data-engineering

IBM PowerHA SystemMirror for AIX 7.1.3 Best Practices and Migration Guide

2015-02-02 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Dino Quintero , William Nespoli Zanatta , Kulwinder Singh , Reshma Prathap , Daniel J. Martin-Corben , Shawn Bodily , Ashraf Ali Thajudeen

IBM data

This IBM® Redbooks® publication positions high availability solutions for IBM Power Systems™ with IBM PowerHA® SystemMirror® Standard and Enterprise Editions (hardware, software, best practices, reference architectures, migration, and tools) with a well-defined and documented deployment model within an IBM Power Systems environment allowing customers a planned foundation for a dynamic high available infrastructure for their enterprise applications. This Redbooks publication documents topics to leverage the strengths of IBM PowerHA SystemMirror Standard and Enterprise Editions 7.1.3 for IBM Power Systems to solve customers' application high availability challenges, and maximize systems' availability, and management. This Redbooks publication focuses on providing the readers with technical information and references on the capabilities of each edition, functionalities, usability, and features that make IBM PowerHA SystemMirror a premier solution for high availability and disaster recovery for IBM Power Systems servers. This Redbooks publication helps strengthen the position of the IBM PowerHA SystemMirror solution with a well-defined and documented best practices, usability, functionality, migration and deployment model within an IBM POWER® system virtualized environment allowing customers a planned foundation for business resilient infrastructure solutions. This Redbooks publication is targeted toward technical professionals (consultants, technical support staff, IT Architects, and IT Specialists) responsible for providing high availability solutions and support with the IBM PowerHA SystemMirror on IBM POWER.

IBM Linear Tape File System Enterprise Edition V1.1.1.2: Installation and Configuration Guide

2015-01-29 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Larry Coyne , Khanh Ngo , Stefan Neff

IBM data

This IBM® Redbooks® publication helps you with the planning, installation, and configuration of the new IBM Linear Tape File System™ (LTFS) Enterprise Edition (EE) V1.1.1.2 for the IBM TS3310, IBM TS3500, and IBM TS4500 tape libraries. LTFS EE enables the use of LTFS for the policy management of tape as a storage tier in an IBM General Parallel File System (IBM GPFS™) based environment and helps encourage the use of tape as a critical tier in the storage environment. LTFS EE can play a major role in reducing the cost of storage for data that does not need the access performance of primary disk. The use of LTFS EE to replace disks with tape in Tier 2 and Tier 3 storage can improve data access over other storage solutions because it improves efficiency and streamlines management for files on tape. LTFS EE simplifies the use of tape by making it transparent to the user and manageable by the administrator under a single infrastructure. This publication is intended for anyone who wants to understand more about LTFS EE planning and implementation. This book is suitable for IBM clients, IBM Business Partners, IBM specialist sales representatives, and technical specialists.

Apache ZooKeeper Essentials

2015-01-28 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Saurav Haloi

Java Python data zookeeper

Apache ZooKeeper Essentials is your comprehensive guide to understanding and utilizing Apache ZooKeeper for coordinating services in distributed systems. This book offers a clear and practical approach to ZooKeeper's architecture and programming, focusing on its application in real-world scenarios. What this Book will help me do Understand the architecture and operational design of Apache ZooKeeper. Effectively use ZooKeeper to coordinate distributed systems. Implement ZooKeeper programming using languages such as Java, C, or Python. Administer and manage ZooKeeper servers and clusters. Utilize tools like Apache Curator to enhance your ZooKeeper experience. Author(s) None Haloi, the author of Apache ZooKeeper Essentials, brings extensive experience in distributed systems and software development. Their expertise ensures a clear and approachable style, ideal for technical learners. Their passion for sharing knowledge is evident through practical examples and focus on real-world applications. Who is it for? This book is ideal for software developers, system architects, and engineers who are looking to enhance their knowledge of distributed systems. Readers should have foundational programming knowledge in languages like Java, C, or Python. While prior experience with ZooKeeper isn't necessary, familiarity with distributed computing will enable you to gain the most from this guide. If you're interested in learning how to leverage ZooKeeper effectively, this book is for you.

ElasticSearch Cookbook - Second Edition

2015-01-28 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Alberto Paro

Analytics Big Data Cloud Computing ELK Java JSON Python data elasticsearch search

The "ElasticSearch Cookbook - Second Edition" is a hands-on guide featuring over 130 advanced recipes to help you harness the power of ElasticSearch, a leading search and analytics engine. Through insightful examples and practical guidance, you'll learn to implement efficient search solutions, optimize queries, and manage ElasticSearch clusters effectively. What this Book will help me do Design and configure ElasticSearch topologies optimized for your specific deployment needs. Develop and utilize custom mappings to optimize your data indexes. Execute advanced queries and filters to refine and retrieve search results effectively. Set up and monitor ElasticSearch clusters for optimal performance. Extend ElasticSearch capabilities through plugin development and integrations using Java and Python. Author(s) Alberto Paro is a technology expert with years of experience working with ElasticSearch, Big Data solutions, and scalable cloud architecture. He has authored multiple books and technical articles on ElasticSearch, leveraging his extensive knowledge to provide practical insights. His approachable and detail-oriented style makes complex concepts accessible to technical professionals. Who is it for? This book is best suited for software developers and IT professionals looking to use ElasticSearch in their projects. Readers should be familiar with JSON, as well as basic programming skills in Java. It is ideal for those who have an understanding of search applications and want to deepen their expertise. Whether you're integrating ElasticSearch into a web application or optimizing your system's search capabilities, this book will provide the skills and knowledge you need.

Elasticsearch: The Definitive Guide

2015-01-28 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Zachary Tong , Clinton Gormley

Analytics ELK data elasticsearch search

Whether you need full-text search or real-time analytics of structured data—or both—the Elasticsearch distributed search engine is an ideal way to put your data to work. This practical guide not only shows you how to search, analyze, and explore data with Elasticsearch, but also helps you deal with the complexities of human language, geolocation, and relationships.

Application Development for IBM CICS Web Services

2015-01-27 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Ian Burnett , James O'Grady , Xue Yong Zhang , San Yong Liu , Jim Harrison (Paramount)

Cloud Computing IBM XML data

This IBM® Redbooks® publication focuses on developing Web service applications in IBM CICS®. It takes the broad view of developing and modernizing CICS applications for XML, Web services, SOAP, and SOA support, and lays out a reference architecture for developing these kinds of applications. We start by discussing Web services in general, then review how CICS implements Web services. We offer an overview of different development approaches: bottom-up, top-down, and meet-in-the-middle. We then look at how you would go about exposing a CICS application as a Web service provider, again looking at the different approaches. The book then steps through the process of creating a CICS Web service requester. We follow this by looking at CICS application aggregation (including 3270 applications) with IBM Rational® Application Developer for IBM System z® and how to implement CICS Web Services using CICS Cloud technology. The first part is concluded with hints and tips to help you when implementing this technology. Part two of this publication provides performance figures for a basic Web service. We investigate some common variables and examine their effects on the performance of CICS as both a requester and provider of Web services.

Implementing High Availability and Disaster Recovery in IBM PureApplication Systems V2

2015-01-27 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Rajeev Gandhi , Addison Goering , Margaret Ticknor , Venkata Gadepalli , Stanley Shieh , Bertrand Portier , Sung-Ik Son , Hendrik Van Run

IBM data

This IBM Redbooks publication describes and demonstrates common, prescriptive scenarios for setting up disaster recovery for common workloads using IBM WebSphere Application Server, IBM DB2, and WebSphere MQ between two IBM PureApplication System racks using the features in PureApplication System V2. The intended audience for this book is pattern developers and operations team members who are setting up production systems using software patterns from IBM that must be highly available or able to recover from a disaster (defined as the complete loss of a data center).

Solr Cookbook - Third Edition - Third Edition

2015-01-23 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Rafal Kuc

data search solr

Master Apache Solr with the comprehensive 'Solr Cookbook - Third Edition', which introduces over 100 practical recipes to help you exploit the full potential of Apache Solr versions 4.x to 5. By following this book, you'll gain actionable insights and solutions to solve real-world problems effectively with Solr. What this Book will help me do Effectively index data from various sources and formats into Solr for optimized searches. Utilize and configure faceting to enhance aggregated data insights. Implement and configure SolrCloud for scalable and robust search infrastructures. Identify and resolve performance bottlenecks in Solr and Solr clusters. Develop and deploy advanced query features like autocomplete and document highlighting. Author(s) Rafal Kuc is a seasoned software architect with years of experience working with Apache Solr in production environments. He specializes in search technologies, distributed systems, and empowering developers with actionable knowledge. Rafal approaches writing with a practical mindset, focusing on how to solve real-world challenges efficiently. Who is it for? This book is ideal for intermediate Solr developers, system architects, or IT professionals responsible for search systems. It assumes a basic familiarity with Solr but provides deep dives into advanced functionalities and configurations. Readers looking to enhance their understanding of Solr 4.x and 5.x capabilities will find this book valuable. Whether you're improving search performance or exploring new Solr features, this book guides you step-by-step.

Getting Started with IBM InfoSphere Optim Workload Replay for DB2

2015-01-18 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Leif Pedersen , Hassi Norlen , Whei-Jen Chen , John Vonau , Tom Toomire , Patrick Titzler , Nisanti Mohanraj

IBM Linux SQL Unix data ibm-db2 relational-databases

This IBM® Redbooks® publication will help you install, configure, and use IBM InfoSphere® Optim™ Workload Replay (InfoSphere Workload Replay), a web-based tool that lets you capture real production SQL workload data and then replay the workload data in a pre-production environment. With InfoSphere Workload Replay, you can set up and run realistic tests for enterprise database changes without the need to create a complex client and application infrastructure to mimic your production environment. The publication goes through the steps to install and configure the InfoSphere Workload Replay appliance and related database components for IBM DB2® for Linux, UNIX, and Windows and for DB2 for IBM z/OS®. The capture, replay, and reporting process, including user ID and roles management, is described in detail to quickly get you up and running. Ongoing operations, such as appliance health monitoring, starting and stopping the product, and backup and restore in your day-to-day management of the product, extensive troubleshooting information, and information about how to integrate InfoSphere Workload Replay with other InfoSphere products are covered in separate chapters.

Implementing the IBM Storwize V7000 Gen2

2015-01-18 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Nancy Kinney , Lev Sturmer , Jon Tate , Morten Dannemand , Massimo Rosati

Big Data IBM data

Data is the new currency of business, the most critical asset of the modern organization. In fact, enterprises that can gain business insights from their data are twice as likely to outperform their competitors. Nevertheless, 72% of them have not started, or are only planning, big data activities. In addition, organizations often spend too much money and time managing where their data is stored. The average firm purchases 24% more storage every year, but uses less than half of the capacity that it already has. The IBM® Storwize® family, including the IBM SAN Volume Controller Data Platform, is a storage virtualization system that enables a single point of control for storage resources. This functionality helps support improved business application availability and greater resource use. The following list describes the business objectives of this system: To manage storage resources in your information technology (IT) infrastructure To make sure that those resources are used to the advantage of your business To do it quickly, efficiently, and in real time, while avoiding increases in administrative costs Storwize functions benefit all virtualized storage. For example, IBM Easy Tier® optimizes use of flash memory. In addition, IBM Real-time Compression™ enhances efficiency even further by enabling the storage of up to five times as much active primary data in the same physical disk space. Finally, high-performance thin provisioning helps automate provisioning. These benefits can help extend the useful life of existing storage assets, reducing costs. Integrating these functions into Storwize also means that they are designed to operate smoothly together, reducing management effort. This IBM Redbooks® publication provides information about the latest features and functions of the Storwize V7000 Gen2 and software version 7.3 implementation, architectural improvements, and Easy Tier.

Data Driven

2015-01-15 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Hilary Mason (Hidden Door) , DJ Patil (GreatPoint Ventures)

Big Data Hadoop data

Succeeding with data isn’t just a matter of putting Hadoop in your machine room, or hiring some physicists with crazy math skills. It requires you to develop a data culture that involves people throughout the organization. In this O’Reilly report, DJ Patil and Hilary Mason outline the steps you need to take if your company is to be truly data-driven—including the questions you should ask and the methods you should adopt. You’ll not only learn examples of how Google, LinkedIn, and Facebook use their data, but also how Walmart, UPS, and other organizations took advantage of this resource long before the advent of Big Data. No matter how you approach it, building a data culture is the key to success in the 21st century. You’ll explore: Data scientist skills—and why every company needs a Spock How the benefits of giving company-wide access to data outweigh the costs Why data-driven organizations use the scientific method to explore and solve data problems Key questions to help you develop a research-specific process for tackling important issues What to consider when assembling your data team Developing processes to keep your data team (and company) engaged Choosing technologies that are powerful, support teamwork, and easy to use and learn

Data Privacy for the Smart Grid

2015-01-15 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Christine Hertzog , Rebecca Herold

Cyber Security data data-security-privacy data security & privacy

Privacy for the Smart Grid provides easy-to-understand guidance on data privacy issues and the implications for creating privacy risk management programs, along with privacy policies and practices required to ensure Smart Grid privacy. It addresses privacy in electric, natural gas, and water grids from two different perspectives of the topic, one from a Smart Grid expert and another from a privacy and information security expert. While considering privacy in the Smart Grid, the book also examines the data created by Smart Grid technologies and machine-to-machine applications.

Digital Privacy in the Marketplace

2015-01-14 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by George Milne

data data-security-privacy data security & privacy

Digital Privacy in the Marketplace focuses on the data ex-changes between marketers and consumers, with special ttention to the privacy challenges that are brought about by new information technologies. The purpose of this book is to provide a background source to help the reader think more deeply about the impact of privacy issues on both consumers and marketers. It covers topics such as: why privacy is needed, the technological, historical and academic theories of privacy, how market exchange af-fects privacy, what are the privacy harms and protections available, and what is the likely future of privacy.

Key Management Models, 3rd Edition

2015-01-14 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Gerben Van den Berg , Paul Pietersma

data data-models

This best selling management book is a true classic. If you want to be a model manager, keep this new, even better 3rd edition close at hand. Key Management Models has the winning combination of brevity and clarity, giving you short, practical overviews of the top classic and cutting edge management models in an easy-to-use, ready reference format. Whether you want to remind yourself about models you’ve already come across, or want to find new ones, you’ll find yourself referring back to it again and again. It's the essential guide to all the management models you’ll ever need to know about. Includes the classic and essential management models from the previous editions. Thoroughly updated to include cutting edge new models. Two-colour illustrations and case studies throughout. The full text downloaded to your computer With eBooks you can: search for key concepts, words and phrases make highlights and notes as you study share your notes with friends eBooks are downloaded to your computer and accessible either offline through the Bookshelf (available as a free download), available online and also via the iPad and Android apps. Upon purchase, you will receive via email the code and instructions on how to access this product. Time limit The eBooks products do not have an expiry date. You will continue to access your digital ebook products whilst you have your Bookshelf installed.

Getting a Big Data Job For Dummies

2015-01-12 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Jason Williamson

Big Data data

Hone your analytic talents and become part of the next big thing Getting a Big Data Job For Dummies is the ultimate guide to landing a position in one of the fastest-growing fields in the modern economy. Learn exactly what "big data" means, why it's so important across all industries, and how you can obtain one of the most sought-after skill sets of the decade. This book walks you through the process of identifying your ideal big data job, shaping the perfect resume, and nailing the interview, all in one easy-to-read guide. Companies from all industries, including finance, technology, medicine, and defense, are harnessing massive amounts of data to reap a competitive advantage. The demand for big data professionals is growing every year, and experts forecast an estimated 1.9 million additional U.S. jobs in big data by 2015. Whether your niche is developing the technology, handling the data, or analyzing the results, turning your attention to a career in big data can lead to a more secure, more lucrative career path. Getting a Big Data Job For Dummies provides an overview of the big data career arc, and then shows you how to get your foot in the door with topics like: The education you need to succeed The range of big data career path options An overview of major big data employers A plan to develop your job-landing strategy Your analytic inclinations may be your ticket to long-lasting success. In a highly competitive job market, developing your data skills can create a situation where you pick your employer rather than the other way around. If you're ready to get in on the ground floor of the next big thing, Getting a Big Data Job For Dummies will teach you everything you need to know to get started today.

Oracle Database 12c Security

2015-01-09 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Scott Gaetjen , William Maroulis , David Knox

Cloud Computing Oracle Cyber Security data oracle-database-solutions

Best Practices for Comprehensive Oracle Database Security Written by renowned experts from Oracle's National Security Group, Oracle Database 12c Security provides proven techniques for designing, implementing, and certifying secure Oracle Database systems in a multitenant architecture. The strategies are also applicable to standalone databases. This Oracle Press guide addresses everything from infrastructure to audit lifecycle and describes how to apply security measures in a holistic manner. The latest security features of Oracle Database 12c are explored in detail with practical and easy-to-understand examples. Connect users to databases in a secure manner Manage identity, authentication, and access control Implement database application security Provide security policies across enterprise applications using Real Application Security Control data access with Oracle Virtual Private Database Control sensitive data using data redaction and transparent sensitive data protection Control data access with Oracle Label Security Use Oracle Database Vault and Transparent Data Encryption for compliance, cybersecurity, and insider threats Implement auditing technologies, including Unified Audit Trail Manage security policies and monitor a secure database environment with Oracle Enterprise Manager Cloud Control

IBM TS4500 Tape Library Guide

2015-01-08 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Larry Coyne , Michael Engelbrecht

IBM data

The IBM® TS4500 tape library is a next-generation tape solution that offers higher storage density and integrated management. This IBM Redbooks® publication gives you a close-up view of the new IBM TS4500 tape library. In the TS4500, IBM delivers the density that today's and tomorrow's data growth require, with the cost-effectiveness and the manageability to grow with business data needs, while preserving existing investments in IBM tape library products. Now, you can achieve both a low cost per terabyte (TB) and a high TB density per square foot, because the TS4500 can store up to 5.5 PBs of data in a single 10 square foot library frame, which is up to 3.4 times more capacity than the IBM TS3500 tape library. This guide describes TS4500 components, feature codes, specifications, supported tape drives, encryption, the new integrated management console, and the command-line interface (CLI) and provides instructions for several specific tasks. It is for anyone who wants to understand more about the IBM TS4500 tape library. It is particularly suitable for IBM clients, IBM Business Partners, IBM specialist sales representatives, and technical specialists.

PHP and MySQL Web Development: A Beginner’s Guide

2015-01-05 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Marty Matthews

HTML JavaScript MySQL SQL data relational-databases

Essential Skills—Made Easy! PHP and MySQL Web Development: A Beginner's Guide takes you from building static web pages to creating comprehensive database-driven web applications. The book reviews HTML, CSS, and JavaScript and then explores PHP--its structure, control statements, arrays, functions, use with forms, and file handling capabilities. Next, the book examines MySQL, including SQL, the MySQL command set, and how to use it with PHP to create a relational database and build secure, databasedriven web applications. This practical resource features complete, step-by-step examples with code that you can use as templates for your own projects. Designed for Easy Learning Key Skills & Concepts--Chapter-opening lists of specific skills covered in the chapter Try This--Hands-on exercises that show you how to apply your skills Notes--Extra information related to the topic being covered Tips--Helpful reminders or alternate ways of doing things Cautions--Errors and pitfalls to avoid Self Tests--End-of-chapter quizzes to reinforce your skills Annotated Syntax--Example code with commentary that describes the programming techniques being illustrated Ready-to-use code at www.mhprofessional.com

Practical Neo4j

2015-01-05 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Gregory Jordan

Big Data Data Modelling Java Neo4j NoSQL Python data graph-databases

" Why have developers at places like Facebook and Twitter increasingly turned to graph databases to manage their highly connected big data? The short answer is that graphs offer superior speed and flexibility to get the job done. It’s time you added skills in graph databases to your toolkit. In Practical Neo4j, database expert Greg Jordan guides you through the background and basics of graph databases and gets you quickly up and running with Neo4j, the most prominent graph database on the market today. Jordan walks you through the data modeling stages for projects such as social networks, recommendation engines, and geo-based applications. The book also dives into the configuration steps as well as the language options used to create your Neo4j-backed applications. Neo4j runs some of the largest connected datasets in the world, and developing with it offers you a fast, proven NoSQL database option. Besides those working for social media, database, and networking companies of all sizes, academics and researchers will find Neo4j a powerful research tool that can help connect large sets of diverse data and provide insights that would otherwise remain hidden. Using Practical Neo4j, you will learn how to harness that power and create elegant solutions that address complex data problems. This book: Explains the basics of graph databases Demonstrates how to configure and maintain Neo4j Shows how to import data into Neo4j from a variety of sources Provides a working example of a Neo4j-based application using an array of language of options including Java, .Net, PHP, Python, Spring, and Ruby As you’ll discover, Neo4j offers a blend of simplicity and speed while allowing data relationships to maintain first-class status. That’s one reason among many that such a wide range of industries and fields have turned to graph databases to analyze deep, dense relationships. After reading this book, you’ll have a potent, elegant tool you can use to develop projects profitably and improve your career options.

Running Applications on Oracle Exadata

2015-01-05 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Joyjeet Banerjee

Oracle data oracle-database-solutions

Maximize Application Performance on Oracle Exadata Written by an enterprise architect specializing in applications on Oracle's engineered systems, Running Applications on Oracle Exadata: Tuning Tips & Techniques reveals proven methods for configuring and tuning Oracle Exadata to achieve peak results from applications. You'll get complete details on application migration, consolidation, and administration. Deliver unparalleled enterprise application performance on Oracle Exadata using the best practices provided in this Oracle Press guide. Understand Oracle Exadata architecture, hardware components, and software features Achieve peak performance from online transaction processing (OLTP) systems Size Oracle Exadata for applications using comparative and predictive methods Migrate and consolidate applications to Oracle Exadata Monitor, manage, and administer all Oracle Exadata components to ensure high availability and performance Develop and implement a backup and recovery strategy Learn best practices for running applications on Oracle Exadata Code examples in the book are available for download at OraclePressBooks.com

talk-data.com

Activity Trend

Top Events

Top Speakers

IBM PowerHA SystemMirror for AIX 7.1.3 Best Practices and Migration Guide

IBM Linear Tape File System Enterprise Edition V1.1.1.2: Installation and Configuration Guide

Apache ZooKeeper Essentials

ElasticSearch Cookbook - Second Edition

Elasticsearch: The Definitive Guide

Application Development for IBM CICS Web Services

Implementing High Availability and Disaster Recovery in IBM PureApplication Systems V2

Solr Cookbook - Third Edition - Third Edition

Getting Started with IBM InfoSphere Optim Workload Replay for DB2

Implementing the IBM Storwize V7000 Gen2

Data Driven

Data Privacy for the Smart Grid

Digital Privacy in the Marketplace

Key Management Models, 3rd Edition

Getting a Big Data Job For Dummies

Oracle Database 12c Security

IBM TS4500 Tape Library Guide

PHP and MySQL Web Development: A Beginner’s Guide

Practical Neo4j

Running Applications on Oracle Exadata