O'Reilly Data Engineering Books

Oracle SOA Suite 12c Handbook

2015-09-01 O'Reilly Amazon

book

Lucas Jellema

data data-engineering oracle-database-solutions Java JSON KPI

Master Oracle SOA Suite 12 c Design, implement, manage, and maintain a highly flexible service-oriented computing infrastructure across your enterprise using the detailed information in this Oracle Press guide. Written by an Oracle ACE director, Oracle SOA Suite 12c Handbook uses a start-to-finish case study to illustrate each concept and technique. Learn expert techniques for designing and implementing components, assembling composite applications, integrating Java, handling complex business logic, and maximizing code reuse. Runtime administration, governance, and security are covered in this practical resource. Get started with the Oracle SOA Suite 12 c development and run time environment Deploy and manage SOA composite applications Expose SOAP/XML REST/JSON through Oracle Service Bus Establish interactions through adapters for Database, JMS, File/FTP, UMS, LDAP, and Coherence Embed custom logic using Java and the Spring component Perform fast data analysis in real time with Oracle Event Processor Implement Event Drive Architecture based on the Event Delivery Network (EDN) Use Oracle Business Rules to encapsulate logic and automate decisions Model complex processes using BPEL, BPMN, and human task components Establish KPIs and evaluate performance using Oracle Business Activity Monitoring Control traffic, audit system activity, and encrypt sensitive data

The Architecture of Privacy

2015-09-01 O'Reilly Amazon

book

John K Grant , Daniel Slate , Ari Gesher , Elissa Lerner , Courtney Bowman

data data-engineering data-security-privacy data security & privacy Analytics Data Analytics

Technology’s influence on privacy not only concerns consumers, political leaders, and advocacy groups, but also the software architects who design new products. In this practical guide, experts in data analytics, software engineering, security, and privacy policy describe how software teams can make privacy-protective features a core part of product functionality, rather than add them late in the development process. Ideal for software engineers new to privacy, this book helps you examine privacy-protective information management architectures and their foundational components—building blocks that you can combine in many ways. Policymakers, academics, students, and advocates unfamiliar with the technical terrain will learn how these tools can help drive policies to maximize privacy protection.

Learning RSLogix 5000 Programming

2015-08-31 O'Reilly Amazon

book

Austin Scott

data data-engineering log-data

Dive into "Learning RSLogix 5000 Programming" and gain comprehensive insights into the RSLogix 5000 and Studio 5000 environments for Rockwell Automation controllers. By the end of this book, you'll master the essentials of programming ControlLogix, CompactLogix, SoftLogix, and designing advanced function block diagrams and sequential routines. What this Book will help me do Learn to program Rockwell Automation controllers using RSLogix 5000 and Studio 5000. Understand the features and functionalities of ControlLogix and CompactLogix platforms. Explore advanced programming techniques such as ladder logic, function block diagrams, and structured text. Familiarize yourself with the latest changes introduced in Studio 5000 and Logix Designer. Gain confidence in troubleshooting, industrial network communication, and automation system application design. Author(s) Austin Scott, a seasoned automation expert, has vast experience working with industrial control systems and Rockwell Automation technologies. His teaching methods focus on practical application and easy comprehension, making technical concepts accessible to beginners and professionals alike. Who is it for? Ideal for PLC programmers, electricians, and automation specialists aiming to enhance their skills with RSLogix 5000. Beginners with basic PLC knowledge will find the step-by-step approach convenient for mastering advanced tools. Aspiring professionals can use this resource to build foundational and advanced programming expertise.

Learning YARN

2015-08-28 O'Reilly Amazon

book

Akhil Arora , Shrey Mehrotra

data data-engineering Hadoop yarn Big Data Spark

"Learning YARN" is your comprehensive guide to master YARN, the resource management layer in the Hadoop ecosystem. Through the book, you'll leverage YARN's capabilities for big data processing, learning to deploy, manage, and scale Hadoop-YARN clusters. What this Book will help me do Understand the main features and benefits of the YARN framework. Gain experience managing Hadoop clusters of varying sizes. Learn to integrate YARN with domain-specific big data tools like Spark. Become skilled at administration and configuration of YARN. Develop and run your own YARN-based applications for distributed computing. Author(s) Akhil Arora and Shrey Mehrotra bring with them years of experience working in big data frameworks and technologies. With expertise in YARN specifically, they aim to bridge the gap for developers and administrators to learn and implement scalable big data solutions. Their extensive knowledge in cluster management and distributed data processing shines through in how this book is structured and detailed. Who is it for? This book is ideal for software developers, big data engineers, and system administrators interested in advancing their knowledge in resource management in Hadoop systems. If you have basic familiarity with Hadoop and need a deeper understanding or feature knowledge of YARN for professional growth, this book is tailored for you. It is also suitable for learners seeking to integrate big data platforms like Spark into YARN clusters.

OCA/OCP Oracle Database 12c All-in-One Exam Guide (Exams 1Z0-061, 1Z0-062, & 1Z0-063), 2nd Edition

2015-08-28 O'Reilly Amazon

book

John Watson , Roopesh Ramklass , Bob Bryla

data data-engineering oracle-database-solutions Oracle SQL

This Oracle Press certification exam guide prepares you for the new Oracle Database 12 c certification track, including the core requirements for OCA and OCP certification. OCA/OCP Oracle Database 12c All-in-One Exam Guide (Exams 1Z0-061, 1Z0-062, & 1Z0-063) covers all of the exam objectives on the Installation and Administration, SQL Fundamentals, and Advanced Administration exams in detail. Each chapter includes examples, practice questions, Inside the Exam sections highlighting key exam topics, a chapter summary, and a two-minute drill to reinforce essential knowledge. 300+ practice exam questions match the format, topics, and difficulty of the real exam. Electronic content includes interactive practice exam software with hundreds of questions that include detailed answers and explanations, and a score report performance assessment tool Ideal as both exam guide and on-the-job reference The most comprehensive single preparation tool for the Oracle Database 12 c OCA and OCP certification exams

Performance Optimization and Tuning Techniques for IBM Power Systems Processors Including IBM POWER8

2015-08-28 O'Reilly Amazon

book

Suresh Warrier , Peter Bergner , Madhusudanan Kandasamy , Alon Shalev Housfater , Steve Munroe , Bill Schmidt , Brian Hall , Bernard King Smith , David Wendt , Will Schmidt , Tulio Magno , Julian Wang , Alex Mericas , Mauricio Oliveira

data data-engineering IBM Linux Superset

This IBM® Redbooks® publication focuses on gathering the correct technical information, and laying out simple guidance for optimizing code performance on IBM POWER8® processor-based systems that run the IBM AIX®, IBM i, or Linux operating systems. There is straightforward performance optimization that can be performed with a minimum of effort and without extensive previous experience or in-depth knowledge. The POWER8 processor contains many new and important performance features, such as support for eight hardware threads in each core and support for transactional memory. The POWER8 processor is a strict superset of the IBM POWER7+™ processor, and so all of the performance features of the POWER7+ processor, such as multiple page sizes, also appear in the POWER8 processor. Much of the technical information and guidance for optimizing performance on POWER8 processors that is presented in this guide also applies to POWER7+ and earlier processors, except where the guide explicitly indicates that a feature is new in the POWER8 processor. This guide strives to focus on optimizations that tend to be positive across a broad set of IBM POWER® processor chips and systems. Specific guidance is given for the POWER8 processor; however, the general guidance is applicable to the IBM POWER7+, IBM POWER7®, IBM POWER6®, IBM POWER5, and even to earlier processors. This guide is directed at personnel who are responsible for performing migration and implementation activities on POWER8 processor-based systems. This includes system administrators, system architects, network administrators, information architects, and database administrators (DBAs).

Expert Oracle Exadata, Second Edition

2015-08-26 O'Reilly Amazon

book

Kerry Osborne , Andy Colvin , Karl Arao , Randy Johnson , Martin Bach , Tanel Poder , Frits Hoogland

data data-engineering oracle-database-solutions Oracle RDBMS SQL

Expert Oracle Exadata, 2nd Edition opens up the internals of Oracle's Exadata platform so that you can fully benefit from the most performant and scalable database hardware appliance capable of running Oracle Database. This edition is fully-updated to cover Exadata 5-2 and Oracle Database 12c. If you're new to Exadata, you'll soon learn that it embodies a change in how you think about and manage relational databases. A key part of that change lies in the concept of offloading SQL processing to the storage layer. In addition there is Oracle's engineering effort in creating a powerful platform for both consolidation and transaction processing. The resulting value proposition in the form of Exadata has truly been a game-changer. Expert Oracle Exadata, 2nd Edition provides a look at the internals and how the combination of hardware and software that comprise Exadata actually work. Authors include Martin Bach, Andy Colvin, and Frits Hoogland, with contributions from Karl Arao, and built on the foundation laid by Kerry Osborne, Randy Johnson, and Tanel Poder in the first edition. They share their real-world experience gained through a great many Exadata implementations, possibly more than any other group of experts today. Always their goal is toward helping you advance your career through success with Exadata in your own environment. This book is intended for readers who want to understand what makes the platform tick and for whom—"how" it does what it is does is as important as what it does. By being exposed to the features that are unique to Exadata, you will gain an understanding of the mechanics that will allow you to fully benefit from the advantages that the platform provides. This book changes how you think about managing SQL performance and processing. It provides a roadmap to successful Exadata implementation. And it removes the "black box" mystique. You'll learn how Exadata actually works and be better able to manage your Exadata engineered systems in support of your business. This book: Changes the way you think about managing SQL performance and processing Provides a roadmap to successful Exadata implementation Removes the "black box" mystique, showing how Exadata actually works

Structured Search for Big Data

2015-08-26 O'Reilly Amazon

book

Mikhail Gilula

data data-engineering search Big Data Data Modelling DWH

The WWW era made billions of people dramatically dependent on the progress of data technologies, out of which Internet search and Big Data are arguably the most notable. Structured Search paradigm connects them via a fundamental concept of key-objects evolving out of keywords as the units of search. The key-object data model and KeySQL revamp the data independence principle making it applicable for Big Data and complement NoSQL with full-blown structured querying functionality. The ultimate goal is extracting Big Information from the Big Data. As a Big Data Consultant, Mikhail Gilula combines academic background with 20 years of industry experience in the database and data warehousing technologies working as a Sr. Data Architect for Teradata, Alcatel-Lucent, and PayPal, among others. He has authored three books, including The Set Model for Database and Information Systems and holds four US Patents in Structured Search and Data Integration. Conceptualizes structured search as a technology for querying multiple data sources in an independent and scalable manner. Explains how NoSQL and KeySQL complement each other and serve different needs with respect to big data Shows the place of structured search in the internet evolution and describes its implementations including the real-time structured internet search

Modernize Your IBM DB2 for IBM z/OS Maintenance with Utility Autonomics

2015-08-20 O'Reilly Amazon

book

Hennie Mynhardt , Dean Brown , Carlos Gomes , Arthur Marais

data data-engineering relational-databases ibm-db2 IBM

IBM® DB2® for IBM z/OS® helps lower the cost of managing data by automating administration, increasing storage efficiency, improving performance, and simplifying the deployment of virtual appliances. By automating tasks such as memory allocation, storage management, and business policy maintenance, DB2 is able to perform many management tasks itself, freeing up Database Administrators to focus on new projects. This IBM Redbooks® publication introduces autonomics for DB2 for z/OS. IBM provides several different components that, when combined, can create an autonomic database environment. All these respective components cover certain aspects of autonomics, which can collaborate into one coherent solution. In our evolution of autonomics and the need to move to smarter systems there has been a bigger drive to the concept of "Active" versus "Passive" autonomics. With the inclusion of the IBM Management Console for IMS™ and DB2 for z/OS and the Autonomics Director, it is now easier than ever to make that transition by leveraging the strength of the DB2 Utilities Solution Pack for z/OS all in one standardized and centralized interface. This publication guides you through the business reasons for adopting autonomic solutions, and provides step-by-step guidance to implement these capabilities in your DB2 for z/OS configuration. This publication is of interest primarily to DB2 Database Administrators and DB2 Systems Programmers, and for anyone looking to understand the benefits of DB2 autonomic solutions.

You: For Sale

2015-08-20 O'Reilly Amazon

book

Stuart Sumner

data data-engineering data-security-privacy data security & privacy Cyber Security

Everything we do online, and increasingly in the real world, is tracked, logged, analyzed, and often packaged and sold on to the highest bidder. Every time you visit a website, use a credit card, drive on the freeway, or go past a CCTV camera, you are logged and tracked. Every day billions of people choose to share their details on social media, which are then sold to advertisers. The Edward Snowden revelations that governments - including those of the US and UK – have been snooping on their citizens, have rocked the world. But nobody seems to realize that this has already been happening for years, with firms such as Google capturing everything you type into a browser and selling it to the highest bidder. Apps take information about where you go, and your contact book details, harvest them and sell them on – and people just click the EULA without caring. No one is revealing the dirty secret that is the tech firms harvesting customers’ personal data and selling it for vast profits – and people are totally unaware of the dangers. You: For Sale is for anyone who is concerned about what corporate and government invasion of privacy means now and down the road. The book sets the scene by spelling out exactly what most users of the Internet and smart phones are exposing themselves to via commonly used sites and apps such as facebook and Google, and then tells you what you can do to protect yourself. The book also covers legal and government issues as well as future trends. With interviews of leading security experts, black market data traders, law enforcement and privacy groups, You: For Sale will help you view your personal data in a new light, and understand both its value, and its danger. Provides a clear picture of how companies and governments harvest and use personal data every time someone logs on Describes exactly what these firms do with the data once they have it – and what you can do to stop it Learn about the dangers of unwittingly releasing private data to tech firms, including interviews with top security experts, black market data traders, law enforcement and privacy groups Understand the legal information and future trends that make this one of the most important issues today

Expert Oracle Application Express, Second Edition

2015-08-13 O'Reilly Amazon

book

Dan McGhan , Martin D’Souza , Nick Buytaert , Doug Gault , Jorge Rimblas , Roel Hartman , Tom Petrus , Christoph Ruepprich , Denes Kubicek , Karen Cannell , Francis Mignault , John Scott , Dimitri Gielis , Raj Mattamal

data data-engineering oracle-database-solutions Oracle SQL

Expert Oracle Application Express, 2nd Edition is newly updated for APEX 5.0 and brings deep insight from some of the best APEX practitioners in the field today. You'll learn about important features in APEX 5.0, and how those can be applied to make your development work easier and with greater impact on your business. Oracle Application Express (APEX) is an entirely web-based development framework that is built into every edition of Oracle Database. The framework rests upon Oracle’s powerful PL/SQL language, enabling power users and developers to rapidly develop applications that easily scale to hundreds, even thousands of concurrent users. APEX has seen meteoric growth and is becoming the tool of choice for ad-hoc application development in the enterprise. The many authors of Expert Oracle Application Express, 2nd Edition build their careers around APEX. They know what it takes to make the product sing—developing secure applications that can be deployed globally to users inside and outside a large enterprise. The authors come together in this book to share some of their deepest and most powerful insights into solving the difficult problems surrounding globalization, configuration and lifecycle management, and more. New in this edition for APEX 5.0 is coverage of Oracle REST Data Services, map integration, jQuery with APEX, and the new Page Designer. You’ll learn about debugging and performance, deep secrets to customizing your application user interface, how to secure applications from intrusion, and about deploying globally in multiple languages. Expert Oracle Application Express, 2nd Edition is truly a book that will move you and your skillset a big step towards the apex of Application Express development. Contains all-new content on Oracle REST Data Services, jQuery in APEX, and map integration Addresses globalization and other concerns of enterprise-level development Shows how to customize APEX for your own application needs

IBM TS7700 Virtualization Engine with R3.2

2015-08-13 O'Reilly Amazon

book

Aderson Pacini , Chen Zhu , Larry Coyne , Michael Scott , Joe Hew , Katja Denefleh , Takahiro Tsuda

data data-engineering IBM SAS

This IBM® Redbooks® publication highlights IBM TS7700 Virtualization Engine Release 3.2 (IBM TS7700). IBM TS7700 is part of a family of IBM System Storage® Enterprise tape products. This book is intended for system architects who want to integrate their storage systems for smoother operation. The IBM TS7700 offers a modular, scalable, and high-performing architecture for mainframe tape virtualization for the IBM System z® environment. It integrates IBM 3592 tape cartridges, high-performance disks, and a new disk cache subsystem into a storage hierarchy. This storage hierarchy is managed by robust storage management firmware with extensive self-management capability. It includes the following advanced functions: Policy management to control physical volume pooling Cache management Redundant copies, including across a grid network Copy mode control The IBM TS7700 Virtualization Engine offers enhanced statistical reporting. It also includes a standards-based Management Interface (MI) for IBM TS7700 management. The IBM TS7700 Release 3.2 continues the next generation of IBM TS7700 Virtualization Engine servers for System z tape: IBM TS7720 features encryption-capable, high-capacity cache using 3 terabyte (TB) serial-attached Small Computer System Interface (SAS) disk drives with Redundant Array of Independent Disks (RAID) 6, providing the ability to scale to very large capacities with the highest level of data protection. IBM TS7740 features encryption-capable 600 gigabyte (GB) SAS drives with RAID 6 protection. Both models write data by policy to physical tape through attachment to high-capacity, high-performance IBM TS1140 and earlier IBM 3592 model tape drives installed in IBM TS3500 tape libraries. Physical tape support is optional on IBM TS7720. These Virtualization Engines are based on IBM POWER7® technology. They offer improved performance for most System z tape workloads compared to the first generation of IBM TS7700 Virtualization Engine servers. IBM TS7700 Virtualization Engine Release 3.2 builds on the existing capabilities of the IBM TS7700 family. It also includes the following enhancements to the IBM TS7700 family: 25 GB logical volume sizes Options for attaching back-end physical tape to IBM TS7720 systems Up to eight repository partitions in a tape-attached IBM TS7720

IBM Software Defined Environment

2015-08-12 O'Reilly Amazon

book

Dino Quintero , Ashish Nainwal , Fabio Martins , Marcin Tabinowski , William M Genovese , KiWaon Kim , Dusan Smolej , Ming Jun MJ Li

data data-engineering IBM Analytics Cloud Computing

This IBM® Redbooks® publication introduces the IBM Software Defined Environment (SDE) solution, which helps to optimize the entire computing infrastructure--compute, storage, and network resources--so that it can adapt to the type of work required. In today's environment, resources are assigned manually to workloads, but that happens automatically in a SDE. In an SDE, workloads are dynamically assigned to IT resources based on application characteristics, best-available resources, and service level policies so that they deliver continuous, dynamic optimization and reconfiguration to address infrastructure issues. Underlying all of this are policy-based compliance checks and updates in a centrally managed environment. Readers get a broad introduction to the new architecture. Think integration, automation, and optimization. Those are enablers of cloud delivery and analytics. SDE can accelerate business success by matching workloads and resources so that you have a responsive, adaptive environment. With the IBM Software Defined Environment, infrastructure is fully programmable to rapidly deploy workloads on optimal resources and to instantly respond to changing business demands. This information is intended for IBM sales representatives, IBM software architects, IBM Systems Technology Group brand specialists, distributors, resellers, and anyone who is developing or implementing SDE.

Introduction to JavaScript Object Notation

2015-08-10 O'Reilly Amazon

book

Lindsay Bassett

data data-engineering storage-formats JSON API HTML

What is JavaScript Object Notation (JSON) and how can you put it to work? This concise guide helps busy IT professionals get up and running quickly with this popular data interchange format, and provides a deep understanding of how JSON works. Author Lindsay Bassett begins with an overview of JSON syntax, data types, formatting, and security concerns before exploring the many ways you can apply JSON today. From Web APIs and server-side language libraries to NoSQL databases and client-side frameworks, JSON has emerged as a viable alternative to XML for exchanging data between different platforms. If you have some programming experience and understand HTML and JavaScript, this is your book. Learn why JSON syntax represents data in name-value pairs Explore JSON data types, including object, string, number, and array Find out how you can combat common security concerns Learn how the JSON schema verifies that data is formatted correctly Examine the relationship between browsers, web APIs, and JSON Understand how web servers can both request and create data Discover how jQuery and other client-side frameworks use JSON Learn why the CouchDB NoSQL database uses JSON to store data

Pro Couchbase Development: A NoSQL Platform for the Enterprise

2015-08-05 O'Reilly Amazon

book

Deepak Vohra

data data-engineering nosql-databases couchbase Big Data Cassandra

Pro Couchbase Development: A NoSQL Platform for the Enterprise discusses programming for Couchbase using Java and scripting languages, querying and searching, handling migration, and integrating Couchbase with Hadoop, HDFS, and JSON. It also discusses migration from other NoSQL databases like MongoDB. This book is for big data developers who use Couchbase NoSQL database or want to use Couchbase for their web applications as well as for those migrating from other NoSQL databases like MongoDB and Cassandra. For example, a reason to migrate from Cassandra is that it is not based on the JSON document model with support for a flexible schema without having to define columns and supercolumns. The target audience is largely Java developers but the book also supports PHP and Ruby developers who want to learn about Couchbase. The author supplies examples in Java, PHP, Ruby, and JavaScript. After reading and using this hands-on guide for developing with Couchbase, you'll be able to build complex enterprise, database and cloud applications that leverage this powerful platform.

Learning NHibernate 4

2015-07-31 O'Reilly Amazon

book

Suhas H Chatekar

data data-engineering database-management-tools object-relational-mapping hibernate XML

Dive into the essentials of NHibernate 4 with this comprehensive guide. Designed for .NET developers, you will discover how to map domain models to databases effectively, perform various database operations, optimize performance, and apply powerful data access patterns using NHibernate. What this Book will help me do Understand how to map domain entities to a database schema using NHibernate's mapping mechanisms. Efficiently configure NHibernate for your application using XML configuration files. Perform CRUD operations and craft data retrieval strategies with NHibernate. Optimize your database-oriented application in terms of performance and memory. Apply NHibernate in real-world projects, including interaction with legacy databases. Author(s) Suhas H. Chatekar is an experienced software engineer who specializes in .NET development and database integration using ORM tools like NHibernate. With a passion for creating clear and operational technical resources, his comprehensive expertise ensures that his works empower developers to achieve practical outcomes. Who is it for? This book is perfect for .NET developers who are new to ORM tools and want to skillfully integrate NHibernate into their projects. Readers who have tried ORM solutions before or used NHibernate but wish to delve deeper into its capabilities will find this book invaluable. If your goal is to model databases effectively and utilize NHibernate for real-world applications, this guide is for you. Beginners to intermediate-level developers will benefit greatly from the step-by-step approach and clear explanations.

Getting Started with Hazelcast, Second Edition

2015-07-30 O'Reilly Amazon

book

Matthew Johns

data data-engineering API DevOps Java

This book is your gateway to mastering Hazelcast, a powerful open-source distributed data grid platform. By using Hazelcast, you'll gain the tools to manage data at scale within your modern applications while improving performance and reliability. What this Book will help me do Gain a comprehensive understanding of distributed data grids and Hazelcast's architecture. Master the configuration and deployment of Hazelcast clusters in various scenarios. Learn to design scalable and resilient systems using Hazelcast's in-memory features. Implement advanced messaging, querying, and processing using Hazelcast APIs. Enhance your applications with distributed caching and data sharing capabilities. Author(s) Matthew Johns is an experienced software engineer and author specializing in distributed systems and Java enterprise development. He has worked extensively in building scalable applications and is passionate about teaching others to leverage modern technologies. His practical approach to programming and clarity of instruction make complex topics accessible and actionable. Who is it for? This book is ideal for Java developers, software architects, and DevOps engineers seeking to enhance their skills in distributed systems. If you're looking to manage data at scale, improve application performance, and build resilient architectures, this book is for you. Whether new to distributed computing or experienced developers exploring Hazelcast, you'll find practical insights for your work. Readers should have basic Java knowledge to get the most out of this book.

IBM GDPS Family of Products: An Introduction to Concepts and Capabilities

2015-07-29 O'Reilly Amazon

book

John Thompson , Sim Schindel , Brian Cooper , David Clitherow

data data-engineering IBM

This IBM® Redbooks® publication presents an overview of the IBM Geographically Dispersed Parallel Sysplex™ (IBM GDPS®) offerings and the roles they play in delivering a business IT resilience solution. The book begins with general concepts of business IT resilience and disaster recovery, along with issues related to high application availability, data integrity, and performance. These topics are considered within the framework of government regulation, increasing application and infrastructure complexity, and the competitive and rapidly changing modern business environment. Next, it describes the GDPS family of offerings with specific reference to how they can help you achieve your defined goals for disaster recovery and high availability. Also covered are the features that simplify and enhance data replication activities, the prerequisites for implementing each offering, and tips for planning for the future and immediate business requirements. Tables provide easy-to-use summaries and comparisons of the offerings, and the additional planning and implementation services available from IBM are explained. Then, several practical client scenarios and requirements are described, along with the most suitable GDPS solution for each case. The introductory chapters of this publication are intended for a broad technical audience, including IT System Architects, Availability Managers, Technical IT Managers, Operations Managers, System Programmers, and Disaster Recovery Planners. The subsequent chapters provide more technical details about the GDPS offerings, and each can be read independently for those readers who are interested in specific topics. Therefore, if you do read all the chapters, be aware that some information is intentionally repeated.

PostgreSQL Replication, Second Edition

2015-07-28 O'Reilly Amazon

book

Hans-Jürgen Schönig

data data-engineering relational-databases postgresql Linux Cyber Security

The second edition of 'PostgreSQL Replication' by Hans-Jürgen Schönig is a comprehensive guide that empowers PostgreSQL database professionals to establish robust replication solutions. Through detailed explanations and expert techniques, you will learn how to enhance the security, scalability, and reliability of your PostgreSQL databases using modern replication methods. What this Book will help me do Master Point-in-Time Recovery to safeguard data and perform database recoveries effectively. Implement both synchronous and asynchronous streaming replication to suit different operational needs. Optimize database performance and scalability using tools like pgpool and PgBouncer. Ensure database high availability and data security through Linux High Availability configurations. Solve replication-related challenges by leveraging advanced knowledge of the PostgreSQL transaction log. Author(s) Hans-Jürgen Schönig, a seasoned PostgreSQL specialist, has years of experience architecting and optimizing PostgreSQL database systems for businesses of all sizes. With a strong focus on practical implementation and a passion for teaching, his writing bridges the gap between theoretical concepts and hands-on solutions, making PostgreSQL topics accessible and actionable. Who is it for? This book is tailored for PostgreSQL administrators and professionals seeking to implement robust database replication. Whether you're familiar with basic database administration or looking to deepen your expertise, this book provides valuable insights into replication strategies. It's ideal for those aiming to boost database performance and enhance operational reliability through advanced PostgreSQL features.

Programming ArcGIS with Python Cookbook, Second Edition

2015-07-28 O'Reilly Amazon

book

Eric Pimpler

data data-engineering location-data geographic-information-system-gis arcgis API

Dive into 'Programming ArcGIS with Python Cookbook, Second Edition,' an essential guide for automating your ArcGIS for Desktop tasks with hands-on Python recipes. Through this book, you will understand how to effectively handle GIS data, automate geoprocessing tasks, and extend ArcGIS functionalities to streamline your workflows and boost your productivity. What this Book will help me do Master the management of map documents, layer files, feature classes, and tables using Python. Automate common ArcGIS tasks such as map production, printing, and creating PDF map books programmatically. Learn to find and correct broken data links and make your datasets reliable. Develop custom geoprocessing tools and share them efficiently among your team or projects. Expand your knowledge by leveraging advanced practices such as Python scripting for ArcGIS Pro and REST API integration. Author(s) Eric Pimpler is an accomplished GIS professional and Python programmer with years of practical experience in geospatial science and technology. He specializes in teaching GIS automation using Python and aims to simplify complex concepts into approachable recipes for learners. Eric's writing is marked by clarity and a methodical approach, ensuring that readers can apply their new knowledge effectively. Who is it for? This book is aimed at GIS professionals, cartographers, or analysts who routinely work with ArcGIS and want to streamline their workflow. If you have foundational experience with ArcGIS and basic Python programming skills, this book will build upon them, offering practical recipes to extend your capabilities. It's perfect for those looking to enhance their efficiency and automate their GIS tasks. By the end of this book, readers will have skills valuable to GIS experts and data analysts alike.

Neo4j Graph Data Modelling

2015-07-27 O'Reilly Amazon

book

Mahesh K Lal

data data-engineering graph-databases Neo4j Data Modelling

Neo4j Graph Data Modelling provides practical guidance in designing and implementing graph databases using Neo4j. This book walks you through modeling concepts, database evolution, and performance optimization. You'll learn how to model real-world domains, write Cypher queries, and adapt your database as requirements change. What this Book will help me do Model data effectively using Neo4j to represent complex relationships. Translate real-world problems into graph database designs efficiently. Write optimized Cypher queries to retrieve and manipulate data. Improve database performance through thoughtful design practices. Adapt and evolve databases seamlessly as application needs change. Author(s) Mahesh K Lal is an experienced developer and database specialist with a deep understanding of graph data modeling. With a focus on practical and accessible instruction, Mahesh's work provides actionable insights into database design. Neo4j Graph Data Modelling reflects his years of hands-on experience with Neo4j. Who is it for? This book is designed for software developers and data professionals looking to explore graph databases. If you aim to effectively model real-world situations using Neo4j or optimize database queries, this guide is for you. Prior experience with databases is helpful but not mandatory.

Oracle Goldengate 12c Implementers Guide

2015-07-27 O'Reilly Amazon

book

John P Jeffries

data data-engineering oracle-database-solutions Oracle

Master Oracle GoldenGate 12c to manage high-volume data replication and integration in real time. This guide provides you with comprehensive knowledge and skills to optimize database processes through GoldenGate's capabilities. What this Book will help me do Install and configure Oracle GoldenGate 12c within your environment effectively. Leverage GoldenGate's high-availability features for robust system setups. Optimize replication processes with advanced configuration and performance tuning techniques. Troubleshoot common GoldenGate issues to ensure seamless operations. Apply best practices for GoldenGate in enterprise database architectures. Author(s) John P Jeffries, the author of this guide, is a seasoned Oracle database consultant with over a decade of experience specializing in high-availability architectures and data replication. His mission is to make complex systems accessible through clear and detailed instructional writing. Who is it for? This book is designed for Oracle database administrators wanting to integrate GoldenGate into their architecture. Ideal for solution architects building robust systems and project managers overseeing database projects. A basic understanding of Oracle databases is assumed, but no prior knowledge of GoldenGate is required.

Spark Cookbook

2015-07-27 O'Reilly Amazon

book

Rishi Yadav

data data-engineering apache-spark AI/ML Analytics Big Data

Spark Cookbook is your practical guide to mastering Apache Spark, encompassing a comprehensive set of patterns and examples. Through its over 60 recipes, you will gain actionable insights into using Spark Core, Spark SQL, Spark Streaming, MLlib, and GraphX effectively for your big data needs. What this Book will help me do Understand how to install and configure Apache Spark in various environments. Build data pipelines and perform real-time analytics with Spark Streaming. Utilize Spark SQL for interactive data querying and reporting. Apply machine learning workflows using MLlib, including supervised and unsupervised models. Develop optimized big data solutions and integrate them into enterprise platforms. Author(s) None Yadav, the author of Spark Cookbook, is an experienced data engineer and technical expert with deep insights into big data processing frameworks. Yadav has spent years working with Spark and its ecosystem, providing practical guidance to developers and data scientists alike. This book reflects their commitment to sharing actionable knowledge. Who is it for? This book is designed for data engineers, developers, and data scientists who work with big data systems and wish to utilize Apache Spark effectively. Whether you're looking to optimize existing Spark applications or explore its libraries for new use cases, this book will provide the guidance you need. A basic familiarity with big data concepts and programming in languages like Java or Python is recommended to make the most out of this book.

ElasticSearch Blueprints

2015-07-24 O'Reilly Amazon

book

Vineeth Mohan

data data-engineering search elasticsearch Analytics ELK

Dive into search technology with "ElasticSearch Blueprints"! This is the perfect project-based guide to help you master Elasticsearch. You will learn how to build and design scalable, effective search solutions, improve search relevancy, manage data efficiently, perform analytics, and visualize your data in comprehensive ways. What this Book will help me do Build and fine-tune scalable search engine features with Elasticsearch. Design and implement accurate ecommerce search solutions using filters. Analyze and visualize data with Elasticsearch's powerful data aggregation capabilities. Increase search relevancy and enhance user query assistance using analyzers. Incorporate enhanced data organization methods, including parent-child relationships. Author(s) None Mohan is an experienced professional specializing in search technologies. With a strong technical background, they have engaged deeply with Elasticsearch, creating solutions that address practical challenges. Their approach focuses on making technical topics accessible, guiding readers step-by-step through projects. Who is it for? This book is tailored for data professionals, application developers, and enthusiasts eager to delve into search technologies. Whether you're beginning with Elasticsearch or aiming to refine your skills, this guide will advance your expertise. By working through practical cases, you'll gain confidence in using Elasticsearch effectively to meet diverse requirements.

Virtualizing Hadoop: How to Install, Deploy, and Optimize Hadoop in a Virtualized Architecture

2015-07-20 O'Reilly Amazon

book

George J. Trujillo Jr. , Charles Kim , Rommel Garcia , Justin Murray , Steven Jones

data data-engineering Hadoop Big Data Cloud Computing Data Management

Plan and Implement Hadoop Virtualization for Maximum Performance, Scalability, and Business Agility Enterprises running Hadoop must absorb rapid changes in big data ecosystems, frameworks, products, and workloads. Virtualized approaches can offer important advantages in speed, flexibility, and elasticity. Now, a world-class team of enterprise virtualization and big data experts guide you through the choices, considerations, and tradeoffs surrounding Hadoop virtualization. The authors help you decide whether to virtualize Hadoop, deploy Hadoop in the cloud, or integrate conventional and virtualized approaches in a blended solution. First, Virtualizing Hadoop reviews big data and Hadoop from the standpoint of the virtualization specialist. The authors demystify MapReduce, YARN, and HDFS and guide you through each stage of Hadoop data management. Next, they turn the tables, introducing big data experts to modern virtualization concepts and best practices. Finally, they bring Hadoop and virtualization together, guiding you through the decisions you’ll face in planning, deploying, provisioning, and managing virtualized Hadoop. From security to multitenancy to day-to-day management, you’ll find reliable answers for choosing your best Hadoop strategy and executing it. Coverage includes the following: • Reviewing the frameworks, products, distributions, use cases, and roles associated with Hadoop • Understanding YARN resource management, HDFS storage, and I/O • Designing data ingestion, movement, and organization for modern enterprise data platforms • Defining SQL engine strategies to meet strict SLAs • Considering security, data isolation, and scheduling for multitenant environments • Deploying Hadoop as a service in the cloud • Reviewing the essential concepts, capabilities, and terminology of virtualization • Applying current best practices, guidelines, and key metrics for Hadoop virtualization • Managing multiple Hadoop frameworks and products as one unified system • Virtualizing master and worker nodes to maximize availability and performance • Installing and configuring Linux for a Hadoop environment

talk-data.com

O'Reilly Data Engineering Books

Top Topics

Top Speakers

Oracle SOA Suite 12c Handbook

The Architecture of Privacy

Learning RSLogix 5000 Programming

Learning YARN

OCA/OCP Oracle Database 12c All-in-One Exam Guide (Exams 1Z0-061, 1Z0-062, & 1Z0-063), 2nd Edition

Performance Optimization and Tuning Techniques for IBM Power Systems Processors Including IBM POWER8

Expert Oracle Exadata, Second Edition

Structured Search for Big Data

Modernize Your IBM DB2 for IBM z/OS Maintenance with Utility Autonomics

You: For Sale

Expert Oracle Application Express, Second Edition

IBM TS7700 Virtualization Engine with R3.2

IBM Software Defined Environment

Introduction to JavaScript Object Notation

Pro Couchbase Development: A NoSQL Platform for the Enterprise

Learning NHibernate 4

Getting Started with Hazelcast, Second Edition

IBM GDPS Family of Products: An Introduction to Concepts and Capabilities

PostgreSQL Replication, Second Edition

Programming ArcGIS with Python Cookbook, Second Edition

Neo4j Graph Data Modelling

Oracle Goldengate 12c Implementers Guide

Spark Cookbook

ElasticSearch Blueprints

Virtualizing Hadoop: How to Install, Deploy, and Optimize Hadoop in a Virtualized Architecture