O'Reilly Data Engineering Books

Guide to IBM PowerHA SystemMirror for AIX Version 7.1.3

2014-09-28 O'Reilly Amazon

book

Dino Quintero , Ashish Nainwal , Kunal Langer , Bernhard Buehler , Minh Pham , Katharina Probs , Bjorn Roden , Isac Silva , Luciano Martins , Marian Tomescu , Bharathraj Keshavamurthy , Alex Abderrazag , Primitivo Cervantes , Matt Radford , Ben Swinney , Yefei Song , Michael Schmut , Sascha Wycisk , Ashraf Ali Thajudeen

data data-engineering IBM

This IBM® Redbooks® publication for IBM Power Systems™ with IBM PowerHA® SystemMirror® Standard and Enterprise Editions (hardware, software, practices, reference architectures, and tools) documents a well-defined deployment model within an IBM Power Systems environment. It guides you through a planned foundation for a dynamic infrastructure for your enterprise applications. This information is for technical consultants, technical support staff, IT architects, and IT specialists who are responsible for providing high availability and support for the IBM PowerHA SystemMirror Standard and Enterprise Editions on IBM POWER® systems.

Building 360-Degree Information Applications

2014-09-26 O'Reilly Amazon

book

Edward Thorne , Uday K Nandam , Colin Dean , Whei-Jen Chen , Soma Shekar Naganna , Bruce Adams

data data-engineering IBM infosphere Data Management Master Data Management

Today's businesses, applications, social media, and online transactions generate more data than ever before. This data can be explored and analyzed to provide tremendous business value. IBM® Watson™ Explorer and IBM InfoSphere® Master Data Management (InfoSphere MDM) enable organizations to simultaneously explore and derive insights from enterprise data that was traditionally stored in "silos" in enterprise applications, different data repositories, and in different data formats. This IBM Redbooks® publication provides information about Watson Explorer 9.0, InfoSphere MDM, and IBM InfoSphere MDM Probabilistic Matching Engine for InfoSphere BigInsights™ (PME for BigInsights). It gives you an overview, describes the architecture, and presents use cases that you can use to accomplish the following tasks: Understand the core capabilities of Watson Explorer, InfoSphere MDM, and PME for BigInsights. Realize the full potential of Watson Explorer applications. Describe the integration and value of the combination of Watson Explorer and InfoSphere MDM. Build a 360-degree information application. Learn by example by following hands-on lab scenarios.

Getting Started with Impala

2014-09-25 O'Reilly Amazon

book

John Russell

data data-engineering Hadoop impala Analytics Big Data

Learn how to write, tune, and port SQL queries and other statements for a Big Data environment, using Impala—the massively parallel processing SQL query engine for Apache Hadoop. The best practices in this practical guide help you design database schemas that not only interoperate with other Hadoop components, and are convenient for administers to manage and monitor, but also accommodate future expansion in data size and evolution of software capabilities. Written by John Russell, documentation lead for the Cloudera Impala project, this book gets you working with the most recent Impala releases quickly. Ideal for database developers and business analysts, the latest revision covers analytics functions, complex types, incremental statistics, subqueries, and submission to the Apache incubator. Getting Started with Impala includes advice from Cloudera’s development team, as well as insights from its consulting engagements with customers. Learn how Impala integrates with a wide range of Hadoop components Attain high performance and scalability for huge data sets on production clusters Explore common developer tasks, such as porting code to Impala and optimizing performance Use tutorials for working with billion-row tables, date- and time-based values, and other techniques Learn how to transition from rigid schemas to a flexible model that evolves as needs change Take a deep dive into joins and the roles of statistics

Oracle BPM Suite 12c Modeling Patterns

2014-09-25 O'Reilly Amazon

book

Vivek Acharya

data data-engineering oracle-database-solutions Cloud Computing KPI Oracle

Dive deep into Oracle BPM Suite 12c with this comprehensive guide on workflow automation and modeling patterns. This book equips you with practical knowledge on designing and implementing intricate Business Process Management solutions using Oracle BPM Suite 12c, making complex processes harmonious and effective. What this Book will help me do Master advanced flow control and branching patterns in Oracle BPM Suite 12c. Implement human workflow integration for better interaction within BPM applications. Learn effective handling of process correlation and exceptions for robust solutions. Explore adaptive case management for handling unpredictable business needs. Understand the usage of predictive analysis and KPIs in business processes. Author(s) None Acharya is a seasoned Oracle BPM and IT solutions expert. With years of practical experience in enterprise process management and cloud-based application development, None combines a technical depth with a passion for sharing knowledge. Their approach to writing ensures a balance of theory and actionable insights. Who is it for? This book is designed for enterprise and solution architects, developers, process analysts, and technical or functional consultants with a focus on process modeling and enterprise IT solutions. Intended readers should have a foundational understanding of BPM concepts. By reading this book, you'll enhance your ability to work with Oracle BPM Suite 12c in professional environments. Ideal for those seeking to deepen their expertise in BPM and workflow automation.

Master Competitive Analytics with Oracle Endeca Information Discovery

2014-09-24 O'Reilly Amazon

book

William Smith , Helen Sun

data data-engineering oracle-database-solutions Analytics Big Data Oracle

Oracle Endeca Information Discovery Best Practices Maximize the powerful capabilities of this self-service enterprise data discovery platform. Master Competitive Analytics with Oracle Endeca Information Discovery reveals how to unlock insights from any type of data, regardless of structure. The first part of the book is a complete technical guide to the product's architecture, components, and implementation. The second part presents a comprehensive collection of business analytics use cases in various industries, including financial services, healthcare, research, manufacturing, retail, consumer packaged goods, and public sector. Step-by-step instructions on implementing some of these use cases are included in this Oracle Press book. Install and manage Oracle Endeca Server Design Oracle Endeca Information Discovery Studio visualizations to facilitate user-driven data exploration and discovery Enable enterprise-driven data exploration with Oracle Endeca Information Discovery Integrator Develop and implement a fraud detection and analysis application Build a healthcare correlation application that integrates claims, patient, and operations analysis; partners; clinical research; and remote monitoring Use an enterprise architecture approach to incrementally establish big data and analytical capabilities

Mastering MariaDB

2014-09-24 O'Reilly Amazon

book

Federico Razzoli

data data-engineering relational-databases MySQL MariaDB Cyber Security

In 'Mastering MariaDB', you'll explore advanced techniques for managing, optimizing, and maintaining your MariaDB database servers. This book teaches you how to analyze query performance, implement best practices for security, and ensure data durability and availability through robust configurations and procedures. What this Book will help me do Learn to analyze query performance to optimize database efficiency. Gain expertise in implementing secure and organized role-based access controls. Achieve effective database backups with reliable recovery strategies. Set up replication environments to enable high availability of data. Explore clustering and sharding techniques for powerful scalability solutions. Author(s) None Razzoli is an experienced database administrator and author, specializing in MariaDB and other open-source database solutions. With an extensive career in database optimization and troubleshooting, None has helped businesses achieve greater performance and reliability with their systems. As a writer, they focus on clear, precise instruction and practical, real-world applications of database technology. Who is it for? This book is ideal for database administrators and system engineers who are already familiar with the basics of MariaDB and want to elevate their expertise. Intermediate MariaDB users looking to optimize their systems, ensure data security, and set up advanced configurations will find this book immensely helpful. It's written for professionals who actively manage database systems and require advanced knowledge to improve performance and reliability. If you're seeking deeper insights into MariaDB's features and scalability techniques, this book is for you.

Reliability and Performance with IBM DB2 Analytics Accelerator V4.1

2014-09-24 O'Reilly Amazon

book

Steve Speller , Ravi Kumar , Anna Griner , Paolo Bruni , Ruiping Li , James Guo , Andy Perkins , Jason Arnold , Jeff Feinsmith , Leticia Cruz , Dino Tonelli , Jonathan Sloan , Chris Harlander , Willie Favero , Johannes Kern

data data-engineering relational-databases ibm-db2 Analytics BI

The IBM® DB2® Analytics Accelerator for IBM z/OS® is a high-performance appliance that integrates the IBM zEnterprise® infrastructure with IBM PureData™ for Analytics, powered by IBM Netezza® technology. With this integration, you can accelerate data-intensive and complex queries in a DB2 for z/OS highly secure and available environment. DB2 and the Analytics Accelerator appliance form a self-managing hybrid environment running online transaction processing and online transactional analytical processing concurrently and efficiently. These online transactions run together with business intelligence and online analytic processing workloads. DB2 Analytics Accelerator V4.1 expands the value of high-performance analytics. DB2 Analytics Accelerator V4.1 opens to static Structured Query Language (SQL) applications and row set processing, minimizes data movement, reduces latency, and improves availability. This IBM Redbooks® publication provides technical decision-makers with an understanding of the benefits of version 4.1 of the Analytics Accelerator with DB2 11 for z/OS. It describes the installation of the new functions, and the advantages to existing analytical processes as measured in our test environment. This book also introduces the DB2 Analytics Accelerator Loader V1.1, a tool that facilitates the data population of the DB2 Analytics Accelerator.

DynamoDB Applied Design Patterns

2014-09-23 O'Reilly Amazon

book

Uchit Hamendra Vyas

data data-engineering nosql-databases DynamoDB API AWS

In "DynamoDB Applied Design Patterns", you'll dive deep into the effective design patterns that optimize the performance of applications using DynamoDB. Through practical examples and best practices, this guide empowers developers to create scalable, efficient, and robust DynamoDB implementations. What this Book will help me do Master how to design effective data models using DynamoDB's native features such as tables, attributes, and indexes. Learn to utilize DynamoDB features like global and local secondary indexes to optimize performance. Gain in-depth knowledge on managing and querying DynamoDB using AWS services and tools. Integrate DynamoDB seamlessly with AWS services such as Redshift, S3, and MapReduce. Leverage advanced DynamoDB API features to retrieve data efficiently for diverse application use cases. Author(s) Uchit Hamendra Vyas is a highly skilled professional specializing in AWS and cloud computing. With years of experience as a developer and architect, he brings practical insights into designing efficient database solutions. His approachable teaching style makes complex topics clear and accessible. Who is it for? This book is designed for developers working with or interested in using DynamoDB in their projects. It assumes a moderate familiarity with database design and AWS concepts. Readers aiming to enhance their DynamoDB skills and optimize performance will greatly benefit. If you're looking to take your NoSQL database knowledge to the next level, this book is for you.

I Heart Logs

2014-09-23 O'Reilly Amazon

book

Jay Kreps

data data-engineering log-data NoSQL

Why a book about logs? That’s easy: the humble log is an abstraction that lies at the heart of many systems, from NoSQL databases to cryptocurrencies. Even though most engineers don’t think much about them, this short book shows you why logs are worthy of your attention. Based on his popular blog posts, LinkedIn principal engineer Jay Kreps shows you how logs work in distributed systems, and then delivers practical applications of these concepts in a variety of common uses—data integration, enterprise architecture, real-time stream processing, data system design, and abstract computing models. Go ahead and take the plunge with logs; you’re going love them. Learn how logs are used for programmatic access in databases and distributed systems Discover solutions to the huge data integration problem when more data of more varieties meet more systems Understand why logs are at the heart of real-time stream processing Learn the role of a log in the internals of online data systems Explore how Jay Kreps applies these ideas to his own work on data infrastructure systems at LinkedIn

MariaDB High Performance

2014-09-23 O'Reilly Amazon

book

Pierre Mavro

data data-engineering relational-databases MySQL MariaDB Linux

Learn how to optimize your MariaDB installations for performance, scalability, and reliability. This comprehensive guide teaches you advanced replication techniques, clustering, sharding data, and optimizing engines. By the end of the book, you'll have the tools to build complex, high-performing database infrastructures. What this Book will help me do Master the setup of advanced replication models such as master/slave and dual-master setups. Implement a Galera Cluster to enhance your database's write scalability and reliability. Configure the Spider engine for effective data sharding across multiple nodes. Identify and mitigate performance bottlenecks through in-depth engine optimization. Deploy and maintain a disaster recovery solution using advanced database strategies. Author(s) Pierre Mavro is an experienced database administrator with years of expertise in MySQL and MariaDB systems. Having worked on numerous high-scale projects, he brings practical insights into creating robust, high-performance database solutions. Pierre enjoys sharing his knowledge and helping developers and administrators navigate complex database challenges. Who is it for? This book is perfect for database administrators and system architects who already have working experience with MariaDB or MySQL. If you're looking to enhance your skills and develop solutions for high traffic applications, this book is for you. Ideal for those comfortable with Linux-based infrastructures and eager to learn. Whether maintaining high-availability systems or scaling databases, you'll find this work essential.

WS-BPEL 2.0 Beginner's Guide

2014-09-22 O'Reilly Amazon

book

Matjaz B Juric

data data-engineering oracle-database-solutions Oracle

The "WS-BPEL 2.0 Beginner's Guide" is an essential resource for getting started with designing and implementing WS-BPEL 2.0-based business processes using Oracle SOA Suite 12c. The book introduces you to core concepts, guides you through practical activities, and equips you with skills to effectively use WS-BPEL for orchestrating and managing web services. What this Book will help me do Understand the core syntax and semantics of WS-BPEL 2.0 for designing business processes. Learn to utilize the Oracle SOA Suite 12c to build and deploy WS-BPEL executable processes. Develop skills to use WS-BPEL features such as variables, conditions, loops, and error handling for robust implementations. Gain practical insights into asynchronous operations, dynamic invocations, and messaging patterns in WS-BPEL. Master advanced WS-BPEL concepts like human task integration, event handling, and compensation mechanisms. Author(s) Matjaz B. Juric is a renowned author and technical expert with extensive experience in SOA and BPM technologies. He has authored multiple books focusing on enterprise systems and has a background in academic research and teaching. His approachable and thorough writing style ensures that readers are able to grasp complex topics and apply them effectively in their work. Who is it for? This book is directed at software professionals such as architects, designers, and developers who are working on or looking to adopt SOA and BPM solutions within their projects. Readers should have a basic understanding of SOA concepts and web services but do not need prior experience with WS-BPEL. It is suitable for those aiming to learn WS-BPEL 2.0 from scratch to become proficient in designing and implementing business processes.

Practical Migration from x86 to Linux on IBM System z

2014-09-18 O'Reilly Amazon

book

Craig Gardner , Berthold Gunreben , Serkan Sahin , Eduardo Simoes Franco , Lydia Parziale , Tito Ogando

data data-engineering IBM Linux Virtual Machine

There are many reasons why you would want to optimize your servers through virtualization using Linux on IBM® System z®: Too many distributed physical servers with low utilization A lengthy provisioning process that delays the implementation of new applications Limitations in data center power and floor space High total cost of ownership (TCO) Difficulty allocating processing power for a dynamic environment Next, we describe total cost of ownership analyses and we guide you in understanding how to analyze your environment before beginning a migration project. We also assist you in determining the expected consolidation ratio for a given workload type. We also describe virtualization concepts along with describing the benefits of migrating from the x86 environment to guests residing on an IBM z/VM® single system image with live guest relocation. This IBM Redbooks publication walks you through a migration approach, includes planning worksheets, as well as a chapter to assist you in analyzing your own systems. We also discuss post migration considerations such as acceptance testing of functionality and performance measurements.

Implementing IBM FlashSystem 840

2014-09-16 O'Reilly Amazon

book

Karen Orlando , Matthew Levan , Detlef Helmbrecht , Carsten Larsen , Chip Elmblad

data data-engineering IBM Cloud Computing

Almost all technological components in the data center are getting faster; central processing units, network, storage area networks (SAN), and memory. All of them have improved their speed by a minimum of 10X; some of them by 100X, for example, data networks. However, spinning disk performance has only increased by 1.2 times. The IBM FlashSystem™ 840 closes this gap. The FlashSystem 840 is optimized for the data center to enable organizations of all sizes to strategically harness the value of stored data. It provides flexible capacity and extreme performance for the most demanding applications, including virtualized or bare-metal online transaction processing (OLTP) and online analytical processing (OLAP) databases, virtual desktop infrastructures (VDI), technical computing applications, and cloud environments. The system accelerates response times with IBM® MicroLatency™ access times as low as 90 µs write latency and 135 µs read latency to enable faster decision making. The introduction of a low capacity 1 TB flash module allows FlashSystem 840 to be configured in capacity points as low as 2 TB in protected RAID 5 mode. Coupled with 10 GB iSCSI, FlashSystem is positioned to bring extreme performance to small and medium-sized businesses (SMB) and growth markets. Implementing the IBM FlashSystem 840 provides value that goes beyond those benefits that are seen on disk-based arrays. These benefits include better user experience, server and application consolidation, development cycle reduction, application scalability, data center footprint savings, and improved price performance economics. This IBM Redbooks® publication introduces clients to the IBM FlashSystem. It provides in-depth knowledge of the product architecture, software and hardware, its implementation, and hints and tips. Also illustrated are use cases that show real-world solutions for tiering, flash-only, and preferred read, as well as examples of the benefits gained by integrating FlashSystem storage into business environments. Also described are product integration scenarios running the IBM FlashSystem 840 with the IBM SAN Volume Controller, the IBM PureFlex® System, and the IBM Storwize® V7000, as well as considerations when integrating with the IBM FlashSystem 840. The preferred practice guidance is provided for your FlashSystem environment with IBM 16 Gbps b-type products and features, focusing on Fibre Channel design. This book is intended for pre-sales and post-sales technical support professionals and storage administrators, and for anyone who wants to understand and learn how to implement this new and exciting technology.

SQL Server Query Performance Tuning,Fourth Edition

2014-09-16 O'Reilly Amazon

book

Grant Fritchey

data data-engineering SQL

Queries not running fast enough? Wondering about the in-memory database features in 2014? Tired of phone calls from frustrated users? Grant Fritchey's book SQL Server Query Performance Tuning is the answer to your SQL Server query performance problems. The book is revised to cover the very latest in performance optimization features and techniques, especially including the newly-added, in-memory database features formerly known under the code name Project Hekaton. This book provides the tools you need to approach your queries with performance in mind. SQL Server Query Performance Tuning leads you through understanding the causes of poor performance, how to identify them, and how to fix them. You’ll learn to be proactive in establishing performance baselines using tools like Performance Monitor and Extended Events. You’ll learn to recognize bottlenecks and defuse them before the phone rings. You’ll learn some quick solutions too, but emphasis is on designing for performance and getting it right, and upon heading off trouble before it occurs. Delight your users. Silence that ringing phone. Put the principles and lessons from SQL Server Query Performance Tuning into practice today. Covers the in-memory features from Project Hekaton Helps establish performance baselines and monitor against them Guides in troubleshooting and eliminating of bottlenecks that frustrate users

Using Flume

2014-09-16 O'Reilly Amazon

book

Hari Shreedharan

data data-engineering log-data API ELK GitHub

How can you get your data from frontend servers to Hadoop in near real time? With this complete reference guide, you’ll learn Flume’s rich set of features for collecting, aggregating, and writing large amounts of streaming data to the Hadoop Distributed File System (HDFS), Apache HBase, SolrCloud, Elastic Search, and other systems. Using Flume shows operations engineers how to configure, deploy, and monitor a Flume cluster, and teaches developers how to write Flume plugins and custom components for their specific use-cases. You’ll learn about Flume’s design and implementation, as well as various features that make it highly scalable, flexible, and reliable. Code examples and exercises are available on GitHub. Learn how Flume provides a steady rate of flow by acting as a buffer between data producers and consumers Dive into key Flume components, including sources that accept data and sinks that write and deliver it Write custom plugins to customize the way Flume receives, modifies, formats, and writes data Explore APIs for sending data to Flume agents from your own applications Plan and deploy Flume in a scalable and flexible way—and monitor your cluster once it’s running

HL7 for BizTalk

2014-09-15 O'Reilly Amazon

book

Vikas Bhardwaj , Howard Edidin

data data-engineering streaming-messaging enterprise-service-bus microsoft-biztalk-server Azure

Vikas Bhardwaj is a Technical Architect at Syntel Inc. Vikas has 14 years of IT experience with Microsoft Technologies like BizTalk Server, .NET, C#, SQL Server. Vikas has implemented various integration solution using BizTalk Server including one of the largest implementation of BizTalk and HL7. Vikas presently lives in Nashville, Tennessee with his wife Poonam and two kids Shivam & Ayaan. You can check out Vikas' blog at http://vikasbhardwaj15.blogspot.com/ and Vikas can be contacted directly at [email protected]. HL7 for BizTalk provides a detailed guide to the planning and delivery of a HL7-compliant system using the dedicated Microsoft BizTalk for HL7 Accelerator. The HL7 Primary Standard, its various versions, and the use of the HL7 Accelerator for BizTalk are broken out and fully explained. HL7 for BizTalk provides clear guidance on the specific healthcare scenarios that HL7 is designed to overcome and provides working case study models of how HL7 solutions can be implemented in BizTalk, deployed in practice and monitored during operation. Special emphasis is given in this book to the BizTalk reporting functionality and its use to provide HL7 oversight within organizations. HL7 for BizTalk is suitable for use with BizTalk versions from 2006 R2 to 2013 R2 to suit the reader organization. All three versions of the HL7 standard and their differences, are explained. Howard S. Edidin is an integrations architect specializing in enterprise application integration. Howard runs his own consulting firm, Edidin Group, Inc, which is a Gold Member of the HL7 International Organization. Howard's firm specializes in delivering HL7 and HIPAA Healthcare solutions and providing guidance in the use of HL7 with BizTalk. Howard is active in several HL7 Working Groups and is involved with the development of a new HL7 Standard. In addition to BizTalk, Howard works with Azure, SQL Server, and SharePoint. Howard and his wife Sharon, live in a northern suburb of Chicago. Howard maintains several blogs, biztalkin-howard.blogspot.com and fhir-biztalk.com. Howard can be contacted directly at [email protected].

IBM z/OS V2.1 DFSMS Technical Update

2014-09-15 O'Reilly Amazon

book

Mary Lovelace , Norbert Schlumberger , Jose Dovidauskas , Anthony Fletcher , Gert Laumann

data data-engineering IBM

Each release of IBM® z/OS® DFSMS builds upon the previous version to provide enhanced storage management, data access, device support, program management, and distributed data access for the z/OS platform in a system-managed storage environment. This IBM Redbooks® publication provides a summary of the functions and enhancements integrated into z/OS V2.1 DFSMS. It provides you with the information that you need to understand and evaluate the content of this DFSMS release, along with practical implementation hints and tips. This book is written for storage professionals and system programmers who have experience with the components of DFSMS. It provides sufficient information so that you can start prioritizing the implementation of new functions and evaluating their applicability in your DFSMS environment.

IBM DS8870 Architecture and Implementation

2014-09-12 O'Reilly Amazon

book

Maged Sallam , Bertrand Dufrasne , Jean Francois Lepine , Jeff Cook , Stephen Manthorpe , Juan Brandenburg

data data-engineering IBM

This IBM® Redbooks® publication describes the concepts, architecture, and implementation of the IBM DS8870. The book provides reference information to assist readers who need to plan for, install, and configure the DS8870. The IBM DS8870 is the most advanced model in the IBM DS8000® series and is equipped with IBM POWER7+™ based controllers. Various configuration options are available that scale from dual 2-core systems up to dual 16-core systems with up to 1 TB of cache. The DS8870 features an integrated high-performance flash enclosure with flash cards that can delivers up to 250,000 IOPS and up to 3.4 GBps bandwidth. A high performance all-flash drive configuration is also available. The DS8870 also features enhanced 8 Gbps device adapters and host adapters. Connectivity options, with up to 128 Fibre Channel/IBM FICON® ports for host connections, make the DS8870 suitable for multiple server environments in open systems and IBM System z® environments. The DS8870 supports advanced disaster recovery solutions, business continuity solutions, and thin provisioning. All disk drives in the DS8870 storage system have the Full Disk Encryption (FDE) feature. The DS8870 also can be integrated in a Lightweight Directory Access Protocol (LDAP) infrastructure. The DS8870 can automatically optimize the use of each storage tier, particularly flash drives and flash cards, through the IBM Easy Tier® feature, which is available at no extra charge.

Professional Microsoft SQL Server 2014 Administration

2014-09-09 O'Reilly Amazon

book

Ross LoForte , Bradley Ball , Brian Knight , Adam Jorgensen , Steven Wort

data data-engineering relational-databases microsoft-sql-server Cloud Computing Microsoft

Learn to take advantage of the opportunities offered by SQL Server 2014 Microsoft's SQL Server 2014 update means big changes for database administrators, and you need to get up to speed quickly because your methods, workflow, and favorite techniques will be different from here on out. The update's enhanced support of large-scale enterprise databases and significant price advantage mean that SQL Server 2014 will become even more widely adopted across the industry. The update includes new backup and recovery tools, new AlwaysOn features, and enhanced cloud capabilities. In-memory OLTP, Buffer Pool Extensions for SSDs, and a new Cardinality Estimator can improve functionality and smooth out the workflow, but only if you understand their full capabilities. Professional Microsoft SQL Server 2014 is your comprehensive guide to working with the new environment. Authors Adam Jorgensen, Bradley Ball, Ross LoForte, Steven Wort, and Brian Knight are the dream team of the SQL Server community, and they put their expertise to work guiding you through the changes. Improve oversight with better management and monitoring Protect your work with enhanced security features Upgrade performance tuning, scaling, replication, and clustering Learn new options for backup and recovery Professional Microsoft SQL Server 2014 includes a companion website with sample code and efficient automation utilities, plus a host of tips, tricks, and workarounds that will make your job as a DBA or database architect much easier. Stop getting frustrated with administrative issues and start taking control. Professional Microsoft SQL Server 2014 is your roadmap to mastering the update and creating solutions that work.

Sams Teach Yourself NoSQL with MongoDB in 24 Hours

2014-09-08 O'Reilly Amazon

book

Brad Dayley

data data-engineering nosql-databases MongoDB Big Data Java

NoSQL database usage is growing at a stunning 50% per year, as organizations discover NoSQL's potential to address even the most challenging Big Data and real-time database problems. Every NoSQL database is different, but one is the most popular by far: MongoDB. Now, in just 24 lessons of one hour or less, you can learn how to leverage MongoDB's immense power. Each short, easy lesson builds on all that's come before, teaching NoSQL concepts and MongoDB techniques from the ground up. Sams Teach Yourself NoSQL with MongoDB in 24 Hours covers all this, and much more: Learning how NoSQL is different, when to use it, and when to use traditional RDBMSes instead Designing and implementing MongoDB databases of diverse types and sizes Storing and interacting with data via Java, PHP, Python, and Node.js/Mongoose Choosing the right NoSQL distribution model for your application Installing and configuring MongoDB Designing MongoDB data models, including collections, indexes, and GridFS Balancing consistency, performance, and durability Leveraging the immense power of Map-Reduce Administering, monitoring, securing, backing up, and repairing MongoDB databases Mastering advanced techniques such as sharding and replication Optimizing performance

Implementing IBM Software Defined Network for Virtual Environments

2014-09-04 O'Reilly Amazon

book

Per Ljungstrøm , Scott Irwin , Sangam Racherla , Pushkar Patil , Alessio M. Tarenzio , David Cain

data data-engineering IBM VMware

This IBM® Redbooks® publication shows how to integrate IBM Software Defined Network for Virtual Environments (IBM SDN VE) seamlessly within a new or existing data center. This book is aimed at pre- and post-sales support, targeting network administrators and other technical professionals that want to get an overview of this new and exciting technology, and see how it fits into the overall vision of a truly Software Defined Environment. It shows you all of the steps that are required to design, install, maintain, and troubleshoot the IBM SDN VE product. It also highlights specific, real-world examples that showcase the power and flexibility that IBM SDN VE has over traditional solutions with a legacy network infrastructure that is applied to virtual systems. This book assumes that you have a general familiarity with networking and virtualization. It does not assume an in-depth understanding of KVM or VMware. It is written for administrators who want to get a quick start with IBM SDN VE in their respective virtualized infrastructure, and to get some virtual machines up and running by using the rich features of the product in a short amount of time (days, not week, or months).

Pro Apache Hadoop, Second Edition

2014-09-03 O'Reilly Amazon

book

Madhu Siddalingaiah , Sameer Wadkar

data data-engineering Hadoop Big Data Cloud Computing HDFS

Pro Apache Hadoop, Second Edition brings you up to speed on Hadoop the framework of big data. Revised to cover Hadoop 2.0, the book covers the very latest developments such as YARN (aka MapReduce 2.0), new HDFS high-availability features, and increased scalability in the form of HDFS Federations. All the old content has been revised too, giving the latest on the ins and outs of MapReduce, cluster design, the Hadoop Distributed File System, and more. This book covers everything you need to build your first Hadoop cluster and begin analyzing and deriving value from your business and scientific data. Learn to solve big-data problems the MapReduce way, by breaking a big problem into chunks and creating small-scale solutions that can be flung across thousands upon thousands of nodes to analyze large data volumes in a short amount of wall-clock time. Learn how to let Hadoop take care of distributing and parallelizing your softwareyou just focus on the code; Hadoop takes care of the rest. Covers all that is new in Hadoop 2.0 Written by a professional involved in Hadoop since day one Takes you quickly to the seasoned pro level on the hottest cloud-computing framework

Oracle PL/SQL Performance Tuning Tips & Techniques

2014-08-29 O'Reilly Amazon

book

Michael Rosenblum , Paul Dorsey

data data-engineering SQL pl-sql pl/sql API

Proven PL/SQL Optimization Solutions In Oracle PL/SQL Performance Tuning Tips & Techniques, Oracle ACE authors with decades of experience building complex production systems for government, industry, and educational organizations present a hands-on approach to enabling optimal results from PL/SQL. The book begins by describing the discovery process required to pinpoint performance problems and then provides measurable and repeatable test cases. In-depth coverage of linking SQL and PL/SQL is followed by deep dives into essential Oracle Database performance tuning tools. Real-world examples and best practices are included throughout this Oracle Press guide. Follow a request-driven nine-step process to identify and address performance problems in web applications Use performance-related database tools, including data dictionary views, logging, tracing, PL/SQL Hierarchical Profiler, PL/Scope, and RUNSTATS Instrument code to pinpoint performance issues using call stack APIs, error stack APIs, and timing markers Embed PL/SQL in SQL and manage user-defined functions Embed SQL in PL/SQL using a set-based approach to handle large volumes of data Properly write and deploy data manipulation language triggers to avoid performance problems Work with advanced datatypes, including LOBs and XML Use caching techniques to avoid redundant operations Effectively use dynamic SQL to reduce the amount of code needed and streamline system management Manage version control and ensure that performance fixes are successfully deployed Code examples in the book are available for download.

Pro Couchbase Server

2014-08-27 O'Reilly Amazon

book

David Ostrovsky , Yaniv Rodenski

data data-engineering nosql-databases couchbase Data Modelling NoSQL

The NoSQL movement has fundamentally changed the database world in recent years. Influenced by the growing needs of web-scale applications, NoSQL databases such as Couchbase Server provide new approaches to scalability, reliability, and performance. With the power and flexibility of Couchbase Server, you can model your data however you want, and easily change the data model any time you want. Pro Couchbase Server is a hands-on guide for developers and administrators who want to take advantage of the power and scalability of Couchbase Server in their applications. This book takes you from the basics of NoSQL database design, through application development, to Couchbase Server administration. Never have document databases been so powerful and performant. Pro Couchbase Server shows what is possible and helps you take full advantage of Couchbase Server and all the performance and scalability that it offers. Helps you design and develop a document database using Couchbase Server. Takes you through deploying and maintaining Couchbase Server. Gives you the tools to scale out your application as needed.

Learning Neo4j

2014-08-25 O'Reilly Amazon

book

Rik Van Bruggen

data data-engineering graph-databases Neo4j Cloud Computing Data Modelling

Dive into the exciting world of graph databases with "Learning Neo4j". This book introduces you to the Neo4j graph database system, showing how graph theory can unlock new ways of organizing and querying complex datasets. Through practical examples, you will explore Neo4j's capabilities and learn to implement real-world applications using graph data models. What this Book will help me do Understand the fundamentals of graph theory and how it relates to databases. Install and set up the Neo4j graph database on local and cloud platforms. Model complex data for use in Neo4j and import various datasets into it. Implement real-world use cases, such as recommendation systems and social networks. Explore visualization tools and resources for enhancing graph database applications. Author(s) The author, None Van Bruggen, is a seasoned expert in data systems with extensive hands-on experience with Neo4j. Drawing from real-world expertise, they provide practical guidance, bridging theoretical concepts to practical utility seamlessly. None Van Bruggen's accessible writing style makes navigating the complexities of graph databases achievable and rewarding for learners. Who is it for? This book is ideal for IT professionals, database administrators, and data analysts looking to harness the power of graph databases. Readers should have a basic understanding of relational databases and data modeling concepts. Whether you're starting with Neo4j or seeking to deepen your knowledge, this book provides the guidance you need. It is particularly great for anyone interested in implementing graph data solutions in real-world scenarios.

talk-data.com

O'Reilly Data Engineering Books

Top Topics

Top Speakers

Guide to IBM PowerHA SystemMirror for AIX Version 7.1.3

Building 360-Degree Information Applications

Getting Started with Impala

Oracle BPM Suite 12c Modeling Patterns

Master Competitive Analytics with Oracle Endeca Information Discovery

Mastering MariaDB

Reliability and Performance with IBM DB2 Analytics Accelerator V4.1

DynamoDB Applied Design Patterns

I Heart Logs

MariaDB High Performance

WS-BPEL 2.0 Beginner's Guide

Practical Migration from x86 to Linux on IBM System z

Implementing IBM FlashSystem 840

SQL Server Query Performance Tuning,Fourth Edition

Using Flume

HL7 for BizTalk

IBM z/OS V2.1 DFSMS Technical Update

IBM DS8870 Architecture and Implementation

Professional Microsoft SQL Server 2014 Administration

Sams Teach Yourself NoSQL with MongoDB in 24 Hours

Implementing IBM Software Defined Network for Virtual Environments

Pro Apache Hadoop, Second Edition

Oracle PL/SQL Performance Tuning Tips & Techniques

Pro Couchbase Server

Learning Neo4j