talk-data.com talk-data.com

Topic

data-engineering

3377

tagged

Activity Trend

1 peak/qtr
2020-Q1 2026-Q1

Activities

Showing filtered results

Filtering by: O'Reilly Data Engineering Books ×
Architecting and Deploying DB2 with BLU Acceleration

IBM® DB2® with BLU Acceleration is a revolutionary technology that is delivered in DB2 for Linux, UNIX, and Windows Release 10.5. BLU Acceleration delivers breakthrough performance improvements for analytic queries by using dynamic in-memory columnar technologies. Different from other vendor solutions, BLU Acceleration allows the unified computing of OLTP and analytics data inside a single database, therefore, removing barriers and accelerating results for users. With observed hundredfold improvement in query response time, BLU Acceleration provides a simple, fast, and easy-to-use solution for the needs of today's organizations; quick access to business answers can be used to gain a competitive edge, lower costs, and more. This IBM Redbooks® publication introduces the concepts of DB2 with BLU Acceleration. It discusses the steps to move from a relational database to using BLU Acceleration, optimizing BLU usage, and deploying BLU into existing analytic solutions today, with an example of IBM Cognos®. This book also describes integration of DB2 with BLU Acceleration into SAP Business Warehouse (SAP BW) and SAP's near-line storage solution on DB2. This publication is intended to be helpful to a wide-ranging audience, including those readers who want to understand the technologies and those who have planning, deployment, and support responsibilities.

Evaluation Theory, Models, and Applications, 2nd Edition

The golden standard evaluation reference text Now in its second edition, Evaluation Theory, Models, and Applications is the vital text on evaluation models, perfect for classroom use as a textbook, and as a professional evaluation reference. The book begins with an overview of the evaluation field and program evaluation standards, and proceeds to cover the most widely used evaluation approaches. With new evaluation designs and the inclusion of the latest literature from the field, this Second Edition is an essential update for professionals and students who want to stay current. Understanding and choosing evaluation approaches is critical to many professions, and Evaluation Theory, Models, and Applications, Second Edition is the benchmark evaluation guide. Authors Daniel L. Stufflebeam and Chris L. S. Coryn, widely considered experts in the evaluation field, introduce and describe 23 program evaluation approaches, including, new to this edition, transformative evaluation, participatory evaluation, consumer feedback, and meta-analysis. Evaluation Theory, Models, and Applications, Second Edition facilitates the process of planning, conducting, and assessing program evaluations. The highlighted evaluation approaches include: Experimental and quasi-experimental design evaluations Daniel L. Stufflebeam's CIPP Model Michael Scriven's Consumer-Oriented Evaluation Michael Patton's Utilization-Focused Evaluation Robert Stake's Responsive/Stakeholder-Centered Evaluation Case Study Evaluation Key readings listed at the end of each chapter direct readers to the most important references for each topic. Learning objectives, review questions, student exercises, and instructor support materials complete the collection of tools. Choosing from evaluation approaches can be an overwhelming process, but Evaluation Theory, Models, and Applications, Second Edition updates the core evaluation concepts with the latest research, making this complex field accessible in just one book.

Key Management Deployment Guide: Using the IBM Enterprise Key Management Foundation

In an increasingly interconnected world, data breaches grab headlines. The security of sensitive information is vital, and new requirements and regulatory bodies such as the Payment Card Industry Data Security Standard (PCI-DSS), Health Insurance Portability and Accountability Act (HIPAA), and Sarbanes-Oxley (SOX) create challenges for enterprises that use encryption to protect their information. As encryption becomes more widely adopted, organizations also must contend with an ever-growing set of encryption keys. Effective management of these keys is essential to ensure both the availability and security of the encrypted information. Centralized management of keys and certificates is necessary to perform the complex tasks that are related to key and certificate generation, renewal, and backup and recovery. The IBM® Enterprise Key Management Foundation (EKMF) is a flexible and highly secure key management system for the enterprise. It provides centralized key management on IBM zEnterprise® and distributed platforms for streamlined, efficient, and secure key and certificate management operations. This IBM Redbooks® publication introduces key concepts around a centralized key management infrastructure and depicts the proper planning, implementation, and management of such a system using the IBM Enterprise Key Management Foundation solution.

Building Applications with iBeacon

High-precision location information is increasingly useful for mobile application developers, since it allows devices to interact with the world around them. This practical book shows you how to achieve arm’s reach accuracy with iBeacons, simple transmitters that enable your applications to react to nearby surroundings and then deliver timely, relevant information—especially indoors, where GPS and cell service are inaccurate.

Microsoft SQL Server 2014 Query Tuning & Optimization

Optimize Microsoft SQL Server 2014 queries and applications Microsoft SQL Server 2014 Query Tuning & Optimization is filled with ready-to-use techniques for creating high-performance queries and applications. The book describes the inner workings of the query processor so you can write better queries and provide the query processor with the quality information it needs to produce efficient execution plans. You’ll also get tips for troubleshooting underperforming queries. In-Memory OLTP (Hekaton), a key new feature of SQL Server 2014, is fully covered in this practical guide. Understand how the query optimizer works Troubleshoot queries using extended events, SQL trace, dynamic management views (DMVs), the data collector, and other tools Work with query operators for data access, joins, aggregations, parallelism, and updates Speed up queries and dramatically improve application performance by creating the right indexes Understand statistics and how to detect and fix cardinality estimation errors Maximize OLTP query performance using In-Memory OLTP (Hekaton) features, including memory-optimized tables and natively compiled stored procedures Monitor and promote plan caching and reuse to improve application performance Improve the performance of data warehouse queries using columnstore indexes Handle query processor limitations with hints and other methods

PostgreSQL Administration Essentials

PostgreSQL Administration Essentials is your practical guide to effectively managing your PostgreSQL databases with confidence. In this book, you will learn techniques for backups and recovery, performance tuning, and replication management, and gain valuable insights into database monitoring and administration, all tailored to ensure the smooth operation and optimization of your database systems. What this Book will help me do Back up and restore PostgreSQL databases efficiently to prevent data loss. Set up and manage PostgreSQL replication to ensure high availability. Monitor and analyze database performance to maintain an optimized environment. Handle user permissions securely to manage database access effectively. Detect performance bottlenecks and optimize indexes for faster queries. Author(s) Hans-Jürgen Schönig is a seasoned database professional with extensive experience working with PostgreSQL in production environments. Having provided consulting and training services in the database field, Hans shares deep insights into practical database administration techniques. Through his books and courses, he focuses on making complex topics accessible and actionable for professionals. Who is it for? This book is ideal for database administrators who want to quickly gain practical PostgreSQL skills. Database developers or project managers aiming to understand database administration and optimization will find this resource valuable. Beginners wishing to familiarize themselves with PostgreSQL basics to advance their careers will also benefit greatly. For those already experienced in other database platforms, this book provides insights to transition into using PostgreSQL effectively.

IBM System Storage SAN Volume Controller and Storwize V7000 Best Practices and Performance Guidelines

This IBM® Redbooks® publication captures several of the preferred practices that are based on field experience and describes the performance gains that can be achieved by implementing the IBM System Storage® SAN Volume Controller and Storwize® V7000 V7.2. This book begins with a look at the latest developments with SAN Volume Controller and Storwize V7000 and reviews the changes in the previous versions of the product. It highlights configuration guidelines and preferred practices for the storage area network (SAN) topology, clustered system, back-end storage, storage pools and managed disks, volumes, remote copy services, and hosts. Then, this book provides performance guidelines for SAN Volume Controller, back-end storage, and applications. It explains how you can optimize disk performance with the IBM System Storage Easy Tier® function. Next, it provides preferred practices for monitoring, maintaining, and troubleshooting SAN Volume Controller and Storwize V7000. Finally, this book highlights several scenarios that demonstrate the preferred practices and performance guidelines.

IBM PowerKVM Configuration and Use

This IBM® Redbooks® publication presents the new IBM PowerKVM virtualization for scale-out Linux systems. This book describes the concepts of PowerKVM and how you can deploy your virtual machines with the software stack included in the product. It helps you install and configure PowerKVM on your Power System server and provides guidance for managing the supported virtualization features by using the Web interface and command-line interface (CLI). This information for professionals who want to acquire a better understanding of PowerKVM virtualization technology to optimize Linux workload consolidation and use the new POWER8 processor features. The intended audience also includes people in these roles: Clients Sales and marketing professionals Technical support professionals IBM Business Partners Independent software vendors Open source community IBM OpenPower™ partners It does not replace the latest marketing materials and configuration tools. It is intended as an additional source of information that, together with existing sources, can be used to enhance your knowledge of IBM virtualization solutions. Before you start reading, you must be familiar with the general concepts of kernel-based virtual machine (KVM), Linux, and IBM Power architecture.

IBM MQ V8 Features and Enhancements

The power of IBM® MQ is its flexibility combined with reliability, scalability, and security. This flexibility provides a large number of design and implementation choices. Making informed decisions from this range of choices can simplify the development of applications and the administration of an MQ messaging infrastructure. Applications that access such an infrastructure can be developed using a wide range of programming paradigms and languages. These applications can run within a substantial array of software and hardware environments. Customers can use IBM MQ to integrate and extend the capabilities of existing and varied infrastructures in the information technology (IT) system of a business. IBM MQ V8.0 was released in June 2014. Before that release, the product name was IBM WebSphere® MQ. This IBM Redbooks® publication covers the core enhancements made in IBM MQ V8 and the concepts that must be understood. A broad understanding of the product features is key to making informed design and implementation choices for both the infrastructure and the applications that access it. Details of new areas of function for IBM MQ are introduced throughout this book, such as the changes to security, publish/subscribe clusters, and IBM System z exploitation. This book is for individuals and organizations who make informed decisions about design and applications before implementing an IBM MQ infrastructure or begin development of an IBM MQ application.

Hadoop in Practice, Second Edition

Hadoop in Practice, Second Edition provides over 100 tested, instantly useful techniques that will help you conquer big data, using Hadoop. This revised new edition covers changes and new features in the Hadoop core architecture, including MapReduce 2. Brand new chapters cover YARN and integrating Kafka, Impala, and Spark SQL with Hadoop. You'll also get new and updated techniques for Flume, Sqoop, and Mahout, all of which have seen major new versions recently. In short, this is the most practical, up-to-date coverage of Hadoop available anywhere About the Technology About the Book It's always a good time to upgrade your Hadoop skills! Hadoop in Practice, Second Edition provides a collection of 104 tested, instantly useful techniques for analyzing real-time streams, moving data securely, machine learning, managing large-scale clusters, and taming big data using Hadoop. This completely revised edition covers changes and new features in Hadoop core, including MapReduce 2 and YARN. You'll pick up hands-on best practices for integrating Spark, Kafka, and Impala with Hadoop, and get new and updated techniques for the latest versions of Flume, Sqoop, and Mahout. In short, this is the most practical, up-to-date coverage of Hadoop available. Readers need to know a programming language like Java and have basic familiarity with Hadoop. What's Inside Thoroughly updated for Hadoop 2 How to write YARN applications Integrate real-time technologies like Storm, Impala, and Spark Predictive analytics using Mahout and RR About the Reader About the Author Alex Holmes works on tough big-data problems. He is a software engineer, author, speaker, and blogger specializing in large-scale Hadoop projects. Quotes Very insightful. A deep dive into the Hadoop world. - Andrea Tarocchi, Red Hat, Inc. The most complete material on Hadoop and its ecosystem known to mankind! - Arthur Zubarev, Vital Insights Clear and concise, full of insights and highly applicable information. - Edward de Oliveira Ribeiro, DataStax, Inc. Comprehensive up-to-date coverage of Hadoop 2. - Muthusamy Manigandan, OzoneMedia

Guide to IBM PowerHA SystemMirror for AIX Version 7.1.3

This IBM® Redbooks® publication for IBM Power Systems™ with IBM PowerHA® SystemMirror® Standard and Enterprise Editions (hardware, software, practices, reference architectures, and tools) documents a well-defined deployment model within an IBM Power Systems environment. It guides you through a planned foundation for a dynamic infrastructure for your enterprise applications. This information is for technical consultants, technical support staff, IT architects, and IT specialists who are responsible for providing high availability and support for the IBM PowerHA SystemMirror Standard and Enterprise Editions on IBM POWER® systems.

Building 360-Degree Information Applications

Today's businesses, applications, social media, and online transactions generate more data than ever before. This data can be explored and analyzed to provide tremendous business value. IBM® Watson™ Explorer and IBM InfoSphere® Master Data Management (InfoSphere MDM) enable organizations to simultaneously explore and derive insights from enterprise data that was traditionally stored in "silos" in enterprise applications, different data repositories, and in different data formats. This IBM Redbooks® publication provides information about Watson Explorer 9.0, InfoSphere MDM, and IBM InfoSphere MDM Probabilistic Matching Engine for InfoSphere BigInsights™ (PME for BigInsights). It gives you an overview, describes the architecture, and presents use cases that you can use to accomplish the following tasks: Understand the core capabilities of Watson Explorer, InfoSphere MDM, and PME for BigInsights. Realize the full potential of Watson Explorer applications. Describe the integration and value of the combination of Watson Explorer and InfoSphere MDM. Build a 360-degree information application. Learn by example by following hands-on lab scenarios.

Getting Started with Impala

Learn how to write, tune, and port SQL queries and other statements for a Big Data environment, using Impala—the massively parallel processing SQL query engine for Apache Hadoop. The best practices in this practical guide help you design database schemas that not only interoperate with other Hadoop components, and are convenient for administers to manage and monitor, but also accommodate future expansion in data size and evolution of software capabilities. Written by John Russell, documentation lead for the Cloudera Impala project, this book gets you working with the most recent Impala releases quickly. Ideal for database developers and business analysts, the latest revision covers analytics functions, complex types, incremental statistics, subqueries, and submission to the Apache incubator. Getting Started with Impala includes advice from Cloudera’s development team, as well as insights from its consulting engagements with customers. Learn how Impala integrates with a wide range of Hadoop components Attain high performance and scalability for huge data sets on production clusters Explore common developer tasks, such as porting code to Impala and optimizing performance Use tutorials for working with billion-row tables, date- and time-based values, and other techniques Learn how to transition from rigid schemas to a flexible model that evolves as needs change Take a deep dive into joins and the roles of statistics

Oracle BPM Suite 12c Modeling Patterns

Dive deep into Oracle BPM Suite 12c with this comprehensive guide on workflow automation and modeling patterns. This book equips you with practical knowledge on designing and implementing intricate Business Process Management solutions using Oracle BPM Suite 12c, making complex processes harmonious and effective. What this Book will help me do Master advanced flow control and branching patterns in Oracle BPM Suite 12c. Implement human workflow integration for better interaction within BPM applications. Learn effective handling of process correlation and exceptions for robust solutions. Explore adaptive case management for handling unpredictable business needs. Understand the usage of predictive analysis and KPIs in business processes. Author(s) None Acharya is a seasoned Oracle BPM and IT solutions expert. With years of practical experience in enterprise process management and cloud-based application development, None combines a technical depth with a passion for sharing knowledge. Their approach to writing ensures a balance of theory and actionable insights. Who is it for? This book is designed for enterprise and solution architects, developers, process analysts, and technical or functional consultants with a focus on process modeling and enterprise IT solutions. Intended readers should have a foundational understanding of BPM concepts. By reading this book, you'll enhance your ability to work with Oracle BPM Suite 12c in professional environments. Ideal for those seeking to deepen their expertise in BPM and workflow automation.

Master Competitive Analytics with Oracle Endeca Information Discovery

Oracle Endeca Information Discovery Best Practices Maximize the powerful capabilities of this self-service enterprise data discovery platform. Master Competitive Analytics with Oracle Endeca Information Discovery reveals how to unlock insights from any type of data, regardless of structure. The first part of the book is a complete technical guide to the product's architecture, components, and implementation. The second part presents a comprehensive collection of business analytics use cases in various industries, including financial services, healthcare, research, manufacturing, retail, consumer packaged goods, and public sector. Step-by-step instructions on implementing some of these use cases are included in this Oracle Press book. Install and manage Oracle Endeca Server Design Oracle Endeca Information Discovery Studio visualizations to facilitate user-driven data exploration and discovery Enable enterprise-driven data exploration with Oracle Endeca Information Discovery Integrator Develop and implement a fraud detection and analysis application Build a healthcare correlation application that integrates claims, patient, and operations analysis; partners; clinical research; and remote monitoring Use an enterprise architecture approach to incrementally establish big data and analytical capabilities

Mastering MariaDB

In 'Mastering MariaDB', you'll explore advanced techniques for managing, optimizing, and maintaining your MariaDB database servers. This book teaches you how to analyze query performance, implement best practices for security, and ensure data durability and availability through robust configurations and procedures. What this Book will help me do Learn to analyze query performance to optimize database efficiency. Gain expertise in implementing secure and organized role-based access controls. Achieve effective database backups with reliable recovery strategies. Set up replication environments to enable high availability of data. Explore clustering and sharding techniques for powerful scalability solutions. Author(s) None Razzoli is an experienced database administrator and author, specializing in MariaDB and other open-source database solutions. With an extensive career in database optimization and troubleshooting, None has helped businesses achieve greater performance and reliability with their systems. As a writer, they focus on clear, precise instruction and practical, real-world applications of database technology. Who is it for? This book is ideal for database administrators and system engineers who are already familiar with the basics of MariaDB and want to elevate their expertise. Intermediate MariaDB users looking to optimize their systems, ensure data security, and set up advanced configurations will find this book immensely helpful. It's written for professionals who actively manage database systems and require advanced knowledge to improve performance and reliability. If you're seeking deeper insights into MariaDB's features and scalability techniques, this book is for you.

Reliability and Performance with IBM DB2 Analytics Accelerator V4.1

The IBM® DB2® Analytics Accelerator for IBM z/OS® is a high-performance appliance that integrates the IBM zEnterprise® infrastructure with IBM PureData™ for Analytics, powered by IBM Netezza® technology. With this integration, you can accelerate data-intensive and complex queries in a DB2 for z/OS highly secure and available environment. DB2 and the Analytics Accelerator appliance form a self-managing hybrid environment running online transaction processing and online transactional analytical processing concurrently and efficiently. These online transactions run together with business intelligence and online analytic processing workloads. DB2 Analytics Accelerator V4.1 expands the value of high-performance analytics. DB2 Analytics Accelerator V4.1 opens to static Structured Query Language (SQL) applications and row set processing, minimizes data movement, reduces latency, and improves availability. This IBM Redbooks® publication provides technical decision-makers with an understanding of the benefits of version 4.1 of the Analytics Accelerator with DB2 11 for z/OS. It describes the installation of the new functions, and the advantages to existing analytical processes as measured in our test environment. This book also introduces the DB2 Analytics Accelerator Loader V1.1, a tool that facilitates the data population of the DB2 Analytics Accelerator.

DynamoDB Applied Design Patterns

In "DynamoDB Applied Design Patterns", you'll dive deep into the effective design patterns that optimize the performance of applications using DynamoDB. Through practical examples and best practices, this guide empowers developers to create scalable, efficient, and robust DynamoDB implementations. What this Book will help me do Master how to design effective data models using DynamoDB's native features such as tables, attributes, and indexes. Learn to utilize DynamoDB features like global and local secondary indexes to optimize performance. Gain in-depth knowledge on managing and querying DynamoDB using AWS services and tools. Integrate DynamoDB seamlessly with AWS services such as Redshift, S3, and MapReduce. Leverage advanced DynamoDB API features to retrieve data efficiently for diverse application use cases. Author(s) Uchit Hamendra Vyas is a highly skilled professional specializing in AWS and cloud computing. With years of experience as a developer and architect, he brings practical insights into designing efficient database solutions. His approachable teaching style makes complex topics clear and accessible. Who is it for? This book is designed for developers working with or interested in using DynamoDB in their projects. It assumes a moderate familiarity with database design and AWS concepts. Readers aiming to enhance their DynamoDB skills and optimize performance will greatly benefit. If you're looking to take your NoSQL database knowledge to the next level, this book is for you.

I Heart Logs

Why a book about logs? That’s easy: the humble log is an abstraction that lies at the heart of many systems, from NoSQL databases to cryptocurrencies. Even though most engineers don’t think much about them, this short book shows you why logs are worthy of your attention. Based on his popular blog posts, LinkedIn principal engineer Jay Kreps shows you how logs work in distributed systems, and then delivers practical applications of these concepts in a variety of common uses—data integration, enterprise architecture, real-time stream processing, data system design, and abstract computing models. Go ahead and take the plunge with logs; you’re going love them. Learn how logs are used for programmatic access in databases and distributed systems Discover solutions to the huge data integration problem when more data of more varieties meet more systems Understand why logs are at the heart of real-time stream processing Learn the role of a log in the internals of online data systems Explore how Jay Kreps applies these ideas to his own work on data infrastructure systems at LinkedIn

MariaDB High Performance

Learn how to optimize your MariaDB installations for performance, scalability, and reliability. This comprehensive guide teaches you advanced replication techniques, clustering, sharding data, and optimizing engines. By the end of the book, you'll have the tools to build complex, high-performing database infrastructures. What this Book will help me do Master the setup of advanced replication models such as master/slave and dual-master setups. Implement a Galera Cluster to enhance your database's write scalability and reliability. Configure the Spider engine for effective data sharding across multiple nodes. Identify and mitigate performance bottlenecks through in-depth engine optimization. Deploy and maintain a disaster recovery solution using advanced database strategies. Author(s) Pierre Mavro is an experienced database administrator with years of expertise in MySQL and MariaDB systems. Having worked on numerous high-scale projects, he brings practical insights into creating robust, high-performance database solutions. Pierre enjoys sharing his knowledge and helping developers and administrators navigate complex database challenges. Who is it for? This book is perfect for database administrators and system architects who already have working experience with MariaDB or MySQL. If you're looking to enhance your skills and develop solutions for high traffic applications, this book is for you. Ideal for those comfortable with Linux-based infrastructures and eager to learn. Whether maintaining high-availability systems or scaling databases, you'll find this work essential.