talk-data.com talk-data.com

Topic

data-engineering

3377

tagged

Activity Trend

1 peak/qtr
2020-Q1 2026-Q1

Activities

Showing filtered results

Filtering by: O'Reilly Data Engineering Books ×
RabbitMQ Essentials

RabbitMQ Essentials provides a comprehensive introduction to RabbitMQ, a versatile message broker solution frequently used in distributed systems. Through a clear set of examples and practical exercises, readers will gain the skills needed to start building robust messaging applications. What this Book will help me do Understand the core concepts and architecture of RabbitMQ. Learn how to set up RabbitMQ and develop messaging applications. Master sending, receiving, and routing messages effectively with RabbitMQ exchanging types such as direct, topic, and fanout. Implement robust error handling and monitoring mechanisms for RabbitMQ implementations. Create intelligent messaging workflows and scalable systems leveraging RabbitMQ's advanced features. Author(s) David Dossot is a seasoned software architect and developer with deep expertise in distributed systems and messaging solutions. With years of practical experience, David brings clarity and actionable insight to the complex subject of message-oriented middleware. His goal is to help technologists bridge the gap between theory and implementation. Who is it for? This book is intended for developers interested in learning how to utilize RabbitMQ for building modern messaging applications. It's suitable for readers with basic programming knowledge who want to deepen their understanding of distributed systems. Ideal for professionals seeking to implement message queuing solutions in their systems or enhance existing setups. The book progresses from beginner-friendly explanations to more advanced topics, making it a perfect choice for developers at any stage of learning RabbitMQ.

Geoinformation, 2nd Edition

Written by a renowned expert, Geoinformation: Remote Sensing, Photogrammetry and Geographic Information Systems, Second Edition gives you an overarching view of how remote sensing, photogrammetry, and geographic information systems work together in an interdisciplinary manner. The book presents the required basic background of the geoinformatics concept in which the different methodologies must be combined. It details the principal components of remote sensing, from theoretical principles to advanced image analysis and interpretation techniques, sensor components, and operating platforms. New and Updated in the Second Edition: Web-based image viewing with Google Earth Aerial platforms Existing digital photogrammetric software systems, including Intergraph image station, Autodesk, and Oracle Spatial Land management and cadaster Imaging sensors such as laser scanning, image spectrometry, radar imaging, and radar interferometry With the advent of high-resolution satellite systems in stereo, the theory of analytical photogrammetry restituting 2D image information into 3D is of increasing importance, merging the remote sensing approach with that of photogrammetry. This text describes the fundamentals of these approaches in detail, with an emphasis on global, regional, and local applications. It provides a short introduction to the GPS satellite positioning system in the context of data integration. An extensive overview of the basic elements of GIS technologies and data management approaches, as well as the widely employed positioning systems such as GPS and GSM networks, complete the presentation of the technological framework for geoinformation. Appropriate for GIS courses at all levels, the book proceeds beyond the science and technology to tackle cost considerations and practical implementation issues, giving you a starting point for multidisciplinary new activities and services in the future.

IBM i and IBM Storwize Family: A Practical Guide to Usage Scenarios

The use of external storage and the benefits of virtualization became a topic of discussion in the IBM® i area during the last several years. The question tends to be, what are the advantages of the use of external storage that is attached to an IBM i environment as opposed to the use of internal storage. The use of IBM PowerVM® virtualization technology to virtualize Power server processors and memory also became common in IBM i environments. However, virtualized access to external storage and network resources by using a VIO server is still not widely used. This IBM Redbooks® publication gives a broad overview of the IBM Storwize® family products and their features and functions. It describes the setup that is required on the storage side and describes and positions the different options for attaching IBM Storwize family products to an IBM i environment. Basic setup and configuration of a VIO server specifically for the needs of an IBM i environment is also described. In addition, different configuration options for a combined setup of IBM PowerHA® SystemMirror® for i and the Storwize family products are described and positioned against each other. Detailed examples are provided for the setup process that is required for these environments. The information that is provided in this book is useful for clients, IBM Business Partners, and IBM service professionals who need to understand how to install and configure their IBM i environment with attachment to the Storwize family products.

Implementing the IBM Storwize V7000 Unified Disk System

This IBM® Redbooks® publication introduces the IBM Storwize® V7000 Unified Disk System, a virtualized storage system that consolidates block and file workloads into a single storage system. Advantages include simplicity of management, reduced cost, highly scalable capacity, performance, and high availability. It also offers improved efficiency and flexibility through built-in solid-state drive optimization, thin provisioning, IBM Real-time Compression™, We suggest that you familiarize yourself with the following Redbooks publications to get the most from this book: Implementing the IBM Storwize V7000 V6.3, SG24-7938 Implementing the IBM System Storage SAN Volume Controller V6.3, SG24-7933 Real-time Compression in SAN Volume Controller and Storwize V7000, REDP-4859 SONAS Implementation and Best Practices Guide, SG24-7962 SONAS Concepts, Architecture, and Planning Guide, SG24-7963

Professional Microsoft SQL Server 2014 Integration Services

Fill the gap between planning and doing with SSIS 2014 The 2014 release of Microsoft's SQL Server Integration Services provides enhancements for managing extraction, transformation, and load operations, plus expanded in-memory capabilities, improved disaster recovery, increased scalability, and much more. The increased functionality will streamline your ETL processes and smooth out your workflow, but the catch is that your workflow must change. New tools come with new best practices, and Professional Microsoft SQL Server 2014 Integration Services will keep you ahead of the curve. SQL Server MVP Brian Knight is the most respected name in the business, and your ultimate guide to navigating the changes to use Microsoft SQL Server Integration Services 2014 to your utmost advantage. Implement new best practices for effective use of SSIS Work through tutorials for hands-on learning of complex techniques Read case studies that illustrate the more advanced concepts Learn directly from the foremost authority on SSIS SQL Server Integration Services is a complex tool, but it's the lifeblood of your work. You need to know it inside out, and you must understand the full potential of its capabilities in order to use it effectively. You need to make sure the right architecture is in place. Professional Microsoft SQL Server 2014 Integration Services is your roadmap to understanding SSIS on a fundamental level, and setting yourself up for success.

Implementing IBM FlashSystem 840

Almost all technological components in the data center are getting faster; central processing units, network, storage area networks (SAN), and memory. All of them have improved their speed by a minimum of 10X; some of them by 100X, for example, data networks. However, spinning disk performance has only increased by 1.2 times. Fortunately, flash storage has come along and closed the gap between the other improvements in server technology and storage. Using the IBM® FlashSystem™ 840 provides value that goes beyond those benefits that are seen on disk-based arrays. These benefits include better user experience, server and application consolidation, development cycle reduction, application scalability, data center footprint savings, and improved price performance economics. This IBM Redbooks® publication introduces clients to the newest member of the IBM FlashSystem™ products: the IBM FlashSystem 840. It will provide in-depth knowledge of the product architecture, software and hardware, its implementation, and hints and tips. Also illustrated are use cases that show real-world solutions for tiering, flash-only, and preferred read, as well as examples of the benefits gained by integrating the FlashSystem storage into business environments. This book is intended for pre-sales and post-sales technical support professionals and storage administrators, and for anyone who wants to understand and learn how to implement this new and exciting technology.

MySQL High Availability, 2nd Edition

Server bottlenecks and failures are a fact of life in any database deployment, but they don’t have to bring everything to a halt. This practical book explains replication, cluster, and monitoring features that can help protect your MySQL system from outages, whether it’s running on hardware, virtual machines, or in the cloud.

Pig Design Patterns

Discover how to simplify Hadoop programming with Pig Design Patterns, helping you create innovative enterprise-level big data solutions. This book takes you step-by-step through practical design patterns for creating efficient data processing workflows with Apache Pig. What this Book will help me do Understand and implement fundamental data processing patterns with Pig. Master advanced Pig techniques for Big Data analytics. Learn to optimize Pig scripts for performance and scalability. Build end-to-end data processing solutions with real-world examples. Integrate Pig workflows into the broader Hadoop ecosystem. Author(s) Pradeep Pasupuleti is an experienced data engineer and software developer specializing in Big Data technologies. With extensive expertise in Hadoop and Pig, Pradeep shares valuable insights and practical techniques beginners and experts alike will appreciate. Who is it for? This book is perfect for software developers and data engineers working with Hadoop who want to streamline their workflow. It is ideal for professionals already familiar with Pig and Hadoop basics looking to advance. It also suits learners aiming to implement optimized data solutions effectively.

IBM Platform Computing Solutions Reference Architectures and Best Practices

This IBM® Redbooks® publication demonstrates and documents that the combination of IBM System x®, IBM GPFS™, IBM GPFS-FPO, IBM Platform Symphony®, IBM Platform HPC, IBM Platform LSF®, IBM Platform Cluster Manager Standard Edition, and IBM Platform Cluster Manager Advanced Edition deliver significant value to clients in need of cost-effective, highly scalable, and robust solutions. IBM depth of solutions can help the clients plan a foundation to face challenges in how to manage, maintain, enhance, and provision computing environments to, for example, analyze the growing volumes of data within their organizations. This IBM Redbooks publication addresses topics to educate, reiterate, confirm, and strengthen the widely held opinion of IBM Platform Computing as the systems software platform of choice within an IBM System x environment for deploying and managing environments that help clients solve challenging technical and business problems. This IBM Redbooks publication addresses topics to that help answer customer’s complex challenge requirements to manage, maintain, and analyze the growing volumes of data within their organizations and provide expert-level documentation to transfer the how-to-skills to the worldwide support teams. This IBM Redbooks publication is targeted toward technical professionals (consultants, technical support staff, IT Architects, and IT Specialists) who are responsible for delivering cost-effective computing solutions that help optimize business results, product development, and scientific discoveries.

Leveraging the IBM BPM Coach Framework in Your Organization

The IBM® Coach Framework is a key element of the IBM Business Process Manager (BPM) product suite. With the Coach Framework, process authors can create and maintain custom web-based user interfaces that are embedded within their business process solutions. This ability to create and maintain custom user interfaces is a key factor in the successful deployment of business process solutions. Coaches have proven to be an extremely powerful element of IBM BPM solutions, and with the release of IBM BPM version 8.0 they were rejuvenated to incorporate the recent advances in browser-based user interfaces. This IBM Redbooks® publication focuses on the capabilities that Coach Framework delivers with IBM BPM version 8.5, but much of what is shared in these pages continues to be of value as IBM evolves coaches in the future. This book has been produced to help you fully benefit from the power of the Coach Framework.

Hadoop For Dummies

Let Hadoop For Dummies help harness the power of your data and rein in the information overload Big data has become big business, and companies and organizations of all sizes are struggling to find ways to retrieve valuable information from their massive data sets with becoming overwhelmed. Enter Hadoop and this easy-to-understand For Dummies guide. Hadoop For Dummies helps readers understand the value of big data, make a business case for using Hadoop, navigate the Hadoop ecosystem, and build and manage Hadoop applications and clusters. Explains the origins of Hadoop, its economic benefits, and its functionality and practical applications Helps you find your way around the Hadoop ecosystem, program MapReduce, utilize design patterns, and get your Hadoop cluster up and running quickly and easily Details how to use Hadoop applications for data mining, web analytics and personalization, large-scale text processing, data science, and problem-solving Shows you how to improve the value of your Hadoop cluster, maximize your investment in Hadoop, and avoid common pitfalls when building your Hadoop cluster From programmers challenged with building and maintaining affordable, scaleable data systems to administrators who must deal with huge volumes of information effectively and efficiently, this how-to has something to help you with Hadoop.

Anonymous Communication Networks

This book examines anonymous communication networks as a solution to Internet privacy concerns. It explores various anonymous communication networks as possible solutions to Internet privacy concerns and identifies specific scenarios where it is best to remain anonymous. The text details the two main approaches to anonymous communication networks: onion routing and mixed networks. Using examples and case studies, it illustrates the usefulness of anonymous communication networks for web browsing, email, e-banking, and e-voting. It also includes guidance to help readers download and install Tor, I2P, JAP/JonDo, and QuickSilver.

IBM Tivoli Storage Productivity Center V5.1 Technical Guide

IBM® Tivoli® Storage Productivity Center V5.1 products offer storage infrastructure management that helps optimize storage management by centralizing, simplifying, automating, and optimizing storage tasks associated with storage systems, data disaster recovery, storage networks, and capacity management. IBM Tivoli Storage Productivity Center V5.1 products include: IBM Tivoli Storage Productivity Center V5.1 IBM Tivoli Storage Productivity Center Select Edition V5.1 Tivoli Storage Productivity Center Select Edition V5.1 offers the same features as Tivoli Storage Productivity Center V5.1 but at attractive entry-level pricing for operations with smaller capacities. It is licensed per storage device, such as disk controllers and their respective expansion units. This IBM Redbooks® publication is intended for storage administrators and users who are installing and using the features and functions in IBM Tivoli Storage Productivity Center V5.1. The information in this book can be used to plan for, install, and customize the components of Tivoli Storage Productivity Center in your storage infrastructure.

Beginning Hibernate, Third Edition

Beginning Hibernate, Third Edition is ideal if you're experienced in Java with databases (the traditional, or "connected," approach), but new to open-source, lightweight Hibernate, a leading object-relational mapping and database-oriented application development framework. This book packs in information about the release of the Hibernate 4.x persistence layer and provides a clear introduction to the current standard for object-relational persistence in Java. And since the book keeps its focus on Hibernate without wasting time on nonessential third-party tools, you'll be able to immediately start building transaction-based engines and applications. Experienced authors Joseph Ottinger with Dave Minter and Jeff Linwood provide more in-depth examples than any other book for Hibernate beginners. The authors also present material in a lively, example-based manner—not a dry, theoretical, hard-to-read fashion. What you'll learn How to build enterprise Java-based transaction-type applications that access complex data with Hibernate How to work with Hibernate 4 Where to integrate into the persistence life cycle How to map using annotations, Hibernate XML files, and more How to search and query with the new version of Hibernate How to integrate with MongoDB using NoSQL Who this book is for This book is for Java developers who want to learn about Hibernate.

Think Bigger
Big data--the enormous amount of data that is created as virtually every movement, transaction, and choice we make becomes digitized--is revolutionizing business. Offering real-world insight and explanations, this book provides a roadmap for organizations looking to develop a profitable big data strategy...and reveals why it's not something they can leave to the I.T. department.

Sharing best practices from companies that have implemented a big data strategy including Walmart, InterContinental Hotel Group, Walt Disney, and Shell, Think Bigger covers the most important big data trends affecting organizations, as well as key technologies like Hadoop and MapReduce, and several crucial types of analyses. In addition, the book offers guidance on how to ensure security, and respect the privacy rights of consumers. It also examines in detail how big data is impacting specific industries--and where opportunities can be found.

Big data is changing the way businesses--and even governments--are operated and managed. Think Bigger is an essential resource for anyone who wants to ensure that their company isn't left in the dust.

IBM zEnterprise System Technical Introduction

In a smarter planet, information-centric processes are exploding in growth. The mainframe has always been the IT industry's leading platform for transaction processing, consolidated and secure data serving, and support for available enterprise-wide applications. IBM® has extended the mainframe platform to help large enterprises reshape their client experiences through information-centric computing and to deliver on key business initiatives. IBM zEnterprise® is recognized as the most reliable and trusted system, and the most secure environment for core business operations. The new zEnterprise System consists of the IBM zEnterprise EC12 (zEC12) or IBM zEnterprise BC12 (zBC12), the IBM zEnterprise Unified Resource Manager, and the IBM zEnterprise IBM BladeCenter® Extension (zBX) Model 003. This IBM Redbooks® publication describes the zEC12 and zBC12, with their improved scalability, performance, security, resiliency, availability, and virtualization. The zEnterprise System has no peer as a trusted platform that also provides the most efficient transaction processing and database management. With efficiency at scale delivering significant cost savings on core processes, resources can be freed up to focus on developing new services to drive growth. This book provides a technical overview of the zEC12, zBC12, zBX Model 003, and Unified Resource Manager. This publication is intended for IT managers, architects, consultants, and anyone else who wants to understand the elements of the zEnterprise System. For this introduction to the zEnterprise System, readers are not expected to be familiar with current IBM System z® technology and terminology.

Responsive Mobile User Experience Using MQTT and IBM MessageSight

IBM® MessageSight is an appliance-based messaging server that is optimized to address the massive scale requirements of machine-to-machine (m2m) and mobile user scenarios. IBM MessageSight makes it easy to connect mobile customers to your existing messaging enterprise system, enabling a substantial number of remote clients to be concurrently connected. The MQTT protocol is a lightweight messaging protocol that uses publish/subscribe architecture to deliver messages over low bandwidth or unreliable networks. A publish/subscribe architecture works well for HTML5, native, and hybrid mobile applications by removing the wait time of a request/response model. This creates a better, richer user experience. The MQTT protocol is simple, which results in a client library with a low footprint. MQTT was proposed as an Organization for the Advancement of Structured Information Standards (OASIS) standard. This book provides information about version 3.1 of the MQTT specification. This IBM Redbooks® publication provides information about how IBM MessageSight, in combination with MQTT, facilitates the expansion of enterprise systems to include mobile devices and m2m communications. This book also outlines how to connect IBM MessageSight to an existing infrastructure, either through the use of IBM WebSphere® MQ connectivity or the IBM Integration Bus (formerly known as WebSphere Message Broker). This book describes IBM MessageSight product features and facilities that are relevant to technical personnel, such as system architects, to help them make informed design decisions regarding the integration of the messaging appliance into their enterprise architecture. Using a scenario-based approach, you learn how to develop a mobile application, and how to integrate IBM MessageSight with other IBM products. This publication is intended to be of use to a wide-ranging audience.

Storm Blueprints: Patterns for Distributed Real-time Computation

"Storm Blueprints: Patterns for Distributed Real-time Computation" takes you on a hands-on journey into understanding and implementing distributed real-time processing with Apache Storm. Through real-world examples and projects, you'll gain a sound understanding of the fundamentals and learn to design systems capable of resilient, scalable, and fast computation. What this Book will help me do Understand the essentials of Apache Storm and its architecture. Learn to deploy and manage Storm in different modes, including distributed clusters. Discover design patterns for real-time data flow in distributed systems. Master the implementation of fault tolerance and continuous availability in processing. Analyze system performance insights through practical integrations and use cases. Author(s) The author(s) of 'Storm Blueprints' bring extensive experience in distributed systems engineering and real-time computations. Their passion for sharing knowledge is evident in this approachable yet comprehensive book. With years of practical experience, they offer insights and proven techniques to empower readers to build practical distributed systems. Who is it for? This book is designed for software engineers and developers working on data pipelines and real-time processing systems. Beginners to Storm will find it an excellent introduction, while those with experience will appreciate the advanced design patterns and use cases. If you aim to leverage Storm effectively in distributed architectures, this guide is tailored for you.

DFSMSrmm Primer

DFSMSrmm from IBM® is the full function tape management system available in IBM OS/390® and IBM z/OS®. With DFSMSrmm, you can manage all types of tape media at the shelf, volume, and data set level, simplifying the tasks of your tape librarian. Are you a new DFSMSrmm user? Then, this IBM Redbooks® publication introduces you to the DFSMSrmm basic concepts and functions. You learn how to manage your tape environment by implementing the DFSMSrmm management policies. Are you already using DFSMSrmm? In that case, this publication provides the most up-to-date information about the new functions and enhancements introduced with the latest release of DFSMSrmm. You will find useful information for implementing these new functions and getting more benefits from DFSMSrmm. Do you want to test DFSMSrmm functions? If you are using another tape management system and are thinking about converting to DFSMSrmm, you can start DFSMSrmm and run it in parallel with your current system for testing purposes. This book is intended to be a starting point for new professionals and a handbook for using the basic DFSMSrmm functions.

Solr in Action

Solr in Action is a comprehensive guide to implementing scalable search using Apache Solr. This clearly written book walks you through well-documented examples ranging from basic keyword searching to scaling a system for billions of documents and queries. It will give you a deep understanding of how to implement core Solr capabilities. About the Technology About the Book Whether you're handling big (or small) data, managing documents, or building a website, it is important to be able to quickly search through your content and discover meaning in it. Apache Solr is your tool: a ready-to-deploy, Lucene-based, open source, full-text search engine. Solr can scale across many servers to enable real-time queries and data analytics across billions of documents. Solr in Action teaches you to implement scalable search using Apache Solr. This easy-to-read guide balances conceptual discussions with practical examples to show you how to implement all of Solr's core capabilities. You'll master topics like text analysis, faceted search, hit highlighting, result grouping, query suggestions, multilingual search, advanced geospatial and data operations, and relevancy tuning. What's Inside How to scale Solr for big data Rich real-world examples Solr as a NoSQL data store Advanced multilingual, data, and relevancy tricks Coverage of versions through Solr 4.7 About the Reader This book assumes basic knowledge of Java and standard database technology. No prior knowledge of Solr or Lucene is required. About the Authors Trey Grainger is a director of engineering at CareerBuilder. Timothy Potter is a senior member of the engineering team at LucidWorks. The authors work on the scalability and reliability of Solr, as well as on recommendation engine and big data analytics technologies. Quotes The knowledge and techniques you need. - From the Foreword by Yonik Seeley, Creator of Solr Readable and immediately applicable ... an excellent book. - John Viviano, InterCorp, Inc. The go-to guide for Solr ... a definitive resource for both beginners and experts. - Scott Anthony, Business Instruments A well-dosed combination of deep technical knowledge and real-world experience. - Alexandre Madurell, Piksel, Inc.