talk-data.com talk-data.com

Topic

data-engineering

3377

tagged

Activity Trend

1 peak/qtr
2020-Q1 2026-Q1

Activities

Showing filtered results

Filtering by: O'Reilly Data Engineering Books ×
IBM and Cisco: Together for a World Class Data Center

This IBM® Redbooks® publication is an IBM and Cisco collaboration that articulates how IBM and Cisco can bring the benefits of their respective companies to the modern data center. It documents the architectures, solutions, and benefits that can be achieved by implementing a data center based on IBM server, storage, and integrated systems, with the broader Cisco network. We describe how to design a state-of-the art data center and networking infrastructure combining Cisco and IBM solutions. The objective is to provide a reference guide for customers looking to build an infrastructure that is optimized for virtualization, is highly available, is interoperable, and is efficient in terms of power and space consumption. It will explain the technologies used to build the infrastructure, provide use cases, and give guidance on deployments.

The Metadata Manual

Cultural heritage professionals have high levels of training in metadata. However, the institutions in which they practice often depend on support staff, volunteers, and students in order to function. With limited time and funding for training in metadata creation for digital collections, there are often many questions about metadata without a reliable, direct source for answers. The Metadata Manual provides such a resource, answering basic metadata questions that may appear, and exploring metadata from a beginner’s perspective. This title covers metadata basics, XML basics, Dublin Core, VRA Core, and CDWA schemes and provides exercise in the creation of metadata. Finally, the book gives an overview of metadata, including mapping and sharing. Outlines the most popular metadata schema written by practicing metadata librarians Focuses on what you “need to know” Does not require coding experience to use and understand

Mastering Financial Modeling: A Professional’s Guide to Building Financial Models in Excel

All the precision of financial modeling--and none of the complexity Evidence-based decision making is only as good as the external evidence on which it is based. Financial models uncover potential risks on a company’s balance sheet, but the complexity of these instruments has limited their effectiveness. Now, Mastering Financial Modeling offers a simplified method for building the fast and accurate financial models serious evidencebased decision makers need. What sets this practical guide apart is its "learning-on-the-job" approach. Unlike other books that teach modeling in a vacuum, this superior method uses a diverse collection of case studies to convey each step of the building process. "Learning on the job" connects the dots between the proper Excel formulas and functions and the real-world situations where you want to use them. By learning through association, you can absorb the information quickly and have it ready to use when you need it. The book starts right off on building models--from creating a standalone cash flow model through integrating it with an income statement and balance sheet. Along the way, you will master the skill set you need to build advanced financial models. With only a basic knowledge of accounting and finance, individual investors and financial professionals alike can: Create a core model and customize it for companies in most industries Understand every working component of a financial model and what each one tells you about a company Format cells and sheets in Excel for easily repeatable modeling Written with the practitioner in mind, Mastering Financial Modeling shows you how to ensure your model is ready for real-world application by safeguarding it against modeling errors. It covers a full array of Excel's builtin auditing and testing tools and illustrates how to build customized error-checking tools of your own to catch the inaccuracies that typically fall through the cracks. Get the most out of your data with Mastering Financial Modeling. Mastering Financial Modeling brings the power of financial models down to earth and puts it in the hands of investors, bankers, and private equity professionals who don't have a passion for crunching numbers. Nowhere else can you get step-by-step instruction on building these valuable tools from an elite World Bank investment officer. Starting from the ground up, Eric Soubeiga shows you how to interpret and build financial models in Microsoft Excel that will accurately assess any company’s valuation and profit potential. Even if you have unsuccessfully tried financial modeling in the past, this book will reach you because it associates every lesson to the business world you work in daily. Chapter by chapter, you will master financial modeling, and in the end, you will: Command authority over building every aspect of a financial model Be capable of explaining the accounting and finance concepts behind the mechanics of modeling Confidently determine a company’s ability to generate cash flows for its capital investors with discounted cash flow (DCF) modeling Execute powerful spreadsheet calculations in Excel Most importantly, as a decision maker, the insight you bring to the table through your sophisticated understanding and application of financial modeling will benefit every stakeholder. See what leading professionals around the world already know-- Mastering Financial Modeling is the most comprehensive guide on the market for designing, building, and implementing valuation projection models. What it does from there is up to you.

Microsoft BizTalk ESB Toolkit 2.1

This book, 'Microsoft BizTalk ESB Toolkit 2.1,' provides a detailed and practical guide to implementing enterprise integration solutions using Microsoft's robust Toolkit. You'll explore architectural principles, key components, and advanced features, allowing you to create efficient and scalable integration infrastructures. What this Book will help me do Understand the architecture and core principles of the ESB Toolkit. Learn to use Itinerary components to manage and drive flexible service compositions. Master the error handling features of the toolkit to ensure reliability in integration processes. Explore the ESB Management Portal for operational and administrative tasks. Gain hands-on experience with the Toolkit's web services for extending applications. Author(s) The content in this book is developed by a team of technical writers and software experts who specialize in enterprise integration and BizTalk solutions. With collective years of experience and deep knowledge about BizTalk Server and related enterprise tools, the authors aim to deliver practical and applicable insights in a clear and effective manner. Who is it for? This book is ideal for BizTalk developers aiming to deepen their expertise in using the ESB Toolkit, architects wanting to understand its role in enterprise service integration, and IT managers seeking to improve their service-oriented architecture. New users who wish to get acquainted with the ESB Toolkit will also find value in its step-by-step walkthroughs.

IBM Flex System and PureFlex System Network Implementation

To meet today's complex and ever-changing business demands, you need a solid foundation of server, storage, networking, and software resources that are simple to deploy and can quickly and automatically adapt to changing conditions. You also need access to, and the ability to take advantage of, broad expertise and proven best practices in systems management, applications, hardware maintenance, and more. IBM® PureFlex™ System, which is a part of the IBM PureSystems™ family of expert integrated systems, combines advanced IBM hardware and software along with patterns of expertise and integrates them into three optimized configurations that are simple to acquire and deploy so that you can achieve faster time to value. If you want a preconfigured, preintegrated infrastructure with integrated management and cloud capabilities, factory tuned from IBM with x86 and Power Systems™ hybrid solution, IBM PureFlex System is the answer. In this IBM Redbooks® publication, which is aimed at system and network administrators, we show the design and architecture, how to configure hosts and switches, maintain, and troubleshoot using the IBM Flex System™ Ethernet I/O modules (EN2091 1Gb Ethernet Scalable Switch and EN4093R 10Gb Scalable Switch).

IBM System Storage DCS3700 Introduction and Implementation Guide

The IBM® System Storage® DCS3700 consists of IBM System Storage DCS3700 storage subsystem and IBM System Storage DCS3700 expansion unit. The DCS3700 features the latest technologies, including 6 Gbps SAS and 8 Gbps Fibre Channel host interfaces, along with 6 Gbps SAS drives. The DCS3700 also features a 10 Gbps iSCSI host interface with an optional Performance Modules system. The DCS3700 provides a simple, efficient, and flexible approach to storage that is based on seven generations of design knowledge and firmware development. The DCS3700 can act as a cost-effective and fully integrated complement to IBM System x® servers, IBM BladeCenter®, and IBM Power Systems™ for various intensive computing environments. This IBM Redbooks® publication specifically addresses the hardware features, configuration, and implementation of the DCS3700. It presents detailed descriptions of the hardware configurations and options that are offered with the DCS3700. It then presents the concepts and functions that are used in planning and managing the storage subsystems, such as multipathing and path failover. This book offers a step-by-step guide to using the IBM Storage Manager to create arrays, logical drives, and other basic (and advanced) management tasks. This publication also contains practical information about diagnostic tests and troubleshooting, and includes practical examples about how to use scripts and the command-line interface (CLI).

IBM Business Process Manager Version 8.0 Production Topologies

This IBM® Redbooks® publication describes how to build production topologies for IBM Business Process Manager V8.0. This book is an update of the existing book IBM Business Process Manager V7.5 Production Topologies, SG24-7976. It is intended for IT Architects and IT Specialists who want to understand and implement these topologies. Use this book to select the appropriate production topologies for an environment, then follow the step-by-step instructions to build those topologies. Part 1 introduces IBM Business Process Manager and provides an overview of basic topology components, and Process Server and Process Center. This part also provides an overview of the production topologies described in this book, including a selection criteria for when to select a topology. IBM Business Process Manager security and the presentation layer are also addressed in this part. Part 2 provides a series of step-by-step instructions for creating production topology environments by using deployment environment patterns. This process includes topologies that incorporate IBM Business Monitor. This part also describes advanced topology topics. Part 3 covers post installation instructions for implementing production topology environments such as configuring IBM Business Process Manager to use IBM HTTP Server and WebSphere® proxy server. Please note that the additional material referenced in the text is not available from IBM.

Practical Anonymity

For those with legitimate reason to use the Internet anonymously--diplomats, military and other government agencies, journalists, political activists, IT professionals, law enforcement personnel, political refugees and others--anonymous networking provides an invaluable tool, and many good reasons that anonymity can serve a very important purpose. Anonymous use of the Internet is made difficult by the many websites that know everything about us, by the cookies and ad networks, IP-logging ISPs, even nosy officials may get involved. It is no longer possible to turn off browser cookies to be left alone in your online life. Practical Anonymity: Hiding in Plain Sight Online shows you how to use the most effective and widely-used anonymity tools--the ones that protect diplomats, military and other government agencies to become invisible online. This practical guide skips the theoretical and technical details and focuses on getting from zero to anonymous as fast as possible. For many, using any of the open-source, peer-reviewed tools for connecting to the Internet via an anonymous network may be (or seem to be) too difficult because most of the information about these tools is burdened with discussions of how they work and how to maximize security. Even tech-savvy users may find the burden too great--but actually using the tools can be pretty simple. The primary market for this book consists of IT professionals who need/want tools for anonymity to test/work around corporate firewalls and router filtering as well as provide anonymity tools to their customers. Simple, step-by-step instructions for configuring and using anonymous networking software Simple, step-by-step instructions for configuring and using anonymous networking software Use of open source, time-proven and peer-reviewed tools for anonymity Plain-language discussion of actual threats and concrete suggestions for appropriate responses Easy-to-follow tips for safer computing Simple, step-by-step instructions for configuring and using anonymous networking software Use of open source, time-proven and peer-reviewed tools for anonymity Plain-language discussion of actual threats, and concrete suggestions for appropriate responses Easy to follow tips for safer computing

Data Model Patterns

This is the digital version of the printed book (Copyright © 1996). Learning the basics of a modeling technique is not the same as learning how to use and apply it. To develop a data model of an organization is to gain insights into its nature that do not come easily. Indeed, analysts are often expected to understand subtleties of an organization's structure that may have evaded people who have worked there for years. Here's help for those analysts who have learned the basics of data modeling (or "entity/relationship modeling") but who need to obtain the insights required to prepare a good model of a real business. Structures common to many types of business are analyzed in areas such as accounting, material requirements planning, process manufacturing, contracts, laboratories, and documents. In each chapter, high-level data models are drawn from the following business areas: The Enterprise and Its World The Things of the Enterprise Procedures and Activities Contracts Accounting The Laboratory Material Requirements Planning Process Manufacturing Documents Lower-Level Conventions

Pro Oracle Database 12c Administration, Second Edition

Pro Oracle Database 12c Administration is a book focused on results. Author Darl Kuhn draws from a well of experience over a decade deep to lay out real-world techniques that lead to success as an Oracle Database administrator. He gives clear explanations on how to perform critical tasks. He weaves in theory where necessary without bogging you down in unneeded detail. He is not afraid to take a stand on how things should be done. He won't leave you adrift in a sea of choices, showing you three ways to do something and then walking away. Database administration isn't about passing a certified exam, or about pointing-and-clicking your way through a crisis. Database administration is about applying the right solution at the right time, about avoiding risk, about making robust choices that get you home each night in time for dinner with your family. If you have "buck stops here" responsibility for an Oracle database, then Pro Oracle Database 12c Administration is the book you need to help elevate yourself to the level of Professional Oracle Database Administrator. Covers multi-tenant container and pluggable database implementation and management Condenses and organizes the core job of a database administrator into one volume. Takes a results-oriented approach to getting things done. Lays a foundation upon which to build a senior level of expertise What you'll learn Create a stable environment consistent across all databases that you manage Manage pluggable and multi-tenant databases Take care of job #1: backing up, and then recovering when needed Manage users and objects, and the security between them Do battle with "large"—large databases and large objects Move and distribute data using Data Pump, materialized views, external tables Automate critical jobs and tackle database troubleshooting problems Who this book is for Pro Oracle Database 12c Administration is aimed at new database administrators who aspire to senior positions in which employers and customers trust you to work independently, and with a "buck stops here" attitude.

RMAN Recipes for Oracle Database 12c: A Problem-Solution Approach, Second Edition

RMAN Recipes for Oracle Database 12c is an example-driven approach to the Oracle database administrator's #1 job responsibility: Be able to recover the database. Of all the things you are responsible for as database administrator, nothing is more important than the data itself. Like it or not, the fearsome responsibility of protecting your organization's most critical data falls squarely upon your shoulders: Lose that data and your company could fail. Lose that data and you could be out of a job. Oracle's flagship database product fortunately implements a wide-ranging feature set to aid you in the all-important task of safeguarding against data loss. Recovery Manager, or RMAN, is at the heart of that feature set, and is the tool most-often used to initiate database backup and recovery operations. In this book, well-known authors and database experts Darl Kuhn, Sam Alapati, and Arup Nanda have created a set of examples encompassing the gamut of backup and recovery tasks that you might need to perform. Sometimes, especially when the heat is on, a good example is what you need to get started towards a solution. RMAN Recipes for Oracle Database 12c delivers. It'll be the book you reach for when that dreaded call comes in at 3:00am some dreary morning. It'll be the book that lets you sleep at night knowing that no matter what transpires, that you've done your job well and can recover from any outage. RMAN Recipes for Oracle Database 12c gets right to the point with quick and easy-to-read, step-by-step solutions that can help you backup and recover your data with confidence. What you'll learn Reliably back up and recover your database using Oracle's Recovery Manager Let Oracle Database manage your backup files via the Fast Recovery Area Automate backup and recovery tasks by writing scripts Troubleshoot RMAN problems and optimize RMAN performance Recover from the loss of a control file, loss of an online redo log, and from other unusual situations Who this book is for RMAN Recipes for Oracle Database 12c is aimed squarely at Oracle database administrators responsible for database backup and recovery operations.

Apache Flume: Distributed Log Collection for Hadoop

Apache Flume: Distributed Log Collection for Hadoop is a focused guide for users looking to efficiently collect and transport log data into systems like Hadoop using Apache Flume. Its step-by-step approach covers the installation, configuration, and customization of Flume to optimize your data ingestion workflows. What this Book will help me do Effectively install and set up Apache Flume for your data ingestion processes. Understand Flume's architecture and capabilities, including sources, channels, and sinks. Learn to configure reliable data flow paths using failover and load-balancing techniques. Implement data routing and transformations during data flow using Flume. Optimize and monitor your Flume operations to enhance reliability and performance. Author(s) The authors of this book are experienced software engineers and data administrators with deep knowledge and practical expertise in implementing distributed log collection systems. Their teaching approach combines clear explanation with actionable examples to give you a hands-on learning experience. Who is it for? This book is ideal for software engineers, data engineers, and system administrators involved in handling and transporting datasets, especially those with a focus on Hadoop. If you are seeking to understand or optimize Apache Flume for your data processing pipeline, this book will guide you from beginner-friendly setup to advanced customization, helping to enhance your workflows.

Microsoft Access 2013 Inside Out

Conquer Microsoft Access 2013—from the inside out! You’re beyond the basics, so dive right into Access 2013—and use your skills to create sophisticated database apps! This supremely organized reference packs hundreds of timesaving solutions, troubleshooting tips, and workarounds. It’s all muscle and no fluff. Discover how the experts tackle Access 2013—and challenge yourself to new levels of mastery. Build an Access Services web app with Microsoft SharePoint Server Automate your Access web app with data macros Create tables in your Access web app using built-in templates Aggregate and display your web app data using totals queries Use the Autocomplete control to quickly search for related data Create a Summary view to consolidate and group information Display related data on your views with the Related Items control Package your web app for use by others in your organization Plus—download chapters on building desktop databases For Intermediate and Advanced Users and Database Designers

IBM Flex System and PureFlex System Network Implementation with Juniper Networks

To meet today's complex and ever-changing business demands, you need a solid foundation of server, storage, networking and software resources that is simple to deploy and can quickly and automatically adapt to changing conditions. You also need access to, and the ability to take advantage of, broad expertise and proven best practices in systems management, applications, hardware maintenance and more. IBM® PureFlex™ System, which is a part of the IBM PureSystems™ family of expert integrated systems, combines advanced IBM hardware and software along with patterns of expertise and integrates them into three optimized configurations that are simple to acquire and deploy so you can achieve faster time to value. If you want a pre-configured, pre-integrated infrastructure with integrated management and cloud capabilities, factory tuned from IBM with x86 and Power hybrid solution, IBM PureFlex System is the answer. In this IBM Redbooks® publication, we use EX4500 core switches to demonstrate interoperability with the System Networking switches (RackSwitch™ G8264 top of rack switch and the Flex system fabric EN4093 10Gb scalable switch). We also describe a redundant environment using QFX3500 switches running IBM Virtual-Link Aggregation Group (MC-LAG/vLAG) and Juniper Multi- Chassis-Link Aggregation Group.

IBM XIV Storage System Gen3: Architecture, Implementation, and Usage

This IBM® Redbooks® publication describes the concepts, architecture, and implementation of the IBM XIV® Storage System. The XIV Storage System is a scalable enterprise storage system that is based on a grid array of hardware components. It can attach to both Fibre Channel Protocol (FCP) and IP network Small Computer System Interface (iSCSI) capable hosts. This system is a good fit for clients who want to be able to grow capacity without managing multiple tiers of storage. The XIV Storage System is suited for mixed or random access workloads, including online transaction processing, video streamings, images, email, and emerging workload areas, such as Web 2.0 and storage cloud. The focus of this edition is on the XIV Gen3 hardware Release 3.2, running Version 11.2 of the XIV system software. With this version, XIV Storage System offers up to five times the iSCSI throughput with new 10 GbE ports, a performance boost with new CPUs, and enhanced caching with optional solid-state drives (SSDs). The IBM XIV software Version 11.2 also offers support for Windows Server 2012, including space reclamation. And, the software enables drive rebuild times as fast as 26 minutes for a fully utilized 2 TB hard disk drive under heavy load. In the first few chapters of this book, we describe many of the unique and powerful concepts that form the basis of the XIV Storage System logical and physical architecture. We explain how the system is designed to eliminate direct dependencies between the hardware elements and the software that governs the system. In subsequent chapters, we explain the planning and preparation tasks that are required to deploy the system in your environment. A step-by-step procedure is presented that describes how to configure and administer the system. Illustrations are provided about how to perform those tasks by using the intuitive, yet powerful XIV Storage Manager GUI or the XIV command-line interface (XCLI). We describe the performance characteristics of the XIV Storage System and present options that are available for alerting and monitoring, including an enhanced secure remote support capability. This book is intended for IT professionals who want an understanding of the XIV Storage System. It also targets readers who need detailed advice on how to configure and use the system.

Enterprise Data Workflows with Cascading

There is an easier way to build Hadoop applications. With this hands-on book, you’ll learn how to use Cascading, the open source abstraction framework for Hadoop that lets you easily create and manage powerful enterprise-grade data processing applications—without having to learn the intricacies of MapReduce. Working with sample apps based on Java and other JVM languages, you’ll quickly learn Cascading’s streamlined approach to data processing, data filtering, and workflow optimization. This book demonstrates how this framework can help your business extract meaningful information from large amounts of distributed data. Start working on Cascading example projects right away Model and analyze unstructured data in any format, from any source Build and test applications with familiar constructs and reusable components Work with the Scalding and Cascalog Domain-Specific Languages Easily deploy applications to Hadoop, regardless of cluster location or data size Build workflows that integrate several big data frameworks and processes Explore common use cases for Cascading, including features and tools that support them Examine a case study that uses a dataset from the Open Data Initiative

Software Development on the SAP HANA Platform

Software Development on the SAP HANA Platform equips you with all the knowledge you need to master developing on this high-performance in-memory technology. From setup and installation to deploying fully functional HANA applications, this book guides you step by step. With hands-on chapters, you'll gain the analytical tools and data management proficiency needed to excel. What this Book will help me do Set up a SAP HANA development environment from scratch. Successfully execute your first development project on SAP HANA. Utilize each type of view in SAP HANA effectively for data manipulation. Create users with appropriate authorizations for reporting purposes. Deploy reporting applications to end-user software seamlessly. Author(s) Mark Walker is a seasoned expert in SAP HANA, with years of professional experience in enterprise software development and training. He brings a passion for teaching complex technologies in an approachable and practical way. Mark's hands-on approach ensures that readers not only learn but can confidently apply their new skills. Who is it for? This book is designed for software developers and data professionals looking to expand their expertise with SAP HANA. It is ideal for those new to this platform or professionals enhancing their analytical and data management skills. Whether you're starting from scratch or upgrading your capabilities, this book suits your needs. The lessons here will assist in reaching your SAP HANA proficiency goals.

IBM Information Server: Integration and Governance for Emerging Data Warehouse Demands

This IBM® Redbooks® publication is intended for business leaders and IT architects who are responsible for building and extending their data warehouse and Business Intelligence infrastructure. It provides an overview of powerful new capabilities of Information Server in the areas of big data, statistical models, data governance and data quality. The book also provides key technical details that IT professionals can use in solution planning, design, and implementation.

Database Cloud Storage

Implement a Centralized Cloud Storage Infrastructure with Oracle Automatic Storage Management Build and manage a scalable, highly available cloud storage solution. Filled with detailed examples and best practices, this Oracle Press guide explains how to set up a complete cloud-based storage system using Oracle Automatic Storage Management. Find out how to prepare hardware, build disk groups, efficiently allocate storage space, and handle security. Database Cloud Storage: The Essential Guide to Oracle Automatic Storage Management shows how to monitor your system, maximize throughput, and ensure consistency across servers and clusters. Set up and configure Oracle Automatic Storage Management Discover and manage disks and establish disk groups Create, clone, and administer Oracle databases Consolidate resources with Oracle Private Database Cloud Control access, encrypt files, and assign user privileges Integrate replication, file tagging, and automatic failover Employ pre-engineered private cloud database consolidation tools Check for data consistency and resync failed disks Code examples in the book are available for download

Learning SPARQL, 2nd Edition

Gain hands-on experience with SPARQL, the RDF query language that’s bringing new possibilities to semantic web, linked data, and big data projects. This updated and expanded edition shows you how to use SPARQL 1.1 with a variety of tools to retrieve, manipulate, and federate data from the public web as well as from private sources. Author Bob DuCharme has you writing simple queries right away before providing background on how SPARQL fits into RDF technologies. Using short examples that you can run yourself with open source software, you’ll learn how to update, add to, and delete data in RDF datasets. Get the big picture on RDF, linked data, and the semantic web Use SPARQL to find bad data and create new data from existing data Use datatype metadata and functions in your queries Learn techniques and tools to help your queries run more efficiently Use RDF Schemas and OWL ontologies to extend the power of your queries Discover the roles that SPARQL can play in your applications