talk-data.com talk-data.com

Event

O'Reilly Data Engineering Books

2001-10-19 – 2027-05-25 Oreilly Visit website ↗

Activities tracked

3406

Collection of O'Reilly books on Data Engineering.

Filtering by: data ×

Sessions & talks

Showing 2101–2125 of 3406 · Newest first

Search within this event →
IBM System Storage N series Software Guide

Corporate workgroups, distributed enterprises, and small to medium-sized companies are increasingly seeking to network and consolidate storage to improve availability, share information, reduce costs, and protect and secure information. These organizations require enterprise-class solutions capable of addressing immediate storage needs cost-effectively, while providing an upgrade path for future requirements. IBM® System Storage® N series storage systems and their software capabilities are designed to meet these requirements. IBM System Storage N series storage systems offer an excellent solution for a broad range of deployment scenarios. IBM System Storage N series storage systems function as a mulitprotocol storage device that is designed to allow you to simultaneously serve both file and block-level data across a single network. These activities are demanding procedures that, for some solutions, require multiple, separately managed systems. The flexibility of IBM System Storage N series storage systems, however, allows them to address the storage needs of a wide range of organizations, including distributed enterprises and data centers for midrange enterprises. IBM System Storage N series storage systems also support sites with computer and data-intensive enterprise applications, such as database, data warehousing, workgroup collaboration, and messaging. This IBM® Redbooks® publication explains the software features of the IBM System Storage N series storage systems. This book also covers topics such as installation, setup, and administration of those software features from the IBM System Storage N series storage systems and clients and provides example scenarios.

Oracle Exalytics Revealed

An in-depth Oracle Press guide to Oracle Exalytics In-Memory Machine Written by Oracle ACE Director Mark Rittman, this e-book offers comprehensive coverage of Oracle Exalytics In-Memory Machine, the best-in-class, preintegrated BI platform. After an overview of the product and its capabilities, you’ll get complete details on installation, configuration, administration, maintenance, and creation of the in-memory adaptive data mart that powers the “speed-of-thought” dashboards. Expert tips for integrating with both Oracle and non-Oracle enterprise data sources are included. Enable your organization to analyze large amounts of data via rich, visual dashboards using Oracle Exalytics In-Memory Machine and this Oracle Press guide.

Professional Microsoft SQL Server 2012 Analysis Services with MDX and DAX

Understand Microsoft's dramatically updated new release of its premier toolset for business intelligence The first major update to Microsoft's state-of-the-art, complex toolset for business intelligence (BI) in years is now available and what better way to master it than with this detailed book from key members of the product's development team? If you're a database or data warehouse developer, this is the expert resource you need to build full-scale, multi-dimensional, database applications using Microsoft's new SQL Server 2012 Analysis Services and related tools. Discover how to solve real-world BI problems by leveraging a slew of powerful new Analysis Services features and capabilities. These include the new DAX language, which is a more user-friendly version of MDX; PowerPivot, a new tool for performing simplified analysis of data; BISM, Microsoft's new Business Intelligence Semantic Model; and much more. Serves as an authoritative guide to Microsoft's new SQL Server 2012 Analysis Services BI product and is written by key members of the Microsoft Analysis Services product development team Covers SQL Server 2012 Analysis Services, a major new release with a host of powerful new features and capabilities Topics include using the new DAX language, a simplified, more user-friendly version of MDX; PowerPivot, a new tool for performing simplified analysis of data; BISM, Microsoft's new Business Intelligence Semantic Model; and a new, yet-to-be-named BI reporting tool Explores real-world scenarios to help developers build comprehensive solutions Get thoroughly up to speed on this powerful new BI toolset with the timely and authoritative Professional Microsoft SQL Server 2012 Analysis Services with MDX.

Student Solutions Manual to Accompany Loss Models: From Data to Decisions, Fourth Edition

Praise for the Third Edition "This book provides in-depth coverage of modelling techniques used throughout many branches of actuarial science...The exceptional high standard of this book has made it a pleasure to read." --Annals of Actuarial Science Newly organized to focus exclusively on material tested in the Society of Actuaries' Exam C and the Casualty Actuarial Society's Exam 4, Loss Models: From Data to Decisions, Fourth Edition continues to supply actuaries with a practical approach to the key concepts and techniques needed on the job. With updated material and extensive examples, the book successfully provides the essential methods for using available data to construct models for the frequency and severity of future adverse outcomes. The book continues to equip readers with the tools needed for the construction and analysis of mathematical models that describe the process by which funds flow into and out of an insurance system. Focusing on the loss process, the authors explore key quantitative techniques including random variables, basic distributional quantities, and the recursive method, and discuss techniques for classifying and creating distributions. Parametric, non-parametric, and Bayesian estimation methods are thoroughly covered along with advice for choosing an appropriate model. New features of this Fourth Edition include: Expanded discussion of working with large data sets, now including more practical elements of constructing decrement tables Added coverage of methods for simulating several special situations An updated presentation of Bayesian estimation, outlining conjugate prior distributions and the linear exponential family as well as related computational issues Throughout the book, numerous examples showcase the real-world applications of the presented concepts, with an emphasis on calculations and spreadsheet implementation. A wealth of new exercises taken from previous Exam C/4 exams allows readers to test their comprehension of the material, and a related FTP site features the book's data sets. Loss Models, Fourth Edition is an indispensable resource for students and aspiring actuaries who are preparing to take the SOA and CAS examinations. The book is also a valuable reference for professional actuaries, actuarial students, and anyone who works with loss and risk models. To explore our additional offerings in actuarial exam preparation visit www.wiley.com/go/c4actuarial.

Hadoop in Practice

Hadoop in Practice collects 85 Hadoop examples and presents them in a problem/solution format. Each technique addresses a specific task you'll face, like querying big data using Pig or writing a log file loader. You'll explore each problem step by step, learning both how to build and deploy that specific solution along with the thinking that went into its design. As you work through the tasks, you'll find yourself growing more comfortable with Hadoop and at home in the world of big data. About the Technology Hadoop is an open source MapReduce platform designed to query and analyze data distributed across large clusters. Especially effective for big data systems, Hadoop powers mission-critical software at Apple, eBay, LinkedIn, Yahoo, and Facebook. It offers developers handy ways to store, manage, and analyze data. About the Book Hadoop in Practice collects 85 battle-tested examples and presents them in a problem/solution format. It balances conceptual foundations with practical recipes for key problem areas like data ingress and egress, serialization, and LZO compression. You'll explore each technique step by step, learning how to build a specific solution along with the thinking that went into it. As a bonus, the book's examples create a well-structured and understandable codebase you can tweak to meet your own needs. What's Inside Conceptual overview of Hadoop and MapReduce 85 practical, tested techniques Real problems, real solutions How to integrate MapReduce and R About the Reader This book assumes you've already started exploring Hadoop and want concrete advice on how to use it in production. About the Author Alex Holmes is a senior software engineer with extensive expertise in solving big data problems using Hadoop. He has presented at JavaOne and Jazoon and is a technical lead at VeriSign. Quotes Interesting topics that tickle the creative brain. - Mark Kemna, Brillig Ties together the Hadoop ecosystem technologies. - Ayon Sinha, Britely Comprehensive … high-quality code samples. - Chris Nauroth, The Walt Disney Company Covers all of the variants of Hadoop, not just the Apache distribution. - Ted Dunning, MapR Technologies Charts a path to the future. - Alexey Gayduk, Grid Dynamics

Structural Equation Modeling: Applications Using Mplus

A reference guide for applications of SEM using Mplus Structural Equation Modeling: Applications Using Mplus is intended as both a teaching resource and a reference guide. Written in non-mathematical terms, this book focuses on the conceptual and practical aspects of Structural Equation Modeling (SEM). Basic concepts and examples of various SEM models are demonstrated along with recently developed advanced methods, such as mixture modeling and model-based power analysis and sample size estimate for SEM. The statistical modeling program, Mplus, is also featured and provides researchers with a flexible tool to analyze their data with an easy-to-use interface and graphical displays of data and analysis results. Key features: Presents a useful reference guide for applications of SEM whilst systematically demonstrating various advanced SEM models, such as multi-group and mixture models using Mplus. Discusses and demonstrates various SEM models using both cross-sectional and longitudinal data with both continuous and categorical outcomes. Provides step-by-step instructions of model specification and estimation, as well as detail interpretation of Mplus results. Explores different methods for sample size estimate and statistical power analysis for SEM. By following the examples provided in this book, readers will be able to build their own SEM models using Mplus. Teachers, graduate students, and researchers in social sciences and health studies will also benefit from this book.

Pro T-SQL 2012 Programmer's Guide, Third Edition

Pro T-SQL 2012 Programmer's Guide is every developer's key to making full use of SQL Server 2012's powerful, built-in Transact-SQL language. Discussing new and existing features, the book takes you on an expert guided tour of Transact-SQL functionality. Fully functioning examples and downloadable source code bring technically accurate and engaging treatment of Transact-SQL into your own hands. Step-by-step explanations ensure clarity, and an advocacy of best-practices will steer you down the road to success. Transact-SQL is the language developers and DBAs use to interact with SQL Server. It's used for everything from querying data, to writing stored procedures, to managing the database. New features in T-SQL 2012 include full support for window functions, stored sequences, the ability to throw errors, data paging, and more. All these important new features are covered in this book. Developers and DBAs alike can benefit from the expressive power of Transact-SQL, and Pro T-SQL 2012 Programmer's Guide provides the gateway to success in applying this increasingly important database language to everyday business and technical tasks. What you'll learn Take full advantage of window functions and other new features in T-SQL 2012 Move business logic into the database via procedures and functions Develop using advanced features such as encryption and full-text search Store and retrieve XML data using XQuery and XPath Debug and Optimize T-SQL Execution Build ADO.NET Client Applications Who this book is for Pro T-SQL 2012 Programmer's Guide is written for SQL Server and Transact-SQL developers who want to implement best-practices and take full advantage of all that SQL Server has to offer. NET programmers will find the book helpful in showing how to run .NET code within SQL Server. Database administrators who need to write triggers and the occasional stored procedure will also benefit from the book.

IBM System Storage N series Hardware Guide

This IBM® Redbooks® publication provides a detailed look at the features, benefits, and capabilities of the IBM System Storage® N series hardware offerings. The IBM System Storage N series systems can help you tackle the challenge of effective data management by using virtualization technology and a unified storage architecture. The N series delivers low- to high-end enterprise storage and data management capabilities with midrange affordability. Built-in serviceability and manageability features help support your efforts to increase reliability; simplify and unify storage infrastructure and maintenance; and deliver exceptional economy. The IBM System Storage N series systems provide a range of reliable, scalable storage solutions to meet various storage requirements. These capabilities are achieved by using network access protocols such as Network File System (NFS), Common Internet File System (CIFS), HTTP, and iSCSI, and storage area network technologies such as Fibre Channel. Using built-in Redundant Array of Independent Disks (RAID) technologies, all data is protected with options to enhance protection through mirroring, replication, Snapshots, and backup. These storage systems also have simple management interfaces that make installation, administration, and troubleshooting straightforward. In addition, this book also addresses high-availability solutions including clustering and MetroCluster supporting highest business continuity requirements. MetroCluster is a unique solution that combines array-based clustering with synchronous mirroring to deliver continuous availability.

Effective MySQL Replication Techniques in Depth

The Essential Guide to Scaling Your Business Solution Written by Oracle ACE Director and MySQL expert Ronald Bradford, with coauthor Chris Schneider, Effective MySQL: Replication Techniques in Depth describes what is needed to understand and implement MySQL replication to build scalable solutions. This book includes detailed syntax examples to demonstrate the features, options, and limitations of native MySQL replication. Providing an evaluation of various new replication features and additional third-party product implementations, this Oracle Press guide helps to ensure your MySQL environment can support the various high-availability needs of your business. Master the strengths and limitations of native asynchronous replication in a MySQL topology Identify the important features to improve replication for growing business requirements Recognize the key business factors to determine your optimal highavailability needs Understand the benefits of using MySQL replication for failover scenarios Identify the key configuration variables and SQL commands affecting master/ slave replication Learn about the advancements in replication techniques provided by new products, including Tungsten Replicator and Galera Optimize your replication management with various utilities and toolkits Find additional detailed information and presentations at EffectiveMySQL.com.

Programming Hive

Need to move a relational database application to Hadoop? This comprehensive guide introduces you to Apache Hive, Hadoop’s data warehouse infrastructure. You’ll quickly learn how to use Hive’s SQL dialect—HiveQL—to summarize, query, and analyze large datasets stored in Hadoop’s distributed filesystem. This example-driven guide shows you how to set up and configure Hive in your environment, provides a detailed overview of Hadoop and MapReduce, and demonstrates how Hive works within the Hadoop ecosystem. You’ll also find real-world case studies that describe how companies have used Hive to solve unique problems involving petabytes of data. Use Hive to create, alter, and drop databases, tables, views, functions, and indexes Customize data formats and storage options, from files to external databases Load and extract data from tables—and use queries, grouping, filtering, joining, and other conventional query methods Gain best practices for creating user defined functions (UDFs) Learn Hive patterns you should use and anti-patterns you should avoid Integrate Hive with other data processing programs Use storage handlers for NoSQL databases and other datastores Learn the pros and cons of running Hive on Amazon’s Elastic MapReduce

SQL Server 2012 T-SQL Recipes: A Problem-Solution Approach

SQL Server 2012 T-SQL Recipes is an example-based guide to the Transact-SQL language that is at the core of SQL Server 2012. It provides ready-to-implement solutions to common programming and database administration tasks. Learn to create databases, insert and update data, generate reports, secure your data, and more. Tasks and their solutions are broken down into a problem/solution format that is quick and easy to read so that you can get the job done fast when the pressure is on. Solutions in this book are divided into chapters by problem domain. Each chapter is a collection of solutions around a single facet of the language such as writing queries, developing triggers, and applying aggregate functions. Each solution is presented code-first, giving you a working code example to copy from and implement immediately in your own environment. Following each example is an in-depth description of how and why the given solution works. Tradeoffs and alternative approaches are also discussed. Focused on solutions: Look up what you need to do. Learn how to do it. Do it. Current: Newly updated for SQL Server 2012 Comprehensive: Covers all common T-SQL problem domains What you'll learn Create databases, tables, and indexes Query and manipulate data Store and manage XML inside the database Move business logic into the database Resolve common performance problems. Build reports that matter to your business. Perform common backup and recovery tasks. Who this book is for SQL Server 2012 T-SQL Recipes is aimed at technically-oriented users of SQL Server desiring to extract the full power of the platform through it's powerful, built-in programming and scripting language. Target readers include developers who use Microsoft SQL Server 2012 as their back-end database, and the database administrators who create, manage, and secure those databases.

IBM Business Process Manager Security: Concepts and Guidance

This IBM® Redbooks® publication provides information about security concerning an organization’s business process management (BPM) program, about common security holes that often occur in this field, and describes techniques for rectifying these holes. This book documents preferred practices and common security hardening exercises that you can use to achieve a reasonably well-secured BPM installation. Many of the practices described in this book apply equally to generic Java Platform and Enterprise Edition (J2EE) applications, as well as to BPM. However, it focuses on aspects that typically do not receive adequate consideration in actual practice. Also, it addresses equally the BPM Standard and BPM Advanced Editions, although there are topics inherent in BPM Advanced that we considered to be out of scope for this book. This book is not meant as a technical deep-dive into any one topic, technology, or philosophy. IBM offers a variety of training and consulting services that can help you to understand and evaluate the implications of this book’s topic in your own organization.

Microsoft® SQL Server 2012 Management and Administration, Second Edition

If you need to deploy, manage, or secure Microsoft SQL Server 2012, this is the complete, fast-paced, task-based reference you’ve been searching for. Authored by a worldclass expert on running SQL Server in the enterprise, this book goes far beyond the basics, taking on the complex tasks that DBAs need to make the most of Microsoft’s first cloud-enabled database platform. Designed for maximum practical usability, it’s packed with expert tips and up-to-date real-world configuration guidance you simply won’t find anywhere else. As someone who helped influence the design of SQL Server 2012 and drawing on many months of beta testing, Ross Mistry provides immediately usable solutions for installation and upgrades, management and monitoring, performance and availability, security, consolidation, virtualization, troubleshooting, and more. Mistry identifies new features and corresponding best practices in every chapter, helping you take full advantage of new SQL Server innovations ranging from private cloud support to AlwaysOn Availability Groups. Understand how to: Efficiently install or upgrade the SQL Server 2012 database engine Administer and configure database engine settings, storage, I/O, and partitioning Transfer data on-premise or to the cloud Manage and optimize indexes Learn how to consolidate, virtualize and optimize SQL Server for Private Clouds Harden and audit SQL Server 2012 environments Administer security and authorization, including new Contained Databases Encrypt data and communications Design and deploy new AlwaysOn high-availability and disaster recovery features Implement maintenance best practices, including Policy-Based Management

XML in Scientific Computing

This book introduces XML to scientists and engineers in a way that illustrates the similarities and differences with traditional programming languages and suggests new ways of saving and sharing the results of scientific calculations. The author discusses XML in the context of scientific computing, demonstrates how XSL can be used to perform various calculations, and explains how to create and navigate through XML documents using traditional languages such as Fortran, C++, and MATLAB. A suite of computer programs are available on the author's website.

Microsoft® SQL Server® 2012 Integration Services

Build and manage data integration solutions with expert guidance from the Microsoft SQL Server Integration Services (SSIS) team. See best practices in action and dive deep into the SSIS engine, SSISDB catalog, and security features. Using the developer enhancements in SQL Server 2012 and the flexible SSIS toolset, you’ll handle complex data integration scenarios more efficiently—and acquire the skills you need to build comprehensive solutions. Discover how to: Use SSIS to extract, transform, and load data from multiple data sources Apply best practices to optimize package and project configuration and deployment Manage security settings in the SSISDB catalog and control package access Work with SSIS data quality features to profile, cleanse, and increase reliability Monitor, troubleshoot, and tune SSIS solutions with advanced features such as detailed views and data taps Load data incrementally to capture an easily consumable stream of insert, update, and delete activity

Pro SQL Server 2012 BI Solutions

Business intelligence projects do not need to cost multi-millions of dollars or take months or even years to complete! Using rapid application development (RAD) techniques along with Microsoft SQL Server 2012, this book guides database administrators, SQL programmers, and report specialists in creating practical, cost-effective business intelligence solutions for their companies and departments. Pro SQL Server 2012 BI Solutions provides practical examples of cost-effective business intelligence projects. Readers will be guided through several complete projects that build a foundation for real-world solutions. Even with limited experience using Microsoft's SQL Server, Integration Server, Analysis Server, and Reporting Server, you can leverage your existing knowledge of SQL programming and database design to provide users with the business intelligence reports they need. Provides recipes for multiple business intelligence scenarios Progresses from simple to advanced projects using several examples Shows Microsoft SQL Server technology used to complete real-world business intelligence projects What you'll learn How to plan and implement cost-effective business intelligence projects How to create practical data warehouse databases How to extract, transform, and load data with Integration Services How to develop OLAP cubes and dimensions on Analysis Server How to create Reporting Server reports using both SQL and MDX How to apply performance-tuning techniques to get the most from your solutions Who this book is for Pro SQL Server 2012 BI Solutions is aimed at database administrators, SQL programmers, and report developers who create business intelligence solutions for midsized businesses and departments.

Loss Models: From Data to Decisions, 4th Edition

Praise for the Third Edition "This book provides in-depth coverage of modelling techniques used throughout many branches of actuarial science. . . . The exceptional high standard of this book has made it a pleasure to read." — Annals of Actuarial Science Newly organized to focus exclusively on material tested in the Society of Actuaries' Exam C and the Casualty Actuarial Society's Exam 4, Loss Models: From Data to Decisions, Fourth Edition continues to supply actuaries with a practical approach to the key concepts and techniques needed on the job. With updated material and extensive examples, the book successfully provides the essential methods for using available data to construct models for the frequency and severity of future adverse outcomes. The book continues to equip readers with the tools needed for the construction and analysis of mathematical models that describe the process by which funds flow into and out of an insurance system. Focusing on the loss process, the authors explore key quantitative techniques including random variables, basic distributional quantities, and the recursive method, and discuss techniques for classifying and creating distributions. Parametric, non-parametric, and Bayesian estimation methods are thoroughly covered along with advice for choosing an appropriate model. New features of this Fourth Edition include: Expanded discussion of working with large data sets, now including more practical elements of constructing decrement tables Added coverage of methods for simulating several special situations An updated presentation of Bayesian estimation, outlining conjugate prior distributions and the linear exponential family as well as related computational issues Throughout the book, numerous examples showcase the real-world applications of the presented concepts, with an emphasis on calculations and spreadsheet implementation. A wealth of new exercises taken from previous Exam C/4 exams allows readers to test their comprehension of the material, and a related FTP site features the book's data sets. Loss Models, Fourth Edition is an indispensable resource for students and aspiring actuaries who are preparing to take the SOA and CAS examinations. The book is also a valuable reference for professional actuaries, actuarial students, and anyone who works with loss and risk models. To explore our additional offerings in actuarial exam preparation visit www.wiley.com/go/c4actuarial.

Indexing

Indexing consists of both novel and more traditional techniques. Cutting-edge indexing techniques, such as automatic indexing, ontologies, and topic maps, were developed independently of older techniques such as thesauri, but it is now recognized that these older methods also hold expertise. Indexing describes various traditional and novel indexing techniques, giving information professionals and students of library and information sciences a broad and comprehensible introduction to indexing. This title consists of twelve chapters: an Introduction to subject readings and theasauri; Automatic indexing versus manual indexing; Techniques applied in automatic indexing of text material; Automatic indexing of images; The black art of indexing moving images; Automatic indexing of music; Taxonomies and ontologies; Metadata formats and indexing; Tagging; Topic maps; Indexing the web; and The Semantic Web. Makes difficult and complex techniques understandable Contains may links to and illustrations from websites where new indexing techniques can be experienced Provides references for further reading

Building Smarter Planet Solutions with MQTT and IBM WebSphere MQ Telemetry

MQ Telemetry Transport (MQTT) is a messaging protocol that is lightweight enough to be supported by the smallest devices, yet robust enough to ensure that important messages get to their destinations every time. With MQTT devices such as smart energy meters, cars, trains, satellite receivers, and personal health care devices can communicate with each other and with other systems or applications. This IBM® Redbooks® publication introduces MQTT and takes a scenario-based approach to demonstrate its capabilities. It provides a quick guide to getting started and then shows how to grow to an enterprise scale MQTT server using IBM WebSphere® MQ Telemetry. Scenarios demonstrate how to integrate MQTT with other IBM products, including WebSphere Message Broker. This book also provides typical usage patterns and guidance on scaling a solution. The intended audience for this book ranges from new users of MQTT and telemetry to those readers who are looking for in-depth knowledge and advanced topics. Please note that the additional material referenced in the text is not available from IBM.

Delivering Continuity and Extreme Capacity with the IBM DB2 pureScale Feature

The IBM® DB2® pureScale® feature offers clustering technology that helps deliver high availability and exceptional scalability transparent to applications. The DB2 pureScale feature helps you to meet your business needs around availability and scalability, and is also easy to configure and administer. This IBM Redbooks® publication addresses the DB2 pureScale feature that is available in IBM DB2 10.1 for Linux, UNIX, and Windows operating systems. It can help you build skills and deploy the DB2 pureScale feature. This book bundles all the information necessary for a in-depth analysis into the functions of the DB2 pureScale feature, including the actual hardware requirements. It includes validated step-by-step hardware and software installation instructions. In addition, this book provides detailed examples about how to work effectively with a DB2 pureScale cluster and how to plan and run an upgrade for all DB2 related components to DB2 10.1. This book is intended for database administrators (DBAs) who use IBM DB2 10.1 for Linux, UNIX, and Windows operating systems who want to explore and get started with the DB2 pureScale feature.

SQL Server 2012 Integration Services Design Patterns

SQL Server 2012 Integration Services Design Patterns is a book of recipes for SQL Server Integration Services (SSIS). Design patterns in the book show how to solve common problems encountered when developing data integration solutions. Because you do not have to build the code from scratch each time, using design patterns improves your efficiency as an SSIS developer. In SSIS Design Patterns, we take you through several of these snippets in detail, providing the technical details of the resolution. SQL Server 2012 Integration Services Design Patterns does not focus on the problems to be solved; instead, the book delves into why particular problems should be solved in certain ways. You'll learn more about SSIS as a result, and you'll learn by practical example. Where appropriate, SQL Server 2012 Integration Services Design Patterns provides examples of alternative patterns and discusses when and where they should be used. Highlights of the book include sections on ETL Instrumentation, SSIS Frameworks, and Dependency Services. Takes you through solutions to several common data integration challenges Demonstrates new features in SQL Server 2012 Integration Services Teaches SSIS using practical examples What you'll learn Load data from flat file formats Explore patterns for executing SSIS packages Discover a pattern for loading XML data Migrate SSIS packages through your application lifecycle without editing connections Take advantage of SSIS 2012 Dependency Services Build an SSIS Framework to support your application needs Who this book is for SQL Server 2012 Integration Services Design Patterns is for the data integration developer who is ready to take their SQL Server Integration Services (SSIS) skills to a more efficient level. It's for the developer interested in locating a previously-tested solution quickly. SQL Server 2012 Integration Services Design Patterns is a great book for ETL (extract, transform, and load) specialists and those seeking practical uses for new features in SQL Server 2012 Integration Services. It's an excellent choice for business intelligence and data warehouse developers.

PeopleSoft PeopleTools Data Management and Upgrade Handbook

Master Oracle’s PeopleSoft PeopleTools Data Administration and Upgrade Toolset Maximize data integrity, maintain peak application performance, and keep your PeopleSoft environment up to date. PeopleSoft PeopleTools Data Management and Upgrade Handbook explains the latest techniques and provides best practices, case studies, and programming examples. Find out how to develop and deploy data mover scripts, audit database health, apply patches, and generate project comparison reports. This Oracle Press guide offers thorough coverage of Oracle’s PeopleSoft life cycle management tools. Understand PeopleSoft PeopleTools architecture Manage metadata using PeopleSoft Application Designer Create and execute PeopleSoft Data Mover scripts Learn how to use PeopleSoft Data Archive Manager Read best practices for applying updates, patches, and fixes Implement PeopleSoft change control features Build automated functional tests Plan and execute application and platform upgrades

Generalized Linear and Nonlinear Models for Correlated Data: Theory and Applications Using SAS

Edward F. Vonesh's Generalized Linear and Nonlinear Models for Correlated Data: Theory and Applications Using SAS is devoted to the analysis of correlated response data using SAS, with special emphasis on applications that require the use of generalized linear models or generalized nonlinear models. Written in a clear, easy-to-understand manner, it provides applied statisticians with the necessary theory, tools, and understanding to conduct complex analyses of continuous and/or discrete correlated data in a longitudinal or clustered data setting. Using numerous and complex examples, the book emphasizes real-world applications where the underlying model requires a nonlinear rather than linear formulation and compares and contrasts the various estimation techniques for both marginal and mixed-effects models. The SAS procedures MIXED, GENMOD, GLIMMIX, and NLMIXED as well as user-specified macros will be used extensively in these applications. In addition, the book provides detailed software code with most examples so that readers can begin applying the various techniques immediately.

Getting Started with Storm

Even as big data is turning the world upside down, the next phase of the revolution is already taking shape: real-time data analysis. This hands-on guide introduces you to Storm, a distributed, JVM-based system for processing streaming data. Through simple tutorials, sample Java code, and a complete real-world scenario, you’ll learn how to build fast, fault-tolerant solutions that process results as soon as the data arrives. Discover how easy it is to set up Storm clusters for solving various problems, including continuous data computation, distributed remote procedure calls, and data stream processing. Learn how to program Storm components: spouts for data input and bolts for data transformation Discover how data is exchanged between spouts and bolts in a Storm topology Make spouts fault-tolerant with several commonly used design strategies Explore bolts—their life cycle, strategies for design, and ways to implement them Scale your solution by defining each component’s level of parallelism Study a real-time web analytics system built with Node.js, a Redis server, and a Storm topology Write spouts and bolts with non-JVM languages such as Python, Ruby, and Javascript

IBM ProtecTier Implementation and Best Practices Guide

The IBM® System Storage® TS7650G ProtecTIER® Deduplication Gateway and the IBM System Storage TS7650 ProtecTIER Deduplication Appliance are disk-based data storage systems configured for three available interfaces: The Virtual Tape Library (VTL) interface is the foundation of ProtecTIER and emulates traditional automated tape libraries. The Symantec NetBackup OpenStorage (OST) API can be integrated with Symantec NetBackup to provide backup-to-disk without having to emulate traditional tape libraries. The newly available File System Interface (FSI) supports Common Internet File System (CIFS) as a backup target. For your existing ProtecTIER solution, we provide best practices and suggestions to boost the performance and the effectiveness of the data deduplication with regards to your application platforms for your VTL, OST, and FSI systems. When building a new ProtecTIER data deduplication environment, this publication helps IT architects and solution designers plan for the best option and scenario for data deduplication for their environments. This guide helps you optimize your deduplication ratio, while reducing the hardware, power and cooling, and management costs.