talk-data.com talk-data.com

Topic

data-engineering

3377

tagged

Activity Trend

1 peak/qtr
2020-Q1 2026-Q1

Activities

Showing filtered results

Filtering by: O'Reilly Data Engineering Books ×
IBM System Storage DS Storage Manager Copy Services Guide

The purpose of this IBM® Redbooks® publication is to provide customers with guidance and recommendations for how and when to use the IBM System Storage® Copy Services premium features. The topics discussed in this publication apply to the IBM System Storage DS® models DS3000, DS4000®, and DS5000 running the firmware v7.70, and IBM System Storage DS Storage Manager v10.70. Customers in today’s IT world are finding a major need to ensure a good archive of their data and a requirement to create these archives with minimal interruptions. The IBM Midrange System Storage helps to fulfill these requirements by offering three copy services premium features: IBM FlashCopy® VolumeCopy Enhanced Remote Mirroring (ERM) This publication specifically addresses the copy services premium features and can be used in conjunction with the following IBM DS System Storage books: IBM System Storage DS4000 and Storage Manager V10.30, SG24-7010 IBM System Storage DS3000: Introduction and Implementation Guide, SG24-7065 IBM System Storage DS3500: Introduction and Implementation Guide, SG24-7914 IBM Midrange System Storage Hardware Guide, SG24-7676 IBM Midrange System Storage Implementation and Best Practices Guide, SG24-6363

Scaling MongoDB

Create a MongoDB cluster that will to grow to meet the needs of your application. With this short and concise book, you'll get guidelines for setting up and using clusters to store a large volume of data, and learn how to access the data efficiently. In the process, you'll understand how to make your application work with a distributed database system. Scaling MongoDB will help you: Set up a MongoDB cluster through sharding Work with a cluster to query and update data Operate, monitor, and backup your cluster Plan your application to deal with outages By following the advice in this book, you'll be well on your way to building and running an efficient, predictable distributed system using MongoDB.

Database Modeling and Design, 5th Edition

Database Modeling and Design, Fifth Edition, focuses on techniques for database design in relational database systems. This extensively revised fifth edition features clear explanations, lots of terrific examples and an illustrative case, and practical advice, with design rules that are applicable to any SQL-based system. The common examples are based on real-life experiences and have been thoroughly class-tested. This book is immediately useful to anyone tasked with the creation of data models for the integration of large-scale enterprise data. It is ideal for a stand-alone data management course focused on logical database design, or a supplement to an introductory text for introductory database management. In-depth detail and plenty of real-world, practical examples throughout Loaded with design rules and illustrative case studies that are applicable to any SQL, UML, or XML-based system Immediately useful to anyone tasked with the creation of data models for the integration of large-scale enterprise data

Automated Physical Database Design and Tuning

Due to the increasing complexity in application workloads and query engines, database administrators are turning to automated tuning tools that systematically explore the space of physical design alternatives. A critical element of such tuning is physical database design since the choice of physical structures has a significant impact on the performance of the database system. Automated Physical Database Design and Tuning presents a detailed overview of the fundamental ideas and algorithms for automatically recommending changes to the physical design of a database system. The first part of the book introduces the necessary technical background. The author explains SQL, the space of execution plans for answering SQL queries, query optimization, how the choice of access paths (e.g., indexes) is crucial to performance, and the complexity of the physical design problem. The second part extensively discusses automated physical design techniques, covering fundamental research ideas in the last 15 years that have resulted in a new generation of tuning tools. The text focuses on the search space of alternatives, the necessity of a cost model to compare such alternatives, different mechanisms to traverse and enumerate the search space, and practical aspects in real-world tuning tools. In the third part, the author explores new advances in automated physical design. He applies previous approaches to other physical structures, such as materialized views, partitioning, and multidimensional clustering. He also analyzes workload models for new types of applications, generalizes the optimizing function of current physical design tools to cope with other application scenarios, and examines open-ended challenges in physical database design. This book offers valuable insights on well-established principles and cutting-edge research results in automated physical design. It helps readers gain a deeper understanding of how automated tuning tools work in database installations as well as the challenges and opportunities involved in designing next-generation tuning tools.

Developing High Quality Data Models

Developing High Quality Data Models provides an introduction to the key principles of data modeling. It explains the purpose of data models in both developing an Enterprise Architecture and in supporting Information Quality; common problems in data model development; and how to develop high quality data models, in particular conceptual, integration, and enterprise data models. The book is organized into four parts. Part 1 provides an overview of data models and data modeling including the basics of data model notation; types and uses of data models; and the place of data models in enterprise architecture. Part 2 introduces some general principles for data models, including principles for developing ontologically based data models; and applications of the principles for attributes, relationship types, and entity types. Part 3 presents an ontological framework for developing consistent data models. Part 4 provides the full data model that has been in development throughout the book. The model was created using Jotne EPM Technologys EDMVisualExpress data modeling tool. This book was designed for all types of modelers: from those who understand data modeling basics but are just starting to learn about data modeling in practice, through to experienced data modelers seeking to expand their knowledge and skills and solve some of the more challenging problems of data modeling. Uses a number of common data model patterns to explain how to develop data models over a wide scope in a way that is consistent and of high quality Offers generic data model templates that are reusable in many applications and are fundamental for developing more specific templates Develops ideas for creating consistent approaches to high quality data models

Extremely pureXML in DB2 10 for z/OS

The DB2® pureXML® feature offers sophisticated capabilities to store, process and manage XML data in its native hierarchical format. By integrating XML data intact into a relational database structure, users can take full advantage of DB2's relational data management features. In this IBM® Redbooks® publication, we document the steps for the implementation of a simple but meaningful XML application scenario. We have chosen to provide samples in COBOL and Java™ language. The purpose is to provide an easy path to follow to integrate the XML data type for the traditional DB2 for z/OS® user. We also add considerations for the data administrator and suggest best practices for ease of use and better performance.

Mastering XPages: A Step-by-Step Guide to XPages Application Development and the XSP Language

The first complete, practical guide to XPages development - direct from members of the XPages development team at IBM Lotus Martin Donnelly, Mark Wallace, and Tony McGuckin have written the definitive programmer's guide to utilizing this breakthrough technology. Packed with tips, tricks, and best practices from IBM's own XPages developers, Mastering XPages brings together all the information developers need to become experts - whether you’re experienced with Notes/Domino development or not. The authors start from the very beginning, helping developers steadily build your expertise through practical code examples and clear, complete explanations. Readers will work through scores of real-world XPages examples, learning cutting-edge XPages and XSP language skills and gaining deep insight into the entire development process. Drawing on their own experience working directly with XPages users and customers, the authors illuminate both the technology and how it can be applied to solving real business problems. Martin Donnelly previously led a software startup that developed and distributed small business accounting software. Donnelly holds a Commerce degree from University College Cork and an M.S. in Computer Science from Boston University. Mark Wallace has worked at IBM for 15 years on many projects as a technical architect and application developer. Tony McGuckin participates in the Lotus OneUI Web Application and iWidget Adoption Workgroup. He holds a bachelor's degree in Software Engineering from the University of Ulster.

Tivoli Integration Scenarios

This IBM® Redbooks® publication provides a broad view of how Tivoli® system management products work together in several common scenarios. You must achieve seamless integration for operations personnel to work with the solution. This integration is necessary to ensure that the product can be used easily by the users. Product integration contains multiple dimensions, such as security, navigation, data and task integrations. Within the context of the scenarios in this book, you see examples of these integrations. The scenarios implemented in this book are largely based on the input from the integration team, and several clients using IBM products. We based these scenarios on common real-life examples that IT operations often have to deal with. Of course, these scenarios are only a small subset of the possible integration scenarios that can be accomplished by the Tivoli products, but they were chosen to be representative of the integration possibilities using the Tivoli products. We discuss these implementations and benefits that are realized by these integrations, and also provide sample scenarios of how these integrations work. This book is a reference guide for IT architects and IT specialists working on integrating Tivoli products in real-life environments.

Internet Searches for Vetting, Investigations, and Open-Source Intelligence

In the information age, it is critical that we understand the implications and exposure of the activities and data documented on the Internet. Improved efficiencies and the added capabilities of instant communication, high-speed connectivity to browsers, search engines, websites, databases, indexing, searching and analytical applications have made information technology (IT) and the Internet a vital issued for public and private enterprises. The downside is that this increased level of complexity and vulnerability presents a daunting challenge for enterprise and personal security. Internet Searches for Vetting, Investigations, and Open-Source Intelligence provides an understanding of the implications of the activities and data documented by individuals on the Internet. It delineates a much-needed framework for the responsible collection and use of the Internet for intelligence, investigation, vetting, and open-source information. This book makes a compelling case for action as well as reviews relevant laws, regulations, and rulings as they pertain to Internet crimes, misbehaviors, and individuals’ privacy. Exploring technologies such as social media and aggregate information services, the author outlines the techniques and skills that can be used to leverage the capabilities of networked systems on the Internet and find critically important data to complete an up-to-date picture of people, employees, entities, and their activities. Outlining appropriate adoption of legal, policy, and procedural principles—and emphasizing the careful and appropriate use of Internet searching within the law—the book includes coverage of cases, privacy issues, and solutions for common problems encountered in Internet searching practice and information usage, from internal and external threats. The book is a valuable resource on how to utilize open-source, online sources to gather important information and screen and vet employees, prospective employees, corporate partners, and vendors.

Using Google Maps™ and Google Earth™

Using Google™ Maps and Google Earth is more than just a book: it’s the fastest, easiest way to master Google’s amazing mapping applications! Don’t just “read” about it: see it, hear it, live it, with step-by-step screencasts and expert audio tips. Discover how to map your favorite places with Google Maps…see actual locations with Street View…generate driving, walking, and public transit directions…find and learn more about businesses…create and share custom maps and mashups…use Google Maps on iPhone…navigate Google Earth to find locations fast…create life-like Google Earth roadmaps, and tour your route…even explore Google Sky, Google Moon, and Google Earth’s Flight Simulator! Exclusive online Show Me video walks through tasks you’ve just got to see… Tell Me More audio delivers practical, “straight from the experts” insights… Point-Counterpoint audio compares alternative solutions—so you can pick the one that’s best for you. It’s all the help you’ll ever need…where you want it, when you want it!

Camel in Action

NEWER EDITION AVAILABLE Camel in Action, Second Edition is now available. An eBook of this older edition is included at no additional cost when you buy the revised edition! A limited number of pBook copies of this edition are still available. Please contact Manning Support to inquire about purchasing previous edition copies. Camel in Action is a Camel tutorial full of small examples showing how to work with the integration patterns. It starts with core concepts like sending, receiving, routing, and transforming data. It then shows you the entire lifecycle and goes in depth on how to test, deal with errors, scale, deploy, and even monitor your app—details you can find only in the Camel code itself. Written by the developers of Camel, this book distills their experience and practical insights so that you can tackle integration tasks like a pro. About the Technology Apache Camel is a Java framework that lets you implement the standard enterprise integration patterns in a few lines of code. With a concise but sophisticated DSL you snap integration logic into your app, Lego-style, using Java, XML, or Scala. Camel supports over 80 common transports such as HTTP, REST, JMS, and Web Services. About the Book What's Inside Valuable examples in Java and XML Explanations of complex patterns Error handling, testing, deploying, managing, and running Camel Accessible to beginners, useful to experts About the Reader About the Authors Claus Ibsen is a principal engineer working for FuseSource specializing in the enterprise integration space. He has worked on Apache Camel for the last three years where he is a a PMC member, a key contributor, and heads the development and roadmap. Claus lives in Sweden near Malmo with his wife and dog. Jonathan Anstey is a software engineer with varied experience in manufacturing control systems, build infrastructure, and enterprise integration. Lately, Jon has been working on Apache Camel as a PMC member and an active committer while at FuseSource. When he is not hacking on Camel he likes to spend time with his wife and daughter in St. John's, Newfoundland. Quotes I highly recommend this book. It kicks ass! - James Strachan, Cofounder of Apache Camel Strikes the right balance between core concepts and running code. - Gregor Hohpe, Coauthor of Enterprise Integration Patterns Comprehensive guide to enterprise integration with Camel. - Gordon Dickens, Chariot Solutions A deep book... with great examples. - Jeroen Benckhuijsen, Atos Origin Great content from the source developers. - Domingo Suarez Torres, SynergyJ A must-have. - Tijs Rademakers, Atos Origin

Microsoft® Access® 2010 In Depth

Access 2010 In Depth is the beyond-the-basics, beneath-the-surface guide for everyone who wants to streamline their work with Access 2010, and get more done in less time. Legendary Access expert Roger Jennings provides specific, tested, proven solutions to the problems Access database users and developers run into every day: challenges other books ignore or oversimplify. Jennings thoroughly covers all facets of working with Access 2010, and adds new chapters on integration and collaboration with Microsoft SharePoint and emulating Table Triggers with Access Data Macros. New coverage also includes: customizable Ribbon and Themes; the revamped Macro Designer; Quick Start Fields; IntelliSense support in Expression Builder; Application Parts and Navigation Forms; conditional formatting and data bars in reports; and new web database publishing techniques. As with all In Depth books, Access 2010 In Depth presents comprehensive coverage, breakthrough techniques, exclusive shortcuts, quick access to information, troubleshooting help for tough problems, and real-world examples with nothing glossed over or left out. Step-by-step instructions with icons guide readers through essential tasks such as designing tables, entering data, importing external data, designing and executing queries, and designing data entry forms and printed reports. Additional chapters on advanced form and report design emphasize data entry efficiency and presentation clarity. By Roger Jennings, a world-renowned Access expert who has sold more than 1.25 million books! Covers all aspects of working with Access 2010, from the customizable Ribbon and Themes to conditional formatting, advanced web database publishing to SharePoint collaboration For everyone who wants to get the most out of Access 2010, from hobbyists to power users to corporate developers

Data Integration Blueprint and Modeling: Techniques for a Scalable and Sustainable Architecture

Making Data Integration Work: How to Systematically Reduce Cost, Improve Quality, and Enhance Effectiveness Today’s enterprises are investing massive resources in data integration. Many possess thousands of point-to-point data integration applications that are costly, undocumented, and difficult to maintain. Data integration now accounts for a major part of the expense and risk of typical data warehousing and business intelligence projects--and, as businesses increasingly rely on analytics, the need for a blueprint for data integration is increasing now more than ever. This book presents the solution: a clear, consistent approach to defining, designing, and building data integration components to reduce cost, simplify management, enhance quality, and improve effectiveness. Leading IBM data management expert Tony Giordano brings together best practices for architecture, design, and methodology, and shows how to do the disciplined work of getting data integration right. Mr. Giordano begins with an overview of the “patterns” of data integration, showing how to build blueprints that smoothly handle both operational and analytic data integration. Next, he walks through the entire project lifecycle, explaining each phase, activity, task, and deliverable through a complete case study. Finally, he shows how to integrate data integration with other information management disciplines, from data governance to metadata. The book’s appendices bring together key principles, detailed models, and a complete data integration glossary. Coverage includes Implementing repeatable, efficient, and well-documented processes for integrating data Lowering costs and improving quality by eliminating unnecessary or duplicative data integrations Managing the high levels of complexity associated with integrating business and technical data Using intuitive graphical design techniques for more effective process and data integration modeling Building end-to-end data integration applications that bring together many complex data sources

Pro Oracle Database 11g Administration

Pro Oracle Database 11g Administration is a book focused on results. Author Darl Kuhn draws from a well of experience over a decade deep to lay out real-world techniques that lead to success as an Oracle DBA. He gives clear explanations on how to perform critical tasks. He weaves in theory where necessary without bogging you down in unneeded detail. He is not afraid to take a stand on how things should be done. He won't leave you adrift in a sea of choices, showing you three ways to do something and then walking away. Database administration isn't about passing a certified exam, or about pointing-and-clicking your way through a crisis. Database administration is about applying the right solution at the right time, about avoiding risk, about making robust choices that get you home each night in time for dinner with your family. If you have "buck stops here" responsibility for an Oracle database, then Pro Oracle Database 11g Administration is the book you need to help elevate yourself to the level of Professional Oracle Database Administrator. Condenses and organizes the core job of a database administrator into one volume. Takes a results-oriented approach to getting things done. Lays a foundation upon which to build a senior level of expertise

Securing SQL Server

Securing SQL Server: Protecting Your Database from Attackers provides readers with the necessary tools and techniques to help maintain the security of databases within their environment. It begins with a discussion of network security issues, including public versus private IP addresses; accessing an SQL server from home; physical security; and testing network security. The remaining chapters cover database encryption; SQL password security; SQL injection attacks; database backup security; security auditing; and server rights. The Appendix features checklists that database administrators can use to pass external audits. Named a 2011 Systems Administration Book by InfoSec Reviews Author Denny Cherry is an MVP by Microsoft for his expertise in the SQL Server product Learn expert techniques to protect your SQL database environment Discover how to identify what an intruder accessed or damaged

IBM AIX Version 7.1 Differences Guide

This IBM® Redbooks® publication focuses on the enhancements to IBM AIX® Version 7.1 Standard Edition. It is intended to help system administrators, developers, and users understand these enhancements and evaluate potential benefits in their own environments. AIX Version 7.1 introduces many new features, including: - Domain Role Based Access Control - Workload Partition enhancements - Topas performance tool enhancements - Terabyte segment support - Cluster Aware AIX functionality AIX Version 7.1 offers many other new enhancements, and you can explore them all in this publication. For clients who are not familiar with the enhancements of AIX through Version 5.3, a companion publication, AIX Version 6.1 Differences Guide, SG24-7559, is available.

Oracle Streams 11g Data Replication

Master Oracle Streams 11 g Replication Enable real-time information access and data sharing across your distributed framework using the expert information in this Oracle Press guide. Oracle Streams 11g Data Replication explains how to set up and administer a unified enterprise data sharing infrastructure. Learn how to capture, propagate, and apply database changes, transform data, and handle data conflicts. Monitoring, optimizing, and troubleshooting techniques are also covered in this comprehensive volume. Understand Oracle Streams components and architecture Gain in-depth knowledge about capturing, propagating, and applying data manipulation language (DML) and data definition language (DDL) changes Learn how to access and modify the contents of Logical Change Records Build custom procedures for data transformations Configure Oracle Streams replication for the database, schemas, and tables Tune Oracle Streams performance for improved throughput Manage and monitor Oracle Streams using Oracle Enterprise Manager Grid Control Learn from several practical examples and scripts

IBM CICS Explorer

IBM® Customer Information Control System (CICS®) Explorer is the new face of CICS Integration point for CICS tooling with rich CICS views, data, and methods. Are you looking for new ways to accelerate the transfer of knowledge, skills, and best practices to the next generation of technical staff and experts? Do you need to maintain productivity and protect service-levels? CICS Explorer™ and System z® lead the way to platform simplification. IBM CICS Explorer has a common, intuitive, Eclipse-based environment for architects, developers, administrators, system programmers, and operators. The task-oriented views provide integrated access to a broad range of data and control capabilities, and it also has powerful, context-sensitive resource editors. Integration point for CICS TS, CICS Tools, CICS TG, PD Tools, and Rational® Tools are extensible by independent software vendors (ISV), system integrators (SI), and customers who use our Software Development Kit. In this IBM Redbooks® publication, we focus on the new CICS Explorer. The first part of the book is an overview of the CICS Explorer along with all of the CICS Tools’ plug-ins. In the second part of the book, we focus on several scenarios in which you can use the CICS Explorer with the CICS Tools plug-ins to resolve various problems.

Oracle and PL/SQL Recipes: A Problem-Solution Approach

Oracle PL/SQL Recipes is your go to book for PL/SQL programming solutions. It takes a task-oriented approach to PL/SQL programming that lets you quickly look up a specific task and see the pattern for a solution. Then it's as simple as modifying the pattern for your specific application and implementing it. And you're done and home for dinner. Oracle PL/SQL Recipes is another in Apress' ongoing series of recipe books aimed at Oracle practitioners. The recipe format is ideal for the busy professional who just needs to get the job done. Covers the most common PL/SQL programming problems Presents solutions in ready-to-use format Stays short and to-the-point What you'll learn Move business logic into the database Implement robust exception-handling Optimize and debug, taking advantage of Oracle SQL Developer Integrate PL/SQL with web-based applications Interface with Java code and dynamic languages in the database, and running externally Run the UTPLSQL unit-testing framework Who this book is for Oracle PL/SQL Recipes is database administrators and developers wanting to take advantage of Oracle Database's built-in support for procedural logic. Database administrators can use PL/SQL to automate administrative tasks, audit access to sensitive data, and more. Developers will be able to optimize processing, push business logic into the database layer, and interface with Java and web applications.

Hadoop in Action

Hadoop in Action introduces the subject and teaches you how to write programs in the MapReduce style. It starts with a few easy examples and then moves quickly to show Hadoop use in more complex data analysis tasks. Included are best practices and design patterns of MapReduce programming. About the Technology Big data can be difficult to handle using traditional databases. Apache Hadoop is a NoSQL applications framework that runs on distributed clusters. This lets it scale to huge datasets. If you need analytic information from your data, Hadoop's the way to go. About the Book What's Inside Introduction to MapReduce Examples illustrating ideas in practice Hadoop's Streaming API Other related tools, like Pig and Hive About the Reader This book requires basic Java skills. Knowing basic statistical concepts can help with the more advanced examples. About the Author Chuck Lam is a Senior Engineer at RockYou! He has a PhD in pattern recognition from Stanford University. Quotes A guide for beginners, a source of insight for advanced users. - Philipp K. Janert, Principal Value, LLC A nice mix of the what, why, and how of Hadoop. - Paul Stusiak, Falcon Technologies Corp. Demystifies Hadoop. A great resource! - Rick Wagner, Acxiom Corp. Covers it all! Plus, gives you sweet extras no one else does. - John S. Griffin, Overstock.com An excellent introduction to Hadoop and MapReduce. - Kenneth DeLong, BabyCenter, LLC