talk-data.com talk-data.com

Topic

data-engineering

3377

tagged

Activity Trend

1 peak/qtr
2020-Q1 2026-Q1

Activities

Showing filtered results

Filtering by: O'Reilly Data Engineering Books ×
What's New in SQL Server 2012

SQL Server 2012 introduces a wealth of new features and enhancements that database professionals need to master to stay ahead in the ever-evolving industry. This book provides a practical guide to upgrading your knowledge with the latest advancements, from T-SQL improvements to new Business Intelligence tools and cloud capabilities. What this Book will help me do Understand and install the core and advanced features of SQL Server 2012 effectively. Implement new SQL Server Management Studio features for enhanced administration. Utilize Business Intelligence Semantic Models for insightful data analysis. Execute data cleansing projects using Data Quality Services (DQS). Simulate real-world database loads using Distributed Replay for testing purposes. Author(s) The author is an experienced database administrator and SQL Server expert with a career spanning over two decades. With hands-on experience in implementation, administration, and optimization of complex SQL Server environments, the author brings a wealth of practical knowledge to this book. Their approach is to provide concise, actionable insights tailored to the needs of IT professionals. Who is it for? This book is tailored for database administrators, developers, and BI professionals familiar with SQL Server 2008 R2 seeking to efficiently upgrade to SQL Server 2012. If you aim to quickly adopt and utilize the new features and improvements in SQL Server 2012, this book provides the clear and focused learning path you need.

IBM Power Systems 775 for AIX and Linux HPC Solution

This IBM® Redbooks® publication contains information about the IBM Power Systems™ 775 Supercomputer solution for AIX® and Linux HPC customers. This publication provides details about how to plan, configure, maintain, and run HPC workloads in this environment. This IBM Redbooks document is targeted to current and future users of the IBM Power Systems 775 Supercomputer (consultants, IT architects, support staff, and IT specialists) responsible for delivering and implementing IBM Power Systems 775 clustering solutions for their enterprise high-performance computing applications.

Microsoft SQL Server 2012 Master Data Services 2/E, 2nd Edition

Deploy and Maintain an Integrated MDS Architecture Harness your master data and grow revenue while reducing administrative costs. Thoroughly revised to cover the latest MDS features, Microsoft SQL Server 2012 Master Data Services, Second Edition shows how to implement and manage a centralized, customer-focused MDS framework. See how to accurately model business processes, load and cleanse data, enforce business rules, eliminate redundancies, and publish data to external systems. Security, SOA and Web services, and legacy data integration are also covered in this practical guide. Install Microsoft SQL Server 2012 Master Data Services Build custom MDS models and entityspecific staging tables Load and cleanse data from disparate sources Logically group assets into collections and hierarchies Ensure integrity using versioning and business rules Configure security at functional, object, and attribute levels Extend functionality with SOA and Web services Facilitate collaboration using the MDS Excel Add-In Export data to subscribing systems through SQL views

DB2 SQL Tuning Tips for z/OS Developers

The Definitive Solutions-Oriented Guide to IBM® DB2® for z/OS®: Now Fully Updated for Both v9 and v10! The largest database tuning performance gains can often be obtained from tuning application code, and applications that use SQL to retrieve data are the best candidates for tuning. This well-organized, easy-to-understand reference brings together more than 100 SQL-related skills and techniques that any developer can use to build and optimize DB2 applications for consistently superior performance. DB2 tuning expert Tony Andrews (“Tony the Tuner”) draws on more than 20 years of DB2-related experience, empowering you to take performance into your own hands, whether you’re writing new software or tuning existing systems. Tony shows you exactly how to clear bottlenecks, resolve problems, and improve both speed and reliability. This book fully reflects the latest SQL programming best practices for DB2 V9 and DB2 V10 on z/OS: techniques that are taught in no other book and are rarely covered in general DB2 SQL courses. Drawing on his extensive consulting experience and highly praised training with Themis Inc., Tony also presents practical checklists and an invaluable 15-step methodology for optimizing virtually any DB2 application. Coverage includes Empowering developers on knowing what to do and where to look in resolving performance problems in queries or programs Providing many programming and SQL coding examples Establishing standards and guidelines that lead to high-performance SQL Implementing time-efficient code walkthroughs to ensure that your standards are followed Focusing on the small number of SQL statements that consume the most resources Identifying simple solutions that deliver the most sizable benefits Optimizing performance by rewriting query predicates more efficiently Providing a better understanding of SQL optimization and Runstat statistics Recognizing opportunities to tweak your code more effectively than the Optimizer Optimizing SQL code with COBOL applications Efficiently checking for the existence of data, rows, or tables Using Runstats’ newest capabilities to consistently optimize paths to data

IBM PowerVM Best Practices

This IBM® Redbooks® publication provides best practices for planning, installing, maintaining, and monitoring the IBM PowerVM® Enterprise Edition virtualization features on IBM POWER7® processor technology-based servers. PowerVM is a combination of hardware, PowerVM Hypervisor, and software, which includes other virtualization features, such as the Virtual I/O Server. This publication is intended for experienced IT specialists and IT architects who want to learn about PowerVM best practices, and focuses on the following topics: Planning and general best practices Installation, migration, and configuration Administration and maintenance Storage and networking Performance monitoring Security PowerVM advanced features

Implementing an Advanced Application Using Processes, Rules, Events, and Reports

In this IBM® Redbooks® publication we describe how to build an advanced business application from end to end. We use a fictional scenario to define the application, document the deployment methodology, and confirm the roles needed to support its development and deployment. Through step-by-step instructions you learn how to: In addition, we articulate a testing strategy using IBM Rational® Quality Manager and deployment options using IBM Workload Deployer. Please note that the additional material referenced in the text is not available from IBM.

Introduction to Surveillance Studies

Surveillance can be a potent tool for protecting resources and assets. It can also be extremely invasive, calling into question our basic rights to freedom and privacy. This volume explores technological trends, past- and present-day rationales for surveillance and surveillance devices, and current social issues surrounding them. It begins with a historical perspective on the evolution of surveillance technologies, then charts the development of modern-day devices. Enhanced with numerous photos, the book presents the pros and cons and some of the controversy of these increasingly sophisticated technologies, their collective impact, and what the future may hold.

50 Ways to Protect Your Identity in a Digital Age: New Financial Threats You Need to Know and How to Avoid Them, Second Edition

It is better to stop identity theft from happening in the first place than have to fix or repair the situation afterwards. Steve Weisman reveals the threats of new identity theft attacks based on use of Facebook, iPad, iPhone, Android, cloud apps, iPod, and other new technologies -- and shows you how to protect yourself, or how to fix the damage if you've already been attacked! Discover why ID theft is more dangerous than ever, and discover today's most dangerous new threats -- including attacks targeting medical records, personal finance and online banking sites, the elderly, and military service members. Meet the hackers and organized crime groups who want to steal your identity and money -- and learn how to protect your data and your life! Step by step, Weisman shows how to avoid risks, minimize risks you can't completely avoid, and immediately take the right steps if you're ever victimized. He objectively reviews new products and services that promise to fight identity theft, and previews emerging dangers, such as RFID credit cards. If you use a computer, the Internet, a smartphone, cell phone, tablet, or any other communications device, 50 Ways to Protect Your Identity in a Digital Age isn't just an indispensable wake-up call: it's the world's best resource for protecting yourself!

Optimization and Decision Support Design Guide: Using IBM ILOG Optimization Decision Manager

Today many organizations face challenges when developing a realistic plan or schedule that provides the best possible balance between customer service and revenue goals. Optimization technology has long been used to find the best solutions to complex planning and scheduling problems. A decision-support environment that enables the flexible exploration of all the trade-offs and sensitivities needs to provide the following capabilities: Flexibility to develop and compare realistic planning and scheduling scenarios Quality sensitivity analysis and explanations Collaborative planning and scenario sharing Decision recommendations This IBM® Redbooks® publication introduces you to the IBM ILOG® Optimization Decision Manager (ODM) Enterprise. This decision-support application provides the capabilities you need to take full advantage of optimization technology. Applications built with IBM ILOG ODM Enterprise can help users create, compare, and understand planning or scheduling scenarios. They can also adjust any of the model inputs or goals, and fully understanding the binding constraints, trade-offs, sensitivities, and business options. This book enables business analysts, architects, and administrators to design and use their own operational decision management solution. Please note that the additional material referenced in the text is not available from IBM.

Hadoop Operations

If you’ve been asked to maintain large and complex Hadoop clusters, this book is a must. Demand for operations-specific material has skyrocketed now that Hadoop is becoming the de facto standard for truly large-scale data processing in the data center. Eric Sammer, Principal Solution Architect at Cloudera, shows you the particulars of running Hadoop in production, from planning, installing, and configuring the system to providing ongoing maintenance. Rather than run through all possible scenarios, this pragmatic operations guide calls out what works, as demonstrated in critical deployments. Get a high-level overview of HDFS and MapReduce: why they exist and how they work Plan a Hadoop deployment, from hardware and OS selection to network requirements Learn setup and configuration details with a list of critical properties Manage resources by sharing a cluster across multiple groups Get a runbook of the most common cluster maintenance tasks Monitor Hadoop clusters—and learn troubleshooting with the help of real-world war stories Use basic tools and techniques to handle backup and catastrophic failure

Geolocation in iOS

The iPhone and iPad allow user positioning via multiple methods, including GPS. The growing number location-aware, and location-fenced, applications now arriving in the App Store make heavy use of these abilities. This book walks you through the basic tools you need to build geo-aware applications before diving into the available third-party geo-SDKs available for the iOS platform.

IBM System Storage N series Software Guide

Corporate workgroups, distributed enterprises, and small to medium-sized companies are increasingly seeking to network and consolidate storage to improve availability, share information, reduce costs, and protect and secure information. These organizations require enterprise-class solutions capable of addressing immediate storage needs cost-effectively, while providing an upgrade path for future requirements. IBM® System Storage® N series storage systems and their software capabilities are designed to meet these requirements. IBM System Storage N series storage systems offer an excellent solution for a broad range of deployment scenarios. IBM System Storage N series storage systems function as a mulitprotocol storage device that is designed to allow you to simultaneously serve both file and block-level data across a single network. These activities are demanding procedures that, for some solutions, require multiple, separately managed systems. The flexibility of IBM System Storage N series storage systems, however, allows them to address the storage needs of a wide range of organizations, including distributed enterprises and data centers for midrange enterprises. IBM System Storage N series storage systems also support sites with computer and data-intensive enterprise applications, such as database, data warehousing, workgroup collaboration, and messaging. This IBM® Redbooks® publication explains the software features of the IBM System Storage N series storage systems. This book also covers topics such as installation, setup, and administration of those software features from the IBM System Storage N series storage systems and clients and provides example scenarios.

Oracle Exalytics Revealed

An in-depth Oracle Press guide to Oracle Exalytics In-Memory Machine Written by Oracle ACE Director Mark Rittman, this e-book offers comprehensive coverage of Oracle Exalytics In-Memory Machine, the best-in-class, preintegrated BI platform. After an overview of the product and its capabilities, you’ll get complete details on installation, configuration, administration, maintenance, and creation of the in-memory adaptive data mart that powers the “speed-of-thought” dashboards. Expert tips for integrating with both Oracle and non-Oracle enterprise data sources are included. Enable your organization to analyze large amounts of data via rich, visual dashboards using Oracle Exalytics In-Memory Machine and this Oracle Press guide.

Professional Microsoft SQL Server 2012 Analysis Services with MDX and DAX

Understand Microsoft's dramatically updated new release of its premier toolset for business intelligence The first major update to Microsoft's state-of-the-art, complex toolset for business intelligence (BI) in years is now available and what better way to master it than with this detailed book from key members of the product's development team? If you're a database or data warehouse developer, this is the expert resource you need to build full-scale, multi-dimensional, database applications using Microsoft's new SQL Server 2012 Analysis Services and related tools. Discover how to solve real-world BI problems by leveraging a slew of powerful new Analysis Services features and capabilities. These include the new DAX language, which is a more user-friendly version of MDX; PowerPivot, a new tool for performing simplified analysis of data; BISM, Microsoft's new Business Intelligence Semantic Model; and much more. Serves as an authoritative guide to Microsoft's new SQL Server 2012 Analysis Services BI product and is written by key members of the Microsoft Analysis Services product development team Covers SQL Server 2012 Analysis Services, a major new release with a host of powerful new features and capabilities Topics include using the new DAX language, a simplified, more user-friendly version of MDX; PowerPivot, a new tool for performing simplified analysis of data; BISM, Microsoft's new Business Intelligence Semantic Model; and a new, yet-to-be-named BI reporting tool Explores real-world scenarios to help developers build comprehensive solutions Get thoroughly up to speed on this powerful new BI toolset with the timely and authoritative Professional Microsoft SQL Server 2012 Analysis Services with MDX.

Student Solutions Manual to Accompany Loss Models: From Data to Decisions, Fourth Edition

Praise for the Third Edition "This book provides in-depth coverage of modelling techniques used throughout many branches of actuarial science...The exceptional high standard of this book has made it a pleasure to read." --Annals of Actuarial Science Newly organized to focus exclusively on material tested in the Society of Actuaries' Exam C and the Casualty Actuarial Society's Exam 4, Loss Models: From Data to Decisions, Fourth Edition continues to supply actuaries with a practical approach to the key concepts and techniques needed on the job. With updated material and extensive examples, the book successfully provides the essential methods for using available data to construct models for the frequency and severity of future adverse outcomes. The book continues to equip readers with the tools needed for the construction and analysis of mathematical models that describe the process by which funds flow into and out of an insurance system. Focusing on the loss process, the authors explore key quantitative techniques including random variables, basic distributional quantities, and the recursive method, and discuss techniques for classifying and creating distributions. Parametric, non-parametric, and Bayesian estimation methods are thoroughly covered along with advice for choosing an appropriate model. New features of this Fourth Edition include: Expanded discussion of working with large data sets, now including more practical elements of constructing decrement tables Added coverage of methods for simulating several special situations An updated presentation of Bayesian estimation, outlining conjugate prior distributions and the linear exponential family as well as related computational issues Throughout the book, numerous examples showcase the real-world applications of the presented concepts, with an emphasis on calculations and spreadsheet implementation. A wealth of new exercises taken from previous Exam C/4 exams allows readers to test their comprehension of the material, and a related FTP site features the book's data sets. Loss Models, Fourth Edition is an indispensable resource for students and aspiring actuaries who are preparing to take the SOA and CAS examinations. The book is also a valuable reference for professional actuaries, actuarial students, and anyone who works with loss and risk models. To explore our additional offerings in actuarial exam preparation visit www.wiley.com/go/c4actuarial.

Hadoop in Practice

Hadoop in Practice collects 85 Hadoop examples and presents them in a problem/solution format. Each technique addresses a specific task you'll face, like querying big data using Pig or writing a log file loader. You'll explore each problem step by step, learning both how to build and deploy that specific solution along with the thinking that went into its design. As you work through the tasks, you'll find yourself growing more comfortable with Hadoop and at home in the world of big data. About the Technology Hadoop is an open source MapReduce platform designed to query and analyze data distributed across large clusters. Especially effective for big data systems, Hadoop powers mission-critical software at Apple, eBay, LinkedIn, Yahoo, and Facebook. It offers developers handy ways to store, manage, and analyze data. About the Book Hadoop in Practice collects 85 battle-tested examples and presents them in a problem/solution format. It balances conceptual foundations with practical recipes for key problem areas like data ingress and egress, serialization, and LZO compression. You'll explore each technique step by step, learning how to build a specific solution along with the thinking that went into it. As a bonus, the book's examples create a well-structured and understandable codebase you can tweak to meet your own needs. What's Inside Conceptual overview of Hadoop and MapReduce 85 practical, tested techniques Real problems, real solutions How to integrate MapReduce and R About the Reader This book assumes you've already started exploring Hadoop and want concrete advice on how to use it in production. About the Author Alex Holmes is a senior software engineer with extensive expertise in solving big data problems using Hadoop. He has presented at JavaOne and Jazoon and is a technical lead at VeriSign. Quotes Interesting topics that tickle the creative brain. - Mark Kemna, Brillig Ties together the Hadoop ecosystem technologies. - Ayon Sinha, Britely Comprehensive … high-quality code samples. - Chris Nauroth, The Walt Disney Company Covers all of the variants of Hadoop, not just the Apache distribution. - Ted Dunning, MapR Technologies Charts a path to the future. - Alexey Gayduk, Grid Dynamics

Structural Equation Modeling: Applications Using Mplus

A reference guide for applications of SEM using Mplus Structural Equation Modeling: Applications Using Mplus is intended as both a teaching resource and a reference guide. Written in non-mathematical terms, this book focuses on the conceptual and practical aspects of Structural Equation Modeling (SEM). Basic concepts and examples of various SEM models are demonstrated along with recently developed advanced methods, such as mixture modeling and model-based power analysis and sample size estimate for SEM. The statistical modeling program, Mplus, is also featured and provides researchers with a flexible tool to analyze their data with an easy-to-use interface and graphical displays of data and analysis results. Key features: Presents a useful reference guide for applications of SEM whilst systematically demonstrating various advanced SEM models, such as multi-group and mixture models using Mplus. Discusses and demonstrates various SEM models using both cross-sectional and longitudinal data with both continuous and categorical outcomes. Provides step-by-step instructions of model specification and estimation, as well as detail interpretation of Mplus results. Explores different methods for sample size estimate and statistical power analysis for SEM. By following the examples provided in this book, readers will be able to build their own SEM models using Mplus. Teachers, graduate students, and researchers in social sciences and health studies will also benefit from this book.

Pro T-SQL 2012 Programmer's Guide, Third Edition

Pro T-SQL 2012 Programmer's Guide is every developer's key to making full use of SQL Server 2012's powerful, built-in Transact-SQL language. Discussing new and existing features, the book takes you on an expert guided tour of Transact-SQL functionality. Fully functioning examples and downloadable source code bring technically accurate and engaging treatment of Transact-SQL into your own hands. Step-by-step explanations ensure clarity, and an advocacy of best-practices will steer you down the road to success. Transact-SQL is the language developers and DBAs use to interact with SQL Server. It's used for everything from querying data, to writing stored procedures, to managing the database. New features in T-SQL 2012 include full support for window functions, stored sequences, the ability to throw errors, data paging, and more. All these important new features are covered in this book. Developers and DBAs alike can benefit from the expressive power of Transact-SQL, and Pro T-SQL 2012 Programmer's Guide provides the gateway to success in applying this increasingly important database language to everyday business and technical tasks. What you'll learn Take full advantage of window functions and other new features in T-SQL 2012 Move business logic into the database via procedures and functions Develop using advanced features such as encryption and full-text search Store and retrieve XML data using XQuery and XPath Debug and Optimize T-SQL Execution Build ADO.NET Client Applications Who this book is for Pro T-SQL 2012 Programmer's Guide is written for SQL Server and Transact-SQL developers who want to implement best-practices and take full advantage of all that SQL Server has to offer. NET programmers will find the book helpful in showing how to run .NET code within SQL Server. Database administrators who need to write triggers and the occasional stored procedure will also benefit from the book.

IBM System Storage N series Hardware Guide

This IBM® Redbooks® publication provides a detailed look at the features, benefits, and capabilities of the IBM System Storage® N series hardware offerings. The IBM System Storage N series systems can help you tackle the challenge of effective data management by using virtualization technology and a unified storage architecture. The N series delivers low- to high-end enterprise storage and data management capabilities with midrange affordability. Built-in serviceability and manageability features help support your efforts to increase reliability; simplify and unify storage infrastructure and maintenance; and deliver exceptional economy. The IBM System Storage N series systems provide a range of reliable, scalable storage solutions to meet various storage requirements. These capabilities are achieved by using network access protocols such as Network File System (NFS), Common Internet File System (CIFS), HTTP, and iSCSI, and storage area network technologies such as Fibre Channel. Using built-in Redundant Array of Independent Disks (RAID) technologies, all data is protected with options to enhance protection through mirroring, replication, Snapshots, and backup. These storage systems also have simple management interfaces that make installation, administration, and troubleshooting straightforward. In addition, this book also addresses high-availability solutions including clustering and MetroCluster supporting highest business continuity requirements. MetroCluster is a unique solution that combines array-based clustering with synchronous mirroring to deliver continuous availability.

Effective MySQL Replication Techniques in Depth

The Essential Guide to Scaling Your Business Solution Written by Oracle ACE Director and MySQL expert Ronald Bradford, with coauthor Chris Schneider, Effective MySQL: Replication Techniques in Depth describes what is needed to understand and implement MySQL replication to build scalable solutions. This book includes detailed syntax examples to demonstrate the features, options, and limitations of native MySQL replication. Providing an evaluation of various new replication features and additional third-party product implementations, this Oracle Press guide helps to ensure your MySQL environment can support the various high-availability needs of your business. Master the strengths and limitations of native asynchronous replication in a MySQL topology Identify the important features to improve replication for growing business requirements Recognize the key business factors to determine your optimal highavailability needs Understand the benefits of using MySQL replication for failover scenarios Identify the key configuration variables and SQL commands affecting master/ slave replication Learn about the advancements in replication techniques provided by new products, including Tungsten Replicator and Galera Optimize your replication management with various utilities and toolkits Find additional detailed information and presentations at EffectiveMySQL.com.