talk-data.com talk-data.com

Topic

ETL/ELT

ETL/ELT

data_integration data_transformation data_loading

21

tagged

Activity Trend

40 peak/qtr
2020-Q1 2026-Q1

Activities

Showing filtered results

Filtering by: O'Reilly Data Science Books ×
Essential Data Analytics, Data Science, and AI: A Practical Guide for a Data-Driven World

In today’s world, understanding data analytics, data science, and artificial intelligence is not just an advantage but a necessity. This book is your thorough guide to learning these innovative fields, designed to make the learning practical and engaging. The book starts by introducing data analytics, data science, and artificial intelligence. It illustrates real-world applications, and, it addresses the ethical considerations tied to AI. It also explores ways to gain data for practice and real-world scenarios, including the concept of synthetic data. Next, it uncovers Extract, Transform, Load (ETL) processes and explains how to implement them using Python. Further, it covers artificial intelligence and the pivotal role played by machine learning models. It explains feature engineering, the distinction between algorithms and models, and how to harness their power to make predictions. Moving forward, it discusses how to assess machine learning models after their creation, with insights into various evaluation techniques. It emphasizes the crucial aspects of model deployment, including the pros and cons of on-device versus cloud-based solutions. It concludes with real-world examples and encourages embracing AI while dispelling fears, and fostering an appreciation for the transformative potential of these technologies. Whether you’re a beginner or an experienced professional, this book offers valuable insights that will expand your horizons in the world of data and AI. What you will learn: What are Synthetic data and Telemetry data How to analyze data using programming languages like Python and Tableau. What is feature engineering What are the practical Implications of Artificial Intelligence Who this book is for: Data analysts, scientists, and engineers seeking to enhance their skills, explore advanced concepts, and stay up-to-date with ethics. Business leaders and decision-makers across industries are interested in understanding the transformative potential and ethical implications of data analytics and AI in their organizations.

Data Modeling with Microsoft Power BI

Data modeling is the single most overlooked feature in Power BI Desktop, yet it's what sets Power BI apart from other tools on the market. This practical book serves as your fast-forward button for data modeling with Power BI, Analysis Services tabular, and SQL databases. It serves as a starting point for data modeling, as well as a handy refresher. Author Markus Ehrenmueller-Jensen, founder of Savory Data, shows you the basic concepts of Power BI's semantic model with hands-on examples in DAX, Power Query, and T-SQL. If you're looking to build a data warehouse layer, chapters with T-SQL examples will get you started. You'll begin with simple steps and gradually solve more complex problems. This book shows you how to: Normalize and denormalize with DAX, Power Query, and T-SQL Apply best practices for calculations, flags and indicators, time and date, role-playing dimensions and slowly changing dimensions Solve challenges such as binning, budget, localized models, composite models, and key value with DAX, Power Query, and T-SQL Discover and tackle performance issues by applying solutions in DAX, Power Query, and T-SQL Work with tables, relations, set operations, normal forms, dimensional modeling, and ETL

Mastering Tableau 2023 - Fourth Edition

This comprehensive book on Tableau 2023 is your practical guide to mastering data visualization and business intelligence techniques. You will explore the latest features of Tableau, learn how to create insightful dashboards, and gain proficiency in integrating analytics and machine learning workflows. By the end, you'll have the skills to address a variety of analytics challenges using Tableau. What this Book will help me do Master the latest Tableau 2023 features and use cases to tackle analytics challenges. Develop and implement ETL workflows using Tableau Prep Builder for optimized data preparation. Integrate Tableau with programming languages such as Python and R to enhance analytics. Create engaging, visually impactful dashboards for effective data storytelling. Understand and apply data governance to ensure data quality and compliance. Author(s) Marleen Meier is an experienced data visualization expert and Tableau consultant with over a decade of experience helping organizations transform data into actionable insights. Her approach integrates her technical expertise and a keen eye for design to make analytics accessible rather than overwhelming. Her passion for teaching others to use visualization tools effectively shines through in her writing. Who is it for? This book is ideal for business analysts, BI professionals, or data analysts looking to enhance their Tableau expertise. It caters to both newcomers seeking to understand the foundations of Tableau and experienced users aiming to refine their skills in advanced analytics and data visualization. If your goal is to leverage Tableau as a strategic tool in your organization's BI projects, this book is for you.

Transitioning to Microsoft Power Platform: An Excel User Guide to Building Integrated Cloud Applications in Power BI, Power Apps, and Power Automate

Welcome to this step-by-step guide for Excel users, data analysts, and finance specialists. It is designed to take you through practical report and development scenarios, including both the approach and the technical challenges. This book will equip you with an understanding of the overall Power Platform use case for addressing common business challenges. While Power BI continues to be an excellent tool of choice in the BI space, Power Platform is the real game changer. Using an integrated architecture, a small team of citizen developers can build solutions for all kinds of business problems. For small businesses, Power Platform can be used to build bespoke CRM, Finance, and Warehouse management tools. For large businesses, it can be used to build an integration point for existing systems to simplify reporting, operation, and approval processes. The author has drawn on his15 years of hands-on analytics experience to help you pivot from the traditional Excel-based reporting environment. By using different business scenarios, this book provides you with clear reasons why a skill is important before you start to dive into the scenarios. You will use a fast prototyping approach to continue to build exciting reporting, automation, and application solutions and improve them while you acquire new skill sets. The book helps you get started quickly with Power BI. It covers data visualization, collaboration, and governance practices. You will learn about the most practical SQL challenges. And you will learn how to build applications in PowerApps and Power Automate. The book ends with an integrated solution framework that can be adapted to solve a wide range of complex business problems. What You Will Learn Develop reporting solutions and business applications Understand the Power Platform licensing and development environment Apply Data ETL and modeling in Power BI Use Data Storytelling and dashboard design to better visualize data Carry out data operations with SQL and SharePoint lists Develop useful applications using Power Apps Develop automated workflows using Power Automate Integrate solutions with Power BI, Power Apps, and Power Automate to build enterprise solutions Who This Book Is For Next-generation data specialists, including Excel-based users who want to learn Power BI and build internal apps; finance specialists who want to take a different approach to traditional accounting reports; and anyone who wants to enhance their skill set for the future job market.

Serverless Analytics with Amazon Athena

Delve into the serverless world of Amazon Athena with the comprehensive book 'Serverless Analytics with Amazon Athena'. This guide introduces you to the power of Athena, showing you how to efficiently query data in Amazon S3 using SQL without the hassle of managing infrastructure. With clear instructions and practical examples, you'll master querying structured, unstructured, and semi-structured data seamlessly. What this Book will help me do Effectively query and analyze both structured and unstructured data stored in S3 using Amazon Athena. Integrate Athena with other AWS services to create powerful, secure, and cost-efficient data workflows. Develop ETL pipelines and machine learning workflows leveraging Athena's compatibility with AWS Glue. Monitor and troubleshoot Athena queries for consistent performance and build scalable serverless data solutions. Implement security best practices and optimize costs when managing your Athena-driven data solutions. Author(s) None Virtuoso, along with co-authors Mert Turkay Hocanin None and None Wishnick, brings a wealth of experience in cloud solutions, serverless technologies, and data engineering. They excel in demystifying complex technical topics and have a passion for empowering readers with practical skills and knowledge. Who is it for? This book is tailored for business intelligence analysts, application developers, and system administrators who want to harness Amazon Athena for seamless, cost-efficient data analytics. It suits individuals with basic SQL knowledge looking to expand their capabilities in querying and processing data. Whether you're managing growing datasets or building data-driven applications, this book provides the know-how to get it right.

Mastering SAS Programming for Data Warehousing

"Mastering SAS Programming for Data Warehousing" dives into the effective use of SAS for handling large-scale data environments like data warehouses and data lakes. You will learn to design and manage ETL processes using SAS, standardize workflows with macros and arrays, and connect SAS to other systems to enhance reporting and data visualization. What this Book will help me do Master efficient data input/output management in SAS environments. Design and maintain robust ETL pipelines using SAS macros and arrays. Identify and address data warehouse user requirements. Utilize Output Delivery System (ODS) to create professional reports. Integrate SAS with external systems for optimized data processing. Author(s) Monika Wahi brings extensive SAS programming experience coupled with a strong background in data warehousing and data analysis. Her insightful approach demystifies complex topics, focusing on equipping readers with practical skills. Her collaborative writing style makes advanced concepts accessible and applicable to real-world scenarios. Who is it for? This book is designed for data professionals such as architects, managers leading data-intensive projects, and SAS programmers or developers. It's ideal for those with foundational SAS experience who aspire to manage, maintain, or develop data lakes, marts, or warehouses effectively. The book offers a logical progression from basic concepts to advanced implementations, tailored for ambitious learners.

Prepare Your Data for Tableau: A Practical Guide to the Tableau Data Prep Tool

Focus on the most important and most often overlooked factor in a successful Tableau project—data. Without a reliable data source, you will not achieve the results you hope for in Tableau. This book does more than teach the mechanics of data preparation. It teaches you: how to look at data in a new way, to recognize the most common issues that hinder analytics, and how to mitigate those factors one by one. Tableau can change the course of business, but the old adage of "garbage in, garbage out" is the hard truth that hides behind every Tableau sales pitch. That amazing sales demo does not work as well with bad data. The unfortunate reality is that almost all data starts out in a less-than-perfect state. Data prep is hard. Traditionally, we were forced into the world of the database where complex ETL (Extract, Transform, Load) operations created by the data team did all the heavy lifting for us. Fortunately, we have moved past those days. With the introduction of the Tableau Data Prep tool you can now handle most of the common Data Prep and cleanup tasks on your own, at your desk, and without the help of the data team. This essential book will guide you through: The layout and important parts of the Tableau Data Prep tool Connecting to data Data quality and consistency The shape of the data. Is the data oriented in columns or rows? How to decide? Why does it matter? What is the level of detail in the source data? Why is that important? Combining source data to bring in more fields and rows Saving the data flow and the results of our data prep work Common cleanup and setup tasks in Tableau Desktop What You Will Learn Recognize data sources that are good candidates for analytics in Tableau Connect tolocal, server, and cloud-based data sources Profile data to better understand its content and structure Rename fields, adjust data types, group data points, and aggregate numeric data Pivot data Join data from local, server, and cloud-based sources for unified analytics Review the steps and results of each phase of the Data Prep process Output new data sources that can be reviewed in Tableau or any other analytics tool Who This Book Is For Tableau Desktop users who want to: connect to data, profile the data to identify common issues, clean up those issues, join to additional data sources, and save the newly cleaned, joined data so that it can be used more effectively in Tableau

Pentaho Data Integration Quick Start Guide

Pentaho Data Integration Quick Start Guide offers a comprehensive introduction to Pentaho's Extract-Transform-Load (ETL) tools. Through this book, you will learn to design, execute, and monitor data transformations and seamlessly integrate data across various sources. It is designed to simplify and streamline the process for developers and analysts. What this Book will help me do Understand the functionality and usage of Pentaho Data Integration tools to manage your ETL workflows. Utilize tools like Spoon to design, execute, and manage transformations effectively. Learn to connect to and process data from diverse data sources, including files and databases. Develop skills in transforming data using various techniques provided by PDI to create meaningful outcomes. Master creating jobs to sequence tasks and automate data workflows efficiently. Author(s) The author, Carina Roldán, is an experienced professional specializing in data integration and ETL processes using Pentaho tools. She leverages her extensive experience to craft this book with clarity and accessibility, making it simple for readers to grasp key concepts. Her instructional style is straightforward and geared toward minimizing beginners' challenges. Who is it for? This book is ideal for data analysts, business intelligence developers, and software engineers who want to utilize Pentaho Data Integration for ETL tasks and data workflows. No prior in-depth experience with Pentaho is necessary, but familiarity with basic data concepts is recommended. Readers will benefit most if they are seeking practical skills in data integration to solve real-world problems and streamline data processes.

Learning Pentaho Data Integration 8 CE - Third Edition

"Learning Pentaho Data Integration 8 CE" is your comprehensive guide to mastering data manipulation and integration using Pentaho Data Integration (PDI) 8 Community Edition. Through step-by-step instructions and practical examples, you'll learn to explore, transform, validate, and integrate data from multiple sources, equipping you to handle real-world data challenges efficiently. What this Book will help me do Effectively install and understand the foundational concepts of Pentaho Data Integration 8 Community Edition. Efficiently organize, clean, and transform raw data from various sources into useful formats. Perform advanced data operations like metadata injection, managing relational databases, and implementing ETL solutions. Design, create, and deploy comprehensive data warehouse solutions using modern best practices. Streamline daily data processing tasks with flexibility and accuracy while handling errors gracefully. Author(s) The author, Carina Roldán, is an experienced professional in the field of data science and ETL (Extract, Transform, Load) development. Her expertise in leveraging tools like Pentaho Data Integration has allowed her to contribute significantly to BI and data management projects. Her approach in writing this book reflects her commitment to simplifying complex topics for aspiring professionals. Who is it for? This book is ideal for software developers, data analysts, business intelligence professionals, and IT students aiming to enhance their skills in ETL processes using Pentaho Data Integration. Beginners who wish to learn PDI comprehensively and professionals looking to deepen their expertise will both find value in this resource. It's also suitable for individuals involved in data warehouse design and implementation. This book will equip you with the skills to handle diverse data transformation tasks effectively.

Learning Informatica PowerCenter 10.x - Second Edition

Dive into the world of Informatica PowerCenter 10.x, where enterprise data warehousing meets cutting-edge data management solutions. This comprehensive guide walks you through mastering ETL processes and optimizing performance, helping you become proficient in this powerful data integration tool. With step-by-step instructions, you'll build your knowledge from installation to advanced techniques. What this Book will help me do Understand how to install and configure Informatica PowerCenter 10.x for enterprise-level data integration projects, ensuring readiness to start transforming data effectively. Gain hands-on experience with PowerCenter's various developer tools, including Workflow Manager, Workflow Monitor, Designer, and Repository Manager, mastering their practical utilities. Learn and apply essential data warehousing concepts, such as Slowly Changing Dimensions (SCDs) and Incremental Aggregations, to create robust data-handling workflows. Leverage advanced PowerCenter features like pushdown optimization and partitioning to optimize performance for large-scale data processing jobs. Become proficient in migrating sources, targets, and workflows between environments, enabling seamless integration of data management solutions across enterprise systems. Author(s) Rahul Malewar, a seasoned expert in ETL and data integration, brings his extensive experience with Informatica PowerCenter to the table. With years spent working alongside global enterprises to streamline their data operations, Rahul's insights transfer into a hands-on teaching style that simplifies even the most advanced concepts. Apt at bridging technical depth with accessible explanations, he has dedicated his career to empowering learners to unlock the full potential of their data warehousing tools. Who is it for? Perfect for developers and data professionals aiming to elevate their enterprise data management skills, this book is ideally suited for those new to or experienced with Informatica. Whether you're striving to become proficient in PowerCenter or seeking to implement advanced ETL concepts in your projects, this guide will equip you with the expertise to succeed. A foundational understanding of programming and data warehousing concepts is recommended for best results.

Integrated Analytics

Companies are collecting more data than ever. But, given how difficult it is to unify the many internal and external data streams they’ve built, more data doesn’t necessarily translate into better analytics. The real challenge is to provide deep and broad access to “a single source of truth” in their data that the typically slow ETL process for data warehousing cannot achieve. More than just fast access, analysts need the ability to explore data at a granular level. In this O’Reilly report, author Courtney Webster presents a roadmap to data centralization that will help your organization make data accessible, flexible, and actionable. Building a genuine data-driven culture depends on your company’s ability to quickly act upon new findings. This report explains how. Identify stakeholders: build a culture of trust and awareness among decision makers, data analysts, and quality management Create a data plan: define your needs, specify your metrics, identify data sources, and standardize metric definitions Centralize the data: evaluate each data source for existing common fields and, if you can, minor variances, and standardize data references Find the right tool(s) for the job: choose from legacy architecture tools, managed and cloud-only services, and data visualization or data exploration platforms Courtney Webster is a reformed chemist in the Washington, D.C. metro area. She spent a few years after grad school programming robots to do chemistry and is now managing web and mobile applications for clinical research trials.

QlikView Your Business

Unlock the meaning of your data with QlikView The Qlik platform was designed to provide a fast and easy data analytics tool, and QlikView Your Business is your detailed, full-color, step-by-step guide to understanding Qlikview's powerful features and techniques so you can quickly start unlocking your data’s potential. This expert author team brings real-world insight together with practical business analytics, so you can approach, explore, and solve business intelligence problems using the robust Qlik toolset and clearly communicate your results to stakeholders using powerful visualization features in QlikView and Qlik Sense. This book starts at the basic level and dives deep into the most advanced QlikView techniques, delivering tangible value and knowledge to new users and experienced developers alike. As an added benefit, every topic presented in the book is enhanced with tips, tricks, and insightful recommendations that the authors accumulated through years of developing QlikView analytics. This is the book for you: If you are a developer whose job is to load transactional data into Qlik BI environment, and who needs to understand both the basics and the most advanced techniques of Qlik data modelling and scripting If you are a data analyst whose job is to develop actionable and insightful QlikView visualizations to share within your organization If you are a project manager or business person, who wants to get a better understanding of the Qlik Business Intelligence platform and its capabilities What You Will Learn: The book covers three common business scenarios - Sales, Profitability, and Inventory Analysis. Each scenario contains four chapters, covering the four main disciplines of business analytics: Business Case, Data Modeling, Scripting, and Visualizations. The material is organized by increasing levels of complexity. Following our comprehensive tutorial, you will learn simple and advanced QlikView and Qlik Sense concepts, including the following: Data Modeling: Transforming Transactional data into Dimensional models Building a Star Schema Linking multiple fact tables using Link Tables Combing multiple tables into a single fact able using Concatenated Fact models Managing slowly changing dimensions Advanced date handling, using the As of Date table Calculating running balances Basic and Advanced Scripting: How to use the Data Load Script language for implementing data modeling techniques How to build and use the QVD data layer Building a multi-tier data architectures Using variables, loops, subroutines, and other script control statements Advanced scripting techniques for a variety of ETL solutions Building Insightful Visualizations in QlikView: Introduction into QlikView sheet objects — List Boxes, Text Objects, Charts, and more Designing insightful Dashboards in QlikView Using advanced calculation techniques, such as Set Analysis and Advanced Aggregation Using variables for What-If Analysis, as well as using variables for storing calculations, colors, and selection filters Advanced visualization techniques - normalized and non-normalized Mekko charts, Waterfall charts, Whale Tail charts, and more Building Insightful Visualizations in Qlik Sense: Introducing Qlik Sense - how it is different from QlikView and what is similar? Creating Sense sheet objects Building and using the Library of Master Items Exploring Qlik Sense unique features — Storytelling, Geo Mapping, and using Extensions Whether you are jus

Mastering QlikView

"Mastering QlikView" is your advanced guide to unlocking the potential of business intelligence through QlikView. Dive deep into powerful data modeling, performance tuning, and visualization techniques, crafted to empower you in making data-driven decisions and optimizing your BI workflows. What this Book will help me do Understand and implement advanced QlikView data modeling techniques for efficient analysis. Master performance tuning methods to ensure your QlikView applications are fast and scalable. Apply industry best practices for ETL and data loading using QVDs and other QlikView features. Create advanced visualizations and dashboards that distill analytics into actionable insights. Leverage metadata management tools and governance techniques to maintain data integrity and consistency. Author(s) Stephen Redmond, an expert in business intelligence and data visualization, brings years of hands-on experience with QlikView and Qlik Sense. As a seasoned developer and thought leader, Stephen specializes in distilling complex BI methodologies into practical skills. His approachable style makes advanced topics accessible and engaging to readers. Who is it for? This book is tailored for business application developers and system analysts already familiar with QlikView. Ideal for professionals seeking to enhance their BI proficiency with advanced QlikView capabilities. If you're aiming to solve complex data challenges or refine your visualization skills, this book provides the expert guidance to take your knowledge further.

Pentaho Data Integration Cookbook - Second Edition - Second Edition

This cookbook is a comprehensive guide to using Pentaho Data Integration (Kettle) for executing ETL processes effectively. With step-by-step recipes, it covers everything from connecting to diverse data sources to implementing advanced data handling workflows. This book is a valuable resource to streamline and enhance your data integration tasks. What this Book will help me do Learn to configure Kettle to connect with various databases and applications. Understand how to embed Java code for optimized transformations. Discover techniques to reuse and manage transformations and jobs. Master the integration of Kettle with other Pentaho Suite components. Explore advanced data flow control and manipulation tactics. Author(s) The authors of this book are experienced professionals in data integration and Pentaho tools. They bring years of practical industry experience and have a passion for sharing knowledge through clear, hands-on tutorials. Their approach to writing ensures readers can take actionable insights directly to their work. Who is it for? This book is ideal for developers familiar with the fundamental concepts of Kettle who aim to delve deeper into advanced functionalities. Readers should have basic ETL knowledge and the ambition to master Pentaho Data Integration. Experienced users will find valuable tips and learn about new features to automate and enhance their processes.

Pentaho Data Integration Beginner's Guide - Second Edition

This book is a comprehensive guide designed for those new to Pentaho Data Integration. With a focus on practical application and step-by-step learning, this book covers everything from installation to complex data manipulation. By following along, you will acquire the skills you need to efficiently manage and transform data using Pentaho. What this Book will help me do Understand how to install and set up Pentaho Data Integration for professional data manipulation. Master data transformation tasks such as cleaning, sorting, and integrating different data sources. Learn to configure and operate databases within the Pentaho environment, including CRUD operations. Gain hands-on experience with data warehousing concepts and using Pentaho to populate data warehouses. Develop workflows and schedules for automated data processes using Pentaho's advanced tools. Author(s) Carina Roldán is an experienced data professional with extensive expertise in the field of ETL and data integration. Her teaching style is clear, approachable, and heavily reliant on practical examples. She focuses on enabling learners to build real-world skills in a supportive and engaging manner, making complex topics accessible to everyone. Who is it for? This book is perfect for developers, database administrators, and IT professionals looking to venture into ETL tools or seeking a deeper understanding of Pentaho Data Integration. Beginners without prior exposure to Pentaho Data Integration will find it an excellent entry point, while those with some experience will benefit from its in-depth insights. It is also valuable for data warehouse designers and architects aiming to streamline their workflows.

Pentaho® Kettle Solutions: Building Open Source ETL Solutions with Pentaho Data Integration

A complete guide to Pentaho Kettle, the Pentaho Data lntegration toolset for ETL This practical book is a complete guide to installing, configuring, and managing Pentaho Kettle. If you're a database administrator or developer, you'll first get up to speed on Kettle basics and how to apply Kettle to create ETL solutions—before progressing to specialized concepts such as clustering, extensibility, and data vault models. Learn how to design and build every phase of an ETL solution. Shows developers and database administrators how to use the open-source Pentaho Kettle for enterprise-level ETL processes (Extracting, Transforming, and Loading data) Assumes no prior knowledge of Kettle or ETL, and brings beginners thoroughly up to speed at their own pace Explains how to get Kettle solutions up and running, then follows the 34 ETL subsystems model, as created by the Kimball Group, to explore the entire ETL lifecycle, including all aspects of data warehousing with Kettle Goes beyond routine tasks to explore how to extend Kettle and scale Kettle solutions using a distributed "cloud" Get the most out of Pentaho Kettle and your data warehousing with this detailed guide—from simple single table data migration to complex multisystem clustered data integration tasks.

Pentaho 3.2 Data Integration Beginner's Guide

Explore, transform, validate, and integrate your data with ease Get started with Pentaho Data Integration from scratch. Enrich your data transformation operations by embedding Java and JavaScript code in PDI transformations. Create a simple but complete Datamart Project that will cover all key features of PDI. Part of Packt's Beginner's Guide series: Each task gradually develops from the previous task with an organized sequence of instructions accompanied with relevant explanation and a set of challenging tasks that the reader will be able to accomplish. In Detail Pentaho Data Integration (a.k.a. Kettle) is a full-featured open source ETL (Extract, Transform, and Load) solution. Although PDI is a feature-rich tool, effectively capturing, manipulating, cleansing, transferring, and loading data can get complicated. This book is full of practical examples that will help you to take advantage of Pentaho Data Integration's graphical, drag-and-drop design environment. You will quickly get started with Pentaho Data Integration by following the step-by-step guidance in this book. The useful tips in this book will encourage you to exploit powerful features of Pentaho Data Integration and perform ETL operations with ease. Starting with the installation of the PDI software, this book will teach you all the key PDI concepts. Each chapter introduces new features, allowing you to gradually get involved with the tool. First, you will learn to work with plain files, and to do all kinds of data manipulation. Then, the book gives you a primer on databases and teaches you how to work with databases inside PDI. Not only that, you'll be given an introduction to data warehouse concepts and you will learn to load data in a data warehouse. After that, you will learn to implement simple and complex processes. Once you've learned all the basics, you will build a simple datamart that will serve to reinforce all the concepts learned through the book. A practical, easy-to-read guide that gives you full understanding of the Pentaho Data Integration tool and shows you how to use it to your advantage to manipulate data

Pentaho® Solutions: Business Intelligence and Data Warehousing with Pentaho and MySQL®

Your all-in-one resource for using Pentaho with MySQL for Business Intelligence and Data Warehousing Open-source Pentaho provides business intelligence (BI) and data warehousing solutions at a fraction of the cost of proprietary solutions. Now you can take advantage of Pentaho for your business needs with this practical guide written by two major participants in the Pentaho community. The book covers all components of the Pentaho BI Suite. You'll learn to install, use, and maintain Pentaho-and find plenty of background discussion that will bring you thoroughly up to speed on BI and Pentaho concepts. Of all available open source BI products, Pentaho offers the most comprehensive toolset and is the fastest growing open source product suite Explains how to build and load a data warehouse with Pentaho Kettle for data integration/ETL, manually create JFree (pentaho reporting services) reports using direct SQL queries, and create Mondrian (Pentaho analysis services) cubes and attach them to a JPivot cube browser Review deploying reports, cubes and metadata to the Pentaho platform in order to distribute BI solutions to end-users Shows how to set up scheduling, subscription and automatic distribution The companion Web site provides complete source code examples, sample data, and links to related resources.

Market Risk Analysis Volume IV: Value-at-Risk Models

Written by leading market risk academic, Professor Carol Alexander, Value-at-Risk Models forms part four of the Market Risk Analysis four volume set. Building on the three previous volumes this book provides by far the most comprehensive, rigorous and detailed treatment of market VaR models. It rests on the basic knowledge of financial mathematics and statistics gained from Volume I, of factor models, principal component analysis, statistical models of volatility and correlation and copulas from Volume II and, from Volume III, knowledge of pricing and hedging financial instruments and of mapping portfolios of similar instruments to risk factors. A unifying characteristic of the series is the pedagogical approach to practical examples that are relevant to market risk analysis in practice. All together, the Market Risk Analysis four volume set illustrates virtually every concept or formula with a practical, numerical example or a longer, empirical case study. Across all four volumes there are approximately 300 numerical and empirical examples, 400 graphs and figures and 30 case studies many of which are contained in interactive Excel spreadsheets available from the the accompanying CD-ROM . Empirical examples and case studies specific to this volume include: Parametric linear value at risk (VaR)models: normal, Student t and normal mixture and their expected tail loss (ETL); New formulae for VaR based on autocorrelated returns; Historical simulation VaR models: how to scale historical VaR and volatility adjusted historical VaR; Monte Carlo simulation VaR models based on multivariate normal and Student t distributions, and based on copulas; Examples and case studies of numerous applications to interest rate sensitive, equity, commodity and international portfolios; Decomposition of systematic VaR of large portfolios into standard alone and marginal VaR components; Backtesting and the assessment of risk model risk; Hypothetical factor push and historical stress tests, and stress testing based on VaR and ETL. "The ebook version does not provide access to the companion files".