talk-data.com talk-data.com

Topic

data

2093

tagged

Activity Trend

3 peak/qtr
2020-Q1 2026-Q1

Activities

Showing filtered results

Filtering by: O'Reilly Data Science Books ×
Applied Bayesian Modelling, 2nd Edition

This book provides an accessible approach to Bayesian computing and data analysis, with an emphasis on the interpretation of real data sets. Following in the tradition of the successful first edition, this book aims to make a wide range of statistical modeling applications accessible using tested code that can be readily adapted to the reader's own applications. The second edition has been thoroughly reworked and updated to take account of advances in the field. A new set of worked examples is included. The novel aspect of the first edition was the coverage of statistical modeling using WinBUGS and OPENBUGS. This feature continues in the new edition along with examples using R to broaden appeal and for completeness of coverage.

Basic Data Analysis for Time Series with R

Written at a readily accessible level, Basic Data Analysis for Time Series with R emphasizes the mathematical importance of collaborative analysis of data used to collect increments of time or space. Balancing a theoretical and practical approach to analyzing data within the context of serial correlation, the book presents a coherent and systematic regression-based approach to model selection. The book illustrates these principles of model selection and model building through the use of information criteria, cross validation, hypothesis tests, and confidence intervals. Focusing on frequency- and time-domain and trigonometric regression as the primary themes, the book also includes modern topical coverage on Fourier series and Akaike's Information Criterion (AIC). In addition, Basic Data Analysis for Time Series with R also features: Real-world examples to provide readers with practical hands-on experience Multiple R software subroutines employed with graphical displays Numerous exercise sets intended to support readers understanding of the core concepts Specific chapters devoted to the analysis of the Wolf sunspot number data and the Vostok ice core data sets

Discovering Knowledge in Data: An Introduction to Data Mining, 2nd Edition

The field of data mining lies at the confluence of predictive analytics, statistical analysis, and business intelligence. Due to the ever-increasing complexity and size of data sets and the wide range of applications in computer science, business, and health care, the process of discovering knowledge in data is more relevant than ever before. This book provides the tools needed to thrive in today's big data world. The author demonstrates how to leverage a company's existing databases to increase profits and market share, and carefully explains the most current data science methods and techniques. The reader will "learn data mining by doing data mining". By adding chapters on data modelling preparation, imputation of missing data, and multivariate statistical analysis, Discovering Knowledge in Data, Second Edition remains the eminent reference on data mining. The second edition of a highly praised, successful reference on data mining, with thorough coverage of big data applications, predictive analytics, and statistical analysis. Includes new chapters on Multivariate Statistics, Preparing to Model the Data, and Imputation of Missing Data, and an Appendix on Data Summarization and Visualization Offers extensive coverage of the R statistical programming language Contains 280 end-of-chapter exercises Includes a companion website with further resources for all readers, and Powerpoint slides, a solutions manual, and suggested projects for instructors who adopt the book

Making Human Capital Analytics Work: Measuring the ROI of Human Capital Processes and Outcomes

PROVE THE VALUE OF YOUR HR PROGRAM WITH HARD DATA While corporate leaders may well know the value of human capital, they don’t always understand the extent to which the HR function contributes to the bottom line. So when times get tough and business budgets get cut, HR departments often take the first hit. In this groundbreaking guide, the cofounders of ROI Institute, Jack Phillips and Patti Phillips, provide the tools and techniques you need to use analytics to show top decision makers the value of HR in your organization. Focusing on three types of analytics--descriptive, predictive, and prescriptive-- Making Human Capital Analytics Work shows how you can apply analytics by: Developing relationships between variables Predicting the success of HR programs Determining the cost of intangibles that are otherwise diffi cult to value Showing the business value of particular HR programs Calculating and forecasting the ROI of various HR projects and programs Much more than a guide to using data collection and analysis, Making Human Capital Analytics Work is a template for spearheading large-scale change in your organization by dramatically influencing your department's overall image within the organization. The authors take you step-by-step through the processes of using hard data to drive decisions and demonstrate the tangible value of HR. You know that your department is more than administrative and transactional--that it's an integral player in your company's strategy. Apply the lessons in Making Human Capital Analytics Work and ensure that all other stakeholders know too.

Multiple Imputation of Missing Data Using SAS

Find guidance on using SAS for multiple imputation and solving common missing data issues.

Multiple Imputation of Missing Data Using SAS provides both theoretical background and constructive solutions for those working with incomplete data sets in an engaging example-driven format. It offers practical instruction on the use of SAS for multiple imputation and provides numerous examples that use a variety of public release data sets with applications to survey data.

Written for users with an intermediate background in SAS programming and statistics, this book is an excellent resource for anyone seeking guidance on multiple imputation. The authors cover the MI and MIANALYZE procedures in detail, along with other procedures used for analysis of complete data sets. They guide analysts through the multiple imputation process, including evaluation of missing data patterns, choice of an imputation method, execution of the process, and interpretation of results.

Topics discussed include how to deal with missing data problems in a statistically appropriate manner, how to intelligently select an imputation method, how to incorporate the uncertainty introduced by the imputation process, and how to incorporate the complex sample design (if appropriate) through use of the SAS SURVEY procedures.

Discover the theoretical background and see extensive applications of the multiple imputation process in action.

This book is part of the SAS Press program.

Practical Data Analysis with JMP, Second Edition, 2nd Edition

Understand the concepts and techniques of analysis while learning to reason statistically.

Being an effective analyst requires that you know how to properly define a problem and apply suitable statistical techniques, as well as clearly and honestly communicate the results with information-rich visualizations and precise language. Being a well-informed consumer of analyses requires the same set of skills so that you can recognize credible, actionable research when you see it.

Robert Carver's Practical Data Analysis with JMP, Second Edition uses the powerful interactive and visual approach of JMP to introduce readers to the logic and methods of statistical thinking and data analysis. It enables you to discriminate among and to use fundamental techniques of analysis, enabling you to engage in statistical thinking by analyzing real-world problems. “Application Scenarios” at the end of each chapter challenge you to put your knowledge and skills to use with data sets that go beyond mere repetition of chapter examples, and three new review chapters help readers integrate ideas and techniques. In addition, the scope and sequence of the chapters have been updated with more coverage of data management and analysis of data.

The book can stand on its own as a learning resource for professionals or be used to supplement a standard college-level introduction-to-statistics textbook. It includes varied examples and problems that rely on real sets of data, typically starting with an important or interesting research question that an investigator has pursued. Reflective of the broad applicability of statistical reasoning, the problems come from a wide variety of disciplines, including engineering, life sciences, business, economics, among

Practical Data Analysis with JMP, Second Edition introduces you to the major platforms and essential features of JMP and will leave you with a sufficient background and the confidence to continue your exploration independently.

This book is part of the SAS Press program.

Risk-Based Monitoring and Fraud Detection in Clinical Trials Using JMP and SAS

Improve efficiency while reducing costs in clinical trials with centralized monitoring techniques using JMP and SAS.

International guidelines recommend that clinical trial data should be actively reviewed or monitored; the well-being of trial participants and the validity and integrity of the final analysis results are at stake. Traditional interpretation of this guidance for pharmaceutical trials has led to extensive on-site monitoring, including 100% source data verification. On-site review is time consuming, expensive (estimated at up to a third of the cost of a clinical trial), prone to error, and limited in its ability to provide insight for data trends across time, patients, and clinical sites. In contrast, risk-based monitoring (RBM) makes use of central computerized review of clinical trial data and site metrics to determine if and when clinical sites should receive more extensive quality review or intervention.

Risk-Based Monitoring and Fraud Detection in Clinical Trials Using JMP and SAS presents a practical implementation of methodologies within JMP Clinical for the centralized monitoring of clinical trials. Focused on intermediate users, this book describes analyses for RBM that incorporate and extend the recommendations of TransCelerate Biopharm Inc., methods to detect potential patient-or investigator misconduct, snapshot comparisons to more easily identify new or modified data, and other novel visual and analytical techniques to enhance safety and quality reviews. Further discussion highlights recent regulatory guidance documents on risk-based approaches, addresses the requirements for CDISC data, and describes methods to supplement analyses with data captured external to the study database.

Given the interactive, dynamic, and graphical nature of JMP Clinical, any individual from the clinical trial team - including clinicians, statisticians, data managers, programmers, regulatory associates, and monitors - can make use of this book and the numerous examples contained within to streamline, accelerate, and enrich their reviews of clinical trial data.

The analytical methods described in Risk-Based Monitoring and Fraud Detection in Clinical Trials Using JMP and SAS enable the clinical trial team to take a proactive approach to data quality and safety to streamline clinical development activities and address shortcomings while the study is ongoing.

This book is part of the SAS Press

Analytics and Dynamic Customer Strategy: Big Profits from Big Data

Key decisions determine the success of big data strategy Dynamic Customer Strategy: Big Profits from Big Data is a comprehensive guide to exploiting big data for both business-to-consumer and business-to-business marketing. This complete guide provides a process for rigorous decision making in navigating the data-driven industry shift, informing marketing practice, and aiding businesses in early adoption. Using data from a five-year study to illustrate important concepts and scenarios along the way, the author speaks directly to marketing and operations professionals who may not necessarily be big data savvy. With expert insight and clear analysis, the book helps eliminate paralysis-by-analysis and optimize decision making for marketing performance. Nearly seventy-five percent of marketers plan to adopt a big data analytics solution within two years, but many are likely to fail. Despite intensive planning, generous spending, and the best intentions, these initiatives will not succeed without a manager at the helm who is capable of handling the nuances of big data projects. This requires a new way of marketing, and a new approach to data. It means applying new models and metrics to brand new consumer behaviors. Dynamic Customer Strategy clarifies the situation, and highlights the key decisions that have the greatest impact on a company's big data plan. Topics include: Applying the elements of Dynamic Customer Strategy Acquiring, mining, and analyzing data Metrics and models for big data utilization Shifting perspective from model to customer Big data is a tremendous opportunity for marketers and may just be the only factor that will allow marketers to keep pace with the changing consumer and thus keep brands relevant at a time of unprecedented choice. But like any tool, it must be wielded with skill and precision. Dynamic Customer Strategy: Big Profits from Big Data helps marketers shape a strategy that works.

Better Business Decisions from Data

" Everyone encounters statistics on a daily basis. They are used in proposals, reports, requests, and advertisements, among others, to support assertions, opinions, and theories. Unless you're a trained statistician, it can be bewildering. What are the numbers really saying or not saying? Better Business Decisions from Data: Statistical Analysis for Professional Success provides the answers to these questions and more. It will show you how to use statistical data to improve small, every-day management judgments as well as major business decisions with potentially serious consequences. Author Peter Kenny-with deep experience in industry-believes that "while the methods of statistics can be complicated, the meaning of statistics is not." He first outlines the ways in which we are frequently misled by statistical results, either because of our lack of understanding or because we are being misled intentionally. Then he offers sound approaches for understanding and assessing statistical data to make excellent decisions. Kenny assumes no prior knowledge of statistical techniques; he explains concepts simply and shows how the tools are used in various business situations. With the arrival of Big Data, statistical processing has taken on a new level of importance. Kenny lays a foundation for understanding the importance and value of Big Data, and then he shows how mined data can help you see your business in a new light and uncover opportunity. Among other things, this book covers: How statistics can help you assess the probability of a successful outcome How data is collected, sampled, and best interpreted How to make effective forecasts based on the data at hand How to spot the misuse or abuse of statistical evidence in advertisements, reports, and proposals How to commission a statistical analysis Arranged in seven parts-Uncertainties, Data, Samples, Comparisons, Relationships, Forecasts, and Big Data-" Better Business Decisions from Data is a guide for busy people in general management, finance, marketing, operations, and other business disciplines who run across statistics on a daily or weekly basis. You'll return to it again and again as new challenges emerge, making better decisions each time that boost your organization's fortunes—as well as your own.

Using R for Statistics

" R is a popular and growing open source statistical analysis and graphics environment as well as a programming language and platform. If you need to use a variety of statistics, then Using R for Statistics will get you the answers to most of the problems you are likely to encounter. Using R for Statistics is a problem-solution primer for using R to set up your data, pose your problems and get answers using a wide array of statistical tests. The book walks you through R basics and how to use R to accomplish a wide variety statistical operations. You'll be able to navigate the R system, enter and import data, manipulate datasets, calculate summary statistics, create statistical plots and customize their appearance, perform hypothesis tests such as the t-tests and analyses of variance, and build regression models. Examples are built around actual datasets to simulate real-world solutions, and programming basics are explained to assist those who do not have a development background. After reading and using this guide, you'll be comfortable using and applying R to your specific statistical analyses or hypothesis tests. No prior knowledge of R or of programming is assumed, though you should have some experience with statistics. "

Theory and Application of Statistical Energy Analysis, 2nd Edition

This up-to-date second edition provides a comprehensive examination of the theory and application of Statistical Energy Analysis (SEA) in acoustics and vibration. Complete with examples and data taken from real problems this unique book also exploresthe influence of computers on SEA and emphasizes computer based SEA calculations. In addition to a discussion of the relationship between SEA and other procedures used in response estimation, Theory and Application of Statistical Energy Anlaysis, SecondEdition, explores the basic relationships between model and wave descriptions of systems.

Discrete and Continuous Simulation

When it comes to discovering glitches inherent in complex systems—be it a railway or banking, chemical production, medical, manufacturing, or inventory control system—developing a simulation of a system can identify problems with less time, effort, and disruption than it would take to employ the original. Advantageous to both academic and industrial practitioners, Discrete and Continuous Simulation: Theory and Practice offers a detailed view of simulation that is useful in several fields of study. This text concentrates on the simulation of complex systems, covering the basics in detail and exploring the diverse aspects, including continuous event simulation and optimization with simulation. It explores the connections between discrete and continuous simulation, and applies a specific focus to simulation in the supply chain and manufacturing field. It discusses the Monte Carlo simulation, which is the basic and traditional form of simulation. It addresses future trends and technologies for simulation, with particular emphasis given to .NET technologies and cloud computing, and proposes various simulation optimization algorithms from existing literature. Includes chapters on input modeling and hybrid simulation Introduces general probability theory Contains a chapter on Microsoft ® Excel ™ and MATLAB ®/Simulink ® Discusses various probability distributions required for simulation Describes essential random number generators Discrete and Continuous Simulation: Theory and Practice defines the simulation of complex systems. This text benefits academic researchers in industrial/manufacturing/systems engineering, computer sciences, operations research, and researchers in transportation, operations management, healthcare systems, and human–machine systems.

Power Query for Power BI and Excel

" Power Query for Power BI and Excel is a book for people who are tired of copying and pasting data into Excel worksheets. Power Query, part of the Microsoft Power BI suite, is a tool that automates the process of getting data into Excel and will save you hours of dull, repetitive, and error-prone work! Power Query makes it easy to extract data from many different data sources, filter that data, aggregate it, clean it and perform calculations on it, finally loading that data into either your worksheet or directly into the new Excel 2013 Data Model used by Power Pivot. This concise, practical book provides a complete guide to Power Query and how to use it to solve all of your Excel data-loading problems. Power Query for Power BI and Excel goes well beyond the surface of what Power Query can do. The book goes deep into the underlying M language, showing you how to do amazing things that aren't going to be possible from just the GUI interface that is covered in most other books. You'll have full command of the GUI, and you'll be able to drop into the M language to go beyond what the GUI provides. The depth in this book makes it a must-have item for anyone who is pushing Power BI and Excel to their limits in the pursuit of business intelligence from data analysis. " Teaches the basics of using Power Query to load data into Excel Helps you solve common, data-related problems with Power Query Shows how to write your own solutions in the powerful M language

Advanced Backend Optimization

This book is a summary of more than a decade of research in the area of backend optimization. It contains the latest fundamental research results in this field. While existing books are often more oriented toward Masters students, this book is aimed more towards professors and researchers as it contains more advanced subjects. It is unique in the sense that it contains information that has not previously been covered by other books in the field, with chapters on phase ordering in optimizing compilation; register saturation in instruction level parallelism; code size reduction for software pipelining; memory hierarchy effects and instruction level parallelism. Other chapters provide the latest research results in well-known topics such as register need, and software pipelining and periodic register allocation.

Recursive Identification and Parameter Estimation

Recursive Identification and Parameter Estimation describes a recursive approach to solving system identification and parameter estimation problems arising from diverse areas. Supplying rigorous theoretical analysis, it presents the material and proposed algorithms in a manner that makes it easy to understand—providing readers with the modeling and identification skills required for successful theoretical research and effective application. The book begins by introducing the basic concepts of probability theory, including martingales, martingale difference sequences, Markov chains, mixing processes, and stationary processes. Next, it discusses the root-seeking problem for functions, starting with the classic RM algorithm, but with attention mainly paid to the stochastic approximation algorithms with expanding truncations (SAAWET) which serves as the basic tool for recursively solving the problems addressed in the book. The book not only identifies the results of system identification and parameter estimation, but also demonstrates how to apply the proposed approaches for addressing problems in a range of areas, including: Identification of ARMAX systems without imposing restrictive conditions Identification of typical nonlinear systems Optimal adaptive tracking Consensus of multi-agents systems Principal component analysis Distributed randomized PageRank computation This book recursively identifies autoregressive and moving average with exogenous input (ARMAX) and discusses the identification of non-linear systems. It concludes by addressing the problems arising from different areas that are solved by SAAWET. Demonstrating how to apply the proposed approaches to solve problems across a range of areas, the book is suitable for students, researchers, and engineers working in systems and control, signal processing, communication, and mathematical statistics.

The Mystery of Market Movements: An Archetypal Approach to Investment Forecasting and Modelling

A quantifiable framework for unlocking the unconscious forces that shape markets There has long been a notion that subliminal forces play a great part in causing the seemingly irrational financial bubbles, which conventional economic theory, again and again, fails to explain. However, these forces, sometimes labeled 'animal spirits' or 'irrational exuberance, have remained elusive - until now. The Mystery of Market Movements provides you with a methodology to timely predict and profit from changes in human investment behaviour based on the workings of the collective unconscious. Niklas Hageback draws in on one of psychology's most influential ideas - archetypes - to explain how they form investor's perceptions and can be predicted and turned into profit. The Mystery of Market Movements provides; A review of the collective unconscious and its archetypes based on Carl Jung's theories and empirical case studies that highlights and assesses the influences of the collective unconscious on financial bubbles and zeitgeists For the first time being able to objectively measure the impact of archetypal forces on human thoughts and behaviour with a view to provide early warning signals on major turns in the markets. This is done through a step-by-step guide on how to develop a measurement methodology based on an analysis of the language of the unconscious; figurative speech such as metaphors and symbolism, drawn out and deciphered from Big Data sources, allowing for quantification into time series The book is supplemented with an online resource that presents continuously updated bespoken archetypal indexes with predictive capabilities to major financial indexes Investors are often unaware of the real reasons behind their own financial decisions. This book explains why psychological drivers in the collective unconscious dictates not only investment behaviour but also political, cultural and social trends. Understanding these forces allows you to stay ahead of the curve and profit from market tendencies that more traditional methods completely overlook.

Bayesian Networks

Understand the Foundations of Bayesian Networks—Core Properties and Definitions Explained Bayesian Networks: With Examples in R introduces Bayesian networks using a hands-on approach. Simple yet meaningful examples in R illustrate each step of the modeling process. The examples start from the simplest notions and gradually increase in complexity. The authors also distinguish the probabilistic models from their estimation with data sets. The first three chapters explain the whole process of Bayesian network modeling, from structure learning to parameter learning to inference. These chapters cover discrete Bayesian, Gaussian Bayesian, and hybrid networks, including arbitrary random variables. The book then gives a concise but rigorous treatment of the fundamentals of Bayesian networks and offers an introduction to causal Bayesian networks. It also presents an overview of R and other software packages appropriate for Bayesian networks. The final chapter evaluates two real-world examples: a landmark causal protein signaling network paper and graphical modeling approaches for predicting the composition of different body parts. Suitable for graduate students and non-statisticians, this text provides an introductory overview of Bayesian networks. It gives readers a clear, practical understanding of the general approach and steps involved.

Communicating Data with Tableau

Go beyond spreadsheets and tables and design a data presentation that really makes an impact. This practical guide shows you how to use Tableau Software to convert raw data into compelling data visualizations that provide insight or allow viewers to explore the data for themselves. Ideal for analysts, engineers, marketers, journalists, and researchers, this book describes the principles of communicating data and takes you on an in-depth tour of common visualization methods. You’ll learn how to craft articulate and creative data visualizations with Tableau Desktop 8.1 and Tableau Public 8.1. Present comparisons of how much and how many Use blended data sources to create ratios and rates Create charts to depict proportions and percentages Visualize measures of mean, median, and mode Lean how to deal with variation and uncertainty Communicate multiple quantities in the same view Show how quantities and events change over time Use maps to communicate positional data Build dashboards to combine several visualizations

Fundamentals of Applied Probability and Random Processes, 2nd Edition

The long-awaited revision of Fundamentals of Applied Probability and Random Processes expands on the central components that made the first edition a classic. The title is based on the premise that engineers use probability as a modeling tool, and that probability can be applied to the solution of engineering problems. Engineers and students studying probability and random processes also need to analyze data, and thus need some knowledge of statistics. This book is designed to provide students with a thorough grounding in probability and stochastic processes, demonstrate their applicability to real-world problems, and introduce the basics of statistics. The book's clear writing style and homework problems make it ideal for the classroom or for self-study. Demonstrates concepts with more than 100 illustrations, including 2 dozen new drawings Expands readers’ understanding of disruptive statistics in a new chapter (chapter 8) Provides new chapter on Introduction to Random Processes with 14 new illustrations and tables explaining key concepts. Includes two chapters devoted to the two branches of statistics, namely descriptive statistics (chapter 8) and inferential (or inductive) statistics (chapter 9).

Learning NumPy Array

This book, 'Learning NumPy Array,' is the ultimate guide to mastering the fundamental library for numerical computing in Python: NumPy. Through concise explanations and practical examples, you will learn how to create and manipulate arrays, perform complex computations, and leverage NumPy's capabilities to streamline data analysis workflows. What this Book will help me do Install and set up NumPy in your Python environment for numerical computing. Create and manipulate multidimensional arrays to handle and process large data sets. Perform complex mathematical and statistical computations with NumPy's built-in methods. Explore time series analysis and signal processing techniques using NumPy. Optimize and improve the performance of Python code leveraging NumPy's efficient operations. Author(s) Ivan Idris is a seasoned programmer and data scientist with a great passion for Python and numerical computing. With years of experience working on data analysis projects, he has solidified his expertise in Python's scientific libraries, including NumPy. Ivan creates practical, reader-friendly guides that not only teach the technical how-to's but also inspire confidence in solving real-world problems. Who is it for? This book is ideal for Python programmers taking their first steps into the world of numerical computing or data analysis. Beginners looking to understand the basics of handling large numerical datasets in Python will find this resource highly enlightening. Developers and scientists wanting to streamline their calculations using efficient techniques will gain valuable insights. If working with Python in a data-driven environment interests you, this book is for you.