talk-data.com talk-data.com

Event

O'Reilly Data Science Books

2013-08-09 – 2026-02-25 Oreilly Visit website ↗

Activities tracked

794

Collection of O'Reilly books on Data Science.

Filtering by: data-science-tasks ×

Sessions & talks

Showing 576–600 of 794 · Newest first

Search within this event →
Applied Data Mining

Data mining has witnessed substantial advances in recent decades. New research questions and practical challenges have arisen from emerging areas and applications within the various fields closely related to human daily life, e.g. social media and social networking. This book aims to bridge the gap between traditional data mining and the latest advances in newly emerging information services. It explores the extension of well-studied algorithms and approaches into these new research arenas.

Applied Multiple Regression/Correlation Analysis for the Behavioral Sciences, 3rd Edition

This classic text on multiple regression is noted for its nonmathematical, applied, and data-analytic approach. Readers profit from its verbal-conceptual exposition and frequent use of examples. The applied emphasis provides clear illustrations of the principles and provides worked examples of the types of applications that are possible. Researchers learn how to specify regression models that directly address their research questions. An overview of the fundamental ideas of multiple regression and a review of bivariate correlation and regression and other elementary statistical concepts provide a strong foundation for understanding the rest of the text. The third edition features an increased emphasis on graphics and the use of confidence intervals and effect size measures, and an accompanying website with data for most of the numerical examples along with the computer code for SPSS, SAS, and SYSTAT, at www.psypress.com/9780805822236 . Applied Multiple Regression serves as both a textbook for graduate students and as a reference tool for researchers in psychology, education, health sciences, communications, business, sociology, political science, anthropology, and economics. An introductory knowledge of statistics is required. Self-standing chapters minimize the need for researchers to refer to previous chapters.

Measuring Sustainability

' Measuring the sustainability of development is crucial to achieving it, and is one of the most actively studied issues in the area. To date, most studies of measurements or indicators have been largely theoretical.

However, this book, a follow-on to Bell and Morse's highly influential Sustainability Indicators (1999), presents valuable practical advice on how to develop measurements that will work in real-life development contexts. It describes and analyses how to derive, validate and apply indicators in the course of an actual development project - in this case the Mediterranean Action Plan in Malta.

The authors explain the trade-offs and constraints involved and how it is possible to combine the open-ended and flexible perspectives of sustainability with the more linear processes and fixed targets of specific projects through the use of pragmatic and reflective methodologies.

Navigating Strategic Decisions

Based on forty years of experience and research, this book provides guidance on forecasting for strategic decision making. It includes methodology, tools, and models. It also explains how to apply sanity checks to existing forecasts to rank project valuations, identify project risks, and select the higher value creation projects. The author discusses how to assess the feasibility of large projects, analyze forecasting models to determine controllable levers, and create the conditions needed for forecasts to materialize.

Improving Surveys with Paradata: Analytic Uses of Process Information

Explore the practices and cutting-edge research on the new and exciting topic of paradata Paradata are measurements related to the process of collecting survey data. Improving Surveys with Paradata: Analytic Uses of Process Information is the most accessible and comprehensive contribution to this up-and-coming area in survey methodology. Featuring contributions from leading experts in the field, Improving Surveys with Paradata: Analytic Uses of Process Information introduces and reviews issues involved in the collection and analysis of paradata. The book presents readers with an overview of the indispensable techniques and new, innovative research on improving survey quality and total survey error. Along with several case studies, topics include: Using paradata to monitor fieldwork activity in face-to-face, telephone, and web surveys Guiding intervention decisions during data collection Analysis of measurement, nonresponse, and coverage error via paradata Providing a practical, encompassing guide to the subject of paradata, the book is aimed at both producers and users of survey data. Improving Surveys with Paradata: Analytic Uses of Process The book also serves as an excellent resource for courses on data collection, survey methodology, and nonresponse and measurement error.

Excel Dashboards and Reports, 2nd Edition

Learn to use Excel dashboards and reports to better conceptualize data Updated for all the latest features and capabilities of Excel 2013, this go-to resource provides you with in-depth coverage of the individual functions and tools that can be used to create compelling Excel reports. Veteran author Michael Alexander walks you through the most effective ways to present and report data. Featuring a comprehensive review of a wide array of technical and analytical concepts, this essential guide helps you go from reporting data with simple tables full of dull numbers to presenting key information through the use of high-impact, meaningful reports and dashboards that will wow management both visually and substantively. Details how to analyze large amounts of data and report the results in a way that is both visually attractive and effective Describes how to use different perspectives to achieve better visibility into data, as well as how to slice data into various views on the fly Shows how to automate redundant reporting and analysis processes Walks you through creating impressive dashboards, eye-catching visualizations, and real-world What-If analyses Excel Dashboards and Reports, Second Edition is part technical manual, part analytical guidebook, and exactly what you need to become your organization's dashboard dynamo!

Handbook of Statistics

Statistical learning and analysis techniques have become extremely important today, given the tremendous growth in the size of heterogeneous data collections and the ability to process it even from physically distant locations. Recent advances made in the field of machine learning provide a strong framework for robust learning from the diverse corpora and continue to impact a variety of research problems across multiple scientific disciplines. The aim of this handbook is to familiarize beginners as well as experts with some of the recent techniques in this field. The Handbook is divided in two sections: Theory and Applications, covering machine learning, data analytics, biometrics, document recognition and security. very relevant to current research challenges faced in various fields self-contained reference to machine learning emphasis on applications-oriented techniques

Microsoft Visio 2013 Step by Step

The smart way to learn Microsoft Visio 2013—one step at a time! Experience learning made easy—and quickly teach yourself how to create professional-looking business and technical diagrams with Visio 2013. With Step by Step, you set the pace—building and practicing the skills you need, just when you need them! Create dynamic organization charts with Visio Make charts with wizards or build them by hand Build drawings using Visio themes and effects Use data-driven drawings in Microsoft SharePoint Import, manipulate, and visualize business data Draw and then execute SharePoint 2013 workflows

Statistics and Probability with Applications for Engineers and Scientists

Introducing the tools of statistics and probability from the ground up An understanding of statistical tools is essential for engineers and scientists who often need to deal with data analysis over the course of their work. Statistics and Probability with Applications for Engineers and Scientists walks readers through a wide range of popular statistical techniques, explaining step-by-step how to generate, analyze, and interpret data for diverse applications in engineering and the natural sciences. Unique among books of this kind, Statistics and Probability with Applications for Engineers and Scientists covers descriptive statistics first, then goes on to discuss the fundamentals of probability theory. Along with case studies, examples, and real-world data sets, the book incorporates clear instructions on how to use the statistical packages Minitab and Microsoft Office Excel to analyze various data sets. The book also features: Detailed discussions on sampling distributions, statistical estimation of population parameters, hypothesis testing, reliability theory, statistical quality control including Phase I and Phase II control charts, and process capability indices A clear presentation of nonparametric methods and simple and multiple linear regression methods, as well as a brief discussion on logistic regression method Comprehensive guidance on the design of experiments, including randomized block designs, one- and two-way layout designs, Latin square designs, random effects and mixed effects models, factorial and fractional factorial designs, and response surface methodology A companion website containing data sets for Minitab and Microsoft Office Excel, as well as JMP routines and results Assuming no background in probability and statistics, Statistics and Probability with Applications for Engineers and Scientists features a unique, yet tried-and-true, approach that is ideal for all undergraduate students as well as statistical practitioners who analyze and illustrate real-world data in engineering and the natural sciences.

An Accidental Statistician: The Life and Memories of George E. P. Box

Celebrating the life of an admired pioneer in statistics In this captivating and inspiring memoir, world-renowned statistician George E. P. Box offers a firsthand account of his life and statistical work. Writing in an engaging, charming style, Dr. Box reveals the unlikely events that led him to a career in statistics, beginning with his job as a chemist conducting experiments for the British army during World War II. At this turning point in his life and career, Dr. Box taught himself the statistical methods necessary to analyze his own findings when there were no statisticians available to check his work. Throughout his autobiography, Dr. Box expertly weaves a personal and professional narrative to illustrate the effects his work had on his life and vice-versa. Interwoven between his research with time series analysis, experimental design, and the quality movement, Dr. Box recounts coming to the United States, his family life, and stories of the people who mean the most to him. This fascinating account balances the influence of both personal and professional relationships to demonstrate the extraordinary life of one of the greatest and most influential statisticians of our time. An Accidental Statistician also features: Two forewords written by Dr. Box's former colleagues and closest confidants Personal insights from more than a dozen statisticians on how Dr. Box has influenced and continues to touch their careers and lives Numerous, previously unpublished photos from the author's personal collection An Accidental Statistician is a compelling read for statisticians in education or industry, mathematicians, engineers, and anyone interested in the life story of an influential intellectual who altered the world of modern statistics.

Success Probability Estimation with Applications to Clinical Trials

Provides an introduction to the various statistical techniques involved in medical research and drug development with a focus on estimating the success probability of an experiment Success Probability Estimation with Applications to Clinical Trials details the use of success probability estimation in both the planning and analyzing of clinical trials and in widely used statistical tests. Devoted to both statisticians and non-statisticians who are involved in clinical trials, Part I of the book presents new concepts related to success probability estimation and their usefulness in clinical trials, and each section begins with a non-technical explanation of the presented concepts. Part II delves deeper into the techniques for success probability estimation and features applications to both reproducibility probability estimation and conservative sample size estimation. Success Probability Estimation with Applications to Clinical Trials: Addresses the theoretical and practical aspects of the topic and introduces new and promising techniques in the statistical and pharmaceutical industries Features practical solutions for problems that are often encountered in clinical trials Includes success probability estimation for widely used statistical tests, such as parametric and nonparametric models Focuses on experimental planning, specifically the sample size of clinical trials using phase II results and data for planning phase III trials Introduces statistical concepts related to success probability estimation and their usefulness in clinical trials Success Probability Estimation with Applications to Clinical Trials is an ideal reference for statisticians and biostatisticians in the pharmaceutical industry as well as researchers and practitioners in medical centers who are actively involved in health policy, clinical research, and the design and evaluation of clinical trials.

Optimal Resource Allocation: With Practical Statistical Applications and Theory

A UNIQUE ENGINEERING AND STATISTICAL APPROACH TO OPTIMAL RESOURCE ALLOCATION Optimal Resource Allocation: With Practical Statistical Applications and Theory features the application of probabilistic and statistical methods used in reliability engineering during the different phases of life cycles of technical systems. Bridging the gap between reliability engineering and applied mathematics, the book outlines different approaches to optimal resource allocation and various applications of models and algorithms for solving real-world problems. In addition, the fundamental background on optimization theory and various illustrative numerical examples are provided. The book also features: An overview of various approaches to optimal resource allocation, from classical Lagrange methods to modern algorithms based on ideas of evolution in biology Numerous exercises and case studies from a variety of areas, including communications, transportation, energy transmission, and counterterrorism protection The applied methods of optimization with various methods of optimal redundancy problem solutions as well as the numerical examples and statistical methods needed to solve the problems Practical thoughts, opinions, and judgments on real-world applications of reliability theory and solves practical problems using mathematical models and algorithms Optimal Resource Allocation is a must-have guide for electrical, mechanical, and reliability engineers dealing with engineering design and optimal reliability problems. In addition, the book is excellent for graduate and PhD-level courses in reliability theory and optimization.

Applied Logistic Regression, 3rd Edition

A new edition of the definitive guide to logistic regression modeling for health science and other applications This thoroughly expanded Third Edition provides an easily accessible introduction to the logistic regression (LR) model and highlights the power of this model by examining the relationship between a dichotomous outcome and a set of covariables. Applied Logistic Regression, Third Edition emphasizes applications in the health sciences and handpicks topics that best suit the use of modern statistical software. The book provides readers with state-of-the-art techniques for building, interpreting, and assessing the performance of LR models. New and updated features include: A chapter on the analysis of correlated outcome data A wealth of additional material for topics ranging from Bayesian methods to assessing model fit Rich data sets from real-world studies that demonstrate each method under discussion Detailed examples and interpretation of the presented results as well as exercises throughout Applied Logistic Regression, Third Edition is a must-have guide for professionals and researchers who need to model nominal or ordinal scaled outcome variables in public health, medicine, and the social sciences as well as a wide range of other fields and disciplines.

Practical Time Series Analysis Using SAS

Anders Milhøj's Practical Time Series Analysis Using SAS explains and demonstrates through examples how you can use SAS for time series analysis. It offers modern procedures for forecasting, seasonal adjustments, and decomposition of time series that can be used without involved statistical reasoning. The book teaches, with numerous examples, how to apply these procedures with very simple coding. In addition, it also gives the statistical background for interested readers. Beginning with an introductory chapter that covers the practical handling of time series data in SAS using the TIMESERIES and EXPAND procedures, it goes on to explain forecasting, which is found in the ESM procedure; seasonal adjustment, including trading-day correction using PROC X12; and unobserved component models using the UCM procedure.

This book is part of the SAS Press program.

Statistical Analysis with Excel For Dummies, 3rd Edition

Take the mystery out of statistical terms and put Excel to work! If you need to create and interpret statistics in business or classroom settings, this easy-to-use guide is just what you need. It shows you how to use Excel's powerful tools for statistical analysis, even if you've never taken a course in statistics. Learn the meaning of terms like mean and median, margin of error, standard deviation, and permutations, and discover how to interpret the statistics of everyday life. You'll learn to use Excel formulas, charts, PivotTables, and other tools to make sense of everything from sports stats to medical correlations. Statistics have a reputation for being challenging and math-intensive; this friendly guide makes statistical analysis with Excel easy to understand Explains how to use Excel to crunch numbers and interpret the statistics of everyday life: sales figures, gambling odds, sports stats, a grading curve, and much more Covers formulas and functions, charts and PivotTables, samples and normal distributions, probabilities and related distributions, trends, and correlations Clarifies statistical terms such as median vs. mean, margin of error, standard deviation, correlations, and permutations Statistical Analysis with Excel For Dummies, 3rd Edition helps you make sense of statistics and use Excel's statistical analysis tools in your daily life.

Statistical Methods with Applications to Demography and Life Insurance

Suitable for statisticians, mathematicians, actuaries, and students interested in the problems of insurance and analysis of lifetimes, this book presents contemporary statistical techniques for analyzing life distributions and life insurance problems. It mainly focuses on the analysis of an individual life and describes statistical methods based on empirical and related processes. The text not only contains traditional material but also incorporates new problems and techniques not discussed in existing actuarial literature. Coverage ranges from analyzing the tails of distributions of lifetimes to modeling population dynamics with migrations.

Solutions Manual to Accompany Introduction to Linear Regression Analysis, 5th Edition

As the Solutions Manual, this book is meant to accompany the main title, Introduction to Linear Regression Analysis, Fifth Edition. Clearly balancing theory with applications, this book describes both the conventional and less common uses of linear regression in the practical context of today's mathematical and scientific research. Beginning with a general introduction to regression modeling, including typical applications, the book then outlines a host of technical tools that form the linear regression analytical arsenal, including: basic inference procedures and introductory aspects of model adequacy checking; how transformations and weighted least squares can be used to resolve problems of model inadequacy; how to deal with influential observations; and polynomial regression models and their variations. The book also includes material on regression models with autocorrelated errors, bootstrapping regression estimates, classification and regression trees, and regression model validation.

Probability, Statistics and Random Processes

Probability, Statistics and Random Processes is designed to meet the requirements of students and is intended for beginners to help them understand the concepts from the first principles. Spread across 16 chapters, it discusses the theoretical aspects that have been refined and updated to reflect the current developments in the subjects. It expounds on theoretical concepts that have immense practical applications, giving adequate proofs to establish significant theorems.

Underwater Acoustic Modeling and Simulation, Fourth Edition, 4th Edition

Underwater Acoustic Modeling and Simulation, Fourth Edition continues to provide the most authoritative overview of currently available propagation, noise, reverberation, and sonar-performance models. This fourth edition of a bestseller discusses the fundamental processes involved in simulating the performance of underwater acoustic systems and emphasizes the importance of applying the proper modeling resources to simulate the behavior of sound in virtual ocean environments. New to the Fourth Edition Extensive new material that addresses recent advances in inverse techniques and marine-mammal protection Problem sets in each chapter Updated and expanded inventories of available models Designed for readers with an understanding of underwater acoustics but who are unfamiliar with the various aspects of modeling, the book includes sufficient mathematical derivations to demonstrate model formulations and provides guidelines for selecting and using the models. Examples of each type of model illustrate model formulations, model assumptions, and algorithm efficiency. Simulation case studies are also included to demonstrate practical applications. Providing a thorough source of information on modeling resources, this book examines the translation of our physical understanding of sound in the sea into mathematical models that simulate acoustic propagation, noise, and reverberation in the ocean. The text shows how these models are used to predict and diagnose the performance of complex sonar systems operating in the undersea environment.

Essentials of Mathematical Statistics

Part of the Jones and Bartlett Learning International Series in Mathematics

Written for the one-term introductory probability and statistics course for mid- to upper-level math and science majors, Essentials of Mathematical Statistics combines the topics generally found in main-stream elementary statistics books with the essentials of the underlying theory. The book begins with an axiomatic treatment of probability followed by chapters on discrete and continuous random variables and their associated distributions. It then introduces basic statistical concepts including summarizing data and interval parameter estimation, stressing the connection between probability and statistics. Final chapters introduce hypothesis testing, regression, and non-parametric techniques. All chapters provide a balance between conceptual understanding and theoretical understanding of the topics at hand.

Key Features of Essentials of Mathematical Statistics:

  • End-of-section exercises range from computational to conceptual to theoretical.
  • Many sections include a sub-section titled “Software Calculations” which gives detailed descriptions of how to perform the calculations discussed in the section using the software Minitab, R, Excel, and the TI-83/84 calculators.
  • Provides a clear balance between conceptual understanding and theoretical understanding
  • Exercises throughout vary in level of difficulty and scope.
Predictive Analytics: The Power to Predict Who Will Click, Buy, Lie, or Die

"The Freakonomics of big data." —Stein Kretsinger, founding executive of Advertising.com; former lead analyst at Capital One This book is easily understood by all readers. Rather than a "how to" for hands-on techies, the book entices lay-readers and experts alike by covering new case studies and the latest state-of-the-art techniques. You have been predicted — by companies, governments, law enforcement, hospitals, and universities. Their computers say, "I knew you were going to do that!" These institutions are seizing upon the power to predict whether you're going to click, buy, lie, or die. Why? For good reason: predicting human behavior combats financial risk, fortifies healthcare, conquers spam, toughens crime fighting, and boosts sales. How? Prediction is powered by the world's most potent, booming unnatural resource: data. Accumulated in large part as the by-product of routine tasks, data is the unsalted, flavorless residue deposited en masse as organizations churn away. Surprise! This heap of refuse is a gold mine. Big data embodies an extraordinary wealth of experience from which to learn. Predictive analytics unleashes the power of data. With this technology, the computer literally learns from data how to predict the future behavior of individuals. Perfect prediction is not possible, but putting odds on the future — lifting a bit of the fog off our hazy view of tomorrow — means pay dirt. In this rich, entertaining primer, former Columbia University professor and Predictive Analytics World founder Eric Siegel reveals the power and perils of prediction: What type of mortgage risk Chase Bank predicted before the recession. Predicting which people will drop out of school, cancel a subscription, or get divorced before they are even aware of it themselves. Why early retirement decreases life expectancy and vegetarians miss fewer flights. Five reasons why organizations predict death, including one health insurance company. How U.S. Bank, European wireless carrier Telenor, and Obama's 2012 campaign calculated the way to most strongly influence each individual. How IBM's Watson computer used predictive modeling to answer questions and beat the human champs on TV's Jeopardy! How companies ascertain untold, private truths — how Target figures out you're pregnant and Hewlett-Packard deduces you're about to quit your job. How judges and parole boards rely on crime-predicting computers to decide who stays in prison and who goes free. What's predicted by the BBC, Citibank, ConEd, Facebook, Ford, Google, IBM, the IRS, Match.com, MTV, Netflix, Pandora, PayPal, Pfizer, and Wikipedia. A truly omnipresent science, predictive analytics affects everyone, every day. Although largely unseen, it drives millions of decisions, determining whom to call, mail, investigate, incarcerate, set up on a date, or medicate. Predictive analytics transcends human perception. This book's final chapter answers the riddle: What often happens to you that cannot be witnessed, and that you can't even be sure has happened afterward — but that can be predicted in advance? Whether you are a consumer of it — or consumed by it — get a handle on the power of Predictive Analytics.

Introduction to Statistics Through Resampling Methods and R, 2nd Edition

A highly accessible alternative approach to basic statistics Praise for the First Edition: "Certainly one of the most impressive little paperback 200-page introductory statistics books that I will ever see . . . it would make a good nightstand book for every statistician."—Technometrics Written in a highly accessible style, Introduction to Statistics through Resampling Methods and R, Second Edition guides students in the understanding of descriptive statistics, estimation, hypothesis testing, and model building. The book emphasizes the discovery method, enabling readers to ascertain solutions on their own rather than simply copy answers or apply a formula by rote. The Second Edition utilizes the R programming language to simplify tedious computations, illustrate new concepts, and assist readers in completing exercises. The text facilitates quick learning through the use of: More than 250 exercises—with selected "hints"—scattered throughout to stimulate readers' thinking and to actively engage them in applying their newfound skills An increased focus on why a method is introduced Multiple explanations of basic concepts Real-life applications in a variety of disciplines Dozens of thought-provoking, problem-solving questions in the final chapter to assist readers in applying statistics to real-life applications Introduction to Statistics through Resampling Methods and R, Second Edition is an excellent resource for students and practitioners in the fields of agriculture, astrophysics, bacteriology, biology, botany, business, climatology, clinical trials, economics, education, epidemiology, genetics, geology, growth processes, hospital administration, law, manufacturing, marketing, medicine, mycology, physics, political science, psychology, social welfare, sports, and toxicology who want to master and learn to apply statistical methods.

Chi-Squared Goodness of Fit Tests with Applications

Chi-Squared Goodness of Fit Tests with Applications provides a thorough and complete context for the theoretical basis and implementation of Pearson’s monumental contribution and its wide applicability for chi-squared goodness of fit tests. The book is ideal for researchers and scientists conducting statistical analysis in processing of experimental data as well as to students and practitioners with a good mathematical background who use statistical methods. The historical context, especially Chapter 7, provides great insight into importance of this subject with an authoritative author team. This reference includes the most recent application developments in using these methods and models. Systematic presentation with interesting historical context and coverage of the fundamentals of the subject Presents modern model validity methods, graphical techniques, and computer-intensive methods Recent research and a variety of open problems Interesting real-life examples for practitioners

A First Course in Probability and Markov Chains

Provides an introduction to basic structures of probability with a view towards applications in information technology A First Course in Probability and Markov Chains presents an introduction to the basic elements in probability and focuses on two main areas. The first part explores notions and structures in probability, including combinatorics, probability measures, probability distributions, conditional probability, inclusion-exclusion formulas, random variables, dispersion indexes, independent random variables as well as weak and strong laws of large numbers and central limit theorem. In the second part of the book, focus is given to Discrete Time Discrete Markov Chains which is addressed together with an introduction to Poisson processes and Continuous Time Discrete Markov Chains. This book also looks at making use of measure theory notations that unify all the presentation, in particular avoiding the separate treatment of continuous and discrete distributions. A First Course in Probability and Markov Chains: Presents the basic elements of probability. Explores elementary probability with combinatorics, uniform probability, the inclusion-exclusion principle, independence and convergence of random variables. Features applications of Law of Large Numbers. Introduces Bernoulli and Poisson processes as well as discrete and continuous time Markov Chains with discrete states. Includes illustrations and examples throughout, along with solutions to problems featured in this book. The authors present a unified and comprehensive overview of probability and Markov Chains aimed at educating engineers working with probability and statistics as well as advanced undergraduate students in sciences and engineering with a basic background in mathematical analysis and linear algebra.