talk-data.com talk-data.com

Topic

statistics

505

tagged

Activity Trend

1 peak/qtr
2020-Q1 2026-Q1

Activities

Showing filtered results

Filtering by: O'Reilly Data Science Books ×
Theory and Methods of Statistics

Theory and Methods of Statistics covers essential topics for advanced graduate students and professional research statisticians. This comprehensive resource covers many important areas in one manageable volume, including core subjects such as probability theory, mathematical statistics, and linear models, and various special topics, including nonparametrics, curve estimation, multivariate analysis, time series, and resampling. The book presents subjects such as "maximum likelihood and sufficiency," and is written with an intuitive, heuristic approach to build reader comprehension. It also includes many probability inequalities that are not only useful in the context of this text, but also as a resource for investigating convergence of statistical procedures. Codifies foundational information in many core areas of statistics into a comprehensive and definitive resource Serves as an excellent text for select master’s and PhD programs, as well as a professional reference Integrates numerous examples to illustrate advanced concepts Includes many probability inequalities useful for investigating convergence of statistical procedures

Applied Regression and Modeling

The book is divided into three parts – (1) prerequisite to regression analysis followed by a discussion on simple regression, (2) multiple regression analysis with applications, and (3) regression and modeling including the second order models, nonlinear regression, and interaction models in regressions. All these sections provide examples with complete computer analysis and instructions commonly used in modeling and analyzing these problems. The book deals with detailed analysis and interpretation of computer results. This will help readers to appreciate the power of computer in applying regression models. The readers will find that the understanding of computer results is critical to implementing regression and modeling in real world situation. The book is written for juniors, seniors and graduate students in business, MBAs, professional MBAs, and working people in business and industry. Managers, practitioners, professionals, quality professionals, quality engineers, and anyone involved in data analysis, business analytics, and quality and six sigma will find the book to be a valuable resource.

Understanding and Applying Basic Statistical Methods Using R

Features a straightforward and concise resource for introductory statistical concepts, methods, and techniques using R Understanding and Applying Basic Statistical Methods Using R uniquely bridges the gap between advances in the statistical literature and methods routinely used by non-statisticians. Providing a conceptual basis for understanding the relative merits and applications of these methods, the book features modern insights and advances relevant to basic techniques in terms of dealing with non-normality, outliers, heteroscedasticity (unequal variances), and curvature. Featuring a guide to R, the book uses R programming to explore introductory statistical concepts and standard methods for dealing with known problems associated with classic techniques. Thoroughly class-room tested, the book includes sections that focus on either R programming or computational details to help the reader become acquainted with basic concepts and principles essential in terms of understanding and applying the many methods currently available. Covering relevant material from a wide range of disciplines, Understanding and Applying Basic Statistical Methods Using R also includes: Numerous illustrations and exercises that use data to demonstrate the practical importance of multiple perspectives Discussions on common mistakes such as eliminating outliers and applying standard methods based on means using the remaining data Detailed coverage on R programming with descriptions on how to apply both classic and more modern methods using R A companion website with the data and solutions to all of the exercises Understanding and Applying Basic Statistical Methods Using R is an ideal textbook for an undergraduate and graduate-level statistics courses in the science and/or social science departments. The book can also serve as a reference for professional statisticians and other practitioners looking to better understand modern statistical methods as well as R programming.

Network Reliability

In Engineering theory and applications, we think and operate in terms of logics and models with some acceptable and reasonable assumptions. The present text is aimed at providing modelling and analysis techniques for the evaluation of reliability measures (2-terminal, all-terminal, k-terminal reliability) for systems whose structure can be described in the form of a probabilistic graph. Among the several approaches of network reliability evaluation, the multiple-variable-inversion sum-of-disjoint product approach finds a well-deserved niche as it provides the reliability or unreliability expression in a most efficient and compact manner. However, it does require an efficiently enumerated minimal inputs (minimal path, spanning tree, minimal k-trees, minimal cut, minimal global-cut, minimal k-cut) depending on the desired reliability. The present book covers these two aspects in detail through the descriptions of several algorithms devised by the ‘reliability fraternity’ and explained through solved examples to obtain and evaluate 2-terminal, k-terminal and all-terminal network reliability/unreliability measures and could be its USP. The accompanying web-based supplementary information containing modifiable Matlab® source code for the algorithms is another feature of this book. A very concerted effort has been made to keep the book ideally suitable for first course or even for a novice stepping into the area of network reliability. The mathematical treatment is kept as minimal as possible with an assumption on the readers’ side that they have basic knowledge in graph theory, probabilities laws, Boolean laws and set theory.

Threat Forecasting

Drawing upon years of practical experience and using numerous examples and illustrative case studies, Threat Forecasting: Leveraging Big Data for Predictive Analysis discusses important topics, including the danger of using historic data as the basis for predicting future breaches, how to use security intelligence as a tool to develop threat forecasting techniques, and how to use threat data visualization techniques and threat simulation tools. Readers will gain valuable security insights into unstructured big data, along with tactics on how to use the data to their advantage to reduce risk. Presents case studies and actual data to demonstrate threat data visualization techniques and threat simulation tools Explores the usage of kill chain modelling to inform actionable security intelligence Demonstrates a methodology that can be used to create a full threat forecast analysis for enterprise networks of any size

Regression Analysis Microsoft® Excel®

This is today’s most complete guide to regression analysis with Microsoft® Excel for any business analytics or research task. Drawing on 25 years of advanced statistical experience, Microsoft MVP Conrad Carlberg shows how to use Excel’s regression-related worksheet functions to perform a wide spectrum of practical analyses. Carlberg clearly explains all the theory you’ll need to avoid mistakes, understand what your regressions are really doing, and evaluate analyses performed by others. From simple correlations and t-tests through multiple analysis of covariance, Carlberg offers hands-on, step-by-step walkthroughs using meaningful examples. He discusses the consequences of using each option and argument, points out idiosyncrasies and controversies associated with Excel’s regression functions, and shows how to use them reliably in fields ranging from medical research to financial analysis to operations. You don’t need expensive software or a doctorate in statistics to work with regression analyses. Microsoft Excel has all the tools you need—and this book has all the knowledge! Understand what regression analysis can and can’t do, and why Master regression-based functions built into all recent versions of Excel Work with correlation and simple regression Make the most of Excel’s improved LINEST() function Plan and perform multiple regression Distinguish the assumptions that matter from the ones that don’t Extend your analysis options by using regression instead of traditional analysis of variance Add covariates to your analysis to reduce bias and increase statistical power

A Course in Statistics with R

Integrates the theory and applications of statistics using R A Course in Statistics with R has been written to bridge the gap between theory and applications and explain how mathematical expressions are converted into R programs. The book has been primarily designed as a useful companion for a Masters student during each semester of the course, but will also help applied statisticians in revisiting the underpinnings of the subject. With this dual goal in mind, the book begins with R basics and quickly covers visualization and exploratory analysis. Probability and statistical inference, inclusive of classical, nonparametric, and Bayesian schools, is developed with definitions, motivations, mathematical expression and R programs in a way which will help the reader to understand the mathematical development as well as R implementation. Linear regression models, experimental designs, multivariate analysis, and categorical data analysis are treated in a way which makes effective use of visualization techniques and the related statistical techniques underlying them through practical applications, and hence helps the reader to achieve a clear understanding of the associated statistical models. Key features: Integrates R basics with statistical concepts Provides graphical presentations inclusive of mathematical expressions Aids understanding of limit theorems of probability with and without the simulation approach Presents detailed algorithmic development of statistical models from scratch Includes practical applications with over 50 data sets

Regression for Economics, Second Edition

Regression analysis can be used to establish causal relationships between factors and the response variable. However, in order to be able to do so, economic theory must be used to provide the causal relationship and then regression analysis is applied to verify the validity of the theory. Regression analysis is the most commonly used analytical tool and can be understood without complex mathematics.  This book simplifies and demystifies regression analysis. All the examples are from economics and in almost all the cases, real data is used to show the application of the method. By limiting the use of mathematical symbols, the author enables a logical reader to learn regression, without shortchanging the subject.  The book is targeted to all business students and executives who need to understand the concept of regression for practical and professional purposes.

Age-Period-Cohort Analysis

This book explores the ways in which statistical models, methods, and research designs can be used to open new possibilities for APC analysis. Within a single, consistent HAPC-GLMM statistical modeling framework, the authors synthesize APC models and methods for three research designs: age-by-time period tables of population rates or proportions, repeated cross-section sample surveys, and accelerated longitudinal panel studies. They show how the empirical application of the models to various problems leads to many fascinating findings on how outcome variables develop along the age, period, and cohort dimensions.

Constrained Principal Component Analysis and Related Techniques

This book shows how constrained principal component analysis (CPCA) offers a unified framework for regression techniques and PCA. Keeping the use of complicated iterative methods to a minimum, the book includes implementation details and many real application examples. It also offers material for methodologically oriented readers interested in developing statistical techniques of their own. MATLAB programs as well as data to create the book's examples are available on the author's website.

Incomplete Categorical Data Design

A self-contained, systematic introduction, this book shows you how to draw valid statistical inferences from survey data with sensitive characteristics. It guides you in applying the non-randomized response approach in surveys and new non-randomized response designs. The techniques covered integrate the strengths of existing approaches, including randomized response models, incomplete categorical data design, the EM algorithm, the bootstrap method, and the data augmentation algorithm. All R codes for the examples are available online.

Statistical Methods for QTL Mapping

While numerous advanced statistical approaches have recently been developed for quantitative trait loci (QTL) mapping, the methods are scattered throughout the literature. This book brings together many recent statistical techniques that address the data complexity of QTL mapping. It emphasizes the modern statistical methodology for QTL mapping as well as the statistical issues that arise during this process. The book gives the necessary biological background for statisticians without training in genetics and, likewise, covers statistical thinking and principles for geneticists.

Stochastic Financial Models

Developed from the esteemed author's advanced undergraduate and graduate courses at the University of Cambridge, this text provides a hands-on, sound introduction to mathematical finance. Assuming no prior knowledge of stochastic calculus or measure-theoretic probability, the author includes the relevant mathematical background as well as many exercises with solutions. He first presents the classical topics of utility and the mean-variance approach to portfolio choice. Focusing on derivative pricing, the text then covers the binomial model, the general discrete-time model, Brownian motion, the Black-Scholes model, and various interest-rate models.

Transportation Statistics and Microsimulation

By discussing statistical concepts in the context of transportation planning and operations, this text provides the necessary background for making informed transportation-related decisions. It explains the why behind standard methods and uses real-world transportation examples and problems to illustrate key concepts. The book covers the statistical techniques most frequently employed by transportation and pavement professionals. To familiarize readers with the underlying theory and equations, it contains problems that can be solved using SAS's JMP package, which enables users to interactively explore and visualize data.

Regression Analysis with Python

Dive into the world of regression analysis guided by Python in this comprehensive book. From simple linear regression to complex models, you'll gain a deep understanding of how to analyze data and predict outcomes. By the end of this book, you will be equipped with the skills to tidy data, build models, and apply regression techniques to real-world problems. What this Book will help me do Understand and format datasets to prepare them for regression analysis efficiently. Build and implement various regression models, such as linear and logistic regression, to solve data science problems. Develop techniques to combat overfitting and ensure predictive accuracy. Learn to scale and adapt regression models to large datasets and apply incremental learning. Apply the skills gained to make informed business decisions using predictive insights from regression models. Author(s) Luca Massaron and Alberto Boschetti are seasoned data professionals with years of expertise in data science, regression analysis, and Python programming. They are passionate about teaching and have crafted this book to demystify regression for learners interested in predictive analytics. Their approachable style ensures concepts are accessible yet comprehensive. Who is it for? This book is ideal for Python developers and data scientists who have a foundational knowledge of math and statistics. Whether you're looking to delve deeper into predictive modeling or efficiently analyze datasets, this book provides step-by-step guidance. If you've dabbled in data science and wish to expand your skillset to include regression analysis, this book is for you!

Perfect Simulation

This book illustrates the application of perfect simulation ideas and algorithms to a wide range of problems. The author describes numerous protocol methodologies for designing algorithms for specific problems. He first examines the commonly used acceptance/rejection (AR) protocol for creating perfect simulation algorithms. He then covers other major protocols, including coupling from the past (CFTP); the Fill, Machida, Murdoch, and Rosenthal (FMMR) method; the randomness recycler; retrospective sampling; and partially recursive AR, along with multiple variants of these protocols.

Quality of Life and Living Standards Analysis

This book is about the concept of “Quality of Life”. What is necessary for quality of life, and how can it be measured? The approach is a multicriterial scheme reduction which prevents as much information loss as possible when shifting from the set of partial criteria to their convolution. This book is written for researchers, analysts and graduate and postgraduate students of mathematics and economics.

Statistics in Toxicology Using R

The apparent contradiction between statistical significance and biological relevance has diminished the value of statistical methods as a whole in toxicology. Moreover, recommendations for statistical analysis are imprecise in most toxicological guidelines. Addressing these dilemmas, Statistics in Toxicology Using R explains the statistical analysis of selected experimental data in toxicology and presents assay-specific suggestions, such as for the in vitro micronucleus assay. Mostly focusing on hypothesis testing, the book covers standardized bioassays for chemicals, drugs, and environmental pollutants. It is organized according to selected toxicological assays, including:Short-term repeated toxicity studiesLong-term carcinogenicity assaysStudies on reproductive toxicityMutagenicity assaysToxicokinetic studiesThe book also discusses proof of safety (particularly in ecotoxicological assays), toxicogenomics, the analysis of interlaboratory studies and the modeling of dose-response relationships for risk assessment. For each toxicological problem, the author describes the statistics involved, matching data example, R code, and outcomes and their interpretation. This approach allows you to select a certain bioassay, identify the specific data structure, run the R code with the data example, understand the test outcome and interpretation, and replace the data set with your own data and run again.Supporting material for this title can be downloaded here.

Multiple Time Series Modeling Using the SAS VARMAX Procedure

Aimed at econometricians who have completed at least one course in time series modeling, Multiple Time Series Modeling Using the SAS VARMAX Procedure will teach you the time series analytical possibilities that SAS offers today. Estimations of model parameters are now performed in a split second. For this reason, working through the identifications phase to find the correct model is unnecessary. Instead, several competing models can be estimated, and their fit can be compared instantaneously.

Consequently, for time series analysis, most of the Box and Jenkins analysis process for univariate series is now obsolete. The former days of looking at cross-correlations and pre-whitening are over, because distributed lag models are easily fitted by an automatic lag identification method. The same goes for bivariate and even multivariate models, for which PROC VARMAX models are automatically fitted. For these models, other interesting variations arise: Subjects like Granger causality testing, feedback, equilibrium, cointegration, and error correction are easily addressed by PROC VARMAX.

One problem with multivariate modeling is that it includes many parameters, making parameterizations unstable. This instability can be compensated for by application of Bayesian methods, which are also incorporated in PROC VARMAX. Volatility modeling has now become a standard part of time series modeling, because of the popularity of GARCH models. Both univariate and multivariate GARCH models are supported by PROC VARMAX. This feature is especially interesting for financial analytics in which risk is a focus.

This book teaches with examples. Readers who are analyzing a time series for the first time will find PROC VARMAX easy to use; readers who know more advanced theoretical time series models will discover that PROC VARMAX is a useful tool for advanced model building.

A First Course in Statistics, 12th Edition

For courses in introductory statistics. A Contemporary Classic Classic, yet contemporary; theoretical, yet applied—McClave & Sincich’s A First Course in Statistics gives you the best of both worlds. This text offers a trusted, comprehensive introduction to statistics that emphasizes inference and integrates real data throughout. The authors stress the development of statistical thinking, the assessment of credibility, and value of the inferences made from data. This new edition is extensively revised with an eye on clearer, more concise language throughout the text and in the exercises. Ideal for one- or two-semester courses in introductory statistics, this text assumes a mathematical background of basic algebra. Flexibility is built in for instructors who teach a more advanced course, with optional footnotes about calculus and the underlying theory. Also available with MyStatLab MyStatLab™ is an online homework, tutorial, and assessment program designed to work with this text to engage students and improve results. Within its structured environment, students practice what they learn, test their understanding, and pursue a personalized study plan that helps them absorb course material and understand difficult concepts. For this edition, MyStatLab offers 30% new and updated exercises. Note: You are purchasing a standalone product; MyLab™ & Mastering™ does not come packaged with this content. Students, if interested in purchasing this title with MyLab & Mastering, ask your instructor for the correct package ISBN and Course ID. Instructors, contact your Pearson representative for more information. If you would like to purchase both the physical text and MyLab & Mastering, search for: 0134090438 / 9780134090436 * Statistics Plus New MyStatLab with Pearson eText -- Access Card Package Package consists of: 0134080211 / 9780134080215 * Statistics 0321847997 / 9780321847997 * My StatLab Glue-in Access Card 032184839X / 9780321848390 * MyStatLab Inside Sticker for Glue-In Packages