talk-data.com talk-data.com

Topic

data-science

2252

tagged

Activity Trend

1 peak/qtr
2020-Q1 2026-Q1

Activities

2252 activities · Newest first

Illuminating Statistical Analysis Using Scenarios and Simulations

Features an integrated approach of statistical scenarios and simulations to aid readers in developing key intuitions needed to understand the wide ranging concepts and methods of statistics and inference Illuminating Statistical Analysis Using Scenarios and Simulations presents the basic concepts of statistics and statistical inference using the dual mechanisms of scenarios and simulations. This approach helps readers develop key intuitions and deep understandings of statistical analysis. Scenario-specific sampling simulations depict the results that would be obtained by a very large number of individuals investigating the same scenario, each with their own evidence, while graphical depictions of the simulation results present clear and direct pathways to intuitive methods for statistical inference. These intuitive methods can then be easily linked to traditional formulaic methods, and the author does not simply explain the linkages, but rather provides demonstrations throughout for a broad range of statistical phenomena. In addition, induction and deduction are repeatedly interwoven, which fosters a natural "need to know basis" for ordering the topic coverage. Examining computer simulation results is central to the discussion and provides an illustrative way to (re)discover the properties of sample statistics, the role of chance, and to (re)invent corresponding principles of statistical inference. In addition, the simulation results foreshadow the various mathematical formulas that underlie statistical analysis. In addition, this book: • Features both an intuitive and analytical perspective and includes a broad introduction to the use of Monte Carlo simulation and formulaic methods for statistical analysis • Presents straight-forward coverage of the essentials of basic statistics and ensures proper understanding of key concepts such as sampling distributions, the effects of sample size and variance on uncertainty, analysis of proportion, mean and rank differences, covariance, correlation, and regression • Introduces advanced topics such as Bayesian statistics, data mining, model cross-validation, robust regression, and resampling • Contains numerous example problems in each chapter with detailed solutions as well as an appendix that serves as a manual for constructing simulations quickly and easily using Microsoft® Office Excel® Illuminating Statistical Analysis Using Scenarios and Simulations is an ideal textbook for courses, seminars, and workshops in statistics and statistical inference and is appropriate for self-study as well. The book also serves as a thought-provoking treatise for researchers, scientists, managers, technicians, and others with a keen interest in statistical analysis. Jeffrey E. Kottemann, Ph.D., is Professor in the Perdue School at Salisbury University. Dr. Kottemann has published articles in a wide variety of academic research journals in the fields of business administration, computer science, decision sciences, economics, engineering, information systems, psychology, and public administration. He received his Ph.D. in Systems and Quantitative Methods from the University of Arizona.

Statistical Techniques for Transportation Engineering

Statistical Techniques for Transportation Engineering is written with a systematic approach in mind and covers a full range of data analysis topics, from the introductory level (basic probability, measures of dispersion, random variable, discrete and continuous distributions) through more generally used techniques (common statistical distributions, hypothesis testing), to advanced analysis and statistical modeling techniques (regression, AnoVa, and time series). The book also provides worked out examples and solved problems for a wide variety of transportation engineering challenges. Demonstrates how to effectively interpret, summarize, and report transportation data using appropriate statistical descriptors Teaches how to identify and apply appropriate analysis methods for transportation data Explains how to evaluate transportation proposals and schemes with statistical rigor

Beginning Power BI: A Practical Guide to Self-Service Data Analytics with Excel 2016 and Power BI Desktop, Second Edition

Analyze your company's data quickly and easily using Microsoft's latest tools. You will learn to build scalable and robust data models to work from, clean and combine different data sources effectively, and create compelling visualizations and share them with your colleagues. Author Dan Clark takes you through each topic using step-by-step activities and plenty of screen shots to help familiarize you with the tools. This second edition includes new material on advanced uses of Power Query, along with the latest user guidance on the evolving Power BI platform. Beginning Power BI is your hands-on guide to quick, reliable, and valuable data insight. What You'll Learn Simplify data discovery, association, and cleansing Build solid analytical data models Create robust interactive data presentations Combine analytical and geographic data in map-based visualizations Publish and share dashboards and reports Who This Book Is For Business analysts, database administrators, developers, and other professionals looking to better understand and communicate with data

Big Data Visualization

Dive into 'Big Data Visualization' and uncover how to tackle the challenges of visualizing vast quantities of complex data. With a focus on scalable and dynamic techniques, this guide explores the nuances of effective data analysis. You'll master tools and approaches to display, interpret, and communicate data in impactful ways. What this Book will help me do Understand the fundamentals of big data visualization, including unique challenges and solutions. Explore practical techniques for using D3 and Python to visualize and detect anomalies in big data. Learn to leverage dashboards like Tableau to present data insights effectively. Address and improve data quality issues to enhance analysis accuracy. Gain hands-on experience with real-world use cases for tools such as Hadoop and Splunk. Author(s) James D. Miller is an IBM-certified expert specializing in data analytics and visualization. With years of experience handling massive datasets and extracting actionable insights, he is dedicated to sharing his expertise. His practical approach is evident in how he combines tool mastery with a clear understanding of data complexities. Who is it for? This book is designed for data analysts, data scientists, and others involved in interpreting and presenting big datasets. Whether you are a beginner looking to understand big data visualization or an experienced professional seeking advanced tools and techniques, this guide suits your needs perfectly. A foundational knowledge in programming languages like R and big data platforms such as Hadoop is recommended to maximize your learning.

Data Visualization with D3 4.x Cookbook - Second Edition

This book, 'Data Visualization with D3 4.x Cookbook' by Nick Zhu, is your ultimate guide to mastering data visualization using D3.js. Through practical recipes, you'll learn to create dynamic, data-driven visualizations and tackle real-world visualization challenges. The book also introduces techniques to manage and present data powerfully. What this Book will help me do Master D3.js 4.x features to create efficient data visualizations. Utilize pre-built recipes to generate diverse charts and graphs. Acquire expertise in manipulating datasets for visualization. Develop interactive, dynamic web applications with D3. Overcome common visualization challenges with practical solutions. Author(s) Nick Zhu is a professional data engineer and an expert in creating data-driven applications. With years of experience using D3.js, Nick brings his wealth of knowledge to writing, making complex concepts accessible to learners. He creates resources to help others enhance their data visualization skills. Who is it for? This book is ideal for developers and data analysts familiar with web technologies like HTML, CSS, and JavaScript, aiming to expand their skills with D3.js. Whether you're new to D3 or experienced and looking for a comprehensive reference, this book will empower you to create professional-grade visualizations.

The Data Science Handbook

A comprehensive overview of data science covering the analytics, programming, and business skills necessary to master the discipline Finding a good data scientist has been likened to hunting for a unicorn: the required combination of technical skills is simply very hard to find in one person. In addition, good data science is not just rote application of trainable skill sets; it requires the ability to think flexibly about all these areas and understand the connections between them. This book provides a crash course in data science, combining all the necessary skills into a unified discipline. Unlike many analytics books, computer science and software engineering are given extensive coverage since they play such a central role in the daily work of a data scientist. The author also describes classic machine learning algorithms, from their mathematical foundations to real-world applications. Visualization tools are reviewed, and their central importance in data science is highlighted. Classical statistics is addressed to help readers think critically about the interpretation of data and its common pitfalls. The clear communication of technical results, which is perhaps the most undertrained of data science skills, is given its own chapter, and all topics are explained in the context of solving real-world data problems. The book also features: • Extensive sample code and tutorials using Python™ along with its technical libraries • Core technologies of “Big Data,” including their strengths and limitations and how they can be used to solve real-world problems • Coverage of the practical realities of the tools, keeping theory to a minimum; however, when theory is presented, it is done in an intuitive way to encourage critical thinking and creativity • A wide variety of case studies from industry • Practical advice on the realities of being a data scientist today, including the overall workflow, where time is spent, the types of datasets worked on, and the skill sets needed The Data Science Handbook is an ideal resource for data analysis methodology and big data software tools. The book is appropriate for people who want to practice data science, but lack the required skill sets. This includes software professionals who need to better understand analytics and statisticians who need to understand software. Modern data science is a unified discipline, and it is presented as such. This book is also an appropriate reference for researchers and entry-level graduate students who need to learn real-world analytics and expand their skill set. FIELD CADY is the data scientist at the Allen Institute for Artificial Intelligence, where he develops tools that use machine learning to mine scientific literature. He has also worked at Google and several Big Data startups. He has a BS in physics and math from Stanford University, and an MS in computer science from Carnegie Mellon.

JMP 13 Consumer Research, Second Edition, 2nd Edition

JMP 13 Consumer Research focuses on analyses that help users observe and predict subject's behavior, particularly those in the market research field. The Uplift platform predicts consumer behavior based on shifts in marketing efforts. Learn how to tabulate and summarize categorical responses with the Categorical platform. Factor Analysis rotates principal components to help identify which directions have the most variation among the variables. The book also covers Item Analysis, a method for identifying latent traits that might affect an individual's choices. And read about the Choice platform, which market researchers use to estimate probability in consumer spending.

JMP 13 Design of Experiments Guide, Second Edition, 2nd Edition

The JMP 13 Design of Experiments Guide covers classic DOE designs (for example, full factorial, response surface, and mixture designs). Read about more flexible custom designs, which you generate to fit your particular experimental situation. And discover JMP’s definitive screening designs, an efficient way to identify important factor interactions using fewer runs than required by traditional designs. The book also provides guidance on determining an appropriate sample size for your study.

JMP 13 Fitting Linear Models, Second Edition, 2nd Edition

JMP 13 Fitting Linear Models focuses on the Fit Model platform and many of its personalities. Linear and logistic regression, analysis of variance and covariance, and stepwise procedures are covered. Also included are multivariate analysis of variance, mixed models, generalized models, and models based on penalized regression techniques.

JMP 13 Multivariate Methods, Second Edition, 2nd Edition

JMP 13 Multivariate Methods describes techniques for analyzing several variables simultaneously. The book covers descriptive measures, such as correlations. It also describes methods that give insight into the structure of the multivariate data, such as clustering, latent class analysis, principal components, discriminant analysis, and partial least squares.

JMP 13 Predictive and Specialized Modeling, Second Edition, 2nd Edition

JMP 13 Predictive and Specialized Modeling provides details about modeling techniques such as partitioning, neural networks, nonlinear regression, and time series analysis. Topics include the Gaussian platform, which is useful in analyzing computer simulation experiments. The book also covers the Response Screening platform, which is useful in testing the effect of a predictor when you have many responses.

JMP Start Statistics, 6th Edition

This book provides hands-on tutorials with just the right amount of conceptual and motivational material to illustrate how to use the intuitive interface for data analysis in JMP. Each chapter features concept-specific tutorials,

examples, brief reviews of concepts, step-by-step illustrations, and exercises.

Updated for JMP 13, JMP Start Statistics, Sixth Edition includes many new features, including:

The redesigned Formula Editor.

New and improved ways to create formulas in JMP directly from the data table or dialogs.

Interface updates, including improved menu layout.

Updates and enhancements in many analysis platforms.

New ways to get data into JMP and to save and share JMP results.

Many new features that make it easier to use JMP.

Total Survey Error in Practice

Featuring a timely presentation of total survey error (TSE), this edited volume introduces valuable tools for understanding and improving survey data quality in the context of evolving large-scale data sets This book provides an overview of the TSE framework and current TSE research as related to survey design, data collection, estimation, and analysis. It recognizes that survey data affects many public policy and business decisions and thus focuses on the framework for understanding and improving survey data quality. The book also addresses issues with data quality in official statistics and in social, opinion, and market research as these fields continue to evolve, leading to larger and messier data sets. This perspective challenges survey organizations to find ways to collect and process data more efficiently without sacrificing quality. The volume consists of the most up-to-date research and reporting from over 70 contributors representing the best academics and researchers from a range of fields. The chapters are broken out into five main sections: The Concept of TSE and the TSE Paradigm, Implications for Survey Design, Data Collection and Data Processing Applications, Evaluation and Improvement, and Estimation and Analysis. Each chapter introduces and examines multiple error sources, such as sampling error, measurement error, and nonresponse error, which often offer the greatest risks to data quality, while also encouraging readers not to lose sight of the less commonly studied error sources, such as coverage error, processing error, and specification error. The book also notes the relationships between errors and the ways in which efforts to reduce one type can increase another, resulting in an estimate with larger total error. This book: • Features various error sources, and the complex relationships between them, in 25 high-quality chapters on the most up-to-date research in the field of TSE • Provides comprehensive reviews of the literature on error sources as well as data collection approaches and estimation methods to reduce their effects • Presents examples of recent international events that demonstrate the effects of data error, the importance of survey data quality, and the real-world issues that arise from these errors • Spans the four pillars of the total survey error paradigm (design, data collection, evaluation and analysis) to address key data quality issues in official statistics and survey research Total Survey Error in Practice is a reference for survey researchers and data scientists in research areas that include social science, public opinion, public policy, and business. It can also be used as a textbook or supplementary material for a graduate-level course in survey research methods. Paul P. Biemer, PhD, is distinguished fellow at RTI International and associate director of Survey Research and Development at the Odum Institute, University of North Carolina, USA. Edith de Leeuw, PhD, is professor of survey methodology in the Department of Methodology and Statistics at Utrecht University, the Netherlands. Stephanie Eckman, PhD, is fellow at RTI International, USA. Brad Edwards is vice president, director of Field Services, and deputy area director at Westat, USA. Frauke Kreuter, PhD, is professor and director of the Joint Program in Survey Methodology, University of Maryland, USA; professor of statistics and methodology at the University of Mannheim, Germany; and head of the Statistical Methods Research Department at the Institute for Employment Research, Germany. Lars E. Lyberg, PhD, is senior advisor at Inizio, Sweden. N. Clyde Tucker, PhD, is principal survey methodologist at the American Institutes for Research, USA. Brady T. West, PhD, is research associate professor in the Survey Resea

2017 European Data Science Salary Survey

How do data science salaries for people in Europe compare to their counterparts in the rest of the world? Among the more than 1000 people who responded to O’Reilly’s 2016 Data Science Salary Survey, 359 live and work in various European countries as data scientists, analysts, engineers, and related professions. This report takes a deep dive into the survey results from respondents in various regions of Europe, including the tools they use, the compensation they receive, and the roles they play in their respective organizations. Even if you didn’t take part in the survey, you can still plug your own information into the survey’s simple linear model to see where you fit. With this report, you’ll learn: How salaries vary by country and specific regions in Europe Average size of companies by region How salary is affected by a country’s GDP Top industries for data scientists, including software, banking, finance, retail, and ecommerce Most commonly used tools vs tools used by respondents with above-average salaries Primary and secondary job tasks performed by survey respondents To stay up-to-date on this research, your participation is crucial. The survey is now open for the 2017 report; please take just 5 to 10 minutes to participate in the survey here.

Learning Kibana 5.0

Learning Kibana 5.0 is your gateway to mastering the art of data visualization using the powerful features of the Kibana platform. This book guides you through the process of creating stunning interactive dashboards and making data-driven insights accessible with real-time visualizations. Whether you're new to the Elastic stack or seeking to refine your expertise, this book equips you to harness Kibana's full potential. What this Book will help me do Build robust, real-time dashboards in Kibana to visualize complex datasets efficiently. Leverage Timelion to perform time-series data analysis and create metrics-based dashboards. Explore advanced analytics using the Graph plugin to uncover relationships and correlations in data. Learn how to create and deploy custom plugins to tailor Kibana to specific project needs. Understand how to use the Elastic stack to monitor, analyze, and optimize various types of data flows. Author(s) Bahaaldine Azarmi is a seasoned expert in the Elastic stack, known for his dedication to making complex technical topics approachable and practical. With years of experience in data analytics and software development, Bahaaldine shares not only his technical expertise but also his passion for helping professionals achieve their goals through clear, actionable guidance. His writing emphasizes hands-on learning and practical application. Who is it for? This book is perfect for developers, data visualization engineers, and data scientists who aim to hone their skills in data visualization and interactive dashboard development. It assumes a basic understanding of Elasticsearch and Logstash to maximize its practicality. If you aim to advance your career by learning how to optimize data architecture and solve real-world problems using the Elastic stack, this book is ideal for you.

Evolutionary Computation with Biogeography-based Optimization

Evolutionary computation algorithms are employed to minimize functions with large number of variables. Biogeography-based optimization (BBO) is an optimization algorithm that is based on the science of biogeography, which researches the migration patterns of species. These migration paradigms provide the main logic behind BBO. Due to the cross-disciplinary nature of the optimization problems, there is a need to develop multiple approaches to tackle them and to study the theoretical reasoning behind their performance. This manuscript intends to explain the mathematical model of BBO algorithm and its variants created to cope with continuous domain problems (with and without constraints) and combinatorial problems. Due to the cross-disciplinary nature of the optimization problems, there is a need to develop multiple approaches to tackle them and to study the theoretical reasoning behind their performance. This manuscript intends to explain the mathematical model of BBO algorithm and its variants created to cope with continuous domain problems (with and without constraints) and combinatorial problems.

A Panorama of Statistics

A Panorama of Statistics: Perspectives, Puzzles and Paradoxes in Statistics Eric Sowey, School of Economics, The University of New South Wales, Sydney, Australia Peter Petocz, Department of Statistics, Macquarie University, Sydney, Australia This book is a stimulating panoramic tour – quite different from a textbook journey – of the world of statistics in both its theory and practice, for teachers, students and practitioners.At each stop on the tour, the authors investigate unusual and quirky aspects of statistics, highlighting historical, biographical and philosophical dimensions of this field of knowledge. Each chapter opens with perspectives on its theme, often from several points of view. Five original and thought-provoking questions follow. These aim at widening readers’ knowledge and deepening their insight. Scattered among the questions are entertaining puzzles to solve and tantalising paradoxes to explain. Readers can compare their own statistical discoveries with the authors’ detailed answers to all the questions. The writing is lively and inviting, the ideas are rewarding, and the material is extensively cross-referenced. A Panorama of Statistics: Leads readers to discover the fascinations of statistics. Is an enjoyable companion to an undergraduate statistics textbook. Is an enriching source of knowledge for statistics teachers and practitioners. Is unique among statistics books today for its memorable content and engaging style. Lending itself equally to reading through and to dipping into, A Panorama of Statistics will surprise teachers, students and practitioners by the variety of ways in which statistics can capture and hold their interest.

Researching UX: Analytics

Good UX is based on evidence. Qualitative evidence, such as user testing and field research, can only get you so far. To get the full picture of how users are engaging with your website or app, you'll need to use quantitative evidence in the form of analytics. This book will show you, step by step, how you can use website and app analytics data to inform design choices and definitively improve user experience. Offering practical guidelines, with plenty of detailed examples, this book covers: why you need to gather analytics data for your UX projects getting set up with analytics tools analyzing data how to find problems in your analytics using analytics to aid user research, measure and report on outcomes By the end of this book, you'll have a strong understanding of the important role analytics plays in the UX process. It will inspire you to take an "analytics first" approach to your UX projects.

Strategies in Biomedical Data Science

An essential guide to healthcare data problems, sources, and solutions Strategies in Biomedical Data Science provides medical professionals with much-needed guidance toward managing the increasing deluge of healthcare data. Beginning with a look at our current top-down methodologies, this book demonstrates the ways in which both technological development and more effective use of current resources can better serve both patient and payer. The discussion explores the aggregation of disparate data sources, current analytics and toolsets, the growing necessity of smart bioinformatics, and more as data science and biomedical science grow increasingly intertwined. You'll dig into the unknown challenges that come along with every advance, and explore the ways in which healthcare data management and technology will inform medicine, politics, and research in the not-so-distant future. Real-world use cases and clear examples are featured throughout, and coverage of data sources, problems, and potential mitigations provides necessary insight for forward-looking healthcare professionals. Big Data has been a topic of discussion for some time, with much attention focused on problems and management issues surrounding truly staggering amounts of data. This book offers a lifeline through the tsunami of healthcare data, to help the medical community turn their data management problem into a solution. Consider the data challenges personalized medicine entails Explore the available advanced analytic resources and tools Learn how bioinformatics as a service is quickly becoming reality Examine the future of IOT and the deluge of personal device data The sheer amount of healthcare data being generated will only increase as both biomedical research and clinical practice trend toward individualized, patient-specific care. Strategies in Biomedical Data Science provides expert insight into the kind of robust data management that is becoming increasingly critical as healthcare evolves.