talk-data.com talk-data.com

Topic

data-science

2252

tagged

Activity Trend

1 peak/qtr
2020-Q1 2026-Q1

Activities

2252 activities · Newest first

Budgeting, Forecasting and Planning In Uncertain Times

Budgeting, planning and forecasting are critical management tasks that not only impact the future success of an organization, but can threaten its very survival if done badly. Yet in spite of their importance, the speed and complexity of today’s business environment has caused a rapid decrease in the planning time horizon. As a consequence the traditional planning processes have become unsuitable for most organization’s needs. In this book readers will find new, original insights, including: 7 planning models that every organization needs to plan and manage performance 6 ways in which performance can be viewed A planning framework based on best management practices that can cope with an unpredictable business environment The application of technology to planning and latest developments in systems Results of the survey conducted for the book on the state of planning in organizations

Mastering Machine Learning with R - Second Edition

Dive into the world of advanced machine learning techniques with "Mastering Machine Learning with R, Second Edition." This comprehensive guide equips you with the skills to implement sophisticated algorithms and create powerful prediction models using R 3.x. You will explore topics such as supervised and unsupervised learning, decision trees, ensemble methods, and deep learning. What this Book will help me do Implement machine learning workflows using a variety of R packages like XGBOOST. Effectively use linear and logistic regression for statistical analysis and pattern recognition. Develop skills in advanced methods such as support vector machines and neural networks. Learn actionable techniques to create recommendation engines and perform text mining. Gain hands-on experience running R-based machine learning analyses on cloud platforms. Author(s) None Lesmeister, a seasoned data scientist, combines extensive hands-on experience and a passion for teaching to deliver technical concepts in a practical, engaging manner. With a strong background in statistical analysis and machine learning, they are dedicated to providing readers with actionable knowledge and step-by-step guidance. Who is it for? This book is ideal for data scientists, analysts, and machine learning practitioners aiming to deepen their expertise in R. Readers should have a fundamental understanding of machine learning concepts and a basic knowledge of R programming. If you're looking to master advanced learning methods and apply them effectively, this book is tailored for you.

The Big Book of Dashboards

The definitive reference book with real-world solutions you won't find anywhere else The Big Book of Dashboards presents a comprehensive reference for those tasked with building or overseeing the development of business dashboards. Comprising dozens of examples that address different industries and departments (healthcare, transportation, finance, human resources, marketing, customer service, sports, etc.) and different platforms (print, desktop, tablet, smartphone, and conference room display) The Big Book of Dashboards is the only book that matches great dashboards with real-world business scenarios. By organizing the book based on these scenarios and offering practical and effective visualization examples, The Big Book of Dashboards will be the trusted resource that you open when you need to build an effective business dashboard. In addition to the scenarios there's an entire section of the book that is devoted to addressing many practical and psychological factors you will encounter in your work. It's great to have theory and evidenced-based research at your disposal, but what will you do when somebody asks you to make your dashboard 'cooler' by adding packed bubbles and donut charts? The expert authors have a combined 30-plus years of hands-on experience helping people in hundreds of organizations build effective visualizations. They have fought many 'best practices' battles and having endured bring an uncommon empathy to help you, the reader of this book, survive and thrive in the data visualization world. A well-designed dashboard can point out risks, opportunities, and more; but common challenges and misconceptions can make your dashboard useless at best, and misleading at worst. The Big Book of Dashboards gives you the tools, guidance, and models you need to produce great dashboards that inform, enlighten, and engage.

Oracle Business Intelligence Enterprise Edition 12c - Second Edition

This book, "Oracle Business Intelligence Enterprise Edition 12c," is a comprehensive guide to understanding and implementing Oracle's powerful business intelligence tools. Throughout its chapters, readers will learn how to install, configure, and utilize OBIEE 12c to transform organizational data into actionable insights through reports, dashboards, and advanced analytics. What this Book will help me do Install and configure Oracle BI 12c, preparing the essential software and database infrastructure. Create and manage the repository connecting your data to the OBIEE front end for detailed analysis. Develop interactive dashboards and analyses that derive actionable meaning from data. Design pixel-perfect reports using BI Publisher to cater to organizational reporting needs. Set up agents and workflows to provide actionable insights and improve decision-making processes. Author(s) The authors, None Ward and None Khan, are seasoned experts in Oracle Business Intelligence with years of experience implementing and managing OBIEE systems across various industries. Their approach combines succinct practical tutorials with insights gained from real-world projects, making technical concepts accessible and actionable. Who is it for? This book is perfect for IT consultants, data scientists, project managers, and business analysts involved in data-driven decision-making. Beginners to OBIEE but with a basic understanding of databases will find this guide invaluable for starting their BI journey. Professionals already familiar with other BI tools can also benefit from a detailed exploration of OBIEE's features.

Analyzing Data with Power BI and Power Pivot for Excel, First Edition

Renowned DAX experts Alberto Ferrari and Marco Russo teach you how to design data models for maximum efficiency and effectiveness. How can you use Excel and Power BI to gain real insights into your information? As you examine your data, how do you write a formula that provides the numbers you need? The answers to both of these questions lie with the data model. This book introduces the basic techniques for shaping data models in Excel and Power BI. It’s meant for readers who are new to data modeling as well as for experienced data modelers looking for tips from the experts. If you want to use Power BI or Excel to analyze data, the many real-world examples in this book will help you look at your reports in a different way—like experienced data modelers do. As you’ll soon see, with the right data model, the correct answer is always a simple one! By reading this book, you will: • Gain an understanding of the basics of data modeling, including tables, relationships, and keys • Familiarize yourself with star schemas, snowflakes, and common modeling techniques • Learn the importance of granularity • Discover how to use multiple fact tables, like sales and purchases, in a complex data model • Manage calendar-related calculations by using date tables • Track historical attributes, like previous addresses of customers or manager assignments • Use snapshots to compute quantity on hand • Work with multiple currencies in the most efficient way • Analyze events that have durations, including overlapping durations • Learn what data model you need to answer your specific business questions About This Book • For Excel and Power BI users who want to exploit the full power of their favorite tools • For BI professionals seeking new ideas for modeling data

Theory of Probability

First issued in translation as a two-volume work in 1975, this classic book provides the first complete development of the theory of probability from a subjectivist viewpoint. It proceeds from a detailed discussion of the philosophical mathematical aspects to a detailed mathematical treatment of probability and statistics. De Finetti’s theory of probability is one of the foundations of Bayesian theory. De Finetti stated that probability is nothing but a subjective analysis of the likelihood that something will happen and that that probability does not exist outside the mind. It is the rate at which a person is willing to bet on something happening. This view is directly opposed to the classicist/ frequentist view of the likelihood of a particular outcome of an event, which assumes that the same event could be identically repeated many times over, and the 'probability' of a particular outcome has to do with the fraction of the time that outcome results from the repeated trials.

Statistical Intervals, 2nd Edition

Describes statistical intervals to quantify sampling uncertainty,focusing on key application needs and recently developed methodology in an easy-to-apply format Statistical intervals provide invaluable tools for quantifying sampling uncertainty. The widely hailed first edition, published in 1991, described the use and construction of the most important statistical intervals. Particular emphasis was given to intervals—such as prediction intervals, tolerance intervals and confidence intervals on distribution quantiles—frequently needed in practice, but often neglected in introductory courses. Vastly improved computer capabilities over the past 25 years have resulted in an explosion of the tools readily available to analysts. This second edition—more than double the size of the first—adds these new methods in an easy-to-apply format. In addition to extensive updating of the original chapters, the second edition includes new chapters on: • Likelihood-based statistical intervals • Nonparametric bootstrap intervals • Parametric bootstrap and other simulation-based intervals • An introduction to Bayesian intervals • Bayesian intervals for the popular binomial, Poisson and normal distributions • Statistical intervals for Bayesian hierarchical models • Advanced case studies, further illustrating the use of the newly described methods New technical appendices provide justification of the methods and pathways to extensions and further applications. A webpage directs readers to current readily accessible computer software and other useful information. Statistical Intervals: A Guide for Practitioners and Researchers, Second Edition is an up-to-date working guide and reference for all who analyze data, allowing them to quantify the uncertainty in their results using statistical intervals. William Q. Meeker is Professor of Statistics and Distinguished Professor of Liberal Arts and Sciences at Iowa State University. He is co-author of Statistical Methods for Reliability Data (Wiley, 1998) and of numerous publications in the engineering and statistical literature and has won many awards for his research. Gerald J. Hahn served for 46 years as applied statistician and manager of an 18-person statistics group supporting General Electric and has co-authored four books. His accomplishments have been recognized by GE’s prestigious Coolidge Fellowship and 19 professional society awards. Luis A. Escobar is Professor of Statistics at Louisiana State University. He is co-author of Statistical Methods for Reliability Data (Wiley, 1998) and several book chapters. His publications have appeared in the engineering and statistical literature and he has won several research and teaching awards.

D3.js: Cutting-edge Data Visualization

Turn your raw data into real knowledge by creating and deploying complex data visualizations with D3.js About This Book Understand how to best represent your data by developing the right kind of visualization Explore the concepts of D3.js through examples that enable you to quickly create visualizations including charts, network diagrams, and maps Get practical examples of visualizations using real-world data sets that show you how to use D3.js to visualize and interact with information to glean its underlying meaning Who This Book Is For Whether you are new to data and data visualization, a seasoned data scientist, or a computer graphics specialist, this Learning Path will provide you with the skills you need to create web-based and interactive data visualizations. Some basic JavaScript knowledge is expected, but no prior experience with data visualization or D3 is required What You Will Learn Gain a solid understanding of the common D3 development idioms Find out how to write basic D3 code for servers using Node.js Install and use D3.js to create HTML elements within a document Create and style graphical elements such as circles, ellipses, rectangles, lines, paths, and text using SVG Turn your data into bar and scatter charts, and add margins, axes, labels, and legends Use D3.js generators to perform the magic of creating complex visualizations from data Add interactivity to your visualizations, including tool-tips, sorting, hover-to-highlight, and grouping and dragging of visuals Write, test, and distribute a D3-based charting package Make a real-time application with Node and D3 In Detail D3 has emerged as one of the leading platforms to develop beautiful, interactive visualizations over the web. We begin the course by setting up a strong foundation, then build on this foundation as we take you through the entire world of reimagining data using interactive, animated visualizations created in D3.js. In the first module, we cover the various features of D3.js to build a wide range of visualizations. We also focus on the entire process of representing data through visualizations. By the end of this module, you will be ready to use D3 to transform any data into a more engaging and sophisticated visualization. In the next module, you will learn to master the creation of graphical elements from data. Using practical examples provided, you will quickly get to grips with the features of D3.js and use this learning to create your own spectacular data visualizations with D3.js. Over the last leg of this course, you will get acquainted with how to integrate D3 with mapping libraries to provide reverse geocoding and interactive maps among many other advanced features of D3. This module culminates by showing you how to create enterprise-level dashboards to display real-time data. This Learning Path combines some of the best that Packt has to offer in one complete, curated package. It includes content from the following Packt products: Learning D3.js Data Visualization, Second Edition by Andrew H. Rininsland D3.js By Example by Michael Heydt Mastering D3.js by Pablo Navarro Castillo Style and approach This course provides a comprehensive explanation of how to leverage the power of D3.js to create powerful and creative visualizations through step-by-step instructions in the form of modules. Each module help you skill up a level in creating meaningful visualizations. Downloading the example code for this book. You can download the example code files for all Packt books you have purchased from your account at http://www.PacktPub.com. If you purchased this book elsewhere, you can visit http://www.PacktPub.com/support and register to have the code file.

R: Predictive Analysis

Master the art of predictive modeling About This Book Load, wrangle, and analyze your data using the world's most powerful statistical programming language Familiarize yourself with the most common data mining tools of R, such as k-means, hierarchical regression, linear regression, Naïve Bayes, decision trees, text mining and so on. We emphasize important concepts, such as the bias-variance trade-off and over-fitting, which are pervasive in predictive modeling Who This Book Is For If you work with data and want to become an expert in predictive analysis and modeling, then this Learning Path will serve you well. It is intended for budding and seasoned practitioners of predictive modeling alike. You should have basic knowledge of the use of R, although it’s not necessary to put this Learning Path to great use. What You Will Learn Get to know the basics of R’s syntax and major data structures Write functions, load data, and install packages Use different data sources in R and know how to interface with databases, and request and load JSON and XML Identify the challenges and apply your knowledge about data analysis in R to imperfect real-world data Predict the future with reasonably simple algorithms Understand key data visualization and predictive analytic skills using R Understand the language of models and the predictive modeling process In Detail Predictive analytics is a field that uses data to build models that predict a future outcome of interest. It can be applied to a range of business strategies and has been a key player in search advertising and recommendation engines. The power and domain-specificity of R allows the user to express complex analytics easily, quickly, and succinctly. R offers a free and open source environment that is perfect for both learning and deploying predictive modeling solutions in the real world. This Learning Path will provide you with all the steps you need to master the art of predictive modeling with R. We start with an introduction to data analysis with R, and then gradually you’ll get your feet wet with predictive modeling. You will get to grips with the fundamentals of applied statistics and build on this knowledge to perform sophisticated and powerful analytics. You will be able to solve the difficulties relating to performing data analysis in practice and find solutions to working with “messy data”, large data, communicating results, and facilitating reproducibility. You will then perform key predictive analytics tasks using R, such as train and test predictive models for classification and regression tasks, score new data sets and so on. By the end of this Learning Path, you will have explored and tested the most popular modeling techniques in use on real-world data sets and mastered a diverse range of techniques in predictive analytics. This Learning Path combines some of the best that Packt has to offer in one complete, curated package. It includes content from the following Packt products: Data Analysis with R, Tony Fischetti Learning Predictive Analytics with R, Eric Mayor Mastering Predictive Analytics with R, Rui Miguel Forte Style and approach Learn data analysis using engaging examples and fun exercises, and with a gentle and friendly but comprehensive "learn-by-doing" approach. This is a practical course, which analyzes compelling data about life, health, and death with the help of tutorials. It offers you a useful way of interpreting the data that’s specific to this course, but that can also be applied to any other data. This course is designed to be both a guide and a reference for moving beyond the basics of predictive modeling. Downloading the example code for this book. You can download the example code files for all Packt books you have purchased from your account at http://www.PacktPub.com. If you purchased this book elsewhere, you can visit http://www.PacktPub.com/support and register to have the code file.

An Introduction to SAS Visual Analytics

When it comes to business intelligence and analytical capabilities, SAS Visual Analytics is the premier solution for data discovery, visualization, and reporting. An Introduction to SAS Visual Analytics will show you how to make sense of your complex data with the goal of leading you to smarter, data-driven decisions without having to write a single line of code – unless you want to! You will be able to use SAS Visual Analytics to access, prepare, and present your data to anyone anywhere in the world. SAS Visual Analytics automatically highlights key relationships, outliers, clusters, trends and more. These abilities will guide you to critical insights that inspire action from your data. With this book, you will become proficient using SAS Visual Analytics to present data and results in customizable, robust visualizations, as well as guided analyses through auto-charting. With interactive dashboards, charts, and reports, you will create visualizations which convey clear and actionable insights for any size and type of data. This book largely focuses on the version of SAS Visual Analytics on SAS 9.4, although it is available on both 9.4 and SAS Viya platforms. Each version is considered the latest release, with subsequent releases planned to continue on each platform; hence, the Viya version works similarly to the 9.4 version and will look familiar. This book covers new features of each and important differences between the two. With this book, you will learn how to: Build your first report using the SAS Visual Analytics Designer Prepare a dashboard and determine the best layout Effectively use geo-spatial objects to add location analytics to reports Understand and use the elements of data visualizations Prepare and load your data with the SAS Visual Analytics Data Builder Analyze data with a variety of options, including forecasting, word clouds, heat maps, correlation matrix, and more Understand administration activities to keep SAS Visual Analytics humming along Optimize your environment for considerations such as scalability, availability, and efficiency between components of your SAS software deployment and data providers

Statistical Analysis with R For Dummies

Understanding the world of R programming and analysis has never been easier Most guides to R, whether books or online, focus on R functions and procedures. But now, thanks to Statistical Analysis with R For Dummies, you have access to a trusted, easy-to-follow guide that focuses on the foundational statistical concepts that R addresses—as well as step-by-step guidance that shows you exactly how to implement them using R programming. People are becoming more aware of R every day as major institutions are adopting it as a standard. Part of its appeal is that it's a free tool that's taking the place of costly statistical software packages that sometimes take an inordinate amount of time to learn. Plus, R enables a user to carry out complex statistical analyses by simply entering a few commands, making sophisticated analyses available and understandable to a wide audience. Statistical Analysis with R For Dummies enables you to perform these analyses and to fully understand their implications and results. Gets you up to speed on the #1 analytics/data science software tool Demonstrates how to easily find, download, and use cutting-edge community-reviewed methods in statistics and predictive modeling Shows you how R offers intel from leading researchers in data science, free of charge Provides information on using R Studio to work with R Get ready to use R to crunch and analyze your data—the fast and easy way!

Creating a Data-Driven Enterprise with DataOps

Many companies are busy collecting massive amounts of data, but few are taking advantage of this treasure horde to build a truly data insights-driven organization. To do so, the data team must democratize both data and the insights in a way that provides real-time access to all employees in the organization. This report explores DataOps, the process, culture, tools, and people required to scale big data pervasively across the enterprise. Just as DevOps has enabled organizations to improve coordination between developers and the operations team, DataOps closely connects everyone who handles data, including engineers, data scientists, analysts, and business users. Democratizing data with this approach requires removing barriers typical of siloed data, teams, and systems. In this report, Apache Hive creators Ashish Thusoo and Joydeep Sen Sarma examine the characteristics of a data-driven organization that supports a self-service model. Explore related topics such as data lakes, metadata, cloud architecture, and data-infrastructure-as-a-service Examine conclusions from a survey of more than 400 senior executives whose companies are in various stages of data maturity Learn how data pioneers at Facebook, Uber, LinkedIn, Twitter, and eBay created data-driven cultures and self-service data infrastructures for their organizations

Monetizing Your Data

Transforming data into revenue generating strategies and actions Organizations are swamped with data—collected from web traffic, point of sale systems, enterprise resource planning systems, and more , but what to do with it? Monetizing your Data provides a framework and path for business managers to convert ever-increasing volumes of data into revenue generating actions through three disciplines: decision architecture, data science, and guided analytics. There are large gaps between understanding a business problem and knowing which data is relevant to the problem and how to leverage that data to drive significant financial performance. Using a proven methodology developed in the field through delivering meaningful solutions to Fortune 500 companies, this book gives you the analytical tools, methods, and techniques to transform data you already have into information into insights that drive winning decisions. Beginning with an explanation of the analytical cycle, this book guides you through the process of developing value generating strategies that can translate into big returns. The companion website, www.monetizingyourdata.com, provides templates, checklists, and examples to help you apply the methodology in your environment, and the expert author team provides authoritative guidance every step of the way. This book shows you how to use your data to: Monetize your data to drive revenue and cut costs Connect your data to decisions that drive action and deliver value Develop analytic tools to guide managers up and down the ladder to better decisions Turning data into action is key; data can be a valuable competitive advantage, but only if you understand how to organize it, structure it, and uncover the actionable information hidden within it through decision architecture and guided analytics. From multinational corporations to single-owner small businesses, companies of every size and structure stand to benefit from these tools, methods, and techniques; Monetizing your Data walks you through the translation and transformation to help you leverage your data into value creating strategies.

Effective Business Intelligence with QuickSight

Effective Business Intelligence with QuickSight introduces you to Amazon QuickSight, a modern BI tool that enables interactive visualizations powered by the cloud. With comprehensive tutorials, you'll master how to load, prepare, and visualize your data for actionable insights. This book provides real-world examples to showcase how QuickSight integrates into the AWS ecosystem. What this Book will help me do Understand how to effectively use Amazon QuickSight for business intelligence. Learn how to connect QuickSight to data sources like S3, RDS, and more. Create interactive dashboards and visualizations with QuickSight tools. Gain expertise in managing users, permissions, and data security in QuickSight. Execute a real-world big data project using AWS Data Lakes and QuickSight. Author(s) None Nadipalli is a seasoned data architect with extensive experience in cloud computing and business intelligence. With expertise in the AWS ecosystem, she has worked on numerous large-scale data analytics projects. Her writing focuses on providing practical knowledge through easy-to-follow examples and actionable insights. Who is it for? This book is ideal for business intelligence architects, developers, and IT executives seeking to leverage Amazon QuickSight. It is suited for readers with foundational knowledge of AWS who want to enhance their capabilities in BI and data visualization. If your goal is to modernize your business intelligence systems and explore advanced analytics, this book is perfect for you.

Translating Statistics to Make Decisions: A Guide for the Non-Statistician

Examine and solve the common misconceptions and fallacies that non-statisticians bring to their interpretation of statistical results. Explore the many pitfalls that non-statisticians—and also statisticians who present statistical reports to non-statisticians—must avoid if statistical results are to be correctly used for evidence-based business decision making. Victoria Cox, senior statistician at the United Kingdom's Defence Science and Technology Laboratory (Dstl), distills the lessons of her long experience presenting the actionable results of complex statistical studies to users of widely varying statistical sophistication across many disciplines: from scientists, engineers, analysts, and information technologists to executives, military personnel, project managers, and officials across UK government departments, industry, academia, and international partners. The author shows how faulty statistical reasoning often undermines the utility of statistical results even among those with advanced technical training. Translating Statistics teaches statistically naive readers enough about statistical questions, methods, models, assumptions, and statements that they will be able to extract the practical message from statistical reports and better constrain what conclusions cannot be made from the results. To non-statisticians with some statistical training, this book offers brush-ups, reminders, and tips for the proper use of statistics and solutions to common errors. To fellow statisticians, the author demonstrates how to present statistical output to non-statisticians to ensure that the statistical results are correctly understood and properly applied to real-world tasks and decisions. The book avoids algebra and proofs, but it does supply code written in R for those readers who are motivated to work out examples. Pointing along the way to instructive examples of statistics gone awry, Translating Statistics walks readers through the typical course of a statistical study, progressing from the experimental design stage through the data collection process, exploratory data analysis, descriptive statistics, uncertainty, hypothesis testing, statistical modelling and multivariate methods, to graphs suitable for final presentation. The steady focus throughout the book is on how to turn the mathematical artefacts and specialist jargon that are second nature to statisticians into plain English for corporate customers and stakeholders. The final chapter neatly summarizes the book's lessons and insights for accurately communicating statistical reports to the non-statisticians who commission and act on them. What You'll Learn Recognize and avoid common errors and misconceptions that cause statistical studies to be misinterpreted and misused by non-statisticians in organizational settings Gain a practical understanding of the methods, processes, capabilities, and caveats of statistical studies to improve the application of statistical data to business decisions See how to code statistical solutions in R Who This Book Is For Non-statisticians—including both those with and without an introductory statistics course under their belts—who consume statistical reports in organizational settings, and statisticians who seek guidance for reporting statistical studies to non-statisticians in ways that will be accurately understood and will inform sound business and technical decisions

Beginning Data Science in R: Data Analysis, Visualization, and Modelling for the Data Scientist

Discover best practices for data analysis and software development in R and start on the path to becoming a fully-fledged data scientist. This book teaches you techniques for both data manipulation and visualization and shows you the best way for developing new software packages for R. Beginning Data Science in R details how data science is a combination of statistics, computational science, and machine learning. You'll see how to efficiently structure and mine data to extract useful patterns and build mathematical models. This requires computational methods and programming, and R is an ideal programming language for this. This book is based on a number of lecture notes for classes the author has taught on data science and statistical programming using the R programming language. Modern data analysis requires computational skills and usually a minimum of programming. What You Will Learn Perform data science and analytics using statistics and the R programming language Visualize and explore data, including working with large data sets found in big data Build an R package Test and check your code Practice version control Profile and optimize your code Who This Book Is For Those with some data science or analytics background, but not necessarily experience with the R programming language.

Think Like a Data Scientist

Think Like a Data Scientist presents a step-by-step approach to data science, combining analytic, programming, and business perspectives into easy-to-digest techniques and thought processes for solving real world data-centric problems. About the Technology Data collected from customers, scientific measurements, IoT sensors, and so on is valuable only if you understand it. Data scientists revel in the interesting and rewarding challenge of observing, exploring, analyzing, and interpreting this data. Getting started with data science means more than mastering analytic tools and techniques, however; the real magic happens when you begin to think like a data scientist. This book will get you there. About the Book Think Like a Data Scientist teaches you a step-by-step approach to solving real-world data-centric problems. By breaking down carefully crafted examples, you'll learn to combine analytic, programming, and business perspectives into a repeatable process for extracting real knowledge from data. As you read, you'll discover (or remember) valuable statistical techniques and explore powerful data science software. More importantly, you'll put this knowledge together using a structured process for data science. When you've finished, you'll have a strong foundation for a lifetime of data science learning and practice. What's Inside The data science process, step-by-step How to anticipate problems Dealing with uncertainty Best practices in software and scientific thinking About the Reader Readers need beginner programming skills and knowledge of basic statistics. About the Author Brian Godsey has worked in software, academia, finance, and defense and has launched several data-centric start-ups. Quotes Explains difficult concepts and techniques concisely and approachably. - Jenice Tom, CVS Health Goes beyond simple tools and techniques and helps you to conceptualize and solve challenging, real-world data science problems. - Casimir Saternos, Synchronoss Technologies A successful attempt to put the mind of a data scientist on paper. - David Krief, Altansia The book that changed my career path! - Nicolas Boulet-Lavoie, DL Innov

Data Visualization, Volume II

This book discusses data and information visualization techniques-the decision-making tools with applications in health care, finance, manufacturing engineering, process improvement, product design, and others. These tools are an excellent means of viewing the current state of the process and improving them. The initial chapters discuss data analysis, the current trends in visualization, the concepts of systems and processes from which data are collected. The second part is devoted to quality tools-a set of graphical and information visualization tools in data analysis, decision-making, and Lean Six-Sigma quality. The eight basic tools of quality discussed are the Process Maps, Check Sheets, Histograms, Scatter Diagrams, Run Charts, Control Charts, Cause-and-Effect Diagrams, and Pareto Charts. The new quality tools presented are the Affinity, Tree, and Matrix Diagrams, Interrelationship Digraph, Prioritizing Matrices, Process Decision Program Chart, and Activity Network Diagram along with Quality Function Deployment (QFD) and Multivari Charts.

Data Science For Dummies, 2nd Edition

Your ticket to breaking into the field of data science! Jobs in data science are projected to outpace the number of people with data science skills—making those with the knowledge to fill a data science position a hot commodity in the coming years. Data Science For Dummies is the perfect starting point for IT professionals and students interested in making sense of an organization's massive data sets and applying their findings to real-world business scenarios. From uncovering rich data sources to managing large amounts of data within hardware and software limitations, ensuring consistency in reporting, merging various data sources, and beyond, you'll develop the know-how you need to effectively interpret data and tell a story that can be understood by anyone in your organization. Provides a background in data science fundamentals and preparing your data for analysis Details different data visualization techniques that can be used to showcase and summarize your data Explains both supervised and unsupervised machine learning, including regression, model validation, and clustering techniques Includes coverage of big data processing tools like MapReduce, Hadoop, Dremel, Storm, and Spark It's a big, big data world out there—let Data Science For Dummies help you harness its power and gain a competitive edge for your organization.