data-science

HBR Guide to Data Analytics Basics for Managers (HBR Guide Series)

2018-03-13 · O'Reilly Data Science Books O'Reilly Amazon

book

by Harvard Business Review

Analytics Data Analytics data data-science-tools r

Don't let a fear of numbers hold you back. Today's business environment brings with it an onslaught of data. Now more than ever, managers must know how to tease insight from data--to understand where the numbers come from, make sense of them, and use them to inform tough decisions. How do you get started? Whether you're working with data experts or running your own tests, you'll find answers in the HBR Guide to Data Analytics Basics for Managers. This book describes three key steps in the data analysis process, so you can get the information you need, study the data, and communicate your findings to others. You'll learn how to: Identify the metrics you need to measure Run experiments and A/B tests Ask the right questions of your data experts Understand statistical terms and concepts Create effective charts and visualizations Avoid common mistakes

The Routledge Companion to Trust

2018-03-13 · O'Reilly Data Science Books O'Reilly Amazon

book

by Sim B. Sitkin , Rosalind H. Searle , Ann-Marie I. Nienaber

data data-science-tasks statistics survey-methodologies

Comprising six thematic sections, The Routledge Companion to Trust provides a comprehensive survey of trust research. With contributions from international experts, this volume examines the major topics and emerging areas within the field, including essays on the foundations, levels and theories of trust.

Data Stewardship for Open Science

2018-03-09 · O'Reilly Data Science Books O'Reilly Amazon

book

by Barend Mons

data

This book makes readers aware of the need, complexity, and challenges associated with open science, modern science communication, and data stewardship.

Business Case Analysis with R: Simulation Tutorials to Support Complex Business Decisions

2018-03-01 · O'Reilly Data Science Books O'Reilly Amazon

book

by Robert D. Brown III

Data Science Monte Carlo R data data-science-tools r

This tutorial teaches you how to use the statistical programming language R to develop a business case simulation and analysis. It presents a methodology for conducting business case analysis that minimizes decision delay by focusing stakeholders on what matters most and suggests pathways for minimizing the risk in strategic and capital allocation decisions. Business case analysis, often conducted in spreadsheets, exposes decision makers to additional risks that arise just from the use of the spreadsheet environment. R has become one of the most widely used tools for reproducible quantitative analysis, and analysts fluent in this language are in high demand. The R language, traditionally used for statistical analysis, provides a more explicit, flexible, and extensible environment than spreadsheets for conducting business case analysis. The main tutorial follows the case in which a chemical manufacturing company considers constructing a chemical reactor and production facility to bring a new compound to market. There are numerous uncertainties and risks involved, including the possibility that a competitor brings a similar product online. The company must determine the value of making the decision to move forward and where they might prioritize their attention to make a more informed and robust decision. While the example used is a chemical company, the analysis structure it presents can be applied to just about any business decision, from IT projects to new product development to commercial real estate. The supporting tutorials include the perspective of the founder of a professional service firm who wants to grow his business and a member of a strategic planning group in a biomedical device company who wants to know how much to budget in order to refine the quality of information about critical uncertainties that might affect the value of a chosen product development pathway. What You’ll Learn Set upa business case abstraction in an influence diagram to communicate the essence of the problem to other stakeholders Model the inherent uncertainties in the problem with Monte Carlo simulation using the R language Communicate the results graphically Draw appropriate insights from the results Develop creative decision strategies for thorough opportunity cost analysis Calculate the value of information on critical uncertainties between competing decision strategies to set the budget for deeper data analysis Construct appropriate information to satisfy the parameters for the Monte Carlo simulation when little or no empirical data are available Who This Book Is For Financial analysts, data practitioners, and risk/business professionals; also appropriate for graduate level finance, business, or data science students

SQL Server 2017 Machine Learning Services with R

2018-02-27 · O'Reilly Data Science Books O'Reilly Amazon

book

by Julie Koesmarno (Microsoft) , Toma≈æ Ka≈°trun Kaštrun

AI/ML Analytics Data Science SQL data data-science-tools r

Learn how to leverage SQL Server 2017 Machine Learning Services and the R programming language to create robust, efficient data analysis and machine learning solutions. This book provides actionable insights and practical examples to help you implement and manage database-oriented analytics and predictive modeling. What this Book will help me do Understand and use SQL Server 2017 Machine Learning Services integrated with R. Gain experience in installing, configuring, and maintaining R services in SQL Server. Create and operationalize predictive models using RevoScaleR and other R packages. Improve database solutions by incorporating advanced analytics techniques. Monitor and manage R-based services effectively for reliable production solutions. Author(s) Tomaž Kaštrun and None Koesmarno bring a wealth of expertise as practitioners and educators in data science and SQL Server technologies. They share their experience innovatively, making intricate subjects approachable. Their unified teaching method ensures readers can directly benefit from practical examples and real-world applications. Who is it for? This book is tailored for database administrators, data analysts, and data scientists eager to integrate R with SQL Server. It caters to professionals with varying levels of R experience who are looking to enhance their proficiency in database-oriented analytics. Readers will benefit most if they are motivated to design effective, data-driven solutions in SQL Server environments.

A Practical Guide for Informationists

2018-02-23 · O'Reilly Data Science Books O'Reilly Amazon

book

by Antonio P DeRosa

Data Management data healthcare-analytics

A Practical Guide for Informationists: Supporting Research and Clinical Practice guides new informationists to a successful career, giving them a pathway to this savvier, more technically advanced, domain-focused role in modern day information centers and libraries. The book's broad scope serves as an invaluable toolkit for healthcare professionals, researchers and graduate students in information management, library and information science, data management, informatics, etc. Furthermore, it is also ideal as a textbook for courses in medical reference services/medical informatics in MLIS programs. Offer examples (e.g. case studies) of ways of delivering information services to end users Includes recommendations, evidence and worksheets/take-aways/templates to be repurposed and adapted by the reader Aimed at the broad area of healthcare and research libraries

Practical Data Science: A Guide to Building the Technology Stack for Turning Data Lakes into Business Assets

2018-02-21 · O'Reilly Data Science Books O'Reilly Amazon

book

by Andreas François Vermeulen

Data Engineering Data Lake Data Science data

Learn how to build a data science technology stack and perform good data science with repeatable methods. You will learn how to turn data lakes into business assets. The data science technology stack demonstrated in Practical Data Science is built from components in general use in the industry. Data scientist Andreas Vermeulen demonstrates in detail how to build and provision a technology stack to yield repeatable results. He shows you how to apply practical methods to extract actionable business knowledge from data lakes consisting of data from a polyglot of data types and dimensions. What You'll Learn Become fluent in the essential concepts and terminology of data science and data engineering Build and use a technology stack that meets industry criteria Master the methods for retrieving actionable business knowledge Coordinate the handling ofpolyglot data types in a data lake for repeatable results Who This Book Is For Data scientists and data engineers who are required to convert data from a data lake into actionable knowledge for their business, and students who aspire to be data scientists and data engineers

R Projects For Dummies

2018-02-13 · O'Reilly Data Science Books O'Reilly Amazon

book

by Joseph Schmuller

AI/ML Data Science data data-science-tools r

Make the most of R’s extensive toolset R Projects For Dummies offers a unique learn-by-doing approach. You will increase the depth and breadth of your R skillset by completing a wide variety of projects. By using R’s graphics, interactive, and machine learning tools, you’ll learn to apply R’s extensive capabilities in an array of scenarios. The depth of the project experience is unmatched by any other content online or in print. And you just might increase your statistics knowledge along the way, too! R is a free tool, and it’s the basis of a huge amount of work in data science. It's taking the place of costly statistical software that sometimes takes a long time to learn. One reason is that you can use just a few R commands to create sophisticated analyses. Another is that easy-to-learn R graphics enable you make the results of those analyses available to a wide audience. This book will help you sharpen your skills by applying them in the context of projects with R, including dashboards, image processing, data reduction, mapping, and more. Appropriate for R users at all levels Helps R programmers plan and complete their own projects Focuses on R functions and packages Shows how to carry out complex analyses by just entering a few commands If you’re brand new to R or just want to brush up on your skills, R Projects For Dummies will help you complete your projects with ease.

Python Web Scraping Cookbook

2018-02-09 · O'Reilly Data Science Books O'Reilly Amazon

book

by Mei Lu , Lazar Telebak , Michael Heydt

AWS Cloud Computing Data Engineering JavaScript Python Selenium data data-science-tasks web-scraping

Python Web Scraping Cookbook is your comprehensive guide to building efficient and functional web scraping tools using Python. With practical recipes, you'll learn to overcome the challenges of dynamic content, captcha, and irregular web structures while deploying scalable solutions. What this Book will help me do Master the use of Python libraries like BeautifulSoup and Scrapy for scraping data. Perfect techniques for handling JavaScript-heavy sites using Selenium. Learn to overcome web scraping challenges, such as captchas and rate-limiting. Design scalable scraping pipelines with cloud deployment in AWS. Understand web data extraction techniques with XPath, CSS selectors, and more. Author(s) Michael Heydt is a seasoned software engineer and technical author with a focus on data engineering and cloud solutions. Having worked with Python extensively, he brings real-world insights into web scraping. His practical approach simplifies complex concepts. Who is it for? This book is perfect for Python developers and data enthusiasts keen to master web scraping techniques. If you're a programmer with insights into Python scripting and wish to scrape, analyze, and utilize web data efficiently, this book is for you.

SAS Viya

2018-02-08 · O'Reilly Data Science Books O'Reilly Amazon

book

by Kevin D. Smith , Xiangxiang Meng

AI/ML Analytics API Cloud Computing Java Python SAS analytics-platforms data

Learn how to access analytics from SAS Cloud Analytic Services (CAS) using Python and the SAS Viya platform. SAS Viya : The Python Perspective is an introduction to using the Python client on the SAS Viya platform. SAS Viya is a high-performance, fault-tolerant analytics architecture that can be deployed on both public and private cloud infrastructures. While SAS Viya can be used by various SAS applications, it also enables you to access analytic methods from SAS, Python, Lua, and Java, as well as through a REST interface using HTTP or HTTPS. This book focuses on the perspective of SAS Viya from Python. SAS Viya is made up of multiple components. The central piece of this ecosystem is SAS Cloud Analytic Services (CAS). CAS is the cloud-based server that all clients communicate with to run analytical methods. The Python client is used to drive the CAS component directly using objects and constructs that are familiar to Python programmers. Some knowledge of Python would be helpful before using this book; however, there is an appendix that covers the features of Python that are used in the CAS Python client. Knowledge of CAS is not required to use this book. However, you will need to have a CAS server set up and running to execute the examples in this book. With this book, you will learn how to: Install the required components for accessing CAS from Python Connect to CAS, load data, and run simple analyses Work with CAS using APIs familiar to Python users Grasp general CAS workflows and advanced features of the CAS Python client SAS Viya : The Python Perspective covers topics that will be useful to beginners as well as experienced CAS users. It includes examples from creating connections to CAS all the way to simple statistics and machine learning, but it is also useful as a desktop reference.

An Introduction to Discrete-Valued Time Series

2018-02-05 · O'Reilly Data Science Books O'Reilly Amazon

book

by Christian H. Weiss

AI/ML Computer Science Data Science Jenkins data data-science-tasks statistics time-series

A much-needed introduction to the field of discrete-valued time series, with a focus on count-data time series Time series analysis is an essential tool in a wide array of fields, including business, economics, computer science, epidemiology, finance, manufacturing and meteorology, to name just a few. Despite growing interest in discrete-valued time series—especially those arising from counting specific objects or events at specified times—most books on time series give short shrift to that increasingly important subject area. This book seeks to rectify that state of affairs by providing a much needed introduction to discrete-valued time series, with particular focus on count-data time series. The main focus of this book is on modeling. Throughout numerous examples are provided illustrating models currently used in discrete-valued time series applications. Statistical process control, including various control charts (such as cumulative sum control charts), and performance evaluation are treated at length. Classic approaches like ARMA models and the Box-Jenkins program are also featured with the basics of these approaches summarized in an Appendix. In addition, data examples, with all relevant R code, are available on a companion website. Provides a balanced presentation of theory and practice, exploring both categorical and integer-valued series Covers common models for time series of counts as well as for categorical time series, and works out their most important stochastic properties Addresses statistical approaches for analyzing discrete-valued time series and illustrates their implementation with numerous data examples Covers classical approaches such as ARMA models, Box-Jenkins program and how to generate functions Includes dataset examples with all necessary R code provided on a companion website An Introduction to Discrete-Valued Time Series is a valuable working resource for researchers and practitioners in a broad range of fields, including statistics, data science, machine learning, and engineering. It will also be of interest to postgraduate students in statistics, mathematics and economics.

Loss Data Analysis

2018-02-05 · O'Reilly Data Science Books O'Reilly Amazon

book

by Silvia Mayoral , Henryk Gzyl , Erika Gomes-Gonçalves

data data-science-tasks statistics

This volume deals with two complementary topics. On one hand the book deals with the problem of determining the the probability distribution of a positive compound random variable, a problem which appears in the banking and insurance industries, in many areas of operational research and in reliability problems in the engineering sciences. On the other hand, the methodology proposed to solve such problems, which is based on an application of the maximum entropy method to invert the Laplace transform of the distributions, can be applied to many other problems. The book contains applications to a large variety of problems, including the problem of dependence of the sample data used to estimate empirically the Laplace transform of the random variable. Contents Introduction Frequency models Individual severity models Some detailed examples Some traditional approaches to the aggregation problem Laplace transforms and fractional moment problems The standard maximum entropy method Extensions of the method of maximum entropy Superresolution in maxentropic Laplace transform inversion Sample data dependence Disentangling frequencies and decompounding losses Computations using the maxentropic density Review of statistical procedures

Market Data Analysis Using JMP

2018-02-05 · O'Reilly Data Science Books O'Reilly Amazon

book

by Walter R. Paczkowski

analytics-platforms data jmp

With the powerful interactive and visual functionality of JMP, you can dynamically analyze market data to transform it into actionable and useful information with clear, concise, and insightful reports and displays. Market Data Analysis Using JMP is a unique example-driven book because it has a specific application focus: market data analysis. A working knowledge of JMP will help you turn your market data into vital knowledge that will help you succeed in a highly competitive, fast-moving, and dynamic business world. This book can be used as a stand-alone resource for working professionals, or as a supplement to a business school course in market data research. Anyone who works with market data will benefit from reading and studying this book, then using JMP to apply the dynamic analytical concepts to their market data. After reading this book, you will be able to quickly and effortlessly use JMP to: prepare market data for analysis use and interpret sophisticated statistical methods build choice models estimate regression models to turn data into useful and actionable information Market Data Analysis Using JMP will teach you how to use dynamic graphics to illustrate your market data analysis and explore the vast possibilities that your data can offer!

Predictive Analytics for Marketers

2018-02-03 · O'Reilly Data Science Books O'Reilly Amazon

book

by Barry Leventhal

Analytics Marketing data web-analytics

Understand how to apply predictive analytics to better manage a company and its resources more effectively, with this revolutionary book for marketing professionals.

An Introduction to SAS University Edition

2018-02-02 · O'Reilly Data Science Books O'Reilly Amazon

book

by Ron Cody

CSV SAS analytics-platforms data

SAS ® OnDemand for Academics is now the primary software choice for learners. SAS OnDemand for Academics is available for free access to SAS for individual learners as well as university educators and students. Access to SAS University Edition will end Aug. 2, 2021; users will no longer be able to download it after Apr. 30, 2021. Get up and running with the SAS University Edition using Ron Cody’s easy-to-follow, step-by-step guide. Aimed at beginners who have downloaded the free SAS University Edition and want to either use the point-and-click interactive environment of SAS Studio, or who want to write their own SAS programs, or both, An Introduction to SAS University Edition, begins by showing you how to obtain the SAS University Edition, and how you can run SAS on a PC or Macintosh computer. The first part of the book shows you how to perform basic tasks, such as producing a report, summarizing data, producing charts and graphs, and using the SAS Studio built-in tasks. The first part also describes how you can perform basic statistical tests using the interactive point-and-click environment. The second part of the book shows you how to write your own SAS programs, and how to use SAS procedures to perform a variety of tasks. This part of the book also explains how to read data from a variety of sources: text files, Excel workbooks, and CSV files. In order to get familiar with the SAS Studio environment, this book also shows you how to access dozens of interesting data sets that are included with the product.

Simulation of Dynamic Systems with MATLAB® and Simulink®, 3rd Edition

2018-02-02 · O'Reilly Data Science Books O'Reilly Amazon

book

by Randal Allen , Harold Klee

MATLAB data data-science-tools

The book presents an integrated treatment of continuous simulation with all the background and essential prerequisites in one setting. It features updated chapters and two new sections on Black Swan and the Stochastic Information Packet (SIP) and Stochastic Library Units with Relationships Preserved (SLURP) Standard.

Regression Analysis with R

2018-01-31 · O'Reilly Data Science Books O'Reilly Amazon

book

by Pierre Paquay , Giuseppe Ciaburro , Manoj Kumar , Shaikh Salamatullah

Data Science data data-science-tasks regression-analysis statistics

Dive into the world of regression analysis with this hands-on guide that covers everything you need to know about building effective regression models in R. You'll learn both the theoretical foundations and how to apply them using practical examples and R code. By the end, you'll be equipped to interpret regression results and use them to make meaningful predictions. What this Book will help me do Master the fundamentals of regression analysis, from simple linear to logistic regression. Gain expertise in R programming for implementing regression models and analyzing results. Develop skills in handling missing data, feature engineering, and exploratory data analysis. Understand how to identify, prevent, and address overfitting and underfitting issues in modeling. Apply regression techniques in real-world applications, including classification problems and advanced methods like Bagging and Boosting. Author(s) Giuseppe Ciaburro is an experienced data scientist and author with a passion for making complex technical topics accessible. With expertise in R programming and regression analysis, he has worked extensively in statistical modeling and data exploration. Giuseppe's writing combines clear explanations of theory with hands-on examples, ideal for learners and practitioners alike. Who is it for? This book is perfect for aspiring data scientists and analysts eager to understand and apply regression analysis using R. It's suited for readers with a foundational knowledge of statistics and basic R programming experience. Whether you're delving into data science or aiming to strengthen existing skills, this book offers practical insights to reach your goals.

Interval Finite Element Method with MATLAB

2018-01-23 · O'Reilly Data Science Books O'Reilly Amazon

book

by Sukanta Nayak , Snehashish Chakraverty

MATLAB data data-science-tools

Interval Finite Element Method with MATLAB provides a thorough introduction to an effective way of investigating problems involving uncertainty using computational modeling. The well-known and versatile Finite Element Method (FEM) is combined with the concept of interval uncertainties to develop the Interval Finite Element Method (IFEM). An interval or stochastic environment in parameters and variables is used in place of crisp ones to make the governing equations interval, thereby allowing modeling of the problem. The concept of interval uncertainties is systematically explained. Several examples are explored with IFEM using MATLAB on topics like spring mass, bar, truss and frame. Provides a systematic approach to understanding the interval uncertainties caused by vague or imprecise data Describes the interval finite element method in detail Gives step-by-step instructions for how to use MATLAB code for IFEM Provides a range of examples of IFEM in use, with accompanying MATLAB codes

Complex Network Analysis in Python

2018-01-19 · O'Reilly Data Science Books O'Reilly Amazon

book

by Dmitry Zinoviev

Analytics Marketing Matplotlib NumPy Pandas Python data data-science-tasks data-visualization gephi

Construct, analyze, and visualize networks with networkx, a Python language module. Network analysis is a powerful tool you can apply to a multitude of datasets and situations. Discover how to work with all kinds of networks, including social, product, temporal, spatial, and semantic networks. Convert almost any real-world data into a complex network--such as recommendations on co-using cosmetic products, muddy hedge fund connections, and online friendships. Analyze and visualize the network, and make business decisions based on your analysis. If you're a curious Python programmer, a data scientist, or a CNA specialist interested in mechanizing mundane tasks, you'll increase your productivity exponentially. Complex network analysis used to be done by hand or with non-programmable network analysis tools, but not anymore! You can now automate and program these tasks in Python. Complex networks are collections of connected items, words, concepts, or people. By exploring their structure and individual elements, we can learn about their meaning, evolution, and resilience. Starting with simple networks, convert real-life and synthetic network graphs into networkx data structures. Look at more sophisticated networks and learn more powerful machinery to handle centrality calculation, blockmodeling, and clique and community detection. Get familiar with presentation-quality network visualization tools, both programmable and interactive--such as Gephi, a CNA explorer. Adapt the patterns from the case studies to your problems. Explore big networks with NetworKit, a high-performance networkx substitute. Each part in the book gives you an overview of a class of networks, includes a practical study of networkx functions and techniques, and concludes with case studies from various fields, including social networking, anthropology, marketing, and sports analytics. Combine your CNA and Python programming skills to become a better network analyst, a more accomplished data scientist, and a more versatile programmer. What You Need: You will need a Python 3.x installation with the following additional modules: Pandas (>=0.18), NumPy (>=1.10), matplotlib (>=1.5), networkx (>=1.11), python-louvain (>=0.5), NetworKit (>=3.6), and generalizesimilarity. We recommend using the Anaconda distribution that comes with all these modules, except for python-louvain, NetworKit, and generalizedsimilarity, and works on all major modern operating systems.

Analyzing Baseball Data with R

2018-01-17 · O'Reilly Data Science Books O'Reilly Amazon

book

by Max Marchi , Jim Albert

Data Management data data-science-tools r

With its flexible capabilities and open-source platform, R has become a major tool for analyzing detailed, high-quality baseball data. Analyzing Baseball Data with R provides an introduction to R for sabermetricians, baseball enthusiasts, and students interested in exploring the rich sources of baseball data. It equips readers with the necessary skills and software tools to perform all of the analysis steps, from gathering the datasets and entering them in a convenient format to visualizing the data via graphs to performing a statistical analysis. The authors first present an overview of publicly available baseball datasets and a gentle introduction to the type of data structures and exploratory and data management capabilities of R. They also cover the traditional graphics functions in the base package and introduce more sophisticated graphical displays available through the lattice and ggplot2 packages. Much of the book illustrates the use of R through popular sabermetrics topics, including the Pythagorean formula, runs expectancy, career trajectories, simulation of games and seasons, patterns of streaky behavior of players, and fielding measures. Each chapter contains exercises that encourage readers to perform their own analyses using R. All of the datasets and R code used in the text are available online. This book helps readers answer questions about baseball teams, players, and strategy using large, publically available datasets. It offers detailed instructions on downloading the datasets and putting them into formats that simplify data exploration and analysis. Through the book’s various examples, readers will learn about modern sabermetrics and be able to conduct their own baseball analyses.

talk-data.com

Activity Trend

Top Events

Top Speakers

HBR Guide to Data Analytics Basics for Managers (HBR Guide Series)

The Routledge Companion to Trust

Data Stewardship for Open Science

Business Case Analysis with R: Simulation Tutorials to Support Complex Business Decisions

SQL Server 2017 Machine Learning Services with R

A Practical Guide for Informationists

Practical Data Science: A Guide to Building the Technology Stack for Turning Data Lakes into Business Assets

R Projects For Dummies

Python Web Scraping Cookbook

SAS Viya

An Introduction to Discrete-Valued Time Series

Loss Data Analysis

Market Data Analysis Using JMP

Predictive Analytics for Marketers

An Introduction to SAS University Edition

Simulation of Dynamic Systems with MATLAB® and Simulink®, 3rd Edition

Regression Analysis with R

Interval Finite Element Method with MATLAB

Complex Network Analysis in Python

Analyzing Baseball Data with R