talk-data.com talk-data.com

Event

O'Reilly Data Science Books

2013-08-09 – 2026-02-25 Oreilly Visit website ↗

Activities tracked

2093

Collection of O'Reilly books on Data Science.

Filtering by: data ×

Sessions & talks

Showing 726–750 of 2093 · Newest first

Search within this event →
R Graphics, 2nd Edition

Extensively updated to reflect the evolution of statistics and computing, the second edition of the bestselling R Graphics comes complete with new packages and new examples. Paul Murrell, widely known as the leading expert on R graphics, has developed an in-depth resource that helps both neophyte and seasoned users master the intricacies of R graph

R Graphics Cookbook, 2nd Edition

This O’Reilly cookbook provides more than 150 recipes to help scientists, engineers, programmers, and data analysts generate high-quality graphs quickly—without having to comb through all the details of R’s graphing systems. Each recipe tackles a specific problem with a solution you can apply to your own project and includes a discussion of how and why the recipe works. Most of the recipes in this second edition use the updated version of the ggplot2 package, a powerful and flexible way to make graphs in R. You’ll also find expanded content about the visual design of graphics. If you have at least a basic understanding of the R language, you’re ready to get started with this easy-to-use reference. Use R’s default graphics for quick exploration of data Create a variety of bar graphs, line graphs, and scatter plots Summarize data distributions with histograms, density curves, box plots, and more Provide annotations to help viewers interpret data Control the overall appearance of graphics Explore options for using colors in plots Create network graphs, heat maps, and 3D scatter plots Get your data into shape using packages from the tidyverse

Applied Health Analytics and Informatics Using SAS

Leverage health data into insight! Applied Health Analytics and Informatics Using SAS describes health anamatics, a result of the intersection of data analytics and health informatics. Healthcare systems generate nearly a third of the world’s data, and analytics can help to eliminate medical errors, reduce readmissions, provide evidence-based care, demonstrate quality outcomes, and add cost-efficient care. This comprehensive textbook includes data analytics and health informatics concepts, along with applied experiential learning exercises and case studies using SAS Enterprise MinerTM within the healthcare industry setting. Topics covered include: Sampling and modeling health data – both structured and unstructured Exploring health data quality Developing health administration and health data assessment procedures Identifying future health trends Analyzing high-performance health data mining models Applied Health Analytics and Informatics Using SAS is intended for professionals, lifelong learners, senior-level undergraduates, graduate-level students in professional development courses, health informatics courses, health analytics courses, and specialized industry track courses. This textbook is accessible to a wide variety of backgrounds and specialty areas, including administrators, clinicians, and executives. This book is part of the SAS Press program.

Microsoft Power BI Dashboards Step by Step, First Edition

Your hands-on guide to building effective Power BI dashboards Expand your expertise–and teach yourself how to create world-class Power BI business analysis dashboards that bring data to life for better decision-making. If you're an experienced business intelligence professional or manager, you'll get all the guidance, examples, and code you need to succeed–even if you've never used Power BI before. Successfully design, architect, and implement Power BI in your organization Take full advantage of any Microsoft Power BI platform, including Power BI Premium Make upfront decisions that position your Power BI project for success Build rich, live dashboards to monitor crucial data from across your organization Aggregate data and data elements from numerous internal and external data sources Develop dynamic visualizations, including charts, maps, and graphs Bring data to life with stunning interactive reports Ensure dashboard security and compliance Drive user adoption through effective training

Ensemble Classification Methods with Applications in R

An essential guide to two burgeoning topics in machine learning – classification trees and ensemble learning Ensemble Classification Methods with Applications in R introduces the concepts and principles of ensemble classifiers methods and includes a review of the most commonly used techniques. This important resource shows how ensemble classification has become an extension of the individual classifiers. The text puts the emphasis on two areas of machine learning: classification trees and ensemble learning. The authors explore ensemble classification methods’ basic characteristics and explain the types of problems that can emerge in its application. Written by a team of noted experts in the field, the text is divided into two main sections. The first section outlines the theoretical underpinnings of the topic and the second section is designed to include examples of practical applications. The book contains a wealth of illustrative cases of business failure prediction, zoology, ecology and others. This vital guide: Offers an important text that has been tested both in the classroom and at tutorials at conferences Contains authoritative information written by leading experts in the field Presents a comprehensive text that can be applied to courses in machine learning, data mining and artificial intelligence Combines in one volume two of the most intriguing topics in machine learning: ensemble learning and classification trees Written for researchers from many fields such as biostatistics, economics, environment, zoology, as well as students of data mining and machine learning, Ensemble Classification Methods with Applications in R puts the focus on two topics in machine learning: classification trees and ensemble learning.

Learning Apache Drill

Get up to speed with Apache Drill, an extensible distributed SQL query engine that reads massive datasets in many popular file formats such as Parquet, JSON, and CSV. Drill reads data in HDFS or in cloud-native storage such as S3 and works with Hive metastores along with distributed databases such as HBase, MongoDB, and relational databases. Drill works everywhere: on your laptop or in your largest cluster. In this practical book, Drill committers Charles Givre and Paul Rogers show analysts and data scientists how to query and analyze raw data using this powerful tool. Data scientists today spend about 80% of their time just gathering and cleaning data. With this book, you’ll learn how Drill helps you analyze data more effectively to drive down time to insight. Use Drill to clean, prepare, and summarize delimited data for further analysis Query file types including logfiles, Parquet, JSON, and other complex formats Query Hadoop, relational databases, MongoDB, and Kafka with standard SQL Connect to Drill programmatically using a variety of languages Use Drill even with challenging or ambiguous file formats Perform sophisticated analysis by extending Drill’s functionality with user-defined functions Facilitate data analysis for network security, image metadata, and machine learning

R Web Scraping Quick Start Guide

Discover the essentials of web scraping with R through this comprehensive guide. In this book, you will learn powerful techniques to extract valuable data from websites using the R programming language and tools like rvest and RSelenium. By understanding how to write efficient scripts, you will gain the ability to automate data collection and analysis for your projects. What this Book will help me do Understand the fundamentals of web scraping and its applications. Master the use of rvest for extracting data from static websites. Learn advanced techniques for dynamic websites using RSelenium. Write effective RegEx and XPath rules to enhance data extraction. Store, manage, and visualize the scraped data efficiently. Author(s) None Aydin is an experienced data analyst and R programmer with a deep passion for data manipulation and analysis. With years of firsthand expertise in utilizing R for various data-related tasks, Aydin brings a practical and methodological approach to teaching complex concepts. His clear instruction style ensures that readers quickly grasp and apply the techniques taught in this book. Who is it for? This book is ideal for R programmers seeking to expand their skills by delving into web scraping techniques. Whether you are a beginner with a basic knowledge of R or a data analyst exploring new ways to extract and utilize data, this guide is tailored for you. It suits readers who aspire to automate data collection and expand their analytical capabilities.

INFORMS Analytics Body of Knowledge

Standardizes the definition and framework of analytics ABOK stands for Analytics Body of Knowledge. Based on the authors’ definition of analytics—which is “a process by which a team of people helps an organization make better decisions (the objective) through the analysis of data (the activity)”— this book from Institute for Operations Research and the Management Sciences (INFORMS) represents the perspectives of some of the most respected experts on analytics. The INFORMS ABOK documents the core concepts and skills with which an analytics professional should be familiar; establishes a dynamic resource that will be used by practitioners to increase their understanding of analytics; and, presents instructors with a framework for developing academic courses and programs in analytics. The INFORMS ABOK offers in-depth insight from peer-reviewed chapters that provide readers with a better understanding of the dynamic field of analytics. Chapters cover: Introduction to Analytics; Getting Started with Analytics; The Analytics Team; The Data; Solution Methodology; Model Building; Machine Learning; Deployment and Life Cycle Management; and The Blossoming Analytics Talent Pool: An Overview of the Analytics Ecosystem. Across industries and academia, readers with various backgrounds in analytics – from novices who are interested in learning more about the basics of analytics to experienced professionals who want a different perspective on some aspect of analytics – will benefit from reading about and implementing the concepts and methods covered by the INFORMS ABOK.

Matplotlib 3.0 Cookbook

Matplotlib 3.0 Cookbook is your go-to guide for mastering the Matplotlib library in Python for creating a wide range of data visualizations. Through 150+ practical recipes, you will learn how to design intuitive and detailed charts, graphs, and dashboards, navigating from simple plots to advanced interactive and 3D visualizations. What this Book will help me do Develop professional-quality data visualizations using Matplotlib. Leverage Matplotlib's API for both quick plotting and advanced customization. Create interactive and animative plots for engaging data representation. Extend Matplotlib functionalities with toolkits like cartopy and axisartist. Integrate Matplotlib figures into GUI applications for broader usage. Author(s) None Poladi and None Borkar are experienced Python developers and enthusiasts who have collaborated in creating a resourceful guide to Matplotlib. They bring extensive experience in data science visualization and Python programming. Their collaborative effort ensures clarity and an approachable learning curve for anyone delving into graphical data representation using Matplotlib. Who is it for? This book is ideal for data scientists, Python developers, and visualization enthusiasts eager to enhance their technical plotting skills. The content covers both fundamentals and advanced topics, suitable for users ranging from beginners curious about Python visualization to experts seeking streamlined workflows and advanced techniques.

Pervasive Intelligence Now
  This book looks at strategies to help companies become more intelligent, connected, and agile. It discusses how companies can define and measure high-impact outcomes and use effectively analytics technology to achieve them. It also looks at the technology needed to implement the analytics necessary to achieve high-impact outcomes—from both analytics tool and technical infrastructure perspective. Also discussed are ancillary, but critical, topics such as data security and governance that may not traditionally be a part of analytics discussions but are essential in helping companies maintain a secure environment for their analytics and access the quality data they need to gain critical insights and drive better decision-making.
Jump into JMP Scripting, Second Edition, 2nd Edition

Learn the essentials of the JMP Scripting Language with this beginner’s guide. Written in an easy-to-understand style based on the authors’ extensive experience, Jump into JMP Scripting, Second Edition teaches beginner scripters how to take advantage of the robust JMP Scripting Language (JSL) using step-by-step instructions and real-world situations. The authors demonstrate how JSL offers the freedom to create scripts from the very simple and specific to the most generic and complex. With a new chapter on JSL language foundations, the first half of the book explains the fundamentals of JSL and walks you through creating your first scripts, such as opening a data table, adding columns, or selecting rows. A new chapter on the Dashboard and Application Builders provides helpful tips on creating custom dashboards and learning how to build applications. Also new to this edition, a chapter on advanced topics introduces more helpful tools and concepts in JSL. After learning the basics, you are ready to tackle specific tasks using JSL. The second half of the book provides more than 50 examples using a unique question-and-answer format. This book is part of the SAS Press program.

Data Analytics for IT Networks: Developing Innovative Use Cases, First Edition

Use data analytics to drive innovation and value throughout your network infrastructure Network and IT professionals capture immense amounts of data from their networks. Buried in this data are multiple opportunities to solve and avoid problems, strengthen security, and improve network performance. To achieve these goals, IT networking experts need a solid understanding of data science, and data scientists need a firm grasp of modern networking concepts. Data Analytics for IT Networks fills these knowledge gaps, allowing both groups to drive unprecedented value from telemetry, event analytics, network infrastructure metadata, and other network data sources. Drawing on his pioneering experience applying data science to large-scale Cisco networks, John Garrett introduces the specific data science methodologies and algorithms network and IT professionals need, and helps data scientists understand contemporary network technologies, applications, and data sources. After establishing this shared understanding, Garrett shows how to uncover innovative use cases that integrate data science algorithms with network data. He concludes with several hands-on, Python-based case studies reflecting Cisco Customer Experience (CX) engineers’ supporting its largest customers. These are designed to serve as templates for developing custom solutions ranging from advanced troubleshooting to service assurance. Understand the data analytics landscape and its opportunities in Networking See how elements of an analytics solution come together in the practical use cases Explore and access network data sources, and choose the right data for your problem Innovate more successfully by understanding mental models and cognitive biases Walk through common analytics use cases from many industries, and adapt them to your environment Uncover new data science use cases for optimizing large networks Master proven algorithms, models, and methodologies for solving network problems Adapt use cases built with traditional statistical methods Use data science to improve network infrastructure analysisAnalyze control and data planes with greater sophistication Fully leverage your existing Cisco tools to collect, analyze, and visualize data

Communication Systems Principles Using MATLAB

Discover the basic telecommunications systems principles in an accessible learn-by-doing format Communication Systems Principles Using MATLAB covers a variety of systems principles in telecommunications in an accessible format without the need to master a large body of theory. The text puts the focus on topics such as radio and wireless modulation, reception and transmission, wired networks and fiber optic communications. The book also explores packet networks and TCP/IP as well as digital source and channel coding, and the fundamentals of data encryption. Since MATLAB® is widely used by telecommunications engineers, it was chosen as the vehicle to demonstrate many of the basic ideas, with code examples presented in every chapter. The text addresses digital communications with coverage of packet-switched networks. Many fundamental concepts such as routing via shortest-path are introduced with simple and concrete examples. The treatment of advanced telecommunications topics extends to OFDM for wireless modulation, and public-key exchange algorithms for data encryption. Throughout the book, the author puts the emphasis on understanding rather than memorization. The text also: Includes many useful take-home skills that can be honed while studying each aspect of telecommunications Offers a coding and experimentation approach with many real-world examples provided Gives information on the underlying theory in order to better understand conceptual developments Suggests a valuable learn-by-doing approach to the topic Written for students of telecommunications engineering, Communication Systems Principles Using MATLAB® is the hands-on resource for mastering the basic concepts of telecommunications in a learn-by-doing format.

Handbook of Healthcare Analytics

How can analytics scholars and healthcare professionals access the most exciting and important healthcare topics and tools for the 21st century? Editors Tinglong Dai and Sridhar Tayur, aided by a team of internationally acclaimed experts, have curated this timely volume to help newcomers and seasoned researchers alike to rapidly comprehend a diverse set of thrusts and tools in this rapidly growing cross-disciplinary field. The Handbook covers a wide range of macro-, meso- and micro-level thrusts—such as market design, competing interests, global health, personalized medicine, residential care and concierge medicine, among others—and structures what has been a highly fragmented research area into a coherent scientific discipline. The handbook also provides an easy-to-comprehend introduction to five essential research tools—Markov decision process, game theory and information economics, queueing games, econometric methods, and data science—by illustrating their uses and applicability on examples from diverse healthcare settings, thus connecting tools with thrusts. The primary audience of the Handbook includes analytics scholars interested in healthcare and healthcare practitioners interested in analytics. This Handbook: Instills analytics scholars with a way of thinking that incorporates behavioral, incentive, and policy considerations in various healthcare settings. This change in perspective—a shift in gaze away from narrow, local and one-off operational improvement efforts that do not replicate, scale or remain sustainable—can lead to new knowledge and innovative solutions that healthcare has been seeking so desperately. Facilitates collaboration between healthcare experts and analytics scholar to frame and tackle their pressing concerns through appropriate modern mathematical tools designed for this very purpose. The handbook is designed to be accessible to the independent reader, and it may be used in a variety of settings, from a short lecture series on specific topics to a semester-long course.

Data Professionals at Work

Enjoy reading interviews with more than two dozen data professionals to see a picture of what it’s like to work in the industry managing and analyzing data, helping you to know what it takes to move from your current expertise into one of the fastest growing areas of technology today. Data is the hottest word of the century, and data professionals are in high demand. You may already be a data professional such as a database administrator or business intelligence analyst. Or you may be one of the many people who want to work as a data professional, and are curious how to get there. Either way, this collection helps you understand how data professionals work, what makes them successful, and what they do to keep up. You’ll find interviews in this book with database administrators, database programmers, data architects, business intelligence professionals, and analytics professionals. Interviewees work across industry sectors ranging from healthcare and banking tofinance and transportation and beyond. Each chapter illuminates a successful professional at the top of their game, who shares what helped them get to the top, and what skills and attitudes combine to make them successful in their respective fields. Interviewees in the book include: Mindy Curnutt, Julie Smith, Kenneth Fisher, Andy Leonard, Jes Borland, Kevin Feasel, Ginger Grant, Vicky Harp, Kendra Little, Jason Brimhall, Tim Costello, Andy Mallon, Steph Locke, Jonathan Stewart, Joseph Sack, John Q. Martin, John Morehouse, Kathi Kellenberger, Argenis Fernandez, Kirsten Benzel, Tracy Boggiano, Dave Walden, Matt Gordon, Jimmy May, Drew Furgiuele, Marlon Ribunal, and Joseph Fleming. All of them have been successful in their careers, and share their perspectives on working and succeeding in the field as data and database professionals. What You'll Learn Stand out as an outstanding professional in your area of data work by developing the right set of skills and attitudes that lead to success Avoid common mistakes and pitfalls, and recover from operational failures and bad technology decisions Understand current trends and best practices, and stay out in front as the field evolves Break into working with data through database administration, business intelligence, or any of the other career paths represented in this book Manage stress and develop a healthy work-life balance no matter which career path you decide upon Choose a suitable path for yourself from among the different career paths in working with data Who This Book Is For Database administrators and developers, database and business intelligence architects, consultants, and analytic professionals, as well as those intent on moving into one of those career paths. Aspiring data professionals and those in related technical fields who want to make a move toward managing or analyzing data on a full-time basis will find the book useful. Existing data professionals who want to be outstanding and successful at what they do will also appreciate the book's advice and guidance.

Collect, Combine, and Transform Data Using Power Query in Excel and Power BI, First Edition

Using Power Query, you can import, reshape, and cleanse any data from a simple interface, so you can mine that data for all of its hidden insights. Power Query is embedded in Excel, Power BI, and other Microsoft products, and leading Power Query expert Gil Raviv will help you make the most of it. Discover how to eliminate time-consuming manual data preparation, solve common problems, avoid pitfalls, and more. Then, walk through several complete analytics challenges, and integrate all your skills in a realistic chapter-length final project. By the time you're finished, you'll be ready to wrangle any data–and transform it into actionable knowledge. Prepare and analyze your data the easy way, with Power Query · Quickly prepare data for analysis with Power Query in Excel (also known as Get & Transform) and in Power BI · Solve common data preparation problems with a few mouse clicks and simple formula edits · Combine data from multiple sources, multiple queries, and mismatched tables · Master basic and advanced techniques for unpivoting tables · Customize transformations and build flexible data mashups with the M formula language · Address collaboration challenges with Power Query · Gain crucial insights into text feeds · Streamline complex social network analytics so you can do it yourself For all information workers, analysts, and any Excel user who wants to solve their own business intelligence problems.

Continuous Time Dynamical Systems

This book presents the developments in problems of state estimation and optimal control of continuous-time dynamical systems using orthogonal functions since 1975. It deals with both full and reduced-order state estimation and problems of linear time-invariant systems. It also addresses optimal control problems of varieties of continuous-time systems such as linear and nonlinear systems, time-invariant and time-varying systems, as well as delay-free and time-delay systems. Content focuses on development of recursive algorithms for studying state estimation and optimal control problems.

Douglas Montgomery's Introduction to Statistical Quality Control

Master Statistical Quality Control using JMP ! Using examples from the popular textbook by Douglas Montgomery, Introduction to Statistical Quality Control: A JMP Companion demonstrates the powerful Statistical Quality Control (SQC) tools found in JMP. Geared toward students and practitioners of SQC who are using these techniques to monitor and improve products and processes, this companion provides step-by-step instructions on how to use JMP to generate the output and solutions found in Montgomery’s book. The authors combine their many years of experience as passionate practitioners of SQC and their expertise using JMP to highlight the recent advances in JMP’s Analyze menu, and in particular, Quality and Process. Key JMP platforms include: Control Chart Builder CUSUM Control Chart Control Chart (XBar, IR, P, NP, C, U, UWMA, EWMA, CUSUM) Process Screening Process Capability Measurement System Analysis Time Series Multivariate Control Chart Multivariate and Principal Components Distribution For anyone who wants to learn how to use JMP to more easily explore data using tools associated with Statistical Process Control, Process Capability Analysis, Measurement System Analysis, Advanced Statistical Process Control, and Process Health Assessment, this book is a must!

Computation for Humanity

This book discusses various aspects of computational science and engineering in combination with applied science. It highlights sustainability to demonstrate how computation can improve different aspects of life. The editors provide a collection of numerous computation-related projects that form a foundation from which to cross-pollinate between different disciplines and further extensive collaboration. They present a clear and profound understanding of computing in today's world. The detailed application examples provide the reader with contributions that behold approaches to provide fundamental solutions to some of the most pertinent humanity-related problems.

Pharmaceutical Quality by Design Using JMP

Solve your pharmaceutical product development and manufacturing problems using JMP . Pharmaceutical Quality by Design Using JMP : Solving Product Development and Manufacturing Problems provides broad-based techniques available in JMP to visualize data and run statistical analyses for areas common in healthcare product manufacturing. As international regulatory agencies push the concept of Quality by Design (QbD), there is a growing emphasis to optimize the processing of products. This book uses practical examples from the pharmaceutical and medical device industries to illustrate easy-to-understand ways of incorporating QbD elements using JMP. Pharmaceutical Quality by Design Using JMP opens by demonstrating the easy navigation of JMP to visualize data through the distribution function and the graph builder and then highlights the following: the powerful dynamic nature of data visualization that enables users to be able to quickly extract meaningful information tools and techniques designed for the use of structured, multivariate sets of experiments examples of complex analysis unique to healthcare products such as particle size distributions/drug dissolution, stability of drug products over time, and blend uniformity/content uniformity. Scientists, engineers, and technicians involved throughout the pharmaceutical and medical device product life cycles will find this book invaluable. This book is part of the SAS Press program.

Getting Started with Tableau 2018.x

Dive into the world of data visualization with "Getting Started with Tableau 2018.x." This comprehensive guide introduces you to both the fundamental and advanced functionalities of Tableau 2018.x, making it easier to create impactful data visualizations. Learn to unlock Tableau's full potential through practical examples and clear explanations. What this Book will help me do Understand the new Tableau 2018.x features like density, extensions, and transparency and how to leverage them. Learn how to connect to data sources, perform transformations, and build efficient data models to support your analysis. Master visualization techniques to design effective and insightful dashboards tailored to business needs. Explore advanced concepts such as calculations, cross-database joins, and data blending to handle complex scenarios. Develop the confidence to publish and interact with content on Tableau Server and share your insights effectively. Author(s) None Guillevin and None Pires are data visualization experts with extensive experience using Tableau. They aim to make data analysis accessible through hands-on examples and easy-to-follow explanations. Their writing balances clear instruction with practical application, making advanced concepts understandable for all readers. Who is it for? This book is ideal for beginners or experienced BI professionals who wish to gain expertise in Tableau 2018.x. It caters to aspiring analysts and business professionals looking to answer complex business-specific questions through data visualization. Regardless of prior experience in Tableau or other BI tools, this book provides value through a structured learning approach.

MicroStrategy Quick Start Guide

In 'MicroStrategy Quick Start Guide,' you'll learn how to transform your raw business data into actionable insights using MicroStrategy. The book covers everything from setting up and configuring MicroStrategy tools to creating insightful dashboards and managing BI solutions from start to finish. What this Book will help me do Configure the MicroStrategy Intelligence Server and essential tools. Create and utilize MicroStrategy Projects and manage metadata repositories. Design effective MicroStrategy Reports to retrieve key business insights. Develop engaging dashboards for advanced data visualization and storytelling. Administer and secure your MicroStrategy BI solutions for stable operation. Author(s) None Rivero Esqueda brings their extensive experience in Business Intelligence solutions to this practical guide. Known for their expertise in MicroStrategy, they are passionate about empowering data analysts and BI professionals to leverage data for better decisions. Their professional insight and accessible approach make this book a valuable resource for readers at all levels. Who is it for? This book is ideal for Business Intelligence professionals or data analysts looking to explore MicroStrategy as their primary BI tool. Readers should have a basic understanding of BI concepts and data analysis. It is tailored to suit beginners as well as professionals transitioning to MicroStrategy. If you are eager to create impactful visualizations and dashboards while mastering MicroStrategy, this is the perfect guide for you.