talk-data.com talk-data.com

Topic

data-science

2091

tagged

Activity Trend

1 peak/qtr
2020-Q1 2026-Q1

Activities

Showing filtered results

Filtering by: O'Reilly Data Science Books ×
Reliability Prediction and Testing Textbook

This textbook reviews the methodologies of reliability prediction as currently used in industries such as electronics, automotive, aircraft, aerospace, off-highway, farm machinery, and others. It then discusses why these are not successful; and, presents methods developed by the authors for obtaining accurate information for successful prediction. The approach is founded on approaches that accurately duplicate the real world use of the product. Their approach is based on two fundamental components needed for successful reliability prediction; first, the methodology necessary; and, second, use of accelerated reliability and durability testing as a source of the necessary data. Applicable to all areas of engineering, this textbook details the newest techniques and tools to achieve successful reliabilityprediction and testing. It demonstrates practical examples of the implementation of the approaches described. This book is a tool for engineers, managers, researchers, in industry, teachers, and students. The reader will learn the importance of the interactions of the influencing factors and the interconnections of safety and human factors in product prediction and testing.

Programming Skills for Data Science: Start Writing Code to Wrangle, Analyze, and Visualize Data with R, First Edition

The Foundational Hands-On Skills You Need to Dive into Data Science “Freeman and Ross have created the definitive resource for new and aspiring data scientists to learn foundational programming skills.” –From the foreword by Jared Lander, series editor Using data science techniques, you can transform raw data into actionable insights for domains ranging from urban planning to precision medicine. brings together all the foundational skills you need to get started, even if you have no programming or data science experience. Programming Skills for Data Science Leading instructors Michael Freeman and Joel Ross guide you through installing and configuring the tools you need to solve professional-level data science problems, including the widely used R language and Git version-control system. They explain how to wrangle your data into a form where it can be easily used, analyzed, and visualized so others can see the patterns you've uncovered. Step by step, you'll master powerful R programming techniques and troubleshooting skills for probing data in new ways, and at larger scales. Freeman and Ross teach through practical examples and exercises that can be combined into complete data science projects. Everything's focused on real-world application, so you can quickly start analyzing your own data and getting answers you can act upon. Learn to Install your complete data science environment, including R and RStudio Manage projects efficiently, from version tracking to documentation Host, manage, and collaborate on data science projects with GitHub Master R language fundamentals: syntax, programming concepts, and data structures Load, format, explore, and restructure data for successful analysis Interact with databases and web APIs Master key principles for visualizing data accurately and intuitively Produce engaging, interactive visualizations with ggplot and other R packages Transform analyses into sharable documents and sites with R Markdown Create interactive web data science applications with Shiny Collaborate smoothly as part of a data science team Register your book for convenient access to downloads, updates, and/or corrections as they become available. See inside book for details.

Pro Power BI Architecture: Sharing, Security, and Deployment Options for Microsoft Power BI Solutions

Architect and deploy a Power BI solution. This book will help you understand the many available options and choose the best combination for hosting, developing, sharing, and deploying a Power BI solution within your organization. Pro Power BI Architecture provides detailed examples and explains the different methods available for sharing and securing Power BI content so that only intended recipients can see it. Commonly encountered problems you will learn to handle include content unexpectedly changing while users are in the process of creating reports and building analysis, methods of sharing analyses that don’t cover all the requirements of your business or organization, and inconsistent security models. The knowledge provided in this book will allow you to choose an architecture and deployment model that suits the needs of your organization, ensuring that you do not spend your time maintaining your solution but onusing it for its intended purpose and gaining business value from mining and analyzing your organization’s data. What You'll Learn Architect and administer enterprise-level Power BI solutions Choose the right sharing method for your Power BI solution Create and manage environments for development, testing, and production Implement row level security in multiple ways to secure your data Save money by choosing the right licensing plan Select a suitable connection type—Live Connection, DirectQuery, or Scheduled Refresh—for your use case Set up a Power BI gateway to bridge between on-premises data sources and the Power BI cloud service Who This Book Is For Data analysts, developers, architects, and managers who want to leverage Power BI for their reporting solution

R Graphics Cookbook, 2nd Edition

This O’Reilly cookbook provides more than 150 recipes to help scientists, engineers, programmers, and data analysts generate high-quality graphs quickly—without having to comb through all the details of R’s graphing systems. Each recipe tackles a specific problem with a solution you can apply to your own project and includes a discussion of how and why the recipe works. Most of the recipes in this second edition use the updated version of the ggplot2 package, a powerful and flexible way to make graphs in R. You’ll also find expanded content about the visual design of graphics. If you have at least a basic understanding of the R language, you’re ready to get started with this easy-to-use reference. Use R’s default graphics for quick exploration of data Create a variety of bar graphs, line graphs, and scatter plots Summarize data distributions with histograms, density curves, box plots, and more Provide annotations to help viewers interpret data Control the overall appearance of graphics Explore options for using colors in plots Create network graphs, heat maps, and 3D scatter plots Get your data into shape using packages from the tidyverse

Applied Health Analytics and Informatics Using SAS

Leverage health data into insight! Applied Health Analytics and Informatics Using SAS describes health anamatics, a result of the intersection of data analytics and health informatics. Healthcare systems generate nearly a third of the world’s data, and analytics can help to eliminate medical errors, reduce readmissions, provide evidence-based care, demonstrate quality outcomes, and add cost-efficient care. This comprehensive textbook includes data analytics and health informatics concepts, along with applied experiential learning exercises and case studies using SAS Enterprise MinerTM within the healthcare industry setting. Topics covered include: Sampling and modeling health data – both structured and unstructured Exploring health data quality Developing health administration and health data assessment procedures Identifying future health trends Analyzing high-performance health data mining models Applied Health Analytics and Informatics Using SAS is intended for professionals, lifelong learners, senior-level undergraduates, graduate-level students in professional development courses, health informatics courses, health analytics courses, and specialized industry track courses. This textbook is accessible to a wide variety of backgrounds and specialty areas, including administrators, clinicians, and executives. This book is part of the SAS Press program.

Microsoft Power BI Dashboards Step by Step, First Edition

Your hands-on guide to building effective Power BI dashboards Expand your expertise–and teach yourself how to create world-class Power BI business analysis dashboards that bring data to life for better decision-making. If you're an experienced business intelligence professional or manager, you'll get all the guidance, examples, and code you need to succeed–even if you've never used Power BI before. Successfully design, architect, and implement Power BI in your organization Take full advantage of any Microsoft Power BI platform, including Power BI Premium Make upfront decisions that position your Power BI project for success Build rich, live dashboards to monitor crucial data from across your organization Aggregate data and data elements from numerous internal and external data sources Develop dynamic visualizations, including charts, maps, and graphs Bring data to life with stunning interactive reports Ensure dashboard security and compliance Drive user adoption through effective training

Ensemble Classification Methods with Applications in R

An essential guide to two burgeoning topics in machine learning – classification trees and ensemble learning Ensemble Classification Methods with Applications in R introduces the concepts and principles of ensemble classifiers methods and includes a review of the most commonly used techniques. This important resource shows how ensemble classification has become an extension of the individual classifiers. The text puts the emphasis on two areas of machine learning: classification trees and ensemble learning. The authors explore ensemble classification methods’ basic characteristics and explain the types of problems that can emerge in its application. Written by a team of noted experts in the field, the text is divided into two main sections. The first section outlines the theoretical underpinnings of the topic and the second section is designed to include examples of practical applications. The book contains a wealth of illustrative cases of business failure prediction, zoology, ecology and others. This vital guide: Offers an important text that has been tested both in the classroom and at tutorials at conferences Contains authoritative information written by leading experts in the field Presents a comprehensive text that can be applied to courses in machine learning, data mining and artificial intelligence Combines in one volume two of the most intriguing topics in machine learning: ensemble learning and classification trees Written for researchers from many fields such as biostatistics, economics, environment, zoology, as well as students of data mining and machine learning, Ensemble Classification Methods with Applications in R puts the focus on two topics in machine learning: classification trees and ensemble learning.

Learning Apache Drill

Get up to speed with Apache Drill, an extensible distributed SQL query engine that reads massive datasets in many popular file formats such as Parquet, JSON, and CSV. Drill reads data in HDFS or in cloud-native storage such as S3 and works with Hive metastores along with distributed databases such as HBase, MongoDB, and relational databases. Drill works everywhere: on your laptop or in your largest cluster. In this practical book, Drill committers Charles Givre and Paul Rogers show analysts and data scientists how to query and analyze raw data using this powerful tool. Data scientists today spend about 80% of their time just gathering and cleaning data. With this book, you’ll learn how Drill helps you analyze data more effectively to drive down time to insight. Use Drill to clean, prepare, and summarize delimited data for further analysis Query file types including logfiles, Parquet, JSON, and other complex formats Query Hadoop, relational databases, MongoDB, and Kafka with standard SQL Connect to Drill programmatically using a variety of languages Use Drill even with challenging or ambiguous file formats Perform sophisticated analysis by extending Drill’s functionality with user-defined functions Facilitate data analysis for network security, image metadata, and machine learning

R Web Scraping Quick Start Guide

Discover the essentials of web scraping with R through this comprehensive guide. In this book, you will learn powerful techniques to extract valuable data from websites using the R programming language and tools like rvest and RSelenium. By understanding how to write efficient scripts, you will gain the ability to automate data collection and analysis for your projects. What this Book will help me do Understand the fundamentals of web scraping and its applications. Master the use of rvest for extracting data from static websites. Learn advanced techniques for dynamic websites using RSelenium. Write effective RegEx and XPath rules to enhance data extraction. Store, manage, and visualize the scraped data efficiently. Author(s) None Aydin is an experienced data analyst and R programmer with a deep passion for data manipulation and analysis. With years of firsthand expertise in utilizing R for various data-related tasks, Aydin brings a practical and methodological approach to teaching complex concepts. His clear instruction style ensures that readers quickly grasp and apply the techniques taught in this book. Who is it for? This book is ideal for R programmers seeking to expand their skills by delving into web scraping techniques. Whether you are a beginner with a basic knowledge of R or a data analyst exploring new ways to extract and utilize data, this guide is tailored for you. It suits readers who aspire to automate data collection and expand their analytical capabilities.

INFORMS Analytics Body of Knowledge

Standardizes the definition and framework of analytics ABOK stands for Analytics Body of Knowledge. Based on the authors’ definition of analytics—which is “a process by which a team of people helps an organization make better decisions (the objective) through the analysis of data (the activity)”— this book from Institute for Operations Research and the Management Sciences (INFORMS) represents the perspectives of some of the most respected experts on analytics. The INFORMS ABOK documents the core concepts and skills with which an analytics professional should be familiar; establishes a dynamic resource that will be used by practitioners to increase their understanding of analytics; and, presents instructors with a framework for developing academic courses and programs in analytics. The INFORMS ABOK offers in-depth insight from peer-reviewed chapters that provide readers with a better understanding of the dynamic field of analytics. Chapters cover: Introduction to Analytics; Getting Started with Analytics; The Analytics Team; The Data; Solution Methodology; Model Building; Machine Learning; Deployment and Life Cycle Management; and The Blossoming Analytics Talent Pool: An Overview of the Analytics Ecosystem. Across industries and academia, readers with various backgrounds in analytics – from novices who are interested in learning more about the basics of analytics to experienced professionals who want a different perspective on some aspect of analytics – will benefit from reading about and implementing the concepts and methods covered by the INFORMS ABOK.

Matplotlib 3.0 Cookbook

Matplotlib 3.0 Cookbook is your go-to guide for mastering the Matplotlib library in Python for creating a wide range of data visualizations. Through 150+ practical recipes, you will learn how to design intuitive and detailed charts, graphs, and dashboards, navigating from simple plots to advanced interactive and 3D visualizations. What this Book will help me do Develop professional-quality data visualizations using Matplotlib. Leverage Matplotlib's API for both quick plotting and advanced customization. Create interactive and animative plots for engaging data representation. Extend Matplotlib functionalities with toolkits like cartopy and axisartist. Integrate Matplotlib figures into GUI applications for broader usage. Author(s) None Poladi and None Borkar are experienced Python developers and enthusiasts who have collaborated in creating a resourceful guide to Matplotlib. They bring extensive experience in data science visualization and Python programming. Their collaborative effort ensures clarity and an approachable learning curve for anyone delving into graphical data representation using Matplotlib. Who is it for? This book is ideal for data scientists, Python developers, and visualization enthusiasts eager to enhance their technical plotting skills. The content covers both fundamentals and advanced topics, suitable for users ranging from beginners curious about Python visualization to experts seeking streamlined workflows and advanced techniques.

Pervasive Intelligence Now
  This book looks at strategies to help companies become more intelligent, connected, and agile. It discusses how companies can define and measure high-impact outcomes and use effectively analytics technology to achieve them. It also looks at the technology needed to implement the analytics necessary to achieve high-impact outcomes—from both analytics tool and technical infrastructure perspective. Also discussed are ancillary, but critical, topics such as data security and governance that may not traditionally be a part of analytics discussions but are essential in helping companies maintain a secure environment for their analytics and access the quality data they need to gain critical insights and drive better decision-making.
Jump into JMP Scripting, Second Edition, 2nd Edition

Learn the essentials of the JMP Scripting Language with this beginner’s guide. Written in an easy-to-understand style based on the authors’ extensive experience, Jump into JMP Scripting, Second Edition teaches beginner scripters how to take advantage of the robust JMP Scripting Language (JSL) using step-by-step instructions and real-world situations. The authors demonstrate how JSL offers the freedom to create scripts from the very simple and specific to the most generic and complex. With a new chapter on JSL language foundations, the first half of the book explains the fundamentals of JSL and walks you through creating your first scripts, such as opening a data table, adding columns, or selecting rows. A new chapter on the Dashboard and Application Builders provides helpful tips on creating custom dashboards and learning how to build applications. Also new to this edition, a chapter on advanced topics introduces more helpful tools and concepts in JSL. After learning the basics, you are ready to tackle specific tasks using JSL. The second half of the book provides more than 50 examples using a unique question-and-answer format. This book is part of the SAS Press program.

Data Analytics for IT Networks: Developing Innovative Use Cases, First Edition

Use data analytics to drive innovation and value throughout your network infrastructure Network and IT professionals capture immense amounts of data from their networks. Buried in this data are multiple opportunities to solve and avoid problems, strengthen security, and improve network performance. To achieve these goals, IT networking experts need a solid understanding of data science, and data scientists need a firm grasp of modern networking concepts. Data Analytics for IT Networks fills these knowledge gaps, allowing both groups to drive unprecedented value from telemetry, event analytics, network infrastructure metadata, and other network data sources. Drawing on his pioneering experience applying data science to large-scale Cisco networks, John Garrett introduces the specific data science methodologies and algorithms network and IT professionals need, and helps data scientists understand contemporary network technologies, applications, and data sources. After establishing this shared understanding, Garrett shows how to uncover innovative use cases that integrate data science algorithms with network data. He concludes with several hands-on, Python-based case studies reflecting Cisco Customer Experience (CX) engineers’ supporting its largest customers. These are designed to serve as templates for developing custom solutions ranging from advanced troubleshooting to service assurance. Understand the data analytics landscape and its opportunities in Networking See how elements of an analytics solution come together in the practical use cases Explore and access network data sources, and choose the right data for your problem Innovate more successfully by understanding mental models and cognitive biases Walk through common analytics use cases from many industries, and adapt them to your environment Uncover new data science use cases for optimizing large networks Master proven algorithms, models, and methodologies for solving network problems Adapt use cases built with traditional statistical methods Use data science to improve network infrastructure analysisAnalyze control and data planes with greater sophistication Fully leverage your existing Cisco tools to collect, analyze, and visualize data

Communication Systems Principles Using MATLAB

Discover the basic telecommunications systems principles in an accessible learn-by-doing format Communication Systems Principles Using MATLAB covers a variety of systems principles in telecommunications in an accessible format without the need to master a large body of theory. The text puts the focus on topics such as radio and wireless modulation, reception and transmission, wired networks and fiber optic communications. The book also explores packet networks and TCP/IP as well as digital source and channel coding, and the fundamentals of data encryption. Since MATLAB® is widely used by telecommunications engineers, it was chosen as the vehicle to demonstrate many of the basic ideas, with code examples presented in every chapter. The text addresses digital communications with coverage of packet-switched networks. Many fundamental concepts such as routing via shortest-path are introduced with simple and concrete examples. The treatment of advanced telecommunications topics extends to OFDM for wireless modulation, and public-key exchange algorithms for data encryption. Throughout the book, the author puts the emphasis on understanding rather than memorization. The text also: Includes many useful take-home skills that can be honed while studying each aspect of telecommunications Offers a coding and experimentation approach with many real-world examples provided Gives information on the underlying theory in order to better understand conceptual developments Suggests a valuable learn-by-doing approach to the topic Written for students of telecommunications engineering, Communication Systems Principles Using MATLAB® is the hands-on resource for mastering the basic concepts of telecommunications in a learn-by-doing format.

Handbook of Healthcare Analytics

How can analytics scholars and healthcare professionals access the most exciting and important healthcare topics and tools for the 21st century? Editors Tinglong Dai and Sridhar Tayur, aided by a team of internationally acclaimed experts, have curated this timely volume to help newcomers and seasoned researchers alike to rapidly comprehend a diverse set of thrusts and tools in this rapidly growing cross-disciplinary field. The Handbook covers a wide range of macro-, meso- and micro-level thrusts—such as market design, competing interests, global health, personalized medicine, residential care and concierge medicine, among others—and structures what has been a highly fragmented research area into a coherent scientific discipline. The handbook also provides an easy-to-comprehend introduction to five essential research tools—Markov decision process, game theory and information economics, queueing games, econometric methods, and data science—by illustrating their uses and applicability on examples from diverse healthcare settings, thus connecting tools with thrusts. The primary audience of the Handbook includes analytics scholars interested in healthcare and healthcare practitioners interested in analytics. This Handbook: Instills analytics scholars with a way of thinking that incorporates behavioral, incentive, and policy considerations in various healthcare settings. This change in perspective—a shift in gaze away from narrow, local and one-off operational improvement efforts that do not replicate, scale or remain sustainable—can lead to new knowledge and innovative solutions that healthcare has been seeking so desperately. Facilitates collaboration between healthcare experts and analytics scholar to frame and tackle their pressing concerns through appropriate modern mathematical tools designed for this very purpose. The handbook is designed to be accessible to the independent reader, and it may be used in a variety of settings, from a short lecture series on specific topics to a semester-long course.

Data Professionals at Work

Enjoy reading interviews with more than two dozen data professionals to see a picture of what it’s like to work in the industry managing and analyzing data, helping you to know what it takes to move from your current expertise into one of the fastest growing areas of technology today. Data is the hottest word of the century, and data professionals are in high demand. You may already be a data professional such as a database administrator or business intelligence analyst. Or you may be one of the many people who want to work as a data professional, and are curious how to get there. Either way, this collection helps you understand how data professionals work, what makes them successful, and what they do to keep up. You’ll find interviews in this book with database administrators, database programmers, data architects, business intelligence professionals, and analytics professionals. Interviewees work across industry sectors ranging from healthcare and banking tofinance and transportation and beyond. Each chapter illuminates a successful professional at the top of their game, who shares what helped them get to the top, and what skills and attitudes combine to make them successful in their respective fields. Interviewees in the book include: Mindy Curnutt, Julie Smith, Kenneth Fisher, Andy Leonard, Jes Borland, Kevin Feasel, Ginger Grant, Vicky Harp, Kendra Little, Jason Brimhall, Tim Costello, Andy Mallon, Steph Locke, Jonathan Stewart, Joseph Sack, John Q. Martin, John Morehouse, Kathi Kellenberger, Argenis Fernandez, Kirsten Benzel, Tracy Boggiano, Dave Walden, Matt Gordon, Jimmy May, Drew Furgiuele, Marlon Ribunal, and Joseph Fleming. All of them have been successful in their careers, and share their perspectives on working and succeeding in the field as data and database professionals. What You'll Learn Stand out as an outstanding professional in your area of data work by developing the right set of skills and attitudes that lead to success Avoid common mistakes and pitfalls, and recover from operational failures and bad technology decisions Understand current trends and best practices, and stay out in front as the field evolves Break into working with data through database administration, business intelligence, or any of the other career paths represented in this book Manage stress and develop a healthy work-life balance no matter which career path you decide upon Choose a suitable path for yourself from among the different career paths in working with data Who This Book Is For Database administrators and developers, database and business intelligence architects, consultants, and analytic professionals, as well as those intent on moving into one of those career paths. Aspiring data professionals and those in related technical fields who want to make a move toward managing or analyzing data on a full-time basis will find the book useful. Existing data professionals who want to be outstanding and successful at what they do will also appreciate the book's advice and guidance.

Collect, Combine, and Transform Data Using Power Query in Excel and Power BI, First Edition

Using Power Query, you can import, reshape, and cleanse any data from a simple interface, so you can mine that data for all of its hidden insights. Power Query is embedded in Excel, Power BI, and other Microsoft products, and leading Power Query expert Gil Raviv will help you make the most of it. Discover how to eliminate time-consuming manual data preparation, solve common problems, avoid pitfalls, and more. Then, walk through several complete analytics challenges, and integrate all your skills in a realistic chapter-length final project. By the time you're finished, you'll be ready to wrangle any data–and transform it into actionable knowledge. Prepare and analyze your data the easy way, with Power Query · Quickly prepare data for analysis with Power Query in Excel (also known as Get & Transform) and in Power BI · Solve common data preparation problems with a few mouse clicks and simple formula edits · Combine data from multiple sources, multiple queries, and mismatched tables · Master basic and advanced techniques for unpivoting tables · Customize transformations and build flexible data mashups with the M formula language · Address collaboration challenges with Power Query · Gain crucial insights into text feeds · Streamline complex social network analytics so you can do it yourself For all information workers, analysts, and any Excel user who wants to solve their own business intelligence problems.