talk-data.com talk-data.com

Event

O'Reilly Data Science Books

2013-08-09 – 2026-02-25 Oreilly Visit website ↗

Activities tracked

2093

Collection of O'Reilly books on Data Science.

Filtering by: data ×

Sessions & talks

Showing 601–625 of 2093 · Newest first

Search within this event →
Advanced Statistics with Applications in R

Advanced Statistics with Applications in R fills the gap between several excellent theoretical statistics textbooks and many applied statistics books where teaching reduces to using existing packages. This book looks at what is under the hood. Many statistics issues including the recent crisis with p-value are caused by misunderstanding of statistical concepts due to poor theoretical background of practitioners and applied statisticians. This book is the product of a forty-year experience in teaching of probability and statistics and their applications for solving real-life problems. There are more than 442 examples in the book: basically every probability or statistics concept is illustrated with an example accompanied with an R code. Many examples, such as Who said π? What team is better? The fall of the Roman empire, James Bond chase problem, Black Friday shopping, Free fall equation: Aristotle or Galilei, and many others are intriguing. These examples cover biostatistics, finance, physics and engineering, text and image analysis, epidemiology, spatial statistics, sociology, etc. Advanced Statistics with Applications in R teaches students to use theory for solving real-life problems through computations: there are about 500 R codes and 100 datasets. These data can be freely downloaded from the author's website dartmouth.edu/~eugened. This book is suitable as a text for senior undergraduate students with major in statistics or data science or graduate students. Many researchers who apply statistics on the regular basis find explanation of many fundamental concepts from the theoretical perspective illustrated by concrete real-world applications.

Managing Data Science

Discover how to successfully manage data science projects and build high-performing teams with 'Managing Data Science.' This book provides actionable insights on handling the entire data science workflow, from conception to production, and addresses common challenges with practical strategies. What this Book will help me do Understand the fundamentals of building scalable and efficient data science pipelines. Acquire techniques to manage every stage of data science projects effectively, from prototype to production. Learn proven strategies for assembling, cultivating, and sustaining a skilled data science team. Explore the latest tools, methodologies, and best practices in ModelOps and DevOps for data science. Gain insights into troubleshooting and optimizing data science workflows to achieve organizational goals. Author(s) None Dubovikov is a seasoned expert in data science and project management, bringing years of hands-on experience to both domains. With a passion for leveraging data to drive business success, None guides readers through building sustainable practices and effective teams in the growing field of data science. Who is it for? This book is perfect for data science professionals, project managers, and business leaders seeking practical guidance to reap the benefits of data-driven decision-making. Designed for readers with a foundational understanding of data science, it helps bridge the gap between technical expertise and managerial efficiency.

Business Analytics, Volume II

This business analytics (BA) text discusses the models based on fact-based data to measure past business performance to guide an organization in visualizing and predicting future business performance and outcomes. It provides a comprehensive overview of analytics in general with an emphasis on predictive analytics. Given the booming interest in analytics and data science, this book is timely and informative. It brings many terms, tools, and methods of analytics together. The first three chapters provide an introduction to BA, importance of analytics, types of BA-descriptive, predictive, and prescriptive-along with the tools and models. Business intelligence (BI) and a case on descriptive analytics are discussed. Additionally, the book discusses on the most widely used predictive models, including regression analysis, forecasting, data mining, and an introduction to recent applications of predictive analytics-machine learning, neural networks, and artificial intelligence. The concluding chapter discusses on the current state, job outlook, and certifications in analytics.

Data Mining for Business Analytics

Data Mining for Business Analytics: Concepts, Techniques, and Applications in Python presents an applied approach to data mining concepts and methods, using Python software for illustration Readers will learn how to implement a variety of popular data mining algorithms in Python (a free and open-source software) to tackle business problems and opportunities. This is the sixth version of this successful text, and the first using Python. It covers both statistical and machine learning algorithms for prediction, classification, visualization, dimension reduction, recommender systems, clustering, text mining and network analysis. It also includes: A new co-author, Peter Gedeck, who brings both experience teaching business analytics courses using Python, and expertise in the application of machine learning methods to the drug-discovery process A new section on ethical issues in data mining Updates and new material based on feedback from instructors teaching MBA, undergraduate, diploma and executive courses, and from their students More than a dozen case studies demonstrating applications for the data mining techniques described End-of-chapter exercises that help readers gauge and expand their comprehension and competency of the material presented A companion website with more than two dozen data sets, and instructor materials including exercise solutions, PowerPoint slides, and case solutions Data Mining for Business Analytics: Concepts, Techniques, and Applications in Python is an ideal textbook for graduate and upper-undergraduate level courses in data mining, predictive analytics, and business analytics. This new edition is also an excellent reference for analysts, researchers, and practitioners working with quantitative methods in the fields of business, finance, marketing, computer science, and information technology. “This book has by far the most comprehensive review of business analytics methods that I have ever seen, covering everything from classical approaches such as linear and logistic regression, through to modern methods like neural networks, bagging and boosting, and even much more business specific procedures such as social network analysis and text mining. If not the bible, it is at the least a definitive manual on the subject.” —Gareth M. James, University of Southern California and co-author (with Witten, Hastie and Tibshirani) of the best-selling book An Introduction to Statistical Learning, with Applications in R

Clustering Methodology for Symbolic Data

Covers everything readers need to know about clustering methodology for symbolic data—including new methods and headings—while providing a focus on multi-valued list data, interval data and histogram data This book presents all of the latest developments in the field of clustering methodology for symbolic data—paying special attention to the classification methodology for multi-valued list, interval-valued and histogram-valued data methodology, along with numerous worked examples. The book also offers an expansive discussion of data management techniques showing how to manage the large complex dataset into more manageable datasets ready for analyses. Filled with examples, tables, figures, and case studies, Clustering Methodology for Symbolic Data begins by offering chapters on data management, distance measures, general clustering techniques, partitioning, divisive clustering, and agglomerative and pyramid clustering. Provides new classification methodologies for histogram valued data reaching across many fields in data science Demonstrates how to manage a large complex dataset into manageable datasets ready for analysis Features very large contemporary datasets such as multi-valued list data, interval-valued data, and histogram-valued data Considers classification models by dynamical clustering Features a supporting website hosting relevant data sets Clustering Methodology for Symbolic Data will appeal to practitioners of symbolic data analysis, such as statisticians and economists within the public sectors. It will also be of interest to postgraduate students of, and researchers within, web mining, text mining and bioengineering.

Spatial Analysis Using Big Data

Spatial Analysis Using Big Data: Methods and Urban Applications helps readers understand the most powerful, state-of-the-art spatial econometric methods, focusing particularly on urban research problems. The methods represent a cluster of potentially transformational socio-economic modeling tools that allow researchers to capture real-time and high-resolution information to potentially reveal new socioeconomic dynamics within urban populations. Each method, written by leading exponents of the discipline, uses real-time urban big data to solve research problems in spatial science. Urban applications of these methods are provided in unsurpassed depth, with chapters on surface temperature mapping, view value analysis, community clustering and spatial-social networks, among many others. Reviews some of the most powerful and challenging modern methods to study big data problems in spatial science Provides computer codes written in R, MATLAB and Python to help implement methods Applies these methods to common problems observed in urban and regional economics

Pro D3.js: Use D3.js to Create Maintainable, Modular, and Testable Charts

Go beyond the basics of D3.js to create maintainable, modular, and testable charts and to package them into a library that can be distributed as open source software or kept for private use. This book will show you how to transform regular D3.js chart code into reusable and extendable modules.You know the basics of working with D3.js, but it's time to become a professional D3.js practitioner. This book is your launching pad to refactoring code, composing complex visualizations from small components, working as a team with other developers, and integrating charts with a Continuous Integration system. You'll begin by creating a production-ready chart using D3.js v5, ES2015, and a test-driven approach and then move on to using and extending Britecharts, the reusable charting library based on Reusable API patterns. Finally, you'll see how to use D3.js along with React to document and build your charts to compose a charting library you can release into the NPM repository. With Pro D3.js, you'll become an accomplished D3.js developer in no time. What You Will Learn Create v5 D3.js charts with ES2016 and unit tests Develop modular, testable and extensible code with the Reusable API pattern Work with and extend Britecharts, a reusable charting library created at Eventbrite Use Webpack and npm to create and publish a charting library from your own chart collections Write reference documentation and build a documentation homepage for your library. Who This Book Is For Data scientists, data visualization engineers, and frontend developers with a fundamental knowledge of D3.js and some experience with JavaScript, as well as data journalists and consultants.

Applications of Computational Intelligence in Data-Driven Trading

“Life on earth is filled with many mysteries, but perhaps the most challenging of these is the nature of Intelligence.” – Prof. Terrence J. Sejnowski, Computational Neurobiologist The main objective of this book is to create awareness about both the promises and the formidable challenges that the era of Data-Driven Decision-Making and Machine Learning are confronted with, and especially about how these new developments may influence the future of the financial industry. The subject of Financial Machine Learning has attracted a lot of interest recently, specifically because it represents one of the most challenging problem spaces for the applicability of Machine Learning. The author has used a novel approach to introduce the reader to this topic: The first half of the book is a readable and coherent introduction to two modern topics that are not generally considered together: the data-driven paradigm and Computational Intelligence. The second half of the book illustrates a set of Case Studies that are contemporarily relevant to quantitative trading practitioners who are dealing with problems such as trade execution optimization, price dynamics forecast, portfolio management, market making, derivatives valuation, risk, and compliance. The main purpose of this book is pedagogical in nature, and it is specifically aimed at defining an adequate level of engineering and scientific clarity when it comes to the usage of the term “ Artificial Intelligence,” especially as it relates to the financial industry. The message conveyed by this book is one of confidence in the possibilities offered by this new era of Data-Intensive Computation. This message is not grounded on the current hype surrounding the latest technologies, but on a deep analysis of their effectiveness and also on the author’s two decades of professional experience as a technologist, quant and academic.

Practical MATLAB: With Modeling, Simulation, and Processing Projects

Apply MATLAB programming to the mathematical modeling of real-life problems from a wide range of topics. This pragmatic book shows you how to solve your programming problems, starting with a brief primer on MATLAB and the fundamentals of the MATLAB programming language. Then, you’ll build fully working examples and computational models found in the financial, engineering, and scientific sectors. As part of this section, you’ll cover signal and image processing, as well as GUIs. After reading and using Practical MATLAB and its accompanying source code, you’ll have the practical know-how and code to apply to your own MATLAB programming projects. What You Will Learn Discover the fundamentals of MATLAB and how to get started with it for problem solving Apply MATLAB to a variety of problems and case studies Carry out economic and financial modeling with MATLAB, including option pricing and compound interest Use MATLAB for simulation problems such as coin flips, dice rolling, random walks, and traffic flows Solve computational biology problems with MATLAB Implement signal processing with MATLAB, including currents, Fast Fourier Transforms (FFTs), and harmonic analysis Process images with filters and edge detection Build applications with GUIs Who This Book Is For People with some prior experience with programming and MATLAB.

Mastering pandas - Second Edition

Mastering pandas is the ultimate guide to harnessing the power of the pandas library for data analysis. Covering everything from installation to advanced techniques, this book provides comprehensive instructions and examples to help you perform efficient data manipulation and visualization. Explore key features of pandas, such as multi-indexing and time series analysis, and become proficient in actionable analytics. What this Book will help me do Master importing and managing datasets of various formats using pandas. Expertly handle missing data and clean datasets for robust analysis. Create powerful visualizations and reports using pandas and Jupyter notebooks. Leverage advanced indexing and grouping techniques to derive insights. Utilize pandas for time series analysis to analyze trends and patterns. Author(s) None Kumar is an experienced data scientist specializing in data analysis and visualization using Python. With a deep understanding of the pandas library, None has been helping professionals and enthusiasts alike to make data-driven decisions. Known for an example-driven teaching style, None bridges complex theoretical concepts with practical applications in data science. Who is it for? If you're a data scientist, analyst, or Python developer seeking to enhance your data analysis capabilities, this book is for you. Prior knowledge of Python is beneficial but not mandatory, as foundational concepts are explained. This guide spans beginner to advanced topics, accommodating users looking to deepen their skills and those aiming to start with pandas.

What Is Augmented Analytics?

As your business tries to make sense of today’s staggering amount of structured and unstructured data, traditional analytics will take you only so far. The key to success over the next few years will depend on augmented analytics, a method that embeds machine learning and natural language processing (NLP) in the process. This report explains how augmented analytics can help you uncover hidden insights, predict results, and even prescribe solutions. Author Alice LaPlante provides best practices for deploying augmented analytics, along with real-world case studies that show you how to take full advantage of this method. IT professionals, business managers, and CFOs will learn ways to democratize data use among business users and executives, using a self-service model. The future belongs to those who can get more from their data. This report shows you how. Get a primer on the key components and learn how they work together Delve into the benefits of—and roadblocks to—adopting augmented analytics Learn how companies use this method in marketing, sales, finance, and human resources Examine case studies of companies including Accenture and Riverbed

Storytelling with Data

Influence action through data! This is not a book. It is a one-of-a-kind immersive learning experience through which you can become—or teach others to be—a powerful data storyteller. Let’s practice! helps you build confidence and credibility to create graphs and visualizations that make sense and weave them into action-inspiring stories. Expanding upon best seller storytelling with data’s foundational lessons, Let’s practice! delivers fresh content, a plethora of new examples, and over 100 hands-on exercises. Author and data storytelling maven Cole Nussbaumer Knaflic guides you along the path to hone core skills and become a well-practiced data communicator. Each chapter includes: ● Practice with Cole: exercises based on real-world examples first posed for you to consider and solve, followed by detailed step-by-step illustration and explanation ● Practice on your own: thought-provoking questions and even more exercises to be assigned or worked through individually, without prescribed solutions ● Practice at work: practical guidance and hands-on exercises for applying storytelling with data lessons on the job, including instruction on when and how to solicit useful feedback and refine for greater impact The lessons and exercises found within this comprehensive guide will empower you to master—or develop in others—data storytelling skills and transition your work from acceptable to exceptional. By investing in these skills for ourselves and our teams, we can all tell inspiring and influential data stories!

Business Statistics with Solutions in R

Business Statistics with Solutions in R covers a wide range of applications of statistics in solving business related problems. It will introduce readers to quantitative tools that are necessary for daily business needs and help them to make evidence-based decisions. The book provides an insight on how to summarize data, analyze it, and draw meaningful inferences that can be used to improve decisions. It will enable readers to develop computational skills and problem-solving competence using the open source language, R. Mustapha Abiodun Akinkunmi uses real life business data for illustrative examples while discussing the basic statistical measures, probability, regression analysis, significance testing, correlation, the Poisson distribution, process control for manufacturing, time series analysis, forecasting techniques, exponential smoothing, univariate and multivariate analysis including ANOVA and MANOVA and more in this valuable reference for policy makers, professionals, academics and individuals interested in the areas of business statistics, applied statistics, statistical computing, finance, management and econometrics.

Prescriptive Analytics: The Final Frontier for Evidence-Based Management and Optimal Decision Making

Make Better Decisions, Leverage New Opportunities, and Automate Decisioning at Scale Prescriptive analytics is more directly linked to successful decision-making than any other form of business analytics. It can help you systematically sort through your choices to optimize decisions, respond to new opportunities and risks with precision, and continually reflect new information into your decisioning process. In Prescriptive Analytics, analytics expert Dr. Dursun Delen illuminates the field’s state-of-the-art methods, offering holistic insight for both professionals and students. Delen’s end-to-end, all-inclusive approach covers optimization, simulation, multi-criteria decision-making methods, inference- and heuristic-based decisioning, and more. Balancing theory and practice, he presents intuitive conceptual illustrations, realistic example problems, and real-world case studies–all designed to deliver knowledge you can use. Discover where prescriptive analytics fits and how it improves decision-making Identify optimal solutions for achieving an objective within real-world constraints Analyze complex systems via Monte-Carlo, discrete, and continuous simulations Apply powerful multi-criteria decision-making and mature expert systems and case-based reasoning Preview emerging techniques based on deep learning and cognitive computing

Simulating Business Processes for Descriptive, Predictive, and Prescriptive Analytics

This book outlines the benefits and limitations of simulation, what is involved in setting up a simulation capability in an organization, the steps involved in developing a simulation model and how to ensure that model results are implemented. In addition, detailed example applications are provided to show where the tool is useful and what it can offer the decision maker. In Simulating Business Processes for Descriptive, Predictive, and Prescriptive Analytics, Andrew Greasley provides an in-depth discussion of Business process simulation and how it can enable business analytics How business process simulation can provide speed, cost, dependability, quality, and flexibility metrics Industrial case studies including improving service delivery while ensuring an efficient use of staff in public sector organizations such as the police service, testing the capacity of planned production facilities in manufacturing, and ensuring on-time delivery in logistics systems State-of-the-art developments in business process simulation regarding the generation of simulation analytics using process mining and modeling people’s behavior Managers and decision makers will learn how simulation provides a faster, cheaper and less risky way of observing the future performance of a real-world system. The book will also benefit personnel already involved in simulation development by providing a business perspective on managing the process of simulation, ensuring simulation results are implemented, and that performance is improved.

Practical Data Analysis with JMP, Third Edition, 3rd Edition

Master the concepts and techniques of statistical analysis using JMP Practical Data Analysis with JMP, Third Edition, highlights the powerful interactive and visual approach of JMP to introduce readers to statistical thinking and data analysis. It helps you choose the best technique for the problem at hand by using real-world cases. It also illustrates best-practice workflow throughout the entire investigative cycle, from asking valuable questions through data acquisition, preparation, analysis, interpretation, and communication of findings. The book can stand on its own as a learning resource for professionals, or it can be used to supplement a college-level textbook for an introductory statistics course. It includes varied examples and problems using real sets of data. Each chapter typically starts with an important or interesting research question that an investigator has pursued. Reflecting the broad applicability of statistical reasoning, the problems come from a wide variety of disciplines, including engineering, life sciences, business, and economics, as well as international and historical examples. Application Scenarios at the end of each chapter challenge you to use your knowledge and skills with data sets that go beyond mere repetition of chapter examples. New in the third edition, chapters have been updated to demonstrate the enhanced capabilities of JMP, including projects, Graph Builder, Query Builder, and Formula Depot.

SAS Certified Professional Prep Guide

The official guide by the SAS Global Certification Program, SAS Certified Professional Prep Guide: Advanced Programming Using SAS 9.4 prepares you to take the new SAS 9.4 Advanced Programming Performance-Based Exam. New in this edition is a workbook whose sample scenarios require you to write code to solve problems and answer questions. Answers to the chapter quizzes and solutions to the sample scenarios in the workbook are included. You will also find links to exam objectives, practice exams, and other resources such as the Base SAS Glossary and a list of practice data sets. Major topics include SQL processing, SAS macro language processing, and advanced SAS programming techniques. All exam topics are covered in the following chapters: SQL Processing with SAS PROC SQL Fundamentals Creating and Managing Tables Joining Tables Using PROC SQL Joining Tables Using Set Operators Using Subqueries Advanced SQL Techniques SAS Macro Language Processing Creating and Using Macro Variables Storing and Processing Text Working with Macro Programs Advanced Macro Techniques Advanced SAS Programming Techniques Defining and Processing Arrays Processing Data Using Hash Objects Using SAS Utility Procedures Using Advanced Functions Practice Programming Scenarios (Workbook)

Practical Time Series Analysis

Time series data analysis is increasingly important due to the massive production of such data through the internet of things, the digitalization of healthcare, and the rise of smart cities. As continuous monitoring and data collection become more common, the need for competent time series analysis with both statistical and machine learning techniques will increase. Covering innovations in time series data analysis and use cases from the real world, this practical guide will help you solve the most common data engineering and analysis challengesin time series, using both traditional statistical and modern machine learning techniques. Author Aileen Nielsen offers an accessible, well-rounded introduction to time series in both R and Python that will have data scientists, software engineers, and researchers up and running quickly. You’ll get the guidance you need to confidently: Find and wrangle time series data Undertake exploratory time series data analysis Store temporal data Simulate time series data Generate and select features for a time series Measure error Forecast and classify time series with machine or deep learning Evaluate accuracy and performance

R Bioinformatics Cookbook

In the "R Bioinformatics Cookbook", you will explore the full potential of the R programming language and the Bioconductor ecosystem to overcome challenges in bioinformatics. By working through real-world examples, you will learn to handle biological data effectively and gain insights into genomics, RNA sequencing, and advanced data visualization. What this Book will help me do Develop skills to analyze RNA sequencing data using R and Bioconductor packages such as edgeR and DESeq. Learn to create professional-grade graphical representations of biological data using ggplot and other visualization tools. Understand how to perform genome-wide studies like variant calling and metagenomics analysis with R. Master the integration of external genomic databases with Ensembl for functional annotation. Explore machine learning applications in bioinformatics including classification and clustering models. Author(s) None MacLean and Dr. Dan Maclean are experienced bioinformatics researchers and R programmers. With a deep understanding of computational biology and visualization techniques, they bring years of academic and practical expertise to help readers excel in bioinformatics. Their approachable writing style ensures that complex topics are made accessible. Who is it for? This book is ideal for bioinformatics professionals and data analysts with an interest in applying R to biological data. It is particularly suited for those with a basic knowledge of R and bioinformatics looking to enhance their analysis skills. Researchers seeking to integrate genomics and computational methods into their workflows will find this book valuable. It's perfect for anyone aiming to tackle intermediate to advanced topics in biological data analysis.

The Little SAS Book, 6th Edition

A classic that just keeps getting better, The Little SAS Book is essential for anyone learning SAS programming. Lora Delwiche and Susan Slaughter offer a user-friendly approach so that readers can quickly and easily learn the most commonly used features of the SAS language. Each topic is presented in a self-contained, two-page layout complete with examples and graphics. Nearly every section has been revised to ensure that the sixth edition is fully up-to-date. This edition is also interface-independent, written for all SAS programmers whether they use SAS Studio, SAS Enterprise Guide, or the SAS windowing environment. New sections have been added covering PROC SQL, iterative DO loops, DO WHILE and DO UNTIL statements, %DO statements, using variable names with special characters, the ODS EXCEL destination, and the XLSX LIBNAME engine. This title belongs on every SAS programmer's bookshelf. It's a resource not just to get you started, but one you will return to as you continue to improve your programming skills. Learn more about the updates to The Little SAS Book, Sixth Edition here. Reviews for The Little SAS Book, Sixth Edition can be read here.

Applied Statistics

Instructs readers on how to use methods of statistics and experimental design with R software Applied statistics covers both the theory and the application of modern statistical and mathematical modelling techniques to applied problems in industry, public services, commerce, and research. It proceeds from a strong theoretical background, but it is practically oriented to develop one's ability to tackle new and non-standard problems confidently. Taking a practical approach to applied statistics, this user-friendly guide teaches readers how to use methods of statistics and experimental design without going deep into the theory. Applied Statistics: Theory and Problem Solutions with R includes chapters that cover R package sampling procedures, analysis of variance, point estimation, and more. It follows on the heels of Rasch and Schott's Mathematical Statistics via that book's theoretical background—taking the lessons learned from there to another level with this book’s addition of instructions on how to employ the methods using R. But there are two important chapters not mentioned in the theoretical back ground as Generalised Linear Models and Spatial Statistics. Offers a practical over theoretical approach to the subject of applied statistics Provides a pre-experimental as well as post-experimental approach to applied statistics Features classroom tested material Applicable to a wide range of people working in experimental design and all empirical sciences Includes 300 different procedures with R and examples with R-programs for the analysis and for determining minimal experimental sizes Applied Statistics: Theory and Problem Solutions with R will appeal to experimenters, statisticians, mathematicians, and all scientists using statistical procedures in the natural sciences, medicine, and psychology amongst others.

Introduction to Biostatistics with JMP

Explore biostatistics using JMP in this refreshing introduction Presented in an easy-to-understand way, Introduction to Biostatistics with JMP introduces undergraduate students in the biological sciences to the most commonly used (and misused) statistical methods that they will need to analyze their experimental data using JMP. It covers many of the basic topics in statistics using biological examples for exercises so that the student biologists can see the relevance to future work in the problems addressed. The book starts by teaching students how to become confident in executing the right analysis by thinking like a statistician then moves into the application of specific tests. Using the powerful capabilities of JMP, the book addresses problems requiring analysis by chi-square tests, t tests, ANOVA analysis, various regression models, DOE, and survival analysis. Topics of particular interest to the biological or health science field include odds ratios, relative risk,

A Gentle Introduction to Statistics Using SASⓇ Studio

Point and click your way to performing statistics! Many people are intimidated by learning statistics, but A Gentle Introduction to Statistics Using SAS Studio is here to help. Whether you need to perform statistical analysis for a project or, perhaps, for a course in education, psychology, sociology, economics, or any other field that requires basic statistical skills, this book teaches the fundamentals of statistics, from designing your experiment through calculating logistic regressions. Serving as an introduction to many common statistical tests and principles, it explains concepts in a non-technical way with little math and very few formulas. Once the basic statistical concepts are covered, the book then demonstrates how to use them with SAS Studio and SAS University Edition’s easy point-and-click interface. Topics included in this book are: How to install and use SAS University Edition Descriptive statistics One-sample tests T tests (for independent or paired samples) One-way analysis of variance (ANOVA) N-way ANOVA Correlation analysis Simple and multiple linear regression Binary logistic regression Categorical data, including two-way tables and chi-square Power and sample size calculations Questions are provided to test your knowledge and practice your skills.

Hands-On SAS for Data Analysis

"Hands-On SAS for Data Analysis" is a practical guide that introduces you to the fundamentals of using SAS for managing and analyzing data effectively. Through a hands-on approach, you'll explore key topics such as data manipulation with SAS 4GL, SQL querying, and creating insightful visualizations and reports. By the end of the book, you'll not only have a robust understanding of SAS but also be prepared for the SAS certification exam. What this Book will help me do Effectively use SAS modules and tools for comprehensive data analysis tasks. Master SAS 4GL functions to perform advanced data manipulation and transformation. Leverage advanced SQL options within SAS to query and analyze datasets. Become proficient in writing SAS Macros to automate repetitive tasks efficiently. Produce professional reports and visualizations using SAS Output Delivery System. Author(s) None Gulati is a renowned expert in data analysis and business intelligence, with years of professional experience in leveraging SAS for enterprise solutions. An experienced trainer and technical author, None has a unique ability to simplify complex concepts. Through this book, None shares practical knowledge that aligns with industry needs and certification goals. Who is it for? This book is designed for data professionals seeking to enhance their skills in SAS programming and data analysis. Whether you're just starting out with SAS or aiming to pass the SAS certification exam, this book will provide valuable insights. Readers with basic knowledge of data management will find this guide especially beneficial.

SAS for R Users

BRIDGES THE GAP BETWEEN SAS AND R, ALLOWING USERS TRAINED IN ONE LANGUAGE TO EASILY LEARN THE OTHER SAS and R are widely-used, very different software environments. Prized for its statistical and graphical tools, R is an open-source programming language that is popular with statisticians and data miners who develop statistical software and analyze data. SAS (Statistical Analysis System) is the leading corporate software in analytics thanks to its faster data handling and smaller learning curve. SAS for R Users enables entry-level data scientists to take advantage of the best aspects of both tools by providing a cross-functional framework for users who already know R but may need to work with SAS. Those with knowledge of both R and SAS are of far greater value to employers, particularly in corporate settings. Using a clear, step-by-step approach, this book presents an analytics workflow that mirrors that of the everyday data scientist. This up-to-date guide is compatible with the latest R packages as well as SAS University Edition. Useful for anyone seeking employment in data science, this book: Instructs both practitioners and students fluent in one language seeking to learn the other Provides command-by-command translations of R to SAS and SAS to R Offers examples and applications in both R and SAS Presents step-by-step guidance on workflows, color illustrations, sample code, chapter quizzes, and more Includes sections on advanced methods and applications Designed for professionals, researchers, and students, SAS for R Users is a valuable resource for those with some knowledge of coding and basic statistics who wish to enter the realm of data science and business analytics. AJAY OHRI is the founder of analytics startup Decisionstats.com. His research interests include spreading open source analytics, analyzing social media manipulation with mechanism design, simpler interfaces to cloud computing, investigating climate change, and knowledge flows. He currently advises startups in analytics off shoring, analytics services, and analytics. He is the author of Python for R Users: A Data Science Approach (Wiley), R for Business Analytics, and R for Cloud Computing.