talk-data.com talk-data.com

Topic

data-science-tools

326

tagged

Activity Trend

1 peak/qtr
2020-Q1 2026-Q1

Activities

Showing filtered results

Filtering by: O'Reilly Data Science Books ×
Hands-On Data Analysis with NumPy and pandas

Dive into 'Hands-On Data Analysis with NumPy and pandas' to explore the world of Python for data analysis. This book guides you through using these powerful Python libraries to handle and manipulate data efficiently. You will learn hands-on techniques to read, sort, group, and visualize data for impactful analysis. What this Book will help me do Learn to set up a Python environment for data analysis with tools like Jupyter notebooks. Master data handling using NumPy, focusing on array creation, slicing, and operations. Understand the functionalities of pandas for managing datasets, including DataFrame operations. Discover techniques for data preparation, such as handling missing data and hierarchical indexing. Explore data visualization using pandas and create impactful plots for data insights. Author(s) The book is authored by None Miller, a seasoned Python developer and data analyst. With a strong background in leveraging Python for data processing, None focuses on creating content that is practical and accessible. The author's teaching approach emphasizes hands-on practice and understanding, making technical topics approachable and engaging. Who is it for? This book is ideal for Python developers at a beginner to intermediate level looking to venture into data analysis. If you are transitioning from general programming to data-focused work or need to enhance your skills in data manipulation and processing, this book will be a strong foundation. It requires no prior experience with data analysis, so it is accessible to many learners.

Mastering Numerical Computing with NumPy

"Mastering Numerical Computing with NumPy" is a comprehensive guide to becoming proficient in numerical computing using Python's NumPy library. This book will teach you how to perform advanced numerical operations, explore data statistically, and build predictive models effectively. By mastering the provided concepts and exercises, you'll be empowered in your scientific computing projects. What this Book will help me do Perform and optimize vector and matrix operations effectively using NumPy. Analyze data using exploratory data analysis techniques and predictive modeling. Implement unsupervised learning algorithms such as clustering with relevant datasets. Understand advanced benchmarks and select optimal configurations for performance. Write efficient and scalable programs utilizing advanced NumPy features. Author(s) The authors of "Mastering Numerical Computing with NumPy" include domain experts and educators with years of experience in Python programming, numerical computing, and data science. They bring a practical and detailed approach to teaching advanced topics and guide you through every step of mastering NumPy. Who is it for? This book is ideal for Python programmers, data analysts, and data science enthusiasts who aim to deepen their understanding of numerical computing. If you have basic mathematics skills and want to utilize NumPy to solve complex data problems, this book is an excellent resource. Whether you're a beginner or an intermediate user, you will find this content approachable and enriching. Advanced users will benefit from the highly specialized content and real-world examples.

Domain-Specific Languages in R: Advanced Statistical Programming

Gain an accelerated introduction to domain-specific languages in R, including coverage of regular expressions. This compact, in-depth book shows you how DSLs are programming languages specialized for a particular purpose, as opposed to general purpose programming languages. Along the way, you’ll learn to specify tasks you want to do in a precise way and achieve programming goals within a domain-specific context. Domain-Specific Languages in R includes examples of DSLs including large data sets or matrix multiplication; pattern matching DSLs for application in computer vision; and DSLs for continuous time Markov chains and their applications in data science. After reading and using this book, you’ll understand how to write DSLs in R and have skills you can extrapolate to other programming languages. What You'll Learn Program with domain-specific languages using R Discover the components of DSLs Carry out large matrix expressions and multiplications Implement metaprogramming with DSLs Parse and manipulate expressions Who This Book Is For Those with prior programming experience. R knowledge is helpful but not required.

Python vs. R for Data Science

Python and R are two of the mainstream languages in data science. Fundamentally, Python is a language for programmers, whereas R is a language for statisticians. In a data science context, there is a significant degree of overlap when it comes to the capabilities of each language in the fields of regression analysis and machine learning. Your choice of language will depend highly on the environment in which you are operating. In a production environment, Python integrates with other languages much more seamlessly and is therefore the modus operandi in this context. However, R is much more common in research environments due to its more extensive selection of libraries for statistical analysis.

Foundations of Computational Finance with MATLAB

Graduate from Excel to MATLAB® to keep up with the evolution of finance data Foundations of Computational Finance with MATLAB® is an introductory text for both finance professionals looking to branch out from the spreadsheet, and for programmers who wish to learn more about finance. As financial data grows in volume and complexity, its very nature has changed to the extent that traditional financial calculators and spreadsheet programs are simply no longer enough. Today’s analysts need more powerful data solutions with more customization and visualization capabilities, and MATLAB provides all of this and more in an easy-to-learn skillset. This book walks you through the basics, and then shows you how to stretch your new skills to create customized solutions. Part I demonstrates MATLAB’s capabilities as they apply to traditional finance concepts, and PART II shows you how to create interactive and reusable code, link with external data sources, communicate graphically, and more. Master MATLAB’s basic operations including matrices, arrays, and flexible data structures Learn how to build your own customized solutions when the built-ins just won’t do Learn how to handle financial data and industry-specific variables including risk and uncertainty Adopt more accurate modeling practices for portfolios, options, time series, and more MATLAB is an integrated development environment that includes everything you need in one well-designed user interface. Available Toolboxes provide tested algorithms that save you hours of code, and the skills you learn using MATLAB make it easier to learn additional languages if you choose to do so. Financial firms are catching up to universities in MATLAB usage, so this is skill set that will follow you throughout your career. When you’re ready to step into the new age of finance, Foundations of Computational Finance with MATLAB provides the expert instruction you need to get started quickly.

Data Analysis with R, Second Edition - Second Edition

"Data Analysis with R, Second Edition" is your ultimate guide to mastering data analysis in R, encompassing everything from foundational concepts to advanced techniques. You will learn to manipulate, analyze, and visualize data effectively, applying cutting-edge R packages like ggplot2 and dplyr. Through rich examples, this comprehensive book thoroughly prepares you to tackle real-world analytical challenges. What this Book will help me do Understand foundational statistical reasoning and sampling methods. Perform hypothesis testing and apply Bayesian methods to data analysis. Build and evaluate regression, classification, and time series models. Handle messy and missing data using advanced R techniques and methods. Optimize performance through parallel processing, Rcpp, and efficient data manipulation. Author(s) Chris Burnett is an experienced data analyst with over 15 years of expertise harnessing R for insights. A passionate advocate for accessible computing, Chris integrates practical exercises and rich examples to demystify complex analytical techniques. Their experience and dedication shine in this approachable yet detailed guide. Who is it for? This book is ideal for budding and professional data analysts or data scientists who wish to deepen their expertise in R. It suits learners with a basic understanding of R who aim to extend their proficiency in applied data analysis. The guide provides significant value for professionals aiming to implement effective analytical models. Readers seeking to grow within the analytics sector will find this resource indispensable.

HBR Guide to Data Analytics Basics for Managers (HBR Guide Series)

Don't let a fear of numbers hold you back. Today's business environment brings with it an onslaught of data. Now more than ever, managers must know how to tease insight from data--to understand where the numbers come from, make sense of them, and use them to inform tough decisions. How do you get started? Whether you're working with data experts or running your own tests, you'll find answers in the HBR Guide to Data Analytics Basics for Managers. This book describes three key steps in the data analysis process, so you can get the information you need, study the data, and communicate your findings to others. You'll learn how to: Identify the metrics you need to measure Run experiments and A/B tests Ask the right questions of your data experts Understand statistical terms and concepts Create effective charts and visualizations Avoid common mistakes

Business Case Analysis with R: Simulation Tutorials to Support Complex Business Decisions

This tutorial teaches you how to use the statistical programming language R to develop a business case simulation and analysis. It presents a methodology for conducting business case analysis that minimizes decision delay by focusing stakeholders on what matters most and suggests pathways for minimizing the risk in strategic and capital allocation decisions. Business case analysis, often conducted in spreadsheets, exposes decision makers to additional risks that arise just from the use of the spreadsheet environment. R has become one of the most widely used tools for reproducible quantitative analysis, and analysts fluent in this language are in high demand. The R language, traditionally used for statistical analysis, provides a more explicit, flexible, and extensible environment than spreadsheets for conducting business case analysis. The main tutorial follows the case in which a chemical manufacturing company considers constructing a chemical reactor and production facility to bring a new compound to market. There are numerous uncertainties and risks involved, including the possibility that a competitor brings a similar product online. The company must determine the value of making the decision to move forward and where they might prioritize their attention to make a more informed and robust decision. While the example used is a chemical company, the analysis structure it presents can be applied to just about any business decision, from IT projects to new product development to commercial real estate. The supporting tutorials include the perspective of the founder of a professional service firm who wants to grow his business and a member of a strategic planning group in a biomedical device company who wants to know how much to budget in order to refine the quality of information about critical uncertainties that might affect the value of a chosen product development pathway. What You’ll Learn Set upa business case abstraction in an influence diagram to communicate the essence of the problem to other stakeholders Model the inherent uncertainties in the problem with Monte Carlo simulation using the R language Communicate the results graphically Draw appropriate insights from the results Develop creative decision strategies for thorough opportunity cost analysis Calculate the value of information on critical uncertainties between competing decision strategies to set the budget for deeper data analysis Construct appropriate information to satisfy the parameters for the Monte Carlo simulation when little or no empirical data are available Who This Book Is For Financial analysts, data practitioners, and risk/business professionals; also appropriate for graduate level finance, business, or data science students

SQL Server 2017 Machine Learning Services with R

Learn how to leverage SQL Server 2017 Machine Learning Services and the R programming language to create robust, efficient data analysis and machine learning solutions. This book provides actionable insights and practical examples to help you implement and manage database-oriented analytics and predictive modeling. What this Book will help me do Understand and use SQL Server 2017 Machine Learning Services integrated with R. Gain experience in installing, configuring, and maintaining R services in SQL Server. Create and operationalize predictive models using RevoScaleR and other R packages. Improve database solutions by incorporating advanced analytics techniques. Monitor and manage R-based services effectively for reliable production solutions. Author(s) Tomaž Kaštrun and None Koesmarno bring a wealth of expertise as practitioners and educators in data science and SQL Server technologies. They share their experience innovatively, making intricate subjects approachable. Their unified teaching method ensures readers can directly benefit from practical examples and real-world applications. Who is it for? This book is tailored for database administrators, data analysts, and data scientists eager to integrate R with SQL Server. It caters to professionals with varying levels of R experience who are looking to enhance their proficiency in database-oriented analytics. Readers will benefit most if they are motivated to design effective, data-driven solutions in SQL Server environments.

R Projects For Dummies

Make the most of R’s extensive toolset R Projects For Dummies offers a unique learn-by-doing approach. You will increase the depth and breadth of your R skillset by completing a wide variety of projects. By using R’s graphics, interactive, and machine learning tools, you’ll learn to apply R’s extensive capabilities in an array of scenarios. The depth of the project experience is unmatched by any other content online or in print. And you just might increase your statistics knowledge along the way, too! R is a free tool, and it’s the basis of a huge amount of work in data science. It's taking the place of costly statistical software that sometimes takes a long time to learn. One reason is that you can use just a few R commands to create sophisticated analyses. Another is that easy-to-learn R graphics enable you make the results of those analyses available to a wide audience. This book will help you sharpen your skills by applying them in the context of projects with R, including dashboards, image processing, data reduction, mapping, and more. Appropriate for R users at all levels Helps R programmers plan and complete their own projects Focuses on R functions and packages Shows how to carry out complex analyses by just entering a few commands If you’re brand new to R or just want to brush up on your skills, R Projects For Dummies will help you complete your projects with ease.

Simulation of Dynamic Systems with MATLAB® and Simulink®, 3rd Edition

The book presents an integrated treatment of continuous simulation with all the background and essential prerequisites in one setting. It features updated chapters and two new sections on Black Swan and the Stochastic Information Packet (SIP) and Stochastic Library Units with Relationships Preserved (SLURP) Standard.

Interval Finite Element Method with MATLAB

Interval Finite Element Method with MATLAB provides a thorough introduction to an effective way of investigating problems involving uncertainty using computational modeling. The well-known and versatile Finite Element Method (FEM) is combined with the concept of interval uncertainties to develop the Interval Finite Element Method (IFEM). An interval or stochastic environment in parameters and variables is used in place of crisp ones to make the governing equations interval, thereby allowing modeling of the problem. The concept of interval uncertainties is systematically explained. Several examples are explored with IFEM using MATLAB on topics like spring mass, bar, truss and frame. Provides a systematic approach to understanding the interval uncertainties caused by vague or imprecise data Describes the interval finite element method in detail Gives step-by-step instructions for how to use MATLAB code for IFEM Provides a range of examples of IFEM in use, with accompanying MATLAB codes

Analyzing Baseball Data with R

With its flexible capabilities and open-source platform, R has become a major tool for analyzing detailed, high-quality baseball data. Analyzing Baseball Data with R provides an introduction to R for sabermetricians, baseball enthusiasts, and students interested in exploring the rich sources of baseball data. It equips readers with the necessary skills and software tools to perform all of the analysis steps, from gathering the datasets and entering them in a convenient format to visualizing the data via graphs to performing a statistical analysis. The authors first present an overview of publicly available baseball datasets and a gentle introduction to the type of data structures and exploratory and data management capabilities of R. They also cover the traditional graphics functions in the base package and introduce more sophisticated graphical displays available through the lattice and ggplot2 packages. Much of the book illustrates the use of R through popular sabermetrics topics, including the Pythagorean formula, runs expectancy, career trajectories, simulation of games and seasons, patterns of streaky behavior of players, and fielding measures. Each chapter contains exercises that encourage readers to perform their own analyses using R. All of the datasets and R code used in the text are available online. This book helps readers answer questions about baseball teams, players, and strategy using large, publically available datasets. It offers detailed instructions on downloading the datasets and putting them into formats that simplify data exploration and analysis. Through the book’s various examples, readers will learn about modern sabermetrics and be able to conduct their own baseball analyses.

R Programming By Example

"R Programming By Example" serves as an engaging and practical introduction to the R programming language for data analysis and visualization. Through step-by-step examples and comprehensive guides, this book builds your understanding from foundational knowledge to advanced applications in R. You will master programming practices while analyzing real-world scenarios. What this Book will help me do Gain proficiency in leveraging R's versatile features and package ecosystem to tackle data analysis tasks. Learn to create and customize high-quality visualizations, including 3D graphs, for enhanced data presentation. Understand statistical modeling and descriptive analysis techniques for extracting insights from data. Discover efficient programming strategies in R, including code profiling and parallelization, to optimize performance. Acquire the skills to interface R with databases and RESTful APIs for robust data integration. Author(s) The authors, None Trejo Navarro and Omar Trejo Navarro, bring a wealth of experience in statistical programming and data analysis. Having worked extensively with R, they focus on practical and results-driven teaching. They have a passion for making complex topics accessible to learners. Who is it for? This book is aimed at aspiring data scientists, statisticians, or analysts looking to learn R. It is particularly suitable for readers familiar with basic programming concepts and who wish to apply R in practical scenarios. Whether you're analyzing data, building models, or creating visualizations, this book will guide you effectively. If you're eager to advance your R skills through hands-on projects, this is for you.

SciPy Recipes

Dive into the world of scientific computing with 'SciPy Recipes', a practical guide tailored for anyone seeking hands-on experience with the SciPy stack. With over 110 detailed recipes, you'll gain expertise in handling real-world data challenges, from statistical computations to crafting intricate visualizations and beyond. What this Book will help me do Learn to use the SciPy Stack libraries like NumPy, pandas, and matplotlib effectively for scientific computing tasks. Master data wrangling techniques using pandas for efficient data manipulation. Understand the process of creating informative visualizations using matplotlib. Perform advanced statistical and numerical computations with simplicity. Solve real-world problems like numerical analysis and linear algebra using SciPy components. Author(s) None Martins, Ruben Oliva Ramos, and V Kishore Ayyadevara bring years of experience in scientific computing and Python programming to this book. Individually, they have contributed extensively to the implementation of computational tools and systems. Together, they've crafted this book to be both accessible to learners and insightful for practitioners, blending instruction with real-world practical applications. Who is it for? This book is designed for Python developers, data scientists, and analysts eager to venture into scientific computing. If you have a basic understanding of Python and aspire to effectively manipulate and visualize data using the SciPy stack, this book is perfect for you. It's equally beneficial for those who seek practical solutions to complex computational challenges. Begin your journey into scientific computing with this essential guide.

Adaptive Filtering

This book covers the fundamentals of adaptive filtering, with a focus on the least mean square (LMS) adaptive filter. It discusses random variables, stochastic processes, vectors, matrices, determinants, discrete random signals, and probability distributions, while delivering a concise introduction to MATLAB®—complete with problems, computer experiments, and over 110 functions and script files. The text not only addresses the basics of the LMS adaptive filter algorithm but also explores the Wiener filter and its applications, details the steepest descent method, and develops the Newton’s algorithm.

Electronically Scanned Arrays MATLAB® Modeling and Simulation

Electronically scanned arrays (ESAs) have become a key technology for sensor electronic systems. MATLAB® provides an excellent framework for ESA design and analysis, and this book is an invaluable resource for those who require simulation analysis tools that provide insight and understanding for ESA design. In addition to covering ESA fundamentals such as pattern synthesis, grating lobes, and instantaneous bandwidth, the text also provides insight into pattern optimization, subarray beamforming, space-based application of ESAs, and ESA reliability modeling. The book provides MATLAB code, giving readers an opportunity to model ESAs and develop an in-depth understanding that other books do not offer.

Pandas for Everyone: Python Data Analysis, First Edition

The Hands-On, Example-Rich Introduction to Pandas Data Analysis in Python Today, analysts must manage data characterized by extraordinary variety, velocity, and volume. Using the open source Pandas library, you can use Python to rapidly automate and perform virtually any data analysis task, no matter how large or complex. Pandas can help you ensure the veracity of your data, visualize it for effective decision-making, and reliably reproduce analyses across multiple datasets. brings together practical knowledge and insight for solving real problems with Pandas, even if you’re new to Python data analysis. Daniel Y. Chen introduces key concepts through simple but practical examples, incrementally building on them to solve more difficult, real-world problems. Pandas for Everyone Chen gives you a jumpstart on using Pandas with a realistic dataset and covers combining datasets, handling missing data, and structuring datasets for easier analysis and visualization. He demonstrates powerful data cleaning techniques, from basic string manipulation to applying functions simultaneously across dataframes. Once your data is ready, Chen guides you through fitting models for prediction, clustering, inference, and exploration. He provides tips on performance and scalability, and introduces you to the wider Python data analysis ecosystem. Work with DataFrames and Series, and import or export data Create plots with matplotlib, seaborn, and pandas Combine datasets and handle missing data Reshape, tidy, and clean datasets so they’re easier to work with Convert data types and manipulate text strings Apply functions to scale data manipulations Aggregate, transform, and filter large datasets with groupby Leverage Pandas’ advanced date and time capabilities Fit linear models using statsmodels and scikit-learn libraries Use generalized linear modeling to fit models with different response variables Compare multiple models to select the “best” Regularize to overcome overfitting and improve performance Use clustering in unsupervised machine learning Register your product at informit.com/register for convenient access to downloads, updates, and/or corrections as they become available.

R Data Mining

Dive into the world of data mining with 'R Data Mining' and discover how to utilize R's vast tools for uncovering insights in data. This hands-on guide immerses you in real-world cases, teaching both foundational concepts and advanced techniques like regression models and text mining. You'll emerge with a sharp understanding of how to transform raw data into actionable information. What this Book will help me do Gain proficiency in R packages such as dplyr and ggplot2 for data manipulation and visualization. Master the CRISP-DM methodology to systematically approach data mining projects. Develop skillsets in data cleaning and validation to ensure quality data analysis. Understand and implement multiple regression and classification techniques effectively. Learn to use ensemble learning methods and produce reporting with R Markdown. Author(s) Andrea Cirillo brings extensive expertise in data science and R programming as the author of 'R Data Mining.' Their practical approach, drawing from professional experiences in various industries, makes complex techniques accessible and engaging. Their passion for teaching translates into a meticulously crafted learning journey for aspiring data miners. Who is it for? This book is ideal for beginner to intermediate-level data analysts or aspiring data scientists eager to delve into the field of data mining using R. If you're familiar with the basics of programming in R and want to expand into practical applications of data mining methodologies, this is the resource for you. Gain hands-on experience by engaging with real-world datasets and scenarios.

Introduction to MATLAB for Engineers and Scientists: Solutions for Numerical Computation and Modeling

Familiarize yourself with MATLAB using this concise, practical tutorial that is focused on writing code to learn concepts. Starting from the basics, this book covers array-based computing, plotting and working with files, numerical computation formalism, and the primary concepts of approximations. Introduction to MATLAB is useful for industry engineers, researchers, and students who are looking for open-source solutions for numerical computation. In this book you will learn by doing, avoiding technical jargon, which makes the concepts easy to learn. First you’ll see how to run basic calculations, absorbing technical complexities incrementally as you progress toward advanced topics. Throughout, the language is kept simple to ensure that readers at all levels can grasp the concepts. What You'll Learn Apply sample code to your engineering or science problems Work with MATLAB arrays, functions, and loops Use MATLAB’s plotting functions for data visualization Solve numerical computing and computational engineering problems with a MATLAB case study Who This Book Is For Engineers, scientists, researchers, and students who are new to MATLAB. Some prior programming experience would be helpful but not required.