talk-data.com talk-data.com

Topic

data-science-tasks

849

tagged

Activity Trend

1 peak/qtr
2020-Q1 2026-Q1

Activities

849 activities · Newest first

Time Series Analysis with Spark

Time Series Analysis with Spark provides a practical introduction to leveraging Apache Spark and Databricks for time series analysis. You'll learn to prepare, model, and deploy robust and scalable time series solutions for real-world applications. From data preparation to advanced generative AI techniques, this guide prepares you to excel in big data analytics. What this Book will help me do Understand the core concepts and architectures of Apache Spark for time series analysis. Learn to clean, organize, and prepare time series data for big data environments. Gain expertise in choosing, building, and training various time series models tailored to specific projects. Master techniques to scale your models in production using Spark and Databricks. Explore the integration of advanced technologies such as generative AI to enhance predictions and derive insights. Author(s) Yoni Ramaswami, a Senior Solutions Architect at Databricks, has extensive experience in data engineering and AI solutions. With a focus on creating innovative big data and AI strategies across industries, Yoni authored this book to empower professionals to efficiently handle time series data. Yoni's approachable style ensures that both foundational concepts and advanced techniques are accessible to readers. Who is it for? This book is ideal for data engineers, machine learning engineers, data scientists, and analysts interested in enhancing their expertise in time series analysis using Apache Spark and Databricks. Whether you're new to time series or looking to refine your skills, you'll find both foundational insights and advanced practices explained clearly. A basic understanding of Spark is helpful but not required.

Time Series Forecasting Using Generative AI : Leveraging AI for Precision Forecasting

"Time Series Forecasting Using Generative AI introduces readers to Generative Artificial Intelligence (Gen AI) in time series analysis, offering an essential exploration of cutting-edge forecasting methodologies." The book covers a wide range of topics, starting with an overview of Generative AI, where readers gain insights into the history and fundamentals of Gen AI with a brief introduction to large language models. The subsequent chapter explains practical applications, guiding readers through the implementation of diverse neural network architectures for time series analysis such as Multi-Layer Perceptrons (MLP), WaveNet, Temporal Convolutional Network (TCN), Bidirectional Temporal Convolutional Network (BiTCN), Recurrent Neural Networks (RNN), Long Short-Term Memory (LSTM), Deep AutoRegressive(DeepAR), and Neural Basis Expansion Analysis(NBEATS) using modern tools. Building on this foundation, the book introduces the power of Transformer architecture, exploring its variants such as Vanilla Transformers, Inverted Transformer (iTransformer), DLinear, NLinear, and Patch Time Series Transformer (PatchTST). Finally, The book delves into foundation models such as Time-LLM, Chronos, TimeGPT, Moirai, and TimesFM enabling readers to implement sophisticated forecasting models tailored to their specific needs. This book empowers readers with the knowledge and skills needed to leverage Gen AI for accurate and efficient time series forecasting. By providing a detailed exploration of advanced forecasting models and methodologies, this book enables practitioners to make informed decisions and drive business growth through data-driven insights. ● Understand the core history and applications of Gen AI and its potential to revolutionize time series forecasting. ● Learn to implement different neural network architectures such as MLP, WaveNet, TCN, BiTCN, RNN, LSTM, DeepAR, and NBEATS for time series forecasting. ● Discover the potential of Transformer architecture and its variants, such as Vanilla Transformers, iTransformer, DLinear, NLinear, and PatchTST, for time series forecasting. ● Explore complex foundation models like Time-LLM, Chronos, TimeGPT, Moirai, and TimesFM. ● Gain practical knowledge on how to apply Gen AI techniques to real-world time series forecasting challenges and make data-driven decisions. Who this book is for: Data Scientists, Machine learning engineers, Business Aanalysts, Statisticians, Economists, Financial Analysts, Operations Research Analysts, Data Analysts, Students.

Statistical Quantitative Methods in Finance: From Theory to Quantitative Portfolio Management

Statistical quantitative methods are vital for financial valuation models and benchmarking machine learning models in finance. This book explores the theoretical foundations of statistical models, from ordinary least squares (OLS) to the generalized method of moments (GMM) used in econometrics. It enriches your understanding through practical examples drawn from applied finance, demonstrating the real-world applications of these concepts. Additionally, the book delves into non-linear methods and Bayesian approaches, which are becoming increasingly popular among practitioners thanks to advancements in computational resources. By mastering these topics, you will be equipped to build foundational models crucial for applied data science, a skill highly sought after by software engineering and asset management firms. The book also offers valuable insights into quantitative portfolio management, showcasing how traditional data science tools can be enhanced with machine learning models. These enhancements are illustrated through real-world examples from finance and econometrics, accompanied by Python code. This practical approach ensures that you can apply what you learn, gaining proficiency in the statsmodels library and becoming adept at designing, implementing, and calibrating your models. By understanding and applying these statistical models, you enhance your data science skills and effectively tackle financial challenges. What You Will Learn Understand the fundamentals of linear regression and its applications in financial data analysis and prediction Apply generalized linear models for handling various types of data distributions and enhancing model flexibility Gain insights into regime switching models to capture different market conditions and improve financial forecasting Benchmark machine learning models against traditional statistical methods to ensure robustness and reliability in financial applications Who This Book Is For Data scientists, machine learning engineers, finance professionals, and software engineers

Learning AI Tools in Tableau

As businesses increasingly rely on data to drive decisions, the role of advanced analytics and AI in enhancing data interpretation is becoming crucial. For professionals tasked with optimizing data analytics platforms like Tableau, staying ahead of the curve with the latest tools isn't just beneficial—it's essential. This insightful guide takes you through the integration of Tableau Pulse and Einstein Copilot, explaining their roles within the broader Tableau and Salesforce ecosystems. Author Ann Jackson, an esteemed analytics professional with a deep expertise in Tableau, offers a step-by-step exploration of these tools, backed by real-world use cases that demonstrate their impact across various industries. By the end of this book, you will: Understand the functionalities of Tableau Pulse and Einstein Copilot and how to use them Learn to deploy Tableau Pulse effectively, ensuring it aligns with your business objectives Navigate discussions on AI's role within Tableau, enhancing your strategic conversations Visualize how Tableau Pulse operates through detailed images and scenarios Utilize Einstein Copilot in Tableau Desktop/Prep to streamline and enhance data analysis

Probabilistic Forecasts and Optimal Decisions

Account for uncertainties and optimize decision-making with this thorough exposition Decision theory is a body of thought and research seeking to apply a mathematical-logical framework to assessing probability and optimizing decision-making. It has developed robust tools for addressing all major challenges to decision making. Yet the number of variables and uncertainties affecting each decision outcome, many of them beyond the decider’s control, mean that decision-making is far from a ‘solved problem’. The tools created by decision theory remain to be refined and applied to decisions in which uncertainties are prominent. Probabilistic Forecasts and Optimal Decisions introduces a theoretically-grounded methodology for optimizing decision-making under conditions of uncertainty. Beginning with an overview of the basic elements of probability theory and methods for modeling continuous variates, it proceeds to survey the mathematics of both continuous and discrete models, supporting each with key examples. The result is a crucial window into the complex but enormously rewarding world of decision theory. Readers of Probablistic Forecasts and Optimal Decisions will also find: Extended case studies supported with real-world data Mini-projects running through multiple chapters to illustrate different stages of the decision-making process End of chapter exercises designed to facilitate student learning Probabilistic Forecasts and Optimal Decisions is ideal for advanced undergraduate and graduate students in the sciences and engineering, as well as predictive analytics and decision analytics professionals.

Probability For Dummies, 2nd Edition

Learn how to calculate your chances with easy-to-understand explanations of probability Probability—the likelihood or chance of an event occurring—is an important branch of mathematics used in business and economics, finance, engineering, physics, and beyond. We see probability at work every day in areas such as weather forecasting, investing, and sports betting. Packed with real-life examples and mathematical problems with thorough explanations, Probability For Dummies helps students, professionals, and the everyday reader learn the basics. Topics include set theory, counting, permutations and combinations, random variables, conditional probability, joint distributions, conditional expectations, and probability modeling. Pass your probability class and play your cards right, with this accessible Dummies guide. Understand how probability impacts daily life Discover what counting rules are and how to use them Practice probability concepts with sample problems and explanations Get clear explanations of all the topics in your probability or statistics class Probability For Dummies is the perfect Dummies guide for college students, amateur and professional gamblers, investors, insurance professionals, and anyone preparing for the actuarial exam.

Skew-Normal Model Theories and Their Applications

This book focuses on several skew-normal mixed effects models, and systematically explores the statistical inference theories, methods, and applications of parameters of interest. This book is of academic value, since it helps to establish a series of statistical inference theories and methods for skew-normal mixed effects models.

Fuzzy Methods for Assessment and Decision Making

Fuzzy Methods for Assessment and Decision Making presents the assessment of learning and problem-solving skills with qualitative grades. These methods are outcomes of the author’s research work on the subject for more than 20 years. In particular, a hybrid assessment model uses the Center of Gravity (COG) defuzzification technique, closed real intervals (grey numbers), neutrosophic sets, and soft sets as tools. The book starts with the basic mathematical background that is needed for an understanding of its contents. The Rectangular Fuzzy Assessment Model (RFAM) of Subbotin and Voskoglou is presented next, the outcomes of which are compared to those of the GPA index. The book presents innovative fuzzy assessment methods, enabling readers to assess the mean and quality performance of learning or problem-solving skills of a group of students when qualitative (linguistic) grades are used for this purpose. In the case of using linguistic grades for the assessment of a group’s skills, the classical method of calculating the mean value of the (numerical) grades cannot be applied. Also, no safe conclusions can be obtained on comparing the quality performance of two groups when the values of their GPA index are equal. Presents innovative, fuzzy assessment methods to enable readers to assess the mean and quality performance of learning Discusses fuzzy logic and techniques for decision-making in all domains Includes applications of fuzzy decision-making as a hybrid model using soft sets, grey numbers, and neutrosophic sets

Data Analysis and Related Applications 4

This book is a collective work by a number of leading scientists, analysts, engineers, mathematicians and statisticians who have been working at the forefront of data analysis and related applications, arising from data science, operations research, engineering, machine learning or statistics. The chapters of this collaborative work represent a cross-section of current research interests in the above scientific areas. The collected material has been divided into appropriate sections to provide the reader with both theoretical and applied information on data analysis methods, models and techniques, along with appropriate applications. Data Analysis and Related Applications 4 investigates a number of different topics in the areas mentioned above, touching on statistical analysis, stochastic processes, estimation methods, algorithms, distributions and networks, among others.

Data Storytelling with Altair and AI

Great data presentations tell a story. Learn how to organize, visualize, and present data using Python, generative AI, and the cutting-edge Altair data visualization toolkit. Take the fast track to amazing data presentations! Data Storytelling with Altair and AI introduces a stack of useful tools and tried-and-tested methodologies that will rapidly increase your productivity, streamline the visualization process, and leave your audience inspired. In Data Storytelling with Altair and AI you’ll discover: Using Python Altair for data visualization Using Generative AI tools for data storytelling The main concepts of data storytelling Building data stories with the DIKW pyramid approach Transforming raw data into a data story Data Storytelling with Altair and AI teaches you how to turn raw data into effective, insightful data stories. You’ll learn exactly what goes into an effective data story, then combine your Python data skills with the Altair library and AI tools to rapidly create amazing visualizations. Your bosses and decision-makers will love your new presentations—and you’ll love how quick Generative AI makes the whole process! About the Technology Every dataset tells a story. After you’ve cleaned, crunched, and organized the raw data, it’s your job to share its story in a way that connects with your audience. Python’s Altair data visualization library, combined with generative AI tools like Copilot and ChatGPT, provide an amazing toolbox for transforming numbers, code, text, and graphics into intuitive data presentations. About the Book Data Storytelling with Altair and AI teaches you how to build enhanced data visualizations using these tools. The book uses hands-on examples to build powerful narratives that can inform, inspire, and motivate. It covers the Altair data visualization library, along with AI techniques like generating text with ChatGPT, creating images with DALL-E, and Python coding with Copilot. You’ll learn by practicing with each interesting data story, from tourist arrivals in Portugal to population growth in the USA to fake news, salmon aquaculture, and more. What's Inside The Data-Information-Knowledge-Wisdom (DIKW) pyramid Publish data stories using Streamlit, Tableau, and Comet Vega and Vega-Lite visualization grammar About the Reader For data analysts and data scientists experienced with Python. No previous knowledge of Altair or Generative AI required. About the Author Angelica Lo Duca is a researcher at the Institute of Informatics and Telematics of the National Research Council, Italy. The technical editor on this book was Ninoslav Cerkez. Quotes This book’s step-by-step approach, illustrated through real-world examples, makes complex data accessible and actionable. - Alexey Grigorev, DataTalks.Club A clear and concise guide to data storytelling. Highly recommended. - Andrew Madson, Insights x Design Data storytelling in a way that anyone can do! This book feels ahead of its time. - Avery Smith, Data Career Jumpstart Excellent hands-on exercises that combine two of my favorite tools: AI and the Altair library. - Jose Berengueres, Author of DataViz and Storytelling

Classification Methods for Remotely Sensed Data, 3rd Edition

The new edition of the bestselling Classification Methods for Remotely Sensed Data covers current state-of-the-art machine learning algorithms and developments in the analysis of remotely sensed data, and presents new AI-based analysis tools and metrics together with ongoing debates on accuracy assessment strategies and XAI methods.

Statistics for Data Science and Analytics

Introductory statistics textbook with a focus on data science topics such as prediction, correlation, and data exploration Statistics for Data Science and Analytics is a comprehensive guide to statistical analysis using Python, presenting important topics useful for data science such as prediction, correlation, and data exploration. The authors provide an introduction to statistical science and big data, as well as an overview of Python data structures and operations. A range of statistical techniques are presented with their implementation in Python, including hypothesis testing, probability, exploratory data analysis, categorical variables, surveys and sampling, A/B testing, and correlation. The text introduces binary classification, a foundational element of machine learning, validation of statistical models by applying them to holdout data, and probability and inference via the easy-to-understand method of resampling and the bootstrap instead of using a myriad of “kitchen sink” formulas. Regression is taught both as a tool for explanation and for prediction. This book is informed by the authors’ experience designing and teaching both introductory statistics and machine learning at Statistics.com. Each chapter includes practical examples, explanations of the underlying concepts, and Python code snippets to help readers apply the techniques themselves. Statistics for Data Science and Analytics includes information on sample topics such as: Int, float, and string data types, numerical operations, manipulating strings, converting data types, and advanced data structures like lists, dictionaries, and sets Experiment design via randomizing, blinding, and before-after pairing, as well as proportions and percents when handling binary data Specialized Python packages like numpy, scipy, pandas, scikit-learn and statsmodels—the workhorses of data science—and how to get the most value from them Statistical versus practical significance, random number generators, functions for code reuse, and binomial and normal probability distributions Written by and for data science instructors, Statistics for Data Science and Analytics is an excellent learning resource for data science instructors prescribing a required intro stats course for their programs, as well as other students and professionals seeking to transition to the data science field.

Graph Based Multimedia Analysis

Graph Based Multimedia Analysis applies concepts from graph theory to the problems of analyzing overabundant video data. Video data can be quite diverse: exocentric (captured by a standard camera) or egocentric (captured by a wearable device like Google Glass); of various durations (ranging from a few seconds to several hours); and could be from a single source or multiple sources. Efficient extraction of important information from such a large class of diverse video data can be overwhelming. The book, with its rich repertoire of theoretically elegant solutions, from graph theory in conjunction with deep learning, constrained optimization, and game theory, empowers the audience to achieve tasks like obtaining concise yet useful summaries and precisely recognizing single as well as multiple actions in a computationally efficient manner. The book provides a unique treatise on topics like egocentric video analysis and scalable video processing. Addresses a number of challenging state-of-the-art problems in multimedia analysis like summarization, co-summarization, and action recognition Handles a wide class of video with different genres, durations, and numbers Applies a class of theoretically rich algorithms from the discipline of graph theory, in conjunction with deep learning, constrained optimization and game theory Includes thorough complexity analyses of the proposed solutions, and an appendix containing implementable source codes

Bayesian Statistics and Marketing, 2nd Edition

Fine-tune your marketing research with this cutting-edge statistical toolkit Bayesian Statistics and Marketing illustrates the potential for applying a Bayesian approach to some of the most challenging and important problems in marketing. Analyzing household and consumer data, predicting product performance, and custom-targeting campaigns are only a few of the areas in which Bayesian approaches promise revolutionary results. This book provides a comprehensive, accessible overview of this subject essential for any statistically informed marketing researcher or practitioner. Economists and other social scientists will find a comprehensive treatment of many Bayesian methods that are central to the problems in social science more generally. This includes a practical approach to computationally challenging problems in random coefficient models, non-parametrics, and the problems of endogeneity. Readers of the second edition of Bayesian Statistics and Marketing will also find: Discussion of Bayesian methods in text analysis and Machine Learning Updates throughout reflecting the latest research and applications Discussion of modern statistical software, including an introduction to the R package bayesm, which implements all models incorporated here Extensive case studies throughout to link theory and practice Bayesian Statistics and Marketing is ideal for advanced students and researchers in marketing, business, and economics departments, as well as for any statistically savvy marketing practitioner.

Artificial Intelligence in Forecasting

Can you forecast the future value by considering historical data? Accurate forecasting requires more than just plugging in historical data into models. Readers will find the latest techniques used by managers in business today, discover the importance of forecasting and learn how it is accomplished.

Biostatistics For Dummies, 2nd Edition

Break down biostatistics, make sense of complex concepts, and pass your class If you're taking biostatistics, you may need or want a little extra assistance as you make your way through. Biostatistics For Dummies follows a typical biostatistics course at the college level, helping you understand even the most difficult concepts, so you can get the grade you need. Start at the beginning by learning how to read and understand mathematical equations and conduct clinical research. Then, use your knowledge to analyze and graph your data. This new edition includes more example problems with step-by-step walkthroughs on how to use statistical software to analyze large datasets. Biostatistics For Dummies is your go-to guide for making sense of it all. Review basic statistics and decode mathematical equations Learn how to analyze and graph data from clinical research studies Look for relationships with correlation and regression Use software to properly analyze large datasets Anyone studying in clinical science, public health, pharmaceutical sciences, chemistry, and epidemiology-related fields will want this book to get through that biostatistics course.

D3.js in Action, Third Edition

Create stunning web-based data visualizations with D3.js. This totally-revised new edition of D3.js in Action guides you from simple charts to powerful interactive graphics. Chapter-by-chapter you’ll assemble an impressive portfolio of visualizations—including intricate networks, maps, and even a complete customized visualization layout. Plus, you'll learn best practices for building interactive graphics, animations, and integrating your work into frontend development frameworks like React and Svelte. In D3.js in Action, Third Edition you will learn how to: Set up a local development environment for D3 Include D3 in web development projects, including Node-based web apps Select and append DOM elements Size and position elements on screen Assemble components and layouts into creative data visualizations D3.js in Action, Third Edition has been extensively revised for D3.js version 7, and modern best practices for web visualizations. Its brand new chapters dive into interactive visualizations, cover responsiveness for dataviz, and show you how you can improve accessibility. About the Technology With D3.js, you can create sophisticated infographics, charts, and interactive data visualizations using standard frontend tools like JavaScript, HTML, and CSS. Granting D3 its VIS Test of Time award, the IEEE credited this powerful library for bringing data visualization to the mainstream. You’ll be blown away by how beautiful your results can be! About the Book D3.js in Action, Third Edition is a roadmap for creating brilliant and beautiful visualizations with D3.js. Like a gentle mentor, it guides you from basic charts all the way to advanced interactive visualizations like networks and maps. You’ll learn to build graphics, create animations, and set up mobile-friendly responsiveness. Each chapter contains a complete data visualization project to put your new skills into action. What's Inside Fully revised for D3.js v7 Includes 12 complete projects Create data visualizations with SVG and canvas Combine D3 with React, Svelte, and Angular About the Reader For web developers with HTML, CSS, and JavaScript skills. About the Authors Elijah Meeks was a data visualization pioneer at Stanford and the first Senior Data Visualization Engineer at Netflix. Anne-Marie Dufour is a Data Visualization Engineer. The technical editor on this book was Jon Borgman. Quotes Guides readers through the intricate world of D3 with clarity and practical insight. Whether you’re a seasoned expert or just starting, this book will be invaluable. - Connor Rothschild, Data Visualization Engineer, Moksha Data Studio Amazing job of explaining the core concepts of D3 while providing all you need to learn other fundamental concepts. - Lindsey Poulter, Visualization Engineer, New York Mets A navigation tool to explore all possible paths in the world of D3. Clear schematics and nicely selected examples guide the readers through D3’s possibilities. - Matthias Stahl, Head Data & Visualizations, Der SPIEGEL