talk-data.com talk-data.com

Topic

exploratory-data-analysis

56

tagged

Activity Trend

1 peak/qtr
2020-Q1 2026-Q1

Activities

Showing filtered results

Filtering by: O'Reilly Data Science Books ×
Bibliometric Analyses in Data-Driven Decision-Making

The book provides essential insights and practical tools needed to effectively navigate the evolving landscape of scholarly research, helping enhance the understanding of publication trends, citation impacts, and collaboration networks across multiple fields. Bibliometric Analyses in Data-Driven Decision-Making offers a comprehensive guide to researchers, academics, and practitioners interested in utilizing bibliometric analysis to understand and navigate the dynamic landscape of the increasingly vital field of data-driven decision-making and its applications across many areas. It provides insights into growth, impact, and trends within the field, using bibliometric tools and methodologies. This volume adopts a pragmatic approach, balancing theoretical concepts with practical applications of data-driven decision-making models through the perspectives of bibliometric analyses using real-world examples, case studies, and step-by-step guides. The reader will find the book: Gives practical guidance on conducting bibliometric analyses across a range of applications for data-driven decision-making; Illustrates the application of bibliometric tools in the field with real-world case studies; Provides in-depth coverage of various bibliometric indicators and metrics; Explores emerging trends and challenges in bibliometric analysis; Provides a comprehensive overview of software and tools available for bibliometric research. Audience Librarians and Information professionals involved in research management, knowledge discovery, and the evaluation of scholarly communication, as well as professionals in industries reliant on cutting-edge research and development, technology assessment, and innovation. Also, a range of researchers and scholars seeking how to apply bibliometric analysis to assess the impact of their work, and advanced insights into bibliometric metrics, collaboration networks, and research trends.

Classification Methods for Remotely Sensed Data, 3rd Edition

The new edition of the bestselling Classification Methods for Remotely Sensed Data covers current state-of-the-art machine learning algorithms and developments in the analysis of remotely sensed data, and presents new AI-based analysis tools and metrics together with ongoing debates on accuracy assessment strategies and XAI methods.

Fundamentals of Data Observability

Quickly detect, troubleshoot, and prevent a wide range of data issues through data observability, a set of best practices that enables data teams to gain greater visibility of data and its usage. If you're a data engineer, data architect, or machine learning engineer who depends on the quality of your data, this book shows you how to focus on the practical aspects of introducing data observability in your everyday work. Author Andy Petrella helps you build the right habits to identify and solve data issues, such as data drifts and poor quality, so you can stop their propagation in data applications, pipelines, and analytics. You'll learn ways to introduce data observability, including setting up a framework for generating and collecting all the information you need. Learn the core principles and benefits of data observability Use data observability to detect, troubleshoot, and prevent data issues Follow the book's recipes to implement observability in your data projects Use data observability to create a trustworthy communication framework with data consumers Learn how to educate your peers about the benefits of data observability

Data Mining and Predictive Analytics for Business Decisions

With many recent advances in data science, we have many more tools and techniques available for data analysts to extract information from data sets. This book will assist data analysts to move up from simple tools such as Excel for descriptive analytics to answer more sophisticated questions using machine learning. Most of the exercises use R and Python, but rather than focus on coding algorithms, the book employs interactive interfaces to these tools to perform the analysis. Using the CRISP-DM data mining standard, the early chapters cover conducting the preparatory steps in data mining: translating business information needs into framed analytical questions and data preparation. The Jamovi and the JASP interfaces are used with R and the Orange3 data mining interface with Python. Where appropriate, Voyant and other open-source programs are used for text analytics. The techniques covered in this book range from basic descriptive statistics, such as summarization and tabulation, to more sophisticated predictive techniques, such as linear and logistic regression, clustering, classification, and text analytics. Includes companion files with case study files, solution spreadsheets, data sets and charts, etc. from the book. Features: Covers basic descriptive statistics, such as summarization and tabulation, to more sophisticated predictive techniques, such as linear and logistic regression, clustering, classification, and text analytics Uses R, Python, Jamovi and JASP interfaces, and the Orange3 data mining interface Includes companion files with the case study files from the book, solution spreadsheets, data sets, etc.

Data Analysis and Related Applications, Volume 1

The scientific field of data analysis is constantly expanding due to the rapid growth of the computer industry and the wide applicability of computational and algorithmic techniques, in conjunction with new advances in statistical, stochastic and analytic tools. There is a constant need for new, high-quality publications to cover the recent advances in all fields of science and engineering. This book is a collective work by a number of leading scientists, computer experts, analysts, engineers, mathematicians, probabilists and statisticians who have been working at the forefront of data analysis and related applications. The chapters of this collaborative work represent a cross-section of current concerns, developments and research interests in the above scientific areas. The collected material has been divided into appropriate sections to provide the reader with both theoretical and applied information on data analysis methods, models and techniques, along with related applications.

Data Analysis and Related Applications, Volume 2

The scientific field of data analysis is constantly expanding due to the rapid growth of the computer industry and the wide applicability of computational and algorithmic techniques, in conjunction with new advances in statistical, stochastic and analytic tools. There is a constant need for new, high-quality publications to cover the recent advances in all fields of science and engineering. This book is a collective work by a number of leading scientists, computer experts, analysts, engineers, mathematicians, probabilists and statisticians who have been working at the forefront of data analysis and related applications. The chapters of this collaborative work represent a cross-section of current concerns, developments and research interests in the above scientific areas. The collected material has been divided into appropriate sections to provide the reader with both theoretical and applied information on data analysis methods, models and techniques, along with related applications.

Face Analysis Under Uncontrolled Conditions

Face analysis is essential for a large number of applications such as human-computer interaction or multimedia (e.g. content indexing and retrieval). Although many approaches are under investigation, performance under uncontrolled conditions is still not satisfactory. The variations that impact facial appearance (e.g. pose, expression, illumination, occlusion, motion blur) make it a difficult problem to solve. This book describes the progress towards this goal, from a core building block – landmark detection – to the higher level of micro and macro expression recognition. Specifically, the book addresses the modeling of temporal information to coincide with the dynamic nature of the face. It also includes a benchmark of recent solutions along with details about the acquisition of a dataset for such tasks.

Research Data Sharing and Valorization

As platforms for sharing, re-using and storing data, research data repositories are integral to open science policy. This book provides a comprehensive approach to these data repositories, their functionalities, uses, issues and prospects. Taking France as an example, the current landscape of data repositories is considered, including discussion of the idea of a national repository and a comparative study of several national systems. The international re3data directory is outlined and a collection of six case studies of model repositories, both public and private, are detailed (CDS, Data INRAE, SEANOE, Nakala, Figshare and Data Mendeley). Research Data Sharing and Valorization also includes appendices containing a number of websites and reference texts from the French Ministry of Higher Education, Research and Innovation, and the CNRS. To the authors’ knowledge, it is the first book to be entirely devoted to these new platforms and is aimed at researchers, teachers, students and professionals working with scientific and technical data and information.

Innovative Data Integration and Conceptual Space Modeling for COVID, Cancer, and Cardiac Care

In recent years, scientific research and translation medicine have placed increased emphasis on computational methodology and data curation across many disciplines, both to advance underlying science and to instantiate precision-medicine protocols in the lab and in clinical practice. The nexus of concerns related to oncology, cardiology, and virology (SARS-CoV-2) presents a fortuitous context within which to examine the theory and practice of biomedical data curation. Innovative Data Integration and Conceptual Space Modeling for COVID, Cancer, and Cardiac Care argues that a well-rounded approach to data modeling should optimally embrace multiple perspectives inasmuch as data-modeling is neither a purely formal nor a purely conceptual discipline, but rather a hybrid of both. On the one hand, data models are designed for use by computer software components, and are, consequently, constrained by the mechanistic demands of software environments; data modeling strategies must accept the formal rigors imposed by unambiguous data-sharing and query-evaluation logic. In particular, data models are not well-suited for software-level deployment if such models do not translate seamlessly to clear strategies for querying data and ensuring data integrity as information is moved across multiple points. On the other hand, data modeling is, likewise, constrained by human conceptual tendencies, because the information which is managed by databases and data networks is ultimately intended to be visualized/utilized by humans as the end-user. Thus, at the intersection of both formal and humanistic methodology, data modeling takes on elements of both logico-mathematical frameworks (e.g., type systems and graph theory) and conceptual/philosophical paradigms (e.g., linguistics and cognitive science). The authors embrace this two-sided aspect of data models by seeking non-reductionistic points of convergence between formal and humanistic/conceptual viewpoints, and by leveraging biomedical contexts (viz., COVID, Cancer, and Cardiac Care) so as to provide motivating examples and case-studies in this volume. Provides an analysis of how conceptual spaces and related cognitive linguistic approaches can inspire programming and query-processing models Outlines the vital role that data modeling/curation has played in significant medical breakthroughs Presents readers with an overview of how information-management approaches intersect with precision medicine, providing case studies of data-modeling in concrete scientific practice Explores applications of image analysis and computer vision in the context of precision medicine Examines the role of technology in scientific publishing, replication studies, and dataset curation

Intelligent Data Analytics for Terror Threat Prediction

Intelligent data analytics for terror threat prediction is an emerging field of research at the intersection of information science and computer science, bringing with it a new era of tremendous opportunities and challenges due to plenty of easily available criminal data for further analysis. This book provides innovative insights that will help obtain interventions to undertake emerging dynamic scenarios of criminal activities. Furthermore, it presents emerging issues, challenges and management strategies in public safety and crime control development across various domains. The book will play a vital role in improvising human life to a great extent. Researchers and practitioners working in the fields of data mining, machine learning and artificial intelligence will greatly benefit from this book, which will be a good addition to the state-of-the-art approaches collected for intelligent data analytics. It will also be very beneficial for those who are new to the field and need to quickly become acquainted with the best performing methods. With this book they will be able to compare different approaches and carry forward their research in the most important areas of this field, which has a direct impact on the betterment of human life by maintaining the security of our society. No other book is currently on the market which provides such a good collection of state-of-the-art methods for intelligent data analytics-based models for terror threat prediction, as intelligent data analytics is a newly emerging field and research in data mining and machine learning is still in the early stage of development.

Intelligent Data Analysis
  This book focuses on methods and tools for intelligent data analysis, aimed at narrowing the increasing gap between data gathering and data comprehension, and emphasis will also be given to solving of problems which result from automated data collection, such as analysis of computer-based patient records, data warehousing tools, intelligent alarming, effective and efficient monitoring, and so on. This book aims to describe the different approaches of Intelligent Data Analysis from a practical point of view: solving common life problems with data analysis tools.
Data Analysis and Applications 3, 3rd Edition

Data analysis as an area of importance has grown exponentially, especially during the past couple of decades. This can be attributed to a rapidly growing computer industry and the wide applicability of computational techniques, in conjunction with new advances of analytic tools. This being the case, the need for literature that addresses this is self-evident. New publications are appearing, covering the need for information from all fields of science and engineering, thanks to the universal relevance of data analysis and statistics packages. This book is a collective work by a number of leading scientists, analysts, engineers, mathematicians and statisticians who have been working at the forefront of data analysis. The chapters included in this volume represent a cross-section of current concerns and research interests in these scientific areas. The material is divided into two parts: Computational Data Analysis, and Classification Data Analysis, with methods for both - providing the reader with both theoretical and applied information on data analysis methods, models and techniques and appropriate applications.

End-to-end Data Analytics for Product Development

An interactive guide to the statistical tools used to solve problems during product and process innovation End to End Data Analytics for Product Development is an accessible guide designed for practitioners in the industrial field. It offers an introduction to data analytics and the design of experiments (DoE) whilst covering the basic statistical concepts useful to an understanding of DoE. The text supports product innovation and development across a range of consumer goods and pharmaceutical organizations in order to improve the quality and speed of implementation through data analytics, statistical design and data prediction. The book reviews information on feasibility screening, formulation and packaging development, sensory tests, and more. The authors – noted experts in the field – explore relevant techniques for data analytics and present the guidelines for data interpretation. In addition, the book contains information on process development and product validation that can be optimized through data understanding, analysis and validation. The authors present an accessible, hands-on approach that uses MINITAB and JMP software. The book: • Presents a guide to innovation feasibility and formulation and process development • Contains the statistical tools used to solve challenges faced during product innovation and feasibility • Offers information on stability studies which are common especially in chemical or pharmaceutical fields • Includes a companion website which contains videos summarizing main concepts Written for undergraduate students and practitioners in industry, End to End Data Analytics for Product Development offers resources for the planning, conducting, analyzing and interpreting of controlled tests in order to develop effective products and processes.

Data Mining for Business Analytics

Data Mining for Business Analytics: Concepts, Techniques, and Applications in Python presents an applied approach to data mining concepts and methods, using Python software for illustration Readers will learn how to implement a variety of popular data mining algorithms in Python (a free and open-source software) to tackle business problems and opportunities. This is the sixth version of this successful text, and the first using Python. It covers both statistical and machine learning algorithms for prediction, classification, visualization, dimension reduction, recommender systems, clustering, text mining and network analysis. It also includes: A new co-author, Peter Gedeck, who brings both experience teaching business analytics courses using Python, and expertise in the application of machine learning methods to the drug-discovery process A new section on ethical issues in data mining Updates and new material based on feedback from instructors teaching MBA, undergraduate, diploma and executive courses, and from their students More than a dozen case studies demonstrating applications for the data mining techniques described End-of-chapter exercises that help readers gauge and expand their comprehension and competency of the material presented A companion website with more than two dozen data sets, and instructor materials including exercise solutions, PowerPoint slides, and case solutions Data Mining for Business Analytics: Concepts, Techniques, and Applications in Python is an ideal textbook for graduate and upper-undergraduate level courses in data mining, predictive analytics, and business analytics. This new edition is also an excellent reference for analysts, researchers, and practitioners working with quantitative methods in the fields of business, finance, marketing, computer science, and information technology. “This book has by far the most comprehensive review of business analytics methods that I have ever seen, covering everything from classical approaches such as linear and logistic regression, through to modern methods like neural networks, bagging and boosting, and even much more business specific procedures such as social network analysis and text mining. If not the bible, it is at the least a definitive manual on the subject.” —Gareth M. James, University of Southern California and co-author (with Witten, Hastie and Tibshirani) of the best-selling book An Introduction to Statistical Learning, with Applications in R

Clustering Methodology for Symbolic Data

Covers everything readers need to know about clustering methodology for symbolic data—including new methods and headings—while providing a focus on multi-valued list data, interval data and histogram data This book presents all of the latest developments in the field of clustering methodology for symbolic data—paying special attention to the classification methodology for multi-valued list, interval-valued and histogram-valued data methodology, along with numerous worked examples. The book also offers an expansive discussion of data management techniques showing how to manage the large complex dataset into more manageable datasets ready for analyses. Filled with examples, tables, figures, and case studies, Clustering Methodology for Symbolic Data begins by offering chapters on data management, distance measures, general clustering techniques, partitioning, divisive clustering, and agglomerative and pyramid clustering. Provides new classification methodologies for histogram valued data reaching across many fields in data science Demonstrates how to manage a large complex dataset into manageable datasets ready for analysis Features very large contemporary datasets such as multi-valued list data, interval-valued data, and histogram-valued data Considers classification models by dynamical clustering Features a supporting website hosting relevant data sets Clustering Methodology for Symbolic Data will appeal to practitioners of symbolic data analysis, such as statisticians and economists within the public sectors. It will also be of interest to postgraduate students of, and researchers within, web mining, text mining and bioengineering.

Spatial Analysis Using Big Data

Spatial Analysis Using Big Data: Methods and Urban Applications helps readers understand the most powerful, state-of-the-art spatial econometric methods, focusing particularly on urban research problems. The methods represent a cluster of potentially transformational socio-economic modeling tools that allow researchers to capture real-time and high-resolution information to potentially reveal new socioeconomic dynamics within urban populations. Each method, written by leading exponents of the discipline, uses real-time urban big data to solve research problems in spatial science. Urban applications of these methods are provided in unsurpassed depth, with chapters on surface temperature mapping, view value analysis, community clustering and spatial-social networks, among many others. Reviews some of the most powerful and challenging modern methods to study big data problems in spatial science Provides computer codes written in R, MATLAB and Python to help implement methods Applies these methods to common problems observed in urban and regional economics

Data Analysis and Applications 1

This series of books collects a diverse array of work that provides the reader with theoretical and applied information on data analysis methods, models, and techniques, along with appropriate applications. Volume 1 begins with an introductory chapter by Gilbert Saporta, a leading expert in the field, who summarizes the developments in data analysis over the last 50 years. The book is then divided into three parts: Part 1 presents clustering and regression cases; Part 2 examines grouping and decomposition, GARCH and threshold models, structural equations, and SME modeling; and Part 3 presents symbolic data analysis, time series and multiple choice models, modeling in demography, and data mining.

Intelligent Data Analysis for Biomedical Applications

Intelligent Data Analysis for Biomedical Applications: Challenges and Solutions presents specialized statistical, pattern recognition, machine learning, data abstraction and visualization tools for the analysis of data and discovery of mechanisms that create data. It provides computational methods and tools for intelligent data analysis, with an emphasis on problem-solving relating to automated data collection, such as computer-based patient records, data warehousing tools, intelligent alarming, effective and efficient monitoring, and more. This book provides useful references for educational institutions, industry professionals, researchers, scientists, engineers and practitioners interested in intelligent data analysis, knowledge discovery, and decision support in databases. Provides the methods and tools necessary for intelligent data analysis and gives solutions to problems resulting from automated data collection Contains an analysis of medical databases to provide diagnostic expert systems Addresses the integration of intelligent data analysis techniques within biomedical information systems

Data Analyst

With this book, aspiring data analysts will discover what data analysts do all day, what skills they will need for the role, and what regulations they will be required to adhere to. Practising data analysts can explore useful data analysis tools, methods and techniques, brush up on best practices and look at how they can advance their career.