talk-data.com talk-data.com

Topic

data

5765

tagged

Activity Trend

3 peak/qtr
2020-Q1 2026-Q1

Activities

5765 activities · Newest first

R Data Analysis Projects

Step into the world of advanced data analysis with 'R Data Analysis Projects.' In this hands-on guide, you will learn to build efficient analytics systems and pipelines using R for practical applications in finance, social media, and more. By following real-world projects, you'll enhance your data analysis skills, from implementing recommender systems to performing time-series modeling. What this Book will help me do Develop end-to-end data analysis and visualization solutions using R. Create scalable predictive analytics systems with actionable insights. Leverage RShiny to build interactive dashboards for effective communication. Master popular R packages like dplyr, ggplot2, and recommenderlab. Tackle real-world data challenges in varied domains such as finance and social networks. Author(s) Gopi Subramanian is an experienced data scientist and educator with an extensive background in statistical modeling and analytics. With years of hands-on practice and teaching, Gopi specializes in making complex concepts accessible through practical examples. His passion for R programming and real-world applications shines in his approachable style, making learning empowering and engaging. Who is it for? This book is designed for readers with a foundational understanding of R and data analysis, aiming to advance their skills to a professional level. Ideal for data analysts, R programmers, and aspiring data scientists seeking practical experience in building analytics systems. Whether you're transitioning to or deepening your expertise in R, this guide offers actionable knowledge to enhance your projects.

Statistics for Data Science

Dive into the world of statistics specifically tailored for the needs of data science with 'Statistics for Data Science'. This book guides you from the fundamentals of statistical concepts to their practical application in data analysis, machine learning, and neural networks. Learn with clear explanations and practical R examples to fully grasp statistical methods for data-driven challenges. What this Book will help me do Understand foundational statistical concepts such as variance, standard deviation, and probability. Gain proficiency in using R for programmatically performing statistical computations for data science. Learn techniques for applying statistics in data cleaning, mining, and analysis tasks. Master methods for executing linear regression, regularization, and model assessment. Explore advanced techniques like boosting, SVMs, and neural network applications. Author(s) James D. Miller brings years of experience as a data scientist and educator. He has a deep understanding of how statistics foundationally supports data science and has worked across multiple industries applying these principles. Dedicated to teaching, James simplifies complex statistical concepts into approachable and actionable knowledge for developers aspiring to master data science applications. Who is it for? This book is intended for developers aiming to transition into the field of data science. If you have some basic programming knowledge and a desire to understand statistics essentials for data science applications, this book is designed for you. It's perfect for those who wish to apply statistical methods to practical tasks like data mining and analysis. A prior hands-on experience with R is helpful but not mandatory, as the book explains R methodologies comprehensively.

Practical Data Wrangling

"Practical Data Wrangling" provides a comprehensive guide to cleaning and preparing data for analysis, focusing on techniques in Python and R. As you progress through the book, you'll learn how to handle various datasets, reshape their formats, and prepare them for insights, empowering you to derive more value from your data. What this Book will help me do Understand the data wrangling process and its importance in the data analysis pipeline. Learn how to retrieve, parse, and shape raw data into structured formats. Master packages and tools in Python and R to efficiently clean and manipulate data. Gain proficiency in using regular expressions for text data preparation. Acquire skills to analyze, merge, and transform datasets to meet analytics needs. Author(s) None Visochek has years of experience working with data and analytics, with expertise in using Python and R for solving real-world data challenges. Their teaching approach emphasizes practical examples and accessible explanations, ensuring complex concepts are easy to understand. Who is it for? This book is for data scientists, analysts, or statisticians who work with real-world data and want to optimize their data preparation process. It is ideal for professionals with basic knowledge of Python and R looking to enhance their skills in data wrangling and data preparation techniques. If you're seeking to streamline your data analysis workflow through better wrangling techniques, this book is for you.

The State of Data Analytics and Visualization Adoption

Businesses regardless of industry or company size increasingly rely on data analytics and visualization to gain competitive advantage. That’s why organizations today are racing to gather, store, and analyze data from many sources in a wide range of formats. In the spring of 2017, Zoomdata commissioned an O’Reilly survey to assess the state of data analytics and visualization technology adoption across several industries, including manufacturing, financial services, and healthcare. Roughly 875 respondents answered questions online about their industry, job role, company size, and reasons for using analytics, as well as technologies they use in analytics programs, the perceived value of analytics programs, and many other topics. This report reveals: The industries furthest along in adopting big data analytics and visualization technologies The most commonly analyzed sources of big data The most commonly used technologies for analyzing streaming data Which analytics skills are in most demand The most valued characteristic of big data across all industries The types of users big data analytics and visualization projects typically target If you’re a technology decision maker, a product manager looking to embed analytics, a business user relying on analytics, or a developer pursuing the most marketable skills, this report provides valuable details on today’s data analytics trends.

Basic Applied Bioinformatics

An accessible guide that introduces students in all areas of life sciences to bioinformatics Basic Applied Bioinformatics provides a practical guidance in bioinformatics and helps students to optimize parameters for data analysis and then to draw accurate conclusions from the results. In addition to parameter optimization, the text will also familiarize students with relevant terminology. Basic Applied Bioinformatics is written as an accessible guide for graduate students studying bioinformatics, biotechnology, and other related sub-disciplines of the life sciences. This accessible text outlines the basics of bioinformatics, including pertinent information such as downloading molecular sequences (nucleotide and protein) from databases; BLAST analyses; primer designing and its quality checking, multiple sequence alignment (global and local using freely available software); phylogenetic tree construction (using UPGMA, NJ, MP, ME, FM algorithm and MEGA7 suite), prediction of protein structures and genome annotation, RNASeq data analyses and identification of differentially expressed genes and similar advanced bioinformatics analyses. The authors Chandra Sekhar Mukhopadhyay, Ratan Kumar Choudhary, and Mir Asif Iquebal are noted experts in the field and have come together to provide an updated information on bioinformatics. Salient features of this book includes: Accessible and updated information on bioinformatics tools A practical step-by-step approach to molecular-data analyses Information pertinent to study a variety of disciplines including biotechnology, zoology, bioinformatics and other related fields Worked examples, glossary terms, problems and solutions Basic Applied Bioinformatics gives students studying bioinformatics, agricultural biotechnology, animal biotechnology, medical biotechnology, microbial biotechnology, and zoology an updated introduction to the growing field of bioinformatics.

Exam Ref 70-767 Implementing a SQL Data Warehouse

Prepare for Microsoft Exam 70-767–and help demonstrate your real-world mastery of skills for managing data warehouses. This exam is intended for Extract, Transform, Load (ETL) data warehouse developers who create business intelligence (BI) solutions. Their responsibilities include data cleansing as well as ETL and data warehouse implementation. The reader should have experience installing and implementing a Master Data Services (MDS) model, using MDS tools, and creating a Master Data Manager database and web application. The reader should understand how to design and implement ETL control flow elements and work with a SQL Service Integration Services package. Focus on the expertise measured by these objectives: • Design, and implement, and maintain a data warehouse • Extract, transform, and load data • Build data quality solutionsThis Microsoft Exam Ref: • Organizes its coverage by exam objectives • Features strategic, what-if scenarios to challenge you • Assumes you have working knowledge of relational database technology and incremental database extraction, as well as experience with designing ETL control flows, using and debugging SSIS packages, accessing and importing or exporting data from multiple sources, and managing a SQL data warehouse. Implementing a SQL Data Warehouse About the Exam Exam 70-767 focuses on skills and knowledge required for working with relational database technology. About Microsoft Certification Passing this exam earns you credit toward a Microsoft Certified Professional (MCP) or Microsoft Certified Solutions Associate (MCSA) certification that demonstrates your mastery of data warehouse management Passing this exam as well as Exam 70-768 (Developing SQL Data Models) earns you credit toward a Microsoft Certified Solutions Associate (MCSA) SQL 2016 Business Intelligence (BI) Development certification. See full details at: microsoft.com/learning

Measuring Agreement

Presents statistical methodologies for analyzing common types of data from method comparison experiments and illustrates their applications through detailed case studies Measuring Agreement: Models, Methods, and Applications features statistical evaluation of agreement between two or more methods of measurement of a variable with a primary focus on continuous data. The authors view the analysis of method comparison data as a two-step procedure where an adequate model for the data is found, and then inferential techniques are applied for appropriate functions of parameters of the model. The presentation is accessible to a wide audience and provides the necessary technical details and references. In addition, the authors present chapter-length explorations of data from paired measurements designs, repeated measurements designs, and multiple methods; data with covariates; and heteroscedastic, longitudinal, and categorical data. The book also: • Strikes a balance between theory and applications • Presents parametric as well as nonparametric methodologies • Provides a concise introduction to Cohen’s kappa coefficient and other measures of agreement for binary and categorical data • Discusses sample size determination for trials on measuring agreement • Contains real-world case studies and exercises throughout • Provides a supplemental website containing the related datasets and R code Measuring Agreement: Models, Methods, and Applications is a resource for statisticians and biostatisticians engaged in data analysis, consultancy, and methodological research. It is a reference for clinical chemists, ecologists, and biomedical and other scientists who deal with development and validation of measurement methods. This book can also serve as a graduate-level text for students in statistics and biostatistics.

Python for R Users

The definitive guide for statisticians and data scientists who understand the advantages of becoming proficient in both R and Python The first book of its kind, Python for R Users: A Data Science Approach makes it easy for R programmers to code in Python and Python users to program in R. Short on theory and long on actionable analytics, it provides readers with a detailed comparative introduction and overview of both languages and features concise tutorials with command-by-command translations—complete with sample code—of R to Python and Python to R. Following an introduction to both languages, the author cuts to the chase with step-by-step coverage of the full range of pertinent programming features and functions, including data input, data inspection/data quality, data analysis, and data visualization. Statistical modeling, machine learning, and data mining—including supervised and unsupervised data mining methods—are treated in detail, as are time series forecasting, text mining, and natural language processing. • Features a quick-learning format with concise tutorials and actionable analytics • Provides command-by-command translations of R to Python and vice versa • Incorporates Python and R code throughout to make it easier for readers to compare and contrast features in both languages • Offers numerous comparative examples and applications in both programming languages • Designed for use for practitioners and students that know one language and want to learn the other • Supplies slides useful for teaching and learning either software on a companion website Python for R Users: A Data Science Approach is a valuable working resource for computer scientists and data scientists that know R and would like to learn Python or are familiar with Python and want to learn R. It also functions as textbook for students of computer science and statistics. A. Ohri is the founder of Decisionstats.com and currently works as a senior data scientist. He has advised multiple startups in analytics off-shoring, analytics services, and analytics education, as well as using social media to enhance buzz for analytics products. Mr. Ohri's research interests include spreading open source analytics, analyzing social media manipulation with mechanism design, simpler interfaces for cloud computing, investigating climate change and knowledge flows. His other books include R for Business Analytics and R for Cloud Computing.

Security and Privacy in Cyber-Physical Systems

Written by a team of experts at the forefront of the cyber-physical systems (CPS) revolution, this book provides an in-depth look at security and privacy, two of the most critical challenges facing both the CPS research and development community and ICT professionals. It explores, in depth, the key technical, social, and legal issues at stake, and it provides readers with the information they need to advance research and development in this exciting area. Cyber-physical systems (CPS) are engineered systems that are built from, and depend upon the seamless integration of computational algorithms and physical components. Advances in CPS will enable capability, adaptability, scalability, resiliency, safety, security, and usability far in excess of what today’s simple embedded systems can provide. Just as the Internet revolutionized the way we interact with information, CPS technology has already begun to transform the way people interact with engineered systems. In the years ahead, smart CPS will drive innovation and competition across industry sectors, from agriculture, energy, and transportation, to architecture, healthcare, and manufacturing. A priceless source of practical information and inspiration, Security and Privacy in Cyber-Physical Systems: Foundations, Principles and Applications is certain to have a profound impact on ongoing R&D and education at the confluence of security, privacy, and CPS.

ABCs of IBM z/OS System Programming Volume 1

Abstract The ABCs of IBM® z/OS® System Programming is a 13-volume collection that provides an introduction to the z/OS operating system and the hardware architecture. Whether you are a beginner or an experienced system programmer, the ABCs collection provides the information that you need to start your research into z/OS and related subjects. Whether you want to become more familiar with z/OS in your current environment, or you are evaluating platforms to consolidate your online business applications, the ABCs collection will serve as a powerful technical tool. Volume 1 provides an updated understanding of the software and IBM zSeries architecture, and explains how it is used together with the z/OS operating system. This includes the main components of z/OS needed to customize and install the z/OS operating system. This edition has been significantly updated and revised.

Engineering Biostatistics

Provides a one-stop resource for engineers learning biostatistics using MATLAB® and WinBUGS Through its scope and depth of coverage, this book addresses the needs of the vibrant and rapidly growing bio-oriented engineering fields while implementing software packages that are familiar to engineers. The book is heavily oriented to computation and hands-on approaches so readers understand each step of the programming. Another dimension of this book is in parallel coverage of both Bayesian and frequentist approaches to statistical inference. It avoids taking sides on the classical vs. Bayesian paradigms, and many examples in this book are solved using both methods. The results are then compared and commented upon. Readers have the choice of MATLAB® for classical data analysis and WinBUGS/OpenBUGS for Bayesian data analysis. Every chapter starts with a box highlighting what is covered in that chapter and ends with exercises, a list of software scripts, datasets, and references. Engineering Biostatistics: An Introduction using MATLAB® and WinBUGS also includes: parallel coverage of classical and Bayesian approaches, where appropriate substantial coverage of Bayesian approaches to statistical inference material that has been classroom-tested in an introductory statistics course in bioengineering over several years exercises at the end of each chapter and an accompanying website with full solutions and hints to some exercises, as well as additional materials and examples Engineering Biostatistics: An Introduction using MATLAB® and WinBUGS can serve as a textbook for introductory-to-intermediate applied statistics courses, as well as a useful reference for engineers interested in biostatistical approaches.

Pro MySQL NDB Cluster

Create and run a real-time, highly-available, and high-redundancy version of the world's most popular open-source database, MySQL. You will understand the advantages and disadvantages of the MySQL NDB Cluster solution, and when MySQL NDB Cluster is the right choice. Pro MySQL NDB Cluster walks you through the full lifecycle of a MySQL Cluster installation: starting with the installation and initial configuration, moving through online configuration and schema changes, and completing with online upgrades. Along the way, you will learn to monitor your cluster, make decisions about schema design, implement geographic replication, troubleshoot and optimize performance, and much more. This book covers the many programming APIs that are supported by MySQL NDB Cluster. There's also robust coverage of connecting to MySQL NDB Cluster from Java, SQL, memcached, and even from C++. From any of these languages, you'll be able to connect and store and retrieve data as your applications demand. The book: Covers MySQL NDB Cluster concepts and architecture Takes you through the MySQL NDB Cluster lifecycle from installation to upgrades Guides you through DBA and Developer decisions when working with MySQL NDB Cluster What You'll Learn Understand the shared-nothing architecture behind MySQL NDB Cluster Plan, install, and configure a MySQL NDB Cluster environment Perform everyday tasks such as backing up, restoring, and upgrading Develop applications from Java, memcached, C++, and SQL Troubleshoot and resolve application performance problems Master enterprise-level features such the MySQL NDB Cluster Manager Who This Book Is For Database administrators and developers who are looking into deploying MySQL NDB Cluster, or who already have a cluster in production and want to increase their knowledge and ability to handle routine administrative tasks and troubleshooting. The book also is for those developers wanting to employ MySQL NDB Cluster as their chosen storage engine from Java, memcached, and C++ applications.

MariaDB and MySQL Common Table Expressions and Window Functions Revealed

Walk away from old-fashioned and cumbersome query approaches and answer your business intelligence questions through simple and powerful queries built on common table expressions (CTEs) and window functions. These new features in MariaDB and MySQL help you to write queries without having to wade through a quagmire of brittle self-joins and other crazy techniques from the past. Your queries will generate correct results, be more readable and less brittle in the face of unexpected data, and you’ll be able to adapt them quickly in the face of changing business requirements. MariaDB and MySQL Common Table Expressions and Window Functions Revealed introduces and explains CTEs and window functions, newly available in MariaDB 10.2 and MySQL 8.0, and helps you understand why and how every MariaDB and MySQL database programmer should learn and apply these features in their daily work. CTEs and especially window functions enable easy solutions to many query challenges that in prior releases have been difficult and sometimes impossible to surmount. Mastering these features opens the door to query solutions that are more robust, execute faster, and are easier to maintain over time than prior solutions using older techniques. The book: Takes you step-by-step through the workings of common table expressions and window functions Provides easy-to-follow examples of the new syntax Helps you answer business questions faster and easier than ever What You'll Learn Answer business questions using simple queries that don’t break in the face of unexpected data Avoid writing queries that are a difficult-to-maintain quagmire of self-joins and nested subqueries Recognize situations that call for window functions, and learn when to use these features Reduce the need for performance-robbing self-joins Simplify and speed the execution of analytical queries Create queries that finish in seconds instead of hours Who This Book Is For Database administrators and application developers who want to quickly get up to speed on important features in MariaDB and MySQL for writing business intelligence queries. Any developer writing SQL against MariaDB and MySQL databases will benefit tremendously from the knowledge and techniques this book provides.

Oracle Application Express by Design: Managing Cost, Schedule, and Quality

Learn the many design decisions that must be made before starting to build a large Oracle Application Express (APEX) application for the cloud or enterprise. One of APEX's key strengths is the fact that it is a Rapid Application Development (RAD) tool. This is also a major weakness when it tempts developers to start coding too soon. Small applications that consist of tens of pages can be coded without a lot of design work because they can be re-factored quickly when design flaws are discovered. Design flaws in large cloud and enterprise applications that consist of hundreds or thousands of pages are not so easy to re-factor due to the time needed to redevelop and retest the application, not to mention the risk of breaking functionality in subtle ways. Designing a large application before coding starts is a profitable exercise because a thoughtful design goes a long way in mitigating cost overruns and schedule slippage while simultaneously enhancing quality. This book takes into account perspectives from other non-developer stakeholders such as maintenance developers, business analysts, testers, technical writers, end users, and business owners. Overlooking these perspectives is one of the chief causes of expensive rework late in the development cycle. Oracle Application Express by Design illustrates APEX design principles by using architecture diagrams, screen shots, and explicit code snippets to guide developers through the many design choices and complex interrelationship issues that must be evaluated before embarking on large APEX projects. This book: Guides you through important, up-front APEX design decisions Helps you to optimize your design by keeping all stakeholders in mind Explicit code examples show how design impacts cost, schedule, and quality What You Will Learn Pick and choose from the list of designs before coding begins Bake optimal quality into the underlying fabric of an APEX application Think and design from outside the developer’s narrow perspective Optimize APEX application designs to satisfy multiple stakeholder groups Evaluate design options through hands-on, explicit code examples Define and measure success for large cloud and enterprise APEX applications Who This Book Is For APEX developers and development teams

RabbitMQ in Depth

RabbitMQ in Depth is a practical guide to building and maintaining message-based applications. This book provides detailed coverage of RabbitMQ with an emphasis on why it works the way it does. You'll find examples and detailed explanations based in real-world systems ranging from simple networked services to complex distributed designs. You'll also find the insights you need to make core architectural choices and develop procedures for effective operational management. About the Technology At the heart of most modern distributed applications is a queue that buffers, prioritizes, and routes message traffic. RabbitMQ is a high-performance message broker based on the Advanced Message Queueing Protocol. It's battle tested, ultrafast, and powerful enough to handle anything you can throw at it. It requires a few simple setup steps, and you can instantly start using it to manage low-level service communication, application integration, and distributed system message routing. About the Book What's Inside AMQP, the Advanced Message Queueing Protocol Communicating via MQTT, Stomp, and HTTP Valuable troubleshooting techniques Database integration About the Reader Written for programmers with a basic understanding of messaging-oriented systems. About the Author Gavin M. Roy is an active, open source evangelist and advocate who has been working with internet and enterprise technologies since the mid-90s. Technical editor James Titcumb is a freelance developer, trainer, speaker, and active contributor to open source projects. Quotes An excellent resource for beginners and experts alike...shows how to integrate RabbitMQ into a successful enterprise application. - Ian Dallas, Hewlett-Packard The most comprehensive source for everything RabbitMQ. From terms to code to patterns, it's all here! - Andrew Meredith, Quantum Metric A cheat sheet for getting started and troubleshooting the migration process to RabbitMQ. - Nadia Noori, La Salle University Barcelona Filled with pragmatic advice and pearls of wisdom. - Milos Milivojevic, Mozzart Bet

The Biml Book: Business Intelligence and Data Warehouse Automation

Learn Business Intelligence Markup Language (Biml) for automating much of the repetitive, manual labor involved in data integration. We teach you how to build frameworks and use advanced Biml features to get more out of SQL Server Integration Services (SSIS), Transact-SQL (T-SQL), and SQL Server Analysis Services (SSAS) than you ever thought possible. The first part of the book starts with the basics—getting your development environment configured, Biml syntax, and scripting essentials. Whether a beginner or a seasoned Biml expert, the next part of the book guides you through the process of using Biml to build a framework that captures both your design patterns and execution management. Design patterns are reusable code blocks that standardize the approach you use to perform certain types of data integration, logging, and other key data functions. Design patterns solve common problems encountered when developing data integration solutions. Because you do not have to build the code from scratch each time, design patterns improve your efficiency as a Biml developer. In addition to leveraging design patterns in your framework, you will learn how to build a robust metadata store and how to package your framework into Biml bundles for deployment within your enterprise. In the last part of the book, we teach you more advanced Biml features and capabilities, such as SSAS development, T-SQL recipes, documentation autogeneration, and Biml troubleshooting. The Biml Book: Provides practical and applicable examples Teaches you how to use Biml to reduce development time while improving quality Takes you through solutions to common data integration and BI challenges What You'll Learn Master the basics of Business Intelligence Markup Language (Biml) Study patterns for automating SSIS package generation Build a Biml Framework Import and transform database schemas Automate generation of scripts and projects Who This Book Is For BI developers wishing to quickly locate previously tested solutions, Microsoft BI specialists, those seeking more information about solution automation and code generation, and practitioners of Data Integration Lifecycle Management (DILM) in the DevOps enterprise

Mastering ArcGIS Enterprise Administration

Mastering ArcGIS Enterprise Administration teaches you how to install, configure, and manage ArcGIS Enterprise, guiding you through publishing, optimizing, and securing GIS services for your organizational needs. With this book, you will build a robust GIS infrastructure and gain solutions to common administration challenges. What this Book will help me do Install and configure ArcGIS Enterprise, including its enterprise geodatabase and web services components. Utilize administrative tools like user interfaces, REST API, and Python for system management. Securely publish and manage GIS services, optimizing performance for end users. Apply best practices for securing GIS resources and enabling efficient access. Troubleshoot issues and implement automation to maintain system reliability. Author(s) None Cooper is a seasoned GIS professional with extensive experience in deploying and managing ArcGIS systems for various organizational workflows. With a meticulous approach, they have assisted numerous teams in leveraging GIS technologies to accomplish their objectives. None's commitment to teaching is reflected in this comprehensive guide, which simplifies complex topics to foster effective learning. Who is it for? This book is perfect for GIS analysts, managers, and administrators who aim to learn ArcGIS Enterprise installation and administration. If you already work with ArcGIS or similar GIS platforms, this book can enhance your understanding. It's suitable for those with a basic understanding of geospatial concepts looking to specialize in Enterprise administration. Whether you work as a GIS engineer or database administrator, this guide helps you establish a secure and functional ArcGIS infrastructure.

PHP & MySQL: Novice to Ninja, 6th Edition

PHP & MySQL: Novice to Ninja, 6th Edition is a hands-on guide to learning all the tools, principles, and techniques needed to build a fully functional application using PHP & MySQL. Comprehensively updated to cover PHP 7 and modern best practice, this practical and fun book covers everything from installing PHP and MySQL through to creating a complete online content management system. You'll learn how to: Install PHP & MySQL on Windows, Mac OS X, or Linux Gain a thorough understanding of PHP syntax Use object oriented programming techniques Master database design principles and SQL Develop robust websites that can handle high levels of traffic Build a working content management system (CMS) And much more!