talk-data.com talk-data.com

Topic

data

5765

tagged

Activity Trend

3 peak/qtr
2020-Q1 2026-Q1

Activities

5765 activities · Newest first

Preparing Data for Analysis with JMP

Access and clean up data easily using JMP®! Data acquisition and preparation commonly consume approximately 75% of the effort and time of total data analysis. JMP provides many visual, intuitive, and even innovative data-preparation capabilities that enable you to make the most of your organization's data. Preparing Data for Analysis with JMP® is organized within a framework of statistical investigations and model-building and illustrates the new data-handling features in JMP, such as the Query Builder. Useful to students and programmers with little or no JMP experience, or those looking to learn the new data-management features and techniques, it uses a practical approach to getting started with plenty of examples. Using step-by-step demonstrations and screenshots, this book walks you through the most commonly used data-management techniques that also include lots of tips on how to avoid common problems. With this book, you will learn how to: Manage database operations using the JMP Query Builder Get data into JMP from other formats, such as Excel, csv, SAS, HTML, JSON, and the web Identify and avoid problems with the help of JMP’s visual and automated data-exploration tools Consolidate data from multiple sources with Query Builder for tables Deal with common issues and repairs that include the following tasks: reshaping tables (stack/unstack) managing missing data with techniques such as imputation and Principal Components Analysis cleaning and correcting dirty data computing new variables transforming variables for modelling reconciling time and date Subset and filter your data Save data tables for exchange with other platforms

SPSS Statistics for Data Analysis and Visualization

Dive deeper into SPSS Statistics for more efficient, accurate, and sophisticated data analysis and visualization SPSS Statistics for Data Analysis and Visualization goes beyond the basics of SPSS Statistics to show you advanced techniques that exploit the full capabilities of SPSS. The authors explain when and why to use each technique, and then walk you through the execution with a pragmatic, nuts and bolts example. Coverage includes extensive, in-depth discussion of advanced statistical techniques, data visualization, predictive analytics, and SPSS programming, including automation and integration with other languages like R and Python. You'll learn the best methods to power through an analysis, with more efficient, elegant, and accurate code. IBM SPSS Statistics is complex: true mastery requires a deep understanding of statistical theory, the user interface, and programming. Most users don't encounter all of the methods SPSS offers, leaving many little-known modules undiscovered. This book walks you through tools you may have never noticed, and shows you how they can be used to streamline your workflow and enable you to produce more accurate results. Conduct a more efficient and accurate analysis Display complex relationships and create better visualizations Model complex interactions and master predictive analytics Integrate R and Python with SPSS Statistics for more efficient, more powerful code These "hidden tools" can help you produce charts that simply wouldn't be possible any other way, and the support for other programming languages gives you better options for solving complex problems. If you're ready to take advantage of everything this powerful software package has to offer, SPSS Statistics for Data Analysis and Visualization is the expert-led training you need.

D3.js 4.x Data Visualization - Third Edition

Learn to build beautiful and interactive data visualizations with D3.js 4.x in this approachable and practical guide. From setting up the basics to mastering advanced techniques, this book empowers you to create engaging, sophisticated graphics and charts to communicate information effectively. What this Book will help me do Effectively map data sets to visual representations using D3.js scales. Generate dynamic and interactive SVG elements with D3's shape generators. Develop reusable D3 components for extensible and testing-friendly charts. Leverage advanced D3 layout patterns for large-scale data visualization. Write modern, efficient JavaScript for web-based data visualization systems. Author(s) Aendrew Rininsland, an experienced data visualization developer, and co-author bring a wealth of expertise to this book, drawing from years of implementing interactive solutions in professional contexts. Their engaging way of explaining concepts and demonstrating techniques with practical examples makes learning straightforward and impactful. Who is it for? This book is ideal for web developers, data analysts, and designers who are interested in enhancing their web-based data visualization skills with D3.js. Some familiarity with JavaScript is beneficial but not strictly required, as the book provides clear guidance to bring readers up to speed. If you aspire to effectively present data through captivating visuals and improve your technical proficiency, this book is a great fit for you.

Machine Learning with Spark - Second Edition

Dive into the world of distributed machine learning with Apache Spark, a powerful framework for handling, processing, and analyzing big data. This book will take you through implementing popular machine learning algorithms using Spark ML, covering end-to-end workflows such as data preparation, model building, predictive analysis, and text processing. What this Book will help me do Learn to implement scalable machine learning solutions using Spark ML. Develop the skills to set up and configure Apache Spark environments. Master the application of machine learning techniques like clustering, classification, and regression with Spark. Efficiently handle and process large-scale datasets using Spark tools. Put Spark's capabilities to work in building real-world distributed data processing solutions. Author(s) None Dua and None Ghotra bring a wealth of experience in big data and machine learning to this book. They have been involved in building scalable data systems and implementing machine learning solutions in various industry scenarios. Their approach is hands-on and focused on teaching practical, actionable knowledge. Who is it for? This book is perfect for data enthusiasts, data engineers, and machine learning practitioners who are familiar with Python and Scala, eager to apply machine learning concepts in distributed environments. It's aimed at professionals looking to develop their skills in building scalable data systems and implementing advanced machine learning workflows in Spark.

Research Methods in Human-Computer Interaction, 2nd Edition

Research Methods in Human-Computer Interaction is a comprehensive guide to performing research and is essential reading for both quantitative and qualitative methods. Since the first edition was published in 2009, the book has been adopted for use at leading universities around the world, including Harvard University, Carnegie-Mellon University, the University of Washington, the University of Toronto, HiOA (Norway), KTH (Sweden), Tel Aviv University (Israel), and many others. Chapters cover a broad range of topics relevant to the collection and analysis of HCI data, going beyond experimental design and surveys, to cover ethnography, diaries, physiological measurements, case studies, crowdsourcing, and other essential elements in the well-informed HCI researcher's toolkit. Continual technological evolution has led to an explosion of new techniques and a need for this updated 2nd edition, to reflect the most recent research in the field and newer trends in research methodology. This Research Methods in HCI revision contains updates throughout, including more detail on statistical tests, coding qualitative data, and data collection via mobile devices and sensors. Other new material covers performing research with children, older adults, and people with cognitive impairments. Comprehensive and updated guide to the latest research methodologies and approaches, and now available in EPUB3 format (choose any of the ePub or Mobi formats after purchase of the eBook) Expanded discussions of online datasets, crowdsourcing, statistical tests, coding qualitative data, laws and regulations relating to the use of human participants, and data collection via mobile devices and sensors New material on performing research with children, older adults, and people with cognitive impairments, two new case studies from Google and Yahoo!, and techniques for expanding the influence of your research to reach non-researcher audiences, including software developers and policymakers

PostgreSQL Administration Cookbook, 9.5/9.6 Edition - Third Edition

Dive into the world of PostgreSQL database management with this hands-on guide. This book takes you through essential administration tasks and advanced features of PostgreSQL 9.5 and 9.6, equipping you with the tools to efficiently manage and optimize your databases. What this Book will help me do Set up and configure PostgreSQL servers for optimal performance and reliability. Implement robust backup and disaster recovery strategies tailored to your needs. Master replication techniques including high availability and logical replication. Analyze and troubleshoot performance issues with advanced diagnostics tools. Secure and protect your databases using best practices and security features. Author(s) Simon Riggs, Gianni Ciolli, and None Bartolini are leading figures in the PostgreSQL community. With extensive experience in database architecture and system administration, they have guided numerous professionals in mastering PostgreSQL. Their practical insights and clear instructions make this book an invaluable resource. Who is it for? This book is ideal for system administrators, database administrators, and developers who are responsible for database management. Whether you're aspiring to deepen your expertise in PostgreSQL or are already working with databases and seeking advanced knowledge, this guide caters to intermediate to advanced skill levels.

Good Charts for Persuasive Presentations

The right visual revealed at the right time can turn an unremarkable presentation into a resonant, emotional experience. This two-book collection provides you with the tools you need to craft and deliver presentations that will impress your audience, increase your influence in your organization, and advance your career. Good Charts: The HBR Guide to Making Smarter, More Persuasive Data Visualizations shows how a good visualization can communicate the nature and potential impact of information and ideas more powerfully than any other form of communication. For a long time "dataviz" was left to specialists--data scientists and professional designers. No longer. A new generation of tools and massive amounts of available data make it easy for anyone to create visualizations that communicate ideas far more effectively than generic spreadsheet charts ever could. What's more, building good charts is quickly becoming a need-to-have skill for managers. If you're not doing it, other managers are, and they're getting noticed for it and getting credit for contributing to your company's success. In Good Charts, dataviz maven Scott Berinato provides an essential guide to how visualization works and how to use this new language to impress and persuade. Dataviz today is where spreadsheets and word processors were in the early 1980s—on the cusp of changing how we work. Berinato lays out a system for thinking visually and building better charts through a process of talking, sketching, and prototyping. This book is much more than a set of static rules for making visualizations. It taps into both well-established and cutting-edge research in visual perception and neuroscience, as well as the emerging field of visualization science, to explore why good charts (and bad ones) create "feelings behind our eyes." Along the way, Berinato also includes many engaging vignettes of dataviz pros, illustrating the ideas in practice. Good Charts will help you turn plain, uninspiring charts that merely present information into smart, effective visualizations that powerfully convey ideas. HBR Guide to Persuasive Presentations will teach you to how to take the pain out of presentations. Terrified of speaking in front of a group? Or simply looking to polish your skills? No matter where you are on the spectrum, this guide will give you the confidence and the tools you need to get results. Written by presentation expert Nancy Duarte, the HBR Guide to Persuasive Presentations will help you: (1) Win over tough crowds, (2) Organize a coherent narrative, (3) Create powerful messages and visuals, (4) Connect with and engage your audience, (5) Show people why your ideas matter to them, and (6) Strike the right tone, in any situation.

Learning Apache Cassandra - Second Edition

Learning Apache Cassandra is an engaging and in-depth guide to understanding the concepts and practical applications of Apache Cassandra, one of the most robust distributed NoSQL databases available. By the end of this book, you will have the necessary skills to design and manage scalable, high-performance database solutions tailored for modern applications. What this Book will help me do Set up Apache Cassandra and its multi-node clusters confidently and efficiently. Master schema design principles, including the use of composite keys, collections, and user-defined types. Implement efficient query strategies with secondary indexes and materialized views. Understand data distribution strategies and tune consistency levels for different application requirements. Dive into advanced topics like user-defined functions, batch operations, and Java client optimizations for scalable database architecture. Author(s) None Yarabarla brings practical expertise and deep knowledge to the subject of Apache Cassandra. With hands-on industry experience designing scalable database solutions, the author ensures complex topics are presented through clear and actionable insights. This is coupled with real-world scenarios to help you apply your learning effectively. Who is it for? This book is ideal for developers and IT professionals interested in learning Apache Cassandra from scratch or enhancing their NoSQL database expertise. It is particularly suited for those transitioning from relational databases to NoSQL systems. Even without prior coding experience, readers can expect to follow along and achieve practical results.

Budgeting, Forecasting and Planning In Uncertain Times

Budgeting, planning and forecasting are critical management tasks that not only impact the future success of an organization, but can threaten its very survival if done badly. Yet in spite of their importance, the speed and complexity of today’s business environment has caused a rapid decrease in the planning time horizon. As a consequence the traditional planning processes have become unsuitable for most organization’s needs. In this book readers will find new, original insights, including: 7 planning models that every organization needs to plan and manage performance 6 ways in which performance can be viewed A planning framework based on best management practices that can cope with an unpredictable business environment The application of technology to planning and latest developments in systems Results of the survey conducted for the book on the state of planning in organizations

Mastering Machine Learning with R - Second Edition

Dive into the world of advanced machine learning techniques with "Mastering Machine Learning with R, Second Edition." This comprehensive guide equips you with the skills to implement sophisticated algorithms and create powerful prediction models using R 3.x. You will explore topics such as supervised and unsupervised learning, decision trees, ensemble methods, and deep learning. What this Book will help me do Implement machine learning workflows using a variety of R packages like XGBOOST. Effectively use linear and logistic regression for statistical analysis and pattern recognition. Develop skills in advanced methods such as support vector machines and neural networks. Learn actionable techniques to create recommendation engines and perform text mining. Gain hands-on experience running R-based machine learning analyses on cloud platforms. Author(s) None Lesmeister, a seasoned data scientist, combines extensive hands-on experience and a passion for teaching to deliver technical concepts in a practical, engaging manner. With a strong background in statistical analysis and machine learning, they are dedicated to providing readers with actionable knowledge and step-by-step guidance. Who is it for? This book is ideal for data scientists, analysts, and machine learning practitioners aiming to deepen their expertise in R. Readers should have a fundamental understanding of machine learning concepts and a basic knowledge of R programming. If you're looking to master advanced learning methods and apply them effectively, this book is tailored for you.

Oracle Database 12c Release 2 New Features

Leverage the New and Improved Features of Oracle Database 12c Written by Oracle experts Bob Bryla and Robert G. Freeman, this Oracle Press guide describes the myriad new and enhanced capabilities available in the latest Oracle Database release. Inside, you’ll find everything you need to know to get up and running quickly on Oracle Database 12c Release 2. Supported by contributions from Oracle expert Eric Yen, Oracle Database 12c Release 2 New Features offers detailed coverage of: • Installing Oracle Database 12c and Grid Infrastructure • Architectural changes, such as Oracle Multitenant • The most current information on upgrading and migrating to Oracle Database 12c • The pre-upgrade information tool and parallel processing for database upgrades • Oracle Real Application Clusters new features, such as Oracle Flex Cluster, Oracle Flex Automatic Storage Management, and Oracle Automatic Storage Management Cluster File System • Enhanced and new online operations: tables, indexes, and PDBs • Oracle RMAN enhancements, including cross-platform backup and recovery • Oracle Data Guard improvements, such as Fast Sync, and Oracle Active Data Guard new features, such as Far Sync • SQL, PL/SQL, DML, and DDL new features • Improvements to partitioning manageability, performance, and availability • Advanced business intelligence and data warehousing capabilities • Security enhancements, including privileges analysis, data redaction, and new administrative-level privileges • Manageability, performance, and optimization improvements

The Big Book of Dashboards

The definitive reference book with real-world solutions you won't find anywhere else The Big Book of Dashboards presents a comprehensive reference for those tasked with building or overseeing the development of business dashboards. Comprising dozens of examples that address different industries and departments (healthcare, transportation, finance, human resources, marketing, customer service, sports, etc.) and different platforms (print, desktop, tablet, smartphone, and conference room display) The Big Book of Dashboards is the only book that matches great dashboards with real-world business scenarios. By organizing the book based on these scenarios and offering practical and effective visualization examples, The Big Book of Dashboards will be the trusted resource that you open when you need to build an effective business dashboard. In addition to the scenarios there's an entire section of the book that is devoted to addressing many practical and psychological factors you will encounter in your work. It's great to have theory and evidenced-based research at your disposal, but what will you do when somebody asks you to make your dashboard 'cooler' by adding packed bubbles and donut charts? The expert authors have a combined 30-plus years of hands-on experience helping people in hundreds of organizations build effective visualizations. They have fought many 'best practices' battles and having endured bring an uncommon empathy to help you, the reader of this book, survive and thrive in the data visualization world. A well-designed dashboard can point out risks, opportunities, and more; but common challenges and misconceptions can make your dashboard useless at best, and misleading at worst. The Big Book of Dashboards gives you the tools, guidance, and models you need to produce great dashboards that inform, enlighten, and engage.

Oracle Business Intelligence Enterprise Edition 12c - Second Edition

This book, "Oracle Business Intelligence Enterprise Edition 12c," is a comprehensive guide to understanding and implementing Oracle's powerful business intelligence tools. Throughout its chapters, readers will learn how to install, configure, and utilize OBIEE 12c to transform organizational data into actionable insights through reports, dashboards, and advanced analytics. What this Book will help me do Install and configure Oracle BI 12c, preparing the essential software and database infrastructure. Create and manage the repository connecting your data to the OBIEE front end for detailed analysis. Develop interactive dashboards and analyses that derive actionable meaning from data. Design pixel-perfect reports using BI Publisher to cater to organizational reporting needs. Set up agents and workflows to provide actionable insights and improve decision-making processes. Author(s) The authors, None Ward and None Khan, are seasoned experts in Oracle Business Intelligence with years of experience implementing and managing OBIEE systems across various industries. Their approach combines succinct practical tutorials with insights gained from real-world projects, making technical concepts accessible and actionable. Who is it for? This book is perfect for IT consultants, data scientists, project managers, and business analysts involved in data-driven decision-making. Beginners to OBIEE but with a basic understanding of databases will find this guide invaluable for starting their BI journey. Professionals already familiar with other BI tools can also benefit from a detailed exploration of OBIEE's features.

Analyzing Data with Power BI and Power Pivot for Excel, First Edition

Renowned DAX experts Alberto Ferrari and Marco Russo teach you how to design data models for maximum efficiency and effectiveness. How can you use Excel and Power BI to gain real insights into your information? As you examine your data, how do you write a formula that provides the numbers you need? The answers to both of these questions lie with the data model. This book introduces the basic techniques for shaping data models in Excel and Power BI. It’s meant for readers who are new to data modeling as well as for experienced data modelers looking for tips from the experts. If you want to use Power BI or Excel to analyze data, the many real-world examples in this book will help you look at your reports in a different way—like experienced data modelers do. As you’ll soon see, with the right data model, the correct answer is always a simple one! By reading this book, you will: • Gain an understanding of the basics of data modeling, including tables, relationships, and keys • Familiarize yourself with star schemas, snowflakes, and common modeling techniques • Learn the importance of granularity • Discover how to use multiple fact tables, like sales and purchases, in a complex data model • Manage calendar-related calculations by using date tables • Track historical attributes, like previous addresses of customers or manager assignments • Use snapshots to compute quantity on hand • Work with multiple currencies in the most efficient way • Analyze events that have durations, including overlapping durations • Learn what data model you need to answer your specific business questions About This Book • For Excel and Power BI users who want to exploit the full power of their favorite tools • For BI professionals seeking new ideas for modeling data

IBM GDPS Family: An introduction to Concepts and Capabilities

Abstract This IBM® Redbooks® publication presents an overview of the IBM Geographically Dispersed Parallel Sysplex™ (IBM GDPS®) offerings and the roles they play in delivering a business IT resilience solution. The book begins with general concepts of business IT resilience and disaster recovery, along with issues related to high application availability, data integrity, and performance. These topics are considered within the framework of government regulation, increasing application and infrastructure complexity, and the competitive and rapidly changing modern business environment. Next, it describes the GDPS family of offerings with specific reference to how they can help you achieve your defined goals for disaster recovery and high availability. Also covered are the features that simplify and enhance data replication activities, the prerequisites for implementing each offering, and tips for planning for the future and immediate business requirements. Tables provide easy-to-use summaries and comparisons of the offerings, and the additional planning and implementation services available from IBM are explained. Then, several practical client scenarios and requirements are described, along with the most suitable GDPS solution for each case. The introductory chapters of this publication are intended for a broad technical audience, including IT System Architects, Availability Managers, Technical IT Managers, Operations Managers, System Programmers, and Disaster Recovery Planners. The subsequent chapters provide more technical details about the GDPS offerings, and each can be read independently for those readers who are interested in specific topics. Therefore, if you do read all the chapters, be aware that some information is intentionally repeated.

Theory of Probability

First issued in translation as a two-volume work in 1975, this classic book provides the first complete development of the theory of probability from a subjectivist viewpoint. It proceeds from a detailed discussion of the philosophical mathematical aspects to a detailed mathematical treatment of probability and statistics. De Finetti’s theory of probability is one of the foundations of Bayesian theory. De Finetti stated that probability is nothing but a subjective analysis of the likelihood that something will happen and that that probability does not exist outside the mind. It is the rate at which a person is willing to bet on something happening. This view is directly opposed to the classicist/ frequentist view of the likelihood of a particular outcome of an event, which assumes that the same event could be identically repeated many times over, and the 'probability' of a particular outcome has to do with the fraction of the time that outcome results from the repeated trials.

MQTT Essentials - A Lightweight IoT Protocol

Dive into the world of MQTT, the preferred protocol for IoT and M2M communication. This book provides a comprehensive guide to understanding, implementing, and securing MQTT-based systems, enabling readers to create efficient and lightweight communication networks for their connected devices. What this Book will help me do Understand the underlying principles and protocol structure of MQTT. Securely configure and deploy an MQTT broker for communication. Develop Python, Java, and JavaScript-based MQTT client applications. Utilize MQTT for real-world IoT use cases such as sensor data interchange. Optimize MQTT usage for low-latency and lightweight communication scenarios. Author(s) Gastón C. Hillar is an experienced IoT developer and author with a deep understanding of IoT protocols and technologies. With years of practical experience in designing and deploying secure IoT systems, Gastón specializes in breaking down complex topics into digestible and actionable insights. Through his books, he aims to empower developers to effectively integrate IoT technologies into their work. Who is it for? The book is tailored for software developers and engineers who are looking to integrate MQTT into their IoT solutions. It's ideal for individuals with pre-existing knowledge in IoT concepts who want to deepen their understanding of MQTT. Readers seeking to secure, optimize, and utilize MQTT for communication and automation tasks will find it especially useful. It's a perfect fit for those working with Python, Java, and web technologies in IoT contexts.

Tabular Modeling in Microsoft SQL Server Analysis Services, Second Edition

Build agile and responsive business intelligence solutions Create a semantic model and analyze data using the tabular model in SQL Server 2016 Analysis Services to create corporate-level business intelligence (BI) solutions. Led by two BI experts, you will learn how to build, deploy, and query a tabular model by following detailed examples and best practices. This hands-on book shows you how to use the tabular model’s in-memory database to perform rapid analytics—whether you are new to Analysis Services or already familiar with its multidimensional model. Discover how to: • Determine when a tabular or multidimensional model is right for your project • Build a tabular model using SQL Server Data Tools in Microsoft Visual Studio 2015 • Integrate data from multiple sources into a single, coherent view of company information • Choose a data-modeling technique that meets your organization’s performance and usability requirements • Implement security by establishing administrative and data user roles • Define and implement partitioning strategies to reduce processing time • Use Tabular Model Scripting Language (TMSL) to execute and automate administrative tasks • Optimize your data model to reduce the memory footprint for VertiPaq • Choose between in-memory (VertiPaq) and pass-through (DirectQuery) engines for tabular models • Select the proper hardware and virtualization configurations • Deploy and manipulate tabular models from C# and PowerShell using AMO and TOM libraries Get code samples, including complete apps, at: https://aka.ms/tabular/downloads About This Book • For BI professionals who are new to SQL Server 2016 Analysis Services or already familiar with previous versions of the product, and who want the best reference for creating and maintaining tabular models. • Assumes basic familiarity with database design and business analytics concepts.

Implementing the IBM Storwize V7000 and IBM Spectrum Virtualize V7.8

Abstract Continuing its commitment to developing and delivering industry-leading storage technologies, IBM® introduces the IBM Storwize® V7000 solution powered by IBM Spectrum Virtualize™, which is an innovative storage offering that delivers essential storage efficiency technologies and exceptional ease of use and performance, all integrated into a compact, modular design that is offered at a competitive, midrange price. The IBM Storwize V7000 solution incorporates some of the top IBM technologies that are typically found only in enterprise-class storage systems, raising the standard for storage efficiency in midrange disk systems. This cutting-edge storage system extends the comprehensive storage portfolio from IBM and can help change the way organizations address the ongoing information explosion. This IBM Redbooks® publication introduces the features and functions of the IBM Storwize V7000 and IBM Spectrum Virtualize V7.8 system through several examples. This book is aimed at pre-sales and post-sales technical support and marketing and storage administrators. It helps you understand the architecture of the Storwize V7000, how to implement it, and how to take advantage of its industry-leading functions and features.