talk-data.com talk-data.com

Topic

data

5765

tagged

Activity Trend

3 peak/qtr
2020-Q1 2026-Q1

Activities

5765 activities · Newest first

Practical Time Series Analysis

Time series data analysis is increasingly important due to the massive production of such data through the internet of things, the digitalization of healthcare, and the rise of smart cities. As continuous monitoring and data collection become more common, the need for competent time series analysis with both statistical and machine learning techniques will increase. Covering innovations in time series data analysis and use cases from the real world, this practical guide will help you solve the most common data engineering and analysis challengesin time series, using both traditional statistical and modern machine learning techniques. Author Aileen Nielsen offers an accessible, well-rounded introduction to time series in both R and Python that will have data scientists, software engineers, and researchers up and running quickly. You’ll get the guidance you need to confidently: Find and wrangle time series data Undertake exploratory time series data analysis Store temporal data Simulate time series data Generate and select features for a time series Measure error Forecast and classify time series with machine or deep learning Evaluate accuracy and performance

EU GDPR & EU-U.S. Privacy Shield: A pocket guide, second edition

This concise guide is essential reading for US organizations wanting an easy to follow overview of the GDPR and the compliance obligations for handling data of EU citizens, including guidance on the EU-U.S. Privacy Shield.

IBM Spectrum Scale Erasure Code Edition: Planning and Implementation Guide

This IBM® Redpaper introduces the IBM Spectrum® Scale Erasure Code Edition (ECE) as a scalable, high-performance data and file management solution. ECE is designed to run on any commodity server that meets the ECE minimum hardware requirements. ECE provides all the functionality, reliability, scalability, and performance of IBM Spectrum Scale with the added benefit of network-dispersed IBM Spectrum Scale RAID, which provides data protection, storage efficiency, and the ability to manage storage in hyperscale environments that are composed from commodity hardware. In this publication, we explain the benefits of ECE and the use cases where we believe it fits best. We also provide a technical introduction to IBM Spectrum Scale RAID. Next, we explain the key aspects of planning an installation, provide an example of an installation scenario, and describe the key aspects of day-to-day management and a process for problem determination. We conclude with an overview of possible enhancements that are being considered for future versions of IBM Spectrum Scale Erasure Code Edition. Overall knowledge of IBM Spectrum Scale Erasure Code Edition is critical to planning a successful storage system deployment. This paper is targeted toward technical professionals (consultants, technical support staff, IT Architects, and IT Specialists) who are responsible for delivering cost effective storage solutions. The goal of this paper is to describe the benefits of using IBM Spectrum Scale Erasure Code Edition for the creation of high performing storage systems.

R Bioinformatics Cookbook

In the "R Bioinformatics Cookbook", you will explore the full potential of the R programming language and the Bioconductor ecosystem to overcome challenges in bioinformatics. By working through real-world examples, you will learn to handle biological data effectively and gain insights into genomics, RNA sequencing, and advanced data visualization. What this Book will help me do Develop skills to analyze RNA sequencing data using R and Bioconductor packages such as edgeR and DESeq. Learn to create professional-grade graphical representations of biological data using ggplot and other visualization tools. Understand how to perform genome-wide studies like variant calling and metagenomics analysis with R. Master the integration of external genomic databases with Ensembl for functional annotation. Explore machine learning applications in bioinformatics including classification and clustering models. Author(s) None MacLean and Dr. Dan Maclean are experienced bioinformatics researchers and R programmers. With a deep understanding of computational biology and visualization techniques, they bring years of academic and practical expertise to help readers excel in bioinformatics. Their approachable writing style ensures that complex topics are made accessible. Who is it for? This book is ideal for bioinformatics professionals and data analysts with an interest in applying R to biological data. It is particularly suited for those with a basic knowledge of R and bioinformatics looking to enhance their analysis skills. Researchers seeking to integrate genomics and computational methods into their workflows will find this book valuable. It's perfect for anyone aiming to tackle intermediate to advanced topics in biological data analysis.

The Little SAS Book, 6th Edition

A classic that just keeps getting better, The Little SAS Book is essential for anyone learning SAS programming. Lora Delwiche and Susan Slaughter offer a user-friendly approach so that readers can quickly and easily learn the most commonly used features of the SAS language. Each topic is presented in a self-contained, two-page layout complete with examples and graphics. Nearly every section has been revised to ensure that the sixth edition is fully up-to-date. This edition is also interface-independent, written for all SAS programmers whether they use SAS Studio, SAS Enterprise Guide, or the SAS windowing environment. New sections have been added covering PROC SQL, iterative DO loops, DO WHILE and DO UNTIL statements, %DO statements, using variable names with special characters, the ODS EXCEL destination, and the XLSX LIBNAME engine. This title belongs on every SAS programmer's bookshelf. It's a resource not just to get you started, but one you will return to as you continue to improve your programming skills. Learn more about the updates to The Little SAS Book, Sixth Edition here. Reviews for The Little SAS Book, Sixth Edition can be read here.

Mastering Spark with R

If you’re like most R users, you have deep knowledge and love for statistics. But as your organization continues to collect huge amounts of data, adding tools such as Apache Spark makes a lot of sense. With this practical book, data scientists and professionals working with large-scale data applications will learn how to use Spark from R to tackle big data and big compute problems. Authors Javier Luraschi, Kevin Kuo, and Edgar Ruiz show you how to use R with Spark to solve different data analysis problems. This book covers relevant data science topics, cluster computing, and issues that should interest even the most advanced users. Analyze, explore, transform, and visualize data in Apache Spark with R Create statistical models to extract information and predict outcomes; automate the process in production-ready workflows Perform analysis and modeling across many machines using distributed computing techniques Use large-scale data from multiple sources and different formats with ease from within Spark Learn about alternative modeling frameworks for graph processing, geospatial analysis, and genomics at scale Dive into advanced topics including custom transformations, real-time data processing, and creating custom Spark extensions

IBM z15 Technical Introduction

This IBM® Redbooks® publication introduces the latest member of the IBM Z® platform, the IBM z15™ (machine type 8561). It includes information about the Z environment and how it helps integrate data and transactions more securely. It also provides insight for faster and more accurate business decisions. The z15 is a state-of-the-art data and transaction system that delivers advanced capabilities, which are vital to any digital transformation. The z15 is designed for enhanced modularity, which is in an industry-standard footprint. The z15 system excels at the following tasks: Using multicloud integration services Securing data with pervasive encryption Providing resilience with key to zero downtime Transforming a transactional platform into a data powerhouse Getting more out of the platform with IT Operational Analytics Accelerating digital transformation with agile service delivery Revolutionizing business processes Blending open source and Z technologies This book explains how this system uses new innovations and traditional Z strengths to satisfy growing demand for cloud, analytics, and open source technologies. With the z15 as the base, applications can run in a trusted, reliable, and secure environment that improves operations and lessens business risk.

Applied Statistics

Instructs readers on how to use methods of statistics and experimental design with R software Applied statistics covers both the theory and the application of modern statistical and mathematical modelling techniques to applied problems in industry, public services, commerce, and research. It proceeds from a strong theoretical background, but it is practically oriented to develop one's ability to tackle new and non-standard problems confidently. Taking a practical approach to applied statistics, this user-friendly guide teaches readers how to use methods of statistics and experimental design without going deep into the theory. Applied Statistics: Theory and Problem Solutions with R includes chapters that cover R package sampling procedures, analysis of variance, point estimation, and more. It follows on the heels of Rasch and Schott's Mathematical Statistics via that book's theoretical background—taking the lessons learned from there to another level with this book’s addition of instructions on how to employ the methods using R. But there are two important chapters not mentioned in the theoretical back ground as Generalised Linear Models and Spatial Statistics. Offers a practical over theoretical approach to the subject of applied statistics Provides a pre-experimental as well as post-experimental approach to applied statistics Features classroom tested material Applicable to a wide range of people working in experimental design and all empirical sciences Includes 300 different procedures with R and examples with R-programs for the analysis and for determining minimal experimental sizes Applied Statistics: Theory and Problem Solutions with R will appeal to experimenters, statisticians, mathematicians, and all scientists using statistical procedures in the natural sciences, medicine, and psychology amongst others.

IBM Storage for Red Hat OpenShift Blueprint Version 1 Release 1

IBM Storage for Red Hat OpenShift is a comprehensive container-ready solution that includes all the hardware & software components necessary to setup and/or expand your Red Hat OpenShift environment. This blueprint includes Red Hat OpenShift Container Platform and uses Container Storage Interface (CSI) standards. IBM Storage brings enterprise data services to containers. In this blueprint, learn how to: · Combine the benefits of IBM Systems with the performance of IBM Storage solutions so that you can deliver the right services to your clients today! · Build a 24 by 7 by 365 enterprise class private cloud with Red Hat OpenShift Container Platform utilizing new open source Container Storage interface (CSI) drivers · Leverage enterprise class services such as NVMe based flash performance, high data availability, and advanced container security IBM Storage for Red Hat OpenShift Container Platform is designed for your DevOps environment for on-premises deployment with easy-to-consume components built to perform and scale for your enterprise. Simplify your journey to cloud with pre-tested and validated blueprints engineered to enable rapid deployment and peace of mind as you move to a hybrid multicloud environment. You now have the capabilities.

Introduction to Biostatistics with JMP

Explore biostatistics using JMP in this refreshing introduction Presented in an easy-to-understand way, Introduction to Biostatistics with JMP introduces undergraduate students in the biological sciences to the most commonly used (and misused) statistical methods that they will need to analyze their experimental data using JMP. It covers many of the basic topics in statistics using biological examples for exercises so that the student biologists can see the relevance to future work in the problems addressed. The book starts by teaching students how to become confident in executing the right analysis by thinking like a statistician then moves into the application of specific tests. Using the powerful capabilities of JMP, the book addresses problems requiring analysis by chi-square tests, t tests, ANOVA analysis, various regression models, DOE, and survival analysis. Topics of particular interest to the biological or health science field include odds ratios, relative risk,

IBM Storage Solutions for SAP Applications Version 1.3

This paper is intended as an architecture and configuration guide to set up the IBM® System Storage® for the SAP HANA tailored data center integration (SAP HANA TDI) within a storage area network (SAN) environment. SAP HANA TDI allows the SAP customer to attach external storage to the SAP HANA server. The paper also describes the setup and configuration of SAP Landscape Management for SAP HANA systems on IBM infrastructure components: IBM Power Systems™ and IBM Storage based on IBM Spectrum™ Virtualize. This document is written for IT technical specialists and architects with advanced skill levels on SUSE Linux Enterprise Server (SLES) or Red Hat Enterprise Linux (RHEL) and IBM System Storage. This document provides the necessary information to select, verify, and connect IBM System Storage to the SAP HANA server through a Fibre Channel-based SAN. The recommendations in this Blueprint apply to single-node and scale-out configurations, and Intel and IBM Power based SAP HANA systems.

Database Internals

When it comes to choosing, using, and maintaining a database, understanding its internals is essential. But with so many distributed databases and tools available today, it’s often difficult to understand what each one offers and how they differ. With this practical guide, Alex Petrov guides developers through the concepts behind modern database and storage engine internals. Throughout the book, you’ll explore relevant material gleaned from numerous books, papers, blog posts, and the source code of several open source databases. These resources are listed at the end of parts one and two. You’ll discover that the most significant distinctions among many modern databases reside in subsystems that determine how storage is organized and how data is distributed. This book examines: Storage engines: Explore storage classification and taxonomy, and dive into B-Tree-based and immutable Log Structured storage engines, with differences and use-cases for each Storage building blocks: Learn how database files are organized to build efficient storage, using auxiliary data structures such as Page Cache, Buffer Pool and Write-Ahead Log Distributed systems: Learn step-by-step how nodes and processes connect and build complex communication patterns Database clusters: Which consistency models are commonly used by modern databases and how distributed storage systems achieve consistency

Query Store for SQL Server 2019: Identify and Fix Poorly Performing Queries

Apply the new Query Store feature to identify and fix poorly performing queries in SQL Server. Query Store is an important and recent feature in SQL Server that provides insight into the details of query execution and how that execution has changed over time. Query Store helps to identify queries that aren’t performing well, or that have regressed in their performance. Query Store provides detailed information such as wait stats that you need to resolve root causes, and it allows you to force the use of a known good execution plan. With SQL Server 2017 and later you can automate the correction of regressions in performance. Query Store for SQL Server 2019 helps you protect your database’s performance during upgrades of applications or version of SQL Server. The book provides fundamental information on how Query Store works and best practices for implementation and use. You will learn to run and interpret built-in reports, configure automatic plan correction, and troubleshoot queries using Query Store when needed. Query Store for SQL Server 2019 helps you master Query Store and bring value to your organization through consistent query execution times and automate correction of regressions. What You'll Learn Apply best practices in implementing Query Store on production servers Detect and correct regressions in query performance Lower the risk of performance degradation following an upgrade Use tools and techniques to get the most from Query Store Automate regression correction and other uses of Query Store Who This Book Is For SQL Server developers and administrators responsible for query performance on SQL Server. Anyone responsible for identifying poorly performing queries will be able to use Query Store to find these queries and resolve the underlying issues.

IBM Spectrum Discover: Metadata Management for Deep Insight of Unstructured Storage

This IBM® Redpaper publication provides a comprehensive overview of the IBM Spectrum® Discover metadata management software platform. We give a detailed explanation of how the product creates, collects, and analyzes metadata. Several in-depth use cases are used that show examples of analytics, governance, and optimization. We also provide step-by-step information to install and set up the IBM Spectrum Discover trial environment. More than 80% of all data that is collected by organizations is not in a standard relational database. Instead, it is trapped in unstructured documents, social media posts, machine logs, and so on. Many organizations face significant challenges to manage this deluge of unstructured data such as: Pinpointing and activating relevant data for large-scale analytics Lacking the fine-grained visibility that is needed to map data to business priorities Removing redundant, obsolete, and trivial (ROT) data Identifying and classifying sensitive data IBM Spectrum Discover is a modern metadata management software that provides data insight for petabyte-scale file and Object Storage, storage on premises, and in the cloud. This software enables organizations to make better business decisions and gain and maintain a competitive advantage. IBM Spectrum Discover provides a rich metadata layer that enables storage administrators, data stewards, and data scientists to efficiently manage, classify, and gain insights from massive amounts of unstructured data. It improves storage economics, helps mitigate risk, and accelerates large-scale analytics to create competitive advantage and speed critical research.

IBM Storage for Red Hat OpenShift Container Platform V3.11 Blueprint Version 1 Release 1

IBM Storage for Red Hat OpenShift Container Platform is a comprehensive container-ready solution that includes all the hardware & software components necessary to setup and/or expand your Red Hat OpenShift Container Platform V3.11 environment. IBM Storage, bringing enterprise data services to containers. In this blueprint, learn how to: • Combine the benefits of IBM Systems with the performance of IBM Storage solutions so that you can deliver the right services to your clients today! • Build a 24 by 7 by 365 enterprise class private cloud with Red Hat OpenShift Container Platform • Leverage enterprise class services such as NVMe based flash performance, high data availability, and advanced container security IBM Storage for Red Hat OpenShift Container Platform: designed for your DevOps environment for on-premises deployment with easy-to-consume components built to perform and scale for your enterprise. Simplify your journey to cloud with pre-tested and validated blueprints engineered to enable rapid deployment and peace of mind as you move to a hybrid multicloud environment. You now have the capabilities.

A Gentle Introduction to Statistics Using SASⓇ Studio

Point and click your way to performing statistics! Many people are intimidated by learning statistics, but A Gentle Introduction to Statistics Using SAS Studio is here to help. Whether you need to perform statistical analysis for a project or, perhaps, for a course in education, psychology, sociology, economics, or any other field that requires basic statistical skills, this book teaches the fundamentals of statistics, from designing your experiment through calculating logistic regressions. Serving as an introduction to many common statistical tests and principles, it explains concepts in a non-technical way with little math and very few formulas. Once the basic statistical concepts are covered, the book then demonstrates how to use them with SAS Studio and SAS University Edition’s easy point-and-click interface. Topics included in this book are: How to install and use SAS University Edition Descriptive statistics One-sample tests T tests (for independent or paired samples) One-way analysis of variance (ANOVA) N-way ANOVA Correlation analysis Simple and multiple linear regression Binary logistic regression Categorical data, including two-way tables and chi-square Power and sample size calculations Questions are provided to test your knowledge and practice your skills.

Hands-On SAS for Data Analysis

"Hands-On SAS for Data Analysis" is a practical guide that introduces you to the fundamentals of using SAS for managing and analyzing data effectively. Through a hands-on approach, you'll explore key topics such as data manipulation with SAS 4GL, SQL querying, and creating insightful visualizations and reports. By the end of the book, you'll not only have a robust understanding of SAS but also be prepared for the SAS certification exam. What this Book will help me do Effectively use SAS modules and tools for comprehensive data analysis tasks. Master SAS 4GL functions to perform advanced data manipulation and transformation. Leverage advanced SQL options within SAS to query and analyze datasets. Become proficient in writing SAS Macros to automate repetitive tasks efficiently. Produce professional reports and visualizations using SAS Output Delivery System. Author(s) None Gulati is a renowned expert in data analysis and business intelligence, with years of professional experience in leveraging SAS for enterprise solutions. An experienced trainer and technical author, None has a unique ability to simplify complex concepts. Through this book, None shares practical knowledge that aligns with industry needs and certification goals. Who is it for? This book is designed for data professionals seeking to enhance their skills in SAS programming and data analysis. Whether you're just starting out with SAS or aiming to pass the SAS certification exam, this book will provide valuable insights. Readers with basic knowledge of data management will find this guide especially beneficial.

SAP ABAP Objects: A Practical Guide to the Basics and Beyond

Understand ABAP objects—the object-oriented extension of the SAP language ABAP—in the latest release of SAP NetWeaver 7.5, and its newest advancements. This book begins with the programming of objects in general and the basics of the ABAP language that a developer needs to know to get started. The most important topics needed to perform daily support jobs and ensure successful projects are covered. ABAP is a vast community with developers working in a variety of functional areas. You will be able to apply the concepts in this book to your area. SAP ABAP Objects is goal directed, rather than a collection of theoretical topics. It doesn't just touch on the surface of ABAP objects, but goes in depth from building the basic foundation (e.g., classes and objects created locally and globally) to the intermediary areas (e.g., ALV programming, method chaining, polymorphism, simple and nested interfaces), and then finally into the advanced topics (e.g., shared memory, persistent objects). You will know how to use best practices to make better programs via ABAP objects. What You’ll Learn Know the latest advancements in ABAP objects with the new SAP Netweaver system Understand object-oriented ABAP classes and their components Use object creation and instance-methods calls Be familiar with the functions of the global class builder Be exposed to advanced topics Incorporate best practices for making object-oriented ABAP programs Who This Book Is For ABAP developers, ABAP programming analysts, and junior ABAP developers. Included are: ABAP developers for all modules of SAP, both new learners and developers with some experience or little programming experience in general; students studying ABAP at the college/university level; senior non-ABAP programmers with considerable experience who are willing to switch to SAP/ABAP; and any functional consultants who want or have recently switched to ABAP technical.

IBM Power Systems Enterprise AI Solutions

This IBM® Redpaper publication helps the line of business (LOB), data science, and information technology (IT) teams develop an information architecture (IA) for their enterprise artificial intelligence (AI) environment. It describes the challenges that are faced by the three roles when creating and deploying enterprise AI solutions, and how they can collaborate for best results. This publication also highlights the capabilities of the IBM Cognitive Systems and AI solutions: IBM Watson® Machine Learning Community Edition IBM Watson Machine Learning Accelerator (WMLA) IBM PowerAI Vision IBM Watson Machine Learning IBM Watson Studio Local IBM Video Analytics H2O Driverless AI IBM Spectrum® Scale IBM Spectrum Discover This publication examines the challenges through five different use case examples: Artificial vision Natural language processing (NLP) Planning for the future Machine learning (ML) AI teaming and collaboration This publication targets readers from LOBs, data science teams, and IT departments, and anyone that is interested in understanding how to build an IA to support enterprise AI development and deployment.

IBM FlashSystem A9000, IBM FlashSystem A9000R, and IBM XIV Storage System: Host Attachment and Interoperability

This IBM® Redbooks® publication provides information for attaching the IBM FlashSystem® A9000, IBM FlashSystem A9000R, and IBM XIV® Storage System to various host operating system platforms, such as IBM AIX® and Microsoft Windows. This publication was last updated in May 2019 to cover the VLAN tagging and port trunking support available with software version 12.3.2 (see in particular section 2.4, "VLAN tagging" on page 67. The goal is to give an overview of the versatility and compatibility of the IBM Spectrum™ Accelerate family of storage systems with various platforms and environments. The information that is presented here is not meant as a replacement or substitute for the IBM Storage Host Attachment Kit publications or other product publications. It is meant as a complement and to provide usage guidance and practical illustrations. This publication does not address attachments to a secondary system used for Remote Mirroring or data migration. These topics are covered in IBM FlashSystem A9000 and IBM FlashSystem A9000 and A9000R Business Continuity Solutions, REDP-5401.