talk-data.com talk-data.com

Topic

data

5765

tagged

Activity Trend

3 peak/qtr
2020-Q1 2026-Q1

Activities

5765 activities · Newest first

JMP 13 Fitting Linear Models, Second Edition, 2nd Edition

JMP 13 Fitting Linear Models focuses on the Fit Model platform and many of its personalities. Linear and logistic regression, analysis of variance and covariance, and stepwise procedures are covered. Also included are multivariate analysis of variance, mixed models, generalized models, and models based on penalized regression techniques.

JMP 13 Multivariate Methods, Second Edition, 2nd Edition

JMP 13 Multivariate Methods describes techniques for analyzing several variables simultaneously. The book covers descriptive measures, such as correlations. It also describes methods that give insight into the structure of the multivariate data, such as clustering, latent class analysis, principal components, discriminant analysis, and partial least squares.

JMP 13 Predictive and Specialized Modeling, Second Edition, 2nd Edition

JMP 13 Predictive and Specialized Modeling provides details about modeling techniques such as partitioning, neural networks, nonlinear regression, and time series analysis. Topics include the Gaussian platform, which is useful in analyzing computer simulation experiments. The book also covers the Response Screening platform, which is useful in testing the effect of a predictor when you have many responses.

JMP Start Statistics, 6th Edition

This book provides hands-on tutorials with just the right amount of conceptual and motivational material to illustrate how to use the intuitive interface for data analysis in JMP. Each chapter features concept-specific tutorials,

examples, brief reviews of concepts, step-by-step illustrations, and exercises.

Updated for JMP 13, JMP Start Statistics, Sixth Edition includes many new features, including:

The redesigned Formula Editor.

New and improved ways to create formulas in JMP directly from the data table or dialogs.

Interface updates, including improved menu layout.

Updates and enhancements in many analysis platforms.

New ways to get data into JMP and to save and share JMP results.

Many new features that make it easier to use JMP.

Mastering Elasticsearch 5.x - Third Edition

This comprehensive guide dives deep into the functionalities of Elasticsearch 5, the widely-used search and analytics engine. Leveraging the power of Apache Lucene, this book will help you understand advanced concepts like querying, indexing, and cluster management to build efficient and scalable search solutions. What this Book will help me do Master advanced features of Elasticsearch such as text scoring, sharding, and aggregation. Understand how to handle big data efficiently using Elasticsearch's architecture. Learn practical implementation techniques for Elasticsearch features through hands-on examples. Develop custom plugins for Elasticsearch to tailor its functionalities to specific needs. Scale and optimize Elasticsearch clusters for high performance in production environments. Author(s) Bharvi Dixit is an experienced software engineer and a recognized expert in implementing Elasticsearch solutions. With a strong background in distributed systems and database management, Bharvi's writing is informed by real-world experience and a focus on practical applications. Who is it for? This book is ideal for developers and data engineers with existing experience in Elasticsearch who wish to deepen their knowledge. It serves as a valuable resource for professionals tasked with creating scalable search applications. A working understanding of Elasticsearch basics and query DSL is recommended to fully benefit from this guide.

Total Survey Error in Practice

Featuring a timely presentation of total survey error (TSE), this edited volume introduces valuable tools for understanding and improving survey data quality in the context of evolving large-scale data sets This book provides an overview of the TSE framework and current TSE research as related to survey design, data collection, estimation, and analysis. It recognizes that survey data affects many public policy and business decisions and thus focuses on the framework for understanding and improving survey data quality. The book also addresses issues with data quality in official statistics and in social, opinion, and market research as these fields continue to evolve, leading to larger and messier data sets. This perspective challenges survey organizations to find ways to collect and process data more efficiently without sacrificing quality. The volume consists of the most up-to-date research and reporting from over 70 contributors representing the best academics and researchers from a range of fields. The chapters are broken out into five main sections: The Concept of TSE and the TSE Paradigm, Implications for Survey Design, Data Collection and Data Processing Applications, Evaluation and Improvement, and Estimation and Analysis. Each chapter introduces and examines multiple error sources, such as sampling error, measurement error, and nonresponse error, which often offer the greatest risks to data quality, while also encouraging readers not to lose sight of the less commonly studied error sources, such as coverage error, processing error, and specification error. The book also notes the relationships between errors and the ways in which efforts to reduce one type can increase another, resulting in an estimate with larger total error. This book: • Features various error sources, and the complex relationships between them, in 25 high-quality chapters on the most up-to-date research in the field of TSE • Provides comprehensive reviews of the literature on error sources as well as data collection approaches and estimation methods to reduce their effects • Presents examples of recent international events that demonstrate the effects of data error, the importance of survey data quality, and the real-world issues that arise from these errors • Spans the four pillars of the total survey error paradigm (design, data collection, evaluation and analysis) to address key data quality issues in official statistics and survey research Total Survey Error in Practice is a reference for survey researchers and data scientists in research areas that include social science, public opinion, public policy, and business. It can also be used as a textbook or supplementary material for a graduate-level course in survey research methods. Paul P. Biemer, PhD, is distinguished fellow at RTI International and associate director of Survey Research and Development at the Odum Institute, University of North Carolina, USA. Edith de Leeuw, PhD, is professor of survey methodology in the Department of Methodology and Statistics at Utrecht University, the Netherlands. Stephanie Eckman, PhD, is fellow at RTI International, USA. Brad Edwards is vice president, director of Field Services, and deputy area director at Westat, USA. Frauke Kreuter, PhD, is professor and director of the Joint Program in Survey Methodology, University of Maryland, USA; professor of statistics and methodology at the University of Mannheim, Germany; and head of the Statistical Methods Research Department at the Institute for Employment Research, Germany. Lars E. Lyberg, PhD, is senior advisor at Inizio, Sweden. N. Clyde Tucker, PhD, is principal survey methodologist at the American Institutes for Research, USA. Brady T. West, PhD, is research associate professor in the Survey Resea

IBM Power Systems L and LC Server Positioning Guide

This IBM® Redpaper™ publication is written to assist you in locating the optimal server/workload fit within the IBM Power Systems™ L and IBM OpenPOWER LC product lines. IBM has announced several scale-out servers, and as a partner in the OpenPOWER organization, unique design characteristics that are engineered into the LC line have broadened the suite of available workloads beyond typical client OS hosting. This paper looks at the benefits of the Power Systems L servers and OpenPOWER LC servers, and how they are different, providing unique benefits for Enterprise workloads and use cases.

2017 European Data Science Salary Survey

How do data science salaries for people in Europe compare to their counterparts in the rest of the world? Among the more than 1000 people who responded to O’Reilly’s 2016 Data Science Salary Survey, 359 live and work in various European countries as data scientists, analysts, engineers, and related professions. This report takes a deep dive into the survey results from respondents in various regions of Europe, including the tools they use, the compensation they receive, and the roles they play in their respective organizations. Even if you didn’t take part in the survey, you can still plug your own information into the survey’s simple linear model to see where you fit. With this report, you’ll learn: How salaries vary by country and specific regions in Europe Average size of companies by region How salary is affected by a country’s GDP Top industries for data scientists, including software, banking, finance, retail, and ecommerce Most commonly used tools vs tools used by respondents with above-average salaries Primary and secondary job tasks performed by survey respondents To stay up-to-date on this research, your participation is crucial. The survey is now open for the 2017 report; please take just 5 to 10 minutes to participate in the survey here.

Big Data Now: 2016 Edition

Now in its sixth edition, O’Reilly’s annual Big Data Now report recaps the trends, tools, applications, and forecasts we’ve examined throughout 2016. This collection of blog posts, authored by leading thinkers and experts in the field, reflects a unique set of themes we’ve identified as gaining significant attention and traction. Our list of topics for 2016 includes: Careers in data Tools and architecture for big data Intelligent real-time applications Cloud infrastructure Machine learning: models and training Deep learning and artificial intelligence

Geospatial Data and Analysis

Geospatial data, or data with location information, is generated in huge volumes every day by billions of mobile phones, IoT sensors, drones, nanosatellites, and many other sources in an unending stream. This practical ebook introduces you to the landscape of tools and methods for making sense of all that data, and shows you how to apply geospatial analytics to a variety of issues, large and small. Authors Aurelia Moser, Jon Bruner, and Bill Day provide a complete picture of the geospatial analysis options available, including low-scale commercial desktop GIS tools, medium-scale options such as PostGIS and Lucene-based searching, and true big data solutions built on technologies such as Hadoop. You’ll learn when it makes sense to move from one type of solution to the next, taking increased costs and complexity into account. Explore the structure of basic webmaps, and the challenges and constraints involved when working with geo data Dive into low- to medium-scale mapping tools for use in backend and frontend web development Focus on tools for robust medium-scale geospatial projects that don’t quite justify a big data solution Learn about innovative platforms and software packages for solving issues of processing and storage of large-scale data Examine geodata analysis use cases, including disaster relief, urban planning, and agriculture and environmental monitoring

Learning Kibana 5.0

Learning Kibana 5.0 is your gateway to mastering the art of data visualization using the powerful features of the Kibana platform. This book guides you through the process of creating stunning interactive dashboards and making data-driven insights accessible with real-time visualizations. Whether you're new to the Elastic stack or seeking to refine your expertise, this book equips you to harness Kibana's full potential. What this Book will help me do Build robust, real-time dashboards in Kibana to visualize complex datasets efficiently. Leverage Timelion to perform time-series data analysis and create metrics-based dashboards. Explore advanced analytics using the Graph plugin to uncover relationships and correlations in data. Learn how to create and deploy custom plugins to tailor Kibana to specific project needs. Understand how to use the Elastic stack to monitor, analyze, and optimize various types of data flows. Author(s) Bahaaldine Azarmi is a seasoned expert in the Elastic stack, known for his dedication to making complex technical topics approachable and practical. With years of experience in data analytics and software development, Bahaaldine shares not only his technical expertise but also his passion for helping professionals achieve their goals through clear, actionable guidance. His writing emphasizes hands-on learning and practical application. Who is it for? This book is perfect for developers, data visualization engineers, and data scientists who aim to hone their skills in data visualization and interactive dashboard development. It assumes a basic understanding of Elasticsearch and Logstash to maximize its practicality. If you aim to advance your career by learning how to optimize data architecture and solve real-world problems using the Elastic stack, this book is ideal for you.

Cloud Data Sharing with IBM Spectrum Scale

This IBM® Redpaper™ publication provides information to help you with the sizing, configuration, and monitoring of hybrid cloud solutions using the Cloud data sharing feature of IBM Spectrum Scale™. IBM Spectrum Scale, formerly IBM General Parallel File System (IBM GPFS™), is a scalable data and file management solution that provides a global namespace for large data sets along with several enterprise features. Cloud data sharing allows for the sharing and use of data between various cloud object storage types and IBM Spectrum Scale. Cloud data sharing can help with the movement of data in both directions, between file systems and cloud object storage, so that data is where it needs to be, when it needs to be there. This paper is intended for IT architects, IT administrators, storage administrators, and those who want to learn more about sizing, configuration, and monitoring of hybrid cloud solutions using IBM Spectrum Scale and Cloud data sharing.

IBM DS8880 Integrated Copy Services Manager and LDAP Client on the HMC

IBM® Copy Services Manager (CSM) is a replication management solution that is based on the IBM Tivoli® Productivity Center for Replication technology. CSM inherits all the Tivoli Productivity Center for Replication capabilities and continues to provide Copy Services solutions for most IBM storage offerings. The IBM DS8880, starting with firmware Release 8.1, Licensed Machine Code (LMC) 8.8.10.xx.xx, includes CSM for the IBM System Storage® DS8000®, which is pre-installed on the Hardware Management Console (HMC). If you ordered the CSM feature code as part of your IBM DS8000 system configuration, you only need to activate CSM. CSM as installed on the HMC, or acquired separately, includes a lightweight build of the IBM WebSphere® Liberty server code, to use to authenticate CSM users through a Lightweight Directory Access Protocol (LDAP). The same integrated LDAP support can be used for remote authentication of DS8000 users. Furthermore, if you simply want to take advantage of the CSM LDAP client for DS8000 LDAP authentication, the CSM license and CSM activation are not required. This IBM Redpaper™ publication describes the requirements for setup and usage of CSM on the DS8000 HMC, for both Copy Services management and LDAP authentication.

PostgreSQL High Availability Cookbook - Second Edition

Master the essential strategies for ensuring high availability in PostgreSQL with this practical cookbook. You'll learn how to build resilient PostgreSQL database clusters that can withstand failures, safely replicate data, and scale to meet increasing demands, ensuring your application's reliability. What this Book will help me do Understand and apply replication techniques in PostgreSQL to protect your data and ensure consistency. Set up a robust database cluster using tools like Patroni or Pacemaker to automate failover and maintain availability. Learn hardware configuration best practices for building a strong database platform. Optimize resource usage in your PostgreSQL clusters with connection pooling techniques using pgpool and PgBouncer. Implement advanced monitoring and alerting solutions to effectively track and respond to potential issues in real-time. Author(s) Shaun Thomas is a seasoned database administrator and consultant specializing in PostgreSQL high availability and clustering solutions. With years of hands-on experience in building resilient and scalable database systems, Shaun shares actionable insights and methodologies in a clear and accessible manner. His real-world knowledge and passion for database reliability shine through in his practical and effective writing style, making this book an invaluable resource. Who is it for? This book is perfect for Linux system administrators and PostgreSQL DBAs seeking to enhance the reliability and resilience of their database systems. If you're responsible for reducing downtime, improving failover processes, or managing databases in high-demand scenarios, this book provides the tools and techniques you need. It's especially helpful for professionals looking to deepen their understanding of PostgreSQL-specific solutions to high availability challenges.

IBM Hyper-Scale Manager for IBM Spectrum Accelerate Family: IBM XIV, IBM FlashSystem A9000 and A9000R, and IBM Spectrum Accelerate

This IBM® Redbooks® publication describes storage management functions and their configuration and usage with the IBM Hyper-Scale Manager management graphical user interface (GUI) for IBM FlashSystem® A9000 and A9000R, IBM XIV® Gen3, and IBM Spectrum™ Accelerate software. The web-based GUI provides a revolutionary object-centered interface design that is aimed toward ease of use together with enhanced efficiency for storage administrators. The first chapter describes general features of the GUI and installation of the IBM Hyper-Scale Manager server. Subsequent chapters illustrate some typical GUI actions, among many other possibilities, to manage and configure the storage systems, to define security roles, to set up multitenancy. For most of the GUI-based actions that are illustrated in this book, the corresponding XIV Storage System command-line interface (XCLI) commands are also shown. IBM Hyper-Scale Manager based GUI information regarding host attachment and replication is covered in IBM FlashSystem A9000, IBM FlashSystem A9000R, and IBM XIV Storage System: Host Attachment and Interoperability, SG24-8368 and IBM FlashSystem A9000 and A9000R Replication Solutions, REDP-5401.

Elasticsearch 5.x Cookbook - Third Edition

Elasticsearch 5.x Cookbook is a comprehensive guide that teaches you how to leverage the full power of Elasticsearch for high-performance search and analytics. Through step-by-step recipes, you'll explore deployment, query building, plugin integration, and advanced analytics, ensuring you can manage and scale Elasticsearch like a pro. What this Book will help me do Understand and deploy complex Elasticsearch cluster topologies for optimal performance. Create tailored mappings to gain finer control over data indexing and retrieval. Design and execute advanced queries and analytics using Elasticsearch capabilities. Integrate Elasticsearch with popular programming languages and big data platforms. Monitor and improve Elasticsearch cluster health using the best practices and tools. Author(s) Alberto Paro is a seasoned software engineer and data scientist with extensive experience in distributed systems and search technologies. Having worked on numerous search-related projects, he brings practical, real-world insights to his writing. Alberto is passionate about teaching and simplifying complex concepts, making this book both approachable and expertly detailed. Who is it for? This book is ideal for developers or data engineers seeking to utilize Elasticsearch for advanced search and analytics tasks. If you have some prior knowledge of JSON and programming concepts, particularly Java, you will benefit most from this material. Whether you're looking to integrate Elasticsearch into your systems or to optimize its usage, this book caters to your needs.

Evolutionary Computation with Biogeography-based Optimization

Evolutionary computation algorithms are employed to minimize functions with large number of variables. Biogeography-based optimization (BBO) is an optimization algorithm that is based on the science of biogeography, which researches the migration patterns of species. These migration paradigms provide the main logic behind BBO. Due to the cross-disciplinary nature of the optimization problems, there is a need to develop multiple approaches to tackle them and to study the theoretical reasoning behind their performance. This manuscript intends to explain the mathematical model of BBO algorithm and its variants created to cope with continuous domain problems (with and without constraints) and combinatorial problems. Due to the cross-disciplinary nature of the optimization problems, there is a need to develop multiple approaches to tackle them and to study the theoretical reasoning behind their performance. This manuscript intends to explain the mathematical model of BBO algorithm and its variants created to cope with continuous domain problems (with and without constraints) and combinatorial problems.

Professional Microsoft SQL Server 2016 Reporting Services and Mobile Reports

Optimize reporting and BI with Microsoft SQL Server 2016 Professional Microsoft SQL Server 2016 Reporting Services and Mobile Reports provides a comprehensive lesson in business intelligence (BI), operational reporting and Reporting Services architecture using a clear, concise tutorial approach. You'll learn effective report solution design based upon many years of experience with successful report solutions. Improve your own reports with advanced, best-practice design, usability, query design, and filtering techniques. Expert guidance provides insight into common report types and explains where each could be made more efficient, while providing step-by step instruction on Microsoft SQL Server 2016. All changes to the 2016 release are covered in detail, including improvements to the Visual Studio Report Designer (SQL Server Data Tools) and Report Builder, Mobile Dashboard Designer, the new Report Portal Interface, HTML-5 Rendering, Power BI integration, Custom Parameters Pane, and more. The Microsoft SQL Server 2016 release will include significant changes. New functionality, new capabilities, re-tooled processes, and changing support require a considerable update to existing knowledge. Whether you're starting from scratch or simply upgrading, this book is an essential guide to report design and business intelligence solutions. Understand BI fundamentals and Reporting Services architecture Learn the ingredients to a successful report design Get up to speed on Microsoft SQL Server 2016 Grasp the purpose behind common designs to optimize your reporting Microsoft SQL Server Reporting Services makes reporting faster, easier, and more powerful than ever in web, desktop and portal solutions. Compatibility with an extensive variety of data sources makes it a go-to solution for organizations across the globe. The 2016 release brings some of the biggest changes in years, and the full depth and breadth of these changes can create a serious snag in your workflow. For a clear tutorial geared toward the working professional, Professional Microsoft SQL Server 2016 Reporting Services and Mobile Reports is the ideal guide for getting up to speed and producing successful reports.

HBase High Performance Cookbook

"HBase High Performance Cookbook" is your guide to mastering the optimization, scaling, and tuning of HBase systems. Covering everything from configuring HBase clusters to designing scalable table structures and performance tuning, this comprehensive book provides practical advice and strategies for leveraging HBase's full potential. By following this book's recipes, you'll supercharge your HBase expertise. What this Book will help me do Understand how to configure HBase for optimal performance, improving your data system's efficiency. Learn to design table structures to maximize scalability and functionality in HBase. Gain skills in performing CRUD operations and using advanced features like MapReduce within HBase. Discover practices for integrating HBase with other technologies such as ElasticSearch. Master the steps involved in setting up and optimizing HBase in cloud environments for enhanced performance. Author(s) Ruchir Choudhry is a seasoned data management professional with extensive experience in distributed database systems. He possesses deep expertise in HBase, Hadoop, and other big data technologies. His practical and engaging writing style aims to demystify complex technical topics, making them accessible to developers and architects alike. Who is it for? This book is tailored for developers and system architects looking to deepen their understanding of HBase. Whether you are experienced with other NoSQL databases or are new to HBase, this book provides extensive practical knowledge. Ideal for professionals working in big data applications or those eager to optimize and scale their database systems effectively.