talk-data.com talk-data.com

Topic

data

5765

tagged

Activity Trend

3 peak/qtr
2020-Q1 2026-Q1

Activities

5765 activities · Newest first

Redash v5 Quick Start Guide

In the 'Redash v5 Quick Start Guide', you'll learn everything you need to master the Redash data visualization platform and confidently create compelling dashboards. This book covers how to connect to different data sources, use SQL to query data, and design and share insightful visualizations. What this Book will help me do Understand how to install, configure, and troubleshoot Redash for your data projects. Gain skills in managing user roles and permissions to ensure secure data collaboration. Learn to connect Redash to various data sources and fetch, process, and handle data. Master the creation of advanced visualizations to effectively present complex data. Develop proficiency in utilizing the Redash API for integrating programmatic interactions. Author(s) None Leibzon is a recognized expert in data visualization and Business Intelligence tools, with years of experience working with data-driven systems. Drawing from his deep practical knowledge of Redash and its applications, None has crafted this guide to be accessible and highly practical. His goal is to enable learners and professionals to unlock the power of data storytelling through intuitive and actionable visualization. Who is it for? If you're a Data Analyst, BI professional, or Data Developer with basic SQL skills, this book is tailored for you. It assumes no prior knowledge of Redash but benefits those who understand fundamental Business Intelligence concepts. Whether you're looking to create your first visualization or streamline data collaboration, this guide will help you achieve your goals.

Applied Data Visualization with R and ggplot2

Applied Data Visualization with R and ggplot2 introduces the crucial concepts of creating compelling data visualizations using R's powerful ggplot2 library in a straightforward and efficient manner. Through engaging explanations and practical exercises, you'll learn to set up your R environment, understand the components of the grammar of graphics, and design visualizations that bring your data to life. What this Book will help me do Master the setup of RStudio and the application of ggplot2's core structure. Harness the grammar of graphics to create meaningful data visualizations. Design visually appealing and informative custom plots with various ggplot2 features. Understand and apply advanced visualization techniques such as density plots and facet plotting. Develop the ability to communicate insights effectively through data visualizations. Author(s) Dr. Tania Moulik is a respected data visualization practitioner and educator, with years of experience using R and ggplot2. She channels her passion for teaching to enable data professionals to enhance their practice through improved visualizations. Dr. Moulik's clear and systematic approach ensures that learners at any level can unlock the potential of their data with ease. Who is it for? This book is ideal for data professionals looking to enhance their visualization skills with R and ggplot2. If you're a student aiming to delve deeper into data analysis using advanced plotting techniques, this book was written for you. It assumes a foundational knowledge of R programming, but is accessible whether you're building your skills or honing your craft. This book aligns perfectly with anyone driven to transform data into actionable insights and compelling visual narratives.

Getting Started with Tableau 2018.x

Dive into the world of data visualization with "Getting Started with Tableau 2018.x." This comprehensive guide introduces you to both the fundamental and advanced functionalities of Tableau 2018.x, making it easier to create impactful data visualizations. Learn to unlock Tableau's full potential through practical examples and clear explanations. What this Book will help me do Understand the new Tableau 2018.x features like density, extensions, and transparency and how to leverage them. Learn how to connect to data sources, perform transformations, and build efficient data models to support your analysis. Master visualization techniques to design effective and insightful dashboards tailored to business needs. Explore advanced concepts such as calculations, cross-database joins, and data blending to handle complex scenarios. Develop the confidence to publish and interact with content on Tableau Server and share your insights effectively. Author(s) None Guillevin and None Pires are data visualization experts with extensive experience using Tableau. They aim to make data analysis accessible through hands-on examples and easy-to-follow explanations. Their writing balances clear instruction with practical application, making advanced concepts understandable for all readers. Who is it for? This book is ideal for beginners or experienced BI professionals who wish to gain expertise in Tableau 2018.x. It caters to aspiring analysts and business professionals looking to answer complex business-specific questions through data visualization. Regardless of prior experience in Tableau or other BI tools, this book provides value through a structured learning approach.

MicroStrategy Quick Start Guide

In 'MicroStrategy Quick Start Guide,' you'll learn how to transform your raw business data into actionable insights using MicroStrategy. The book covers everything from setting up and configuring MicroStrategy tools to creating insightful dashboards and managing BI solutions from start to finish. What this Book will help me do Configure the MicroStrategy Intelligence Server and essential tools. Create and utilize MicroStrategy Projects and manage metadata repositories. Design effective MicroStrategy Reports to retrieve key business insights. Develop engaging dashboards for advanced data visualization and storytelling. Administer and secure your MicroStrategy BI solutions for stable operation. Author(s) None Rivero Esqueda brings their extensive experience in Business Intelligence solutions to this practical guide. Known for their expertise in MicroStrategy, they are passionate about empowering data analysts and BI professionals to leverage data for better decisions. Their professional insight and accessible approach make this book a valuable resource for readers at all levels. Who is it for? This book is ideal for Business Intelligence professionals or data analysts looking to explore MicroStrategy as their primary BI tool. Readers should have a basic understanding of BI concepts and data analysis. It is tailored to suit beginners as well as professionals transitioning to MicroStrategy. If you are eager to create impactful visualizations and dashboards while mastering MicroStrategy, this is the perfect guide for you.

MongoDB 4 Quick Start Guide

"MongoDB 4 Quick Start Guide" is your gateway into understanding and utilizing MongoDB, the world's leading NoSQL database alternative. Through this approachable guide, you will quickly learn how to install, secure, and effectively perform database operations using MongoDB Version 4. What this Book will help me do Master the installation and configuration of MongoDB to prepare for secure database setups. Execute CRUD operations seamlessly to manage your data through the MongoDB shell. Construct queries using the aggregation pipeline for robust data analysis. Implement replication and sharding to ensure data safety and scaleability. Use the PHP MongoDB driver to integrate MongoDB effectively with web applications. Author(s) None Bierer is an expert in database technologies with extensive experience in NoSQL solutions, particularly MongoDB. Their passion for teaching developers new and efficient ways to work with databases shines through in this practical and hands-on guide. Who is it for? This book is perfect for web developers looking to enhance their understanding of modern databases, IT professionals interested in NoSQL solutions, and DBAs transitioning from relational databases to document-oriented databases. Prior experience with databases can be helpful, but this guide is accessible even for enthusiastic beginners seeking to learn MongoDB.

D3.js Quick Start Guide

D3.js Quick Start Guide is your go-to resource for mastering D3.js, a powerful JavaScript library for creating interactive visualizations in the browser. This book walks you through core concepts, from building scatter plots to creating force-directed graphs, helping you go from beginner to creating stunning visual data representations. What this Book will help me do Create interactive scatter plots showcasing data relationships. Implement bar graphs that dynamically update from API data. Design animated pie charts for visually appealing representations. Develop force-directed graphs to represent networked data. Leverage GeoJSON data for building informative interactive maps. Author(s) None Huntington is an experienced web developer with a clear knack for turning complex topics into understandable concepts. With expertise in data visualization and web technologies, Huntington explains technical subject matter in a friendly and approachable manner, ensuring learners grasp both theoretical and practical aspects effectively. Who is it for? This book is ideal for web developers and data enthusiasts eager to learn how to represent data via interactive visualizations using D3.js. If you have a basic understanding of JavaScript and are looking to enhance your web development skillset with dynamic visualization techniques, this guide is perfect for you. Through easy-to-follow examples, you'll get up to speed quickly and start building professional-looking visualizations right away. Whether you're a data scientist, interactive news developer, or just interested in bringing data to life, this book is your key to mastering D3.js.

Python Data Analytics: With Pandas, NumPy, and Matplotlib

Explore the latest Python tools and techniques to help you tackle the world of data acquisition and analysis. You'll review scientific computing with NumPy, visualization with matplotlib, and machine learning with scikit-learn. This revision is fully updated with new content on social media data analysis, image analysis with OpenCV, and deep learning libraries. Each chapter includes multiple examples demonstrating how to work with each library. At its heart lies the coverage of pandas, for high-performance, easy-to-use data structures and tools for data manipulation Author Fabio Nelli expertly demonstrates using Python for data processing, management, and information retrieval. Later chapters apply what you've learned to handwriting recognition and extending graphical capabilities with the JavaScript D3 library. Whether you are dealing with sales data, investment data, medical data, web page usage, or other data sets, Python Data Analytics, Second Edition is an invaluable reference with its examples of storing, accessing, and analyzing data. What You'll Learn Understand the core concepts of data analysis and the Python ecosystem Go in depth with pandas for reading, writing, and processing data Use tools and techniques for data visualization and image analysis Examine popular deep learning libraries Keras, Theano,TensorFlow, and PyTorch Who This Book Is For Experienced Python developers who need to learn about Pythonic tools for data analysis

R Programming Fundamentals

Master the essentials of programming with R and streamline your data analysis workflow with 'R Programming Fundamentals'. This book introduces key R concepts like data structures and control flow, and guides you through practical applications such as data visualization with ggplot2. By the end, you will progress to completing a full data science project for practical hands-on experience. What this Book will help me do Learn to use R's core features, including package management, data structures, and control flow. Process and clean datasets effectively within R, handling missing values and variable transformation. Master data visualization techniques with ggplot2 to create insightful plots and charts. Develop skills to import diverse datasets such as CSVs, Excel spreadsheets, and SQL databases into R. Construct a data science project end-to-end, applying skills in analysis, visualization, and reporting. Author(s) Kaelen Medeiros is a dedicated teacher with a passion for making complex concepts accessible. Bringing years of experience in data science and statistical computing, Kaelen excels at helping learners understand and leverage R for their data analysis needs. With a focus on practical learning, Kaelen has designed this book to give you the hands-on experience and foundational knowledge you need. Who is it for? This book is perfect for analysts looking to enhance their data science toolkit by learning R. It's especially suited for those with little R programming experience looking to start with foundational concepts. Whether you're an aspiring data scientist or a seasoned professional seeking a refresher, this book offers a structured approach to mastering R effectively.

Web Application Development with R Using Shiny - Third Edition

Transform your R programming into interactive web applications with "Web Application Development with R Using Shiny." This book takes you step-by-step through creating dynamic user interfaces and web solutions with the R Shiny package, empowering you to build impactful tools that showcase your data. What this Book will help me do Create interactive web applications using R Shiny. Apply JavaScript for added functionality and customization in Shiny apps. Effortlessly deploy Shiny apps online for accessibility. Understand Shiny UI functions to design effective user interfaces. Leverage data visualization techniques for insightful analytics in apps. Author(s) Chris Beeley and Shitalkumar R. Sukhdeve bring their profound expertise in R programming and Shiny development to this book. Chris is an experienced data scientist passionate about interactive data solutions, while Shitalkumar, with a strong computing background, shares his hands-on insights. Their collaborative and tutorial approach ensures learners grasp each concept smoothly. Who is it for? This book is ideal for R programmers eager to transition from static data evaluation to engaging, interactive web applications. It caters to professionals and enthusiasts seeking practical, hands-on coding guidance. Readers should have foundational R programming knowledge, ensuring a smooth transition into Shiny concepts.

Inside the Message Passing Interface

A hands-on guide to writing a Message Passing Interface, this book takes the reader on a tour across major MPI implementations, best optimization techniques, application relevant usage hints, and a historical retrospective of the MPI world, all based on a quarter of a century spent inside MPI. Readers will learn to write MPI implementations from scratch, and to design and optimize communication mechanisms using pragmatic subsetting as the guiding principle. Inside the Message Passing Interface also covers MPI quirks and tricks to achieve best performance. Dr. Alexander Supalov created the Intel Cluster Tools product line, including the Intel MP Library that he designed and led between 2003 and 2015. He invented the common MPICH ABI and also guided Intel efforts in the MPI Forum during the development of the MPI-2.1, MPI-2.2, and MPI-3 standards. Before that, Alexander designed new finite-element mesh-generation methods, contributing to the PARMACS and PARASOL interfaces, and developed the first full MPI-2 and IMPI implementations in the world. He graduated from the Moscow Institute of Physics and Technology in 1990, and earned his PhD in applied mathematics at the Institute of Numerical Mathematics of the Russian Academy of Sciences in 1995. Alexander holds 26 patents (more pending worldwide).

Kafka Streams in Action

Kafka Streams in Action teaches you everything you need to know to implement stream processing on data flowing into your Kafka platform, allowing you to focus on getting more from your data without sacrificing time or effort. About the Technology Not all stream-based applications require a dedicated processing cluster. The lightweight Kafka Streams library provides exactly the power and simplicity you need for message handling in microservices and real-time event processing. With the Kafka Streams API, you filter and transform data streams with just Kafka and your application. About the Book Kafka Streams in Action teaches you to implement stream processing within the Kafka platform. In this easy-to-follow book, you’ll explore real-world examples to collect, transform, and aggregate data, work with multiple processors, and handle real-time events. You’ll even dive into streaming SQL with KSQL! Practical to the very end, it finishes with testing and operational aspects, such as monitoring and debugging. What's Inside Using the KStreams API Filtering, transforming, and splitting data Working with the Processor API Integrating with external systems About the Reader Assumes some experience with distributed systems. No knowledge of Kafka or streaming applications required. About the Author Bill Bejeck is a Kafka Streams contributor and Confluent engineer with over 15 years of software development experience. Quotes A great way to learn about Kafka Streams and how it is a key enabler of event-driven applications. - From the Foreword by Neha Narkhede, Cocreator of Apache Kafka A comprehensive guide to Kafka Streams—from introduction to production! - Bojan Djurkovic, Cvent Bridges the gap between message brokering and real-time streaming analytics. - Jim Mantheiy Jr., Next Century Valuable both as an introduction to streams as well as an ongoing reference. - Robin Coe, TD Bank

IBM Spectrum Scale Security

Storage systems must provide reliable and convenient data access to all authorized users while simultaneously preventing threats coming from outside or even inside the enterprise. Security threats come in many forms, from unauthorized access to data, data tampering, denial of service, and obtaining privileged access to systems. According to the Storage Network Industry Association (SNIA), data security in the context of storage systems is responsible for safeguarding the data against theft, prevention of unauthorized disclosure of data, prevention of data tampering, and accidental corruption. This process ensures accountability, authenticity, business continuity, and regulatory compliance. Security for storage systems can be classified as follows: Data storage (data at rest, which includes data durability and immutability) Access to data Movement of data (data in flight) Management of data IBM® Spectrum Scale is a software-defined storage system for high performance, large-scale workloads on-premises or in the cloud. IBM Spectrum™ Scale addresses all four aspects of security by securing data at rest (protecting data at rest with snapshots, and backups and immutability features) and securing data in flight (providing secure management of data, and secure access to data by using authentication and authorization across multiple supported access protocols). These protocols include POSIX, NFS, SMB, Hadoop, and Object (REST). For automated data management, it is equipped with powerful information lifecycle management (ILM) tools that can help administer unstructured data by providing the correct security for the correct data. This IBM Redpaper™ publication details the various aspects of security in IBM Spectrum Scale™, including the following items: Security of data in transit Security of data at rest Authentication Authorization Hadoop security Immutability Secure administration Audit logging Security for transparent cloud tiering (TCT) Security for OpenStack drivers Unless stated otherwise, the functions that are mentioned in this paper are available in IBM Spectrum Scale V4.2.1 or later releases.

EU GDPR: A Pocket Guide, School's edition

The EU General Data Protection Regulation (GDPR) unifies data protection and unifies data protection across the EU. It applies to every organisation in the world that handles EU residents’ personal data – which includes schools. The Regulation introduces a number of key changes for schools – and the change from compliance with the Data Protection Act 1998 (DPA) to GDPR compliance is a complex one. We have revised our popular EU GDPR – A Pocket Guide to include specific expectations of and requirements for schools, and provide an accessible overview of the changes you need to make to comply with the Regulation. GDPR – A Pocket Guide Schools’ Edition sets out: A brief history of data protection and national data protection laws in the EU, including as the UK’s DPA); Explanations of the terms and definitions used in the GDPR; The key requirements of the GDPR; The need to appoint a data protection officer (DPO); The lawful basis of processing data and when consent is needed; How to comply with the Regulation; and A full index of the Regulation, enabling you to find relevant articles quickly and easily. This pocket guide is the ideal resource for anyone wanting a clear, concise primer on the GDPR.

IBM FlashSystem V9000 Model AE3 Product IBM FlashSystem V9000 AC3 with Flash Enclosure Model AE3 Product Guide

This IBM Redbooks® Product Guide describes IBM FlashSystem® V9000, which is a comprehensive all-flash enterprise storage solution that delivers the full capabilities of IBM FlashCore® technology. In addition, it provides a rich set of software-defined storage features, including IBM Real-time Compression™, data reductions, dynamic tiering, thin provisioning, snapshots, cloning, replication, data copy services, and IBM HyperSwap® for high availability. Scale out scale up configurations can now add a hot spare node to further enhance availability. With the release of FlashSystem V9000 Software V8.1, extra functions and features are available, including support for new and more powerful FlashSystem V9000 storage enclosure Model AE3. Software features added include GUI enhancements, a new dashboard, support assistance, and data deduplication. AE3 capacities include Small (3.6 TB), Medium (8.5 TB), and Large (18 TB) IBM MicroLatency® modules for between 14.4 TB and 180 TB usable capacity (TBu), with inline hardware compression increasing the capacity up to 219 TB effective capacity (TBe). New SAS-based small form factor (SFF) and large form factor (LFF) expansion enclosures that provide a mixture of nearline hard disk drives (HDDs) and flash MDisks in a pool that can be used for IBM Easy Tier®. The new IBM FlashSystem V9000 SFF expansion enclosure Model92F offers new tiering options with low-cost solid-state drive (SSD flash drives) and nearline HDDs. Up to 784 drives per node pair of serial-attached SCSI (SAS) expansions are supported per FlashSystem V9000 controller pair, providing up to 480 drives with expansion Model 24F and up to 240 drives with expansion Model 12F. FlashSystem V9000 Software version 8.1 replaces version 7.8, and is available to all IBM FlashSystem V9000 customers with current warranty or software maintenance agreements.

Unstructured Data Analysis

Unstructured data is the most voluminous form of data in the world, and several elements are critical for any advanced analytics practitioner leveraging SAS software to effectively address the challenge of deriving value from that data. This book covers the five critical elements of entity extraction, unstructured data, entity resolution, entity network mapping and analysis, and entity management. By following examples of how to apply processing to unstructured data, readers will derive tremendous long-term value from this book as they enhance the value they realize from SAS products.

Learning, Unlearning and Re-Learning Curves

Learning is an empirical phenomenon whereby people or organisations undergo a level of efficiency improvement with recurring tasks. Alan Jones pragmatic guide to this important element within estimating introduces two key learning curve models: Wright and Crawford and explains where, how and when to apply them.

Power BI Data Analysis and Visualization

Power BI Data Analysis and Visualization provides a roadmap to vendor choices and highlights why Microsoft’s Power BI is a very viable, cost effective option for data visualization. The book covers the fundamentals and most commonly used features of Power BI, but also includes an in-depth discussion of advanced Power BI features such as natural language queries; embedding Power BI dashboards; and live streaming data. It discusses real solutions to extract data from the ERP application, Microsoft Dynamics CRM, and also offers ways to host the Power BI Dashboard as an Azure application, extracting data from popular data sources like Microsoft SQL Server and open-source PostgreSQL. Authored by Microsoft experts, this book uses real-world coding samples and screenshots to spotlight how to create reports, embed them in a webpage, view them across multiple platforms, and more. Business owners, IT professionals, data scientists, and analysts will benefit from this thorough presentation of Power BI and its functions.

Random Number Generators—Principles and Practices

Random Number Generators, Principles and Practices has been written for programmers, hardware engineers, and sophisticated hobbyists interested in understanding random numbers generators and gaining the tools necessary to work with random number generators with confidence and knowledge. Using an approach that employs clear diagrams and running code examples rather than excessive mathematics, random number related topics such as entropy estimation, entropy extraction, entropy sources, PRNGs, randomness testing, distribution generation, and many others are exposed and demystified. If you have ever Wondered how to test if data is really random Needed to measure the randomness of data in real time as it is generated Wondered how to get randomness into your programs Wondered whether or not a random number generator is trustworthy Wanted to be able to choose between random number generator solutions Needed to turn uniform random data into a different distribution Needed to ensure the random numbers from your computer will work for your cryptographic application Wanted to combine more than one random number generator to increase reliability or security Wanted to get random numbers in a floating point format Needed to verify that a random number generator meets the requirements of a published standard like SP800-90 or AIS 31 Needed to choose between an LCG, PCG or XorShift algorithm Then this might be the book for you.

Introducing InnoDB Cluster: Learning the MySQL High Availability Stack

Set up, manage, and configure the new InnoDB Cluster feature in MySQL from Oracle. If you are growing your MySQL installation and want to explore making your servers highly available, this book provides what you need to know about high availability and the new tools that are available in MySQL 8.0.11 and later. Introducing InnoDB Cluster teaches you about the building blocks that make up InnoDB Cluster such as MySQL Group Replication for storing data redundantly, MySQL Router for the routing of inbound connections, and MySQL Shell for simplified setup and configuration, status reporting, and even automatic failover. You will understand how it all works together to ensure that your data are available even when your primary database server goes down. Features described in this book are available in the Community Edition of MySQL, beginning with the version 8.0.11 GA release, making this book relevant for any MySQL users in need of redundancy against failure. Tutorials in the book show how to configure a test environment and plan a production deployment. Examples are provided in the form of a walk-through of a typical MySQL high-availability setup. What You'll Learn Discover the newest high-availability features in MySQL Set up and use InnoDB Cluster as an HA solution Migrate your existing servers to MySQL 8 Employ best practices for using InnoDB Cluster Configure servers for optimal automatic failover to ensure that applications continue when a server fails Configure MySQL Router to load-balance inbound connections to the cluster Who This Book Is For Systems engineers, developers, and database professionals wanting to learn about the powerful high availability (HA) features, beginning with MySQL 8.0.11: MySQL Shell, MySQL Router, and MySQL Group Replication. The book is useful for those designing high-availability systems backed by a database, and for those interested in open source HA solutions.