talk-data.com talk-data.com

Topic

data

5765

tagged

Activity Trend

3 peak/qtr
2020-Q1 2026-Q1

Activities

5765 activities · Newest first

Extending Power BI with Python and R

Dive into the world of advanced analytics and visualizations in Power BI with "Extending Power BI with Python and R". This comprehensive guide will teach you how to integrate Python and R scripting into your Power BI projects, allowing you to build data models, transform data, and create rich visualizations. Learn practical techniques to make your Power BI dashboards more interactive and insightful. What this Book will help me do Master the integration of Python and R scripts into Power BI to enhance its functionality. Learn to implement advanced data transformations and enrichments using external APIs. Create advanced visualizations and custom visuals with R for improved analytics. Perform advanced data analysis including handling missing data using Python and R. Leverage machine learning techniques within Power BI projects to extract actionable insights. Author(s) None Zavarella is a data science expert and renowned author specializing in data analytics and visualization tools. With years of experience working with Power BI, Python, and R in diverse data-driven projects, Zavarella offers a unique perspective on enhancing Power BI capabilities. Passionate about teaching, they craft clear and impactful tutorials for learners. Who is it for? This book is perfect for business intelligence professionals, data scientists, and business analysts who already use Power BI and want to augment its features with Python and R. If you have a foundational understanding of Power BI and some basic familiarity with Python and R, this book will help you explore their combined potential for advanced analytics.

Tableau for Business Users: Learn to Automate and Simplify Dashboards for Better Decision Making

Learn Tableau by working through concrete examples and issues that you are likely to face in your day-to-day work. Author Shankar Arul starts by teaching you the fundamentals of data analytics before moving on to the core concepts of Tableau. You will learn how to create calculated fields, and about the currently available calculation functionalities in Tableau, including Basic Expressions, Level of Detail (LOD) Expressions, and Table Calculations. As the book progresses, you’ll be walked through comparisons and trend calculations using tables. A concluding chapter on dashboarding will show you how to build actionable dashboards to communicate analysis and visualizations. You’ll also see how Tableau can complement and communicate with Excel. After completing this book, you will be ready to tackle the challenges of data analytics using Tableau without getting bogged down by the technicalities of the tool. What Will You Learn Master the core concepts of Tableau Automate and simplify dashboards to help business users Understand the basics of data visualization techniques Leverage powerful features such as parameters, table calculations, level of detail expressions, and more Who is This book For Business analysts, data analysts, as well as financial analysts.

Managing and Visualizing Your BIM Data

Managing and Visualizing Your BIM Data is an essential guide for AEC professionals who want to harness the power of data to enhance their projects. Designed with a hands-on approach, this book delves into using Autodesk Dynamo for data collection and Microsoft Power BI for creating insightful dashboards. By the end, readers will be adept at connecting BIM models to interactive visualizations. What this Book will help me do Gain a deep understanding of data collection workflows in Autodesk Dynamo. Learn to connect Building Information Modeling (BIM) data to Power BI dashboards. Master the basics and advanced features of Dynamo for BIM data management. Create dynamic and visually appealing Power BI dashboards for AEC projects. Explore real-world use cases with expert-guided hands-on examples. Author(s) The authors, None Pellegrino, None Bottiglieri, None Crump, None Pieper, and None Touil, are experienced professionals in the AEC and software development industries. With extensive backgrounds in Building Information Modeling (BIM) and data visualization, they bring practical insights combined with a passion for teaching. Their approach ensures readers not only learn the tools but also understand the reasoning behind best practices. Who is it for? This book is ideal for BIM managers and coordinators, design technology managers, and other Architecture, Engineering, and Construction (AEC) professionals. Readers with a foundational knowledge of BIM will find it particularly beneficial for enhancing their data analysis and reporting capabilities. If you're aiming to elevate your skill set in managing BIM data and creating impactful visualizations, this guide is for you.

Serverless Analytics with Amazon Athena

Delve into the serverless world of Amazon Athena with the comprehensive book 'Serverless Analytics with Amazon Athena'. This guide introduces you to the power of Athena, showing you how to efficiently query data in Amazon S3 using SQL without the hassle of managing infrastructure. With clear instructions and practical examples, you'll master querying structured, unstructured, and semi-structured data seamlessly. What this Book will help me do Effectively query and analyze both structured and unstructured data stored in S3 using Amazon Athena. Integrate Athena with other AWS services to create powerful, secure, and cost-efficient data workflows. Develop ETL pipelines and machine learning workflows leveraging Athena's compatibility with AWS Glue. Monitor and troubleshoot Athena queries for consistent performance and build scalable serverless data solutions. Implement security best practices and optimize costs when managing your Athena-driven data solutions. Author(s) None Virtuoso, along with co-authors Mert Turkay Hocanin None and None Wishnick, brings a wealth of experience in cloud solutions, serverless technologies, and data engineering. They excel in demystifying complex technical topics and have a passion for empowering readers with practical skills and knowledge. Who is it for? This book is tailored for business intelligence analysts, application developers, and system administrators who want to harness Amazon Athena for seamless, cost-efficient data analytics. It suits individuals with basic SQL knowledge looking to expand their capabilities in querying and processing data. Whether you're managing growing datasets or building data-driven applications, this book provides the know-how to get it right.

High Performance MySQL, 4th Edition

How can you realize MySQL's full power? With High Performance MySQL, you'll learn advanced techniques for everything from setting service-level objectives to designing schemas, indexes, and queries to tuning your server, operating system, and hardware to achieve your platform's full potential. This guide also teaches database administrators safe and practical ways to scale applications through replication, load balancing, high availability, and failover. Updated to reflect recent advances in cloud- and self-hosted MySQL, InnoDB performance, and new features and tools, this revised edition helps you design a relational data platform that will scale with your business. You'll learn best practices for database security along with hard-earned lessons in both performance and database stability. Dive into MySQL's architecture, including key facts about its storage engines Learn how server configuration works with your hardware and deployment choices Make query performance part of your software delivery process Examine enhancements to MySQL's replication and high availability Compare different MySQL offerings in managed cloud environments Explore MySQL's full stack optimization from application-side configuration to server tuning Turn traditional database management tasks into automated processes

Data Science Bookcamp

Learn data science with Python by building five real-world projects! Experiment with card game predictions, tracking disease outbreaks, and more, as you build a flexible and intuitive understanding of data science. In Data Science Bookcamp you will learn: Techniques for computing and plotting probabilities Statistical analysis using Scipy How to organize datasets with clustering algorithms How to visualize complex multi-variable datasets How to train a decision tree machine learning algorithm In Data Science Bookcamp you’ll test and build your knowledge of Python with the kind of open-ended problems that professional data scientists work on every day. Downloadable data sets and thoroughly-explained solutions help you lock in what you’ve learned, building your confidence and making you ready for an exciting new data science career. About the Technology A data science project has a lot of moving parts, and it takes practice and skill to get all the code, algorithms, datasets, formats, and visualizations working together harmoniously. This unique book guides you through five realistic projects, including tracking disease outbreaks from news headlines, analyzing social networks, and finding relevant patterns in ad click data. About the Book Data Science Bookcamp doesn’t stop with surface-level theory and toy examples. As you work through each project, you’ll learn how to troubleshoot common problems like missing data, messy data, and algorithms that don’t quite fit the model you’re building. You’ll appreciate the detailed setup instructions and the fully explained solutions that highlight common failure points. In the end, you’ll be confident in your skills because you can see the results. What's Inside Web scraping Organize datasets with clustering algorithms Visualize complex multi-variable datasets Train a decision tree machine learning algorithm About the Reader For readers who know the basics of Python. No prior data science or machine learning skills required. About the Author Leonard Apeltsin is the Head of Data Science at Anomaly, where his team applies advanced analytics to uncover healthcare fraud, waste, and abuse. Quotes Valuable and accessible… a solid foundation for anyone aspiring to be a data scientist. - Amaresh Rajasekharan, IBM Corporation Really good introduction of statistical data science concepts. A must-have for every beginner! - Simone Sguazza, University of Applied Sciences and Arts of Southern Switzerland A full-fledged tutorial in data science including common Python libraries and language tricks! - Jean-François Morin, Laval University This book is a complete package for understanding how the data science process works end to end. - Ayon Roy, Internshala

Storage as a Service Offering Guide

IBM® Storage as a Service (STaaS) extends your hybrid cloud experience with a new flexible consumption model enabled for both your on-premises and hybrid cloud infrastructure needs, giving you the agility, cash flow efficiency, and services of cloud storage with the flexibility to dynamically scale up or down and only pay for what you use beyond the minimal capacity. This IBM Redpaper provides a detailed introduction to the IBM STaaS service. The paper is targeted for data center managers and storage administrators.

IBM HyperSwap and Multi-site HA/DR for IBM FlashSystem A9000 and A9000R

IBM® HyperSwap® is the high availability (HA) solution that provides continuous data availability in case of hardware failure, power failure, connectivity failure, or disasters. The HyperSwap capability is available for IBM FlashSystem® A9000 and IBM FlashSystem A9000R, starting with software version 12.2.1. Version 12.3 introduces a function that combines HyperSwap and Asynchronous replication, which creates a solution that entails HA and Disaster Recovery (DR). One side of the HyperSwap pair has an active async link to the third system, and the other side has a standby link. Known as Multi-site HA/DR, this configuration provides HyperSwap active-active HA while keeping data mirrored to a third copy to ensure two levels of business continuity. This IBM Redpaper™ publication gives a broad understanding of the architecture, design, and implementation of HyperSwap and Multi-site HA/DR solution. It also discusses and illustrates various use cases pertaining to their use and functionality. This paper is intended for those users who want to deploy solutions that take advantage of HyperSwap and Multi-site HA/DR for FlashSystem A9000 and A9000R.

Expert Oracle Database Architecture: Techniques and Solutions for High Performance and Productivity

Now in its fourth edition and covering Oracle Database 21c, this best-selling book continues to bring you some of the best thinking on how to apply Oracle Database to produce scalable applications that perform well and deliver correct results. Tom Kyte and Darl Kuhn share a simple philosophy: "you can treat Oracle as a black box and just stick data into it, or you can understand how it works and exploit it as a powerful computing environment." If you choose the latter, then you’ll find that there are few information management problems that you cannot solve quickly and elegantly. This fully revised fourth edition covers the developments and new features up to Oracle Database 21c. Up-to-date features are covered for tables, indexes, data types, sequences, partitioning, data loading, temporary tables, and more. All the examples are demonstrated using modern techniques and are executed in container and pluggable databases. The book’s proof-by-example approach encourages you to let evidence be your guide. Try something. See the result. Understand why the result is what it is. Apply your newfound knowledge with confidence. The book covers features by explaining how each one works, how to implement software using it, and the common pitfalls associated with it. Don’t treat Oracle Database as a black box. Get this book. Dive deeply into Oracle Database’s most powerful features that many do not invest the time to learn about. Set yourself apart from your competition and turbo-charge your career. What You Will Learn Identify and effectively resolve application performance issues and bottlenecks Architect systems to leverage the full power and feature set of Oracle’s database engine Configure a database to maximize the use of memory structures and background processes Understand internal locking and latching technology and how it impacts your system Proactively recommend best practices around performance for table and index structures Take advantage of advanced features such as table partitioning and parallel execution Who This Book Is For Oracle developers and Oracle DBAs. If you’re a developer and want a stronger understanding of Oracle features and architecture that will enable your applications to scale regardless of the workload, this book is for you. If you’re a DBA and want to intelligently work with developers to design applications that effectively leverage Oracle technology, then look no further.

Kafka: The Definitive Guide, 2nd Edition

Every enterprise application creates data, whether it consists of log messages, metrics, user activity, or outgoing messages. Moving all this data is just as important as the data itself. With this updated edition, application architects, developers, and production engineers new to the Kafka streaming platform will learn how to handle data in motion. Additional chapters cover Kafka's AdminClient API, transactions, new security features, and tooling changes. Engineers from Confluent and LinkedIn responsible for developing Kafka explain how to deploy production Kafka clusters, write reliable event-driven microservices, and build scalable stream processing applications with this platform. Through detailed examples, you'll learn Kafka's design principles, reliability guarantees, key APIs, and architecture details, including the replication protocol, the controller, and the storage layer. You'll examine: Best practices for deploying and configuring Kafka Kafka producers and consumers for writing and reading messages Patterns and use-case requirements to ensure reliable data delivery Best practices for building data pipelines and applications with Kafka How to perform monitoring, tuning, and maintenance tasks with Kafka in production The most critical metrics among Kafka's operational measurements Kafka's delivery capabilities for stream processing systems

Artificial Intelligence and Machine Learning in Business Management

The focus of this book is to introduce Artificial Intelligence (AI) and Machine Learning (ML) technologies into the context of Business Management. The book gives insights into the implementation and impact of AI and ML to business leaders, managers, technology developers, and implementers.

Essential PySpark for Scalable Data Analytics

Dive into the world of scalable data processing with 'Essential PySpark for Scalable Data Analytics'. This book is a comprehensive guide that helps beginners understand and utilize PySpark to process, analyze, and draw insights from large datasets effectively. With hands-on tutorials and clear explanations, you will gain the confidence to tackle big data analytics challenges. What this Book will help me do Understand and apply the distributed computing paradigm for big data. Learn to perform scalable data ingestion, cleansing, and preparation using PySpark. Create and utilize data lakes and the Lakehouse paradigm for efficient data storage and access. Develop and deploy machine learning models with scalability in mind. Master real-time analytics pipelines and create impactful data visualizations. Author(s) None Nudurupati is an experienced data engineer and educator, specializing in distributed systems and big data technologies. With years of practical experience in the field, None brings a clear and approachable teaching style to technical topics. Passionate about empowering readers, the author has designed this book to be both practical and inspirational for aspiring data practitioners. Who is it for? This book is ideal for data professionals including data scientists, engineers, and analysts looking to scale their data analytics processes. It assumes familiarity with basic data science concepts and Python, as well as some experience with SQL-like data analysis. This is particularly suitable for individuals aiming to expand their knowledge in distributed computing and PySpark to handle big data challenges. Achieving scalable and efficient data solutions is at the core of this guide.

Maximizing Tableau Server

Maximizing Tableau Server guides you on how to make the most of your Tableau Server experience. You'll learn to organize, share, and interact with dashboards and data sources effectively. This book empowers you to enhance your productivity with Tableau Server and achieve seamless collaboration with your team. What this Book will help me do Navigate Tableau Server's interface to locate and customize content easily. Manage and organize Tableau Server content for efficient collaboration. Share, download, and interact with dashboards, enhancing user productivity. Automate tasks such as subscriptions and data refresh schedules. Apply best practices to optimize dashboard performance and usability. Author(s) None Sarsfield and None Locker are seasoned data professionals with extensive knowledge of Tableau. They have guided many organizations in utilizing Tableau Server to its full potential. Their practical insights and step-by-step approach demystify Tableau Server for readers of all backgrounds. Who is it for? This book is perfect for BI developers, data analysts, and professionals who are new to Tableau Server. If you're aiming to streamline the way you handle and share dashboards and want actionable advice on enhancing efficiency, this book is ideal for you. Basic familiarity with web navigation is all that is needed.

IBM DS8000 and IBM Z Synergy DS8000: Release 9.2 and z/OS 2.5

IBM® Z has a close and unique relationship to its storage. Over the years, improvements to the Z processors and storage software, the disk storage systems, and their communication architecture consistently reinforced this synergy. This IBM Redpaper publication summarizes and highlights the various aspects, advanced functions, and technologies that are often pioneered by IBM, and that make the IBM Z® and the IBM DS8000 products an ideal combination. This paper is intended for users who have some familiarity with IBM Z and the IBM DS8000® series and want a condensed but comprehensive overview of the synergy items up to the IBM z15™ server with z/OS v2.5 and the IBM DS8900 Release 9.2 firmware.

Practical Biomedical Signal Analysis Using MATLAB®, 2nd Edition

Fully updated and with exclusive new content, this second edition presents a coherent treatment of various signal processing methods and applications. The book not only covers the current techniques of biomedical signal processing, but it also offers guidance on which methods are appropriate for a given task and different types of data.

Automating Analytics

Do you have a method for seeing all the data that passes through your organization? The need to democratize access to data and analytics, automate complex and tedious business processes, and amplify human output has led to analytic process automation (APA). Thousands of organizations across nearly every business and industry vertical use this software to accelerate data-driven business outcomes. This report examines the power of APA using technology, business, and real-world examples. If you're a technical business, analytics, or business intelligence leader, you'll learn how to use APA to tackle complex problems, increase productivity, and improve efficiency. You'll discover what APA means for your business and for you. This report explores: The importance of data: understand how data is transformed into information and insights for making business decisions Gathering data with APA: learn how APA differs from your current process Data democratization: grant data access to employees and empower them to analyze specific tasks and performance Data reporting: learn how APA blends data tables, fields, and values to help you search for insights at a granular level Analytics: explore new tools that use AI and ML to improve the analytic process

Beginning Hibernate 6: Java Persistence from Beginner to Pro

Get started with Hibernate, an open source Java persistence layer and gain a clear introduction to the current standard for object-relational persistence in Java. This updated edition includes the new Hibernate 6.0 framework which covers new configuration, new object relational mapping changes, and enhanced integration with the more general Spring, Boot and Quarkus and other Java frameworks.The book keeps its focus on Hibernate without wasting time on nonessential third-party tools, so you’ll be able to immediately start building transaction-based engines and applications. Experienced authors Joseph Ottinger with Dave Minter and Jeff Linwood provide more in-depth examples than any other book for Hibernate beginners. They present their material in a lively, example-based manner—not a dry, theoretical, hard-to-read fashion. What You'll Learn Build enterprise Java-based transaction-type applications that access complex data with Hibernate Work with Hibernate 6 using a present-day build process Integrate into the persistence life cycle Search and query with the new version of Hibernate Keep track of versioned data with Hibernate Envers Who This Book Is For Programmers experienced in Java with databases (the traditional, or connected, approach), but new to open-source, lightweight Hibernate.

Modern Analytics Platforms

From a global pandemic to extreme weather, the events of 2020 and 2021 have caused organizations to make quick and constant adjustments to their strategy and operations. This transformation is likely to continue and have a major impact on analytics. Not only do responders to Experian's annual Global Data Management survey confirm more demand for data insights, but most of them also believe the lack of agility hurt their organization's responses to fast-changing business needs. With this O'Reilly report, you'll learn how organizations have begun to take new approaches to analytics for business reinvention and digital transformation. Chief analytics and data officers and data analytics, data science, data visualization leaders will explore converged analytics and find out how it differs from legacy and current analytics approaches. You'll see where your organization stands in its journey to convergence--and what you need to do next. This report helps you: Examine how three organizations in different industries and with different objectives have benefited from modern analytics Learn how analytics has evolved to support greater business agility at scale Examine the alignment of people, processes, tools, and data in converged analytics Learn the five stages of analytical competition and six dimensions for benchmarking maturity Explore practices that you can adopt to improve your analytics capabilities and your agility

Optimize Video Streaming Delivery

Media content today is increasingly streamed video, and this trend will only grow as the speed of consumer internet and video quality improve. Traditional video streaming platforms, such as Netflix and Hulu, now account for only a portion of this content as more and more live events are streamed over the internet. And consumer-generated content on video-based social networks such as Twitch and TikTok is now more accessible and gaining popularity. This report focuses on the current state of video delivery, including the challenges content providers face and the various solutions they're pursuing. The findings in this report are based on a recent survey conducted by Edgecast, a content delivery network (CDN) that helps companies accelerate and deliver static and dynamic content to end users around the world. You'll explore: The current state of video streaming, how it works, and how streams are delivered Responses from a survey of CDN users that produce video streams How content providers are addressing recent video streaming challenges How the information in this report can help you identify KPIs