talk-data.com talk-data.com

Topic

Analytics

data_analysis insights metrics

4552

tagged

Activity Trend

398 peak/qtr
2020-Q1 2026-Q1

Activities

4552 activities · Newest first

Big Data Visualization

Dive into 'Big Data Visualization' and uncover how to tackle the challenges of visualizing vast quantities of complex data. With a focus on scalable and dynamic techniques, this guide explores the nuances of effective data analysis. You'll master tools and approaches to display, interpret, and communicate data in impactful ways. What this Book will help me do Understand the fundamentals of big data visualization, including unique challenges and solutions. Explore practical techniques for using D3 and Python to visualize and detect anomalies in big data. Learn to leverage dashboards like Tableau to present data insights effectively. Address and improve data quality issues to enhance analysis accuracy. Gain hands-on experience with real-world use cases for tools such as Hadoop and Splunk. Author(s) James D. Miller is an IBM-certified expert specializing in data analytics and visualization. With years of experience handling massive datasets and extracting actionable insights, he is dedicated to sharing his expertise. His practical approach is evident in how he combines tool mastery with a clear understanding of data complexities. Who is it for? This book is designed for data analysts, data scientists, and others involved in interpreting and presenting big datasets. Whether you are a beginner looking to understand big data visualization or an experienced professional seeking advanced tools and techniques, this guide suits your needs perfectly. A foundational knowledge in programming languages like R and big data platforms such as Hadoop is recommended to maximize your learning.

Mastering Elastic Stack

Mastering Elastic Stack is your complete guide to advancing your data analytics expertise using the ELK Stack. With detailed coverage of Elasticsearch, Logstash, Kibana, Beats, and X-Pack, this book equips you with the skills to process and analyze any type of data efficiently. Through practical examples and real-world scenarios, you'll gain the ability to build end-to-end pipelines and create insightful dashboards. What this Book will help me do Build and manage log pipelines using Logstash, Beats, and Elasticsearch for real-time analytics. Develop advanced Kibana dashboards to visualize and interpret complex datasets. Efficiently utilize X-Pack features for alerting, monitoring, and security in the Elastic Stack. Master plugin customization and deployment for a tailored Elastic Stack environment. Apply Elastic Stack solutions to real-world cases for centralized logging and actionable insights. Author(s) The authors, None Kumar Gupta and None Gupta, are experienced technologists who have spent years working at the forefront of data processing and analytics. They are well-versed in Elasticsearch, Logstash, Kibana, and the Elastic ecosystem, having worked extensively in enterprise environments where these tools have transformed operations. Their passion for teaching and thorough understanding of the tools culminate in this comprehensive resource. Who is it for? The ideal reader is a developer already familiar with Elasticsearch, Logstash, and Kibana who wants to deepen their understanding of the stack. If you're involved in creating scalable data pipelines, analyzing complex datasets, or looking to implement centralized logging solutions in your work, this book is an excellent resource. It bridges the gap from intermediate to expert knowledge, allowing you to use the Elastic Stack effectively in various scenarios. Whether you are transitioning from a beginner or enhancing your skill set, this book meets your needs.

The Data Science Handbook

A comprehensive overview of data science covering the analytics, programming, and business skills necessary to master the discipline Finding a good data scientist has been likened to hunting for a unicorn: the required combination of technical skills is simply very hard to find in one person. In addition, good data science is not just rote application of trainable skill sets; it requires the ability to think flexibly about all these areas and understand the connections between them. This book provides a crash course in data science, combining all the necessary skills into a unified discipline. Unlike many analytics books, computer science and software engineering are given extensive coverage since they play such a central role in the daily work of a data scientist. The author also describes classic machine learning algorithms, from their mathematical foundations to real-world applications. Visualization tools are reviewed, and their central importance in data science is highlighted. Classical statistics is addressed to help readers think critically about the interpretation of data and its common pitfalls. The clear communication of technical results, which is perhaps the most undertrained of data science skills, is given its own chapter, and all topics are explained in the context of solving real-world data problems. The book also features: • Extensive sample code and tutorials using Python™ along with its technical libraries • Core technologies of “Big Data,” including their strengths and limitations and how they can be used to solve real-world problems • Coverage of the practical realities of the tools, keeping theory to a minimum; however, when theory is presented, it is done in an intuitive way to encourage critical thinking and creativity • A wide variety of case studies from industry • Practical advice on the realities of being a data scientist today, including the overall workflow, where time is spent, the types of datasets worked on, and the skill sets needed The Data Science Handbook is an ideal resource for data analysis methodology and big data software tools. The book is appropriate for people who want to practice data science, but lack the required skill sets. This includes software professionals who need to better understand analytics and statisticians who need to understand software. Modern data science is a unified discipline, and it is presented as such. This book is also an appropriate reference for researchers and entry-level graduate students who need to learn real-world analytics and expand their skill set. FIELD CADY is the data scientist at the Allen Institute for Artificial Intelligence, where he develops tools that use machine learning to mine scientific literature. He has also worked at Google and several Big Data startups. He has a BS in physics and math from Stanford University, and an MS in computer science from Carnegie Mellon.

Scala: Guide for Data Science Professionals

Scala will be a valuable tool to have on hand during your data science journey for everything from data cleaning to cutting-edge machine learning About This Book Build data science and data engineering solutions with ease An in-depth look at each stage of the data analysis process — from reading and collecting data to distributed analytics Explore a broad variety of data processing, machine learning, and genetic algorithms through diagrams, mathematical formulations, and source code Who This Book Is For This learning path is perfect for those who are comfortable with Scala programming and now want to enter the field of data science. Some knowledge of statistics is expected. What You Will Learn Transfer and filter tabular data to extract features for machine learning Read, clean, transform, and write data to both SQL and NoSQL databases Create Scala web applications that couple with JavaScript libraries such as D3 to create compelling interactive visualizations Load data from HDFS and HIVE with ease Run streaming and graph analytics in Spark for exploratory analysis Bundle and scale up Spark jobs by deploying them into a variety of cluster managers Build dynamic workflows for scientific computing Leverage open source libraries to extract patterns from time series Master probabilistic models for sequential data In Detail Scala is especially good for analyzing large sets of data as the scale of the task doesn’t have any significant impact on performance. Scala’s powerful functional libraries can interact with databases and build scalable frameworks — resulting in the creation of robust data pipelines. The first module introduces you to Scala libraries to ingest, store, manipulate, process, and visualize data. Using real world examples, you will learn how to design scalable architecture to process and model data — starting from simple concurrency constructs and progressing to actor systems and Apache Spark. After this, you will also learn how to build interactive visualizations with web frameworks. Once you have become familiar with all the tasks involved in data science, you will explore data analytics with Scala in the second module. You’ll see how Scala can be used to make sense of data through easy to follow recipes. You will learn about Bokeh bindings for exploratory data analysis and quintessential machine learning with algorithms with Spark ML library. You’ll get a sufficient understanding of Spark streaming, machine learning for streaming data, and Spark graphX. Armed with a firm understanding of data analysis, you will be ready to explore the most cutting-edge aspect of data science — machine learning. The final module teaches you the A to Z of machine learning with Scala. You’ll explore Scala for dependency injections and implicits, which are used to write machine learning algorithms. You’ll also explore machine learning topics such as clustering, dimentionality reduction, Naïve Bayes, Regression models, SVMs, neural networks, and more. This learning path combines some of the best that Packt has to offer into one complete, curated package. It includes content from the following Packt products: Scala for Data Science, Pascal Bugnion Scala Data Analysis Cookbook, Arun Manivannan Scala for Machine Learning, Patrick R. Nicolas Style and approach A complete package with all the information necessary to start building useful data engineering and data science solutions straight away. It contains a diverse set of recipes that cover the full spectrum of interesting data analysis tasks and will help you revolutionize your data analysis skills using Scala. Downloading the example code for this book. You can download the example code files for all Packt books you have purchased from your account at http://www.PacktPub.com. If you purchased this book elsewhere, you can visit http://www.PacktPub.com/support and register to have the code file.

Mastering Elasticsearch 5.x - Third Edition

This comprehensive guide dives deep into the functionalities of Elasticsearch 5, the widely-used search and analytics engine. Leveraging the power of Apache Lucene, this book will help you understand advanced concepts like querying, indexing, and cluster management to build efficient and scalable search solutions. What this Book will help me do Master advanced features of Elasticsearch such as text scoring, sharding, and aggregation. Understand how to handle big data efficiently using Elasticsearch's architecture. Learn practical implementation techniques for Elasticsearch features through hands-on examples. Develop custom plugins for Elasticsearch to tailor its functionalities to specific needs. Scale and optimize Elasticsearch clusters for high performance in production environments. Author(s) Bharvi Dixit is an experienced software engineer and a recognized expert in implementing Elasticsearch solutions. With a strong background in distributed systems and database management, Bharvi's writing is informed by real-world experience and a focus on practical applications. Who is it for? This book is ideal for developers and data engineers with existing experience in Elasticsearch who wish to deepen their knowledge. It serves as a valuable resource for professionals tasked with creating scalable search applications. A working understanding of Elasticsearch basics and query DSL is recommended to fully benefit from this guide.

Wednesday at 9 AM Pacific

Update: Talk is available @ https://www.voiceamerica.com/episode/97300/the-quantum-disruption-in-global-business-driven-by-the-big-analytics

The Quantum disruption in Global Business driven by The Big Analytics Listen to Vishal Kumar, An Author, Innovator, and a Mentor in discussion on one of the most important and relevant subjects of the modern times: The Big Analytics, and how it is changing the landscape of Global Business

Wednesday at 9 AM Pacific Time on VoiceAmerica Business Channel

Featured Guest

Vishal Kumar

Vishal Kumar is CEO & President of AnalyticsWeek. He is a leading advocate for data-driven decision making. He is rated as top 100 global influencers to follow in data analytics by leading research organizations. He has published two books on the topics of analytics. Currently, his work involves using Artificial Intelligence to prepare the workforce for the future. Vishal has been a keynote speaker at various international conferences. He sits as advisor to various analytics startups.

Originally Posted @ VoiceAmerica

About #Podcast:

FutureOfData podcast is a conversation starter to bring leaders, influencers and lead practitioners to come on show and discuss their journey in creating the data driven future.

Wanna Join? If you or any you know wants to join in, Register your interest @ http://play.analyticsweek.com/guest/

Want to sponsor? Email us @ [email protected]

Keywords: FutureOfData Data Analytics Leadership Podcast Big Data Strategy

Geospatial Data and Analysis

Geospatial data, or data with location information, is generated in huge volumes every day by billions of mobile phones, IoT sensors, drones, nanosatellites, and many other sources in an unending stream. This practical ebook introduces you to the landscape of tools and methods for making sense of all that data, and shows you how to apply geospatial analytics to a variety of issues, large and small. Authors Aurelia Moser, Jon Bruner, and Bill Day provide a complete picture of the geospatial analysis options available, including low-scale commercial desktop GIS tools, medium-scale options such as PostGIS and Lucene-based searching, and true big data solutions built on technologies such as Hadoop. You’ll learn when it makes sense to move from one type of solution to the next, taking increased costs and complexity into account. Explore the structure of basic webmaps, and the challenges and constraints involved when working with geo data Dive into low- to medium-scale mapping tools for use in backend and frontend web development Focus on tools for robust medium-scale geospatial projects that don’t quite justify a big data solution Learn about innovative platforms and software packages for solving issues of processing and storage of large-scale data Examine geodata analysis use cases, including disaster relief, urban planning, and agriculture and environmental monitoring

Learning Kibana 5.0

Learning Kibana 5.0 is your gateway to mastering the art of data visualization using the powerful features of the Kibana platform. This book guides you through the process of creating stunning interactive dashboards and making data-driven insights accessible with real-time visualizations. Whether you're new to the Elastic stack or seeking to refine your expertise, this book equips you to harness Kibana's full potential. What this Book will help me do Build robust, real-time dashboards in Kibana to visualize complex datasets efficiently. Leverage Timelion to perform time-series data analysis and create metrics-based dashboards. Explore advanced analytics using the Graph plugin to uncover relationships and correlations in data. Learn how to create and deploy custom plugins to tailor Kibana to specific project needs. Understand how to use the Elastic stack to monitor, analyze, and optimize various types of data flows. Author(s) Bahaaldine Azarmi is a seasoned expert in the Elastic stack, known for his dedication to making complex technical topics approachable and practical. With years of experience in data analytics and software development, Bahaaldine shares not only his technical expertise but also his passion for helping professionals achieve their goals through clear, actionable guidance. His writing emphasizes hands-on learning and practical application. Who is it for? This book is perfect for developers, data visualization engineers, and data scientists who aim to hone their skills in data visualization and interactive dashboard development. It assumes a basic understanding of Elasticsearch and Logstash to maximize its practicality. If you aim to advance your career by learning how to optimize data architecture and solve real-world problems using the Elastic stack, this book is ideal for you.

podcast_episode
by Val Kroll , Julie Hoyer , Tim Wilson (Analytics Power Hour - Columbus (OH) , Moe Kiss (Canva) , Michael Helbling (Search Discovery)

Have you ever been to a really great analytics conference and had just one great conversation after another with other attendees? We have! And, for this episode, we decided to head up into the hills above Budapest and try to bring that experience to you. With a range of fine and foreign libations in hand, a crackling fire toasting our backsides, and a roaming handheld microphone, we asked the questions, and the Superweek 2017 attendees provided the answers. Except when the audience asked the questions...for an episode releasing four weeks hence!   For complete show notes, including links to items mentioned in this episode and a transcript of the show, visit the show page.

Elasticsearch 5.x Cookbook - Third Edition

Elasticsearch 5.x Cookbook is a comprehensive guide that teaches you how to leverage the full power of Elasticsearch for high-performance search and analytics. Through step-by-step recipes, you'll explore deployment, query building, plugin integration, and advanced analytics, ensuring you can manage and scale Elasticsearch like a pro. What this Book will help me do Understand and deploy complex Elasticsearch cluster topologies for optimal performance. Create tailored mappings to gain finer control over data indexing and retrieval. Design and execute advanced queries and analytics using Elasticsearch capabilities. Integrate Elasticsearch with popular programming languages and big data platforms. Monitor and improve Elasticsearch cluster health using the best practices and tools. Author(s) Alberto Paro is a seasoned software engineer and data scientist with extensive experience in distributed systems and search technologies. Having worked on numerous search-related projects, he brings practical, real-world insights to his writing. Alberto is passionate about teaching and simplifying complex concepts, making this book both approachable and expertly detailed. Who is it for? This book is ideal for developers or data engineers seeking to utilize Elasticsearch for advanced search and analytics tasks. If you have some prior knowledge of JSON and programming concepts, particularly Java, you will benefit most from this material. Whether you're looking to integrate Elasticsearch into your systems or to optimize its usage, this book caters to your needs.

Getting the most out of your optimization efforts means understanding the data you’re collecting, from analytics implementation, to report setup, to analysis techniques. Using that data to drive your Optimization, and eventually Personalization efforts, will help you to get the most out of your marketing dollars. In this session, Krista will give you several tips for using your analytics data to identify more opportunities, prioritize your efforts, and begin to personalize your user experience.

talk
by Caleb Whitmore (Analytics Pros, USA)

A fireside 'ask us anything' with Caleb and Charlotte Whitmore, founders of Analytics Pros. From questions about analytics to building a globally leading team and company all while building a family and life together, just about anything is fair game.

The Digital Analytics Power Hour podcast just turned two years old and, since it was inspired by the informal conversations that occur after hours at analytics conferences, Michael and Tim will be recording an episode as a fireside chat at Superweek! The podcast regularly has guests who get peppered with questions from Michael and Tim for 45-50 minutes. For this session, they will be altering the format slightly and peppering selected speakers and attendees with one question each -- the one question they've always wanted to ask the person, a question that came out of one of the sessions, or, possibly, a question that they've heard the person answer so spectacularly in the past that they wanted to ask it again to record the response for posterity!

Miroslav will present Analytics from a completely different perspective. Some things he will reveal about the life and job of an Analytics / data expert will be painful, honest, interesting and most of all, funny. After his Stand-up, you will look at attribution, data collection and Analytics dimensions / metrics, from a new point of view. Please, don't expect everything will be 100% accurate - after all, it's a stand-up.

Correct and accurate measurement of a hybrid app with GA is hard. The Native app outer can be measured in GA using app measurement and the mobile web content can be measured using normal universal analytics GA. The problem is that you can't tell where hybrid app users came from in the mobile web GA - there is no useable traffic source. You can't measure the app and web in the same property but you have hybrid app mobile web and pure mobile web in the mobile web property. This sounds like a complex mess and it is but we've solved it and we'll explain how with a real live demo and technical walk through with Q & A.

Enhanced Ecommerce introduced some new metrics that explain user behavior – something Google Analytics should be used for. Robert, a corporate web analyst or the EEEO (Enhanced Ecommerce Executive Officer) will show us how he uses Cart-to-Detail Rate, Buy-to-Detail Rate and some calculated metrics for better campaign as well as warehouse planning in the EEEO (Enhanced Ecommerce Enabled Organization).

In this session Ron Luhtanen and Petri Mertanen will tell how Annalect Finland is using machine learning in creating predictive models, which are explaining and predicting user behavior from web analytics data. By default quite many web analytics systems report behavioral data on an aggregate level. This is inconvenient for in depth analysis, because it limits the methods to only different regression models. During the session you will hear how browser and session level data is used with modeling algorithm. We will wrap up the session with benefits and outputs of the modeling.

Somewhere along the spectrum of "logging into Google Analytics" and "the machines are in control" is the world of the power analyst who interacts with the data on the fly, applies statistics to large data sets, and develops interactive visualizations that go well beyond the capabilities of Excel. Those power analysts are operating on the fringes of the domain of the "data scientist" -- a role for which no one can really agree on a concrete definition! In this session, Tim -- who has never claimed to be and never will claim to be a data scientist -- will share what he has learned from trying to understand the scope and nature of that role. And, beyond that, how he has grown as a digital analyst, expanded his skills to "program with data" with R, and increased his value to the organizations with which he works as a result.