talk-data.com talk-data.com

Topic

Analytics

data_analysis insights metrics

4552

tagged

Activity Trend

398 peak/qtr
2020-Q1 2026-Q1

Activities

4552 activities · Newest first

Apache Spark Machine Learning Blueprints

In 'Apache Spark Machine Learning Blueprints', you'll explore how to create sophisticated and scalable machine learning projects using Apache Spark. This project-driven guide covers practical applications including fraud detection, customer analysis, and recommendation engines, helping you leverage Spark's capabilities for advanced data science tasks. What this Book will help me do Learn to set up Apache Spark efficiently for machine learning projects, unlocking its powerful processing capabilities. Integrate Apache Spark with R for detailed analytical insights, empowering your decision-making processes. Create predictive models for use cases including customer scoring, fraud detection, and risk assessment with practical implementations. Understand and utilize Spark's parallel computing architecture for large-scale machine learning tasks. Develop and refine recommendation systems capable of handling large user bases and datasets using Spark. Author(s) Alex Liu is a seasoned data scientist and software developer specializing in machine learning and big data technology. With extensive experience in using Apache Spark for predictive analytics, Alex has successfully built and deployed scalable solutions across industries. Their teaching approach combines theory and practical insights, making cutting-edge technologies accessible and actionable. Who is it for? This book is ideal for data analysts, data scientists, and developers with a foundation in machine learning who are eager to apply their knowledge in big data contexts. If you have a basic familiarity with Apache Spark and its ecosystem, and you're looking to enhance your ability to build machine learning applications, this resource is for you. It's particularly valuable for those aiming to utilize Spark for extensive data operations and gain practical, project-based insights.

IBM z13 Technical Guide

Digital business has been driving the transformation of underlying IT infrastructure to be more efficient, secure, adaptive, and integrated. Information Technology (IT) must be able to handle the explosive growth of mobile clients and employees. IT also must be able to use enormous amounts of data to provide deep and real-time insights to help achieve the greatest business impact. This IBM® Redbooks® publication addresses the IBM Mainframe, the IBM z13™. The IBM z13 is the trusted enterprise platform for integrating data, transactions, and insight. A data-centric infrastructure must always be available with a 99.999% or better availability, have flawless data integrity, and be secured from misuse. It needs to be an integrated infrastructure that can support new applications. It needs to have integrated capabilities that can provide new mobile capabilities with real-time analytics delivered by a secure cloud infrastructure. IBM z13 is designed with improved scalability, performance, security, resiliency, availability, and virtualization. The superscalar design allows the z13 to deliver a record level of capacity over the prior IBM z Systems™. In its maximum configuration, z13 is powered by up to 141 client characterizable microprocessors (cores) running at 5 GHz. This configuration can run more than 110,000 millions of instructions per second (MIPS) and up to 10 TB of client memory. The IBM z13 Model NE1 is estimated to provide up to 40% more total system capacity than the IBM zEnterprise® EC12 (zEC1) Model HA1. This book provides information about the IBM z13 and its functions, features, and associated software support. Greater detail is offered in areas relevant to technical planning. It is intended for systems engineers, consultants, planners, and anyone who wants to understand the IBM z Systems functions and plan for their usage. It is not intended as an introduction to mainframes. Readers are expected to be generally familiar with existing IBM z Systems technology and terminology.

Streaming Architecture

More and more data-driven companies are looking to adopt stream processing and streaming analytics. With this concise ebook, you’ll learn best practices for designing a reliable architecture that supports this emerging big-data paradigm. Authors Ted Dunning and Ellen Friedman (Real World Hadoop) help you explore some of the best technologies to handle stream processing and analytics, with a focus on the upstream queuing or message-passing layer. To illustrate the effectiveness of these technologies, this book also includes specific use cases. Ideal for developers and non-technical people alike, this book describes: Key elements in good design for streaming analytics, focusing on the essential characteristics of the messaging layer New messaging technologies, including Apache Kafka and MapR Streams, with links to sample code Technology choices for streaming analytics: Apache Spark Streaming, Apache Flink, Apache Storm, and Apache Apex How stream-based architectures are helpful to support microservices Specific use cases such as fraud detection and geo-distributed data streams Ted Dunning is Chief Applications Architect at MapR Technologies, and active in the open source community. He currently serves as VP for Incubator at the Apache Foundation, as a champion and mentor for a large number of projects, and as committer and PMC member of the Apache ZooKeeper and Drill projects. Ted is on Twitter as @ted_dunning. Ellen Friedman, a committer for the Apache Drill and Apache Mahout projects, is a solutions consultant and well-known speaker and author, currently writing mainly about big data topics. With a PhD in Biochemistry, she has years of experience as a research scientist and has written about a variety of technical topics. Ellen is on Twitter as @Ellen_Friedman.

podcast_episode
by Val Kroll , Julie Hoyer , Dylan Lewis (Intuit) , Tim Wilson (Analytics Power Hour - Columbus (OH) , Moe Kiss (Canva) , Michael Helbling (Search Discovery)

If you're like most analysts, you've probably changed jobs since the last episode of this podcast hit your earbuds two weeks ago. Or, if you haven't actually changed jobs, then you've at least been hounded by recruiters who wish you would. No matter how you look at it, digital analysts have lots of opportunities to bounce between companies at a frequent pace, and many analysts do just that. On this episode, we talk with Dylan Lewis, who has been doing digital analytics at Intuit since before there were federal taxes (give or take a few years). Give it a listen. You just might decide you need a personal board of directors! If nothing else, this episode might inspire you to check out http://careers.intuit.com, which would be ironic given the topic, but definitely understandable!

The Evolution of Analytics

Machine learning is a hot topic in business. Even data-driven organizations that have spent years developing successful data analysis platforms, with many accurate statistical models in place, are now looking into this decades-old discipline. But how can companies turn hyped opportunities for machine learning into real business value? This report examines the growing momentum of machine learning in the analytics landscape, the challenges machine learning presents to businesses, and examples of how organizations are actively seeking to incorporate modern machine learning techniques into their production data infrastructures. Authors Patrick Hall, Wen Phan, and Katie Whitson look at two companies in depth—one in healthcare and one in finance—that are seeing the real impact of machine learning. Discover how machine learning can help your organization: Analyze and generate insights from large amounts of varied, messy, and unstructured data unfit for traditional statistical analysis Increase the predictive accuracy beyond what was previously possible Augment aging analytical processes and other decision-making tools

Regression Analysis Microsoft® Excel®

This is today’s most complete guide to regression analysis with Microsoft® Excel for any business analytics or research task. Drawing on 25 years of advanced statistical experience, Microsoft MVP Conrad Carlberg shows how to use Excel’s regression-related worksheet functions to perform a wide spectrum of practical analyses. Carlberg clearly explains all the theory you’ll need to avoid mistakes, understand what your regressions are really doing, and evaluate analyses performed by others. From simple correlations and t-tests through multiple analysis of covariance, Carlberg offers hands-on, step-by-step walkthroughs using meaningful examples. He discusses the consequences of using each option and argument, points out idiosyncrasies and controversies associated with Excel’s regression functions, and shows how to use them reliably in fields ranging from medical research to financial analysis to operations. You don’t need expensive software or a doctorate in statistics to work with regression analyses. Microsoft Excel has all the tools you need—and this book has all the knowledge! Understand what regression analysis can and can’t do, and why Master regression-based functions built into all recent versions of Excel Work with correlation and simple regression Make the most of Excel’s improved LINEST() function Plan and perform multiple regression Distinguish the assumptions that matter from the ones that don’t Extend your analysis options by using regression instead of traditional analysis of variance Add covariates to your analysis to reduce bias and increase statistical power

Big Data in Practice

The best-selling author of Big Data is back, this time with a unique and in-depth insight into how specific companies use big data. Big data is on the tip of everyone's tongue. Everyone understands its power and importance, but many fail to grasp the actionable steps and resources required to utilise it effectively. This book fills the knowledge gap by showing how major companies are using big data every day, from an up-close, on-the-ground perspective. From technology, media and retail, to sport teams, government agencies and financial institutions, learn the actual strategies and processes being used to learn about customers, improve manufacturing, spur innovation, improve safety and so much more. Organised for easy dip-in navigation, each chapter follows the same structure to give you the information you need quickly. For each company profiled, learn what data was used, what problem it solved and the processes put it place to make it practical, as well as the technical details, challenges and lessons learned from each unique scenario. Learn how predictive analytics helps Amazon, Target, John Deere and Apple understand their customers Discover how big data is behind the success of Walmart, LinkedIn, Microsoft and more Learn how big data is changing medicine, law enforcement, hospitality, fashion, science and banking Develop your own big data strategy by accessing additional reading materials at the end of each chapter

Apache Hive Cookbook

Apache Hive Cookbook is a comprehensive resource for mastering Apache Hive, a tool that bridges the gap between SQL and Big Data processing. Through guided recipes, you'll acquire essential skills in Hive query development, optimization, and integration with modern big data frameworks. What this Book will help me do Design efficient Hive query structures for big data analytics. Optimize data storage and query execution using partitions and buckets. Integrate Hive seamlessly with frameworks like Spark and Hadoop. Understand and utilize the HiveQL syntax to perform advanced analytical processing. Implement practical solutions to secure, maintain, and scale Hive environments. Author(s) Hanish Bansal, Saurabh Chauhan, and Shrey Mehrotra bring their extensive expertise in big data technologies and Hive to this cookbook. With years of practical experience and deep technical knowledge, they offer a collection of solutions and best practices that reflect real-world use cases. Their commitment to clarity and depth makes this book an invaluable resource for exploring Hive to its fullest potential. Who is it for? This book is perfect for data professionals, engineers, and developers looking to enhance their capabilities in big data analytics using Hive. It caters to those with a foundational understanding of big data frameworks and some familiarity with SQL. Whether you're planning to optimize data handling or integrate Hive with other data tools, this guide helps you achieve your goals. Step into the world of efficient data analytics with Apache Hive through structured learning paths.

podcast_episode
by Val Kroll , Julie Hoyer , Tim Wilson (Analytics Power Hour - Columbus (OH) , Moe Kiss (Canva) , Michael Helbling (Search Discovery)

Are you a data scientist? Have you pondered whether you're really a growth hacker? Well...get over yourself! Picking up on a debate that started onstage at eMetrics, Michael, Jim, and Tim discuss whether a fundamental shift in the role (and requisite skills) of the web analyst are changing. You know, getting more "science-y" (if "science" is "more technical and more maths"). all in 2,852 seconds (each second of which can be pulled into R and used to build a predictive model showing the expected ROI of listening to future episodes; at least, we assume that's what a data scientist could do).

Mastering QlikView Data Visualization

"Mastering QlikView Data Visualization" is your essential guide to becoming proficient in advanced data visualization and analysis using QlikView. Through practical examples and real-world scenarios, this book enables you to create insightful and meaningful QlikView applications tailored to business needs. What this Book will help me do Design and implement advanced QlikView applications using realistic data and scenarios. Understand and fulfill business requirements across varied organizational departments. Create advanced charts and visualizations including frequency polygons and XmR charts. Integrate geographical, sentiment, and planning analysis into your QlikView models. Develop troubleshooting strategies for common QlikView data visualization challenges. Author(s) None Pover, an expert in data analytics and QlikView technologies, has extensive experience in implementing QlikView applications to address real-world business challenges. They are passionate about teaching practical solutions, ensuring readers gain actionable insights. With hands-on expertise, the author delivers clear, structured guidance in technical learning. Who is it for? If you're a QlikView developer wanting to go beyond the basics, this book is perfect for you. It is designed for individuals who have foundational knowledge of QlikView and are looking to enhance their ability to handle advanced projects. Whether you're focusing on analytics for sales, finance, or operations, you'll find this guide extremely useful.

Big Data and Business Analytics

With the increasing barrage of big data, it becomes vital for organizations to make sense of this data in a timely and effective way to improve their decision making and competitive advantage. That's where business analytics come into play. This book explores case studies from industry leaders in big data domains such as cybersecurity, marketing, finance, emergency management, healthcare, and transportation. It offers a concise guide for CEOs and senior managers, as well as for business, management, and technology students interested in this emerging field.

RapidMiner

Written by leaders in the data mining community, including the developers of the RapidMiner software, this book provides an in-depth introduction to the application of data mining and business analytics techniques and tools in scientific research, medicine, industry, commerce, and diverse other sectors. It presents the most powerful and flexible open source software solutions: RapidMiner and RapidAnalytics. The book and software tools cover all relevant steps of the data mining process. The software and their extensions can be freely downloaded at www.RapidMiner.com.

Getting Analytics Right

Ask vital questions before you dive into data Are your big data and analytics capabilities up to par? Nearly half of the global company executives in a recent Forbes Insight/Teradata survey certainly don’t think theirs are. This new book from O’Reilly examines how things typically go wrong in the data analytics process, and introduces a question-first, data-second strategy that can help your company close the gap between being analytics-invested and truly data-driven. Authors from Tamr, Inc. share insights into why analytics projects often fail, and offer solutions based on their combined experience in engineering, architecture, product strategizing, and marketing. You’ll learn how projects often start from the wrong place, take too long, and don’t go far enough—missteps that lead to incomplete, late, or useless answers to critical business questions. Find out how their question-first, data-second approach—fueled by vastly improved data preparation platforms and cataloging software—can help you create human-machine analytics solutions designed specifically to produce better answers, faster. Getting Analytics Right was written and presented by people at Tamr, Inc., including Nidhi Aggarwal, Product and Strategy Lead; Byron Berk, Customer Success Lead; Gideon Goldin, Senior UX Architect; Matt Holzapfel, Product Marketing; and Eliot Knudsen, Field Engineer. Tamr, a Cambridge, Massachusetts-based startup, helps companies understand and unify their disparate databases.

I know what you're thinking: they're world-class podcasters when they hide behind editing tools and autotune, but can they do it LIVE? This special recording from the final keynote spot at eMetrics has the three amigos of insight taking questions from Twitter and a live audience. There was bourbon, Jim Sterne, and a disagreement over the future of the industry - all in under 45 minutes. So, turn up the volume (seriously...because the sound levels were low and we did the best we could with a short-turnaround edit) and give it a listen!

"Guests" on the show (aka, people who asked questions who we were able to identify) included: Justin Goodman, Mike Harmanos, Rachelle Maisner, Boaz Vilozny, and KeAndre Boggess.

Business Intelligence Strategy and Big Data Analytics

Business Intelligence Strategy and Big Data Analytics is written for business leaders, managers, and analysts - people who are involved with advancing the use of BI at their companies or who need to better understand what BI is and how it can be used to improve profitability. It is written from a general management perspective, and it draws on observations at 12 companies whose annual revenues range between $500 million and $20 billion. Over the past 15 years, my company has formulated vendor-neutral business-focused BI strategies and program execution plans in collaboration with manufacturers, distributors, retailers, logistics companies, insurers, investment companies, credit unions, and utilities, among others. It is through these experiences that we have validated business-driven BI strategy formulation methods and identified common enterprise BI program execution challenges. In recent years, terms like “big data” and “big data analytics” have been introduced into the business and technical lexicon. Upon close examination, the newer terminology is about the same thing that BI has always been about: analyzing the vast amounts of data that companies generate and/or purchase in the course of business as a means of improving profitability and competitiveness. Accordingly, we will use the terms BI and business intelligence throughout the book, and we will discuss the newer concepts like big data as appropriate. More broadly, the goal of this book is to share methods and observations that will help companies achieve BI success and thereby increase revenues, reduce costs, or both. Provides ideas for improving the business performance of one’s company or business functions Emphasizes proven, practical, step-by-step methods that readers can readily apply in their companies Includes exercises and case studies with road-tested advice about formulating BI strategies and program plans

Global Business Analytics Models: Concepts and Applications in Predictive, Healthcare, Supply Chain, and Finance Analytics

THE COMPLETE GUIDE TO USING ANALYTICS TO MANAGE RISK AND UNCERTAINTY IN COMPLEX GLOBAL BUSINESS ENVIRONMENTS Practical techniques for developing reliable, actionable intelligence–and using it to craft strategy Analytical opportunities to solve key managerial problems in global enterprises Written for working managers: packed with realistic, useful examples This guide helps global managers use modern analytics to gain reliable, actionable, and timely business intelligence–and use it to manage risk, build winning strategies, and solve urgent problems. Dr. Hokey Min offers a practical, easy-to-understand overview of business analytics in a global context, focusing especially on managerial and strategic implications. After demystifying today’s core quantitative tools, he demonstrates them at work in a wide spectrum of global applications. You’ll build models to help segment global markets, forecast demand, assess risk, plan financing, optimize supply chains, and more. Along the way, you’ll find practical guidance for developing analytic thinking, operationalizing Big Data in global environments, and preparing for future analytical innovations. Whether you’re a global executive, strategist, analyst, marketer, supply chain professional, student or researcher, this book will help you drive real value from analytics–in smarter decisions, improved strategy, and better management. In today’s global business environments characterized by growing complexity, volatility, and uncertainty, business analytics has become an indispensable tool for managing these challenges. Specifically, global managers need analytics expertise to solve problems, identify opportunities, shape strategy, mitigate risk, and improve their day-to-day operational efficiency. Now, for the first time, there’s an analytics guide designed specifically for decision-makers in global organizations. Leveraging his experience teaching a number of students and training hundreds of managers and executives, Dr. Hokey Min demystifies the principles and tools of modern business analytics, and demonstrates their real-world use in global business. First, Dr. Min identifies key success factors and mindsets, helping you establish the preconditions for effective analysis. Next, he walks you through the practicalities of collecting, organizing, and analyzing Big Data, and developing models to transform them into actionable insight. Building on these foundations, he illustrates core analytical applications in finance, healthcare, and global supply chains. He concludes by previewing emerging trends in analytics, including the newest tools for automated decision-making. Compare today’s key quantitative tools Stats, data mining, OR, and simulation: how they work, when to use them Get the right data… …and get the data right Predict the future… …and sense its arrival sooner than others can Implement high-value analytics applications… …in finance, supply chains, healthcare, and beyond

IT Modernization using Catalogic ECX Copy Data Management and IBM Spectrum Storage

Data is the currency of the new economy, and organizations are increasingly tasked with finding better ways to protect, recover, access, share, and use data. Traditional storage technologies are being stretched to the breaking point. This challenge is not because of storage hardware performance, but because management tools and techniques have not kept pace with new requirements. Primary data growth rates of 35% to 50% annually only amplify the problem. Organizations of all sizes find themselves needing to modernize their IT processes to enable critical new use cases such as storage self-service, Development and Operations (DevOps), and integration of data centers with the Cloud. They are equally challenged with improving management efficiencies for long established IT processes such as data protection, disaster recovery, reporting, and business analytics. Access to copies of data is the one common feature of all these use cases. However, the slow, manual processes common to IT organizations, including a heavy reliance on labor-intensive scripting and disparate tool sets, are no longer able to deliver the speed and agility required in today's fast-paced world. Copy Data Management (CDM) is an IT modernization technology that focuses on using existing data in a manner that is efficient, automated, scalable, and easy to use, delivering the data access that is urgently needed to meet the new use cases. Catalogic ECX, with IBM® storage, provides in-place copy data management that modernizes IT processes, enables key use cases, and does it all within existing infrastructure. This IBM Redbooks® publication shows how Catalogic Software and IBM have partnered together to create an integrated solution that addresses today's IT environment.

Ecommerce Analytics: Analyze and Improve the Impact of Your Digital Strategy

Today's Complete, Focused, Up-to-Date Guide to Analytics for Ecommerce Profit from analytics throughout the entire customer experience and lifecycle Make the most of all the fast-changing data sources now available to you For all ecommerce executives, strategists, entrepreneurs, marketers, analysts, and data scientists Ecommerce Analytics is the only complete single-source guide to analytics for your ecommerce business. It brings together all the knowledge and skills you need to solve your unique problems, and transform your data into better decisions and customer experiences. Judah Phillips shows how to use analysis to improve ecommerce marketing and advertising, understand customer behavior, increase conversion rates, strengthen loyalty, optimize merchandising and product mix, streamline transactions, optimize product mix, and accurately attribute sales. Drawing on extensive experience leading large-scale analytics programs, he also offers expert guidance on building successful analytical teams; surfacing high-value insights via dashboards and visualization; and managing data governance, security, and privacy. Here are the answers you need to make the most of analytics in ecommerce: throughout your organization, across your entire customer lifecycle.

Hadoop Real-World Solutions Cookbook - Second Edition

Master the full potential of big data processing using Hadoop with this comprehensive guide. Featuring over 90 practical recipes, this book helps you streamline data workflows and implement machine learning models with tools like Spark, Hive, and Pig. By the end, you'll confidently handle complex data problems and optimize big data solutions effectively. What this Book will help me do Install and manage a Hadoop 2.x cluster efficiently to suit your data processing needs. Explore and utilize advanced tools like Hive, Pig, and Flume for seamless big data analysis. Master data import/export processes with Sqoop and workflows automation using Oozie. Implement machine learning and analytics tasks using Mahout and Apache Spark. Store and process data flexibly across formats like Parquet, ORC, RC, and more. Author(s) None Deshpande is an expert in big data processing and analytics with years of hands-on experience in implementing Hadoop-based solutions for real-world problems. Known for a clear and pragmatic writing style, None brings actionable wisdom and best practices to the forefront, helping readers excel in managing and utilizing big data systems. Who is it for? Designed for technical enthusiasts and professionals, this book is ideal for those familiar with basic big data concepts. If you are looking to expand your expertise in Hadoop's ecosystem and implement data-driven solutions, this book will guide you through essential skills and advanced techniques to efficiently manage complex big data projects.

You know what it's time we do? It's time we make analytics great again. How can we do that? With three guys who know about winning. Maybe not winning with real estate. Or with steaks. But winning with analytics. Are these three guys winners? Well, for the sake of 40 minutes of audio, let's say they are. And then we'll let you, the people, decide.

Gratuitous pop culture references in this episode include: DJ Khaled, Larry David (on SNL), and Louis CK.