talk-data.com talk-data.com

Topic

data

5765

tagged

Activity Trend

3 peak/qtr
2020-Q1 2026-Q1

Activities

5765 activities · Newest first

Hadoop For Dummies

Let Hadoop For Dummies help harness the power of your data and rein in the information overload Big data has become big business, and companies and organizations of all sizes are struggling to find ways to retrieve valuable information from their massive data sets with becoming overwhelmed. Enter Hadoop and this easy-to-understand For Dummies guide. Hadoop For Dummies helps readers understand the value of big data, make a business case for using Hadoop, navigate the Hadoop ecosystem, and build and manage Hadoop applications and clusters. Explains the origins of Hadoop, its economic benefits, and its functionality and practical applications Helps you find your way around the Hadoop ecosystem, program MapReduce, utilize design patterns, and get your Hadoop cluster up and running quickly and easily Details how to use Hadoop applications for data mining, web analytics and personalization, large-scale text processing, data science, and problem-solving Shows you how to improve the value of your Hadoop cluster, maximize your investment in Hadoop, and avoid common pitfalls when building your Hadoop cluster From programmers challenged with building and maintaining affordable, scaleable data systems to administrators who must deal with huge volumes of information effectively and efficiently, this how-to has something to help you with Hadoop.

Repeated Measurements and Cross-Over Designs

An introduction to state-of-the-art experimental design approaches to better understand and interpret repeated measurement data in cross-over designs. Repeated Measurements and Cross-Over Designs: Features the close tie between the design, analysis, and presentation of results Presents principles and rules that apply very generally to most areas of research, such as clinical trials, agricultural investigations, industrial procedures, quality control procedures, and epidemiological studies Includes many practical examples, such as PK/PD studies in the pharmaceutical industry, k-sample and one sample repeated measurement designs for psychological studies, and residual effects of different treatments in controlling conditions such as asthma, blood pressure, and diabetes. Utilizes SAS(R) software to draw necessary inferences. All SAS output and data sets are available via the book's related website. This book is ideal for a broad audience including statisticians in pre-clinical research, researchers in psychology, sociology, politics, marketing, and engineering.

Statistics: Principles and Methods, 7th Edition

Johnson/Bhattacharyya is unique in its clarity of exposition while maintaining the mathematical correctness of its explanations. Many other books that claim to be easier to understand often sacrifice mathematical rigor. In contrast, Johnson/ Bhattacharyya maintain a focus on accuracy without getting bogged down in unnecessary details.

Anonymous Communication Networks

This book examines anonymous communication networks as a solution to Internet privacy concerns. It explores various anonymous communication networks as possible solutions to Internet privacy concerns and identifies specific scenarios where it is best to remain anonymous. The text details the two main approaches to anonymous communication networks: onion routing and mixed networks. Using examples and case studies, it illustrates the usefulness of anonymous communication networks for web browsing, email, e-banking, and e-voting. It also includes guidance to help readers download and install Tor, I2P, JAP/JonDo, and QuickSilver.

IBM Tivoli Storage Productivity Center V5.1 Technical Guide

IBM® Tivoli® Storage Productivity Center V5.1 products offer storage infrastructure management that helps optimize storage management by centralizing, simplifying, automating, and optimizing storage tasks associated with storage systems, data disaster recovery, storage networks, and capacity management. IBM Tivoli Storage Productivity Center V5.1 products include: IBM Tivoli Storage Productivity Center V5.1 IBM Tivoli Storage Productivity Center Select Edition V5.1 Tivoli Storage Productivity Center Select Edition V5.1 offers the same features as Tivoli Storage Productivity Center V5.1 but at attractive entry-level pricing for operations with smaller capacities. It is licensed per storage device, such as disk controllers and their respective expansion units. This IBM Redbooks® publication is intended for storage administrators and users who are installing and using the features and functions in IBM Tivoli Storage Productivity Center V5.1. The information in this book can be used to plan for, install, and customize the components of Tivoli Storage Productivity Center in your storage infrastructure.

Developing Analytic Talent: Becoming a Data Scientist

Learn what it takes to succeed in the the most in-demand tech job Harvard Business Review calls it the sexiest tech job of the 21st century. Data scientists are in demand, and this unique book shows you exactly what employers want and the skill set that separates the quality data scientist from other talented IT professionals. Data science involves extracting, creating, and processing data to turn it into business value. With over 15 years of big data, predictive modeling, and business analytics experience, author Vincent Granville is no stranger to data science. In this one-of-a-kind guide, he provides insight into the essential data science skills, such as statistics and visualization techniques, and covers everything from analytical recipes and data science tricks to common job interview questions, sample resumes, and source code. The applications are endless and varied: automatically detecting spam and plagiarism, optimizing bid prices in keyword advertising, identifying new molecules to fight cancer, assessing the risk of meteorite impact. Complete with case studies, this book is a must, whether you're looking to become a data scientist or to hire one. Explains the finer points of data science, the required skills, and how to acquire them, including analytical recipes, standard rules, source code, and a dictionary of terms Shows what companies are looking for and how the growing importance of big data has increased the demand for data scientists Features job interview questions, sample resumes, salary surveys, and examples of job ads Case studies explore how data science is used on Wall Street, in botnet detection, for online advertising, and in many other business-critical situations Developing Analytic Talent: Becoming a Data Scientist is essential reading for those aspiring to this hot career choice and for employers seeking the best candidates.

Introduction to Numerical Electrostatics Using MATLAB

Readers are guided step by step through numerous specific problems and challenges, covering all aspects of electrostatics with an emphasis on numerical procedures. The author focuses on practical examples, derives mathematical equations, and addresses common issues with algorithms. Introduction to Numerical Electrostatics contains problem sets, an accompanying web site with simulations, and a complete list of computer codes. Computer source code listings on accompanying web site Problem sets included with book Readers using MATLAB or other simulation packages will gain insight as to the inner workings of these packages, and how to account for their limitations Example computer code is provided in MATLAB Solutions Manual The first book of its kind uniquely devoted to the field of computational electrostatics

Beginning Hibernate, Third Edition

Beginning Hibernate, Third Edition is ideal if you're experienced in Java with databases (the traditional, or "connected," approach), but new to open-source, lightweight Hibernate, a leading object-relational mapping and database-oriented application development framework. This book packs in information about the release of the Hibernate 4.x persistence layer and provides a clear introduction to the current standard for object-relational persistence in Java. And since the book keeps its focus on Hibernate without wasting time on nonessential third-party tools, you'll be able to immediately start building transaction-based engines and applications. Experienced authors Joseph Ottinger with Dave Minter and Jeff Linwood provide more in-depth examples than any other book for Hibernate beginners. The authors also present material in a lively, example-based manner—not a dry, theoretical, hard-to-read fashion. What you'll learn How to build enterprise Java-based transaction-type applications that access complex data with Hibernate How to work with Hibernate 4 Where to integrate into the persistence life cycle How to map using annotations, Hibernate XML files, and more How to search and query with the new version of Hibernate How to integrate with MongoDB using NoSQL Who this book is for This book is for Java developers who want to learn about Hibernate.

Displaying Time Series, Spatial, and Space-Time Data with R

Code and Methods for Creating High-Quality Data GraphicsA data graphic is not only a static image, but it also tells a story about the data. It activates cognitive processes that are able to detect patterns and discover information not readily available with the raw data. This is particularly true for time series, spatial, and space-time datasets.F

Statistical Analysis: Microsoft® Excel® 2013

Use Excel 2013’s statistical tools to transform your data into knowledge Conrad Carlberg shows how to use Excel 2013 to perform core statistical tasks every business professional, student, and researcher should master. Using real-world examples, Carlberg helps you choose the right technique for each problem and get the most out of Excel’s statistical features, including recently introduced consistency functions. Along the way, he clarifies confusing statistical terminology and helps you avoid common mistakes. You’ll learn how to use correlation and regression, analyze variance and covariance, and test statistical hypotheses using the normal, binomial, t, and F distributions. To help you make accurate inferences based on samples from a population, this edition adds two more chapters on inferential statistics, covering crucial topics ranging from experimental design to the statistical power of F tests. Becoming an expert with Excel statistics has never been easier! You’ll find crystal-clear instructions, insider insights, and complete step-by-step projects—all complemented by extensive web-based resources. Master Excel’s most useful descriptive and inferential statistical tools Tell the truth with statistics—and recognize when others don’t Accurately summarize sets of values Infer a population’s characteristics from a sample’s frequency distribution Explore correlation and regression to learn how variables move in tandem Use Excel consistency functions such as STDEV.S() and STDEV.P() Test differences between two means using z tests, t tests, and Excel’s Data Analysis Add-in Use ANOVA to test differences between more than two means Explore statistical power by manipulating mean differences, standard errors, directionality, and alpha Take advantage of Recommended PivotTables, Quick Analysis, and other Excel 2013 shortcuts

Think Bigger
Big data--the enormous amount of data that is created as virtually every movement, transaction, and choice we make becomes digitized--is revolutionizing business. Offering real-world insight and explanations, this book provides a roadmap for organizations looking to develop a profitable big data strategy...and reveals why it's not something they can leave to the I.T. department.

Sharing best practices from companies that have implemented a big data strategy including Walmart, InterContinental Hotel Group, Walt Disney, and Shell, Think Bigger covers the most important big data trends affecting organizations, as well as key technologies like Hadoop and MapReduce, and several crucial types of analyses. In addition, the book offers guidance on how to ensure security, and respect the privacy rights of consumers. It also examines in detail how big data is impacting specific industries--and where opportunities can be found.

Big data is changing the way businesses--and even governments--are operated and managed. Think Bigger is an essential resource for anyone who wants to ensure that their company isn't left in the dust.

Economic and Business Forecasting: Analyzing and Interpreting Econometric Results

Discover the secrets to applying simple econometric techniques to improve forecasting Equipping analysts, practitioners, and graduate students with a statistical framework to make effective decisions based on the application of simple economic and statistical methods, Economic and Business Forecasting offers a comprehensive and practical approach to quantifying and accurate forecasting of key variables. Using simple econometric techniques, author John E. Silvia focuses on a select set of major economic and financial variables, revealing how to optimally use statistical software as a template to apply to your own variables of interest. Presents the economic and financial variables that offer unique insights into economic performance Highlights the econometric techniques that can be used to characterize variables Explores the application of SAS software, complete with simple explanations of SAS-code and output Identifies key econometric issues with practical solutions to those problems Presenting the "ten commandments" for economic and business forecasting, this book provides you with a practical forecasting framework you can use for important everyday business applications.

Excel Dashboards and Reports For Dummies, 2nd Edition

Create dynamic dashboards and put your data on display with For Dummies No matter what business you're in, reports have become a staple of the workplace, but what good is a report if no reads it, or even worse, understands it? This all new edition of Excel Dashboards & Reports For Dummies is here to help you make meaning of all your data and turn it into clear and actionable visualizations. Fully updated for the latest business intelligence and spreadsheet tools in Excel 2013, this book shows you how to analyze large amounts of data, quickly slice data into various views on the fly, automate redundant reporting, create eye-catching visualizations, and more. Helps you move beyond reporting data with simple tables, rows, and columns to designing high-impact reports, dashboards, and visuals Walks you through a wide array of technical and analytical concepts to give you the background you need to select the right tool for interpreting and displaying data Covers how to build a chart, work with pivot tables, group and bucket your data, represent trends, create What-If analyses, and increase the value of your reports Excel Dashboards & Reports For Dummies, 2nd Edition is the business analysis tool you need to transform your raw data into a powerful and effective presentation that is accessible to everyone.

IBM zEnterprise System Technical Introduction

In a smarter planet, information-centric processes are exploding in growth. The mainframe has always been the IT industry's leading platform for transaction processing, consolidated and secure data serving, and support for available enterprise-wide applications. IBM® has extended the mainframe platform to help large enterprises reshape their client experiences through information-centric computing and to deliver on key business initiatives. IBM zEnterprise® is recognized as the most reliable and trusted system, and the most secure environment for core business operations. The new zEnterprise System consists of the IBM zEnterprise EC12 (zEC12) or IBM zEnterprise BC12 (zBC12), the IBM zEnterprise Unified Resource Manager, and the IBM zEnterprise IBM BladeCenter® Extension (zBX) Model 003. This IBM Redbooks® publication describes the zEC12 and zBC12, with their improved scalability, performance, security, resiliency, availability, and virtualization. The zEnterprise System has no peer as a trusted platform that also provides the most efficient transaction processing and database management. With efficiency at scale delivering significant cost savings on core processes, resources can be freed up to focus on developing new services to drive growth. This book provides a technical overview of the zEC12, zBC12, zBX Model 003, and Unified Resource Manager. This publication is intended for IT managers, architects, consultants, and anyone else who wants to understand the elements of the zEnterprise System. For this introduction to the zEnterprise System, readers are not expected to be familiar with current IBM System z® technology and terminology.

It's Not the Size of the Data -- It's How You Use It
Brand tracking, CRM programs, trade shows, online behavior tracking, satisfaction studies. Mounds of marketing metrics are generated across touchpoints and channels. It can be information overload--too much, too scattered. But locked in the vast quantity of information are accurate, data-driven answers to every marketing question. Analytic dashboards are transformative web-based tools that gather, syn the size, and visually display essential data in real time, directly connecting marketing with performance. World renowned marketing expert Koen Pauwels supplies a simple yet rigorous methodology and wealth of case studies to help any size organization, in any industry, turn data into productive action. He explains step by step how to: ● Gain crucial IT support ● Build a rock-solid database ● Select key leading performance indicators ● Design the optimal dashboard layout ● Use marketing analytics to improve decisions and reap rewards Gut decisions are outdated and downright dangerous. Whether you're trying to allocate resources between online and offline marketing, measure the ROI of specific efforts, or scale up a creative campaign, dashboard analytics bring scientific precision and insight to marketing efforts--with far better results.
matplotlib Plotting Cookbook

The "matplotlib Plotting Cookbook" equips you with the skills to create impactful scientific visualizations using Python's matplotlib library. Through a series of concise recipes, this book covers everything from basic plotting to advanced techniques, ensuring you can create impressive graphics for your data. What this Book will help me do Learn to produce standard 2D plots like line, bar, and scatter plots. Master advanced plotting techniques such as 3D plotting and data overlays. Enhance plots with detailed annotations, rich legends, and labeling. Understand the use of colors, styles, and scales to maximize readability. Use matplotlib to generate plots programmatically or integrate with applications. Author(s) Alexandre Devert, the author of the "matplotlib Plotting Cookbook," is an experienced data scientist with a strong foundation in Python and data visualization techniques. Alexandre has worked extensively in the field of data analysis, and his expertise is reflected in the practical examples and hands-on guidance provided throughout this book. He takes a learner-focused approach to presenting technical topics in an accessible way. Who is it for? This book is designed for Python developers, data scientists, and researchers who need to create clear, professional-quality visualizations. If you are at a beginner or intermediate level in using matplotlib or visualization libraries, this book will empower you with essential plotting skills. Readers looking to save time while producing meaningful insights through data visualizations will find this book valuable. It is suitable for those aiming to improve their data representation skills for presentations or publications.

Responsive Mobile User Experience Using MQTT and IBM MessageSight

IBM® MessageSight is an appliance-based messaging server that is optimized to address the massive scale requirements of machine-to-machine (m2m) and mobile user scenarios. IBM MessageSight makes it easy to connect mobile customers to your existing messaging enterprise system, enabling a substantial number of remote clients to be concurrently connected. The MQTT protocol is a lightweight messaging protocol that uses publish/subscribe architecture to deliver messages over low bandwidth or unreliable networks. A publish/subscribe architecture works well for HTML5, native, and hybrid mobile applications by removing the wait time of a request/response model. This creates a better, richer user experience. The MQTT protocol is simple, which results in a client library with a low footprint. MQTT was proposed as an Organization for the Advancement of Structured Information Standards (OASIS) standard. This book provides information about version 3.1 of the MQTT specification. This IBM Redbooks® publication provides information about how IBM MessageSight, in combination with MQTT, facilitates the expansion of enterprise systems to include mobile devices and m2m communications. This book also outlines how to connect IBM MessageSight to an existing infrastructure, either through the use of IBM WebSphere® MQ connectivity or the IBM Integration Bus (formerly known as WebSphere Message Broker). This book describes IBM MessageSight product features and facilities that are relevant to technical personnel, such as system architects, to help them make informed design decisions regarding the integration of the messaging appliance into their enterprise architecture. Using a scenario-based approach, you learn how to develop a mobile application, and how to integrate IBM MessageSight with other IBM products. This publication is intended to be of use to a wide-ranging audience.

Storm Blueprints: Patterns for Distributed Real-time Computation

"Storm Blueprints: Patterns for Distributed Real-time Computation" takes you on a hands-on journey into understanding and implementing distributed real-time processing with Apache Storm. Through real-world examples and projects, you'll gain a sound understanding of the fundamentals and learn to design systems capable of resilient, scalable, and fast computation. What this Book will help me do Understand the essentials of Apache Storm and its architecture. Learn to deploy and manage Storm in different modes, including distributed clusters. Discover design patterns for real-time data flow in distributed systems. Master the implementation of fault tolerance and continuous availability in processing. Analyze system performance insights through practical integrations and use cases. Author(s) The author(s) of 'Storm Blueprints' bring extensive experience in distributed systems engineering and real-time computations. Their passion for sharing knowledge is evident in this approachable yet comprehensive book. With years of practical experience, they offer insights and proven techniques to empower readers to build practical distributed systems. Who is it for? This book is designed for software engineers and developers working on data pipelines and real-time processing systems. Beginners to Storm will find it an excellent introduction, while those with experience will appreciate the advanced design patterns and use cases. If you aim to leverage Storm effectively in distributed architectures, this guide is tailored for you.

DFSMSrmm Primer

DFSMSrmm from IBM® is the full function tape management system available in IBM OS/390® and IBM z/OS®. With DFSMSrmm, you can manage all types of tape media at the shelf, volume, and data set level, simplifying the tasks of your tape librarian. Are you a new DFSMSrmm user? Then, this IBM Redbooks® publication introduces you to the DFSMSrmm basic concepts and functions. You learn how to manage your tape environment by implementing the DFSMSrmm management policies. Are you already using DFSMSrmm? In that case, this publication provides the most up-to-date information about the new functions and enhancements introduced with the latest release of DFSMSrmm. You will find useful information for implementing these new functions and getting more benefits from DFSMSrmm. Do you want to test DFSMSrmm functions? If you are using another tape management system and are thinking about converting to DFSMSrmm, you can start DFSMSrmm and run it in parallel with your current system for testing purposes. This book is intended to be a starting point for new professionals and a handbook for using the basic DFSMSrmm functions.

Practical Data Science with R

NEWER EDITION AVAILABLE IN MEAP Practical Data Science with R, Second Edition is now available in the Manning Early Access Program. An eBook of this older edition is included at no additional cost when you buy the revised edition! You may still purchase Practical Data Science with R (First Edition) using the Buy options on this page. Practical Data Science with R lives up to its name. It explains basic principles without the theoretical mumbo-jumbo and jumps right to the real use cases you'll face as you collect, curate, and analyze the data crucial to the success of your business. You'll apply the R programming language and statistical analysis techniques to carefully explained examples based in marketing, business intelligence, and decision support. About the Technology Business analysts and developers are increasingly collecting, curating, analyzing, and reporting on crucial business data. The R language and its associated tools provide a straightforward way to tackle day-to-day data science tasks without a lot of academic theory or advanced mathematics. About the Book Practical Data Science with R shows you how to apply the R programming language and useful statistical techniques to everyday business situations. Using examples from marketing, business intelligence, and decision support, it shows you how to design experiments (such as A/B tests), build predictive models, and present results to audiences of all levels. What's Inside Data science for the business professional Statistical analysis using the R language Project lifecycle, from planning to delivery Numerous instantly familiar use cases Keys to effective data presentations About the Reader This book is accessible to readers without a background in data science. Some familiarity with basic statistics, R, or another scripting language is assumed. About the Authors Nina Zumel and John Mount are cofounders of a San Francisco-based data science consulting firm. Both hold PhDs from Carnegie Mellon and blog on statistics, probability, and computer science at win-vector.com. Quotes A unique and important addition to any data scientist’s library. - From the Foreword by Jim Porzak, Cofounder Bay Area R Users Group Covers the process end-to-end, from data exploration to modeling to delivering the results. - Nezih Yigitbasi, Intel Full of useful gems for both aspiring and experienced data scientists. - Fred Rahmanian, Siemens Healthcare Hands-on data analysis with real-world examples. Highly recommended. - Dr. Kostas Passadis, IPTO