talk-data.com talk-data.com

Topic

data

5765

tagged

Activity Trend

3 peak/qtr
2020-Q1 2026-Q1

Activities

5765 activities · Newest first

Mastering Python Data Visualization

Mastering Python Data Visualization provides thorough, hands-on guidance for creating impactful visual representations of data by leveraging Python's powerful libraries such as Matplotlib, Pandas, and Scikit-Learn. By following this book, you will gain proficiency in understanding data, performing analyses, and ultimately presenting your findings in a clear and engaging way. What this Book will help me do Effectively transform raw data into insightful visualizations using Python's rich ecosystem of libraries. Understand and apply best practices for selecting the most appropriate visualization techniques for different datasets and objectives. Master the use of Python for interactive plotting, regression analysis, clustering, and classification tasks. Develop a solid foundation in data visualization aesthetics and how to convey information clearly through visuals. Utilize Python for specialized fields such as finance, bioinformatics, and social network analysis, incorporating advanced computation techniques. Author(s) Kirthi Raman is an experienced data scientist and Python advocate with a strong background in technical computing and data visualization. He has hands-on experience in using Python's ecosystem to solve real-world data problems and a passion for sharing knowledge. Raman's writing focuses on blending practical insights with comprehensive explanations, ensuring readers not only learn the tools but also apply them effectively. Who is it for? This book is ideal for data analysts, data scientists, and researchers who want to deepen their knowledge of Python-based data visualization techniques. It requires readers to have a basic understanding of Python and data manipulation. If your goal is to create professional and informative visual narratives that are both visually appealing and data-driven, this book is for you.

Microsoft Mapping: Geospatial Development in Windows 10 with Bing Maps and C#, Second Edition

This revised edition of Microsoft Mapping includes the latest details about SQL Server 2014 and the new 3D and Streetside-capable map control for Windows 10 applications. It contains updated chapters on Microsoft Azure and Power Map for Excel plus a new chapter on Bing Maps for Universal Windows. The book tells a story, from beginning to end, of planning and deploying a single geospatial application built using Microsoft technologies from end-to-end. Readers are expected to have basic familiarity with the fundamentals of developing for Microsoft platforms (some understanding of basic SQL, C#, .NET, and WCF); as readers work through the book they will build on their existing skills so that they will be able to deploy geospatial applications for social networking, data collection, enterprise management, or other purposes.

Hadoop 2 Quick-Start Guide: Learn the Essentials of Big Data Computing in the Apache Hadoop 2 Ecosystem

Get Started Fast with Apache Hadoop ® 2, YARN, and Today’s Hadoop Ecosystem With Hadoop 2.x and YARN, Hadoop moves beyond MapReduce to become practical for virtually any type of data processing. Hadoop 2.x and the Data Lake concept represent a radical shift away from conventional approaches to data usage and storage. Hadoop 2.x installations offer unmatched scalability and breakthrough extensibility that supports new and existing Big Data analytics processing methods and models. Hadoop ® 2 Quick-Start Guide is the first easy, accessible guide to Apache Hadoop 2.x, YARN, and the modern Hadoop ecosystem. Building on his unsurpassed experience teaching Hadoop and Big Data, author Douglas Eadline covers all the basics you need to know to install and use Hadoop 2 on personal computers or servers, and to navigate the powerful technologies that complement it. Eadline concisely introduces and explains every key Hadoop 2 concept, tool, and service, illustrating each with a simple “beginning-to-end” example and identifying trustworthy, up-to-date resources for learning more. This guide is ideal if you want to learn about Hadoop 2 without getting mired in technical details. Douglas Eadline will bring you up to speed quickly, whether you’re a user, admin, devops specialist, programmer, architect, analyst, or data scientist. Coverage Includes Understanding what Hadoop 2 and YARN do, and how they improve on Hadoop 1 with MapReduce Understanding Hadoop-based Data Lakes versus RDBMS Data Warehouses Installing Hadoop 2 and core services on Linux machines, virtualized sandboxes, or clusters Exploring the Hadoop Distributed File System (HDFS) Understanding the essentials of MapReduce and YARN application programming Simplifying programming and data movement with Apache Pig, Hive, Sqoop, Flume, Oozie, and HBase Observing application progress, controlling jobs, and managing workflows Managing Hadoop efficiently with Apache Ambari–including recipes for HDFS to NFSv3 gateway, HDFS snapshots, and YARN configuration Learning basic Hadoop 2 troubleshooting, and installing Apache Hue and Apache Spark

VersaStack Solution by Cisco and IBM with SQL, Spectrum Control, and Spectrum Protect

Dynamic organizations want to accelerate growth while reducing costs. To do so, they must speed the deployment of business applications and adapt quickly to any changes in priorities. Organizations today require an IT infrastructure to be easy, efficient, and versatile. The VersaStack solution by Cisco and IBM® can help you accelerate the deployment of your data centers. It reduces costs by more efficiently managing information and resources while maintaining your ability to adapt to business change. The VersaStack solution combines the innovation of Cisco UCS Integrated Infrastructure with the efficiency of the IBM Storwize® storage system. The Cisco UCS Integrated Infrastructure includes the Cisco Unified Computing System (Cisco UCS), Cisco Nexus and Cisco MDS switches, and Cisco UCS Director. The IBM Storwize V7000 enhances virtual environments with its Data Virtualization, IBM Real-time Compression™, and IBM Easy Tier® features. These features deliver extraordinary levels of performance and efficiency. The VersaStack solution is Cisco Application Centric Infrastructure (ACI) ready. Your IT team can build, deploy, secure, and maintain applications through a more agile framework. Cisco Intercloud Fabric capabilities help enable the creation of open and highly secure solutions for the hybrid cloud. These solutions accelerate your IT transformation while delivering dramatic improvements in operational efficiency and simplicity. Cisco and IBM are global leaders in the IT industry. The VersaStack solution gives you the opportunity to take advantage of integrated infrastructure solutions that are targeted at enterprise applications, analytics, and cloud solutions. The VersaStack solution is backed by Cisco Validated Designs (CVD) to provide faster delivery of applications, greater IT efficiency, and less risk. This IBM Redbooks® publication is aimed at experienced storage administrators that are tasked with deploying a VersaStack solution with Microsoft Sequel (SQL), IBM Spectrum™ Protect, and IBM Spectrum Control™.

Expert Performance Indexing in SQL Server, Second Edition

This book is a deep dive into perhaps the single-most important facet of good performance: indexes, and how to best use them. The book begins in the shallow waters with explanations of the types of indexes and how they are stored in databases. Moving deeper into the topic, and further into the book, you will look at the statistics that are accumulated both by indexes and on indexes. You’ll better understand what indexes are doing in the database and what can be done to mitigate and improve their effect on performance. The final destination is a guided tour through a number of real life scenarios showing approaches you can take to investigate, mitigate, and improve the performance of your database. • Defines the types of indexes and their implementation options • Provides use cases and common patterns in applying indexing • Describes and explain the index metadata and statistics • Provides a framework of strategies and approaches for indexing databases

IBM PowerVC Version 1.2.3: Introduction and Configuration

IBM® Power Virtualization Center (PowerVC™) is an advanced enterprise virtualization management offering for IBM® Power Systems™, which is based on the OpenStack framework. This IBM Redbooks® publication introduces PowerVC and helps you understand its functions, planning, installation, and setup. Starting with PowerVC version 1.2.2, the Express Edition offering is no longer available and the Standard Edition is the only offering. PowerVC supports both large and small deployments, either by managing IBM PowerVM® that is controlled with the Hardware Management Console (HMC) or by managing PowerKVM directly. PowerVC can manage IBM AIX®, IBM i, and Linux workloads that run on POWER® hardware, including IBM PurePower systems. PowerVC editions include the following features and benefits: Virtual Image capture, deployment, and management Policy-based Virtual Machine (VM) placement to improve use Management of real-time optimization and VM resilience to increase productivity Managing real-time optimization and VM resilience to increase productivity VM Mobility with placement policies to reduce the burden on IT staff in a simple-to-install and easy-to-use graphical user interface (GUI) An open and extensible PowerVM management system that you can adapt as you need and that runs in parallel with existing infrastructure, preserving your investment A management system for existing PowerVM deployments You will also find all the details about how we set up the lab environment that is used in this book. This book is for experienced users of IBM PowerVM and other virtualization solutions who want to understand and implement the next generation of enterprise virtualization management for Power Systems. Unless stated otherwise, the content of this book refers to versions 1.2.2 and 1.2.3 of IBM PowerVC. Unless stated otherwise, the content of this book refers to versions 1.2.2 and 1.2.3 of IBM PowerVC Version 1.2.3 Introduction and Configuration IBM PowerVC.

Sams Teach Yourself: Big Data Analytics with Microsoft HDInsight in 24 Hours

Sams Teach Yourself Big Data Analytics with Microsoft HDInsight in 24 Hours In just 24 lessons of one hour or less, Sams Teach Yourself Big Data Analytics with Microsoft HDInsight in 24 Hours helps you leverage Hadoop’s power on a flexible, scalable cloud platform using Microsoft’s newest business intelligence, visualization, and productivity tools. This book’s straightforward, step-by-step approach shows you how to provision, configure, monitor, and troubleshoot HDInsight and use Hadoop cloud services to solve real analytics problems. You’ll gain more of Hadoop’s benefits, with less complexity–even if you’re completely new to Big Data analytics. Every lesson builds on what you’ve already learned, giving you a rock-solid foundation for real-world success. Practical, hands-on examples show you how to apply what you learn Quizzes and exercises help you test your knowledge and stretch your skills Notes and tips point out shortcuts and solutions Learn how to… Master core Big Data and NoSQL concepts, value propositions, and use cases Work with key Hadoop features, such as HDFS2 and YARN Quickly install, configure, and monitor Hadoop (HDInsight) clusters in the cloud Automate provisioning, customize clusters, install additional Hadoop projects, and administer clusters Integrate, analyze, and report with Microsoft BI and Power BI Automate workflows for data transformation, integration, and other tasks Use Apache HBase on HDInsight Use Sqoop or SSIS to move data to or from HDInsight Perform R-based statistical computing on HDInsight datasets Accelerate analytics with Apache Spark Run real-time analytics on high-velocity data streams Write MapReduce, Hive, and Pig programs Register your book at informit.com/register for convenient access to downloads, updates, and corrections as they become available.

Creating Stunning Dashboards with QlikView

Explore the world of QlikView dashboards with this comprehensive guide that walks you through the entire process of creating effective and visually engaging dashboards for your business needs. From identifying KPIs to rolling out your application, this book provides actionable steps and best practices for delivering data-driven results. What this Book will help me do Define key performance indicators (KPIs) based on business objectives and goals. Design and structure dashboards using best practices in data visualization. Master creating various chart types, including bar, line, pie charts, and advanced visualizations, like heat maps. Integrate data from multiple sources, such as ERP systems and spreadsheets, into a cohesive dashboard. Learn the steps to develop mobile-optimized dashboards for accessibility on the go. Author(s) None Villafuerte, a seasoned expert in data visualization and QlikView development, brings a wealth of experience to this book. With years of hands-on work creating impactful dashboards for various business needs, the author's pragmatic and result-oriented approach provides readers with practical and insightful knowledge. Who is it for? The book is tailored for QlikView developers who already possess a basic understanding of scripting and dashboard layout design. It's ideal for professionals aiming to enhance their design and visualization skills. Additionally, business analysts or managers with a technical inclination could also benefit from its comprehensive approach to creating interactive dashboards. If building effective and appealing dashboards that drive business impact is your goal, this book is for you.

Sams Teach Yourself T-SQL in One Hour a Day

Master T-SQL database design, development, and administration the easy way–hands-on! In just one hour a day, you’ll build all the skills you need to create effective database applications with T-SQL and SQL Server. With this complete tutorial, you’ll quickly master the basics and then move on to more advanced features and concepts: Learn the fundamentals of T-SQL from the ground up, one step at a time Succeed with the newest versions of T-SQL, SQL Server, and SQL Server Management Studio Use T-SQL effectively as both an application developer and DBA Master powerful stored procedures, triggers, transactions, and user-defined functions (UDFs) Systematically optimize and secure your SQL Server databases Learn on your own time, at your own pace No previous T-SQL or database programming experience required Learn how to design efficient, reliable SQL Server databases Define efficient tables, table relationships, fields, and constraints Make the most of T-SQL’s SELECT and UPDATE statements Work effectively with simple and complex views and joins Master stored procedure techniques every developer should know Build and use powerful User-Defined Functions (UDFs) Secure databases with authentication, roles, permissions, and principals Configure, maintain, and tune SQL Server for maximum reliability, performance, and value Back up, restore, and audit databases Optimize databases with the SQL Server Profiler, System Monitor, and Index Tuning Wizard Leverage valuable insight and time saving techniques from a world renowned database expert Register your book at informit.com/register for access to source code, example files, updates, and corrections as they become available.

Key Management Development Models, 3rd Edition

Key Management Development Models provides the crucial information you need to develop your skills as a manager. Divided into two parts (Part 1: Developing Yourself & Part 2: Working with Others), each tool, model or idea will ensure you: · understand yourself better · understand how others perceive you · develop your credibility at work · make better choices in your management of others · become a more rounded professional, able to adapt your style to get the best out of yourself and others

Variational Methods for Engineers with Matlab

This book is issued from a 30 years’ experience on the presentation of variational methods to successive generations of students and researchers in Engineering. It gives a comprehensive, pedagogical and engineer-oriented presentation of the foundations of variational methods and of their use in numerical problems of Engineering. Particular applications to linear and nonlinear systems of equations, differential equations, optimization and control are presented. MATLAB programs illustrate the implementation and make the book suitable as a textbook and for self-study. The evolution of knowledge, of the engineering studies and of the society in general has led to a change of focus from students and researchers. New generations of students and researchers do not have the same relations to mathematics as the previous ones. In the particular case of variational methods, the presentations used in the past are not adapted to the previous knowledge, the language and the centers of interest of the new generations. Since these methods remain a core knowledge – thus essential - in many fields (Physics, Engineering, Applied Mathematics, Economics, Image analysis …), a new presentation is necessary in order to address variational methods to the actual context.

Beginning R: An Introduction to Statistical Programming, Second Edition

Beginning R, Second Edition is a hands-on book showing how to use the R language, write and save R scripts, read in data files, and write custom statistical functions as well as use built in functions. This book shows the use of R in specific cases such as one-way ANOVA analysis, linear and logistic regression, data visualization, parallel processing, bootstrapping, and more. It takes a hands-on, example-based approach incorporating best practices with clear explanations of the statistics being done. It has been completely re-written since the first edition to make use of the latest packages and features in R version 3. R is a powerful open-source language and programming environment for statistics and has become the de facto standard for doing, teaching, and learning computational statistics. R is both an object-oriented language and a functional language that is easy to learn, easy to use, and completely free. A large community of dedicated R users and programmers provides an excellent source of R code, functions, and data sets, with a constantly evolving ecosystem of packages providing new functionality for data analysis. R has also become popular in commercial use at companies such as Microsoft, Google, and Oracle. Your investment in learning R is sure to pay off in the long term as R continues to grow into the go to language for data analysis and research.

IBM Content Manager OnDemand Guide

This IBM® Redbooks® publication provides a practical guide to the design, installation, configuration, and maintenance of IBM Content Manager OnDemand Version 9.5. Content Manager OnDemand manages the high-volume storage and retrieval of electronic statements and provides efficient enterprise report management. Content Manager OnDemand transforms formatted computer output and printed reports, such as statements and invoices, into electronic information for easy report management. Content Manager OnDemand helps eliminate costly, high-volume print output by capturing, indexing, archiving, and presenting electronic information for improved customer service. This publication covers the key areas of Content Manager OnDemand, some of which might not be known to the Content Manager OnDemand community or are misunderstood. The book covers various topics, including basic information in administration, database structure, storage management, and security. In addition, the book covers data indexing, loading, conversion, and expiration. Other topics include user exits, performance, retention management, records management, and many more. Because many other resources are available that address subjects on different platforms, this publication is not intended as a comprehensive guide for Content Manager OnDemand. Rather, it is intended to complement the existing Content Manager OnDemand documentation and provide insight into the issues that might be encountered in the setup and use of Content Manager OnDemand. This book is intended for individuals who need to design, install, configure, and maintain Content Manager OnDemand.

Learning Shiny

Have you ever wanted to transform your data analysis in R into interactive, web-based dashboards and applications? "Learning Shiny" is your guide to mastering R's Shiny framework to create dynamic, visual, and engaging web applications. With its step-by-step approach, this book enables you to harness Shiny's features effectively. What this Book will help me do Understand the core principles of R and data processing using tools like apply and lapply, empowering you to handle data programmatically. Learn the Shiny framework fundamentals, including structuring an interactive application using UI and server scripts. Create stunning visualizations and dashboards using libraries like ggplot2 and integrate Shiny seamlessly. Deploy and host Shiny web applications on Linux servers for effective sharing and collaboration. Enhance your applications with JavaScript integrations, using tools like D3.js, for advanced customization. Author(s) Hernan Resnizky is a renowned data scientist and educator with extensive experience in R programming and Shiny application development. Known for his clear teaching style, he has guided numerous professionals in using R for real-world applications. His practical approach ensures readers not only learn techniques but understand how to apply them effectively. Who is it for? "Learning Shiny" is ideal for data scientists looking to showcase their work through interactive web apps and visualizations, and for web developers curious about leveraging the Shiny framework in R. Beginners as well as those with some R experience will find tailored guidance to suit their level. If you aim to expand your toolkit with web-focused R capabilities, this book is for you.

Data Preparation in the Big Data Era

Preparing and cleaning data is notoriously expensive, prone to error, and time consuming: the process accounts for roughly 80% of the total time spent on analysis. As this O’Reilly report points out, enterprises have already invested billions of dollars in big data analytics, so there’s great incentive to modernize methods for cleaning, combining, and transforming data. Author Federico Castanedo, Chief Data Scientist at WiseAthena.com, details best practices for reducing the time it takes to convert raw data into actionable insights. With these tools and techniques in mind, your organization will be well positioned to translate big data into big decisions. Explore the problems organizations face today with traditional prep and integration Define the business questions you want to address before selecting, prepping, and analyzing data Learn new methods for preparing raw data, including date-time and string data Understand how some cleaning actions (like replacing missing values) affect your analysis Examine data curation products: modern approaches that scale Consider your business audience when choosing ways to deliver your analysis

Fast Data: Smart and at Scale

The need for fast data applications is growing rapidly, driven by the IoT, the surge in machine-to-machine (M2M) data, global mobile device proliferation, and the monetization of SaaS platforms. So how do you combine real-time, streaming analytics with real-time decisions in an architecture that’s reliable, scalable, and simple? In this O’Reilly report, Ryan Betts and John Hugg from VoltDB examine ways to develop apps for fast data, using pre-defined patterns. These patterns are general enough to suit both the do-it-yourself, hybrid batch/streaming approach, as well as the simpler, proven in-memory approach available with certain fast database offerings. Their goal is to create a collection of fast data app development recipes. We welcome your contributions, which will be tested and included in future editions of this report.

Hadoop with Python

Hadoop is mostly written in Java, but that doesn't exclude the use of other programming languages with this distributed storage and processing framework, particularly Python. With this concise book, you’ll learn how to use Python with the Hadoop Distributed File System (HDFS), MapReduce, the Apache Pig platform and Pig Latin script, and the Apache Spark cluster-computing framework. Authors Zachary Radtka and Donald Miner from the data science firm Miner & Kasch take you through the basic concepts behind Hadoop, MapReduce, Pig, and Spark. Then, through multiple examples and use cases, you'll learn how to work with these technologies by applying various Python tools. Use the Python library Snakebite to access HDFS programmatically from within Python applications Write MapReduce jobs in Python with mrjob, the Python MapReduce library Extend Pig Latin with user-defined functions (UDFs) in Python Use the Spark Python API (PySpark) to write Spark programs with Python Learn how to use the Luigi Python workflow scheduler to manage MapReduce jobs and Pig scripts Zachary Radtka, a platform engineer at Miner & Kasch, has extensive experience creating custom analytics that run on petabyte-scale data sets.

Private and Open Data in Asia: A Regional Guide

The rise of big data in recent years coincides with the economic and political rise of Asia, especially among the five countries that make up the bulk of the East Asian Internet-using population: China, Japan, Korea, India, and Indonesia. If you’re thinking of entering the Asian market, this O’Reilly report provides an overview of the current state of big data and open data in these countries, and helps you examine whether the benefits of doing business with them outweigh the costs. While Japan and South Korea are highly developed countries with lofty Internet penetration rates, China, India, and Indonesia have enormous populations, relatively low Internet penetration, and enormous growth potential. But access to open data from fields such as healthcare, education, agriculture, transportation, energy, and finance—data vital for building businesses and services—varies from country to country. Each of them has a distinctive character reflecting its national priorities. To help you assess risk vs opportunity in the Asian market, author Franklin Lu reviews these five countries individually to reveal the nature of data privacy laws, open data initiatives, and existing businesses.

Sharpening Your Advanced SAS Skills

This guide presents sophisticated SAS programming techniques, procedures, and tools, such as Proc SQL, hash tables, and SAS Macro programming, for any industry. It empowers both advanced programmers who need a quick refresher and programmers interested in learning new techniques. It shows how to take advantage of the latest SAS options and new SAS procedures. The book illustrates syntax with simple, common task-oriented examples and prepares readers for the advanced SAS certification exam. Mindmaps and process flowcharts are available on the author's website.

The Definitive Guide to DAX: Business intelligence with Microsoft Excel, SQL Server Analysis Services, and Power BI

This comprehensive and authoritative guide will teach you the DAX language for business intelligence, data modeling, and analytics. Leading Microsoft BI consultants Marco Russo and Alberto Ferrari help you master everything from table functions through advanced code and model optimization. You’ll learn exactly what happens under the hood when you run a DAX expression, how DAX behaves differently from other languages, and how to use this knowledge to write fast, robust code. If you want to leverage all of DAX’s remarkable power and flexibility, this no-compromise “deep dive” is exactly what you need. Perform powerful data analysis with DAX for Microsoft SQL Server Analysis Services, Excel, and Power BI Master core DAX concepts, including calculated columns, measures, and error handling Understand evaluation contexts and the CALCULATE and CALCULATETABLE functions Perform time-based calculations: YTD, MTD, previous year, working days, and more Work with expanded tables, complex functions, and elaborate DAX expressions Perform calculations over hierarchies, including parent/child hierarchies Use DAX to express diverse and unusual relationships Measure DAX query performance with SQL Server Profiler and DAX Studio