talk-data.com talk-data.com

Topic

data

5765

tagged

Activity Trend

3 peak/qtr
2020-Q1 2026-Q1

Activities

5765 activities · Newest first

Efficient R optimization

What you’ll learn—and how you can apply it You’ll learn how to optimize your tried and tested code. In this Lesson, learners will understand how to profile code to identify and prevent key bottlenecks in R performance, as well as tricks that may improve performance on row and column operations and matrices. This Lesson also presents an example of specific improvements that can be made to enhance performance of the movie_square() function. This lesson is for you because You already have well-developed code that is mature conceptually and has been tried and tested. Now, you want to optimize this code. Prerequisites: Some knowledge of R and have well-developed R code Materials or downloads needed: Installed RStudio Some examples in this Lesson require a working C++ compiler

Efficient R Programming

There are many excellent R resources for visualization, data science, and package development. Hundreds of scattered vignettes, web pages, and forums explain how to use R in particular domains. But little has been written on how to simply make R work effectively—until now. This hands-on book teaches novices and experienced R users how to write efficient R code. Drawing on years of experience teaching R courses, authors Colin Gillespie and Robin Lovelace provide practical advice on a range of topics—from optimizing the set-up of RStudio to leveraging C++—that make this book a useful addition to any R user’s bookshelf.

Improve the outcome of your data experiments with A-B testing

Data scientists are faced with the need to conduct continual experiments, particularly regarding user interface and product marketing. Designing experiments is a cornerstone of the practice of statistics, with clear application to data science. In this lesson, you’ll learn about A-B testing and hypothesis, or significance tests—critical aspects of experimental design for data science. What you’ll learn—and how you can apply it You will learn the central concepts of A-B testing, understand its role in designing and conducting data science experiments, and the characteristics of a proper A-B test. Through a series of sample tests, you’ll learn how to interpret results, and apply that insight to your analysis of the data. Since A-B tests are typically constructed with a hypothesis in mind, you’ll also learn how to conduct various hypothesis, or significance tests, enabling you to avoid misinterpreting randomness. This lesson is for you because You are a data scientist or analyst working with data, and want to gain beginner-level knowledge of key statistical concepts to improve the design, and outcome of your experimental tests with data. Prerequisites: Basic familiarity with coding in R Materials or downloads needed: n/a

Optimizing Cassandra performance

In this lesson, we look at how to tune Cassandra to improve performance. There are a variety of settings in the configuration file and on individual tables. Although the default settings are appropriate for many use cases, there might be circumstances in which you need to change them. We’ll look at how and why to make these changes. We also see how to use the cassandra-stress test tool that ships with Cassandra to generate load against Cassandra and quickly see how it behaves under stress test circumstances. We can then tune Cassandra appropriately and feel confident that we’re ready to deploy to a production environment. What you’ll learn—and how you can apply it You’ll learn how to monitor and analyze Cassandra performance. You’ll learn about Cassandra features such as caching, memtables, commit logs, SStables, hinted handoff, compaction, and threading to improve responsiveness, consistency, and speed and reduce data loss. We’ll also look at timeout properties and JVM settings. This lesson is for you because… You are a developer, database administrator, or architect who wants to learn how to tune Cassandra. Prerequisites Understanding of Cassandra architecture and data model. If you want to run cassandra-stress Cassandra installed with a running Cassandra cluster. Materials or downloads needed A Cassandra cluster if you want to run cassandra-stress

SAS ODS Graphics Designer by Example

You just got the results from your study, and need to get some quick graphical views of your data before you begin the analysis. Do you need a crash course in the SG procedures (also known as ODS Graphics procedures) just to get a simple histogram? What should you do? The ODS Graphics Designer is the answer. With this application, you can use the interactive drag-and-drop feature to create many graphs, including histograms, box plots, scatter plot matrices, classification panels, and more. You can render your graph in batch with new data and output the results to any open ODS destination, or view the generated Graph Template Language (GTL) code as a leg-up to GTL programming. You can do all this with ease!

SAS(R) ODS Graphics Designer by Example: A Visual Guide to Creating Graphs Interactively describes in detail the features of the ODS Graphics Designer. The designer application lets you, the analyst, create graphs interactively so that you can focus on the analysis, and not on learning graph syntax. This book will take you step-by-step through the features of the designer, providing you with examples of graphs that are commonly used for the analysis of data in the health care, life sciences, and finance industries. The examples in this book will help you create just the right graph with ease!

IBM DB2 12 for z/OS Technical Overview

IBM® DB2® 12 for z/OS® delivers key innovations that increase availability, reliability, scalability, and security for your business-critical information. In addition, DB2 12 for z/OS offers performance and functional improvements for both transactional and analytical workloads and makes installation and migration simpler and faster. DB2 12 for z/OS also allows you to develop applications for the cloud and mobile devices by providing self-provisioning, multitenancy, and self-managing capabilities in an agile development environment. DB2 12 for z/OS is also the first version of DB2 built for continuous delivery. This IBM Redbooks® publication introduces the enhancements made available with DB2 12 for z/OS. The contents help database administrators to understand the new functions and performance enhancements, to plan for ways to use the key new capabilities, and to justify the investment in installing or migrating to DB2 12.

Dynamics of Structures with MATLAB® Applications by Pearsom

This book is designed for undergraduate and graduate students taking a first course in Dynamics of Structures, Structural Dynamics or Earthquake Engineering. It includes several topics on the theory of structural dynamics and the applications of this theory to the analysis of buildings, bridges, towers and other structures subjected to dynamic and earthquake forces. This comprehensive text demonstrates the applications of numerical solution techniques to a large variety of practical, real-world problems under dynamic loads.

About The Authors –

Dr Ashok K. Jain is Professor of Civil Engineering at the Indian Institute of Technology Roorkee (formerly University of Roorkee), obtained his B.E. and M.E. degrees with honours from the University of Roorkee in 1972 and 1974, and a doctorate degree from the University of Michigan, Ann Arbor, in 1978. His main areas of interest include multistoreyed buildings, concrete and steel bridges, and nonlinear seismic response of structures. Besides teaching and research, he has been a structural consultant to various state and central government agencies as well as many private companies. A recipient of several awards, he has been a research fellow at the University of Michigan; a visiting Professor at the McGill University, Montreal; Director, Malaviya National Institute of Technology, Jaipur; and Head of Civil Engineering Department, I.I.T. Roorkee.

Book Contents –

Part 1 Single degree of Freedom Systems Part 2 Multi-degree of Freedom Systems Part 3 Application to Earthquake Engineering Part 4 Wind Load Appendix 1 Measuring Earthquakes: Magnitude and Intensity Appendix 2 MATLAB Basics Answers to Selected Problems Index

Practical Data Science with Hadoop® and Spark: Designing and Building Effective Analytics at Scale

The Complete Guide to Data Science with Hadoop—For Technical Professionals, Businesspeople, and Students Demand is soaring for professionals who can solve real data science problems with Hadoop and Spark. is your complete guide to doing just that. Drawing on immense experience with Hadoop and big data, three leading experts bring together everything you need: high-level concepts, deep-dive techniques, real-world use cases, practical applications, and hands-on tutorials. Practical Data Science with Hadoop® and Spark The authors introduce the essentials of data science and the modern Hadoop ecosystem, explaining how Hadoop and Spark have evolved into an effective platform for solving data science problems at scale. In addition to comprehensive application coverage, the authors also provide useful guidance on the important steps of data ingestion, data munging, and visualization. Once the groundwork is in place, the authors focus on specific applications, including machine learning, predictive modeling for sentiment analysis, clustering for document analysis, anomaly detection, and natural language processing (NLP). This guide provides a strong technical foundation for those who want to do practical data science, and also presents business-driven guidance on how to apply Hadoop and Spark to optimize ROI of data science initiatives. Learn What data science is, how it has evolved, and how to plan a data science career How data volume, variety, and velocity shape data science use cases Hadoop and its ecosystem, including HDFS, MapReduce, YARN, and Spark Data importation with Hive and Spark Data quality, preprocessing, preparation, and modeling Visualization: surfacing insights from huge data sets Machine learning: classification, regression, clustering, and anomaly detection Algorithms and Hadoop tools for predictive modeling Cluster analysis and similarity functions Large-scale anomaly detection NLP: applying data science to human language

R for Data Science

Learn how to use R to turn raw data into insight, knowledge, and understanding. This book introduces you to R, RStudio, and the tidyverse, a collection of R packages designed to work together to make data science fast, fluent, and fun. Suitable for readers with no previous programming experience, R for Data Science is designed to get you doing data science as quickly as possible. Authors Hadley Wickham and Garrett Grolemund guide you through the steps of importing, wrangling, exploring, and modeling your data and communicating the results. You’ll get a complete, big-picture understanding of the data science cycle, along with basic tools you need to manage the details. Each section of the book is paired with exercises to help you practice what you’ve learned along the way. You’ll learn how to: Wrangle—transform your datasets into a form convenient for analysis Program—learn powerful R tools for solving data problems with greater clarity and ease Explore—examine your data, generate hypotheses, and quickly test them Model—provide a low-dimensional summary that captures true "signals" in your dataset Communicate—learn R Markdown for integrating prose, code, and results

Trade-off Analytics

Presents information to create a trade-off analysis framework for use in government and commercial acquisition environments This book presents a decision management process based on decision theory and cost analysis best practices aligned with the ISO/IEC 15288, the Systems Engineering Handbook, and the Systems Engineering Body of Knowledge. It provides a sound trade-off analysis framework to generate the tradespace and evaluate value and risk to support system decision-making throughout the life cycle. Trade-off analysis and risk analysis techniques are examined. The authors present an integrated value trade-off and risk analysis framework based on decision theory. These trade-off analysis concepts are illustrated in the different life cycle stages using multiple examples from defense and commercial domains. Provides techniques to identify and structure stakeholder objectives and creative, doable alternatives Presents the advantages and disadvantages of tradespace creation and exploration techniques for trade-off analysis of concepts, architectures, design, operations, and retirement Covers the sources of uncertainty in the system life cycle and examines how to identify, assess, and model uncertainty using probability Illustrates how to perform a trade-off analysis using the INCOSE Decision Management Process using both deterministic and probabilistic techniques Trade-off Analytics: Creating and Exploring the System Tradespace is written for upper undergraduate students and graduate students studying systems design, systems engineering, industrial engineering and engineering management. This book also serves as a resource for practicing systems designers, systems engineers, project managers, and engineering managers. is a Research Professor in the Department of Industrial Engineering at the University of Arkansas. He is also a senior principal with Innovative Decisions, Inc., a decision and risk analysis firm and has served as Chairman of the Board. Dr. Parnell has published more than 100 papers and book chapters and was lead editor of Gregory S. Parnell, PhD, Decision Making for Systems Engineering and Management, Wiley Series in Systems Engineering (2nd Ed, Wiley 2011) and lead author of the Handbook of Decision Analysis (Wiley 2013). He is a fellow of INFORMS, the INCOSE, MORS, and the Society for Decision Professionals.

Beginning Elastic Stack

Learn how to install, configure and implement the Elastic Stack (Elasticsearch, Logstash and Kibana) – the invaluable tool for anyone deploying a centralized log management solution for servers and apps. You will see how to use and configure Elastic Stack independently and alongside Puppet. Each chapter includes real-world examples and practical troubleshooting tips, enabling you to get up and running with Elastic Stack in record time. Fully customizable and easy to use, Elastic Stack enables you to be on top of your servers all the time, and resolve problems for your clients as fast as possible. Supported by Puppet and available with various plugins. Get started with Beginning Elastic Stack today and see why many consider Elastic Stack the best option for server log management. What You Will Learn: Install and configure Logstash Use Logstash with Elasticsearch and Kibana Use Logstash with Puppet and Foreman Centralize data processing Who This Book Is For: Anyone working on multiple servers who needs to search their logs using a web interface. It is ideal for server administrators who have just started their job and need to look after multiple servers efficiently.

Expert Hadoop® Administration

The Comprehensive, Up-to-Date Apache Hadoop Administration Handbook and Reference “Sam Alapati has worked with production Hadoop clusters for six years. His unique depth of experience has enabled him to write the go-to resource for all administrators looking to spec, size, expand, and secure production Hadoop clusters of any size.” –Paul Dix, Series Editor In leading Hadoop administrator Sam R. Alapati brings together authoritative knowledge for creating, configuring, securing, managing, and optimizing production Hadoop clusters in any environment. Drawing on his experience with large-scale Hadoop administration, Alapati integrates action-oriented advice with carefully researched explanations of both problems and solutions. He covers an unmatched range of topics and offers an unparalleled collection of realistic examples. Expert Hadoop® Administration, Alapati demystifies complex Hadoop environments, helping you understand exactly what happens behind the scenes when you administer your cluster. You’ll gain unprecedented insight as you walk through building clusters from scratch and configuring high availability, performance, security, encryption, and other key attributes. The high-value administration skills you learn here will be indispensable no matter what Hadoop distribution you use or what Hadoop applications you run. Understand Hadoop’s architecture from an administrator’s standpoint Create simple and fully distributed clusters Run MapReduce and Spark applications in a Hadoop cluster Manage and protect Hadoop data and high availability Work with HDFS commands, file permissions, and storage management Move data, and use YARN to allocate resources and schedule jobs Manage job workflows with Oozie and Hue Secure, monitor, log, and optimize Hadoop Benchmark and troubleshoot Hadoop

Mastering Tableau

Mastering Tableau is your comprehensive guide to becoming highly skilled in Tableau, focusing on advanced data visualization and practical applications. You will learn how to create complex dashboards, integrate R, and make the most of Tableau's features to deliver compelling insights. By the end of the book, you'll be ready to tackle real-world business intelligence challenges. What this Book will help me do Master advanced Tableau calculations such as row-level and aggregate-level calculations. Create engaging and efficient dashboards for professional data presentations. Integrate R functionalities with Tableau for predictive and advanced analytics. Design and implement custom geographic visualizations, including polygon maps. Optimize performance and best practices in Tableau for innovative BI solutions. Author(s) Jen Stirrup and None Baldwin are experienced data analysts and Tableau experts with years of practical experience in consulting and teaching. Jen has contributed significantly to the Tableau community through workshops and talks. Together, they provide structured guidance that helps readers master Tableau while emphasizing hands-on learning. Who is it for? This book is for business analysts aiming to enhance their data visualization skills using Tableau. Whether you are an intermediate Tableau user looking to tackle advanced techniques or someone wanting to streamline your BI workflows, this book focuses on practical problem-solving. It equips you to use Tableau effectively to create impactful visualizations and insights.

Who Knew You Could Do That with RPG IV? Modern RPG for the Modern Programmer

Application development is a key part of IBM® i businesses. The IBM i operating system is a modern, robust platform to create and develop applications. The RPG language has been around for a long time, but is still being transformed into a modern business language. This IBM Redbooks® publication is focused on helping the IBM i development community understand the modern RPG language. The world of application development has been rapidly changing over the past years. The good news is that IBM i has been changing right along with it, and has made significant changes to the RPG language. This book is intended to help developers understand what modern RPG looks like and how to move from older versions of RPG to a newer, modern version. Additionally, it covers the basics of Integrated Language Environment® (ILE), interfacing with many other languages, and the best tools for doing development on IBM i. Using modern tools, methodologies, and languages are key to continuing to stay relevant in today's world. Being able to find the right talent for your company is key to your continued success. Using the guidelines and principles in this book can help set you up to find that talent today and into the future. This publication is the result of work that was done by IBM, industry experts, business partners, and some of the original authors of the first edition of this book. This information is important not only for developers, but also business decision makers (CIO for example) to understand that the IBM i is not an 'old' system. IBM i has modern languages and tools. It is a matter of what you choose to do with the IBM i that defines its age.

MDX with Microsoft SQL Server 2016 Analysis Services Cookbook - Third Edition

Dive into the world of multidimensional data analysis with "MDX with Microsoft SQL Server 2016 Analysis Services Cookbook." This book provides over 70 practical recipes to help you understand and utilize MDX queries and calculations effectively. What this Book will help me do Master the fundamentals of MDX concepts and their applications. Learn to create time-aware calculations using the Time dimension. Develop skills to write efficient and flexible MDX queries. Gain insights into creating compact and efficient analytical reports. Understand advanced techniques for capturing MDX queries and metadata-driven calculations. Author(s) None Li and Tomislav Piasevoli are accomplished experts in multidimensional data analysis and business intelligence. Drawing from extensive experience, they offer readers a well-structured and comprehensive approach to mastering MDX. Their pedagogy emphasizes practical, real-world examples promoting clear understanding. Who is it for? This volume is designed for database administrators, multidimensional cube developers, and report writers looking to enhance their strengths in MDX. Readers with intermediate exposure to multidimensional databases will particularly benefit. It also serves as a valuable resource for business analysts and power users aiming to boost data analysis capabilities.

Style and Statistics

A non-technical guide to leveraging retail analytics for personal and competitive advantage Style & Statistics is a real-world guide to analytics in retail. Written specifically for the non-IT crowd, this book explains analytics in an approachable, understandable way, and provides examples of direct application to retail merchandise management, marketing, and operations. The discussion covers current industry trends and emerging-standard processes, and illustrates how analytics is providing new solutions to perennial retail problems. You'll learn how to leverage the benefits of analytics to boost your personal career, and how to interpret data in a way that's useful to the average end business user or shopper. Key concepts are detailed in easy-to-understand language, and numerous examples highlight the growing importance of understanding analytics in the retail environment. The power of analytics has become apparent across industries, but it's left an especially indelible mark on retail. It's a complex topic, but you don't need to be a data scientist to take advantage of the opportunities it brings. This book shows you what you need to know, and how to put analytics to work with retail-specific applications. Learn how analytics can help you be better at your job Dig deeper into the customer's needs, wants, and dreams Streamline merchandise management, pricing, marketing, and more Find solutions for inefficiencies and inaccuracies As the retail customer evolves, so must the retail industry. The retail landscape not only includes in-store but also website, mobile site, mobile apps, and social media . With more and more competition emerging on all sides, retailers need to use every tool at their disposal to create value and gain a competitive advantage. Analytics offers a number of ways to make your company stand out, whether it's through improved operations, customer experience, or any of the other myriad factors that build a great place to shop. Style & Statistics provides an analytics primer with a practical bent, specifically for the retail industry.

Tableau 10 Business Intelligence Cookbook

Tableau 10 Business Intelligence Cookbook is your comprehensive guide to mastering data analysis and visualization using Tableau 10. You will gain confidence in creating powerful, interactive dashboards and visualizations that not only look great but also help convey critical insights effectively. What this Book will help me do Create and customize effective charts including bar charts, line graphs, and scatter plots. Build interactive dashboards that combine visualizations into cohesive data presentations. Leverage Tableau's calculated fields and parameters to implement advanced data transformations. Prepare and clean your data for analysis using built-in Tableau tools to ensure accuracy. Utilize geospatial and mapping features to visualize geographic and location-based data effectively. Author(s) Donabel Santos is an experienced data specialist and Tableau expert with a passion for teaching data visualization techniques. Paul Banoub, a seasoned business intelligence professional, brings practical insights into crafting effective data strategies using Tableau. Together, they create a book that empowers professionals to realize their data visualization goals. Who is it for? This book is ideal for business professionals, data analysts, and technology experts looking to enhance their Tableau skills. Beginners will find the recipes approachable thanks to the step-by-step guidance, while more advanced users will appreciate the depth of techniques covered. Whether you analyze data for business intelligence or strategic planning, this book will provide tools to expand your capabilities.

SQL Server 2016 Reporting Services Cookbook

Dive into the world of Microsoft SQL Server 2016 Reporting Services with this cookbook-style guide that covers operational reporting and mobile dashboards. By following clear, task-oriented recipes, you'll quickly learn how to leverage SSRS 2016 for creating advanced, visually appealing, and functional reports to improve your reporting workflows and decision-making processes. What this Book will help me do Understand the architectural components and key features of SQL Server 2016 Reporting Services. Create advanced reporting solutions tailored to your organization's needs using step-by-step recipes. Utilize Power BI and mobile reporting capabilities for more interactive and accessible data insights. Master administration, security, and performance optimization of reporting environments. Integrate reporting solutions into .NET applications for custom business intelligence enhancements. Author(s) None Priyankara is an industry expert with years of experience in data warehousing and reporting solutions, bringing practical insights to the complex world of SQL Server Reporting Services. Co-author Robert Cain is a seasoned technology trainer and consultant specializing in SQL Server and Power BI. Together, they provide a comprehensive, hands-on guide rooted in real-world applications and best practices. Who is it for? This book is designed for software professionals who are involved in reporting and business intelligence, such as software engineers, architects, and DW/BI experts. If you're responsible for designing, implementing, or managing reporting platforms and want to explore SSRS 2016's capabilities, this is the perfect guide for you.