talk-data.com talk-data.com

Topic

data

5765

tagged

Activity Trend

3 peak/qtr
2020-Q1 2026-Q1

Activities

5765 activities · Newest first

MongoDB Basics

Need a quick and easy to understand introduction to MongoDB and NoSQL databases? MongoDB Basics, from The Definitive Guide to MongoDB, 2E, shows you how a document-oriented database system differs from a relational database, and how to install and get started using it. You'll also learn MongoDB design basics, including geospatial indexing, how to navigate, view, and query your database, and how to use GridFS with a bit of Python.

Probability: An Introduction with Statistical Applications, 2nd Edition

Praise for the First Edition "This is a well-written and impressively presented introduction to probability and statistics. The text throughout is highly readable, and the author makes liberal use of graphs and diagrams to clarify the theory." - The Statistician Thoroughly updated, Probability: An Introduction with Statistical Applications, Second Edition features a comprehensive exploration of statistical data analysis as an application of probability. The new edition provides an introduction to statistics with accessible coverage of reliability, acceptance sampling, confidence intervals, hypothesis testing, and simple linear regression. Encouraging readers to develop a deeper intuitive understanding of probability, the author presents illustrative geometrical presentations and arguments without the need for rigorous mathematical proofs. The Second Edition features interesting and practical examples from a variety of engineering and scientific fields, as well as: Over 880 problems at varying degrees of difficulty allowing readers to take on more challenging problems as their skill levels increase Chapter-by-chapter projects that aid in the visualization of probability distributions New coverage of statistical quality control and quality production An appendix dedicated to the use of Mathematica® and a companion website containing the referenced data sets Featuring a practical and real-world approach, this textbook is ideal for a first course in probability for students majoring in statistics, engineering, business, psychology, operations research, and mathematics. Probability: An Introduction with Statistical Applications, Second Edition is also an excellent reference for researchers and professionals in any discipline who need to make decisions based on data as well as readers interested in learning how to accomplish effective decision making from data.

Even You Can Learn Statistics and Analytics: An Easy to Understand Guide to Statistics and Analytics, Third Edition

Related Content Even You Can Learn Statistics, Fourth Edition, is now available with new and expanded content. Thought you couldn’t learn statistics? You can – and you will! Even You Can Learn Statistics and Analytics, Third Edition is the practical, up-to-date introduction to statistics – for everyone! Now fully updated for "big data" analytics and the newest applications, it'll teach you all the statistical techniques you’ll need for finance, marketing, quality, science, social science, and more – one easy step at a time. Simple jargon-free explanations help you understand every technique, and extensive practical examples and worked problems give you all the hands-on practice you'll need. This edition contains more practical examples than ever – all updated for the newest versions of Microsoft Excel. You'll find downloadable practice files, templates, data sets, and sample models – including complete solutions you can put right to work! Learn how to do all this, and more: Apply statistical techniques to analyze huge data sets and transform them into valuable knowledge Construct and interpret statistical charts and tables with Excel or OpenOffice.org Calc 3 Work with mean, median, mode, standard deviation, Z scores, skewness, and other descriptive statistics Use probability and probability distributions Work with sampling distributions and confidence intervals Test hypotheses with Z, t, chi-square, ANOVA, and other techniques Perform powerful regression analysis and modeling Use multiple regression to develop models that contain several independent variables Master specific statistical techniques for quality and Six Sigma programs Hate math? No sweat. You’ll be amazed at how little you need. Like math? Optional "Equation Blackboard" sections reveal the mathematical foundations of statistics right before your eyes. If you need to understand, evaluate, or use statistics in business, academia, or anywhere else, this is the book you've been searching for!

Neo4j in Action

Neo4j in Action is a comprehensive guide to Neo4j, aimed at application developers and software architects. Using hands-on examples, you'll learn to model graph domains naturally with Neo4j graph structures. The book explores the full power of native Java APIs for graph data manipulation and querying. About the Technology Much of the data today is highly connected--from social networks to supply chains to software dependency management--and more connections are continually being uncovered. Neo4j is an ideal graph database tool for highly connected data. It is mature, production-ready, and unique in enabling developers to simply and efficiently model and query connected data. About the Book Neo4j in Action is a comprehensive guide to designing, implementing, and querying graph data using Neo4j. Using hands-on examples, you'll learn to model graph domains naturally with Neo4j graph structures. The book explores the full power of native Java APIs for graph data manipulation and querying. It also covers Cypher, Neo4j's graph query language. Along the way, you'll learn how to integrate Neo4j into your domain-driven app using Spring Data Neo4j, as well as how to use Neo4j in standalone server or embedded modes. What's Inside Graph database patterns How to model data in social networks How to use Neo4j in your Java applications How to configure and set up Neo4j About the Reader Knowledge of Java basics is required. No prior experience with graph data or Neo4j is assumed. About the Authors Aleksa Vukotic is an architect specializing in graph data models. Nicki Watt, Dominic Fox, Tareq Abedrabbo, and Jonas Partner work at OpenCredo, a Neo Technology partner, and have been involved in many projects using Neo4j. Quotes A pragmatic programmatic tour through Neo4j’s APIs and query language. - From the Foreword by Jim Webber and Ian Robinson, Neo Technology Excellent coverage of one of the most successful NoSQL products. - Pouria Amirian, PhD, University of Oxford A great resource for rethinking your data storage using graphs in Neo4j. - Stephen Kitt, ERDF

Learning PHP, MySQL & JavaScript, 4th Edition

Build interactive, data-driven websites with the potent combination of open-source technologies and web standards, even if you have only basic HTML knowledge. With this popular hands-on guide, you’ll tackle dynamic web programming with the help of today’s core technologies: PHP, MySQL, JavaScript, jQuery, CSS, and HTML5.

Time Series Databases: New Ways to Store and Access Data

Time series data is of growing importance, especially with the rapid expansion of the Internet of Things. This concise guide shows you effective ways to collect, persist, and access large-scale time series data for analysis. You’ll explore the theory behind time series databases and learn practical methods for implementing them. Authors Ted Dunning and Ellen Friedman provide a detailed examination of open source tools such as OpenTSDB and new modifications that greatly speed up data ingestion.

Create Web Charts with D3

Create Web Charts with D3 shows how to convert your data into eye-catching, innovative, animated, and highly interactive browser-based charts. This book is suitable for developers of all experience levels and needs: if you want power and control and need to create data visualization beyond traditional charts, then D3 is the JavaScript library for you. By the end of the book, you will have a good knowledge of all the elements needed to manage data from every possible source, from high-end scientific instruments to Arduino boards, from PHP SQL databases queries to simple HTML tables, and from Matlab calculations to reports in Excel. This book contains content previously published in Beginning JavaScript Charts. Create all kinds of charts using the latest technologies available on browsers Full of step-by-step examples, Create Web Charts with D3 introduces you gradually to all aspects of chart development, from the data source to the choice of which solution to apply. This book provides a number of tools that can be the starting point for any project requiring graphical representations of data, whether using commercial libraries or your own

Fundamentals of Database Indexing and Searching

Fundamentals of Database Indexing and Searching presents well-known database searching and indexing techniques. It focuses on similarity search queries, showing how to use distance functions to measure the notion of dissimilarity. After defining database queries and similarity search queries, the book organizes the most common and representative index structures according to their characteristics. The author first describes low-dimensional index structures, memory-based index structures, and hierarchical disk-based index structures. He then outlines useful distance measures and index structures that use the distance information to efficiently solve similarity search queries. Focusing on the difficult dimensionality phenomenon, he also presents several indexing methods that specifically deal with high-dimensional spaces. In addition, the book covers data reduction techniques, including embedding, various data transforms, and histograms. Through numerous real-world examples, this book explores how to effectively index and search for information in large collections of data. Requiring only a basic computer science background, it is accessible to practitioners and advanced undergraduate students.

Developing Credit Risk Models Using SAS Enterprise Miner and SAS/STAT

Combine complex concepts facing the financial sector with the software toolsets available to analysts.

The credit decisions you make are dependent on the data, models, and tools that you use to determine them. Developing Credit Risk Models Using SAS Enterprise Miner and SAS/STAT: Theory and Applications combines both theoretical explanation and practical applications to define as well as demonstrate how you can build credit risk models using SAS Enterprise Miner and SAS/STAT and apply them into practice.

The ultimate goal of credit risk is to reduce losses through better and more reliable credit decisions that can be developed and deployed quickly. In this example-driven book, Dr. Brown breaks down the required modeling steps and details how this would be achieved through the implementation of SAS Enterprise Miner and SAS/STAT.

Users will solve real-world risk problems as well as comprehensively walk through model development while addressing key concepts in credit risk modeling. The book is aimed at credit risk analysts in retail banking, but its applications apply to risk modeling outside of the retail banking sphere. Those who would benefit from this book include credit risk analysts and managers alike, as well as analysts working in fraud, Basel compliancy, and marketing analytics. It is targeted for intermediate users with a specific business focus and some programming background is required.

Efficient and effective management of the entire credit risk model lifecycle process enables you to make better credit decisions. Developing Credit Risk Models Using SAS Enterprise Miner and SAS/STAT: Theory and Applications demonstrates how practitioners can more accurately develop credit risk models as well as implement them in a timely fashion.

This book is part of the SAS Press Program.

JMP Essentials, 2nd Edition

Grasp essential steps in order to generate meaningful results quickly with JMP.

JMP Essentials: An Illustrated Step-by-Step Guide for New Users, Second Edition is designed for the new or occasional JMP user who needs to generate meaningful graphs or results quickly. Drawing on their own experience working with these customers, the authors provide essential steps for what new users typically need to carry out with JMP. This newest edition has all new instructions and screen shots reflecting the latest release of JMP software. In addition, it has eight new detailed sections and 10 new subsections that include creating maps, filtering data, creating dashboards, and working with Excel data, all of which highlight new, useful and basic level enhancements to JMP.

The format of the book is unique. It adopts a show-and-tell design with essential step-by-step instructions and corresponding screen illustrations, which help users quickly see how to generate the desired results. In most cases, each section completes a JMP task, which maximizes the book's utility as a reference. In addition, each chapter contains a family of features that are carefully crafted to first introduce you to basic features and then on to more advanced ones. JMP Essentials: An Illustrated Step-by-Step Guide for New Users, Second Edition is the quickest and most accessible reference book available.

This is part of the SAS Press program.

Optical Fiber Communication Systems with MATLAB® and Simulink® Models, 2nd Edition

Carefully structured to instill practical knowledge of fundamental issues, Optical Fiber Communication Systems with MATLABdescribes the modeling of optically amplified fiber communications systems using MATLAB ® and Simulink ® Models ® and Simulink ®. This lecture-based book focuses on concepts and interpretation, mathematical procedures, and engineering applications, shedding light on device behavior and dynamics through computer modeling. Supplying a deeper understanding of the current and future state of optical systems and networks, this Second Edition: Reflects the latest developments in optical fiber communications technology Includes new and updated case studies, examples, end-of-chapter problems, and MATLAB ® and Simulink ® models Emphasizes DSP-based coherent reception techniques essential to advancement in short- and long-term optical transmission networks Optical Fiber Communication Systems with MATLAB ® and Simulink ® Models, Second Edition is intended for use in university and professional training courses in the specialized field of optical communications. This text should also appeal to students of engineering and science who have already taken courses in electromagnetic theory, signal processing, and digital communications, as well as to optical engineers, designers, and practitioners in industry.

SAS Certification Prep Guide, 4th Edition
Businesses rely on career professionals with strong SAS knowledge and skills. Set yourself apart from the competition by earning the only globally recognized credential endorsed by SAS.

The SAS Certification Prep Guide: Advanced Programming for SAS 9, Fourth Edition, prepares you to take the Advanced Programming for SAS 9 exam. Major topics include SQL processing with SAS, the SAS macro language, advanced SAS programming techniques, and optimizing SAS programs, as well as a new chapter on creating functions with PROC FCMP. You will also become familiar with the enhancements and new functionality that are available in SAS 9.

New or experienced SAS users will find this guide to be an invaluable resource that covers the objectives tested on the exam. The text contains quizzes that enable you to test your understanding of material in each chapter. Quiz solutions are included at the end of the book. Candidates must earn the SAS Certified Base Programmer for SAS 9 Credential before taking the SAS Advanced Programming for SAS 9 exam.

You’ll find instructions on how to obtain sample data when accessing SAS through SAS Enterprise Guide, SAS Studio, SAS University Edition, and the SAS windowing environment. This edition provides significant improvements to numerous examples, making the code even more efficient.

Experience is a critical component to becoming a SAS Certified Professional. This comprehensive guide along with training in SAS SQL1, SAS Macro Language 1, and SAS Programming 3 are valuable resources designed to help you prepare for the Advanced SAS Certification exam.

Sparse Modeling

Sparse models are particularly useful in scientific applications, such as biomarker discovery in genetic or neuroimaging data, where the interpretability of a predictive model is essential. Sparsity can also dramatically improve the cost efficiency of signal processing. Sparse Modeling: Theory, Algorithms, and Applications provides an introduction to the growing field of sparse modeling, including application examples, problem formulations that yield sparse solutions, algorithms for finding such solutions, and recent theoretical results on sparse recovery. The book gets you up to speed on the latest sparsity-related developments and will motivate you to continue learning about the field. The authors first present motivating examples and a high-level survey of key recent developments in sparse modeling. The book then describes optimization problems involving commonly used sparsity-enforcing tools, presents essential theoretical results, and discusses several state-of-the-art algorithms for finding sparse solutions. The authors go on to address a variety of sparse recovery problems that extend the basic formulation to more sophisticated forms of structured sparsity and to different loss functions. They also examine a particular class of sparse graphical models and cover dictionary learning and sparse matrix factorizations.

Test Scoring and Analysis Using SAS

Develop your own multiple-choice tests, score students, produce student rosters (in print form or Excel), and explore item response theory (IRT).

Aimed at nonstatisticians working in education or training, Test Scoring and Analysis Using SAS describes item analysis and test reliability in easy-to-understand terms, and teaches you SAS programming to score tests, perform item analysis, and estimate reliability. Maximizing flexibility, the scoring and analysis programs enable you to analyze tests with multiple versions, define alternate correct responses for selected items, and repeat the scoring with selected items deleted.

You will be guided step-by-step on how to design multiple-choice items, use analysis to improve your tests, and even detect cheating on students’ submitted multiple-choice tests. Other subjects addressed include reading in data from a variety of sources (text files and Excel workbooks, for example), detecting errors in the input data, and producing class rosters in printed form or Excel workbooks. Also included is a chapter on IRT—widely used in education to calibrate and evaluate items in tests in education such as the SAT and GRE—with instructions for running the new SAS procedure PROC IRT.

This book is part of the SAS Press program.

The Essential Guide to SAS Dates and Times, Second Edition, 2nd Edition

Why does SAS use January 1, 1960 as its arbitrary reference date? How do you convert a value such as 27 January 2003 into a SAS date? How do you put a date into a filename, or label an Excel worksheet with the date?

You'll find the answers to these questions and much more in Derek Morgan's Essential Guide to SAS Dates and Times, Second Edition, which makes it easy to understand how to use and manipulate dates, times, and datetimes in SAS. Updated for SAS 9.4, with additional functions, formats, and capabilities, the Second Edition has a new chapter dedicated to the ISO 8601 standard and the formats and functions that are new to SAS, including how SAS works with Universal Coordinated Time (UTC).

Novice users will appreciate the new "Troubleshooting" appendix, which discusses questions common to newer SAS users in a conversational way and provides clear examples of simple solutions to these questions. Both novice and intermediate users will find the clear, task-based examples on how to accomplish date-related tasks and the detailed explanations of standard formats and functions invaluable. Users working with intervals will appreciate the expanded discussion of the topic, which details the new custom interval capability, among other enhancements to intervals.

Users working with international dates and times will benefit from the detailed discussion of the NLS facility as it relates to dates and times. Included are bonus "Quick Reference Guides" that list both the standard date and time formats and the NLS date and time formats with examples. These guides illustrate how each format displays the same date, time, or datetime, so you can find the format you want to use at a glance.

The Essential Guide to SAS Dates and Times, Second Edition is the most complete and up-to-date collection of examples on how to write complex programs involving dates, times, or datetime values.

This book is part of the SAS Press Program.

Visualization Analysis and Design

This book provides a systematic, comprehensive framework for thinking about visualization in terms of principles and design choices. It features a unified approach encompassing information visualization techniques for abstract data, scientific visualization techniques for spatial data, and visual analytics techniques for interweaving data transformation and analysis with interactive visual exploration. Suitable for both beginners and more experienced designers, the book does not assume any experience with programming, mathematics, human-computer interaction, or graphic design.

Statistical Graphics Procedures by Example

Sanjay Matange and Dan Heath's Statistical Graphics Procedures by Example: Effective Graphs Using SAS shows the innumerable capabilities of SAS Statistical Graphics (SG) procedures. The authors begin with a general discussion of the principles of effective graphics, ODS Graphics, and the SG procedures. They then move on to show examples of the procedures' many features. The book is designed so that you can easily flip through it, find the graph you need, and view the code right next to the example. Among the topics included are how to combine plot statements to create custom graphs; customizing graph axes, legends, and insets; advanced features, such as annotation and attribute maps; tips and tricks for creating the optimal graph for the intended usage; real-world examples from the health and life sciences domain; and ODS styles. The procedures in Statistical Graphics Procedures by Example are specifically designed for the creation of analytical graphs. That makes this book a must-read for analysts and statisticians in the health care, clinical trials, financial, and insurance industries. However, you will find that the examples here apply to all fields. This book is part of the SAS Press program.

GeoServer Cookbook

Unlock the full potential of GeoServer and master the art of serving dynamic maps and geospatial services by using this comprehensive cookbook. With practical, step-by-step instructions, you'll learn advanced techniques to optimize your GeoServer installations, style maps, and handle advanced configurations seamlessly. What this Book will help me do Optimize GeoServer for efficient handling of vector and raster data, ensuring excellent performance for GIS applications. Create visually dynamic and customized maps using advanced CSS styling techniques tailored for GeoServer. Expand the capabilities of your maps by incorporating time and elevation dimensions. Master database configurations, coordinate reference systems handling, and GeoWebCache to enhance GIS system efficiency. Automate and streamline GeoServer configurations to ensure consistent and effective deployment processes. Author(s) None Iacovella, a seasoned expert in GIS technologies with extensive experience in geospatial applications, provides readers with a hands-on and practical approach in this book. Leveraging years of working with GeoServer, their guidance is clear, precise, and comprehensive. Iacovella's focus on real-world applications makes their writing an invaluable resource for GIS practitioners. Who is it for? This book is designed for GIS experts, developers, and system administrators who aim to build professional-grade map services using GeoServer. It is ideal for individuals who already have a foundational understanding of GIS concepts and basic GeoServer usage. Whether you're looking to optimize performance, experiment with advanced configurations, or generate visually striking geographic data representations, this book will be incredibly beneficial. If you're an aspiring geospatial professional, this guide will help you elevate your skills to the next level.

Predictive Analytics and Data Mining

Put Predictive Analytics into ActionLearn the basics of Predictive Analysis and Data Mining through an easy to understand conceptual framework and immediately practice the concepts learned using the open source RapidMiner tool. Whether you are brand new to Data Mining or working on your tenth project, this book will show you how to analyze data, uncover hidden patterns and relationships to aid important decisions and predictions. Data Mining has become an essential tool for any enterprise that collects, stores and processes data as part of its operations. This book is ideal for business users, data analysts, business analysts, business intelligence and data warehousing professionals and for anyone who wants to learn Data Mining.You’ll be able to:1. Gain the necessary knowledge of different data mining techniques, so that you can select the right technique for a given data problem and create a general purpose analytics process.2. Get up and running fast with more than two dozen commonly used powerful algorithms for predictive analytics using practical use cases.3. Implement a simple step-by-step process for predicting an outcome or discovering hidden relationships from the data using RapidMiner, an open source GUI based data mining tool Predictive analytics and Data Mining techniques covered: Exploratory Data Analysis, Visualization, Decision trees, Rule induction, k-Nearest Neighbors, Naïve Bayesian, Artificial Neural Networks, Support Vector machines, Ensemble models, Bagging, Boosting, Random Forests, Linear regression, Logistic regression, Association analysis using Apriori and FP Growth, K-Means clustering, Density based clustering, Self Organizing Maps, Text Mining, Time series forecasting, Anomaly detection and Feature selection. Implementation files can be downloaded from the book companion site at www.LearnPredictiveAnalytics.com Demystifies data mining concepts with easy to understand language Shows how to get up and running fast with 20 commonly used powerful techniques for predictive analysis Explains the process of using open source RapidMiner tools Discusses a simple 5 step process for implementing algorithms that can be used for performing predictive analytics Includes practical use cases and examples

Data Architecture: A Primer for the Data Scientist

Today, the world is trying to create and educate data scientists because of the phenomenon of Big Data. And everyone is looking deeply into this technology. But no one is looking at the larger architectural picture of how Big Data needs to fit within the existing systems (data warehousing systems). Taking a look at the larger picture into which Big Data fits gives the data scientist the necessary context for how pieces of the puzzle should fit together. Most references on Big Data look at only one tiny part of a much larger whole. Until data gathered can be put into an existing framework or architecture it can’t be used to its full potential. Data Architecture a Primer for the Data Scientist addresses the larger architectural picture of how Big Data fits with the existing information infrastructure, an essential topic for the data scientist. Drawing upon years of practical experience and using numerous examples and an easy to understand framework. W.H. Inmon, and Daniel Linstedt define the importance of data architecture and how it can be used effectively to harness big data within existing systems. You’ll be able to: Turn textual information into a form that can be analyzed by standard tools. Make the connection between analytics and Big Data Understand how Big Data fits within an existing systems environment Conduct analytics on repetitive and non-repetitive data Discusses the value in Big Data that is often overlooked, non-repetitive data, and why there is significant business value in using it Shows how to turn textual information into a form that can be analyzed by standard tools Explains how Big Data fits within an existing systems environment Presents new opportunities that are afforded by the advent of Big Data Demystifies the murky waters of repetitive and non-repetitive data in Big Data