talk-data.com talk-data.com

Topic

data-science

2252

tagged

Activity Trend

1 peak/qtr
2020-Q1 2026-Q1

Activities

2252 activities · Newest first

21 Recipes for Mining Twitter

Millions of public Twitter streams harbor a wealth of data, and once you mine them, you can gain some valuable insights. This short and concise book offers a collection of recipes to help you extract nuggets of Twitter information using easy-to-learn Python tools. Each recipe offers a discussion of how and why the solution works, so you can quickly adapt it to fit your particular needs. The recipes include techniques to: Use OAuth to access Twitter data Create and analyze graphs of retweet relationships Use the streaming API to harvest tweets in realtime Harvest and analyze friends and followers Discover friendship cliques Summarize webpages from short URLs This book is a perfect companion to O’Reilly's Mining the Social Web.

Practical Applications of Data Mining

Practical Applications of Data Mining emphasizes both theory and applications of data mining algorithms. Various topics of data mining techniques are identified and described throughout, including clustering, association rules, rough set theory, probability theory, neural networks, classification, and fuzzy logic. Each of these techniques is explored with a theoretical introduction and its effectiveness is demonstrated with various chapter examples. This book will help any database and IT professional understand how to apply data mining techniques to real-world problems.

Following an introduction to data mining principles, Practical Applications of Data Mining introduces association rules to describe the generation of rules as the first step in data mining. It covers classification and clustering methods to show how data can be classified to retrieve information from data. Statistical functions and drough set theory are discussed to demonstrate how statistical and rough set formulas can be used for data analytics and knowlege discovery. Neural networks is an important branch in computational intelligence. It is introduced and explored in the text to investigate the role of neural network algorithms in data analytics.

Entity Resolution and Information Quality

Entity Resolution and Information Quality presents topics and definitions, and clarifies confusing terminologies regarding entity resolution and information quality. It takes a very wide view of IQ, including its six-domain framework and the skills formed by the International Association for Information and Data Quality {IAIDQ). The book includes chapters that cover the principles of entity resolution and the principles of Information Quality, in addition to their concepts and terminology. It also discusses the Fellegi-Sunter theory of record linkage, the Stanford Entity Resolution Framework, and the Algebraic Model for Entity Resolution, which are the major theoretical models that support Entity Resolution. In relation to this, the book briefly discusses entity-based data integration (EBDI) and its model, which serve as an extension of the Algebraic Model for Entity Resolution. There is also an explanation of how the three commercial ER systems operate and a description of the non-commercial open-source system known as OYSTER. The book concludes by discussing trends in entity resolution research and practice. Students taking IT courses and IT professionals will find this book invaluable. First authoritative reference explaining entity resolution and how to use it effectively Provides practical system design advice to help you get a competitive advantage Includes a companion site with synthetic customer data for applicatory exercises, and access to a Java-based Entity Resolution program.

Pro SharePoint 2010 Business Intelligence Solutions

Business Intelligence (BI) software is everywhere. It is the code and tools that allow you to view different components of a business using a single visual platform, making comprehending mountains of data easier. With SharePoint 2010, Microsoft has married one of the most-used team collaboration tools with its business intelligence products, including PerformancePoint Services and Excel Services. Now, users of SharePoint can have a single place to get a view of their business, its organization and processes, and its financial performance. Constructing this type of portal is a process unique to each business. With Pro SharePoint 2010 Business Intelligence Solutions, authors Sahil Malik and Srini Sistla take you through the different tools, services, and frameworks available for building a BI portal atop SharePoint 2010. With comprehensive coverage of Visio Services, PerformancePoint, Excel Services, and a practical look at building scorecards, dashboards, and other reporting solutions, this book answers the fundamental questions about why and how BI can work for your business.

MATLAB®: An Introduction with Applications, Fourth Edition

MATLAB: An Introduction with Applications 4th Edition walks readers through the ins and outs of this powerful software for technical computing. The first chapter describes basic features of the program and shows how to use it in simple arithmetic operations with scalars. The next two chapters focus on the topic of arrays (the basis of MATLAB), while the remaining text covers a wide range of other applications. MATLAB: An Introduction with Applications 4th Edition is presented gradually and in great detail, generously illustrated through computer screen shots and step-by-step tutorials, and applied in problems in mathematics, science, and engineering.

Decision Support Systems for Business Intelligence, Second Edition

This book examines decision making in general, the translation of knowledge about decision making into a DSS model, and the actual programming of a DSS. In addition, it combines the theoretical underpinnings of the topic with practical application using tools and technology currently available. Topics are discussed on three levels: general theory, implications for DSS design, and code development. This approach provides readers with practical examples than can be adopted into systems design. The Second Edition has been completely updated to reflect new technologies as well as the demands upon technology that have evolved since the publication of the First Edition in 1996. The book utilizes a combination of Dreamweaver and Cold Fusion, which are both popular software products that follow industry standards. (The First Edition utilized Level 5 Object, but all references to the software have been removed as it is not deemed a viable tool any longer.) In light of the numerous advances in the field of DSS over the years, there are many topics throughout the book that have updated and revised. For example, data warehousing has increased substantially in importance since the First Edition, and this section has been completely revised. The topic of business intelligence has been added, and similarly, data mining coverage has been increased via additional discussion and examples. In addition, transnational corporations have become more prevalent and are addressed accordingly.

Computational Intelligence and Pattern Analysis in Biological Informatics

An invaluable tool in Bioinformatics, this unique volume provides both theoretical and experimental results, and describes basic principles of computational intelligence and pattern analysis while deepening the reader's understanding of the ways in which these principles can be used for analyzing biological data in an efficient manner. This book synthesizes current research in the integration of computational intelligence and pattern analysis techniques, either individually or in a hybridized manner. The purpose is to analyze biological data and enable extraction of more meaningful information and insight from it. Biological data for analysis include sequence data, secondary and tertiary structure data, and microarray data. These data types are complex and advanced methods are required, including the use of domain-specific knowledge for reducing search space, dealing with uncertainty, partial truth and imprecision, efficient linear and/or sub-linear scalability, incremental approaches to knowledge discovery, and increased level and intelligence of interactivity with human experts and decision makers Chapters authored by leading researchers in CI in biology informatics. Covers highly relevant topics: rational drug design; analysis of microRNAs and their involvement in human diseases. Supplementary material included: program code and relevant data sets correspond to chapters. Note: The ebook version does not provide access to the companion files.

Doing Bayesian Data Analysis

There is an explosion of interest in Bayesian statistics, primarily because recently created computational methods have finally made Bayesian analysis tractable and accessible to a wide audience. Doing Bayesian Data Analysis, A Tutorial Introduction with R and BUGS, is for first year graduate students or advanced undergraduates and provides an accessible approach, as all mathematics is explained intuitively and with concrete examples. It assumes only algebra and ‘rusty’ calculus. Unlike other textbooks, this book begins with the basics, including essential concepts of probability and random sampling. The book gradually climbs all the way to advanced hierarchical modeling methods for realistic data. The text provides complete examples with the R programming language and BUGS software (both freeware), and begins with basic programming examples, working up gradually to complete programs for complex analyses and presentation graphics. These templates can be easily adapted for a large variety of students and their own research needs.The textbook bridges the students from their undergraduate training into modern Bayesian methods. -Accessible, including the basics of essential concepts of probability and random sampling -Examples with R programming language and BUGS software -Comprehensive coverage of all scenarios addressed by non-bayesian textbooks- t-tests, analysis of variance (ANOVA) and comparisons in ANOVA, multiple regression, and chi-square (contingency table analysis). -Coverage of experiment planning -R and BUGS computer programming code on website -Exercises have explicit purposes and guidelines for accomplishment

Data Analysis with Open Source Tools

Collecting data is relatively easy, but turning raw information into something useful requires that you know how to extract precisely what you need. With this insightful book, intermediate to experienced programmers interested in data analysis will learn techniques for working with data in a business environment. You'll learn how to look at data to discover what it contains, how to capture those ideas in conceptual models, and then feed your understanding back into the organization through business plans, metrics dashboards, and other applications. Along the way, you'll experiment with concepts through hands-on workshops at the end of each chapter. Above all, you'll learn how to think about the results you want to achieve -- rather than rely on tools to think for you. Use graphics to describe data with one, two, or dozens of variables Develop conceptual models using back-of-the-envelope calculations, as well asscaling and probability arguments Mine data with computationally intensive methods such as simulation and clustering Make your conclusions understandable through reports, dashboards, and other metrics programs Understand financial calculations, including the time-value of money Use dimensionality reduction techniques or predictive analytics to conquer challenging data analysis situations Become familiar with different open source programming environments for data analysis "Finally, a concise reference for understanding how to conquer piles of data."--Austin King, Senior Web Developer, Mozilla "An indispensable text for aspiring data scientists."--Michael E. Driscoll, CEO/Founder, Dataspora

Telemedicine Technologies: Information Technologies in Medicine and Telehealth

This book brings together a broad range of topics demonstrating how information and wireless technologies can be used in healthcare In this book, the authors focus on how medical information can be reliably transmitted through wireless communication networks. It explains how they can be optimized to carry medical information in various situations by utilizing readily available traditional wireless local area network (WLAN) and broadband wireless access (BWA) systems. In addition, the authors discuss consumer healthcare technology, which is becoming more popular as reduction in manufacturing cost of electronics products makes healthcare products more affordable to the general public. Finally, the book explores topics such as communication networks and services, patient monitoring, information processing, system deployment, data security and privacy, information technology in alternative medicine, multimedia and health informatics, and caring for the community. Key Features: Focuses on the transmission of medical information over wireless communication networks, and addresses topics such as communication networks and services, patient monitoring, information processing, system deployment, data security and privacy, and many others Provides an in-depth introduction to the various factors that need to be considered for supporting healthcare services with information technology Covers advancements in topics such as RFID in healthcare Discusses medical signal processing as well as ECG and signal processing techniques This book will be of interest to advanced students and professors in biomedical engineering, bioinformatics, and information engineering. Medical and IT professionals involved in specifying new facilities, healthcare practitioners in telemedicine, researchers in wireless communications and information technology, and network administrators will also find this book insightful.

Signals and Systems using MATLAB

This new textbook in signals and systems provides a pedagogically rich approach to what can commonly be a mathematically dry subject. With features like historical notes, highlighted common mistakes, and applications in controls, communications, and signal processing, Chaparro helps students appreciate the usefulness of the techniques described in the book. Each chapter contains a section with MatLab applications. Pedagogically rich introduction to signals and systems using historical notes, pointing out "common mistakes", and relating concepts to realistic examples throughout to motivate learning the material Introduces both continuous and discrete systems early, then studies each (separately) in more depth later Extensive set of worked examples and homework assignments, with applications to controls, communications, and signal processing throughout Provides review of all the background math necessary to study the subject MatLab applications in every chapter

Performance Dashboards: Measuring, Monitoring, and Managing Your Business, 2nd Edition

Tips, techniques, and trends on harnessing dashboard technology to optimize business performance In Performance Dashboards, Second Edition, author Wayne Eckerson explains what dashboards are, where they can be used, and why they are important to measuring and managing performance. As Director of Research for The Data Warehousing Institute, a worldwide association of business intelligence professionals, Eckerson interviewed dozens of organizations that have built various types of performance dashboards in different industries and lines of business. Their practical insights explore how you can effectively turbo-charge performance–management initiatives with dashboard technology. Includes all-new case studies, industry research, news chapters on "Architecting Performance Dashboards" and "Launching and Managing the Project" and updated information on designing KPIs, designing dashboard displays, integrating dashboards, and types of dashboards. Provides a solid foundation for understanding performance dashboards, business intelligence, and performance management Addresses the next generation of performance dashboards, such as Mashboards and Visual Discovery tools, and including new techniques for designing dashboards and developing key performance indicators Offers guidance on how to incorporate predictive analytics, what-if modeling, collaboration, and advanced visualization techniques This updated book, which is 75% rewritten, provides a foundation for understanding performance dashboards, business intelligence, and performance management to optimize performance and accelerate results.

Data Mining For Business Intelligence: Concepts, Techniques, and Applications in Microsoft Office Excel® with XLMiner®, Second Edition

Data Mining for Business Intelligence, Second Edition uses real data and actual cases to illustrate the applicability of data mining (DM) intelligence in the development of successful business models. Featuring complimentary access to XLMiner®, the Microsoft Office Excel® add-in, this book allows readers to follow along and implement algorithms at their own speed, with a minimal learning curve. In addition, students and practitioners of DM techniques are presented with hands-on, business-oriented applications. An abundant amount of exercises and examples, now doubled in number in the second edition, are provided to motivate learning and understanding. This book helps readers understand the beneficial relationship that can be established between DM and smart business practices, and is an excellent learning tool for creating valuable strategies and making wiser business decisions. New topics include detailed coverage of visualization (enhanced by Spotfire subroutines) and time series forecasting, among a host of other subject matter.

IBM Cognos Business Intelligence V10.1 Handbook

IBM® Cognos® Business Intelligence (BI) helps organizations meet This IBM Redbooks® publication addresses IBM Cognos Business Intelligence V10.1. You can use this book to: This book uses a fictional business scenario to demonstrate the power of IBM Cognos BI. The book is primarily focused on the roles of Advanced Business User, Professional Report Author, Modeler, Administrator, and IT Architect.

Statistical Programming with SAS/IML Software

SAS/IML software is a powerful tool for data analysts because it enables implementation of statistical algorithms that are not available in any SAS procedure. Rick Wicklin's Statistical Programming with SAS/IML Software is the first book to provide a comprehensive description of the software and how to use it. He presents tips and techniques that enable you to use the IML procedure and the SAS/IML Studio application efficiently. In addition to providing a comprehensive introduction to the software, the book also shows how to create and modify statistical graphs, call SAS procedures and R functions from a SAS/IML program, and implement such modern statistical techniques as simulations and bootstrap methods in the SAS/IML language. Written for data analysts working in all industries, graduate students, and consultants, Statistical Programming with SAS/IML Software includes numerous code snippets and more than 100 graphs.

This book is part of the SAS Press program.

Knight's Microsoft® Business Intelligence 24-Hour Trainer: Leveraging Microsoft SQL Server® Integration, Analysis, and Reporting Services with Excel® and SharePoint®

A book-and-video introduction to Microsoft's Business Intelligence tools If you are just starting to get a handle on Microsoft Business Intelligence (BI) tools, this book and accompanying video provides you with the just the right amount of information to perform basic business analysis and reporting. You'll explore the components and related tools that comprise the Microsoft BI toolset as well as the new BI features of Office 2010. After a basic primer on BI and data modeling, the expert team of authors provides you with step-by-step lessons in the book and videos on the accompanying Video on how to use SQL Server Integration Services, SQL Server Analysis Services, SQL Server Reporting Services, Excel BI (including PowerPivot), and SharePoint. Integrates instructional videos with each of the lessons found in the book to enhance your learning experience Explores the Microsoft Business Intelligence (BI) toolset as well as the new BI features of Office 2010 Encourages you to practice what you've learned in "Try It Out" sections Contains video demonstrations that walk you through how to tackle each lesson featured in the book With Knight's Microsoft Business Intelligence 24-Hour Trainer, veteran authors present you with an ideal introductory book-and-video package so that you can get started working with the BI toolset immediately!

Pentaho® Kettle Solutions: Building Open Source ETL Solutions with Pentaho Data Integration

A complete guide to Pentaho Kettle, the Pentaho Data lntegration toolset for ETL This practical book is a complete guide to installing, configuring, and managing Pentaho Kettle. If you're a database administrator or developer, you'll first get up to speed on Kettle basics and how to apply Kettle to create ETL solutions—before progressing to specialized concepts such as clustering, extensibility, and data vault models. Learn how to design and build every phase of an ETL solution. Shows developers and database administrators how to use the open-source Pentaho Kettle for enterprise-level ETL processes (Extracting, Transforming, and Loading data) Assumes no prior knowledge of Kettle or ETL, and brings beginners thoroughly up to speed at their own pace Explains how to get Kettle solutions up and running, then follows the 34 ETL subsystems model, as created by the Kimball Group, to explore the entire ETL lifecycle, including all aspects of data warehousing with Kettle Goes beyond routine tasks to explore how to extend Kettle and scale Kettle solutions using a distributed "cloud" Get the most out of Pentaho Kettle and your data warehousing with this detailed guide—from simple single table data migration to complex multisystem clustered data integration tasks.

News Search, Blogs and Feeds

This book is about news search and monitoring. Aimed at professionals with a strategic need of monitoring the surrounding world, users with a need to find the best news sources, monitoring services and news search strategies and techniques will benefit from reading this book. The main purpose is to present a practical handbook with an analysis of readily available tools, blending with passages of a theoretical nature. It is also useful for students at LIS programmes and related information programmes and for librarians and information professionals. The authors aim to aid the reader in reaching a greater understanding of the core in news search and monitoring. Presents effective tools to evaluate news search engines and databases Harness the power of RSS (Real Simple Syndication) feeds in online news search and monitoring Learn how to navigate and critically question the news found in the blogosphere

SAS Programming for Enterprise Guide Users, Second Edition

New and updated for SAS Enterprise Guide 4.2!

In this pragmatic, example-driven book, author Neil Constable demonstrates how you can use SAS code to enhance the capabilities of SAS Enterprise Guide. Designed to help you gain extra value from the products you already have, SAS Programming for Enterprise Guide Users contains tips and techniques that show you a variety of features that cannot be accessed directly through the task interfaces. In all cases, techniques are shown with examples that you can try and test, plus additional exercises are included to give you more practice. The end result is more efficient and resilient use of SAS Enterprise Guide in a wider variety of business areas.

Included is a discussion of the following subject areas: the Output Delivery System, advanced formatting, macro variables and macros, advanced reporting using PROC REPORT, highlighting in reports, hyperlinking between reports and graphs, data manipulation using SQL, data manipulation using the DATA step, extended graphics.

By adding small amounts of code in key areas, SAS Enterprise Guide users can get more out of the product than the tasks reveal. Users should be familiar with the SAS Enterprise Guide user interface and tasks. No programming experience is necessary.

This book is part of the SAS Press program.