talk-data.com talk-data.com

Topic

data-science-tasks

849

tagged

Activity Trend

1 peak/qtr
2020-Q1 2026-Q1

Activities

849 activities · Newest first

Mining the Web

Mining the Web: Discovering Knowledge from Hypertext Data is the first book devoted entirely to techniques for producing knowledge from the vast body of unstructured Web data. Building on an initial survey of infrastructural issues—including Web crawling and indexing—Chakrabarti examines low-level machine learning techniques as they relate specifically to the challenges of Web mining. He then devotes the final part of the book to applications that unite infrastructure and analysis to bring machine learning to bear on systematically acquired and stored data. Here the focus is on results: the strengths and weaknesses of these applications, along with their potential as foundations for further progress. From Chakrabarti's work—painstaking, critical, and forward-looking—readers will gain the theoretical and practical understanding they need to contribute to the Web mining effort. * A comprehensive, critical exploration of statistics-based attempts to make sense of Web Mining. * Details the special challenges associated with analyzing unstructured and semi-structured data. * Looks at how classical Information Retrieval techniques have been modified for use with Web data. * Focuses on today's dominant learning methods: clustering and classification, hyperlink analysis, and supervised and semi-supervised learning. * Analyzes current applications for resource discovery and social network analysis. * An excellent way to introduce students to especially vital applications of data mining and machine learning technology.

The Boost Graph Library: User Guide and Reference Manual

The Boost Graph Library (BGL) is the first C++ library to apply the principles of generic programming to the construction of the advanced data structures and algorithms used in graph computations. Problems in such diverse areas as Internet packet routing, molecular biology, scientific computing, and telephone network design can be solved by using graph theory. This book presents an in-depth description of the BGL and provides working examples designed to illustrate the application of BGL to these real-world problems. Written by the BGL developers, gives you all the information you need to take advantage of this powerful new library. Part I is a complete user guide that begins by introducing graph concepts, terminology, and generic graph algorithms. This guide also takes the reader on a tour through the major features of the BGL; all motivated with example problems. Part II is a comprehensive reference manual that provides complete documentation of all BGL concepts, algorithms, and classes. The Boost Graph Library: User Guide and Reference Manual Readers will find coverage of: Graph terminology and concepts Generic programming techniques in C++ Shortest-path algorithms for Internet routing Network planning problems using the minimum-spanning tree algorithms BGL algorithms with implicitly defined graphs BGL Interfaces to other graph libraries BGL concepts and algorithms BGL classes–graph, auxiliary, and adaptor Groundbreaking in its scope, this book offers the key to unlocking the power of the BGL for the C++ programmer looking to extend the reach of generic programming beyond the Standard Template Library.

Ten Minute Guide to Microsoft® Visio® 2002

Because most people don't have the luxury of sitting down uninterrupted for hours at a time to learn Visio, this 10-Minute Guide focuses on the most often used features, covering them in lessons designed to take 10 minutes or less to complete. In addition, this guide teaches the user how to use Visio without relying on technical jargon, by providing straightforward, easy-to-follow explanations and lists of numbered steps that tell the user which keys to press and which options to select.

Say It With Charts: The Executive’s Guide to Visual Communication, 4th Edition

Step-by-step guide to creating compelling, memorable presentations A chart that once took ten hours to prepare can now be produced by anyone with ten minutes and a computer keyboard. What hasn't changed, however, are the basics behind creating a powerful visual - what to say, why to say it, and how to say it for the most impact. In Say It With Charts, Fourth Edition --the latest, cutting-edge edition of his best-selling presentation guide -- Gene Zelazny reveals time-tested tips for preparing effective presentations. Then, this presentation guru shows you how to combine those tips with today's hottest technologies for sharper, stronger visuals. Look to this comprehensive presentation encyclopedia for information on: * How to prepare different types of charts -- pie, bar, column, line, or dot -- and when to use each * Lettering size, color choice, appropriate chart types, and more * Techniques for producing dramatic eVisuals using animation, scanned images, sound, video, and links to pertinent websites

Reliability: Modeling, Prediction, and Optimization

Bringing together business and engineering to reliability analysis With manufactured products exploding in numbers and complexity, reliability studies play an increasingly critical role throughout a product's entire life cycle-from design to post-sale support. Reliability: Modeling, Prediction, and Optimization presents a remarkably broad framework for the analysis of the technical and commercial aspects of product reliability, integrating concepts and methodologies from such diverse areas as engineering, materials science, statistics, probability, operations research, and management. Written in plain language by two highly respected experts in the field, this practical work provides engineers, operations managers, and applied statisticians with both qualitative and quantitative tools for solving a variety of complex, real-world reliability problems. A wealth of examples and case studies accompanies: Comprehensive coverage of assessment, prediction, and improvement at each stage of a product's life cycle Clear explanations of modeling and analysis for hardware ranging from a single part to whole systems Thorough coverage of test design and statistical analysis of reliability data A special chapter on software reliability Coverage of effective management of reliability, product support, testing, pricing, and related topics Lists of sources for technical information, data, and computer programs Hundreds of graphs, charts, and tables, as well as over 500 references PowerPoint slides are available from the Wiley editorial department.

Professional Development with Visio® 2000

Professional Development with Visio 2000 empowers you to create your own Visio solutions quickly and easily. Using client-proven methods, and the success of his training seminars worldwide, Visio insider David Edson provides you with an understanding of the Visio development platform, and guides you through the use of Visual Basic for Applications (VBA), enabling you to create your own Visio solutions. You will benefit from David's expert knowledge of topics including understanding Visio solutions, working with SmartShapes, customizing ShapeSheets, Visio VBA automation, Generating Visio Drawings with ActiveX Automation, and much more.

APPLIED MULTIVARIATE STATISTICS: WITH SAS® SOFTWARE

Real-world problems and data sets are the backbone of Ravindra Khattree and Dayanand Naik's Applied Multivariate Statistics with SAS Software, Second Edition, which provides a unique approach to the topic, integrating statistical methods, data analysis, and applications. Now extensively revised, the book includes new information about mixed effects models, applications of the MIXED procedure, regression diagnostics with the corresponding IML procedure code, and covariance structures. The authors' approach to the information will aid professors, researchers, and students in a variety of disciplines and industries. Extensive SAS code and the corresponding high-resolution output accompany sample problems, and clear explanations of SAS procedures are included. Emphasis is on correct interpretation of the output to draw meaningful conclusions. Featuring both the theoretical and the practical, topics covered include multivariate analysis of experimental data and repeated measures data, graphical representation of data including biplots, and multivariate regression. In addition, a quick introduction to the IML procedure with special reference to multivariate data is available in an appendix. SAS programs and output integrated with the text make it easy to read and follow the examples.

System Identification: Theory for the User, 2nd Edition

65669-4 The field’s leading text, now completely updated. Modeling dynamical systems — theory, methodology, and applications. Lennart Ljung’s System Identification: Theory for the User is a complete, coherent description of the theory, methodology, and practice of System Identification. This completely revised Second Edition introduces subspace methods, methods that utilize frequency domain data, and general non-linear black box methods, including neural networks and neuro-fuzzy modeling. The book contains many new computer-based examples designed for Ljung’s market-leading software, System Identification Toolbox for MATLAB. Ljung combines careful mathematics, a practical understanding of real-world applications, and extensive exercises. He introduces both black-box and tailor-made models of linear as well as non-linear systems, and he describes principles, properties, and algorithms for a variety of identification techniques: Nonparametric time-domain and frequency-domain methods. Parameter estimation methods in a general prediction error setting. Frequency domain data and frequency domain interpretations. Asymptotic analysis of parameter estimates. Linear regressions, iterative search methods, and other ways to compute estimates. Recursive (adaptive) estimation techniques. Ljung also presents detailed coverage of the key issues that can make or break system identification projects, such as defining objectives, designing experiments, controlling the bias distribution of transfer-function estimates, and carefully validating the resulting models. The first edition of System Identification has been the field’s most widely cited reference for over a decade. This new edition will be the new text of choice for anyone concerned with system identification theory and practice.

Modelling Stock Market Volatility

This essay collection focuses on the relationship between continuous time models and Autoregressive Conditionally Heteroskedastic (ARCH) models and applications. For the first time, Modelling Stock Market Volatility provides new insights about the links between these two models and new work on practical estimation methods for continuous time models. Featuring the pioneering scholarship of Daniel Nelson, the text presents research about the discrete time model, continuous time limits and optimal filtering of ARCH models, and the specification and estimation of continuous time processes. This work will lead to a rapid growth in their empirical application as they are increasingly subjected to routine specification testing. Key Features * Provides for the first time new insights on the links between continuous time and ARCH models * Collects seminal scholarship by some of the most renowned researchers in finance and econometrics * Captures complex arguments underlying the approximation and proper statistical modelling of continuous time volatility dynamics