talk-data.com talk-data.com

Topic

data-science

2252

tagged

Activity Trend

1 peak/qtr
2020-Q1 2026-Q1

Activities

2252 activities · Newest first

Fuzzy Modeling and Genetic Algorithms for Data Mining and Exploration

Fuzzy Modeling and Genetic Algorithms for Data Mining and Exploration is a handbook for analysts, engineers, and managers involved in developing data mining models in business and government. As you’ll discover, fuzzy systems are extraordinarily valuable tools for representing and manipulating all kinds of data, and genetic algorithms and evolutionary programming techniques drawn from biology provide the most effective means for designing and tuning these systems. You don’t need a background in fuzzy modeling or genetic algorithms to benefit, for this book provides it, along with detailed instruction in methods that you can immediately put to work in your own projects. The author provides many diverse examples and also an extended example in which evolutionary strategies are used to create a complex scheduling system. Written to provide analysts, engineers, and managers with the background and specific instruction needed to develop and implement more effective data mining systems Helps you to understand the trade-offs implicit in various models and model architectures Provides extensive coverage of fuzzy SQL querying, fuzzy clustering, and fuzzy rule induction Lays out a roadmap for exploring data, selecting model system measures, organizing adaptive feedback loops, selecting a model configuration, implementing a working model, and validating the final model In an extended example, applies evolutionary programming techniques to solve a complicated scheduling problem Presents examples in C, C++, Java, and easy-to-understand pseudo-code Extensive online component, including sample code and a complete data mining workbench

Analysis of Clinical Trials Using SAS

In Analysis of Clinical Trials Using SAS: A Practical Guide, Alex Dmitrienko, Geert Molenberghs, Christy Chuang-Stein, and Walter Offen bridge the gap between modern statistical methodology and real-world clinical trial applications. Step-by-step instructions illustrated with examples from actual trials and case studies serve to define a statistical method and its relevance in a clinical trials setting and to illustrate how to implement the method rapidly and efficiently using the power of SAS software. Topics reflect the International Conference on Harmonization (ICH) guidelines for the pharmaceutical industry and address important statistical problems encountered in clinical trials, including analysis of stratified data, incomplete data, multiple inferences, issues arising in safety and efficacy monitoring, and reference intervals for extreme safety and diagnostic measurements. Clinical statisticians, research scientists, and graduate students in biostatistics will greatly benefit from the decades of clinical research experience compiled in this book. Numerous ready-to-use SAS macros and example code are included.

This book is part of the SAS Press program.

Visualization Handbook

The Visualization Handbook provides an overview of the field of visualization by presenting the basic concepts, providing a snapshot of current visualization software systems, and examining research topics that are advancing the field. This text is intended for a broad audience, including not only the visualization expert seeking advanced methods to solve a particular problem, but also the novice looking for general background information on visualization topics. The largest collection of state-of-the-art visualization research yet gathered in a single volume, this book includes articles by a “who’s who? of international scientific visualization researchers covering every aspect of the discipline, including: · Virtual environments for visualization · Basic visualization algorithms · Large-scale data visualization · Scalar data isosurface methods · Visualization software and frameworks · Scalar data volume rendering · Perceptual issues in visualization · Various application topics, including information visualization. * Edited by two of the best known people in the world on the subject; chapter authors are authoritative experts in their own fields; * Covers a wide range of topics, in 47 chapters, representing the state-of-the-art of scientific visualization.

Classification, Parameter Estimation and State Estimation: An Engineering Approach Using MATLAB

Classification, Parameter Estimation and State Estimation is a practical guide for data analysts and designers of measurement systems and postgraduates students that are interested in advanced measurement systems using MATLAB. 'Prtools' is a powerful MATLAB toolbox for pattern recognition and is written and owned by one of the co-authors, B. Duin of the Delft University of Technology. After an introductory chapter, the book provides the theoretical construction for classification, estimation and state estimation. The book also deals with the skills required to bring the theoretical concepts to practical systems, and how to evaluate these systems. Together with the many examples in the chapters, the book is accompanied by a MATLAB toolbox for pattern recognition and classification. The appendix provides the necessary documentation for this toolbox as well as an overview of the most useful functions from these toolboxes. With its integrated and unified approach to classification, parameter estimation and state estimation, this book is a suitable practical supplement in existing university courses in pattern classification, optimal estimation and data analysis. Covers all contemporary main methods for classification and estimation. Integrated approach to classification, parameter estimation and state estimation Highlights the practical deployment of theoretical issues. Provides a concise and practical approach supported by MATLAB toolbox. Offers exercises at the end of each chapter and numerous worked out examples. PRtools toolbox (MATLAB) and code of worked out examples available from the internet Many examples showing implementations in MATLAB Enables students to practice their skills using a MATLAB environment

Even You Can Learn Statistics A Guide for Everyone Who Has Ever Been Afraid of Statistics

Even You Can Learn Statistics A Guide for Everyone Who Has Ever Been Afraid Of Statistics One easy step at a time, this book will teach you the key statistical techniques you'll need for finance, quality, marketing, the social sciences, or just about any other field. Each technique is introduced with a simple, jargon-free explanation, practical examples, and hands-on guidance for solving real problems with Excel or a TI-83/84 series calculator, including Plus models. Hate math? No sweat. You'll be amazed how little you need! For those who do have an interest in mathematics, optional "Equation Blackboard" sections review the equations that provide the foundations for important concepts. David M. Levine is a much-honored innovator in statistics education. He is Professor Emeritus of Statistics and Computer Information Systems at Bernard M. Baruch College (CUNY), and co-author of several best-selling books, including Statistics for Managers using Microsoft Excel, Basic Business Statistics, Quality Management, and Six Sigma for Green Belts and Champions. Instructional designer David F. Stephan pioneered the classroom use of personal computers, and is a leader in making Excel more accessible to statistics students. He has co-authored several textbooks with David M. Levine. Here's just some of what you'll learn how to do... Use statistics in your everyday work or study Perform common statistical tasks using a Texas Instruments statistical calculator or Microsoft Excel Build and interpret statistical charts and tables "Test Yourself" at the end of each chapter to review the concepts and methods that you learned in the chapter Work with mean, median, mode, standard deviation, Z scores, skewness, and other descriptive statistics Use probability and probability distributions Work with sampling distributions and confidence intervals Test hypotheses and decision-making risks with Z, t, Chi-Square, ANOVA, and other techniques Perform regression analysis and modeling The easy, practical introduction to statistics–for everyone! Thought you couldn't learn statistics? Think again. You can–and you will! Complementary Web site Downloadable practice files at http://www.ftpress.com/youcanlearnstatistics

PROC SQL: Beyond the Basics Using SAS®

Master the language of PROC SQL! Get faster and better results when you use the SQL procedure with this step-by-step, example-driven guide in your hands. You will find information about implementing both American National Standards Institute (ANSI) guidelines and non-ANSI features. A broad range of SAS users, including applications programmers, database designers and administrators, statisticians, and systems analysts, will find techniques in Kirk Lafler's PROC SQL: Beyond the Basics Using SAS to enhance their data access, manipulation, and presentation prowess. Topics examined include database design; an introduction to SQL, dictionary tables, and tables; coding PROC SQL logic as well as complex queries; working with views; troubleshooting and debugging SQL code; and tuning for performance. With the growing popularity of databases, PROC SQL is becoming the language of choice for SAS users around the world. Intermediate and advanced users will appreciate the straightforward explanations and numerous examples.

Say It With Charts Workbook

Hands-on tips for powerful presentations in this all-new companion to the bestselling Say It with Charts Through four editions, Gene Zelazny's classic how-to Say It with Charts has generated more than $1.5 million in revenues. Now, in the companion Say It with Charts Workbook, Zelazny shows you how to make even more of your visual communication skills, working "one-on-one" with you on how to masterfully use the latest techniques and tools to enliven every presentation. More than just a rote listing of techniques, Say It with Charts Workbook features performance-improving strategies and suggestions that will help keep both you--and, even more important, the audience--comfortable and at ease. Part refresher course, part workbook, part self-test, it arms you with: • Step-by-step instructions and guidelines • Performance-improving strategies and suggestions • Tactics for customizing graphics to specific audiences

Elasticity

Although there are several books in print dealing with elasticity, many focus on specialized topics such as mathematical foundations, anisotropic materials, two-dimensional problems, thermoelasticity, non-linear theory, etc. As such they are not appropriate candidates for a general textbook. This book provides a concise and organized presentation and development of general theory of elasticity. Complemented by a Solutions Manual and including MatLab codes and coding, this text is an excellent book teaching guide. - Contains exercises for student engagement as well as the integration and use of MATLAB Software - Provides development of common solution methodologies and a systematic review of analytical solutions useful in applications of engineering interest - Presents applications of contemporary interest

Preparing for DB2 Near-Realtime Business Intelligence

In this IBM Redbooks publication we discuss primary processes and various alternatives that prepare you in implementing a DB2 near-realtime business intelligence environment. We discuss architectural alternatives and include overviews of software products that you can use in an implementation. As a primary focus, we tested the capabilities for supporting continuous update of a DB2 data warehouse while running a continuous concurrent query workload against that data warehouse. We tested several implementation scenarios and the variables that impact them. The results of our testing and the issues we encountered are provided.

Algorithmic Graph Theory and Perfect Graphs, 2nd Edition

Algorithmic Graph Theory and Perfect Graphs, first published in 1980, has become the classic introduction to the field. This new Annals edition continues to convey the message that intersection graph models are a necessary and important tool for solving real-world problems. It remains a stepping stone from which the reader may embark on one of many fascinating research trails. The past twenty years have been an amazingly fruitful period of research in algorithmic graph theory and structured families of graphs. Especially important have been the theory and applications of new intersection graph models such as generalizations of permutation graphs and interval graphs. These have lead to new families of perfect graphs and many algorithmic results. These are surveyed in the new Epilogue chapter in this second edition. New edition of the "Classic" book on the topic Wonderful introduction to a rich research area Leading author in the field of algorithmic graph theory Beautifully written for the new mathematician or computer scientist Comprehensive treatment

Spidering Hacks

The Internet, with its profusion of information, has made us hungry for ever more, ever better data. Out of necessity, many of us have become pretty adept with search engine queries, but there are times when even the most powerful search engines aren't enough. If you've ever wanted your data in a different form than it's presented, or wanted to collect data from several sites and see it side-by-side without the constraints of a browser, then Spidering Hacks is for you. Spidering Hacks takes you to the next level in Internet data retrieval--beyond search engines--by showing you how to create spiders and bots to retrieve information from your favorite sites and data sources. You'll no longer feel constrained by the way host sites think you want to see their data presented--you'll learn how to scrape and repurpose raw data so you can view in a way that's meaningful to you.Written for developers, researchers, technical assistants, librarians, and power users, Spidering Hacks provides expert tips on spidering and scraping methodologies. You'll begin with a crash course in spidering concepts, tools (Perl, LWP, out-of-the-box utilities), and ethics (how to know when you've gone too far: what's acceptable and unacceptable). Next, you'll collect media files and data from databases. Then you'll learn how to interpret and understand the data, repurpose it for use in other applications, and even build authorized interfaces to integrate the data into your own content. By the time you finish Spidering Hacks, you'll be able to: Aggregate and associate data from disparate locations, then store and manipulate the data as you like Gain a competitive edge in business by knowing when competitors' products are on sale, and comparing sales ranks and product placement on e-commerce sites Integrate third-party data into your own applications or web sites Make your own site easier to scrape and more usable to others Keep up-to-date with your favorite comics strips, news stories, stock tips, and more without visiting the site every dayLike the other books in O'Reilly's popular Hacks series, Spidering Hacks brings you 100 industrial-strength tips and tools from the experts to help you master this technology. If you're interested in data retrieval of any type, this book provides a wealth of data for finding a wealth of data.

The Little SAS® Book: A Primer, Third Edition

Getting started with SAS is easy with The Little SAS Book. This friendly, easy-to-read guide gently introduces you to the most commonly used features of SAS software plus a whole lot more! The book has been revised to include concepts of the Output Delivery System; the STYLE= option in the PRINT, REPORT, and TABULATE procedures; ODS HTML, RTF, PRINTER, and OUTPUT destinations; PROC REPORT; more on PROC TABULATE; exporting data; and the colon modifier for informats. You'll find clear and concise explanations of basic SAS concepts (such as DATA and PROC steps), inputting data, modifying and combining data sets, summarizing and presenting data, basic statistical procedures, and debugging SAS programs. Each topic is presented in a self-contained, two-page layout complete with examples and graphics. This format enables new users to get up and running quickly, while the examples allow you to type in the program and see it work!

Supply Chains: A Manager's Guide

“An excellent summary of the state of supply chain management going into the twenty-first century. Explains the essential concepts clearly and offers practical, down-to-earth advice for making supply chains more efficient and adaptive. Truly a survival guide for executives as they struggle to cope with the increasing competition between supply chains.” — Christian Knoll, Vice President of Global Supply Chain Management, SAP AG “Through real-world case studies and graphic illustrations, David Taylor clearly demonstrates the bottom-line benefits of managing the supply chain effectively. Although the book is written for managers, I recommend it for everyone from the executive suite to the shipping floor because they all have to work together to master the supply chain. But beware—you can expect many passionate employees demanding improvements in your company’s supply chain after reading this book!” — David Myers, President, WinfoSoft Inc., Former Board Member of Supply Chain Council “A comprehensive, thoroughly researched, and well-designed book that gives managers the information they need in a highly readable form. I am already starting to use the techniques in this book to improve our international distribution system.” — Jim Muller, Vice President of Produce Sales, SoFresh Produce “Supply chain management is a deceptively deep subject. Simple business practices combine to form complex systems that seem to defy rational analysis: Companies that form trading partnerships continue to compete despite their best efforts to cooperate; small variations in consumer buying create devastating swings in upstream demand, and so on. In his trademark fashion, Taylor clearly reveals the hidden logic at work in your supply chain and gives you the practical tools you need to make better management decisions. A must-read for every manager who affects a supply chain, and in today's marketplace there are few managers who are exempt from this requirement.” — Adrian J. Bowles, Ph.D., President, CoSource.net “David Taylor has done it again. With his new book, David makes supply chain management easy to grasp for the working manager, just as he did with his earlier guides to business technology. If you work for a company that is part of a supply chain, you need this book.” — Dirk Riehle, Ph.D. “David Taylor has done a masterful job of defining the core issues in supply chain management without getting trapped in the quicksand of jargon. This concise book is well written, highly informative, and easy to read.” — Marcia Robinson, President, E-Business Strategies, author of Services Blueprint: Roadmap “Taylor has done a tremendous job of giving readers an intuitive grasp of a complicated subject. If you’re new to supply chains, this book will give you an invaluable map of the territory. If you're already among the initiated, it will crystallize your insights and help you make better decisions. In either case, you can only come out ahead by reading this book.” — Kevin Dick, Founder of Kevin Dick Associates, author of XML: A Manager’s Guide “My motto for compressing data is ‘squeeze it til it gags.’ In the current business climate, that’s what you have to do to costs, and Taylor shows you many ways to squeeze costs out of your supply chain. He also writes with the same economy: This book contains exactly what you need to manage your supply chain effectively. Nothing is missing, and nothing is extra.” — Charles Ashbacher, President, Charles Ashbacher Technologies Today's fiercest business battles are taking place between competitors' supply chains, with victory dependent on finding a way to deliver products to customers more quickly and efficiently than the competition. For proof, just look to Dell and Amazon.com, both of which revolutionized their industries by changing how companies produce, distribute, and sell physical goods. But they're hardly alone. By revamping their supply chains, Siemens CT improved lead time from six months to two weeks, Gillette slashed $400 million of inventory, and Chrysler saved $1.7 billion a year. It's a high-stakes game, and you don't have a lot of choice about playing: If your company touches a physical product, it's part of a supply chain--and your success ultimately hangs on the weakest link in that chain. In , best-selling author David Taylor explains how to assemble a killer supply chain using the knowledge, technology, and tools employed in supply-chain success stories. Using his signature fast-track summaries and informative graphics, Taylor offers a clear roadmap to understanding and solving the complex problems of supply-chain management. Supply Chains: A Manager's Guide Modern manufacturing has driven down the time and cost of the production process, leaving supply chains as the final frontier for cost reduction and competitive advantage. will quickly give managers the foundation they need to contribute effectively to their company's supply-chain success. Supply Chains: A Manager's Guide

Database Modeling with Microsoft® Visio for Enterprise Architects

This book is for database designers and database administrators using Visio, which is the database component of Microsoft's Visual Studio .NET for Enterprise Architects suite, also included in MSDN subscriptions. This is the only guide to this product that tells DBAs how to get their job done. Although primarily focused on tool features, the book also provides an introduction to data modeling, and includes practical advice on managing database projects. The principal author was the program manager of VEA's database modeling solutions. · Explains how to model databases with Microsoft® Visio for Enterprise Architects (VEA), focusing on tool features.· Provides a platform-independent introduction to data modeling using both Object Role Modeling (ORM) and Entity Relationship Modeling (ERM), and includes practical advice on managing database projects.· Additional ORM models, course notes, and add-ins available online.

Bioinformatics

Life science data integration and interoperability is one of the most challenging problems facing bioinformatics today. In the current age of the life sciences, investigators have to interpret many types of information from a variety of sources: lab instruments, public databases, gene expression profiles, raw sequence traces, single nucleotide polymorphisms, chemical screening data, proteomic data, putative metabolic pathway models, and many others. Unfortunately, scientists are not currently able to easily identify and access this information because of the variety of semantics, interfaces, and data formats used by the underlying data sources. Bioinformatics: Managing Scientific Data tackles this challenge head-on by discussing the current approaches and variety of systems available to help bioinformaticians with this increasingly complex issue. The heart of the book lies in the collaboration efforts of eight distinct bioinformatics teams that describe their own unique approaches to data integration and interoperability. Each system receives its own chapter where the lead contributors provide precious insight into the specific problems being addressed by the system, why the particular architecture was chosen, and details on the system's strengths and weaknesses. In closing, the editors provide important criteria for evaluating these systems that bioinformatics professionals will find valuable. * Provides a clear overview of the state-of-the-art in data integration and interoperability in genomics, highlighting a variety of systems and giving insight into the strengths and weaknesses of their different approaches. * Discusses shared vocabulary, design issues, complexity of use cases, and the difficulties of transferring existing data management approaches to bioinformatics systems, which serves to connect computer and life scientists. * Written by the primary contributors of eight reputable bioinformatics systems in academia and industry including: BioKris, TAMBIS, K2, GeneExpress, P/FDM, MBM, SDSC, SRS, and DiscoveryLink.

BLAST

Sequence similarity is a powerful tool for discovering biological function. Just as the ancient Greeks used comparative anatomy to understand the human body and linguists used the Rosetta stone to decipher Egyptian hieroglyphs, today we can use comparative sequence analysis to understand genomes. BLAST (Basic Local Alignment Search Tool), is a sophisticated software package for rapid searching of nucleotide and protein databases. It is one of the most important software packages used in sequence analysis and bioinformatics. Most users of BLAST, however, seldom move beyond the program's default parameters, and never take advantage of its full power. BLAST is the only book completely devoted to this popular suite of tools. It offers biologists, computational biology students, and bioinformatics professionals a clear understanding of BLAST as well as the science it supports. This book shows you how to move beyond the default parameters, get specific answers using BLAST, and how to interpret your results. The book also contains tutorial and reference sections covering NCBI-BLAST and WU-BLAST, background material to help you understand the statistics behind BLAST, Perl scripts to help you prepare your data and analyze your results, and a wealth of tips and tricks for configuring BLAST to meet your own research needs. Some of the topics covered include: BLAST basics and the NCBI web interface How to select appropriate search parameters BLAST programs: BLASTN, BLASTP, BLASTX, TBLASTN, TBLASTX, PHI-BLAST, and PSI BLAST Detailed BLAST references, including NCBI-BLAST and WU-BLAST Understanding biological sequences Sequence similarity, homology, scoring matrices, scores, and evolution Sequence Alignment Calculating BLAST statistics Industrial-strength BLAST, including developing applications with Perl and BLAST BLAST is the only comprehensive reference with detailed, accurate information on optimizing BLAST searches for high-throughput sequence analysis. This is a book that any biologist should own.

Practical RDF

The Resource Description Framework (RDF) is a structure for describing and interchanging metadata on the Web--anything from library catalogs and worldwide directories to bioinformatics, Mozilla internal data structures, and knowledge bases for artificial intelligence projects. RDF provides a consistent framework and syntax for describing and querying data, making it possible to share website descriptions more easily. RDF's capabilities, however, have long been shrouded by its reputation for complexity and a difficult family of specifications. Practical RDF breaks through this reputation with immediate and solvable problems to help you understand, master, and implement RDF solutions. Practical RDF explains RDF from the ground up, providing real-world examples and descriptions of how the technology is being used in applications like Mozilla, FOAF, and Chandler, as well as infrastructure you can use to build your own applications. This book cuts to the heart of the W3C's often obscure specifications, giving you tools to apply RDF successfully in your own projects.The first part of the book focuses on the RDF specifications. After an introduction to RDF, the book covers the RDF specification documents themselves, including RDF Semantics and Concepts and Abstract Model specifications, RDF constructs, and the RDF Schema. The second section focuses on programming language support, and the tools and utilities that allow developers to review, edit, parse, store, and manipulate RDF/XML. Subsequent sections focus on RDF's data roots, programming and framework support, and practical implementation and use of RDF and RDF/XML.If you want to know how to apply RDF to information processing, Practical RDF is for you. Whether your interests lie in large-scale information aggregation and analysis or in smaller-scale projects like weblog syndication, this book will provide you with a solid foundation for working with RDF.

Business Intelligence

Business Intelligence describes the basic architectural components of a business intelligence environment, ranging from traditional topics such as business process modeling, data modeling, and more modern topics such as business rule systems, data profiling, information compliance and data quality, data warehousing, and data mining. This book progresses through a logical sequence, starting with data model infrastructure, then data preparation, followed by data analysis, integration, knowledge discovery, and finally the actual use of discovered knowledge. The book contains a quick reference guide for business intelligence terminology. Business Intelligence is part of Morgan Kaufmann's Savvy Manager's Guide series. * Provides clear explanations without technical jargon, followed by in-depth descriptions. * Articulates the business value of new technology, while providing relevant introductory technical background. * Contains a handy quick-reference to technologies and terminologies. * Guides managers through developing, administering, or simply understanding business intelligence technology. * Bridges the business-technical gap. * Is Web enhanced. Companion sites to the book and series provide value-added information, links, discussions, and more.