talk-data.com talk-data.com

Topic

data-science

2091

tagged

Activity Trend

1 peak/qtr
2020-Q1 2026-Q1

Activities

Showing filtered results

Filtering by: O'Reilly Data Science Books ×
2013 Data Science Salary Survey

What tools do successful data scientists and analysts use, and how much money do they make? We surveyed hundreds of attendees at the O'Reilly Strata Conferences in Santa Clara, California and New York to understand. Findings from the survey include: Average number of tools and median income for all respondents Distribution of responses by age, location, industry, and position Detailed analysis of tools used by respondents and correlation to their salaries - including by tool clusters (Hadoop, SQL/Excel, and other) Correlation of specialized big data tools usage and salary What tools should you be learning and using? Read this valuable report to gain insight from these potentially career-changing findings.

Information Evaluation

During the reception of a piece of information, we are never passive. Depending on its origin and content, from our personal beliefs and convictions, we bestow upon this piece of information, spontaneously or after reflection, a certain amount of confidence. Too much confidence shows a degree of naivety, whereas an absolute lack of it condemns us as being paranoid. These two attitudes are symmetrically detrimental, not only to the proper perception of this information but also to its use. Beyond these two extremes, each person generally adopts an intermediate position when faced with the reception of information, depending on its provenance and credibility. We still need to understand and explain how these judgements are conceived, in what context and to what end. Spanning the approaches offered by philosophy, military intelligence, algorithmics and information science, this book presents the concepts of information and the confidence placed in it, the methods that militaries, the first to be aware of the need, have or should have adopted, tools to help them, and the prospects that they have opened up. Beyond the military context, the book reveals ways to evaluate information for the good of other fields such as economic intelligence, and, more globally, the informational monitoring by governments and businesses. Contents 1. Information: Philosophical Analysis and Strategic Applications, Mouhamadou El Hady Ba and Philippe Capet. 2. Epistemic Trust, Gloria Origgi. 3. The Fundamentals of Intelligence, Philippe Lemercier. 4. Information Evaluation in the Military Domain: Doctrines, Practices and Shortcomings, Philippe Capet and Adrien Revault d'Allonnes. 5. Multidimensional Approach to Reliability Evaluation of Information Sources, Frédéric Pichon, Christophe Labreuche, Bertrand Duqueroie and Thomas Delavallade. 6. Uncertainty of an Event and its Markers in Natural Language Processing, Mouhamadou El Hady Ba, Stéphanie Brizard, Tanneguy Dulong and Bénédicte Goujon. 7. Quantitative Information Evaluation: Modeling and Experimental Evaluation, Marie-Jeanne Lesot, Frédéric Pichon and Thomas Delavallade. 8. When Reported Information Is Second Hand, Laurence Cholvy. 9. An Architecture for the Evolution of Trust: Definition and Impact of the Necessary Dimensions of Opinion Making, Adrien Revault d'Allonnes. About the Authors Philippe Capet is a project manager and research engineer at Ektimo, working mainly on information management and control in military contexts. Thomas Delavallade is an advanced studies engineer at Thales Communications & Security, working on social media mining in the context of crisis management, cybersecurity and the fight against cybercrime.

Methods and Applications of Statistics in Clinical Trials, Volume 1: Concepts, Principles, Trials, and Designs

A complete guide to the key statistical concepts essential for the design and construction of clinical trials As the newest major resource in the field of medical research, Methods and Applications of Statistics in Clinical Trials, Volume 1: Concepts, Principles, Trials, and Designs presents a timely and authoritative reviewof the central statistical concepts used to build clinical trials that obtain the best results. The referenceunveils modern approaches vital to understanding, creating, and evaluating data obtained throughoutthe various stages of clinical trial design and analysis. Accessible and comprehensive, the first volume in a two-part set includes newly-written articles as well as established literature from the Wiley Encyclopedia of Clinical Trials. Illustrating a variety of statistical concepts and principles such as longitudinal data, missing data, covariates, biased-coin randomization, repeated measurements, and simple randomization, the book also provides in-depth coverage of the various trial designs found within phase I-IV trials. Methods and Applications of Statistics in Clinical Trials, Volume 1: Concepts, Principles, Trials, and Designs also features: Detailed chapters on the type of trial designs, such as adaptive, crossover, group-randomized, multicenter, non-inferiority, non-randomized, open-labeled, preference, prevention, and superiority trials Over 100 contributions from leading academics, researchers, and practitioners An exploration of ongoing, cutting-edge clinical trials on early cancer and heart disease, mother-to-child human immunodeficiency virus transmission trials, and the AIDS Clinical Trials Group Methods and Applications of Statistics in Clinical Trials, Volume 1: Concepts, Principles, Trials, and Designs is an excellent reference for researchers, practitioners, and students in the fields of clinicaltrials, pharmaceutics, biostatistics, medical research design, biology, biomedicine, epidemiology,and public health.

Statistical Analysis in Forensic Science: Evidential Values of Multivariate Physicochemical Data

A practical guide for determining the evidential value of physicochemical data Microtraces of various materials (e.g. glass, paint, fibres, and petroleum products) are routinely subjected to physicochemical examination by forensic experts, whose role is to evaluate such physicochemical data in the context of the prosecution and defence propositions. Such examinations return various kinds of information, including quantitative data. From the forensic point of view, the most suitable way to evaluate evidence is the likelihood ratio. This book provides a collection of recent approaches to the determination of likelihood ratios and describes suitable software, with documentation and examples of their use in practice. The statistical computing and graphics software environment R, pre-computed Bayesian networks using Hugin Researcher and a new package, calcuLatoR, for the computation of likelihood ratios are all explored. Statistical Analysis in Forensic Science will provide an invaluable practical guide for forensic experts and practitioners, forensic statisticians, analytical chemists, and chemometricians. Key features include: Description of the physicochemical analysis of forensic trace evidence. Detailed description of likelihood ratio models for determining the evidential value of multivariate physicochemical data. Detailed description of methods, such as empirical cross-entropy plots, for assessing the performance of likelihood ratio-based methods for evidence evaluation. Routines written using the open-source R software, as well as Hugin Researcher and calcuLatoR. Practical examples and recommendations for the use of all these methods in practice.

Ask, Measure, Learn

You can measure practically anything in the age of social media, but if you don’t know what you’re looking for, collecting mountains of data won’t yield a grain of insight. This non-technical guide shows you how to extract significant business value from big data with Ask-Measure-Learn, a system that helps you ask the right questions, measure the right data, and then learn from the results. Authors Lutz Finger and Soumitra Dutta originally devised this system to help governments and NGOs sift through volumes of data. With this book, these two experts provide business managers and analysts with a high-level overview of the Ask-Measure-Learn system, and demonstrate specific ways to apply social media analytics to marketing, sales, public relations, and customer management, using examples and case studies.

Commercial Data Mining

Whether you are brand new to data mining or working on your tenth predictive analytics project, Commercial Data Mining will be there for you as an accessible reference outlining the entire process and related themes. In this book, you'll learn that your organization does not need a huge volume of data or a Fortune 500 budget to generate business using existing information assets. Expert author David Nettleton guides you through the process from beginning to end and covers everything from business objectives to data sources, and selection to analysis and predictive modeling. Commercial Data Mining includes case studies and practical examples from Nettleton's more than 20 years of commercial experience. Real-world cases covering customer loyalty, cross-selling, and audience prediction in industries including insurance, banking, and media illustrate the concepts and techniques explained throughout the book. Illustrates cost-benefit evaluation of potential projects Includes vendor-agnostic advice on what to look for in off-the-shelf solutions as well as tips on building your own data mining tools Approachable reference can be read from cover to cover by readers of all experience levels Includes practical examples and case studies as well as actionable business insights from author's own experience

Explorations of Mathematical Models in Biology with MATLAB

Explore and analyze the solutions of mathematical models from diverse disciplines As biology increasingly depends on data, algorithms, and models, it has become necessary to use a computing language, such as the user-friendly MATLAB, to focus more on building and analyzing models as opposed to configuring tedious calculations. Explorations of Mathematical Models in Biology with MATLAB provides an introduction to model creation using MATLAB, followed by the translation, analysis, interpretation, and observation of the models. With an integrated and interdisciplinary approach that embeds mathematical modeling into biological applications, the book illustrates numerous applications of mathematical techniques within biology, ecology, and environmental sciences. Featuring a quantitative, computational, and mathematical approach, the book includes: Examples of real-world applications, such as population dynamics, genetics, drug administration, interacting species, and the spread of contagious diseases, to showcase the relevancy and wide applicability of abstract mathematical techniques Discussion of various mathematical concepts, such as Markov chains, matrix algebra, eigenvalues, eigenvectors, first-order linear difference equations, and nonlinear first-order difference equations Coverage of difference equations to model a wide range of real-life discrete time situations in diverse areas as well as discussions on matrices to model linear problems Solutions to selected exercises and additional MATLAB codes Explorations of Mathematical Models in Biology with MATLAB is an ideal textbook for upper-undergraduate courses in mathematical models in biology, theoretical ecology, bioeconomics, forensic science, applied mathematics, and environmental science. The book is also an excellent reference for biologists, ecologists, mathematicians, biomathematicians, and environmental and resource economists.

MATLAB: An Introduction with Applications 5th Edition

More college students use Amos Gilat's MATLAB: An Introduction with Applications than any other MATLAB textbook. This concise book is known for its just-in-time learning approach that gives students information when they need it. The new edition gradually presents the latest MATLAB functionality in detail. Equally effective as a freshmen-level text, self-study tool, or course reference, the book is generously illustrated through computer screen shots and step-by-step tutorials, with abundant and motivating applications to problems in mathematics, science, and engineering.

Getting Started with Beautiful Soup

"Getting Started with Beautiful Soup" is your practical guide to website scraping using Python. It teaches you how to use Beautiful Soup and the urllib2 module to extract data from websites efficiently and effectively. Through hands-on examples and clear explanations, you'll gain the skills to navigate, search, and modify HTML content. What this Book will help me do Navigate and scrape web pages using the Beautiful Soup Python library. Understand and implement the urllib2 module to access web content programmatically. Search and analyze HTML structures efficiently to extract the needed data. Modify and format extracted HTML and XML content effectively. Handle encoding and manage output formats for diverse scraping requirements. Author(s) Vineeth G. Nair is an experienced Python developer with a strong focus on web technologies, data extraction, and automation. His expertise in Python's Beautiful Soup library has helped countless learners and professionals tackle the challenges of web scraping. Vineeth combines a methodical approach to teaching with practical examples, making complex concepts accessible and actionable. Who is it for? This book is ideal for Python enthusiasts, data analysts, and budding developers looking to explore web scraping. Whether you're a beginner or have some programming experience, this book will guide you through the fundamental concepts of extracting web data. If you're aiming to delve into practical, real-world implementations of web scraping, this is the book for you.

QlikView Server and Publisher

This book, "QlikView Server and Publisher," serves as a comprehensive guide for implementing and managing QlikView Server and Publisher in enterprise environments. It walks you through the entire process, highlighting vital configuration techniques, and best practices necessary to maximize the potential of QlikView for your data solutions. What this Book will help me do Understand the best practices for configuring QlikView Server and Publisher in your organization's IT environment. Master the process of installing and maintaining QlikView Server to ensure optimal performance. Learning how to deploy enterprise-grade QlikView Server setups effectively. Gain expertise in setting up and leveraging QlikView Publisher for task scheduling and automation. Acquire the skills to integrate QlikView with advanced security solutions and monitor system efficiency. Author(s) Stephen Redmond is a renowned author and expert on data visualization platforms, particularly QlikView. With years of experience in implementing enterprise data solutions, Stephen brings practical insights to complex technical concepts. He focuses on guiding readers with systematic and approachable methods to master Qlik technologies. Who is it for? This book is designed for IT professionals and system administrators who are tasked with deploying or maintaining QlikView systems. Whether you're new to QlikView or have some experience, you'll find the material clear and beneficial to mastering enterprise implementations. The content suits individuals aiming to improve data-driven decision-making systems in their organization.

Thinking with Data

Many analysts are too concerned with tools and techniques for cleansing, modeling, and visualizing datasets and not concerned enough with asking the right questions. In this practical guide, data strategy consultant Max Shron shows you how to put the why before the how, through an often-overlooked set of analytical skills. Thinking with Data helps you learn techniques for turning data into knowledge you can use. You’ll learn a framework for defining your project, including the data you want to collect, and how you intend to approach, organize, and analyze the results. You’ll also learn patterns of reasoning that will help you unveil the real problem that needs to be solved. Learn a framework for scoping data projects Understand how to pin down the details of an idea, receive feedback, and begin prototyping Use the tools of arguments to ask good questions, build projects in stages, and communicate results Explore data-specific patterns of reasoning and learn how to build more useful arguments Delve into causal reasoning and learn how it permeates data work Put everything together, using extended examples to see the method of full problem thinking in action

Forecasting Offertory Revenue at St. Elizabeth Seton Catholic Church

This new business analytics case study challenges readers to forecast donations, plan budgets, and manage cash flow for a religious institution suffering rapidly falling contributions. Crystallizing realistic analytical challenges faced by non-profit and for-profit organizations of all kinds, it exposes readers to the entire decision-making process, providing opportunities to perform analyses, interpret output, and recommend the best course of action. Author: Matthew J. Drake, Duquesne University.

Forecasting Sales at Ska Brewing Company

This new business analytics case study challenges readers to project trends and plan capacity for a fast-growing craft beer operation, so it can make the best possible decisions about expensive investments in brewing capacity. Crystallizing realistic analytical challenges faced by companies in many industries and markets, it exposes readers to the entire decision-making process, providing opportunities to perform analyses, interpret output, and recommend the best course of action. Author: Eric Huggins, Fort Lewis College.

Statistics for Mining Engineering

Many areas of mining engineering gather and use statistical information, provided by observing the actual operation of equipment, their systems, the development of mining works, surface subsidence that accompanies underground mining, displacement of rocks surrounding surface pits and underground drives and longwalls, amongst others. In addition, the actual modern machines used in surface mining are equipped with diagnostic systems that automatically trace all important machine parameters and send this information to the main producer’s computer. Such data not only provide information on the technical properties of the machine but they also have a statistical character. Furthermore, all information gathered during stand and lab investigations where parts, assemblies and whole devices are tested in order to prove their usefulness, have a stochastic character. All of these materials need to be developed statistically and, more importantly, based on these results mining engineers must make decisions whether to undertake actions, connected with the further operation of the machines, the further development of the works, etc. For these reasons, knowledge of modern statistics is necessary for mining engineers; not only as to how statistical analysis of data should be conducted and statistical synthesis should be done, but also as to understanding the results obtained and how to use them to make appropriate decisions in relation to the mining operation. This book on statistical analysis and synthesis starts with a short repetition of probability theory and also includes a special section on statistical prediction. The text is illustrated with many examples taken from mining practice; moreover the tables required to conduct statistical inference are included.

Growth Curve Modeling: Theory and Applications

Features recent trends and advances in the theory and techniques used to accurately measure and model growth Growth Curve Modeling: Theory and Applications features an accessible introduction to growth curve modeling and addresses how to monitor the change in variables over time since there is no "one size fits all" approach to growth measurement. A review of the requisite mathematics for growth modeling and the statistical techniques needed for estimating growth models are provided, and an overview of popular growth curves, such as linear, logarithmic, reciprocal, logistic, Gompertz, Weibull, negative exponential, and log-logistic, among others, is included. In addition, the book discusses key application areas including economic, plant, population, forest, and firm growth and is suitable as a resource for assessing recent growth modeling trends in the medical field. SAS is utilized throughout to analyze and model growth curves, aiding readers in estimating specialized growth rates and curves. Including derivations of virtually all of the major growth curves and models, Growth Curve Modeling: Theory and Applications also features: Statistical distribution analysis as it pertains to growth modeling Trend estimations Dynamic site equations obtained from growth models Nonlinear regression Yield-density curves Nonlinear mixed effects models for repeated measurements data Growth Curve Modeling: Theory and Applications is an excellent resource for statisticians, public health analysts, biologists, botanists, economists, and demographers who require a modern review of statistical methods for modeling growth curves and analyzing longitudinal data. The book is also useful for upper-undergraduate and graduate courses on growth modeling.

MATLAB for Neuroscientists, 2nd Edition

MATLAB for Neuroscientists serves as the only complete study manual and teaching resource for MATLAB, the globally accepted standard for scientific computing, in the neurosciences and psychology. This unique introduction can be used to learn the entire empirical and experimental process (including stimulus generation, experimental control, data collection, data analysis, modeling, and more), and the 2nd Edition continues to ensure that a wide variety of computational problems can be addressed in a single programming environment. This updated edition features additional material on the creation of visual stimuli, advanced psychophysics, analysis of LFP data, choice probabilities, synchrony, and advanced spectral analysis. Users at a variety of levels—advanced undergraduates, beginning graduate students, and researchers looking to modernize their skills—will learn to design and implement their own analytical tools, and gain the fluency required to meet the computational needs of neuroscience practitioners. The first complete volume on MATLAB focusing on neuroscience and psychology applications Problem-based approach with many examples from neuroscience and cognitive psychology using real data Illustrated in full color throughout Careful tutorial approach, by authors who are award-winning educators with strong teaching experience

Bistatic SAR/GISAR/FISAR Geometry, Signal Models and Imaging Algorithms

Bistatic radar consists of a radar system which comprises a transmitter and receiver which are separated by a distance comparable to the expected target distance. This book provides a general theoretical description of such bistatic technology in the context of synthetic aperture, inverse synthetic aperture and forward scattering radars from the point of view of analytical geometrical and signal formation as well as processing theory. Signal formation and image reconstruction algorithms are developed with the application of high informative linear frequency and phase code modulating techniques, and numerical experiments that confirm theoretical models are carried out. The authors suggest the program implementation of developed algorithms. A theoretical summary of the latest results in the field of bistatic radars is provided, before applying an analytical geometrical description of scenarios of bistatic synthetic aperture, inverse synthetic aperture and forward scattering radars with cooperative and non-cooperative transmitters. Signal models with linear frequency and phase code modulation are developed, and special phase modulations with C/A (coarse acquisition) and P (precision) of GPS satellite transmitters are considered. The authors suggest Matlab implementations of all geometrical models and signal formation and processing algorithms. Contents 1. Bistatic Synthetic Aperture Radar (BSAR) Survey. 2. BSAR Geometry. 3. BSAR Waveforms and Signal Models. 4. BSAR Image Reconstruction Algorithms. 5. Analytical Geometrical Determination of BSAR Resolution. 6. BSAR Experimental Results. 7. BSAR Matlab Implementation. A general theoretical description of bistatic technology within the scope of synthetic aperture, inverse synthetic aperture and forward scattering radars from the point of view of analytical geometrical and signal formation and processing theory. Signal formation and image reconstruction algorithms are developed in this title, with application of high informative linear frequency and phase code modulating techniques. Numerical experiments that confirm theoretical models are carried out and the authors suggest program implementation for the algorithms developed.