talk-data.com talk-data.com

Topic

Unix

operating_system multi_user multitasking

6

tagged

Activity Trend

1 peak/qtr
2020-Q1 2026-Q1

Activities

Showing filtered results

Filtering by: O'Reilly Data Science Books ×
Data Science at the Command Line, 2nd Edition

This thoroughly revised guide demonstrates how the flexibility of the command line can help you become a more efficient and productive data scientist. You'll learn how to combine small yet powerful command-line tools to quickly obtain, scrub, explore, and model your data. To get you started, author Jeroen Janssens provides a Docker image packed with over 100 Unix power tools--useful whether you work with Windows, macOS, or Linux. You'll quickly discover why the command line is an agile, scalable, and extensible technology. Even if you're comfortable processing data with Python or R, you'll learn how to greatly improve your data science workflow by leveraging the command line's power. This book is ideal for data scientists, analysts, engineers, system administrators, and researchers. Obtain data from websites, APIs, databases, and spreadsheets Perform scrub operations on text, CSV, HTML, XML, and JSON files Explore data, compute descriptive statistics, and create visualizations Manage your data science workflow Create your own tools from one-liners and existing Python or R code Parallelize and distribute data-intensive pipelines Model data with dimensionality reduction, regression, and classification algorithms Leverage the command line from Python, Jupyter, R, RStudio, and Apache Spark

Gnuplot in Action, Second Edition

Gnuplot in Action, Second Edition is a major revision of this popular and authoritative guide for developers, engineers, and scientists who want to learn and use gnuplot effectively. Fully updated for gnuplot version 5, the book includes four pages of color illustrations and four bonus appendixes available in the eBook. About the Technology Gnuplot is an open-source graphics program that helps you analyze, interpret, and present numerical data. Available for Unix, Mac, and Windows, it is well-maintained, mature, and totally free. About the Book Gnuplot in Action, Second Edition is a major revision of this authoritative guide for developers, engineers, and scientists. The book starts with a tutorial introduction, followed by a systematic overview of gnuplot's core features and full coverage of gnuplot's advanced capabilities. Experienced readers will appreciate the discussion of gnuplot 5?s features, including new plot types, improved text and color handling, and support for interactive, web-based display formats. The book concludes with chapters on graphical effects and general techniques for understanding data with graphs. It includes four pages of color illustrations. 3D graphics, false-color plots, heatmaps, and multivariate visualizations are covered in chapter-length appendixes available in the eBook. What's Inside Creating different types of graphs in detail Animations, scripting, batch operations Extensive discussion of terminals Updated to cover gnuplot version 5 About the Reader No prior experience with gnuplot is required. This book concentrates on practical applications of gnuplot relevant to users of all levels. About the Author Philipp K. Janert, Ph.D, is a programmer and scientist. He is the author of several books on data analysis and applied math and has been a gnuplot power user and developer for over 20 years. Quotes The highly anticipated, updated version of my go-to-for-everything book on gnuplot. - Ryan Balfanz, Shift Medical, Inc. The essential guide for newcomers and the definitive handbook for advanced users. - Zoltán Vörös, University of Innsbruck Learn how to use gnuplot to convert meaningful data into attention-grabbing visualizations that communicate your message quickly and accurately. - David Kerns, Rincon Research Corporation An accessible guide to gnuplot and best practices of everyday data visualization. - Wesley R. Elsberry,PhD, RealPage, Inc.

Bioinformatics Data Skills

Learn the data skills necessary for turning large sequencing datasets into reproducible and robust biological findings. With this practical guide, you’ll learn how to use freely available open source tools to extract meaning from large complex biological data sets. At no other point in human history has our ability to understand life’s complexities been so dependent on our skills to work with and analyze data. This intermediate-level book teaches the general computational and data skills you need to analyze biological data. If you have experience with a scripting language like Python, you’re ready to get started. Go from handling small problems with messy scripts to tackling large problems with clever methods and tools Process bioinformatics data with powerful Unix pipelines and data tools Learn how to use exploratory data analysis techniques in the R language Use efficient methods to work with genomic range data and range operations Work with common genomics data file formats like FASTA, FASTQ, SAM, and BAM Manage your bioinformatics project with the Git version control system Tackle tedious data processing tasks with with Bash scripts and Makefiles

Gnuplot in Action

Gnuplot in Action is the first comprehensive introduction to gnuplot—from the basics to the power features and beyond. Besides providing a tutorial on gnuplot itself, it demonstrates how to apply and use gnuplot to extract intelligence from data. Particular attention is paid to tricky or poorly-explained areas. You will learn how to apply gnuplot to actual data analysis problems. This book looks at different types of graphs that can be generated with gnuplot and will discuss when and how to use them to extract actual information from data. About the Technology Statistical data is only as valuable as your ability to analyze, interpret, and present it in a meaningful way. Gnuplot is the most widely used program to plot and visualize data for Unix/Linux systems and it is also popular for Windows and the Mac. It's open-source (as in free!), actively maintained, stable, and mature. It can deal with arbitrarily large data sets and is capable of producing high-quality, publication-ready graphics. So far, the only comprehensive documentation available about gnuplot is the online reference documentation, which makes it both hard to get started and almost impossible to get a complete overview over all of its features. If you've never tried gnuplot—or have found it tough to get your arms around—read on. About the Book One of gnuplot's main advantages is that it requires no programming skills nor knowledge of advanced mathematical or statistical concepts. Gnuplot in Action assumes you have no previous knowledge of either gnuplot or statistics and data analysis. The books starts out with basic gnuplot concepts, then describes in depth how to get a graph ready for final presentation and to make it look "just right" by including arrows, labels, and other decorations. Next the book looks at advanced concepts, such as multi-dimensional graphs and false-color plots—powerful features for special purposes. The author also describes advanced applications of gnuplot, such as how to script gnuplot so that it can run unattended as a batch job, and how to call gnuplot from within a CGI script to generate graphics for dynamic websites on demand. What's Inside Creating graphs with gnuplot Data transformations and filters Preparing/polishing graphs for final presentation Publishing graphs in print or on the Web Using gnuplot's power features Gnuplot scripting and programming Types of graphs and when to use them Techniques of graphical analysis How to build, install, and develop for gnuplot Command and Option reference organized by concept About the Reader Gnuplot in Action makes gnuplot easy for anyone who needs to do data analysis, but doesn't have an education in analytical tools and methods. It's perfect for DBAs, programmers, and performance engineers; business analysts and MBAs; and Six-Sigma Black Belts and process engineers. About the Author Philipp K. Janert is Chief Consultant at Principal Value, LLC. He has been a gnuplot user for more than 15 years and regards it as one of the indispensable tools in his toolbox. He has worked for small start-ups and in large corporate environments, both in the US and overseas, including several years at Amazon.com, where he initiated and led several projects to improve Amazon's order fulfillment processes. Philipp K. Janert has written about software and software development for the O'Reilly Network, IBM developerWorks, IEEE Software, and Linux Magazine. He holds a Ph.D. in Theoretical Physics from the University of Washington. Visit his website at www.principal-value.com. Quotes Knee-deep in data? This is your guidebook to exploring it with gnuplot. - Austin King, Mozilla Sparkles with insight about visualization, image perception, and data exploration. - Richard B. Kreckel, GiNaC.de Incredibly useful for beginners - indispensable for advanced users. - Mark Pruett, Systems Architect Dominion Bridges the gap between gnupolt's reference manual and real-world problems. - Mitchell Johnson, Border Stylo A Swiss Army knife for plotting data. - Nishanth Sastry, Computer Laboratory, University of Cambridge/IBM

Developing Bioinformatics Computer Skills

Bioinformatics--the application of computational and analytical methods to biological problems--is a rapidly evolving scientific discipline. Genome sequencing projects are producing vast amounts of biological data for many different organisms, and, increasingly, storing these data in public databases. Such biological databases are growing exponentially, along with the biological literature. It's impossible for even the most zealous researcher to stay on top of necessary information in the field without the aid of computer-based tools. Bioinformatics is all about building these tools. Developing Bioinformatics Computer Skills is for scientists and students who are learning computational approaches to biology for the first time, as well as for experienced biology researchers who are just starting to use computers to handle their data. The book covers the Unix file system, building tools and databases for bioinformatics, computational approaches to biological problems, an introduction to Perl for bioinformatics, data mining, and data visualization. Written in a clear, engaging style, Developing Bioinformatics Computer Skills will help biologists develop a structured approach to biological data as well as the tools they'll need to analyze the data.

Essential Guide to Computing: The Story of Information Technology, The

The complete, easy-to-understand guide to IT—now and in the future! Computers, networks, and pervasive computing Hardware, operating systems, and software How networks work: LANs, WANs, and the Internet E-business, the Web, and security The guide for ANYONE who needs to understand the key technologies driving today's economy and high tech industries! You can't afford not to understand the information revolution that's sweeping the world-but who's got time for all the acronyms and hype most technology books give you? The Essential Guide to Computing demystifies the digital society we live in with an intelligent, thorough, and up-to-date explanation of computer, networking, and Internet technologies. It's perfect for smart professionals who want to get up to speed, but don't have computer science or engineering degrees! You'll find up-to-the-minute coverage on all of today's hottest technologies including: The evolution of computing: from the room-sized "monoliths" of the 1950s to today's global Internet Preview of the next revolution: "pervasive computing" Computer hardware: microprocessors, memory, storage, I/O, displays, and architecture Windows, Macintosh, UNIX/Linux, DOS, NetWare, Palm: what operating systems do, and how they compare Programming languages: from machine language to advanced object-oriented technologies Key software applications: databases, spreadsheets, word processing, voice recognition, and beyond Microsoft and the software industry: where they stand, where they're headed How networks work: LANs, WANs, packet switching, hardware, media, and more The Internet, e-commerce, and security Enterprise applications: data warehousing, Web-centered development, and groupware Whether you're a consumer, investor, marketer, or executive, this is your start-to-finish briefing on the information technologies that have changed the world-and the coming technologies that will transform it yet again!