data

Security on IBM z/VSE

2018-06-14 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Ingo Franzki Helmut Hellner Antoinette Kaschner, Joerg Schmidbauer, Heiko Schnell, Klaus-Dieter Wacker

IBM Cyber Security data-engineering

Abstract One of a firm’s most valuable resources is its data: client lists, accounting data, employee information, and so on. This critical data must be securely managed and controlled, and simultaneously made available to those users authorized to see it. The IBM® z/VSE® system features extensive capabilities to simultaneously share the firm’s data among multiple users and protect them. Threats to this data come from various sources. Insider threats and malicious hackers are not only difficult to detect and prevent, they might be using resources with the business being unaware. This IBM Redbooks® publication was written to assist z/VSE support and security personnel in providing the enterprise with a safe, secure and manageable environment. This book provides an overview of the security that is provided by z/VSE and the processes for the implementation and configuration of z/VSE security components, Basic Security Manager (BSM), IBM CICS® security, TCP/IP security, single sign-on using LDAP, and connector security.

Next-Generation Big Data: A Practical Guide to Apache Kudu, Impala, and Spark

2018-06-12 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Butch Quinto

Alteryx Analytics BI Big Data Cloud Computing Data Governance DataViz DWH Apache HBase HDFS Kafka MySQL +7 more

Utilize this practical and easy-to-follow guide to modernize traditional enterprise data warehouse and business intelligence environments with next-generation big data technologies. Next-Generation Big Data takes a holistic approach, covering the most important aspects of modern enterprise big data. The book covers not only the main technology stack but also the next-generation tools and applications used for big data warehousing, data warehouse optimization, real-time and batch data ingestion and processing, real-time data visualization, big data governance, data wrangling, big data cloud deployments, and distributed in-memory big data computing. Finally, the book has an extensive and detailed coverage of big data case studies from Navistar, Cerner, British Telecom, Shopzilla, Thomson Reuters, and Mastercard. What You’ll Learn Install Apache Kudu, Impala, and Spark to modernize enterprise data warehouse and business intelligence environments, complete with real-world, easy-to-follow examples, and practical advice Integrate HBase, Solr, Oracle, SQL Server, MySQL, Flume, Kafka, HDFS, and Amazon S3 with Apache Kudu, Impala, and Spark Use StreamSets, Talend, Pentaho, and CDAP for real-time and batch data ingestion and processing Utilize Trifacta, Alteryx, and Datameer for data wrangling and interactive data processing Turbocharge Spark with Alluxio, a distributed in-memory storage platform Deploy big data in the cloud using Cloudera Director Perform real-time data visualization and time series analysis using Zoomdata, Apache Kudu, Impala, and Spark Understand enterprise big data topics such as big data governance, metadata management, data lineage, impact analysis, and policy enforcement, and how to use Cloudera Navigator to perform common data governance tasks Implement big data use cases such as big data warehousing, data warehouse optimization, Internet of Things, real-time data ingestion and analytics, complex event processing, and scalable predictive modeling Study real-world big data case studies from innovative companies, including Navistar, Cerner, British Telecom, Shopzilla, Thomson Reuters, and Mastercard Who This Book Is For BI and big data warehouse professionals interested in gaining practical and real-world insight into next-generation big data processing and analytics using Apache Kudu, Impala, and Spark; and those who want to learn more about other advanced enterprise topics

BizTalk

2018-06-11 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Suren Machiraju , Suraj Gaurav

Azure Microsoft SQL data-engineering enterprise-service-bus microsoft-biztalk-server streaming-messaging

Why do businesses continue to use Microsoft’s BizTalk Server as the backbone to integrate line-of-business applications with their trading partners and how do recent changes make it even more effective? With the advent of Azure, we have a unique opportunity to enhance BizTalk functionality including reducing the cost of operations and maintenance. This book offers three solutions for the reader on ways to leverage BizTalk to get more from existing deployments or find ways to modernize the deployment via Azure. Microsoft partners are playing a significant role in enhancing the capabilities of BizTalk and this book includes sections that provide an in-depth review of BizTalk 360 © and the WPC HIPAA DB Toolkit ©. Over the recent past, Web 3.0 has also introduced many new concepts and open source technologies and this book covers ways to leverage these to enhance your BizTalk deployment. The authors start with a survey of the existing BizTalk Server – its history, patterns, and state of affairs –and go on to provide an in-depth elaboration of three messaging patterns that customers use for BizTalk; the advantages of updating to SQL Server 2016; a review of partner solutions that enhance BizTalk; and BizTalk with Web 3.0 for custom solutions. The book concludes with a comparison of the three viable BizTalk Azure application solutions that will enable you to make the best choice for your business.

Implementing IBM FlashSystem 900 Model AE3

2018-06-11 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Jim Cioffi Detlef Helmbrecht Jon Herd, Jeffrey Irving, Christian Karpp, Volker Kiemes, Carsten Larsen, Adrian Orban

Analytics Cloud Computing IBM data-engineering

Abstract Today’s global organizations depend on being able to unlock business insights from massive volumes of data. Now, with IBM® FlashSystem 900 Model AE3, powered by IBM FlashCore® technology, they can make faster decisions based on real-time insights and unleash the power of the most demanding applications, including online transaction processing (OLTP) and analytics databases, virtual desktop infrastructures (VDIs), technical computing applications, and cloud environments. This IBM Redbooks® publication introduces clients to the IBM FlashSystem® 900 Model AE3. It provides in-depth knowledge of the product architecture, software and hardware, implementation, and hints and tips. Also illustrated are use cases that show real-world solutions for tiering, flash-only, and preferred-read, and also examples of the benefits gained by integrating the FlashSystem storage into business environments. This book is intended for pre-sales and post-sales technical support professionals and storage administrators, and for anyone who wants to understand how to implement this new and exciting technology.

Mastering The Faster Web with PHP, MySQL, and JavaScript

2018-06-11 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Andrew Caya

JavaScript MySQL SQL data-engineering relational-databases

Explore cutting-edge web optimization techniques in 'Mastering The Faster Web with PHP, MySQL, and JavaScript'. This comprehensive guide equips developers with the tools and knowledge to create lightning-fast web applications using modern technologies, including PHP 7, asynchronous programming, advanced SQL, and efficient JavaScript. What this Book will help me do Efficiently use profiling and benchmarking tools to identify performance bottlenecks. Optimize PHP 7 applications through efficient data structures and logical improvements. Enhance database performance by identifying and solving inefficient SQL queries. Incorporate modern asynchronous programming and functional programming techniques into your workflow. Integrate seamless UI designs that prioritize application responsiveness and user experience. Author(s) None Caya is a seasoned web developer with extensive experience in PHP, MySQL, and JavaScript. Through their career, they have delved deep into profiling, optimization techniques, and modern web technologies to deliver high-performance web solutions. This book reflects their commitment to providing actionable insights and practical advice to fellow developers. Who is it for? Ideal readers of this book are PHP developers with foundational knowledge in programming and web technologies who aspire to build and optimize modern web applications. Experience in JavaScript is not required, as the book covers essential aspects needed for performance enhancements. If you're aiming to hone your skills in creating faster web solutions, this book suits your goals perfectly.

Microsoft SQL Server 2017 on Linux

2018-06-08 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Benjamin Nevarez

Docker Linux Microsoft Cyber Security SQL SQL Server data-engineering microsoft-sql-server relational-databases

Essential Microsoft® SQL Server® 2017 installation, configuration, and management techniques for Linux Foreword by Kalen Delaney, Microsoft SQL Server MVP This comprehensive guide shows, step-by-step, how to set up, configure, and administer SQL Server 2017 on Linux for high performance and high availability. Written by a SQL Server expert and respected author, Microsoft SQL Server 2017 on Linux teaches valuable Linux skills to Windows-based SQL Server professionals. You will get clear coverage of both Linux and SQL Server and complete explanations of the latest features, tools, and techniques. The book offers clear instruction on adaptive query processing, automatic tuning, disaster recovery, security, and much more. •Understand how SQL Server 2017 on Linux works •Install and configure SQL Server on Linux •Run SQL Server on Docker containers •Learn Linux Administration •Troubleshoot and tune query performance in SQL Server •Learn what is new in SQL Server 2017 •Work with adaptive query processing and automatic tuning techniques •Implement high availability and disaster recovery for SQL Server on Linux •Learn the security features available in SQL Server

MySQL and JSON: A Practical Programming Guide

2018-06-08 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by David Stokes

JavaScript JSON MySQL Oracle data-engineering storage-formats

Practical instruction on using JavaScript Object Notation (JSON) with MySQL This hands-on guide teaches, step by step, how to use JavaScript Object Notation (JSON) with MySQL. Written by a MySQL Community Manager for Oracle , MySQL and JSON: A Practical Programming Guide shows how to quickly get started using JSON with MySQL and clearly explains the latest tools and functions. All content is based on the author’s years of interaction with MySQL professionals. Throughout, real-world examples and sample code guide you through the syntax and application of each method. You will get in-depth coverage of programming with the MySQL Document Store. •See how JavaScript Object Notation (JSON) works with MySQL •Use JSON as string data and JSON as a data type •Find the path, load data, and handle searches with REGEX •Work with JSON and non-JSON output •Build virtual generated columns and stored generated columns •Generate complex geometries using GeoJSON •Convert and manage data with JSON functions •Access JSON data, collections, and tables through MySQL Document Store

IBM z14 Model ZR1 Technical Guide

2018-06-06 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Hervey Kamga Octavian Lascu Frank Packheiser, Martijn Raave, John Troy, Bill White

Analytics Cloud Computing IBM Cyber Security data-engineering

Abstract This IBM® Redbooks® publication describes the new member of the IBM Z® family, IBM z14™ Model ZR1 (Machine Type 3907). It includes information about the Z environment and how it helps integrate data and transactions more securely, and can infuse insight for faster and more accurate business decisions. The z14 ZR1 is a state-of-the-art data and transaction system that delivers advanced capabilities, which are vital to any digital transformation. The z14 ZR1 is designed for enhanced modularity, in an industry standard footprint. A data-centric infrastructure must always be available with a 99.999% or better availability, have flawless data integrity, and be secured from misuse. It also must be an integrated infrastructure that can support new applications. Finally, it must have integrated capabilities that can provide new mobile capabilities with real-time analytics that are delivered by a secure cloud infrastructure. IBM z14 ZR1 servers are designed with improved scalability, performance, security, resiliency, availability, and virtualization. The superscalar design allows z14 ZR1 servers to deliver a record level of capacity over the previous IBM Z platforms. In its maximum configuration, z14 ZR1 is powered by up to 30 client characterizable microprocessors (cores) running at 4.5 GHz. This configuration can run more than 29,000 million instructions per second and up to 8 TB of client memory. The IBM z14 Model ZR1 is estimated to provide up to 54% more total system capacity than the IBM z13s® Model N20. This Redbooks publication provides information about IBM z14 ZR1 and its functions, features, and associated software support. More information is offered in areas that are relevant to technical planning. It is intended for systems engineers, consultants, planners, and anyone who wants to understand the IBM Z servers functions and plan for their usage. It is intended as an introduction to mainframes. Readers are expected to be generally familiar with IBM Z technology and terminology.

Data Analytics with Spark Using Python, First edition

2018-06-04 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Jeffrey Aven

AI/ML Analytics Cloud Computing Data Analytics Data Science Hadoop NoSQL Python Spark Data Streaming apache-spark data-engineering

Spark for Data Professionals introduces and solidifies the concepts behind Spark 2.x, teaching working developers, architects, and data professionals exactly how to build practical Spark solutions. Jeffrey Aven covers all aspects of Spark development, including basic programming to SparkSQL, SparkR, Spark Streaming, Messaging, NoSQL and Hadoop integration. Each chapter presents practical exercises deploying Spark to your local or cloud environment, plus programming exercises for building real applications. Unlike other Spark guides, Spark for Data Professionals explains crucial concepts step-by-step, assuming no extensive background as an open source developer. It provides a complete foundation for quickly progressing to more advanced data science and machine learning topics. This guide will help you: Understand Spark basics that will make you a better programmer and cluster “citizen” Master Spark programming techniques that maximize your productivity Choose the right approach for each problem Make the most of built-in platform constructs, including broadcast variables, accumulators, effective partitioning, caching, and checkpointing Leverage powerful tools for managing streaming, structured, semi-structured, and unstructured data

Decarbonizing Logistics

2018-06-03 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Prof. Alan McKinnon

data-engineering log-data

Learn how to cut logistics-related carbon emissions with this essential guide based on cutting edge research

Big Data Analytics with Hadoop 3

2018-05-31 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Sridhar Alla

Analytics Flink AWS Big Data Cloud Computing Data Analytics Hadoop HDFS Python Spark data-engineering

Big Data Analytics with Hadoop 3 is your comprehensive guide to understanding and leveraging the power of Apache Hadoop for large-scale data processing and analytics. Through practical examples, it introduces the tools and techniques necessary to integrate Hadoop with other popular frameworks, enabling efficient data handling, processing, and visualization. What this Book will help me do Understand the foundational components and features of Apache Hadoop 3 such as HDFS, YARN, and MapReduce. Gain the ability to integrate Hadoop with programming languages like Python and R for data analysis. Learn the skills to utilize tools such as Apache Spark and Apache Flink for real-time data analytics within the Hadoop ecosystem. Develop expertise in setting up a Hadoop cluster and performing analytics in cloud environments such as AWS. Master the process of building practical big data analytics pipelines for end-to-end data processing. Author(s) Sridhar Alla is a seasoned big data professional with extensive industry experience in building and deploying scalable big data analytics solutions. Known for his expertise in Hadoop and related ecosystems, Sridhar combines technical depth with clear communication in his writing, providing practical insights and hands-on knowledge. Who is it for? This book is tailored for data professionals, software engineers, and data scientists looking to expand their expertise in big data analytics using Hadoop 3. Whether you're an experienced developer or new to the big data ecosystem, this book provides the step-by-step guidance and practical examples needed to advance your skills and achieve your analytical goals.

Enabling Hybrid Cloud Storage for IBM Spectrum Scale Using Transparent Cloud Tiering

2018-05-31 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Rob Basham , Amey Gokhale , Jinesh Shah , Anbazhagan Mani , Kedar Karmarkar , Nikhil Khandelwal , Larry Coyne , Sandeep R Patil , Donald Mathisen , Arend Dittmer

Cloud Computing Cloud Storage IBM S3 cloud-storage data-engineering storage-repositories

This IBM® Redbooks® publication provides information to help you with the sizing, configuration, and monitoring of hybrid cloud solutions using the transparent cloud tiering (TCT) functionality of IBM Spectrum™ Scale. IBM Spectrum Scale™ is a scalable data, file, and object management solution that provides a global namespace for large data sets and several enterprise features. The IBM Spectrum Scale feature called transparent cloud tiering allows cloud object storage providers, such as IBM Cloud™ Object Storage, IBM Cloud, and Amazon S3, to be used as a storage tier for IBM Spectrum Scale. Transparent cloud tiering can help cut storage capital and operating costs by moving data that does not require local performance to an on-premise or off-premise cloud object storage provider. Transparent cloud tiering reduces the complexity of cloud object storage by making data transfers transparent to the user or application. This capability can help you adapt to a hybrid cloud deployment model where active data remains directly accessible to your applications and inactive data is placed in the correct cloud (private or public) automatically through IBM Spectrum Scale policies. This publication is intended for IT architects, IT administrators, storage administrators, and those wanting to learn more about sizing, configuration, and monitoring of hybrid cloud solutions using IBM Spectrum Scale and transparent cloud tiering.

Hands-On Data Warehousing with Azure Data Factory

2018-05-31 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Christian Cote , Giuseppe Ciaburro , Michelle Gutzait

AI/ML Analytics Azure ADF BI Cloud Computing Data Engineering Data Lake Databricks DWH ETL/ELT Power BI +6 more

Dive into the world of ETL (Extract, Transform, Load) with 'Hands-On Data Warehousing with Azure Data Factory'. This book guides readers through the essential techniques for working with Azure Data Factory and SQL Server Integration Services to design, implement, and optimize ETL solutions for both on-premises and cloud data environments. What this Book will help me do Understand and utilize Azure Data Factory and SQL Server Integration Services to build ETL solutions. Design scalable and high-performance ETL architectures tailored to modern data problems. Integrate various Azure services, such as Azure Data Lake Analytics, Machine Learning, and Databricks Spark, into your workflows. Troubleshoot and optimize ETL pipelines and address common challenges in data processing. Create insightful Power BI dashboards to visualize and interact with data from your ETL workflows. Author(s) Authors None Cote, Michelle Gutzait, and Giuseppe Ciaburro bring a wealth of experience in data engineering and cloud technologies to this practical guide. Combining expertise in Azure ecosystem and hands-on Data Warehousing, they deliver actionable insights for working professionals. Who is it for? This book is crafted for software professionals working in data engineering, especially those specializing in ETL processes. Readers with a foundational knowledge of SQL Server and cloud infrastructures will benefit most. If you aspire to implement state-of-the-art ETL pipelines or enhance existing workflows with ADF and SSIS, this book is an ideal resource.

Learning PHP, MySQL & JavaScript, 5th Edition

2018-05-29 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Robin Nixon

HTML JavaScript MySQL Cyber Security data-engineering relational-databases

Build interactive, data-driven websites with the potent combination of open source technologies and web standards, even if you have only basic HTML knowledge. In this update to this popular hands-on guide, you’ll tackle dynamic web programming with the latest versions of today’s core technologies: PHP, MySQL, JavaScript, CSS, HTML5, and key jQuery libraries. Web designers will learn how to use these technologies together and pick up valuable web programming practices along the way—including how to optimize websites for mobile devices. At the end of the book, you’ll put everything together to build a fully functional social networking site suitable for both desktop and mobile browsers. Explore MySQL, from database structure to complex queries Use the MySQLi extension, PHP’s improved MySQL interface Create dynamic PHP web pages that tailor themselves to the user Manage cookies and sessions and maintain a high level of security Enhance the JavaScript language with jQuery and jQuery mobile libraries Use Ajax calls for background browser-server communication Style your web pages by acquiring CSS2 and CSS3 skills Implement HTML5 features, including geolocation, audio, video, and the canvas element Reformat your websites into mobile web apps

IBM Storage Networking SAN24B-6 Switch

2018-05-23 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Ernest A. Keenan

IBM data-engineering

This IBM® Redbooks® product guide describes the IBM Storage Networking SAN24B-6 switch. Explosive data growth, coupled with user expectations of unlimited access from anywhere, at any time, is pushing storage environments to the limit. To meet these dynamic business demands, the network must evolve to improve speed, increase efficiency, and reduce costs. Legacy infrastructures were not designed to support the performance requirements of flash-based storage technology. A new approach to storage networking is required to unlock the full capabilities of all-flash arrays. By treating the network as a strategic part of a storage environment, organizations can maximize their productivity and efficiency, even as they rapidly grow their environments. The IBM Storage Networking SAN24B-6 switch provides exceptional value in an entry-level switch, combining high-performance capabilities of 4, 8, 16, and 32 Gbps, point-and-click simplicity, and enterprise-class functionality. The port speed capability is dependent on the transceiver installed. SAN24B-6 provides small to midsized data centers with low-cost access to industry-leading Gen 5 and Gen 6 Fibre Channel technology and the ability to start small and grow on demand from 8 to 24 ports to support an evolving storage environment. In addition, SAN24B-6 is easy to use and install, with a point-and-click user interface that simplifies deployment and saves time.

Storwize HyperSwap with IBM i

2018-05-23 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Jon Tate , Falk Schneider , Jana Jamsek

IBM data-engineering

IBM® Storwize® HyperSwap® is a response to increasing demand for continuous application availability, minimizing downtime in the event of an outage, and non disruptive migrations. IT centers with IBM i can take full advantage of the HyperSwap solution. In this IBM Redpaper™ publication, we provide instructions to implement Storwize HyperSwap with IBM i. We also describe some business continuity scenarios in this area, including solutions with HyperSwap and IBM i Live Partition Mobility, and a solution with HyperSwap and IBM PowerHA® for IBM i.

PostgreSQL 10 Administration Cookbook - Fourth Edition

2018-05-18 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Simon Riggs , Gianni Ciolli

data-engineering postgresql relational-databases

This book offers an extensive collection of practical recipes for administering PostgreSQL 10, covering everything from configuring servers to optimizing performance. By working through these structured solutions, you will develop the skills necessary to manage PostgreSQL databases effectively, making your systems reliable and responsive. What this Book will help me do Implement and leverage the latest PostgreSQL 10 features for better databases. Master techniques for performance tuning and optimization in PostgreSQL. Develop strategies for comprehensive backup and recovery processes. Learn best practices for ensuring replication and high availability. Understand how to diagnose and resolve common PostgreSQL challenges effectively. Author(s) The authors of this book are experienced database professionals with deep knowledge of PostgreSQL. They bring their practical insights and expertise to help administrators and developers achieve the most out of PostgreSQL. They are dedicated to making complex topics approachable and relevant. Who is it for? This book is for current or aspiring database administrators and developers who work with PostgreSQL. It suits those who are familiar with databases and want to gain practical skills in PostgreSQL administration. It is ideal for individuals aiming to improve performance and reliability of their PostgreSQL systems.

IBM Real-time Compression in IBM SAN Volume Controller and IBM Storwize V7000

2018-05-16 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Jon Tate , Christian Burns , Jorge Quintal , Bosmat Tuv-El

IBM data-engineering ibm-system-storage ibm-system-storage-san-volume-controller

IBM® Real-time Compression™ software that is embedded in IBM SAN Volume Controller (SVC) and IBM Storwize® V7000 solution addresses all the requirements of primary storage data reduction, including performance, by using a purpose-built technology called . This IBM Redpaper™ publication addresses the key requirements for primary storage data reduction and gives real world examples of savings that can be made by using compression. SVC and Storwize V7000 is designed to improve storage efficiency by compressing data by as much as 80% through supported real-time compression for block storage. This process enables up to five times as much data to be stored in the same physical disk space. Unlike other approaches to compression, IBM Real-time Compression is used with active primary data, such as production databases and email systems. This configuration dramatically expands the range of candidate data that can benefit from compression. As its name implies, IBM Real-time Compression operates as data is written to disk, avoiding the need to store data that is awaiting compression.

Designing Event-Driven Systems

2018-05-15 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Ben Stopford

API DevOps Kafka Data Streaming data-engineering streaming-messaging

Many forces affect software today: larger datasets, geographical disparities, complex company structures, and the growing need to be fast and nimble in the face of change. Proven approaches such as service-oriented and event-driven architectures are joined by newer techniques such as microservices, reactive architectures, DevOps, and stream processing. Many of these patterns are successful by themselves, but as this practical ebook demonstrates, they provide a more holistic and compelling approach when applied together. Author Ben Stopford explains how service-based architectures and stream processing tools such as Apache Kafka can help you build business-critical systems. You’ll learn how to apply patterns including Event Sourcing and CQRS, and how to build multi-team systems with microservices and SOA using patterns such as "inside out databases" and "event streams as a source of truth." These approaches provide a unique foundation for how these large, autonomous service ecosystems can communicate and share data. Learn why streaming beats request-response based architectures in complex, contemporary use cases Understand why replayable logs such as Kafka provide a backbone for both service communication and shared datasets Explore how event collaboration and event sourcing patterns increase safety and recoverability with functional, event-driven approaches Build service ecosystems that blend event-driven and request-driven interfaces using a replayable log and Kafka’s Streams API Scale beyond individual teams into larger, department- and company-sized architectures, using event streams as a source of truth

Data Science Fundamentals for Python and MongoDB

2018-05-10 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by David Paper

AI/ML Data Science JSON MongoDB Monte Carlo Python data-engineering nosql-databases

Build the foundational data science skills necessary to work with and better understand complex data science algorithms. This example-driven book provides complete Python coding examples to complement and clarify data science concepts, and enrich the learning experience. Coding examples include visualizations whenever appropriate. The book is a necessary precursor to applying and implementing machine learning algorithms. The book is self-contained. All of the math, statistics, stochastic, and programming skills required to master the content are covered. In-depth knowledge of object-oriented programming isn’t required because complete examples are provided and explained. Data Science Fundamentals with Python and MongoDB is an excellent starting point for those interested in pursuing a career in data science. Like any science, the fundamentals of data science are a prerequisite to competency. Without proficiency in mathematics, statistics, data manipulation, and coding, the path to success is “rocky” at best. The coding examples in this book are concise, accurate, and complete, and perfectly complement the data science concepts introduced. What You'll Learn Prepare for a career in data science Work with complex data structures in Python Simulate with Monte Carlo and Stochastic algorithms Apply linear algebra using vectors and matrices Utilize complex algorithms such as gradient descent and principal component analysis Wrangle, cleanse, visualize, and problem solve with data Use MongoDB and JSON to work with data Who This Book Is For The novice yearning to break into the data science world, and the enthusiast looking to enrich, deepen, and develop data science skills through mastering the underlying fundamentalsthat are sometimes skipped over in the rush to be productive. Some knowledge of object-oriented programming will make learning easier.

talk-data.com

Activity Trend

Top Events

Top Speakers

Security on IBM z/VSE

Next-Generation Big Data: A Practical Guide to Apache Kudu, Impala, and Spark

BizTalk

Implementing IBM FlashSystem 900 Model AE3

Mastering The Faster Web with PHP, MySQL, and JavaScript

Microsoft SQL Server 2017 on Linux

MySQL and JSON: A Practical Programming Guide

IBM z14 Model ZR1 Technical Guide

Data Analytics with Spark Using Python, First edition

Decarbonizing Logistics

Big Data Analytics with Hadoop 3

Enabling Hybrid Cloud Storage for IBM Spectrum Scale Using Transparent Cloud Tiering

Hands-On Data Warehousing with Azure Data Factory

Learning PHP, MySQL & JavaScript, 5th Edition

IBM Storage Networking SAN24B-6 Switch

Storwize HyperSwap with IBM i

PostgreSQL 10 Administration Cookbook - Fourth Edition

IBM Real-time Compression in IBM SAN Volume Controller and IBM Storwize V7000

Designing Event-Driven Systems

Data Science Fundamentals for Python and MongoDB