talk-data.com talk-data.com

Topic

AWS Glue

etl data_catalog aws

8

tagged

Activity Trend

10 peak/qtr
2020-Q1 2026-Q1

Activities

8 activities · Newest first

Data Engineering with AWS Cookbook

Data Engineering with AWS Cookbook serves as a comprehensive practical guide for building scalable and efficient data engineering solutions using AWS. With this book, you will master implementing data lakes, orchestrating data pipelines, and creating serving layers using AWS's robust services, such as Glue, EMR, Redshift, and Athena. With hands-on exercises and practical recipes, you will enhance your AWS-based data engineering projects. What this Book will help me do Gain the skills to design centralized data lake solutions and manage them securely at scale. Develop expertise in crafting data pipelines with AWS's ETL technologies like Glue and EMR. Learn to implement and automate governance, orchestration, and monitoring for data platforms. Build high-performance data serving layers using AWS analytics tools like Redshift and QuickSight. Effectively plan and execute data migrations to AWS from on-premises infrastructure. Author(s) Trâm Ngọc Phạm, Gonzalo Herreros González, Viquar Khan, and Huda Nofal bring together years of collective experience in data engineering and AWS cloud solutions. Each author's deep knowledge and passion for cloud technology have shaped this book into a valuable resource, geared towards practical learning and real-world application. Their approach ensures readers are not just learning but building tangible, impactful solutions. Who is it for? This book is geared towards data engineers and big data professionals engaged in or transitioning to cloud-based environments, specifically on AWS. Ideal readers are those looking to optimize workflows and master AWS tools to create scalable, efficient solutions. The content assumes a basic familiarity with AWS concepts like IAM roles and a command-line interface, ensuring all examples are accessible yet meaningful for those seeking advancement in AWS data engineering.

Data Engineering with AWS - Second Edition

Learn data engineering and modern data pipeline design with AWS in this comprehensive guide! You will explore key AWS services like S3, Glue, Redshift, and QuickSight to ingest, transform, and analyze data, and you'll gain hands-on experience creating robust, scalable solutions. What this Book will help me do Understand and implement data ingestion and transformation processes using AWS tools. Optimize data for analytics with advanced AWS-powered workflows. Build end-to-end modern data pipelines leveraging cutting-edge AWS technologies. Design data governance strategies using AWS services for security and compliance. Visualize data and extract insights using Amazon QuickSight and other tools. Author(s) Gareth Eagar is a Senior Data Architect with over 25 years of experience in designing and implementing data solutions across various industries. He combines his deep technical expertise with a passion for teaching, aiming to make complex concepts approachable for learners at all levels. Who is it for? This book is intended for current or aspiring data engineers, data architects, and analysts seeking to leverage AWS for data engineering. It suits beginners with a basic understanding of data concepts who want to gain practical experience as well as intermediate professionals aiming to expand into AWS-based systems.

Serverless ETL and Analytics with AWS Glue

Discover how to harness AWS Glue for your ETL and data analysis workflows with "Serverless ETL and Analytics with AWS Glue." This comprehensive guide introduces readers to the capabilities of AWS Glue, from building data lakes to performing advanced ETL tasks, allowing you to create efficient, secure, and scalable data pipelines with serverless technology. What this Book will help me do Understand and utilize various AWS Glue features for data lake and ETL pipeline creation. Leverage AWS Glue Studio and DataBrew for intuitive data preparation workflows. Implement effective storage optimization techniques for enhanced data analytics. Apply robust data security measures, including encryption and access control, to protect data. Integrate AWS Glue with machine learning tools like SageMaker to build intelligent models. Author(s) The authors of this book include experts across the fields of data engineering and AWS technologies. With backgrounds in data analytics, software development, and cloud architecture, they bring a depth of practical experience. Their approach combines hands-on tutorials with conceptual clarity, ensuring a blend of foundational knowledge and actionable insights. Who is it for? This book is designed for ETL developers, data engineers, and data analysts who are familiar with data management concepts and want to extend their skills into serverless cloud solutions. If you're looking to master AWS Glue for building scalable and efficient ETL pipelines or are transitioning existing systems to the cloud, this book is ideal for you.

Serverless Analytics with Amazon Athena

Delve into the serverless world of Amazon Athena with the comprehensive book 'Serverless Analytics with Amazon Athena'. This guide introduces you to the power of Athena, showing you how to efficiently query data in Amazon S3 using SQL without the hassle of managing infrastructure. With clear instructions and practical examples, you'll master querying structured, unstructured, and semi-structured data seamlessly. What this Book will help me do Effectively query and analyze both structured and unstructured data stored in S3 using Amazon Athena. Integrate Athena with other AWS services to create powerful, secure, and cost-efficient data workflows. Develop ETL pipelines and machine learning workflows leveraging Athena's compatibility with AWS Glue. Monitor and troubleshoot Athena queries for consistent performance and build scalable serverless data solutions. Implement security best practices and optimize costs when managing your Athena-driven data solutions. Author(s) None Virtuoso, along with co-authors Mert Turkay Hocanin None and None Wishnick, brings a wealth of experience in cloud solutions, serverless technologies, and data engineering. They excel in demystifying complex technical topics and have a passion for empowering readers with practical skills and knowledge. Who is it for? This book is tailored for business intelligence analysts, application developers, and system administrators who want to harness Amazon Athena for seamless, cost-efficient data analytics. It suits individuals with basic SQL knowledge looking to expand their capabilities in querying and processing data. Whether you're managing growing datasets or building data-driven applications, this book provides the know-how to get it right.

A First Course in Statistics, 12th Edition

For courses in introductory statistics. A Contemporary Classic Classic, yet contemporary; theoretical, yet applied—McClave & Sincich’s A First Course in Statistics gives you the best of both worlds. This text offers a trusted, comprehensive introduction to statistics that emphasizes inference and integrates real data throughout. The authors stress the development of statistical thinking, the assessment of credibility, and value of the inferences made from data. This new edition is extensively revised with an eye on clearer, more concise language throughout the text and in the exercises. Ideal for one- or two-semester courses in introductory statistics, this text assumes a mathematical background of basic algebra. Flexibility is built in for instructors who teach a more advanced course, with optional footnotes about calculus and the underlying theory. Also available with MyStatLab MyStatLab™ is an online homework, tutorial, and assessment program designed to work with this text to engage students and improve results. Within its structured environment, students practice what they learn, test their understanding, and pursue a personalized study plan that helps them absorb course material and understand difficult concepts. For this edition, MyStatLab offers 30% new and updated exercises. Note: You are purchasing a standalone product; MyLab™ & Mastering™ does not come packaged with this content. Students, if interested in purchasing this title with MyLab & Mastering, ask your instructor for the correct package ISBN and Course ID. Instructors, contact your Pearson representative for more information. If you would like to purchase both the physical text and MyLab & Mastering, search for: 0134090438 / 9780134090436 * Statistics Plus New MyStatLab with Pearson eText -- Access Card Package Package consists of: 0134080211 / 9780134080215 * Statistics 0321847997 / 9780321847997 * My StatLab Glue-in Access Card 032184839X / 9780321848390 * MyStatLab Inside Sticker for Glue-In Packages

RabbitMQ in Action

RabbitMQ in Action is a fast-paced run through building and managing scalable applications using the RabbitMQ messaging server. It starts by explaining how message queuing works, its history, and how RabbitMQ fits in. Then it shows you real-world examples you can apply to your own scalability and interoperability challenges. About the Technology There's a virtual switchboard at the core of most large applications where messages race between servers, programs, and services. RabbitMQ is an efficient and easy-to-deploy queue that handles this message traffic effortlessly in all situations, from web startups to massive enterprise systems. About the Book RabbitMQ in Action teaches you to build and manage scalable applications in multiple languages using the RabbitMQ messaging server. It's a snap to get started. You'll learn how message queuing works and how RabbitMQ fits in. Then, you'll explore practical scalability and interoperability issues through many examples. By the end, you'll know how to make Rabbit run like a well-oiled machine in a 24 x 7 x 365 environment. What's Inside Learn fundamental messaging design patterns Use patterns for on-demand scalability Glue a PHP frontend to a backend written in anything Implement a PubSub-alerting service in 30 minutes flat Configure RabbitMQ's built-in clustering Monitor, manage, extend, and tune RabbitMQ About the Reader Written for developers familiar with Python, PHP, Java, .NET, or any other modern programming language. No RabbitMQ experience required. About the Authors Alvaro Videla is a developer and architect specializing in MQ-based applications. Jason J. W. Williams is CTO of DigiTar, a messaging service provider, where he directs design and development. Quotes In this outstanding work, two experts share their years of experience running large-scale RabbitMQ systems. - Alexis Richardson, VMware Well-written, thoughtful, and easy to follow. - Karsten Strøbæk, Microsoft Soup to nuts on RabbitMQ; a wide variety of in-depth examples. - Patrick Lemiuex, Voxel Internap This book will take you to a messaging wonderland. - David Dossot, Coauthor of Mule in Action

Oracle and Open Source

Oracle & Open Source is the first book to tie together the commercial world of Oracle and the free-wheeling world of open source software. As this book reveals, these two worlds are not as far apart as they may seem. Today, there are many excellent and freely available software tools that Oracle developers and database administrators can use, at no cost, to improve their own coding productivity and their system's performance. Moreover, many of the finest Oracle developers are now making their source code freely available so their peers can build upon this code base. Oracle Corporation is even porting its RDBMS to Linux and starting to incorporate a growing number of open source tools in the company's own software. Oracle & Open Source describes close to 100 open source tools you can use for Oracle development and database administration, from large and widely known open source systems (like Linux, Perl, Apache, TCL/Tk and Python) to more Oracle-specific tools (like Orasoft, Orac, OracleTool, and OraSnap). You'll learn how to obtain the software and how to adapt it to best advantage. The book abounds with code examples, download and installation instructions, and helpful usage hints. Not only does it tell you how to find and use existing open source code; Oracle & Open Source gives you the details and the motivation to build your own open source contributions and release them to the Oracle community. You'll learn all about tools like the Oracle Call Interface (OCI) and Perl-DBI (Database Interface), which provide the glue allowing new open source tools to link into commercial Oracle software. With Oracle & Open Source as a guide, you'll discover an enormous number of highly effective open source tools, while getting involved with the thriving community of open source development.

Essential XML: Beyond Markup

The Extensible Markup Language (XML) has been anointed as the universal duct tape for all software integration problems despite XML's relatively humble origins in the world of document management systems. presents a software engineering-focused view of XML and investigates how XML can be used as a component integration technology much like COM or CORBA. Written for software developers and technical managers, this book demonstrates how XML can be used as the glue between independently developed software components (or in the marketecture terminology Essential XML du jour, how XML can act as the backplane for B2B e-commerce applications). Authors Don Box, Aaron Skonnard, and John Lam cover the key issues, technologies, and techniques involved in using XML as the adhesive between disparate software components and environments. They explain the fundamental abstractions and concepts that permeate all XML technologies, primarily those documented in the XML Information Set (Infoset). XML-based approaches to metadata, declarative, and procedural programming through transformation and programmatic interfaces are covered. Don Box, co-author of the Simple Object Access Protocol (SOAP) specification, provides readers with insight into this emerging XML messaging technology for bridging COM, CORBA, EJB, and the Web. Readers acquire a better understanding of XML's inner workings and come to see how its platform, language, and vendor independence--along with its accessibility--make it an extraordinarily effective solution for software interoperation. 0201709147B04062001