talk-data.com talk-data.com

Topic

data

3406

tagged

Activity Trend

3 peak/qtr
2020-Q1 2026-Q1

Activities

Showing filtered results

Filtering by: O'Reilly Data Engineering Books ×
Fast Data: Smart and at Scale

The need for fast data applications is growing rapidly, driven by the IoT, the surge in machine-to-machine (M2M) data, global mobile device proliferation, and the monetization of SaaS platforms. So how do you combine real-time, streaming analytics with real-time decisions in an architecture that’s reliable, scalable, and simple? In this O’Reilly report, Ryan Betts and John Hugg from VoltDB examine ways to develop apps for fast data, using pre-defined patterns. These patterns are general enough to suit both the do-it-yourself, hybrid batch/streaming approach, as well as the simpler, proven in-memory approach available with certain fast database offerings. Their goal is to create a collection of fast data app development recipes. We welcome your contributions, which will be tested and included in future editions of this report.

Hadoop with Python

Hadoop is mostly written in Java, but that doesn't exclude the use of other programming languages with this distributed storage and processing framework, particularly Python. With this concise book, you’ll learn how to use Python with the Hadoop Distributed File System (HDFS), MapReduce, the Apache Pig platform and Pig Latin script, and the Apache Spark cluster-computing framework. Authors Zachary Radtka and Donald Miner from the data science firm Miner & Kasch take you through the basic concepts behind Hadoop, MapReduce, Pig, and Spark. Then, through multiple examples and use cases, you'll learn how to work with these technologies by applying various Python tools. Use the Python library Snakebite to access HDFS programmatically from within Python applications Write MapReduce jobs in Python with mrjob, the Python MapReduce library Extend Pig Latin with user-defined functions (UDFs) in Python Use the Spark Python API (PySpark) to write Spark programs with Python Learn how to use the Luigi Python workflow scheduler to manage MapReduce jobs and Pig scripts Zachary Radtka, a platform engineer at Miner & Kasch, has extensive experience creating custom analytics that run on petabyte-scale data sets.

Implementing Mobile Document Capture with IBM Datacap Software

Organizations face many challenges in managing ever-increasing documents that they need to conduct their businesses. IBM® content management and imaging solutions can capture, store, manage, integrate, and deliver various forms of content throughout an enterprise. These tools can help reduce costs associated with content management and help organizations deliver improved customer service. The advanced document capture capabilities are provided through IBM Datacap software. This IBM Redbooks® publication focuses on Datacap components, system architecture, functions, and capabilities. It explains how Datacap works, how to design a document image capture solution, and how to implement the solution using Datacap Developer Tools, such as Datacap FastDoc (Admin). FastDoc is the development tool that designers use to create rules and rule sets, configure a document hierarchy and task profiles, and set up a verification panel for image verification. A loan application example explains the advanced technologies of IBM Datacap Version 9. This scenario shows how to develop a versatile capture solution that is able to handle both structured and unstructured documents. Information about high availability, scalability, performance, backup and recovery options, preferable practices, and suggestions for designing and implementing an imaging solution is also included. This book is intended for IT architects and professionals who are responsible for creating, improving, designing, and implementing document imaging solutions for their organizations.

Introducing SQL Server

Introducing SQL Server is a fast and easy introduction to SQL Server and the world of relational databases. You’ll learn how databases work and how to use the T-SQL language by practicing on one of the most widely-used and powerful database engines in the corporate world: Microsoft SQL Server. Do you quake at the sight of a SELECT statement? Start to shiver when people start talking about tables and rows? Fear not, Introducing SQL Server is here to rescue you. The book focuses on the knowledge and skills needed to begin your journey toward becoming a solid and competent SQL Server professional and database programmer. You’ll learn the core concepts of SQL Server, from installing the software to executing and profiling queries. Introducing SQL Server is aimed at SQL Server newcomers as well as at those wanting to improve their database skills. You’ll put a comprehensive database together as you work through the book. You will create tables and learn to use constraints; create reusable functions and stored procedures; and even learn how indexes work and what they bring in terms of increased performance. Introducing SQL Server shows you that databases don’t need to be difficult. Teaches you how to build a SQL Server database from scratch Takes a tutorial-based approach, with each chapter building on the last Covers what you need to know for common SQL Server development tasks

Learning Android Google Maps

Learning Android Google Maps is the ultimate guide to integrating Google Maps into your Android applications. This book takes you through the process of setting up, customizing, and leveraging this powerful feature. By the end, you'll be adept at creating engaging map functionalities applicable for any Android project. What this Book will help me do Understand how to set up the Android development environment and obtain the Google API key to start using Maps. Gain the skills to add features to Google Maps, such as markers, overlays, and custom information windows. Learn how to work with various types of maps, enabling specific applications. Master the ability to connect your map with real-time GPS data, offering user location-based services. Discover how to implement Google Street View and other interactive geographic features into your apps. Author(s) This book is meticulously compiled by developers with extensive experience in building Android applications and implementing Google Maps. Their combined years of hands-on development ensure the instructions are clear, comprehensive, and practical. Their passion for teaching shines as they break down complex topics into easy-to-understand explanations. Who is it for? This book is ideal for Android developers looking to integrate map functionalities into their apps. Beginners can follow along due to its detailed, step-by-step approach, while intermediate developers will appreciate the customization techniques and advanced features covered. If you aim to master Google Maps API in Android development, this book is for you.

Oracle API Management 12c Implementation

Unlock the full potential of Oracle's API Management Solution 12c with this comprehensive guide. Through a practical case study, you'll master how to implement API strategies, develop and manage APIs, and leverage Oracle's tools for API governance and community management. This book equips you with the knowledge needed to align API functionality with business goals. What this Book will help me do Master the principles of API management and the differences between API Management and SOA Governance. Learn how to define and implement an API management strategy aligned with business drivers. Understand the intricacies of Oracle's API Management Solution components: OAC, OAPIM, and OAG. Explore hands-on use cases for applying API management in organizational scenarios. Acquire skills in deploying Oracle API Management products with tips for real-world implementation. Author(s) None Carrasco is an experienced technical architect specializing in SOA Governance and API Management. With a career dedicated to exploring Oracle's solutions, they bring hands-on expertise in applying technical concepts to solve real-world problems. Known for their clear and structured teaching, None ensures readers gain practical knowledge to drive value through technology. Who is it for? This book is ideal for Enterprise Architects, Solution Architects, Technical Architects, and API consultants seeking to deepen their understanding of API management with the Oracle 12c suite. If you're familiar with basic IT infrastructure concepts and looking to integrate business priorities with API development processes, this guide is tailored for you.

IBM Software for SAP Solutions

SAP is a market leader in enterprise business application software. SAP solutions provide a rich set of composable application modules, and configurable functional capabilities that are expected from a comprehensive enterprise business application software suite. In most cases, companies that adopt SAP software remain heterogeneous enterprises running both SAP and non-SAP systems to support their business processes. Regardless of the specific scenario, in heterogeneous enterprises most SAP implementations must be integrated with a variety of non-SAP enterprise systems: Portals Messaging infrastructure Business process management (BPM) tools Enterprise Content Management (ECM) methods and tools Business analytics (BA) and business intelligence (BI) technologies Security Systems of record Systems of engagement The tooling included with SAP software addresses many needs for creating SAP-centric environments. However, the classic approach to implementing SAP functionality generally leaves the business with a rigid solution that is difficult and expensive to change and enhance. When SAP software is used in a large, heterogeneous enterprise environment, SAP clients face the dilemma of selecting the correct set of tools and platforms to implement SAP functionality, and to integrate the SAP solutions with non-SAP systems. This IBM® Redbooks® publication explains the value of integrating IBM software with SAP solutions. It describes how to enhance and extend pre-built capabilities in SAP software with best-in-class IBM enterprise software, enabling clients to maximize return on investment (ROI) in their SAP investment and achieve a balanced enterprise architecture approach. This book describes IBM Reference Architecture for SAP, a prescriptive blueprint for using IBM software in SAP solutions. The reference architecture is focused on defining the use of IBM software with SAP, and is not intended to address the internal aspects of SAP components. The chapters of this book provide a specific reference architecture for many of the architectural domains that are each important for a large enterprise to establish common strategy, efficiency, and balance. The majority of the most important architectural domain topics, such as integration, process optimization, master data management, mobile access, Enterprise Content Management, business intelligence, DevOps, security, systems monitoring, and so on, are covered in the book. However, there are several other architectural domains which are not included in the book. This is not to imply that these other architectural domains are not important or are less important, or that IBM does not offer a solution to address them. It is only reflective of time constraints, available resources, and the complexity of assembling a book on an extremely broad topic. Although more content could have been added, the authors feel confident that the scope of architectural material that has been included should provide organizations with a fantastic head start in defining their own enterprise reference architecture for many of the important architectural domains, and it is hoped that this book provides great value to those reading it. This IBM Redbooks publication is targeted to the following audiences: Client decision makers and solution architects leading enterprise transformation projects and wanting to gain further insight so that they can benefit from the integration of IBM software in large-scale SAP projects. IT architects and consultants integrating IBM technology with SAP solutions.

Big Data for Chimps

Finding patterns in massive event streams can be difficult, but learning how to find them doesn’t have to be. This unique hands-on guide shows you how to solve this and many other problems in large-scale data processing with simple, fun, and elegant tools that leverage Apache Hadoop. You’ll gain a practical, actionable view of big data by working with real data and real problems. Perfect for beginners, this book’s approach will also appeal to experienced practitioners who want to brush up on their skills. Part I explains how Hadoop and MapReduce work, while Part II covers many analytic patterns you can use to process any data. As you work through several exercises, you’ll also learn how to use Apache Pig to process data. Learn the necessary mechanics of working with Hadoop, including how data and computation move around the cluster Dive into map/reduce mechanics and build your first map/reduce job in Python Understand how to run chains of map/reduce jobs in the form of Pig scripts Use a real-world dataset—baseball performance statistics—throughout the book Work with examples of several analytic patterns, and learn when and where you might use them

Pro MongoDB™ Development

Pro MongoDB Development is a critical reference for anyone using MongoDB, a NoSQL database based on the BSON (binary JSON) document model. The book explores many aspects of implementing MongoDB in web applications, whether you are using Java, PHP, Ruby, and JavaScript. Noted expert Deepak Vohra walks you through accessing MongoDB databases with all these languages and working with various other technologies and databases. Vohra discusses using Java EE frameworks Kundera and Spring Data with MongoDB. You learn the nuts and bolts of migrating data from other NoSQL databases (Apache Cassandra and Couchbase) and from relational databases (Oracle Database). And, because NoSQL databases are commonly used with the Hadoop ecosystem, the book also covers using MongoDB with Apache Hive. Each chapter includes details about the software you need and hands on examples of working with MongoDB and these technologies so you know exactly what to do, whatever your MongoDB implementation requires.

DynamoDB Cookbook

This comprehensive guide introduces you to Amazon's DynamoDB, a NoSQL database designed for high scalability and performance. Using this book, you will learn how to build robust web and mobile applications on DynamoDB and integrate it seamlessly with other AWS services for a complete cloud solution. What this Book will help me do Understand the key design concepts of DynamoDB and leverage its performance and scalability in your projects. Learn best practices for operating and managing DynamoDB tables, including optimizing throughput and designing efficient indexes. Master techniques for securing data in DynamoDB, including encryption and access management approaches. Explore integration strategies with other AWS services such as S3, EMR, and Lambda, to develop complex, real-world applications. Learn cost-effective solutions and tips for managing DynamoDB usage to avoid unnecessary expenses while maximizing resources. Author(s) None Deshpande, an expert in AWS and NoSQL databases, brings years of practical experience and engineering best practices to this book. With a strong focus on clear and actionable insights, Deshpande is dedicated to enabling developers to unlock the full potential of DynamoDB and related services for scalable application development. Who is it for? This book is most suited for developers and architects familiar with AWS who aim to deepen their understanding of DynamoDB. It is ideal for individuals looking to harness NoSQL databases for robust and scalable application solutions. The topics covered range from foundational knowledge to advanced integrations, making the book approachable yet comprehensive for both learners and seasoned practitioners.

IBM SONAS Best Practices

As IBM® Scale Out Network Attached Storage (SONAS) is adopted, it is important to provide information about planning, installation, and daily administration. This IBM Redbooks® publication also describes leading tuning practices information gained by those who implement and support SONAS. These preferred practices are based on hands-on experience from the field. Monitoring of the SONAS system is included. This IBM Redbooks publication provides information about IBM SONAS features and function at the 1.5.1 level. This book is the companion to the IBM SONAS Implementation Guide, SG24-7962 IBM Redbooks publication. It is intended for readers who have implemented SONAS and are responsible for daily administration and monitoring.

Managing Ever-Increasing Amounts of Data with IBM DB2 for z/OS: Using Temporal Data Management, Archive Transparency, and the DB2 Analytics Accelerator

IBM® DB2® Version 11.1 for z/OS® (DB2 11 for z/OS or just DB2 11 throughout this book) is the fifteenth release of DB2 for IBM MVS™. The DB2 11 environment is available either for new installations of DB2 or for migrations from DB2 10 for z/OS subsystems only. This IBM Redbooks® publication describes enhancements that are available with DB2 11 for z/OS. The contents help database administrators to understand the new extensions and performance enhancements, to plan for ways to use the key new capabilities, and to justify the investment in installing or migrating to DB2 11. Businesses are faced with a global and increasingly competitive business environment, and they need to collect and analyze ever increasing amounts of data (Figure 1). Governments also need to collect and analyze large amounts of data. The main focus of this book is to introduce recent DB2 capability that can be used to address challenges facing organizations with storing and analyzing exploding amounts of business or organizational data, while managing risk and trying to meet new regulatory and compliance requirements. This book describes recent extensions to DB2 for z/OS in V10 and V11 that can help organizations address these challenges.

Reduce Risk and Improve Security on IBM Mainframes: Volume 2 Mainframe Communication and Networking Security

This IBM® Redbooks® publication documents the strength and value of the IBM security strategy with IBM z Systems hardware and software (referred to in this book by the previous product name, IBM System z®). In an age of increasing security consciousness and more dangerous and advanced persistent threats, System z provides the capabilities to address today’s business security challenges. This book explores how System z hardware is designed to provide integrity, process isolation, and cryptographic capability to help address security requirements. We highlight the features of IBM z/OS® and other operating systems that offer a variety of customizable security elements. We also describe z/OS and other operating systems and additional software that use the building blocks of System z hardware to meet business security needs. We explore these from the perspective of an enterprise security architect and how a modern mainframe must fit into an enterprise security architecture. This book is part of a three-volume series that focuses on guiding principles for optimized mainframe security configuration within a holistic enterprise security architecture. The intended audience includes enterprise security architects, planners, and managers who are interested in exploring how the security design and features of the System z platform, the z/OS operating system, and associated software address current issues, such as data encryption, authentication, authorization, network security, auditing, ease of security administration, and monitoring.

Oracle PL/SQL in 10 Minutes, Sams Teach Yourself

Covers Oracle DBMS versions 10, 11, 12, and XE Sams Teach Yourself Oracle PL/SQL in 10 Minutes offers straightforward, practical answers when you need fast results. By working through the book’s 26 lessons of 10 minutes or less, you’ll learn what you need to know to take advantage of Oracle’s PL/SQL language. This handy pocket guide starts with simple data retrieval and moves on to more complex topics, including the use of joins, subqueries, regular expression and full text-based searches, stored procedures, cursors, triggers, table constraints, and much more. 10 minutes is all you need to learn how to... Use PL/SQL in the Oracle environments and tools Construct complex PL/SQL statements using multiple clauses and operators Retrieve, sort, and format database contents Pinpoint the data you need using a variety of filtering techniques Use string, date-time, and math functions to manipulate your data Join two or more related tables Insert, update, and delete data Create and alter database tables Work with views, stored procedures, cursors, triggers, and more

SAP in 24 Hours, Sams Teach Yourself, Fifth Edition

Thoroughly updated and expanded! Includes new coverage on HANA, the cloud, and using SAP’s applications! In just 24 sessions of one hour or less, you’ll get up and running with the latest SAP technologies, applications, and solutions. Using a straightforward, step-by-step approach, each lesson strengthens your understanding of SAP from both a business and technical perspective, helping you gain practical mastery from the ground up on topics such as security, governance, validations, release management, SLA, and legal issues. Step-by-step instructions carefully walk you through the most common questions, issues, and tasks. Quizzes and exercises help you build and test your knowledge. Notes present interesting pieces of information. Tips offer advice or teach an easier way to do something. Cautions advise you about potential problems and help you steer clear of disaster. Learn how to… Understand SAP terminology, concepts, and solutions Install SAP on premises or in the cloud Master SAP’s revamped user interface Discover how and when to use in-memory HANA databases Integrate SAP Software as a Service (SaaS) solutions such as Ariba, Successfactors, Fieldglass, and hybris Find resources at SAP’s Service Marketplace, Developer Network, and Help Portal Avoid pitfalls in SAP project implementation, migration, and upgrades Discover how SAP fits with mobile devices, social media, big data, and the Internet of Things Start or accelerate your career working with SAP technologies

Expert SQL Server In-Memory OLTP

Expert SQL Server In-Memory OLTP is a deep dive into one of the most significant features of SQL Server 2014 – support for In-Memory Online Transaction Processing. The book describes the architecture and internals of the In-Memory OLTP Engine and explains how to develop, deploy, and maintain systems using it. With it you can dramatically increase transactional throughput to handle thousands of transactions per second supporting millions of customers. Dmitri Korotkevitch is the five-star author of Pro SQL Server Internals, and now brings his same combination of clear thinking and deep expertise to the question of how to recognize the opportunities and benefit from Microsoft’s In-Memory OLTP feature set in SQL Server 2014. Learn the architecture and the internals in order to recognize when in-memory OLTP can make a difference. Learn useful scenarios for thoughtfully incorporating In-Memory support into existing applications. Recognize opportunities for In-Memory OLTP in new development. Don’t be without Dmitri Korotkevitch and the deep expertise he imparts in Expert SQL Server In-Memory OLTP as you move forward in using SQL Server’s new and important In-Memory OLTP feature set. Covers In-Memory OLTP internals and architecture, including data storage, indexing, multi-version concurrency control, transaction logging, and recovery Illustrates In-Memory OLTP programmability and the process of native compilation Guides in using In-Memory OLTP in new development and existing systems.

Agile Data Warehousing for the Enterprise

Building upon his earlier book that detailed agile data warehousing programming techniques for the Scrum master, Ralph's latest work illustrates the agile interpretations of the remaining software engineering disciplines: Requirements management benefits from streamlined templates that not only define projects quickly, but ensure nothing essential is overlooked. Data engineering receives two new "hyper modeling" techniques, yielding data warehouses that can be easily adapted when requirements change without having to invest in ruinously expensive data-conversion programs. Quality assurance advances with not only a stereoscopic top-down and bottom-up planning method, but also the incorporation of the latest in automated test engines. Use this step-by-step guide to deepen your own application development skills through self-study, show your teammates the world's fastest and most reliable techniques for creating business intelligence systems, or ensure that the IT department working for you is building your next decision support system the right way. Learn how to quickly define scope and architecture before programming starts Includes techniques of process and data engineering that enable iterative and incremental delivery Demonstrates how to plan and execute quality assurance plans and includes a guide to continuous integration and automated regression testing Presents program management strategies for coordinating multiple agile data mart projects so that over time an enterprise data warehouse emerges Use the provided 120-day road map to establish a robust, agile data warehousing program

Java Hibernate Cookbook

Learn how to use Hibernate, a powerful Java object-relational mapping framework, with the "Java Hibernate Cookbook." This book is designed to help developers quickly grasp the capabilities of Hibernate, taking you from installation to building real-world applications. What this Book will help me do Install and configure Hibernate for Java development. Utilize annotations to streamline Hibernate configuration. Implement collection mappings such as lists, maps, and sets. Understand and create associations between different entities. Integrate Hibernate with other frameworks to develop enterprise solutions. Author(s) None Prajapati and None Ranapariya are experienced Java developers and authors with a passion for sharing knowledge about cutting-edge frameworks and tools. They have significant expertise with Hibernate and have distilled their experience into this comprehensive guide to help developers understand and utilize Hibernate effectively. Who is it for? This book is ideal for Java developers looking to enhance their understanding of Hibernate. It assumes readers are comfortable with Java programming and have a basic knowledge of SQL. Whether you're improving existing skills or aiming to learn Hibernate for a new project, this book provides clear explanations and structured examples to help you succeed.

Mastering ArcGIS Server Development with JavaScript

Learn how to create interactive and stylish web mapping applications with "Mastering ArcGIS Server Development with JavaScript." By focusing on the ArcGIS JavaScript API along with modern development practices, this book provides you with practical examples and insights to build functional, visually appealing GIS applications for the web. What this Book will help me do Integrate and display data from multiple sources into a single-page application using mapping tools. Customize widgets and maps to ensure user engagement and optimal functionality. Use charts and graphs to intuitively represent geographic data insights. Combine the ArcGIS JavaScript API with popular JavaScript libraries for advanced map-making features. Develop web maps suitable for multiple devices and test for platform compatibility. Author(s) None Doman is a seasoned GIS developer with extensive experience in the ArcGIS ecosystem. Specializing in integrating cutting-edge JavaScript techniques with GIS solutions, Doman focuses on empowering developers through comprehensive and insightful learning materials. Their approachable writing style ensures even complex concepts are accessible and engaging. Who is it for? This book is aimed at professional developers looking to enhance their GIS application development skills. Ideal readers have a fundamental understanding of ArcGIS Server and some experience with developing using JavaScript. It is especially beneficial for those eager to apply GIS solutions in web development scenarios. It's perfect for those ready to deepen their expertise in GIS programming and create impactful web mapping applications.

Building a Scalable Data Warehouse with Data Vault 2.0

The Data Vault was invented by Dan Linstedt at the U.S. Department of Defense, and the standard has been successfully applied to data warehousing projects at organizations of different sizes, from small to large-size corporations. Due to its simplified design, which is adapted from nature, the Data Vault 2.0 standard helps prevent typical data warehousing failures. "Building a Scalable Data Warehouse" covers everything one needs to know to create a scalable data warehouse end to end, including a presentation of the Data Vault modeling technique, which provides the foundations to create a technical data warehouse layer. The book discusses how to build the data warehouse incrementally using the agile Data Vault 2.0 methodology. In addition, readers will learn how to create the input layer (the stage layer) and the presentation layer (data mart) of the Data Vault 2.0 architecture including implementation best practices. Drawing upon years of practical experience and using numerous examples and an easy to understand framework, Dan Linstedt and Michael Olschimke discuss: How to load each layer using SQL Server Integration Services (SSIS), including automation of the Data Vault loading processes. Important data warehouse technologies and practices. Data Quality Services (DQS) and Master Data Services (MDS) in the context of the Data Vault architecture. Provides a complete introduction to data warehousing, applications, and the business context so readers can get-up and running fast Explains theoretical concepts and provides hands-on instruction on how to build and implement a data warehouse Demystifies data vault modeling with beginning, intermediate, and advanced techniques Discusses the advantages of the data vault approach over other techniques, also including the latest updates to Data Vault 2.0 and multiple improvements to Data Vault 1.0