talk-data.com talk-data.com

Topic

data-engineering

3395

tagged

Activity Trend

1 peak/qtr
2020-Q1 2026-Q1

Activities

3395 activities · Newest first

Hiding Behind the Keyboard

Hiding Behind the Keyboard: Uncovering Covert Communication Methods with Forensic Analysis exposes the latest electronic covert communication techniques used by cybercriminals, along with the needed investigative methods for identifying them. The book shows how to use the Internet for legitimate covert communication, while giving investigators the information they need for detecting cybercriminals who attempt to hide their true identity. Intended for practitioners and investigators, the book offers concrete examples on how to communicate securely, serving as an ideal reference for those who truly need protection, as well as those who investigate cybercriminals. Covers high-level strategies, what they can achieve, and how to implement them Shows discovery and mitigation methods using examples, court cases, and more Explores how social media sites and gaming technologies can be used for illicit communications activities Explores the currently in-use technologies such as TAILS and TOR that help with keeping anonymous online

Learning QGIS, Third Edition - Third Edition

Learning QGIS, Third Edition, serves as a comprehensive guide for GIS users looking to enhance their skills using the QGIS platform. By following the structured, step-by-step instructions, you'll master data visualization, manipulation, and advanced mapping techniques. The book emphasizes practical knowledge, enabling you to efficiently handle both data processing and cartographic output. What this Book will help me do Install and effectively navigate the QGIS software interface to enable GIS tasks. Load, visualize, and manage vector and raster spatial data from various sources. Create, edit, and analyze spatial datasets with precision using QGIS tools. Perform and automate complex geoprocessing tasks using the Processing toolbox. Configure advanced cartographic outputs including printable maps tailored to your needs. Author(s) Anita Graser, a notable GIS expert, brings her extensive experience and knowledge in open-source geospatial technologies to this book. She is a core developer of QGIS and regularly publishes content on GIS applications and spatial analysis. Anita excels in presenting complex concepts in a user-friendly manner, making advanced GIS techniques accessible to learners of diverse backgrounds. Who is it for? This book is tailored for GIS professionals, consultants, or developers looking to expand their expertise in QGIS. Whether you're familiar with GIS principles or are an experienced user of other platforms, this book helps bridge the gap to using QGIS effectively. If you're aiming to enhance your mapping and geospatial analysis capabilities, this guide is greatly suited for your ambitions.

IBM z13 and IBM z13s Technical Introduction

This IBM® Redbooks® publication introduces the latest IBM z Systems™ platforms, the IBM z13™ and IBM z13s. It includes information about the z Systems environment and how it can help integrate data, transactions, and insight for faster and more accurate business decisions. The z13 and z13s are state-of-the-art data and transaction systems that deliver advanced capabilities that are vital to modern IT infrastructures. These capabilities include: Accelerated data and transaction serving Integrated analytics Access to the API economy Agile development and operations Efficient, scalable, and secure cloud services End-to-end security for data and transactions This book explains how these systems use both new innovations and traditional z Systems strengths to satisfy growing demand for cloud, analytics, and mobile applications. With one of these z Systems platforms as the base, applications can run in a trusted, reliable, and secure environment that both improves operations and lessens business risk.

IBM PowerKVM: Configuration and Use

This IBM® Redpaper Redbooks® publication presents the IBM PowerKVM virtualization for scale-out Linux systems, including the new LC IBM Power Systems™. PowerKVM is open source server virtualization that is based on the IBM POWER8® processor technology. It includes the Linux open source technology of KVM virtualization, and it complements the performance, scalability, and security qualities of Linux. This book describes the concepts of PowerKVM and how you can deploy your virtual machines with the software stack included in the product. It helps you install and configure PowerKVM on your Power Systems server and provides guidance for managing the supported virtualization features by using the web interface and command-line interface (CLI). This information is for professionals who want to acquire a better understanding of PowerKVM virtualization technology to optimize Linux workload consolidation and use the POWER8 processor features. The intended audience also includes people in these roles: Clients Sales and marketing professionals Technical support professionals IBM Business Partners Independent software vendors Open source community IBM OpenPower partners It does not replace the latest marketing materials and configuration tools. It is intended as an additional source of information that, along with existing sources, can be used to increase your knowledge of IBM virtualization solutions. Before you start reading, you must be familiar with the general concepts of kernel-based virtual machine (KVM), Linux, and IBM Power architecture.

IBM Spectrum Family: IBM Spectrum Control Standard Editon

IBM® Spectrum Control (Spectrum Control), a member of the IBM Spectrum™ Family of products, is the next-generation data management solution for software-defined environments (SDEs). With support for block, file, object workloads, and software-defined storage and predictive analytics, and automated and advanced monitoring to identify proactively storage performance problems, Spectrum Control enables administrators to provide efficient management for heterogeneous storage environments. IBM Spectrum Control™ (formerly IBM Tivoli® Storage Productivity Center) delivers a complete set of functions to manage IBM Spectrum Virtualize™, IBM Spectrum Accelerate™, and IBM Spectrum Scale™ storage infrastructures, and traditional IBM and select third-party storage hardware systems. This IBM Redbooks® publication provides practical examples and use cases that can be deployed with IBM Spectrum Control Standard Edition, with an overview of IBM Spectrum Control Advanced Edition. This book complements the Spectrum Control IBM Knowledge Center, which is referenced for product details, and for installation and implementation details throughout this book. You can find this resource as the following website: IBM Spectrum Control Knowledge Center Also provided are descriptions and an architectural overview of the IBM Spectrum Family, highlighting Spectrum Control, as integrated into software-defined storage environments. This publication is intended for storage administrators, clients who are responsible for maintaining IT and business infrastructures, and anyone who wants to learn more about employing Spectrum Control and Spectrum Control Standard Edition.

Introduction to the New Mainframe: IBM z/VSE Basics

This IBM® Redbooks® publication is based on the book Introduction to the New Mainframe: z/OS Basics, SG24-6366, which was produced by the International Technical Support Organization (ITSO), Poughkeepsie Center. It provides students of information systems technology with the background knowledge and skills necessary to begin using the basic facilities of a mainframe computer. For optimal learning, students are assumed to have successfully completed an introductory course in computer system concepts, such as computer organization and architecture, operating systems, data management, or data communications. They should also have successfully completed courses in one or more programming languages, and be PC literate. This textbook can also be used as a prerequisite for courses in advanced topics, or for internships and special studies. It is not intended to be a complete text covering all aspects of mainframe operation. It is also not a reference book that discusses every feature and option of the mainframe facilities. Others who can benefit from this course include experienced data processing professionals who have worked with non-mainframe platforms, or who are familiar with some aspects of the mainframe but want to become knowledgeable with other facilities and benefits of the mainframe environment. As we go through this course, we suggest that the instructor alternate between text, lecture, discussions, and hands-on exercises. Many of the exercises are cumulative, and are designed to show the student how to design and implement the topic presented. The instructor-led discussions and hands-on exercises are an integral part of the course, and can include topics not covered in this textbook. In this course, we use simplified examples and focus mainly on basic system functions. Hands-on exercises are provided throughout the course to help students explore the mainframe style of computing. At the end of this course, you will be familiar with the following information: Basic concepts of the mainframe, including its usage and architecture Fundamentals of IBM z/VSE® (VSE), an IBM z™ Systems entry mainframe operating system (OS) An understanding of mainframe workloads and the major middleware applications in use on mainframes today The basis for subsequent course work in more advanced, specialized areas of z/VSE, such as system administration or application programming

Ceph Cookbook

Ceph Cookbook is a practical guide offering over 100 detailed recipes to help you effectively design, implement, and manage the Ceph software-defined storage system. Through step-by-step tutorials, readers will master critical tasks, from cluster setup to integration with cloud and virtualization platforms. What this Book will help me do Gain hands-on skills to set up, manage, and maintain a Ceph cluster effectively. Learn to integrate Ceph with popular cloud solutions like OpenStack for optimal performance. Understand techniques for advanced troubleshooting, monitoring, and optimization of storage systems. Develop proficiency in creating scalable storage solutions for enterprise environments. Master best practices in utilizing Ceph's various storage paradigms and technologies. Author(s) Karan Singh is a seasoned technology professional with extensive experience in storage systems and cloud design. With years of experience working with Ceph and an active participant in the open-source community, Karan brings practical insights and in-depth technical knowledge to his writing. His clear and approachable style helps demystify complex concepts for readers. Who is it for? This book is ideal for storage engineers, cloud administrators, and technical architects seeking to understand and deploy software-defined storage solutions. Whether you have foundational knowledge of Linux and storage technologies or are new to Ceph, this book will guide you. Professionals aiming to enhance their cloud infrastructure will find actionable steps and strategies here.

Elasticsearch Server - Third Edition - Third Edition

Master the art of efficient search solutions with the insights and techniques provided in 'Elasticsearch Server - Third Edition'. This comprehensive guide covers everything from the basics of indexing and querying to advanced topics like aggregation and scaling, ensuring you can build robust search infrastructures tailored to your project's needs. What this Book will help me do Gain practical expertise in configuring Elasticsearch indices and retrieving data efficiently. Learn to craft complex queries using the Elasticsearch query domain-specific language (DSL). Understand and implement advanced search features for enhanced functionality. Master the aggregation framework to derive valuable insights from your data. Equip yourself with the skills to monitor and optimize your Elasticsearch cluster for performance and scalability. Author(s) Marek Rogozinski and Rafal Kuc are seasoned experts in search technologies and have extensive experience working with Elasticsearch and related domains. With years of technical experience and a passion for teaching through clear, hands-on examples, they aim to make mastering Elasticsearch accessible and practical for tech professionals and enthusiasts alike. Who is it for? This book is aimed at software developers and IT professionals who are eager to build or strengthen their expertise in Elasticsearch. Whether you're new to search infrastructure or looking to refine your skills, this book is tailored for beginner to intermediate levels. If your goal is to deploy scalable search solutions or understand how to analyze large datasets effectively, this book is for you.

IBM DS8880 Architecture and Implementation (Release 8)

This IBM® Redbooks® publication describes the concepts, architecture, and implementation of the IBM DS8880 family. The book provides reference information to assist readers who need to plan for, install, and configure the DS8880 systems. The IBM DS8000® family is a high-performance, high-capacity, highly secure, and resilient series of disk storage systems. The DS8880 family is the latest and most advanced of the DS8000 offerings to date. The high availability, multiplatform support, including the IBM z Systems™, and simplified management tools help provide a cost-effective path to an on-demand world. The new IBM DS8880 family includes two high-performance models (DS8886 Model 981 with its associated DS8886 Expansion Unit Model 98E, and the DS8884 Model 980 with its associated Expansion Unit Model 98B). Two powerful IBM POWER8® processor-based servers manage the cache to streamline disk I/Os, maximizing performance and throughput. These capabilities are further enhanced with the availability of high-performance flash enclosures (HPFEs). A major change with the introduction of the DS8880 is the reduction of the footprint to a 19-inch rack. Like its predecessors, the DS8880 supports advanced disaster recovery (DR) solutions, business continuity solutions, and thin provisioning. All disk drives in the DS8880 storage system include the Full Disk Encryption (FDE) feature. The DS8880 can automatically optimize the use of each storage tier, particularly flash drives and flash cards, through the IBM Easy Tier® feature. The DS8880 also can be integrated in a Lightweight Directory Access Protocol (LDAP) infrastructure.

Real-Time Big Data Analytics

This book delves into the techniques and tools essential for designing, processing, and analyzing complex datasets in real-time using advanced frameworks like Apache Spark, Storm, and Amazon Kinesis. By engaging with this thorough guide, you'll build proficiency in creating robust, efficient, and scalable real-time data processing architectures tailored to real-world scenarios. What this Book will help me do Learn the fundamentals of real-time data processing and how it differs from batch processing. Gain hands-on experience with Apache Storm for creating robust data-driven solutions. Develop real-world applications using Amazon Kinesis for cloud-based analytics. Perform complex data queries and transformations with Spark SQL and understand Spark RDDs. Master the Lambda Architecture to combine batch and real-time analytics effectively. Author(s) Shilpi Saxena is a renowned expert in big data technologies, holding extensive experience in real-time data analytics. With a career spanning years in the industry, Shilpi has provided innovative solutions for big data challenges in top-tier organizations. Her teaching approach emphasizes practical applicability, making her writings accessible and impactful for developers and architects alike. Who is it for? This book is for software professionals such as Big Data architects, developers, or programmers looking to enhance their skills in real-time big data analytics. If you are familiar with basic programming principles and seek to build solutions for processing large data streams in real-time environments, this book caters to your needs. It is also suitable for those seeking to familiarize themselves with using state-of-the-art tools like Spark SQL, Apache Storm, and Amazon Kinesis. Whether you're extending current expertise or transitioning into this field, this resource helps you achieve your objectives.

Handbook of Big Data

This handbook provides a state-of-the-art overview of the analysis of large-scale datasets. Featuring contributions from statistics and computer science experts in industry and academia, the text instills a working understanding of key statistical and computing ideas that can be readily applied in research and practice. Offering balanced coverage of methodology, theory, and applications, the text describes modern, scalable approaches for analyzing large datasets. It details advances in statistics and machine learning, as well as defines the underlying concepts of the available analytical tools and techniques.

MySQL for the Internet of Things

This book introduces the problems facing Internet of Things developers and explores current technologies and techniques to help you manage, mine, and make sense of the data being collected through the use of the world’s most popular database on the Internet - MySQL. The IoT is poised to change how we interact with and perceive the world around us, and the possibilities are nearly boundless. As more and more connected devices generate data, we will need to solve the problem of how to collect, store, and make sense of IoT data by leveraging the power of database systems. The book begins with an introduction of the MySQL database system and storage of sensor data. Detailed instructions and examples are provided to show how to add database nodes to IoT solutions including how to leverage MySQL high availability, including examples of how to protect data from node outages using advanced features of MySQL. The book closes with a comparison of raw and transformed data showing how transformed data can improve understandability and help you cut through a clutter of superfluous data toward the goal of mining nuggets of useful knowledge.

Advanced Oracle PL/SQL Developer's Guide (Second Edition) - Second Edition

In "Advanced Oracle PL/SQL Developer's Guide (Second Edition)", you'll delve into the advanced capabilities of Oracle PL/SQL, honing skills needed for professional-level certification while mastering the innovations introduced in Oracle Database 12c. This book serves as a comprehensive resource for enhancing your database development expertise. What this Book will help me do Master advanced Oracle PL/SQL development skills aligned with Oracle Database 12c innovations. Understand and implement Virtual Private Database (VPD) for advanced database security. Gain expertise in tuning, profiling, and debugging PL/SQL code for robust application performance. Integrate and utilize Oracle Database 12c features such as Multitenant feature and Database In-Memory. Prepare for the 1Z0-146 Oracle certification to become recognized as an Advanced PL/SQL Developer. Author(s) Saurabh K. Gupta is an experienced Oracle developer and author known for his clarity and depth in explaining advanced technical concepts. With a strong background in Oracle Database and PL/SQL development, he imparts knowledge that bridges the gap between learning and practical application. Gupta's writing emphasizes clarity and hands-on understanding, making complex topics accessible to developers. Who is it for? This book is tailored for advanced Oracle developers looking to deepen their understanding of PL/SQL and integrate Oracle Database 12c's features into their workflow. It is particularly beneficial for professionals preparing for the 1Z0-146 Oracle exam. Readers should have foundational knowledge in PL/SQL and a determination to elevate their technical proficiency.

Fast Data Front Ends for Hadoop

Organizations striving to build applications for streaming data have a new possibility to ponder: the use of ingestion engines at the front end of their Hadoop systems. With this O’Reilly report, you’ll learn how these fast data front ends process data before it reaches the Hadoop Data File System (HDFS), and provide intelligence and context in real time. This helps you reduce response times from hours to minutes, or even minutes to seconds. Author and independent consultant Akmal Chaudhri looks at several popular ingestion engines, including Apache Spark, Apache Storm, and the VoltDB in-memory database. Among them, VoltDB stands out by providing full Atomicity, Consistency, Isolation, and Durability (ACID) support. VoltDB also lets you build a fast data front-end that uses the familiar SQL language and standards. Learn the advantages of ingestion engines as well as the theoretical and practical problems that can come up in an implementation. You’ll discover how this option can handle streaming data, provide state, ensure durability, and support transactions and real-time decisions. Akmal B. Chaudhri is an Independent Consultant, specializing in big data, NoSQL, and NewSQL database technologies. He has previously held roles as a developer, consultant, product strategist, and technical trainer with several blue-chip companies and big data startups. Akmal regularly presents at international conferences and serves on program committees for several major conferences and workshops.

IBM Spectrum Accelerate Deployment, Usage, and Maintenance

This edition applies to IBM® Spectrum Accelerate V11.5.1 and V11.5.3. IBM Spectrum™ Accelerate, a member of IBM Spectrum Storage™, is an agile, software-defined storage solution for enterprise and cloud that builds on the customer-proven and mature IBM XIV® storage software. The key characteristic of Spectrum Accelerate is that it can be easily deployed and run on purpose-built or existing hardware that is chosen by the customer. IBM Spectrum Accelerate™ enables rapid deployment of high-performance and scalable block data storage infrastructure over commodity hardware on-premises or off-premises. This IBM Redbooks® publication provides a broad understanding of IBM Spectrum Accelerate. The book introduces Spectrum Accelerate and describes planning and preparation that are essential for a successful deployment of the solution. The deployment is described through a step-by-step approach, by using a graphical user interface (GUI) based method or a simple command-line interface (CLI) based procedure. Chapters in this book describe the logical configuration of the system, host support and business continuity functions, and migration. Although it makes many references to the XIV storage software, the book also emphasizes where IBM Spectrum Accelerate differs from XIV. Finally, a substantial portion of the book is dedicated to maintenance and troubleshooting to provide detailed guidance for the customer support personnel.

VersaStack Solution by Cisco and IBM with IBM DB2, IBM Spectrum Control, and IBM Spectrum Protect

Dynamic organizations want to accelerate growth while reducing costs. To do so, they must speed the deployment of business applications and adapt quickly to any changes in priorities. Organizations require an IT infrastructure to be easy, efficient, and versatile. The VersaStack solution by Cisco and IBM® can help you accelerate the deployment of your datacenters. It reduces costs by more efficiently managing information and resources while maintaining your ability to adapt to business change. The VersaStack solution combines the innovation of Cisco Unified Computing System (Cisco UCS) Integrated Infrastructure with the efficiency of the IBM Storwize® storage system. The Cisco UCS Integrated Infrastructure includes the Cisco UCS, Cisco Nexus and Cisco MDS switches, and Cisco UCS Director. The IBM Storwize V7000 storage system enhances virtual environments with its Data Virtualization, IBM Real-time Compression™, and IBM Easy Tier® features. These features deliver extraordinary levels of performance and efficiency. The VersaStack solution is Cisco Application Centric Infrastructure (ACI) ready. Your IT team can build, deploy, secure, and maintain applications through a more agile framework. Cisco Intercloud Fabric capabilities help enable the creation of open and highly secure solutions for the hybrid cloud. These solutions accelerate your IT transformation while delivering dramatic improvements in operational efficiency and simplicity. Cisco and IBM are global leaders in the IT industry. The VersaStack solution gives you the opportunity to take advantage of integrated infrastructure solutions that are targeted at enterprise applications, analytics, and cloud solutions. The VersaStack solution is backed by Cisco Validated Designs (CVDs) to provide faster delivery of applications, greater IT efficiency, and less risk. This IBM Redbooks® publication is aimed at experienced storage administrators that are tasked with deploying a VersaStack solution with IBM DB2® High Availability (DB2 HA), IBM Spectrum™ Protect, and IBM Spectrum Control™.

Elasticsearch Essentials

"Elasticsearch Essentials" provides a comprehensive introduction to Elasticsearch, the powerful search and analytics engine. This book delivers a fast-paced, practical guide to harnessing Elasticsearch for creating scalable search and analytics applications. What this Book will help me do Learn to effectively use Elasticsearch REST APIs for search and analytics. Understand and design schema and mappings with best practices. Master data modeling concepts for efficient data queries. Develop skills to create and manage Elasticsearch clusters in production. Learn techniques for ensuring high availability and handling large datasets. Author(s) Bharvi Dixit is a seasoned developer and expert in search technologies with hands-on experience in Elasticsearch and other search solutions. With extensive knowledge in data analytics and large-scale systems, Bharvi ensures readers gain practical skills and insights through well-structured examples and explanations. Who is it for? This book is perfect for developers looking to enhance their skills in building search and analytics solutions with Elasticsearch. It's particularly suited for those familiar with search technologies like Apache Lucene or Solr but new to Elasticsearch. Beginners to intermediate learners in big data and analytics will find the structured approach beneficial. It's ideal for professionals aspiring to develop advanced search implementations with modern tools.

Oracle SQL Developer

Delve into the world of database management with 'Oracle SQL Developer,' an essential guide for mastering the feature-rich SQL Developer 4.1 interface. This book provides a step-by-step approach to using SQL Developer's capabilities for database design, development, and administration, ensuring you can leverage powerful features like data modeling, reports, and REST services to streamline and enhance your workflow. What this Book will help me do Understand the advanced features of SQL Developer 4.1 and how to install and navigate them effectively. Master essential database management tasks, including creating, editing, and deleting database objects. Learn to utilize the SQL worksheet for running SQL scripts, debugging PL/SQL code, and manipulating data. Develop skills in database performance tuning, exporting/importing data, and creating custom reports. Gain proficiency in data modeling and harnessing SQL Developer's extensibility for advanced tasks. Author(s) Ajith Narayanan and Susan Harper bring a wealth of experience to this book. Ajith Narayanan, an Oracle APPS DBA with over 10 years of experience, combines technical expertise with a passion for teaching nuanced database management practices. Co-author Susan Harper adds to this knowledge base, providing a comprehensive and insightful approach to leveraging SQL Developer. Together, they focus on practicality and clarity, enabling readers to understand and apply complex concepts. Who is it for? This book is tailored for Oracle developers, database administrators, and data architects seeking to enhance their efficiency and capabilities using SQL Developer. It suits professionals with a working knowledge of SQL and PL/SQL who aim to optimize their workflows. Beginners with foundational knowledge of Oracle database concepts will also find this an accessible and rewarding resource for learning advanced database management.

Mastering OpenLayers 3

Delve into the world of advanced web mapping with 'Mastering OpenLayers 3.' This comprehensive guide equips you with the knowledge to create responsive, robust web mapping applications using the OpenLayers 3 library, showcasing step-by-step examples and practical insights. What this Book will help me do Learn to effectively utilize OpenLayers 3's advanced features for web mapping. Integrate and customize the library in your own mapping applications proficiently. Develop thematic maps and apply stunning visual effects using advanced techniques. Create mobile-friendly, interactive web mapping solutions. Extend the capabilities of OpenLayers 3 with your own custom classes and scripts. Author(s) None Farkas is a skilled technical author with expertise in web mapping technologies and libraries like OpenLayers. Known for his hands-on approach, None brings clarity to complex topics, making them accessible to developers of various skill levels. Who is it for? This book is perfect for developers with basic to intermediate knowledge of JavaScript and GIS. If you are a front-end developer looking to build dynamic mapping applications or someone aiming to deepen your understanding of OpenLayers 3, this book is for you.

IBM Tape Library Guide for Open Systems

This IBM® Redbooks® publication presents a general introduction to Linear Tape-Open (LTO) technology and the implementation of corresponding IBM products. The book highlights the new generation IBM LTO-7 tape drives, which are the next-generation storage solution that is designed to help midsize and large enterprises respond to storage challenges. This twelfth edition includes information about the latest enhancements to the IBM Ultrium family of tape drives and tape libraries. In particular, it includes details of the latest IBM LTO Ultrium 7 tape drive technology and its implementation in IBM tape libraries. It contains technical information about each IBM tape product for open systems and includes generalized sections about Small Computer System Interface (SCSI) and Fibre Channel connections and multipath architecture configurations. This book also covers tools and techniques for library management. It is intended for anyone who wants to understand more about IBM tape products and their implementation. It is suitable for IBM clients, IBM Business Partners, IBM specialist sales representatives, and technical specialists. If you do not have a background in computer tape storage products, you might need to read other sources of information. In the interest of being concise, topics that are generally understood are not covered in detail.