talk-data.com talk-data.com

Topic

data

3406

tagged

Activity Trend

3 peak/qtr
2020-Q1 2026-Q1

Activities

Showing filtered results

Filtering by: O'Reilly Data Engineering Books ×
Scaling Apache Solr

Become an expert in implementing high-performance, scalable search solutions with Apache Solr in 'Scaling Apache Solr'. This detailed guide teaches you how to architect and manage top-tier search functionalities tailored for different enterprise environments. What this Book will help me do Understand the Apache Solr ecosystem and its core functionality. Apply techniques for scaling and optimizing search for enterprise environments. Implement sharding, replication, and fault tolerance for robust searches. Integrate Solr with various systems and infrastructure to enhance capability. Optimize data indexing and retrieval for high-performance applications. Author(s) Vijay Karambelkar is an experienced software architect with extensive expertise in search technologies, including Solr and Lucene. He has worked on numerous enterprise applications where scalable and efficient search was critical. Vijay's writing is informed by his real-world implementations and is structured to provide practical knowledge to help readers tackle similar challenges. Who is it for? This book is ideal for software developers, architects, and IT professionals who manage or create enterprise search solutions. It's suitable for readers with basic programming knowledge but no experience with Apache Solr. This detailed guide will also benefit those looking to improve performance and scalability in their applications using cutting-edge technology. If scalability, integration, and cloud search solutions are topics you want to master, this book is tailored for you.

SQL Server 2014 Development Essentials

This book is your ultimate guide to mastering database development using Microsoft SQL Server 2014. By diving into this hands-on resource, you will explore the essentials of database design, implementation, and deployment to create robust solutions that meet modern enterprise needs. What this Book will help me do Gain a deep understanding of SQL Server 2014's new features and enhancements. Master database design principles for scalable and efficient solutions. Develop and optimize SQL queries for robust data retrieval and manipulation. Understand advanced database object topics and effective error handling. Learn performance optimization techniques for maintaining database efficiency. Author(s) None A. Masood-Al-Farooq is a seasoned database professional with extensive experience in SQL Server development and administration. They have worked on numerous critical projects in enterprise data management and have a practical, results-driven approach to database solutions. As an author, they focus on equipping readers with actionable insights and techniques through clear explanations and real-world examples. Who is it for? This book is ideal for database developers, administrators, and architects who work with Microsoft SQL Server and wish to expand their expertise in its 2014 version. Beginners to intermediate-level professionals will find it accessible and straightforward, while advanced users can discover new features and optimizations. It caters to anyone looking to design or optimize database solutions effectively. Whether you manage databases or are diving into database software development, this book will enhance your SQL Server 2014 skills.

Implementing the IBM Storwize V7000 V7.2

Continuing its commitment to developing and delivering industry-leading storage technologies, IBM® introduces the IBM Storwize® V7000 solution, an innovative new storage offering that delivers essential storage efficiency technologies and exceptional ease of use and performance, all integrated into a compact, modular design that is offered at a competitive, midrange price. The IBM Storwize V7000 solution incorporates some of the top IBM technologies typically found only in enterprise-class storage systems, raising the standard for storage efficiency in midrange disk systems. This cutting-edge storage system extends the comprehensive storage portfolio from IBM and can help change the way organizations address the ongoing information explosion. This IBM Redbooks® publication introduces the features and functions of the IBM Storwize V7000 system through several examples. This book is aimed at pre- and post-sales technical support and marketing, storage administrators, and will help you understand the architecture of the Storwize V7000, how to implement it, and take advantage of the industry leading functions and features.

Cloudera Administration Handbook

Discover how to effectively administer large Apache Hadoop clusters with the Cloudera Administration Handbook. This guide offers step-by-step instructions and practical examples, enabling you to confidently set up and manage Hadoop environments using Cloudera Manager and CDH5 tools. Through this book, administrators or aspiring experts can unlock the power of distributed computing and streamline cluster operations. What this Book will help me do Gain in-depth understanding of Apache Hadoop architecture and its operational framework. Master the setup, configuration, and management of Hadoop clusters using Cloudera tools. Implement robust security measures in your cluster including Kerberos authentication. Optimize for reliability with advanced HDFS features like High Availability and Federation. Streamline cluster management and address troubleshooting effectively using best practices. Author(s) None Menon is an experienced technologist specializing in distributed computing and data infrastructure. With a strong background in big data platforms and certifications in Hadoop administration, None has helped enterprises optimize their cluster deployments. Their instructional approach combines clarity, practical insights, and a hands-on focus. Who is it for? This book is ideal for systems administrators, data engineers, and IT professionals keen on mastering Hadoop environments. It serves both beginners getting started with cluster setup and seasoned administrators seeking advanced configurations. If you're aiming to efficiently manage Hadoop clusters using Cloudera solutions, this guide provides the knowledge and tools you need.

PostgreSQL 9 High Availability Cookbook

"PostgreSQL 9 High Availability Cookbook" is a guide for PostgreSQL DBAs and developers looking to build a robust and highly available database ecosystem. Through over 100 tested recipes, it delves into vital topics like replication, clustering, and monitoring to ensure system reliability and uptime. What this Book will help me do Set up PostgreSQL replication to enhance data availability and reliability. Implement monitoring solutions to keep your database's performance and health under check. Learn to troubleshoot common database issues to reduce downtime. Configure connection pooling to optimize resource usage and ensure better scalability. Master techniques for clustering and partitioning large datasets to handle growing system needs. Author(s) The author, Shaun Thomas, is a seasoned PostgreSQL administrator with extensive experience in database tuning, high availability solutions, and Linux system management. Shaun brings practical insights from his years of professional practice, aiming to make complex topics approachable. Who is it for? This book caters to intermediate to advanced PostgreSQL administrators and developers. If you are seeking to enhance your database's performance, reliability, and resilience, this book is for you. With its practical recipe approach, it's a great fit for those who enjoy hands-on learning. Whether you're maintaining production systems or scaling for growth, this guide is your ally.

Performance Optimization and Tuning Techniques for IBM Processors, including IBM POWER8

This IBM® Redbooks® publication focuses on gathering the correct technical information, and laying out simple guidance for optimizing code performance on IBM POWER8™ systems that run the AIX®, IBM i, or Linux operating systems. There is much straightforward performance optimization that can be performed with a minimum of effort and without extensive previous experience or in-depth knowledge. The POWER8 processor contains many new and important performance features, such as support for eight hardware threads in each core and support for transactional memory. POWER8 is a strict superset of IBM POWER7+™, and so all of the performance features of POWER7+, such as multiple page sizes, also appear in POWER8. Much of the technical information and guidance for optimizing performance on POWER8 presented in this guide also applies to POWER7+ and earlier processors, except where the guide explicitly indicates that a feature is new in POWER8. This guide strives to focus on optimizations that tend to be positive across a broad set of IBM POWER® processor chips and systems. Specific guidance is given for the POWER8 processor; however, the general guidance is applicable to the IBM POWER7+, IBM POWER7®, IBM POWER6®, IBM POWER5, and even to earlier processors. This guide is directed to personnel who are responsible for performing migration and implementation activities on IBM POWER8-based servers. This includes system administrators, system architects, network administrators, information architects, and database administrators (DBAs).

Understanding Big Data Scalability: Big Data Scalability Series, Part I

Get Started Scaling Your Database Infrastructure for High-Volume Big Data Applications “Understanding Big Data Scalability presents the fundamentals of scaling databases from a single node to large clusters. It provides a practical explanation of what ‘Big Data’ systems are, and fundamental issues to consider when optimizing for performance and scalability. Cory draws on many years of experience to explain issues involved in working with data sets that can no longer be handled with single, monolithic relational databases.... His approach is particularly relevant now that relational data models are making a comeback via SQL interfaces to popular NoSQL databases and Hadoop distributions.... This book should be especially useful to database practitioners new to scaling databases beyond traditional single node deployments.” —Brian O’Krafka, software architect presents a solid foundation for scaling Big Data infrastructure and helps you address each crucial factor associated with optimizing performance in scalable and dynamic Big Data clusters. Understanding Big Data Scalability Database expert Cory Isaacson offers practical, actionable insights for every technical professional who must scale a database tier for high-volume applications. Focusing on today’s most common Big Data applications, he introduces proven ways to manage unprecedented data growth from widely diverse sources and to deliver real-time processing at levels that were inconceivable until recently. Isaacson explains why databases slow down, reviews each major technique for scaling database applications, and identifies the key rules of database scalability that every architect should follow. You’ll find insights and techniques proven with all types of database engines and environments, including SQL, NoSQL, and Hadoop. Two start-to-finish case studies walk you through planning and implementation, offering specific lessons for formulating your own scalability strategy. Coverage includes Understanding the true causes of database performance degradation in today’s Big Data environments Scaling smoothly to petabyte-class databases and beyond Defining database clusters for maximum scalability and performance Integrating NoSQL or columnar databases that aren’t “drop-in” replacements for RDBMSes Scaling application components: solutions and options for each tier Recognizing when to scale your data tier—a decision with enormous consequences for your application environment Why data relationships may be even more important in non-relational databases Why virtually every database scalability implementation still relies on sharding, and how to choose the best approach How to set clear objectives for architecting high-performance Big Data implementations The Big Data Scalability Series is a comprehensive, four-part series, containing information on many facets of database performance and scalability. is the first book in the series. Understanding Big Data Scalability Learn more and join the conversation about Big Data scalability at bigdatascalability.com.

Computing in Geographic Information Systems

Capable of acquiring large volumes of data through sensors deployed in air, land, and sea, and making this information readily available in a continuous time frame, the science of geographical information system (GIS) is rapidly evolving. This popular information system is emerging as a platform for scientific visualization, simulation, and computation of spatio-temporal data. New computing techniques are being researched and implemented to match the increasing capability of modern-day computing platforms and easy availability of spatio-temporal data. This has led to the need for the design, analysis, development, and optimization of new algorithms for extracting spatio-temporal patterns from a large volume of spatial data. considers the computational aspects, and helps students understand the mathematical principles of GIS. It provides a deeper understanding of the algorithms and mathematical methods inherent in the process of designing and developing GIS functions. It examines the associated scientific computations along with the applications of computational geometry, differential geometry, and affine geometry in processing spatial data. It also covers the mathematical aspects of geodesy, cartography, map projection, spatial interpolation, spatial statistics, and coordinate transformation. The book discusses the principles of bathymetry and generation of electronic navigation charts. Computing in Geographic Information Systems The book consists of 12 chapters. Chapters one through four delve into the modeling and preprocessing of spatial data and prepares the spatial data as input to the GIS system. Chapters five through eight describe the various techniques of computing the spatial data using different geometric and statically techniques. Chapters nine through eleven define the technique for image registration computation and measurements of spatial objects and phenomenon. Examines cartographic modeling and map projection Covers the mathematical aspects of different map projections Explores some of the spatial analysis techniques and applications of GIS Introduces the bathymetric principles and systems generated using bathymetric charts Explains concepts of differential geometry, affine geometry, and computational geometry Discusses popular analysis and measurement methods used in GIS This text outlines the key concepts encompassing GIS and spatio-temporal information, and is intended for students, researchers, and professionals engaged in analysis, visualization, and estimation of spatio-temporal events.

Microsoft® Azure™ SQL Database Step by Step

Your hands-on guide to Azure SQL Database fundamentals Expand your expertise—and teach yourself the fundamentals of Microsoft Azure SQL Database. If you have previous programming experience but are new to Azure, this tutorial delivers the step-by-step guidance and coding exercises you need to master core topics and techniques. Discover how to: Perform Azure setup and configuration Explore design and security considerations Use programming and reporting services Migrate data Backup and sync data Work with scalability and high performance Understand the differences between SQL Server and Microsoft Azure SQL Database

IBM Distributed Virtual Switch 5000V Quickstart Guide

The IBM® Distributed Virtual Switch 5000V (DVS 5000V) is a software-based network switching solution that is designed for use with the virtualized network resources in a VMware enhanced data center. It works with VMware vSphere and ESXi 5.0 and beyond to provide an IBM Networking OS management plane and advanced Layer 2 features in the control and data planes. It provides a large-scale, secure, and dynamic integrated virtual and physical environment for efficient virtual machine (VM) networking that is aware of server virtualization events, such as VMotion and Distributed Resource Scheduler (DRS). The DVS 5000V interoperates with any 802.1Qbg compliant physical switch to enable switching of local VM traffic in the hypervisor or in the upstream physical switch. Network administrators who are familiar with IBM System Networking switches can manage the DVS 5000V just like IBM physical switches by using advanced networking, troubleshooting, and management features to make the virtual switch more visible and easier to manage. This IBM Redbooks® publication helps the network and system administrator install, tailor, and quickly configure the IBM Distributed Virtual Switch 5000V (DVS 5000V) for a new or existing virtualization computing environment. It provides several practical applications of the numerous features of the DVS 5000V, including a step-by-step guide to deploying, configuring, maintaining, and troubleshooting the device. Administrators who are already familiar with the CLI interface of IBM System Networking switches will be comfortable with the DVS 5000V. Regardless of whether the reader has previous experience with IBM System Networking, this publication is designed to help you get the DVS 5000V functional quickly, and provide a conceptual explanation of how the DVS 5000V works in tandem with VMware.

FileMaker Pro 13: The Missing Manual

You don’t need a technical background to build powerful databases with FileMaker Pro 13. This crystal-clear guide covers all new FileMaker Pro 13 features, such as its improved layout tools and enhanced mobile support. Whether you’re running a business, printing a catalog, or planning a wedding, you’ll learn how to customize your database to run on a PC, Mac, Web browser, or iOS device. The important stuff you need to know: Get started. Tour FileMaker Pro’s features and create your first database in minutes. Access data anywhere. Use FileMaker Go on your iPad or iPhone—or share data on the Web. Dive into relational data. Solve problems quickly by connecting and combining data tables. Create professional documents. Publish reports, invoices, catalogs, and other documents with ease. Harness processing power. Use calculations and scripts to crunch numbers, search text, and automate tasks. Add visual power and clarity. Create colorful charts to illustrate and summarize your data. Share your database on a secure server. Add the high-level features of FileMaker Pro Advanced and FileMaker Pro Server.

Harnessing the Power of ProtecTIER and Tivoli Storage Manager

This IBM® Redbooks® publication will help you install, tailor, and configure IBM ProtecTIER® products with IBM Tivoli® Storage Manager to harness the performance and the power of the two products working together as a data protection solution. This book goes beyond the preferred practices of each product and provides in-depth explanations of each of the items that are configurable, and the underlying reasons behind the suggestions. This book provides enough detailed information to allow an administrator to make the correct choices about which methods to use when implementing both products to meet and to exceed the business requirements. This publication provides descriptions and guidance about the following topics: Terminology and concepts of ProtecTIER and Tivoli Storage Manager Planning for ProtecTIER to run with Tivoli Storage Manager Setup and configuration of the IBM ProtecTIER device as a storage pool in the Tivoli Storage Manager environment, primarily as a Virtual Tape Library (VTL) interface, with a description as a File System Interface (FSI) Day-to-day administration of ProtecTIER when it is used in a Tivoli Storage Manager environment Overview of how to plan for disaster recovery in a ProtecTIER and Tivoli Storage Manager environment Monitoring and problem solving: How a system administrator can review ProtecTIER logs and Tivoli Storage Manager server logs to identify the source of problems Hints, tips, and use cases for ProtecTIER and Tivoli Storage Manager administrators

Modernizing IBM i Applications from the Database up to the User Interface and Everything in Between

This IBM® Redbooks® publication is focused on melding industry preferred practices with the unique needs of the IBM i community and providing a holistic view of modernization. This book covers key trends for application structure, user interface, data access, and the database. Modernization is a broad term when applied to applications. It is more than a single event. It is a sequence of actions. But even more, it is a process of rethinking how to approach the creation and maintenance of applications. There are tangible deliveries when it comes to modernization, the most notable being a modern user interface (UI), such as a web browser or being able to access applications from a mobile device. The UI, however, is only the beginning. There are many more aspects to modernization. Using modern tools and methodologies can significantly improve productivity and reduce long-term cost while positioning applications for the next decade. It is time to put the past away. Tools and methodologies have undergone significant transformation, improving functionality, usability, and productivity. This is true of the plethora of IBM tools and the wealth of tools available from many Independent Solution Providers (ISVs). This publication is the result of work that was done by IBM, industry experts, and by representatives from many of the ISV Tool Providers. Some of their tools are referenced in the book. In addition to reviewing technologies based on context, there is an explanation of why modernization is important and a description of the business benefits of investing in modernization. This critical information is key for line-of-business executives who want to understand the benefits of a modernization project. This book is appropriate for CIOs, architects, developers, and business leaders. Related information Making the Case for Modernization, IBM Systems Magazine

Large Scale and Big Data

Large Scale and Big Data: Processing and Management provides readers with a central source of reference on the data management techniques currently available for large-scale data processing. Presenting chapters written by leading researchers, academics, and practitioners, it addresses the fundamental challenges associated with Big Data processing tools and techniques across a range of computing environments. The book begins by discussing the basic concepts and tools of large-scale Big Data processing and cloud computing. It also provides an overview of different programming models and cloud-based deployment models. The book’s second section examines the usage of advanced Big Data processing techniques in different domains, including semantic web, graph processing, and stream processing. The third section discusses advanced topics of Big Data processing such as consistency management, privacy, and security. Supplying a comprehensive summary from both the research and applied perspectives, the book covers recent research discoveries and applications, making it an ideal reference for a wide range of audiences, including researchers and academics working on databases, data mining, and web scale data processing. After reading this book, you will gain a fundamental understanding of how to use Big Data-processing tools and techniques effectively across application domains. Coverage includes cloud data management architectures, big data analytics visualization, data management, analytics for vast amounts of unstructured data, clustering, classification, link analysis of big data, scalable data mining, and machine learning techniques.

SAP HCM - A Complete Tutorial

"SAP HCM - A Complete Tutorial" is your comprehensive guide to mastering SAP HCM concepts and configurations. Through practical examples and real-world solutions, this book ensures that you understand and apply the diverse functionalities within SAP HCM effectively. Learn how to handle challenges, automate processes, and deliver value to organizational HR functions. What this Book will help me do Grasp the core principles and features of SAP HCM. Configure and solve real-time issues within the module. Streamline HR processes using SAP HCM tools and techniques. Leverage best practices for customizing and optimizing SAP HCM functions. Develop proficiency in deploying SAP HCM for business needs. Author(s) Karthik S, an experienced professional in ERP systems and SAP HCM solutions, has spent years guiding organizations in deploying SAP-based solutions. Renowned for clear instruction and practical approaches, Karthik has written this book to help readers quickly learn, implement, and benefit from SAP HCM technologies. Who is it for? This book is ideal for IT professionals and ERP consultants interested in mastering SAP HCM. Whether you're new to SAP or looking to deepen existing knowledge, it's tailored for those seeking practical skills to configure and optimize HCM solutions. Readers should have a basic understanding of ERP concepts and a desire to develop SAP-specific expertise.

Healthcare Information Privacy and Security: Regulatory Compliance and Data Security in the Age of Electronic Health Records

Healthcare IT is the growth industry right now, and the need for guidance in regard to privacy and security is huge. Why? With new federal incentives and penalties tied to the HITECH Act, HIPAA, and the implementation of Electronic Health Record (EHR) systems, medical practices and healthcare systems are implementing new software at breakneck speed. Yet privacy and security considerations are often an afterthought, putting healthcare organizations at risk of fines and damage to their reputations. Healthcare Information Privacy and Security: Regulatory Compliance and Data Security in the Age of Electronic Health Records outlines the new regulatory regime, and it also provides IT professionals with the processes and protocols, standards, and governance tools they need to maintain a secure and legal environment for data and records. It’s a concrete resource that will help you understand the issues affecting the law and regulatory compliance, privacy, and security in the enterprise. As healthcare IT security expert Bernard Peter Robichau II shows, the success of a privacy and security initiative lies not just in proper planning but also in identifying who will own the implementation and maintain technologies and processes. From executive sponsors to system analysts and administrators, a properly designed security program requires that that the right people are assigned to the right tasks and have the tools they need. Robichau explains how to design and implement that program with an eye toward long-term success. Putting processes and systems in place is, of course, only the start. Robichau also shows how to manage your security program and maintain operational support including ongoing maintenance and policy updates. (Because regulations never sleep!) This book will help you devise solutions that include: Identity and access management systems Proper application design Physical and environmental safeguards Systemwide and client-based security configurations Safeguards for patient data Training and auditing procedures Governance and policy administration Healthcare Information Privacy and Security is the definitive guide to help you through the process of maintaining privacy and security in the healthcare industry. It will help you keep health information safe, and it will help keep your organization—whether local clinic or major hospital system—on the right side of the law.

GeoComputation, Second Edition, 2nd Edition

A revision of Openshaw and Abrahart’s seminal work, GeoComputation, Second Edition retains influences of its originators while also providing updated, state-of-the-art information on changes in the computational environment. In keeping with the field’s development, this new edition takes a broader view and provides comprehensive coverage across the field of GeoComputation. See What’s New in the Second Edition: Coverage of ubiquitous computing, the GeoWeb, reproducible research, open access, and agent-based modelling Expanded chapter on Genetic Programming and a separate chapter developed on Evolutionary Algorithms Ten chapters updated by the same or new authors and eight new chapters added to reflect state of the art Each chapter is a stand-alone entity that covers a particular topic. You can simply dip in and out or read it from cover to cover. The opening chapter by Stan Openshaw has been preserved, with only a limited number of minor essential modifications having been enacted. This is not just a matter of respect. Openshaw’s work is eloquent, prophetic, and his overall message remains largely unchanged. In contrast to other books on this subject, GeoComputation: Second Edition supplies a state-of-the-art review of all major areas in GeoComputation with chapters written especially for this book by invited specialists. This approach helps develop and expand a computational culture, one that can exploit the ever-increasing richness of modern geographical and geospatial datasets. It also supplies an instructional guide to be kept within easy reach for regular access and when need arises.

IBM System z Connectivity Handbook

This IBM® Redbooks® publication discusses the connectivity options available for use within and beyond the data center for the IBM System z® family of mainframes, which includes these systems: IBM zEnterprise® EC12 (zEC12) IBM zEnterprise BC12 (zBC12) IBM zEnterprise 196 (z196) IBM zEnterprise 114 (z114) IBM System z10® Enterprise Class (z10 EC) IBM System z10 Business Class (z10 BC) This book highlights the hardware and software components, functions, typical uses, coexistence, and relative merits of these connectivity features. It helps readers understand the connectivity alternatives that are available when planning and designing their data center infrastructures. The changes to this edition are based on the System z hardware announcement dated July 23, 2013. This book is intended for data center planners, IT professionals, systems engineers, technical sales staff, and network planners who are involved in the planning of connectivity solutions for IBM System z servers.

Architecting and Deploying IBM DB2 with BLU Acceleration in Your Analytical Environment

IBM® DB2® with BLU Acceleration is a revolutionary technology that is delivered in DB2 for Linux, UNIX, and Windows Release 10.5. BLU Acceleration delivers breakthrough performance improvements for analytic queries by using dynamic in-memory columnar technologies. Different from other vendor solutions, BLU Acceleration allows the unified computing of online transaction processing (OLTP) and analytics data inside a single database, therefore, removing barriers and accelerating results for users. With observed hundredfold improvement in query response time, BLU Acceleration provides a simple, fast, and easy-to-use solution for the needs of today's organizations; quick access to business answers can be used to gain a competitive edge, lower costs, and more. This IBM Redbooks® publication introduces the concepts of DB2 with BLU Acceleration. It discusses the steps to move from a relational database to using BLU Acceleration, optimizing BLU usage, and deploying BLU into existing analytic solutions today, with an example of IBM Cognos®. This book also describes integration of DB2 with BLU Acceleration into SAP Business Warehouse (SAP BW) and SAP's near-line storage solution on DB2. This publication is intended to be helpful to a wide-ranging audience, including those readers who want to understand the technologies and readers who have planning, deployment, and support responsibilities.

Pro SQL Server Internals

Pro SQL Server Internals explains how different SQL Server components work "under the hood" and how they communicate with each other. This is the practical book with a large number of examples that will show you how various design and implementation decisions affect the behavior and performance of your systems. Pro SQL Server Internals covers a multiple SQL Server versions starting with SQL Server 2005 all the way up to the recently released SQL Server 2014. You’ll learn about new SQL Server 2014 features including the new Cardinality Estimator, In-Memory OLTP Engine (codename Hekaton), and Clustered Columnstore Indexes. With Pro SQL Server Internals, you have a solid roadmap for understanding the depth and power of the SQL Server database backend, regardless of the version and edition of SQL Server you use. Pro SQL Server Internals does the following: Explains how to design efficient database schema, indexing, and transaction strategies. Shows how various database objects and technologies are implemented internally and when they should or should not be used. Demonstrates how SQL Server executes queries and works with data and transaction logs.