search

Structured Search for Big Data

2015-08-26 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Mikhail Gilula

Big Data Data Modelling DWH NoSQL Teradata data data-engineering

The WWW era made billions of people dramatically dependent on the progress of data technologies, out of which Internet search and Big Data are arguably the most notable. Structured Search paradigm connects them via a fundamental concept of key-objects evolving out of keywords as the units of search. The key-object data model and KeySQL revamp the data independence principle making it applicable for Big Data and complement NoSQL with full-blown structured querying functionality. The ultimate goal is extracting Big Information from the Big Data. As a Big Data Consultant, Mikhail Gilula combines academic background with 20 years of industry experience in the database and data warehousing technologies working as a Sr. Data Architect for Teradata, Alcatel-Lucent, and PayPal, among others. He has authored three books, including The Set Model for Database and Information Systems and holds four US Patents in Structured Search and Data Integration. Conceptualizes structured search as a technology for querying multiple data sources in an independent and scalable manner. Explains how NoSQL and KeySQL complement each other and serve different needs with respect to big data Shows the place of structured search in the internet evolution and describes its implementations including the real-time structured internet search

ElasticSearch Blueprints

2015-07-24 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Vineeth Mohan

Analytics ELK data data-engineering elasticsearch

Dive into search technology with "ElasticSearch Blueprints"! This is the perfect project-based guide to help you master Elasticsearch. You will learn how to build and design scalable, effective search solutions, improve search relevancy, manage data efficiently, perform analytics, and visualize your data in comprehensive ways. What this Book will help me do Build and fine-tune scalable search engine features with Elasticsearch. Design and implement accurate ecommerce search solutions using filters. Analyze and visualize data with Elasticsearch's powerful data aggregation capabilities. Increase search relevancy and enhance user query assistance using analyzers. Incorporate enhanced data organization methods, including parent-child relationships. Author(s) None Mohan is an experienced professional specializing in search technologies. With a strong technical background, they have engaged deeply with Elasticsearch, creating solutions that address practical challenges. Their approach focuses on making technical topics accessible, guiding readers step-by-step through projects. Who is it for? This book is tailored for data professionals, application developers, and enthusiasts eager to delve into search technologies. Whether you're beginning with Elasticsearch or aiming to refine your skills, this guide will advance your expertise. By working through practical cases, you'll gain confidence in using Elasticsearch effectively to meet diverse requirements.

Lucene 4 Cookbook

2015-06-26 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Vineeth Mohan , Edwood Ng

Java data data-engineering lucene

Lucene 4 Cookbook provides a comprehensive guide for developers looking to integrate Apache Lucene into their search applications. With over 70 hands-on recipes, this book takes you from the basics to advanced topics, enabling you to create effective and high-performing search solutions. What this Book will help me do Learn how to configure Lucene for optimal indexing and searching. Gain skills in analyzing text and implementing custom analyzers. Discover techniques for implementing near real-time search capabilities. Understand how to scale your search application to handle large datasets. Explore modular extensions to enhance and customize your search application. Author(s) Edwood Ng and Vineeth Mohan are seasoned developers with extensive experience in search technologies and open-source projects. Their practical experience with Apache Lucene is reflected in this cookbook, which focuses on real-world applications and challenges. They have a clear, engaging writing style that makes complex concepts accessible to developers. Who is it for? This book is for professional and aspiring software developers who are new to Apache Lucene but eager to explore its full capabilities. It assumes some working knowledge of Java programming, so readers should at least be familiar with basic programming concepts. It's ideal for those looking to add search functionalities to their applications effectively and efficiently.

Search and Foraging

2015-06-23 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Irad Ben-Gal , Eugene Kagan

data data-engineering

This book examines how to program artificial search agents so that they act optimally or demonstrate the same behavior as predicted by the foraging theory for living organisms. It discusses foraging theory as well as search and screening theory in the same mathematical and algorithmic framework. It presents an overview of the main ideas and methods of foraging and search theories, making the concepts of one theory accessible to specialists of the other. Numerical examples illustrate the application of both theories.

Apache Solr Search Patterns

2015-04-24 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Jayant Kumar

Analytics ELK data data-engineering solr

Master Elasticsearch as you uncover advanced Solr techniques in this professional guide. This book dives deeply into deploying and optimizing Solr-powered search engines and explores high-performance techniques. Learn to leverage your data with accessible, comprehensive, and practical insights. What this Book will help me do Learn to customize Solr's query scorer to provide tailored search results. Understand the internals of Solr, including indexing and query facilities, for better optimization. Implement scalable and reliable search clusters using SolrCloud. Explore the use of Solr for spatial, e-commerce, and advertising searches. Combine Solr with front-end technologies like AJAX and advanced tagging with FSTs. Author(s) Jayant Kumar, an experienced developer and search solutions architect, specializes in leveraging Apache Solr. With years of practical experience, he brings unique insights into scaling search platforms. His commitment to imparting clear, actionable knowledge is reflected in this focused resource. Who is it for? This book is ideal for software developers and architects embedded in the Solr ecosystem looking to enhance their expertise. If you are seeking to develop advanced and scalable solutions, master Solr's core capabilities, or improve your analytics and graph-generating skills, this book will support your goals.

Mastering Elasticsearch - Second Edition

2015-02-27 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Marek Rogozinski

ELK data data-engineering elasticsearch

Delve deeper into Elasticsearch in "Mastering Elasticsearch - Second Edition" to gain comprehensive insights into advanced querying, data indexing, and internal workings of Elasticsearch servers. With this book, you'll enhance your ability to implement powerful search solutions and optimize performance with confidence. What this Book will help me do Build advanced querying skills to utilize the Elasticsearch Query DSL effectively. Gain hands-on understanding of optimal data indexing for your Elasticsearch applications. Learn to improve user search experiences by tailoring Elasticsearch functionalities. Master Elasticsearch performance tuning and server optimization techniques. Develop custom Elasticsearch plugins to expand its core capabilities. Author(s) Marek Rogozinski, a seasoned Elasticsearch developer, brings years of professional expertise to this comprehensive guide. With a focus on practical and actionable knowledge, Marek has crafted this edition for users eager to deepen their Elasticsearch proficiency. His hands-on approach ensures you can apply the lessons directly and effectively. Who is it for? Ideal readers are those experienced with Elasticsearch, familiar with Query DSL and indexing techniques, and looking to expand their technical capabilities. Whether you're an Elasticsearch administrator, developer, or enthusiast, this book will enable you to master advanced topics and achieve your goals in search technology.

ElasticSearch Cookbook - Second Edition

2015-01-28 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Alberto Paro

Analytics Big Data Cloud Computing ELK Java JSON Python data data-engineering elasticsearch

The "ElasticSearch Cookbook - Second Edition" is a hands-on guide featuring over 130 advanced recipes to help you harness the power of ElasticSearch, a leading search and analytics engine. Through insightful examples and practical guidance, you'll learn to implement efficient search solutions, optimize queries, and manage ElasticSearch clusters effectively. What this Book will help me do Design and configure ElasticSearch topologies optimized for your specific deployment needs. Develop and utilize custom mappings to optimize your data indexes. Execute advanced queries and filters to refine and retrieve search results effectively. Set up and monitor ElasticSearch clusters for optimal performance. Extend ElasticSearch capabilities through plugin development and integrations using Java and Python. Author(s) Alberto Paro is a technology expert with years of experience working with ElasticSearch, Big Data solutions, and scalable cloud architecture. He has authored multiple books and technical articles on ElasticSearch, leveraging his extensive knowledge to provide practical insights. His approachable and detail-oriented style makes complex concepts accessible to technical professionals. Who is it for? This book is best suited for software developers and IT professionals looking to use ElasticSearch in their projects. Readers should be familiar with JSON, as well as basic programming skills in Java. It is ideal for those who have an understanding of search applications and want to deepen their expertise. Whether you're integrating ElasticSearch into a web application or optimizing your system's search capabilities, this book will provide the skills and knowledge you need.

Elasticsearch: The Definitive Guide

2015-01-28 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Zachary Tong , Clinton Gormley

Analytics ELK data data-engineering elasticsearch

Whether you need full-text search or real-time analytics of structured data—or both—the Elasticsearch distributed search engine is an ideal way to put your data to work. This practical guide not only shows you how to search, analyze, and explore data with Elasticsearch, but also helps you deal with the complexities of human language, geolocation, and relationships.

Solr Cookbook - Third Edition - Third Edition

2015-01-23 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Rafal Kuc

data data-engineering solr

Master Apache Solr with the comprehensive 'Solr Cookbook - Third Edition', which introduces over 100 practical recipes to help you exploit the full potential of Apache Solr versions 4.x to 5. By following this book, you'll gain actionable insights and solutions to solve real-world problems effectively with Solr. What this Book will help me do Effectively index data from various sources and formats into Solr for optimized searches. Utilize and configure faceting to enhance aggregated data insights. Implement and configure SolrCloud for scalable and robust search infrastructures. Identify and resolve performance bottlenecks in Solr and Solr clusters. Develop and deploy advanced query features like autocomplete and document highlighting. Author(s) Rafal Kuc is a seasoned software architect with years of experience working with Apache Solr in production environments. He specializes in search technologies, distributed systems, and empowering developers with actionable knowledge. Rafal approaches writing with a practical mindset, focusing on how to solve real-world challenges efficiently. Who is it for? This book is ideal for intermediate Solr developers, system architects, or IT professionals responsible for search systems. It assumes a basic familiarity with Solr but provides deep dives into advanced functionalities and configurations. Readers looking to enhance their understanding of Solr 4.x and 5.x capabilities will find this book valuable. Whether you're improving search performance or exploring new Solr features, this book guides you step-by-step.

Fundamentals of Database Indexing and Searching

2014-12-02 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Arnab Bhattacharya

Computer Science data data-engineering

Fundamentals of Database Indexing and Searching presents well-known database searching and indexing techniques. It focuses on similarity search queries, showing how to use distance functions to measure the notion of dissimilarity. After defining database queries and similarity search queries, the book organizes the most common and representative index structures according to their characteristics. The author first describes low-dimensional index structures, memory-based index structures, and hierarchical disk-based index structures. He then outlines useful distance measures and index structures that use the distance information to efficiently solve similarity search queries. Focusing on the difficult dimensionality phenomenon, he also presents several indexing methods that specifically deal with high-dimensional spaces. In addition, the book covers data reduction techniques, including embedding, various data transforms, and histograms. Through numerous real-world examples, this book explores how to effectively index and search for information in large collections of data. Requiring only a basic computer science background, it is accessible to practitioners and advanced undergraduate students.

Search: How the Data Explosion Makes Us Smarter

2014-11-04 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Stefan Weitz

data data-engineering

Search is as old as language. We've always needed to find something in the jumble of human creation. The first web was nothing more than passing verbal histories down the generations so others could find and remember how not to get eaten; the first search used the power of written language to build simple indexes in printed books, leading to the Dewey Decimal system and reverse indices in more modern times. Then digital happened. Besides having profound societal impacts, it also made the act of searching almost impossibly complex for both engines and searchers. Information isn’t just words; it is pictures, videos, thoughts tagged with geocode data, routes, physical world data, and, increasingly, the machines themselves reporting their condition and listening to others’. Search: How the Data Explosion Makes Us Smarter, the first in the Insight Labs Library, holds up a mirror to our time to see if search can keep up. Author Stefan Weitz explores the idea of access to help readers understand how we are inventing new ways to search and access data through devices in more places and with more capabilities. We are at the cusp of imbuing our generation with superpowers, but only if we fundamentally rethink what search is, how people can use it, and what we should demand of it.

Understanding Large Temporal Networks and Spatial Networks: Exploration, Pattern Searching, Visualization and Network Evolution

2014-11-03 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Natasa Kejzar , Vladimir Batagelj , Anuska Ferligoj , Patrick Doreian

data data-engineering

This book explores social mechanisms that drive network change and link them to computationally sound models of changing structure to detect patterns. This text identifies the social processes generating these networks and how networks have evolved.

Data Classification

2014-07-25 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Charu C. Aggarwal

AI/ML data data-engineering

Research on the problem of classification tends to be fragmented across such areas as pattern recognition, database, data mining, and machine learning. Addressing the work of these different communities in a unified way, this book explores the underlying algorithms of classification as well as applications of classification in a variety of problem domains, including text, multimedia, social network, and biological data. It presents core methods in data classification, covers recent problem domains, and discusses advanced methods for enhancing the quality of the underlying classification results.

Scaling Apache Solr

2014-07-25 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Hrishikesh Vijay Karambelkar

Cloud Computing data data-engineering solr

Become an expert in implementing high-performance, scalable search solutions with Apache Solr in 'Scaling Apache Solr'. This detailed guide teaches you how to architect and manage top-tier search functionalities tailored for different enterprise environments. What this Book will help me do Understand the Apache Solr ecosystem and its core functionality. Apply techniques for scaling and optimizing search for enterprise environments. Implement sharding, replication, and fault tolerance for robust searches. Integrate Solr with various systems and infrastructure to enhance capability. Optimize data indexing and retrieval for high-performance applications. Author(s) Vijay Karambelkar is an experienced software architect with extensive expertise in search technologies, including Solr and Lucene. He has worked on numerous enterprise applications where scalable and efficient search was critical. Vijay's writing is informed by his real-world implementations and is structured to provide practical knowledge to help readers tackle similar challenges. Who is it for? This book is ideal for software developers, architects, and IT professionals who manage or create enterprise search solutions. It's suitable for readers with basic programming knowledge but no experience with Apache Solr. This detailed guide will also benefit those looking to improve performance and scalability in their applications using cutting-edge technology. If scalability, integration, and cloud search solutions are topics you want to master, this book is tailored for you.

Solr in Action

2014-03-25 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Trey Grainger (Searchkernel) , Timothy Potter

Analytics Big Data Data Analytics Java NoSQL data data-engineering solr

Solr in Action is a comprehensive guide to implementing scalable search using Apache Solr. This clearly written book walks you through well-documented examples ranging from basic keyword searching to scaling a system for billions of documents and queries. It will give you a deep understanding of how to implement core Solr capabilities. About the Technology About the Book Whether you're handling big (or small) data, managing documents, or building a website, it is important to be able to quickly search through your content and discover meaning in it. Apache Solr is your tool: a ready-to-deploy, Lucene-based, open source, full-text search engine. Solr can scale across many servers to enable real-time queries and data analytics across billions of documents. Solr in Action teaches you to implement scalable search using Apache Solr. This easy-to-read guide balances conceptual discussions with practical examples to show you how to implement all of Solr's core capabilities. You'll master topics like text analysis, faceted search, hit highlighting, result grouping, query suggestions, multilingual search, advanced geospatial and data operations, and relevancy tuning. What's Inside How to scale Solr for big data Rich real-world examples Solr as a NoSQL data store Advanced multilingual, data, and relevancy tricks Coverage of versions through Solr 4.7 About the Reader This book assumes basic knowledge of Java and standard database technology. No prior knowledge of Solr or Lucene is required. About the Authors Trey Grainger is a director of engineering at CareerBuilder. Timothy Potter is a senior member of the engineering team at LucidWorks. The authors work on the scalability and reliability of Solr, as well as on recommendation engine and big data analytics technologies. Quotes The knowledge and techniques you need. - From the Foreword by Yonik Seeley, Creator of Solr Readable and immediately applicable ... an excellent book. - John Viviano, InterCorp, Inc. The go-to guide for Solr ... a definitive resource for both beginners and experts. - Scott Anthony, Business Instruments A well-dosed combination of deep technical knowledge and real-world experience. - Alexandre Madurell, Piksel, Inc.

Relevance Ranking for Vertical Search Engines

2014-01-25 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Bo Long , Yi Chang

data data-engineering

In plain, uncomplicated language, and using detailed examples to explain the key concepts, models, and algorithms in vertical search ranking, Relevance Ranking for Vertical Search Engines teaches readers how to manipulate ranking algorithms to achieve better results in real-world applications. This reference book for professionals covers concepts and theories from the fundamental to the advanced, such as relevance, query intention, location-based relevance ranking, and cross-property ranking. It covers the most recent developments in vertical search ranking applications, such as freshness-based relevance theory for new search applications, location-based relevance theory for local search applications, and cross-property ranking theory for applications involving multiple verticals. Foreword by Ron Brachman, Chief Scientist and Head, Yahoo! Labs Introduces ranking algorithms and teaches readers how to manipulate ranking algorithms for the best results Covers concepts and theories from the fundamental to the advanced Discusses the state of the art: development of theories and practices in vertical search ranking applications Includes detailed examples, case studies and real-world situations

Find Out Anything From Anyone, Anytime

2014-01-20 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Karinch Maryann , Pyle James

data data-engineering

With his style of questioning alone, Jim Pyle can get more information than most other interrogators using multiple techniques. Gregory Hartley, coauthor of the best-seller How to Spot a Liar The secret to finding out anything you want to know is amazingly simple: Ask good questions.

Hidden Visual Studio LightSwitch: Secrets from the Real World for Creating Great Apps

2013-03-27 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Alessandro Del Sole

Microsoft SQL data data-engineering lucene

This eBook offers practical tips and tricks as well as useful guidance on how to implement common features in LightSwitch, such as those for working with documents, business analysis, screen customization, optimal server configuration, usage with databases other than SQL Server, and so on. What you can expect to find is solutions for everyday problems, with suggestions on how to implement requirements that are very common in any business application, especially for running across distributed networks in the enterprise. In summary, what you’ll find in this eBook is how to solve problems you will face in the real world. This eBook is intended for developers who have at least basic knowledge of Microsoft Visual Studio LightSwitch, as well as some experience in creating, configuring, and publishing applications.

ElasticSearch Server

2013-02-21 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Rafal Kuc , Marek Rogozinski

Analytics ELK data data-engineering elasticsearch

ElasticSearch Server is an excellent resource for mastering the ElasticSearch open-source search engine. This book takes you through practical steps to implement, configure, and optimize search capabilities, suitable for various data sets and applications, making faster and more accurate search outcomes accessible. What this Book will help me do Understand the core concepts of ElasticSearch, including data indexing, dynamic mapping, and search analysis. Develop practical skills in writing queries and filters to retrieve precise and relevant results. Learn to set up and efficiently manage ElasticSearch clusters for scalability and real-time performance. Implement advanced ElasticSearch functions like autocompletion, faceting, and geo-search. Utilize optimization techniques for cluster monitoring, health-checks, and tuning for reliable performance. Author(s) The authors of ElasticSearch Server are industry professionals with extensive experience in search technologies and system architecture. They have contributed to multiple tools and publications in the field of data search and analytics. Their writing aims to distill complex technical concepts into practical knowledge, making it valuable for readers from all backgrounds. Who is it for? This book is perfect for developers, system architects, and IT professionals seeking a robust and scalable search solution for their projects. Whether you're new to ElasticSearch or looking to deepen your expertise, this book will serve as a practical guide to implement ElasticSearch effectively. The only prerequisites are a basic understanding of databases and general query concepts, so prior search server knowledge is not required.

Designing the Search Experience

2012-12-31 · O'Reilly Data Engineering Books O'Reilly Amazon

book

by Tyler Tate , Tony Russell-Rose

data data-engineering

Search is not just a box and ten blue links. Search is a journey: an exploration where what we encounter along the way changes what we seek. But in order to guide people along this journey, designers must understand both the art and science of search.In Designing the Search Experience, authors Tony Russell-Rose and Tyler Tate weave together the theories of information seeking with the practice of user interface design. Understand how people search, and how the concepts of information seeking, information foraging, and sensemaking underpin the search process Apply the principles of user-centered design to the search box, search results, faceted navigation, mobile interfaces, social search, and much more Design the cross-channel search experiences of tomorrow that span desktop, tablet, mobile, and other devices

talk-data.com

Activity Trend

Top Events

Top Speakers

Structured Search for Big Data

ElasticSearch Blueprints

Lucene 4 Cookbook

Search and Foraging

Apache Solr Search Patterns

Mastering Elasticsearch - Second Edition

ElasticSearch Cookbook - Second Edition

Elasticsearch: The Definitive Guide

Solr Cookbook - Third Edition - Third Edition

Fundamentals of Database Indexing and Searching

Search: How the Data Explosion Makes Us Smarter

Understanding Large Temporal Networks and Spatial Networks: Exploration, Pattern Searching, Visualization and Network Evolution

Data Classification

Scaling Apache Solr

Solr in Action

Relevance Ranking for Vertical Search Engines

Find Out Anything From Anyone, Anytime

Hidden Visual Studio LightSwitch: Secrets from the Real World for Creating Great Apps

ElasticSearch Server

Designing the Search Experience