Generative AI initiatives require new data pipelines that prepare text files for querying by language models. Data engineers, scientists, and other stakeholders collaborate to design and implement these pipelines, which span text sources, tokens, vectors, vector databases, and LMs. Published at: https://www.eckerson.com/articles/the-new-data-pipeline-for-generative-ai-where-and-how-it-works
talk-data.com
Topic
Vector DB
ai
2
tagged
Activity Trend
10
peak/qtr
2020-Q1
2026-Q1
Top Events
DATA MINER Big Data Europe Conference 2020
10
Data Engineering Podcast
10
Data + AI Summit 2025
7
DataFramed
7
AWS re:Invent 2024
4
Databricks DATA + AI Summit 2023
3
DataTalks.Club
3
Google Cloud Next '24
3
Generative AI and Intelligent Agents Virtual Summit
2
PyConDE & PyData Berlin 2023
2
Secrets of Data Analytics Leaders
2
Google Cloud Next '25
2
Filtering by:
Secrets of Data Analytics Leaders
×
Embeddings are a learned way of representing data in space. Vector databases make it easier to work with embeddings generated from deep learning models. They will become an essential tool in the AI stack because they reduce the time to structure data and train models. Published at: https://www.eckerson.com/articles/the-why-what-who-and-where-of-vector-databases