Apache Arrow has become a de-facto standard for efficient in-memory columnar data representation. Beyond the standardized and language-independent columnar memory format for tabular data, the Apache Arrow project also has a growing set of supplementary specifications and language implementations. This talk will give an overview of the recent developments in the Apache Arrow ecosystem, including ADBC, nanoarrow, new data types, and the Arrow PyCapsule protocol.
talk-data.com
Topic
Arrow
Apache Arrow
data_processing
columnar_memory_format
big_data
2
tagged
Activity Trend
6
peak/qtr
2020-Q1
2026-Q1
Top Events
Data Engineering Podcast
10
Data Council Austin 2024 - Day 1
5
Databricks DATA + AI Summit 2023
4
PyConDE & PyData Berlin 2023
3
PyData Paris 2025
3
The Analytics Engineering Podcast
2
Data Council 2023
2
O'Reilly Data Engineering Books
2
PyData Paris 2024
2
Data + AI Summit 2025
2
Data Skeptic
1
Making Data Simple
1
Filtering by:
Joris Van den Bossche
×
Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing, and is becoming the de facto standard for tabular data. This talk will give an overview of the recent developments both in Apache Arrow itself as how it is being adopted in the PyData ecosystem (and beyond) and can improve your day-to-day data analytics workflows.