talk-data.com talk-data.com

PyData talk 2024-09-25 at 12:25

The expanding Apache Arrow universe - standardizing and accelerating tabular data access and interchange

Topics

Description

Apache Arrow has become a de-facto standard for efficient in-memory columnar data representation. Beyond the standardized and language-independent columnar memory format for tabular data, the Apache Arrow project also has a growing set of supplementary specifications and language implementations. This talk will give an overview of the recent developments in the Apache Arrow ecosystem, including ADBC, nanoarrow, new data types, and the Arrow PyCapsule protocol.