Topic

Scala

programming_language functional_programming jvm

Activities

2

tagged

Activity Trend

12 peak/qtr

2020-Q1 2026-Q2

Top Events

O'Reilly Data Engineering Books 34 Data Engineering Podcast 33 Databricks DATA + AI Summit 2023 11 ADSP: Algorithms + Data Structures = Programs 3 Scala Talks: Hands-On Capture Checking & Scala Native live-coding ☀️ 2 Scala Talks: A deep dive into streaming with fs2 & Scala Meets GenAI 2 Data + AI Summit 2025 2 Scala Talks: Tour of error handling & Functional Programming at Huge Companies 2 Women in Scala: From Paradigms to Percussion & Hands On with Creative Scala 2 Meetup Paris Scala User Group (PSUG) – Hébergé par DataDome! 2 Scala Talks: Write a book about Scala during Covid & AI tooling for developers 2 DataDome x PSUG #116 : My First Year in Scala! + TBA 2

Top Speakers

Tobias Macey 33 Holden Karau (Fight Health Insurance) 3 Conor Hoekstra 3 Bryce Adelstein Lelbach (NVIDIA) 3 Raúl Estrada 2 Zainab Ali (London Scala User Group) 2 Josh Wills 2 Sourav Gulati (Databricks) 2 Mohammed Guller 2 Romeo Kienzler 2 Sandy Ryza (Databricks) 2 Sean Owen (Databricks) 2

Activities

Showing filtered results

All Video Podcast Book

Filtering by: Data + AI Summit 2025 ×

What’s New in Apache Spark™ 4.0?

2025-06-12 · Data + AI Summit 2025 Watch

talk

by Daniel Tenedorio (Databricks) , Wenchen Fan (Databricks)

AI/ML API Java Python Spark SQL Data Streaming

Join this session for a concise tour of Apache Spark™ 4.0’s most notable enhancements: SQL features: ANSI by default, scripting, SQL pipe syntax, SQL UDF, session variable, view schema evolution, etc. Data type: VARIANT type, string collation Python features: Python data source, plotting API, etc. Streaming improvements: State store data source, state store checkpoint v2, arbitrary state v2, etc. Spark Connect improvements: More API coverage, thin client, unified Scala interface, etc. Infrastructure: Better error message, structured logging, new Java/Scala version support, etc. Whether you’re a seasoned Spark user or new to the ecosystem, this talk will prepare you to leverage Spark 4.0’s latest innovations for modern data and AI pipelines.

Breaking Barriers: Building Custom Spark 4.0 Data Connectors with Python

2025-06-11 · Data + AI Summit 2025 Watch

talk

by Sourav Gulati (Databricks) , Ashish Saraswat (Databricks)

API Java Python Spark Data Streaming

Building a custom Spark data source connector once required Java or Scala expertise, making it complex and limiting. This left many proprietary data sources without public SDKs disconnected from Spark. Additionally, data sources with Python SDKs couldn't harness Spark’s distributed power. Spark 4.0 changes this with a new Python API for data source connectors, allowing developers to build fully functional connectors without Java or Scala. This unlocks new possibilities, from integrating proprietary systems to leveraging untapped data sources. Supporting both batch and streaming, this API makes data ingestion more flexible than ever. In this talk, we’ll demonstrate how to build a Spark connector for Excel using Python, showcasing schema inference, data reads/writes and streaming support. Whether you're a data engineer or Spark enthusiast, you’ll gain the knowledge to integrate Spark with any data source — entirely in Python.