Jigar Bhati

Activities

1

talks

Member of Technical Staff Open AI

Jigar Bhati has 10+ years of professional software development experience at leading technology companies such as OpenAI, Twitter, and Samsung working on large-scale distributed systems. At OpenAI, he drives AI innovation by designing real-time infrastructure and streaming systems that process millions of events per second. His contributions have been instrumental in launching new products, enhancing safety, and deploying advanced AI capabilities. Previously at Twitter, he played a lead role in developing and scaling Manhattan, a distributed NoSQL database, and productionizing CockroachDB (NewSQL). Together, these systems handled millions of user requests per second, ensuring high availability and platform stability.

Bio from: Data + AI Summit 2025

Filter by Event / Source

Data + AI Summit 2025 1

Talks & appearances

1 activities · Newest first

Search activities →

Kafka Forwarder: Simplifying Kafka Consumption at OpenAI

2025-06-10 · Data + AI Summit 2025 Watch

talk

Databricks Kafka LLM Data Streaming

At OpenAI, Kafka fuels real-time data streaming at massive scale, but traditional consumers struggle under the burden of partition management, offset tracking, error handling, retries, Dead Letter Queues (DLQ), and dynamic scaling — all while racing to maintain ultra-high throughput. As deployments scale, complexity multiplies. Enter Kafka Forwarder — a game-changing Kafka Consumer Proxy that flips the script on traditional Kafka consumption. By offloading client-side complexity and pushing messages to consumers, it ensures at-least-once delivery, automated retries, and seamless DLQ management via Databricks. The result? Scalable, reliable and effortless Kafka consumption that lets teams focus on what truly matters. Curious how OpenAI simplified self-service, high-scale Kafka consumption? Join us as we walk through the motivation, architecture and challenges behind Kafka Forwarder, and share how we structured the pipeline to seamlessly route DLQ data into Databricks for analysis.