Every sprint consumed by fixing parsers is a sprint spent not shipping product- brittle parsing kills velocity. This workshop is about retiring that cycle so you can move from messy, unstructured inputs to production-ready data in seconds. bem ingests and transforms any unstructured input at any volume — PDFs, emails, Excel, Word, CSV, text, JSON, images (PNG, JPEG, HEIC, HEIF, WebP), HTML, and audio (WAV, MP3, M4A) — into clean JSON instantly via API. With primitives like Transform, Join, Split, Route, and Analyze, you define the exact workflow your product needs. Built-in Evals measure + enforce accuracy automatically so quality doesn’t drop as you scale. Flow outputs straight into MotherDuck so you can go from chaos to query without manual cleanup — and your team can focus on shipping, not scraping.
talk-data.com
U
Speaker
Upal Saha
1
talks
Co-Founder & CTO
bem
Upal is Co-Founder + CTO of bem, where the team turns unstructured inputs into workflows + product for startups to Fortune 100 orgs.
He’s spent his entire career in data processing and automation, previously building Silo's backend and food traceability data systems from the ground up and leading enterprise data integrations at Yext.
He also harbors a deeply held dislike for feta cheese.
Bio from: Small Data SF 2025
Filter by Event / Source
Talks & appearances
1 activities · Newest first