talk-data.com talk-data.com

Angela Chu

Speaker

Angela Chu

2

talks

Lead Solutions Architect Databricks

Angela Chu has turned data into information for more than 25 years. The last three have been spent as a Solutions Architect and Streaming Subject Matter Expert at Databricks. She enjoys learning about different technologies and solving complex problems with it, and sharing her learnings with her customers and peers. When not having fun with technology, she is spending time with her family and traveling to different countries so her kids can experience amazing cultures from around the world.

Bio from: Databricks DATA + AI Summit 2023

Filter by Event / Source

Talks & appearances

2 activities · Newest first

Search activities →
Unlock Your Use Cases: A Deep Dive on Structured Streaming’s New TransformWithState API

Don’t you just hate telling your customers “No”? “No, I can’t get you the data that quickly”, or “No that logic isn’t possible to implement” really aren’t fun to say. But what if you had a tool that would allow you to implement those use cases? What if it was in a technology you were already familiar with — say, Spark Structured Streaming? There is a brand new arbitrary stateful operations API called TransformWithState, and after attending this deep dive you won’t have to say “No” anymore. During this presentation we’ll go through some real-world use cases and build them step-by-step. Everything from state variables, process vs. event time, watermarks, timers, state TTL, and even how you can initialize state with the checkpoint of another stream. Unlock your use cases with the power of Structured Streaming’s TransformWithState!

Structured Streaming: Demystifying Arbitrary Stateful Operations

Let’s face it -- data is messy. And your company’s business requirements? Even messier. You’re staring at your screen, knowing there is a tool that will let you give your business partners the information they need as quickly as they need it. There’s even a Python version of it now. But…it looks kind of scary. You’ve never used it before, and you don’t know where to start. Yes, we’re talking about the dreaded flatMapGroupsWithState. But fear not - we’ve got you covered.

In this session, we’ll take a real-word use case and use it to show you how to break down flatMapGroupsWithState into its basic building blocks. We’ll explain each piece in both Scala and the newly-released Python, and at the end we’ll illustrate how it all comes together to enable the implementation of arbitrary stateful operations with Spark Structured Streaming.

Talk by: Angela Chu

Connect with us: Website: https://databricks.com Twitter: https://twitter.com/databricks LinkedIn: https://www.linkedin.com/company/databricks Instagram: https://www.instagram.com/databricksinc Facebook: https://www.facebook.com/databricksinc