talk-data.com talk-data.com

Angela Chu

Speaker

Angela Chu

1

talks

Lead Solutions Architect Databricks

Angela Chu has turned data into information for more than 25 years. The last three have been spent as a Solutions Architect and Streaming Subject Matter Expert at Databricks. She enjoys learning about different technologies and solving complex problems with it, and sharing her learnings with her customers and peers. When not having fun with technology, she is spending time with her family and traveling to different countries so her kids can experience amazing cultures from around the world.

Bio from: Databricks DATA + AI Summit 2023

Filtering by: Databricks DATA + AI Summit 2023 ×

Filter by Event / Source

Talks & appearances

Showing 1 of 2 activities

Search activities →
Structured Streaming: Demystifying Arbitrary Stateful Operations

Let’s face it -- data is messy. And your company’s business requirements? Even messier. You’re staring at your screen, knowing there is a tool that will let you give your business partners the information they need as quickly as they need it. There’s even a Python version of it now. But…it looks kind of scary. You’ve never used it before, and you don’t know where to start. Yes, we’re talking about the dreaded flatMapGroupsWithState. But fear not - we’ve got you covered.

In this session, we’ll take a real-word use case and use it to show you how to break down flatMapGroupsWithState into its basic building blocks. We’ll explain each piece in both Scala and the newly-released Python, and at the end we’ll illustrate how it all comes together to enable the implementation of arbitrary stateful operations with Spark Structured Streaming.

Talk by: Angela Chu

Connect with us: Website: https://databricks.com Twitter: https://twitter.com/databricks LinkedIn: https://www.linkedin.com/company/databricks Instagram: https://www.instagram.com/databricksinc Facebook: https://www.facebook.com/databricksinc