talk-data.com talk-data.com

Filter by Source

Select conferences and events

People (237 results)

See all 237 →

Activities & events

Title & Speakers Event

Join us at our first event of the year at The Information Lab on the historic Watling Street in the City of London 🙌 We will be kicking off 2026 by delving into the topic of real-time data with our speakers - Sam, Nicoleta & Anton. We are running this event in collaboration with Confluent.

6pm: Doors Open

6:30pm: Talks Start

🗣️The Speakers🗣️

Load-In to Lights-Out: Data Engineering the World's Biggest Tours and Live Events Sam Malcolm, Head of Architecture & Engineering at Centrus (Sam's Linkedin) Sam’s session dives into lessons from large-scale live event data systems—handling over 10 billion data points per second for global tours like Beyoncé, Coldplay, and Glastonbury. He connects the extreme demands of real-time analytics and high-performance networking to modern cloud data practices, showing how the same principles of speed, resilience, and precision apply when designing reliable, scalable data platforms today.

Should I Stream or Should I Join: From Regular to Delta Joins in Apache Flink Nicoleta Lazar, Senior Data Engineer at Fresha & Anton Borisov, Principal Engineer at Fresha (Niloceta's LinkedIn , Anton's LinkedIn) Joins in the streaming world are where the fun stops and the tradeoffs start. State that grows forever, latency that spikes unpredictably, watermarks that never quite behave, every Flink developer has war stories about this. In this session, Anton Borisov and Nicoleta Lazar break down the join landscape in Apache Flink:

→ Regular joins and the state explosion problem → Interval joins: when they work, when they don't → Temporal joins and the versioned table dance → Lookup joins: the escape hatch and its hidden costs → Delta joins: the new kid and how Fluss enables them, and why it matters

Talks finish by 8pm and there will be a break between the talks. Afterwards, we may head to a pub to continue chatting.

You can sign up by subscribing to this event

🚨IMPORTANT: Please bring a valid form of ID.

See you all on the 22nd January 🤩

Happy Networking 🍻

Checkout Meetup Groups run by Confluent:

  • https://www.meetup.com/london-real-time-data-meetup/
  • https://www.meetup.com/data-streaming-europe/

By attending this event, you agree to abide by our rules of conduct:

  • Respect others' opinions.
  • Keep it appropriate - no harassment of any sort.
  • If you see something or have a complaint, please reach out to one of the organisers or email [email protected].
Data Engineers London: Real Time Data - January 2026

IMPORTANT: PLEASE RSVP @ https://www.meetup.com/data-engineers-london/events/312450363/

Details 6pm: Doors Open

6:30pm: Talks Start

🗣️The Speakers🗣️

Load-In to Lights-Out: Data Engineering the World's Biggest Tours and Live Events Sam Malcolm, Head of Architecture & Engineering at Centrus (Sam's Linkedin) Sam’s session dives into lessons from large-scale live event data systems—handling over 10 billion data points per second for global tours like Beyoncé, Coldplay, and Glastonbury. He connects the extreme demands of real-time analytics and high-performance networking to modern cloud data practices, showing how the same principles of speed, resilience, and precision apply when designing reliable, scalable data platforms today.

Should I Stream or Should I Join: From Regular to Delta Joins in Apache Flink Nicoleta Lazar, Senior Data Engineer at Fresha & Anton Borisov, Principal Engineer at Fresha (Niloceta's LinkedIn , Anton's LinkedIn) Joins in the streaming world are where the fun stops and the tradeoffs start. State that grows forever, latency that spikes unpredictably, watermarks that never quite behave, every Flink developer has war stories about this. In this session, Anton Borisov and Nicoleta Lazar break down the join landscape in Apache Flink:

→ Regular joins and the state explosion problem → Interval joins: when they work, when they don't → Temporal joins and the versioned table dance → Lookup joins: the escape hatch and its hidden costs → Delta joins: the new kid and how Fluss enables them, and why it matters

Talks finish by 8pm and there will be a break between the talks. Afterwards, we may head to a pub to continue chatting.

*** If you are interested in speaking at or hosting a meetup, please reach out to [email protected]

Data Engineers London: Real Time Data - January 2026

IMPORTANT: PLEASE RSVP @ https://www.meetup.com/data-engineers-london/events/312450363/

Details 6pm: Doors Open

6:30pm: Talks Start

🗣️The Speakers🗣️

Load-In to Lights-Out: Data Engineering the World's Biggest Tours and Live Events Sam Malcolm, Head of Architecture & Engineering at Centrus (Sam's Linkedin) Sam’s session dives into lessons from large-scale live event data systems—handling over 10 billion data points per second for global tours like Beyoncé, Coldplay, and Glastonbury. He connects the extreme demands of real-time analytics and high-performance networking to modern cloud data practices, showing how the same principles of speed, resilience, and precision apply when designing reliable, scalable data platforms today.

Should I Stream or Should I Join: From Regular to Delta Joins in Apache Flink Nicoleta Lazar, Senior Data Engineer at Fresha & Anton Borisov, Principal Engineer at Fresha (Niloceta's LinkedIn , Anton's LinkedIn) Joins in the streaming world are where the fun stops and the tradeoffs start. State that grows forever, latency that spikes unpredictably, watermarks that never quite behave, every Flink developer has war stories about this. In this session, Anton Borisov and Nicoleta Lazar break down the join landscape in Apache Flink:

→ Regular joins and the state explosion problem → Interval joins: when they work, when they don't → Temporal joins and the versioned table dance → Lookup joins: the escape hatch and its hidden costs → Delta joins: the new kid and how Fluss enables them, and why it matters

Talks finish by 8pm and there will be a break between the talks. Afterwards, we may head to a pub to continue chatting.

*** If you are interested in speaking at or hosting a meetup, please reach out to [email protected]

Data Engineers London: Real Time Data - January 2026
Martha Scheffler – Data Engineer @ Qarma

Discover how Qarma built a resilient data platform architecture using blue-green deployment strategies powered by Snowflake zero-copy clones and dbt macros. This talk also dives into orchestrating multiple dbt projects with Kestra—from serial execution for ingestion pipelines to parallel processing for scalable analytics delivery.

dbt Snowflake Kestra
Thomas Schrum Nicolet – Platform Engineer @ Velux

Learn how Velux is building a scalable data mesh from the ground up using dbt to design and model data products in Snowflake. This session explores how dbt serves as the single source of truth, and how the team uses custom and enhanced macros to simplify data engineering workflows and accelerate data delivery.

dbt Snowflake data mesh
Olga Surkova – Senior Data Analyst @ Keepit

Refactoring a large dbt project isn’t just about renaming models – it’s about understanding what went wrong and redesigning for long-term scalability. This talk covers DAG restructuring, Jinja pattern improvements, and validation checks.

dbt

This session explores how behavioral design can be effectively delivered through AI-powered platforms to create real-time, personalized journeys at scale.

Join our Behavioral Analyst, Jessica Nicole, as she unpacks how tools like the ecosystem.Ai Prediction Platform enable dynamic experimentation and optimization, using methods such as multi-armed bandits and timely triggers to deliver interactions tailored to individual behavior.

We’ll examine real-world case studies across industries and show how frameworks like EAST and the Make-It Toolkit make it possible to automate proven behavioral mechanisms without needing a full UX team.

Understanding Behavioral Design for AI Systems
Data Meets Art 2025-11-20 · 14:22
Nathalie Miebach – data artist and Artist-in-Residence @ School of Data Science, University of Virginia , Alex Gates – Assistant Professor of Data Science @ University of Virginia

Here we explore the intersections of data, art, and storytelling. Our guest, Nathalie Miebach, is an internationally-recognized data artist and the School of Data Science’s inaugural Artist-in-Residence.

Using materials like reed and paper, she transforms complex datasets into woven sculptures and musical scores, inviting us to view and even hear data in new ways. Joining her is Alex Gates, assistant professor of data science at the University of Virginia research examines how patterns of connection shape creativity, innovation, and discovery.

Together, they discuss what happens when data meets art.

Chapters (00:00:01) - Data Points: When Art Meets Science(00:00:46) - Ian and Nicole: Introduction(00:06:18) - How Stories Get Made(00:09:59) - Basket Weaving Visualizing Data(00:20:33) - Wonders of the World(00:25:47) - Data and Artist Residency(00:27:50) - Breaking Habits in Creativity(00:30:06) - What is Data Science: Craftsmanship?(00:34:50) - How Art Affects Our Understanding of Data

Data Science
UVA Data Points
Jeff Teper , Michael DiPaolo @ Bristol Myers Squibb , Sumit Chauhan @ Microsoft , Nicole Herskowitz @ Microsoft

Frontier Firms aren’t just adopting AI, they’re reimagining work around it. Join Nicole Herskowitz, Sumit Chauhan and Jeff Teper for live demos and real-world examples of the latest innovations in the AI built for work, Microsoft 365 Copilot. Discover what’s new and how Copilot empowers people, teams and business processes in the era of AI.

AI/ML Microsoft
Microsoft Ignite 2025
Csanád Bakos – Data Engineer @ Vinted

While upgrading Flink to its latest versions to enable more AI-related capabilities, one can easily run into tricky savepoint incompatibilities that render existing state snapshots unusable for recovery. This is especially problematic in the case of pipelines with large state. In such cases, doing a backfill can take too long and using the State Processor API leads to downtime or breaking the exactly-once delivery guarantee.

In this talk, I’ll share a state migration pattern that I applied to one of our Flink jobs using regular streaming mode. It involves creating a new stateful operator that conforms to the new requirements, allowing for compatible savepoint creation. Leveraging side outputs and custom key traversal the existing state is forwarded to the new operator. In the meantime, regular processing is uninterrupted.

We’ll explore the core problem and understand the pitfalls and trade-offs of existing solutions such as the State Processor API. Then, a deep-dive into the migration pattern will follow: ensuring correct state handoff between operator versions, setting up triggers to migrate all keys and other technicalities. Lastly, a few words about cleaning up seamlessly. With this session I will add a nice pattern to your toolbox that you can easily apply next time you run into state migration challenges.

flink state processor api
Tides of Change: Real-Time Flow with Postgres, Kafka & Flink
Csanád Bakos – Data Engineer @ Vinted

Talk by Csanád Bakos, Data Engineer at Vinted.

Nicoleta Lazar – Sr. Data Engineer @ Fresha

Talk by Nicoleta Lazar, Senior Data Engineer at Fresha.

Nicoleta Lazar – Sr. Data Engineer @ Fresha

At Fresha, we became the pioneers that put StarRocks to test in production for realtime analytical workloads. But one of the first challenges we faced was getting all the data there reliably and efficiently. We had to think about historical data, and realtime data and orchestrate all of that, such that we can move fast, without breaking too many things. Our tools of choice: Airflow, StarRocks Pipes, Apache Flink. In this talk, I’ll share how we built our data pipelines using Apache Flink and Airflow, what worked and what didn’t for us. Along the way, we’ll explore how Flink helps ensure data consistency, handles failures gracefully, and keeps our real-time workloads running strong.

Airflow starrocks pipes flink
Celeste Hogan – Developer Advocate @ Snowflake

Kafka and Flink tend to get lumped in as "data services", in the sense that they process data, but in comparison to traditional databases they differ quite dramatically in functionality and utility. In this talk, we'll run through the lifetime of a write in Postgres to establish a baseline, understanding all the different services that data hits on its way down to the disk. Then we'll walk through writing data to a Kafka topic, and what 'writing' (or really, streaming) data to a Flink workflow looks like from a similar systems perspective. Along the way, we'll understand the key differences between the services and why some are more suited to long-term data storage than others.

postgresql Kafka flink
Celeste Hogan – Developer Advocate @ Snowflake

Talk by Celeste Hogan, Developer Advocate at Snowflake.

Snowflake
Tides of Change: Real-Time Flow with Postgres, Kafka & Flink
Let's LazyLayout 2025-08-27 · 20:45

Fun using LazyLayout? In this talk we’ll go over what you need to do to create custom LazyLayouts and what can you do with them. After this talk you should have a better understanding of each of the components involved and why you probably don’t need a custom layout after all :D

We often use modern SDKs without thinking about what happens under the hood. This talk follows the journey of a coroutine from launch to execution, giving a high-level view of the key layers it passes through-without getting lost in every tiny detail

Oreoluwa – Data Scientist @ KDE Community

In this workshop, Oreoluwa will walk us through KdeGuiTest, an open-source automation tool used to emulate user interaction with different software applications. KdeGuiTest (previously called KdeEcoTest) is an automation and testing tool which allows one to record and simulate user interactions with the GUI of an application. It is being developed as part of the KDE Eco initiative to create usage scenario scripts for measuring the energy consumption of software.

Python kdeguitest kde
Python for KDE Applications 2025-06-26 · 20:30
Nicolas Fella – KDE Contributor @ KDE

The KDE community has been producing free software for almost 30 years. This does not only include the Plasma desktop environment but hundreds of applications. Virtually all of these applications are written in C++. As part of the "Streamlined Application Development Experience" initiative we want to open the door to include other programming languages as well. In this talk we are going to look at the recent work on enabling writing KDE applications using Python, how to do it, and how to get involved with improving the support for it.

Python kde
Nicole – Advocate @ Opt Green

Nicole discusses environmental impact of proprietary software, why a Linux operating system could be a solution and where you can get help if you decide to switch.

windows 10 Linux open source