talk-data.com
People (4 results)
See all 4 →Activities & events
| Title & Speakers | Event |
|---|---|
|
From Crypto Streams to AI-Powered Predictions
2025-12-01 · 18:30
Olena Kutsenko
– Staff Developer Advocate
@ Confluent
In this 2-hour hands-on workshop, you'll build an end-to-end streaming analytics pipeline that captures live cryptocurrency prices, processes them in real-time, and uses AI to forecast the future. Ingest live crypto data into Apache Kafka using Kafka Connect; tame that chaos with Apache Flink's stream processing; freeze streams into queryable Apache Iceberg tables using Tableflow; and forecast price trends with Flink AI. |
Crypto Streams to AI Predictions: Apache Kafka®, Apache Flink® & Apache Iceberg®
|
|
PLEASE RSVP @ *** Join us for a hands-on workshop by Olena Kutsenko on Monday, December 1st from 6:00pm! In this workshop, you’ll harness the power of Confluent Cloud - the fully managed data streaming platform built on Apache Kafka®, Apache Flink®, and Apache Iceberg® - to build a live crypto-streaming pipeline that ingests, processes, stores, and predicts real-time data. |
Crypto Streams to AI Predictions: Apache Kafka®, Apache Flink® & Apache Iceberg®
|
|
Stop Overfeeding Your AI: A Practical Guide to Context Optimization
2025-08-20 · 18:30
Archana Vaidheeswaran
– Developer Advocate
@ Aleph Alpha
Abstract: Ever notice how your AI interactions start strong but quickly deteriorate with complexity? We've all been there – carefully crafting detailed prompts for AI models, only to receive increasingly mediocre responses as our inputs grow longer. The conventional wisdom says more context equals better results, but real-world evidence suggests otherwise. In this session, I'll share discoveries from analyzing thousands of AI interactions across various domains that reveal a surprising truth: the relationship between prompt length and response quality isn't linear – it's parabolic. There's a sweet spot, and most of us are operating well beyond it. |
|
|
Mastering real-time anomaly detection
2025-08-20 · 16:00
Olena Kutsenko
– Staff Developer Advocate
@ Confluent
Abstract: Detecting problems as they happen is essential in today’s fast-moving, data-driven world. In this talk, you’ll learn how to build a flexible, real-time anomaly detection pipeline using Apache Kafka and Apache Flink, backed by statistical and machine learning models. We’ll start by demystifying what anomaly really means - exploring the different types (point, contextual, and collective anomalies) and the difference between unintentional issues and intentional outliers like fraud or abuse. Then, we’ll look at how anomaly detection is solved in practice: from classical statistical models like ARIMA to deep learning models like LSTM. You’ll learn how ARIMA breaks time series into AutoRegressive, Integrated, and Moving Average components, no math degree required (just a Python library). We’ll also uncover why forgetting is a feature, not a bug, when it comes to LSTMs, and how these models learn to detect complex patterns over time. Throughout, we’ll show how Kafka handles high-throughput streaming data and how Flink enables low-latency, stateful processing to catch issues as they emerge. You’ll leave knowing not just how these systems work, but when to use each type of model depending on your data and goals. Whether you're monitoring system health, tracking IoT devices, or looking for fraud in transactions, this talk will give you the foundations and tools to detect the unexpected - before it becomes a problem. |
|
|
IN PERSON: Apache Kafka® x Apache Iceberg x Apache Flink®
2025-05-07 · 18:00
***IMPORTANT: IF YOU RSVP here you don't need to also RSVP to London Kafka Group.*** Date and Time: 🗓️ Wednesday 7th May, ⏰ 18:00 - 21:00 PM 🕘 Venue: Snowflake, One Crown Place, London EC2A 4EF, U.K. 5th & 6th floors · London Schedule:
🎙️ \~Talk 1\~ Mastering real-time anomaly detection, Olena Kutsenko, Staff Developer Advocate, Confluent Abstract: Detecting problems as they happen is essential in today's fast-moving world. This talk shows how to build a simple, powerful system for real-time anomaly detection in live data. We'll use Apache Kafka for streaming data, Apache Flink for processing it in real time, and various models to detect unusual patterns. Whether it's monitoring systems, or tracking IoT devices, this solution is flexible and reliable. We'll start by exploring how Kafka helps collect and manage fast-moving data streams. Then, we'll demonstrate how Flink processes this data in real time and integrates anomaly detection models to uncover events as they occur. We'll dive into the details of how ARIMA and LSTM work, so even if you’re not into mathematics, you can still understand what happens behind the scenes! This talk is ideal for anyone looking to monitor anomalies in real-time data streams. 🗣️ Speaker 1: Olena is a Staff Developer Advocate at Confluent and a recognized expert in data streaming and analytics. With two decades of experience in software engineering, she has built mission-critical applications, led high-performing teams, and driven large-scale technology adoption at industry leaders like Nokia, HERE Technologies, AWS, and Aiven. 🎙️ \~Talk 2\~ Iced Kaf-fee: Chilling Kafka Data into Iceberg Tables, Danica Fine, Lead Developer Advocate, Open Source at Snowflake Abstract: Have piping-hot, real-time data in Apache Kafka® but want to chill it down into Apache Iceberg™ tables? Let’s see how we can craft the perfect cup of “Iced Kaf-fee” for you and your needs! We’ll start by grinding through the motivation for moving data from Kafka topics into Iceberg tables, exploring the benefits that doing so has to offer your analytics workflows. From there, we’ll open up the menu of options available to cool down your streams, including Apache Flink®, Apache Spark™, and Kafka Connect. Each brewing method has its own recipe, so we’ll compare their pros and cons, walk through use cases for each, and highlight when you might prefer a strong Spark roast over a smooth Flink blend—or maybe a Connect cold brew. Plus, we’ll share a sneak peek at future innovations that are percolating in the community to make sinking your Kafka data into Iceberg even easier. By the end of the session, you’ll have everything you need to whip up the perfect pipeline and serve up your “Iced Kaf-fee” with confidence. 🗣️ Speaker 2: Danica began her career as a software engineer in financial services and pivoted to developer relations, where she focussed primarily on open source technologies under the Apache Software Foundation umbrella such as Apache Kafka and Apache Flink. She now leads the open source advocacy efforts at Snowflake, supporting Apache Iceberg and Apache Polaris (incubating). 🎙️ \~Talk 3\~ Observing all the things: Apache Kafka® and Apache Flink® with OpenTelemetry, Mehreen Tahir Software Engineer, New Relic 🗣️ Speaker 3: Mehreen specializes in machine learning, data science, and artificial intelligence. Mehreen is passionate about observability and the use of telemetry data to improve application performance. She actively contributes to developer communities and has a keen interest in edge analytics and serverless architecture. *** DISCLAIMER NOTE: We are unable to cater for any attendees under the age of 18. If you would like to speak or host our next event please let us know! [email protected] |
IN PERSON: Apache Kafka® x Apache Iceberg x Apache Flink®
|
|
IN PERSON: Apache Kafka to Apache Iceberg examples by Snowflake
2025-05-07 · 17:00
Join us for an a range of talks including Kafka to Apache Iceberg in London hosted by Snowflake! Date and Time: 🗓️ Wednesday 7th May, ⏰ 18:00 - 21:00 PM 🕘 Venue: Snowflake, One Crown Place, London EC2A 4EF, U.K. 5th & 6th floors · London Schedule:
🎙️ \~Talk 1\~ Mastering real-time anomaly detection, Olena Kutsenko, Staff Developer Advocate, Confluent Abstract: Detecting problems as they happen is essential in today's fast-moving world. This talk shows how to build a simple, powerful system for real-time anomaly detection in live data. We'll use Apache Kafka for streaming data, Apache Flink for processing it in real time, and various models to detect unusual patterns. Whether it's monitoring systems, or tracking IoT devices, this solution is flexible and reliable. We'll start by exploring how Kafka helps collect and manage fast-moving data streams. Then, we'll demonstrate how Flink processes this data in real time and integrates anomaly detection models to uncover events as they occur. We'll dive into the details of how ARIMA and LSTM work, so even if you’re not into mathematics, you can still understand what happens behind the scenes! This talk is ideal for anyone looking to monitor anomalies in real-time data streams. 🗣️ Speaker 1: Olena is a Staff Developer Advocate at Confluent and a recognized expert in data streaming and analytics. With two decades of experience in software engineering, she has built mission-critical applications, led high-performing teams, and driven large-scale technology adoption at industry leaders like Nokia, HERE Technologies, AWS, and Aiven. 🎙️ \~Talk 2\~ Iced Kaf-fee: Chilling Kafka Data into Iceberg Tables, Danica Fine, Lead Developer Advocate, Open Source at Snowflake Abstract: Have piping-hot, real-time data in Apache Kafka® but want to chill it down into Apache Iceberg™ tables? Let’s see how we can craft the perfect cup of “Iced Kaf-fee” for you and your needs! We’ll start by grinding through the motivation for moving data from Kafka topics into Iceberg tables, exploring the benefits that doing so has to offer your analytics workflows. From there, we’ll open up the menu of options available to cool down your streams, including Apache Flink®, Apache Spark™, and Kafka Connect. Each brewing method has its own recipe, so we’ll compare their pros and cons, walk through use cases for each, and highlight when you might prefer a strong Spark roast over a smooth Flink blend—or maybe a Connect cold brew. Plus, we’ll share a sneak peek at future innovations that are percolating in the community to make sinking your Kafka data into Iceberg even easier. By the end of the session, you’ll have everything you need to whip up the perfect pipeline and serve up your “Iced Kaf-fee” with confidence. 🗣️ Speaker 2: Danica began her career as a software engineer in financial services and pivoted to developer relations, where she focussed primarily on open source technologies under the Apache Software Foundation umbrella such as Apache Kafka and Apache Flink. She now leads the open source advocacy efforts at Snowflake, supporting Apache Iceberg and Apache Polaris (incubating). 🎙️ \~Talk 3\~ Observing all the things: Apache Kafka® and Apache Flink® with OpenTelemetry, Mehreen Tahir Software Engineer, New Relic 🗣️ Speaker 3: Mehreen specializes in machine learning, data science, and artificial intelligence. Mehreen is passionate about observability and the use of telemetry data to improve application performance. She actively contributes to developer communities and has a keen interest in edge analytics and serverless architecture. |
IN PERSON: Apache Kafka to Apache Iceberg examples by Snowflake
|
|
Apache Kafka in Action
2025-05-04
Alexander Kropp
– author
,
Anatoly Zelenin
– author
Apache Kafka, start to finish. Apache Kafka in Action: From basics to production guides you through the concepts and skills you’ll need to deploy and administer Kafka for data pipelines, event-driven applications, and other systems that process data streams from multiple sources. Authors Anatoly Zelenin and Alexander Kropp have spent years using Kafka in real-world production environments. In this guide, they reveal their hard-won expert insights to help you avoid common Kafka pitfalls and challenges. Inside Apache Kafka in Action you’ll discover: Apache Kafka from the ground up Achieving reliability and performance Troubleshooting Kafka systems Operations, governance, and monitoring Kafka use cases, patterns, and anti-patterns Clear, concise, and practical, Apache Kafka in Action is written for IT operators, software engineers, and IT architects working with Kafka every day. Chapter by chapter, it guides you through the skills you need to deliver and maintain reliable and fault-tolerant data-driven applications. About the Technology Apache Kafka is the gold standard streaming data platform for real-time analytics, event sourcing, and stream processing. Acting as a central hub for distributed data, it enables seamless flow between producers and consumers via a publish-subscribe model. Kafka easily handles millions of events per second, and its rock-solid design ensures high fault tolerance and smooth scalability. About the Book Apache Kafka in Action is a practical guide for IT professionals who are integrating Kafka into data-intensive applications and infrastructures. The book covers everything from Kafka fundamentals to advanced operations, with interesting visuals and real-world examples. Readers will learn to set up Kafka clusters, produce and consume messages, handle real-time streaming, and integrate Kafka into enterprise systems. This easy-to-follow book emphasizes building reliable Kafka applications and taking advantage of its distributed architecture for scalability and resilience. What's Inside Master Kafka’s distributed streaming capabilities Implement real-time data solutions Integrate Kafka into enterprise environments Build and manage Kafka applications Achieve fault tolerance and scalability About the Reader For IT operators, software architects and developers. No experience with Kafka required. About the Authors Anatoly Zelenin is a Kafka expert known for workshops across Europe, especially in banking and manufacturing. Alexander Kropp specializes in Kafka and Kubernetes, contributing to cloud platform design and monitoring. Quotes A great introduction. Even experienced users will go back to it again and again. - Jakub Scholz, Red Hat Approachable, practical, well-illustrated, and easy to follow. A must-read. - Olena Kutsenko, Confluent A zero to hero journey to understanding and using Kafka! - Anthony Nandaa, Microsoft Thoughtfully explores a wide range of topics. A wealth of valuable information seamlessly presented and easily accessible. - Olena Babenko, Aiven Oy |
O'Reilly Data Engineering Books
|
|
Panel Discussion | Building Effective Data Teams: Strategies for Success
2024-12-06 · 23:14
Paul Andrew
,
Olena Kutsenko
,
Martin Zuern
,
Gunnar Morling
– Software Engineer and open-source enthusiast
@ Decodable
🌟 Session Overview 🌟 Session Name: Building Effective Data Teams: Strategies for Success Speaker: Gunnar Morling, Martin Zuern, Olena Kutsenko, Paul Andrew Session Description: Panel Discussion will explore the key strategies for assembling and nurturing high-performing data teams. Expert panelists will discuss best practices for recruiting top talent, fostering collaboration, and creating a culture of innovation within data teams. The session will also address common challenges such as skill gaps, team dynamics, and aligning data initiatives with business goals. 🚀 About Big Data and RPA 2024 🚀 Unlock the future of innovation and automation at Big Data & RPA Conference Europe 2024! 🌟 This unique event brings together the brightest minds in big data, machine learning, AI, and robotic process automation to explore cutting-edge solutions and trends shaping the tech landscape. Perfect for data engineers, analysts, RPA developers, and business leaders, the conference offers dual insights into the power of data-driven strategies and intelligent automation. 🚀 Gain practical knowledge on topics like hyperautomation, AI integration, advanced analytics, and workflow optimization while networking with global experts. Don’t miss this exclusive opportunity to expand your expertise and revolutionize your processes—all from the comfort of your home! 📊🤖✨ 📅 Yearly Conferences: Curious about the evolution of QA? Check out our archive of past Big Data & RPA sessions. Watch the strategies and technologies evolve in our videos! 🚀 🔗 Find Other Years' Videos: 2023 Big Data Conference Europe https://www.youtube.com/playlist?list=PLqYhGsQ9iSEpb_oyAsg67PhpbrkCC59_g 2022 Big Data Conference Europe Online https://www.youtube.com/playlist?list=PLqYhGsQ9iSEryAOjmvdiaXTfjCg5j3HhT 2021 Big Data Conference Europe Online https://www.youtube.com/playlist?list=PLqYhGsQ9iSEqHwbQoWEXEJALFLKVDRXiP 💡 Stay Connected & Updated 💡 Don’t miss out on any updates or upcoming event information from Big Data & RPA Conference Europe. Follow us on our social media channels and visit our website to stay in the loop! 🌐 Website: https://bigdataconference.eu/, https://rpaconference.eu/ 👤 Facebook: https://www.facebook.com/bigdataconf, https://www.facebook.com/rpaeurope/ 🐦 Twitter: @BigDataConfEU, @europe_rpa 🔗 LinkedIn: https://www.linkedin.com/company/73234449/admin/dashboard/, https://www.linkedin.com/company/75464753/admin/dashboard/ 🎥 YouTube: http://www.youtube.com/@DATAMINERLT |
|
|
🌟 Session Overview 🌟 Session Name: Sentiment Analysis in Action: Building Your Real-time Pipeline Speaker: Olena Kutsenko Session Description: Monitoring and interpreting the sentiment of data records is important for a variety of use cases. However, traditional human-based methods fall short in handling huge volumes of information with the required speed and efficiency. AI, however, can address this challenge. AI is only part of the solution. We need to build a data pipeline that ingests data from various channels, processes it using AI-driven sentiment analysis models to classify the sentiment of each individual record, and prepares it to be consumed by applications for aggregation and analysis. In this session, we'll build a system using open-source technologies Apache Kafka and Apache Flink with AI models to obtain real-time sentiment from social media data. Apache Kafka's scalability ensures that no record is left behind, making it a reliable foundation for sentiment analysis. Apache Flink, with its adaptability to fluctuations in data volume and velocity, will enable the analysis of a continuous data stream using an AI model. 🚀 About Big Data and RPA 2024 🚀 Unlock the future of innovation and automation at Big Data & RPA Conference Europe 2024! 🌟 This unique event brings together the brightest minds in big data, machine learning, AI, and robotic process automation to explore cutting-edge solutions and trends shaping the tech landscape. Perfect for data engineers, analysts, RPA developers, and business leaders, the conference offers dual insights into the power of data-driven strategies and intelligent automation. 🚀 Gain practical knowledge on topics like hyperautomation, AI integration, advanced analytics, and workflow optimization while networking with global experts. Don’t miss this exclusive opportunity to expand your expertise and revolutionize your processes—all from the comfort of your home! 📊🤖✨ 📅 Yearly Conferences: Curious about the evolution of QA? Check out our archive of past Big Data & RPA sessions. Watch the strategies and technologies evolve in our videos! 🚀 🔗 Find Other Years' Videos: 2023 Big Data Conference Europe https://www.youtube.com/playlist?list=PLqYhGsQ9iSEpb_oyAsg67PhpbrkCC59_g 2022 Big Data Conference Europe Online https://www.youtube.com/playlist?list=PLqYhGsQ9iSEryAOjmvdiaXTfjCg5j3HhT 2021 Big Data Conference Europe Online https://www.youtube.com/playlist?list=PLqYhGsQ9iSEqHwbQoWEXEJALFLKVDRXiP 💡 Stay Connected & Updated 💡 Don’t miss out on any updates or upcoming event information from Big Data & RPA Conference Europe. Follow us on our social media channels and visit our website to stay in the loop! 🌐 Website: https://bigdataconference.eu/, https://rpaconference.eu/ 👤 Facebook: https://www.facebook.com/bigdataconf, https://www.facebook.com/rpaeurope/ 🐦 Twitter: @BigDataConfEU, @europe_rpa 🔗 LinkedIn: https://www.linkedin.com/company/73234449/admin/dashboard/, https://www.linkedin.com/company/75464753/admin/dashboard/ 🎥 YouTube: http://www.youtube.com/@DATAMINERLT |
|
|
Managing TiDB Upgrades & useful practices with Apache Kafka
2023-10-11 · 14:30
Zoom link for online participants will be published on the date of meetup here: https://bolt.zoom.us/j/93106867043?pwd=NTMvM29DODgvOG5uQUJuQnJqSVN4UT09 During the meetup, we will delve into the cutting-edge technologies powering our TiDB Cloud Data Service and explore TiDB Upgrades. We will also dive into how Bolt manages their TiDB Upgrades with the paranoid TiDB upgrade guide. We discuss a beginner guide to balance your data across Apache Kafka partitions This is a fantastic opportunity to expand your knowledge, gain insights from industry experts, and connect with fellow tech enthusiasts. Whether you’re a seasoned professional or just starting out, these sessions have something valuable for everyone. Agenda: 17:30 – 18:00 Networking 18:00 – 18:10 Opening words 18:15 – 18:55 TiDB Cloud Data Service Presenter: Daniel James – Principal Solutions Engineer at PingCAP 19:05 – 19:45 The Paranoid TiDB Version Upgrader’s Guide. Presenter: Leandro Morgado – Senior MySQL DBA at Bolt 19:45 – 20:00 BREAK 20:05 – 20:45 Beginners guide to balance your data across Apache Kafka partitions Presenter: Olena Kutsenko – Sr. Developer Advocate at Aiven 20:45 – 21:30 Networking |
Managing TiDB Upgrades & useful practices with Apache Kafka
|