talk-data.com
People (4 results)
See all 4 →Activities & events
| Title & Speakers | Event |
|---|---|
|
From Raw Data to Trusted Assets: A Practical Walkthrough with AWS services and Collibra
2025-09-16 · 16:00
Behnaz Derakhshani
– Specialist Data Engineer
@ Diconium
Expect a hands-on journey showing how modern data lake tools and governance platforms connect the dots - making your data discoverable, governed, and productized for real-world use. |
|
|
Yingjun Wu
– Speaker
@ RisingWave Labs
Stream processing systems have traditionally relied on local storage engines such as RocksDB to achieve low latency. While effective in single-node setups, this model doesn't scale well in the cloud, where elasticity and separation of compute and storage are essential. In this talk, we'll explore how RisingWave rethinks the architecture by building directly on top of S3 while still delivering sub-100 ms latency. At the core is Hummock, a log-structured state engine designed for object storage. Hummock organizes state into a three-tier hierarchy: in-memory cache for the hottest keys, disk cache managed by Foyer for warm data, and S3 as the persistent cold tier. This approach ensures queries never directly hit S3, avoiding its variable performance. We'll also examine how remote compaction offloads heavy maintenance tasks from query nodes, eliminating interference between user queries and background operations. Combined with fine-grained caching policies and eviction strategies, this architecture enables both consistent query performance and cloud-native elasticity. Attendees will walk away with a deeper understanding of how to design streaming systems that balance durability, scalability, and low latency in an S3-based environment. |
|
|
Effective Agentic genAI in data streaming
2025-09-16 · 16:00
Erik Schmiegelow
– CEO
@ Hivemind Technologies
Successful gen AI projects strike the balance between impact, accuracy and cost - in this talk, we cover how to create agentic data applications effectively, choosing when and how to integrate them in data streams and keep response quality issues and costs in check. |
|
|
Dear data-loving community, we can't wait to present to you our new Meetup event: This time, it will be a collaboration with RisingWave, a platform for real-time streaming data management and analysis. Yingjun Wu, Founder and CEO at RisingWave Labs, will share his experience in a techy talk, as well as Behnaz Derakhshani, who works as a Specialist Data Engineer at Diconium's data department. Additionally, we're going to welcome external guest speaker Erik Schmiegelow, CEO at Hivemind Technologies. Exciting line-up, right? :D Join us on September 16th in Berlin and bring all your questions! Here are the topics you can expect: Yingjun Wu: Achieving Sub‑100 ms Real‑Time Stream Processing with an S3‑Native Architecture Stream processing systems have traditionally relied on local storage engines such as RocksDB to achieve low latency. While effective in single-node setups, this model doesn't scale well in the cloud, where elasticity and separation of compute and storage are essential. In this talk, we'll explore how RisingWave rethinks the architecture by building directly on top of S3 while still delivering sub-100 ms latency. At the core is Hummock, a log-structured state engine designed for object storage. Hummock organizes state into a three-tier hierarchy: in-memory cache for the hottest keys, disk cache managed by Foyer for warm data, and S3 as the persistent cold tier. This approach ensures queries never directly hit S3, avoiding its variable performance. We'll also examine how remote compaction offloads heavy maintenance tasks from query nodes, eliminating interference between user queries and background operations. Combined with fine-grained caching policies and eviction strategies, this architecture enables both consistent query performance and cloud-native elasticity. Attendees will walk away with a deeper understanding of how to design streaming systems that balance durability, scalability, and low latency in an S3-based environment. Behnaz Derakhshani: From Raw Data to Trusted Assets: A Practical Walkthrough with AWS services and Collibra Expect a hands-on journey of Behnaz showing how modern data lake tools and governance platforms connect the dots, making your data discoverable, governed, and productized for real-world use. Erik Schmiegelow: Effective Agentic GenAI in Data Streaming Successful genAI projects strike the balance between impact, accuracy, and cost. In this talk, Erik will cover how to create agentic data applications effectively, choosing when and how to integrate them in data streams and keep response quality issues and costs in check. What you can expect:
Timetable:
Our goal is to form a local data-loving community, so join us and let's talk data together! -> Our event page, where you can also contact us if you want to present in the future at our Meetup: Data Engineering MeetUp Berlin - applydata --- At the event, sound, image and video recordings are created and published for documentation purposes as well as for the presentation of the event in publicly accessible media, on websites and blogs and for presentation on social media. By participating the event, the participant implicitly consents to the aforementioned photo and/or video recordings. Find more information on data protection here. |
Data Builders’ Evening: Architecture, Engineering & Beyond | Berlin, Sep. 16th
|
|
IN PERSON: How Apache Kafka can power AI applications
2025-02-25 · 18:00
Join us for our AI focused meetup! You'll learn all about how to use Apache Kafka in data intensive AI applications and a bunch more. Date and Time: 🗓️ Tuesday 25th February, ⏰ 18:00 - 20:30 PM 🕘 Venue: Lacon House, London WC1X 8NL, United Kingdom Attending Brands: OSO, Hivemind, Lenses Schedule: 18:00: Doors Open 18:00 - 18:30: Food, drinks, networking 18:30 - 19:00: "Streaming Data Platforms - the convergence of micro services and data lakehouses" - Erik Schmiegelow ( CEO, Hivemind Technologies) 19:00 - 19:30: “K2K - making a Universal Kafka Replicator - (Adamos Loizou is Head of Product at Lenses and Carlos Teixeira is a Software Engineer at Lenses) 19:30- 20:30pm: Additional Q&A, Networking 🎙️ \~Talk 1\~ Talk Title: Streaming Data Platforms - the convergence of micro services and data lakehouses Summary: The data space is experiencing a generational shift from classic batch to near real time data products, driven by business demand and AI use cases. We explore what fundamental architectural changes are required and how to transition from data lakes to streaming streams data platforms 🗣️ Speaker 1: Erik is a director and co-founder of Hivemind Technologies, a technology consultancy focussed on cloud and data engineering with functional programming and Infrastructure as Code. Prior to Hivemind, Erik founded several companies and worked in finance, advertising and industrial sectors building scalable distributed software platforms. 🎙️ \~Talk 2\~ Talk Title: K2K - making a Universal Kafka Replicator - Lenses Summary: Join us as the Lenses dev team shares the story behind building K2K, our new tool for Kafka replication and migration. We'll dive into why we made certain design choices, the technical challenges we had to tackle (spoiler: there were plenty), and what K2K can do right now. Plus, we'll give you a peek at where we're headed and what features we're dreaming up for the future. It's a behind-the-scenes look at turning a complex problem into a practical solution 🗣️ Speaker 1: Adamos Loizou is Head of Product at Lenses.io He's been a software engineer for 15+ years. One day, after getting angry at the screen for debugging Kafka Streams, he went to a meetup. He saw an awesome tool that fixed his problem. That tool was Lenses. He lives in London, UK, wears a beanie and has recently switched his mustache for a beard. 🗣️ Speaker 1: Carlos Teixeira is a Software Engineer at Lenses.io, based in Porto, Portugal. He has experience working with Kafka and enjoys solving problems in distributed systems, with a particular interest in programming with Scala and exploring the intricacies of concurrency. |
IN PERSON: How Apache Kafka can power AI applications
|
|
Brownfield Data Integration: How to Migrate from Complex Legacy Environments to Modern Data Platforms
2024-04-25 · 19:00
Erik Schmiegelow
– CEO
@ Hivemind Technologies
|
|
|
Streaming Feature Pipelines with Quix for Real-Time Coinbase Market AI Systems on Hopsworks
2024-04-25 · 18:30
Javier de la Rúa Martínez
– Research Engineer
@ Hopsworks
AI/ML
Data Streaming
|
|
|
London Analytics Engineering Meetup #12
2024-04-11 · 17:00
We're delighted to announce our next meetup will be hosted by Lyst. Agenda: 6:00pm: Doors open, networking, food, drinks. 7:00pm: Talks start! 8:00pm: Talks finish, Q&A. 8:30pm: Further drinks and networking. Speakers: Will GenAI replace Data Engineers? - Gaurav Tiwari - Engineering Manager @ Spotify Brownfield data integration: how to transition from complex legacy data sources to modern data platforms - Erik Schmiegelow - CEO @ Hivemind Technologies The London Analytics Engineering Meetup is focused on discussing and spreading best practices in the growing field of analytics engineering. Whether you've set up the "Modern Data Stack" many times over or are brand new to tools like dbt, Looker, Snowflake, Bigquery and more, this is the meetup for you. Food and drink will be provided by Lyst with additional support from Spectacles and SELECT. |
London Analytics Engineering Meetup #12
|