talk-data.com
People (688 results)
See all 688 →Companies (1 result)
Activities & events
| Title & Speakers | Event |
|---|---|
|
Dear data-loving community, we can't wait to present to you our new Meetup event: This time, it will be a collaboration with RisingWave, a platform for real-time streaming data management and analysis. Yingjun Wu, Founder and CEO at RisingWave Labs, will share his experience in a techy talk, as well as Behnaz Derakhshani, who works as a Specialist Data Engineer at Diconium's data department. Additionally, we're going to welcome external guest speaker Erik Schmiegelow, CEO at Hivemind Technologies. Exciting line-up, right? :D Join us on September 16th in Berlin and bring all your questions! Here are the topics you can expect: Yingjun Wu: Achieving Sub‑100 ms Real‑Time Stream Processing with an S3‑Native Architecture Stream processing systems have traditionally relied on local storage engines such as RocksDB to achieve low latency. While effective in single-node setups, this model doesn't scale well in the cloud, where elasticity and separation of compute and storage are essential. In this talk, we'll explore how RisingWave rethinks the architecture by building directly on top of S3 while still delivering sub-100 ms latency. At the core is Hummock, a log-structured state engine designed for object storage. Hummock organizes state into a three-tier hierarchy: in-memory cache for the hottest keys, disk cache managed by Foyer for warm data, and S3 as the persistent cold tier. This approach ensures queries never directly hit S3, avoiding its variable performance. We'll also examine how remote compaction offloads heavy maintenance tasks from query nodes, eliminating interference between user queries and background operations. Combined with fine-grained caching policies and eviction strategies, this architecture enables both consistent query performance and cloud-native elasticity. Attendees will walk away with a deeper understanding of how to design streaming systems that balance durability, scalability, and low latency in an S3-based environment. Behnaz Derakhshani: From Raw Data to Trusted Assets: A Practical Walkthrough with AWS services and Collibra Expect a hands-on journey of Behnaz showing how modern data lake tools and governance platforms connect the dots, making your data discoverable, governed, and productized for real-world use. Erik Schmiegelow: Effective Agentic GenAI in Data Streaming Successful genAI projects strike the balance between impact, accuracy, and cost. In this talk, Erik will cover how to create agentic data applications effectively, choosing when and how to integrate them in data streams and keep response quality issues and costs in check. What you can expect:
Timetable:
Our goal is to form a local data-loving community, so join us and let's talk data together! -> Our event page, where you can also contact us if you want to present in the future at our Meetup: Data Engineering MeetUp Berlin - applydata --- At the event, sound, image and video recordings are created and published for documentation purposes as well as for the presentation of the event in publicly accessible media, on websites and blogs and for presentation on social media. By participating the event, the participant implicitly consents to the aforementioned photo and/or video recordings. Find more information on data protection here. |
Data Builders’ Evening: Architecture, Engineering & Beyond | Berlin, Sep. 16th
|
|
ClickHouse Delhi/Gurgaon Meetup - March 2025
2025-03-22 · 05:00
We are excited to finally have the first ClickHouse Meetup in the vibrant city of Delhi! Join the ClickHouse crew, from Singapore and from different cities in India, for an engaging day of talks, food, and discussion with your fellow database enthusiasts. But here's the deal: to secure your spot, make sure you register ASAP! 🗓️ Agenda:
If anyone from the community is interested in sharing a talk at future meetups, complete this CFP form and we’ll be in touch. _______ 🎤 Session Details: Introduction to ClickHouse Discover the secrets behind ClickHouse's unparalleled efficiency and performance. Johnny will give an overview of different use cases for which global companies are adopting this groundbreaking database to transform data storage and analytics. Speaker: Rakesh Puttaswamy, Solution Architect @ ClickHouse Rakesh Puttaswamy is a Solution Architect with ClickHouse, working with users across India, with over 12 years of experience in data architecture, big data, data science, and software engineering.Rakesh helps organizations design and implement cutting-edge data-driven solutions. With deep expertise in a broad range of databases and data warehousing technologies, he specializes in building scalable, innovative solutions to enable data transformation and drive business success. 🎤 Session Details: ClickPipes Overview and demo ClickPipes is a powerful integration engine that simplifies data ingestion at scale, making it as easy as a few clicks. With an intuitive onboarding process, setting up new ingestion pipelines takes just a few steps—select your data source, define the schema, and let ClickPipes handle the rest. Designed for continuous ingest, it automates pipeline management, ensuring seamless data flow without manual intervention. In this talk, Kunal will demo the Postgres CDC connector for ClickPipes, enabling seamless, native replication of Postgres data to ClickHouse Cloud in just a few clicks—no external tools needed for fast, cost-effective analytics. Speaker: Kunal Gupta, Sr. Software Engineer @ ClickHouse Kunal Gupta is a Senior Software Engineer at ClickHouse, joining through the acquisition of PeerDB in 2024, where he played a pivotal role as a founding engineer. With several years of experience in architecting scalable systems and real-time applications, Kunal has consistently driven innovation and technical excellence. Previously, he was a founding engineer for new solutions at ICICIdirect and at AsknBid Tech, leading high-impact teams and advancing code analysis, storage solutions, and enterprise software development. 🎤 Session Details: Optimizing Log Management with Clickhouse: Cost-Effective & Scalable Solutions Efficient log management is essential in today's cloud-native environments, yet traditional solutions like ElasticSearch often face scalability issues, high costs, and performance limitations. This talk will begin with an overview of common logging tools and their challenges, followed by an in-depth look at ClickHouse's architecture. We will compare ClickHouse with ElasticSearch, focusing on improvements in query performance, storage efficiency, and overall cost-effectiveness. A key highlight will be OLX India's migration to ClickHouse, detailing the motivations behind the shift, the migration strategy, key optimizations, and the resulting 50% reduction in log storage costs. By the end of this talk, attendees will gain a clear understanding of when and how to leverage ClickHouse for log management, along with best practices for optimizing performance and reducing operational costs. Speaker: Pushpender Kumar, DevOps Architect @ OLX India Born and raised in Bijnor, moved to Delhi to stay ahead in the race of life. Currently working as a DevOps Architect at OLX India, specializing in cloud infrastructure, Kubernetes, and automation with over 10 years of experience. Successfully optimized log storage costs by 50% using Clickhouse, bringing scalability and efficiency to large-scale logging systems. Passionate about cloud optimization, DevOps hiring, and performance engineering. 🎤 Session Details: ClickHouse at Physics Wallah: Empowering Real-Time Analytics at Scale This session explores how Physics Wallah revolutionized its real-time analytics capabilities by leveraging ClickHouse. We'll delve into the journey of implementing ClickHouse to efficiently handle large-scale data processing, optimize query performance, and power diverse use cases such as user activity tracking and engagement analysis. By enabling actionable insights and seamless decision-making, this transformation has significantly enhanced the learning experience for millions of users. Today, more than five customer-facing products at Physics Wallah are powered by ClickHouse, serving over 10 million students and parents, including 1.5 million Daily Active Users. Our in-house ClickHouse cluster, hosted and managed within our EKS infrastructure on AWS Cloud, ingests more than 10 million rows of data daily from various sources. Join us to learn about the architecture, challenges, and key strategies behind this scalable, high-performance analytics solution. Speaker: Utkarsh G. Srivastava, Software Development Engineer III @ Physics Wallah As a versatile Software Engineer with over 7 years of experience in the IT industry, I have had the privilege of taking on diverse roles, with a primary focus on backend development, data engineering, infrastructure, DevOps, and security. Throughout my career, I have played a pivotal role in transformative projects, consistently striving to craft innovative and effective solutions for customers in the SaaS space. 🎤 Session Details: FabFunnel & ClickHouse: Delivering Real-Time Marketing Analytics We are a performance marketing company that relies on real-time reporting to drive data-driven decisions and maximize campaign effectiveness. As our client base expanded, we encountered significant challenges with our reporting system—frequent data updates meant handling large datasets inefficiently, leading to slow query execution and delays in delivering insights. This bottleneck hindered our ability to provide timely optimizations for ad campaigns. To address these issues, we needed a solution that could handle rapid data ingestion and querying at scale without the overhead of traditional refresh processes. In this talk, we’ll share how we transformed our reporting infrastructure to achieve real-time insights, enhancing speed, scalability, and efficiency in managing large-scale ad performance data. Speakers: Anmol Jain, SDE-2 (Full stack Developer), & Siddhant Gaba, SDE-2 (Python) @ Idea Clan From competing as a national table tennis player to building high-performance software, Anmol Jain brings a unique mix of strategy and problem-solving to tech. With 3+ years of experience at Idea Clan, they play a key role in scaling Lookfinity and FabFunnel, managing multi-million-dollar ad spends every month. Specializing in ClickHouse, React.js, and Node.js, Anmol focuses on real-time data processing and scalable backend solutions. At this meet-up, they’ll share insights on solving reporting challenges and driving real-time decision-making in performance marketing. Siddhant Gaba is an SDE II at Idea Clan, with expertise in Python, Java, and C#, specializing in scalable backend systems. With four years of experience working with FastAPI, PostgreSQL, MongoDB, and ClickHouse, he focuses on real-time analytics, database optimization, and distributed systems. Passionate about high-performance computing, asynchronous APIs, and system design, he aims to advance real-time data processing. Outside of work, he enjoys playing volleyball. At this meetup, he will share insights on how ClickHouse transformed real-time reporting and scalability. 🎤 Session Details: From SQL to AI: Building Intelligent Applications with ClickHouse and LangDB As AI becomes a driving force behind innovation, building applications that seamlessly integrate AI capabilities with existing data infrastructures is critical. In this session, we explore the creation of agentic applications using ClickHouse and LangDB. We will introduce the concept of an AI gateway, explaining its role in connecting powerful AI models with the high-performance analytics engine of ClickHouse. By leveraging LangDB, we demonstrate how to directly interact with AI functions as User-Defined Functions (UDFs) in ClickHouse, enabling developers to design and execute complex AI workflows within SQL. Additionally, we will showcase how LangDB facilitates deep visibility into AI function behaviors and agent interactions, providing tools to analyze and optimize the performance of AI-driven logic. Finally, we will highlight how ClickHouse, powered by LangDB APIs, can be used to evaluate and refine the quality of LLM responses, ensuring reliable and efficient AI integrations. Speaker: Matteo Pelati, Co-founder, LangDB.ai Matteo Pelati is a seasoned software engineer with over two decades of experience, specializing in data engineering for the past ten years. He is the co-founder of LangDB, a company based in Singapore building the fastest Open Source AI Gateway. Before founding LangDB, he was part of the early team at DataRobot, where he contributed to scaling their product for enterprise clients. Subsequently, he joined DBS Bank where he built their data platform and team from the ground up. Prior to starting LangDB, Matteo led the data group for Asia Pacific and data engineering at Goldman Sachs. |
ClickHouse Delhi/Gurgaon Meetup - March 2025
|
|
DevFest Berlin 2024
2024-11-23 · 08:00
DevFest Berlin is back! This year back to Humboldt University of Berlin, with more than 25 talks & workshops, you can expect a whole day of learning, socialising, and engaging with a vibrant Berlin Tech community! 🎫 Get you ticket here: pretix.eu/devfestberlin/2024/ 🖍 Call for Papers still open: pretalx.com/devfest-berlin-2024/cfp Agenda Day 1 9:00 AM: Registration & Coffee 🥐 ☕️ 9:45 AM: 🎤 Welcoming 10:00 AM: 🎤 Katya Vinnichenko - Introduction to Google Principles of Responsible AI This year's DevFest explores how AI can improve lives globally, from business to healthcare to education. At Google we acknowledge AI's potential, while also recognising the challenges it presents. Thus, we are committed to helping you build and use AI responsibly, ensuring fairness and ethical practices. In my talk you will learn: the main principles of responsible AI at Google; the ethical implications of AI; best practices for developing AI systems and integrating AI into Google products and services; last but not least – how AI will change the role of the developer as we know it. 10:50 AM: 🎤 Oleksii Antypov - DMARC Demystified Discover the essential framework behind DMARC and how it secures email communication across the internet. This session covers the historical evolution of email security, dives into the common challenges of implementing DMARC, and provides actionable best practices for protecting your domain. Ideal for developers, security professionals, and anyone interested in safe email practices. In a world where phishing and email spoofing are constant threats, DMARC stands as a vital defense mechanism. “DMARC Demystified” takes you through a journey from the origins of email security to the modern challenges and solutions that DMARC offers. We'll explore how DMARC works with SPF and DKIM, why it’s essential for organizations of all sizes, and the practical steps to ensure smooth implementation. Expect an interactive timeline tracing the milestones of email security, detailed breakdowns of real-world cases, and insights into optimizing DMARC. Walk away with a deeper understanding of email protection, armed with knowledge to strengthen your email systems and protect against threats. 11:40 AM: 🎤 Marcin Chudy - Demystifying App Architecture: The LeanCode Guide At LeanCode we developed over 40 Flutter apps, spanning from huge enterprise apps to nimble startup ventures. Some were developed by a single Flutter dev, some came into light through collaborative efforts across multiple teams. Each of them was different. Each of them presented unique challenges and taught us invaluable lessons. In this talk, we invite you to explore different approaches to architecting Flutter apps. Central to our narrative will be the concept of architectural drivers - key factors or priorities that steer our decisions about how the app is structured and designed. We'll show how we leverage our experience when approaching new projects. Drawing from our successes and failures, we'll present our current Flutter stack which enables us to craft robust, scalable, and maintainable applications. While there is no silver bullet for Flutter architecture, we can still have some sensible defaults. Why do we use BLoC for state management? Why not Riverpod? Why do we love hook 12:30 PM: 🎤 Danny Preussler - Ten things you heard about testing that might be wrong Testing became an essential part of Android development. Many conference talks have been given and even more best practices have been written. But what if, as time evolved, some of the things we thought were true, changed? Let’s start questioning some of these in this talk: Are flaky tests fixable? Are mocks even harmful? Is DI about testing? Did we understand testing in isolation properly? Is the test pyramid still valid? And in times of AI, should we generate tests? Come and join my session to learn more! 1:10 PM: Lunch 🍔🥤 2:40 PM: 🎤 Andrey Sitnik - Privacy-first architecture: alternatives to GDPR popup and local-first Why and how modern developers could increase the privacy of modern Web. The popularity of clouds, the rise of huge monopolies across the internet, and the growth of shady data brokers recently have made the world a much more dangerous place for ordinary people—here is how we fix it. In this talk, Andrey Sitnik, the creator of PostCSS and the privacy-first open-source RSS reader, will explain how we can stop this dangerous trend and make the web a private place again. — Beginners will find simple steps, which can be applied to any website — Advanced developers will get practical insights into new local-first architecture — Privacy experts could find useful unique privacy tricks from a global world perspective and beyond just U.S. privacy risks 3:30 PM: 🎤 Raphaël VO - Largest Contentful Paint - The unheard story Largest Contentful Paint (LCP) is more than a speed metric — it's the unseen factor shaping user experiences and impacting SEO. While often overlooked, LCP reveals when a page’s core content is truly ready, affecting how users perceive load time and usability. This talk uncovers LCP’s role, why it matters more than we think, and simple strategies to boost LCP for better engagement and rankings. Discover the hidden story behind one of web performance’s most crucial, yet understated metrics. Did you know the speed of a single webpage element could decide if users stay or leave? Largest Contentful Paint (LCP) is that hidden hero, quietly working to load the most important content quickly. This talk unveils LCP’s role in creating faster, more engaging web experiences and why it’s key to winning user loyalty. Dive into the “unheard story” of LCP and discover practical tips to make your site not only faster but unforgettable. 4:20 PM: 🎤 Ash Davies - Navigation in a Multiplatform World: Choosing the Right Framework for your App Navigation in mobile, desktop, and web applications is such a fundamental part of how we structure our architecture. In order to both obtain functional clarity, and abstraction from platform level implementation. For a long time, there have been options available specific to each platform, and even options part of the platform framework itself. Though it can be difficult to find the right option for platform-agnostic code, ensuring consistency. Some go one step further, providing an opinionated guide on how to architecture your application. In this talk, I'll evaluate the options available, how they differ, and to what type of applications they are best suited. Including how to get started with them, and the best practice guidelines on how to get the most out of them, for your application. 5:10 PM: 🎤 Vadim Makeev - You don’t know MathML. Almost nobody does Do you speak math? Me neither. Still, math formulas have always been around: from Wikipedia articles to JavaScript APIs and even CSS docs. It looks so alien that I never had a clue how to express it on the web. Apparently, there’s a markup language for that. HTML for content, SVG for vector graphics, and MathML for math! And it’s pretty cross-browser, too. Let’s dive into the basics and quirks of the language of the universe. Even if math is not your love language, you might learn something interesting about the web platform. Day 2 9:00 AM: Registration & Coffee 🥐 ☕️ 10:00 AM: 🎤 Alex Mir – Accessibility matters The regulators are here and now businesses will care about the a11y. Let's make the a11y compliance not just a formal check. I believe that it is our job as industry experts to understand why it is important and get our products ready for all groups of people. 10:50 AM: 🎤 Marco Gomiero - From Android to Multiplatform and beyond With Kotlin Multiplatform getting increasingly established, many Android libraries became multiplatform. But how to make an existing Android library multiplatform? In this talk, we will cover the common challenges faced while migrating Android libraries to Kotlin Multiplatform, like handling platform-specific dependencies, re-organizing the project structure without losing the contributor's history, testing on multiple platforms, and publishing the library. 11:20 AM: 🎤 Muhammad Salman Bediya - Crucial Performance Issue in Flutter Apps: Memory Leaks Memory leaks can be hard to spot but have a big impact on the performance of Flutter apps, especially those running for long periods. In this talk, we’ll explore the most common reasons memory leaks happen in Flutter and Dart, focusing on how asynchronous programming and Streams can make them more challenging. You’ll learn practical tips to identify and fix these issues, helping your apps run smoother and more efficiently. 11:40 AM: 🎤 Andrii Raikov - Maximizing Scalability with Go and Redis: A Telemetry Processing Journey At Delivery Hero, we process 10,000 requests per second using Go and Redis. Join us to learn how this powerful duo handles high-load telemetry data efficiently and cost-effectively, with scalability, resource optimization, and continuous innovation through customized data flows. 12:30 PM: 🎤 Tomek Porozynski - Can You Outsmart an AI? Adventures in Prompt Hacking In this talk combined with hands-on elements, participants will engage in a series of live prompt hacking challenges, accessible directly through their mobile devices. The workshop begins with simple prompt injection techniques and progressively moves to more sophisticated manipulation strategies. After each successful hack, I'll analyze what made it work and transform these insights into practical defense mechanisms. Attendees will learn: Common vulnerabilities in AI prompt design, Practical techniques for prompt injection attacks, Essential strategies for securing chatbot applications, Best practices for implementing defensive layers, Real-world examples of prompt security failures and successes Perfect for developers working with AI models, security enthusiasts, or anyone interested in building safer AI applications. No specialized tools needed - just bring your phone and creativity! You'll leave with concrete techniques for both testing and securing your AI systems against prompt manipulation attacks. 1:10 PM: Lunch 🍔🥤 2:40 PM: 🎤 Cesar Martinez - Domain Driven Design Fundamentals for Frontend Developers What can we learn from Domain Driven Design and how to start applying its teachings in your frontend codebase. 3:30 PM: 🎤 Vadym Pinchuk - Effortless optimization of Flutter apps: performance tips for developers In this session, we’ll dive into effortless yet impactful ways to optimize your Flutter applications. Performance improvements don’t always require a full rewrite—sometimes, small adjustments can lead to big gains. We'll explore practical tips and tricks for enhancing app speed, responsiveness, and efficiency with minimal effort. From reducing widget rebuilds to handling large data efficiently and managing state effectively, this talk will provide developers with actionable insights to deliver a smoother user experience. Whether you’re a beginner or an experienced Flutter dev, you’ll walk away with easy-to-apply techniques to optimize your apps without breaking a sweat. 4:20 PM: 🎤 Ian Ballantyne - Generative AI on Mobile and Web with Google AI Edge Generative AI is no longer limited to execution in the cloud. Small language models, such as Gemma 2B, are quickly becoming small and powerful enough for on-device AI, offering benefits like low latency, offline functionality, privacy, and cost-effectiveness. Google AI Edge, with MediaPipe and LiteRT (formerly Tensorflow Lite), enables the development and deployment of efficient on-device AI models. These frameworks handle the complexities of model execution and hardware acceleration, allowing developers to focus on creating innovative AI experiences. Think generative AI is just about chatbots? Think again. This talk will go beyond basic conversations with language models and explore how on-device generative AI can be integrated into everyday apps ready to help with tasks, answer questions, and provide creative inspiration, all powered by the information located on-device. Imagine truly useful apps that are quick to respond and still work without an internet connection. 5:10 PM: 🎤 Bogdan Plieshka - Automated Testing Layers in a multidimensional Monorepo: Fast-tracking Quality for hundreds apps In this talk, I’ll dive into the testing layers that make up our quality pipeline at Zattoo, including static analysis, unit, system, and end-to-end testing. We’ll discuss the concept of quality gates, shift-left approach, and affected domain recognition, which helps us maintain reliability across a large, dynamic codebase, bringing total quality feedback for contributors to 3 minutes. I’ll share practices for achieving scalable, fast testing in a high-complexity environment, offering insights for anyone working with large-scale applications or monorepos and looking to streamline QA processes. Day 3 9:00 AM: Registration & Coffee 🥐 ☕️ 10:00 AM: 🎤 Inès Mir & Doruk Deniz Kutukculer - Fellowship of Product. How your team setup affects your experience Did you know there are 2 types of team formation in tech? These formations can change your experience in the team drastically and you better recognise them early to adjust your expectations from the job. And even more importantly, you need to show different qualities on job interviews to get this job in a particular team formation! Deniz Doruk Kuetuekcueler, a head of engineering, and Inès Mir, a principal product designer, are trying to figure out how design and engineering can effectively work together in these setups. 10:50 AM: 🎤 Alireza Rahmaty - How we automate the App Release Monitoring at GetYourGuide App release monitoring (ARM) represents a suite of innovative tools designed to monitor the health and stability of iOS and Android app releases. These tools provide real-time updates by sending notifications to Slack channels and logging the app's status throughout the release process. At GetYourGuide, we have developed an ARM to monitor the rollout of our Android and iOS apps from the moment they are submitted to the App Store & Google Play until they are fully released. We ship releases faster and with more confidence using ARM! 11:40 AM: 🎤 Aleksandr Gorbunov - Flutter for frontenders or There and Back Again Every developer, regardless of specialization, may encounter the need to create a UI for a client application. The choice of technology may depend on the developer, or it may be pre-determined by the client, as happened in my case. The peculiarity is that, coming from frontend development in JavaScript, I started building user interfaces in Flutter. Today, there is a vast number of technologies that enable the development of cross-platform applications. These technologies are evolving rapidly, attracting large communities, and more frequently, companies are adopting them. For example, Flutter is a powerful framework that allows developers to create cross-platform applications. With a high probability, every developer may encounter the need to use such development tools, and it’s great that frameworks like Flutter come with detailed documentation and extensive community support, making it relatively easy to start developing with them. Although, at first glance, everything might not seem smooth, and the desire to revert to familiar methods may arise. 12:05 PM: 🎤 Muhammad Salman Bediya - Crucial Performance Issue in Flutter Apps: Memory Leaks Memory leaks can be hard to spot but have a big impact on the performance of Flutter apps, especially those running for long periods. In this talk, we’ll explore the most common reasons memory leaks happen in Flutter and Dart, focusing on how asynchronous programming and Streams can make them more challenging. You’ll learn practical tips to identify and fix these issues, helping your apps run smoother and more efficiently. 12:30 PM: 🎤 Ole Bulbuk - Native GUIs For All Traditionally native GUIs are highly platform dependent and often specific for one programming language. In this talk we will explore a way to create GUI applications that supports virtually all platforms and any programming language. It is very effective and easy to use, too. 1:10 PM: Lunch 🍔🥤 2:40 PM: 🎤 Nicole Terc - Tap it! Shake it! Fling it! Sheep it! - The Gesture Animations Dance! Let's have fun with animations, gestures and sensors! Using Compose Multiplatform, we'll go over how to create animations using gestures and sensor events for Android & iOS. We'll cover some basics like how to get the device motion and position information, how to track gestures in the screen, and how you can combine them with animations to have fun! After this talk, you'll have a better understanding on how to use the sensor frameworks, how to make your own gesture effects, and how to create interesting animations in an easy way. Keep it fun, keep it animated! 3:30 PM: 🎤 Andrii Khrystian - From waves to widgets: Sound processing in Flutter In this talk, we'll explore how to work with sound in Flutter apps. We'll go over the basics of adding sound effects and processing audio to make your apps more interesting. You'll learn how to handle audio files and integrate them smoothly with your Flutter projects. This session is great for anyone looking to add audio features to their apps simply and effectively. 4:20 PM: 🎤 Randy Nel Gupta - From Practice: Migration of an Order Processing System to the Cloud A case study on how an order processing system, processing 50,000 orders daily for an international retailer spread across multiple continents and jurisdictions, is migrated to the cloud. The legacy system is implemented in PL/SQL and must be migrated during ongoing operations. The presentation will cover all aspects from testing, monitoring, to development and the application of Site Reliability Engineering. Furthermore, less technical topics will be introduced, such as the systematic composition of teams to ensure the necessary technical as well as domain-specific expertise. 4:50 PM: 🎤 Wietse Venema - Running open large language models in production with serverless GPUs Many developers are interested in running open large language models, such as Google's Gemma and Llama. Open models give you full control over the deployment options, the timing of model upgrades, the private data that goes into the model, and the ability to fine-tune on specific tasks such as data extraction. Hugging Face TGI is a popular open-source LLM inference server, and Hugging Face TRL is excellent for fine-tuning. You’ll learn how to build and deploy an application that uses an open model on Google Cloud Run with cost-effective GPUs that scale down to zero instances. Day 4 9:00 AM: Registration & Coffee 🥐 ☕️ 10:00 AM: 🎤 Daniel Stamer & Diana Nanova - Workshop: From Prototype to Production In this hands-on technical workshop participants will work on a hilarious web service prototype and deploy it to the cloud, set up build and deployment pipelines, extend the code base to leverage GenAI functionality, use SRE practices to effectively operate the application and finally strengthen the security posture of the overall software delivery process to guard against supply chain attacks. 1:10 PM: Lunch 🍔🥤 2:40 PM: 🎤 John Nguyen - Building a Chrome Extension using Gemini and Langchain In this workshop, you will learn the basics of creating a Google Chrome Extension (which will also work on any Chromium-based Browser). We will build a simple Page summarizer using Bun, Typescript, Gemini, and LangChain. We will learn the anatomy of the manifest.json for building a Chrome Extension, Bun's bundler, how to interact with Gemini, and why LangChain is a good idea here. 3:45 PM: 🎤 Guillaume Vernade - How to make the most of Gemini multimodal capabilities? We all know that in Tech there are always dozens of way of doing anything. But what if we could only use LLM for a first investigation? Let me show you how I'm trying to solve the mystery of who killed my pond's fishes using the power of Gemini. Day 5 9:00 AM: Registration & Coffee 🥐 ☕️ 10:00 AM: 🎤 Mario Bodemann & Joost van Dijk - Workshop: Passkeys on Android: How to get rid of passwords Passwords. Or two factors? What about multiple factors? Which email did you register with? Why is 'password123' not working on this side, that is password is shared everywhere else? If you recognize some of those questions, I am happy to add another couple: What are passkeys? Or how about: How to use passkeys to replace passwords in an Android app? In this workshop I will walk through the later two questions: How to build an Android App that registers and signs users in, using passkeys. Expect a quick explanation of this fancy new technology, why it will replace passwords and how you can store them either on your mobile devices or on dedicated hardware. Following that, a fictive application and service will be built to show you how to use those passkeys and which moving pieces you will need. Expect to use you Android Studio with Kotlin and common best practices to build an Android app, talking to the public available backend. 11:05 AM: 🎤 Anton Borries - Workshop: Adding Homescreen Widgets to Flutter Apps HomeScreen Widgets are a great way to provide more Information to your Users right on their HomeScreens providing more ways for your App to appear in User's lives and help them achieve their goals. In this Workshop we'll look at the necessary steps needed in order to add HomeScreen Widgets to Flutter Apps using the home_widget package 12:10 PM: 🎤 Elena Grahovac - Workshop: Mastering Multiple Engineering Leadership Roles for Maximum Impact As an engineering manager or technical leader, navigating multiple roles that demand a diverse set of skills is a common yet challenging part of the job. In this workshop, we will explore how to effectively balance these multiple roles and responsibilities in a complex engineering environment. Participants will be guided through the creation of their own leadership framework, tailored to adapt to the unique situations and styles of each individual. Beginning with identifying core values and responsibilities, the framework is elaborated into an actionable plan to succeed. This workshop not only offers an opportunity for reflection on personal and professional development but also provides tools and insights to enhance management capabilities and team dynamics. Join us to cultivate a comprehensive approach to leadership that aligns with your unique role, responsibilities, and personal style. 1:10 PM: Lunch 🍔🥤 2:40 PM: 🎤 Gus Martins - Workshop: Gemma for Everyone: Your First Steps with Open Models and AI Dive into the world of open models and AI with Gemma! This workshop will guide you through the basics of using Gemma, Google's powerful family of language models. Learn how to harness Gemma's capabilities for tasks like text generation, question answering, and more. We'll also explore how to fine-tune Gemma on your own data, allowing you to create custom AI solutions tailored to your needs. No prior experience with large language models is required! 3:45 PM: 🎤 Shahriyar Rzayev - Learn Flask the hard way: Introduce Architecture Patterns Flask is a popular and flexible web framework for Python, but building scalable and maintainable Flask applications can be challenging without a solid understanding of architecture patterns. This workshop aims to provide participants with a detailed explanation of applying architecture patterns to Flask projects. By exploring various design principles and best practices, attendees will learn how to structure their Flask applications for improved scalability, modularity, and maintainability. Focusing on the Repository, Unit of Work, and Use Cases patterns, attendees will gain experience in applying these patterns to enhance code organization, maintainability, and testability. All these layers are wired together using Dependency Injection, which is yet another powerful tool to use in your applications. The application we are going to build is stored in: https://github.com/ShahriyarR/hexagonal-flask-blog-tutorial We are going to completely rewrite the official Blog application described in Flask documentation by applying architecture patterns. All abstraction layers are covered by unit and integration tests, which will give the attendees a detailed view of why it is important to structure the application using architecture patterns. Speakers Aleksandr Gorbunov - Smart Steel Technologies (Full Stack Developer) A skilled developer specializing in JavaScript (JS) and TypeScript (TS), with strong expertise in frontend development. Proficient in the Vue ecosystem (Vue2, Vue3, Composition API, Nuxt 3), using Webpack and Vite for project bundling. Experienced in testing with Vitest, Cypress, and Jest. Adept in CSS preprocessors like SASS and Stylus. Additionally, has solid knowledge of Flutter and experie… Andrey Sitnik - Evil Martians (Lead Engineer) With more than 20 years in open source, Andrey Sitnik created a few popular CSS tools (PostCSS, Autoprefixer), local-first framework (Logux), and many small libraries with millions of downloads (like Nano ID). Andrii Khrystian - Dynatrace (Senior Flutter Developer) GDG Linz organiser. Senior Flutter Developer at Dynatrace. Public speaker and tech writer Andrii Raikov - Delivery Hero SE (Principal Software Engineer) Andrii is a Principal Software Engineer at Delivery Hero. He has a total of 15 years of experience with Ruby and has been very passionate about Go for the last 5 years. Anton Borries - 1KOMMA5° (Software Engineer) Anton is a Software Engineer working at 1KOMMA5° He loves building great UI and UX using Flutter. Coming from an Android Background the gap between Flutter and native Features has always tickled his interest. This has lead him into improving the experience of developing HomeScreen Widgets for Flutter Apps Ash Davies Google Developer Expert for Android, enthusiastic speaker, lead engineer at ImmobilieenScout24, Kotlin aficionado, spends more time travelling than working. Daniel Stamer - Google (Cloud Customer Engineer) Daniel is passionate about building modern cloud-native applications on Google's serverless technologies. He works with digital natives out of Germany’s startup capital Berlin and helps to modernize applications or build brand new ones in the cloud. Danny Preussler - SoundCloud (Android Platform Lead) Danny is a developer by heart, living in Berlin and leading the Android team at SoundCloud. He worked for companies like Groupon, Viacom, eBay and Alcatel and started his mobile career long before any Android with Java ME and Blackberry applications. Danny writes and talks about mobile development and testing regularly and is a Google Developer Expert for Android and Kotlin. Elena Grahovac - FerretDB (Director of Engineering) Elena has been in software engineering since 2007, focusing on backend systems and infrastructure. Having played the roles of both individual contributor and engineering manager, Elena is passionate about combining technical expertise with strong team collaboration. A dedicated advocate of DevOps practices, she aims to enhance workflows and bring teams together. Elena believes in helping peopl… Gus Martins - Google (Developer Advocate) Katya Vinnichenko - Google (Program Manager) Katya is a Program Manager at Google Developer Relations team. Currently she is leading the Google Developer Groups across Europe, the Middle East and Africa. Marcin Chudy - LeanCode (Senior Flutter Developer) Marcin is a Senior Flutter Developer at LeanCode, currently playing tech lead role in a big project for the banking sector. Previously worked with backend, web frontend with React, finally settling on mobile and falling in love with Flutter at first sight. After work, he enjoys dancing salsa and bachata and attends metal concerts. Marcin is a Senior Flutter Developer at LeanCode and has … Marco Gomiero - Airalo (Senior Android Developer | Kotlin GDE) Marco is an Android engineer, currently working at Airalo. He is a Google Developer Expert for Kotlin, he loves Kotlin and he has experience with native Android and native iOS development, as well as cross-platform development with Flutter and Kotlin Multiplatform. In his spare time, he writes and maintains open-source code, he shares his dev experience by writing on his blog, speaking a… Mario Bodeman - Yubico (Android Developer Advocate) Speaker of talks, coder of code, doer of dones. Muhammad Bediya Muhammad Salman is a Senior Software Engineer specializing in mobile app development with a focus on building scalable, high-quality applications using Flutter, React Native, Xamarin, and Swift. With experience leading frontend teams on enterprise-level projects that have reached over 1.5 million users, he brings a strong commitment to creating impactful, user-centered solutions. A dedic… Nicole Terc Android GDE, Boardgame lover, videogame addict and origami enthusiast, Nicole self taught herself to code and has been fooling around with the Android ecosystem for more than 10 years. She has participated in a diverse variety of projects for several clients around the world, including video streaming, news, social media and public transport applications. Regardless of what the current adventu… Ole Bulbuk - Ardan Labs Ole is a backend engineer since the nineties. He has been working for many companies big and small and seen many projects fail or succeed. He loves to be part of the global Go community and working on projects that make the world a better place. In his spare time he is co-organising the Berlin chapter of GDG Golang, develops open source software and enjoys time with his family. Oleksii Antypov - DmarcDkim.com (Founder & CEO) Experienced CTO specializing in early-stage startups. Formerly with Rocket Internet and PocketBook, now focused on accelerating global DMARC adoption. Originally from Ukraine, I relocated to Berlin in 2015 to deepen my expertise in building successful startups from the ground up. Raphaël VO - Ekino (Senior Software Engineer) I’m Raphael Vo, a passionate Senior Software Engineer with over 10 years of experience, specializing in Angular and frontend development. I love turning complex ideas into delightful user experiences and tackling challenges creatively and enthusiastically. When I'm not coding, you’ll find me diving into the latest tech trends or enjoying epic board game nights with friends. As an aspiring spea… Vadim Makeev Frontend developer in love with the Web, browsers, bicycles, and podcasting. He/him, MDN technical writer, Google Developer Expert. Alex Mir - mobile.de (Frontend Engineer) Frontend Engineer at car retail platform mobile.de (part of Adevinta / ex-Ebay) Alireza Rahmaty - GetYourGuide (Android Developer) I am Alireza, an Android developer with 6+ years of experience building apps. I have experience building server-driven UI apps, complex UI, localisation and testing, and CI/CDI. I sometimes go hiking and play video games. Cesar Martinez - Meyer Sound (Web Developer) Web developer with around 10 years of experience and a passion for software architecture. Currently working at Meyer Sound. Bogdan Plieshka - Zattoo (Principal Engineer) Engineer with over a decade of Frontend development experience, passionate about automation, accessibility, and scaling complex systems. Working at Zattoo as a Principal Engineer, focusing on delivering frontend solutions across Web, React, and React Native for streaming media content.Organizer of the React Berlin Meetup, actively contributing to the development community. Diana Nanova - Google (Customer Engineering Manager) Diana is a Customer Engineering Manager at Google Cloud. Based in the German tech startup capital Berlin, Diana helps digital native customers and startups across various industries to leverage the capabilities of Google Cloud and loves championing for Google culture. Doruk Deniz Kutukculer - Zalando (Head of Engineering) IT professional and a leader with over 15 years of experience in the industry. Currently a Head of Engineering at Zalando. Guillaume Vernade - Google (AI Dev Rel) I've been a jack-of-all-trades in the Tech industry, starting as a prototyper building apps on Google Glasses and the first Android watches, then became a Product Owner and an Agile coach. I realized my childhood dream of becoming a video game producer then came back to my other passion: AI. Ian Ballantyne - Google (AI DevRel) Ian is a Developer Relations Engineer for AI at Google. Currently he works on generative AI, such as Gemini and Gemma. He is passionate about on-device AI, using technologies such as Google AI Edge to deploy artificial intelligence to web and mobile devices. He has been in Developer Relations at Google for 9 years specializing in helping partners and developers unlock the capability of Google … Inès Mir - Zalando (Principal Product Designer) A principal product designer at Zalando and a content creator. John Nguyen - Eon (Backend Developer) Fullstack developer with a knack for whipping up code recipes using my secret ingredients: a dash of JavaScript, a pinch of Python, and a whole lot of serverless magic John's journey in software development began as a PHP developer, but he later transitioned to front-end development and became passionate about all things related to Javascript. While working as a data DevOps engineer in a… Joost van Dijk - Yubico (Developer Advocate) Joost van Dijk is a developer advocate at Yubico. As the inventor of the YubiKey, Yubico makes secure login easy and available for everyone. Joost focuses on securing digital identities and accelerating the adoption of open authentication standards as part of Yubico’s developer program. Randy Gupta Randy is a Google Developer Expert for Cloud and also Organizer of the GDG Düsseldorf. With a professional experience of more 25 years in software development he is focused today on building microservices applications on top of Kubernetes. Shahriyar Rzayev - Nord Security (Senior Software Engineer) Senior Software Engineer @ Nord Security. Moving forward on Clean Code and Clean Architecture. Previous accomplishments include contributing to open source, providing technical direction, and sharing knowledge about Clean Code and Architectural patterns. An empathetic team player and mentor. Azerbaijan Python Group Leader. Former QA Engineer and Bug Hunter. Tomek Porożyński - Atos Vadym Pinchuk - Sky (Mobile Software Engineer) Vadym, a seasoned software engineer, possesses a wealth of experience in Android application development. He has skillfully transitioned his expertise to cross-platform development, utilizing Flutter. Throughout his career, Vadym has collaborated with a diverse range of companies, from industry giants like Samsung, Volvo, Bosch, and Instagram to smaller start-ups. Leveraging his extensiv… Wietse Venema - Google (Google Cloud Engineer) Wietse Venema is an engineer at Google Cloud. He wrote the O’Reilly book on Cloud Run. Hosts Seemran Xec - Sawayo (Software Engineer) A focused developer possessing professional experience of 6+ years in software development for product-based and service-based industries, with businesses acquiring valuable insight and implementing best practices. Collaborated with startups and other businesses as a freelancer/consultant to build, design, and manage the product. I'm passionate about what I do and a lifelong learner. Louis Tsai - Zalando SE (GDG Organizer) Alex Mir - mobile.de (Frontend Engineer) Frontend Engineer at car retail platform mobile.de (part of Adevinta / ex-Ebay) Jhoon Saravia - Greenmates (Mobile Engineer) Software consultant and developer, experienced in Android, Flutter and Full-stack. Interested in working on DEI initiatives as a complement to my core work. Particularly interested in technology, gadgetry, the future, the combination of those three and the impact that driving Diversity, Equity and Inclusion has on all of them both in and out of the workplace.Amateur photographer a… Matthias Geisler - Thermondo (Senior Software Engineer) True believer in (Kotlin) Multiplatform and working with it for over 4 years now. Builds solutions for Android. Maintainer and developer of KMock. Co-Organizer of KUG Berlin, GDG Android Berlin, Rust Berlin and XTC Berlin. Emy Jamalian - Atlas Metrics (Software QA Engineer) Complete your event RSVP here: https://gdg.community.dev/events/details/google-gdg-berlin-presents-devfest-berlin-2024/. |
DevFest Berlin 2024
|
|
Madrid dbt Meetup #5 (in-person)
2024-11-21 · 17:45
dbt Meetups are networking events open to all folks working with data! Talks predominantly focus on community members’ experience with dbt, however, you’ll catch presentations on broader topics such as analytics engineering, data stacks, DataOps, modeling, testing, and team structures. 📍 Venue Host: Utopicus Habana (P.º de La Habana, 9, 11, 28036 Madrid) 🍕 Catering: Drinks & Pizza at the place of the event 🤝 Organizer: Astrafy is organizing this event, enabled by the community team at dbt Labs *To attend, please read the Health and Safety Policy and Terms of Participation: ***https://www.getdbt.com/legal/health-and-safety-policy 🗓️Agenda:
🗣️Presentation #1: dbt can be leveraged for more than just basic testing. We will dive into advanced data validation techniques that ensure data quality beyond conventional testing in dbt. We will use Recce as a new emerging tool that allows validation checks and improved approval requests. Speaker bio: Alejandro de la Cruz López is an experienced Data Engineer with a strong background in Data Science and Artificial Intelligence. He has led various data projects, optimizing systems and improving infrastructure for several organizations. Alejandro holds multiple professional certifications and has authored articles on data engineering practices. His work focuses on delivering efficient, scalable data solutions in the cloud. --- 🗣️Presentation #2: In this presentation, Miquel Angel will show how Okta dynamically builds all dbt DAGs from upstream to downstream based on tags and the dbt project structure, automates tests inside the dags, and uses the same warehouse configuration for both dbt runs and tests. Speaker bio: Data Engineer specialized in ETL, BigData processes, and DevOps 🗣️Presentation #3: Today, we have tools to enforce quality checks on projects, at the model level, like dbt_project_evaluator. Those tools are indispensable to allow teams to scale their dbt transformation. But while we've been focusing on rules at the model level. Could we leverage CLL to also define rules at the column level now? The idea of this talk would be to build an open-source tool and present what problems it can solve. Speaker bio: Staff Analytics Engineer at dbt Labs ➡️ Join the dbt Slack community: https://www.getdbt.com/community/ 🤝For the best Meetup experience, make sure to join the #local-\ channel in dbt Slack (https://slack.getdbt.com/). ---------------------------------- dbt is the standard in data transformation, used by over 40,000 organizations worldwide. Through the application of software engineering best practices like modularity, version control, testing, and documentation, dbt’s analytics engineering workflow helps teams work more efficiently to produce data the entire organization can trust. Learn more: https://www.getdbt.com/ |
Madrid dbt Meetup #5 (in-person)
|
|
Eindhoven Data Community meetup 19 - ASML
2024-11-21 · 16:00
We’re excited to return to ASML for our annual meetup! This year, we have two concurrent tracks featuring a total of four talks. We're also thrilled to welcome a special guest from the US coming over to ASML: Joe Reis, author of "The Fundamentals of Data Engineering," who will be doing an AMA. Joe \| Ask Me Anything about Data Engineering or Otherwise Joe Reis is here to answer all of your questions about data engineering, the state of the industry and technology, and anything else on your mind. This is a very rare change to have a free-flowing conversation with Joe Reis. Cristiano & Shashank\| Automating Creating Trusted Data Products: a developer experience-driven approach Creating high-quality data products is a complex task that often burdens data professionals with repetitive activities. Our trusted dataset creation framework aims to alleviate this challenge by providing a comprehensive mechanism that automates essential processes in data product development. This presentation will delve into how it not only simplifies workflows but also improves developer experience by enhancing feedback loops and cognitive load. Juan \| Standardization of Predictive Maintenance Pipelines Juan will show how his team, The Model Factory, is currently setting up a framework that ensures that all our predictive maintenance pipelines follow standards that ensure 1) Short time-to-market, 2) maintainability, and 3) interpretability of outputs and intermediate calculations. Ismael & Ricardo \| Airflow 3.0: A New Perspective on MLOps and GenAi The new version of Airflow is more than just a tool for data orchestration, and is coming up early 2025. Airflow It's evolving to meet the needs derived by the explosion of GenAi applications, and it is even changing its internal architecture to be faster and more flexible. In this talk, we'll discuss how Airflow 3.0 is evolving to support the requirements of modern applications. We'll also provide a practical example of using Airflow with a RAG implementation. It's a look at the future of Airflow, and we hope you'll join us. Program 17:00 – 18:00 🍕 Food Track 1
Track 2
20:00-21:00 🥤 Drinks 20:15-21:00 Tour ASML experience center Joe Reis \| Author\, data engineer\, "recovering data scientist" Joe Reis, a "recovering data scientist" with 20 years in the data industry, is the co-author of the best-selling O'Reilly book, "Fundamentals of Data Engineering." He’s also the instructor for the wildly popular Data Engineering Professional Specialization on Coursera, created with DeepLearning.ai and AWS. Joe’s extensive experience encompasses data engineering, data architecture, machine learning, and more. He regularly keynotes major data conferences globally, advises and invests in innovative data product companies, writes at Practical Data Modeling and his personal blog, and hosts the popular data podcasts "The Monday Morning Data Chat" and "The Joe Reis Show." In his free time, Joe is dedicated to writing new books and articles, and thinking of ways to advance the data industry. Cristiano Rocha \| Lead Data Engineer Cristiano is a lead engineer at ASML with an educational background in Distributed and Parallel Computing. With over 15+ years of experience in on-premise and cloud data-based solutions, Cristiano has a wealth of knowledge in building and maturing high-impact data platforms and self-service analytics programs for large organizations. He has extensive experience in a variety of roles, including data infrastructure engineer, self-service analytics platform engineer, data engineer, big data competence lead, DataOps competence lead, machine learning engineer, and data analyst. Shashank Shekhar \| Senior Data Engineer Shashank is a Senior Data Engineer at ASML with extensive expertise in cross-cloud technologies and architecting and optimizing data pipelines that drive actionable insights. Over 7 years in the industry, Shashank has successfully executed complex data projects, enabling organizations to harness the full potential of their data. Juan Manuel Ortiz Sevillano \| Machine Learning Engineer Juan is originally a Data Scientist who turned into a Machine Learning Engineer driven by the need to make ML models produce actual value. He currently focuses on reducing time-to-market and improving maintainability of Predictive Maintenance pipelines at ASML Ismael Cabral \| Author\, Machine Learning Engineer Ismael is a Machine Learning Engineer and Airflow trainer at Xebia Data in The Netherlands. At the same time, he is currently co-authoring the 2nd version of “Data Pipelines with Apache Airflow”. Ricardo Granados \| Author\, Analytics Engineer Ricardo Granados, co-author of Fundamentals of Analytics Engineering, is an analytics engineer specializing in data engineering and analysis. With a master’s in IT management and a focus on data science, he is proficient in using various programming languages and tools. Ricardo is skilled in exploring efficient alternatives and has contributed to multicultural teams, creating business value with data products using modern data stack solutions. As an analytics engineer, he helps companies enhance data value through data modeling, best practices, task automation, and data quality improvement. Note: For security reasons, we must register all visitors in advance. When registering, we ask for additional information such as first and last name, e-mail address and possibly license plate of your vehicle if you want to use a parking facility. Please use the extra field "Reason for visiting" to register your license plate. Please note: bring a valid ID! |
Eindhoven Data Community meetup 19 - ASML
|
|
Cutting-Edge AI and Machine Learning Innovations
2024-09-10 · 15:30
PLEASE REGISTER HERE: https://meetup-september10.kickstartai-events.org PyData Eindhoven & Kickstart.ai We are very happy to announce that we are organizing the next PyData Eindhoven meetup in collaboration with Kickstart.ai! We will host this meetup at the AI Innovation Center, located at the High Tech Campus in Eindhoven. This meetup is set to take place on September 10, 2024, at the Al Innovation Center in the High Tech Campus, Eindhoven. Mark your calendar and join us for an evening of insights, networking, and cutting-edge technology! Learn the ins and outs of cutting-edge Al and machine learning with our expert speakers, Merel Theisen (Principal Software Engineer at QuantumBlack) and Emilio Oldenziel (Machine Learning Engineer at Eraneos). Discover how software engineering principles can elevate machine learning projects and how Al is optimizing rail traffic control through innovative solutions. Come and connect with fellow Al experts at our Cutting-Edge Al and Machine Learning Innovations meetup, where you'll have the chance to learn from top experts in the field and share your own experiences. Our events provide a platform for collaboration and knowledge sharing, helping us all to advance in the fast-evolving world of Al. And after an evening of learning, enjoy networking drinks and bites, where our speakers will be available to chat and answer any questions you have about Al and machine learning. Program
Embedding Software Engineering Best Practices into Machine Learning Projects with Kedro In this talk, I will explore how software engineering best practices such as modularity, separation of concerns, testability, and reproducibility can elevate the quality and deployability of machine learning projects. Focusing on the Kedro framework, I’ll uncover how these principles integrate into data workflows, making complex projects more manageable and scalable. Attendees will gain practical insights into improving project design, ensuring code quality, and facilitating smoother transitions to production environments. No extensive software engineering background is required, making this an accessible and informative session for all data professionals looking to enhance their knowledge of software principles through Kedro. About I am a Principal Software Engineer at QuantumBlack, where I am currently the tech lead of Kedro, an open-source project part of the Linux Foundation. I have over eight years of experience in the software industry, with most of my career focused on backend product engineering. I am passionate about building products that solve real user problems, and I care deeply about creating robust, well-tested software that follows good engineering principles. I am also a strong advocate for open-source software, and I find working with the community to be both inspiring and energising. ---- Optimizing Rail Traffic Control using a Digital Twin and Reinforcement Learning In this talk, I will do a deep dive into one of our recent customer cases. In the customer’s railway network, train frequency is high, with trains departing every 3 minutes at some stations. Consequently, even a small disruption can affect the punctuality of many subsequent trains. Train dispatchers are tasked with resolving conflicts efficiently to minimize delays. I will discuss how a Digital Twin and using Reinforcement Learning (RL) can support dispatchers in making smarter decisions and the challenges of implementing RL solutions. I will demonstrate how combining scientific and practical RL knowledge reduced delays by over 58,000 minutes annually. About Emilio is an expert in the field of machine learning. At the Eraneos Data & AI practice he is responsible for advising on, developing and implementing AI solutions. He has a background in computing science and has worked on AI projects for companies like Porsche, Deutsche Bahn, Enexis and HTM. Transport and Logistics is one of his industry focuses, where he sees a lot of value in applying AI. |
Cutting-Edge AI and Machine Learning Innovations
|
|
DataOps, Observability, and The Cure for Data Team Blues - Christopher Bergh
2024-08-15 · 08:07
Johanna Berer
– Host/Interviewer
@ DataTalks.Club
,
Christopher Bergh
– CEO and Founder
@ DataKitchen
0:00 hi everyone Welcome to our event this event is brought to you by data dos club which is a community of people who love 0:06 data and we have weekly events and today one is one of such events and I guess we 0:12 are also a community of people who like to wake up early if you're from the states right Christopher or maybe not so 0:19 much because this is the time we usually have uh uh our events uh for our guests 0:27 and presenters from the states we usually do it in the evening of Berlin time but yes unfortunately it kind of 0:34 slipped my mind but anyways we have a lot of events you can check them in the 0:41 description like there's a link um I don't think there are a lot of them right now on that link but we will be 0:48 adding more and more I think we have like five or six uh interviews scheduled so um keep an eye on that do not forget 0:56 to subscribe to our YouTube channel this way you will get notified about all our future streams that will be as awesome 1:02 as the one today and of course very important do not forget to join our community where you can hang out with 1:09 other data enthusiasts during today's interview you can ask any question there's a pin Link in live chat so click 1:18 on that link ask your question and we will be covering these questions during the interview now I will stop sharing my 1:27 screen and uh there is there's a a message in uh and Christopher is from 1:34 you so we actually have this on YouTube but so they have not seen what you wrote 1:39 but there is a message from to anyone who's watching this right now from Christopher saying hello everyone can I 1:46 call you Chris or you okay I should go I should uh I should look on YouTube then okay yeah but anyways I'll you don't 1:53 need like you we'll need to focus on answering questions and I'll keep an eye 1:58 I'll be keeping an eye on all the question questions so um 2:04 yeah if you're ready we can start I'm ready yeah and you prefer Christopher 2:10 not Chris right Chris is fine Chris is fine it's a bit shorter um 2:18 okay so this week we'll talk about data Ops again maybe it's a tradition that we talk about data Ops every like once per 2:25 year but we actually skipped one year so because we did not have we haven't had 2:31 Chris for some time so today we have a very special guest Christopher Christopher is the co-founder CEO and 2:37 head chef or hat cook at data kitchen with 25 years of experience maybe this 2:43 is outdated uh cuz probably now you have more and maybe you stopped counting I 2:48 don't know but like with tons of years of experience in analytics and software engineering Christopher is known as the 2:55 co-author of the data Ops cookbook and data Ops Manifesto and it's not the 3:00 first time we have Christopher here on the podcast we interviewed him two years ago also about data Ops and this one 3:07 will be about data hops so we'll catch up and see what actually changed in in 3:13 these two years and yeah so welcome to the interview well thank you for having 3:19 me I'm I'm happy to be here and talking all things related to data Ops and why 3:24 why why bother with data Ops and happy to talk about the company or or what's changed 3:30 excited yeah so let's dive in so the questions for today's interview are prepared by Johanna berer as always 3:37 thanks Johanna for your help so before we start with our main topic for today 3:42 data Ops uh let's start with your ground can you tell us about your career Journey so far and also for those who 3:50 have not heard have not listened to the previous podcast maybe you can um talk 3:55 about yourself and also for those who did listen to the previous you can also maybe give a summary of what has changed 4:03 in the last two years so we'll do yeah so um my name is Chris so I guess I'm 4:09 a sort of an engineer so I spent about the first 15 years of my career in 4:15 software sort of working and building some AI systems some non- AI systems uh 4:21 at uh Us's NASA and MIT linol lab and then some startups and then um 4:30 Microsoft and then about 2005 I got I got the data bug uh I think you know my 4:35 kids were small and I thought oh this data thing was easy and I'd be able to go home uh for dinner at 5 and life 4:41 would be fine um because I was a big you started your own company right and uh it didn't work out that way 4:50 and um and what was interesting is is for me it the problem wasn't doing the 4:57 data like I we had smart people who did data science and data engineering the act of creating things it was like the 5:04 systems around the data that were hard um things it was really hard to not have 5:11 errors in production and I would sort of driving to work and I had a Blackberry at the time and I would not look at my 5:18 Blackberry all all morning I had this long drive to work and I'd sit in the parking lot and take a deep breath and 5:24 look at my Blackberry and go uh oh is there going to be any problems today and I'd be and if there wasn't I'd walk and 5:30 very happy um and if there was I'd have to like rce myself um and you know and 5:36 then the second problem is the team I worked for we just couldn't go fast enough the customers were super 5:42 demanding they didn't care they all they always thought things should be faster and we are always behind and so um how 5:50 do you you know how do you live in that world where things are breaking left and right you're terrified of making errors 5:57 um and then second you just can't go fast enough um and it's preh Hadoop era 6:02 right it's like before all this big data Tech yeah before this was we were using 6:08 uh SQL Server um and we actually you know we had smart people so we we we 6:14 built an engine in SQL Server that made SQL Server a column or 6:20 database so we built a column or database inside of SQL Server um so uh 6:26 in order to make certain things fast and and uh yeah it was it was really uh it's not 6:33 bad I mean the principles are the same right before Hadoop it's it's still a database there's still indexes there's 6:38 still queries um things like that we we uh at the time uh you would use olap 6:43 engines we didn't use those but you those reports you know are for models it's it's not that different um you know 6:50 we had a rack of servers instead of the cloud um so yeah and I think so what what I 6:57 took from that was uh it's just hard to run a team of people to do do data and analytics and it's not 7:05 really I I took it from a manager perspective I started to read Deming and 7:11 think about the work that we do as a factory you know and in a factory that produces insight and not automobiles um 7:18 and so how do you run that factory so it produces things that are good of good 7:24 quality and then second since I had come from software I've been very influenced 7:29 by by the devops movement how you automate deployment how you run in an agile way how you 7:35 produce um how you how you change things quickly and how you innovate and so 7:41 those two things of like running you know running a really good solid production line that has very low errors 7:47 um and then second changing that production line at at very very often they're kind of opposite right um and so 7:55 how do you how do you as a manager how do you technically approach that and 8:00 then um 10 years ago when we started data kitchen um we've always been a profitable company and so we started off 8:07 uh with some customers we started building some software and realized that we couldn't work any other way and that 8:13 the way we work wasn't understood by a lot of people so we had to write a book and a Manifesto to kind of share our our 8:21 methods and then so yeah we've been in so we've been in business now about a little over 10 8:28 years oh that's cool and uh like what 8:33 uh so let's talk about dat offs and you mentioned devops and how you were inspired by that and by the way like do 8:41 you remember roughly when devops as I think started to appear like when did people start calling these principles 8:49 and like tools around them as de yeah so agile Manifesto well first of all the I 8:57 mean I had a boss in 1990 at Nasa who had this idea build a 9:03 little test a little learn a lot right that was his Mantra and then which made 9:09 made a lot of sense um and so and then the sort of agile software Manifesto 9:14 came out which is very similar in 2001 and then um the sort of first real 9:22 devops was a guy at Twitter started to do automat automated deployment you know 9:27 push a button and that was like 200 Nish and so the first I think devops 9:33 Meetup was around then so it's it's it's been 15 years I guess 6 like I was 9:39 trying to so I started my career in 2010 so I my first job was a Java 9:44 developer and like I remember for some things like we would just uh SFTP to the 9:52 machine and then put the jar archive there and then like keep our fingers crossed that it doesn't break uh uh like 10:00 it was not really the I wouldn't call it this way right you were deploying you 10:06 had a Dey process I put it yeah 10:11 right was that so that was documented too it was like put the jar on production cross your 10:17 fingers I think there was uh like a page on uh some internal Viki uh yeah that 10:25 describes like with passwords and don't like what you should do yeah that was and and I think what's interesting is 10:33 why that changed right and and we laugh at it now but that was why didn't you 10:38 invest in automating deployment or a whole bunch of automated regression 10:44 tests right that would run because I think in software now that would be rare 10:49 that people wouldn't use C CD they wouldn't have some automated tests you know functional 10:56 regression tests that would be the |
DataTalks.Club |
|
Let the tools worry — Be happy
2024-08-08 · 16:00
Do you use static analysis to its full potential? Do you write custom rules to assist in your daily development flow? In this talk, Theodor will provide an overview of the areas where static analysis excels and how it can benefit you and your teammates. |
|
|
Beyond Testing: QA as a Product Manager?
2024-08-08 · 16:00
Uliana will show how QA’s involvement in product management activities leads to a richer understanding of the “what” and “why” of product development, driving better decision-making and improved user satisfaction. |
|
|
QA and Product Management: Turning Failures into Success
2024-08-08 · 16:00
Dmitry Suholet
– Product Manager
@ Qase
,
Vitaly Sharovatov
– Developer Advocate
@ Qase
Vitaly Sharovatov (Developer Advocate) and Dmitry Suholet (Product Manager) will present a pair talk on what a product management failure can teach us about the collaboration between QA, developers, and product managers. |
|
|
May 8 - AI, Machine Learning and Computer Vision Meetup
2024-05-08 · 17:00
When May 8, 2024 – 10:00 AM Pacific / 1:00 PM Eastern Where Virtual / Zoom: https://voxel51.com/computer-vision-events/may-8-2024-ai-machine-learning-data-science-meetup/ To Infer or To Defer: Hazy Oracles in Human+AI Collaboration This talk explores the evolving dynamics of human+AI collaboration, focusing on the concept of the human as a “hazy oracle” rather than an infallible source. It outlines the journey of integrating AI systems more deeply into practical applications through human+AI cooperation, discussing the potential value and challenges. The discussion includes the modeling of interaction errors and the strategic choices between immediate AI inference or seeking additional human input, supported by results from a user study on optimizing these collaborations. About the Speaker Jason Corso is a Professor of Robotics, Electrical Engineering, and Computer Science at the University of Michigan, and Co-Founder / Chief Scientist at AI startup Voxel51. His research spans computer vision, robotics, and AI, with over 150 peer-reviewed publications. From Research to Industry: Bridging Real-World Applications with Anomalib at the CVPR VAND Challenge This talk highlights the role of Anomalib, an open-source deep learning framework, in advancing anomaly detection within AI systems, particularly showcased at the upcoming CVPR Visual Anomaly and Novelty Detection (VAND) workshop. Anomalib integrates advanced algorithms and tools to facilitate both academic research and practical applications in sectors like manufacturing, healthcare, and security. It features capabilities such as experiment tracking, model optimization, and scalable deployment solutions. Additionally, the discussion will include Anomalib’s participation in the VAND challenge, focusing on robust real-world applications and few-shot learning for anomaly detection. About the Speaker Samet Akcay, an AI research engineer and a tech lead, specializes in semi/self-supervised, zero/few-shot anomaly detection, and multi-modality. He is recently known for his open-source contributions to the ML/DL community. He is the lead author of anomalib, a major open-source anomaly detection library. He also maintains the OpenVINO Training Extensions, a low-code transfer learning framework for building computer vision models. Learning Robot Perception and Control using Vision with Action To achieve general utility, robots must continue to learn in unstructured environments. In this talk, I describe how our mobile manipulation robot uses vision with action to 1) learn visual control, 2) annotate its own training data, and 3) learn to estimate depth for new objects and the environment. Using these techniques, I describe how I led a small group to win consecutive robot competitions against teams from Stanford, MIT, and other Universities. About the Speaker Brent Griffin is the Perception Lead at Agility Robotics and was previously an assistant research scientist at the University of Michigan conducting research at the intersection of computer vision, control, and robot learning. He is lead author on publications in all of the top IEEE conferences for computer vision, robotics, and control, and his work has been featured in Popular Science, in IEEE Spectrum, and on the Big Ten Network. Anomaly Detection with Anomalib and FiftyOne Most anomaly detection techniques are unsupervised, meaning that anomaly detection models are trained on unlabeled non-anomalous data. Developing the highest-quality dataset and data pipeline is essential to training robust anomaly detection models. In this brief walkthrough, I will illustrate how to leverage open-source FiftyOne and Anomalib to build deployment-ready anomaly detection models. First, we will load and visualize the MVTec AD dataset in the FiftyOne App. Next, we will use Albumentations to test out augmentation techniques. We will then train an anomaly detection model with Anomalib and evaluate the model with FiftyOne. About the Speaker Jacob Marks is a Senior Machine Learning Engineer and Researcher at Voxel51, where he leads open source efforts in vector search, semantic search, and generative AI for the FiftyOne data-centric AI toolkit. Prior to joining Voxel51, Jacob worked at Google X, Samsung Research, and Wolfram Research. |
May 8 - AI, Machine Learning and Computer Vision Meetup
|
|
May 8 - AI, Machine Learning and Computer Vision Meetup
2024-05-08 · 17:00
When May 8, 2024 – 10:00 AM Pacific / 1:00 PM Eastern Where Virtual / Zoom: https://voxel51.com/computer-vision-events/may-8-2024-ai-machine-learning-data-science-meetup/ To Infer or To Defer: Hazy Oracles in Human+AI Collaboration This talk explores the evolving dynamics of human+AI collaboration, focusing on the concept of the human as a “hazy oracle” rather than an infallible source. It outlines the journey of integrating AI systems more deeply into practical applications through human+AI cooperation, discussing the potential value and challenges. The discussion includes the modeling of interaction errors and the strategic choices between immediate AI inference or seeking additional human input, supported by results from a user study on optimizing these collaborations. About the Speaker Jason Corso is a Professor of Robotics, Electrical Engineering, and Computer Science at the University of Michigan, and Co-Founder / Chief Scientist at AI startup Voxel51. His research spans computer vision, robotics, and AI, with over 150 peer-reviewed publications. From Research to Industry: Bridging Real-World Applications with Anomalib at the CVPR VAND Challenge This talk highlights the role of Anomalib, an open-source deep learning framework, in advancing anomaly detection within AI systems, particularly showcased at the upcoming CVPR Visual Anomaly and Novelty Detection (VAND) workshop. Anomalib integrates advanced algorithms and tools to facilitate both academic research and practical applications in sectors like manufacturing, healthcare, and security. It features capabilities such as experiment tracking, model optimization, and scalable deployment solutions. Additionally, the discussion will include Anomalib’s participation in the VAND challenge, focusing on robust real-world applications and few-shot learning for anomaly detection. About the Speaker Samet Akcay, an AI research engineer and a tech lead, specializes in semi/self-supervised, zero/few-shot anomaly detection, and multi-modality. He is recently known for his open-source contributions to the ML/DL community. He is the lead author of anomalib, a major open-source anomaly detection library. He also maintains the OpenVINO Training Extensions, a low-code transfer learning framework for building computer vision models. Learning Robot Perception and Control using Vision with Action To achieve general utility, robots must continue to learn in unstructured environments. In this talk, I describe how our mobile manipulation robot uses vision with action to 1) learn visual control, 2) annotate its own training data, and 3) learn to estimate depth for new objects and the environment. Using these techniques, I describe how I led a small group to win consecutive robot competitions against teams from Stanford, MIT, and other Universities. About the Speaker Brent Griffin is the Perception Lead at Agility Robotics and was previously an assistant research scientist at the University of Michigan conducting research at the intersection of computer vision, control, and robot learning. He is lead author on publications in all of the top IEEE conferences for computer vision, robotics, and control, and his work has been featured in Popular Science, in IEEE Spectrum, and on the Big Ten Network. Anomaly Detection with Anomalib and FiftyOne Most anomaly detection techniques are unsupervised, meaning that anomaly detection models are trained on unlabeled non-anomalous data. Developing the highest-quality dataset and data pipeline is essential to training robust anomaly detection models. In this brief walkthrough, I will illustrate how to leverage open-source FiftyOne and Anomalib to build deployment-ready anomaly detection models. First, we will load and visualize the MVTec AD dataset in the FiftyOne App. Next, we will use Albumentations to test out augmentation techniques. We will then train an anomaly detection model with Anomalib and evaluate the model with FiftyOne. About the Speaker Jacob Marks is a Senior Machine Learning Engineer and Researcher at Voxel51, where he leads open source efforts in vector search, semantic search, and generative AI for the FiftyOne data-centric AI toolkit. Prior to joining Voxel51, Jacob worked at Google X, Samsung Research, and Wolfram Research. |
May 8 - AI, Machine Learning and Computer Vision Meetup
|
|
May 8 - AI, Machine Learning and Computer Vision Meetup
2024-05-08 · 17:00
When May 8, 2024 – 10:00 AM Pacific / 1:00 PM Eastern Where Virtual / Zoom: https://voxel51.com/computer-vision-events/may-8-2024-ai-machine-learning-data-science-meetup/ To Infer or To Defer: Hazy Oracles in Human+AI Collaboration This talk explores the evolving dynamics of human+AI collaboration, focusing on the concept of the human as a “hazy oracle” rather than an infallible source. It outlines the journey of integrating AI systems more deeply into practical applications through human+AI cooperation, discussing the potential value and challenges. The discussion includes the modeling of interaction errors and the strategic choices between immediate AI inference or seeking additional human input, supported by results from a user study on optimizing these collaborations. About the Speaker Jason Corso is a Professor of Robotics, Electrical Engineering, and Computer Science at the University of Michigan, and Co-Founder / Chief Scientist at AI startup Voxel51. His research spans computer vision, robotics, and AI, with over 150 peer-reviewed publications. From Research to Industry: Bridging Real-World Applications with Anomalib at the CVPR VAND Challenge This talk highlights the role of Anomalib, an open-source deep learning framework, in advancing anomaly detection within AI systems, particularly showcased at the upcoming CVPR Visual Anomaly and Novelty Detection (VAND) workshop. Anomalib integrates advanced algorithms and tools to facilitate both academic research and practical applications in sectors like manufacturing, healthcare, and security. It features capabilities such as experiment tracking, model optimization, and scalable deployment solutions. Additionally, the discussion will include Anomalib’s participation in the VAND challenge, focusing on robust real-world applications and few-shot learning for anomaly detection. About the Speaker Samet Akcay, an AI research engineer and a tech lead, specializes in semi/self-supervised, zero/few-shot anomaly detection, and multi-modality. He is recently known for his open-source contributions to the ML/DL community. He is the lead author of anomalib, a major open-source anomaly detection library. He also maintains the OpenVINO Training Extensions, a low-code transfer learning framework for building computer vision models. Learning Robot Perception and Control using Vision with Action To achieve general utility, robots must continue to learn in unstructured environments. In this talk, I describe how our mobile manipulation robot uses vision with action to 1) learn visual control, 2) annotate its own training data, and 3) learn to estimate depth for new objects and the environment. Using these techniques, I describe how I led a small group to win consecutive robot competitions against teams from Stanford, MIT, and other Universities. About the Speaker Brent Griffin is the Perception Lead at Agility Robotics and was previously an assistant research scientist at the University of Michigan conducting research at the intersection of computer vision, control, and robot learning. He is lead author on publications in all of the top IEEE conferences for computer vision, robotics, and control, and his work has been featured in Popular Science, in IEEE Spectrum, and on the Big Ten Network. Anomaly Detection with Anomalib and FiftyOne Most anomaly detection techniques are unsupervised, meaning that anomaly detection models are trained on unlabeled non-anomalous data. Developing the highest-quality dataset and data pipeline is essential to training robust anomaly detection models. In this brief walkthrough, I will illustrate how to leverage open-source FiftyOne and Anomalib to build deployment-ready anomaly detection models. First, we will load and visualize the MVTec AD dataset in the FiftyOne App. Next, we will use Albumentations to test out augmentation techniques. We will then train an anomaly detection model with Anomalib and evaluate the model with FiftyOne. About the Speaker Jacob Marks is a Senior Machine Learning Engineer and Researcher at Voxel51, where he leads open source efforts in vector search, semantic search, and generative AI for the FiftyOne data-centric AI toolkit. Prior to joining Voxel51, Jacob worked at Google X, Samsung Research, and Wolfram Research. |
May 8 - AI, Machine Learning and Computer Vision Meetup
|
|
Data Engineering Meetup - Data Applications
2024-05-08 · 17:00
Welcome to the new edition of Data Engineering London on Data Applications! Join us for the third edition of the Data Engineering meetup with a range of talks looking at data applications in various fields. You'll have the chance to network and meet fellow data engineers (and other data enthusiasts)! When? 18:00 - 18:30 Networking with pizza and drinks from GoCardless 18:30 - 19:30 Talks 19:30 - 20:30 More networking Where? GoCardless offices (see address) Speakers and Talks: 1. Data Quality: Prevention is better than the cure - by Andrew Jones (Principal Engineer @ GoCardless) 2. Universal-First Apps: the key to high quality\, unified data collection - by Ambroise Laurent (Solution Engineer @ Theodo UK) & Mo Khazali (Head of Mobile @ Theodo UK) 3. The Data Engineering behind Energy Trading Insights - by Clementine (Cici) Whitcomb (Data Engineer @ EDF Energy) If you have a topic you're passionate about and wish to see discussed, let us know! We're always looking for more talks for our future events. Places are limited, make sure you register! |
Data Engineering Meetup - Data Applications
|
|
May 8 - AI, Machine Learning and Computer Vision Meetup
2024-05-08 · 17:00
When May 8, 2024 – 10:00 AM Pacific / 1:00 PM Eastern Where Virtual / Zoom: https://voxel51.com/computer-vision-events/may-8-2024-ai-machine-learning-data-science-meetup/ To Infer or To Defer: Hazy Oracles in Human+AI Collaboration This talk explores the evolving dynamics of human+AI collaboration, focusing on the concept of the human as a “hazy oracle” rather than an infallible source. It outlines the journey of integrating AI systems more deeply into practical applications through human+AI cooperation, discussing the potential value and challenges. The discussion includes the modeling of interaction errors and the strategic choices between immediate AI inference or seeking additional human input, supported by results from a user study on optimizing these collaborations. About the Speaker Jason Corso is a Professor of Robotics, Electrical Engineering, and Computer Science at the University of Michigan, and Co-Founder / Chief Scientist at AI startup Voxel51. His research spans computer vision, robotics, and AI, with over 150 peer-reviewed publications. From Research to Industry: Bridging Real-World Applications with Anomalib at the CVPR VAND Challenge This talk highlights the role of Anomalib, an open-source deep learning framework, in advancing anomaly detection within AI systems, particularly showcased at the upcoming CVPR Visual Anomaly and Novelty Detection (VAND) workshop. Anomalib integrates advanced algorithms and tools to facilitate both academic research and practical applications in sectors like manufacturing, healthcare, and security. It features capabilities such as experiment tracking, model optimization, and scalable deployment solutions. Additionally, the discussion will include Anomalib’s participation in the VAND challenge, focusing on robust real-world applications and few-shot learning for anomaly detection. About the Speaker Samet Akcay, an AI research engineer and a tech lead, specializes in semi/self-supervised, zero/few-shot anomaly detection, and multi-modality. He is recently known for his open-source contributions to the ML/DL community. He is the lead author of anomalib, a major open-source anomaly detection library. He also maintains the OpenVINO Training Extensions, a low-code transfer learning framework for building computer vision models. Learning Robot Perception and Control using Vision with Action To achieve general utility, robots must continue to learn in unstructured environments. In this talk, I describe how our mobile manipulation robot uses vision with action to 1) learn visual control, 2) annotate its own training data, and 3) learn to estimate depth for new objects and the environment. Using these techniques, I describe how I led a small group to win consecutive robot competitions against teams from Stanford, MIT, and other Universities. About the Speaker Brent Griffin is the Perception Lead at Agility Robotics and was previously an assistant research scientist at the University of Michigan conducting research at the intersection of computer vision, control, and robot learning. He is lead author on publications in all of the top IEEE conferences for computer vision, robotics, and control, and his work has been featured in Popular Science, in IEEE Spectrum, and on the Big Ten Network. Anomaly Detection with Anomalib and FiftyOne Most anomaly detection techniques are unsupervised, meaning that anomaly detection models are trained on unlabeled non-anomalous data. Developing the highest-quality dataset and data pipeline is essential to training robust anomaly detection models. In this brief walkthrough, I will illustrate how to leverage open-source FiftyOne and Anomalib to build deployment-ready anomaly detection models. First, we will load and visualize the MVTec AD dataset in the FiftyOne App. Next, we will use Albumentations to test out augmentation techniques. We will then train an anomaly detection model with Anomalib and evaluate the model with FiftyOne. About the Speaker Jacob Marks is a Senior Machine Learning Engineer and Researcher at Voxel51, where he leads open source efforts in vector search, semantic search, and generative AI for the FiftyOne data-centric AI toolkit. Prior to joining Voxel51, Jacob worked at Google X, Samsung Research, and Wolfram Research. |
May 8 - AI, Machine Learning and Computer Vision Meetup
|
|
DevOps Society London Meetup @ Broadcom
2024-02-22 · 18:00
Our first London DevOps Society event has arrived! We are delighted to be partnered with our hosts Broadcom (who recently acquired VMware), with a trio of really exciting speakers lined up. Format of the meetup event will be as follows: 6pm – 6:30pm - Networking on arrival coupled with pizza and drinks provided by ReVybe IT. 6:30pm – 7:40pm – 3X 20 mins speakers with a 10-minute break in-between 2nd and 3rd speaker. 7:40pm onwards – Networking and event close. We are delighted to welcome 3 speakers to our first event. #1 – Vahid Mojtahed – Group Director of Data and Insights Vahid is a pioneer in the world of Data, with a specialism in AI/ML. He has led teams throughout his career, with a focus on innovation and collaboration. Accolades include: 2023 Invited Speaker at Data + AI World Tour by Databricks 2018 Recognized as Exceptional Talent by the British Academy for innovative work on fraud analytics and solution 2016 Invited speaker at Italian Academy of Science Title: Think Like a Machine, Win Like a Human: How AI & ML are Transforming Businesses Across Industries Vahid Mojtahed will kick start the evening by giving us an insight into how data, AI, and ML are empowering businesses and revolutionizing various industries. He will present examples of fraud detection in agri-food industry, financial forecasting in a global enterprise and accelerating property transaction using natural language processing. #2 – Fernando Villalba – DevOps/DevEX/Platform Engineer Fernando has over a decade of miscellaneous IT experience. He started in IT support ("Have you tried turning it on and off?"), veered to become a SysAdmin ("Don't you dare turn it off") and later segued into DevOps type of roles ("Destroy and replace!"). He has been a consultant for various multi-billion-dollar organizations helping them achieve their highest potential with their DevOps processes. Title: Platform engineering must kill RTFM Fernando Villalba will be speaking on how platform engineering must kill “read the f*ck!ng manual”. “It doesn't matter how great your documentation is or how much you tell developers to read the manual, if your design and user experience is terrible, you will get poor performance. In this talk we will explore how and why placing an emphasis on documentation at the expense of design is a bad practice, and we will also explore how this relates to your cloud provider and your internal developer platform. Whether you are looking to build your own platform or find a compass to pick better tooling for your developers, this talk may be helpful for you.” This talk is based on the following blog posts: https://nandovillalba.medium.com/platform-engineering-please-kill-rtfm-72de6f01075e https://nandovillalba.medium.com/ux-on-platform-engineering-1c7ecfaddea7 #3 – Chris Hurford – Strategic Customer Success Manager – Broadcom Chris is a senior technical leader who has a blend of technical and leadership skills, with a push towards agile software delivery and delivery management in recent years. He is passionate about developing and supporting high quality software products and systems, whilst motivating teams using agile methodologies. Chris has over 25 years of experience in the technology space, starting off as a software developer before moving into management. Title: An overview of the Tanzu software portfolio and Tanzu Lab's Platform Engineering services Lastly, we have Chris Hurford, a senior figure within the Broadcom engineering team, delivering a talk on Tanzu, which helps platform engineers accelerate application development and delivery so that developers can focus on building great apps. Tanzu Application Platform is a modular Kubernetes-based platform that allows you to build quickly and ship fast. |
DevOps Society London Meetup @ Broadcom
|
|
Data Engineering Meetup - Relaunch
2024-01-24 · 18:00
Welcome back! Join us for the exciting relaunch of our Data Engineering Community Meetup! This event promises to be a dynamic gathering for data professionals, enthusiasts, and experts who are eager to dive into the latest trends and innovations in the world of data engineering. The theme for this meetup is Data Quality! When? 18:00 - 18:30 Networking with pizza and drinks 18:30 - 19:30 Talks 19:30 - 20:30 More networking Where? Theodo UK offices (see address) Speakers and Talks: 1. Lifecycle of a Data Glitch: Discover\, Fix and Future-Proof - by Marc Montanari 2. Anomaly Detection using GenAI - by Chloé Caron 3. Using Elementary to enhance the data quality of your dbt workflows - by Corentin Berteaux As we finalise our speaker lineup, we invite your contributions. If you have a topic you're passionate about or wish to see discussed, let us know! Your suggestions can help shape our meetup into an even more engaging and relevant event for our data engineering community. Places are limited, make sure you register! |
Data Engineering Meetup - Relaunch
|
|
Panel discussion: What didn’t they tell you about Engineering Management?
2023-12-06 · 17:00
Please use this registration link to RSVP. Ever wondered what they didn't tell you about transitioning to an engineering manager? Curious how to choose between technical leadership and people management? Can introverts excel just as much as extroverts in this role? Find out at the GDG Berlin panel discussion at Shiftmove as we uncover these career mysteries! Dive deep into the insights of becoming an engineering manager, and hear the stories of those who made it. Explore how to retain engineers and what motivates them. Learn the secrets of balancing tech quality and business growth. Whether you're an individual contributor or a team leader, whether you're here to learn or eager to actively participate by sharing your opinions and asking questions, you're wholeheartedly invited. See you at the GDG Berlin meetup at Shiftmove! Agenda 6:00 PM: Registration 6:30 PM: Panel discussion & Q&A In this part we will talk about Career Paths in Engineering Management. 7:15 PM: Break 7:30 PM: Panel discussion & Q&A In the second part of the discussion we will address topic of Building and Leading High-Performing Engineering Teams. 8:00 PM: Pizza & drinks Panelists Jason Thrasher - Shiftmove (CTO) Jason is an engineer who loves to build technology and teams. He is new to Berlin, having joined Shiftmove to solve mobility problems in the European markets through technological transformation. He has been fortunate to be a part of exits in Silicon Valley, and is currently learning new things about the tech market in Germany as he broadens his world view. CJ Jenkins - Zalando (Applied Science Manager) CJ is a reformed academic who spent years building and deploying machine learning algorithms for various fintechs. She has spent the past 3 years leading data and engineering teams, and loves helping people grow within their role and executing deliverables on time. Vishal Varshney - Zalando (Engineering Leader) I am a Senior Engineering Manager for Product Development with 17 years of experience. I have extensive experience in setting up processes and automation infrastructure for agile teams. Moderator Eugenia Zigisova - Limehome GmbH (Frontend Lead) Eugenia is a frontend engineer and lead at Limehome—a digital hotel starup, and tech event speaker who is passionate about web performance. She has experience working in fast-growing European startups like N26 and Gorillas where she helped to build a web performance culture. Before moving to Berlin, she ran Google Developer Group and led Women Techmakers in Latvia for several years. Hosted By Abhinav Kulshreshtha, GDG Organizer Eugenia Zigisova, Organizer Jerome Mouton, Organizer Louis Tsai, GDG Organizer manjula dube, Organizer Partner Shiftmove (https://www.shiftmove.com/) SHIFTMOVE is the European pioneer for mobility operations. Emerging from the merger of the two market leaders Avrios and Vimcar, SHIFTMOVE enables companies to reap the benefits of holistic operational mobility management. SHIFTMOVE's mission is to integrate all mobility needs of employees, goods and tools in an easy-to-manage, cloud-based software platform. It creates the basis for transforming operational mobility from a pure cost factor into a value driver for the entire organisation. Complete your event RSVP here: https://gdg.community.dev/events/details/google-gdg-berlin-presents-panel-discussion-what-didnt-they-tell-you-about-engineering-management/. |
Panel discussion: What didn’t they tell you about Engineering Management?
|
|
Bratislava dbt Meetup #2
2023-11-23 · 17:00
🤩 BRATISLAVA dbt MEETUP VOL. 2 is coming your way. Join the data community, learn from experts and expand your network. See you there, okay? 🔥 ABOUT THIS EVENT At this second edition, you will catch talks across the whole spectrum of the modern data experience: from getting started the MDS through data testing to changing careers and entering the fabulous world of analytics. Presentations will be delivered in English. - - TOPICS & SPEAKERS -- 1️⃣ Modern Data Stack vs Modern Data People Speaker: Viktoria Horvath, Analytics Engineer @ Infinite Lambda Viki often helps data teams get started with dbt and the Modern Data Stack. In this talk, she will look into the most common challenges that people face but often do not voice. Join this session to get answers to the burning questions that are sure to come up while working on your first dbt project or having just started a new role in analytics. Viki will share insights, show you how to seek support and point you to valuable resources to set you up for success. 2️⃣ Fighting the Fear of Change: My Shift from a Marketer to an Analytics Engineer Speaker: Alina Serhiienko, Analytics Engineer @ Slido Alina had extensive experience in marketing she intended to keep building on. Yet, upon moving to Slovakia and starting her dream career at Slido in 2022, she discovered she was too passionate for data not to pursue a career in the field. She made a transition and is now a member of Slido’s analytics team. Alina will be sharing her story, shedding light on how modern tools such as dbt facilitate such a career change and make it easier for folks to enter the data industry. 3️⃣ How dbt Helps You Sleep at Night (Part 2) Speaker: Sean McIntyre, Senior Solutions Architect @ dbt Labs The delivery of high quality data is vital for organisations for making both day-to-day and strategic decisions. Sean is going to explore data testing techniques, comment on various strategies and show you how to leverage dbt's testing capabilities to ensure data quality. 🌟 Moderator Miriama Krizkova, Head of Analytics Engineering @ Slido - - ADMISSION - - The event is completely FREE but seats are limited, so book yours by RSVP-ing. - - SCHEDULE - - 17:45 – 18:00: Registration 18:00 – 18:10: Opening remarks 18:10 – 18:40: Talk #1 + Q&A 18:40 – 19:10: Talk #2 + Q&A 19:10 – 19:40: Talk #3 + Q&A 19:40 – 22:00: Networking and drinks The event is suitable for: Folks just starting out on their data journey; People with some experience with the modern data stack; Seasoned professionals who want to keep expanding their knowledge. 🚀 See you soon, data buddies. ---------------------------------- To attend, please read the Required Participation Language for In-Person Events with dbt Labs: https://bit.ly/3QIJXFb ➡️ Join the dbt Slack community: https://www.getdbt.com/community/ 🤝 For the best Meetup experience, make sure to join the #local-czsk channel in dbt Slack (https://slack.getdbt.com/). ---------------------------------- dbt is a data transformation framework that lets analysts and engineers collaborate using their shared knowledge of SQL. Through the application of software engineering best practices like modularity, version control, testing, and documentation, dbt’s analytics engineering workflow helps teams work more efficiently to produce data the entire organisation can trust. Learn more: https://www.getdbt.com/ |
Bratislava dbt Meetup #2
|
|
PyData Trójmiasto #27 - Snowflake, VoiceLab
2023-10-24 · 16:00
We are very happy to welcome you to October's edition of PyData Trójmiasto meetup! If you crave to learn more about LLMs or get to know how to approach data engineering and data science with Python using Snowflake platform in action - save the spot! When: 24th October 2023, 18:00 Where: Gdańsk Science and Technology Park, Trzy Lipy 3, Building B, Room 002 Registration: We have up to 70 seats. The event is free to enter. Let us know you're coming by RSVP on Meetup. Agenda: 18:00 - 18:05 - Meeting boarding 18:05 - 18:10 - A few words about PyData 18:10 - 18:55 - How do language models work? by Wojciech Janowski 18:55 - 19:40 - Snowflake for Data Engineering and Data Science with Python by Piotr Pietrzkiewicz and Łukasz Leszewski 19:40 - Pizza & networking! Livestream at: https://www.youtube.com/watch?v=KU80NP312ds About How do language models work? In this prelection, we will explore in detail the theoretical foundations of language models, especially LLMs. With minimal mathematical and algorithmic background required, we will find the answer to the question “How do language models work?”. I explain the entire text generation process. By the end of this presentation, the workings of LLMs will no longer be a mystery to you. About Wojciech Janowski: Wojciech is a member of NLP Research team in VoiceLab AI company. Currently, he is working on Large Language Models (LLMs) and their implementation into new features for production-ready applications. Wojciech with NLP Research team create first polish LLM known asTrurl. Additionally, Wojciech is pursuing a Ph.D on Gdańsk Universisty of Technology in collaboration with VoiceLab. About Snowflake for Data Engineering and Data Science with Python Snowflake Data Cloud - Snowflake technical run through the lens of Tasty bytes - a food truck company using analytics to run their business. The session will be a mix of Snowflake theory and live demo. You will have a chance to learn what is Snowflake Data Cloud, how it looks like, how you interact but also how data engineering and data science works in Snowflake. About Piotr Pietrzkiewicz: Throughout his career, Piotr worked in various roles for companies like Amazon Web Services or Atos. During that time, he supported both private, hybrid and public cloud customers. 15 years of hands-on experience in transforming and migrating obsolete architectures into modern resilient solutions. In Snowflake Piotr manages Sales Engineering team and assists customers on their way towards efficient data usage. About Łukasz Leszewski: Łukasz is the Senior Sales Engineer for Eastern Europe at Snowflake. Before joining, Łukasz held various positions like consultant, architect, and project manager at the business analytics software vendor SAS. Based out of Warsaw, he also is a lecturer on Data Quality at the Warsaw School of Economics. He has over 15 years of experience in data management and business intelligence. |
PyData Trójmiasto #27 - Snowflake, VoiceLab
|