Search – talk-data.com

Title & Speakers	Event
The State of Airflow 2026: London Airflow Meetup! 2026-01-28 · 17:30 Join fellow Airflow enthusiasts and leaders at Salisbury House for an evening of engaging talks, great food and drinks, and exclusive swag! We'll start you off with a deep dive into the Airflow 2026 survey results, and finish off with a community member presentation on the Apache TinkerPop provider. PRESENTATIONS *Talk #1: The State of Apache Airflow® 2026* Apache Airflow® continues to thrive as the world’s leading open-source data orchestration platform, with 30M downloads per month and over 3k contributors. 2025 marked a major milestone with the release of Airflow 3, which introduced DAG versioning, enhanced security and task isolation, assets, and more. These changes have reshaped how data teams build, operate, and govern their pipelines. In this session, our speaker will share insights from the State of Airflow 2026 report, including: Latest trends in how teams are using Airflow today What’s next for the project and ecosystem A discussion of emerging best practices and evolving use cases Join us to hear directly from a leader in the community and discover how to get the most out of Airflow in the year ahead. *Talk #2: Building the Apache TinkerPop Provider for Airflow* Speaker: Ahmad Farhan, Data Engineer Graph databases are powering everything from recommendation engines to fraud detection, but integrating graph operations into modern data pipelines has often required custom code and workarounds. Earlier this year, Ahmad built a new Apache TinkerPop provider for Airflow, making it easier than ever to orchestrate Gremlin queries, manage graph workloads, and connect Airflow to TinkerPop-enabled systems. In this session, you’ll learn: What the TinkerPop provider does and why it matters for graph-based workloads How to run Gremlin queries and manage graph jobs directly within Airflow Real examples from the development process, including design decisions and lessons learned How this provider opens the door for new use cases in graph analytics and data engineering Join us to explore how Airflow and TinkerPop can work together to streamline graph workflows and unlock new patterns in modern data pipelines. AGENDA 5:30-6 PM: Arrivals, networking, food & drinks 6-7PM: Presentations 7-8PM: Networking	The State of Airflow 2026: London Airflow Meetup!
AI Meetup (March): AI, GenAI and ML 2025-03-25 · 18:00 Important RSVP HERE (Due to room capacity and venue security, it is required to pre-register at the link for admission) Welcome to the AI meetup in London. Join us for deep dive tech talks on AI, GenAI, LLMs and machine learning, food/drink, networking with speakers and fellow developers. Tech Talk: What you need to know about AI factories Speaker: Matt Shore (High Performance Computing & Artificial Intelligence) Abstract: As organisations start to move from proof of concept to production, they need to consider how to build their infrastructure from the ground up to be completely optimised for the next wave of AI’s requirements. From data to power and space, HPE will provide a whistlestop tour of the latest thinking across the AI stack, as well as what it means for each type of role in the organisation. Tech Talk: Accessing and building with open-source models Speaker: Darin Verheijke (Recursal ai) Abstract: In this session, I will discuss the recent improvements of open-source LLM models, the difficulties in running these open-source models for yourself/your company, how you can easily access and make use of all these open-source models through Hugging Face and Featherless.ai and demo some self-made open-source applications for every developer to use. Speakers/Topics: Stay tuned as we are updating speakers and schedules. If you have a keen interest in speaking to our community, we invite you to submit topics for consideration: Submit Topics Sponsors:** We are actively seeking sponsors to support AI developers community. Whether it is by offering venue spaces, providing food, or cash sponsorship. Sponsors will not only speak at the meetups, receive prominent recognition, but also gain exposure to our extensive membership base of 20,000+ AI developers in London and 500K+ worldwide.	AI Meetup (March): AI, GenAI and ML
IN PERSON: Apache Kafka meets Apache Flink 2025-01-23 · 18:00 Join us for our very first meetup of 2025! You'll learn all about how to use Apache Kafka beyond the consumer protocol and get an introduction to Apache Flink. Date and Time: 🗓️ Thursday 23rd January, ⏰ 18:00 - 20:30 PM 🕘 Venue: Confluent Europe Ltd, 262 High Holborn, London WC1V 7EE, United Kingdom Attending Brands: OSO, Streambased, Gravitee & Confluent Schedule: 18:00: Doors Open 18:00 - 18:30: Food, drinks, networking 18:30 - 19:00: "Accessing Kafka: beyond the consumer protocol" - Tom Scott (CEO Streambased) & Linus Hakansson (CPO Gravitee) 19:00 - 19:30: “Flink - Adi Polak (Director Developer Experience Engineering and Advocacy, Confluent) 19:30- 20:30pm: Additional Q&A, Networking 🎙️ \~Talk 1\~ Talk Title: Accessing Kafka: beyond the consumer protocol Summary: The role of Kafka is expanding and with it the use cases it addresses. This brings many cool new features but also highlights some drawbacks. The standard producer/consumer pattern that has served us so well for so many years is no longer a good fit for all the things that Kafka data is used for and it's time to look beyond.Join Linus (Gravitee) and Tom (Streambased) for an in depth look at how you can interact with you Kafka clusters via REST, GraphQL, WebSockets, JDBC/ODBC and even as a simple filesystem.We'll outline the reasoning behind these new access patterns, the features that differentiate them (and the features that unite them) and show some live demos of the opportunities they create. 🗣️ Speaker 1: Tom Scott (CEO Streambased) is the founder of Streambased, Tom is building multi tenant, on-prem and cloud Kafka services to attack common Kafka pain points and break down barriers to starting your data journey.Linus Hakansson Linus Hakansson is the Chief Product Officer at Gravitee, building a next-generation management platform helping organizations secure, control and govern their Kafka and APIs 🗣️ Speaker 2: Linus Hakansson is the Chief Product Officer at Gravitee, building a next-generation management platform helping organizations secure, control and govern their Kafka and APIs 🎙️ \~Talk 2\~ Talk Title: Flink - demystifying data streaming Summary: In an era where data velocity and volume continue to grow, the ability to process and analyze data streams in real-time is pivotal for businesses aiming to optimize operations, enhance decision-making, and maintain competitive advantages. Apache Flink stands out as a comprehensive, open-source stream processing framework designed to meet these challenges head-on. In this session you will learn about data streaming through the lens of Apache Flink, offering insights into its architecture, capabilities, and how it seamlessly facilitates real-time data processing.Objectives: 1. Introduce Stream Processing: Provide a foundational understanding of stream processing - its importance\, use cases\, and when to use in comparison to batch processing. 2. Explore Apache Flink: Deep dive into Apache Flink's architecture\, key features\, and its unique approach to handling stateful computations\, event time processing\, and ensuring fault tolerance at scale. 🗣️ Speaker 1: Adi Polak, Director Developer Experience Engineering and Advocacy, Confluent. Adi is an experienced software engineer and people manager. For most of her professional life, she dealt with data and machine learning for transactional and analytics workloads by building large-scale systems. As a data practitioner, she developed software to solve real-world problems with Apache Spark, Kafka, HDFS, K8s, AWS, and Azure in high-throughput, high-scale production environments for companies like Akamai and Microsoft.Adi has taught Spark to thousands of students throughout the years and is the author of the successful book — Scaling Machine Learning with Spark. When not thinking up new architecture, teaching new tech or pondering on a distributed systems challenge, you can find her at the local cultural scene.	IN PERSON: Apache Kafka meets Apache Flink
Open Source Data Deep Dive: London 2024-11-19 · 19:00 REGISTER HERE FOR LOCATION: https://lu.ma/2etm1zve Come hang out at the OSS Data Deep Dive in London, where we'll explore some of the coolest and innovative use cases of the Iceberg ecosystem. Whether you're new to Iceberg, data lakehouses, or you’re a seasoned data engineer, discover how these tools can boost your data projects. Plus, there'll be plenty of networking, cool swag, and delicious food. Hope to see you there! Agenda: Apache Iceberg REST Catalog: Making Catalog Interoperability Happen with Alex Merced, Senior Evangelist @ Dremio (co-author of “Apache Iceberg: The Definitive Guide”) Apache Polaris: an Open Source Iceberg REST Catalog with Yufei Gu, Senior Software Engineer at Snowflake & Iceberg PMC member, Hadoop PMC member Charting the Course: The Evolution and Future of Apache Iceberg and Polaris (incubating) with Jean-Baptiste Onofre, Board Member @ Apache Software Foundation	Open Source Data Deep Dive: London
Open Source Data Deep Dives 2024-11-19 · 18:00 RSVP HERE: Https://lu.ma/2etm1zve Join us at the OSS Data Deep Dive in London for an in-depth workshop on Data Engineering Best Practices. This event is perfect for professionals who are keen to enhance their skills in handling big data efficiently. Plus, there'll be plenty of networking, cool swag, and delicious food. Hope to see you there. Agenda: Apache Iceberg REST Catalog: Making Catalog Interoperability Happen with Alex Merced, Senior Evangelist @Dremio (co-author of "Apache Iceberg: The Definitive Guide") Apache Polaris: an Open Source iceberg REST Catalog with Yufei Gu, Senior Software Engineer at Snowflake & Iceberg PMC member, Hadoop PMC member Charting the Course: The Evolution and Future of Apache Iceberg and Polaris (incubating) with Jean-Baptistse Onofre, Board Member @Apache Software Foundation. Our expert speakers will delve into topics like data modeling, ETL processes, data pipelines, and database architecture. Whether you are a seasoned data engineer or just starting in the field, this workshop will provide valuable insights and practical tips to streamline your data engineering projects. Don't miss out on this opportunity to network with fellow enthusiasts and take your data engineering skills to the next level!	Open Source Data Deep Dives
AI and Deep Learning for Enterprise #19 2024-10-08 · 18:00 Join us at Civo Tech Junction on October 8th for an evening of talks, food, and conversation with ML and AI industry pros. Please note you will be unable to enter the venue before 6.30pm. RSVPs will close 24 hours before the event, you may be unable to register after this time but you can still watch online. If you can't join us in person you can watch remotely via our YouTube channel. Agenda 06:30pm - Doors open, food and drink served 07:00pm - Welcome 07:05pm - A short talk from our hosts Daemon 07:10pm - Thomas Wood, Director of Fast Data Science "Project Harmony: a free online tool using LLMs for research in psychology and social sciences" Thomas Wood will present our work on Harmony, harmonydata.ac.uk, which is a free online tool that uses generative AI and LLMs to help researchers compare items in questionnaires such as GAD-7 (used to measure anxiety), even when they are written in different languages. Harmony is open source under MIT License and is written in Python, and uses HuggingFace Sentence Transformers to find similarities between questionnaires. Harmony will soon allow researchers to discover datasets using a vector search. 07:50pm - Break 08:00 - Vikram Haridas, Lead Product Manager at Groupon "Implementing AI-Driven Product Innovations: Strategic Insights and Practical Applications" Vikram Haridas, from Groupon, will reveal how AI can supercharge product roadmaps. Learn how to balance excitement with realism as you scale AI features and discover practical use cases. Get insights into Groupon's success with AI-powered deal optimization and automated merchant onboarding, and learn how to implement these strategies in your own business. 08:40 - Shubhangi Goyal, Data Analyst @ ICS.AI Ltd and Nidhi Agrawal Director @UBS, "Generative AI and its use cases" Our session will explore the transformative potential of Generative AI, focusing on its use cases in matching algorithms and its applications in the financial industry. We'll dive into how AI models enhance person-matching processes by analyzing large datasets for customer service, and personalization. Additionally, we’ll examine how Generative AI is revolutionising the financial sector. 09:10pm - Wrap up, drinks at Angel London Our hosts may require that we provide a list of all attendees, please ensure that you register with a name that matches your government issued ID or bank card: if you do not we cannot guarantee you entry to the building. Please RSVP for the event well in advance if you plan to attend in person and unRSVP if you can no longer attend as limited spaces are available.	AI and Deep Learning for Enterprise #19
AI tools for software engineers, but without the hype – with Simon Willison (co-creator of Django) 2024-09-25 · 14:06 Simon Willison – co-creator of the Django Web Framework; founder/creator of Datasette @ Django (Web Framework) and Datasette (open-source project) The first episode of The Pragmatic Engineer Podcast is out. Expect similar episodes every other Wednesday. You can add the podcast in your favorite podcast player, and have future episodes downloaded automatically. Listen now on Apple, Spotify, and YouTube. Brought to you by: • Codeium: Join the 700K+ developers using the IT-approved AI-powered code assistant. • TLDR: Keep up with tech in 5 minutes — On the first episode of the Pragmatic Engineer Podcast, I am joined by Simon Willison. Simon is one of the best-known software engineers experimenting with LLMs to boost his own productivity: he’s been doing this for more than three years, blogging about it in the open. Simon is the creator of Datasette, an open-source tool for exploring and publishing data. He works full-time developing open-source tools for data journalism, centered on Datasette and SQLite. Previously, he was an engineering director at Eventbrite, joining through the acquisition of Lanyrd, a Y Combinator startup he co-founded in 2010. Simon is also a co-creator of the Django Web Framework. He has been blogging about web development since the early 2000s. In today’s conversation, we dive deep into the realm of Gen AI and talk about the following: • Simon’s initial experiments with LLMs and coding tools • Why fine-tuning is generally a waste of time—and when it’s not • RAG: an overview • Interacting with GPTs voice mode • Simon’s day-to-day LLM stack • Common misconceptions about LLMs and ethical gray areas • How Simon’s productivity has increased and his generally optimistic view on these tools • Tips, tricks, and hacks for interacting with GenAI tools • And more! I hope you enjoy this episode. — In this episode, we cover: (02:15) Welcome (05:28) Simon’s ‘scary’ experience with ChatGPT (10:58) Simon’s initial experiments with LLMs and coding tools (12:21) The languages that LLMs excel at (14:50) To start LLMs by understanding the theory, or by playing around? (16:35) Fine-tuning: what it is, and why it’s mostly a waste of time (18:03) Where fine-tuning works (18:31) RAG: an explanation (21:34) The expense of running testing on AI (23:15) Simon’s current AI stack (29:55) Common misconceptions about using LLM tools (30:09) Simon’s stack – continued (32:51) Learnings from running local models (33:56) The impact of Firebug and the introduction of open-source (39:42) How Simon’s productivity has increased using LLM tools (41:55) Why most people should limit themselves to 3-4 programming languages (45:18) Addressing ethical issues and resistance to using generative AI (49:11) Are LLMs are plateauing? Is AGI overhyped? (55:45) Coding vs. professional coding, looking ahead (57:27) The importance of systems thinking for software engineers (1:01:00) Simon’s advice for experienced engineers (1:06:29) Rapid-fire questions — Where to find Simon Willison: • X: https://x.com/simonw • LinkedIn: https://www.linkedin.com/in/simonwillison/ • Website: https://simonwillison.net/ • Mastodon: https://fedi.simonwillison.net/@simon — Referenced: • Simon’s LLM project: https://github.com/simonw/llm • Jeremy Howard’s Fast Ai: https://www.fast.ai/ • jq programming language: https://en.wikipedia.org/wiki/Jq_(programming_language) • Datasette: https://datasette.io/ • GPT Code Interpreter: https://platform.openai.com/docs/assistants/tools/code-interpreter • Open Ai Playground: https://platform.openai.com/playground/chat • Advent of Code: https://adventofcode.com/ • Rust programming language: https://www.rust-lang.org/ • Applied AI Software Engineering: RAG: https://newsletter.pragmaticengineer.com/p/rag • Claude: https://claude.ai/ • Claude 3.5 sonnet: https://www.anthropic.com/news/claude-3-5-sonnet • ChatGPT can now see, hear, and speak: https://openai.com/index/chatgpt-can-now-see-hear-and-speak/ • GitHub Copilot: https://github.com/features/copilot • What are Artifacts and how do I use them?: https://support.anthropic.com/en/articles/9487310-what-are-artifacts-and-how-do-i-use-them • Large Language Models on the command line: https://simonwillison.net/2024/Jun/17/cli-language-models/ • Llama: https://www.llama.com/ • MLC chat on the app store: https://apps.apple.com/us/app/mlc-chat/id6448482937 • Firebug: https://en.wikipedia.org/wiki/Firebug_(software)# • NPM: https://www.npmjs.com/ • Django: https://www.djangoproject.com/ • Sourceforge: https://sourceforge.net/ • CPAN: https://www.cpan.org/ • OOP: https://en.wikipedia.org/wiki/Object-oriented_programming • Prolog: https://en.wikipedia.org/wiki/Prolog • SML: https://en.wikipedia.org/wiki/Standard_ML • Stabile Diffusion: https://stability.ai/ • Chain of thought prompting: https://www.promptingguide.ai/techniques/cot • Cognition AI: https://www.cognition.ai/ • In the Race to Artificial General Intelligence, Where’s the Finish Line?: https://www.scientificamerican.com/article/what-does-artificial-general-intelligence-actually-mean/ • Black swan theory: https://en.wikipedia.org/wiki/Black_swan_theory • Copilot workspace: https://githubnext.com/projects/copilot-workspace • Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems: https://www.amazon.com/Designing-Data-Intensive-Applications-Reliable-Maintainable/dp/1449373321 • Bluesky Global: https://www.blueskyglobal.org/ • The Atrocity Archives (Laundry Files #1): https://www.amazon.com/Atrocity-Archives-Laundry-Files/dp/0441013651 • Rivers of London: https://www.amazon.com/Rivers-London-Ben-Aaronovitch/dp/1625676158/ • Vanilla JavaScript: http://vanilla-js.com/ • jQuery: https://jquery.com/ • Fly.io: https://fly.io/ — Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email [email protected]. Get full access to The Pragmatic Engineer at newsletter.pragmaticengineer.com/subscribe AI/ML C#/.NET GenAI GitHub JavaScript LLM Marketing RAG Rust	The Pragmatic Engineer Listen
From data to insights: Clojure for data deep dive (by Kira McLean) 2024-04-30 · 17:30 THIS IS AN ONLINE EVENT [Connection details will be shared 1h before the start time] The London Clojurians are happy to present: Title: From data to insights: Clojure for data deep dive Speaker: Kira McLean Time: 2024-04-30 @ 18:30 (London time) Local time: click here for local time Kira McLean (https://github.com/kiramclean/) will be presenting: "From data to insights: Clojure for data deep dive" In this session, participants will dive into the lesser-known corners of Clojure's data ecosystem. Learn how to extract meaningful insights from example datasets, uncovering the versatility of libraries like tablecloth, tech.ml.dataset, and fastmath to confidently tackle realistic and complex data challenges. Participants will leave equipped with the tools and techniques to effectively leverage Clojure's robust data science toolkit for insightful real-world data exploration and analysis. Kira has been writing software since 2015, focusing on Clojure for the last 4 years. With a desire to pave the way for Clojure's broader recognition and adoption in the data science community, she's actively developing tools and guides aimed at showcasing the strengths of Clojure's data science toolkit. Her efforts are driven by a vision to broaden Clojure's adoption in the data world by improving the usability and effectiveness it's core libraries. An advocate for Clojure's potential in the world of data science, she's spending this year working exclusively on open source contributions to support and grow the Clojure data science ecosystem, supported by Clojurists Together and other generous sponsors. If you missed this event, you can watch the recording on our YouTube channel: https://www.youtube.com/@LondonClojurians (The recording will be uploaded a couple of days after the event.) Please, consider supporting the London Clojurians with a small donation: https://opencollective.com/london-clojurians/ Your contributions will enable the sustainability of the London Clojurians community and support our varied set of online and in-person events: ClojureBridge London: supports under-represented groups discover Clojure re:Clojure: our free to attend annual community conference monthly meetup events with speakers from all over the world subscription and admin costs such as domain name & StreamYard subscription Thank you to our sponsors: https://juxt.pro/ https://flexiana.com/ And many individual sponsors	From data to insights: Clojure for data deep dive (by Kira McLean)
PyData London - 84th Meetup 2024-04-02 · 18:00 Venue: Riverbank House, 2 Swan Ln, London EC4R 3AD - IMPORTANT: LOCATION UPDATED! Please note: 🚨🚨🚨A valid photo ID is required by building security. 🚨🚨🚨 This event follows the NumFOCUS Code of Conduct, please familiarise yourself with it before the event. If your RSVP status says "You're going" you will be able to get in. No need to show your RSVP confirmation when signing in. If you can no longer make it, please unRSVP as soon as you know. Code of Conduct: This event follows the NumFOCUS Code of Conduct. Please get in touch with the organisers with any questions or concerns regarding the Code of Conduct. As always, there'll be free food & drinks, generously provided by our host, Man Group. Main Talks Building Retrieval Augmented Generation (RAG) powered applications - Aniket Maurya RAG extends the capability and knowledge base of large language models (LLMs) by augmenting prompts with proprietary and domain-specific knowledge without the need to retrain the LLM. It ensures information stays current and reduces hallucination by attributing the source. In this talk, the audience will get an overview of building RAG powered applications using open-source tools. Getting python out of the way when taking ML models from research to production. A deep dive into the Open Neural Network Exchange (ONNX) - Aditya Goel The Python data science ecosystem is unparalleled when it comes to model development and training. When moving models from research to production, Python creates many challenges from latency through to managing environments and dependencies. The Open Neural Network Exchange (ONNX) enables data practitioners to export their model to a self-contained, target independent protobuf representation. When paired with highly performant runtime and compiler technology, this leads to exceptionally high performance inference across many hardware targets, while massively simplifying to process of getting models from research into production. This talk will explore how data practitioners and software engineers can exploit ONNX to rapidly speed up the transfer of models from research to production. ⚡ Lightning Talks Using Google Location Data and Reverse Geocoding to Explore your Personal Travel History - Jessica Walkenhorst In this talk I will demonstrate how you can use reverse geocoding on your Google location data to gain a detailed understanding of your past travel history. Apart from being a fun exercise and bringing back great memories, the results of this analysis can be used to understand times spent abroad, information that is often required in the process of applying for residency permits and foreign passports. 2. Community Lightning Talk - Bring your own! This is an opportunity for guests to bring their own lightning talk and spontaneously present on the evening! Logistics Doors open at 6.30 pm (get there early as you have to sign-in via building security), talks start at 7 pm, drinks from 9 pm in the bar. We will have reduced capacity for this event but there will be plenty of people to discuss data science questions with! Please unRSVP in good time if you realise you can't make it. We're limited by building security on the number of attendees, so please free up your place for your fellow community members! Follow @pydatalondon (https://twitter.com/pydatalondon) for updates and early announcements.	PyData London - 84th Meetup
AI Meetup: ML and LLMs Infrastructure 2024-03-27 · 18:00 * RSVP: https://www.aicamp.ai/event/eventdetails/W2024032710 (Due to limited room capacity, you must pre-register at the link for admission). Welcome to the AI meetup in London. Join us for deep dive tech talks on AI, GenAI, LLMs and machine learning, food/drink, networking with speakers and fellow developers. Agenda:** * 6:00pm\~7:00pm: Checkin, Food/drink and Networking * 7:00pm\~9:00pm: Tech talks and Q&A * 9:00pm: Open discussion and Mixer Tech Talk: Building GenAI and ML systems with OSS Metaflow Speaker: Hugo Bowne-Anderson (Outerbounds) Abstract: This talk explores a framework for how data scientists can deliver value with Generative AI: How can you embed LLMs and foundation models into your pre-existing software stack? How can you do so using Open Source Python? What changes about the production machine learning stack and what remains the same? This talk is aimed squarely at (data) scientists and ML engineers who want to focus on the science, data, and modeling, but want to be able to access all their infrastructural, platform, and software needs with ease! Tech Talk: Harmony, Open source AI tool for psychology research Speaker: Thomas Wood (Fast Data Science) Abstract: In this talk, I will discuss AI for social sciences research and how to build a research tool with NLP and AI with open source tool Harmony, funded by Wellcome. Speakers/Topics: Stay tuned as we are updating speakers and schedules. If you have a keen interest in speaking to our community, we invite you to submit topics for consideration: Submit Topics Sponsors: We are actively seeking sponsors to support AI developers community. Whether it is by offering venue spaces, providing food, or cash sponsorship. Sponsors will not only speak at the meetups, receive prominent recognition, but also gain exposure to our extensive membership base of 10,000+ AI developers in London or 300K+ worldwide. Community on Slack/Discord Event chat: chat and connect with speakers and attendees Sharing blogs, events, job openings, projects collaborations Join Slack/Discord (link is at the bottom of the page) *	AI Meetup: ML and LLMs Infrastructure
PyData London - 82nd Meetup 2024-02-06 · 19:00 Venue: Riverbank House, 2 Swan Ln, London EC4R 3AD - IMPORTANT: LOCATION UPDATED! Please note: 🚨🚨🚨A valid photo ID is required by building security. 🚨🚨🚨 This event follows the NumFOCUS Code of Conduct, please familiarise yourself with it before the event. If your RSVP status says "You're going" you will be able to get in. No need to show your RSVP confirmation when signing in. If you can no longer make it, please unRSVP as soon as you know. Code of Conduct: This event follows the NumFOCUS Code of Conduct. Please get in touch with the organisers with any questions or concerns regarding the Code of Conduct. As always, there'll be free food & drinks, generously provided by our host, Man Group. Main Talks Toolbox of a not-so Data Scientist - Tambe Tabitha Achere This talk is about building data science solutions in scenarios where demos cannot be done on a notebook and dashboards do not suffice as a final deliverable. By the end of this session, the audience will have an idea of how data scientists can build the logic behind full-stack applications without the need to learn a backend framework. I will do a deep dive into one of my projects and there will be lots of code samples accompanied by explanations that led to design decisions. The project I'll be diving into is one in which the data could not be pulled in so if you've ever had to build for data you couldn't see, this session is for you too. I'll highlight the tools, packages and processes that enabled it to be built. Boosting Similarity Search With Real-time Stream Processing - Fawaz Ghali The goal of similarity search and vector databases is to find similar results to the search query for unstructured data, such as text, images, and videos. The unstructured data first is vectorized, and stored in a vector format. There are publicly available tools to create vectors from unstructured data; similarly, there are vector databases to store and perform similarity searches. This is important because of the rising popularity of Large Language Models (LLMs) and their combination with vector databases. Here, we present a hybrid approach by taking the strengths of vector databases and boosting them with traditional search and filtering techniques based on real-time stream processing. Vector databases are good for building high-performance vector search applications. On the other hand, stream processing can be used for real-time fast data storage for structured data (filters, tags, and contextual data). In this work, we're adding context and memory to vector databases to ingest, enrich, predict, and act on your data in a simplified but efficient approach. In this talk, we’ll focus on how Real-time compute APIs help leverage the processing capabilities of a distributed cluster, so you aren’t leaving large potential performance gains on the table. The combination of Real-time storage and computing provides a unique synergy that enables applications to address real-time use cases at any scale. ⚡ Lightning Talks Open-Source Science (OSSci) - Tim Bonnemann Open-Source Science (OSSci) is a new NumFOCUS initiative – launched in July 2022 in partnership with IBM – that aims to accelerate scientific research by improving the ways open source software in science gets done (built, used, funded, sustained, recognized, etc.). OSSci connects scientists, OSS developers and other stakeholders to share best practices, identify common pain points, and explore solutions together. The five OSSci interest groups to date cover domain-specific topics (chemistry/materials, life sciences/healthcare, climate/sustainability) as well as cross-domain topics (reproducibility, map of science), with more to be rolled out in 2024. This lightning talk will provide a brief overview of OSSci’s activities to date, our plans for 2024, and how you can get involved. (Maybe) faster Pandas with CuDF on the GPU (perhaps) - Ian Ozsvald NVIDIA's CuDF promises 100-1000x GPU speed ups with 100% compatibility, with a bit of effort it can be made to work. This talk shows what could work and which bits (including setup!) can be painful Logistics Doors open at 6.30 pm (get there early as you have to sign-in via building security), talks start at 7 pm, drinks from 9 pm in the bar. We will have reduced capacity for this event but there will be plenty of people to discuss data science questions with! Please unRSVP in good time if you realise you can't make it. We're limited by building security on the number of attendees, so please free up your place for your fellow community members! Follow @pydatalondon (https://twitter.com/pydatalondon) for updates and early announcements.	PyData London - 82nd Meetup
AI Deep Dive: LLMs and Vector Databases 2023-09-06 · 17:00 ** Raffle at the end: two Designing Machine Learning Systems, O'Reilly GDG Cloud London is thrilled to be collaborating with AICamp (https://www.meetup.com/London-AI-Tech-Talk/) for a deep dive into the AI world. The event will be held at Cathedral View, Carlisle Place, St. Vincents Centre, London, SW1P 1NL. This meet-up is a unique opportunity to connect with fellow AI enthusiasts, industry practitioners, and researchers in a dynamic and interactive setting. Whether you are a seasoned AI professional or just curious about the latest advancements in Generative AI, LLMs and Vector Databases, this meet-up is for you! Join us for an insightful and thought-provoking discussion on the forefront of AI innovation and practices. This event is sponsored by Weativate. At the end of the event, we'll give away two O'Reilly books: Designing Machine Learning Systems (paperback). Don't miss out, RSVP now! Agenda 6:00 PM: Arrivals and Check In 6:30 PM: Welcome / Community Update 6:45 PM: Zain Hasan - Using Vector Databases with Multimodal Embeddings and Search At Scale Many real-world problems are inherently multimodal, from the communicative modalities humans use such as spoken language and gestures to the force, proprioception, and visual sensors ubiquitous in robotics. In order for machine learning models to address these problems and interact more naturally and wholistically with the world around them and ultimately be more general and powerful reasoning engines we need them to understand data across all of its corresponding image, video, text, audio, and tactile representations. In this talk we will discuss how we can use open-source multimodal models, that can see, hear, read, and feel data(!), to perform cross-modal search(searching audio with images, videos with text etc.) at the billion-object scale with the help of open source vector databases. I will also demonstrate, with live code demos and large-scale datasets, how being able to perform this cross-modal retrieval in real-time can help users add natural search interfaces to their apps. This talk will revolve around how we scaled the usage of multimodal embedding models in production and how you can add cross-modal search into your apps. 7:15 PM: JP Hwang - Bringing LLMs to Your Data In this talk, JP explains how Weaviate redefines what you thought was possible in a database. JP will begin by showing how you can use Weaviate to effectively search data, before moving on to show you how you can use generative search (retrieval augmented generation) with Weaviate to transform your data at retrieval time with LLMs. 7:45 PM: Raffle and Networking Speakers Zain Hasan - Weaviate (Developer Relations) Joon-Pil Hwang - Weaviate (Developer Relations) Hosted By Amanda Cavallaro, GDG Organizer I'm an Aikidoka, Developer Advocate, Software Developer, Google Developers Expert, Linkedin Learning Author and a Full Stack Web Development Specialist. Saverio Terracciano, GDG Organizer Stefano Le Pera, GDG Organizer Lorenzo Turrino, GDG Organizer Alessandro Puccetti, GDG Organizer My name is Alessandro Puccetti, I am Italian 🇮🇹 but I am in fact a citizen of the world 🌎. I love travelling and meeting new people from different cultures, and I enjoy having a particular focus on their food 😉. Kubra Harmankaya, GDG Organizer Natalie Godec, GDG Organizer Bruno Ripa, GDG Organizer I am an italian software architect, in the industry since 2006. I have been in entrepreneurship in Italy, for 6 years, and then continued my career in United Kingdom, in London (2012), a city (or, better, the City) which I consider as my second home. I have worked in several industries (gaming, fintech, digital asset management) and in many companies, with a 3 years parenthesis in Spain (2017-2020), precisely in Barcelona, where I have worked as a contractor for a few USA startups and an european company working in IoT. In March 2020 I made my way back in London, working for Erlang Solutions. Actually I am a contractor and Consultant at BBC Arianna Capizzi, GDG Organizer Jen Kwon, GDG Organizer Complete your event RSVP here: https://gdg.community.dev/events/details/google-gdg-cloud-london-presents-ai-deep-dive-llms-and-vector-databases/.	AI Deep Dive: LLMs and Vector Databases
AI Deep Dive: LLMs and Vector Databases 2023-09-06 · 17:00 Welcome to our in-person AI meetup, in collaboration with Google Developers Group. Join us for deep dive tech talks on AI/ML, food/drink, networking with speakers&peers developers, and win lucky draw prizes. Pre-registration is required: https://www.aicamp.ai/event/eventdetails/W2023090610 [RSVP instructions] Pre-register at the event website. (venue security may not let you in if you don't pre-register) Contact us to submit topics and/or sponsor the meetup on venue/food/swags/prizes. https://forms.gle/JkMt91CZRtoJBSFUA Community on Slack for events chat, speakers office hour and learning resources, job openings and projects collaboration. join slack (search and join #london channel) * Description: This meet-up is a unique opportunity to connect with fellow AI enthusiasts, industry practitioners, and researchers in a dynamic and interactive setting. Whether you are a seasoned AI professional or just curious about the latest advancements in Generative AI, LLMs and Vector Databases, this meet-up is for you! Join us for an insightful and thought-provoking discussion on the forefront of AI innovation and practices. Agenda (BST): * 6:00pm\~6:30pm: Checkin, Food/Snacks/Drink and networking * 6:30pm\~6:45pm: Welcome/community update * 6:45pm\~7:45pm: Tech talks * 7:45pm: Open discussion & Mixer Tech Talk 1: Using Vector Databases with Multimodal Embeddings and Search At Scale Speaker: Zain Hasan @Weaviate Abstract: Many real-world problems are inherently multimodal, from the communicative modalities humans use such as spoken language and gestures to the force, proprioception, and visual sensors ubiquitous in robotics. In order for machine learning models to address these problems and interact more naturally and holistically with the world around them and ultimately be more general and powerful reasoning engines we need them to understand data across all of its corresponding image, video, text, audio, and tactile representations. In this talk we will discuss how we can use open-source multimodal models, that can see, hear, read, and feel data(!), to perform cross-modal search(searching audio with images, videos with text etc.) at the billion-object scale with the help of open source vector databases. I will also demonstrate, with live code demos and large-scale datasets, how being able to perform this cross-modal retrieval in real-time can help users add natural search interfaces to their apps. This talk will revolve around how we scaled the usage of multimodal embedding models in production and how you can add cross-modal search into your apps. Tech Talk 2: Bringing LLMs to Your Data Speaker: JP Huang @Weaviate Abstract: In this talk, JP explains how Weaviate redefines what you thought was possible in a database. JP will begin by showing how you can use Weaviate to effectively search data, before moving on to show you how you can use generative search (retrieval augmented generation) with Weaviate to transform your data at retrieval time with LLMs.	AI Deep Dive: LLMs and Vector Databases
AI Deep Dive: Creating your own ChatGPT with Apache Airflow 2023-07-13 · 17:00 ** Big give-away at the end: two Designing Machine Learning Systems, O'ReillyGDG Cloud London is thrilled to be collaborating with AICamp (https://www.meetup.com/London-AI-Tech-Talk/) for a deep dive into the AI world. The event will be held at GoCardless London office. This meet-up is a unique opportunity to connect with fellow AI enthusiasts, industry practitioners, and researchers in a dynamic and interactive setting. Whether you are a seasoned AI professional or just curious about the latest advancements in NLP, LLMs and Airflow, this meet-up is for you! Join us for an insightful and thought-provoking discussion on the forefront of AI innovation. At the end of the event, we'll give away two O'Reilly books: Designing Machine Learning Systems (paperback). Don't miss out, RSVP now! A big shout-out to our sponsor, Transparent (https://heytransparent.io). Agenda 6:00 PM: Arrivals and Check In 6:30 PM: Welcome / Community Update A quick intro from GDG Cloud London, AICamp and Transparent.io. 6:45 PM: Tatiana Al-Chueyr - Creating your own ChatGPT with Apache Airflow Apache Airflow is an orchestration tool which allows users to build all sorts of pipelines, automating steps and allowing them to run on schedule reliably. Thousands of companies use Airflow to process ETL and machine learning pipelines worldwide. This talk will illustrate creating an Airflow pipeline to process data and train a custom ChatGPT. 7:15 PM: Marty Pitt - Using AI to create data pipelines and Service Orchestration with Orbital and Google Cloud 7:45 PM: Wrap up, Networking and Raffle! We are going to give away two Designing Machine Learning Systems, O'Reilly books after the last talk! Speakers Tatiana Al-Chueyr - Astronomer (Staff Software Engineer) Tatiana is a Staff Software Engineer at Astronomer and builds open-source authoring tools on top of Apache Airflow. She Graduated in Computer Engineering and has worked for over 18 years building highly scalable software for multiple organisations, including the Ministry of Science and Technology in Brazil, TV Globo and the BBC. Marty Pitt - Orbital (Founder) Hosted By Amanda Cavallaro, GDG Organizer I'm an Aikidoka, Developer Advocate, Software Developer, Google Developers Expert, Linkedin Learning Author and a Full Stack Web Development Specialist. Saverio Terracciano, GDG Organizer Stefano Le Pera, GDG Organizer Lorenzo Turrino, GDG Organizer Alessandro Puccetti, GDG Organizer My name is Alessandro Puccetti, I am Italian 🇮🇹 but I am in fact a citizen of the world 🌎. I love travelling and meeting new people from different cultures, and I enjoy having a particular focus on their food 😉. Kubra Harmankaya, GDG Organizer Natalie Godec, GDG Organizer Nodir Siddikov, GDG Organizer Bruno Ripa, GDG Organizer I am an italian software architect, in the industry since 2006. I have been in entrepreneurship in Italy, for 6 years, and then continued my career in United Kingdom, in London (2012), a city (or, better, the City) which I consider as my second home. I have worked in several industries (gaming, fintech, digital asset management) and in many companies, with a 3 years parenthesis in Spain (2017-2020), precisely in Barcelona, where I have worked as a contractor for a few USA startups and an european company working in IoT. In March 2020 I made my way back in London, working for Erlang Solutions. Actually I am a contractor and Consultant at BBC Arianna Capizzi, GDG Organizer Complete your event RSVP here: https://gdg.community.dev/events/details/google-gdg-cloud-london-presents-ai-deep-dive-creating-your-own-chatgpt-with-apache-airflow/.	AI Deep Dive: Creating your own ChatGPT with Apache Airflow
AI Deep Dive: AI Audio Pipelines & LLMs-powered Applications 2023-06-15 · 17:00 Complete your event RSVP here: https://gdg.community.dev/events/details/google-gdg-cloud-london-presents-ai-deep-dive-ai-audio-pipelines-llms-powered-applications/ Welcome to our in-person meet-up about Artificial Intelligence (AI)! GDG Cloud London is thrilled to be collaborating with AICamp (https://www.meetup.com/London-AI-Tech-Talk/) for a deep dive into the AI world. Join us for an engaging discussion on two cutting-edge topics: Haystack, an open source NLP framework by Deepset, and how to get the most out of your Audio/Video Content. The event will be held at RightMove London office on 33 Soho Square. This meet-up is a unique opportunity to connect with fellow AI enthusiasts, industry practitioners, and researchers in a dynamic and interactive setting. Whether you are a seasoned AI professional or just curious about the latest advancements in NLP, LLMs and AI Audio Pipelines, this meet-up is for you! Join us for an insightful and thought-provoking discussion on the forefront of AI innovation. Don't miss out, RSVP now! A big shout-out to our sponsor, Transparent (https://heytransparent.io). Agenda 6:00 PM: Arrivals and Check In 6:30 PM: Welcome / Community Update A quick intro from GDG Cloud London, AICamp and Transparent.io. 6:45 PM: Adam MacVeigh - AI Audio Pipelines How to get the most out of your audio and video content with transcription and Natural Language Processing. 7:15 PM: Tuana Celik - Building LLM Powered NLP Applications with Haystack In this talk we will take a look at Haystack, an open source NLP framework by deepset, and the current state of building NLP applications. We will look at challenges we frequently face using LLMs (such as hallucinations), and what we can do to mitigate them. We will also look at different applications of language models with various prompting implementations such as Agents, chat applications and more 7:45 PM: Wrap up and Networking Speakers Adam MacVeigh - News UK (Senior Data Scientist) Tuana Celik - Deepset (Developer Advocate) Tuana is a developer advocate at deepset, where she focuses on the open source NLP framework: Haystack. With a degree in Computer Science from the University of Bristol, she first started her career as a software engineer. Later, she returned to the world of machine learning as a developer advocate and now dedicates her time to helping the open source NLP community. headshot is attached.… Hosted By Amanda Cavallaro, GDG Organizer I'm an Aikidoka, Developer Advocate, Software Developer, Google Developers Expert, Linkedin Learning Author and a Full Stack Web Development Specialist. Saverio Terracciano, GDG Organizer Stefano Le Pera, GDG Organizer Lorenzo Turrino, GDG Organizer Alessandro Puccetti, GDG Organizer My name is Alessandro Puccetti, I am Italian 🇮🇹 but I am in fact a citizen of the world 🌎. I love travelling and meeting new people from different cultures, and I enjoy having a particular focus on their food 😉. Kubra Harmankaya, GDG Organizer Natalie Godec, GDG Organizer Nodir Siddikov, GDG Organizer Bruno Ripa, GDG Organizer I am an italian software architect, in the industry since 2006. I have been in entrepreneurship in Italy, for 6 years, and then continued my career in United Kingdom, in London (2012), a city (or, better, the City) which I consider as my second home. I have worked in several industries (gaming, fintech, digital asset management) and in many companies, with a 3 years parenthesis in Spain (2017-2020), precisely in Barcelona, where I have worked as a contractor for a few USA startups and an european company working in IoT. In March 2020 I made my way back in London, working for Erlang Solutions. Actually I am a contractor and Consultant at BBC Arianna Capizzi, GDG Organizer Partners AICamp (https://www.meetup.com/London-AI-Tech-Talk/) AICamp is a global online learning platform for developers, engineers, data scientists to learn and practice AI/ML technology. * Online live tech talks, workshops, bootcamps, courses * 100k+ developers members from 100+ countries * Learning groups in 50+ major tech hub cities around the world. Transparent (https://heytransparent.io) A tech recruitment start-up Complete your event RSVP here: https://gdg.community.dev/events/details/google-gdg-cloud-london-presents-ai-deep-dive-ai-audio-pipelines-llms-powered-applications/.	AI Deep Dive: AI Audio Pipelines & LLMs-powered Applications
AI Deep Dive: AI Audio Pipelines&LLMs-powered Applications 2023-06-15 · 17:00 Welcome to our in-person ML monthly meetup, in collaboration with GDG (Google Developers Group) Cloud London. Join us for deep dive tech talks on AI/ML, food/drink, networking with speakers&peers developers, and win lucky draw prizes. Pre-registration is required here: https://www.aicamp.ai/event/eventdetails/W2023061510 [RSVP instructions] Register at the event website. (full name and email is required for badges and check in) Contact us to submit topics and/or sponsor the meetup on venue/food/swags/prizes. https://forms.gle/JkMt91CZRtoJBSFUA Community on Slack for events chat, speakers office hour, sharing learning resources, job openings, etc... join slack (search and join the #london channel) * Description: Join us for an engaging discussion on two cutting-edge topics: LLMs and how to get the most out of your Audio/Video Content. This meet-up is a unique opportunity to connect with fellow AI enthusiasts, industry practitioners, and researchers in a dynamic and interactive setting. Whether you are a seasoned AI professional or just curious about the latest advancements in NLP, LLMs and AI Audio Pipelines, this meet-up is for you! Join us for an insightful and thought-provoking discussion on the forefront of AI innovation. A big shout-out to our sponsor, Transparent (https://heytransparent.io). Agenda (BST): * 6:00pm\~6:30pm: Checkin, Food/Snacks/Drink and networking * 6:30pm\~6:45pm: Welcome/community update * 6:45pm\~7:45pm: Tech talks * 7:45pm: Open discussion & Mixer Tech Talk 1: AI Audio Pipelines Speaker: Adam MacVeigh, Data Scientist @News UK Abstract: How to get the most out of your audio and video content with transcription and Natural Language Processing. Tech Talk 2: Building LLM Powered NLP Applications with Haystack Speaker: Tuana Celik, Developer Advocate @Deepset Abstract: In this talk we will take a look at Haystack, an open source NLP framework by deepset, and the current state of building NLP applications. We will look at challenges we frequently face using LLMs (such as hallucinations), and what we can do to mitigate them. We will also look at different applications of language models with various prompting implementations such as Agents, chat applications and more.	AI Deep Dive: AI Audio Pipelines&LLMs-powered Applications

The State of Airflow 2026: London Airflow Meetup! 2026-01-28 · 17:30

Join fellow Airflow enthusiasts and leaders at Salisbury House for an evening of engaging talks, great food and drinks, and exclusive swag!

We'll start you off with a deep dive into the Airflow 2026 survey results, and finish off with a community member presentation on the Apache TinkerPop provider.

PRESENTATIONS

Talk #1: The State of Apache Airflow® 2026

Apache Airflow® continues to thrive as the world’s leading open-source data orchestration platform, with 30M downloads per month and over 3k contributors. 2025 marked a major milestone with the release of Airflow 3, which introduced DAG versioning, enhanced security and task isolation, assets, and more. These changes have reshaped how data teams build, operate, and govern their pipelines.

In this session, our speaker will share insights from the State of Airflow 2026 report, including:

Latest trends in how teams are using Airflow today
What’s next for the project and ecosystem
A discussion of emerging best practices and evolving use cases

Join us to hear directly from a leader in the community and discover how to get the most out of Airflow in the year ahead.

Talk #2: Building the Apache TinkerPop Provider for Airflow

Speaker: Ahmad Farhan, Data Engineer

Graph databases are powering everything from recommendation engines to fraud detection, but integrating graph operations into modern data pipelines has often required custom code and workarounds. Earlier this year, Ahmad built a new Apache TinkerPop provider for Airflow, making it easier than ever to orchestrate Gremlin queries, manage graph workloads, and connect Airflow to TinkerPop-enabled systems. In this session, you’ll learn:

What the TinkerPop provider does and why it matters for graph-based workloads
How to run Gremlin queries and manage graph jobs directly within Airflow
Real examples from the development process, including design decisions and lessons learned
How this provider opens the door for new use cases in graph analytics and data engineering

Join us to explore how Airflow and TinkerPop can work together to streamline graph workflows and unlock new patterns in modern data pipelines.

AGENDA

5:30-6 PM: Arrivals, networking, food & drinks
6-7PM: Presentations
7-8PM: Networking

The State of Airflow 2026: London Airflow Meetup!

AI Meetup (March): AI, GenAI and ML 2025-03-25 · 18:00

** Important RSVP HERE (Due to room capacity and venue security, it is required to pre-register at the link for admission)

Welcome to the AI meetup in London. Join us for deep dive tech talks on AI, GenAI, LLMs and machine learning, food/drink, networking with speakers and fellow developers.

Tech Talk: What you need to know about AI factories Speaker: Matt Shore (High Performance Computing & Artificial Intelligence) Abstract: As organisations start to move from proof of concept to production, they need to consider how to build their infrastructure from the ground up to be completely optimised for the next wave of AI’s requirements. From data to power and space, HPE will provide a whistlestop tour of the latest thinking across the AI stack, as well as what it means for each type of role in the organisation.

Tech Talk: Accessing and building with open-source models Speaker: Darin Verheijke (Recursal ai) Abstract: In this session, I will discuss the recent improvements of open-source LLM models, the difficulties in running these open-source models for yourself/your company, how you can easily access and make use of all these open-source models through Hugging Face and Featherless.ai and demo some self-made open-source applications for every developer to use.

Speakers/Topics: Stay tuned as we are updating speakers and schedules. If you have a keen interest in speaking to our community, we invite you to submit topics for consideration: Submit Topics

Sponsors: We are actively seeking sponsors to support AI developers community. Whether it is by offering venue spaces, providing food, or cash sponsorship. Sponsors will not only speak at the meetups, receive prominent recognition, but also gain exposure to our extensive membership base of 20,000+ AI developers in London and 500K+ worldwide.

AI Meetup (March): AI, GenAI and ML

IN PERSON: Apache Kafka meets Apache Flink 2025-01-23 · 18:00

Join us for our very first meetup of 2025! You'll learn all about how to use Apache Kafka beyond the consumer protocol and get an introduction to Apache Flink.

Date and Time: 🗓️ Thursday 23rd January, ⏰ 18:00 - 20:30 PM 🕘 Venue: Confluent Europe Ltd, 262 High Holborn, London WC1V 7EE, United Kingdom Attending Brands: OSO, Streambased, Gravitee & Confluent

Schedule: 18:00: Doors Open 18:00 - 18:30: Food, drinks, networking 18:30 - 19:00: "Accessing Kafka: beyond the consumer protocol" - Tom Scott (CEO Streambased) & Linus Hakansson (CPO Gravitee) 19:00 - 19:30: “Flink - Adi Polak (Director Developer Experience Engineering and Advocacy, Confluent) 19:30- 20:30pm: Additional Q&A, Networking

🎙️ \~Talk 1\~ Talk Title: Accessing Kafka: beyond the consumer protocol

Summary: The role of Kafka is expanding and with it the use cases it addresses. This brings many cool new features but also highlights some drawbacks. The standard producer/consumer pattern that has served us so well for so many years is no longer a good fit for all the things that Kafka data is used for and it's time to look beyond.Join Linus (Gravitee) and Tom (Streambased) for an in depth look at how you can interact with you Kafka clusters via REST, GraphQL, WebSockets, JDBC/ODBC and even as a simple filesystem.We'll outline the reasoning behind these new access patterns, the features that differentiate them (and the features that unite them) and show some live demos of the opportunities they create.

🗣️ Speaker 1: Tom Scott (CEO Streambased) is the founder of Streambased, Tom is building multi tenant, on-prem and cloud Kafka services to attack common Kafka pain points and break down barriers to starting your data journey.Linus Hakansson Linus Hakansson is the Chief Product Officer at Gravitee, building a next-generation management platform helping organizations secure, control and govern their Kafka and APIs

🗣️ Speaker 2: Linus Hakansson is the Chief Product Officer at Gravitee, building a next-generation management platform helping organizations secure, control and govern their Kafka and APIs

🎙️ \~Talk 2\~ Talk Title: Flink - demystifying data streaming

Summary: In an era where data velocity and volume continue to grow, the ability to process and analyze data streams in real-time is pivotal for businesses aiming to optimize operations, enhance decision-making, and maintain competitive advantages.

Apache Flink stands out as a comprehensive, open-source stream processing framework designed to meet these challenges head-on. In this session you will learn about data streaming through the lens of Apache Flink, offering insights into its architecture, capabilities, and how it seamlessly facilitates real-time data processing.Objectives: 1. Introduce Stream Processing: Provide a foundational understanding of stream processing - its importance\, use cases\, and when to use in comparison to batch processing. 2. Explore Apache Flink: Deep dive into Apache Flink's architecture\, key features\, and its unique approach to handling stateful computations\, event time processing\, and ensuring fault tolerance at scale.

🗣️ Speaker 1: Adi Polak, Director Developer Experience Engineering and Advocacy, Confluent. Adi is an experienced software engineer and people manager. For most of her professional life, she dealt with data and machine learning for transactional and analytics workloads by building large-scale systems. As a data practitioner, she developed software to solve real-world problems with Apache Spark, Kafka, HDFS, K8s, AWS, and Azure in high-throughput, high-scale production environments for companies like Akamai and Microsoft.Adi has taught Spark to thousands of students throughout the years and is the author of the successful book — Scaling Machine Learning with Spark.

When not thinking up new architecture, teaching new tech or pondering on a distributed systems challenge, you can find her at the local cultural scene.

IN PERSON: Apache Kafka meets Apache Flink

Open Source Data Deep Dive: London 2024-11-19 · 19:00

REGISTER HERE FOR LOCATION: https://lu.ma/2etm1zve

Come hang out at the OSS Data Deep Dive in London, where we'll explore some of the coolest and innovative use cases of the Iceberg ecosystem. Whether you're new to Iceberg, data lakehouses, or you’re a seasoned data engineer, discover how these tools can boost your data projects. Plus, there'll be plenty of networking, cool swag, and delicious food. Hope to see you there! Agenda:

Apache Iceberg REST Catalog: Making Catalog Interoperability Happen with Alex Merced, Senior Evangelist @ Dremio (co-author of “Apache Iceberg: The Definitive Guide”)
Apache Polaris: an Open Source Iceberg REST Catalog with Yufei Gu, Senior Software Engineer at Snowflake & Iceberg PMC member, Hadoop PMC member
Charting the Course: The Evolution and Future of Apache Iceberg and Polaris (incubating) with Jean-Baptiste Onofre, Board Member @ Apache Software Foundation

Open Source Data Deep Dive: London

Open Source Data Deep Dives 2024-11-19 · 18:00

RSVP HERE: Https://lu.ma/2etm1zve

Join us at the OSS Data Deep Dive in London for an in-depth workshop on Data Engineering Best Practices. This event is perfect for professionals who are keen to enhance their skills in handling big data efficiently.

Plus, there'll be plenty of networking, cool swag, and delicious food. Hope to see you there.

Agenda:

Apache Iceberg REST Catalog: Making Catalog Interoperability Happen with Alex Merced, Senior Evangelist @Dremio (co-author of "Apache Iceberg: The Definitive Guide")
Apache Polaris: an Open Source iceberg REST Catalog with Yufei Gu, Senior Software Engineer at Snowflake & Iceberg PMC member, Hadoop PMC member
Charting the Course: The Evolution and Future of Apache Iceberg and Polaris (incubating) with Jean-Baptistse Onofre, Board Member @Apache Software Foundation.

Our expert speakers will delve into topics like data modeling, ETL processes, data pipelines, and database architecture. Whether you are a seasoned data engineer or just starting in the field, this workshop will provide valuable insights and practical tips to streamline your data engineering projects. Don't miss out on this opportunity to network with fellow enthusiasts and take your data engineering skills to the next level!

Open Source Data Deep Dives

AI and Deep Learning for Enterprise #19 2024-10-08 · 18:00

Join us at Civo Tech Junction on October 8th for an evening of talks, food, and conversation with ML and AI industry pros.

Please note you will be unable to enter the venue before 6.30pm.

RSVPs will close 24 hours before the event, you may be unable to register after this time but you can still watch online.

If you can't join us in person you can watch remotely via our YouTube channel.

Agenda

06:30pm - Doors open, food and drink served

07:00pm - Welcome

07:05pm - A short talk from our hosts Daemon

07:10pm - Thomas Wood, Director of Fast Data Science "Project Harmony: a free online tool using LLMs for research in psychology and social sciences"

Thomas Wood will present our work on Harmony, harmonydata.ac.uk, which is a free online tool that uses generative AI and LLMs to help researchers compare items in questionnaires such as GAD-7 (used to measure anxiety), even when they are written in different languages. Harmony is open source under MIT License and is written in Python, and uses HuggingFace Sentence Transformers to find similarities between questionnaires. Harmony will soon allow researchers to discover datasets using a vector search.

07:50pm - Break

08:00 - Vikram Haridas, Lead Product Manager at Groupon "Implementing AI-Driven Product Innovations: Strategic Insights and Practical Applications"

Vikram Haridas, from Groupon, will reveal how AI can supercharge product roadmaps. Learn how to balance excitement with realism as you scale AI features and discover practical use cases. Get insights into Groupon's success with AI-powered deal optimization and automated merchant onboarding, and learn how to implement these strategies in your own business.

08:40 - Shubhangi Goyal, Data Analyst @ ICS.AI Ltd and Nidhi Agrawal Director @UBS, "Generative AI and its use cases"

Our session will explore the transformative potential of Generative AI, focusing on its use cases in matching algorithms and its applications in the financial industry. We'll dive into how AI models enhance person-matching processes by analyzing large datasets for customer service, and personalization. Additionally, we’ll examine how Generative AI is revolutionising the financial sector.

09:10pm - Wrap up, drinks at Angel London

Our hosts may require that we provide a list of all attendees, please ensure that you register with a name that matches your government issued ID or bank card: if you do not we cannot guarantee you entry to the building.

Please RSVP for the event well in advance if you plan to attend in person and unRSVP if you can no longer attend as limited spaces are available.

AI and Deep Learning for Enterprise #19

AI tools for software engineers, but without the hype – with Simon Willison (co-creator of Django) 2024-09-25 · 14:06

Simon Willison – co-creator of the Django Web Framework; founder/creator of Datasette @ Django (Web Framework) and Datasette (open-source project)

The first episode of The Pragmatic Engineer Podcast is out. Expect similar episodes every other Wednesday. You can add the podcast in your favorite podcast player, and have future episodes downloaded automatically. Listen now on Apple, Spotify, and YouTube. Brought to you by: • Codeium: Join the 700K+ developers using the IT-approved AI-powered code assistant. • TLDR: Keep up with tech in 5 minutes — On the first episode of the Pragmatic Engineer Podcast, I am joined by Simon Willison. Simon is one of the best-known software engineers experimenting with LLMs to boost his own productivity: he’s been doing this for more than three years, blogging about it in the open. Simon is the creator of Datasette, an open-source tool for exploring and publishing data. He works full-time developing open-source tools for data journalism, centered on Datasette and SQLite. Previously, he was an engineering director at Eventbrite, joining through the acquisition of Lanyrd, a Y Combinator startup he co-founded in 2010. Simon is also a co-creator of the Django Web Framework. He has been blogging about web development since the early 2000s. In today’s conversation, we dive deep into the realm of Gen AI and talk about the following: • Simon’s initial experiments with LLMs and coding tools • Why fine-tuning is generally a waste of time—and when it’s not • RAG: an overview • Interacting with GPTs voice mode • Simon’s day-to-day LLM stack • Common misconceptions about LLMs and ethical gray areas • How Simon’s productivity has increased and his generally optimistic view on these tools • Tips, tricks, and hacks for interacting with GenAI tools • And more! I hope you enjoy this episode. — In this episode, we cover: (02:15) Welcome (05:28) Simon’s ‘scary’ experience with ChatGPT (10:58) Simon’s initial experiments with LLMs and coding tools (12:21) The languages that LLMs excel at (14:50) To start LLMs by understanding the theory, or by playing around? (16:35) Fine-tuning: what it is, and why it’s mostly a waste of time (18:03) Where fine-tuning works (18:31) RAG: an explanation (21:34) The expense of running testing on AI (23:15) Simon’s current AI stack (29:55) Common misconceptions about using LLM tools (30:09) Simon’s stack – continued (32:51) Learnings from running local models (33:56) The impact of Firebug and the introduction of open-source (39:42) How Simon’s productivity has increased using LLM tools (41:55) Why most people should limit themselves to 3-4 programming languages (45:18) Addressing ethical issues and resistance to using generative AI (49:11) Are LLMs are plateauing? Is AGI overhyped? (55:45) Coding vs. professional coding, looking ahead (57:27) The importance of systems thinking for software engineers (1:01:00) Simon’s advice for experienced engineers (1:06:29) Rapid-fire questions — Where to find Simon Willison: • X: https://x.com/simonw • LinkedIn: https://www.linkedin.com/in/simonwillison/ • Website: https://simonwillison.net/ • Mastodon: https://fedi.simonwillison.net/@simon — Referenced: • Simon’s LLM project: https://github.com/simonw/llm • Jeremy Howard’s Fast Ai: https://www.fast.ai/ • jq programming language: https://en.wikipedia.org/wiki/Jq_(programming_language) • Datasette: https://datasette.io/ • GPT Code Interpreter: https://platform.openai.com/docs/assistants/tools/code-interpreter • Open Ai Playground: https://platform.openai.com/playground/chat • Advent of Code: https://adventofcode.com/ • Rust programming language: https://www.rust-lang.org/ • Applied AI Software Engineering: RAG: https://newsletter.pragmaticengineer.com/p/rag • Claude: https://claude.ai/ • Claude 3.5 sonnet: https://www.anthropic.com/news/claude-3-5-sonnet • ChatGPT can now see, hear, and speak: https://openai.com/index/chatgpt-can-now-see-hear-and-speak/ • GitHub Copilot: https://github.com/features/copilot • What are Artifacts and how do I use them?: https://support.anthropic.com/en/articles/9487310-what-are-artifacts-and-how-do-i-use-them • Large Language Models on the command line: https://simonwillison.net/2024/Jun/17/cli-language-models/ • Llama: https://www.llama.com/ • MLC chat on the app store: https://apps.apple.com/us/app/mlc-chat/id6448482937 • Firebug: https://en.wikipedia.org/wiki/Firebug_(software)# • NPM: https://www.npmjs.com/ • Django: https://www.djangoproject.com/ • Sourceforge: https://sourceforge.net/ • CPAN: https://www.cpan.org/ • OOP: https://en.wikipedia.org/wiki/Object-oriented_programming • Prolog: https://en.wikipedia.org/wiki/Prolog • SML: https://en.wikipedia.org/wiki/Standard_ML • Stabile Diffusion: https://stability.ai/ • Chain of thought prompting: https://www.promptingguide.ai/techniques/cot • Cognition AI: https://www.cognition.ai/ • In the Race to Artificial General Intelligence, Where’s the Finish Line?: https://www.scientificamerican.com/article/what-does-artificial-general-intelligence-actually-mean/ • Black swan theory: https://en.wikipedia.org/wiki/Black_swan_theory • Copilot workspace: https://githubnext.com/projects/copilot-workspace • Designing Data-Intensive Applications: The Big Ideas Behind Reliable, Scalable, and Maintainable Systems: https://www.amazon.com/Designing-Data-Intensive-Applications-Reliable-Maintainable/dp/1449373321 • Bluesky Global: https://www.blueskyglobal.org/ • The Atrocity Archives (Laundry Files #1): https://www.amazon.com/Atrocity-Archives-Laundry-Files/dp/0441013651 • Rivers of London: https://www.amazon.com/Rivers-London-Ben-Aaronovitch/dp/1625676158/ • Vanilla JavaScript: http://vanilla-js.com/ • jQuery: https://jquery.com/ • Fly.io: https://fly.io/ — Production and marketing by https://penname.co/. For inquiries about sponsoring the podcast, email [email protected].

Get full access to The Pragmatic Engineer at newsletter.pragmaticengineer.com/subscribe

AI/ML C#/.NET GenAI GitHub JavaScript LLM Marketing RAG Rust

The Pragmatic Engineer

Listen

From data to insights: Clojure for data deep dive (by Kira McLean) 2024-04-30 · 17:30

THIS IS AN ONLINE EVENT

[Connection details will be shared 1h before the start time]

The London Clojurians are happy to present: Title: From data to insights: Clojure for data deep dive Speaker: Kira McLean Time: 2024-04-30 @ 18:30 (London time) Local time: click here for local time

Kira McLean (https://github.com/kiramclean/) will be presenting: "From data to insights: Clojure for data deep dive"

In this session, participants will dive into the lesser-known corners of Clojure's data ecosystem. Learn how to extract meaningful insights from example datasets, uncovering the versatility of libraries like tablecloth, tech.ml.dataset, and fastmath to confidently tackle realistic and complex data challenges. Participants will leave equipped with the tools and techniques to effectively leverage Clojure's robust data science toolkit for insightful real-world data exploration and analysis.

Kira has been writing software since 2015, focusing on Clojure for the last 4 years. With a desire to pave the way for Clojure's broader recognition and adoption in the data science community, she's actively developing tools and guides aimed at showcasing the strengths of Clojure's data science toolkit. Her efforts are driven by a vision to broaden Clojure's adoption in the data world by improving the usability and effectiveness it's core libraries. An advocate for Clojure's potential in the world of data science, she's spending this year working exclusively on open source contributions to support and grow the Clojure data science ecosystem, supported by Clojurists Together and other generous sponsors.

If you missed this event, you can watch the recording on our YouTube channel: https://www.youtube.com/@LondonClojurians (The recording will be uploaded a couple of days after the event.)

Please, consider supporting the London Clojurians with a small donation: https://opencollective.com/london-clojurians/

Your contributions will enable the sustainability of the London Clojurians community and support our varied set of online and in-person events:

ClojureBridge London: supports under-represented groups discover Clojure
re:Clojure: our free to attend annual community conference
monthly meetup events with speakers from all over the world
subscription and admin costs such as domain name & StreamYard subscription

Thank you to our sponsors:

https://juxt.pro/
https://flexiana.com/
And many individual sponsors

From data to insights: Clojure for data deep dive (by Kira McLean)

PyData London - 84th Meetup 2024-04-02 · 18:00

Venue: Riverbank House, 2 Swan Ln, London EC4R 3AD - IMPORTANT: LOCATION UPDATED! Please note:

🚨🚨🚨A valid photo ID is required by building security. 🚨🚨🚨
This event follows the NumFOCUS Code of Conduct, please familiarise yourself with it before the event.

If your RSVP status says "You're going" you will be able to get in. No need to show your RSVP confirmation when signing in.

If you can no longer make it, please unRSVP as soon as you know.

Code of Conduct: This event follows the NumFOCUS Code of Conduct. Please get in touch with the organisers with any questions or concerns regarding the Code of Conduct.

As always, there'll be free food & drinks, generously provided by our host, Man Group.

Main Talks

Building Retrieval Augmented Generation (RAG) powered applications - Aniket Maurya

RAG extends the capability and knowledge base of large language models (LLMs) by augmenting prompts with proprietary and domain-specific knowledge without the need to retrain the LLM. It ensures information stays current and reduces hallucination by attributing the source. In this talk, the audience will get an overview of building RAG powered applications using open-source tools.

Getting python out of the way when taking ML models from research to production. A deep dive into the Open Neural Network Exchange (ONNX) - Aditya Goel

The Python data science ecosystem is unparalleled when it comes to model development and training. When moving models from research to production, Python creates many challenges from latency through to managing environments and dependencies. The Open Neural Network Exchange (ONNX) enables data practitioners to export their model to a self-contained, target independent protobuf representation. When paired with highly performant runtime and compiler technology, this leads to exceptionally high performance inference across many hardware targets, while massively simplifying to process of getting models from research into production. This talk will explore how data practitioners and software engineers can exploit ONNX to rapidly speed up the transfer of models from research to production.

⚡ Lightning Talks

Using Google Location Data and Reverse Geocoding to Explore your Personal Travel History - Jessica Walkenhorst

In this talk I will demonstrate how you can use reverse geocoding on your Google location data to gain a detailed understanding of your past travel history. Apart from being a fun exercise and bringing back great memories, the results of this analysis can be used to understand times spent abroad, information that is often required in the process of applying for residency permits and foreign passports.

2. Community Lightning Talk - Bring your own!

This is an opportunity for guests to bring their own lightning talk and spontaneously present on the evening!

Logistics Doors open at 6.30 pm (get there early as you have to sign-in via building security), talks start at 7 pm, drinks from 9 pm in the bar. We will have reduced capacity for this event but there will be plenty of people to discuss data science questions with!

Please unRSVP in good time if you realise you can't make it. We're limited by building security on the number of attendees, so please free up your place for your fellow community members!

Follow @pydatalondon (https://twitter.com/pydatalondon) for updates and early announcements.

PyData London - 84th Meetup

AI Meetup: ML and LLMs Infrastructure 2024-03-27 · 18:00

*** RSVP: https://www.aicamp.ai/event/eventdetails/W2024032710 (Due to limited room capacity, you must pre-register at the link for admission).

Welcome to the AI meetup in London. Join us for deep dive tech talks on AI, GenAI, LLMs and machine learning, food/drink, networking with speakers and fellow developers.

Agenda: * 6:00pm\~7:00pm: Checkin, Food/drink and Networking * 7:00pm\~9:00pm: Tech talks and Q&A * 9:00pm: Open discussion and Mixer

Tech Talk: Building GenAI and ML systems with OSS Metaflow Speaker: Hugo Bowne-Anderson (Outerbounds) Abstract: This talk explores a framework for how data scientists can deliver value with Generative AI: How can you embed LLMs and foundation models into your pre-existing software stack? How can you do so using Open Source Python? What changes about the production machine learning stack and what remains the same? This talk is aimed squarely at (data) scientists and ML engineers who want to focus on the science, data, and modeling, but want to be able to access all their infrastructural, platform, and software needs with ease!

Tech Talk: Harmony, Open source AI tool for psychology research Speaker: Thomas Wood (Fast Data Science) Abstract: In this talk, I will discuss AI for social sciences research and how to build a research tool with NLP and AI with open source tool Harmony, funded by Wellcome.

Speakers/Topics: Stay tuned as we are updating speakers and schedules. If you have a keen interest in speaking to our community, we invite you to submit topics for consideration: Submit Topics

Sponsors: We are actively seeking sponsors to support AI developers community. Whether it is by offering venue spaces, providing food, or cash sponsorship. Sponsors will not only speak at the meetups, receive prominent recognition, but also gain exposure to our extensive membership base of 10,000+ AI developers in London or 300K+ worldwide.

Community on Slack/Discord

Event chat: chat and connect with speakers and attendees
Sharing blogs, events, job openings, projects collaborations
Join Slack/Discord (link is at the bottom of the page) *

AI Meetup: ML and LLMs Infrastructure

PyData London - 82nd Meetup 2024-02-06 · 19:00

Venue: Riverbank House, 2 Swan Ln, London EC4R 3AD - IMPORTANT: LOCATION UPDATED! Please note:

🚨🚨🚨A valid photo ID is required by building security. 🚨🚨🚨
This event follows the NumFOCUS Code of Conduct, please familiarise yourself with it before the event.

If your RSVP status says "You're going" you will be able to get in. No need to show your RSVP confirmation when signing in.

If you can no longer make it, please unRSVP as soon as you know.

Code of Conduct: This event follows the NumFOCUS Code of Conduct. Please get in touch with the organisers with any questions or concerns regarding the Code of Conduct.

As always, there'll be free food & drinks, generously provided by our host, Man Group.

Main Talks

Toolbox of a not-so Data Scientist - Tambe Tabitha Achere

This talk is about building data science solutions in scenarios where demos cannot be done on a notebook and dashboards do not suffice as a final deliverable. By the end of this session, the audience will have an idea of how data scientists can build the logic behind full-stack applications without the need to learn a backend framework.

I will do a deep dive into one of my projects and there will be lots of code samples accompanied by explanations that led to design decisions. The project I'll be diving into is one in which the data could not be pulled in so if you've ever had to build for data you couldn't see, this session is for you too. I'll highlight the tools, packages and processes that enabled it to be built.

Boosting Similarity Search With Real-time Stream Processing - Fawaz Ghali

The goal of similarity search and vector databases is to find similar results to the search query for unstructured data, such as text, images, and videos. The unstructured data first is vectorized, and stored in a vector format. There are publicly available tools to create vectors from unstructured data; similarly, there are vector databases to store and perform similarity searches. This is important because of the rising popularity of Large Language Models (LLMs) and their combination with vector databases. Here, we present a hybrid approach by taking the strengths of vector databases and boosting them with traditional search and filtering techniques based on real-time stream processing. Vector databases are good for building high-performance vector search applications. On the other hand, stream processing can be used for real-time fast data storage for structured data (filters, tags, and contextual data). In this work, we're adding context and memory to vector databases to ingest, enrich, predict, and act on your data in a simplified but efficient approach. In this talk, we’ll focus on how Real-time compute APIs help leverage the processing capabilities of a distributed cluster, so you aren’t leaving large potential performance gains on the table. The combination of Real-time storage and computing provides a unique synergy that enables applications to address real-time use cases at any scale.

⚡ Lightning Talks

Open-Source Science (OSSci) - Tim Bonnemann

Open-Source Science (OSSci) is a new NumFOCUS initiative – launched in July 2022 in partnership with IBM – that aims to accelerate scientific research by improving the ways open source software in science gets done (built, used, funded, sustained, recognized, etc.). OSSci connects scientists, OSS developers and other stakeholders to share best practices, identify common pain points, and explore solutions together. The five OSSci interest groups to date cover domain-specific topics (chemistry/materials, life sciences/healthcare, climate/sustainability) as well as cross-domain topics (reproducibility, map of science), with more to be rolled out in 2024. This lightning talk will provide a brief overview of OSSci’s activities to date, our plans for 2024, and how you can get involved.

(Maybe) faster Pandas with CuDF on the GPU (perhaps) - Ian Ozsvald

NVIDIA's CuDF promises 100-1000x GPU speed ups with 100% compatibility, with a bit of effort it can be made to work. This talk shows what could work and which bits (including setup!) can be painful

Logistics Doors open at 6.30 pm (get there early as you have to sign-in via building security), talks start at 7 pm, drinks from 9 pm in the bar. We will have reduced capacity for this event but there will be plenty of people to discuss data science questions with!

Please unRSVP in good time if you realise you can't make it. We're limited by building security on the number of attendees, so please free up your place for your fellow community members!

Follow @pydatalondon (https://twitter.com/pydatalondon) for updates and early announcements.

PyData London - 82nd Meetup

AI Deep Dive: LLMs and Vector Databases 2023-09-06 · 17:00

** Raffle at the end: two Designing Machine Learning Systems, O'Reilly

GDG Cloud London is thrilled to be collaborating with AICamp (https://www.meetup.com/London-AI-Tech-Talk/) for a deep dive into the AI world.

The event will be held at Cathedral View, Carlisle Place, St. Vincents Centre, London, SW1P 1NL.

This meet-up is a unique opportunity to connect with fellow AI enthusiasts, industry practitioners, and researchers in a dynamic and interactive setting. Whether you are a seasoned AI professional or just curious about the latest advancements in Generative AI, LLMs and Vector Databases, this meet-up is for you! Join us for an insightful and thought-provoking discussion on the forefront of AI innovation and practices.

This event is sponsored by Weativate.

At the end of the event, we'll give away two O'Reilly books: Designing Machine Learning Systems (paperback).

Don't miss out, RSVP now!

Agenda

6:00 PM: Arrivals and Check In

6:30 PM: Welcome / Community Update

6:45 PM: Zain Hasan - Using Vector Databases with Multimodal Embeddings and Search At Scale

Many real-world problems are inherently multimodal, from the communicative modalities humans use such as spoken language and gestures to the force, proprioception, and visual sensors ubiquitous in robotics. In order for machine learning models to address these problems and interact more naturally and wholistically with the world around them and ultimately be more general and powerful reasoning engines we need them to understand data across all of its corresponding image, video, text, audio, and tactile representations. In this talk we will discuss how we can use open-source multimodal models, that can see, hear, read, and feel data(!), to perform cross-modal search(searching audio with images, videos with text etc.) at the billion-object scale with the help of open source vector databases. I will also demonstrate, with live code demos and large-scale datasets, how being able to perform this cross-modal retrieval in real-time can help users add natural search interfaces to their apps. This talk will revolve around how we scaled the usage of multimodal embedding models in production and how you can add cross-modal search into your apps.

7:15 PM: JP Hwang - Bringing LLMs to Your Data

In this talk, JP explains how Weaviate redefines what you thought was possible in a database. JP will begin by showing how you can use Weaviate to effectively search data, before moving on to show you how you can use generative search (retrieval augmented generation) with Weaviate to transform your data at retrieval time with LLMs.

7:45 PM: Raffle and Networking

Speakers

Zain Hasan - Weaviate (Developer Relations)

Joon-Pil Hwang - Weaviate (Developer Relations)

Hosted By

Amanda Cavallaro, GDG Organizer

I'm an Aikidoka, Developer Advocate, Software Developer, Google Developers Expert, Linkedin Learning Author and a Full Stack Web Development Specialist.

Saverio Terracciano, GDG Organizer

Stefano Le Pera, GDG Organizer

Lorenzo Turrino, GDG Organizer

Alessandro Puccetti, GDG Organizer

My name is Alessandro Puccetti, I am Italian 🇮🇹 but I am in fact a citizen of the world 🌎. I love travelling and meeting new people from different cultures, and I enjoy having a particular focus on their food 😉.

Kubra Harmankaya, GDG Organizer

Natalie Godec, GDG Organizer

Bruno Ripa, GDG Organizer

I am an italian software architect, in the industry since 2006. I have been in entrepreneurship in Italy, for 6 years, and then continued my career in United Kingdom, in London (2012), a city (or, better, the City) which I consider as my second home. I have worked in several industries (gaming, fintech, digital asset management) and in many companies, with a 3 years parenthesis in Spain (2017-2020), precisely in Barcelona, where I have worked as a contractor for a few USA startups and an european company working in IoT. In March 2020 I made my way back in London, working for Erlang Solutions. Actually I am a contractor and Consultant at BBC

Arianna Capizzi, GDG Organizer

Jen Kwon, GDG Organizer

Complete your event RSVP here: https://gdg.community.dev/events/details/google-gdg-cloud-london-presents-ai-deep-dive-llms-and-vector-databases/.

AI Deep Dive: LLMs and Vector Databases

AI Deep Dive: LLMs and Vector Databases 2023-09-06 · 17:00

Welcome to our in-person AI meetup, in collaboration with Google Developers Group. Join us for deep dive tech talks on AI/ML, food/drink, networking with speakers&peers developers, and win lucky draw prizes.

Pre-registration is required: https://www.aicamp.ai/event/eventdetails/W2023090610

[RSVP instructions]

Pre-register at the event website. (venue security may not let you in if you don't pre-register)
Contact us to submit topics and/or sponsor the meetup on venue/food/swags/prizes. https://forms.gle/JkMt91CZRtoJBSFUA
Community on Slack for events chat, speakers office hour and learning resources, job openings and projects collaboration. join slack (search and join #london channel) *

Description: This meet-up is a unique opportunity to connect with fellow AI enthusiasts, industry practitioners, and researchers in a dynamic and interactive setting. Whether you are a seasoned AI professional or just curious about the latest advancements in Generative AI, LLMs and Vector Databases, this meet-up is for you! Join us for an insightful and thought-provoking discussion on the forefront of AI innovation and practices.

Agenda (BST): * 6:00pm\~6:30pm: Checkin, Food/Snacks/Drink and networking * 6:30pm\~6:45pm: Welcome/community update * 6:45pm\~7:45pm: Tech talks * 7:45pm: Open discussion & Mixer

Tech Talk 1: Using Vector Databases with Multimodal Embeddings and Search At Scale Speaker: Zain Hasan @Weaviate Abstract: Many real-world problems are inherently multimodal, from the communicative modalities humans use such as spoken language and gestures to the force, proprioception, and visual sensors ubiquitous in robotics. In order for machine learning models to address these problems and interact more naturally and holistically with the world around them and ultimately be more general and powerful reasoning engines we need them to understand data across all of its corresponding image, video, text, audio, and tactile representations.

In this talk we will discuss how we can use open-source multimodal models, that can see, hear, read, and feel data(!), to perform cross-modal search(searching audio with images, videos with text etc.) at the billion-object scale with the help of open source vector databases. I will also demonstrate, with live code demos and large-scale datasets, how being able to perform this cross-modal retrieval in real-time can help users add natural search interfaces to their apps. This talk will revolve around how we scaled the usage of multimodal embedding models in production and how you can add cross-modal search into your apps.

Tech Talk 2: Bringing LLMs to Your Data Speaker: JP Huang @Weaviate Abstract: In this talk, JP explains how Weaviate redefines what you thought was possible in a database. JP will begin by showing how you can use Weaviate to effectively search data, before moving on to show you how you can use generative search (retrieval augmented generation) with Weaviate to transform your data at retrieval time with LLMs.

AI Deep Dive: LLMs and Vector Databases

AI Deep Dive: Creating your own ChatGPT with Apache Airflow 2023-07-13 · 17:00

** Big give-away at the end: two Designing Machine Learning Systems, O'ReillyGDG Cloud London is thrilled to be collaborating with AICamp (https://www.meetup.com/London-AI-Tech-Talk/) for a deep dive into the AI world.

The event will be held at GoCardless London office.

This meet-up is a unique opportunity to connect with fellow AI enthusiasts, industry practitioners, and researchers in a dynamic and interactive setting. Whether you are a seasoned AI professional or just curious about the latest advancements in NLP, LLMs and Airflow, this meet-up is for you! Join us for an insightful and thought-provoking discussion on the forefront of AI innovation.

At the end of the event, we'll give away two O'Reilly books: Designing Machine Learning Systems (paperback).

Don't miss out, RSVP now!

A big shout-out to our sponsor, Transparent (https://heytransparent.io).

Agenda

6:00 PM: Arrivals and Check In

6:30 PM: Welcome / Community Update

A quick intro from GDG Cloud London, AICamp and Transparent.io.

6:45 PM: Tatiana Al-Chueyr - Creating your own ChatGPT with Apache Airflow

Apache Airflow is an orchestration tool which allows users to build all sorts of pipelines, automating steps and allowing them to run on schedule reliably. Thousands of companies use Airflow to process ETL and machine learning pipelines worldwide. This talk will illustrate creating an Airflow pipeline to process data and train a custom ChatGPT.

7:15 PM: Marty Pitt - Using AI to create data pipelines and Service Orchestration with Orbital and Google Cloud

7:45 PM: Wrap up, Networking and Raffle!

We are going to give away two Designing Machine Learning Systems, O'Reilly books after the last talk!

Speakers

Tatiana Al-Chueyr - Astronomer (Staff Software Engineer)

Tatiana is a Staff Software Engineer at Astronomer and builds open-source authoring tools on top of Apache Airflow. She Graduated in Computer Engineering and has worked for over 18 years building highly scalable software for multiple organisations, including the Ministry of Science and Technology in Brazil, TV Globo and the BBC.

Marty Pitt - Orbital (Founder)

Hosted By

Amanda Cavallaro, GDG Organizer

I'm an Aikidoka, Developer Advocate, Software Developer, Google Developers Expert, Linkedin Learning Author and a Full Stack Web Development Specialist.

Saverio Terracciano, GDG Organizer

Stefano Le Pera, GDG Organizer

Lorenzo Turrino, GDG Organizer

Alessandro Puccetti, GDG Organizer

My name is Alessandro Puccetti, I am Italian 🇮🇹 but I am in fact a citizen of the world 🌎. I love travelling and meeting new people from different cultures, and I enjoy having a particular focus on their food 😉.

Kubra Harmankaya, GDG Organizer

Natalie Godec, GDG Organizer

Nodir Siddikov, GDG Organizer

Bruno Ripa, GDG Organizer

I am an italian software architect, in the industry since 2006. I have been in entrepreneurship in Italy, for 6 years, and then continued my career in United Kingdom, in London (2012), a city (or, better, the City) which I consider as my second home. I have worked in several industries (gaming, fintech, digital asset management) and in many companies, with a 3 years parenthesis in Spain (2017-2020), precisely in Barcelona, where I have worked as a contractor for a few USA startups and an european company working in IoT. In March 2020 I made my way back in London, working for Erlang Solutions. Actually I am a contractor and Consultant at BBC

Arianna Capizzi, GDG Organizer

Complete your event RSVP here: https://gdg.community.dev/events/details/google-gdg-cloud-london-presents-ai-deep-dive-creating-your-own-chatgpt-with-apache-airflow/.

AI Deep Dive: Creating your own ChatGPT with Apache Airflow

AI Deep Dive: AI Audio Pipelines & LLMs-powered Applications 2023-06-15 · 17:00

Complete your event RSVP here: https://gdg.community.dev/events/details/google-gdg-cloud-london-presents-ai-deep-dive-ai-audio-pipelines-llms-powered-applications/

Welcome to our in-person meet-up about Artificial Intelligence (AI)!

GDG Cloud London is thrilled to be collaborating with AICamp (https://www.meetup.com/London-AI-Tech-Talk/) for a deep dive into the AI world.

Join us for an engaging discussion on two cutting-edge topics: Haystack, an open source NLP framework by Deepset, and how to get the most out of your Audio/Video Content.

The event will be held at RightMove London office on 33 Soho Square.

This meet-up is a unique opportunity to connect with fellow AI enthusiasts, industry practitioners, and researchers in a dynamic and interactive setting. Whether you are a seasoned AI professional or just curious about the latest advancements in NLP, LLMs and AI Audio Pipelines, this meet-up is for you! Join us for an insightful and thought-provoking discussion on the forefront of AI innovation.

Don't miss out, RSVP now!

A big shout-out to our sponsor, Transparent (https://heytransparent.io).

Agenda

6:00 PM: Arrivals and Check In

6:30 PM: Welcome / Community Update

A quick intro from GDG Cloud London, AICamp and Transparent.io.

6:45 PM: Adam MacVeigh - AI Audio Pipelines

How to get the most out of your audio and video content with transcription and Natural Language Processing.

7:15 PM: Tuana Celik - Building LLM Powered NLP Applications with Haystack

In this talk we will take a look at Haystack, an open source NLP framework by deepset, and the current state of building NLP applications. We will look at challenges we frequently face using LLMs (such as hallucinations), and what we can do to mitigate them. We will also look at different applications of language models with various prompting implementations such as Agents, chat applications and more

7:45 PM: Wrap up and Networking

Speakers

Adam MacVeigh - News UK (Senior Data Scientist)

Tuana Celik - Deepset (Developer Advocate)

Tuana is a developer advocate at deepset, where she focuses on the open source NLP framework: Haystack. With a degree in Computer Science from the University of Bristol, she first started her career as a software engineer. Later, she returned to the world of machine learning as a developer advocate and now dedicates her time to helping the open source NLP community. headshot is attached.…

Hosted By

Amanda Cavallaro, GDG Organizer

I'm an Aikidoka, Developer Advocate, Software Developer, Google Developers Expert, Linkedin Learning Author and a Full Stack Web Development Specialist.

Saverio Terracciano, GDG Organizer

Stefano Le Pera, GDG Organizer

Lorenzo Turrino, GDG Organizer

Alessandro Puccetti, GDG Organizer

My name is Alessandro Puccetti, I am Italian 🇮🇹 but I am in fact a citizen of the world 🌎. I love travelling and meeting new people from different cultures, and I enjoy having a particular focus on their food 😉.

Kubra Harmankaya, GDG Organizer

Natalie Godec, GDG Organizer

Nodir Siddikov, GDG Organizer

Bruno Ripa, GDG Organizer

I am an italian software architect, in the industry since 2006. I have been in entrepreneurship in Italy, for 6 years, and then continued my career in United Kingdom, in London (2012), a city (or, better, the City) which I consider as my second home. I have worked in several industries (gaming, fintech, digital asset management) and in many companies, with a 3 years parenthesis in Spain (2017-2020), precisely in Barcelona, where I have worked as a contractor for a few USA startups and an european company working in IoT. In March 2020 I made my way back in London, working for Erlang Solutions. Actually I am a contractor and Consultant at BBC

Arianna Capizzi, GDG Organizer

Partners

AICamp (https://www.meetup.com/London-AI-Tech-Talk/)

AICamp is a global online learning platform for developers, engineers, data scientists to learn and practice AI/ML technology. * Online live tech talks, workshops, bootcamps, courses * 100k+ developers members from 100+ countries * Learning groups in 50+ major tech hub cities around the world.

Transparent (https://heytransparent.io)

A tech recruitment start-up

Complete your event RSVP here: https://gdg.community.dev/events/details/google-gdg-cloud-london-presents-ai-deep-dive-ai-audio-pipelines-llms-powered-applications/.

AI Deep Dive: AI Audio Pipelines & LLMs-powered Applications

AI Deep Dive: AI Audio Pipelines&LLMs-powered Applications 2023-06-15 · 17:00

Welcome to our in-person ML monthly meetup, in collaboration with GDG (Google Developers Group) Cloud London. Join us for deep dive tech talks on AI/ML, food/drink, networking with speakers&peers developers, and win lucky draw prizes.

Pre-registration is required here: https://www.aicamp.ai/event/eventdetails/W2023061510

[RSVP instructions]

Register at the event website. (full name and email is required for badges and check in)
Contact us to submit topics and/or sponsor the meetup on venue/food/swags/prizes. https://forms.gle/JkMt91CZRtoJBSFUA
Community on Slack for events chat, speakers office hour, sharing learning resources, job openings, etc... join slack (search and join the #london channel) *

Description: Join us for an engaging discussion on two cutting-edge topics: LLMs and how to get the most out of your Audio/Video Content.

This meet-up is a unique opportunity to connect with fellow AI enthusiasts, industry practitioners, and researchers in a dynamic and interactive setting. Whether you are a seasoned AI professional or just curious about the latest advancements in NLP, LLMs and AI Audio Pipelines, this meet-up is for you! Join us for an insightful and thought-provoking discussion on the forefront of AI innovation.

A big shout-out to our sponsor, Transparent (https://heytransparent.io).

Agenda (BST): * 6:00pm\~6:30pm: Checkin, Food/Snacks/Drink and networking * 6:30pm\~6:45pm: Welcome/community update * 6:45pm\~7:45pm: Tech talks * 7:45pm: Open discussion & Mixer

Tech Talk 1: AI Audio Pipelines Speaker: Adam MacVeigh, Data Scientist @News UK Abstract: How to get the most out of your audio and video content with transcription and Natural Language Processing.

Tech Talk 2: Building LLM Powered NLP Applications with Haystack Speaker: Tuana Celik, Developer Advocate @Deepset Abstract: In this talk we will take a look at Haystack, an open source NLP framework by deepset, and the current state of building NLP applications. We will look at challenges we frequently face using LLMs (such as hallucinations), and what we can do to mitigate them. We will also look at different applications of language models with various prompting implementations such as Agents, chat applications and more.

AI Deep Dive: AI Audio Pipelines&LLMs-powered Applications

talk-data.com

People (6 results)

Activities & events