talk-data.com

Topic

llms

Activities

102

tagged

Activity Trend

19 peak/qtr

2020-Q1 2026-Q1

Top Events

AI Builders Summit 2025 | ODSC & Google Cloud event 9 Breaking Out of DemoLand : Ship It NYC (Event Not full - Join waitlist) 3 Prompting for Production: Ensuring the Quality of LLM Outputs in Product Feature 2 AI and Deep Learning for Enterprise #15 2 Virtual Summit: Generative AI and Intelligent Agents 2 How We Build High-Quality, User-Oriented LLM Features at Grammarly 2 Google I/O Extended 2023 North America 2 AI Meetup (June): GenAI, LLMs and ML 1 [AI Alliance] Better Expert Agents with Dana, Agent-Native Programming Language 1 Virtual Summit: LLMs and the Generative AI Revolution 1 AI Meetup (October): GenAI, LLMs and Agents 1 London Reactor Meetup 1

Top Speakers

Martin Brodbeck (Priceline) 2 Vinh Luong (Aitomatic) 2 Yulia Khalus (Grammarly) 2 Kelly Vanee (Capital One) 2 Hussein Mehanna (Cruise) 2 Noel Kenehan (Google) 2 Dr. Murat Baday (Magnimind Academy) 2 Dr. Yasin Ceran (KAIST) 2 Davor Bonaci (DataStax) 2 Vinay Chella (Netflix) 2 Scott Johnston (Docker) 2 David Xue (Astronomer) 1

Activities

102 activities · Newest first

All Video Podcast Book

Plongée au coeur des LLM à long contexte

2025-06-17 · Soirée LLM / Agents Intelligents

talk

by Laurent Picard (Google Cloud)

RAG retrieval augmented generation

Les grands modèles de langage (LLM) ont révolutionné la résolution de problèmes en langue naturelle, mais connaissez-vous leurs limites ? Les LLM ont des tailles de contexte variant considérablement, allant de quelques milliers à plusieurs millions de tokens, mais que cela implique-t-il concrètement ?

Dans cette session, nous aborderons les points suivants via exemples illustrés et démos :

Qu’est-ce que la fenêtre de contexte d’un LLM ?

Quelle est la relation entre données, tokens, performances et coûts ?

En pratique, comment peut-on pousser les LLM dans leurs limites ?

Quels sont les cas d’usage uniquement résolus grâce à un long contexte ?

Quelles sont les différences avec une approche RAG (Retrieval Augmented Generation) ?

LLMs for Data People

2025-05-22 · The finally warmer Data Berlin Meetup

talk

by Francesco Mucio (Untitled Data Company)

Building BAML: A new programming language for LLMs — React for prompting

2025-05-08 · Breaking Out of DemoLand : Ship It NYC (Event Not full - Join waitlist)

talk

by Vaibhav Gupta (Boundary)

live coding programming languages prompting

Deep-dive into Building BAML, a programming language for LLM prompting, with live coding and production use cases across chatbots, healthcare ERP, and finance.

Leading LLMs for automating government RFP writing

2025-05-08 · Breaking Out of DemoLand : Ship It NYC (Event Not full - Join waitlist)

talk

by Gabe Villasana (GovEagle)

public sector rfp automation

Deep-dive into GovEagle's approach to automating government RFP writing with LLMs, including real-world challenges and trade-offs.

Shipping LLM-powered SEC/FINRA compliance tools for firms

2025-05-08 · Breaking Out of DemoLand : Ship It NYC (Event Not full - Join waitlist)

talk

by Allen Calderwood (Hadrius)

compliance tooling regtech

Live dive into how Hadrius builds LLM-powered compliance tools for firms managing ~$2T, including architecture, data flow, and deployment considerations.

Building Quality Linguistic Features in the Age of LLMs

2025-04-24 · The Power of AI in Communication and Data

talk

by Olena Nahorna (Grammarly)

NLP machine learning

The talk explores how large language models (LLMs) have accelerated the development of linguistic features. It focuses on how to adapt feature development processes to match this rapid pace and highlights key considerations for maintaining high-quality output in a fast-evolving AI landscape.

Can Compressing Foundation Models be as Easy as Image Compression?

2025-04-22 · #16: Compressing Foundation Models as Easy as Image Compression? by M. Genzel

talk

by Dr. Martin Genzel (Merantix Momentum)

acip compression foundation models iterative pruning quantization

Abstract: The talk introduces Any Compression via Iterative Pruning (ACIP), a novel approach designed to give users intuitive control over the compression-performance trade-off. ACIP uses a single gradient descent run of iterative pruning to establish a global parameter ranking, enabling immediate materialization of models of any target size. It demonstrates strong predictive performance on downstream tasks without costly fine-tuning and achieves state-of-the-art compression for open-weight LLMs, often complementing common quantization techniques.

Building a state-of-the-art AI web researcher

2025-04-17 · AI Meetup (April): Agentic AI

talk

by Boris Toledano (Linkup)

ai agents genai search infrastructure web retrieval

In this session, we'll discuss the next-generation search infrastructure that gives AI agents seamless access to web information and hard-to-find intelligence. Traditional methods can't handle these new workflows, and legacy search engines - designed for human attention - aren't built for these emerging AI use cases. We will address: a)The power of web search for LLM-based applications; b) the need to avoid scraping of legacy search engines; c) How we're building a new category of "searcher" models; and d) What you can power with a web retrieval engine, including demos.

Scaling AI in the Cloud: LLMs, Smart Search and Automated Tuning

2025-04-10 · Cloud Native Westminister - Meetup #3

talk

by Fawaz Ghali (Snowflake)

automated tuning smart search

Discover how to efficiently scale AI solutions while maintaining security and cost efficiency. Learn about integrating LLMs, intelligent search, and automated tuning into your business workflows to optimise performance and impact.

Shipping features across platforms at Grammarly

2025-03-20 · Tackling the Challenges of Cross-Platform Features at Grammarly

talk

by Andrew Garkavyi (Grammarly) , Lesha Levzhynskyi (Grammarly)

ai frontend full-stack hybrid multiplatform development native web

Andrew Garkavyi and Lesha Levzhynskyi discuss the history and present state of shipping features across Grammarly's multiple platforms, recounting challenges and approaches from fully native to web to hybrid, and addressing overlays, assistant and chat modes in the age of LLMs.