talk-data.com

Topic

inference

Activities

tagged

Activity Trend

2 peak/qtr

2020-Q1 2026-Q1

Top Events

PyData Trójmiasto #37 1 [AI Alliance] Hyper Parameter Optimization for Computer Vision using TerraTorch 1 PyTorch Meetup #21 1 [AI Alliance] Hyper Parameter Optimization for Computer Vision using TerraTorch 1 LLMs Aren’t Just for Chat: Grammatical Error Correction at Scale 1

Top Speakers

Romeo (IBM Research Europe) 2 Dariusz Piotrowski (Amazon Robotics) 1 Lukas Beisteiner (Grammarly) 1 kir zharov (SpinnerAI) 1 Kostia Omelianchuk (Grammarly) 1

Activities

5 activities · Newest first

All Video Podcast Book

Bridging Research and Production: How We Build Scalable GEC Systems

2025-11-25 · LLMs Aren’t Just for Chat: Grammatical Error Correction at Scale

talk

by Lukas Beisteiner (Grammarly) , Kostia Omelianchuk (Grammarly)

NLP data generation evaluation grammatical error correction llms production systems

Join Kostia Omelianchuk and Lukas Beisteiner as they unpack the full scope of Grammatical Error Correction (GEC) from task framing, evaluation, and training to inference optimization and serving high-performance production systems at Grammarly. They will discuss: The modern GEC recipe (shift from heavily human-annotated corpora to semi-synthetic data generation), LLM-as-a-judge techniques for scalable evaluation, and techniques to make deployment fast and affordable, including Speculative Decoding.

WTF is Temperature: LLM inference in Practice

2025-10-29 · PyData Trójmiasto #37

talk

by Dariusz Piotrowski (Amazon Robotics)

LLM beam search kv caching prompt caching quantization sampling strategies tokenization

Ever wondered what actually happens when you call an LLM API? This talk breaks down the inference pipeline from tokenization to text generation, explaining what's really going on under the hood. He will walk through the key sampling strategies and their parameters - temperature, top-p, top-k, beam search. We'll also cover performance tricks like quantization, KV caching, and prompt caching that can speed things up significantly. If time allows, we will also touch on some use-case-specific techniques like pass@k and majority voting.

Finetuning and Inference

2025-04-03 · [AI Alliance] Hyper Parameter Optimization for Computer Vision using TerraTorch

talk

by Romeo (IBM Research Europe)

fine-tuning pytorch lightning terratorch

Finetuning and Inference

2025-04-03 · [AI Alliance] Hyper Parameter Optimization for Computer Vision using TerraTorch

talk

by Romeo (IBM Research Europe)

fine-tuning terratorch

Finetuning and inference workflows for geospatial foundation models using TerraTorch.

LLM Inference Under the Hood: From Edge Devices to Production Beasts

· PyTorch Meetup #21

talk

by kir zharov (SpinnerAI)

Databricks LLM apple silicon edge devices production clusters

We will dive deeper into inference from Apple Silicon to huge production clusters, and cover tricks how to make it faster.