talk-data.com talk-data.com

A

Speaker

Aditya Gautam

1

talks

Machine Learning Lead Meta

Aditya is a seasoned Machine learning practitioner, currently working on LLM (llama) application to enhance recommendation and ranking algorithms at scale. He has led several critical Machine learning projects in Facebook reels including user interest exploration, trend detection, quality improvement and safeguarding policy by detection violation and mitigating misinformation. He holds a master’s degree from Carnegie Mellon University, has worked in Machine learning at Google and has been a founding engineer of an AI startup at Area 120 (Google Incubator). Aditya has been quite active in the Generative AI community and is actively contributing through different speaking, panel and research engagement at various conferences.

Bio from: Data + AI Summit 2025

Filter by Event / Source

Talks & appearances

1 activities · Newest first

Search activities →
Optimize Cost and User Value Through Model Routing AI Agent

Each LLM has unique strengths and weaknesses, and there is no one-size-fits-all solution. Companies strive to balance cost reduction with maximizing the value of their use cases by considering various factors such as latency, multi-modality, API costs, user need, and prompt complexity. Model routing helps in optimizing performance and cost along with enhanced scalability and user satisfaction. Overview of cost-effective models training using AI gateway logs, user feedback, prompt, and model features to design an intelligent model-routing AI agent. Covers different strategies for model routing, deployment in Mosaic AI, re-training, and evaluation through A/B testing and end-to-end Databricks workflows. Additionally, it will delve into the details of training data collection, feature engineering, prompt formatting, custom loss functions, architectural modifications, addressing cold-start problems, query embedding generation and clustering through VectorDB, and RL policy-based exploration.