talk-data.com talk-data.com

E

Speaker

Eric Tramel

1

talks

Principal Research Scientist NVIDIA

Eric W. Tramel is a Principal Research Scientist at NVIDIA, where he drives research in synthetic data generation for AI systems since the acquisition of his previous company, Gretel.ai. Previously, he directed ML for disease-progression prediction and synthetic patient record generation at Unlearn.AI, advanced on-device wake-word models for Alexa at Amazon, and spent four years leading federated-learning research across multi-hospital collaborations at Owkin. Eric’s academic roots include post-doctoral work at École Normale Supérieure on the statistical physics of machine learning and a Ph.D. in information theory focused on compressed sensing and high-dimensional signal recovery.

Bio from: Data + AI Summit 2025

Filter by Event / Source

Talks & appearances

1 activities · Newest first

Search activities →
Generating Zero-Shot Hard-Case Hallucinations: A Synthetic and Open Data Approach

We present a novel framework for designing and inducing controlled hallucinations in long-form content generation by LLMs across diverse domains. The purpose is to create fully-synthetic benchmarks and mine hard cases for iterative refinement of zero-shot hallucination detectors. We will first demonstrate how Gretel Data Designer (now part of NVIDIA) can be used to design realistic, high-quality long-context datasets across various domains. Second, we will describe our reasoning-based approach to hard-case mining. Specifically, our methodology relies on chain-of-thought-based generation of both faithful and deceptive question-answer pairs based upon long-context samples. Subsequently, a consensus labeling & detector framework is employed to filter synthetic examples to zero-shot hard cases. The result of this process is a fully-automated system, operating under open data licenses such as Apache-2.0, for the generation of hallucinations at the edge-of-capabilities for a target LLM to detect.