In this talk, the speaker presents NuExtract, the first LLM specialized in extracting structured information (JSON output), and NuMarkdown, the first reasoning OCR LLM (RAG-ready Markdown output). The talk demonstrates low-hallucination open-source models that outclass frontier LLMs like GPT-5 and Gemini 2.5 while being orders of magnitude smaller, enabling private usage. It will demonstrate the abilities of these LLMs, show how to use them at scale, and discuss what’s coming next in information extraction.
talk-data.com
Topic
open-source llms
4
tagged
Activity Trend
1
peak/qtr
2020-Q1
2026-Q1
Day 2 self-paced session on Meta Llama and open-source LLMs.
Self-paced session exploring Meta Llama and open-source LLMs.
In this talk, I will share our journey, which began with local applications leveraging Open-Source LLMs and has now evolved to serve our internal teams. I will dive into the practical lessons we've learned along the way, offering insights that can help you navigate the complexities of LLM implementation.