talk-data.com

Topic

ocr

Activities

tagged

Activity Trend

2 peak/qtr

2020-Q1 2026-Q2

Top Events

Data Science Retreat Demo Day #37 1 [AI Alliance] Workshop: Hands-on with Docling 1 Building AI Agents with Multimodal Models: NVIDIA DLI Workshop for Academia 1 Data Science Retreat Demo Day #41 1 Building AI Agents with Multimodal Models: NVIDIA DLI Workshop for Academia 1 Outclassing Frontier LLMs at Extracting Information 1 [AI Alliance] Workshop: Hands-on with Docling 1

Top Speakers

Etienne Bernard (NuMind) 1

Activities

7 activities · Newest first

All Video Podcast Book

Outclassing Frontier LLMs at Extracting Information

2025-12-22 · Outclassing Frontier LLMs at Extracting Information

talk

by Etienne Bernard (NuMind)

RAG information extraction json output nuextract numarkdown open-source llms

In this talk, the speaker presents NuExtract, the first LLM specialized in extracting structured information (JSON output), and NuMarkdown, the first reasoning OCR LLM (RAG-ready Markdown output). The talk demonstrates low-hallucination open-source models that outclass frontier LLMs like GPT-5 and Gemini 2.5 while being orders of magnitude smaller, enabling private usage. It will demonstrate the abilities of these LLMs, show how to use them at scale, and discuss what’s coming next in information extraction.

Part 3: Cross-modal Projection

2025-12-20 · Building AI Agents with Multimodal Models: NVIDIA DLI Workshop for Academia

workshop

pdf processing vision-language models

Transform an LLM into a Vision Language Model (VLM). Process PDFs like a pro with OCR tools.

Part 3: Cross-modal Projection

2025-09-27 · Building AI Agents with Multimodal Models: NVIDIA DLI Workshop for Academia

workshop

LLM pdf processing vision language model (vlm)

Transform an LLM into a Vision Language Model (VLM). Process PDFs like a pro with OCR tools.

EasyLens – Smart AI Assistant for Everyday Tasks in Germany

2025-04-15 · Data Science Retreat Demo Day #41

talk

ai computer vision core ml ios

EasyLens is an iOS app that uses AI and computer vision to simplify daily tasks like waste sorting, understanding German documents, and identifying tourist spots. Users simply point their camera and select a feature; the app does the rest using Core ML and OCR.

Docling Hands-on Workshop

2025-03-13 · [AI Alliance] Workshop: Hands-on with Docling

Hands-on workshop

HTML Python docling docx google colab pdf

Hands-on session exploring how to use Docling for data extraction and cleanup across PDFs, HTML, and DOCX. Includes getting started with Docling, extracting content from documents, handling table and image data, and extracting content from scanned PDF documents using OCR.

Getting started with Docling

2025-03-13 · [AI Alliance] Workshop: Hands-on with Docling

Hands-on workshop

HTML Python docling docx google colab pdf

Hands-on workshop on using Docling to extract and clean data from documents, including PDFs, HTML, and OCR for scanned PDFs. Key activities: getting started with Docling; extracting content from PDFs/HTML; handling table and image data; extracting content from scanned PDFs using OCR.

Building a web scraping library

2024-04-04 · Data Science Retreat Demo Day #37

talk

LLM automation scripts web scraping

Project aim is to try building a web scraping library that uses OCR, an LLM and some automation scripts to retrieve data from highly protected websites without API’s.