Join our chill space, unwind, chat about Feminist AI and contribute to the PyData London DIY collage zine.
talk-data.com
Event
PyData London 2025
2025-06-06 โ 2025-06-08
PyData
Activities tracked
2
Filtering by:
Ines Montani
×
Top Topics
Sessions & talks
Showing 1โ2 of 2 ยท Newest first
Conquering PDFs: document understanding beyond plain text
2025-06-07
Watch
NLP and data science could be so easy if all of our data came as clean and plain text. But in practice, a lot of it is hidden away in PDFs, Word documents, scans and other formats that have been a nightmare to work with. In this talk, I'll present a new and modular approach for building robust document understanding systems, using state-of-the-art models and the awesome Python ecosystem. I'll show you how you can go from PDFs to structured data and even build fully custom information extraction pipelines for your specific use case.