Hands-on workshop to explore IBM Data Prep Kit for data preparation, including getting started, extracting content from PDFs, DOCX, and HTML, cleaning markup, deduplicating data, and removing low-quality or spam documents. The session will be run in Google Colab and is suitable for LLM app developers, data scientists, and data engineers. Prerequisites: comfortable with Python.
talk-data.com
Topic
google colab
1
tagged
Activity Trend
1
peak/qtr
2020-Q1
2026-Q1
Top Events
April 5-6: FREE 2-Day Deep Learning Fundamentals NVIDIA DLI Certification Course
2
[AI Alliance] Workshop: Preparing High Quality Datasets with Data Prep Kit
1
[AI Alliance] Workshop: Hands-on with Docling
1
[AI Alliance] Workshop: Hands-on with Data Prep Kit
1
[AI Alliance] Workshop: Preparing High Quality Datasets with Data Prep Kit
1
[AI Alliance] Workshop: Hands-on with Data Prep Kit
1
Einstieg in die Piwik PRO API für Reporting || FREE Community Training
1
[AI Alliance] Workshop: Hands-on with Data Prep Kit
1
[AI Alliance] Workshop: Hands-on with Docling
1
[AI Alliance] Workshop: Hands-on with Docling
1
April 5-6: FREE 2-Day Deep Learning Fundamentals NVIDIA DLI Certification Course
1
[AI Alliance] Workshop: Preparing High Quality Datasets with Data Prep Kit
1
Filtering by:
[AI Alliance] Workshop: Hands-on with Data Prep Kit
×