talk-data.com

Topic

google colab

Activities

tagged

Activity Trend

1 peak/qtr

2020-Q1 2026-Q1

Top Events

Top Speakers

Antonio Rueda-Toicen (Hasso Plattner Institute) 3 Marcus Stade 1 Patrick Mohr 1 Markus Baersch 1

Activities

13 activities · Newest first

All Video Podcast Book

Day 2 — April 6, 2025: Visual dataset curation with FiftyOne and iterative improvement of image classification models

2025-04-05 · April 5-6: FREE 2-Day Deep Learning Fundamentals NVIDIA DLI Certification Course

workshop

by Antonio Rueda-Toicen (Hasso Plattner Institute)

Python fiftyone jupyter

Day 2 focuses on visual dataset curation with FiftyOne and iterative improvement of image classification models.

Day 1: Building and training neural networks with PyTorch

2025-04-05 · April 5-6: FREE 2-Day Deep Learning Fundamentals NVIDIA DLI Certification Course

workshop

by Antonio Rueda-Toicen (Hasso Plattner Institute)

Pandas Python PyTorch fiftyone github codespaces jupyter

Focus on building and training neural networks with PyTorch.

Day 2: Visual dataset curation with FiftyOne and iterative improvement of image classification models

2025-04-05 · April 5-6: FREE 2-Day Deep Learning Fundamentals NVIDIA DLI Certification Course

workshop

by Antonio Rueda-Toicen (Hasso Plattner Institute)

Pandas Python PyTorch fiftyone github codespaces jupyter

Focus on visual dataset curation with FiftyOne and iterative improvement of image classification models.

Data Prep Kit Workshop

2025-03-27 · [AI Alliance] Workshop: Preparing High Quality Datasets with Data Prep Kit

workshop

HTML Python data prep kit pdf

Hands-on workshop on cleaning and preparing high-quality datasets using Data Prep Kit. Topics include extracting content from PDFs and HTML, cleaning up markup, detecting and removing SPAM content, scoring and removing low-quality documents, identifying and removing PII data, and detecting and removing HAP (Hate Abuse Profanity) speech. More about Data Prep Kit: https://github.com/IBM/data-prep-kit

Data Prep Kit Workshop: Clean and Prepare High-Quality Datasets

2025-03-27 · [AI Alliance] Workshop: Preparing High Quality Datasets with Data Prep Kit

workshop

Python data prep kit html parsing pdf parsing

Hands-on workshop on using Data Prep Kit to extract content from PDFs/HTML, clean up data, remove SPAM, score and remove low-quality documents, identify and remove PII data, and detect and remove HAP (Hate Abuse Profanity) speech to improve dataset quality. Code will be run in Google Colab using Python.

Data Prep Kit Workshop: Data wrangling for ML and data apps

2025-03-27 · [AI Alliance] Workshop: Preparing High Quality Datasets with Data Prep Kit

workshop

Python data prep kit

Hands-on workshop on using Data Prep Kit to clean and prepare high-quality datasets: extract content from PDFs/HTML, cleanup markups, remove SPAM, score and filter low-quality documents, identify and remove PII data, and detect Hate/Abusive language. Prerequisites: comfortable with Python; run the workshop in Google Colab.

Data Prep Kit Hands-on Workshop

2025-03-20 · [AI Alliance] Workshop: Hands-on with Data Prep Kit

workshop

Python data prep kit

Hands-on session to explore Data Prep Kit and accelerate data preparation for building robust LLM applications. Topics include getting started with Data Prep Kit, extracting content from PDFs, DOCX, and HTML, cleanup of excess markup, detecting/removing duplicate documents, and removing low-quality and spam documents. Attendees should be comfortable with Python; workshop code will run in Google Colab.

Data Prep Kit Workshop

2025-03-20 · [AI Alliance] Workshop: Hands-on with Data Prep Kit

Hands-on workshop

Python data prep kit

Hands-on workshop to explore IBM Data Prep Kit for data preparation, including getting started, extracting content from PDFs, DOCX, and HTML, cleaning markup, deduplicating data, and removing low-quality or spam documents. The session will be run in Google Colab and is suitable for LLM app developers, data scientists, and data engineers. Prerequisites: comfortable with Python.

Hands-on workshop: Data Prep Kit for data preparation and LLM applications

2025-03-20 · [AI Alliance] Workshop: Hands-on with Data Prep Kit

workshop

Python data prep kit

Hands-on session to explore Data Prep Kit and how to accelerate data preparation for LLM applications. The workshop covers getting started with Data Prep Kit, extracting content from PDFs, DOCX, and HTML, cleaning markup, deduplicating content, and detecting/removing low-quality or spam documents.

Docling Hands-on Workshop

2025-03-13 · [AI Alliance] Workshop: Hands-on with Docling

hands-on workshop

Python docling

Hands-on workshop exploring Docling for data wrangling and document extraction. Topics include getting started with Docling, extracting content from PDFs and HTML, handling tables and images, and extracting content from scanned PDFs using OCR.

Docling Hands-on Workshop

2025-03-13 · [AI Alliance] Workshop: Hands-on with Docling

Hands-on workshop

HTML Python docling docx ocr pdf

Hands-on session exploring how to use Docling for data extraction and cleanup across PDFs, HTML, and DOCX. Includes getting started with Docling, extracting content from documents, handling table and image data, and extracting content from scanned PDF documents using OCR.

Getting started with Docling

2025-03-13 · [AI Alliance] Workshop: Hands-on with Docling

Hands-on workshop

HTML Python docling docx ocr pdf

Hands-on workshop on using Docling to extract and clean data from documents, including PDFs, HTML, and OCR for scanned PDFs. Key activities: getting started with Docling; extracting content from PDFs/HTML; handling table and image data; extracting content from scanned PDFs using OCR.

Piwik PRO APIs: Zugriff, Datenabruf und Berichte

2024-08-23 · Einstieg in die Piwik PRO API für Reporting || FREE Community Training

training

by Marcus Stade , Patrick Mohr , Markus Baersch

Python piwik pro

Inhalte: Piwik PRO hat jede Menge APIs für alle erdenklichen Zwecke. Auch zum gezielten Abruf von konsolidierten Zahlen, Rohdaten und Berichten. Wir schauen uns an, was man braucht, um die API zu nutzen, wie man Daten abruft und für verschiedene Zwecke einsetzen kann. Dazu verwenden wir Python und Google Colab Notebooks als Basis und fangen ganz von vorn an, so dass jeder auf Wunsch - parallel oder später - die einzelnen Schritte mit den eigenen Daten nachvollziehen und weiter ausbauen kann. Programmierkenntnisse sind dafür nicht zwingend erforderlich – auch das ist ein Vorteil des Tool-Stacks, den wir im Training näher beleuchten werden.