The BIOSCAN-5M dataset features five million specimens from 47 countries with paired high-resolution images and DNA barcodes for every sample. The dataset’s hierarchical taxonomic labels, geographic data, and long-tail distribution of rare species offer valuable resources for ecological research and AI model training. BIOSCAN-5M represents a significant advancement in biodiversity informatics, facilitated by the International Barcode of Life and the BIOSCAN project, and is publicly available for download via Hugging Face and PyPI.
talk-data.com
S
Speaker
Scott C. Lowe
1
talks
machine learning researcher
Vector Institute
Scott C. Lowe is a British machine learning researcher based at the Vector Institute in Toronto, Canada. His work is multidisciplinary, spanning several topics. Recently he has focused on biodiversity monitoring applications for both insects (BIOSCAN) and ocean habitats (BenthicNet), self-supervised learning, reasoning capabilities of LLMs, and symbolic music generation. Previously, he completed his PhD in Neuroinformatics from the University of Edinburgh.
Bio from: Feb 20 - Virtual AI, ML and Computer Vision Meetup
Filtering by:
Feb 20 - Virtual AI, ML and Computer Vision Meetup
×
Filter by Event / Source
Talks & appearances
Showing 1 of 4 activities