talk-data.com talk-data.com

PyData talk 2023-04-19 at 08:50

Shrinking gigabyte sized scikit-learn models for deployment

Description

We present an open source library to shrink pickled scikit-learn and lightgbm models. We will provide insights of how pickling ML models work and how to improve the disk representation. With this approach, we can reduce the deployment size of machine learning applications up to 6x.