We design and apply quantization algorithms for PyTorch DNNs across modern architectures, using PyTorch internals mechanisms to automatically balance quality and speed. We then compile the quantized checkpoints to deliver real-world speedup on different hardware.
talk-data.com
Company
TheStage AI
Speakers
1
Activities
1
Speakers from TheStage AI
Talks & appearances
1 activities from TheStage AI speakers