We design and apply quantization algorithms for PyTorch DNNs across modern architectures, using PyTorch internals mechanisms to automatically balance quality and speed. We then compile the quantized checkpoints to deliver real-world speedup on different hardware.
talk-data.com
Topic
dnns
1
tagged
Activity Trend
1
peak/qtr
2020-Q1
2026-Q1