In the video presentation embedded below, our friends over at Neural Magic present a compelling workshop: How to Optimize Deep Learning Models for Production. Topics covered:
- What model pruning is, including benefits and downsides
- SOTA pruning algorithms and techniques that you can implement today
- SparseML, an open-source tool that makes pruning easy and successful
- Guaranteed ways to get production performance out of a pruned model.
After watching this video, you’ll be able to optimize your NLP and/or computer vision model, apply your own data with a few lines of code, and deploy it on commodity CPUs at GPU-level speeds.
Sign up for the free insideAI News newsletter.
Join us on Twitter: @InsideBigData1 – https://twitter.com/InsideBigData1
Speak Your Mind