avatar

Ep. 567 w/ Brian Stevens CEO at Neural Magic

Building The Future Show - Radio / TV / Podcast
Building The Future Show - Radio / TV / Podcast
Episode • Apr 23, 2024 • 46m

Together with our community, we engineer sparse LLM, CV, and NLP models that are more efficient and performant in production. Why does this matter? Sparse models are more flexible and can achieve unrivaled latency and throughput performance on your private CPU and GPU infrastructure. Check us out on GitHub and join the Neural Magic Slack Community to get started with software-delivered AI.

http://neuralmagic.com/

Building The Future Show - Radio / TV / Podcast • Ep. 567 w/ Brian Stevens CEO at Neural Magic • Listen on Fountain