Ep. 567 w/ Brian Stevens CEO at Neural Magic

Building The Future Show - Radio / TV / Podcast

Episode • Apr 23, 2024 • 46m

Together with our community, we engineer sparse LLM, CV, and NLP models that are more efficient and performant in production. Why does this matter? Sparse models are more flexible and can achieve unrivaled latency and throughput performance on your private CPU and GPU infrastructure. Check us out on GitHub and join the Neural Magic Slack Community to get started with software-delivered AI.

http://neuralmagic.com/

Activity

Switch to the Fountain App

Open in Fountain