avatar

Scaling Large ML Models to Small Devices with Atila Orhon

Software Engineering Daily
Software Engineering Daily
Episode • May 7, 2024 • 56m

The size of ML models is growing into the many billions of parameters. This poses a challenge for running inference on non-dedicated hardware like phones and laptops. Argmax is a startup focused on developing methods to run large models on commodity hardware. A key observation behind their strategy is that the largest models are getting

The post Scaling Large ML Models to Small Devices with Atila Orhon appeared first on Software Engineering Daily.

Switch to the Fountain App