Scaling Large ML Models to Small Devices with Atila Orhon
Software Engineering Daily - En podcast af Software Engineering Daily
Kategorier:
The size of ML models is growing into the many billions of parameters. This poses a challenge for running inference on non-dedicated hardware like phones and laptops. Argmax is a startup focused on developing methods to run large models on commodity hardware. A key observation behind their strategy is that the largest models are getting