[AI] Small Language Models and Training PubMedGPT - Naveen Rao
The Swyx Mixtape - En podcast af Swyx
Naveen Rao is the CEO and Co-Founder of the machine learning (ML) training platform, MosaicML, and the former CEO and Co-Founder of Nervana Systems. Naveen shares insight into the thesis behind Mosaic and the practical applications of Large Language Models (LLMs), as well as the Generative Pre-trained Transformer-2 (GPT-2) to GPT-3 transition and the challenge of training models with the constraint of data limits.