Some months ago, SemiAnalysis published a flashy article with the premise that organizations with GPUs in the magnitude of tens of thousands had so many resources that the rest of the startups and researchers with few GPUs were wasting their time doing things such as local fine-tuning and over-quantization.
Good stuff
Glad you liked it!