Discussion about this post

User's avatar
JP's avatar

The painter analogy for batch size tradeoffs clicked for me in a way the usual GPU utilisation charts don't. Your point about model labs backfilling idle capacity with training runs is exactly what I found when looking into who actually captures the money in inference. The companies that build and run models get to amortise hardware across workloads nobody else has. I dug into the broader economics here: https://medium.datadriveninvestor.com/who-profits-when-ai-models-are-free-b71ae03f4167

Paul's avatar
Feb 24Edited

thank you, llm inference is always interesting!

1 more comment...

No posts

Ready for more?