The reasoning is simple, if you add the cost of producing something to the cost of using it, then training an llm is equivalent to producing a movie, and llm inference is equivalent to watching said movie.
The reasoning is simple, if you add the cost of producing something to the cost of using it, then training an llm is equivalent to producing a movie, and llm inference is equivalent to watching said movie.
github.com/mifi/lossles...
github.com/mifi/lossles...