Rafid Mahmood
rafidrm.bsky.social
Rafid Mahmood
@rafidrm.bsky.social
Assistant Prof @ Telfer School of Management + Sr. Research Scientist @ NVIDIA working on data-centric ML & operations of AI systems

https://rafidrm.github.io
You can read more in our paper here:

arxiv.org/abs/2411.02661

We will also be presenting this work at Neurips in a few weeks. Feel free to swing by our poster on Friday Dec 13!

16/16
Pricing and Competition for Generative AI
Compared to classical machine learning (ML) models, generative models offer a new usage paradigm where (i) a single model can be used for many different tasks out-of-the-box; (ii) users interact with ...
arxiv.org
November 25, 2024 at 5:55 PM
Finally, we assume everyone knows how good all models are. This is defensible since many models are available on the Chatbot Arena, but the problem becomes much harder if you have to price while forecasting how good competitors will get in the near future.

15/n
November 25, 2024 at 5:55 PM
Moreover, AI technology is subject to the well-known AI flywheel effect where companies are incentivized to release a product sooner in order to collect data and reap performance rewards later. These time dependent dynamics may require rethinking pricing decisions.

14/n
November 25, 2024 at 5:55 PM
A few caveats: Our analysis is theoretical and focuses only on pricing rather than the costs of building the models. If your model performance is limited, then the optimal price may be so low that you cannot net any profit from the product.

13/n
November 25, 2024 at 5:55 PM
Note that what matters is relative competitive ratios. Your model could be worse than competitors, but if it is a lot worse on translation & slightly worse on coding, then you can still get revenue by setting a low price & take the coding market.

12/n
November 25, 2024 at 5:55 PM
If competitive ratios are different, competitors cant outprice everything. If a model is very good at coding and okay for translation versus competitors, then you can set a price to incentivize competitors to focus on translation, since underpricing on coding cuts their revenue

11/n
November 25, 2024 at 5:55 PM
This brings us back to the issue of rising costs for gen AI development when optimizing for every benchmark.

However, there may be an alternative: specializing gen AI models to a few applications.

10/n
November 25, 2024 at 5:55 PM
The most dangerous regime is if the ratio of competitive ratios is close to 1, i.e., if all AI models perform similarly on all tasks. Here, competitors can always underprice their models to capture the entire market, leading to commoditization.

9/n
November 25, 2024 at 5:55 PM
Due to competition, Gen AI models must continue to improve or face being underpriced.

Given 2 tasks, we show pricing reduces to 3 regimes based on the ratio of competitive ratios versus the ratio of user demands:

1) high prices,
2) low prices,
3) 0 revenue!

8/n
November 25, 2024 at 5:55 PM
Competing companies go the other direction by pricing for the tasks where your model is least competitive. However, if your model is too pricey, then the competitors could be more cost-effective to users for all tasks.

7/n
November 25, 2024 at 5:55 PM
To set token prices, companies can rank tasks on the relative # of tokens needed for a satisfactory output between different models.

Given user demand, companies can maximize their revenue if they focus on only the most competitive tasks and ignore the ones where they are uncompetitive.

6/n
November 25, 2024 at 5:55 PM
We focus on per-token pricing. Here, when choosing between gen AI models w/ different performances, users should prefer to minimize their cost

= price/token × # of tokens needed for a satisfactory output.

Users can choose the best model for each task.

5/n
November 25, 2024 at 5:55 PM
3️⃣ Users rarely stop after a bad response—they can keep prompting it with more info until they get a good output. Some tasks need many prompts!

With these 3 factors, we see two popular pricing models today: subscription-based (for chatbot users) and per-token (for API users)

4/n
November 25, 2024 at 5:55 PM
2️⃣ Models are best evaluated by user satisfaction to responses (e.g., Chatbot Arena).

For instance on Github Copilot, the ‘acceptance rate’ of AI suggestions is the largest predictor of user productivity.

dl.acm.org/doi/pdf/10.1...

3/n
November 25, 2024 at 5:55 PM
Pricing gen AI is harder than other ML products for a few reasons.

1️⃣ A single model can handle many tasks, e.g., coding, translation, image gen, but users interact differently with each.

2/n
November 25, 2024 at 5:55 PM