https://rafidrm.github.io
arxiv.org/abs/2411.02661
We will also be presenting this work at Neurips in a few weeks. Feel free to swing by our poster on Friday Dec 13!
16/16
arxiv.org/abs/2411.02661
We will also be presenting this work at Neurips in a few weeks. Feel free to swing by our poster on Friday Dec 13!
16/16
15/n
15/n
14/n
14/n
13/n
13/n
12/n
12/n
11/n
11/n
However, there may be an alternative: specializing gen AI models to a few applications.
10/n
However, there may be an alternative: specializing gen AI models to a few applications.
10/n
9/n
9/n
Given 2 tasks, we show pricing reduces to 3 regimes based on the ratio of competitive ratios versus the ratio of user demands:
1) high prices,
2) low prices,
3) 0 revenue!
8/n
Given 2 tasks, we show pricing reduces to 3 regimes based on the ratio of competitive ratios versus the ratio of user demands:
1) high prices,
2) low prices,
3) 0 revenue!
8/n
7/n
7/n
Given user demand, companies can maximize their revenue if they focus on only the most competitive tasks and ignore the ones where they are uncompetitive.
6/n
Given user demand, companies can maximize their revenue if they focus on only the most competitive tasks and ignore the ones where they are uncompetitive.
6/n
= price/token × # of tokens needed for a satisfactory output.
Users can choose the best model for each task.
5/n
= price/token × # of tokens needed for a satisfactory output.
Users can choose the best model for each task.
5/n
With these 3 factors, we see two popular pricing models today: subscription-based (for chatbot users) and per-token (for API users)
4/n
With these 3 factors, we see two popular pricing models today: subscription-based (for chatbot users) and per-token (for API users)
4/n
For instance on Github Copilot, the ‘acceptance rate’ of AI suggestions is the largest predictor of user productivity.
dl.acm.org/doi/pdf/10.1...
3/n
For instance on Github Copilot, the ‘acceptance rate’ of AI suggestions is the largest predictor of user productivity.
dl.acm.org/doi/pdf/10.1...
3/n
1️⃣ A single model can handle many tasks, e.g., coding, translation, image gen, but users interact differently with each.
2/n
1️⃣ A single model can handle many tasks, e.g., coding, translation, image gen, but users interact differently with each.
2/n