4.5 is huge but the "big model smell" is elusive. The RLHF holds it back, making it boring and predictable. Sonnet is less creative with words but more steerable. I'd befriend Sonnet and avoid GPT IRL. Unless OAI overhauls their RLHF, even a 100T model wouldn't change this.
4.5 is huge but the "big model smell" is elusive. The RLHF holds it back, making it boring and predictable. Sonnet is less creative with words but more steerable. I'd befriend Sonnet and avoid GPT IRL. Unless OAI overhauls their RLHF, even a 100T model wouldn't change this.