Zero-shot Benchmarking: Flexible and Scalable Automatic Evaluation of LLMs

0
View Event
Zero-shot Benchmarking: Flexible and Scalable Automatic Evaluation of LLMs — Lisbon Events | bushdrum