Maintain and develop: MTEB, ScandEval, tomsup, DaCy, etc.
#NLPProc
The research includes efficient post-training, alignment, evaluation, and preference optimization, but we are very flexible for reinterpretation. So, if you think that you might be a partial fit do apply!
international.au.dk/about/profil...
The research includes efficient post-training, alignment, evaluation, and preference optimization, but we are very flexible for reinterpretation. So, if you think that you might be a partial fit do apply!
international.au.dk/about/profil...
This work implements >500 evaluation tasks across >1000 languages and covers a wide range of use cases and domains🩺👩💻⚖️
This work implements >500 evaluation tasks across >1000 languages and covers a wide range of use cases and domains🩺👩💻⚖️
If you have an opinion on how it should define zero-shot, let us know:
github.com/embeddings-b...
If you have an opinion on how it should define zero-shot, let us know:
github.com/embeddings-b...
international.au.dk/about/profil...
international.au.dk/about/profil...
Asking as we would like to track and detect dataset contamination in MTEB:
github.com/embeddings-b...
Asking as we would like to track and detect dataset contamination in MTEB:
github.com/embeddings-b...
Lyt her 👇
YouTube: youtu.be/IpEla8mZHnU?...
Spotify: open.spotify.com/episode/41WT...
Lyt her 👇
YouTube: youtu.be/IpEla8mZHnU?...
Spotify: open.spotify.com/episode/41WT...
Is there anything I should see while I am there?
Is there anything I should see while I am there?