Lightnews — Scholar-powered news

Reposted by Max Bartolo

Lisa Alazraki @lisaalaz.bsky.social · May 22

Thrilled to share our new preprint on Reinforcement Learning for Reverse Engineering (RLRE) 🚀

We demonstrate that human preferences can be reverse engineered effectively by pipelining LLMs to optimise upstream preambles via reinforcement learning 🧵⬇️

1 1 9

Max Bartolo @maxbartolo.bsky.social · Mar 27

Massive shoutout to all our fantastic contributors, collaborators and partners who made this possible! 🙏

1

Max Bartolo @maxbartolo.bsky.social · Mar 27

Model weights are available for research purposes at:
🔗 Command A: huggingface.co/CohereForAI/...
🔗Command R7B: huggingface.co/CohereForAI/...

1 1

Max Bartolo @maxbartolo.bsky.social · Mar 27

📄 You can find the full tech report at cohere.com/research/pap...

1 1

Max Bartolo @maxbartolo.bsky.social · Mar 27

I'm excited to share the tech report for our @cohere.com @cohereforai.bsky.social Command A and Command R7B models. We highlight our novel approach to model training including self-refinement algorithms and model merging techniques at scale. Read more below! ⬇️

1 4 11

Max Bartolo @maxbartolo.bsky.social · Mar 19

I really enjoyed my MLST chat with Tim @neuripsconf.bsky.social about the research we've been doing on reasoning, robustness and human feedback. If you have an hour to spare and are interested in AI robustness, it may be worth a listen 🎧

Check it out at youtu.be/DL7qwmWWk88?...

3 8

Max Bartolo @maxbartolo.bsky.social · Mar 10

That's very cool! There's definitely a lot happening in the space and most people are doing some version of this, but I haven't come across a well-organised collection of tools like this yet -- could be quite impactful!

1

Max Bartolo @maxbartolo.bsky.social · Feb 13

Check out @lisaalaz.bsky.social's internship work with us @cohere.com questioning the rationale behind rationales 🔥

1 4

Max Bartolo @maxbartolo.bsky.social · Dec 11

Super excited to see PRISM recognised as a #NeurIPS2024 best paper. This was an incredible large-scale effort by @hannahrosekirk.bsky.social and fantastic collaborators. If you're interested in human feedback, check it out, there are 100+ pages of detailed insights! 🔥

1 9

Reposted by Max Bartolo

Adina Williams @adinawilliams.bsky.social · Dec 11

Our paper PRISM alignment won a best paper award at #neurips2024!

All credits to @hannahrosekirk.bsky.social A.Whitefield, P.Röttger, A.M.Bean, K.Margatina, R.Mosquera-Gomez, J.Ciro, @maxbartolo.bsky.social H.He, B.Vidgen, S.Hale

Catch Hannah tomorrow at neurips.cc/virtual/2024/poster/97804

blog.neurips

2 9 67

Reposted by Max Bartolo

Tim Rocktäschel @handle.invalid · Dec 4

Excited to reveal Genie 2, our most capable foundation world model that, given a single prompt image, can generate an endless variety of action-controllable, playable 3D worlds. Fantastic cross-team effort by the Open-Endedness Team and many other teams at Google DeepMind! 🧞

Jack Parker-Holder @jparkerholder.bsky.social · Dec 4

Introducing 🧞Genie 2 🧞 - our most capable large-scale foundation world model, which can generate a diverse array of consistent worlds, playable for up to a minute. We believe Genie 2 could unlock the next wave of capabilities for embodied agents 🧠.

3 18 94

Max Bartolo @maxbartolo.bsky.social · Dec 2

Looking forward to @neuripsconf.bsky.social #NeurIPS #NeurIPS2024 in Vancouver next week! ❄️

Reach out (or pop by the @cohere.com booth) if you want to chat about human feedback, robustness and reasoning, prompt optimisation, adversarial data, glitch tokens, evaluation, or anything else!

an advertisement for vancouver in british columbia canada

ALT: an advertisement for vancouver in british columbia canada

media.tenor.com

11

Max Bartolo @maxbartolo.bsky.social · Dec 1

Couldn't agree with you more, Laura is incredible!

3

Max Bartolo @maxbartolo.bsky.social · Nov 29

Sparks of multi-hop reasoning ✨

Sohee Yang @soheeyang.bsky.social · Nov 27

🚨 New Paper 🚨
Can LLMs perform latent multi-hop reasoning without exploiting shortcuts? We find the answer is yes – they can recall and compose facts not seen together in training or guessing the answer, but success greatly depends on the type of the bridge entity (80% for country, 6% for year)! 1/N

2 9

Max Bartolo @maxbartolo.bsky.social · Nov 24

Fun to see Douwe's Dynabench plot continue to inspire new groundbreaking benchmarking work!

Tim Rocktäschel @handle.invalid · Nov 22

Excited to announce "BALROG: a Benchmark for Agentic LLM and VLM Reasoning On Games" led b UCL DARK's @dpaglieri.bsky.social! Douwe Kiela plot below is maybe the scariest for AI progress — LLM benchmarks are saturating at an accelerating rate. BALROG to the rescue. This will keep us busy for years.

4

Max Bartolo @maxbartolo.bsky.social · Nov 20

Awesome, thanks!

1

Max Bartolo @maxbartolo.bsky.social · Nov 20

@mariaa.bsky.social I'm new here so apologies if this is a noob question, but is there a way I can recommend folks to be added to starter packs?

1 1

Max Bartolo @maxbartolo.bsky.social · Nov 20

🚨 LLMs can learn to reason from procedural knowledge in pretraining data! 🚨 I particularly enjoy research where the evidence contradicts our initial hypothesis. If you're interested in LLM reasoning, check out the 60+ pages of in-depth work at arxiv.org/abs/2411.12580

Laura @lauraruis.bsky.social · Nov 20

How do LLMs learn to reason from data? Are they ~retrieving the answers from parametric knowledge🦜? In our new preprint, we look at the pretraining data and find evidence against this:

Procedural knowledge in pretraining drives LLM reasoning ⚙️🔢

🧵⬇️

4 7 67

Reposted by Max Bartolo

atla @atla-ai.bsky.social · Nov 19

We launched Judge Arena with @huggingface.bsky.social
@clefourrier.bsky.social - a platform that lets you easily compare models as judges side-by-side and vote for the best evaluation

Check out the live leaderboard and start voting now 🤗

3 10