Lightnews — Scholar-powered news

Jakob Foerster @jfoerst.bsky.social · May 2

Late to the party (since I just took some time to spend with our two little ones) but luckily good science is timeless ;)

Mattie Fellows @mattieml.bsky.social · Mar 19

PQN Blog 1/3: TD methods are the bread and butter of RL, yet can have convergence issues when used in practice. This has always annoyed me. Find out below why TD is so unstable and how can we understand this instability better using the TD Jacobian. @flair-ox.bsky.social @jfoerst.bsky.social

Fixing TD Pt I: Why is Temporal Difference Learning so Unstable?

blog.foersterlab.com

3

Jakob Foerster @jfoerst.bsky.social · Mar 20

PQN puts Q-learning back on the map and now comes with a blog post + Colab demo! Also, congrats to the team for the spotlight at #ICLR2025

Mattie Fellows @mattieml.bsky.social · Mar 20

PQN blog 3/3 👉take a look at Matteo's 5-minute blog covering PQN’s key features, plus a Colab demo with JAX & PyTorch implementations mttga.github.io/posts/pqn/

🔎 For a deeper dive into the theory:
blog.foersterlab.com/fixing-td-pa...
blog.foersterlab.com/fixing-td-pa...

See you in Singapore! 🇸🇬

Simplifying Deep Temporal Difference Learning

A modern implementation of Deep Q-Network without target networks and replay buffers.

mttga.github.io

4 16

Jakob Foerster @jfoerst.bsky.social · Mar 12

My group @FLAIR_Ox is recruiting a postdoc and looking for someone who can get started by the end of April. Deadline to apply is in one week (!), 19th of March at noon, so please help spread the word: my.corehr.com/pls/uoxrecru...

Job Details

my.corehr.com

13 19

Reposted by Jakob Foerster

Christian Wolf @chriswolfvision.bsky.social · Feb 9

That's the first time that I see a video by chess.com cited in an accepted ICLR paper, in particular on handshakes vs. fist bumps during a chess competition ...

By Oxford, @jfoerst.bsky.social

Paper: openreview.net/forum?id=wFg...

Video: www.youtube.com/watch?v=6fS7...
@danielrensch.chess.com

1 1 9

Reposted by Jakob Foerster

Amine El Ouassouli @aelouass.bsky.social · Feb 18

@jfoerst.bsky.social take on how the community sees the ARC Challenge and how we evaluate models and use benchmarks nowadays is 👌.

#more_science_less_hype (please).

PS: Amazing discussion and good brain food, as usual with MLST.

ImageNet Moment for Reinforcement Learning?

YouTube video by Machine Learning Street Talk

www.youtube.com

1 3

Reposted by Jakob Foerster

Pablo Samuel Castro @pcastr.bsky.social · Dec 11

Second #runconference @neuripsconf.bsky.social #NeurIPS2024 !
@jfoerst.bsky.social @ferranalet.bsky.social @adamjelley.bsky.social @enjeeneer.io

Same deal for tomorrow: 7am at
goo.gl/maps/8Z8eMrd...

Join us!

1 20

Jakob Foerster @jfoerst.bsky.social · Nov 29

Apply here and list me as the _first_ supervisor: ox.ac.uk/admissions/g...
More information at foersterlab.com. Thanks a lot and happy applying!

DPhil in Engineering Science | University of Oxford

About the courseThe DPhil in Engineering Science will offer you the opportunity to develop in-depth knowledge, understanding and expertise in your chosen field of engineering research. To support

ox.ac.uk

4

Jakob Foerster @jfoerst.bsky.social · Nov 29

🚨 PSA 🚨 Deadline to apply for your dream Phd in ML
@FLAIR_Ox
is coming up on the 2nd of December AOE. We work on compute-only scaling of LLMs, (meta/multi-agent) RL at the Hyperscale, Human-AI coordination, opponent-shaping for vaccine design, GenAI for finance & much more..

DPhil in Engineering Science | University of Oxford

About the courseThe DPhil in Engineering Science will offer you the opportunity to develop in-depth knowledge, understanding and expertise in your chosen field of engineering research. To support

ox.ac.uk

1 3 15

Jakob Foerster @jfoerst.bsky.social · Nov 23

correct -- this runs on top of an open-source protocol and the UI is a Twitter clone. How hard can this be?

1

Jakob Foerster @jfoerst.bsky.social · Nov 23

wth did we not go to an open-source and non-for profit alternative? en.wikipedia.org/wiki/Bluesky

3 8

Jakob Foerster @jfoerst.bsky.social · Nov 23

sad times. Joking aside, have you tried pufferlib? I am really curious how it compares and contrasts to JAX RL line of work and haven't seen much direct comparison.

3 2

Jakob Foerster @jfoerst.bsky.social · Nov 23

Let's try @josephsuarez.bsky.social

1 1

Jakob Foerster @jfoerst.bsky.social · Nov 23

Candidates also need to apply for an Engineering DPhil by 2nd of Dec AOE (if they haven’t already) listing me as the supervisor, www.ox.ac.uk/admissions/g... The student should have an outstanding track record of academic excellence and relevant research experience.

DPhil in Engineering Science | University of Oxford

About the courseThe DPhil in Engineering Science will offer you the opportunity to develop in-depth knowledge, understanding and expertise in your chosen field of engineering research. To support

www.ox.ac.uk

3

Jakob Foerster @jfoerst.bsky.social · Nov 23

Apply by emailing a CV, personal statement, and research proposal to “[email protected]” by 2nd of Dec AOE. Joint interviews will be held in January. Shortlisted candidates will also be invited to apply to FAIR.

1 2

Jakob Foerster @jfoerst.bsky.social · Nov 23

The goal is to improve the generalisation abilities and data efficiency of GenAI, e.g. using RL and curriculum learning to train LLMs at the frontier of learnability.
For more details about our work, check out foersterlab.com and joao.science

Home

FLAIR is a research group in the Department of Engineering Science at the University of Oxford, specialising in Reinforcement Learning.

foersterlab.com

1

Jakob Foerster @jfoerst.bsky.social · Nov 23

Hello BlueSky! Joao Henriques (joao.science) and I are hiring a fully funded PhD student (UK/international) for the FAIR-Oxford program. The student will spend 50% of their time @UniofOxford and 50% @MetaAI (FAIR) in London, while completing a DPhil (Oxford PhD). Deadline: 2nd of Dec AOE!!

João F. Henriques

Research of Joao F. Henriques

joao.science

1 18 51

Jakob Foerster @jfoerst.bsky.social · Nov 15

I got summoned 🫡

2 9

Jakob Foerster @jfoerst.bsky.social · Nov 14

Hello world!

6 5 33