Sadiq Jaffer
@sadiq.toao.com
220 followers 45 following 7 posts
Researcher @ Cambridge CL, OCaml hacker, fmr CEO at Opsian
Posts Media Videos Starter Packs
sadiq.toao.com
A good point. Was being generated but not linked anywhere. Fixed now. Thanks!
Reposted by Sadiq Jaffer
anil.recoil.org
Some fun OCaml GC projects here with @sadiq.toao.com and @kcsrk.info if any students are looking for projects involving programming languages toao.com/blog/ocaml-0...
Last three months in OCaml (July 2025) - Sadiq Jaffer
toao.com
Reposted by Sadiq Jaffer
anil.recoil.org
The most incredibly fun part of this Nature comment on evidence synthesis we published today is that the cartoonist (David Parkins) also did Beano and Dennis the Menace (!) A true legend. www.nature.com/articles/d41...
Reposted by Sadiq Jaffer
cst.cam.ac.uk
The rapid rise in AI-generated fraudulent academic papers is "poisoning" scientific literature, say Cambridge researchers in Nature magazine today. But though AI is the problem, it could also help in ensuring the integrity of scientific discovery... buff.ly/AuSNcGd
@anil.recoil.org @sadiq.toao.com
Reposted by Sadiq Jaffer
yminsky.bsky.social
I'm pleased to announce OxCaml!

OxCaml is Jane Street's branch of OCaml. We've given it a new name and a snazzy logo, and done a bunch of work to make it easy for people to try.
sadiq.toao.com
One thing I probably should highlight more in the post is that the proprietary models (like Claude and Gemini) that most students currently have access to can already ace the assignments.
sadiq.toao.com
This is a thorny question and mostly comes down to what we're trying to teach. I wonder if a progressive approach where at early stages of teaching there is no automatic tooling but as critical skills are learnt more can be automated. It's a bit of a moving target at the moment though.
sadiq.toao.com
Just how good are locally hostable code models on Cambridge first year OCaml assignments? @anil.recoil.org , @jon.recoil.org and I wanted to find out, so ran some tests. TL;DR Qwen3 means we might need new assignments. toao.com/blog/ocaml-l...
Qwen3 Leads the Pack: Evaluating how Local LLMs tackle First Year CS OCaml exercises - Sadiq Jaffer
toao.com
sadiq.toao.com
If you are using llama.cpp, here's a workaround using grammars for getting JSON structured output from Deepseek R1 and distills: toao.com/blog/json-ou...
JSON output from Deepseek R1 and distills with llama.cpp - Sadiq Jaffer
toao.com
Reposted by Sadiq Jaffer
lawrennd.bsky.social
Working to surface challenges faced by folks at the coal face.

Data in research contributions from @orbenamy.bsky.social @sadiq.toao.com @scotthosking.bsky.social Stefan Scholtes, Vasco Carvalho, Mireia Crispin and a foreward with Jess Montgomery @dianecoyle1859.bsky.social @ginasue.bsky.social
ai.cam.ac.uk
ai@cam @ai.cam.ac.uk · Nov 29
🚨 New report now live! 🚨

In partnership with @mctd.bsky.social & @bennettinstitute.bsky.social, our new report presents six case studies which show innovative uses of data for research in areas that are critically important to #science and #society.

⬇️ Read more
ai.cam.ac.uk/assets/uploa...
Reposted by Sadiq Jaffer
anil.recoil.org
New preprint from our work on using LLMs to accelerate conservation evidence synthesis across millions of papers. We crosscheck 3 retrieval strategies against 10 LLMs and benchmark against human experts and find quite a bit of variance https://www.researchsquare.com/article/rs-5409185/v1