Book: https://a.co/d/bC2kSj1
Substack: https://www.oneusefulthing.org/
Web: https://mgmt.wharton.upenn.edu/profile/emollick
Claude: Here's one based on David Weber's space operas
Me: Not that Weber
C: Here's a game based on sociologist Max Weber
Me: Not that one
C: The operas of Carl Maria von Weber?
Me: No
C: Here is one using Weber grills!
Claude: Here's one based on David Weber's space operas
Me: Not that Weber
C: Here's a game based on sociologist Max Weber
Me: Not that one
C: The operas of Carl Maria von Weber?
Me: No
C: Here is one using Weber grills!
Big gains in ability to do practical work (like make a PowerPoint from an Excel) and the best results ever (& in one shot) in my Lem poetry test, plus good results in Claude Code
Big gains in ability to do practical work (like make a PowerPoint from an Excel) and the best results ever (& in one shot) in my Lem poetry test, plus good results in Claude Code
Success in Circuit lies
Too bright for our infirm Delight
The Truth's superb surprise
This paper finds poetry is a universal single shot jailbreak for LLMs. Systems built to stop prosaic attacks fail when the request is phrased in verse arxiv.org/abs/2511.15304
Success in Circuit lies
Too bright for our infirm Delight
The Truth's superb surprise
This paper finds poetry is a universal single shot jailbreak for LLMs. Systems built to stop prosaic attacks fail when the request is phrased in verse arxiv.org/abs/2511.15304
Not absolutely perfect, but I can’t believe how much there is a coherent through-line, how clear the text is, and also parts of it are actually funny?
Not absolutely perfect, but I can’t believe how much there is a coherent through-line, how clear the text is, and also parts of it are actually funny?
Gemini: "Here is F.L.O.O.R. (First-person Lino Observation & Ornamental Review)."
Pretty good!
Gemini: "Here is F.L.O.O.R. (First-person Lino Observation & Ornamental Review)."
Pretty good!
It isn’t clear how to interpret the sycophancy score, but the MASK score for deception is quite high compared to big models.
Sycophancy leads to higher LMArena scores…
It isn’t clear how to interpret the sycophancy score, but the MASK score for deception is quite high compared to big models.
Sycophancy leads to higher LMArena scores…
"This approach allowed the threat actor to achieve operational scale typically associated with nation-state campaigns while maintaining minimal direct involvement" www.anthropic.com/news/disrupt...
"This approach allowed the threat actor to achieve operational scale typically associated with nation-state campaigns while maintaining minimal direct involvement" www.anthropic.com/news/disrupt...
When Cursor added agentic coding in 2024, adopters produced 39% more code merges, with no sign of a decrease in quality (revert rates were the same, bugs dropped) and no sign that the scope of the work shrank. papers.ssrn.com/sol3/papers....
When Cursor added agentic coding in 2024, adopters produced 39% more code merges, with no sign of a decrease in quality (revert rates were the same, bugs dropped) and no sign that the scope of the work shrank. papers.ssrn.com/sol3/papers....
Anyone who wants to use AI seriously for real work will need to assess it themselves. www.oneusefulthing.org/p/giving-you...
Anyone who wants to use AI seriously for real work will need to assess it themselves. www.oneusefulthing.org/p/giving-you...
Our systems are very much not ready for the revelation that this is no longer true, as this planning objection AI shows
Our systems are very much not ready for the revelation that this is no longer true, as this planning objection AI shows
But giving them access to an LLM for guidance significantly closes the gap. mgcuna.github.io/website/JMP_...
But giving them access to an LLM for guidance significantly closes the gap. mgcuna.github.io/website/JMP_...
Now this paper confirms that cover letters have lost their value as predictor
Now this paper confirms that cover letters have lost their value as predictor
It also featured the best version of “I spoke to a local farmer about a data center”
It also featured the best version of “I spoke to a local farmer about a data center”
From Walter Benjamin (the painting in the reply)
From Walter Benjamin (the painting in the reply)
When we are given answers we think we learn, but we don’t. Learning is work. However, things like the “learning modes” from the AI providers help, as does using AI for tutoring not answers
When we are given answers we think we learn, but we don’t. Learning is work. However, things like the “learning modes” from the AI providers help, as does using AI for tutoring not answers
We live in a strange time (not the penguin pillar. That has always been there)
We live in a strange time (not the penguin pillar. That has always been there)
It is a time where CEO vision matters a lot, and you can see a contrast in Amazon and Walmart
It is a time where CEO vision matters a lot, and you can see a contrast in Amazon and Walmart