https://www.seidellmarton.us/gremio
https://www.linkedin.com/in/gregory-marton/
Ever asked an LLM-as-Marilyn Monroe who the US president was in 2000? 🤔 Should the LLM answer at all? We call these clashes Concept Incongruence. Read on! ⬇️
1/n 🧵
Ever asked an LLM-as-Marilyn Monroe who the US president was in 2000? 🤔 Should the LLM answer at all? We call these clashes Concept Incongruence. Read on! ⬇️
1/n 🧵
They find that,
While LLMs achieve broad categorical alignment with human judgment, they falter in capturing fine-grained semantic nuances such as typicality and, critically, exhibit vastly different representational efficiency profiles.
They find that,
While LLMs achieve broad categorical alignment with human judgment, they falter in capturing fine-grained semantic nuances such as typicality and, critically, exhibit vastly different representational efficiency profiles.
www.nytimes.com/interactive/...
www.nytimes.com/interactive/...
Gemini is the first public release of a full multimodal LLM that can directly make images. This allows the systems to do detail work
Gemini is the first public release of a full multimodal LLM that can directly make images. This allows the systems to do detail work
An interesting histogram of numeracy scores for U.S. vs. some other countries,
using Alberto Cairo's [weepeople font](github.com/propublica/w...) to show the people involved in these distributions.
Src: bit.ly/3FrDq0v
👇
An interesting histogram of numeracy scores for U.S. vs. some other countries,
using Alberto Cairo's [weepeople font](github.com/propublica/w...) to show the people involved in these distributions.
Src: bit.ly/3FrDq0v
👇
www.abqjournal.com/opinion/arti...
@sciencehomecoming.bsky.social
@cantlonlab.bsky.social
@spiantado.bsky.social
www.abqjournal.com/opinion/arti...
@sciencehomecoming.bsky.social
@cantlonlab.bsky.social
@spiantado.bsky.social
We have universities for literally the same reason that we have roads and armies.
There is no society in the history of humanity that has successfully built a good university system without massive govt subsidies. US had already pushed the idea very far.
We have universities for literally the same reason that we have roads and armies.
We propose a method to automatically find position-aware circuits, improving faithfulness while keeping circuits compact. 🧵👇
We propose a method to automatically find position-aware circuits, improving faithfulness while keeping circuits compact. 🧵👇
When LLMs learn from previous incorrect answers, they typically observe corrective feedback in the form of rationales explaining each mistake. In our new preprint, we find these rationales do not help, in fact they hurt performance!
🧵
Love the homage to Richard Muller, too!
Love the homage to Richard Muller, too!
2.0 Flash & Flash-Lite set new standards in the quality/cost Pareto frontier.
More details:
blog.google/technology/g...
2.0 Flash & Flash-Lite set new standards in the quality/cost Pareto frontier.
More details:
blog.google/technology/g...
as posted by @aymeric-roucher.bsky.social
An entirely open agent that can: navigate the web autonomously, scroll and search through pages, download and manipulate files, run calculation on data...
as posted by @aymeric-roucher.bsky.social
An entirely open agent that can: navigate the web autonomously, scroll and search through pages, download and manipulate files, run calculation on data...
Exa & Deepseek Chat App is a free and open-source chat app that uses Exa's API for web search and Deepseek R1 LLM for reasoning.
github.com/exa-labs/exa...
Exa & Deepseek Chat App is a free and open-source chat app that uses Exa's API for web search and Deepseek R1 LLM for reasoning.
github.com/exa-labs/exa...
This is an attempt to consolidate the dizzying rate of AI developments since Christmas. If you're into AI but not deep enough, this should get you oriented again.
timkellogg.me/blog/2025/01...
This is an attempt to consolidate the dizzying rate of AI developments since Christmas. If you're into AI but not deep enough, this should get you oriented again.
timkellogg.me/blog/2025/01...