Towards Critical Artificial Intelligence Literacies doi.org/10.5281/zeno...
1/
Towards Critical Artificial Intelligence Literacies doi.org/10.5281/zeno...
1/
"[AI agents] can... infer a researcher's latent hypotheses and produce data that artificially confirms them."
...
"We can no longer trust that survey responses are coming from real people" [email protected]
"[AI agents] can... infer a researcher's latent hypotheses and produce data that artificially confirms them."
...
"We can no longer trust that survey responses are coming from real people" [email protected]
Yes, yes, yes, and yes.
It’s what I do all day long.
Yes, yes, yes, and yes.
It’s what I do all day long.
Wherein I analyse HCAI & demonstrate through 3 triplets my new tripartite definition of AI (Table 1) that properly centres the human. 1/n
Wherein I analyse HCAI & demonstrate through 3 triplets my new tripartite definition of AI (Table 1) that properly centres the human. 1/n
(The METR measure is just one of many benchmarks, and like all benchmarks has flaws, but also has the advantage of have neither a ceiling or floor effect)
metr.org/blog/2025-03...
(The METR measure is just one of many benchmarks, and like all benchmarks has flaws, but also has the advantage of have neither a ceiling or floor effect)
metr.org/blog/2025-03...
Does threatening an AI really make it perform better (the way Google founder Brin claimed)? How about offering to tip the AI? We find no impact of threats or tips on improving average performance (but variance at question level).
Does threatening an AI really make it perform better (the way Google founder Brin claimed)? How about offering to tip the AI? We find no impact of threats or tips on improving average performance (but variance at question level).
www.youtube.com/watch?v=sVhP...
www.youtube.com/watch?v=sVhP...
huggingface.co/spaces/mteb/...
huggingface.co/spaces/mteb/...
www.nature.com/articles/s41...
www.nature.com/articles/s41...
GPT-4 (now obsolete) went from 30% accuracy to 87% accuracy in clinical oncology decisions when given access to tools www.nature.com/articles/s43...
GPT-4 (now obsolete) went from 30% accuracy to 87% accuracy in clinical oncology decisions when given access to tools www.nature.com/articles/s43...
erictopol.substack.com/p/predicting...
erictopol.substack.com/p/predicting...
Reivindica la agencia femenina como un factor central en la difusión y adaptación de tecnologías clave durante esa época.
Reivindica la agencia femenina como un factor central en la difusión y adaptación de tecnologías clave durante esa época.
#AIEthics #cybersecurity
www.technologyreview.com/2025/04/04/1...
#AIEthics #cybersecurity
www.technologyreview.com/2025/04/04/1...
Some fun results: comparisons of the same frame when expressed in images vs texts. When the "crime" frame is expressed in the article text, there are more political words in the text, but when the frame is expressed in the article image, more police words.
Some fun results: comparisons of the same frame when expressed in images vs texts. When the "crime" frame is expressed in the article text, there are more political words in the text, but when the frame is expressed in the article image, more police words.
www.nature.com/articles/s41...
www.nature.com/articles/s41...
We analyse cog neuro theories showing how vicious regress, e.g. the homunculus fallacy, is (sadly) alive and well — and importantly how to avoid it. 1/
We analyse cog neuro theories showing how vicious regress, e.g. the homunculus fallacy, is (sadly) alive and well — and importantly how to avoid it. 1/
“Our findings reveal widespread adoption of large language models across diverse writing domains, ranging consumers, firms and international organizations.”
“Early adopters may have already reached a saturation point”
arxiv.org/abs/2502.09747
By. September 2024, 18% of financial consumer complaints, 24% of press releases, 15% of job postings & 14% of UN press releases showed signs of LLM writing. And the method undercounts true use.
“Our findings reveal widespread adoption of large language models across diverse writing domains, ranging consumers, firms and international organizations.”
“Early adopters may have already reached a saturation point”
arxiv.org/abs/2502.09747
Government workers are 12% more productive when randomly assigned to work from home. They're more efficient where it's quiet.
Most people aren't shirking from home. They're escaping distractions and long commutes.
Government workers are 12% more productive when randomly assigned to work from home. They're more efficient where it's quiet.
Most people aren't shirking from home. They're escaping distractions and long commutes.
rupress.org/jem/article/...