...And also this cheeky riddle that I wrote on page 23 (original content).
internationalaisafetyreport.org/publication/...
...And also this cheeky riddle that I wrote on page 23 (original content).
internationalaisafetyreport.org/publication/...
Max Kamachee and I just updated our "Video Deepfake Abuse" paper with this new fig:
🔗 papers.ssrn.com/sol3/papers....
Max Kamachee and I just updated our "Video Deepfake Abuse" paper with this new fig:
🔗 papers.ssrn.com/sol3/papers....
papers.ssrn.com/sol3/papers....
papers.ssrn.com/sol3/papers....
@nickacaputo
.
We hear a lot about what important concepts and methods from AI research that lawyers need to understand. But it's really a two-way street...
🧵🧵🧵
@nickacaputo
.
We hear a lot about what important concepts and methods from AI research that lawyers need to understand. But it's really a two-way street...
🧵🧵🧵
t.co/3qWCNzoZrh
t.co/3qWCNzoZrh
Here's what I learned from our investigation of over 50 platforms, sites, apps, Discords, etc., while writing this paper.
papers.ssrn.com/sol3/papers...
Here's what I learned from our investigation of over 50 platforms, sites, apps, Discords, etc., while writing this paper.
papers.ssrn.com/sol3/papers...
In most (non-adversarial) cases, I expect the opposite will often apply...
In most (non-adversarial) cases, I expect the opposite will often apply...
papers.ssrn.com/sol3/papers....
papers.ssrn.com/sol3/papers....
www.aisi.gov.uk/careers
www.aisi.gov.uk/careers
This new paper studies how a small number of models power the non-consensual AI video deepfake ecosystem and why their developers could have predicted and mitigated this.
This new paper studies how a small number of models power the non-consensual AI video deepfake ecosystem and why their developers could have predicted and mitigated this.
Shamelessly copied from a slack message.
Shamelessly copied from a slack message.
t.co/CVkAKNXZme
t.co/CVkAKNXZme
They tested filtration of species/genus data against adv. fine-tuning. It didn't work well. This suggests filtering may work better if applied to entire tasks/domains rather than specific instances.
arxiv.org/abs/2510.27629
They tested filtration of species/genus data against adv. fine-tuning. It didn't work well. This suggests filtering may work better if applied to entire tasks/domains rather than specific instances.
arxiv.org/abs/2510.27629
We showed that filtering biothreat-related pretraining data is SOTA for making models resist adversarial fine-tuning. We proposed an amendment to the hypothesis from papers 1 and 2 above.
deepignorance.ai
We showed that filtering biothreat-related pretraining data is SOTA for making models resist adversarial fine-tuning. We proposed an amendment to the hypothesis from papers 1 and 2 above.
deepignorance.ai
They reported an instance where filtering biothreat data didn't have a big impact. But without more info on how and how much they filtered, it's hard to draw strong conclusions.
arxiv.org/abs/2508.03153
They reported an instance where filtering biothreat data didn't have a big impact. But without more info on how and how much they filtered, it's hard to draw strong conclusions.
arxiv.org/abs/2508.03153