Bálint Gyevnár
@gbalint.bsky.social
110 followers
210 following
37 posts
Postdoc at CMU • Safeguarding scientific integrity in the age of AI scientists • PhD at University of Edinburgh • gbalint.me • 🇭🇺🏴
Posts
Media
Videos
Starter Packs
Pinned
Reposted by Bálint Gyevnár
Reposted by Bálint Gyevnár
Bálint Gyevnár
@gbalint.bsky.social
· Aug 16
Bálint Gyevnár
@gbalint.bsky.social
· Aug 16
Bálint Gyevnár
@gbalint.bsky.social
· Aug 16
Integrating Counterfactual Simulations with Language Models for Explaining Multi-Agent Behaviour
Autonomous multi-agent systems (MAS) are useful for automating complex tasks but raise trust concerns due to risks like miscoordination and goal misalignment. Explainability is vital for trust calibra...
arxiv.org
Bálint Gyevnár
@gbalint.bsky.social
· Jul 11
Measuring the Impact of Early-2025 AI on Experienced Open-Source Developer Productivity
We conduct a randomized controlled trial to understand how early-2025 AI tools affect the productivity of experienced open-source developers working on their own repositories. Surprisingly, we find th...
metr.org
Bálint Gyevnár
@gbalint.bsky.social
· Jul 10
Reposted by Bálint Gyevnár
Bálint Gyevnár
@gbalint.bsky.social
· Jun 9
Bálint Gyevnár
@gbalint.bsky.social
· Feb 7
Objective Metrics for Human-Subjects Evaluation in Explainable Reinforcement Learning
Explanation is a fundamentally human process. Understanding the goal and audience of the explanation is vital, yet existing work on explainable reinforcement learning (XRL) routinely does not consult ...
arxiv.org
Reposted by Bálint Gyevnár
Atoosa Kasirzadeh
@atoosakz.bsky.social
· May 23
Reposted by Bálint Gyevnár
Bálint Gyevnár
@gbalint.bsky.social
· Apr 30
Bálint Gyevnár
@gbalint.bsky.social
· Apr 24