Peng Qi
@qi2peng2.bsky.social
290 followers
43 following
79 posts
Multimodal Agents Research @ Orby AI. Ex-AWS AI, JD AI. PhD from @stanfordnlp.bsky.social, UG Tsinghua U. He/him. Opinions my own.
Posts
Media
Videos
Starter Packs
Peng Qi
@qi2peng2.bsky.social
· Jul 25
Peng Qi
@qi2peng2.bsky.social
· Jul 16
Peng Qi
@qi2peng2.bsky.social
· Jul 16
Peng Qi
@qi2peng2.bsky.social
· Jul 16
Peng Qi
@qi2peng2.bsky.social
· Jul 16
Peng Qi
@qi2peng2.bsky.social
· Jul 16
Peng Qi
@qi2peng2.bsky.social
· Jul 16
Peng Qi
@qi2peng2.bsky.social
· Jul 2
Why You Should Stop Using HotpotQA for AI Agents Evaluation in 2025 | Peng Qi
We published HotpotQA, a groundbreaking multi-step question answering dataset in 2018, which has since motivated and facilitated numerous AI agent research works. But you should probably reconsider…
qipeng.me
Peng Qi
@qi2peng2.bsky.social
· Jul 2
Why You Should Stop Using HotpotQA for AI Agents Evaluation in 2025 | Peng Qi
We published HotpotQA, a groundbreaking multi-step question answering dataset in 2018, which has since motivated and facilitated numerous AI agent research works. But you should probably reconsider…
qipeng.me
Peng Qi
@qi2peng2.bsky.social
· Jul 2
Why You Should Stop Using HotpotQA for AI Agents Evaluation in 2025 | Peng Qi
We published HotpotQA, a groundbreaking multi-step question answering dataset in 2018, which has since motivated and facilitated numerous AI agent research works. But you should probably reconsider…
qipeng.me
Peng Qi
@qi2peng2.bsky.social
· May 22
Peng Qi
@qi2peng2.bsky.social
· May 22
Peng Qi
@qi2peng2.bsky.social
· May 22
Peng Qi
@qi2peng2.bsky.social
· May 22
Peng Qi
@qi2peng2.bsky.social
· May 6
Peng Qi
@qi2peng2.bsky.social
· May 6
Peng Qi
@qi2peng2.bsky.social
· May 6
Peng Qi
@qi2peng2.bsky.social
· May 6
Peng Qi
@qi2peng2.bsky.social
· May 6
Peng Qi
@qi2peng2.bsky.social
· May 6
Peng Qi
@qi2peng2.bsky.social
· May 6
Peng Qi
@qi2peng2.bsky.social
· Mar 28
Reposted by Peng Qi