Lightnews — Scholar-powered news

Shang Qu

@lindsayttsq.bsky.social

25 followers 250 following 8 posts

AI4Biomed & LLMs @ Tsinghua University

Posts Replies Media Videos

Shang Qu

@lindsayttsq.bsky.social

📝We've released the MedXpertQA dataset!
huggingface.co/datasets/Tsi...

📚Check out more details:
Preprint: arxiv.org/pdf/2501.18362
Github: github.com/TsinghuaC3I/...

Shang Qu @lindsayttsq.bsky.social · Feb 4

📈How far are leading models from mastering realistic medical tasks? MedXpertQA, our new text & multimodal medical benchmark, reveals gaps in model abilities

📌Percentage scores on our Text subset:
o3-mini: 37.30
R1: 37.76 - frontrunner among open-source models
o1: 44.67 - still room for improvement!

February 9, 2025 at 2:19 AM

Shang Qu

@lindsayttsq.bsky.social

February 4, 2025 at 1:29 PM

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news