Zara Siddique
@zarasiddique.bsky.social
140 followers
660 following
32 posts
Working on ethics and bias in NLP @CardiffNLP #NLP #NLProc
Posts
Media
Videos
Starter Packs
Zara Siddique
@zarasiddique.bsky.social
· May 26
Zara Siddique
@zarasiddique.bsky.social
· May 14
Zara Siddique
@zarasiddique.bsky.social
· Mar 25
Reposted by Zara Siddique
Dustin Wright
@dustinbwright.com
· Mar 25
Zara Siddique
@zarasiddique.bsky.social
· Mar 25
Zara Siddique
@zarasiddique.bsky.social
· Mar 13
Zara Siddique
@zarasiddique.bsky.social
· Mar 13
Zara Siddique
@zarasiddique.bsky.social
· Mar 13
Shifting Perspectives: Steering Vector Ensembles for Robust Bias Mitigation in LLMs
We present a novel approach to bias mitigation in large language models (LLMs) by applying steering vectors to modify model activations in forward passes. We employ Bayesian optimization to systematic...
arxiv.org
Reposted by Zara Siddique
Cardiff NLP
@cardiffnlp.bsky.social
· Mar 5
Reposted by Zara Siddique