Abhishek Sharma
@abhishekshar.bsky.social
43 followers
200 following
7 posts
CS PhD @Harvard w/ Finale Doshi-Velez | Research in {Reinforcement Learning | Healthcare | Representation Learning}
🌐 https://abhishekshar.com/
Posts
Media
Videos
Starter Packs
Abhishek Sharma
@abhishekshar.bsky.social
· Jan 23
Decision-Point Guided Safe Policy Improvement
Within batch reinforcement learning, safe policy improvement (SPI) seeks to ensure that the learnt policy performs at least as well as the behavior policy that generated the dataset. The core challeng...
arxiv.org
Abhishek Sharma
@abhishekshar.bsky.social
· Jan 23
Reposted by Abhishek Sharma
Reposted by Abhishek Sharma
Abhishek Sharma
@abhishekshar.bsky.social
· Nov 27
Abhishek Sharma
@abhishekshar.bsky.social
· Nov 27
Abhishek Sharma
@abhishekshar.bsky.social
· Nov 22
Abhishek Sharma
@abhishekshar.bsky.social
· Nov 22