Sharad Duwal
sharad461.bsky.social
Sharad Duwal
@sharad461.bsky.social
This would be novella territory because any longer and it gets oppressive, but Coetzee (Life and Times of Michael K, Youth and Disgrace) and Kafka (The Castle) might be close.
October 24, 2025 at 12:11 PM
Discord is also down and with it all the research servers. If Bsky also goes down, it's been nice to know everyone.
September 5, 2025 at 3:21 PM
Paper: arxiv.org/abs/2412.13860

This work has been accepted in Challenges in Processing South Asian Languages (CHiPSAL) at COLING'25.

Work done in collaboration with Suraj Prasai and Dr. Suresh Manandhar.

Models:
- huggingface.co/sharad461/Op... Base
- huggingface.co/sharad461/Op... Mixed SFT
Domain-adaptative Continual Learning for Low-resource Tasks: Evaluation on Nepali
Continual learning has emerged as an important research direction due to the infeasibility of retraining large language models (LLMs) from scratch in the event of new data availability. Of great inter...
arxiv.org
December 27, 2024 at 10:55 AM
This work has been accepted in Challenges in Processing South Asian Languages (CHiPSAL) at COLING'25.

Work done in collaboration with Suraj Prasai and Dr. Suresh Manandhar.

Check out a short blog and the paper:
Blog: medium.com/@sharad.duwa...
Paper: arxiv.org/abs/2412.13860
Making Llama speak Nepali using Continual Learning
This essay discusses our work on making Meta’s Llama 3 8B understand and generate Nepali. Read the paper here.
medium.com
December 27, 2024 at 10:51 AM