Arif Perdana
@arifperdana.net
1.5K followers
4.5K following
550 posts
Author | Educator | Speaker | Digital Strategy | Data Science and Analytics | Interested in Philosophy, Photography, Music, Movie, and Tech | An Experienced Academic in Multiple Countries | No Scammers | Posts and Comments are on my own | arifperdana.net
Posts
Media
Videos
Starter Packs
Arif Perdana
@arifperdana.net
· Aug 17
Arif Perdana
@arifperdana.net
· Aug 17
Arif Perdana
@arifperdana.net
· Aug 17
Arif Perdana
@arifperdana.net
· Jun 9
Arif Perdana
@arifperdana.net
· Jun 9
Arif Perdana
@arifperdana.net
· Jun 9
Arif Perdana
@arifperdana.net
· May 7
All Roads Lead to Likelihood: The Value of Reinforcement Learning in Fine-Tuning
From a first-principles perspective, it may seem odd that the strongest results in foundation model fine-tuning (FT) are achieved via a relatively complex, two-stage training procedure. Specifically, ...
arxiv.org
Arif Perdana
@arifperdana.net
· May 7
Arif Perdana
@arifperdana.net
· May 7
Arif Perdana
@arifperdana.net
· May 4
Arif Perdana
@arifperdana.net
· May 4
Arif Perdana
@arifperdana.net
· May 4
Arif Perdana
@arifperdana.net
· May 4
Arif Perdana
@arifperdana.net
· May 4
Arif Perdana
@arifperdana.net
· May 4
Arif Perdana
@arifperdana.net
· May 4
Arif Perdana
@arifperdana.net
· Apr 7
Arif Perdana
@arifperdana.net
· Apr 7