Akash Sharma
akashsharma02.bsky.social
Akash Sharma
@akashsharma02.bsky.social
550 followers 200 following 31 posts
Ph.D. student at CMU Robotics Institute | Visiting Researcher at FAIR Meta Opinions expressed are my own 📍Pittsburgh, USA 🔗 akashsharma02.github.io
Posts Media Videos Starter Packs
Pinned
Robots need touch for human-like hands to reach the goal of general manipulation. However, approaches today don’t use tactile sensing or use specific architectures per tactile task.

Can 1 model improve many tactile tasks?
🌟Introducing Sparsh-skin: tinyurl.com/y935wz5c

1/6
And we also show improvement in many tactile perception tasks such as force estimation, pose estimation and full-hand joystick state estimation.
5/6
With this we see a 75% improvement in real-world tactile plug insertion over end-to-end using vision and tactile:
4/6
We pretrain Sparsh-skin with 4 hours of unlabeled data via self-distillation, and make several changes to get highly performant reps:

Decorrelate signals by tokenizing 1s window of tactile data.
Condition the encoder on robot hand configurations via sensor positions as input
3/6
Sparsh-skin is an approach to pretrain encoders for magnetic skin sensors on a dexterous robot hand.

It improves tactile tasks by over 56% in end-to-end methods and by over 41% in prior work.
It is trained via self-supervision for the Xela sensor, so no labeled data needed!
2/6
Robots need touch for human-like hands to reach the goal of general manipulation. However, approaches today don’t use tactile sensing or use specific architectures per tactile task.

Can 1 model improve many tactile tasks?
🌟Introducing Sparsh-skin: tinyurl.com/y935wz5c

1/6
I might sound salty, but I never got how 'outstanding reviewers' are chosen. Til then, part of the 'mediocre reviewers' gang it is 🤣
Check out my work to know more:

1. Sparsh: tactile reps for vision based sensors sparsh-ssl.github.io

2. [Releasing soon] Sparsh-skin: Tactile reps for full hand magnetic skins

3. [Coming soon] Reps for multimodal touch fusing tactile-images, audio, motion and pressure
Sparsh | Self-supervised touch representations for vision-based tactile sensing
Sparsh: Self-supervised touch representations for vision-based tactile sensing
sparsh-ssl.github.io
Last week I passed my thesis proposal, and I'm now officially a Ph.D. candidate!

I'm grateful to my committee, and everyone who supported me.

My proposed thesis "Self supervised perception for tactile dexterity" will explore ways to improve dexterous manipulation using tactile reps.
We took a matter of fact approach for a robotics conference, and it backfired too.
Reposted by Akash Sharma
I asked "on the other platform" what were the most important improvements to the original 2017 transformer.

That was quite popular and here is a synthesis of the responses:
Reposted by Akash Sharma
⏰ Heads up! The deadline for two #CVPR2025 Autonomous Grand Challenge tracks is May 10th, 2025:

1️⃣ NAVSIM v2 Challenge: huggingface.co/spaces/AGC20...

2️⃣ World Model Challenge by 1X: huggingface.co/spaces/1x-te...
Reposted by Akash Sharma
I love situations like this: in the pre-deep era (and following classical learning theory), people would have stopped training the white model at the red arrow, as the validation error increases. But, no, the model first seems to learns unwanted short cuts (overfitting wildly) but finds a way out.
Some pictures of the Pittsburgh spring to reduce the spiciness of the bsky feed!

a6700 w/ 17-70mm Tamron lens
Reposted by Akash Sharma
A new #CosmicDistanceLadder post on why lunar and solar eclipses tend to come in pairs (for instance, the solar eclipse next week is paired with the lunar eclipse from last week). www.instagram.com/p/DHkS3EcA40L
Reposted by Akash Sharma
What would you love to know about #robot learning and decision making?

Later this season, I'll be chatting to Prof. Lerrel Pinto (@lerrelpinto.com) from NYU about using machine learning to train robots to adapt to new environments.

Send me your questions for Lerrel: robottalk.org/ask-a-question/
Reposted by Akash Sharma
We are looking for a student researcher to work on video understanding plus 3D, in Google DeepMind London. DM/Email me or pass it to someone if you feel it may be a good fit!
Reposted by Akash Sharma
The first measles death in the US in a decade -- the tragic, preventable death of a child whose parents chose not to protect them with vaccination -- should spark an immediate nation-wide campaign to ensure all children are protected against preventable diseases. Anything less is unconscionable.
Congrats Eric, really cool stuff!
Reposted by Akash Sharma
Gearing up for our workshop on 4D Vision at @CVPR this June! Check out our line up of speakers and submit your work by Mar 28. Spread the word!
Really excited to put together this #CVPR2025 workshop on "4D Vision: Modeling the Dynamic World" -- one of the most fascinating areas in computer vision today!

We've invited incredible researchers who are leading fantastic work at various related fields.

4dvisionworkshop.github.io
At first I thought poisson or log-normal, but after a bit of searching, maybe Gumbel distribution: en.m.wikipedia.org/wiki/Gumbel_...
Gumbel distribution - Wikipedia
en.m.wikipedia.org
Reposted by Akash Sharma
Last night I found out that the NSF math postdoctoral fellowship I applied for is being deleted because it does not comply with Trump’s executive orders on DEI in the federal government. I’m going to answer some FAQs and share some thoughts about this ordeal in this thread 1/n
Now, see how life changes when you swap control and caps lock! 😆
Seeing some of the early results from DexterityGen were definitely a wow moment for me!

It doesn't take a lot to realize all the new opportunities a strong teleop system like this enables! 🚀

X thread: x.com/zhaohengyin/...
Link: zhaohengyin.github.io/dexteritygen/
DexGen
zhaohengyin.github.io
Reposted by Akash Sharma
Not one VC would ever fund a startup to do the kind of hardcore optimization work that DeepSeek did.

Every VC firm should be asking themselves why.