Lightnews — Scholar-powered news

Reposted by Shikhar Murty

Hao Zhu 朱昊

@zhuhao.me

Ever dreamed of AI agents learning through interacting with the open world unsupervisedly? Our latest preprint introduces NNetNav-Live which collects training data through exploration on real websites and hindsight labeling, which produces a SOTA OSS agent.

February 6, 2025 at 7:22 PM

Shikhar Murty

@shikharmurty.bsky.social

Want to make a browser agent for *any* domain like banking or healthcare?
We propose methods for training LLMs with open-ended, unsupervised interaction on live websites:
✅ OSS SoTA on WebVoyager
✅ world's smallest high-performing web-agent
Try it here: nnetnav.dev

February 6, 2025 at 5:43 PM

Shikhar Murty

@shikharmurty.bsky.social

going to stay off twitter for my own mental health. something has gone horribly wrong with that platform.

December 28, 2024 at 10:07 PM

Shikhar Murty

@shikharmurty.bsky.social

Couldn't make it to NeurIPS due to work, but do check out our workshop happening in West Ballroom B. Lots of cool things to come, including a very fun panel!

Nouha Dziri @nouhadziri.bsky.social · Dec 15

Super excited today for the System 2 Reasoning at Scale workshop, come join us to discover how to equip AI systems with reasoning that's optimized for renewable energy and not fossil fuel 🔥🚀

⏰When? today, 9am-5:30pm
📍West Ballroom B

s2r-at-scale-workshop.github.io
#NeurIPS2024

December 15, 2024 at 8:29 PM

Reposted by Shikhar Murty

Robert Csordas

@robertcsordas.bsky.social

Come visit our poster "MoEUT: Mixture-of-Experts Universal Transformers" on Friday at 4:30 in East Exhibit Hall A-C #1907 on #NeurIPS2024. With Kazuki Irie, Jürgen Schmidhuber, Christopher Potts and @chrmanning.bsky.social.

December 12, 2024 at 10:46 PM

Reposted by Shikhar Murty

Stanford NLP Group

@stanfordnlp.bsky.social

The extraordinary recent takeover of ML/AI by #NLP is well-known but insufficiently reflected on.

Look at the @neuripsconf.bsky.social tutorials in 2024!

neurips.cc/virtual/2024...

14 tutorials; 6 have "LLM" in the title; 4 more cover foundation models, with large NLP coverage. That's > 70% 😲

NeurIPS 2024 TutorialsNeurIPS 2024

neurips.cc

December 9, 2024 at 7:29 PM

Reposted by Shikhar Murty

Paul Soulos

@paulsoulos.bsky.social

🚨 Thrilled to share that Compositional Generalization Across Distributional Shifts with Sparse Tree Operations received a spotlight award at #NeurIPS2024! 🌟 I'll present a poster on Tuesday and give an invited lightning talk at the System 2 Reasoning Workshop on Sunday. 🧵👇

December 9, 2024 at 3:06 PM

Reposted by Shikhar Murty

Alexandre Lacoste

@alex-lacoste.bsky.social

🧵-1
We are thrilled to release #AgentLab, a new open-source package for developing and evaluating web agents. This builds on the new #BrowserGym package which supports 10 different benchmarks, including #WebArena.

AgentLab diagram.

The image describes AgentLab, a framework for efficient parallel experiments with agents. It highlights:

Core Agent Features:

Dynamic Prompting and a Unified LLM API for interacting with large language models.
BrowserGym Platform:

A tool for testing agents on benchmarks like WebArena, WorkArena, MiniWoB, and others.
Key Features:

Reproducibility, a Unified Leaderboard, an analysis tool called Xray, and a Dataset for sharing agent traces.
Blue elements represent AgentLab components.

December 3, 2024 at 9:02 PM

Shikhar Murty

@shikharmurty.bsky.social

Folks, I'm not going to be at Neurips this year, but we have an *awesome* workshop that i'm super proud of.

Go attend, and use the link below to ask all of your burning questions about LLM reasoning, agents and compositionality!

Nouha Dziri @nouhadziri.bsky.social · Dec 3

🎊Excited for #neurips2024 and our "System 2 Reasoning at Scale" workshop. We have an excited lineup of speakers who will answer your most burning questions about AI and reasoning 🚀

🔥Got spicy questions? Submit & vote here:
app.sli.do/event/dJNU63...

Join Slido: Enter #code to vote and ask questions

Participate in a live poll, quiz or Q&A. No login required.

app.sli.do

December 3, 2024 at 7:45 PM

Reposted by Shikhar Murty

Nouha Dziri

@nouhadziri.bsky.social

🎊Excited for #neurips2024 and our "System 2 Reasoning at Scale" workshop. We have an excited lineup of speakers who will answer your most burning questions about AI and reasoning 🚀

🔥Got spicy questions? Submit & vote here:
app.sli.do/event/dJNU63...

Join Slido: Enter #code to vote and ask questions

Participate in a live poll, quiz or Q&A. No login required.

app.sli.do

December 3, 2024 at 5:43 PM

Shikhar Murty

@shikharmurty.bsky.social

I also wear the AI agents researcher hat. Can't say i'm similarly impressed by reviewers in that community...

Shikhar Murty @shikharmurty.bsky.social · Nov 27

ACL syntax track reviewers >> almost any other conference.

These folks care about their sub-field and i learn something new every time!

November 27, 2024 at 11:32 PM

Shikhar Murty

@shikharmurty.bsky.social

ACL syntax track reviewers >> almost any other conference.

These folks care about their sub-field and i learn something new every time!

November 27, 2024 at 7:44 PM

Shikhar Murty

@shikharmurty.bsky.social

What is a probing task that is purely about semantics?
Context: I have a probe trained to predict dependency relations, and would like to train another one on a semantics only task (for research purposes)

November 24, 2024 at 5:00 AM

Shikhar Murty

@shikharmurty.bsky.social

Asked GPT-4o to draw parse trees in two languages:

November 21, 2024 at 5:49 AM

Shikhar Murty

@shikharmurty.bsky.social

Hot take (since it's still just friends on this platform):

It's crazy how the classic "sample and rerank" baseline from machine translation and IR, got re-branded as "scaling up inference-time compute".

November 21, 2024 at 5:06 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news