Lightnews — Scholar-powered news

Tom McCoy @rtommccoy.bsky.social · 2d

Beginning a Grand Tour of California!
- Oct 6: Colloquium at Berkeley Linguistics
- Oct 9: Workshop at Google Mountain View
- Oct 14: Talk at UC Irvine Center for Lg, Intelligence & Computation
- Oct 16: NLP / Text-as-Data talk at NYU

Say hi if you'll be around!

4

Reposted by Tom McCoy

Gasper Begus @begus.bsky.social · 2d

Exciting talk in the linguistics department at UC Berkeley tomorrow!
@rtommccoy.bsky.social

1 10

Tom McCoy @rtommccoy.bsky.social · 7d

Yes!! An excellent point!!

Tom McCoy @rtommccoy.bsky.social · 8d

🤖 🧠 NEW BLOG POST 🧠 🤖

What skills do you need to be a successful researcher?

The list seems long: collaborating, writing, presenting, reviewing, etc

But I argue that many of these skills can be unified under a single overarching ability: theory of mind

rtmccoy.com/posts/theory...

Illustration of the blog post's main argument, summarized as: "Theory of Mind as a Central Skill for Researchers: Research involves many skills.If each skill is viewed separately, each one takes a long time to learn. These skills can instead be connected via theory of mind – the ability to reason about the mental states of others. This allows you to transfer your abilities across areas, making it easier to gain new skills."

2 2 19

Tom McCoy @rtommccoy.bsky.social · Sep 1

Totally. I think one key question is whether you want to model the whole developmental process or just the end state. If just the end state, LLMs have a lot to offer; but if the whole development (which is what we ultimately should aim for!) there are many issues in how LLMs get there

1 1

Tom McCoy @rtommccoy.bsky.social · Sep 1

The conversation that frequently plays out is:

A: "LLMs do lots of compositional things!"
B: "But they also make lots of mistakes!"
A: "But so do humans!"

I don't find that very productive, so would love to see the field move toward more detailed/contentful comparisons.

Tom McCoy @rtommccoy.bsky.social · Sep 1

They're definitely not fully systematic, so currently it kinda comes down to personal opinion about how systematic is systematic enough. And one thing I would love to see is more systematic head-to-head comparisons of humans and neural networks so that we don't need to rely on intuitions.

1

Tom McCoy @rtommccoy.bsky.social · Sep 1

Yeah, I think that's a good definition! I also believe that some LLM behaviors qualify as this - they routinely generate sentences with a syntactic structure that never appeared in the training set.

1 1

Tom McCoy @rtommccoy.bsky.social · Aug 31

"Hello world!" sounds like a word followed by a crossword clue for that word: "Hell = Low world"

3

Tom McCoy @rtommccoy.bsky.social · Aug 31

And although models still make lots of mistakes on compositionality, that alone also isn't enough because humans do too. So, if we want to make claims about models being human-like or not, what we really need are finer-grained characterizations of what human-like compositionality is.

1

Tom McCoy @rtommccoy.bsky.social · Aug 31

Agreed with these points broadly! But though being less “bad at compositionality” isn’t the same as compositional like humans, it does mean that we can no longer say "models completely fail at compositionality and are thus non human like" (because they no longer completely fail).

1 1

Tom McCoy @rtommccoy.bsky.social · Aug 31

I agree that garden paths & agreement attraction could be explained with fairly superficial statistics. For priming, what I had in mind was syntactic priming, which I do think requires some sort of structural abstraction.

1 2

Tom McCoy @rtommccoy.bsky.social · Aug 31

What would you view as evidence for true productivity?

1 1