Nathaniel Haines
banner
natehaines.bsky.social
Nathaniel Haines
@natehaines.bsky.social
Paid to do

p(b | a) p(a)
p(a | b) = —————————
p(b)

Data Scientist | Computational Psychologist | Devout Bayesian

https://bayesianbeginnings.com/
oops, alpha = cronbach's alpha, which is a very common measure of reliability in the psychonetric lit en.m.wikipedia.org/wiki/Cronbac...
Cronbach's alpha - Wikipedia
en.m.wikipedia.org
September 29, 2025 at 1:36 AM
because alpha is math equivalent to the average (length adjusted) split half reliability, it fits this use case well. e.g. if you score 4 of 8 items (call this test A score) and then also the left out 4 (test B score), the correlation of test A and B (across test takers) is what alpha measures
September 29, 2025 at 1:32 AM
yeah sounds like OP is after something similar to the concept of reliability (in the psychometric sense, although not sure what the construct is here)

the whole simulation thing doesn't seem necessary if that's the case—the standard reliability measures will do
September 29, 2025 at 1:31 AM
Congrats dude!!
August 20, 2025 at 2:12 PM
Ah yeah most of the work im familiar with skews toward generative models. e.g. like this classic piece, which captures various interactions between Stroop conditions, learning effects, etc.

psycnet.apa.org/record/1990-...
APA PsycNet
psycnet.apa.org
July 2, 2025 at 12:49 PM
Probably not what you are referring to, but the Stroop task is our main example here: psycnet.apa.org/fulltext/202...
APA PsycNet
share.google
July 2, 2025 at 12:28 PM
yeah this is a great set
May 13, 2025 at 12:34 AM
Wish I could have made it in person!
May 5, 2025 at 9:41 PM
i have now vented on reddit and bsky so maybe i will feel better about it now
April 23, 2025 at 2:47 AM
tariffs amirite
April 21, 2025 at 3:51 PM
Thanks for the endorsement! Awesome to hear it's been influential to your work 😁

And yes the KL finding is super interesting by itself, happy to hear someone found it buried in the supplement ha
April 17, 2025 at 7:46 PM
Thank you! Glad you have found it useful 🤓
April 17, 2025 at 7:44 PM
it was horrible, there were 10+ papers based on the initial preprint/blog that were in print before this one eventually made it there 😅
April 17, 2025 at 3:22 PM
Yeah I've found LLM tech good for stuff that is mostly boilerplate (e.g. some general software development stuff), but when it comes to modeling work that is necessarily bespoke, they are quite bad
April 14, 2025 at 3:59 PM