Erick Scott
@erickscott.bsky.social
41 followers 93 following 50 posts
Scientist, building cstructure.
Posts Media Videos Starter Packs
erickscott.bsky.social
I'd love to see someone specify that function...where does Terence Tao sit on the curve?
erickscott.bsky.social
I have been surprised that the first generation is usually more thoughtful with the sycophant warning as a system prompt.

I won't speculate on what's actually happening with matmul/reasoning, but I have found it helpful to counteract the vendor's ingratiating base prompt.
erickscott.bsky.social
Try adding: 'Don't be a sycophant' to your system prompt.

Gemini is more stubborn...
erickscott.bsky.social
I don't think they are useless and deepmind is definitely moving forward with principled quantitative approaches. Moving stochastic outputs into structured models + rapid human error correction is what @travisgerke.bsky.social and I are working on at cStructure. Happy to chat anytime
erickscott.bsky.social
The code that supposedly underpinned the analysis used a fake propensity score (0.1*covariate1 + 0.2*covariate2...) with a comment that a real propensity model should be implemented.

This happens all the time: code syntax was fine, semantics wrong - assoc. text was plausible. User beware.
2/2
erickscott.bsky.social
I have many similar stories. For example, I asked for a propensity score analysis of Lalonde assuming this canonical example is a best case scenario. I provided the dataset. The generated text provided a correct and nuanced description of the estimator and the ATE. 1/2
erickscott.bsky.social
cStructure is proud to collaborate with @BeeKeeperAI_Inc and DREAM on The COVID Causal Diagram DREAM Challenge.

Privacy-preserving compute + collaborative causal modeling -> the future of responsible AI development.

Learn more at cstructure.net

@travisgerke.bsky.social
Reposted by Erick Scott
rickylongthread.bsky.social
Tonight is the 250th anniversary of Paul Revere’s midnight ride. May his memory remind us all to resist the tyranny forming in our government.
erickscott.bsky.social
Bayesian posterior distributions. So much information packed into the density. If two people disagree on what threshold should be used to make a decision, it's easy to calculate the support for either.
erickscott.bsky.social
Reminds me of the difference b/w Efficacy (ITT if properly used, e.g. abstinence for teen pregnancy) vs Effectiveness (PP, outcomes when abstinence is used in practice).

In practice, I think the effectiveness of a causal method is important as unmeasured confounding is ever present in real data.
Reposted by Erick Scott
chrismurphyct.bsky.social
The Trump-Supreme Court battle is not really the crisis.

The crisis is here now. Trump is enacting an insidious coordinated attack on our institutions of democratic accountability, designed to crater democracy before next fall.

1/ A long 🧵to explain the plan & how we stop it.
erickscott.bsky.social
I really do believe Gordon et al. offered a best effort assessment. The scale, diversity, and quality of their ground truth is quite impressive.

What would you have done differently to reduce assumption violation?
erickscott.bsky.social
I see simulations as a useful tool to assess method performance under various degrees of assumption violation.

I also think the simulations should approximate the magnitude and direction of bias seen in high quality empirical studies.
erickscott.bsky.social
Am I the only one in industry, that looks at this thread and remembers junior hires showing up to their first stakeholder meeting after throwing "all the x's" into sci-kit learn and then getting absolutely thrashed by the domain experts?
erickscott.bsky.social
It's like we learned absolutely nothing from the reproducibility crisis, kitchen-sink machine learning models for covid, population/environmental stratification in genomics, a/b testing at scale...sigh.
erickscott.bsky.social
I just named several industries that in practice don't blindly use LASSO. A/B testing is used by any industry with a website/app and small to large companies employ (data) scientists to design and analyze the experiments. Healthcare is a pretty large industry, Computing is a pretty large industry??
erickscott.bsky.social
Then I am puzzled by the idea that in practice scientists just expect LASSO to select the right variables. Here's SHAP docs describing why that is a bad assumption **in practice**
shap.readthedocs.io/en/latest/ex...
erickscott.bsky.social
You should encourage him to explore causal inference.

Practical applications that share the same concern about LASSO: A/B testing, drug development, electrical engineering, physics
Reposted by Erick Scott
chrismurphyct.bsky.social
Bone chilling.

A court ordered Kilmar Abrego Garcia to stay in the United States.

The Supreme Court ruled 9-0 that he was illegally removed. Trump is pretending he won the ruling 9-0.

1/ You may not think this case means anything to you. But let me tell you why it does.
erickscott.bsky.social
There are several excellent technical books on this subject.

WhatIf by Hernán and Robins: miguelhernan.org/whatifbook

Causal Inference in Statistics by Pearl: www.amazon.com/Causal-Infer...

Causality by Pearl: bayes.cs.ucla.edu/BOOK-2K/
Reposted by Erick Scott
chrismurphyct.bsky.social
Those trying to understand the tariffs as economic policy are dangerously naive.

No, the tariffs are a tool to collapse our democracy. A means to compel loyalty from every business that will need to petition Trump for relief.

1/ A 🧵 to explain his plan and how we fight back.
erickscott.bsky.social
Rectangle is amazing for organizing windows
rectangleapp.com

Spaces are also really helpful to keep work streams partitioned. 3 finger up/down

Dbeaver is the best free database GUI for mac

Homebrew for package management is a must
Rectangle
Move and resize windows in macOS using keyboard shortcuts or snap areas. The official page for Rectangle.
rectangleapp.com
Reposted by Erick Scott
akshaykagrawal.bsky.social
🚀 The @marimo.io YouTube channel crossed 1k subscribers today — just two weeks after its launch!

marimo is best understand by seeing it in action. In his latest video, the one and only @koaning.bsky.social gives a bird's eye overview of what sets marimo apart:

www.youtube.com/watch?v=3N6l...
An overview of marimo
YouTube video by marimo
www.youtube.com