Amin Shoarinejad
banner
aminsn.bsky.social
Amin Shoarinejad
@aminsn.bsky.social
PhD Candidate, Statistics & Machine Learning. Bayesian Enthusiast, Transformers, R 🤝 Python

🌐 https://aminsn.github.io/
Instead of embeddings/density clusters that miss nuance, 𝗱𝘂𝗸𝘁𝗿 uses 𝗟𝗟𝗠 𝗿𝗲𝗮𝘀𝗼𝗻𝗶𝗻𝗴 to deduce subtle concepts & connect the dots. Minimal dependencies. Supports OpenAI/Gemini/HuggingFace or any LLM of your choice.

Check it out!
Github: github.com/Aminsn/duktr
GitHub - Aminsn/duktr: An LLM-powered Python package for dynamic concept mining and mixed-membership clustering over text.
An LLM-powered Python package for dynamic concept mining and mixed-membership clustering over text. - Aminsn/duktr
github.com
January 1, 2026 at 9:51 PM
GPs: defining distributions over functions instead of points.

It’s an interesting stretch to think about functions as random variables.
October 31, 2023 at 4:04 AM
No problem. Exactly. I think that example is misleading and shouldn’t be used against R2 to undermine its credibility for predictive performance evaluation.
October 28, 2023 at 5:24 PM
You won’t get different r2 if you only change the unit of y (e.g. height). Your regression coefficients will change and the r2 remains unchanged.
October 28, 2023 at 3:45 PM
Goodness of prediction and goodness of detection of the detectable signal are two different things. You can do a good job in finding the true underlaying function and yet fail miserably in having a good prediction model.
October 28, 2023 at 3:03 PM
I disagree, that shows why r2 is better than mse. The second example has significantly weaker signal and r2 shows that while mse doesn’t. 1 cm error in predicting a human’s height is very good while it’s rubbish for predicting an ant’s height (same mse, different context). R2 reflects the context
October 28, 2023 at 2:58 PM
How is it not good measure of predictive accuracy?
October 28, 2023 at 9:57 AM