Max kuhn
@topepo.bsky.social
4.8K followers 290 following 140 posts
Writing modeling packages at @posit.co (née RStudio). Opinions are my own. https://max-kuhn.org/
Posts Media Videos Starter Packs
Pinned
topepo.bsky.social
I last posted here about 6 months ago. Here's what I've been working on and/or thinking about. #rstats, #statistics, #ml

package upkeep: we are doing major preventive maintenance on the tidymodels packages ("upkeep week!"). It's rote but very rewarding work. Error messages are 100x better.

1/3
Applied Machine Learning for Tabular Data
aml4td.org
topepo.bsky.social
The kangaroos are nice, but those handles are 💯
topepo.bsky.social
I hesitate to inflict bag envy on this whole site but here they are. Thanks to @visnut.bsky.social and @robjhyndman.com
Reposted by Max kuhn
kellybodwin.com
Shannon's slides are always so unbelievably clear and helpful!!!

github.com/shannonpileg...

I'm having "Ohhhhh that's what that means" moments every 10 seconds here.
#positconf2025
Reposted by Max kuhn
kevinbaer.bsky.social
I'm all in on @topepo.bsky.social and co's new {important} and other variable importance/feature selection tools in tidymodels! #rstats
Reposted by Max kuhn
Reposted by Max kuhn
lauretig.bsky.social
Simon Wood, the GOAT of generalized additive models & creator of the mgcv #rstats package, has an Annual Review of Statistics essay on GAMs, available open access #statssky #mlsky

www.annualreviews.org/content/jour...
output from a GAM in the linked essay
Reposted by Max kuhn
ddgutierrez.bsky.social
ML success ≠ Kaggle leaderboard. The real world rewards:
- Clear explanations
- Thoughtful metrics
- Collaboration with domain experts

A 0.01 lift in F1 score won’t save you if no one understands your model.

#DataSciene #MachineLearning #AI #RStats
Reposted by Max kuhn
posit.co
Posit @posit.co · Sep 5
Announcing a new blog series on LLMs from @veerle.hypebright.nl!

In Part 2, “Talking to LLMs: From Prompt to Response”, we get hands-on with LLM-powered apps. This guide is for #Python & #RStats users who want to go beyond the basics.

Check it out here: shiny.posit.co/blog/posts/s...
topepo.bsky.social
The new version of tune (out before posit::conf🤞) has support for both.

My testing so far shows that future(_lappy) and mirai(_map) are very similar in speed-ups most of the time. The times when they differ split in either direction, so I haven’t seen any clear signal.
topepo.bsky.social
Slides from my #rstats talk “Measuring LLM Effectiveness” at #dataconfAI with @simonpcouch.com.

topepo.github.io/2025_NYR/

Video in about a month.

Great conference!
Reposted by Max kuhn
dataconf.ai
🧠📊 3 days. 2 workshops. 20 talks. 1 amazing community.
#dataconfAI is officially wrapped!

Thanks for showing up with insights, ideas, inspiration, and curiosity. And to all who made it unforgettable—speakers, attendees, sponsors, and volunteers.

See you at the next one! 🚀
Reposted by Max kuhn
simonpcouch.com
In working on an eval for an experimental tidymodels AI assistant, I realized that today's frontier LLMs know much more about #rstats tidymodels than I thought.

www.simonpcouch.com/blog/2025-08...
A ggplot2 bar plot. On the x axis are two LLMs, Claude Sonnet 4 and Gemini Pro 2.5. Both bars are dodged by three configurations: Predictive, Databot, and `run_r_code()` tool. On the y axis is relative performance, ranging from 0 (baseline model) to 100 (best human modelers). The bar heights for Claude Sonnet 4 are all about the same (around 80) and, for Gemini 2.5 Pro, the height for Predictive, an app designed specifically as a tidymodels assistant, is much lower (around 20).
topepo.bsky.social
It's a lot of fun! Everyone gets something out of it.

Plus, @davisvaughan.bsky.social always finds a great barista!
hadley.nz
We still have spots available for tidyverse dev day on Sept 19: www.tidyverse.org/blog/2025/07.... Please come along to contribute to the tidyverse and have a bunch of fun along the way! It's open to all, but is most convenient if you're coming to posit::conf or live near Atlanta #rstats
Tidyverse developer day 2025
Join us in Atlanta for tidyverse developer day on September 19, 2025!
www.tidyverse.org
Reposted by Max kuhn
dataconf.ai
Start off The NY Data Science & AI Conference w/ hands-on workshops on Aug 25 in NYC or online:

📊 Machine Learning in R w/ Max Kuhn
🤖 Intro to LLMs/AI w/ Daniel Chen

🎟️ Learn more & register at dataconf.ai/nyc

#RStats #AI #Workshops #databs @topepo.bsky.social @chendaniely.bsky.social
Reposted by Max kuhn
emilhvitfeldt.bsky.social
Excited to share my newest quarto revealjs plugin: imagemover

Easily reposition and resize images directly in your quarto revealjs slides for a much smoother slidecrafting experience

github.com/EmilHvitfeld...
#quarto
Reposted by Max kuhn
transport-talk.bsky.social
Time to convert this into an LLM powered snippet using {chores} by @simonpcouch.com. #useR2025 #rstats
transport-talk.bsky.social
CreateBranding shiny app is now available here: umair.shinyapps.io/create_brand...

You can now download the palettes, scales and a theme for ggplot2. See a demo here: youtu.be/C7-rhLPrA3o

Try it out and let me know your suggestions for improvements. #rstats
Reposted by Max kuhn
kellybodwin.com
Welp, {chores} by @simonpcouch.com is an immediate install for sure.

Basically it's {usethis} plus llm bundled into RStudio/Positron key encoding.

Excited!!! 🧹🧺

#useR2025 #rstats #couchverse?
Reposted by Max kuhn
landeranalytics.com
Don't miss out learning from the best, Max Kuhn! @topepo.bsky.social

#dataBS #Tidymodels #MachineLearning
dataconf.ai
📊 Want to level up your R modeling skills? Max Kuhn’s Machine Learning in R workshop is an intro to tidymodels, covering data prep, resampling, tuning & evaluation using real workflows!

📍Aug 25 in NYC or online
🎟️ & info: dataconf.ai/nyc

#RStats #Tidymodels #MachineLearning @topepo.bsky.social
Reposted by Max kuhn
dataconf.ai
📊 Want to level up your R modeling skills? Max Kuhn’s Machine Learning in R workshop is an intro to tidymodels, covering data prep, resampling, tuning & evaluation using real workflows!

📍Aug 25 in NYC or online
🎟️ & info: dataconf.ai/nyc

#RStats #Tidymodels #MachineLearning @topepo.bsky.social
topepo.bsky.social
We are super excited to have you join us for the day!
posit.co
Posit @posit.co · Jul 29
Check out our Modeling & ML with #RStats workshops at posit::conf!

🔢 Intro to ML w/ tidymodels @simonpcouch.com
🏗️ Feature Engineering & Tuning @topepo.bsky.social @emilhvitfeldt.bsky.social
↔️ Causal Inference @malcolmbarrett.malco.io @lucystats.bsky.social

Learn more: pos.it/conf-2025-workshops
A promotional image for "Introduction to Machine Learning in R with tidymodels" at posit conf (2025). The image features Simon Couch, from Posit, smiling outdoors in a headshot. The text "Atlanta" and "Sept. 16-18" are also visible, alongside abstract cubes in orange, green, and blue. A promotional image for "Causal Inference in R" at posit conf (2025). The image features two individuals: Malcolm Barrett from Stanford and Lucy D'Agostino McGowan from Wake Forest University. Malcolm Barrett is shown in a headshot with a blurred background, and Lucy D'Agostino McGowan is shown smiling with a blackboard in the background. The text "Atlanta" and "Sept. 16-18" are also visible, along with abstract cubes in orange, green, and blue. A promotional image for "Getting More Out of Feature Engineering and Tuning for Machine Learning" at posit conf (2025). The image features Max Kuhn and Emil Hvitfeldt, both from Posit. Max Kuhn is shown in a headshot with a blurred background, and Emil Hvitfeldt is shown smiling with plants in the background. The text "Atlanta" and "Sept. 16-18" are also visible, along with abstract cubes in orange, green, and blue.
topepo.bsky.social
I’m sorry it wasn’t a good experience for you.

I’m surprised by the first two issues; those have both been in positron for a while and work pretty well (for me).

The visual cues are not as isolated as they are in Rstudio. We are aware of that and are working on it. It’s a vscode constraint.
topepo.bsky.social
Positron is definitely visually more than RStudio, and this is a helpful overview.
posit.co
Posit @posit.co · Jul 28
Take a quick tour of Positron, Posit's next-generation data science IDE, built by the creators of RStudio.

Read the blog to learn more: posit.co/blog/a-quick...
A screenshot of the Positron interface, labeled with its components: "Activity bar," "Primary side bar," "Editor," "Secondary side bar," and "Panel." The text "A quick tour of Positron" is on the left.
Reposted by Max kuhn
frick.ws
The call for papers for LatinR 2025 (online) is now open! You can present in English, Spanish, or Portuguese 🗣️ #RStats latinr.org/en/blog/en/2...
Call for papers – LatinR 2024
latinr.org
topepo.bsky.social
We've released 4 new chapters of Applied Machine Learning for Tabular Data.

Includes: Bayesian optimization, feature selection, model comparisons, classification metrics, calibration, #rstats computing sections, and more

blog.aml4td.org/posts/2025-0...
Part 3 is Finished, Part 4 Started – Applied Predictive Modeling Blog
blog.aml4td.org