Dan
@danwalkerdatasci.bsky.social
90 followers 540 following 26 posts
Data person Former fish squeezer Python - R - Rust
Posts Media Videos Starter Packs
Reposted by Dan
henrikbengtsson.bsky.social
A #Slurm user just confirmed that "yay it works. Pretty sick!"

Thanks to excellent feedback from several users, it'll soon be even easier to distribute #rstats code via #HPC job schedulers using future.batchtools

#parallel #futureverse
henrikbengtsson.bsky.social
If anyone else is following this, we've moved over to github.com/futureverse/..., where progress has already been made
Reposted by Dan
kylewalker.bsky.social
AI is powerful, but it's no free lunch - and again, it's no substitute for YOUR expertise.
R and QGIS, name a better combo

#databs
smachlis.bsky.social
“I used R the statistical programming language to analyse each of the 3-hourly netCDFs — a file format for storing multidimensional scientific data — and create a geoJSON file where the data was greater than 35C. These files were then loaded into Qgis and styled….” - @sdbernard.bsky.social #RStats
Reposted by Dan
smachlis.bsky.social
“I used R the statistical programming language to analyse each of the 3-hourly netCDFs — a file format for storing multidimensional scientific data — and create a geoJSON file where the data was greater than 35C. These files were then loaded into Qgis and styled….” - @sdbernard.bsky.social #RStats
Reposted by Dan
posit.co
Posit @posit.co · Jun 18
Data science junkies, get ready! 🚀 "The Test Set" #podcast trailer is here for your viewing pleasure.

Tune in July 1st and every Tuesday after for new episodes with hosts @mchow.com, @hadley.nz, and @wesmckinney.com as they welcome thought leaders in #DataScience.

Subscribe now: pos.it/thetestset
Reposted by Dan
shikokuchuo.net
Bleeding edge update for the #tidyverse purrr package with even more seamless #rstats parallel maps.

Introducing our shiniest new adverb: `in_parallel()`. Just wrap your function to take advantage of blazing fast parallel processing via mirai.

pak::pak("tidyverse/purrr")

purrr.tidyverse.org/dev/
Functional Programming Tools
A complete and consistent functional programming toolkit for R.
purrr.tidyverse.org
Reposted by Dan
emilhvitfeldt.bsky.social
Being able to productionize a ML model is often the goal, however there are many things to keep track of when you do. The orbital package lets you translate your fitted scikit-learn or tidymodels model into SQL that that when run produces predictions.

posit.co/blog/databri... #python #rstats
Posit
Accelerate model deployment with Databricks and Orbital for R and Python Scikit-learn/Tidymodels projects.
posit.co
Claude 4 is pretty impressive 🤖
Reposted by Dan
kevinschaul.bsky.social
How reliable are LLMs at extracting data from pdfs? Inspired by @simonwillison.net's PyCon talk, I added extracting FEMA's daily operation briefing to my LLM evals suite.

Just one model extracted the data from the pdf correctly: Gemini 2.5 Pro Preview. Full results -> kschaul.com/llm-evals/ev...
Screenshot of the Declaration Requests in Process table
Reposted by Dan
nhsrcommunity.bsky.social
☕ Coffee and Coding ☕

Do you have an interesting piece of code/work to showcase, an opportunity for collaboration or a code dilemma you would like help with?

We would love to hear from you at Coffee and Coding – join the NHS-R Community Slack for more info (postcard.nhsrcommunity.com)!

#rstats
NHS-R Community
postcard.nhsrcommunity.com
Reposted by Dan
bigbookofr.com
Statistical Rethinking with brms, ggplot2, and the tidyverse Second edition by A Solomon Kurz
#RStats
https://bigbookofr.com/chapters/statistics.html#statistical-rethinking-with-brms-ggplot2-and-the-tidyverse-second-edition
Reposted by Dan
peck.phd
Evan Peck @peck.phd · Nov 20
Trying something new:
A 🧵 on a topic I find many students struggle with: "why do their 📊 look more professional than my 📊?"

It's *lots* of tiny decisions that aren't the defaults in many libraries, so let's break down 1 simple graph by @jburnmurdoch.bsky.social

🔗 www.ft.com/content/73a1...
Reposted by Dan
posit.co
Posit @posit.co · Mar 20
We're delighted to announce Jonathan McPherson – software architect at Posit – as keynote speaker at posit::conf(2025)!

If you're curious about how thoughtful design principles can improve the data science tools you use, you won't want to miss this!

Join us Sep 16-18 in Atlanta. pos.it/conf
Reposted by Dan
tanho.ca
Tan @tanho.ca · Feb 27
R+Docker, we use an R pkg project structure (R/, man/, tests/, inst/) plus additional top-level folders like `exec/` for docker-executable scripts, `dev/` for devel/sandbox scripts, `reports/` for one-time reports, and `local/` for gitignored large files.

app.R, plumber.R etc go at the top level.
Reposted by Dan
jonthegeek.com
So many of my #RStats checks got cleaner when I learned that nrow() always returns a number if you scream it loud enough:
```
nrow(NULL)
#> NULL
NROW(NULL)
#> [1] 0
```
Reposted by Dan
rmcelreath.bsky.social
People just now finding out that much our digital infrastructure runs on COBOL. The SABRE airline/hotel reservation system e.g. runs on virtual 1980s mainframes executing 1970s COBOL. See also many banks.

The Old Ones dwell in the dark, undiminished. We cannot kill them without crippling ourselves.
Greetings fellow stats textbook collectors lol
Terrible news, thanks for sharing the alternative
Success in wartime intelligence through statistics!

#databs
Reposted by Dan
tterence.bsky.social
Illuminated contours of the Gulf of México.

#rayshader adventures, an #rstats tale
A visualisation of the Gulf of México as illuminated contours
Thank you! Looking forward to reading this one
Reposted by Dan
thomasp85.com
If you have used #ggplot2 in the last couple of years you owe a great deal to @teunbrand.bsky.social who is behind most of the new features and fixes.

Read about his journey to become a part of the ggplot2 core team here:
Joining the ggplot2 team - Tidyverse
I joined the ggplot2 team and would like to share the experience.
www.tidyverse.org
Baffled by the decisions they made adapting the Wheel of Time to a series