Dewey Dunnington
@paleolimbot.bsky.social
190 followers 69 following 35 posts
Scaling spatial computing at Wherobots, Inc. ApacheArrow PMC, #gischat, #rstats, ex Voltron Data.
Posts Media Videos Starter Packs
Reposted by Dewey Dunnington
pitrou.net
Great news for Arrow (and more work for us 😄).
Also, those work items also implicitly apply to #ApacheParquet. @julien.ledem.net
Reposted by Dewey Dunnington
kylebarron.dev
New blog post! kylebarron.dev/blog/fast-py...

This dives into performance pitfalls of Python bindings, why projects like Lonboard are so fast, and why I care about Apache Arrow so much. The post is high level and you don't need to know C/C++/Rust to understand it!
paleolimbot.bsky.social
FGB and COG are using the same mechanism that hyparquet, fsspec, objstore, and DuckDB's httpfs are using to read from http (i.e., range requests)
paleolimbot.bsky.social
Definitely depends what you're up to! People are definitely experimenting with serving features directly from Parquet files for visualization (e.g., medium.com/center-for-c... )
Optimal GeoParquet Partitioning Strategy
My quest for one-size-fits-all format
medium.com
paleolimbot.bsky.social
- FGB is great! Lookups with small ranges work well but full scans are much slower (particularly if zipped!)
- SedonaDB, DuckDB, and pyarrow with fsspec can all do partial Parquet reads from http (give geoarrow.org/data a try!)
Data – GeoArrow
geoarrow.org
paleolimbot.bsky.social
Interesting...I think of GeoParquet and GeoArrow as tech that allows more people to avoid the cloud (i.e., use your laptop to do things that everyone thought they needed "the cloud" for five years ago). When your 14 GB .fgb files become 2GB GeoParquets, you can fit a whole lot more of them locally!
Reposted by Dewey Dunnington
kylewalker.bsky.social
R-Universe by @rOpenSci is an extremely useful project.

My latest R package, {pmtiles}, is not a good fit for CRAN as it bundles a pre-compiled Go library.

With R-Universe, I was able to get installable cross-platform binaries built in hours!

#rstats

walkerke.r-universe....
paleolimbot.bsky.social
I cannot say enough about DataFusion...in order to build an engine that considers spatial types at every level we needed to customize types, functions, optimizer rules, joins, Parquet pruning, and more. DataFusion not only made this possible but documented even the most obscure bits. So cool!
Reposted by Dewey Dunnington
Reposted by Dewey Dunnington
Reposted by Dewey Dunnington
kylebarron.dev
Published arro3-core v0.6 with wider dtype support for converting numpy ndarrays to Arrow. It now additionally supports:
- datetime64
- timedelta64
- fixed-width strings/bytes
- variable-width strings
- object arrays of str/bytes

And the Python wheel is still just 2.5MB!
github.com/kylebarron/a...
Release py-v0.6.0 · kylebarron/arro3
arro3-core Breaking Changes 🔧 No breaking changes. New features ✨ Expanded dtype support in Array.from_numpy. It now additionally supports datetime64, timedelta64, fixed-width strings and bytes, v...
github.com
Reposted by Dewey Dunnington
columnar.tech
In September the @columnar.tech crew are headed to PyData Paris 2025 and the first ever Apache Arrow Summit. The organizer @quantstack.bsky.social is a dedicated supporter of @arrow.apache.org. We’re delighted to be sponsoring the event.
pydataparis.bsky.social
Welcome to our new sponsor Columnar !
Reposted by Dewey Dunnington
jakubnowosad.com
Check out "Scaling the r-spatial ecosystem" by Dewey Dunnington 🌍📦
An exploration of how R’s spatial tools can be used for big(ger) data.

Video: youtu.be/tjNEoIYr_ag?...
Slides: dewey.dunnington.ca/slides/rspat...

#RStats #rspatial #GIS #SpatialData
Reposted by Dewey Dunnington
yutannihilation-en.bsky.social
Glad to see another proof that Rust-powered R packages work fine on webR!

For those who are wondering what "savvy" is, it's a framework to use Rust in an R package. #rstats

yutannihilation.github.io/savvy/guide/...
Reposted by Dewey Dunnington
etiennebacher.bsky.social
polars can be used via webR, thanks to the work of @gws.phd both on webR and on the Rust component of polars.

It also uses savvy, developed by @yutannihilation-en.bsky.social‬, to connect Rust and R.
Reposted by Dewey Dunnington
etiennebacher.bsky.social
#rstats polars 1.0.0 is now available!

polars provides bindings to the eponymous Rust library, a very efficient dataframe library.

This is the outcome of a 1-year long rewrite led by @eitsupi to refactor the internals and match the API of Python polars.

pola-rs.github.io/r-polars/

Details ⬇️
Polars R Package
pola-rs.github.io
paleolimbot.bsky.social
A new version of @arrow.apache.org is out! I'm personally excited about the Parquet support for Geometry/Geography but there's a lot of other great features in this release...check out the release post for all the details! arrow.apache.org/blog/2025/07...
Reposted by Dewey Dunnington
yutannihilation-en.bsky.social
Voting started! I hope my proposal of a talk about GeoArrow is attractive enough 🙏
foss4g.org
Time to select what looks delectable to you 🧁

Wander through inspiring workshop, talk, & presentation submissions

Pause & vote up or vote down in ranking

Together = a collectively curated conference

Grab yourself a cool or hot drink and dive in:
https://2025.foss4g.org/attend/call-for-papers
Reposted by Dewey Dunnington
arrow.apache.org
Just a reminder that this call closes July 26! Get your proposal(s) in by then. If you have any questions or need help, feel free to DM.
arrow.apache.org
Apache Arrow Summit 25 is happening! Join us in person on October 2nd, in Paris (hosted by @pydataparis.bsky.social ). The Call For Proposal is open, submit your talks before July 26th:
sessionize.com/arrow-summit...
Arrow Summit 2025: Call for Speakers
sessionize.com