TEGNicholas.bsky.social
@tegnicholas.bsky.social
330 followers 490 following 72 posts
Open-Source Software for science at Earthmover.io, built on Pangeo.io. One of many xarray.dev core devs. https://tom-nicholas.com/ Previously dabbled in oceanography at [C]Worthy and Columbia Uni., originally did fusion plasma physics.
Posts Media Videos Starter Packs
Pinned
tegnicholas.bsky.social
I've been thinking a lot recently about how one thing science needs generally is a **social network for sharing big data**, so I wrote this.

hackmd.io/wKKm4cIDR6a9...

🧵
Science needs a social network for sharing big data - HackMD
hackmd.io
Reposted by TEGNicholas.bsky.social
ketanjoshi.co
"[Matt Yglesias] can write whole essays claiming that fracking is good and we need fossil fuel friendly energy policies, dismissing progressives as childish, while never engaging with the scientific literature on the consequences of climate change"
Matt Yglesias Is Confidently Wrong About Everything
The Biden administration’s favorite centrist pundit produces smug pseudo-analysis that cannot be considered serious thought. He ought to be permanently disregarded.
www.currentaffairs.org
tegnicholas.bsky.social
This reads like the writings of climate denialists - constant strawmanning, shifting of goalposts, selection bias in research quoted, and conspiracy theorizing.
Reposted by TEGNicholas.bsky.social
fernandoperez.org
Open source, open science for earth, climate and geospatial science? Coming to #AGU25? Build tools in #Python @jupyter.org?

Submit an abstract for this session and come meet us and like minded scientists!
wilsonsauthoff.bsky.social
📣 Call for submissions: AGU25 IN029 "Open Source Geospatial Workflows in the Cloud" 🌍

Please submit your work using cutting-edge practices and tools that are shaping the future of geospatial science.

More info: events.geojupyter.org/conferences/...

#opensource #geospatial #jupyter #AGU25
AGU session advertisement with session title and invited speakers.
Reposted by TEGNicholas.bsky.social
emorwee.bsky.social
Will Alpine co-wrote Microsoft's manifesto on how AI will be a powerful force for good for climate change

In a new interview, Alpine disavows the manifesto, saying he believes Microsoft used his work to distract from the much larger climate harms the company enables through contracts with Big Oil
He helped Microsoft build AI to help the climate. Then Microsoft sold it to Big Oil.
A former Microsoft project manager reveals how the tech giant is using AI to help Big Oil drill—and how he and his partner are now pushing for change.
heated.world
Reposted by TEGNicholas.bsky.social
henrifdrake.bsky.social
This is fucking insane. Closing these NOAA labs would obliterate our ability to observe, understand, and forecast the Earth System, from weather systems tomorrow to sea levels 50 years from now.
zacklabe.com
What a tragedy this is even being proposed on paper... 💔

NOAA FY2026 Congressional Justification: www.noaa.gov/sites/defaul...
Termination of OAR’s Climate Laboratories and Cooperative Institutes (-$102,292, -204 FTE/ -216 Positions) –NOAA will continue to support high priority ocean and weather research programs in NOS and NWS.

In coordination with the requested terminations for Weather Laboratories and Cooperative Institutes (see OAR-10) and Ocean Laboratories and Cooperative Institutes (see OAR-19), NOAA will close the Atlantic Oceanographic & Meteorological Laboratory (AOML) in Miami, FL; the Air Resources Laboratory (ARL) in College Park, MD, Idaho Falls, ID, and Oak Ridge, TN, as well as a nation-wide network of soil moisture sensors; the Chemical Sciences Laboratory (CSL) in Boulder, CO; the Geophysical Fluid Dynamics Laboratory (GFDL) in Princeton, NJ; the Global Monitoring Laboratory (GML) in Boulder, CO, Utqiaġvik, AK, Mauna Loa, HI, Hilo, HI, Big Island, HI, American Samoa, and the South Pole; the Pacific Marine Environmental Laboratory (PMEL) in Seattle, WA; and the Physical Sciences Laboratory (PSL) in Boulder, CO.
Reposted by TEGNicholas.bsky.social
aussiastronomer.bsky.social
NASA is being told to cancel 19 *active* missions to save $6B, which looks to be less than the ICE *hiring/retention* budget going forward.

I need people to let that sentence sink into their bones for a minute.
reichlinmelnick.bsky.social
If the GOP reconciliation bill passes, ICE gets through FY2029:

- $45 billion for detention, on top of the current annual budget of $3.4 billion
- $14.4 billion for transportation and removal, on top of the current annual budget of $750 million
- $8 billion for hiring/retention
- Billions more.
Reposted by TEGNicholas.bsky.social
micheletobias.bsky.social
I've been adding new accounts to the Open Source Geospatial starter pack. Who else wants on or off?
#gischat #geosky

go.bsky.app/PGYLmPG
tegnicholas.bsky.social
I would have suggested Max Jones, Aimee Barciauskas, or Lindsey Nield, but they don't seem to be on BlueSky, so you could instead add @jhamman.bsky.social , @rabernat.bsky.social , or myself.
Reposted by TEGNicholas.bsky.social
adamsobel.bsky.social
It's outrageous that NASA GISS, one of the best earth & space science labs in the world, is being kicked out of its Columbia home. The outstanding scientists who work there can't say that publicly, but I can. And so can you --- call your reps, esp. (but not only) if you live in NYC or NY state.
Reposted by TEGNicholas.bsky.social
earthmover.io
𝐻𝑜𝑤 𝑑𝑜𝑒𝑠 𝐼𝑐𝑒𝑐ℎ𝑢𝑛𝑘 𝑎𝑣𝑜𝑖𝑑 𝑟𝑒𝑑𝑢𝑛𝑑𝑎𝑛𝑡 𝑠𝑡𝑜𝑟𝑎𝑔𝑒 𝑏𝑒𝑡𝑤𝑒𝑒𝑛 𝑑𝑎𝑡𝑎 𝑣𝑒𝑟𝑠𝑖𝑜𝑛𝑠?

Icechunk stores only new or changed chunks for each version —no redundant copies or rewrites. You get instant time travel, branching, and efficient updates, all with negligible storage overhead.

More: bit.ly/3F1XFST
Icechunk: Efficient storage of versioned array data - Earthmover
We recently got an interesting question in Icechunk’s community Slack channel (thank you Iury Simoes-Sousa for motivating this post): I’m new to Icechunk. How is the storage managed for redundant info...
earthmover.io
tegnicholas.bsky.social
Excellent post by Brian Davis laying out why doing "Open Science" for data-driven workflows is almost impossible in practice, at least without much better data pipeline tools.
earthmover.io
Our latest blog post dives into the chaos of the status quo - where every tweak means regenerating the 𝑤ℎ𝑜𝑙𝑒 𝑑𝑎𝑡𝑎𝑠𝑒𝑡 and collaboration and experimentation is often stifled by silos and secret knowledge. Check out the full post: earthmover.io/blog/tensoro...
TensorOps: Scientific Data Doesn't Have to Hurt - Earthmover
Curious how your team scores on the "Data Pain Survey"? Wondering why your teams are building Rube Goldberg machines just to put some data on a map? Or just want to see our plan to bring order to your...
earthmover.io
tegnicholas.bsky.social
It's fun to work with real hardcore software engineers like @functionth.bsky.social who can teach you about database consistency and transactions and all that

Scientific data infrastructure should be built on solid foundations like this instead of on piles of janky code written by postdocs...
earthmover.io
1/ 🚨 New Blog Post Alert: "𝐿𝑒𝑎𝑟𝑛𝑖𝑛𝑔 𝐴𝑏𝑜𝑢𝑡 𝐼𝑐𝑒𝑐ℎ𝑢𝑛𝑘 𝐶𝑜𝑛𝑠𝑖𝑠𝑡𝑒𝑛𝑐𝑦 𝑤𝑖𝑡ℎ 𝑎 𝐶𝑙𝑖𝑐ℎ𝑒́𝑑 𝑏𝑢𝑡 𝐼𝑛𝑠𝑡𝑟𝑢𝑐𝑡𝑖𝑣𝑒 𝐸𝑥𝑎𝑚𝑝𝑙𝑒" 🏦🔁

👉 Read it here: earthmover.io/blog/learnin...
Learning about Icechunk consistency with a clichéd but instructive example - Earthmover
In this post we’ll show what can happen when more than one process write to the same Icechunk repository concurrently, and how Icechunk uses transactions and conflict resolution to guarantee consisten...
earthmover.io
Reposted by TEGNicholas.bsky.social
mikemahoney218.com
the fact that I've never once thought about making a range request, and yet make them constantly for extremely targeted data pulls, is absolutely an invisible technical miracle
tegnicholas.bsky.social
Of course- I really wrote this article for my past self! I wish someone had explained this cloud science stuff to me earlier.
tegnicholas.bsky.social
It’s also important for understanding what problem VirtualiZarr solves.

I’ve given this explanation to many people in the past (including at @cworthy.bsky.social), so I hope that this article can serve as a useful reference the next time someone wonders what @zarr.dev actually is.
tegnicholas.bsky.social
I wrote the article I wish I could have read back when I first heard of Zarr and cloud-native science back in 2018.

This explains how object storage and conventional filesystems are different, and the key properties that make @zarr.dev work so well in cloud object storage.
tegnicholas.bsky.social
5/ Almost all organisations working with scientific array data have this kind of data delivery issue, even if it's just internally.

Whilst the Flux integrations today are established geospatial standards, you also see similar patterns in other fields such as Neuroscience.
tegnicholas.bsky.social
4/ Flux's architecture is auto-scaling, so once turned on there is no need to worry about how many users are hitting the data.

As it's not a stateful server like THREDDS, it won't catch fire under pressure.

This is what "Cloud-Native" architectures for scientific data look like.
tegnicholas.bsky.social
3/ Your downstream scientists, GIS users, analysts, and external users can all now forget about file formats!

They just keep using the same GUI or tool or script that they prefer, and don't need any other services or copies of the data made bespoke for them - Flux does that on-demand!
tegnicholas.bsky.social
2/ Flux bridges this chasm.

It sits in between your data and the consumers, springing up at a moment's notice to provide subsets of data however your users prefer it.