Mike Mahoney
banner
mikemahoney218.com
Mike Mahoney
@mikemahoney218.com
Helped build NYS's Forest Carbon Assessment. Currently helping build public water data infrastructure. PhD in Environmental Science.

#rstats, ML, Boston and spatial data.
Opinions my own. RTs imply causation.
https://mm218.dev
github.com/mikemahoney218
it's fine but I'm spoiled by largely flying JetBlue's Airbus lines, which are ABSOLUTELY nicer
January 20, 2026 at 7:55 PM
on the way back I get an A319 instead of the E175 which I think makes for a crappy plane hat trick
January 20, 2026 at 6:09 PM
of course these days I don't need tires, my wheels are 711mm of solid steel
January 15, 2026 at 7:08 PM
I absolutely relate to "forgetting the license plate number you've had for 7 years" though
January 15, 2026 at 6:02 PM
I'm just really baffled by the Google Photos mention here. Did you forget that you sometimes leave your home? Did the idea that "weather happens" really require a computer vision algorithm?
January 15, 2026 at 6:01 PM
I still think that's a better introduction than your sophomore year track times, though
January 6, 2026 at 3:46 PM
Massachusetts state agencies, personally. The MBTA and DOT in particular only post alerts on Twitter!
January 2, 2026 at 7:08 PM
...I can see the vision of "with enough layers the model will figure out the optimal preprocessing itself!"

That said, I don't usually have that many layers, and if you already know preprocessing that more or less works then it's WAY more efficient to just do it!
December 31, 2025 at 7:04 PM
I will say, back when I was doing more ML work we often found that we could skip a lot of preprocessing steps and only have relatively small performance impacts. A long way downstream from raw sensor data, and it rarely saved enough cycles to be worth the hit. But still...

1/
December 31, 2025 at 7:02 PM
separately, I also remain an "AI-ready" skeptic -- I don't think it's reasonable to expect producers to maintain use case specific (rather than domain-specific) metadata and I think users have a responsibility to build their own cleaning and ingest pipes
December 31, 2025 at 6:25 PM
At AGU a speaker referred to ARD as being somehow less rigorous than "AI-ready data" and that's just simply incorrect! We've still got so many competing specifications for "AI-ready" data, while ARD means making your data rigorously interoperable!
December 31, 2025 at 6:23 PM
yeah sorry that's what I was trying to point at -- but I just woke up and did NOT make that very clear 😅
December 31, 2025 at 1:24 PM