Mehdi Ouazza
mehdio.com
Mehdi Ouazza
@mehdio.com
🧡 Data Engineer at heart
🥑 DevRel @motherduck.com (Cloud-based @duckdb.org)
🎥 YouTube : https://www.youtube.com/@mehdio
✒️ Blog http://mehdio.com/blog
There’s a big shift coming in dev tools: their main users will soon be AI agents, not humans.
Neon already sees 4x more DBs spun up by AIs. The question now:
How do you make your tool AI-consumable? MCP is one answer—but new platforms & integrations will matter too.
June 9, 2025 at 1:37 PM
Having more fun building a metadata registry for my personal website with all my content.

3 goal for this :

1. Bring public visibility to all my content (written / video) in one place
2. Make it easier to search through all of it (WIP)
June 7, 2025 at 11:12 AM
The best part of being active online is turning connections into real-life moments.
Just hosted a sold-out DATA+AI meetup in Zurich at Google!
June 4, 2025 at 12:19 PM
I just had an epiphany, what Iceberg, Delta Lake, and Duck Lake have in common?

Their logos are all blue. Like a lake...

You’re welcome.
June 3, 2025 at 2:59 PM
Since I teach #dataengineering, I figured my personal website should reflect that—it’s all about pipelines.

I’ve automated most of the publishing workflow so the site acts as a central metadata hub, no matter where I post content online.
May 16, 2025 at 11:30 AM
Today is a good day to be on Google Cloud.
April 8, 2025 at 11:48 AM
Be careful what features you wish for.
Before you know it, you’re stuck in an over-engineered data project.
Been there, done that—because hey, it’s fun.
April 3, 2025 at 2:40 PM
My goal has always been simple: to educate.

To lower the technical barrier. To make data more inclusive.

Even for ducks. 🦆
April 1, 2025 at 12:53 PM
You’re not a real data engineer until you’ve broken production
March 26, 2025 at 8:09 AM
I hereby declare today the official Data Engineer Day.
Say thank you to this unsung hero of your data team
Working in the shadows, always ready.
March 18, 2025 at 2:21 PM
Irony: build a tool to curate weekly data content online, and only to have your own content surfaced 🤦

Latest data subTLDR is live : www.subtldr.com/newsletters/...
March 16, 2025 at 7:42 PM
Me: “Hey AI, can you code this feature?”
AI: “Go learn it yourself, you lazy human.”
Turns out AI isn’t here to destroy humanity—it’s trying to save it.
March 13, 2025 at 10:47 AM
What if you could get a weekly TL;DR of your favorite subreddit—delivered straight to you?

That’s exactly what I’m building with subtldr.com! 🚀

The waiting list is open, and there’s a demo live on the site—check it out! 👆
March 9, 2025 at 8:49 PM
Today is one of those special days that reminds me why I should continue contributing to the tech community—regardless of my employer or whether it’s free.

After a few folks translated my content into Portuguese for the Brazilian community, some of my work is now being translated into Chinese!
March 4, 2025 at 2:44 PM
smallpond relies on ray and it distributes the workload at the partition level (manually specified). You have multiple of options for storage but the above benchmark is done on 3FS (DeepSeek's custom distributed storage framework)
February 28, 2025 at 4:02 PM
There are different levels of distributed compute compared to daft or apache spark
February 28, 2025 at 4:02 PM
Based on my highly sophisticated (and totally unbiased) survey, I can confidently conclude: clicking is painful.

Stay tuned for more #databs questions.
February 21, 2025 at 4:23 PM
According to the AWS Redshift team, 94% of workloads don’t exceed 10 TB.

I look at my 4 TB SSD and wonder:
Do we really need massive data centers for this?
Do we really need distributed compute?

Our idea of “big” #databs has changed, but most design decisions remain a decade old.
February 20, 2025 at 1:32 PM
62% of my network enjoys living on the edge—parsing CSVs without a schema.
31% have spent one too many nights debugging inconsistent delimiters.
7% are still waiting for their XML parser to finish loading.
February 12, 2025 at 1:50 PM
Normal people on weekend : do laundry, vacuum, wipe down surfaces, take out the trash, and tidy up clutter.
Me :
@duckdb.org
February 9, 2025 at 3:00 PM
Hey #duckdb folks, I'm currently looking for the next guests for Quack&Code (www.youtube.com/playlist?lis...)

If you have a nice DuckDB (and/or MotherDuck) story you want to share, feel free to quack in my DM!
February 7, 2025 at 2:54 PM
DuckCon 6 AMS @duckdb.org conference highlights
February 3, 2025 at 10:37 AM
Folks are being really creative to add Delta write support to @duckdb.org
January 31, 2025 at 4:21 PM
I'm already quacking in Amsterdam, who's coming to the DuckCon organized by @duckdb.org ? #databs
January 31, 2025 at 9:48 AM
Are GitHub stars just vanity metrics? Most of the time.

1️⃣ Stars ≠ users. I’ve starred projects but never used them—bad READMEs lose users.
2️⃣ Stars = bookmarks. I’ve starred & forgotten, yet the count stays.
3️⃣ Buzz ≠ growth. Stars can be hyped or gamed

For >1yr or 5k stars repo, they lose meaning.
January 9, 2025 at 3:09 PM