Mehdi Ouazza
@mehdio.com
🧡 Data Engineer at heart
🥑 DevRel @motherduck.com (Cloud-based @duckdb.org)
🎥 YouTube : https://www.youtube.com/@mehdio
✒️ Blog http://mehdio.com/blog
🥑 DevRel @motherduck.com (Cloud-based @duckdb.org)
🎥 YouTube : https://www.youtube.com/@mehdio
✒️ Blog http://mehdio.com/blog
There’s a big shift coming in dev tools: their main users will soon be AI agents, not humans.
Neon already sees 4x more DBs spun up by AIs. The question now:
How do you make your tool AI-consumable? MCP is one answer—but new platforms & integrations will matter too.
Neon already sees 4x more DBs spun up by AIs. The question now:
How do you make your tool AI-consumable? MCP is one answer—but new platforms & integrations will matter too.
June 9, 2025 at 1:37 PM
There’s a big shift coming in dev tools: their main users will soon be AI agents, not humans.
Neon already sees 4x more DBs spun up by AIs. The question now:
How do you make your tool AI-consumable? MCP is one answer—but new platforms & integrations will matter too.
Neon already sees 4x more DBs spun up by AIs. The question now:
How do you make your tool AI-consumable? MCP is one answer—but new platforms & integrations will matter too.
Having more fun building a metadata registry for my personal website with all my content.
3 goal for this :
1. Bring public visibility to all my content (written / video) in one place
2. Make it easier to search through all of it (WIP)
3 goal for this :
1. Bring public visibility to all my content (written / video) in one place
2. Make it easier to search through all of it (WIP)
June 7, 2025 at 11:12 AM
Having more fun building a metadata registry for my personal website with all my content.
3 goal for this :
1. Bring public visibility to all my content (written / video) in one place
2. Make it easier to search through all of it (WIP)
3 goal for this :
1. Bring public visibility to all my content (written / video) in one place
2. Make it easier to search through all of it (WIP)
The best part of being active online is turning connections into real-life moments.
Just hosted a sold-out DATA+AI meetup in Zurich at Google!
Just hosted a sold-out DATA+AI meetup in Zurich at Google!
June 4, 2025 at 12:19 PM
The best part of being active online is turning connections into real-life moments.
Just hosted a sold-out DATA+AI meetup in Zurich at Google!
Just hosted a sold-out DATA+AI meetup in Zurich at Google!
I just had an epiphany, what Iceberg, Delta Lake, and Duck Lake have in common?
Their logos are all blue. Like a lake...
You’re welcome.
Their logos are all blue. Like a lake...
You’re welcome.
June 3, 2025 at 2:59 PM
I just had an epiphany, what Iceberg, Delta Lake, and Duck Lake have in common?
Their logos are all blue. Like a lake...
You’re welcome.
Their logos are all blue. Like a lake...
You’re welcome.
Since I teach #dataengineering, I figured my personal website should reflect that—it’s all about pipelines.
I’ve automated most of the publishing workflow so the site acts as a central metadata hub, no matter where I post content online.
I’ve automated most of the publishing workflow so the site acts as a central metadata hub, no matter where I post content online.
May 16, 2025 at 11:30 AM
Since I teach #dataengineering, I figured my personal website should reflect that—it’s all about pipelines.
I’ve automated most of the publishing workflow so the site acts as a central metadata hub, no matter where I post content online.
I’ve automated most of the publishing workflow so the site acts as a central metadata hub, no matter where I post content online.
Today is a good day to be on Google Cloud.
April 8, 2025 at 11:48 AM
Today is a good day to be on Google Cloud.
Be careful what features you wish for.
Before you know it, you’re stuck in an over-engineered data project.
Been there, done that—because hey, it’s fun.
Before you know it, you’re stuck in an over-engineered data project.
Been there, done that—because hey, it’s fun.
April 3, 2025 at 2:40 PM
Be careful what features you wish for.
Before you know it, you’re stuck in an over-engineered data project.
Been there, done that—because hey, it’s fun.
Before you know it, you’re stuck in an over-engineered data project.
Been there, done that—because hey, it’s fun.
My goal has always been simple: to educate.
To lower the technical barrier. To make data more inclusive.
Even for ducks. 🦆
To lower the technical barrier. To make data more inclusive.
Even for ducks. 🦆
April 1, 2025 at 12:53 PM
My goal has always been simple: to educate.
To lower the technical barrier. To make data more inclusive.
Even for ducks. 🦆
To lower the technical barrier. To make data more inclusive.
Even for ducks. 🦆
You’re not a real data engineer until you’ve broken production
March 26, 2025 at 8:09 AM
You’re not a real data engineer until you’ve broken production
I hereby declare today the official Data Engineer Day.
Say thank you to this unsung hero of your data team
Working in the shadows, always ready.
Say thank you to this unsung hero of your data team
Working in the shadows, always ready.
March 18, 2025 at 2:21 PM
I hereby declare today the official Data Engineer Day.
Say thank you to this unsung hero of your data team
Working in the shadows, always ready.
Say thank you to this unsung hero of your data team
Working in the shadows, always ready.
Irony: build a tool to curate weekly data content online, and only to have your own content surfaced 🤦
Latest data subTLDR is live : www.subtldr.com/newsletters/...
Latest data subTLDR is live : www.subtldr.com/newsletters/...
March 16, 2025 at 7:42 PM
Irony: build a tool to curate weekly data content online, and only to have your own content surfaced 🤦
Latest data subTLDR is live : www.subtldr.com/newsletters/...
Latest data subTLDR is live : www.subtldr.com/newsletters/...
Me: “Hey AI, can you code this feature?”
AI: “Go learn it yourself, you lazy human.”
Turns out AI isn’t here to destroy humanity—it’s trying to save it.
AI: “Go learn it yourself, you lazy human.”
Turns out AI isn’t here to destroy humanity—it’s trying to save it.
March 13, 2025 at 10:47 AM
Me: “Hey AI, can you code this feature?”
AI: “Go learn it yourself, you lazy human.”
Turns out AI isn’t here to destroy humanity—it’s trying to save it.
AI: “Go learn it yourself, you lazy human.”
Turns out AI isn’t here to destroy humanity—it’s trying to save it.
What if you could get a weekly TL;DR of your favorite subreddit—delivered straight to you?
That’s exactly what I’m building with subtldr.com! 🚀
The waiting list is open, and there’s a demo live on the site—check it out! 👆
That’s exactly what I’m building with subtldr.com! 🚀
The waiting list is open, and there’s a demo live on the site—check it out! 👆
March 9, 2025 at 8:49 PM
What if you could get a weekly TL;DR of your favorite subreddit—delivered straight to you?
That’s exactly what I’m building with subtldr.com! 🚀
The waiting list is open, and there’s a demo live on the site—check it out! 👆
That’s exactly what I’m building with subtldr.com! 🚀
The waiting list is open, and there’s a demo live on the site—check it out! 👆
Today is one of those special days that reminds me why I should continue contributing to the tech community—regardless of my employer or whether it’s free.
After a few folks translated my content into Portuguese for the Brazilian community, some of my work is now being translated into Chinese!
After a few folks translated my content into Portuguese for the Brazilian community, some of my work is now being translated into Chinese!
March 4, 2025 at 2:44 PM
Today is one of those special days that reminds me why I should continue contributing to the tech community—regardless of my employer or whether it’s free.
After a few folks translated my content into Portuguese for the Brazilian community, some of my work is now being translated into Chinese!
After a few folks translated my content into Portuguese for the Brazilian community, some of my work is now being translated into Chinese!
smallpond relies on ray and it distributes the workload at the partition level (manually specified). You have multiple of options for storage but the above benchmark is done on 3FS (DeepSeek's custom distributed storage framework)
February 28, 2025 at 4:02 PM
smallpond relies on ray and it distributes the workload at the partition level (manually specified). You have multiple of options for storage but the above benchmark is done on 3FS (DeepSeek's custom distributed storage framework)
There are different levels of distributed compute compared to daft or apache spark
February 28, 2025 at 4:02 PM
There are different levels of distributed compute compared to daft or apache spark
Based on my highly sophisticated (and totally unbiased) survey, I can confidently conclude: clicking is painful.
Stay tuned for more #databs questions.
Stay tuned for more #databs questions.
February 21, 2025 at 4:23 PM
Based on my highly sophisticated (and totally unbiased) survey, I can confidently conclude: clicking is painful.
Stay tuned for more #databs questions.
Stay tuned for more #databs questions.
According to the AWS Redshift team, 94% of workloads don’t exceed 10 TB.
I look at my 4 TB SSD and wonder:
Do we really need massive data centers for this?
Do we really need distributed compute?
Our idea of “big” #databs has changed, but most design decisions remain a decade old.
I look at my 4 TB SSD and wonder:
Do we really need massive data centers for this?
Do we really need distributed compute?
Our idea of “big” #databs has changed, but most design decisions remain a decade old.
February 20, 2025 at 1:32 PM
According to the AWS Redshift team, 94% of workloads don’t exceed 10 TB.
I look at my 4 TB SSD and wonder:
Do we really need massive data centers for this?
Do we really need distributed compute?
Our idea of “big” #databs has changed, but most design decisions remain a decade old.
I look at my 4 TB SSD and wonder:
Do we really need massive data centers for this?
Do we really need distributed compute?
Our idea of “big” #databs has changed, but most design decisions remain a decade old.
62% of my network enjoys living on the edge—parsing CSVs without a schema.
31% have spent one too many nights debugging inconsistent delimiters.
7% are still waiting for their XML parser to finish loading.
31% have spent one too many nights debugging inconsistent delimiters.
7% are still waiting for their XML parser to finish loading.
February 12, 2025 at 1:50 PM
62% of my network enjoys living on the edge—parsing CSVs without a schema.
31% have spent one too many nights debugging inconsistent delimiters.
7% are still waiting for their XML parser to finish loading.
31% have spent one too many nights debugging inconsistent delimiters.
7% are still waiting for their XML parser to finish loading.
Normal people on weekend : do laundry, vacuum, wipe down surfaces, take out the trash, and tidy up clutter.
Me :
@duckdb.org
Me :
@duckdb.org
February 9, 2025 at 3:00 PM
Normal people on weekend : do laundry, vacuum, wipe down surfaces, take out the trash, and tidy up clutter.
Me :
@duckdb.org
Me :
@duckdb.org
Hey #duckdb folks, I'm currently looking for the next guests for Quack&Code (www.youtube.com/playlist?lis...)
If you have a nice DuckDB (and/or MotherDuck) story you want to share, feel free to quack in my DM!
If you have a nice DuckDB (and/or MotherDuck) story you want to share, feel free to quack in my DM!
February 7, 2025 at 2:54 PM
Hey #duckdb folks, I'm currently looking for the next guests for Quack&Code (www.youtube.com/playlist?lis...)
If you have a nice DuckDB (and/or MotherDuck) story you want to share, feel free to quack in my DM!
If you have a nice DuckDB (and/or MotherDuck) story you want to share, feel free to quack in my DM!
DuckCon 6 AMS @duckdb.org conference highlights
February 3, 2025 at 10:37 AM
DuckCon 6 AMS @duckdb.org conference highlights
Folks are being really creative to add Delta write support to @duckdb.org
January 31, 2025 at 4:21 PM
Folks are being really creative to add Delta write support to @duckdb.org
I'm already quacking in Amsterdam, who's coming to the DuckCon organized by @duckdb.org ? #databs
January 31, 2025 at 9:48 AM
I'm already quacking in Amsterdam, who's coming to the DuckCon organized by @duckdb.org ? #databs
Are GitHub stars just vanity metrics? Most of the time.
1️⃣ Stars ≠ users. I’ve starred projects but never used them—bad READMEs lose users.
2️⃣ Stars = bookmarks. I’ve starred & forgotten, yet the count stays.
3️⃣ Buzz ≠ growth. Stars can be hyped or gamed
For >1yr or 5k stars repo, they lose meaning.
1️⃣ Stars ≠ users. I’ve starred projects but never used them—bad READMEs lose users.
2️⃣ Stars = bookmarks. I’ve starred & forgotten, yet the count stays.
3️⃣ Buzz ≠ growth. Stars can be hyped or gamed
For >1yr or 5k stars repo, they lose meaning.
January 9, 2025 at 3:09 PM
Are GitHub stars just vanity metrics? Most of the time.
1️⃣ Stars ≠ users. I’ve starred projects but never used them—bad READMEs lose users.
2️⃣ Stars = bookmarks. I’ve starred & forgotten, yet the count stays.
3️⃣ Buzz ≠ growth. Stars can be hyped or gamed
For >1yr or 5k stars repo, they lose meaning.
1️⃣ Stars ≠ users. I’ve starred projects but never used them—bad READMEs lose users.
2️⃣ Stars = bookmarks. I’ve starred & forgotten, yet the count stays.
3️⃣ Buzz ≠ growth. Stars can be hyped or gamed
For >1yr or 5k stars repo, they lose meaning.