Julien Le Dem
@julien.ledem.net
Principal Engineer, Founder, Angel, Advisor, OSS.
LFAI&data: OpenLineage, Marquez, ASF: Parquet, Arrow, Iceberg, 🐖
he/him.
Me: https://julien.ledem.net/
Blog: https://sympathetic.ink
LFAI&data: OpenLineage, Marquez, ASF: Parquet, Arrow, Iceberg, 🐖
he/him.
Me: https://julien.ledem.net/
Blog: https://sympathetic.ink
Good evening golden gate.
November 9, 2025 at 1:08 AM
Good evening golden gate.
Higher latency but higher throughput on improving the overall data ecosystem.
"if you want to go fast, go alone; If you want to go far, go together"
New Apache Parquet Community page is up: parquet.apache.org/community/
New Apache Parquet Community page is up: parquet.apache.org/community/
November 7, 2025 at 9:17 PM
Higher latency but higher throughput on improving the overall data ecosystem.
Reposted by Julien Le Dem
"if you want to go fast, go alone; If you want to go far, go together"
New Apache Parquet Community page is up: parquet.apache.org/community/
New Apache Parquet Community page is up: parquet.apache.org/community/
November 7, 2025 at 8:06 PM
"if you want to go fast, go alone; If you want to go far, go together"
New Apache Parquet Community page is up: parquet.apache.org/community/
New Apache Parquet Community page is up: parquet.apache.org/community/
Reposted by Julien Le Dem
The future of data connectivity is columnar. Today we launched
@columnar.tech to accelerate the shift from slow, row-oriented APIs like ODBC and JDBC to >10x faster alternatives powered by @arrow.apache.org. Learn more 👇
@columnar.tech to accelerate the shift from slow, row-oriented APIs like ODBC and JDBC to >10x faster alternatives powered by @arrow.apache.org. Learn more 👇
Announcing Columnar
Back to the future of data connectivity
columnar.tech
October 29, 2025 at 10:51 PM
The future of data connectivity is columnar. Today we launched
@columnar.tech to accelerate the shift from slow, row-oriented APIs like ODBC and JDBC to >10x faster alternatives powered by @arrow.apache.org. Learn more 👇
@columnar.tech to accelerate the shift from slow, row-oriented APIs like ODBC and JDBC to >10x faster alternatives powered by @arrow.apache.org. Learn more 👇
Experimenting with the laser cutter to make 3d objects.
November 3, 2025 at 12:21 AM
Experimenting with the laser cutter to make 3d objects.
This just went in the oven
October 31, 2025 at 12:40 AM
This just went in the oven
If you missed my talk: "Data Observability and OpenLineage" at the Datadog summit in SF, here is your chance to catch up on the recording.
www.youtube.com/watch?v=uhNo...
www.youtube.com/watch?v=uhNo...
Data Observability and OpenLineage
YouTube video by Datadog
www.youtube.com
October 28, 2025 at 10:30 PM
If you missed my talk: "Data Observability and OpenLineage" at the Datadog summit in SF, here is your chance to catch up on the recording.
www.youtube.com/watch?v=uhNo...
www.youtube.com/watch?v=uhNo...
Great time seeing Garbage at the Warfield tonight.
October 25, 2025 at 5:52 AM
Great time seeing Garbage at the Warfield tonight.
I'm speaking tomorrow at "Embracing AI in Open Source"
If you've been wondering why we see a flurry of new columnar formats, come see me present "Column Storage for the AI Era" I'll talk about what has changed, new advances in data encoding and how that's pushing Parquet to evolve.
luma.com/pxikwty3
If you've been wondering why we see a flurry of new columnar formats, come see me present "Column Storage for the AI Era" I'll talk about what has changed, new advances in data encoding and how that's pushing Parquet to evolve.
luma.com/pxikwty3
Next-Gen Data Engineering: Embracing AI in Open Source · Luma
Next-Gen Data Engineering: Embracing AI in Open Source
Join us October 23rd at the Silicon Valley AI Hub in Snowflake’s Menlo Park campus for an evening…
luma.com
October 22, 2025 at 9:47 PM
I'm speaking tomorrow at "Embracing AI in Open Source"
If you've been wondering why we see a flurry of new columnar formats, come see me present "Column Storage for the AI Era" I'll talk about what has changed, new advances in data encoding and how that's pushing Parquet to evolve.
luma.com/pxikwty3
If you've been wondering why we see a flurry of new columnar formats, come see me present "Column Storage for the AI Era" I'll talk about what has changed, new advances in data encoding and how that's pushing Parquet to evolve.
luma.com/pxikwty3
Who’s solving the Louvre robbery?
Right answers only
Right answers only
October 20, 2025 at 5:12 AM
Who’s solving the Louvre robbery?
Right answers only
Right answers only
First attempt at making prints with the laser cutter. I don’t know what I’m doing but happy to have made something.
October 19, 2025 at 8:28 PM
First attempt at making prints with the laser cutter. I don’t know what I’m doing but happy to have made something.
And then he said:
« Bring me to your leader! »
« Bring me to your leader! »
October 17, 2025 at 1:29 AM
And then he said:
« Bring me to your leader! »
« Bring me to your leader! »
Felt it
The epicenter of the quake, which was reported at 9:23 a.m., was in the center of campus, according to preliminary information from the U.S. Geological Survey.
Magnitude 3.1 earthquake shakes UC Berkeley campus an hour before planned quake drill
The epicenter of the quake, which was reported at 9:23 a.m., was in the center of campus, according to preliminary information from the U.S. Geological Survey.
www.berkeleyside.org
October 16, 2025 at 4:45 PM
Felt it
Reposted by Julien Le Dem
"It is not 100% clear to me how a new file format (or three) will drive additional ecosystem adoption :thinking:"
However, I absolutely think this adds to the pressure for Parquet to evolve.
Speaking of, anyone interested in helping add new encodings to parquet?
lists.apache.org/thread/djnbb...
However, I absolutely think this adds to the pressure for Parquet to evolve.
Speaking of, anyone interested in helping add new encodings to parquet?
lists.apache.org/thread/djnbb...
October 1, 2025 at 7:21 PM
"It is not 100% clear to me how a new file format (or three) will drive additional ecosystem adoption :thinking:"
However, I absolutely think this adds to the pressure for Parquet to evolve.
Speaking of, anyone interested in helping add new encodings to parquet?
lists.apache.org/thread/djnbb...
However, I absolutely think this adds to the pressure for Parquet to evolve.
Speaking of, anyone interested in helping add new encodings to parquet?
lists.apache.org/thread/djnbb...
"Why Datadog Chose Airflow 3: Multi-Tenancy, Observability, and the Future of Event-Driven Workflows"
Zach and I will be talking about how Datadog adopted Airflow 3 at the Airflow summit next week. Come say hi!
airflowsummit.org/sessions/202...
Zach and I will be talking about how Datadog adopted Airflow 3 at the Airflow summit next week. Come say hi!
airflowsummit.org/sessions/202...
October 1, 2025 at 3:39 PM
"Why Datadog Chose Airflow 3: Multi-Tenancy, Observability, and the Future of Event-Driven Workflows"
Zach and I will be talking about how Datadog adopted Airflow 3 at the Airflow summit next week. Come say hi!
airflowsummit.org/sessions/202...
Zach and I will be talking about how Datadog adopted Airflow 3 at the Airflow summit next week. Come say hi!
airflowsummit.org/sessions/202...
Columnar file formats are hot!
Our SIGMOD paper with our friends at Tsinghua + @wesmckinney.com + @pateljm.bsky.social on creating a next generation open-source data file format is out. F3 is a future-proof file format avoids the mistakes of Parquet.
📄 Paper: db.cs.cmu.edu/papers/2025/...
📁 Code: github.com/future-file-...
📄 Paper: db.cs.cmu.edu/papers/2025/...
📁 Code: github.com/future-file-...
October 1, 2025 at 3:08 PM
Columnar file formats are hot!
I'm trying to understand a bit better real life deployments of open source Clickhouse.
If you're using it, what does your deployment look like?
If you're using it, what does your deployment look like?
September 23, 2025 at 11:05 PM
I'm trying to understand a bit better real life deployments of open source Clickhouse.
If you're using it, what does your deployment look like?
If you're using it, what does your deployment look like?
Reposted by Julien Le Dem
Berkeley just experienced a small earthquake.
Check USGS for the official earthquake magnitude: earthquake.usgs.gov/earthquakes/...
Remember to drop, cover, and hold on during earthquake shaking.
Check USGS for the official earthquake magnitude: earthquake.usgs.gov/earthquakes/...
Remember to drop, cover, and hold on during earthquake shaking.
Latest Earthquakes
earthquake.usgs.gov
September 22, 2025 at 10:03 AM
Berkeley just experienced a small earthquake.
Check USGS for the official earthquake magnitude: earthquake.usgs.gov/earthquakes/...
Remember to drop, cover, and hold on during earthquake shaking.
Check USGS for the official earthquake magnitude: earthquake.usgs.gov/earthquakes/...
Remember to drop, cover, and hold on during earthquake shaking.
Woken up by the earth quake!
September 22, 2025 at 9:59 AM
Woken up by the earth quake!
Catching up in person at the Community over Code conference.
Nice to see you all!
Nice to see you all!
September 12, 2025 at 5:46 PM
Catching up in person at the Community over Code conference.
Nice to see you all!
Nice to see you all!
I’ll be at the Community over Code Conference in Minneapolis on Thursday and Friday. Come say hi if you’re around.
I’m speaking about the deconstructed database Thursday at noon.
communityovercode.org/schedule/
I’m speaking about the deconstructed database Thursday at noon.
communityovercode.org/schedule/
Sessions Schedule
Please note: All session times are in Central Daylight Time (UTC -5). You must be registered for Community Over Code NA 2025 to participate in the sessions. If you have not yet registered but would…
communityovercode.org
September 10, 2025 at 3:06 AM
I’ll be at the Community over Code Conference in Minneapolis on Thursday and Friday. Come say hi if you’re around.
I’m speaking about the deconstructed database Thursday at noon.
communityovercode.org/schedule/
I’m speaking about the deconstructed database Thursday at noon.
communityovercode.org/schedule/
A cool blog post by Qi Zhu, Jigao Luo and @andrewlamb1111.bsky.social on embedding custom indices in Parquet files while staying compatible with the standard.
datafusion.apache.org/blog/2025/07...
datafusion.apache.org/blog/2025/07...
Embedding User-Defined Indexes in Apache Parquet Files - Apache DataFusion Blog
datafusion.apache.org
July 17, 2025 at 12:59 AM
A cool blog post by Qi Zhu, Jigao Luo and @andrewlamb1111.bsky.social on embedding custom indices in Parquet files while staying compatible with the standard.
datafusion.apache.org/blog/2025/07...
datafusion.apache.org/blog/2025/07...