Peter Boncz
peterabcz.bsky.social
Peter Boncz
@peterabcz.bsky.social
Professor Analytical Data Systems @cwi_da and @VUamsterdam.
researcher, systems architect, educator, entrepreneur
@duckdb.org 1.4.0 is feature-packed: MERGE INTO, compressed in-mem DBs, Iceberg writes..

PhD students also contributed:
- Laurens Kuiper: new k-way parallel mergesort duckdb.org/2025/09/24/sorting-again.html
- Lotte Felius @ccfelius.bsky.social: on-disk DB encryption
- Denis Hirn: materialized CTEs
September 26, 2025 at 12:55 PM
Laurens Kuiper from @duckdb.org presented DuckDB's new memory assignment policy to run multi-join pipelines out-of-core with gracious performance degradation when join hash tables increasingly do not fit in RAM.

A well-attended and -delivered talk!

paper: vldb.org/pvldb/vol18/p2748-kuiper.pdf
September 4, 2025 at 2:01 PM
Tobias Schmidt (TUM) @vldb.bsky.social at VLDB2025 presented SQLStorm, which uses LLMs to generate a huge amount of complex queries

SQLStorm now has 18K different complex queries and runs on a large real-world dataset (stackoverflow)

paper: vldb.org/pvldb/vol18/...
code: github.com/SQL-Storm/SQ...
September 4, 2025 at 1:55 PM
Very honored to receive the @vldb.bsky.social 2025 Test of Time Award for the Join Order Benchmark (JOB)

Kudos to my very talented TUM co-authors, specifically Viktor Leis who was the driving force & gave a great award talk.

paper: www.vldb.org/pvldb/vol18/p5531-viktor.pdf
JOB: event.cwi.nl/da/job
September 3, 2025 at 4:09 PM
Azim Afroozeh gave a great talk at @vldb.bsky.social VLDB2025 in London on the FastLanes file format.

FastLanes compresses 1.4x better than Parquet/snappy and allows 40x faster reads on the PublicBI dataset!

Paper: vldb.org/pvldb/vol18/p4629-afroozeh.pdf
Code: github.com/cwida/FastLanes
September 3, 2025 at 3:59 PM
@sigmod2025.bsky.social Berlin is a wrap. Many 🙏 to the organizers!

Next stop is @vldb.bsky.social London to present
- github.com/cwida/FastLanes v0.1 of a new big data format
- spilling multi-operator joins (via @duckdb.org)
- the SQLStorm benchmark of 30k LLM-generated complex queries (via TUM)
June 27, 2025 at 1:54 PM
Some pics of Leonardo Kuffo presenting his SIGMOD2025 paper on PDX.

PDX is a vertical layout that can accelerate vector search in principle in any vector index technique (it makes the distance calculation faster, using better SIMD + pruning).

ir.cwi.nl/pub/35044/3504…
github.com/cwida/P
DX
June 25, 2025 at 2:50 PM
And.. Azim Afroozeh put a lot of effort in open-sourcing the ALP floating point compressor (github.com/cwida/ALP). Leonardo Kuffo had written with him the SIGMOD2024 paper which now won a reproducibility award!

+ 🙏🙏 to the reproducibility and artifacts committee - this is a ton of work.
June 25, 2025 at 1:29 PM
But @cwi_da has no reason to complain, here in Berlin.

Leonardo Kuffo at the preceding DaMoN2025 workshop won the Best Paper Award for a study that showed that for vector databases, it matters a lot which AWS CPU you pick.

Congratulations to him!
June 25, 2025 at 1:26 PM
The opening talk of #systemsdistributed, organized by our friends @tigerbeetle.com in the Eye Film museum in Amsterdam, was given by @hannes.muehleisen.org of
@duckdb.org about:

DuckLake (ducklake.select)

and this was very well received

movie poster refers back to CIDR2025 😄
June 20, 2025 at 9:10 AM
CIDR2025 is a wrap!

Lived the many interesting papers & discussions, Gong Show, @duckdb reception..

ACM president Yannis Ioannidis gave an inspiring talk on open science.

Proceedings are in ACM DL & VLDB (see cidrdb.org).

🙏 all in+outside @cwi-amsterdam.bsky.social who helped organize!!
January 22, 2025 at 5:58 PM
In five days the CIDR2025 conference (cidrdb.org) will start, and we are expecting around 170 attendees from all over the world.

On an unrelated note, the exotic "goldeneye" duck was just spotted in The Netherlands!

See: bit.ly/duck-goldeneye
January 14, 2025 at 9:37 PM
Amsterdam is once again hosting my favorite event, the Conference on Innovative Data Systems (CIDR2025).

Check its exciting program: www.cidrdb.org/cidr2025/pro...

It will be held January 19-22 in the Amsterdam Mövenpick hotel.

Plan your trip quickly, because registration closes on Thursday!
December 16, 2024 at 8:51 PM
Happy to see @motherduck.com opening shop in my hometown Amsterdam: bit.ly/motherduck-a...

In reality, they have already been renting offices for 1.5 years close to the Database Architectures research group at CWI, but with a Dutch legal entity, and soon an own office, things are solidifying.
December 3, 2024 at 5:55 PM
Thanks Andy. That is a big honor and it was an honor to have you in Amsterdam. Enjoy the thxgiving break!
November 29, 2024 at 10:27 PM
But there's more! Last Friday, the day after the Dijkstra Award, it was our honor to host the 20th Dutch-Belgian Database Day (DBDBD2024) at CWI.

It was well-attended and lively. 🙏 to all who made it possible!

DBDBD2024 pictures are on cwida.github.io/dbdbd2024 (scroll down):
November 29, 2024 at 3:55 PM
Some pictures of the CWI lectures on database architecture, more here: cwida.github.io/dbdbd2024/di...

🙏 to Marcin Zukowski
+
@andypavlo.bsky.social
@madelonhulsebos.bsky.social
@hannes.muehleisen.org, Viktor Leis & Allison Lee for their contributions, but also to all who attended!
November 29, 2024 at 3:51 PM
At cwi.nl/en/events/dijkstra-awards/cwi-lectures-dijkstra-fellowship we awarded the Dijkstra Fellowship to Marcin Zukowski for his contributions to DB architecture in a full room & great speakers: Allison Lee, Viktor Leis, @andypavlo.bsky.social @hannes.muehleisen.org @madelonhulsebos.bsky.social
November 22, 2024 at 8:58 AM