Finn Völkel
fivo.bsky.social
Finn Völkel
@fivo.bsky.social
Programming, Databases, Distributed systems, Beachvolley. Currently working on @xtdb.com
In section "3.1.2 A batching optimization to reduce memory" of the paper you mentioned, you seem to restrict the size of the extensions `propose` returns, but this then means batches might be quite different in size.
December 16, 2025 at 4:31 PM
> benefit from imposing a limit on the tuples to produce at once (currently: 100M)
^from you post. When doing the joining of `a`, you don't know how many tuples you get if all variables have been unified. I was mainly wondering how you decide to cut your batches when being at the `a` level in GC.
December 16, 2025 at 4:14 PM
Thanks. I also "stole" the GC interface (giving credit of course) from you old post. www.frankmcsherry.org/dataflow/rel...
Worst-case optimal joins, in dataflow
I’ve gotten timely dataflow in Rust up and running exciting computations! I’m going to explain one that I think is especially cool, and that I’m going to tr...
www.frankmcsherry.org
December 16, 2025 at 4:05 PM
I meant (2.). If the variable order is (a,b,c), naive GJ will produce all a's before looking at b's, so I was wondering what you do in datatoad (or elsewhere) besides potential cardinality estimation.
December 16, 2025 at 4:03 PM
Hey. Love that you are blogging on WCOJ again. I was wondering how you deal with the limit on the tuples produced in GenericJoin. In Leapfrog this seems quite easy as you are producing the tuples one at a time at the bottom, but in GenericJoin you need (?) to take some decisions higher up or not?
December 16, 2025 at 8:40 AM
Reposted by Finn Völkel
The 3rd post in the WCOJ series. Implementing some logical datalog connectors in the context of GenericJoin. finnvolkel.com/wcoj-datalog...
WCOJ - Datalog and GenericJoin · Finn Völkel
finnvolkel.com
December 15, 2025 at 11:26 AM
The 3rd post in the WCOJ series. Implementing some logical datalog connectors in the context of GenericJoin. finnvolkel.com/wcoj-datalog...
WCOJ - Datalog and GenericJoin · Finn Völkel
finnvolkel.com
December 15, 2025 at 11:26 AM
Reposted by Finn Völkel
The second post in the series is on an actual implementation of a WCOJ algorithm. finnvolkel.com/wcoj-generic...
WCOJ - Generic Join · Finn Völkel
finnvolkel.com
December 11, 2025 at 10:04 AM
The second post in the series is on an actual implementation of a WCOJ algorithm. finnvolkel.com/wcoj-generic...
WCOJ - Generic Join · Finn Völkel
finnvolkel.com
December 11, 2025 at 10:04 AM
I know a bit old your posts. I also found this blog a really good resource for understanding WCOJ www.frankmcsherry.org/dataflow/rel.... It implements the one from arxiv.org/abs/1310.3314.
Worst-case optimal joins, in dataflow
I’ve gotten timely dataflow in Rust up and running exciting computations! I’m going to explain one that I think is especially cool, and that I’m going to tr...
www.frankmcsherry.org
October 6, 2025 at 1:13 PM
I confusingly said it would be next Wednesday. It's Wednesday in a week (23.7) from now.
July 14, 2025 at 7:25 AM
I found this to be a really good article harper.blog/2025/02/16/m...
My LLM codegen workflow atm
A detailed walkthrough of my current workflow for using LLms to build software, from brainstorming through planning and execution.
harper.blog
March 9, 2025 at 3:39 PM
Just wanted to mention that the links to the companies mentioned in the report do not work. Keep the great work up.
December 9, 2024 at 9:14 AM