Ian Cook
ian.columnar.tech
Ian Cook
@ian.columnar.tech
I work on Apache Arrow and obsess about interoperability and performance in data analytics systems
Also:

Oct 20: @columnar.tech

Thanks @andypavlo.bsky.social for hosting this series!
September 18, 2025 at 3:27 AM
There are some great books in this vein by Joseph Stiglitz and David Graeber
December 30, 2024 at 1:53 PM
You might like to try Ibis: ibis-project.org. It was created by Wes McKinney (also the author of pandas) as a solution to some problems pandas couldn't solve well. It's had a lot of recent growth and improvement. There's a tutorial for users coming from dplyr: ibis-project.org/tutorials/ib...
November 19, 2024 at 4:29 AM
Yes—this came up for me because I’m playing with the Swift implementation of Apache Arrow, which is maintained in a directory of the Arrow monorepo, but Swift’s package manager assumes a 1:1 relationship of repo:package
November 14, 2024 at 2:14 PM
Pretty funny that the easiest way to do it was with an SVN command, until GitHub dropped support for that earlier this year.
November 14, 2024 at 1:53 AM
There’s some great news coming soon on this. Unfortunately I can’t share it publicly now, but DM me if you’re curious
November 3, 2024 at 6:58 PM
There’s also ADBC which aims to replace ODBC and JDBC in analytics applications with a much faster Arrow-based alternative.

DM me if you’re interested to discuss
October 30, 2024 at 6:45 PM
Arrow IPC stream format works great for sending over HTTP APIs. There’s also Arrow Flight which is a framework for sending Arrow IPC data through RPC APIs.

Arrow is much more efficient than JSON for data transport in OLAP applications because you avoid transposing columns to and from rows.
October 30, 2024 at 6:39 PM