In this case, the input is a URL-encoded form data with small keys and no values.
In this case, the input is a URL-encoded form data with small keys and no values.
A full fetch of the list takes a few hours, because the API scales pretty poorly. This is what the rate of discovery of new repositories looked like the first time I ran this:
A full fetch of the list takes a few hours, because the API scales pretty poorly. This is what the rate of discovery of new repositories looked like the first time I ran this:
On the tail end, there is a large amount of very small collections, most of them appearing spamish.
On the tail end, there is a large amount of very small collections, most of them appearing spamish.
In decreasing order of size: likes (a majority of the data), then posts, then reposts and finally follows.
In decreasing order of size: likes (a majority of the data), then posts, then reposts and finally follows.
Surprising given the amount of redundancy in the data (i.e. structs are stored as maps).
Surprising given the amount of redundancy in the data (i.e. structs are stored as maps).
Definitely some interesting outliers here.
Definitely some interesting outliers here.
You have to wait until the 97th centile to reach 1.04MB.
You have to wait until the 97th centile to reach 1.04MB.