Erald David
eraldds.bsky.social
Erald David
@eraldds.bsky.social
Analytics @BukuWarung
Avid reader. Random book quotes one at a time
Flipping lower order bit
January 10, 2025 at 3:56 PM
I love how every article about Analytics kept screaming "Raise your technical skills. Learn platform!" but the OG book from Kimball mentioned (in the first 20 page of the book): "Yeah no, you're hybrid DBA and MBA"
January 8, 2025 at 5:17 PM
Unexplored Topic: Most startups (at least in Indonesia) currently take a Business-Market Fit-first approach, keeping all processes manual until they generate revenue, before committing precious resources (like engineering bandwidth) to properly productize their solutions.
December 18, 2024 at 4:10 AM
Reposted by Erald David
Last post of the year! S3, Parquet, Iceberg, and @duckdb.org are a great way to get customers their data.
S3 Is the New SFTP
Customers want their data. A customer data lake is a great way to give it to them.
materializedview.io
December 16, 2024 at 4:10 PM
Reposted by Erald David
Book Club is happening - find all the details here: jennajordan.me/book-club/

#databs #datasky
December 15, 2024 at 1:05 AM
Reposted by Erald David
The first book would have to be Data & Reality by Bill Kent (2nd ed of course).

After that there are many potential options but I’ve wanted to have a book club for Data & Reality for so long that it has to be the first one. We could call it the “Data Philosophizing” book club or something.
I want to make this happen (seriously the idea of lots of ppl reading books I wanna read and then talking about it together is like… the dream!) but we are coming up on holiday season so the earliest it would happen would be January. Maybe async convos here and biweekly videos calls… sound like fun?
Apparently I’m supposed to lead a book club now 🤷‍♀️

I mean, twist my arm, it’s not like I’ve ever thought about that before or have a whole list of books to start with…
November 5, 2024 at 5:59 PM
Reposted by Erald David
For my last class this semester, I tried to cram our Advanced Database course into one lecture. We cover the following database systems in 60min: youtu.be/fr5lIchF6pw
• Google Dremel / BigQuery
• Snowflake
• Amazon Redshift
• Yellowbrick
• Databricks Photon
@duckdb.org
• TabDB
#25 - BigQuery + Snowflake + Redshift + Databricks + DuckDB (CMU Intro to Database Systems)
YouTube video by CMU Database Group
youtu.be
December 5, 2024 at 10:39 PM
Reposted by Erald David
Fantastic opportunity for ppl in #mlsky #databs #stats etc to write their own perspectives on how machine learning and engineering have changed over the course of your careers (see original link in the thread), especially last 5 years. Would love to read all of these posts!
post refers as classical to the 2015s deep learning architecture days. Would like to see a wider perspective from people who started in the eighties before the statistical revolution. What did rule based folks think? What about people working with non-parametric probabilistic models in the 2000s
December 6, 2024 at 1:55 PM
I just completed "Historian Hysteria" - Day 1 - Advent of Code 2024 #AdventOfCode adventofcode.com/2024/day/1
December 1, 2024 at 11:01 AM
It's crazy to think that Chicago become the center of modern finance because their geographic advantage and .....cow?

What does analogy that fit into this case?

On top of my head: it's like knowing your local IKEA become 3 Michelin star restaurant in the future... because they can cook meatball
November 29, 2024 at 8:08 PM
Reposted by Erald David
FYI, here's the entire code to create a dataset of every single bsky message in real time:

```
from atproto import *
def f(m): print(m.header, parse_subscribe_repos_message())
FirehoseSubscribeReposClient().start(f)
```
November 28, 2024 at 9:56 AM
Woahh idk about this. Very useful
Different classes of end-user programming, table from: web.media.mit.edu/~lieber/Publ...
November 29, 2024 at 2:14 PM
Reposted by Erald David
I recently shared some of my reflections on how to use probabilistic classifiers for optimal decision-making under uncertainty at @pydataparis.bsky.social 2024.

Here is the recording of the presentation:

www.youtube.com/watch?v=-gYn...
November 27, 2024 at 2:17 PM
Reposted by Erald David
Interesting critique of the problems with peer review. Unfortunately it fails to offer a proposal for what might be better. I'm reminded of the old adage “democracy is the worst form of government, except for all the others.” Likewise for peer review. www.experimental-history.com/p/the-rise-a...
The rise and fall of peer review
Why the greatest scientific experiment in history failed, and why that's a great thing
www.experimental-history.com
November 26, 2024 at 11:11 AM
Reposted by Erald David
Finished reading the book, last two chapters on the philosophical aspects of technology worship are great, and the material provides vivid conceptual clarity on bubble dynamics. However, overall, the book could have been much shorter.
Boom time!
November 26, 2024 at 5:36 PM
Reposted by Erald David
I'm glad @hf.co is doing this. It brings down the barriers to allow more people to benefit from AI, rather than keeping it exclusively in the realm of deep pocketed giant companies.

AI can help open the gates, to allow regular people to do things they couldn't do before. (Which can be threatening!)
November 27, 2024 at 12:53 AM
The number of banger quotes in 'Boom' by Byrne Hobart is staggering

Few of my favorites (a note to myself)
November 24, 2024 at 8:14 AM
Reposted by Erald David
One thing I’ve learnt about The Practice is that it takes time for other controllable levers to show up — and sometimes you’re limited by how rich your understanding of your causal model of your process is.

This is limited by experience, instrumentation, and creativity.
Just listened to @cedricchin.bsky.social on the Analytics Engineering pod with Tristan (roundup.getdbt.com/p/data-as-an...)

I have some thoughts and it basically is about heuristics. As a now content marketer (ish?), I'm thinking a lot about The Practice: the core loop that you do to get better.
November 21, 2024 at 8:57 PM