Arik Friedman
arikf.net
Arik Friedman
@arikf.net
Data Scientist @ Atlassian
Spent some time looking into software teams' effectiveness, and how software teams can use data to reflect and improve how they work. These days I'm trying to learn more about Statistical Process Control.
Pinned
Hi, I'm Arik. I'm a data scientist at Atlassian, living in Sydney, Australia. I did some software engineering and product management in the past, but a PhD on Privacy Preserving Data Mining eventually led me to work with data.
Oh, that's mean! 🤔
when i think about all the important moments in my life,
and try to guess which will flash before my eyes at the end,
my best guess is that it will be kurtosis🤔
October 14, 2025 at 12:03 PM
Reposted by Arik Friedman
I was tagged the other day by someone kindly sharing my Python Rgonomics post and realized that my thesis of "tooling keeps improving" held up too well and some of the recos have changed

Hence, here's the 2025 update: www.emilyriederer.com/post/py-rgo-...

Now feat uv (vs pyenv, pdm) and Positron
Python Rgonomics - 2025 Update | Emily Riederer
Switching languages is about switching mindsets - not just syntax. New developments in python data science toolings, like polars and seaborn’s object interface, can capture the ‘feel’ that converts fr...
www.emilyriederer.com
January 27, 2025 at 3:12 AM
Interviewer: can you explain this gap in your resume?

Data scientist: it's a confidence interval.
Interviewer: Can you explain this gap in your resume?

DevOps: It is running on us-east-1
Interviewer: Can you explain this gap in your resume?

Database Admin: That's a normal maintenance window
December 21, 2024 at 1:09 AM
Reposted by Arik Friedman
KNN + topic detection getting a big glow-up www.anthropic.com/research/clio
Clio: Privacy-preserving insights into real-world AI use
A blog post describing Anthropic’s new system, Clio, for analyzing how people use AI while maintaining their privacy
www.anthropic.com
December 13, 2024 at 12:06 PM
Spent hours yesterday to get a SQL cell in a databricks job use a variable that was set in a prior python cell. Amazing how something that seems so trivial can be so convoluted. Is that what "business impact" looks like?
November 30, 2024 at 1:35 AM
Reposted by Arik Friedman
November 23, 2024 at 1:07 PM
Reposted by Arik Friedman
Don't Do This - PostgreSQL wiki
wiki.postgresql.org
November 22, 2024 at 2:43 PM
Great stuff, and an excuse to recommend the book Storytelling with Data by Cole Nussbaumer Knaflic for anyone who wants to learn more on this.
Trying something new:
A 🧵 on a topic I find many students struggle with: "why do their 📊 look more professional than my 📊?"

It's *lots* of tiny decisions that aren't the defaults in many libraries, so let's break down 1 simple graph by @jburnmurdoch.bsky.social

🔗 www.ft.com/content/73a1...
November 21, 2024 at 2:41 AM
Reposted by Arik Friedman
One of my favourite pieces on the role of the data analyst comes from @rdpeng.org , who in turn quotes Tukey's "The Future of Data Analysis" from 1962 (!)
simplystatistics.org/posts/2019-0...
#databs
November 16, 2024 at 10:01 PM
Reposted by Arik Friedman
one of the many things to like about atproto is that everything is authenticated and public

you don't have to trust bluesky, you can read from the relay yourself

it's not an API... more like a spinal tap into the central feed

and this is an amazing example of what you can do with that power
Sky Zoo
Stats on Bluesky, At Protocol, ...
skyzoo.blue
November 16, 2024 at 9:16 PM
One of my favourite pieces on the role of the data analyst comes from @rdpeng.org , who in turn quotes Tukey's "The Future of Data Analysis" from 1962 (!)
simplystatistics.org/posts/2019-0...
#databs
November 16, 2024 at 10:01 PM
Reposted by Arik Friedman
A number of artists and creators have made their home on Bluesky, and we hear their concerns with other platforms training on their data. We do not use any of your content to train generative AI, and have no intention of doing so.
November 15, 2024 at 5:22 PM
Reposted by Arik Friedman
This is probably the one of the clearest explanations I’ve seen of zero-shot, etc

arxiv.org/abs/2309.024...
November 15, 2024 at 3:37 PM
Can confirm
I feel like this site's demographics can be described as "still mad about the loss of Google Reader"
November 12, 2024 at 8:42 PM
I noticed my app habits changed over the last couple of weeks. I now head to bluesky for #dataBS, and head to X when I want to see temu ads.
November 9, 2024 at 1:22 AM
Reposted by Arik Friedman
This is a fantastic read. But depressing, but fantastic.

hellooperator.substack.com/p/what-it-me...
What It Means to Be Data-Driven
Stop hiding behind the numbers and start using them instead.
hellooperator.substack.com
November 4, 2024 at 3:49 AM
Reposted by Arik Friedman
Yeah, I think the immune response of "AI will replace us" was just so misguided. "AI will help us" is so much healthier. I write more code because of Cursor, not less.
November 1, 2024 at 9:38 PM
I forgot to hashtag my #dataBS. Apparently this site doesn't have an edit button.
My takeaway: LLMs are risky for generating code unless: (a) it's simple enough to review quickly, (b) the output is easy to verify, or (c) mistakes are low-stakes and tolerable.
November 2, 2024 at 1:07 AM
The talk about "automating analysts' work" reminded me of my past attempt to run an LLM through our Data Science Interview. medium.com/atlassiandat...
A colleague of mine actually repeated that recently - it's clear that LLMs got better, but they still need a human in the loop to help them along.
Will Generative AI make data scientists obsolete?
Would ChatGPT and its Code Interpret plugin pass Atlassian’s Data Science interview? How can data scientists leverage generative AI?
medium.com
November 2, 2024 at 1:02 AM
Reposted by Arik Friedman
Another Bluesky tip, there's a "Quiet posters" feed that tries to surface people who don't post very often and would otherwise get buried amongst highly online people

bsky.app/profile/did:...
October 29, 2024 at 9:43 PM
Reposted by Arik Friedman
I got your "Self Service Dashboard" right 'ere
October 28, 2024 at 3:36 PM
Reposted by Arik Friedman
“We shall spend an incredible amount of money building a data warehouse and a data science team to generate business insights. Everyone will do this.”

—Sir, I assume these data teams will be trusted and their data used for important decisions

“No, decisions will be made primarily on vibes”
October 28, 2024 at 1:17 AM
Reposted by Arik Friedman
I made a data people starter pack: go.bsky.app/8TdEfdK
October 25, 2024 at 1:06 AM