📍 VT, ⛷️ & ⛵️
sarahsnewsletter.substack.com
Switched to prefect and never looked back
I wish it was more widespread in the industry
Switched to prefect and never looked back
I wish it was more widespread in the industry
If you're trying to run a data team: analytics (learning to work with stakeholders)
Or, use data as a gateway to learning and evolve your career once again
If you're trying to run a data team: analytics (learning to work with stakeholders)
Or, use data as a gateway to learning and evolve your career once again
I recently watched the airflow summit 2023 video on it - isn't it just an Airflow plugin for dags that relies on manual hooks and lacks deep integration with data or infra assets? I'd also expect some UI around lineage.
If I'm making naive assumptions correct me
I recently watched the airflow summit 2023 video on it - isn't it just an Airflow plugin for dags that relies on manual hooks and lacks deep integration with data or infra assets? I'd also expect some UI around lineage.
If I'm making naive assumptions correct me
But what if you're running a python ETL process pre-warehouse and your infra dies? The output of that job would be out of date.
That's also lineage, and not in SQL. So we need to solve for that too.
But what if you're running a python ETL process pre-warehouse and your infra dies? The output of that job would be out of date.
That's also lineage, and not in SQL. So we need to solve for that too.
sarahsnewsletter.substack.com/p/everyone-s...
sarahsnewsletter.substack.com/p/everyone-s...
I'm hearing this is a problem when data eng / data platform become different teams.
Who's encountered this?
#dataBS
I'm hearing this is a problem when data eng / data platform become different teams.
Who's encountered this?
#dataBS
PS ignore the trolls, only way forward
PS ignore the trolls, only way forward
But what if an event happens but it's throttled to only run a thing every 5 min? Then it's not realtime
I think realtime is about the SLA of the output the event is triggering
So there's a venn diagram with an overlapping middle
But what if an event happens but it's throttled to only run a thing every 5 min? Then it's not realtime
I think realtime is about the SLA of the output the event is triggering
So there's a venn diagram with an overlapping middle