BadMerge
badmerge.bsky.social
BadMerge
@badmerge.bsky.social
Just interesting data-related stuff that I find
Reposted by BadMerge
Learn how to define and analyze the impact radius of data model changes — Dave Flynn presents a clear, hands-on walkthrough of a change-aware data-validation workflow. towardsdatascience.com/change-aware...
Change-Aware Data Validation with Column-Level Lineage | Towards Data Science
Data transformation tools like dbt make constructing SQL data pipelines easy and systematic. But even with the added structure and clearly defined data models, pipelines can still become complex,…
towardsdatascience.com
July 6, 2025 at 12:34 AM
Learn how to build modern, scalable data pipelines using Python and AI-assisted tools:

www.youtube.com/watch?v=T23B...
Data Engineering with Python and AI/LLMs – Data Loading Tutorial
YouTube video by freeCodeCamp.org
www.youtube.com
June 19, 2025 at 1:07 AM
Soda acquires NannyML:

siliconcanals.com/brussels-sod...

Are you using AI in your data transformation or data quality layer yet?

#data #ai #dataquality #datamonitoring
Brussels’ Soda acquires AI monitoring company NannyML
Soda’s acquisition of NannyML brings AI capabilities into its platform, enhancing data quality workflows and reducing blind spots and false positives.
siliconcanals.com
June 10, 2025 at 2:56 AM
SQL Workbench is a browser-based SQL IDE that lets you write, run, and visualize queries without any setup.

Follow @sql-workbench.com and developer @tobilg.com for updates:

sql-workbench.com

#SQL #data #analytics #visualization
SQL Workbench - Rapid prototyping SQL Queries & Data Visualizations
An online SQL Workbench based on DuckDB that can query and visualize remote CSV, JSON, Parquet and Arrow data, as well as local files.
sql-workbench.com
June 9, 2025 at 1:15 AM
dbt recently emphasizes their commitment to dbt core by providing a roadmap, but is lip service enough?

Will dbt core become neglected, will the data community fork it? Are there better alternatives?

What's your opinion?

www.reddit.com/r/dataengine...

#dbt #dataengineering #analytics
From the dataengineering community on Reddit
Explore this post and more from the dataengineering community
www.reddit.com
June 6, 2025 at 3:42 AM
Open-sourcing circuit tracing tools

Anthropic:
"Today, we’re open-sourcing the method...we introduced a new method to trace the thoughts of a large language model...so that anyone can build on our research"

www.anthropic.com/research/ope...

#LLM #GenAI #AI #Gemma #Llama
Open-sourcing circuit-tracing tools
Anthropic is an AI safety and research company that's working to build reliable, interpretable, and steerable AI systems.
www.anthropic.com
June 3, 2025 at 7:46 AM
Reposted by BadMerge
The 221st edition of Data Engineering Weekly is out with a fresh set of articles.

www.dataengineeringw...
May 26, 2025 at 1:01 AM
Raspberry Pi duckdb TPC-H Benchmark

"I used DuckDB and python to run TPC-H benchmark queries on a Raspberry Pi Zero 2W"

"The benchmark is built to test analytical workloads... It took 20 min to run all 22 queries"

by Jonas Hertz: www.linkedin.com/posts/jonas-...

github.com/jonas-bispec...
GitHub - jonas-bispecialist/raspberry_duckdb_tpch_test: This repository helps you setup and run TPC-H analytics test on a Raspberry Pi
This repository helps you setup and run TPC-H analytics test on a Raspberry Pi - jonas-bispecialist/raspberry_duckdb_tpch_test
github.com
May 26, 2025 at 12:43 AM
Reposted by BadMerge
This looks like a great resource for getting started with CrewAI. This course, by Tyler Reed, focuses on developing AI Agents using the CrewAI framework. CrewAI is a lean, lightning-fast Python framework built entirely from scratch.

www.youtube.com/watch?v=ONKO...

#python #crewai #ai
How To Use AI Agents To Do ALL Your Work - Full CrewAI Course for Beginners
YouTube video by Tyler AI
www.youtube.com
May 22, 2025 at 4:05 AM
Reposted by BadMerge
We just dropped a 4h knowledge bomb in collaboration with
@freecodecamp.bsky.social and #DataTalksClub

It's designed for people already in the data field who want to upskill to senior DE knowledge in data loading best practices.

www.youtube.com/watch?v=T23B...

#dataengineering #databs
Data Engineering with Python and AI/LLMs – Data Loading Tutorial
YouTube video by freeCodeCamp.org
www.youtube.com
April 19, 2025 at 3:27 AM
Data Talks Club:

LLM Zoomcamp 2025

Pre-Course Live Q&A

lu.ma/a5hk6hxy

#LLM #onlinecourse #data
LLM Zoomcamp 2025
Pre-Course Live Q&A · Luma
Join us for an interactive Q&A session about the LLM Zoomcamp, our free online course about real-life applications of large language models (LLMs). You will…
lu.ma
May 19, 2025 at 7:29 AM
SQLBolt

A series of interactive lessons and exercises designed to help you quickly learn SQL right in your browser

sqlbolt.com

#SQL #learn #data #resources
SQLBolt - Learn SQL - Introduction to SQL
SQLBolt provides a set of interactive lessons and exercises to help you learn SQL
sqlbolt.com
May 15, 2025 at 11:30 PM
Key Moments from AI Codecon (@oreilly.bsky.social):

My LLM Codegen Workflow at the Moment - Harper Reed

AI-driven design-to-code workflow involving multiple models for Q&A, spec generation, planning, and code prompting

www.youtube.com/watch?v=h2gi...

#LLM #codegen #vibecoding
Key Moments from AI Codecon: My LLM Codegen Workflow at the Moment - Harper Reed
YouTube video by O'Reilly
www.youtube.com
May 15, 2025 at 2:46 AM