If your dataset is 72% "no-churn," a model that just always says "no-churn" gets 72% accuracy. Your "fancy" model might just be slightly better than a dumb baseline! 🫠
#DataTalksClub
If your dataset is 72% "no-churn," a model that just always says "no-churn" gets 72% accuracy. Your "fancy" model might just be slightly better than a dumb baseline! 🫠
#DataTalksClub
It measures a model's ability to separate positive and negative classes, independent of any specific threshold.
🎯 1.0 = Perfect classifier 🤷 0.5 = Random guessing
#MLZoomcamp #DataTalksClub #LearningInPublic
It measures a model's ability to separate positive and negative classes, independent of any specific threshold.
🎯 1.0 = Perfect classifier 🤷 0.5 = Random guessing
#MLZoomcamp #DataTalksClub #LearningInPublic
1. Split data into K folds.
2. Train on K-1, test on 1.
3.Repeat K times.
Get the mean & standard deviation of the scores.
A low std dev means your model's performance is consistent! 😌 #MLZoomcamp #DataTalksClub #LearningInPublic
1. Split data into K folds.
2. Train on K-1, test on 1.
3.Repeat K times.
Get the mean & standard deviation of the scores.
A low std dev means your model's performance is consistent! 😌 #MLZoomcamp #DataTalksClub #LearningInPublic
A linear model scores a customer, then the sigmoid function squashes that score into a 0-1 probability. High probability = high churn risk! 🎯
Learn more from Alexey Grigorev!
#MLZoomcamp #DataTalksClub #MachineLearning
A linear model scores a customer, then the sigmoid function squashes that score into a 0-1 probability. High probability = high churn risk! 🎯
Learn more from Alexey Grigorev!
#MLZoomcamp #DataTalksClub #MachineLearning
📈 Business Goal -> 📊 Data -> 🧹 Prep -> 🤖 Model -> ✅ Evaluate -> 🚀 Deploy.
It's a cycle, not a straight line! Loving this structured approach.
#MachineLearning #Process #MLZoomcamp #DataTalksClub
📈 Business Goal -> 📊 Data -> 🧹 Prep -> 🤖 Model -> ✅ Evaluate -> 🚀 Deploy.
It's a cycle, not a straight line! Loving this structured approach.
#MachineLearning #Process #MLZoomcamp #DataTalksClub
We covered the entire offline evaluation pipeline: from generating a ground truth dataset to assessing performance with metrics like MRR, Hit Rate, and Cosine Similarity, plus the fascinating LLM-as-a-Judge approach.
#LLMZoomcamp #RAG #LLM #Evaluation
We covered the entire offline evaluation pipeline: from generating a ground truth dataset to assessing performance with metrics like MRR, Hit Rate, and Cosine Similarity, plus the fascinating LLM-as-a-Judge approach.
#LLMZoomcamp #RAG #LLM #Evaluation