1. Definition

  • Calibration = the alignment between predicted probabilities and actual observed frequencies.
  • A well-calibrated model produces probabilities that can be directly interpreted as real-world likelihoods.

Example:

  • If a fraud detection model outputs 70% probability, and among all such predictions about 70% are truly fraud, the model is well calibrated.
  • If in reality only 40% are fraud, the model is overconfident.

2. Why It Matters


3. How to Assess Calibration


4. How to Improve Calibration


5. Example Use Cases

  • Credit Fraud Detection:
    • Critical: calibrated probabilities directly guide risk thresholds and fraud investigation prioritization.
  • Yelp Sentiment Analysis:
    • Less critical: classification accuracy is more important than calibrated probabilities.
  • Energy Forecasting:
    • Not relevant: regression metrics (RMSE, MAPE) matter more than probability calibration.

Summary:

  • Calibration ensures predicted probabilities reflect true outcome frequencies.
  • Important for domains where probabilities drive decisions (fraud detection, medical diagnosis, insurance).
  • Key methods: Platt Scaling, Isotonic Regression, Temperature Scaling.