Prediction Drift Analysis - AI March Madness 2026

Name: Prediction Drift Analysis - NCAA Tournament 2026
Creator: AI March Madness 2026
License: https://creativecommons.org/licenses/by/4.0/

Prediction drift measures how AI model picks change between the T-24h, T-6h, and T-1h collection windows before each NCAA Tournament game. A flip occurs when a model switches its predicted winner between windows. Confidence shifts track changes in stated confidence without a winner change.

Models with high flip rates in the T-24h to T-6h window tend to have lower overall accuracy. The most reliable predictors maintain stable picks at T-24h and T-6h, then update decisively at T-1h when late-breaking information like injury reports becomes available.

The drift chart visualizes each model's prediction stability across all games played. GPT-4o, Gemini 2.5, and Perplexity Sonar Pro each show distinct drift patterns based on their underlying source dependencies and reasoning approaches.

Flip Rate: The percentage of games where a model changed its predicted winner between collection windows. Lower flip rates generally correlate with higher accuracy.
Confidence Shift: A change of 10 or more percentage points in stated confidence between collection windows without changing the predicted winner.
Prediction Stability: The inverse of flip rate. Higher stability indicates more consistent predictions across collection windows.

PREDICTION DRIFT

How picks shift from T-24h to T-4h to T-1h · Flipped picks highlighted in orange

DOES CONFIDENCE INCREASE NEAR TIP-OFF?

We tracked whether each model's confidence language becomes more or less certain as the game approaches. High flip-rate models tend to start overconfident and then reverse - a pattern that erodes reliability.

Models with stable confidence trajectories adjust proportionally to new information rather than swinging based on framing.

Prediction Drift Analysis - AI March Madness 2026

PREDICTION DRIFT

DOES CONFIDENCE INCREASE NEAR TIP-OFF?

GAME-BY-GAME DRIFT