Skip to content
All threads

Thread

The Fabrication Problem

Most AI numbers are fabricated. Source material fixes it. Self-checking fails. Trust signals are backwards.

Answered Source material collapses unsourced numbers to single digits. Prohibition outperforms monitoring 5x. Self-checking fails because the same process generates and evaluates. Trust signals (citations, confidence, specificity) are higher in fabricated output than sourced output.

Open Does the fix work beyond reformulation tasks? Reasoning shows improvement (75% vs 38%), but strategy and creative untested.