My institution recently began using an AI detector that gives a percentage score for AI-generated content. If the score is high, the work is immediately questioned. There is no discussion of uncertainty or margin of error. When did these scores start being treated as proof rather than estimates?

