85Trust
Verified
π Web Verified
Eric TopolonX / Twitter5/9/2026
Expressing uncertainty is a major weak spot of LLMs in medicine @NEJM "Can AI Say I Don't Know?"
Good lines: "Contemporary LLMs have passed many
Turing tests, but will they pass this modern test of not knowing? We donβt know."
nejm.org/doi/full/10.10β¦ https://t.co/CXL0fpzP6v
Trust Metrics
85
82
85
88
Accuracy85%
Framing82%
Context85%
Tone88%
Analysis Summary
A New England Journal of Medicine article highlights a critical gap in large language models: they struggle to express uncertainty appropriately, especially in medicine where clinicians are expected to acknowledge the limits of their knowledge. The piece uses the metaphor of passing Turing tests (demonstrating human-like conversation) versus passing a more demanding modern testβknowing when not to know. This matters because AI tools deployed in clinical settings that confidently assert incorrect answers pose real patient safety risks. The broader pattern documented across multiple sources shows LLMs routinely hallucinate facts and overstate confidence, a limitation that persists despite their impressive performance on many benchmarks.
Claims Analysis (2)
βContemporary LLMs have passed many Turing tests, but may not pass a modern test of not knowing whether they can express uncertainty appropriatelyβ
NEJM article directly addresses thisβLLMs lack appropriate uncertainty expression in medical contexts, a documented limitation
βExpressing uncertainty is a major weak spot of LLMs in medicineβ
Core thesis of the NEJM article. Multiple independent sources (Gizmodo, Hackaday) document LLM hallucination and confidence failures
Verify Yourself
Was this analysis helpful?
Try ClearFeed free β