CF
ClearFeed
Trust Analysis
85Trust
Verified
πŸ” Web Verified
Eric TopolonX / Twitter5/9/2026
Expressing uncertainty is a major weak spot of LLMs in medicine @NEJM "Can AI Say I Don't Know?" Good lines: "Contemporary LLMs have passed many Turing tests, but will they pass this modern test of not knowing? We don’t know." nejm.org/doi/full/10.10… https://t.co/CXL0fpzP6v
Trust Metrics
85
Accuracy
82
Framing
85
Context
88
Tone
Accuracy85%
Framing82%
Context85%
Tone88%
Analysis Summary
A New England Journal of Medicine article highlights a critical gap in large language models: they struggle to express uncertainty appropriately, especially in medicine where clinicians are expected to acknowledge the limits of their knowledge. The piece uses the metaphor of passing Turing tests (demonstrating human-like conversation) versus passing a more demanding modern testβ€”knowing when not to know. This matters because AI tools deployed in clinical settings that confidently assert incorrect answers pose real patient safety risks. The broader pattern documented across multiple sources shows LLMs routinely hallucinate facts and overstate confidence, a limitation that persists despite their impressive performance on many benchmarks.
Claims Analysis (2)
β€œContemporary LLMs have passed many Turing tests, but may not pass a modern test of not knowing whether they can express uncertainty appropriately”
NEJM article directly addresses thisβ€”LLMs lack appropriate uncertainty expression in medical contexts, a documented limitation
βœ“ Verified
β€œExpressing uncertainty is a major weak spot of LLMs in medicine”
Core thesis of the NEJM article. Multiple independent sources (Gizmodo, Hackaday) document LLM hallucination and confidence failures
βœ“ Verified
Was this analysis helpful?
Try ClearFeed free β†’
clearfeed.app β€” Trust scores for your social feed