Lenz Research
2 stories
-
techAI Models Disagree on Two-Thirds of Fact-Checks, Lenz Research Study Finds
Lenz Research tests five AI models on 1,000 real claims, finding 67% disagreement, with implications for AI reliability in markets.
-
techAI Models Disagree on Basic Facts Two-Thirds of the Time, Study Finds
Five frontier AI systems agreed on only 328 out of 1,000 real-world fact-check claims, revealing deep reliability issues in the models' understanding of basic factual information.