1 stories tagged #BullshitBench

  1. AI Models Fail Critical 'Bullshit' Detection Test
    tech

    AI Models Fail Critical 'Bullshit' Detection Test

    A new benchmark reveals most AI models confidently answer nonsensical questions, highlighting significant flaws in their reasoning and a need for improved AI safety.

    3mo ago 2 min read