25 stories tagged #AI safety

  1. Nvidia Launches Halos for Robotics: A Breakthrough in AI Safety
    tech

    Nvidia Launches Halos for Robotics: A Breakthrough in AI Safety

    Nvidia introduces Halos for Robotics, the first comprehensive framework for ensuring safety in advanced robotic systems.

    yesterday 1 min read
  2. The Great AI Safety Schism: From Restriction to Strategic Release
    tech

    The Great AI Safety Schism: From Restriction to Strategic Release

    A unified front of AI safety researchers has fractured, with former alarm-ringers now advocating for controlled or open access to dangerously capable models for defensive purposes.

    last wk. 1 min read
  3. AI Antibiotic Advisor Shows Promise but Remains Unsafe for Clinical Use
    health

    AI Antibiotic Advisor Shows Promise but Remains Unsafe for Clinical Use

    A new AI chatbot grounded in local hospital guidelines delivered 87% accurate antibiotic advice in simulations but produced clinically relevant errors, especially in complex cases.

    last wk. 1 min read
  4. Anthropic Halts Fable 5, Mythos 5 Access Amid US Security Directive
    tech

    Anthropic Halts Fable 5, Mythos 5 Access Amid US Security Directive

    Anthropic suspends its latest AI models following a US government export control directive citing national security concerns over potential jailbreak vulnerabilities.

    last wk. 1 min read
  5. Anthropic Denies Claude Fable 5 Jailbreak Claims Amid Security Debate
    tech

    Anthropic Denies Claude Fable 5 Jailbreak Claims Amid Security Debate

    Anthropic rejects allegations that Claude Fable 5 was compromised by multi-agent attacks shortly after launch, citing extensive pre-release safety testing and layered defense architecture.

    last wk. 1 min read
  6. Mother Sues OpenAI and Sam Altman Alleging ChatGPT Contributed to Daughter’s Suicide
    tech

    Mother Sues OpenAI and Sam Altman Alleging ChatGPT Contributed to Daughter’s Suicide

    A Canadian mother filed a US lawsuit against OpenAI, claiming ChatGPT validated her daughter's suicidal thoughts and failed safety protocols, highlighting urgent AI accountability concerns.

    last wk. 2 min read
  7. Anthropic Co-Founder Warns AI Race Lacks Emergency Brake
    tech

    Anthropic Co-Founder Warns AI Race Lacks Emergency Brake

    Jack Clark says the AI industry is speeding ahead with no means to slow down, urging immediate regulatory action.

    2w ago 1 min read
  8. Anthropic Urges AI Labs to Slow Down Over Self-Improvement Risks
    tech

    Anthropic Urges AI Labs to Slow Down Over Self-Improvement Risks

    Anthropic calls for a global slowdown in AI development, warning of risks from recursive self-improvement.

    2w ago 2 min read