SWE-Bench Pro
2 stories
-
techAnthropic's Claude Fable 5 Shatters AI Benchmarks, Surpassing OpenAI's Leading Model
Anthropic's new 'Mythos-class' Claude Fable 5 achieves a 161 on the Epoch Capabilities Index and dominates in software engineering tests, signaling a major shift in the enterprise AI landscape.
-
techKimi WebBridge Lets AI Agents Drive Your Browser Locally
Moonshot AI launches Kimi WebBridge, a browser extension for local AI agent automation, keeping data private.