SWE-bench Pro
3 stories
-
techAnthropic's Claude Sonnet 5 Challenges Opus Model at a Lower Price Point
Anthropic's new Claude Sonnet 5 model matches the performance of its top-tier Opus 4.8 on key benchmarks, offering developers near-equivalent intelligence at a significantly reduced cost.
-
techAnthropic's Claude Fable 5 Shatters AI Benchmarks, Surpassing OpenAI's Leading Model
Anthropic's new 'Mythos-class' Claude Fable 5 achieves a 161 on the Epoch Capabilities Index and dominates in software engineering tests, signaling a major shift in the enterprise AI landscape.
-
techKimi WebBridge Lets AI Agents Drive Your Browser Locally
Moonshot AI launches Kimi WebBridge, a browser extension for local AI agent automation, keeping data private.