Mixture-of-Experts
2 stories
-
techTrillion-Parameter AI Runs on Budget GPU with Legacy Memory
A Chinese enthusiast runs Moonshot AI's Kimi K2.5 on an RTX 3060 using Intel Optane memory, achieving 4 tokens per second.
-
techTencent Unveils Hy3: The Efficient Chinese AI Model Outperforming Rivals
Tencent's new Hy3 AI model, a 295 billion parameter Mixture-of-Experts architecture, achieves remarkable efficiency and performance gains, particularly in coding and reasoning tasks.