AI inference
-
techXcena Raises $135M to Build Memory-Centric AI Chip MX1
South Korean startup Xcena has secured $135 million to develop a memory-focused AI chip designed to solve data bottlenecks in AI inference.
-
techQuantum Leap in AI: IBM Computer Boosts LLM Accuracy
Scientists use an IBM quantum computer to train an AI model, reducing perplexity and improving answer accuracy over the base version.
-
techQualcomm Secures Major Hyperscale Customer for Custom Data Center Chips
Qualcomm wins a major data center customer for custom AI inference chips, marking its return to the server market with shipments due by December 2026.
-
techUK Chip Startup Fractile Raises $220M to Speed Up AI Inference
Fractile secures $220M in Series B to accelerate token consumption with novel chip design.
-
techTokenization and AI Inference Reshaping Inflation and Asset Prices
Jordi Visser explains how tokenization will impact inflation, AI inference drives asset price parabolas, and the tech sector's race against obsolescence.
-
techAranya Launches Cluster OS, Partnering with Hydra Host for Bare-Metal AI
Aranya debuts its ClusteredOS, simplifying AI infrastructure deployment and partnering with Hydra Host for bare-metal AI solutions.
-
techCNCF Tackles AI Inference Chaos with Cloud-Native Solutions
The CNCF is advancing cloud-native infrastructure to handle the unpredictability and scale of AI inference, driving efficiency and adoption.
-
techGimlet Labs Secures $80M for Multi-Silicon Inference Cloud
Gimlet Labs raises $80M led by Menlo Ventures to solve AI inference bottlenecks with its unique multi-silicon approach.