MatX CEO Reiner Pope explains how batch size and KV cache dictate AI latency and cost, and why efficient inference is crucial.