How eval engineering manages AI agent behavior by addressing cost and latency challenges for production governance.