Most AI agent failures don't happen during the demo. They happen when APIs fail, context windows explode, costs spiral, and nobody can explain why the agent made a decision. Here are five questions that separate production-ready platforms from expensive experiments.