Simulation and evals for AI agents. 