Example code for agentifying Tau-Bench using A2A and MCP standards.
src/
├── green_agent/ # Assessment manager agent
├── white_agent/ # Target agent being tested
└── launcher.py # Evaluation coordinator
uv syncFirst, configure .env with OPENAI_API_KEY=..., then
# Launch complete evaluation
uv run python main.py launch