Agentify Example: Tau-Bench

Example code for agentifying Tau-Bench using A2A and MCP standards.

Project Structure

src/
├── green_agent/    # Assessment manager agent
├── white_agent/    # Target agent being tested
└── launcher.py     # Evaluation coordinator

uv sync

First, configure .env with OPENAI_API_KEY=..., then

# Launch complete evaluation
uv run python main.py launch

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
src		src
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
main.py		main.py
pyproject.toml		pyproject.toml
uv.lock		uv.lock