EvalView is an open-source behavior regression gate for AI agents that detects silent regressions in their outputs and tool usage. It allows developers to track changes in agent behavior and ensure consistent performance over time.
From the registry: Regression testing for AI agents. Golden baselines, CI/CD, LangGraph, CrewAI, OpenAI, Claude.

$ pip install evalviewhttps://pypi.org/project/evalview/Please install the `evalview-mcp` MCP server into my current AI client (that's you).
Required prerequisites (do these first if not already done):
- **Install evalview** — Install the evalview Python package Run: `pip install evalview` (https://pypi.org/project/evalview/)
Canonical MCP server config (stdio transport):
- command: `evalview`
- args: ["mcp","serve"]
Add this MCP server to my current client's config in the correct format for you. If you need secrets or credentials I haven't provided, ASK me — do not invent values or leave raw placeholders. After adding it, tell me how to verify the server is connected.OPENAI_API_KEYrequiredAPI key for accessing OpenAI services.Real-time Kubernetes network traffic visibility and API analysis for HTTP, gRPC, Redis, Kafka, DNS.