Evaluation library for LLM applications. Create custom evaluators or use pre-built ones for hallucination detection, relevance scoring, and other evaluation tasks.
Model Context Protocol (MCP) server for Phoenix. Provides access to prompts, datasets, and experiments through the MCP standard for integration with Claude Desktop, Cursor, and other MCP-compatible tools.