Phoenix Evals automatically traces all evaluation executions, providing complete transparency into how your evaluators make decisions. This visibility is essential for achieving human alignment and building trust in your evaluation results.
Phoenix Evals follows the Transparency pillar - nothing is abstracted away. You can inspect every aspect of the evaluation process, from the raw prompts to the model’s step-by-step reasoning. This transparency enables you to:
Tune evaluation prompts for better human alignment
Identify systematic biases or errors in evaluation logic
Provide evidence-based justification for evaluation results
Continuously improve evaluator performance through data-driven insights
Use Phoenix’s trace viewer to explore evaluation traces and ensure your evaluators are making decisions that align with human judgment.