1
Increase
max_tokens: Update the model configuration as follows:2
Update Phoenix: Use version ≥0.17.4, which removes token limits for OpenAI and increases defaults for other APIs.
3
Check Logs: Look for
finish_reason="length" to confirm token limits caused the issue.4
If the above doesn’t work, it’s possible the llm-as-a-judge output might not fit into the defined rails for that particular custom Phoenix eval. Double check the prompt output matches the rail expectations.

