Demo
Function Calling Eval Template
We are continually iterating our templates, view the most up-to-date template on GitHub.
Running an Agent Eval using the Function Calling Template
df- a dataframe of cases to evaluate. The dataframe must have these columns to match the default template:question- the query made to the model. If you’ve exported spans from Phoenix to evaluate, this will thellm.input_messagescolumn in your exported data.tool_call- information on the tool called and parameters included. If you’ve exported spans from Phoenix to evaluate, this will be thellm.function_callcolumn in your exported data.

