AgentTrajectoryEvaluator#

class langchain.evaluation.schema.AgentTrajectoryEvaluator[source]#

Interface for evaluating agent trajectories.

Attributes

`requires_input`	Whether this evaluator requires an input string.
`requires_reference`	Whether this evaluator requires a reference label.

Methods

`aevaluate_agent_trajectory`(*, prediction, ...)	Asynchronously evaluate a trajectory.
`evaluate_agent_trajectory`(*, prediction, ...)	Evaluate a trajectory.

async aevaluate_agent_trajectory(

*,

prediction: str,

agent_trajectory: Sequence[tuple[AgentAction, str]],

input: str,

reference: str | None = None,

**kwargs: Any,

) → dict[source]#

Asynchronously evaluate a trajectory.

Parameters:

prediction (str) – The final predicted response.
agent_trajectory (Sequence[tuple[AgentAction, str]]) – The intermediate steps forming the agent trajectory.
input (str) – The input to the agent.
reference (str | None) – The reference answer.
**kwargs (Any) – Additional keyword arguments.

Returns:

The evaluation result.

Return type:

dict

evaluate_agent_trajectory(

*,

prediction: str,

agent_trajectory: Sequence[tuple[AgentAction, str]],

input: str,

reference: str | None = None,

**kwargs: Any,

) → dict[source]#

Evaluate a trajectory.

Parameters:

prediction (str) – The final predicted response.
agent_trajectory (Sequence[tuple[AgentAction, str]]) – The intermediate steps forming the agent trajectory.
input (str) – The input to the agent.
reference (str | None) – The reference answer.
**kwargs (Any) – Additional keyword arguments.

Returns:

The evaluation result.

Return type:

dict