class Riffer::Evals::EvaluatorRunner

Orchestrates running evaluators against an agent across multiple scenarios.

Accepts an agent class, a list of scenarios, and evaluator classes. Generates agent output for each scenario and runs all evaluators, returning a RunResult with per-scenario details and aggregate scores.

result = Riffer::Evals::EvaluatorRunner.run( agent: MyAgent, scenarios: [ { input: “What is Ruby?”, ground_truth: “A programming language” }, { input: “What is Python?” } ], evaluators: [AnswerRelevancyEvaluator] )

result.scores # => { AnswerRelevancyEvaluator => 0.85 }