Learn how to use client-side evaluators for both tracing and experiments
enrich_span
is being used to add metrics on particular steps: get_relevant_docs
and generate_response
,
while enrich_session
is used to set metrics that apply to the entire session or pipeline run.outputs
), two parameters (outputs
, inputs
), or three parameters (outputs
, inputs
, ground_truths
) depending on what data your evaluation logic requires.consistency_evaluator
for the main rag_pipeline
function, and retrieval_relevance_evaluator
for the document retrieval step. The first is passed directly to evaluate()
, while the second is enriched within the retrieval step itself.