llm4s-core/org.llm4s/org.llm4s.rag/org.llm4s.rag.evaluation/org.llm4s.rag.evaluation.metrics/AnswerRelevancy

AnswerRelevancy

org.llm4s.rag.evaluation.metrics.AnswerRelevancy

See theAnswerRelevancy companion object

class AnswerRelevancy(llmClient: LLMClient, embeddingClient: EmbeddingClient, modelConfig: EmbeddingModelConfig, numGeneratedQuestions: Int) extends RAGASMetric

Answer Relevancy metric: measures how well the answer addresses the question.

Algorithm:

Generate N questions that the provided answer would address
Compute embedding for the original question
Compute embeddings for the generated questions
Calculate cosine similarity between original and generated question embeddings
Score = average similarity across generated questions

The intuition: if the answer is relevant to the question, then questions generated from the answer should be semantically similar to the original question.

Value parameters

embeddingClient: Client for computing embeddings
llmClient: LLM client for generating questions from the answer
modelConfig: Embedding model configuration
numGeneratedQuestions: Number of questions to generate (default: 3)

Attributes

Example

val metric = AnswerRelevancy(llmClient, embeddingClient, modelConfig)
val sample = EvalSample(
 question = "What is machine learning?",
 answer = "Machine learning is a subset of AI that enables systems to learn from data.",
 contexts = Seq("...") // contexts not used for this metric
)
val result = metric.evaluate(sample)
// High score if generated questions are similar to "What is machine learning?"

Companion

object

Graph

Supertypes

trait RAGASMetric

class Object

trait Matchable

class Any

Members list

Value members

Concrete methods

Evaluate a single sample.

Value parameters

sample: The evaluation sample containing question, answer, contexts

Attributes

Returns: Score between 0.0 and 1.0, with optional details
Definition Classes: RAGASMetric

Inherited methods

Check if this metric can be evaluated for a given sample.

Attributes

Inherited from:: RAGASMetric

Evaluate multiple samples.

Default implementation evaluates sequentially. Override for batch optimizations (e.g., batched LLM calls).

Value parameters

samples: The evaluation samples

Attributes

Returns: Results for each sample in order
Inherited from:: RAGASMetric

Concrete fields

Human-readable description of what this metric measures.

Attributes

Unique name of this metric (e.g., "faithfulness", "answer_relevancy"). Used as an identifier in results and configuration.

Attributes

Which inputs this metric requires from an EvalSample. Used to skip metrics when required inputs are missing.

Attributes

In this article

Generated with