llm4s-core/org.llm4s/org.llm4s.agent/org.llm4s.agent.guardrails/org.llm4s.agent.guardrails.rag/SourceAttributionGuardrail

SourceAttributionGuardrail

org.llm4s.agent.guardrails.rag.SourceAttributionGuardrail

See theSourceAttributionGuardrail companion object

class SourceAttributionGuardrail(val llmClient: LLMClient, val requireAttributions: Boolean, val minAttributionScore: Double, val onFail: GuardrailAction) extends RAGGuardrail

LLM-based guardrail to validate that responses properly cite their sources.

SourceAttributionGuardrail ensures that RAG responses include proper citations to the source documents from which information was derived. This is important for transparency, verifiability, and trust.

Evaluation criteria:

Does the response cite sources for factual claims?
Are the citations accurate (pointing to the right chunks)?
Are all major claims properly attributed?

Use cases:

Ensure transparency in RAG responses
Enable users to verify information
Comply with requirements for attributing sources
Detect when responses fail to cite available sources

Example usage:

val guardrail = SourceAttributionGuardrail(llmClient)

val context = RAGContext.withSources(
 query = "What causes climate change?",
 chunks = Seq("Human activities release greenhouse gases..."),
 sources = Seq("IPCC Report 2023.pdf")
)

// Response should cite sources
val response = "According to the IPCC Report, human activities release greenhouse gases..."
guardrail.validateWithContext(response, context)

Value parameters

llmClient: The LLM client for evaluation
minAttributionScore: Minimum attribution quality score (default: 0.5)
onFail: Action to take when attribution is insufficient (default: Block)
requireAttributions: Whether citations are required (default: true)

Attributes

Companion: object
Graph
Supertypes: trait RAGGuardrail

trait OutputGuardrail

trait Guardrail[String]

class Object

trait Matchable

class Any
Show all

Members list

Value members

Concrete methods

Transform response to add citations if in Fix mode.

Attributes

Definition Classes: RAGGuardrail

Standard validate without context.

Attributes

Definition Classes: Guardrail

Validate that response properly attributes sources.

Attributes

Definition Classes: RAGGuardrail

Inherited methods

Compose this guardrail with another sequentially.

The second guardrail runs only if this one passes.

Value parameters

other: The guardrail to run after this one

Attributes

Returns: A composite guardrail that runs both in sequence
Inherited from:: Guardrail

Optional: Transform the output after validation. Default is identity (no transformation).

Value parameters

output: The validated output

Attributes

Returns: The transformed output
Inherited from:: OutputGuardrail

Concrete fields

Optional description of what this guardrail validates.

Attributes

Name of this guardrail for logging and error messages.

Attributes

In this article

Generated with