org.llm4s.rag.loader.SourceBackedLoader
See theSourceBackedLoader companion object
final case class SourceBackedLoader(source: DocumentSource, extractor: DocumentExtractor, additionalMetadata: Map[String, String], defaultHints: Option[DocumentHints]) extends DocumentLoader
Bridge between DocumentSource and DocumentLoader.
SourceBackedLoader converts any DocumentSource into a DocumentLoader, enabling documents from S3, GCS, databases, or any custom source to be used with the RAG pipeline.
The loader:
- Lists documents from the source
- Reads document content (bytes)
- Extracts text using DocumentExtractor
- Creates Document objects with appropriate metadata and hints
Usage:
// From S3
val s3Source = S3DocumentSource("my-bucket", "docs/")
val loader = SourceBackedLoader(s3Source)
rag.sync(loader)
// With custom extractor
val loader = SourceBackedLoader(source, customExtractor)
Value parameters
- additionalMetadata
-
Extra metadata to add to all documents
- defaultHints
-
Default processing hints for documents
- extractor
-
Document extractor for text extraction (default: DefaultDocumentExtractor)
- source
-
The document source to load from
Attributes
- Companion
- object
- Graph
-
- Supertypes
-
trait Serializabletrait Producttrait Equalstrait DocumentLoaderclass Objecttrait Matchableclass Any
Members list
In this article