ExtractedDocument

org.llm4s.rag.extract.ExtractedDocument
final case class ExtractedDocument(text: String, metadata: Map[String, String], format: DocumentFormat)

Extracted document content with metadata.

Represents the result of extracting text from a document, including any metadata that could be extracted (title, author, etc.)

Value parameters

format

The detected document format

metadata

Document metadata (title, author, pageCount, etc.)

text

The extracted text content

Attributes

Graph
Supertypes
trait Serializable
trait Product
trait Equals
class Object
trait Matchable
class Any
Show all

Members list

Value members

Inherited methods

def productElementNames: Iterator[String]

Attributes

Inherited from:
Product
def productIterator: Iterator[Any]

Attributes

Inherited from:
Product