org.llm4s.chunking.MarkdownChunker
See theMarkdownChunker companion object
class MarkdownChunker extends DocumentChunker
Markdown-aware document chunker.
Preserves markdown structure by:
- Respecting heading boundaries (# through ######)
- Keeping code blocks intact when possible
- Tracking heading hierarchy in chunk metadata
- Preserving list structure
This chunker produces higher quality chunks for markdown content because it understands document structure.
Usage:
val chunker = MarkdownChunker()
val chunks = chunker.chunk(markdownText, ChunkingConfig(targetSize = 800))
chunks.foreach { c =>
val headingPath = c.metadata.headings.mkString(" > ")
println(s"[$$headingPath] $${c.content.take(50)}...")
}
Attributes
- Companion
- object
- Graph
-
- Supertypes
Members list
In this article