GlobPatternMatcher

org.llm4s.rag.loader.internal.GlobPatternMatcher

Utility for matching URLs against glob-style patterns.

Supports:

  • asterisk matches any string (non-greedy within path segments)
  • double asterisk matches any string including path separators
  • question mark matches single character
  • Literal matching for other characters

Examples:

  • subdomain.example.com/path matches any subdomain and path
  • example.com/docs/anything matches any path under /docs/
  • example.com/page1.html matches page1.html, page2.html, etc.

Attributes

Graph
Supertypes
class Object
trait Matchable
class Any
Self type

Members list

Value members

Concrete methods

def clearCache(): Unit

Clear the pattern cache (for testing or memory management).

Clear the pattern cache (for testing or memory management).

Attributes

def filter(urls: Seq[String], includePatterns: Seq[String], excludePatterns: Seq[String]): Seq[String]

Filter a list of URLs by patterns.

Filter a list of URLs by patterns.

Value parameters

excludePatterns

Patterns to exclude

includePatterns

Patterns to include (empty = include all)

urls

URLs to filter

Attributes

Returns

URLs matching include patterns and not matching exclude patterns

def matches(url: String, pattern: String): Boolean

Check if a URL matches a glob pattern.

Check if a URL matches a glob pattern.

Value parameters

pattern

Glob pattern

url

URL to check

Attributes

Returns

true if URL matches pattern

def matchesAny(url: String, patterns: Seq[String]): Boolean

Check if a URL matches any of the given patterns.

Check if a URL matches any of the given patterns.

Value parameters

patterns

Glob patterns to match against

url

URL to check

Attributes

Returns

true if URL matches any pattern