BenchmarkSuite

org.llm4s.rag.benchmark.BenchmarkSuite
See theBenchmarkSuite companion class

Attributes

Companion
class
Graph
Supertypes
trait Product
trait Mirror
class Object
trait Matchable
class Any
Self type

Members list

Type members

Inherited and Abstract types

type MirroredElemLabels <: Tuple

The names of the product elements

The names of the product elements

Attributes

Inherited from:
Mirror
type MirroredLabel <: String

The name of the type

The name of the type

Attributes

Inherited from:
Mirror

Value members

Concrete methods

def chunkingSuite(datasetPath: String): BenchmarkSuite

Suite comparing different chunking strategies.

Suite comparing different chunking strategies.

Tests: Simple, Sentence, Markdown chunkers with default parameters. Semantic chunking is excluded as it requires embedding client setup.

Attributes

def comprehensiveSuite(datasetPath: String): BenchmarkSuite

Comprehensive suite testing all major dimensions.

Comprehensive suite testing all major dimensions.

Combines chunking, fusion, and basic embedding comparisons. Use for thorough evaluation when time permits.

Attributes

def custom(name: String, description: String, experiments: Seq[RAGExperimentConfig], datasetPath: String): BenchmarkSuite

Create a custom suite from configurations.

Create a custom suite from configurations.

Attributes

def embeddingSuite(datasetPath: String): BenchmarkSuite

Suite comparing different embedding providers.

Suite comparing different embedding providers.

Tests: OpenAI (small/large), Voyage, Ollama embeddings. Note: Requires API keys for cloud providers and local Ollama for local models.

Attributes

def fusionSuite(datasetPath: String): BenchmarkSuite

Suite comparing different fusion strategies.

Suite comparing different fusion strategies.

Tests: RRF with different k values, weighted scoring, and single-source modes.

Attributes

def quickSuite(datasetPath: String, sampleCount: Int): BenchmarkSuite

Quick test suite with minimal experiments for fast validation.

Quick test suite with minimal experiments for fast validation.

Tests just two configurations with a small sample subset.

Attributes