BenchmarkResults

org.llm4s.rag.benchmark.BenchmarkResults

See theBenchmarkResults companion object

final case class BenchmarkResults(suite: BenchmarkSuite, results: Seq[ExperimentResult], startTime: Long, endTime: Long)

Results from running a complete benchmark suite.

Value parameters

Companion: object
Graph
Supertypes: trait Serializable

trait Product

trait Equals

class Object

trait Matchable

class Any
Show all

Get average scores across all experiments for each metric.

Compare two experiments by name. Returns (difference in RAGAS score, comparison details)

All failed results

Number of failed experiments

Get result for a specific experiment.

Get metric comparison table. Returns map of experiment name -> map of metric name -> score

Get results ranked by RAGAS score (highest first).

Number of successful experiments

All successful results

Total benchmark duration in milliseconds

Total benchmark duration in seconds

Get the best performing experiment.

In this article

Generated with