WhisperSpeechToText

org.llm4s.speech.stt.WhisperSpeechToText
final class WhisperSpeechToText(command: Seq[String], model: String, outputFormat: String) extends SpeechToText

Enhanced Whisper integration via CLI (whisper.cpp or openai-whisper). Supports various Whisper models and output formats.

Attributes

Graph
Supertypes
trait SpeechToText
class Object
trait Matchable
class Any

Members list

Value members

Concrete methods

override def transcribe(input: AudioInput, options: STTOptions): Result[Transcription]

Transcribe audio to text.

Transcribe audio to text.

Value parameters

input

Audio data to transcribe

options

Configuration for transcription

Attributes

Returns

Result containing Transcription or STTError

Throws
STTError

if transcription fails (wrapped in Result)

Definition Classes

Inherited methods

def isAvailable: Result[Boolean]

Check if this provider is available/healthy. Useful for failover logic and availability checks.

Check if this provider is available/healthy. Useful for failover logic and availability checks.

Attributes

Inherited from:
SpeechToText

Concrete fields

override val name: String

Unique identifier/name of this provider

Unique identifier/name of this provider

Attributes

override val supportedFormats: List[String]

List supported audio formats (e.g., "audio/wav", "audio/mp3")

List supported audio formats (e.g., "audio/wav", "audio/mp3")

Attributes