org.llm4s.speech.stt
package org.llm4s.speech.stt
Members list
Type members
Classlikes
Attributes
object STTError
final case class STTOptions(language: Option[String], prompt: Option[String], enableTimestamps: Boolean, diarization: Boolean)
Models for speech-to-text.
Models for speech-to-text.
Attributes
- Supertypes
-
trait Serializabletrait Producttrait Equalsclass Objecttrait Matchableclass AnyShow all
trait SpeechToText
Abstraction for speech-to-text providers.
Abstraction for speech-to-text providers.
Attributes
- Supertypes
-
class Objecttrait Matchableclass Any
- Known subtypes
-
class VoskSpeechToTextclass WhisperSpeechToText
final case class Transcription(text: String, language: Option[String], confidence: Option[Double], timestamps: List[WordTimestamp], meta: Option[AudioMeta])
Attributes
- Supertypes
-
trait Serializabletrait Producttrait Equalsclass Objecttrait Matchableclass AnyShow all
final class VoskSpeechToText(modelPath: Option[String]) extends SpeechToText
Vosk-based speech-to-text implementation. Replaces Sphinx4 as it's more actively maintained and has better performance.
Vosk-based speech-to-text implementation. Replaces Sphinx4 as it's more actively maintained and has better performance.
Attributes
- Supertypes
final class WhisperSpeechToText(command: Seq[String], model: String, outputFormat: String) extends SpeechToText
Enhanced Whisper integration via CLI (whisper.cpp or openai-whisper). Supports various Whisper models and output formats.
Enhanced Whisper integration via CLI (whisper.cpp or openai-whisper). Supports various Whisper models and output formats.
Attributes
- Supertypes
Attributes
- Supertypes
-
trait Serializabletrait Producttrait Equalsclass Objecttrait Matchableclass AnyShow all
In this article