core/org.llm4s/org.llm4s.identity/TokenizerId

TokenizerId

org.llm4s.identity.TokenizerId

See theTokenizerId companion object

case class TokenizerId(name: String)

Identifies a BPE tokenizer vocabulary by its canonical name.

Tokenizer IDs are used by context-window estimation logic to select the correct byte-pair-encoding vocabulary for a given model, so that prompt and completion token counts are accurate without calling the provider API. The mapping from ModelId to TokenizerId is maintained by the context package.

Value parameters

name: Tokenizer vocabulary name as used by tiktoken and related libraries (e.g. "cl100k_base" for GPT-4 / GPT-3.5).

Attributes

Companion: object
Graph
Supertypes: trait Serializable

trait Product

trait Equals

class Object

trait Matchable

class Any
Show all

Members list

Value members

Inherited methods

Attributes

Inherited from:: Product

Attributes

Inherited from:: Product

In this article

Generated with