Skip to content

tokenization_utils

Initialization for Hugging Face tokenization utilities.

Modules:

Name Description
llama

Hugging Face chat templates for Llama models.

noise_tokenizer

A module for augmenting text data with masks for noise injection.

serialization

Utilities for serializing and deserializing Hugging Face tokenizers.

universal

Universal schema mappers for building LLM prompts.