Tokenization — AI Glossary

Converting text into discrete units (tokens) for modeling; subword tokenizers balance vocabulary size and coverage.

Full Definition

Converting text into discrete units (tokens) for modeling; subword tokenizers balance vocabulary size and coverage.

BPE subwords

Foundations & Theory

See how Tokenization connects to other concepts.