Results for "text"
Text-to-Speech
Intermediate
Generating speech audio from text, with control over prosody, speaker identity, and style.
Tokenization
Intermediate
Converting text into discrete units (tokens) for modeling; subword tokenizers balance vocabulary size and coverage.
Prompt
Intermediate
The text (and possibly other modalities) given to an LLM to condition its output behavior.
Adversarial Example
Intermediate
Inputs crafted to cause model errors or unsafe behavior, often imperceptible in vision or subtle in text.
Speech Recognition
Intermediate
Converting audio speech into text, often using encoder-decoder or transducer architectures.
CLIP
Intermediate
Joint vision-language model aligning images and text.
Speech Synthesis
Intermediate
Generating human-like speech from text.