Results for "text"

Context Window

Intermediate

Maximum number of tokens the model can attend to in one forward pass; constrains long-document reasoning.

The context window is like a pair of glasses that helps a language model see a certain number of words at a time. If the model can only see a few words, it might miss important details that come later in a long sentence or paragraph. For example, if you’re reading a book and can only see one page...

Full Definition View in 3D WordGraph

25 results

Text-to-Speech Intermediate

Generating speech audio from text, with control over prosody, speaker identity, and style.

Speech & Audio AI

Speech Synthesis Intermediate

Generating human-like speech from text.

Speech & Audio AI

Tokenization Intermediate

Converting text into discrete units (tokens) for modeling; subword tokenizers balance vocabulary size and coverage.

Foundations & Theory

CLIP Intermediate

Joint vision-language model aligning images and text.

Computer Vision

Language Model Intermediate

A model that assigns probabilities to sequences of tokens; often trained by next-token prediction.

Large Language Models

Large Language Model Intermediate

A high-capacity language model trained on massive corpora, exhibiting broad generalization and emergent behaviors.

Large Language Models

Autoregressive Model Intermediate

Generates sequences one token at a time, conditioning on past tokens.

Foundations & Theory

Prompt Intermediate

The text (and possibly other modalities) given to an LLM to condition its output behavior.

Prompting & Instructions

Speech Recognition Intermediate

Converting audio speech into text, often using encoder-decoder or transducer architectures.

Speech & Audio AI

Attention Intermediate

Mechanism that computes context-aware mixtures of representations; scales well and captures long-range dependencies.

Transformers & LLMs

Transformer Intermediate

Architecture based on self-attention and feedforward layers; foundation of modern LLMs and many multimodal models.

Transformers & LLMs

Next-Token Prediction Intermediate

Training objective where the model predicts the next token given previous tokens (causal modeling).

Foundations & Theory

Data Labeling Intermediate

Human or automated process of assigning targets; quality, consistency, and guidelines matter heavily.

Foundations & Theory

Data Augmentation Intermediate

Expanding training data via transformations (flips, noise, paraphrases) to improve robustness.

Foundations & Theory

Adversarial Example Intermediate

Inputs crafted to cause model errors or unsafe behavior, often imperceptible in vision or subtle in text.

Foundations & Theory

Prompt Injection Intermediate

Attacks that manipulate model instructions (especially via retrieved content) to override system goals or exfiltrate data.

Foundations & Theory

Multimodal Model Intermediate

Models that process or generate multiple modalities, enabling vision-language tasks, speech, video understanding, etc.

Foundations & Theory

Causal Mask Intermediate

Prevents attention to future tokens during training/inference.

AI Economics & Strategy

NLP Intermediate

AI subfield dealing with understanding and generating human language, including syntax, semantics, and pragmatics.

Foundations & Theory

Key-Value Cache Intermediate

Stores past attention states to speed up autoregressive decoding.

AI Economics & Strategy

Generative Model Advanced

Models that learn to generate samples resembling training data.

Diffusion & Generative Models

Multimodal Fusion Intermediate

Combining signals from multiple modalities.

Computer Vision

Cross-Attention Intermediate

Attention between different modalities.

Computer Vision

Neural Vocoder Intermediate

Generates audio waveforms from spectrograms.

Speech & Audio AI

Legal AI Intermediate

AI supporting legal research, drafting, and analysis.