Search: contrastive vision-language

3D Reconstruction Intermediate

Recovering 3D structure from images.

Computer Vision

Particle Filter Intermediate

Monte Carlo method for state estimation.

Time Series

Sensor Advanced

Devices measuring physical quantities (vision, lidar, force, IMU, etc.).

Robotics & Embodied AI

Perception Stack Advanced

Software pipeline converting raw sensor data into structured representations.

Robotics & Embodied AI

Synthetic Sensors Advanced

Artificial sensor data generated in simulation.

Simulation & Sim-to-Real

Affordance Frontier

Perceived actions an environment allows.

World Models & Cognition

Gesture Recognition Frontier

Interpreting human gestures.

World Models & Cognition

Log Loss Intermediate

Penalizes confident wrong predictions heavily; standard for classification and language modeling.

Optimization

Artificial Intelligence Intermediate

The field of building systems that perform tasks associated with human intelligence—perception, reasoning, language, planning, and decision-making—via algori...

Foundations & Theory

Context Window Intermediate

Maximum number of tokens the model can attend to in one forward pass; constrains long-document reasoning.

Transformers & LLMs

Tool Use Intermediate

Letting an LLM call external functions/APIs to fetch data, compute, or take actions, improving reliability.

Agents & Autonomy

Fine-Tuning Intermediate

Updating a pretrained model’s weights on task-specific data to improve performance or adapt style/behavior.

Large Language Models

Hallucination Intermediate

Model-generated content that is fluent but unsupported by evidence or incorrect; mitigated by grounding and verification.

Model Failure Modes

SFT Intermediate

Fine-tuning on (prompt, response) pairs to align a model with instruction-following behaviors.

Foundations & Theory

Safety Filter Intermediate

Automated detection/prevention of disallowed outputs (toxicity, self-harm, illegal instruction, etc.).

Foundations & Theory

Perplexity Intermediate

Exponential of average negative log-likelihood; lower means better predictive fit, not necessarily better utility.

Evaluation & Benchmarking

Toolformer Intermediate

Models trained to decide when to call tools.

AI Economics & Strategy

Forced Alignment Intermediate

Aligns transcripts with audio timestamps.

Speech & Audio AI

Prosody Intermediate

Temporal and pitch characteristics of speech.

Speech & Audio AI

Zero-Shot Prompting Intro

Task instruction without examples.

Prompting & Instructions

Role Prompting Intro

Assigning a role or identity to the model.

Prompting & Instructions

Delimited Prompt Intro

Using markers to isolate context segments.

Prompting & Instructions

Legal AI Intermediate

AI supporting legal research, drafting, and analysis.

AI in Law

Self-Supervised Learning Intermediate

Learning from data by constructing “pseudo-labels” (e.g., next-token prediction, masked modeling) without manual annotation.

Machine Learning

Recurrent Neural Network Intermediate

Networks with recurrent connections for sequences; largely supplanted by Transformers for many tasks.

Neural Networks

Self-Attention Intermediate

Attention where queries/keys/values come from the same sequence, enabling token-to-token interactions.

Transformers & LLMs

LSTM Intermediate

An RNN variant using gates to mitigate vanishing gradients and capture longer context.

Foundations & Theory

Positional Encoding Intermediate

Injects sequence order into Transformers, since attention alone is permutation-invariant.

Foundations & Theory

Transformer Intermediate

Architecture based on self-attention and feedforward layers; foundation of modern LLMs and many multimodal models.

Transformers & LLMs

Tokenization Intermediate

Converting text into discrete units (tokens) for modeling; subword tokenizers balance vocabulary size and coverage.

Foundations & Theory

Results for "contrastive vision-language"

Welcome to AI Glossary

Search

Browse

3D WordGraph