Results for "modality alignment"
Robust Alignment
Advanced
Maintaining alignment under new conditions.
Alignment
Intermediate
Ensuring model behavior matches human goals, norms, and constraints, including reducing harmful or deceptive outputs.
Forced Alignment
Intermediate
Aligns transcripts with audio timestamps.
Alignment Problem
Advanced
Ensuring AI systems pursue intended human goals.
Outer Alignment
Advanced
Correctly specifying goals.
Inner Alignment
Advanced
Ensuring learned behavior matches intended objective.
Deceptive Alignment
Advanced
Model behaves well during training but not deployment.
Alignment Tax
Advanced
Tradeoff between safety and performance.
Alignment Research
Intermediate
Research ensuring AI remains safe.