Results for "learned objectives"
Configuration choices not learned directly (or not typically learned) that govern training or architecture.
Model exploits poorly specified objectives.
Model optimizes objectives misaligned with human values.
Agents have opposing objectives.
The internal space where learned representations live; operations here often correlate with semantics or generative factors.
A parameterized mapping from inputs to outputs; includes architecture + learned parameters.
The learned numeric values of a model adjusted during training to minimize a loss function.
A theoretical framework analyzing what classes of functions can be learned, how efficiently, and with what guarantees.
Early architecture using learned gates for skip connections.
Ensuring learned behavior matches intended objective.
Learned subsystem that optimizes its own objective.
Applying learned patterns incorrectly.
RL using learned or known environment models.
Learned model of environment dynamics.