Results for "transfer problem"
Ensuring learned behavior matches intended objective.
Sampling multiple outputs and selecting consensus.
Asking model to review and improve output.
Prompt augmented with retrieved documents.
Enables external computation or lookup.
Finding control policies minimizing cumulative cost.
Optimal control for linear systems with quadratic cost.
Inferring reward function from observed behavior.
Imagined future trajectories.
Collective behavior without central control.
AI capable of performing most intellectual tasks humans can.
Awareness and regulation of internal processes.