Results for "step optimization"
Recovering 3D structure from images.
Predicting future values from past observations.
Using production outcomes to improve models.
Cost of model training.
Measures similarity and projection between vectors.
Sensitivity of a function to input perturbations.
Matrix of first-order derivatives for vector-valued functions.
Direction of steepest ascent of a function.
Measures joint variability between variables.
Maximizing reward without fulfilling real goal.
Learned subsystem that optimizes its own objective.
Using limited human feedback to guide large models.
Explicit output constraints (format, tone).
Asking model to review and improve output.
Coordinating models, tools, and logic.
Requirement to provide explanations.
Limiting inference usage.
Maximum system processing rate.
Mathematical framework for controlling dynamic systems.
Optimizes future actions using a model of dynamics.
Control that remains stable under model uncertainty.
Computing joint angles for desired end-effector pose.
High-fidelity virtual model of a physical system.
Randomizing simulation parameters to improve real-world transfer.
Directly optimizing control policies.
Sampling-based motion planner.
Modeling environment evolution in latent space.
Acting to minimize surprise or free energy.
Ensuring robots do not harm humans.
Learning without catastrophic forgetting.