Results for "direct optimization"
Using limited human feedback to guide large models.
Explicit output constraints (format, tone).
Asking model to review and improve output.
Breaking tasks into sub-steps.
Requirement to provide explanations.
Coordinating models, tools, and logic.
Limiting inference usage.
Maximum system processing rate.
Mathematical framework for controlling dynamic systems.
Optimizes future actions using a model of dynamics.
Control that remains stable under model uncertainty.
Computing joint angles for desired end-effector pose.
High-fidelity virtual model of a physical system.
Randomizing simulation parameters to improve real-world transfer.
Sampling-based motion planner.
Modeling environment evolution in latent space.
Acting to minimize surprise or free energy.
Ensuring robots do not harm humans.
Learning without catastrophic forgetting.
Fabrication of cases or statutes by LLMs.
AI-driven buying/selling of financial assets.
AI applied to scientific problems.
Finding mathematical equations from data.
Designing efficient marketplaces.
Collective behavior without central control.
Stored compute or algorithms enabling rapid jumps.
Tradeoff between safety and performance.
Regulating access to large-scale compute.
The field of building systems that perform tasks associated with human intelligence—perception, reasoning, language, planning, and decision-making—via algori...
Updating a pretrained model’s weights on task-specific data to improve performance or adapt style/behavior.