Results for "collective behavior"
Maximizing reward without fulfilling real goal.
Model optimizes objectives misaligned with human values.
Assigning a role or identity to the model.
Small prompt changes cause large output changes.
Required descriptions of model behavior and limits.
Mechanism to disable AI system.
AI used without governance approval.
Optimizes future actions using a model of dynamics.
Control that remains stable under model uncertainty.
Motion considering forces and mass.
Mathematical representation of friction forces.
Artificial environment for training/testing agents.
Differences between simulated and real physics.
Artificial sensor data generated in simulation.
Learning physical parameters from data.
Directly optimizing control policies.
Modifying reward to accelerate learning.
Learning policies from expert demonstrations.
Acting to minimize surprise or free energy.
Imagined future trajectories.
Humans assist or override autonomous behavior.
Inferring human goals from behavior.
Closed loop linking sensing and acting.
Mathematical guarantees of system behavior.
Fabrication of cases or statutes by LLMs.
Identifying suspicious transactions.
AI discovering new compounds/materials.
Risk of incorrect financial models.
Rules governing auctions.
Agents fail to coordinate optimally.