Results for "collective behavior"
Collective behavior without central control.
Agents optimize collective outcomes.
The text (and possibly other modalities) given to an LLM to condition its output behavior.
Crafting prompts to elicit desired behavior, often using role, structure, constraints, and examples.
A high-priority instruction layer setting overarching behavior constraints for a chat model.
Updating a pretrained model’s weights on task-specific data to improve performance or adapt style/behavior.
Ensuring model behavior matches human goals, norms, and constraints, including reducing harmful or deceptive outputs.
Rules and controls around generation (filters, validators, structured outputs) to reduce unsafe or invalid behavior.
Local surrogate explanation method approximating model behavior near a specific input.
Inputs crafted to cause model errors or unsafe behavior, often imperceptible in vision or subtle in text.
Hidden behavior activated by specific triggers, causing targeted mispredictions or undesired outputs.
Coordinating tools, models, and steps (retrieval, calls, validation) to deliver reliable end-to-end behavior.
Ensuring learned behavior matches intended objective.
Required descriptions of model behavior and limits.
Inferring reward function from observed behavior.
Human-like understanding of physical behavior.
Humans assist or override autonomous behavior.
Inferring human goals from behavior.
Mathematical guarantees of system behavior.
System-level behavior arising from interactions.
Signals indicating dangerous behavior.
Learning action mapping directly from demonstrations.
Agents copy others’ actions.
Tendency to gain control/resources.