Results for "collective behavior"

24 results

Swarm Dynamics Advanced

Collective behavior without central control.

Dynamics & Physics
Cooperative Game Advanced

Agents optimize collective outcomes.

Agents & Autonomy
Prompt Intermediate

The text (and possibly other modalities) given to an LLM to condition its output behavior.

Prompting & Instructions
Prompt Engineering Intermediate

Crafting prompts to elicit desired behavior, often using role, structure, constraints, and examples.

Prompting & Instructions
System Prompt Intermediate

A high-priority instruction layer setting overarching behavior constraints for a chat model.

Reinforcement Learning
Fine-Tuning Intermediate

Updating a pretrained model’s weights on task-specific data to improve performance or adapt style/behavior.

Large Language Models
Alignment Intermediate

Ensuring model behavior matches human goals, norms, and constraints, including reducing harmful or deceptive outputs.

Foundations & Theory
Guardrails Intermediate

Rules and controls around generation (filters, validators, structured outputs) to reduce unsafe or invalid behavior.

Reinforcement Learning
LIME Intermediate

Local surrogate explanation method approximating model behavior near a specific input.

Foundations & Theory
Adversarial Example Intermediate

Inputs crafted to cause model errors or unsafe behavior, often imperceptible in vision or subtle in text.

Foundations & Theory
Backdoor / Trojan Intermediate

Hidden behavior activated by specific triggers, causing targeted mispredictions or undesired outputs.

Foundations & Theory
Orchestration Intermediate

Coordinating tools, models, and steps (retrieval, calls, validation) to deliver reliable end-to-end behavior.

Foundations & Theory
Inner Alignment Advanced

Ensuring learned behavior matches intended objective.

AI Safety & Alignment
Model Documentation Intermediate

Required descriptions of model behavior and limits.

Governance & Ethics
Inverse Reinforcement Learning Advanced

Inferring reward function from observed behavior.

Reinforcement Learning
Commonsense Physics Frontier

Human-like understanding of physical behavior.

World Models & Cognition
Human-in-the-Loop Control Frontier

Humans assist or override autonomous behavior.

World Models & Cognition
Intent Recognition Frontier

Inferring human goals from behavior.

World Models & Cognition
Formal Verification Advanced

Mathematical guarantees of system behavior.

Agents & Autonomy
Emergence Advanced

System-level behavior arising from interactions.

Dynamics & Physics
Tripwire Advanced

Signals indicating dangerous behavior.

AI Safety & Alignment
Behavior Cloning Advanced

Learning action mapping directly from demonstrations.

Reinforcement Learning
Herding Behavior Advanced

Agents copy others’ actions.

Dynamics & Physics
Power-Seeking Behavior Advanced

Tendency to gain control/resources.

AI Safety & Alignment