Results for "trigger-based behavior"
Detects trigger phrases in audio streams.
Signals indicating dangerous behavior.
Detecting unauthorized model outputs or data leaks.
Agent calls external tools dynamically.
AI reinforcing market trends.
Tendency to gain control/resources.
Agents copy others’ actions.
Ensuring learned behavior matches intended objective.
Distributed agents producing emergent intelligence.
The text (and possibly other modalities) given to an LLM to condition its output behavior.
Learning action mapping directly from demonstrations.
Collective behavior without central control.
Emergence of conventions among agents.
A high-priority instruction layer setting overarching behavior constraints for a chat model.
Ensuring model behavior matches human goals, norms, and constraints, including reducing harmful or deceptive outputs.
Inferring reward function from observed behavior.
Rules and controls around generation (filters, validators, structured outputs) to reduce unsafe or invalid behavior.
Human-like understanding of physical behavior.
Decisions dependent on others’ actions.
RL using learned or known environment models.
Local surrogate explanation method approximating model behavior near a specific input.
Inputs crafted to cause model errors or unsafe behavior, often imperceptible in vision or subtle in text.
Hidden behavior activated by specific triggers, causing targeted mispredictions or undesired outputs.
Mathematical framework for controlling dynamic systems.
Model behaves well during training but not deployment.
The physical system being controlled.
Equations governing how system states change over time.
Modeling interactions with environment.
Mechanics of price formation.
Modeling chemical systems computationally.