Results for "policy consistency"
Systematic review of model/data processes to ensure performance, fairness, security, and policy compliance.
All possible configurations an agent may encounter.
Formal framework for sequential decision-making under uncertainty.
Fundamental recursive relationship defining optimal value functions.
Expected return of taking action in a state.
Formal model linking causal mechanisms and variables.
Expected causal effect of a treatment.
Ensuring learned behavior matches intended objective.
Randomizing simulation parameters to improve real-world transfer.
RL using learned or known environment models.
Learning policies from expert demonstrations.
Inferring reward function from observed behavior.