Results for "state-action value"

25 results

Value Function Intermediate

Expected cumulative reward from a state or state-action pair.

AI Economics & Strategy
Dynamics Model Advanced

Predicts next state given current state and action.

Reinforcement Learning
Q-Function Intermediate

Expected return of taking action in a state.

AI Economics & Strategy
State Estimation Advanced

Inferring the agent’s internal state from noisy sensor data.

Robotics & Embodied AI
Agent Loop Intermediate

Continuous cycle of observation, reasoning, action, and feedback.

AI Economics & Strategy
Trajectory Optimization Advanced

Optimizing continuous action sequences.

Reinforcement Learning
Behavior Cloning Advanced

Learning action mapping directly from demonstrations.

Reinforcement Learning
Bellman Equation Intermediate

Fundamental recursive relationship defining optimal value functions.

AI Economics & Strategy
Actor-Critic Intermediate

Combines value estimation (critic) with policy learning (actor).

AI Economics & Strategy
Expectation Advanced

Average value under a distribution.

Probability & Statistics
Law of Large Numbers Advanced

Sample mean converges to expected value.

Probability & Statistics
Model Registry Intermediate

Central system to store model versions, metadata, approvals, and deployment state.

Foundations & Theory
Observability Intermediate

A broader capability to infer internal system state from telemetry, crucial for AI services and agents.

Evaluation & Benchmarking
Agent Intermediate

A system that perceives state, selects actions, and pursues goals—often combining LLM reasoning with tools and memory.

Agents & Autonomy
Particle Filter Intermediate

Monte Carlo method for state estimation.

Time Series
Blackboard System Advanced

Agents communicate via shared state.

Agents & Autonomy
Control Loop Advanced

Continuous loop adjusting actions based on state feedback.

Robotics & Embodied AI
Action Space Intermediate

Set of all actions available to the agent.

AI Economics & Strategy
Key-Value Cache Intermediate

Stores past attention states to speed up autoregressive decoding.

AI Economics & Strategy
Singular Value Decomposition Advanced

Decomposes a matrix into orthogonal components; used in embeddings and compression.

Mathematics
Value Misalignment Advanced

Model optimizes objectives misaligned with human values.

AI Safety & Alignment
Value at Risk Intermediate

Maximum expected loss under normal conditions.

AI Economics & Strategy
Value Learning Intermediate

Inferring and aligning with human preferences.

Governance & Ethics
State Space Intermediate

All possible configurations an agent may encounter.

AI Economics & Strategy
State Space Model Intermediate

Models time evolution via hidden states.

Time Series