Results for "state-action value"

AdvertisementAd space — search-top

110 results

Value Function Intermediate

Expected cumulative reward from a state or state-action pair.

AI Economics & Strategy
Q-Function Intermediate

Expected return of taking action in a state.

AI Economics & Strategy
Action Space Intermediate

Set of all actions available to the agent.

AI Economics & Strategy
Bellman Equation Intermediate

Fundamental recursive relationship defining optimal value functions.

AI Economics & Strategy
Agent Loop Intermediate

Continuous cycle of observation, reasoning, action, and feedback.

AI Economics & Strategy
Markov Decision Process Intermediate

Formal framework for sequential decision-making under uncertainty.

AI Economics & Strategy
Dynamics Model Advanced

Predicts next state given current state and action.

Reinforcement Learning
State Space Model Intermediate

Models time evolution via hidden states.

Time Series
Policy Intermediate

Strategy mapping states to actions.

AI Economics & Strategy
State Estimation Advanced

Inferring the agent’s internal state from noisy sensor data.

Robotics & Embodied AI
Policy Gradient Intermediate

Optimizing policies directly via gradient ascent on expected reward.

AI Economics & Strategy
Control Loop Advanced

Continuous loop adjusting actions based on state feedback.

Robotics & Embodied AI
Actor-Critic Intermediate

Combines value estimation (critic) with policy learning (actor).

AI Economics & Strategy
State Space Intermediate

All possible configurations an agent may encounter.

AI Economics & Strategy
Policy Search Advanced

Directly optimizing control policies.

Reinforcement Learning
Reflex Agent Advanced

Simple agent responding directly to inputs.

Agents & Autonomy
Behavior Cloning Advanced

Learning action mapping directly from demonstrations.

Reinforcement Learning
On-Policy Learning Intermediate

Learning only from current policy’s data.

AI Economics & Strategy
Key-Value Cache Intermediate

Stores past attention states to speed up autoregressive decoding.

AI Economics & Strategy
Value at Risk Intermediate

Maximum expected loss under normal conditions.

AI Economics & Strategy
Reinforcement Learning Intermediate

A learning paradigm where an agent interacts with an environment and learns to choose actions to maximize cumulative reward.

Reinforcement Learning
Kalman Filter Intermediate

Optimal estimator for linear dynamic systems.

Time Series
ReAct Pattern Advanced

Interleaving reasoning and tool use.

Agents & Autonomy
Law of Large Numbers Advanced

Sample mean converges to expected value.

Probability & Statistics
Monte Carlo Estimation Advanced

Approximating expectations via random sampling.

Probability & Statistics
Value Misalignment Advanced

Model optimizes objectives misaligned with human values.

AI Safety & Alignment
Value Learning Intermediate

Inferring and aligning with human preferences.

Governance & Ethics
Particle Filter Intermediate

Monte Carlo method for state estimation.

Time Series
Model-Based RL Advanced

RL using learned or known environment models.

Reinforcement Learning
Sparse Reward Advanced

Reward only given upon task completion.

Reinforcement Learning

Welcome to AI Glossary

The free, self-building AI dictionary. Help us keep it free—click an ad once in a while!

Search

Type any question or keyword into the search bar at the top.

Browse

Tap a letter in the A–Z bar to browse terms alphabetically, or filter by domain, industry, or difficulty level.

3D WordGraph

Fly around the interactive 3D graph to explore how AI concepts connect. Click any word to read its full definition.