Results for "reasoning + action"
Continuous cycle of observation, reasoning, action, and feedback.
Expected cumulative reward from a state or state-action pair.
Expected return of taking action in a state.
Predicts next state given current state and action.
Optimizing continuous action sequences.
Learning action mapping directly from demonstrations.
The field of building systems that perform tasks associated with human intelligence—perception, reasoning, language, planning, and decision-making—via algori...
Maximum number of tokens the model can attend to in one forward pass; constrains long-document reasoning.
Stepwise reasoning patterns that can improve multi-step tasks; often handled implicitly or summarized for safety/privacy.
Framework for reasoning about cause-effect relationships beyond correlation, often using structural assumptions and experiments.
A system that perceives state, selects actions, and pursues goals—often combining LLM reasoning with tools and memory.
Interleaving reasoning and tool use.
Agent reasoning about future outcomes.
Temporary reasoning space (often hidden).
Set of all actions available to the agent.