Policy Gradient
IntermediateOptimizing policies directly via gradient ascent on expected reward.
Full Definition
Optimizing policies directly via gradient ascent on expected reward.
Keywords
Domains
Related Terms
Concept Map
See how Policy Gradient connects to other concepts.
Open Knowledge Graph