Policy Gradient

Intermediate

Optimizing policies directly via gradient ascent on expected reward.

Full Definition

Optimizing policies directly via gradient ascent on expected reward.

Keywords

Domains

Related Terms

Concept Map

See how Policy Gradient connects to other concepts.

Open Knowledge Graph