Leemon Baird
Tags
Active Learning, Markov Decision Processes, Reinforcement Learning
Papers
-
Multi-Value-Functions: Efficient Automatic Action Hierarchies for Multiple Goal MDPs
(1999)
An efficient procedure to approximately compute all policies for all possible goal states.