Malcolm Strens
Postdoctoral Fellow
Tags
Markov Decision Processes, Memory-based Learning, Optimization, Reinforcement Learning
Papers
-
Direct Policy Search using Paired Statistical Tests
(2001)
If you're going to choose the best policy by roll-outs, how can statistical tests and "racing" help?