Deep RL Success Stories
DQNMnihet al, NIPS 2013 / Nature 2015
MCTSGuoet al, NIPS 2014; TRPOSchulman, Levine, Moritz, Jordan, Abbeel, ICML 2015; A3CMnihet al,
ICML 2016; Dueling DQN Wang et al ICML 2016; Double DQN van Hasselt et al, AAAI 2016; Prioritized
Experience Replay Schaulet al, ICLR 2016; Bootstrapped DQN Osbandet al, 2016; Q-EnsemblesChen et al,
2017; RainbowHessel et al, 2017; AcceleratedStookeand Abbeel, 2018; ...
Pieter Abbeel --UC Berkeley | Gradescope| Covariant.AI