Deep Learning and Robotics

AI with Objectives/Goals n Robotics n Marketing / Advertising n Dialogue n Optimizing operations / logistics n Queue management ...

From Pixels to Actions? Pong Enduro Beamrider Q*bert Pieter Abbeel --UC Berkeley | Gradescope| Covariant.AI ...

[Source: Mnihet al., Nature 2015 (DeepMind) ] Deep Q-Network (DQN): From Pixels to Joystick Commands 32 8x8 filters with stride ...

[ Source: Mnihet al., Nature 2015 (DeepMind) ] Pieter Abbeel --UC Berkeley | Gradescope| Covariant.AI ...

Deep RL Success Stories DQNMnihet al, NIPS 2013 / Nature 2015 MCTSGuoet al, NIPS 2014; TRPOSchulman, Levine, Moritz, Jordan, Abb ...

Deep RL Success Stories AlphaGoSilver et al, Nature 2015 AlphaGoZeroSilver et al, Nature 2017 Tian et al, 2016; Maddison et al, ...

n Super-human agent on a competitive game, enabled by n Reinforcement learning n Self-play n Enough computation n Cooperation em ...

Learning Locomotion [Schulman, Moritz, Levine, Jordan, Abbeel, ICLR 2016] Pieter Abbeel --UC Berkeley | Gradescope| Covariant.AI ...

Deep RL: Learn to Pass/Protect [Bansal et al, 2017] Pieter Abbeel --UC Berkeley | Gradescope| Covariant.AI ...

Deep RL: Learn Soccer [Bansal et al, 2017] Pieter Abbeel --UC Berkeley | Gradescope| Covariant.AI ...

Deep RL: Virtual Stuntman [Peng, Abbeel, Levine, van de Panne, 2018] Pieter Abbeel --UC Berkeley | Gradescope| Covariant.AI ...

Deep RL: Dynamic Animation for Motion Picture [Peng, Abbeel, Levine, van de Panne, 2018] Pieter Abbeel --UC Berkeley | Gradescop ...

BRETT: Berkeley Robot for the Elimination of Tedious Tasks [Levine, Finn, Darrell, Abbeel, JMLR 2016] Pieter Abbeel --UC Berkele ...

Unsupervised Learning for Interaction? [Levine et al, 2016] Pieter Abbeel --UC Berkeley | Gradescope| Covariant.AI ...

Application: Google Datacenter Cooling 40% reduction in cooling cost https://deepmind.com/blog/deepmind-ai-reduces-google-data-c ...

Deep Reinforcement Learning -- NASA SUPERball [Geng, Zhang, Bruce*, Caluwaerts, Vespignani, Sunspiral, Abbeel, Levine, ICRA 2017 ...

How About a Hand? [OpenAIRobotics Team] Pieter Abbeel --UC Berkeley | Gradescope| Covariant.AI ...

Speed Up Deep RL through Imitation Pieter Abbeel --UC Berkeley | Gradescope| Covariant.AI ...

Human Demonstrations [Zhang, McCarthy, Jow, Lee, Chen, Goldberg, Abbeel, ICRA 2018] Pieter Abbeel --UC Berkeley | Gradescope| Co ...

n Deep learning successes n Supervised learning = pattern recognition, if enough data (input -> output pairs), then neural ne ...

«
1
2
3
4
5
6
»

Free download pdf

Get our desktop app

Company

Features

Documentation

Resources