Deep Learning for Robotics

Proprietary and confidential. Do not distribute.
Deep Learning for Robotics
Yinyin Liu, PhD
MAKING MACHINES
SMARTER.™
now part of

Nervana Systems Proprietary
2
neon deep
learning
framework
train deployexplore
nervana
engine
2-3x speedup on
NVIDIA GPUs
cloudn

3
Back-propagation
End-to-end
Resnet
ImageNet
NLP
Regularization
Convolution
Unrolling
RNN
Generalization
hyperparameters
Video recognition
dropout
Pooling
LSTM
AlexNet
Auto-encoder
neon
https://github.com/NervanaSystems/neon
Nervana’s deep learning
tutorials:
https://www.nervanasys.com/deep-learning-tutorials/
We are hiring!
https://www.nervanasys.com/careers/

4
• What is Deep Learning and What Can It Do Today?
• How DL helps Robotics?
• Deep Reinforcement Learning
• Finding the Right Frameworks For You

6
https://www.nervanasys.com/industry-focus-serving-the-automotive-industry-with-the-nervana-platform/

9
http://www.nervanasys.com/deep-reinforcement-learning-with-neon/
https://youtu.be/KkIf0Ok5GCE

10
Historical perspective:
• Input → designed features → output
• Input → designed features → SVM → output
• Input → learned features → SVM → output
• Input → levels of learned features → output

11
~60 million parameters
Positive/
negative
End-to-end learning
Raw image input Output

12
A method for extracting features at multiple
levels of abstraction
• Features are discovered from data
• Performance improves with more data
• Network can express complex transformations
• High degree of representational power

13
(Zeiler and Fergus, 2013)

Source: ImageNet
ImageNet top 5 error rate
0%
10%
20%
30%
2010 2011 2012 2013 2014 2015
human
performance
• No free lunch
• lots of data
• flexible and fast
frameworks
• powerful computing
resources
14

15
Healthcare: Tumor detection
Automotive: Speech interfaces Finance: Time-series search engine
Positive:
Negative:
Agricultural Robotics Oil & Gas
Positive:
Negative:
Proteomics: Sequence analysis
Query:
Results:

16

17
Image classification Object localization
Image segmentation

18
pepper jibo
Robot base FURo-i Cubic Budgee Branto
echoroomba
Consumer robots for companionship and home service

19
https://www.autonomous.ai/personal-robot
DL-based computer vision solutions help robot to navigate around a home and understand the scene and localize everyday
objects.

20
DL-based NLP/NLU solutions help robot to understand verbal commands and interact with users

21
• But most of the consumer robots either do not move or move around on a base
• Home robots are still far from providing home service, e.g. cooking, cleaning, taking care of people.
• Robot movement is a difficult
• It is challenging for robot to know how to interact with objects, not to mention having the level of dexterity of human

22
• Trying to tackle the problem of robotic grasping
• 14 Separate robots to collect data in parallel,
800k grasp attempts collected, over 7 months
• Each grasp consists of T time steps. At the end
of the T, grasp success is evaluated. Then T
samples of (image, current pose, success label)
data are collected
• No human labelling needed!
Levine et.al (2016)
https://research.googleblog.com/2016/03/deep-
learning-for-robots-learning-from.html

23
Prediction network: CNN learn to predict the outcome of a grasp, given
• An image before grasp begins
• An image at current time
• A motor command - 3D translation vector
https://arxiv.org/pdf/1603.02199v4.pdf

24
Servoing mechanism:
• User the predictor network
• Choose the motor commands from a pool of samples with the best score
Prediction network
score
score
score
score

25
• End-to-end learning
what are objects vs. gripper
what is the right orientation to grasp
what is the right motor command
• Learn from repetitively trials
• A useful training paradigm is RL

26

27
• RL – defines the goal, reward, training paradigm
• DL – gives the mechanics
• RL + DL = AI*
http://icml.cc/2016/tutorials/deep_rl_tutorial.pdf
* By David Silver

28
End-to-end learningRaw perception Output
https://storage.googleapis.com/deepmind-data/assets/papers/DeepMindNature14236Paper.pdf

29
https://www.nervanasys.com/demystif
ying-deep-reinforcement-learning/
https://www.nervanasys.com/deep-
reinforcement-learning-with-neon/

30
https://github.com/tambetm/simple_dqn

31
https://github.com/tambetm/simple_dqn

32
• As one network approximating the Q value, and output layer represents values for each action, the
algorithm deals with discrete and small finite-set of actions only.
• Apply actor-critic architecture to continuous action space
• Add BatchNorm – help to generalize to different problems
• High-dimensional tasks simulated in MuJoCo.
• Race game simulated using Torcs.
Lillicrap et. al. (Deepmind, ICLR 2016)

33

34
To make progress on robotics:
• Need a lot of data to improve on executing tasks
• Need interaction with the environment
- costly for real world experiments
- need simulator for a variety of tasks
• Need benchmarks
- ImageNet drove a lot of progress for the vision problems in supervised learning
- lack of standardized environment, tasks, or metrics for RL publications and
comparison

35

36
https://www.nervanasys.com/openai/

Layers
Linear, Convolution, Pooling, Deconvolution, Dropout, Recurrent, Long Short-
Term Memory, Gated Recurrent Unit, BatchNorm, LookupTable,
Local Response Normalization, Bidirectional-RNN, Bidirectional-LSTM
Backend NervanaGPU, NervanaCPU, NervanaMGPU
Datasets
MNIST, CIFAR-10, Imagenet 1K, PASCAL VOC, Mini-Places2, IMDB, Penn Treebank,
Shakespeare Text, bAbI, Hutter-prize, UCF101, flickr8k, flickr30k, COCO
Initializers Constant, Uniform, Gaussian, Glorot Uniform, Xavier, Kaiming, IdentityInit, Orthonormal
Optimizers Gradient Descent with Momentum, RMSProp, AdaDelta, Adam, Adagrad,MultiOptimizer
Activations Rectified Linear, Softmax, Tanh, Logistic, Identity, ExpLin
Costs Binary Cross Entropy, Multiclass Cross Entropy, Sum of Squares Error
Metrics Misclassification (Top1, TopK), LogLoss, Accuracy, PrecisionRecall, ObjectDetection
37

38
neon Theano Caffe Torch TensorFlow
Academic Research
Bleeding-edge
Curated models
Iteration Time
Inference speed
Package ecosystem
Support

39
Third-party
(Facebook)
benchmarking

40

41

42
• github.com/NervanaSystems/ModelZoo
• model files, parameters

43
neon
https://github.com/NervanaSystems/neon
Nervana’s deep learning tutorials:
https://www.nervanasys.com/deep-learning-tutorials/
We are hiring!
https://www.nervanasys.com/careers/

Deep Learning for Robotics