Convolutional Encoder-Decoder Networks for Robust Image-to-Motion Prediction

May 1, 2019·

Barry Ridge

Rok Pahič

Aleš Ude

Jun Morimoto

· 0 min read

PDF Cite Code Project

Writing digits with a robot using image-to-motion encoder-decoder network prediction.

Abstract

A deep encoder-decoder network was previously proposed for learning a mapping from raw images to dynamic movement primitives in order to enable a robot to draw sketches of numeric digits when shown images of same. In this paper, the network architecture, which was previously constructed entirely with fully-connected linear layers, is modified to include convolutional layers in order to improve the image encoder component and make the network more robust to noise. The convolutional layers are pre-trained as part of an MNIST digit classifier and adapted for use in the encoder-decoder network, before the network is trained using a dataset composed of digit images and corresponding writing trajectories. This architecture was tested on several challenging noisy digit datasets and the use of convolutional layers is shown to provide a robust improvement in results.

Type

Conference paper

Publication

Proceedings of the 28th International Conference on Robotics in Alpe-Adria-Danube Region (RAAD 2019)

Last updated on May 1, 2019

← Cut & Recombine: Reuse of Robot Action Components Based on Simple Language Instructions Sep 1, 2019

Learning to Write Anywhere with Spatial Transformer Image-to-Motion Encoder-Decoder Networks May 1, 2019 →