latent-learning-robot

Deep autoencoder for robotic task learning in latent space

Introduction

This repository contains a simple deep autoencoder network designed to demonstrate applicability in robotic task learning. The robot's task was to accurately throw a ball into the target using reinforcement learning. Mitsubishi PA-10 robot was used for the execution of the throw. Demo:

Description

The trajectories of motion of the robot were presented using dynamic movement primitives (DMPs). To reduce the dimensionality we ran a reinforcement learning algorithm in the latent space of the deep autoencoder network.

Data set for training the autoencoder was generated using simulation. The data is a set of DMP weight vectors that define the appropriate robot movements that generate ball trajectories for a grid of targets; see the Figure below:

Reinforcement learning was shown to be faster in the extracted latent space and it generated more natural movements of the robotic arm; see [1].

How-to

By running the Python script train-ae.py in scripts directory we can train the deep autoencoder network using DMP weight vectors. DMP weight vectors have to be in csv format (examples in data directory). The script then saves the transition matrices for encoding and decoding to and from latent space.

Reinforcement learning algorithm (PoWER) can then be applied to modify the transformed DMP weights in latent space of the initial throw to reach the target.

The simulation and execution was written in Matlab for this specific robot and this specific task. The same approach can be applied in general for different tasks.

Example

We can apply the same approach to a simple task of writing numbers:

For implementation of DMPs we can use the open source Python library: pydmps

pip install pydmps

For the simulation of a robotic arm we can use the simulation framework: control. First we need to generate a set of DMP weights vectors that define the appropriate robot movements that generate the numbers trajectories. The figure below is showing the set of generated trajectories for number 2:

Then train the deep autoencoder network to extract the latent space for learning. We can start with an initial simple movement of drawing a line, transform the DMP weights into latent space and apply reinforcement learning algorithm to reach a target drawing of the number:

In this example the euclidean distance between the generated curve and the target curve that represents the number was used as a reward function for PoWER reinforcement learning algorithm.

References

[1]: R. Pahič, Z. Lončarević, A. Gams and A. Ude, "Robot skill learning in latent space of a deep autoencoder neural network", Robotics and Autonomous Systems 135 (2021), doi: https://doi.org/10.1016/j.robot.2020.103690 https://www.sciencedirect.com/science/article/pii/S0921889020305303

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
data		data
figures		figures
scripts		scripts
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

latent-learning-robot

Introduction

Description

How-to

Example

References

About

Releases

Packages

Languages

markolalovic/latent-learning-robot

Folders and files

Latest commit

History

Repository files navigation

latent-learning-robot

Introduction

Description

How-to

Example

References

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages