Quantum Reinforcement Learning for Sensor-Assisted Robot Navigation Tasks

Cobussen, Joyce

Quantum Reinforcement Learning for Sensor-Assisted Robot Navigation Tasks

Mark

Cobussen, Joyce ^LU (2023) FYSM33 20232
Department of Physics

Abstract: Quantum computing has advanced rapidly throughout the past decade, both from a hardware and software point of view. A variety of algorithms have been developed that are suitable for the current generation of quantum devices, which are referred to as noisy intermediate-scale quantum devices. Amongst them is the variational quantum algorithm, a hybrid quantum-classical algorithm that optimizes the parameters of a parameterized quantum circuit using optimization methods from classical machine learning. Previous research has shown that this approach requires fewer trainable parameters and fewer time steps to find a solution to certain problems compared to various classical machine learning algorithms. This is particularly interesting in the... (More); Quantum computing has advanced rapidly throughout the past decade, both from a hardware and software point of view. A variety of algorithms have been developed that are suitable for the current generation of quantum devices, which are referred to as noisy intermediate-scale quantum devices. Amongst them is the variational quantum algorithm, a hybrid quantum-classical algorithm that optimizes the parameters of a parameterized quantum circuit using optimization methods from classical machine learning. Previous research has shown that this approach requires fewer trainable parameters and fewer time steps to find a solution to certain problems compared to various classical machine learning algorithms. This is particularly interesting in the case of reinforcement
learning, which is a powerful type of machine learning, although its algorithms are notoriously difficult to train. They often require many training steps to converge to a solution, leading to large computational costs. Therefore, this work investigates the effect of replacing the deep neural networks of the Deep Q-learning algorithm with various parameterized quantum circuits. For the first time, a simulated TurtleBot equipped with a LiDAR sensor is trained to move through three environments containing fixed obstacles, each with a different size and complexity. The influence of different configurations of input data on the performance of both classical and quantum models is investigated. Furthermore, the expressibility, entanglement capability and effective dimension of the quantum circuits were computed to investigate the correlation between these metrics and the performance of the quantum models. Finally, the effect of depolarizing noise on the performance of one of the quantum models was investigated. Results showed that in the smallest environment, the quantum algorithm had a higher chance to converge and required fewer time steps to do so compared to a classical network containing a similar number of trainable parameters. In addition, the best quantum model performed equally well compared to the largest classical model, even though the latter has an order of magnitude more trainable parameters. In general, the data encoding strategy had a strong impact on the performance of the quantum models. Furthermore, there seemed to be no strong correlation between the quantum metrics and the performance of the quantum models. Finally, depolarizing noise at the a relatively low error rate did not influence the performance of the tested model. These results constitute a useful and practical base for further research into training sensor-assisted agents using quantum reinforcement learning. Further investigation into the the
performance of the models in larger and more complex environments, as well as their performance in non-static environments, is required. Additionally, the correlation between the quantum metrics, especially the normalized effective dimension, and the quantum model performances requires further analysis. Lastly, noise models incorporating two-qubit gate errors, as well as different types of noise, should be further investigated. (Less)
Popular Abstract: Can Quantum Computing bring R2D2 to Life?
Nearly everyone knows R2D2, the beloved droid from the Star Wars universe. Such a sophisticated droid may not exist in real life yet, but scientists have started developing techniques to train robots for simple but essential tasks, such as walking through a room filled with obstacles. This is an easy task for most humans, as our eyes help us to avoid the obstacles and our sense of orientation will lead us to the goal. A robot, however, does not possess these natural abilities, so how can we teach them to do the same?

A sophisticated, yet intuitive method is called reinforcement learning. This machine learning technique lets the robot learn information about its environment through... (More); Can Quantum Computing bring R2D2 to Life?
Nearly everyone knows R2D2, the beloved droid from the Star Wars universe. Such a sophisticated droid may not exist in real life yet, but scientists have started developing techniques to train robots for simple but essential tasks, such as walking through a room filled with obstacles. This is an easy task for most humans, as our eyes help us to avoid the obstacles and our sense of orientation will lead us to the goal. A robot, however, does not possess these natural abilities, so how can we teach them to do the same?

A sophisticated, yet intuitive method is called reinforcement learning. This machine learning technique lets the robot learn information about its environment through trial-and-error. Simply put, the robot (or a simulation of it) is put in the obstacle-filled room and has some way to observe it (e.g., using distance sensors). Next, it is allowed to move through the room and collect rewards by avoiding obstacles and moving closer to the goal. As time passes, the robot learns how collect as many rewards as possible, which ultimately leads it to the goal.

To process the observations and rewards, the robot uses a deep neural network. This works a bit like a simplified, artificial brain, that gets better through training. Zooming in, the neural network contains many ‘trainable parameters’ that are connected to each other – somewhat like brain cells. During the training process, the parameters are updated in a coordinated manner, considering the observations and rewards of the robot. When the training ends, the parameters are optimized such that the robot can exactly predict the outcome of each step it takes. As a result, the robot can reach the goal by simply by choosing the steps that lead to the best outcome.

Optimizing the parameters of the neural network can be done with a regular computer, but recent research showed that quantum computers, a new type of computers based on different physical properties, might also be good at this task. Quantum computers use qubits, instead of bits, to process information. Simply put, a bit is the smallest piece of information on a regular computer which has a value of 1 or 0. In contrast, a qubit is the quantum version of a bit and can both be 1 and 0 at the same time with a certain probability. In addition, qubits can be ‘entangled’ with each other, which means that manipulating one qubit may instantly influence one or more of the other qubits. These two properties make quantum computers potentially very powerful.

This brings us to the purpose of this master’s thesis. A simple simulated robot equipped
with some distance sensors was trained to walk through a room filled with obstacles. The
neural network, functioning as the brain of the robot, was trained using qubits instead of regular bits. It turned out that the best quantum-based neural network needed twenty times fewer trainable parameters to learn how to navigate, compared to neural networks trained with regular computer. This is a promising result, as more complex tasks easily require millions of trainable parameters, which can be difficult to optimize.

However, many questions still remained unanswered. Firstly, the arrangement of the qubits influenced the performance of the quantum-based neural network substantially, although it was unclear which differences contributed the most. In addition, the obstacle-filled room was relatively small, so it is unclear how well the quantum-based neural network performs in larger spaces. Finally, the quantum-noise that is present in quantum computers may also impact the results, but that has not been investigated accurately enough yet. Taking all of this into consideration, we probably won’t have a quantum-R2D2 anytime soon, but future perspectives look promising nevertheless! (Less)

Please use this url to cite or link to this publication: http://lup.lub.lu.se/student-papers/record/9141398

author

Cobussen, Joyce ^LU

supervisor

organization

Department of Physics

course

FYSM33 20232

year

2023

type

H2 - Master's Degree (Two Years)

subject

Physics and Astronomy

language

English

id

9141398

date added to LUP

2023-11-13 09:14:17

date last changed

2023-11-13 09:14:17

@misc{9141398,
abstract = {{Quantum computing has advanced rapidly throughout the past decade, both from a hardware and software point of view. A variety of algorithms have been developed that are suitable for the current generation of quantum devices, which are referred to as noisy intermediate-scale quantum devices. Amongst them is the variational quantum algorithm, a hybrid quantum-classical algorithm that optimizes the parameters of a parameterized quantum circuit using optimization methods from classical machine learning. Previous research has shown that this approach requires fewer trainable parameters and fewer time steps to find a solution to certain problems compared to various classical machine learning algorithms. This is particularly interesting in the case of reinforcement
learning, which is a powerful type of machine learning, although its algorithms are notoriously difficult to train. They often require many training steps to converge to a solution, leading to large computational costs. Therefore, this work investigates the effect of replacing the deep neural networks of the Deep Q-learning algorithm with various parameterized quantum circuits. For the first time, a simulated TurtleBot equipped with a LiDAR sensor is trained to move through three environments containing fixed obstacles, each with a different size and complexity. The influence of different configurations of input data on the performance of both classical and quantum models is investigated. Furthermore, the expressibility, entanglement capability and effective dimension of the quantum circuits were computed to investigate the correlation between these metrics and the performance of the quantum models. Finally, the effect of depolarizing noise on the performance of one of the quantum models was investigated. Results showed that in the smallest environment, the quantum algorithm had a higher chance to converge and required fewer time steps to do so compared to a classical network containing a similar number of trainable parameters. In addition, the best quantum model performed equally well compared to the largest classical model, even though the latter has an order of magnitude more trainable parameters. In general, the data encoding strategy had a strong impact on the performance of the quantum models. Furthermore, there seemed to be no strong correlation between the quantum metrics and the performance of the quantum models. Finally, depolarizing noise at the a relatively low error rate did not influence the performance of the tested model. These results constitute a useful and practical base for further research into training sensor-assisted agents using quantum reinforcement learning. Further investigation into the the
performance of the models in larger and more complex environments, as well as their performance in non-static environments, is required. Additionally, the correlation between the quantum metrics, especially the normalized effective dimension, and the quantum model performances requires further analysis. Lastly, noise models incorporating two-qubit gate errors, as well as different types of noise, should be further investigated.}},
author = {{Cobussen, Joyce}},
language = {{eng}},
note = {{Student Paper}},
title = {{Quantum Reinforcement Learning for Sensor-Assisted Robot Navigation Tasks}},
year = {{2023}},
}

LUP Student Papers

LUND UNIVERSITY LIBRARIES

Quantum Reinforcement Learning for Sensor-Assisted Robot Navigation Tasks