Highway Lane change under uncertainty with Deep Reinforcement Learning based motion planner

Sakib, Nazmus

doi:doi:10.7939/r3-qm5k-s682

This decommissioned ERA site remains active temporarily to support our final migration steps to https://ualberta.scholaris.ca, ERA's new home. All new collections and items, including Spring 2025 theses, are at that site. For assistance, please contact erahelp@ualberta.ca.

View

Download

Communities and Collections

Graduate and Postdoctoral Studies (GPS), Faculty of / Theses and Dissertations

Usage

290 views
376 downloads

Highway Lane change under uncertainty with Deep Reinforcement Learning based motion planner

Author / Creator

Sakib, Nazmus
Motion Planning is a fundamental component of a mobile robot to reach its
goal safely avoiding collision. For a self-driving car on a highway, the presence of non-communicating vehicles, specially those whose intent is unknown,
creates a lot of uncertainty for the motion planner in generating a safe trajectory. State-of-the-art planning methods do not work well in case of adversary driving scenarios, where the other vehicles may make mistakes or have
a competing or malicious intent. We use reinforcement learning framework
to improve safety under those scenarios. In most recent deep reinforcement
learning applications, there is a neural network that maps an input state to
an optimal policy over actions. However, learning a policy over such original
or primitive actions is very slow and inefficient and is therefore not suitable
for many robotics tasks. On the other hand, the knowledge already learned in
classical planning methods should be inherited and reused. In this thesis, in
order to take advantage of reinforcement learning good at exploring the action
space for optimal solution and classical planning skill models good at handling
most driving scenarios, we propose to learn a policy over an action space of
primitive actions augmented with classical planning methods. By doing so, we
show that our agent outperforms the primitive-action reinforcement learning
agent and the classical planning methods in terms of collision rate
Subjects / Keywords
Graduation date

Spring 2020
Type of Item

Thesis
Degree

Master of Science
DOI

https://doi.org/10.7939/r3-qm5k-s682
License

Permission is hereby granted to the University of Alberta Libraries to reproduce single copies of this thesis and to lend or sell such copies for private, scholarly or scientific research purposes only. Where the thesis is converted to, or otherwise made available in digital form, the University of Alberta will advise potential users of the thesis of these terms. The author reserves all other publication and other rights in association with the copyright in the thesis and, except as herein before provided, neither the thesis nor any substantial portion thereof may be printed or otherwise reproduced in any material form whatsoever without the author's prior written permission.

Language

English
Institution

University of Alberta
Degree level

Master's
Department
- Department of Computing Science
Supervisor / co-supervisor and their department(s)
- Zhang, Hong