Michigan Tech Publications

Reinforcement learning-based multi-AUV adaptive trajectory planning for under-ice field estimation

Chaofeng Wang, Michigan Technological UniversityFollow
Li Wei, Michigan Technological UniversityFollow
Zhaohui Wang, Michigan Technological UniversityFollow
Min Song, Stevens Institute of Technology
Nina Mahmoudian, Michigan Technological UniversityFollow

Document Type

Article

Publication Date

11-9-2018

Department

Department of Electrical and Computer Engineering; Department of Mechanical Engineering-Engineering Mechanics

Abstract

This work studies online learning-based trajectory planning for multiple autonomous underwater vehicles (AUVs) to estimate a water parameter field of interest in the under-ice environment. A centralized system is considered, where several fixed access points on the ice layer are introduced as gateways for communications between the AUVs and a remote data fusion center. We model the water parameter field of interest as a Gaussian process with unknown hyper-parameters. The AUV trajectories for sampling are determined on an epoch-by-epoch basis. At the end of each epoch, the access points relay the observed field samples from all the AUVs to the fusion center, which computes the posterior distribution of the field based on the Gaussian process regression and estimates the field hyper-parameters. The optimal trajectories of all the AUVs in the next epoch are determined to maximize a long-term reward that is defined based on the field uncertainty reduction and the AUV mobility cost, subject to the kinematics constraint, the communication constraint and the sensing area constraint. We formulate the adaptive trajectory planning problem as a Markov decision process (MDP). A reinforcement learning-based online learning algorithm is designed to determine the optimal AUV trajectories in a constrained continuous space. Simulation results show that the proposed learning-based trajectory planning algorithm has performance similar to a benchmark method that assumes perfect knowledge of the field hyper-parameters.

Publisher's Statement

Publication Title

Sensors

Recommended Citation

Wang, C., Wei, L., Wang, Z., Song, M., & Mahmoudian, N. (2018). Reinforcement learning-based multi-AUV adaptive trajectory planning for under-ice field estimation. Sensors, 18(11), 1-19. http://doi.org/10.3390/s18113859
Retrieved from: https://digitalcommons.mtu.edu/michigantech-p/172

Creative Commons License

This work is licensed under a Creative Commons Attribution 4.0 International License.

Version

Publisher's PDF

Download

Included in

Electrical and Computer Engineering Commons

COinS

Michigan Tech Publications

Reinforcement learning-based multi-AUV adaptive trajectory planning for under-ice field estimation

Document Type

Publication Date

Department

Abstract

Publisher's Statement

Publication Title

Recommended Citation

Creative Commons License

Version

Included in

LINKS

Browse

Search

Author Corner

Links

Michigan Tech Publications

Reinforcement learning-based multi-AUV adaptive trajectory planning for under-ice field estimation

Authors

Document Type

Publication Date

Department

Abstract

Publisher's Statement

Publication Title

Recommended Citation

Creative Commons License

Version

Included in

Share

LINKS

Browse

Search

Author Corner

Links