Optimal tracking controllers with Off-policy Reinforcement Learning Algorithm in Quadrotor

Dinh Duong Pham; Thanh Trung Cao; Tat Chung Nguyen; Phuong Nam Dao

Optimal tracking controllers with Off-policy Reinforcement Learning Algorithm in Quadrotor

Dinh Duong Pham, Thanh Trung Cao, Tat Chung Nguyen, Phuong Nam Dao

DOI: http://dx.doi.org/10.15439/2022R52

Citation: Proceedings of the 2022 Seventh International Conference on Research in Intelligent and Computing in Engineering, Vu Dinh Khoa, Shivani Agarwal, Gloria Jeanette Rincon Aponte, Nguyen Thi Hong Nga, Vijender Kumar Solanki, Ewa Ziemba (eds). ACSIS, Vol. 33, pages 29–32 (2022)

Full text

Abstract. In this study, the optimal tracking control problem for the quadrotor which is a highly coupling system with completely unknown dynamics is addressed based on data by introducing the reinforcement learning (RL) technique. The proposed Off-policy RL algorithm does not need any knowledge of quadrotor model. By collecting data, which is the states of quadrotor system then using an actor-critic networks (NNs) to solve the optimal tracking trajectory problem. Finally, simulation results are provided to illustrate the effectiveness of proposed method.

References

E. Altug, J. Ostrowski, and R. Mahony. Control of a quadrotor helicopter using visual feedback. In Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292), volume 1, pages 72–77 vol.1, 2002.
A. Das, F. Lewis, and K. Subbarao. Backstepping approach for controlling a quadrotor using lagrange form dynamics. Journal of Intelligent and Robotic Systems, 56:127–151, 09 2009.
Y. Li and S. Song. A survey of control algorithms for quadrotor unmanned helicopter. In 2012 IEEE Fifth International Conference on Advanced Computational Intelligence (ICACI), pages 365–369, 2012.
C. Mu, C. Sun, and W. Xu. Fast sliding mode control on air-breathing hypersonic vehicles with transient response analysis. Proceedings of the Institution of Mechanical Engineers, Part I: Journal of Systems and Control Engineering, 230, 11 2015.
A. Tayebi and S. McGilvray. Attitude stabilization of a vtol quadrotor aircraft. IEEE Transactions on Control Systems Technology, 14(3):562–571, 2006.
A. Tayebi and S. McGilvray. Attitude stabilization of a vtol quadrotor aircraft. IEEE Transactions on Control Systems Technology, 14(3):562–571, 2006.
W. Wang, H. Ma, and C.-Y. Sun. Control system design for multi-rotor mav. Journal of Theoretical and Applied Mechanics, 51:1027–1038, 01 2013.
G. Xiao, H. Zhang, Y. Luo, and H. Jiang. Data-driven optimal tracking control for a class of affine non-linear continuous-time systems with completely unknown dynamics. Iet Control Theory and Applications, 10:700–710, 2016.
W. Zhao, H. Liu, F. L. Lewis, and X. Wang. Data-driven optimal formation control for quadrotor team with unknown dynamics. IEEE Transactions on Cybernetics, pages 1–10, 2021.