n Р adjustment[D]. Chengdu: Southwest Jiaotong University, 2016. Р [14] SILVER D, LEVER G, HEESS N, et al. Deterministic policy gradient algorithms [C] //Proceedings of the International Р Conference on Machine Learning. Beijing: ACM, 2014:387-395. Р [15] LILLICRAP T P, HUNT J J, PRITZEL A, et al. Continuous control with deep reinforcement learning[EB/OL]. 2015: arXiv: Р 1509.02971[cs.LG]. https://arxiv.org/abs/1509.02971. Р [16] 杨尚彤.面向确定性策略的深度强化学习探索方法研究[D].合肥:中国科学技术大学,2021. Р YANG Shangtong. Exploration strategy of deterministic policy in deep reinforcement learning[D]. Hefei: University of Science Р and Technology of China, 2021.