全文预览

基于DDPG算法的列车节能控制策略研究 武晓春

上传者:科技星球 |  格式:pdf  |  页数:13 |  大小:0KB

文档介绍
n Р adjustment[D]. Chengdu: Southwest Jiaotong University, 2016. Р [14] SILVER D, LEVER G, HEESS N, et al. Deterministic policy gradient algorithms [C] //Proceedings of the International Р Conference on Machine Learning. Beijing: ACM, 2014:387-395. Р [15] LILLICRAP T P, HUNT J J, PRITZEL A, et al. Continuous control with deep reinforcement learning[EB/OL]. 2015: arXiv: Р 1509.02971[cs.LG]. https://arxiv.org/abs/1509.02971. Р [16] 杨尚彤.面向确定性策略的深度强化学习探索方法研究[D].合肥:中国科学技术大学,2021. Р YANG Shangtong. Exploration strategy of deterministic policy in deep reinforcement learning[D]. Hefei: University of Science Р and Technology of China, 2021.

收藏

分享

举报
下载此文档