浏览全部资源
扫码关注微信
华南师范大学 软件学院, 广东 佛山 538200
Published:16 September 2024,
Received:10 August 2023,
移动端阅览
周娴玮,张锟,叶鑫.基于鲁棒交叉熵与梯度优化的安全强化学习方法[J].软件导刊,2024,23(09):143-149.
ZHOU Xianwei,ZHANG Kun,YE Xin.Safe Reinforcement Learning Method Based on Robust Cross-Entropy and Gradient Optimization[J].Software Guide,2024,23(09):143-149.
周娴玮,张锟,叶鑫.基于鲁棒交叉熵与梯度优化的安全强化学习方法[J].软件导刊,2024,23(09):143-149. DOI: 10.11907/rjdk.231853.
ZHOU Xianwei,ZHANG Kun,YE Xin.Safe Reinforcement Learning Method Based on Robust Cross-Entropy and Gradient Optimization[J].Software Guide,2024,23(09):143-149. DOI: 10.11907/rjdk.231853.
0
Views
下载量
CSCD
Publicity Resources
Related Articles
Related Author
Related Institution