您当前的位置:
首页 >
文章列表页 >
Safe Reinforcement Learning Method Based on Robust Cross-Entropy and Gradient Optimization
更新时间:2024-09-25
    • Safe Reinforcement Learning Method Based on Robust Cross-Entropy and Gradient Optimization

    • 在智能体安全执行任务领域,研究者提出了一种新算法,通过模型预测控制框架和优化方法,有效提升了智能体的安全性和效率。
    • Software Guide   Vol. 23, Issue 9, Pages: 143-149(2024)
    • DOI:10.11907/rjdk.231853    

      CLC: TP391.4
    • Published:16 September 2024

      Received:10 August 2023

    移动端阅览

  • ZHOU Xianwei,ZHANG Kun,YE Xin.Safe Reinforcement Learning Method Based on Robust Cross-Entropy and Gradient Optimization[J].Software Guide,2024,23(09):143-149. DOI: 10.11907/rjdk.231853.

  •  
  •  

0

Views

0

下载量

0

CSCD

Alert me when the article has been cited
提交
Tools
Download
Export Citation
Share
Add to favorites
Add to my album

Related Articles

Fusing Reinforcement Learning and Transfer to Adversarial Network for Image Caption
Reinforcement Learning for Generative AI:A Review
Link Prediction Method Based on Sub-graph Feature Fusion

Related Author

LI Qian
LI Shuang
JING Qidong
TIAN Wei
TENG Lei
Jin HUANG
Qi-jie SHU
Ru-han HE

Related Institution

China Electronics Industry Internet Co., Ltd.
Hubei Provincial Engineering Research Center for Intelligent Textile and Fashion
Engineering Research Center of Hubei Province for Clothing Information
School of Computer Science and Artificial Intelligence, Wuhan Textile University
School of Computer, Beijing Information Science and Technology University
0