Safe Reinforcement Learning Method Based on Robust Cross-Entropy and Gradient Optimization

您当前的位置：

首页 >

文章列表页 >

更新时间：2024-09-25

- Safe Reinforcement Learning Method Based on Robust Cross-Entropy and Gradient Optimization
- “在智能体安全执行任务领域，研究者提出了一种新算法，通过模型预测控制框架和优化方法，有效提升了智能体的安全性和效率。”
- Software Guide Vol. 23, Issue 9, Pages: 143-149(2024)
- 作者机构：
  
  华南师范大学软件学院，广东佛山 538200
- 作者简介：
- 基金信息：
- DOI：10.11907/rjdk.231853
  CLC： TP391.4
- Published：16 September 2024，
  
  Received：10 August 2023，
- 稿件说明：
移动端阅览
周娴玮,张锟,叶鑫.基于鲁棒交叉熵与梯度优化的安全强化学习方法[J].软件导刊,2024,23(09):143-149.

ZHOU Xianwei,ZHANG Kun,YE Xin.Safe Reinforcement Learning Method Based on Robust Cross-Entropy and Gradient Optimization[J].Software Guide,2024,23(09):143-149.
周娴玮,张锟,叶鑫.基于鲁棒交叉熵与梯度优化的安全强化学习方法[J].软件导刊,2024,23(09):143-149. DOI： 10.11907/rjdk.231853.

ZHOU Xianwei,ZHANG Kun,YE Xin.Safe Reinforcement Learning Method Based on Robust Cross-Entropy and Gradient Optimization[J].Software Guide,2024,23(09):143-149. DOI： 10.11907/rjdk.231853.

Views

下载量

CSCD

Alert me when the article has been cited

提交

Tools

Publicity Resources

Fusing Reinforcement Learning and Transfer to Adversarial Network for Image Caption

Reinforcement Learning for Generative AI：A Review

Link Prediction Method Based on Sub-graph Feature Fusion

Related Author

LI Qian

LI Shuang

JING Qidong

TIAN Wei

TENG Lei

Jin HUANG

Qi-jie SHU

Ru-han HE

Related Institution

China Electronics Industry Internet Co.， Ltd.

Hubei Provincial Engineering Research Center for Intelligent Textile and Fashion

Engineering Research Center of Hubei Province for Clothing Information

School of Computer Science and Artificial Intelligence， Wuhan Textile University

School of Computer， Beijing Information Science and Technology University

⁰