Averaged Soft Actor-Critic for Deep Reinforcement Learning

Complexity 2021:1-16 (2021)
  Copy   BIBTEX

Abstract

With the advent of the era of artificial intelligence, deep reinforcement learning has achieved unprecedented success in high-dimensional and large-scale artificial intelligence tasks. However, the insecurity and instability of the DRL algorithm have an important impact on its performance. The Soft Actor-Critic algorithm uses advanced functions to update the policy and value network to alleviate some of these problems. However, SAC still has some problems. In order to reduce the error caused by the overestimation of SAC, we propose a new SAC algorithm called Averaged-SAC. By averaging the previously learned action-state estimates, it reduces the overestimation problem of soft Q-learning, thereby contributing to a more stable training process and improving performance. We evaluate the performance of Averaged-SAC through some games in the MuJoCo environment. The experimental results show that the Averaged-SAC algorithm effectively improves the performance of the SAC algorithm and the stability of the training process.

Other Versions

No versions found

Links

PhilArchive



    Upload a copy of this work     Papers currently archived: 100,937

External links

Setup an account with your affiliations in order to access resources via your University's proxy server

Through your library

Similar books and articles

Ga により探索空間の動的生成を行う Q 学習.Matsuno Fumitoshi Ito Kazuyuki - 2001 - Transactions of the Japanese Society for Artificial Intelligence 16:510-520.
On the correction of errors in English grammar by deep learning.Xiaorui Yue & Yanghui Zhong - 2022 - Journal of Intelligent Systems 31 (1):260-270.
Qdsega による多足ロボットの歩行運動の獲得.Matsuno Fumitoshi Ito Kazuyuki - 2002 - Transactions of the Japanese Society for Artificial Intelligence 17:363-372.

Analytics

Added to PP
2021-04-02

Downloads
15 (#1,232,057)

6 months
4 (#1,247,093)

Historical graph of downloads
How can I increase my downloads?

Author Profiles

Jing Gao
Lan Zhou University
Li Peng
Abilene Christian University

Citations of this work

No citations found.

Add more citations

References found in this work

No references found.

Add more references