Deep reinforcement learning (DRL) integrates the feature representation ability of deep learning with the decision-making ability of reinforcement learning so that it can achieve powerful end-to-end learning control capabilities. In addition to value-based and policy-based DRL algorithms, the advances in maximum entropy-based DRL are summarized.
This research advances how AI systems learn, reason, and solve problems — with direct implications for automation and scientific discovery.
Read the full paper
Access the original peer-reviewed research via OpenAlex.
| Category | 🤖 Artificial Intelligence |
| Published | Sep 28, 2022 |
| Journal | IEEE Transactions on Neural Networks and Learning Systems |
| Authors | Xu Wang, Sen Wang, Xingxing Liang, Dawei Zhao, Jincai Huang |
| DOI | 10.1109/tnnls.2022.3207346 |
| Citations | 763 |
| Source | OpenAlex |