The overall framework of the TPDEB.

<div><p>To address the inefficiencies in sample utilization and policy instability in asynchronous distributed reinforcement learning, we propose TPDEB—a dual experience replay framework that integrates prioritized sampling and temporal diversity. While recent distributed RL systems have...

Full description

Saved in:
Bibliographic Details
Main Author: Teh Noranis Mohd Aris (22600931) (author)
Other Authors: Ningning Chen (509273) (author), Norwati Mustapha (17029699) (author), Maslina Zolkepli (22600934) (author)
Published: 2025
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!