Effect of hyper-parameters on mean and variance of rewards in the first 100 episodes.

<p>(a) Results on different discount factors and batch sizes, (b) Results on different learning rates and batch sizes.</p>

Saved in:
Bibliographic Details
Main Author: Shoudao Sun (21439645) (author)
Other Authors: Yi Lu (6211) (author), Di Wu (23906) (author), Guangyan Zhang (2072143) (author)
Published: 2025
Subjects:
Tags: Add Tag
No Tags, Be the first to tag this record!