Resulting strategies of learning agents, showing the top strategy learned and the proportion of runs this was the resulting strategy for the parameters LR=0.9, DR=0.1.
<p>For expectation, means the Beneficiary does not signal, and the Donor keeps the resource. Valid for any value of <i>p</i>.</p>
Saved in:
| Main Author: | |
|---|---|
| Other Authors: | |
| Published: |
2025
|
| Subjects: | |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|