Showing 1 - 20 results of 3,757 for search '(((( learning task decrease ) OR ( _ point decrease ))) OR ( _ largest decrease ))', query time: 0.39s Refine Results
  1. 1
  2. 2
  3. 3
  4. 4
  5. 5
  6. 6
  7. 7
  8. 8
  9. 9
  10. 10
  11. 11

    Deep reinforcement learning agents trained using a curriculum solve navigation tasks with delayed rewards. by William L. Tong (22238845)

    Published 2025
    “…C: ADP outperforms INC and RAND (each teacher-student interaction is a step). The agent does not learn the task without a curriculum. Results are plotted from 5 repeats. …”
  12. 12
  13. 13
  14. 14
  15. 15
  16. 16
  17. 17
  18. 18
  19. 19
  20. 20