Choices on individual test tasks were not explained by model-free perseveration.

<p>A model-free algorithm learns the optimal policies on training tasks but has no way to evaluate their success on test tasks. The result is that the algorithm reuses the optimal training policies in an unselective fashion. Each plot shows the proportion of choices (y-axis) that lead to each...

وصف كامل

محفوظ في:

التفاصيل البيبلوغرافية
المؤلف الرئيسي:	Sam Hall-McMaster (10343795) (author)
مؤلفون آخرون:	Momchil S. Tomov (8677314) (author), Samuel J. Gershman (8677326) (author), Nicolas W. Schuck (6260720) (author)
منشور في:	2025
الموضوعات:	Neuroscience Science Policy Mental Health Environmental Sciences not elsewhere classified Biological Sciences not elsewhere classified Information Systems not elsewhere classified human participants (< computational process based based control using successor representation known possible neural implementation increased behavioral reuse humans reuse strategies learned optimal solutions generalized policy improvement div >< p novel test tasks test tasks successor features neural evidence training tasks earlier tasks selective manner prefrontal cortex past experience new task n </ intelligent systems important feature gpi solutions gpi algorithm gpi ). functional connection free perseveration findings point evaluate solutions adaptive algorithm
الوسوم:	إضافة وسم لا توجد وسوم, كن أول من يضع وسما على هذه التسجيلة!

Choices on individual test tasks were not explained by model-free perseveration.

مواد مشابهة