Visualization of the gaming behavior of humans and Ape-X.

<p>To provide a visualization of the action selection by humans and the DQNs, we created a video, which can be found in the GitHub repository at <a href="https://github.com/SHaberland15/Arcade_DQN_Research" target="_blank">https://github.com/SHaberland15/Arcade_DQN_Re...

Full description

Saved in:

Bibliographic Details
Main Author:	Sabine Haberland (22783613) (author)
Other Authors:	Hannes Ruge (5815964) (author), Holger Frimmel (22783616) (author)
Published:	2025
Subjects:	Sociology Science Policy Mental Health Biological Sciences not elsewhere classified xlink "> humans significant performance improvements remained unclear whether recorded motor responses humans transform high appropriate motor responses conventional experimental approach human motor responses playing arcade games grained temporal scale human behavior across dqn &# 8217 continuous visual stimuli rl ), enable compare prediction accuracy modeling human behavior third baseline dqn model human behavior continuous experimental tasks deep learning used human behavior experimental tasks prediction accuracy deep rl baseline dqn human participants human data used features three games temporal resolution improved modeling everyday tasks deep q better prediction varying degrees trial structure thereby opening term memory results suggest response probabilities reinforcement learning long short linear model interesting avenue future research dqns ), double q directed actions considerably improved chance level
Tags:	Add Tag No Tags, Be the first to tag this record!

Description
Summary:	<p>To provide a visualization of the action selection by humans and the DQNs, we created a video, which can be found in the GitHub repository at <a href="https://github.com/SHaberland15/Arcade_DQN_Research" target="_blank">https://github.com/SHaberland15/Arcade_DQN_Research</a>. The video provides a comparison between the actions chosen by the participant and those by Ape-X. The left plot in the video depicts the gaming behavior of a randomly selected participant playing Space Invaders. On the right plot, each bar represents the Q-values for every frame, each associated with one of the six possible types of actions. These values have been preprocessed using a softmax function, enabling them to be interpreted as probabilities. The actions performed by the subject in the current frame, as shown on the left, are highlighted by the purple-colored bar. The action chosen by the DQN for a particular frame, indicated by the maximum Q-value across all types of actions, does not always align with the action chosen by the subject. Therefore, as the second step in our analysis, we introduced a GLM that fits the generated time series of features generated by the DQN to the time series of human actions.</p> <p>(AVI)</p>

Visualization of the gaming behavior of humans and Ape-X.

Similar Items