Search alternatives:
policy optimization » topology optimization (Expand Search), process optimization (Expand Search)
wolf optimization » whale optimization (Expand Search), swarm optimization (Expand Search), _ optimization (Expand Search)
primary role » primary care (Expand Search), primary goal (Expand Search)
binary basic » binary mask (Expand Search)
role policy » crime policy (Expand Search), broad policy (Expand Search)
policy optimization » topology optimization (Expand Search), process optimization (Expand Search)
wolf optimization » whale optimization (Expand Search), swarm optimization (Expand Search), _ optimization (Expand Search)
primary role » primary care (Expand Search), primary goal (Expand Search)
binary basic » binary mask (Expand Search)
role policy » crime policy (Expand Search), broad policy (Expand Search)
-
1
-
2
-
3
Mean scores of the last 100 episodes using PPO and DQN in Unity environment and ViZDoom.
Published 2025Subjects: -
4
-
5
-
6
-
7
The images shown correspond the downscaled images an agent receives in Unity environment.
Published 2025Subjects: -
8
Mean reward of using only pixels and pixels with raw audio samples in ViZDoom environment.
Published 2025Subjects: -
9
-
10
Average success rate (%) of the test experiment on ViZDoom and Unity environments.
Published 2025Subjects: -
11
The mean reward results of using visual information and audio samples in the ViZDoom environment.
Published 2025Subjects: -
12
-
13