Text this: Effect of hyper-parameters on mean and variance of rewards in the first 100 episodes.