AIspace

In a stationary MDP, if an agent has a 20% chance of dying at each timestep, what is the optimal value for the discount factor?

The discount factor can be interpretted as the probability that the agent keeps going. Therefore the optimal value of the discount factor would be 0.8.