
In a stationary MDP, if an agent has a 20% chance of dying at each timestep, what is the optimal value for the discount factor?

Main Tools: Graph Searching  Consistency for CSP  SLS for CSP  Deduction  Belief and Decision Networks  Decision Trees  Neural Networks  STRIPS to CSP 