Abstract
Accelerating procedures for solving discounted Markov decision processes problems are developed based on a one-step lookahead analysis of the value iteration algorithm. We apply the criteria of minimum difference and minimum variance to obtain good adaptive relaxation factors that speed up the convergence of the algorithm. Several problems (including Howard's automobile replacement) are tested and a preliminary numerical evaluation reveals considerable reductions in computation time when compared to existing value iteration schemes.
Original language | English |
---|---|
Pages (from-to) | 940-946 |
Number of pages | 7 |
Journal | Operations Research |
Volume | 42 |
Issue number | 5 |
DOIs | |
State | Published - 1994 |
Externally published | Yes |