Criteria for selecting the relaxation factor of the value iteration algorithm for undiscounted Markov and semi-Markov decision processes

Meir Herzberg, Uri Yechiali

Research output: Contribution to journalArticlepeer-review

5 Scopus citations

Abstract

We present two criteria for selecting the adaptive relaxation factor being used in speeding-up the value iteration algorithm for undiscounted Markov decision processes. The criteria are: 1. Minimum Ratio, 2. Minimum Variance. For the problems tested it was found that the criterion of minimum variance is most effective for Markov models while an hybrid use of both criteria yields best results when solving semi-Markov decision problems. The advantage of using these criteria increases with the dimension of the models.

Original languageEnglish
Pages (from-to)193-202
Number of pages10
JournalOperations Research Letters
Volume10
Issue number4
DOIs
StatePublished - Jun 1991
Externally publishedYes

Keywords

  • Markov and semi-Markov decision processes
  • adaptive relaxation factor
  • value iteration algorithm

Fingerprint

Dive into the research topics of 'Criteria for selecting the relaxation factor of the value iteration algorithm for undiscounted Markov and semi-Markov decision processes'. Together they form a unique fingerprint.

Cite this