节点文献
半Markov决策过程折扣模型与平均模型之间的关系
Relations between discounted models and average models for semi-Markov decision processes
【摘要】 首先分别在折扣代价与平均代价性能准则下,讨论了一类半M arkov决策问题.基于性能势方法,导出了由最优平稳策略所满足的最优性方程.然后讨论了两种模型之间的关系,表明了平均模型的有关结论,可以通过对折扣模型相应结论取折扣因子趋于零时的极限来得到.
【Abstract】 The semi-Markov decision problems are discussed for discounted-cost and average-cost performance criteria,respectively.Based on a potential approach,the optimality equations satisfied by the optimal stationary policies are derived.Then the relation between the discounted model and average model is studied.It shows that the related conclusions for the average model can be obtained by taking the limits of results about the discounted model as the discounted factor tends to zero.
【关键词】 半Markov决策过程;
折扣模型;
平均模型;
最优性方程;
最优平稳策略;
【Key words】 semi-Markov decision processes; discounted model; average model; optimality equation; optimal stationary policy;
【Key words】 semi-Markov decision processes; discounted model; average model; optimality equation; optimal stationary policy;
【基金】 国家自然科学基金资助项目(60274012,60574065);安徽省自然科学基金资助项目(050420301)
- 【文献出处】 控制理论与应用 ,Control Theory & Applications , 编辑部邮箱 ,2006年01期
- 【分类号】C934
- 【被引频次】7
- 【下载频次】162