节点文献

半Markov决策过程折扣模型与平均模型之间的关系

Relations between discounted models and average models for semi-Markov decision processes

  • 推荐 CAJ下载
  • PDF下载
  • 不支持迅雷等下载工具,请取消加速工具后下载。

【作者】 殷保群李衍杰唐昊代桂平奚宏生

【Author】 YIN Bao-qun~1,LI Yan-jie~1,TANG Hao~2,DAI Gui-ping~1,XI Hong-sheng~1(1.Department of Automation,University of Science and Technology of China,Hefei Anhui 230026,China;2.Department of Computer,Hefei University of Technology,Hefei Anhui 230009,China)

【机构】 中国科学技术大学自动化系合肥工业大学计算机系中国科学技术大学自动化系 安徽合肥230026安徽合肥230026安徽合肥230009

【摘要】 首先分别在折扣代价与平均代价性能准则下,讨论了一类半M arkov决策问题.基于性能势方法,导出了由最优平稳策略所满足的最优性方程.然后讨论了两种模型之间的关系,表明了平均模型的有关结论,可以通过对折扣模型相应结论取折扣因子趋于零时的极限来得到.

【Abstract】 The semi-Markov decision problems are discussed for discounted-cost and average-cost performance criteria,respectively.Based on a potential approach,the optimality equations satisfied by the optimal stationary policies are derived.Then the relation between the discounted model and average model is studied.It shows that the related conclusions for the average model can be obtained by taking the limits of results about the discounted model as the discounted factor tends to zero.

【基金】 国家自然科学基金资助项目(60274012,60574065);安徽省自然科学基金资助项目(050420301)
  • 【文献出处】 控制理论与应用 ,Control Theory & Applications , 编辑部邮箱 ,2006年01期
  • 【分类号】C934
  • 【被引频次】7
  • 【下载频次】162
节点文献中: 

本文链接的文献网络图示:

本文的引文网络