节点文献
基于分层强化学习的低过载比拦截制导律
Intercept Guidance Law with a Low Acceleration Ratio Based on Hierarchical Reinforcement Learning
【摘要】 为解决低过载比和纯角度量测等约束下的三维机动目标拦截制导问题,提出了一种基于分层强化学习的拦截制导律。首先将问题建模为马尔科夫决策过程模型,并考虑拦截能量消耗与弹目视线角速率,设计了一种启发式奖赏函数。其次通过构建具有双层结构的策略网络,并利用上层策略规划阶段性子目标来指导下层策略生成所需的制导指令,实现了拦截交战过程中的视线角速率收敛,以保证能成功拦截机动目标。仿真结果验证了所提出的方法较增强比例导引具有更高的拦截精度和拦截概率,且拦截过程的需用过载更低。
【Abstract】 This paper has proposed an intercept guidance law based on hierarchical reinforcement learning to solve the three-dimensional maneuvering target intercept guidance problem with constraints of low acceleration ratio and bearingsonly measurement. The aforementioned problem was initially modelled using a Markov decision process model, where a heuristic reward function was applied considering both the energy consumption and the missile-to-target line of sight(LOS) angular rate. Besides, the policy of two levels was built up with the lower-level policy generating the required guidance command and being supervised by subgoals that were instructed by the higher levels, allowing the convergence of the LOS angular rate and guaranteeing the successful interception against a maneuvering target. Simulation results have validated the superiority of the proposed method compared with the augmented proportional navigation guidance law in terms of intercept accuracy and hit probability, and its required acceleration ratio is much lower.
【Key words】 guidance law; maneuvering target intercept; low acceleration ratio; hierarchical reinforcement learning;
- 【文献出处】 空天防御 ,Air & Space Defense , 编辑部邮箱 ,2024年01期
- 【分类号】TJ765.3;TP18
- 【下载频次】86