节点文献
高性能计算机容错技术综述
A survey on Fault Tolerance Technology in HPC systems
【Author】 Zhang Lu -fei~1,Cheng Hua~(1,2) (1.Jiangnan Institute of Computing Technology,Wuxi 214083,China;2.Institute of Computing Technology,Chinese Academy of Sciences/Graduate School of the Chinese Academy of Sciences,Beijing 100080,China.)
【机构】 江南计算技术研究所; 中国科学院计算技术研究所; 中国科学院研究生院;
【摘要】 随着高性能计算机系统规模的不断增大,系统的故障愈加频发,这对系统的性能发挥造成严重的威胁。本文首先介绍了高性能计算机系统可用性的概念,并对系统失效原因作了简单的分析;其次介绍了一些传统的容错技术,并对这些技术面临的挑战作了分析;然后介绍了一些新的容错技术和他们在高性能计算机系统上的应用;最后做了总结。
【Abstract】 As Green Operating System plays the most important integral role of Green Computing,it should follow the development of computer architecture and application running on which,to be more energy efficient and environment friendly.This paper explores the role,the functional characteristics and implementation method of Green Operating System in heterogeneous many-core platforms,power-efficient clusters and cloud computing.Then some key ideas in operating system power management are introduced.Finally,several challenges and open issues in Green Operating System are summarized.
【Key words】 Green Computing; Green Computing system; heterogeneous many-core platforms; power-efficient clusters; cloud computing; power management;
- 【会议录名称】 2010通信理论与技术新发展——第十五届全国青年通信学术会议论文集(下册)
- 【会议名称】2010通信理论与技术新发展——第十五届全国青年通信学术会议
- 【会议时间】2010-08-06
- 【会议地点】中国云南昆明
- 【分类号】TP302.8
- 【主办单位】中国通信学会