节点文献

基于复杂性K近邻规则的蛋白质亚细胞位点预测

Complexity KNN Rules Based Prediction of Protein Subcellular Locations

  • 推荐 CAJ下载
  • PDF下载
  • 不支持迅雷等下载工具,请取消加速工具后下载。

【作者】 李斌李义兵何红波

【Author】 LI Bin1,LI Yibing2,HE Hongbo2(1.School of Information Science and Engineering,Central South University,Changsha 410083;2.School of Physics Science and Technology,Central South University,Changsha 410083)

【机构】 中南大学信息科学与工程学院中南大学物理科学与技术学院中南大学物理科学与技术学院 长沙410083长沙410083

【摘要】 提出了一个基于符号序列LZ复杂性相似度和K近邻规则的蛋白质亚细胞位点类型预测的方法。相比许多其他特征参数,蛋白质序列的LZ复杂性相似度计算无需深入的生物学领域知识和除序列数据以外的其他辅助数据。同时,K近邻规则的延迟学习特性适合于亚细胞位点类型已知的蛋白质数据的动态增加。在标准的RH数据集上对该预测方法进行10重交叉验证,其总体的预测准确率优于4种对照预测方法。

【Abstract】 A method to predict the subcellular location of proteins is proposed based on the LZ complexity similarity of symbolic sequences and K nearest neighbor rule.Compared to many other features,the calculation of the LZ complexity similarity between protein sequences requires little detailed field knowledge of biology,nor accessorial data besides the sequences of proteins.The lazy learning characteristic of the K nearest neighbor rule facilitates the prediction of protein subcellular location when the number of proteins,which subcellular location has been determined,increases dynamically.The proposed prediction method is tested on the standard RH dataset using a 10-Fold cross validation.The total precision of the proposed method is better than the results of other four contrast methods.

  • 【文献出处】 计算机工程 ,Computer Engineering , 编辑部邮箱 ,2007年07期
  • 【分类号】TP183
  • 【被引频次】3
  • 【下载频次】133
节点文献中: 

本文链接的文献网络图示:

本文的引文网络