节点文献
GoPipe:批量序列的Gene Ontology注释和统计分析(英文)
GoPipe: Streamlined Gene Ontology Annotationfor Batch Anonymous Sequences With Statistics
【摘要】 随着后基因组时代的到来,批量的测序,特别是EST的测序,逐渐成为普通实验室的日常工作. 这些新的序列往往需要进行批量的Gene Ontology (GO)的注释及随后的统计分析. 但是目前除了Goblet以外,并没有软件适合对未知序列进行批量的GO注释,而GoBlet因为具有上载量的限制,以及仅仅利用BLAST作为预测工具,所以仍有许多不足之处. 开发了一个软件包GoPipe,通过整合BLAST和InterProScan的结果来进行序列注释,并提供了进一步作统计比较的工具. 主程序接收任意个BLAST和InterProScan的结果文件,并依次进行文本分析、数据整合、去除冗余、统计分析和显示等工作. 还提供了统计的工具来比较不同输入对GO的分布来挖掘生物学意义. 另外,在交集工作模式下,程序取InterProScan和BLAST结果的交集,在测试数据集中,其精确度达到99.1%,这大大超过了InterProScan本身对GO预测的精确度,而敏感度只是稍微下降. 较高的精确度、较快的速度和较大的灵活性使它成为对未知序列进行批量Gene Ontology注释的理想的工具. 上述软件包可以在网站(http://gopipe.fishgenome.org/ ) 免费获得或者与作者联系获取.
【Abstract】 Accelerated availability of new sequences, especially ESTs, calls for computational methods to link sequences with Gene Ontology (GO) terms in a batch mode. There is currently no program for such purpose except Goblet, an online tool which uses BLAST to interpret query sequence with proper GO terms, but has a restriction of upload sequence files less than 100 kilobytes in size. GoPipe is a standalone package that integrates BLAST and InterProScan results to obtain Gene Ontology annotation with built-in statistical options. GoPipe takes any number of BLAST and/or InterProScan output files simultaneously and launches jobs sequentially to perform parsing, data integration, redundancy removal, GO distributions calculation and graphic display. A very high annotation specificity of 99.1% was achieved for a test dataset when the program was run in the "intersection" mode, which intersects the BLAST and InterProScan results, outperforming the specificity (81.1%) obtained from the InterProScan only. Statistical tools are also provided to compare GO distributions between different inputs, so that GO distributions of different sets of batch sequences can be compared, and differentially represented GO terms can be easily displayed. High specificity, speed and flexibility make GoPipe an ideal tool for streamlined GO annotation for batch sequences. The package is freely available at http://gopipe.fishgenome.org/ or by contacting the authors.
【Key words】 Gene Ontology; functional genomics; EST; BLAST; InterProScan; GOA;
- 【文献出处】 生物化学与生物物理进展 ,Progress In Biochemistry and Biophysics , 编辑部邮箱 ,2005年02期
- 【分类号】Q7-3
- 【被引频次】36
- 【下载频次】656