节点文献

粗糙脉孢菌基因组中的微卫星序列的组成和分布

Frequency and Distribution of Microsatellites in the Genome of Filamentous Fungus, Neurospora crassa

  • 推荐 CAJ下载
  • PDF下载
  • 不支持迅雷等下载工具,请取消加速工具后下载。

【作者】 李成云李进斌周晓罡张绍松许明辉

【Author】 LI Cheng-yun1, LI Jin-bin2, ZHOU Xiao-gang1, ZHANG Shao-song1,XU Ming-hui1 (1Biotechnology Research Institute of Yunnan Academy of Agricultural Science/Key Laboratory of Plateau Agrotechnology of Agriculture, Ministry of Agriculture, Kunming 650223 ; Plant Protection Research Institute, Yunnan Academy of Agricultural Sciences, Kunming 650205) 2

【机构】 云南省农业科学院生物技术研究所/农业部南方高原农业生物技术重点实验室云南省农业科学院植物保护研究所云南省农业科学院生物技术研究所/农业部南方高原农业生物技术重点实验室 昆明650223昆明650205昆明650223昆明650223

【摘要】 利用已经公布的粗糙脉孢菌基因组测序结果,对该真菌基因组中的微卫星(SSR)序列进行了系统分析。结果表明,在已经公布的38.0 Mb 的基因组序列中,共有14 788 个以1~6个核苷酸为基序的 SSR序列(长度大于15 bp,匹配值大于80%),其碱基总数占整个基因组碱基数的 0.95%,平均2.57 kb 就分布有一个大于15 bp的SSR。其中数量最多的三碱基 SSR,数量达到 4 729个,其次为六碱基 SSR (2 940个)和单碱基 SSR(2 489 个),这3 类SSR 总数达10 158 个,占 SSR总数的68.7%。数量最少的是二碱基SSR,只有691 个。在可读框(ORF)中的SSR总数为 4 094个,共分布于2 373个 ORF中,其中只有1 个SSR 的ORF为 1 056个。与其它生物内 SSR的分布类似,在基因编码区中,以三碱基SSR和六碱基SSR占绝对优势,分别为基因组中三碱基和六碱基SSR总数的54.1%和48.8%,由于ORFs和编码区的碱基总数分别为该菌基因组碱基总数的约46%和38.3%, 所以这两种长度的SSR在编码区中的密度高于基因组中的平均密度。ORF上下游300 bp调控区域内是各类SSR相对的富集区。尤其是上游区域中的五碱基SSR,为平均密度的3倍,二碱基SSR和四碱基SSR的密度也是基因组中平均密度的2倍多。在下游调控序列中,五碱基、四碱基、二碱基、单碱基 SSR的密度,也大大超过了在基因?

【Abstract】 A total of 38.0 Mb of publicly available DNA sequence in Neurospora crassa was researched for mono- to hexanucleotide simple sequence repeat (SSR or microsatellite) to determine the type, size and frequency. A total of 14 788 SSRs were observed in the whole genomic DNA sequence, about one every 2.57 kb, with the criteria of SSR length >15 bp and 80% matches. The most abundant microsatellite were trinucleotide repeat, the number was 4 729, followed by hexanucleotide and mononucleotide repeats, the numbers were 2 940 and 2 489 respectively, and the least abundance was dinucleotide repeat, only 691 were found. Among the 10 082 ORFs, 4 094 SSRs were harbored in 2 373 ORFs no intron) ( of the organism. Onethousand and fifty six ORFs kept only one SSR. Similar with other organisms, tri- and hexanucleotide repeats were predominant in ORFs, 54.1% and 48.8% of tri- and hexanucleotide repeats were distributed in ORF region. The density of these two motifs was overpresented in coding regions, because ORF region and coding region constitutes only 46% and 38.3% of genomic sequence respectively. Upstream and downstream 300 bp of regulatory regions were high density regions of SSRs, particularly density of pentanucleotide SSR in upstream region was as high asfive times of average density in genomic DNA, density of di- and tetranucleotide SSR was also more than two times of average density. The density of penta-, tetra-, di- and mononucleotide SSRs was relatively higher than average density, also. There were 47 SSRs in mitochondria 64 840 bp DNA sequence, their distribution is similar with genomic DNA sequence. These results suggested that SSRs were clustered in regulatory regions of genomic DNA.

【基金】 云南省自然科学基金重点资助项目(1999C0008Z );国家自然科学基金资助项目(30360061 )
  • 【文献出处】 中国农业科学 ,Scientia Agricultura Sinica , 编辑部邮箱 ,2004年06期
  • 【分类号】S188
  • 【被引频次】26
  • 【下载频次】164
节点文献中: 

本文链接的文献网络图示:

本文的引文网络