节点文献

中国对虾(Fenneropenaeus chinensis)基因组微卫星特征分析

ANALYSIS OF MICROSATELLITE SEQUENCES IN CHINESE SHRIMP FENNEROPENAEUS CHINENSIS GENOME

  • 推荐 CAJ下载
  • PDF下载
  • 不支持迅雷等下载工具,请取消加速工具后下载。

【作者】 高焕刘萍孟宪红王伟继孔杰

【Author】 GAO Huan, LIU Ping , MENG Xian-Hong , WANG Wei-Ji , KONG Jie (Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao, 266071; Institute of Oceanology, Chinese Academy of Sciences, Qingdao, 266071; Graduate School, Chinese Academy of Sciences, Beijing, 100039)  (Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, Qingdao, 266071)

【机构】 中国水产科学研究院黄海水产研究所,中国水产科学研究院黄海水产研究所,中国水产科学研究院黄海水产研究所,中国水产科学研究院黄海水产研究所,中国水产科学研究院黄海水产研究所 青岛266071中国科学院海洋研究所青岛266071中国科学院研究生院北京100039,青岛266071,青岛266071,青岛266071,青岛266071

【摘要】 对中国对虾基因组随机测序 ,获得了总长度约为 641 0 0 0个碱基的基因组DNA序列 ,从中找到 1 362个重复序列。其中 ,两碱基重复类型的重复数目最多 ( 985个 ) ,占重复序列总数目的 72 .32 % ;其次是三碱基 ( 1 49个 )和四碱基 ( 1 0 2个 ) ,分别占重复序列总数目的 1 0 .94%和7.49%。另外 ,六碱基重复 34个 ,单碱基重复 5 0个 ,五碱基重复 5个 ,分别占重复序列总数目的 2 .5 0 %、3.67%、0 .37%。在单碱基重复类型中 ,重复拷贝类别为A的重复数目最多 ;两碱基重复类型中 ,AT重复数目最多 ,其次是AC和AG ;三碱基重复类型中以AAT重复拷贝类别最多 ,其次是AAG和ATC ;四碱基重复类型中 ,AGAT重复数目最多 ;五碱基重复类型只发现了AGAGA、GAGGC、TCTTC和TTTCT四种重复拷贝类别 ;六碱基重复中以ATTATC重复数目最多。其中一些序列已经提交GeneBank注册 ,注册号为AY5 4 5 898 AY5 4 5 91 3。中国对虾基因组二碱基重复类型中以不完全 (Imperfect)形式的微卫星序列为主 ,其中GC重复拷贝类别的重复数目很少。利用 8对微卫星引物对 60个个体遗传多样性分析 ,共获得了 60个等位基因 ,因此认为微卫星技术在中国对虾基因组研究中具有较好的应用前景。

【Abstract】 By sequencing randomly, 3699 clones of sequences in the genome of Fenneropenaeus chinensis were obtained. Then, using software DNASTAR (Version 5.0) to assembly all of the clones, 1520 clones independent of each other, were made in which the length of DNA sequences is about 641,000 bp in total. With the help of the bio-soft Tandem Repeats Finder (Version 2.02), 1362 microsatellite repeat sequences are found in the sequences. Criterions to distinguish the repeat sequences of mono-, di-, tri, tetra-, pentra-, hexanucleotide are that the copy numbers of the motif composed of the mono-, di-, tri, tetra-, pentra-, hexanucleotide are ≥ 14, 7, 5, 4, 3 and 3, respectively. In the 1362 repeat sequences, the numbers of the dinucleotide repeats are 985, and most (72.32%) among all of the repeat sequences; the second are the trinucleotide repeats, 149 (10.94%); the third is the tetranucleotide repeats, 102 (7.49%); the forth is the mononucleotide repeats, 50 (3.67%); the fifth is the hexanucleotide repeats, 34 (2.50%); the sixth is the petranucleotide repeats, 5 (0.37%). Numbers of repeat sequences that composed of the motif of A are 43, accounting for 86%, and most among the mononucleotide repeats. In dinucleotide repeat, the numbers of AT repeats are 418, the most, accounting for 42.44%; and the second and third are AC and AG repeats, 339 (34.42%) and 228 (23.15%) respectively. Seven classes of repeat sequences that respectively composed of the motif AAT, AAG, ATC, AGG, AAC, ACT and ACC, are found in the trinucleotide repeats, in which the numbers of AAT repeats are 75, the most; the second are AAG(24); the others are ATC(22), AGG(14), AAC(8), ACT(5) and ACC(1) in turn. AGAT and ATTATC repeats are the most ones in tetranucleotide and hexanucleotide respectively. Both classes and copy number of repeat units are few in pentranucleotide; and there are together four classes: AGAGA, GAGGC, TCTTC and TTTCT. Some of the above sequences are referred to the GeneBank, and the numbers of accession are AY545898—AY545913.The reason of fewer GC dinucleotide repeats are also discussed in the article. Two possible answers are that: one is methylation of C in CpG islands resulting in the mutation of C-T; and another is that it is difficult to sequence the GC repeat sequences.Distributions of copy numbers in different types of repeat sequences are as follows: copy numbers of mononucleotide repeats are mainly between 20 and 29, accounting for 64%; copy numbers of dinucleotide are mainly between 10 and 29, accounting for 60.71%; copy numbers of trinucleotide repeats are mainly between 5 and 19, accounting for 79.19%; copy numbers of tetre-, pentra- and hexanucleotide repeats together are mainly between 3 and 10. In general, the lengths of microsatellite repeat sequences are mainly between 20 to 60 bp. Among the sequences, the numbers of imperfect sequences are predominance. Based on the above point, it is believed that the nucleotide mutation of microsatellite locations are accumulated largely in a long term of evolution; and there would be abundant polymorphism in these locations. In fact, we get 60 alleles in 8 microsatellite locations, using 8 pairs of microsatellite primers to amplify the genome of 60 individuals by PCR technology. Therefore, it would be very practical to use microsatellite to study the genome of F. chinensis.

【关键词】 微卫星中国对虾基因组
【Key words】 MicrosatellitesFenneropenaeus chinensisGenome
【基金】 国家重点基础研究发展规划 (973)资助项目 ,G19990 12 0 0 7号 ;国家高技术研究发展计划 (86 3)资助项目 ,2 0 0 3AA6 0 30 2 1号
  • 【文献出处】 海洋与湖沼 ,Oceanologia Et Limnologia Sinica , 编辑部邮箱 ,2004年05期
  • 【分类号】Q953
  • 【被引频次】87
  • 【下载频次】415
节点文献中: 

本文链接的文献网络图示:

本文的引文网络