本发明涉及一种基因组定点敲除的方法。
背景技术:
crispr-cas9(theclusteredregularlyinterspacedshortpalindromicrepeats-crispr-associatedprotein9)技术的出现和发展,已经成为强有力的基因组编辑手段,被广泛应用到很多组织和细胞中。crispr/cas9protein–rna复合物通过向导rna(guiderna)定位于靶点上,切割产生dna双链断裂(dsb,dsdnabreak),而后,生物体会本能的启动dna修复机制修复dsb。一般有两种修复机制,一种是非同源末端连接(nhej,non-homologousendjoining),一种是同源重组(hdr,homology-directedrepair),通常情况下,nhej修复占大多数,通过随机插入或者缺失不同数量的碱基,细胞将断裂的dsb连接起来,引起编码蛋白的氨基酸移码或发生提前终止,从而改变蛋白的结构,影响基因功能,实现基因定点敲除。
为了提高工作效率,降低工作成本,定点敲除效率的提升一直是植物基因组定点敲除的重要技术方向。另一方面,植物中常用的streptococcuspyogenescas9(spcas9)进行基因组编辑时具有一定的脱靶效应,虽然植物不同于动物,在后代可以通过遗传分离去除脱靶位点,但是由于一些潜在的脱靶位点可能是未知的,所以,后代中就很难有目的性的去除。因此,降低脱靶效应也是植物中一直以来的重要技术方向。
技术实现要素:
本发明的目的是提供一种基因组定点敲除的方法。
本发明提供了一种基因编辑系统,包括cas9核酸酶和sgrna;
所述cas9核酸酶如序列表的序列4所示;
所述sgrna的核苷酸序列如序列表的序列2自5’端第445-530位所示。
本发明还保护表达所述基因编辑系统的重组表达载体、表达盒、重组细胞或重组菌。
表达所述cas9核酸酶的表达盒具体可为表达盒甲。
当靶序列如表1所示时,表达所述sgrna的表达盒具体可为表达盒乙。
本发明还保护一种用于基因敲除的重组表达载体,包括表达盒甲和表达盒乙;所述表达盒甲表达所述cas9核酸酶;所述表达盒乙包括n个元件乙;所述元件乙包括所述的sgrna和靶序列;所述重组表达载体可靶向n个不同的靶序列进行基因敲除。
所述元件乙还包括pre-trna的核苷酸序列;所述pre-trna的核苷酸序列如序列表的序列2自5’端第344-420位所示。
所述元件乙自5’端依次为pre-trna的核苷酸序列、靶点序列和sgrna的核苷酸序列。
以上任一所述表达盒甲由启动子甲启动所述cas9核酸酶的编码基因表达。所述表达盒甲自5’端依次包括启动子甲、所述cas9核酸酶的编码基因和终止子甲。所述启动子甲具体可为osubq3启动子。所述osubq3启动子的核苷酸序列如序列表的序列3自5’端第1-1714位所示。所述终止子甲具体可为camv35s终止子。所述camv35s终止子的核苷酸序列如序列表的序列3自5’端第5999-6193所示。所述cas9核酸酶的编码基因如序列表的序列3自5’端第1721-5992位所示。所述表达盒甲具体可如序列表的序列3所示。
以上任一所述表达盒乙由启动子乙启动所述元件乙表达。所述表达盒乙自5’端依次包括启动子乙、元件乙和终止子乙。所述启动子乙具体可为osu3启动子。所述osu3启动子的核苷酸序列如序列表的序列2自5’端第1-337位所示。所述终止子乙具体可为osu3终止子。所述osu3终止子的核苷酸序列如序列表的序列2自5’端第1263-1553位所示。当靶序列如表1所示时,所述表达盒乙具体可如序列表的序列2所示。
当靶序列如表1所示时,以上任一所述重组载体具体可为将序列表的序列1自5’端第131位-1633位替换为序列表的序列2,并且将序列表的序列1自5’端第1640-7832位替换为序列表的序列3得到的环状质粒。
本发明还保护一种植物基因组定点敲除的方法,包括如下步骤:利用所述基因编辑系统完成植物基因组定点敲除。
本发明还保护一种植物基因组定点敲除的方法,包括如下步骤:将以上任一所述的重组表达载体导入目的植物,实现植物基因组定点敲除。
本发明还保护所述基因编辑系统,或,以上任一所述重组表达载体、表达盒、重组细胞或重组菌,在植物基因组定点敲除中的应用。
以上任一所述所述植物具体可为水稻,更具体可为日本晴水稻。
本发明的发明人在植物中将espcas9(1.1)和sgrna(modified)组合使用,增加靶点的敲除效率,降低脱靶效应。具体结果为,spcas9+sgrna(modified)敲除效率显著高于spcas9+sgrna(wt),但脱靶效率升高,使用espcas9(1.1)+sgrna(wt)脱靶效率较spcas9+sgrna(wt)降低或相当,但部分靶点敲除效率显著降低,同时使用espcas9(1.1)和sgrna(modified),多数靶点敲除效率显著高于spcas9+sgrna(wt),敲除效率增加至与spcas9+sgrna(modified)相当或略低,而且脱靶效率与spcas9+sgrna(wt)相当或在一定程度上降低。敲除效率的提升可能会伴随脱靶效率的增加,通过使用espcas9(1.1)+sgrna(modified),平衡了这一矛盾,提升敲除效率的同时,不增加脱靶效率甚至是一定程度的降低脱靶效率。
附图说明
图1为spcas9+sgrna(wt)和espcas9(1.1)+sgrna(wt)对靶点的敲除效率。
图2为espcas9(1.1)+sgrna(modified)和espcas9(1.1)+sgrna(wt)对靶点的敲除效率。
图3为spcas9+sgrna(modified)和espcas9(1.1)+sgrna(modified)对靶点的敲除效率。
图4为脱靶效率检测结果。
具体实施方式
以下的实施例便于更好地理解本发明,但并不限定本发明。下述实施例中的实验方法,如无特殊说明,均为常规方法。下述实施例中所用的试验材料,如无特殊说明,均为自常规生化试剂商店购买得到的。
日本晴水稻:参考文献:梁卫红,王高华,杜京尧,等.硝普钠及其光解产物对日本晴水稻幼苗生长和5种激素标记基因表达的影响[j].河南师范大学学报(自然版),2017(2):48-52.;公众可以从北京市农林科学院获得。
下述实施例中靶标基因、靶点名称和序列如表1所示。
表1
实施例1、espcas9(1.1)+sgrna(modified)增加靶点敲除效率
一、基因组编辑载体的构建
1、spcas9+sgrna(wt):人工合成序列表的序列1所示的环状质粒。
序列表的序列1包括如下三个表达盒:
序列1自5’端第131位-1633位为表达盒i,其中,第131-467位为osu3启动子的核苷酸序列,第474-550位为pre-trna的核苷酸序列,第555-574位为oshd1靶点的核苷酸序列,第575-650位为sgrna(wt)的核苷酸序列,第651-727位为pre-trna的核苷酸序列,第728-747位为osghd7靶点的核苷酸序列,第748-823位为sgrna(wt)的核苷酸序列,第824-900位为pre-trna的核苷酸序列,第901-920位为osgl7靶点的核苷酸序列,第921-996位为sgrna(wt)的核苷酸序列,第997-1073位为pre-trna的核苷酸序列,第1074-1093位为gw8-t1靶点的核苷酸序列,第1094-1169位为sgrna(wt)的核苷酸序列,第1170-1246位为pre-trna的核苷酸序列,第1247-1266位为gw8-t2靶点的核苷酸序列,第1267-1342位为sgrna(wt)的核苷酸序列,第1343-1633位为osu3终止子的核苷酸序列。
序列1自5’端第1640-7832位为表达盒ⅱ,其中,第1640-3353位为osubq3启动子的核苷酸序列,第3360-7631位为spcas9的核苷酸序列,第7638-7832位为camv35s终止子的核苷酸序列。
序列1自5’端第7907-11173位为表达盒ⅲ,其中,第7907-9899为zmubi1启动子的核苷酸序列,第9906-10931位为潮霉素的核苷酸序列,第10958-11173位为camv35spolya的核苷酸序列。
2、spcas9+sgrna(modified):将序列表的序列1自5’端第131位-1633位(表达盒i)替换为序列表的序列2得到的环状质粒。spcas9+sgrna(modified)与spcas9+sgrna(wt)的区别仅在于采用sgrna(modified)替换了sgrna(wt)。
3、espcas9(1.1)+sgrna(wt):将序列表的序列1自5’端第1640-7832位(表达盒ⅱ)替换为序列表的序列3得到的环状质粒。espcas9(1.1)+sgrna(wt)与spcas9+sgrna(wt)的区别仅在于采用espcas9(1.1)替换了spcas9。
4、espcas9(1.1)+sgrna(modified):将序列表的序列1自5’端第131位-1633位(表达盒i)替换为序列表的序列2,将序列表的序列1自5’端第1640-7832位(表达盒ⅱ)替换为序列表的序列3得到的环状质粒。espcas9(1.1)+sgrna(modified)与spcas9+sgrna(wt)的区别在于采用sgrna(modified)替换了sgrna(wt),并且采用espcas9(1.1)替换了spcas9。
序列表的序列2中,自5’端第1-337位为osu3启动子的核苷酸序列,第344-420位为pre-trna的核苷酸序列,第425-444位为oshd1靶点的核苷酸序列,第445-530位为sgrna(modified)的核苷酸序列,第531-607位为pre-trna的核苷酸序列,第608-627位为osghd7靶点的核苷酸序列,第628-713位为sgrna(modified)的核苷酸序列,第714-790位为pre-trna的核苷酸序列,第791-810位为osgl7靶点的核苷酸序列,第811-896位为sgrna(modified)的核苷酸序列,第897-973位为pre-trna的核苷酸序列,第974-993位为gw8-t1靶点的核苷酸序列,第994-1079位为sgrna(modified)的核苷酸序列,第1080-1156位为pre-trna的核苷酸序列,第1157-1176位为gw8-t2靶点的核苷酸序列,第1177-1262位为sgrna(modified)的核苷酸序列,第1263-1553位为osu3终止子的核苷酸序列。
序列表的序列3中,自5’端第1-1714位为osubq3启动子的核苷酸序列,第1721-5992位为espcas9(1.1)的核苷酸序列,第5999-6193位为camv35s终止子的核苷酸序列。
espcas9(1.1)如序列表的序列4所示。
二、在水稻愈伤组织中进行基因编辑
将步骤一构建的四个载体分别按照如下步骤1-5进行操作:
1、将载体导入农杆菌lba4404(唯地生物,上海,cat#:ac1030),得到重组菌,采用yep培养基培养重组菌,得到菌液od600nm为0.2的侵染液。
2、选取日本晴水稻种子,剥去种皮,灭菌洗涤后,均匀的点入在n6培养基中,28℃暗培养4-6周以诱导愈伤组织的产生。
3、将步骤2得到的水稻愈伤组织浸泡在步骤1得到的侵染液中浸泡10min,取愈伤组织接种于含有两层滤纸的培养皿上,25℃黑暗下培养3天(培养基为含有100mg/l的特美汀的n6培养基),然后愈伤组织在筛选培养基中(含有50mg/l潮霉素的n6培养基,ph5.7)28℃黑暗下筛选培养2周,转入新配置的筛选培养基中再次进行筛选培养2周,获得抗性愈伤。
4、提取步骤3得到的抗性愈伤的基因组dna,采用引物f(5’-attatgtagcttgtgcgtttcg-3’)和引物r(5’-gatgaagagcttatcgacgt-3’)组成的引物对进行pcr扩增;将获得的扩增产物进行琼脂糖凝胶电泳,如电泳图显示有1150bp大小的条带,说明对应的愈伤为阳性抗性愈伤。
5、每个载体取15块步骤4得到的阳性抗性愈伤的dna,分别采用osghd7靶点引物(ghd7-f和ghd7-r)、osgl7靶点引物(gl7-f和gl7-r)、oshd1靶点引物(hd1-f和hd1-r)、gw8-t1靶点引物(gw8-f和gw8-r)和gw8-t2靶点引物(gw8-t1靶点引物相同)进行pcr扩增,然后将扩增产物测序,只要在靶点区产生双峰,就视为发生敲除突变。五个靶点分别统计发生敲除突变的阳性抗性愈伤数,敲除效率为15块阳性抗性愈伤中发生敲除突变的阳性抗性愈伤所占的比例。
ghd7-f:5’-tgatcgagctcaagtgacctc-3’,
ghd7-r:5’-aagaactggaactcgtgcacc-3’;
gl7-f:5’-acgaccaccccaagttcac-3’,
gl7-r:5’-aagatctggcgtcagcagc-3’;
hd1-f:5’-ctcctctccaaagattccgac-3’,
hd1-r:5’-cagctagtaatagatgaactcacgc-3’;
gw8-f:5’-catttcgttggctccacctc-3’,
gw8-r:5’-ccagagatgagaggctgcg-3’。
结果如图1-图3所示。实验结果表明,相对于spcas9+sgrna(wt),espcas9(1.1)+sgrna(wt)对大部分靶点的敲除效率降低或相当(图1)。espcas9(1.1)+sgrna(modified)相对于espcas9(1.1)+sgrna(wt),对靶点的敲除效率显著增加(图2),且在osgl7,gw8-t1,gw8-t2这三个靶点中,敲除效率增加至与spcas9+sgrna(modified)相当或略低(图3),且多数靶点敲除效率显著高于spcas9+sgrna(wt)(图1和图3)。
实施例2、脱靶效应分析比较
每个载体从实施例1步骤二的4得到的阳性抗性愈伤中选取8块(优先选择经步骤5检测发生敲除突变的,如果发生敲除突变的阳性抗性愈伤数量不足8块,再随机选择经步骤5检测未发生敲除突变的阳性抗性愈伤补足8块),将其dna进行混合。在cas-offinder网站上(http://www.rgenome.net/cas-offinder/)输入gw8-t2的靶点序列,预测该靶点在基因组上对应的脱靶位点,选择脱靶可能性最高的前四个潜在脱靶位点,进行二代测序验证脱靶效率。潜在的脱靶位点及序列信息分别为:
gw8-ot1:5’-tctctctcttctgtcagttccgg-3’,
gw8-ot2:5’-tctctctctgctgtcaactccgg-3’,
gw8-ot3:5’-tctctctcttctgtcatccccgg-3’,
gw8-ot4:5’-tttctctctcctctcagctctgg-3’。
针对四个脱靶位点,设计成对引物,分别为:
ot1-f:5’-agggacggcgattggcag-3’,
ot1-r:5’-ttcatctcattgtcattggagctca-3’,
ot2-f:5’-cttctcttcggctacaaagaaagtg-3’,
ot2-r:5’-ccgtaattctcttcctcacgtcg-3’,
ot3-f:5’-atgagggcgatcagataagcttc-3’,
ot3-r:5’-catggggatttggtggtggt-3’,
ot4-f:5’-caaatactttgttcttgttgcagcc-3’,
ot4-r:5’-aaacaagcatgtgaaatcttttgcc-3’。
分别对目标靶点区检测突变情况,靶点区任何突变均计入脱靶。
结果如图4所示。实验结果表明,对于gw8-ot2、gw8-ot3和gw8-ot4这三个靶点,四种形式的组合均未检测到显著的脱靶效应。对于gw8-ot1靶点,spcas9+sgrna(wt)脱靶效率为1.5%,espcas9+sgrna(wt)的脱靶效率接近0,spcas9+sgrna(modified)脱靶效率为4.45%,espcas9(1.1)+sgrna(modified)脱靶效率为1.06%,即espcas9(1.1)+sgrna(modified)脱靶效率较spcas9+sgrna(modified)降低,较spcas9+sgrna(wt)脱靶效率相当或在一定程度上降低。
序列表
<110>北京市农林科学院
<120>一种基因组定点敲除的方法
<160>4
<170>siposequencelisting1.0
<210>1
<211>17579
<212>dna
<213>人工序列(artificialsequence)
<400>1
ggtggcaggatatattgtggtgtaaacatggcactagcctcaccgtcttcgcagacgagg60
ccgctaagtcgcagctacgctctcaacggcactgactaggtagtttaaacgtgcacttaa120
ttaaggtaccgaagcaacttaaagttatcaggcatgcatggatcttggaggaatcagatg180
tgcagtcagggaccatagcacaagacaggcgtcttctactggtgctaccagcaaatgctg240
gaagccgggaacactgggtacgttggaaaccacgtgatgtgaagaagtaagataaactgt300
aggagaaaagcatttcgtagtgggccatgaagcctttcaggacatgtattgcagtatggg360
ccggcccattacgcaattggacgacaacaaagactagtattagtaccacctcggctatcc420
acatagatcaaagctgatttaaaagagttgtgcagatgatccgtggcggatccaacaaag480
caccagtggtctagtggtagaatagtaccctgccacggtacagacccgggttcgattccc540
ggctggtgcaaaacctgagtgagcagcagcataggttttagagctagaaatagcaagtta600
aaataaggctagtccgttatcaacttgaaaaagtggcaccgagtcggtgcaacaaagcac660
cagtggtctagtggtagaatagtaccctgccacggtacagacccgggttcgattcccggc720
tggtgcagcacaggccacatccttctcgttttagagctagaaatagcaagttaaaataag780
gctagtccgttatcaacttgaaaaagtggcaccgagtcggtgcaacaaagcaccagtggt840
ctagtggtagaatagtaccctgccacggtacagacccgggttcgattcccggctggtgca900
actagagtggctagggatcagttttagagctagaaatagcaagttaaaataaggctagtc960
cgttatcaacttgaaaaagtggcaccgagtcggtgcaacaaagcaccagtggtctagtgg1020
tagaatagtaccctgccacggtacagacccgggttcgattcccggctggtgcagcaagat1080
gttctccgatggtgttttagagctagaaatagcaagttaaaataaggctagtccgttatc1140
aacttgaaaaagtggcaccgagtcggtgcaacaaagcaccagtggtctagtggtagaata1200
gtaccctgccacggtacagacccgggttcgattcccggctggtgcatctctctcttctgt1260
cagctcgttttagagctagaaatagcaagttaaaataaggctagtccgttatcaacttga1320
aaaagtggcaccgagtcggtgcttttttttttcgttttgcattgagttttctccgtcgca1380
tgtttgcagttttattttccgttttgcattgaaatttctccgtctcatgtttgcagcgtg1440
ttcaaaaagtacgcagctgtatttcacttatttacggcgccacattttcatgccgtttgt1500
gccaactatcccgagctagtgaatacagcttggcttcacacaacactggtgacccgctga1560
cctgctcgtacctcgtaccgtcgtacggcacagcatttggaattaaagggtgtgatcgat1620
actgcttgctgctaagcttacaaattcgggtcaaggcggaagccagcgcgccaccccacg1680
tcagcaaatacggaggcgcggggttgacggcgtcacccggtcctaacggcgaccaacaaa1740
ccagccagaagaaattacagtaaaaaaaaagtaaattgcactttgatccaccttttatta1800
cctaagtctcaatttggatcacccttaaacctatcttttcaatttgggccgggttgtggt1860
ttggactaccatgaacaacttttcgtcatgtctaacttccctttcagcaaacatatgaac1920
catatatagaggagatcggccgtatactagagctgatgtgtttaaggtcgttgattgcac1980
gagaaaaaaaaatccaaatcgcaacaatagcaaatttatctggttcaaagtgaaaagata2040
tgtttaaaggtagtccaaagtaaaacttatagataataaaatgtggtccaaagcgtaatt2100
cactcaaaaaaaatcaacgagacgtgtaccaaacggagacaaacggcatcttctcgaaat2160
ttcccaaccgctcgctcgcccgcctcgtcttcccggaaaccgcggtggtttcagcgtggc2220
ggattctccaagcagacggagacgtcacggcacgggactcctcccaccacccaaccgcca2280
taaataccagccccctcatctcctctcctcgcatcagctccacccccgaaaaatttctcc2340
ccaatctcgcgaggctctcgtcgtcgaatcgaatcctctcgcgtcctcaaggtacgctgc2400
ttctcctctcctcgcttcgtttcgattcgatttcggacgggtgaggttgttttgttgcta2460
gatccgattggtggttagggttgtcgatgtgattatcgtgagatgtttaggggttgtaga2520
tctgatggttgtgatttgggcacggttggttcgataggtggaatcgtggttaggttttgg2580
gattggatgttggttctgatgattggggggaatttttacggttagatgaattgttggatg2640
attcgattggggaaatcggtgtagatctgttggggaattgtggaactagtcatgcctgag2700
tgattggtgcgatttgtagcgtgttccatcttgtaggccttgttgcgagcatgttcagat2760
ctactgttccgctcttgattgagttattggtgccatgggttggtgcaaacacaggcttta2820
atatgttatatctgttttgtgtttgatgtagatctgtagggtagttcttcttagacatgg2880
ttcaattatgtagcttgtgcgtttcgatttgatttcatatgttcacagattagataatga2940
tgaactcttttaattaattgtcaatggtaaataggaagtcttgtcgctatatctgtcata3000
atgatctcatgttactatctgccagtaatttatgctaagaactatattagaatatcatgt3060
tacaatctgtagtaatatcatgttacaatctgtagttcatctatataatctattgtggta3120
atttctttttactatctgtgtgaagattattgccactagttcattctacttatttctgaa3180
gttcaggatacgtgtgctgttactacctatctgaatacatgtgtgatgtgcctgttacta3240
tctttttgaatacatgtatgttctgttggaatatgtttgctgtttgatccgttgttgtgt3300
ccttaatcttgtgctagttcttaccctatctgtttggtgattatttcttgcagtacgtaa3360
tggactacaaggaccacgacggggattacaaagaccacgacatagactacaaggatgacg3420
atgacaaaatggcaccgaagaaaaaaaggaaggtcggaatccatggcgttccagctgccg3480
ataagaaatattccatcggactcgacattggcacgaatagcgtcggatgggctgttatta3540
ctgatgagtacaaagttccgtctaagaagttcaaggtgctgggcaacacagaccgccaca3600
gcataaagaaaaatctcatcggtgcactccttttcgatagtggggagactgcagaagcga3660
caagattgaaaaggactgcgagaaggcgctatacacggcgtaagaatagaatctgctacc3720
ttcaggagattttctctaacgaaatggctaaggtcgatgacagtttctttcatagacttg3780
aggaatcgttcttggttgaggaggataagaaacatgagaggcacccgatatttggaaaca3840
tcgtggatgaggtcgcatatcatgaaaagtaccccacaatctaccacctgagaaagaaac3900
tcgttgattccaccgacaaagcggatttgagactcatctacctcgctcttgcccatatga3960
taaagttccgcggacactttctgatcgagggcgacctcaaccctgataatagcgacgtcg4020
ataagctcttcatccagttggttcaaacctacaatcagctctttgaggaaaacccaatta4080
atgctagtggagtggatgcaaaagcgatactgtcggccagactctccaagagcagaaggt4140
tggagaacctgatcgctcaacttcctggagaaaagaaaaacggtctttttgggaatttga4200
ttgccttgtctctgggcctcacaccaaacttcaagtcaaattttgacctcgctgaggatg4260
ccaaacttcagttgtctaaggatacctatgatgacgatcttgacaatttgctggcacaaa4320
ttggcgaccagtacgcggatctgttcctcgcagcgaagaatctgagtgatgctattctcc4380
tttcggacatactcagggttaacactgagatcacaaaagcacctttgagtgcgtcgatga4440
ttaagcgctatgatgaacatcaccaagacctcactttgctgaaggcccttgtgcggcagc4500
aattgccagagaagtacaaagaaatcttctttgaccaatctaagaacggatacgctggct4560
atattgatggaggagcttctcaggaggaattctataagtttatcaaacctatacttgaga4620
agatggatggtacagaggaactccttgttaaattgaacagagaagatttgctgcgcaagc4680
aacggacctttgacaacggatcaattccgcatcagatacacctcggcgagcttcatgcca4740
tccttcgccggcaggaagatttctacccctttttgaaggacaaccgcgagaagatagaaa4800
aaatccttacgttccggattccttactatgtgggtccattggcaagggggaattcccgct4860
ttgcgtggatgactcggaaaagcgaggaaactatcacaccgtggaacttcgaggaagttg4920
tggacaagggagcttctgcccaatcattcattgagaggatgactaacttcgataagaacc4980
tgccgaacgagaaagttctccccaagcactccctcctttacgagtatttcaccgtgtata5040
acgaacttacgaaggttaaatacgtgactgagggtatgaggaagccagcattcttgagcg5100
gggaacaaaagaaagcgattgttgatttgctgtttaaaactaatcgcaaggtgacagtca5160
agcagctcaaagaggattatttcaagaaaattgaatgtttcgactctgtggagatatcag5220
gagtcgaagataggtttaacgcttcccttggcacataccatgacctccttaagatcatta5280
aggacaaagatttcctggataacgaggaaaatgaggacatcctcgaagatattgttctta5340
ccttgacgctgtttgaggatcgcgaaatgatcgaggaacggcttaagacgtatgctcact5400
tgttcgacgataaggttatgaagcagctcaagcgtagaaggtacactggatggggccgtc5460
tgtctagaaagctcatcaacggaatacgtgataaacaaagtggcaagacaattttggatt5520
ttctgaagtcggacggattcgccaacagaaattttatgcagctgattcatgacgatagtc5580
tcaccttcaaagaggacatacagaaggctcaagtgagtggtcaaggggattcgctgcatg5640
aacacatcgcaaacctcgcgggttcaccggccataaagaaaggaatccttcaaactgtta5700
aggtcgttgatgagttggttaaagtgatgggtaggcacaagcccgaaaacatagtgatcg5760
agatggctcgcgaaaatcagactacacaaaaagggcagaagaactctcgcgagcggatga5820
aaaggattgaggaaggaatcaaggaactgggctcacagattctcaaagagcatccagtcg5880
aaaacacacagctgcaaaatgagaagctctatctttactatctccaaaatggccgggaca5940
tgtatgttgatcaggagcttgacatcaaccgtttgtccgactatgatgtggaccacattg6000
tcccgcaatctttccttaaggacgattcaatcgataataaggtgttgacccggagcgata6060
aaaaccgtggaaagtctgacaatgtcccttcagaggaagtggttaagaagatgaagaact6120
actggagacaattgctgaatgcaaaactgatcacacagagaaagttcgacaacctcacca6180
aagcagagagaggtgggctcagtgaacttgataaagcgggcttcattaagcgtcagctcg6240
ttgagactagacagatcacgaagcatgtcgcgcagattttggattcgcggatgaacacga6300
agtacgacgagaatgataaactgatacgtgaagtcaaggttatcactcttaagtccaaat6360
tggtgagcgatttcagaaaggacttccaattctataaggtcagggagatcaacaattatc6420
atcacgctcacgatgcctaccttaatgctgttgtggggaccgcccttattaagaaatacc6480
ctaaattggagtctgaattcgtttacggggattataaggtctacgacgttaggaaaatga6540
tagctaagagtgagcaggagatcggtaaagcaactgcgaagtatttcttttactcgaaca6600
tcatgaatttctttaagaccgagataacgctggcaaatggcgaaattagaaagaggcctc6660
tcatagagactaacggtgagacaggggaaatcgtctgggataagggtagggactttgcga6720
cagtgcgcaaggtcctctctatgccgcaagttaatattgtgaagaaaaccgaggtgcaga6780
cgggaggcttctccaaggaaagcatacttcccaaacggaactctgataagttgatcgctc6840
gtaagaaagattgggaccctaagaaatatggtgggttcgattccccaactgttgcttaca6900
gcgtgctggtcgttgccaaggtcgagaagggtaaatccaagaaactcaaaagcgttaagg6960
aactccttgggattactatcatggagagatcttcattcgaaaagaatcctatcgactttc7020
ttgaggccaaaggatataaggaagttaagaaagatctgataatcaaactcccaaagtact7080
cattgtttgagctggaaaacggcaggaagcgcatgcttgcttccgccggagagttgcaga7140
aagggaacgagttggctctgccttctaagtatgttaacttcctctatcttgcctctcatt7200
acgagaagctcaaaggctcaccagaggacaacgaacagaaacaactttttgtcgagcaac7260
ataagcactatttggatgagattatagaacagatcagtgaattctcgaaaagggttatcc7320
ttgcagatgcgaatcttgacaaggtgttgtctgcatacaacaaacatagagataagccga7380
tcagggagcaagcggaaaatatcattcacctcttcactcttacaaacttgggtgctcccg7440
ctgccttcaagtattttgataccacgattgaccggaaacgttacacctcaacgaaggagg7500
tgctggatgccaccctcatccaccaatctattaccggactctacgagactagaatcgatc7560
tctcacagctcggcggggataaaagaccagcagcgacgaaaaaggcaggacaggctaaga7620
agaagaaatagactagtctgaaatcaccagtctctctctacaaatctatctctctctata7680
ataatgtgtgagtagttcccagataagggaattagggttcttatagggtttcgctcatgt7740
gttgagcatataagaaacccttagtatgtatttgtatttgtaaaatacttctatcaataa7800
aatttctaattcctaaaaccaaaatccagtggggcgcccgacctgtactcgcgaaggtta7860
acttacagagagtgtccgggcgcgcctggtggatcgtccgcctaggctgcagtgcagcgt7920
gacccggtcgtgcccctctctagagataatgagcattgcatgtctaagttataaaaaatt7980
accacatattttttttgtcacacttgtttgaagtgcagtttatctatctttatacatata8040
tttaaactttactctacgaataatataatctatagtactacaataatatcagtgttttag8100
agaatcatataaatgaacagttagacatggtctaaaggacaattgagtattttgacaaca8160
ggactctacagttttatctttttagtgtgcatgtgttctcctttttttttgcaaatagct8220
tcacctatataatacttcatccattttattagtacatccatttagggtttagggttaatg8280
gtttttatagactaatttttttagtacatctattttattctattttagcctctaaattaa8340
gaaaactaaaactctattttagtttttttatttaataatttagatataaaatagaataaa8400
ataaagtgactaaaaattaaacaaataccctttaagaaattaaaaaaactaaggaaacat8460
ttttcttgtttcgagtagataatgccagcctgttaaacgccgtcgacgagtctaacggac8520
accaaccagcgaaccagcagcgtcgcgtcgggccaagcgaagcagacggcacggcatctc8580
tgtcgctgcctctggacccctctcgagagttccgctccaccgttggacttgctccgctgt8640
cggcatccagaaattgcgtggcggagcggcagacgtgagccggcacggcaggcggcctcc8700
tcctcctctcacggcaccggcagctacgggggattcctttcccaccgctccttcgctttc8760
ccttcctcgcccgccgtaataaatagacaccccctccacaccctctttccccaacctcgt8820
gttgttcggagcgcacacacacacaaccagatctcccccaaatccacccgtcggcacctc8880
cgcttcaaggtacgccgctcgtcctccccccccccccctctctaccttctctagatcggc8940
gttccggtccatggttagggcccggtagttctacttctgttcatgtttgtgttagatccg9000
tgtttgtgttagatccgtgctgctagcgttcgtacacggatgcgacctgtacgtcagaca9060
cgttctgattgctaacttgccagtgtttctctttggggaatcctgggatggctctagccg9120
ttccgcagacgggatcgatttcatgattttttttgtttcgttgcatagggtttggtttgc9180
ccttttcctttatttcaatatatgccgtgcacttgtttgtcgggtcatcttttcatgctt9240
ttttttgtcttggttgtgatgatgtggtctggttgggcggtcgttctagatcggagtaga9300
attctgtttcaaactacctggtggatttattaattttggatctgtatgtgtgtgccatac9360
atattcatagttacgaattgaagatgatggatggaaatatcgatctaggataggtataca9420
tgttgatgcgggttttactgatgcatatacagagatgctttttgttcgcttggttgtgat9480
gatgtggtgtggttgggcggtcgttcattcgttctagatcggagtagaatactgtttcaa9540
actacctggtgtatttattaattttggaactgtatgtgtgtgtcatacatcttcatagtt9600
acgagtttaagatggatggaaatatcgatctaggataggtatacatgttgatgtgggttt9660
tactgatgcatatacatgatggcatatgcagcatctattcatatgctctaaccttgagta9720
cctatctattataataaacaagtatgttttataattattttgatcttgatatacttggat9780
gatggcatatgcagcagctatatgtggatttttttagccctgccttcatacgctatttat9840
ttgcttggtactgtttcttttgtcgatgctcaccctgttgtttggtgttacttctgcagg9900
agctcatgaaaaagcctgaactcaccgcgacgtctgtcgagaagtttctgatcgaaaagt9960
tcgacagcgtctccgacctgatgcagctctcggagggcgaagaatctcgtgctttcagct10020
tcgatgtaggagggcgtggatatgtcctgcgggtaaatagctgcgccgatggtttctaca10080
aagatcgttatgtttatcggcactttgcatcggccgcgctcccgattccggaagtgcttg10140
acattggggagtttagcgagagcctgacctattgcatctcccgccgttcacagggtgtca10200
cgttgcaagacctgcctgaaaccgaactgcccgctgttctacaaccggtcgcggaggcta10260
tggatgcgatcgctgcggccgatcttagccagacgagcgggttcggcccattcggaccgc10320
aaggaatcggtcaatacactacatggcgtgatttcatatgcgcgattgctgatccccatg10380
tgtatcactggcaaactgtgatggacgacaccgtcagtgcgtccgtcgcgcaggctctcg10440
atgagctgatgctttgggccgaggactgccccgaagtccggcacctcgtgcacgcggatt10500
tcggctccaacaatgtcctgacggacaatggccgcataacagcggtcattgactggagcg10560
aggcgatgttcggggattcccaatacgaggtcgccaacatcttcttctggaggccgtggt10620
tggcttgtatggagcagcagacgcgctacttcgagcggaggcatccggagcttgcaggat10680
cgccacgactccgggcgtatatgctccgcattggtcttgaccaactctatcagagcttgg10740
ttgacggcaatttcgatgatgcagcttgggcgcagggtcgatgcgacgcaatcgtccgat10800
ccggagccgggactgtcgggcgtacacaaatcgcccgcagaagcgcggccgtctggaccg10860
atggctgtgtagaagtactcgccgatagtggaaaccgacgccccagcactcgtccgaggg10920
caaagaaatagagtagatgccgaccgggatctgtcgatcgacaagctcgagtttctccat10980
aataatgtgtgagtagttcccagataagggaattagggttcctatagggtttcgctcatg11040
tgttgagcatataagaaacccttagtatgtatttgtatttgtaaaatacttctatcaata11100
aaatttctaattcctaaaaccaaaatccagtactaaaatccagatcccccgaattaattc11160
ggcgttaattcagcctgcaggacgcgtttaattaagtgcacgcggccgcctacttagtca11220
agagcctcgcacgcgactgtcacgcggccaggatcgcctcgtgagcctcgcaatctgtac11280
ctagtgtttaaactatcagtgtttgacaggatatattggcgggtaaacctaagagaaaag11340
agcgtttattagaataacggatatttaaaagggcgtgaaaaggtttatccgttcgtccat11400
ttgtatgtgcatgccaaccacagggttcccctcgggatcaaagtactttgatccaacccc11460
tccgctgctatagtgcagtcggcttctgacgttcagtgcagccgtcttctgaaaacgaca11520
tgtcgcacaagtcctaagttacgcgacaggctgccgccctgcccttttcctggcgttttc11580
ttgtcgcgtgttttagtcgcataaagtagaatacttgcgactagaaccggagacattacg11640
ccatgaacaagagcgccgccgctggcctgctgggctatgcccgcgtcagcaccgacgacc11700
aggacttgaccaaccaacgggccgaactgcacgcggccggctgcaccaagctgttttccg11760
agaagatcaccggcaccaggcgcgaccgcccggagctggccaggatgcttgaccacctac11820
gccctggcgacgttgtgacagtgaccaggctagaccgcctggcccgcagcacccgcgacc11880
tactggacattgccgagcgcatccaggaggccggcgcgggcctgcgtagcctggcagagc11940
cgtgggccgacaccaccacgccggccggccgcatggtgttgaccgtgttcgccggcattg12000
ccgagttcgagcgttccctaatcatcgaccgcacccggagcgggcgcgaggccgccaagg12060
cccgaggcgtgaagtttggcccccgccctaccctcaccccggcacagatcgcgcacgccc12120
gcgagctgatcgaccaggaaggccgcaccgtgaaagaggcggctgcactgcttggcgtgc12180
atcgctcgaccctgtaccgcgcacttgagcgcagcgaggaagtgacgcccaccgaggcca12240
ggcggcgcggtgccttccgtgaggacgcattgaccgaggccgacgccctggcggccgccg12300
agaatgaacgccaagaggaacaagcatgaaaccgcaccaggacggccaggacgaaccgtt12360
tttcattaccgaagagatcgaggcggagatgatcgcggccgggtacgtgttcgagccgcc12420
cgcgcacgtctcaaccgtgcggctgcatgaaatcctggccggtttgtctgatgccaagct12480
ggcggcctggccggccagcttggccgctgaagaaaccgagcgccgccgtctaaaaaggtg12540
atgtgtatttgagtaaaacagcttgcgtcatgcggtcgctgcgtatatgatgcgatgagt12600
aaataaacaaatacgcaaggggaacgcatgaaggttatcgctgtacttaaccagaaaggc12660
gggtcaggcaagacgaccatcgcaacccatctagcccgcgccctgcaactcgccggggcc12720
gatgttctgttagtcgattccgatccccagggcagtgcccgcgattgggcggccgtgcgg12780
gaagatcaaccgctaaccgttgtcggcatcgaccgcccgacgattgaccgcgacgtgaag12840
gccatcggccggcgcgacttcgtagtgatcgacggagcgccccaggcggcggacttggct12900
gtgtccgcgatcaaggcagccgacttcgtgctgattccggtgcagccaagcccttacgac12960
atatgggccaccgccgacctggtggagctggttaagcagcgcattgaggtcacggatgga13020
aggctacaagcggcctttgtcgtgtcgcgggcgatcaaaggcacgcgcatcggcggtgag13080
gttgccgaggcgctggccgggtacgagctgcccattcttgagtcccgtatcacgcagcgc13140
gtgagctacccaggcactgccgccgccggcacaaccgttcttgaatcagaacccgagggc13200
gacgctgcccgcgaggtccaggcgctggccgctgaaattaaatcaaaactcatttgagtt13260
aatgaggtaaagagaaaatgagcaaaagcacaaacacgctaagtgccggccgtccgagcg13320
cacgcagcagcaaggctgcaacgttggccagcctggcagacacgccagccatgaagcggg13380
tcaactttcagttgccggcggaggatcacaccaagctgaagatgtacgcggtacgccaag13440
gcaagaccattaccgagctgctatctgaatacatcgcgcagctaccagagtaaatgagca13500
aatgaataaatgagtagatgaattttagcggctaaaggaggcggcatggaaaatcaagaa13560
caaccaggcaccgacgccgtggaatgccccatgtgtggaggaacgggcggttggccaggc13620
gtaagcggctgggttgtctgccggccctgcaatggcactggaacccccaagcccgaggaa13680
tcggcgtgacggtcgcaaaccatccggcccggtacaaatcggcgcggcgctgggtgatga13740
cctggtggagaagttgaaggccgcgcaggccgcccagcggcaacgcatcgaggcagaagc13800
acgccccggtgaatcgtggcaagcggccgctgatcgaatccgcaaagaatcccggcaacc13860
gccggcagccggtgcgccgtcgattaggaagccgcccaagggcgacgagcaaccagattt13920
tttcgttccgatgctctatgacgtgggcacccgcgatagtcgcagcatcatggacgtggc13980
cgttttccgtctgtcgaagcgtgaccgacgagctggcgaggtgatccgctacgagcttcc14040
agacgggcacgtagaggtttccgcagggccggccggcatggccagtgtgtgggattacga14100
cctggtactgatggcggtttcccatctaaccgaatccatgaaccgataccgggaagggaa14160
gggagacaagcccggccgcgtgttccgtccacacgttgcggacgtactcaagttctgccg14220
gcgagccgatggcggaaagcagaaagacgacctggtagaaacctgcattcggttaaacac14280
cacgcacgttgccatgcagcgtacgaagaaggccaagaacggccgcctggtgacggtatc14340
cgagggtgaagccttgattagccgctacaagatcgtaaagagcgaaaccgggcggccgga14400
gtacatcgagatcgagctagctgattggatgtaccgcgagatcacagaaggcaagaaccc14460
ggacgtgctgacggttcaccccgattactttttgatcgatcccggcatcggccgttttct14520
ctaccgcctggcacgccgcgccgcaggcaaggcagaagccagatggttgttcaagacgat14580
ctacgaacgcagtggcagcgccggagagttcaagaagttctgtttcaccgtgcgcaagct14640
gatcgggtcaaatgacctgccggagtacgatttgaaggaggaggcggggcaggctggccc14700
gatcctagtcatgcgctaccgcaacctgatcgagggcgaagcatccgccggttcctaatg14760
tacggagcagatgctagggcaaattgccctagcaggggaaaaaggtcgaaaaggtctctt14820
tcctgtggatagcacgtacattgggaacccaaagccgtacattgggaaccggaacccgta14880
cattgggaacccaaagccgtacattgggaaccggtcacacatgtaagtgactgatataaa14940
agagaaaaaaggcgatttttccgcctaaaactctttaaaacttattaaaactcttaaaac15000
ccgcctggcctgtgcataactgtctggccagcgcacagccgaagagctgcaaaaagcgcc15060
tacccttcggtcgctgcgctccctacgccccgccgcttcgcgtcggcctatcgcggccgc15120
tggccgctcaaaaatggctggcctacggccaggcaatctaccagggcgcggacaagccgc15180
gccgtcgccactcgaccgccggcgcccacatcaaggcaccctgcctcgcgcgtttcggtg15240
atgacggtgaaaacctctgacacatgcagctcccggagacggtcacagcttgtctgtaag15300
cggatgccgggagcagacaagcccgtcagggcgcgtcagcgggtgttggcgggtgtcggg15360
gcgcagccatgacccagtcacgtagcgatagcggagtgtatactggcttaactatgcggc15420
atcagagcagattgtactgagagtgcaccatatgcggtgtgaaataccgcacagatgcgt15480
aaggagaaaataccgcatcaggcgctcttccgcttcctcgctcactgactcgctgcgctc15540
ggtcgttcggctgcggcgagcggtatcagctcactcaaaggcggtaatacggttatccac15600
agaatcaggggataacgcaggaaagaacatgtgagcaaaaggccagcaaaaggccaggaa15660
ccgtaaaaaggccgcgttgctggcgtttttccataggctccgcccccctgacgagcatca15720
caaaaatcgacgctcaagtcagaggtggcgaaacccgacaggactataaagataccaggc15780
gtttccccctggaagctccctcgtgcgctctcctgttccgaccctgccgcttaccggata15840
cctgtccgcctttctcccttcgggaagcgtggcgctttctcatagctcacgctgtaggta15900
tctcagttcggtgtaggtcgttcgctccaagctgggctgtgtgcacgaaccccccgttca15960
gcccgaccgctgcgccttatccggtaactatcgtcttgagtccaacccggtaagacacga16020
cttatcgccactggcagcagccactggtaacaggattagcagagcgaggtatgtaggcgg16080
tgctacagagttcttgaagtggtggcctaactacggctacactagaaggacagtatttgg16140
tatctgcgctctgctgaagccagttaccttcggaaaaagagttggtagctcttgatccgg16200
caaacaaaccaccgctggtagcggtggtttttttgtttgcaagcagcagattacgcgcag16260
aaaaaaaggatctcaagaagatcctttgatcttttctacggggtctgacgctcagtggaa16320
cgaaaactcacgttaagggattttggtcatgcattctaggtactaaaacaattcatccag16380
taaaatataatattttattttctcccaatcaggcttgatccccagtaagtcaaaaaatag16440
ctcgacatactgttcttccccgatatcctccctgatcgaccggacgcagaaggcaatgtc16500
ataccacttgtccgccctgccgcttctcccaagatcaataaagccacttactttgccatc16560
tttcacaaagatgttgctgtctcccaggtcgccgtgggaaaagacaagttcctcttcggg16620
cttttccgtctttaaaaaatcatacagctcgcgcggatctttaaatggagtgtcttcttc16680
ccagttttcgcaatccacatcggccagatcgttattcagtaagtaatccaattcggctaa16740
gcggctgtctaagctattcgtatagggacaatccgatatgtcgatggagtgaaagagcct16800
gatgcactccgcatacagctcgataatcttttcagggctttgttcatcttcatactcttc16860
cgagcaaaggacgccatcggcctcactcatgagcagattgctccagccatcatgccgttc16920
aaagtgcaggacctttggaacaggcagctttccttccagccatagcatcatgtccttttc16980
ccgttccacatcataggtggtccctttataccggctgtccgtcatttttaaatataggtt17040
ttcattttctcccaccagcttatataccttagcaggagacattccttccgtatcttttac17100
gcagcggtatttttcgatcagttttttcaattccggtgatattctcattttagccattta17160
ttatttccttcctcttttctacagtatttaaagataccccaagaagctaattataacaag17220
acgaactccaattcactgttccttgcattctaaaaccttaaataccagaaaacagctttt17280
tcaaagttgttttcaaagttggcgtataacatagtatcgacggagccgattttgaaaccg17340
cggtgatcacaggcagcaacgctctgtcatcgttacaatcaacatgctaccctccgcgag17400
atcatccgtgtttcaaacccggcagcttagttgccgttcttccgaatagcatcggtaaca17460
tgagcaaagtctgccgccttacaacggctctcccgctgacgccgtcccggactgatgggc17520
tgcctgtatcgagtggtgattttgtgccgagctgccggtcggggagctgttggctggct17579
<210>2
<211>1553
<212>dna
<213>人工序列(artificialsequence)
<400>2
gaagcaacttaaagttatcaggcatgcatggatcttggaggaatcagatgtgcagtcagg60
gaccatagcacaagacaggcgtcttctactggtgctaccagcaaatgctggaagccggga120
acactgggtacgttggaaaccacgtgatgtgaagaagtaagataaactgtaggagaaaag180
catttcgtagtgggccatgaagcctttcaggacatgtattgcagtatgggccggcccatt240
acgcaattggacgacaacaaagactagtattagtaccacctcggctatccacatagatca300
aagctgatttaaaagagttgtgcagatgatccgtggcggatccaacaaagcaccagtggt360
ctagtggtagaatagtaccctgccacggtacagacccgggttcgattcccggctggtgca420
aaacctgagtgagcagcagcataggtttcagagctatgctggaaacagcatagcaagttg480
aaataaggctagtccgttatcaacttgaaaaagtggcaccgagtcggtgcaacaaagcac540
cagtggtctagtggtagaatagtaccctgccacggtacagacccgggttcgattcccggc600
tggtgcagcacaggccacatccttctcgtttcagagctatgctggaaacagcatagcaag660
ttgaaataaggctagtccgttatcaacttgaaaaagtggcaccgagtcggtgcaacaaag720
caccagtggtctagtggtagaatagtaccctgccacggtacagacccgggttcgattccc780
ggctggtgcaactagagtggctagggatcagtttcagagctatgctggaaacagcatagc840
aagttgaaataaggctagtccgttatcaacttgaaaaagtggcaccgagtcggtgcaaca900
aagcaccagtggtctagtggtagaatagtaccctgccacggtacagacccgggttcgatt960
cccggctggtgcagcaagatgttctccgatggtgtttcagagctatgctggaaacagcat1020
agcaagttgaaataaggctagtccgttatcaacttgaaaaagtggcaccgagtcggtgca1080
acaaagcaccagtggtctagtggtagaatagtaccctgccacggtacagacccgggttcg1140
attcccggctggtgcatctctctcttctgtcagctcgtttcagagctatgctggaaacag1200
catagcaagttgaaataaggctagtccgttatcaacttgaaaaagtggcaccgagtcggt1260
gcttttttttttcgttttgcattgagttttctccgtcgcatgtttgcagttttattttcc1320
gttttgcattgaaatttctccgtctcatgtttgcagcgtgttcaaaaagtacgcagctgt1380
atttcacttatttacggcgccacattttcatgccgtttgtgccaactatcccgagctagt1440
gaatacagcttggcttcacacaacactggtgacccgctgacctgctcgtacctcgtaccg1500
tcgtacggcacagcatttggaattaaagggtgtgatcgatactgcttgctgct1553
<210>3
<211>6193
<212>dna
<213>人工序列(artificialsequence)
<400>3
acaaattcgggtcaaggcggaagccagcgcgccaccccacgtcagcaaatacggaggcgc60
ggggttgacggcgtcacccggtcctaacggcgaccaacaaaccagccagaagaaattaca120
gtaaaaaaaaagtaaattgcactttgatccaccttttattacctaagtctcaatttggat180
cacccttaaacctatcttttcaatttgggccgggttgtggtttggactaccatgaacaac240
ttttcgtcatgtctaacttccctttcagcaaacatatgaaccatatatagaggagatcgg300
ccgtatactagagctgatgtgtttaaggtcgttgattgcacgagaaaaaaaaatccaaat360
cgcaacaatagcaaatttatctggttcaaagtgaaaagatatgtttaaaggtagtccaaa420
gtaaaacttatagataataaaatgtggtccaaagcgtaattcactcaaaaaaaatcaacg480
agacgtgtaccaaacggagacaaacggcatcttctcgaaatttcccaaccgctcgctcgc540
ccgcctcgtcttcccggaaaccgcggtggtttcagcgtggcggattctccaagcagacgg600
agacgtcacggcacgggactcctcccaccacccaaccgccataaataccagccccctcat660
ctcctctcctcgcatcagctccacccccgaaaaatttctccccaatctcgcgaggctctc720
gtcgtcgaatcgaatcctctcgcgtcctcaaggtacgctgcttctcctctcctcgcttcg780
tttcgattcgatttcggacgggtgaggttgttttgttgctagatccgattggtggttagg840
gttgtcgatgtgattatcgtgagatgtttaggggttgtagatctgatggttgtgatttgg900
gcacggttggttcgataggtggaatcgtggttaggttttgggattggatgttggttctga960
tgattggggggaatttttacggttagatgaattgttggatgattcgattggggaaatcgg1020
tgtagatctgttggggaattgtggaactagtcatgcctgagtgattggtgcgatttgtag1080
cgtgttccatcttgtaggccttgttgcgagcatgttcagatctactgttccgctcttgat1140
tgagttattggtgccatgggttggtgcaaacacaggctttaatatgttatatctgttttg1200
tgtttgatgtagatctgtagggtagttcttcttagacatggttcaattatgtagcttgtg1260
cgtttcgatttgatttcatatgttcacagattagataatgatgaactcttttaattaatt1320
gtcaatggtaaataggaagtcttgtcgctatatctgtcataatgatctcatgttactatc1380
tgccagtaatttatgctaagaactatattagaatatcatgttacaatctgtagtaatatc1440
atgttacaatctgtagttcatctatataatctattgtggtaatttctttttactatctgt1500
gtgaagattattgccactagttcattctacttatttctgaagttcaggatacgtgtgctg1560
ttactacctatctgaatacatgtgtgatgtgcctgttactatctttttgaatacatgtat1620
gttctgttggaatatgtttgctgtttgatccgttgttgtgtccttaatcttgtgctagtt1680
cttaccctatctgtttggtgattatttcttgcagtacgtaatggactacaaggaccacga1740
cggggattacaaagaccacgacatagactacaaggatgacgatgacaaaatggcaccgaa1800
gaaaaaaaggaaggtcggaatccatggcgttccagctgccgataagaaatattccatcgg1860
actcgacattggcacgaatagcgtcggatgggctgttattactgatgagtacaaagttcc1920
gtctaagaagttcaaggtgctgggcaacacagaccgccacagcataaagaaaaatctcat1980
cggtgcactccttttcgatagtggggagactgcagaagcgacaagattgaaaaggactgc2040
gagaaggcgctatacacggcgtaagaatagaatctgctaccttcaggagattttctctaa2100
cgaaatggctaaggtcgatgacagtttctttcatagacttgaggaatcgttcttggttga2160
ggaggataagaaacatgagaggcacccgatatttggaaacatcgtggatgaggtcgcata2220
tcatgaaaagtaccccacaatctaccacctgagaaagaaactcgttgattccaccgacaa2280
agcggatttgagactcatctacctcgctcttgcccatatgataaagttccgcggacactt2340
tctgatcgagggcgacctcaaccctgataatagcgacgtcgataagctcttcatccagtt2400
ggttcaaacctacaatcagctctttgaggaaaacccaattaatgctagtggagtggatgc2460
aaaagcgatactgtcggccagactctccaagagcagaaggttggagaacctgatcgctca2520
acttcctggagaaaagaaaaacggtctttttgggaatttgattgccttgtctctgggcct2580
cacaccaaacttcaagtcaaattttgacctcgctgaggatgccaaacttcagttgtctaa2640
ggatacctatgatgacgatcttgacaatttgctggcacaaattggcgaccagtacgcgga2700
tctgttcctcgcagcgaagaatctgagtgatgctattctcctttcggacatactcagggt2760
taacactgagatcacaaaagcacctttgagtgcgtcgatgattaagcgctatgatgaaca2820
tcaccaagacctcactttgctgaaggcccttgtgcggcagcaattgccagagaagtacaa2880
agaaatcttctttgaccaatctaagaacggatacgctggctatattgatggaggagcttc2940
tcaggaggaattctataagtttatcaaacctatacttgagaagatggatggtacagagga3000
actccttgttaaattgaacagagaagatttgctgcgcaagcaacggacctttgacaacgg3060
atcaattccgcatcagatacacctcggcgagcttcatgccatccttcgccggcaggaaga3120
tttctacccctttttgaaggacaaccgcgagaagatagaaaaaatccttacgttccggat3180
tccttactatgtgggtccattggcaagggggaattcccgctttgcgtggatgactcggaa3240
aagcgaggaaactatcacaccgtggaacttcgaggaagttgtggacaagggagcttctgc3300
ccaatcattcattgagaggatgactaacttcgataagaacctgccgaacgagaaagttct3360
ccccaagcactccctcctttacgagtatttcaccgtgtataacgaacttacgaaggttaa3420
atacgtgactgagggtatgaggaagccagcattcttgagcggggaacaaaagaaagcgat3480
tgttgatttgctgtttaaaactaatcgcaaggtgacagtcaagcagctcaaagaggatta3540
tttcaagaaaattgaatgtttcgactctgtggagatatcaggagtcgaagataggtttaa3600
cgcttcccttggcacataccatgacctccttaagatcattaaggacaaagatttcctgga3660
taacgaggaaaatgaggacatcctcgaagatattgttcttaccttgacgctgtttgagga3720
tcgcgaaatgatcgaggaacggcttaagacgtatgctcacttgttcgacgataaggttat3780
gaagcagctcaagcgtagaaggtacactggatggggccgtctgtctagaaagctcatcaa3840
cggaatacgtgataaacaaagtggcaagacaattttggattttctgaagtcggacggatt3900
cgccaacagaaattttatgcagctgattcatgacgatagtctcaccttcaaagaggacat3960
acagaaggctcaagtgagtggtcaaggggattcgctgcatgaacacatcgcaaacctcgc4020
gggttcaccggccataaagaaaggaatccttcaaactgttaaggtcgttgatgagttggt4080
taaagtgatgggtaggcacaagcccgaaaacatagtgatcgagatggctcgcgaaaatca4140
gactacacaaaaagggcagaagaactctcgcgagcggatgaaaaggattgaggaaggaat4200
caaggaactgggctcacagattctcaaagagcatccagtcgaaaacacacagctgcaaaa4260
tgagaagctctatctttactatctccaaaatggccgggacatgtatgttgatcaggagct4320
tgacatcaaccgtttgtccgactatgatgtggaccacattgtcccgcaatctttccttgc4380
agacgattcaatcgataataaggtgttgacccggagcgataaaaaccgtggaaagtctga4440
caatgtcccttcagaggaagtggttaagaagatgaagaactactggagacaattgctgaa4500
tgcaaaactgatcacacagagaaagttcgacaacctcaccaaagcagagagaggtgggct4560
cagtgaacttgataaagcgggcttcattaagcgtcagctcgttgagactagacagatcac4620
gaagcatgtcgcgcagattttggattcgcggatgaacacgaagtacgacgagaatgataa4680
actgatacgtgaagtcaaggttatcactcttaagtccaaattggtgagcgatttcagaaa4740
ggacttccaattctataaggtcagggagatcaacaattatcatcacgctcacgatgccta4800
ccttaatgctgttgtggggaccgcccttattaagaaataccctgcattggagtctgaatt4860
cgtttacggggattataaggtctacgacgttaggaaaatgatagctaagagtgagcagga4920
gatcggtaaagcaactgcgaagtatttcttttactcgaacatcatgaatttctttaagac4980
cgagataacgctggcaaatggcgaaattagaaaggcacctctcatagagactaacggtga5040
gacaggggaaatcgtctgggataagggtagggactttgcgacagtgcgcaaggtcctctc5100
tatgccgcaagttaatattgtgaagaaaaccgaggtgcagacgggaggcttctccaagga5160
aagcatacttcccaaacggaactctgataagttgatcgctcgtaagaaagattgggaccc5220
taagaaatatggtgggttcgattccccaactgttgcttacagcgtgctggtcgttgccaa5280
ggtcgagaagggtaaatccaagaaactcaaaagcgttaaggaactccttgggattactat5340
catggagagatcttcattcgaaaagaatcctatcgactttcttgaggccaaaggatataa5400
ggaagttaagaaagatctgataatcaaactcccaaagtactcattgtttgagctggaaaa5460
cggcaggaagcgcatgcttgcttccgccggagagttgcagaaagggaacgagttggctct5520
gccttctaagtatgttaacttcctctatcttgcctctcattacgagaagctcaaaggctc5580
accagaggacaacgaacagaaacaactttttgtcgagcaacataagcactatttggatga5640
gattatagaacagatcagtgaattctcgaaaagggttatccttgcagatgcgaatcttga5700
caaggtgttgtctgcatacaacaaacatagagataagccgatcagggagcaagcggaaaa5760
tatcattcacctcttcactcttacaaacttgggtgctcccgctgccttcaagtattttga5820
taccacgattgaccggaaacgttacacctcaacgaaggaggtgctggatgccaccctcat5880
ccaccaatctattaccggactctacgagactagaatcgatctctcacagctcggcgggga5940
taaaagaccagcagcgacgaaaaaggcaggacaggctaagaagaagaaatagactagtct6000
gaaatcaccagtctctctctacaaatctatctctctctataataatgtgtgagtagttcc6060
cagataagggaattagggttcttatagggtttcgctcatgtgttgagcatataagaaacc6120
cttagtatgtatttgtatttgtaaaatacttctatcaataaaatttctaattcctaaaac6180
caaaatccagtgg6193
<210>4
<211>1423
<212>prt
<213>人工序列(artificialsequence)
<400>4
metasptyrlysasphisaspglyasptyrlysasphisaspileasp
151015
tyrlysaspaspaspasplysmetalaprolyslyslysarglysval
202530
glyilehisglyvalproalaalaasplyslystyrserileglyleu
354045
aspileglythrasnservalglytrpalavalilethraspglutyr
505560
lysvalproserlyslysphelysvalleuglyasnthrasparghis
65707580
serilelyslysasnleuileglyalaleuleupheaspserglyglu
859095
thralaglualathrargleulysargthralaargargargtyrthr
100105110
argarglysasnargilecystyrleuglngluilepheserasnglu
115120125
metalalysvalaspaspserphephehisargleuglugluserphe
130135140
leuvalglugluasplyslyshisgluarghisproilepheglyasn
145150155160
ilevalaspgluvalalatyrhisglulystyrprothriletyrhis
165170175
leuarglyslysleuvalaspserthrasplysalaaspleuargleu
180185190
iletyrleualaleualahismetilelyspheargglyhispheleu
195200205
ilegluglyaspleuasnproaspasnseraspvalasplysleuphe
210215220
ileglnleuvalglnthrtyrasnglnleupheglugluasnproile
225230235240
asnalaserglyvalaspalalysalaileleuseralaargleuser
245250255
lysserargargleugluasnleuilealaglnleuproglyglulys
260265270
lysasnglyleupheglyasnleuilealaleuserleuglyleuthr
275280285
proasnphelysserasnpheaspleualagluaspalalysleugln
290295300
leuserlysaspthrtyraspaspaspleuaspasnleuleualagln
305310315320
ileglyaspglntyralaaspleupheleualaalalysasnleuser
325330335
aspalaileleuleuseraspileleuargvalasnthrgluilethr
340345350
lysalaproleuseralasermetilelysargtyraspgluhishis
355360365
glnaspleuthrleuleulysalaleuvalargglnglnleuproglu
370375380
lystyrlysgluilephepheaspglnserlysasnglytyralagly
385390395400
tyrileaspglyglyalaserglnglugluphetyrlyspheilelys
405410415
proileleuglulysmetaspglythrglugluleuleuvallysleu
420425430
asnarggluaspleuleuarglysglnargthrpheaspasnglyser
435440445
ileprohisglnilehisleuglygluleuhisalaileleuargarg
450455460
glngluaspphetyrpropheleulysaspasnargglulysileglu
465470475480
lysileleuthrpheargileprotyrtyrvalglyproleualaarg
485490495
glyasnserargphealatrpmetthrarglysserglugluthrile
500505510
thrprotrpasnpheglugluvalvalasplysglyalaseralagln
515520525
serpheilegluargmetthrasnpheasplysasnleuproasnglu
530535540
lysvalleuprolyshisserleuleutyrglutyrphethrvaltyr
545550555560
asngluleuthrlysvallystyrvalthrgluglymetarglyspro
565570575
alapheleuserglygluglnlyslysalailevalaspleuleuphe
580585590
lysthrasnarglysvalthrvallysglnleulysgluasptyrphe
595600605
lyslysileglucyspheaspservalgluileserglyvalgluasp
610615620
argpheasnalaserleuglythrtyrhisaspleuleulysileile
625630635640
lysasplysasppheleuaspasnglugluasngluaspileleuglu
645650655
aspilevalleuthrleuthrleuphegluaspargglumetileglu
660665670
gluargleulysthrtyralahisleupheaspasplysvalmetlys
675680685
glnleulysargargargtyrthrglytrpglyargleuserarglys
690695700
leuileasnglyileargasplysglnserglylysthrileleuasp
705710715720
pheleulysseraspglyphealaasnargasnphemetglnleuile
725730735
hisaspaspserleuthrphelysgluaspileglnlysalaglnval
740745750
serglyglnglyaspserleuhisgluhisilealaasnleualagly
755760765
serproalailelyslysglyileleuglnthrvallysvalvalasp
770775780
gluleuvallysvalmetglyarghislysprogluasnilevalile
785790795800
glumetalaarggluasnglnthrthrglnlysglyglnlysasnser
805810815
arggluargmetlysargileglugluglyilelysgluleuglyser
820825830
glnileleulysgluhisprovalgluasnthrglnleuglnasnglu
835840845
lysleutyrleutyrtyrleuglnasnglyargaspmettyrvalasp
850855860
glngluleuaspileasnargleuserasptyraspvalasphisile
865870875880
valproglnserpheleualaaspaspserileaspasnlysvalleu
885890895
thrargserasplysasnargglylysseraspasnvalproserglu
900905910
gluvalvallyslysmetlysasntyrtrpargglnleuleuasnala
915920925
lysleuilethrglnarglyspheaspasnleuthrlysalagluarg
930935940
glyglyleusergluleuasplysalaglypheilelysargglnleu
945950955960
valgluthrargglnilethrlyshisvalalaglnileleuaspser
965970975
argmetasnthrlystyraspgluasnasplysleuilearggluval
980985990
lysvalilethrleulysserlysleuvalseraspphearglysasp
99510001005
pheglnphetyrlysvalarggluileasnasntyrhishisalahis
101010151020
aspalatyrleuasnalavalvalglythralaleuilelyslystyr
1025103010351040
proalaleuglusergluphevaltyrglyasptyrlysvaltyrasp
104510501055
valarglysmetilealalyssergluglngluileglylysalathr
106010651070
alalystyrphephetyrserasnilemetasnphephelysthrglu
107510801085
ilethrleualaasnglygluilearglysalaproleuilegluthr
109010951100
asnglygluthrglygluilevaltrpasplysglyargasppheala
1105111011151120
thrvalarglysvalleusermetproglnvalasnilevallyslys
112511301135
thrgluvalglnthrglyglypheserlysgluserileleuprolys
114011451150
argasnserasplysleuilealaarglyslysasptrpaspprolys
115511601165
lystyrglyglypheaspserprothrvalalatyrservalleuval
117011751180
valalalysvalglulysglylysserlyslysleulysservallys
1185119011951200
gluleuleuglyilethrilemetgluargserserpheglulysasn
120512101215
proileasppheleuglualalysglytyrlysgluvallyslysasp
122012251230
leuileilelysleuprolystyrserleuphegluleugluasngly
123512401245
arglysargmetleualaseralaglygluleuglnlysglyasnglu
125012551260
leualaleuproserlystyrvalasnpheleutyrleualaserhis
1265127012751280
tyrglulysleulysglyserprogluaspasngluglnlysglnleu
128512901295
phevalgluglnhislyshistyrleuaspgluileilegluglnile
130013051310
serglupheserlysargvalileleualaaspalaasnleuasplys
131513201325
valleuseralatyrasnlyshisargasplysproileargglugln
133013351340
alagluasnileilehisleuphethrleuthrasnleuglyalapro
1345135013551360
alaalaphelystyrpheaspthrthrileasparglysargtyrthr
136513701375
serthrlysgluvalleuaspalathrleuilehisglnserilethr
138013851390
glyleutyrgluthrargileaspleuserglnleuglyglyasplys
139514001405
argproalaalathrlyslysalaglyglnalalyslyslyslys
141014151420