本发明涉及基因敲除策略,更具体来说,涉及基于碱基编辑的基因敲除方法及其应用。
背景技术:
传统的真核生物靶向基因操作是通过全能胚胎干细胞的同源重组和囊胚注射实现的。由于受到建立全能胚胎干细胞这一限制,主要是小鼠(也有大鼠的报道)可以通过全能胚胎干细胞的同源重组完成基因靶向改造[capecchi,2005]。进行基因靶向操作的另一种途径是克隆,即体细胞的基因改造和核移植。不过,克隆技术存在一些缺陷[carteretal.,2002;zhuetal.,2004]。比如,1、终末分化的体细胞克隆后很难完全去分化成未分化细胞,影响胚胎发育,造成发育缺陷;2、所有的遗传物质仅为母源;3、成功率低等。传统的基因靶向操作技术制约了基因敲除。
可编程核酸内切酶技术包括锌指核酸酶(zinc-fingernucleases,zfns)技术和转录激活因子样效应物核酸酶(transcriptionactivator-likeeffectorsnucleases,talens)技术,以及规律成簇间隔短回文重复系统(clusteredregularlyinterspacedshortpalindromicrepeat;crispr-associated,crispr/cas9)[kimandkim,2014]。这类技术的发明和推广打破了全能胚胎干细胞的限制,使得不同物种的基因操作变得可能。特别是crispr/cas9系统,由于简便、高效、价廉,出现之后立即席卷全球,成为了基因编辑领域最新,但发展最快、应用最广的技术,引发了基因编辑领域的革命。现在crispr/cas9系统已经被成功地用于dna敲除、dna敲入、dna替代、dna修饰、rna修饰、dna标记、基因转录调节等[hsuetal.,2014;komoretal.,2017]。并已经成功应用于多个物种的基因编辑[barrangour&doudnaja,2016;komoretal.,2017]。
crispr/cas9介导特异性基因编辑是利用sgrna(singleguidedrna)通过靶序列互补引导cas9蛋白定位剪切双链dna,造成双链dna断裂(double-strandbreaks,dsb),在没有模板的条件下,发生非同源末端连接(non-homologousendjoining,nhej)修复,造成移码突变(frameshiftmutation),导致基因敲除(knockout);在有模板的条件下,通过同源重组进行修复(homology-directedrepair,hdr),实现基因敲入(knockin)[hsuetal.,2014;kimandkim,2014;komoretal.,2017]。由于hdr效率低(整合很少发生),而且非同源性末端接合机制容易产生随机插入和删除(indel),使得在断裂点附近可能随机引入新的碱基,从而导致不精确的基因编辑。此外,crispr/cas9介导的基因编辑总有一些脱靶效应[gorskietal.,2017]。
技术实现要素:
本发明的目的之一是提供一种高效精准的基因敲除策略。
最新研究表明,基于crispr/cas9技术所构建的cas9融合蛋白可作为“碱基编辑器(baseeditor,be)”。这些融合蛋白包含dcas9或cas9切口酶以及大鼠胞苷脱氨酶apobec1,它通过脱氨基作用将胞嘧啶(c)转化为尿嘧啶,而无需切割dna。之后,通过dna复制或修复,尿嘧啶被转化成胸腺嘧啶(t)。类似地,其也能将单碱基g转化成a。特别是cas9切口酶与apobec1组成的be3,其能够将碱基编辑效率显著提高到15~75%。由于不需切割dna造成dsb,形成的indel低于1%,实现的基因编辑更精确[komoretal.,2016];而且,这种方式将脱靶效率降低到低于自然背景的10倍,实现的基因编辑更安全[nishidaetal.,2017]。be3已经被成功地用于在体内碱基编辑,实现了小鼠的ct突变,效率达到44~57%[kimetal.,2017]。
发明人基于上述be介导单碱基突变,尤其是be3介导的单碱基编辑的精确性和特异性,巧妙设计了一种基因敲除策略:通过ct突变引入终止密码子,例如将caa、cag、cga突变成终止密码子taa、tag、tga,或通过ga突变将tgg突变成终止密码子taa、tga、tag,终止编码基因的翻译,从而实现基因敲除。
根据本发明的第一方面,提供了一种基因敲除方法,其包括:
选定待敲除基因的编码区(cds区)的20bp–ngg目标序列(pam序列),使其包含完整的目标密码子caa、cag或cga;
利用sgrna序列来将be3定位到目标序列以使目标密码子中的目标单碱基c变成t从而相应引入终止密码子taa或tag、tga以实现基因敲除,
其中目标单碱基c位于目标序列的(左端)第1-8位,优选为第4-8位,目标密码子与ngg间隔12-14bp,优选为14bp,并且目标密码子的上游紧邻碱基(h)不能为g;
所述sgrna序列为与目标序列互补对应的20bp序列。
可替代地,在上述方法中,也可选定待敲除基因的编码区的ccn-20bp目标序列(pam序列),使其包含完整的目标密码子tgg,目标密码子的下游紧邻碱基(d)不能为c;相应地,目标单碱基g位于目标序列的(右端)第1-8位,优选为第4-8位,目标密码子与ccn间隔12-14bp,优选为14bp。
根据本发明,be3可以选自:rapobec1-sacas9-nls-ugi-nls;3xugi-rapobec1-sacas9-nls-ugi-nls;rapobec1-spcas9-nls-ugi-nls;3xugi-rapobec1-spcas9-nls-ugi-nls,优选采用后两种。
根据本发明的方法可以用于敲除如下八个靶基因:人pd1、lag3、tigit、vista、2b4、cd160以及小鼠tim3和lag3,与其相应的sgrna序列分别与序列一至八所示目标基因序列互补。
根据本发明的第二方面,提供了上述方法在细胞系hek293t进行人pd1、lag3、tigit、vista、2b4、cd160基因敲除的应用。
根据本发明的第三方面,提供了上述方法在人t细胞进行人pd1、lag3、tigit、vista、2b4、cd160基因敲除的应用。
根据本发明的第四方面,提供了根据上述应用而获得的分离的t细胞或细胞系或它们的次代培养物。
根据本发明的第五方面,提供了一种用于基因敲除的试剂盒,包括(与待敲除基因相应的)上述sgrna、be3以及相应的扩增试剂。
本发明利用crispr/cas9基础上发展的碱基编辑技术,通过精准的ct或ga单碱基突变创造终止密码子,从而建立了比crispr/cas9更高效、更精确以及更少脱靶效应的基因敲除策略。
附图说明
图1为根据本发明的利用ct突变实现目标基因被敲除的示意图;
图2-5为不同be3的结构示意图。
具体实施方式
首先,构建不同的be3,如图2-5所示,将不同的cas9nickase与胞苷脱氨酶(apobec1)融合后形成如下四种be3:
(1)rapobec1-sacas9-nls-ugi-nls,图2,序列1;
(2)3xugi-rapobec1-sacas9-nls-ugi-nls,图3,序列2;
(3)rapobec1-spcas9-nls-ugi-nls,图4,序列3;
(4)3xugi-rapobec1-spcas9-nls-ugi-nls,图5,序列4。
在下面进行基因敲除时,可以采用上述任一种be3,优选为(3)或(4)。
接下来进行sgrna的设计。碱基定点编辑是利用sgrna将be3定位到或靶向特异靶位点,靶基因特异性sgrna的选择和设计是本发明的关键之处。本发明如下选择设计sgrna:
选定待敲除基因的编码区的20bp–ngg目标序列(pam序列),使其包含完整的目标密码子caa、cag或cga;
目标单碱基c优选位于目标序列的(左端)第4-8位,目标密码子与ngg优选间隔14bp,并且目标密码子的上游紧邻碱基(h)不能为g;
制备与目标序列互补对应的20bpsgrna序列。
在替代方案中,则选定待敲除基因的编码区的ccn-20bp目标序列(pam序列),使其包含完整的目标密码子tgg,目标密码子的下游紧邻碱基(d)不能为c;相应地,目标单碱基g优选位于目标序列的(右端)第4-8位,目标密码子优选与ccn间隔14bp。
针对八个靶基因——人pd1、lag3、tigit、vista、2b4、cd160以及小鼠tim3和lag3,本发明选定下述目标基因序列来设计相应的sgrna(粗体下划表示pam;斜体下划表示候选突变编码子):
一.hpd-1
sg-1:
sg-2:
sg-3:
二.hlag3
sg-1:
sg-2:
sg-3:
三.htigit
sg-1:
四.hvista
sg-1:
五.2b4
sg-1:
六.hcd160
sg-1:
七.mtim3
sg-1:
八.mlag3
sg-1:
针对上述选定的目标基因序列,人pd1(3条)、lag3(3条)、tigit、vista、2b4、cd160以及鼠tim3、lag3,构建相应的sgrna表达载体,将不同的sgrna分别导入pgl3-u6-sgrna。
实施例1
在细胞株上进行be3介导的碱基编辑,引入终止密码子,实现基因敲除。按常规操作,进行细胞株的基因敲除(通过电转或脂质体转染),以脂质体转染为例。
(1)以hek293t细胞为例,本发明进行真核生物细胞的培养与转染:hek293t细胞接种培养于添加10%fbs的dmem高糖培养液中(hyclone,sh30022.01b),其中含penicillin(100u/ml)和streptomycin(100μg/ml)。
(2)在转染前分至6孔板中,待密度达到70%-80%时进行转染。
(3)转染以脂质体转染为例。按照lipofectaminetm2000transfectionreagent(invitrogen,11668-019)的操作手册,以spcas9nickase为例,将2μgbe3质粒与2μgpgl3-u6-sgrna质粒混匀,共转染至每孔细胞中,6-8小时后换液,72小时后收取细胞。
(4)基因型分析
a、收取部分细胞在裂解液(10μmtris-hcl,0.4mnacl,2μmedta,1%sds)中用100μg/ml蛋白酶k裂解消化后,酚-氯仿抽提后溶解到50μl去离子水中。
b、使用一对引物n-for和n-rev进行pcr扩增,用axypreppcrcleanup纯化获得pcr回收产物,取200ng统一稀释到20μl进行变性、退火,程序如:95℃,5min;95-85℃at-2℃/s;85-25℃at-0.1℃/s;holdat4℃。
c、获得的pcr回收产物用rtaq进行加a反应。加a反应体系为:
700-800ngpcr回收产物
5μl10xbuffer(mg2+plus)
4μldntp
0.5μlrtaq(takara,r001am)
补水至50μl体系。
37℃温育30分钟后,取1μl产物与pmd19-tvector(takara,3271)连接并转化dh5感受态细胞(transgen,cd201)。
d、挑取单克隆,用通用引物m13-f测序各靶基因突变,测序结果如下(粗体下划表示pam;斜体表示突变编码子;斜体下划表示突变碱基):
1.hpd-1
sg-1:
mut:
sg-2:
mut:
sg-3:
mut:
2.hlag3
sg-1:
mut:
sg-2:
mut:
sg-3:
mut:
3.htigit
sg-1:
mut:
4.hvista
sg-1:
mut:
5.2b4
sg-1:
mut:
6.hcd160
sg-1:
mut:
结果表明:靶基因发生了sgrna靶向的碱基突变,引入了终止密码子,pd1、lag3、tigit、vista、2b4、cd160等基因敲除成功。
实施例2
在原代细胞上进行be3介导的碱基编辑,引入终止密码子,实现基因敲除。
按常规操作,以人的t细胞进原代细胞的基因敲除(通过电转或脂质体转染),以电转为例。
(1)pbmc细胞的分离纯化:
a、用抗凝管采集外周血,边采集边摇晃使外周血与抗凝剂充分混合;
b、外周血细胞与淋巴细胞分离液等体积混合,离心,吸取离心后的白膜层细胞;
c、将得到的白膜层细胞与pbs或者无血清细胞培养基1640混合后离心,沉淀即为所述pbmc细胞。
重复三遍。
(2)cd3阳性细胞的富集
a、调整pbmc细胞浓度至50x106cell/ml。
b、按每1ml加入cd3+enrichedantibodiescocktail50μl,混匀后室温静置5分钟。
c、按每1ml加入magnet150μl,混匀后室温静置10分钟,
d、将离心管置于磁力架上静置5分钟,吸取上层细胞悬液至新的15ml离心管中。
e、重复该操作一次。
f、室温离心300*g,10分钟,收集细胞。
g、细胞计数。
(3)cd3阳性细胞的电转
a、配置电转体系
向1.5ml离心管中分别加入8μgbe3质粒与8μgpgl3-u6-sgrna质粒,并按照lonzaamaxa电转试剂盒说明书要求,加入82μl电转缓冲液和18μlsupplement1,混匀。
b、收取20x106个细胞到15ml离心管中,300g离心10分钟,弃掉上清。
c、以a中配好的质粒电转缓冲液混合物重悬细胞,并转移至电转杯中。
d、使用仪器lonza2b,u-014程序进行电转。
e、电转后的细胞迅速转移至提前预热的添加有10%fbs的aim-v培养基中,37度5%二氧化碳培养箱中培养2小时。
f、电转后的细胞全换液,以1x106个/ml的密度重悬细胞,培养过夜。
(4)t细胞的激活培养
a、电转培养24小时后,向培养基中加入100u/mlil-2,并按照1:1的比例加入cd3/cd28dynabeads,激活t细胞。
b、每两天对细胞半换液,或者是补加il-2,细胞密度始终维持在1x106个/ml。
c、激活5天后,将t细胞收集到15ml离心管中,并将离心管置于磁力架中,慢慢将上清转移到另外一个干净的15ml离心管中,重复此步骤一次。
d、室温离心300*g,10分钟,弃上清,使用10%fbs,300u/mlil-2aim-v培养基重悬细胞,密度控制在1x106个/ml。
e、每两天对细胞半换液,或者是补加il-2,并计数,细胞密度始终维持在1x106个/ml。
(5)基因型分析
a、收取部分细胞在裂解液(10μmtris-hcl,0.4mnacl,2μmedta,1%sds)中用100μg/ml蛋白酶k裂解消化后,酚-氯仿抽提后溶解到50μl去离子水中。
b、使用一对引物n-for和n-rev进行pcr扩增,用axypreppcrcleanup纯化获得pcr回收产物,取200ng统一稀释到20μl进行变性、退火,程序如:95℃,5min;95-85℃at-2℃/s;85-25℃at-0.1℃/s;holdat4℃。
c、获得的pcr回收产物用rtaq进行加a反应。加a反应体系为:
700-800ngpcr回收产物
5μl10xbuffer(mg2+plus)
4μldntp
0.5μlrtaq(takara,r001am)
补水至50μl体系。
37℃温育30分钟后,取1μl产物与pmd19-tvector(takara,3271)连接并转化dh5感受态细胞(transgen,cd201)。
d、挑取单克隆,用通用引物m13-f测序t细胞各靶基因突变,测序结果如下(粗体下划表示pam;斜体表示候选突变编码子;斜体下划表示突变碱基):
1.hpd-1
sg-1:
mut:
sg-2:
mut:
sg-3:
mut:
2.hlag3
sg-1:
mut:
sg-2:
mut:
sg-3:
mut:
3.htigit
sg-1:
mut:
4.hvista
sg-1:
mut:
5.2b4
sg-1:
mut:
6.hcd160
sg-1:
mut:
结果表明:靶基因发生了sgrna靶向的碱基突变,引入了终止密码子,pd1、lag3、tigit、vista、2b4、cd160等基因敲除成功。
实施例3
构建be3介导的基因敲除小鼠
按常规操作进行小鼠的胚胎收集、显微注射、胚胎培养和胚胎移植等。以tim3和lag3基因为例构建敲除小鼠。
(1)显微注射:受精卵分别注射be3mrna和tim3特异性sgrna(对应于上述序列七),或be3mrna和lag3特异性sgrna(对应于上述序列八)。常规进行胚胎移植;
(2)基因型分析:常规小鼠剪尾提取基因组dna,分别pcr扩增编码区域,sanger测序,测序结果如下(粗体下划表示pam;斜体表示突变编码子;斜体下划表示突变碱基):
7.mtim3
sg-1:
mut:
8.mlag3
sg-1:
mut:
上述结果确证tim3和lag3的ct突变和终止密码子的引入。构建tim3和lag3敲除小鼠成功。
序列表
<110>黄行许
<120>基于碱基编辑的基因敲除方法及其应用
<160>4
<210>1
<211>7686
<212>dna
<213>人工序列
<400>1
atatgccaagtacgccccctattgacgtcaatgacggtaaatggcccgcctggcattatg60
cccagtacatgaccttatgggactttcctacttggcagtacatctacgtattagtcatcg120
ctattaccatggtgatgcggttttggcagtacatcaatgggcgtggatagcggtttgact180
cacggggatttccaagtctccaccccattgacgtcaatgggagtttgttttggcaccaaa240
atcaacgggactttccaaaatgtcgtaacaactccgccccattgacgcaaatgggcggta300
ggcgtgtacggtgggaggtctatataagcagagctggtttagtgaaccgtcagatccgct360
agagatccgcggccgctaatacgactcactatagggagagccgccaccatgagctcagag420
actggcccagtggctgtggaccccacattgagacggcggatcgagccccatgagtttgag480
gtattcttcgatccgagagagctccgcaaggagacctgcctgctttacgaaattaattgg540
gggggccggcactccatttggcgacatacatcacagaacactaacaagcacgtcgaagtc600
aacttcatcgagaagttcacgacagaaagatatttctgtccgaacacaaggtgcagcatt660
acctggtttctcagctggagcccatgcggcgaatgtagtagggccatcactgaattcctg720
tcaaggtatccccacgtcactctgtttatttacatcgcaaggctgtaccaccacgctgac780
ccccgcaatcgacaaggcctgcgggatttgatctcttcaggtgtgactatccaaattatg840
actgagcaggagtcaggatactgctggagaaactttgtgaattatagcccgagtaatgaa900
gcccactggcctaggtatccccatctgtgggtacgactgtacgttcttgaactgtactgc960
atcatactgggcctgcctccttgtctcaacattctgagaaggaagcagccacagctgaca1020
ttctttaccatcgctcttcagtcttgtcattaccagcgactgcccccacacattctctgg1080
gccaccgggttgaaaagcggcagcgagactcccgggacctcagagtccgccacacccgaa1140
agtaagcggaactatatcctcgggctggctattggcatcacatctgtcggctatggtata1200
atagactatgaaacaagggacgtgattgacgcaggtgtgaggctgttcaaggaggcaaac1260
gtcgagaacaacgaaggtcggagaagcaagaggggtgcccggaggctgaagaggaggaga1320
aggcacagaatacagcgggtcaagaaactcctgttcgactataacctgctgaccgatcat1380
tccgaactgtcaggcatcaatccttacgaagccagagtcaagggtctgtctcaaaaactc1440
tctgaggaagagttttccgcagccctgctgcacctggctaagaggagaggagtccacaac1500
gtcaatgaggttgaggaggatacagggaacgaactgtctacaaaggaacagatcagccgg1560
aatagcaaggccctggaagagaagtacgttgctgaactgcagctggaaaggctcaagaaa1620
gatggagaggttcggggttccatcaacaggttcaagacatctgactatgtgaaggaagcc1680
aagcaactgctcaaggtgcagaaggcctaccatcagctcgaccagagcttcattgatact1740
tacatagacctgctggagactaggagaacttactacgaagggcctggcgagggcagccct1800
ttcggctggaaagatatcaaggagtggtacgagatgctcatggggcattgcacctacttc1860
cccgaagaactgaggtcagtcaagtacgcctacaacgcagacctgtacaacgccctgaat1920
gatctcaacaatctcgtcataactcgggatgaaaacgagaagctggaatattatgagaag1980
ttccagattattgaaaatgtgttcaaacagaagaagaaacctaccctgaaacaaattgcc2040
aaagagatcctggtgaatgaggaggatatcaagggatatcgggttacttctaccggcaaa2100
ccagagttcacaaatctgaaagtttaccatgacatcaaagatattaccgcaagaaaggag2160
atcatcgagaacgctgagctcctggaccagatcgctaagattctcactatctatcagtcc2220
agcgaggatattcaggaagagctgaccaacctgaactcagagctgactcaggaagaaatc2280
gaacaaatctccaatctgaaaggatacactggtacccataatctctcactcaaggctatc2340
aatctgatcctggatgaactgtggcatactaacgacaatcagatcgccatcttcaatcgg2400
ctcaaactggtgcccaaaaaagtggacctgagccaacagaaagagattcctacaaccctg2460
gtggacgatttcattctgagcccagtggttaagcggagcttcatccaatccatcaaggtg2520
atcaacgctatcatcaagaagtatggcctgcctaatgacataatcattgaactcgcaagg2580
gaaaagaatagcaaagatgcccagaagatgataaacgagatgcagaaacggaacagacag2640
actaacgaaagaatcgaggaaataatacggaccactggtaaggagaacgctaagtatctg2700
atcgagaaaatcaagctgcacgatatgcaggaaggcaagtgcctgtattctctggaggct2760
atacccctggaggatctgctcaataatcctttcaattacgaggtggatcacatcatacca2820
agatccgtgagctttgacaatagctttaataataaggtgctcgtgaagcaggaggaaaac2880
tcaaagaaaggcaacaggaccccattccagtacctgtccagctctgacagcaagattagc2940
tacgaaaccttcaagaaacacatcctgaatctggccaagggcaagggaagaataagcaaa3000
acaaagaaagagtatctcctggaggaaagggacatcaacaggttttcagtgcagaaagat3060
tttatcaatcggaatctcgttgacacaagatatgctaccagagggctcatgaatctgctc3120
aggtcatactttagggtgaacaacctggatgtgaaggtcaaatccataaatggagggttc3180
acttcctttctcaggagaaaatggaagtttaagaaagagagaaataagggttacaaacat3240
cacgccgaggacgcactgatcattgccaacgctgactttatctttaaggaatggaagaag3300
ctggacaaagcaaagaaggtgatggagaatcagatgtttgaggaaaagcaggccgagtct3360
atgcctgagattgaaacagagcaggaatacaaagagatcttcattactccacatcagatt3420
aagcacatcaaggactttaaggactataaatactcacatagggtggataagaaacctaat3480
agagagctgatcaacgatacactctactcaacaaggaaagacgacaaaggaaacaccctg3540
attgttaataatctcaatgggctgtatgacaaagataatgacaagctgaagaaactcatc3600
aacaagtccccagaaaagctgctgatgtatcaccacgatccccaaacatatcagaagctg3660
aagctgattatggagcagtatggtgatgagaagaaccctctgtacaagtactatgaagag3720
acagggaactacctcactaagtacagcaagaaagacaacggacccgttatcaagaagatc3780
aagtactacggcaataagctgaacgcccacctggatatcacagatgactatccaaactct3840
aggaacaaagtggtgaaactgtccctgaagccatacagatttgatgtgtatctggataac3900
ggagtctataagttcgtcacagtcaagaacctggacgtcatcaagaaggagaattactat3960
gaagtgaacagcaaatgctacgaggaagccaagaagctcaagaagatttctaaccaggca4020
gagtttatcgcctctttctacaataacgatctgatcaagatcaacggagaactgtacaga4080
gtgatcggcgtgaataatgacctcctgaataggatcgaggttaacatgatcgatatcaca4140
tatcgggagtacctggagaatatgaatgacaagaggcctcccagaattatcaagactatt4200
gccagcaaaacccaatctataaaaaagtactcaacagatatcctggggaacctgtatgag4260
gtgaagtcaaagaagcatccccagattatcaagaaaggcggcagccccaagaagaagagg4320
aaggtgagcagcgactacaaggaccacgacggcgactacaaggaccacgacatcgactac4380
aaggacgacgacgacaagtctggtggttctactaatctgtcagatattattgaaaaggag4440
accggtaagcaactggttatccaggaatccatcctcatgctcccagaggaggtggaagaa4500
gtcattgggaacaagccggaaagcgatatactcgtgcacaccgcctacgacgagagcacc4560
gacgagaatgtcatgcttctgactagcgacgcccctgaatacaagccttgggctctggtc4620
atacaggatagcaacggtgagaacaagattaagatgctctctggtggttctcccaagaag4680
aagaggaaagtctaaccggtcatcatcaccatcaccattgagtttaaacccgctgatcag4740
cctcgactgtgccttctagttgccagccatctgttgtttgcccctcccccgtgccttcct4800
tgaccctggaaggtgccactcccactgtcctttcctaataaaatgaggaaattgcatcgc4860
attgtctgagtaggtgtcattctattctggggggtggggtggggcaggacagcaaggggg4920
aggattgggaagacaatagcaggcatgctggggatgcggtgggctctatggcttctgagg4980
cggaaagaaccagctggggctcgataccgtcgacctctagctagagcttggcgtaatcat5040
ggtcatagctgtttcctgtgtgaaattgttatccgctcacaattccacacaacatacgag5100
ccggaagcataaagtgtaaagcctagggtgcctaatgagtgagctaactcacattaattg5160
cgttgcgctcactgcccgctttccagtcgggaaacctgtcgtgccagctgcattaatgaa5220
tcggccaacgcgcggggagaggcggtttgcgtattgggcgctcttccgcttcctcgctca5280
ctgactcgctgcgctcggtcgttcggctgcggcgagcggtatcagctcactcaaaggcgg5340
taatacggttatccacagaatcaggggataacgcaggaaagaacatgtgagcaaaaggcc5400
agcaaaaggccaggaaccgtaaaaaggccgcgttgctggcgtttttccataggctccgcc5460
cccctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccgacaggac5520
tataaagataccaggcgtttccccctggaagctccctcgtgcgctctcctgttccgaccc5580
tgccgcttaccggatacctgtccgcctttctcccttcgggaagcgtggcgctttctcata5640
gctcacgctgtaggtatctcagttcggtgtaggtcgttcgctccaagctgggctgtgtgc5700
acgaaccccccgttcagcccgaccgctgcgccttatccggtaactatcgtcttgagtcca5760
acccggtaagacacgacttatcgccactggcagcagccactggtaacaggattagcagag5820
cgaggtatgtaggcggtgctacagagttcttgaagtggtggcctaactacggctacacta5880
gaagaacagtatttggtatctgcgctctgctgaagccagttaccttcggaaaaagagttg5940
gtagctcttgatccggcaaacaaaccaccgctggtagcggtggtttttttgtttgcaagc6000
agcagattacgcgcagaaaaaaaggatctcaagaagatcctttgatcttttctacggggt6060
ctgacgctcagtggaacgaaaactcacgttaagggattttggtcatgagattatcaaaaa6120
ggatcttcacctagatccttttaaattaaaaatgaagttttaaatcaatctaaagtatat6180
atgagtaaacttggtctgacagttaccaatgcttaatcagtgaggcacctatctcagcga6240
tctgtctatttcgttcatccatagttgcctgactccccgtcgtgtagataactacgatac6300
gggagggcttaccatctggccccagtgctgcaatgataccgcgagacccacgctcaccgg6360
ctccagatttatcagcaataaaccagccagccggaagggccgagcgcagaagtggtcctg6420
caactttatccgcctccatccagtctattaattgttgccgggaagctagagtaagtagtt6480
cgccagttaatagtttgcgcaacgttgttgccattgctacaggcatcgtggtgtcacgct6540
cgtcgtttggtatggcttcattcagctccggttcccaacgatcaaggcgagttacatgat6600
cccccatgttgtgcaaaaaagcggttagctccttcggtcctccgatcgttgtcagaagta6660
agttggccgcagtgttatcactcatggttatggcagcactgcataattctcttactgtca6720
tgccatccgtaagatgcttttctgtgactggtgagtactcaaccaagtcattctgagaat6780
agtgtatgcggcgaccgagttgctcttgcccggcgtcaatacgggataataccgcgccac6840
atagcagaactttaaaagtgctcatcattggaaaacgttcttcggggcgaaaactctcaa6900
ggatcttaccgctgttgagatccagttcgatgtaacccactcgtgcacccaactgatctt6960
cagcatcttttactttcaccagcgtttctgggtgagcaaaaacaggaaggcaaaatgccg7020
caaaaaagggaataagggcgacacggaaatgttgaatactcatactcttcctttttcaat7080
attattgaagcatttatcagggttattgtctcatgagcggatacatatttgaatgtattt7140
agaaaaataaacaaataggggttccgcgcacatttccccgaaaagtgccacctgacgtcg7200
acggatcgggagatcgatctcccgatcccctagggtcgactctcagtacaatctgctctg7260
atgccgcatagttaagccagtatctgctccctgcttgtgtgttggaggtcgctgagtagt7320
gcgcgagcaaaatttaagctacaacaaggcaaggcttgaccgacaattgcatgaagaatc7380
tgcttagggttaggcgttttgcgctgcttcgcgatgtacgggccagatatacgcgttgac7440
attgattattgactagttattaatagtaatcaattacggggtcattagttcatagcccat7500
atatggagttccgcgttacataacttacggtaaatggcccgcctggctgaccgcccaacg7560
acccccgcccattgacgtcaataatgacgtatgttcccatagtaacgccaatagggactt7620
tccattgacgtcaatgggtggagtatttacggtaaactgcccacttggcagtacatcaag7680
tgtatc7686
<210>2
<211>8670
<212>dna
<213>人工序列
<400>2
atatgccaagtacgccccctattgacgtcaatgacggtaaatggcccgcctggcattatg60
cccagtacatgaccttatgggactttcctacttggcagtacatctacgtattagtcatcg120
ctattaccatggtgatgcggttttggcagtacatcaatgggcgtggatagcggtttgact180
cacggggatttccaagtctccaccccattgacgtcaatgggagtttgttttggcaccaaa240
atcaacgggactttccaaaatgtcgtaacaactccgccccattgacgcaaatgggcggta300
ggcgtgtacggtgggaggtctatataagcagagctggtttagtgaaccgtcagatccgct360
agagatccgcggccgctaatacgactcactatagggagagccgccaccgggcccatgact420
aatctgtcagatattattgaaaaggagaccggtaagcaactggttatccaggaatccatc480
ctcatgctcccagaggaggtggaagaagtcattgggaacaagccggaaagcgatatactc540
gtgcacaccgcctacgacgagagcaccgacgagaatgtcatgcttctgactagcgacgcc600
cctgaatacaagccttgggctctggtcatacaggatagcaacggtgagaacaagattaag660
atgctccccaagaagaagaggaaagtcgagggcagaggaagtctgctaacatgcggtgac720
gtcgaggagaatcctggcccaaccaacctgtccgatatcattgagaaagagaccggcaaa780
cagctggtgatccaggagagcatcctgatgctgcccgaagaggtggaggaagtgatcggc840
aacaagcccgagtccgacatcctggtgcacacagcctatgatgaatccaccgacgagaac900
gtgatgctgctgacctccgatgctcccgagtataaaccctgggcactggtgatccaggac960
tctaatggagagaacaagatcaagatgctgcccaagaagaagaggaaagtcgctactaac1020
ttcagcctgctgaagcaggctggagacgtggaggagaaccctggacctacaaacctcagt1080
gacattatcgagaaggaaacaggaaaacagctcgtcattcaagaatctattcttatgttg1140
cctgaggaagtcgaagaggttattggcaataaacctgaatctgatattcttgtccatacc1200
gcttacgatgagtccacagatgaaaatgttatgctgctcacatctgacgcaccagagtac1260
aaaccatgggcgctcgttattcaagattccaacggcgaaaacaaaatcaaaatgcttccc1320
aagaagaagaggaaagtcgaaggacggggctccctcctgacctgtggcgatgtggaagag1380
aaccccggccccatgagctcagagactggcccagtggctgtggaccccacattgagacgg1440
cggatcgagccccatgagtttgaggtattcttcgatccgagagagctccgcaaggagacc1500
tgcctgctttacgaaattaattgggggggccggcactccatttggcgacatacatcacag1560
aacactaacaagcacgtcgaagtcaacttcatcgagaagttcacgacagaaagatatttc1620
tgtccgaacacaaggtgcagcattacctggtttctcagctggagcccatgcggcgaatgt1680
agtagggccatcactgaattcctgtcaaggtatccccacgtcactctgtttatttacatc1740
gcaaggctgtaccaccacgctgacccccgcaatcgacaaggcctgcgggatttgatctct1800
tcaggtgtgactatccaaattatgactgagcaggagtcaggatactgctggagaaacttt1860
gtgaattatagcccgagtaatgaagcccactggcctaggtatccccatctgtgggtacga1920
ctgtacgttcttgaactgtactgcatcatactgggcctgcctccttgtctcaacattctg1980
agaaggaagcagccacagctgacattctttaccatcgctcttcagtcttgtcattaccag2040
cgactgcccccacacattctctgggccaccgggttgaaaagcggcagcgagactcccggg2100
acctcagagtccgccacacccgaaagtaagcggaactatatcctcgggctggctattggc2160
atcacatctgtcggctatggtataatagactatgaaacaagggacgtgattgacgcaggt2220
gtgaggctgttcaaggaggcaaacgtcgagaacaacgaaggtcggagaagcaagaggggt2280
gcccggaggctgaagaggaggagaaggcacagaatacagcgggtcaagaaactcctgttc2340
gactataacctgctgaccgatcattccgaactgtcaggcatcaatccttacgaagccaga2400
gtcaagggtctgtctcaaaaactctctgaggaagagttttccgcagccctgctgcacctg2460
gctaagaggagaggagtccacaacgtcaatgaggttgaggaggatacagggaacgaactg2520
tctacaaaggaacagatcagccggaatagcaaggccctggaagagaagtacgttgctgaa2580
ctgcagctggaaaggctcaagaaagatggagaggttcggggttccatcaacaggttcaag2640
acatctgactatgtgaaggaagccaagcaactgctcaaggtgcagaaggcctaccatcag2700
ctcgaccagagcttcattgatacttacatagacctgctggagactaggagaacttactac2760
gaagggcctggcgagggcagccctttcggctggaaagatatcaaggagtggtacgagatg2820
ctcatggggcattgcacctacttccccgaagaactgaggtcagtcaagtacgcctacaac2880
gcagacctgtacaacgccctgaatgatctcaacaatctcgtcataactcgggatgaaaac2940
gagaagctggaatattatgagaagttccagattattgaaaatgtgttcaaacagaagaag3000
aaacctaccctgaaacaaattgccaaagagatcctggtgaatgaggaggatatcaaggga3060
tatcgggttacttctaccggcaaaccagagttcacaaatctgaaagtttaccatgacatc3120
aaagatattaccgcaagaaaggagatcatcgagaacgctgagctcctggaccagatcgct3180
aagattctcactatctatcagtccagcgaggatattcaggaagagctgaccaacctgaac3240
tcagagctgactcaggaagaaatcgaacaaatctccaatctgaaaggatacactggtacc3300
cataatctctcactcaaggctatcaatctgatcctggatgaactgtggcatactaacgac3360
aatcagatcgccatcttcaatcggctcaaactggtgcccaaaaaagtggacctgagccaa3420
cagaaagagattcctacaaccctggtggacgatttcattctgagcccagtggttaagcgg3480
agcttcatccaatccatcaaggtgatcaacgctatcatcaagaagtatggcctgcctaat3540
gacataatcattgaactcgcaagggaaaagaatagcaaagatgcccagaagatgataaac3600
gagatgcagaaacggaacagacagactaacgaaagaatcgaggaaataatacggaccact3660
ggtaaggagaacgctaagtatctgatcgagaaaatcaagctgcacgatatgcaggaaggc3720
aagtgcctgtattctctggaggctatacccctggaggatctgctcaataatcctttcaat3780
tacgaggtggatcacatcataccaagatccgtgagctttgacaatagctttaataataag3840
gtgctcgtgaagcaggaggaaaactcaaagaaaggcaacaggaccccattccagtacctg3900
tccagctctgacagcaagattagctacgaaaccttcaagaaacacatcctgaatctggcc3960
aagggcaagggaagaataagcaaaacaaagaaagagtatctcctggaggaaagggacatc4020
aacaggttttcagtgcagaaagattttatcaatcggaatctcgttgacacaagatatgct4080
accagagggctcatgaatctgctcaggtcatactttagggtgaacaacctggatgtgaag4140
gtcaaatccataaatggagggttcacttcctttctcaggagaaaatggaagtttaagaaa4200
gagagaaataagggttacaaacatcacgccgaggacgcactgatcattgccaacgctgac4260
tttatctttaaggaatggaagaagctggacaaagcaaagaaggtgatggagaatcagatg4320
tttgaggaaaagcaggccgagtctatgcctgagattgaaacagagcaggaatacaaagag4380
atcttcattactccacatcagattaagcacatcaaggactttaaggactataaatactca4440
catagggtggataagaaacctaatagagagctgatcaacgatacactctactcaacaagg4500
aaagacgacaaaggaaacaccctgattgttaataatctcaatgggctgtatgacaaagat4560
aatgacaagctgaagaaactcatcaacaagtccccagaaaagctgctgatgtatcaccac4620
gatccccaaacatatcagaagctgaagctgattatggagcagtatggtgatgagaagaac4680
cctctgtacaagtactatgaagagacagggaactacctcactaagtacagcaagaaagac4740
aacggacccgttatcaagaagatcaagtactacggcaataagctgaacgcccacctggat4800
atcacagatgactatccaaactctaggaacaaagtggtgaaactgtccctgaagccatac4860
agatttgatgtgtatctggataacggagtctataagttcgtcacagtcaagaacctggac4920
gtcatcaagaaggagaattactatgaagtgaacagcaaatgctacgaggaagccaagaag4980
ctcaagaagatttctaaccaggcagagtttatcgcctctttctacaataacgatctgatc5040
aagatcaacggagaactgtacagagtgatcggcgtgaataatgacctcctgaataggatc5100
gaggttaacatgatcgatatcacatatcgggagtacctggagaatatgaatgacaagagg5160
cctcccagaattatcaagactattgccagcaaaacccaatctataaaaaagtactcaaca5220
gatatcctggggaacctgtatgaggtgaagtcaaagaagcatccccagattatcaagaaa5280
ggcggcagccccaagaagaagaggaaggtgagcagcgactacaaggaccacgacggcgac5340
tacaaggaccacgacatcgactacaaggacgacgacgacaagtctggtggttctactaat5400
ctgtcagatattattgaaaaggagaccggtaagcaactggttatccaggaatccatcctc5460
atgctcccagaggaggtggaagaagtcattgggaacaagccggaaagcgatatactcgtg5520
cacaccgcctacgacgagagcaccgacgagaatgtcatgcttctgactagcgacgcccct5580
gaatacaagccttgggctctggtcatacaggatagcaacggtgagaacaagattaagatg5640
ctctctggtggttctcccaagaagaagaggaaagtctaaccggtcatcatcaccatcacc5700
attgagtttaaacccgctgatcagcctcgactgtgccttctagttgccagccatctgttg5760
tttgcccctcccccgtgccttccttgaccctggaaggtgccactcccactgtcctttcct5820
aataaaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctggggggtg5880
gggtggggcaggacagcaagggggaggattgggaagacaatagcaggcatgctggggatg5940
cggtgggctctatggcttctgaggcggaaagaaccagctggggctcgataccgtcgacct6000
ctagctagagcttggcgtaatcatggtcatagctgtttcctgtgtgaaattgttatccgc6060
tcacaattccacacaacatacgagccggaagcataaagtgtaaagcctagggtgcctaat6120
gagtgagctaactcacattaattgcgttgcgctcactgcccgctttccagtcgggaaacc6180
tgtcgtgccagctgcattaatgaatcggccaacgcgcggggagaggcggtttgcgtattg6240
ggcgctcttccgcttcctcgctcactgactcgctgcgctcggtcgttcggctgcggcgag6300
cggtatcagctcactcaaaggcggtaatacggttatccacagaatcaggggataacgcag6360
gaaagaacatgtgagcaaaaggccagcaaaaggccaggaaccgtaaaaaggccgcgttgc6420
tggcgtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgctcaagtc6480
agaggtggcgaaacccgacaggactataaagataccaggcgtttccccctggaagctccc6540
tcgtgcgctctcctgttccgaccctgccgcttaccggatacctgtccgcctttctccctt6600
cgggaagcgtggcgctttctcatagctcacgctgtaggtatctcagttcggtgtaggtcg6660
ttcgctccaagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcgccttat6720
ccggtaactatcgtcttgagtccaacccggtaagacacgacttatcgccactggcagcag6780
ccactggtaacaggattagcagagcgaggtatgtaggcggtgctacagagttcttgaagt6840
ggtggcctaactacggctacactagaagaacagtatttggtatctgcgctctgctgaagc6900
cagttaccttcggaaaaagagttggtagctcttgatccggcaaacaaaccaccgctggta6960
gcggtggtttttttgtttgcaagcagcagattacgcgcagaaaaaaaggatctcaagaag7020
atcctttgatcttttctacggggtctgacgctcagtggaacgaaaactcacgttaaggga7080
ttttggtcatgagattatcaaaaaggatcttcacctagatccttttaaattaaaaatgaa7140
gttttaaatcaatctaaagtatatatgagtaaacttggtctgacagttaccaatgcttaa7200
tcagtgaggcacctatctcagcgatctgtctatttcgttcatccatagttgcctgactcc7260
ccgtcgtgtagataactacgatacgggagggcttaccatctggccccagtgctgcaatga7320
taccgcgagacccacgctcaccggctccagatttatcagcaataaaccagccagccggaa7380
gggccgagcgcagaagtggtcctgcaactttatccgcctccatccagtctattaattgtt7440
gccgggaagctagagtaagtagttcgccagttaatagtttgcgcaacgttgttgccattg7500
ctacaggcatcgtggtgtcacgctcgtcgtttggtatggcttcattcagctccggttccc7560
aacgatcaaggcgagttacatgatcccccatgttgtgcaaaaaagcggttagctccttcg7620
gtcctccgatcgttgtcagaagtaagttggccgcagtgttatcactcatggttatggcag7680
cactgcataattctcttactgtcatgccatccgtaagatgcttttctgtgactggtgagt7740
actcaaccaagtcattctgagaatagtgtatgcggcgaccgagttgctcttgcccggcgt7800
caatacgggataataccgcgccacatagcagaactttaaaagtgctcatcattggaaaac7860
gttcttcggggcgaaaactctcaaggatcttaccgctgttgagatccagttcgatgtaac7920
ccactcgtgcacccaactgatcttcagcatcttttactttcaccagcgtttctgggtgag7980
caaaaacaggaaggcaaaatgccgcaaaaaagggaataagggcgacacggaaatgttgaa8040
tactcatactcttcctttttcaatattattgaagcatttatcagggttattgtctcatga8100
gcggatacatatttgaatgtatttagaaaaataaacaaataggggttccgcgcacatttc8160
cccgaaaagtgccacctgacgtcgacggatcgggagatcgatctcccgatcccctagggt8220
cgactctcagtacaatctgctctgatgccgcatagttaagccagtatctgctccctgctt8280
gtgtgttggaggtcgctgagtagtgcgcgagcaaaatttaagctacaacaaggcaaggct8340
tgaccgacaattgcatgaagaatctgcttagggttaggcgttttgcgctgcttcgcgatg8400
tacgggccagatatacgcgttgacattgattattgactagttattaatagtaatcaatta8460
cggggtcattagttcatagcccatatatggagttccgcgttacataacttacggtaaatg8520
gcccgcctggctgaccgcccaacgacccccgcccattgacgtcaataatgacgtatgttc8580
ccatagtaacgccaatagggactttccattgacgtcaatgggtggagtatttacggtaaa8640
ctgcccacttggcagtacatcaagtgtatc8670
<210>3
<211>8532
<212>dna
<213>人工序列
<400>3
atatgccaagtacgccccctattgacgtcaatgacggtaaatggcccgcctggcattatg60
cccagtacatgaccttatgggactttcctacttggcagtacatctacgtattagtcatcg120
ctattaccatggtgatgcggttttggcagtacatcaatgggcgtggatagcggtttgact180
cacggggatttccaagtctccaccccattgacgtcaatgggagtttgttttggcaccaaa240
atcaacgggactttccaaaatgtcgtaacaactccgccccattgacgcaaatgggcggta300
ggcgtgtacggtgggaggtctatataagcagagctggtttagtgaaccgtcagatccgct360
agagatccgcggccgctaatacgactcactatagggagagccgccaccatgagctcagag420
actggcccagtggctgtggaccccacattgagacggcggatcgagccccatgagtttgag480
gtattcttcgatccgagagagctccgcaaggagacctgcctgctttacgaaattaattgg540
gggggccggcactccatttggcgacatacatcacagaacactaacaagcacgtcgaagtc600
aacttcatcgagaagttcacgacagaaagatatttctgtccgaacacaaggtgcagcatt660
acctggtttctcagctggagcccatgcggcgaatgtagtagggccatcactgaattcctg720
tcaaggtatccccacgtcactctgtttatttacatcgcaaggctgtaccaccacgctgac780
ccccgcaatcgacaaggcctgcgggatttgatctcttcaggtgtgactatccaaattatg840
actgagcaggagtcaggatactgctggagaaactttgtgaattatagcccgagtaatgaa900
gcccactggcctaggtatccccatctgtgggtacgactgtacgttcttgaactgtactgc960
atcatactgggcctgcctccttgtctcaacattctgagaaggaagcagccacagctgaca1020
ttctttaccatcgctcttcagtcttgtcattaccagcgactgcccccacacattctctgg1080
gccaccgggttgaaaagcggcagcgagactcccgggacctcagagtccgccacacccgaa1140
agtgataaaaagtattctattggtttagccatcggcactaattccgttggatgggctgtc1200
ataaccgatgaatacaaagtaccttcaaagaaatttaaggtgttggggaacacagaccgt1260
cattcgattaaaaagaatcttatcggtgccctcctattcgatagtggcgaaacggcagag1320
gcgactcgcctgaaacgaaccgctcggagaaggtatacacgtcgcaagaaccgaatatgt1380
tacttacaagaaatttttagcaatgagatggccaaagttgacgattctttctttcaccgt1440
ttggaagagtccttccttgtcgaagaggacaagaaacatgaacggcaccccatctttgga1500
aacatagtagatgaggtggcatatcatgaaaagtacccaacgatttatcacctcagaaaa1560
aagctagttgactcaactgataaagcggacctgaggttaatctacttggctcttgcccat1620
atgataaagttccgtgggcactttctcattgagggtgatctaaatccggacaactcggat1680
gtcgacaaactgttcatccagttagtacaaacctataatcagttgtttgaagagaaccct1740
ataaatgcaagtggcgtggatgcgaaggctattcttagcgcccgcctctctaaatcccga1800
cggctagaaaacctgatcgcacaattacccggagagaagaaaaatgggttgttcggtaac1860
cttatagcgctctcactaggcctgacaccaaattttaagtcgaacttcgacttagctgaa1920
gatgccaaattgcagcttagtaaggacacgtacgatgacgatctcgacaatctactggca1980
caaattggagatcagtatgcggacttatttttggctgccaaaaaccttagcgatgcaatc2040
ctcctatctgacatactgagagttaatactgagattaccaaggcgccgttatccgcttca2100
atgatcaaaaggtacgatgaacatcaccaagacttgacacttctcaaggccctagtccgt2160
cagcaactgcctgagaaatataaggaaatattctttgatcagtcgaaaaacgggtacgca2220
ggttatattgacggcggagcgagtcaagaggaattctacaagtttatcaaacccatatta2280
gagaagatggatgggacggaagagttgcttgtaaaactcaatcgcgaagatctactgcga2340
aagcagcggactttcgacaacggtagcattccacatcaaatccacttaggcgaattgcat2400
gctatacttagaaggcaggaggatttttatccgttcctcaaagacaatcgtgaaaagatt2460
gagaaaatcctaacctttcgcataccttactatgtgggacccctggcccgagggaactct2520
cggttcgcatggatgacaagaaagtccgaagaaacgattactccatggaattttgaggaa2580
gttgtcgataaaggtgcgtcagctcaatcgttcatcgagaggatgaccaactttgacaag2640
aatttaccgaacgaaaaagtattgcctaagcacagtttactttacgagtatttcacagtg2700
tacaatgaactcacgaaagttaagtatgtcactgagggcatgcgtaaacccgcctttcta2760
agcggagaacagaagaaagcaatagtagatctgttattcaagaccaaccgcaaagtgaca2820
gttaagcaattgaaagaggactactttaagaaaattgaatgcttcgattctgtcgagatc2880
tccggggtagaagatcgatttaatgcgtcacttggtacgtatcatgacctcctaaagata2940
attaaagataaggacttcctggataacgaagagaatgaagatatcttagaagatatagtg3000
ttgactcttaccctctttgaagatcgggaaatgattgaggaaagactaaaaacatacgct3060
cacctgttcgacgataaggttatgaaacagttaaagaggcgtcgctatacgggctgggga3120
cgattgtcgcggaaacttatcaacgggataagagacaagcaaagtggtaaaactattctc3180
gattttctaaagagcgacggcttcgccaataggaactttatgcagctgatccatgatgac3240
tctttaaccttcaaagaggatatacaaaaggcacaggtttccggacaaggggactcattg3300
cacgaacatattgcgaatcttgctggttcgccagccatcaaaaagggcatactccagaca3360
gtcaaagtagtggatgagctagttaaggtcatgggacgtcacaaaccggaaaacattgta3420
atcgagatggcacgcgaaaatcaaacgactcagaaggggcaaaaaaacagtcgagagcgg3480
atgaagagaatagaagagggtattaaagaactgggcagccagatcttaaaggagcatcct3540
gtggaaaatacccaattgcagaacgagaaactttacctctattacctacaaaatggaagg3600
gacatgtatgttgatcaggaactggacataaaccgtttatctgattacgacgtcgatcac3660
attgtaccccaatcctttttgaaggacgattcaatcgacaataaagtgcttacacgctcg3720
gataagaaccgagggaaaagtgacaatgttccaagcgaggaagtcgtaaagaaaatgaag3780
aactattggcggcagctcctaaatgcgaaactgataacgcaaagaaagttcgataactta3840
actaaagctgagaggggtggcttgtctgaacttgacaaggccggatttattaaacgtcag3900
ctcgtggaaacccgccaaatcacaaagcatgttgcacagatactagattcccgaatgaat3960
acgaaatacgacgagaacgataagctgattcgggaagtcaaagtaatcactttaaagtca4020
aaattggtgtcggacttcagaaaggattttcaattctataaagttagggagataaataac4080
taccaccatgcgcacgacgcttatcttaatgccgtcgtagggaccgcactcattaagaaa4140
tacccgaagctagaaagtgagtttgtgtatggtgattacaaagtttatgacgtccgtaag4200
atgatcgcgaaaagcgaacaggagataggcaaggctacagccaaatacttcttttattct4260
aacattatgaatttctttaagacggaaatcactctggcaaacggagagatacgcaaacga4320
cctttaattgaaaccaatggggagacaggtgaaatcgtatgggataagggccgggacttc4380
gcgacggtgagaaaagttttgtccatgccccaagtcaacatagtaaagaaaactgaggtg4440
cagaccggagggttttcaaaggaatcgattcttccaaaaaggaatagtgataagctcatc4500
gctcgtaaaaaggactgggacccgaaaaagtacggtggcttcgatagccctacagttgcc4560
tattctgtcctagtagtggcaaaagttgagaagggaaaatccaagaaactgaagtcagtc4620
aaagaattattggggataacgattatggagcgctcgtcttttgaaaagaaccccatcgac4680
ttccttgaggcgaaaggttacaaggaagtaaaaaaggatctcataattaaactaccaaag4740
tatagtctgtttgagttagaaaatggccgaaaacggatgttggctagcgccggagagctt4800
caaaaggggaacgaactcgcactaccgtctaaatacgtgaatttcctgtatttagcgtcc4860
cattacgagaagttgaaaggttcacctgaagataacgaacagaagcaactttttgttgag4920
cagcacaaacattatctcgacgaaatcatagagcaaatttcggaattcagtaagagagtc4980
atcctagctgatgccaatctggacaaagtattaagcgcatacaacaagcacagggataaa5040
cccatacgtgagcaggcggaaaatattatccatttgtttactcttaccaacctcggcgct5100
ccagccgcattcaagtattttgacacaacgatagatcgcaaacgatacacttctaccaag5160
gaggtgctagacgcgacactgattcaccaatccatcacgggattatatgaaactcggata5220
gatttgtcacagcttgggggtgactctggtggttctactaatctgtcagatattattgaa5280
aaggagaccggtaagcaactggttatccaggaatccatcctcatgctcccagaggaggtg5340
gaagaagtcattgggaacaagccggaaagcgatatactcgtgcacaccgcctacgacgag5400
agcaccgacgagaatgtcatgcttctgactagcgacgcccctgaatacaagccttgggct5460
ctggtcatacaggatagcaacggtgagaacaagattaagatgctctctggtggttctccc5520
aagaagaagaggaaagtctaaccggtcatcatcaccatcaccattgagtttaaacccgct5580
gatcagcctcgactgtgccttctagttgccagccatctgttgtttgcccctcccccgtgc5640
cttccttgaccctggaaggtgccactcccactgtcctttcctaataaaatgaggaaattg5700
catcgcattgtctgagtaggtgtcattctattctggggggtggggtggggcaggacagca5760
agggggaggattgggaagacaatagcaggcatgctggggatgcggtgggctctatggctt5820
ctgaggcggaaagaaccagctggggctcgataccgtcgacctctagctagagcttggcgt5880
aatcatggtcatagctgtttcctgtgtgaaattgttatccgctcacaattccacacaaca5940
tacgagccggaagcataaagtgtaaagcctagggtgcctaatgagtgagctaactcacat6000
taattgcgttgcgctcactgcccgctttccagtcgggaaacctgtcgtgccagctgcatt6060
aatgaatcggccaacgcgcggggagaggcggtttgcgtattgggcgctcttccgcttcct6120
cgctcactgactcgctgcgctcggtcgttcggctgcggcgagcggtatcagctcactcaa6180
aggcggtaatacggttatccacagaatcaggggataacgcaggaaagaacatgtgagcaa6240
aaggccagcaaaaggccaggaaccgtaaaaaggccgcgttgctggcgtttttccataggc6300
tccgcccccctgacgagcatcacaaaaatcgacgctcaagtcagaggtggcgaaacccga6360
caggactataaagataccaggcgtttccccctggaagctccctcgtgcgctctcctgttc6420
cgaccctgccgcttaccggatacctgtccgcctttctcccttcgggaagcgtggcgcttt6480
ctcatagctcacgctgtaggtatctcagttcggtgtaggtcgttcgctccaagctgggct6540
gtgtgcacgaaccccccgttcagcccgaccgctgcgccttatccggtaactatcgtcttg6600
agtccaacccggtaagacacgacttatcgccactggcagcagccactggtaacaggatta6660
gcagagcgaggtatgtaggcggtgctacagagttcttgaagtggtggcctaactacggct6720
acactagaagaacagtatttggtatctgcgctctgctgaagccagttaccttcggaaaaa6780
gagttggtagctcttgatccggcaaacaaaccaccgctggtagcggtggtttttttgttt6840
gcaagcagcagattacgcgcagaaaaaaaggatctcaagaagatcctttgatcttttcta6900
cggggtctgacgctcagtggaacgaaaactcacgttaagggattttggtcatgagattat6960
caaaaaggatcttcacctagatccttttaaattaaaaatgaagttttaaatcaatctaaa7020
gtatatatgagtaaacttggtctgacagttaccaatgcttaatcagtgaggcacctatct7080
cagcgatctgtctatttcgttcatccatagttgcctgactccccgtcgtgtagataacta7140
cgatacgggagggcttaccatctggccccagtgctgcaatgataccgcgagacccacgct7200
caccggctccagatttatcagcaataaaccagccagccggaagggccgagcgcagaagtg7260
gtcctgcaactttatccgcctccatccagtctattaattgttgccgggaagctagagtaa7320
gtagttcgccagttaatagtttgcgcaacgttgttgccattgctacaggcatcgtggtgt7380
cacgctcgtcgtttggtatggcttcattcagctccggttcccaacgatcaaggcgagtta7440
catgatcccccatgttgtgcaaaaaagcggttagctccttcggtcctccgatcgttgtca7500
gaagtaagttggccgcagtgttatcactcatggttatggcagcactgcataattctctta7560
ctgtcatgccatccgtaagatgcttttctgtgactggtgagtactcaaccaagtcattct7620
gagaatagtgtatgcggcgaccgagttgctcttgcccggcgtcaatacgggataataccg7680
cgccacatagcagaactttaaaagtgctcatcattggaaaacgttcttcggggcgaaaac7740
tctcaaggatcttaccgctgttgagatccagttcgatgtaacccactcgtgcacccaact7800
gatcttcagcatcttttactttcaccagcgtttctgggtgagcaaaaacaggaaggcaaa7860
atgccgcaaaaaagggaataagggcgacacggaaatgttgaatactcatactcttccttt7920
ttcaatattattgaagcatttatcagggttattgtctcatgagcggatacatatttgaat7980
gtatttagaaaaataaacaaataggggttccgcgcacatttccccgaaaagtgccacctg8040
acgtcgacggatcgggagatcgatctcccgatcccctagggtcgactctcagtacaatct8100
gctctgatgccgcatagttaagccagtatctgctccctgcttgtgtgttggaggtcgctg8160
agtagtgcgcgagcaaaatttaagctacaacaaggcaaggcttgaccgacaattgcatga8220
agaatctgcttagggttaggcgttttgcgctgcttcgcgatgtacgggccagatatacgc8280
gttgacattgattattgactagttattaatagtaatcaattacggggtcattagttcata8340
gcccatatatggagttccgcgttacataacttacggtaaatggcccgcctggctgaccgc8400
ccaacgacccccgcccattgacgtcaataatgacgtatgttcccatagtaacgccaatag8460
ggactttccattgacgtcaatgggtggagtatttacggtaaactgcccacttggcagtac8520
atcaagtgtatc8532
<210>4
<211>9516
<212>dna
<213>人工序列
<400>4
atatgccaagtacgccccctattgacgtcaatgacggtaaatggcccgcctggcattatg60
cccagtacatgaccttatgggactttcctacttggcagtacatctacgtattagtcatcg120
ctattaccatggtgatgcggttttggcagtacatcaatgggcgtggatagcggtttgact180
cacggggatttccaagtctccaccccattgacgtcaatgggagtttgttttggcaccaaa240
atcaacgggactttccaaaatgtcgtaacaactccgccccattgacgcaaatgggcggta300
ggcgtgtacggtgggaggtctatataagcagagctggtttagtgaaccgtcagatccgct360
agagatccgcggccgctaatacgactcactatagggagagccgccaccgggcccatgact420
aatctgtcagatattattgaaaaggagaccggtaagcaactggttatccaggaatccatc480
ctcatgctcccagaggaggtggaagaagtcattgggaacaagccggaaagcgatatactc540
gtgcacaccgcctacgacgagagcaccgacgagaatgtcatgcttctgactagcgacgcc600
cctgaatacaagccttgggctctggtcatacaggatagcaacggtgagaacaagattaag660
atgctccccaagaagaagaggaaagtcgagggcagaggaagtctgctaacatgcggtgac720
gtcgaggagaatcctggcccaaccaacctgtccgatatcattgagaaagagaccggcaaa780
cagctggtgatccaggagagcatcctgatgctgcccgaagaggtggaggaagtgatcggc840
aacaagcccgagtccgacatcctggtgcacacagcctatgatgaatccaccgacgagaac900
gtgatgctgctgacctccgatgctcccgagtataaaccctgggcactggtgatccaggac960
tctaatggagagaacaagatcaagatgctgcccaagaagaagaggaaagtcgctactaac1020
ttcagcctgctgaagcaggctggagacgtggaggagaaccctggacctacaaacctcagt1080
gacattatcgagaaggaaacaggaaaacagctcgtcattcaagaatctattcttatgttg1140
cctgaggaagtcgaagaggttattggcaataaacctgaatctgatattcttgtccatacc1200
gcttacgatgagtccacagatgaaaatgttatgctgctcacatctgacgcaccagagtac1260
aaaccatgggcgctcgttattcaagattccaacggcgaaaacaaaatcaaaatgcttccc1320
aagaagaagaggaaagtcgaaggacggggctccctcctgacctgtggcgatgtggaagag1380
aaccccggccccatgagctcagagactggcccagtggctgtggaccccacattgagacgg1440
cggatcgagccccatgagtttgaggtattcttcgatccgagagagctccgcaaggagacc1500
tgcctgctttacgaaattaattgggggggccggcactccatttggcgacatacatcacag1560
aacactaacaagcacgtcgaagtcaacttcatcgagaagttcacgacagaaagatatttc1620
tgtccgaacacaaggtgcagcattacctggtttctcagctggagcccatgcggcgaatgt1680
agtagggccatcactgaattcctgtcaaggtatccccacgtcactctgtttatttacatc1740
gcaaggctgtaccaccacgctgacccccgcaatcgacaaggcctgcgggatttgatctct1800
tcaggtgtgactatccaaattatgactgagcaggagtcaggatactgctggagaaacttt1860
gtgaattatagcccgagtaatgaagcccactggcctaggtatccccatctgtgggtacga1920
ctgtacgttcttgaactgtactgcatcatactgggcctgcctccttgtctcaacattctg1980
agaaggaagcagccacagctgacattctttaccatcgctcttcagtcttgtcattaccag2040
cgactgcccccacacattctctgggccaccgggttgaaaagcggcagcgagactcccggg2100
acctcagagtccgccacacccgaaagtgataaaaagtattctattggtttagccatcggc2160
actaattccgttggatgggctgtcataaccgatgaatacaaagtaccttcaaagaaattt2220
aaggtgttggggaacacagaccgtcattcgattaaaaagaatcttatcggtgccctccta2280
ttcgatagtggcgaaacggcagaggcgactcgcctgaaacgaaccgctcggagaaggtat2340
acacgtcgcaagaaccgaatatgttacttacaagaaatttttagcaatgagatggccaaa2400
gttgacgattctttctttcaccgtttggaagagtccttccttgtcgaagaggacaagaaa2460
catgaacggcaccccatctttggaaacatagtagatgaggtggcatatcatgaaaagtac2520
ccaacgatttatcacctcagaaaaaagctagttgactcaactgataaagcggacctgagg2580
ttaatctacttggctcttgcccatatgataaagttccgtgggcactttctcattgagggt2640
gatctaaatccggacaactcggatgtcgacaaactgttcatccagttagtacaaacctat2700
aatcagttgtttgaagagaaccctataaatgcaagtggcgtggatgcgaaggctattctt2760
agcgcccgcctctctaaatcccgacggctagaaaacctgatcgcacaattacccggagag2820
aagaaaaatgggttgttcggtaaccttatagcgctctcactaggcctgacaccaaatttt2880
aagtcgaacttcgacttagctgaagatgccaaattgcagcttagtaaggacacgtacgat2940
gacgatctcgacaatctactggcacaaattggagatcagtatgcggacttatttttggct3000
gccaaaaaccttagcgatgcaatcctcctatctgacatactgagagttaatactgagatt3060
accaaggcgccgttatccgcttcaatgatcaaaaggtacgatgaacatcaccaagacttg3120
acacttctcaaggccctagtccgtcagcaactgcctgagaaatataaggaaatattcttt3180
gatcagtcgaaaaacgggtacgcaggttatattgacggcggagcgagtcaagaggaattc3240
tacaagtttatcaaacccatattagagaagatggatgggacggaagagttgcttgtaaaa3300
ctcaatcgcgaagatctactgcgaaagcagcggactttcgacaacggtagcattccacat3360
caaatccacttaggcgaattgcatgctatacttagaaggcaggaggatttttatccgttc3420
ctcaaagacaatcgtgaaaagattgagaaaatcctaacctttcgcataccttactatgtg3480
ggacccctggcccgagggaactctcggttcgcatggatgacaagaaagtccgaagaaacg3540
attactccatggaattttgaggaagttgtcgataaaggtgcgtcagctcaatcgttcatc3600
gagaggatgaccaactttgacaagaatttaccgaacgaaaaagtattgcctaagcacagt3660
ttactttacgagtatttcacagtgtacaatgaactcacgaaagttaagtatgtcactgag3720
ggcatgcgtaaacccgcctttctaagcggagaacagaagaaagcaatagtagatctgtta3780
ttcaagaccaaccgcaaagtgacagttaagcaattgaaagaggactactttaagaaaatt3840
gaatgcttcgattctgtcgagatctccggggtagaagatcgatttaatgcgtcacttggt3900
acgtatcatgacctcctaaagataattaaagataaggacttcctggataacgaagagaat3960
gaagatatcttagaagatatagtgttgactcttaccctctttgaagatcgggaaatgatt4020
gaggaaagactaaaaacatacgctcacctgttcgacgataaggttatgaaacagttaaag4080
aggcgtcgctatacgggctggggacgattgtcgcggaaacttatcaacgggataagagac4140
aagcaaagtggtaaaactattctcgattttctaaagagcgacggcttcgccaataggaac4200
tttatgcagctgatccatgatgactctttaaccttcaaagaggatatacaaaaggcacag4260
gtttccggacaaggggactcattgcacgaacatattgcgaatcttgctggttcgccagcc4320
atcaaaaagggcatactccagacagtcaaagtagtggatgagctagttaaggtcatggga4380
cgtcacaaaccggaaaacattgtaatcgagatggcacgcgaaaatcaaacgactcagaag4440
gggcaaaaaaacagtcgagagcggatgaagagaatagaagagggtattaaagaactgggc4500
agccagatcttaaaggagcatcctgtggaaaatacccaattgcagaacgagaaactttac4560
ctctattacctacaaaatggaagggacatgtatgttgatcaggaactggacataaaccgt4620
ttatctgattacgacgtcgatcacattgtaccccaatcctttttgaaggacgattcaatc4680
gacaataaagtgcttacacgctcggataagaaccgagggaaaagtgacaatgttccaagc4740
gaggaagtcgtaaagaaaatgaagaactattggcggcagctcctaaatgcgaaactgata4800
acgcaaagaaagttcgataacttaactaaagctgagaggggtggcttgtctgaacttgac4860
aaggccggatttattaaacgtcagctcgtggaaacccgccaaatcacaaagcatgttgca4920
cagatactagattcccgaatgaatacgaaatacgacgagaacgataagctgattcgggaa4980
gtcaaagtaatcactttaaagtcaaaattggtgtcggacttcagaaaggattttcaattc5040
tataaagttagggagataaataactaccaccatgcgcacgacgcttatcttaatgccgtc5100
gtagggaccgcactcattaagaaatacccgaagctagaaagtgagtttgtgtatggtgat5160
tacaaagtttatgacgtccgtaagatgatcgcgaaaagcgaacaggagataggcaaggct5220
acagccaaatacttcttttattctaacattatgaatttctttaagacggaaatcactctg5280
gcaaacggagagatacgcaaacgacctttaattgaaaccaatggggagacaggtgaaatc5340
gtatgggataagggccgggacttcgcgacggtgagaaaagttttgtccatgccccaagtc5400
aacatagtaaagaaaactgaggtgcagaccggagggttttcaaaggaatcgattcttcca5460
aaaaggaatagtgataagctcatcgctcgtaaaaaggactgggacccgaaaaagtacggt5520
ggcttcgatagccctacagttgcctattctgtcctagtagtggcaaaagttgagaaggga5580
aaatccaagaaactgaagtcagtcaaagaattattggggataacgattatggagcgctcg5640
tcttttgaaaagaaccccatcgacttccttgaggcgaaaggttacaaggaagtaaaaaag5700
gatctcataattaaactaccaaagtatagtctgtttgagttagaaaatggccgaaaacgg5760
atgttggctagcgccggagagcttcaaaaggggaacgaactcgcactaccgtctaaatac5820
gtgaatttcctgtatttagcgtcccattacgagaagttgaaaggttcacctgaagataac5880
gaacagaagcaactttttgttgagcagcacaaacattatctcgacgaaatcatagagcaa5940
atttcggaattcagtaagagagtcatcctagctgatgccaatctggacaaagtattaagc6000
gcatacaacaagcacagggataaacccatacgtgagcaggcggaaaatattatccatttg6060
tttactcttaccaacctcggcgctccagccgcattcaagtattttgacacaacgatagat6120
cgcaaacgatacacttctaccaaggaggtgctagacgcgacactgattcaccaatccatc6180
acgggattatatgaaactcggatagatttgtcacagcttgggggtgactctggtggttct6240
actaatctgtcagatattattgaaaaggagaccggtaagcaactggttatccaggaatcc6300
atcctcatgctcccagaggaggtggaagaagtcattgggaacaagccggaaagcgatata6360
ctcgtgcacaccgcctacgacgagagcaccgacgagaatgtcatgcttctgactagcgac6420
gcccctgaatacaagccttgggctctggtcatacaggatagcaacggtgagaacaagatt6480
aagatgctctctggtggttctcccaagaagaagaggaaagtctaaccggtcatcatcacc6540
atcaccattgagtttaaacccgctgatcagcctcgactgtgccttctagttgccagccat6600
ctgttgtttgcccctcccccgtgccttccttgaccctggaaggtgccactcccactgtcc6660
tttcctaataaaatgaggaaattgcatcgcattgtctgagtaggtgtcattctattctgg6720
ggggtggggtggggcaggacagcaagggggaggattgggaagacaatagcaggcatgctg6780
gggatgcggtgggctctatggcttctgaggcggaaagaaccagctggggctcgataccgt6840
cgacctctagctagagcttggcgtaatcatggtcatagctgtttcctgtgtgaaattgtt6900
atccgctcacaattccacacaacatacgagccggaagcataaagtgtaaagcctagggtg6960
cctaatgagtgagctaactcacattaattgcgttgcgctcactgcccgctttccagtcgg7020
gaaacctgtcgtgccagctgcattaatgaatcggccaacgcgcggggagaggcggtttgc7080
gtattgggcgctcttccgcttcctcgctcactgactcgctgcgctcggtcgttcggctgc7140
ggcgagcggtatcagctcactcaaaggcggtaatacggttatccacagaatcaggggata7200
acgcaggaaagaacatgtgagcaaaaggccagcaaaaggccaggaaccgtaaaaaggccg7260
cgttgctggcgtttttccataggctccgcccccctgacgagcatcacaaaaatcgacgct7320
caagtcagaggtggcgaaacccgacaggactataaagataccaggcgtttccccctggaa7380
gctccctcgtgcgctctcctgttccgaccctgccgcttaccggatacctgtccgcctttc7440
tcccttcgggaagcgtggcgctttctcatagctcacgctgtaggtatctcagttcggtgt7500
aggtcgttcgctccaagctgggctgtgtgcacgaaccccccgttcagcccgaccgctgcg7560
ccttatccggtaactatcgtcttgagtccaacccggtaagacacgacttatcgccactgg7620
cagcagccactggtaacaggattagcagagcgaggtatgtaggcggtgctacagagttct7680
tgaagtggtggcctaactacggctacactagaagaacagtatttggtatctgcgctctgc7740
tgaagccagttaccttcggaaaaagagttggtagctcttgatccggcaaacaaaccaccg7800
ctggtagcggtggtttttttgtttgcaagcagcagattacgcgcagaaaaaaaggatctc7860
aagaagatcctttgatcttttctacggggtctgacgctcagtggaacgaaaactcacgtt7920
aagggattttggtcatgagattatcaaaaaggatcttcacctagatccttttaaattaaa7980
aatgaagttttaaatcaatctaaagtatatatgagtaaacttggtctgacagttaccaat8040
gcttaatcagtgaggcacctatctcagcgatctgtctatttcgttcatccatagttgcct8100
gactccccgtcgtgtagataactacgatacgggagggcttaccatctggccccagtgctg8160
caatgataccgcgagacccacgctcaccggctccagatttatcagcaataaaccagccag8220
ccggaagggccgagcgcagaagtggtcctgcaactttatccgcctccatccagtctatta8280
attgttgccgggaagctagagtaagtagttcgccagttaatagtttgcgcaacgttgttg8340
ccattgctacaggcatcgtggtgtcacgctcgtcgtttggtatggcttcattcagctccg8400
gttcccaacgatcaaggcgagttacatgatcccccatgttgtgcaaaaaagcggttagct8460
ccttcggtcctccgatcgttgtcagaagtaagttggccgcagtgttatcactcatggtta8520
tggcagcactgcataattctcttactgtcatgccatccgtaagatgcttttctgtgactg8580
gtgagtactcaaccaagtcattctgagaatagtgtatgcggcgaccgagttgctcttgcc8640
cggcgtcaatacgggataataccgcgccacatagcagaactttaaaagtgctcatcattg8700
gaaaacgttcttcggggcgaaaactctcaaggatcttaccgctgttgagatccagttcga8760
tgtaacccactcgtgcacccaactgatcttcagcatcttttactttcaccagcgtttctg8820
ggtgagcaaaaacaggaaggcaaaatgccgcaaaaaagggaataagggcgacacggaaat8880
gttgaatactcatactcttcctttttcaatattattgaagcatttatcagggttattgtc8940
tcatgagcggatacatatttgaatgtatttagaaaaataaacaaataggggttccgcgca9000
catttccccgaaaagtgccacctgacgtcgacggatcgggagatcgatctcccgatcccc9060
tagggtcgactctcagtacaatctgctctgatgccgcatagttaagccagtatctgctcc9120
ctgcttgtgtgttggaggtcgctgagtagtgcgcgagcaaaatttaagctacaacaaggc9180
aaggcttgaccgacaattgcatgaagaatctgcttagggttaggcgttttgcgctgcttc9240
gcgatgtacgggccagatatacgcgttgacattgattattgactagttattaatagtaat9300
caattacggggtcattagttcatagcccatatatggagttccgcgttacataacttacgg9360
taaatggcccgcctggctgaccgcccaacgacccccgcccattgacgtcaataatgacgt9420
atgttcccatagtaacgccaatagggactttccattgacgtcaatgggtggagtatttac9480
ggtaaactgcccacttggcagtacatcaagtgtatc9516