高温双链dna噬菌体gbsv1的全基因组序列的制作方法

文档序号:3477069阅读:361来源:国知局
专利名称:高温双链dna噬菌体gbsv1的全基因组序列的制作方法
技术领域
本发明属于生物技术领域,特别是涉及一种高温双链DNA噬菌体GBSV1的全基因组序列。
背景技术
近年来,感染嗜热微生物的高温噬菌体或病毒引起了越来越多科研工作者的关注。因为,它们可以作为一种模式系统来研究热液区生物的生物化学及分子生物学特性,而且它们还影响生物地球化学和生态系统进化过程,包括营养循环、生物分类、生物种群结构、遗传转换、生物进化等。另外,高温噬菌体在构建人工载体和克隆各种分子生物学工具酶方面具有广阔的应用前景。

发明内容
本发明的目的是采取梯度离心纯化、基因组提取和核酸序列测定等技术,对分离、纯化的高温双链DNA噬菌体GBSV1进行全基因组序列测定及序列分析,测定了34579bp的核苷酸序列,为进一步深入研究、构建人工载体和克隆各种分子生物学工具酶奠定基础。
本发明提供高温双链DNA噬菌体GBSV1的全基因组核苷酸序列及氨基酸序列功能基因氨基酸序列,如表1高温双链DNA噬菌体GBSV1的全基因组序列,SEQ NO.1将噬菌体GBSV1基因组文库中的DNA结合、重组、复制等功能基因进行体外克隆表达,可用于基因重组、基因治疗。
根据噬菌体GBSV1的全基因组核苷酸序列,可以构建人工载体,建立高温表达系统,研究热液区生物的生物化学及分子生物学特性。
本发明的优点和积极效果在于分离、提取并测定了中国第一株高温双链DNA噬菌体GBSV1基因组的核苷酸序列。本发明可根据全基因组核苷酸序列构建人工载体和克隆各种分子生物学工具酶以及基因工程的研究和开发。


图1是高温双链DNA噬菌体GBSV1基因组图谱。
具体实施例方式
1、噬菌体的分离与纯化1)200ml噬菌体裂解液加入酶抑制剂苯甲基磺酰氟PMSF使其中浓度为1mM,15000×g,20min离心。
2)取上清加入NaCl,使其终浓度为1M;加入聚乙二醇PEG6000,使其终浓度为10%,4℃放置1-3h。
3)15000×g,4℃离心1h。
4)去上清,用2mL缓冲剂SM贮液重悬沉淀,3000×g 15min离心去沉淀。
5)取上清,在氯化铯溶液(1.5g/ml)中,220,000×g,4℃离心24h6)用一次性针筒收集噬菌体。
2、噬菌体基因组的提取1)取噬菌体颗粒的缓冲剂TNE溶解液,按照10uL/mL加入10mg/mlRNase,37℃温浴30min,75℃温浴10min。
2)加入蛋白酶Protein K使其终浓度为50ug/mL,10%十二烷基肌氨酸终浓度为0.5%,10%SDS终浓度为0.5%,56℃温浴1h。
3)加入同体积的苯酚,混匀,12000r/min离心5min;取上清,加入同体积的苯酚-氯仿,混匀,12000r/min离心5min;取上清,加入同体积的氯仿-异戊醇,混匀,12000r/min离心5min;取上清,加入2倍体积的无水乙醇,静置30min。
4)18000×g离心10min,去上清,风干,用30-100uL无菌水溶解基因组。
3、噬菌体基因组核苷酸序列的测定、拼接及分析1)基因组核苷酸序列采用Mega-BACE 1000测序仪进行测序,具体操作由杭州华大基因研发中心完成。
2)序列拼接采用的是PHRED/PHRAP软件包。首先用PHRED(Q13标准)去除序列两端的低质量序列,然后用cross_match屏蔽载体序列,选取将屏蔽载体后有效长度大于100bp的序列作为有效序列,将有效序列用PHRAP进行拼接。用Consed进行引物设计,用GCskewl.1.pl、PlasimdCircle50.pl画图。
3)用拼接得到的全序列(fasta格式)跑glimmer程序得到ORF序列。
4)用ORF序列跟NR库跑blastx进行功能分析。
序列表表1高温双链DNA噬菌体GBSV1的全基因组序列,SEQ NO.11 AACACAAGGG AGGTAATCAC TGATGAATCT TGATCGATTT GCTTACGGGT TGCGCGATCC61 ACAATCATAT CCAACAGTCG GTGAGTGCCG TCACTGCGGT GCTGAACTAT ATAAGGGATG121TGAAGCAATT CAATTTGAAG GTGATTTGTT CTGTGACACC GTCTGCTTGG GTGAGCATCT181TATCGAAACT ACTGATTTTG ACGAGGTGAT CCTATGAAGC CGATCATCCT AGCAGAGACG241GCGAACATGG ATCATCTTGA ATGGTTGCGG TTACGTCAGA AAGGCATCGG CGGCTCGGAT301GCGGCCGCCA TTGCCGGCCT TAACAAATAT AAAAGTCCGA TCCAGGTGTA TTACGAAAAA361GTGGAAGGTG TGAAGGAATC GGTGCCGAGT GAAGCAGCCT ACTGGGGGAC GATCCTGGAG421GATATCGTCG CACAGGAATT CAGCCGGCGA ACAGGCCTGA AAGTCCGTCG TCGGAACGCG481ATCTTGCGAC ATCCCGAATA TCCGTTCATG ATTGCCAATA TCGATCGGCT CGTTGTTGGC541GAAGATGCCG GATTGGAATG CAAAACGGCG AGCGAATACC TCAAGGACGA ATGGGTGGAA601GGCGAAAAGA TTCCGGATCA ATATTTCATC CAATGCCAAC ACTATATGGC CGTCACGGGG661CGCTCGAGGT GGTACATCGC GGTCTTAATT GGAGGCAATA AGTTTCGCTG GGATGTCATT721GAAAGGGATG AGGACATCAT CCGATATCTC ATCGAGATTG AGTCAGAATT TTGGCAGCGC781GTCATAGAAA AACGGCCTCC GGAAGTGGAC GGAAGCGAAG CAAGCACAAA TCTATTGAAC841CTCTTGTATC CGGTGGAGTC TGTCGTGGAT GACGAAGCCG AACTGCCTAG AGAAGTGGAG901GCGCTCATCG CAGAATTAGA GGCAGTGAAC GCGGAGATCA AGCAGAAGAG CGAGAGAAAG961GCGGAGATTG AGAACAAAAT CAAAGCTCTG TTGGGGGAAC GTGAGCGTGG CCGTACGAGT1021 GAATACGTTG TGAAATGGTC GGTCGTGAAT TCGAACCAGT TCGATTCCAA GAGGTTTGCA1081 AAAGAACACC CGGATTTGTA TCAGCAATAC ATCCAAACGT CGACGTATCG ACGTTTCAGC1141 ATCTCTAAAA ACAAATCGAG AAAGGCGGTA TCACAATGAC TCAAGCGGAG AAACTGAAAA1201 ACCAATTGGC AGCGAAAGCG AACGGAAACG GCCAGGCGGC CAAGAAGCAG GACGGCGGCA1261 AAGTCACCGT CGCCGATCTC CTGCAAAAGA TGAAGCCGGA ACTCGAACGG GCGTTGCCGA1321 AGCATCTTGA TGCCGATCGG TTGATTCGCA TCGCCATGAC GGAAATGCGC CGAAATCCTG1381 AACTGCTGTC ATGCGAAATC AAGAGCCTGC TCGGCGCGAT CATGCAGGCG GCTCAATTGG1441 GCCTTGAGCC GGGCTTGCTC GGACATTGCT ACCTCATCCC ATTCAAAAAT CGGAAAAACG1501 GAACAAAAGA AGTGCAATTC GTGATCGGCT ATAAAGGGCT GATCGACTTA GTTCGCCGAT1561 CCGGCGAGGT TGAAACGATT AAAGCCGAAG CAGTCTATGA AAACGATGAA TTTGAATTTG1621 AATACGGACT TGATGAACGT TTACGTCACA AGCCGTTGCT GTTCGGTGAC CGCGGAAAAT1681 TAATTGGGTT CTATGCGTAC GCCAAGTTCA AAGACGGCGG ACATGCGTTT CACGTTATGT1741 CGGTCGAAGA AATCAATCGC CTTCGTGACA AATACTCTAG GGCGAAAGAG TCCGGGCCGT1801 GGCGTGAGGA ATACGAAGCA ATGGCCAAGA AAACGGTCAT CCGGCAGTTA ATCAAGTATT1861 TGCCGATCTC GATTGAAATT CAGCGCAACA TCTCGCTCGA TGAAACGGTT CGCAAGGACA1921 TCCATGACGA ACCGGAGCAA GTGGACTATA TTGACATGGA AGTTGAGGCG ATTGAAGGAG1981 AGGTTATCGA TGGGGATTCG AGCGAACAGG AGACAGCGGC GGAACAGCAG GAGCTTTTTA2041 ACTAGGGGGC ATGATGCTCC CTAGTGGCTC TGGGAGGTGA TTTGTTATGA CGATATATCG2101 CATCGAAAAA AAAGAAAACT ATGTGGTGCT AGATAAGGGA TTCCTTCACG ATCGGGAACT2161 ATCTTGGCAA GCAAAAGGAT TGCTGGCTTT TATGCTATCT ATGCCGAACG ATTGGGTGTT2221 TAACATGAAA GACTTGCAAA ATCGCAGCAA AAATGGCCGA GATGCGACAT ACCGCATTAT2281 GAAAGAGTTA ATCGAGGCTG GCTATGTTAC TCGCGTCGAA AACCGGGACG GGGGCAAATT2341 CGGAAAAGTG GAGTATGTCG TTCACGAAGT GAAGCAATCA CCGCATACTG AAAGTCCGGA2401 TACGGTTCCA CCGTGTACTG AAAATCCGTA TCCGGGAAAT CCGTATCCGG GAAATCCGTA2461 TCCGGAAAAT CCGCCACTAC TAAATAATAA TAATACTAAT TATAAAAATA CTAATAATGA
2521TGATGATAAT AAGGACCGTC CCAAAACCAA CAGCCTTAAT GCTTTTCGGT TTTATGAAGA2581AAACTTTCAG CCGACGCTTT CTTCTGTCGA TATTGAAATC CTTAATTATT GGTTGGATCG2641TTTTCCGGAA GAAATTGTTC TCTGTGCCAT GAGAAAAGCG CTTGAACAAA ATGTCCGGAG2701AATCAAATAC ATCGATAGAA TCCTTGCTAA TTGGGAAATG CAAAAGGTTC AAACGTTAGA2761AGATGTGGCC CGCTTGGACA GGCAGTATGA ACTTGAAAAA CAAGCGAGAC AAAAACGCGG2821GGGTGTTGTC AATGGCTCGG TTCACCAGCA TCGCGGAAGT AATGACCGAT CTACAAAAGA2881AGATGAAAGA ATTTCGCATT ACGAACCAGG GAAATGGGAC GACGTCGACA TCTCCCTCGA2941TGGACTTCTA TAAATGCGAA AAATGTAAAG ATCTTGGCGT GATCTTCATC GATAACAATA3001CAGCGCGTGT TTGCGATTGC ATCGTGCAGC GGAGAATCGA ACGGCTTTTT AAATCGTCCG3061AGATCACCGA AGCCTTCCGT GGTCTGAACT TCTCGAACTT CTCAATTGAA GACCGTCCTC3121AGATCGTCCG GGACGCCTTT AGATGCGCTG TAAAGTACGT CAAATCGTTT CCGGGTATCA3181GACATAGCCG GAGGAACTCT ATGGCGCTCC TTGGCGCTCC TGGAGCCGGA AAAACACATT3241TACTGACGGC GGTGGCCAAT AATTTGATCA GAAATCAAGT TGAGGTTCTC TATTTTCCGT3301ACCGGGAAGG ATTCGACGAA ATCAAAGACG ATCTTGAGAG CTTGGAACAA AAATGCGAAC3361GGATGAAAGC TGTTGAGGTC TTGTTTATCG ATGATCTCTT CAAACGTGGC GCGACCGAAT3421TCGAGATCAA AACCATGTAT TCTGTCGTGA ACTATCGGTA TCTCAATCAC AAGCCGATTA3481TGGTATCGTC GGAGTGCCTA GAGGATGACT TGTTGAATAT CGATGAAGCG CTCGGCTCTC3541GCATTATCGA AATGTGTCGG GATTATCTCG TTGAAATCGT CGGTGACCGG AAACTATTAA3601ACTATCGATT AGCCAGATAA AAAATGGGAG GTGGAAACCT TGATTGAATT CGTCGGGTTC3661GACGGTCCTG CGCCGTTCTC CATTATTTTT TGGCAACGGA ACGGGGAGTA TGTTTCCAAG3721ACGATTTGTT ACACGAAAGA GGAGGAACGT GAAGCGGTCA GTCGTTTTTA TACGCCGAAC3781GATTTTTACG GTGCTTTTGA TGCAGCGATC ATCTCTAGGA GAGAAGGGAC TGTGAAAACG3841GTTTGGAGGT GGGCGTGATG CGTTGTCCAA TCTGTGGTGG TCGGACAAAG GTATTGGATG3901TTCGTGAAAA GCGAGATAAG GTGAAGCGCA GACGGGAATG TTCGGATTGT TTAACGCGAT3961TCAATACTGA AGAAAAGCTT GTCATTGACT CGTTAGATGA ATACCTCATC CGTCGTCTTC4021ATAGTTTTCG GGCTCTATAG AGGGGGAGGG ATGTTTCGGT GAGAATGGGT TTGAATTTGG4081CGACCCTCTT TGACATGCAG GAGGCGCTGG ATAATCATAT CCGGCGCAAA CAAAACATTC4141CGGCGGATAT GGATCTTGTT CCGTATTTAG TGGATGCGCT TGATGTGGAA ATCGCGGAAT4201TACAGAACGA AGTCCGGTAT TTCAAGTTTT GGAGTGCTGA TCGGAATATG CGACGTGAAG4261CGGCGCTCAT GGAATACGTT GACGGATTGC ATTTCTTTTT AAGTCTAGGG CTGGCGTTTG4321GCATTCCTCG TGAATTTGAA CCAGTGCCAC ATCGCTATGA TCACATTCGG AAACAATTCC4381GATCACTGAA ACGTTATGTC TACGTTATGG AGGGTCCGAT GCAGTGGTAT ATCGCATTCA4441ATCTTTTTCT TGGACTAGGA GAATTTTTGG GGTTCAGTTG GCCGGAAATT GTGCGAGCAT4501ACATGGAAAA GAACCAGGTC AATCATAAGC GGCAAGAGGC GGGGTATTGA GGTGATAGCG4561TGGGGATCAG CGCAGCCAAT CGCGGCATGG CATTTGAACA TCTGGTCGAA TACACAAATC4621GTTGCTACAG GATGAAGGGG ATGGCTCACA TTGAAAAGGT GCCGACTCCT TGGAAGGTGA4681TTCGGCAGGG GCAGCGCATC ATTAGTGCTT TCCCAGAAAA GAAGGGAATG GTCGACTTCA4741TTGGGATTGC TCATGGACGC GTGATTGCCT TTGATGCGAA ATCGACGAGG GAACGCACCC4801GGTTCCCTCT CGATAACATC GAGGACCATC AAATGGCTTT TTTGAAATCT TGGCGCGACC4861AAGGAGCGAT CACCTTTTTC CTGGTGGAGT TCGCAAAAAA GCATGAGGTG TATATCCTCC4921GTTTTTCGGA CGCGGAGAAA TGGTGGGTGC AAGCGAAGCA AGGTGGCCGG AAGTCGATCC4981CGTATCAATG GTTTGTGAAG CATTGCGACC TTGTCAAATC AAGCAGGGGG ATTCCGTTAG5041ATTATTTGAG GTGCTTAGAG AGATGAAAAT GATTACAGTA TCTATTCGCG ATGACCGGCC5101GGAATGGATG AAAAAAGAAG ATGCGCTGAT GTATTGTTGG TGGTATTGCC CAGTGTTTTC5161CAAATGTATG ACAAGGTGCG GTGCTGACTG TCGGCGTTTT GGTGGTGAAA TCATTCCAAA5221GTTGAGGGGG TGAGATCATG CTCGTGAGAT TCTCGCTTTT ATACAAAAGT GGCTATGAAG5281ATGTCATCGA GCAAGTTTGT CTGCCTAGTG AAGTGGGGAA CATCATACAA ACGATCCGAG
5341GTTCGTTTTA TGAGGACTCT CCTGGCTATA TATCCTTCGG TGATGAAAAT CTGAAAGGGT5401ATATCATCAA TGTCCAAGAA GTCGCTCGTG TGAAGATGGA AATCTTAGAA GGCGGTGATT5461CATGATGGAC GAACTGAAAT TTCAACGGTC TGTTATTCGC GCTGAATATG TAACGGAGGC5521GGATATCGAT GTTCGTGCGC GTGAAATACT TAGAGTTTGG CAAAGAAAAG GGCTATACCA5581TGTGGGCCAA AAGTAAGGAG GAAGTCATCG CCAATCTGCG GCAAGTTGGC TGTTCTCCCG5641ACATGGTTAG ATCGCTTGAA ATATGCAAGC CTGGAGAAAA CGAATTCAAA CTATACAATC5701CTAAATTCTT ATGGTGATGT AGGGGACTGA AGAATAAAAA GACCGGGCTT CTCCCGGCCG5761ATAAATGGGG GTCAAGATAC CCATATTATA CCACGGGAGG GGTTCGAGTG GTAAAGTACA5821AACAAATATC CTTTGTACGC GACGTGGATG GGGAGAAGAC AAAAGAGGCT GTGGAGGCAG5881CGCTTGAAAA ATACCGGATA TACATGCTGA CCGTTCCAGA CGAGCATCTC CCGCGAGTGA5941CACAAACGTA CTCTCTCGTG CCGCCGTCGA AGACGAATGC ATTTTACTCA TCTACAGAAA6001GCGCGGCGAT CCGCAAGGTA GATTTTGAAC GCGAGCGTGA CGAGTACATG GAGAGAATAC6061GCCGCGCGGT GAATAAGCTC AATAGAATGG AAAGGGAATT GATTATTAAG CGATATCTGA6121CACTGGAAGA GCCTTATGAT TACGAGGTAT ATAACGAAAT GGGAATCAGT GAATCGAGGT6181TCTACCGCGT TCGCGAAAAG GCGTTCTACA AGCTGGCCTT TGCACTGCGG ATTGAGGTTT6241ACAAAGAAGA GGCACCTGTG TAGGTGTCTT TTTTAATAAT CATTTTAAAC CTTAGTTCAA6301AATTGACATA TAATATGGAA AAATATTAAG CTATTAGTTA AAATTTATCT ATAGTAGTTA6361TTGTTGCTTA CTATGGTTAT TTAATTATGA TCTGAATTTT CATAATTTAG TTAACTAGGT6421ACTTTGCGAA AGGGTAGGAT TCCATATGGC TTTATATAAA GACATTATCC AAAAAGGAAA6481TATGGAAAAG ATAAAGCCAT ATTGTTATTA TGATGATATA AAAAAACTTG TCTTAGACAT6541TTGTAAGTTA CATAACGATT TCTCGAAAGA ATGGGTTCGT TCAGGGAATT ACGCTCCTGT6601GAATTTGAAA AAGCAGTTAA TCAGGGATGT TTCTTTAGTG GTTGATGATC GTGAGGTTTT6661TATACTTGAT AACTCTTTCT ATTCGCTGAT AAAGAATTAT GTTTATAAGC TGGAAGAATT6721GGTGATTGAT ATTGACTTTG AATTTACATA TCAGCATTTG GATTGTCGTT TGCGTGTTAA6781ACAAAATGAA TCAATTGTGA ATAAATTGAA GTATTATCGA GTAGGTAAGA AGGATAAAGG6841GCTTTATCCG TTGAATAAGT GTTTAAATGA TCTGTTAGGA TTCAGGATCA TTGTTGATGA6901TTTTGATCAT GATTGTGTAT ATTTCGACGC TTTATGTGAA AACATAAAAA CTGAATACAA6961AATTCGTAAG ATGAATGCGT CTAAAAATGG ATACAAGGCT ACACACATAT ATTTTTATGG7021AGAGAGCAAT ACATATTTTC CTTGGGAGTT GCAAATATGG AACTCAAGCG ATGAGGAACA7081AAATGTACGT TCCCATCGTA TACATAAACA AGAGTATGTA AAATGGGCGG CTATTTATAA7141AAATTCGAGC GAATTTGAAG GGGGTGTTTA GCTATGGCGT TTCATTTTAT AGCCATTGTC7201AGTCATTATG CAGAGGGCAA AAGGATCGGA TGGCATTACG CTGACGAGGA AAAACTTGAT7261AAAAAAATCA TCCATGAGTT TTTGGCACAA GTGAAGAAAA AGTGTGGAGA TGTACAGTTA7321GGCATTCATA AATTATCTAC TGATTCTGTA TCATGGGATT CTGTAGTTCA GAGGGATGCT7381TTTTTTAAAG ACGTAATTAT CACAAGTGAC ATAGAAACCT TCATTGAACT AGTTTCACAA7441GATCAGGAGT TAACCGCATA TGACGTGGCA AAATTTATCT TATCAGTTAT ACCATCATCG7501CATTTGAAGT TACAAAAATT ATTATACTTT GCATATGCGG AGTTTCTATT ACGTACTGGC7561GAAAAATTAT TTAGGGAACC GATACTGGCG TTTAAATATG GGCCTGTGGT GGAAAGTGTT7621TTTCATAAAT ACAAAGTTCA TGGCTCTTCT ATAATTGACT ATAAAGAGGA CGAAACAGTT7681TGTATATATC CTGAGGAACT GGCGGTTACC CCTTCTTTCA TGAAAATTGC ATCATCGGAG7741CATGGTTTGA TAGCGTTAGA CTGTCTAATA GATGTCTTTG AAAAATATGG GCATTTGAGT7801GCTCAGGAAC TCGTTTATAA AACACATCGA CCAGGCGGAC CTTGGGATAG AGTGTATAAA7861CCGGGAAAAA ATGCTGTAAT AACGGATGAT TTGATCCTTC AATACCACCA TGTAATTCAA7921TAGAGTTTTG AAACAAAAGG GTTTTCGTTA AGCAATGATT AATAAGACGC AACGACAGAA7981AACTGACGGA AAAGTGAAGG AAAAATGACA GAAAGACGAC AGAATATTTC TGTTTAGACG8041TGCTATGATG GTATCATCAG GCACAGTGAC AAGCCGGATC CTCCTTCCAT GAGCGTCACC8101CGATCGGGTG GCGTTTTTTA TTTTCCATGA ATTAGAAAGA AAAAATAGGG CACAATAAGC
8161TCATGTAATG GAAGCTACGG TGAGATCATG AAGTGAGGTT TATGGCTATG TATCCATACA8221GCGACAAAGA AAAAGCCCAA CGTGCCTACG AACGCCATAA GTGCAAAAAC TGTATATGGG8281CGAAGTGGCA AGTGCCATAT CTTGTTTCCT GTCCATTTGT GCGGTGTGTG CGAATGTCGG8341AATTTGACGA AAAAGTAATG TAGGAAAATC CATTCTGTTT TGAAACTAAC TAAAAGTGGA8401GAACGATAAA ACGTCAAAAA CATGTCAGTT TTGGCACGTT TTTTTGTGAT AAAATGATAC8461TGAACAAATG AATACCCTCT ACTATGTATG CGCGTGAGTG GAGCAGACTA ATCGAGAAGA8521GCCCAACAAA TCCCTTCCCC GCAAAGAGGC TGTCGCTTAT TGAGCGGCGG CTTCTTTATT8581TTAGCGATGA GTACGAATGT CGGAATTTGA CGGAAAAACG ATGCAGGAAA ATTCCTCCTT8641TTGCCGAATT GAGTAGGCGG AAGGGGGATG AGATAATGAG TGAATTAAAT TATGAACAGT8701TTGGGAAATG GGTGAAAAGC GAATCATTTC GACCAGACAC TGGTTATAAG TTAAGGCTTG8761CATATTTTAA AATTAACGAG CTTTTACCAA CAAATGAAAT TAGAGAGTTC TATCCTAAAA8821ACATAGTAAA TGACAAAGAA GATATTGAGT TATTCGCATT TTTAGATAGC AAATTAATCG8881TTATTTTACC GAAAGAAGAA GAAACCATTT TTAAAGTGCT GTATTTAAAA GATATTTCGC8941ACGTTGATTT AGAAATTAAG AAAAATGTAG GATTCGAAAG CTGGATTTTA AAAATTTATT9001TCACAAACAA CACAGTTATT GAATTAAACA GCGTCGAAGA TAGTAATGAA TATTGGCGAT9061ATAAATATGA AGATGCAATT AAAAGAATAT ATAAGTCATT ATTGGTTGAA AAATCGCTCG9121TGTAATTACG GGTGATTTTT TATTTGGGGT TGATAGCATG AGTTTCTATA AAACAAAACA9181ATGGAAAAGG AAACGTGAAG CAATACTGCG GCGTGATGAG TATTTGTGTC AAGAGTGCAA9241GCGATATGGA AAGACAACGC CAGCGAAGAT AGTGCATCAC ATTATACCGT TTGAACAAAG9301ACCGAATTTG AAGTTGAATA ACGATAACTT AGTCAGCTTA TGTTTTCAAT GTCATGAGCA9361GATGCATAAC AAGATGACGA ACGAGCTGAC GGAAAAAGGA TTGGAGTGGG TGGAACGAGT9421GGAACGGAAG TTGAGATGAT CCCCCCCCCT CCTTTCTGTT TTTGAACAGC CGACAGGGGA9481CCGGCCAGGG GGCAGGCGTT TCCAATAGCG CGGAGTTTTT TCGAAAAGGG GTGATAGCGC9541ATGGTCAAAA CGAAAAAGGC GTTCATCGCG GAGATCAAAC GACAAATGAA ATCACTGGGC9601ACATACAAAA AGGAATATGA TCGCATGATT GAAATTTTCG CCGGGATGCT TCACCAGTAT9661TACATGTTCG AGGAACAGTT TGCCCAGAGT GGGTACAAGA TCACGGAGAT GTACACCAAT9721AAAGCCGGCG CGACGAACGA ACGAAAGACG CCCCTATACA CAGCTATGGA AAGCCTGCGG9781AAAGACATTG CAACGTACTC TGATCGACTA TGCTTAAACC CGAAGGCGAT GGAGGCCATC9841ACCATTGAGC AGCAAAACAA ATCAAAACTA GCTCAAGTGT TGAGCGAGTT ATCATGAGGA9901AGTATAAAAA CTACGACGCG GTGATGGAGT ACGCCAAAAG CATAGTCGAT GGCCGGAAGG9961TAGCCTGTCG CGAACTGATC CAGGCGGCCA AGCGATTTTT CAAAGACTTG GAGAATCCAA10021 AATATGATTT CAATCCAAGG GAAGCGGAAT TTGTTATTCA AATCATTGAA AAAACGTTCG10081 TTCACGACCA GGGTGAACGA CTAGACGGCA CACCATTGCG CGGTGAGCCT TTTTTATTGG10141 AACCGTGGCA AAAATTCATT ATCTACAATT TGCTTGGTTT CTATCTCAAA GGAACGAAGA10201 TCAGACGCTT CAAAGAAGCG TTTATTTTTA TACCTCGGAA AAACGGAAAA ACCCGGCTTA10261 TTGCAGCGTT GGCCTGGGCT TTGGCTCTGC TTGAAAGGCA ATCCGGGTCA AAAATCTATA10321 TCACAAGTGC GGCGCTTCAG CAGTCGTTGC AGTCGTTTGA ATTTATCTTG TTTAACCTAC10381 GGCAAATGGG CGAGGAACAA AACTTTCGGA TATTGAACAA TAACCAAGAA AACAGCATCA10441 GCGGCGAATT CAGCGATGGT TCCATTTACA TTCGGGCCCT GGCAGCCAAT CCAGATAAGC10501 AAGACTCCCT GAACTGTAAT GTCGGGATCG CCGATGAAAT GCACGCCTAC AAAACACCTA10561 AGCAGTATAA CATCATCAAA GAGGCGATGA AGGCGTACAC GAACAAGCTG ATGATCGGGA10621 TTACAACGGC CGGCGACGAT ATGACCTCAT TCTGTTATCA ACGCTTGCAA TATTGTAAAA10681 AGATTCTCGA TGAAACGGTG ACGGATGAGG CTTATTTCGT TTTCATTTGC AAAGCTGATG10741 AAGACGAAAA CGGCGAAGTG GATTACACAA ACCCGATTGA ACATGAGAAA GCCAATCCGA10801 ATTATGGCGT GACGATTCGG CCGGACGACA TAATGAACGA CGCGCTCCAG GCGCAAAATG10861 ACCCACAGCA GCGGAAAGAT TTCTTGGCGA AGTCATTGAA CATCTATACG TCGCCGATGA10921 AGGCATATTT CAACATCGAC GAGTTTAAGA AGTCTGATCG GAAATATGAT TGGACGATTG
10981 AACAGCTTGC GAAACTTGGT ATCGACTGGT TCGGTGGTGC GGACCTTTCT AAATTGCATG11041 ACTTGACCGC AGCAGCGCTC TATGGAAACT ATAACGGCGT TGATATAGCC ATTACTCATG11101 CGTGGTTCCC GATCGTCGCG GCAACGAGAA AAGCAGAGGA AGACAATATT CCGTTGTTTG11161 GTTGGCGTGA TGACGGCTGG CTCACGATGA CCAATACACC GACAGTCAAC TTTTCCGATG11221 TGGTTAAGTG GTTCGAAACA ATGCGAGCAA AAGGGTTTAA AATTAAGCAA GTCGGATTTG11281 ACCGGAAGTT TGGCCGGGAA TTTTTCATGG CGATGAAACA GAAGCGATTC AATATTGTCG11341 ATCAGCCACA ATACTATTAC AAGAAATCAG AAGGTTTTAG ACGGATTGAA AAGCAAGTGA11401 AAGACGGCAA ATTCTATTAC TTGCATTCCC AGGCGTTTGA ATACTGCGTT CAGAACGTGC11461 ATGCGATTGA AAAAACTGAC GACATGATCC AATTTGAGAA AATTGAGGAC AAGCACCGTA11521 TTGATATTTT TGACGCTACT GTATTCGCGG CAATCAGGAT GCTTGAAAAT ATGGAGAAGG11581 CTGCGACGGC TACGAAATGG CTGAAAGGAG GTTGATGGAA TGGGACTATT CGATAGGTGG11641 AGGCGAACGA AGCGTAAAAG TAAAATTAGA GCGGACACAG GGTATGTAGG TCTATTCATG11701 AGTGGTGAAG ATGTATCCTT TCTTGTTCCC GGATACGTAA GATTGAGCGA TAACCCGGAA11761 GTGCGCATGG CGGTGCATAA GATTGCTGAT CTCATCTCGT CGATGACAAT TTATCTCATG11821 CAGAATACAG AGGACGGGGA TATCCGCATC CGTAATGAGC TGTCGCGCAA AATTGATATC11881 ACTCCATATT CGCTCATGAC AAGAAAGTCA TGGATGTACA ACATTGTGTA TACGATGCTG11941 TTAGATGGTG AAGGAAATAG TGTGGTGTTT CCGAAGTATA CAGCGGATGG GCTGATTGAT12001 GAGCTGGTAC CGTTAACGCC GTCGAAGGTA AACTTCTTGG ACACACCGGA TGGATACCAG12061 GTTTTATACG GCGGGCAGAC ATTCAACTAT GATGAGGTTT TACACTTCAT CTACAATCCG12121 GACCCGGAAC GCCCGTACAT CGGACGAGGG TATCGGGTGG TATTGAAGGA TATTGCGGAT12181 AACCTAAAGC AAGCAACAGC GACAAAGAAA AGTTTTATGA GTGGGAAATA CATGCCTTCG12241 CTCATCGTGA AGGTTGACGC GGCCACGGCA GAGCTTTCGA GTGAAGAAGG CCGGAATGCA12301 GTGTTCAAAA AATACCTTCA AGCCACGGAG GCGGGGCAGC CTTGGATCAT CCCGGCCGAG12361 CTGTTAGAGG TGGAACAAGT CAAACCATTA TCCCTTAAAG ATATTGCGAT CAACGAGGCG12421 GTCGAACTGG ATAAGCGAAC CGTAGCTGGC ATGTTTGGTG TGCCGGCTTT TTTATTGGGC12481 ATCGGGGAGT TCAACCGGGA TGAGTACAAC AACTTTATCA ATTCGACTAT CTTGCCGATT12541 GCAAAAGGAA TCGAGCAAGA ATTGACGAGA AAGCTGCTTA TCAGCCCGGA TCTTTATTTC12601 AAGTTCAATC CACGAAGCCT ATACGCTTAT GACCTTAAAG AGCTGGCGGA AGTCGGTTCA12661 AATATGTATG TCCGTGGCAT TATGGAGGGA AATGAGGTCC GCGATTGGCT TGGGCTTTCG12721 CCGAAAGAAG GATTGAGTGA GCTAGTTATC CTAGAAAACT ACATTCCTCT CGATAAAATT12781 GGCGATCAAA GCAAACTGAA AGGTGGTGAT AATAGTGGAG CGGACGGTCA AACAGACTAG12841 AAGCCTACAA ACGAACATCA CCGCGACGCG GGCGGAACAG GATGACGAGA TGTATATCGA12901 GGGATACTTC GCAGTGTTCA ACCGTGAAAC CGAACTATTT CCAGGAGCTT TTGAAGAAAT12961 CGCACCGGAA GCGTTTAACG GCACGTTGAG CAACGACATT CGCGCGCTAA TCAATCACGA13021 TGCATCCCTA GTACTCGGCC GCAACAAAGC CGGAACGCTA GAGCTAAAAG TGGACAGCCG13081 GGGCTTGTGG GGACGGATTA AGATCAACCC TCGCGATAGT GATGCGGTCA ATCTGTACGA13141 GCGTGTCAAG CGTGGTGATG TCGATCAATG TTCGTTCGGA TTTAACATCA TTGAAGAAGA13201 AACGGAGTTC CGGGATGACG GAACAATCAA ATGGACACTG AAAAAAGTGG ATTTACATGA13261 GGTTTCAGTG GTCACGTTCC CGGCATATCC GGATACAAGT GTCCATGCTC GCATGAAAGA13321 ATACGAACAA CATAAGAAGC GGCAACTAGA ACAAAGAAAA TTACAGTTAA AGGAGCGTGT13381 GAGAAATGGC GTTGCGTCAA TTAATGTTGA CCAAAAAAAT TGAACAACGT AAAGCTGCAT13441 TGGACGAGCT CGTGAAGCGC GAGCAGGAAC TGCAAGCGAA AGCAGCGGAG TTGGAACAAG13501 CGATCGAGGA AGCACAAACC GAAGAAGAAG TTTCAGCGGT GGAGGAAGAA GTCGCAAAGT13561 TGGAAGATGA GCGTAACGAA CTGAACGAGA AAAAATCGAA ACTCGAAGGC GAGATCGCTC13621 AATTAGAGGA TGAACTCGAA CAGATCAACA GCAAACAGCC TTCGAATCAA TCCCGCCAAA13681 AAATGCAAGG GTCGAAAGGA GATGTGGTAG AAATGAATCG TTTACAAGTG CGCGAGATGT13741 TAAAAACCGG TGAATACTAC AAACGGAGTG AGGTCGTTGA GTTTTACGAG AAATTCCGCA
13801 ACCTCCGTGC GGTGGCTGGC GGAGAATTAA CGATCCCGGA AGTGGTTGTC AATCGCATCA13861 TGGACATTAT GGGCGATTAC ACGACACTTT ATCCGTTGGT TGATAAAATC CGAGTTAAGG13921 GGACGACTCG TATTTTAGTC GATACGGATA CTTCTCCGGC TACTTGGATT GAACAATCCG13981 GCGCGCTCCC GACTGGCGAT GTTGGAACGA TTGCAAGCAT TGATTTTGAT GGATTTAAAG14041 TCGGTAAAGT AACGTTCGTG GATAACTACT TGCTGCAAGA CAGCATCATT AATCTTGATG14101 ATTATGTGAC GAAGAAGATT GCTCGCGCTA TTGCCAAGGC GTTAGATTTG GCAATCGTTA14161 AGGGTACAGG TGCTGCAAAT AAACAGCCGC TAGGTATCAT TCCGAGCTTA CCTCCAGAAA14221 ATCAAGTAAC CGTTGAAGCC GATAATAACT TGCTTAAAAA TCTTGTGAAG CAAATCGGGT14281 TGATTGATAC GGGCGACGAC AGCGTCGGAG AAATTGTGGC TGTGATGAAA CGTTCTACTT14341 ACTACAATCG CTTGGTAGAG TTTAGTATTC AAGTCGATTC TAACGGCAAT GTGGTAGGTA14401 AACTTCCGAA TTTACGCACG CCGGACCTTC TCGGATTGCG CGTTGTGTTT AATAACTTCC14461 TTGACGATGA TACTGTATTG TTTGGTGAGT TCGAACAATA TACGTTGGTC GAACGTGAGA14521 ATATCACGAT TGACAGCTCG ACTCACGTGA AGTTTACCGA AGACCAAACC GCATTCCGTG14581 GTAAAGGGCG CTTCGATGGA AAACCAGTGA AACCAGAAGC GTTTGTTCTT GTAACGATTA14641 CGGATCCGGT TCAAGGAGCG TAATTTTGAA GAGGTGATAT AAATGCCTAA ATATGTAGTG14701 ATTAAGGTTT TTAAGGATTT GCAGGATCGC CAACATATCT ATCGCGTTGG CGATACGTAT14761 CCACGGAAAG GGTATAAGCC GTCAAAAAAG CGTATCGAGG AATTGCTGGG GAATGAAAAT14821 AGGATCGGTG AACCTTTAAT CGCAGAAGTG GATGAGGAAG AAGGCAATGA ATGATGGACA14881 TCGCTACGAT TGTTAGTTTA GTGAAAGAGC GGCTCGGTAT CCGTACGACG GTGCGTGACA14941 CGTATATCAC TGCGATCGTC GACGGCGTAA TTAAAGAGCT TGAAGACGAA AAAGGGTTGG15001 TGCTGGATGG CGCCAACCCT TATCATTTGA TGTTTGTTGT CGATTATGCG ACATGGCGCT15061 ATCAGAGCCG AGATAGCGAC GGGGCAATGC CTAGACATTT GCAATATCGG CTGCATAATC15121 TCATGCTCCA TGCAGGCGGT GGAACCGCGT GACATACGAT AATGAACTTG TATTGATTGC15181 ACAGGAATTT GTGGAGGATG AAATCGGCAA TCAGATACCA ATCGAAACGC GAAAAACTGT15241 CCTATGTAAC GTGAAATCGG TTGGCAGAAA TGAATTCTAC AGCGCTGCCA CGTCCGGGCT15301 GCGCCCGTCT GTTGTGTTTG TCGTCCACAG ATATGAATAC AGCGGTGAAT CGGAAGTGGA15361 GTTTGAAGGG ATAAGGTACC GGGTAATCCG GACATATGCT GTTGATTTTG AGGAAGTAGA15421 ACTCACTTGC GAGAGGGTGC TCGCCTATGG CTAAGATAAA AATCGGCAGG CTGGCGGATG15481 AGATCACAAG TCAGTTACGA AAATATTCGC AAGTCATAGC CGACGATGTA GAACAAATCA15541 TGGATGATGT AACAAAAGAA GCGGTTGGCC GGCTAAAGAG TAAAATCCAA GAAGTTGGAT15601 TGGTACAGAC GGGCGACTAT ATGCGCGGAT GGACACGGAA GCGAGTGCCG AACGGGTGGG15661 TGATCCATAA CAAGACCGAA TATCGGCTGG CTCATTTGCT GGAATACGGA CATGCCACAG15721 TGGATGGCGG CCGGGTTCCG GGTACGCCGC ATATCCGGCC GATTGAGGAC TGGCTTGAAA15781 AAGAATTTGA AGATCGTGTC GAGAAGGCGA TCAAACAATG AAGCTGACAG AGTTGGACGA15841 TCTGCTCAAG GCGACAGGTT TACCAGTTGC CTATTCGCAT TTTTCAAAAC CGCAAAAGCC15901 ACCGTTCATC ACGTATATGG TCGCTTATTC ATCGAACTTT ACTGCTGATG ATCAAGTATA15961 TCAGGAGATT GAAAACGTTC AGATCGAGCT GTATACGTTG AAAAAGGATT TTGAAGCAGA16021 GGAAAAGGTG AAAGCCGTTT TGGATGCCAA CAATCTCGTT TATGAGACAT CCGAAACGTA16081 TATTCCATCC GAAAAACTAT ACCAAAAAGT ATATGAGGTG AGGTTATTAT GACAGCTCCA16141 AGCACCAATA AAATCAAATA TGGTCTTCGC AATGTCCACT ATGCGGTGAT TACGGAAGAC16201 CCGGTTACCG GAGCGATCAC ATACGGGACG CCGAAGCGCA TTCCGGGGGC CGTTTCGTTA16261 ACATTGGAAC CTGCTGGCGA AACGTTTGAT TTTTATGCCG ATGATTCTGC TTATTATAGC16321 GAAGCGACCA ATAATGGATA TGATGGGGAA CTGGAAGTGG CCAACCTAAC CGATGAGTTT16381 CGTATCGACG TCCTTGGCGA CACGTACGAA AATGGCGTGA TGTACGAAAA CGCGGACCAA16441 GTCACGAAGC CATTTGCTTT GTTGTTTGAG TTTCAGGGGG ATAAGAAAGC GAAACGACAT16501 GTCCTGTATT ACTGCAAAGC GAGCCGTCCG ACTGTCGCAG GGCAAACCAA AGCAGAAAAC16561 ACAGAGCCGC AAACGTCTAC ATTGAATTTC ACGGCTCGTC CTCGCCCGGA CACGAAAGAG
16621 GTAAAAGCTG ATACGACTCC AGAGATTGAC CAAGCATTGT ACGATAGCTG GTATACACAA16681 GTGCACGTAA AAGGTGCGAC AACGGGGGCA TGATGAATGG AGAAAACACT AGTGATTGAC16741 GGCAAACAGG TTCGCTTCAA GTCCAATGCG GCAACGCCGT TGCGCTTTAA AGCACAATTC16801 GGGAAAGACT TTTTCAAAGA AATCTACAAA TTGAATGCAA TTGGGGAATT GACGGATAAG16861 GATGGCCATT TCAACTATGA AGTGCTGGAA AAGCTGGACT TTGACTTTTT TTACAATATC16921 ATCTGGACGT TGGCTAAAAC AGCGAATCCG TCTATTCCCG ACCCGATTAC GTGGCTAGAC16981 CAGTTCGATG AATTTCCTCT CATGGAGATT ATTCCGGAGT TGCAGGATCT CATTATTGCT17041 AGTATCCAAT CTAAAAAAAA GTAGATGATA ACGAAGCGTC CGGTGGTGAT GAATTCACTA17101 CCGAGACGTT TCTTTTTTTG TGCAAACGCT GCGGATTACA AAAGGATGAC TTGGAAGAAA17161 TGACGATCGG CATGTGTATC GACTACATCG ACGAATACAT TCGTGAAATG ACCAATCCAA17221 AAGAACGTGT TCGGGAAGCC ACACAGGCCG ATTTTGATGC GTTTTAGAGG GAGGTGAGGC17281 ACGTGGCGGA CCGTATCAAA GGGATTACAA TCGAAATCGG CGGGGATACG ATTGGGTTAC17341 AAAAGGCGTT GCAGGATGTG AACGCCAAAA GCCGTGAGTT GTCGAAAGAA TTGCGCGATG17401 TGGAACGCTT GCTCAAGTTT GATCCGGGCA ACGTCGAGGC CGTGGCTCAA AAACAACGAA17461 TTCTCGCCCA ACAGATTGAG GCAACGACCG AAAAGCTGAA CCAATTGCGC TCCGCACAAA17521 GCCAAGTGGA GGAACAGTTC CGCAAGGGAG AAATCGGAGA ACAGCAATAT CGGAACTTTC17581 GACGTGAAAT CGAATTTACG GAAGCGCAAC TAAGCAAATA CAAGCAATCA TTGCAGGCGA17641 TACAGGATGA GCAGCAAGCG GCGGAGCAGT CCACCAAACG GTTGGAATCG CTGTTTAAAG17701 CAACTGGCAA AAGTGTGGAT GATTTTGCTG ATGTTTTAGG AAACAAGCTA GTCAATGCGA17761 TCAAGGACGG AAAAGCATCG TCTGCACAGT TAGAGGATGC TTTAGGTAAA ATCGGGCAGG17821 CCGTACTTGG TGCCGGCGTC GATTTAGATA AGATGCGCCA GGCGCTCGAC CAGCTTGCGA17881 GCGGCGCTAA ATTGGATAAA GTCAAAAAGG ATCTTGACGA GATTGCCAAA GCGGCGAATA17941 GTGCCGAAAA GGATGTGCAG GGACTAGGCG AGACGCTTTC CGGTGTTGCT GGCGGACTTG18001 CGGCCGGCGG AGGGCTAGCA GGGGCGATCA ACCAGGCGTT GGATACCTCC AGACTAAACA18061 CAAAAATTGA TATTTCGTTC AATGTGCCGG AGGAATCGAA AAAGGCCGTT AAAGATGCTG18121 TCAACACAAT CAAGGCATAT GGTATTGATG CGGAAACGGC GCTTGAGGGC GTGCGGCGTC18181 AGTGGGCGCT GAATGCTGAC GCCAGCGATG AGGCAAACCA AAAGATTATT GAAGGTGCCG18241 GCATGATTGC GTCTGCCTAT TCAGGCATTG ATTTCACCGA ACTGATCCAA GAAATCAATG18301 AAATTGGCAG CGAGCTCAAG ATTTCAGATG AACAGGCGCT CGGCCTAGTC AATAGCCTTT18361 TGCAGATCGG ATTTCCGCCT GATCAATTGG ATATCATCGC AGAATATGGC CAGCAGTTGC18421 AACGAGCTGG CTACGATGCT CAGGAAATTC AGGCGATTTT TGCGGCAGGC ATTGAAACTG18481 GTACATGGAA TATTGACAAC TTGCTCGACG GGCTGAAGGA AGGACGGATT CGTCTAGCAG18541 AATTCGGACA GGGCGTTGAT AAGGCGACGG CCGAACTCCT TGAAGGGACG GGCATATCGA18601 CGGCACAGCT TCAAGAGTGG GGCCGAGCGG TCGCGGCAGG CGGGGAACAA GGGCAAAAAG18661 CTATGTTTGA AGTCGCACAG GCGCTTGCCG GTATCAAAGA TGCGACAACA CAGAATGCGC18721 TCGGTGTGAA AATTTTCGGT ACAATGTGGG AAGACCAGGG CACGAACATC ACCGAGACCA18781 TTCTCAATAT GAATAAGCAC TTGGCGGACG CGAAAAACAA CCAAGATTTG CTCAATGATT18841 CAATTTCAAA AATTAATGCT GACCCGGCCG TAAAGTTTCA AAAGGCGATT GGCGACCTAA18901 AAACGGCACT AGAGCCTCTG ATGTCAGTCA TCGCTTCTGT TGTCGGGGCT ATCGCTAGCT18961 GGATGTCAGC CAATCCGCAG CTATCTGCAA CAATCACTGC GATTGTAGGC GCTGTCGGCA19021 TTTTTTCGGG CGCGCTCATG GCATTGGCGC CCATTTTATA TTCAATACAG AATGTCCTCC19081 CAATTATCAC GAAGATGCTA CCAATGCTAG GGAATGCGTT TAAAGCGATG ACGGGGCCGA19141 TTGGACTTGC CATTACTGTG CTGACGCTAC TTGTGCCCGT CATCATTAAA AATTGGGAGC19201 CAATTAAAGA ATTTTTCTCT AAGTTATGGG ATGGAATTAA AAAAATCTTT GAAACAACTG19261 TTAATGCAAT CGGATCGTTT TTGAGCAGCG CGTGGGAGGG CATAAGAGCG GCTGCACAAG19321 CGGTGTGGAA TGGAATTAAA GCCTATTTTG AAACGGTTCT CAATGTATAT AAAACGATTT19381 TTACGGTCGC ATGGAATGCA ATCCGAACGG CGGTAGTCGC GATTTGGAAT GGACTGAAGA
19441 CCACAGCAAT CACTGTTTTT GACGCATTGA AAAACGCCAT TTCTAACGCC TGGAACGCGG19501 TTAAGACGGT CACAACGACT GTATGGAACG CGATCAAAAG CATGGTAACG TCGGTGATGA19561 ATGGTATTCA GCAAGCAATC AGCACAGCCA TCAACGCTGT CAAAAACATC GTCACGGCGG19621 CATGGAATGC AATCAAATCA CTGACATCGA GCGTATGGAA CAGTATCAAA AGTTTTGTGA19681 TGACCCATGT AAACGCGATT CGCGACGCTG TTCCAACGGC TTTCGAGGCG ATGAAAAACC19741 GAATCGCCAG CGTGTGGGAG GGAGTAAAAA ACGTGATTAA AGCCCCATTG AACGCGGTCA19801 TTTCAATCAT TAATAGTTTT ATTGGCCGTT TGAACACACT TAAAATCCCC GACTGGGTGC19861 CGGGGGTAGG CGGAAAAGGA ATCAACATCC CGAAAATCCC GATGCTGGCG AAAGGAACGG19921 ACTATTTCCG CGGCGGCTAC GCGATTGTAG GCGAGCAGGG GCCGGAGCTG GTACAACTGC19981 CACGCGGTTC GAAGGTGTAT CCGAATACCG AGACAATGGG CATGTTGGGC GGAAACATTT20041 CGATCAACAT CCAAAATATG ACCGTTCGAG ATGAAAGGGA TATTGAAAAA ATTTCGCGTG20101 AACTATACGT GCTGATTGAA CGGTCAAAAC GAAGCAGGGG ATCGCGATGA TGTGGTTTGA20161 GTTTAATGGT ATCCATTCGA ACGAACTAAA CATTCGTGTT TTGCGTTTTA GAAAACCTGT20221 TTTCCCTGAG TTTAGCGATC ATTATGAATT GATCCCTGGT CATGACGGAG GCATTCTTTT20281 TCCTCAATCA TTCGGAATGC GCTCGATTGA AATAGATTGT TTACTTTTAC ATGACCAAGA20341 CAATCGTACA GAAGCAATTC GGCAGCTTTC TCGAATATTA ACCAAAACAG AAGCATCGTT20401 AATGATTAGT ACTGAACCTG ATGTCTATTA CATCGGGAAG CTAGCTGGTA CTTTTGTTCC20461 CGATATGCAC AGAACATTGT CAAGTTTTAC GCTTGCGTTT ATCTGCCAAC CATTCGCATA20521 TAACGTTAGT AAGACAAGTG TATCGAAGCA AATAACTGCA AGCGACAAAC AAATTACTCT20581 CGTCAATAAC GGAACATACG ATGTGCTTCC GATCATAAAA ATATCGAATG CAAACACAAA20641 TTCACTGTCT TTGACGCTTG ATGATGACAA ACTAACCATT TCAAACACAC TTCAAACAAA20701 CGATGTATTA ACGATTGATT GTGATGAAAT GACCGTTTTG TTAAACGATA CGAACATTTT20761 AGACAAAACG ACGGGCACGT TTTTGACGTT GCGACCGGGA ACAAATGTAA TGACCATCGA20821 GGCACAGAAT ACGCTAAACG TATCGGTGGA ATGGCGCGAA CGGTTTTTGT AGGGGGTGAG20881 AATGTTGAGC TTTTCAAGAA AAATCGAACA TCAGATGGTG CTGTACGACC TGGCCGGGAA20941 GCCACTTGGT GTTCTGAAAA ACGCATACAA CATTGAACAT GAAGAAACAT TGAATGACGC21001 AGAAGTGCTC ACATTCTCCC TTCCACGCGA TGATCGCCTT GCTCGAGTGA TGATGAACGA21061 TATGGAAATT ATCTATATGG GCAAGCGTTT TTTTATTGCC GAGATGAATG ACGGGCGCGA21121 TGCGAACGGC AAACCGATTT TCGATATTGT CTGTCCATCG TATTTCGTGA AATTGCTCGA21181 TACGTTTTTA ATCGAGATAA CGATTGATAG AAAGACACCG AAAGCAGGAC TAGAGCAAAT21241 TATTTCGCGT ACAGGTTGGG TTGTCGGTCG TGTAGAGGCG CTGTCGTCAC AAGAAACGCA21301 GCATTCGATG AGTGAAAAGA GGAAGTCGGC GCTTTGGGCA ATCAGGCAAT GGGCGAAAAT21361 CACAGGGCAT GAAATTCAAT TTGATACGGT AAAAAAAGAA ATCAACCTTG TCAAACAGAT21421 CGGAACGAAT CGGGGCTATG GTTTTCGATA CAGGAAAAAC TTGAAAGAAA TCAAACGGAC21481 GATTCGTGCG CCAGAGGCGA CAGTCTTGTA CCCGTACGGA AAGAATGGAC TGTCCATCGA21541 AAGCGTCAAT GACAACAAAC CGTATGTCGA GGACTACTCT TGGTATACGA GTTTAGGAAT21601 CCCATTGATT GAGGCAAAGC AAAAATATCG CAAAGAGTAT GTATGGGAAG ATGAGCGTTT21661 CTTGTTGGCT GGCGATTTAA TGCGTGCGGC ACAAGAAAAG CTGAAAGTAT TATCACAGCC21721 TGTCATTTCA TATCAATGCA AAGTCATCGA CTTATCTGCA TTGACAGGAA ATTCACAATA21781 TGAGTTTTCT GTCGGGGATT ATGTGAATGT GTTCGACGAT GAACTCGGAA TCAACGTTCA21841 AACACGCATT GTTCGCATGC GACGTTTTCC CGATGAGCCG TATCGAAATG AAGTGGAATT21901 GTCGTATATG ATCCCGGGGA TTCATACGCA AGAACAAGAC CAGCTCACTT CATCGGATGT21961 TTCGTTGTCC CAGCCGTCTT TTATTATGGG GACCAACGAA AAAACGCTTT CTATTGGAAC22021 ATCTGTTCAA ACAGCTCTAT CGCTTGTCAT TACGAATTTC AGTTCGGCGA ACGCACAAGT22081 TGGTTTGTCG TTGATTGGAC AAGCATCGAC AACAATGACG GTTGAAATTT CATTTATGTA22141 TGGCGGGAAG CCAACATTTA ATACGATTAA ACAAACGTGC CAAGCTGGAT TTGTAACAAT22201 TGGCGTCCCT TTCTTGTTGT TGCAAATGCC TCCTGGTTCG GCATTTTTAG ATGTGCAAAT
22261 GAAAACAAGC GCTGGAACGC TAACCATTGA CCCGCGCGGA CTACAAGTGT TTGTATATGC22321 AGCGAACTTG CTCGGCGGCA TTTCTGACCG TCTACCGCGT GCGAATGTGA CGGAAGAAAT22381 TCAATGGAAA AATGTTATTT CTCCATACAC AAGCCGTTTT CAATCAGTCG TTAATGGGCA22441 AATTGTCAGC ATGCAAGTCG TTGTTCCTGT ATCTTCAAGC ATCGTTGAGT CTATATCGCA22501 ATGGCGAACA CATGATCAAG TTCGTCCGTT TCCGGCAGTG GCAAGTACGG TACAAATCAC22561 ATTGAATTGA GGTGAGGTGA GAAATGTTGG AGTTTGATAT GGAACGTGCC TATCATTCAT22621 TGGCTAGAAA GGAAAATTTC GTGACTGGTG AAGTCATAGA GATGTTAAAA CATAAAGTGA22681 GCTCGATTCC AATTCGCGGC TTTACGAGAG TGGAGCTTTT TGACGAAAAG CGATTTGGGA22741 AAAAAGTTGA AGAAGTTACG GCAGAAAACT TTATTTCGAT TAATATGAAA GATTATCTTG22801 AATACATTTT GATGAACGAG TATTCAAAAA TAGGGGTCGG ATTAAACGGG CCGGATATGA22861 TGCCGTTGAC TTATAGAGAA TTCCCATTTA ACGTGCTTGC GCTGACAACG GATTCAAAAC22921 CTGAAAACCC TCAAATCGAA CGAGTAGTAA TGGGAGACAT TATTGGCTAC GCTTTTAAAA22981 ATCAGTATGT TGGTTCTGAT ACGCGAAGAG GGACGCTTAA TTCTACTGAA AGTTATCGGA23041 ACCGTTCAAT CGCGCATTTT GTATTTGATT TTGCGACTAA CGCTGCAAAT GGTACGTTTT23101 CATCGGTTAT TTGGTATTCC GATGTTGGGT CATCGTCTAT AACAGCCCAA AGATATTACC23161 AAAAAAATTA TGAGTGGTAT GTTGATTTAA ATGGAAAAAA TGGATTAGCA AGTTTAGGAG23221 GTGCGCGGGG TGGGATGTGT TTTGATGGAT CAAGCTTTTG GTCCATGGAA ACACTGGATA23281 GTGGAGTGAA AAAAATAGTG GAGTTGTTAA TCACCCCTGG TCAAAAAGGA AATGTGGCAT23341 CATTTACATT GGGGAGAGTT TGGGATCATC CTTCTTACTT GGGTTCAAGT GATGTAGCCT23401 ACGATATGAC ATATGACTCA GATTACATTT ATTACGTAGT AAACACTGGA AGTTACAACA23461 CAATACATCG GGTCAAAAAG TCAGATGGAA CAAGAAGCAC TATCACACTT TCGGGATTTA23521 GGTATTTATA TGGAGTTGAG AGAGTAGGAG CATATTTCTA TGTATTAGGA CAGTCAGCCA23581 CGCAAGTAAA TGGACAATAC CCATTACGAT GGGCAAAGTA TGACAGTAAT TTCAATGTTA23641 TTGAAGCAAA AAACATCTTT GATACTAGCG TCGTAGCGTA TGGGATGGCG TACAATCAAC23701 AGAAAAATGA AATAGCTGTT CGTACAAGCA TCGGTTTGTT TATTTTTGAT ACGCAAATGA23761 ATAGAACATC GTATGCCATT GACCAGCCAA GTTCAATGGG ATCATATCCT GGTATAGCTG23821 TTAAAGATGG TGAATATTTC ATCCGTTCAG ATAATACCAT TTTCATGGTT GAATTAGGCT23881 CGTTAGGAGC AAGAAACCTT CTTCCAACAC CTGTGACAAA GACAAGCACA AACACAATGA23941 AAGTGACGTA TGATTTTATG TTTGTATAGG AGGGCGGACA TGGAACGATT TGACATCATT24001 TACAAAATCG GCGCGGCTAC GCTTGGGGCT GTGGTTGGGT ATTTGTTTGG AGAGTCAACA24061 GGACTGTTGC TCGTGTTGTT TTGGATGGTC ATCATTGATT ATGTGAGCGG ATTAGCAGCT24121 GGCTACACAG AAAAAACTTT ATCCAGCAAG ATCGGGTTCA AAGGGATCAT CAAGAAGGTC24181 ATGATTTTTG TCATGGTTGC ACTGGCTCAT TTGGTCGATA GCGCTCTTGG AACGAAAAAT24241 ATGTTCCGAG ATGCGACTAT CGTTTTTTAT ATGGCTAATG AGTTGTTAAG TATCTTTGAA24301 AATGCGGGAA GAATGGGAGT GCCAGTTCCG GAACGACTTA CACAGGCGGT GGAAGTATTG24361 AAAGGAAAAA GCAAGGAGGC GGAGAAAAAA TGAAGATCGT TTTAGACGCC GGTCATGGCG24421 GCCACGATCC CGGTGCTGTT GCAAACGGAC TGAAGGAGAA AGATCTGACA CTCGCTATCG24481 TAAAACATAT CGGCAAGATG CTCGGGGAGT ATGAAGGGGC GGAAGTACAC TACACACGGA24541 CGGATGACCG TTTTCTTGAA CTTTCCGAAC GTGCGGCGAT TGCAAATAAA TTGAAAGCTG24601 ATTTACTCAT TTCTGTTCAC ATTAACGCTG GCGGGGGAAC CGGATTCGAA AGTTACATCT24661 ATAACGGTAA CGTCAGCCCG GCGACGATCG CTTATCAAAA CGTGATTCAC CAGGAGCTCA24721 TGAAAGCAAT CGGCAATGTG ACTGACCGCG GCAAGAAACG CGCGAACTAT GCCGTATTGC24781 GTGAAACGAA CATGCCAGCG ATTCTCACGG AAAACTTGTT TATTGATAAC GCTAATGACG24841 CAGCAAAATT GAAATCCGAA CAGTTCCTGC AACAAGTCGC ATACGGCCAC GTGCAAGGAA24901 TTGTCAAAGC TTTCGGTCTC AAGAAAAAAG TGAACTCCCA ACCGGAACAA AAACCGTCTG24961 ACGGAAAATT GTATCGCGTC CAAGTCGGGG CGTTCATTGA CAGGAAAAAC GCCGAACGAC25021 TGGCGGAGGA GTTGAAGAAA AAAGGGTATC CATCCTTTGT TGTAGACTAA CCAAAAAAGA
25081 GCCTACTCGT TGGGAGTAGG CTCTAATCCT CAAACACCTC ATTCATCAAC TCTTCGATCT25141 CTTTTCGAAG TTTTTCTGTT TCTTTTTTAT CAATGGTCAC CTCAATAATA ACAACATCTC25201 CTACGTCAGC CTCTTTAGGA AAAATATCCT TCGGAAAGTC AAGAGTTTTT CTACCGATCT25261 CAACTACTGC AATATCGCCT TCAAAACGAT CGACGATTCC TTGGACTTTT CTTTTCATTT25321 ATCGTCTCTC CGTTTTCACC GAAAGTGTTG AACCATTACT TGTGAACACA ATCGTCCCTT25381 TTTGATCAGT TCTATAGACA GTAACTTTCG CGGCTTTGAG GCGATTAAGT ACCTCTTTTG25441 TTGGATGACC GTATGAGTTT TTCCCAACAG AGATTACTGC ATATTTTGGT TTAACAGCGT25501 TCAAGAAAGC AGTACTTGTA GAAGTTTTAG CCCCATGATG GCCAACCTTA AGAACATCCG25561 CTTTCAAAGG CTGTTTAGCT TTTATCATGT CAGATTCAGC TTTAAATTCA GCATCCCCAG25621 TAAACAAGAA GGTATTTTTC CCGTAAGTCA ATCGTAAGAC AGCACTCCAG TCATTTGTAT25681 CACTTTTGCT ATATGTTTTT ACTGGCCCGA CAAACTTAGC TGTTACTCCT TTAATTGGAA25741 GGGTAACGTT TGCTTGTGCT GTTTTGATCG TTAGTTTTTC ATTCTTTACT GCAATGAGAA25801 AATCCTTATA CGCTTGGGAA GTATGGCTTA CTTTTGGAGC ATACACACTT TTTACCGGAA25861 AAGCCTTAAG AACTTCGTCA AGTCCGCCGA TGTGGTCAGC GTCTGGATGA GTCGCGATCA25921 TGATTTCGAT GTCTTTTACC TTTTGCTTTT TAAGATAGGC GACGATATCG CTGCCATCTT25981 TATTTCCCCC GTCAATGAGA ATGTCCTCAC CATTTGGTGC CTTGATATAA ATACTATCTC26041 CCTGTCCTAC ATTGATAAAG TGAACATACA GTGTTTTGGT GGCGGCATTA GATATTGAAC26101 TGGTTGGAAC GATCATAAAA GTTAAAGCGA CAATGACGGA AAGTAGAACT TTTAATTGCC26161 TCATACAAAT TCCCCCCCTT TTTGATACAA GAAAATTATA CATATATTTA CATTGGAAAG26221 GAAGGAGGAA ATTGACTTGA TGAAGTATTT ATAAGATGAG TTATTATCTT GAAAGGAGAA26281 ATGACATGTT CCCGATCATC GGCCGTCTGC GCTGCCCCAT CTGTTCAGAG CCGGTTCGAC26341 CGGACGAAAA AGTGTTCATA GACATCATCA ATACCATTAT GCACCAAAAA TGCTACTACA26401 AGTTCCCGCA ACGCCGTCTT CCCATAAAGG ATGAAGGCTC ATTTCAAAAA ATGCTTCTGA26461 AGCATCCTTT CTTTCATCGA GCAGATGATG AGGAATAACG AAAAAAGGTG AATCGCTCTA26521 CCAAACAATC CGAGCTTACC CGTTTTGGGT AGGCTCTTTT TTTATCCATA TCTCTCCGAT26581 TGGCCGCTTC AATTCCGTAC AAATCGCATA AGTAACATCG AACGACGGAA GCTGTTTATC26641 ATTGACAATC GCACTTAGGG TGCCAGGATT AATATTGATC CGTTTTGCGA ATTCCCCGTG26701 TTTAATACCT TCTTCAGCCA AAATCACCTT TAACCTGCAT TTATACCCTG TCTGACTCAT26761 CGAATCCCCT CCGGTTTTTC ATTTCGCCAT GAAGTCGTTC CGATCCTCCC GTAAAGGATA26821 AGCTCTATCA GAATTTTTTT CGATGGACAA GATAAAAGGA CAATCAGGGC CACATAGGCT26881 AATAACACGA AAGCCCCACG GTAGCAAAAC TGCCCCTACA CTGTATTAGA CGTCCTGGCA26941 TCACTTTTTC AACATGATTC TGAAAGTGTT GCATGTTCGC TCCGAACACT TTCGCTACAC27001 TTTATCCCTA GTGCACTTGT ACAGGCAGAA AGGCTTATGC GACAAGGAAG TCAGCGTGTG27061 TTAAGGAGCG AAGCGACGAG GCGGAAAGAA GGGGGAGCGC GAGGGGGAAG AAAACCACAA27121 ACAAAAGAGG TGATAGGAGT GCTTTTCAGA CGGCAGCCGG AAACCATCCC GTTTCGAGAT27181 TTTATTTACG GAAAACAGAC AGCTGGAAAA CGGGCGAAGA CAGGAGTAGT TGTCCCGGCG27241 TTGTTCCCGG TCATTACTCC GGAGAACTTA TTTCCTATCC ACGATCACGA CTTTTCGTTA27301 TTAATGATAG GGGTTGGATC AATTACACTG GCAGCCTTTT TAGAAAGAGG ATTGGTTATG27361 ATAGGAATGA CCGATATGGC TGAAAAAGTC GCCGGTTGTG GCCGCATTGT ATTTCCGATT27421 GCAGTTTACG GCGCTGTGTT GTGGCTATTT TTCAGTCTGG GAGGGCTTTG ATGTGATAAA27481 AGAATGGCTG CAGAAACAAC GCGCAAAATC CCAACTACGG AAAGCGTTCC AGGCGGCTGG27541 GTTGTATGTG GCATATAAAA GCGGTGATAA GGAAATGAAA GTATTTCCAA AGGTCCATAG27601 CGTTAGGATC GGAGATGATC AGACGGAATA TGTGTTCACT CTTATCAATG GAATGGACCC27661 GAAGGAAGTA AAGAAAAAGG AATACGTGTT CCAGCAAGTC TTTGGTCGAA ATACAGAAGT27721 CAAAGGCGAT TTAAAAAAGT TTGTATTGAC GGTTCACAAA AACGGGTTGC CGAAAAAAGT27781 GAAATATCGT TATTCGGAAA TATATCCGCT AATAAAAGGG CTTTTACTAC CGATTGTTTG27841 TGGAAAGGAT AGGCATGGTA AATGGCTCGT TTACGATGCT GTTAATAATC CCAATTGTTT
27901 GCTGTTTGGG CAGCCTGGTT CTGGCAAGAG TTCCATGCTT CACAATATTC TTGTGACGCT27961 GATCCAATAC TATACTGCCG ACGAGCTTCA TCTGTATTTG GGGGATTTGA AAATGTCGGA28021 GTTCGGGATA TACGAAGGAG TTGATCATGT TAAAAGTCTT TGCTTCCAAG CGAATGAACT28081 TGGTCCAGCA CTTGAATATC TCAAAAAGAA AGAACTCAAA AAGCGTGGGG AACTGTTAAA28141 GAAATATCGT GTCCGACATA TTAGCAAAGT ACCGAAAAGT GAACGCCCGC CTTTTATCGT28201 TGTGTGTGTT GATGAATTTG TGATGATTAA AGACGATGAA ATAATGACGA ACTTACTGCA28261 AATTGCATCA CTGGGTCGTG CTTGCGGAAT TTTCGTTATC TTATCAATGC AACGCCCTTC28321 ACATACGATT CTAAACACGG ATGTTAGAGC AGTCTTATCT GTCCGCATGG GCTTTCGTAC28381 AGTTGATTTA CGCAATGCAA TGATCGGTGA AACGCCTGGC AGCGAAAAGA TCAGCCTTGA28441 TACTCCAGGT AGGTTCTTGC TGCGCTTAGA CGACTTGATT GAATTACAGG CTCCACACGT28501 TACGGAAGAC ATAGCCGAAA AGATACTCAA AAAATACAAA TCTGATGGTT GGAAAAATCA28561 TTCGTTCATT GTTACGCAGG TGCTTGAAAA TAAAATGCAG GGCAGTGAAG AGGAGTTAGA28621 TCGGGACAAA ATTTTAGGGG TGTTGGACGG TGCTGACAAA GCGAGATAAA GCCATTATTG28681 CCGATCTAAG GCGATTCCGC GTGATGAGTC GAGACGACAT CGCCGATATC CATTTCAAAG28741 GGCTAAAGAG ACCACAGGAA AGCGCCAATA ATGTCCTATT GCGGTTGGTT CGCGACGGTC28801 ACATACAGCG ATCCACGGCA TTTGTTCCGT ACGTGTATTT TTGTTCCGAC AGCAACATCA28861 AAAAGAACTC GCAAAAAATC CCGCACTACC TCGAGATTGT CAAGGTGTAC AGGGAAATTA28921 TTTCAATCGG ACCGGTCGAG CAGTTTATCG TCGAGCCGAA ATATCGGAAG GGGCTGGCCG28981 AACCGGACGC ATTTTTTATT TATCAGCGAA CGCCGTTTTT TCTTGAATGC CAGCGCACTT29041 TCTATAGCGA AAAAATGATT GAGGAAAAAC TGAACCGATA TGTCGCGCTG TACGAGAAGG29101 GATTGATCGC CGATGAGCCT TGGCAGCCGT CCGGGAAAGT AGTATTTCCA TATATTTTGA29161 TCATATCAGA TACAAGGTAC GCTCTAAATC GGCAGTATCC TTTCCGAGTG TTCCAGGCAC29221 CATCGTTTTT GAGTTTTCTG CGGTCGTTGA AACAGCCGCA GCAGTTCCAG CAGTCATCCT29281 ACTCTGACAT AAAAGTGACC GGCGCGCGAT TGAAACTCCG GGATCAATGA GGAGGGGAAA29341 CTGTGATGGA TAAATTAATA GTAAAACTGC TTGTTCTACA TGCTTTTATA GCCGATCAGC29401 GGAACGAATA TGCTAAAATG GAAACAGAAG ATGTTGTGGA GCAGGCTTTT GCAGAGGGAA29461 TCGTCGCAGC GTGCGAGTTT TTCGAAGAGG CGCTTGAAGA AATGTGGAAT GAAAGCGTTG29521 TTTGAAAACA ACGGAAGAAG AAAAGAAGGC GGAGTGATGA CTCCGCCTTT GTGACCACGT29581 GACCACATTT TGACCACATA TTTAATAAAA ATCTATGATT TTTTATTAGA TTTTGCAGGC29641 GGAGAAGCTT TAAAATCGGC TTAAAATCAA TGATTTATGG ACATACGAAT AATTTGATAT29701 AAATCAATAG ATGATGGGCG GCATGATGTA ATCATCGCCT CCAATCCTTT GGAACACTGG29761 GGTTCTCCCC GGTGTTCTTT TTGTTTTGAC CACATTTTGA CCACAAAAAT CACAAATTTT29821 TGATGAAGTT GCTCATAAGC TCGCTAAATT TCTGCGAAGC TCTTTCTTCC AAATCTTTTG29881 TTAGATGCGC ATAAATGTTC ATGGTTGTTT CGATATCTGC ATGGCCGAGG CGCTGCTGGA29941 TTTCTTTGAT TCCCACTCCT GCTTCAATAA GTAAACTAGT ATGAGTGTGC CGAAGGGAGT30001 GAGGTGTTAC ATGTTTATGA ATGTTTGCGA TTCTCAGTAG TCGCTTCATT CTGATTTCAA30061 TTTTTTTAGG CACTTCAGGG AATCCACTTG GCCGCGCAAA CACGAAACCC AGATCGTGAT30121 ATAACTCTCC CATTCGCAAT TTGATTTCAT TTTGCTCAGC TTTGTGCTTC TTTAGAAGTG30181 AAATGATGTT TGGATCGACT TTGATGGTTC GAATGGATCC CTGTGTTTTA GGGGGCAGTA30241 ATTGATAATA TCTTTCGTTG TTTCTAGGGC TGTACAACGT TTTTGTAATC GCAATGGTAT30301 GTTCTTTAAA GTTAATATCC TTCCACTGAA GGGCCAGTAA TTCACCTAGC CGCATACCAG30361 TGTATGCCAA GGTTGAAAAC ACGACATAAT CCATAGCCAG CCCGTGTTCT TGGGCTGTTT30421 TCAAAAAGAG GGCTAACTCA TGCTTTTCTA AAAATTTCAT GTCTTTATTG CCGCTTTCAA30481 TGTCCTCCAC TGTTTTTTTC TGCTTAGGCA CCTTGGCGTA TTCCGTAGGA TTAGAGTTTA30541 TCAATTCCAA TTCCATCGCT TTTTTAAAAA TCATTCTCCC TGTTGTATGA ATTCCGTCTA30601 GGGTGTTGTC AGCATATCCT TTTTTCTTGA GATCCAAGAG CATGTCTTGA TACATTTTTC30661 GTGTAACATC CTTTAGCTTC AAGTTGCCGA AGTAACGCAT CAAGTGCCCC AGTTCATGTT
30721 TCCTGGCGCG AATGGTGCTT ATTTTTGCGG TCTCGCTGTA TATTGCTAAC CATTCTTGTG30781 CGAAATCCTT GAATAGTATA TTCGTGTCCT TCACGTAGCC TCCGTTATTT ATTTCGTCGT30841 ACAATTTGGC CGCAGCCAAC TGCGCTTCTT TTTTCGTCTT AAAGCCTCGA CGTGTCGTCG30901 TTTTCCGTTT CCCGGTCGCC GGATCGATGC CGACATCCAT TTTAAACATC CATTTCTCGC30961 CGTCTTTAGT TTTATACTTT TGGAACGAAG CCAATGTTAT CGCTCCTTCT CGTCTAAGTT31021 TTCGATGATG ATTTTCTGAC CGTTTTCTAA AACGCCAATG AGTTTGCCAT CCTCTACGGT31081 GATTCGTGAA AATACAGGAA TACTGGCCAT TGAGCAGCTG TTTGTTTTCT TTTTCTTCTT31141 TTTACGATAG TCGTCCAGGC GAATAATCAA TTGGATCACC TCCTTTTTGT GGGAATGTAT31201 GTTTGGTTTT GTGATTAAAA CAAATTACAG ATTGCGCTGA GTGTTCCGCG CAGGCTAAAA31261 AAAGGGCAAC AATCGCGTTG CTCTTTTAAA TTTTTAACTC CCTCTTCAAA TGAACCACCT31321 CTTTAGGTAT GCCATAGGTT GCCGCAACCT CATAAATGGT CGCATCGGTA CTACGATATG31381 TATAAAGCAC CTCATCCGAC AGAAGCAATT CCACGGCAAA CTCATTCGCT TCCCTTTCCA31441 CTTTGTCCAT ACAGAAAAGT GTGTTTTTTC GCAAAAATGA AGTGCTAAGT TCGGGATGCA31501 AAACCGCATG CCCCAGCTCG TGCGCGCAAA CGAAGCGTTG CGTTGGCTCG TCCAACTCTG31561 AATTGATATG AATGATCTGA ATCCGACGGA ACGTATGATG ATACCCGTAT ATCCCGCCCA31621 GCGGCTCAAA CAACAGCACA ATGCCTTTCT GTGATGCGAT CTCAAAGGGG TTGTTCGTGC31681 CGTGCTTTCG GATCATCTTC TCTACAATTT GTTTGATCTT TTCAGCCATA GCGAACCCCC31741 TGGAGATGGT TATTCTTTTC GATATTTCTT CGGCGTGAAT TTTTGCTTTG CGATGCGCTT31801 GGCGAGGCGG AGAGAGTTTT CCAAAGATGC GATCAGCAGT TCCCGATCCT CTTCATCGAG31861 TTCGTCGATG TCCACTCCGC CGAATGCAGC AAATCCACTT CCTGTCTTGA GTCCATTGAT31921 GATCTTTTCC AGCTCCTTTT GGATGTCGCG CTCGTCTTTC TCGGTGAGGG CGGGGAGCTT31981 GGTGTCGGTG GGTTGGGTAT TATCCCGTCC TAATAGGTAG TCAATACTTA CATCCAATGC32041 ATCAGCCAAA GCTTTTAAAG TATCCATGTC TGGTGTTCTA TTTCCACTCT CATATCCCGA32101 AATTGATACT TTAGTAACAT TAACCTTTCG TCCCAGTTCT TCTTGTGTTA AACCTTTTGC32161 TTTTCTTAAC ATTCTTAGTC TTTGAGGGAA ACTCATTTGT AAACCTCCCG TTAATTTAAT32221 AACTCTACAG ATATTATAAG TTAACAAAAA GCTAACGTAA ATCTTGTTAA CAAATTAGAA32281 ATTTTTTATT GACAGTTAAC TATAAGTTAA TCTATAATGT AGTTAACAAG AGGTTAACTT32341 AAGAAAGGAG GAAACGGGTT GAACAAAAGG AGAGAAAAAT TAATTAGCAC AAGAAAAAAA32401 GAAGGGATGA CGTTACAGGA AGTGGCAGAT AAAGTAGGGA TCAGCAAGCC ATATTATTGG32461 CAGATTGAAC AAGGAAAAAG AGGGCTTTCA TATGAAATGG CTGTAAAGAT TGCTCGTGTT32521 TTTAACAAAA GACCAGACGA TATTTTTTTG GTCGGGGAGT TAACTTATGA GGAACAAAAG32581 GAGGGAGTTA AATGAACCAA TTACAGATTT TTAATCATCC GATGTTCGGC GATGTTCGAT32641 TCGTCGAAAT TAACAACAAT CCACACGCCG TTGGTAATGA TGTTGCAAAA GCATTGGGGT32701 ACAGCCGGCC GCACGAAGCA ATTTCAAGTC ACTGCAGGGG GCGGTAACTT ACCGCATCCT32761 TACTAATGGA GGAGAGCAAA CGGTGAAAGT TATCCCAGAA GGGGATATAT ACCGCTTGAT32821 CATTAAGGCG GCTGATCAGA GCAAAAATCC GGAAATCAGA CAAAAGGCGG AGGAATTTGA32881 AAAGTGGATA TTTGAAGTAG TCCTCCCAAC CATCCGTCGA ACCGGCGGCT ACGTCGCGAA32941 CGAGGACATG TTCATCAATA CGTATCTTCC GTTCGCGGAC GAACAAACGA AAATGATGTT33001 CCGCGGCATG TTGGAAACGG TGCGGCGACA AAACGAACAG ATCGCGGCGA TGAAGCCGAA33061 AGTGGAGTAT TTCGATGCGC TGGTTGACCG GAACTTGCTG ACGAATTTCC GCGACACGGC33121 GAAAGAACTG AAAATCAAGG AACGGTACTT CATCAACTGG CTTTTGGAGA ACAAATTTGT33181 GTATCGCGAT CAGAAAGGGA AGCTCAAGCC ATACGCGGCG TATGTTCCCG AGCTATTCGA33241 GCTGAAAGAG TGGGAGCGAA ACGGCAAGGC AGACGTGCAG ACGCTCATCA CGCCGAAAGG33301 GCGAGAGACG TTCCGGTTAT TGCTGAAGAA AGAAACGGCG TGAAAAAGGG GGGAGTGATA33361 TGAAAAAGAG GCCACCGAAA AATCGGCAGC CGAAAGGAAT GACGGATGGT AAGCATCTTG33421 ATATTTGATT TTGCGAACCA TGTTTATTTT AGCTGAACTG CTTTTAATGG TGAAATCGAT33481 AAATTGTGTT AGTCCTACAC TTGGTCTAGT AGCGGTAGTC ATTGCACTCA CCTCCCTTCC
33541 CTGGTCACTA TTCGACAAGA AGGGAGGAAA TTCCTACGAA AAAGGAAGGG AAGTCGTCGT33601 GCAAATGGTG GTGCAGGTTC CCGACATTGA TAAATACGTG AAGGATCTCG TCCGCCAAGC33661 GTATGAGTTG GGGGTGGAGG AAGGGCGAAA GAGATATAGC TATCCTCCAG TCCTAACTAG33721 GAAAGACTTG GCAGAAATCT TCCAGGTCCA GCTTTCCACG GTAAGCAATT TAACCGATAT33781 TCCAGGCTTT CCAAAGCTCA CGCACATTCG CGCCCGATAT CCTCGCGATC AGGTGTTTCG33841 CTGGATTGAA GAAAATTCGA CCTATTTGGA TCAAGTGGCG CCTAATCGAA AAATTGGGAG33901 AAGGAGTGAA TGATGATGCA AATCAAAAAC ATCTCGCTCA ATGAGTTGCC CAGCGGCGTC33961 AGAAAAGTAG CAGATCGGGC GATAGCGGAA TGGAAAGTCA GAAATGTTTT TCGAGTCACT34021 GAATTGGATT TCGGTGATGG CCGGGTGTAC TACGAGATCA GCGCGATCAG TGACAGCTTC34081 ATTCTTGAGC TGAGTGTCAG CGAACTGGGA GTTGAACACG TCAACCGCAT CGGAGTGGAT34141 ACGGTTCGCG ACGCGATCAA AGCGCATCCA GAACGCTTCG GTCTCGAGTG AAGGAGGTGA34201 AATGATGTTT TTGCAAAACA GAGAAACATT TTGCAAAAAA CGATATAAGC AGTTGCTACG34261 AGAGGAGATC TTTTTAATCG GCTTGGCGGA AGTAATTGAA GCCAGCGGGG ATATAGCGGG34321 AGCTAAACAG GTTTGGTCGC GTGTGTGGAA AACTCGCGAA GCGAGAAAGA GTTTGTGCGG34381 CCGGATGCCG GTATGATGAT CAAAGTTCGC GATTGGTTGC AGATGTCTTG GGAAGAACGT34441 TTTTGGTTGC TTGAAAACGA AGCCTACCGG CAGTGGAAAA ACAGAAAACG GTTCTGCTTG34501 GACAACAGAA CCGTTCGACC ATGAAAGAGG TCGGTGAAAA TTTATCTTGC TAATTCCATT34561 TTATCTCACC GACCTCCAA表2高温双链DNA噬菌体GBSV1的氨基酸序列,SEQ NO.21 NTREVITDES SICLRVARST IISNSRVPSL RCTIGMSNSI RFVLHRLLGA SYRNYFRGDP61 MKPIILAETA NMDHLEWLRL RQKGIGGSDA AAIAGLNKYK SPIQVYYEKV EGVKESVPSE121AAYWGTILED IVAQEFSRRT GLKVRRRNAI LRHPEYPFMI ANIDRLVVGE DAGLECKTAS181EYLKDEWVEG EKIPDQYFIQ CQHYMAVTGR SRWYIAVLIG GNKFRWDVIE RDEDIIRYLI241EIESEFWQRV IEKRPPEVDG SEASTNLLNL LYPVESVVDD EAELPREVEA LIAELEAVNA301EIKQKSERKA EIENKIKALL GERERGRTSE YVVKWSVVNS NQFDSKRFAK EHPDLYQQYI361QTSTYRRFSI SKNKSRKAVS QLKRRNKTNW QRKRTETARR PRSRTAAKSP SPISCKRSRN421SNGRCRSILM PIGFASPRKC AEILNCCHAK SRACSARSCR RLNWALSRAC SDIATSSHSK481IGKTEQKKCN SSAIKGSTFA DPARLKRLKP KQSMKTMNLN LNTDLMNVYV TSRCCSVTAE541NLGSMRTPSS KTADMRFTLC RSKKSIAFVT NTLGRKSPGR GVRNTKQWPR KRSSGSSSIC601RSRLKFSATS RSMKRFARTS MTNRSKWTIL TWKLRRLKER LSMGIRANRR QRRNSRSFLT661RGHDAPWLWE VICYDDISHR KKRKLCGARG IPSRSGTILA SKRIAGFYAI YAERLGVHER721LAKSQQKWPR CDIPHYERVN RGWLCYSRRK PGRGQIRKSG VCRSRSEAIT AYKSGYGSTV781YKSVSGKSVS GKSVSGKSAT TKYLKYGPSQ NQQPCFSVLR KLSADAFFCR YNPLLVGSFS841GRNCSLCHEK SATKCPENQI HRNPCLGNAK GSNVRRCGPL GQAVTKTSET KTRGCCQWLG901SPASRKPIYK RRKNFALRTR EMGRRRHLPR WTSINAKNVK ILASSSITIQ RVFAIASCSG961ESNGFLNRPR SPKPSVVTSR TSQLKTVLRS SGTPLDALST SNRFRVSDIA GGTLWRSLAL1021 LEPEKHIYRR WPIISEIKLR FSIFRTGKDS TKSKTILRAW NKNANGKLLR SCLSMISSNV1081 ARPNSRSKPC ILSTIGISIT SRLWYRRSAR MTCISMKRSA LALSKCVGII SLKSSVTGNY1141 TIDPDKKWEV ETLIEFVGFD GPAPFSIIFW QRNGEYVSKT ICYTKEEERE AVSRFYTPND1201 FYGAFDAAII SRREGTVKTV WRWACVVQSV VVGQRYWMFV KSEIRSADGN VRIVRDSILK1261 KSLSLTRMNT SSVVFIVFGL YRGGGMFREW VIWRPSLTCR RRWIIISGAN KTFRRIWILF1321 RIWMRLMWKS RNYRTKSGIS SFGVLIGICD VKRRSWNTLT DCISFVGWRL AFLVNLNQCH1381 IAMITFGNNS DHNVMSTLWR VRCSGISHSI FFLDENFWGS VGRKLCEHTW KRTRSIISGK1441 RRGIEVIAWG SAQPIAAWHL NIWSNTQIVA TGRGWLTLKR CRLLGRFGRG SASLVLSQKR
1501REWSTSLGLL MDALPLMRNR RGNAPGSLSI TSRTIKWLFN LGATKERSPF SWWSSQKSMR1561CISSVFRTRR NGGCKRSKVA GSRSRINGLS IATLSNQAGG FRIIGARDEN DYSIYSRPAG1621MDEKRRCADV LLVVLPSVFQ MYDKVRCLSA FWWNHSKVEG VRSCSDSRFY TKVAMKMSSS1681KFVCLVKWGT SYKRSEVRFM RTLLAIYPSV MKIKGISSMS KKSLVRWKSK AVIHDGRTEI1741STVCYSRICN GGGYRCSCAN TSLAKKRAIP CGPKVRRKSS PICGKLAVLP TWLDRLKYAS1801LEKTNSNYTI LNSYGDVGDR IKRPGFSRPI NGGQDTHIIP REGFEWSTNK YPLYATWMGR1861RQKRLWRQRL KNTGYTCPFQ TSISREHKRT LSCRRRRRMH FTHLQKARRS ARILNASVTS1921TWREYAARIS SIEWKGNLLS DIHWKSLMIT RYITKWESVN RGSTAFAKRR STSWPLHCGL1981RFTKKRHLCR CLFSFTLVQN HIIWKNIKLL VKIYLLLLLT MVILSEFSFS LGTLRKGRIP2041YGFIRHYPKR KYGKDKAILL LYKKTCLRHL VTRFLERMGS FRELRSCEFE KAVNQGCFFS2101GSGFYTLFLF ADKELCLAGR IGDYLIYISA FGLSFACTKI NCEIEVLSSR EGRALSVEVF2161KSVRIQDHCF SLCIFRRFMK HKNIQNSDEC VKWIQGYTHI FLWREQYIFS LGVANMELKR2221GTKCTFPSYT TRVCKMGGYL KFERIRGCLA MAFHFIAIVS HYAEGKRIGW HYADEEKLDK2281KIIHEFLAQV KKKCGDVQLG IHKLSTDSVS WDSVVQRDAF FKDVIITSDI ETFIELVSQD2341QELTAYDVAK FILSVIPSSH LKLQKLLYFA YAEFLLRTGE KLFREPILAF KYGPVVESVF2401HKYKVHGSSI IDYKEDETVC IYPEELAVTP SFMKIASSEH GLIALDCLID VFEKYGHLSA2461QELVYKTHRP GGPWDRVYKP GKNAVITDDL ILQYHHVIQS FETKGFSLSN DDATTENRKS2521EGKMTERRQN ISVTCYDGII RHSDKPDPPS MSVTRSGGVF YFPIRKKKGT ISSCNGSYGE2581IMKGLWLCIH TATKKKPNVP TNAISAKTVY GRSGKCHILF PVHLCGVCEC RNLTKKCRKI2641HSVLKLTKSG ERNVKNMSVL ARFFVIKYTN EYPLLCMRVS GADSRRAQQI PSPQRGCRLL2701SGGFFILAMS TNVGIRKNDA GKFLLLPNVG GRGMRVNIMN SLGNGKANHF DQTLVISGLH2761ILKLTSFYQQ MKLESSILKT MTKKILSYSH FIANSLFYRK KKKPFLKCCI KIFRTLIKLR2821KMDSKAGFKF ISQTTQLLNT ASKIVMNIGD INMKMQLKEY ISHYWLKNRS CNYGFFIWGH2881EFLNKTMEKE TSNTAAVFVS RVQAIWKDNA SEDSASHYTV TKTEFEVERL SQLMFSMSAD2941AQDDERADGK RIGVGGTSGT EVEMIPPPPF CFTADRGPAR GQAFPIARSF FEKGRMVKTK3001KAFIAEIKRQ MKSLGTYKKE YDRMIEIFAG MLHQYYMFEE QFAQSGYKIT EMYTNKAGAT3061NERKTPLYTA MESLRKDIAT YSDRLCLNPK AMEAITIEQQ NKSKLAQVLS ELSGSIKTTT3121RWSTPKASMA GRPVANSRRP SDFSKTWRIQ NMISIQGKRN LLFKSLKKRS FTTRVNDTAH3181HCAVSLFYWN RGKNSLSTIC LVSISKERRS DASKKRLFLY LGKTEKPGLL QRWPGLWLCL3241KGNPGQKSIS QVRRFSSRCS RLNLSCLTYG KWARNKTFGY TITKKTASAA NSAMVPFTFG3301PWQPIQISKT PTVMSGSPMK CTPTKHLSSI TSSKRRRRTR TSSGLQRPAT IPHSVINACN3361IVKRFSMKRR MRLISFSFAK LMKTKTAKWI TQTRLNMRKP IRIMARFGRT TTTRSRRKMT3421HSSGKISWRS HTSIRRRRHI STSTSLRSLI GNMIGRLNSL RNLVSTGSVV RTFLNCMTPQ3481QRSMETITAL IPLLMRGSRS SRQREKQRKT IFRCLVGVMT AGSRPIHRQS TFPMWLSGSK3541QCEQKGLKLS KSDLTGSLAG NFSWRNRSDS ILSISHNTIT RNQKVLDGLK SKKTANSITC3601IPRRLNTAFR TCMRLKKLTT SNLRKLRTST VLIFLTLLYS RQSGCLKIWR RLRRLRNGKE3661VDGMGLFDRW RRTKRKSKIR ADTGYVGLFM SGEDVSFLVP GYVRLSDNPE VRMAVHKIAD3721LISSMTIYLM QNTEDGDIRI RNELSRKIDI TPYSLMTRKS WMYNIVYTML LDGEGNSVVF3781PKYTADGLID ELVPLTPSKV NFLDTPDGYQ VLYGGQTFNY DEVLHFIYNP DPERPYIGRG3841YRVVLKDIAD NLKQATATKK SFMSGKYMPS LIVKVDAATA ELSSEEGRNA VFKKYLQATE3901AGQPWIIPAE LLEVEQVKPL SLKDIAINEA VELDKRTVAG MFGVPAFLLG IGEFNRDEYN3961NFINSTILPI AKGIEQELTR KLLISPDLYF KFNPRSLYAY DLKELAEVGS NMYVRGIMEG4021NEVRDWLGLS PKEGLSELVI LENYIPLDKI GDQSKLKGGD NSGADGQTDK PTNEHHRDAG4081GTGRDVYRGI LRSVQPNRTI SRSFRNRTGS VRHVEQRHSR ANQSRCIPST RPQQSRNARA4141KSGQPGLVGT DDQPSRCGQS VRACQAWCRS MFVRIHHRRN GVPGRNNQMD TEKSGFTGFS4201GHVPGISGYK CPCSHERIRT TEAATRTKKI TVKGACEKWR CVNCPKKLNN VKLHWTSSSA4261SRNCKRKQRS WNKRSRKHKP KKKFQRWRKK SQSWKMSVTN TRKNRNSKAR SLNRMNSNRS
4321TANSLRINPA KKCKGRKEMW KIVYKCARCK PVNTTNGVRS LSFTRNSATS VRWLAENRSR4381KWLSIASWTL WAITRHFIRW LIKSELRGRL VFSIRILLRL LGLNNPARSR LAMLERLQAL4441ILMDLKSVKR SWITTCCKTA SLILMIMRRR LLALLPRRIW QSLRVQVLQI NSRVSFRAYL4501QKIKPLKPII TCLKILSKSG LIRATTASEK LWLNVLLTTI AWSLVFKSIL TAMWVNFRIY4561ARRTFSDCAL CLITSLTMIL YCLVSSNNIR WSNVRISRLT ARLTSLPKTK PHSVVKGASM4621ENQNQKRLFL RLRIRFKERN FEEVIMPKYV VIKVFKDLQD RQHIYRVGDT YPRKGYKPSK4681KRIEELLGNE NRIGEPLIAE VDEEEGNEWT SLRLLVKSGS VSVRRCVTRI SLRSSTALKS4741LKTKKGWCWM APTLIICLLS IMRHGAIRAE IATGQCLDIC NIGCIISCSM QAVEPRDIRT4801CIDCTGICGG NRQSDTNRNA KNCPMREIGW QKILQRCHVR AAPVCCVCRP QIIQRIGSGV4861RDKVPGNPDI CCFGSRTHLR EGARLWLRKS AGWRMRSQVS YENIRKSPTM NKSWMMQKKR4921LAGRVKSKKL DWYRRATICA DGHGSECRTG GSITRPNIGW LICWNTDMPQ WMAAGFRVRR4981ISGRLRTGLK KNLKIVSRRR SNNEADRVGR SAQGDRFTSC LFAFFKTAKA TVHHVYGRLF5041IELYCSSISG DKRSDRAVYV EKGFSRGKGE SRFGCQQSRL DIRNVYSIRK TIPKSIGEVI5101MTAPSTNKIK YGLRNVHYAV ITEDPVTGAI TYGTPKRIPG AVSLTLEPAG ETFDFYADDS5161AYYSEATNNG YDGELEVANL TDEFRIDVLG DTYENGVMYE NADQVTKPFA LLFEFQGDKK5221AKRHVLYYCK ASRPTVAGQT KAENTEPQTS TLNFTARPRP DTKEVKADTT PEIDQALYDS5281WYTQVHVKGA TTGAMEKTLV IDGKQVRFKS NAATPLRFKA QFGKDFFKEI YKLNAIGELT5341DKDGHFNYEV LEKLDFDFFY NIIWTLAKTA NPSIPDPITW LDQFDEFPLM EIIPELQDLI5401IASIQSKKKM ITKRPVVMNS LPRRFFFCAN AADYKRMTWK KRSACVSTTS TNTFVKPIQK5461NVFGKPHRPI LMRFRGRGTW RTVSKGLQSK SAGIRLGYKR RCRMTPKAVS CRKNCAMWNA5521CSSLIRATSR PWLKNNEFSP NRLRQRPKST NCAPHKAKWR NSSAREKSEN SNIGTFDVKS5581NLRKRNANTS NHCRRYRMSS KRRSSPPNGW NRCLKQLAKV WMILLMFETS SMRSRTEKHR5641LHSRMLVKSG RPYLVPASII RCARRSTSLR AALNWIKSKR ILTRLPKRRI VPKRMCRDAR5701RFPVLLADLR PAEGQGRSTR RWIPPDTQKL IFRSMCRRNR KRPLKMLSTQ SRHMVLMRKR5761RLRACGVSGR MLTPAMRQTK RLLKVPALRL PIQALISPNS KKSMKLAASS RFQMNRRSAS5821IAFCRSDFRL INWISSQNMA SSCNELATML RKFRRFLRQA LKLVHGILTT CSTGRKDGFV5881QNSDRALIRR RPNSLKGRAY RRHSFKSGAE RSRQAGNKGK KLCLKSHRRL PVSKMRQHRM5941RSVKFSVQCG KTRARTSPRP FSIISTWRTR KTTKICSMIQ FQKLMLTRPS FKRRLATKRH6001SLCQSSLLLS GLSLAGCQPI RSYLQQSLRL ALSAFFRARS WHWRPFYIQY RMSSQLSRRC6061YQCGMRLKRR GRLDLPLLCR YLCPSSLKIG SQLKNFSLSY GMELKKSLKQ LLMQSDRFAA6121RGRAERLHKR CGMELKPILK RFSMYIKRFL RSHGMQSERR SRFGMDRPQQ SLFLTHKTPF6181LTPGTRLRRS QRLYGTRSKA WRRMVFSKQS AQPSTLSKTS SRRHGMQSNH HRAYGTVSKV6241LPMTRFATLF QRLSRRKTES PACGREKTLK PHTRSFQSLI VLLAVTHLKS PTGCRGAEKE6301STSRKSRCWR KERTISAAAT RLASRGRSWY NCHAVRRCIR IPRQWACWAE TFRSTSKIPF6361EMKGILKKFR VNYTCLNGQN EAGDRDDVVV WYPFERTKHS CFAFKTCFPV RSLIDPWSRR6421HSFSSIIRNA LDNRLFTFTP RQSYRSNSAA FSNINQNRSI VNDYTCLLHR EASWYFCSRY6481AQNIVKFYAC VYLPTIRIRD KCIEANNCKR QTNYSRQRNI RCASDHKNIE CKHKFTVFDA6541QTNHFKHTSN KRCINDLNDR FVKRYEHFRQ NDGHVFDVAT GNKCNDHRGT EYAKRIGGMA6601RTVFVGGENV ELFKKNRTSD GAVRPGREAT WCSEKRIQHT RNIERRSAHI LPSTRSPCSS6661DDERYGNYLY GQAFFYCRDE RARCERQTDP RYCLSIVFRE IARYVFNRDN DKDTESRTRA6721NYFAYRLGCR SCRGAVVTRN AAFDEKEEVG ALGNQAMGEN HRANSIYGKK RNQPCQTDRN6781ESGLWFSIQE KLERNQTDDS CARGDSLVPV RKEWTVHRKR QQQTVCRGLL LVYEFRNPID6841GKAKISQRVC MGRAFLVGWR FNACGTRKAE SIITACHFIS MQSHRLICID RKFTIVFCRG6901LCECVRRTRN QRSNTHCSHA TFSRAVSKSG IVVYDPGDSY ARTRPAHFIG CFVVPAVFYY6961GDQRKNAFYW NICSNSSIAC HYEFQFGERT SWFVVDWTSI DNNDGNFIYV WREANIYDTN7021VPSWICNNWR PFLVVANASW FGIFRCANEN KRWNANHPAR TTSVCICSEL ARRHFPSTAC7081ECDGRNSMEK CYFSIHKPFS ISRWANCQHA SRCSCIFKHR VYIAMANTSS SSVSGSGKYG
7141TNHIELRGEK CWSLIWNVPI IHWLERKISL VKSRCNIKAR FQFAALREWS FLTKSDLGKK7201LKKLRQKTLF RLIKIILNTF TSIQKGSDTG RICRLIENSH LTCLRQRIQN LKTLKSNEWE7261TLLATLLKIS MLVLIREEGR LILLKVIGTV QSRILYLILR LTLQMVRFHR LFGIPMLGHR7321LQPKDITKKI MSGMLIMEKM DQVEVRGVGC VLMDQAFGPW KHWIVEKKWS CSPLVKKEMW7381HHLHWGEFGI ILLTWVQVMP TIHMTQITFI TTLEVTTQYI GSKSQMEQEA LSHFRDLGIY7441MELREEHISM YDSQPRKMDN THYDGQSMTV ISMLLKQKTS LILASRMGWR TINRKMKLFV7501QASVCLFLIR KIEHRMPLTS QVQWDHILVL LKMVNISSVQ IIPFSWLNAR EQETFFQHLQ7561RQAQTQKRMI LCLYRRADME RFDIIYKIGA ATLGAVVGYL FGESTGLLLV LFWMVIIDYV7621SGLAAGYTEK TLSSKIGFKG IIKKVMIFVM VALAHLVDSA LGTKNMFRDA TIVFYMANEL7681LSIFENAGRM GVPVPERLTQ AVEVLKGKSK EAEKKRSFTP VMAATIPVLL QTDRRKIHSL7741SNISARCSGS MKGRKYTTHG RMTVFLNFPN VRRLQINKLI YSFLFTLTLA GEPDSKVTSI7801TVTSARRRSL IKTFTRSSKQ SAMLTAARNA RTMPYCVKRT CQRFSRKTCL LITLMTQQNN7861PNSSCNKSHT ATCKELSKLS VSRKKTPNRN KNRLTENCIA SKSGRSLTGK TPNDWRRSRK7921KGIHPLLTNQ KRAYSLGVGS NPQTPHSSTL RSLFEVFLFL FYQWSPQQHL LRQPLEKYPS7981ESQEFFYRSQ LLQYRLQNDR RFLGLFFSFI VSPFSPKVLN HYLTQSSLFD QFYRQLSRLG8041DVPLLLDDRM SFSQQRLLHI LVQRSRKQYL KFPHDGQPEH PLSKAVLLSC QIQLIQHPQT8101RRYFSRKSIV RQHSSHLYHF CYMFLLARQT LLLLLEGRLL VLFSLVFHSL LQENPYTLGK8161YGLLLEHTHF LPEKPELRQV RRCGQRLDES RSFRCLLPFA FDRRRYRCHL YFPRQECPHH8221LVPYKYYLPV LHSEHTVFWW RHILNWLERS KLKRQRKVEL LIASYKFPPF LIQENYTYIY8281IGKEGGNLDE VFIRVIILKG EMTCSRSSAV CAAPSVQSRF DRTKKCSTSS IPLCTKNATT8341SSRNAVFPRM KAHFKKCFSI LSFIEQMMRN NEKRIALPNN PSLPVLGRLF FYPYLSDWPL8401QFRTNRISNI ERRKLFIIDN RTGARINIDP FCEFPVFNTF FSQNHLPAFI PCLTHRIPSG8461FSFRHEVVPI LPRISSIRIF FDGQDKRTIR ATANNTKAPR QNCPYTVLDV LASLFQHDSE8521SVACSLRTLS LHFIPSALVQ AERLMRQGSQ RVLRSEATRR KEGGARGGRK PQTKEVIGVL8581FRRQPETIPF RDFIYGKQTA GKRAKTGVVV PALFPVITPE NLFPIHDHDF SLLMIGVGSI8641TLAAFLERGL VMIGMTDMAE KVAGCGRIVF PIAVYGAVLW LFFSLGGLCD KRMAAETTRK8701IPTTESVPGG WVVCGIKRGN ESISKGPRDR RSDGICVHSY QWNGPEGSKE KGIRVPASLW8761SKYRSQRRFK KVCIDGSQKR VAEKSEISLF GNISANKRAF TTDCLWKGAW MARLRCCSQL8821FAVWAAWFWQ EFHASQYSCD ADPILYCRRA SSVFGGFENV GVRDIRRSSC KSLLPSETWS8881STISQKERTQ KAWGTVKEIS CPTYQSTEKT PAFYRCVCIC DDRRNNDELT ANCITGSCLR8941NFRYLINATP FTYDSKHGCS SLICPHGLSY SFTQCNDRNA WQRKDQPYSR VLAALRRLDI9001TGSTRYGRHS RKDTQKIQIW LEKSFVHCYA GAKNAGQRGV RSGQNFRGVG RCQSEIKPLL9061PIGDSAVETT SPISISKGRD HRKAPIMSYC GWFATVTYSD PRHLFRTCIF VPTATSKRTR9121KKSRTTSRLS RCTGKLFQSD RSSSLSSSRN IGRGWPNRTH FLFISERRFF LNASALSIAK9181KLRKNTDMSR CTRRDS PMSL GSRPGKYFHI FSYQIQGTLI GSILSECSRH HRFVFCGRNS9241RSSSSSHPTL TKPARDNSGI NEEGKLWINN CLFYMLLPIS GTNMLKWKQK MLWSRLLQRE9301SSQRASFSKR RLKKCGMKAL FENNGRRKEG GVMTPPLPRD HILTTYLIKI YDFLLDFAGG9361EALKSANQFM DIRIIYKSID DGRHDVIIAS NPLEHWGSPR CSFCFDHILT TKITNFSCSA9421RISAKLFLPN LLLDAHKCSW LFRYLHGRGA AGFLFPLLLQ VNYECAEGSE VLHVYECLRF9481SVVASFFQFF ALQGIHLAAQ TRNPDRDITL PFAIFHFAQL CASLEVKCLD RLWFEWIPVF9541GAVIDNIFRC FGCTTFLSQW YVLSYPSTEG PVIHLAAYQC MPRLKTRHNP PARVLGLFSK9601RGLTHAFLKI SCLYCRFQCP PLFFSAAPWR IPDSLSIPIP SLFKSFSLLY EFRLGCCQHI9661LFSDPRACLD TFFVHPLASS CRSNASSAPV HVSWREWCLF LRSRCILLTI LVRNPIVYSC9721PSRSLRYLFR RTIWPQPTAL LFSSSLDVSS FSVSRSPDRC RHPFTSISRR LFYTFGTKPM9781LSLLLVVFDD DFLTVFNANE FAILYGDSKY RNTGHAAVCF LFLLFTIVVQ ANNQLDHLLF9841VGMYVWFCDN KLQIALSVPR RLKKGQQSRC SFKFLTPSSN EPPLVCHRLP QPHKWSHRYY9901DMYKAPHPTE AIPRQTHSLP FPLCPYRKVC FFAKMKCVRD AKPHAPARAR KRSVALARPT
9961 LNYESESDGT YDDTRISRPA AQTTAQCLSV MRSQRGCSCR AFGSSSLQFV SFQPRTPWRW10021 LFFSIFLRRE FLLCDALGEA ERVFQRCDQQ FPILFIEFVD VHSAECSKST SCLESIDDLF10081 QLLLDVALVF LGEGGELGVG GLGIIPSVVN TYIQCISQSF SIHVWCSIST LISRNYFSNI10141 NLSSQFFLCT FCFSHSSLRE THLTSRFNNS TDIISQKANV NLVNKLEIFY QLTISSIMLT10201 RGLKKGGNGL NKRREKLIST RKKEGMTLQE VADKVGISKP YYWQIEQGKR GLSYEMAVKI10261 ARVFNKRPDD IFLVGELTYE EQKEGVKTNY RFLIIRCSAM FDSSKLTTIH TPLVMMLQKH10321 WGTAGRTKQF QVTAGGGNLP HPYWRRANGE SYPRRGYIPL DHGGSEQKSG NQTKGGGIKV10381 DISSPPNHPS NRRLRRERGH VHQYVSSVRG RTNENDVPRH VGNGAATKRT DRGDEAESGV10441 FRCAGPELAD EFPRHGERTE NQGTVLHQLA FGEQICVSRS EREAQAIRGV CSRAIRAERV10501 GAKRQGRRAD AHHAERARDV PVIAEERNGV KKGGVIKRGH RKIGSRKERM VSILIFDFAN10561 HVYFSTAFNG EIDKLCSYTW SSSGSHCTHL PSLVTIRQEG RKFLRKRKGS RRANGGAGSR10621 HIREGSRPPS VVGGGGRAKE ILSSSPNERL GRNLPGPAFH GKQFNRYSRL SKAHAHSRPI10681 SSRSGVSLDR KFDLFGSSGA SKNWEKEMMM QIKNISLNEL PSGVRKVADR AIAEWKVRNV10741 FRVTELDFGD GRVYYEISAI SDSFILELSV SELGVEHVNR IGVDTVRDAI KAHPERFGLE10801 RRNDVFAKQR NILQKTIAVA TRGDLFNRLG GSNSQRGYSG STGLVACVEN SRSEKEFVRP10861 DAGMMIKVRD WLQMSWEERF WLLENEAYRQ WKNRKRFCLD NRTVRPKRSV KIYLANSILS10921 HRPP表3高温双链DNA噬菌体GBSV1的功能基因氨基酸序列及其功能VP289(1176-2045)DNA结合蛋白MTQAEKLKNQLAAKANGNGQAAKKQDGGKVTVADLLQKMKPELERALPKHLDADRLIRIAMTENRRNPELLSCEIKSLLGAIMQAAQLGLEPGLLGHCYLIPFKNRKNGTKEVQFVIGYKGLIDLVRRSGEVETIKAEAVYENDEFEFEYGLDERLRHKPLLFGDRGKLIGFYAYAKFKDGGHAFHVMSVEEINRLRDKYSRAKESGPWREEYEAMAKKTVIRQLIKYLPISIEIQRNISLDETVRKDIHDEPEQVDYIDMEVEAIEGEVIDGDSSEQETAAEQQELFNVP288(2087-2953)噬菌体复制蛋白MTIYRIEKKENYVVLDKGFLHDRELSWQAKGLLAFMLSMPNDWVFNMKDLQNRSKNGRDATYRIMKELIEAGYVTRVENRDGGKFGKVEYVVHEVKQSPHTESPDTVPPCTENPYPGNPYPGNPYPENPPLLNNNNTNYKNTNNDDDNKDRPKTNSLNAFRFYEENFQPTLSSVDIEILNYWLDRFPEEIVLCAMRKALEQNVRRIKYIDRILANWEMQKVQTLEDVARLDRQYELEKQARQKRGGVVNGSVHQHRGSNDRSTKEDERISHYEPGKWDDVDISLDGLLVP133(4149-4550)ATP结合蛋白MDLVPYLVDALDVEIAELQNEVRYFKFWSADRNMRREAALMEYVDGLHFFLSLGLAFGIPREFEPVPHRYDHIRKQFRSLKRYVYVMEGPMQWYIAFNLFLGLGEFLGFSWPEIVRAYMEKNQVNHKRQEAGYVP141(4641-5066)青霉素结合蛋白MAHIEKVPTPWKVIRQGQRIISAFPEKKGMVDFIGIAHGRVIAFDAKSTRERTRFPLDNIEDHQMAFLKSWRDQGAITFFLVEFAKKHEVYILRFSDAEKWWVQAKQGGRKSIPYQWFVKHCDLVKSSRGIPLDYLRCLERVP213(8484-9125)重组蛋白MYAREWSRLIEKSPTNPFPAKRLSLIERRLLYFSDEYECRNLTEKRCRKIPPFAELSRRKGDEIMSELNYEQFGKWVKSESFRPDTGYKLRLAYFKINELLPTNEIREFYPKNIVNDKEDIELFAFLDSKLIVILPKEEETIFKVLYLKDISHVDLEIKKNVGFESWILKIYFTNNTVIELNSVEDSNEYWRYKYEDAIKRIYKSLLVEKSLVVP118(9541-9897)核酸内切酶MVKTKKAFIAEIKRQMKSLGTYKKEYDRMIEIFAGMLHQYYMFEEQFAQSGYKITEMYTNKAGATNERKTPLYTAMESLRKDIATYSDRLCLNPKAMEAITIEQQNKSKLAQVLSELSVP573(9894-11615)噬菌体末端酶MRKYKNYDAVMEYAKSIVDGRKVACRELIQAAKRFFKDLENPKYDFNPREAEFVIQIIEKTFVHDQGERLDGTPLRGEPFLLEPWQKFIIYNLLGFYLKGTKIRRFKEAFIFIPRKNGKTRLIAALAWALALLERQSGSKIYITSAALQQSLQsFEFILF
NLRQMGEEQNFRILNNNQENSISGEFSDGSIYIRALAANPDKQDSLNCNVGIADEMHAYKTPKQYNIIKEAMKAYTNKLMIGITTAGDDMTSFCYQRLQYCKKILDETVTDEAYFVFICKADEDENGEVDYTNPIEHEKANPNYGVTIRPDDIMNDALQAQNDPQQRKDFLAKSLNIYTSPMKAYFNIDEFKKSDRKYDWTIEQLAKLGIDWFGGADLSKLHDLTAAALYGNYNGVDIAITHAWFPIVAATRKAEEDNIPLFGWRDDGWLTMTNTPTVNFSDVVKWFETMRAKGFKIKQVGFDRKFGREFFMAMKQKRFNIVDQPQYYYKKSEGFRRIEKQVKDGKFYYLHSQAFEYCVQNVHAIEKTDDMIQFEKIEDKHRIDIFDATVFAAIRMLENMEKAATATKWLKGGVP202(12815-13423)蛋白酶VERTVKQTRSLQTNITATRAEQDDEMYIEGYFAVFNRETELFPGAFEEIAPEAFNGTLSNDIRALINHDASLVLGRNKAGTLELKVDSRGLWGRIKINPRDSDAVNLYERVKRGDVDQCSFGFNIIEEETEFRDDGTIKWTLKKVDLHEVSVVTFPAYPDTSVHARMKEYEQHKKRQLEQRKLQLKERVRNGVASINVDQKNVP425(13383-14663)噬菌体结构蛋白MALRQLMLTKKIEQRKAALDELVKREQELQAKAAELEQAIEEAQTEEEVSAVEEEVAKLEDERNELNEKKSKLEGEIAQLEDELEQINSKQPSNQSRQKMQGSKGDVVEMNRLQVREMLKTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPEVVVNRIMDIMGDYTTLYPLVDKIRVKGTTRILVDTDTSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQDSIINLDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPENQVTVEADNNLLKNLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGKLPNLRTPDLLGLRVVFNNFLDDDTVLFGEFEQYTLVERENITIDSSTHVKFTEDQTAFRGKGRFDGKPVKPEAFVLVTITDPVQGAVP104(15818-16132)氨基肽酶MKLTELDDLLKATGLPVAYSHFSKPQKPPFITYMVAYSSNFTADDQVYQEIENVQIELYTLKKDFEAEEKVKAVLDANNLVYETSETYIPSEKLYQKVYEVRLLVP71(25318-25103)酰胺酶MKRKVQGIVDRFEGDIAVVEIGRKTLDFPKDIFPKEADVGDVVIIEVTIDKKETEKLRKEIEELMNEVFEDVP394(30994-29810)DNA重组/插入蛋白LASFQKYKTKDGEKWMFKMDVGIDPATGKRKTTTRRGFKTKKEAQLAAAKLYDEINNGGYVKDTNILFKDFAQEWLAIYSETAKISTIRARKHELGHLMRYFGNLKLKDVTRKMYQDMLLDLKKKGYADNTLDGIHTTGRMIFKKAMELELINSNPTEYAKVPKQKKTVEDIESGNKDMKFLEKHELALFLKTAQEHGLAMDYVVFSTLAYTGMRLGELLALQWKDINFKEHTIAITKTLYSPRNNERYYQLLPPKTQGSIRTIKVDPNIISLLKKHKAEQNEIKLRMGELYHDLGFVFARPSGFPEVPKKIEIRMKRLLRIANIHKHVTPHSLRHTHTSLLIEAGVGIKEIQQRLGHADIETTMNIYAHLTKDLEERASQKFSELMSNFIKNLVP60(31179-30997)噬菌体整合酶VIQLIIRLDDYRKKKKKKTNSCSMASIPVFSRITVEDGKLIGVLENGQKIIIENLDEKERVP148(32196-31750)免疫抑制蛋白MSFPQRLRMLRKAKGLTQEELGRKVNVTKVSISGYESGNRTPDMDTLKALADALDVSIDYLLGRDNTQPTDTKLPALTEKDERDIQKELEKIINGLKTGSGFAAFGGVDIDELDEEDRELLIASLENSLRLAKRIAKQKFTPKKYRKEVP186(32783-33343)噬菌体抗抑制物VKVIPEGDIYRLIIKAADQSKNPEIRQKAEEFEKWIFEVVLPTIRRTGGYVANEDMFINTYLPFADEQTKMMFRGMLETVRRQNEQIAAMKPKVEYFDALVDRNLLTNFRDTAKELKIKERYFINWLLENKFVYRDQKGKLKPYAAYVPELFELKEWERNGKADVQTLITPKGRETFRLLLKKETA
权利要求
1.一种高温双链DNA噬菌体GBSV1,其特征在于具有如SEQ NO.1所示的核苷酸序列。
2.根据权利要求1的高温双链DNA噬菌体GBSV1,其特征在于具有如SEQ NO.2所示的氨基酸序列。
3.根据权利要求2的高温双链DNA噬菌体GBSV1,其特征在于在其氨基酸序列SEQ NO.2的第1176-2045位为DNA结合蛋白。
4.根据权利要求2的高温双链DNA噬菌体GBSV1,其特征在于在其氨基酸序列SEQ NO.2的第2087-2953位为噬菌体复制蛋白。
5.根据权利要求2的高温双链DNA噬菌体GBSV1,其特征在于在其氨基酸序列SEQ NO.2的第4149-4550位为ATP结合蛋白。
全文摘要
本发明提供了一种高温双链DNA噬菌体GBSV1的全基因组序列;本发明采取梯度离心纯化、基因组提取和核酸序列测定等技术,对分离、纯化的高温双链DNA噬菌体GBSV1进行全基因组序列测定及序列分析,测定了34579bp的核苷酸序列。将噬菌体GBSV1基因组文库中的DNA结合、重组、复制等功能基因进行体外克隆表达,可用于基因重组、基因治疗。根据噬菌体GBSV1的全基因组核苷酸序列,可以构建人工载体,建立高温表达系统,研究热液区生物的生物化学及分子生物学特性。
文档编号C07K14/005GK101089184SQ20061009143
公开日2007年12月19日 申请日期2006年6月12日 优先权日2006年6月12日
发明者章晓波, 刘斌 申请人:国家海洋局第三海洋研究所
网友询问留言 已有0条留言
  • 还没有人留言评论。精彩留言会获得点赞!
1