本申请涉及琥珀酸酐类化合物,还涉及与其相关的基因和蛋白以及其制备方法。
背景技术:
琥珀酸酐(也叫丁二酸酐)类化合物,是一种常用的化学物质,可作为食品添加剂;塑料工业用于制造玻璃纤维增强塑料。有机工业用作合成有机化合物的中间体,如聚乳酸等。琥珀酸酐类化合物的下游产品涵盖了n-羟基丁二酰亚胺、丁二酸单乙酯酰氯、红霉素琥珀酸乙酯、芬布芬、青蒿琥酯、恶丙嗪、曲匹布通等重要产品;同时,琥珀酸酐能与其他化合物组装,提升药物的活性等,表明琥珀酸酐类化合物为重要产品的合成前体和原料。
因此开发新型的琥珀酸酐类衍生物为发现新型材料和医药农药等重要产品提供新型的原材料。
技术实现要素:
本申请之一提供了一种琥珀酸酐类化合物,其分子式如结构式i所示:
本申请之二提供了如本申请之一所述的琥珀酸酐类化合物在制备n-羟基丁二酰亚胺、丁二酸单乙酯酰氯、红霉素琥珀酸乙酯、芬布芬、青蒿琥酯、恶丙嗪和曲匹布通中的至少一种中作为前体,用于制造玻璃纤维增强塑料,作为食品添加剂,以及提高药物活性中的应用。
本申请之三提供了一种制备如本申请之一所述的琥珀酸酐类化合物的方法,所述方法包括发酵含有编i型pks/nrps酶的基因,以及含有编码er酶的基因的异源生物,得到发酵液;通过在异源生物中表达i型pks/nrps酶和er酶来合成如本申请之一所述的琥珀酸酐类化合物;其中,所述i型pks/nrps酶具有如下(i)或(ii)的氨基酸序列:(i)来源于嗜热属(thermomyces)的i型pks/nrps酶的氨基酸序列,优选地,如seqidno.1所示的氨基酸序列;(ii)与(i)中的氨基酸序列的一致性在95%以上,且与(i)中的氨基酸序列具有相同功能的氨基酸序列;优选其氨基酸序列为与(i)中的氨基酸序列的一致性在99%以上,且与(i)中的氨基酸序列具有相同功能的氨基酸序列;所述er酶具有如下(iii)或(iv)的氨基酸序列:(iii)来源于嗜热属(thermomyces)的er酶的氨基酸序列,优选地,如seqidno.2所示的氨基酸序列;(iv)与(iii)中的氨基酸序列的一致性在95%以上,且与(iii)中的氨基酸序列具有相同功能的氨基酸序列;优选其氨基酸序列为与(iii)中的氨基酸序列的一致性在99%以上,且与(iii)中的氨基酸序列具有相同功能的氨基酸序列。其中所述的异源生物的表述是相对于i型pks/nrps酶或er酶的来源生物而言的。如seqidno.1所示的氨基酸序列和如seqidno.2所示的氨基酸序列来源于嗜热踝节菌thermomycesdupontii(talaromycesthermophilusnrrl2155,该菌株保存时该菌株名为talaromycesthermophilus,2014年后国际真菌分类学将此菌株改为thermomycesdupontii,见文献thermophilicfungiinthenewageoffungaltaxonomy,extremophiles,doi:10.1007/s00792-014-0707-0,https://www.researchgate.net/publication/268392182),而表达这两个基因的生物可以是异源生物酵母。
在一个具体实施方式中,编码所述i型pks/nrps酶的基因连接在能够用于所述异源生物的第一表达载体上;编码所述er酶的基因连接在能够用于所述异源生物的第二表达载体上;所述第一表达载体与所述第二表达载体不相同。
在一个具体实施方式中,所述第一表达载体在连接上所述i型pks/nrps酶的基因之前选自pgadt7或pgbkt7。
在一个具体实施方式中,所述第二表达载体在连接上所述er酶的基因之前选自pgadt7或pgbkt7。
在一个具体实施方式中,编码所述i型pks/nrps酶的基因具有如seqidno.3所示的核苷酸序列;和/或编码所述er酶的基因具有如seqidno.4所示的核苷酸序列。i型pks/nrps基因由如下方法得到:通过柱式法提取t.dupontiinrrl2155基因组,操作方法按照天根植物提取试剂盒使用说明。将pks/nrps基因分为三段(每段约4000bp),设计引物进行pcr,通过酵母同源重组的方法连在捕获质粒prs426上,得到含有目的基因的载体prs426-talth1_004980-t1,通过测序确证目的基因正确无突变;或者也可以分段人工合成得到。er基因如下方法得到:通过柱式法提取t.dupontiinrrl2155基因组,操作方法按照天根植物提取试剂盒使用说明。设计引物进行pcr并纯化片段进行测序,通过测序确证目的基因正确无突变;或者也可以人工合成得到。
在一个具体实施方式中,所述异源生物为酵母菌(saccharomyces);优选地,所述异源生物为酿酒酵母(saccharomycescerevisiae)。例如,酿酒酵母fy834菌株。
在一个具体实施方式中,所述方法包括如下步骤:
1)将所述i型pks/nrps酶的基因连接到pgadt7载体上,得到第一表达载体;
2)将所述er酶的基因连接到pgbkt7上,得到第二表达载体:
3)将所述第一表达载体和第二表达载体共同转化至酿酒酵母中,获得工程菌株
4)将所述工程菌株接种至培养酿酒酵母的液体培养基(例如,酿酒酵母fy834菌株使用的氨基酸缺陷型液体培养基sd/-leu/-trp)中,在30±2℃200±50rpm培养10-14小时(例如12小时);按照2%-10%的接种率扩大培养,在30±2℃200±50rpm的条件下发酵4-6天,得到发酵液。
在一个具体实施方式中,所述方法还包括从发酵液中分离提纯得到本申请之一所述的琥珀酸酐类化合物。
在一个具体实施方式中,分离提纯的步骤如下:
a)将所述发酵液浓缩,得到浓缩物;
b)将所述浓缩物与丙酮水溶液混合并进行超声处理,减压浓缩后得到粗提物;
c)所述粗提物与丙酮混合并进行超声处理,减压浓缩后得到褐色的油状丙酮提取浸膏;
d)所述丙酮提取浸膏用凝胶柱色谱分离,过程中的流出液最多每7ml更换一次接样瓶,将接样瓶中组分相同的流出液样品混合,得到凝胶柱色谱分离液;然后对所述凝胶柱色谱分离液进行柱层析,过程中的流出液最多每7ml更换一次接样瓶,将接样瓶中组分相同的流出液样品混合,得到柱层析分离液;然后对所述柱层析分离液进行硅胶柱层析分离,得到橘色粉末状化合物,所述橘色粉末状化合物即为如本申请之一所述的琥珀酸酐类化合物;
在一个具体实施方式中,分离提纯的步骤如下:
a)将所述发酵液旋转蒸发浓缩,得到浓缩物;
b)将所述浓缩物与65-75%的丙酮水溶液混合并在80-120khz下超声处理2-5次,每次10-30min,减压浓缩后得到粗提物;
c)所述粗提物与丙酮混合并在80-120khz下超声处理2-5次,每次30-80min超声处理,减压浓缩后得到褐色的油状丙酮提取浸膏;
d)所述丙酮提取浸膏在流动性为甲醇的条件下进行sephadexlh-20凝胶柱色谱分离,过程中的流出液最多每5ml更换一次接样瓶,将接样瓶中tlc检测rf值为0.35-0.45的流出液样品混合,得到凝胶柱分离液;然后在流动性为5%-15%甲醇水溶液的条件下对所述凝胶柱分离液进行中压rp18柱层析,过程中的流出液最多每5ml更换一次接样瓶,将接样瓶中tlc检测rf值为0.35-0.45的流出液样品混合,得到柱层析分离液,接着对所述柱层析分离液在以体积比为11-9:1的氯仿和丙酮的混合液作为流动性的条件下进行硅胶柱层析分离,得到橘色粉末状化合物,所述橘色粉末状化合物即为如本申请之一所述的琥珀酸酐类化合物。
在一个具体实施方式中,分离提纯的步骤如下:
a)将所述发酵液旋转蒸发浓缩,得到浓缩物;
b)将所述浓缩物与65-75%的丙酮水溶液混合并在80-120khz下超声处理2-3次,每次15-25min,减压浓缩后得到粗提物;
c)所述粗提物与丙酮(100%的纯丙酮)混合并在80-120khz下超声处理2-3次,每次55-70min超声处理,减压浓缩后得到褐色的油状丙酮提取浸膏;
d)所述丙酮提取浸膏在流动性为甲醇的条件下进行sephadexlh-20凝胶柱色谱分离,过程中的流出液每4ml至5ml更换一次接样瓶,将接样瓶中tlc检测rf值为0.35-0.45的流出液样品混合,得到凝胶柱分离液,然后在流动性为5%-15%甲醇水溶液的条件下对所述凝胶柱分离液进行中压rp18柱层析,过程中的流出液每4ml至5ml更换一次接样瓶,将接样瓶中tlc检测rf值为0.35-0.45的流出液样品混合,得到柱层析分离液,接着对所述柱层析分离液在以体积比为11-9:1的氯仿和丙酮的混合液作为流动性的条件下进行硅胶柱层析分离,得到橘色粉末状化合物,所述橘色粉末状化合物即为如本申请之一所述的琥珀酸酐类化合物。
在一个具体实施方式中,tlc展开体系为氯仿:丙酮9:1。
本申请之四提供了一种er酶,所述er酶具有如下(iii)或(iv)的氨基酸序列:
(iii)来源于嗜热属(thermomyces)的er酶的氨基酸序列,优选地,如seqidno.2所示的氨基酸序列;
(iv)与(iii)中的氨基酸序列的一致性在98%以上,且与(iii)中的氨基酸序列具有相同功能的氨基酸序列;优选其氨基酸序列为与(iii)中的氨基酸序列的一致性在99.5%以上,且与(iii)中的氨基酸序列具有相同功能的氨基酸序列。
本申请之五提供了一种能够编码如本申请之四所述的er酶的基因。
在一个具体实施方式中,优选所述基因具有如seqidno.4所示的核苷酸序列。
本申请之六提供了根据本申请之四所述的er酶,或本申请之五所述的基因在制备如本申请之一所述的琥珀酸酐类化合物中的应用。
本申请的有益效果:
现有技术认为er酶参与pks聚酮类化合物生物合成,而没有其参与琥珀酸酐类化合物的生物合成的报道。
本申请首次发现了通过将来源于嗜热属菌的i型pks/nrps酶和er酶在酵母中进行共同异源表达,可以得到含有氨基的新型琥珀酸酐类化合物。首次发现i型pks/nrps和er基因参与产生含有伯氨的琥珀酸酐类化合物的新功能,以及琥珀酸酐类化合物可以通过在酵母异源表达高温真菌中的i型pks/nrps酶和er酶来合成制备的新方法。该生物合成制备方法具有污染小、安全可靠以及发酵周期短等方面的优点。
附图说明
图1为pks/nrps(talth1_004980-t1)3段基因pcr琼脂糖凝胶电泳图,其中泳道m为dl5000dnamarker,泳道1、2和3分别为3段基因(每段约4000bp)。
图2为以prs426-talth1_004980-t1扩增talth1_004980-t1的pcr琼脂糖凝胶电泳图,其中泳道为dl15000dnamarker,泳道1为目的条带(11844bp)。
图3为er(talth1_004946_t1)的pcr琼脂糖凝胶电泳图,其中泳道为dl2000dnamarker,泳道1为目的条带(1230bp)。
图4为pgadt7-talth1_004980-t1表达载体图谱。
图5为pgbkt7-talth1_004946_t1表达载体图谱。
具体实施方式
以下为结合具体实例对本申请进行进一步描述,但本申请的保护范围并不仅限于此。
在没有特别说明的情况下,本申请中使用的试剂和原料均可以市售获得,或者通过常规配制得到。
pdb培养基配方:将200g马铃薯用水煮沸30min后,取上清加入20g无水葡萄糖定容至1l,121℃灭菌20min。
氨基酸缺陷型固体及液体培养基sd/-leu/-trp来源:购自takara公司。
pgadt7质粒载体来源:购自takara公司。
pgbkt7质粒载体来源:购自takara公司。
实施例1t.dupontiinrrl2155pks/nrps基因的获得
采用柱式法提取t.dupontiinrrl2155(菌种由云南省生物资源保护与利用国家重点实验室提供)基因组,将t.dupontiinrrl2155于pdb培养基中45℃培养7天,过滤收集菌丝,加入液氮充分研磨至菌体呈粉末状;基因组的提取方法参照天根植物基因组提取试剂盒说明书。将编码总长为11844bp的pks/nrps基因talth1_004980-t1(如seqidno.3所示,其编码如seqidno.1所示的氨基酸序列)分为3段(每段约4000bp)设计引物进行pcr,其中第一对引物为pks/nrps-f1(如seqidno.5所示)和pks/nrps-r1(如seqidno.6所示),第二对引物为pks/nrps-f2(如seqidno.7所示)和pks/nrps-r2(如seqidno.8所示),第三对引物为pks/nrps-f3(如seqidno.9所示)和pks/nrps-r3(如seqidno.10所示)。pcr体系为50μl:5×primestargxlbuffer10μl,dntpmixture(各2.5mm)4μl,primerf1μl,primerr1μl,模板1μl,primestargxldnapolymerase1μl,ddh2o32μl。pcr仪为eppendorf生产,pcr反应温度为:98℃2min,98℃10sec,55℃15sec,68℃4min共计35个循环,68℃5min。pcr产物分别%的琼脂糖凝胶电泳,可见3个约4000bp的pcr产物(见图1)。最后通过酵母的同源重组将3个片段依次连接到质粒prs426上,得到含有完整目的片段的prs426-talth1_004980-t1质粒。
实施例2构建表达载体pgadt7-talth1_004980-t1与pgbkt7-talth1_004946_t1
根据实施例1的测序结果设计构建talth1_004980-t1基因表达载体的引物分别为p1-f(如seqidno.11所示)和引物p1-r(如seqidno.12所示)。
以t.dupontiinrrl2155基因组为模板,设计构建talth1_004946_t1表达载体的引物分别为p2-f(如seqidno.13所示)p2-r(如seqidno.14所示)。
利用长片段高保真酶primestargxl进行扩增,pcr体系为(50μl):1.talth1_004980-t1:5×primestargxlbuffer10μl,dntpmixture(各2.5mm)4μl,primerf0.5μl,primerr0.5μl,模板0.5μl,primestargxldnapolymerase2μl,ddh2o32.5μl;2.talth1_004946_t1:5×primestargxlbuffer10μl,dntpmixture(各2.5mm)4μl,primerf1μl,primerr1μl,模板1μl,primestargxldnapolymerase1μl,ddh2o32μl。pcr反应温度为:1.talth1_004980-t1:98℃2min,98℃10sec,55℃15sec,68℃3min共计35个循环,68℃10min;2.talth1_004946_t1:98℃2min,98℃10sec,55℃15sec,68℃1min共计35个循环,68℃5min。
pcr产物分别%的琼脂糖凝胶电泳可见talth1_004980-t1条带在11844bp位置(如图2),talth1_004946_t1条带在1230bp位置(如图3)。将两个目的片段纯化后,通过takara的infusion酶分别与经过ecori和bamhi双酶切的酵母表达载体pgadt7、pgbkt7相连,构建表达载体pgadt7-talth1_004980-t1与pgbkt7-talth1_004946_t1。并测序检测基因序列的正确性。
实施例3构建工程菌株fy834/pgadt7-talth1_004980-t1/pgbkt7-talth1_004946_t1
将施实例2中构建的两个表达质粒pgadt7-talth1_004980-t1与pgbkt7-talth1_004946_t1共同转化到酿酒酵母fy834中,涂布于氨基酸缺陷型固体培养基sd/-leu/-trp上,30℃倒置培养2-4天。随机挑去转化子于5ml液体sd/-leu/-trp培养基中扩大培养并提取质粒进行pcr验证,并选取pcr验证为阳性的转化子进行测序验证。结果表明,表达载体pgadt7-talth1_004980-t1与pgbkt7-talth1_004946_t1成功转化至酿酒酵母fy834中。
实施例4工程菌株fy834/pgadt7-talth1_004980-t1/pgbkt7-talth1_004946_t1的表达
将实施例3构建的工程菌株fy834/pgadt7-talth1_004980-t1/pgbkt7-talth1_004946_t1接种至5ml液体sd/-leu/-trp培养基中,30℃200rpm过夜培养;按照5%的接种率扩大培养,30℃200rpm发酵5天。
实施例5琥珀酸酐含氨基衍生物化合物的提取分离应用
以实施例4发酵液为样品组,转化了空载质粒pgadt7和pgbkt7的fy834菌株为对照,对照组与样品组选用相同培养以及相同的处理方法。将发酵液用旋转蒸发仪浓缩,浓缩产物用70%丙酮/水室温100khz超声提取2次,每次20min,减压浓缩得70%丙酮/水粗提物;粗提物用纯丙酮100khz超声提取两次,每次60min,减压浓缩得到褐色油状丙酮提取物浸膏;将样品组与对照组用tlc检测,10%的浓硫酸乙醇溶液进行显色,在样品组中得到一个与对照组存在差异的粉色点。丙酮提取物浸膏经凝胶柱色谱(sephadexlh-20,甲醇为流动相冲洗),每5ml流出液更换一次接样瓶,并用tlc检测流出液成分,将组分相同的样品进行合并;随后通过中压rp18柱层析(5%-15%甲醇/水为流动相梯度洗脱),tlc检测流出液成分,合并相同组分;之后合并组份利用硅胶柱层析(氯仿-丙酮10:1)为流动相,得到粉末状化合物。经过对一位核磁共振谱、二维核磁共振谱以及质谱的分析,鉴定化合物为一个新型的琥珀酸酐含氨基衍生物,结构如下:
本申请的琥珀酸酐含氨基衍生物的理化性质:
物化特征为:
外观:无色油脂状物质。
溶解性:溶于氯仿、丙酮、二甲亚砜等,不溶于己烷和水等。
新化合物琥珀酸酐含氨基衍生物的分子式为:c7h11no3,分子量为157;高分辨正离子esi-ms质谱:实测值为158.0817[m+h]+,计算值为158.0821(forc7h12no3)。氢谱和碳谱数据(其溶解溶剂为cd3cocd3,400mhz,δ:ppm):碳谱为:175.4(s,c-1),76.9(s,c-2),40.6(t,c-3),172.0(s,c-4),35.2(d,c-5),16.5(q,c-6),15.7(q,c-7);氢谱为2.90和2.73(3-h2,d,19.2hz),1.96(h-5,m),0.99(7-h3,d,7.1hz),0.91(6-h3,d,7.1hz)。
序列表
<110>云南大学
<120>琥珀酸酐类化合物、与其相关的基因和蛋白及其制备方法
<130>lha1760721
<160>14
<170>siposequencelisting1.0
<210>1
<211>3947
<212>prt
<213>嗜热踝节菌(thermomycesdupontii)
<400>1
metalaserlysvalargprogluproilevalileileglythrgly
151015
cysargpheproglyaspileargserthrserglnleutrpgluval
202530
ilecysserglnlysaspleuleualaargileproserserargphe
354045
serserlysglyphetyrhisserasnglygluarghisglyserthr
505560
asnvalasphisalatyrleuleuserasnaspvalglyalapheasp
65707580
alaaspphepheglyileasnhisargglualaglualaileasppro
859095
glnglnargileleuleugluthrvaltyrglualailegluaspala
100105110
glyleuthrileserglyleulysglyserasnthralavaltyrval
115120125
glyleumetthrglyasptyrhisglumetglnvalargaspproglu
130135140
aspmetprothrtyrmetalathrglythralaargserilevalser
145150155160
asnargilesertyrphepheasptrplysglyprosermetthrile
165170175
aspthralacysserserserleuvalalaleuhisasnalavalgln
180185190
alaleuargalaglyglucyshisilealavalalaalaglyalaasn
195200205
leuileleuglyproglumetmetilealagluserasnleuargmet
210215220
leuserprothralaargserargmettrpaspalaalaalaaspgly
225230235240
tyralaargglygluglyphealaalavalileleulysthrleuser
245250255
glnalaleualaaspglyaspprovalglutyrileilearggluthr
260265270
glyvalasnglnaspglyargthrglnglyilethrmetproasnala
275280285
alaserglnthralaleuileargglnvaltyrglnargalaglyleu
290295300
aspcysthrargprogluaspargcysglnphepheglualahisgly
305310315320
thrglythrproargglyaspproileglualaargalailehisasp
325330335
alaphephevalasphisserthralagluglupromettyrvalgly
340345350
servallysthrvalileglyhisleugluglycysalaglyleuala
355360365
glyleuleulysalaalaglualaileargargalagluilepropro
370375380
asnmethisphethrglumetasnprogluilevalprophealagln
385390395400
hisleuargvalprothrlysthrleuprotrpproglyglnthrlys
405410415
ileargargalaservalasnserpheglypheglyglythrasnala
420425430
hisvalileileglusertyraspleualaserserproserargser
435440445
leuserserprothrmetleuproileproleuthrpheseralaala
450455460
lysgluthrserleuthrargpheilegluglutyrilethrleuile
465470475480
lysserhisaspileleuproleuhisglnilealaserileleuser
485490495
serargargserglnhisserthrargalavalpheserglythrasp
500505510
lysasnargleuleuglnlysleugluthralaleualaalaserpro
515520525
leuglygluarglysglulysalaproileseraspargileleugly
530535540
ilephethrglyglnglyalaglntrpalacysmetglyarggluleu
545550555560
ileargserserprometalaarggluthrleuargalaleuglnala
565570575
serleuaspgluleuproaspglyproasntrpthrilegluthrgln
580585590
leuthrgluvalglugluproserargileglnglualaalaleuser
595600605
glnproleucysthralavalglnvalmetleuvalaspleuleuarg
610615620
alaalahisvalserphehisthrvalvalglyhisserserglyglu
625630635640
ilealaalaalatyralaalaglymetileseralaargaspalaile
645650655
argilealatyrtyrargglyleutyralalysphealaargglygln
660665670
glucysalalysglyalametmetalavalglyileserpheaspglu
675680685
alaglnaspleuilealaalalyspheargglyargilealavalala
690695700
alaserasnalaproargservalthrleuserglyaspgluaspala
705710715720
ilealaglualalysalametphegluglnasngluthrphecysarg
725730735
leuleulysvalaspthralatyrhisserhisglnmetleuprocys
740745750
leuaspprotyrleualaalaleuargalaalaasnileserproser
755760765
lysproseraspglyleuthrtrpvalserservalhisgluargglu
770775780
metvalthrglugluaspilegluserleuargalaasptyrtrpgly
785790795800
aspasnmetalaglnthrvalargpheserglnalavalglnlysala
805810815
serargleuhisglyprophealavalglyvalgluvalglyprohis
820825830
proalaleulysglyproalathrglnthrilelysaspglucysgly
835840845
glnthrileprotyrserglythrleualaargphegluhisaspval
850855860
glualapheseraspalaleuglypheleutrplysgluileglypro
865870875880
serservalaspleuargalatyralaalaalaalapheaspleuser
885890895
pheglnglupropheserasnhisleuserleuproargtyrprotrp
900905910
asphisserglnargphetrplysgluserargleuserargargtyr
915920925
argglnargargileasnarghisaspleuleuglythrargcysser
930935940
aspasphisasphisglumethistrpargasnileleuargvalser
945950955960
gluserprotrpleuserglyhislysvalglnglyglnvalilephe
965970975
proalaalaglytyrleuvalmetalametglnalaalaleugluleu
980985990
alaglyaspargargilephemetilehisleuseraspvalasnile
99510001005
aspargalailealaleuproglutyrlysglyvalgluvalmetphe
101010151020
hisleuargprolysservalthrserserleuilelysalagluphe
1025103010351040
alacystyrservalthrseraspgluasnglyserproserglnarg
104510501055
hisalaserglyilevalglnvalhisleuaspproalaaspglnleu
106010651070
glnleuproproserglnglugluprovalserleuvalservalasn
107510801085
metgluthrphetyralaserleusergluileglyleuglutyrthr
109010951100
glyleupheargargleuaspargvalgluargargalaglyargala
1105111011151120
thrglytyralaargaspileproseraspserglumetprovalval
112511301135
ilehisproalaleuleuaspalaalapheglnthrilephealaala
114011451150
phecystrpproglyaspglythrleuhisglyprotyrvalprothr
115511601165
hisleuglnserleuargilevalprovalthrglnpheglualagln
117011751180
lysmetthrileglucysthrilethrgluserargproglnthrval
1185119011951200
thralaaspvalaspvalphealaglnasncysproargvalglnleu
120512101215
gluglyleuthrcysthrmetleuasnalaprothrprogluaspasp
122012251230
cysgluleuphealagluthrvaltrpargalaaspalaglyalaasp
123512401245
leuglyseralaaspleuvalseraspcysalaaspaspleulysleu
125012551260
valaspleucysgluargleusertyrsertyrleuargglnleuasn
1265127012751280
alaalaileaspargsergluileaspserphevaltrpasnhisgln
128512901295
argilepheglupheileasptyrleupheproleuileglusergly
130013051310
glnhisprothrileargproglutrplysglyaspserhisglutrp
131513201325
leuleualaglnalaargglnhisproaspvalvalaspleuglnleu
133013351340
ileseralavalglygluhisleualaaspvalvalargglylysthr
1345135013551360
thrileleugluhismetilealaasnasnthrleuaspargphetyr
136513701375
lystyrglyleuglypheglnargalaasnlysalaleuserleuval
138013851390
alaalaglnilealahisargtyrproargmetlysileleugluile
139514001405
glyalaglythrglyglyalathrlysglyileleugluhisleuasp
141014151420
asplysphegluglntyrvalphethraspileserthrglyphephe
1425143014351440
gluasnalaglnglnglnphealaargtrpalaserargmetserphe
144514501455
argproleuasnilegluglugluvalserserglnglyphegluasp
146014651470
glyiletyraspmetileilealaserasnvalleuhisalathrlys
147514801485
lysleuglutyrthrmetglnasnvalargargleuleulysprogly
149014951500
glypheleuleuleuleugluvalthrseraspileleuargvallys
1505151015151520
phemetmetserglyleuproglytrptrpleuglyalaaspaspgly
152515301535
argargphealaprothrileseralaproglntrphisaspleuleu
154015451550
ileargthrglypheserglyvalaspglnargvalthrasppheglu
155515601565
aspvalserlyshismetthrservalmetleuserglnalavalasp
157015751580
proaspvallysleuleuargasnproleualalysseraspproser
1585159015951600
leuthrvalaspargvalileleuvalglyglyglnthralaglyval
160516101615
histhrleualagluglnvalserthrleuleuargargtrpserile
162016251630
aspserprovalilevalproargleugluaspilevalhisserasp
163516401645
leuglyseralavalalavalvalcysleualaaspleugluglnpro
165016551660
valvaltrpaspmetasnaspgluargleuthrglyleulyslysleu
1665167016751680
leuasnileserargglnleuleutrpvalthrserglyalaargasp
168516901695
thrasnprotyralaasnmetserileglyleuglyargserleumet
170017051710
tyrglutyrthrhisileargmetglnhisleuaspphevalserasp
171517201725
cysaspasnlysalavaltyrilealathralaleualaargleuile
173017351740
leuvalasplysleuaspleuproserlysargleuleutrpserval
1745175017551760
gluprogluvalalapheglngluglyargtrpleuileproargile
176517701775
leuproasnaspproleuasnargargleuasnalaserargargarg
178017851790
valthrgluaspvalleumetasnasphisprovalgluvalilegly
179518001805
aspglyalaaspvaltrpcysglulysthraspleuproargglnasp
181018151820
aspthrleuleuleuvalarglysglntyralaleuleuhisglyleu
1825183018351840
pheleuasnglyasnglyproleutyrleuserileglylysvalglu
184518501855
aspseraspargsertyrserleuprothrglythrthrvalleuala
186018651870
metserhisglnileargserleusertrpileproproserhisala
187518801885
valproileaspserserthralathrproasptyrleumetleuthr
189018951900
alaleualaleuvalvalasnservalvalaspglnvalsersergln
1905191019151920
glyhisileleuleuvalserproaspglnalaileargargleuval
192519301935
gluaspargalaprovalargasnleulysvalthrilevalhisphe
194019451950
serproglyhisgluglyalailetyrileproserleuleuprolys
195519601965
arghisileserglyargleuproserasnvalaspleuileleuasp
197019751980
cysseralagluthrhisvalleuglygluleuleuilecysasphis
1985199019952000
thrileargleuargaspilepheargileproserglythrtyrser
200520102015
serlysalaglyileserglnasngluleuvalglualaleuargval
202020252030
alaserthrsersertyrgluilethralaargleuleuproleuser
203520402045
gluvalserglyalaglyhisphealaaspilealaservalileasp
205020552060
phethralavalthrthrvalglnthrleuvalargprovalaspala
2065207020752080
glyserleupheargseraspargsertyrleuleuvalglycysthr
208520902095
glyglyleuglyglnserleuthrargtrpmetvalleuasnglyval
210021052110
arghisleuileleuthrserargasnserlysasnvalasnargval
211521202125
trpleuglugluleulysargmetglyalaglnvalhisleupheglu
213021352140
leuasnilealaasplysglnalaleuhisalamettyraspglnval
2145215021552160
glnargglnleuproproilealaglyvalalaasnalaalametval
216521702175
leuseraspcysleupheasnaspmetthrvalgluaspleuglnlys
218021852190
valleuaspprolysvalalaglythralatyrleuaspgluleuphe
219522002205
serserprothrleuaspphephevalleupheserserleualaser
221022152220
ilevalglyasnargglyglnserasntyrglyalaalaasnleuphe
2225223022352240
metthrserleualaalaargarglysargglnglyleualaglyser
224522502255
valleuaspileglymetvalleuglyileglytyrvalserglnthr
226022652270
glyiletyrgluserthrleuarglyspheasntyrmetproileser
227522802285
gluglnlysphehisvalmetphethrglualaileilealaglyarg
229022952300
proaspglnarggluvalseralagluileilethrglyleuhisarg
2305231023152320
valalagluserseraspglyserglyasnglnalaphetrpsergly
232523302335
asnproargpheserhistyralavalargglulysglyglyserglu
234023452350
glnalaalathralavalvalalaleulyslysglnleuglugluala
235523602365
gluaspleuthralaileasnglnvalleuleuasnalaphealaasp
237023752380
lysleuglyargileleuglnvalproprogluglnileasnthrthr
2385239023952400
glnproleuileasnleuglyileaspserleumetalavalgluval
240524102415
argsertrppheleulysgluvalasnmetaspvalprovalleuarg
242024252430
ileleuglyaspalaserproalametleucysglnglualaalaasp
243524402445
argtyrmetglnleuglnasnprosermetglnalaalailethrser
245024552460
gluthrserserserseralaserglnleuleuaspserthrthrala
2465247024752480
thrthrsergluiletyrprothrserseralaserserglnglyile
248524902495
glnthrproprogluthrthraspphevalgluthrsercysaspglu
250025052510
gluglualagluleugluvalvalglualacysglnleuserpheala
251525202525
glngluargleutrppheleuargglupheleugluaspargserthr
253025352540
tyrasnvalthrmetvaltyrargvalcysglyprothrvalserala
2545255025552560
leuasnglualapheasnalavalvalserarghishisvalleuarg
256525702575
seralapheleuvalasplysgluserglyleuprotyrglnasnile
258025852590
leuhisglnserpropheargleuthrglnargglulysgluthrala
259526002605
thraspgluaspileaspargglupheglnargleucyshishisthr
261026152620
tyraspleugluhisglyglucysmetalaalavalleupheserhis
2625263026352640
alaproaspthrhisthrleuileleuglyphehishisilevalphe
264526502655
aspglypheseralaglnilephevallysaspleualathralaleu
266026652670
serglyargtyrleuproproleuasncysglntyrthrasppheala
267526802685
argargglnargalaglnvalglnasnglumetalagluaspleuala
269026952700
tyrtrplysglnglupheserthrleuproserprovalproleuphe
2705271027152720
gluphecysglnvalalathrargargthrleuthrglutyralathr
272527302735
hisglyileglnlysthrileproalaserthrvaltyralaphearg
274027452750
sermetalaargargpheglnalathrprophehisglyhisleuala
275527602765
ileleuargleuleuleualaargleuleuaspleuthrgluvalcys
277027752780
ileglyilethraspalaasnargthraspserasppheleugluthr
2785279027952800
ileglyphephevalasnleuleuproleuargphegluleuglygln
280528102815
hisaspserleugluglnleumetglnasnthrargaspvalthrtyr
282028252830
argalaleuglnhissercysvalpropheaspvalleuleuaspala
283528402845
leuvalvalproargserthrthrgluserproleupheglnileleu
285028552860
metasntyrargmetglyserthrserlysilelysthrasnglyphe
2865287028752880
glualagluleuleuargpheglnaspalaargasnprotyraspleu
288528902895
ilepheasnvalglugluglnaspaspglythrthrleuvalaspval
290029052910
glnserglnsertyrleutyrthrglnaspaspleuglumetleuleu
291529202925
aspalatyrilecysleuleuthrsercysserthrasnproglytrp
293029352940
proleuhislystyrthriletyrasngluglnaspvalasnleuala
2945295029552960
leugluleuglyargglyproglnleuasnpheglygluseralathr
296529702975
leucysargargileaspglumetvalalaalaglnproaspgluthr
298029852990
alavallysasphisasnglyglnpheleuthrtyrlysglnleuleu
299530003005
serhisvalglupheilealaalathrleuglualahisglyvalarg
301030153020
serglyasptyrvalalavalphecysgluprothriletyrserval
3025303030353040
cystyrleuleualailetrpargleuasnalailetyrvalproleu
304530503055
aspproglnasnproalaalaargleuglnleuileleuaspaspcys
306030653070
glnprolysvalleuiletyrhisglualathrglugluthrmetarg
307530803085
lysphehisleuserthrthrgluprovalthrpheserasppheser
309030953100
serphealaserleuprovalproaspargsergluphethrserpro
3105311031153120
alacysalaleutyrthrserglyserthrglyvalprolysglyile
312531303135
leuleuthrhisaspserphevalasnglnileleuglyilearghis
314031453150
glnpheservalglyarggluthrvalleuglnglnserserleugly
315531603165
pheaspvalserleuaspglnmetleuglnproleuvalglyglygly
317031753180
thrleuvalvalalaserargglnleuargtyraspalathrgluleu
3185319031953200
alaargleumetvalarggluhisilethrtyrthrtyralathrpro
320532103215
serglutyralaalaleuleuargtyrglyglyaspvalleuglnarg
322032253230
serserphetrpargleualaphevalglyglyglualaleuproala
323532403245
hisleuileargserphehisalaleuglnargproglyleuargleu
325032553260
ileasnargtyrglyprothrgluilethrileserserasncysleu
3265327032753280
serileaspthrtrpasnproalavalileserleuserargvalser
328532903295
valglyproserleuproasntyrvalthrtyrilemetaspserasn
330033053310
glyargproleuproileglyhisvalglygluilevalileglygly
331533203325
alaglyvalserglnglytyrleuargargglugluleuthrargglu
333033353340
argpheleuvalasplystyrglylysseraspalaglyleuvalglu
3345335033553360
cysglyargmettyrargthrglyasplysglyargleuleuprothr
336533703375
glygluleuphetyrleuglyargmetaspglyaspthrglnilelys
338033853390
leuargglypheargilegluleugluaspilealaglnthrileleu
339534003405
argalaalahisglyargleualaaspalavalvalservalarggly
341034153420
valhisaspglugluglyaspargargpheleuvalalaphealaval
3425343034353440
proalaargglnseraspglyserglyaspileglnalapheleuasp
344534503455
glnleuvalhisthrleuproleuproglntyrmetileproargarg
346034653470
valvalvalvalasphisleuproargasnproasnglylysleuasp
347534803485
argargalaleuaspserleuproleuproserleuserproaspala
349034953500
proserhislysleuthralaalaglnalaalavalvalcysvaltrp
3505351035153520
lysargcysleuaspproserasnleuproaspsertrpserprothr
352535303535
alaaspphephegluleuglyglyasnserleuleuleuvalargval
354035453550
hisalaleuleuserglualapheaspargglnvalproleuhisglu
355535603565
leupheleuserserthrvalglnglymetalaalaargphealapro
357035753580
glugluvalalaglnservalasnmetileasptrpglusergluser
3585359035953600
thrprosergluleugluargglnalametgluserlysproproval
360536103615
argalaargargseraspmetlyslysilegluvalcysleuthrgly
362036253630
serthrglypheleuglysergluleuleuargargleualahisasp
363536403645
proargvalserargilehiscysvalalavalargserserasnala
365036553660
asnargproargthrleualavalaspserglulysilevalvaltyr
3665367036753680
thrglyaspleuthrgluproargleuglyleuproproaspthrtrp
368536903695
aspalaleuglygluargvalaspalaileilehisileglyalaasp
370037053710
valserpheleulysthrtyrhisserleuargasnalaasnvalhis
371537203725
serthrarggluleualaleuleualaleuargargargileproleu
373037353740
histyrileserthrglyglyvalalaglnleuvalglyvalgluthr
3745375037553760
leuthrproglnservalalaargpheproproproasnaspglyser
376537703775
metglytyrilealaserlystrpalasergluvaltyrleugluser
378037853790
cysalathrglnphehisleuprocysvalilehisargproserasn
379538003805
ileileglygluglyvalproserthraspleuileglnthrileleu
381038153820
glntyrservalargileglnglyvalprovalleugluasntrpser
3825383038353840
glyserpheaspphevalprovalgluaspvalalaleuargvalcys
384538503855
glugluleuvalargserileaspalaphealaargproaspthrpro
386038653870
alaglusermetleuargphevalhishiscysglyalaglulysile
387538803885
provalglyaspileglyvaltyrleuglulyslyshisglyvalmet
389038953900
leuargserileaspileglysertrpleuasnalaalaargalaala
3905391039153920
glyleuproalaalametgluasnleuvalthralathrleuthrglu
392539303935
lysglyhishisvalleuproserleusergln
39403945
<210>2
<211>409
<212>prt
<213>嗜热踝节菌(thermomycesdupontii)
<400>2
metserleuvalvalproargtyrpheserglncysproglnglnarg
151015
cyspheglnvalpropheasnalaserasnalacysargasptyrlys
202530
tyrgluvalargargthrgluserargvalpheileglyasnserser
354045
serargilealavalmetseralaglnthrilethrpheglnglnhis
505560
serthrgluproserargvalileargvalhishishisgluserile
65707580
glyaspargproleuproproaspservalleuleuargpheleuala
859095
alaproileasnproglnaspleuleuvalilealaglyargtyrpro
100105110
valglnprohistyrlystyralaaspgluproileproglytyrasp
115120125
glyvalalaargvalgluargvalglyalaasnvalthrthrleugln
130135140
proglyasphisvalileproarghishisglyleuglythrtrparg
145150155160
serglualavalvalproalathrservalleulysvalserasnarg
165170175
leugluprothrthralaalaleuleulysleuglycysserthrala
180185190
tyrleuleuleugluserserasnalaleuglnproglyaspleuval
195200205
alaileasnalaalaserglytrpilealaargmetvalvalglnphe
210215220
alaargleuargglycysglyserilecysileileargaspargasp
225230235240
asnvalgluthrthrargglnserleuleualahisglyalahisval
245250255
valleuthrgluserglnleualaglngluglyvalalaalaalaarg
260265270
thrglyglyargargvalmetleualaleuaspalavalpheglyglu
275280285
serglyglnargleuvalserleuleuserthrglyglythrtyrile
290295300
asntyrglyserleuglyglyalaalaglyglnileileleuthrgln
305310315320
gluleuleuphetrplysglnilethrpheargasnpheargleuser
325330335
glnalaleualaargtyrthrgluglualaglnileglnleuleuthr
340345350
trppheglygluleuphegluglnglyglnleuvalalaproproval
355360365
lysileilelystrplysglyaspglyserleuglulysargvalarg
370375380
glualaleuserglnilelysgluserseralaglyvalvalglyasn
385390395400
leulysprovalpheglnphegluser
405
<210>3
<211>11844
<212>dna
<213>嗜热踝节菌(thermomycesdupontii)
<400>3
atggcttccaaagttcgtccggagccgattgtcataattggcaccggctgtcgatttccc60
ggtgacattcgcagcacgtctcagctctgggaggtgatctgcagccagaaagaccttctc120
gcccgcattccttcgtcccggttcagcagcaaaggattctaccactccaacggcgagcga180
cacggcagcacgaatgtcgaccatgcctatttactcagcaatgatgtcggcgcctttgac240
gccgacttcttcggcatcaatcacagagaagcggaagcaatcgatcctcagcagcggatt300
ctccttgaaaccgtttatgaggcaattgaagatgctggccttaccatttccggactgaag360
ggatcgaatactgccgtatatgtcggcttgatgactggcgattaccacgaaatgcaggtc420
cgcgacccggaggatatgcccacgtacatggctacggggactgctcgtagcatcgtttcg480
aacaggatctcctacttcttcgactggaaaggcccgtccatgaccattgataccgcctgt540
tcctcgagtctggttgctttgcataatgctgtccaagctctccgagcaggggaatgtcac600
attgctgtcgctgccggggcaaatctcattctgggaccggagatgatgattgccgagtcg660
aatcttcgtatgctttcgcctactgcacgatctcgaatgtgggacgctgcagcggatggg720
tatgcccgtggtgaaggctttgccgcggtcattctcaaaacactctcccaagctttggcc780
gacggggatcctgtggagtatatcattcgagaaacaggcgtcaatcaggacggcagaacg840
cagggcatcaccatgccaaacgcagcatcacaaacggctctgattcggcaagtctatcaa900
agagcgggtcttgactgcacgcgtcctgaagatcgttgccaattcttcgaggcgcatgga960
accggaaccccccgtggcgatcccatcgaggcgcgcgccatccatgatgcattcttcgtc1020
gaccactcaacggcagaggagccgatgtatgttggctcagtcaagactgttatcggccac1080
ttggaaggatgcgcagggttggccggtcttctgaaagccgctgaggcaattcgtcgtgct1140
gaaatccctcccaacatgcactttacggaaatgaaccccgaaattgttccttttgcgcag1200
catttgcgggtcccaaccaaaaccctgccctggcctggacagactaagattcgtcgggcg1260
agtgtcaattcttttggattcggaggaacaaatgcccacgttatcatcgagagctatgat1320
cttgcatccagtccaagccggagtctatcgtcaccaaccatgctaccaatcccactgaca1380
ttctcggcggcaaaggagacatcactgactcgattcattgaggagtacatcactttgatt1440
aaatctcatgacattctgcctctccatcagatagcgtcgattctgtcctctcggcgatcc1500
caacactcaacccgggccgttttctctggtacagacaagaacaggcttctgcaaaaactt1560
gagaccgctctcgccgcgtctccacttggagagcgaaaggagaaagcacccatttcggat1620
cggattctgggaatctttaccggccaaggcgctcaatgggcgtgtatgggacgggagtta1680
attaggtcctctccgatggcgcgggaaacattgagagcgcttcaggcgtctctggacgag1740
cttccggatggaccaaactggactatagaaacgcagttgaccgaggttgaggaaccctct1800
cggattcaagaagccgcactctcgcagccactatgcaccgcagtgcaggtgatgctcgtt1860
gatctcttgcgggctgcacatgtctcattccacactgtcgttggtcactcatctggagag1920
atcgctgctgcttatgccgcgggaatgatttctgcacgggacgccattcgcattgcatac1980
taccgcggcctgtacgcaaagttcgccagaggccaagaatgcgcaaagggagccatgatg2040
gcagtaggtatctctttcgatgaggcacaggacttgattgccgcgaaattccgaggccgc2100
atcgccgttgctgccagcaatgcaccccggtctgtcaccctttctggagacgaagatgcc2160
atcgccgaagccaaggccatgtttgagcagaatgaaaccttttgccgcctcctaaaagtg2220
gacactgcttatcactcccatcaaatgcttccgtgtcttgacccatatcttgcggctttg2280
cgggccgcaaacatcagtccctcgaagccctccgacgggctcacctgggtgtcaagtgtg2340
catgaacgagaaatggtcactgaggaagatatcgagtcactccgggccgattattggggc2400
gacaacatggcgcagacggtgcgcttttctcaggctgttcagaaagcgtcgcgtctacat2460
ggtccttttgctgtcggagtggaagttggaccgcatccagcattgaaaggaccggccacc2520
cagacgataaaagacgagtgtggccagacaatcccatacagtggcaccctcgcccggttt2580
gaacacgatgttgaggcattctccgacgcattgggcttcttatggaaagaaattggtcct2640
agctcggtcgaccttcgcgcatacgccgccgcagccttcgatctgtcgttccaagaacct2700
ttctcgaaccacttatcgctgccgcgatacccgtgggatcacagtcagcgattctggaag2760
gagtcccgactatctagaaggtatcgccaacgtcggatcaacaggcatgaccttctcggg2820
acacgctgttctgacgaccatgaccacgaaatgcactggaggaacatcctacgcgtctct2880
gaatccccgtggctctcgggccataaggttcaagggcaggttatcttccccgcagcaggt2940
tatctcgtaatggcgatgcaggccgcgctcgaactcgcgggcgatcgtcgtattttcatg3000
atacatctctcggatgtcaatattgatcgggcgatcgcgctcccggaatacaagggagtc3060
gaagtcatgtttcatcttcgaccaaagtccgtcacttcatccttgattaaggctgagttc3120
gcatgttactcggttacgtctgatgaaaatggttcaccttcacaacgacatgcctctggt3180
attgtgcaagttcatctggatccagcagatcagctccagctcccgccctcgcaggaagaa3240
ccggtctctttggtgtcggtgaacatggaaacattctacgcatcactcagcgaaatcggc3300
ttggagtacacaggactcttccgtcgtctcgatcgagtggagcgccgagctggtcgggcc3360
actggttacgcacgggacatcccatccgatagcgaaatgcccgtcgttatccatccagca3420
ttgttagatgctgcgtttcaaacaatttttgctgcattttgctggccaggagacggcact3480
ctgcacggcccctacgtgcccacgcatcttcaatcactccgtatcgtgccggtaactcag3540
tttgaagctcagaagatgacaattgagtgcacaatcaccgagtctcgcccacagacagtc3600
acagcggatgtcgatgttttcgcgcagaactgtccacgtgtgcaattggagggcctcact3660
tgcacgatgttgaacgcaccaacccctgaggacgactgcgagctcttcgcggaaactgta3720
tggcgtgctgatgctggggcagatctcgggtcagccgacttggtctctgactgcgctgat3780
gacttgaaactcgtggacctctgcgaaaggctctcctactcttatctacgccagctcaat3840
gctgcgatcgatcgcagcgagatagattcgtttgtctggaatcatcaacgcatctttgaa3900
ttcattgactatctcttccctctgatcgaaagcggtcagcaccccaccatccggccggaa3960
tggaaaggtgattcgcacgagtggttgcttgctcaggctcggcaacatccagatgtagtg4020
gatctgcagctgatctccgccgttggcgaacaccttgcggacgtggtccggggaaagact4080
actatactggagcatatgatcgccaataataccctggacagattctacaagtacgggctg4140
ggttttcagcgggccaacaaagctctcagtcttgtagctgcacagatcgctcatcggtac4200
ccgcgcatgaagattctggagattggcgctggcacaggcggagcaaccaagggaatcttg4260
gagcatctcgacgacaagttcgagcaatatgtgtttacggacatatccaccggcttcttc4320
gagaacgcccaacaacagttcgcaaggtgggcatcgcggatgtcatttaggcccctcaac4380
atcgaagaggaggtatcttctcaagggtttgaggacggcatttatgacatgatcatcgct4440
tcgaatgtgctgcacgcgacaaagaagctggagtatacaatgcaaaatgtacgcagattg4500
ctaaagccaggcggattcttactcctcctcgaggtgacgagcgatattctcagagtcaag4560
ttcatgatgtcaggcctacctggatggtggctgggtgctgatgatggccgacgatttgct4620
ccaacaataagtgcccctcaatggcacgaccttcttatccgtaccggtttttctggagtg4680
gatcagagggtgactgattttgaagacgtgtccaagcatatgacttcggtgatgctatcc4740
caggcagtggatcctgatgtcaagctgctgcgaaacccactggccaaatccgacccgtca4800
ctcacggtggatcgtgtcatacttgtcggcggccaaactgctggcgtacacacattggca4860
gagcaggtttcgactcttctacgacgttggagtattgactcacctgtcattgtgcctcgg4920
ttggaggatattgttcactccgatctaggctctgccgtcgcagttgtgtgcctagctgat4980
ctggaacagccagtggtgtgggacatgaacgatgaacgactcaccgggttgaagaaactc5040
ctaaatataagccgccagttgctgtgggtgacgtctggcgcccgtgataccaatccctac5100
gccaacatgagtattgggctcggacgatctctgatgtacgaatacactcacattcgcatg5160
cagcaccttgattttgtcagtgattgcgacaataaggcagtgtatattgcaactgcgctg5220
gcacgcctcattttggtggataagttggatcttccatccaaacggttgttgtggagtgtc5280
gaacctgaagtcgccttccaggagggccgctggttgatcccccgaattttgcccaacgac5340
cctctgaaccgtcgcctcaacgccagcaggagaagagtcacagaagatgttctcatgaat5400
gatcatccagtggaagtaattggcgatggcgctgatgtatggtgtgagaagaccgatctt5460
cctcgtcaagacgatactctgctcttggtccggaagcagtacgctcttctccatggcctg5520
ttcctgaacgggaatggtccactatacctgtccattgggaaggtggaggattctgatcga5580
agctactcattgcccactggcacgactgttctggcaatgagtcaccaaattcgatcactt5640
tcgtggattcctccaagccatgctgttccaattgacagttccactgcgacgccagactat5700
ctcatgctcactgctctcgcactcgtcgtcaattcagttgtcgaccaggtatcctctcag5760
ggacatatccttctcgtgagtccagatcaagctatccgtcgacttgttgaagatcgcgcc5820
cctgtgcgaaatctgaaggtgactatcgtgcatttctctccagggcatgaaggtgccatt5880
tatattcctagcctgcttcccaaacggcacatcagtggtcgtcttccaagcaatgttgat5940
ctcatactcgattgcagcgccgaaacacatgtcctgggcgagctactgatctgtgaccac6000
accattcgtttgcgcgacatcttcagaatcccgtccggaacatattcatctaaggcaggc6060
atctcgcaaaacgagcttgtggaggccttgagagtcgcctcaacatcatcttacgagatt6120
actgctcggctcctacctctctccgaggtcagtggcgcgggacatttcgctgatatcgcc6180
tccgtgattgacttcacggccgtgacaaccgttcagacgctcgtgcgacccgttgatgcc6240
ggcagcctcttccgatctgatcgatcttatctgctcgtcggctgcactggcgggttgggg6300
caatcgcttactcgttggatggtgctgaatggtgttcgccacctcatactgaccagtcgc6360
aactcaaagaatgtcaatcgagtgtggttggaagaattgaaacgcatgggagctcaagtt6420
catctgttcgaactaaacattgccgacaagcaagctcttcacgccatgtatgatcaggtc6480
cagcgacaactacctcccattgccggagtagccaatgcggcaatggtgctctcggattgt6540
cttttcaacgatatgacagtggaggatctccagaaagtgcttgatccaaaggttgccggc6600
actgcttatctagatgaactattctcgtcgccaacgctggatttcttcgttttgttttct6660
tcgctggcaagcatcgtggggaatcgtggtcaatccaactatggcgcagcaaatctgttc6720
atgacaagtctggctgctcgacggaaacggcaaggccttgcaggttctgtgcttgacatt6780
ggcatggttctgggaataggatacgtctcacaaacgggcatttacgaatccacgttgcgc6840
aagttcaactacatgcctatctcggaacaaaagtttcacgtcatgttcacagaggccata6900
attgcgggtcgtcctgaccaacgagaagtctccgctgagatcatcacgggtctgcatcgc6960
gtcgccgaatcatcggatggtagtggaaaccaggccttctggtctggtaatccacgtttc7020
tcccactatgctgtccgcgagaaaggtggttctgagcaggccgcaactgccgttgtcgca7080
ctcaaaaagcagttagaagaggccgaggatctgaccgcgatcaatcaggtacttctgaac7140
gcctttgcagacaagctcggcagaattcttcaggttcccccggagcaaatcaatactact7200
cagccgctcatcaaccttggaattgactcgttgatggctgtcgaagtgcggtcttggttc7260
ctgaaggaggtcaatatggacgtgccggttttgcgcatcctcggcgacgcctcaccggcc7320
atgctctgccaggaggccgctgatcgctacatgcaactgcagaatccttcaatgcaggcg7380
gcgattacttcagagacatcatcttcgagcgcgagtcaactgcttgactctaccacggcg7440
acgacctcggaaatataccccacgagttcagcttcatcccaagggattcaaacccctcca7500
gaaacgaccgacttcgtggaaacttcgtgtgatgaagaggaagcggagttagaggtggtc7560
gaggcatgccaactgtcttttgcccaggaacgactgtggttcctaagagagttcctagaa7620
gatcggtcaacatacaacgtgaccatggtgtaccgggtgtgtggtcctacagtcagtgcg7680
ctgaatgaggctttcaatgctgttgtgagtcgccatcatgttcttcgctcggcgttccta7740
gtggataaagagagtggtctgccgtatcagaacattctgcaccaatctcccttccggctt7800
actcaaagagaaaaagagacagcgacagatgaagatattgatcgagagtttcagagactc7860
tgccatcacacatatgacctggaacatggagaatgtatggccgcagtgttgttctcgcat7920
gcgccagatactcacactctgattctgggctttcatcacattgtcttcgacggattcagt7980
gcacaaatctttgtcaaggatttggccacagcactctctggccggtatctccctcctttg8040
aattgtcagtacactgattttgcgcgacgccaacgagcacaagtgcaaaatgagatggca8100
gaggatctcgcgtactggaagcaggagttttcgacgctgccatccccggtgcctctattt8160
gagttctgccaggttgccacgcgacggacattgactgaatatgctacacacggaatacaa8220
aagacgatccccgcgtcgactgtttatgccttcagaagcatggcgcgacggttccaagca8280
acccctttccacggccatctcgctattctacggcttctccttgctcggttgctggacctg8340
acagaggtttgcatcggcatcacggacgccaaccgcacagactcggacttcctcgagacg8400
ataggctttttcgtgaatctccttccactccgttttgagctcggtcaacatgactctttg8460
gagcagcttatgcaaaacacgcgcgatgtcacctaccgggctctccagcattcctgtgtc8520
ccgttcgacgttcttttggatgcgcttgttgtcccgcggtcgactacggaaagccccctg8580
tttcagattctcatgaattaccggatgggatcaaccagcaaaattaagacaaatggattt8640
gaggcagagctgctacggttccaggatgcccgcaatccttacgatttgatctttaatgtc8700
gaggaacaagatgatggaaccactttggttgatgtccagtcacagtcctacctctacact8760
caagacgatttggagatgttgttggacgcctacatctgccttctgacatcatgctcaacc8820
aaccctggttggcctctccacaagtataccatctacaatgaacaggatgtcaacctcgcg8880
ttggagttgggaagggggccccaattgaattttggtgagagtgcgactctttgccgtcga8940
attgatgaaatggtggccgcacaaccggatgagacagctgtcaaggatcacaacggccaa9000
ttcctcacatacaaacagctgctcagccatgttgagttcattgcagctacgctggaggca9060
catggagtgcgctcaggggactatgtcgccgtattctgtgagcctacaatctattctgtt9120
tgctatctgcttgctatttggcgcctgaatgcgatctatgtccccttggatccgcagaac9180
cccgcagctcgattgcagcttattcttgacgattgccaacccaaggtcctcatctaccat9240
gaagcgacggaagagacgatgcgcaagttccatttgtcgaccactgagccagtcaccttc9300
tctgacttttcttcctttgcctccttgcctgttcctgacaggtcagaattcacgtcaccg9360
gcctgcgctctgtacactagtggctcaacaggtgtaccaaaggggattcttctgacccat9420
gacagctttgtcaaccaaattctaggcattcgacatcagttctcagtcggccgtgagact9480
gttctgcagcaaagttcgcttgggtttgatgtgtccttggaccaaatgttacagcctctt9540
gttggtggtggaacgttggtggtggcatcccggcagctgcgttacgacgccactgagctg9600
gcgcgattgatggtacgggagcatatcacctacacctacgcgactccatcagagtacgcg9660
gcacttctccgatacggtggggatgtgttacagcgtagctcgttctggcggctcgccttt9720
gttggaggtgaagcccttcctgcgcatttgatcagatccttccatgcccttcaacgtccg9780
ggtctccgcctaatcaaccgatatggcccaacggagattacgatatcaagtaattgtcta9840
tcaatcgatacctggaatcctgccgtgatctcactgtcacgagttagtgtgggcccctcg9900
ttgccaaactatgtcacctatatcatggattccaacggacgtcctcttcccatcgggcat9960
gttggagagattgtcatcggaggcgctggggtctctcagggctatctccggagagaagaa10020
ctcacccgcgagcgattcctggtagataaatatggaaaatcagatgcagggctcgtggag10080
tgcggccgcatgtatcggaccggcgataagggtcggctgcttccgacgggagaactgttc10140
tacttgggccgaatggacggagacacccagataaagctacgaggcttccgcatcgaactg10200
gaggacattgcgcaaaccatcctgcgtgccgctcatgggcgtctggcagatgccgtggtg10260
tcagttcgaggagtccacgatgaggagggagatcgacgattcttggtggcttttgctgtt10320
cctgcaagacagagtgatggatcgggtgacattcaagcgttcctggaccaactcgtgcac10380
accctccctctaccccaatacatgattccccggcgagttgtagtcgtggaccatcttccg10440
cggaatcccaatggcaagctggaccgacgtgcgctggattcgctgccgctaccttctctg10500
tccccggatgctccttcacataaactcactgctgcacaagcagcggtggtttgtgtatgg10560
aagcggtgcctagatccatccaaccttcctgactcttggagcccaactgctgattttttc10620
gagctgggagggaactcgctgttgctggttcgcgtgcatgccttgttgtccgaggccttt10680
gaccgtcaagtaccacttcatgaattgttcttatcgagtaccgtgcaagggatggccgct10740
cgctttgctccagaggaggtcgcacagtccgtcaacatgattgactgggagtcagaatcg10800
actccaagcgaactggaaagacaggcgatggagtctaagccgccagtcagggctcgacgc10860
agcgacatgaagaagattgaagtgtgtctgacaggatcaactggattcttgggctccgag10920
cttcttcggcgattagcccatgaccctcgcgtttcccgtattcactgcgtcgccgttcgg10980
tcctcaaacgcaaatcgtccgcggacattggccgttgattccgaaaagatcgtggtgtat11040
accggtgatctgacagaaccccgcttggggcttcctccagatacgtgggatgccttgggg11100
gagcgagtggacgccatcattcacattggggccgatgtgtcgttcctgaagacttatcat11160
tcactgcgtaatgctaatgtccattcaacgcgcgagttggcgctccttgccctccgtcgt11220
cgtatcccactgcattacatttccacgggcggagtggcccagctggttggtgttgaaacc11280
ctcacaccacaatcagtggcgcgcttccctcctccaaacgatggctcaatgggctacata11340
gcttcgaagtgggctagtgaggtttatctggagtcatgtgcaacacaattccatctgccg11400
tgtgtcatccaccgaccctcaaacatcatcggcgagggggttccgtcgaccgatctgatt11460
cagactattcttcagtactccgtccgaattcagggcgttcctgtgttggagaattggtct11520
ggatccttcgacttcgttcccgtggaagatgtcgcgctccgtgtgtgtgaggagctggtt11580
cgcagtattgatgcctttgcgcgacctgatacgccggccgagtcgatgctgcgatttgtt11640
caccattgtggagcggaaaagattcccgtgggcgatattggcgtctacttggagaaaaag11700
cacggagttatgctccgctcgatagatattggctcatggcttaatgctgcacgagccgct11760
gggttacctgcggccatggagaacctggtgacggcgacgttgacggagaagggccatcat11820
gtactcccatcgctgtcacaatag11844
<210>4
<211>1230
<212>dna
<213>嗜热踝节菌(thermomycesdupontii)
<400>4
atgtcccttgtggtacctaggtacttctcacagtgtccgcaacagcgatgcttccaagtt60
ccattcaacgcaagcaatgcgtgtcgtgattataaatatgaagtacggcggactgagtca120
cgggtgttcattgggaactccagttctcgaatcgccgtcatgtctgcgcagacgatcacc180
ttccaacagcactcgacggagccatcccgggtgattcgcgtccatcaccatgagtctata240
ggagaccgtccacttccccccgacagtgtgctgctgcgctttctggcagctccgatcaac300
ccacaagacctgctggttattgccggacgataccccgtgcagccacactacaagtacgca360
gacgaacccattccgggctacgacggcgtcgcgcgcgtggagcgtgtcggagctaatgtg420
acgacccttcagccgggagaccatgtcattcctcgccatcacggcttgggcacctggcgg480
tcggaagcagtcgtgccggcgacgtcggtgctgaaggtgtcaaaccgcctggagcccacc540
accgccgcactgctgaagttgggttgttcgaccgcctaccttttgctagagagcagcaac600
gccctccagccgggggacctggtcgcgatcaacgcggcgagcggctggatcgcccgaatg660
gtggtccagttcgctcggcttcgcggctgcggcagcatctgtatcatccgcgaccgtgac720
aacgtcgagacaacgaggcagtcactcctcgctcacggcgctcacgtggtgctcaccgag780
tcgcagctggcacaagagggcgtggccgctgcacgcacgggcggccggcgggtcatgcta840
gccctggacgcggtgtttggggagtccgggcagcggctggtatcgctgctctccaccggt900
gggacatatatcaactacggatcgctggggggtgcagccggacagatcattctgacgcaa960
gagctgctcttctggaagcaaatcacctttcgcaacttccggctgtctcaggccctggca1020
cggtacacagaggaggcgcagatccagctcctgacctggttcggggagctctttgagcag1080
ggacagctggttgcgcctccggtgaagatcattaaatggaaaggagacggttcgctggag1140
aaacgagtccgggaggctctatctcagatcaaggagagttctgcaggggtggtggggaat1200
ctcaagcccgtctttcaatttgagtcttga1230
<210>5
<211>50
<212>dna
<213>人工序列(non)
<400>5
ctatagggcgaattgggtacggcgcgccatggcttccaaagttcgtccgg50
<210>6
<211>45
<212>dna
<213>人工序列(non)
<400>6
caggaattcgatatcaagctggtacccgtccgcaaggtgttcgcc45
<210>7
<211>38
<212>dna
<213>人工序列(non)
<400>7
ggcgaacaccttgcggacgtggtccggggaaagactac38
<210>8
<211>44
<212>dna
<213>人工序列(non)
<400>8
cagagagtgctgtggccaaatccttgacaaagatttgtgcactg44
<210>9
<211>47
<212>dna
<213>人工序列(non)
<400>9
cgaattcctgcagcccgggggctagctttggccacagcactctctgg47
<210>10
<211>49
<212>dna
<213>人工序列(non)
<400>10
ctaaagggaacaaaagctggggcgcgccctattgtgacagcgatgggag49
<210>11
<211>52
<212>dna
<213>人工序列(non)
<400>11
attacgctcatatggccatggaggccagtgatggcttccaaagttcgtccgg52
<210>12
<211>57
<212>dna
<213>人工序列(non)
<400>12
ctacgattcatctgcagctcgagctcgatgctattgtgacagcgatgggagtacatg57
<210>13
<211>47
<212>dna
<213>人工序列(non)
<400>13
atggccatggaggccgaaatgtcccttgtggtacctaggtacttctc47
<210>14
<211>43
<212>dna
<213>人工序列(non)
<400>14
ggccgctgcaggtcgacgtcaagactcaaattgaaagacgggc43