来自ulkenia的PUFA-PKS基因的制作方法

文档序号:439987阅读:919来源:国知局
专利名称:来自ulkenia的PUFA-PKS基因的制作方法
技术领域
本发明描述了对于多酮化合物合酶(polyketide synthase)(PKS)特异性的基因编码序列。由它们合成的PKS的特征是具有产生PUFAs(多不饱和脂肪酸)的酶学能力。本发明另外包括相应DNA序列的鉴定以及所述核苷酸序列对于生产重组和/或转基因生物的用途。
术语PUFAs(多不饱和脂肪酸)表示具有链长度>C12和至少两个双键的多重不饱和长链脂肪酸。有两个PUFA的主要家族,其根据相对于烷基末端,在ω-3和在ω-6脂肪酸中的第一个双键的位置而区别。它们是细胞膜的重要组分,在那里它们以脂质,特别是磷脂的形式存在。PUFAs还作为在人和在动物中的重要分子,例如,前列腺素,白三烯和环前列腺的初级阶段而起作用(A.P.Simopoulos,essential fattyacids in health and chronic disease,Am.J.Clin.Nutr.1999(70),pp.560-569)。ω-3脂肪酸族的重要代表是DHA(二十二碳六烯酸)和EPA(二十碳五烯酸),其可以在鱼油和在海洋微生物中发现。ω-6脂肪酸的重要代表是ARA(花生四烯酸),其出现在,例如,丝状真菌中,但是也可以从动物组织如肝和肾中分离。DHA和ARA在人母乳中彼此相接出现。
PUFAs对于人来说在适当发育方面,特别是对于发育脑,组织形成及其修复是必需的。因而,DHA是人细胞膜的重要组分,特别是神经的细胞膜。它在脑功能的成熟中发挥重要作用并且对于视力的发育是必需的。ω-3 PUFAs如DHA和EPA被用作营养添加剂,因为具有DHA充分供应的平衡营养对于某些疾病的预防有利(A.P.Simopoulos,Essential fatty acids in health and chronic disease,AmericanJournal of Clinical Nutrition 1999(70),pp.560-569)。例如,患有非胰岛素依赖型糖尿病的成人呈现与后来出现的心脏问题相关的DHA平衡的缺陷或者至少是失衡的DHA平衡。同样地,神经元疾病如,例如,阿尔茨海默病或精神分裂症伴随着低的DHA水平。
有大量的DHA商业提取物的来源,例如,来自海洋冷水鱼的油,蛋黄部分或海洋微生物。适于提取n-3 PUFA的微生物发现于,例如,弧菌属(Vibrio)的细菌中(例如,海产弧菌(Vibrio marinus))或腰鞭毛虫(Dinophyta)中,其中特别是Crypthecodinium属,如C.cohnii或在Stramenopiles(或Labyrinthulomycota)中,如Pinguiophyceae如,例如,Glossomastix,Phaeomonas,Pinguiochrysis,Pinguiococcus和Polypodochrysis。其它生产PUFA的优选微生物特别属于Thraustochytriales目,(Thraustchytriidea)具有Japonochytrium属,Schizochytrium属,Thraustochytrium属,Althornia属,Labyrinthuloides属,Aplanochytrium属和Ulkenia属。
提取自商业上已知的PUFA来源如植物或动物的油的特征经常是非常不均匀的组成。以这种方式提取的油必须进行昂贵的纯化处理以便能够富集一种或几种PUFAs。另外,来自这些来源的PUFA的供应也会发生不可控制的波动。因而,疾病和天气影响能够减少动物也能够减少植物的产量。从鱼中提取PUFA出现季节波动并且甚至能够由于过度捕捞或气候变化(例如,厄尔尼诺现象)而暂时性地停止。动物油,特别是鱼油,可以通过食物链从环境中积聚有害物质。已知动物受有机氯化物,例如,多氯化联苯高度胁迫,特别是在商业性鱼场中,其抵消了鱼类消费的健康方面(Hites等,2004,Global assessmentof organic contaminants in farmed salmon,Science 303,pp.226-229)。鱼产品质量的所得损失导致消费者对于鱼和鱼油作为ω-3 PUFA来源的接受度下降。另外,从鱼浓缩DHA因为高度技术需要而相对昂贵。另一方面,DHA存在于少数海洋微生物,占细胞总脂肪组分的大约50%,并且它们能够在大的发酵罐中进行相对经济地培养。微生物的另一个优点是提取自它们的油的组成限于少数几种组分。
对于长链PUFA如二十二碳六烯酸(DHA;22:6,n-3)和二十碳五烯酸(EPA;20:5,n-3)的生物合成已知多种生物催化途径。在真核生物中生产长链PUFA的常规生物合成途径起始于亚油酸(LA;18:2,n-6)和α亚油酸的δ-6去饱和作用。它导致由亚油酸合成γ亚油酸(GLA;18:3,n-6)以及由α亚油酸合成十八碳四烯酸(OTA;18:4,n-3)。对于n-6以及n-3脂肪酸来说,此去饱和作用后接延伸步骤以及δ-5去饱和作用,致成花生四烯酸(ARA;20:4,n-6)和二十碳五烯酸(EPA;20:5,n-3)。起始自二十碳五烯酸(EPA;20:5,n-3)的二十二碳六烯酸(DHA;22:6,n-3)的合成随后能够通过两种不同的生物合成途径发生。在所谓的线性生物合成途径中,发生二十碳五烯酸(EPA;20:5,n-3)延伸另外两个碳单位,随后发生δ-4去饱和作用以形成二十二碳六烯酸(DHA;22:6,n-3)。这种生物合成途径的存在能够通过生物如破囊壶菌属(Thraustochytrium)和裸藻属(Euglena)中δ-4去饱和酶的存在而确证(Qiu,等,Identification of a delta 4 fatty acid desaturase fromThraustochytrium sp.involved in the biosynthesis of docosahexaenoic acidby heterologous expression in Saccharomyces cerevisiae and Brassicajuncea.,J.Biol.Chem.276(2001),pp.31561-31,566和Meyer等,Biosynthesis of docosahexaenoic acid in Euglena gracilisBiochemicaland molecular evidence for the involvement of a delta 4 fatty acyl groupdesaturase.Biochemistry 42(2003),pp.9779-9788)。起始自二十碳五烯酸(EPA;20:5,n-3)的二十二碳六烯酸(DHA;22:6,n-3)合成的第二条途径,所谓的Sprecher途径,独立于δ-4去饱和作用。它由两个连续延伸步骤,每步延伸2个碳单位至二十四碳五烯酸(24:5,n-3)以及随后δ-6去饱和作用至二十四碳六烯酸(24:6,n-3)组成。随后通过过氧化物酶体β氧化作用缩短两个碳单位而接着发生二十二碳六烯酸的形成(H.Sprecher,Metabolism of highly unsaturated n-3 and n-6 fatty acids.Biochimica et Biophysica Acta 1486(2000),pp.219-231)。这一第二生物合成途径是在哺乳动物中占优势的DHA合成途径(Leonard等,Identification and expression of mammalian long-chain PUFA elongationenzymes.Lipids 37(2002),pp.733-740)。对于C20 PUFA形成的备选生物合成途径存在于少数缺δ-6变性酶活性的生物中。这些生物包括,例如,原生生物Acanthamoeba sp.和Euglena gracilis。在备选的C20PUFA合成中的第一步在于C18脂肪酸,亚油酸(LA;18:2,n-6)和α亚油酸(ALA;18:3,n-3)延伸两个碳单位。随后通过δ8去饱和作用和接下来的δ5去饱和作用将得到的脂肪酸二十碳二烯酸(20:2,n-6)和二十碳三烯酸(20:3,n-3)转化成花生四烯酸(ARA;20:4,n-6)和/或二十碳五烯酸(EPA;20:5,n-3)(Sayanova和Napier,Eicosapentaenoic acidBiosynthetic routes and the potential for synthesis in transgenic plants.Phytochemistry 65(2004),pp.147-158;Wallis和Browse;The delta-8desaturase of Euglena gracilisAn alternate pathway for synthesis of20-carbon polyunsaturated fatty acids.Arch.Biochem.Biophys.362(1999),pp.307-316)。
高等植物不具有由初级阶段合成C20 PUFA的能力。它们通过各种去饱和酶起始自硬脂酸(18:0),形成油酸(C18:1;δ-9去饱和酶),亚油酸(18:2,n-6,δ12去饱和酶)和α亚油酸(18:3,n-3;δ15去饱和酶)。
不过,某些海洋微生物采取完全不同的生物合成途径来产生EPA和DHA。这些产生PUFA的微生物包括γ蛋白细菌的海洋代表以及少数几种cytophaga flavobacterium bacteroides族和到目前为止的真核性原生生物,Schizochytrium sp.ATCC 20888(Metz等,2001,Productionof polyunsaturated fatty acids by polyketide synthases in both prokaryotesand eukaryotes.Science 293290-293)。它们通过所谓的多酮化合物合酶(PKS)来合成长链PUFA。这些PKSs代表催化由酮化合物(ketide)单位组成的次级代谢产物合成的大酶(G.W.Wallis,J.L.Watts和J.Browse,Polyunsaturated fatty acid synthesiswhat will they think of next?Trendsin Biochemical Sciences 27(9)(2000)pp.467-473)。多酮化合物的合成包含许多与脂肪酸合成类似的酶反应(Hopwood & Sherman Annu.Rev.Genet.24(1990)pp.37-66;Katz & Donadio Annu.Rev.of Microbiol.47(1993)pp.875-912)。
已知不同PUFA-PKSs(PUFA-合成的PKSs)的基因序列。由此,从海洋细菌Shewanella sp.分离出38kb基因组片段含有生产EPA的信息。随后对这一片段的测序导致鉴定了8个开放阅读框(ORFs)(H.Takeyama等,Microbiology 143(1997)pp.2725-2731)。来自Shewanella的这些开放阅读框,其中五个与多酮化合物合酶基因密切相关。同样,美国专利号5,798,259描述了来自Shewanella putrefaciens SCRC-2874的EPA基因簇。PUFA-PKS基因也发现于海洋原核生物Photobacteriumprofundum株SS9中(Allen和Bartlett,Microbiology 2002,148 pp.1903-1913)和Moritella marina株MP-1,早期的Vibrio marinus(Tanaka等,Biotechnol.Letters 1999,21,pp.939-945)。类似的产生PUFA的PKS样ORFs也能够在真核性原生生物Schizochytrium中鉴定(Metz等,Science 293(2001)pp.290-293,US专利No.6,556,583及WO02/083870A2)。在Schizochytrium中确定了三种ORFs,其与来自Shewanella的EPA基因簇呈现部分同一性。在少数原核生物和真核生物Schizochytrium中存在这些保守性PKS基因给出了暗示,PUFA-PKS基因可能在原核生物和真核生物之间进行了水平转移。
即使是使用正常情况下不产生PUFAs的微生物中分离的基因簇对PUFAs进行转基因生产也已经能够得以显示了。因而,存在于来自Shewanella sp.SCRC-2738的簇中的上述五种ORFs(开放阅读框)足以在非IPA生产者大肠杆菌(E.coli)和Synechoccus sp.中生产可测量量的EPA(Yazawa,Lipids 1996,31,pp.297-300和Takayama等,Microbiology 1997,143,pp.2725-2731)。
通常,对于大规模生产PUFAs的新的PUFA生产者总是存在需要。首先这种生产是否发生在,例如,原核生物,原生生物或在植物中并不重要。目标始终是尽可能经济地和以尽可能保护环境的方式大量生产高质量的PUFAs。本发明追求这一目标,因为它介绍了来自特别有效的PUFA生产者Ulkenia sp.的合适的PUFA-PKS基因。
考虑到技术状态,所以本发明的任务是从生产DHA的微生物Ulkenia sp.中鉴定和分离另外的PUFA-PKS基因,其极适于生产PUFAs。此外,应当获得关于这些基因的位置和排列以及它们的调控元件的知识。由此获得的知识,特别是由此获得的核酸物质,应当使得PUFA-PKS基因在同系生物以及在转基因生物中的加强表达成为可能。
通过本发明的权利要求书中所定义的主题解决了这些任务以及其它未曾被明确地说明但可以从本文件初始讨论的联系中轻易得到或总结的其它任务。
1.PUFA-PKS,其特征是它们a.包括在SEQ ID No.6(ORF 1),7(ORF 2),8和/或80(ORF 3)中所示氨基酸序列的至少其中一种,以及具有与它们有至少70%,优选80%,特别优选至少90%和更加特别优选至少99%和最优选100%序列同源性的同源序列,其具有PUFA-PKS的至少一个结构域的生物学活性,或b.包括在SEQ ID No.32,34,45,58,59,60,61,72,74和/或77中所示氨基酸序列的至少其中一种,以及具有与它们有至少70%,优选80%,特别优选至少90%和更加特别优选至少99%和最优选100%序列同源性的同源序列,其具有PUFA-PKS的至少一个结构域的生物学活性。
2.具有10个或更多ACP结构域的根据权利要求1的分离的PUFA-PKS。
另外,本发明在优选的方面涉及这样一种PUFA-PKS,其包含与序列SEQ ID No.6(ORF 1),7(ORF 2)和/或8和/或80(ORF 3)的至少500个直接连续氨基酸具有至少70%,优选至少80%,特别优选至少90%和更加特别优选至少99%同一性的至少一种氨基酸序列。
在另一个优选的方面,本发明涉及分离的DNA分子,其编码根据任一项在前权利要求的PUFA-PKS。
后者优选特征为它编码与序列SEQ ID No.6(ORF 1),7(ORF 2)和/或8和/或80(ORF 3)的至少500个直接连续氨基酸具有至少70%同一性的氨基酸序列。
另外,本发明涉及这样的分离DNA分子,其与来自序列SEQ ID No.3,4,5和/或9的至少500个连续核苷酸具有至少70%,优选至少80%,特别优选至少90%和更加特别优选至少95%的同一性。
在另一个优选的方面,本发明涉及一种重组DNA分子,其包含与控制转录的至少一种DNA序列功能性连接的先前所述DNA分子的其中之一,优选选自SEQ ID No.3,4和5和/或9或其至少500个核苷酸的部分以及它们的功能性变体。
在又一个优选的方面,本发明涉及包含前述重组DNA分子的重组宿主细胞。
在又一个优选的视点下,本发明涉及内源性表达具有至少10个ACP结构域的根据本发明的PUFA-PKS的重组宿主细胞。
另外,在又一个优选的方面,本发明涉及一种生产含有PUFA的油的方法,包括培养这种重组宿主细胞,以及涉及以此方式生产的油。
另外,在又一个优选的方面,本发明涉及一种生产含有PUFA,优选DHA的生物质量的方法,包括培养这种重组宿主细胞,以及涉及以此方式生产的生物质量。
所以,在又一个优选的方面,本发明还涉及根据权利要求15的重组生物质量,其包含根据权利要求8的核酸和/或根据权利要求1的氨基酸序列或与它同源的至少500个连续氨基酸的部分。
本发明在又一个优选的方面还涉及SEQ ID No.32,33,34,45,58,59,60,61,72,74和/或77中所示、来自包含SEQ ID No.6,7,8和/或80的PUFA-PKS的个别酶结构域的用途,用于生产人工多酮化合物,例如,多酮化合物抗生素和/或新的,变化的脂肪酸。
根据本发明,有关核酸的同一性指示在待比较链的特定位置上的相同碱基对。不过,缺口是有可能的。以%计算同一性值的可能性由程序blastn和fasta代表。
就氨基酸而言,概念同源性也包含,例如氨基酸序列的保守性交换,其丝毫不影响蛋白质的功能和/或结构。甚至是这些同源性值也通过本领域熟练技术人员已知的程序,例如,blastp,Matrix PAM30,GapPenalties9,Extension1进行计算(Altschul等,NAR 25,3389-3402)。
来自Ulkenia sp.的PUFA-PKS基因的序列信息可由SEQ ID No.3-5和/或9中所定义的核酸序列和氨基酸序列获得。SEQ ID No.1和2代表目前分离的两种粘粒上的完整基因组DNA序列(见实施例2和3)。后者对其部分包含PUFA合成所必需的三种相关开放阅读框ORFs1-3的信息以及它们的侧翼调控序列。另外,作为其结果提出了能够源自基因组序列的蛋白质序列。
本发明另外包括用根据本发明的核酸对宿主生物进行同源和异源转化用来生产高纯PUFAs的方法。分离的开放阅读框优选导致在同系生物和转基因生物中生产PUFA,特别是DHA,EPA和DPA。
由此生产的PUFAs优选作为生物质量的组分或作为油而存在。
在本发明之前,只有真核生物,原生生物Schizochytrium的PUFA-PKS基因是已知的(美国专利号6,566,583,WO02/083870)。随后测定的序列数据部分源自cDNA和源自染色体DNA。在本发明中首次从染色体DNA完全描述了对于PUFA合成必需的真核性原生生物的所有PUFA-PKS基因。这不仅导致确定了以往未知的来自Ulkenia sp.的PUFA-PKS编码基因信息,还另外提供了关于侧翼调控元件如转录启动子和终止子的数据。此外,染色体序列信息使得深入了解个别PUFA-PKS基因的位置和排列成为可能。
这里完全令人吃惊的是簇同样地不再存在,因为以往知道它是来自原核性PUFA-PKS代表如Shewanella,Photobacterium或Moritella。鉴定的粘粒(Seq ID No.1)一开始显示个别ORFs的线性排列在Ulkenia中被打乱并且还显示个别ORFs的阅读方向是反向的(

图1)。这可能是大段基因转座的结果。作为转座的结果,个别ORFs还清楚地呈现彼此的更大间隔。因而,两个ORFs 1和2具有大约13kb的间隔。第三个ORF直到在另一个粘粒上才能够在此情况下得以鉴定(SeqID No.2)并且在两种粘粒之间(Seq ID No.1和2)没能发现部分同一性(图1)。这意味着来自Ulkenia sp.的ORF在空间上不再位于两种ORFs1和2附近。这作出结论,即PUFA基因簇,已知来自上述原核代表,不再存在于真核生物Ulkenia sp.中。已经部分测定了原生生物Schizochytrium的个别PUFA-PKS基因在基因组上的位置和排列(WO02/083870)并且还显示了两种ORFs A和B的相反方向。不过,它们彼此仅仅分离4224个碱基对。在专利申请WO 02/083870中将这一序列片段讨论为具有双向启动子元件的基因间隔区。至少对于Ulkenia在同源性ORFs 1和2之间的双向启动子元件似乎是不可能的,这是因为对于Ulkenia测定的12.95kb的间隔区。没有其它明显ORFs存在于来自Ulkenia的ORFs 1和ORF2之间的12.95kb区域之内。表明区域中发生了大的重组和/或转座事件。转座酶样事件也能够基于少数重复序列重复而发生。
更加令人特别吃惊的是与EPA生产者Shewanella(6xACP)和Photobacterium(5xACP)的PUFA-PKS以及DHA生产者Moritella(5xACP)和Schizochytrium(9xACP)的PUFA-PKS相比,来自Ulkeniasp.的PUFA-PKS具有最大数目的酰基载体蛋白的重复,有10个ACP结构域(图3)。这意味着分离自Ulkenia sp.的PUFA-PKS相对于来自亲缘性原生生物Schizochytrium的PUFA-PKS不仅具有偏移性氨基酸序列,而且在结构上也是独特的。另一种特性是这样的事实,即来自Ulkenia sp.的第三个ORF相对于来自Schizochytrium的ORF C短了38个氨基酸并且另包含了丙氨酸富集的结构域,该结构域并不以此方式存在于Schizochytrium中(图6)中。令人感兴趣的是,这种序列类似存在于来自ORF 1的个别ACT结构域之间的区域并且可能代表连接区。所述相似性在于序列长度以及丙氨酸连续仅被个别脯氨酸和缬氨酸打乱的事实。相对于Schizochytrium ORF C缺失的ORF 3中氨基酸的最大部分是删除的结果,有30个氨基酸长,位于脱水酶/异构酶结构域之间(图6)。作为结果,这些结构域位于相应的蛋白质上,彼此相距短的间隔,这能够对于酶学活性具有影响。对于ORF 3来说,即使其它的5’位置上的ATG密码子也可作为起始密码子,从而在理论上甚至是最大为1848个氨基酸长的ORF也能够存在(Seq ID No.9和80)。在此情况下甚至同时出现ORF 3的变体也是可能的。
特别地,来自Ulkenia sp.的ORF 1(Seq ID No.3和6)在一方面包含所谓的β酮酰基合成酶结构域(Seq ID No.14和32),其特征是靶标(motive)(DXAC)(Seq ID No.12和30)。Ulkenia ORF 1中酶学结构域的活性中心的靶标能够以优选的形式扩展到17个氨基酸的范围(GMNCVVDAACASSLIAV)Seq ID No.11和29)。完整的β酮酰基合成酶结构域可以分成N末端(Seq ID No.10和28)和分成C末端(Seq ID No.13和31)部分。β酮酰基合成酶结构域的生物学功能是催化脂肪酸和/或PKS合成的缩合反应。进行延伸的酰基基团通过硫酯键结合到酶学结构域的活性中心的半胱氨酸基团并且以几个步骤转移到酰基载体蛋白上的丙二酰基团的碳原子2上,释放CO2。β酮酰基合成酶结构域后接丙二酰CoA-ACP转移酶结构域(Seq ID No.15和33)。此结构域催化丙二酰CoA转移到酰基载体蛋白(ACP)上的4’-phosphopantetheine基团。丙二酰CoA-ACP转移酶结构域也将甲基或乙基丙二酸酯转移到ACP上,期间它们将分枝导入其它的线性碳链上。随后将连接区域后接富含丙氨酸序列的部分(Seq ID No.16和34),该部分包含10个重复的酰基载体蛋白结构域(ACP结构域)(17-26和35-44)。这些ACP结构域对于它们的部分彼此通过连接区域相互分离,所述连接区域主要由丙氨酸和脯氨酸组成。每个ACP结构域的特征是4’-phosphopantetheine分子(LGXDS(L/I))的结合靶标。所述4’-phosphopantetheine分子在这里结合到靶标内的保守丝氨酸上。ACP结构域通过4’-phosphopantetheine基团作为载体起作用来生长脂肪酸和/或多酮化合物链。与酮还原酶具有部分同一性的序列(Seq ID No.27和45)随后接上。这些结构域的生物学功能在于3-酮酰基-ACP化合物的NADPH依赖型还原作用。它代表脂肪酸生物合成中的第一次还原反应。这种反应在多酮化合物合成中也经常发生(还参见图3)。
来自Ulkenia sp.的ORF 2(Seq ID No.4和7)也以β酮酰基合成酶结构域(Seq ID No.50和58)起始,其特征是靶标(DXAC)(Seq IDNo.48和56)。Ulkenia ORF 2中酶学结构域的活性中心的这种靶标能够以优选的形式扩展到17个氨基酸的范围(PLHYSVDAACATALYVL)Seq ID No.47和55)。完整的β酮酰基合成酶结构域可以分成N末端(Seq ID No.46和54)和C末端(Seq ID No.49和57)部分。此结构域的生物学活性对应于ORF1中所述的β酮酰基合成酶结构域。Kethosynthases在延伸循环中发挥关键作用并且显示了比脂肪酸合成的其它酶更高的底物特异性。这再次后接与β酮酰基合成酶结构域具有较小部分同一性的序列片段。另外,这一结构域缺少用于活性中心的靶标DXAC。它具有来自II型PKS类似系统的所谓链长因子(CLF)的特性(Seq ID No.51和59)。CLF氨基酸序列与酮合成酶具有部分同一性,但是没有具有相应的半胱氨酸基团的特征性活性中心。PKS系统中的CLFs的部分目前正以争论方式进行讨论。最近的结果指出CLF结构的部分在于丙二酰ACP的脱羧作用。产生的乙酰基随后可以结合到β酮酰基合成酶结构域的活性中心上并且因而代表了起始缩合反应的所谓引动分子(priming molecule)。还发现CLF同源性序列作为分子PKS系统中的负载结构域。具有CLF序列特性的结构域存在于所有先前已知的PUFA-PKS系统。这后接酰基转移酶结构域(Seq ID No.52和60)。这种结构域催化许多酰基转移如从酰基转移到辅酶A或转移到ACP结构域。来自ORF 2的终止结构域显示与氧化还原酶的部分同一性(Seq ID No.53和61)并且很可能代表了一种烯酰基还原酶结构域。烯酰基还原酶结构域的生物学活性存在于脂肪酸合成的第二次还原反应中。它催化脂肪酸酰基ACP的反式双键的还原(也参见图2)。
来自Ulkenia sp.的ORF 3(Seq ID No.5和8)由两种脱水酶/异构酶结构域(Seq ID No.66,68,72和74)组成。两种结构域都包含“活性位点”组氨酸,直接相邻半胱氨酸(Seq ID No.67和73以及Seq ID No.69和75)。这些结构域的生物学功能是反式双键插入到脂肪酸或多酮化合物分子中,伴随着H2O的分解和双键随后转化成顺式异构形式。第二种脱水酶/异构酶结构域并入丙氨酸富集区(Seq ID No.70和76),所述丙氨酸富集区没有已知的功能但是可能代表连接区。这后接烯酰基还原酶结构域(Seq ID No.71和77),其与来自Ulkenia的已经存在于ORF 2中的烯酰基还原酶结构域具有高度部分同一性。它的生物学功能对应于上面已经介绍过的烯酰基还原酶结构域(也参见图2)。
优选在来自Ulkenia sp.的ORF 1起始ATG密码子前面给出2000bp(Sequence ID No.62)作为启动子序列。它们特别优选1500bp,更加特别优选1000bp在起始密码子之前。
优选可以在终止密码子TAA之后给出2000bp(Sequence ID No.63)作为ORF 1的终止序列。特别优选1500bp,更加特别优选1000bp在终止密码子之后。具有碱基序列AATAAA的ORF 1的mRNA合成的潜在终止信号存在于终止密码子TAA之后的412bp。
优选在来自Ulkenia sp.的ORF 2起始ATG密码子前面给出2000bp(Sequence ID No.64)作为启动子序列。它们特别优选1500bp,更加特别优选1000bp在起始密码子之前。
优选可以在终止密码子TAA之后给出2000bp(Sequence ID No.65)作为ORF 2的终止序列。具有碱基序列AATAAA的ORF 2的mRNA合成的潜在终止信号存在于终止密码子TAA之后的1650bp。
优选在来自Ulkenia sp.的ORF 3起始ATG密码子前面给出2000bp(Sequence ID No.78)作为启动子序列。它们特别优选1500bp,更加特别优选1000bp在起始密码子之前。
优选可以在终止密码子TAA之后给出2000bp(Sequence ID No.79)作为ORF 3的终止序列。具有碱基序列AATAAA的ORF 3的mRNA合成的潜在终止信号存在于终止密码子TAA之后的4229bp。
PUFA,例如,DHA可以在Ulkenia sp.中进行同源生产,此外还可以在宿主,例如,大肠杆菌中利用本发明测定的序列信息进行异源生产。根据本发明的核酸序列可以用来提高PUFA的产量,其中它们被用来,例如,提高生产PUFA的生物中PUFA-PKS基因的数目。自然地,甚至是个别核酸片段,例如,编码ACP结构域的序列片段也可在同源或异源生产生物中进行扩增。特别地,ACP结构域呈现自己提高生产,因为辅因子4-phosphapantheteine的结合位点对于PUFA合成是必需的。自然地,即使是不同调控元件,例如,启动子,终止子和增强子元件的使用也能够导致经遗传修饰的PUFA生产者内产量的提高。在个别序列片段中的遗传修饰能够导致获得产物结构的变化并且因而导致不同PUFAs的生产。另外,PUFA合成酶与多酮化合物合酶的相似性使得混合系统的构建成为可能。这种所谓的组合性生物合成允许新的人工生物活性物质的生产。例如,通过PKS-和PUFA-PKS单位的混合系统在转基因微生物中生产的新型多酮化合物抗生素是有可能的。
适于这里给出的PUFA基因的异源表达的宿主除了大肠杆菌之外为,例如,酵母如酿酒酵母(Saccharomyces cerevisiae)和毕赤酵母(Pichia Pastoris)或者丝状真菌,例如,构巢曲霉(Aspergillus nidulans)和Acremonium chrysogenum。通过将根据本发明的基因导入,例如,大豆,油菜,向日葵,亚麻或其它的,优选富含油的植物中来生成生产PUFA的植物。为了PUFA基因的有效异源表达,甚至也可以使用其它的附属基因,例如,4-phosphopantheteine转移酶。另外,可以使用宿主特异性启动子/操纵系统进行加强的或可诱导的基因表达。
可以使用多种原核表达系统进行PUFA的异源生产。可以构建除了相应的PUFA基因之外还包含启动子,核糖体结合位点和转录终止子的表达载体。将大肠杆菌色氨酸生物合成的启动子/操纵子区和λ噬菌体的启动子引证作为大肠杆菌中这些调控元件的例子。同样地,可以将可选择的标记,例如,对氨苄青霉素、四环素或氯霉素的抗性用于合适的载体上。对于大肠杆菌的转化非常合适的载体为pBR322,pCQV2和pUC质粒以及它们的衍生物。这些质粒可包含病毒以及细菌元件。可以使用每种源自大肠杆菌K12的菌株,例如,JM101,JM109,RR1,HB101,DH1或AG1作为大肠杆菌宿主菌株。自然地,所有其它惯用的原核表达系统也可以用于异源PUFA生产(还参见Sambrook等)。还可以使用生油(oil-building)细菌作为宿主系统。
可以将哺乳动物、植物和昆虫细胞以及真菌,例如,酵母用作真核表达系统。对于酵母系统来说,可以使用来自于糖酵解酶基因的转录起始元件。这包括乙醇脱氢酶,甘油醛-3-磷酸脱氢酶,phosphoglukoisomerase,磷酸甘油酯激酶等的调控元件。不过,即使是来自基因如来自酸性磷酸酶,乳糖酶,金属硫蛋白或葡糖淀粉酶基因的调控元件也可以使用。这里还使用允许加强的或可诱导的表达的启动子。可由半乳糖诱导的启动子(GAL1,GAL7和GAL10)也是令人特别感兴趣的(Lue等,1987 Mol.Cell.Biol.7,p.3446 ff.和Johnston1987 Mircobiol.Rev.51,p.458 ff.)。3’终止序列还优选源自酵母。由于紧邻起始密码子(ATG)的核苷酸序列影响酵母中基因的表达,还优选来自酵母的有效翻译起始序列。在使用酵母质粒的情况下,它们包含来自酵母的复制起点并且包含选择标记。这种选择标记优选是营养缺陷型标记,例如,LEU,TRP或HIS。这种酵母质粒是所谓的YRps(酵母复制性质粒),YCps(酵母着丝点质粒)和YEps(酵母游离质粒)。没有复制起点的质粒是Yips(酵母整合质粒),其用于整合转化的DNA至基因组中。特别感兴趣的是质粒pYES2和pYX424以及pPICZ质粒。
如果将丝状真菌,例如,构巢曲霉用作异源PUFA生产者,也可以使用来自对应生物的启动子。可以将用于加强表达的gpdA启动子和用于可诱导表达的alcA启动子用作实例。优选使用酵母质粒如pHELP(D.J.Balance和G.Turner(1985)Development of ahigh-frequency transforming vector for Aspergillus nidulans.Gene 36,321-331)和可选择标记如ura,bio或paba用于转化丝状真菌。甚至优选来自丝状真菌的3’调控元件。
通过杆状病毒表达系统可以在昆虫细胞中生产PUFA。这些表达系统可由,例如Clonetech或Invitrogen商购。
可以将载体,例如,来自土壤杆菌的Ti质粒或完整病毒如菜花样花叶病毒(CaMV),双粒病毒,番茄金黄花叶病毒或烟草花叶病毒(TMV)用于植物的转化。优选的启动子为,例如,CaMV的35S启动子。对于植物转化的其它可能性为磷酸钙法,聚乙二醇法,微注射,电穿孔或原生质体的脂染。还优选通过用DNA带电微粒轰击(基因枪)进行的转化。植物中备选的PUFA生产源自叶绿体的转化。例如,N末端引导肽使得蛋白质在叶绿体中的转运成为可能。优选的引导肽源自核酮糖双磷酸酯羧化酶的小亚基但是也可以使用其它chloroplastidary蛋白的引导肽。叶绿体基因组的稳定转化提供了另一种可能性。对此尤其可以考虑生物导弹法还可以考虑其它方法(Blowers等Plant Cell 1989 1 pp.123-132,Kline等.Nature 1987 327 pp.70-73和Schrier等Embo J.4 pp.25-32)。
对于哺乳动物细胞还可以使用可以商购的表达系统。其中,可以使用病毒性或非病毒性转化和表达系统,例如,慢病毒或腺病毒系统或Invitrogen的T-Rex系统等作为例子。同样,来自Invitrogen的Flp-In系统,可以用于哺乳动物细胞中DNA的目的性整合。
下面利用几个实施例介绍构成了根据本发明方法基础的核酸和氨基酸。不过,所述序列和本发明并不限于这些实施例。
附图简述图1描述了来自Ulkenia sp.的PUFA-PKS基因在基因组上的位置。另外,显示了由这些基因编码的PUFA-PKS的个别结构域。KS酮合成酶,MAT丙二酰-CoA:ACP酰基转移酶,ACP酰基载体蛋白,KR酮还原酶,CLF链长因子,AT酰基转移酶,ER烯酰基还原酶和DH脱水酶/异构酶。
图2显示来自Ulkenia sp.的ORF2和ORF3与来自Moritellamarina(GenBank编号AB025342.1),Photobacterium profundum SS9(GenBank编号AF409100),Shewanella sp.SCRC-2783(GenBank编号U73935.1)和Schizochytrium(GenBank编号AF378327,AF378328,AF378329)的相应同源性ORFs的比较。在进化过程中个别ORFs之中和之间的基因转座也在结构域结构旁边指出。
图3显示来自Ulkenia sp.的ORF1与来自Moritella marina(GenBank编号AB025342.1),Photobacterium profundum SS9(GenBank编号AF409100),Shewanella sp.SCRC-2783(GenBank编号U73935.1)和Schizochytrium(GenBank编号AF378327,AF378328,AF378329)的相应同源性ORFs的比较。强调了ACP结构域和氨基酸连续LGIDSIKRVEIL重复的数目。
图4包含了来自Ulkenia sp.的ORF1与来自Schizochytrium的ORF A的序列比较。两种序列的部分同一性的程度为大约81.5%。
图5包含了来自Ulkenia sp.的ORF 2与来自Schizochytrium的ORF B的序列比较。两种序列的部分同一性的程度为大约75.9%。
图6包含了来自Ulkenia sp.的ORF 3与来自Schizochytrium的ORF C的序列比较。两种序列的部分同一性的程度为大约80.0%。
图7描述了由FASTAX进行的,实施例1中所述PCR产物与数据库序列(Swiss-PROT全文库)的序列比较。
图8显示了用于生产来自实施例2的粘粒库的Cosmid SuperCosI(Stragagene)的载体图(card)。
图9描述了由BLASTX进行的,实施例3中所述PCR产物与数据库序列(Swiss-PROT全文库)的序列比较。
实施例实施例1从分离自Ulkenia sp.SAM2179的DNA扩增PUFA-PKS特异性序列1.1包含编码PUFA-PKS的基因的基因组DNA的分离在250ml带有阻流板的Erlenmeyer烧瓶中用Ulkenia sp.SAM2179接种50ml DH1培养基(50g/l葡萄糖;12.5g/l酵母提取物;16.65g/l Tropic Marin;pH6.0)并于28℃和150rpm培养48h。随后用灭菌自来水洗涤细胞,离心下去并将细胞沉淀物冷冻于-85℃中。为了进一步的检查(workup),随后将细胞沉淀物转移入研钵中并以研棒在液氮下粉碎成精细粉末。随后,将大约1/10研成粉末的细胞材料与2ml裂解缓冲液(50mM tris/Cl pH7.2;50mM EDTA;3%(v/v)SDA;0.01%(v/v)2-巯基乙醇)混合并于68℃温育1h。随后加入2ml苯酚/氯仿/异戊醇(25∶24∶1),搅动并于100000rpm离心20min。在除去上层水相后,将后者转移入两个新的反应容器中,每个600μl,并且分别再次与600μl苯酚/氯仿/异戊醇(25∶24∶1)混合,搅动并于13000rpm离心15min。随后将特定上层相每个400μl转移入新的反应容器中并在每种情况下加入1ml乙醇(100%)后倒转两到三次。随后,将沉淀的DNA缠绕在玻璃棒上,用70%乙醇洗涤,干燥并溶于50μl蒸馏水中。将以此方式提取的DNA与2μl RNase A混合并保存于4℃待用。
1.2利用靶标特异性寡核苷酸进行PCR反应将PCR引物MOF1和MOR1用作靶标特异性寡核苷酸。
MOF15’-CTC GGC ATT GAC TCC ATC-3’(Seq ID No.81)MOR15’-GAG AAT CTC GAC ACG CTT-3’(Seq ID No.82)。将在上面1.1段中所述的来自Ulkenia sp.SAM2179的基因组DNA稀释1∶100。随后将2μl的这种稀释液转移入50μl体积的PCR反应混合物中(1x缓冲液(Sigma);dNTPs(每种200μM);MOF1(20pmol),MOR1(20pmol)和2.5U Taq-DNA聚合酶(Sigma))。在下列条件下实施PCR起始变性94℃ 3min,随后为30个循环,每个循环于94℃ 1min,55℃ 1min,72℃ 1min,和最后8min 72℃。随后通过凝胶电泳分析PCR产物并通过T/A克隆(Invitrogen)将具有合适大小的片段插入载体pCR2.1 TOPO中。在转化大肠杆菌TOP 10F’之后,分离质粒DNA(Qiaprep Spin,QUAGEN)并进行测序。
将获得的序列数据与官方EMBL核苷酸序列数据库(http://www.ebi.ac.uk/embl/)相比较并进行评估。用FASTAX获得的序列比较对于来自Ulkenia sp.SAM 2179的PCR主要产物与来自Schizochytrium sp.ATCC 20888的PUFA-PKS(ORF A;ORF开放阅读框)的酰基载体蛋白产生部分同一性,其在氨基酸水平上为大约90%(图7)。令人吃惊的是,为了确定在Ulkenia sp.SAM 2179中的这种PUFA-PKS,仅须实施单次PCR实验。这说明所用寡核苷酸的特别高的效力。
实施例2由来自Ulkenia sp.SAM 2179的基因组DNA生产基因组文库在500μl体积中以2.5U Sau3AI于37℃ 2min将来自Ulkenia sp.SAM 2179的50μg基因组DNA部分裂解并且接下着立即用相同体积的苯酚/氯仿进行沉淀,随后用乙醇沉淀并溶解于蒸馏水中。随后根据生产商的说明书用SAP(虾碱性磷酸酶;Roche)将Sau3AI裂解的基因组DNA去磷酸化。随后通过将该反应加热20分钟至65℃来进行酶的灭活。将粘粒Supercos I(Stratagene,图8)用作载体。将10μgSupercos I用XbaI于37℃完全裂解几小时。随后将酶于65℃加热灭活20min并且根据生产商的说明书用SAP(Roche)将剪切的粘粒去磷酸化。在这里也通过将该反应于65℃加热20分钟进行酶的灭活。随后用BamHI于37℃将XbaI裂解的和去磷酸化的Supercos I粘粒完全裂解几小时。随后将剪切的粘粒DNA用苯酚/氯仿进行沉淀,用乙醇沉淀并接下来溶解于蒸馏水中。为了进行连接,将1μg用XbaI和BamHI裂解的粘粒DNA,和3.5μl Sau3AI裂解的基因组DNA组合于20μl的体积中并用T4连接酶(Biolabs)根据生产商的说明书连接几小时。随后根据生产商的说明书利用Gigapack III XL Packaging Extract(Stratagene)将大约1/7的连接物包装在噬菌体中。随后将后者用于转染大肠杆菌XL1-Blue MR。随后以PCR筛选的形式由QIAGEN公司(Hilden,Germany)由基因文库中进行PUFA-PKS特异性粘粒的分离,所述PCR筛选利用Ulkenia-PKS-特异性寡核苷酸PSF25’-ATT ACT CCT CTCTGC ATC CGT-3’(Seq ID No.83)和PSR25’-GCC GAA GACAGC ATC AAA CTC-3’(Seq ID No.84)。随后对由此确定的粘粒克隆C19F09的粘粒DNA进行分离和测序(Seq ID No.1)。
实施例3来自Ulkenia sp.的ORF3的鉴定为了鉴定来自Ulkenia sp.SAM 2179的ORF,寡核苷酸源自不同PUFA-PKS的高度保守的序列片段。令人感兴趣的是,对于PCR扩增似乎合适的非常高的部分同一性出现在个别物种之间编码脱水酶/异构酶的序列片段区域。
3.1包含编码PUFA-PKS的基因的基因组DNA的分离参见实施例1.13.2利用PUFA-PKS-特异性寡核苷酸进行的PCR反应将下列PCR引物用作PUFA-PKS-特异性寡核苷酸CFOR15’-GTC GAG AGT GGC CAG TGC GAT-3’(Seq No.85)CREV35’-AAA GTG GCA GGG AAA GTA CCA-3’(Seq IDNo.86).
将在上述3.1段所述的来自Ulkenia sp.2179的基因组DNA稀释到1∶10的比例。随后将2μl这种稀释液转移入50μl体积的PCR反应混合物中(1x缓冲液(Sigma);dNTPs(每种200μM);CFOR1(20pmol),CREV3(20pmol)和2.5U Taq-DNA聚合酶(Sigma)。在下列条件下进行PCR94℃初始变性3min,随后30个循环,每个循环于94℃1min,60℃ 1min,72℃ 1min,和最后8min 72℃。随后通过凝胶电泳分析PCR产物并通过T/A克隆(Invitrogen)将合适大小的片段插入载体pCR2.1 TOPO中。在转化大肠杆菌E.coli TOP10F’之后,分离质粒DNA(Qiaprep Spin,QUAGEN)并进行部分测序。
将获得的序列数据与官方EMBL核苷酸序列数据库(http:∥www.ebi.ac.uk/embl/)相比较并进行评估。用FASTAX获得的序列比较对于来自Ulkenia sp.SAM 2179的PCR主要产物与来自Schizochytrium sp.ATCC 20888的PUFA-PKS合成酶的ORF C产生部分同一性,其在氨基酸水平上为大约80%(图9)。令人吃惊的是,为了确定在Ulkenia sp.SAM 2179中的这种PUFA-PKS,仅须实施单次PCR实验。这说明所用寡核苷酸的特别高的效力。随后以PCR筛选的形式通过QIAGEN公司(Hilden,Germany)由实施例2中所述基因文库中分离PUFA-PKS特异性粘粒,所述PCR筛选利用已经用于PCR的寡核苷酸CFOR15’-GTC GAG AGT GGC CAG TGC GAT-3’(Seq ID No.85)和CREV35’-AAA GTG GCA GGG AAA GTA CCA-3’(Seq IDNo.86)。随后对由此确定的粘粒克隆058G09的粘粒DNA进行分离和测序(Seq ID No.2)。
序列表<110>努特诺瓦营养产品及食品成分有限公司(Nutrinova Nutrition Specialties and Food Ingredients GmbH)<120>来自ulkenia的PUFA-PKS基因(PUFA-PKS Gene aus Ulkenia)<130>SCT064799-47<160>86<170>PatentIn version 3.1<210>1<211>43372<212>DNA<213>Ulkenia sp.
<400>1ggatccacag cgttcattta ctcaagatca cactcgtgtg cagtccttga accttgggaa60agctcatgtc tctaggtatt gctgtcatgg tttgaaattt tgtcctcaaa agaatcgctt120gtaatttttc acttggtggg gtgcacaatg gtctctcaga accatctgct ctaaggagtc180ctactgacac ctacctacca cccttccttc atacccatgc ctactaacca acctattgat240aactctaacc agggttctat gataggcaaa tcagccaatc tcccgtggaa attagtcttt300tcaatcgttg gccagcaagc accatcgcaa cgacagcgct gcatcagcag gaactcgagt360acgcttcacc gtcatcgtca tcggtatcac cactattcat gaaatcagaa cctagtcacc420cagttacttt ttacgaggca gttgattctg tggagagatg ctcctgatca atggatatgt480ctattttatc tacaggtcac acataatcaa tcattcgggg tcatgatttt ccgccatggc540gatagtccaa aaaaactcag gaggcaaaat cattgttcaa tttacaacta cccacggagt600aaattaatgt aagagctcca atttacaggc aggtatatca tcacggtgtg ctgcagtagg660ttctgggtta tcatcctcaa tcattcataa acataacatt cattcataaa cataacattc720attcattcat aaacataaca ttcattcatt cattcactca ttcactcatt cattcattca780ctcattaatc cgcttaattt aactttaaat tgattgattg attgattgat ggcagaacca840cctattagca attggttact ccttgtattg aaaggcctga ataagtaagc aagcaagcca900ttggtaaacc ttcctcgccg cgactcgagc gacctcgaga gcggtctgag tgagtctctc960acgcaggccc cccgcctcct gagccgtctg tctcgctcaa ctgaagctcc gacaagccaa1020gctcacagct gcaagcttgc aagcaagctc gcttctgtct actcgtcctg catcgaatca1080acaaccttct cttacgccat gacggacgcc tcttccgaga tgcgcaagcg taagcgctac1140gcataccgca tcctcactga tgagtcatcc tcctcccatg caccctctgc tgaggatggt1200tccgtgcagg actctcgtat gctccgccat gccggcagca tctgggatgc cgaagagcgc1260
cgccgcgctg gcaaaatgtc ctcttccgca actgcagcca tgtccagtgt acctcctgga1320gaggaactct ggcttgtgtc tatccctgcg gacttcgacg cccatgacct caatggcctt1380cgcctgtctg ggaagaagcc cctcgcggac caagaaatcc aaattggcgc tacccacacg1440ctcactgctg acctgctctc gggctcttct caggtgcggt gcctgcgccc tactagctcc1500tatgtcaacg gcctgaggct tacaccgcct gccgcgcgtg ttttccacgt cgtagagcgt1560gatgccgctg atgatgaggc cagtgaagcg ggaggcagtg cccaagagga ggaggagcgc1620ctgcgcaagg ctgaagaggt cgtcaagaga cttttgccga agccgcgtga gcaaattgaa1680tttaggactt tttctatggc cgacaaagag gaactgctga agcgcatgca aaaggcaaag1740gcgcgtggag agaagaagag gggcagaaac gcgattaagg aagaagcaga agacgaggag1800gacaaggagg aagagaagtt ggtggccaag acagcaaaga aggacaagaa gaagggcaag1860aaggaaaagg agaaaaggcg caagtctgtg gcctgagctg gaaacccctt taaagtgaat1920aaaggctgtc ttgacatgtt caagaacgct tattcgatac atgaagacgt gctctggggt1980tatttcgatg aagcctgatc taaatactag tctgcttcag aatcatgcac agtgttcaaa2040ttgattctta actacagcct acgctgaagt tcagcttcaa attttggtct attttgaagt2100tcttcaccga aagtcatttc tagagtcccg ccccaaagtc tgatctacac tctctactcc2160attaccgcta atatccttta caactcttat ctttttcgac ttcttcaagc gctaaggagc2220ggaccactaa actgatgcaa gcttgcatca actctacgac cttttttatg tcaacacaag2280ttctggcctt acgctgaact cgtctctgat acacaatatg caacgaacac cgccaagacg2340gtcgctcatg cacatacgca cacatatata caaccaaaca tacaaataaa cacataagca2400ttggtcaagc cagctacagg accaatattc catcttttgc tgcttttctg caatttgggc2460cgctttttta tgtttggctg tatatatttt tcttggcatg caacctaaca agacacatga2520gcagaaaaaa taaatacggt caaagtcttg tctctgatgc tcatgtcttt cttctaatct2580taccagcgag aagacctttc taaagaataa tatcacatat actcaattgt ccaaattgct2640ttcaataagc attctttact ggatagctct cgccaaactg tcattcttag gaacactgct2700aatacgtggc tgaaagcact cccaacatgc acttttattc ctatgcattt tcttcttgga2760gctcaatttg acaaaatgcc ggtcgataag ctcgcggtct tgactttgat gcttacttcc2820ttgtttaact cgaaaacctt ctcatggctc attggaaaat catcaaatgg attatctatc2880atcttcactt aacccaattt ttgtttctct aaaacagccc caactatttt ttaaagaaat2940ttgtgtgctc tatcttctgt ttgcaactca aactaacaag ccacatcaac aaacatttat3000ttttttcaaa cttgataact ttagaccaac tttgcatcct cgatgctcgg gactccatct3060
taccccttgt caggtatgaa gcatctgatg aagcttgcag tattattacc ttttccagaa3120cactactgct accttcaaag atttgttcat ttcttttctt tgggggaaac aatgaatgct3180gattacccga agcgtaatat ggttgttgca tatattcaaa tattttaaac cttctaagta3240tttatatgat aggtatatgt tatttttaaa gacctttaat gcagttattt catatcaata3300accaagctct cgcagttttg cgctgtactg gcagtggtgg aggacccgtt gatctttata3360aaataggatc actggaggaa ggtgagacca ggaaactaag actatataag tttgtgggtt3420tctgtcattg tcactgacaa ggatcaaagt tatcctaatg cagagcatcc aacctttgtc3480tcagggaccc acccaatcca ctcttcaagt tttcactttc aatttcaggc caatttaaga3540caggaataca actcaaacta aatcaggatt cttctttttt aactcccagt catgcgatct3600ttaaaattga tcacattgcc ggcataataa ccatgggttt cgcaacttcc tccctggttt3660ctttgccaaa taaaacttcc acacactcga gagcaaactc cattgccgtg ccaggccctc3720tagacgtcac aattttggcc tcgtgctcaa ccaccacgcg atcctctgaa catcctcctt3780gagcgctctc aagatctttc gcaaatgccg gatggcacgt ggctctgcgc cctttcacaa3840tacccaaagg tgctagcacc accgctggag cggcgcaaat tgcggcaacc caggctccgc3900gcgagttctg tgccaacaat agcgagcgga gaggctcgct cgcggcgaga tttgaagcgc3960cgggcatccc gccaggaacg attacgaggt cgaaagatgg agatgtagaa ctggcatcca4020ggaggtcatc caaacgaaca tcagcctcaa tacgcacgcc tcgagaacag gtgagggttt4080ttccgctatc gcaaacagcg gcaacaatca cggaggcacg ggctctgcgc aggacatcaa4140tcggaatcac gctttccatc tcttcactgc catccgccat gacaaccagc acggacggtg4200gggaggaagg ggaagaagac atcgtcgaat tatgggaaac gtcgagactg gagcaagcgg4260gggcgattgt ttaagcgagc acaaagtgac gaggaattga gttacaatgt gaatctatag4320ataaataggt acctgtgcct tgcgacgaca gaaagatatt ttctcataat aggcctatct4380aaaaccaata attttgaaca ttttcatcat tgacgaaaag ctcctgcctt ccaaattgga4440agtgactatc cttaatatag tgcaataacg cattggacca aacagaatcc tcctggaggt4500gaccaccatg ttaggacctt gaacttcgca attgattggt ttcgaccttt tctccctcct4560tttataaaat aagcggctca aattaattag cctatcacgg tttctctagt ttttgggggt4620ttcgctatta tttggttatt atgaacaaat gtacagcttc ttacttacca gcctcctcgt4680tcagcatggt gaatgcatga aataaggaat caacttcatg actcatgctc tgcgtacaac4740attagattat ttttgcatgt ggtgttgaaa gtaagtcttc aagtcttttt cgtcaggata4800aaaactttct ttcatttgaa gttgtatgca agtcgcacca agatgtgatg actattttgc4860ttttcattaa ctttcctttg cagcaaaaaa gctctgtgcc tatgaaagcg ttagaactta4920
cttatataac ctccaaatgg tagtgactat tccacctaaa ttacatatca taatgattta4980agtctttgtt aaaaagtgga tgtttggtaa gaaactggaa taactaaggg accactaagc5040tccagacact acaagtgaag caaatcttca atttaaatta tcaaagtact tcaaccaaaa5100ttttagcgtc tcaacaagta cccttcgtgt gctatcccgg aggcaatcac atgtgcacaa5160gtaacgatgt tgaacgtacc tatggctctg gtttattttg gcagccatga gcaacgcaac5220actgaccgta tctttctcta cgctacaatg tcctccgcca agcaaaaaga gaatatccca5280gctcatttgc aaagccgaga ttttattcct gccagtggtg tcaactggtc atttacggag5340aggattgcac ttcaaagcca tgcaatgaat gtggtattat ccacgacaat cttggaaaat5400ccaagctttt aaaatgcccc aaaaccatgc aaacacgtag ccgatcgtga tatccacgcc5460ctccagctgc gccacctatc caaggacatg gtttaagaat tgtcgtttgg tcatatgtta5520gttttcaacc cgcaattggg ccttagtcca ccttgttacc ataggaaatg caagctttgc5580aaattttgta ggctaatctc taagtgtagc ttttgtcatt gtaaagacac aattcattga5640catgaggttg aaagctgttc tcatatgtaa caatccgcaa cattgactac gtcacatgtt5700cgtgcataga gggaacactt atcttgcata gtatgccctc acaactctcc tcccccgtac5760agcaatcgca cgcaccatca tttattcaaa tgagacaata cttgctatcg tcccgattgc5820tctttagttg gacatagaac taaatgcgcg tcgcgatgcg accggaaagg tttaccagca5880gactgttctg caatcgttcc gtaccctatt tcacaacatt agtcgatcga tcagaacaaa5940tcaagataga acctgcagga ggggtcgcgc aaagtttagg cacccaggca cagccgctct6000gtaagtggat tttcattcaa ttgtggtcct gtgcattcat tgtttgctcg tgtagcaaat6060agaaccacaa ggggttttgc agaaagaaaa caaggatcat ggggcgaaac cgaggccaga6120cggcgggacc actcgaccgc cagtcgaggt tcatgaccaa ggttctgcgg caccgcgcgg6180cagacatggg tcttgaaatg cgttcagatg ggtttgtgcg cgtagaagac cttctgaaac6240ttcagcaact taaagacatt ggccttgagg atgtcaaagc tattgttgct gctgataaca6300aacagcgatt tggccttcag caggaagagg accagacctg gtggattcgt gccaaccaag6360gtcactctat ggctagtgtc gagacagaag atcttcttga ggaggttgac ctcgatggga6420tttctctctg tttgcacggc acctatttgc ggttctggcc attgatagta cgcgatggtt6480taaagcgtat gcaacgtaac catatccact ttgcaacagg ccttcccggg gacgatggtg6540tccttagtgg atttcgcaac tctgctgagg tgcttattta tcttgatacc gtgcaggcga6600aaaaagctgg actcaaaatg tatcgctctg caaaccaggt gctcctaagt ccaggtcttg6660gcgacagtgg agtaatccct gtcaccttgt ttgctaaggc tgtcgagcgc cgctctggaa6720agctactttg gccaatagag gaaggtaaag agtcgcaacc ccctacagcg cctacttcag6780
accaccaacc tcgacaagga caactagcaa gtaagcgaaa agctggtggc cacaacaaga6840aactatcgca catgcttagc cgtgtcctgc ggcactctgc agttgatgaa ggaatcacca6900ttcgtgaaga tggcttcgtg cgccttgaag atctccaaac caaactcaag cgtttcgaaa6960atgtaactct tgatgacgtt caagctgtgg tgcgtgacaa tgacaaacaa cgcttcacac7020tacgccagga gtcagacggg tcctggatta ttcgcgcaaa ccaaggtcat tccatggctg7080ttgtcaaaga atcttttctc ttgcgggaac ttgaccctac cacaattgat gtgtgtcttc7140atggtactta caaagaagct tgggcaaaga ttcgaaaaac tggtctctcg cgcatgaacc7200gaaaccatat tcactttgct cgtggattgc cctccgactc caatggtgtt atcagtggca7260tgcggaaatc atgcgaagta catctctata ttgatgcctc tgcagcaggc aaagatggga7320ttaaattctt tgaatctgac aacggtgtta tcttaagtcc tggtaatggt gatggcatta7380tccctcctaa atactttaag tctgtcacag atcgccaagg cgcttcctta gaaaacctaa7440aatgacaaat tatgtagatc ttagttgttg aggacttcat gtcctttttg ttgtttgatt7500ccttgtatag cttatacacc ctggttatgt acattgtcat tcttgttaga ggcaattctt7560catctttgat tgatattcta tagaacttcc tcatgggtgt acctatacac aattatttat7620tataccgtgt gatattgtga ggttctaaag ttagcatcgc ctctgacacc tatgatggat7680gcagagtgac gccaatcctt cctctatatt gtgcgtgcct gctcgagaat caaatgatgt7740taaaagtcgt cttcattcat tatataacag agcataatgg aataataaaa ggaggcagga7800gacaagggta cttctgttgt gtaaaattcc attactatgt tcgtgtatag tagtattcct7860tgcctttagg atagtaggga agatattctc tgtgactttc acctacttca ctcttatgca7920agctcttatg caatcacaga tggatgtaga ttccgcttct tcattctcac tacgagaaca7980gcgcaactac aaatcttaag gactgtcaac tggcctgaaa tagtgaccaa ttatatattc8040caaaataaat ttatttgtat aaaattgtaa agatgcagca tgatagctta ggtacacata8100aacaacggtt aagtgtatag ggatacgcaa acgcaagcga gaacatgcaa gcgagaccat8160cgcctttcac cataatgtta taaatgtcta ttcttctgcc aagagcacga tacactcaac8220gttggtctaa gcactaaaga cagcatgtat ttatgtaagg acaacaacaa gcacctatac8280ctcaaaactt agtaataggc ttactaaaca ttctaacact atgatcttca tgtgaaaata8340ctcagcagca tggatgttga agctccacaa atggaataca gaaaacacaa tctagcaaga8400cgatgaaaat tgttcttagg tttcaggatc agaataacca aaatgcgcac cacacctgtt8460tctgatgctg tagctgtcat gttatggtaa aaacgtgcac agggcaccac tagcctgtta8520ttgtgtcgat tttgatacag tttatcacac gagagcttac tgactatgtt gtagaatgta8580aataccctat tcaaataacc ttgtggacac actcatccaa catactctac tcaactctta8640
ctaaaacaac caaaagattc cgctgaacta gaccaaaata atttgagtga tatgctgcaa8700ttcgtttgaa cacaatacat gtattgatgg ctgagatatg acttgccaaa gattgttcgt8760tgcaattaaa gtttactctc tgagtgcata tactcaatac aatgcagctt tatcgtggaa8820atccgggcta agcatgccat taggacccta tagcaggctc tgggcacgat ctttatatct8880tagcgatagt ttgtgcagca aaataatgga taaatcaaac ttcaacgagt cttaattcat8940agtttcgaat ccctacgagg ctatatatat aaagaaggtg tgagtcgaca gcacagttat9000gtaggaaaag ttataattat gtggaaaata accttagttg tcgaatcgtg gtgaataaaa9060gcttcattta agcgttttca gagatgccgg agcccatacc aaatattaat ttgctcaaag9120tcatcaattt cttatttgat agaatctaaa acagctttat attatatgaa gagcatatat9180attttaagct agtttagact tcaaccaagg ggatccaatt ttcgctcgtc actctgcgtc9240aaggtcgttt gcaaaaacat caaatctggt gcaagctcaa atgactaggg tcaataagga9300ctcctactaa ttatagttgt cactattatt tccactagga accgataaaa cagatgtaat9360taactctctt ggcgcttacc ttgtatagca agagtaaaga gtaaatgatg cggcaaaaac9420tatctctgtt acttatatgt tatagagtgc attggctgcg ccatgccata tgatagtagg9480taaactttgg aagttgaaag gggcgagaaa gggatcacag gtgatctata tataaaatgc9540aaatgaaaat tttaaagttt ggaaagttta tatgcgacac ataaaattat aatttgcata9600tgtggattaa gtgaatggaa tgagtctagc tataactact acctatccct atcataatca9660tgggaacaga tcaggagcaa attgggctta caggcgctca gtgggcacgt agatgtcatc9720aatctcggca gcaacctgct tggcgttagc cttcagcggg gcattacgga cagcttcgag9780gcggcgcaag aagcaggcac cacggaggat ctgcaagttg atttgcacaa catcggggta9840ctcgttggca acggcggggt caaggtaggt acccttgatg aagtcgttga aagatccaat9900cgctgggcca caccaaacct ggtagtccat ggcacggtcc gggatgccag cgtttgccca9960gaagctcgcc aaaccaaggt accagcggaa gcacaaggac atcttaagct tggggtcacg10020ctccgcgcgc tcaatcttct ccgggttctg caacctgttg atgtagaagt ccttggtctc10080ttcccaaact tctgacagag acttcttgaa aatgcgcttc tccacacgtt ccagctctcc10140aggagccatg gactcaaagg agtcatactt gacgaagagc tcatagagct tgttggcacg10200cgaggggaac atagttccct tcttgagcac ctggagcttg acaccttcct caaacatgtc10260agctgctggg gccatgcaga tgtcggagta ggtggcttgt gagagctgct tgcgaacggt10320gtcacaggtt ccagcttgct tactcatctg gtttacggta ccagtgacga tgaaggccgc10380gcccatgttg aaggtggcaa tggcggcctg agggcatcca atgccaccac cagcaccaac10440gcgaacgcga aggtgggcag ggtagccgca ctccttgtgc agacgatcac ggaggttgac10500
aatgagaggg aggatgacgt ggatggggcg gttatcggtg tggccaccgg agtccgcctc10560aacggcaatg tcgtctgcca caggcactgt gcgtgcgaga gcagcctgct cttgggtgat10620ctcgccggac ttcagcagct tctcgaggag attctcgggc gcgggacgga taaacattgc10680ggcaagctct gtgcgagaaa ccttaccgat gacgcggttc ttaataaccg tggagccatc10740agcagcgcga gagagacctg cagcacggta gcgcacgagc tgcggggtca aggtcataaa10800ggcggaggct tcaacgacag tgacgccctt ctcgaggaag aggtcgacgt tacccttctc10860gaggttgctg tcgaagggag agtggatgag gttgacagcg taagggccct tgggcagttc10920agcctggata gcttcgagag ccttgcgtac ggtggcgata ggaagaccac cagcaccgag10980agaaccaagg atgccgcgct ttccggcagc gataaccatc tcagcggatg caatgccctt11040tgccatggcg ccggtgtaca tgggggcgga tacaccatat gtctccatga aggcacggct11100gccaagatcc ttgatatcgc acttgggcac aacaatagat gcttcacttg ggcttgcttc11160aacgagatca ccgttggcgt tgacaccaag catcaaagtg ctgttgagct ccaaaagttt11220ggcacggaga gcctcagagg aagccacaac agctccggag acgctgcggg ccggagcagc11280aggggcaagg gcaggagcag cggaaggttt gttatccttg ttgagaatag ggtcgcgtgt11340ggcgtcttgc tcctgaatgt cgagacgctc catgaacttg ggggcaatag gctgcatctt11400gcgagcctgg ataagagcct cgatcttggg gtccgcagga ggaagcttag ctagcacctg11460gggcggcacg agctgctttt tggggtcata gcgaccattg accacaatct tacgcaagaa11520cttgttctta gtaggcttct tgccagccac catatcgttg taactctgcg tagcctcctc11580aacagtctcg gggtggtaca gaggggagac cttcacgcca ggcacgcggt gggcttggag11640agaggcaacc agcttgacca tggttgtcca agcattctcg ttctggcggt ccatggatcc11700ggtgacaaaa ggcttgctat ttccaagggt ggcgcgaatt gcggcgctac ggtgggcgtt11760gggaccagtc tcaacaaaga cgtcaaagtt cttgtcgcta acggtcttgg cgatcttagg11820aaagtctgcc tgaacagtgt acagctgtgc tgcgtattca ccaaagctgg gtgcgtactc11880gtcgctggct ccagtggact tgttaacaag cttcttctgg ttgacgctcg tgtacaggtc11940aaggccggca acctcgggaa tctcgaggac gctatggatc tcagcgatct gcttgccgta12000cggctcgacc acggggcagt ggccacacat accaaggtcc acgggcaaag cagggaggtt12060gctgctcagg cgagcaatgg cagccttgca atcttcaggc ttgccactga tgagagcact12120gttggcatcg ttgacaatgg tcaagtgcac gtacttattg ttggggccga tggccgcttc12180aacggcctcg cgggttccac gtaccacgta tccttgccag aactcgctga caggggtatc12240ttggggaata ttccaggcct tgcggagggc gtcaaactca acagcgaggg ccttacgcca12300
gacctccgag ttgcggagtt tagttgtcag ctcctcagag acaaggccgt tcttctcaga12360aaaggcaaaa accatggaaa tctctccaag gctcagtccg aaagcagcct tgggctggat12420gccaagcacg tcgcgagcga tgtgggtgaa gcacatggac atgagaatac cgagtcggaa12480catctccacc tggttgcggt tgaactcatc ttcctgcgcc ttaagctcct ccttcgtcga12540ggcgcgcggg atcaaccatc tgtcgccttg atcccaaagc ttgttggtct tggcgtttac12600aaactcgtga agttcgggcc agatgcggtg aatgtcaagg ccgataccat agtaagggct12660tcggccttcg ccgtacataa acgcaacgcg atcgcttgac agtggcttgg gtgcaaagtg12720gctgcccgag ggtgatgtcc agtcgcggcc catcttaaga ctccgcggga tgcccttgga12780ggcgagttca agctccttct ggagcttact aggagaggtc accaggcaca gagcgaaggc12840cggcaacggg gtcttggtct cctgggcaat gctctcgccg agcaactcca taaaagcaag12900acgtacatta gcgctaggct gggcgaggcg ctcgcggagc ttgtcaacac gctgcgtgat12960agcgtcatgg gagtctccgc ggattacgag gagtttgacg gcatcgtcat cgagcgaaat13020gcggctcttg gtctcgtggt ggccctccac atcagagagc agcaccgtgt agcatgaacg13080ggtctcggaa acacctgaga cagctgcgtg gcggcgagct ccagggttct tcaaccaggc13140ccgcgaggac tggcacgcgt acagagactt gccccactgt gtctcaggtg caggctcctc13200ccaggaggcg ccgtttgagg gcaagtagcg gttgtacaga cagagagccg tcttgatgag13260actggcagct cctgaggcgt agccggtgtc accgacagtg gacttgacgc tgctgacagc13320gacgttgtgg ggctccacag cttcgttgct agagcgctgg ctgagaatgg cctcaatgcc13380gcggatttcc tcctcagcag tgagttcctt aggcagaacg gaggggttct tgaggtggcg13440ggcagagtca gcggagagct cgagcatctc aacgtccttg gggttgacgc gagcctgggc13500gagagcctcc tccatgcagg ctgccggcat gttgccgggc acgatagcgt ccatgcaggc13560gtaaatgcgt tcgtccttgg tgcagtcgct ctcgcgcttg aggacgaggg caccacatcc13620ctcaccaaca aagtagccgt cagcgccgga gtcgaagctg gcccgcgggc tctcctgctc13680cgagaccttg aaacgacgcg acttcacgta gagattctca gcgctggcgc aaagatccac13740accggcgatc actacggcct cgacctcgcc agtctcgagc aagtacttgc ccaactctgc13800gcaacggtag acggagttgt tgccctctgt gatggtgaaa gaaggaccct cgaaacccca13860ttgtgaagac acgcgggtgg ccacgaggtt gccgatgtag gatgtgtacg aggtagcggt13920accgcaatcg ttgatgtagg acatcatatc attgagggct gaagcggctt cgggacgagc13980acgctccttg agggcaacgc gggcgcggtg acggtagagc tcaaggtcag tgccaaggcc14040gacgaagaca gcgaccttac ctcccttctt gaggccagag ttgagaatgg cacggtcgat14100ggttgtgaca gcaagtagct gcatggggcg caacatgtcg tctggcgtca tgggcgtgcg14160
caggcggcta aagtccacct cgacgtcctc aatgtagcat ccgtggggca cctccttgac14220accgcacagg tccaaaaagt ccttgtcttt accaaggaaa cgccagcgct tctcaggcaa14280tggcacagca ccatgttggc cattgtagat ggcacgctca aaggcgtcca ggcccttgag14340ggagccgaag gtggcatcca taccggtaat agcaatgcgc atgttgccct ccccgccaca14400acgtgagctg agggaactga tgctatcgtg ggtggcacag gcagccttgg agcggtcaaa14460ctcctcaaag actgcgtggg cgttggtgcc accaaagccg aaagcggaga gaccagcgcg14520cttgggctcg ccctcagtgt cgggccatgg gatgggctca gagaccacaa gcgggtccat14580ttgggaagat ccatcgacac caggagtggg cgggatcaca ccatgcttca tggcaaggag14640taccttgcac atgcctgcga aaccagctgc aacgagtgtg tggccaaagt tacccttgga14700gcttccaaag cgaggcacct tgccctcgaa gcaagccttg acggcatcaa tctcaacgcg14760gtctccctgg ggagtacccg ttgcgtggca ctcgacgtac tggatcttgt gcgggtgcac14820gttgacgcgc ttgtaggtat caatgaggca ggacttctcg ctgggcaagt gcggcttgag14880gggaagacca cagccagcat tgctgatggt agcaccgagc agagtaccgt aaatgtggtc14940tccatcgcga atagcgtcgt caaggcgctt gagaaccata atggcaccac cttcaccagg15000ggtgagaccc tgactgtcct tgtgaagcgg gtacgagatg ccgtctcccg atacaggcat15060ggcctggaaa gtggagaatc cggagagaat gaaaaagggc tccgggaagc aagttgcacc15120agcgagcatg acatcagcag caccggaaac gaggtggtcc tgggcgaggc gaaggacgta15180aagggcggtg gcacaggcag catcgacaga gtagtgaaga ggaccgaggt tgagctcttc15240tgctacgaag gatgccgggt ccataaagat gcggcggtca ccagcctcgg ggttctgcga15300ctgctcacgc tcggaccact tggaggcatc cttgaagacg cgagcgccga gtttcttttc15360gacgtggttt tggtacacat tgaggagttc gccctggagg ttgtccatgg gaaaggacag15420gcatccgctc acaataccgc accttgtaga gtcggagacc gatgtctcgg agagagcctt15480cttggagagc ttaaggagaa gctcgtgttc gttatcgacg gagtcatcga cgcagccgta15540gttctcgttg caaaaggtat ctgcaaattt gctacgctct gctttgaagt gctcggctcg15600cttgttggat ccgaggcgtt tatcgctaat cttagtccat gcagcctcac cgcccatgac15660tactttccag aactcttcct tgtctttgca gcccgcgtat tgcacggcca tgcccaccac15720ggcaatgcgc ttctcgtcgt gcatttcgtg agcagcgctc acattcttgc gagaggccat15780ctttttgctt tcttgttgct gcttactgta aacaaaaaaa agagcttgcg tgtcacctga15840ccggcacttt tagatcgatc aaaaagcggt cgtgtagatg gtttgctttg gaggagatgt15900ataaatgatg tgattgacta ccttgagcaa gtgattacag ggatgccaga gcaatcaaat15960aatcaatcag ttaatcaacg ccgtaataaa ggctatcaat caatcaatca atcaatcagc16020
caactagcta gccgaagctg cgatggactg gcgtttggac agcgcgaagc tgtaggaact16080ggcgccgcac gagctgcgag gctgccaagc tagaggctgt ctgcctttgt ctcactcctt16140ttccgaggaa ggagagagag agagagagag agagagagag tggggggatg aaagtttgga16200tgcacgatgc gtgctttgtg gtttgtttcc ttgtttcttt ctttgcttgt tttttctctc16260tttttctttg ttattttgtc tctcttgaag caaatagaaa gaacctcgaa ctagacgctc16320caaagggtct tcaagaggtc tcgaaggcta ggctggcgaa agcgcgcacg ctggtcaagc16380aagcaagcaa agcaagcagg caagcaagca agcaagcaag caagcaaagc aagcaagggg16440tggattccac gaatgcgaga agtcaaaact ctgcttcaaa cagagaacaa atgggcaaac16500gaatgaggat aaatgagcaa ctaagtgaag tttacatttt caaaactcaa caaaacgatt16560acccaatcaa ctatgagacg cgcagacgtc tgcggcagca tctcttttat gattttcaaa16620aacaaaaaca aaaaccaaaa caaaataatt tgcaacaaat taatgaaaag cgaaacaaca16680aacagaaaca ttgtttaaac taaaaagtca tttttattga aaatctgttc ttttcatctg16740tacgtatgta tgtttgtatg tacacacttt gcttcatcgg tttattcgag tgctcttcat16800tcttgaaatt gccttagttc ttgctgttat aactgtcaaa caaacctcgc gaccttgaca16860agcagctcca cctcaccttc gggcctgctc gtttgccttt ctcgcttttt tcgcgatctt16920ctgccatcct tgcctactct gtccttatct catcaggctg ctgcggcctc ttgacctagc16980agttcaagta taattaattt gaaaataaac aaaaaaacac tgccacttat tatgcagatg17040gcactctctc agtgttgcaa aagtagagtg aaattctggt ttacaaaaaa tatttattta17100ataaacaaat aaaataaata taaattcatg ttatgttaga tcattttatt ttgttttctg17160agggcgcgat aaacgcttac ttgagaacca agaaaagcaa gaaaagcaaa ggtgcgaaag17220aagcaaacac attgatttcc ctagttccca ccacttcttt ctttctttgt ttgtatattt17280gtttgtttct ttctttcctg ctttgttttg tttgttttgt ttgttttgtt tgtttgtctg17340tttgtctgtt tatctgtttg ttagtttgtt agttactaga ctgctaattg atttgaaaac17400caagccaaac ccacgcaatg aatacgcaga aagcacagct aaaaagaaga agaagaggag17460gaattccgaa tcaggcgaga aagtctcgaa agcagtgcac caaaatcctc atttggaatc17520aaagccctcc ttcccagcga ctacggaggc ccacgacgac gacgacgccg acgacgccgc17580ccgcccgccc atcctcctct ctctccgcct gctcctcgtc ttctccctcc ctccctccct17640ccctcgcgca cgccgctccg aatggaatga catgactgac gcaagcgcgc aatggccgcc17700gtgcgatggc tcgaagcagc atcgcatcgc attgcattgg cattattcat tgattcattc17760attgattcat tcattaattt attcatttta attcattcat tcattcaatc attcatttat17820tcattcattc attaatttat tcattttaat tcattcatta atttattcat taatttattc17880
atttttattc attcatactc ccgagcgcta cccggcgcta ggtgggtgct aggcgtggat17940ggagcggacc tctctgccag cagaaagagg aatgaatcta tctggatact gcgcgcagct18000tcttgcttgc tttgcttcaa cttgcttgca aacagccagg aggccgaacg gcttcgaccg18060ctcagcgtgt tcgccagcaa agaaccacct ccgccctcgc agtcgccgga tggatgaacg18120agcgaatgcg aatcctcctc cgatcttgaa cctcgaacct tcaatcaact tgccttaatt18180ttactttcat gactctcact attttaaata tacatgtatg tatgtatgta tgtatgtatg18240tatgtatgaa tgcacctcat actgataggg acctgcgggg gactgatacc acctgtctga18300atcaatttgc gagaccgcga gactgagtgg caggtagtag ctagctaagt agctgcctaa18360gagtctatcg gcatgcatga atcaaaaact atcatgtcaa tgttcctttg aggcttcgaa18420gtccgtcatt tgtcacgaaa ggttttgggt gaacgatcca ctgtttcgag agagatggtg18480tgaatgtata ggtgatagtt gccgagctgg cgagccgtcc caagcggtgc cggcactcac18540ccggctgaag cttcttacat gctctccgtt cataatcgtc caaattgatc ctgattcatg18600attcatgatt catgattcat gatgacacga gttggagttg gacgataagt cagcgctcgc18660tcaaccaaac tacctctgct cgcctagctg ctgttaggta gtgctactga ggcaggaccc18720aacttgaagc tacctactgc ctaggtattc ctacgctgtt tcgctgattt gcaatctctt18780cgttaccaag agataaaatt aacgagttat gacattgcgt atgcagacta cataataaag18840attgtgtcat ttatttataa gtggaaaggt gtaagatcaa gaactaagca ctaggtagca18900attaggcgtt atttgttagc gcgtggaaga aaatgcctct ggacagatag ctattaatag18960ctattaatag ccggtgttgt atttacaacc ttctgaaaga atttctccat agaggaaagt19020aaagaaacat cttattctgt gaaaagagat aaacaacttt ctagaaaatg gatgacagag19080caaagaaggt cgatcgtctt caaccgcaga tctgggaatg ctaaggttgg cgccaggctt19140acattatgcg tcatgctgac caaagggcgt aaagtgccga tgggcatccg atatatgcgc19200gttcaaggtg aggaattcaa gatcatcaag tttgtttgaa tttcgaggtt gaaaacacag19260agttttgaca atcgatcaat caatcaatca atcaatcaat caatttaaaa ccaatttaaa19320accaaatgaa tgagtgaatg agtgaatgac tgaatcaatt taaactaaat gaatgaatga19380atttaaaacc aaatgaatga gtccttagcg atttcaagtt ctgcagtgaa atctacaaat19440ctacgacgaa agtagtgaga tcgtatcaac gtgtatagac agacaatgat gctgcggata19500cctaagtgct tgcgtggagg gactacgatg cagatcccga gttttaggtc ctagttcctc19560cgttctctgg taaaaaagaa agcctctcct tcttgacgcc attcagcgac gtggaacaag19620cgagacagag gcacaagttt tggagtcatt gagtcgggtc tgctctgctt tgaggatgaa19680ccaacgacct tcggagtctt gcagatagat ggtccattct tcaaacgaca cagagatcgt19740
cgtctcgcgt aagttggcag tgggtctaga gctagctaaa aacatctgac agagagcaca19800tacagagcta aagaggagtg tactcggcaa aatagcgtgg acggatgaca tcatcaatcg19860ctcagctttt tcgtttctta ccaaaaaatt gacaaaccag agaaataaat agattgactc19920aacaaattaa attaaaacaa taaattaaaa aagatctctt aaagaagttt tctgaaagaa19980accaaaaaca ataaactctg cgacaagaac ttgaggccag aagggatgaa gaaggtacgt20040atctagatgg tgactgggga cacaaagaag caaggtctga attctcagaa gccagctgca20100gccagccagc tactaggagt gtctgccagc tccgtcgtca tgccacgagt gtccctgcca20160acgcttcaag cgtacttgca acttttattt gattactaca ttactacatt ataacttcat20220ctatagcttt aaaaaggaaa taaaggaaat aataaaataa atcaaataat ggtaaaaagt20280tataaataat caacgactaa aaaggaattt tattcgaagg tcctcggcag gaaataagtg20340gaatcaaaga gaaggcggga acggtaggga ccatacatga tagtcccaaa ctgaggaact20400acgaattgcg gggctaagca aattcatagg atcccagtta gggacagacc ctcgaggtcc20460gagttggtat cctgggccaa agcttgcgca agggtgctct agagctacaa ctcaatacca20520gtagttgcat ggccatctct gatagctttc ttcatgaata tggggtgagc ttagagacaa20580gcagtagaca ctctgtgacc tacgagctat atttgctgtc gcagagcatc tcctcaaaat20640aattcatcga agaaagacgg attgaaagtt ttgccttatt tgaacaaagt taatatttta20700actctcggta gttaaaccat gatagctcat ttatagcgta ggctgacaca gaagcgtagg20760ggcttagacg tcatgatgat tcgtgatgaa ataaatcaag gattctcgaa cgttgacacg20820cgcaatggag cgtgccaatg tcaaaagggt attgctgtat catcaacgta ggtaggtagt20880caaacgggct acagctctgt cctattcact cactaagaca aaatgttttc tctcaaacgg20940ccagctcgaa agtaatattg ggagcaagaa tgaaaatcat tctccagtac acttgcagtg21000agatcaagtt tcaagaccat caaacgatac gatacaggag gtactatctt tgctgaagtc21060agtagcagca gcattacgag cctggtagat ataaattgat aaaaagacaa gaggtatatc21120atatttcaga gtagagtaca tactgagctg gaaacataaa actagtgcac gcaatcgacg21180gttcaacttt tctcaagacg cttccagtcg tttcttaatt agctcagatg gtagcaaaag21240tgatatgcgc atcagacttt cgtaaacgta aaactcggca tctgtagatg ttgagtcatt21300gttttcttca ataatttact tctcgcagca gtgcacttgg aaaggtttgt caagtttgac21360ccagctaatg aaacacaaca tcatcaggcg gggctcgaaa agtagatctg aaagtctata21420aagaatgaaa gttactctca acacagaaag caatttgtgc aaacataaga gagaatggcg21480tctatgctgc aagagaaaat tcgacggtcg catcatagtc gtctacactg ctgtgcatgg21540gcaatttata atatcatgtc tgatcacggt ttctgagaac atttaaacga aataagtcaa21600
aacgaatgcg ctctgtcgcg attatagttt tgttctgaca gtaactccta accaaagggc21660caaataagga cgagagaata aaatagattg ctctctcact tcggacccag gaatcccgaa21720tttatataat ttcaatgtac tcacgtaaca ctgacaagct atgcggcgtc aataactcat21780ccacgttggg agaatctcga aacaacgcaa cgagttattt tatcctgatt aataatctag21840cttgaaccgt ttgttgtaac tagaacccaa gctgcaaaga gctacaacca aggtttgatt21900tcgttccaag ctaacatgaa actctcaaac ttcgtcgatt tttttaatgt ttgtcaaaaa21960cctagtacag cggtcctagg taccgatttg agaagcaggc aacccgctta taaataaaag22020aaaaagagtc tttattattt tataaataga aaaaacttta attgggacaa tattctttat22080gtgttctctg tcttcttcct tcatgtatga cgtaatgatc atgctccttt catctccttc22140cttccaaaaa gttcattttt cctactaggt ctttttcaaa attaaaaata taattaagta22200agaaagaaag aaggaaagaa agaaaaacct gggtactaat cagtgtgata tgaggtgaat22260ggtggttttg ttttacttct cggaagtgtc gagtcctata aggagcacta tacctatcct22320agacgctttt ggtaccaagc cctgcgcggc aggcatacgt cagcaagcta cgatagcagt22380acacgctact cagaaaggcc tagtgaggta ggcgagcagg aagtagtgct cttgcgtcat22440gcttatgatg gcatcagcca cgcgagaacc tcattcgaat agtccttttg caattcattc22500acgcatgcat gcattgatgc ctgctacaga gtagctagtg agagagtatg atacttagtt22560agtgctactt atgcgttgtc acctatgcaa tagcattgga tagaaggaat cagattcacc22620gctgactctc gctgagagta agggccatac gcagtgctcc tgagttgttt cattaaacgg22680acttcaagct gagttctggc taggcacctg gtagctgggg ctagagggta cctacctacc22740tacctactga tagctaactt tcaaatgagg aaagattgga gattgaatag aaagaaagtg22800atacatactg tcagccgtat cgaaactccg aagtggcacg cggatggcgt cagcaaactg22860ccgtagcaag tgaataacgc acatctcaat tgggacgtcc atgaaaacaa aaaacaaaaa22920agcaaaaaaa agttgcaatc gatcatgaat cgtgctgatt catgggttgc ttgcttagtt22980gttatgctgg agggtgtcga gacttggatc tggtgagcag tgcgctctcc actcaagttg23040gaccctttgg tatcagggga gtgcgagtgg gcacactacc atagtatcct aaattacctc23100tacgttttga ttgcctttga tcacagcaga taattttcaa tttaaataaa aatcataaaa23160agaagaagaa gaagaagaaa gaaagtgaag gtggcgtttc tgatgtcatc attttcgcag23220tgcttcccag cgaagattta ctgtgaacta ctacgcatgt gagtatggca agcactgggt23280aagtaggtac ctaccactac catgttgtaa aacaaaacaa ggaatatgtt agctagaaca23340gagcgaatcc ggtgtgagtg ggagtcatca tcagatattg aaagttgtcc tctcaattaa23400tataaatatt tctaactaaa gcaattaaac atatatttat taatttaatt ataaattaaa23460
taaatatgct gggtgggtcc gagtcattct gactatcatc tatgatgttt aataataaaa23520tattgaaagc agtcaaggtt atttggaatt atgggatgat cgtgatctgt gtatcattct23580gcatcattgt ggatgctggc ctacgaaact acgacggcat tgcaattgcc acctggcggt23640gcgatcgcgt gcactcctgc aattgcgagt gtcttccgcc ggcttcaagt tgaggtgctg23700cgacagtgcg ggcccagagc tcctaacatt tcgtggatga ccgactgact cagacagagg23760tctctcaagc ttagaaagtg cgctgcaaaa aagggcgcta gctagataag atacgagtga23820gtgagtgagt gagtgagtga gtgagtgagg ttctagctag tgctcctccc aaatcttgga23880gtgccgatgc tcgagaatac atacatactt caagacacga agaacttgaa cccgaagacg23940aatgccgtct tcgacgtcat ctttgccgtc gtcatggccc actgcagcaa cgatccagtg24000cgtgcgagca gcagggccag cccacgatca cgcagctcgt cgggctggac ttggctcaat24060gaatgaatga atcaatcaat gaaagaatga ctcaatgaat caatgaatca gcaagttgcc24120accaaagccc atcgcaacga cgggtcctgc ctgcgtgcgc cattcttagg atccagagca24180agcaagatct tcttcaccta tcgctcagca agcgagaacg caacctccct ctgcatcatg24240atgcaggata agtaagataa atccatcttg gacctcgagc tcaaatcgac gcttgctgca24300tctatctatc tttgtatcta tctatgtatc tatctttgta tctatgtgtc tatctatctc24360tctgcgtgcc tcgtcgtgtt tttgaaaagg agtttcgatc gtggcccaat cggaagagaa24420ggctctctct ccctctctct ctctctctct ctctctgcat cgcacagacc aatgagcctt24480gcggcaacac agcttcaact tcattgcagg atccaatcca tccaaggcat cgcttgggct24540ctcagtgaat gaattcgacc aaagctcgtt ggcaggcaga caaggcctgg acaacataaa24600gcaagggggc acgaaggcaa gatggcaagg aggcagagca ggcaccagcg actgcgatgc24660tggcgagaga agatcaaggc aaagcagagg ctgcaagcaa gctctgcagt agccacctcc24720tcagcagatt cgtcaagatc gggcaaactt cgtctgtggc tgccacgcca gagcagagca24780tgcctgcttc atgatccatg ctcaagaaag aaagacagac aagacagaca agacagatag24840atggatgaca gcgaacttac atttgcagac ttcgaaggtg cctgacgggt attggtgcca24900ctaagacgag aaggagcact tgcttccaga tcgctcacgc cgctcacatc accatgctac24960gtcttcaata cgcctggtcc ggttcgcaag agccgcgcgc cggcgattgg gcgaaaggcg25020gaggagtcga ggtacgcgtt atcagcagaa tgtaggaaca ccgcgacgcg gccgacgacg25080ctggtgagga ggaagaaaga cctggcgcct gtacgtacgt acctacgttc tagcagtagc25140ttgaagtgga ctgtgggtcc cctccatctt cttcaagacc ttcaagttgc ttgctgacgg25200catcgctgtt tgtttgtggc tgttaggtag gtaggtagct agctagctat agctgtgtcc25260tagctgcaca gggagcactc agcctctttc ctagtttctt tggttctgtg cttgtttttc25320
tagcgagtcg tgcaaataac ctgcggcggc cacgagaagt ccgcgttgag gcgatcttgc25380gccagtgcgg cagttgccat cactcgtgca gacagagttg agttgcttct caatcgttac25440caatcgctcc aagcaggcct agacatagat tttccttctc tggaccatct actaaaatga25500tcaagttaga taggtagata gatagataga tagatagcta gggagatact aggcaccttc25560tatgccggca cgtctcgaac aaagcgaaga aagagctgtg ggcaagagca ctcattttga25620tcgtagatga tcgtagacgc gctgtagagg agagctctta gtggcggcta ctgtgatgga25680ctatgagagg ggacttcgca agacctgtct cggtcgcacg tagctgtggg aagcgagaac25740ccgcagagga ctgattctga ttagtgcgga taacttggtc gaggaagagc ggggacccgc25800agggaacccg catagcagcg acgttggcac ccgacgacgc tagggcaaag acgcagcatg25860cgtgcgaggt gcctataagc tgcgcaattc agagaattaa gacagcagcg ctgggaagga25920aggaggagat ttgaaggctc ggcgggagct gtcgagatgg aggcaggcag gcaagcaagc25980aagcaagcga aagaggcggc cagggctcgc gtcgaagccg ctgatggacg agagaatcgc26040acgaagaaga atacggagtg tttgttttca aagccaaaga aagccaaagc caaagccaat26100tcgttcgttc gtgagttaac ttattattta atttaattga catcttcatt tactactgtt26160gttatctatt atttatttat ttatttattt atttatttat ttatttattt atttatttat26220ttatttattt atttatttat ttatttattt atttatttat tgtttatatt tttttaaatt26280aaaaaaattc aaaattcaaa attcaaaatt cacgaataaa ttgcacttga aggagatgaa26340gcaaagcttt gtttcttcta aaaagagtat aaataataca aagtgatgac ggaaagaagc26400atcattctga tggtaagcac ttcggcaaga tgcacgcact agcacttgtc gccttgcttg26460cgatccgcgg aggtaatagt ggaggcgaaa gaaggagttc attcctgtta tttcgcgctg26520gggttacagc agtgccaaga tttcgaatat ttgaattttt gaatttttga atttttggat26580cttcgttccc cttcttcctg aactgttcaa acgactcgga ggttgtcgat cggatcactc26640aatctctcaa tctctcactc actcactcac tcactttttc tcagctgcct gatccttcgc26700aatgctcgcg aagcgcgagg gatatgcgtg ggcgagcacg caccatcttc tctccacgcg26760taaagaagag cagagccaga ggcaggtagg tatctccacc catctcaggc tgtgacttct26820ttgtttcttt ctttctttgc ttgttttctg ttctctctct gtgctctgtc cacacgagaa26880agagaaagag agagagaaag aaccacgggt ttatagagcg cactcgtcct tcctgcttca26940gcagaaagca ctgcgtagga gaactacggg ggaggaggaa gcacgcacgg aggaggcgtg27000gaaggaagga ggagacagag agagagagac actgagggac agagggggag aggcagaggg27060agaggcatct gatgtttgcg agaaaccaat aagttttgaa agtgatttga tttagctgat27120tgactgatct atggcctgaa agaaagcttt taaagcggag ggagatagat gacgagggca27180
gctgcgatgg cgtacggcgc atccgtctct ctctgtgtct ctctctcttt ctctctcgtc27240agggcgtgga gacctcggaa gctgcacgcg gcgcggtgag gaggcagggc agcagaggga27300gaggagagat cccagagtcg aagagcattg attgattgca gatgatcttg ggcaacgcgc27360gtcagcttga gcgaggaatg ctttggactt caggttcttc gcttctgtgt ttcattcttt27420ctcgaagaaa gaaagaatga aagaaagaga gaaagaaaga aagaaagaaa gaaagaaaga27480aagaaagaaa gaatgaatga atgaaagaaa gagagaaaga aagaacgaat gaaagaaaga27540gagaaagaat caaagagaaa gcgcattcgc agttcttctt cgtgaaagaa aaggaaaaga27600gaggcgatgg taggctctga tctcatcatt tctggtttct ctgttgtacc tgtactctgt27660gcttgtggcc ttgcgaaggc tgaagacgcc atgcagacaa ccacgcctcc gcagagactt27720tgcgggaaag cagagggctt ctcgccactc tcgaagaaac gagctcgcca gttttcgggg27780ttgttctcag aattgcgagt gttggcttta tatgggatga tggtatggca cttcgtcatc27840gttactctcg ctcgcttgct tacgaagatt ttcaaaaggg cgaaagaagt gctcagcttt27900taaaataaag tcacaccaaa gactaggccg catagcagaa agctaaagta aacccaatct27960gtctgaagag agtgtcgtgg ttagatactt acgcaagagt ttaaaagctg taaatagtac28020aggaacaaaa acaaataaat atatatatat tcttttttat tagtaaaaca tgaaaccaaa28080aaactccttt aaaataaaat aaaataaaat aaaataaaat aaaataaaat aaatttacta28140ctatatatac atatatatat acaataaata aaaacaactt tttcagacca gaaaaagact28200gagaaaaaag gaaactaatg actctcgagc accgagagcg atataagagt ggattatatt28260tgctaggccc accacgagtg agtcccctag gaggaagcgc cctctgagac aggagcagag28320gcgtcgctgg tgctccaaaa agcgacggcg aatggaaagc aaaacccttt cgagggaggc28380ttgtggccgt gactattcaa atctccagca tctcagctcc agcacagcag aagctacctc28440gcttctcagc tctagctatc acatcgatcg cagcatctag ctcgtagaca gctagcgccg28500caccttcccc caaatcaact tgggcaactt aactcttttt tcaccagaac tcctcttttc28560ctttaatctt cgaaaagaag acgaataaaa gagataatcc tctgccgcag cacattctaa28620aagaaaagcg gcatactggc gtaggcaaga ctttcaagct cttcctcgcc tccaccccgt28680atttccctgt tcatctttgt gaaacgagga aacaagaaat tttataggac aagatggctc28740aacgtgagaa ccgtctcgag gccaacatgg atacccgcat cgctgtgatc ggcatgtccg28800ccatcctccc ctgcggtacc accgttcgtg agtcttggga ggctatccgc gatggtatcg28860actgcctcag tgatctcccc gaggaccgcg tcgatgtgac cgcctacttc gacccggtca28920agaccaccaa ggataagatc tactgcaaac gtggtggatt catccctgag tacgacttcg28980acgcccgtga gttcggcctc aacatgtttc agatggagga ctccgacgca aaccaaaccg29040
tcaccctcct caaggtcaag gaggccctcg aggacgctgg catcgaagcc ctcagcaagg29100aaaagaagaa cattggatgt gttctcggta tcggtggtgg ccagaagtcc agccacgagt29160tctactcccg cttaaactat gttgtcgttg agaaggtcct tcgcaagatg ggcatgcctg29220aggaggatgt tcaagctgct gttgagaagt acaaggccaa cttccctgag tggcgccttg29280actccttccc cggtttcctc ggcaacgtta ctgccggtcg ctgtaccaac accttcaacc29340tcgatggtat gaactgtgtc gtcgatgctg cctgtgctag ttctctcatc gccgttaagg29400ttgccattga tgagcttctc cacggagact gtgacatgat gatcactggt gctacctgca29460cggataactc catcggtatg tacatggcct tctccaagac cccggtgttc tctaccgacc29520ctagcgtccg cgcatacgat gagaagacca agggtatgct tattggcgaa ggctctgcca29580tgcttgtgct taaacgttac gccgacgctg ttcgtgatgg tgacgagatt cacgctgtca29640ttcgcggctg cgcctcttcc tctgacggta aggcctccgg tatttacacc ccgaccatct29700ctggtcaaga ggaggctctt cgccgtgcct acatgcgcgc taacgtcgat cccgccaccg29760tcactcttgt tgagggccac ggtaccggta cccccgttgg tgaccgtatt gagctcaccg29820ctctccgtaa cctcttcgac agtgcctacg gcaacgagaa ggagaaggtc gctgttggca29880gcattaagtc caacatcggt cacctcaagg ctgtcgccgg tcttgccggt atgatcaagg29940tcatcatggc cctcaagcat aagactcttc cggccaccat caacgttgat gagcccccta30000agctttacga caacactccc atcaccgact catcgctgta cattaacacg atgaaccgtc30060cgtggttccc tgctccgggt gtgccccgtc gcgctggtat ctccagtttc ggttttggtg30120gtgccaacta ccacgccgtt cttgaggaag ccgagcccga gcaccagaag gcttaccgtc30180tcaacaaacg cccccagccg gtgcttctga tggcatcttc aacccaggct cttgcttccc30240tctgtgaagc ccagcttaag gaattcgaga aggctatcga ggagaacaag accgtcaaga30300acactgctta catcaagtgc gtcgacttct gtgagaagtt caagttccct ggatctatcc30360cgagctctaa cgctcgcctc ggttttcttg tcaaggaggc cgatgatgcc accgagaccc30420tccgtgccat cgttgcccag ttccaaaagt cagctggcaa ggattcttgg caccttcccc30480gccagggtgt gagctttcgt gctcagggca tcaacaccac tggtggtgtc gctgccctct30540tctctggcca gggtgctcag tacacccaca tgttcagcga ggtcgccatg aactggcctc30600agttccgtga gagcatctct gacatggatc gtgcccaggc taaggttgct ggcgctgaca30660aggactacga gcgtgtctcc caagtcctct acccgcgtaa gccttataac tctgagcccg30720agcaggacca caagaagatc tccctgacct catactctca gccctctacc ctcgcctgcg30780ctcttggtgc ctacgagatc ttcaagcagg ctggtttcaa gcccgacttc gctgccggtc30840actctctcgg tgagtttgcg gccctctacg ctgctgactg cgtcaaccgt gacgacctct30900
ttgagctcgt gtgccgtcgt gcccgcatca tgggtggcaa ggatgcacct gctaccccca30960agggatgcat ggctgctgtc attggaccca atgccgagaa gatccagatt cgcactgctg31020atgtctggct cggcaactgc aactcccctt cgcagactgt catcaccggc tctgttgagg31080gtatcaagaa ggagtccgag cttctccaga gtgagggctt ccgtgttgtc cccctcgcct31140gcgagagtgc cttccactca ccgcagatgc aaaacgcctc ctctgccttc aaggatgttc31200tctccaaggt tgccttccgt cagcctagcg cccagaccaa gctcttcagc aacgtgtctg31260gcgagaccta ctccaacaat gcccaggacc tccttaagga gcacatgacc agcagtgtta31320agttcatctc tcaggttcgc aacatgcact ctgctggtgc tcgcatcttt gtcgagtttg31380gccccaagca ggtgctctct aagcttgttt ccgagaccct caaggacgat ccttccatta31440tcactatctc tgtcaaccct tcctctggca aggatgccga tattcagctt cgcgaggctg31500ctgtgcagct cgttgttgct ggagtcaacc ttcagggctt cgacaagtgg gacgcacctg31560acgccacccg ccttcagccg attaagaaga agaagactac tcttcgtctc tcggctgcca31620cttacgtgtc tgacaagacc aagaaggctc gcgaggctgc catgaacgac ggccgcatgc31680tcagctgtgt cagcaaggtc atcgcccccc ctgacgccaa gcccattgtg gacaccaagg31740ctcaggagga ggttgctcgt ctccagaagc agcttcagga tgcccaggcc cagatccaga31800aggccaaggc cgatgctgct gaggctgaca agaagcttgc cgctgctaag gatgaggcca31860agcgtgccgc cgcttctgca cctgtgcaga agcaggttga caccaccatt gttgataagc31920accgtgctat cctcaagtct atgcttgctg agcttgactg ctactccact cctggtgctg31980tgtccagctc tttccaggca cctgttgctg ctacccctgc tccggtcgct gcgcctgttg32040cagctgctcc tgctccggct gtcaacaatg ctctccttgc caaggctgag tctgttgtca32100tggaggttct tgccgccaag actggttacg agactgacat gatcgagccc gacatggagc32160tcgagactga gctcggcatt gactctatca agcgtgtcga gattctctct gaggtccagg32220cccagctcaa cgtcgaggcc aaggatgttg atgctcttag ccgcacccgc accgtcggtg32280aggttgtcaa cgccatgaag gctgagatcg ctggcagctc tggtgctgcc gctgctgccc32340cggccccggt tgctgctgct cccgctgccc ctgcccctgc tgtcaacagc gctcttcttg32400ccaaggctga gactgttgtc atggaggttc ttgccgccaa gactggttac gagactgaca32460tgattgagcc cgacatggag ctcgagactg agctcggcat tgactccatc aagcgtgtcg32520agattctctc tgaggttcag gcccagctca acgttgaggc caaggatgtt gatgctctta32580gccgcacccg caccgttggt gaggttgtca acgccatgaa ggctgagatc gctggcagct32640ctggtgctgc cgctgctgcc ccggcccctg ttgctgctgc tccggcgccc gtcgctgccg32700ctgcccctgc tgtcagcagc gctctccttg agaaggctga gtctgttgtc atggaggttc32760
ttgccgccaa gactggttac gagactgaca tgattgaggc cgacatggag ctcgagactg32820agctcggcat tgactccatc aagcgtgtcg agattctctc tgaggtccag gcccagctca32880acgtcgaggc caaggatgtc gatgctctta gccgcacccg caccgttggt gaggttgtca32940acgccatgaa ggctgagatc gctggcagct ctggtgctgc tgccccggcc ccggtcgctg33000cggcccctgc tccggtcgct gccgctgccc ctgctgtcaa cagcgctctt cttgagaagg33060ctgagactgt tgtcatggag gttcttgccg ccaagactgg ttacgagact gacatgatcg33120agcccgacat ggagctcgag actgagctcg gcattgactc tatcaagcgt gtcgagattc33180tctctgaggt ccaggcccag ctcaacgttg aggccaagga tgttgatgct cttagccgca33240cccgcaccgt tggtgaggtt gtcaacgcca tgaaggctga gatcgctggc agctctggtg33300ctgccgctgc tgccccggcc ccggttgctg ctgctcccgc tcccgtcgct gcccctgctg33360tcagcagcgc tctccttgag aaggctgagt ctgtcgtcat ggaggttctt gccgccaaga33420ctggttacga gactgacatg attgaggccg acatggagct cgagactgag ctcggcattg33480actccatcaa gcgtgtcgag attctctctg aggtccaggc ccagctcaac gttgaggcca33540aggatgtcga tgctcttagc cgcacccgca ccgttggtga ggttgtcaac gccatgaagg33600ctgagatcgc tggcagctct ggtgctgccg ctgctgcccc ggcccctgtt gctgcctctc33660ccgctcccgt cgctgccgct gcccctgctg tcagcagcgc tctccttgag aaggccgaat33720ctgttgtcat ggaggttctc gccgccaaga ctggttacga gactgacatg attgaggctg33780acatggagct cgagactgag ctcggcattg actctatcaa gcgtgtcgag attctctctg33840aggtccaggc tatgcttaac gttgaggcca aggatgttga tgctcttagc cgcacccgca33900ccgttggtga ggttgtcaac gccatgaagg ctgagatcgc tggcagctct ggtgccgccg33960ctgctgcccc ggccccggtt gctgctgctc cggcgcccgt cactgccgct gcccctgctg34020tcagcagcgc tctccttgag aaggccgaat ctgttgtcat ggaggttctc gccgccaaga34080ctggttacga gactgacatg attgaggccg acatggagct cgagactgag cttggcattg34140actccatcaa gcgtgtcgag attctctctg aggtccaggc tatgcttaac gtcgaggcca34200aggatgttga tgctcttagc cgcacccgca ccgttggtga ggttgtcaac gccatgaagg34260ctgagattgc tagcagctct ggtgctgctg cccctgctcc ggctgctgcc gttgcaccgg34320cccctgctgc tgcccctgct gtcagcagcg ctctccttga gaaggccgaa tctgttgtca34380tggaggttct cgccgccaag actggttacg agactgacat gattgaggcc gacatggagc34440tcgagactga gctcggcatt gactctatca agcgtgtcga gattctctct gaggtccagg34500ctatgcttaa cgttgaggcc aaggatgttg atgctcttag ccgcacccgc accgttggtg34560aggttgtcaa cgccatgaag gctgagattg ctagcagctc tggtgctgct gcccctgctc34620
ctgctgctgc cgctgcaccg gcccctgctg ctgcccctgc tgtcagcagc gctcttcttg34680agaaggctga gtctgttgtc atggaggttc tcgccgccaa gactggttac gagactgaca34740tgattgaggc cgacatggag ctcgagactg agcttggcat tgactccatc aagcgtgtcg34800agattctctc tgaggtccag gctatgctta acgttgaggc caaggatgtt gatgctctta34860gccgcacccg caccgttggt gaggttgtca acgccatgaa ggctgagatt gctagcagct34920ctggtgctgc tgcccctgct cctgctgctg ccgctgcacc ggcccctgct gctgcccctg34980ctgtcagcag cgctcttctt gagaaggctg agtctgttgt catggaggtt ctcgccgcca35040agactggtta cgagactgac atgattgagg ccgacatgga gctcgagact gagcttggca35100ttgactccat caagcgtgtc gagattctct ctgaggtcca ggctatgctt aacgttgagg35160ccaaggatgt tgatgctctt agccgcaccc gcaccgttgg tgaggttgtc aacgccatga35220aggctgagat cgctggcagc tctggtgctg ctactgcctc tgcccctgct gctgcagctg35280ccgcccctgc tatcaagatc tccactgttc acggtgctga ctgcgatgac ctctctgtga35340tgtctgctga gcttgtcgac attcgtcgcg ctgatgagct ccttcttgag cgccctgaga35400accgcccggt ccttattgtc gatgatggta ccgagctcac ctctgctctg gttcgtgttc35460ttggtgctgg tgctgtagtt cttacctttg acggtcttca gttggctcag cgtgctggtg35520ctgctgttcg ccatgtccag gtgaaggacc tctccgctga gagtgccgag aaggctatca35580aggaggctga gcaacgcttc ggccagcttg gaggcttcat ctctcagcag gctgagcgct35640ttgcccctgc tgacattctt ggtttcaccc tcatgtgcgc taagtttgcc aaggcttccc35700tctgcacccc tgtgcagggt ggccgtgcct tcttcattgg tgtggcccgt cttgacggtc35760gccttggttt cacctcccag ggatctactg actccctcac acgtgcccag cgtggtgcta35820tcttcggcct ctgcaagacc attggccttg agtggtctgc taacgaagtg ttcgcccgcg35880gtattgatat tgctcgtgag gtccaccctg aagatgctgc cgtcgccatc actcgcgaaa35940tgtcctgcgc tgacaaccgt atccgcgagg tcggcattgg cctcaaccag aagcgctgca36000ccatccgtgc tgtggacctc aagccgggtg cccccaagat ccagatcagc caggatgacg36060ttctccttgt gtctggtggt gctcgtggta ttactcctct ctgcatccgt gagatcaccc36120gtcaggtccg cggtggtaag tacattctcc tcggtcgctc caaggtccct gctggtgagc36180ctgcttggtg caacggtgtt tctgatgacg atcttggcaa ggctgctatg caggagctga36240agcgtgcttt ctccgccggt gagggcccca agcccacccc gatgacccac aagaagctcg36300ttggcactat tgctggtgcc cgtgaggttc gttcctcaat tgctaacatt gaggctctcg36360gtggcaaggc aatctactcc tcttgtgatg tgaactctgc tgctgatgtc gccaaggctg36420ttcgcgaggc tgaggctcag cttggcgccc gtgtaactgg tgtcgtccac gcttctggtg36480
tccttcgtga ccgcctcatt gagcagaagc gccccgatga gtttgatgct gtcttcggca36540ccaaggtgac tggtctcgag aacctctttg gtgccattga catggccaac cttaagcacc36600tcgtcctctt cagctctctt gctggtttcc acggcaacat tggtcagtct gactacgcca36660tggctaacga ggccctcaac aagatgggtc ttgagctctc tgaccgtgtg tccgtgaagt36720ctatttgctt cggcccctgg gatggtggca tggttacccc ccagctcaag aagcagttcc36780agtctatggg tgttcagatc atcccccgtg agggtggtgc cgatactgtg gctcgcattg36840tcctcggctc ctcccctgct gagatccttg ttggcaactg gaccactccc accaagaagg36900ttggcagtga gcccgttgtg atccaccgca agatcagcgc tgcatccaac ccttttctta36960aggaccacgt catccagggt cgctgtgtgc tccccatgac cattgctgtg ggctgccttg37020ctgagacctg cctgggtcag ttccctggat actccctctg ggctattgag gatgctcaac37080tcttcaaggg tgtcaccgtt gacggtgatg tcaactgtga gatcactctc aagccttccc37140agggtactgc cggccgcgtt atgattcagg ccaccctgaa gaccttcgct agcggcaagc37200ttgttccggc ttaccgtgcc gtgatcgttc tctccactca gggaaagccc cctgctgcta37260ctacttccca gaccccctct ctccaggctg atcctgctgc ccgtggcaac ccttacgacg37320gcaagaccct cttccacggc cctgccttcc agggtcttaa ggagatcatc tcttgcaaca37380agtctcagct tgtcgccgag tgcaccttca ttccgtcttc cgagagcgct ggtgagttcg37440cttctgacta cgagtcccac aaccctttcg tcaacgacat tgctttccag gccatgctcg37500tctggattcg ccgcaccctc ggccaggctg ccctccccaa ctctatccag cgcattgtgc37560agcaccgtgc tcttccccag gacaagccct tctacttgac cctcaagagc aacagcgcga37620gtggccactc tcagcacaag acctccgttc agtttcacaa cgagcagggt gacctcttcg37680tggacatcca ggcttccgtc acctcttctg actcccttgc cttctaaagt tgtgaggctg37740tcttgtcttg tcagtcgcga aagtgtaagc aagaactttg tcatacaaag aagcaaccaa37800cttccgaacc aacacacctt gtaggattac aaccacaact ttctataaat agtgcgcaag37860aataaccagt aagctatcct tcgtgtacct gttacaacaa cgacattttt acttgatctt37920cctacttgtg atgggtagtc ccggcttgta ctgacagtga tgccacagca gagtagatca37980ctgtgaataa gtaaataagc ctacttatta tattcccaaa gtactcgctg ggatattatt38040agtatcacga aaagtgatat gttttataac tcgcttgtct tgccaagatc taaccttttt38100tttttaaatg gccaaaaagt cgccagaaca catcttacaa taaacaaaaa tttagattat38160atcgtatgta taatgtataa tatattatat tattatatac atacgatata atctaaagcc38220attccagact tattcggtga tgaaaaatgc tttcccagct ttatacaaac tattcaaaaa38280gttgcatgac ccattttcag atatatttaa tagtataaga ttatgtccat ttgttttcaa38340
agttattcaa gagtttacat cttgaagttt catcccttta ctactacact gtttttcgtt38400tgggtttttt ctctaacggc gaaagaaaca agtcaccaag cttaactagt aggcatcttt38460gtggtgacga aattaaagtt gaatatataa attatagtta gtcattatgg aatctcagtt38520tgaacgaagc taagctattt ataaaaatca ctgcatggag ataatacttg aattttgatg38580atagtgttta tgaagaagtt taatcttgct ttttattaat gttattctct aatatagaaa38640tatttcaata aaaaaatcat atgaagggat aataaataca gagaatgatc gttatcattt38700gatatgtcga acgctaatct atcatcttat ctaggaaaca aaggtggaaa taaaggaaag38760ccctacacga gttaattcct caaacgaact actttggatt atcaaatcca actgctgaca38820ctggatacat gcatgtattt agtgggtgtt actgtacttc cttatttcct ttaattcaat38880tgtcttgatt tttacttcgg agattctact tgaaaatcat ctcccttcac ttccggttat38940acagaaagac ccttcaattc gaatgctggc caggtacaat aactatcagc gattcccctc39000cactagacat gaccgactgt aagcacctca acccgatttc aagcaacaca tgatgactag39060ctgtttccgc aaaacaacaa ataagagagg tagtggaaaa cacccagttc gctcgagctc39120ccctagtaga ttcgacattc actttctatt tgattgctaa ttgtgggtcc ggctatttaa39180ggaaagaact gatgaaagtc cacctcacgc aatcaaatcg cggtctagtt ggaagctaca39240atggccgacg tatgcgcgcc tctatctttt aggattgtag aacagggcgg caatctgcta39300acataaattt aataccttgc tcaagctgct ttccatactt ttcaatccat ttgtgataat39360cttgcaatgg accaatctcc aaatctgtag aagcaataac aaggacatcg cagggtcccg39420gttcgtttgc atgctcgtct tctggtgcca caacaatgct gcctgttatt atctcatgag39480agtctttata ctgcggatcc gtggctatag cgtgaataaa cgttgtgcgc aagcctatat39540cctcgcgatg gagatactgg cctgctacag tttgcgttcg tctgcctacg acaacgcatg39600gaacattctt tggtgtgcga gtgggccgta gcgttcgacc ctgggcaagg aagccatgca39660gacgtgattc cgagaggcca tctcgcgtgt aagacttatc ccaattttct ggatcctcta39720atttccagct agccataagc tcagtcaaca gaccaagcgt tcttgatctt ctttctaggt39780caaatacatc ttgatggaag cctgcagtaa tttctttgta agatttggaa acgacgttct39840tgaaatgaac acaaactgat attgcattca tgggtgcagg tgacagttgc aaatgaactg39900aaatgtctgg agaaaagttg aggaagcgtg gtttataaag cggccaagct gtcctcgcat39960gcgcaagacc tagtatatta ctaatgactc tgcgaccaca atcctccatg cgttcaaact40020tgctatgcgg aattccacga atgatgttac cttgaggatt tggggctctc caaaggagct40080gttgcagttg ctgtacgtat tcgcggtgtt cgcggacctg atctcgaagt cgggcatttt40140cctctgagca aggccctaca ggtggaaatc tgcacagcat attgtatgtt ctctctagat40200
gtactgcccg ttgccgcaaa tgagctacat ccatctccag tttatttact gtgtcttcga40260gcgcaaacct ttcacagcgc ctgcgtttgc gttcatttct cgaaatctct cgccgccgct40320gcctgattcg ttctgcgcga tcaactcggt catcccctgt gtagcttggt gatgacgtgg40380atccatcttg tgaggcgtca aagccagaca ctgcctttac ttctaaatct cgccattcat40440ctgcaaaatc cctatatcct tccccataag tgtaatcgtc actacctatc aattctgtag40500atgccgcatc tacagtccta attatttgag gatttccttg cattgtaaag caaagatact40560cggaggctgg atttgtcaca aaaggtacga cagccctatt gatcaaattg aaggaagggg40620attgctttta ccagtacacg atgttactgt tgttgctatt gttgttgttc ccaatttctt40680cagacgtagc gtgccgcttc tgacattgcc aatagctgct tgtctttggt cttctttggg40740gaatgggcca gtaaaagaaa ccctaggcag ttcgattatc tactaatcta aagaacctgt40800ggcccctttc ccctcaaccc acgcccttcg ttgctctctt cggtcggtga agcgtttaga40860tgcgaggttt cctccactac gtgcttcttc aatgctaaac gcccaagtca actgaggaca40920ctgaaagcct gcacggagca gaagacccac acagacggtc gcaggatcaa ccctacctac40980gcctcgttgc cacgatggtc gctgccgatc ctcgatctct cgtcgattat tggtctcctg41040ttgcgctctt ggccacgcgg ccactcagac tctgcttctg tggcttctca ctgacgtgat41100gtagaaagaa atagaaagca cagagccact ttaaaaggaa aaggggaaag cagagaggaa41160agggaaaaag aagacctcag attgactcag agattgactc aatcgacgag agaatggaag41220ggaatggacg ccacggagac agaggcgcag cgagacggag cgagacggag gtaggcagag41280gcagaggcag aggtggaggc gaggggccgg gttgtcggca ctggcagagg gagagagaga41340gaaggagagg cggaccagtt tgaaaactct cgccagcttc gatagccgta ctcggtatgt41400atgtatgtat gtatgtatgt atgtatgtat gtatgtatgt atgtatgtat gcactcttct41460acttgtttcc aatgtgctgt tctatgcttt acagtgtttt ccgcgctcgc tacttgctac41520tttcatcagt ctgtctgcct gaggcggcgg tgatgcagaa tgcacctagg tacctatttg41580tcgccaactt tggatttgcg tggcggcagg attcctcttc tcctgcactt tgtttcgact41640cgccttagaa gggttgttgg aagacgccta aacgggtatt gcccggagat aggtgctgct41700ggtagctcat gtagatagtt cgttaggtag ttacactgga acagacagac gctctgtgtt41760tcgtggtgtt gcaggtcatg gactcagagg ggctgcgtga gttttgtgtt cgagagcaga41820gtgttgatat tcttttatgg gcaggacaca ttgcaacttg aagtaccgtg gttgtaacta41880caggacctcc atctgaagcg cggcatcacg tgaaaaagaa atgaaatgaa gagggaaagg41940acacccaaag gttcataatg tttggtttgc aaaggttatt cgaaagacac cttcttcgtg42000gtagatggtg attctgtcga aactgccgag attttgctga gagtgaacca aagcagggtt42060
ttgagataga agaatcaatc gtgcatggac aacctattcg taggattgtt atagctgttg42120tttgttatag gtcaaacttt atagcttcaa cccctcgctg gcaagtacga agggaaagtg42180taaatataca ttcttggttt aacgcataat ctcaagagct tccatgctga aaagttagat42240agtatattct tctgatttta catatttaaa ccaagtaaac aagttccacc aagggactta42300cttggcaact taaccatggt catcataatt tgcgcatcac ttagatcact acgttaacat42360tcgttcttga tctcttcgag cgcctaaata agcaaactgg cagcgaatta ggtcaccata42420tttttccaag gaggaaaaac tgtattgtgc tacccgttgt ggtgtaaaac ttgtaattct42480tcgcatctct aattcctatc gttaaacttg tcatcttact ttctggaagg aagcttggta42540tctcagaaaa tcgaactttg caataatacg aaagcacaag taagggttta tggcagcata42600acattgtctt aagaaattga atttaaaagc agaccgaatg caccgcagaa tacattgtaa42660attggtgcca aatattatga gtagcaatca tcaatctaac gcacgatttt ttgaagaagt42720acaatacaaa tttccccgtc gtagagaatc aaatggtttt acacatctat ttcaacactt42780ttcttggatt gtgatttcat atcaagacaa ggcttaaatg atcttggctt tctctgcaag42840agcggttctc caaatttcct ctcctgtttc tggattcatg tcaaaacata gtttaacaat42900agaaagaagg tgaccaggta ggtacgcaat aatagtttcc gcaatgaatt ggggcttgta42960gcgtgcagag aaatgcatga gatatagggc ctggcagttg tccaatgcac ctcgttttgc43020aaacctcgcg agctcttcaa tgtggatgtg gccacgctct ctagcaaagg agatatcgcc43080atcaaaaaat gtaagctcca tgcaaagtgt ggcagcctga agaaatagag cctcagggat43140atccagggcg tctataattg tgtcacctgt atatgcaaat tcaatcgttt ctctgtacac43200gaaatcctca ggcataggag gtgacttttg aaagcgcttt cgtttctctt ctggactgag43260gttggcaagc tctgacctaa gctcttttcg tttcgtcttc accgcatagc ctacagaggg43320aactctatgc atcgtcttgc acaccacaac acttgcatct cctcctagat cc43372<210>2<211>39976<212>DNA<213>Ulkenia sp.
<220>
<221>misc_feature<222>(32086)..(32086)<223>n steht für irgendeine Base<220>
<221>misc_feature<222>(32086)..(32086)<223>n steht für irgendeine Base<220>
<221>misc_feature
<222>(32084)..(32084)<223>n steht für irgendeine Base<400>2tcaagaattc gcggccgcaa ttaaccctca ctaaagggat ctgatgaact tggagcaaga60ataagaaatc catccattca agtcagcaca cccgatggca tcatcaatct tcgtcaactc120tttgtgcagg cagattggtg cttcgggcaa tcaatcggtt gacggattga ttgatcaatc180gctttgcttg cttgcttgct tgcttgcttg caattgatcg gcaaaagagg ccatccatcg240tagagcgtgc aatcttcaat gctctagcta gaggcgccat caggtagtta gttagctagc300tcgttagtta gttgctcttc ctgaaactaa caatgtatga catcagcatc atcgttcttt360cttctttatc catccaggat ccttcttttc aattcgtttg ttttgttttg tcttgttttg420tctttttctt tcaatgcaag catctcttaa ttcaacaaac caaacgaacc aagagatgaa480actcaaaaaa cgttttaaaa taaacaaaca attaaaatca aatagaaaat gaaattgaaa540gcacttttgt tttcgcctct ctagagagct agctatagct acctactatt cgttctcgct600cttcgtcgtc gggactgctg catcctgtca ttatcgggcc ctaagagtgc cctagtctta660gaaattgatg gcgataagat ggcggtcttt cttatccttc ttctcgttgc tgctgctgtg720ctctttgcct ctcggatcct tttgtttaca gctggccagt cagtcagaca gtcagttaat780cgattaacag gcaagcaagc aagcaagcaa gcacgcaagt cagccagctg gatagacagt840tagatagatc gtggcgtcgt cgttggcttc gtcgctgttt tggtgcttga ggattcgaag900tgcacgaggt tccttctacc tacagctctt cctttcactc ttcacctatt attatgcgct960gcaagttctt ttcgaaaggc tttttcttct ttcattctct ttcttttggc ctttgcgtta1020cagagcggag acgcctagtt ttatagatct aaataaacaa gagggaggac aacagaggcg1080gaaaacaagc aagttcaaga cggcaagaaa gcagcgcctt tgtttctttg tttcttttgt1140ttcttttcaa aagagccctt cctcggaaag ctttctttct ctcttgagcc aacttgaatt1200cgaatctgat cttcaaagcg agttagttcc tcaggcgcca ggcacctctc tccctccctc1260cctccctcta tcgcaggcag gccagcgtga cacctgtgac agcaggcagc tcaggcgtgc1320atgcaacgaa ggcgttgact catgcattgg cgctcactca ctcactcact cactcactca1380ctcgcgtacg tacgcacgca cactcacgca ctcacgcact caatcactca atcactcact1440cactcactca ctcactcacg ccagcattct cgaggagagg ccatgcgtag gtgaggtacg1500aaggaaagga gtccatagtt tggaggcgat gatggcgaat tgcagagcat aacagtgcag1560agggagaaac ttacatccat tcatacgtag ggaggcgcat acttacgtaa ctaagtgcaa1620tcggtggatc aagaaagaag gaatgaaaga atgaatgaag gaatgaatga aagaaagaaa1680gaaagaataa atgaataaat gaatgaatga atgaatgaat gaatgaatag ataaatgaat1740
gaaagaaaga gccccgctta tttggtatcg atctcattgc aaatgttcct gaaagttgct1800tatttgcctc acaactatga gtaggtagtg atgataataa tagtaattgc tattgctatt1860acttgaattt gaatttgaat ttgaattcag gtagacaata aaataagatt agcaaaacat1920tttgagagga agcagaggat atgcagtgca aaaggaggtc ccgagtttcg atcttctttg1980cacctgctac gtatctagtg cacgtagagc aagaaagaat gaaagaaaga acgaaagaaa2040gaaagagaga gagagagaga gagagagaga gaaagcgaag atgatagcgg agagaactct2100tcttcgcagt cactctgttt ctcagtcagt cccgcaacca ataacaactc gaactcgcag2160cagtgttctt cggagtgcca gcgctcgctc gcactgcgtc ggcacagcag cagcagcagc2220aggccccgcg ctcgctgcac tcagcccggg caggagcaac agctgctgag cagctgaggc2280cagctggctg gcggctcgcc tcgcctcgcc tcgcgtcgcg tcgcgagaga aagcgatcga2340ccaactgtca atcgattatt cgagtccttc gagcgcttta tagggcactg attgatcact2400cattgattca ttgactcatt tattctttgc gtggtcagcc aaacggcgtt agcattgggc2460aaagcgggtc tttgctttgc tctaaaatag atttgctcgc gagagtacgt acttgcagga2520gtaggtaggc tctgcctagt acctgggcat ttgaatattt gaacttcgaa cttcgttgag2580tatctgaata tttgaatatc tgaatatttg aatttcgaaa gtttgaatat ttgaatattt2640gaattttgga atattggaat agctgggttt ggagataaga cttactaagc taagcgccga2700cgtaagagcg gcgagtaaat ccacacacaa gagagaggca gagagagagg gagggagaca2760actcgcgcag gcaagctgag cccactggac gcacggggcg cgtcccccct gacgggcgct2820ctggtggtgg cgtgtttggg agggttttgc atgcttgtga taggggctct ggcgcgggct2880ctgtacggtg cttggagatg cacgggcagg gcgagagagg ggacgggttc ccgggaggcg2940ctgcttggag gtgctgagag ggagggagaa ggcgtgcttt gcgatgcgcg gggcgaccta3000ggcgctgctg cgcggtgcag cagcagggac ctcggacgtg agtcgaagcc gtctgcagag3060gagatggtag aagggccgcg gattggtagc agagaagagg aaatagaaga agaagaagaa3120atagaagaag aagaaataga agaagaagaa atagaagaag aagaggagga cgggcaggcg3180ggaaagatgg agaaaggact cgcggcggga aaacaagaga atgtgaactt gggcttgaac3240tttggtttga atttgaatgt ggagaacgag gggttgaatt tgagtttgaa tttgaaagaa3300aacttacgga aagaaagttt agttgaaagt gagaaagaaa aaaatgagaa agaaaaagag3360aaagaaaaag agaaagaaaa agagaaagaa aaagagaaag aaaaagagaa agaaaaagag3420aaagaaaaag agaaagaaaa agagaaagaa aaagagaaag aaaaagagaa agaaaaagag3480aaagaaaaag aagaagaaaa agaagaagaa aaagagaaag aaaaagagaa agaaaaagag3540aaagaaaaag aagaaggaga tttaaaaagt tgtttagttg aaaaaggaga aggaggaaga3600
agcagcgaca gcggcagaag aagaagtagt tgttgtaaga ggggaacgga ggcagtagca3660gtggagcagg cggaggcgac agcaaacctc gaactcgacc ccgtcgagcc gcagcaagaa3720caagagcccg accaggtgga cgaggacgag gtccgcttgt tgtcaggaac aacagaagtt3780gcaggactag ccgagagtgc taccactgca attcttagat ccacagacgc aagagcagaa3840aacttacaac tgctcgccac aacacaagaa ccaccttcag atacaaccag gttcgagaac3900tccacaagtc tagaagcagc aacagctcta gcagataatc aaacaggtcc agaaaaagct3960acgactagaa gagaaattat cgagtcgcaa cttgcaacca tggccactcg cgtgaagacc4020aacaagaaac catgctggga gatgaccaag gaggagctca ccagcggcaa gaacgtcgtt4080ttcgactatg acgagctcct tgagttcgcc gagggtgaca tcagcaaggt cttcggcccc4140gaattcagcc agatcgacca gtacaagcgt cgcgttcgtc tccccgcccg cgagtacctc4200ctcgtcaccc gcgtcaccct catggacgcc gaggtcaaca actaccgcgt cggtgcccgc4260atggtcactg agtacgacct ccccgtcaac ggtgagctct ctgagggtgg tgactgcccc4320tgggccgtgc tcgtcgagag tggtcagtgt gatctcatgc tcatctccta catgggtatt4380gacttccaga acaagagcga ccgcgtctac cgtctgctca acaccaccct caccttctac4440ggtgttgccc aggagggcga gaccctggag tacgacatcc gcgtgaccgg cttcgccaag4500cgtctcgacg gtgacatctc catgttcttc ttcgagtacg actgctacgt caacggccgt4560ctcctcatcg agatgcgcga cggctgtgcc ggtttcttca ccaacgagga gctcgccgcc4620ggcaagggtg tcgtctttac ccgcgctgat ctcctcgccc gcgagaagac caagaagcag4680gacatcaccc cgtacgccat tgccccgcgt cttaacaaga ccgttctcaa cgagactgag4740atgcagtccc tcgtggacaa gaactggacc aaggttttcg gccccgagaa cggcatggac4800cagatcaact acaaactctg cgcccgtaag atgctcatga ttgaccgcgt caccaagatt4860gactacaccg gtggccccta cggccttggt cttctcgttg gtgagaagat cctcgagcgc4920gaccactggt actttccgtg ccacttcgtc ggagaccagg tcatggctgg atccctcgtg4980tctgacggct gcagccagct cctcaagatg tacatgctct ggctcggcct ccaccttaag5040accggtccct tcgacttccg ccccgtcaac ggccacccca acaaggtccg ctgccgtggc5100cagatctccc cgcacaaggg taagctcgta tacgtcatgg agatcaagga gatgggctac5160gacgaggctg gtgacccgta cgccatcgcc gatgtcaaca ttctcgacat tgacttcgag5220aagggccaga ctttcgacct tgccaacctc cacgagtacg gcaagggcga cctcaacaag5280aagatcgtcg tcgacttcaa gggtattgcc ctcaagctcc agaagcgctc tggccctgcc5340gttgtcgctc ccgagaagcc cctcgctctc aacaaggacc tttgcgcccc ggctgttgag5400gccatccctg agcacatcct caagggcgat gctcttgccc ctaaccagat gacctggcac5460
ccgatgtcca agatcgctgg caaccccacg ccctcgttct ctccctcggc ctaccctccc5520cgtcccatca ccttcacccc gttccccggc aacaagaacg acaacaacca cgtgcccggc5580gagatgccgc tctcgtggta caacatggct gagttcatgg ccggcaaggt cagcctctgc5640ctcggccctg agttcgccaa gttcgatgac tccaacacca gccgcagccc tgcatgggac5700cttgctcttg tgactcgtgt ggtctccgtt tctgacatgg agtgggtcca gtggaagaac5760gtggactgca acccgtccaa gggaaccatg gttggcgagt tcgactgccc catcgacgcc5820tggttcttcc agggatcttg taacgacggc cacatgccgt actccatcct catggagatc5880gccctccaga cctctggtgt cctcacctct gtgctcaagg ccccgctcac catggagaag5940aaggacattc tcttccgcaa ccttgacgcc aacgccgaga tggttcgctc tgatattgac6000ctccgcggca agaccatcca caacctcacc aagtgtaccg gctacagcat gctcggagac6060atgggtgtcc accgcttcag cttcgagctc tctgttgatg gtgtagtctt ctacaagggt6120accacctcct tcggctggtt cgtccctgag gtcttcatct cccagactgg tctcgacaac6180ggtcgccgca cccagccctg gcacattgag tccaaggtgc cttccgccca ggtcctcacc6240tacgacgtta cccccaacgg tgccggtcgc acccagctct acgccaacgc ccccaagggc6300gctcagctca ctcgccgctg gaaccagtgc cagtaccttg acaccatcga ccttgtggtc6360gccggtggct ccgccggtct tggctacggt catggccgca agcaggtgaa ccccaaggac6420tggttcttct cgtgccactt ctggttcgac tccgtcatgc ccggctcgct cggtgtggag6480tctatgttcc agctcgtcga gtccatcgct gtcaagcagg acctcgccgg caagtacggc6540atcaccaacc cgaccttcgc tcatgctccg ggcaagatct cctggaagta ccgtggtcag6600ctcaccccca cctccaagtt catggactcc gaggcccaca ttgtctccat cgaggcccac6660gacggcgtcg tcgacatcgt tgccaatggt aacctctggg ctgatggcct ccgcgtctac6720aacgtcagca acatccgtgt gcgcattgtt gctggcgccg cccctgctgc tgctgctgct6780gctgctgctg ttgctgctcc ggctgccgcc cctgctccgg ttgctgcatc tggccctgcc6840cagaccatca ccctcaagca gctcaaggct gagcttcttg acgttgagaa gcctctctac6900atctcctcca gcaacggcca ggtcaagaag cacgccgatg tggctggtgg ccaggccacc6960attgtgcagg cttgcagcct cagtgacctc ggtgatgaag gcttcatgaa gacctacggt7020gttgtggctc ctctctacac cggtgccatg gccaagggta ttgcctctgc tgaccttgtg7080attgccactg gtaagcgcaa gatcctcggt tccttcggtg ctggcggtct ccccatgcac7140attgtccgtg ccgctgttga gaagatccag gctgagctcc cgaacggccc cttcgccgtc7200aacctcatcc actccccctt cgatagcaac cttgagaagg gcaacgttga cctcttcctc7260gagaagggcg ttactgtcgt cgaggcctcc gccttcatga ccttgacccc gcaagtcgtc7320
cgctaccgtg ctgctggtct ttcccgtaac gctgatggct ccattaacat caagaaccgc7380atcatcggta aggtctcccg taccgagctc gctgagatgt tcatccgccc tgccccgcag7440aacctcctcg acaagctcat ccagtctggt gagattacca aggagcaggc tgagcttgcc7500aagctcgtcc ccgtcgccga cgacatcgcc gtcgaggccg actctggtgg ccacaccgac7560aaccgcccca tccacgtcat cctccccctt atcatcaacc tccgcaaccg cctccacaag7620gagtgcggct accccgctca cctccgcgtg cgcgttggag ctggtggtgg tgttggatgc7680ccccaggccg ctgccgctgc tctcgctatg ggtgctgcct tccttgttac cggcactgtc7740aaccaggtcg ccaagcagtc cggcacctgc gacaatgtcc gcaagcagct ctgcatggcc7800acctactctg acgtctgcat ggctcccgct gctgacatgt tcgaggaggg cgtcaagctc7860caggtcctca agaagggaac catgttcccg tccagggcta acaagctcta cgagctcttc7920tgcaagtacg actccttcga gtccatgcct gccacagagc tcgagcgtgt tgagaagcgc7980atcttccagt gccctcttgc tgatgtctgg gctgagacct ccgacttcta catcaaccgc8040ctccacaacc cggagaagat cacccgtgcc gagcgtgacc ccaagctcaa gatgtctctc8100tgcttccgct ggtaccttgg tcttgcctct cgctgggcca acaccggtga ggctggacgc8160gtcatggact accaggtctg gtgtggccct gccattggag ccttcaacga cttcatcaag8220ggctcctacc ttgacccggc cgtctctggt gagtacccgg acgtcgtgca gatcaacttg8280cagatccttc gcggtgcctg ctacctccgc cgtctcaatg tcatccgcaa cgacccgcgt8340gtcagcattg aggtcgagga tgctgagttc gtctacgagc ccaccaacgc cctctaagcg8400agttatatct gtctagaaaa cttggcatgg ctagcaattt atgtctagct attccataca8460cacggtaatg ccagtagcct gttagttata gctcttttgg ttgttgtctc acaatacact8520gacatcagca gaacaaaatg aaaggggcct tggctaccat gaaatcaata cttcaaaagg8580tctcttggtt tctttactcg catgtcgcta tttacttaca ttcctcgagt acataacata8640tcatacatca aagaaattaa aaagaaaaca aacattcaaa tatgcattac tttccctact8700gtactagtaa gtacgtttct ggtattaagt tgttttttct caaaagaaca atgtgcttac8760ttgtaaaatc cacagctgct tacttgtaag cctcaactag ttagtgatgt gattatcata8820aaatgttcga cactgtacct cctttccagc tatcttccta cacctcctct gacgcaggtt8880gacggaggag gcgtgggggt tgattgaagt gcaacacaac gttttgttta agatattcct8940tgccttggcc gactccaaat ggatagcaca gaagcctaat gataatttga attaatttta9000tttcgagctt atttaatgct cttatcagag tccgtaggta tctcttttcc tactaattgt9060tgaaaaagga tgttttggac atagcaggtc atcatactat ttggttccat caaattcata9120tccatttctt tcgttcaagt gcttcccttc ctacttatta tatatattat atatccataa9180
atgtaaaaga gacgattacg aatactttgc atacatgtat agcgaaacag agatggtagc9240aaaagttcac cttcactaat ctaagaatct ctccacgtgg gtaaaaactt cagcagtaag9300attgtaaatg atgtccaaga acaaaacgtc atgctagtcc aggggttact gagctaacga9360ttaataatgt ttcgtagtct tcctaattgc accatcaaaa cttgtctgca caagttttaa9420agtattggag cctttactga agaatcagag gacatagatg gggcacgttc gccttgaaaa9480aaatagtctt ctttacctgc atggtgttac aaacaaaaac gagttgaaaa tagctgtgca9540aggaggcaaa catgattgga aaagaaaaac gaggggaccc ttatacagga gggcgccaca9600tagtagaatg agtagattgt tagagtaggg tacgctttat gtgattgatt gaatgggcga9660gtgaaagttg ctgtcaaggt tctaaacaaa aggatgtttg agtttgtgag tattgtttgc9720ggcaaaaaga ttcagtagag agaaatgcac aaaaagataa tacgtgtgta gggcgattat9780ggaggcatgc atttggggga aatcatcgca tgcgcatgag tttctccatc tgccgaatct9840ttgcaaaggc attttcaagc tccatttgca tagcgtaggc ttgctgctca aactgagcgc9900gctgatgcgc cagattttct tcatgtcttt tgttcaaact acgctcaaga ccctcaagag9960ccgcaacctt gagcttgcgt tccttttgct gaatctccat aactcttcgt ttcacctgga10020gctcaatttc tgcagcatcc gtggtctttg cagcggcctg tgcgtcttgt gcggcctgtg10080cgttgtttgc gagctccttt cgcagctcct ccatctccgc gttctttttc tcctccatcc10140atttggcacc gagtttggca gcttgatcga tgcggccctt gagaacttct tcgttctcct10200caagttctgc gatacgcgcg tgtaagccga ggatctcctc cgagacagcc tcgccattga10260tcattatttc acttcccgag tcttgaatga caacatcagc cttggtgcca ggttcaccgg10320tatctcgctc gcaaccctgc tggcgcatag acagcataag gcgcgcatta tcctcacgca10380gatcatccac ctgttctgat aaaagtttga ctgcctgctc aagattacgg gggttcactt10440cgtgaaaaat ttcttgaagg tctcgaagct cagaaagctt ggcagagcaa gtgtgcatcg10500ctctgcactt tttaagacgt gcaagtgcat catcaagttt ggcattattt accttcatgg10560aggcttcagc tacttcggct tcttcgatta caattttctg cagctctaca acatcatggc10620caattaactt gcgatgcagc tcggcaatca ccccatgcat cttttcggta tggcctggac10680gcgcctcatc ctgcgttctt cggatctcct cctctagttc tcgatttaga cgaagggctg10740gtccaagggg cgggtaatta gcctgagtca agccaagctc tgttgctagt ccaaggcagt10800cggaaagtcg cagccggtcc ctatcagaaa cagccttttg caagtctacg ctcaaacgca10860cttcttgagc cttgcgcacc atcttcggtt ctgcctgtcg cagaagtttc gagtcgtagc10920cagcttgcca cgctagcacg atggcacgcg caagtgacct cagttgaccg ctgttcatgg10980cagacttgag caacattttg atttgcacaa atacctcatc tgattcatca tcttcagctt11040
cctcaagctc tgcaggtgtc ttgcgctctc cagagacttg aagagcaggg ttcaaaccgc11100cctccaggac ctcgctcgca agcgcctcct ctgtctcagc tttgcgcaat agcgcagcag11160cattctccgc cattgtgttt gtcactcacg agattaatat cgttgccaga gtatacggta11220atgcgagtta aggattcaca gaatctctca aattaatctt ttcacctaat gatatccaca11280aaacgttgca atcgctcagc ccaacgacaa gcgtgcttct tgttttaaga ctgcaactgc11340tcctttttct attagtcaat atggaccgtc ctccaaacgt ccagaaaata gcacagaatt11400taccagcagc cgctgcagac aagaagtgca agagagcagg caagcaagtg agggtttgag11460caaataggcc aacctctcca cgcagaattc tagggtcgca accggaactc acagtcctta11520gaaaccgtgc gaagccctgg gctcaacttc aatttgtcca cgggaccttc agcaagcacc11580aagctcagca gcgtgaaggc aggcgctgac cacagtttga gctcagaggg cttggtgtgc11640ctcgcgattg atattgaagt caattgcgca ggacggcagc aacggaccag gtggtgaaga11700aggtaatctc cagcggagtg atgatggagc tcgaccgact actccggaat cgaccagggg11760aggtgcgggc gcccttcaca agcgggcgag aggcagggga gagaaggctc gactccacgt11820cttgaagcgt gtacgtgtgc gcgctcacgc gtgcgacacg ccggcaaggg cgccttagtg11880gcctgctgct gctgctggtc gccacgctgc gagcccaaga gatttgaatt gaactcgaag11940aaaataacta tcatttatca attccaatca atcaatgcat tatgaagcac ctctgaagtg12000aactattctc ctctccaata tacaacaaaa aacacacaca gtgggtttta ccctataacc12060tattgttccg cgagcgatca actactctat agagcgaatg accagttttt ctttctttct12120ttctttcttt ctttctttct ttctttcttt ctttctttct ttctttcttt ctttctgttt12180tcctatctaa taaccccttt aatcgaggaa acctttcgat ttaaaaggaa agctctgtct12240gtatatatct gttacagata ctgctatcat gccatgcaga aagaaacaca aaagaaaaac12300aaaagaaaga gagaaagaga gaaagaaaga gagaaagaaa gaaagaaaga aagaaagaag12360agcttttctc aatcggtttc ctcatcgacc gctcacatat ctacgattgt ggcaaagaaa12420gaaagaaaga aagaaggaaa gcctcagcag agtccgcacg aaagccttca ttgagccacc12480atgtcgtggt ccgctgcagt cagtgccgcc tctctgtgaa ttgagtgagt gagtgagtga12540gtgagttggt tggttagtta gttagtgcct cttcagctca aagcctttca cggtcgctct12600tcgagcgttt gctttttcat aaacaaataa acaaaccatc gaacgaacca tcgaacgaac12660gaacaatggt accccagaat agacggaatt aattgctaag taaaccagta acagtaagtt12720agtgtttctg acctgagccg ttttctttat ttattcctct cagctctgtg aagagaattt12780gggatgaaaa gaaacgtttt tatttattta aaagtttagt aacaagaaaa acatggtccc12840tcttcttcct tcatgtaaaa ataagtaagt aaaaaaaaga aaagaaaaaa aaaaaagctt12900
ttaaagtagt aaagcgaggt agagataaaa gttctttctc agggctccta gtaggcactt12960aggaggtacg tctaagaccg cctcgtggga agaaaagaga aaacaagaag agaaaagaga13020gagagaaaca gcgctgaccc gagaggctca tgcgcagagc ccaaatctgc ccaactttgg13080caaaatgcag cgccgcctct gcggcggaga cggtcatgtg aatccgcaga gctgcacgca13140cgcgtcacag gctacagctg gatatttttt atacgagccc gcgcgagacc gcggcggaga13200aacggggtcc cgcgcgaagg gcctctgaaa agcaggcagc gaaccaggcc tgcaccagcg13260ccgacctccg cgagacttcc ttcgatctca ggaaggacct tctgaagagt ggctcaaagc13320agcgcaggcg gaggcagcgg cggagggcac gcccagcgag ggcatcggct cgaggctcca13380gggctgccag gtcgcgaggc atgcacggcc tcggttcgtg atcttggccc tgccgggtgt13440gccgggatcc aatatggtgc gcaccgtttt tgaagctgtc gctcttttct cgcgtcgcac13500attacgatgc gcagaactga gtgagtggac aaacgaagag ggcgatcgat ggcttggaat13560gcgaactccg tccatcgaca tcgacatcga tcaacccatc gacccatcca ctccgtgcac13620aagctgcact ccgtgcacaa gctggagacg agcgaccgaa gaggtgacga ttcgctctcg13680ctcgggatgc ttggatgatt ggatgattgg gtgcacgagc tgccacttgt tgttcttgtg13740ttgttcttgt tgctgttctt cttcttcttg gcggtcgttg agcgaatgcg ctgtttgtcg13800agaaccatga aatgagcgtc ttgaatatgg gtggcctcgg gaatccgcag aacgatggta13860tcgcattcgc atccctggtt gcaagaaggc ttgcgatgag gtaagcacat gccgactcgc13920cgatcgacca gcgcgggcct ctgtgccgaa ggagcgacag cttggacgca ggggaatggg13980gcctcgaagt tcttgtggtc actcaggaca gaaactcttg ttttaatttt tctagttgct14040tagctcaagt tagttagcca gttggctagt ttgcttttaa ttaaaaatga agaaaactaa14100aattgagttc tcaagtctga aagaacaagc aaacaaaagc gaaggatgtg ctgtgcatgc14160acgagcttcg gctcaggcag aggaagattg ccagctcgca tgaccttgga tcttccatac14220tgcgtaatgc tgagcgtcag agaaagatgc gggccaggtg ccggaagata taccttcatg14280gactttccgc agaggtgaag atcagcgatg atcatgtgga agtgacacga cgcacctcga14340gcatcccagg aattgcagtg tttgcccagg caggcagtga gtgcctggtc aattatggaa14400tagtcaatct agtaatatga gtgagtggaa ggcagaaaat aatttccatt ccttcattcc14460atgactagct gcatcaacat catgatgttg cttcagctcg tcagcagggt gaacaacgtg14520cgggctagaa gaattagaaa agaacaatga gtgtctatga atgcatgaga atcgagtgta14580atgcaataca gaaacgtgag aaattgcagg attgattaga aagtattagt agggcaagaa14640cagagagatt agagaagtga aaagggatga cggtgaaacc agtgtagtcg tagtaaagag14700tggcttgcaa ataggtgcac cgcatccatc aattggtcaa cgagcaaatt agtgcagcca14760
gcgtactagc tatttactgc gacgatgtaa cgaagtcctc caaggacgcg tacacggtgg14820ccggcaagtc ttcattggcc ttgagcttgt ccaagataat gcggggaaac tggattgcct14880ggtcaatcac agccttacgg gcctgctcat cgttgacaag tgtagggtcg cgctcgaggt14940ggtcggtgaa gcgagagtgc aaatcaagag ctacagcaat gatgcgctca gccttgagct15000catcacggtc ccactcaact tcctgttgca taagcgaacg gatgcgcttt acaaggcagt15060ctcggacctt gggcaggtca ttaataccat cgtaataaat actcatgacc ttccacatgt15120tgggctcact cttacactta gaacgcattt tgtcaaagag ttcctccaat tgtgcgcgca15180aactagacac aagttcaggg ctcatctcct tacgaggggc ctccggcgtg tgcgggtctt15240cactagaggt ctgagaatcc gacttacgga cagagaccaa ggcatctaca actacaagga15300gactctgtaa gtcaaccatc tcagcaatgg cctcacgagt gccagcgcgc acatccacaa15360ggttgctcat accatcaatg gccatacccc actgatgagt gttgacagcc aacataatgt15420agttctccca aatacgccag ttggaacgtg actgacgaac ggcctcaata acagccttca15480aggcagcagg gtagtcgttg agctgaatca aaatcgaact caagttagcc caagcatcac15540cgctatccgg atcctggcga gtcacatgcg caaaggctgt acgggccaag gtccactgtt15600caaggcgcat agcgcatgag ccaaggcgga accacgactc cgggtacaac gggttgatct15660taagggcatc ctgaagatgg tcaatactct cctgcaaatc accgcggtca aatgccatga15720gggccaattc acgcttagca cgcgcatggc gcttgccaga aaactcccac gccttgctga15780accagtcctc atcctgaagc aaagagccca aaacgcacat aaggtgtgca gtgggctcaa15840ccgcaaggcg ctcacggatc aacttctcag cacgagcgcg cttgtccatg atcacaaggc15900agtccacagc ctcttcccaa agacgaacct cctcaaagat ctgcaacgca ctaccagcag15960cgcccacttc atagtacaat tgcgcgaggc cgcgcttgag ctcccaaact gcgggccagg16020atagcgcgtg cagaaaggcg agacgctcgg tcactggggc agcgttgtcc acgtcacgct16080ggcgaggctg tgttggtgtg agccggtcag tctgctggtc aacgaggact tgcatctgta16140aaatggcacg ctccttggtc ttgttgcgct caaactcaag ctgagactta attagaagtg16200cagtggagta taccatccag ttttcaggac tctggaggac acgctccacg taagcaagca16260tctcctctgc agtgagagcc tccatagcgt agctgttctt cacatccata cacaagccga16320gcacgataca ttggtccaaa aggctcaacg ttccgcggcg catgcgagcc tcctcatcac16380tggtctcctt tgcgtactgg atctcctcgt gaagtggagt ctcagcgtca acttcttcga16440ggcgcacctt ggggataccg agaacagaag tggatcctac aatctcgcca ccgttctctt16500cctcagttgc atcattatca tcctctgcca caatctcgtt gggtacctcg gtctcgatgg16560ccttgactgc aggggaatta gaattcttag gagcttcagt gtcttgctcg cgattttctg16620
ggtctttggt ggtagctgac gatgcaagaa gaataagctg ggtcttctct tgtttctgaa16680acttggtacg ctttcccatt acaccagtca tctgaatatt cagctgtgct gtttcctttg16740cctttgcaaa agcacgctta gctccatctg cagccttgaa cttgtgccgt gctacaccgc16800attcaaccca cacgagggac tccagaagtt tatcatctgg gtacatgatc tgcactgctc16860ggaccgtgcg agcaaatcct ctctctgcct cctgctcgag gctaggagcc gctgcctcct16920tggcttcaag ggtctcctgg tgaacaacag cagatcgtgc tgcccaccag ctaggtgtca16980gcaaatgccg aagagctccg cggactgtgt tggagatctg ttcatgaggg ttagcgctca17040tgccgccgtt ggtctttccg ggaagatcgt cctcgccatc ttcttcatct tcatcagcgc17100caatgacagc cccgttctcg tccacaaggc gagcctcagc gagcatatca gttgggtcca17160cagcaccagg ggttactgtc ggattagcga cgacgcgaag aatgacgcga gcaacaagca17220agaagtgaag gtacttggca tcacggtaaa cctcctcacc gttcgcgcca agcataagag17280ctgttcttgc gtggagctcc ggatacgcgt ccaactgtga ctcgtggaag ctggcatcct17340tgccagacac agcggcagca gccttagcat cggaggcgag agcggagaga gcagtctcca17400catacctgag acccggctca gaggcagact tggagctagt atctacagaa ttcttggggg17460cattggggtt gtcgtagacg ccgcgaacaa aagggagggg gtagaacttg tcaataccat17520ggctagagac aggaggacct gtccagttgg cctggacgaa gatgtgaaga caagcaacac17580ctgcgaacat gcaggccata gctcgcagag cacgttggtt tacgtggtcc tcaggactcg17640agacactcgg gtccttgaag gagctgcgac ctgtgcaggg gccaatacca gtctcgatgt17700gctcaacaac acgctcatga agaaaacgac cttcacggtt tttaacgcga cttacagagt17760atcccttctt aagatcttct gcggcaaaga ggccttgcgc cgcaggagag gcgaggacct17820cgaagaaatc gccttgggca agagcgcacg ccatacgaag gacctcaagt tggagctctt17880tcacttcagg ctcgtcacgg agctcctcgc ggtcatctgc agcagccgat gcagccacaa17940gaacatcttc gagaccctcg gcattgctct cgagagcgag acgctcgaca agtcgcagcg18000agtaaagagg tttcgcaact ccagagccct ttttggagtt gaagacacca gcgccgttaa18060gatcaccatc gtcagcctcg tcgatctcat catcaggacc gtcagggagc tcaaagtcct18120gtggaaggcc taggaactcg ttcaggtctg cgtcagagct cgaatccgac gcgtagtccg18180ccatcctggc ctacaggacc gccgaaacag gttgcggcag ccgcccaaag tctaagctgc18240aagagtcaac cctcaatcgc gagcttgcgg cacaacgtcg ccgcaggatc tcgcgccaag18300acgtctccaa atgcaagtct ggtgctcaag tcatcctggc cacccgcgcc tttgcccctt18360aagctaggtc acctacctta aaccagagtt gccccgcggt gtcatattgt aaacatttta18420taacaatata cgtcatatta aaaacctaga tgtggggaca atgttataaa taagtaacaa18480
atatagacta catcgagaag aaagaattct tcggcactcc gtgtgagttt gggcgaaact18540gcaatcacga agccatgcaa agtcttcgta tatctgagtg gagcctcgct ggagagaaga18600ccccatgtga atgggtgtag aacgacgaat ctacgcagcg ttgtctccgt tgagacgctc18660tgtccagata tgaggtccct cactattctc gtatttgatc atgccaagca tctccagttc18720caacaatgga gttttctatt gaaagaacat agacatgttt ggaacggttc ctttcagagg18780ggaaaaacta atcaaaaatc aattgaggaa tgcagggggg ttatttgctg cagttttagc18840aataaaataa aaatcctttg ttgatgtgat ttcattcgtt cctttgacat tcaatcattg18900aattgctctt caccggagct tttcaaggtg cccaactgcg atctccgctg cggctgctcg18960cggccgggct ctgagctcta tctccgtgtg ggaggcggga agccagcagg tgcggcgacc19020ctctccaaat agaggccgcg gcgaccttga ggcactcgcg tggcgggcgg attggcgatt19080ctgtgttcaa ccgagatatt tcatacatat tatttgctaa ttattagcaa atagaaataa19140atatacagac tttgcaagct cagtagagaa agtgaagatc caaaatgtcg gcctcttcct19200cgcaatctac ttcggagcag cgcaagtcac gcgtggcgta cttttacaaa cctgagattg19260gcagctacta ctatgggtaa gttagtatgg gaaaattggc gacagaaaaa tataataaaa19320aaagcaactg tatcgccacc gtttattcac ggtagttaga aggtatttgc ttcctgcgca19380cactcgatct gcaggatgta catgtcttga gtggcattgt ccaacgatcg ttctgtttgg19440cggaacattg cttttaaaca aaaacgagat agtgaatata ttctacccaa ctaccaccat19500ccggtttaag gagacaaata aatctgtctt tcgacccagg ataaggaggc ttgcatggga19560atcttttata atctagtctt tatgtcaaat tttcgcaggt tccagcctac catctctcat19620gctatttgtg attgcacaag atgatatgaa agtaaagaaa caaggcaaag gatataagat19680gcataaggat gtgcagaaaa ctaactagaa acattcatgt gatgaaacct tcctcttgaa19740aactcacctc ggtttgtttt ggatcttggt ttgtctttgc tcactttttt tcattattta19800cagcccgtcc catccgatga agcctcaccg cctgaaactg actcacaacc tgcttcttac19860atacggactc ttccgacaca tggaagttct gcgcccgcac gacgcgactg cggaagacat19920ggagcgtttc cactcgcacg aatatgttga ctttctaaag cgcatttctc ccgacaccga19980gcaagagttc gagaagcaaa tgacccgttt caacgttggt ccctattctg attgccctat20040ttttgacggc ttatacaatt ttatgtctag ctgctccggc gcatcgttgg atgccgcaat20100taagatcaac cacggacagg ccgatgtttg tgtcaactgg tctggtggtc ttcaccacgc20160aaagaagggt gaagcttctg gtttttgcta catcaacgat attgttctct gtattgttga20220gctcctcaag tatcaccctc gtgtactcta tgtggatatc gacattcacc atggtgacgg20280agttgaggaa gcgttttaca caaccaatcg tgtgatgacc tgctcttttc acaagtatgg20340
tgacttcttt cccggtagtg gtgcctacac agataccggc gctcgcgctg gtaagaacta20400cgccgtaaac tttccgctca aggatggtct tgacgatgcc agctttgaga gcatcttcaa20460gcctgttctt gatggcatca tgaagcactt tcagcccggt gctgtggtga tgtgctgtgg20520tgctgattcc atctctggtg atcgccttgg gtgctggaac atgtcattgc gaggccatgg20580ctacgctgta cagtacgtga aatcctttgg cgtacctgtt gtgcttcttg gtggtggagg20640ttacaccccg cgtaacgtgg ctcgctgctg ggcttacgaa accggcattg cactcggcaa20700gcatgaggat atgcagaatg atattccatg gaacaactac cacaactact ttggccctaa20760ccatcttctt cacattactc ctgacccgca gatgaagaac gccaattcac gcacctacat20820ggacaagtac accaacatta ttctcgagaa cctttcgaag cttgaagcgg tgcccagtgt20880acagttccaa gatcgcccta acgactttgc aaacccagat gagcgtgctc gtattgctct20940tgacaacgct gaccctgatg aaaaggatta cattcaacgt cctcagcacg aggccgaata21000ttacgaagac gagaaacacc aagactcgga ccgtcccaat ccggctgatg gtggtgccga21060ctcaaaggta aagtctgaaa aatcctcagg cgatggagct gcggacgaag cggagaccgg21120atccagaaag ccttacaaaa agggcactga atgcggtggt ctacttgaaa ttgacgaggc21180tgtcatggaa gtggactcca atgaagcgcc caaggagact gctcctgctt cagattctgc21240tatcaagact gaggatgctc ctgctgctga gtctgctgcc tccccctcgg atgccaaggc21300ctaaacatga agactttgtt ttaatgcaat agacgtgctc ttttgctgct cgagtagcgg21360caaccctagt gccatgtcct ccttttttct tactcacttc tctctctacc tttgaaagag21420accaagtgga accaagcagc catttctgtg ttccacattg caatagatta tcttttaaca21480attctcatac atacatattt tcttcatttt tcttttctat gtatttttaa aataaaatat21540aacaacaaag tagtagtttg tatgaatttc ggccatgcag gtgacaaaag gtgaaagtaa21600tgagcgtcat tttggatcac attaccagcg aatccactca acgactcttc tcttctcgag21660ctttagaagc tgactgtgag ataatagaac agagcacggt ccatcaatca aaatacataa21720ttagctcgca atagcttcgc ctcacagtga tcgtttcacc tcatgatacc cttgttgggc21780gctcgctctt aggctctccc ttgttgttat atgatgcaac gatcatctaa gtgctgtccg21840cagtcatcaa gacatcctat tctgtagcaa gcaagcaagc aagcaagcta gctagtttag21900ctggctagct agtttagctg gctgagttcg cagtgaataa acaattaaca cctcaagtct21960tgaaggagca ggaaacttgg ctcctatgat atgccatcct ggaaggccat gttttggggg22020gtatgagaga caggtctttc cttttctact ctggttcggt ggatgacgag acaacaacca22080gacgtcccgc ctagtacctg ggtggtcgat ctgtcctccg ttcactccga gtgcagggct22140tgtgggacga ctcgctctgt tgaattgagg tccttcacgc gagcctatct gggcatcgat22200
cgacctcatc catcaacaca cacacatatg ttcaatccgc gccaccctcg ctgactccca22260gactgcccag cgaaactttg aaaacttccc catctcgaaa cagcactccc aaaagacgca22320cacaagcaac gcttgagcct aggcaggctc tccgctggac gcacaaacca cctcgcagcc22380atccactctc tgactcccca agcatgcatg gccttctccc tcgatttggc gcttcgcgtt22440gctgtcttcg aagtcctcaa acacgaactt ttcactaatc atcctcgacc tcagcaggat22500gccccccctc ctaagctctg tttgctatgt atttattaga ggaaggacgg caagctgggg22560gtctgcggaa cgcattttgg gggtttgaaa attttcgaat tttcaaactc cccgaaacgg22620ccatggtttc ttccgagaag cggtagttag gtggggaaat gagagcacgg cggagttggc22680gagaagcata aatctgggcg ggcaagcaaa ccccaaacta tcctgcaatc aacaaaacac22740acgcactccg caatcaactt gcaccgtaag tctttggaat tgattatggt atctgcttcg22800ccgtcttcaa ctttaacttt gcgcctcgca acgagacttt gttttgtaat gtgcctttag22860atttgacgaa acatctttaa gcgagatagt acagcagcgc gttggtacca agagagatag22920atcctgggac cttttgaaat aaataaactg tgtgatgaac ggtcgactaa ctgggcttgt22980aattgatata ttgatgatac tcttggtcca catgggagtg agcacagtcc acaaacaact23040tgctaaccca cacaaaaacc tcccaaactt gcagacccgt tctgcattct tgtaaacaca23100taatcacaca gcacacataa tcacaatgac ctacggcaca gcacacaact acgtgcagga23160gcagattgag ttggacgaat gcttcaacaa ctttggcgaa gaagtgagca gctctgttga23220gcctcggtgg cagcgcaagg ccttggccgc tcgcactccc aagtctagcc gcaagcgtag23280ccgcaccggc aagaccccga gcaagggcaa gtctacgccc cagcacgacc gattcatccc23340caaccgtggc gccatggacc tcgctaacgc tcacttcaac ctcatgaagg agaacagcag23400ctccgcctct aaccagtgcg agtcccctac tcgtgctgaa ttcaacaagg ctttggcgtc23460cagcatgggt gcgggtgagt cccgtgtttt ggccttcaag aagaaggctc cggcaccgcc23520tgagggatat gaaaactccc tcaaggtttt gtacacgcag aacaaggaga agatggcgcg23580cactcagaag cccgttcgtc acattccttc ggcaccggag cgtatcctcg acgcacccga23640cctcttggac gactactacc tcaaccttgt cgactggggc gcctccaaca tgctcgccgt23700ggcccttggc cagacggtgt acttgtggaa cgccgagacc ggcggcattg aggagctctg23760ccagtgtgat gccgaggatg actacatcac ctcggttaag tttgttcagg agggcggtgg23820ctacttggct gtgggcacga acttcagcga gaccaagctc tttgatgtgg agacctgcaa23880gcttctccgc aacatggacg gtcacagctc tcgcgtgtcc tcgctctcgt ggaaccagca23940catcctttcc agtggcagcc gcgactcgac tattgtgcac cacgacgttc gcgtggccag24000ccacaaggtc ggtgttcttg agggtcacgt gcaggaggtc tgtgggcttt catggtcccc24060
ggatggccag accttggcct ccggaggcaa cgacaacctg ctgtgcctct gggacgctcg24120ttactctggc gacggtcgct cccagcagac cgtgcagacc ccgcgtctta agatcgctga24180ccacctcgct gctgtgaagg ctcttgcctg gtgcccgcac cagcgcaatg tccttgccag24240cggaggtggt actgccgatc gcacgattaa gatctggaac gctgccaatg gcgcctgcct24300caacagcgtc gacactggat cccaggtgtg ctccctcctc tggaacccac acgagaagga24360gcttctgtct tctcacggct tcagtgagaa ccagctcagt ctctggaagt tcccttccat24420ggctcgtgtc aaggatcttc gcagccactc cgctcgcgtt ctccacttgg cgatgtctcc24480ggacggaacc actgtctgct ccgctgctgc tgacgagacc cttcgattct ggaaggtctt24540cgaggcagct aacccggtca agcgcaacaa gcgcgccgct ggagctgcca ctgcctctca24600cggtggcctc gcccgcatga gcatccggta agtttccccc cttcccttgt ccggttaatt24660cactttcgac tactgtctta cacagaagca aagcatggtt atgcaagcaa acttgctggc24720atgctctctt ttgtctcttc agtagcgaga ggccgtggtc aaggggctca tgcgggagct24780ccaatgtaat ctaccaccac ccggcctctc atgtatacat atatatatat ctatttatat24840gctgatcatg atgcaaaaaa atcccacgcc gtcatactaa agcgcgtcag tgtttacaat24900actgttggcg tatagttcgg tagtgaaaat taaaatcctt cagggtttgt acctatagct24960tttggtgatg aatgtgatct actactactg acgtgacaga agcaacaatt cttgtgaatc25020tgacttcttt tttgtgtatt ctatttcgca tgactgcctg attgtatgat atgggtctga25080tttggtcgac tgtactctat tttgcatgcc atgtaacttt ttgttcgatt atactatgaa25140tctgtggcaa cttttgctga gaagaaggga tggcagacag tttgattttc ttgatcaatg25200tgtttcgctg tcccgctgtg ttgaaagaat gcagtaaatg acccgagtat cggactggag25260tgcgtatgtt tcacgctgcc ttatgaatcc ccaggggttc gcagcagcac tttccctcgt25320ctgtctctgt gtttgctgtt tgttcgctcg taaatgtgtt ttgcctgtat catatgcatg25380taggatagaa agttattacg cagtgtgtat tatagattta tggaagatca ggtggactcg25440tatatgctga ctggtgggta tgcttcacgg gatactcgca ttaagttcaa attcgaggca25500atggttgctg ctgaagtcgc tgacgaagga gagctcattg ttcttgtcgc caatttgtaa25560gtaggtggca cctgattcct ctttcctctg ggaagagatg cagcgctctt gggatcagtt25620tctctctcaa tcacgcttgc cgagcagttt ttagtagcaa gcaataggtc tttaatgact25680tctagaacta gatgagcagg tatttgcatc atgcaaggct ggcatgtttg gtggctttgc25740aatttctctg tcttgaactt agctggatag atagcgagag agtgaagttg gtacaaacat25800aaccgacagc atgtagccgc tgccttcgct cgcagctcta gcgctcgcct gcagagacgg25860aagagtgtat aattgcccag tgtcaacttt tgggtggtgg gtctgactca caatcaatgg25920
taccgttcag gtatctttcg gtagattatg acactggcca cttttctgaa gtgatttgag25980atttggtatc gatgatgaag agtgagagaa ttttgaaaga aatacctcat taacttccaa26040tagtcagtat cttgatgaaa aacgctgacc tgaaagctgc gcgtgttttg ttgacacggt26100ccttttattt tgttttttga tgatctattg gtacttatac ctgcgatttt tcttttgcaa26160gctaaggcac attcgacttt gtctagaagg aaagtgatca tcacgcttcg gcacacatct26220gttttcctca gttaagtttt cttcttggtt caggtatggt attacatgca ggaagaaagg26280ggatgcgggg acagccgtat agatgccacc aactttaaca tggtttgtgt tttggggaaa26340caaggaaaga gagcatacgc tatgagctac ttaaactagt gacacaagaa gcaacttatc26400ataccggaga tcacaatgga gtgattaggt tctatcagat agtagaagca gagtatgcga26460cctgcggtgg ctacgtacat gggtgaaaat aatagaacac ctcgcgtagc gtcgaaaacc26520gcctcgtaga ctctgtgtca ggtatgaacc acccactttt tttgtcctct ttatctccac26580actatttcct tcatggagac aaactcattc tcgaaagaca aacaatcaaa tcaatccatt26640accctcatgt tctcatgatg ggtatgttat acatatatgt ctcagacata tgtttatcct26700ttttaaaaca catacttaat aggcacttag cactgttact gctatagaaa actcatccat26760tcaagaggag ggagagaaca gagttggcaa aatcttggaa gggcaaagtt tatagcaagt26820aagtagtagc acagagagag tattatgtat gtgttcatct agcaaaatct aaatagaaga26880gccgatcgac tcagtcagtt gtaattagga ctagtcgtta atcatgacat ggctcataaa26940caactagtca gtttcttgat ttacttggca ctcaggaaca aagtatgttg ccatccctgg27000gcaatagatt tgatcccgtg cgttgagata aagcttgcca aggtcgggtc atgtaactgc27060agaggcactg ggcgtagatt ccagtcccag acataaggaa cagcaagatc ctcaccaacc27120acgcaaatgc cctcagttcc aattgtaact tcaagctgag gagtcttgtg ctcggcggaa27180agctcgaaag gggtaaaaac aggtacaggg tcaaggactg tgcgagctgt ggccttgtat27240ttgttggtgg acttccaaaa tccctcctcc atgaatggtt caatctgctt ggtcacagcc27300tcggagcttg aagtttcctt gtcggacatg agaccccact ggtaaagctt gcagccgtgg27360ccctgagaat ctttaactaa agcgacataa ctctgcgggc ctgcccaaat gtcaagcacc27420gggcccgcct cagggccgaa ctcgacctct cttggtgaaa cctggtcctc gtagttgctt27480ttggcctcgt ccaactggcg agattgcatc ttgccccaca caaagacacg accatccttg27540agcaaggctg cgctgctgtt catgccagca gcaaccttga tggccggtcc aggtagatct27600ctgacctctt gcatgacgaa gaagtcgtca ataccgcgca gaccgattcc gagttgaccg27660cgctgtccct tgccccacgc gaagactttg ccgctcactt tcgtagccac aactccgtgt27720ctgaatccca acgcaacgct ggccacggca tcatcatctt caggaagacc aattgtagtc27780
cttgggtccc agaagtacga gtctgtggtt cccgtcgcgc actgtccata gacattctcg27840ccaaatacaa aaagcgtgtc cgtttccttc gtaatgaaag ctgtcacacc ggcaccacac27900acaacttccc gaataggttc tgtcgagtaa ccctcaaact ttgtctcaag accccttttc27960cgtgagtcct gctcaatatc gtcctcacca aggtccttgt acaccttgta gctcaatacc28020tcctttggct caatcgcatc cacagatgtc tccacaccca tcatccgcat gacatactgc28080atcaccatgc ttgatttcgc atagcgaccc agacgcacag taagccgggt atcgtgggtg28140cgaccaaaga gataaacgcg accttgggcg tcaagaacgg cgctgtggcc aaagcccgct28200gcaagtttta caggctgggc ctgcttggtg tcgaggtcgc cgtggatctg tgtagggctg28260tcagcgttat cgagactacc tgtaccgagg gcaccgttga taccaatgcc tcgagcccat28320acgccgcgaa gggcggttcc ggcagaagag ctaagcatcc gcttggcacc tgttagggag28380cccagcgccg tcatggtggt ggtctgtatg tcaatgtatc tgtagaaagg cagccagcta28440actaaccagc tgtactgtga accacagaag aggcttttgc aaaagatgct cgagagcaaa28500atggatgatc ggtggagatg cggagaagcg cacagcacga tccgagtccg aacttgattg28560aactcaagtt cggagtttgc aatttttcta caactaggta taccttcgta gtatcacgta28620gtaggtggta gtactagtag tcctttgaat tgcggcaggg aatttacgac agcaactctg28680gtaaattaat ttaggacgcc tcttttgtac taaagtcctt ctctttagaa cggaaagaac28740atatgatatt gagacatcat gaggacatgg gaaagggttg tgcatctttg gaactgtatt28800gcccagtatg gctggacttc accttggact tattcataga atgaccacag ctattcctgg28860ggtagatgga ggtctgacaa tgctcgagct aaccctgccc atccatgatc aagacgcacc28920caagcactat ggccgcaagt ttcagttcat ggagagcaga gctgctcaaa tttagcttct28980gcggtcgatt ggtcttggca caaccgctct taagagtcat ctacgacagg ctaccatcca29040ctcaagataa aaatggactc acagatagat agatagatag atagatagat agatagatgg29100caggcgacca atcgcagcgc actctcgctc tcaagatatg cccgcccatc gaaacacggc29160cttctcatgc ggcctgtttc gtctcaagct cgagcaggcg tcggcccatg ctccagcgca29220acgggcccgc aactttcagt ttcgagcttg gtcttgcttt tgagtttgct tttgcttttg29280agtttgagtt tgagtttgag tttgagttca aaattcaaat tcttcaaatt caaattcttc29340gaattcaaac tcaaattgga gaatccatct tttcaaaaac tcaattcacg ctctcgaaga29400agttcaaact ccgcagtcgc atccagctga ggcacgcact ccccatcgca tcgccggcgc29460tctctcctcg ctcctgccgc gtctaagcgt gctcgcgtct ctgtcctgct gctgcttgct29520tgccagtatc tccacttctc gcgagcagaa ggaggacgag cagaagaaga aggaaggatc29580aagaatcatc aagaaggaac actctctttg tttctgtggt tcgtcattag tttgttgtag29640
cttgaaggag aaggagaaga cggagaagat ggagaagaag ggaatgaaca gcagtggcgt29700ttatctgtct ctagctagct aggtacctta cctaccaggt agagttagga ggagaggata29760gccgagacta aggaagcaag ccgtagtttt attttactat gtctgttgtt ctttctctcg29820actaccttct ctcgctaccc ccgtgggaag gaggtctctt gtgtcgagtc tgatccacgt29880ggacgcctcg aggatcttcc ctcgcacccc gggcccggtc gctgccggtg caaaacctcc29940tcagtggcct tgctcgcgct gtgtgctttc gttcctgcgt ctggaacgtc agatagcaga30000taaagagata taagatagtt agttgacgga agcagtcaaa gcaaacctcg aacggattga30060agcgaagcga ggacgctctc gcctctttgc tgactgctcc gcctattgct gctctggccc30120tcactctgag atattactat gtctgaacct gccgcagccg caccgccggc cgagcccaaa30180tcgtcgtggg cggatgaagt cgataatgac acggagggag acgctgtggc cgctctgagc30240gaacatgcgg ctaagttgga cctcgacgtc cacggagctc cagacctgca cagcggtgct30300cttgtagtac gcgaggccgg gtgccccgtg gacgagccca agacgcaggc agtgacaagt30360ttctcagccc ttgcgattga tgacgacctc aagaagtcta tcgcgaacgt caagggctgg30420agcactatgt ctaagatcca gcaaattgga cttccgcttg tgatcagcga ccctccacga30480aaccttatcg ggcaggctca agccggcacg ggtaagaccg gtacctttgt catctctatg30540cttgcaagga tctctgcaga taagaagccc agcacgcctc aggccattat cttggctgta30600actcaggagc tgtgcacgca gattgcacag gaggtcaacg cactgggatc cgacaagggc30660attaaagcac gcagagttat gtctgctagg tccaaaaatg gacccctcgc ggaagggagc30720gcggcggcgc cgtgggcact tagtgaaggt gaagactttg atgagcaggt cgttgtggga30780acacctggaa tggtcaagaa ctacctcaaa aatgccatgg gacgcaagaa gcgcaagccc30840atgatcgatc cgtctgagtg ccgcgttctt attcttgatg aagctgacaa gatggtgcag30900cagccacctc acggatttgg acaggacgtt caggagattc gcgacattat tctcaagaag30960cgcaaggaca agccgtgcca aattttgctc ttttcggcca ccttcaccga aaatgtacga31020cagattgccc gccagttcgt tggtggacat gacatggacg agtccaagta ccacgagatc31080acgctgcgca aggaggatgt cactctcgac aaagtcgtca acttcgttgt ctatattgga31140gacgagaatg agcgcaacga agaggaaatc tataagaaga agtttgaggc cattaatgag31200atctgggaga acctctctca gctcagcgag gggcagtccg ttatcttttg caatcgtaaa31260gatcgtgtac aacgcctcgc ggattatctt cgcgggctaa acttcccggt cggtcagatc31320catggtgaca tggataaggc cgagcgtgac attgtgctca gtgagttcaa gcgcggtgag31380cgcaaggctc tcgtttctac tgatgtcacc tcgcgcggta ttgacaaccc caatgtgact31440ttagttatta atgtcgacct tcctgttaac cgcgagcagg aagctgaccc ggagaatttt31500
gtgcacagga taggccgctc gggacgttgg actaagaagg gtgcttctgt ttctcttgtg31560gctcgcagcc ctgccttccg tgaccttggc ctcatgaagg acattgagcg tgcactcttc31620gctaatgcag aggtaaaccg tccgcttatc cccgtcgatg atctctccaa ccttgagagc31680aagatcattt ctgctcttga agcatacaac taagtgccta cctaccttaa tcagccctta31740tcacttgcat tgcgagcccg ggtttccgca gcgcttgccc tgtgttgcta gagactgggc31800aagctggctc gcctgtctct ttctcgcatt caacaatgca ttcaccgttt ctcctagctg31860cacccgccct ctctcttgcg cccacgacaa gaaaaataca gttcatatca gcatcccccc31920caaaacaacc ataacaatta cgtaaatgaa ggccgtttat tctaccgtgc atcatgagca31980ctgcaccttt tctctcctcc atcgcgcctt ataccgataa acaaaaaata gataacacct32040ttttgtagag caaccaccac cattgtttcc cttccctccc tccncnctcc ctcccaaaat32100aacttgcttt gtttgtacgg cgttccttct atctactttt tctttaatct tcaatcatgt32160ctgacggttc ctttacttat tatgcgttgt tttattcggt cacaaggagg tacagccttg32220atggtcctgc gatagatgcc gtactttatt gtcatatgtt tataactttt aaaaaattaa32280ttttttagta cttatattca aaattcaaaa ttcaaaatat aaaattcaaa attcaaaaat32340tcaaaaattc gaaattcgaa attcgaaatt caatttagat tgtaatctga ttatctttga32400atccgtcacc ttctttttat tattttttaa aataatttat ttttaatgtt tttagttaag32460ctaattttgt aaaaacaatt atattgttat aataacctta tcacctgaat aataagatag32520aaaacgaaga tgcatcctta cctcagcata agaccaaaca gactaaaacg aaacatcttg32580gattgcattt tgtctcgact atatcccatc tcaagagagc aataaaagtt attactgagc32640cttttcaagt cagaaatgtg tagtcgtgtt caaatttgaa ctttagtttt cgctaaataa32700catataagat ctgaattttg caacgactgt gacacaacac tttggttctc aagagaacac32760aagttcttgg ttggccagtg cttgttattc cgtatagtat tttgggataa tggacaagga32820tccaaaccaa gcacaattga gaagcataat tgcaacacca aacctgaaaa gtaactattt32880tgaagacatt accttgtggt gcagtttgat cgatacgaga gcaacgaacg gagcattgag32940gttaagcgag gggagtcaaa gaaagttatg ggacaggcac tcaactccac gatgaatgcc33000atgcatgtat ccaaggctgg ctgctcctct gggtggatgg gtgtcggggc acatgattat33060gtagaggaca aagatgtccc ttctcttgag ccttctgagc atagccaggc accttttcgt33120tgttcttgcg tacaatctcg ggttgtaggc cccaaaagtc acgttgaaaa ggtaatgggc33180tcacgatgtt gtcaaagccc tcgatgtagc gcgggcaaag gcacgcttgc agaactcgac33240gaggtcatgg acaacaaagt ccgaatttct ctagacgttg gcgaagacgt cgatgtcggc33300catgaagtcg gcaaagaaga taagacgagg ggcaaggcga ctcttcatga tggaggtgtt33360
agagacaaac tcagaatcgc tgatggtgtt attggctcta attatgttga tctcaaggcc33420aaagaaagtc atcaagcacc aggctcggtt atccaatcgc agcggcactc tcgagccgag33480aagtaccgcc gagcagacgc tgttcgcgaa gctcttcgcg atttgtcttc tttctcgcca33540agtacttcaa tgaatacttt tcctgactct tcgagccgaa caacacctgc atgctcccct33600gaatcagaaa ctagccttga tgaggagaag gagaatatag ggctggtaaa taacgttcta33660cttgaggaag aacacgttag tcgcccacga tcaatgacgt ttgatgcttc actttcgatg33720acggagctgg aaacccaaaa cgaagtggag cacgctgtgt tgacttcgtc tgtcatgtat33780gcagccgaga aaactctaag ttttattaag gagaattccg gagaattggg caaacatatc33840ggaaccgaag gcggaagtaa tatcaaagac attgttgaag aacatgcaaa tcaaaaatcg33900caagaaagtg ataatgaaat gtttatgagg ttgcttgaag atctgcctac tcaggcccaa33960caagtagttt ccgaaagttt gggaacacct actaccaaac atcattactt ttccagcgcc34020aacacgagca gtggagcatc gcgaagcttg cagtcaggtc gatcaagcac cccaaactgt34080gtcacggtat ctccatgcac agagctgggc tctcctcgtt gcgggcttga ctctgtactt34140ggtaaccaaa ttgatgaaaa acatggtgaa gggcttgacg atcaccatag gatcccgcag34200tttgatctct tacaacatga gcttttacaa gatagcaact ctattacagc acacagagat34260ggtgaaacga cttcgtcccc agttgcctgg gctggagatc ttcaagatga tcttacgcgc34320tctctgttga cagaagttga acatcctttc atctgtcgag aaacaaatat accaccggtc34380cattcaaaag ggaacgaggg tttgagaaca tgcaatggtt cgtcgcatag atctagtctg34440ggagcaattt tgcacgagat tctcgaaacc aagggagact ttcgtaaaaa cggtgaactg34500atcaccgacc tcgacatctt cctaggcgat aaattgccaa aaggcaaaac attttggtcg34560ctcttgacaa gtagcgagct aggtgagctt ggtgaaagag ttgaactcga aataatgagc34620cgccccctcg cgcaccagcc ttaccgagaa tcactctggt gtgttgcatt tcagacaatc34680cagctcactc cctatcgcca aagattggcg ctcagctgtc gcgatagact tttgcctcac34740gagcgggctt taagcgggtt ctccattgct caactaggtc gtgcgtgttt tgtacttcgg34800caaaggctcg tagactgctt ccaccacaac ggcaggataa agttcaaatg ttacaggcga34860acatgcaagt tgctggaagc aaggatgtgg caatgagcct caaaacatag gcttggcaca34920gggtgttgaa gcgcctttct gagacccatg aaactcctag tttgtttgct ttgcatcgct34980ctgtatcaat cgtgccgcat gcaaatgcaa taagctaaca ctcaaatcat ggtacagtct35040tttaatttgg accgagtcta gggcacccga ggcatttcga tgcaaacatc tttctcatca35100aagacttatt taggcgagtt aggcattgga gctcaccttc cctggcaggt cgcctttacg35160tggtaagtta tataagtcaa gaggaaaacc cgagcgacgc tggtctctat aagattgaca35220
gatccctgga ggtgataaag gttgtatcgt acaacttgtt ctacgagaat caaatcttgt35280acgctccaag ccagcagctt gaaattggca gatgagttgt atctgcgtca ggagttatca35340gagagcttac tggactatca aatggtagac atgttgacac tgcgcacctg aaaagctctg35400ccaagcacct ccgctcccca gaaagcctgg tttacatgaa gtgtgatgta gtctgcagtt35460caagatctaa tctcatcaga gagcgcttag tacccattgg tgatctgtca cattttgagg35520ctacgcacag tttggatgac gctcttcgcg ctgtatgcaa cacatccgac gaacgagatg35580aacctacttc caaagactcg tgtgctggtt ggcgcgcggc ctagacctgg tcggggcact35640ggcgcatgct atgagattgc tggacgcgaa aaatgtggcg aagctgtgta cgcagtgaac35700tggggtgcca aatcaatgat tctaagagtg tttgccccaa agtatggctt aaaatgtttc35760aaactaccca agggttcccc gacatgaggc cacatgtggg aagtgtattt gccccccatt35820tgagaagttg ggacagagcg cttcgtcagg gatgatcatg aagcatgttc tatgaacttg35880caccacttgt ttagaacgga agtgtggctg gaatgaaacc tatatgtcag catatctgcg35940ggtaatcccc aactacataa tatttgctgg tatgcttgct ttaagcagca atcaagtttc36000tagcaacagg gtaataacca ggtcaccggt caatcgcaca atggcctttt tagttcggaa36060aatttgacaa cctgtggatg tttggggagt ccatggataa atgtggagct gtttggtgta36120acagaacatt gcaaagggtg acgccttaga tccttttctc atgacaggct tcgatcacaa36180agttgtacac tttcaaggtt gtaggtgcgt attgaacttg gcatttctgg aacaaacaga36240cactatatct cgaatctggg tctgcctgcc cctctagctc aggccctgat agtttgacta36300gagcatcgcc gtctcgtgta ttctctccga atctttctgc acattgagtt agacttctcg36360tcgtgtttgg agcatgtgta aatacatcag cgatattttt ttactcctaa aaatggcaaa36420ttcgcattta cctactgcaa ataatgaatc aaaatgagga aacaatgtgc tatatgaacc36480gtgctctttg gaacacaaat aaaaaataaa taaagtcaaa gatcgtgcca aatccgccca36540acttgagaga aaggcttggc tggtgacctg ccctgttgtg gcatcatcct atcttggctg36600ccgccctcca aagagaaatg tgagcctcgg aagagcgggc taggctggta accaatgaga36660gctatgtaaa tagcaaagga agagagaata aatctttggg aataaacctg tcagcaaggc36720tccaaagctt gctttctggg caaggcttac atgttgcttg atatgatttc acagaagcat36780ttggacacgc caaactctgc tactttgact gtgcctaggt ggtaaaccaa gcaactgcta36840tctttgacgc caccatgcag gtttccatca aaatagagat agaggagaag ttaccatatt36900tgaatccacc aattcttcaa gtgtgtggag acgctcgagt aatgagcata cttgaggaag36960atgctcatgg accttccgtg tgtttttctc ccgaggtatt acacgatatt ttcgtatttg37020caatgttgca gagtcttgat atcgtgtgac agtggaaaca aatgctacag ttgattcctt37080
gatccccttc atcgcaaaga gcttgttatt ctctataata agagctagtt accggcaccg37140tagtcgcttt tgctcagcaa gtggcccttt tccagcatga gataagacct cctaattttg37200gctcgttttc tgattacaaa tgaaggtcct tgccaactac accatggtca cagctttctc37260tgccgagctc agggatgcaa ctgtcggctt agacaccaag tcagcgtcgg ttgcaagtgc37320tgcttctgag agctgactgc tgtagtgtgt gggtttgctc cacctatgag tgggtatgag37380taggtctgct ccacctatga ggaccaccaa gtttgctctc catgtgctac agcgcctgcg37440tctcttgtgc ggtgagacat attttttgag cttggtcttt acgaaatgaa ggcctgcgac37500agacaacgat cgcaacaatt ctgcctcgaa ggcgcttatc cctacgtaga cgtaggtctc37560tgttcccact aaagccactc ctgcgtcaat agaacaaaag caaaagctct tatggctgct37620gtacaaatag agtaaaactt cacctttcta ctcgtaacac tacagttata agtagcaagt37680caatcagagc aagacctttg cgagtaaacc tgcattgctc tatcgcagtc ttccagcatc37740ttcgcgaggc ggtctcgcac aacttcagtc agtctgtaat aacaggagct ttagcaccag37800ccaaagcagt tgcgttgcaa ccagcagaag acttggcatc atgctcattc ccgctgtgga37860cgtggccgtg ggcggtgctg tggcgtcctc tgagaagttt gatctcctca aacgcctgag37920ttggtgcggc ccccttcgca tcatcccttc acttgactct gtctccgcac caagtgtggg37980tgcccctgag gagaaggact tctggaaatc tgctgttcgc aagtggggca aagctttgtg38040ttcgtaccct tgccaagttg gtcccatcgc cgctacaagc gttgaggaag tgacgcaatg38100gctcaacgaa ggcgctgtcc aagtcattgt tgagggttct ttcgacgacc tcgaggacat38160tgcttcgcag cttcctcgtg aacgtcttgt tgccagattt tccgagaagg tccttgaaga38220cgacggtctc ctgagcaaac tttctggcag cgttgggggc gtttcaatta tttctgaggc38280caaaaattct gaagaagtcg tcaaggtcgc agagagggca tggcagcttt tgggaaaacg38340ccttgctatc gcattagagg tccccgagat cgaggccgga ggcgaggcgc agaagattaa38400caaccagctt gttggtaagc tccatggact ccactccaca gactttcctg tgaacgttgt38460gtctgagaac gtttccatgc caacagaagg gtctcttgcg acagatactg actcagaagc38520tgccttttgc gtggcaaggt cttttgtagc gtgccttcgc accgaccgta cagatggtct38580ctttgcgacg gtcgtcaccg atgagaatgg cgtggcactt ggcctcgtgt actccagcga38640acagtctgtg gttgcctcgt tggcgtgtgg ccgcggcgtg tactggtcaa gatccaggca38700gagtctgtgg cgcaagggcg acacaagcgg tgcctttcag gagcttgtgt ctatcgcatt38760tgactgtgat gccgacgcga tgaggttcaa ggtgcgccag cgtggaaacc ctcctgcatt38820ttgccatcaa cagacccgca catgctgggg ttatgacggt ggcatccccc acctctttcg38880cactcttgag tcccgcaagc ttaacgcccc agaaggatca tacacaaaac gtctttttga38940
ggacaaggca ttgctgcgta acaagctcat tgaggaggca caagaggtaa ttgaggctat39000tgaggagaat gacccagagc atgttgcccg cgaggtcgca gacctcgcat acttcctctt39060tgccgcgtgc acgtgcggaa atgcgtcgct cgaggacgtt acacggcagc ttgacatgcg39120ttccctcaag gtcaagcgga ggccaggcaa tgcaaaggca gatcgcatcg ctgctggtga39180ggcagttctc caggctcagc agcagaaaaa gtctgcagag gagcccccag cagctcccaa39240ggaccaggcc taaattgcat gcttattatt acacccaaat cctgcttatt gtgacttgtc39300tgcacccttt tcacattgaa gaagcgtgtt ttcttacccg tcacaccacc actaagtctc39360atcctttctt tcttaccttt ttactagtcc gaacgatata aactttatct ttgcaaggct39420cttgttatac tgcaattgtt atttagtttg ttttctattg ataggcaaac cagacgtaat39480cgtctgagag tgtttgaaga ggataaaaca aagaatcatt aacaggtttt gtgtttctgt39540acacttgaat agttttatgc ctatctactt ctagagcctg ggcggagttg gcatttgtat39600aatctcaaca ttcgataaca aattgcttca aatgaagaac aaaaacagga aatgatttga39660attaaaatct aatatttgta gaaaagaaaa agcgagctga catcattcca tcaaattgac39720caattgactc cttagcacag tagatatttc ctaaacgact tcaactcatt cctcattatc39780ctcgctgttc ctgcttccgt gagtaccctt gctgattcgt acttccaaat cgccgccatc39840ctcccggtca tcatcatctt cgtcatcttc gtcttcatca tcagcccctg acgaggagta39900aatgtcaagg taaggtttgg gattctcgag ctttcgcaat tctccaatac ttattggttg39960gccacagacc ggatcc39976<210>3<211>8994<212>DNA<213>Ulkenia sp.
<400>3atggctcaac gtgagaaccg tctcgaggcc aacatggata cccgcatcgc tgtgatcggc60atgtccgcca tcctcccctg cggtaccacc gttcgtgagt cttgggaggc tatccgcgat120ggtatcgact gcctcagtga tctccccgag gaccgcgtcg atgtgaccgc ctacttcgac180ccggtcaaga ccaccaagga taagatctac tgcaaacgtg gtggattcat ccctgagtac240gacttcgacg cccgtgagtt cggcctcaac atgtttcaga tggaggactc cgacgcaaac300caaaccgtca ccctcctcaa ggtcaaggag gccctcgagg acgctggcat cgaagccctc360agcaaggaaa agaagaacat tggatgtgtt ctcggtatcg gtggtggcca gaagtccagc420cacgagttct actcccgctt aaactatgtt gtcgttgaga aggtccttcg caagatgggc480atgcctgagg aggatgttca agctgctgtt gagaagtaca aggccaactt ccctgagtgg540
cgccttgact ccttccccgg tttcctcggc aacgttactg ccggtcgctg taccaacacc600ttcaacctcg atggtatgaa ctgtgtcgtc gatgctgcct gtgctagttc tctcatcgcc660gttaaggttg ccattgatga gcttctccac ggagactgtg acatgatgat cactggtgct720acctgcacgg ataactccat cggtatgtac atggccttct ccaagacccc ggtgttctct780accgacccta gcgtccgcgc atacgatgag aagaccaagg gtatgcttat tggcgaaggc840tctgccatgc ttgtgcttaa acgttacgcc gacgctgttc gtgatggtga cgagattcac900gctgtcattc gcggctgcgc ctcttcctct gacggtaagg cctccggtat ttacaccccg960accatctctg gtcaagagga ggctcttcgc cgtgcctaca tgcgcgctaa cgtcgatccc1020gccaccgtca ctcttgttga gggccacggt accggtaccc ccgttggtga ccgtattgag1080ctcaccgctc tccgtaacct cttcgacagt gcctacggca acgagaagga gaaggtcgct1140gttggcagca ttaagtccaa catcggtcac ctcaaggctg tcgccggtct tgccggtatg1200atcaaggtca tcatggccct caagcataag actcttccgg ccaccatcaa cgttgatgag1260ccccctaagc tttacgacaa cactcccatc accgactcat cgctgtacat taacacgatg1320aaccgtccgt ggttccctgc tccgggtgtg ccccgtcgcg ctggtatctc cagtttcggt1380tttggtggtg ccaactacca cgccgttctt gaggaagccg agcccgagca ccagaaggct1440taccgtctca acaaacgccc ccagccggtg cttctgatgg catcttcaac ccaggctctt1500gcttccctct gtgaagccca gcttaaggaa ttcgagaagg ctatcgagga gaacaagacc1560gtcaagaaca ctgcttacat caagtgcgtc gacttctgtg agaagttcaa gttccctgga1620tctatcccga gctctaacgc tcgcctcggt tttcttgtca aggaggccga tgatgccacc1680gagaccctcc gtgccatcgt tgcccagttc caaaagtcag ctggcaagga ttcttggcac1740cttccccgcc agggtgtgag ctttcgtgct cagggcatca acaccactgg tggtgtcgct1800gccctcttct ctggccaggg tgctcagtac acccacatgt tcagcgaggt cgccatgaac1860tggcctcagt tccgtgagag catctctgac atggatcgtg cccaggctaa ggttgctggc1920gctgacaagg actacgagcg tgtctcccaa gtcctctacc cgcgtaagcc ttataactct1980gagcccgagc aggaccacaa gaagatctcc ctgacctcat actctcagcc ctctaccctc2040gcctgcgctc ttggtgccta cgagatcttc aagcaggctg gtttcaagcc cgacttcgct2100gccggtcact ctctcggtga gtttgcggcc ctctacgctg ctgactgcgt caaccgtgac2160gacctctttg agctcgtgtg ccgtcgtgcc cgcatcatgg gtggcaagga tgcacctgct2220acccccaagg gatgcatggc tgctgtcatt ggacccaatg ccgagaagat ccagattcgc2280actgctgatg tctggctcgg caactgcaac tccccttcgc agactgtcat caccggctct2340gttgagggta tcaagaagga gtccgagctt ctccagagtg agggcttccg tgttgtcccc2400
ctcgcctgcg agagtgcctt ccactcaccg cagatgcaaa acgcctcctc tgccttcaag2460gatgttctct ccaaggttgc cttccgtcag cctagcgccc agaccaagct cttcagcaac2520gtgtctggcg agacctactc caacaatgcc caggacctcc ttaaggagca catgaccagc2580agtgttaagt tcatctctca ggttcgcaac atgcactctg ctggtgctcg catctttgtc2640gagtttggcc ccaagcaggt gctctctaag cttgtttccg agaccctcaa ggacgatcct2700tccattatca ctatctctgt caacccttcc tctggcaagg atgccgatat tcagcttcgc2760gaggctgctg tgcagctcgt tgttgctgga gtcaaccttc agggcttcga caagtgggac2820gcacctgacg ccacccgcct tcagccgatt aagaagaaga agactactct tcgtctctcg2880gctgccactt acgtgtctga caagaccaag aaggctcgcg aggctgccat gaacgacggc2940cgcatgctca gctgtgtcag caaggtcatc gccccccctg acgccaagcc cattgtggac3000accaaggctc aggaggaggt tgctcgtctc cagaagcagc ttcaggatgc ccaggcccag3060atccagaagg ccaaggccga tgctgctgag gctgacaaga agcttgccgc tgctaaggat3120gaggccaagc gtgccgccgc ttctgcacct gtgcagaagc aggttgacac caccattgtt3180gataagcacc gtgctatcct caagtctatg cttgctgagc ttgactgcta ctccactcct3240ggtgctgtgt ccagctcttt ccaggcacct gttgctgcta cccctgctcc ggtcgctgcg3300cctgttgcag ctgctcctgc tccggctgtc aacaatgctc tccttgccaa ggctgagtct3360gttgtcatgg aggttcttgc cgccaagact ggttacgaga ctgacatgat cgagcccgac3420atggagctcg agactgagct cggcattgac tctatcaagc gtgtcgagat tctctctgag3480gtccaggccc agctcaacgt cgaggccaag gatgttgatg ctcttagccg cacccgcacc3540gtcggtgagg ttgtcaacgc catgaaggct gagatcgctg gcagctctgg tgctgccgct3600gctgccccgg ccccggttgc tgctgctccc gctgcccctg cccctgctgt caacagcgct3660cttcttgcca aggctgagac tgttgtcatg gaggttcttg ccgccaagac tggttacgag3720actgacatga ttgagcccga catggagctc gagactgagc tcggcattga ctccatcaag3780cgtgtcgaga ttctctctga ggttcaggcc cagctcaacg ttgaggccaa ggatgttgat3840gctcttagcc gcacccgcac cgttggtgag gttgtcaacg ccatgaaggc tgagatcgct3900ggcagctctg gtgctgccgc tgctgccccg gcccctgttg ctgctgctcc ggcgcccgtc3960gctgccgctg cccctgctgt cagcagcgct ctccttgaga aggctgagtc tgttgtcatg4020gaggttcttg ccgccaagac tggttacgag actgacatga ttgaggccga catggagctc4080gagactgagc tcggcattga ctccatcaag cgtgtcgaga ttctctctga ggtccaggcc4140cagctcaacg tcgaggccaa ggatgtcgat gctcttagcc gcacccgcac cgttggtgag4200gttgtcaacg ccatgaaggc tgagatcgct ggcagctctg gtgctgctgc cccggccccg4260
gtcgctgcgg cccctgctcc ggtcgctgcc gctgcccctg ctgtcaacag cgctcttctt4320gagaaggctg agactgttgt catggaggtt cttgccgcca agactggtta cgagactgac4380atgatcgagc ccgacatgga gctcgagact gagctcggca ttgactctat caagcgtgtc4440gagattctct ctgaggtcca ggcccagctc aacgttgagg ccaaggatgt tgatgctctt4500agccgcaccc gcaccgttgg tgaggttgtc aacgccatga aggctgagat cgctggcagc4560tctggtgctg ccgctgctgc cccggccccg gttgctgctg ctcccgctcc cgtcgctgcc4620cctgctgtca gcagcgctct ccttgagaag gctgagtctg tcgtcatgga ggttcttgcc4680gccaagactg gttacgagac tgacatgatt gaggccgaca tggagctcga gactgagctc4740ggcattgact ccatcaagcg tgtcgagatt ctctctgagg tccaggccca gctcaacgtt4800gaggccaagg atgtcgatgc tcttagccgc acccgcaccg ttggtgaggt tgtcaacgcc4860atgaaggctg agatcgctgg cagctctggt gctgccgctg ctgccccggc ccctgttgct4920gcctctcccg ctcccgtcgc tgccgctgcc cctgctgtca gcagcgctct ccttgagaag4980gccgaatctg ttgtcatgga ggttctcgcc gccaagactg gttacgagac tgacatgatt5040gaggctgaca tggagctcga gactgagctc ggcattgact ctatcaagcg tgtcgagatt5100ctctctgagg tccaggctat gcttaacgtt gaggccaagg atgttgatgc tcttagccgc5160acccgcaccg ttggtgaggt tgtcaacgcc atgaaggctg agatcgctgg cagctctggt5220gccgccgctg ctgccccggc cccggttgct gctgctccgg cgcccgtcac tgccgctgcc5280cctgctgtca gcagcgctct ccttgagaag gccgaatctg ttgtcatgga ggttctcgcc5340gccaagactg gttacgagac tgacatgatt gaggccgaca tggagctcga gactgagctt5400ggcattgact ccatcaagcg tgtcgagatt ctctctgagg tccaggctat gcttaacgtc5460gaggccaagg atgttgatgc tcttagccgc acccgcaccg ttggtgaggt tgtcaacgcc5520atgaaggctg agattgctag cagctctggt gctgctgccc ctgctccggc tgctgccgtt5580gcaccggccc ctgctgctgc ccctgctgtc agcagcgctc tccttgagaa ggccgaatct5640gttgtcatgg aggttctcgc cgccaagact ggttacgaga ctgacatgat tgaggccgac5700atggagctcg agactgagct cggcattgac tctatcaagc gtgtcgagat tctctctgag5760gtccaggcta tgcttaacgt tgaggccaag gatgttgatg ctcttagccg cacccgcacc5820gttggtgagg ttgtcaacgc catgaaggct gagattgcta gcagctctgg tgctgctgcc5880cctgctcctg ctgctgccgc tgcaccggcc cctgctgctg cccctgctgt cagcagcgct5940cttcttgaga aggctgagtc tgttgtcatg gaggttctcg ccgccaagac tggttacgag6000actgacatga ttgaggccga catggagctc gagactgagc ttggcattga ctccatcaag6060cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat6120
gctcttagcc gcacccgcac cgttggtgag gttgtcaacg ccatgaaggc tgagattgct6180agcagctctg gtgctgctgc ccctgctcct gctgctgccg ctgcaccggc ccctgctgct6240gcccctgctg tcagcagcgc tcttcttgag aaggctgagt ctgttgtcat ggaggttctc6300gccgccaaga ctggttacga gactgacatg attgaggccg acatggagct cgagactgag6360cttggcattg actccatcaa gcgtgtcgag attctctctg aggtccaggc tatgcttaac6420gttgaggcca aggatgttga tgctcttagc cgcacccgca ccgttggtga ggttgtcaac6480gccatgaagg ctgagatcgc tggcagctct ggtgctgcta ctgcctctgc ccctgctgct6540gcagctgccg cccctgctat caagatctcc actgttcacg gtgctgactg cgatgacctc6600tctgtgatgt ctgctgagct tgtcgacatt cgtcgcgctg atgagctcct tcttgagcgc6660cctgagaacc gcccggtcct tattgtcgat gatggtaccg agctcacctc tgctctggtt6720cgtgttcttg gtgctggtgc tgtagttctt acctttgacg gtcttcagtt ggctcagcgt6780gctggtgctg ctgttcgcca tgtccaggtg aaggacctct ccgctgagag tgccgagaag6840gctatcaagg aggctgagca acgcttcggc cagcttggag gcttcatctc tcagcaggct6900gagcgctttg cccctgctga cattcttggt ttcaccctca tgtgcgctaa gtttgccaag6960gcttccctct gcacccctgt gcagggtggc cgtgccttct tcattggtgt ggcccgtctt7020gacggtcgcc ttggtttcac ctcccaggga tctactgact ccctcacacg tgcccagcgt7080ggtgctatct tcggcctctg caagaccatt ggccttgagt ggtctgctaa cgaagtgttc7140gcccgcggta ttgatattgc tcgtgaggtc caccctgaag atgctgccgt cgccatcact7200cgcgaaatgt cctgcgctga caaccgtatc cgcgaggtcg gcattggcct caaccagaag7260cgctgcacca tccgtgctgt ggacctcaag ccgggtgccc ccaagatcca gatcagccag7320gatgacgttc tccttgtgtc tggtggtgct cgtggtatta ctcctctctg catccgtgag7380atcacccgtc aggtccgcgg tggtaagtac attctcctcg gtcgctccaa ggtccctgct7440ggtgagcctg cttggtgcaa cggtgtttct gatgacgatc ttggcaaggc tgctatgcag7500gagctgaagc gtgctttctc cgccggtgag ggccccaagc ccaccccgat gacccacaag7560aagctcgttg gcactattgc tggtgcccgt gaggttcgtt cctcaattgc taacattgag7620gctctcggtg gcaaggcaat ctactcctct tgtgatgtga actctgctgc tgatgtcgcc7680aaggctgttc gcgaggctga ggctcagctt ggcgcccgtg taactggtgt cgtccacgct7740tctggtgtcc ttcgtgaccg cctcattgag cagaagcgcc ccgatgagtt tgatgctgtc7800ttcggcacca aggtgactgg tctcgagaac ctctttggtg ccattgacat ggccaacctt7860aagcacctcg tcctcttcag ctctcttgct ggtttccacg gcaacattgg tcagtctgac7920tacgccatgg ctaacgaggc cctcaacaag atgggtcttg agctctctga ccgtgtgtcc7980
gtgaagtcta tttgcttcgg cccctgggat ggtggcatgg ttacccccca gctcaagaag8040cagttccagt ctatgggtgt tcagatcatc ccccgtgagg gtggtgccga tactgtggct8100cgcattgtcc tcggctcctc ccctgctgag atccttgttg gcaactggac cactcccacc8160aagaaggttg gcagtgagcc cgttgtgatc caccgcaaga tcagcgctgc atccaaccct8220tttcttaagg accacgtcat ccagggtcgc tgtgtgctcc ccatgaccat tgctgtgggc8280tgccttgctg agacctgcct gggtcagttc cctggatact ccctctgggc tattgaggat8340gctcaactct tcaagggtgt caccgttgac ggtgatgtca actgtgagat cactctcaag8400ccttcccagg gtactgccgg ccgcgttatg attcaggcca ccctgaagac cttcgctagc8460ggcaagcttg ttccggctta ccgtgccgtg atcgttctct ccactcaggg aaagccccct8520gctgctacta cttcccagac cccctctctc caggctgatc ctgctgcccg tggcaaccct8580tacgacggca agaccctctt ccacggccct gccttccagg gtcttaagga gatcatctct8640tgcaacaagt ctcagcttgt cgccgagtgc accttcattc cgtcttccga gagcgctggt8700gagttcgctt ctgactacga gtcccacaac cctttcgtca acgacattgc tttccaggcc8760atgctcgtct ggattcgccg caccctcggc caggctgccc tccccaactc tatccagcgc8820attgtgcagc accgtgctct tccccaggac aagcccttct acttgaccct caagagcaac8880agcgcgagtg gccactctca gcacaagacc tccgttcagt ttcacaacga gcagggtgac8940ctcttcgtgg acatccaggc ttccgtcacc tcttctgact cccttgcctt ctaa 8994<210>4<211>6093<212>DNA<213>Ulkenia sp.
<400>4atggcctctc gcaagaatgt gagcgctgct cacgaaatgc acgacgagaa gcgcattgcc60gtggtgggca tggccgtgca atacgcgggc tgcaaagaca aggaagagtt ctggaaagta120gtcatgggcg gtgaggctgc atggactaag attagcgata aacgcctcgg atccaacaag180cgagccgagc acttcaaagc agagcgtagc aaatttgcag ataccttttg caacgagaac240tacggctgcg tcgatgactc cgtcgataac gaacacgagc ttctccttaa gctctccaag300aaggctctct ccgagacatc ggtctccgac tctacaaggt gcggtattgt gagcggatgc360ctgtcctttc ccatggacaa cctccagggc gaactcctca atgtgtacca aaaccacgtc420gaaaagaaac tcggcgctcg cgtcttcaag gatgcctcca agtggtccga gcgtgagcag480tcgcagaacc ccgaggctgg tgaccgccgc atctttatgg acccggcatc cttcgtagca540gaagagctca acctcggtcc tcttcactac tctgtcgatg ctgcctgtgc caccgccctt600
tacgtccttc gcctcgccca ggaccacctc gtttccggtg ctgctgatgt catgctcgct660ggtgcaactt gcttcccgga gccctttttc attctctccg gattctccac tttccaggcc720atgcctgtat cgggagacgg catctcgtac ccgcttcaca aggacagtca gggtctcacc780cctggtgaag gtggtgccat tatggttctc aagcgccttg acgacgctat tcgcgatgga840gaccacattt acggtactct gctcggtgct accatcagca atgctggctg tggtcttccc900ctcaagccgc acttgcccag cgagaagtcc tgcctcattg atacctacaa gcgcgtcaac960gtgcacccgc acaagatcca gtacgtcgag tgccacgcaa cgggtactcc ccagggagac1020cgcgttgaga ttgatgccgt caaggcttgc ttcgagggca aggtgcctcg ctttggaagc1080tccaagggta actttggcca cacactcgtt gcagctggtt tcgcaggcat gtgcaaggta1140ctccttgcca tgaagcatgg tgtgatcccg cccactcctg gtgtcgatgg atcttcccaa1200atggacccgc ttgtggtctc tgagcccatc ccatggcccg acactgaggg cgagcccaag1260cgcgctggtc tctccgcttt cggctttggt ggcaccaacg cccacgcagt ctttgaggag1320tttgaccgct ccaaggctgc ctgtgccacc cacgatagca tcagttccct cagctcacgt1380tgtggcgggg agggcaacat gcgcattgct attaccggta tggatgccac cttcggctcc1440ctcaagggcc tggacgcctt tgagcgtgcc atctacaatg gccaacatgg tgctgtgcca1500ttgcctgaga agcgctggcg tttccttggt aaagacaagg actttttgga cctgtgcggt1560gtcaaggagg tgccccacgg atgctacatt gaggacgtcg aggtggactt tagccgcctg1620cgcacgccca tgacgccaga cgacatgttg cgccccatgc agctacttgc tgtcacaacc1680atcgaccgtg ccattctcaa ctctggcctc aagaagggag gtaaggtcgc tgtcttcgtc1740ggccttggca ctgaccttga gctctaccgt caccgcgccc gcgttgccct caaggagcgt1800gctcgtcccg aagccgcttc agccctcaat gatatgatgt cctacatcaa cgattgcggt1860accgctacct cgtacacatc ctacatcggc aacctcgtgg ccacccgcgt gtcttcacaa1920tggggtttcg agggtccttc tttcaccatc acagagggca acaactccgt ctaccgttgc1980gcagagttgg gcaagtactt gctcgagact ggcgaggtcg aggccgtagt gatcgccggt2040gtggatcttt gcgccagcgc tgagaatctc tacgtgaagt cgcgtcgttt caaggtctcg2100gagcaggaga gcccgcgggc cagcttcgac tccggcgctg acggctactt tgttggtgag2160ggatgtggtg ccctcgtcct caagcgcgag agcgactgca ccaaggacga acgcatttac2220gcctgcatgg acgctatcgt gcccggcaac atgccggcag cctgcatgga ggaggctctc2280gcccaggctc gcgtcaaccc caaggacgtt gagatgctcg agctctccgc tgactctgcc2340cgccacctca agaacccctc cgttctgcct aaggaactca ctgctgagga ggaaatccgc2400ggcattgagg ccattctcag ccagcgctct agcaacgaag ctgtggagcc ccacaacgtc2460
gctgtcagca gcgtcaagtc cactgtcggt gacaccggct acgcctcagg agctgccagt2520ctcatcaaga cggctctctg tctgtacaac cgctacttgc cctcaaacgg cgcctcctgg2580gaggagcctg cacctgagac acagtggggc aagtctctgt acgcgtgcca gtcctcgcgg2640gcctggttga agaaccctgg agctcgccgc cacgcagctg tctcaggtgt ttccgagacc2700cgttcatgct acacggtgct gctctctgat gtggagggcc accacgagac caagagccgc2760atttcgctcg atgacgatgc cgtcaaactc ctcgtaatcc gcggagactc ccatgacgct2820atcacgcagc gtgttgacaa gctccgcgag cgcctcgccc agcctagcgc taatgtacgt2880cttgctttta tggagttgct cggcgagagc attgcccagg agaccaagac cccgttgccg2940gccttcgctc tgtgcctggt gacctctcct agtaagctcc agaaggagct tgaactcgcc3000tccaagggca tcccgcggag tcttaagatg ggccgcgact ggacatcacc ctcgggcagc3060cactttgcac ccaagccact gtcaagcgat cgcgttgcgt ttatgtacgg cgaaggccga3120agcccttact atggtatcgg ccttgacatt caccgcatct ggcccgaact tcacgagttt3180gtaaacgcca agaccaacaa gctttgggat caaggcgaca gatggttgat cccgcgcgcc3240tcgacgaagg aggagcttaa ggcgcaggaa gatgagttca accgcaacca ggtggagatg3300ttccgactcg gtattctcat gtccatgtgc ttcacccaca tcgctcgcga cgtgcttggc3360atccagccca aggctgcttt cggactgagc cttggagaga tttccatggt ttttgccttt3420tctgagaaga acggccttgt ctctgaggag ctgacaacta aactccgcaa ctcggaggtc3480tggcgtaagg ccctcgctgt tgagtttgac gccctccgca aggcctggaa tattccccaa3540gatacccctg tcagcgagtt ctggcaagga tacgtggtac gtggaacccg cgaggccgtt3600gaagcggcca tcggccccaa caataagtac gtgcacttga ccattgtcaa cgatgccaac3660agtgctctca tcagtggcaa gcctgaagat tgcaaggctg ccattgctcg cctgagcagc3720aacctccctg ctttgcccgt ggaccttggt atgtgtggcc actgccccgt ggtcgagccg3780tacggcaagc agatcgctga gatccatagc gtcctcgaga ttcccgaggt tgccggcctt3840gacctgtaca cgagcgtcaa ccagaagaag cttgttaaca agtccactgg agccagcgac3900gagtacgcac ccagctttgg tgaatacgca gcacagctgt acactgttca ggcagacttt3960cctaagatcg ccaagaccgt tagcgacaag aactttgacg tctttgttga gactggtccc4020aacgcccacc gtagcgccgc aattcgcgcc acccttggaa atagcaagcc ttttgtcacc4080ggatccatgg accgccagaa cgagaatgct tggacaacca tggtcaagct ggttgcctct4140ctccaagccc accgcgtgcc tggcgtgaag gtctcccctc tgtaccaccc cgagactgtt4200gaggaggcta cgcagagtta caacgatatg gtggctggca agaagcctac taagaacaag4260ttcttgcgta agattgtggt caatggtcgc tatgacccca aaaagcagct cgtgccgccc4320
caggtgctag ctaagcttcc tcctgcggac cccaagatcg aggctcttat ccaggctcgc4380aagatgcagc ctattgcccc caagttcatg gagcgtctcg acattcagga gcaagacgcc4440acacgcgacc ctattctcaa caaggataac aaaccttccg ctgctcctgc ccttgcccct4500gctgctccgg cccgcagcgt ctccggagct gttgtggctt cctctgaggc tctccgtgcc4560aaacttttgg agctcaacag cactttgatg cttggtgtca acgccaacgg tgatctcgtt4620gaagcaagcc caagtgaagc atctattgtt gtgcccaagt gcgatatcaa ggatcttggc4680agccgtgcct tcatggagac atatggtgta tccgccccca tgtacaccgg cgccatggca4740aagggcattg catccgctga gatggttatc gctgccggaa agcgcggcat ccttggttct4800ctcggtgctg gtggtcttcc tatcgccacc gtacgcaagg ctctcgaagc tatccaggct4860gaactgccca agggccctta cgctgtcaac ctcatccact ctcccttcga cagcaacctc4920gagaagggta acgtcgacct cttcctcgag aagggcgtca ctgtcgttga agcctccgcc4980tttatgacct tgaccccgca gctcgtgcgc taccgtgctg caggtctctc tcgcgctgct5040gatggctcca cggttattaa gaaccgcgtc atcggtaagg tttctcgcac agagcttgcc5100gcaatgttta tccgtcccgc gcccgagaat ctcctcgaga agctgctgaa gtccggcgag5160atcacccaag agcaggctgc tctcgcacgc acagtgcctg tggcagacga cattgccgtt5220gaggcggact ccggtggcca caccgataac cgccccatcc acgtcatcct ccctctcatt5280gtcaacctcc gtgatcgtct gcacaaggag tgcggctacc ctgcccacct tcgcgttcgc5340gttggtgctg gtggtggcat tggatgccct caggccgcca ttgccacctt caacatgggc5400gcggccttca tcgtcactgg taccgtaaac cagatgagta agcaagctgg aacctgtgac5460accgttcgca agcagctctc acaagccacc tactccgaca tctgcatggc cccagcagct5520gacatgtttg aggaaggtgt caagctccag gtgctcaaga agggaactat gttcccctcg5580cgtgccaaca agctctatga gctcttcgtc aagtatgact cctttgagtc catggctcct5640ggagagctgg aacgtgtgga gaagcgcatt ttcaagaagt ctctgtcaga agtttgggaa5700gagaccaagg acttctacat caacaggttg cagaacccgg agaagattga gcgcgcggag5760cgtgacccca agcttaagat gtccttgtgc ttccgctggt accttggttt ggcgagcttc5820tgggcaaacg ctggcatccc ggaccgtgcc atggactacc aggtttggtg tggcccagcg5880attggatctt tcaacgactt catcaagggt acctaccttg accccgccgt tgccaacgag5940taccccgatg ttgtgcaaat caacttgcag atcctccgtg gtgcctgctt cttgcgccgc6000ctcgaagctg tccgtaatgc cccgctgaag gctaacgcca agcaggttgc tgccgagatt6060gatgacatct acgtgcccac tgagcgcctg taa 6093
<210>5<211>4398<212>DNA<213>Ulkenia sp.
<400>5atggccactc gcgtgaagac caacaagaaa ccatgctggg agatgaccaa ggaggagctc60accagcggca agaacgtcgt tttcgactat gacgagctcc ttgagttcgc cgagggtgac120atcagcaagg tcttcggccc cgaattcagc cagatcgacc agtacaagcg tcgcgttcgt180ctccccgccc gcgagtacct cctcgtcacc cgcgtcaccc tcatggacgc cgaggtcaac240aactaccgcg tcggtgcccg catggtcact gagtacgacc tccccgtcaa cggtgagctc300tctgagggtg gtgactgccc ctgggccgtg ctcgtcgaga gtggtcagtg tgatctcatg360ctcatctcct acatgggtat tgacttccag aacaagagcg accgcgtcta ccgtctgctc420aacaccaccc tcaccttcta cggtgttgcc caggagggcg agaccctgga gtacgacatc480cgcgtgaccg gcttcgccaa gcgtctcgac ggtgacatct ccatgttctt cttcgagtac540gactgctacg tcaacggccg tctcctcatc gagatgcgcg acggctgtgc cggtttcttc600accaacgagg agctcgccgc cggcaagggt gtcgtcttta cccgcgctga tctcctcgcc660cgcgagaaga ccaagaagca ggacatcacc ccgtacgcca ttgccccgcg tcttaacaag720accgttctca acgagactga gatgcagtcc ctcgtggaca agaactggac caaggttttc780ggccccgaga acggcatgga ccagatcaac tacaaactct gcgcccgtaa gatgctcatg840attgaccgcg tcaccaagat tgactacacc ggtggcccct acggccttgg tcttctcgtt900ggtgagaaga tcctcgagcg cgaccactgg tactttccgt gccacttcgt cggagaccag960gtcatggctg gatccctcgt gtctgacggc tgcagccagc tcctcaagat gtacatgctc1020tggctcggcc tccaccttaa gaccggtccc ttcgacttcc gccccgtcaa cggccacccc1080aacaaggtcc gctgccgtgg ccagatctcc ccgcacaagg gtaagctcgt atacgtcatg1140gagatcaagg agatgggcta cgacgaggct ggtgacccgt acgccatcgc cgatgtcaac1200attctcgaca ttgacttcga gaagggccag actttcgacc ttgccaacct ccacgagtac1260ggcaagggcg acctcaacaa gaagatcgtc gtcgacttca agggtattgc cctcaagctc1320cagaagcgct ctggccctgc cgttgtcgct cccgagaagc ccctcgctct caacaaggac1380ctttgcgccc cggctgttga ggccatccct gagcacatcc tcaagggcga tgctcttgcc1440cctaaccaga tgacctggca cccgatgtcc aagatcgctg gcaaccccac gccctcgttc1500tctccctcgg cctaccctcc ccgtcccatc accttcaccc cgttccccgg caacaagaac1560gacaacaacc acgtgcccgg cgagatgccg ctctcgtggt acaacatggc tgagttcatg1620gccggcaagg tcagcctctg cctcggccct gagttcgcca agttcgatga ctccaacacc1680
agccgcagcc ctgcatggga ccttgctctt gtgactcgtg tggtctccgt ttctgacatg1740gagtgggtcc agtggaagaa cgtggactgc aacccgtcca agggaaccat ggttggcgag1800ttcgactgcc ccatcgacgc ctggttcttc cagggatctt gtaacgacgg ccacatgccg1860tactccatcc tcatggagat cgccctccag acctctggtg tcctcacctc tgtgctcaag1920gccccgctca ccatggagaa gaaggacatt ctcttccgca accttgacgc caacgccgag1980atggttcgct ctgatattga cctccgcggc aagaccatcc acaacctcac caagtgtacc2040ggctacagca tgctcggaga catgggtgtc caccgcttca gcttcgagct ctctgttgat2100ggtgtagtct tctacaaggg taccacctcc ttcggctggt tcgtccctga ggtcttcatc2160tcccagactg gtctcgacaa cggtcgccgc acccagccct ggcacattga gtccaaggtg2220ccttccgccc aggtcctcac ctacgacgtt acccccaacg gtgccggtcg cacccagctc2280tacgccaacg cccccaaggg cgctcagctc actcgccgct ggaaccagtg ccagtacctt2340gacaccatcg accttgtggt cgccggtggc tccgccggtc ttggctacgg tcatggccgc2400aagcaggtga accccaagga ctggttcttc tcgtgccact tctggttcga ctccgtcatg2460cccggctcgc tcggtgtgga gtctatgttc cagctcgtcg agtccatcgc tgtcaagcag2520gacctcgccg gcaagtacgg catcaccaac ccgaccttcg ctcatgctcc gggcaagatc2580tcctggaagt accgtggtca gctcaccccc acctccaagt tcatggactc cgaggcccac2640attgtctcca tcgaggccca cgacggcgtc gtcgacatcg ttgccaatgg taacctctgg2700gctgatggcc tccgcgtcta caacgtcagc aacatccgtg tgcgcattgt tgctggcgcc2760gcccctgctg ctgctgctgc tgctgctgct gttgctgctc cggctgccgc ccctgctccg2820gttgctgcat ctggccctgc ccagaccatc accctcaagc agctcaaggc tgagcttctt2880gacgttgaga agcctctcta catctcctcc agcaacggcc aggtcaagaa gcacgccgat2940gtggctggtg gccaggccac cattgtgcag gcttgcagcc tcagtgacct cggtgatgaa3000ggcttcatga agacctacgg tgttgtggct cctctctaca ccggtgccat ggccaagggt3060attgcctctg ctgaccttgt gattgccact ggtaagcgca agatcctcgg ttccttcggt3120gctggcggtc tccccatgca cattgtccgt gccgctgttg agaagatcca ggctgagctc3180ccgaacggcc ccttcgccgt caacctcatc cactccccct tcgatagcaa ccttgagaag3240ggcaacgttg acctcttcct cgagaagggc gttactgtcg tcgaggcctc cgccttcatg3300accttgaccc cgcaagtcgt ccgctaccgt gctgctggtc tttcccgtaa cgctgatggc3360tccattaaca tcaagaaccg catcatcggt aaggtctccc gtaccgagct cgctgagatg3420ttcatccgcc ctgccccgca gaacctcctc gacaagctca tccagtctgg tgagattacc3480aaggagcagg ctgagcttgc caagctcgtc cccgtcgccg acgacatcgc cgtcgaggcc3540
gactctggtg gccacaccga caaccgcccc atccacgtca tcctccccct tatcatcaac3600ctccgcaacc gcctccacaa ggagtgcggc taccccgctc acctccgcgt gcgcgttgga3660gctggtggtg gtgttggatg cccccaggcc gctgccgctg ctctcgctat gggtgctgcc3720ttccttgtta ccggcactgt caaccaggtc gccaagcagt ccggcacctg cgacaatgtc3780cgcaagcagc tctgcatggc cacctactct gacgtctgca tggctcccgc tgctgacatg3840ttcgaggagg gcgtcaagct ccaggtcctc aagaagggaa ccatgttccc gtccagggct3900aacaagctct acgagctctt ctgcaagtac gactccttcg agtccatgcc tgccacagag3960ctcgagcgtg ttgagaagcg catcttccag tgccctcttg ctgatgtctg ggctgagacc4020tccgacttct acatcaaccg cctccacaac ccggagaaga tcacccgtgc cgagcgtgac4080cccaagctca agatgtctct ctgcttccgc tggtaccttg gtcttgcctc tcgctgggcc4140aacaccggtg aggctggacg cgtcatggac taccaggtct ggtgtggccc tgccattgga4200gccttcaacg acttcatcaa gggctcctac cttgacccgg ccgtctctgg tgagtacccg4260gacgtcgtgc agatcaactt gcagatcctt cgcggtgcct gctacctccg ccgtctcaat4320gtcatccgca acgacccgcg tgtcagcatt gaggtcgagg atgctgagtt cgtctacgag4380cccaccaacg ccctctaa 4398<210>6<211>2997<212>PRT<213>Ulkenia sp.
<400>6Met Ala Gln Arg Glu Asn Arg Leu Glu Ala Asn Met Asp Thr Arg Ile1 5 1015Ala Val Ile Gly Met Ser Ala Ile Leu Pro Cys Gly Thr Thr Val Arg20 25 30Glu Ser Trp Glu Ala Ile Arg Asp Gly Ile Asp Cys Leu Ser Asp Leu35 40 45Pro Glu Asp Arg Val Asp Val Thr Ala Tyr Phe Asp Pro Val Lys Thr5055 60Thr Lys Asp Lys Ile Tyr Cys Lys Arg Gly Gly Phe Ile Pro Glu Tyr65 70 75 80Asp Phe Asp Ala Arg Glu Phe Gly Leu Asn Met Phe Gln Met Glu Asp85 90 95Ser Asp Ala Asn Gln Thr Val Thr Leu Leu Lys Val Lys Glu Ala Leu100 105 110Glu Asp Ala Gly Ile Glu Ala Leu Ser Lys Glu Lys Lys Asn Ile Gly115 120 125
Cys Val Leu Gly Ile Gly Gly Gly Gln Lys Ser Ser His Glu Phe Tyr130 135 140Ser Arg Leu Asn Tyr Val Val Val Glu Lys Val Leu Arg Lys Met Gly145 150 155 160Met Pro Glu Glu Asp Val Gln Ala Ala Val Glu Lys Tyr Lys Ala Asn165 170 175Phe Pro Glu Trp Arg Leu Asp Ser Phe Pro Gly Phe Leu Gly Asn Val180 185 190Thr Ala Gly Arg Cys Thr Asn Thr Phe Asn Leu Asp Gly Met Asn Cys195 200 205Val Val Asp Ala Ala Cys Ala Ser Ser Leu Ile Ala Val Lys Val Ala210 215 220Ile Asp Glu Leu Leu His Gly Asp Cys Asp Met Met Ile Thr Gly Ala225 230 235 240Thr Cys Thr Asp Asn Ser Ile Gly Met Tyr Met Ala Phe Ser Lys Thr245 250 255Pro Val Phe Ser Thr Asp Pro Ser Val Arg Ala Tyr Asp Glu Lys Thr260 265 270Lys Gly Met Leu Ile Gly Glu Gly Ser Ala Met Leu Val Leu Lys Arg275 280 285Tyr Ala Asp Ala Val Arg Asp Gly Asp Glu Ile His Ala Val Ile Arg290 295 300Gly Cys Ala Ser Ser Ser Asp Gly Lys Ala Ser Gly Ile Tyr Thr Pro305 310 315 320Thr Ile Ser Gly Gln Glu Glu Ala Leu Arg Arg Ala Tyr Met Arg Ala325 330 335Asn Val Asp Pro Ala Thr Val Thr Leu Val Glu Gly His Gly Thr Gly340 345 350Thr Pro Val Gly Asp Arg Ile Glu Leu Thr Ala Leu Arg Asn Leu Phe355 360 365Asp Ser Ala Tyr Gly Asn Glu Lys Glu Lys Val Ala Val Gly Ser Ile370 375 380Lys Ser Asn Ile Gly His Leu Lys Ala Val Ala Gly Leu Ala Gly Met385 390 395 400Ile Lys Val Ile Met Ala Leu Lys His Lys Thr Leu Pro Ala Thr Ile405 410 415Asn Val Asp Glu Pro Pro Lys Leu Tyr Asp Asn Thr Pro Ile Thr Asp420 425 430Ser Ser Leu Tyr Ile Asrn Thr Met Asn Arg Pro Trp Phe Pro Ala Pro435440 445Gly Val Pro Arg Arg Ala Gly Ile Ser Ser Phe Gly Phe Gly Gly Ala
450 455 460Asn Tyr His Ala Val Leu Glu Glu Ala Glu Pro Glu His Gln Lys Ala465 470 475 480Tyr Arg Leu Asn Lys Arg Pro Gln Pro Val Leu Leu Met Ala Ser Ser485 490 495Thr Gln Ala Leu Ala Ser Leu Cys Glu Ala Gln Leu Lys Glu Phe Glu500 505 510Lys Ala Ile Glu Glu Asn Lys Thr Val Lys Asn Thr Ala Tyr Ile Lys515 520 525Cys Val Asp Phe Cys Glu Lys Phe Lys Phe Pro Gly Ser Ile Pro Ser530 535 540Ser Asn Ala Arg Leu Gly Phe Leu Val Lys Glu Ala Asp Asp Ala Thr545 550 555 560Glu Thr Leu Arg Ala Ile Val Ala Gln Phe Gln Lys Ser Ala Gly Lys565 570 575Asp Ser Trp His Leu Pro Arg Gln Gly Val Ser Phe Arg Ala Gln Gly580 585 590Ile Asn Thr Thr Gly Gly Val Ala Ala Leu Phe Ser Gly Gln Gly Ala595 600 605Gln Tyr Thr His Met Phe Ser Glu Val Ala Met Asn Trp Pro Gln Phe610 615 620Arg Glu Ser Ile Ser Asp Met Asp Arg Ala Gln Ala Lys Val Ala Gly625 630 635 640Ala Asp Lys Asp Tyr Glu Arg Val Ser Gln Val Leu Tyr Pro Arg Lys645 650 655Pro Tyr Asn Ser Glu Pro Glu Gln Asp His Lys Lys Ile Ser Leu Thr660 665 670Ser Tyr Ser Gln Pro Ser Thr Leu Ala Cys Ala Leu Gly Ala Tyr Glu675 680 685Ile Phe Lys Gln Ala Gly Phe Lys Pro Asp Phe Ala Ala Gly His Ser690 695 700Leu Gly Glu Phe Ala Ala Leu Tyr Ala Ala Asp Cys Val Asn Arg Asp705 710 715 720Asp Leu Phe Glu Leu Val Cys Arg Arg Ala Arg Ile Met Gly Gly Lys725 730 735Asp Ala Pro Ala Thr Pro Lys Gly Cys Met Ala Ala Val Ile Gly Pro740 745 750Asn Ala Glu Lys Ile Gln Ile Arg Thr Ala Asp Val Trp Leu Gly Asn755 760 765Cys Asn Ser Pro Ser Gln Thr Val Ile Thr Gly Ser Val Glu Gly Ile770 775 780
Lys Lys Glu Ser Glu Leu Leu Gln Ser Glu Gly Phe Arg Val Val Pro785 790 795 800Leu Ala Cys Glu Ser Ala Phe His Ser Pro Gln Met Gln Asn Ala Ser805 810 815Ser Ala Phe Lys Asp Val Leu Ser Lys Val Ala Phe Arg Gln Pro Ser820 825 830Ala Gln Thr Lys Leu Phe Ser Asn Val Ser Gly Glu Thr Tyr Ser Asn835 840 845Asn Ala Gln Asp Leu Leu Lys Glu His Met Thr Ser Ser Val Lys Phe850 855 860Ile Ser Gln Val Arg Asn Met His Ser Ala Gly Ala Arg Ile Phe Val865 870 875 880Glu Phe Gly Pro Lys Gln Val Leu Ser Lys Leu Val Ser Glu Thr Leu885 890 895Lys Asp Asp Pro Ser Ile Ile Thr Ile Ser Val Asn Pro Ser Ser Gly900 905 910Lys Asp Ala Asp Ile Gln Leu Arg Glu Ala Ala Val Gln Leu Val Val915 920 925Ala Gly Val Asn Leu Gln Gly Phe Asp Lys Trp Asp Ala Pro Asp Ala930 935 940Thr Arg Leu Gln Pro Ile Lys Lys Lys Lys Thr Thr Leu Arg Leu Ser945 950 955 960Ala Ala Thr Tyr Val Ser Asp Lys Thr Lys Lys Ala Arg Glu Ala Ala965 970 975Met Asn Asp Gly Arg Met Leu Ser Cys Val Ser Lys Val Ile Ala Pro980 985 990Pro Asp Ala Lys Pro Ile Val Asp Thr Lys Ala Gln Glu Glu Val Ala995 1000 1005Arg Leu Gln Lys Gln Leu Gln Asp Ala Gln Ala Gln Ile Gln Lys1010 1015 1020Ala Lys Ala Asp Ala Ala Glu Ala Asp Lys Lys Leu Ala Ala Ala1025 1030 1035Lys Asp Glu Ala Lys Arg Ala Ala Ala Ser Ala Pro Val Gln Lys1040 1045 1050Gln Val Asp Thr Thr Ile Val Asp Lys His Arg Ala Ile Leu Lys1055 1060 1065Ser Met Leu Ala Glu Leu Asp Cys Tyr Ser Thr Pro Gly Ala Val1070 1075 1080Ser Ser Ser Phe Gln Ala Pro Val Ala Ala Thr Pro Ala Pro Val1085 1090 1095Ala Ala Pro Val Ala Ala Ala Pro Ala Pro Ala Val Asn Asn Ala
1100 1105 1110Leu Leu Ala Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala1115 1120 1125Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met Glu Leu1130 1135 1140Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu1145 1150 1155Ser Glu Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp1160 1165 1170Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met1175 1180 1185Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro1190 1195 1200Ala Pro Val Ala Ala Ala Pro Ala Ala Pro Ala Pro Ala Val Asn1205 1210 1215Ser Ala Leu Leu Ala Lys Ala Glu Thr Val Val Met Glu Val Leu1220 1225 1230Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met1235 1240 1245Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu1250 1255 1260Ile Leu Ser Glu Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp1265 1270 1275Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn1280 1285 1290Ala Met Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Ala Ala1295 1300 1305Ala Pro Ala Pro Val Ala Ala Ala Pro Ala Pro Val Ala Ala Ala1310 1315 1320Ala Pro Ala Val Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser Val1325 1330 1335Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met1340 1345 1350Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser1355 1360 1365Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Gln Leu Asn1370 1375 1380Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val1385 1390 1395Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala Gly Ser Ser1400 1405 1410
Gly Ala Ala Ala Pro Ala Pro Val Ala Ala Ala Pro Ala Pro Val1415 1420 1425Ala Ala Ala Ala Pro Ala Val Asn Ser Ala Leu Leu Glu Lys Ala1430 1435 1440Glu Thr Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu1445 1450 1455Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu Thr Glu Leu Gly1460 1465 1470Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala1475 1480 1485Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr1490 1495 1500Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala1505 1510 1515Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro Ala Pro Val Ala Ala1520 1525 1530Ala Pro Ala Pro Val Ala Ala Pro Ala Val Ser Ser Ala Leu Leu1535 1540 1545Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys Thr1550 1555 1560Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr1565 1570 1575Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu1580 1585 1590Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu1595 1600 1605Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala1610 1615 1620Glu Ile Ala Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro Ala Pro1625 1630 1635Val Ala Ala Ser Pro Ala Pro Val Ala Ala Ala Ala Pro Ala Val1640 1645 1650Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val1655 1660 1665Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp1670 1675 1680Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val1685 1690 1695Glu Ile Leu Ser Glu Val Gln Ala Met Leu Asn Val Glu Ala Lys1700 1705 1710Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val
1715 1720 1725Asn Ala Met Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Ala1730 1735 1740Ala Ala Pro Ala Pro Val Ala Ala Ala Pro Ala Pro Val Thr Ala1745 1750 1755Ala Ala Pro Ala Val Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser1760 1765 1770Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp1775 1780 1785Met Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp1790 1795 1800Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Met Leu1805 1810 1815Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr1820 1825 1830Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala Ser Ser1835 1840 1845Ser Gly Ala Ala Ala Pro Ala Pro Ala Ala Ala Val Ala Pro Ala1850 1855 1860Pro Ala Ala Ala Pro Ala Val Ser Ser Ala Leu Leu Glu Lys Ala1865 1870 1875Glu Ser Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu1880 1885 1890Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu Gly1895 1900 1905Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala1910 1915 1920Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr1925 1930 1935Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala1940 1945 1950Ser Ser Ser Gly Ala Ala Ala Pro Ala Pro Ala Ala Ala Ala Ala1955 1960 1965Pro Ala Pro Ala Ala Ala Pro Ala Val Ser Ser Ala Leu Leu Glu1970 1975 1980Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys Thr Gly1985 1990 1995Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr Glu2000 2005 2010Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val2015 2020 2025
Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser2030 2035 2040Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu2045 2050 2055Ile Ala Ser Ser Ser Gly Ala Ala Ala Pro Ala Pro Ala Ala Ala2060 2065 2070Ala Ala Pro Ala Pro Ala Ala Ala Pro Ala Val Ser Ser Ala Leu2075 2080 2085Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys2090 2095 2100Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu2105 2110 2115Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser2120 2125 2130Glu Val Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala2135 2140 2145Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys2150 2155 2160Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Thr Ala Ser Ala Pro2165 2170 2175Ala Ala Ala Ala Ala Ala Pro Ala Ile Lys Ile Ser Thr Val His2180 2185 2190Gly Ala Asp Cys Asp Asp Leu Ser Val Met Ser Ala Glu Leu Val2195 2200 2205Asp Ile Arg Arg Ala Asp Glu Leu Leu Leu Glu Arg Pro Glu Asn2210 2215 2220Arg Pro Val Leu Ile Val Asp Asp Gly Thr Glu Leu Thr Ser Ala2225 2230 2235Leu Val Arg Val Leu Gly Ala Gly Ala Val Val Leu Thr Phe Asp2240 2245 2250Gly Leu Gln Leu Ala Gln Arg Ala Gly Ala Ala Val Arg His Val2255 2260 2265Gln Val Lys Asp Leu Ser Ala Glu Ser Ala Glu Lys Ala Ile Lys2270 2275 2280Glu Ala Glu Gln Arg Phe Gly Gln Leu Gly Gly Phe Ile Ser Gln2285 2290 2295Gln Ala Glu Arg Phe Ala Pro Ala Asp Ile Leu Gly Phe Thr Leu2300 2305 2310Met Cys Ala Lys Phe Ala Lys Ala Ser Leu Cys Thr Pro Val Gln2315 2320 2325Gly Gly Arg Ala Phe Phe Ile Gly Val Ala Arg Leu Asp Gly Arg
2330 2335 2340Leu Gly Phe Thr Ser Gln Gly Ser Thr Asp Ser Leu Thr Arg Ala2345 2350 2355Gln Arg Gly Ala Ile Phe Gly Leu Cys Lys Thr Ile Gly Leu Glu2360 2365 2370Trp Ser Ala Asn Glu Val Phe Ala Arg Gly Ile Asp Ile Ala Arg2375 2380 2385Glu Val His Pro Glu Asp Ala Ala Val Ala Ile Thr Arg Glu Met2390 2395 2400Ser Cys Ala Asp Asn Arg Ile Arg Glu Val Gly Ile Gly Leu Asn2405 2410 2415Gln Lys Arg Cys Thr Ile Arg Ala Val Asp Leu Lys Pro Gly Ala2420 2425 2430Pro Lys Ile Gln Ile Ser Gln Asp Asp Val Leu Leu Val Ser Gly2435 2440 2445Gly Ala Arg Gly Ile Thr Pro Leu Cys Ile Arg Glu Ile Thr Arg2450 2455 2460Gln Val Arg Gly Gly Lys Tyr Ile Leu Leu Gly Arg Ser Lys Val2465 2470 2475Pro Ala Gly Glu Pro Ala Trp Cys Asn Gly Val Ser Asp Asp Asp2480 2485 2490Leu Gly Lys Ala Ala Met Gln Glu Leu Lys Arg Ala Phe Ser Ala2495 2500 2505Gly Glu Gly Pro Lys Pro Thr Pro Met Thr His Lys Lys Leu Val2510 2515 2520Gly Thr Ile Ala Gly Ala Arg Glu Val Arg Ser Ser Ile Ala Asn2525 2530 2535Ile Glu Ala Leu Gly Gly Lys Ala Ile Tyr Ser Ser Cys Asp Val2540 2545 2550Asn Ser Ala Ala Asp Val Ala Lys Ala Val Arg Glu Ala Glu Ala2555 2560 2565Gln Leu Gly Ala Arg Val Thr Gly Val Val His Ala Ser Gly Val2570 2575 2580Leu Arg Asp Arg Leu Ile Glu Gln Lys Arg Pro Asp Glu Phe Asp2585 2590 2595Ala Val Phe Gly Thr Lys Val Thr Gly Leu Glu Asn Leu Phe Gly2600 2605 2610Ala Ile Asp Met Ala Asn Leu Lys His Leu Val Leu Phe Ser Ser2615 2620 2625Leu Ala Gly Phe His Gly Asn Ile Gly Gln Ser Asp Tyr Ala Met2630 2635 2640
Ala Asn Glu Ala Leu Asn Lys Met Gly Leu Glu Leu Ser Asp Arg2645 2650 2655Val Ser Val Lys Ser Ile Cys Phe Gly Pro Trp Asp Gly Gly Met2660 2665 2670Val Thr Pro Gln Leu Lys Lys Gln Phe Gln Ser Met Gly Val Gln2675 2680 2685Ile Ile Pro Arg Glu Gly Gly Ala Asp Thr Val Ala Arg Ile Val2690 2695 2700Leu Gly Ser Ser Pro Ala Glu Ile Leu Val Gly Asn Trp Thr Thr2705 2710 2715Pro Thr Lys Lys Val Gly Ser Glu Pro Val Val Ile His Arg Lys2720 2725 2730Ile Ser Ala Ala Ser Asn Pro Phe Leu Lys Asp His Val Ile Gln2735 2740 2745Gly Arg Cys Val Leu Pro Met Thr Ile Ala Val Gly Cys Leu Ala2750 2755 2760Glu Thr Cys Leu Gly Gln Phe Pro Gly Tyr Ser Leu Trp Ala Ile2765 2770 2775Glu Asp Ala Gln Leu Phe Lys Gly Val Thr Val Asp Gly Asp Val2780 2785 2790Asn Cys Glu Ile Thr Leu Lys Pro Ser Gln Gly Thr Ala Gly Arg2795 2800 2805Val Met Ile Gln Ala Thr Leu Lys Thr Phe Ala Ser Gly Lys Leu2810 2815 2820Val Pro Ala Tyr Arg Ala Val Ile Val Leu Ser Thr Gln Gly Lys2825 2830 2835Pro Pro Ala Ala Thr Thr Ser Gln Thr Pro Ser Leu Gln Ala Asp2840 2845 2850Pro Ala Ala Arg Gly Asn Pro Tyr Asp Gly Lys Thr Leu Phe His2855 2860 2865Gly Pro Ala Phe Gln Gly Leu Lys Glu Ile Ile Ser Cys Asn Lys2870 2875 2880Ser Gln Leu Val Ala Glu Cys Thr Phe Ile Pro Ser Ser Glu Ser2885 2890 2895Ala Gly Glu Phe Ala Ser Asp Tyr Glu Ser His Asn Pro Phe Val2900 2905 2910Asn Asp Ile Ala Phe Gln Ala Met Leu Val Trp Ile Arg Arg Thr2915 2920 2925Leu Gly Gln Ala Ala Leu Pro Asn Ser Ile Gln Arg Ile Val Gln2930 2935 2940His Arg Ala Leu Pro Gln Asp Lys Pro Phe Tyr Leu Thr Leu Lys
2945 2950 2955Ser Asn Ser Ala Ser Gly His Ser Gln His Lys Thr Ser Val Gln2960 2965 2970Phe His Asn Glu Gln Gly Asp Leu Phe Val Asp Ile Gln Ala Ser2975 2980 2985Val Thr Ser Ser Asp Ser Leu Ala Phe2990 2995<210>7<211>2030<212>PRT<213>Ulkenia sp.
<400>7Met Ala Ser Arg Lys Asn Val Ser Ala Ala His Glu Met His Asp Glu1 510 15Lys Arg Ile Ala Val Val Gly Met Ala Val Gln Tyr Ala Gly Cys Lys20 25 30Asp Lys Glu Glu Phe Trp Lys Val Val Met Gly Gly Glu Ala Ala Trp35 40 45Thr Lys Ile Ser Asp Lys Arg Leu Gly Ser Asn Lys Arg Ala Glu His50 55 60Phe Lys Ala Glu Arg Ser Lys Phe Ala Asp Thr Phe Cys Asn Glu Asn65 70 75 80Tyr Gly Cys Val Asp Asp Ser Val Asp Asn Glu His Glu Leu Leu Leu85 90 95Lys Leu Ser Lys Lys Ala Leu Ser Glu Thr Ser Val Ser Asp Ser Thr100 105 110Arg Cys Gly Ile Val Ser Gly Cys Leu Ser Phe Pro Met Asp Asn Leu115 120 125Gln Gly Glu Leu Leu Asn Val Tyr Gln Asn His Val Glu Lys Lys Leu130 135 140Gly Ala Arg Val Phe Lys Asp Ala Ser Lys Trp Ser Glu Arg Glu Gln145 150 155 160Ser Gln Asn Pro Glu Ala Gly Asp Arg Arg Ile Phe Met Asp Pro Ala165 170 175Ser Phe Val Ala Glu Glu Leu Asn Leu Gly Pro Leu His Tyr Ser Val180 185 190Asp Ala Ala Cys Ala Thr Ala Leu Tyr Val Leu Arg Leu Ala Gln Asp195 200 205His Leu Val Ser Gly Ala Ala Asp Val Met Leu Ala Gly Ala Thr Cys210 215 220Phe Pro Glu Pro Phe Phe Ile Leu Ser Gly Phe Ser Thr Phe Gln Ala
225 230 235 240Met Pro Val Ser Gly Asp Gly Ile Ser Tyr Pro Leu His Lys Asp Ser245 250 255Gln Gly Leu Thr Pro Gly Glu Gly Gly Ala Ile Met Val Leu Lys Arg260 265 270Leu Asp Asp Ala Ile Arg Asp Gly Asp His Ile Tyr Gly Thr Leu Leu275 280 285Gly Ala Thr Ile Ser Asn Ala Gly Cys Gly Leu Pro Leu Lys Pro His290 295 300Leu Pro Ser Glu Lys Ser Cys Leu Ile Asp Thr Tyr Lys Arg Val Asn305 310 315 320Val His Pro His Lys Ile Gln Tyr Val Glu Cys His Ala Thr Gly Thr325 330 335Pro Gln Gly Asp Arg Val Glu Ile Asp Ala Val Lys Ala Cys Phe Glu340 345 350Gly Lys Val Pro Arg Phe Gly Ser Ser Lys Gly Asn Phe Gly His Thr355 360 365Leu Val Ala Ala Gly Phe Ala Gly Met Cys Lys Val Leu Leu Ala Met370 375 380Lys His Gly Val Ile Pro Pro Thr Pro Gly Val Asp Gly Ser Ser Gln385 390 395 400Met Asp Pro Leu Val Val Ser Glu Pro Ile Pro Trp Pro Asp Thr Glu405 410 415Gly Glu Pro Lys Arg Ala Gly Leu Ser Ala Phe Gly Phe Gly Gly Thr420 425 430Asn Ala His Ala Val Phe Glu Glu Phe Asp Arg Ser Lys Ala Ala Cys435 440 445Ala Thr His Asp Ser Ile Ser Ser Leu Ser Ser Arg Cys Gly Gly Glu450 455 460Gly Asn Met Arg Ile Ala Ile Thr Gly Met Asp Ala Thr Phe Gly Ser465 470 475 480Leu Lys Gly Leu Asp Ala Phe Glu Arg Ala Ile Tyr Asn Gly Gln His485 490 495Gly Ala Val Pro Leu Pro Glu Lys Arg Trp Arg Phe Leu Gly Lys Asp500 505 510Lys Asp Phe Leu Asp Leu Cys Gly Val Lys Glu Val Pro His Gly Cys515 520 525Tyr Ile Glu Asp Val Glu Val Asp Phe Ser Arg Leu Arg Thr Pro Met530 535 540Thr Pro Asp Asp Met Leu Arg Pro Met Gln Leu Leu Ala Val Thr Thr545 550 555 560
Ile Asp Arg Ala Ile Leu Asn Ser Gly Leu Lys Lys Gly Gly Lys Val565 570 575Ala Val Phe Val Gly Leu Gly Thr Asp Leu Glu Leu Tyr Arg His Arg580 585 590Ala Arg Val Ala Leu Lys Glu Arg Ala Arg Pro Glu Ala Ala Ser Ala595 600 605Leu Asn Asp Met Met Ser Tyr Ile Asn Asp Cys Gly Thr Ala Thr Ser610 615 620Tyr Thr Ser Tyr Ile Gly Asn Leu Val Ala Thr Arg Val Ser Ser Gln625 630 635 640Trp Gly Phe Glu Gly Pro Ser Phe Thr Ile Thr Glu Gly Asn Asn Ser645 650 655Val Tyr Arg Cys Ala Glu Leu Gly Lys Tyr Leu Leu Glu Thr Gly Glu660 665 670Val Glu Ala Val Val Ile Ala Gly Val Asp Leu Cys Ala Ser Ala Glu675 680 685Asn Leu Tyr Val Lys Ser Arg Arg Phe Lys Val Ser Glu Gln Glu Ser690 695 700Pro Arg Ala Ser Phe Asp Ser Gly Ala Asp Gly Tyr Phe Val Gly Glu705 710 715 720Gly Cys Gly Ala Leu Val Leu Lys Arg Glu Ser Asp Cys Thr Lys Asp725 730 735Glu Arg Ile Tyr Ala Cys Met Asp Ala Ile Val Pro Gly Asn Met Pro740 745 750Ala Ala Cys Met Glu Glu Ala Leu Ala Gln Ala Arg Val Asn Pro Lys755 760 765Asp Val Glu Met Leu Glu Leu Ser Ala Asp Ser Ala Arg His Leu Lys770 775 780Asn Pro Ser Val Leu Pro Lys Glu Leu Thr Ala Glu Glu Glu Ile Arg785 790 795 800Gly Ile Glu Ala Ile Leu Ser Gln Arg Ser Ser Asn Glu Ala Val Glu805 810 815Pro His Asn Val Ala Val Ser Ser Val Lys Ser Thr Val Gly Asp Thr820 825 830Gly Tyr Ala Ser Gly Ala Ala Ser Leu Ile Lys Thr Ala Leu Cys Leu835 840 845Tyr Asn Arg Tyr Leu Pro Ser Asn Gly Ala Ser Trp Glu Glu Pro Ala850 855 860Pro Glu Thr Gln Trp Gly Lys Ser Leu Tyr Ala Cys Gln Ser Ser Arg865 870 875 880Ala Trp Leu Lys Asn Pro Gly Ala Arg Arg His Ala Ala Val Ser Gly
885 890 895Val Ser Glu Thr Arg Ser Cys Tyr Thr Val Leu Leu Ser Asp Val Glu900 905 910Gly His His Glu Thr Lys Ser Arg Ile Ser Leu Asp Asp Asp Ala Val915 920 925Lys Leu Leu Val Ile Arg Gly Asp Ser His Asp Ala Ile Thr Gln Arg930 935 940Val Asp Lys Leu Arg Glu Arg Leu Ala Gln Pro Ser Ala Asn Val Arg945 950 955 960Leu Ala Phe Met Glu Leu Leu Gly Glu Ser Ile Ala Gln Glu Thr Lys965 970 975Thr Pro Leu Pro Ala Phe Ala Leu Cys Leu Val Thr Ser Pro Ser Lys980 985 990Leu Gln Lys Glu Leu Glu Leu Ala Ser Lys Gly Ile Pro Arg Ser Leu995 1000 1005Lys Met Gly Arg Asp Trp Thr Ser Pro Ser Gly Ser His Phe Ala1010 1015 1020Pro Lys Pro Leu Ser Ser Asp Arg Val Ala Phe Met Tyr Gly Glu1025 1030 1035Gly Arg Ser Pro Tyr Tyr Gly Ile Gly Leu Asp Ile His Arg Ile1040 1045 1050Trp Pro Glu Leu His Glu Phe Val Asn Ala Lys Thr Asn Lys Leu1055 1060 1065Trp Asp Gln Gly Asp Arg Trp Leu Ile Pro Arg Ala Ser Thr Lys1070 1075 1080Glu Glu Leu Lys Ala Gln Glu Asp Glu Phe Asn Arg Asn Gln Val1085 1090 1095Glu Met Phe Arg Leu Gly Ile Leu Met Ser Met Cys Phe Thr His1100 1105 1110Ile Ala Arg Asp Val Leu Gly Ile Gln Pro Lys Ala Ala Phe Gly1115 1120 1125Leu Ser Leu Gly Glu Ile Ser Met Val Phe Ala Phe Ser Glu Lys1130 1135 1140Asn Gly Leu Val Ser Glu Glu Leu Thr Thr Lys Leu Arg Asn Ser1145 1150 1155Glu Val Trp Arg Lys Ala Leu Ala Val Glu Phe Asp Ala Leu Arg1160 1165 1170Lys Ala Trp Asn Ile Pro Gln Asp Thr Pro Val Ser Glu Phe Trp1175 1180 1185Gln Gly Tyr Val Val Arg Gly Thr Arg Glu Ala Val Glu Ala Ala1190 1195 1200
Ile Gly Pro Asn Asn Lys Tyr Val His Leu Thr Ile Val Asn Asp1205 1210 1215Ala Asn Ser Ala Leu Ile Ser Gly Lys Pro Glu Asp Cys Lys Ala1220 1225 1230Ala Ile Ala Arg Leu Ser Ser Asn Leu Pro Ala Leu Pro Val Asp1235 1240 1245Leu Gly Met Cys Gly His Cys Pro Val Val Glu Pro Tyr Gly Lys1250 1255 1260Gln Ile Ala Glu Ile His Ser Val Leu Glu Ile Pro Glu Val Ala1265 1270 1275Gly Leu Asp Leu Tyr Thr Ser Val Asn Gln Lys Lys Leu Val Asn1280 1285 1290Lys Ser Thr Gly Ala Ser Asp Glu Tyr Ala Pro Ser Phe Gly Glu1295 1300 1305Tyr Ala Ala Gln Leu Tyr Thr Val Gln Ala Asp Phe Pro Lys Ile1310 1315 1320Ala Lys Thr Val Ser Asp Lys Asn Phe Asp Val Phe Val Glu Thr1325 1330 1335Gly Pro Asn Ala His Arg Ser Ala Ala Ile Arg Ala Thr Leu Gly1340 1345 1350Asn Ser Lys Pro Phe Val Thr Gly Ser Met Asp Arg Gln Asn Glu1355 1360 1365Asn Ala Trp Thr Thr Met Val Lys Leu Val Ala Ser Leu Gln Ala1370 1375 1380His Arg Val Pro Gly Val Lys Val Ser Pro Leu Tyr His Pro Glu1385 1390 1395Thr Val Glu Glu Ala Thr Gln Ser Tyr Asn Asp Met Val Ala Gly1400 1405 1410Lys Lys Pro Thr Lys Asn Lys Phe Leu Arg Lys Ile Val Val Asn1415 1420 1425Gly Arg Tyr Asp Pro Lys Lys Gln Leu Val Pro Pro Gln Val Leu1430 1435 1440Ala Lys Leu Pro Pro Ala Asp Pro Lys Ile Glu Ala Leu Ile Gln1445 1450 1455Ala Arg Lys Met Gln Pro Ile Ala Pro Lys Phe Met Glu Arg Leu1460 1465 1470Asp Ile Gln Glu Gln Asp Ala Thr Arg Asp Pro Ile Leu Asn Lys1475 1480 1485Asp Asn Lys Pro Ser Ala Ala Pro Ala Leu Ala Pro Ala Ala Pro1490 1495 1500Ala Arg Ser Val Ser Gly Ala Val Val Ala Ser Ser Glu Ala Leu
1505 1510 1515Arq Ala Lys Leu Leu Glu Leu Asn Ser Thr Leu Met Leu Gly Val1520 1525 1530Asn Ala Asn Gly Asp Leu Val Glu Ala Ser Pro Ser Glu Ala Ser1535 1540 1545Ile Val Val Pro Lys Cys Asp Ile Lys Asp Leu Gly Ser Arg Ala1550 1555 1560Phe Met Glu Thr Tyr Gly Val Ser Ala Pro Met Tyr Thr Gly Ala1565 1570 1575Met Ala Lys Gly Ile Ala Ser Ala Glu Met Val Ile Ala Ala Gly1580 1585 1590Lys Arg Gly Ile Leu Gly Ser Leu Gly Ala Gly Gly Leu Pro Ile1595 1600 1605Ala Thr Val Arg Lys Ala Leu Glu Ala Ile Gln Ala Glu Leu Pro1610 1615 1620Lys Gly Pro Tyr Ala Val Asn Leu Ile His Ser Pro Phe Asp Ser1625 1630 1635Asn Leu Glu Lys Gly Asn Val Asp Leu Phe Leu Glu Lys Gly Val1640 1645 1650Thr Val Val Glu Ala Ser Ala Phe Met Thr Leu Thr Pro Gln Leu1655 1660 1665Val Arg Tyr Arg Ala Ala Gly Leu Ser Arg Ala Ala Asp Gly Ser1670 1675 1680Thr Val Ile Lys Asn Arg Val Ile Gly Lys Val Ser Arg Thr Glu1685 1690 1695Leu Ala Ala Met Phe Ile Arg Pro Ala Pro Glu Asn Leu Leu Glu1700 1705 1710Lys Leu Leu Lys Ser Gly Glu Ile Thr Gln Glu Gln Ala Ala Leu1715 1720 1725Ala Arg Thr Val Pro Val Ala Asp Asp Ile Ala Val Glu Ala Asp1730 1735 1740Ser Gly Gly His Thr Asp Asn Arg Pro Ile His Val Ile Leu Pro1745 1750 1755Leu Ile Val Asn Leu Arg Asp Arg Leu His Lys Glu Cys Gly Tyr1760 1765 1770Pro Ala His Leu Arg Val Arg Val Gly Ala Gly Gly Gly Ile Gly1775 1780 1785Cys Pro Gln Ala Ala Ile Ala Thr Phe Asn Met Gly Ala Ala Phe1790 1795 1800Ile Val Thr Gly Thr Val Asn Gln Met Ser Lys Gln Ala Gly Thr1805 1810 1815
Cys Asp Thr Val Arg Lys Gln Leu Ser Gln Ala Thr Tyr Ser Asp1820 1825 1830Ile Cys Met Ala Pro Ala Ala Asp Met Phe Glu Glu Gly Val Lys1835 1840 1845Leu Gln Val Leu Lys Lys Gly Thr Met Phe Pro Ser Arg Ala Asn1850 1855 1860Lys Leu Tyr Glu Leu Phe Val Lys Tyr Asp Ser Phe Glu Ser Met1865 1870 1875Ala Pro Gly Glu Leu Glu Arg Val Glu Lys Arg Ile Phe Lys Lys1880 1885 1890Ser Leu Ser Glu Val Trp Glu Glu Thr Lys Asp Phe Tyr Ile Asn1895 1900 1905Arg Leu Gln Asn Pro Glu Lys Ile Glu Arg Ala Glu Arg Asp Pro1910 1915 1920Lys Leu Lys Met Ser Leu Cys Phe Arg Trp Tyr Leu Gly Leu Ala1925 1930 1935Ser Phe Trp Ala Asn Ala Gly Ile Pro Asp Arg Ala Met Asp Tyr1940 1945 1950Gln Val Trp Cys Gly Pro Ala Ile Gly Ser Phe Asn Asp Phe Ile1955 1960 1965Lys Gly Thr Tyr Leu Asp Pro Ala Val Ala Asn Glu Tyr Pro Asp1970 1975 1980Val Val Gln Ile Asn Leu Gln Ile Leu Arg Gly Ala Cys Phe Leu1985 1990 1995Arg Arg Leu Glu Ala Val Arg Asn Ala Pro Leu Lys Ala Asn Ala2000 2005 2010Lys Gln Val Ala Ala Glu Ile Asp Asp Ile Tyr Val Pro Thr Glu2015 2020 2025Arg Leu2030<210>8<211>1465<212>PRT<213>Ulkenia sp.
<400>8Met Ala Thr Arg Val Lys Thr Asn Lys Lys Pro Cys Trp Glu Met Thr1 510 15Lys Glu Glu Leu Thr Ser Gly Lys Asn Val Val Phe Asp Tyr Asp Glu20 25 30Leu Leu Glu Phe Ala Glu Gly Asp Ile Ser Lys Val Phe Gly Pro Glu35 40 45
Phe Ser Gln Ile Asp Gln Tyr Lys Arg Arg Val Arg Leu Pro Ala Arg50 5560Glu Tyr Leu Leu Val Thr Arg Val Thr Leu Met Asp Ala Glu Val Asn65 70 75 80Asn Tyr Arg Val Gly Ala Arg Met Val Thr Glu Tyr Asp Leu Pro Val85 90 95Asn Gly Glu Leu Ser Glu Gly Gly Asp Cys Pro Trp Ala Val Leu Val100 105 110Glu Ser Gly Gln Cys Asp Leu Met Leu Ile Ser Tyr Met Gly Ile Asp115 120 125Phe Gln Asn Lys Ser Asp Arg Val Tyr Arg Leu Leu Asn Thr Thr Leu130 135 140Thr Phe Tyr Gly Val Ala Gln Glu Gly Glu Thr Leu Glu Tyr Asp Ile145 150 155 160Arg Val Thr Gly Phe Ala Lys Arg Leu Asp Gly Asp Ile Ser Met Phe165 170 175Phe Phe Glu Tyr Asp Cys Tyr Val Asn Gly Arg Leu Leu Ile Glu Met180 185 190Arg Asp Gly Cys Ala Gly Phe Phe Thr Asn Glu Glu Leu Ala Ala Gly195 200 205Lys Gly Val Val Phe Thr Arg Ala Asp Leu Leu Ala Arg Glu Lys Thr210 215 220Lys Lys Gln Asp Ile Thr Pro Tyr Ala Ile Ala Pro Arg Leu Asn Lys225 230 235 240Thr Val Leu Asn Glu Thr Glu Met Gln Ser Leu Val Asp Lys Asn Trp245 250 255Thr Lys Val Phe Gly Pro Glu Asn Gly Met Asp Gln Ile Asn Tyr Lys260 265 270Leu Cys Ala Arg Lys Met Leu Met Ile Asp Arg Val Thr Lys Ile Asp275 280 285Tyr Thr Gly Gly Pro Tyr Gly Leu Gly Leu Leu Val Gly Glu Lys Ile290 295 300Leu Glu Arg Asp His Trp Tyr Phe Pro Cys His Phe Val Gly Asp Gln305 310 315 320Val Met Ala Gly Ser Leu Val Ser Asp Gly Cys Ser Gln Leu Leu Lys325 330 335Met Tyr Met Leu Trp Leu Gly Leu His Leu Lys Thr Gly Pro Phe Asp340 345 350Phe Arg Pro Val Asn Gly His Pro Asn Lys Val Arg Cys Arg Gly Gln355 360 365Ile Ser Pro His Lys Gly Lys Leu Val Tyr Val Met Glu Ile Lys Glu
370 375 380Met Gly Tyr Asp Glu Ala Gly Asp Pro Tyr Ala Ile Ala Asp Val Asn385 390 395 400Ile Leu Asp Ile Asp Phe Glu Lys Gly Gln Thr Phe Asp Leu Ala Asn405 410 415Leu His Glu Tyr Gly Lys Gly Asp Leu Asn Lys Lys Ile Val Val Asp420 425 430Phe Lys Gly Ile Ala Leu Lys Leu Gln Lys Arg Ser Gly Pro Ala Val435 440 445Val Ala Pro Glu Lys Pro Leu Ala Leu Asn Lys Asp Leu Cys Ala Pro450 455 460Ala Val Glu Ala Ile Pro Glu His Ile Leu Lys Gly Asp Ala Leu Ala465 470 475 480Pro Asn Gln Met Thr Trp His Pro Met Ser Lys Ile Ala Gly Asn Pro485 490 495Thr Pro Ser Phe Ser Pro Ser Ala Tyr Pro Pro Arg Pro Ile Thr Phe500 505 510Thr Pro Phe Pro Gly Asn Lys Asn Asp Asn Asn His Val Pro Gly Glu515 520 525Met Pro Leu Ser Trp Tyr Asn Met Ala Glu Phe Met Ala Gly Lys Val530 535 540Ser Leu Cys Leu Gly Pro Glu Phe Ala Lys Phe Asp Asp Ser Asn Thr545 550 555 560Ser Arg Ser Pro Ala Trp Asp Leu Ala Leu Val Thr Arg Val Val Ser565 570 575Val Ser Asp Met Glu Trp Val Gln Trp Lys Asn Val Asp Cys Asn Pro580 585 590Ser Lys Gly Thr Met Val Gly Glu Phe Asp Cys Pro Ile Asp Ala Trp595 600 605Phe Phe Gln Gly Ser Cys Asn Asp Gly His Met Pro Tyr Ser Ile Leu610 615 620Met Glu Ile Ala Leu Gln Thr Ser Gly Val Leu Thr Ser Val Leu Lys625 630 635 640Ala Pro Leu Thr Met Glu Lys Lys Asp Ile Leu Phe Arg Asn Leu Asp645 650 655Ala Asn Ala Glu Met Val Arg Ser Asp Ile Asp Leu Arg Gly Lys Thr660 665 670Ile His Asn Leu Thr Lys Cys Thr Gly Tyr Ser Met Leu Gly Asp Met675 680 685Gly Val His Arg Phe Ser Phe Glu Leu Ser Val Asp Gly Val Val Phe690 695 700
Tyr Lys Gly Thr Thr Ser Phe Gly Trp Phe Val Pro Glu Val Phe Ile705 710 715 720Ser Gln Thr Gly Leu Asp Asn Gly Arg Arg Thr Gln Pro Trp His Ile725 730 735Glu Ser Lys Val Pro Ser Ala Gln Val Leu Thr Tyr Asp Val Thr Pro740 745 750Asn Gly Ala Gly Arg Thr Gln Leu Tyr Ala Asn Ala Pro Lys Gly Ala755 760 765Gln Leu Thr Arg Arg Trp Asn Gln Cys Gln Tyr Leu Asp Thr Ile Asp770 775 780Leu Val Val Ala Gly Gly Ser Ala Gly Leu Gly Tyr Gly His Gly Arg785 790 795 800Lys Gln Val Asn Pro Lys Asp Trp Phe Phe Ser Cys His Phe Trp Phe805 810 815Asp Ser Val Met Pro Gly Ser Leu Gly Val Glu Ser Met Phe Gln Leu820 825 830Val Glu Ser Ile Ala Val Lys Gln Asp Leu Ala Gly Lys Tyr Gly Ile835 840 845Thr Asn Pro Thr Phe Ala His Ala Pro Gly Lys Ile Ser Trp Lys Tyr850 855 860Arg Gly Gln Leu Thr Pro Thr Ser Lys Phe Met Asp Ser Glu Ala His865 870 875 880Ile Val Ser Ile Glu Ala His Asp Gly Val Val Asp Ile Val Ala Asn885 890 895Gly Asn Leu Trp Ala Asp Gly Leu Arg Val Tyr Asn Val Ser Asn Ile900 905 910Arg Val Arg Ile Val Ala Gly Ala Ala Pro Ala Ala Ala Ala Ala Ala915 920 925Ala Ala Val Ala Ala Pro Ala Ala Ala Pro Ala Pro Val Ala Ala Ser930 935 940Gly Pro Ala Gln Thr Ile Thr Leu Lys Gln Leu Lys Ala Glu Leu Leu945 950 955 960Asp Val Glu Lys Pro Leu Tyr Ile Ser Ser Ser Asn Gly Gln Val Lys965 970 975Lys His Ala Asp Val Ala Gly Gly Gln Ala Thr Ile Val Gln Ala Cys980 985 990Ser Leu Ser Asp Leu Gly Asp Glu Gly Phe Met Lys Thr Tyr Gly Val995 1000 1005Val Ala Pro Leu Tyr Thr Gly Ala Met Ala Lys Gly Ile Ala Ser1010 1015 1020Ala Asp Leu Val Ile Ala Thr Gly Lys Arg Lys Ile Leu Gly Ser
1025 1030 1035Phe Gly Ala Gly Gly Leu Pro Met His Ile Val Arg Ala Ala Val1040 1045 1050Glu Lys Ile Gln Ala Glu Leu Pro Asn Gly Pro Phe Ala Val Asn1055 1060 1065Leu Ile His Ser Pro Phe Asp Ser Asn Leu Glu Lys Gly Asn Val1070 1075 1080Asp Leu Phe Leu Glu Lys Gly Val Thr Val Val Glu Ala Ser Ala1085 1090 1095Phe Met Thr Leu Thr Pro Gln Val Val Arg Tyr Arg Ala Ala Gly1100 1105 1110Leu Ser Arg Asn Ala Asp Gly Ser Ile Asn Ile Lys Asn Arg Ile1115 1120 1125Ile Gly Lys Val Ser Arg Thr Glu Leu Ala Glu Met Phe Ile Arg1130 1135 1140Pro Ala Pro Gln Asn Leu Leu Asp Lys Leu Ile Gln Ser Gly Glu1145 1150 1155Ile Thr Lys Glu Gln Ala Glu Leu Ala Lys Leu Val Pro Val Ala1160 1165 1170Asp Asp Ile Ala Val Glu Ala Asp Ser Gly Gly His Thr Asp Asn1175 1180 1185Arg Pro Ile His Val Ile Leu Pro Leu Ile Ile Asn Leu Arg Asn1190 1195 1200Arg Leu His Lys Glu Cys Gly Tyr Pro Ala His Leu Arg Val Arg1205 1210 1215Val Gly Ala Gly Gly Gly Val Gly Cys Pro Gln Ala Ala Ala Ala1220 1225 1230Ala Leu Ala Met Gly Ala Ala Phe Leu Val Thr Gly Thr Val Asn1235 1240 1245Gln Val Ala Lys Gln Ser Gly Thr Cys Asp Asn Val Arg Lys Gln1250 1255 1260Leu Cys Met Ala Thr Tyr Ser Asp Val Cys Met Ala Pro Ala Ala1265 1270 1275Asp Met Phe Glu Glu Gly Val Lys Leu Gln Val Leu Lys Lys Gly1280 1285 1290Thr Met Phe Pro Ser Arg Ala Asn Lys Leu Tyr Glu Leu Phe Cys1295 1300 1305Lys Tyr Asp Ser Phe Glu Ser Met Pro Ala Thr Glu Leu Glu Arg1310 1315 1320Val Glu Lys Arg Ile Phe Gln Cys Pro Leu Ala Asp Val Trp Ala1325 1330 1335
Glu Thr Ser Asp Phe Tyr Ile Asn Arg Leu His Asn Pro Glu Lys1340 1345 1350Ile Thr Arg Ala Glu Arg Asp Pro Lys Leu Lys Met Ser Leu Cys1355 1360 1365Phe Arg Trp Tyr Leu Gly Leu Ala Ser Arg Trp Ala Asn Thr Gly1370 1375 1380Glu Ala Gly Arg Val Met Asp Tyr Gln Val Trp Cys Gly Pro Ala1385 1390 1395Ile Gly Ala Phe Asn Asp Phe Ile Lys Gly Ser Tyr Leu Asp Pro1400 1405 1410Ala Val Ser Gly Glu Tyr Pro Asp Val Val Gln Ile Asn Leu Gln1415 1420 1425Ile Leu Arg Gly Ala Cys Tyr Leu Arg Arg Leu Asn Val Ile Arg1430 1435 1440Asn Asp Pro Arg Val Ser Ile Glu Val Glu Asp Ala Glu Phe Val1445 1450 1455Tyr Glu Pro Thr Asn Ala Leu1460 1465<210>9<211>5547<212>DNA<213>Ulkenia sp.
<400>9atgcttgtga taggggctct ggcgcgggct ctgtacggtg cttggagatg cacgggcagg60gcgagagagg ggacgggttc ccgggaggcg ctgcttggag gtgctgagag ggagggagaa120ggcgtgcttt gcgatgcgcg gggcgaccta ggcgctgctg cgcggtgcag cagcagggac180ctcggacgtg agtcgaagcc gtctgcagag gagatggtag aagggccgcg gattggtagc240agagaagagg aaatagaaga agaagaagaa atagaagaag aagaaataga agaagaagaa300atagaagaag aagaggagga cgggcaggcg ggaaagatgg agaaaggact cgcggcggga360aaacaagaga atgtgaactt gggcttgaac tttggtttga atttgaatgt ggagaacgag420gggttgaatt tgagtttgaa tttgaaagaa aacttacgga aagaaagttt agttgaaagt480gagaaagaaa aaaatgagaa agaaaaagag aaagaaaaag agaaagaaaa agagaaagaa540aaagagaaag aaaaagagaa agaaaaagag aaagaaaaag agaaagaaaa agagaaagaa600aaagagaaag aaaaagagaa agaaaaagag aaagaaaaag aagaagaaaa agaagaagaa660aaagagaaag aaaaagagaa agaaaaagag aaagaaaaag aagaaggaga tttaaaaagt720tgtttagttg aaaaaggaga aggaggaaga agcagcgaca gcggcagaag aagaagtagt780tgttgtaaga ggggaacgga ggcagtagca gtggagcagg cggaggcgac agcaaacctc840
gaactcgacc ccgtcgagcc gcagcaagaa caagagcccg accaggtgga cgaggacgag900gtccgcttgt tgtcaggaac aacagaagtt gcaggactag ccgagagtgc taccactgca960attcttagat ccacagacgc aagagcagaa aacttacaac tgctcgccac aacacaagaa1020ccaccttcag atacaaccag gttcgagaac tccacaagtc tagaagcagc aacagctcta1080gcagataatc aaacaggtcc agaaaaagct acgactagaa gagaaattat cgagtcgcaa1140cttgcaacca tggccactcg cgtgaagacc aacaagaaac catgctggga gatgaccaag1200gaggagctca ccagcggcaa gaacgtcgtt ttcgactatg acgagctcct tgagttcgcc1260gagggtgaca tcagcaaggt cttcggcccc gaattcagcc agatcgacca gtacaagcgt1320cgcgttcgtc tccccgcccg cgagtacctc ctcgtcaccc gcgtcaccct catggacgcc1380gaggtcaaca actaccgcgt cggtgcccgc atggtcactg agtacgacct ccccgtcaac1440ggtgagctct ctgagggtgg tgactgcccc tgggccgtgc tcgtcgagag tggtcagtgt1500gatctcatgc tcatctccta catgggtatt gacttccaga acaagagcga ccgcgtctac1560cgtctgctca acaccaccct caccttctac ggtgttgccc aggagggcga gaccctggag1620tacgacatcc gcgtgaccgg cttcgccaag cgtctcgacg gtgacatctc catgttcttc1680ttcgagtacg actgctacgt caacggccgt ctcctcatcg agatgcgcga cggctgtgcc1740ggtttcttca ccaacgagga gctcgccgcc ggcaagggtg tcgtctttac ccgcgctgat1800ctcctcgccc gcgagaagac caagaagcag gacatcaccc cgtacgccat tgccccgcgt1860cttaacaaga ccgttctcaa cgagactgag atgcagtccc tcgtggacaa gaactggacc1920aaggttttcg gccccgagaa cggcatggac cagatcaact acaaactctg cgcccgtaag1980atgctcatga ttgaccgcgt caccaagatt gactacaccg gtggccccta cggccttggt2040cttctcgttg gtgagaagat cctcgagcgc gaccactggt actttccgtg ccacttcgtc2100ggagaccagg tcatggctgg atccctcgtg tctgacggct gcagccagct cctcaagatg2160tacatgctct ggctcggcct ccaccttaag accggtccct tcgacttccg ccccgtcaac2220ggccacccca acaaggtccg ctgccgtggc cagatctccc cgcacaaggg taagctcgta2280tacgtcatgg agatcaagga gatgggctac gacgaggctg gtgacccgta cgccatcgcc2340gatgtcaaca ttctcgacat tgacttcgag aagggccaga ctttcgacct tgccaacctc2400cacgagtacg gcaagggcga cctcaacaag aagatcgtcg tcgacttcaa gggtattgcc2460ctcaagctcc agaagcgctc tggccctgcc gttgtcgctc ccgagaagcc cctcgctctc2520aacaaggacc tttgcgcccc ggctgttgag gccatccctg agcacatcct caagggcgat2580gctcttgccc ctaaccagat gacctggcac ccgatgtcca agatcgctgg caaccccacg2640ccctcgttct ctccctcggc ctaccctccc cgtcccatca ccttcacccc gttccccggc2700
aacaagaacg acaacaacca cgtgcccggc gagatgccgc tctcgtggta caacatggct2760gagttcatgg ccggcaaggt cagcctctgc ctcggccctg agttcgccaa gttcgatgac2820tccaacacca gccgcagccc tgcatgggac cttgctcttg tgactcgtgt ggtctccgtt2880tctgacatgg agtgggtcca gtggaagaac gtggactgca acccgtccaa gggaaccatg2940gttggcgagt tcgactgccc catcgacgcc tggttcttcc agggatcttg taacgacggc3000cacatgccgt actccatcct catggagatc gccctccaga cctctggtgt cctcacctct3060gtgctcaagg ccccgctcac catggagaag aaggacattc tcttccgcaa ccttgacgcc3120aacgccgaga tggttcgctc tgatattgac ctccgcggca agaccatcca caacctcacc3180aagtgtaccg gctacagcat gctcggagac atgggtgtcc accgcttcag cttcgagctc3240tctgttgatg gtgtagtctt ctacaagggt accacctcct tcggctggtt cgtccctgag3300gtcttcatct cccagactgg tctcgacaac ggtcgccgca cccagccctg gcacattgag3360tccaaggtgc cttccgccca ggtcctcacc tacgacgtta cccccaacgg tgccggtcgc3420acccagctct acgccaacgc ccccaagggc gctcagctca ctcgccgctg gaaccagtgc3480cagtaccttg acaccatcga ccttgtggtc gccggtggct ccgccggtct tggctacggt3540catggccgca agcaggtgaa ccccaaggac tggttcttct cgtgccactt ctggttcgac3600tccgtcatgc ccggctcgct cggtgtggag tctatgttcc agctcgtcga gtccatcgct3660gtcaagcagg acctcgccgg caagtacggc atcaccaacc cgaccttcgc tcatgctccg3720ggcaagatct cctggaagta ccgtggtcag ctcaccccca cctccaagtt catggactcc3780gaggcccaca ttgtctccat cgaggcccac gacggcgtcg tcgacatcgt tgccaatggt3840aacctctggg ctgatggcct ccgcgtctac aacgtcagca acatccgtgt gcgcattgtt3900gctggcgccg cccctgctgc tgctgctgct gctgctgctg ttgctgctcc ggctgccgcc3960cctgctccgg ttgctgcatc tggccctgcc cagaccatca ccctcaagca gctcaaggct4020gagcttcttg acgttgagaa gcctctctac atctcctcca gcaacggcca ggtcaagaag4080cacgccgatg tggctggtgg ccaggccacc attgtgcagg cttgcagcct cagtgacctc4140ggtgatgaag gcttcatgaa gacctacggt gttgtggctc ctctctacac cggtgccatg4200gccaagggta ttgcctctgc tgaccttgtg attgccactg gtaagcgcaa gatcctcggt4260tccttcggtg ctggcggtct ccccatgcac attgtccgtg ccgctgttga gaagatccag4320gctgagctcc cgaacggccc cttcgccgtc aacctcatcc actccccctt cgatagcaac4380cttgagaagg gcaacgttga cctcttcctc gagaagggcg ttactgtcgt cgaggcctcc4440gccttcatga ccttgacccc gcaagtcgtc cgctaccgtg ctgctggtct ttcccgtaac4500gctgatggct ccattaacat caagaaccgc atcatcggta aggtctcccg taccgagctc4560
gctgagatgt tcatccgccc tgccccgcag aacctcctcg acaagctcat ccagtctggt4620gagattacca aggagcaggc tgagcttgcc aagctcgtcc ccgtcgccga cgacatcgcc4680gtcgaggccg actctggtgg ccacaccgac aaccgcccca tccacgtcat cctccccctt4740atcatcaacc tccgcaaccg cctccacaag gagtgcggct accccgctca cctccgcgtg4800cgcgttggag ctggtggtgg tgttggatgc ccccaggccg ctgccgctgc tctcgctatg4860ggtgctgcct tccttgttac cggcactgtc aaccaggtcg ccaagcagtc cggcacctgc4920gacaatgtcc gcaagcagct ctgcatggcc acctactctg acgtctgcat ggctcccgct4980gctgacatgt tcgaggaggg cgtcaagctc caggtcctca agaagggaac catgttcccg5040tccagggcta acaagctcta cgagctcttc tgcaagtacg actccttcga gtccatgcct5100gccacagagc tcgagcgtgt tgagaagcgc atcttccagt gccctcttgc tgatgtctgg5160gctgagacct ccgacttcta catcaaccgc ctccacaacc cggagaagat cacccgtgcc5220gagcgtgacc ccaagctcaa gatgtctctc tgcttccgct ggtaccttgg tcttgcctct5280cgctgggcca acaccggtga ggctggacgc gtcatggact accaggtctg gtgtggccct5340gccattggag ccttcaacga cttcatcaag ggctcctacc ttgacccggc cgtctctggt5400gagtacccgg acgtcgtgca gatcaacttg cagatccttc gcggtgcctg ctacctccgc5460cgtctcaatg tcatccgcaa cgacccgcgt gtcagcattg aggtcgagga tgctgagttc5520gtctacgagc ccaccaacgc cctctaa5547<210>10<211>837<212>DNA<213>Ulkania sp.
<400>10acccgcatcg ctgtgatcgg catgtccgcc atcctcccct gcggtaccac cgttcgtgag60tcttgggagg ctatccgcga tggtatcgac tgcctcagtg atctccccga ggaccgcgtc120gatgtgaccg cctacttcga cccggtcaag accaccaagg ataagatcta ctgcaaacgt180ggtggattca tccctgagta cgacttcgac gcccgtgagt tcggcctcaa catgtttcag240atggaggact ccgacgcaaa ccaaaccgtc accctcctca aggtcaagga ggccctcgag300gacgctggca tcgaagccct cagcaaggaa aagaagaaca ttggatgtgt tctcggtatc360ggtggtggcc agaagtccag ccacgagttc tactcccgct taaactatgt tgtcgttgag420aaggtccttc gcaagatggg catgcctgag gaggatgttc aagctgctgt tgagaagtac480aaggccaact tccctgagtg gcgccttgac tccttccccg gtttcctcgg caacgttact540gccggtcgct gtaccaacac cttcaacctc gatggtatga actgtgtcgt cgatgctgcc600
tgtgctagtt ctctcatcgc cgttaaggtt gccattgatg agcttctcca cggagactgt660gacatgatga tcactggtgc tacctgcacg gataactcca tcggtatgta catggccttc720tccaagaccc cggtgttctc taccgaccct agcgtccgcg catacgatga gaagaccaag780ggtatgctta ttggcgaagg ctctgccatg cttgtgctta aacgttacgc cgacgct 837<210>11<211>51<212>DNA<213>Ulkenia sp.
<400>11ggtatgaact gtgtcgtcga tgctgcctgt gctagttctc tcatcgccgtt 51<210>12<211>12<212>DNA<213>Ulkenia sp.
<400>12gatgctgcct gt12<210>13<211>522<212>DNA<213>Ulkenia sp.
<400>13cacgctgtca ttcgcggctg cgcctcttcc tctgacggta aggcctccgg tatttacacc60ccgaccatct ctggtcaaga ggaggctctt cgccgtgcct acatgcgcgc taacgtcgat120cccgccaccg tcactcttgt tgagggccac ggtaccggta cccccgttgg tgaccgtatt180gagctcaccg ctctccgtaa cctcttcgac agtgcctacg gcaacgagaa ggagaaggtc240gctgttggca gcattaagtc caacatcggt cacctcaagg ctgtcgccgg tcttgccggt300atgatcaagg tcatcatggc cctcaagcat aagactcttc cggccaccat caacgttgat360gagcccccta agctttacga caacactccc atcaccgact catcgctgta cattaacacg420atgaaccgtc cgtggttccc tgctccgggt gtgccccgtc gcgctggtat ctccagtttc480ggttttggtg gtgccaacta ccacgccgtt cttgaggaag cc 522<210>14<211>1380<212>DNA<213>Ulkenia sp.
<400>14acccgcatcg ctgtgatcgg catgtccgcc atcctcccct gcggtaccac cgttcgtgag60tcttgggagg ctatccgcga tggtatcgac tgcctcagtg atctccccga ggaccgcgtc120
gatgtgaccg cctacttcga cccggtcaag accaccaagg ataagatcta ctgcaaacgt180ggtggattca tccctgagta cgacttcgac gcccgtgagt tcggcctcaa catgtttcag240atggaggact ccgacgcaaa ccaaaccgtc accctcctca aggtcaagga ggccctcgag300gacgctggca tcgaagccct cagcaaggaa aagaagaaca ttggatgtgt tctcggtatc360ggtggtggcc agaagtccag ccacgagttc tactcccgct taaactatgt tgtcgttgag420aaggtccttc gcaagatggg catgcctgag gaggatgttc aagctgctgt tgagaagtac480aaggccaact tccctgagtg gcgccttgac tccttccccg gtttcctcgg caacgttact540gccggtcgct gtaccaacac cttcaacctc gatggtatga actgtgtcgt cgatgctgcc600tgtgctagtt ctctcatcgc cgttaaggtt gccattgatg agcttctcca cggagactgt660gacatgatga tcactggtgc tacctgcacg gataactcca tcggtatgta catggccttc720tccaagaccc cggtgttctc taccgaccct agcgtccgcg catacgatga gaagaccaag780ggtatgctta ttggcgaagg ctctgccatg cttgtgctta aacgttacgc cgacgctgtt840cgtgatggtg acgagattca cgctgtcatt cgcggctgcg cctcttcctc tgacggtaag900gcctccggta tttacacccc gaccatctct ggtcaagagg aggctcttcg ccgtgcctac960atgcgcgcta acgtcgatcc cgccaccgtc actcttgttg agggccacgg taccggtacc1020cccgttggtg accgtattga gctcaccgct ctccgtaacc tcttcgacag tgcctacggc1080aacgagaagg agaaggtcgc tgttggcagc attaagtcca acatcggtca cctcaaggct1140gtcgccggtc ttgccggtat gatcaaggtc atcatggccc tcaagcataa gactcttccg1200gccaccatca acgttgatga gccccctaag ctttacgaca acactcccat caccgactca1260tcgctgtaca ttaacacgat gaaccgtccg tggttccctg ctccgggtgt gccccgtcgc1320gctggtatct ccagtttcgg ttttggtggt gccaactacc acgccgttct tgaggaagcc1380<210>15<211>996<212>DNA<213>Ulkenia sp.
<400>15ctcttctctg gccagggtgc tcagtacacc cacatgttca gcgaggtcgc catgaactgg60cctcagttcc gtgagagcat ctctgacatg gatcgtgccc aggctaaggt tgctggcgct120gacaaggact acgagcgtgt ctcccaagtc ctctacccgc gtaagcctta taactctgag180cccgagcagg accacaagaa gatctccctg acctcatact ctcagccctc taccctcgcc240tgcgctcttg gtgcctacga gatcttcaag caggctggtt tcaagcccga cttcgctgcc300ggtcactctc tcggtgagtt tgcggccctc tacgctgctg actgcgtcaa ccgtgacgac360
ctctttgagc tcgtgtgccg tcgtgcccgc atcatgggtg gcaaggatgc acctgctacc420cccaagggat gcatggctgc tgtcattgga cccaatgccg agaagatcca gattcgcact480gctgatgtct ggctcggcaa ctgcaactcc ccttcgcaga ctgtcatcac cggctctgtt540gagggtatca agaaggagtc cgagcttctc cagagtgagg gcttccgtgt tgtccccctc600gcctgcgaga gtgccttcca ctcaccgcag atgcaaaacg cctcctctgc cttcaaggat660gttctctcca aggttgcctt ccgtcagcct agcgcccaga ccaagctctt cagcaacgtg720tctggcgaga cctactccaa caatgcccag gacctcctta aggagcacat gaccagcagt780gttaagttca tctctcaggt tcgcaacatg cactctgctg gtgctcgcat ctttgtcgag840tttggcccca agcaggtgct ctctaagctt gtttccgaga ccctcaagga cgatccttcc900attatcacta tctctgtcaa cccttcctct ggcaaggatg ccgatattca gcttcgcgag960gctgctgtgc agctcgttgt tgctggagtc aacctt 996<210>16<211>3510<212>DNA<213>Ulkenia sp.
<400>16gcccaggccc agatccagaa ggccaaggcc gatgctgctg aggctgacaa gaagcttgcc60gctgctaagg atgaggccaa gcgtgccgcc gcttctgcac ctgtgcagaa gcaggttgac120accaccattg ttgataagca ccgtgctatc ctcaagtcta tgcttgctga gcttgactgc180tactccactc ctggtgctgt gtccagctct ttccaggcac ctgttgctgc tacccctgct240ccggtcgctg cgcctgttgc agctgctcct gctccggctg tcaacaatgc tctccttgcc300aaggctgagt ctgttgtcat ggaggttctt gccgccaaga ctggttacga gactgacatg360atcgagcccg acatggagct cgagactgag ctcggcattg actctatcaa gcgtgtcgag420attctctctg aggtccaggc ccagctcaac gtcgaggcca aggatgttga tgctcttagc480cgcacccgca ccgtcggtga ggttgtcaac gccatgaagg ctgagatcgc tggcagctct540ggtgctgccg ctgctgcccc ggccccggtt gctgctgctc ccgctgcccc tgcccctgct600gtcaacagcg ctcttcttgc caaggctgag actgttgtca tggaggttct tgccgccaag660actggttacg agactgacat gattgagccc gacatggagc tcgagactga gctcggcatt720gactccatca agcgtgtcga gattctctct gaggttcagg cccagctcaa cgttgaggcc780aaggatgttg atgctcttag ccgcacccgc accgttggtg aggttgtcaa cgccatgaag840gctgagatcg ctggcagctc tggtgctgcc gctgctgccc cggcccctgt tgctgctgct900ccggcgcccg tcgctgccgc tgcccctgct gtcagcagcg ctctccttga gaaggctgag960
tctgttgtca tggaggttct tgccgccaag actggttacg agactgacat gattgaggcc1020gacatggagc tcgagactga gctcggcatt gactccatca agcgtgtcga gattctctct1080gaggtccagg cccagctcaa cgtcgaggcc aaggatgtcg atgctcttag ccgcacccgc1140accgttggtg aggttgtcaa cgccatgaag gctgagatcg ctggcagctc tggtgctgct1200gccccggccc cggtcgctgc ggcccctgct ccggtcgctg ccgctgcccc tgctgtcaac1260agcgctcttc ttgagaaggc tgagactgtt gtcatggagg ttcttgccgc caagactggt1320tacgagactg acatgatcga gcccgacatg gagctcgaga ctgagctcgg cattgactct1380atcaagcgtg tcgagattct ctctgaggtc caggcccagc tcaacgttga ggccaaggat1440gttgatgctc ttagccgcac ccgcaccgtt ggtgaggttg tcaacgccat gaaggctgag1500atcgctggca gctctggtgc tgccgctgct gccccggccc cggttgctgc tgctcccgct1560cccgtcgctg cccctgctgt cagcagcgct ctccttgaga aggctgagtc tgtcgtcatg1620gaggttcttg ccgccaagac tggttacgag actgacatga ttgaggccga catggagctc1680gagactgagc tcggcattga ctccatcaag cgtgtcgaga ttctctctga ggtccaggcc1740cagctcaacg ttgaggccaa ggatgtcgat gctcttagcc gcacccgcac cgttggtgag1800gttgtcaacg ccatgaaggc tgagatcgct ggcagctctg gtgctgccgc tgctgccccg1860gcccctgttg ctgcctctcc cgctcccgtc gctgccgctg cccctgctgt cagcagcgct1920ctccttgaga aggccgaatc tgttgtcatg gaggttctcg ccgccaagac tggttacgag1980actgacatga ttgaggctga catggagctc gagactgagc tcggcattga ctctatcaag2040cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat2100gctcttagcc gcacccgcac cgttggtgag gttgtcaacg ccatgaaggc tgagatcgct2160ggcagctctg gtgccgccgc tgctgccccg gccccggttg ctgctgctcc ggcgcccgtc2220actgccgctg cccctgctgt cagcagcgct ctccttgaga aggccgaatc tgttgtcatg2280gaggttctcg ccgccaagac tggttacgag actgacatga ttgaggccga catggagctc2340gagactgagc ttggcattga ctccatcaag cgtgtcgaga ttctctctga ggtccaggct2400atgcttaacg tcgaggccaa ggatgttgat gctcttagcc gcacccgcac cgttggtgag2460gttgtcaacg ccatgaaggc tgagattgct agcagctctg gtgctgctgc ccctgctccg2520gctgctgccg ttgcaccggc ccctgctgct gcccctgctg tcagcagcgc tctccttgag2580aaggccgaat ctgttgtcat ggaggttctc gccgccaaga ctggttacga gactgacatg2640attgaggccg acatggagct cgagactgag ctcggcattg actctatcaa gcgtgtcgag2700attctctctg aggtccaggc tatgcttaac gttgaggcca aggatgttga tgctcttagc2760cgcacccgca ccgttggtga ggttgtcaac gccatgaagg ctgagattgc tagcagctct2820
ggtgctgctg cccctgctcc tgctgctgcc gctgcaccgg cccctgctgc tgcccctgct2880gtcagcagcg ctcttcttga gaaggctgag tctgttgtca tggaggttct cgccgccaag2940actggttacg agactgacat gattgaggcc gacatggagc tcgagactga gcttggcatt3000gactccatca agcgtgtcga gattctctct gaggtccagg ctatgcttaa cgttgaggcc3060aaggatgttg atgctcttag ccgcacccgc accgttggtg aggttgtcaa cgccatgaag3120gctgagattg ctagcagctc tggtgctgct gcccctgctc ctgctgctgc cgctgcaccg3180gcccctgctg ctgcccctgc tgtcagcagc gctcttcttg agaaggctga gtctgttgtc3240atggaggttc tcgccgccaa gactggttac gagactgaca tgattgaggc cgacatggag3300ctcgagactg agcttggcat tgactccatc aagcgtgtcg agattctctc tgaggtccag3360gctatgctta acgttgaggc caaggatgtt gatgctctta gccgcacccg caccgttggt3420gaggttgtca acgccatgaa ggctgagatc gctggcagct ctggtgctgc tactgcctct3480gcccctgctg ctgcagctgc cgcccctgct 3510<210>17<211>219<212>DNA<213>Ulkenia sp.
<400>17ctccttgcca aggctgagtc tgttgtcatg gaggttcttg ccgccaagac tggttacgag60actgacatga tcgagcccga catggagctc gagactgagc tcggcattga ctctatcaag120cgtgtcgaga ttctctctga ggtccaggcc cagctcaacg tcgaggccaa ggatgttgat180gctcttagcc gcacccgcac cgtcggtgag gttgtcaac 219<210>18<211>219<212>DNA<213>Ulkenia sp.
<400>18cttcttgcca aggctgagac tgttgtcatg gaggttcttg ccgccaagac tggttacgag60actgacatga ttgagcccga catggagctc gagactgagc tcggcattga ctccatcaag120cgtgtcgaga ttctctctga ggttcaggcc cagctcaacg ttgaggccaa ggatgttgat180gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219<210>19<211>219<212>DNA<213>Ulkenia sp.
<400>19
ctccttgaga aggctgagtc tgttgtcatg gaggttcttg ccgccaagac tggttacgag60actgacatga ttgaggccga catggagctc gagactgagc tcggcattga ctccatcaag120cgtgtcgaga ttctctctga ggtccaggcc cagctcaacg tcgaggccaa ggatgtcgat180gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219<210>20<211>219<212>DNA<213>Ulkenia sp.
<400>20cttcttgaga aggctgagac tgttgtcatg gaggttcttg ccgccaagac tggttacgag60actgacatga tcgagcccga catggagctc gagactgagc tcggcattga ctctatcaag120cgtgtcgaga ttctctctga ggtccaggcc cagctcaacg ttgaggccaa ggatgttgat180gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219<210>21<211>219<212>DNA<213>Ulkenia sp.
<400>21ctccttgaga aggctgagtc tgtcgtcatg gaggttcttg ccgccaagac tggttacgag60actgacatga ttgaggccga catggagctc gagactgagc tcggcattga ctccatcaag120cgtgtcgaga ttctctctga ggtccaggcc cagctcaacg ttgaggccaa ggatgtcgat180gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219<210>22<211>219<212>DNA<213>Ulkenia sp.
<400>22ctccttgaga aggccgaatc tgttgtcatg gaggttctcg ccgccaagac tggttacgag60actgacatga ttgaggctga catggagctc gagactgagc tcggcattga ctctatcaag120cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat180gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219<210>23<211>219<212>DNA<213>Ulkenia sp.
<400>23ctccttgaga aggccgaatc tgttgtcatg gaggttctcg ccgccaagac tggttacgag 60
actgacatga ttgaggccga catggagctc gagactgagc ttggcattga ctccatcaag120cgtgtcgaga ttctctctga ggtccaggct atgcttaacg tcgaggccaa ggatgttgat180gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219<210>24<211>219<212>DNA<213>Ulkenia sp.
<400>24ctccttgaga aggccgaatc tgttgtcatg gaggttctcg ccgccaagac tggttacgag60actgacatga ttgaggccga catggagctc gagactgagc tcggcattga ctctatcaag120cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat180gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219<210>25<211>219<212>DNA<213>Ulkenia sp.
<400>25cttcttgaga aggctgagtc tgttgtcatg gaggttctcg ccgccaagac tggttacgag60actgacatga ttgaggccga catggagctc gagactgagc ttggcattga ctccatcaag120cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat180gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219<210>26<211>219<212>DNA<213>Ulkenia sp.
<400>26cttcttgaga aggctgagtc tgttgtcatg gaggttctcg ccgccaagac tggttacgag60actgacatga ttgaggccga catggagctc gagactgagc ttggcattga ctccatcaag120cgtgtcgaga ttctctctga ggtccaggct atgcttaacg ttgaggccaa ggatgttgat180gctcttagcc gcacccgcac cgttggtgag gttgtcaac 219<210>27<211>609<212>DNA<213>Ulkenia sp.
<400>27aagaagctcg ttggcactat tgctggtgcc cgtgaggttc gttcctcaat tgctaacatt60gaggctctcg gtggcaaggc aatctactcc tcttgtgatg tgaactctgc tgctgatgtc120
gccaaggctg ttcgcgaggc tgaggctcag cttggcgccc gtgtaactgg tgtcgtccac180gcttctggtg tccttcgtga ccgcctcatt gagcagaagc gccccgatga gtttgatgct240gtcttcggca ccaaggtgac tggtctcgag aacctctttg gtgccattga catggccaac300cttaagcacc tcgtcctctt cagctctctt gctggtttcc acggcaacat tggtcagtct360gactacgcca tggctaacga ggccctcaac aagatgggtc ttgagctctc tgaccgtgtg420tccgtgaagt ctatttgctt cggcccctgg gatggtggca tggttacccc ccagctcaag480aagcagttcc agtctatggg tgttcagatc atcccccgtg agggtggtgc cgatactgtg540gctcgcattg tcctcggctc ctcccctgct gagatccttg ttggcaactg gaccactccc600accaagaag609<210>28<211>279<212>PRT<213>Ulkenia sp.
<400>28Thr Arg Ile Ala Val Ile Gly Met Ser Ala Ile Leu Pro Cys Gly Thr1 510 15Thr Val Arg Glu Ser Trp Glu Ala Ile Arg Asp Gly Ile Asp Cys Leu20 25 30Ser Asp Leu Pro Glu Asp Arg Val Asp Val Thr Ala Tyr Phe Asp Pro35 40 45Val Lys Thr Thr Lys Asp Lys Ile Tyr Cys Lys Arg Gly Gly Phe Ile50 5560Pro Glu Tyr Asp Phe Asp Ala Arg Glu Phe Gly Leu Asn Met Phe Gln65 70 75 80Met Glu Asp Ser Asp Ala Asn Gln Thr Val Thr Leu Leu Lys Val Lys8590 95Glu Ala Leu Glu Asp Ala Gly Ile Glu Ala Leu Ser Lys Glu Lys Lys100 105 110Asn Ile Gly Cys Val Leu Gly Ile Gly Gly Gly Gln Lys Ser Ser His115 120 125Glu Phe Tyr Ser Arg Leu Asn Tyr Val Val Val Glu Lys Val Leu Arg130 135 140Lys Met Gly Met Pro Glu Glu Asp Val Gln Ala Ala Val Glu Lys Tyr145 150 155 160Lys Ala Asn Phe pro Glu Trp Arg Leu Asp Ser Phe Pro Gly Phe Leu165 170 175Gly Asn Val Thr Ala Gly Arg Cys Thr Asn Thr Phe Asn Leu Asp Gly180 185 190
Met Asn Cys Val Val Asp Ala Ala Cys Ala Ser Ser Leu Ile Ala Val195 200 205Lys Val Ala Ile Asp Glu Leu Leu His Gly Asp Cys Asp Met Met Ile210 215 220Thr Gly Ala Thr Cys Thr Asp Asn Ser Ile Gly Met Tyr Met Ala Phe225 230 235 240Ser Lys Thr Pro Val Phe Ser Thr Asp Pro Ser Val Arg Ala Tyr Asp245 250 255Glu Lys Thr Lys Gly Met Leu Ile Gly Glu Gly Ser Ala Met Leu Val260 265 270Leu Lys Arg Tyr Ala Asp Ala275<210>29<211>17<212>PRT<213>Ulkenia sp.
<400>29Gly Met Asn Cys Val Val Asp Ala Ala Cys Ala Ser Ser Leu Ile Ala1 510 15Val<210>30<211>4<212>PRT<213>Ulkenia sp.
<400>30Asp Ala Ala Cys1<210>31<211>174<212>PRT<213>Ulkenia sp.
<400>31His Ala Val Ile Arg Gly Cys Ala Ser Ser Ser Asp Gly Lys Ala Ser1 510 15Gly Ile Tyr Thr Pro Thr Ile Ser Gly Gln Glu Glu Ala Leu Arg Arg20 25 30Ala Tyr Met Arg Ala Asn Val Asp Pro Ala Thr Val Thr Leu Val Glu35 40 45Gly His Gly Thr Gly Thr Pro Val Gly Asp Arg Ile Glu Leu Thr Ala50 5560Leu Arg Asn Leu Phe Asp Ser Ala Tyr Gly Asn Glu Lys Glu Lys Val65 70 75 80
Ala Val Gly Ser Ile Lys Ser Asrn Ile Gly His Leu Lys Ala Val Ala85 9095Gly Leu Ala Gly Met Ile Lys Val Ile Met Ala Leu Lys His Lys Thr100 105 110Leu Pro Ala Thr Ile Asn Val Asp Glu Pro Pro Lys Leu Tyr Asp Asn115 120 125Thr Pro Ile Thr Asp Ser Ser Leu Tyr Ile Asn Thr Met Asn Arg Pro130 135 140Trp Phe Pro Ala Pro Gly Val Pro Arg Arg Ala Gly Ile Ser Ser Phe145 150 155 160Gly Phe Gly Gly Ala Asn Tyr His Ala Val Leu Glu Glu Ala165 170<210>32<211>460<212>PRT<213>Ulkenia sp.
<400>32Thr Arg Ile Ala Val Ile Gly Met Ser Ala Ile Leu Pro Cys Gly Thr1 510 15Thr Val Arg Glu Ser Trp Glu Ala Ile Arg Asp Gly Ile Asp Cys Leu20 25 30Ser Asp Leu Pro Glu Asp Arg Val Asp Val Thr Ala Tyr Phe Asp pro35 40 45Val Lys Thr Thr Lys Asp Lys Ile Tyr Cys Lys Arg Gly Gly Phe Ile50 5560Pro Glu Tyr Asp Phe Asp Ala Arg Glu Phe Gly Leu Asn Met Phe Gln65 70 75 80Met Glu Asp Ser Asp Ala Asn Gln Thr Val Thr Leu Leu Lys Val Lys8590 95Glu Ala Leu Glu Asp Ala Gly Ile Glu Ala Leu Ser Lys Glu Lys Lys100 105 110Asn Ile Gly Cys Val Leu Gly Ile Gly Gly Gly Gln Lys Ser Ser His115 120 125Glu Phe Tyr Ser Arg Leu Asn Tyr Val Val Val Glu Lys Val Leu Arg130 135 140Lys Met Gly Met Pro Glu Glu Asp Val Gln Ala Ala Val Glu Lys Tyr145 150 155 160Lys Ala Asn Phe Pro Glu Trp Arg Leu Asp Ser Phe Pro Gly Phe Leu165 170 175Gly Asn Val Thr Ala Gly Arg Cys Thr Asn Thr Phe Asn Leu Asp Gly180 185 190
Met Asn Cys Val Val Asp Ala Ala Cys Ala Ser Ser Leu Ile Ala Val195 200 205Lys Val Ala Ile Asp Glu Leu Leu His Gly Asp Cys Asp Met Met Ile210 215 220Thr Gly Ala Thr Cys Thr Asp Asn Ser Ile Gly Met Tyr Met Ala Phe225 230 235 240Ser Lys Thr Pro Val Phe Ser Thr Asp Pro Ser Val Arg Ala Tyr Asp245 250 255Glu Lys Thr Lys Gly Met Leu Ile Gly Glu Gly Ser Ala Met Leu Val260 265 270Leu Lys Arg Tyr Ala Asp Ala Val Arg Asp Gly Asp Glu Ile His Ala275 280 285Val Ile Arg Gly Cys Ala Ser Ser Ser Asp Gly Lys Ala Ser Gly Ile290 295 300Tyr Thr Pro Thr Ile Ser Gly Gln Glu Glu Ala Leu Arg Arg Ala Tyr305 310 315 320Met Arg Ala Asn Val Asp Pro Ala Thr Val Thr Leu Val Glu Gly His325 330 335Gly Thr Gly Thr Pro Val Gly Asp Arg Ile Glu Leu Thr Ala Leu Arg340 345 350Asn Leu Phe Asp Ser Ala Tyr Gly Asn Glu Lys Glu Lys Val Ala Val355 360 365Gly Ser Ile Lys Ser Asn Ile Gly His Leu Lys Ala Val Ala Gly Leu370 375 380Ala Gly Met Ile Lys Val Ile Met Ala Leu Lys His Lys Thr Leu Pro385 390 395 400Ala Thr Ile Asn Val Asp Glu Pro Pro Lys Leu Tyr Asp Asn Thr Pro405 410 415Ile Thr Asp Ser Ser Leu Tyr Ile Asn Thr Met Asn Arg Pro Trp Phe420 425 430Pro Ala Pro Gly Val Pro Arg Arg Ala Gly Ile Ser Ser Phe Gly Phe435 440 445Gly Gly Ala Asn Tyr His Ala Val Leu Glu Glu Ala450 455 460<210>33<211>332<212>PRT<213>Ulkenia sp.
<400>33Leu Phe Ser Gly Gln Gly Ala Gln Tyr Thr His Met Phe Ser Glu Val1 5 10 15
Ala Met Asn Trp Pro Gln Phe Arg Glu Ser Ile Ser Asp Met Asp Arg20 25 30Ala Gln Ala Lys Val Ala Gly Ala Asp Lys Asp Tyr Glu Arg Val Ser35 40 45Gln Val Leu Tyr Pro Arg Lys Pro Tyr Asn Ser Glu Pro Glu Gln Asp50 5560His Lys Lys Ile Ser Leu Thr Ser Tyr Ser Gln Pro Ser Thr Leu Ala65 70 75 80Cys Ala Leu Gly Ala Tyr Glu Ile Phe Lys Gln Ala Gly Phe Lys Pro85 9095Asp Phe Ala Ala Gly His Ser Leu Gly Glu Phe Ala Ala Leu Tyr Ala100 105 110Ala Asp Cys Val Asn Arg Asp Asp Leu Phe Glu Leu Val Cys Arg Arg115 120 125Ala Arg Ile Met Gly Gly Lys Asp Ala Pro Ala Thr Pro Lys Gly Cys130 135 140Met Ala Ala Val Ile Gly Pro Asn Ala Glu Lys Ile Gln Ile Arg Thr145 150 155 160Ala Asp Val Trp Leu Gly Asn Cys Asn Ser Pro Ser Gln Thr Val Ile165 170 175Thr Gly Ser Val Glu Gly Ile Lys Lys Glu Ser Glu Leu Leu Gln Ser180 185 190Glu Gly Phe Arg Val Val Pro Leu Ala Cys Glu Ser Ala Phe His Ser195 200 205Pro Gln Met Gln Asn Ala Ser Ser Ala Phe Lys Asp Val Leu Ser Lys210 215 220Val Ala Phe Arq Gln Pro Ser Ala Gln Thr Lys Leu Phe Ser Asn Val225 230 235 240Ser Gly Glu Thr Tyr Ser Asn Asn Ala Gln Asp Leu Leu Lys Glu His245 250 255Met Thr Ser Ser Val Lys Phe Ile Ser Gln Val Arg Asn Met His Ser260 265 270Ala Gly Ala Arg Ile Phe Val Glu Phe Gly Pro Lys Gln Val Leu Ser275 280 285Lys Leu Val Ser Glu Thr Leu Lys Asp Asp Pro Ser Ile Ile Thr Ile290 295 300Ser Val Asn Pro Ser Ser Gly Lys Asp Ala Asp Ile Gln Leu Arg Glu305 310 315 320Ala Ala Val Gln Leu Val Val Ala Gly Val Asn Leu325 330
<210>34<211>1170<212>PRT<213>Ulkenia sp.
<400>34Ala Gln Ala Gln Ile Gln Lys Ala Lys Ala Asp Ala Ala Glu Ala Asp1 510 15Lys Lys Leu Ala Ala Ala Lys Asp Glu Ala Lys Arg Ala Ala Ala Ser20 25 30Ala Pro Val Gln Lys Gln Val Asp Thr Thr Ile Val Asp Lys His Arg35 40 45Ala Ile Leu Lys Ser Met Leu Ala Glu Leu Asp Cys Tyr Ser Thr Pro50 5560Gly Ala Val Ser Ser Ser Phe Gln Ala Pro Val Ala Ala Thr Pro Ala65 70 75 80Pro Val Ala Ala Pro Val Ala Ala Ala Pro Ala Pro Ala Val Asn Asn85 9095Ala Leu Leu Ala Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala100 105 110Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu115 120 125Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu130 135 140Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser145 150 155 160Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile165 170 175Ala Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro Ala Pro Val Ala Ala180 185 190Ala Pro Ala Ala Pro Ala Pro Ala Val Asn Ser Ala Leu Leu Ala Lys195 200 205Ala Glu Thr Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu210 215 220Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu Thr Glu Leu Gly Ile225 230 235 240Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Gln Leu245 250 255Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val260 265 270Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala Gly Ser Ser Gly275 280 285Ala Ala Ala Ala Ala Pro Ala Pro Val Ala Ala Ala Pro Ala Pro Val
290 295 300Ala Ala Ala Ala Pro Ala Val Ser Ser Ala Leu Leu Glu Lys Ala Glu305 310 315 320Ser Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp325 330 335Met Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser340 345 350Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Gln Leu Asn Val355 360 365Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu370 375 380Val Val Asn Ala Met Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala385 390 395 400Ala Pro Ala Pro Val Ala Ala Ala Pro Ala Pro Val Ala Ala Ala Ala405 410 415Pro Ala Val Asn Ser Ala Leu Leu Glu Lys Ala Glu Thr Val Val Met420 425 430Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro435 440 445Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val450 455 460Glu Ile Leu Ser Glu Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp465 470 475 480Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala485 490 495Met Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro500 505 5l0Ala Pro Val Ala Ala Ala Pro Ala Pro Val Ala Ala Pro Ala Val Ser515 520 525Ser Ala Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala530 535 540Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu545 550 555 560Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser565 570 575Glu Val Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu580 585 590Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu595 600 605Ile Ala Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro Ala Pro Val Ala610 615 620
Ala Ser Pro Ala Pro Val Ala Ala Ala Ala Pro Ala Val Ser Ser Ala625 630 635 640Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys645 650 655Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr660 665 670Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val675 680 685Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg690 695 700Thr Arg Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala705 710 715 720Gly Ser Ser Gly Ala Ala Ala Ala Ala Pro Ala Pro Val Ala Ala Ala725 730 735Pro Ala Pro Val Thr Ala Ala Ala Pro Ala Val Ser Ser Ala Leu Leu740 745 750Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys Thr Gly755 760 765Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu770 775 780Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala785 790 795 800Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg805 810 815Thr Val Gly Glu Val Val Asn Ala Met Lys Ala Glu Ile Ala Ser Ser820 825 830Ser Gly Ala Ala Ala Pro Ala Pro Ala Ala Ala Val Ala Pro Ala Pro835 840 845Ala Ala Ala Pro Ala Val Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser850 855 860Val Val Met Glu Val Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met865 870 875 880Ile Glu Ala Asp Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile885 890 895Lys Arg Val Glu Ile Leu Ser Glu Val Gln Ala Met Leu Asn Val Glu900 905 910Ala Lys Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val915 920 925Val Asn Ala Met Lys Ala Glu Ile Ala Ser Ser Ser Gly Ala Ala Ala930 935 940Pro Ala Pro Ala Ala Ala Ala Ala Pro Ala Pro Ala Ala Ala Pro Ala
945 950 955 960Val Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val965 970 975Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met980 985 990Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile995 1000 1005Leu Ser Glu Val Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val1010 1015 1020Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val Asn Ala1025 1030 1035Met Lys Ala Glu Ile Ala Ser Ser Ser Gly Ala Ala Ala Pro Ala1040 1045 1050Pro Ala Ala Ala Ala Ala Pro Ala Pro Ala Ala Ala Pro Ala Val1055 1060 1065Ser Ser Ala Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val1070 1075 1080Leu Ala Ala Lys Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp1085 1090 1095Met Glu Leu Glu Thr Glu Leu Gly Ile Asp Ser Ile Lys Arg Val1100 1105 1110Glu Ile Leu Ser Glu Val Gln Ala Met Leu Asn Val Glu Ala Lys1115 1120 1125Asp Val Asp Ala Leu Ser Arg Thr Arg Thr Val Gly Glu Val Val1130 1135 1140Asn Ala Met Lys Ala Glu Ile Ala Gly Ser Ser Gly Ala Ala Thr1145 1150 1155Ala Ser Ala Pro Ala Ala Ala Ala Ala Ala Pro Ala1160 1165 1170<210>35<211>73<212>PRT<213>Ulkenia sp.
<400>35Leu Leu Ala Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys1 510 15Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu Thr20 25 30Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val35 40 45Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg
5055 60Thr Arg Thr Val Gly Glu Val Val Asn65 70<210>36<211>73<212>PRT<213>Ulkenia sp.
<400>36Leu Leu Ala Lys Ala Glu Thr Val Val Met Glu Val Leu Ala Ala Lys1 510 15Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu Thr20 25 30Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val35 40 45Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg5055 60Thr Arg Thr Val Gly Glu Val Val Asn65 70<210>37<211>73<212>PRT<213>Ulkenia sp.
<400>37Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys1 510 15Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr20 25 30Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val35 40 45Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg50 5560Thr Arg Thr Val Gly Glu Val Val Asn65 70<210>38<211>73<212>PRT<213>Ulkenia sp.
<400>38Leu Leu Glu Lys Ala Glu Thr Val Val Met Glu Val Leu Ala Ala Lys1 510 15Thr Gly Tyr Glu Thr Asp Met Ile Glu Pro Asp Met Glu Leu Glu Thr20 25 30
Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val35 40 45Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg50 5560Thr Arg Thr Val Gly Glu Val Val Asn65 70<210>39<211>73<212>PRT<213>Ulkenia sp.
<400>39Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys1 5 10 15Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr20 25 30Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val35 40 45Gln Ala Gln Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg50 5560Thr Arg Thr Val Gly Glu Val Val Asn65 70<210>40<211>73<212>PRT<213>Ulkenia sp.
<400>40Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys1 5 1015Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr20 25 30Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val35 40 45Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg50 5560Thr Arg Thr Val Gly Glu Val Val Asn65 70<210>41<211>73<212>PRT<213>Ulkenia sp.
<400>41Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys
1 510 15Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr20 25 30Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val35 40 45Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg50 5560Thr Arg Thr Val Gly Glu Val Val Asn65 70<210>42<211>73<212>PRT<213>Ulkenia sp.
<400>42Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys1 510 15Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr20 25 30Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val3540 45Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg50 5560Thr Arg Thr Val Gly Glu Val Val Asn65 70<210>43<211>73<212>PRT<213>Ulkenia sp.
<400>43Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys1 5 1015Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr20 25 30Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val35 4045Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg50 5560Thr Arg Thr Val Gly Glu Val Val Asn65 70<210>44<211>73<212>PRT
<213>Ulkenia sp.
<400>44Leu Leu Glu Lys Ala Glu Ser Val Val Met Glu Val Leu Ala Ala Lys1 510 15Thr Gly Tyr Glu Thr Asp Met Ile Glu Ala Asp Met Glu Leu Glu Thr20 25 30Glu Leu Gly Ile Asp Ser Ile Lys Arg Val Glu Ile Leu Ser Glu Val35 40 45Gln Ala Met Leu Asn Val Glu Ala Lys Asp Val Asp Ala Leu Ser Arg50 5560Thr Arg Thr Val Gly Glu Val Val Asn65 70<210>45<211>203<212>PRT<213>Ulkenia sp.
<400>45Lys Lys Leu Val Gly Thr Ile Ala Gly Ala Arg Glu Val Arg Ser Ser1 510 15Ile Ala Asn Ile Glu Ala Leu Gly Gly Lys Ala Ile Tyr Ser Ser Cys20 25 30Asp Val Asn Ser Ala Ala Asp Val Ala Lys Ala Val Arg Glu Ala Glu35 40 45Ala Gln Leu Gly Ala Arg Val Thr Gly Val Val His Ala Ser Gly Val5055 60Leu Arg Asp Arg Leu Ile Glu Gln Lys Arg Pro Asp Glu Phe Asp Ala65 70 75 80Val Phe Gly Thr Lys Val Thr Gly Leu Glu Asn Leu Phe Gly Ala Ile85 9095Asp Met Ala Asn Leu Lys His Leu Val Leu Phe Ser Ser Leu Ala Gly100 105 110Phe His Gly Asn Ile Gly Gln Ser Asp Tyr Ala Met Ala Asn Glu Ala115 120 125Leu Asn Lys Met Gly Leu Glu Leu Ser Asp Arg Val Ser Val Lys Ser130 135 140Ile Cys Phe Gly Pro Trp Asp Gly Gly Met Val Thr Pro Gln Leu Lys145 150 155 160Lys Gln Phe Gln Ser Met Gly Val Gln Ile Ile Pro Arg Glu Gly Gly165 170 175Ala Asp Thr Val Ala Arg Ile Val Leu Gly Ser Ser Pro Ala Glu Ile180 185190
Leu Val Gly Asn Trp Thr Thr Pro Thr Lys Lys195 200<210>46<211>780<212>DNA<213>Ulkenia sp.
<400>46aagcgcattg ccgtggtggg catggccgtg caatacgcgg gctgcaaaga caaggaagag60ttctggaaag tagtcatggg cggtgaggct gcatggacta agattagcga taaacgcctc120ggatccaaca agcgagccga gcacttcaaa gcagagcgta gcaaatttgc agataccttt180tgcaacgaga actacggctg cgtcgatgac tccgtcgata acgaacacga gcttctcctt240aagctctcca agaaggctct ctccgagaca tcggtctccg actctacaag gtgcggtatt300gtgagcggat gcctgtcctt tcccatggac aacctccagg gcgaactcct caatgtgtac360caaaaccacg tcgaaaagaa actcggcgct cgcgtcttca aggatgcctc caagtggtcc420gagcgtgagc agtcgcagaa ccccgaggct ggtgaccgcc gcatctttat ggacccggca480tccttcgtag cagaagagct caacctcggt cctcttcact actctgtcga tgctgcctgt540gccaccgccc tttacgtcct tcgcctcgcc caggaccacc tcgtttccgg tgctgctgat600gtcatgctcg ctggtgcaac ttgcttcccg gagccctttt tcattctctc cggattctcc660actttccagg ccatgcctgt atcgggagac ggcatctcgt acccgcttca caaggacagt720cagggtctca cccctggtga aggtggtgcc attatggttc tcaagcgcct tgacgacgct780<210>47<211>51<212>DNA<213>Ulkenia sp.
<400>47cctcttcact actctgtcga tgctgcctgt gccaccgccc tttacgtcct t51<210>48<211>12<212>DNA<213>Ulkenia sp.
<400>48gatgctgcct gt12<210>49<211>477<212>DNA<213>Ulkenia sp.
<400>49
tacggtactc tgctcggtgc taccatcagc aatgctggct gtggtcttcc cctcaagccg60cacttgccca gcgagaagtc ctgcctcatt gatacctaca agcgcgtcaa cgtgcacccg120cacaagatcc agtacgtcga gtgccacgca acgggtactc cccagggaga ccgcgttgag180attgatgccg tcaaggcttg cttcgagggc aaggtgcctc gctttggaag ctccaagggt240aactttggcc acacactcgt tgcagctggt ttcgcaggca tgtgcaaggt actccttgcc300atgaagcatg gtgtgatccc gcccactcct ggtgtcgatg gatcttccca aatggacccg360cttgtggtct ctgagcccat cccatggccc gacactgagg gcgagcccaa gcgcgctggt420ctctccgctt tcggctttgg tggcaccaac gcccacgcag tctttgagga gtttgac 477<210>50<211>1278<212>DNA<213>Ulkenia sp.
<400>50aagcgcattg ccgtggtggg catggccgtg caatacgcgg gctgcaaaga caaggaagag60ttctggaaag tagtcatggg cggtgaggct gcatggacta agattagcga taaacgcctc120ggatccaaca agcgagccga gcacttcaaa gcagagcgta gcaaatttgc agataccttt180tgcaacgaga actacggctg cgtcgatgac tccgtcgata acgaacacga gcttctcctt240aagctctcca agaaggctct ctccgagaca tcggtctccg actctacaag gtgcggtatt300gtgagcggat gcctgtcctt tcccatggac aacctccagg gcgaactcct caatgtgtac360caaaaccacg tcgaaaagaa actcggcgct cgcgtcttca aggatgcctc caagtggtcc420gagcgtgagc agtcgcagaa ccccgaggct ggtgaccgcc gcatctttat ggacccggca480tccttcgtag cagaagagct caacctcggt cctcttcact actctgtcga tgctgcctgt540gccaccgccc tttacgtcct tcgcctcgcc caggaccacc tcgtttccgg tgctgctgat600gtcatgctcg ctggtgcaac ttgcttcccg gagccctttt tcattctctc cggattctcc660actttccagg ccatgcctgt atcgggagac ggcatctcgt acccgcttca caaggacagt720cagggtctca cccctggtga aggtggtgcc attatggttc tcaagcgcct tgacgacgct780attcgcgatg gagaccacat ttacggtact ctgctcggtg ctaccatcag caatgctggc840tgtggtcttc ccctcaagcc gcacttgccc agcgagaagt cctgcctcat tgatacctac900aagcgcgtca acgtgcaccc gcacaagatc cagtacgtcg agtgccacgc aacgggtact960ccccagggag accgcgttga gattgatgcc gtcaaggctt gcttcgaggg caaggtgcct1020cgctttggaa gctccaaggg taactttggc cacacactcg ttgcagctgg tttcgcaggc1080atgtgcaagg tactccttgc catgaagcat ggtgtgatcc cgcccactcc tggtgtcgat1140
ggatcttccc aaatggaccc gcttgtggtc tctgagccca tcccatggcc cgacactgag1200ggcgagccca agcgcgctgg tctctccgct ttcggctttg gtggcaccaa cgcccacgca1260gtctttgagg agtttgac 1278<210>51<211>801<212>DNA<213>Ulkenia sp.
<400>51atgcgcattg ctattaccgg tatggatgcc accttcggct ccctcaaggg cctggacgcc60tttgagcgtg ccatctacaa tggccaacat ggtgctgtgc cattgcctga gaagcgctgg120cgtttccttg gtaaagacaa ggactttttg gacctgtgcg gtgtcaagga ggtgccccac180ggatgctaca ttgaggacgt cgaggtggac tttagccgcc tgcgcacgcc catgacgcca240gacgacatgt tgcgccccat gcagctactt gctgtcacaa ccatcgaccg tgccattctc300aactctggcc tcaagaaggg aggtaaggtc gctgtcttcg tcggccttgg cactgacctt360gagctctacc gtcaccgcgc ccgcgttgcc ctcaaggagc gtgctcgtcc cgaagccgct420tcagccctca atgatatgat gtcctacatc aacgattgcg gtaccgctac ctcgtacaca480tcctacatcg gcaacctcgt ggccacccgc gtgtcttcac aatggggttt cgagggtcct540tctttcacca tcacagaggg caacaactcc gtctaccgtt gcgcagagtt gggcaagtac600ttgctcgaga ctggcgaggt cgaggccgta gtgatcgccg gtgtggatct ttgcgccagc660gctgagaatc tctacgtgaa gtcgcgtcgt ttcaaggtct cggagcagga gagcccgcgg720gccagcttcg actccggcgc tgacggctac tttgttggtg agggatgtgg tgccctcgtc780ctcaagcgcg agagcgactg c 801<210>52<211>792<212>DNA<213>Ulkenia sp.
<400>52gctgctttcg gactgagcct tggagagatt tccatggttt ttgccttttc tgagaagaac60ggccttgtct ctgaggagct gacaactaaa ctccgcaact cggaggtctg gcgtaaggcc120ctcgctgttg agtttgacgc cctccgcaag gcctggaata ttccccaaga tacccctgtc180agcgagttct ggcaaggata cgtggtacgt ggaacccgcg aggccgttga agcggccatc240ggccccaaca ataagtacgt gcacttgacc attgtcaacg atgccaacag tgctctcatc300agtggcaagc ctgaagattg caaggctgcc attgctcgcc tgagcagcaa cctccctgct360ttgcccgtgg accttggtat gtgtggccac tgccccgtgg tcgagccgta cggcaagcag420
atcgctgaga tccatagcgt cctcgagatt cccgaggttg ccggccttga cctgtacacg480agcgtcaacc agaagaagct tgttaacaag tccactggag ccagcgacga gtacgcaccc540agctttggtg aatacgcagc acagctgtac actgttcagg cagactttcc taagatcgcc600aagaccgtta gcgacaagaa ctttgacgtc tttgttgaga ctggtcccaa cgcccaccgt660agcgccgcaa ttcgcgccac ccttggaaat agcaagcctt ttgtcaccgg atccatggac720cgccagaacg agaatgcttg gacaaccatg gtcaagctgg ttgcctctct ccaagcccac780cgcgtgcctg gc792<210>53<211>1302<212>DNA<213>Ulkenia sp.
<400>53agccgtgcct tcatggagac atatggtgta tccgccccca tgtacaccgg cgccatggca60aagggcattg catccgctga gatggttatc gctgccggaa agcgcggcat ccttggttct120ctcggtgctg gtggtcttcc tatcgccacc gtacgcaagg ctctcgaagc tatccaggct180gaactgccca agggccctta cgctgtcaac ctcatccact ctcccttcga cagcaacctc240gagaagggta acgtcgacct cttcctcgag aagggcgtca ctgtcgttga agcctccgcc300tttatgacct tgaccccgca gctcgtgcgc taccgtgctg caggtctctc tcgcgctgct360gatggctcca cggttattaa gaaccgcgtc atcggtaagg tttctcgcac agagcttgcc420gcaatgttta tccgtcccgc gcccgagaat ctcctcgaga agctgctgaa gtccggcgag480atcacccaag agcaggctgc tctcgcacgc acagtgcctg tggcagacga cattgccgtt540gaggcggact ccggtggcca caccgataac cgccccatcc acgtcatcct ccctctcatt600gtcaacctcc gtgatcgtct gcacaaggag tgcggctacc ctgcccacct tcgcgttcgc660gttggtgctg gtggtggcat tggatgccct caggccgcca ttgccacctt caacatgggc720gcggccttca tcgtcactgg taccgtaaac cagatgagta agcaagctgg aacctgtgac780accgttcgca agcagctctc acaagccacc tactccgaca tctgcatggc cccagcagct840gacatgtttg aggaaggtgt caagctccag gtgctcaaga agggaactat gttcccctcg900cgtgccaaca agctctatga gctcttcgtc aagtatgact cctttgagtc catggctcct960ggagagctgg aacgtgtgga gaagcgcatt ttcaagaagt ctctgtcaga agtttgggaa1020gagaccaagg acttctacat caacaggttg cagaacccgg agaagattga gcgcgcggag1080cgtgacccca agcttaagat gtccttgtgc ttccgctggt accttggttt ggcgagcttc1140tgggcaaacg ctggcatccc ggaccgtgcc atggactacc aggtttggtg tggcccagcg1200
attggatctt tcaacgactt catcaagggt acctaccttg accccgccgt tgccaacgag1260taccccgatg ttgtgcaaat caacttgcag atcctccgtg gt1302<210>54<211>260<212>PRT<213>Ulkenia sp.
<400>54Lys Arg Ile Ala Val Val Gly Met Ala Val Gln Tyr Ala Gly Cys Lys1 510 15Asp Lys Glu Glu Phe Trp Lys Val Val Met Gly Gly Glu Ala Ala Trp20 25 30Thr Lys Ile Ser Asp Lys Arg Leu Gly Ser Asn Lys Arg Ala Glu His3540 45Phe Lys Ala Glu Arg Ser Lys Phe Ala Asp Thr Phe Cys Asn Glu Asn50 5560Tyr Gly Cys Val Asp Asp Ser Val Asp Asn Glu His Glu Leu Leu Leu65 70 75 80Lys Leu Ser Lys Lys Ala Leu Ser Glu Thr Ser Val Ser Asp Ser Thr85 9095Arg Cys Gly Ile Val Ser Gly Cys Leu Ser Phe Pro Met Asp Asn Leu100 105 110Gln Gly Glu Leu Leu Asn Val Tyr Gln Asn His Val Glu Lys Lys Leu115 120 125Gly Ala Arg Val Phe Lys Asp Ala Ser Lys Trp Ser Glu Arg Glu Gln130 135 140Ser Gln Asn Pro Glu Ala Gly Asp Arg Arg Ile Phe Met Asp Pro Ala145 150 155 160Ser Phe Val Ala Glu Glu Leu Asn Leu Gly Pro Leu His Tyr Ser Val165 170 175Asp Ala Ala Cys Ala Thr Ala Leu Tyr Val Leu Arg Leu Ala Gln Asp180 185 190His Leu Val Ser Gly Ala Ala Asp Val Met Leu Ala Gly Ala Thr Cys195 200 205Phe Pro Glu Pro Phe Phe Ile Leu Ser Gly Phe Ser Thr Phe Gln Ala210 215 220Met Pro Val Ser Gly Asp Gly Ile Ser Tyr Pro Leu His Lys Asp Ser225 230 235 240Gln Gly Leu Thr Pro Gly Glu Gly Gly Ala Ile Met Val Leu Lys Arg245 250 255Leu Asp Asp Ala260
<210>55<211>17<212>PRT<213>Ulkenia sp.
<400>55Pro Leu His Tyr Ser Val Asp Ala Ala Cys Ala Thr Ala Leu Tyr Val1 510 15Leu<210>56<211> 4<212>PRT<213>Ulkenia sp.
<400>56Asp Ala Ala Cys1<210>57<211>159<212>PRT<213>Ulkenia sp.
<400>57Tyr Gly Thr Leu Leu Gly Ala Thr Ile Ser Asn Ala Gly Cys Gly Leu1 5 1015Pro Leu Lys Pro His Leu Pro Ser Glu Lys Ser Cys Leu Ile Asp Thr20 25 30Tyr Lys Arg Val Asn Val His Pro His Lys Ile Gln Tyr Val Glu Cys35 40 45His Ala Thr Gly Thr Pro Gln Gly Asp Arg Val Glu Ile Asp Ala Val5055 60Lys Ala Cys Phe Glu Gly Lys Val Pro Arg Phe Gly Ser Ser Lys Gly65 70 7580Asn Phe Gly His Thr Leu Val Ala Ala Gly Phe Ala Gly Met Cys Lys85 90 95Val Leu Leu Ala Met Lys His Gly Val Ile Pro Pro Thr Pro Gly Val100 105 110Asp Gly Ser Ser Gln Met Asp Pro Leu Val Val Ser Glu Pro Ile Pro115 120 125Trp Pro Asp Thr Glu Gly Glu Pro Lys Arg Ala Gly Leu Ser Ala Phe130 135 140Gly Phe Gly Gly Thr Asn Ala His Ala Val Phe Glu Glu Phe Asp145 150 155<210>58<211>426
<212>PRT<213>Ulkenia sp.
<400>58Lys Arg Ile Ala Val Val Gly Met Ala Val Gln Tyr Ala Gly Cys Lys1 5 1015Asp Lys Glu Glu Phe Trp Lys Val Val Met Gly Gly Glu Ala Ala Trp20 25 30Thr Lys Ile Ser Asp Lys Arg Leu Gly Ser Asn Lys Arg Ala Glu His35 40 45Phe Lys Ala Glu Arg Ser Lys Phe Ala Asp Thr Phe Cys Asn Glu Asn50 55 60Tyr Gly Cys Val Asp Asp Ser Val Asp Asn Glu His Glu Leu Leu Leu65 70 75 80Lys Leu Ser Lys Lys Ala Leu Ser Glu Thr Ser Val Ser Asp Ser Thr85 90 95Arg Cys Gly Ile Val Ser Gly Cys Leu Ser Phe Pro Met Asp Asn Leu100 105 110Gln Gly Glu Leu Leu Asn Val Tyr Gln Asn His Val Glu Lys Lys Leu115 120 125Gly Ala Arg Val Phe Lys Asp Ala Ser Lys Trp Ser Glu Arg Glu Gln130 135 140Ser Gln Asn Pro Glu Ala Gly Asp Arg Arg Ile Phe Met Asp Pro Ala145 150 155 160Ser Phe Val Ala Glu Glu Leu Asn Leu Gly Pro Leu His Tyr Ser Val165 170 175Asp Ala Ala Cys Ala Thr Ala Leu Tyr Val Leu Arg Leu Ala Gln Asp180 185 190His Leu Val Ser Gly Ala Ala Asp Val Met Leu Ala Gly Ala Thr Cys195 200 205Phe Pro Glu Pro Phe Phe Ile Leu Ser Gly Phe Ser Thr Phe Gln Ala210 215 220Met Pro Val Ser Gly Asp Gly Ile Ser Tyr Pro Leu His Lys Asp Ser225 230 235 240Gln Gly Leu Thr Pro Gly Glu Gly Gly Ala Ile Met Val Leu Lys Arg245 250 255Leu Asp Asp Ala Ile Arg Asp Gly Asp His Ile Tyr Gly Thr Leu Leu260 265 270Gly Ala Thr Ile Ser Asn Ala Gly Cys Gly Leu Pro Leu Lys Pro His275 280 285Leu Pro Ser Glu Lys Ser Cys Leu Ile Asp Thr Tyr Lys Arg Val Asn290 295 300
Val His Pro His Lys Ile Gln Tyr Val Glu Cys His Ala Thr Gly Thr305 310 315 320Pro Gln Gly Asp Arg Val Glu Ile Asp Ala Val Lys Ala Cys Phe Glu325 330 335Gly Lys Val Pro Arg Phe Gly Ser Ser Lys Gly Asn Phe Gly His Thr340 345 350Leu Val Ala Ala Gly Phe Ala Gly Met Cys Lys Val Leu Leu Ala Met355 360 365Lys His Gly Val Ile Pro Pro Thr Pro Gly Val Asp Gly Ser Ser Gln370 375 380Met Asp Pro Leu Val Val Ser Glu Pro Ile Pro Trp Pro Asp Thr Glu385 390 395 400Gly Glu Pro Lys Arg Ala Gly Leu Ser Ala Phe Gly Phe Gly Gly Thr405 410 415Asn Ala His Ala Val Phe Glu Glu Phe Asp420 425<210>59<211>267<212>PRT<213>Ulkenia sp.
<400>59Met Arg Ile Ala Ile Thr Gly Met Asp Ala Thr Phe Gly Ser Leu Lys1 5 1015Gly Leu Asp Ala Phe Glu Arg Ala Ile Tyr Asn Gly Gln His Gly Ala20 25 30Val Pro Leu Pro Glu Lys Arg Trp Arg Phe Leu Gly Lys Asp Lys Asp35 40 45Phe Leu Asp Leu Cys Gly Val Lys Glu Val Pro His Gly Cys Tyr Ile50 5560Glu Asp Val Glu Val Asp Phe Ser Arg Leu Arg Thr Pro Met Thr Pro65 70 75 80Asp Asp Met Leu Arg Pro Met Gln Leu Leu Ala Val Thr Thr Ile Asp8590 95Arg Ala Ile Leu Asn Ser Gly Leu Lys Lys Gly Gly Lys Val Ala Val100 105 110Phe Val Gly Leu Gly Thr Asp Leu Glu Leu Tyr Arg His Arg Ala Arg115 120 125Val Ala Leu Lys Glu Arg Ala Arg Pro Glu Ala Ala Ser Ala Leu Asn130 135 140Asp Met Met Ser Tyr Ile Asn Asp Cys Gly Thr Ala Thr Ser Tyr Thr145 150 155160
Ser Tyr Ile Gly Asn Leu Val Ala Thr Arg Val Ser Ser Gln Trp Gly165 170 175Phe Glu Gly Pro Ser Phe Thr Ile Thr Glu Gly Asn Asn Ser Val Tyr180 185 190Arg Cys Ala Glu Leu Gly Lys Tyr Leu Leu Glu Thr Gly Glu Val Glu195 200 205Ala Val Val Ile Ala Gly Val Asp Leu Cys Ala Ser Ala Glu Asn Leu210 215 220Tyr Val Lys Ser Arg Arg Phe Lys Val Ser Glu Gln Glu Ser Pro Arg225 230 235 240Ala Ser Phe Asp Ser Gly Ala Asp Gly Tyr Phe Val Gly Glu Gly Cys245 250 255Gly Ala Leu Val Leu Lys Arg Glu Ser Asp Cys260 265<210>60<211>264<212>PRT<213>Ulkenia sp.
<400>60Ala Ala Phe Gly Leu Ser Leu Gly Glu Ile Ser Met Val Phe Ala Phe1 5 1015Ser Glu Lys Asn Gly Leu Val Ser Glu Glu Leu Thr Thr Lys Leu Arg20 25 30Asn Ser Glu Val Trp Arg Lys Ala Leu Ala Val Glu Phe Asp Ala Leu35 40 45Arg Lys Ala Trp Asn Ile Pro Gln Asp Thr Pro Val Ser Glu Phe Trp50 5560Gln Gly Tyr Val Val Arg Gly Thr Arg Glu Ala Val Glu Ala Ala Ile65 70 75 80Gly Pro Asn Asn Lys Tyr Val His Leu Thr Ile Val Asn Asp Ala Asn85 90 95Ser Ala Leu Ile Ser Gly Lys Pro Glu Asp Cys Lys Ala Ala Ile Ala100 105 110Arq Leu Ser Ser Asn Leu Pro Ala Leu Pro Val Asp Leu Gly Met Cys115 120 125Gly His Cys Pro Val Val Glu Pro Tyr Gly Lys Gln Ile Ala Glu Ile130 135 140His Ser Val Leu Glu Ile Pro Glu Val Ala Gly Leu Asp Leu Tyr Thr145 150 155 160Ser Val Asn Gln Lys Lys Leu Val Asn Lys Ser Thr Gly Ala Ser Asp165 170 175
Glu Tyr Ala Pro Ser Phe Gly Glu Tyr Ala Ala Gln Leu Tyr Thr Val180 185 190Gln Ala Asp Phe Pro Lys Ile Ala Lys Thr Val Ser Asp Lys Asn Phe195 200 205Asp Val Phe Val Glu Thr Gly Pro Asn Ala His Arg Ser Ala Ala Ile210 215 220Arg Ala Thr Leu Gly Asn Ser Lys Pro Phe Val Thr Gly Ser Met Asp225 230 235 240Arg Gln Asn Glu Asn Ala Trp Thr Thr Met Val Lys Leu Val Ala Ser245 250 255Leu Gln Ala His Arg Val Pro Gly260<210>61<211>434<212>PRT<213>Ulkenia sp.
<400>61Ser Arg Ala Phe Met Glu Thr Tyr Gly Val Ser Ala Pro Met Tyr Thr1 5 1015Gly Ala Met Ala Lys Gly Ile Ala Ser Ala Glu Met Val Ile Ala Ala20 25 30Gly Lys Arg Gly Ile Leu Gly Ser Leu Gly Ala Gly Gly Leu Pro Ile35 4045Ala Thr Val Arg Lys Ala Leu Glu Ala Ile Gln Ala Glu Leu Pro Lys50 5560Gly Pro Tyr Ala Val Asn Leu Ile His Ser Pro Phe Asp Ser Asn Leu65 70 75 80Glu Lys Gly Asn Val Asp Leu Phe Leu Glu Lys Gly Val Thr Val Val85 90 95Glu Ala Ser Ala Phe Met Thr Leu Thr Pro Gln Leu Val Arg Tyr Arg100 105 110Ala Ala Gly Leu Ser Arg Ala Ala Asp Gly Ser Thr Val Ile Lys Asn115 120 125Arg Val Ile Gly Lys Val Ser Arg Thr Glu Leu Ala Ala Met Phe Ile130 135 140Arq Pro Ala Pro Glu Asn Leu Leu Glu Lys Leu Leu Lys Ser Gly Glu145 150 155 160Ile Thr Gln Glu Gln Ala Ala Leu Ala Arg Thr Val Pro Val Ala Asp165 170 175Asp Ile Ala Val Glu Ala Asp Ser Gly Gly His Thr Asp Asn Arg Pro180 185 190
Ile His Val Ile Leu Pro Leu Ile Val Asn Leu Arg Asp Arg Leu His195 200 205Lys Glu Cys Gly Tyr Pro Ala His Leu Arg Val Arg Val Gly Ala Gly210 215 220Gly Gly Ile Gly Cys Pro Gln Ala Ala Ile Ala Thr Phe Asn Met Gly225 230 235 240Ala Ala Phe Ile Val Thr Gly Thr Val Asn Gln Met Ser Lys Gln Ala245 250 255Gly Thr Cys Asp Thr Val Arg Lys Gln Leu Ser Gln Ala Thr Tyr Ser260 265 270Asp Ile Cys Met Ala Pro Ala Ala Asp Met Phe Glu Glu Gly Val Lys275 280 285Leu Gln Val Leu Lys Lys Gly Thr Met Phe Pro Ser Arg Ala Asn Lys290 295 300Leu Tyr Glu Leu Phe Val Lys Tyr Asp Ser Phe Glu Ser Met Ala Pro305 310 315 320Gly Glu Leu Glu Arg Val Glu Lys Arg Ile Phe Lys Lys Ser Leu Ser325 330 335Glu Val Trp Glu Glu Thr Lys Asp Phe Tyr Ile Asn Arg Leu Gln Asn340 345 350Pro Glu Lys Ile Glu Arg Ala Glu Arg Asp Pro Lys Leu Lys Met Ser355 360 365Leu Cys Phe Arg Trp Tyr Leu Gly Leu Ala Ser Phe Trp Ala Asn Ala370 375 380Gly Ile Pro Asp Arg Ala Met Asp Tyr Gln Val Trp Cys Gly Pro Ala385 390 395 400Ile Gly Ser Phe Asn Asp Phe Ile Lys Gly Thr Tyr Leu Asp Pro Ala405 410 415Val Ala Asn Glu Tyr Pro Asp Val Val Gln Ile Asn Leu Gln Ile Leu420 425 430Arg Gly<210>62<211>2000<212>DNA<213>Ulkenia sp.
<400>62gagcacgcac catcttctct ccacgcgtaa agaagagcag agccagaggc aggtaggtat60ctccacccat ctcaggctgt gacttctttg tttctttctt tctttgcttg ttttctgttc120tctctctgtg ctctgtccac acgagaaaga gaaagagaga gagaaagaac cacgggttta180tagagcgcac tcgtccttcc tgcttcagca gaaagcactg cgtaggagaa ctacggggga240
ggaggaagca cgcacggagg aggcgtggaa ggaaggagga gacagagaga gagagacact300gagggacaga gggggagagg cagagggaga ggcatctgat gtttgcgaga aaccaataag360ttttgaaagt gatttgattt agctgattga ctgatctatg gcctgaaaga aagcttttaa420agcggaggga gatagatgac gagggcagct gcgatggcgt acggcgcatc cgtctctctc480tgtgtctctc tctctttctc tctcgtcagg gcgtggagac ctcggaagct gcacgcggcg540cggtgaggag gcagggcagc agagggagag gagagatccc agagtcgaag agcattgatt600gattgcagat gatcttgggc aacgcgcgtc agcttgagcg aggaatgctt tggacttcag660gttcttcgct tctgtgtttc attctttctc gaagaaagaa agaatgaaag aaagagagaa720agaaagaaag aaagaaagaa agaaagaaag aaagaaagaa tgaatgaatg aaagaaagag780agaaagaaag aacgaatgaa agaaagagag aaagaatcaa agagaaagcg cattcgcagt840tcttcttcgt gaaagaaaag gaaaagagag gcgatggtag gctctgatct catcatttct900ggtttctctg ttgtacctgt actctgtgct tgtggccttg cgaaggctga agacgccatg960cagacaacca cgcctccgca gagactttgc gggaaagcag agggcttctc gccactctcg1020aagaaacgag ctcgccagtt ttcggggttg ttctcagaat tgcgagtgtt ggctttatat1080gggatgatgg tatggcactt cgtcatcgtt actctcgctc gcttgcttac gaagattttc1140aaaagggcga aagaagtgct cagcttttaa aataaagtca caccaaagac taggccgcat1200agcagaaagc taaagtaaac ccaatctgtc tgaagagagt gtcgtggtta gatacttacg1260caagagttta aaagctgtaa atagtacagg aacaaaaaca aataaatata tatatattct1320tttttattag taaaacatga aaccaaaaaa ctcctttaaa ataaaataaa ataaaataaa1380ataaaataaa ataaaataaa tttactacta tatatacata tatatataca ataaataaaa1440acaacttttt cagaccagaa aaagactgag aaaaaaggaa actaatgact ctcgagcacc1500gagagcgata taagagtgga ttatatttgc taggcccacc acgagtgagt cccctaggag1560gaagcgccct ctgagacagg agcagaggcg tcgctggtgc tccaaaaagc gacggcgaat1620ggaaagcaaa accctttcga gggaggcttg tggccgtgac tattcaaatc tccagcatct1680cagctccagc acagcagaag ctacctcgct tctcagctct agctatcaca tcgatcgcag1740catctagctc gtagacagct agcgccgcac cttcccccaa atcaacttgg gcaacttaac1800tcttttttca ccagaactcc tcttttcctt taatcttcga aaagaagacg aataaaagag1860ataatcctct gccgcagcac attctaaaag aaaagcggca tactggcgta ggcaagactt1920tcaagctctt cctcgcctcc accccgtatt tccctgttca tctttgtgaa acgaggaaac1980aagaaatttt ataggacaag2000
<210>63<211>2000<212>DNA<213>Ulkenia sp.
<400>63agttgtgagg ctgtcttgtc ttgtcagtcg cgaaagtgta agcaagaact ttgtcataca60aagaagcaac caacttccga accaacacac cttgtaggat tacaaccaca actttctata120aatagtgcgc aagaataacc agtaagctat ccttcgtgta cctgttacaa caacgacatt180tttacttgat cttcctactt gtgatgggta gtcccggctt gtactgacag tgatgccaca240gcagagtaga tcactgtgaa taagtaaata agcctactta ttatattccc aaagtactcg300ctgggatatt attagtatca cgaaaagtga tatgttttat aactcgcttg tcttgccaag360atctaacctt ttttttttaa atggccaaaa agtcgccaga acacatctta caataaacaa420aaatttagat tatatcgtat gtataatgta taatatatta tattattata tacatacgat480ataatctaaa gccattccag acttattcgg tgatgaaaaa tgctttccca gctttataca540aactattcaa aaagttgcat gacccatttt cagatatatt taatagtata agattatgtc600catttgtttt caaagttatt caagagttta catcttgaag tttcatccct ttactactac660actgtttttc gtttgggttt tttctctaac ggcgaaagaa acaagtcacc aagcttaact720agtaggcatc tttgtggtga cgaaattaaa gttgaatata taaattatag ttagtcatta780tggaatctca gtttgaacga agctaagcta tttataaaaa tcactgcatg gagataatac840ttgaattttg atgatagtgt ttatgaagaa gtttaatctt gctttttatt aatgttattc900tctaatatag aaatatttca ataaaaaaat catatgaagg gataataaat acagagaatg960atcgttatca tttgatatgt cgaacgctaa tctatcatct tatctaggaa acaaaggtgg1020aaataaagga aagccctaca cgagttaatt cctcaaacga actactttgg attatcaaat1080ccaactgctg acactggata catgcatgta tttagtgggt gttactgtac ttccttattt1140cctttaattc aattgtcttg atttttactt cggagattct acttgaaaat catctccctt1200cacttccggt tatacagaaa gacccttcaa ttcgaatgct ggccaggtac aataactatc1260agcgattccc ctccactaga catgaccgac tgtaagcacc tcaacccgat ttcaagcaac1320acatgatgac tagctgtttc cgcaaaacaa caaataagag aggtagtgga aaacacccag1380ttcgctcgag ctcccctagt agattcgaca ttcactttct atttgattgc taattgtggg1440tccggctatt taaggaaaga actgatgaaa gtccacctca cgcaatcaaa tcgcggtcta1500gttggaagct acaatggccg acgtatgcgc gcctctatct tttaggattg tagaacaggg1560cggcaatctg ctaacataaa tttaatacct tgctcaagct gctttccata cttttcaatc1620catttgtgat aatcttgcaa tggaccaatc tccaaatctg tagaagcaat aacaaggaca1680
tcgcagggtc ccggttcgtt tgcatgctcg tcttctggtg ccacaacaat gctgcctgtt1740attatctcat gagagtcttt atactgcgga tccgtggcta tagcgtgaat aaacgttgtg1800cgcaagccta tatcctcgcg atggagatac tggcctgcta cagtttgcgt tcgtctgcct1860acgacaacgc atggaacatt ctttggtgtg cgagtgggcc gtagcgttcg accctgggca1920aggaagccat gcagacgtga ttccgagagg ccatctcgcg tgtaagactt atcccaattt1980tctggatcct ctaatttcca2000<210>64<211>2000<212>DNA<213>Ulkenia sp.
<400>64aaattaatga atgaatcaat gaatgaatca atgaataatg ccaatgcaat gcgatgcgat60gctgcttcga gccatcgcac ggcggccatt gcgcgcttgc gtcagtcatg tcattccatt120cggagcggcg tgcgcgaggg agggagggag ggagggagaa gacgaggagc aggcggagag180agaggaggat gggcgggcgg gcggcgtcgt cggcgtcgtc gtcgtcgtgg gcctccgtag240tcgctgggaa ggagggcttt gattccaaat gaggattttg gtgcactgct ttcgagactt300tctcgcctga ttcggaattc ctcctcttct tcttcttttt agctgtgctt tctgcgtatt360cattgcgtgg gtttggcttg gttttcaaat caattagcag tctagtaact aacaaactaa420caaacagata aacagacaaa cagacaaaca aacaaaacaa acaaaacaaa caaaacaaag480caggaaagaa agaaacaaac aaatatacaa acaaagaaag aaagaagtgg tgggaactag540ggaaatcaat gtgtttgctt ctttcgcacc tttgcttttc ttgcttttct tggttctcaa600gtaagcgttt atcgcgccct cagaaaacaa aataaaatga tctaacataa catgaattta660tatttatttt atttgtttat taaataaata ttttttgtaa accagaattt cactctactt720ttgcaacact gagagagtgc catctgcata ataagtggca gtgttttttt gtttattttc780aaattaatta tacttgaact gctaggtcaa gaggccgcag cagcctgatg agataaggac840agagtaggca aggatggcag aagatcgcga aaaaagcgag aaaggcaaac gagcaggccc900gaaggtgagg tggagctgct tgtcaaggtc gcgaggtttg tttgacagtt ataacagcaa960gaactaaggc aatttcaaga atgaagagca ctcgaataaa ccgatgaagc aaagtgtgta1020catacaaaca tacatacgta cagatgaaaa gaacagattt tcaataaaaa tgacttttta1080gtttaaacaa tgtttctgtt tgttgtttcg cttttcatta atttgttgca aattattttg1140ttttggtttt tgtttttgtt tttgaaaatc ataaaagaga tgctgccgca gacgtctgcg1200cgtctcatag ttgattgggt aatcgttttg ttgagttttg aaaatgtaaa cttcacttag1260
ttgctcattt atcctcattc gtttgcccat ttgttctctg tttgaagcag agttttgact1320tctcgcattc gtggaatcca ccccttgctt gctttgcttg cttgcttgct tgcttgcttg1380cctgcttgct ttgcttgctt gcttgaccag cgtgcgcgct ttcgccagcc tagccttcga1440gacctcttga agaccctttg gagcgtctag ttcgaggttc tttctatttg cttcaagaga1500gacaaaataa caaagaaaaa gagagaaaaa acaagcaaag aaagaaacaa ggaaacaaac1560cacaaagcac gcatcgtgca tccaaacttt catcccccca ctctctctct ctctctctct1620ctctctctcc ttcctcggaa aaggagtgag acaaaggcag acagcctcta gcttggcagc1680ctcgcagctc gtgcggcgcc agttcctaca gcttcgcgct gtccaaacgc cagtccatcg1740cagcttcggc tagctagttg gctgattgat tgattgattg attgatagcc tttattacgg1800cgttgattaa ctgattgatt atttgattgc tctggcatcc ctgtaatcac ttgctcaagg1860tagtcaatca catcatttat acatctcctc caaagcaaac catctacacg accgcttttt1920gatcgatcta aaagtgccgg tcaggtgaca cgcaagctct tttttttgtt tacagtaagc1980agcaacaaga aagcaaaaag2000<210>65<211>2000<212>DNA<213>Ulkenia sp.
<400>65gcccaatttg ctcctgatct gttcccatga ttatgatagg gataggtagt agttatagct60agactcattc cattcactta atccacatat gcaaattata attttatgtg tcgcatataa120actttccaaa ctttaaaatt ttcatttgca ttttatatat agatcacctg tgatcccttt180ctcgcccctt tcaacttcca aagtttacct actatcatat ggcatggcgc agccaatgca240ctctataaca tataagtaac agagatagtt tttgccgcat catttactct ttactcttgc300tatacaaggt aagcgccaag agagttaatt acatctgttt tatcggttcc tagtggaaat360aatagtgaca actataatta gtaggagtcc ttattgaccc tagtcatttg agcttgcacc420agatttgatg tttttgcaaa cgaccttgac gcagagtgac gagcgaaaat tggatcccct480tggttgaagt ctaaactagc ttaaaatata tatgctcttc atataatata aagctgtttt540agattctatc aaataagaaa ttgatgactt tgagcaaatt aatatttggt atgggctccg600gcatctctga aaacgcttaa atgaagcttt tattcaccac gattcgacaa ctaaggttat660tttccacata attataactt ttcctacata actgtgctgt cgactcacac cttctttata720tatatagcct cgtagggatt cgaaactatg aattaagact cgttgaagtt tgatttatcc780attattttgc tgcacaaact atcgctaaga tataaagatc gtgcccagag cctgctatag840
ggtcctaatg gcatgcttag cccggatttc cacgataaag ctgcattgta ttgagtatat900gcactcagag agtaaacttt aattgcaacg aacaatcttt ggcaagtcat atctcagcca960tcaatacatg tattgtgttc aaacgaattg cagcatatca ctcaaattat tttggtctag1020ttcagcggaa tcttttggtt gttttagtaa gagttgagta gagtatgttg gatgagtgtg1080tccacaaggt tatttgaata gggtatttac attctacaac atagtcagta agctctcgtg1140tgataaactg tatcaaaatc gacacaataa caggctagtg gtgccctgtg cacgttttta1200ccataacatg acagctacag catcagaaac aggtgtggtg cgcattttgg ttattctgat1260cctgaaacct aagaacaatt ttcatcgtct tgctagattg tgttttctgt attccatttg1320tggagcttca acatccatgc tgctgagtat tttcacatga agatcatagt gttagaatgt1380ttagtaagcc tattactaag ttttgaggta taggtgcttg ttgttgtcct tacataaata1440catgctgtct ttagtgctta gaccaacgtt gagtgtatcg tgctcttggc agaagaatag1500acatttataa cattatggtg aaaggcgatg gtctcgcttg catgttctcg cttgcgtttg1560cgtatcccta tacacttaac cgttgtttat gtgtacctaa gctatcatgc tgcatcttta1620caattttata caaataaatt tattttggaa tatataattg gtcactattt caggccagtt1680gacagtcctt aagatttgta gttgcgctgt tctcgtagtg agaatgaaga agcggaatct1740acatccatct gtgattgcat aagagcttgc ataagagtga agtaggtgaa agtcacagag1800aatatcttcc ctactatcct aaaggcaagg aatactacta tacacgaaca tagtaatgga1860attttacaca acagaagtac ccttgtctcc tgcctccttt tattattcca ttatgctctg1920ttatataatg aatgaagacg acttttaaca tcatttgatt ctcgagcagg cacgcacaat1980atagaggaag gattggcgtc2000<210>66<211>1212<212>DNA<213>Ulkenia sp.
<400>66ggcaagaacg tcgttttcga ctatgacgag ctccttgagt tcgccgaggg tgacatcagc60aaggtcttcg gccccgaatt cagccagatc gaccagtaca agcgtcgcgt tcgtctcccc120gcccgcgagt acctcctcgt cacccgcgtc accctcatgg acgccgaggt caacaactac180cgcgtcggtg cccgcatggt cactgagtac gacctccccg tcaacggtga gctctctgag240ggtggtgact gcccctgggc cgtgctcgtc gagagtggtc agtgtgatct catgctcatc300tcctacatgg gtattgactt ccagaacaag agcgaccgcg tctaccgtct gctcaacacc360accctcacct tctacggtgt tgcccaggag ggcgagaccc tggagtacga catccgcgtg420
accggcttcg ccaagcgtct cgacggtgac atctccatgt tcttcttcga gtacgactgc480tacgtcaacg gccgtctcct catcgagatg cgcgacggct gtgccggttt cttcaccaac540gaggagctcg ccgccggcaa gggtgtcgtc tttacccgcg ctgatctcct cgcccgcgag600aagaccaaga agcaggacat caccccgtac gccattgccc cgcgtcttaa caagaccgtt660ctcaacgaga ctgagatgca gtccctcgtg gacaagaact ggaccaaggt tttcggcccc720gagaacggca tggaccagat caactacaaa ctctgcgccc gtaagatgct catgattgac780cgcgtcacca agattgacta caccggtggc ccctacggcc ttggtcttct cgttggtgag840aagatcctcg agcgcgacca ctggtacttt ccgtgccact tcgtcggaga ccaggtcatg900gctggatccc tcgtgtctga cggctgcagc cagctcctca agatgtacat gctctggctc960ggcctccacc ttaagaccgg tcccttcgac ttccgccccg tcaacggcca ccccaacaag1020gtccgctgcc gtggccagat ctccccgcac aagggtaagc tcgtatacgt catggagatc1080aaggagatgg gctacgacga ggctggtgac ccgtacgcca tcgccgatgt caacattctc1140gacattgact tcgagaaggg ccagactttc gaccttgcca acctccacga gtacggcaag1200ggcgacctca ac1212<210>67<211>21<212>DNA<213>Ulkenia sp.
<400>67tggtactttc cgtgccacttc 21<210>68<211>1197<212>DNA<213>Ulkenia sp.
<400>68gtgcccggcg agatgccgct ctcgtggtac aacatggctg agttcatggc cggcaaggtc60agcctctgcc tcggccctga gttcgccaag ttcgatgact ccaacaccag ccgcagccct120gcatgggacc ttgctcttgt gactcgtgtg gtctccgttt ctgacatgga gtgggtccag180tggaagaacg tggactgcaa cccgtccaag ggaaccatgg ttggcgagtt cgactgcccc240atcgacgcct ggttcttcca gggatcttgt aacgacggcc acatgccgta ctccatcctc300atggagatcg ccctccagac ctctggtgtc ctcacctctg tgctcaaggc cccgctcacc360atggagaaga aggacattct cttccgcaac cttgacgcca acgccgagat ggttcgctct420gatattgacc tccgcggcaa gaccatccac aacctcacca agtgtaccgg ctacagcatg480ctcggagaca tgggtgtcca ccgcttcagc ttcgagctct ctgttgatgg tgtagtcttc540
tacaagggta ccacctcctt cggctggttc gtccctgagg tcttcatctc ccagactggt600ctcgacaacg gtcgccgcac ccagccctgg cacattgagt ccaaggtgcc ttccgcccag660gtcctcacct acgacgttac ccccaacggt gccggtcgca cccagctcta cgccaacgcc720cccaagggcg ctcagctcac tcgccgctgg aaccagtgcc agtaccttga caccatcgac780cttgtggtcg ccggtggctc cgccggtctt ggctacggtc atggccgcaa gcaggtgaac840cccaaggact ggttcttctc gtgccacttc tggttcgact ccgtcatgcc cggctcgctc900ggtgtggagt ctatgttcca gctcgtcgag tccatcgctg tcaagcagga cctcgccggc960aagtacggca tcaccaaccc gaccttcgct catgctccgg gcaagatctc ctggaagtac1020cgtggtcagc tcacccccac ctccaagttc atggactccg aggcccacat tgtctccatc1080gaggcccacg acggcgtcgt cgacatcgtt gccaatggta acctctgggc tgatggcctc1140cgcgtctaca acgtcagcaa catccgtgtg cgcattgttg ctggcgccgc ccctgct 1197<210>69<211>21<212>DNA<213>Ulkenia sp.
<400>69tggttcttct cgtgccactt c 21<210>70<211>90<212>DNA<213>Ulkenia sp.
<400>70gctggcgccg cccctgctgc tgctgctgct gctgctgctg ttgctgctcc ggctgccgcc60cctgctccgg ttgctgcatc tggccctgcc 90<210>71<211>1299<212>DNA<213>Ulkenia sp.
<400>71gaaggcttca tgaagaccta cggtgttgtg gctcctctct acaccggtgc catggccaag60ggtattgcct ctgctgacct tgtgattgcc actggtaagc gcaagatcct cggttccttc120ggtgctggcg gtctccccat gcacattgtc cgtgccgctg ttgagaagat ccaggctgag180ctcccgaacg gccccttcgc cgtcaacctc atccactccc ccttcgatag caaccttgag240aagggcaacg ttgacctctt cctcgagaag ggcgttactg tcgtcgaggc ctccgccttc300atgaccttga ccccgcaagt cgtccgctac cgtgctgctg gtctttcccg taacgctgat360
ggctccatta acatcaagaa ccgcatcatc ggtaaggtct cccgtaccga gctcgctgag420atgttcatcc gccctgcccc gcagaacctc ctcgacaagc tcatccagtc tggtgagatt480accaaggagc aggctgagct tgccaagctc gtccccgtcg ccgacgacat cgccgtcgag540gccgactctg gtggccacac cgacaaccgc cccatccacg tcatcctccc ccttatcatc600aacctccgca accgcctcca caaggagtgc ggctaccccg ctcacctccg cgtgcgcgtt660ggagctggtg gtggtgttgg atgcccccag gccgctgccg ctgctctcgc tatgggtgct720gccttccttg ttaccggcac tgtcaaccag gtcgccaagc agtccggcac ctgcgacaat780gtccgcaagc agctctgcat ggccacctac tctgacgtct gcatggctcc cgctgctgac840atgttcgagg agggcgtcaa gctccaggtc ctcaagaagg gaaccatgtt cccgtccagg900gctaacaagc tctacgagct cttctgcaag tacgactcct tcgagtccat gcctgccaca960gagctcgagc gtgttgagaa gcgcatcttc cagtgccctc ttgctgatgt ctgggctgag1020acctccgact tctacatcaa ccgcctccac aacccggaga agatcacccg tgccgagcgt1080gaccccaagc tcaagatgtc tctctgcttc cgctggtacc ttggtcttgc ctctcgctgg1140gccaacaccg gtgaggctgg acgcgtcatg gactaccagg tctggtgtgg ccctgccatt1200ggagccttca acgacttcat caagggctcc taccttgacc cggccgtctc tggtgagtac1260ccggacgtcg tgcagatcaa cttgcagatc cttcgcggt 1299<210>72<211>404<212>PRT<213>Ulkenia sp.
<400>72Gly Lys Asn Val Val Phe Asp Tyr Asp Glu Leu Leu Glu Phe Ala Glu1 510 15Gly Asp Ile Ser Lys Val Phe Gly Pro Glu Phe Ser Gln Ile Asp Gln20 25 30Tyr Lys Arg Arg Val Arg Leu Pro Ala Arg Glu Tyr Leu Leu Val Thr35 40 45Arg Val Thr Leu Met Asp Ala Glu Val Asn Asn Tyr Arg Val Gly Ala50 5560Arg Met Val Thr Glu Tyr Asp Leu Pro Val Asn Gly Glu Leu Ser Glu65 70 75 80Gly Gly Asp Cys Pro Trp Ala Val Leu Val Glu Ser Gly Gln Cys Asp8590 95Leu Met Leu Ile Ser Tyr Met Gly Ile Asp Phe Gln Asn Lys Ser Asp100 105 110Arg Val Tyr Arg Leu Leu Asn Thr Thr Leu Thr Phe Tyr Gly Val Ala
115 120 125Gln Glu Gly Glu Thr Leu Glu Tyr Asp Ile Arg Val Thr Gly Phe Ala130 135 140Lys Arg Leu Asp Gly Asp Ile Ser Met Phe Phe Phe Glu Tyr Asp Cys145 150 155 160Tyr Val Asn Gly Arg Leu Leu Ile Glu Met Arg Asp Gly Cys Ala Gly165 170 175Phe Phe Thr Asn Glu Glu Leu Ala Ala Gly Lys Gly Val Val Phe Thr180 185 190Arg Ala Asp Leu Leu Ala Arg Glu Lys Thr Lys Lys Gln Asp Ile Thr195 200 205Pro Tyr Ala Ile Ala Pro Arg Leu Asn Lys Thr Val Leu Asn Glu Thr210 215 220Glu Met Gln Ser Leu Val Asp Lys Asn Trp Thr Lys Val Phe Gly Pro225 230 235 240Glu Asn Gly Met Asp Gln Ile Asn Tyr Lys Leu Cys Ala Arg Lys Met245 250 255Leu Met Ile Asp Arg Val Thr Lys Ile Asp Tyr Thr Gly Gly Pro Tyr260 265 270Gly Leu Gly Leu Leu Val Gly Glu Lys Ile Leu Glu Arg Asp His Trp275 280 285Tyr Phe Pro Cys His Phe Val Gly Asp Gln Val Met Ala Gly Ser Leu290 295 300Val Ser Asp Gly Cys Ser Gln Leu Leu Lys Met Tyr Met Leu Trp Leu305 310 315 320Gly Leu His Leu Lys Thr Gly Pro Phe Asp Phe Arg Pro Val Asn Gly325 330 335His Pro Asn Lys Val Arg Cys Arg Gly Gln Ile Ser Pro His Lys Gly340 345 350Lys Leu Val Tyr Val Met Glu Ile Lys Glu Met Gly Tyr Asp Glu Ala355 360 365Gly Asp Pro Tyr Ala Ile Ala Asp Val Asn Ile Leu Asp Ile Asp Phe370 375 380Glu Lys Gly Gln Thr Phe Asp Leu Ala Asn Leu His Glu Tyr Gly Lys385 390 395 400Gly Asp Leu Asn<210>73<211>7<212>PRT<213>Ulkenia sp.
<400>73
Trp Tyr Phe Pro Cys His Phe1 5<210>74<211>399<212>PRT<213>Ulkenia sp.
<400>74Val Pro Gly Glu Met Pro Leu Ser Trp Tyr Asn Met Ala Glu Phe Met1 510 15Ala Gly Lys Val Ser Leu Cys Leu Gly Pro Glu Phe Ala Lys Phe Asp20 25 30Asp Ser Asn Thr Ser Arg Ser Pro Ala Trp Asp Leu Ala Leu Val Thr35 40 45Arg Val Val Ser Val Ser Asp Met Glu Trp Val Gln Trp Lys Asn Val50 5560Asp Cys Asn Pro Ser Lys Gly Thr Met Val Gly Glu Phe Asp Cys Pro65 70 75 80Ile Asp Ala Trp Phe Phe Gln Gly Ser Cys Asn Asp Gly His Met Pro85 90 95Tyr Ser Ile Leu Met Glu Ile Ala Leu Gln Thr Ser Gly Val Leu Thr100 105 110Ser Val Leu Lys Ala Pro Leu Thr Met Glu Lys Lys Asp Ile Leu Phe115 120 125Arg Asn Leu Asp Ala Asn Ala Glu Met Val Arg Ser Asp Ile Asp Leu130 135 140Arg Gly Lys Thr Ile His Asn Leu Thr Lys Cys Thr Gly Tyr Ser Met145 150 155 160Leu Gly Asp Met Gly Val His Arg Phe Ser Phe Glu Leu Ser Val Asp165 170 175Gly Val Val Phe Tyr Lys Gly Thr Thr Ser Phe Gly Trp Phe Val Pro180 185 190Glu Val Phe Ile Ser Gln Thr Gly Leu Asp Asn Gly Arg Arg Thr Gln195 200 205Pro Trp His Ile Glu Ser Lys Val Pro Ser Ala Gln Val Leu Thr Tyr210 215 220Asp Val Thr Pro Asn Gly Ala Gly Arg Thr Gln Leu Tyr Ala Asn Ala225 230 235 240Pro Lys Gly Ala Gln Leu Thr Arg Arg Trp Asn Gln Cys Gln Tyr Leu245 250 255Asp Thr Ile Asp Leu Val Val Ala Gly Gly Ser Ala Gly Leu Gly Tyr260 265 270
Gly His Gly Arg Lys Gln Val Asn Pro Lys Asp Trp Phe Phe Ser Cys275 280 285His Phe Trp Phe Asp Ser Val Met Pro Gly Ser Leu Gly Val Glu Ser290 295 300Met Phe Gln Leu Val Glu Ser Ile Ala Val Lys Gln Asp Leu Ala Gly305 310 315 320Lys Tyr Gly Ile Thr Asn Pro Thr Phe Ala His Ala Pro Gly Lys Ile325 330 335Ser Trp Lys Tyr Arg Gly Gln Leu Thr Pro Thr Ser Lys Phe Met Asp340 345 350Ser Glu Ala His Ile Val Ser Ile Glu Ala His Asp Gly Val Val Asp355 360 365Ile Val Ala Asn Gly Asn Leu Trp Ala Asp Gly Leu Arg Val Tyr Asn370 375 380Val Ser Asn Ile Arg Val Arg Ile Val Ala Gly Ala Ala Pro Ala385 390 395<210>75<211>7<212>PRT<213>Ulkenia sp.
<400>75Trp Phe Phe Ser Cys His Phe1 5<210>76<211>30<212>PRT<213>Ulkenia sp.
<400>76Ala Gly Ala Ala Pro Ala Ala Ala Ala Ala Ala Ala Ala Val Ala Ala1 510 15Pro Ala Ala Ala Pro Ala Pro Val Ala Ala Ser Gly Pro Ala20 25 30<210>77<211>433<212>PRT<213>Ulkenia sp.
<400>77Glu Gly Phe Met Lys Thr Tyr Gly Val Val Ala Pro Leu Tyr Thr Gly1 5 1015Ala Met Ala Lys Gly Ile Ala Ser Ala Asp Leu Val Ile Ala Thr Gly20 25 30Lys Arg Lys Ile Leu Gly Ser Phe Gly Ala Gly Gly Leu Pro Met His
3540 45Ile Val Arg Ala Ala Val Glu Lys Ile Gln Ala Glu Leu Pro Asn Gly50 5560Pro Phe Ala Val Asn Leu Ile His Ser Pro Phe Asp Ser Asn Leu Glu65 70 75 80Lys Gly Asn Val Asp Leu Phe Leu Glu Lys Gly Val Thr Val Val Glu85 90 95Ala Ser Ala Phe Met Thr Leu Thr Pro Gln Val Val Arg Tyr Arg Ala100 105 110Ala Gly Leu Ser Arg Asn Ala Asp Gly Ser Ile Asn Ile Lys Asn Arg115 120 125Ile Ile Gly Lys Val Ser Arg Thr Glu Leu Ala Glu Met Phe Ile Arg130 135 140Pro Ala Pro Gln Asn Leu Leu Asp Lys Leu Ile Gln Ser Gly Glu Ile145 150 155 160Thr Lys Glu Gln Ala Glu Leu Ala Lys Leu Val Pro Val Ala Asp Asp165 170 175Ile Ala Val Glu Ala Asp Ser Gly Gly His Thr Asp Asn Arg Pro Ile180 185 190His Val Ile Leu Pro Leu Ile Ile Asn Leu Arg Asn Arg Leu His Lys195 200 205Glu Cys Gly Tyr Pro Ala His Leu Arg Val Arg Val Gly Ala Gly Gly210 215 220Gly Val Gly Cys Pro Gln Ala Ala Ala Ala Ala Leu Ala Met Gly Ala225 230 235 240Ala Phe Leu Val Thr Gly Thr Val Asn Gln Val Ala Lys Gln Ser Gly245 250 255Thr Cys Asp Asn Val Arg Lys Gln Leu Cys Met Ala Thr Tyr Ser Asp260 265 270Val Cys Met Ala Pro Ala Ala Asp Met Phe Glu Glu Gly Val Lys Leu275 280 285Gln Val Leu Lys Lys Gly Thr Met Phe Pro Ser Arg Ala Asn Lys Leu290 295 300Tyr Glu Leu Phe Cys Lys Tyr Asp Ser Phe Glu Ser Met Pro Ala Thr305 310 315 320Glu Leu Glu Arg Val Glu Lys Arg Ile Phe Gln Cys Pro Leu Ala Asp325 330 335Val Trp Ala Glu Thr Ser Asp Phe Tyr Ile Asn Arg Leu His Asn Pro340 345 350Glu Lys Ile Thr Arg Ala Glu Arg Asp Pro Lys Leu Lys Met Ser Leu355 360 365
Cys Phe Arg Trp Tyr Leu Gly Leu Ala Ser Arg Trp Ala Asn Thr Gly370 375 380Glu Ala Gly Arg Val Met Asp Tyr Gln Val Trp Cys Gly Pro Ala Ile385 390 395 400Gly Ala Phe Asn Asp Phe Ile Lys Gly Ser Tyr Leu Asp Pro Ala Val405 410 415Ser Gly Glu Tyr Pro Asp Val Val Gln Ile Asn Leu Gln Ile Leu Arg420 425 430Gly<210>78<211>2000<212>DNA<213>Ulkenia sp.
<400>78gcacgtagag caagaaagaa tgaaagaaag aacgaaagaa agaaagagag agagagagag60agagagagag agaaagcgaa gatgatagcg gagagaactc ttcttcgcag tcactctgtt120tctcagtcag tcccgcaacc aataacaact cgaactcgca gcagtgttct tcggagtgcc180agcgctcgct cgcactgcgt cggcacagca gcagcagcag caggccccgc gctcgctgca240ctcagcccgg gcaggagcaa cagctgctga gcagctgagg ccagctggct ggcggctcgc300ctcgcctcgc ctcgcgtcgc gtcgcgagag aaagcgatcg accaactgtc aatcgattat360tcgagtcctt cgagcgcttt atagggcact gattgatcac tcattgattc attgactcat420ttattctttg cgtggtcagc caaacggcgt tagcattggg caaagcgggt ctttgctttg480ctctaaaata gatttgctcg cgagagtacg tacttgcagg agtaggtagg ctctgcctag540tacctgggca tttgaatatt tgaacttcga acttcgttga gtatctgaat atttgaatat600ctgaatattt gaatttcgaa agtttgaata tttgaatatt tgaattttgg aatattggaa660tagctgggtt tggagataag acttactaag ctaagcgccg acgtaagagc ggcgagtaaa720tccacacaca agagagaggc agagagagag ggagggagac aactcgcgca ggcaagctga780gcccactgga cgcacggggc gcgtcccccc tgacgggcgc tctggtggtg gcgtgtttgg840gagggttttg catgcttgtg ataggggctc tggcgcgggc tctgtacggt gcttggagat900gcacgggcag ggcgagagag gggacgggtt cccgggaggc gctgcttgga ggtgctgaga960gggagggaga aggcgtgctt tgcgatgcgc ggggcgacct aggcgctgct gcgcggtgca1020gcagcaggga cctcggacgt gagtcgaagc cgtctgcaga ggagatggta gaagggccgc1080ggattggtag cagagaagag gaaatagaag aagaagaaga aatagaagaa gaagaaatag1140aagaagaaga aatagaagaa gaagaggagg acgggcaggc gggaaagatg gagaaaggac1200
tcgcggcggg aaaacaagag aatgtgaact tgggcttgaa ctttggtttg aatttgaatg1260tggagaacga ggggttgaat ttgagtttga atttgaaaga aaacttacgg aaagaaagtt1320tagttgaaag tgagaaagaa aaaaatgaga aagaaaaaga gaaagaaaaa gagaaagaaa1380aagagaaaga aaaagagaaa gaaaaagaga aagaaaaaga gaaagaaaaa gagaaagaaa1440aagagaaaga aaaagagaaa gaaaaagaga aagaaaaaga gaaagaaaaa gaagaagaaa1500aagaagaaga aaaagagaaa gaaaaagaga aagaaaaaga gaaagaaaaa gaagaaggag1560atttaaaaag ttgtttagtt gaaaaaggag aaggaggaag aagcagcgac agcggcagaa1620gaagaagtag ttgttgtaag aggggaacgg aggcagtagc agtggagcag gcggaggcga1680cagcaaacct cgaactcgac cccgtcgagc cgcagcaaga acaagagccc gaccaggtgg1740acgaggacga ggtccgcttg ttgtcaggaa caacagaagt tgcaggacta gccgagagtg1800ctaccactgc aattcttaga tccacagacg caagagcaga aaacttacaa ctgctcgcca1860caacacaaga accaccttca gatacaacca ggttcgagaa ctccacaagt ctagaagcag1920caacagctct agcagataat caaacaggtc cagaaaaagc tacgactaga agagaaatta1980tcgagtcgca acttgcaacc2000<210>79<211>4683<212>DNA<213>Ulkenia sp.
<400>79gcgagttata tctgtctaga aaacttggca tggctagcaa tttatgtcta gctattccat60acacacggta atgccagtag cctgttagtt atagctcttt tggttgttgt ctcacaatac120actgacatca gcagaacaaa atgaaagggg ccttggctac catgaaatca atacttcaaa180aggtctcttg gtttctttac tcgcatgtcg ctatttactt acattcctcg agtacataac240atatcataca tcaaagaaat taaaaagaaa acaaacattc aaatatgcat tactttccct300actgtactag taagtacgtt tctggtatta agttgttttt tctcaaaaga acaatgtgct360tacttgtaaa atccacagct gcttacttgt aagcctcaac tagttagtga tgtgattatc420ataaaatgtt cgacactgta cctcctttcc agctatcttc ctacacctcc tctgacgcag480gttgacggag gaggcgtggg ggttgattga agtgcaacac aacgttttgt ttaagatatt540ccttgccttg gccgactcca aatggatagc acagaagcct aatgataatt tgaattaatt600ttatttcgag cttatttaat gctcttatca gagtccgtag gtatctcttt tcctactaat660tgttgaaaaa ggatgttttg gacatagcag gtcatcatac tatttggttc catcaaattc720atatccattt ctttcgttca agtgcttccc ttcctactta ttatatatat tatatatcca780
taaatgtaaa agagacgatt acgaatactt tgcatacatg tatagcgaaa cagagatggt840agcaaaagtt caccttcact aatctaagaa tctctccacg tgggtaaaaa cttcagcagt900aagattgtaa atgatgtcca agaacaaaac gtcatgctag tccaggggtt actgagctaa960cgattaataa tgtttcgtag tcttcctaat tgcaccatca aaacttgtct gcacaagttt1020taaagtattg gagcctttac tgaagaatca gaggacatag atggggcacg ttcgccttga1080aaaaaatagt cttctttacc tgcatggtgt tacaaacaaa aacgagttga aaatagctgt1140gcaaggaggc aaacatgatt ggaaaagaaa aacgagggga cccttataca ggagggcgcc1200acatagtaga atgagtagat tgttagagta gggtacgctt tatgtgattg attgaatggg1260cgagtgaaag ttgctgtcaa ggttctaaac aaaaggatgt ttgagtttgt gagtattgtt1320tgcggcaaaa agattcagta gagagaaatg cacaaaaaga taatacgtgt gtagggcgat1380tatggaggca tgcatttggg ggaaatcatc gcatgcgcat gagtttctcc atctgccgaa1440tctttgcaaa ggcattttca agctccattt gcatagcgta ggcttgctgc tcaaactgag1500cgcgctgatg cgccagattt tcttcatgtc ttttgttcaa actacgctca agaccctcaa1560gagccgcaac cttgagcttg cgttcctttt gctgaatctc cataactctt cgtttcacct1620ggagctcaat ttctgcagca tccgtggtct ttgcagcggc ctgtgcgtct tgtgcggcct1680gtgcgttgtt tgcgagctcc tttcgcagct cctccatctc cgcgttcttt ttctcctcca1740tccatttggc accgagtttg gcagcttgat cgatgcggcc cttgagaact tcttcgttct1800cctcaagttc tgcgatacgc gcgtgtaagc cgaggatctc ctccgagaca gcctcgccat1860tgatcattat ttcacttccc gagtcttgaa tgacaacatc agccttggtg ccaggttcac1920cggtatctcg ctcgcaaccc tgctggcgca tagacagcat aaggcgcgca ttatcctcac1980gcagatcatc cacctgttct gataaaagtt tgactgcctg ctcaagatta cgggggttca2040cttcgtgaaa aatttcttga aggtctcgaa gctcagaaag cttggcagag caagtgtgca2100tcgctctgca ctttttaaga cgtgcaagtg catcatcaag tttggcatta tttaccttca2160tggaggcttc agctacttcg gcttcttcga ttacaatttt ctgcagctct acaacatcat2220ggccaattaa cttgcgatgc agctcggcaa tcaccccatg catcttttcg gtatggcctg2280gacgcgcctc atcctgcgtt cttcggatct cctcctctag ttctcgattt agacgaaggg2340ctggtccaag gggcgggtaa ttagcctgag tcaagccaag ctctgttgct agtccaaggc2400agtcggaaag tcgcagccgg tccctatcag aaacagcctt ttgcaagtct acgctcaaac2460gcacttcttg agccttgcgc accatcttcg gttctgcctg tcgcagaagt ttcgagtcgt2520agccagcttg ccacgctagc acgatggcac gcgcaagtga cctcagttga ccgctgttca2580tggcagactt gagcaacatt ttgatttgca caaatacctc atctgattca tcatcttcag2640
cttcctcaag ctctgcaggt gtcttgcgct ctccagagac ttgaagagca gggttcaaac2700cgccctccag gacctcgctc gcaagcgcct cctctgtctc agctttgcgc aatagcgcag2760cagcattctc cgccattgtg tttgtcactc acgagattaa tatcgttgcc agagtatacg2820gtaatgcgag ttaaggattc acagaatctc tcaaattaat cttttcacct aatgatatcc2880acaaaacgtt gcaatcgctc agcccaacga caagcgtgct tcttgtttta agactgcaac2940tgctcctttt tctattagtc aatatggacc gtcctccaaa cgtccagaaa atagcacaga3000atttaccagc agccgctgca gacaagaagt gcaagagagc aggcaagcaa gtgagggttt3060gagcaaatag gccaacctct ccacgcagaa ttctagggtc gcaaccggaa ctcacagtcc3120ttagaaaccg tgcgaagccc tgggctcaac ttcaatttgt ccacgggacc ttcagcaagc3180accaagctca gcagcgtgaa ggcaggcgct gaccacagtt tgagctcaga gggcttggtg3240tgcctcgcga ttgatattga agtcaattgc gcaggacggc agcaacggac caggtggtga3300agaaggtaat ctccagcgga gtgatgatgg agctcgaccg actactccgg aatcgaccag3360gggaggtgcg ggcgcccttc acaagcgggc gagaggcagg ggagagaagg ctcgactcca3420cgtcttgaag cgtgtacgtg tgcgcgctca cgcgtgcgac acgccggcaa gggcgcctta3480gtggcctgct gctgctgctg gtcgccacgc tgcgagccca agagatttga attgaactcg3540aagaaaataa ctatcattta tcaattccaa tcaatcaatg cattatgaag cacctctgaa3600gtgaactatt ctcctctcca atatacaaca aaaaacacac acagtgggtt ttaccctata3660acctattgtt ccgcgagcga tcaactactc tatagagcga atgaccagtt tttctttctt3720tctttctttc tttctttctt tctttctttc tttctttctt tctttctttc tttctttctg3780ttttcctatc taataacccc tttaatcgag gaaacctttc gatttaaaag gaaagctctg3840tctgtatata tctgttacag atactgctat catgccatgc agaaagaaac acaaaagaaa3900aacaaaagaa agagagaaag agagaaagaa agagagaaag aaagaaagaa agaaagaaag3960aagagctttt ctcaatcggt ttcctcatcg accgctcaca tatctacgat tgtggcaaag4020aaagaaagaa agaaagaagg aaagcctcag cagagtccgc acgaaagcct tcattgagcc4080accatgtcgt ggtccgctgc agtcagtgcc gcctctctgt gaattgagtg agtgagtgag4140tgagtgagtt ggttggttag ttagttagtg cctcttcagc tcaaagcctt tcacggtcgc4200tcttcgagcg tttgcttttt cataaacaaa taaacaaacc atcgaacgaa ccatcgaacg4260aacgaacaat ggtaccccag aatagacgga attaattgct aagtaaacca gtaacagtaa4320gttagtgttt ctgacctgag ccgttttctt tatttattcc tctcagctct gtgaagagaa4380tttgggatga aaagaaacgt ttttatttat ttaaaagttt agtaacaaga aaaacatggt4440ccctcttctt ccttcatgta aaaataagta agtaaaaaaa agaaaagaaa aaaaaaaaag4500
cttttaaagt agtaaagcga ggtagagata aaagttcttt ctcagggctc ctagtaggca4560cttaggaggt acgtctaaga ccgcctcgtg ggaagaaaag agaaaacaag aagagaaaag4620agagagagaa acagcgctga cccgagaggc tcatgcgcag agcccaaatc tgcccaactt4680tgg 4683<210>80<211>1848<212>PRT<213>Ulkenia sp.
<400>80Met Leu Val Ile Gly Ala Leu Ala Arg Ala Leu Tyr Gly Ala Trp Arg1 510 15Cys Thr Gly Arg Ala Arg Glu Gly Thr Gly Ser Arg Glu Ala Leu Leu20 25 30Gly Gly Ala Glu Arg Glu Gly Glu Gly Val Leu Cys Asp Ala Arg Gly35 40 45Asp Leu Gly Ala Ala Ala Arg Cys Ser Ser Arg Asp Leu Gly Arg Glu5055 60Ser Lys Pro Ser Ala Glu Glu Met Val Glu Gly Pro Arg Ile Gly Ser65 70 75 80Arg Glu Glu Glu Ile Glu Glu Glu Glu Glu Ile Glu Glu Glu Glu Ile85 90 95Glu Glu Glu Glu Ile Glu Glu Glu Glu Glu Asp Gly Gln Ala Gly Lys100 105 110Met Glu Lys Gly Leu Ala Ala Gly Lys Gln Glu Asn Val Asn Leu Gly115 120 125Leu Asn Phe Gly Leu Asn Leu Asn Val Glu Asn Glu Gly Leu Asn Leu130 135 140Ser Leu Asn Leu Lys Glu Asn Leu Arg Lys Glu Ser Leu Val Glu Ser145 150 155 160Glu Lys Glu Lys Asn Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu165 170 175Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu180 185 190Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu195 200 205Lys Glu Lys Glu Lys Glu Glu Glu Lys Glu Glu Glu Lys Glu Lys Glu210 215 220Lys Glu Lys Glu Lys Glu Lys Glu Lys Glu Glu Gly Asp Leu Lys Ser225 230 235 240Cys Leu Val Glu Lys Gly Glu Gly Gly Arg Ser Ser Asp Ser Gly Arg
245 250 255Arg Arg Ser Ser Cys Cys Lys Arg Gly Thr Glu Ala Val Ala Val Glu260 265 270Gln Ala Glu Ala Thr Ala Asn Leu Glu Leu Asp Pro Val Glu Pro Gln275 280 285Gln Glu Gln Glu Pro Asp Gln Val Asp Glu Asp Glu Val Arg Leu Leu290 295 300Ser Gly Thr Thr Glu Val Ala Gly Leu Ala Glu Ser Ala Thr Thr Ala305 310 315 320Ile Leu Arg Ser Thr Asp Ala Arg Ala Glu Asn Leu Gln Leu Leu Ala325 330 335Thr Thr Gln Glu Pro Pro Ser Asp Thr Thr Arg Phe Glu Asn Ser Thr340 345 350Ser Leu Glu Ala Ala Thr Ala Leu Ala Asp Asn Gln Thr Gly Pro Glu355 360 365Lys Ala Thr Thr Arg Arg Glu Ile Ile Glu Ser Gln Leu Ala Thr Met370 375 380Ala Thr Arg Val Lys Thr Asn Lys Lys Pro Cys Trp Glu Met Thr Lys385 390 395 400Glu Glu Leu Thr Ser Gly Lys Asn Val Val Phe Asp Tyr Asp Glu Leu405 410 415Leu Glu Phe Ala Glu Gly Asp Ile Ser Lys Val Phe Gly Pro Glu Phe420 425 430Ser Gln Ile Asp Gln Tyr Lys Arg Arg Val Arg Leu Pro Ala Arg Glu435 440 445Tyr Leu Leu Val Thr Arg Val Thr Leu Met Asp Ala Glu Val Asn Asn450 455 460Tyr Arg Val Gly Ala Arg Met Val Thr Glu Tyr Asp Leu Pro Val Asn465 470 475 480Gly Glu Leu Ser Glu Gly Gly Asp Cys Pro Trp Ala Val Leu Val Glu485 490 495Ser Gly Gln Cys Asp Leu Met Leu Ile Ser Tyr Met Gly Ile Asp Phe500 505 510Gln Asn Lys Ser Asp Arg Val Tyr Arg Leu Leu Asn Thr Thr Leu Thr515 520 525Phe Tyr Gly Val Ala Gln Glu Gly Glu Thr Leu Glu Tyr Asp Ile Arg530 535 540Val Thr Gly Phe Ala Lys Arg Leu Asp Gly Asp Ile Ser Met Phe Phe545 550 555 560Phe Glu Tyr Asp Cys Tyr Val Asn Gly Arg Leu Leu Ile Glu Met Arg565 570 575
Asp Gly Cys Ala Gly Phe Phe Thr Asn Glu Glu Leu Ala Ala Gly Lys580 585 590Gly Val Val Phe Thr Arg Ala Asp Leu Leu Ala Arg Glu Lys Thr Lys595 600 605Lys Gln Asp Ile Thr Pro Tyr Ala Ile Ala Pro Arg Leu Asn Lys Thr610 615 620Val Leu Asn Glu Thr Glu Met Gln Ser Leu Val Asp Lys Asn Trp Thr625 630 635 640Lys Val Phe Gly Pro Glu Asn Gly Met Asp Gln Ile Asn Tyr Lys Leu645 650 655Cys Ala Arg Lys Met Leu Met Ile Asp Arg Val Thr Lys Ile Asp Tyr660 665 670Thr Gly Gly Pro Tyr Gly Leu Gly Leu Leu Val Gly Glu Lys Ile Leu675 680 685Glu Arg Asp His Trp Tyr Phe Pro Cys His Phe Val Gly Asp Gln Val690 695 700Met Ala Gly Ser Leu Val Ser Asp Gly Cys Ser Gln Leu Leu Lys Met705 710 715 720Tyr Met Leu Trp Leu Gly Leu His Leu Lys Thr Gly Pro Phe Asp Phe725 730 735Arg Pro Val Asn Gly His Pro Asn Lys Val Arg Cys Arg Gly Gln Ile740 745 750Ser Pro His Lys Gly Lys Leu Val Tyr Val Met Glu Ile Lys Glu Met755 760 765Gly Tyr Asp Glu Ala Gly Asp Pro Tyr Ala Ile Ala Asp Val Asn Ile770 775 780Leu Asp Ile Asp Phe Glu Lys Gly Gln Thr Phe Asp Leu Ala Asn Leu785 790 795 800His Glu Tyr Gly Lys Gly Asp Leu Asn Lys Lys Ile Val Val Asp Phe805 810 815Lys Gly Ile Ala Leu Lys Leu Gln Lys Arg Ser Gly Pro Ala Val Val820 825 830Ala Pro Glu Lys Pro Leu Ala Leu Asn Lys Asp Leu Cys Ala Pro Ala835 840 845Val Glu Ala Ile Pro Glu His Ile Leu Lys Gly Asp Ala Leu Ala Pro850 855 860Asn Gln Met Thr Trp His Pro Met Ser Lys Ile Ala Gly Asn Pro Thr865 870 875 880Pro Ser Phe Ser Pro Ser Ala Tyr Pro Pro Arg Pro Ile Thr Phe Thr885 890 895Pro Phe Pro Gly Asn Lys Asn Asp Asn Asn His Val Pro Gly Glu Met
900 905 910Pro Leu Ser Trp Tyr Asn Met Ala Glu Phe Met Ala Gly Lys Val Ser915 920 925Leu Cys Leu Gly Pro Glu Phe Ala Lys Phe Asp Asp Ser Asn Thr Ser930 935 940Arg Ser Pro Ala Trp Asp Leu Ala Leu Val Thr Arg Val Val Ser Val945 950 955 960Ser Asp Met Glu Trp Val Gln Trp Lys Asn Val Asp Cys Asn Pro Ser965 970 975Lys Gly Thr Met Val Gly Glu Phe Asp Cys Pro Ile Asp Ala Trp Phe980 985 990Phe Gln Gly Ser Cys Asn Asp Gly His Met Pro Tyr Ser Ile Leu Met995 1000 1005Glu Ile Ala Leu Gln Thr Ser Gly Val Leu Thr Ser Val Leu Lys1010 1015 1020Ala Pro Leu Thr Met Glu Lys Lys Asp Ile Leu Phe Arg Asn Leu1025 1030 1035Asp Ala Asn Ala Glu Met Val Arg Ser Asp Ile Asp Leu Arg Gly1040 1045 1050Lys Thr Ile His Asn Leu Thr Lys Cys Thr Gly Tyr Ser Met Leu1055 1060 1065Gly Asp Met Gly Val His Arg Phe Ser Phe Glu Leu Ser Val Asp1070 1075 1080Gly Val Val Phe Tyr Lys Gly Thr Thr Ser Phe Gly Trp Phe Val1085 1090 1095Pro Glu Val Phe Ile Ser Gln Thr Gly Leu Asp Asn Gly Arg Arg1100 1105 1110Thr Gln Pro Trp His Ile Glu Ser Lys Val Pro Ser Ala Gln Val1115 1120 1125Leu Thr Tyr Asp Val Thr Pro Asn Gly Ala Gly Arg Thr Gln Leu1130 1135 1140Tyr Ala Asn Ala Pro Lys Gly Ala Gln Leu Thr Arg Arg Trp Asn1145 1150 1155Gln Cys Gln Tyr Leu Asp Thr Ile Asp Leu Val Val Ala Gly Gly1160 1165 1170Ser Ala Gly Leu Gly Tyr Gly His Gly Arg Lys Gln Val Asn Pro1175 1180 1185Lys Asp Trp Phe Phe Ser Cys His Phe Trp Phe Asp Ser Val Met1190 1195 1200Pro Gly Ser Leu Gly Val Glu Ser Met Phe Gln Leu Val Glu Ser1205 1210 1215
Ile Ala Val Lys Gln Asp Leu Ala Gly Lys Tyr Gly Ile Thr Asn1220 1225 1230Pro Thr Phe Ala His Ala Pro Gly Lys Ile Ser Trp Lys Tyr Arg1235 1240 1245Gly Gln Leu Thr Pro Thr Ser Lys Phe Met Asp Ser Glu Ala His1250 1255 1260Ile Val Ser Ile Glu Ala His Asp Gly Val Val Asp Ile Val Ala1265 1270 1275Asn Gly Asn Leu Trp Ala Asp Gly Leu Arg Val Tyr Asn Val Ser1280 1285 1290Asn Ile Arg Val Arg Ile Val Ala Gly Ala Ala Pro Ala Ala Ala1295 1300 1305Ala Ala Ala Ala Ala Val Ala Ala Pro Ala Ala Ala Pro Ala Pro1310 1315 1320Val Ala Ala Ser Gly Pro Ala Gln Thr Ile Thr Leu Lys Gln Leu1325 1330 1335Lys Ala Glu Leu Leu Asp Val Glu Lys Pro Leu Tyr Ile Ser Ser1340 1345 1350Ser Asn Gly Gln Val Lys Lys His Ala Asp Val Ala Gly Gly Gln1355 1360 1365Ala Thr Ile Val Gln Ala Cys Ser Leu Ser Asp Leu Gly Asp Glu1370 1375 1380Gly Phe Met Lys Thr Tyr Gly Val Val Ala Pro Leu Tyr Thr Gly1385 1390 1395Ala Met Ala Lys Gly Ile Ala Ser Ala Asp Leu Val Ile Ala Thr1400 1405 1410Gly Lys Arg Lys Ile Leu Gly Ser Phe Gly Ala Gly Gly Leu Pro1415 1420 1425Met His Ile Val Arg Ala Ala Val Glu Lys Ile Gln Ala Glu Leu1430 1435 1440Pro Asn Gly Pro Phe Ala Val Asn Leu Ile His Ser Pro Phe Asp1445 1450 1455Ser Asn Leu Glu Lys Gly Asn Val Asp Leu Phe Leu Glu Lys Gly1460 1465 1470Val Thr Val Val Glu Ala Ser Ala Phe Met Thr Leu Thr Pro Gln1475 1480 1485Val Val Arg Tyr Arg Ala Ala Gly Leu Ser Arg Asn Ala Asp Gly149 1495 1500Ser Ile Asn Ile Lys Asn Arg Ile Ile Gly Lys Val Ser Arg Thr1505 1510 1515Glu Leu Ala Glu Met Phe Ile Arg Pro Ala Pro Gln Asn Leu Leu
1520 1525 1530Asp Lys Leu Ile Gln Ser Gly Glu Ile Thr Lys Glu Gln Ala Glu1535 1540 1545Leu Ala Lys Leu Val Pro Val Ala Asp Asp Ile Ala Val Glu Ala1550 1555 1560Asp Ser Gly Gly His Thr Asp Asn Arg Pro Ile His Val Ile Leu1565 1570 1575Pro Leu Ile Ile Asn Leu Arg Asn Arg Leu His Lys Glu Cys Gly1580 1585 1590Tyr Pro Ala His Leu Arg Val Arg Val Gly Ala Gly Gly Gly Val1595 1600 1605Gly Cys Pro Gln Ala Ala Ala Ala Ala Leu Ala Met Gly Ala Ala1610 1615 1620Phe Leu Val Thr Gly Thr Val Asn Gln Val Ala Lys Gln Ser Gly1625 1630 1635Thr Cys Asp Asn Val Arg Lys Gln Leu Cys Met Ala Thr Tyr Ser1640 1645 1650Asp Val Cys Met Ala Pro Ala Ala Asp Met Phe Glu Glu Gly Val1655 1660 1665Lys Leu Gln Val Leu Lys Lys Gly Thr Met Phe Pro Ser Arg Ala1670 1675 1680Asn Lys Leu Tyr Glu Leu Phe Cys Lys Tyr Asp Ser Phe Glu Ser1685 1690 1695Met Pro Ala Thr Glu Leu Glu Arg Val Glu Lys Arg Ile Phe Gln1700 1705 1710Cys Pro Leu Ala Asp Val Trp Ala Glu Thr Ser Asp Phe Tyr Ile1715 1720 1725Asn Arg Leu His Asn Pro Glu Lys Ile Thr Arg Ala Glu Arg Asp1730 1735 1740Pro Lys Leu Lys Met Ser Leu Cys Phe Arg Trp Tyr Leu Gly Leu1745 1750 1755Ala Ser Arg Trp Ala Asn Thr Gly Glu Ala Gly Arg Val Met Asp1760 1765 1770Tyr Gln Val Trp Cys Gly Pro Ala Ile Gly Ala Phe Asn Asp Phe1775 1780 1785Ile Lys Gly Ser Tyr Leu Asp Pro Ala Val Ser Gly Glu Tyr Pro1790 1795 1800Asp Val Val Gln Ile Asn Leu Gln Ile Leu Arg Gly Ala Cys Tyr1805 1810 1815Leu Arg Arg Leu Asn Val Ile Arg Asn Asp Pro Arg Val Ser Ile1820 1825 1830
Glu Val Glu Asp Ala Glu Phe Val Tyr Glu Pro Thr Asn Ala Leu1835 1840 1845<210>81<211>18<212>DNA<213>Künstliche Sequenz<400>81ctcggcattg actccatc 18<210>82<211>18<212>DNA<213>Künstliche Sequenz<400>82GAGAATCTCG ACACGCTT 18<210>83<211>21<212>DNA<213>Künstliche Sequenz<400>83ATTACTCCTC TCTGCATCCG T 21<210>84<211>21<212>DNA<213>Künstliche Sequenz<400>84GCCGAAGACA GCATCAAACT C 21<210>85<211>21<212>DNA<213> Künstliche Sequenz<400>85GTCGAGAGTG GCCAGTGCGA T 21<210>86<211>21<212>DNA<213> Künstliche Sequenz<400>86AAAGTGGCAG GGAAAGTACC A 2权利要求
1.PUFA-PKS,其特征是它们a.包括在SEQ ID No.6(ORF 1),7(ORF 2),8和/或80(ORF 3)中所示氨基酸序列的至少其中一种,以及具有与它们有至少70%,优选80%,特别优选至少90%和更加特别优选至少99%和最优选100%序列同源性的同源序列,所述同源序列具有PUFA-PKS的至少一个结构域的生物学活性,或b.包括在SEQ ID No.32,34,45,58,59,60,61,72,74和/或77中所示氨基酸序列的至少其中一种,以及具有与它们有至少70%,优选80%,特别优选至少90%和更加特别优选至少99%和最优选100%序列同源性的同源序列,所述同源序列具有PUFA-PKS的至少一个结构域的生物学活性。
2.具有10个或更多ACP结构域的根据权利要求1的分离的PUFA-PKS。
3.根据任何一项在前权利要求,其特征是它包含与序列SEQ IDNo.6(ORF 1),7(ORF 2)和/或8和/或80(ORF 3)的至少500个直接连续氨基酸具有至少70%,优选至少80%,特别优选至少90%和更加特别优选至少99%序列同源性的至少一种氨基酸序列,并且具有PUFA-PKS的至少一个结构域的生物学活性。
4.一种氨基酸序列,它与SEQ ID No.6(ORF 1),7(ORF 2)和/或8和/或80(ORF 3)的至少500个直接连续氨基酸具有至少70%,优选至少80%,特别优选至少90%和更加特别优选至少99%的同一性,并且具有PUFA-PKS的至少一个结构域的生物学活性。
5.一种分离的DNA分子,其编码根据任一项在前权利要求的氨基酸序列和与它完全互补的DNA。
6.根据权利要求5的分离的DNA分子,其特征是它与来自SEQ IDNo.3,4和5和/或9的至少500个直接连续核苷酸具有至少70%,优选至少80%,特别优选至少90%和更加特别优选至少95%的同一性。
7.根据权利要求5或6的DNA分子,其特征是它编码与序列SEQID No.6(ORF 1),7(ORF 2)和/或8和/或80(ORF 3)的至少500个直接连续氨基酸具有至少70%同源性的氨基酸序列。
8.包含根据权利要求5,6和/或7其中之一DNA分子的重组DNA分子,其与至少一种控制转录的DNA序列功能性连接,所述DNA序列优选选自SEQ ID No.XX-YY(终止子/启动子),或其来自至少500个核苷酸的部分以及它们的功能性变体。
9.包含根据权利要求8的重组DNA分子的重组宿主细胞。
10.根据权利要求9的重组宿主细胞,其内源性表达具有至少另一种PUFA-PKS结构域活性的根据权利要求1的PUFA-PKS。
11.包含重组DNA构建体的重组宿主细胞,其中控制翻译的元件选自SEQ ID No.XX-YY(终止子/启动子),或其来自至少500个核苷酸的部分以及它们的功能性变体。
12.一种生产含有PUFA,优选DHA的油的方法,包括培养根据权利要求9或10的宿主细胞。
13.根据权利要求12的方法生产的油。
14.一种生产含有PUFA,优选DHA的生物质量的方法,包括培养根据权利要求9或10的宿主细胞。
15.根据权利要求14的方法生产的生物质量。
16.根据权利要求15的重组生物质量,其包含根据权利要求8的核酸和/或根据权利要求1的氨基酸序列或与它们同源的至少50个连续氨基酸的部分。
17.包含PUFA-PKS的来自SEQ ID No.6,7,8和/或80的个别酶结构域用于生产人工多酮化合物的用途。
全文摘要
本发明涉及编码特异于多酮化合物合酶(PKS)的序列的基因。由此合成的PKS特征在于其产生PUFAs(多不饱和脂肪酸)的酶能力。本发明还涉及鉴定相应的DNA序列,以及所述核苷酸序列用于产生重组和/或转基因生物的应用。
文档编号C12P7/64GK101087882SQ200580018878
公开日2007年12月12日 申请日期2005年4月8日 优先权日2004年4月8日
发明者托马斯·克伊, 马库斯·卢伊, 马西亚斯·鲁辛 申请人:努特诺瓦营养产品及食品成分有限公司
网友询问留言 已有0条留言
  • 还没有人留言评论。精彩留言会获得点赞!
1