原钙粘着蛋白蛋白质及其应用的制作方法

文档序号:3548932阅读:525来源:国知局
专利名称:原钙粘着蛋白蛋白质及其应用的制作方法
本申请是1993年12月23日提交的国际专利申请No.PCT/US93/12588的后续部分,前者本身又是1992年12月29日提交的美国专利申请序号No.07/998,003的后续部分。
发明的领域本申请总地涉及与细胞-细胞粘着相关的材料和方法。更具体地说,本发明涉及称为原钙粘着蛋白的新的粘着蛋白质,和编码原钙粘着蛋白的多核苷酸序列。本发明还涉及抑制原钙粘着蛋白与其天然配体/抗配体结合的方法。
发明的背景在体内,细胞间粘着在形态发生和器官形成、白细胞外侵、肿瘤转移和侵染,以及细胞接合的形成等过程中起着重要作用。此外,细胞-细胞粘着对于组织整合性的维持是很关键的。
细胞间粘着是由特异性细胞表面粘着分子介导的。已将细胞粘着分子分为至少四个家族,其中包括免疫球蛋白超家族、整联蛋白超家族、选择蛋白超家族和钙粘着蛋白超家族。所有形成实体组织的细胞类型均表达钙粘着蛋白超家族的某些成员,此提示钙粘着蛋白参予大多数细胞类型的选择性粘着。
已将钙粘着蛋白总地描述为糖基化整合膜蛋白质,其具有由以钙粘着蛋白所独有的序列为特征的5个亚区域组成的N末端细胞外区域(似乎直接参予结合的是该区域的N末端113个氨基酸)、疏水跨膜区域及通过连环蛋白及其他细胞骨架相关蛋白质与细胞骨架作用的C末端胞浆区域。但某些钙粘着蛋白缺少胞浆区域并且似乎通过与具有胞浆区域之钙粘着蛋白不同机制在细胞-细胞粘着中发挥功能。在具有胞浆区域的钙粘着蛋白中,胞浆区域是细胞外区域的粘着功能所必需的。在不同细胞上表达的钙粘着蛋白家族成员之间的结合是同嗜性的(即钙粘着蛋白家族的成员与其自身的钙粘着蛋白或密切相关的亚类结合)和Ca2+依赖性的。有关钙粘着蛋白的最新综述可参见Takeichi,Annu.Rev.Biochem.,59237-252(1990)和Takeichi,Science,2511451-1455(1991)。
已根据其参予Ca2+依赖性细胞粘着和其独特的免疫学特性及组织定位鉴定了将要描述的第一类钙粘着蛋白(小鼠上皮细胞中的E-钙粘着蛋白、鸟肝脏中的L-CAM、小鼠胚泡中的桑椹胚粘着蛋白,及人上皮细胞中的CAM120/80)。随着后来对N-钙粘着蛋白的免疫学鉴定,发现其具有与E-钙粘着蛋白不同的组织分布,已发现了一个Ca2+依赖性细胞-细胞粘着分子的新家族这一事实变得更加明确了。
编码E-钙粘着蛋白(参见Nagafuchi et al.,Nature,329341-343(1987))、N-钙粘着蛋白(Hatta et al.,J.Cell,Biol.,106873-881(1988)),和P-钙粘着蛋白(Nose etal.,EMBO J.,63655-3661(1987))之基因的分子克隆为包括一个细胞粘着分子家族的钙粘着蛋白提供了结构证据。L-CAM(Gallin et al.,Proc.Natl.Acad.Sci USA,842808-2812(1987))和桑椹胚粘着蛋白(Ringwald et al.,EMBO J.,63647-3653(1986))的克隆则揭示了它们与E-钙粘着蛋白是相同的。比较E-、N-和P-钙粘着蛋白的氨基酸序列显示这三个亚类间氨基酸相似性水平约为45-58%。Liaw等人(EMBO J.,92701-2708(1990))描述了使用基于E-、N-和P-钙粘着蛋白之保守区域的简并寡核苷酸,以PCR方法从牛微血管上皮细胞cDNA扩增N-和P-钙粘着蛋白。
Suzuki et al.,Cell Regulation,2261-270(1991)中报导了用PCR技术分离另外八种钙粘着蛋白。继后,有人描述了包括R-钙粘着蛋白(Inuzuka et al.,Neuron,769-79(1991))、M-钙粘着蛋白(Donalies,Proc.Natl.Acad.SciUSA,888024-8028(1991))、B-钙粘着蛋白(Napolitano,J.Cell Biol.,113893-905(1991))和T-钙粘着蛋白(Ranscht,Neuron,7391-402(1991))在内的其他几种钙粘着蛋白。
此外,还描述了显然与钙粘着蛋白相关的蛋白质如桥粒芯蛋白(Goodwin et al.,Biochem.Biophys.Res.Commun.,1731224-1230(1990)和Koch et al.,Eur.J.Cell Biol.,531-12(1990))和桥粒胶粘蛋白(desmocollins)(Molton et al.,J.Cell Science,97239-246(1990))。这些分子的细胞外区域在结构上与典型钙粘着蛋白的细胞外区域相关,但又各有独特的胞浆区域。Mahoney等人(Cell,67853-868(1991))描述了称为fat的,也编码钙粘着蛋白相关蛋白质的果蝇属肿瘤抑制基因。fat肿瘤抑制基因包括34个钙粘着蛋白样亚区域,然后是4个EGF样重复区、跨膜区域、及一个新的胞浆区域。对这些钙粘着蛋白相关蛋白质的鉴定证明了一个以存在钙粘着蛋白细胞外区域基序为特征的大的超家族。
有关各种钙粘着蛋白相关蛋白质之组织表达的研究揭示,各亚类分子均具有独特的组织分布模式。例如,E-钙粘着蛋白见于上皮细胞中,而N-钙粘着蛋白则见于神经和肌肉细胞中。因为个别蛋白质在特定发育阶段似乎是由特异细胞和组织表达的,所以钙粘着蛋白相关蛋白质的表达似乎也是空间的并且在发育期间受到暂时调节的(如参见Takeichi(1991),文献出处同上)。钙粘着蛋白相关蛋白质的异位表达和对钙粘着蛋白相关蛋白质固有表达的抑制阻止了正常组织结构的形成(Detrick et al.,Neuron,4493-506(1990);Fujimori et al.,Development,110;97-104(1990);Kintner,Cell,69225-236(1992))。
当考虑到蛋白质的各个亚类可能在体内正常生理过程(如小肠上皮细胞屏障的维持)和异常病理过程(如肿瘤转移或炎症)中发挥作用时,不同的钙粘着蛋白和钙粘着蛋白相关蛋白质的独特的暂时和组织表达特征是特别重要的。钙粘着蛋白相关蛋白质的不同亚类或亚类组合很可能负责其中可期望进行治疗检测和/或干预的不同的细胞-细胞粘着作用。例如,寻常天疱疮(一种由于缺少细胞粘着能力而引起的以水疱形成为特征的自身免疫性皮肤病)病人的自身抗体可与对钙粘着蛋白的体内粘着功能提供直接支持的钙粘着蛋白相关蛋白质反应(Amagai et al.,Cell,67869-877(1991))。研究还显示,钙粘着蛋白和钙粘着蛋白相关蛋白质除有粘着活性外,还可能有调节功能。据Matsunaga等人报导(Nature,33462-64(1988)),N-钙粘着蛋白具有轴突过度生长促进活性。果蝇fat肿瘤抑制基因似乎可如哺乳动物E-钙粘着蛋白一样调节细胞生长并抑制肿瘤侵染(参见Mahoney et al.,文献同上;Frixen et al.,J.Cell.Biol.,113173-185(1991);Chen et al.,J.Cell.Biol.,114319-327(1991);Vleminckx et al.,Cell,66107-119(1991))。因此,可以期望对在特定组织中表达的钙粘着蛋白相关蛋白质的调节活性进行治疗干预。
因此本领域仍要求鉴定并定性那些参与细胞-细胞粘着和/或调节过程的其他钙粘着蛋白相关蛋白质。此外,钙粘着蛋白相关蛋白质可在一定程度上形成开发治疗和诊断剂的基础,而克隆编码这些蛋白质的基因是必要的。有关编码钙粘着蛋白相关蛋白质之DNA序列及氨基酸序列的资料将得以用重组技术大批量生产这些蛋白质,并得以鉴定天然产生这些蛋白质的组织/细胞。这些序列资料也将允许制备与钙粘着蛋白相关蛋白质发生特异反应的抗体物质或其他新的结合分子,这些物质或分子则可用于调节所说的蛋白质参予的天然配体/抗配体结合反应。
发明的概要本发明提供与钙粘着蛋白相关的材料及与细胞-细胞粘着相关的方法。一个方面,本发明提供纯化的和分离的编码新的细胞粘着分子的多核苷酸(如可以是正义和反义链的DNA和RNA),所说的新的细胞粘着分子本文中称为原钙粘着蛋白并包括原钙粘着蛋白-42、原钙粘着蛋白-43、原钙粘着蛋白pc3、原钙粘着蛋白pc4和原钙粘着蛋白pc5。本发明优选的多核苷酸序列包括基因组和cDNA序列以及完全或部分合成的DNA序列,及其生物学复制品(即体外制得的序列的拷贝)。还期望提供包含这些多核苷酸序列的生物学活性载体。
本发明的特别举例的原钙粘着蛋白多核苷酸序列是质粒pRC/RSV-pc42和pRC/RSV-pc43中的插入片段,所说的质粒已于1992年12月16日保存于美国典型培养物保藏中心(ATCC,12301ParklawnDrive,Rockville,Maryland 20852),ATCC保藏登记号分别为No.69162和69163。
通过公开本发明的DNA和氨基酸序列所提供的信息的科学价值是明显的。例如,编码原钙粘着蛋白的部分或完整DNA的序列知识有可能以标准DNA/DNA杂交或PCR技术分离编码该蛋白质(或其变异体)的全长cDNA或基因组DNA序列,并且就基因组DNA序列来说,还可分离用于指定原钙粘着蛋白特异性调节序列如启动子、增强子等的基因组DNA序列。或者,可用常规技术化学合成本发明的DNA序列。杂交和PCR技术也允许分离编码与本文特别举例的原钙粘着蛋白同源之异源原钙粘着蛋白的DNA。
根据本发明的另一个方面,宿主细胞,特别是真核和原核细胞,是以允许在细胞中表达原钙粘着蛋白多肽的方式用本发明的多核苷酸序列稳定转化或转染的。当表达原钙粘着蛋白多肽产物的宿主细胞生长于适当的培养基中时,特别适于大批量产生原钙粘着蛋白多肽、其片段及变异体,从而能够从细胞中或从培养细胞的培养基中分离所需的多肽产物。
本发明的新的原钙粘着蛋白产物可以作为分离物从天然组织来源的得到,但更好是以涉及本发明之宿主细胞的重组方法产生。这些产物可以依据所选用的宿主细胞或重组生产和/或分离后加工方法的不同,以完全或部分糖基化、部分或完全去糖基化、或非糖基化的形式得到。
根据本发明的原钙粘着蛋白变异体可以包括其中缺失或被置换一个或多个特定氨基酸的多肽类似物,或其中加入了一个或多个非天然编码之氨基酸的多肽类似物(1)没有丢失,并且更好是提高了一种或多种原钙粘着蛋白特异的生物学活性或免疫学性质;或者(2)特异性损害特定配体/抗配体结合功能。本发明还预期包括对本发明原钙粘着蛋白特异的抗体物质(如单克隆和多克隆抗体、嵌合和人源化抗体、包括Fab、Fab′、F(ab′)2、FV或单一可变区在内的抗体区域,以及单链抗体)。可使用分离的天然、重组或合成原钙粘着蛋白多肽产物或在其表面上表达这些产物的宿主细胞开发抗体物质。可以利用抗体物质纯化本发明的原钙粘着蛋白多肽、检测多肽的组织表达及作为原钙粘着蛋白之配体/抗配体结合活性的拮抗剂。本发明特别举例说明的单克隆抗体是由定名为38I2C的杂交瘤细胞系产生的原钙粘着蛋白-43特异性单克隆抗体,所说的杂交瘤细胞系于1992年12月2日保存于ATCC,入藏登记号为ATCC No.HB11207。
在参照附图阅读下面的详细描述之后将会明确了解本发明的许多其他方面和优点,其中附

图1A-C显示了排成行的本发明原钙粘着蛋白氨基酸序列与N-钙粘着蛋白和果蝇fat肿瘤抑制基因的氨基酸序列。
发明的详细描述以下述实施例举例说明本发明,其中实施例1、2和3描述用PCR技术分离原钙粘着蛋白多核苷酸序列。实施例3还描述本发明的几种原钙粘着蛋白基因的染色体定位。实施例4描述用DNA/DNA杂交技术分离本发明的其他原钙粘着蛋白多核苷酸序列。实施例5介绍包括编码原钙粘着蛋白-42或原钙粘着蛋白-43之多核苷酸的表达质粒的构建以及用这些质粒转染L细胞的方法。实施例6描述抗原钙粘着蛋白-42及原钙粘着蛋白-43抗体的产生。实施例7介绍对表达原钙粘着蛋白-42或原钙粘着蛋白-43之被转染的L细胞进行免疫检测的结果。实施例8描述了经原钙粘着蛋白-42、原钙粘着蛋白-43或嵌合原钙粘着蛋白-43/E-钙粘着蛋白分子转染的L细胞的细胞集合特性。实施例9描述了pc43的钙结合性质。实施例10、11和12分别说明以Northern印迹法、Western印迹法和原位杂交法检测各种表达原钙粘着蛋白-42和原钙粘着蛋白-43之组织和细胞系的结果。实施例13描述鉴定与原钙粘着蛋白-43共沉淀之120kDa蛋白质的免疫沉淀实验。
实施例1使用聚合酶链反应(PCR)分离编码钙粘着蛋白相关多肽的新的大鼠cDNA片段。
PCR引物的设计将已公布的L-CAM(Gallin et al.,文献同上)、E-钙粘着蛋白(Nagafuchi et al.,文献同上)、小鼠P-钙粘着蛋白(Nose et al.,文献同上)、桑椹胚粘着蛋白(Ringwald etal.,文献同上)、鸡N-钙粘着蛋白(Hatta et al.,文献同上)、小鼠N-钙粘着蛋白(Miyatani et al.,Science,245631-635(1989))和人P-钙粘着蛋白(Shimoyama et al.,J.Cell.Biol.1091787-1794(1989))的氨基酸序列进行比较,以鉴定保守氨基酸序列的两个区域一个来自钙粘着蛋白细胞外第三亚区域(EC-3)的中部,另一个来自细胞外第四亚区域(EC-4)的C末端,并设计分别在下面列出的以IUPAC-IUB生物化学命名法命名的相应简并寡核苷酸,以用作PCR引物。
引物1(SEQ ID NO1)5′AARSSNNTNGAYTRY6A 3′引物2(SEQ ID NO2)3′TTRCTRTTRCGNGGNNN 5′使用Applied Biosystems 380B型DNA合成仪(Foster City,California)合成简并寡核苷酸。
经PCR克隆cDNA序列基于大鼠脑cDNA制剂,以与Suzuki等人(Cell Regulation,2261-270(1991))所述者相似的方法进行PCR。用Maniatis等人(Molecular CloningA Laboratory Manual,pp.196,ColdSpring Harbor,New YorkCold Spring Harbor Laboratory(1982))所述的异硫氰酸鈲/氯化铯方法从大鼠脑中制备总RNA。然后使用FastTrack试剂盒(Invitrogen,San Diego,California),分离脑poly(A)+RNAs并使用cDNA合成试剂盒(BoehringerMannheim Biochemicals,Indianapolis,Indiana)制备cDNA。向100ng模板cDNA和10μg各引物中加入2.5单位Taq DNA聚合酶(Boehringer Mannheim Biochemicals)以开始PCR反应,然后进行35次反应循环94℃变性1.5分钟、45℃退火2分钟、72C聚合3分钟。对PCR反应产物进行琼脂糖凝胶电泳,发现大小约为450碱基对(bp)和130bp的两条主带。450bp带相当于两个引物位点之间所期望的长度,这两个引物位点分别对应于钙粘着蛋白细胞外第三亚区域(EC-3)的中部和钙粘着蛋白细胞外第四亚区域(EC-4)的羧基末端;但130bp带则不能从以前鉴定的任何钙粘着蛋白序列推断出来。用冻融法提取450bp和130bp带。用T4多核苷酸激酶使所得片段在5′末端磷酸化并以序列分析的平端连接法平端连接到M13mp18(Boehringer Mannheim Biochemicals)的SmaI位点,以进行亚克隆。使用Sequenase试剂盒(United States Biochemicals,Cleveland,Ohio)以二脱氧核苷酸链终止法测定片段的序列。使用Beckman Microgenie程序(Fullerton,California)分析DNA和氨基酸序列。
cDNA序列的分析分离19个新的部分cDNA克隆。克隆的DNA序列和由之推测的氨基酸序列(包括相当于PCR引物的序列)如下所示RAT-123(分别为SEQ ID Nos3和4)、RAT-212(SEQ ID NOs5和6),RAT-214(SEQ ID NOs7和8),RAT-216(SEQ ID NOs9和10),RAT-218(SEQ ID NOs11和12),RAT-224(SEQ ID NOs13和14),RAT-312(SEQ ID NOs15和16),RAT-313(SEQ ID NOs17和18),RAT-314(SEQ ID NOs19和20),RAT-315(SEQ ID NOs21和22),RAT-316(SEQ ID NOs23和24),RAT-317(SEQID NOs25和26),RAT-321(SEQ ID NOs27和28),RAT-323(SEQ IDNOs29和30),RAT-336(SEQ ID NOs31和32),RAT-352(SEQ ID NOs33和34),RAT-411(SEQ ID NOs35和36),RAT-413(SEQID NOs37和38),和RAT-551(SEQ ID NOs39和40)。
推测的cDNA克隆的氨基酸序列与已知钙粘着蛋白的序列同源但又不同。因此所描述的钙粘着蛋白在细胞外第三亚区域(EC-3)中具有高度保守的、短的氨基酸序列,包括位于该亚区域之中间区的共有序列D-Y-E或D-F-E及在其末端的共有序列D-X-N-E-X-P-X-F(SEQ ID NO41)或D-X-D-E-X-P-X-F(SEQ ID NO42)(Hatta et al.,文献同上)而除细胞外第五亚区域(EC-5)外的其他亚区域的相应序列则分别是D-R-E和D-X-N-D-N-X-P-X-F(SEQ ID NO43)。相反,新克隆相当于钙粘着蛋白细胞外亚区域的推测的氨基酸序列则在一个末端包括序列D-Y-E或D-F-E,但在另一端则具有代替D-X-N-E-X-P-X-F或D-X-D-E-X-P-X-E的序列D-X-N-D-N-X-P-X-F。由部分克隆编码的多肽是与以前鉴定的钙粘着蛋白同源的,但与Genbank中的任何其他序列则没有显示明显的同源性。因此,该部分cDNAs似乎包含钙粘着蛋白相关分子的一个新的亚类。
实施例2结构上与实施例1中所述的大鼠cDNAs相似的各种cDNA片段是使用实施例1中所述的引物1和2,以PCR方法从人、小鼠和非洲爪蟮脑cDNA制剂,以及从果蝇和C.elegans全身cDNA制剂中分离的。所得PCR片段的DNA序列和推测的氨基酸序列(包括相当于PCR引物的序列)如下MOUSE-321(SEQ ID NOs44和45),MOUSE-322(SEQID NOs46和47)MOUSE-324(SEQ ID NOs48和49),MOUSE-326(SEQ ID NOs50和51),HUMAN-11(SEQ ID NOs52和53),HUMAN-13(SEQ ID NOs54和55),HUMAN-21(SEQ ID NOs56和57),HUMAN-24(SEQ ID NOs58和59),HUMAN-32(SEQ ID NOs60和61),HUMAN-42(SEQ ID NOs62和63),HUMAN-43(SEQ ID NOs64和65),HUMAN-212(SEQ ID NOs66和67),HUMAN-213(SEQ ID NOs68和69),HUMAN-215(SEQ ID NOs70和71),HUMAN-223(SEQ ID NOs72和73),HUMAN-410(SEQ ID NOs74和75),HUMAN-443(SEQ ID NOs76和77),XENOPUS-21(SEQ IDNOs78和79),XENOPUS-23(SEQ ID NOs80和81),XENOPUS-25(SEQID NOs82和83),XENOPUS-31(SEQ ID NOs84和85),DROSOPHILA-12(SEQ ID NOs86和87),DROSOPHILA-13(SEQ ID NOs88和89),DROSOPHILA-14(SEQ ID NOs90和91)和C.ELEGANS-41(SEQ IDNOs92和93)。比较所推测的氨基酸序列表明这些不同组克隆间有显著的相似性。具体地说,有三组克隆似乎是种间同源的RAT-218、MOUSE-322和HUMAN-43;RAT-314、MOUSE-321和HUMAN-11;以及MOUSE-326和HUMAN-42。
实施例3为了确定由PCR产物限定的新蛋白质的完整结构,分离两个相当于部分cDNA HUMAN-42和HUMAN-43的全长人cDNAs。
全长人cDNAs的分离用32P标记的HUMAN-42和HUMAN-43DNA片段,以噬斑杂交方法(Ausubel et al.,Eds.,Current Protocols in MolecularBiology,Sections 6.1.1-6.1.4和6.2.1-6.2.3,John Wiley& Sons,New York(1987))筛选入ZapII载体中的人胎脑cDNA文库(Stratagene,La Jolla,California)。噬斑纯化阳性克隆,并使用辅助病毒,以Bluescript SK(+)质粒形式用体内切除方法切掉插入段。然后将此插入段序列亚克隆到M13载体(BoehringerMannheim,Biochemicals)中进行测序。用各含有两个cDNAs的探针分离几个重叠cDNA克隆,所说的cDNA含有两种称为原钙粘着蛋白-42(pc42)和原钙粘着蛋白-43(pc43)的两种新蛋白质之推测的完整编码序列。SEQ ID NOs94和95中分别列出pc42的DNA及其推测的氨基酸序列,同时SEQ ID NOs96和97中则分别列出pc43的DNA及其推测的氨基酸序列。
Sano等人在优先权申请之后,在The EMBO Journal,12(6)2249-2256(1993)中发表了本发明原钙粘着蛋白序列的克隆方法。在1991年12月9日美国细胞生物学学会的会议上已提出了pc43的推测的氨基酸序列。报告的摘要是作为Suzuki et al.,J.Cell.Biol.,11572a(Abstract416)(December9,1991)发表的。
全长人克隆的分析将pc42和pc43的全长cDNA序列与原来以PCR方法得到的各种DNA片段的序列相比较,表明MOUSE-326和HUMAN-42相当于pc42之细胞外第四亚区域(EC-4)的-部分,并且RAT-314、MOUSE-321,和HUMAN-11相当于pc43之细胞外第三亚区域(EC-3)的一部分,而RAT-218、MOUSE-322和HUMAN-43则相当于pc43之细胞外第五亚区域(EC-5)的一部分。
pc42和pc43的总体结构与典型钙粘着蛋白相似,但这些新的分子也有不同的特征。两种原钙粘着蛋白cDNA序列都含有推测的翻译起始位点并且翻译的氨基酸序列随典型的信号序列开始,但这些克隆却没有存在于所有已知钙粘着蛋白前体中的前序列。该cDNAs编码具有一个大的N末端细胞外区域和一个由跨膜序列连接之相对短的C末端胞浆区域的蛋白质。pc42和pc43的细胞外区域长度不同,并且pc42含有七个很类似于典型钙粘着蛋白细胞外亚区域的亚区域,而pc43有六个这样的亚区域。原钙粘着蛋白胞浆区域的大小相似于典型钙粘着蛋白的相应区域,但序列并没有显示与已知钙粘着蛋白或钙粘着蛋白相关蛋白质的序列有明显的同源性。
下列表1中给出了人pc42和pc43、以及小鼠N-钙粘着蛋白(SEQID NO98)(作为“典型”钙粘着蛋白的一个例子给出的)的细胞外亚区域,和果蝇fat肿瘤抑制基因的细胞外第18个亚区域(EC-18,SEQ ID NO99)(fat的细胞外第18个亚区域是一个原型fat亚区域)之间的氨基酸同一性确定结果,例如其中“N-EC-1×pc42”表示N-钙粘着蛋白的细胞外第一亚区域与水平轴上指示的pc42的细胞外亚区域可相比。
表1EC-1 EC-2 EC-3 EC-4 FEC-5 E5C-6 EC-7N-EC-1×pc4220272626 3129 17N-EC-1×pc4331232326 3124N-EC-2×pc4228303230 3731 19N-EC-2×pc4330283036 2930N-EC-3×pc4221263029 3130 22N-EC-3×pc4325182628 2825N-EC-4×pc4228282625 2927 17N-EC-4×pc4321252828 2924N-EC-5×pc4224212524 2419 12N-EC-5×pc4315212020 2516fat EC-18×pc42 22353234 4235 19fat EC-18×pc43 32303636 3329pc42和pc43、N-钙粘着蛋白EC-1至EC-5及果蝇fat EC-18的细胞外亚区域之间的氨基酸同一性值基本上小于40%。这些同一性值与其他钙粘着蛋白亚类的亚区域之间的同一性值差不多。然而,较高的同一性值表示pc42和pc43与fat比与N-钙粘着蛋白关系更为密切。
下列表2中给出人pc42和pc43的细胞外亚区域间的氨基酸同一性确定结果。
表2pc42pc43EC-1 FC-2 EC-3 EC-4 EC-5 EC-6 EC-7EC-133272926252625EC-226382933342821EC-326324130323122EC-425343041393118EC-523322927363416EC-625252625282326各个EC-1、EC-2、EC-3、EC-4、EC-5亚区域和pc42和pc43之最后亚区域间的同一性值一般高于原钙粘着蛋白与N-钙粘着蛋白相比较所得到的值。这些结果提示,pc42和pc43相互间的关系比它们与典型钙粘着蛋白的关系更为密切。
图1A-C显示pc42(EC-1至EC-7)、pc43(EC-1至EC-6)、小鼠N-钙粘着蛋白(EC-1至EC-5)和果蝇fat EC-18的细胞外亚区域之排成行的推测的氨基酸序列。图1A中一行上的序列延续到图1B和1C中同一行上。对最大同源性部分引入缺口。“基序”行中由大写字母描述的氨基酸残基是以大于N-钙粘着蛋白、pc42、pc43和果蝇fat亚区域之一半的形式给出的。基序行中由小写字母描述的氨基酸残基在人pc42、pc43和果蝇fat中较不保守。图1A-C显示,其他钙粘着蛋白细胞外区域重复的许多氨基酸特征在pc42和pc43序列中是保守的,其中包括钙粘着蛋白序列基序DXD、DRE和DXNDNXPXF(SEQ ID NO43)、两个甘氨酸残基,和一个谷氨酸残基。另外,pc42和pc43与N-钙粘着蛋白相比,它们共享独特的特征。在pc42和pc43之间,特定位点上的更多氨基酸是保守的,如接近pc42和pc43亚区域之氨基末端的DXDXGXN(SEQ ID NO100)原钙粘着蛋白序列基序和接近亚区域之羧基末端的AXDXGXP(SEQ ID NO101)序列基序。再者,两原钙粘着蛋白在接近EC-1之羧基末端、EC-2和EC-4的中间,及最后一个重复序列的羧基末端共享一个与典型钙粘着蛋白基序(N-钙粘着蛋白的)并不显著同源的区域。一个半胱氨酸残基位于pc42和pc43之EC-4中部的相似位置上。一般说来,pc42和pc43的细胞外亚区域比N-钙粘着蛋白的细胞外亚区域更相似于fat的EC-18。
可能变换的剪接对各种重叠原钙粘着蛋白cDNA克隆的序列分析表明,有些克隆在3′末端含有独特的序列,但5′末端序列是与其他克隆相同的。形成3′末端区域之边界的序列与mRNA剪接的共有序列一致,提示这些克隆可能相应于变换剪接的mRNAs。SEQ ID NOs102和103中列出变换剪接pc42mRNA的一个可能产物的DNA序列和由之推测的氨基酸序列。SEQ ID NO104和105,以及SEQ ID NOs106和107中分别列出变换剪接pc43mRNA之两个可能产物的DNA序列及由之推测的氨基酸序列。
染色体定位用常规方法确定原钙粘着蛋白413基因(SEQ ID NO37)和pc42及pc43基因的染色体定位。
简单地说,按以前已描述的方法(Seldin,et al.,J.Exp.Med.,167688-693(1988))喂养C3H/HeJ-gld和Mus spretus(西班牙)小鼠和[(C3H/HeJ-gld×Nus spretus)F1×C3H/HeJ-gld]种间回交小鼠。交配中选择Mus spretus作为第二亲代,因为与使用常规纯系实验品系进行交配相比,相对更易可确定信息性限制片段长度变异体(RFLVs)。以分离分析法确定基因连锁。
用限制性核酸内切酶消化以标准技术从小鼠器官中分离的基因组DNA,并在0.9%琼脂糖凝胶中电泳分离10μg样品。将DNA转移到Nytran膜(Schleicher&Schull,Inc.,Keene,NH)上,于65℃与适当的探针杂交并在严格条件下洗膜(各操作均参见Maniatis等人上述文献)。为了定位pc42基因,可使用相当于SEQ ID NO94之核苷酸1419至1906的小鼠序列探针;为了定位pc43基因则使用相当于SEQ ID NO96之核苷酸1060至1811的大鼠序列探针。为了定位原钙粘着蛋白413基因可使用包含SEQ ID NO37中所列序列的探针。目前研究用作探针的其他克隆和用以确定无名DNA座位的RFLV是前已描述过的(染色体7、DNA片段、Washington12(D7Was12);甲状旁腺激素(Pth);降钙素(Calc);血红蛋白,β链(Hbb);金属硫蛋白-I(Mt-1)腺嘌呤磷酸核糖基转移酶(Aprt);生长激素受体(Ghr);前列腺素E受体EP2亚型(Ptgerep2);二氢叶酸还原酶-2(Dhfr2)成纤维细胞生长因子a(Fgfa)和糖皮质醇受体-1(Grl-1))。
将原钙粘着蛋白基因的单元型分布与就整个小鼠基因组座位所确定的分布相比较,得以分别对其进行图谱分析以确定它在小鼠染色体上的特定区域定位。连锁的可能性大小99%,并表明pc42基因和pc43基因排布为染色体18。原钙粘着蛋白413基因的排布为染色体7。pc42和pc43基因图谱限定的染色体18的区域相当于ataxia(ax)座位(Burt,Anat.Rec.,19661-69(1980);Lyon,J,Hered.,4677-80(1955))和twirler(Tw)座位(Lyon,J.Embryol.Exp.Morphol.,6105-116(1958)),而原钙粘着蛋白413基因图谱限定的染色体7的区域相当于Shaker(sh-1)座位(Kikuchi et al.,Acta Oto-Laryngol.,60287-303(1965)和Lord et al.,Am.Nat.,63453-442(1929))。这些基因座位已被卷入与之相关的小鼠遗传性神经疾病中。这一结果与显示pc42和pc43在脑中,特别是大脑中被强表达的原位杂交实验结果(见实施例12)相一致。
实施例4使用实施例1中所述的大鼠原钙粘着蛋白片段作为探针,分离两种另外的新的人原钙粘着蛋白cDNA和一种另外的新的大鼠原钙粘着蛋白cDNA。
首先,使用大鼠克隆RAT-214(SEQ ID NO7)作为探针筛选大鼠脑cDNA文库(Stratagene,La Jolla,CA)。最后的洗涤步骤是于50℃在0.1×SSC中用0.1%SDS洗两次,每次15分钟。鉴定含有编码相关原钙粘着蛋白氨基酸序列之部分cDNA插入段的各种不同的克隆。SEQ ID NO108中列出了一个定名为#6-2的新的大鼠克隆的核苷酸序列。SEQ ID NO108的前15个核苷酸是接头序列而不是大鼠#6-2克隆的一部分。
用克隆#6-2之0.7kbp PstI片段筛选从Stratagene得到的人胎脑cDNA文库。该片段似乎编码大鼠原钙粘着蛋白的EC-2和EC-3。筛选大约2×106个噬斑后,分离11个阳性克隆。对克隆进行序列测定以鉴定称为人pc3的新的人原钙粘着蛋白全长cDNA。SEQID NO109和110中列出了人pc3的核昔酸和推断的氨基酸序列。
还使用大鼠克隆#6-2的0.7kbp PstI片段重筛选大鼠全长cDNA克隆的Stratagene大鼠脑cDNA文库。分离含有编码新的原钙粘着蛋白全长cDNA之插入段的克隆。SEQ ID NO111和NO112中列出了该插入段的DNA和由之推断的氨基酸序列。因为基于序列比较,全长大鼠cDNA似乎不是与人pc3克隆同源的,故将其定名为pc5。
同时,使用定名为#43(SEQ ID NO113)的大鼠部分cDNA的0.8kbp EcoRI-PstI片段(该片段是用相当于人pc43胞浆区域的探针筛选Stratagene大鼠脑cDNA文库而得到的)作为探针探测Stratagene人cDNA文库以筛选人原钙粘着蛋白全长cDNA。该片段似乎编码克隆#43的EC-3至EC-6的开始部分。鉴定出的一个部分克隆编码称为人pc4的新的人原钙粘着蛋白。SEQ ID NO114和115中列出了人pc4克隆的核苷酸序列和由之推断的氨基酸序列。由pc4克隆编码的氨基酸序列似乎在pc4之EC-2的中间开始并通过原钙粘着蛋白的胞浆尾部延续。
实施例5使用pRC/RSV表达载体(Invitrogen,San Diego,California)在L细胞(ATCC CCLl)中表达编码pc42和pc43的全长人cDNA。经用SspI消化然后用DNA聚合酶修成平末端并用XbaI消化(pc42),或者用SpeI和EcoRV双消化(pc43)以从实施例2所述的Bluescript SK(+)克隆中分离cDNA。用HindIII消化pRC/RSV表达载体,然后修成平端并再次用XbaI消化以插入pc42序列,或者用XbaI消化然后修成平端并再用SpeI消化以插入pc43序列。将分离出的原钙粘着蛋白DNA连接到线性化的pRC/RSV载体中。用CsCl梯度离心法纯化所得到的称为pRC/RSV-pc42的pc42表达质粒(ATCC69162)和称为pRC/RSV-pc43的pc43表达质粒(ATCC69163),并用磷酸钙法转染到L细胞中。
pc42和pc43转染体形态上相似于亲本细胞。对经pc42或pc43DNA序列转染的L细胞的Northern印迹分析显示,被转染的细胞表达了有预期编码特定原钙粘着蛋白之大小的mRNA。
实施例6产生对pc42和pc43特异的兔多克隆抗体及对pc43特异的小鼠单克隆抗体。
对pc42和pc43特异的多克隆抗体的制备将编码pc42和pc42之细胞外区域部分的DNA序列分别与麦芽糖结合蛋白编码序列融合并在细菌中表达之。具体地说,用PCR方法制备相当于之pc42之EC-4至EC-7和pc43之EC-3至EC-5的DNA并以正确读码亚克隆到pMAL表达载体(New England Biolabs.Beverly,Massachusetts)的多克隆位点,所说的表达载体含有编码直接在该多克隆位点之上游的麦芽糖结合蛋白质的序列。然后用单步骤转化法将所得质粒导入E.Coli NM522细胞(Invitrogen,SanDiego,California)中。并按制造商所述加入IPTG诱导融合蛋白质的表达,用直链淀粉树脂亲和层析(New England Biolabs)从细胞提取物中纯化融合蛋白质。不经进一步纯化即可使用该融合蛋白质免疫兔。
用加在弗氏完全佐剂中的500μg纯化的融合蛋白质在四个皮下位点免疫兔以制备多克隆抗体。继后用加在弗氏不完全佐剂中的100μg融合蛋白质进行免疫。使免疫血清穿过偶联到麦芽糖结合蛋白质上的琼脂糖凝胶(New England Biolabs),并使用由纯化的融合蛋白质与CNBr Sepharose(Pharmacia)反应而制得的Sepharose亲和层析柱从免疫血清中纯化多克隆抗体。进一步证实了多克隆血清与纯化的pc42融合蛋白质和pc42转染的细胞提取物(如实施例5中所述的)的反应性。
对pc43特异的单克隆抗体的制备按照Kennett(Methods in Enzymol.,58345-359(1978))的方法,使用pc43融合蛋白质(含有pc43的EC-3至EC-5亚区域)在小鼠体内产生单克隆抗体。简单地说,在两个皮下位点用pc43融合蛋白质(100μg)免疫小鼠。将得自有最高滴度之小鼠的脾细胞与NS1骨髓瘤细胞系融合。以ELISA检测法筛选与pc43融合蛋白质及与麦芽糖结合蛋白质有反应性的所得杂交瘤的上清液。亚克隆与pc43细胞外区域有最高反应性的融合孔。称为38I2C的杂交瘤细胞系(ATCC HB11207)产生对pc43特异的IgG1亚型单克隆抗体。经免疫印迹实施例5中所述的pc43L细胞转染体进一步证实由杂交瘤细胞系38I2C产生的单克隆抗体与pc43的反应性。38I2C单克隆抗体是对人pc43特异的。
实施例7用免疫印迹法和免疫荧光显微镜检法检测用实施例5中制备的编码pc42和pc43的DNA序列转染的L细胞对原钙粘着蛋白的表达。
免疫印迹分析对pc42和pc43转染体的细胞提取物进行SDS-PAGE分离,然后电转印到PVDF膜(Millipore,Bedford,Massachusetts)上。在Tris缓冲盐水(TBS)中将该膜与5%脱脂乳一起保温2小时,然后分别与pc42多克隆血清或pc43单克隆抗体保温1小时。用含有0.05%吐温20的TBS将膜洗3次(各5分钟),并在同样缓冲液中分别与结合有碱性磷酸酶的抗兔IgG抗体或抗小鼠IgG抗体(Promega,Madison,Wisconsin)保温1小时。用含有0.05%吐温20的TBS洗膜后,使用Western Blue溶液(Promega)显现肉眼可见的反应性带。
抗pc42多克隆抗体着染pc42转染的细胞中的,而不是亲本L细胞中的分子量约为170kDa的带。pc43特异性单克隆抗体(38I2C)和多克隆抗体着染pc43转染之细胞中的两条分子量约为150kDa的相邻带。pc43抗体并不着染亲本L细胞中的带。经用pc42和pc43抗体着染电泳带,所指示的分子量明显大于由推断的氨基酸序列预测的分子量。这种分子量的差异在各种钙粘着蛋白相关蛋白质间是常见的,并且可能归因于糖基化作用和/或钙粘着蛋白特异性的结构特性。pc42抗体也着染可能是蛋白水解降解产物的较小带。
当用胰酶处理被转染的细胞并制备细胞提取物,在SDS/PAGE上电泳分离并用适当的抗体进行免疫印迹分析时,发现由被转染的细胞表达的pc42和pc43多肽对蛋白水解高度敏感,并且容易通过0.01%胰蛋白酶处理消化。但与典型的钙粘着蛋白相反,这些蛋白质在1-5mM Ca2+存在下不能兔于被消化。
免疫荧光显微镜检使被转染的细胞生长在被粘连蛋白预包被的盖片上,并于室温下用4%多聚甲醛固定5分钟,或者在冰上用冷甲醇处理10分钟,然后再用4%多聚甲醛固定。用TBS洗涤后,将细胞与含有1%BSA的TBS保温30分钟,再于室温下在含有1%BSA的TBS中与抗pc42多克隆抗体或抗pc43单克隆抗体保温1小时。然后用含有0.01%BSA的TBS洗盖片并在室温下分别与结合有FITC的抗兔抗体或抗小鼠抗体(Cappel,Durham,North Carolina)一起保温60分钟。再次用含有0.01%BSA的TBS洗细胞并进行荧光显微镜检。pc42特异性和pc43特异性多克隆抗体都着染表达原钙粘着蛋白蛋白质之被转染细胞的细胞边缘区,主要是在细胞-细胞接触部位。这些抗体不着染亲本L细胞,兔预免疫血清均不着染pc42和pc43转染体。
实施例8检验被转染的表达原钙粘着蛋白蛋白质之L细胞的细胞聚集特性。在含有10%胎牛血清的Dulbecco氏改动的Eagle氏培养基(DMEM)(Gibco,Grand Island,New York)中于37℃和5%CO2环境下培养被转染的L细胞。在旋转振荡器上于37℃用存在有1mMEGTA的0.01%胰蛋白酶将生长至接近汇合的细胞处理25分钟,然后离心收集之。加入大豆胰蛋白酶抑制剂后用无Ca2+的HEPES缓冲盐水(HBS)将细胞洗3次,并重新悬浮于含有1%BSA的HBS中。在旋转振荡器上,于DMEM和含1%BSA、2mM CaCl2及20μg/ml脱氧核糖核酸酶之HBS的1∶1混合物中将重新悬浮的细胞于37℃保温30分钟至6小时,以完成细胞聚集试验(Urushihara et al.,Dev.Biol.,70206-216(1979))。
在保温不到1小时的期间内,pc42和pc43转染体没有显示任何明显的细胞聚集活性。这一结果正好与典型钙粘着蛋白在相似实验中所发生的细胞聚集相反(Nagafuchi et al.,文献同上;Hatta etal.,文献同上)。但延长被转染细胞的保温时间(超过1-2小时)则可见细胞逐渐再聚集成小的聚集体。当在Ca2+存在下经用胰蛋白酶处理制备被转染细胞的单细胞悬液时可得到相似的结果。当试验未被转染的L细胞或只用pRC/RSV载体转染的L细胞时,在相同条件下未观察到再聚集现象。当在细胞聚集试验中将用DiO(MolecularProbes,Eugene,OR)标记的pc43转染体与未被标记的pc42转染体一起保温时,标记的和未标记的细胞的聚集几乎是互不相容的,此表明原钙粘着蛋白结合是同嗜性的。
鉴于原钙粘着蛋白胞浆区域没有表现出与钙粘着蛋白区域有明显的同源性这一事实,故进行相应实验以确定是否胞浆区域的差异可以说明在钙粘着蛋白和原钙粘着蛋白转染体中观察到的细胞聚集活性差异的原因。用E-钙粘着蛋白的胞浆区域取代pc43的胞浆区域并分析用嵌合构建体转染之细胞的聚集。
用EcoRV消化实施例2中所述的含有pc43之完整编码序列的Bluescript SK(+)克隆,然后再用XbaI部分消化以除去相当于胞浆区域的序列,并经琼脂糖凝胶电泳纯化质粒DNA。使用从小鼠肺mRNA制得的小鼠cDNA作为模板,并使用相当于接近胞浆区域序列之N末端的区域或含有小鼠E-钙粘着蛋白之终止密码子的区域作为特异性引物(Nagafuchi et al.,文献同上),以PCR法合成相当于小鼠E-钙粘着蛋白之胞浆区域的cDNA。XbaI序列被包括在上游引物的5′末端。然后将E-钙粘着蛋白胞浆区域cDNA亚克隆到线性化的pc43Bluescript克隆中。用SpeI和EcoRV切下含有所得完整嵌合序列的DNA并亚克隆到表达载体pRC/RSV载体的SpeI切平头的XbaI位点中。最后,以磷酸钙法用所得构建体转染L细胞。用G418筛选约10天后,用FITC标记的38I2C抗pc43抗体着染转化体并进行FACS分析。分离一部分高标记的细胞并克隆之。转染体显示有与亲本L细胞相似的形态,并使用pc43抗体以免疫荧光显微镜检测,在细胞周边定位所表达的蛋白质。
按下述方法分析嵌合转染体的细胞聚集活性。室温下用DiO将嵌合pc43转染体标记20分钟。在1mM EGTA存在下用胰蛋白酶处理所得到的细胞并制成单细胞悬液。然后,将细胞与未标记的其他类型转染体混合并在旋转振荡器上保温2小时。用荧光和相差显微镜检查实验结果。在标准检测培养基中,于多克隆抗pc43抗体(100ng/ml)存在下保温转染体以检查抗体对细胞聚集的抑制作用。
在细胞聚集试验中,在40分钟保温时间内嵌合pc43转染体显示了明显的Ca2+依赖性细胞聚集作用。加入pc43特异性多克隆抗体后可抑制细胞聚集。
实施例9使用Maruyama等人(J.Biochem.,95511-519(1984))的方法,在加或不加钙-45条件下经Western印迹分析确定pc43的钙结合特性。将实施例6中所述的含有pc43亚区域EC-3至EC-5的pc43融合蛋白质与钙结合蛋白质钙调蛋白相比较。对纯化的pc43融合蛋白质的样品进行SDS/PAGE并电泳转移到PVDF膜上。使用放射自显影响术检测45Ca2+与pc43融合蛋白质的结合,并确定该结合几乎与45Ca2+对钙调蛋白的结合一样有效。相反,钙没有与纯化的缺少pc43细胞外区域的麦芽糖结合蛋白质结合。pc43亚区域EC-3至EC-5含有与见于E-钙粘着蛋白中的推断Ca2±结合基序高度同源的序列(参见(Ringwald et al.,EMBO J.,63647-3653(1987))。
实施例10用Northern印迹法检测各种组织和细胞系中编码pc42和pc43之mRNA的表达。
用异硫氰酸鈲法制备总RNA并使用FastTrack试剂盒(Invitrogen)分离poly(A)+RNAs。在变性条件下用0.8%琼脂糖凝胶电泳分离RNA制剂并使用毛细管法将之转移到硝酸纤维素滤膜上。按照Thomas(Proc.Natl.Acad.Sci.USA,775201-5205(1980))的方法进行Northern印迹分析。最后于65℃在含有0.1%十二烷基硫酸钠的0.2×标准柠檬酸盐盐水中将滤膜洗10分钟。
原钙粘着蛋白mRNA在成年大鼠组织中的表达在变性条件下电泳分离包括脑、心脏、肝脏、肺、皮肤、肾脏及肌肉在内的大鼠组织中的总mRNA制剂(10μg mRNA/泳道),并转移到硝酸纤维素滤膜上。将所得滤膜与32P标记的eDNA片段MOUSE-326(其相当于人pc42的EC-4)和RAT-218(相当于人pc43的EC-5)杂交。两种原钙粘着蛋白的mRNA在脑组织中均呈高度表达。pc42探针检测到-条大小为7kb的主带和一条大小为4kb的付带,可能代表替换剪接的产物。pc43探针与5kb大小的主带以及较小的付带杂交。
原钙粘着蛋白mRNA在大鼠脑中的发育表达为了检查对原钙粘着蛋白之mRNA表达的发育调节作用,制备胚胎第17和20天、新生第5和11天的大鼠以及成年大鼠的脑mRNA,并对其进行如上所述的用于检查其他大鼠组织的Northern印迹分析。使用β-肌动蛋白作为内部标准。当与β-肌动蛋白表达比较时,可见在脑的胚胎发育期间增加了表达pc42和pc43蛋白质的mRNA水平。
人的细胞系中的原钙粘着蛋白mRNA表达使用32P标记的用于表达的pc42和pc43mRNA,以Northern印迹法检测几种神经元和胶原细胞系(包括人SK-N-SH成神经细胞瘤、人U251神经胶质瘤和小鼠Neuro-2a成神经细胞瘤细胞系)。用HUMAN-42(相当于人pc42的EC-4)和HUMAN-43(相当于人pc43的EC-5)cDNA片段探查人细胞系,同时用MOUSE-326(相当于人pc42的EC-4)和RAT-322(相当于人pc43的EC-5)cDNA片段探查小鼠细胞系。发现SK-N-SH人成神经细胞瘤细胞和U251人神经胶质瘤细胞表达pc43mRNA并发现Neuro-2a小鼠成神经细胞瘤细胞表达pc42mRNA。
实施例11用Western印迹法和免疫荧光显微镜检术检测pc43蛋白质在各种组织、提取物及细胞中的表达。
在大鼠心肌提取物中的表达切除后在液氮中冷冻心脏,在研钵中研成粒末、在加于0.5%Nonidet P40(在10mM PIPES(pH6.8)、50mMNaCl、250mM(NH4)2SO4、300mM蔗糖、3mM MgCl2中)内的Polytron中简单研磨并微量离心15分钟,以制备大鼠心脏无离子去污剂提取物。经SDS/PAGE分离样品并电泳转移到硝酸纤维素膜上(Towbin et al.,PNAS 764350-4354(1979))。按实施例7中所述的免疫印迹法,使用兔抗pc43多克隆抗体检测出两条分子量为150kDa和140kDa的pc43蛋白质带。
在组织切片和细胞中的表达为了确定原钙粘着蛋白在各种组织中的定位,切下人和成年大鼠组织,在溶于PBS中的30%蔗糖中4℃保温30分钟,在加于冷冻模中的OCT化合物(Tissue-Tek,Elkhart,Indiana)中包埋并快速冷冻。切出6微米切片并放在玻片上。用PBS洗玻片并在3%多聚甲醛中固定5分钟。为了通透组织切片,将玻片在-20℃丙酮中浸泡10分钟并风干。用溶于PBS中的2%羊血精和1%BSA将切片封闭30分钟后,室温下与兔抗pc43多克隆抗血清保温1小时。在含0.1%BSA的PBS中将切片淋洗3次并与含1%BSA(溶于PBS)的生物素化的抗兔血清(Vector Laboratories,Burlingame,California)一起保温30分钟。洗3次后,加入与FITC(Vetor Laboratories)结合的链霉抗生物素蛋白保留30分钟,然后再洗3次。为进行共定位研究,将适当的第一抗体与结合有TRITC的第二抗体一起使用。
A.肌肉pc43在大鼠心肌中的免疫定位显示pc43以重复方式定位,而这是与pc43同肌节相关联的情况是一致的。肌节是骨骼肌和心肌中粘连筋膜(fascia adherens)间的重复性收缩单位。与细胞骨架蛋白质共定位显示pc43存在于Z线中肌节的末端上,其中Z线是与结蛋白和肌动蛋白结合蛋白质粘着斑蛋白,以及α-辅肌动蛋白相联系的。F-肌动蛋白的薄微丝是与Z线间的厚肌球蛋白丝相联系的。相反,N-钙粘着蛋白则定位于肌细胞肌细胞接触部位的粘连筋膜接合处的心肌细胞末端上。pc43在心肌中的定位提示,pc43可能在收缩器向浆膜固着的肌肉收缩中起作用。
在大鼠骨赂肌中观察到了pc43的相似定位。超结构研究显示肌营养不良蛋白(Duchenne肌营养不良症中缺少的基因产物)是肌膜的一种成分(Porter et al.,J.Cell.Biol.,117997-1005(1992))。肌膜是与pc43所在的M和Z线上的收缩器相连的。
B.脑大鼠和人小脑冷冻切片上抗pc43多克隆抗体及单克隆抗体38I2C的反应性分别显示,pc43表达的主要部位是在含有许多小神经元的Purkinje细胞和颗粒细胞层中。
C.胎盘在胎盘发育的早期阶段(妊娠的5-7周)还观察到单克隆抗体38I2C与人合胞体滋养层的强反应性。显示表达随胎盘发育进程而逐步降低一现象表明,pc43可能参予受精卵植入脑盘的过程。
D.成神经细胞瘤和星状细胞瘤细胞使用抗pc43抗体进行的pc43在SK-N-SH成神经细胞瘤细胞和UW28星状细胞瘤细胞中的免疫细胞化学定位揭示了pc43的点状细胞表面分布,并且在某些细胞中有一个在神经元足部隆起之延伸顶端处的定位。在UW28星状细胞瘤细胞的细胞-细胞接触部位,pc43组织成一系列的平行线。这些线开始在接触部位并延伸约5微米。用罗丹明-鬼笔环肽(Molecular Probes,Eugene,Oregon,按制造前所述方法)鉴定F-肌动蛋白微丝显示细胞中的微丝似乎终止在pc43线性结构处(线性结构从细胞接触部位的细胞边缘开始延伸)。
用pc43特异性抗体进行的免疫印迹研究显示,其可识别人SK-N-SH成神经细胞瘤细胞和UW28星状细胞瘤细胞中的分子量为140kDa的蛋白质。
E.成骨细胞使用单克隆抗体38I2C进行的pc43在两个人成骨肉瘤细胞系(SaOS(ATCC HTB 85)和MG-63(ATCC CRL 1427))中和正常人小梁成骨细胞培养物(Civitelli et al.,J.Clin.Invest.,911888-1896(1993)中所述的培养系统)中的免疫细胞化学定位研究显示,pc43以与见于UW28星状细胞瘤细胞中的相似方式在成骨细胞中表达。在细胞-细胞接触部位,pc43被组织成似乎相当于肌动蛋白张力纤维的一系列平行线。另外,在某些细胞中,pc43似乎定位在接触细胞隆起的顶端。Northern印迹分析为pc43在正常人小梁成骨细胞中表达提供了进一步的证据。pc43特异性DNA探针与从正常人小梁成骨细胞分离之poly-A mRNA样品中的5kb主带杂交。
实施例12在大鼠组织的冰冻切片上使用原钙粘着蛋白特异性RNA探针进行原位杂交实验。
使用Promega公司(Madison,Wisconsin)介绍的标准方法制备正义和反义35S-核糖探针(riboprobes)。使用MOUSE-326cDNA克隆的约400bp EcoRI-XbaI片段作为pc42特异性探针。该片段编码pc42之EC-3的中部至EC-4的末端。使用RAT-218cDNA克隆的约700bp SmaI片段作为pc43特异性探针。该片段编码pc43之EC-3的末端至EC-5的末端。
收获成年大鼠组织并直接用加在冰冻模中的OCT Compound(Tissue-TeK)包埋,并在95%乙醇/干冰溶中快速冷冻。将冷冻块保存于-80℃下以备切片。使用恒冷切片机(Reichert-Jung,Model#2800 Frigocut N,Leica,Inc.,Gilroy,California)切出6微米组织切片。将切出的组织切片保存于-80℃。
所使用的原位方法是Angerer等人(Methods in Enzymology,152649-660,(1987))所述方法改动方法。用焦碳酸二乙酯(DEPC,Sigma,St.Louis,Missouri)处理所有溶液以除去RNase污染。首先将组织切片置于4%多聚甲醛中4℃固定20分钟。为除去多余的多聚甲醛并停止组织固定,在PBS(磷酸盐缓冲盐水)中洗玻片,在一系列不同浓度乙醇(70、95、100%)中变性处理并干燥之。为了防止在原位处理期间组织从玻片上脱离,在聚-L-赖氨酸溶液(Sigma)中将组织切片室温处理10分钟。为使组织中的所有RNA变性,将切片在70%甲酰胺/2×SSC(0.15M NaCl/0.3M柠檬酸钠,pH7.0)溶液中70℃放置2分钟,然后在冷却的2×SSC中漂洗之,在一系列不同浓度的乙醇中脱水并干燥之。干燥后,将切片置于杂交缓冲液(50%甲酰胺/50mM DTT(二硫苏糖醇)/0.3MNaCl/20mM Tris,pH8.0/5mM EDTA/1×Denhardt氏溶液(0.02%Ficoll 400型/0.02%聚乙烯吡咯烷酮/0.02%BSA)/10%葡聚糖硫酸酯)中于最终杂交温度下预杂交约4小时。预杂交后,向各切片上加大约1×106cpm的适当核糖探针。一般将切片于45℃杂交过夜(12-16小时)。为确保所看到的杂交是特异的,在某些实验中通过将杂交温度升高到50℃以增加杂交的严格性。鉴于45℃和50℃实验均给出差不多的结果,故将所使用的标准杂交温度定为45℃。
为了除去过多的未杂交的探针,对切片进行一系列洗涤。首先在4×SSC中淋洗切片以除去大部分杂交溶液和探针。然后于室温下在4×SSC/50mMDTT中洗15分钟。还在逐步增加的严格条件下进行洗涤。于60℃在50%甲酰胺/2×SSC/50mM DTT中洗40分钟。最后在室温下洗涤4次,每次10分钟两次在2×SSC中,两次在0.1×SSC中。在一系列渐增浓度的乙醇中使洗过的切片脱水并干燥之。
为了看到杂交的探针,将玻片浸于已在dH2O中1∶1稀释的Kodak NTB2核乳液(International Biotechnology,New Haven,Connecticut)中。干燥后,4℃下将玻片在不透光盒中储存适当的曝光时间。由两个人单独观察原位玻片并记录阳性或阴性杂交信号。
所有原位杂交研究均在大鼠组织上进行。因为Northern印迹实验(参见实施例9)的结果表明pc42和pc43均在成年脑组织中表达,所以进行原位杂交研究以将这些分子的表达定位于特定类型的脑细胞上。在正常成年大鼠脑中看到的杂交是特异的(没有看到与正文探针的本底杂交),并定位于脑的特定区域。所看到的pc42和pc43表达的总体特征是很相似的,其主要差异在于表达水平的不同。pc43的表达水平似乎低于pc42。两种分子均在海马的生发细胞和锥体细胞中、小脑及灰质的神经元的Purkinje细胞中表达。另外,pc42在白质的胶质细胞中表达,但与pc43在胶质瘤细胞系中的表达(如实施例9中所述)相反,没有观察到pc42在正常胶质细胞中的表达。在脊髓中,两种原钙粘着蛋白在灰质的运动神经元中表达,并且pc42还在白质的胶质细胞中表达。
当分析两种原钙粘着蛋白分子在具有EAE(实验性应变性脑脊髓炎)(Vandenbark et al.,Cell.Immunol.,1285-93(1974))之大鼠的脑和脊髓中的表达时,发现如上所述的同样结构是阳性的。另外,观察到pc42在EAE组织的白细胞浸润液中的表达。经两个白细胞细胞系RBL-1/和y3的原位杂交分析进一步证实pc42在白细胞中的表达。
在所试验的所有胚胎学发育日(E15-E19)均观察到原钙粘着蛋白-42和-43在大鼠胚胎的发育阶段脑组织中的表达。另外,还在所试验的所有胚胎学发育日(E13-E19)观察到原钙粘着蛋白-43在发育阶段大鼠心脏中的表达。这一发现是与显示原钙粘着蛋白-43在成年心脏中表达的免疫组织化学分析结果一致的。
为了确定原钙粘着蛋白在神经系统发育中的可能作用,还用原位杂交方法检查原钙粘着蛋白成员在发育阶段的大鼠脑和成年大鼠脑中的表达分布图。将处于出生后第0、6、14、30天(P0至P30)和3个月(年青成年期)之大鼠脑的一系列冠状、矢状和水平断面切片与相当于本发明的包括pc42、pc43、RAT-212、RAT-411及RAT-418在内的各种原钙粘着蛋白的经标记cDNA探针杂交。在发育的脑中,RAT-411在嗅球的神经元即僧谓细胞和球旁细胞中以高水平表达。RAT-411mRNA的表达是瞬时的;表达出现在P0、P6达到降值,至P14减少,在P30时峰和成年大鼠脑中测不到表达。在成年大鼠中,发现pc43mRNA主要是在小脑的Purkinje细胞中表达。从大约在P6时Purkinje细胞分化之初即观察到pc43mRNA在Purkinje细胞中的表达。其他原钙粘着蛋白成员以很低的水平在发育中的和成年的大鼠脑的各个部位表达。这些结果表明在中枢神经系统发育期间原钙粘着蛋白成员被得以不同地表达,并且提示RAT-411和pc43在嗅球神经元和Purkinje细胞的发育期间分别具有特定的作用。
实施例13使用pc43特异性多克隆抗体和单克隆抗体38I2C进行常规免疫沉淀,以鉴定与L细胞转化体中pc43相互反应的蛋白质。
经在含有[35S]甲硫氨酸(50μCi/ml)的Dulbecco氏改动的Eagle氏培养基中保温细胞过夜,代谢上标记pc43和嵌合pc43转染体。洗涤后,用含有Triton X-100的PBS裂解转染体细胞并与抗pc43抗体一起保温。然后使用蛋白A-Sepharose小珠收集免疫复合物。室温下用洗涤缓冲液(50mM Tris-HCl、pH8.0,含有0.5M NaCl,0.1%卵白蛋白、0.5%NP-40、0.5%Triton X-100和1mM EDTA)将得到小珠洗5次。经SDS-PAGE分离蛋白质并进行放射自显影。
因为抗α-连环蛋白和抗β-连环蛋白抗体着染差不多大小的带,所以嵌合pc43与很可能相当于α-和β-连环蛋白的105kDa及95kDa带共沉淀。另一方面,pc43与120kDa带共沉淀。
虽然已通过特定方法和组成描述了本发明,但应明确的是本领域技术人员可以对其进行各种改动和修饰。因此,仅该对本发明作出如出现于权利要求中的这些限制。
序列表(1)一般资料(i)申请人Suzuki,Shintaro(ii)发明名称Protocadherin Materials and Methods(iii)序列数目115(iv)通讯地址(A)地址Marshall,O′Toole,Gerstein,Murray,&Borun(B)街道6300 Sears Tower,233 S.Wacker Drive(C)城市Chicago(D)州 Illinois(E)国家USA(F)邮政编码60606(v)计算机可读形式(A)介质类型Floppy disk(B)计算机IBM PC compatible(C)操作系统PC-DOS/MS-DOS(D)软件PatentIn Release #1.0,Version#1.25(vi)目前申请信息(A)申请号(B)提交日(C)分类(vii)在先申请信息(A)申请号PCT/US93/12588(B)申请日23 DEC 1993(vii)在先申请信息(A)申请号US07/998,003(B)申请日29 DEC 1992(viii)律师/代理人信息(A)姓名Noland,Greta E.
(B)登记号35,302(C)参考/文件号32149(ix)电讯信息(A)电话312/474-6300(B)传真312/474-0448(C)电传25-3856(2)SEQ ID NO1的信息(i)序列特征(A)长度17碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型DNA(xi)序列描述SEQ ID NO1AARSSNNTNG AYTRYGA 17(2)SEQ ID NO2的信息(i)序列特征(A)长度17碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(i i)分子类型DNA(xi)序列描述SEQ ID NO2TTRCTRTTRC GNGGNNN 17(2)SEQ ID NO3的信息(i)序列特征(A)长度131碱基对
(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO3AAGGGAGTGG ACTTTGAGGA GCAGCCTGAG CTTAGTCTCA TCCTCACGGC TTTGGATGGA 60GGGACTCCAT CCAGGTCTGG GACTGCATTG GTTCAAGTGG AAGTCATAGA TGCCAATGAC120AACGCACCGT A 131(2)SEQ ID NO4的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO4Lys Gly Val Asp Phe Glu Glu Gln Pro Glu Leu Ser Leu Ile Leu Thr1 5 10 15Ala Leu Asp Gly Gly Thr Pro Ser Arg Ser Gly Thr Ala Leu Val Gln20 25 30Val Glu Val Ile Asp Ala Asn Asp Asn Ala Pro35 40(2)SEQ ID NO5的信息(i)序列特征(A)长度131碱基对(B)类型核酸(C)链型单股
(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO5AAACGCATGG ATTTCGAGGA GTCTTCCTCC TACCAGATCT ATGTGCAAGC TACTGACCGG 60GGACCAGTAC CCATGGCGGG TCATTGCAAG GTGTTGGTGG ACATTATAGA TGTGAACGAC120AACGCACCTA A 131(2)SEQ ID NO6的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO6Lys Ala Met Asp Phe Glu Glu Ser Ser Ser Tyr Gln Ile Tyr Val Gln1 5 10 15Ala Thr Asp Arg Gly Pro Val Pro Met Ala Gly His Cys Lys Val Leu20 25 30Val Asp Ile Ile Asp Val Asn Asp Asn Ala Pro35 40(2)SEQ ID NO7的信息(i)序列特征(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA
(xi)序列描述SEQ ID NO7AAGCGACTGG ACTTTGAGAC CCTGCAGACC TTCGAGTTCA GCGTGGGTGC CACAGACCAT 60GGCTCCCCCT CGCTCCGCAG TCAGGCTCTG GTGCGCGTGG TGGTGCTGGA CCACAATGAC120AATGCCCCCA A 131(2)SEQ ID NO8的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO8Lys Arg Leu Asp Phe Glu Thr Leu Gln Thr Phe Glu Phe Ser Val Gly1 5 10 15Ala Thr Asp His Gly Ser Pro Ser Leu Arg Ser Gln Ala Leu Val Arg20 25 30Val Val Val Leu Asp His Asn Asp Asn Ala Pro35 40(2)SEQ ID NO9的信息(i)序列特征(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO9AAGGGCCTGG ATTACGAGGC ACTGCAGTCC TTCGAGTTCT ACGTGGGCGC TACAGATGGA 60GGCTCACCCG CGCTCAGCAG CCAGACTCTG GTGCGGATGG TGGTGCTGGA TGACAACGAC120AACGCCCCTA A 131(2)SEQ ID NO10的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO10Lys Gly Leu Asp Tyr Glu Ala Leu Gln Ser Phe Glu Phe Tyr Val Gly1 5 10 15Ala Thr Asp Gly Gly Ser Pro Ala Leu Ser Ser Gln Thr Leu Val Arg20 25 30Met Val Val Leu Asp Asp Asn Asp Asn Ala Pro35 40(2)SEQ ID NO11的信息(i)序列特征(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO11AAGGCGTTTG ATTTTGAGGA TCAGAGAGAG TTCCAGCTAA CCGCTCATAT AAACGACGGA 60GGTACCCCGG TTTTGGCCAC CAACATCAGC GTGAACATAT TTGTTACTGA CCGCAATGAC120AACGCCCCGC A 131(2)SEQ ID NO12的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO12Lys Ala Phe Asp Phe Glu Asp Gln Arg Glu Phe Gln Leu Thr Ala His1 5 10 15Ile Asn Asp Gly Gly Thr Pro Val Leu Ala Thr Asn Ile Ser Val Asn20 25 30Ile Phe Val Thr Asp Arg Asn Asp Asn Ala Pro35 40(2)SEQ ID NO13的信息(i)序列特征(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO13AAGGCGGTGG ATTACGAAAT CACCAAGTCC TATGAGATAG ATGTTCAAGC CCAAGATCTG 60GGTCCCAATT CTATTCCTGC TCATTGCAAA ATTATAATTA AGGTCGTGGA TGTCAACGAC120AACGCTCCCA A 131(2)SEQ ID NO14的信息(i)序列特征
(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO14Lys Ala Val Asp Tyr Glu Ile Thr Lys Ser Tyr Glu Ile Asp Val Gln1 5 10 15Ala Gln Asp Leu Gly Pro Asn Ser Ile Pro Ala His Cys Lys Ile Ile20 25 30Ile Lys Val Val Asp Val Asn Asp Asn Ala Pro35 40(2)SEQ ID NO15的信息(i)序列特征(A)长度135碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO15TATGACCATG ATTACGAGAC AACCAAAGAA TATACACTGC GGATCCGGGC CCAGGATGGT 60GGCCGGACTC CACTTTCCAA CGTCTCCGGT CTAGTAACCG TGCAGGTCCT AGACATCAAC120GACAATGCCC CCCCA 135(2)SEQ ID NO16的信息(i)序列特征(A)长度44氨基酸(B)类型氨基酸
(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO16Tyr Asp His Asp Tyr Glu Thr Thr Lys Glu Tyr Thr Leu Arg Ile Arg1 5 10 15Ala Gln Asp Gly Gly Arg Thr Pro Leu Ser Asn Val Ser Gly Leu Val20 25 30Thr Val Gln Val Leu Asp Ile Asn Asp Asn Ala Pro35 40(2)SEQ ID NO17的信息(i)序列特征(A)长度129碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO17GGGGGGTCGA TTACGAGGAG AACGGCATGT TAGAGATCGA CGTGCAGGCC AGAGACCTAG 60GACCTAACCC AATTCCAGCC CATTGCAAGG TCACAGTCAA GCTCATCGAC CGCAATGATA120ACGCCCCCA129(2)SEQ ID NO18的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性
(ii)分子类型蛋白质(xi)序列描述SEQ ID NO18Arg Gly Val Asp Tyr Glu Glu Asn Gly Met Leu Glu Ile Asp Val Gln1 5 10 15Ala Arg Asp Leu Gly Pro Asn Pro Ile Pro Ala His Cys Lys Val Thr20 25 30Val Lys Leu Ile Asp Arg Asn Asp Asn Ala Pro35 40(2)SEQ ID NO19的信息(i)序列特征(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO19AAGGGGTTGG ACTACGAAGA CACCAAACTC CATGAGATTT ACATCCAGGC CAAAGACAAA 60GGTGCCAATC CGGAAGGAGC GCATTGCAAA GTACTGGTAG AGGTTGTGGA CGTTAACGAC120AATGCCCCTC A 131(2)SEQ ID NO20的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO20Lys Gly Leu Asp Tyr Glu Asp Thr Lys Leu His Glu Ile Tyr Ile Gln1 5 10 15Ala Lys Asp Lys Gly Ala Asn Pro Glu Gly Ala His Cys Lye Val Leu20 25 30Val Glu Val Val Asp Val Asn Asp Asn Ala Pro35 40(2)SEQ ID NO21的信息(i)序列特征(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO21AAGGGTTTGG ACTTTGAGCA AGTAGATGTC TACAAAATCC GCGTTGACGC GACGGACAAA 60GGACACCCTC CGATGGCAGG CCATTGCACT GTTTTAGTGA GGGTATTGGA TGAAAACGAC120AATGCGCCTC T 131(2)SEQ ID NO22的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO22Lys Gly Leu Asp Phe Glu Gln Val Asp Val Tyr Lys Ile Arg Val Asp1 5 10 15
Ala Tnr Asp Lys Gly His Pro Pro Met Ala Gly His Cys Tnr Val Leu20 25 30Val Arg Val Leu Asp Glu Asn Asp Asn Ala Pro35 40(2)SEQ ID NO23的信息(i)序列特征(A)长度134碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO23AAGGGTATAG ACTTCGAGCA GATCAAGGAC TTCAGCTTTC AAGTGGAAGC CCGGGACGCC 60GGCAGTCCCC AGGCGCTGTC CGGCAACTGC ACTGTCAACA TCTTGATAGT GGATCAGAAC120GACAACGCCC CTAA 134(2)SEQ ID NO24的信息(i)序列特征(A)长度44氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO24Lys Gly Ile Asp Phe Glu Gln Ile Lys Asp Phe Ser Phe Gln Val Glu1 5 10 15Ala Arg Asp Ala Gly Ser Pro Gln Ala Leu Ala Gly Asn Thr Thr Val20 25 30Asn Ile Leu Ile Val Asp Gln Asn Asp Asn Ala Pro35 40(2)SEQ ID NO25的信息(i)序列特征(A)长度134碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO25AAGCCGTTCG ACTATGAGCA AACCGCCAAC ACGCTGGCAC AGATTGACGC CGTGCTGGAA 60AAACAGGGCA GCAATAAATC GAGCATTCTG GATGCCACCA TTTTCCTGGC CGATAAAAAC120GACAATGCGC CAGA 134(2)SEQ ID NO26的信息(i)序列特征(A)长度44氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO26Lys Pro Phe Asp Tyr Glu Gln Thr Ala Asn Thr Leu Ala Gln Ile Asp1 5 10 15Ala Val Leu Glu Lys Gln Gly Ser Asn Lys Ser Ser Ile Leu Asp Ala20 25 30Thr Ile Phe Leu Ala Asp Lys Asn Asp Asn Ala Pro35 40(2)SEQ ID NO27的信息(i)序列特征
(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO27AAGCGGCTGG ATTTCGAACA GTTCCAGCAG CACAAGCTGC TCGTAAGGGC TGTTGATGGA 60GGAATGCCGC CACTGAGCAG CGATGTGGTC GTCACTGTGG ATGTCACCGA CCTCAACGAT120AACGCGCCCT A 131(2)SEQ ID NO28的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO28Lys Arg Leu Asp Phe Glu Gln Phe Gln Gln His Lys Leu Leu Val Arg1 5 10 15Ala Val Asp Gly Gly Met Pro Pro Leu Ser Ser Asp Val Val Val Thr20 25 30Val Asp Val Thr Asp Leu Asn Asp Asn Ala Pro35 40(2)SEQ ID NO29的信息(i)序列特征(A)长度131碱基对(B)类型核酸
(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO29AAGGGGATAG ACTTTGAGAG TGAGAATTAC TATGAATTTG ATGTGCGGGC TCGCGATGGG 60GGTTCTCCAG CCATGGAGCA ACATTGCAGC CTTCGAGTGG ATCTGCTGGA CGTAAATGAC120AACGCCCCAC T 131(2)SEQ ID NO30的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO30Lys Gly Ile Asp Phe Glu Ser Glu Asn Tyr Tyr Glu Phe Asp Val Arg1 5 10 15Ala Arg Asp Gly Gly Ser Pro Ala Met Glu Gln His Cys Ser Leu Arg20 25 30Val Asp Leu Leu Asp Val Asn Asp Asn Ala Pro35 40(2)SEQ ID NO31的信息(i)序列特征(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性
(ii)分子类型cDNA(xi)序列描述SEQ ID NO31AAGGCATTGG ACTTTGAGGC CCGGCGACTG TATTCGCTGA CAGTTCAGGC CACGGACCGA 60GGCGTGCCCT CGCTCACCGG GCGTGCCGAA GCGCTTATCC AGCTGCTAGA TGTCAACGAC120AACGCACCCA T 131(2)SEQ ID NO32的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO32Lys Ala Leu Asp Phe Glu Ala Arg Arg Leu Tyr Ser Leu Thr Val Gln1 5 10 15Ala Thr Asp Arg Gly Val Pro Ler Leu Thr Gly Arg Ala Glu Ala Leu20 25 30Ile Gln Leu Leu Asp Val Asn Asp Asn Ala Pro35 40(2)SEQ ID NO33的信息(i)序列特征(A)长度125碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO33AAGCCAATTG ATTACGAGGC AACTCCATAC TATAACATGG AAATTGTAGC CACAGACAGC 60GGAGGTCTTT CGGGAAAATG CACTGTGTCT ATACAGGTGG TGGATGTGAA CGACAACGCC120CCCAA125(2)SEQ ID NO34的信息(i)序列特征(A)长度41氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO34Lys Pro Ile Asp Tyr Glu Ala Thr Pro Tyr Tyr Asn Met Glu Ile Val1 5 10 15Ala Thr Asp Ser Gly Gly Leu Ser Gly Lys Cys Thr Val Ser Ile Gln20 25 30Val Val Asp Val Ash Asp Asn Ala Pro35 40(2)SEQ ID NO35的信息(i)序列特征(A)长度446碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO35AAGCGGGTAG ACTTCGAAAT GTGCAAAAGA TTTTACCTTG TGGTGGAAGC TAAAGACGGA 60GGCACCCCAG CCCTCAGCAC GGCAGCCACT GTCAGCATCG ACCTCACAGA TGTGAATGAT120AACCCTCCTC GGTTCAGCCA AGATGTCTAC AGTGCTGTCA TCAGTGAGGA TGCCTTAGAG 180GGGGACTCTG TCATTCTGCT GATAGCAGAA GATGTGGATA GCAAGCCTAA TGGACAGATT 240CGGTTTTCCA TCGTGGGTGG AGATAGGGAC AATGAATTTG CTGTCGATCC AATCTTGGGA 300CTTGTGAAAG TTAAGAAGAA ACTGGACCGG GAGCGGGTGT CAGGATACTC CCTGCTCATC 360CAGGCAGTAG ATAGTGGCAT TCCTGCAATG TCCTCAACGA CAACTGTCAA CATTGATATT 420TCTGATGTGA ACGACAACGC CCCCCT446(2)SEQ ID NO36的信息(i)序列特征(A)长度148氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO36Lys Arg Val Asp Phe Glu Met Cys Lys Arg Phe Tyr Leu Val Val Glu1 5 10 15Ala Lys Asp Gly Gly Thr Pro Ala Leu Ser Thr Ala Ala Thr Val Ser20 25 30Ile Asp Leu Thr Asp Val Asn Asp Asn Pro Pro Arg Phe Ser Gln Asp35 40 45Val Tyr Asp Ala Val Ile Ser Glu Asp Ala Leu Glu Gly Asp Ser Val50 55 60Ile Leu Leu Ile Ala Glu Asp Val Asp Ser Lye Pro Aan Gly Gln Ile65 70 75 80Arg Phe Ser Ile Val Gly Gly Asp Arg Asp Asn Glu Phe Ala Val Asp85 90 95Pro Ile Leu Gly Leu Val Lys Val Lys Lys Lys Leu Asp Arg Glu Arg100 105 110Val Ser Gly Tyr Ser Leu Leu Ile Gln Ala Val Asp Ser Gly Ile Pro115 120 125Ala Met Ser Ser Thr Thr Thr Val Asn Ile Asp Ile Ser Asp Val Asn130 135 140Asp Asn Ala Pro145(2)SEQ ID NO37的信息(i)序列特征(A)长度440碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO37AAGGGGGTTG ATTATGAGAC AAACCCACGG CTACGACTGG TGCTACAGGC AGAGAGTGGA60GGAGCCTTTG CTTTCTCGGT GCTGACCCTG ACCCTTCAAG ATGCCAATGA CAATGCTCCC 120CGTTTCCTGC AGCCTCACTA CGTGGCTTTC CTGCCAGAGT CCCGACCCTT GGAAGGGCCC 180CTGCTGCAGG TGGAAGCAGA CGACCTGGAT CAAGGCTCTG GAGGACAGAT CTCCTACAGT 240CTGGCTGCAT CCCAGCCAGC ACGGGGCTTG TTCCATGTAG ACCCAGCCAC AGGCACTATC 300ACTACCACAG CCATCCTGGA CCGGGAAATC TGGGCTGAAA CACGGCTGGT ACTGATGGCC 360ACAGACAGAG GAAGCCCAGC ATTGGTGGGC TCAGCTACCC TGACAGTGAT GGTCATCGAT 420ACCAACGACA ATGCTCCCCT 440(2)SEQ ID NO38的信息(i)序列特征(A)长度146氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO38Lys Gly Val Asp Tyr Glu Thr Asn pro Arg Leu Arg Leu Val Leu Gln1 5 10 15
Ala Glu Ser Gly Gly Ala Phe Ala Phe Ser Val Leu Thr Leu Thr Leu20 25 30Gln Asp Ala Asn Asp Asn Ala Pro Arg Phe Leu Gln Pro His Tyr Val35 40 45Ala Phe Leu Pro Glu Ser Arg Pro Leu Glu Gly Pro Leu Leu Gln Val50 55 60Glu Ala Asn Asp Leu Asp Gln Gly Ser Gly Gly Gln Ile Ser Tyr Ser65 70 75 80Leu Ala Ala Ser Gln Pro Ala Arg Gly Leu Phe His Val Asp Pro Ala85 90 95Thr Gly Thr Ile Thr Thr Thr Ala Ile Leu Asp Arg Glu Ile Trp Ala100 105 110Glu Thr Arg Leu Val Leu Met Ala Thr Asp Arg Gly Ser Pro Ala Leu115 120 125Val Gly Ser Ala Thr Leu Thr Val Met Val Ile Asp Thr Asn Asp Asn130 135 140Ala Pro145(2)SEQ ID NO39的信息(i)序列特征(A)长度124碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO39AAGGTCTCGA TTATGAGGCA ACTCCATATT ATAACGTGGA AATTGTAGCC ACAGATGGTG 60GGGGCCTTTC AGGAAAATGC ACTGTGGCTA TAGAAGTGGT GGATGTGAAC GACGGCGCTC120CAAT 124(2)SEQ ID NO40的信息(i)序列特征(A)长度41氨基酸
(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO40Lys Gly Leu Asp Tyr Glu Ala Thr Pro Tyr Tyr Asn Val Glu Ile Val1 5 10 15Ala Thr Asp Gly Gly Ala Phe Asp Glu Asn Cys Thr Val Ala Ile Glu20 25 30Val Val Asp Val Asn Asp Asn Ala Pro35 40(2)SEQ ID NO41的信息(i)序列特征(A)长度8氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型DNA(xi)序列描述SEQ ID NO41Asp Xaa Asn Glu Xaa Pro Xaa Phe1 5(2)SEQ ID NO42的信息(i)序列特征(A)长度8氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性
(ii)分子类型DNA(xi)序列描述SEQ ID NO42Asp Xaa Asp Glu Xaa Pro Xaa phe1 5(2)SEQ ID NO43的信息(i)序列特征(A)长度9氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型DNA(xi)序列描述SEQ ID NO43Asp Xaa Asn Asp Asn Xaa Pro Xaa Phe1 5(2)SEQ ID NO44的信息(i)序列特征(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO44AAGCGGATGG ATTTTGAAGA CACCAAACTC CATGAGATTT ACATCCAGGC CAAAGACAAA 60GGTGCCAATC CCGAAGGAGC GCATTGCAAA GTACTTGTAG AGGTTGTAGA CGTAAACGAC120AACGCCCCAG T 131(2)SEQ ID NO45的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO45Leu Arg Met Asp Phe Glu Asp Thr Lys Leu His Glu Ile Tyr Ile Gln1 5 10 15Ala Lys Asp Lys Gly Ala Asn Pro Glu Gly Ala His Cys Lys Val Leu20 25 30Val Glu Val Val Asp Val Asn Asp Asn Ala Pro35 40(2)SEQ ID NO46的信息(i)序列特征(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO46AAGGCTTTGG ATTACGAGGA TCAGAGAGAG TTCCAACTAA CAGCTCATAT AAACGACGGA 60GGTACCCCAG TCTTAGCCAC CAACATCAGC GTGAACGTAT TTGTTACTGA CCGCAATGAT120AACGCCCCCT A 131(2)SEQ ID NO47的信息(i)序列特征
(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO47Lys Ala Leu Asp Tyr Glu Asp Gln Arg Glu Phe Gln Leu Thr Ala His1 5 10 15Ile Asn Asp Gly Gly Thr Pro Val Leu Ala Thr Asn Ile Ser Val Asn20 25 30Val Phe Val Thr Asp Arg Asn Asp Asn Ala Pro35 40(2)SEQ ID NO48的信息(i)序列特征(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO48AAGCGCTTGG ACTACGAGGA GAGTAACAAT TATGAAATTC ACGTGGATGC TACAGATAAA 60GGATACCCAC CTATGGTTGC TCACTGCACC GTACTCGTGG GAATCTTGGA TGAAAATGAC120AACGCACCCA T 131(2)SEQ ID NO49的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸
(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO49Lys Arg Leu Asp Tyr Glu Glu Ser Asn Asn Tyr Glu Ile His Val Asp1 5 10 15Ala Thr Asp Lys Gly Tyr Pro Pro Met Val Ala His Cys Thr Val Leu20 25 30Val Gly Ile Leu Asp Glu Asn Asp Asn Ala Pro35 40(2)SEQ ID NO50的信息(i)序列特征(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO50AAACCGGTGG ACTACGAGAA AGTCAAAGAC TATACCATCG AGATCGTGGC TGTGGATTCC 60GGCAACCCTC CACTCTCTAG CACCAACTCC CTCAAGGTGC AGGTGGTAGA CGTCAACGAT120AACGCCCCTC T 131(2)SEQ ID NO51的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性
(ii)分子类型蛋白质(xi)序列描述SEQ ID NO51Lys Pro Val Asp Tyr Glu Lys Val Lys Asp Tyr Thr Ile Glu Ile Val1 5 10 15Ala Val Asp Ser Gly Asn Pro Pro Leu Ser Ser Thr Asn Ser Leu Lys20 25 30Val Gln Val Val Asp Val Asn Asp Asn Ala Pro35 40(2)SEQ ID NO52的信息(i)序列特征(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO52AAGCCTTTTG ATTTCGAGGA CACCAAACTC CATGAGATTT ACATCCAGGC CAAAGACAAG 60GGCGCCAATC CCGAAGGAGC ACATTGCAAA GTGTTGGTGG AGGTTGTGGA TGTGAACGAC120AATGCCCCTC A 131(2)SEQ ID NO53的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO53Lys Pro Phe Asp Phe Glu Asp Thr Lys Leu His Glu Ile Tyr Ile Gln1 5 10 15Ala Lys Asp Lys Gly Ala Asn Pro Glu Gly Ala His Cys Lys Val Leu20 25 30Val Glu Val Val Asp Val Asn Asp Asn Ala Pro35 40(2)SEQ ID NO54的信息(i)序列特征(A)长度122碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO54AAAGGTGTCG ATTACGAGGT GAGTCCACGG CTGCGACTGG TGCTGCAGGC AGAGAGTCGA 60GGAGCCTTTG CCTTCACTGT GCTGACCCTG ACCCTGCAAG ATGCCAACGA CAACGCCCCG120AG 122(2)SEQ ID NO55的信息(i)序列特征(A)长度40氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO55Lys Gly Val Asp Tyr Glu Val Ser Pro Arg Leu Arg Leu Val Leu Gln1 5 10 15
Ala Glu Ser Arg Gly Ala Phe Ala Phe Thr Val Leu Thr Leu Thr Leu20 25 30Gln Asp Ala Asn Asp Asn Ala Pro35 40(2)SEQ ID NO56的信息(i)序列特征(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO56AAAGGGATTG ATTACGAGCA GTTGAGAGAC CTACAGCTGT GGGTGACAGC CAGCGACAGC 60GGGGACCCGC CTCTTAGCAG CAACGTGTCA CTGAGCCTGT TTGTGCTGGA CCAGAACGAC120AACGCCCCCC T 131(2)SEQ ID NO57的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO57Lys Gly Ile Asp Tyr Glu Gln Leu Arg Asp Leu Gln Leu Trp Val Thr1 5 10 15Ala Ser Asp Ser Gly Asp Pro Pro Leu Ser Ser Asn Val Ser Leu Ser20 25 30Leu Phe Val Leu Asp Gln Asn Asp Asn Ala Pro35 40(2)SEQ ID NO58的信息(i)序列特征(A)长度125碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO58AAGGCGGTCG ATTTTGAGCG CACATCCTCT TATCAACTCA TCATTCAGGC CACCAATATG 60GCAGGAATGG CTTCCAATGC TACAGTCAAT ATTCAGATTG TTGATGAAAA CGACAACGCC120CCCCA125(2)SEQ ID NO59的信息(i)序列特征(A)长度41氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO59Lys Ala Val Asp Phe Glu Arg Thr Ser Ser Tyr Gln Leu Ile Ile Gln1 5 10 15Ala Thr Asn Met Ala Gly Met Ala Ser Asn Ala Thr Val Asn Ile Gln20 25 30Ile Val Asp Glu Asn Asp Asn Ala Pro35 40(2)SEQ ID NO60的信息(i)序列特征
(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO60AAACGGCTAG ACTTTGAAAA GATACAAAAA TATGTTGTAT GGATAGACGC CAGAGATGGT 60GGTTTCCCTC CTTTCTCCTC TTACGAGAAA CTTGATATAA CAGTATTAGA TGTCAACGAT120AACGCGCCTA A 131(2)SEQ ID NO61的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO61Lys Arg Leu Asp Phe Glu Lys Ile Gln Lys Tyr Val Val Trp Ile Glu1 5 10 15Ala Arg Asp Gly Gly Phe Pro Pro Phe Ser Ser Tyr Glu Lys Leu Asp20 25 30Ile Thr Val Leu Asp Val Asn Asp Asn Ala Pro35 40(2)SEQ ID NO62的信息(i)序列特征(A)长度131碱基对(B)类型核酸
(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO62AAGGGGATCG ATTATGAGAA GGTCAAAGAC TACACCATTG AGATTGTGGC TGTGGACTCT 60GGCAACCCCC CACTCTCCAG CACTAACTCC CTCAAGGTGC AGGTGGTGGA CGTCAATGACl20AACGCACCGT G 131(2)SEQ ID NO63的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO63Lys Gly Ile Asp Tyr Glu Lys Val Lys Asp Tyr Thr Ile Glu Ile Val1 5 10 15Ala Val Asp Ser Gly Asn Pro Pro Leu Ser Ser Thr Asn Ser Leu Lys20 25 30Val Gln Val Val Asp Val Asn Asp Asn Ala Pro35 40(2)SEQ ID NO64的信息(i)序列特征(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性
(ii)分子类型cDNA(xi)序列描述SEQ ID NO64AAGGGACTCG ACTACGAGGA TCGGCGGGAA TTTGAATTAA CAGCTCATAT CAGCGATGGG 60GGCACCCCGG TCCTAGCCAC CAACATCAGC GTGAACATAT TTGTCACTGA TCGCAACGAT120AATGCCCCCG T 131(2)SEQ ID NO65的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO65Lys Gly Leu Asp Tyr Glu Asp Arg Arg Glu Phe Glu Leu Thr Ala His1 5 10 15Ile Ser Asp Gly Gly Thr Pro Val Leu Ala Thr Asn Ile Ser Val Asn20 25 30Ile phe Val Thr Asp Arg Asn Asp Asn Ala Pro35 40(2)SEQ ID NO66的信息(i)序列特征(A)长度470碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO66AAGGGTTTGG ACTACGAGAC CACACAGGCC TACCAGCTCA CGGTCAACGC CACAGATCAA 60GACAACACCA GGCCTCTGTC CACCCTGGCC AACTTGGCCA TCATCATCAC AGATGTCCAG120GACATGGACC CCATCTTCAT CAACCTGCCT TACAGCACCA ACATCTACGA GCATTCTCCT180CCGGGCACGA CGGTGCGCAT CATCACCGCC ATAGACCAGG ATCAAGGACG TCCCCGGGGC240ATTGGCTACA CCATCGTTTC AGGGAATACC AACAGCATCT TTGCCCTGGA CTACATCAGC300GGAGTGCTGA CCTTGAATGG CCTGCTGGAC CGGGAGAACC CCCTGTACAG CCATGGCTTC360ATCCTGACTG TGAAGGGCAC GGAGCTGAAC GATGACCGCA CCCCATCTGA CGCTACAGTC420ACCACGACCT TCAATATCCT GGTTATTGAC ATCAACGACA ACGCCCCACT 470(2)SEQ ID NO67的信息(i)序列特征(A)长度156氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO67Lys Gly Leu Asp Tyr Glu Thr Thr Gln Ala Tyr Gln Leu Thr Val Asn1 5 10 15Ala Thr Asp Gln Asp Asn Thr Arg Pro Leu Ser Thr Leu Ala Asn Leu20 25 30Ala Ile Ile Ile Thr Asp Val Gln Asp Met Asp Pro Ile Phe Ile Asn35 40 45Leu Pro Tyr Ser Thr Asn Ile Tyr Glu His Ser Pro Pro Gly Thr Thr50 55 60Val Arg Ile Ile Thr Ala Ile Asp Gln Asp Gln Gly Arg Pro Arg Gly65 70 75 80Ile Gly Tyr Thr Ile Val Ser Gly Asn Thr Asn Ser Ile Phe Ala Leu85 90 95Asp Tyr Ile Ser Gly Val Leu Thr Leu Asn Gly Leu Leu Asp Arg Glu100 105 110Asn Pro Leu Tyr Ser Gly Gly Phe Ile Leu Thr Val Lys Gly Thr Glu115 120 125
Leu Asn Asp Asp Arg Thr Pro Ser Asp Ala Thr Val Thr Thr Thr Phe130 135 140Asn Ile Leu Val Ile Asp Ile Asn Asp Asn Ala Pro145 150 155(2)SEQ ID NO68的信息(i)序列特征(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO68AAGGGGGTCG ATTACGAGGT ACTACAGGCC TTTGAGTTCC ACGTGAGCGC CACAGACCGA 60GGCTCACCGG GGCTCAGCAG CCAGGCTCTG GTGCGCGTGG TGGTGCTGGA CGACAATGAC120AACGCTCCCG T 131(2)SEQ ID NO69的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO69Lys Gly Val Asp Tyr Glu Val Leu Gln Ala Phe Glu Phe His Val Ser1 5 10 15Ala Thr Asp Arg Gly Ser Pro Gly Leu Ser Ser Gln Ala Leu Val Arg20 25 30val Val Val Leu Asp Asp Asn Asp Asn Ala Pro35 40(2)SEQ ID NO70的信息(i)序列特征(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO70AAGGGGCTGG ATTATGAGCA GTTCCAGACC CTACAACTGG GAGTGACCGC TAGTGACAGT 60GGAAACCCAC CATTAAGAAG CAATATTTCA CTGACCCTTT TCGTGCTGGA CCAGAATGAT120AACGCCCCAA A 131(2)SEQ ID NO71的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO71Lys Gly Leu Asp Tyr Glu Gln phe Gln Thr Leu Gln Leu Gly Val Thr1 5 10 15Ala Ser Asp Ser Gly Asn Pro Pro Leu Arg Ser Asn Ile Ser Leu Thr20 25 30Leu Phe Val Leu Asp Gln Asn Asp Asn Ala pro35 40(2)SEQ ID NO72的信息(i)序列特征
(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO72AAGCGGGTTG ATTACGAGGA TGTCCAGAAA TACTCGCTGA GCATTAAGGC CCAGGATGGG 60CGGCCCCCGC TCATCAATTC TTCAGGGGTG GTGTCTGTGC AGGTGCTGGA TGTCAACGAC120AATGCCCCGG A 131(2)SEQ ID NO73的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型肽(xi)序列描述SEQ ID NO 73Lys Arg Val Asp Tyr Glu Asp Val Gln Lys Tyr Ser Leu Ser Ile Lys1 5 10 15Ala Gln Asp Gly Arg Pro Pro Leu Ile Asn Ser Ser Gly Val Val Ser20 25 30Val Gln Val Leu Asp Val Asn Asp Asn Ala Pro35 40(2)SEQ ID NO74的信息(i)序列特征(A)长度125碱基对(B)类型核酸
(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO74AAACCGGTAG ACTTTGAGCT ACAGCAGTTC TATGAAGTAG CTGTGGTGGC TTGGAACTCT 60GAGGGATTTC ATGTCAAAAG GGTCATTAAA GTGCAACTTT TAGATGACAA CGACAATGCC120CCGAT125(2)SEQ ID NO75的信息(i)序列特征(A)长度41氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO75Lys Pro Val Asp Phe Glu Leu Gln Gln Phe Tyr Glu Val Ala Val Val1 5 10 15Ala Trp Asn Ser Glu Gly Phe His Val Lys Arg Val Ile Lys Val Gln20 25 30Leu Leu Asp Asp Asn Asp Asn Ala Pro35 40(2)SEQ ID NO76的信息(i)序列特征(A)长度125碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性
(ii)分子类型cDNA(xi)序列描述SEQ ID NO76AAGGGATTAG ATTTTGAAAC TTTGCCCATT TACACATTGA TAATACAAGG AACTAACATG 60GCTGGTTTGT CCACTAATAC AACGGTTCTA GTTCACTTGC AGGATGAGAA TGATAACGCC120CCAAA125(2)SEQ ID NO77的信息(i)序列特征(A)长度41氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO77Lys Gly Leu Asp Phe Glu Thr Leu Pro Ile Tyr Thr Leu Ile Ile Gln1 5 10 15Gly Thr Asn Met Ala Gly Leu Ser Thr Asn Thr Thr Val Leu Val His20 25 30Leu Gln Asp Glu Asn Asp Asn Ala Pro35 40(2)SEQ ID NO78的信息(i)序列特征(A)长度134碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO78AAGCGGGCGG ATTTCGAGGC GATCCGGGAG TACAGTCTGA GGATCAAAGC GCAGGACGGG60GGGCGGCCTC CCCTCAGCAA CACCACGGGC ATGGTCACAG TGCAGGTCGT GGACGTCAAT 120GACAACGCAC CCCT 134(2)SEQ ID NO79的信息(i)序列特征(A)长度44氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO79Lys Arg Ala Asp Phe Glu Ala Ile Arg Glu Tyr Ser Leu Arg Ile Lys1 5 10 15Ala Gln Asp Gly Gly Arg Pro Pro Leu Ser Asn Thr Thr Gly Met Val20 25 30Thr Val Gln Val Val Asp Val Asn Asp Asn Ala Pro35 40(2)SEQ ID NO80的信息(i)序列特征(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO80AAGCGGTTGG ATTACGAAAA GGCATCGGAA TATGAAATCT ATGTTCAAGC CGCTGACAAA60GGCGCTGTCC CTATGGCTGG CCATTGCAAA GTGTTGCTGG AGATCGTGGA TGTCAACGAC 120AACGCCCCCT T131(2)SEQ ID NO81的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO81Lys Arg Leu Asp Tyr Glu Lys Ala Ser Glu Tyr Glu Ile Tyr Val Gln1 5 10 15Ala Ala Asp Lys Gly Ala Val Pro Met Ala Gly His Cys Lys Val Leu20 25 30Leu Glu Ile Val Asp Val Asn Asp Asn Ala Pro35 40(2)SEQ ID NO82的信息(i)序列特征(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO82AAGGGGATCG ATTATGAGGA TCAGGTCTCT TACACATTAG CAGTAACAGC ACATGACTAT 60GGCATCCCTC AAAAATCAGA CACTACCTAT TTGGAAATCT TAGTAATTGA TGTTAACGAC120AACGCGCCCC A 131(2)SEQ ID NO83的信息(i)序列特征
(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO83Lys Gly Ile Asp Tyr Glu Asp G1n Val Ser Tyr Thr Leu Ala Val Thr1 5 10 15Ala His Asp Tyr Gly Ile Pro Gln Lys Ser Asp Thr Thr Tyr Leu Glu20 25 30Ile Leu Val Ile Asp Val Asn Asp Asn Ala Pro35 40(2)SEQ ID NO84的信息(i)序列特征(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO84AAAGGGTTAG ATTTCGAGGG CACTAAAGAT TCAGCGTTTA AAATAGTGGC AGCTGACACA 60GGGAAGCCCA GCCTCAACCA GACAGCCCTG GTGAGAGTAG AGCTGGAGGA TGAGAACGAC120AACGCCCCAA T 131(2)SEQ ID NO85的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸
(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO85Lys Gly Leu Asp Phe Glu Gly Thr Lys Asp Ser Ala Phe Lys Ile Val1 5 10 15Ala Ala Asp Thr Gly Lys pro Ser Leu Asn Gln Thr Ala Leu Val Arg20 25 30Val Glu Leu Glu Asp Glu Asn Asp Asn Ala Pro35 40(2)SEQ ID NO86的信息(i)序列特征(A)长度130碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO86AAGGGTGTGG ATTTTGAAAG TGTGCGTAGC TACAGGCTGG TTATTCGTGC TCAAGATGGA 60GGCAGCCCCT CCAGAAGTAA CACCACCCAG CTCTTGGTCA ACGTCATCGA TCGAATGACA120ATGCGCCGCT 130(2)SEQ ID NO87的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性
(ii)分子类型蛋白质(xi)序列描述SEQ ID NO87Lys Gly Val Asp Phe Glu Ser Val Arg Ser Tyr Arg Leu Val Ile Arg1 5 10 15Ala Gln Asp Gly Gly Ser Pro Ser Arg Ser Asn Thr Thr Gln Leu Leu20 25 30Val Asn Val Ile Asp Val Asn Asp Asn Ala Pro35 40(2)SEQ ID NO88的信息(i)序列特征(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO88AAGGGTGTGG ACTTCGAGCT GACACATCTG TATGAGATTT GGATTGAGGC TGCCGATGGA 60GACACGCCAA GTCTGCGTAG TGTAACTCTT ATAACGCTCA ACGTAACGGA TGCCAATGAC120AATGCTCCCA A 131(2)SEQ ID NO89的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO89
Lys Gly Val Asp Phe Glu Leu Thr His Leu Tyr Glu Ile Trp Ile Glu1 5 10 15Ala Ala Asp Gly Asp Thr Pro Ser Leu Arg Ser Val Thr Leu Ile Thr20 25 30Leu Asn Val Thr Asp Ala Asn Asp Asn Ala Pro35 40(2)SEQ ID NO90的信息(i)序列特征(A)长度441碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO90CAAGGCGTTT GATTTTGAAG AGACAAGTAG ATATGTGTTG AGTGTGGAAG CTAAGGATGG 60AGGAGTACAC ACAGCTCACT GTAATGTTCA AATAGAAATT GTTGACGAGA ATGACAATGC 120CCCAGAGGTG ACATTCATGT CCTTCTCTAA CCAGATTCCA GAGGATTCAG ACCTTGGAAC 180TGTAATAGCC CTCATAAAAG TGCGAGACAA GGATTCTGGG CAAAATGGCA TGGTGACATG 240CTATACTCAG GAAGAAGTTC CTTTCAAATT AGAATCCACC TCGAAGAATT ATTACAAGCT 300GGTGATTGCT GGAGCCCTAA ACCGGGAGCA GACAGCAGAC TACAACGTCA CAATCATAGC 360CACCGACAAG GGCAAACCAG CCCTTTCCTC CAGGACAAGC ATCACCGTGC ACATCTCCGA 420CATCAACGAT AATGCCCCCG T 441(2)SEQ ID NO91的信息(i)序列特征(A)长度146氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性
(ii)分子类型蛋白质(xi)序列描述SEQ ID NO91Lys Ala Phe Asp Phe Glu Glu Thr Ser Arg Tyr Val Leu Ser Val Glu1 5 10 15Ala Lys Asp Gly Gly Val His Thr Ala His Cys Asn Val Gln Ile Glu20 25 30Ile Val Asp Glu Asn Asp Asn Ala Pro Glu Val Thr Phe Met Ser Phe35 40 45Ser Asn Gln Ile Pro Glu Asp Ser Asp Leu Gly Thr Val Ile Ala Leu50 55 60Ile Lys Val Arg Asp Lys Asp Ser Gly Gln Asn Gly Met Val Thr Cys65 70 75 80Tyr Thr Gln Glu Glu Val Pro Phe Lys Leu Glu Ser Thr Ser Lys Asn85 90 95Tyr Tyr Lys Leu Val Ile Ala Gly Ala Leu Asn Arg Glu Gln Thr Ala100 105 110Asp Tyr Asn Val Thr Ile Ile Ala Thr Asp Lys Gly Lys Pro Ala Leu115 120 125Ser Ser Arg Thr Ser Ile Thr Leu His Ile Ser Asp Ile Asn Asp Asn130 135 140Ala Pro145(2)SEQ ID NO92的信息(i)序列特征(A)长度131碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO92AAGCGAGTGG ATTACGAGGC CACTCGGAAT TATAAGCTGA GAGTTAAGGC TACTGATCTT 60GGGATTCCAC CGAGATCTTC TAACATGACA CTGTTCATTC ATGTCCTTGA TGTTAACGAC 120AACGCTCCCT T 131(2)SEQ ID NO93的信息(i)序列特征(A)长度43氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO93Lys Arg val Asp Tyr Glu Ala Thr Arg Asn Tyr Lys Leu Arg Val Lys1 5 10 15Ala Thr Asp Leu Gly Ile Pro Pro Arg Ser Ser Asn Met Thr Leu Phe20 25 30Ile His Val Leu Asp Val Asn Asp Asn Ala Pro35 40(2)SEQ ID NO94的信息(i)序列特征(A)长度4104碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(ix)特征(A)名称/键CDS(B)定位495..3572(xi)序列描述SEQ ID NO94CCTCTATTCG ACATTCTCTT TGGATTGTTT TGCTATAACT TGAAATTTGG GATGTCACAA 60ACGAAACTGT CATCTGTTTC CGCCAAACTG TGGTTCTGCT AATCTCCCAG GCTGGCAGCA 120TTGGAGACTT GCTGACTTCT TTCATCCCCC ACTCTTTTCA CCTGAAATTC CTTTCCTTGG 180TTTTGCTCTA AGTCCTATGC TTCAGTCAGG GGCCAACCAA ATCTCACTGC CTCCTTTTTA 240TCATGAAGCC TTTGATCACT GATAGTTCTT TTTATATCTT GAAAAATCAC CCTTCCCAGT 300ACAGTTAATA TTTAGTATCT CTACTCATCT TGGCACTTAC TCACAGCTCC ATAATTCAGT 360CGTTTTCGTA CCTCTTCATG GTGATGGGGA GCCCTTTGGA GGTGGTGACT GTGCTTTATA 420CTCCTCATGA TGCTTCACAT GTGGCAGGCG TGGAGTGCCC GGAGGCGGCC CTCCTGATTC 480TGGGGCCTCC CAGG ATG GAG CCC CTG AGG CAC AGC CCA GGC CCT GGG GGG530Met Glu Pro Leu Arg His Ser Pro Gly Pro Gly Gly1 5 10CAA CGG CTA CTG CTG CCC TCC ATG CTG CTA GCA CTG CTG CTC CTG CTG578Gln Arg Leu Leu Leu Pro Ser Met Leu Leu Ala Leu Leu Leu Leu Leu15 20 25GCT CCA TCC CCA GGC CAC GCC ACT CGG GTA GTG TAC AAG GTG CCG GAG626Ala Pro Ser Pro Gly His Ala Thr Arg Val Val Tyr Lys Val Pro Glu30 35 40GAA CAG CCA CCC AAC ACC CTC ATT GGG AGC CTC GCA GCC GAC TAT GGT674Glu Gln Pro Pro Asn Thr Leu Ile Gly Ser Leu Ala Ala Asp Tyr Gly45 50 55 60TTT CCA GAT GTG GGG CAC CTG TAC AAG CTA GAG GTG GGT GCC CCG TAC722Phe Pro Asp Val Gly His Leu Tyr Lys Leu Glu Val Gly Ala Pro Tyr65 70 75CTT CGC GTG GAT GGC AAG ACA GGT GAC ATT TTC ACC ACC GAG ACC TCC770Leu Arg Val Asp Gly Lys Thr Gly Asp Ile Phe Thr Thr Glu Thr Ser80 85 90ATC GAC CGT GAG GGG CTC CGT GAA TGC CAG AAC CAG CTC CCT GGT GAT818Ile Asp Arg Glu Gly Leu Arg Glu Cys Gln Asn Gln Leu Pro Gly Asp95 100 105CCC TGC ATC CTG GAG TTT GAG GTA TCT ATC ACA GAC CTC GTG CAG AAT866Pro Cys Ile Leu Glu Phe Glu Val Ser Ile Thr Asp Leu Val Gln Asn110 115 120GCG AGC CCC CGG CTG CTA GAG GGC CAG ATA GAA GTA CAA GAC ATC AAT914Ala Ser Pro Arg Leu Leu Glu Gly Gln Ile Glu Val Gln Asp Ile Asn125 130 135 140GAC AAC ACA CCC AAC TTC GCC TCA CCA GTC ATC ACT CTG GCC ATC CCT962Asp Asn Thr Pro Asn Phe Ala Ser Pro Val Ile Thr Leu Ala Ile Pro145 150 155GAG AAC ACC AAC ATC GGC TCA CTC TTC CCC ATC CCG CTG GCT TCA GAC 1010Glu Asn Thr Asn Ile Gly Ser Leu Phe Pro Ile Pro Leu Ala Ser Asp160 165 170CGT GAT GCT GGT CCC AAC GGT GTG GCA TCC TAT GAG CTG CAG GTG GCA 1058Arg Asp Ala Gly Pro Asn Gly Val Ala Ser Tyr Glu Leu Gln Val Ala175 180 185GAG GAC CAG GAG GAG AAG CAA CCA CAG CTC ATT GTG ATG GGC AAC CTG 1106Glu Asp Gin Glu Glu Lys Gln Pro Gln Leu Ile Val Met Gly Asn Leu190 195 200GAC CGT GAG CGC TGG GAC TCC TAT GAC CTC ACC ATC AAG GTG CAG GAT 1154Asp Arg Glu Arg Trp Asp Ser Tyr Asp Leu Thr Ile Lys Val Gln Asp205 210 215 220GGC GGC AGC CCC CCA CGC GCC ACG AGT GCC CTG CTG CGT GTC ACC GTG 1202Gly Gly Ser Pro Pro Arg Ala Thr Ser Ala Leu Leu Arg Val Thr Val225 230 235CTT GAC ACC AAT GAC AAC GCC CCC AAG TTT GAG CGG CCC TCC TAT GAG 1250Leu Asp Thr Asn Asp Asn Ala Pro Lys Phe Glu Arg Pro Ser Tyr Glu240 245 250GCC GAA CTA TCT GAG AAT AGC CCC ATA GGC CAC TCG GTC ATC CAG GTG 1298Ala Glu Leu Ser Glu Asn Ser Pro Ile Gly His Ser Val Ile Gln Val255 260 265AAG GCC AAT GAC TCA GAC CAA GGT GCC AAT GCA GAA ATC GAA TAC ACA 1346Lys Ala Asn Asp Ser Asp Gln Gly Ala Asn Ala Glu Ile Glu Tyr Thr270 275 280TTC CAC CAG GCG CCC GAA GTT GTG AGG CGT CTT CTT CGA CTG GAC AGG 1394Phe His Gln Ala Pro Glu Val Val Arg Arg Leu Leu Arg Leu Asp Arg285 290 295 300AAC ACT GGA CTT ATC ACT GTT CAG GGC CCG GTG GAC CGT GAG GAC CTA 1442Asn Thr Gly Leu Ile Thr Val Gln Gly Pro Val Asp Arg Glu Asp Leu305 310 315AGC ACC CTG CGC TTC TCA GTG CTT GCT AAG GAC CGA GGC ACC AAC CCC 1490Ser Thr Leu Arg Phe Ser Val Leu Ala Lys Asp Arg Gly Thr Asn Pro320 325 330AAG AGT GCC CGT GCC CAG GTG GTT GTG ACC GTG AAG GAC ATG AAT GAC 1538Lys Ser Ala Arg Ala Gln Val Val Val Thr Val Lys Asp Met Asn Asp335 340 345AAT GCC CCC ACC ATT GAG ATC CGG GGC ATA GGG CTA GTG ACT CAT CAA 1586Asn Ala Pro Thr Ile Glu Ile Arg Gly Ile Gly Leu Val Thr His Gln350 355 360GAT GGG ATG GCT AAC ATC TCA GAG GAT GTG GCA GAG GAG ACA GCT GTG 1634Asp Gly Met Ala Asn Ile Ser Glu Asp Val Ala Glu Glu Thr Ala Val365 370 375 380GCC CTG GTG CAG GTG TCT GAC CGA GAT GAG GGA GAG AAT GCA GCT GTC 1682Ala Leu Val Gln Val Ser Asp Arg Asp Glu Gly Glu Asn Ala Ala Val385 390 395ACC TGT GTG GTG GCA GGT GAT GTG CCC TTC CAG CTG CGC CAG GCC AGT 1730Thr Cys Val Val Ala Gly Asp Val Pro Phe Gln Leu Arg Gln Ala Ser400 405 410GAG ACA GGC AGT GAC AGC AAG AAG AAG TAT TTC CTG CAG ACT ACC ACC 1778Glu Thr Gly Ser Asp Ser Lys Lys Lys Tyr Phe Leu Gln Thr Thr Thr415 420 425CCG CTA GAC TAC GAG AAG GTC AAA GAC TAC ACC ATT GAG ATT GTG GCT 1826Pro Leu Asp Tyr Glu Lys Val Lys Asp Tyr Thr Ile Glu Ile Val Ala430 435 440GTG GAC TCT GGC AAC CCC CCA CTC TCC AGC ACT AAC TCC CTC AAG GTG 1874Val Asp Ser Gly Asn Pro Pro Leu Ser Ser Thr Asn Ser Leu Lys Val445 450 455 460CAG GTG GTG GAC GTC AAT GAC AAC GCA CCT GTC TTC ACT CAG AGT GTC 1922Gln Val Val Asp Val Asn Asp Asn Ala Pro Val Phe Thr Gln Ser Val465 470 475ACT GAG GTC GCC TTC CCG GAA AAC AAC AAG CCT GGT GAA GTG ATT GCT 1970Thr Glu Val Ala Phe Pro Glu Asn Asn Lys Pro Gly Glu Val Ile Ala480 485 490GAG ATC ACT GCC AGT GAT GCT GAC TCT GGC TCT AAT GCT GAG CTG GTT 2018Glu Ile Thr Ala Ser Asp Ala Asp Ser Gly Ser Asn Ala Glu Leu Val495 500 505TAC TCT CTG GAG CCT GAG CCG GCT GCT AAG GGC CTC TTC ACC ATC TCA 2066Tyr Ser Leu Glu Pro Glu Pro Ala Ala Lys Gly Leu Phe Thr Ile Ser510 515 520CCC GAG ACT GGA GAG ATC CAG GTG AAG ACA TCT CTG GAT CGG GAA CAG 2114Pro Glu Thr Gly Glu Ile Gln Val Lys Thr Ser Leu Asp Arg Glu Gln525 530 535 540CGG GAG AGC TAT GAG TTG AAG GTG GTG GCA GCT GAC CGG GGC AGT CCT 2162Arg Glu Ser Tyr Glu Leu Lys Val Val Ala Ala Asp Arg Gly Ser Pro545 550 555AGC CTC CAG GGC ACA GCC ACT GTC CTT GTC AAT GTG CTG GAC TGC AAT 2210Ser Leu Gln Gly Thr Ala Thr Val Leu Val Asn Val Leu Asp Cys Asn560 565 570GAC AAT GAC CCC AAA TTT ATG CTG AGT GGC TAC AAC TTC TCA GTG ATG 2258Asp Asn Asp Pro Lys Phe Met Leu Ser Gly Tyr Asn Phe Ser Val Met575 580 585GAG AAC ATG CCA GCA CTG AGT CCA GTG GGC ATG GTG ACT GTC ATT GAT 2306Glu Asn Met Pro Ala Leu Ser Pro Val Gly Met Val Thr Val Ile Asp590 595 600GGA GAC AAG GGG GAG AAT GCC CAG GTG CAG CTC TCA GTG GAG CAG GAC 2354Gly Asp Lys Gly Glu Asn Ala Gln Val Gln Leu Ser Val Glu Gln Asp605 610 615 620AAC GGT GAC TTT GTT ATC CAG AAT GGC ACA GGC ACC ATC CTA TCC AGC 2402Asn Gly Asp Phe Val Ils Gln Asn Gly Thr Gly Thr Ile Leu Ser Ser625 630 635CTG AGC TTT GAT CGA GAG CAA CAA AGC ACC TAC ACC TTC CAG CTG AAG 2450Leu Ser Phe Asp Arg Glu Gln Gln Ser Thr Tyr Thr Phe Gln Leu Lys640 645 650GCA GTG GAT GGT GGC GTC CCA CCT CGC TCA GCT TAC GTT GGT GTC ACC2498Ala Val Asp Gly Gly Val Pro Pro Arg Ser Ala Tyr Val Gly Val Thr655 660 665ATC AAT GTG CTG GAC GAG AAT GAC AAC GCA CCC TAT ATC ACT GCC CCT2546Ile Asn Val Leu Asp Glu Asn Asp Asn Ala Pro Tyr Ile Thr Ala Pro670 675 680TCT AAC ACC TCT CAC AAG CTG CTG ACC CCC CAG ACA CGT CTT GGT GAG2594Ser Asn Thr Ser His Lys Leu Leu Thr Pro Gln Thr Arg Leu Gly Glu685 690 695 700ACG GTC AGC CAG GTG GCA GCC GAG GAC TTT GAC TCT GGT GTC AAT GCC2642Thr Val Ser Gln Val Ala Ala Glu Asp Phe Asp Ser Gly Val Asn Ala705 710 715GAG CTG ATC TAC AGC ATT GCA GGT GGC AAC CCT TAT GGA CTC TTC CAG2690Glu Leu Ile Tyr Ser Ile Ala Gly Gly Asn Pro Tyr Gly Leu Phe Gln720 725 730ATT GGG TCA CAT TCA GGT GCC ATC ACC CTG GAG AAG GAG ATT GAG CGG2738Ile Gly Ser His Ser Gly Ala Ile Thr Leu Glu Lys Glu Ile Glu Arg735 740 745CGC CAC CAT GGG CTA CAC CGC CTG GTG GTG AAG GTC AGT GAC CGC GGC2786Arg His His Gly Leu His Arg Leu Val Val Lys Val Ser Asp Arg Gly750 755 760AAG CCC CCA CGC TAT GGC ACA GCC TTG GTC CAT CTT TAT GTC AAT GAG2834Lys Pro Pro Arg Tyr Gly Thr Ala Leu Val His Leu Tyr Val Asn Glu765 770 775 780ACT CTG GCC AAC CGC ACG CTG CTG GAG ACC CTC CTG GGC CAC AGC CTG2882Thr Leu Ala Asn Arg Thr Leu Leu Glu Thr Leu Leu Gly His Ser Leu785 790 795GAC ACG CCG CTG GAT ATT GAC ATT GCT GGG GAT CCA GAA TAT GAG CGC2930Aap Thr Pro Leu Asp Ile Asp Ile Ala Gly Asp Pro Glu Tyr Glu Arg800 805 810TCC AAG CAG CGT GGC AAC ATT CTC TTT GGT GTG GTG GCT GGT GTG GTG2978Ser Lys Gln Arg Gly Asn Ile Leu Phe Gly Val Val Ala Gly Val Val815 820 825GCC GTG GCC TTG CTC ATC GCC CTG GCG GTT CTT GTG CGC TAC TGC AGA3026Ala Val Ala Leu Leu Ile Ala Leu Ala Val Leu Val Arg Tyr Cys Arg830 835 840CAG CGG GAG GCC AAA AGT GGT TAC CAG GCT GGT AAG AAG GAG ACC AAG3074Gln Arg Glu Ala Lys Ser Gly Tyr Gln Ala Gly Lys Lys Glu Thr Lys845 850 855 860GAC CTG TAT GCC CCC AAG CCC AGT GGC AAG GCC TCC AAG GGA AAC AAA3122Asp Leu Tyr Ala Pro Lys Pro Ser Gly Lys Ala Ser Lys Gly Asn Lys865 870 875AGC AAA GGC AAG AAG AGC AAG TCC CCA AAG CCC GTG AAG CCA GTG GAG3170Ser Lys Gly Lys Lys Ser Lys Ser Pro Lys Pro Val Lys Pro Val Glu880 885 890GAC GAG GAT GAG GCC GGG CTG CAG AAG TCC CTC AAG TTC AAC CTG ATG3218Asp Glu Asp Glu Ala Gly Leu Gln Lys Ser Leu Lys Phe Aen Leu Met895 900 905AGC GAT GCC CCT GGG GAC AGT CCC CGC ATC CAC CTG CCC CTC AAC TAC3266Ser Asp Ala Pro Gly Asp Ser Pro Arg Ile His Leu Pro Leu Asn Tyr910 915 920CCA CCA GGC AGC CCT GAC CTG GGC CGC CAC TAT CGC TCT AAC TCC CCA3314Pro Pro Gly Ser Pro Asp Leu Gly Arg His Tyr Arg Ser Asn Ser Pro925 930 935 940CTG CCT TCC ATC CAG CTG CAG CCC CAG TCA CCC TCA GCC TCC AAG AAG3362Leu Pro Ser Ile Gln Leu Gln Pro Gln Ser Pro Ser Ala Ser Lys Lys945 950 955CAC CAG GTG GTA CAG GAC CTG CCA CCT GCA AAC ACA TTC GTG GGC ACC3410His Gln Val Val Gln Asp Leu Pro Pro Ala Asn Thr Phe Val Gly Thr960 965 970GGG GAC ACC ACG TCC ACG GGC TCT GAG CAG TAC TCC GAC TAC AGC TAC3458Gly Asp Thr Thr Ser Thr Gly Ser Glu Gln Tyr Ser Asp Tyr Ser Tyr975 980 985CGC ACC AAC CCC CCC AAA TAC CCC AGC AAG CAG GTA GGC CAG CCC TTT3506Arg Thr Asn Pro Pro Lys Tyr Pro Ser Lys Gln Val Gly Gln Pro Phe990 995 1000CAG CTC AGC ACA CCC CAG CCC CTA CCC CAC CCC TAC CAC GGA GCC ATC3554Gln Leu Ser Thr Pro Gln Pro Leu Pro His Pro Tyr His Gly Ala Ile1005101010151020TGG ACC GAG GTG TGG GAG TGATGGAGCA GGTTTACTGT GCCTGCCCGT 3602Trp Thr Glu Val Trp Glu1025GTTGGGGGCC AGCCTGAGCC AGCAGTGGGA GGTGGGGCCT TAGTGCCTCA CCGGGCACAC 3662GGATTAGGCT GAGTGAAGAT TAAGGGAGGG TGTGCTCTGT GGTCTCCTCC CTGCCCTCTC 3722CCCACTGGGG AGAGACCTGT GATTTGCCAA GTCCCTGGAC CCTGGACCAG CTACTGGGCC 3782TTATGGGTTG GGGGTGGTAG GCAGGTGAGC GTAAGTGGGG AGGGAAATGG GTAAGAAGTC 3842TACTCCAAAC CTAGGTCTCT ATGTCAGACC AGACCTAGGT GCTTCTCTAG GAGGGAAACA 3902GGGAGACCTG GGGTCCTGTG GATAACTGAG TGGGGAGTCT GCCAGGGGAG GGCACCTTCC 3962CATTGTGCCT TCTGTGTGTA TTGTGCATTA ACCTCTTCCT CACCACTAGG CTTCTGGGGC 4022TGGGTCCCAC ATGCCCTTGA CCCTGACAAT AAAGTTCTCT ATTTTTGGAA AAAAAAAAAA 4082AAAAAAAAAA AAAAAAAAAA AA 4104(2)SEQ ID NO95的信息(i)序列特征(A)长度1026氨基酸(B)类型氨基酸(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO95Met Glu Pro Leu Arg His Ser Pro Gly Pro Gly Gly Gln Arg Leu Leu1 5 10 15Leu Pro Ser Met Leu Leu Ala Leu Leu Leu Leu Leu Ala Pro Ser Pro20 25 30Gly His Ala Thr Arg Val Val Tyr Lys Val Pro Glu Glu Gln Pro Pro35 40 45Asn Thr Leu Ile Gly Ser Leu Ala Ala Asp Tyr Gly Phe Pro Asp Val50 55 60Gly His Leu Tyr Lys Leu Glu Val Gly Ala Pro Tyr Leu Arg val Asp65 70 75 80Gly Lys Thr Gly Asp Ile Phe Thr Thr Glu Thr Ser Ile Asp Arg Glu85 90 95Gly Leu Arg Glu Cys Gln Asn Gln Leu Pro Gly Asp Pro Cys Ile Leu100 105 110Glu Phe Glu Val Ser Ile Thr Asp Leu Val Gln Asn Ala Ser Pro Arg115 120 125Leu Leu Glu Gly Gln Ile Glu Val Gln Asp Ile Asn Asp Asn Thr Pro130 135 140Asn Phe Ala Ser Pro Val Ile Thr Leu Ala Ile Pro Glu Asn Thr Asn145 150 155 160Ile Gly Ser Leu Phe Pro Ile Pro Leu Ala Ser Asp Arg Asp Ala Gly165 170 175Pro Asn Gly Val Ala Ser Tyr Glu Leu Gln Val Ala Glu Asp Gln Glu180 185 190Glu Lys Gln Pro Gln Leu Ile Val Met Gly Asn Leu Asp Arg Glu Arg195 200 205Trp Asp Ser Tyr Asp Leu Thr Ile Lys Val Gln Asp Gly Gly Ser Pro210 215 220Pro Arg Ala Thr Ser Ala Leu Leu Arg Val Thr Val Leu Asp Thr Asn225 230 235 240Asp Asn Ala Pro Lys Phe Glu Arg Pro Ser Tyr Glu Ala Glu Leu Ser245 250 255Glu Asn Ser Pro Ile Gly His Ser Val Ile Gln Val Lye Ala Asn Asp260 265 270
Ser Asp Gln Gly Ala Asn Ala Glu Ile Glu Tyr Thr Phe His Gln Ala275 280 285Pro Glu Val Val Arg Arg Leu Leu Arg Leu Asp Arg Asn Thr Gly Leu290 295 300Ile Thr Val Gln Gly Pro Val Asp Arg Glu Asp Leu Ser Thr Leu Arg305 310 315 320Phe Ser Val Leu Ala Lys Asp Arg Gly Thr Asn Pro Lys Ser Ala Arg325 330 335Ala Gln Val Val Val Thr Val Lys Asp Met Asn Asp Asn Ala Pro Thr340 345 350Ile Glu Ile Arg Gly Ile Gly Leu Val Thr His Gln Asp Gly Met Ala355 360 365Asn Ile Ser Glu Aep Val Ala Glu Glu Thr Ala Val Ala Leu Val Gln370 375 380Val Ser Asp Arg Asp Glu Gly Glu Asn Ala Ala Val Thr Cys Val Val385 390 395 400Ala Gly Asp Val Pro Phe Gln Leu Arg Gln Ala Ser Glu Thr Gly Ser405 410 415Asp Ser Lys Lys Lys Tyr Phe Leu Gln Thr Thr Thr Pro Leu Asp Tyr420 425 430Glu Lys Val Lys Asp Tyr Thr Ile Glu Ile Val Ala Val Asp Ser Gly435 440 445Asn Pro Pro Leu Ser Ser Thr Asn Ser Leu Lys Val Gln Val Val Asp450 455 460Val Asn Asp Asn Ala Pro Val Phe Thr Gln Ser Val Thr Glu Val Ala465 470 475 480Phe Pro Glu Asn Asn Lys Pro Gly Glu Val Ile Ala Glu Ile Thr Ala485 490 495Ser Asp Ala Asp Ser Gly Ser Asn Ala Glu Leu Val Tyr Ser Leu Glu500 505 510Pro Glu Pro Ala Ala Lys Gly Leu Phe Thr Ile Ser Pro Glu Thr Gly515 520 525Glu Ile Gln Val Lys Thr Ser Leu Asp Arg Glu Gln Arg Glu Ser Tyr530 535 540Glu Leu Lys Val Val Ala Ala Asp Arg Gly Ser Pro Ser Leu Gln Gly545 550 555 560Thr Ala Thr Val Leu Val Asn Val Leu Asp Cys Asn Asp Asn Asp Pro565 570 575Lys Phe Met Leu Ser Gly Tyr Asn Phe Ser Val Met Glu Asn Met Pro580 585 590
Ala Leu Ser Pro Val Gly Met Val Thr Val Ile Asp Gly Asp Lys Gly595 600 605Glu Asn Ala Gln Val Gln Leu Ser Val Glu Gln Asp Asn Gly Asp Phe610 615 620Val Ile Gln Asn Gly Thr Gly Thr Ile Leu Ser Ser Leu Ser Phe Asp625 630 635 640Arg Glu Gln Gln Ser Thr Tyr Thr Phe Gln Leu Lys Ala Val Asp Gly645 650 655Gly Val Pro Pro Arg Ser Ala Tyr Val Gly Val Thr Ile Asn Val Leu660 665 670Asp Glu Asn Asp Asn Ala Pro Tyr Ile Thr Ala Pro Ser Asn Thr Ser675 680 685His Lys Leu Leu Thr Pro Gln Thr Arg Leu Gly Glu Thr Val Ser Gln690 695 700Val Ala Ala Glu Asp Phe Asp Ser Gly Val Asn Ala Glu Leu Ile Tyr705 710 715 720Ser Ile Ala Gly Gly Asn Pro Tyr Gly Leu Phe Gln Ile Gly Ser Mis725 730 735Ser Gly Ala Ile Thr Leu Glu Lys Glu Ile Glu Arg Arg His His Gly740 745 750Leu His Arg Leu Val Val Lys Val Ser Asp Arg Gly Lys Pro Pro Arg755 760 765Tyr Gly Thr Ala Leu Val His Leu Tyr Val Asn Glu Thr Leu Ala Asn770 775 780Arg Thr Leu Leu Glu Thr Leu Leu Gly His Ser Leu Asp Thr Pro Leu785 790 795 800Asp Ile Asp Ile Ala Gly Asp Pro Glu Tyr Glu Arg Ser Lys Gln Arg805 810 815Gly Asn Ile Leu Phe Gly Val val Ala Gly Val Val Ala Val Ala Leu820 825 830Leu Ile Ala Leu Ala Val Leu Val Arg Tyr Cys Arg Gln Arg Glu Ala835 840 845Lys Ser Gly Tyr Gln Ala Gly Lys Lys Glu Thr Lys Asp Leu Tyr Ala850 855 860Pro Lys Pro Ser Gly Lys Ala Ser Lys Gly Asn Lys Ser Lys Gly Lys865 870 875 880Lys Ser Lys Ser Pro Lys Pro Val Lys Pro Val Glu Asp Glu Asp Glu885 890 895Ala Gly Leu Gln Lys Ser Leu Lys Phe Asn Leu Met Ser Asp Ala Pro900 905 910
Gly Asp Ser Pro Arg Ile His Leu Pro Leu Asn Tyr Pro Pro Gly Ser915 920 925Pro Asp Leu Gly Arg Hia Tyr Arg Ser Asn Ser Pro Leu Pro Ser Ile930 935 940Gln Leu Gln Pro Gln Ser Pro Ser Ala Ser Lys Lys His Gln Val Val945 950 955 960Gln Asp Leu Pro Pro Ala Asn Thr Phe Val Gly Thr Gly Asp Thr Thr965 970 975Ser Thr Gly Ser Glu Gln Tyr Ser Asp Tyr Ser Tyr Arg Thr Asn Pro980 985 990Pro Lys Tyr Pro Ser Lys Gln Val Gly Gln Pro Phe Gln Leu Ser Thr995 10001005Pro Gln Pro Leu Pro His Pro Tyr His Gly Ala Ile Trp Thr Glu Val101010151020Trp Glu1025(2)SEQ ID NO96的信息(i)序列特征(A)长度4705碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(ix)特征(A)名称/键CDS(B)定位115..2827(xi)序列描述SEQ ID NO96CGAAAGCCAT GTCGGACTCG TCGCCCAGCG CCCAAGCGCT AACCCGCTGA AAGTTTCTCA 60GCGAAATCTC AGGGACGATC TGGACCCCGC TGAGAGGAAC TGCTTTTGAG TGAG ATG117Met1GTC CCA GAG GCC TGG AGG AGC GGA CTG GTA AGC ACC GGG AGG GTA GTG165Val Pro Glu Ala Trp Arg Ser Gly Leu Val Ser Thr Gly Arg Val Val5 10 15GGA GTT TTG CTT CTG CTT GGT GCC TTG AAC AAG GCT TCC ACG GTC ATT213Gly Val Leu Leu Leu Leu Gly Ala Leu Asn Lys Ala Ser Thr Val Ile20 25 30CAC TAT GAG ATC CCG GAG GAA AGA GAG AAG GGT TTC GCT GTG GGC AAC261His Tyr Glu Ile Pro Glu Glu Arg Glu Lys Gly Phe Ala Val Gly Asn35 40 45GTG GTG GCG AAC CTT GGT TTG GAT CTC GGT AGC CTC TCA GCC CGC AGG309Va1 Val Ala Asn Leu Gly Leu Asp Leu Gly Ser Leu Ser Ala Arg Arg50 55 60 65TTC CCG GTG GTG TCT GGA GCT AGC CGA AGA TTC TTT GAG GTG AAC CGG357Phe Pro Val Val Ser Gly Ala Ser Arg Arg Phe Phe Glu Val Asn Arg70 75 80GAG ACC GGA GAG ATG TTT GTG AAC GAC CGT CTG GAT CGA GAG GAG CTG405Glu Thr Gly Glu Met Phe Val Asn Asp Arg Leu Asp Arg Glu Glu Leu85 90 95TGT GGG ACA CTG CCC TCT TGC ACT GTA ACT CTG GAG TTG GTA GTG GAG453Cys Gly Thr Leu Pro Ser Cys Thr Val Thr Leu Glu Leu Val Val Glu100 105 110AAC CCG CTG GAG CTG TTC AGC GTG GAA GTG GTG ATC CAG GAC ATC AAC501Ash Pro Leu Glu Leu Phe Ser Val Glu Val Val Ile Gln Asp Ile Asn115 120 125GAC AAC AAT CCT GCT TTC CCT ACC CAG GAA ATG AAA TTG GAG ATT AGC549Asp Asn Asn Pro Ala Phe Pro Thr Gln Glu Met Lys Leu Glu Ile Ser130 135 140 145GAG GCC GTG GCT CCG GGG ACG CGC TTT CCG CTC GAG AGC GCG CAC GAT597Glu Ala Val Ala Pro Gly Thr Arg Phe Pro Leu Glu Ser Ala His Asp150 155 160CCC GAT CTG GGA AGC AAC TCT TTA CAA ACC TAT GAG CTG AGC CGA AAT645Pro Asp Leu Gly Ser Asn Ser Leu Gln Thr Tyr Glu Leu Ser Arg Asn165 170 175GAA TAC TTT GCG CTT CGC GTG CAG ACG CGG GAG GAC AGC ACC AAG TAC693Glu Tyr Phe Ala Leu Arg Val Gln Thr Arg Glu Asp Ser Thr Lys Tyr180 185 190GCG GAG CTG GTG TTG GAG CGC GCC CTG GAC CGA GAA CGG GAG CCT AGT741Ala Glu Leu Val Leu Glu Arg Ala Leu Asp Arg Glu Arg Glu Pro Ser195 200 205CTC CAG TTA GTG CTG ACG GCG TTG GAC GGA GGG ACC CCA GCT CTC TCC789Leu Gln Leu Val Leu Thr Ala Leu Asp Gly Gly Thr Pro Ala Leu Ser210 215 220 225GCC AGC CTG CCT ATT CAC ATC AAG GTG CTG GAC GCG AAT GAC AAT GCG837Ala Ser Leu Pro Ile His Ile Lys Val Leu Asp Ala Asn Asp Asn Ala230 235 240CCT GTC TTC AAC CAG TCC TTG TAC CGG GCG CGC GTT CCT GGA GGA TGC885Pro Val Phe Asn Gln Ser Leu Tyr Arg Ala Arg Val Pro Gly Gly Cys245 250 255ACC TCC GGC ACG CGC GTG GTA CAA GTC CTT GCA ACG GAT CTG GAT GAA933Thr Ser Gly Thr Arg Val Val Gln Val Leu Ala Thr Asp Leu Asp Glu260 265 270GGC CCC AAC GGT GAA ATT ATT TAC TCC TTC GGC AGC CAC AAC CGC GCC981Gly Pro Asn Gly Glu Ile Ile Tyr Ser Phe Gly Ser His Asn Arg Ala275 280 285GGC GTG CGG CAA CTA TTC GCC TTA GAC CTT GTA ACC GGG ATG CTG ACA1029Gly Val Arg Gln Leu Phe Ala Leu Asp Leu Val Thr Gly Met Leu Thr290 295 300 305ATC AAG GGT CGG CTG GAC TTC GAG GAC ACC AAA CTC CAT GAG ATT TAC1077Ile Lys Gly Arg Leu Asp Phe Glu Asp Thr Lys Leu His Glu Ile Tyr310 315 320ATC CAG GCC AAA GAC AAG GGC GCC AAT CCC GAA GGA GCA CAT TGC AAA1125Ile Gln Ala Lye Asp Lys Gly Ala Asn Pro Glu Gly Ala His Cys Lys325 330 335GTG TTG GTG GAG GTT GTG GAT GTG AAT GAC AAC GCC CCG GAG ATC ACA1173Val Leu Val Glu Val Val Asp Val Asn Asp Asn Ala Pro Glu Ile Thr340 345 350GTC ACC TCC GTG TAC AGC CCA GTA CCC GAG GAT GCC TCT GGG ACT GTC1221Val Thr Ser Val Tyr Ser Pro Val Pro Glu Asp Ala Ser Gly Thr Val355 360 365ATC GCT TTG CTC AGT GTG ACT GAC CTG GAT GCT GGC GAG AAC GGG CTG1269Ile Ala Leu Leu Ser Val Thr Asp Leu Asp Ala Gly Glu Asn Gly Leu370 375 380 385GTG ACC TGC GAA GTT CCA CCG GGT CTC CCT TTC AGC CTT ACT TCT TCC1317Val Thr Cys Glu Val Pro Pro Gly Leu Pro Phe Ser Leu Thr Ser Ser390 395 400CTC AAG AAT TAC TTC ACT TTG AAA ACC AGT GCA GAC CTG GAT CGG GAG1365Leu Lys Ash Tyr Phe Thr Leu Lys Thr Ser Ala Asp Leu Asp Arg Glu405 410 415ACT GTG CCA GAA TAC AAC CTC AGC ATC ACC GCC CGA GAC GCC GGA ACC1413Thr Val Pro Glu Tyr Asn Leu Ser Ile Thr Ala Arg Asp Ala Gly Thr420 425 430CCT TCC CTC TCA GCC CTT ACA ATA GTG CGT GTT CAA GTG TCC GAC ATC1461Pro Ser Leu Ser Ala Leu Thr Ile Val Arg Val Gln Val Ser Asp Ile435 440 445AAT GAC AAC CCT CCA CAA TCT TCT CAA TCT TCC TAC GAC GTT TAC ATT1509Asn Asp Asn Pro Pro Gln Ser Ser Gln Ser Ser Tyr Asp Val Tyr Ile450 455 460 465GAA GAA AAC AAC CTC CCC GGG GCT CCA ATA CTA AAC CTA AGT GTC TGG1557Glu Glu Asn Asn Leu Pro Gly Ala Pro Ile Leu Asn Leu Ser Val Trp470 475 480GAC CCC GAC GCC CCG CAG AAT GCT CGG CTT TCT TTC TTT CTC TTG GAG1605Asp Pro Asp Ala Pro Gln Asn Ala Arg Leu Ser Phe Phe Leu Leu Glu485 490 495CAA GGA GCT GAA ACC GGG CTA GTG GGT CGC TAT TTC ACA ATA AAT CGT1653Gln Gly Ala Glu Thr Gly Leu Val Gly Arg Tyr Phe Thr Ile Asn Arg500 505 510GAC AAT GGC ATA GTG TCA TCC TTA GTG CCC CTA GAC TAT GAG GAT CGG1701Asp Asn Gly Ile Val Ser Ser Leu Val Pro Leu Asp Tyr Glu Asp Arg515 520 525CGG GAA TTT GAA TTA ACA GCT CAT ATC AGC GAT GGG GGC ACC CCG GTC 1749Arg Glu Phs Glu Leu Thr Ala His Ils Ser Asp Gly Gly Thr Pro Val530 535 540 545CTA GCC ACC AAC ATC AGC GTG AAC ATA TTT GTC ACT GAT CGC AAT GAC 1797Leu Ala Thr Asn Ils Ser Val Asn Ils Phs Val Thr Asp Arg Asn Asp550 555 560AAT GCC CCC CAG GTC CTA TAT CCT CGG CCA GGT GGG AGC TCG GTG GAG 1845Asn Ala Pro Gln Val Leu Tyr Pro Arg Pro Gly Gly Ser Ser Val Glu565 570 575ATG CTG CCT CGA GGT ACC TCA GCT GGC CAC CTA GTG TCA CGG GTG GTA 1893Met Leu Pro Arg Gly Thr Ser Ala Gly His Leu Val Ser Arg Val Val580 585 590GGC TGG GAC GCG GAT GCA GGG CAC AAT GCC TGG CTC TCC TAC AGT CTC 1941Gly Trp Asp Ala Asp Ala Gly His Asn Ala Trp Leu Ser Tyr Ser Leu595 600 605TTT GGA TCC CCT AAC CAG AGC CTT TTT GCC ATA GGG CTG CAC ACT GGT 1989Phs Gly Ser Pro Asn Gln Ser Leu Phs Ala Ile Gly Leu His Thr Gly610 615 620 625CAA ATC AGT ACT GCC CGT CCA GTC CAA GAC ACA GAT TCA CCC AGG CAG 2037Gln Ils Ser Thr Ala Arg Pro Val Gln Asp Thr Asp Ser Pro Arg Gln630 635 640ACT CTC ACT GTC TTG ATC AAA GAC AAT GGG GAG CCT TCG CTC TCC ACC 2085Thr Leu Thr Val Leu Ile Lys Asp Asn Gly Glu Pro Ser Leu Ser Thr645 650 655ACT GCT ACC CTC ACT GTG TCA GTA ACC GAG GAC TCT CCT GAA GCC CGA 2133Thr Ala Thr Leu Thr Val Ser Val Thr Glu Asp Ser Pro Glu Ala Arg660 665 670GCC GAG TTC CCC TCT GGC TCT GCC CCC CGG GAG CAG AAA AAA AAT CTC 2181Ala Glu Phs Pro Ser Gly Ser Ala Pro Arg Glu Gln Lys Lys Asn Leu675 680 685ACC TTT TAT CTA CTT CTT TCT CTA ATC CTG GTT TCT GTG GGC TTC GTG 2229Thr Phs Tyr Leu Leu Leu Ser Leu Ils Leu Val Ser Val Gly Phs Val690 695 700 705GTC ACA GTG TTC GGA GTA ATC ATA TTC AAA GTT TAC AAG TGG AAG CAG 2277Val Thr Val Phs Gly Val Ils Ils Phs Lys Val Tyr Lys Trp Lys Gln710 715 720TCT AGA GAC CTA TAC CGA GCC CCG GTG AGC TCA CTG TAC CGA ACA CCA 2325Ser Arg Asp Leu Tyr Arg Ala Pro Val Ser Ser Leu Tyr Arg Thr Pro725 730 735GGG CCC TCC TTG CAC GCG GAC GCC GTG CGG GGA GGC CTG ATG TCG CCG 2373Gly Pro Ser Leu His Ala Asp Ala Val Arg Gly Gly Leu Met Ser Pro740 745 750CAC CTT TAC CAT CAG GTG TAT CTC ACC ACG GAC TCC CGC CGC AGC GAC 2421His Leu Tyr His Gln Val Tyr Leu Thr Thr Asp Ser Arg Arg Ser Asp755 760 765CCG CTG CTG AAG AAA CCT GGT GCA GCC AGT CCA CTG GCC AGC CGC CAG 2469Pro Leu Leu Lys Lys Pro Gly Ala Ala Ser Pro Leu Ala Ser Arg Gln770 775 780 785AAC ACG CTG CGG AGC TGT GAT CCG GTG TTC TAT AGG CAG GTG TTG GGT 2517Asn Thr Leu Arg Ser Cys Asp Pro Val Phe Tyr Arg Gln Val Leu Gly790 795 800GCA GAG AGC GCC CCT CCC GGA CAG CAA GCC CCG CCC AAC ACG GAC TGG 2565Ala Glu Ser Ala Pro Pro Gly Gln Gln Ala Pro Pro Asn Thr Asp Trp805 810 815CGT TTC TCT CAG GCC CAG AGA CCC GGC ACC AGC GGC TCC CAA AAT GGC 2613Arg Phe Ser Gln Ala Gln Arg Pro Gly Thr Ser Gly Ser Gln Asn Gly820 825 830GAT GAC ACC GGC ACC TGG CCC AAC AAC CAG TTT GAC ACA GAG ATG CTG 2661Asp Asp Thr Gly Thr Trp Pro Asn Asn Gln Phe Asp Thr Glu Met Leu835 840 845CAA GCC ATG ATC TTG GCG TCC GCC AGT GAA GCT GCT GAT GGG AGC TCC 2709Gln Ala Met Ile Leu Ala Ser Ala Ser Glu Ala Ala Asp Gly Ser Ser850 855 860 865ACC CTG GGA GGG GGT GCC GGC ACC ATG GGA TTG AGC GCC CGC TAC GGA 2757Thr Leu Gly Gly Gly Ala Gly Thr Met Gly Leu Ser Ala Arg Tyr Gly870 875 880CCC CAG TTC ACC CTG CAG CAC GTG CCC GAC TAC CGC CAG AAT GTC TAC 2805Pro Gln Phe Thr Leu Gln His Val Pro Asp Tyr Arg Gln Asn Val Tyr885 890 895ATC CCA GGC AGC AAT GCA CAC T GACCAACGCA GCTGGCAAGC GGATGGCAAG2857Ile Pro Gly Ser Asn Ala His900GCCCAGCAGG TGGCAATGGC AACAAGAAGA AGTCGGCAAG AAGGAGAAGA AGTAACATGG 2917AGGCCAGGCC AAGAGCCACA GGGCAGCCTC TCCCCGAACC AGCCCAGCTT CTCCTTACCT 2977GCACCCAGGC CTCAGAGTTT CAGGGCTAAC CCCCAGAATA CTGGTAGGGG CCAAGGCATC 3037TCCCTTGGAA ACAGAAACAA GTGCCATCAC ACCATCCCTT CCCCAGGTGT AATATCCAAA 3097GCAGTTCCGC TGGGAACCCC ATCCAATCAG TGGCTGTACC CATTTGGGTA GTGGGGTTCA 3157TGTAGACACC AAGAACCATT TGCCACACCC CGTTTAGTTA CAGCTGAACC CTCCATCTTC 3217CAAATCAATC AGGCCCATCC ATCCCATGCC TCCCTCCTCC CCACCCCACT CCAACAGTTC 3277CTCTTTCCCG AGTAAGGTGG TTGGGGTGTT GAAGTACCAA GTAACCTACA AGCCTCCTAG 3337TTCTGAAAAG TTGGAAGGGC ATCATGACCT CTTGGCCTCT CCTTTGATTC TCAATCTTCC 3397CCCAAAGCAT GGTTTGGTGC CAGCCCCTTC ACCTCCTTCC AGAGCCCAAG ATCAATGCTC 3457AAGTTTTGGA GGACATGATC ACCATCCCCA TGGTACTGAT GCTTGCTGGA TTTAGGGAGG 3517GCATTTTGCT ACCAAGCCTC TTCCCAACGC CCTGGGACCA GTCTTCTGTT TTGTTTTTCA 3577TTGTTTGAGC TTTCCACTGC ATGCCTTGAC TTCCCCCACC TCCTCCTCAA ACAAGAGACT 3637CCACTGCATG TTCCAAGACA GTATGGGGTG GTAAGATAAG GAAGGGAAGT GTGTGGATGT 3697GGATGGTGGG GGCATGGACA AAGCTTGACA CATCAAGTTA TCAAGGCCTT GGAGGAGGCT 3757CTGTATGTCC TCAGGGGACT GACAACATCC TCCAGATTCC AGCCATAAAC CAATAACTAG 3817GCTGGACCCT TCCCACTACA TAATAGGGCT CAGCCAGGCA GCCAGCTTTG GGCTGAGCTA 3877ACAGGACCAA TGGATTAACT GGCATTTCAG TCCAAGGAAG CTCGAAGCAG GTTTAGGACC 3937AGGTCCCCTT GAGAGGTCAG AGGGGCCTCT GTGGGTGCTG GGTACTCCAG AGGTGCCACT 3997GGTGGAAGGG TCAGCGGAGC CCCAGCAGGA AGGGTGGGCC AGCCAGGCCA TTCTTAGTCC 4057CTGGGTTGGG GAGGCAGGGA GCTAGGGCAG GGACCAAATG AACAGAAAGT CTCAGCCCAG 4117GATGGGGCTT CTTCAACAGG CCCCTGCCCT CCTGAAGCCT CAGTCCTTCA CCTTGCCAGG 4177TGCCGTTTCT CTTCCGTGAA GGCCACTGCC CAGGTCCCCA GTGCGCCCCC TAGTGGCCAT 4237AGCCTGGTTA AAGTTCCCCA GTGCCTCCTT GTGATAGACC TTCTTCTCCC ACCCCCTTCT 4297GCCCCTGGGT CCCCGGCCAT CCAGCGGGGC TGCCAGAGAA CCCCAGACCT GCCCTTACAG 4357TAGTGTAGCG CCCCCTCCCT CTTTCGGCTG GTGTAGAATA GCCAGTAGTG TAGTGCGGTG 4417TGCTTTTACG TGATGGCGGG TGGGCAGCGG GCGGCGGCGT CCGCGCAGCC GTCTGTCCTT 4477GATCTGCCCG CGGCGGCCCG TGTTGTGTTT TGTGCTGTGT CCAGCGCTAA GGCGACCCCC 4537TCCCCCGTAC TGACTTCTCC TATAAGCGCT TCTCTTCGCA TAGTCACGTA GCTCCCACCC 4597CACCCTCTTC CTGTGTCTCA CGCAAGTTTT ATACTCTAAT ATTTATATGG CTTTTTTTCT 4657TCGACAAAAA AATAATAAAA CGTTTCTTCT GAAAAAAAAA AAAAAAAA 4705(2)SEQ ID NO97的信息(i)序列特征(A)长度904氨基酸(B)类型氨基酸(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO97Met Val Pro Glu Ala Trp Arg Ser Gly Leu Val Ser Thr Gly Arg Val1 5 10 15Val Gly Val Leu Leu Leu Leu Gly Ala Leu Asn Lys Ala Ser Thr Val20 25 30Ile His Tyr Glu Ile Pro Glu Glu Arg Glu Lys Gly Phe Ala Val Gly35 40 45Asn Val Val Ala Asn Leu Gly Leu Asp Leu Gly Ser Leu Ser Ala Arg50 55 60
Arg Phe Pro Val Val Ser Gly Ala Ser Arg Arg Phe Phe Glu Val Asn65 70 75 80Arg Glu Thr Gly Glu Met Phe Vai Asn Asp Arg Leu Asp Arg Glu Glu85 90 95Leu Cys Gly Thr Leu Pro Ser Cys Thr Val Thr Leu Glu Leu Val Val100 105 110Glu Asn Pro Leu Glu Leu Phe Ser Val Glu Val Val Ile Gln Asp Ile115 120 125Asn Asp Asn Asn Pro Ala Phe Pro Thr Gln Glu Met Lys Leu Glu Ile130 135 140Ser Glu Ala Val Ala Pro Gly Thr Arg Phe Pro Leu Glu Ser Ala His145 150 155 160Aap Pro Asp Leu Gly Ser Asn Ser Leu Gln Thr Tyr Glu Leu Ser Arg165 170 175Asn Glu Tyr Phe Ala Leu Arg Val Gln Thr Arg Glu Asp Ser Thr Lys180 185 190Tyr Ala Glu Leu Val Leu Glu Arg Ala Leu Asp Arg Glu Arg Glu Prol95 200 205Ser Leu Gln Leu Val Leu Thr Ala Leu Asp Gly Gly Thr Pro Ala Leu210 215 220Ser Ala Ser Leu Pro Ile His Ile Lys Val Leu Asp Ala Asn Asp Asn225 230 235 240Ala Pro Val Phe Asn Gln Ser Leu Tyr Arg Ala Arg Val Pro Gly Gly245 250 255Cys Thr Ser Gly Thr Arg Val Val Gln Val Leu Ala Thr Asp Leu Asp260 265 270Glu Gly Pro Asn Gly Glu Ile Ile Tyr Ser Phe Gly Ser His Asn Arg275 280 285Ala Gly Val Arg Gln Leu Phe Ala Leu Asp Leu Val Thr Gly Met Leu290 295 300Thr Ile Lys Gly Arg Leu Asp Phe Glu Asp Thr Lys Leu His Glu Ile305 310 315 320Tyr Ile Gln Ala Lys Asp Lys Gly Ala Asn Pro Glu Gly Ala His Cys325 330 335Lys Val Leu Val Glu Val Val Asp Val Asn Asp Asn Ala Pro Glu Ile340 345 350Thr Val Thr Ser Val Tyr Ser Pro Val Pro Glu Asp Ala Ser Gly Thr355 360 365Val Ile Ala Leu Leu Ser Val Thr Asp Leu Asp Ala Gly Glu Asn Gly370 375 380
Leu Val Thr Cys Glu Val Pro Pro Gly Leu Pro Phe Ser Leu Thr Ser385 390 395 400Ser Leu Lys Asn Tyr Phe Thr Leu Lys Thr Ser Ala Asp Leu Asp Arg405 410 415Glu Thr val Pro Glu Tyr Asn Leu Ser Ile Thr Ala Arg Asp Ala Gly420 425 430Thr Pro Ser Leu Ser Ala Leu Thr Ile Val Arg Val Gln Val Ser Asp435 440 445Ile Asn Asp Asn Pro Pro Gln Ser Ser Gln Ser Ser Tyr Asp Val Tyr450 455 460Ile Glu Glu Asn Asn Leu Pro Gly Ala Pro Ile Leu Asn Leu Ser Val465 470 475 480Trp Asp Pro Asp Ala Pro Gln Asn Ala Arg Leu Ser Phe Phe Leu Leu485 490 495Glu Gln Gly Ala Glu Thr Gly Leu Val Gly Arg Tyr Phe Thr Ile Asn500 505 510Arg Asp Asn Gly Ile Val Ser Ser Leu Val Pro Leu Asp Tyr Glu Asp515 520 525Arg Arg Glu Phe Glu Leu Thr Ala His Ile Ser Asp Gly Gly Thr Pro530 535 540Val Leu Ala Thr Asn Ile Ser Val Asn Ile Phe Val Thr Asp Arg Asn545 550 555 560Asp Asn Ala Pro Gln Val Leu Tyr Pro Arg Pro Gly Gly Ser Ser Val565 570 575Glu Met Leu Pro Arg Gly Thr Ser Ala Gly His Leu Val Ser Arg Val580 585 590Val Gly Trp Asp Ala Asp Ala Gly His Asn Ala Trp Leu Ser Tyr Ser595 600 605Leu Phe Gly Ser Pro Asn Gln Ser Leu Phe Ala Ile Gly Leu His Thr610 615 620Gly Gln Ile Ser Thr Ala Arg Pro Val Gln Asp Thr Asp Ser Pro Arg625 630 635 640Gln Thr Leu Thr Val Leu Ile Lys Asp Asn Gly Glu Pro Ser Leu Ser645 650 655Thr Thr Ala Thr Leu Thr Val Ser Val Thr Glu Asp Ser Pro Glu Ala660 665 670Arg Ala Glu Phe Pro Ser Gly Ser Ala Pro Arg Glu Gln Lys Lys Asn675 680 685Leu Thr Phe Tyr Leu Leu Leu Ser Leu Ile Leu Val Ser Val Gly Phe690 695 700
Val Val Thr Val Phe Gly Val Ile Ile Phe Lys Val Tyr Lys Trp Lys705 710 715 720Gln Sar Arg Asp Leu Tyr Arg Ala Pro Val Ser Ser Leu Tyr Arg Thr725 730 735Pro Gly Pro Ser Leu His Ala Asp Ala Val Arg Gly Gly Leu Met Ser740 745 750Pro His Leu Tyr His Gln Val Tyr Leu Thr Thr Asp Ser Arg Arg Ser755 760 765Asp Pro Leu Leu Lys Lys Pro Gly Ala Ala Ser Pro Leu Ala Ser Arg770 775 780Gln Asn Thr Leu Arg Ser Cys Asp Pro Val Phe Tyr Arg Gln Val Leu785 790 795 800Gly Ala Glu Ser Ala Pro Pro Gly Gln Gln Ala Pro Pro Asn Thr Asp805 810 815Trp Arg Phe Ser Gln Ala Gln Arg Pro Gly Thr Ser Gly Ser Gln Asn820 825 830Gly Asp Asp Thr Gly Thr Trp Pro Asn Asn Gln Phe Asp Thr Glu Met835 840 845Leu Gln Ala Met Ile Leu Ala Ser Ala Ser Glu Ala Ala Asp Gly Ser850 855 860Ser Thr Leu Gly Gly Gly Ala Gly Thr Met Gly Leu Ser Ala Arg Tyr865 870 875 880Gly Pro Gln Phe Thr Leu Gln His Val Pro Asp Tyr Arg Gln Asn Val885 890 895Tyr Ile Pro Gly Ser Asn Ala His900(2)SEQ ID NO98的信息(i)序列特征(A)长度441氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO98Asp Trp Val Ile Pro Pro Ile Asn Leu Pro Glu Asn Ser Arg Gly Pro1 5 10 15Phe Pro Gln Glu Leu Val Arg Ile Arg Ser Asp Arg Asp Lys Asn Leu20 25 30
Ser Leu Arg Tyr Thr Val Thr Gly Pro Gly Ala Asp Gln Pro Pro Thr35 40 45Gly Ile Phe Ile Ile Asn Pro Ile Ser Gly Gln Leu Ser Val Thr Lys50 55 60Pro Leu Asp Arg Glu Gln Ile Ala Arg Phe His Leu Arg Ala His Ala65 70 75 80Val Asp Ile Asn Gly Asn Gln Val Glu Asn Pro Ile Asp Ile Val Ile85 90 95Asn Val Ile Asp Met Asn Asp Asn Arg Pro Glu Phe Thr Ala Met Thr100 105 110Phe Tyr Gly Glu Val Pro Glu Asn Arg Val Asp Ile Ile Val Ala Asn115 120 125Leu Thr Val Thr Asp Lys Asp Gln Pro His Thr Pro Ala Trp Asn Ala130 135 140Val Thr Arg Ile Ser Gly Gly Asp Pro Thr Gly Arg Phe Ala Ile Gln145 150 155 160Thr Asp Pro Asn Ser Asn Asp Gly Leu Val Thr Val Val Lys Pro Ile165 170 175Asp Phe Glu Thr Asn Arg Met Phe Val Leu Thr Val Ala Ala Glu Asn180 185 190Gln Val Pro Leu Ala Lys Gly Ile Gln His Pro Pro Gln Ser Thr Ala195 200 205Thr Val Ser Val Thr Val Ile Asp Val Asn Glu Asn Pro Tyr Phe Ala210 215 220Pro Asn Pro Lys Ile Ile Arg Gln Glu Glu Gly Leu His Ala Gly Thr225 230 235 240Met Leu Thr Thr Phe Thr Ala Gly Asp Pro Asp Arg Tyr Met Gln Gln245 250 255Asn Ile Arg Tyr Thr Lys Leu Ser Asp Pro Ala Asn Trp Leu Lys Ile260 265 270Asp Pro Val Asn Gly Gln Ile Thr Thr Ile Ala Val Leu Asp Arg Glu275 280 285Ser Pro Ash Val Lys Ash Asn Ile Tyr Asn Ala Thr Phe Leu Ala Ser290 295 300Asp Asn Gly Ile Pro Pro Met Ser Gly Thr Gly Thr Leu Gln Ile Tyr305 310 315 320Leu Leu Asp Ile Asn Asp Asn Ala Pro Gln Val Leu Pro Gln Glu Ala325 330 335Glu Thr Cys Glu Thr Pro Asp Pro Asn Ser Ile Asn Ile Thr Thr Ala340 345 350
Leu Asp Tyr Asp Ile Asp Pro Asn Ala Gly Pro Phe Ala Tyr Asp Leu355 360 365Pro Leu Ser Pro Val Thr Ile Lya Arg Aan Trp Thr Ile Thr Arg Leu370 375 380Asn Gly Asp Phe Ala Gln Leu Asn Leu Lya Ile Lys Phe Leu Glu Ala385 390 395 400Gly Ile Tyr Glu Val Pro Ile Ile Ile Thr Asp Ser Gly Asn Pro Pro405 410 415Lys Ser Asn Lys Ser Ile Leu Arg Val Arg Val Cys Gln Cys Asp Phe420 425 430Asn Gly Asp Cys Thr Asp Val Asp Arg435 440(2)SEQ ID NO99的信息(i)序列特征(A)长度105氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO99Glu Asp Thr Val Tyr Ser Phe Asp Ile Pro Glu Asn Ala Gln Arg Gly1 5 10 15Tyr Gln Val Gly Gln Ile Val Ala Arg Asp Ala Asp Leu Gly Gln Asn20 25 30Ala Gln Leu Ser Tyr Gly Val Val Ser Asp Trp Ala Asn Asp Val Phe35 40 45Ser Leu Asn Pro Gln Thr Gly Met Leu Thr Leu Thr Ala Arg Leu Asp50 55 60Tyr Glu Glu Val Gln His Tyr Ile Leu Ile Val Gln Ala Gln Asp Asn65 70 75 80Gly Gln Pro Ser Leu Ser Thr Thr Ile Thr Val Tyr Cys Asn Val Leu85 90 95Asp Leu Asn Asp Asn Ala Pro Ile Phe100 105(2)SEQ ID NO100的信息(i)序列特征(A)长度7氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO100Asp Xaa Asp Xaa Gly Xaa Asn1 5(2)SEQ ID NO101的信息(i)序列特征(A)长度7氨基酸(B)类型氨基酸(C)链型单股(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO101Ala Xaa Asp Xaa Gly Xaa Pro1 5(2)SEQ ID NO102的信息(i)序列特征(A)长度4650碱基对(B)类型核酸(C)链型单股
(D)拓扑结构线性(ii)分子类型cDNA(ix)特征(A)名称/键CDS(B)定位495..4103(xi)序列描述SEQ ID NO102CCTCTATTCG ACATTCTCTT TGGATTGTTT TGCTATAACT TGAAATTTGG GATGTCACAA 60ACGAAACTGT CATCTGTTTC CGCCAAACTG TGGTTCTGCT AATCTCCCAG GCTGGCAGCA 120TTGGAGACTT GCTGACTTCT TTCATCCCCC ACTCTTTTCA CCTGAAATTC CTTTCCTTGG 180TTTTGCTCTA AGTCCTATGC TTCAGTCAGG GGCCAACCAA ATCTCACTGC CTCCTTTTTA 240TCATGAAGCC TTTGATCACT GATAGTTCTT TTTATATCTT GAAAAATCAC CCTTCCCAGT 300ACAGTTAATA TTTAGTATCT CTACTCATCT TGGCACTTAC TCACAGCTCC ATAATTCAGT 360CGTTTTCGTA CCTCTTCATG GTGATGGGGA GCCCTTTGGA GGTGGTGACT GTGCTTTATA 420CTCCTCATGA TGCTTCACAT GTGGCAGGCG TGGAGTGCCC GGAGGCGGCC CTCCTGATTC 480TGGGGCCTCC CAGG ATG GAG CCC CTG AGG CAC AGC CCA GGC CCT GGG GGG530Met Glu Pro Leu Arg His Ser Pro Gly Pro Gly Gly1 5 10CAA CGG CTA CTG CTG CCC TCC ATG CTG CTA GCA CTG CTG CTC CTG CTG578Gln Arg Leu Leu Leu Pro Ser Met Leu Leu Ala Leu Leu Leu Leu Leu15 20 25GCT CCA TCC CCA GGC CAC GCC ACT CGG GTA GTG TAC AAG GTG CCG GAG626Ala Pro Ser Pro Gly His Ala Thr Arg Val Val Tyr Lys Val Pro Glu30 35 40GAA CAG CCA CCC AAC ACC CTC ATT GGG AGC CTC GCA GCC GAC TAT GGT674Glu Gln Pro Pro Asn Thr Leu Ile Gly Ser Leu Ala Ala Asp Tyr Gly45 50 55 60TTT CCA GAT GTG GGG CAC CTG TAC AAG CTA GAG GTG GGT GCC CCG TAC722Phe Pro Asp Val Gly His Leu Tyr Lys Leu Glu Val Gly Ala Pro Tyr65 70 75CTT CGC GTG GAT GGC AAG ACA GGT GAC ATT TTC ACC ACC GAG ACC TCC770Leu Arg Val Asp Gly Lys Thr Gly Asp Ile Phe Thr Thr Glu Thr Ser80 85 90ATC GAC CGT GAG GGG CTC CGT GAA TGC CAG AAC CAG CTC CCT GGT GAT818Ile Asp Arg Glu Gly Leu Arg Glu Cys Gln Asn Gln Leu Pro Gly Asp95 100 105CCC TGC ATC CTG GAG TTT GAG GTA TCT ATC ACA GAC CTC GTG CAG AAT866Pro Cys Ile Leu Glu Phe Glu Val Ser Ile Thr Asp Leu Val Gln Asn110 115 120GCG AGC CCC CGG CTG CTA GAG GGC CAG ATA GAA GTA CAA GAC ATC AAT 914Ala Ser Pro Arg Leu Leu Glu Gly Gln Ile Glu Val Gln Asp Ile Asn125 130 135 140GAC AAC ACA CCC AAC TTC GCC TCA CCA GTC ATC ACT CTG GCC ATC CCT 962Asp Asn Thr Pro Asn Phe Ala Ser Pro Val Ile Thr Leu Ala Ile Pro145 150 155GAG AAC ACC AAC ATC GGC TCA CTC TTC CCC ATC CCG CTG GCT TCA GAC 1010Glu Asn Thr Asn Ile Gly Ser Leu Phe Pro Ile Pro Leu Ala Ser Asp160 165 170CGT GAT GCT GGT CCC AAC GGT GTG GCA TCC TAT GAG CTG CAG GTG GCA 1058Arg Asp Ala Gly Pro Asn Gly Val Ala Ser Tyr Glu Leu Gln Val Ala175 180 185GAG GAC CAG GAG GAG AAG CAA CCA CAG CTC ATT GTG ATG GGC AAC CTG 1106Glu Asp Gln Glu Glu Lys Gln Pro Gln Leu Ile Val Met Gly Asn Leu190 195 200GAC CGT GAG CGC TGG GAC TCC TAT GAC CTC ACC ATC AAG GTG CAG GAT 1154Asp Arg Glu Arg Trp Asp Ser Tyr Asp Leu Thr Ile Lys Val Gln Asp205 210 215 220GGC GGC AGC CCC CCA CGC GCC ACG AGT GCC CTG CTG CGT GTC ACC GTG 1202Gly Gly Ser Pro Pro Arg Ala Thr Ser Ala Leu Leu Arg Val Thr Val225 230 235CTT GAC ACC AAT GAC AAC GCC CCC AAG TTT GAG CGG CCC TCC TAT GAG 1250Leu Asp Thr Asn Asp Asn Ala Pro Lys Phe Glu Arg Pro Ser Tyr Glu240 245 250GCC GAA CTA TCT GAG AAT AGC CCC ATA GGC CAC TCG GTC ATC CAG GTG 1298Ala Glu Leu Ser Glu Asn Ser Pro Ile Gly His Ser Val Ile Gln Val255 260 265AAG GCC AAT GAC TCA GAC CAA GGT GCC AAT GCA GAA ATC GAA TAC ACA 1346Lys Ala Asn Asp Ser Asp Gln Gly Ala Asn Ala Glu Ile Glu Tyr Thr270 275 280TTC CAC CAG GCG CCC GAA GTT GTG AGG CGT CTT CTT CGA CTG GAC AGG 1394Phe His Gln Ala Pro Glu Val Val Arg Arg Leu Leu Arg Leu Asp Arg285 290 295 300AAC ACT GGA CTT ATC ACT GTT CAG GGC CCG GTG GAC CGT GAG GAC CTA 1442Asn Thr Gly Leu Ile Thr Val Gln Gly Pro Val Asp Arg Glu Asp Leu305 310 315AGC ACC CTG CGC TTC TCA GTG CTT GCT AAG GAC CGA GGC ACC AAC CCC 1490Ser Thr Leu Arg Phe Ser Val Leu Ala Lys Asp Arg Gly Thr Asn Pro320 325 330AAG AGT GCC CGT GCC CAG GTG GTT GTG ACC GTG AAG GAC ATG AAT GAC 1538Lys Ser Ala Arg Ala Gln Val Val Val Thr Val Lye Asp Met Asn Asp335 340 345AAT GCC CCC ACC ATT GAG ATC CGG GGC ATA GGG CTA GTG ACT CAT CAA 1586Asn Ala Pro Thr Ile Glu Ile Arg Gly Ile Gly Leu Val Thr His Gln350 355 360GAT GGG ATG GCT AAC ATC TCA GAG GAT GTG GCA GAG GAG ACA GCT GTG1634Asp Gly Met Ala Asn Ile Ser Glu Asp Val Ala Glu Glu Thr Ala Val365 370 375 380GCC CTG GTG CAG GTG TCT GAC CGA GAT GAG GGA GAG AAT GCA GCT GTC1682Ala Leu Val Gln Val Ser Asp Arg Asp Glu Gly Glu Asn Ala Ala Val385 390 395ACC TGT GTG GTG GCA GGT GAT GTG CCC TTC CAG CTG CGC CAG GCC AGT1730Thr Cys Val Val Ala Gly Asp Val Pro Phe Gln Leu Arg Gln Ala Ser400 405 410GAG ACA GGC AGT GAC AGC AAG AAG AAG TAT TTC CTG CAG ACT ACC ACC1778Glu Thr Gly Ser Asp Ser Lys Lys Lys Tyr Phe Leu Gln Thr Thr Thr415 420 425CCG CTA GAC TAC GAG AAG GTC AAA GAC TAC ACC ATT GAG ATT GTG GCT1826Pro Leu Asp Tyr Glu Lys Val Lys Asp Tyr Thr Ile Glu Ile Val Ala430 435 440GTG GAC TCT GGC AAC CCC CCA CTC TCC AGC ACT AAC TCC CTC AAG GTG1874Val Asp Ser Gly Asn Pro Pro Leu Ser Ser Thr Asn Ser Leu Lys Val445 450 455 460CAG GTG GTG GAC GTC AAT GAC AAC GCA CCT GTC TTC ACT CAG AGT GTC1922Gln Val Val Asp Val Asn Asp Asn Ala Pro Val Phe Thr Gln Ser Val465 470 475ACT GAG GTC GCC TTC CCG GAA AAC AAC AAG CCT GGT GAA GTG ATT GCT1970Thr Glu Val Ala Phe Pro Glu Asn Asn Lys Pro Gly Glu Val Ile Ala480 485 490GAG ATC ACT GCC AGT GAT GCT GAC TCT GGC TCT AAT GCT GAG CTG GTT2018Glu Ile Thr Ala Ser Asp Ala Asp Ser Gly Ser Asn Ala Glu Leu Val495 500 505TAC TCT CTG GAG CCT GAG CCG GCT GCT AAG GGC CTC TTC ACC ATC TCA2066Tyr Ser Leu Glu Pro Glu Pro Ala Ala Lye Gly Leu Phe Thr Ile Ser510 515 520CCC GAG ACT GGA GAG ATC CAG GTG AAG ACA TCT CTG GAT CGG GAA CAG2114Pro Glu Thr Gly Glu Ile Gln Val Lys Thr Ser Leu Asp Arg Glu Gln525 530 535 540CGG GAG AGC TAT GAG TTG AAG GTG GTG GCA GCT GAC CGG GGC AGT CCT2162Arg Glu Ser Tyr Glu Leu Lys Val Val Ala Ala Asp Arg Gly Ser Pro545 550 555AGC CTC CAG GGC ACA GCC ACT GTC CTT GTC AAT GTG CTG GAC TGC AAT2210Ser Leu Gln Gly Thr Ala Thr Val Leu Val Asn Val Leu Asp Cys Asn560 565 570GAC AAT GAC CCC AAA TTT ATG CTG AGT GGC TAC AAC TTC TCA GTG ATG2258Asp Asn Asp Pro Lys Phe Met Leu Ser Gly Tyr Asn Phe Ser Val Met575 580 585GAG AAC ATG CCA GCA CTG AGT CCA GTG GGC ATG GTG ACT GTC ATT GAT2306Glu Aan Met Pro Ala Leu Ser Pro Val Gly Met Val Thr Val Ile Asp590 595 600GGA GAC AAG GGG GAG AAT GCC CAG GTG CAG CTC TCA GTG GAG CAG GAC 2354Gly Asp Lys Gly Glu Asn Ala Gln Val Gln Leu Ser Val Glu Gln Asp605 610 615 620AAC GGT GAC TTT GTT ATC CAG AAT GGC ACA GGC ACC ATC CTA TCC AGC 2402Asn Gly Asp Phe Val Ile Gln Asn Gly Thr Gly Thr Ile Leu Ser Ser625 630 635CTG AGC TTT GAT CGA GAG CAA CAA AGC ACC TAC ACC TTC CAG CTG AAG 2450Leu Ser Phe Asp Arg Glu Gln Gln Ser Thr Tyr Thr Phe Gln Leu Lys640 645 650GCA GTG GAT GGT GGC GTC CCA CCT CGC TCA GCT TAC GTT GGT GTC ACC 2498Ala Val Asp Gly Gly Val Pro Pro Arg Ser Ala Tyr Val Gly Val Thr655 660 665ATC AAT GTG CTG GAC GAG AAT GAC AAC GCA CCC TAT ATC ACT GCC CCT 2546Ile Asn Val Leu Asp Glu Asn Asp Asn Ala Pro Tyr Ile Thr Ala Pro670 675 680TCT AAC ACC TCT CAC AAG CTG CTG ACC CCC CAG ACA CGT CTT GGT GAG 2594Ser Aen Thr Ser His Lys Leu Leu Thr Pro Gln Thr Arg Leu Gly Glu685 690 695 700ACG GTC AGC CAG GTG GCA GCC GAG GAC TTT GAC TCT GGT GTC AAT GCC 2642Thr Val Ser Gln Val Ala Ala Glu Asp Phe Asp Ser Gly Val Asn Ala705 710 715GAG CTG ATC TAC AGC ATT GCA GGT GGC AAC CCT TAT GGA CTC TTC CAG 2690Glu Leu Ile Tyr Ser Ile Ala Gly Gly Asn Pro Tyr Gly Leu Phe Gln720 725 730ATT GGG TCA CAT TCA GGT GCC ATC ACC CTG GAG AAG GAG ATT GAG CGG 2738Ile Gly Ser His Ser Gly Ala Ile Thr Leu Glu Lys Glu Ile Glu Arg735 740 745CGC CAC CAT GGG CTA CAC CGC CTG GTG GTG AAG GTC AGT GAC CGC GGC 2786Arg His His Gly Leu His Arg Leu Val Val Lys Val Ser Asp Arg Gly750 755 760AAG CCC CCA CGC TAT GGC ACA GCC TTG GTC CAT CTT TAT GTC AAT GAG 2834Lys Pro Pro Arg Tyr Gly Thr Ala Leu Val His Leu Tyr Val Asn Glu765 770 775 780ACT CTG GCC AAC CGC ACG CTG CTG GAG ACC CTC CTG GGC CAC AGC CTG 2882Thr Leu Ala Asn Arg Thr Leu Leu Glu Thr Leu Leu Gly His Ser Leu785 790 795GAC ACG CCG CTG GAT ATT GAC ATT GCT GGG GAT CCA GAA TAT GAG CGC 2930Asp Thr Pro Leu Asp Ile Asp Ile Ala Gly Asp Pro Glu Tyr Glu Arg800 805 810TCC AAG CAG CGT GGC AAC ATT CTC TTT GGT GTG GTG GCT GGT GTG GTG 2978Ser Lys Gln Arg Gly Asn Ile Leu Phe Gly Val Val Ala Gly Val Val815 820 825GCC GTG GCC TTG CTC ATC GCC CTG GCG GTT CTT GTG CGC TAC TGC AGA 3026Ala Val Ala Leu Leu Ile Ala Leu Ala Val Leu Val Arg Tyr Cys Arg830 835 840CAG CGG GAG GCC AAA AGT GGT TAC CAG GCT GGT AAG AAG GAG ACC AAG 3074Gln Arg Glu Ala Lys Ser Gly Tyr Gln Ala Gly Lys Lys Glu Thr Lys845 850 855 860GAC CTG TAT GCC CCC AAG CCC AGT GGC AAG GCC TCC AAG GGA AAC AAA 3122Aap Leu Tyr Ala Pro Lys Pro Ser Gly Lys Ala Ser Lys Gly Asn Lys865 870 875AGC AAA GGC AAG AAG AGC AAG TCC CCA AAG CCC GTG AAG CCA GTG GAG 3170Ser Lys Gly Lys Lys Ser Lys Ser Pro Lys Pro Val Lys Pro Val Glu880 885 890GAC GAG GAT GAG GCC GGG CTG CAG AAG TCC CTC AAG TTC AAC CTG ATG 3218Asp Glu Asp Glu Ala Gly Leu Gln Lys Ser Leu Lys Phe Asn Leu Met895 900 905AGC GAT GCC CCT GGG GAC AGT CCC CGC ATC CAC CTG CCC CTC AAC TAC 3266Ser Asp Ala Pro Gly Asp Ser Pro Arg Ile His Leu Pro Leu Asn Tyr910 915 920CCA CCA GGC AGC CCT GAC CTG GGC CGC CAC TAT CGC TCT AAC TCC CCA 3314Pro Pro Gly Ser Pro Asp Leu Gly Arg His Tyr Arg Ser Asn Ser Pro925 930 935 940CTG CCT TCC ATC CAG CTG CAG CCC CAG TCA CCC TCA GCC TCC AAG AAG 3362Leu Pro Ser Ile Gln Leu Gln Pro Gln Ser Pro Ser Ala Ser Lys Lys945 950 955CAC CAG GTG GTA CAG GAC CTG CCA CCT GCA AAC ACA TTC GTG GGC ACC 3410His Gln Val Val Gln Asp Leu Pro Pro Ala Asn Thr Phe Val Gly Thr960 965 970GGG GAC ACC ACG TCC ACG GGC TCT GAG CAG TAC TCC GAC TAC AGC TAC 3458Gly Asp Thr Thr Ser Thr Gly Ser Glu Gln Tyr Ser Asp Tyr Ser Tyr975 980 985CGC ACC AAC CCC CCC AAA TAC CCC AGC AAG CAG TTA CCT CAC CGC CGC 3506Arg Thr Asn Pro Pro Lys Tyr Pro Ser Lys Gln Leu Pro His Arg Arg990 995 1000GTC ACC TTC TCG GCC ACC AGC CAG GCC CAG GAG CTG CAG GAC CCA TCC 3554Val Thr Phe Ser Ala Thr Ser Gln Ala Gln Glu Leu Gln Asp Pro Ser1005101010151020CAG CAC AGT TAC TAT GAC AGT GGC CTG GAG GAG TCT GAG ACG CCG TCC 3602Gln His Ser Tyr Tyr Asp Ser Gly Leu Glu Glu Ser Glu Thr Pro Ser102510301035AGC AAG TCA TCC TCA GGG CCT CGA CTC GGT CCC CTG GCC CTG CCT GAG 3650Ser Lys Ser Ser Ser Gly Pro Arg Leu Gly Pro Leu Ala Leu Pro Glu104010451050GAT CAC TAT GAG CGC ACC ACC CCT GAT GGC AGC ATA GGA GAG ATG GAG 3698Asp His Tyr Glu Arg Thr Thr Pro Asp Gly Ser Ile Gly Glu Met Glu105510601065CAC CCC GAG AAT GAC CTT CGC CCT TTG CCT GAT GTC GCC ATG ACA GGC 3746His Pro Glu Asn Asp Leu Arg Pro Leu Pro Asp Val Ala Met Thr Gly107010751080ACA TGT ACC CGG GAG TGC AGT GAG TTT GGC CAC TCT GAC ACA TGC TGG 3794Thr Cys Thr Arg Glu Cys Ser Glu Phe Gly His Ser Asp Thr Cys Trp1085109010951100ATG CCT GGC CAG TCA TCT CCC AGC CGC CGG ACC AAG AGC AGC GCC CTC 3842Mer Pro Gly Gln Ser Ser Pro Ser Arg Arg Thr Lys Ser Ser Ala Leu110511101115AAA CTC TCC ACC TTC ATG CCT TAC CAG GAC CGA GGA GGG CAG GAG CCT 3890Lys Leu Ser Thr Phe Met Pro Tyr Gln Asp Arg Gly Gly Gln Glu Pro112011251130GCG GGC GCC GGC AGC CCC AGC CCC CCG GAA GAC CGG AAC ACC AAA ACG 3938Ala Gly Ala Gly Ser Pro Ser Pro Pro Glu Asp Arg Asn Thr Lys Thr113511401145GCC CCC GTG CGC CTC CTG CCC TCC TAC AGT GCC TTC TCC CAC AGT AGC 3986Ala Pro Val Arg Leu Leu Pro Ser Tyr Ser Ala Phe Ser His Ser Ser115011551160CAT GAT TCC TGC AAG GAC TCG GCC ACC TTG GAG GAA ATC CCC CTG ACC 4034His Asp Ser Cys Lys Asp Ser Ala Thr Leu Glu Glu Ile Pro Leu Thr1165117011751180CAG ACC TCG GAC TTC CCA CCC GCA GCC ACA CCG GCA TCT GCC CAG ACG 4082Gln Thr Ser Asp Phe Pro Pro Ala Ala Thr Pro Ala Ser Ala Gln Thr118511901195GCC AAG CGC GAG ATC TAC CTG TGAGCCCCCT ACTGGCCGGC CCCCCTCCCC 4133AlaLys Arg Glu Ile Tyr Leu1200CAGCGCCGGC CAGCTCCCAA ATGCCCATTC CAGGGCCTCA CTCTCCACCC CTTCAGCGTG 4193GACTTCCTGC CAGGGCCCAA GTGGGGGTAT CACTGACCTC ATGACCACGC TGGCCCTTCT 4253CCCATGCAGG GTCCAGGTCC TCTCCCCTCA TTTCCATCTC CCAGCCCAGG GGCCCCTTCC 4313CCTTTATGGG GCTTCCCCCA GCTGATGCCC AAGAGGGCTC CTCTGCAATG ACTGGGCTCC 4373TTCCCTTGAC TTCCAGGGAG CACCCCCTCG ATTTGGGCAG ATGGTGGAGT CAAGGGTGGG 4433CAGCGTACTT CTAACTCATT GTTTCCCTCA TGGCCGACCA GGGCGGGGAT AGCATGCCCA 4493ATTTTAGCCC TGAAGCAGGG CTGAACTGGG GAGCCCCTTT CCCTGGGAGC TCCCAGAGGA 4553AACTCTTGAC CACCAGTGGC TCCCTGAAGG GCTTTTGTTA CCAAAGGTGG GGTAGGGACG 4613GGGGTGGGAG TGGAGCGGAG GCCTTGTTTT CCCGTGG 4650(2)SEQ ID NO103的信息(i)序列特征(A)长度1203氨基酸
(B)类型氨基酸(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO103Met Glu Pro Leu Arg His Ser Pro Gly Pro Gly Gly Gln Arg Leu Leu1 5 10 15Leu Pro Ser Met Leu Leu Ala Leu Leu Leu Leu Leu Ala Pro Ser Pro20 25 30Gly His Ala Thr Arg Val Val Tyr Lys Val Pro Glu Glu Gln Pro Pro35 40 45Asn Thr Leu Ile Gly Ser Leu Ala Ala Asp Tyr Gly Phe Pro Asp Val50 55 60Gly His Leu Tyr Lys Leu Glu Val Gly Ala Pro Tyr Leu Arg Val Asp65 70 75 80Gly Lys Thr Gly Asp Ile Phe Thr Thr Glu Thr Ser Ile Asp Arg Glu85 90 95Gly Leu Arg Glu Cys Gln Asn Gln Leu Pro Gly Asp Pro Cys Ile Leu100 105 110Glu Phe Glu Val Ser Ile Thr Asp Leu Val Gln Asn Ala Ser Pro Arg115 120 125Leu Leu Glu Gly Gln Ile Glu Val Gln Asp Ile Asn Asp Asn Thr Pro130 135 140Asn Phe Ala Ser Pro Val Ile Thr Leu Ala Ile Pro Glu Asn Thr Asn145 150 155 160Ile Gly Ser Leu Phe Pro Ile Pro Leu Ala Ser Asp Arg Asp Ala Gly165 170 175Pro Asn Gly Val Ala Ser Tyr Glu Leu Gln Val Ala Glu Asp Gln Glu180 185 190Glu Lys Gln Pro Gln Leu Ile Val Met Gly Asn Leu Asp Arg Glu Arg195 200 205Trp Asp Ser Tyr Asp Leu Thr Ile Lys Val Gln Asp Gly Gly Ser Pro210 215220Pro Arg Ala Thr Ser Ala Leu Leu Arg Val Thr Val Leu Asp Thr Asn225 230 235 240Asp Asn Ala Pro Lys Phe Glu Arg Pro Ser Tyr Glu Ala Glu Leu Ser245 250 255Glu Asn Ser Pro Ile Gly Mis Ser Val Ile Gln Val Lys Ala Asn Asp260 265 270Ser Asp Gln Gly Ala Asn Ala Glu Ile Glu Tyr Thr Phe His Gln Ala275 280 285
Pro Glu Val Val Arg Arg Leu Leu Arg Leu Asp Arg Asn Thr Gly Leu290 295 300Ile Thr Val Gln Gly Pro Val Asp Arg Glu Asp Leu Ser Thr Leu Arg305 310 315 320Phe Ser Val Leu Ala Lys Asp Arg Gly Thr Asn Pro Lys Ser Ala Arg325 330 335Ala Gln Val Val Val Thr Val Lys Asp Met Asn Asp Asn Ala Pro Thr340 345 350Ile Glu Ile Arg Gly Ile Gly Leu Val Thr His Gln Asp Gly Met Ala355 360 365Aen Ile Ser Glu Asp Val Ala Glu Glu Thr Ala Val Ala Leu Val Gln370 375 380Val Ser Asp Arg Asp Glu Gly Glu Asn Ala Ala Val Thr Cys Val Val385 390 395 400Ala Gly Asp Val Pro Phe Gln Leu Arg Gln Ala Ser Glu Thr Gly Ser405 410 415Asp Ser Lys Lys Lys Tyr Phe Leu Gln Thr Thr Thr Pro Leu Asp Tyr420 425 430Glu Lys Val Lys Asp Tyr Thr Ile Glu Ile Val Ala Val Asp Ser Gly435 440 445Asn Pro Pro Leu Ser Ser Thr Asn Ser Leu Lys Val Gln Val Val Asp450 455 460Val Asn Asp Asn Ala Pro Val Phe Thr Gln Ser Val Thr Glu Val Ala465 470 475 480Phe Pro Glu Asn Asn Lys Pro Gly Glu Val Ile Ala Glu Ile Thr Ala485 490 495Ser Asp Ala Asp Ser Gly Ser Asn Ala Glu Leu Val Tyr Ser Leu Glu500 505 510Pro Glu Pro Ala Ala Lys Gly Leu Phe Thr Ile Ser Pro Glu Thr Gly515 520 525Glu Ile Gln Val Lys Thr Ser Leu Asp Arg Glu Gln Arg Glu Ser Tyr530 535 540Glu Leu Lys Val Val Ala Ala Asp Arg Gly Ser Pro Ser Leu Gln Gly545 550 555 560Thr Ala Thr Val Leu Val Asn val Leu Asp Cys Asn Asp Asn Asp Pro565 570 575Lys Phe Met Leu Ser Gly Tyr Asn Phe Ser Val Met Glu Asn Met Pro580 585 590Ala Leu Ser Pro Val Gly Met Val Thr Val Ile Asp Gly Asp Lys Gly595 600 605Glu Asn Ala Gln Val Gln Leu Ser Val Glu Gln Asp Asn Gly Asp Phe610 615 620
Val Ile Gln Asn Gly Thr Gly Thr Ile Leu Ser Ser Leu Ser Phe Asp625 630 635 640Arg Glu Gln Gln Ser Thr Tyr Thr Phe Gln Leu Lys Ala Val Asp Gly645 650 655Gly Val Pro Pro Arg Ser Ala Tyr Val Gly Val Thr Ile Asn Val Leu660 665 670Asp Glu Asn Asp Asn Ala Pro Tyr Ile Thr Ala Pro Ser Asn Thr Ser675 680 685His Lys Leu Leu Thr Pro Gln Thr Arg Leu Gly Glu Thr Val Ser Gln690 695 700Val Ala Ala Glu Asp Phe Asp Ser Gly Val Asn Ala Glu Leu Ile Tyr705 710 715 720Ser Ile Ala Gly Gly Asn Pro Tyr Gly Leu Phe Gln Ile Gly Ser His725 730 735Ser Gly Ala Ile Thr Leu Glu Lys Glu Ile Glu Arg Arg His His Gly740 745 750Leu His Arg Leu Val Val Lys Val Ser Asp Arg Gly Lys Pro Pro Arg755 760 765Tyr Gly Thr Ala Leu Val His Leu Tyr Val Asn Glu Thr Leu Ala Asn770 775 780Arg Thr Leu Leu Glu Thr Leu Leu Gly His Ser Leu Asp Thr Pro Leu785 790 795 800Asp Ile Asp Ile Ala Gly Asp Pro Glu Tyr Glu Arg Ser Lye Gln Arg805 810 815Gly Asn Ile Leu Phe Gly Val Val Ala Gly Val Val Ala Val Ala Leu820 825 830Leu Ile Ala Leu Ala Val Leu Val Arg Tyr Cys Arg Gln Arg Glu Ala835 840 845Lys Ser Gly Tyr Gln Ala Gly Lys Lys Glu Thr Lys Asp Leu Tyr Ala850 855 860Pro Lys Pro Ser Gly Lys Ala Ser Lys Gly Asn Lys Ser Lys Gly Lys865 870 875 880Lys Ser Lys Ser Pro Lys Pro Val Lys Pro Val Glu Asp Glu Asp Glu885 890 895Ala Gly Leu Gln Lys Ser Leu Lys Phe Asn Leu Met Ser Asp Ala Pro900 905 910Gly Asp Ser Pro Arg Ile His Leu Pro Leu Asn Tyr Pro Pro Gly Ser915 920 925Pro Asp Leu Gly Arg His Tyr Arg Ser Asn Ser Pro Leu Pro Ser Ile930 935 940Gln Leu Gln Pro Gln Ser Pro Ser Ala Ser Lys Lys His Gln Val Val945 950 955 960
Gln Asp Leu Pro Pro Ala Asn Thr Phe Val Gly Thr Gly Asp Thr Thr965 970 975Ser Thr Gly Ser Glu Gln Tyr Ser Asp Tyr Ser Tyr Arg Thr Ash Pro980 985 990Pro Lys Tyr Pro Ser Lys Gln Leu Pro His Arg Arg Val Thr Phe Ser995 10001005Ala Thr Ser Gln Ala Gln Glu Leu Gln Asp Pro Ser Gln His Ser Tyr101010151020Tyr Asp Ser Gly Leu Glu Glu Ser Glu Thr Pro Ser Ser Lys Ser Ser1025103010351040Ser Gly Pro Arg Leu Gly Pro Leu Ala Leu Pro Glu Asp His Tyr Glu104510501055Arg Thr Thr Pro Asp Gly Ser Ile Gly Glu Met Glu His Pro Glu Asn106010651070Asp Leu Arg Pro Leu Pro Asp Val Ala Met Thr Gly Thr Cys Thr Arg107510801085Glu Cys Ser Glu Phe Gly His Ser Asp Thr Cys Trp Met Pro Gly Gln109010951100Ser Ser Pro Ser Arg Arg Thr Lys Ser Ser Ala Leu Lys Leu Ser Thr1105111011151120Phe Met Pro Tyr Gln Asp Arg Gly Gly Gln Glu Pro Ala Gly Ala Gly112511301135Ser Pro Ser Pro Pro Glu Asp Arg Asn Thr Lys Thr Ala Pro Val Arg114011451150Leu Leu Pro Ser Tyr Ser Ala Phe Ser His Ser Ser His Aep Ser Cys115511601165Lys Asp Ser Ala Thr Leu Glu Glu Ile Pro Leu Thr Gln Thr Ser Asp117011751180Phe Pro Pro Ala Ala Thr Pro Ala Ser Ala Gln Thr Ala Lys Arg Glu1185119011951200Ile Tyr Leu(2)SEQ ID NO104的信息(i)序列特征(A)长度2789碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性
(ii)分子类型cDNA(ix)特征(A)名称/键CDS(B)定位115..2622(xi)序列描述SEQ ID NO104CGAAAGCCAT GTCGGACTCG TCGCCCAGCG CCCAAGCGCT AACCCGCTGA AAGTTTCTCA 60GCGAAATCTC AGGGACGATC TGGACCCCGC TGAGAGGAAC TGCTTTTGAG TGAG ATG 117Met1GTC CCA GAG GCC TGG AGG AGC GGA CTG GTA AGC ACC GGG AGG GTA GTG 165Val Pro Glu Ala Trp Arg Ser Gly Leu Val Ser Thr Gly Arg Val Val5 10 15GGA GTT TTG CTT CTG CTT GGT GCC TTG AAC AAG GCT TCC ACG GTC ATT 213Gly Val Leu Leu Leu Leu Gly Ala Leu Asn Lys Ala Ser Thr Val Ile20 25 30CAC TAT GAG ATC CCG GAG GAA AGA GAG AAG GGT TTC GCT GTG GGC AAC 261His Tyr Glu Ile Pro Glu Glu Arg Glu Lys Gly Phe Ala Val Gly Asn35 40 45GTG GTC GCG AAC CTT GGT TTG GAT CTC GGT AGC CTC TCA GCC CGC AGG 309Val Val Ala Asn Leu Gly Leu Asp Leu Gly Ser Leu Ser Ala Arg Arg50 55 60 65TTC CCG GTG GTG TCT GGA GCT AGC CGA AGA TTC TTT GAG GTG AAC CGG 357Phe Pro Val Val Ser Gly Ala Ser Arg Arg Phe Phe Glu Val Asn Arg70 75 80GAG ACC GGA GAG ATG TTT GTG AAC GAC CGT CTG GAT CGA GAG GAG CTG 405Glu Thr Gly Glu Met Phe Val Asn Asp Arg Leu Asp Arg Glu Glu Leu85 90 95TGT GGG ACA CTG CCC TCT TGC ACT GTA ACT CTG GAG TTG GTA GTG GAG 453Cys Gly Thr Leu Pro Ser Cys Thr Val Thr Leu Glu Leu Val Val Glu100 105 110AAC CCG CTG GAG CTG TTC AGC GTG GAA GTG GTG ATC CAG GAC ATC AAC 501Asn Pro Leu Glu Leu Phe Ser Val Glu Val Val Ile Gln Asp Ile Asn115 120 125GAC AAC AAT CCT GCT TTC CCT ACC CAG GAA ATG AAA TTG GAG ATT AGC 549Asp Asn Asn Pro Ala Phe Pro Thr Gln Glu Met Lys Leu Glu Ile Ser130 135 140 145GAG GCC GTG GCT CCG GGG ACG CGC TTT CCG CTC GAG AGC GCG CAC GAT 597Glu Ala Val Ala Pro Gly Thr Arg Phe Pro Leu Glu Ser Ala His Aspl50 155 160CCC GAT CTG GGA AGC AAC TCT TTA CAA ACC TAT GAG CTG AGC CGA AAT 645Pro Asp Leu Gly Ser Asn Ser Leu Gln Thr Tyr Glu Leu Ser Arg Asn165 170 175GAA TAC TTT GCG CTT CGC GTG CAG ACG CGG GAG GAC AGC ACC AAG TAC 693Glu Tyr Phe Ala Leu Arg Val Gln Thr Arg Glu Asp Ser Thr Lys Tyr180 185 190GCG GAG CTG GTG TTG GAG CGC GCC CTG GAC CGA GAA CGG GAG CCT AGT 741Ala Glu Leu Val Leu Glu Arg Ala Leu Asp Arg Glu Arg Glu Pro Ser195 200 205CTC CAG TTA GTG CTG ACG GCG TTG GAC GGA GGG ACC CCA GCT CTC TCC 789Leu Gln Leu Val Leu Thr Ala Leu Asp Gly Gly Thr Pro Ala Leu Ser210 215 220 225GCC AGC CTG CCT ATT CAC ATC AAG GTG CTG GAC GCG AAT GAC AAT GCG 837Ala Ser Leu Pro Ile His Ile Lys Val Leu Asp Ala Asn Asp Asn Ala230 235 240CCT GTC TTC AAC CAG TCC TTG TAC CGG GCG CGC GTT CCT GGA GGA TGC 885Pro Val Phe Asn Gln Ser Leu Tyr Arg Ala Arg Val Pro Gly Gly Cys245 250 255ACC TCC GGC ACG CGC GTG GTA CAA GTC CTT GCA ACG GAT CTG GAT GAA 933Thr Ser Gly Thr Arg Val Val Gln Val Leu Ala Thr Asp Leu Asp Glu260 265 270GGC CCC AAC GGT GAA ATT ATT TAC TCC TTC GGC AGC CAC AAC CGC GCC 981Gly Pro Asn Gly Glu Ile Ile Tyr Ser Phe Gly Ser His Asn Arg Ala275 280 285GGC GTG CGG CAA CTA TTC GCC TTA GAC CTT GTA ACC GGG ATG CTG ACA1029Gly Val Arg Gln Leu Phe Ala Leu Asp Leu Val Thr Gly Met Leu Thr290 295 300 305ATC AAG GGT CGG CTG GAC TTC GAG GAC ACC AAA CTC CAT GAG ATT TAC1077Ile Lys Gly Arg Leu Asp Phe Glu Asp Thr Lys Leu His Glu Ile Tyr310 315 320ATC CAG GCC AAA GAC AAG GGC GCC AAT CCC GAA GGA GCA CAT TGC AAA1125IleGln Ala Lys Asp Lys Gly Ala Asn Pro Glu Gly Ala His Cys Lys325 330 335GTG TTG GTG GAG GTT GTG GAT GTG AAT GAC AAC GCC CCG GAG ATC ACA1173Val Leu Val Glu Val Val Asp Val Asn Asp Asn Ala Pro Glu Ile Thr340 345 350GTC ACC TCC GTG TAC AGC CCA GTA CCC GAG GAT GCC TCT GGG ACT GTC1221Val Thr Ser Val Tyr Ser Pro Val Pro Glu Asp Ala Ser Gly Thr Val355 360 365ATC GCT TTG CTC AGT GTG ACT GAC CTG GAT GCT GGC GAG AAC GGG CTG1269Ile Ala Leu Leu Ser Val Thr Asp Leu Asp Ala Gly Glu Asn Gly Leu370 375 380 385GTG ACC TGC GAA GTT CCA CCG GGT CTC CCT TTC AGC CTT ACT TCT TCC1317Val Thr Cys Glu Val Pro Pro Gly Leu Pro Phe Ser Leu Thr Ser Ser390 395 400CTC AAG AAT TAC TTC ACT TTG AAA ACC AGT GCA GAC CTG GAT CGG GAG1365Leu Lys Asn Tyr Phe Thr Leu Lys Thr Ser Ala Asp Leu Asp Arg Glu405 410 415ACT GTG CCA GAA TAC AAC CTC AGC ATC ACC GCC CGA GAC GCC GGA ACC 1413Thr Val Pro Glu Tyr Asn Leu Ser Ile Thr Ala Arg Asp Ala Gly Thr420 425 430CCT TCC CTC TCA GCC CTT ACA ATA GTG CGT GTT CAA GTG TCC GAC ATC 1461Pro Ser Leu Ser Ala Leu Thr Ile Val Arg Val Gln Val Ser Asp Ile435 440 445AAT GAC AAC CCT CCA CAA TCT TCT CAA TCT TCC TAC GAC GTT TAC ATT 1509Asn Asp Asn Pro Pro Gln Ser Ser Gln Ser Ser Tyr Asp Val Tyr Ile450 455 460 465GAA GAA AAC AAC CTC CCC GGG GCT CCA ATA CTA AAC CTA AGT GTC TGG 1557Glu Glu Asn Asn Leu Pro Gly Ala Pro Ile Leu Asn Leu Ser Val Trp470 475 480GAC CCC GAC GCC CCG CAG AAT GCT CGG CTT TCT TTC TTT CTC TTG GAG 1605Asp Pro Asp Ala Pro Gln Asn Ala Arg Leu Ser Phe Phe Leu Leu Glu485 490 495CAA GGA GCT GAA ACC GGG CTA GTG GGT CGC TAT TTC ACA ATA AAT CGT 1653Gln Gly Ala Glu Thr Gly Leu Val Gly Arg Tyr Phe Thr Ile Asn Arg500 505 510GAC AAT GGC ATA GTG TCA TCC TTA GTG CCC CTA GAC TAT GAG GAT CGG 1701Asp Asn Gly Ile Val Ser Ser Leu Val Pro Leu Asp Tyr Glu Asp Arg515 520 525CGG GAA TTT GAA TTA ACA GCT CAT ATC AGC GAT GGG GGC ACC CCG GTC 1749Arg Glu Phe Glu Leu Thr Ala His Ile Ser Asp Gly Gly Thr Pro Val530 535 540 545CTA GCC ACC AAC ATC AGC GTG AAC ATA TTT GTC ACT GAT CGC AAT GAC 1797Leu Ala Thr Asn Ile Ser Val Asn Ile Phe Val Thr Asp Arg Asn Asp550 555 560AAT GCC CCC CAG GTC CTA TAT CCT CGG CCA GGT GGG AGC TCG GTG GAG 1845Asn Ala Pro Gln Val Leu Tyr Pro Arg Pro Gly Gly Ser Ser Val Glu565 S70 575ATG CTG CCT CGA GGT ACC TCA GCT GGC CAC CTA GTG TCA CGG GTG GTA 1893Met Leu Pro Arg Gly Thr Ser Ala Gly His Leu Val Ser Arg Val Val580 585 590GGC TGG GAC GCG GAT GCA GGG CAC AAT GCC TGG CTC TCC TAC AGT CTC 1941Gly Trp Asp Ala Asp Ala Gly His Asn Ala Trp Leu Ser Tyr Ser Leu595 600 605TTT GGA TCC CCT AAC CAG AGC CTT TTT GCC ATA GGG CTG CAC ACT GGT 1989Phe Gly Ser Pro Asn Gln Ser Leu Phe Ala Ile Gly Leu His Thr Gly610 615 620 625CAA ATC AGT ACT GCC CGT CCA GTC CAA GAC ACA GAT TCA CCC AGG CAG 2037Gln Ile Ser Thr Ala Arg Pro Val Gln Asp Thr Asp Ser Pro Arg Gln630 635 640ACT CTC ACT GTC TTG ATC AAA GAC AAT GGG GAG CCT TCG CTC TCC ACC 2085Thr Leu Thr Val Leu Ile Lye Asp Asn Gly Glu Pro Ser Leu Ser Thr645 650 655ACT GCT ACC CTC ACT GTG TCA GTA ACC GAG GAC TCT CCT GAA GCC CGA2133Thr Ala Thr Leu Thr Val Ser Val Thr Glu Asp Ser Pro Glu Ala Arg660 665 670GCC GAG TTC CCC TCT GGC TCT GCC CCC CGG GAG CAG AAA AAA AAT CTC2181Ala Glu Phe Pro Ser Gly Ser Ala Pro Arg Glu Gln Lye Lye Asn Leu675 680 685ACC TTT TAT CTA CTT CTT TCT CTA ATC CTG GTT TCT GTG GGC TTC GTG2229Thr Phe Tyr Leu Leu Leu Ser Leu Ile Leu Val Ser Val Gly Phe Val690 695 700 705GTC ACA GTG TTC GGA GTA ATC ATA TTC AAA GTT TAC AAG TGG AAG CAG2277Val Thr Val Phe Gly Val Ile Ile Phe Lys Val Tyr Lys Trp Lys Gln710 715 720TCT AGA GAC CTA TAC CGA GCC CCG GTG AGC TCA CTG TAC CGA ACA CCA2325Ser Arg Asp Leu Tyr Arg Ala Pro Val Ser Ser Leu Tyr Arg Thr Pro725 730 735GGG CCC TCC TTG CAC GCG GAC GCC GTG CGG GGA GGC CTG ATG TCG CCG2373Gly Pro Ser Leu His Ala Asp Ala Val Arg Gly Gly Leu Met Ser Pro740 745 750CAC CTT TAC CAT CAG GTG TAT CTC ACC ACG GAC TCC CGC CGC AGC GAC2421His Leu Tyr His Gln Val Tyr Leu Thr Thr Asp Ser Arg Arg Ser Asp755 760 765CCG CTG CTG AAG AAA CCT GGT GCA GCC AGT CCA CTG GCC AGC CGC CAG2469Pro Leu Leu Lys Lys Pro Gly Ala Ala Ser Pro Leu Ala Ser Arg Gln770 775 780 785AAC ACG CTG CGG AGC TGT GAT CCG GTG TTC TAT AGG CAG GTG TTG GGT2517Asn Thr Leu Arg Ser Cys Asp Pro Val Phe Tyr Arg Gln Val Leu Gly790 795 800GCA GAG AGC GCC CCT CCC GGA CAG GTA AGG TTT AGC AAG TCA TGC TTG2565Ala Glu Ser Ala Pro Pro Gly Gln Val Arg Phe Ser Lys Ser Cys Leu805 810 815ACC CTG TTA GTG CCT TTT TAT TCC TAC ATC ATA TTG AGA AGG CTG GAG2613Thr Leu Leu Val Pro Phe Tyr Ser Tyr Ile Ile Leu Arg Arg Leu Glu820 825 830CTG TTT TTT TAGTGATGAA GATGTTTTCC TGGTGATGCA TTCACACTTT2662Leu Phe Phe835CAACTGGCTC TTCCTAGATC AAAGTTAGTG CCTTTGTGAG ATGGTGGCCT GCCAGAGTGT 2722GGTTTGTGGT CCCATTTCAG GGGGAAGATA CTTGACTCAT CTGTGGACCT AATTCACATC 2782CTCAGCG2789(2)SEQ ID NO105的信息(i)序列特征(A)长度836氨基酸
(B)类型氨基酸(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO105Met Val Pro Glu Ala Trp Arg Ser Gly Leu Val Ser Thr Gly Arg Val1 5 10 15Val Gly Val Leu Leu Leu Leu Gly Ala Leu Asn Lys Ala Ser Thr Val20 25 30Ile His Tyr Glu Ile Pro Glu Glu Arg Glu Lys Gly Phe Ala Val Gly35 40 45Asn Val Val Ala Asn Leu Gly Leu Asp Leu Gly Ser Leu Ser Ala Arg50 55 60Arg Phe Pro Val Val Ser Gly Ala Ser Arg Arg Phe Phe Glu Val Asn65 70 75 80Arg Glu Thr Gly Glu Met Phe Val Asn Asp Arg Leu Asp Arg Glu Glu85 90 95Leu Cys Gly Thr Leu Pro Ser Cys Thr Val Thr Leu Glu Leu Val Val100 105 110Glu Asn Pro Leu Glu Leu Phe Ser Val Glu Val Val Ile Gln Asp Ile115 120 125Asn Asp Asn Asn Pro Ala Phe Pro Thr Gln Glu Met Lys Leu Glu Ile130 135 140Ser Glu Ala Val Ala Pro Gly Thr Arg Phe Pro Leu Glu Ser Ala His145 150 155 160Asp Pro Asp Leu Gly Ser Asn Ser Leu Gln Thr Tyr Glu Leu Ser Arg165 170 175Asn Glu Tyr Phe Ala Leu Arg Val Gln Thr Arg Glu Asp Ser Thr Lys180 185 190Tyr Ala Glu Leu Val Leu Glu Arg Ala Leu Asp Arg Glu Arg Glu Pro195 200 205Ser Leu Gln Leu Val Leu Thr Ala Leu Asp Gly Gly Thr Pro Ala Leu210 215 220Ser Ala Ser Leu Pro Ile His Ile Lys Val Leu Asp Ala Asn Asp Asn225 230 235 240Ala Pro Val Phe Asn Gln Ser Leu Tyr Arg Ala Arg Val Pro Gly Gly245 250 255Cys Thr Ser Gly Thr Arg Val Val Gln Val Leu Ala Thr Asp Leu Asp260 265 270Glu Gly Pro Asn Gly Glu Ile Ile Tyr Ser Phe Gly Ser His Asn Arg275 280 285
Ala Gly Val Arg Gln Leu Phe Ala Leu Asp Leu Val Thr Gly Met Leu290 295 300Thr Ile Lys Gly Arg Leu Asp Phe Glu Asp Thr Lys Leu His Glu Ile305 310 315 320Tyr Ile Gln Ala Lye Asp Lys Gly Ala Asn Pro Glu Gly Ala His Cys325 330 335Lys Val Leu Val Glu Val Val Asp Val Asn Asp Asn Ala Pro Glu Ile340 345 350Thr Val Thr Ser Val Tyr Ser Pro Val Pro Glu Asp Ala Ser Gly Thr355 360 365Val Ile Ala Leu Leu Ser Val Thr Asp Leu Asp Ala Gly Glu Asn Gly370 375 380Leu Val Thr Cys Glu Val Pro Pro Gly Leu Pro Phe Ser Leu Thr Ser385 390 395 400Ser Leu Lys Asn Tyr Phe Thr Leu Lys Thr Ser Ala Asp Leu Asp Arg405 410 415Glu Thr Val Pro Glu Tyr Asn Leu Ser Ile Thr Ala Arg Asp Ala Gly420 425 430Thr Pro Ser Leu Ser Ala Leu Thr Ile Val Arg Val Gln Val Ser Asp435 440 445Ile Asn Asp Asn Pro Pro Gln Ser Ser Gln Ser Ser Tyr Asp Val Tyr450 455 460Ile Glu Glu Asn Asn Leu Pro Gly Ala Pro Ile Leu Asn Leu Ser Val465 470 475 480Trp Asp Pro Asp Ala Pro Gln Asn Ala Arg Leu Ser Phe Phe Leu Leu485 490 495Glu Gln Gly Ala Glu Thr Gly Leu Val Gly Arg Tyr Phe Thr Ile Asn500 505 510Arg Asp Asn Gly Ile Val Ser Ser Leu Val Pro Leu Asp Tyr Glu Asp515 520 525Arg Arg Glu Phe Glu Leu Thr Ala His Ile Ser Asp Gly Gly Thr Pro530 535 540Val Leu Ala Thr Asn Ile Ser Val Asn Ile Phe Val Thr Asp Arg Asn545 550 555 560Asp Asn Ala Pro Gln Val Leu Tyr Pro Arg Pro Gly Gly Ser Ser Val565 570 575Glu Met Leu Pro Arg Gly Thr Ser Ala Gly His Leu Val Ser Arg Val580 585 590Val Gly Trp Asp Ala Asp Ala Gly His Asn Ala Trp Leu Ser Tyr Ser595 600 605Leu Phe Gly Ser Pro Asn Gln Ser Leu Phe Ala Ile Gly Leu His Thr610 615 620
Gly Gln Ile Ser Thr Ala Arg Pro Val Gln Asp Thr Asp Ser Pro Arg625 630 635 640Gln Thr Leu Thr Val Leu Ile Lys Asp Asn Gly Glu Pro Ser Leu Ser645 650 655Thr Thr Ala Thr Leu Thr Val Ser Val Thr Glu Asp Ser Pro Glu Ala660 665 670Arg Ala Glu Phe Pro Ser Gly Ser Ala Pro Arg Glu Gln Lys Lys Asn675 680 685Leu Thr Phe Tyr Leu Leu Leu Ser Leu Ile Leu Val Ser Val Gly Phe690 695 700Val Val Thr Val Phe Gly Val Ile Ile Phe Lys Val Tyr Lys Trp Lys705 710 715 720Gln Ser Arg Asp Leu Tyr Arg Ala Pro Val Ser Ser Leu Tyr Arg Thr725 730 735Pro Gly Pro Ser Leu His Ala Asp Ala Val Arg Gly Gly Leu Met Ser740 745 750Pro His Leu Tyr His Gln Val Tyr Leu Thr Thr Asp Ser Arg Arg Ser755 760 765Asp Pro Leu Leu Lys Lys Pro Gly Ala Ala Ser Pro Leu Ala Ser Arg770 775 780Gln Asn Thr Leu Arg Ser Cys Asp Pro Val Phe Tyr Arg Gln Val Leu785 790 795 800Gly Ala Glu Ser Ala Pro Pro Gly Gln Val Arg Phe Ser Lys Ser Cys805 810 815Leu Thr Leu Leu Val Pro Phe Tyr Ser Tyr Ile Ile Leu Arg Arg Leu820 825 830Glu Leu Phe Phe835(2)SEQ ID NO106的信息(i)序列特征(A)长度2751碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(ix)特征(A)名称/键CDS
(B)定位115..2160(xi)序列描述SEQ ID NO106CGAAAGCCAT GTCGGACTCG TCGCCCAGCG CCCAAGCGCT AACCCGCTGA AAGTTTCTCA 60GCGAAATCTC AGGGACGATC TGGACCCCGC TGAGAGGAAC TGCTTTTGAG TGAG ATG 117Met1GTC CCA GAG GCC TGG AGG AGC GGA CTG GTA AGC ACC GGG AGG GTA GTG 165Val Pro Glu Ala Trp Arg Ser Gly Leu Val Ser Thr Gly Arg Val Val5 10 15GGA GTT TTG CTT CTG CTT GGT GCC TTG AAC AAG GCT TCC ACG GTC ATT 213Gly Val Leu Leu Leu Leu Gly Ala Leu Asn Lys Ala Ser Thr Val Ile20 25 30CAC TAT GAG ATC CCG GAG GAA AGA GAG AAG GGT TTC GCT GTG GGC AAC 261His Tyr Glu Ile Pro Glu Glu Arg Glu Lys Gly Phe Ala Val Gly Asn35 40 45GTG GTC GCG AAC CTT GGT TTG GAT CTC GGT AGC CTC TCA GCC CGC AGG 309Val Val Ala Ash Leu Gly Leu Asp Leu Gly Ser Leu Ser Ala Arg Arg50 55 60 65TTC CCG GTG GTG TCT GGA GCT AGC CGA AGA TTC TTT GAG GTG AAC CGG 357Phe Pro Val Val Ser Gly Ala Ser Arg Arg Phe Phe Glu Val Asn Arg70 75 80GAG ACC GGA GAG ATG TTT GTG AAC GAC CGT CTG GAT CGA GAG GAG CTG 405Glu Thr Gly Glu Met Phe Val Asn Asp Arg Leu Asp Arg Glu Glu Leu85 90 95TGT GGG ACA CTG CCC TCT TGC ACT GTA ACT CTG GAG TTG GTA GTG GAG 453Cys Gly Thr Leu Pro Ser Cys Thr Val Thr Leu Glu Leu Val Val Glu100 105 110AAC CCG CTG GAG CTG TTC AGC GTG GAA GTG GTG ATC CAG GAC ATC AAC 501Asn Pro Leu Glu Leu Phe Ser Val Glu Val Val Ile Gln Asp Ile Asn115 120 125GAC AAC AAT CCT GCT TTC CCT ACC CAG GAA ATG AAA TTG GAG ATT AGC 549Asp Asn Asn Pro Ala Phe Pro Thr Gln Glu Met Lys Leu Glu Ile Ser130 135 140 145GAG GCC GTG GCT CCG GGG ACG CGC TTT CCG CTC GAG AGC GCG CAC GAT 597Glu Ala Val Ala Pro Gly Thr Arg Phe Pro Leu Glu Ser Ala His Asp150 155 160CCC GAT CTG GGA AGC AAC TCT TTA CAA ACC TAT GAG CTG AGC CGA AAT 645Pro Asp Leu Gly Ser Asn Ser Leu Gln Thr Tyr Glu Leu Ser Arg Asn165 170 175GAA TAC TTT GCG CTT CGC GTG CAG ACG CGG GAG GAC AGC ACC AAG TAC 693Glu Tyr Phe Ala Leu Arg Val Gln Thr Arg Glu Asp Ser Thr Lys Tyr180 185 190GCG GAG CTG GTG TTG GAG CGC GCC CTG GAC CGA GAA CGG GAG CCT AGT 741Ala Glu Leu Val Leu Glu Arg Ala Leu Asp Arg Glu Arg Glu Pro Ser195 200 205CTC CAG TTA GTG CTG ACG GCG TTG GAC GGA GGG ACC CCA GCT CTC TCC 789Leu Gln Leu Val Leu Thr Ala Leu Asp Gly Gly Thr Pro Ala Leu Ser210 215 220 225GCC AGC CTG CCT ATT CAC ATC AAG GTG CTG GAC GCG AAT GAC AAT GCG 837Ala Ser Leu Pro Ile His Ile Lys Val Leu Asp Ala Asn Asp Asn Ala230 235 240CCT GTC TTC AAC CAG TCC TTG TAC CGG GCG CGC GTT CCT GGA GGA TGC 885Pro Val Phe Asn Gln Ser Leu Tyr Arg Ala Arg Val Pro Gly Gly Cys245 250 255ACC TCC GGC ACG CGC GTG GTA CAA GTC CTT GCA ACG GAT CTG GAT GAA 933Thr Ser Gly Thr Arg Val Val Gln Val Leu Ala Thr Asp Leu Asp Glu260 265 270GGC CCC AAC GGT GAA ATT ATT TAC TCC TTC GGC AGC CAC AAC CGC GCC 981Gly Pro Asn Gly Glu Ile Ile Tyr Ser Phe Gly Ser His Asn Arg Ala275 280 285GGC GTG CGG CAA CTA TTC GCC TTA GAC CTT GTA ACC GGG ATG CTG ACA1029Gly Val Arg Gln Leu Phe Ala Leu Asp Leu Val Thr Gly Met Leu Thr290 295 300 305ATC AAG GGT CGG CTG GAC TTC GAG GAC ACC AAA CTC CAT GAG ATT TAC1077Ile Lys Gly Arg Leu Asp Phe Glu Asp Thr Lys Leu His Glu Ile Tyr310 315 320ATC CAG GCC AAA GAC AAG GGC GCC AAT CCC GAA GGA GCA CAT TGC AAA1125Ile Gln Ala Lys Asp Lys Gly Ala Asn Pro Glu Gly Ala His Cys Lys325 330 335GTG TTG GTG GAG GTT GTG GAT GTG AAT GAC AAC GCC CCG GAG ATC ACA1173Val Leu Val Glu Val Val Asp Val Asn Asp Asn Ala Pro Glu Ile Thr340 345 350GTC ACC TCC GTG TAC AGC CCA GTA CCC GAG GAT GCC TCT GGG ACT GTC1221Val Thr Ser Val Tyr Ser Pro Val Pro Glu Asp Ala Ser Gly Thr Val355 360 365ATC GCT TTG CTC AGT GTG ACT GAC CTG GAT GCT GGC GAG AAC GGG CTG1269Ile Ala Leu Leu Ser Val Thr Asp Leu Asp Ala Gly Glu Asn Gly Leu370 375 380 385GTG ACC TGC GAA GTT CCA CCG GGT CTC CCT TTC AGC CTT ACT TCT TCC1317Val Thr Cys Glu Val Pro Pro Gly Leu Pro Phe Ser Leu Thr Ser Ser390 395 400CTC AAG AAT TAC TTC ACT TTG AAA ACC AGT GCA GAC CTG GAT CGG GAG1365Leu Lys Asn Tyr Phe Thr Leu Lys Thr Ser Ala Asp Leu Asp Arg Glu405 410 415ACT GTG CCA GAA TAC AAC CTC AGC ATC ACC GCC CGA GAC GCC GGA ACC1413Thr Val Pro Glu Tyr Asn Leu Ser Ile Thr Ala Arg Asp Ala Gly Thr420 425 430CCT TCC CTC TCA GCC CTT ACA ATA GTG CGT GTT CAA GTG TCC GAC ATC1461Pro Ser Leu Ser Ala Leu Thr Ile Val Arg Val Gln Val Ser Asp Ile435 440 445AAT GAC AAC CCT CCA CAA TCT TCT CAA TCT TCC TAC GAC GTT TAC ATT 1509Asn Asp Asn Pro Pro Gln Ser Ser Gln Ser Ser Tyr Asp Val Tyr Ile450 455 460 465GAA GAA AAC AAC CTC CCC GGG GCT CCA ATA CTA AAC CTA AGT GTC TGG 1557Glu Glu Asn Asn Leu Pro Gly Ala Pro Ile Leu Asn Leu Ser Val Trp470 475 480GAC CCC GAC GCC CCG CAG AAT GCT CGG CTT TCT TTC TTT CTC TTG GAG 1605Asp Pro Asp Ala Pro Gln Asn Ala Arg Leu Ser Phe Phe Leu Leu Glu485 490 495CAA GGA GCT GAA ACC GGG CTA GTG GGT CGC TAT TTC ACA ATA AAT CGT 1653Gln Gly Ala Glu Thr Gly Leu Val Gly Arg Tyr Phe Thr Ile Asn Arg500 505 510GAC AAT GGC ATA GTG TCA TCC TTA GTG CCC CTA GAC TAT GAG GAT CGG 1701Asp Asn Gly Ile Val Ser Ser Leu Val Pro Leu Asp Tyr Glu Asp Arg515 520 525CGG GAA TTT GAA TTA ACA GCT CAT ATC AGC GAT GGG GGC ACC CCG GTC 1749Arg Glu Phe Glu Leu Thr Ala His Ile Ser Asp Gly Gly Thr Pro Val530 535 540 545CTA GCC ACC AAC ATC AGC GTG AAC ATA TTT GTC ACT GAT CGC AAT GAC 1797Leu Ala Thr Asn Ile Ser Val Asn Ile Phe Val Thr Asp Arg Asn Asp550 555 560AAT GCC CCC CAG GTC CTA TAT CCT CGG CCA GGT GGG AGC TCG GTG GAG 1845Asn Ala Pro Gln Val Leu Tyr Pro Arg Pro Gly Gly Ser Ser Val Glu565 570 575ATG CTG CCT CGA GGT ACC TCA GCT GGC CAC CTA GTG TCA CGG GTG GTA 1893Met Leu Pro Arg Gly Thr Ser Ala Gly His Leu Val Ser Arg Val Val580 585 590GGC TGG GAC GCG GAT GCA GGG CAC AAT GCC TGG CTC TCC TAC AGT CTC 1941Gly Trp Asp Ala Asp Ala Gly His Ash Ala Trp Leu Ser Tyr ser Leu595 600 605TTT GGA TCC CCT AAC CAG AGC CTT TTT GCC ATA GGG CTG CAC ACT GGT 1989Phe Gly Ser Pro Asn Gln Ser Leu Phe Ala Ile Gly Leu His Thr Gly610 615 620 625CAA ATC AGT ACT GCC CGT CCA GTC CAA GAC ACA GAT TCA CCC AGG CAG 2037Gln Ile Ser Thr Ala Arg Pro Val Gln Asp Thr Asp Ser Pro Arg Gln630 635 640ACT CTC ACT GTC TTG ATC AAA GAC AAT GGG GAG CCT TCG CTC TCC ACC 2085Thr Leu Thr Val Leu Ile Lys Asp Asn Gly Glu Pro Ser Leu Ser Thr645 650 655ACT GCT ACC CTC ACT GTG TCA GTA ACC GAG GAC TCT CCT GAA GCC CGA 2133Thr Ala Thr Leu Thr Val Ser Val Thr Glu Asp Ser Pro Glu Ala Arg660 665 670GCC GAG TTC CCC TCT GGC TCT GCC AGT TAAACCTTCT TTAATTATGG 2180Ala Glu Phe Pro Ser Gly Ser Ala Ser675 680ATTAGCCATT AACATTTTTG AAACGTGGAC CATTTAACCT CGGCCTACCC CCTCCAACTG 2240TCCTGGTGAT GAGTTCATTA GCTAAGTTAA ATTAATTGAA CTTTGATCTA AACCAAAACA 2300AATCAGGAAA ATAAAGCTGT AAAGGAACTT ATCAAGCATT CCAAAACCAA CTAGAAATTA 2360CTTGAAGTTT CGAGTGAGCA TTGCCTGTGC CAGTATTCTT CATTATAGGA TTATAAACTC 2420GTTTTTTTCC CAAAGCGCAT GTCTACGCCA GGCAGAGGAG TAATTATTCA GCCAATTTCA 2480TGGATGTAAC GATGGATATA AATAATTGAT AGCACCTAGA GGCTTCCAGT TTGGGTGGAA 2540GGCTAAAAGT AGAGGGGAAC TCACTCACTT GAGAAATGAT ATTTAAGTGA ATAAATAGTT 2600CTCTTCTATG AAACTATTAC TATTTAGTTC TCTGGAAAAC TTAAGTGTAT TAATGATTAG 2660AACATCAAAT CCTAAGTAAA GAAATGACAT TTTAAATATA AAAAGCCAAA CTTTAAATAA 2720ATCATAGAGA CCTCAGACAT AATATAGGAA A 2751(2)SEQ ID NO107的信息(i)序列特征(A)长度682氨基酸(B)类型氨基酸(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO107Met Val Pro Glu Ala Trp Arg Ser Gly Leu Val Ser Thr Gly Arg Val1 5 10 15Val Gly Val Leu Leu Leu Leu Gly Ala Leu Asn Lys Ala Ser Thr Val20 25 30Ile His Tyr Glu Ile Pro Glu Glu Arg Glu Lys Gly Phe Ala Val Gly35 40 45Asn Val Val Ala Asn Leu Gly Leu Asp Leu Gly Ser Leu Ser Ala Arg50 55 60Arg Phe Pro Val Val Ser Gly Ala Ser Arg Arg Phe Phe Glu Val Asn65 70 75 80Arg Glu Thr Gly Glu Met Phe Val Asn Asp Arg Leu Asp Arg Glu Glu85 90 95Leu Cys Gly Thr Leu Pro Ser Cys Thr Val Thr Leu Glu Leu Val Val100 105 110Glu Asn Pro Leu Glu Leu Phe Ser Val Glu Val Val Ile Gln Asp Ile115 120 125Asn Asp Asn Asn Pro Ala Phe Pro Thr Gln Glu Met Lys Leu Glu Ile130 135 140Ser Glu Ala Val Ala Pro Gly Thr Arg Phe Pro Leu Glu Ser Ala His145 150 155 160Asp Pro Asp Leu Gly Ser Asn Ser Leu Gln Thr Tyr Glu Leu Ser Arg165 170 175Asn Glu Tyr Phe Ala Leu Arg Val Gln Thr Arg Glu Asp Ser Thr Lye180 185 190Tyr Ala Glu Leu Val Leu Glu Arg Ala Leu Asp Arg Glu Arg Glu Pro195 200 205Ser Leu Gln Leu Val Leu Thr Ala Leu Asp Gly Gly Thr Pro Ala Leu210 215 220Ser Ala Ser Leu Pro Ile His Ile Lys Val Leu Asp Ala Asn Asp Asn225 230 235 240Ala Pro Val Phe Asn Gln Ser Leu Tyr Arg Ala Arg Val Pro Gly Gly245 250 255Cys Thr Ser Gly Thr Arg Val Val Gln Val Leu Ala Thr Asp Leu Asp260 265 270Glu Gly Pro Asn Gly Glu Ile Ile Tyr Ser Phe Gly Ser His Asn Arg275 280 285Ala Gly Val Arg Gln Leu Phe Ala Leu Asp Leu Val Thr Gly Met Leu290 295 300Thr Ile Lys Gly Arg Leu Asp Phe Glu Asp Thr Lys Leu His Glu Ile305 310 315 320Tyr Ile Gln Ala Lys Asp Lys Gly Ala Asn Pro Glu Gly Ala His Cys325 330 335Lys Val Leu Val Glu Val Val Asp Val Asn Asp Asn Ala Pro Glu Ile340 345 350Thr Val Thr Ser Val Tyr Ser Pro Val Pro Glu Asp Ala Ser Gly Thr355 360 365Val Ile Ala Leu Leu Ser Val Thr Asp Leu Asp Ala Gly Glu Asn Gly370 375 380Leu Val Thr Cys Glu Val Pro Pro Gly Leu Pro Phe Ser Leu Thr Ser385 390 395 400Ser Leu Lys Asn Tyr Phe Thr Leu Lys Thr Ser Ala Asp Leu Asp Arg405 410 415Glu Thr Val Pro Glu Tyr Asn Leu Ser Ile Thr Ala Arg Asp Ala Gly420 425 430Thr Pro Ser Leu Ser Ala Leu Thr Ile Val Arg Val Gln Val Ser Asp435 440 445Ile Asn Asp Asn Pro Pro Gln Ser Ser Gln Ser Ser Tyr Asp Val Tyr450 455 460Ile Glu Glu Asn Asn Leu Pro Gly Ala Pro Ile Leu Asn Leu Ser Val465 470 475 480Trp Asp Pro Asp Ala Pro Gln Asn Ala Arg Leu Ser Phe Phe Leu Leu485 490 495Glu Gln Gly Ala Glu Thr Gly Leu Val Gly Arg Tyr Phe Thr Ile Asn500 505 510Arg Asp Asn Gly Ile Val Ser Ser Leu Val Pro Leu Asp Tyr Glu Asp515 520 525Arg Arg Glu Phe Glu Leu Thr Ala His Ile Ser Asp Gly Gly Thr Pro530 535 540Val Leu Ala Thr Asn Ile Ser Val Asn Ile Phe Val Thr Asp Arg Asn545 550 555 560Asp Asn Ala Pro Gln Val Leu Tyr Pro Arg Pro Gly Gly Ser Ser Val565 570 575Glu Met Leu Pro Arg Gly Thr Ser Ala Gly His Leu Val Ser Arg Val580 585 590Val Gly Trp Asp Ala Asp Ala Gly His Asn Ala Trp Leu Ser Tyr Ser595 600 605Leu Phe Gly Ser Pro Asn Gln Ser Leu Phe Ala Ile Gly Leu His Thr610 615 620Gly Gin Ile Ser Thr Ala Arg Pro Val Gln Asp Thr Asp Ser Pro Arg625 630 635 640Gln Thr Leu Thr Val Leu Ile Lys Asp Asn Gly Glu Pro Ser Leu Ser645 650 655Thr Thr Ala Thr Leu Thr Val Ser Val Thr Glu Asp Ser Pro Glu Ala660 665 670Arg Ala Glu Phe Pro Ser Gly Ser Ala Ser675 680(2)SEQ ID NO108的信息(i)序列特征(A)长度2831碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性
(ii)分子类型cDNA(xi)序列描述SEQ ID NO108GAATTCGGCA CGAGGCTGAA CTGAGGGTGA CGGACATAAA CGACTATTCT CCAGTGTTCA60GTGAAAGAGA AATGATACTG AGGATACCAG AAAACAGTGC TCGGGGAAAT ACATTCCCTT 120TAAACAATGC TCTGGACTCA GACGTAGATA TCAACAATAT CCAGACCTAT AGGCTCAGCT 180CAAACTCTCA TTTCCTGGTT GTAACCCGCA ACCGCAGTGA TGGCAGGAAG TACCCAGAGC 240TGGTGCTGGA GAAAGAACTG GATCGAGAGG AGGAACCTGA GCTGAGGTTA ACGCTGACAG 300CTTTGGATGG TGGCTCTCCT CCCCGGTCTG GGACGACACA GGTCCTCATT GAAGTAGTGG 360ACACCAACGA TAATGCACCC GAGTTTCAGC AGCCAACATA CCAAGTGCAA ACTCCCGAGA 420ACAGTCCCAC CGGCTCTCTG GTACTCACAG TCTCAGCCAA TGACTTAGAC AGTGGAGACT 480ATGGGAAAGT CTTGTACGCA CTTTCGCAAC CCTCAGAAGA TATTAGCAAA ACATTCGAGG 540TAAACCCTGT AACCGGGGAA ATTCGCCTAC GAAAAGAGGT GAATTTTGAA ACTATTCCTT 600CGTATGAAGT GGTTATCAAG GGGACGGACG GGGGAGGTCT CTCAGGAAAA TGCACTCTGT 660TACTGCAGGT GGTGGACGTG AATGACAATG CCCCAGAAGT GATGCTATCT GCGCTAACCA 720ACCCAGTCCC AGAAAATTCC CCCGATGAGG TAGTGGCTGT TTTCAGTGTT AGAGATCCTG 780ACTCTGGGAA CAACGGAAAA GTGATTGCAT CCATCGAGGA AGACCTGCCC TTTCTTCTAA 840AATCTTCAGG AAAGAACTTT TACACTTTAG TAACCAAGGG AGCACTTGAC AGGGAAGAAA 900GAGAGCAATT GAACATCACC ATCACAGTCA CTGACCTGGG CATACCCAGG CTCACCACCC 960AACACACCAT AACAGTGCAG GTGGCAGACA TCAACGACAA TGCCCCCTCC TTCACCCAAA 1020CCTCCTACAC CATGTTTGTC CGCGAGAACA ACAGCCCCGC CCTGCACATA GGCACCATCA 1080GCGCCACAGA CTCAGACTCA GGATCCAATG CCCACATCAC CTACTCGCTG CTACCGCCCC 1140AAGACCCACA GCTGGCCCTC GACTCGCTCA TCTCCATCAA TGTAGACAAC GGGCAGCTGT 1200TCGCGCTCAG GGCGCTAGAC TATGAGGCTC TGCAGGGCTT CGAGTTCCAT GTGGGCGCCA 1260CAGACCAAGG CTCGCCCGCG CTCAGCAGCC AGGCTCTGGT GCACGTGGTG GTGTTGGACG 1320ACAATGACAA TGCGCCCTTC GTGCTCTACC CGCTGCAAAA CGCCTCTGCA CCCTTCACTG 1380AGCTGCTGCC CAGGGCGGCA GAGCCTGGAT ACCTGGTTAC CAAGGTGGTA GCTGTGGACC 1440GCGACTCTGG CCAGAATGCC TGGCTGTCAT TCCAGCTGCT CAAGGCCACG GAGCCCGGGC 1500TGTTCAACGT ATGGGCGCAC AATGGCGAGG TACGCACCTC CAGGCTGCTG AGCGAGCGCG 1560ACGCACCCAA GCACAAGCTG CTGCTGTTGG TCAAGGACAA TGGAGATCCT CCACGCTCTG 1620CCAGTGTTAC TCTGCACGTG CTAGTGGTGG ATGCCTTCTC TCAGCCCTAC CTGCCTCTGC 1680CAGAGGTGGC GCACGACCCT GCACAAGAAG AAGATGCGCT AACACTCTAC CTGGTCATAG 1740CTTTGGCATC TGTGTCTTCT CTCTTCCTCT TGTCTGTGCT GCTGTTCGTG GGGGTGAGGC 1800TCTGCAGGAG GGCCAGGGCA GCCTCTCTGA GTGCCTATTC TGTGCCTGAA GGCCACTTTC 1860CTGGCCAGCT GGTGGATGTC AGAGGTATGG GGACCCTGTC CCAGAGCTAC CAGTATGATG 1920TATGTCTGAT GGGGGATTCT TCTGGGACCA GCGAATTTAA CTTCTTAAAG CCAGTTCTGC 1980CTAGCTCTCT GCACCAGTGC TCTGGGAAAG AAATAGAGGA AAATTCCACA CTCCAGAATA 2040GTTTTGGGTT TCATCATTAA TAGAAAACTA CTTTACAGAT ATTTAATTCC AAATATCATC 2100TTGTTGATTA ACTAAAGTCT GTTCACATGT AGCTAGCTAG CAACGATTTT AATGTTCACT 2160TTACCCATCT TTTTTCAGGG TCATGTCTAA AGCTACAAGT TTGNCTTTAC TTATACTTGT 2220CGCACAGAAT NNNNNNNNNN TGGTGTATAA GTCACAGTCA TGGGATACTG GCACAAGATG 2280GCAGCTTGAT TGCTCAGTTA TGGCTGCAAA GGGGNGCTTG AGTTTAGGGA ATGTGTTAGA 2340GCTGGAATAA GTTTTCTGAG AAATGTGTAA GACAAATTTC TTTTGCACAT TCCCTGTGTT 2400CCTGTACCCC TGTTTCCAGA ACTACGAAAT GTGTCATCAG AAGGCATGCT CACATTTTCC 2460CCTTTGTTTG CGTGACCCGG GTGCCAGAAA TTAAATAAAA TTAGCATGGA GTTCAATGCA 2520GCATTAAAAC AAAGTTACTT CTACAAACCT TTTATTCGAC GGTTAAAATT GTAACTTCCC 2580CACCCATGAG GCTGGCTGTA AGAACCAGTA TGAATGGGTG TCTATCGCAA CCTTATTTTC 2640AAAAATCAAA CAAAAGGAGA AATGAGAGAC CAAACAACAC GCTACAGGAA AGATTTCATA 2700AGGATGTATG TATGGACACA AAAACTGGGA TACAGACATT TTAAATCTGT TGGTACCACA 2760TGGTGGCGCT GCAGGCTAAA GAAATGCAAG GGAAATTAAA AAGAGGCTGA GCTAGAAGTC 2820AAAAAAAAAA A 2831(2)SEQ ID NO109的信息(i)序列特征(A)长度3353碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(ix)特征(A)名称/键CDS(B)定位763..3123(xi)序列描述SEQ ID NO109GTATTTTTCC ACAGTTTAAA ATTTTCATAA AATCATAACT CTCTGACTTT ATGTAGAAAG60GATACCACAC TGGAATTAAC GTGTAGCTTT TTCTTGATGT AATCCAACCA ATGGGAGCAC 120AATTCTGGTA CATAGGCTGT CTAGAATTTG AAAGAAATTA AAGAATTCAT TTTGTTTTGC 180TGATAAATTT TTAAGAAATC ACGTGGCTTT ATGTTATTAT TATTACAAGA TGACTGATCA 240CTATTATGTC TTCTTTCACT TCTCAATTTC CCTCAGAACA CTACACCCAG ACTACAGGCT 300CTGGAGGGTG GGGACCATGT CTGGGTTGTT TACTGATGTA TTTCATAATT TGGCACATAG 360AGACCAATAA TACTCCTTTA AATGAAGAAA TTAATAATTA CCATTGCGTG ATATTGTGAT 420TACATCATTT CCTCCCAATT TCCAAACTCC TAATAGAATA GAGAATAGAT CAATTGTAGC 480AATTCGTTTC GAAGCAAAGA CAACGCATGG TGGCGCTGCA GGCTAAGGCT TCAAAAAAAG 540GAAAAGGAAA AAGCCCATGA AATGCTACTA GCTACTTCAG ACCTCTTTCA GCCTAAGAGG 600AAAGCCTGTT AGCAGAGCAC GGACCAGTGT CTCCGGAGAA TGCTATTCTC CTACATTTCC 660GAACAGGTTA TCAACGCACA GATCGATCAC TGCCTCTGTC CCATCGCTCC CTGAAGTAGC 720TCTGACTCCG GTTCCTTGAA AGGGGCGTGT ACAGAAGTAA AG ATG GAG CCT GCA 774Met Glu Pro Ala1GGG GAG CGC TTT CCC GAA CAA AGG CAA GTC CTG ATT CTC CTT CTT TTA 822Gly Glu Arg Phe Pro Glu Gln Arg Gln Val Leu Ile Leu Leu Leu Leu5 10 15 20CTG GAA GTG ACT CTG GCA GGC TGG GAA CCC CGT CGC TAT TCT GTG ATG 870Leu Glu Val Thr Leu Ala Gly Trp Glu Pro Arg Arg Tyr Ser Val Met25 30 35GAG GAA ACA GAG AGA GGT TCT TTT GTA GCC AAC CTG GCC AAT GAC CTA 918Glu Glu Thr Glu Arg Gly Ser Phe Val Ala Asn Leu Ala Asn Asp Leu40 45 50GGG CTG GGA GTG GGG GAG CTA GCC GAG CGG GGA GCC CGG GTA GTT TCT 966Gly Leu Gly Val Gly Glu Leu Ala Glu Arg Gly Ala Arg Val Va1 Ser55 60 65GAG GAT AAC GAA CAA GGC TTG CAG CTT GAT CTG CAG ACC GGG CAG TTG1014Glu Asp Asn Glu Gln Gly Leu Gln Leu Asp Leu Gln Thr Gly Gln Leu70 75 80ATA TTA AAT GAG AAG CTG GAC CGG GAG AAG CTG TGT GGC CCT ACT GAG1062Ile Leu Asn Glu Lys Leu Asp Arg Glu Lys Leu Cys Gly Pro Thr Glu85 90 95 100CCC TGT ATA ATG CAT TTC CAA GTG TTA CTG AAA AAA CCT TTG GAA GTA1110Pro Cys Ile Met His Phe Gln Val Leu Leu Lys Lys Pro Leu Glu Val105 110 115TTT CGA GCT GAA CTA CTA GTG ACA GAC ATA AAC GAT CAT TCT CCT GAG1158Phe Arg Ala Glu Leu Leu Val Thr Asp Ile Asn Asp His Ser Pro Glu120 125 130TTT CCT GAA AGA GAA ATG ACC CTG AAA ATC CCA GAA ACT AGC TCC CTT1206Phe Pro Glu Arg Glu Met Thr Leu Lys Ile Pro Glu Thr Ser Ser Leu135 140 145GGG ACT GTG TTT CCT CTG AAA AAA GCT CGG GAC TTG GAC GTG GGC AGC1254Gly Thr Val Phe Pro Leu Lys Lys Ala Arg Asp Leu Asp Val Gly Ser150 155 160AAT AAT GTT CAA AAC TAC AAT ATT TCT CCC AAT TCT CAT TTC CAT GTT1302Asn Asn Val Gln Asn Tyr Asn Ile Ser Pro Asn Ser His Phe His Val165 170 175 180TCC ACT CGC ACC CGA GGG GAT GGC AGG AAA TAC CCA GAG CTG GTG CTG1350Ser Thr Arg Thr Arg Gly Asp Gly Arg Lys Tyr Pro Glu Leu Val Leu185 190 195GAC ACA GAA CTG GAT CGC GAG GAG CAG GCC GAG CTC AGA TTA ACC TTG1398Asp Thr Glu Leu Asp Arg Glu Glu Gln Ala Glu Leu Arg Leu Thr Leu200 205 210ACA GCG GTG GAC GGT GGC TCT CCA CCC CGA TCT GGC ACC GTC CAG ATC1446Thr Ala Val Asp Gly Gly Ser Pro Pro Arg Ser Gly Thr Val Gln Ile215 220 225CTC ATC TTG GTC TTG GAC GCC AAT GAC AAT GCC CCG GAG TTT GTG CAG1494Leu Ile Leu Val Leu Asp Ala Asn Asp Asn Ala Pro Glu Phe Val Gln230 235 240GCG CTC TAC GAG GTG CAG GTC CCA GAG AAC AGC CCA GTA GGC TCC CTA1542Ala Leu Tyr Glu Val Gln Val Pro Glu Asn Ser Pro Val Gly Ser Leu245 250 255 260GTT GTC AAG GTC TCT GCT AGG GAT TTA GAC ACT GGG ACA AAT GGA GAG 1590Val Val Lys Val Ser Ala Arg Asp Leu Asp Thr Gly Thr Asn Gly Glu265 270 275ATA TCA TAC TCC CTT TAT TAC AGC TCT CAG GAG ATA GAC AAA CCT TTT 1638Ile Ser Tyr Ser Leu Tyr Tyr Ser Ser Gln Glu Ile Asp Lys Pro Phe280 285 290GAG CTA AGC AGC CTT TCA GGA GAA ATT CGA CTA ATT AAA AAA CTA GAT 1686Glu Leu Ser Ser Leu Ser Gly Glu Ile Arg Leu Ile Lys Lys Leu Asp295 300 305TTT GAG ACA ATG TCT TCA TAT GAT CTA GAT ATA GAG GCA TCT GAT GGC 1734Phe Glu Thr Met Ser Ser Tyr Asp Leu Asp Ile Glu Ala Ser Asp Gly310 315 320GGG GGA CTT TCT GGA AAA TGC TCT GTC TCT GTT AAG GTG CTG GAT GTT 1782Gly Gly Leu Ser Gly Lys Cys Ser Val Ser Val Lys Val Leu Asp Val325 330 335 340AAC GAT AAC TTC CCG GAA CTA AGT ATT TCA TCA CTT ACC AGC CCT ATT 1830Asn Asp Asn Phe Pro Glu Leu Ser Ile Ser Ser Leu Thr Ser Pro Ile345 350 355CCC GAG AAT TCT CCA GAG ACA GAA GTG GCC CTG TTT AGG ATT AGA GAC 1878Pro Glu Asn Ser Pro Glu Thr Glu Val Ala Leu Phe Arg Ile Arg Asp360 365 370CGA GAC TCT GGA GAA AAT GGA AAA ATG ATT TGC TCA ATT CAG GAT GAT 1926Arg Asp Ser Gly Glu Asn Gly Lys Met Ile Cys Ser Ile Gln Asp Asp375 380 385GTT CCT TTT AAG CTA AAA CCT TCT GTT GAG AAT TTC TAC AGG CTG GTA 1974Val Pro Phe Lys Leu Lys Pro Ser Val Glu Asn Phe Tyr Arg Leu Val390 395 400ACA GAA GGG GCG CTG GAC AGA GAG ACC AGA GCC GAG TAC AAC ATC ACC 2022Thr Glu Gly Ala Leu Asp Arg Glu Thr Arg Ala Glu Tyr Asn Ile Thr405 410 415 420ATC ACC ATC ACA GAC TTG GGG ACT CCA AGG CTG AAA ACC GAG CAG AGC 2070Ile Thr Ile Thr Asp Leu Gly Thr Pro Arg Leu Lys Thr Glu Gln Ser425 430 435ATA ACC GTG CTG GTG TCG GAC GTC AAT GAC AAC GCC CCC GCC TTC ACC 2118Ile Thr Val Leu Val Ser Asp Val Asn Asp Asn Ala Pro Ala Phe Thr440 445 450CAA ACC TCC TAC ACC CTG TTC GTC CGC GAG AAC AAC AGC CCC GCC CTG 2166Gln Thr Ser Tyr Thr Leu Phe Val Arg Glu Asn Asn Ser Pro Ala Leu455 460 465CAC ATC GGC AGT GTC AGC GCC ACA GAC AGA GAC TCG GGC ACC AAC GCC 2214His Ile Gly Ser Val Ser Ala Thr Asp Arg Asp Ser Gly Thr Asn Ala470 475 480CAG GTC ACC TAC TCG CTG CTG CCG CCC CAG GAC CCG CAC CTG CCC CTA 2262Gln Val Thr Tyr Ser Leu Leu Pro Pro Gln Asp Pro His Leu Pro Leu485 490 495 500ACC TCC CTG GTC TCC ATT AAC ACG GAC AAC GGC CAC CTG TTC GCT CTC 2310Thr Ser Leu Val Ser Ile Asn Thr Asp Asn Gly His Leu Phe Ala Leu505 510 515CAG TCG CTG GAC TAC GAG GCC CTG CAG GCT TTC GAG TTC CGC GTG GGC 2358Gln Ser Leu Asp Tyr Glu Ala Leu Gln Ala Phe Glu Phe Arg Val Gly520 525 530GCC ACA GAC CGC GGC TTC CCG GCG CTG AGC AGC GAG GCG CTG GTG CGA 2406Ala Thr Aep Arg Gly Phe Pro Ala Leu Ser Ser Glu Ala Leu Val Arg535 540 545GTG CTG GTG CTG GAC GCC AAC GAC AAC TCG CCC TTC GTG CTG TAC CCG 2454Val Leu Val Leu Asp Ala Asn Asp Asn Ser Pro Phe Val Leu Tyr Pro550 555 560CTG CAG AAC GGC TCC GCG CCC TGC ACC GAG CTG GTG CCC CGG GCG GCC 2502Leu Gln Asn Gly Ser Ala Pro Cys Thr Glu Leu Val Pro Arg Ala Ala565 570 575 580GAG CCG GGC TAC CTG GTG ACC AAG GTG GTG GCG GTG GAC GGC GAC TCG 2550Glu Pro Gly Tyr Leu Val Thr Lys Val Val Ala Val Asp Gly Asp Ser585 590 595GGC CAG AAC GCC TGG CTG TCG TAC CAG CTG CTC AAG GCC ACG GAG CCC 2598Gly Gln Asn Ala Trp Leu Ser Tyr Gln Leu Leu Lys Ala Thr Glu Pro600 605 610GGG CTG TTC GGC GTG TGG GCG CAC AAT GGC GAG GTG CGC ACC GCC AGG 2646Gly Leu Phe Gly Val Trp Ala His Ash Gly Glu Val Arg Thr Ala Arg615 620 625CTG CTG AGC GAG CGC GAC GTG GCC AAG CAC AGG CTA GTG GTG CTG GTC 2694Leu Leu Ser Glu Arg Asp Val Ala Lys His Arg Leu Val Val Leu Val630 635 640AAG GAC AAT GGC GAG CCT CCG CGC TCG GCC ACA GCC ACG CTG CAA GTG 2742Lys Asp Asn Gly Glu Pro Pro Arg Ser Ala Thr Ala Thr Leu Gln Val645 650 655 660CTC CTG GTG GAC GGC TTC TCT CAG CCC TAC CTG CCG CTC CCA GAG GCG 2790Leu Leu Val Asp Gly Phe Ser Gln Pro Tyr Leu Pro Leu Pro Glu Ala665 670 675GCC CCG GCC CAA GCC CAG GCC GAC TCG CTT ACC GTC TAC CTG GTG GTG 2838Ala Pro Ala Gln Ala Gln Ala Asp Ser Leu Thr Val Tyr Leu Val Val680 685 690GCA TTG GCC TCG GTG TCT TCG CTC TTC CTC TTC TCG GTG TTC CTG TTC 2886Ala Leu Ala Ser Val Ser Ser Leu Phe Leu Phe Ser Val Phe Leu Phe695 700 705GTG GCA GTG CGG CTG TGC AGG AGG AGC AGG GCG GCC TCA GTG GGT CGC 2934Val Ala Val Arg Leu Cys Arg Arg Ser Arg Ala Ala Ser Val Gly Arg710715 720TGC TCG GTG CCC GAG GGC CCC TTT CCA GGG CAT CTG GTG GAC GTG AGC 2982Cys Ser Val Pro Glu Gly Pro Phe Pro Gly His Leu Val Asp Val Ser725 730 735 740GGC ACC GGG ACC CrT TCC CAG AGC TAC CAG TAC GAG GTG TGT CTG ACG3030Gly Thr Gly Thr Leu Ser Gln Ser Tyr Gln Tyr Glu Val Cys Leu Thr745 750 755GGA GGC TCT GAA AGT AAT GAT TTC AAG TTC TTG AAG CCT ATA TTC CCA3078Gly Gly Ser Glu Ser Asn Asp Phe Lys Phe Leu Lys Pro Ile Phe Pro760 765 770AAT ATT GTA AGC CAG GAC TCT AGG AGG AAA TCA GAA TTT CTA GAA3123Asn Ile Val Ser Gln Asp Ser Arg Arg Lys Ser Glu Phe Leu Glu775 780 785TAATGTAGGT ATCTGTAGCT TTCCGACCGT CTGTTAATTT TGTCTTCCTC ACTTTTCACC 3183TTAGTTTTTT TTAACCCTTT AGTAATCTTG AATTCTACTT TTTTTTAAAT TTCTACTGTT 3243GTCTTTAGTA ATGTTACTCA TTTCCTTTGT CTGATTGTTA GTTTTCAAAT TATTGTATTA 3303TTATAAATAT TTTATATCAG GAAAGTTCAT ATTTCTGAAT AAATTAATAG 3353(2)SEQ ID NO110的信息(i)序列特征(A)长度787氨基酸(B)类型氨基酸(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO110Met Glu Pro Ala Gly Glu Arg Phe Pro Glu Gln Arg Gln Val Leu Ile1 5 10 15Leu Leu Leu Leu Leu Glu Val Thr Leu Ala Gly Trp Glu Pro Arg Arg20 25 30Tyr Ser Val Mer Glu Glu Thr Glu Arg Gly Ser Phe Val Ala Asn Leu35 40 45Ala Asn Asp Leu Gly Leu Gly Val Gly Glu Leu Ala Glu Arg Gly AlaS0 55 60Arg Val Val Ser Glu Asp Asn Glu Gln Gly Leu Gln Leu Asp Leu Gln65 70 75 80Thr Gly Gln Leu Ile Leu Asn Glu Lys Leu Asp Arg Glu Lys Leu Cys85 90 95Gly Pro Thr Glu Pro Cys Ile Met His Phe Gln Val Leu Leu Lys Lys100 105 110Pro Leu Glu Val Phe Arg Ala Glu Leu Leu Val Thr Asp Ile Asn Asp115 120 125
His Ser Pro Glu Phe Pro Glu Arg Glu Met Thr Leu Lys Ile Pro Glu130 135 140Thr Ser Ser Leu Gly Thr Val Phe Pro Leu Lys Lys Ala Arg Asp Leu145 150 155 160Asp Val Gly Ser Asn Asn Val Gln Asn Tyr Asn Ile Ser Pro Asn Ser165 170 175His Phe His Val Ser Thr Arg Thr Arg Gly Asp Gly Arg Lys Tyr Pro180 185 190Glu Leu Val Leu Asp Thr Glu Leu Asp Arg Glu Glu Gln Ala Glu Leu195 200 205Arg Leu Thr Leu Thr Ala Val Asp Gly Gly Ser Pro Pro Arg Ser Gly210 215 220Thr Val Gln Ile Leu Ile Leu Val Leu Asp Ala Asn Asp Asn Ala Pro225 230 23S 240Glu Phe Val Gln Ala Leu Tyr Glu Val Gln Val Pro Glu Asn Ser Pro245 250 255Val Gly Ser Leu Val Val Lys Val Ser Ala Arg Asp Leu Asp Thr Gly260 265 270Thr Asn Gly Glu Ile Ser Tyr Ser Leu Tyr Tyr Ser Ser Gln Glu Ile275 280 285Asp Lys Pro Phe Glu Leu Ser Ser Leu Ser Gly Glu Ile Arg Leu Ile290 295 300Lys Lys Leu Asp Phe Glu Thr Met Ser Ser Tyr Asp Leu Asp Ile Glu305 310 315 320Ala Ser Asp Gly Gly Gly Leu Ser Gly Lys Cys Ser Val Ser Val Lys325 330 335Val Leu Asp Val Asn Asp Asn Phe Pro Glu Leu Ser Ile Ser Ser Leu340 345 350Thr Ser Pro Ile Pro Glu Ash Ser Pro Glu Thr Glu Val Ala Leu Phe355 360 365Arg Ile Arg Asp Arg Asp Ser Gly Glu Asn Gly Lys Met Ile Cys Ser370 375 380Ile Gln Asp Asp Val Pro Phe Lys Leu Lys Pro Ser Val Glu Asn Phe385 390 395 400Tyr Arg Leu Val Thr Glu Gly Ala Leu Asp Arg Glu Thr Arg Ala Glu405 410 415Tyr Asn Ile Thr Ile Thr Ile Thr Asp Leu Gly Thr Pro Arg Leu Lys420 425 430Thr Glu Gln Ser Ile Thr Val Leu Val Ser Asp Val Asn Asp Asn Ala435 440 445
Pro Ala Phe Thr Gln Thr Ser Tyr Thr Leu Phe Val Arg Glu Asn Asn450 455 460Ser Pro Ala Leu His Ile Gly Ser Val Ser Ala Thr Asp Arg Asp Ser465 470 475 480Gly Thr Asn Ala Gln Val Thr Tyr Ser Leu Leu Pro Pro Gln Asp Pro485 490 495His Leu Pro Leu Thr Ser Leu Val Ser Ile Asn Thr Asp Asn Gly His500 505 510Leu Phe Ala Leu Gln Ser Leu Asp Tyr Glu Ala Leu Gln Ala Phe Glu515 520 525Phe Arg Val Gly Ala Thr Asp Arg Gly Phe Pro Ala Leu Ser Ser Glu530 535 540Ala Leu Val Arg Val Leu Val Leu Asp Ala Asn Asp Asn Ser Pro Phe545 550 555 560Val Leu Tyr Pro Leu Gln Asn Gly Ser Ala Pro Cys Thr Glu Leu Val565 570 575Pro Arg Ala Ala Glu Pro Gly Tyr Leu Val Thr Lys Val Val Ala Val580 585 590Asp Gly Asp Ser Gly Gln Asn Ala Trp Leu Ser Tyr Gln Leu Leu Lys595 600 605Ala Thr Glu Pro Gly Leu Phe Gly Val Trp Ala His Asn Gly Glu Val610 615 620Arg Thr Ala Arg Leu Leu Ser Glu Arg Asp Val Ala Lys His Arg Leu625 630 635 640Val Val Leu Val Lys Asp Asn Gly Glu Pro Pro Arg Ser Ala Thr Ala645 650 655Thr Leu Gln Val Leu Leu Val Asp Gly Phe Ser Gln Pro Tyr Leu Pro660 665 670Leu Pro Glu Ala Ala Pro Ala Gln Ala Gln Ala Asp Ser Leu Thr Val675 680 685Tyr Leu Val Val Ala Leu Ala Ser Val Ser Ser Leu Phe Leu Phe Ser690 695 700Val Phe Leu Phe Val Ala Val Arg Leu Cys Arg Arg Ser Arg Ala Ala705 710 715 720
Ser Val Gly Arg Cys Ser Val Pro Glu Gly Pro Phe Pro Gly His Leu725 730 735Val Asp Val Ser Gly Thr Gly Thr Leu Ser Gln Ser Tyr Gln Tyr Glu740 745 750Val Cys Leu Thr Gly Gly Ser Glu Ser Asn Asp Phe Lys Phe Leu Lys755 760 765Pro Ile Phe Pro Asn Ile Val Ser Gln Asp Ser Arg Arg Lys Ser Glu770 775 780Phe Leu Glu785(2)SEQ ID NO111的信息(i)序列特征(A)长度3033碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(ix)特征(A)名称/键CDS(B)定位138..2528(xi)序列描述SEQ ID NO111GTGATTGGAC GTGTTTTTGT GACTATTTGG GAAGAAGACA CCTTCCTAAT CAGATTTACT 60CCAATATCTT CCCGGACCCT CATGAGTGGA TTGCAATTGA CTTGAAGAAG CAGCACCCTC120AGGACTGAAT CTGAACA ATG GAG ACA GCA CTA GCA AAA ATA CCA CAG CAA 170Met Glu Thr Ala Leu Ala Lys Ile Pro Gln Gln1 5 10AGG CAA GTC TTT TTT CTT ACT ATA TTG TCG TTA TTG TGG AAG TCT AGC 218Arg Gln Val Phe Phe Leu Thr Ile Leu Ser Leu Leu Trp Lys Ser Ser15 20 25TCT GAG GCC ATT AGA TAT TCC ATG CCA GAA GAA ACA GAG AGT GGC TAT 266Ser Glu Ala Ile Arg Tyr Ser Met Pro Glu Glu Thr Glu Ser Gly Tyr30 35 40ATG GTG GCT AAC CTG GCG AAA GAT CTG GGG ATC AGG GTT GGA GAA CTG 314Met Val Ala Asn Leu Ala Lys Asp Leu Gly Ile Arg Val Gly Glu Leu45 50 55TCC TCT AGA GGA GCT CAA ATC CAT TAC AAA GGA AAC AAA GAA CTT TTG 362Ser Ser Arg Gly Ala Gln Ile His Tyr Lys Gly Asn Lys Glu Leu Leu60 65 70 75CAG CTG GAT GCA GAG ACT GGG AAT TTG TTC TTA AAG GAA AAA CTA GAC 410Gln Leu Asp Ala Glu Thr Gly Asn Leu Phe Leu Lys Glu Lys Leu Asp80 85 90AGA GAA CTG CTG TGT GGA GAG ACA GAA CCC TGT GTG CTG AAC TTC CAG 458Arg Glu Leu Leu Cys Gly Glu Thr Glu Pro Cys Val Leu Asn Phe Gln95 100 105ATC ATA CTG GAA AAC CCT ATG CAG TTC TTC CAA ACT GAA CTG CAG CTC 506Ile Ile Leu Glu Asn Pro Mer Gln Phe Phe Gln Thr Glu Leu Gln Leu110 115 120ACA GAT ATA AAC GAC CAT TCT CCA GAG TTC CCC AAC AAG AAA ATG CTT 554Thr Asp Ile Asn Asp His Ser Pro Glu Phe Pro Asn Lys Lys Met Leu125 130 135CTA ACA ATT CCT GAG AGT GCC CAT CCA GGG ACT GTG TTT CCT CTG AAG 602Leu Thr Ile Pro Glu Ser Ala His Pro Gly Thr Val Phe Pro Leu Lys140 145 150 155GCA GCT CGG GAC TCT GAC ATA GGG AGC AAC GCT GTT CAG AAC TAC ACA 650Ala Ala Arg Asp Ser Asp Ile Gly Ser Asn Ala Val Gln Asn Tyr Thr160 165 170GTC AAT CCC AAC CTC CAT TTC CAC GTC GTT ACT CAC AGT CGC ACA GAT 698Val Asn Pro Asn Leu His Phs His Val Val Thr His Ser Arg Thr Asp175 180 185GGC AGG AAA TAC CCA GAG CTG GTG CTG GAC AGA GCC CTG GAT AGG GAG 746Gly Arg Lye Tyr Pro Glu Leu Val Leu Asp Arg Ala Leu Asp Arg Glu190 195 200GAG CAG CCT GAG CTC ACT TTA ATC CTC ACT GCT CTG GAT GGT GGA GCT 794Glu Gln Pro Glu Leu Thr Leu Ile Leu Thr Ala Leu Asp Gly Gly Ala205 210 215CCT TCC AGG TCA GGA ACC ACC ACA GTT CAC ATA GAA GTT GTG GAC ATC 842Pro Ser Arg Ser Gly Thr Thr Thr Val His Ile Glu Val Val Asp Ile220 225230 235AAT GAT AAC TCC CCC CAG TTT GTA CAG TCA CTC TAT AAG GTG CAA GTT 890Asn Asp Asn Ser Pro Gln Phe Val Gln Ser Leu Tyr Lys Val Gln Val240 245 250CCT GAG AAT AAT CCC CTC AAT GCC TTT GTT GTC ACG GTC TCT GCC ACG 938Pro Glu Asn Asn Pro Leu Asn Ala Phe Val Val Thr Val Ser Ala Thr255 260 265GAT TTA GAT GCT GGG GTA TAT GGC AAT GTG ACC TAT TCT CTG TTT CAA 986Asp Leu Asp Ala Gly Val Tyr Gly Asn Val Thr Tyr Ser Leu Phe Gln270 275 280GGG TAT GGG GTA TTT CAA CCA TTT GTA ATA GAC GAA ATC ACT GGA GAA1034Gly Tyr Gly Val Phe Gln Pro Phe Val Ile Asp Glu Ile Thr Gly Glu285 290 295ATC CAT CTG AGC AAA GAG CTG GAT TTT GAG GAA ATT AGC AAT CAT AAC1082Ile His Leu Ser Lys Glu Leu Asp Phe Glu Glu Ile Ser Asn His Asn300 305 310 315ATA GAA ATC GCA GCC ACA GAT GGA GGA GGC CTT TCA GGA AAA TGC ACT1130Ile Glu Ile Ala Ala Thr Asp Gly Gly Gly Leu Ser Gly Lys Cys Thr320 325 330GTG GCT GTA CAG GTG TTG GAT GTG AAT GAC AAC GCC CCA GAG TTG ACA1178Val Ala Val Gln Val Leu Asp Val Asn Asp Asn Ala Pro Glu Leu Thr335 340 345ATT AGG AAG CTC ACA GTC CTG GTC CCA GAA AAT TCC GCA GAG ACT GTA1226Ile Arg Lys Leu Thr Val Leu Val Pro Glu Asn Ser Ala Glu Thr Val350 355 360GTT GCT GTT TTT AGT GTT TCT GAT TCT GAT TCG GGG GAC AAT GGA AGG 1274Val Ala Val Phe Ser Val Ser Asp Ser Asp Ser Gly Asp Asn Gly Arg365 370 375ATG GTG TGT TCT ATT CCG AAC AAT ATC CCA TTT CTC CTG AAA CCC ACA 1322Met Val Cys Ser Ile Pro Asn Asn Ile Pro Phe Leu Leu Lys Pro Thr380 385 390 395TTT GAG AAT TAT TAC ACG TTA GTG ACT GAG GGG CCA CTT GAT AGA GAG 1370Phe Glu Asn Tyr Tyr Thr Leu Val Thr Glu Gly Pro Leu Asp Arg Glu400 405 410AAC AGA GCT GAG TAC AAC ATC ACC ATC ACG GTC TCA GAT CTG GGC ACA 1418Asn Arg Ala Glu Tyr Asn Ile Thr Ile Thr Val Ser Asp Leu Gly Thr415 420 425CCC AGG CTC ACA ACC CAG CAC ACC ATA ACA GTG CAA GTG TCC GAC ATC 1466Pro Arg Leu Thr Thr Gln His Thr Ile Thr Val Gln Val Ser Asp Ile430 435 440AAC GAC AAC GCC CCT GCC TTC ACC CAA ACC TCC TAC ACC ATG TTT GTC 1514Asn Asp Asn Ala Pro Ala Phe Thr Gln Thr Ser Tyr Thr Met Phe Val445 450 455CAC GAG AAC AAC AGC CCC GCC CTG CAC ATA GGC ACC ATC AGT GCC ACA 1562His Glu Asn Asn Ser Pro Ala Leu His Ile Gly Thr Ile Ser Ala Thr460 465 470 475GAC TCA GAC TCA GGC TCC AAT GCC CAC ATC ACC TAC TCG CTG CTG CCG 1610Asp Ser Asp Ser Gly Ser Asn Ala His Ile Thr Tyr Ser Leu Leu Pro480 485 490CCT GAT GAC CCG CAG CTG GCC CTC GAC TCA CTC ATC TCC ATC AAT GTT 1658Pro Asp Asp Pro Gln Leu Ala Leu Asp Ser Leu Ile Ser Ile Asn Val495 500 505GAC AAT GGG CAG CTG TTC GCG CTC AGA GCT CTA GAC TAT GAG GCA CTG 1706Asp Asn Gly Gln Leu Phe Ala Leu Arg Ala Leu Asp Tyr Glu Ala Leu510 515 520CAG TCC TTC GAG TTC TAC GTG GGC GCT ACA GAT GGA GGC TCA CCC GCG 1754Gln Ser Phe Glu Phe Tyr Val Gly Ala Thr Asp Gly Gly Ser Pro Ala525 530 535CTC AGC AGC CAG ACT CTG GTG CGG ATG GTG GTG CTG GAT GAC AAT GAC 1802Leu Ser Ser Gln Thr Leu Val Arg Met Val Val Leu Asp Asp Asn Asp540 545 550 555AAT GCC CCC TTC GTG CTC TAC CCA CTG CAG AAT GCC TCA GCA CCC TGT 1850Asn Ala Pro Phe Val Leu Tyr Pro Leu Gln Asn Ala Ser Ala Pro Cys560 565 570ACT GAG CTA CTG CCT AGG GCA GCA GAG CCC GGC TAC CTG ATC ACC AAA 1898Thr Glu Leu Leu Pro Arg Ala Ala Glu Pro Gly Tyr Leu Ile Thr Lys575 580 585GTG GTG GCT GTG GAT CGC GAC TCT GGA CAG AAT GCT TGG CTG TCG TTC 1946Val Val Ala Val Asp Arg Asp Ser Gly Gln Asn Ala Trp Leu Ser Phe590 595 600CAG CTA CTT AAA GCT ACA GAG CCA GGG CTG TTC AGT GTA TGG GCA CAC 1994Gln Leu Leu Lys Ala Thr Glu Pro Gly Leu Phe Ser Val Trp Ala His605 610 615AAT GGT GAA GTG CGC ACC ACT AGG CTG CTG AGT GAG CGA GAT GCT CAG2042Asn Gly Glu Val Arg Thr Thr Arg Leu Leu Ser Glu Arg Asp Ala Gln620 625 630 635AAG CAC AAG CTA CTG CTG CTG GTC AAG GAC AAT GGC GAT CCT CTG CGC2090Lys His Lys Leu Leu Leu Leu Val Lys Asp Asn Gly Asp Pro Leu Arg640 645 650TCT GCC AAT GTC ACT CTT CAC GTG CTA GTG GTG GAT GGC TTC TCG CAG2138Ser Ala Asn Val Thr Leu His Val Leu Val Val Asp Gly Phe Ser Gln655 660 665CCT TAC CTA CCA TTG GCT GAG GTG GCA CAG GAT TCC ATG CAA GAT AAT2186Pro Tyr Leu Pro Leu Ala Glu Val Ala Gln Asp Ser Met Gln Asp Asn670 675 680TAC GAC GTT CTC ACA CTG TAC CTA GTC ATT GCC TTG GCA TCT GTA TCT2234Tyr Asp Val Leu Thr Leu Tyr Leu Val Ile Ala Leu Ala Ser Val Ser685 690 695TCT CTC TTC CTC TTG TCT GTA GTG CTG TTT GTG GGG GTG AGG CTG TGC2282Ser Leu Phe Leu Leu Ser Val Val Leu Phe Val Gly Val Arg Leu Cys700 705 710 715AGG AGG GCC AGG GAG GCC TCC TTG GGT GAC TAC TCT GTG CCT GAG GGA2330Arg Arg Ala Arg Glu Ala Ser Leu Gly Asp Tyr Ser Val Pro Glu Gly720 725 730CAC TTT CCT AGC CAC TTG GTG GAT GTC AGC GGT GCC GGG ACC CTG TCC2378His Phe Pro Ser His Leu Val Asp Val Ser Gly Ala Gly Thr Leu Ser735 740 745CAG AGT TAT CAA TAT GAG GTG TGT CTT AAT GGA GGT ACT AGA ACA AAT2426Gln Ser Tyr Gln Tyr Glu Val Cys Leu Asn Gly Gly Thr Arg Thr Asn750 755 760GAG TTT AAC TTT CTT AAA CCA TTG TTT CCT ATC CTT CCG ACC CAG GCT2474Glu Phe Asn Phe Leu Lys Pro Leu Phe Pro Ile Leu Pro Thr Gln Ala765 770 775GCT GCT GCT GAA GAA AGA GAA AAC GCT GTT GTG CAC AAT AGC GTT GGA2522Ala Ala Ala Glu Glu Arg Glu Asn Ala Val Val His Asn Ser Val Gly780 785 790 795TTC TAT TAGAGCACTG ATTTTGAAGT GGTGGTTACC TCATTTTTCC TTAACTATCC 2578Phe TyrCTGATGTAGA ATGGTGTAGT GCCGTGAATC AACTCCTGAG ATATATGTTC ATTTTATCCT 2638TTGTTTTGAA TCAAACTATT CAGATGTGAT CCTACTCTAG AGAATTTGGT TCTACTCCAT 2698TGTGTTTGTT TAGATTTCTA CGCCATACCA GTGCATGCTG GGTTGTTTTT TTTTTTACAA 2758TTATTATAAC TTTGCTTTGG AGGGGAACTC ATATTCGCTG TAACGAATTG GAACCACTTT 2818CATTGTTAGA GATGCCTTGC TTTGTTGTGT TATTTCAGAC AGGGTCTTAA ATTGTAGCCC 2878TGGGTGACCT GAAATGACTA TGTACAGACT GACTTTGAAT TTGTGGCAGT CCATCTGCCT 2938CTGTTGTCCT ATGTTGGGAT TGTGAGCATG CATGAGTAGG CTCAGCTGTG GTGAGCGACC 2998TTAATAAAAA TCAAATACTA AAAAAAAAAA AAAAA 3033(2)SEQ ID NO112的信息(i)序列特征(A)长度797氨基酸(B)类型氨基酸(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO112Met Glu Thr Ala Leu Ala Lys Ile Pro Gln Gln Arg Gln Val Phe Phe1 5 10 15Leu Thr Ile Leu Ser Leu Leu Trp Lys Ser Ser Ser Glu Ala Ile Arg20 25 30Tyr Ser Met Pro Glu Glu Thr Glu Ser Gly Tyr Met Val Ala Asn Leu35 40 45Ala Lys Asp Leu Gly Ile Arg Val Gly Glu Leu Ser Ser Arg Gly Ala50 55 60Gln Ile His Tyr Lys Gly Asn Lys Glu Leu Leu Gln Leu Asp Ala Glu65 70 75 80Thr Gly Asn Leu Phe Leu Lys Glu Lys Leu Asp Arg Glu Leu Leu Cys85 90 95Gly Glu Thr Glu Pro Cys Val Leu Asn Phe Gln Ile Ile Leu Glu Asn100 105 110Pro Met Gln Phe Phe Gln Thr Glu Leu Gln Leu Thr Asp Ile Asn Asp115 120 125His Ser Pro Glu Phe Pro Asn Lys Lys Met Leu Leu Thr Ile Pro Glu130 135 140Ser Ala His Pro Gly Thr val Phe Pro Leu Lys Ala Ala Arg Asp Ser145 150 155 160Asp Ile Gly Ser Asn Ala Val Gln Asn Tyr Thr Val Asn Pro Asn Leu165 170 175His Phe His Val Val Thr His Ser Arg Thr Asp Gly Arg Lys Tyr Pro180 185 190Glu Leu Val Leu Asp Arg Ala Leu Asp Arg Glu Glu Gln Pro Glu Leu195 200 205Thr Leu Ile Leu Thr Ala Leu Asp Gly Gly Ala Pro Ser Arg Ser Gly210 215 220Thr Thr Thr Val His Ile Glu Val Val Asp Ile Asn Asp Asn Ser Pro225 230 235 240
Gln Phe Val Gln Ser Leu Tyr Lys Val Gln Val Pro Glu Asn Asn Pro245 250 255Leu Asn Ala Phe Val Val Thr Val Ser Ala Thr Asp Leu Asp Ala Gly260 265 270Val Tyr Gly Asn Val Thr Tyr Ser Leu Phe Gln Gly Tyr Gly Val Phe275 280 285Gln Pro Phe Val Ile Asp Glu Ile Thr Gly Glu Ile His Leu Ser Lys290 295 300Glu Leu Asp Phe Glu Glu Ile Ser Ash His Asn Ile Glu Ile Ala Ala305 310 315 320Thr Asp Gly Gly Gly Leu Ser Gly Lys Cys Thr Val Ala Val Gln Val325 330 335Leu Asp Val Asn Asp Asn Ala Pro Glu Leu Thr Ile Arg Lys Leu Thr340 345 350Val Leu Val Pro Glu Asn Ser Ala Glu Thr Val Val Ala Val Phe Ser355 360 365Val Ser Asp Ser Asp Ser Gly Asp Asn Gly Arg Met Val Cye Ser Ile370 375 380Pro Asn Asn Ile Pro Phe Leu Leu Lys Pro Thr Phe Glu Asn Tyr Tyr385 390 395 400Thr Leu Val Thr Glu Gly Pro Leu Asp Arg Glu Asn Arg Ala Glu Tyr405 410 415Asn Ile Thr Ile Thr Val Ser Asp Leu Gly Thr Pro Arg Leu Thr Thr420 425 430Gln His Thr Ile Thr Val Gln Val Ser Asp Ile Asn Asp Asn Ala Pro435 440 445Ala Phe Thr Gln Thr Ser Tyr Thr Met Phe Val His Glu Asn Asn Ser450 455 460Pro Ala Leu His Ile Gly Thr Ile Ser Ala Thr Asp Ser Asp Ser Gly465 470 475 480Ser Asn Ala His Ile Thr Tyr Ser Leu Leu Pro Pro Asp Asp Pro Gln485 490 495Leu Ala Leu Asp Ser Leu Ile Ser Ile Asn Val Asp Asn Gly Gln Leu500 505 510Phe Ala Leu Arg Ala Leu Asp Tyr Glu Ala Leu Gln Ser Phe Glu Phe515 520 525Tyr Val Gly Ala Thr Asp Gly Gly Ser Pro Ala Leu Ser Ser Gln Thr530 535 540Leu Val Arg Met Val Val Leu Asp Asp Asn Asp Asn Ala Pro Phe Val545 550 555 560Leu Tyr Pro Leu Gln Asn Ala Ser Ala Pro Cys Thr Glu Leu Leu Pro565 570 575
Arg Ala Ala Glu Pro Gly Tyr Leu Ile Thr Lys Val Val Ala Val Asp580 585 590Arg Asp Ser Gly Gln Asn Ala Trp Leu Ser Phe Gln Leu Leu Lys Ala595 600 605Thr Glu Pro Gly Leu Phe Ser Val Trp Ala His Asn Gly Glu Val Arg610 615 620Thr Thr Arg Leu Leu Ser Glu Arg Asp Ala Gln Lys His Lys Leu Leu625 630 635 640Leu Leu Val Lys Asp Asn Gly Asp Pro Leu Arg Ser Ala Asn Val Thr645 650 655Leu His Val Leu Val Val Asp Gly Phe Ser Gln Pro Tyr Leu Pro Leu660 665 670Ala Glu Val Ala Gln Asp Ser Met Gln Asp Asn Tyr Asp Val Leu Thr675 680 685Leu Tyr Leu Val Ile Ala Leu Ala Ser Val Ser Ser Leu Phe Leu Leu690 695 700Ser Val Val Leu Phe Val Gly Val Arg Leu Cys Arg Arg Ala Arg Glu705 710 715 720Ala Ser Leu Gly Asp Tyr Ser Val Pro Glu Gly His Phe Pro Ser His725 730 735Leu Val Asp Val Ser Gly Ala Gly Thr Leu Ser Gln Ser Tyr Gln Tyr740 745 750Glu Val Cye Leu Asn Gly Gly Thr Arg Thr Asn Glu Phe Asn Phe Leu755 760 765Lys Pro Leu Phe Pro Ile Leu Pro Thr Gln Ala Ala Ala Ala Glu Glu770 775 780Arg Glu Asn Ala Val Val His Asn Ser Val Gly Phe Tyr785 790 795(2)SEQ ID NO113的信息(i)序列特征(A)长度2347碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(xi)序列描述SEQ ID NO113AAAACACGGG GGAAATGACA GTAGCAAAGA ATCTGGACTA TGAAGAATGC TCATTGTATG60AAATGGAAAT ACAGGCTGAA GATGTGGGGG CGCTTCTGGG GAGGAGCAAA GTGCTAATTA 120TGGTAGAAGA TGTAAATGAC AATCGGCCAG AAGTGACCAT TACATCCTTG TTTAACCCGG 180TATTGGAAAA TTCTCTTCCC GGGACAGTAA TTGCCTTCTT GAATGTGCAT GACCGAGACT 240CTGGAAAGAA CGGCCAAGTT GTCTGTTACA CGCATGATAA CTTACCTTTT AAATTAGAAA 300AGTCAATAGA TAATTATTAT AGATTGGTGA CATGGAAATA TTTGGACCGA GAAAAAGTCT 360CCATCTACAA TATCACAGTG ATAGCCTCAG ATCTAGGAGC CCACTCTGTC ACTGAAACTT 420ACATTGCCCT GATTGTGGCA GACACTAATG ACAACCCTCC TCGTTTTCCT CACACCTCCT 480ACACAGCCTA TATTCCAGAG AACAACCTGA GGGGCGCCTC CATCTTCTCA CTGACTGCAC 540ATGATCCTGA CAGTCAGGAA AATGCACAGG TCACTTACTC TGTGTCTGAG GACACCATAC 600AGGGAGTGCC TTTGTCCTCT TATATCTCCA TCAACTCAGA TACTGGTGTC CTGTATGCAC 660TGCACTCTTT TGACTTCGAG AAGATACAAG ACTTGCAGCT ACTGGTTGTT GCCACTGACA 720GTGGAAGCCC ACCTCTCAGC AGCAATGTGT CATTGAGCTT GTTTGTGTTG GACCAGAACG 780ACAACGCACC TGAGATTCTA TATCCTAGCT TCCCCACAGA TGGCTCCACT GGTGTGGAAC 840TAGCACCCCG CTCTGCACAG CCTGGATACC TAGTGACCAA AGTGGTGGCA GTGGACAAAG 900ACTCAGGACA GAATGCTTGG CTGTCCTACC GTCTGCTGAA GGCCAGCGAA CCTGGGCTCT 960TCTCTGTAGG ACTTCACACG GGTGAGGTGC GTACAGCGAG GGCCCTGCTG GACAGAGATG 1020CTCTCAAACA GAATCTGGTG ATGGCCGTGC AGGACCATGG CCAACCCCCT CTCTCGGCCA 1080CTGTAACTCT CACTGTGGCA GTGGCTAACA GCATCCCTGA GGTGTTGGCT GACTTGAGCA 1140GCATTAGGAC CCCTGGGGTA CCAGAGGATT CTGATATCAC GCTCCACCTG GTGGTGGCAG 1200TGGCTGTGGT CTCCTGTGTC TTCCTTGTCT TTGTCATTGT CCTCCTAGCT CTCAGGCTTC 1260AGCGCTGGCA GAAGTCTCGC CAGCTCCAGG GCTCCAAAGG TGGATTGGCT CCTGCACCTC 1320CATCACATTT TGTGGGCATC GACGGGGTAC AGGCTTTTCT ACAAACCTAT TCTCATGAAG 1380TCTCGCTCAC TTCAGGCTCC CAGACAAGCC ACATTATCTT TCCTCAGCCC AACTATGCAG 1440ACATGCTCAT TAACCAAGAA GGCTGTGAGA AAAATGATTC CTTATTAACA TCCATAGATT 1500TTCATGAGAG TAACCGTGAA GATGCTTGCG CCCCGCAAGC CCCGCCCAAC ACTGACTGGC 1560GTTTCTCTCA AGCCCAGAGA CCCGGCACGA GCGGATCCCA AAATGGGGAT GAAACCGGCA 1620CCTGGCCCAA CAACCAGTTC GATACAGAGA TGCTGCAAGC CATGATCTTG GCCTCTGCCA 1680GTGAAGCCGC TGATGGGAGC TCCACTCTGG GAGGGGGCAC TGGCACTATG GGTTTGAGCG 1740CTCGATATGG ACCCCAGTTT ACCCTGCAGC ACGTGCCTGA CTACCGCCAG AACGTGTACA 1800TCCCTGGCAG CAATGCCACA CTGACCAACG CAGCTGGCAA ACGAGATGGC AAGGCTCCGG 1860CAGGCGGCAA TGGCAACAAC AACAAGTCGG GCAAGAAAGA GAAGAAGTAA TATGGAGGCC 1920AGGCCTTGAG CCACAGGGCA GCCTCCCTCC CCAGCCAGTC CAGCTTGTCC TTACTTGTAC 1980CCAGGCCTCA GAATTTCAGG GCTCACCCCA GGATTCTGGT AGGAGCCACA GCCAGGCCAT 2040GCTCCCCGTT GGGAAACAGA AACAAGTGCC CAAGCCAACA CCCCCTCTTT GTACCCTAGG 2100GGGGTTGAAT ATGCAAAGAG AGTTCTGCTG GGACCCCCTA TCCAATCAGT GATTGTACCC 2160ACATAGGTAG CAGGGTTAGT GTGGATACAC ACACACACAC ACACACACAC ACACACACAA 2220CCCTTGTCCT CCGCAGTGCC TGCCACTTTC TGGGACTTTC TCATCCCCCT ACGCCCTTCC 2280TTTATCCTCT CCCACCCAGA CACAGCTGCT GGAGAATAAA TTTGGGGATG CTGATGCTAA 2340AAAAAAA2347(2)SEQ ID NO114的信息(i)序列特征(A)长度2972碱基对(B)类型核酸(C)链型单股(D)拓扑结构线性(ii)分子类型cDNA(ix)特征(A)名称/键CDS(B)定位2..1849(xi)序列描述SEQ ID NOl14A GAG GCT GCT CAC CAC CTG GTC CTC ACG GCC TCG GAT GGC GGC AAG 46Glu Ala Ala His His Leu Val Leu Thr Ala Ser Asp Gly Gly Lys1 5 10 15CCG CCT CGC TCT AGC ACA GTG CGC ATC CAC CTG ACA GTG TTG GAT ACA94Pro Pro Arg Ser Ser Thr Val Arg Ile His Val Thr Val Leu Asp Thr20 25 30AAT GAC AAT GCC CCG GTT TTT CCT CAC CCG ATT TAC CGA GTG AAA GTC 142Asn Asp Asn Ala Pro Val Phe Pro His Pro Ile Tyr Arg Val Lys Val35 40 45CTT GAG AAC ATG CCC CCA GGC ACG CGG CTG CTT ACT GTA ACA GCC AGC 190Leu Glu Asn Met Pro Pro Gly Thr Arg Leu Leu Thr Val Thr Ala Ser50 55 60GAC CCG GAT GAG GGA ATC AAC GGA AAA GTG GCA TAC AAA TTC CGG AAA 238Asp Pro Asp Glu Gly Ile Asn Gly Lys Val Ala Tyr Lys Phe Arg Lys65 70 75ATT AAT GAA AAA CAA ACT CCG TTA TTC CAG CTT AAT GAA AAT ACT GGG 286Ile Asn Glu Lys Gln Thr Pro Leu Phe Gln Leu Asn Glu Asn Thr Gly80 85 90 95GAA ATA TCA ATA GCA AAA AGT CTA GAT TAT GAA GAA TGT TCA TTT TAT 334Glu Ile Ser Ile Ala Lys Ser Leu Asp Tyr Glu Glu Cys Ser Phe Tyr100 105 110GAA ATG GAA ATA CAA GCC GAA GAT GTG GGG GCA CTT CTG GGG AGG ACC 382Glu Met Glu Ile Gln Ala Glu Asp Val Gly Ala Leu Leu Gly Arg Thr115 120 125AAA TTG CTC ATT TCT GTG GAA GAT GTA AAT GAC AAT AGA CCA GAA GTG 430Lys Leu Leu Ile Ser Val Glu Asp Val Asn Asp Asn Arg Pro Glu Val130 135 140ATC ATT ACG TCT TTG TTT AGC CCA GTG TTA GAA AAT TCT CTT CCC GGG 478Ile Ile Thr Ser Leu Phe Ser Pro Val Leu Glu Asn Ser Leu Pro Gly145 150 155ACA GTA ATT GCC TTC TTG AGT GTG CAT GAC CAA GAC TCT GGA AAG AAT 526Thr Val Ile Ala Phe Leu Ser Val His Asp Gln Asp Ser Gly Lys Asn160 165 170 175GGT CAA GTT GTC TGT TAC ACA CGT GAT AAT TTA CCT TTT AAA TTA GAA 574Gly Gln Val Val Cys Tyr Thr Arg Asp Asn Leu Pro Phe Lys Leu Glu180 185 190AAG TCA ATA GGT AAT TAT TAT AGA TTA GTG ACA AGG AAA TAT TTG GAC 622Lys Ser Ile Gly Asn Tyr Tyr Arg Leu Val Thr Arg Lys Tyr Leu Asp195 200 205CGA GAA AAT GTC TCT ATC TAC AAT ATC ACA GTG ATG GCC TCA GAT CTA 670Arg Glu Asn Val Ser Ile Tyr Asn Ile Thr Val Met Ala Ser Asp Leu210 215 220GGA ACA CCA CCT CTG TCC ACT GAA ACT CAA ATC GCT CTG CAC GTG GCA 718Gly Thr Pro Pro Leu Ser Thr Glu Thr Gln Ile Ala Leu His Val Ala225 230 235GAC ATT AAC GAC AAC CCT CCT ACT TTC CCT CAT GCC TCC TAC TCA GCG 766Asp Ile Asn Asp Asn Pro Pro Thr Phe Pro His Ala Ser Tyr Ser Ala240 245 250 255TAT ATC CTA GAG AAC AAC CTG AGA GGA GCC TCC ATC TTT TCC TTG ACT 814Tyr Ile Leu Glu Asn Asn Leu Arg Gly Ala Ser Ile Phe Ser Leu Thr260 265 270GCA CAC GAC CCC GAC AGC CAG GAG AAT GCC CAG GTC ACT TAC TCT GTG 862Ala His Asp Pro Asp Ser Gln Glu Asn Ala Gln Val Thr Tyr Ser Val275 280 285ACC GAG GAC ACG CTG C AG GGG GCG CCC CTG TCC TCG TAT ATC TCC ATC 910Thr Glu Asp Thr Leu Gln Gly Ala Pro Leu Ser Ser Tyr Ile Ser Ile290 295 300AAC TCT GAC ACC GGT GTC CTG TAT GCG CTG CAA TCT TTC GAC TAT GAG 958Asn Ser Asp Thr Gly Val Leu Tyr Ala Leu Gln Ser Phe Asp Tyr Glu305 310 315CAG ATC CGA GAC CTG CAG CTA CTG GTA ACA GCC AGC GAC AGC GGG GAC 1006Gln Ile Arg Asp Leu Gln Leu Leu Val Thr Ala Ser Asp Ser Gly Asp320 325 330 335CCG CCC CTC AGC AGC AAC ATG TCA CTG AGC CTG TTC GTG CTG GAC CAG 1054Pro Pro Leu Ser Ser Asn Met Ser Leu Ser Leu Phe Val Leu Asp Gln340 345 350AAT GAC AAC GCG CCC GAG ATC CTG TAC CCC GCC CTC CCC ACA GAC GGT 1102Asn Asp Asn Ala Pro Glu Ile Leu Tyr Pro Ala Leu Pro Thr Asp Gly355 360 365TCC ACT GGC GTG GAG CTG GCG CCC CGC TCC GCA GAG CGT GGC TAC CTG 1150Ser Thr Gly Val Glu Leu Ala Pro Arg Ser Ala Glu Arg Gly Tyr Leu370 375 380GTG ACC AAG GTG GTG GCG GTG GAC AGA GAC TCG GGC CAG AAC GCC TGG 1198Val Thr Lys Val Val Ala Val Asp Arg Asp Ser Gly Gln Asn Ala Trp385 390 395CTG TCC TAC CGC CTG CTC AAG GCC AGC GAG CCG GGA CTC TTC TCG GTG 1246Leu Ser Tyr Arg Leu Leu Lys Ala Ser Glu Pro Gly Leu Phe Ser Val400 405 410 415GGT CTG CAC ACG GGC GAG GTG CGC ACG GCG CGA GCC CTG CTG GAC AGA 1294Gly Leu His Thr Gly Glu Val Arg Thr Ala Arg Ala Leu Leu Asp Arg420 425 430GAC GCG CTC AAG CAG AGC CTC GTG GTG GCC GTC CAG GAC CAT GGC CAG 1342Asp Ala Leu Lys Gln Ser Leu Val Val Ala Val Gln Asp His Gly Gln435 440 445CCC CCT CTC TCC GCC ACT GTC ACG CTC ACC GTA GCC GTG GCT GAC AGC 1390Pro Pro Leu Ser Ala Thr Val Thr Leu Thr Val Ala Val Ala Asp Ser450 455 460ATC CCC GAA GTC CTG ACC GAG TTG GGC AGT CTG AAG CCT TCG GTC GAC 1438Ile Pro Glu Val Leu Thr Glu Leu Gly Ser Leu Lys Pro Ser Val Asp465 470 475CCG AAC GAT TCG AGC CTT ACA CTC TAT CTC GTG GTG GCA GTG GCT GCC 1486Pro Asn Asp Ser Ser Leu Thr Leu Tyr Leu Val Val Ala Val Ala Ala480 485 490 495ATC TCC TGT GTC TTC CTC GCC TTT GTC GCT GTG CTT CTG GGG CTC AGG 1534Ile Ser Cys Val Phe Leu Ala Phe Val Ala Val Leu Leu Gly Leu Arg500 505 510CTG AGG CGC TGG CAC AAG TCA CGC CTG CTC CAG GAT TCC GGT GGC AGA 1582Leu Arg Arg Trp His Lys Ser Arg Leu Leu Gln Asp Ser Gly Gly Arg515 520 525TTG GTA GGC GTG CCT GCC TCA CAT TTT GTG GGT GTT GAG GAG GTA CAG1630Leu Val Gly Val Pro Ala Ser His Phe Val Gly Val Glu Glu Val Gln530 535 540GCT TTC CTG CAG ACC TAT TCC CAG GAA GTC TCC CTC ACC GCC GAC TCG1678Ala Phe Leu Gln Thr Tyr Ser Gln Glu Val Ser Leu Thr Ala Asp Ser545 550 555CGG AAG AGT CAC CTG ATC TTT CCC CAG CCC AAC TAC GCA GAC ATG CTC1726Arg Lys Ser His Leu Ile Phe Pro Gln Pro Asn Tyr Ala Asp Met Leu560 565 570 575ATC AGT CAG GAG GGC TGT GAG AAA AAT GAT TCT TTG TTA ACA TCC GTA1774Ile Ser Gln Glu Gly Cys Glu Lye Asn Asp Ser Leu Leu Thr Ser Val580 585 590GAT TTT CAT GAA TAT AAG AAT GAA GCT GAT CAT GGT CAG GTG AGT TTA1822Asp Phe His Glu Tyr Lys Asn Glu Ala Asp His Gly Gln Val Ser Leu595 600 605GTT CTT TGC TTG CTT TTA ATT TCC AGA TGAATTTTAT TTGGCATAAA 1869Val Leu Cys Leu Leu Leu Ile Ser Arg610 615TTATGTTTTG AAAAACATTG TGAAGATAGT TGAAAATAAT TTTTAAGGTG TATCACAGAG 1929TTTTGGGTTT ATTTTGGTGG TGTTACCAAA AAATTGAACT CTAATAGTCA TAGGTTATTG 1989TTTCATTTGC TTTTAAACGA CTTGGAAAAG ATTGTTCCAC CATTTTAAAC CTTCCAGTAT 2049TTTATTCCTA TTATCACTCA TTCACTTAAG AAGTAGCTAC CCGTCCATAC TGGTAATTTT 2109GCTATTGTTT GTTTGTGTGT GTGTGTGTGT GTGTGTGTGT GTGTGTGTAT CCCAAACTAG 2169AACTTCAGAA AATTATCAAG AAGTCTAAAG CCTTGTTATT AGCTTAGCAA AAGTAAAATA 2229TATCTCAGAA TTTTTAGGGT TATGTTTAGC ATTTGAACCT GTAACTAGGC TCTTGTATAT 2289TTCTTCACTT TAAACCTCTT TTCTGAGCCC TGTTTCTGTA CCAGTGCCCT TCAAAACTTT 2349AATACTTCTT ACCATCCTTC AAAACATGAA CAAACTTTAA AGATGGATCT TGGTGGGAGA 2409TGAGACTGGT TACTAAATAT TAAGTATGTG AGTCAGTGGT CACCTGGGCT CCATCCCCAT 2469GGAGACATGA AATCTAAAGC CTAGAATGTC CATTGCTCCC CCAAACAAAA AACAAAAGCA 2529AAAACATTAG ATCTGAATTA AAATGTAATT TTAAACTGTT GAAAGTGACT TTTGTAAAAT 2589ATGTAAGAAC ATATTTCAAT ACAATTCCAA TTAGCTGTTT CGGTTGTGCA TTGATGTGAA 2649GTGGTGAGAA TGTTGATATT AAGAACCAAT GTTTCAGGTA CACAAGTTCT AAATAAGCTG 2709ATCAATTCAA TTAAAGTTAT TCAGTCTTGG CTGGACACAG TGCCTCATGT CTGAAATCCC 2769AGCACTTTGG GAGGCTGGGG CAGGAGGACC GCTTGAGCCC CGGGGGTTTG AAACTGCAGT 2829GAGCTATGAT CATGCCACTG CACTCCAGCC TAGGTGGCAG AACTAGACCC TGTCTCTAAA 2889AAAACTATTA TTAGGCCGCG TGCGGTGGCT CACGCCTGTA ATCCCAGCAC TTTGGGAGAC 2949TGAGGTGGGT GGATCACCTG AGC 2972(2)SEQ ID NO115的信息(i)序列特征(A)长度616氨基酸(B)类型氨基酸(D)拓扑结构线性(ii)分子类型蛋白质(xi)序列描述SEQ ID NO115Glu Ala Ala His His Leu Val Leu Thr Ala Ser Asp Gly Gly Lys Pro1 5 10 15Pro Arg Ser Ser Thr Val Arg Ile His Val Thr Val Leu Asp Thr Asn20 25 30Asp Asn Ala Pro Val Phe Pro His Pro Ile Tyr Arg Val Lys Val Leu35 40 45Glu Asn Met Pro Pro Gly Thr Arg Leu Leu Thr Val Thr Ala Ser Asp50 55 60Pro Asp Glu Gly Ile Asn Gly Lys Val Ala Tyr Lys Phe Arg Lye Ile65 70 75 80Asn Glu Lys Gln Thr Pro Leu Phe Gln Leu Asn Glu Asn Thr Gly Glu85 90 95Ile Ser Ile Ala Lys Ser Leu Asp Tyr Glu Glu Cys Ser Phe Tyr Glu100 105 110Met Glu Ile Gln Ala Glu Asp Val Gly Ala Leu Leu Gly Arg Thr Lys115 120 125Leu Leu Ile Ser Val Glu Asp Val Asn Asp Asn Arg Pro Glu Val Ile130 135 140Ile Thr Ser Leu Phe Ser Pro Val Leu Glu Asn Ser Leu Pro Gly Thr145 150 155 160Val Ile Ala Phe Leu Ser Val His Asp Gln Asp Ser Gly Lys Asn Gly165 170 175Gln Val Val Cys Tyr Thr Arg Asp Asn Leu Pro Phe Lys Leu Glu Lys180 185 190Ser Ile Gly Asn Tyr Tyr Arg Leu Val Thr Arg Lys Tyr Leu Asp Arg195 200 205Glu Asn Val Ser Ile Tyr Asn Ile Thr Val Met Ala Ser Asp Leu Gly210 215 220Thr Pro Pro Leu Ser Thr Glu Thr Gln Ile Ala Leu His Val Ala Asp225 230 235 240
Ile Asn Asp Asn Pro Pro Thr Phe Pro His Ala Ser Tyr Ser Ala Tyr245 250 255Ile Leu Glu Aan Asn Leu Arg Gly Ala Ser Ile Phe Ser Leu Thr Ala260 265 270His Asp Pro Asp Ser Gln Glu Asn Ala Gln Val Thr Tyr Ser Val Thr275 280 28SGlu Asp Thr Leu Gln Gly Ala Pro Leu Ser Ser Tyr Ile Ser Ile Asn290 295 300Ser Asp Thr Gly Val Leu Tyr Ala Leu Gln Ser Phe Asp Tyr Glu Gln305 310 315 320Ile Arg Asp Leu Gln Leu Leu Val Thr Ala Ser Asp Ser Gly Asp Pro325 330 335Pro Leu Ser Ser Asn Met Ser Leu Ser Leu Phe Val Leu Asp Gln Asn340 345 350Asp Asn Ala Pro Glu Ile Leu Tyr Pro Ala Leu Pro Thr Asp Gly Ser355 360 365Thr Gly Val Glu Leu Ala Pro Arg Ser Ala Glu Arg Gly Tyr Leu Val370 375 380Thr Lys Val Val Ala Val Asp Arg Asp Ser Gly Gln Asn Ala Trp Leu385 390 395 400Ser Tyr Arg Leu Leu Lys Ala Ser Glu Pro Gly Leu Phe Ser Val Gly405 410 415Leu His Thr Gly Glu Val Arg Thr Ala Arg Ala Leu Leu Asp Arg Asp420 425 430Ala Leu Lys Gln Ser Leu Val Val Ala Val Gln Asp His Gly Gln Pro435 440 445Pro Leu Ser Ala Thr Val Thr Leu Thr Val Ala Val Ala Asp Ser Ile450 455 460Pro Glu Val Leu Thr Glu Leu Gly Ser Leu Lys Pro Ser Val Asp Pro465 470 475 480Asn Asp Ser Ser Leu Thr Leu Tyr Leu Val Val Ala Val Ala Ala Ile485 490 495Ser Cys Val Phe Leu Ala Phe Val Ala Val Leu Leu Gly Leu Arg Leu500 505 510Arg Arg Trp His Lys Ser Arg Leu Leu Gln Asp Ser Gly Gly Arg Leu515 520 525Val Gly Val Pro Ala Ser His Phe Val Gly Val Glu Glu Val Gln Ala530 535 540
Phe Leu Gln Thr Tyr Ser Gln Glu Val Ser Leu Thr Ala Asp Ser Arg545 550 555 560Lys Ser Hia Leu Ile Phe Pro Gln Pro Asn Tyr Ala Asp Met Leu Ile565 570 575Ser Gln Glu Gly Cys Glu Lys Asn Asp Ser Leu Leu Thr Ser Val Asp580 585 590Phe His Glu Tyr Lys Asn Glu Ala Asp His Gly Gln Val Ser Leu Val595 600 605Leu Cys Leu Leu Leu Ile Ser Arg610 61权利要求
1.纯化并分离的编码人原钙粘着蛋白pc3的多核苷酸序列。
2.纯化并分离的编码人原钙粘着蛋白pc4的多核苷酸序列。
3.纯化并分离的编码大鼠原钙粘着蛋白pc5的多核苷酸序列。
4.根据权利要求1、2或3的多核苷酸序列,其为DNA序列。
5.根据权利要求4的DNA序列,其为cDNA序列。
6.根据权利要求4的DNA序列,其为基因组DNA序列。
7.根据权利要求4的DNA序列,其为完全或部分地化学合成的DNA序列。
8.根据权利要求1的多核苷酸序列,其包含SEQ ID NO109的人原钙粘着蛋白pc3编码序列。
9.根据权利要求2的多核苷酸序列,其包括SEQ ID NO111的人原钙粘着蛋白pc4编码序列。
10.根据权利要求3的多核苷酸序列,其包括SEQ ID NO114的大鼠原钙粘着蛋白pc5编码序列。
11.包含根据权利要求4之DNA序列的生物学功能性DNA载体。
12.根据权利要求11的载体,其中所说的DNA序列可操作地连接到表达控制DNA序列上。
13.以允许在所说的宿主细胞中表达原钙粘着蛋白多肽的方式用根据权利要求4的DNA序列转化或转染的宿主细胞。
14.生产原钙粘着蛋白多肽的方法,其包括在适当的营养培养基中培养根据权利要求13的宿主细胞以及从所说的细胞中或从其赖以生长的培养基中分离原钙粘着蛋白多肽。
15.纯化并分离的人原钙粘着蛋白pc3多肽。
16.纯化并分离的人原钙粘着蛋白pc4多肽。
17.纯化并分离的大鼠原钙粘着蛋白pc5多肽。
18.对人原钙粘着蛋白pc3特异的抗体物质。
19.对人原钙粘着蛋白pc4特异的抗体物质。
20.对大鼠原钙粘着蛋白pc5特异的抗体物质。
21.根据权利要求18、19或20的抗体物质,其为单克隆抗体。
22.产生根据权利要求21的单克隆抗体的杂交瘤细胞系。
23.调节人原钙粘着蛋白pc3的结合活性的方法,其包括使所说的原钙粘着蛋白与根据权利要求18的对所说的原钙粘着蛋白特异的抗体物质接触。
24.调节人原钙粘着蛋白pc3的结合活性的方法,其包括使所说的原钙粘着蛋白与所说原钙粘着蛋白的肽配体接触。
25.调节人原钙粘着蛋白pc4的结合活性的方法,其包括使所说的原钙粘着蛋白与根据权利要求19的对所说的原钙粘着蛋白特异的抗体物质接触。
26.调节人原钙粘着蛋白pc4的结合活性的方法,其包括使所说的原钙粘着蛋白与所说原钙粘着蛋白的肽配体接触。
27.调节大鼠原钙粘着蛋白pc5的结合活性的方法,其包括使所说的原钙粘着蛋白与根据权利要求20对所说的原钙粘着蛋白特异的抗体物质接触。
28.调节大鼠原钙粘着蛋白pc5的结合活性的方法,其包括使所说的原钙粘着蛋白与所说原钙粘着蛋白的肽配体接触。
全文摘要
本发明提供了编码称为原钙粘着蛋白的钙粘着蛋白样多肽的多核苷酸序列及其变异体,并提供了重组生产它们的方法和材料。本发明还公开了对原钙粘着蛋白特异的抗体物质,这些抗体物质可用于调节原钙粘着蛋白的天然结合和/或调节活性。
文档编号C07K16/00GK1134172SQ95190790
公开日1996年10月23日 申请日期1995年6月26日 优先权日1994年6月27日
发明者S·铃木 申请人:多亨尼眼科研究院
网友询问留言 已有0条留言
  • 还没有人留言评论。精彩留言会获得点赞!
1