具有抑癌功能的新的人蛋白及其编码序列的制作方法

文档序号:3486341阅读:307来源:国知局
专利名称:具有抑癌功能的新的人蛋白及其编码序列的制作方法
技术领域
本发明属于生物技术领域,具体地说,本发明涉及新的编码具有抑癌功能的人蛋白的多核苷酸和此多核苷酸编码的多肽。本发明还涉及此多核苷酸和多肽的用途和制备。
背景技术
人基因组学研究目前是国际上的热点,除人染色体DNA大规模测序,表达序列测序(EST)的方法外,还缺少从功能开始的筛选具有功能基因的高通量的方法。
癌症是危害人类健康的主要疾病之一。为了有效地治疗和预防肿瘤,目前人们已越来越关注肿瘤的基因治疗。因此,本领域迫切需要开发研究具有抑癌功能的人蛋白及其激动剂/抑制剂。

发明内容
本发明的目的是提供一类新的具有抑癌功能的人蛋白多肽以及其片段、类似物和衍生物。
本发明的另一目的是提供编码这些多肽的多核苷酸。
本发明的另一目的是提供生产这些多肽的方法以及该多肽和编码序列的用途。
在本发明的第一方面,提供新颖的分离出的具有抑癌功能的蛋白多肽,它包含具有选自下组的氨基酸序列的多肽SEQ ID NO3、6、9、12、15、18、21、24;或其保守性变异多肽、或其活性片段、或其活性衍生物。
较佳地,该多肽是具有选自下组的氨基酸序列的多肽SEQ ID NO3、6、9、12、15、18、21、24。
在本发明的第二方面,提供了一种分离的多核苷酸,它包含一核苷酸序列,该核苷酸序列与选自下组的一种核苷酸序列有至少85%相同性(a)编码上述的具有抑癌功能的蛋白多肽的多核苷酸;(b)与多核苷酸(a)互补的多核苷酸。较佳地,该多核苷酸编码的多肽具有选自下组的氨基酸序列SEQ ID NO3、6、9、12、15、18、21、24。更佳地,该多核苷酸的序列选自下组SEQ ID NO2、5、8、11、14、17、20、23的编码区序列或全长序列。
在本发明的第三方面,提供了含有上述多核苷酸的载体,以及被该载体转化或转导的宿主细胞或者被上述多核苷酸直接转化或转导的宿主细胞。
在本发明的第四方面,提供了制备具有抑癌功能的蛋白活性的多肽的制备方法,该方法包含(a)在适合表达具有抑癌功能的蛋白的条件下,培养上述被转化或转导的宿主细胞;(b)从培养物中分离出具有抑癌功能的蛋白活性的多肽。
在本发明的第五方面,提供了与上述的具有抑癌功能的蛋白多肽特异性结合的抗体。还提供了可用于检测的核酸分子,它含有上述的多核苷酸中连续10个核苷酸至全长核苷酸,较佳地它含有连续的约10-800个核苷酸。
在本发明的第六方面,提供了一种药物组合物,它含有安全有效量的本发明的具有抑癌功能的蛋白多肽以及药学上可接受的载体。这些药物组合物可治疗癌症以及细胞异常增殖等病症。
本发明的其它方面由于本文的公开内容,对本领域的技术人员而言是显而易见的。
具体实施例方式
3T3细胞是一种小鼠成纤维细胞(J.Cell.Biol.,17299,1963)(也称为NIH/3T3细胞)。在癌症研究领域中,常将外源基因(尤其是人基因)引入3T3细胞,观察其对3T3细胞生长的影响情况。通常认为,对3T3细胞生长有影响的基因是癌症相关基因,其中对3T3细胞生长有抑制作用的基因大多是抑癌基因,而对3T3细胞生长有促进作用的基因大多是(原)癌基因。
本发明采用大规模cDNA克隆转染小鼠胚胎成纤维细胞,在获得具有抑癌作用的基础上,经测序证明为新的基因,进一步得到全长cDNA克隆。DNA转染试验证明,本发明的具有抑癌功能的蛋白对3T3细胞具有抑制克隆形成的作用,其抑制率≥50%。
如本文所用,“分离的”是指物质从其原始环境中分离出来(如果是天然的物质,原始环境即是天然环境)。如活体细胞内的天然状态下的多聚核苷酸和多肽是没有分离纯化的,但同样的多聚核苷酸或多肽如从天然状态中同存在的其他物质中分开,则为分离纯化的。
如本文所用,“分离的具有抑癌功能的蛋白或多肽”是指具有抑癌功能的蛋白多肽基本上不含天然与其相关的其它蛋白、脂类、糖类或其它物质。本领域的技术人员能用标准的蛋白质纯化技术纯化具有抑癌功能的蛋白。基本上纯的多肽在非还原聚丙烯酰胺凝胶上能产生单一的主带。
本发明的多肽可以是重组多肽、天然多肽、合成多肽,优选重组多肽。本发明的多肽可以是天然纯化的产物,或是化学合成的产物,或使用重组技术从原核或真核宿主(例如,细菌、酵母、高等植物、昆虫和哺乳动物细胞)中产生。根据重组生产方案所用的宿主,本发明的多肽可以是糖基化的,或可以是非糖基化的。本发明的多肽还可包括或不包括起始的甲硫氨酸残基。
本发明还包括具有抑癌功能的人蛋白的片段、衍生物和类似物。如本文所用,术语“片段”、“衍生物”和“类似物”是指基本上保持本发明的天然具有抑癌功能的人蛋白相同的生物学功能或活性的多肽。本发明的多肽片段、衍生物或类似物可以是(i)有一个或多个保守或非保守性氨基酸残基(优选保守性氨基酸残基)被取代的多肽,而这样的取代的氨基酸残基可以是也可以不是由遗传密码编码的,或(ii)在一个或多个氨基酸残基中具有取代基团的多肽,或(iii)成熟多肽与另一个化合物(比如延长多肽半衰期的化合物,例如聚乙二醇)融合所形成的多肽,或(iv)附加的氨基酸序列融合到此多肽序列而形成的多肽(如前导序列或分泌序列或用来纯化此多肽的序列或蛋白原序列)。根据本文的教导,这些片段、衍生物和类似物属于本领域熟练技术人员公知的范围。
本发明的多核苷酸可以是DNA形式或RNA形式。DNA形式包括cDNA、基因组DNA或人工合成的DNA。DNA可以是单链的或是双链的。DNA可以是编码链或非编码链。以PP11303蛋白(在本申请中,蛋白质的命名采用其克隆编号)为例,编码成熟多肽的编码区序列可以与SEQ ID NO2所示的编码区序列相同或者是简并的变异体。如本文所用,“简并的变异体”对于PP11303而言是指编码具有SEQ ID NO3的蛋白质,但与SEQ ID NO2所示的编码区序列有差别的核酸序列。再以PP12899蛋白为例,编码成熟多肽的编码区序列可以与SEQ ID NO5所示的编码区序列相同或者是简并的变异体;“简并的变异体”对于PP12899而言是指编码具有SEQ ID NO6的蛋白质,但与SEQ ID NO5所示的编码区序列有差别的核酸序列。对于本发明的其他具有抑癌功能的蛋白,可依此类推。
编码成熟多肽的多核苷酸包括只编码成熟多肽的编码序列;成熟多肽的编码序列和各种附加编码序列;成熟多肽的编码序列(和任选的附加编码序列)以及非编码序列。
术语“编码多肽的多核苷酸”可以是包括编码此多肽的多核苷酸,也可以是还包括附加编码和/或非编码序列的多核苷酸。
本发明还涉及上述多核苷酸的变异体,其编码与本发明有相同的氨基酸序列的多肽或多肽的片段、类似物和衍生物。此多核苷酸的变异体可以是天然发生的等位变异体或非天然发生的变异体。这些核苷酸变异体包括取代变异体、缺失变异体和插入变异体。如本领域所知的,等位变异体是一个多核苷酸的替换形式,它可能是一个或多个核苷酸的取代、缺失或插入,但不会从实质上改变其编码的多肽的功能。
本发明还涉及与上述的序列杂交且两个序列之间具有至少50%,较佳地至少70%,更佳地至少80%相同性的多核苷酸。本发明特别涉及在严格条件下与本发明所述多核苷酸可杂交的多核苷酸。在本发明中,“严格条件”是指(1)在较低离子强度和较高温度下的杂交和洗脱,如0.2×SSC,0.1%SDS,60℃;或(2)杂交时加有变性剂,如50%(v/v)甲酰胺,0.1%小牛血清/0.1%Ficoll,42℃等;或(3)仅在两条序列之间的相同性至少在95%以上,更好是97%以上时才发生杂交。并且,可杂交的多核苷酸编码的多肽与SEQ IDNO3所示的成熟多肽有相同的生物学功能(以PP11303蛋白为例)和活性。
本发明还涉及与上述的序列杂交的核酸片段。如本文所用,“核酸片段”的长度至少含15个核苷酸,较好是至少30个核苷酸,更好是至少50个核苷酸,最好是至少100个核苷酸以上。核酸片段可用于核酸的扩增技术(如PCR)以确定和/或分离编码具有抑癌功能的蛋白的多聚核苷酸。
本发明中的多肽和多核苷酸优选以分离的形式提供,更佳地被纯化至均质。
本发明的DNA序列能用几种方法获得。例如,用本领域熟知的杂交技术分离DNA。这些技术包括但不局限于1)用探针与基因组或cDNA文库杂交以检出同源性核苷酸序列,和2)表达文库的抗体筛选以检出具有共同结构特征的克隆的DNA片段。
编码具有抑癌功能的蛋白的特异DNA片段序列产生也能用下列方法获得1)从基因组DNA分离双链DNA序列;2)化学合成DNA序列以获得所需多肽的双链DNA。
当需要的多肽产物的整个氨基酸序列已知时,DNA序列的直接化学合成是经常选用的方法。如果所需的氨基酸的整个序列不清楚时,DNA序列的直接化学合成是不可能的,选用的方法是cDNA序列的分离。分离感兴趣的cDNA的标准方法是从高表达该基因的供体细胞分离mRNA并进行逆转录,形成质粒或噬菌体cDNA文库。提取mRNA的方法已有多种成熟的技术,试剂盒也可从商业途径获得(Qiagene)。而构建cDNA文库也是通常的方法(Sambrook,et al.,Molecular Cloning,A Laboratory Manual,Cold SpringHarbor Laboratory.New York,1989)。还可得到商业供应的cDNA文库,如Clontech公司的不同cDNA文库。当结合使用聚合酶反应技术时,即使极少的表达产物也能克隆。
可用常规方法从这些cDNA文库中筛选本发明的基因。这些方法包括(但不限于)(1)DNA-DNA或DNA-RNA杂交;(2)标志基因的功能出现或丧失;(3)测定具有抑癌功能的蛋白的转录本的水平;(4)通过免疫学技术或测定生物学活性,来检测基因表达的蛋白产物。上述方法可单用,也可多种方法联合应用。
在第(1)种方法中,杂交所用的探针是与本发明的多核苷酸的任何一部分同源,其长度至少15个核苷酸,较好是至少30个核苷酸,更好是至少50个核苷酸,最好是至少100个核苷酸。此外,探针的长度通常在2kb之内,较佳地为1kb之内。此处所用的探针通常是在本发明的基因DNA序列信息的基础上化学合成的DNA序列。本发明的基因本身或者片段当然可以用作探针。DNA探针的标记可用放射性同位素,荧光素或酶(如碱性磷酸酶)等。
在第(4)种方法中,检测具有抑癌功能的蛋白基因表达的蛋白产物可用免疫学技术如Western印迹法,放射免疫沉淀法,酶联免疫吸附法(ELISA)等。
应用PCR技术扩增DNA/RNA的方法(Saiki,et al.Science 1985;2301350-1354)被优选用于获得本发明的基因。特别是很难从文库中得到全长的cDNA时,可优选使用RACE法(RACE-cDNA末端快速扩增法),用于PCR的引物可根据本文所公开的本发明的序列信息适当地选择,并可用常规方法合成。可用常规方法如通过凝胶电泳分离和纯化扩增的DNA/RNA片段。
如上所述得到的本发明的基因,或者各种DNA片段等的核苷酸序列的测定可用常规方法如双脱氧链终止法(Sanger et al.PNAS,1977,745463-5467)。这类核苷酸序列测定也可用商业测序试剂盒等。为了获得全长的cDNA序列,测序需反复进行。有时需要测定多个克隆的cDNA序列,才能拼接成全长的cDNA序列。
本发明也涉及包含本发明多核苷酸的载体,以及用本发明载体或具有抑癌功能的蛋白编码序列经基因工程产生的宿主细胞,以及经重组技术产生本发明所述多肽的方法。
通过常规的重组DNA技术(Science,19842241431),可利用本发明的多聚核苷酸序列可用来表达或生产重组的具有抑癌功能的蛋白多肽。一般来说有以下步骤(1).用本发明的编码具有抑癌功能的人蛋白的多核苷酸(或变异体),或用含有该多核苷酸的重组表达载体转化或转导合适的宿主细胞;(2).在合适的培养基中培养的宿主细胞;(3).从培养基或细胞中分离、纯化蛋白质。
本发明中,具有抑癌功能的人蛋白多核苷酸序列可插入到重组表达载体中。术语“重组表达载体”指本领域熟知的细菌质粒、噬菌体、酵母质粒、植物细胞病毒、哺乳动物细胞病毒如腺病毒、逆转录病毒或其他载体。在本发明中适用的载体包括但不限于在细菌中表达的基于T7的表达载体(Rosenberg,et al.Gene,1987,56125);在哺乳动物细胞中表达的pMSXND表达载体(Lee and Nathans,J Bio Chem.2633521,1988)和在昆虫细胞中表达的来源于杆状病毒的载体。总之,只要能在宿主体内复制和稳定,任何质粒和载体都可以用。表达载体的一个重要特征是通常含有复制起点、启动子、标记基因和翻译控制元件。
本领域的技术人员熟知的方法能用于构建含具有抑癌功能的人蛋白编码DNA序列和合适的转录/翻译控制信号的表达载体。这些方法包括体外重组DNA技术、DNA合成技术、体内重组技术等(Sambroook,et al.)。所述的DNA序列可有效连接到表达载体中的适当启动子上,以指导mRNA合成。这些启动子的代表性例子有大肠杆菌的lac或trp启动子;λ噬菌体PL启动子;真核启动子包括CMV立即早期启动子、早期和晚期SV40启动子、反转录病毒的LTRs和其他一些已知的可控制基因在原核或真核细胞或其病毒中表达的启动子。表达载体还包括翻译起始用的核糖体结合位点和转录终止子。
此外,表达载体优选地包含一个或多个选择性标记基因,以提供用于选择转化的宿主细胞的表型性状,如真核细胞培养用的二氢叶酸还原酶、新霉素抗性以及绿色荧光蛋白(GFP),或用于大肠杆菌的四环素或氨苄青霉素抗性。
包含上述的适当DNA序列以及适当启动子或者控制序列的载体,可以用于转化适当的宿主细胞,以使其能够表达蛋白质。
宿主细胞可以是原核细胞,如细菌细胞;或是低等真核细胞,如酵母细胞;或是高等真核细胞,如哺乳动物细胞。代表性例子有大肠杆菌,链霉菌属;鼠伤寒沙门氏菌的细菌细胞;真菌细胞如酵母;植物细胞;果蝇S2或Sf9的昆虫细胞;CHO、COS或Bowes黑素瘤细胞的动物细胞等。
本发明的多核苷酸在高等真核细胞中表达时,如果在载体中插入增强子序列时将会使转录得到增强。增强子是DNA的顺式作用因子,通常大约有10到300个碱基对,作用于启动子以增强基因的转录。可举的例子包括在复制起始点晚期一侧的100到270个碱基对的SV40增强子、在复制起始点晚期一侧的多瘤增强子以及腺病毒增强子等。
本领域一般技术人员都清楚如何选择适当的载体、启动子、增强子和宿主细胞。
用重组DNA转化宿主细胞可用本领域技术人员熟知的常规技术进行。当宿主为原核生物如大肠杆菌时,能吸收DNA的感受态细胞可在指数生长期后收获,用CaCl2法处理,所用的步骤在本领域众所周知。可供选择的是用MgCl2。如果需要,转化也可用电穿孔的方法进行。当宿主是真核生物,可选用如下的DNA转染方法磷酸钙共沉淀法,常规机械方法如显微注射、电穿孔、脂质体包装等。
获得的转化子可以用常规方法培养,表达本发明的基因所编码的多肽。根据所用的宿主细胞,培养中所用的培养基可选自各种常规培养基。在适于宿主细胞生长的条件下进行培养。当宿主细胞生长到适当的细胞密度后,用合适的方法(如温度转换或化学诱导)诱导选择的启动子,将细胞再培养一段时间。
在上面的方法中的重组多肽可包被于细胞内、细胞外或在细胞膜上表达或分泌到细胞外。如果需要,可利用其物理的、化学的和其它特性通过各种分离方法分离和纯化重组的蛋白。这些方法是本领域技术人员所熟知的。这些方法的例子包括但并不限于常规的复性处理、用蛋白沉淀剂处理(盐析方法)、离心、渗透破菌、超处理、超离心、分子筛层析(凝胶过滤)、吸附层析、离子交换层析、高效液相层析(HPLC)和其它各种液相层析技术及这些方法的结合。
重组的具有抑癌功能的人蛋白或多肽有多方面的用途。这些用途包括(但不限于)直接做为药物治疗具有抑癌功能的蛋白功能低下或丧失所致的疾病,和用于筛选促进或对抗具有抑癌功能的蛋白功能的抗体、多肽或其它配体。例如,抗体可用于激活或抑制具有抑癌功能的人蛋白的功能。用表达的重组具有抑癌功能的人蛋白筛选多肽库可用于寻找有治疗价值的能抑制或刺激具有抑癌功能的人蛋白功能的多肽分子。
本发明也提供了筛选药物以鉴定提高(激动剂)或阻遏(拮抗剂)具有抑癌功能的人蛋白的药剂的方法。激动剂提高具有抑癌功能的人蛋白刺激细胞增殖等生物功能,而拮抗剂阻止和治疗与细胞过度增殖有关的紊乱如各种癌症。例如,能在药物的存在下,将哺乳动物细胞或表达具有抑癌功能的人蛋白的膜制剂与标记的具有抑癌功能的人蛋白一起培养。然后测定药物提高或阻遏此相互作用的能力。
具有抑癌功能的人蛋白的拮抗剂包括筛选出的抗体、化合物、受体缺失物和类似物等。具有抑癌功能的人蛋白的拮抗剂可以与具有抑癌功能的人蛋白结合并消除其功能,或是抑制具有抑癌功能的人蛋白的产生,或是与多肽的活性位点结合使多肽不能发挥生物学功能。具有抑癌功能的人蛋白的拮抗剂可用于治疗用途。
在筛选作为拮抗剂的化合物时,可以将本发明蛋白加入生物分析测定中,通过测定化合物影响具有抑癌功能的蛋白和其受体之间的相互作用来确定化合物是否是拮抗剂。用上述筛选化合物的同样方法,可以筛选出起拮抗剂作用的受体缺失物和类似物。
本发明的多肽可直接用于疾病治疗,例如,各种恶性肿瘤、和细胞异常增殖等。
本发明的多肽,及其片段、衍生物、类似物或它们的细胞可以用来作为抗原以生产抗体。这些抗体可以是多克隆或单克隆抗体。多克隆抗体可以通过将此多肽直接注射动物的方法得到。制备单克隆抗体的技术包括杂交瘤技术,三瘤技术,人B-细胞杂交瘤技术,EBV-杂交瘤技术等。
可以将本发明的多肽和拮抗剂与合适的药物载体组合后使用。这些载体可以是水、葡萄糖、乙醇、盐类、缓冲液、甘油以及它们的组合。组合物包含安全有效量的多肽或拮抗剂以及不影响药物效果的载体和赋形剂。这些组合物可以作为药物用于疾病治疗。
本发明还提供含有一种或多种容器的药盒或试剂盒,容器中装有一种或多种本发明的药用组合物成分。与这些容器一起,可以有由制造、使用或销售药品或生物制品的政府管理机构所给出的指示性提示,该提示反映出生产、使用或销售的政府管理机构许可其在人体上施用。此外,本发明的多肽可以与其它的治疗化合物结合使用。
药物组合物可以以方便的方式给药,如通过局部、静脉内、腹膜内、肌内、皮下、鼻内或皮内的给药途径。具有抑癌功能的蛋白以有效地治疗和/或预防具体的适应症的量来给药。施用于患者的具有抑癌功能的蛋白的量和剂量范围将取决于许多因素,如给药方式、待治疗者的健康条件和诊断医生的判断。
具有抑癌功能的人蛋白的多聚核苷酸也可用于多种治疗目的。基因治疗技术可用于治疗由于具有抑癌功能的蛋白的无表达或异常/无活性的具有抑癌功能的蛋白的表达所致的细胞增殖、发育或代谢异常。重组的基因治疗载体可用于治疗具有抑癌功能的蛋白表达或活性异常所致的疾病。来源于病毒的表达载体如逆转录病毒、腺病毒、腺病毒相关病毒、单纯疱疹病毒、细小病毒等可用于将具有抑癌功能的蛋白基因转移至细胞内。构建携带具有抑癌功能的蛋白基因的重组病毒载体的方法可见于已有文献(Sambrook,etal.)。另外重组具有抑癌功能的人蛋白基因可包装到脂质体中转移至细胞内。
抑制具有抑癌功能的人蛋白mRNA的寡聚核苷酸(包括反义RNA和DNA)以及核酶也在本发明的范围之内。核酶是一种能特异性分解特定RNA的酶样RNA分子,其作用机制是核酶分子与互补的靶RNA特异性杂交后进行核酸内切作用。反义的RNA和DNA及核酶可用已有的任何RNA或DNA合成技术获得,如固相磷酸酰胺化学合成法合成寡核苷酸的技术已广泛应用。反义RNA分子可通过编码该RNA的DNA序列在体外或体内转录获得。这种DNA序列已整合到载体的RNA聚合酶启动子的下游。为了增加核酸分子的稳定性,可用多种方法对其进行修饰,如增加两侧的序列长度,核糖核苷之间的连接应用磷酸硫酯键或肽键而非磷酸二酯键。
多聚核苷酸导入组织或细胞内的方法包括将多聚核苷酸直接注入到体内组织中;或在体外通过载体(如病毒、噬菌体或质粒等)先将多聚核苷酸导入细胞中,再将细胞移植到体内等。
本发明的多肽还可用作肽谱分析,例如,多肽可用物理的、化学或酶进行特异性切割,并进行一维或二维或三维的凝胶电泳分析。
本发明还提供了针对具有抑癌功能的人蛋白抗原决定簇的抗体。这些抗体包括(但不限于)多克隆抗体、单克隆抗体、嵌合抗体、单链抗体、Fab片段和Fab表达文库产生的片段。这些抗体可用常规方法制备。抗具有抑癌功能的人蛋白的抗体可用于免疫组织化学技术中,检测活检标本中的具有抑癌功能的人蛋白。
与具有抑癌功能的人蛋白结合的单克隆抗体也可用放射性同位素标记,注入体内可跟踪其位置和分布。本发明中的抗体可用于治疗或预防与具有抑癌功能的人蛋白相关的疾病。给予适当剂量的抗体可以刺激或阻断具有抑癌功能的人蛋白的产生或活性。
抗体也可用于设计针对体内某一特殊部位的免疫毒素。如具有抑癌功能的人蛋白高亲和性的单克隆抗体可与细菌或植物毒素(如白喉毒素,蓖麻蛋白,红豆碱等)共价结合。
多克隆抗体的生产可用具有抑癌功能的人蛋白或多肽免疫动物,如家兔,小鼠,大鼠等。多种佐剂可用于增强免疫反应,包括但不限于弗氏佐剂等。
具有抑癌功能的人蛋白单克隆抗体可用杂交瘤技术生产(Kohler and Milstein.Nature,1975,256495-497)。将人恒定区和非人源的可变区结合的嵌合抗体可用已有的技术生产(Morrison et al,PNAS,1985,816851)。而已有的生产单链抗体的技术(U.S.PatNo.4946778)也可用于生产抗具有抑癌功能的人蛋白的单链抗体。
能与本发明蛋白结合的多肽分子可通过筛选由各种可能组合的氨基酸结合于固相物组成的随机多肽库而获得。筛选时,必须对具有抑癌功能的人蛋白分子进行标记。
本发明还涉及定量和定位检测具有抑癌功能的人蛋白水平的诊断试验方法。这些试验是本领域所熟知的,且包括FISH测定和放射免疫测定。试验中所检测的具有抑癌功能的人蛋白水平,可以用作解释具有抑癌功能的人蛋白在各种疾病中的重要性和用于诊断具有抑癌功能的蛋白起作用的疾病。
具有抑癌功能的蛋白的多聚核苷酸可用于具有抑癌功能的蛋白相关疾病的诊断和治疗。在诊断方面,具有抑癌功能的蛋白的多聚核苷酸可用于检测具有抑癌功能的蛋白的表达与否或在疾病状态下具有抑癌功能的蛋白的异常表达。如具有抑癌功能的蛋白DNA序列可用于对活检标本的杂交以判断具有抑癌功能的蛋白的表达异常。杂交技术包括Southern印迹法,Northern印迹法、原位杂交等。这些技术方法都是公开的成熟技术,相关的试剂盒都可从商业途径得到。本发明的多核苷酸的一部分或全部可作为探针固定在微阵列(Microarray)或DNA芯片(又称为“基因芯片”)上,用于分析组织中基因的差异表达分析和基因诊断。用具有抑癌功能的蛋白特异的引物进行RNA-聚合酶链反应(RT-PCR)体外扩增也可检测具有抑癌功能的蛋白的转录产物。
检测具有抑癌功能的蛋白基因的突变也可用于诊断具有抑癌功能的蛋白相关的疾病。具有抑癌功能的蛋白突变的形式包括与正常野生型具有抑癌功能的蛋白DNA序列相比的点突变、易位、缺失、重组和其它任何异常等。可用已有的技术如Southern印迹法、DNA序列分析、PCR和原位杂交检测突变。另外,突变有可能影响蛋白的表达,因此用Northern印迹法、Western印迹法可间接判断基因有无突变。
本发明的序列对染色体鉴定也是有价值的。这些序列会特异性地针对某条人染色体具体位置且并可以与其杂交。目前,需要鉴定染色体上的各基因的具体位点。然而现在只有很少的基于实际序列数据(重复多态性)的染色体标记物可用于标记染色体位置。为了将这些序列与疾病相关基因相关联。第一步就是将本发明DNA序列定位于染色体上。
简而言之,根据cDNA制备PCR引物(优选15-35bp),可以将序列定位于染色体上。然后,将这些引物用于PCR筛选含各条人染色体的体细胞杂合细胞。只有那些含有相应于引物的人基因的杂合细胞会产生扩增的片段。
体细胞杂合细胞的PCR定位法,是将DNA定位到具体染色体的快捷方法。使用本发明的的寡核苷酸引物,通过类似方法,可利用一组来自特定染色体的片段或大量基因组克隆而实现亚定位。可用于染色体定位的其它类似策略包括原位杂交、用标记的流式分选的染色体预筛选和杂交预选,从而构建染色体特异的cDNA库。
将cDNA克隆与中期染色体进行荧光原位杂交(FISH),可以在一个步骤中精确地进行染色体定位。此技术的综述,参见Verma等,Human Chromosomesa Manual of BasicTechniques,Pergamon Press,New York(1988)。
一旦序列被定位到准确的染色体位置,此序列在染色体上的物理位置就可以与基因图数据相关联。这些数据可见于例如,V.Mckusick,Mendelian Inheritance in Man(可通过与Johns Hopkins University Welch Medical Library联机获得)。然后可通过连锁分析,确定基因与业已定位到染色体区域上的疾病之间的关系。
接着,需要测定患病和未患病个体间的cDNA或基因组序列差异。如果在一些或所有的患病个体中观察到某突变,而该突变在任何正常个体中未观察到,则该突变可能是疾病的病因。比较患病和未患病个体,通常涉及首先寻找染色体中结构的变化,如从染色体水平可见的或用基于cDNA序列的PCR可检测的缺失或易位。
本发明的具有抑癌功能的蛋白核苷酸全长序列或其片段通常可以用PCR扩增法、重组法或人工合成的方法获得。对于PCR扩增法,可根据本发明所公开的有关核苷酸序列,尤其是开放阅读框序列来设计引物,并用市售的cDNA库或按本领域技术人员已知的常规方法所制备的cDNA库作为模板,扩增而得有关序列。当序列较长时,常常需要进行两次或多次PCR扩增,然后再将各次扩增出的片段按正确次序拼接在一起。
一旦获得了有关的序列,就可以用重组法来大批量地获得有关序列。这通常是将其克隆入载体,再转入细胞,然后通过常规方法从增殖后的宿主细胞中分离得到有关序列。
此外,还可用人工合成的方法来合成有关序列,尤其是片段长度较短时。通常,通过先合成多个小片段,然后再进行连接可获得序列很长的片段。
目前,已经可以完全通过化学合成来编码本发明蛋白(或其片段,或其衍生物)的DNA序列。然后可将该DNA序列引入本领域中的各种DNA分子(如载体)和细胞中。此外,还可通过化学合成将突变引入本发明蛋白序列中。
此外,由于本发明的具有抑癌功能的蛋白具有源自人的天然氨基酸序列,因此,与来源于其他物种的同族蛋白相比,预计在施用于人时将具有更高的活性和/或更低的副作用(例如在人体内的免疫原性更低或没有)。
下面结合具体实施例,进一步阐述本发明。应理解,这些实施例仅用于说明本发明而不用于限制本发明的范围。下列实施例中未注明具体条件的实验方法,通常按照常规条件如Sambrook等人,分子克隆实验室手册(New YorkCold Spring Harbor LaboratoryPress,1989)中所述的条件,或按照制造厂商所建议的条件。
实施例1cDNA基因的获得及对小鼠NIH/3T3细胞克隆形成的抑制作用PP11303、PP12899和PP14183是通过用常规方法构建人胎盘cDNA文库获得的;FP504、FP972、FP6628、FP6651和FP7162是通过用常规方法构建人胎儿cDNA文库获得的。取3、6、9月龄的胎盘组织(PP克隆)或胎儿组织(FP克隆),用Trizol试剂(GIBCO BRL公司)按厂方说明书提取总RNA,用mRNA提纯试剂盒(Pharmacia公司)提取mRNA。用pCMV-script TMXR cDNA文库构建试剂盒(Stratagene公司)构建上述mRNA的cDNA文库。其中反转录酶改用MMLV-RT-Superscript II(GIBCO BRL),反转录反应在42℃进行。转化XL 10-Gold感受细胞,获得了1×106cfu/μg滴度的cDNA文库。第一轮随机挑取cDNA克隆,其后以高丰度cDNA克隆和已证明有抑制癌细胞生长功能的cDNA克隆为探针,杂交筛选cDNA文库,挑取弱阳性及阴性克隆。用Qiagen 96孔板质粒抽提试剂盒,按厂家说明书进行质粒DNA的提取。质粒DNA和空载体同时转染小鼠NIH/3T3细胞。100ng DNA酒精沉淀干燥后,加6μl H2O溶解,待转染。每份DNA样品中加0.74μl脂质体及9.3μl无血清培液,混匀后,室温放置10分钟。每管中加150μl无血清培液,均分加入3孔生长于96孔板的小鼠NIH/3T3细胞中,37℃放置2小时,每孔再加50μl无血清培液,37℃24小时。每孔换100μl全培液,37℃24小时,换含G418的全培液100μl,37℃24-48小时,边观察,边换G418浓度不等的培液。约2-3次后,直到镜检细胞有克隆形成,计数。发现上述克隆有抑制NIH/3T3细胞克隆形成作用,结果如下表所示。
cDNA克隆转染细胞(3T3)克隆形成情况

对cDNA克隆采用双脱氧终止法,在ABI377 DNA自动测序仪上测定其一端近500bp的核苷酸序列。分析后,确定为新基因克隆,进行另一端测序,仍未获得全长cDNA序列,设计引物,再次进行测序,直到获得全长序列(SEQ ID NO1、4、7、10、13、16、19、22)。
实施例2从胎盘或胎儿cDNA中PCR获得全长基因取3、6、9月龄的胎盘组织(PP克隆)或胎儿组织(FP克隆),用Trizol试剂(GIBCOBRL公司)按厂方说明书提取总RNA,用mRNA提纯试剂盒(Pharmacia公司)提取mRNA。用MMLV-RT-Superscript II(GIBCO BRL),反转录酶在42℃进行反转录反应,获得胎盘或胎儿cDNA。利用各个基因的特异引物(如下表所示),按97℃ 3′,1个循环。94℃ 30″,60℃ 30″,72℃ 1′,共35个循环;72℃ 10′,1个循环进行PCR扩增,获得含有完整开放阅读框序列的各蛋白基因的扩增产物。扩增产物经测序验证,与实施例1测得的序列相符,随后用常规技术将扩增产物转入宿主细胞,获得重组蛋白(SEQ ID NOSEQ ID NO3、6、9、12、15、18、21、24)。
基因特异引物

实施例3cDNA克隆序列分析1.PP11303A核苷酸序列(SEQ ID NO1)长度2662个碱基1 GTGACAGTCC ACGGCCCCGC TGGGATGGAG CCCTGCTGGG TGCCCGCACC GTGCTCAGTG61 TGGCATGCGG CCCGGGTGTG GAGGGAGACG GTGGAGCATC CCGTGCCTAG CGTGGTGCCA121 GCCAAGGGCG GGTGGCTGGG GAGCTGTGCT GGGAGCTGTC GTAAACCCGT GGTGGCTTTG181 ATCCTAGGGC CGTCTTTCTG CTCCACTTCC CGGGCACTGT TCCGAGGGAG GCTCAGGTGG241 GGAAGCGAGT GAGCCTAAAG CCCAGGCTTG TCTCCTTGGT GCCAGGCCCT GCTTGCTGGA301 ATCTGGTGAT CTTAGGAGGT CACTGTTGCA AGGGAGGGGA CCCAGGAGCC ACCTAGTCGA361 ACCTCTTTGT GGTACAGATG GAGAAACCAA GGCCCAAACA GTGGCCCCCT TGATCAGCCA421 GAAGCAGAGC TGGGTGGGGC AGCAGGGGAT CCCCACCACC AGGCTCAGAC TCCTTCAGGA481 TGCTTTTGCT TCGCAGATGA GGAGACCGAG GCTTAGAGAA GAGTAGAGAC TTGCTACCCG541 TTGAGGTGGT AACAGCCAGG CTAGAATCTC CTGAAACGGG GCAGGGTGGG GAGGTCTGGT601 TGGGCTACCT GGGGCCGGGC GCCTTTCCCC CAGGATGGGG TGTACTGCCC GCCCTCCCCC661 AGTCATGGTG CTGGTGCCAG CTGGTGCAGG GGGAGGGCTC TGCAGGCCTT AGCACTGAGG721 CAGGTGGCGA GCAGCAGGGG AAGGGTCTTC TCCACCCACC CCAACTGCCC AAGGTTCCGT781 GGCTCCTCCT TAGACAGCAG TGAGGGTTGG GGGTGACAGG CAAGCCACTG AGCCTCAGCA841 CCGCGACTCA CCCCTCCCAC TCAGCAGTCC AGCCAGGGTC ATCCCCAGCC TCAGAGGAGC901 CTGGGAACAA GGGCAGCGGC AGGGCCGGCG GGGGCCTGGA GGGTGAGCAG GGGCCTTTCT961 TCCTGCAGAC AGCCCTCAGC GCCTTTTTCA GGAGACCAAC ATCCCCTACA GCCACCATCA1021 CCACCAGATG GTAAGTGTCC CCGGAGTCCC CAGTTCTGGA TTGGGCGGAA GGAGGCCGAG1081 CTAGTTCTGT GTATAAGCAG CCCCTGGCCC CGGTGTACGA GGGCGCTGGT GCAGGCGGGG1141 CTCGACCTCT TTGGAGATGG GTCAGCAGGA GTCCCGGCTC CATGGGTCCT GCACTTAATC1201 TTGCCTGTGC CAGCTCCCCC TGAGACCTGG GGGGCGCTGG CCTCTGGGGC AATGAAGCTC1261 CTTACCCTAC AGCCCCCGGG GATGCTGTGG CTGATGGAAA GGGGTGGGCT GGGAAAGCCT1321 CGTGGCCCCA GGCACCGTGG GCTCCTGAGA GTGAGGCTGG GTCGGTTCAT CTCAAGGCTT1381 CTCCTCTGGG AACCCCTGGG CGGCGGACAG GCTTGGGGAT CTGGGGAAGG AACACAGAGC1441 CTTCCGAGAA TGGGCCAGCC ACGCATCTCC CCTTGGGAGG CAGTGGGGGC CCCTCCAGGA1501 AGGGGTGCTC ACCCCATCTC TCCTCTCTTC CCCTCACAGA TGTGCACCCC CGCCAATACC1561 CCTGCTACAC CCCCCAACTT CCCTGACGCT CTCACCATGT TCTCCCGTCT CAAGGCCTCC1621 GAGAGCTTCC ACAGCGGTGG CAGCGGCAGC CCGATGGCCG CGACAGCCAC GTCACCCCCG1681 CCACACTTCC CCCATGCCGC CACCAGCAGC TCTGCGGCCT CCAGCTGGCC CACGGCGGCC1741 TCGCCCCCGG GGGGCCCACA GCACCACCAG CCACAGCCGC CCCTGTGGAC TCCAACACCC1801 CCTTCTCCGG CTTCAGACTG GCCACCCCTG GCCCCCCAAC AGGCCACCTC AGAACCCAGG1861 GCCCACCCTG CCATGGAGGC AGAGAGATAA GGGAGGCCCC TCCCCCCTCC CGGAGGCCAG1921 GACCCGTGGG GCGGGGGAGA GGACGTCTCT GCGGGCCCCC TTCACCCCTT TTCTGTCTGC1981 ACCCCTTGTT CCCCGGAGCC CTGGAGGGGA GAGCGCGGAC TCTAGCCAGG CAGGGACACG2041 TCTGGTGCCA GAACACGCAG CTGCCCACAC GCAAGGTCAT GGCCCCAGCG GCCCCGGCAC2101 ATGGAGTGGT TCAGAGCGGC CTGGGTGCCT GGCGGACAGA ACTTCAGAGA CCACGCAGCC2161 TTCCTTCGAA GACGCACCTG CCCAGCCCAG CCCAGGGGTG CCGTGGAGGA CCACCCTGGC2221 GGAGACATTG CTGATCCCTG GCTTGGAGCT CCTTGGGGGC CGGCAGGCCT CGAACCCCCA2281 CCCTAGGGAA TGCAGAGCCT CTCCGCATGT GTGCGCGTGG CCGTGTCTGT GTATTTCTAC2341 GTGTGTCGCT CTTCAGAAGC AACCTAGTTC CTGGGGCAGC TGGACTTTGC ATGTTAGTGT2401 GAGCCCCCAG CCCCCTGCCC GCCGCCCCCT CCCCAGGGCC CTGCCTCCTC CCCACCCCCT2461 CGTCAGCCAG CGTTGCTGTT CCTTGCAGAG AAAAGGATTG TGGGAAACTC CAGGACTCTT2521 CCCACCGCCT CCCAGCGCCT GCCTGCTGGG GCTGCCTGCA TGCCTCCCCT GCACCTGGGG2581 GTACCCGCAT CCACTTCCTT TCCCCCTTTT AACAAAAGAG AAGAACGAAT TCCAAAAAAA2641 AAAAAAAAAA AAAAAAAAAA AAB核苷酸序列(SEQ ID NO3)长度212个氨基酸1 MKLLTLQPPG MLWLMERGGL GKPRGPRHRG LLRVRLGRFI SRLLLWEPLG GGQAWGSGEG61 TQSLPRMGQP RISPWEAVGA PPGRGAHPIS PLFPSQMCTP ANTPATPPNF PDALTMFSRL121 KASESFHSGG SGSPMAATAT SPPPHFPHAA TSSSAASSWP TAASPPGGPQ HHQPQPPLWT181 PTPPSPASDW PPLAPQQATS EPRAHPAMEA ERC.核苷酸及氨基酸组合序列(SEQ ID NO2)克隆号和蛋白名称PP11303起始编码子1252 ATG 终止编码子1888 TAA 蛋白质分子量22392.191 GTG ACA GTC CAC GGC CCC GCT GGG ATG GAG CCC TGC TGG GTG CCC GCA CCG TGC TCA GTG 6061 TGG CAT GCG GCC CGG GTG TGG AGG GAG ACG GTG GAG CAT CCC GTG CCT AGC GTG GTG CCA 120121 GCC AAG GGC GGG TGG CTG GGG AGC TGT GCT GGG AGC TGT CGT AAA CCC GTG GTG GCT TTG 180181 ATC CTA GGG CCG TCT TTC TGC TCC ACT TCC CGG GCA CTG TTC CGA GGG AGG CTC AGG TGG 240241 GGA AGC GAG TGA GCC TAA AGC CCA GGC TTG TCT CCT TGG TGC CAG GCC CTG CTT GCT GGA 300301 ATC TGG TGA TCT TAG GAG GTC ACT GTT GCA AGG GAG GGG ACC CAG GAG CCA CCT AGT CGA 360361 ACC TCT TTG TGG TAC AGA TGG AGA AAC CAA GGC CCA AAC AGT GGC CCC CTT GAT CAG CCA 420421 GAA GCA GAG CTG GGT GGG GCA GCA GGG GAT CCC CAC CAC CAG GCT CAG ACT CCT TCA GGA 480481 TGC TTT TGC TTC GCA GAT GAG GAG ACC GAG GCT TAG AGA AGA GTA GAG ACT TGC TAC CCG 540541 TTG AGG TGG TAA CAG CCA GGC TAG AAT CTC CTG AAA CGG GGC AGG GTG GGG AGG TCT GGT 600601 TGG GCT ACC TGG GGC CGG GCG CCT TTC CCC CAG GAT GGG GTG TAC TGC CCG CCC TCC CCC 660661 AGT CAT GGT GCT GGT GCC AGC TGG TGC AGG GGG AGG GCT CTG CAG GCC TTA GCA CTG AGG 720721 CAG GTG GCG AGC AGC AGG GGA AGG GTC TTC TCC ACC CAC CCC AAC TGC CCA AGG TTC CGT 780781 GGC TCC TCC TTA GAC AGC AGT GAG GGT TGG GGG TGA CAG GCA AGC CAC TGA GCC TCA GCA 840841 CCG CGA CTC ACC CCT CCC ACT CAG CAG TCC AGC CAG GGT CAT CCC CAG CCT CAG AGG AGC 900901 CTG GGA ACA AGG GCA GCG GCA GGG CCG GCG GGG GCC TGG AGG GTG AGC AGG GGC CTT TCT 960961 TCC TGC AGA CAG CCC TCA GCG CCT TTT TCA GGA GAC CAA CAT CCC CTA CAG CCA CCA TCA10201021 CCA CCA GAT GGT AAG TGT CCC CGG AGT CCC CAG TTC TGG ATT GGG CGG AAG GAG GCC GAG10801081 CTA GTT CTG TGT ATA AGC AGC CCC TGG CCC CGG TGT ACG AGG GCG CTG GTG CAG GCG GGG11401141 CTC GAC CTC TTT GGA GAT GGG TCA GCA GGA GTC CCG GCT CCA TGG GTC CTG CAC TTA ATC12001201 TTG CCT GTG CCA GCT CCC CCT GAG ACC TGG GGG GCG CTG GCC TCT GGG GCA ATG AAG CTC12601 Met Lys Leu 31261 CTT ACC CTA CAG CCC CCG GGG ATG CTG TGG CTG ATG GAA AGG GGT GGG CTG GGA AAG CCT13204 Leu Thr Leu Gln Pro Pro Gly Met Leu Trp Leu Met Glu Arg Gly Gly Leu Gly Lys Pro 231321 CGT GGC CCC AGG CAC CGT GGG CTC CTG AGA GTG AGG CTG GGT CGG TTC ATC TCA AGG CTT138024 Arg Gly Pro Arg His Arg Gly Leu Leu Arg Val Arg Leu Gly Arg Phe Ile Ser Arg Leu 431381 CTC CTC TGG GAA CCC CTG GGC GGC GGA CAG GCT TGG GGA TCT GGG GAA GGA ACA CAG AGC144044 Leu Leu Trp Glu Pro Leu Gly Gly Gly Gln Ala Trp Gly Ser Gly Glu Gly Thr Gln Ser 631441 CTT CCG AGA ATG GGC CAG CCA CGC ATC TCC CCT TGG GAG GCA GTG GGG GCC CCT CCA GGA150064 Leu Pro Arg Met Gly Gln Pro Arg Ile Ser Pro Trp Glu Ala Val Gly Ala Pro Pro Gly 831501 AGG GGT GCT CAC CCC ATC TCT CCT CTC TTC CCC TCA CAG ATG TGC ACC CCC GCC AAT ACC156084 Arg Gly Ala His Pro Ile Ser Pro Leu Phe Pro Ser Gln Met Cys Thr Pro Ala Asn Thr 1031561 CCT GCT ACA CCC CCC AAC TTC CCT GAC GCT CTC ACC ATG TTC TCC CGT CTC AAG GCC TCC1620104 Pro Ala Thr Pro Pro Asn Phe Pro Asp Ala Leu Thr Met Phe Ser Arg Leu Lys Ala Ser 1231621 GAG AGC TTC CAC AGC GGT GGC AGC GGC AGC CCG ATG GCC GCG ACA GCC ACG TCA CCC CCG1680124 Glu Ser Phe His Ser Gly Gly Ser Gly Ser Pro Met Ala Ala Thr Ala Thr Ser Pro Pro 1431681 CCA CAC TTC CCC CAT GCC GCC ACC AGC AGC TCT GCG GCC TCC AGC TGG CCC ACG GCG GCC1740144 Pro His Phe Pro His Ala Ala Thr Ser Ser Ser Ala Ala Ser Ser Trp Pro Thr Ala Ala 1631741 TCG CCC CCG GGG GGC CCA CAG CAC CAC CAG CCA CAG CCG CCC CTG TGG ACT CCA ACA CCC1800164 Ser Pro Pro Gly Gly Pro Gln His His Gln Pro Gln Pro Pro Leu Trp Thr Pro Thr Pro 1831801 CCT TCT CCG GCT TCA GAC TGG CCA CCC CTG GCC CCC CAA CAG GCC ACC TCA GAA CCC AGG1860184 Pro Ser Pro Ala Ser Asp Trp Pro Pro Leu Ala Pro Gln Gln Ala Thr Ser Glu Pro Arg 2031861 GCC CAC CCT GCC ATG GAG GCA GAG AGA TAA GGG AGG CCC CTC CCC CCT CCC GGA GGC CAG1920204 Ala His Pro Ala Met Glu Ala Glu Arg *** 2131921 GAC CCG TGG GGC GGG GGA GAG GAC GTC TCT GCG GGC CCC CTT CAC CCC TTT TCT GTC TGC19801981 ACC CCT TGT TCC CCG GAG CCC TGG AGG GGA GAG CGC GGA CTC TAG CCA GGC AGG GAC ACG20402041 TCT GGT GCC AGA ACA CGC AGC TGC CCA CAC GCA AGG TCA TGG CCC CAG CGG CCC CGG CAC21002101 ATG GAG TGG TTC AGA GCG GCC TGG GTG CCT GGC GGA CAG AAC TTC AGA GAC CAC GCA GCC21602161 TTC CTT CGA AGA CGC ACC TGC CCA GCC CAG CCC AGG GGT GCC GTG GAG GAC CAC CCT GGC22202221 GGA GAC ATT GCT GAT CCC TGG CTT GGA GCT CCT TGG GGG CCG GCA GGC CTC GAA CCC CCA22802281 CCC TAG GGA ATG CAG AGC CTC TCC GCA TGT GTG CGC GTG GCC GTG TCT GTG TAT TTC TAC23402341 GTG TGT CGC TCT TCA GAA GCA ACC TAG TTC CTG GGG CAG CTG GAC TTT GCA TGT TAG TGT24002401 GAG CCC CCA GCC CCC TGC CCG CCG CCC CCT CCC CAG GGC CCT GCC TCC TCC CCA CCC CCT24602461 CGT CAG CCA GCG TTG CTG TTC CTT GCA GAG AAA AGG ATT GTG GGA AAC TCC AGG ACT CTT25202521 CCC ACC GCC TCC CAG CGC CTG CCT GCT GGG GCT GCC TGC ATG CCT CCC CTG CAC CTG GGG25802581 GTA CCC GCA TCC ACT TCC TTT CCC CCT TTT AAC AAA AGA GAA GAA CGA ATT CCA AAA AAA26402641 AAA AAA AAA AAA AAA AAA AAA A 26622.PP12899A核苷酸序列(SEQ ID NO4)长度3325个碱基1 GGCCGCGCGA GGGTGGTGGG CATCGAGGTC CCAGCAGCGG ACGAGGGAGG TGCCGCCGTC61 GCCCAGGATG GGCTGGGAAT GAAGCGATGT AGCCTTTTAA GAGATTTGCT CTGACCCATC121 TGAAGTCCAT ATGGCTCTGT ATGATGAAGA CCTCCTGAAA AATCCTTTCT ATCTGGCTCT181 GCAAAAGTGC CGCCCTGACT TGTGCAGCAA AGTGGCCCAA ATCCATGGCA TTGTCTTAGT241 ACCCTGCAAA GGAAGCCTGT CGAGCAGCAT CCAGTCTACT TGTCAGTTTG AGTCCTACAT301 TTTGATACCT GTGGAAGAGC ATTTTCAGAC CTTAAATGGA AAGGATGTCT TTATTCAAGG361 GAACAGGATT AAATTAGGAG CTGGTTTTGC CTGTCTTCTC TCAGTGCCCA TTCTCTTTGA421 AGAAACTTTC TACAATGAAA AAGAAGAGAG TTTCAGCATC CTGTGTATAG CCCATCCTTT481 GGAAAAGAGA GAGAGTTCAG AAGAGCCTTT GGCACCCTCA GATCCCTTTT CCCTGAAAAC541 CATTGAAGAT GTGAGAGAGT TCTTGGGAAG ACACTCCGAG CGATTTGACA GGAACATCGC601 CTCTTTCCTA ATCGAACATT CCGAGAATGC GAGAGAAAGA GCCTCCGTCA CCACATAGAC661 TCAGCGAATG CTCTCTACAC CAAATGCCTC CAGCAGCTTC TGAGGGACTC TCACCTGAAA721 ATGCTCGCCA AGCAGGAGGC CCAGATGAAC CTGATGAAGC AGGCAGTGGA GATATACGTC781 CATCATGAAA TTTACAACCT GATCTTTAAA TACGTGGGGA CCATGGAGGC AAGTGAGGAT841 GCGGCCTTTA ACAAAAATCA CAAGAAGCCT TCAAGATCTT CAGCAGAAAG ATATTGGTGT901 GAAACCGGAG TTCAGCTTTA ACATACCTCG TGCCAAAAGA GAGCTGGCTC AGCTGAACAA961 ATGCACCTCC CCACAGCAGA AGCTTGTCTG CTTGCGAAAA GTGGTGCAGC TCATTACACA1021 GTCTCCAAGC CAGAGAGTGA ACCTGGAGAC CATGTGTGCT GATGATCTGC TATCAGTCCT1081 GTTATACTTG CTTGTGAAAA CGGAGATCCC TAATTGGATG GCAAATTTGA GTTACATCAA1141 AAACTTCAGG TTTAGCAGCT TGGCAAAGGA TGAACTGGGA TACTGCCTGA CCTCATTCGA1201 AGCTGCCATT GAATATATTC GGCAAGGAAG CCTCTCTGCT AAACCCCCTG TAAGATCTCA1261 CCCCTGCCCT GGCCTTCCTT TGTGGGCATC ATGGTTCCCT TGATAGGGTG CTGGGGTTGG1321 TATGTGGGCA GACGGATTCT TAAATTGCCT CCCAGGAATG GGGCCTCAGC TGTTTGAGGG1381 CTGTGAGTCT TAAAAATCAC TCAGTGAAGA GAACACCAAG CCCCCAATTG GTGGTAAAAA1441 TTGGTGGGTT ATCATTGGGA TTTACATTGT TAATATCCTA CTTCATTAGT CCCCATCCTC1501 TCCAAAGACA TGTGGGTGCA AAGGGAAGCC AGAAGTAGGG AATTTGGATT TCTTGACCTT1561 GATAGTCAAG AAGTGATGTC ACGGGATCCC TGGACTGTCG CTTTTCCAGC CGGAAACCTC1621 TGTGGCTGGT GGCTCCTTTG CCTGAGTTTT GTTCGGGCCT GCTGGGCTCA TTTCACGCTC1681 TTGGCCTGGC AGGCTGCGCT CGGCTTGTGC TACTGGCCTG GATCCCATGC CTGCCAAGGG1741 CGAGCCAGGT GTGGAGTGGC GAGGGGTATG TGAGCAAGTG CAGGGTCTGG CCACTGCACA1801 CAACCAGGTG TGCCGACTGA GGTGGGGTGG GCAGCTCCAA GTTGCTTGTA CAGGGTCCTG1861 CTCCATGCAA GGCTGCAGCT AGAGCAGGCG TACTGTAGGC CGCTTCCACG GTGGGCACTG1921 GGGAACACAG TGGGGCCTGG AAGCTTGGAG ACACCAGGAA CTGCAGAGCC CCAAAGAGGG1981 TGTCATAGCC CTGGCTCGGG GAACTCCTAG GTTGGGCTCC CTGAAGGGCC AGAGCTCTTC2041 TCTCCTCTCG TCACCTGCAA TGTAGTGAGT CGGGAGCATG TTTTAGCTCT CTTTATGTTA2101 CAGCTCTTTC AGTCCTGCCA TTTGGTGGGT CCCGAGTTCT TGTCCCATGT CGAGGAAGAA2161 TGAGGTACGT AGACTAGTGG AGGGTGAGCA AGGCAGAGAG GAGCTTTACT GAACGGCAGA2221 ATAGCTCTCA GGAGACCCAC AGTGGGCAGC TTCTTTCCAC AGGCAGGTCG TCCTGACGAG2281 TTAAAGAGGC CTGACGTAGG TAGCTCCTTC CTGCAGTTGG TAGTCCCGAC ATCTGTCTGA2341 GTCTGGCTGA GTCCGGGGTT TTTTATGGCT CAGAAGGGAG GGAGTATGTG CTGATTGGTC2401 CATAGGTGGG CCTGGAGAAA GCACCATGAG TTCTCAGTCT GGGCCGTGGA CTCCACTTGG2461 AACTGACAGC CCAGCCCCCA GGCTTTAGGC TGTCCCTGTC TTGAAGGTGG GGCTTCACTG2521 GGCACCTGCA CCTTTCCACC CAGAAGCGTG TCTGCCTTCT GCCACCATCA ACATGCTGGC2581 CAGTGCATCC AGGCTGTTTG TGCCAAGGGG CATCTGCAGG CCTGCACTGA GCTGCCCTCA2641 GCCCCTACCT TGACTACTCT CCCATGCTCA TCAGCGCCCA AAATCTTGGA GGGGCTGAGG2701 CATCAGGAGG CTGGTGTGTC AGTGTCACAC CAAGCATGTG CACACATGGC TGGGTTGCAA2761 CAGTACCCGG GCTTGGCCTC AGCTTTGCTC TGAAATTGAA GTCGGTGCCA GGAGTGGGGA2821 GGAGCGGGAG CAGGCACTTA CGAGCCTGCG GCGGCAGGGA TGCTTCCTGG GCCCCTGAGA2881 GTGCAGAGAT TCCTGGATCC AGAGCTGCGG CTGGGCGGCT GCAGCTGCGC CTGGGAGTGC2941 AGGGCTCCCG CCCTGCCAGC TCAGTAGGAG ATGGGGGCTC CTGCCTATTC CTGGCTCCTG3001 TTGGCCCTGC AGAGTGCACA ACCCTGGCCG CGCTTCCTCC ACTGCAGCTT ACGTCTTTGC3061 AGCAGCCACT CCCGATGGGC TGCCACTGCC ATCTGTGAGA CAATTAATGT GTGCAATTTG3121 AGGACTCAGT GGCCTTGCCA TTGTTTCCCT TGGTTTTTAT TGAGCATTGG CTGGGGTCGG3181 CGAGGGGATG TGATTATATT TCTATGTGAA TCGTGAGAAT CTTGAACCAT AGTTGTCCTG3241 CTGGCCTGTT TTACTACATA CCAATGAGTA AAATGTGATC ATACAGAAAT CACAAAGTTG3301 AAATCCTAAA AAAAAAAAAA AAAAAB核苷酸序列(SEQ ID NO6)长度175个氨基酸1 MALYDEDLLK NPFYLALQKC RPDLCSKVAQ IHGIVLVPCK GSLSSSIQST CQFESYILIP61 VEEHFQTLNG KDVFIQGNRI KLGAGFACLL SVPILFEETF YNEKEESFSI LCIAHPLEKR121 ESSEEPLAPS DPFSLKTIED VREFLGRHSE RFDRNIASFL IEHSENARER ASVTTC.核苷酸及氨基酸组合序列(SEQ ID NO5)克隆号和蛋白名称PP12899起始编码子131 ATG 终止编码子656 TAG 蛋白质分子量19828.541 G GCC GCG CGA GGG TGG TGG GCA TCG AGG TCC CAG CAG CGG ACG AGG GAG GTG CCG CCG 5859 TCG CCC AGG ATG GGC TGG GAA TGA AGC GAT GTA GCC TTT TAA GAG ATT TGC TCT GAC CCA118119 TCT GAA GTC CAT ATG GCT CTG TAT GAT GAA GAC CTC CTG AAA AAT CCT TTC TAT CTG GCT1781 Met Ala Leu Tyr Asp Glu Asp Leu Leu Lys Asn Pro Phe Tyr Leu Ala 16179 CTG CAA AAG TGC CGC CCT GAC TTG TGC AGC AAA GTG GCC CAA ATC CAT GGC ATT GTC TTA23817 Leu Gln Lys Cys Arg Pro Asp Leu Cys Ser Lys Val Ala Gln Ile His Gly Ile Val Leu 36239 GTA CCC TGC AAA GGA AGC CTG TCG AGC AGC ATC CAG TCT ACT TGT CAG TTT GAG TCC TAC29837 Val Pro Cys Lys Gly Ser Leu Ser Ser Ser Ile Gln Ser Thr Cys Gln Phe Glu Ser Tyr 56299 ATT TTG ATA CCT GTG GAA GAG CAT TTT CAG ACC TTA AAT GGA AAG GAT GTC TTT ATT CAA35857 Ile Leu Ile Pro Val Glu Glu His Phe Gln Thr Leu Asn Gly Lys Asp Val Phe Ile Gln 76359 GGG AAC AGG ATT AAA TTA GGA GCT GGT TTT GCC TGT CTT CTC TCA GTG CCC ATT CTC TTT 41877 Gly Asn Arg Ile Lys Leu Gly Ala Gly Phe Ala Cys Leu Leu Ser Val Pro Ile Leu Phe 96419 GAA GAA ACT TTC TAC AAT GAA AAA GAA GAG AGT TTC AGC ATC CTG TGT ATA GCC CAT CCT 47897 Glu Glu Thr Phe Tyr Asn Glu Lys Glu Glu Ser Phe Ser Ile Leu Cys Ile Ala His Pro 116479 TTG GAA AAG AGA GAG AGT TCA GAA GAG CCT TTG GCA CCC TCA GAT CCC TTT TCC CTG AAA 538117 Leu Glu Lys Arg Glu Ser Ser Glu Glu Pro Leu Ala Pro Ser Asp Pro Phe Ser Leu Lys 136539 ACC ATT GAA GAT GTG AGA GAG TTC TTG GGA AGA CAC TCC GAG CGA TTT GAC AGG AAC ATC 598137 Thr Ile Glu Asp Val Arg Glu Phe Leu Gly Arg His Ser Glu Arg Phe Asp Arg Asn Ile 156599 GCC TCT TTC CTA ATC GAA CAT TCC GAG AAT GCG AGA GAA AGA GCC TCC GTC ACC ACA TAG 658157 Ala Ser Phe Leu Ile Glu His Ser Glu Asn Ala Arg Glu Arg Ala Ser Val Thr Thr *** 176659 ACT CAG CGA ATG CTC TCT ACA CCA AAT GCC TCC AGC AGC TTC TGA GGG ACT CTC ACC TGA 718719 AAA TGC TCG CCA AGC AGG AGG CCC AGA TGA ACC TGA TGA AGC AGG CAG TGG AGA TAT ACG 778779 TCC ATC ATG AAA TTT ACA ACC TGA TCT TTA AAT ACG TGG GGA CCA TGG AGG CAA GTG AGG 838839 ATG CGG CCT TTA ACA AAA ATC ACA AGA AGC CTT CAA GAT CTT CAG CAG AAA GAT ATT GGT 898899 GTG AAA CCG GAG TTC AGC TTT AAC ATA CCT CGT GCC AAA AGA GAG CTG GCT CAG CTG AAC 958959 AAA TGC ACC TCC CCA CAG CAG AAG CTT GTC TGC TTG CGA AAA GTG GTG CAG CTC ATT ACA10181019 CAG TCT CCA AGC CAG AGA GTG AAC CTG GAG ACC ATG TGT GCT GAT GAT CTG CTA TCA GTC10781079 CTG TTA TAC TTG CTT GTG AAA ACG GAG ATC CCT AAT TGG ATG GCA AAT TTG AGT TAC ATC11381139 AAA AAC TTC AGG TTT AGC AGC TTG GCA AAG GAT GAA CTG GGA TAC TGC CTG ACC TCA TTC11981199 GAA GCT GCC ATT GAA TAT ATT CGG CAA GGA AGC CTC TCT GCT AAA CCC CCT GTA AGA TCT12581259 CAC CCC TGC CCT GGC CTT CCT TTG TGG GCA TCA TGG TTC CCT TGA TAG GGT GCT GGG GTT13181319 GGT ATG TGG GCA GAC GGA TTC TTA AAT TGC CTC CCA GGA ATG GGG CCT CAG CTG TTT GAG13781379 GGC TGT GAG TCT TAA AAA TCA CTC AGT GAA GAG AAC ACC AAG CCC CCA ATT GGT GGT AAA14381439 AAT TGG TGG GTT ATC ATT GGG ATT TAC ATT GTT AAT ATC CTA CTT CAT TAG TCC CCA TCC14981499 TCT CCA AAG ACA TGT GGG TGC AAA GGG AAG CCA GAA GTA GGG AAT TTG GAT TTC TTG ACC15581559 TTG ATA GTC AAG AAG TGA TGT CAC GGG ATC CCT GGA CTG TCG CTT TTC CAG CCG GAA ACC16181619 TCT GTG GCT GGT GGC TCC TTT GCC TGA GTT TTG TTC GGG CCT GCT GGG CTC ATT TCA CGC16781679 TCT TGG CCT GGC AGG CTG CGC TCG GCT TGT GCT ACT GGC CTG GAT CCC ATG CCT GCC AAG17381739 GGC GAG CCA GGT GTG GAG TGG CGA GGG GTA TGT GAG CAA GTG CAG GGT CTG GCC ACT GCA17981799 CAC AAC CAG GTG TGC CGA CTG AGG TGG GGT GGG CAG CTC CAA GTT GCT TGT ACA GGG TCC18581859 TGC TCC ATG CAA GGC TGC AGC TAG AGC AGG CGT ACT GTA GGC CGC TTC CAC GGT GGG CAC19181919 TGG GGA ACA CAG TGG GGC CTG GAA GCT TGG AGA CAC CAG GAA CTG CAG AGC CCC AAA GAG19781979 GGT GTC ATA GCC CTG GCT CGG GGA ACT CCT AGG TTG GGC TCC CTG AAG GGC CAG AGC TCT20382039 TCT CTC CTC TCG TCA CCT GCA ATG TAG TGA GTC GGG AGC ATG TTT TAG CTC TCT TTA TGT20982099 TAC AGC TCT TTC AGT CCT GCC ATT TGG TGG GTC CCG AGT TCT TGT CCC ATG TCG AGG AAG21582159 AAT GAG GTA CGT AGA CTA GTG GAG GGT GAG CAA GGC AGA GAG GAG CTT TAC TGA ACG GCA22182219 GAA TAG CTC TCA GGA GAC CCA CAG TGG GCA GCT TCT TTC CAC AGG CAG GTC GTC CTG ACG22782279 AGT TAA AGA GGC CTG ACG TAG GTA GCT CCT TCC TGC AGT TGG TAG TCC CGA CAT CTG TCT23382339 GAG TCT GGC TGA GTC CGG GGT TTT TTA TGG CTC AGA AGG GAG GGA GTA TGT GCT GAT TGG23982399 TCC ATA GGT GGG CCT GGA GAA AGC ACC ATG AGT TCT CAG TCT GGG CCG TGG ACT CCA CTT24582459 GGA ACT GAC AGC CCA GCC CCC AGG CTT TAG GCT GTC CCT GTC TTG AAG GTG GGG CTT CAC25182519 TGG GCA CCT GCA CCT TTC CAC CCA GAA GCG TGT CTG CCT TCT GCC ACC ATC AAC ATG CTG25782579 GCC AGT GCA TCC AGG CTG TTT GTG CCA AGG GGC ATC TGC AGG CCT GCA CTG AGC TGC CCT26382639 CAG CCC CTA CCT TGA CTA CTC TCC CAT GCT CAT CAG CGC CCA AAA TCT TGG AGG GGC TGA26982699 GGC ATC AGG AGG CTG GTG TGT CAG TGT CAC ACC AAG CAT GTG CAC ACA TGG CTG GGT TGC27582759 AAC AGT ACC CGG GCT TGG CCT CAG CTT TGC TCT GAA ATT GAA GTC GGT GCC AGG AGT GGG28182819 GAG GAG CGG GAG CAG GCA CTT ACG AGC CTG CGG CGG CAG GGA TGC TTC CTG GGC CCC TGA 28782879 GAG TGC AGA GAT TCC TGG ATC CAG AGC TGC GGC TGG GCG GCT GCA GCT GCG CCT GGG AGT 29382939 GCA GGG CTC CCG CCC TGC CAG CTC AGT AGG AGA TGG GGG CTC CTG CCT ATT CCT GGC TCC 29982999 TGT TGG CCC TGC AGA GTG CAC AAC CCT GGC CGC GCT TCC TCC ACT GCA GCT TAC GTC TTT 30583059 GCA GCA GCC ACT CCC GAT GGG CTG CCA CTG CCA TCT GTG AGA CAA TTA ATG TGT GCA ATT 31183119 TGA GGA CTC AGT GGC CTT GCC ATT GTT TCC CTT GGT TTT TAT TGA GCA TTG GCT GGG GTC 31783179 GGC GAG GGG ATG TGA TTA TAT TTC TAT GTG AAT CGT GAG AAT CTT GAA CCA TAG TTG TCC 32383239 TGC TGG CCT GTT TTA CTA CAT ACC AAT GAG TAA AAT GTG ATC ATA CAG AAA TCA CAA AGT 32983299 TGA AAT CCT AAA AAA AAA AAA AAA AAA 33253.PP14183A核苷酸序列(SEQ ID NO7)长度2154个碱基1 GGGGGAATCT CACAGCCCTC ACCTACCTCA ACCTCAGCCG AAACCAGCTG TCGCTGCTGC61 CACCCTACAT CTGCCAGCTG CCCCTGAGGG TCCTCATCGT CAGCAACAAC AAGCTGGGAG121 CCCTGCCCCC TGACATCGGC ACCCTGGGAA GCCTGCGACA GCTTGACGTG AGCAGCAACG181 AGCTCCAATC CCTGCCCTCG GAACTGTGTG GCCTCTCTTC CCTGCGGGAC CTCAATGTCC241 GGAGGAACCA GCTCAGTACG CTGCCCGAAG AGCTGGGGGA CCTCCCTCTG GTCCCCTGGA301 TTTCTCCTGT AACCGCGTCT CCCGAATCCC AGTCTCCTTC TGCCGCCTGA GGCACCTGCA361 GGTCATTCTG CTGGACAGCA ACCCTCTGCA GAGTCCACCT GCCCAGGTCT GCCTGAAGGG421 GAAACTTCAC ATCTTCAAGT ATTTGTCCAC AGAGGCCGGG CAGCGTGGGT CGGCCCTGGG481 GGACCTGGCC CCTTCTCGGC CCCCGAGTTT CAGTCCCTGC CCTGCAGAGG ATCTATTTCC541 GGGACATCGG TACGATGGTG GGCTGGACTC AGGCTTCCAC AGCGTTGATA GTGGCAGCAA601 GAGGTGGTCT GGAAATGAGT CAACAGATGA ATTTTCAGAG CTGTCATTCC GGATCTCAGA661 GCTGGCCCGG GAGCCCCGGG GACCCAGAGA ACGCAAGGAG GATGGCTCAG CGGACGGAGA721 CCCTGTGCAG ATTGACTTCA TCGACAGCCA TGTCCCCGGG GAGGATGAAG AGCGAGGCAC781 TGTGGAGGAG CAGCGACCAC CCGAATTAAG CCCTGGGGCA GGGGACAGGG AGAGGGCACC841 AAGCAGCAGG CGGGAGGAGC CGGCAGGGGA GGAGCGGCGG CGCCCGGACA CCTTGCAGCT901 GTGGCAGGAG CGGGAACGGC GGCAGCAGCA GCAGAGCGGG GCGTGGGGGG CCCCGAGGAA961 GGATAGCGGC TCGCCTAAGT CCAGTGCCTC CCAAGCAGGG GCTGCAGCGG GGCAGGGAGC1021 CCCCGCCCCT GCCCCTGCCT CCCAAGAGCC CCTTCCCATA GCTGGACCAG CGACAGCACC1081 CTGCTCCACG GCCACTTGGC TCCATTCAGA GACCAAACAG CTTCCTCTTC CGTTCCTCCT1141 CTCAGAGTGG CTCAGGCCCT TCCTCACCAG ACTCTGTCCT GAGACCTCGG CGGTACCCCC1201 AGGTTCCAGA TGAGAAGGAC TTAATGACTC AGCTGCGCCA GGTCCTTGAG TCCCGGCTGC1261 AGCGGCCCCT GCCTGAGGAC CTGGCGAGGC TCTGGCCAAG TGGGGTCATC CTGTGCCAGC1321 TGGCCAACCA GCTACGGCCG CGCTCCGTGC CCTTCATCCA TGTGCCCTCC CCTGCTGTGC1381 CAAAACTCAG TGCCCTCAAG GCTCGGAAGA ATGTGGAGAG TTTTCTAGAA GCCTGTCGAA1441 AAATGGGGGT GCCTGAGGCT GACCTGTGCT CGCCCTCGGA TCTCCTCCAG GGCACTGCCC1501 GGGGGCTGCG GACCGCGCTG GAGGCCGTGA AGCGGGTGGG GGGCAAGGCC CTACCGCCCC1561 TCTGGCCCCC CTCTGGTCTG GGCGGCTTCG TCGTCTTCTA CGTGGTCCTC ATGCTGCTGC1621 TCTATGTCAC CTACACTCGG CTCCTGGGTT CCTAGGCCCC AAAATCGGCC CTCCCTCACC1681 CCTTTCCCTT CCTCTCTATT TATAAGGTCC CTGCTCCACC CGACCCCACC TGCGGTGCCT1741 TCAGCCCCAA CCAAAGACAC TAGTGCACCC CCTTCACAGA CACTGACCTC AGAGGCCCCA1801 CTCTGGTGCC CCCAGACCCT GGGCCCCCAG CCTCTGGCCT CCCTCCAGTA GCCCCACGAG1861 TCCCCACCTC TCAGTGCTGA CGGTGCCTTC ATGTCCCCGC CGGCCCTGCC CCTGCCCTCT1921 GTACCCCGTG AGGGGTGGCA GGAGCTGGAG TCTCCCCCTT CCTCCTGTGC CCTCCCCTTC1981 CCCCCCCAAC AGCTGCTATG GGGGGGCTAA ATTATCTCTA TTTTGTAGAG AGGATCTATA2041 TTTGTAGGGG TTCGGGGCCC AGGCCGGGTC CCTATCTCTG TGTATAAACT GTACAGACCG2101 TGAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAB核苷酸序列(SEQ ID NO9)长度143个氨基酸
1 MTQLRQVLES RLQRPLPEDL ARLWPSGVIL CQLANQLRPR SVPFIHVPSP AVPKLSALKA61 RKNVESFLEA CRKMGVPEAD LCSPSDLLQG TARGLRTALE AVKRVGGKAL PPLWPPSGLG121 GFVVFYVVLM LLLYVTYTRL LGSC.核苷酸及氨基酸组合序列(SEQ ID NO8) 克隆号和蛋白名称PP14183起始编码子1224 ATG 终止编码子1653 TAG 蛋白质分子量15712.831GG GGG AAT CTC ACA GCC CTC ACC TAC CTC AAC CTC AGC CGA AAC CAG CTG TCG CTG CTG 5960 CCA CCC TAC ATC TGC CAG CTG CCC CTG AGG GTC CTC ATC GTC AGC AAC AAC AAG CTG GGA 119120 GCC CTG CCC CCT GAC ATC GGC ACC CTG GGA AGC CTG CGA CAG CTT GAC GTG AGC AGC AAC 179180 GAG CTC CAA TCC CTG CCC TCG GAA CTG TGT GGC CTC TCT TCC CTG CGG GAC CTC AAT GTC 239240 CGG AGG AAC CAG CTC AGT ACG CTG CCC GAA GAG CTG GGG GAC CTC CCT CTG GTC CCC TGG 299300 ATT TCT CCT GTA ACC GCG TCT CCC GAA TCC CAG TCT CCT TCT GCC GCC TGA GGC ACC TGC 359360 AGG TCA TTC TGC TGG ACA GCA ACC CTC TGC AGA GTC CAC CTG CCC AGG TCT GCC TGA AGG 419420 GGA AAC TTC ACA TCT TCA AGT ATT TGT CCA CAG AGG CCG GGC AGC GTG GGT CGG CCC TGG 479480 GGG ACC TGG CCC CTT CTC GGC CCC CGA GTT TCA GTC CCT GCC CTG CAG AGG ATC TAT TTC 539540 CGG GAC ATC GGT ACG ATG GTG GGC TGG ACT CAG GCT TCC ACA GCG TTG ATA GTG GCA GCA 599600 AGA GGT GGT CTG GAA ATG AGT CAA CAG ATG AAT TTT CAG AGC TGT CAT TCC GGA TCT CAG 659660 AGC TGG CCC GGG AGC CCC GGG GAC CCA GAG AAC GCA AGG AGG ATG GCT CAG CGG ACG GAG 719720 ACC CTG TGC AGA TTG ACT TCA TCG ACA GCC ATG TCC CCG GGG AGG ATG AAG AGC GAG GCA 779780 CTG TGG AGG AGC AGC GAC CAC CCG AAT TAA GCC CTG GGG CAG GGG ACA GGG AGA GGG CAC 839840 CAA GCA GCA GGC GGG AGG AGC CGG CAG GGG AGG AGC GGC GGC GCC CGG ACA CCT TGC AGC 899900 TGT GGC AGG AGC GGG AAC GGC GGC AGC AGC AGC AGA GCG GGG CGT GGG GGG CCC CGA GGA 959960 AGG ATA GCG GCT CGC CTA AGT CCA GTG CCT CCC AAG CAG GGG CTG CAG CGG GGC AGG GAG10191020 CCC CCG CCC CTG CCC CTG CCT CCC AAG AGC CCC TTC CCA TAG CTG GAC CAG CGA CAG CAC10791080 CCT GCT CCA CGG CCA CTT GGC TCC ATT CAG AGA CCA AAC AGC TTC CTC TTC CGT TCC TCC11391140 TCT CAG AGT GGC TCA GGC CCT TCC TCA CCA GAC TCT GTC CTG AGA CCT CGG CGG TAG CCC11991200 CAG GTT CCA GAT GAG AAG GAC TTA ATG ACT CAG CTG CGC CAG GTC CTT GAG TCC CGG CTG12591 Met Thr Gln Leu Arg Gln Val Leu Glu Ser Arg Leu 121260 CAG CGG CCC CTG CCT GAG GAC CTG GCG AGG CTC TGG CCA AGT GGG GTC ATC CTG TGC CAG131913 Gln Arg Pro Leu Pro Glu Asp Leu Ala Arg Leu Trp Pro Ser Gly Val Ile Leu Cys Gln 321220 CTG GCC AAC CAG CTA CGG CCG CGC TCC GTG CCC TTC ATC CAT GTG CCC TCC CCT GCT GTG137933 Leu Ala Asn Gln Leu Arg Pro Arg Ser Val Pro Phe Ile His Val Pro Ser Pro Ala Val 521380 CCA AAA CTC AGT GCC CTC AAG GCT CGG AAG AAT GTG GAG AGT TTT CTA GAA GCC TGT CGA143953 Pro Lys Leu Ser Ala Leu Lys Ala Arg Lys Asn Val Glu Ser Phe Leu Glu Ala Cys Arg 721440 AAA ATG GGG GTG CCT GAG GCT GAC CTG TGC TCG CCC TCG GAT CTC CTC CAG GGC ACT GCC149973 Lys Met Gly Val Pro Glu Ala Asp Leu Cys Ser Pro Ser Asp Leu Leu Gln Gly Thr Ala 921500 CGG GGG CTG CGG ACC GCG CTG GAG GCC GTG AAG CGG GTG GGG GGC AAG GCC CTA CCG CCC155993 Arg Gly Leu Arg Thr Ala Leu Glu Ala Val Lys Arg Val Gly Gly Lys Ala Leu Pro Pro 1121560 CTC TGG CCC CCC TCT GGT CTG GGC GGC TTC GTC GTC TTC TAC GTG GTC CTC ATG CTG CTG1619113 Leu Trp Pro Pro Ser Gly Leu Gly Gly Phe Val Val Phe Tyr Val Val Leu Met Leu Leu 1321620 CTC TAT GTC ACC TAC ACT CGG CTC CTG GGT TCC TAG GCC CCA AAA TCG GCC CTC CCT CAC1679133 Leu Tyr Val Thr Tyr Thr Arg Leu Leu Gly Ser *** 1441680 CCC TTT CCC TTC CTC TCT ATT TAT AAG GTC CCT GCT CCA CCC GAC CCC ACC TGC GGT GCC17391740 TTC AGC CCC AAC CAA AGA CAC TAG TGC ACC CCC TTC ACA GAC ACT GAC CTC AGA GGC CCC 17991800 ACT CTG GTG CCC CCA GAC CCT GGG CCC CCA GCC TCT GGC CTC CCT CCA GTA GCC CCA CGA 18591860 GTC CCC ACC TCT CAG TGC TGA CGG TGC CTT CAT GTC CCC GCC GGC CCT GCC CCT GCC CTC 19191920 TGT ACC CCG TGA GGG GTG GCA GGA GCT GGA GTC TCC CCC TTC CTC CTG TGC CCT CCC CTT 19791980 CCC CCC CCA ACA GCT GCT ATG GGG GGG CTA AAT TAT CTC TAT TTT GTA GAG AGG ATC TAT 20392040 ATT TGT AGG GGT TCG GGG CCC AGG CCG GGT CCC TAT CTC TGT GTA TAA ACT GTA CAG ACC 20992100 GTG AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA A 21544.FP504A核苷酸序列(SEQ ID NO10)长度4952个碱基1 GCTAAGCAGT AAACTAAAGG ATTATATATT ATTAGTCTCA GTGGTTTTCA GATTTATTTT61 TAAAGGGGAA AACAGGGAAA ACCCATCGTA TTTGTAAAGC ACTTTAGGAT TTTGCCGTTT121 GTTTCTGATT GTTTGAAGAT TAGGGCTTTT TGGTGCGTGG TCACCTTTCA CCTCTCCTTT181 TAGGATTTAG TCCTTTCCAG TCTGCTCTTT TTGTGCGTGT CACAACCATA TTCTTGTGGT241 TCTGGCTCAT ATTGTAGAAC TGCTGAACAT AAGGAGAGGT AGCCAGCTGT ATGGTCGGAT301 TTAATATATA ATGTTATATG TTGGGATATC TTAGTGGTTT GTTTTCTGAG GTAAGTTTCT361 TAGTGTTGTG TTTGAGACAT TGTGTTTGCG TTTATGGCGA CACTGTCATT CATGCACTTG421 GCCATCTGAG CGTGGATACA GCGGGCACTC GGGTCTCTCT GCCAGATGGA TGAAAGCAGT481 GTACATTCCA GTGTGGGAGA CAGACATGTG GACAGGTAAA TTACAAGGCA GTGTGATAAA541 GAGTAGAGAG TTGGTTGAGA GAGATCTTAG ACACCATCCT CATTGTGTAG ATGAGAAAGC601 AAAAGTCACC AAGGCAGCCT GGCAGCTGGG ACCCAAGAAG CCTAGGGTGC CAGTCCTTGG661 GCAGTGCGGG GTTAGGCACA CCCAGGGCCC TCCTGGTTCC TGGCTGACTC TTGGACTCTT721 TGTCTCTAAT TGGAGGCCAT GATGCCCAGC TGTAAGGTGG TCAGCTTCAT TTGAGACACT781 ATATCCTTTA GCACAGCGGG GTAATTTCTT CCCTCCTGTT TCATTCATTT ACCAAATGGC841 CTCCTAAATG ATCTAAAATC ACTTGGATCT TTTGTCTTTG TGGACCTAAC ACCTGGCTTT901 TAAAGTTTAA CTTTCTGTCC CCTCTTCAGC TTGCTAAAAT TGAAAAGTGT TGCAGCCCAA961 CCTCCACAAA TCTTGTCTCA GGAAATAAGA GACATTTGTT AACATTTGTT TTGTACCTCT1021 CAGCAGCTTA GTTGACAAGG GCACCGTGTG GGATTTCCTG TTCTTGCTCA TTTGGAAAGA1081 GAATGTTCTT TGTTCTTAGA CCCTCAGCTC TCATGTGAGA GCCATAGAAT GTTGCGAGGT1141 GGAGTTCTGT GGATACAGAA GGAATGTTTT CAAGTTAGAC TTACTGCCAA TGTTAGGATT1201 TGGGACTTTG CATGATTGGG AGGGAGAGGG AGTGCTGGAG AACAGGTTAA AAGTTGTCCC1261 GCTGAGCTTG GAGCATCTCC TGCCAACCCG GAGTGCTTCC CAGGAACCCT GCCAGTGTCA1321 CTTGGGGTTA TGTTTTCTGA TTTGGAAACA TTAAGCCGTA TGCAGGTCTC TTCAGAACTG1381 GTTCTTCAGC CGGATTGCCC TGGAAAGCAG AGATTGCAGC TCTTCTAAAA CTGCCTCTCA1441 CAGAAGTTCC AAGGCCAGGC TAAATATTGA ATGCAGTACT CAGCAGCTGG GACACCTGAT1501 GCTTTGGTGG CCATCCCTTT CTTCCATCCA AAAGGGCCCC CACTGGAAGG CATCTGTTGT1561 TTTAAAAATA TTTTAGGACT CATTTTTACT TCCCCCACTC CCTCAAGATC ACATACACTC1621 CCCAGTGGGG GTTACAGCCT CTTAGGAGGA ATCGCTGCTT CATGACTTCC TCAGGCATTT1681 ACATTTTTCT CTTCTGTTGA CTTAAATCAT GAACTAAAAT TTATCCCTAG AGGAAAAAAG1741 AATGCTTCCT CCATTCTGGG CTCTTCTCAC TGTACCCAGA CTATGTCTTC AGGACTCTCA1801 TCTCTTGTCA GTTCTGTTGT GCTAGAAAGA CTGGTTTGAA AAAATTCAGC TCGTGTAAAC1861 CTGTGCCCTC CACCCTGTGG GGAACCCATG TGGGGAGCCT TTGAAAATAT CACTTATCAG1921 CTGGGCGCAG TGGCTCATGC CTGTAATCCC AGCACGTTGG GAGGATGAGG TGGGCGGATC1981 ATGAGGTCAG GAGTTCAAGA CCAGCCTGGC CAACATAGTG AAACCCCGTC TCTACTAAAA2041 ATACAAAAAA AAAAAAAAAA AAAACCGAGA CTAGTTCTCT CTCTGTCTCC TGCCTGAACC2101 CTCCTCCTCT TTTTGTTCTG ATCTTTGAGC TCCCTAGAGC CCATAATTCT TTAGAGCAGG2161 TATGTCCCGA GTCTGAAACA TGCCCTTATT TGTCCCAAGC TCTGGACATT TCTCACCCCA2221 AGGCGGATCA ATCATGATTA AATCACTCCA ATTAAACTTT AGGCTCCAGT CAGACCTTCA2281 GCCAAATGGA AAAAAAAACT AGGGGATAAG GGAGGTAGTT GGAGCAAGAA AATGTTATTA2341 GTTGAAACCT TACGGGACCT TCCTCCCTTA GTGAGTCTGT TGGCTAAAGG TTCTCTGGCT2401 TCGTGAATTA GAATTGGATA CTGTTTCCAA GTTAGCAAAA CCAACTCTAC CCCAGCACCC2461 CACGAGGAAG AATGTGGAAG GATCTCCCAT TGGCCGGTTG GGGCAAAAGC CTGAGGCAAT2521 CTTTCATCCC CTTTTGCCAA GGCGAGACTT TCCCAGTGAC GGTGATGTAG TTGGCCACTC2581 TGACTATGGG TGGACTCGGG TGTAGACCTC TGAAGCTGAG ATCACACGAA AACCTGGCCT2641 CCCCGCCATG TAGCTGTTGG AGAGTAGAAA AATAGAGCAC GCCTGATGTT TCTAAATGAG2701 AAGACTTTCA ATAGTAATGA AGAATCCATG GCACTCTCCT CACCCTCAAA CACATGGCAG2761 TCATTCACAT ACAGGCCCCA AAGCCACTGT TAGTGCTGCA GTAGCTCCTG TGGACATTGG2821 AAAGCCCGGA GAGGGCGTGG AAGAAATCAG CTGGCCCCCG GCAGGTTCTC TGGGGTTTTG2881 TGCCCAAGGC TCCTGGAGCC CTAAAAACTT TCAAAAGTTA ACTCCCCACG TCCCCATCCT2941 GCTTGGGTTT CTGGACTTTT CTGAGGCACC GGCAGAGGGG TCTCGTTGCT CCCTTGAGTG3001 TAGGGGCAGC CCTTTAACCT GGCTCCTTGA GTCCCTGCTT TTTCTGCTTC TGTTGCCTTC3061 TTCCTCGTCT TCCTCTCTCT CAATATCTCC CTCTCTTTGT CCCTCCCCAG TTCCTGACCT3121 GGCCATCCCG GGGTGCCCTT GACCAGCCCC GTGCCTCCTC AGGGTGTCCC AGCACCAGCC3181 TGGCACAGAG TGGGGCTCAG TTAGAGTATG TGGGATGTTG GTTTCGCCAG GTGAGTGAAT3241 GAAAGGACTC GACCACCACA GCTGAGCCAC TAGCTGGGCC ATGCGAAGAG TTCTAGGTGC3301 AAAGGCTGGA GGGTGGAATT CATTTTTGAG AGGTGTGTGA GCAGCTTCCG ACCCCTGCCC3361 CATTTGAACG GGGGCCTTGC TGGTCGCGTC CCTGCATTCA CCTGCGCGGC CATCCCGTCA3421 TCCAACAGTT GATCCTAACT GAGCACGCCC ACGGCCCTGG TCTGGCCTGG GCACCGGCCA3481 CCGTAGCCCA TCCCTTGATG GCCTCTGTGT CCCCAGGAGG GCGGGCCGGG GGGTTGCCCA3541 GGGGCTGGAG CAGTGGACTG TGGCTCCATA GAGGTAGGCT GGAGGGTGTG AGGGCAGATT3601 CAAGCTATCC CCAGGGCTCT GCTCTGGTCG GAGCCAGCCC CTTCTCCCTC TCTGCCTTCC3661 CCGCCCCATT CCTGATGCTG AACTGTTCTG GACCCCTGGC CCTGAGTCTC TCAGGACCAA3721 AGTGGGCACG GGAACAGCTG TAGTGTGTGC CCCCCCGGGC TTTGGCACAG GTCTCCCTCT3781 CGAGGTGTGG TTGTGACTGC GACCCTTCCC TTGCCGTGAT GCCTTCCTCC CCCGGGGCTT3841 GGTCCAGCTC CTTCACTCTC TAGCAGCTGC TGGGGCCCAC CTCCCATGCC GAGGACCAGC3901 AGGGGAAACC TCCAGGGAGC ATCTGCAGGC TCTGCTTCTG CCCGGCTGCT GGCTTGCTCT3961 CCCTGGTGGC TCTCCAGCGG CCAGCTTCCT CACCCACCCG GCACTCTGCT TTGCTCTGTC4021 TCCTGAGGTG GGCCTGACCA ACCTCCCCTT CTCTGCCTCA GTCCCTGGGC TCCAGGGCTC4081 AGCTCCACAG CCCTCTGCCT AGCAGGCTGG TTCTCCCTGC CAAGCCCATA CCTGTGGTCA4141 CCTGGCCCTC CTGTGGTCTG AGTACCACTC CCCTGCCCCA GGAGCCACTC CCACTCCAGC4201 TGCCTGTTTC CAGCAGGTTC CCAGTGTCCC CGACAAGCCC CTGCTGGTGT CTCCATCTCC4261 TGCCAAGCAT CCTCCAGTGC CTCCTCCTGT GGGCCTGGCC TCAGGGCTAT GGACAGACTC4321 CTGTCCCATC CCAGAGACCC CTCGTGATCG TGCCCTGGCA CGTGGGCCGT GGCCCGGCTG4381 GGTCGGCTGA AGAACTGCGG ATGGAAGCTG CGGAAGAGGC CCTGATGGGG CCCACCATCC4441 CGGACCCAAG TCTTCTTCCT GGCGGGCCTC TCGTCTCCTT CCTGGTTTGG GCGGAAGCCA4501 TCACCTGGAT GCCTACGTGG GAAGGGACCT CGAATGTGGG ACCCCAGCCC CTCTCCAGCT4561 CGAAATCCCT CCACAGCCAC GGGGACACCC TGCACCTATT CCCACGGGAC AGGCTGGACC4621 CAAAGACTCT GGACCCGGGG CCTCCCCTTG AGTAGAGACC CGCCCTCTGA CTGATGGACG4681 CCGCTGACCT GGGGTCAGAC CCGTGGGCTG GACCCCTGCC CACCCCGCAG GAACCCTGAG4741 GCCTAGGGGA GCTGTTGAGC CTTCAGTGTC TGCATGTGGG AAGTGGGCTC CTTCACCTAC4801 CTCACAGGGC TGTTGTGAGG GGCGCTGTGA TGCGGTTCCA AAGCACAGGG CTTGGCGCAC4861 CCCCCTGTGC TCTCAATAAA TGTGTTTCCT GTCTTAAAAA AAAAAAAAAA AAAAAAAAAA4921 AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAB核苷酸序列(SEQ ID NO12)长度148个氨基酸1 MRRLSIVMKN PWHSPHPQTH GSHSHTGPKA TVSAAVAPVD IGKPGEGVEE ISWPPAGSLG61 FCAQGSWSPK NFQKLTPHVP ILLGFLDFSE APAEGSRCSL ECRGSPLTWL LESLLFLLLL121 PSSSSSSLSI SPSLCPSPVP DLAIPGCPC.核苷酸及氨基酸组合序列(SEQ ID NO11) 克隆号和蛋白名称FP504起始编码子2696 ATG 终止编码子3140 TGA 蛋白质分子量15611.031 G CTA AGC AGT AAA CTA AAG GAT TAT ATA TTA TTA GTC TCA GTG GTT TTC AGA TTT ATT 5859 TTT AAA GGG GAA AAC AGG GAA AAC CCA TCG TAT TTG TAA AGC ACT TTA GGA TTT TGC CGT 118119 TTG TTT CTG ATT GTT TGA AGA TTA GGG CTT TTT GGT GCG TGG TCA CCT TTC ACC TCT CCT 178179 TTT AGG ATT TAG TCC TTT CCA GTC TGC TCT TTT TGT GCG TGT CAC AAC CAT ATT CTT GTG 238239 GTT CTG GCT CAT ATT GTA GAA CTG CTG AAC ATA AGG AGA GGT AGC CAG CTG TAT GGT CGG 298299 ATT TAA TAT ATA ATG TTA TAT GTT GGG ATA TCT TAG TGG TTT GTT TTC TGA GGT AAG TTT 358359 CTT AGT GTT GTG TTT GAG ACA TTG TGT TTG CGT TTA TGG CGA CAC TGT CAT TCA TGC ACT 418419 TGG CCA TCT GAG CGT GGA TAC AGC GGG CAC TCG GGT CTC TCT GCC AGA TGG ATG AAA GCA 478479 GTG TAC ATT CCA GTG TGG GAG ACA GAC ATG TGG ACA GGT AAA TTA CAA GGC AGT GTG ATA 538539 AAG AGT AGA GAG TTG GTT GAG AGA GAT CTT AGA CAC CAT CCT CAT TGT GTA GAT GAG AAA 598599 GCA AAA GTC ACC AAG GCA GCC TGG CAG CTG GGA CCC AAG AAG CCT AGG GTG CCA GTC CTT 658659 GGG CAG TGC GGG GTT AGG CAC ACC CAG GGC CCT CCT GGT TCC TGG CTG ACT CTT GGA CTC 718719 TTT GTC TCT AAT TGG AGG CCA TGA TGC CCA GCT GTA AGG TGG TCA GCT TCA TTT GAG ACA 778779 CTA TAT CCT TTA GCA CAG CGG GGT AAT TTC TTC CCT CCT GTT TCA TTC ATT TAC CAA ATG 838839 GCC TCC TAA ATG ATC TAA AAT CAC TTG GAT CTT TTG TCT TTG TGG ACC TAA CAC CTG GCT 898899 TTT AAA GTT TAA CTT TCT GTC CCC TCT TCA GCT TGC TAA AAT TGA AAA GTG TTG CAG CCC 958959 AAC CTC CAC AAA TCT TGT CTC AGG AAA TAA GAG ACA TTT GTT AAC ATT TGT TTT GTA CCT10181019 CTC AGC AGC TTA GTT GAC AAG GGC ACC GTG TGG GAT TTC CTG TTC TTG CTC ATT TGG AAA10781079 GAG AAT GTT CTT TGT TCT TAG ACC CTC AGC TCT CAT GTG AGA GCC ATA GAA TGT TGC GAG11381139 GTG GAG TTC TGT GGA TAC AGA AGG AAT GTT TTC AAG TTA GAC TTA CTG CCA ATG TTA GGA11981199 TTT GGG ACT TTG CAT GAT TGG GAG GGA GAG GGA GTG CTG GAG AAC AGG TTA AAA GTT GTC12581259 CCG CTG AGC TTG GAG CAT CTC CTG CCA ACC CGG AGT GCT TCC CAG GAA CCC TGC CAG TGT13181319 CAC TTG GGG TTA TGT TTT CTG ATT TGG AAA CAT TAA GCC GTA TGC AGG TCT CTT CAG AAC13781379 TGG TTC TTC AGC CGG ATT GCC CTG GAA AGC AGA GAT TGC AGC TCT TCT AAA ACT GCC TCT14381439 CAC AGA AGT TCC AAG GCC AGG CTA AAT ATT GAA TGC AGT ACT CAG CAG CTG GGA CAC CTG14981499 ATG CTT TGG TGG CCA TCC CTT TCT TCC ATC CAA AAG GGC CCC CAC TGG AAG GCA TCT GTT15581559 GTT TTA AAA ATA TTT TAG GAC TCA TTT TTA CTT CCC CCA CTC CCT CAA GAT CAC ATA CAC16181619 TCC CCA GTG GGG GTT ACA GCC TCT TAG GAG GAA TCG CTG CTT CAT GAC TTC CTC AGG CAT16781679 TTA CAT TTT TCT CTT CTG TTG ACT TAA ATC ATG AAC TAA AAT TTA TCC CTA GAG GAA AAA17381739 AGA ATG CTT CCT CCA TTC TGG GCT CTT CTC ACT GTA CCC AGA CTA TGT CTT CAG GAC TCT17981799 CAT CTC TTG TCA GTT CTG TTG TGC TAG AAA GAC TGG TTT GAA AAA ATT CAG CTC GTG TAA18581859 ACC TGT GCC CTC CAC CCT GTG GGG AAC CCA TGT GGG GAG CCT TTG AAA ATA TCA CTT ATC19181919 AGC TGG GCG CAG TGG CTC ATG CCT GTA ATC CCA GCA CGT TGG GAG GAT GAG GTG GGC GGA19781979 TCA TGA GGT CAG GAG TTC AAG ACC AGC CTG GCC AAC ATA GTG AAA CCC CGT CTC TAC TAA20382039 AAA TAC AAA AAA AAA AAA AAA AAA AAC CGA GAC TAG TTC TCT CTC TGT CTC CTG CCT GAA20982099 CCC TCC TCC TCT TTT TGT TCT GAT CTT TGA GCT CCC TAG AGC CCA TAA TTC TTT AGA GCA21582159 GGT ATG TCC CGA GTC TGA AAC ATG CCC TTA TTT GTC CCA AGC TCT GGA CAT TTC TCA CCC22182219 CAA GGC GGA TCA ATC ATG ATT AAA TCA CTC CAA TTA AAC TTT AGG CTC CAG TCA GAC CTT22782279 CAG CCA AAT GGA AAA AAA AAC TAG GGG ATA AGG GAG GTA GTT GGA GCA AGA AAA TGT TAT23382339 TAG TTG AAA CCT TAC GGG ACC TTC CTC CCT TAG TGA GTC TGT TGG CTA AAG GTT CTC TGG23982399 CTT CGT GAA TTA GAA TTG GAT ACT GTT TCC AAG TTA GCA AAA CCA ACT CTA CCC CAG CAC24582459 CCC ACG AGG AAG AAT GTG GAA GGA TCT CCC ATT GGC CGG TTG GGG CAA AAG CCT GAG GCA25182519 ATC TTT CAT CCC CTT TTG CCA AGG CGA GAC TTT CCC AGT GAC GGT GAT GTA GTT GGC CAC25782579 TCT GAC TAT GGG TGG ACT CGG GTG TAG ACC TCT GAA GCT GAG ATC ACA CGA AAA CCT GGC26382639 CTC CCC GCC ATG TAG CTG TTG GAG AGT AGA AAA ATA GAG CAC GCC TGA TGT TTC TAA ATG26981 Met 12699 AGA AGA CTT TCA ATA GTA ATG AAG AAT CCA TGG CAC TCT CCT CAC CCT CAA ACA CAT GGC27582 Arg Arg Leu Ser Ile Val Met Lys Asn Pro Trp His Ser Pro His Pro Gln Thr His Gly 212759 AGT CAT TCA CAT ACA GGC CCC AAA GCC ACT GTT AGT GCT GCA GTA GCT CCT GTG GAC ATT281822 Ser His Ser His Thr Gly Pro Lys Ala Thr Val Ser Ala Ala Val Ala Pro Val Asp Ile 412819 GGA AAG CCC GGA GAG GGC GTG GAA GAA ATC AGC TGG CCC CCG GCA GGT TCT CTG GGG TTT287842 Gly Lys Pro Gly Glu Gly Val Glu Glu Ile Ser Trp Pro Pro Ala Gly Ser Leu Gly Phe 612879 TGT GCC CAA GGC TCC TGG AGC CCT AAA AAC TTT CAA AAG TTA ACT CCC CAC GTC CCC ATC293862 Cys Ala Gln Gly Ser Trp Ser Pro Lys Asn Phe Gln Lys Leu Thr Pro His Val Pro Ile 812939 CTG CTT GGG TTT CTG GAC TTT TCT GAG GCA CCG GCA GAG GGG TCT CGT TGC TCC CTT GAG299882 Leu Leu Gly Phe Leu Asp Phe Ser Glu Ala Pro Ala Glu Gly Ser Arg Cys Ser Leu Glu 1012999 TGT AGG GGC AGC CCT TTA ACC TGG CTC CTT GAG TCC CTG CTT TTT CTG CTT CTG TTG CCT3058102 Cys Arg Gly Ser Pro Leu Thr Trp Leu Leu Glu Ser Leu Leu Phe Leu Leu Leu Leu Pro 1213059 TCT TCC TCG TCT TCC TCT CTC TCA ATA TCT CCC TCT CTT TGT CCC TCC CCA GTT CCT GAC3118122 Ser Ser Ser Ser Ser Ser Leu Ser Ile Ser Pro Ser Leu Cys Pro Ser Pro Val Pro Asp 1413119 CTG GCC ATC CCG GGG TGC CCT TGA CCA GCC CCG TGC CTC CTC AGG GTG TCC CAG CAC CAG3178142 Leu Ala Ile Pro Gly Cys Pro *** 1493179 CCT GGC ACA GAG TGG GGC TCA GTT AGA GTA TGT GGG ATG TTG GTT TCG CCA GGT GAG TGA32383239 ATG AAA GGA CTC GAC CAC CAC AGC TGA GCC ACT AGC TGG GCC ATG CGA AGA GTT CTA GGT32983299 GCA AAG GCT GGA GGG TGG AAT TCA TTT TTG AGA GGT GTG TGA GCA GCT TCC GAC CCC TGC33583359 CCC ATT TGA ACG GGG GCC TTG CTG GTC GCG TCC CTG CAT TCA CCT GCG CGG CCA TCC CGT34183419 CAT CCA ACA GTT GAT CCT AAC TGA GCA CGC CCA CGG CCC TGG TCT GGC CTG GGC ACC GGC34783479 CAC CGT AGC CCA TCC CTT GAT GGC CTC TGT GTC CCC AGG AGG GCG GGC CGG GGG GTT GCC35383539 CAG GGG CTG GAG CAG TGG ACT GTG GCT CCA TAG AGG TAG GCT GGA GGG TGT GAG GGC AGA35983599 TTC AAG CTA TCC CCA GGG CTC TGC TCT GGT CGG AGC CAG CCC CTT CTC CCT CTC TGC CTT36583659 CCC CGC CCC ATT CCT GAT GCT GAA CTG TTC TGG ACC CCT GGC CCT GAG TCT CTC AGG ACC37183719 AAA GTG GGC ACG GGA ACA GCT GTA GTG TGT GCC CCC CCG GGC TTT GGC ACA GGT CTC CCT37783779 CTC GAG GTG TGG TTG TGA CTG CGA CCC TTC CCT TGC CGT GAT GCC TTC CTC CCC CGG GGC38383839 TTG GTC CAG CTC CTT CAC TCT CTA GCA GCT GCT GGG GCC CAC CTC CCA TGC CGA GGA CCA38983899 GCA GGG GAA ACC TCC AGG GAG CAT CTG CAG GCT CTG CTT CTG CCC GGC TGC TGG CTT GCT39583959 CTC CCT GGT GGC TCT CCA GCG GCC AGC TTC CTC ACC CAC CCG GCA CTC TGC TTT GCT CTG40184019 TCT CCT GAG GTG GGC CTG ACC AAC CTC CCC TTC TCT GCC TCA GTC CCT GGG CTC CAG GGC40784079 TCA GCT CCA CAG CCC TCT GCC TAG CAG GCT GGT TCT CCC TGC CAA GCC CAT ACC TGT GGT41384139 CAC CTG GCC CTC CTG TGG TCT GAG TAC CAC TCC CCT GCC CCA GGA GCC ACT CCC ACT CCA41984199 GCT GCC TGT TTC CAG CAG GTT CCC AGT GTC CCC GAC AAG CCC CTG CTG GTG TCT CCA TCT42584259 CCT GCC AAG CAT CCT CCA GTG CCT CCT CCT GTG GGC CTG GCC TCA GGG CTA TGG ACA GAC43184319 TCC TGT CCC ATC CCA GAG ACC CCT CGT GAT CGT GCC CTG GCA CGT GGG CCG TGG CCC GGC43784379 TGG GTC GGC TGA AGA ACT GCG GAT GGA AGC TGC GGA AGA GGC CCT GAT GGG GCC CAC CAT44384439 CCC GGA CCC AAG TCT TCT TCC TGG CGG GCC TCT CGT CTC CTT CCT GGT TTG GGC GGA AGC44984499 CAT CAC CTG GAT GCC TAC GTG GGA AGG GAC CTC GAA TGT GGG ACC CCA GCC CCT CTC CAG45584559 CTC GAA ATC CCT CCA CAG CCA CGG GGA CAC CCT GCA CCT ATT CCC ACG GGA CAG GCT GGA46184619 CCC AAA GAC TCT GGA CCC GGG GCC TCC CCT TGA GTA GAG ACC CGC CCT CTG ACT GAT GGA46784679 CGC CGC TGA CCT GGG GTC AGA CCC GTG GGC TGG ACC CCT GCC CAC CCC GCA GGA ACC CTG47384739 AGG CCT AGG GGA GCT GTT GAG CCT TCA GTG TCT GCA TGT GGG AAG TGG GCT CCT TCA CCT47984799 ACC TCA CAG GGC TGT TGT GAG GGG CGC TGT GAT GCG GTT CCA AAG CAC AGG GCT TGG CGC48584859 ACC CCC CTG TGC TCT CAA TAA ATG TGT TTC CTG TCT TAA AAA AAA AAA AAA AAA AAA AAA49184919 AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA A 49525.FP972A核苷酸序列(SEQ ID NO13)长度3112个碱基1 GCGACGGCGA GAGCTAGAGC GGGCGCAGCG TTAGGGTGGC CGTGCAAGGG GAGCCGTGGC61 CCGGGCCCGG GGCGTGCGAG ACGGCGGAAG CAGCCCAGGG CCTTGCTGCC GCCATGACTG121 AGGAATCAGA GGAGACAGTC CTGTACATTG AGCACCGCTA TGTCTGCTCT GAGTGCAACC181 AGCTGTATGG ATCACTGGAA GAGGTGCTTA TGCACCAAAA CTCCCACGTG CCCCAGCAGC241 ACTTTGAGCT GGTGGGCGTG GCTGATCCCG GAGTCACTGT GGCCACAGAC ACAGCTTCAG301 GCACGGGCCT CTATCAGACC CTTGTGCAGG AGAGCCAGTA CCAGTGCCTG GAGTGTGGTC361 AACTGCTGAT GTCACCCAGC CAGCTCCTGG AGCACCAGGA GCTGCACCTG AAGATGATGG421 CACCCCAGGA GGCAGTGCCA GCTGAGCCAT CACCTAAGGC ACCACCCCTG AGCTCCAGCA481 CCATCCACTA CGAGTGTGTG GATTGCAAGG CTCTCTTTGC CAGCCAGGAG CTCTGGCTGA541 ACCACCGGCA GACGCACCTC CGGGCCACAC CCACCAAGGC TCCTGCCCCT GTTGTCCTGG601 GGTCCCCAGT TGTTCTAGGG CCTCCTGTGG GCCAGGCCCG AGTGGCTGTG GAGCACTCAT661 ACCGAAAGGC AGAAGAGGGT GGGGAAGGGG CGACTGTCCC ATCTGCCGCT GCCACCACCA721 CTGAGGTAGT GACTGAGGTG GAGCTGCTCC TCTACAAGTG CTCTGAGTGC TCCCAGCTCT781 TCCAGCTGCC GGCGGATTTC CTGGAGCACC AGGCCACTCA CTTCCCTGCT CCTGTACCCG841 AGTCTCAGGA GCCTGCCTTA CAGCAGGAGG TGCAGGCCTC GTCACCTGCA GAGGTGCCTG901 TGTCTCAGCC TGACCCCTTG CCAGCTTCTG ACCACAGTTA CGAGCTGCGC AATGGTGAAG961 CCATTGGGCG GGATCGCCGG GGGCGCAGGG CCCGGAGGAA CAACAGTGGA GAAGCAGGCG1021 GGGCAGCCAC ACAGGAGCTC TTCTGCTCAG CCTGTGACCA GCTCTTTCTC TCACCCCACC1081 AGCTACAGCA GCACCTGCGG AGTCACCGGG AGGGCGTCTT TAAGTGCCCC CTGTGCAGTC1141 GTGTCTTCCC TAGCCCTTCC AGTCTGGACC AGCACCTTGG AGACCATAGC AGCGAGTCAC1201 ACTTCCTGTG TGTAGACTGT GGCCTGGCCT TCGGCACAGA GGCCCTCCTC CTGGCCCACC1261 GGCGAGCCCA CACCCCGAAT CCTCTGCATT CATGTCCATG TGGGAAGACC TTTGTCAACC1321 TTACCAAGTT CCTTTATCAC CGGCGTACTC ATGGGGTAGG GGGGGTGTCC CTCTGCCCAC1381 AACACCAGTC CCACCAGAGG AACCTGTCAT TGGTTTCCCT GAGCCAGCCC CAGCAGAGAC1441 TGGAGAGCCA GAGGCCCCTG AGCCCCCTGT GTCTGAGGAG ACCTCAGCAG GGCCCGCTGC1501 CCCAGGCACC TACCGCTGCC TCCTGTGCAG CCGTGAATTT GGAAAGGCCT TGCAGCTGAC1561 CCGGCACCAA CGTTTTGTGC ATCGGCTGGA GCGGCGCCAT AAATGCAGCA TTTGTGGCAA1621 GATGTTCAAG AAGAAGTCTC ACGTGCGTAA CCACCTGCGC ACACACACAG GGGAGCGGCC1681 CTTCCCCTGC CCTGACTGCT CCAAGCCCTT CAACTCACCT GCCAACCTGG CCCGCCACCG1741 GCTCACACAC ACAGGAGAGC GGCCCTACCG GTGTGGGGAC TGTGGCAAGG CTTTCACGCA1801 AAGCTCCACA CTGAGGCAGC ACCGCTTGGT GCATGCCCAG CACTTTCCCT ACCGCTGCCA1861 GGAATGTGGG GTGCGTTTTC ACCGTCCTTA CCGGCTGCTC ATGCACCGCT ACCATCACAC1921 AGGTGAATAC CCCTACAAGT GTCGCGAGTG CCCCCGCTCC TTCTTGCTGC GTCGGCTGCT1981 GGAGGTGCAC CAGCTCGTGG TCCATGCCGG GCGCCAGCCC CACCGCTGCC CATCCTGTGG2041 GGCTGCCTTC CCCTCCTCAC TGCGGCTCCG GGAGCACCGC TGTGCAGCCG CTGCTGCCCA2101 GGCCCCACGG CGCTTTGAGT GTGGCACCTG TGGCAAGAAA GTGGGCTCAG CTGCTCGACT2161 GCAGGCACAC GAGGCGGCCC ATGCAGCTGC TGGGCCTGGA GAGGTCCTGG CTAAGGAGCC2221 CCCTGCCCCT CGAGCCCCAC GGGCCACTCG TGCACCAGTT GCCTCTCCAG CAGCCCTTGG2281 AAGCACTGCT ACAGCATCCC CTGCGGCCCC TGCCCGCCGC CGGGGTCTAG AGTGCAGCGA2341 GTGCAAGAAG CTGTTCAGCA CAGAGACGTC ACTGCAGGTG CACCGGCGCA TCCACACAGG2401 TGAGCGGCCA TACCCATGTC CAGACTGTGG CAAAGCGTTC CGTCAGAGTA CCCACCTGAA2461 AGACACCGGC GCCTGCACAC AGGTGAGCGG CCCTTTGCCT GTGAAGTGTG TGGCAAGGCC2521 TTTGCCATCT CCATGCGCCT GGCAGAACAT CGCCGCATCC ACACAGGCGA ACGACCCTAC2581 TCCTGCCCTG ACTGTGGCAA GAGCTACCGC TCCTTCTCCA ACCTCTGGAA GCACCGCAAG2641 ACCCATCAGC AGCAGCATCA GGCAGCTGTG CGGCAGCAGC TGGCAGAGGC GGAGGCTGCC2701 GTTGGCCTGG CCGTCATGGA GACTGCTGTG GAGGCGCTAC CCCTGGTGGA AGCCATTGAG2761 ATCTACCCTC TGGCCGAGGC TGAGGGGGTC CAGATCAGTG GCTGACTCTG CCCGACTTCC2821 TCTTTGGCAC CTCCATTCCC TGTTGCTGAA GGCCCTCCAG CATCCCCTTA AGCATCTGTA2881 CATACTGTGT CCCTTCCTCT TCCCATCCCC ACCACCTTGT AAGTTCTAAA TTGGATTTAT2941 TCTCTCGTGA GGGGGGTGCT CTGGGGTCCT TGACACACAT AAAGGTGCCC CCCCACCTTC3001 CACCTCTTAG CACTGGTGAC CCCAAAAATG AAACCATCAA TAAAGACTGA GTTGCCAGCA3061 GTGTGTAGAG TGGAAAAAAA AAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAB核苷酸序列(SEQ ID N015)长度545个氨基酸1 MSMWEDLCQP YQVPLSPAYS WGRGGVPLPT TPVPPEEPVI GFPEPAPAET GEPEAPEPPV61 SEETSAGPAA PGTYRCLLCS REFGKALQLT RHQRFVHRLE RRHKCSICGK MFKKKSHVRN121 HLRTHTGERP FPCPDCSKPF NSPANLARHR LTHTGERPYR CGDCGKAFTQ SSTLRQHRLV181 HAQHFPYRCQ ECGVRFHRPY RLLMHRYHHT GEYPYKCREC PRSFLLRRLL EVHQLVVHAG241 RQPHRCPSCG AAFPSSLRLR EHRCAAAAAQ APRRFECGTC GKKVGSAARL QAHEAAHAAA301 GPGEVLAKEP PAPRAPRATR APVASPAALG STATASPAAP ARRRGLECSE CKKLFSTETS361 LQVHRRIHTG ERPYPCPDCG KAFRQSTHLK DTGACTQVSG PLPVKCVARP LPSPCAWQNI421 AASTQANDPT PALTVARATA PSPTSGSTAR PISSSIRQLC GSSWQRRRLP LAWPSWRLLW481 RRYPWWKPLR STLWPRLRGS RSVADSARLP LWHLHSLLLK ALQHPLKHLY ILCPFLFPSP541 PPCKFC.核苷酸及氨基酸组合序列(SEQ ID NO14) 克隆号和蛋白名称FP972起始编码子1292 ATG 终止编码子2927 TAA 蛋白质分子量60743.811 G CGA CGG CGA GAG CTA GAG CGG GCG CAG CGT TAG GGT GGC CGT GCA AGG GGA GCC GTG 5859 GCC CGG GCC CGG GGC GTG CGA GAC GGC GGA AGC AGC CCA GGG CCT TGC TGC CGC CAT GAC118119 TGA GGA ATC AGA GGA GAC AGT CCT GTA CAT TGA GCA CCG CTA TGT CTG CTC TGA GTG CAA178179 CCA GCT GTA TGG ATC ACT GGA AGA GGT GCT TAT GCA CCA AAA CTC CCA CGT GCC CCA GCA238239 GCA CTT TGA GCT GGT GGG CGT GGC TGA TCC CGG AGT CAC TGT GGC CAC AGA CAC AGC TTC298299 AGG CAC GGG CCT CTA TCA GAC CCT TGT GCA GGA GAG CCA GTA CCA GTG CCT GGA GTG TGG358359 TCA ACT GCT GAT GTC ACC CAG CCA GCT CCT GGA GCA CCA GGA GCT GCA CCT GAA GAT GAT418419 GGC ACC CCA GGA GGC AGT GCC AGC TGA GCC ATC ACC TAA GGC ACC ACC CCT GAG CTC CAG478479 CAC CAT CCA CTA CGA GTG TGT GGA TTG CAA GGC TCT CTT TGC CAG CCA GGA GCT CTG GCT538539 GAA CCA CCG GCA GAC GCA CCT CCG GGC CAC ACC CAC CAA GGC TCC TGC CCC TGT TGT CCT598599 GGG GTC CCC AGT TGT TCT AGG GCC TCC TGT GGG CCA GGC CCG AGT GGC TGT GGA GCA CTC658659 ATA CCG AAA GGC AGA AGA GGG TGG GGA AGG GGC GAC TGT CCC ATC TGC CGC TGC CAC CAC718719 CAC TGA GGT AGT GAC TGA GGT GGA GCT GCT CCT CTA CAA GTG CTC TGA GTG CTC CCA GCT778779 CTT CCA GCT GCC GGC GGA TTT CCT GGA GCA CCA GGC CAC TCA CTT CCC TGC TCC TGT ACC838839 CGA GTC TCA GGA GCC TGC CTT ACA GCA GGA GGT GCA GGC CTC GTC ACC TGC AGA GGT GCC898899 TGT GTC TCA GCC TGA CCC CTT GCC AGC TTC TGA CCA CAG TTA CGA GCT GCG CAA TGG TGA958959 AGC CAT TGG GCG GGA TCG CCG GGG GCG CAG GGC CCG GAG GAA CAA CAG TGG AGA AGC AGG 10181019 CGG GGC AGC CAC ACA GGA GCT CTT CTG CTC AGC CTG TGA CCA GCT CTT TCT CTC ACC CCA 10781079 CCA GCT ACA GCA GCA CCT GCG GAG TCA CCG GGA GGG CGT CTT TAA GTG CCC CCT GTG CAG 11381139 TCG TGT CTT CCC TAG CCC TTC CAG TCT GGA CCA GCA CCT TGG AGA CCA TAG CAG CGA GTC 11981199 ACA CTT CCT GTG TGT AGA CTG TGG CCT GGC CTT CGG CAC AGA GGC CCT CCT CCT GGC CCA 12581259 CCG GCG AGC CCA CAC CCC GAA TCC TCT GCA TTC ATG TCC ATG TGG GAA GAC CTT TGT CAA 13181 Met Ser Met Trp Glu Asp Leu Cys Gln 91319 CCT TAC CAA GTT CCT TTA TCA CCG GCG TAC TCA TGG GGT AGG GGG GGT GTC CCT CTG CCC 137810 Pro Tyr Gln Val Pro Leu Ser Pro Ala Tyr Ser Trp Gly Arg Gly Gly Val Pro Leu Pro 291379 ACA ACA CCA GTC CCA CCA GAG GAA CCT GTC ATT GGT TTC CCT GAG CCA GCC CCA GCA GAG 143830 Thr Thr Pro Val Pro Pro Glu Glu Pro Val Ile Gly Phe Pro Glu Pro Ala Pro Ala Glu 491439 ACT GGA GAG CCA GAG GCC CCT GAG CCC CCT GTG TCT GAG GAG ACC TCA GCA GGG CCC GCT 149850 Thr Gly Glu Pro Glu Ala Pro Glu Pro Pro Val Ser Glu Glu Thr Ser Ala Gly Pro Ala 691499 GCC CCA GGC ACC TAC CGC TGC CTC CTG TGC AGC CGT GAA TTT GGA AAG GCC TTG CAG CTG155870 Ala Pro Gly Thr Tyr Arg Cys Leu Leu Cys Ser Arg Glu Phe Gly Lys Ala Leu Gln Leu 891559 ACC CGG CAC CAA CGT TTT GTG CAT CGG CTG GAG CGG CGC CAT AAA TGC AGC ATT TGT GGC161890 Thr Arg His Gln Arg Phe Val His Arg Leu Glu Arg Arg His Lys Cys Ser Ile Cys Gly 1091619 AAG ATG TTC AAG AAG AAG TCT CAC GTG CGT AAC CAC CTG CGC ACA CAC ACA GGG GAG CGG1678110 Lys Met Phe Lys Lys Lys Ser His Val Arg Asn His Leu Arg Thr His Thr Gly Glu Arg 1291679 CCC TTC CCC TGC CCT GAC TGC TCC AAG CCC TTC AAC TCA CCT GCC AAC CTG GCC CGC CAC1738130 Pro Phe Pro Cys Pro Asp Cys Ser Lys Pro Phe Asn Ser Pro Ala Asn Leu Ala Arg His 1491739 CGG CTC ACA CAC ACA GGA GAG CGG CCC TAC CGG TGT GGG GAC TGT GGC AAG GCT TTC ACG1798150 Arg Leu Thr His Thr Gly Glu Arg Pro Tyr Arg Cys Gly Asp Cys Gly Lys Ala Phe Thr 1691799 CAA AGC TCC ACA CTG AGG CAG CAC CGC TTG GTG CAT GCC CAG CAC TTT CCC TAC CGC TGC1858170 Gln Ser Ser Thr Leu Arg Gln His Arg Leu Val His Ala Gln His Phe Pro Tyr Arg Cys 1891859 CAG GAA TGT GGG GTG CGT TTT CAC CGT CCT TAC CGG CTG CTC ATG CAC CGC TAC CAT CAC1918190 Gln Glu Cys Gly Val Arg Phe His Arg Pro Tyr Arg Leu Leu Met His Arg Tyr His His 2091919 ACA GGT GAA TAC CCC TAC AAG TGT CGC GAG TGC CCC CGC TCC TTC TTG CTG CGT CGG CTG1978210 Thr Gly Glu Tyr Pro Tyr Lys Cys Arg Glu Cys Pro Arg Ser Phe Leu Leu Arg Arg Leu 2291979 CTG GAG GTG CAC CAG CTC GTG GTC CAT GCC GGG CGC CAG CCC CAC CGC TGC CCA TCC TGT2038230 Leu Glu Val His Gln Leu Val Val His Ala Gly Arg Gln Pro His Arg Cys Pro Ser Cys 2492039 GGG GCT GCC TTC CCC TCC TCA CTG CGG CTC CGG GAG CAC CGC TGT GCA GCC GCT GCT GCC2098250 Gly Ala Ala Phe Pro Ser Ser Leu Arg Leu Arg Glu His Arg Cys Ala Ala Ala Ala Ala 2692099 CAG GCC CCA CGG CGC TTT GAG TGT GGC ACC TGT GGC AAG AAA GTG GGC TCA GCT GCT CGA2158270 Gln Ala Pro Arg Arg Phe Glu Cys Gly Thr Cys Gly Lys Lys Val Gly Ser Ala Ala Arg 2892159 CTG CAG GCA CAC GAG GCG GCC CAT GCA GCT GCT GGG CCT GGA GAG GTC CTG GCT AAG GAG2218290 Leu Gln Ala His Glu Ala Ala His Ala Ala Ala Gly Pro Gly Glu Val Leu Ala Lys Glu 3092219 CCC CCT GCC CCT CGA GCC CCA CGG GCC ACT CGT GCA CCA GTT GCC TCT CCA GCA GCC CTT2278310 Pro Pro Ala Pro Arg Ala Pro Arg Ala Thr Arg Ala Pro Val Ala Ser Pro Ala Ala Leu 3292279 GGA AGC ACT GCT ACA GCA TCC CCT GCG GCC CCT GCC CGC CGC CGG GGT CTA GAG TGC AGC2338330 Gly Ser Thr Ala Thr Ala Ser Pro Ala Ala Pro Ala Arg Arg Arg Gly Leu Glu Cys Ser 3492339 GAG TGC AAG AAG CTG TTC AGC ACA GAG ACG TCA CTG CAG GTG CAC CGG CGC ATC CAC ACA2398350 Glu Cys Lys Lys Leu Phe Ser Thr Glu Thr Ser Leu Gln Val His Arg Arg Ile His Thr 3692399 GGT GAG CGG CCA TAC CCA TGT CCA GAC TGT GGC AAA GCG TTC CGT CAG AGT ACC CAC CTG2458370 Gly Glu Arg Pro Tyr Pro Cys Pro Asp Cys Gly Lys Ala Phe Arg Gln Ser Thr His Leu 3892459 AAA GAC ACC GGC GCC TGC ACA CAG GTG AGC GGC CCT TTG CCT GTG AAG TGT GTG GCA AGG2518390 Lys Asp Thr Gly Ala Cys Thr Gln Val Ser Gly Pro Leu Pro Val Lys Cys Val Ala Arg 4092519 CCT TTG CCA TCT CCA TGC GCC TGG CAG AAC ATC GCC GCA TCC ACA CAG GCG AAC GAC CCT2578410 Pro Leu Pro Ser Pro Cys Ala Trp Gln Asn Ile Ala Ala Ser Thr Gln Ala Asn Asp Pro 4292579 ACT CCT GCC CTG ACT GTG GCA AGA GCT ACC GCT CCT TCT CCA ACC TCT GGA AGC ACC GCA2638430 Thr Pro Ala Leu Thr Val Ala Arg Ala Thr Ala Pro Ser Pro Thr Ser Gly Ser Thr Ala 4492639 AGA CCC ATC AGC AGC AGC ATC AGG CAG CTG TGC GGC AGC AGC TGG CAG AGG CGG AGG CTG2698450 Arg Pro Ile Ser Ser Ser Ile Arg Gln Leu Cys Gly Ser Ser Trp Gln Arg Arg Arg Leu 4692699 CCG TTG GCC TGG CCG TCA TGG AGA CTG CTG TGG AGG CGC TAC CCC TGG TGG AAG CCA TTG2758470 Pro Leu Ala Trp Pro Ser Trp Arg Leu Leu Trp Arg Arg Tyr Pro Trp Trp Lys Pro Leu 4892759 AGA TCT ACC CTC TGG CCG AGG CTG AGG GGG TCC AGA TCA GTG GCT GAC TCT GCC CGA CTT2818490 Arg Ser Thr Leu Trp Pro Arg Leu Arg Gly Ser Arg Ser Val Ala Asp Ser Ala Arg Leu 5092819 CCT CTT TGG CAC CTC CAT TCC CTG TTG CTG AAG GCC CTC CAG CAT CCC CTT AAG CAT CTG2878510 Pro Leu Trp His Leu His Ser Leu Leu Leu Lys Ala Leu Gln His Pro Leu Lys His Leu 5292879 TAC ATA CTG TGT CCC TTC CTC TTC CCA TCC CCA CCA CCT TGT AAG TTC TAA ATT GGA TTT2938530 Tyr Ile Leu Cys Pro Phe Leu Phe Pro Ser Pro Pro Pro Cys Lys Phe *** 5462939 ATT CTC TCG TGA GGG GGG TGC TCT GGG GTC CTT GAC ACA CAT AAA GGT GCC CCC CCA CCT29982999 TCC ACC TCT TAG CAC TGG TGA CCC CAA AAA TGA AAC CAT CAA TAA AGA CTG AGT TGC CAG30583059 CAG TGT GTA GAG TGG AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA31126.FP6628A核苷酸序列(SEQ ID NO16)长度3102个碱基1 GGGCAGAGGT TGCAGTAACC CAAGATCATG CCACCATACT ACAGACTGTG TGACAGAGCG61 AGACTCTGTC TCAAAACAAC AACAAAAAAA CAAACTCACC ATTGTACCTG TGCTTATGCA121 AGGTTTAGTA GGAACGTAAA TTGGTTTAAC CTTTGTGGAC AGAAGTTTTA AAAATATATA181 TTAAAATTAA AAGTATGCTC TGAAGGAGGA ACTCCACTTC TGGTAATTTA TCTCAAGAGA241 ATAACTGGGC CAGCACAAAG GCTGCTGTTT AACAATGTGT AATGATGCAG TGACAGCTAC301 AATTGCAAAA ATAACCTAGA CATTCACCAA TGAGGACTGG TTAAATGAAC TAGTATAACC361 ATACTGCAGA ATATCATAAA GATAACAAAA AAATGATATG GATCTGTTTC TTGGCATAAA421 TATATCCATA AGTTTTAAGA AGAGATGCTA TATATACGGT GGTCCCATTG ATGTATAACT481 GTTAGGACTA AAAATAGTAC CTTCCTCATA ATGATGTTTT GAGGAATTAA TGAGTTTATT541 CATGCAAAAT GCTTAGAATG GTACCTGGCA CACAGACAAT GTTTAAGAAA TGTTTGTTAT601 TGTTATTACT ATGTCTCTGT ATATATGCAT AGGAAAAATC TGGAAGGATA AAATAAAAAA661 TGAATATTTT TGGGTGGTGA GACTAAATTT TTGTCTAATT TTATGGATAA GTTTTATGAT721 TTATATTTAT AATAAAAATA AAGCTATAAA AATTAATTAT GATGTTTCTT GCTCATGTCA781 GCTACTTCAC TACATACTGA GTTCCCATCC CCATTTGTTA CAGGAGCAAC TCCTGGTTAA841 GTACCTTTTT TGTAACTGTG AAATTCCCTT GACATTCATC ATATACTGAT GACTTTTCCT901 AATACATGGA AACAAACAGG ATTGTGATTT TTCTCTCATT TTGTACACTA AGTTCTATGC961 CAGCCGATTT CAGAGAGACA CTCTGCAAAG TTCCTATGAA AAGTCTTCAA AAATGTATTA1021 CCTTGCTGTT TAATACCAAT ACCAAAATTC AAATGGACTT ATCAATTAAA CTCACCTCAA1081 ACACAGTAAT GCACTCACAG TTATGAGCAG TGCTCACTAC TGCCAATCAT TTCTGCTTCC1141 AGAATGGTTA AAGGAGCCAC AAACTCTGCC CTTATCAGAA GCAGTAGCCT GATAACAGGT1201 AAGAATAGGA ATGTTCCGTT TCTCCCCAAA TTAAGAGTGG TATCAATAAT CTGACTTTTC1261 CAGGCATTTA TCTCACAGAA ATGTTTATGA GACATGCTAA GATCAACATG GTAATATCTG1321 ACTATTGTTT TTATTAGAAA TAAGGGGGCC AGCCAGGCAC AGTAGCTTAC ACCTGTAATC1381 CCAGCACCTG GGGAGGTTGA GGTGGGAGGA TTGCTTGAGC CCAGGAGTTT GAGACAAGCC1441 TGGGCAACAC AGGGAGACAC CAGCTCTATT AAAAAAAAAA AAAGTAAGGG GGCTATAATG1501 TAACCCTTAT TGACTGATCT TTGAGGCTAC TGTTGTGAGA TTTCTACATC CCTCTTTATT1561 ATAAAAGATC CCAAATGCGG CTTTACTTGG AAAGGAAGCA ATTTGACAGT GATGAGGAAT1621 GATGTGCAGA ATGGAGATTC AGAACCCTAA CAGACTCTGG TATTGATATC TAGTGCTCAT1681 ATTTCTGGGA GTCTGCTAGG GTTATGGGAG TTTGCATTTA AATTGTAGGT TGTTGCAGAA1741 AACAGAATTT ATATGTGGAA AATTGTAACG AATCCACTAA AAAACTATTA GAACTAATAA1801 TCAAGTTTGG CAAGGTTGTA AGACATAAGT CAGTATACAA AAATCAACTG TATTTCTATA1861 CATTTGTGAC AATCTGAAAA TGAAATTAGG AAAACAAATC CATTTACGAT AGCAACAAGA1921 AGTATAAAAT ACTTAGGAAG AAGTATAACA AAAGATGTGC ACAATTTATA TTCTGAAAAC1981 TACAAATAGT GTTTAAAGAA ATTAAAGAAT ATTAAAATAA ATGGAAAAAT ATCCCATGTT2041 CATGGACTGG AAGAATTACT CTTAAGATGT CAATACTCCT CAAATTGATC TACATATTTG2101 ATACAATCCT TGTAAGAACC CGAACTGACT TCTTTGTAGA AATTGACAAA TTGATTCTAA2161 GATTCATACA GGATTGCCAT AGATCCAGAA TAGCCACATC AATTTTAAAA AAAGAAGAAA2221 GTACAAAGAC TCACATTACC TGATTTAAAA ACATACCATA AAGCAATGTT AGGACAGTGT2281 GGTATTGACA TAAGGATAGA CACATAGATC AATGAAAAGG AAAGGGAGCC CAGAAGTAAA2341 ACCACATCAA CTGATTTTCA ACAAAGATGC CAAGACCATT CAATTGAGGA AAGAATAGTC2401 CCTTCAACAA ATGGTGCTGC AACCAGACAG TCATATGCAA AAGAATGAAA TTTAACCTTT2461 ACAAAATTTA ACCATATATA AAAATTAATT CAAATGGATC AAAGACATAT AAGGGCTGAA2521 ACTATAAAAT TGTTAAGAGA ACATAGGAAT AAATATTCAT GACCTTGGAT TTGGCAGTGG2581 ATTCTTAGCT ATAACATCAA AGCACAAGTA AGAAAAGAGA GATAAATTGG ATTTCATGAA2641 AATTAAAAAC CTGTGCTTCA AAGACACTAT CAAGAAAGTG ACAAGGCAAC CCACAGAATG2701 GGAAAAACTG CAGATTATCT GATAAGGGAC TTCTATCTAG AATATATAAA AATCTCTCAC2761 AACTCAGAAA TAAGACAATC CAGTTAAAAT AAGGGTAAAG GAGCCGGGCA TGGTGGCTCA2821 CGCCTGTAAT CCCAGAGCTT TGGGAGGTGG AGGTGGGCAG ATCACCTGAG GTCAGGAGTT2881 CACGACCAGC CTGGCCAACA TGGTAAAACC CCATCTCTAC TAAAAATACA AAAATTAGCC2941 GGGTGTGGTG GTGCATGCCT GTAATCCCAG CTACTTGGGA GGCTGAGGCA GAAGAATCAC3001 TTGAACCTGG GAGGTGGAGG TTGCAGTGAG CCGAGATCGC GCCACTGCAC TCCAGCCTGG3061 GCGACAGAGC GAGAATCTGT CTCGAAAAAA AAAAAAAAAA AAB核苷酸序列(SEQ ID NO18)长度99个氨基酸1 MFVIVITMSL YICIGKIWKD KIKNEYFWVV RLNFCLILWI SFMIYIYNKN KAIKINYDVS61 CSCQLLHYIL SSHPHLLQEQ LLVKYLFCNC EIPLTFIIYC.核苷酸及氨基酸组合序列(SEQ ID NO17) 克隆号和蛋白名称FP6628起始编码子590 ATG 终止编码子887 TGA 蛋白质分子量11979.941 G GGC AGA GGT TGC AGT AAC CCA AGA TCA TGC CAC CAT ACT ACA GAC TGT GTG ACA GAG 5859 CGA GAC TCT GTC TCA AAA CAA CAA CAA AAA AAC AAA CTC ACC ATT GTA CCT GTG CTT ATG118119 CAA GGT TTA GTA GGA ACG TAA ATT GGT TTA ACC TTT GTG GAC AGA AGT TTT AAA AAT ATA178179 TAT TAA AAT TAA AAG TAT GCT CTG AAG GAG GAA CTC CAC TTC TGG TAA TTT ATC TCA AGA238239 GAA TAA CTG GGC CAG CAC AAA GGC TGC TGT TTA ACA ATG TGT AAT GAT GCA GTG ACA GCT298299 ACA ATT GCA AAA ATA ACC TAG ACA TTC ACC AAT GAG GAC TGG TTA AAT GAA CTA GTA TAA358359 CCA TAC TGC AGA ATA TCA TAA AGA TAA CAA AAA AAT GAT ATG GAT CTG TTT CTT GGC ATA418419 AAT ATA TCC ATA AGT TTT AAG AAG AGA TGC TAT ATA TAC GGT GGT CCC ATT GAT GTA TAA478479 CTG TTA GGA CTA AAA ATA GTA CCT TCC TCA TAA TGA TGT TTT GAG GAA TTA ATG AGT TTA538539 TTC ATG CAA AAT GCT TAG AAT GGT ACC TGG CAC ACA GAC AAT GTT TAA GAA ATG TTT GTT5981 Met Phe Val 3599 ATT GTT ATT ACT ATG TCT CTG TAT ATA TGC ATA GGA AAA ATC TGG AAG GAT AAA ATA AAA6584 Ile Val Ile Thr Met Ser Leu Tyr Ile Cys Ile Gly Lys Ile Trp Lys Asp Lys Ile Lys 23659 AAT GAA TAT TTT TGG GTG GTG AGA CTA AAT TTT TGT CTA ATT TTA TGG ATA AGT TTT ATG 71824 Asn Glu Tyr Phe Trp Val Val Arg Leu Asn Phe Cys Leu Ile Leu Trp Ile Ser Phe Met 43719 ATT TAT ATT TAT AAT AAA AAT AAA GCT ATA AAA ATT AAT TAT GAT GTT TCT TGC TCA TGT 77844 Ile Tyr Ile Tyr Asn Lys Asn Lys Ala Ile Lys Ile Asn Tyr Asp Val Ser Cys Ser Cys 63779 CAG CTA CTT CAC TAC ATA CTG AGT TCC CAT CCC CAT TTG TTA CAG GAG CAA CTC CTG GTT 83864 Gln Leu Leu His Tyr Ile Leu Ser Ser His Pro His Leu Leu Gln Glu Gln Leu Leu Val 83839 AAG TAC CTT TTT TGT AAC TGT GAA ATT CCC TTG ACA TTC ATC ATA TAC TGA TGA CTT TTC 89884 Lys Tyr Leu Phe Cys Asn Cys Glu Ile Pro Leu Thr Phe Ile Ile Tyr *** 100899 CTA ATA CAT GGA AAC AAA CAG GAT TGT GAT TTT TCT CTC ATT TTG TAC ACT AAG TTC TAT 958959 GCC AGC CGA TTT CAG AGA GAC ACT CTG CAA AGT TCC TAT GAA AAG TCT TCA AAA ATG TAT10181019 TAC CTT GCT GTT TAA TAC CAA TAC CAA AAT TCA AAT GGA CTT ATC AAT TAA ACT CAC CTC10781079 AAA CAC AGT AAT GCA CTC ACA GTT ATG AGC AGT GCT CAC TAC TGC CAA TCA TTT CTG CTT11381139 CCA GAA TGG TTA AAG GAG CCA CAA ACT CTG CCC TTA TCA GAA GCA GTA GCC TGA TAA CAG11981199 GTA AGA ATA GGA ATG TTC CGT TTC TCC CCA AAT TAA GAG TGG TAT CAA TAA TCT GAC TTT12581259 TCC AGG CAT TTA TCT CAC AGA AAT GTT TAT GAG ACA TGC TAA GAT CAA CAT GGT AAT ATC13181319 TGA CTA TTG TTT TTA TTA GAA ATA AGG GGG CCA GCC AGG CAC AGT AGC TTA CAC CTG TAA13781379 TCC CAG CAC CTG GGG AGG TTG AGG TGG GAG GAT TGC TTG AGC CCA GGA GTT TGA GAC AAG14381439 CCT GGG CAA CAC AGG GAG ACA CCA GCT CTA TTA AAA AAA AAA AAA GTA AGG GGG CTA TAA14981499 TGT AAC CCT TAT TGA CTG ATC TTT GAG GCT ACT GTT GTG AGA TTT CTA CAT CCC TCT TTA15581559 TTA TAA AAG ATC CCA AAT GCG GCT TTA CTT GGA AAG GAA GCA ATT TGA CAG TGA TGA GGA16181619 ATG ATG TGC AGA ATG GAG ATT CAG AAC CCT AAC AGA CTC TGG TAT TGA TAT CTA GTG CTC16781679 ATA TTT CTG GGA GTC TGC TAG GGT TAT GGG AGT TTG CAT TTA AAT TGT AGG TTG TTG CAG17381739 AAA ACA GAA TTT ATA TGT GGA AAA TTG TAA CGA ATC CAC TAA AAA ACT ATT AGA ACT AAT17981799 AAT CAA GTT TGG CAA GGT TGT AAG ACA TAA GTC AGT ATA CAA AAA TCA ACT GTA TTT CTA18581859 TAC ATT TGT GAC AAT CTG AAA ATG AAA TTA GGA AAA CAA ATC CAT TTA CGA TAG CAA CAA19181919 GAA GTA TAA AAT ACT TAG GAA GAA GTA TAA CAA AAG ATG TGC ACA ATT TAT ATT CTG AAA19781979 ACT ACA AAT AGT GTT TAA AGA AAT TAA AGA ATA TTA AAA TAA ATG GAA AAA TAT CCC ATG20382039 TTC ATG GAC TGG AAG AAT TAC TCT TAA GAT GTC AAT ACT CCT CAA ATT GAT CTA CAT ATT20982099 TGA TAC AAT CCT TGT AAG AAC CCG AAC TGA CTT CTT TGT AGA AAT TGA CAA ATT GAT TCT21582159 AAG ATT CAT ACA GGA TTG CCA TAG ATC CAG AAT AGC CAC ATC AAT TTT AAA AAA AGA AGA22182219 AAG TAC AAA GAC TCA CAT TAC CTG ATT TAA AAA CAT ACC ATA AAG CAA TGT TAG GAC AGT22782279 GTG GTA TTG ACA TAA GGA TAG ACA CAT AGA TCA ATG AAA AGG AAA GGG AGC CCA GAA GTA23382339 AAA CCA CAT CAA CTG ATT TTC AAC AAA GAT GCC AAG ACC ATT CAA TTG AGG AAA GAA TAG23982399 TCC CTT CAA CAA ATG GTG CTG CAA CCA GAC AGT CAT ATG CAA AAG AAT GAA ATT TAA CCT24582459 TTA CAA AAT TTA ACC ATA TAT AAA AAT TAA TTC AAA TGG ATC AAA GAC ATA TAA GGG CTG25182519 AAA CTA TAA AAT TGT TAA GAG AAC ATA GGA ATA AAT ATT CAT GAC CTT GGA TTT GGC AGT25782579 GGA TTC TTA GCT ATA ACA TCA AAG CAC AAG TAA GAA AAG AGA GAT AAA TTG GAT TTC ATG26382639 AAA ATT AAA AAC CTG TGC TTC AAA GAC ACT ATC AAG AAA GTG ACA AGG CAA CCC ACA GAA26982699 TGG GAA AAA CTG CAG ATT ATC TGA TAA GGG ACT TCT ATC TAG AAT ATA TAA AAA TCT CTC27582759 ACA ACT CAG AAA TAA GAC AAT CCA GTT AAA ATA AGG GTA AAG GAG CCG GGC ATG GTG GCT28182819 CAC GCC TGT AAT CCC AGA GCT TTG GGA GGT GGA GGT GGG CAG ATC ACC TGA GGT CAG GAG28782879 TTC ACG ACC AGC CTG GCC AAC ATG GTA AAA CCC CAT CTC TAC TAA AAA TAC AAA AAT TAG29382939 CCG GGT GTG GTG GTG CAT GCC TGT AAT CCC AGC TAC TTG GGA GGC TGA GGC AGA AGA ATC29982999 ACT TGA ACC TGG GAG GTG GAG GTT GCA GTG AGC CGA GAT CGC GCC ACT GCA CTC CAG CCT30583059 GGG CGA CAG AGC GAG AAT CTG TCT CGA AAA AAA AAA AAA AAA AA 31027.FP6651A核苷酸序列(SEQ ID NO19)长度2455个碱基1 GTTCTAGGTA GTAGAAAGCA AAGGGTGCTA TGAAGAGCGT GTACACAGAC TCCCAACTGT61 TTTGGGAGTT AAGGAAGGTT TCTTGGAGGA AGTGGCATTC AAGCTATAAG ACCTGATGAT121 CAGGTGGAGT TAGCTGGAGA GCAGGGACAG AGAGAATAGC CTGTGCAAAA GGCCTATTCT181 TCAGGAGAGA ATGACACATG AATGGGACTG AAGAAGTAAA CTGGTATCTC ATATGAAGGA241 CCTTTTATAT CTTGTTAAGG ATTTTGAACT TCCTCCTTTT TTTTTTTTTG AGACAGAGTT301 TCTCTCTGTC ACCCAGGCTG AAGTGCATTG GCGTGATCTC GGCTCATGGC AGCCTCCACC361 TACCAGGTTC AAGCTATTCT CCTGCCTCAG CTTCCCAGAT AGCTGGGATT ACAGTCATGT421 GCCACCACGC CGGGCTAATT TTTGTATTTT TAGTAGAGAC AGGGTTTCAC CGTGTTGGCC481 AGGCTGGTCT CGATTTCCTG ACCTCAAGTG ATCTGCCTGC CTTGGCCTGC CCCAGTGCCG541 GAATTACAGG AGTGAGCCCC CGCGCCTGGC CTGGACTTCT GCTTAAAGGC AATAAGGAAG601 CCTTTACTAG ATTTAAAATA GGAGCTCAGT TTAAATTAGT AAGGATTTGT ATTTCATCAA661 GAGCTCTCTT TGGCCCTAGT CTGGTAGAAG ATGAGTCGAA GTAGAGAGAC TAGTTACAAA721 GCTGTTCCCA ATAATCCAGG TGAAAAATAG TGGTGACCCT AGATTAAGGT AGTATTGGTG781 TGGGTAGGGA GAAGTGGACA GTCATATTTG AGAGGTACCT AGGGAATAGA ATTGCAAAGA841 CCTGGGAGTA GATTGGATAT TCAGTGGGAG GAAGGGAGAG AAGTAATCTC TCAAGTGTTG901 CTCAAGCCAT AACCTTGGAT GGTACTGTCC ACTGATACAG TAGGAGGAAA ATGTTTGAGG961 GAAAGTAGTG ATGAATTTGT GGTGCACTAA CATGGCCAAC ACTAAATATT AGAAAGATTA1021 ATGTGGTCAT GTAGAAGATG AATGAAAAGA AGATACCTCA GAAGTGGAGA GATAGTTAAA1081 TGGCTTTTGT AGGAATCTCA GCTAGAAGTG TCAGTATTCT TAAGTGCAGA ACTAACAGGT1141 GTGGGAAAGT AATGGGAAGT AGACACCAAA CAAATAGTTC CCCAAAGATG GTATCAAATA1201 TCCCAGTGAC AGCTTGCAGC CTGCTCAGCT TTATGATATG CCCCTGAGAT CATTTTTCAG1261 GACAAAAAGT AGTGAAACTA CCTTTATTTA CTTCTCAAAT TTACCTTTAT TTACTTCTCA1321 AATATACATA GAAAGTAATA TTGTAAAAAG CAGCTCTGGC TGGGCGCTGT GGCTCAAGCC1381 TATAGTCCCA GCACTTTGGG AGGCTGAGGG GGGCAGATGA CTTGAGGTCA AGAGTTCAAG1441 ACCATCCTGG CCAACATGGC AAAACCGCAT TTCTACTAAA AATACAAAAA TTCGCGTGGC1501 AGCACGTGCC TGTAACCCCA GCTACTCTGG AGGCTGAGGT ACAAGAGTCG CTTGAATTTG1561 GGAGGTGGAG ACTGCAGCGA GCCGAGATCC TACCACTGCA CTCCAGCTTG GGGGACAGTG1621 CGAGACTCTG TCTTAAAAAA CAGTGGCCTG GCGCACTGGC TCACGCTTGT AATCCCAGCA1681 CTTTGGGAGG CCGAGGTGGG CGGGGGTGGA TCATTGAGGT CAGGAGATCA AGACCATCCC1741 GGCCAACGTG GTGCAACCCC GTCTCTACTA CAAATACAAA AATTAGCTGG ACATGGTGGT1801 GTACGCCTGT AGTCCCAGCT ACTCGGGAGA GTAAGACGGG AATCGCTTGA ACCTGGGAGG1861 TGGGAGGTTG CAGTGAGCCA AGATTGTGCC ACTGCACTCC AGCCTGGCGA CAGAGCAAGA1921 CTGTCTTAAA AAAAAAAAAA AAAAAAAAAG GATATTTTCA CTCTTGGGAC TTGATAAAGC1981 TAGTTTATTT TGATTATCTC CTATATCCTA TACATATTTA ATTGGCCCCT ATGAACAATG2041 TTACCTCTTT ATGAGGGGAC CCAAAGAAGT AGCTGCTGGT GTGAGAGTGA GAGATCATCC2101 ATCTTTTTTA TTGTGCTTTT TGTTGTTTCT TTGTCCTGCT ATGTGTTATA AGTAAGGCCG2161 GGCACGGTGG CTCATGCCTG TAATCCCAGC ACTTAGGGAG GCCAAGGCCA GATCCCTGAG2221 GTCAAGAGTT TGAGACCAGC CTAGCCAACA TGGTGAAACC TTGTCTTTAC TGAAAATACA2281 AAAAAATTAG CTGGGCAGGG TGGCATGCGC CTGTAGTCCC AGCTACTCGC AGAGGCTGAG2341 GCAGGAGAAT TGCTTGAACC TGGGAGGCGG AGGTTGCGGT GAGCCAAGAT CCTGCCACTG2401 CACTCCAGCC TGGGCAACAG AGGGAGACTC CATCTCAAAA AAAAAAAAAA AAAAAB核苷酸序列(SEQ ID NO21)长度95个氨基酸1 MCHHAGLIFV FLVETGFHRV GQAGLDFLTS SDLPALACPS AGITGVSPRA WPGLLLKGNK61 EAFTRFKIGA QFKLVRICIS SRALFGPSLV EDESKC.核苷酸及氨基酸组合序列(SEQ ID NO20) 克隆号和蛋白名称FP6651起始编码子417 ATG 终止编码子702 TAG 蛋白质分子量10202.371 GT TCT AGG TAG TAG AAA GCA AAG GGT GCT ATG AAG AGC GTG TAC ACA GAG TCC CAA CTG 5960 TTT TGG GAG TTA AGG AAG GTT TCT TGG AGG AAG TGG CAT TCA AGC TAT AAG ACC TGA TGA 119120 TCA GGT GGA GTT AGC TGG AGA GCA GGG ACA GAG AGA ATA GCC TGT GCA AAA GGC CTA TTC 179180 TTC AGG AGA GAA TGA CAC ATG AAT GGG ACT GAA GAA GTA AAC TGG TAT CTC ATA TGA AGG 239240 ACC TTT TAT ATC TTG TTA AGG ATT TTG AAC TTC CTC CTT TTT TTT TTT TTG AGA CAG AGT 299300 TTC TCT CTG TCA CCC AGG CTG AAG TGC ATT GGC GTG ATC TCG GCT CAT GGC AGC CTC CAC 359360 CTA CCA GGT TCA AGC TAT TCT CCT GCC TCA GCT TCC CAG ATA GCT GGG ATT ACA GTC ATG 4191 Met 1420 TGC CAC CAC GCC GGG CTA ATT TTT GTA TTT TTA GTA GAG ACA GGG TTT CAC CGT GTT GGC 4792 Cys His His Ala Gly Leu Ile Phe Val Phe Leu Val Glu Thr Gly Phe His Arg Val Gly 21480 CAG GCT GGT CTC GAT TTC GTG ACC TCA AGT GAT CTG CCT GCC TTG GCC TGC CCC AGT GCC 53922 Gln Ala Gly Leu Asp Phe Leu Thr Ser Ser Asp Leu Pro Ala Leu Ala Cys Pro Ser Ala 41540 GGA ATT ACA GGA GTG AGC CCC CGC GCC TGG CCT GGA CTT CTG CTT AAA GGC AAT AAG GAA 59942 Gly lle Thr Gly Val Ser Pro Arg Ala Trp Pro Gly Leu Leu Leu Lys Gly Asn Lys Glu 61600 GCC TTT ACT AGA TTT AAA ATA GGA GCT CAG TTT AAA TTA GTA AGG ATT TGT ATT TCA TCA 65962 Ala Phe Thr Arg Phe Lys Ile Gly Ala Gln Phe Lys Leu Val Arg Ile Cys Ile Ser Ser 81660 AGA GCT CTC TTT GGC CCT AGT CTG GTA GAA GAT GAG TCG AAG TAG AGA GAC TAG TTA CAA 71982 Arg Ala Leu Phe Gly Pro Ser Leu Val Glu Asp Glu Ser Lys *** 96720 AGC TGT TCC CAA TAA TCC AGG TGA AAA ATA GTG GTG ACC CTA GAT TAA GGT AGT ATT GGT 779780 GTG GGT AGG GAG AAG TGG ACA GTC ATA TTT GAG AGG TAC CTA GGG AAT AGA ATT GCA AAG 839840 ACC TGG GAG TAG ATT GGA TAT TCA GTG GGA GGA AGG GAG AGA AGT AAT CTC TCA AGT GTT 899900 GCT CAA GCC ATA ACC TTG GAT GGT ACT GTC CAC TGA TAC AGT AGG AGG AAA ATG TTT GAG 959960 GGA AAG TAG TGA TGA ATT TGT GGT GCA CTA ACA TGG CCA ACA CTA AAT ATT AGA AAG ATT10191020 AAT GTG GTC ATG TAG AAG ATG AAT GAA AAG AAG ATA CCT CAG AAG TGG AGA GAT AGT TAA10791080 ATG GCT TTT GTA GGA ATC TCA GCT AGA AGT GTC AGT ATT CTT AAG TGC AGA ACT AAC AGG11391140 TGT GGG AAA GTA ATG GGA AGT AGA CAC CAA ACA AAT AGT TCC CCA AAG ATG GTA TCA AAT11991200 ATC CCA GTG ACA GCT TGC AGC CTG CTC AGC TTT ATG ATA TGC CCC TGA GAT CAT TTT TCA12591260 GGA CAA AAA GTA GTG AAA CTA CCT TTA TTT ACT TCT CAA ATT TAC CTT TAT TTA CTT CTC13191320 AAA TAT ACA TAG AAA GTA ATA TTG TAA AAA GCA GCT CTG GCT GGG CGC TGT GGC TCA AGC13791380 CTA TAG TCC CAG CAC TTT GGG AGG CTG AGG GGG GCA GAT GAC TTG AGG TCA AGA GTT CAA14391440 GAC CAT CCT GGC CAA CAT GGC AAA ACC GCA TTT CTA CTA AAA ATA CAA AAA TTC GCG TGG14991500 CAG CAC GTG CCT GTA ACC CCA GCT ACT CTG GAG GCT GAG GTA CAA GAG TCG CTT GAA TTT15591560 GGG AGG TGG AGA CTG CAG CGA GCC GAG ATC CTA CCA CTG CAC TCC AGC TTG GGG GAC AGT16191620 GCG AGA CTC TGT CTT AAA AAA CAG TGG CCT GGC GCA CTG GCT CAC GCT TGT AAT CCC AGC16791680 ACT TTG GGA GGC CGA GGT GGG CGG GGG TGG ATC ATT GAG GTC AGG AGA TCA AGA CCA TCC17391740 CGG CCA ACG TGG TGC AAC CCC GTC TCT ACT ACA AAT ACA AAA ATT AGC TGG ACA TGG TGG17991800 TGT ACG CCT GTA GTC CCA GCT ACT CGG GAG AGT AAG ACG GGA ATC GCT TGA ACC TGG GAG18591860 GTG GGA GGT TGC AGT GAG CCA AGA TTG TGC CAC TGC ACT CCA GCC TGG CGA CAG AGC AAG19191920 ACT GTC TTA AAA AAA AAA AAA AAA AAA AAA GGA TAT TTT CAC TCT TGG GAC TTG ATA AAG19791980 CTA GTT TAT TTT GAT TAT CTC CTA TAT CCT ATA CAT ATT TAA TTG GCC CCT ATG AAC AAT20392040 GTT ACC TCT TTA TGA GGG GAC CCA AAG AAG TAG CTG CTG GTG TGA GAG TGA GAG ATC ATC20992100 CAT CTT TTT TAT TGT GCT TTT TGT TGT TTC TTT GTC CTG CTA TGT GTT ATA AGT AAG GCC21592160 GGG CAC GGT GGC TCA TGC CTG TAA TCC CAG CAC TTA GGG AGG CCA AGG CCA GAT CCC TGA22192220 GGT CAA GAG TTT GAG ACC AGC CTA GCC AAC ATG GTG AAA CCT TGT CTT TAC TGA AAA TAC22792280 AAA AAA ATT AGC TGG GCA GGG TGG CAT GCG CCT GTA GTC CCA GCT ACT CGC AGA GGC TGA23392340 GGC AGG AGA ATT GCT TGA ACC TGG GAG GCG GAG GTT GCG GTG AGC CAA GAT CCT GCC ACT 23992400 GCA CTC CAG CCT GGG CAA CAG AGG GAG ACT CCA TCT CAA AAA AAA AAA AAA AAA AA 24558.FP7162A核苷酸序列(SEQ ID NO22)长度2572个碱基1 GCGGGGTTTC ACTATGTTGG CCAGGCTGGT CTAGAACTCC TGACCTCAAG TGATCTGCCC61 GCCTCGGCCT CCCAAAGTGC TGGGATTGCA GGCGTGAGAC ACTGCACCCG GACAATTTTC121 CTTTTCTTAC AAGAACACTG CTCACACTGC ATTCAGGGCC AACCCTAACC CAGTATCGCC181 TCATCCTGGT TTGATTATAT CGGCACAGAC CTTGCTTCCG AGCGAGGCCA CTTTCTCAGG241 TACTGGTGGA CATGAGTCTT CGGAGACGCT GCTCAACCCA CAGTGCTCCT CCAGCTTGGT301 TTCTGTGACT TGCCTTCCCC AGAGGAGGGG TGCCCTGAGA GGTCTCCACT CCCTGACCGG361 CTCCTTGGTG CCGCGCACTC TGAGAGGCTT CCCAGGGAAC AGAGCACACA GGACCGCCCT421 CCTGGGTAGA CCAATCAGCA TCTGAGCTCA CAATTTCCCA GCAGGGCAGT GGGGTGGAGA481 GAGAAGCCTG GGCTGGGCTG GGCTGGGCTG GGCTGGGGAA GCTTCTCCGG GCGGGGGGAC541 GTCAGAGCAG GATCTGGGGC TGATAAAAGC CCGCCCCTGG GTGGGGGCTG AGTGGTGCGG601 AAGCTGAGCC CGACACGTGG GGATGGAGGA CAGGCTGTGG GAGGGTGTGA ACCGGATACT661 GCTTGAAGGG GTGCTGGGGA CTTTGAGAGA GGGCGGCTGG CCCTGTCTGG TCGGGGATGC721 TGGCCCAGAC ACAGGCCATG GCTGGGATGG GGTTCAGAAA CAGGACCGCT GTCTCTCCCG781 GGCCAGGGCC CTCCCCAGCT GCTCCTGGCT TTCTGGTTCT TGGGGTCAGG GGCAGGCCTG841 TGCCATGACC CCGCCACTGA GGCTGTGAGG AGGCTGTCGG TGCCCAAGGG CACCAAGGCA901 CACCCCTACT CTTGCACCCC ATGTGTGGGC CCGAGCACCT GCTCTGCTGC CCCAAAGATC961 TGGCGATGTT TCCCAGGCAA CTGTCTCTCA CAGCCTGTCT GCCTGGCACT CCCGTATCCC1021 ATAAATGCCA CCACATCTGG CTATGGGTGG GCGTGCCTGC CTGGCATCCA CGGGCCAGCA1081 GGTGTGGTGG AGCACAGCCC AGTTCCTGGC TGCGTCAGAA GGCTGCCCGG GCCTTTTGGC1141 TGTCCTTGCC AGCAGGTGAG CACTGCCAGG GCACCGTGTG TGGGTGCTGG GCCATTTAGC1201 CACATGGGAA GGGGTGGAGG CAGCCCAGTG CCTTCAGCAT GTGCCCAGGG TGCCTGTCGG1261 CCACAGGTCT CATTTGGAAA TTGGGAGGGT GCACGGCCAC CGGGCTGCTT AGGCCTGCCA1321 GCCTCAGGGC CCGTCACCGC TGTCTTAGCC TGATTTGCAG GGTGTCAACG CTGGGCAGAG1381 ATGAACATTT GGGTGACTCT GAGGATGCCA GTGGCTGGGA CACTTGTTCT TCCGCGGTGG1441 AAGGAGTTGG AGAGGCCTGG CTCCCTGACC TACGGCCAGC CTGGCTTCTG AAACCAGCTC1501 AGTGGGCTGG GGCCTGATTC ATCATCCATA AATGTGTCCT TTTTTGCCAC AGAGGGTAAG1561 GGGCCTCCTA GCCCACCGGT CTGCAGGTGC GGGAGTAGGA GATGGGTGGC TCTGATGCCC1621 CCACCCACTC GATCACCTTC TGCTCTGCCT GGGATGCAAA CTCCCACAGC TGAAACGTTC1681 TTTTGTAAAC ATGAATTTTG GCTTAGAAAA AACTCATTTC CACTGTGCAC GTGTCAGTCC1741 CAACCAGAAA TTATTTTCCA ATAAAGCAAA ACTCCGTCAC CACAGCAGCA GATGGCTCCG1801 AAGAAGTGGA GCGTTTTCAT CAGGTTCAAC TTTGAAACCT CCACCATCAC CATCACCAGC1861 ACCGCTGTGT CATGCTGATA ACTTGAGGAC AGGCAGGACA AGGCCTTCTG GCGGCCGCCC1921 CTGGTTTCTC CTGGGGGGTG ATGAGCGGGA GCGGCTCTGG GCCGAGCTAC TGCGCACGGT1981 GAGCCCGGAG CTGATCCTGG ATCACGAGGT GCCTTCACTG CCCGCCTTCC CAGGACAGGA2041 GCCCAGGTGC GGCCCGGAGC CCACTGAAGT CTTCACTGTC GGACCCAAGA CCTTTTCCTG2101 GACACCCTTT CCGCCGGACC TGTGGGGCCC GGGCCGTTCC TACCGGCTGC TTCACGGGGC2161 AGGAGGGCAC CTGGAATCCC CCGCCAGGTC CCTGCCCCAG CGCCCGGCAC CTGATCCCTG2221 CAGGGCCCCC AGGGTGGAGC AGCAACCGTC TGTGGAGGGT GCCGCGGCCC TGCGCAACTG2281 CCCCATGTGC CAGAAGGAGT TTGCCCCCAG GCTGACCCAG CTGGATGTTG ACAGCCACCT2341 GGCCCAGTGC TTGGCCGAAA GCACAAAAAA CGTGACGTGG TGAGCGCCAT CCAAGAGCCC2401 TGCGCAGAGT GCAGCGCCCG GACACGCTTT CCCCCGCCAG CAGCCCCGCC TCTCGGCTCC2461 CCCGCCAACA GCCCCGCCTT TCGGCTCCCC CGCATGGGCA TTAAAACAGG GCGGGCTCCT2521 GTCTGTCTCT GTGTTGTGAT GAAAAAAAAA AAAAAAAAAA AAAAAAAAAA AAB核苷酸序列(SEQ ID NO24) 长度230个氨基酸
1 MNFGLEKTHF HCARVSPNQK LFSNKAKLRH HSSRWLRRSG AFSSGSTLKP PPSPSPAPLC61 HADNLRTGRT RPSGGRPWFL LGGDERERLW AELLRTVSPE LILDHEVPSL PAFPGQEPRC121 GPEPTEVFTV GPKTFSWTPF PPDLWGPGRS YRLLHGAGGH LESPARSLPQ RPAPDPCRAP181 RVEQQPSVEG AAALRNCPMC QKEFAPRLTQ LDVDSHLAQC LAESTKNVTWC.核苷酸及氨基酸组合序列(SEQ ID NO23) 克隆号和蛋白名称FP7162起始编码子1691 ATG 终止编码子2381 TGA 蛋白质分子量25496.601 G CGG GGT TTC ACT ATG TTG GCC AGG CTG GTC TAG AAC TCC TGA CCT CAA GTG ATC TGC 5859 CCG CCT CGG CCT CCC AAA GTG CTG GGA TTG CAG GCG TGA GAC ACT GCA CCC GGA CAA TTT 118119 TCC TTT TCT TAC AAG AAC ACT GCT CAC ACT GCA TTC AGG GCC AAC CCT AAC CCA GTA TCG 178179 CCT CAT CCT GGT TTG ATT ATA TCG GCA CAG ACC TTG CTT CCG AGC GAG GCC ACT TTC TCA 238239 GGT ACT GGT GGA CAT GAG TCT TCG GAG ACG CTG CTC AAC CCA CAG TGC TCC TCC AGC TTG 298299 GTT TCT GTG ACT TGC CTT CCC CAG AGG AGG GGT GCC CTG AGA GGT CTC CAC TCC CTG ACC 358359 GGC TCC TTG GTG CCG CGC ACT CTG AGA GGC TTC CCA GGG AAC AGA GCA CAC AGG ACC GCC 418419 CTC CTG GGT AGA CCA ATC AGC ATC TGA GCT CAC AAT TTC CCA GCA GGG CAG TGG GGT GGA 478479 GAG AGA AGC CTG GGC TGG GCT GGG CTG GGC TGG GCT GGG GAA GCT TCT CCG GGC GGG GGG 538539 ACG TCA GAG CAG GAT CTG GGG CTG ATA AAA GCC CGC CCC TGG GTG GGG GCT GAG TGG TGC 598599 GGA AGC TGA GCC CGA CAC GTG GGG ATG GAG GAC AGG CTG TGG GAG GGT GTG AAC CGG ATA 658659 CTG CTT GAA GGG GTG CTG GGG ACT TTG AGA GAG GGC GGC TGG CCC TGT CTG GTC GGG GAT 718719 GCT GGC CCA GAC ACA GGC CAT GGC TGG GAT GGG GTT CAG AAA CAG GAC CGC TGT CTC TCC 778779 CGG GCC AGG GCC CTC CCC AGC TGC TCC TGG CTT TCT GGT TCT TGG GGT CAG GGG CAG GCC 838839 TGT GCC ATG ACC CCG CCA CTG AGG CTG TGA GGA GGC TGT CGG TGC CCA AGG GCA CCA AGG 898899 CAC ACC CCT ACT CTT GCA CCC CAT GTG TGG GCC CGA GCA CCT GCT CTG CTG CCC CAA AGA 958959 TCT GGC GAT GTT TCC CAG GCA ACT GTC TCT CAC AGC CTG TCT GCC TGG CAC TCC CGT ATC10181019 CCA TAA ATG CCA CCA CAT CTG GCT ATG GGT GGG CGT GCC TGC CTG GCA TCC ACG GGC CAG10781079 CAG GTG TGG TGG AGC ACA GCC CAG TTC CTG GCT GCG TCA GAA GGC TGC CCG GGC CTT TTG11381139 GCT GTC CTT GCC AGC AGG TGA GCA CTG CCA GGG CAC CGT GTG TGG GTG CTG GGC CAT TTA11981199 GCC ACA TGG GAA GGG GTG GAG GCA GCC CAG TGC CTT CAG CAT GTG CCC AGG GTG CCT GTC12581259 GGC CAC AGG TCT CAT TTG GAA ATT GGG AGG GTG CAC GGC CAC CGG GCT GCT TAG GCC TGC13181319 CAG CCT CAG GGC CCG TCA CCG CTG TCT TAG CCT GAT TTG CAG GGT GTC AAC GCT GGG CAG13781379 AGA TGA ACA TTT GGG TGA CTC TGA GGA TGC CAG TGG CTG GGA CAC TTG TTC TTC CGC GGT14381439 GGA AGG AGT TGG AGA GGC CTG GCT CCC TGA CCT ACG GCC AGC CTG GCT TCT GAA ACC AGC14981499 TCA GTG GGC TGG GGC CTG ATT CAT CAT CCA TAA ATG TGT CCT TTT TTG CCA CAG AGG GTA15581559 AGG GGC CTC CTA GCC CAC CGG TCT GCA GGT GCG GGA GTA GGA GAT GGG TGG CTC TGA TGC16181619 CCC CAC CCA CTC GAT CAC CTT CTG CTC TGC CTG GGA TGC AAA CTC CCA CAG CTG AAA CGT16781679 TCT TTT GTA AAC ATG AAT TTT GGC TTA GAA AAA ACT CAT TTC CAC TGT GCA CGT GTC AGT17381 Met Asn Phe Gly Leu Glu Lys Thr His Phe His Cys Ala Arg Val Ser 161739 CCC AAC CAG AAA TTA TTT TCC AAT AAA GCA AAA CTC CGT CAC CAC AGC AGC AGA TGG CTC179817 Pro Asn Gln Lys Leu Phe Ser Asn Lys Ala Lys Leu Arg His His Ser Ser Arg Trp Leu 361799 CGA AGA AGT GGA GCG TTT TCA TCA GGT TCA ACT TTG AAA CCT CCA CCA TCA CCA TCA CCA185837 Arg Arg Ser Gly Ala Phe Ser Ser Gly Ser Thr Leu Lys Pro Pro Pro Ser Pro Ser Pro 561859 GCA CCG CTG TGT CAT GCT GAT AAC TTG AGG ACA GGC AGG ACA AGG CCT TCT GGC GGC CGC191857 Ala Pro Leu Cys His Ala Asp Asn Leu Arg Thr Gly Arg Thr Arg Pro Ser Gly Gly Arg 761919 CCC TGG TTT CTC CTG GGG GGT GAT GAG CGG GAG CGG CTC TGG GCC GAG CTA CTG CGC ACG197877 Pro Trp Phe Leu Leu Gly Gly Asp Glu Arg Glu Arg Leu Trp Ala Glu Leu Leu Arg Thr 961979 GTG AGC CCG GAG CTG ATC CTG GAT CAC GAG GTG CCT TCA CTG CCC GCC TTC CCA GGA CAG203897 Val Ser Pro Glu Leu Ile Leu Asp His Glu Val Pro Ser Leu Pro Ala Phe Pro Gly Gln 1162039 GAG CCC AGG TGC GGC CCG GAG CCC ACT GAA GTC TTC ACT GTC GGA CCC AAG ACC TTT TCC2098117 Glu Pro Arg Cys Gly Pro Glu Pro Thr Glu Val Phe Thr Val Gly Pro Lys Thr Phe Ser 1362099 TGG ACA CCC TTT CCG CCG GAC CTG TGG GGC CCG GGC CGT TCC TAC CGG CTG CTT CAC GGG2158137 Trp Thr Pro Phe Pro Pro Asp Leu Trp Gly Pro Gly Arg Ser Tyr Arg Leu Leu His Gly 1562159 GCA GGA GGG CAC CTG GAA TCC CCC GCC AGG TCC CTG CCC CAG CGC CCG GCA CCT GAT CCC2218157 Ala Gly Gly His Leu Glu Ser Pro Ala Arg Ser Leu Pro Gln Arg Pro Ala Pro Asp Pro 1762219 TGC AGG GCC CCC AGG GTG GAG CAG CAA CCG TCT GTG GAG GGT GCC GCG GCC CTG CGC AAC2278177 Cys Arg Ala Pro Arg Val Glu Gln Gln Pro Ser Val Glu Gly Ala Ala Ala Leu Arg Asn 1962279 TGC CCC ATG TGC CAG AAG GAG TTT GCC CCC AGG CTG ACC CAG CTG GAT GTT GAC AGC CAC2338197 Cys Pro Met Cys Gln Lys Glu Phe Ala Pro Arg Leu Thr Gln Leu Asp Val Asp Ser His 2162339 CTG GCC CAG TGC TTG GCC GAA AGC ACA AAA AAC GTG ACG TGG TGA GCG CCA TCC AAG AGC2398217 Leu Ala Gln Cys Leu Ala Glu Ser Thr Lys Asn Val Thr Trp *** 2312399 CCT GCG CAG AGT GCA GCG CCC GGA CAC GCT TTC CCC CGC CAG CAG CCC CGC CTC TCG GCT24582459 CCC CCG CCA ACA GCC CCG CCT TTC GGC TCC CCC GCA TGG GCA TTA AAA CAG GGC GGG CTC25182519 CTG TCT GTC TCT GTG TTG TGA TGA AAA AAA AAA AAA AAA AAA AAA AAA AAA AAA2572
序列表<110>上海新世界基因技术开发有限公司<120>具有抑癌功能的新的人蛋白及其编码序列<130>017519<160>24<170>PatentIn version 3.0<210>1<211>2662<212>DNA<213>智人(Homo sapiens)<400>1gtgacagtcc acggccccgc tgggatggag ccctgctggg tgcccgcacc gtgctcagtg 60tggcatgcgg cccgggtgtg gagggagacg gtggagcatc ccgtgcctag cgtggtgcca120gccaagggcg ggtggctggg gagctgtgct gggagctgtc gtaaacccgt ggtggctttg180atcctagggc cgtctttctg ctccacttcc cgggcactgt tccgagggag gctcaggtgg240ggaagcgagt gagcctaaag cccaggcttg tctccttggt gccaggccct gcttgctgga300atctggtgat cttaggaggt cactgttgca agggagggga cccaggagcc acctagtcga360acctctttgt ggtacagatg gagaaaccaa ggcccaaaca gtggccccct tgatcagcca420gaagcagagc tgggtggggc agcaggggat ccccaccacc aggctcagac tccttcagga480tgcttttgct tcgcagatga ggagaccgag gcttagagaa gagtagagac ttgctacccg540ttgaggtggt aacagccagg ctagaatctc ctgaaacggg gcagggtggg gaggtctggt600tgggctacct ggggccgggc gcctttcccc caggatgggg tgtactgccc gccctccccc660agtcatggtg ctggtgccag ctggtgcagg gggagggctc tgcaggcctt agcactgagg720caggtggcga gcagcagggg aagggtcttc tccacccacc ccaactgccc aaggttccgt780ggctcctcct tagacagcag tgagggttgg gggtgacagg caagccactg agcctcagca840ccgcgactca cccctcccac tcagcagtcc agccagggtc atccccagcc tcagaggagc900ctgggaacaa gggcagcggc agggccggcg ggggcctgga gggtgagcag gggcctttct960tcctgcagac agccctcagc gcctttttca ggagaccaac atcccctaca gccaccatca 1020ccaccagatg gtaagtgtcc ccggagtccc cagttctgga ttgggcggaa ggaggccgag 1080ctagttctgt gtataagcag cccctggccc cggtgtacga gggcgctggt gcaggcgggg 1140ctcgacctct ttggagatgg gtcagcagga gtcccggctc catgggtcct gcacttaatc 1200ttgcctgtgc cagctccccc tgagacctgg ggggcgctgg cctctggggc aatgaagctc 1260cttaccctac agcccccggg gatgctgtgg ctgatggaaa ggggtgggct gggaaagcct 1320cgtggcccca ggcaccgtgg gctcctgaga gtgaggctgg gtcggttcat ctcaaggctt 1380ctcctctggg aacccctggg cggcggacag gcttggggat ctggggaagg aacacagagc 1440cttccgagaa tgggccagcc acgcatctcc ccttgggagg cagtgggggc ccctccagga 1500aggggtgctc accccatctc tcctctcttc ccctcacaga tgtgcacccc cgccaatacc 1560cctgctacac cccccaactt ccctgacgct ctcaccatgt tctcccgtct caaggcctcc 1620gagagcttcc acagcggtgg cagcggcagc ccgatggccg cgacagccac gtcacccccg 1680ccacacttcc cccatgccgc caccagcagc tctgcggcct ccagctggcc cacggcggcc 1740tcgcccccgg ggggcccaca gcaccaccag ccacagccgc ccctgtggac tccaacaccc 1800ccttctccgg cttcagactg gccacccctg gccccccaac aggccacctc agaacccagg 1860gcccaccctg ccatggaggc agagagataa gggaggcccc tcccccctcc cggaggccag 1920gacccgtggg gcgggggaga ggacgtctct gcgggccccc ttcacccctt ttctgtctgc 1980accccttgtt ccccggagcc ctggagggga gagcgcggac tctagccagg cagggacacg 2040tctggtgcca gaacacgcag ctgcccacac gcaaggtcat ggccccagcg gccccggcac 2100atggagtggt tcagagcggc ctgggtgcct ggcggacaga acttcagaga ccacgcagcc 2160ttccttcgaa gacgcacctg cccagcccag cccaggggtg ccgtggagga ccaccctggc 2220ggagacattg ctgatccctg gcttggagct ccttgggggc cggcaggcct cgaaccccca 2280ccctagggaa tgcagagcct ctccgcatgt gtgcgcgtgg ccgtgtctgt gtatttctac 2340gtgtgtcgct cttcagaagc aacctagttc ctggggcagc tggactttgc atgttagtgt 2400gagcccccag ccccctgccc gccgccccct ccccagggcc ctgcctcctc cccaccccct 2460cgtcagccag cgttgctgtt ccttgcagag aaaaggattg tgggaaactc caggactctt 2520cccaccgcct cccagcgcct gcctgctggg gctgcctgca tgcctcccct gcacctgggg 2580gtacccgcat ccacttcctt tccccctttt aacaaaagag aagaacgaat tccaaaaaaa 2640aaaaaaaaaa aaaaaaaaaa aa2662<210>2<211>2662<212>DNA<213>智人(Homo sapiens)<220><221>CDS<222>(1252)..(1887)<400>2gtgacagtcc acggccccgc tgggatggag ccctgctggg tgcccgcacc gtgctcagtg 60tggcatgcgg cccgggtgtg gagggagacg gtggagcatc ccgtgcctag cgtggtgcca120gccaagggcg ggtggctggg gagctgtgct gggagctgtc gtaaacccgt ggtggctttg180atcctagggc cgtctttctg ctccacttcc cgggcactgt tccgagggag gctcaggtgg240ggaagcgagt gagcctaaag cccaggcttg tctccttggt gccaggccct gcttgctgga300atctggtgat cttaggaggt cactgttgca agggagggga cccaggagcc acctagtcga360acctctttgt ggtacagatg gagaaaccaa ggcccaaaca gtggccccct tgatcagcca420gaagcagagc tgggtggggc agcaggggat ccccaccacc aggctcagac tccttcagga480tgcttttgct tcgcagatga ggagaccgag gcttagagaa gagtagagac ttgctacccg540ttgaggtggt aacagccagg ctagaatctc ctgaaacggg gcagggtggg gaggtctggt600tgggctacct ggggccgggc gcctttcccc caggatgggg tgtactgccc gccctccccc660agtcatggtg ctggtgccag ctggtgcagg gggagggctc tgcaggcctt agcactgagg720caggtggcga gcagcagggg aagggtcttc tccacccacc ccaactgccc aaggttccgt780ggctcctcct tagacagcag tgagggttgg gggtgacagg caagccactg agcctcagca840ccgcgactca cccctcccac tcagcagtcc agccagggtc atccccagcc tcagaggagc900ctgggaacaa gggcagcggc agggccggcg ggggcctgga gggtgagcag gggcctttct960tcctgcagac agccctcagc gcctttttca ggagaccaac atcccctaca gccaccatca 1020ccaccagatg gtaagtgtcc ccggagtccc cagttctgga ttgggcggaa ggaggccgag 1080ctagttctgt gtataagcag cccctggccc cggtgtacga gggcgctggt gcaggcgggg 1140ctcgacctct ttggagatgg gtcagcagga gtcccggctc catgggtcct gcacttaatc 1200ttgcctgtgc cagctccccc tgagacctgg ggggcgctgg cctctggggc a atg aag1257Met Lys1ctc ctt acc cta cag ccc ccg ggg atg ctg tgg ctg atg gaa agg ggt 1305Leu Leu Thr Leu Gln Pro Pro Gly Met Leu Trp Leu Met Glu Arg Gly5 10 15ggg ctg gga aag cct cgt ggc ccc agg cac cgt ggg ctc ctg aga gtg 1353Gly Leu Gly Lys Pro Arg Gly Pro Arg His Arg Gly Leu Leu Arg Val20 25 30agg ctg ggt cgg ttc atc tca agg ctt ctc ctc tgg gaa ccc ctg ggc 1401Arg Leu Gly Arg Phe Ile Ser Arg Leu Leu Leu Trp Glu Pro Leu Gly35 40 45 50ggc gga cag gct tgg gga tct ggg gaa gga aca cag agc ctt ccg aga 1449Gly Gly Gln Ala Trp Gly Ser Gly Glu Gly Thr Gln Ser Leu Pro Arg55 60 65atg ggc cag cca cgc atc tcc cct tgg gag gca gtg ggg gcc cct cca 1497Met Gly Gln Pro Arg Ile Ser Pro Trp Glu Ala Val Gly Ala Pro Pro70 75 80gga agg ggt gct cac ccc atc tct cct ctc ttc ccc tca cag atg tgc 1545Gly Arg Gly Ala His Pro Ile Ser Pro Leu Phe Pro Ser Gln Met Cys85 90 95acc ccc gcc aat acc cct gct aca ccc ccc aac ttc cct gac gct ctc 1593Thr Pro Ala Asn Thr Pro Ala Thr Pro Pro Asn Phe Pro Asp Ala Leu100 105 110acc atg ttc tcc cgt ctc aag gcc tcc gag agc ttc cac agc ggt ggc 1641Thr Met Phe Ser Arg Leu Lys Ala Ser Glu Ser Phe His Ser Gly Gly115 120 125 130agc ggc agc ccg atg gcc gcg aca gcc acg tca ccc ccg cca cac ttc 1689Ser Gly Ser Pro Met Ala Ala Thr Ala Thr Ser Pro Pro Pro His Phe135 140 145ccc cat gcc gcc acc agc agc tct gcg gcc tcc agc tgg ccc acg gcg 1737Pro His Ala Ala Thr Ser Ser Ser Ala Ala Ser Ser Trp Pro Thr Ala150 155 160gcc tcg ccc ccg ggg ggc cca cag cac cac cag cca cag ccg ccc ctg 1785Ala Ser Pro Pro Gly Gly Pro Gln His His Gln Pro Gln Pro Pro Leu165 170 175tgg act cca aca ccc cct tct ccg gct tca gac tgg cca ccc ctg gcc 1833Trp Thr Pro Thr Pro Pro Ser Pro Ala Ser Asp Trp Pro Pro Leu Ala180 185 190ccc caa cag gcc acc tca gaa ccc agg gcc cac cct gcc atg gag gca 1881Pro Gln Gln Ala Thr Ser Glu Pro Arg Ala His Pro Ala Met Glu Ala195 200 205 210gag aga taagggaggc ccctcccccc tcccggaggc caggacccgt ggggcggggg 1937Glu Argagaggacgtc tctgcgggcc cccttcaccc cttttctgtc tgcacccctt gttccccgga 1997gccctggagg ggagagcgcg gactctagcc aggcagggac acgtctggtg ccagaacacg 2057cagctgccca cacgcaaggt catggcccca gcggccccgg cacatggagt ggttcagagc 2117ggcctgggtg cctggcggac agaacttcag agaccacgca gccttccttc gaagacgcac 2177ctgcccagcc cagcccaggg gtgccgtgga ggaccaccct ggcggagaca ttgctgatcc 2237ctggcttgga gctccttggg ggccggcagg cctcgaaccc ccaccctagg gaatgcagag 2297cctctccgca tgtgtgcgcg tggccgtgtc tgtgtatttc tacgtgtgtc gctcttcaga 2357agcaacctag ttcctggggc agctggactt tgcatgttag tgtgagcccc cagccccctg 2417cccgccgccc cctccccagg gccctgcctc ctccccaccc cctcgtcagc cagcgttgct 2477gttccttgca gagaaaagga ttgtgggaaa ctccaggact cttcccaccg cctcccagcg 2537cctgcctgct ggggctgcct gcatgcctcc cctgcacctg ggggtacccg catccacttc 2597ctttccccct tttaacaaaa gagaagaacg aattccaaaa aaaaaaaaaa aaaaaaaaaa 2657aaaaa 2662<210>3<211>212<212>PRT<213>智人(Homo sapiens)<400>3Met Lys Leu Leu Thr Leu Gln Pro Pro Gly Met Leu Trp Leu Met Glu1 5 10 15Arg Gly Gly Leu Gly Lys Pro Arg Gly Pro Arg His Arg Gly Leu Leu20 25 30Arg Val Arg Leu Gly Arg Phe Ile Ser Arg Leu Leu Leu Trp Glu Pro35 40 45Leu Gly Gly Gly Gln Ala Trp Gly Ser Gly Glu Gly Thr Gln Ser Leu50 55 60Pro Arg Met Gly Gln Pro Arg Ile Ser Pro Trp Glu Ala Val Gly Ala65 70 75 80Pro Pro Gly Arg Gly Ala His Pro Ile Ser Pro Leu Phe Pro Ser Gln85 90 95Met Cys Thr Pro Ala Asn Thr Pro Ala Thr Pro Pro Asn Phe Pro Asp100 105 110Ala Leu Thr Met Phe Ser Arg Leu Lys Ala Ser Glu Ser Phe His Ser115 120 125Gly Gly Ser Gly Ser Pro Met Ala Ala Thr Ala Thr Ser Pro Pro Pro130 135 140His Phe Pro His Ala Ala Thr Ser Ser Ser Ala Ala Ser Ser Trp Pro145 150 155 160Thr Ala Ala Ser Pro Pro Gly Gly Pro Gln His His Gln Pro Gln Pro165 170 175Pro Leu Trp Thr Pro Thr Pro Pro Ser Pro Ala Ser Asp Trp Pro Pro180 185 190Leu Ala Pro Gln Gln Ala Thr Ser Glu Pro Arg Ala His Pro Ala Met195 200 205Glu Ala Glu Arg210<210>4<21l>3325<212>DNA<213>智人(Homo sapiens)<400>4ggccgcgcga gggtggtggg catcgaggtc ccagcagcgg acgagggagg tgccgccgtc 60gcccaggatg ggctgggaat gaagcgatgt agccttttaa gagatttgct ctgacccatc120tgaagtccat atggctctgt atgatgaaga cctcctgaaa aatcctttct atctggctct180gcaaaagtgc cgccctgact tgtgcagcaa agtggcccaa atccatggca ttgtcttagt240accctgcaaa ggaagcctgt cgagcagcat ccagtctact tgtcagtttg agtcctacat300tttgatacct gtggaagagc attttcagac cttaaatgga aaggatgtct ttattcaagg360gaacaggatt aaattaggag ctggttttgc ctgtcttctc tcagtgccca ttctctttga420agaaactttc tacaatgaaa aagaagagag tttcagcatc ctgtgtatag cccatccttt480ggaaaagaga gagagttcag aagagccttt ggcaccctca gatccctttt ccctgaaaac540cattgaagat gtgagagagt tcttgggaag acactccgag cgatttgaca ggaacatcgc600ctctttccta atcgaacatt ccgagaatgc gagagaaaga gcctccgtca ccacatagac660tcagcgaatg ctctctacac caaatgcctc cagcagcttc tgagggactc tcacctgaaa720atgctcgcca agcaggaggc ccagatgaac ctgatgaagc aggcagtgga gatatacgtc780catcatgaaa tttacaacct gatctttaaa tacgtgggga ccatggaggc aagtgaggat840gcggccttta acaaaaatca caagaagcct tcaagatctt cagcagaaag atattggtgt900gaaaccggag ttcagcttta acatacctcg tgccaaaaga gagctggctc agctgaacaa960atgcacctcc ccacagcaga agcttgtctg cttgcgaaaa gtggtgcagc tcattacaca 1020gtctccaagc cagagagtga acctggagac catgtgtgct gatgatctgc tatcagtcct 1080gttatacttg cttgtgaaaa cggagatccc taattggatg gcaaatttga gttacatcaa 1140aaacttcagg tttagcagct tggcaaagga tgaactggga tactgcctga cctcattcga 1200agctgccatt gaatatattc ggcaaggaag cctctctgct aaaccccctg taagatctca 1260cccctgccct ggccttcctt tgtgggcatc atggttccct tgatagggtg ctggggttgg 1320tatgtgggca gacggattct taaattgcct cccaggaatg gggcctcagc tgtttgaggg 1380ctgtgagtct taaaaatcac tcagtgaaga gaacaccaag cccccaattg gtggtaaaaa 1440ttggtgggtt atcattggga tttacattgt taatatccta cttcattagt ccccatcctc 1500tccaaagaca tgtgggtgca aagggaagcc agaagtaggg aatttggatt tcttgacctt 1560gatagtcaag aagtgatgtc acgggatccc tggactgtcg cttttccagc cggaaacctc 1620tgtggctggt ggctcctttg cctgagtttt gttcgggcct gctgggctca tttcacgctc 1680ttggcctggc aggctgcgct cggcttgtgc tactggcctg gatcccatgc ctgccaaggg 1740cgagccaggt gtggagtggc gaggggtatg tgagcaagtg cagggtctgg ccactgcaca 1800caaccaggtg tgccgactga ggtggggtgg gcagctccaa gttgcttgta cagggtcctg 1860ctccatgcaa ggctgcagct agagcaggcg tactgtaggc cgcttccacg gtgggcactg 1920gggaacacag tggggcctgg aagcttggag acaccaggaa ctgcagagcc ccaaagaggg 1980tgtcatagcc ctggctcggg gaactcctag gttgggctcc ctgaagggcc agagctcttc 2040tctcctctcg tcacctgcaa tgtagtgagt cgggagcatg ttttagctct ctttatgtta 2100cagctctttc agtcctgcca tttggtgggt cccgagttct tgtcccatgt cgaggaagaa 2160tgaggtacgt agactagtgg agggtgagca aggcagagag gagctttact gaacggcaga 2220atagctctca ggagacccac agtgggcagc ttctttccac aggcaggtcg tcctgacgag 2280ttaaagaggc ctgacgtagg tagctccttc ctgcagttgg tagtcccgac atctgtctga 2340gtctggctga gtccggggtt ttttatggct cagaagggag ggagtatgtg ctgattggtc 2400cataggtggg cctggagaaa gcaccatgag ttctcagtct gggccgtgga ctccacttgg 2460aactgacagc ccagccccca ggctttaggc tgtccctgtc ttgaaggtgg ggcttcactg 2520ggcacctgca cctttccacc cagaagcgtg tctgccttct gccaccatca acatgctggc 2580cagtgcatcc aggctgtttg tgccaagggg catctgcagg cctgcactga gctgccctca 2640gcccctacct tgactactct cccatgctca tcagcgccca aaatcttgga ggggctgagg 2700catcaggagg ctggtgtgtc agtgtcacac caagcatgtg cacacatggc tgggttgcaa 2760cagtacccgg gcttggcctc agctttgctc tgaaattgaa gtcggtgcca ggagtgggga 2820ggagcgggag caggcactta cgagcctgcg gcggcaggga tgcttcctgg gcccctgaga 2880gtgcagagat tcctggatcc agagctgcgg ctgggcggct gcagctgcgc ctgggagtgc 2940agggctcccg ccctgccagc tcagtaggag atgggggctc ctgcctattc ctggctcctg 3000ttggccctgc agagtgcaca accctggccg cgcttcctcc actgcagctt acgtctttgc 3060agcagccact cccgatgggc tgccactgcc atctgtgaga caattaatgt gtgcaatttg 3120aggactcagt ggccttgcca ttgtttccct tggtttttat tgagcattgg ctggggtcgg 3180cgaggggatg tgattatatt tctatgtgaa tcgtgagaat cttgaaccat agttgtcctg 3240ctggcctgtt ttactacata ccaatgagta aaatgtgatc atacagaaat cacaaagttg 3300aaatcctaaa aaaaaaaaaa aaaaa 3325<210>5<211>3325<212>DNA<213>智人(Homo sapiens)<220><221>CDS<222>(131)..(655)<400>5ggccgcgcga gggtggtggg catcgaggtc ccagcagcgg acgagggagg tgccgccgtc 60gcccaggatg ggctgggaat gaagcgatgt agccttttaa gagatttgct ctgacccatc120tgaagtccat atg gct ctg tat gat gaa gac ctc ctg aaa aat cct ttc 169Met Ala Leu Tyr Asp Glu Asp Leu Leu Lys Asn Pro Phe1 5 10tat ctg gct ctg caa aag tgc cgc cct gac ttg tgc agc aaa gtg gcc 217Tyr Leu Ala Leu Gln Lys Cys Arg Pro Asp Leu Cys Ser Lys Val Ala15 20 25caa atc cat ggc att gtc tta gta ccc tgc aaa gga agc ctg tcg agc 265Gln Ile His Gly Ile Val Leu Val Pro Cys Lys Gly Ser Leu Ser Ser30 35 40 45agc atc cag tct act tgt cag ttt gag tcc tac att ttg ata cct gtg 313Ser Ile Gln Ser Thr Cys Gln Phe Glu Ser Tyr Ile Leu Ile Pro Val50 55 60gaa gag cat ttt cag acc tta aat gga aag gat gtc ttt att caa ggg 361Glu Glu His Phe Gln Thr Leu Asn Gly Lys Asp Val Phe Ile Gln Gly65 70 75aac agg att aaa tta gga gct ggt ttt gcc tgt ctt ctc tca gtg ccc 409Asn Arg Ile Lys Leu Gly Ala Gly Phe Ala Cys Leu Leu Ser Val Pro80 85 90att ctc ttt gaa gaa act ttc tac aat gaa aaa gaa gag agt ttc agc 457Ile Leu Phe Glu Glu Thr Phe Tyr Asn Glu Lys Glu Glu Ser Phe Ser95 100 105atc ctg tgt ata gcc cat cct ttg gaa aag aga gag agt tca gaa gag 505Ile Leu Cys Ile Ala His Pro Leu Glu Lys Arg Glu Ser Ser Glu Glu110 115 120 125cct ttg gca ccc tca gat ccc ttt tcc ctg aaa acc att gaa gat gtg 553Pro Leu Ala Pro Ser Asp Pro Phe Ser Leu Lys Thr Ile Glu Asp Val130 135 140aga gag ttc ttg gga aga cac tcc gag cga ttt gac agg aac atc gcc 601Arg Glu Phe Leu Gly Arg His Ser Glu Arg Phe Asp Arg Asn Ile Ala145 150 155tct ttc cta atc gaa cat tcc gag aat gcg aga gaa aga gcc tcc gtc 649Ser Phe Leu Ile Glu His Ser Glu Asn Ala Arg Glu Arg Ala Ser Val160 165 170acc aca tagactcagc gaatgctctc tacaccaaat gcctccagca gcttctgagg 705Thr Thr175gactctcacc tgaaaatgct cgccaagcag gaggcccaga tgaacctgat gaagcaggca765gtggagatat acgtccatca tgaaatttac aacctgatct ttaaatacgt ggggaccatg825gaggcaagtg aggatgcggc ctttaacaaa aatcacaaga agccttcaag atcttcagca885gaaagatatt ggtgtgaaac cggagttcag ctttaacata cctcgtgcca aaagagagct945ggctcagctg aacaaatgca cctccccaca gcagaagctt gtctgcttgc gaaaagtggt 1005gcagctcatt acacagtctc caagccagag agtgaacctg gagaccatgt gtgctgatga 1065tctgctatca gtcctgttat acttgcttgt gaaaacggag atccctaatt ggatggcaaa 1125tttgagttac atcaaaaact tcaggtttag cagcttggca aaggatgaac tgggatactg 1185cctgacctca ttcgaagctg ccattgaata tattcggcaa ggaagcctct ctgctaaacc 1245ccctgtaaga tctcacccct gccctggcct tcctttgtgg gcatcatggt tcccttgata 1305gggtgctggg gttggtatgt gggcagacgg attcttaaat tgcctcccag gaatggggcc 1365tcagctgttt gagggctgtg agtcttaaaa atcactcagt gaagagaaca ccaagccccc 1425aattggtggt aaaaattggt gggttatcat tgggatttac attgttaata tcctacttca 1485ttagtcccca tcctctccaa agacatgtgg gtgcaaaggg aagccagaag tagggaattt 1545ggatttcttg accttgatag tcaagaagtg atgtcacggg atccctggac tgtcgctttt 1605ccagccggaa acctctgtgg ctggtggctc ctttgcctga gttttgttcg ggcctgctgg 1665gctcatttca cgctcttggc ctggcaggct gcgctcggct tgtgctactg gcctggatcc 1725catgcctgcc aagggcgagc caggtgtgga gtggcgaggg gtatgtgagc aagtgcaggg 1785tctggccact gcacacaacc aggtgtgccg actgaggtgg ggtgggcagc tccaagttgc 1845ttgtacaggg tcctgctcca tgcaaggctg cagctagagc aggcgtactg taggccgctt 1905ccacggtggg cactggggaa cacagtgggg cctggaagct tggagacacc aggaactgca 1965gagccccaaa gagggtgtca tagccctggc tcggggaact cctaggttgg gctccctgaa 2025gggccagagc tcttctctcc tctcgtcacc tgcaatgtag tgagtcggga gcatgtttta 2085gctctcttta tgttacagct ctttcagtcc tgccatttgg tgggtcccga gttcttgtcc 2145catgtcgagg aagaatgagg tacgtagact agtggagggt gagcaaggca gagaggagct 2205ttactgaacg gcagaatagc tctcaggaga cccacagtgg gcagcttctt tccacaggca 2265ggtcgtcctg acgagttaaa gaggcctgac gtaggtagct ccttcctgca gttggtagtc 2325ccgacatctg tctgagtctg gctgagtccg gggtttttta tggctcagaa gggagggagt 2385atgtgctgat tggtccatag gtgggcctgg agaaagcacc atgagttctc agtctgggcc 2445gtggactcca cttggaactg acagcccagc ccccaggctt taggctgtcc ctgtcttgaa 2505ggtggggctt cactgggcac ctgcaccttt ccacccagaa gcgtgtctgc cttctgccac 2565catcaacatg ctggccagtg catccaggct gtttgtgcca aggggcatct gcaggcctgc 2625actgagctgc cctcagcccc taccttgact actctcccat gctcatcagc gcccaaaatc 2685ttggaggggc tgaggcatca ggaggctggt gtgtcagtgt cacaccaagc atgtgcacac 2745atggctgggt tgcaacagta cccgggcttg gcctcagctt tgctctgaaa ttgaagtcgg 2805tgccaggagt ggggaggagc gggagcaggc acttacgagc ctgcggcggc agggatgctt 2865cctgggcccc tgagagtgca gagattcctg gatccagagc tgcggctggg cggctgcagc 2925tgcgcctggg agtgcagggc tcccgccctg ccagctcagt aggagatggg ggctcctgcc 2985tattcctggc tcctgttggc cctgcagagt gcacaaccct ggccgcgctt cctccactgc 3045agcttacgtc tttgcagcag ccactcccga tgggctgcca ctgccatctg tgagacaatt 3105aatgtgtgca atttgaggac tcagtggcct tgccattgtt tcccttggtt tttattgagc 3165attggctggg gtcggcgagg ggatgtgatt atatttctat gtgaatcgtg agaatcttga 3225accatagttg tcctgctggc ctgttttact acataccaat gagtaaaatg tgatcataca 3285gaaatcacaa agttgaaatc ctaaaaaaaa aaaaaaaaaa 3325<210>6<211>175<212>PRT<213>智人(Homo sapiens)<400>6Met Ala Leu Tyr Asp Glu Asp Leu Leu Lys Asn Pro Phe Tyr Leu Ala1 5 10 15Leu Gln Lys Cys Arg Pro Asp Leu Cys Ser Lys Val Ala Gln Ile His20 25 30Gly Ile Val Leu Val Pro Cys Lys Gly Ser Leu Ser Ser Ser Ile Gln35 40 45Ser Thr Cys Gln Phe Glu Ser Tyr Ile Leu Ile Pro Val Glu Glu His50 55 60Phe Gln Thr Leu Asn Gly Lys Asp Val Phe Ile Gln Gly Asn Arg Ile65 70 75 80Lys Leu Gly Ala Gly Phe Ala Cys Leu Leu Ser Val Pro Ile Leu Phe85 90 95Glu Glu Thr Phe Tyr Asn Glu Lys Glu Glu Ser Phe Ser Ile Leu Cys100 105 110Ile Ala His Pro Leu Glu Lys Arg Glu Ser Ser Glu Glu Pro Leu Ala115 120 125Pro Ser Asp Pro Phe Ser Leu Lys Thr Ile Glu Asp Val Arg Glu Phe130 135 140Leu Gly Arg His Ser Glu Arg Phe Asp Arg Asn Ile Ala Ser Phe Leu145 150 155 160Ile Glu His Ser Glu Asn Ala Arg Glu Arg Ala Ser Val Thr Thr165 170 175<210>7<211>2154<212>DNA<213>智人(Homo sapiens)<400>7gggggaatct cacagccctc acctacctca acctcagccg aaaccagctg tcgctgctgc 60caccctacat ctgccagctg cccctgaggg tcctcatcgt cagcaacaac aagctgggag120ccctgccccc tgacatcggc accctgggaa gcctgcgaca gcttgacgtg agcagcaacg180agctccaatc cctgccctcg gaactgtgtg gcctctcttc cctgcgggac ctcaatgtcc240ggaggaacca gctcagtacg ctgcccgaag agctggggga cctccctctg gtcccctgga300tttctcctgt aaccgcgtct cccgaatccc agtctccttc tgccgcctga ggcacctgca360ggtcattctg ctggacagca accctctgca gagtccacct gcccaggtct gcctgaaggg420gaaacttcac atcttcaagt atttgtccac agaggccggg cagcgtgggt cggccctggg480ggacctggcc ccttctcggc ccccgagttt cagtccctgc cctgcagagg atctatttcc540gggacatcgg tacgatggtg ggctggactc aggcttccac agcgttgata gtggcagcaa600gaggtggtct ggaaatgagt caacagatga attttcagag ctgtcattcc ggatctcaga660gctggcccgg gagccccggg gacccagaga acgcaaggag gatggctcag cggacggaga720ccctgtgcag attgacttca tcgacagcca tgtccccggg gaggatgaag agcgaggcac780tgtggaggag cagcgaccac ccgaattaag ccctggggca ggggacaggg agagggcacc840aagcagcagg cgggaggagc cggcagggga ggagcggcgg cgcccggaca ccttgcagct900gtggcaggag cgggaacggc ggcagcagca gcagagcggg gcgtgggggg ccccgaggaa960ggatagcggc tcgcctaagt ccagtgcctc ccaagcaggg gctgcagcgg ggcagggagc 1020ccccgcccct gcccctgcct cccaagagcc ccttcccata gctggaccag cgacagcacc 1080ctgctccacg gccacttggc tccattcaga gaccaaacag cttcctcttc cgttcctcct 1140ctcagagtgg ctcaggccct tcctcaccag actctgtcct gagacctcgg cggtaccccc 1200aggttccaga tgagaaggac ttaatgactc agctgcgcca ggtccttgag tcccggctgc 1260agcggcccct gcctgaggac ctggcgaggc tctggccaag tggggtcatc ctgtgccagc 1320tggccaacca gctacggccg cgctccgtgc ccttcatcca tgtgccctcc cctgctgtgc 1380caaaactcag tgccctcaag gctcggaaga atgtggagag ttttctagaa gcctgtcgaa 1440aaatgggggt gcctgaggct gacctgtgct cgccctcgga tctcctccag ggcactgccc 1500gggggctgcg gaccgcgctg gaggccgtga agcgggtggg gggcaaggcc ctaccgcccc 1560tctggccccc ctctggtctg ggcggcttcg tcgtcttcta cgtggtcctc atgctgctgc 1620tctatgtcac ctacactcgg ctcctgggtt cctaggcccc aaaatcggcc ctccctcacc 1680cctttccctt cctctctatt tataaggtcc ctgctccacc cgaccccacc tgcggtgcct 1740tcagccccaa ccaaagacac tagtgcaccc ccttcacaga cactgacctc agaggcccca 1800ctctggtgcc cccagaccct gggcccccag cctctggcct ccctccagta gccccacgag 1860tccccacctc tcagtgctga cggtgccttc atgtccccgc cggccctgcc cctgccctct 1920gtaccccgtg aggggtggca ggagctggag tctccccctt cctcctgtgc cctccccttc 1980cccccccaac agctgctatg ggggggctaa attatctcta ttttgtagag aggatctata 2040tttgtagggg ttcggggccc aggccgggtc cctatctctg tgtataaact gtacagaccg 2100tgaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaa 2154<210>8<211>2154<212>DNA<213>智人(Homo sapiens)<220><221>CDS<222>(1224)..(1652)<400>8gggggaatct cacagccctc acctacctca acctcagccg aaaccagctg tcgctgctgc 60caccctacat ctgccagctg cccctgaggg tcctcatcgt cagcaacaac aagctgggag120ccctgccccc tgacatcggc accctgggaa gcctgcgaca gcttgacgtg agcagcaacg180agctccaatc cctgccctcg gaactgtgtg gcctctcttc cctgcgggac ctcaatgtcc240ggaggaacca gctcagtacg ctgcccgaag agctggggga cctccctctg gtcccctgga300tttctcctgt aaccgcgtct cccgaatccc agtctccttc tgccgcctga ggcacctgca360ggtcattctg ctggacagca accctctgca gagtccacct gcccaggtct gcctgaaggg420gaaacttcac atcttcaagt atttgtccac agaggccggg cagcgtgggt cggccctggg480ggacctggcc ccttctcggc ccccgagttt cagtccctgc cctgcagagg atctatttcc540gggacatcgg tacgatggtg ggctggactc aggcttccac agcgttgata gtggcagcaa600gaggtggtct ggaaatgagt caacagatga attttcagag ctgtcattcc ggatctcaga660gctggcccgg gagccccggg gacccagaga acgcaaggag gatggctcag cggacggaga720ccctgtgcag attgacttca tcgacagcca tgtccccggg gaggatgaag agcgaggcac780tgtggaggag cagcgaccac ccgaattaag ccctggggca ggggacaggg agagggcacc840aagcagcagg cgggaggagc cggcagggga ggagcggcgg cgcccggaca ccttgcagct900gtggcaggag cgggaacggc ggcagcagca gcagagcggg gcgtgggggg ccccgaggaa960ggatagcggc tcgcctaagt ccagtgcctc ccaagcaggg gctgcagcgg ggcagggagc 1020ccccgcccct gcccctgcct cccaagagcc ccttcccata gctggaccag cgacagcacc 1080ctgctccacg gccacttggc tccattcaga gaccaaacag cttcctcttc cgttcctcct 1140ctcagagtgg ctcaggccct tcctcaccag actctgtcct gagacctcgg cggtaccccc 1200aggttccaga tgagaaggac tta atg act cag ctg cgc cag gtc ctt gag tcc 1253Met Thr Gln Leu Arg Gln Val Leu Glu Ser1 5 10cgg ctg cag cgg ccc ctg cct gag gac ctg gcg agg ctc tgg cca agt 1301Arg Leu Gln Arg Pro Leu Pro Glu Asp Leu Ala Arg Leu Trp Pro Ser15 20 25ggg gtc atc ctg tgc cag ctg gcc aac cag cta cgg ccg cgc tcc gtg 1349Gly Val Ile Leu Cys Gln Leu Ala Asn Gln Leu Arg Pro Arg Ser Val30 35 40ccc ttc atc cat gtg ccc tcc cct gct gtg cca aaa ctc agt gcc ctc 1397Pro Phe Ile His Val Pro Ser Pro Ala Val Pro Lys Leu Ser Ala Leu45 50 55aag gct cgg aag aat gtg gag agt ttt cta gaa gcc tgt cga aaa atg 1445Lys Ala Arg Lys Asn Val Glu Ser Phe Leu Glu Ala Cys Arg Lys Met60 65 70ggg gtg cct gag gct gac ctg tgc tcg ccc tcg gat ctc ctc cag ggc 1493Gly Val Pro Glu Ala Asp Leu Cys Ser Pro Ser Asp Leu Leu Gln Gly75 80 85 90act gcc cgg ggg ctg cgg acc gcg ctg gag gcc gtg aag cgg gtg ggg 1541Thr Ala Arg Gly Leu Arg Thr Ala Leu Glu Ala Val Lys Arg Val Gly
95 100 105ggc aag gcc cta ccg ccc ctc tgg ccc ccc tct ggt ctg ggc ggc ttc 1589Gly Lys Ala Leu Pro Pro Leu Trp Pro Pro Ser Gly Leu Gly Gly Phe110 115 120gtc gtc ttc tac gtg gtc ctc atg ctg ctg ctc tat gtc acc tac act 1637Val Val Phe Tyr Val Val Leu Met Leu Leu Leu Tyr Val Thr Tyr Thr125 130 135cgg ctc ctg ggt tcc taggccccaa aatcggccct ccctcacccc tttcccttcc 1692Arg Leu Leu Gly Ser140tctctattta taaggtccct gctccacccg accccacctg cggtgccttc agccccaacc 1752aaagacacta gtgcaccccc ttcacagaca ctgacctcag aggccccact ctggtgcccc 1812cagaccctgg gcccccagcc tctggcctcc ctccagtagc cccacgagtc cccacctctc 1872agtgctgacg gtgccttcat gtccccgccg gccctgcccc tgccctctgt accccgtgag 1932gggtggcagg agctggagtc tcccccttcc tcctgtgccc tccccttccc cccccaacag 1992ctgctatggg ggggctaaat tatctctatt ttgtagagag gatctatatt tgtaggggtt 2052cggggcccag gccgggtccc tatctctgtg tataaactgt acagaccgtg aaaaaaaaaa 2112aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 2154<210>9<211>143<212>PRT<213>智人(Homo sapiens)<400>9Met Thr Gln Leu Arg Gln Val Leu Glu Ser Arg Leu Gln Arg Pro Leu1 5 10 15Pro Glu Asp Leu Ala Arg Leu Trp Pro Ser Gly Val Ile Leu Cys Gln20 25 30Leu Ala Asn Gln Leu Arg Pro Arg Ser Val Pro Phe Ile His Val Pro35 40 45Ser Pro Ala Val Pro Lys Leu Ser Ala Leu Lys Ala Arg Lys Asn Val50 55 60Glu Ser Phe Leu Glu Ala Cys Arg Lys Met Gly Val Pro Glu Ala Asp65 70 75 80Leu Cys Ser Pro Ser Asp Leu Leu Gln Gly Thr Ala Arg Gly Leu Arg85 90 95Thr Ala Leu Glu Ala Val Lys Arg Val Gly Gly Lys Ala Leu Pro Pro100 105 110Leu Trp Pro Pro Ser Gly Leu Gly Gly Phe Val Val Phe Tyr Val Val115 120 125Leu Met Leu Leu Leu Tyr Val Thr Tyr Thr Arg Leu Leu Gly Ser130 135 140<210>10<211>4952<212>DNA<213>智人(Homo sapiens)<400>10gctaagcagt aaactaaagg attatatatt attagtctca gtggttttca gatttatttt 60taaaggggaa aacagggaaa acccatcgta tttgtaaagc actttaggat tttgccgttt120gtttctgatt gtttgaagat tagggctttt tggtgcgtgg tcacctttca cctctccttt180taggatttag tcctttccag tctgctcttt ttgtgcgtgt cacaaccata ttcttgtggt240tctggctcat attgtagaac tgctgaacat aaggagaggt agccagctgt atggtcggat300ttaatatata atgttatatg ttgggatatc ttagtggttt gttttctgag gtaagtttct360tagtgttgtg tttgagacat tgtgtttgcg tttatggcga cactgtcatt catgcacttg420gccatctgag cgtggataca gcgggcactc gggtctctct gccagatgga tgaaagcagt480gtacattcca gtgtgggaga cagacatgtg gacaggtaaa ttacaaggca gtgtgataaa540gagtagagag ttggttgaga gagatcttag acaccatcct cattgtgtag atgagaaagc600aaaagtcacc aaggcagcct ggcagctggg acccaagaag cctagggtgc cagtccttgg660gcagtgcggg gttaggcaca cccagggccc tcctggttcc tggctgactc ttggactctt720tgtctctaat tggaggccat gatgcccagc tgtaaggtgg tcagcttcat ttgagacact780atatccttta gcacagcggg gtaatttctt ccctcctgtt tcattcattt accaaatggc840ctcctaaatg atctaaaatc acttggatct tttgtctttg tggacctaac acctggcttt900taaagtttaa ctttctgtcc cctcttcagc ttgctaaaat tgaaaagtgt tgcagcccaa960cctccacaaa tcttgtctca ggaaataaga gacatttgtt aacatttgtt ttgtacctct 1020cagcagctta gttgacaagg gcaccgtgtg ggatttcctg ttcttgctca tttggaaaga 1080gaatgttctt tgttcttaga ccctcagctc tcatgtgaga gccatagaat gttgcgaggt 1140ggagttctgt ggatacagaa ggaatgtttt caagttagac ttactgccaa tgttaggatt 1200tgggactttg catgattggg agggagaggg agtgctggag aacaggttaa aagttgtccc 1260gctgagcttg gagcatctcc tgccaacccg gagtgcttcc caggaaccct gccagtgtca 1320cttggggtta tgttttctga tttggaaaca ttaagccgta tgcaggtctc ttcagaactg 1380gttcttcagc cggattgccc tggaaagcag agattgcagc tcttctaaaa ctgcctctca 1440cagaagttcc aaggccaggc taaatattga atgcagtact cagcagctgg gacacctgat 1500gctttggtgg ccatcccttt cttccatcca aaagggcccc cactggaagg catctgttgt 1560tttaaaaata ttttaggact catttttact tcccccactc cctcaagatc acatacactc 1620cccagtgggg gttacagcct cttaggagga atcgctgctt catgacttcc tcaggcattt 1680acatttttct cttctgttga cttaaatcat gaactaaaat ttatccctag aggaaaaaag 1740aatgcttcct ccattctggg ctcttctcac tgtacccaga ctatgtcttc aggactctca 1800tctcttgtca gttctgttgt gctagaaaga ctggtttgaa aaaattcagc tcgtgtaaac 1860ctgtgccctc caccctgtgg ggaacccatg tggggagcct ttgaaaatat cacttatcag 1920ctgggcgcag tggctcatgc ctgtaatccc agcacgttgg gaggatgagg tgggcggatc 1980atgaggtcag gagttcaaga ccagcctggc caacatagtg aaaccccgtc tctactaaaa 2040atacaaaaaa aaaaaaaaaa aaaaccgaga ctagttctct ctctgtctcc tgcctgaacc 2100ctcctcctct ttttgttctg atctttgagc tccctagagc ccataattct ttagagcagg 2160tatgtcccga gtctgaaaca tgcccttatt tgtcccaagc tctggacatt tctcacccca 2220aggcggatca atcatgatta aatcactcca attaaacttt aggctccagt cagaccttca 2280gccaaatgga aaaaaaaact aggggataag ggaggtagtt ggagcaagaa aatgttatta 2340gttgaaacct tacgggacct tcctccctta gtgagtctgt tggctaaagg ttctctggct 2400tcgtgaatta gaattggata ctgtttccaa gttagcaaaa ccaactctac cccagcaccc 2460cacgaggaag aatgtggaag gatctcccat tggccggttg gggcaaaagc ctgaggcaat 2520ctttcatccc cttttgccaa ggcgagactt tcccagtgac ggtgatgtag ttggccactc 2580tgactatggg tggactcggg tgtagacctc tgaagctgag atcacacgaa aacctggcct 2640ccccgccatg tagctgttgg agagtagaaa aatagagcac gcctgatgtt tctaaatgag 2700aagactttca atagtaatga agaatccatg gcactctcct caccctcaaa cacatggcag 2760tcattcacat acaggcccca aagccactgt tagtgctgca gtagctcctg tggacattgg 2820aaagcccgga gagggcgtgg aagaaatcag ctggcccccg gcaggttctc tggggttttg 2880tgcccaaggc tcctggagcc ctaaaaactt tcaaaagtta actccccacg tccccatcct 2940gcttgggttt ctggactttt ctgaggcacc ggcagagggg tctcgttgct cccttgagtg 3000taggggcagc cctttaacct ggctccttga gtccctgctt tttctgcttc tgttgccttc 3060ttcctcgtct tcctctctct caatatctcc ctctctttgt ccctccccag ttcctgacct 3120ggccatcccg gggtgccctt gaccagcccc gtgcctcctc agggtgtccc agcaccagcc 3180tggcacagag tggggctcag ttagagtatg tgggatgttg gtttcgccag gtgagtgaat 3240gaaaggactc gaccaccaca gctgagccac tagctgggcc atgcgaagag ttctaggtgc 3300aaaggctgga gggtggaatt catttttgag aggtgtgtga gcagcttccg acccctgccc 3360catttgaacg ggggccttgc tggtcgcgtc cctgcattca cctgcgcggc catcccgtca 3420tccaacagtt gatcctaact gagcacgccc acggccctgg tctggcctgg gcaccggcca 3480ccgtagccca tcccttgatg gcctctgtgt ccccaggagg gcgggccggg gggttgccca 3540ggggctggag cagtggactg tggctccata gaggtaggct ggagggtgtg agggcagatt 3600caagctatcc ccagggctct gctctggtcg gagccagccc cttctccctc tctgccttcc 3660ccgccccatt cctgatgctg aactgttctg gacccctggc cctgagtctc tcaggaccaa 3720agtgggcacg ggaacagctg tagtgtgtgc ccccccgggc tttggcacag gtctccctct 3780cgaggtgtgg ttgtgactgc gacccttccc ttgccgtgat gccttcctcc cccggggctt 3840ggtccagctc cttcactctc tagcagctgc tggggcccac ctcccatgcc gaggaccagc 3900aggggaaacc tccagggagc atctgcaggc tctgcttctg cccggctgct ggcttgctct 3960ccctggtggc tctccagcgg ccagcttcct cacccacccg gcactctgct ttgctctgtc 4020tcctgaggtg ggcctgacca acctcccctt ctctgcctca gtccctgggc tccagggctc 4080agctccacag ccctctgcct agcaggctgg ttctccctgc caagcccata cctgtggtca 4140cctggccctc ctgtggtctg agtaccactc ccctgcccca ggagccactc ccactccagc 4200tgcctgtttc cagcaggttc ccagtgtccc cgacaagccc ctgctggtgt ctccatctcc 4260tgccaagcat cctccagtgc ctcctcctgt gggcctggcc tcagggctat ggacagactc 4320ctgtcccatc ccagagaccc ctcgtgatcg tgccctggca cgtgggccgt ggcccggctg 4380ggtcggctga agaactgcgg atggaagctg cggaagaggc cctgatgggg cccaccatcc 4440cggacccaag tcttcttcct ggcgggcctc tcgtctcctt cctggtttgg gcggaagcca 4500tcacctggat gcctacgtgg gaagggacct cgaatgtggg accccagccc ctctccagct 4560cgaaatccct ccacagccac ggggacaccc tgcacctatt cccacgggac aggctggacc 4620caaagactct ggacccgggg cctccccttg agtagagacc cgccctctga ctgatggacg 4680ccgctgacct ggggtcagac ccgtgggctg gacccctgcc caccccgcag gaaccctgag 4740gcctagggga gctgttgagc cttcagtgtc tgcatgtggg aagtgggctc cttcacctac 4800ctcacagggc tgttgtgagg ggcgctgtga tgcggttcca aagcacaggg cttggcgcac 4860ccccctgtgc tctcaataaa tgtgtttcct gtcttaaaaa aaaaaaaaaa aaaaaaaaaa 4920aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 4952<210>11<211>4952<212>DNA<213>智人(Homo sapiens)<220><221>CDS<222>(2696)..(3139)<400>11gctaagcagt aaactaaagg attatatatt attagtctca gtggttttca gatttatttt 60taaaggggaa aacagggaaa acccatcgta tttgtaaagc actttaggat tttgccgttt120gtttctgatt gtttgaagat tagggctttt tggtgcgtgg tcacctttca cctctccttt180taggatttag tcctttccag tctgctcttt ttgtgcgtgt cacaaccata ttcttgtggt240tctggctcat attgtagaac tgctgaacat aaggagaggt agccagctgt atggtcggat300ttaatatata atgttatatg ttgggatatc ttagtggttt gttttctgag gtaagtttct360tagtgttgtg tttgagacat tgtgtttgcg tttatggcga cactgtcatt catgcacttg420gccatctgag cgtggataca gcgggcactc gggtctctct gccagatgga tgaaagcagt480gtacattcca gtgtgggaga cagacatgtg gacaggtaaa ttacaaggca gtgtgataaa540gagtagagag ttggttgaga gagatcttag acaccatcct cattgtgtag atgagaaagc600aaaagtcacc aaggcagcct ggcagctggg acccaagaag cctagggtgc cagtccttgg660gcagtgcggg gttaggcaca cccagggccc tcctggttcc tggctgactc ttggactctt720tgtctctaat tggaggccat gatgcccagc tgtaaggtgg tcagcttcat ttgagacact780atatccttta gcacagcggg gtaatttctt ccctcctgtt tcattcattt accaaatggc840ctcctaaatg atctaaaatc acttggatct tttgtctttg tggacctaac acctggcttt900taaagtttaa ctttctgtcc cctcttcagc ttgctaaaat tgaaaagtgt tgcagcccaa960cctccacaaa tcttgtctca ggaaataaga gacatttgtt aacatttgtt ttgtacctct 1020cagcagctta gttgacaagg gcaccgtgtg ggatttcctg ttcttgctca tttggaaaga 1080gaatgttctt tgttcttaga ccctcagctc tcatgtgaga gccatagaat gttgcgaggt 1140ggagttctgt ggatacagaa ggaatgtttt caagttagac ttactgccaa tgttaggatt 1200tgggactttg catgattggg agggagaggg agtgctggag aacaggttaa aagttgtccc 1260gctgagcttg gagcatctcc tgccaacccg gagtgcttcc caggaaccct gccagtgtca 1320cttggggtta tgttttctga tttggaaaca ttaagccgta tgcaggtctc ttcagaactg 1380gttcttcagc cggattgccc tggaaagcag agattgcagc tcttctaaaa ctgcctctca 1440cagaagttcc aaggccaggc taaatattga atgcagtact cagcagctgg gacacctgat 1500gctttggtgg ccatcccttt cttccatcca aaagggcccc cactggaagg catctgttgt 1560tttaaaaata ttttaggact catttttact tcccccactc cctcaagatc acatacactc 1620cccagtgggg gttacagcct cttaggagga atcgctgctt catgacttcc tcaggcattt 1680acatttttct cttctgttga cttaaatcat gaactaaaat ttatccctag aggaaaaaag 1740aatgcttcct ccattctggg ctcttctcac tgtacccaga ctatgtcttc aggactctca 1800tctcttgtca gttctgttgt gctagaaaga ctggtttgaa aaaattcagc tcgtgtaaac 1860ctgtgccctc caccctgtgg ggaacccatg tggggagcct ttgaaaatat cacttatcag 1920ctgggcgcag tggctcatgc ctgtaatccc agcacgttgg gaggatgagg tgggcggatc 1980atgaggtcag gagttcaaga ccagcctggc caacatagtg aaaccccgtc tctactaaaa 2040atacaaaaaa aaaaaaaaaa aaaaccgaga ctagttctct ctctgtctcc tgcctgaacc 2100ctcctcctct ttttgttctg atctttgagc tccctagagc ccataattct ttagagcagg 2160tatgtcccga gtctgaaaca tgcccttatt tgtcccaagc tctggacatt tctcacccca 2220aggcggatca atcatgatta aatcactcca attaaacttt aggctccagt cagaccttca 2280gccaaatgga aaaaaaaact aggggataag ggaggtagtt ggagcaagaa aatgttatta 2340gttgaaacct tacgggacct tcctccctta gtgagtctgt tggctaaagg ttctctggct 2400tcgtgaatta gaattggata ctgtttccaa gttagcaaaa ccaactctac cccagcaccc 2460cacgaggaag aatgtggaag gatctcccat tggccggttg gggcaaaagc ctgaggcaat 2520ctttcatccc cttttgccaa ggcgagactt tcccagtgac ggtgatgtag ttggccactc 2580tgactatggg tggactcggg tgtagacctc tgaagctgag atcacacgaa aacctggcct 2640ccccgccatg tagctgttgg agagtagaaa aatagagcac gcctgatgtt tctaa atg2698Met1aga aga ctt tca ata gta atg aag aat cca tgg cac tct cct cac cct 2746Arg Arg Leu Ser Ile Val Met Lys Asn Pro Trp His Ser Pro His Pro5 10 15caa aca cat ggc agt cat tca cat aca ggc ccc aaa gcc act gtt agt 2794Gln Thr His Gly Ser His Ser His Thr Gly Pro Lys Ala Thr Val Ser20 25 30gct gca gta gct cct gtg gac att gga aag ccc gga gag ggc gtg gaa 2842Ala Ala Val Ala Pro Val Asp Ile Gly Lys Pro Gly Glu Gly Val Glu35 40 45gaa atc agc tgg ccc ccg gca ggt tct ctg ggg ttt tgt gcc caa ggc 2890Glu Ile Ser Trp Pro Pro Ala Gly Ser Leu Gly Phe Cys Ala Gln Gly50 55 60 65tcc tgg agc cct aaa aac ttt caa aag tta act ccc cac gtc ccc atc 2938Ser Trp Ser Pro Lys Asn Phe Gln Lys Leu Thr Pro His Val Pro Ile70 75 80ctg ctt ggg ttt ctg gac ttt tct gag gca ccg gca gag ggg tct cgt 2986Leu Leu Gly Phe Leu Asp Phe Ser Glu Ala Pro Ala Glu Gly Ser Arg85 90 95tgc tcc ctt gag tgt agg ggc agc cct tta acc tgg ctc ctt gag tcc 3034Cys Ser Leu Glu Cys Arg Gly Ser Pro Leu Thr Trp Leu Leu Glu Ser100 105 110ctg ctt ttt ctg ctt ctg ttg cct tct tcc tcg tct tcc tct ctc tca 3082Leu Leu Phe Leu Leu Leu Leu Pro Ser Ser Ser Ser Ser Ser Leu Ser115 120 125ata tct ccc tct ctt tgt ccc tcc cca gtt cct gac ctg gcc atc ccg 3130Ile Ser Pro Ser Leu Cys Pro Ser Pro Val Pro Asp Leu Ala Ile Pro130 135 140 145ggg tgc cct tgaccagccc cgtgcctcct cagggtgtcc cagcaccagc 3179Gly Cys Proctggcacaga gtggggctca gttagagtat gtgggatgtt ggtttcgcca ggtgagtgaa 3239tgaaaggact cgaccaccac agctgagcca ctagctgggc catgcgaaga gttctaggtg 3299caaaggctgg agggtggaat tcatttttga gaggtgtgtg agcagcttcc gacccctgcc 3359ccatttgaac gggggccttg ctggtcgcgt ccctgcattc acctgcgcgg ccatcccgtc 3419atccaacagt tgatcctaac tgagcacgcc cacggccctg gtctggcctg ggcaccggcc 3479accgtagccc atcccttgat ggcctctgtg tccccaggag ggcgggccgg ggggttgccc 3539aggggctgga gcagtggact gtggctccat agaggtaggc tggagggtgt gagggcagat 3599tcaagctatc cccagggctc tgctctggtc ggagccagcc ccttctccct ctctgccttc 3659cccgccccat tcctgatgct gaactgttct ggacccctgg ccctgagtct ctcaggacca 3719aagtgggcac gggaacagct gtagtgtgtg cccccccggg ctttggcaca ggtctccctc 3779tcgaggtgtg gttgtgactg cgacccttcc cttgccgtga tgccttcctc ccccggggct 3839tggtccagct ccttcactct ctagcagctg ctggggccca cctcccatgc cgaggaccag 3899caggggaaac ctccagggag catctgcagg ctctgcttct gcccggctgc tggcttgctc 3959tccctggtgg ctctccagcg gccagcttcc tcacccaccc ggcactctgc tttgctctgt 4019ctcctgaggt gggcctgacc aacctcccct tctctgcctc agtccctggg ctccagggct 4079cagctccaca gccctctgcc tagcaggctg gttctccctg ccaagcccat acctgtggtc 4139acctggccct cctgtggtct gagtaccact cccctgcccc aggagccact cccactccag 4199ctgcctgttt ccagcaggtt cccagtgtcc ccgacaagcc cctgctggtg tctccatctc 4259ctgccaagca tcctccagtg cctcctcctg tgggcctggc ctcagggcta tggacagact 4319cctgtcccat cccagagacc cctcgtgatc gtgccctggc acgtgggccg tggcccggct 4379gggtcggctg aagaactgcg gatggaagct gcggaagagg ccctgatggg gcccaccatc 4439ccggacccaa gtcttcttcc tggcgggcct ctcgtctcct tcctggtttg ggcggaagcc 4499atcacctgga tgcctacgtg ggaagggacc tcgaatgtgg gaccccagcc cctctccagc 4559tcgaaatccc tccacagcca cggggacacc ctgcacctat tcccacggga caggctggac 4619ccaaagactc tggacccggg gcctcccctt gagtagagac ccgccctctg actgatggac 4679gccgctgacc tggggtcaga cccgtgggct ggacccctgc ccaccccgca ggaaccctga 4739ggcctagggg agctgttgag ccttcagtgt ctgcatgtgg gaagtgggct ccttcaccta 4799cctcacaggg ctgttgtgag gggcgctgtg atgcggttcc aaagcacagg gcttggcgca 4859cccccctgtg ctctcaataa atgtgtttcc tgtcttaaaa aaaaaaaaaa aaaaaaaaaa 4919aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa4952<210>12<211>148<212>PRT<213>智人(Homo sapiens)<400>12Met Arg Arg Leu Ser Ile Val Met Lys Asn Pro Trp His Ser Pro His1 5 10 15Pro Gln Thr His Gly Ser His Ser His Thr Gly Pro Lys Ala Thr Val20 25 30Ser Ala Ala Val Ala Pro Val Asp Ile Gly Lys Pro Gly Glu Gly Val35 40 45Glu Glu Ile Ser Trp Pro Pro Ala Gly Ser Leu Gly Phe Cys Ala Gln50 55 60Gly Ser Trp Ser Pro Lys Asn Phe Gln Lys Leu Thr Pro His Val Pro65 70 75 80Ile Leu Leu Gly Phe Leu Asp Phe Ser Glu Ala Pro Ala Glu GIy Ser85 90 95Arg Cys Ser Leu Glu Cys Arg Gly Ser Pro Leu Thr Trp Leu Leu Glu100 105 110Ser Leu Leu Phe Leu Leu Leu Leu Pro Ser Ser Ser Ser Ser Ser Leu115 120 125Ser Ile Ser Pro Ser Leu Cys Pro Ser Pro Val Pro Asp Leu Ala Ile130 135 140Pro Gly Cys Pro145<210>13<211>3112<212>DNA<213>智人(Homo sapiens)<400>13gcgacggcga gagctagagc gggcgcagcg ttagggtggc cgtgcaaggg gagccgtggc 60ccgggcccgg ggcgtgcgag acggcggaag cagcccaggg ccttgctgcc gccatgactg120aggaatcaga ggagacagtc ctgtacattg agcaccgcta tgtctgctct gagtgcaacc180agctgtatgg atcactggaa gaggtgctta tgcaccaaaa ctcccacgtg ccccagcagc240actttgagct ggtgggcgtg gctgatcccg gagtcactgt ggccacagac acagcttcag300gcacgggcct ctatcagacc cttgtgcagg agagccagta ccagtgcctg gagtgtggtc360aactgctgat gtcacccagc cagctcctgg agcaccagga gctgcacctg aagatgatgg420caccccagga ggcagtgcca gctgagccat cacctaaggc accacccctg agctccagca480ccatccacta cgagtgtgtg gattgcaagg ctctctttgc cagccaggag ctctggctga540accaccggca gacgcacctc cgggccacac ccaccaaggc tcctgcccct gttgtcctgg600ggtccccagt tgttctaggg cctcctgtgg gccaggcccg agtggctgtg gagcactcat660accgaaaggc agaagagggt ggggaagggg cgactgtccc atctgccgct gccaccacca720ctgaggtagt gactgaggtg gagctgctcc tctacaagtg ctctgagtgc tcccagctct780tccagctgcc ggcggatttc ctggagcacc aggccactca cttccctgct cctgtacccg840agtctcagga gcctgcctta cagcaggagg tgcaggcctc gtcacctgca gaggtgcctg900tgtctcagcc tgaccccttg ccagcttctg accacagtta cgagctgcgc aatggtgaag960ccattgggcg ggatcgccgg gggcgcaggg cccggaggaa caacagtgga gaagcaggcg 1020gggcagccac acaggagctc ttctgctcag cctgtgacca gctctttctc tcaccccacc 1080agctacagca gcacctgcgg agtcaccggg agggcgtctt taagtgcccc ctgtgcagtc 1140gtgtcttccc tagcccttcc agtctggacc agcaccttgg agaccatagc agcgagtcac 1200acttcctgtg tgtagactgt ggcctggcct tcggcacaga ggccctcctc ctggcccacc 1260ggcgagccca caccccgaat cctctgcatt catgtccatg tgggaagacc tttgtcaacc 1320ttaccaagtt cctttatcac cggcgtactc atggggtagg gggggtgtcc ctctgcccac 1380aacaccagtc ccaccagagg aacctgtcat tggtttccct gagccagccc cagcagagac 1440tggagagcca gaggcccctg agccccctgt gtctgaggag acctcagcag ggcccgctgc 1500cccaggcacc taccgctgcc tcctgtgcag ccgtgaattt ggaaaggcct tgcagctgac 1560ccggcaccaa cgttttgtgc atcggctgga gcggcgccat aaatgcagca tttgtggcaa 1620gatgttcaag aagaagtctc acgtgcgtaa ccacctgcgc acacacacag gggagcggcc 1680cttcccctgc cctgactgct ccaagccctt caactcacct gccaacctgg cccgccaccg 1740gctcacacac acaggagagc ggccctaccg gtgtggggac tgtggcaagg ctttcacgca 1800aagctccaca ctgaggcagc accgcttggt gcatgcccag cactttccct accgctgcca 1860ggaatgtggg gtgcgttttc accgtcctta ccggctgctc atgcaccgct accatcacac 1920aggtgaatac ccctacaagt gtcgcgagtg cccccgctcc ttcttgctgc gtcggctgct 1980ggaggtgcac cagctcgtgg tccatgccgg gcgccagccc caccgctgcc catcctgtgg 2040ggctgccttc ccctcctcac tgcggctccg ggagcaccgc tgtgcagccg ctgctgccca 2100ggccccacgg cgctttgagt gtggcacctg tggcaagaaa gtgggctcag ctgctcgact 2160gcaggcacac gaggcggccc atgcagctgc tgggcctgga gaggtcctgg ctaaggagcc 2220ccctgcccct cgagccccac gggccactcg tgcaccagtt gcctctccag cagcccttgg 2280aagcactgct acagcatccc ctgcggcccc tgcccgccgc cggggtctag agtgcagcga 2340gtgcaagaag ctgttcagca cagagacgtc actgcaggtg caccggcgca tccacacagg 2400tgagcggcca tacccatgtc cagactgtgg caaagcgttc cgtcagagta cccacctgaa 2460agacaccggc gcctgcacac aggtgagcgg ccctttgcct gtgaagtgtg tggcaaggcc 2520tttgccatct ccatgcgcct ggcagaacat cgccgcatcc acacaggcga acgaccctac 2580tcctgccctg actgtggcaa gagctaccgc tccttctcca acctctggaa gcaccgcaag 2640acccatcagc agcagcatca ggcagctgtg cggcagcagc tggcagaggc ggaggctgcc 2700gttggcctgg ccgtcatgga gactgctgtg gaggcgctac ccctggtgga agccattgag 2760atctaccctc tggccgaggc tgagggggtc cagatcagtg gctgactctg cccgacttcc 2820tctttggcac ctccattccc tgttgctgaa ggccctccag catcccctta agcatctgta 2880catactgtgt cccttcctct tcccatcccc accaccttgt aagttctaaa ttggatttat 2940tctctcgtga ggggggtgct ctggggtcct tgacacacat aaaggtgccc ccccaccttc 3000cacctcttag cactggtgac cccaaaaatg aaaccatcaa taaagactga gttgccagca 3060gtgtgtagag tggaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 3112<210>14<211>3112<212>DNA<213>智人(Homo sapiens)<220><221>CDS<222>(1292)..(2926)<400>14gcgacggcga gagctagagc gggcgcagcg ttagggtggc cgtgcaaggg gagccgtggc 60ccgggcccgg ggcgtgcgag acggcggaag cagcccaggg ccttgctgcc gccatgactg120aggaatcaga ggagacagtc ctgtacattg agcaccgcta tgtctgctct gagtgcaacc180agctgtatgg atcactggaa gaggtgctta tgcaccaaaa ctcccacgtg ccccagcagc240actttgagct ggtgggcgtg gctgatcccg gagtcactgt ggccacagac acagcttcag300gcacgggcct ctatcagacc cttgtgcagg agagccagta ccagtgcctg gagtgtggtc360aactgctgat gtcacccagc cagctcctgg agcaccagga gctgcacctg aagatgatgg420caccccagga ggcagtgcca gctgagccat cacctaaggc accacccctg agctccagca480ccatccacta cgagtgtgtg gattgcaagg ctctctttgc cagccaggag ctctggctga540accaccggca gacgcacctc cgggccacac ccaccaaggc tcctgcccct gttgtcctgg600ggtccccagt tgttctaggg cctcctgtgg gccaggcccg agtggctgtg gagcactcat660accgaaaggc agaagagggt ggggaagggg cgactgtccc atctgccgct gccaccacca720ctgaggtagt gactgaggtg gagctgctcc tctacaagtg ctctgagtgc tcccagctct780tccagctgcc ggcggatttc ctggagcacc aggccactca cttccctgct cctgtacccg840agtctcagga gcctgcctta cagcaggagg tgcaggcctc gtcacctgca gaggtgcctg900tgtctcagcc tgaccccttg ccagcttctg accacagtta cgagctgcgc aatggtgaag960ccattgggcg ggatcgccgg gggcgcaggg cccggaggaa caacagtgga gaagcaggcg 1020gggcagccac acaggagctc ttctgctcag cctgtgacca gctctttctc tcaccccacc 1080agctacagca gcacctgcgg agtcaccggg agggcgtctt taagtgcccc ctgtgcagtc 1140gtgtcttccc tagcccttcc agtctggacc agcaccttgg agaccatagc agcgagtcac 1200acttcctgtg tgtagactgt ggcctggcct tcggcacaga ggccctcctc ctggcccacc 1260ggcgagccca caccccgaat cctctgcatt c atg tcc atg tgg gaa gac ctt 1312Met Ser Met Trp Glu Asp Leu1 5tgt caa cct tac caa gtt cct tta tca ccg gcg tac tca tgg ggt agg 1360Cys Gln Pro Tyr Gln Val Pro Leu Ser Pro Ala Tyr Ser Trp Gly Arg10 15 20ggg ggt gtc cct ctg ccc aca aca cca gtc cca cca gag gaa cct gtc 1408Gly Gly Val Pro Leu Pro Thr Thr Pro Val Pro Pro Glu Glu Pro Val25 30 35att ggt ttc cct gag cca gcc cca gca gag act gga gag cca gag gcc 1456Ile Gly Phe Pro Glu Pro Ala Pro Ala Glu Thr Gly Glu Pro Glu Ala40 45 50 55cct gag ccc cct gtg tct gag gag acc tca gca ggg ccc gct gcc cca 1504Pro Glu Pro Pro Val Ser Glu Glu Thr Ser Ala Gly Pro Ala Ala Pro60 65 70ggc acc tac cgc tgc ctc ctg tgc agc cgt gaa ttt gga aag gcc ttg 1552Gly Thr Tyr Arg Cys Leu Leu Cys Ser Arg Glu Phe Gly Lys Ala Leu75 80 85cag ctg acc cgg cac caa cgt ttt gtg cat cgg ctg gag cgg cgc cat 1600Gln Leu Thr Arg His Gln Arg Phe Val His Arg Leu Glu Arg Arg His90 95 100aaa tgc agc att tgt ggc aag atg ttc aag aag aag tct cac gtg cgt 1648Lys Cys Ser Ile Cys Gly Lys Met Phe Lys Lys Lys Ser His Val Arg105 110 115aac cac ctg cgc aca cac aca ggg gag cgg ccc ttc ccc tgc cct gac 1696Asn His Leu Arg Thr His Thr Gly Glu Arg Pro Phe Pro Cys Pro Asp120 125 130 135tgc tcc aag ccc ttc aac tca cct gcc aac ctg gcc cgc cac cgg ctc 1744Cys Ser Lys Pro Phe Asn Ser Pro Ala Asn Leu Ala Arg His Arg Leu140 145 150aca cac aca gga gag cgg ccc tac cgg tgt ggg gac tgt ggc aag gct 1792Thr His Thr Gly Glu Arg Pro Tyr Arg Cys Gly Asp Cys Gly Lys Ala155 160 165ttc acg caa agc tcc aca ctg agg cag cac cgc ttg gtg cat gcc cag 1840Phe Thr Gln Ser Ser Thr Leu Arg Gln His Arg Leu Val His Ala Gln170 175 180cac ttt ccc tac cgc tgc cag gaa tgt ggg gtg cgt ttt cac cgt cct 1888His Phe Pro Tyr Arg Cys Gln Glu Cys Gly Val Arg Phe His Arg Pro185 190 195tac cgg ctg ctc atg cac cgc tac cat cac aca ggt gaa tac ccc tac 1936Tyr Arg Leu Leu Met His Arg Tyr His His Thr Gly Glu Tyr Pro Tyr200 205 210 215aag tgt cgc gag tgc ccc cgc tcc ttc ttg ctg cgt cgg ctg ctg gag 1984Lys Cys Arg Glu Cys Pro Arg Ser Phe Leu Leu Arg Arg Leu Leu Glu220 225 230gtg cac cag ctc gtg gtc cat gcc ggg cgc cag ccc cac cgc tgc cca 2032Val His Gln Leu Val Val His Ala Gly Arg Gln Pro His Arg Cys Pro235 240 245tcc tgt ggg gct gcc ttc ccc tcc tca ctg cgg ctc cgg gag cac cgc 2080Ser Cys Gly Ala Ala Phe Pro Ser Ser Leu Arg Leu Arg Glu His Arg250 255 260tgt gca gcc gct gct gcc cag gcc cca cgg cgc ttt gag tgt ggc acc 2128Cys Ala Ala Ala Ala Ala Gln Ala Pro Arg Arg Phe Glu Cys Gly Thr265 270 275tgt ggc aag aaa gtg ggc tca gct gct cga ctg cag gca cac gag gcg 2176Cys Gly Lys Lys Val Gly Ser Ala Ala Arg Leu Gln Ala His Glu Ala280 285 290 295gcc cat gca gct gct ggg cct gga gag gtc ctg gct aag gag ccc cct 2224Ala His Ala Ala Ala Gly Pro Gly Glu Val Leu Ala Lys Glu Pro Pro300 305 310gcc cct cga gcc cca cgg gcc act cgt gca cca gtt gcc tct cca gca 2272Ala Pro Arg Ala Pro Arg Ala Thr Arg Ala Pro Val Ala Ser Pro Ala315 320 325gcc ctt gga agc act gct aca gca tcc cct gcg gcc cct gcc cgc cgc 2320Ala Leu Gly Ser Thr Ala Thr Ala Ser Pro Ala Ala Pro Ala Arg Arg330 335 340cgg ggt cta gag tgc agc gag tgc aag aag ctg ttc agc aca gag acg 2368Arg Gly Leu Glu Cys Ser Glu Cys Lys Lys Leu Phe Ser Thr Glu Thr345 350 355tca ctg cag gtg cac cgg cgc atc cac aca ggt gag cgg cca tac cca 2416Ser Leu Gln Val His Arg Arg Ile His Thr Gly Glu Arg Pro Tyr Pro360 365 370 375tgt cca gac tgt ggc aaa gcg ttc cgt cag agt acc cac ctg aaa gac 2464Cys Pro Asp Cys Gly Lys Ala Phe Arg Gln Ser Thr His Leu Lys Asp380 385 390acc ggc gcc tgc aca cag gtg agc ggc cct ttg cct gtg aag tgt gtg 2512Thr Gly Ala Cys Thr Gln Val Ser Gly Pro Leu Pro Val Lys Cys Val395 400 405gca agg cct ttg cca tct cca tgc gcc tgg cag aac atc gcc gca tcc 2560Ala Arg Pro Leu Pro Ser Pro Cys Ala Trp Gln Asn Ile Ala Ala Ser410 415 420aca cag gcg aac gac cct act cct gcc ctg act gtg gca aga gct acc 2608Thr Gln Ala Asn Asp Pro Thr Pro Ala Leu Thr Val Ala Arg Ala Thr425 430 435gct cct tct cca acc tct gga agc acc gca aga ccc atc agc agc agc 2656Ala Pro Ser Pro Thr Ser Gly Ser Thr Ala Arg Pro Ile Ser Ser Ser440 445 450 455atc agg cag ctg tgc ggc agc agc tgg cag agg cgg agg ctg ccg ttg 2704Ile Arg Gln Leu Cys Gly Ser Ser Trp Gln Arg Arg Arg Leu Pro Leu460 465 470gcc tgg ccg tca tgg aga ctg ctg tgg agg cgc tac ccc tgg tgg aag 2752Ala Trp Pro Ser Trp Arg Leu Leu Trp Arg Arg Tyr Pro Trp Trp Lys475 480 485cca ttg aga tct acc ctc tgg ccg agg ctg agg ggg tcc aga tca gtg 2800Pro Leu Arg Ser Thr Leu Trp Pro Arg Leu Arg Gly Ser Arg Ser Val
490 495 500gct gac tct gcc cga ctt cct ctt tgg cac ctc cat tcc ctg ttg ctg 2848Ala Asp Ser Ala Arg Leu Pro Leu Trp His Leu His Ser Leu Leu Leu505 510 515aag gcc ctc cag cat ccc ctt aag cat ctg tac ata ctg tgt ccc ttc 2896Lys Ala Leu Gln His Pro Leu Lys His Leu Tyr Ile Leu Cys Pro Phe520 525 530 535ctc ttc cca tcc cca cca cct tgt aag ttc taaattggat ttattctctc 2946Leu Phe Pro Ser Pro Pro Pro Cys Lys Phe540 545gtgagggggg tgctctgggg tccttgacac acataaaggt gcccccccac cttccacctc 3006ttagcactgg tgaccccaaa aatgaaacca tcaataaaga ctgagttgcc agcagtgtgt 3066agagtggaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaa 3112<210>15<211>545<212>PRT<213>智人(Homo sapiens)<400>15Met Ser Met Trp Glu Asp Leu Cys Gln Pro Tyr Gln Val Pro Leu Ser1 5 10 15Pro Ala Tyr Ser Trp Gly Arg Gly Gly Val Pro Leu Pro Thr Thr Pro20 25 30Val Pro Pro Glu Glu Pro Val Ile Gly Phe Pro Glu Pro Ala Pro Ala35 40 45Glu Thr Gly Glu Pro Glu Ala Pro Glu Pro Pro Val Ser Glu Glu Thr50 55 60Ser Ala Gly Pro Ala Ala Pro Gly Thr Tyr Arg Cys Leu Leu Cys Ser65 70 75 80Arg Glu Phe Gly Lys Ala Leu Gln Leu Thr Arg His Gln Arg Phe Val85 90 95His Arg Leu Glu Arg Arg His Lys Cys Ser Ile Cys Gly Lys Met Phe100 105 110Lys Lys Lys Ser His Val Arg Asn His Leu Arg Thr His Thr Gly Glu115 120 125Arg Pro Phe Pro Cys Pro Asp Cys Ser Lys Pro Phe Asn Ser Pro Ala130 135 140Asn Leu Ala Arg His Arg Leu Thr His Thr Gly Glu Arg Pro Tyr Arg145 150 155 160Cys Gly Asp Cys Gly Lys Ala Phe Thr Gln Ser Ser Thr Leu Arg Gln165 170 175His Arg Leu Val His Ala Gln His Phe Pro Tyr Arg Cys Gln Glu Cys180 185 190Gly Val Arg Phe His Arg Pro Tyr Arg Leu Leu Met His Arg Tyr His195 200 205His Thr Gly Glu Tyr Pro Tyr Lys Cys Arg Glu Cys Pro Arg Ser Phe210 215 220Leu Leu Arg Arg Leu Leu Glu Val His Gln Leu Val Val His Ala Gly225 230 235 240Arg Gln Pro His Arg Cys Pro Ser Cys Gly Ala Ala Phe Pro Ser Ser
245 250 255Leu Arg Leu Arg Glu His Arg Cys Ala Ala Ala Ala Ala Gln Ala Pro260 265 270Arg Arg Phe Glu Cys Gly Thr Cys Gly Lys Lys Val Gly Ser Ala Ala275 280 285Arg Leu Gln Ala His Glu Ala Ala His Ala Ala Ala Gly Pro Gly Glu290 295 300Val Leu Ala Lys Glu Pro Pro Ala Pro Arg Ala Pro Arg Ala Thr Arg305 310 315 320Ala Pro Val Ala Ser Pro Ala Ala Leu Gly Ser Thr Ala Thr Ala Ser325 330 335Pro Ala Ala Pro Ala Arg Arg Arg Gly Leu Glu Cys Ser Glu Cys Lys340 345 350Lys Leu Phe Ser Thr Glu Thr Ser Leu Gln Val His Arg Arg Ile His355 360 365Thr Gly Glu Arg Pro Tyr Pro Cys Pro Asp Cys Gly Lys Ala Phe Arg370 375 380Gln Ser Thr His Leu Lys Asp Thr Gly Ala Cys Thr Gln Val Ser Gly385 390 395 400Pro Leu Pro Val Lys Cys Val Ala Arg Pro Leu Pro Ser Pro Cys Ala405 410 415Trp Gln Asn Ile Ala Ala Ser Thr Gln Ala Asn Asp Pro Thr Pro Ala420 425 430Leu Thr Val Ala Arg Ala Thr Ala Pro Ser Pro Thr Ser Gly Ser Thr435 440 445Ala Arg Pro Ile Ser Ser Ser Ile Arg Gln Leu Cys Gly Ser Ser Trp450 455 460Gln Arg Arg Arg Leu Pro Leu Ala Trp Pro Ser Trp Arg Leu Leu Trp465 470 475 480Arg Arg Tyr Pro Trp Trp Lys Pro Leu Arg Ser Thr Leu Trp Pro Arg485 490 495Leu Arg Gly Ser Arg Ser Val Ala Asp Ser Ala Arg Leu Pro Leu Trp500 505 510His Leu His Ser Leu Leu Leu Lys Ala Leu Gln His Pro Leu Lys His515 520 525Leu Tyr Ile Leu Cys Pro Phe Leu Phe Pro Ser Pro Pro Pro Cys Lys530 535 540Phe545<210>16<211>3102<212>DNA<213>智人(Homo sapiens)<400>16gggcagaggt tgcagtaacc caagatcatg ccaccatact acagactgtg tgacagagcg 60agactctgtc tcaaaacaac aacaaaaaaa caaactcacc attgtacctg tgcttatgca120aggtttagta ggaacgtaaa ttggtttaac ctttgtggac agaagtttta aaaatatata180ttaaaattaa aagtatgctc tgaaggagga actccacttc tggtaattta tctcaagaga240ataactgggc cagcacaaag gctgctgttt aacaatgtgt aatgatgcag tgacagctac300aattgcaaaa ataacctaga cattcaccaa tgaggactgg ttaaatgaac tagtataacc360atactgcaga atatcataaa gataacaaaa aaatgatatg gatctgtttc ttggcataaa420tatatccata agttttaaga agagatgcta tatatacggt ggtcccattg atgtataact480gttaggacta aaaatagtac cttcctcata atgatgtttt gaggaattaa tgagtttatt540catgcaaaat gcttagaatg gtacctggca cacagacaat gtttaagaaa tgtttgttat600tgttattact atgtctctgt atatatgcat aggaaaaatc tggaaggata aaataaaaaa660tgaatatttt tgggtggtga gactaaattt ttgtctaatt ttatggataa gttttatgat720ttatatttat aataaaaata aagctataaa aattaattat gatgtttctt gctcatgtca780gctacttcac tacatactga gttcccatcc ccatttgtta caggagcaac tcctggttaa840gtaccttttt tgtaactgtg aaattccctt gacattcatc atatactgat gacttttcct900aatacatgga aacaaacagg attgtgattt ttctctcatt ttgtacacta agttctatgc960cagccgattt cagagagaca ctctgcaaag ttcctatgaa aagtcttcaa aaatgtatta 1020ccttgctgtt taataccaat accaaaattc aaatggactt atcaattaaa ctcacctcaa 1080acacagtaat gcactcacag ttatgagcag tgctcactac tgccaatcat ttctgcttcc 1140agaatggtta aaggagccac aaactctgcc cttatcagaa gcagtagcct gataacaggt 1200aagaatagga atgttccgtt tctccccaaa ttaagagtgg tatcaataat ctgacttttc 1260caggcattta tctcacagaa atgtttatga gacatgctaa gatcaacatg gtaatatctg 1320actattgttt ttattagaaa taagggggcc agccaggcac agtagcttac acctgtaatc 1380ccagcacctg gggaggttga ggtgggagga ttgcttgagc ccaggagttt gagacaagcc 1440tgggcaacac agggagacac cagctctatt aaaaaaaaaa aaagtaaggg ggctataatg 1500taacccttat tgactgatct ttgaggctac tgttgtgaga tttctacatc cctctttatt 1560ataaaagatc ccaaatgcgg ctttacttgg aaaggaagca atttgacagt gatgaggaat 1620gatgtgcaga atggagattc agaaccctaa cagactctgg tattgatatc tagtgctcat 1680atttctggga gtctgctagg gttatgggag tttgcattta aattgtaggt tgttgcagaa 1740aacagaattt atatgtggaa aattgtaacg aatccactaa aaaactatta gaactaataa 1800tcaagtttgg caaggttgta agacataagt cagtatacaa aaatcaactg tatttctata 1860catttgtgac aatctgaaaa tgaaattagg aaaacaaatc catttacgat agcaacaaga 1920agtataaaat acttaggaag aagtataaca aaagatgtgc acaatttata ttctgaaaac 1980tacaaatagt gtttaaagaa attaaagaat attaaaataa atggaaaaat atcccatgtt 2040catggactgg aagaattact cttaagatgt caatactcct caaattgatc tacatatttg 2100atacaatcct tgtaagaacc cgaactgact tctttgtaga aattgacaaa ttgattctaa 2160gattcataca ggattgccat agatccagaa tagccacatc aattttaaaa aaagaagaaa 2220gtacaaagac tcacattacc tgatttaaaa acataccata aagcaatgtt aggacagtgt 2280ggtattgaca taaggataga cacatagatc aatgaaaagg aaagggagcc cagaagtaaa 2340accacatcaa ctgattttca acaaagatgc caagaccatt caattgagga aagaatagtc 2400ccttcaacaa atggtgctgc aaccagacag tcatatgcaa aagaatgaaa tttaaccttt 2460acaaaattta accatatata aaaattaatt caaatggatc aaagacatat aagggctgaa 2520actataaaat tgttaagaga acataggaat aaatattcat gaccttggat ttggcagtgg 2580attcttagct ataacatcaa agcacaagta agaaaagaga gataaattgg atttcatgaa 2640aattaaaaac ctgtgcttca aagacactat caagaaagtg acaaggcaac ccacagaatg 2700ggaaaaactg cagattatct gataagggac ttctatctag aatatataaa aatctctcac 2760aactcagaaa taagacaatc cagttaaaat aagggtaaag gagccgggca tggtggctca 2820cgcctgtaat cccagagctt tgggaggtgg aggtgggcag atcacctgag gtcaggagtt 2880cacgaccagc ctggccaaca tggtaaaacc ccatctctac taaaaataca aaaattagcc 2940gggtgtggtg gtgcatgcct gtaatcccag ctacttggga ggctgaggca gaagaatcac 3000ttgaacctgg gaggtggagg ttgcagtgag ccgagatcgc gccactgcac tccagcctgg 3060gcgacagagc gagaatctgt ctcgaaaaaa aaaaaaaaaa aa 3102<210>17<211>3102<212>DNA<213>智人(Homo sapiens)<220><221>CDS<222>(590)..(886)<400>17gggcagaggt tgcagtaacc caagatcatg ccaccatact acagactgtg tgacagagcg 60agactctgtc tcaaaacaac aacaaaaaaa caaactcacc attgtacctg tgcttatgca120aggtttagta ggaacgtaaa ttggtttaac ctttgtggac agaagtttta aaaatatata180ttaaaattaa aagtatgctc tgaaggagga actccacttc tggtaattta tctcaagaga240ataactgggc cagcacaaag gctgctgttt aacaatgtgt aatgatgcag tgacagctac300aattgcaaaa ataacctaga cattcaccaa tgaggactgg ttaaatgaac tagtataacc360atactgcaga atatcataaa gataacaaaa aaatgatatg gatctgtttc ttggcataaa420tatatccata agttttaaga agagatgcta tatatacggt ggtcccattg atgtataact480gttaggacta aaaatagtac cttcctcata atgatgtttt gaggaattaa tgagtttatt540catgcaaaat gcttagaatg gtacctggca cacagacaat gtttaagaa atg ttt gtt598Met Phe Val1att gtt att act atg tct ctg tat ata tgc ata gga aaa atc tgg aag 646Ile Val Ile Thr Met Ser Leu Tyr Ile Cys Ile Gly Lys Ile Trp Lys5 10 15gat aaa ata aaa aat gaa tat ttt tgg gtg gtg aga cta aat ttt tgt 694Asp Lys Ile Lys Asn Glu Tyr Phe Trp Val Val Arg Leu Asn Phe Cys20 25 30 35cta att tta tgg ata agt ttt atg att tat att tat aat aaa aat aaa 742Leu Ile Leu Trp Ile Ser Phe Met Ile Tyr Ile Tyr Asn Lys Asn Lys40 45 50gct ata aaa att aat tat gat gtt tct tgc tca tgt cag cta ctt cac 790Ala Ile Lys Ile Asn Tyr Asp Val Ser Cys Ser Cys Gln Leu Leu His55 60 65tac ata ctg agt tcc cat ccc cat ttg tta cag gag caa ctc ctg gtt 838Tyr Ile Leu Ser Ser His Pro His Leu Leu Gln Glu Gln Leu Leu Val70 75 80aag tac ctt ttt tgt aac tgt gaa att ccc ttg aca ttc atc ata tac 886Lys Tyr Leu Phe Cys Asn Cys Glu Ile Pro Leu Thr Phe Ile Ile Tyr85 90 95tgatgacttt tcctaataca tggaaacaaa caggattgtg atttttctct cattttgtac946actaagttct atgccagccg atttcagaga gacactctgc aaagttccta tgaaaagtct 1006tcaaaaatgt attaccttgc tgtttaatac caataccaaa attcaaatgg acttatcaat 1066taaactcacc tcaaacacag taatgcactc acagttatga gcagtgctca ctactgccaa 1126tcatttctgc ttccagaatg gttaaaggag ccacaaactc tgcccttatc agaagcagta 1186gcctgataac aggtaagaat aggaatgttc cgtttctccc caaattaaga gtggtatcaa 1246taatctgact tttccaggca tttatctcac agaaatgttt atgagacatg ctaagatcaa 1306catggtaata tctgactatt gtttttatta gaaataaggg ggccagccag gcacagtagc 1366ttacacctgt aatcccagca cctggggagg ttgaggtggg aggattgctt gagcccagga 1426gtttgagaca agcctgggca acacagggag acaccagctc tattaaaaaa aaaaaaagta 1486agggggctat aatgtaaccc ttattgactg atctttgagg ctactgttgt gagatttcta 1546catccctctt tattataaaa gatcccaaat gcggctttac ttggaaagga agcaatttga 1606cagtgatgag gaatgatgtg cagaatggag attcagaacc ctaacagact ctggtattga 1666tatctagtgc tcatatttct gggagtctgc tagggttatg ggagtttgca tttaaattgt 1726aggttgttgc agaaaacaga atttatatgt ggaaaattgt aacgaatcca ctaaaaaact 1786attagaacta ataatcaagt ttggcaaggt tgtaagacat aagtcagtat acaaaaatca 1846actgtatttc tatacatttg tgacaatctg aaaatgaaat taggaaaaca aatccattta 1906cgatagcaac aagaagtata aaatacttag gaagaagtat aacaaaagat gtgcacaatt 1966tatattctga aaactacaaa tagtgtttaa agaaattaaa gaatattaaa ataaatggaa 2026aaatatccca tgttcatgga ctggaagaat tactcttaag atgtcaatac tcctcaaatt 2086gatctacata tttgatacaa tccttgtaag aacccgaact gacttctttg tagaaattga 2146caaattgatt ctaagattca tacaggattg ccatagatcc agaatagcca catcaatttt 2206aaaaaaagaa gaaagtacaa agactcacat tacctgattt aaaaacatac cataaagcaa 2266tgttaggaca gtgtggtatt gacataagga tagacacata gatcaatgaa aaggaaaggg 2326agcccagaag taaaaccaca tcaactgatt ttcaacaaag atgccaagac cattcaattg 2386aggaaagaat agtcccttca acaaatggtg ctgcaaccag acagtcatat gcaaaagaat 2446gaaatttaac ctttacaaaa tttaaccata tataaaaatt aattcaaatg gatcaaagac 2506atataagggc tgaaactata aaattgttaa gagaacatag gaataaatat tcatgacctt 2566ggatttggca gtggattctt agctataaca tcaaagcaca agtaagaaaa gagagataaa 2626ttggatttca tgaaaattaa aaacctgtgc ttcaaagaca ctatcaagaa agtgacaagg 2686caacccacag aatgggaaaa actgcagatt atctgataag ggacttctat ctagaatata 2746taaaaatctc tcacaactca gaaataagac aatccagtta aaataagggt aaaggagccg 2806ggcatggtgg ctcacgcctg taatcccaga gctttgggag gtggaggtgg gcagatcacc 2866tgaggtcagg agttcacgac cagcctggcc aacatggtaa aaccccatct ctactaaaaa 2926tacaaaaatt agccgggtgt ggtggtgcat gcctgtaatc ccagctactt gggaggctga 2986ggcagaagaa tcacttgaac ctgggaggtg gaggttgcag tgagccgaga tcgcgccact 3046gcactccagc ctgggcgaca gagcgagaat ctgtctcgaa aaaaaaaaaa aaaaaa 3102<210>18<211>99<212>PRT<213>智人(Homo sapiens)<400>18Met Phe Val Ile Val Ile Thr Met Ser Leu Tyr Ile Cys Ile Gly Lys1 5 10 15Ile Trp Lys Asp Lys Ile Lys Asn Glu Tyr Phe Trp Val Val Arg Leu20 25 30Asn Phe Cys Leu Ile Leu Trp Ile Ser Phe Met Ile Tyr Ile Tyr Asn35 40 45Lys Asn Lys Ala Ile Lys Ile Asn Tyr Asp Val Ser Cys Ser Cys Gln50 55 60Leu Leu His Tyr Ile Leu Ser Ser His Pro His Leu Leu Gln Glu Gln65 70 75 80Leu Leu Val Lys Tyr Leu Phe Cys Asn Cys Glu Ile Pro Leu Thr Phe85 90 95Ile Ile Tyr<210>19<211>2455<212>DNA<213>智人(Homo sapiens)<400>19gttctaggta gtagaaagca aagggtgcta tgaagagcgt gtacacagac tcccaactgt 60tttgggagtt aaggaaggtt tcttggagga agtggcattc aagctataag acctgatgat120caggtggagt tagctggaga gcagggacag agagaatagc ctgtgcaaaa ggcctattct180tcaggagaga atgacacatg aatgggactg aagaagtaaa ctggtatctc atatgaagga240ccttttatat cttgttaagg attttgaact tcctcctttt tttttttttg agacagagtt300tctctctgtc acccaggctg aagtgcattg gcgtgatctc ggctcatggc agcctccacc360taccaggttc aagctattct cctgcctcag cttcccagat agctgggatt acagtcatgt420gccaccacgc cgggctaatt tttgtatttt tagtagagac agggtttcac cgtgttggcc480aggctggtct cgatttcctg acctcaagtg atctgcctgc cttggcctgc cccagtgccg540gaattacagg agtgagcccc cgcgcctggc ctggacttct gcttaaaggc aataaggaag600cctttactag atttaaaata ggagctcagt ttaaattagt aaggatttgt atttcatcaa660gagctctctt tggccctagt ctggtagaag atgagtcgaa gtagagagac tagttacaaa720gctgttccca ataatccagg tgaaaaatag tggtgaccct agattaaggt agtattggtg780tgggtaggga gaagtggaca gtcatatttg agaggtacct agggaataga attgcaaaga840cctgggagta gattggatat tcagtgggag gaagggagag aagtaatctc tcaagtgttg900ctcaagccat aaccttggat ggtactgtcc actgatacag taggaggaaa atgtttgagg960gaaagtagtg atgaatttgt ggtgcactaa catggccaac actaaatatt agaaagatta 1020atgtggtcat gtagaagatg aatgaaaaga agatacctca gaagtggaga gatagttaaa 1080tggcttttgt aggaatctca gctagaagtg tcagtattct taagtgcaga actaacaggt 1140gtgggaaagt aatgggaagt agacaccaaa caaatagttc cccaaagatg gtatcaaata 1200tcccagtgac agcttgcagc ctgctcagct ttatgatatg cccctgagat catttttcag 1260gacaaaaagt agtgaaacta cctttattta cttctcaaat ttacctttat ttacttctca 1320aatatacata gaaagtaata ttgtaaaaag cagctctggc tgggcgctgt ggctcaagcc 1380tatagtccca gcactttggg aggctgaggg gggcagatga cttgaggtca agagttcaag 1440accatcctgg ccaacatggc aaaaccgcat ttctactaaa aatacaaaaa ttcgcgtggc 1500agcacgtgcc tgtaacccca gctactctgg aggctgaggt acaagagtcg cttgaatttg 1560ggaggtggag actgcagcga gccgagatcc taccactgca ctccagcttg ggggacagtg 1620cgagactctg tcttaaaaaa cagtggcctg gcgcactggc tcacgcttgt aatcccagca 1680ctttgggagg ccgaggtggg cgggggtgga tcattgaggt caggagatca agaccatccc 1740ggccaacgtg gtgcaacccc gtctctacta caaatacaaa aattagctgg acatggtggt 1800gtacgcctgt agtcccagct actcgggaga gtaagacggg aatcgcttga acctgggagg 1860tgggaggttg cagtgagcca agattgtgcc actgcactcc agcctggcga cagagcaaga 1920ctgtcttaaa aaaaaaaaaa aaaaaaaaag gatattttca ctcttgggac ttgataaagc 1980tagtttattt tgattatctc ctatatccta tacatattta attggcccct atgaacaatg 2040ttacctcttt atgaggggac ccaaagaagt agctgctggt gtgagagtga gagatcatcc 2100atctttttta ttgtgctttt tgttgtttct ttgtcctgct atgtgttata agtaaggccg 2160ggcacggtgg ctcatgcctg taatcccagc acttagggag gccaaggcca gatccctgag 2220gtcaagagtt tgagaccagc ctagccaaca tggtgaaacc ttgtctttac tgaaaataca 2280aaaaaattag ctgggcaggg tggcatgcgc ctgtagtccc agctactcgc agaggctgag 2340gcaggagaat tgcttgaacc tgggaggcgg aggttgcggt gagccaagat cctgccactg 2400cactccagcc tgggcaacag agggagactc catctcaaaa aaaaaaaaaa aaaaa2455<210>20<211>2455<212>DNA<213>智人(Homo sapiens)<220><221>CDS<222>(417)..(701)<400>20gttctaggta gtagaaagca aagggtgcta tgaagagcgt gtacacagac tcccaactgt 60tttgggagtt aaggaaggtt tcttggagga agtggcattc aagctataag acctgatgat120caggtggagt tagctggaga gcagggacag agagaatagc ctgtgcaaaa ggcctattct180tcaggagaga atgacacatg aatgggactg aagaagtaaa ctggtatctc atatgaagga240ccttttatat cttgttaagg attttgaact tcctcctttt tttttttttg agacagagtt300tctctctgtc acccaggctg aagtgcattg gcgtgatctc ggctcatggc agcctccacc360taccaggttc aagctattct cctgcctcag cttcccagat agctgggatt acagtc atg419Met1tgc cac cac gcc ggg cta att ttt gta ttt tta gta gag aca ggg ttt 467Cys His His Ala Gly Leu Ile Phe Val Phe Leu Val Glu Thr Gly Phe5 10 15cac cgt gtt ggc cag gct ggt ctc gat ttc ctg acc tca agt gat ctg 515His Arg Val Gly Gln Ala Gly Leu Asp Phe Leu Thr Ser Ser Asp Leu20 25 30cct gcc ttg gcc tgc ccc agt gcc gga att aca gga gtg agc ccc cgc 563Pro Ala Leu Ala Cys Pro Ser Ala Gly Ile Thr Gly Val Ser Pro Arg35 40 45gcc tgg cct gga ctt ctg ctt aaa ggc aat aag gaa gcc ttt act aga 611Ala Trp Pro Gly Leu Leu Leu Lys Gly Asn Lys Glu Ala Phe Thr Arg50 55 60 65ttt aaa ata gga gct cag ttt aaa tta gta agg att tgt att tca tca 659Phe Lys Ile Gly Ala Gln Phe Lys Leu Val Arg Ile Cys Ile Ser Ser70 75 80aga gct ctc ttt ggc cct agt ctg gta gaa gat gag tcg aag 701Arg Ala Leu Phe Gly Pro Ser Leu Val Glu Asp Glu Ser Lys85 90 95tagagagact agttacaaag ctgttcccaa taatccaggt gaaaaatagt ggtgacccta761gattaaggta gtattggtgt gggtagggag aagtggacag tcatatttga gaggtaccta821gggaatagaa ttgcaaagac ctgggagtag attggatatt cagtgggagg aagggagaga881agtaatctct caagtgttgc tcaagccata accttggatg gtactgtcca ctgatacagt941aggaggaaaa tgtttgaggg aaagtagtga tgaatttgtg gtgcactaac atggccaaca 1001ctaaatatta gaaagattaa tgtggtcatg tagaagatga atgaaaagaa gatacctcag 1061aagtggagag atagttaaat ggcttttgta ggaatctcag ctagaagtgt cagtattctt 1121aagtgcagaa ctaacaggtg tgggaaagta atgggaagta gacaccaaac aaatagttcc 1181ccaaagatgg tatcaaatat cccagtgaca gcttgcagcc tgctcagctt tatgatatgc 1241ccctgagatc atttttcagg acaaaaagta gtgaaactac ctttatttac ttctcaaatt 1301tacctttatt tacttctcaa atatacatag aaagtaatat tgtaaaaagc agctctggct 1361gggcgctgtg gctcaagcct atagtcccag cactttggga ggctgagggg ggcagatgac 1421ttgaggtcaa gagttcaaga ccatcctggc caacatggca aaaccgcatt tctactaaaa 1481atacaaaaat tcgcgtggca gcacgtgcct gtaaccccag ctactctgga ggctgaggta 1541caagagtcgc ttgaatttgg gaggtggaga ctgcagcgag ccgagatcct accactgcac 1601tccagcttgg gggacagtgc gagactctgt cttaaaaaac agtggcctgg cgcactggct 1661cacgcttgta atcccagcac tttgggaggc cgaggtgggc gggggtggat cattgaggtc 1721aggagatcaa gaccatcccg gccaacgtgg tgcaaccccg tctctactac aaatacaaaa 1781attagctgga catggtggtg tacgcctgta gtcccagcta ctcgggagag taagacggga 1841atcgcttgaa cctgggaggt gggaggttgc agtgagccaa gattgtgcca ctgcactcca 1901gcctggcgac agagcaagac tgtcttaaaa aaaaaaaaaa aaaaaaaagg atattttcac 1961tcttgggact tgataaagct agtttatttt gattatctcc tatatcctat acatatttaa 2021ttggccccta tgaacaatgt tacctcttta tgaggggacc caaagaagta gctgctggtg 2081tgagagtgag agatcatcca tcttttttat tgtgcttttt gttgtttctt tgtcctgcta 2141tgtgttataa gtaaggccgg gcacggtggc tcatgcctgt aatcccagca cttagggagg 2201ccaaggccag atccctgagg tcaagagttt gagaccagcc tagccaacat ggtgaaacct 2261tgtctttact gaaaatacaa aaaaattagc tgggcagggt ggcatgcgcc tgtagtccca 2321gctactcgca gaggctgagg caggagaatt gcttgaacct gggaggcgga ggttgcggtg 2381agccaagatc ctgccactgc actccagcct gggcaacaga gggagactcc atctcaaaaa 2441aaaaaaaaaa aaaa 2455<210>21<211>95<212>PRT<213>智人(Homo sapiens)<400>21Met Cys His His Ala Gly Leu Ile Phe Val Phe Leu Val Glu Thr Gly1 5 10 15Phe His Arg Val Gly Gln Ala Gly Leu Asp Phe Leu Thr Ser Ser Asp20 25 30Leu Pro Ala Leu Ala Cys Pro Ser Ala Gly Ile Thr Gly Val Ser Pro35 40 45Arg Ala Trp Pro Gly Leu Leu Leu Lys Gly Asn Lys Glu Ala Phe Thr50 55 60Arg Phe Lys Ile Gly Ala Gln Phe Lys Leu Val Arg Ile Cys Ile Ser65 70 75 80Ser Arg Ala Leu Phe Gly Pro Ser Leu Val Glu Asp Glu Ser Lys85 90 95<210>22<211>2572<212>DNA<213>智人(Homo sapiens)<400>22gcggggtttc actatgttgg ccaggctggt ctagaactcc tgacctcaag tgatctgccc 60gcctcggcct cccaaagtgc tgggattgca ggcgtgagac actgcacccg gacaattttc120cttttcttac aagaacactg ctcacactgc attcagggcc aaccctaacc cagtatcgcc180tcatcctggt ttgattatat cggcacagac cttgcttccg agcgaggcca ctttctcagg240tactggtgga catgagtctt cggagacgct gctcaaccca cagtgctcct ccagcttggt300ttctgtgact tgccttcccc agaggagggg tgccctgaga ggtctccact ccctgaccgg360ctccttggtg ccgcgcactc tgagaggctt cccagggaac agagcacaca ggaccgccct420cctgggtaga ccaatcagca tctgagctca caatttccca gcagggcagt ggggtggaga480gagaagcctg ggctgggctg ggctgggctg ggctggggaa gcttctccgg gcggggggac540gtcagagcag gatctggggc tgataaaagc ccgcccctgg gtgggggctg agtggtgcgg600aagctgagcc cgacacgtgg ggatggagga caggctgtgg gagggtgtga accggatact660gcttgaaggg gtgctgggga ctttgagaga gggcggctgg ccctgtctgg tcggggatgc720tggcccagac acaggccatg gctgggatgg ggttcagaaa caggaccgct gtctctcccg780ggccagggcc ctccccagct gctcctggct ttctggttct tggggtcagg ggcaggcctg840tgccatgacc ccgccactga ggctgtgagg aggctgtcgg tgcccaaggg caccaaggca900cacccctact cttgcacccc atgtgtgggc ccgagcacct gctctgctgc cccaaagatc960tggcgatgtt tcccaggcaa ctgtctctca cagcctgtct gcctggcact cccgtatccc 1020ataaatgcca ccacatctgg ctatgggtgg gcgtgcctgc ctggcatcca cgggccagca 1080ggtgtggtgg agcacagccc agttcctggc tgcgtcagaa ggctgcccgg gccttttggc 1140tgtccttgcc agcaggtgag cactgccagg gcaccgtgtg tgggtgctgg gccatttagc 1200cacatgggaa ggggtggagg cagcccagtg ccttcagcat gtgcccaggg tgcctgtcgg 1260ccacaggtct catttggaaa ttgggagggt gcacggccac cgggctgctt aggcctgcca 1320gcctcagggc ccgtcaccgc tgtcttagcc tgatttgcag ggtgtcaacg ctgggcagag 1380atgaacattt gggtgactct gaggatgcca gtggctggga cacttgttct tccgcggtgg 1440aaggagttgg agaggcctgg ctccctgacc tacggccagc ctggcttctg aaaccagctc 1500agtgggctgg ggcctgattc atcatccata aatgtgtcct tttttgccac agagggtaag 1560gggcctccta gcccaccggt ctgcaggtgc gggagtagga gatgggtggc tctgatgccc 1620ccacccactc gatcaccttc tgctctgcct gggatgcaaa ctcccacagc tgaaacgttc 1680ttttgtaaac atgaattttg gcttagaaaa aactcatttc cactgtgcac gtgtcagtcc 1740caaccagaaa ttattttcca ataaagcaaa actccgtcac cacagcagca gatggctccg 1800aagaagtgga gcgttttcat caggttcaac tttgaaacct ccaccatcac catcaccagc 1860accgctgtgt catgctgata acttgaggac aggcaggaca aggccttctg gcggccgccc 1920ctggtttctc ctggggggtg atgagcggga gcggctctgg gccgagctac tgcgcacggt 1980gagcccggag ctgatcctgg atcacgaggt gccttcactg cccgccttcc caggacagga 2040gcccaggtgc ggcccggagc ccactgaagt cttcactgtc ggacccaaga ccttttcctg 2100gacacccttt ccgccggacc tgtggggccc gggccgttcc taccggctgc ttcacggggc 2160aggagggcac ctggaatccc ccgccaggtc cctgccccag cgcccggcac ctgatccctg 2220cagggccccc agggtggagc agcaaccgtc tgtggagggt gccgcggccc tgcgcaactg 2280ccccatgtgc cagaaggagt ttgcccccag gctgacccag ctggatgttg acagccacct 2340ggcccagtgc ttggccgaaa gcacaaaaaa cgtgacgtgg tgagcgccat ccaagagccc 2400tgcgcagagt gcagcgcccg gacacgcttt cccccgccag cagccccgcc tctcggctcc 2460cccgccaaca gccccgcctt tcggctcccc cgcatgggca ttaaaacagg gcgggctcct 2520gtctgtctct gtgttgtgat gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 2572<210>23<211>2572<212>DNA<213>智人(Homo sapiens)<220><221>CDS<222>(1691)..(2380)<400>23gcggggtttc actatgttgg ccaggctggt ctagaactcc tgacctcaag tgatctgccc 60gcctcggcct cccaaagtgc tgggattgca ggcgtgagac actgcacccg gacaattttc120cttttcttac aagaacactg ctcacactgc attcagggcc aaccctaacc cagtatcgcc180tcatcctggt ttgattatat cggcacagac cttgcttccg agcgaggcca ctttctcagg240tactggtgga catgagtctt cggagacgct gctcaaccca cagtgctcct ccagcttggt300ttctgtgact tgccttcccc agaggagggg tgccctgaga ggtctccact ccctgaccgg360ctccttggtg ccgcgcactc tgagaggctt cccagggaac agagcacaca ggaccgccct420cctgggtaga ccaatcagca tctgagctca caatttccca gcagggcagt ggggtggaga480gagaagcctg ggctgggctg ggctgggctg ggctggggaa gcttctccgg gcggggggac540gtcagagcag gatctggggc tgataaaagc ccgcccctgg gtgggggctg agtggtgcgg600aagctgagcc cgacacgtgg ggatggagga caggctgtgg gagggtgtga accggatact660gcttgaaggg gtgctgggga ctttgagaga gggcggctgg ccctgtctgg tcggggatgc720tggcccagac acaggccatg gctgggatgg ggttcagaaa caggaccgct gtctctcccg780ggccagggcc ctccccagct gctcctggct ttctggttct tggggtcagg ggcaggcctg840tgccatgacc ccgccactga ggctgtgagg aggctgtcgg tgcccaaggg caccaaggca900cacccctact cttgcacccc atgtgtgggc ccgagcacct gctctgctgc cccaaagatc960tggcgatgtt tcccaggcaa ctgtctctca cagcctgtct gcctggcact cccgtatccc 1020ataaatgcca ccacatctgg ctatgggtgg gcgtgcctgc ctggcatcca cgggccagca 1080ggtgtggtgg agcacagccc agttcctggc tgcgtcagaa ggctgcccgg gccttttggc 1140tgtccttgcc agcaggtgag cactgccagg gcaccgtgtg tgggtgctgg gccatttagc 1200cacatgggaa ggggtggagg cagcccagtg ccttcagcat gtgcccaggg tgcctgtcgg 1260ccacaggtct catttggaaa ttgggagggt gcacggccac cgggctgctt aggcctgcca 1320gcctcagggc ccgtcaccgc tgtcttagcc tgatttgcag ggtgtcaacg ctgggcagag 1380atgaacattt gggtgactct gaggatgcca gtggctggga cacttgttct tccgcggtgg 1440aaggagttgg agaggcctgg ctccctgacc tacggccagc ctggcttctg aaaccagctc 1500agtgggctgg ggcctgattc atcatccata aatgtgtcct tttttgccac agagggtaag 1560gggcctccta gcccaccggt ctgcaggtgc gggagtagga gatgggtggc tctgatgccc 1620ccacccactc gatcaccttc tgctctgcct gggatgcaaa ctcccacagc tgaaacgttc 1680ttttgtaaac atg aat ttt ggc tta gaa aaa act cat ttc cac tgt gca 1729Met Asn Phe Gly Leu Glu Lys Thr His Phe His Cys Ala1 5 10cgt gtc agt ccc aac cag aaa tta ttt tcc aat aaa gca aaa ctc cgt 1777Arg Val Ser Pro Asn Gln Lys Leu Phe Ser Asn Lys Ala Lys Leu Arg15 20 25cac cac agc agc aga tgg ctc cga aga agt gga gcg ttt tca tca ggt 1825His His Ser Ser Arg Trp Leu Arg Arg Ser Gly Ala Phe Ser Ser Gly30 35 40 45tca act ttg aaa cct cca cca tca cca tca cca gca ccg ctg tgt cat 1873Ser Thr Leu Lys Pro Pro Pro Ser Pro Ser Pro Ala Pro Leu Cys His50 55 60gct gat aac ttg agg aca ggc agg aca agg cct tct ggc ggc cgc ccc 1921Ala Asp Asn Leu Arg Thr Gly Arg Thr Arg Pro Ser Gly Gly Arg Pro65 70 75tgg ttt ctc ctg ggg ggt gat gag cgg gag cgg ctc tgg gcc gag cta 1969Trp Phe Leu Leu Gly Gly Asp Glu Arg Glu Arg Leu Trp Ala Glu Leu80 85 90ctg cgc acg gtg agc ccg gag ctg atc ctg gat cac gag gtg cct tca 2017Leu Arg Thr Val Ser Pro Glu Leu Ile Leu Asp His Glu Val Pro Ser95 100 105ctg ccc gcc ttc cca gga cag gag ccc agg tgc ggc ccg gag ccc act 2065Leu Pro Ala Phe Pro Gly Gln Glu Pro Arg Cys Gly Pro Glu Pro Thr110 115 120 125gaa gtc ttc act gtc gga ccc aag acc ttt tcc tgg aca ccc ttt ccg 2113Glu Val Phe Thr Val Gly Pro Lys Thr Phe Ser Trp Thr Pro Phe Pro130 135 140ccg gac ctg tgg ggc ccg ggc cgt tcc tac cgg ctg ctt cac ggg gca 2161Pro Asp Leu Trp Gly Pro Gly Arg Ser Tyr Arg Leu Leu His Gly Ala145150 155gga ggg cac ctg gaa tcc ccc gcc agg tcc ctg ccc cag cgc ccg gca 2209Gly Gly His Leu Glu Ser Pro Ala Arg Ser Leu Pro Gln Arg Pro Ala160 165 170cct gat ccc tgc agg gcc ccc agg gtg gag cag caa ccg tct gtg gag 2257Pro Asp Pro Cys Arg Ala Pro Arg Val Glu Gln Gln Pro Ser Val Glu175 180 185ggt gcc gcg gcc ctg cgc aac tgc ccc atg tgc cag aag gag ttt gcc 2305Gly Ala Ala Ala Leu Arg Asn Cys Pro Met Cys Gln Lys Glu Phe Ala190195 200 205ccc agg ctg acc cag ctg gat gtt gac agc cac ctg gcc cag tgc ttg 2353Pro Arg Leu Thr Gln Leu Asp Val Asp Ser His Leu Ala Gln Cys Leu210 215 220gcc gaa agc aca aaa aac gtg acg tgg tgagcgccat ccaagagccc 2400Ala Glu Ser Thr Lys Asn Val Thr Trp225 230tgcgcagagt gcagcgcccg gacacgcttt cccccgccag cagccccgcc tctcggctcc 2460cccgccaaca gccccgcctt tcggctcccc cgcatgggca ttaaaacagg gcgggctcct 2520gtctgtctct gtgttgtgat gaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aa 2572<210>24<211>230<212>PRT<213>智人(Homo sapiens)<400>24Met Asn Phe Gly Leu Glu Lys Thr His Phe His Cys Ala Arg Val Ser1 5 10 15Pro Asn Gln Lys Leu Phe Ser Asn Lys Ala Lys Leu Arg His His Ser20 25 30Ser Arg Trp Leu Arg Arg Ser Gly Ala Phe Ser Ser Gly Ser Thr Leu35 40 45Lys Pro Pro Pro Ser Pro Ser Pro Ala Pro Leu Cys His Ala Asp Asn50 55 60Leu Arg Thr Gly Arg Thr Arg Pro Ser Gly Gly Arg Pro Trp Phe Leu65 70 75 80Leu Gly Gly Asp Glu Arg Glu Arg Leu Trp Ala Glu Leu Leu Arg Thr85 90 95Val Ser Pro Glu Leu Ile Leu Asp His Glu Val Pro Ser Leu Pro Ala100 105 110Phe Pro Gly Gln Glu Pro Arg Cys Gly Pro Glu Pro Thr Glu Val Phe115 120 125Thr Val Gly Pro Lys Thr Phe Ser Trp Thr Pro Phe Pro Pro Asp Leu130 135 140Trp Gly Pro Gly Arg Ser Tyr Arg Leu Leu His Gly Ala Gly Gly His145 150 155 160Leu Glu Ser Pro Ala Arg Ser Leu Pro Gln Arg Pro Ala Pro Asp Pro165 170 175Cys Arg Ala Pro Arg Val Glu Gln Gln Pro Ser Val Glu Gly Ala Ala180 185 190Ala Leu Arg Asn Cys Pro Met Cys Gln Lys Glu Phe Ala Pro Arg Leu195 200 205Thr Gln Leu Asp Val Asp Ser His Leu Ala Gln Cys Leu Ala Glu Ser210 215 220Thr Lys Asn Val Thr Trp225 230
权利要求
1.一种分离的具有抑癌功能的人蛋白,其特征在于,它包含具有选自下组的氨基酸序列的多肽SEQ ID NO3、6、9、12、15、18、21、24;或其保守性变异多肽、或其活性片段、或其活性衍生物。
2.如权利要求1所述的多肽,其特征在于,该多肽是具有选自下组的氨基酸序列的多肽SEQ ID NO3、6、9、12、15、18、21、24。
3.一种分离的多核苷酸,其特征在于,它包含一核苷酸序列,该核苷酸序列与选自下组的一种核苷酸序列有至少85%相同性(a)编码如权利要求1和2所述多肽的多核苷酸;(b)与多核苷酸(a)互补的多核苷酸。
4.如权利要求3所述的多核苷酸,其特征在于,该多核苷酸编码的多肽具有选自下组的氨基酸序列SEQ ID NO3、6、9、12、15、18、21、24。
5.如权利要求3所述的多核苷酸,其特征在于,该多核苷酸的序列选自下组SEQ ID NO2、5、8、11、14、17、20、23的编码区序列或全长序列。
6.一种载体,其特征在于,它含有权利要求3所述的多核苷酸。
7.一种遗传工程化的宿主细胞,其特征在于,它是选自下组的一种宿主细胞(a)用权利要求6所述的载体转化或转导的宿主细胞;(b)用权利要求3所述的多核苷酸转化或转导的宿主细胞。
8.一种具有抑癌功能的人蛋白活性的多肽的制备方法,其特征在于,该方法包含(a)在适合表达具有抑癌功能的人蛋白的条件下,培养权利要求7所述的宿主细胞;(b)从培养物中分离出具有抑癌功能的人蛋白活性的多肽。
9.一种能与权利要求1所述的具有抑癌功能的人蛋白特异性结合的抗体。
10.一种药物组合物,其特征在于,它含有安全有效量的权利要求1所述的多肽以及药学上可接受的载体。
全文摘要
本发明公开了一类新的具有抑癌功能的人蛋白,编码此多肽的多核苷酸和经重组技术产生该多肽的方法。本发明还公开了此多肽用于治疗多种疾病如癌症等的方法。本发明还公开了抗此多肽的拮抗剂及其治疗作用。本发明还公开了编码这类新的具有抑癌功能的人蛋白的多核苷酸的用途。
文档编号C07K14/47GK1429839SQ01145279
公开日2003年7月16日 申请日期2001年12月30日 优先权日2001年12月30日
发明者顾健人, 杨胜利 申请人:上海新世界基因技术开发有限公司
网友询问留言 已有0条留言
  • 还没有人留言评论。精彩留言会获得点赞!
1