非内源的被组成型活化的人g蛋白偶联的受体的制作方法

文档序号:3551614阅读:487来源:国知局
专利名称:非内源的被组成型活化的人g蛋白偶联的受体的制作方法
特此声明,本发明要求下列在先申请的优先权美国专利申请号09/170,496(1998年10月13日提出申请)、美国专利申请号08/839,449(1997年4月14日提出申请,现在已放弃)、美国专利申请号09/060,188(1998年4月14日提出申请)、美国临时申请号60/090,783(1998年6月26日提出申请)和美国临时申请号60/095,677(1998年8月7日提出申请)。前述的每个申请都以全文引入本申请作参考。本发明的领域本专利申请文件所公开的发明涉及跨膜受体,具体地讲,涉及被改造的人G蛋白偶联受体(GPCR),它可被组成型活化。在最优选情况下,被改造的人GPCR可用于筛选药用化合物。本发明的背景尽管在人体内有很多种类的受体,但到目前为止最丰富和最与治疗有关的是G蛋白偶联受体(GPCR)。据估计,在人类基因组内有大约100,000个基因,它们中的大约2%即2,000个基因被估计用来编码GPCR。已识别出与其中的约100个GPCR结合的内源配体。由于在发现内源GPCR和发现它的内源配体之间有显著的时间延迟,因此可预测,其余的1900种GPCR将在它们的内源配体被识别之前的很长时间内被识别和鉴定。其实,人类基因组计划正在快速测序人类的100,000个基因,这表明在今后几年内,其余的人GPCR将被完全测序。然而,尽管付出了对人类基因组测序的努力,但仍不清楚科学家如何能够快速、有力和有效率地利用这样的信息来提高和增强人类的健康状态。本发明正是指向这个重要目的。
包括GPCR在内,其内源配体已被认识的受体被称为“已知”受体,内源配体尚不知晓的受体被称为“孤儿”受体。这种区别并不仅是语义上的,尤其是对GPCR来说。GPCR代表着药物产品开发的一个重要领域60%的处方药物开发自100个已知GPCR中的大约20个。因此,孤儿GPCR将是推进药物工业增长、扩张、增强和发展的机会,就像是金子对于19世纪晚期的加利福尼亚一样。然而,孤儿受体在涉及新药物发现时有一个严重缺陷。这是因为,发现和开发药物的传统途径既需要利用受体又需要利用它的内源配体。因此,迄今为止,孤儿GPCR带给本领域的只是用于发现新药的具有诱惑力且未开发的资源。
在探索潜在治疗药物的传统途径下,一般是受体先被识别。在探索药物的努力开始之前,一般会启动精细、费时和昂贵的程序以识别、分离和产生受体的内源配体---对于每个受体,此过程可花费3到10年,成本大约为每个受体500万美元。在探索药物的传统工作可开始之前,必须先消耗这些时间和资金。这是因为,传统的药物探索技术依赖于所谓的“竞争性结合检测”,其中,假定的治疗剂是通过受体被“筛选”的,其目的是找到这样的化合物,它或者能阻止内源配体与受体结合(拮抗剂)、或者能促进或模仿与受体结合的配体的作用(激活剂)。其总体目标是要识别出这样的化合物,它们在配体与受体结合时能阻止细胞活化(拮抗剂),或者当配体与受体适当地结合时能促进或增强细胞活性(激活剂)。由定义可知,孤儿GPCR的内源配体尚未被识别,因此不可能应用传统药物发现技术去发现针对这些受体的独特的新治疗药物。正如下面将揭示的,本发明能够克服传统药物发现技术所导致的这些和其他严格限制。
GPCR都具有一个相同的基元(motif)。所有这些受体具有七个由22到24个疏水氨基酸组成的序列,它们组成七个α螺旋,每个α螺旋都跨过膜(每个跨度都以数字表示,例如,跨膜-1(TM-1)、跨膜-2(TM-2)等)。跨膜螺旋通过氨基酸链连接,在细胞膜的外部即“细胞外”一边的氨基酸链分别在跨膜-2和跨膜-3、跨膜-4和跨膜-5、跨膜育一小时。接着加入麦胚凝集素小珠(25μl,Amersham),组合物在室温下再温育30分钟,然后试管在1500×g、室温下离心5分钟,并在闪烁计数器上记数。
另一个花费更少但同样可适用的方法已被识别,它也可满足大规模筛选的需要。Flash platesTM和WallacTM闪烁带可被用来格式化高处理量的[35S]GTPγS结合测定。进一步,利用此技术,本方法可用于已知的GPCR,它在通过[35S]GTPγS的结合监测化合物效应的同时,同时监测与受体结合的由氚标记的配体。这之所以可能的是因为Wallacβ计数器可以把能量窗口切换成监测氚和35S标记的探针。本方法也可用于侦查导致受体活化的其他类型的膜活化事件。例如,本方法可用于监测许多受体的32P磷酸化(针对G蛋白偶联受体和酪氨酸激酶受体)。当膜被离心到孔的底部时,结合的[35S]GTPγS或32P磷酸化的受体将要活化包被在孔上的闪烁剂。Scinti_带(Wallac)已被用来展示这个原理。另外,本方法也可通过应用放射标记的配体用来测量与受体结合的配体。以相似的方式,当放射标记的结合的配体被离心到孔底时,闪烁带标记在位于标记的配体附近,这导致活化并被检测到。
基于前述的程序,比较空白对照(pCMV)、内源APJ和非内源APJ的代表性结果被图示于图6。2.腺苷酸环化酶设计用来进行基于细胞的测定的Flash PlateTM腺苷酸环化酶试剂盒(New England Nuclear;目录号SMP004A)被改进以应用于未加工的胞浆膜。闪烁板的孔含有闪烁剂包被层,其中含有识别cAMP的特异抗体。在孔中产生的cAMP通过直接和放射性cAMP示踪物竞争与cAMP抗体结合而被定量。下面是对测量在表达受体的膜上cAMP水平变化程序的简短描述。
在转染后大约3天收获转染细胞。通过在含有20mM pH 7.4的育一小时。接着加入麦胚凝集素小珠(25μl,Amersham),组合物在室温下再温育30分钟,然后试管在1500×g、室温下离心5分钟,并在闪烁计数器上记数。
另一个花费更少但同样可适用的方法已被识别,它也可满足大规模筛选的需要。Flash platesTM和WallacTM闪烁带可被用来格式化高处理量的[35S]GTPγS结合测定。进一步,利用此技术,本方法可用于已知的GPCR,它在通过[35S]GTPγS的结合监测化合物效应的同时,同时监测与受体结合的由氚标记的配体。这之所以可能的是因为Wallacβ计数器可以把能量窗口切换成监测氚和35S标记的探针。本方法也可用于侦查导致受体活化的其他类型的膜活化事件。例如,本方法可用于监测许多受体的32P磷酸化(针对G蛋白偶联受体和酪氨酸激酶受体)。当膜被离心到孔的底部时,结合的[35S]GTPγS或32P磷酸化的受体将要活化包被在孔上的闪烁剂。Scinti_带(Wallac)已被用来展示这个原理。另外,本方法也可通过应用放射标记的配体用来测量与受体结合的配体。以相似的方式,当放射标记的结合的配体被离心到孔底时,闪烁带标记在位于标记的配体附近,这导致活化并被检测到。
基于前述的程序,比较空白对照(pCMV)、内源APJ和非内源APJ的代表性结果被图示于图6。2.腺苷酸环化酶设计用来进行基于细胞的测定的Flash PlateTM腺苷酸环化酶试剂盒(New England Nuclear;目录号SMP004A)被改进以应用于未加工的胞浆膜。闪烁板的孔含有闪烁剂包被层,其中含有识别cAMP的特异抗体。在孔中产生的cAMP通过直接和放射性cAMP示踪物竞争与cAMP抗体结合而被定量。下面是对测量在表达受体的膜上cAMP水平变化程序的简短描述。
在转染后大约3天收获转染细胞。通过在含有20mM pH 7.4的筛选化合物的传统“信条”需要受体的配体已知。由定义可知,这种方法对于孤儿受体来说是没有用的。因此,在坚持用这个教条的方法去发现药物时实质上在教导本领域去废弃孤儿受体的应用,除非和直到受体的内源配体被发现。考虑到大约有2000个G蛋白偶联受体且其中的大多数是孤儿受体,如此信条就会与具有创造性的唯一且独特的药物探索方法相对立。
关于不同种类GPCR的核酸和/或氨基酸序列的信息总结于下表A。因为本发明在此公开的一个重要焦点直接指向孤儿GPCR,所以很多下面所引述的参考都是涉及孤儿GPCR的。然而,本列表并非要在法律或在其他意义上进行以下暗示或解释,即在此公开的本发明仅可应用于GPCR或者下面所特别列举的GPCR。此外,一些被分离的受体本身并非是本申请的主题;例如,参考国际互联网上列举GPCR的G蛋白偶联受体数据库(署名的发明人和受让人都与这个站点没有任何关系)。其他的GPCR是由本发明受让人所有的发明申请的主题,它们并没有在下面列出(包括GPR3、GPR6和GPR12;参见美国临时申请号60/094879)表A
正如下面详细公开的那样,应用突变盒修饰人GPCR内源序列会导致人GPCR的组成型活化。这些非内源性的可组成型活化的人GPCR尤其可用于筛选候选化合物,以直接识别例如与药物相关的化合物。本发明的概述在此公开的是非内源的人G蛋白偶联受体,它包含作为(a)最优选的氨基酸序列区域(从C-末端到N-末端走向)和/或(b)最优选核酸序列区域(3’到5’走向)的横跨GPCR的跨膜-6(TM6)和细胞内环-3(IC3)区域(a)P1AA15X其中(1)P1是位于GPCR的TM6区域内的一个氨基酸残基,其中P1是从(i)内源的GPCR的脯氨酸残基和(ii)除脯氨酸之外的非内源的氨基酸残基中选择而来;(2)AA15是15个氨基酸,它们是从(a)内源的GPCR的氨基酸、(b)非内源的氨基酸残基和(c)内源的GPCR氨基酸和非内源氨基酸的组合中选择而来,除非位于GPCR的TM6区域内的15个氨基酸残基都不是脯氨酸;和(3)X是位于所述GPCR的IC3区域内的非内源氨基酸残基,优选是从赖氨酸、组氨酸和精氨酸中选择而来,最优选地是赖氨酸,但是如果在X位置的内源氨基酸是赖氨酸,那么此时X是非赖氨酸的氨基酸,优选是丙氨酸;和/或(b)P密码子(AA-密码子)15X密码子其中(1)P密码子是位于GPCR的TM6区域内的一个核苷酸序列,其中P密码子编码从(i)内源的GPCR的脯氨酸残基和(ii)除脯氨酸之外的非内源的氨基酸残基中选择出来的氨基酸;(2)(AA-密码子)15是编码15个氨基酸的15个密码子,这些氨基酸是从(a)内源的GPCR的氨基酸、(b)非内源的氨基酸残基和(c)内源GPCR的氨基酸和非内源氨基酸的组合中选择而来,除非在位于GPCR的TM6区域内的15个内源密码子都不编码脯氨酸;(3)X密码子是编码位于所述GPCR的IC3区域内的氨基酸残基的核苷酸,其中X密码子编码非内源氨基酸,它们优选是从由赖氨酸、组氨酸和精氨酸中选择而来,最优选地是赖氨酸,但是当在X密码子位置的内源编码区域编码赖氨酸,那么此时X密码子编码除赖氨酸以外的氨基酸,优选是丙氨酸。
针对这些序列盒所使用的内源和非内源术语是相对于内源GPCR而言。例如,一旦内源脯氨酸残基位于特定GPCR的TM6区域之内且从它算起的第16个氨基酸被识别以用于发生突变来组成型活化该受体,那么也有可能突变内源脯氨酸残基(即,一旦标记物被定位且将发生突变的第16个氨基酸被确定,就可能突变标记物本身),尽管最优选脯氨酸残基不被突变。相似地是,尽管最优选AA15保持它们的内源形式,但这些氨基酸也可被突变。在人GPCR的非内源形式中唯一必须被突变的氨基酸是X,即,由从P1开始的第16个残基构成的内源氨基酸不能保持其内源形式而必须被突变,这正如在此进一步公开的那样。重述一遍,尽管优选在人GPCR的非内源形式中,P1和AA15保持它们的内源形式(即,与它们的野生型形式一样),但一旦X被识别和被突变,任何和/或所有的P1和AA15都可以被突变。这对核苷酸序列也同样适用。那么,当在位置X的内源氨基酸是赖氨酸时,在此GPCR的非内源形式中的X是非赖氨酸的氨基酸,优选是内氨酸。
因此,作为假设的情形,如果内源GPCR在上述位置具有如下的内源氨基酸序列P-AACCTTGGRRRDDDE-Q那么下面的任何一个情形和假设中的序列盒都将落在公开的范围之内(非内源氨基酸以粗体字表示)P-AACCTTGGRRRDDDE-KP-AACCTTHIGPRDDDE-K
P-ADEETTGGPRRDDDE-AP-LLKFMSTWZLVAAPQ-KA-LLKFMSTWZLVAAPQ-K也可能在AA15内加入氨基酸残基,但此方法并没有特别的进展。其实,在最优选的实施方案中,在非内源的人GPCR与内源的GPCR中唯一的氨基酸差别就是在X位置的氨基酸;此氨基酸自身的突变导致该受体的组成型活化。
因此,在特别优选的实施方案中,P1和P密码子分别是内源脯氨酸和编码脯氨酸的内源核苷酸编码区;X和X密码子分别是非内源赖氨酸或丙氨酸和编码赖氨酸或丙氨酸的非内源核苷酸编码区,其中最优选是赖氨酸。因为最优选带有这些突变的非内源人GPCR包含在哺乳动物细胞内并用于筛选候选化合物,因此,尽管分离和纯化的非内源人GPCR是在本发明公开的范围之内,但带有突变的非内源的人GPCR本身并不需要纯化和分离(即它们被包含在哺乳动物细胞的细胞膜内)。涉及非内源人GPCR的基因-靶向和转基因非人哺乳动物(优选是大鼠和小鼠)也在本发明范围之内;特别是,基因靶向的哺乳动物是最优选的,这是因为可把人GPCR的非内源形式导入这些动物,以代替非人哺乳动物中的内源GPCR编码区(产生这样的非人哺乳动物的技术是周知的,利用人编码区替代这些非人哺乳动物的蛋白质编码区;例如,参见美国专利号5,777,194)。
已经发现内源人GPCR的这些变化可使GPCR组成型活化,以致于非内源的被组成型活化的人GPCR尤其可被用来直接筛选候选化合物,而不需要内源配体,这正如在此将要进一步揭示的那样。因此,应用这些材料的方法和用这些方法识别的产品也在如下的公开范围之内。图示的简短说明

图1表示与G蛋白偶联的受体的大体结构,标出的数字表示跨膜的螺旋、细胞内环和细胞外环。
图2示意典型的G蛋白偶联受体的活化和非活化两种状态,和活性状态与第二信使传导途径的偶联。
图3是优选的载体pCMV的序列图示,其中包括限制酶切位点的位置。
图4是进行如下比较所测得的信号图示pCMV、组成型活化的非内源GPR30对于GPR6介导的CRE-Luc报告基因活化的抑制作用、内源GPR30对于GPR6介导的CRE-Luc报告基因活化的抑制作用。
图5是进行如下比较所测得的信号图示pCMV、组成型活化的非内源GPR17对于GPR3介导的CRE-Luc报告基因活化抑制作用,内源GPR17对于GPR3介导的CRE-Luc报告基因活化的抑制作用。
图6是比较pCMV对照、内源APJ和非内源APJ所测得的信号图示。
图7提供了与其内源型进行比较的非内源人5-HT2A受体产生IP3的图解说明。
图8是GPR1(8A)、GPR30(8B)和APJ(8C)的点印迹结果。详细描述本科学文献涉及受体并采用一些术语来描述对受体具有不同作用的配体。为了清楚和前后一致,在本发明文献中将由始至终使用下列定义。在这些定义与这些词语的其他定义冲突时,选择下列定义激活剂 意味着激活细胞内反应的化合物,此时它们结合受体或促进GTP与膜结合。
在此应用的氨基酸缩写列于下表
部分激活剂 意味着这样的化合物,它们与受体结合时,激活细胞内反应或者促进GTP与膜结合的程度低于激活剂。
拮抗剂 意味着这样的化合物,它和激活剂在同一位点与受体竞争性地结合,但不激活由受体的活性形式引起的细胞内反应,并可因此抑制由激活剂或部分激活剂促进的细胞内反应。拮抗剂在没有激活剂或部分激活剂的情形下并不削弱基本细胞内反应。
候选化合物 意味着一个将经受筛选技术检验的分子(例如但不限于化学化合物)。优选的“候选化合物”并不包括对公众来说已知选自受体的反激活剂、激活剂或拮抗剂的化合物,它们以前已通过非直接的识别方法被确定(“非直接识别的化合物”);更优选不包括先前已经确定至少在一种哺乳动物中具有治疗效果的已被非直接识别的化合物;并且,最优选不包括先前已经确定的在人体中具有治疗用途的已被非直接识别的化合物。
密码子 意味着一组三个核苷酸(或者核苷酸的等价物),它们一般包括一个与磷酸基团偶联的核苷(腺苷(A)、鸟苷(G)、胞苷(C)、尿苷(U)和胸苷(T)),当被翻译时,它们编码一个氨基酸。
化合物效应 意味着一个化合物抑制或者刺激受体功能的能力的量度,它与受体结合亲和力相对。一个优选的测定化合物效应的方法是通过测量[35S]GTPγS的结合,这一点将在本发明的实施例部分中进一步公开。
被组成型活化的受体 意味着易受组成型受体活化的受体。与本发明在此公开的相一致,一个非内源的被组成型活化的人G蛋白偶联受体是突变后包括氨基酸盒P1AA15X的受体,这正如在下面详细描述的那样。
组成型受体活化 意味着不利用它的内源配体或其化学等价物与受体结合的方法而使在活性状态下的受体稳定。优选地是,被本发明的组成型受体活化的G蛋白偶联受体与GPCR的内源形式相比,对组成型活化所测得的信号的反应有至少10%的差异(增高或者降低,如具体情况可能的那样),更优选在如此比较的反应中有大约25%的差异,最优选在如此比较的反应中有大约50%的差异。当用于直接识别候选化合物的目的时,最优选信号差异至少为50%,以使在内源信号和非内源信号之间有足够的差异,从而在被选择的候选化合物之间产生区别。在最多的情形下,“差异”将是信号的增加;然而,正如下面详细将要叙述的那样,对Gs-偶联的GPCR来说,测量的“差异”优选是降低。
接触 意味着把至少两部分放在一起,无论是在体外系统还是在体内系统中。
直接识别或被直接识别,与术语“候选化合物”相联系,意味着筛选针对组成型活化的G蛋白偶联受体的候选化合物。本术语在任何情形下都不应被解释或被理解为被包括或包括术语“非直接地识别”或“非直接地被识别”。
内源 意味着由物种的基因组天然产生的物质。关于内源的GPCR,意味着由人体、昆虫、植物、细菌或病毒天然产生的物质,这些只作为例证但却不是限制。与之相对比,术语“非内源”在本文中 意味着不是由物种的基因组天然产生的物质。例如在其内源形式下并非组成型活化的受体,当应用在此公开的盒使之突变并因而使之组成型活化时,此受体被最优选地指称为“非内源的被组成型活化的受体”,这只作为例证而不是限制。两个用语都可被用来描述“体内”和“体外”系统。在筛选过程中,内源的或非内源的受体可被用于体外筛选系统,其中受体在哺乳动物的细胞表面表达,这也只作为例证而不是限制。作为进一步的例子而不是限制,当操作哺乳动物的基因组以包括非内源组成型活化受体时,可以通过体内系统筛选候选化合物。
宿主细胞 意味着能在其中插入质粒和/或载体的细胞。在原核宿主细胞情形下,当宿主细胞复制时质粒典型地以自主分子方式复制(在一般情况下,质粒在复制后被分离出来以被引入真核宿主细胞中);在真核宿主细胞情形下,质粒被整合进宿主细胞的细胞DNA中,因而,当真核细胞复制时,质粒复制。为在此公开的本发明的目的,宿主细胞优选是真核细胞,更优选是哺乳动物细胞,最优选地是从293、293T和COS-7细胞中选择出来的细胞。
非直接地识别或非直接地被识别 意味着发现药物的传统方法,该方法涉及对内源受体特异的内源配体的识别、筛选针对受体的候选化合物、确定那些干扰或竞争配体-受体相互反应的化合物、测量化合物对至少一个与活化受体相关的第二信使途径影响的效率。
抑制,与用语“反应”相联系,意味着在一个化合物存在时一个反应被降低或阻止,这正好与该化合物不存在时相反。
反激活剂 意味着这样的化合物,它们与内源受体或受体的组成型活化形式结合,并且将由受体的活性形式引发的基本细胞内反应抑制到正常基础水平以下,该活性水平是在没有激活剂或部分激活剂的情况下观察的,或者它们降低GTP与膜的结合。与在没有反激活剂情况下的基本反应相比,基本细胞内反应在反激活剂的存在下优选被抑制至少30%、更优选至少50%、最优选至少75%。。
已知受体 意味着其特异的内源配体已被识别的内源受体。
配体 意味着对内源的天然产生的受体特异的内源的天然产生的分子。
关于内源受体的核苷酸和/或氨基酸序列的突变 意味着这些内源序列的特定改造,从而使内源的非组成型活化受体的突变型能造成受体的组成型活化。对于特定序列的等价物,人受体的后续突变型被认为是人受体的首次突变的等价物,如果(a)后续突变型受体的组成型活化水平与受体的首次突变所表明的在本质上一样;和(b)在后续突变型受体和受体的首次突变之间的序列同源性的百分数是至少80%,更优选地是至少90%,最优选地是至少95%。在理想的情况下,考虑到在此公开的用于进行组成型活化的最优选的盒包括在内源和非内源型GPCR之间发生变化的单一氨基酸和/或密码子(即X或X密码子),序列同源性的百分数应是至少98%。
孤儿受体 意味着这样的内源受体,其特异的内源配体尚未被识别或尚未知。
药物组合物 意味着包括至少一种活性成分的组合物,借助此活性成分可以研究该组合物可在哺乳动物(例如但不限于人体)中特定的效果。本领域的那些普通技术人员将能够理解和正确评价那些适于确定活性成分是否具有基于技术人员需要的预期效果的技术。
质粒 意味着载体和cDNA的结合体。一般,为cDNA复制和/或表达蛋白质的目的将质粒引进宿主细胞。
刺激,与术语“反应”相联系,意味着当一种化合物存在时比当它不存在时反应增强。
横跨,涉及被定义的核苷酸序列或者被定义的氨基酸序列,意味着该序列位于至少两个不同且明确限定的区域之内。例如,在一个长度为10个氨基酸的氨基酸序列中,其中10个中的3个是在GPCR的TM6区,其余的7个是在GPCR的IC3区,这10个氨基酸就可被描述为横跨GPCR的TM6和IC3区。
针对cDNA的载体 意味着能够将至少一个cDNA掺入其中且能导入到宿主细胞中的环形DNA。
下面部分的顺序安排是为了表达效果,而不能被解释为对下面的公开或权利要求的限制。A.引言受体的传统研究一直是基于这样的前置假定(基于历史),即内源配体必须首先被识别,然后才能发现可以作用于受体的拮抗剂和其他分子。甚至在拮抗剂被首先发现的情况下,搜索的目光也立即延伸到查找内源配体上去。即使在发现组成型活化受体之后,这种思维模式也一直在受体研究中持续。在此之前没有被认识到的是,是受体的活性状态对发现受体的激活剂、部分激活剂和反激活剂是最有用的。对于那些因为受体的过度活化和不够活化而导致的疾病来说,希望得到的治疗药物是能分别用来减少受体的活性状态或增强受体活性的化合物,而并不需要是对抗内源配体的拮抗剂。这是因为,一个降低或增强活化态受体活性的化合物并不需要结合在和内源配体一样的位点上。因而,正如本发明的一个方法所说的那样,对治疗性化合物的任何搜索可通过筛选针对配体非依赖性活性态的化合物而开始。
筛选针对非内源的组成型活化的GPCR的候选化合物,这可直接识别与这些细胞表面受体作用的候选化合物,而根本无需先了解或使用此受体的内源配体。通过确定表达和/或过度表达这些GPCR的内源形式的身体内部区域,有可能确定与这些受体的表达和/或过度表达相关的疾病/紊乱状态;这种方法在本发明中得到公开。B.疾病/紊乱识别和/或选择最优选用本发明的材料识别针对非内源的组成型活化的GPCR的反激活剂。如此的反激活剂是治疗与这些受体有关的疾病的药物探索中先导化合物的理想候选者。因为可直接识别针对这些受体的反激活剂、部分激活剂或激活剂,因此有可能开发和搜索针对与这些受体有关的疾病和紊乱的药物组合物。例如,检查患病和正常组织样品中这些受体的存在,现在不仅仅是学术研究的问题,也是在孤儿受体的情形下通过识别来寻找内源配体的研究道路上所致力解决的问题。可在健康和患病组织的宽广范围内进行组织检查。如此的组织检查提供了把特异受体与疾病/紊乱相联系的优选第一步骤。
优选内源GPCR的DNA序列被用来制作探针,用于在组织样品中GPCR表达的放射标记cDNA或RT-PCR识别。在疾病组织中受体的存在,或者与正常组织相比在疾病组织中受体的浓度提高或降低,可被优选地用来识别与那种疾病的关联。用这种方法也可很好地把受体定位于器官的区域。基于受体被定位于其中的特定组织的已知功能,受体假想的功能性角色可被推导出来。C.“人GPCR脯氨酸标记物”算法规则和非内源的组成型活化的人GPCR的形成在生物技术领域所面临的许多挑战中,包括从一个物种搜集遗传信息并把该信息和其他物种的信息相联系的不可预测性---在本领域中没有比编码核酸和蛋白质的遗传序列这个问题更能困扰人的。因此,为了一致性并考虑到本领域的高不可预测性,下述发明用的哺乳动物术语局限于人GPCR---把本发明应用于其他哺乳动物物种,尽管具有潜在可能,但并不仅仅只是生搬硬套式的应用。
一般来说,当企图把从一种相关的蛋白质序列或物种中得到的普遍“规则”应用于其他序列或物种时,本领域一般是求助于序列的对齐比较,即把序列线性化并希望在两个或更多的序列之间发现相同的区域。尽管很有用,但此方法并不经常能产生有意义的信息。在GPCR情形下,尽管所有GPCR的一般结构基元是相同的,但TM、EC和IC在长度上的不同导致这样的对齐方法在从一种GPCR到另一种时变得很困难。因此,尽管可以期望应用一种普遍方法,例如从一种GPCR到另一种的组成型活化方法,但由于从一种到另一种GPCR在序列长度、一致性等上存在很大的不同,一种可普遍适用的和实际上成功的突变对齐方法在本质上是不可能的。作个类比,如此的一种方法与这样一种景况相似让一位旅行者从A点开始旅程,给他很多指向B点的不同的地图,但在任何一个地图上却没有任何比例尺或距离的标记物,然后让旅行者仅利用这些地图找出通向目的地B的最短和最有效的路径。在这样的情况下,通过以下手段被简化任务拥有(a)在每个地图上都有一个共同的“地方标记物”,和(b)测量从每一个地方标记物到目的地B的距离的能力,然后,这将容许旅行者选择从开始点A通向目的地B的最有效的路径。
在本质上,本发明的一个特点是提供在人GPCR中的这样一个坐标,它可以容许形成组成型活化的人GPCR。
正如此技术领域中所评价的那样,细胞的跨膜区域是高度疏水的;因此,应用通常的疏水测绘技术,本领域的技术人员就可确定GPCR的TM区,特别是TM6(同样的方法也可用来确定GPCR的EC区和IC区)。已经发现,在人GPCR的TM6区,一个共同的脯氨酸残基(一般靠近TM6的中间)是组成型活化的“标记物”。通过从脯氨酸标记物数过15个氨基酸,第16个氨基酸(定位在IC3环上)在从其内源形式突变为非内源形式时导致受体的组成型活化。为方便起见,我们把这称为“人GPCR脯氨酸标记物”算法规则。尽管在此位置的非内源氨基酸可以是任何一种氨基酸,但最优选的非内源氨基酸还是赖氨酸。尽管并不希望被任何理论所束缚,我们还是相信该位置的本身是独特的,并且,本位置上的突变会影响受体发生细成性的活化。
我们注意到,例如,当在第16位置上的内源氨基酸已经是赖氨酸时(如在GPR4和GPR32中的那样),那么为了让X是一个非内源氨基酸,它必不是赖氨酸;因此,在内源GPCR的第16位位置上是内源赖氨酸残基的情形下,非内源GPCR在该位置上优选不是赖氨酸的氨基酸,优选是丙氨酸、组氨酸和精氨酸。进一步注意到,确定了GPR4看起来被与Gs相联并在其内源形式下活化(数据没有列出)。
因为仅有20种天然氨基酸(尽管可以利用非天然形成的氨基酸),为此第16位置的替代而选择特别的非内源氨基酸是可行的,并且容许有效选择适合研究者需要的非内源氨基酸。然而,正如提示,在第16位更优选的非内源氨基酸是赖氨酸、组氨酸、精氨酸和丙氨酸,其中赖氨酸是最优选的。可认为本领域的普通技术人员有能力用熟练的方法改造密码子的序列以产生所期望的突变。
也发现,在偶然而并非是经常的情形下,脯氨酸残基标记物在TM6中位于W2之后(即,W2P1AA15X),其中W是色氨酸,2是任何氨基酸残基。
我们的发现否定了对于本领域常常应用不可预测且复杂的序列对齐方法的需求。其实,尽管在本质上是一个规则,但我们的发现的重要性就在于它可以容易的方法应用于人GPCR上,其可被本领域技术人员灵巧地简化,并得到独特和高度有用的终产品,即被组成型活化的人GPCR。因为需要很多年和很多资金来确定人GPCR的内源配体(正由人类基因组计划揭示),本发明不仅降低积极探索这种序列信息所必要的时间,也能显著地节约成本。本方法能够真正证实人类基因组计划的重要性,因为它不仅准许应用基因信息来理解GPCR在疾病等中的角色,也能提供提高人类健康状况的可能。D.候选化合物的筛选1.GPCR的筛选测定技术当一种G蛋白受体变为组成型活化时,它与G蛋白(例如,Gq、Gs、Gi、Go)偶联并刺激释放GTP,其后GTP与G蛋白结合。接着,借助受体在正常情况下失活,G蛋白作为GTP酶慢慢地把GTP水解为GDP,然而,包括本发明的非内源的组成型活化的人GPCR在内,组成型活化的受体继续把GDP转化为GTP。GTP非可水解的类似物[35S]GTPγS,可被用来监测与表达组成型活化受体的膜上的G蛋白加强了的结合。据报道,[35S]GTPγS可被用来监测在配体存在或不存在的情形下G蛋白与膜的偶联。在本领域中著名和可行的其他例证中有此种监测的一个例证,它由Traynor和Nahorski在1995年所报道。本测定系统的一个优选的应用是为了初步筛选候选化合物,因为本系统对所有蛋白-偶联受体一般可行,而不考虑与受体的细胞内结构域相互作用的那一种特别的G蛋白。B2.特定的GPCR筛选测定技术C 一旦应用“一般”G蛋白偶联的受体测定方法(即筛选是激活剂、部分激活剂或反激活剂的化合物的方法)识别出候选化合物,优选进一步筛选以确认作用在受体位点的化合物。例如,应用“一般”测定方法识别的化合物可以不与受体结合,但也可以仅仅从细胞内结构域与G蛋白“解偶联”。a.Gs和GiGs刺激腺苷酸环化酶。另一方面,Gi(和Go)抑制该酶。腺苷酸环化酶催化ATP向cAMP的转化;因此,与Gs蛋白偶联的组成型活化的GPCR与升高的细胞内cAMP水平相关联。在另一方面,与Gi(或Go)蛋白偶联的组成型活化的GPCR与降低的细胞内cAMP水平相关联。一般情况参见“突触传导的非直接机制(Indirect Mechanisms ofSynaptic Transmission)”,第8章,丛神经到大脑(From Neuron ToBrain)(第三版),Nichols,J.G.等编,Sinauer Associates,Inc.(1992)。因此,检测cAMP的方法可被用来确定一个竞争性的化合物是否是受体的反激活剂(即这样的一个化合物将能降低cAMP的水平)等。本领域已知的测定cAMP的不同方法可以被利用;最优选的方法依赖于在基于ELISA的方法中应用抗-cAMP的抗体。可被应用的另一类测定方法是一种全细胞第二信使报告基因系统测定法。基因上的启动子驱动由一个特别的基因所编码的蛋白质的表达。环AMP通过以下步骤促进基因的表达,即它响应促进cAMP的DNA结合蛋白或转录因子(CREB)的结合,转录因子接着在被称为cAMP效应元件的特别位点与肩动子结合并驱动基因表达。报告基因系统可被构建为具有一个启动子,该启动子在报告基因的前面含有多个cAMP效应兀件,例如β-半乳糖苷酶或荧光素酶。因而,一个被组成型活化的连接Gs的受体引起cAMP的积累,cAMP接着激活报告蛋白质的基因和表达。β-半乳糖苷酶或荧光素酶等报告蛋白质可用标准生化方法检测到(Chen等,1995)。对于偶联Gi(或Go)的GPCR来说,它能降低cAMP水平,一种筛选反激活剂等的方法是基于应用与Gs相连接的受体(并因而降低cAMP水平),该筛选方法在实施例部分被公开,其针对GPR17和GPR30。b.Go和GqGo和Gq与磷脂酶C的活化相联系,磷脂酶随后水解磷酸酯PIP2,并释放两种细胞内信使二酰甘油(DAG)和肌醇-1,4,5-三磷酸(IP3)。积累增加的IP3与Gq-和Go-关联的受体相关联。一般情况参见“突触传导的非直接机制(Indirect Mechanisms of Synaptic Transmission)”,第8章,从神经到大脑(From Neuron To Brain)(第三版),Nichols,J.G.等编,Sinauer Associates,Inc.(1992)。测定IP3积累的方法可被用来确定一个候选化合物是否是例如针对Gq-或Go-关联受体等的反激活剂(即如此的化合物能降低IP3的水平)。Gq关联受体也可用AP1报告基因测定方法来检测,因为Gq依赖的磷脂酶C引起含有AP1元件的基因活化;因而,活化的Gq关联受体将导致如此基因的表达增高,而其反激活剂将导致如此表达的降低,激活剂将导致如此表达的升高。进行如此测定的商业可得的方法是可得的。E.药物化学在一般但并非经常的情况下对候选化合物直接识别与通过组合化学技术产生的化合物联合使用,其中随机制备几千种化合物用于此分析。如此筛选的结果一般将是具有独特中心结构的化合物;其后,这些化合物围绕着一个优选的中心结构而被优选进行额外的化学修饰,以进一步加强其药用性质。这样的技术在该领域中是已知的,并不需要在本专利文件中详细描述。F.药物细合物为进一步开发而选择出的候选化合物可应用本领域周知的技术制剂成药物组合物。适宜的药物可接受的载体在本领域中是可得的;例如,参见Remington’s Pharmaceuctical Sciences,第16版,1980,MackPublishing Co.(Oslo等编)。G.其他应用尽管公开的非内源人GPCR的一个优选的应用是为了直接识别作为反激活剂、激活剂或部分激活剂(优选地作为药物使用)的候选化合物,这些受体也可被用于研究之用。例如,导入这些受体的体外或体内系统可被用来阐释和理解受体在正常和患病的人体状况中的作用,也可理解当它应用于理解信号级联反应时组成型活化的角色。这些非内源受体的一个价值由于其独特的特点是它们作为研究工具的用途被强化,公开的受体可被用来理解特殊受体在人体中的作用,即使在其内源配体被识别之前。公开的受体的其他应用对于本领域的技术人员将是明显的,特别是当他们阅读了本申请文件之后。实施例下面的实施例是为了说明而非限制本发明的目的。根据本发明文件的叙述,可以在基于与TM6中的脯氨酸残基有关的位置的人GPCR的IC3环中应用突变盒以组成型活化受体,并且,尽管在此公开了特定的核苷酸和氨基酸序列,但可认为本领域中的那些普通技术人员拥有这样的能力,即通过对这些序列的简单的修饰就可得到与下面所报道的相同或基本相似的结果。基于技术人员特殊需要的特殊的序列突变方法是在技术人员的了解范围之内。实施例1制备人内源GPCR不同的GPCR应用在如下的实施例中。一些人内源GPCR在表达载体中提供(如下面致谢),其他的人内源GPCR是用公众可得的序列信息从头合成的。1.GPR1(GenBank入藏登记号U13666)GPR1的人cDNA序列由Brian O’Dowd(多伦多大学)在pRcCMV中提供。GPR1 cDNA(1.4kb片段)作为一个NdeI-XbaI片段被从pRcCMV载体中切下,并被亚克隆进pCMV载体的NdeI-XbaI位点(参见图3)。人GPR1的核苷酸(SEQ ID NO1)和氨基酸(SEQ ID NO2)序列其后被确定和证实。2.GPR4(GenBank入藏登记号L36148,U35399,U21051)GPR4的人cDNA序列由Brian O’Dowd(多伦多大学)在pRcCMV中提供。GPR1 cDNA(1.4kb片段)作为一个ApaI(平端)-XbaI片段被从pRcCMV载体中切下,并被亚克隆进(把大部分5’未翻译区除去)pCMV载体的HindIII(平端)-XbaI位点。人GPR4的核苷酸序列(SEQ ID NO3)和氨基酸(SEQ ID NO4)序列其后被确定和证实。3.GPR5(GenBank入藏登记号L36149)按如下步骤产生人GPR5的cDNA并将其克隆进pCMV表达载体以基因组DNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中用每个引物0.25μM、4种核苷酸中每种0.2mM。循环条件是进行30个循环94℃1分钟;64℃ 1分钟;72℃ 1.5分钟。5’PCR引物包括一个EcoR I位点,其序列为5’-TATGAATTCAGATGCTCTAAACGTCCCTGC-3’(SEQ IDNO5)3’引物包括BamH I位点,其序列为5’-TCCGGATCCACCTGCACCTGCGCCTGCACC-3’(SEQ IDNO6)1.1kb PCR片段被用EcoR I和BamH I消化并被克隆进pCMV表达载体的EcoR I-BamH I位点。人GPR5的核酸(SEQ ID NO7)和氨基酸(SEQ ID NO8)序列其后被确定和证实。4.GPR7(GenBank入藏登记号U22491)按如下步骤产生人GPR7的cDNA并把其克隆进pCMV表达载体PCR条件-以基因组DNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸中每种0.2mM。循环条件是进行30个循环94℃ 1分钟;62℃ 1分钟;72℃ 1分钟20秒。5’PCR引物包括一个Hind III位点,且其序列为5’-GCAAGCTTGGGGGACGCCAGGTCGCCGGCT-3’(SEQ IDNO9)3’引物包括BamH I位点,其序列为5’-GCGGATCCGGACGCTGGGGGAGTCAGGCTGC-3’(SEQ IDNO10)1.1kb PCR片段被用Hind III和BamH I消化并被克隆进pCMV表达载体的Hind III-BamH I位点。人GPR7的核酸(SEQ ID NO11)和氨基酸(SEQ ID NO12)序列其后被确定和证实。5.GPR8(GenBank入藏登记号U22492)按如下步骤产生人GPR8的cDNA并把其克隆进pCMV表达载体以基因组DNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸中每种0.2mM。循环条件是进行30个循环94℃ 1分钟;62℃ 1分钟;72℃ 1分钟20秒。5’PCR引物包括一个EcoR I位点,其序列为5’-CGGAATTCGTCAACGGTCCCAGCTACAATG-3’(SEQ IDNO13)3’引物包括BamH I位点,其序列为5’-ATGGATCCCAGGCCCTTCAGCACCGCAATAT-3’(SEQ IDNO14)1.1kb PCR片段被用EcoR I和BamH I消化并被克隆进pCMV表达载体的EcoR I-BamH I位点。所有测序的4个cDNA克隆包含可能的多态现象,它涉及第206个氨基酸从Arg转变为Gln。暂且不论这个差别,人GPR8的核酸(SEQ ID NO15)和氨基酸(SEQ ID NO16)序列其后被确定和证实。6.GPR9(GenBank入藏登记号X95876)按如下步骤产生人GPR9的cDNA并把其克隆进pCMV表达载体以克隆为模板(由Brian O’Dowd提供),用pfu聚合酶(Stratagene)和制造商提供的加有10%DMSO的缓冲系统进行PCR,其中使用每种引物0.25μM、4种核苷酸每种0.5mM。循环条件是进行25个循环94℃ 1分钟;56℃ 1分钟;72℃ 2.5分钟。5’PCR引物包括一个EcoRI位点,其序列为5’-ACGAATTCAGCCATGGTCCTTGAGGTGAGTGACCACCAAGTGCTAAAT-3’(SEQ ID NO17)3’引物包括BamH I位点,其序列为5’-GAGGATCCTGGAATGCGGGGAAGTCAG-3’(SEQ ID NO18)1.2kb PCR片段被用EcoR I消化并被克隆进pCMV表达载体的EcoR I-Sam I位点。人GPR9的核酸(SEQ ID NO19)和氨基酸(SEQ IDNO20)序列其后被确定和证实。7.GPR9-6(GenBank入藏登记号U45982)按如下步骤产生人GPR9-6的cDNA并把其克隆进pCMV表达载体以基因组DNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸中每种0.2mM。循环条件是进行30个循环94℃ 1分钟;62℃ 1分钟;72℃ 1分钟20秒。5’PCR引物被用如下序列激酶化5’-TTAAGCTTGACCTAATGCCATCTTGTGTCC-3’(SEQ IDNO21)3’引物包括BamH I位点,其序列为
5’-TTGGATCCAAAAGAACCATGCACCTCAGAG-3’(SEQ IDNO22)1.2kb PCR片段被用BamH I消化并被克隆进pCMV表达载体的EcoRV-BamH I位点。人GPR9-6的核酸(SEQ ID NO23)和氨基酸(SEQID NO24)序列其后被确定和证实。8.GPR10(GenBank入藏登记号U32672)GPR10的人cDNA序列由Brian O’Dowd(多伦多大学)在pRcCMV中提供。GPR10 cDNA(1.3kb片段)作为一个EcoRI-XbaI片段被从pRcCMV载体中切下,并被亚克隆进pCMV载体的EcoRI-XbaI位点。人GPR10的核酸(SEQ ID NO25)和氨基酸(SEQ ID NO26)序列其后被确定和证实。9.GPR15(GenBank入藏登记号U34806)GPR15的人cDNA序列由Brian O’Dowd(多伦多大学)在pCDNA3中提供。GPR15 cDNA(1.5kb片段)作为一个HindIII-Bam片段被从pCDNA3载体中切下,并被亚克隆进pCMV载体的HindIII-Bam位点。人GPR15的核酸(SEQ ID NO27)和氨基酸(SEQ ID NO28)序列其后被确定和证实。10.GPR17(GenBank入藏登记号Z94154)按如下步骤产生人GPR17的cDNA并把其克隆进pCMV表达载体以基因组DNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸每种0.2mM。循环条件是进行30个循环94℃ 1分钟;56℃ 1分钟72℃ 1分钟20秒。5’PCR引物包括一个EcoR I位点,其序列为5’-CTAGAATTCTGACTCCAGCCAAAGCATGAAT-3’(SEQ IDNO29)3’引物包括BamH I位点,其序列为
5’-GCTGGATCCTAAACAGTCTGCGCTCGGCCT-3’(SEQ IDNO30)1.1kb PCR片段被用EcoR I和BamH I消化并被克隆进pCMV表达载体的EcoR I-BamH I位点。人GPR17的核酸(SEQ ID NO31)和氨基酸(SEQ ID NO32)序列其后被确定和证实。11.GPR18(GenBank入藏登记号L42324)按如下步骤产生人GPR18的cDNA并把其克隆进pCMV表达载体以基因组DNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸中每种0.2mM。循环条件是进行30个循环94℃ 1分钟;54℃ 1分钟;72℃ 1分钟20秒。5’PCR引物被用如下序列激酶化5’-ATAAGATGATCACCCTGAACAATCAAGAT-3’(SEQ IDNO33)3’引物包括EcoR I位点,其序列为5’-TCCGAATTCATAACATTTCACTGTTTATATTGC-3’(SEQ IDNO34)1.0kb PCR片段被用EcoR I消化并被克隆进pCMV表达载体的平端-EcoR I位点。所有8个被测序的cDNA克隆含有4种可能的多态,其中涉及第12位的氨基酸从Thr变为Pro,第86位的氨基酸从Ala变为Glu,第97位的氨基酸从Ile变为Leu,第310位的氨基酸从Leu变为Met。暂且不论这些改变,人GPR18的核酸(SEQ ID NO35)和氨基酸(SEQ ID NO36)序列其后被确定和证实。12.GPR20(GenBank入藏登记号U66579)按如下步骤产生人GPR20的cDNA并把其克隆进pCMV表达载体以基因组DNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸中每个0.2mM。循环条件是30个循环94℃ 1分钟;62℃ 1分钟;72℃ 1分钟20秒。5’PCR引物被用如下序列激酶化5’-CCAAGCTTCCAGGCCTGGGGTGTGCTGG-3’(SEQ IDNO37)3’引物包括BamH I位点,其序列为5’-ATGGATCCTGACCTTCGGCCCCTGGCAGA-3’(SEQ IDNO38)1.2kb PCR片段被用BamH I消化并被克隆进pCMV表达载体的EcoRV-BamH I位点。人GPR20的核酸(SEQ ID NO39)和氨基酸(SEQID NO40)序列其后被确定和证实。13.GPR21(GenBank入藏登记号U66580)按如下步骤产生人GPR21的cDNA并把其克隆进pCMV表达载体以基因组DNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中每个引物0.25μM、4种核苷酸中每种0.2mM。循环条件是进行30个循环94℃ 1分钟;62℃ 1分钟;72℃ 1分钟20秒。5’PCR引物被用如下序列激酶化5’-GAGAATTCACTCCTGAGCTCAAGATGAACT-3’(SEQ IDNO41)3’引物包括BamH I位点,其序列为5’-CGGGATCCCCGTAACTGAGCCACTTCAGAT-3’(SEQ IDNO42)1.1kb PCR片段被用BamH I消化并被克隆进pCMV表达载体的EcoRV-BamH I位点。人GPR21的核酸(SEQ ID NO43)和氨基酸(SEQID NO44)序列其后被确定和证实。14.GPR22(GenBank入藏登记号U66581)按如下步骤产生人GPR22的cDNA并把其克隆进pCMV表达载体以基因组DNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸中每种0.2mM。循环条件是进行30个循环94℃ 1分钟;50℃ 1分钟;72℃ 1.5分钟。5’PCR引物被用如下序列激酶化5’-TCCCCCGGGAAAAAAACCAACTGCTCCAAA-3’(SEQ IDNO45)3’引物包括BamH I位点,其序列为5’-TAGGATCCATTTGAATGTGGATTTGGTGAAA-3’(SEQ IDNO46)1.38kb PCR片段被用BamH I消化并被克隆进pCMV表达载体的EcoRV-BamH I位点。人GPR22的核酸(SEQ ID NO47)和氨基酸(SEQID NO48)序列其后被确定和证实。15.GPR24(GenBank入藏登记号U71092)按如下步骤产生人GPR24的cDNA并把其克隆进pCMV表达载体以基因组DNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸中每种0.2mM。循环条件是进行30个循环94℃ 1分钟;56℃ 1分钟;72℃ 1分钟20秒。5’PCR引物含有具有如下序列的Hind III位点5’-GTGAAGCTTGCCTCTGGTGCCTGCAGGAGG-3’(SEQ IDNO49)3’引物包括EcoR I位点,其序列为5’-GCAGAATTCCCGGTGGCGTGTTGTGGTGCCC-3’(SEQ IDNO50)1.3kb PCR片段被用Hind III和EcoR I消化并被克隆进pCMV表达载体的Hind III和EcoR I位点。人GPR24的核酸(SEQ ID NO51)和氨基酸(SEQ ID NO52)序列其后被确定和证实。16.GPR30(GenBank入藏登记号U63917)按如下步骤产生人GPR30的cDNA并克隆化从基因组DNA中扩增GPR30(1128bp的长度)的编码序列,并应用下面的引物5’GGCGGATCCATGGATGTGACTTCCCAA-3’(SEQ ID NO53)和5’GGCGGATCCCTACACGGCACTGCTGAA-3’(SEQ IDNO54)。
然后用“TOPO-TA克隆试剂盒”(Invitrogen,#K4500-01)跟随制造商的指令把扩增的产品克隆进商业可得的载体PCR2.1(Invitrogen)中。用BamH I消化来释放全长GPR30插入子,用琼脂糖凝胶电泳使之从载体中分离,用Sephaglas BandprepTM试剂盒(Pharmacia,#27-9285-01)按照制造商的指令纯化。人GPR30的核酸(SEQ ID NO55)和氨基酸(SEQ ID NO56)序列其后被确定和证实。17.GPR31(GenBank入藏登记号U65402)按如下步骤产生人GPR31的cDNA并把其克隆进pCMV表达载体以基因组DNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸中每种0.2mM。循环条件是进行30个循环94℃ 1分钟;58℃ 1分钟;72℃ 2分钟。5’PCR引物含有具有如下序列的EcoR I位点5’-AAGGAATTCACGGCCGGGTGATGCCATTCCC-3’(SEQ IDNO57)3’引物包括BamH I位点,其序列为5’-GGTGGATCCATAAACACGGGCGTTGAGGAC-3’(SEQ IDNO58)1.0kb PCR片段被用EcoR I和BamH I消化并被克隆进pCMV表达载体的EcoR I-BamH I位点。人GPR31的核酸(SEQ ID NO59)和氨基酸(SEQ ID NO60)序列其后被确定和证实。18.GPR32(GenBank入藏登记号AF045764)按如下步骤产生人GPR32的cDNA并把其克隆进pCMV表达载体以基因组DNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸中每种0.2mM。循环条件是进行30个循环94℃ 1分钟;56℃ 1分钟;72℃ 1分钟20秒。5’PCR引物含有具有如下序列的EcoR I位点5’-TAAGAATTCCATAAAAATTATGGAATGG-3’(SEQ IDNO243)3’引物包括BamH I位点,其序列为5’-CCAGGATCCAGCTGAAGTCTTCCATCATTC-3’(SEQ IDNO244)1.1kb PCR片段被用EcoR I和BamH I消化并被克隆进pCMV表达载体的EcoR I-BamH I位点。人GPR32的核酸(SEQ ID NO245)和氨基酸(SEQ ID NO246)序列其后被确定和证实。19.GPR40(GenBank入藏登记号AF024687)按如下步骤产生人GPR40的cDNA并把其克隆进pCMV表达载体以基因组DNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸中每种0.2mM。循环条件是进行30个循环94℃ 1分钟;65℃ 1分钟;72℃ 1分钟10秒。5’PCR引物含有具有如下序列的EcoR I位点5’-GCAGAATTCGGCGGCCCCATGGACCTGCCCCC-3’(SEQ IDNO247)3’引物包括BamH I位点,其序列为5’-GCTGGATCCCCCGAGCAGTGGCGTTACTTC-3’(SEQ IDNO248)1kb PCR片段被用EcoR I和BamH I消化并被克隆进pCMV表达载体的EcoR I-BamH I位点。人GPR40的核酸(SEQ ID NO249)和氨基酸(SEQ ID NO250)序列其后被确定和证实。20.GPR41(GenBank入藏登记号AF024688)按如下步骤产生人GPR41的cDNA并把其克隆进pCMV表达载体以基因组DNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸中每种0.2mM。循环条件是进行30个循环94℃ 1分钟;65℃ 1分钟;72℃ 1分钟10秒。5’PCR引物含有具有如下序列的Hind III位点5’-CTCAAGCTTACTCTCTCTCACCAGTGGCCAC-3’(SEQ IDNO251)3’引物被用下列序列激酶化5’-CCCTCCTCCCCCGGAGGACCTAGC-3’(SEQ ID NO252)1kb PCR片段被用Hind III消化并被克隆进pCMV表达载体的Hind III-平端位点。人GPR41的核酸(SEQ ID NO253)和氨基酸(SEQ IDNO254)序列其后被确定和证实。21.GPR43(GenBank入藏登记号AF024690)按如下步骤产生人GPR43的cDNA并把其克隆进pCMV表达载体以基因组DNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸中每种0.2mM。循环条件是进行30个循环94℃ 1分钟;65℃ 1分钟;72℃ 1分钟10秒。5’PCR引物含有具有如下序列的Hind III位点5’-TTTAAGCTTCCCCTCCAGGATGCTGCCGGAC-3’(SEQ IDNO255)3’引物包括EcoR I位点,其序列为5’-GGCGAATTCTGAAGGTCCAGGGAAACTGCTA-3’(SEQ IDNO256)1kb PCR片段被用Hind III和EcoR I消化并被克隆进pCMV表达载体的Hind III-EcoR I位点。人GPR43的核酸(SEQ ID NO257)和氨基酸(SEQ ID NO258)序列其后被确定和证实。22.APJ(GenBank入藏登记号U03642)人APJ的cDNA(在pRcCMV载体中)由Brian O’Dowd(多伦多大学)提供。人APJ的cDNA作为一个EcoR I-XbaI(平端)片段被从pRcCMV载体中切下,并被亚克隆进pCMV载体的EcoR I-Smal位点。人APJ的核苷酸(SEQ ID NO61)和氨基酸(SEQ ID NO62)序列其后被确定和证实。23.BLR1(GenBank入藏登记号X68149)按如下步骤产生人BLR1的cDNA并把其克隆进pCMV表达载体以胸腺cDNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸每种0.2mM。循环条件是进行30个循环94℃ 1分钟;62℃ 1分钟;72℃ 1分钟20秒。5’PCR引物含有具有如下序列的EcoR I位点5’-TGAGAATTCTGGTGACTCACAGCCGGCACAG-3’(SEQ IDNO63)3’引物包括BamH I位点,其序列为5’-GCCGGATCCAAGGAAAAGCAGCAATAAAAGG-3’(SEQ IDNO64)1.2kb PCR片段被用EcoR I和BamH I消化并被克隆进pCMV表达载体的EcoR I-BamH I位点。人BLR1的核酸(SEQ ID NO65)和氨基酸(SEQ ID NO66)序列其后被确定和证实。24.CEPR(GenBank入藏登记号U77827)按如下步骤产生人CEPR的cDNA并把其克隆进pCMV表达载体以基因组cDNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸中每种0.2mM。循环条件是进行30个循环94℃ 1分钟;65℃ 1分钟;72℃ 1分钟20秒。5’PCR引物被如下序列激酶化5’-CAAAGCTTGAAAGCTGCACGGTGCAGAGAC-3’(SEQ IDNO67)3’引物包括BamH I位点,其序列为5’-GCGGATCCCGAGTCACACCCTGGCTGGGCC-3’(SEQ IDNO68)1.2kb PCR片段被用BamH I消化并被克隆进pCMV表达载体的EcoRV-BamH I位点。人CEPR的核酸(SEQ ID NO69)和氨基酸(SEQ IDNO70)序列其后被确定和证实。25.EBIl(GenBank入藏登记号L31581)按如下步骤产生人EBI1的cDNA并把其克隆进pCMV表达载体以胸腺cDNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸中每种0.2mM。循环条件是进行30个循环94℃ 1分钟;62℃ 1分钟;72℃ 1分钟20秒。5’PCR引物包括EcoR I位点,其序列为5’-ACAGAATTCCTGTGTGGTTTTACCGCCCAG-3’(SEQ IDNO71)3’引物包括BamH I位点,其序列为5’-CTCGGATCCAGGCAGAAGAGTCGCCTATGG-3’(SEQ IDNO72)
1.2kb PCR片段被用EcoR I和BamH I消化并被克隆进pCMV表达载体的EcoR I-BamH I位点。人EBI1的核酸(SEQ ID NO73)和氨基酸(SEQ ID NO74)序列其后被确定和证实。26.EBI2(GenBank入藏登记号L08177)按如下步骤产生人EBI2的cDNA并把其克隆进pCMV表达载体以cDNA克隆为模板(由Kevin Lynch提供,University of VirginiaHealth Sciences Center;应用的载体未被本资料识别),用pfu聚合酶(Stratagene)和制造商提供的缓冲系统并辅以10%DMSO进行PCR,其中使用每个引物0.25μM、4种核苷酸中每种0.5mM。循环条件是进行30个循环94℃ 1分钟;60℃ 1分钟;72℃ 1分钟20秒。5’PCR引物包括EcoR I位点,其序列为5’-CTGGAATTCACCTGGACCACCACCAATGGATA-3’(SEQ IDNO75)3’引物包括BamH I位点,其序列为5’-CTCGGATCCTGCAAAGTTTGTCATACAGTT-3’(SEQ IDNO76)1.2kb PCR片段被用EcoR I和BamH I消化并被克隆进pCMV表达载体的EcoR I-BamH I位点。人EBI2的核酸(SEQ ID NO77)和氨基酸(SEQ ID NO78)序列其后被确定和证实。27.ETBR-LP2(GenBank入藏登记号D38449)按如下步骤产生人ETBR-LP2的cDNA并把其克隆进pCMV表达载体以脑cDNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸中每种0.2mM。循环条件是进行30个循环94℃ 1分钟;65℃ 1分钟;72℃ 1.5分钟。5’PCR引物包括EcoR I位点,其序列为5’-CTGGAATTCTCCTGCTCATCCAGCCATGCGG-3’(SEQ IDNO79)3’引物包括BamH I位点,其序列为5’-CCTGGATCCCCACCCCTACTGGGGCCTCAG-3’(SEQ IDNO80)1.5kb PCR片段被用EcoR I和BamH I消化并被克隆进pCMV表达载体的EcoR I-BamH I位点。人ETBR-LP2的核酸(SEQ ID NO81)和氨基酸(SEQ ID NO82)序列其后被确定和证实。28.GHSR(GenBank入藏登记号U60179)按如下步骤产生人GHSR的cDNA并把其克隆进pCMV表达载体以海马cDNA为模板,用TaqPlus Precision聚合酶(Stratagene)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸中每种0.2mM。循环条件是进行30个循环94℃ 1分钟;68℃ 1分钟;72℃ 1分钟10秒。对于第一轮PCR,5’PCR引物序列为5’-ATGTGGAACGCGACGCCCAGCG-3’(SEQ ID NO83)3’引物序列为5’-TCATGTATTAATACTAGATTCT-3’(SEQ ID NO84)2毫升第一轮PCR产物被用作第二轮PCR模板,其中5’引物被如下序列所激酶化5’-TACCATGTGGAACGCGACGCCCAGCGAAGAGCCGGGGT-3’(SEQ ID NO85)3’PCR引物包括EcoR I位点,其序列为5’-CGGAATTCATGTATTAATACTAGATTCTGTCCAGGCCCG-3’(SEQ ID NO86)1.1kb PCR片段被用EcoR I消化并被克隆进pCMV表达载体的平端-EcoR I位点。人GHSR的核酸(SEQ ID NO87)和氨基酸(SEQ IDNO88)序列其后被确定和证实。29.GPCR-CNS(GenBank入藏登记号AFO17262)按如下步骤产生人GPCR-CNS的cDNA并把其克隆进pCMV表达载体以脑cDNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸中每种0.2mM。循环条件是进行30个循环94℃ 1分钟65℃ 1分钟;72℃ 2分钟。5’PCR引物包括Hind III位点,其序列为5’-GCAAGCTTGTGCCCTCACCAAGCCATGCGAGCC-3’(SEQID NO89)3’引物包括EcoR I位点,其序列为5’-CGGAATTCAGCAATGAGTTCCGACAGAAGC-3’(SEQ IDNO90)1.9kb PCR片段被用Hind III和EcoR I消化并被克隆进pCMV表达载体的Hind III-EcoR I位点。所有9个被测序的克隆包含涉及一个S284C变化的潜在多态现象。暂且不论这个差别,人GPCR-CNS的核酸(SEQ ID NO91)和氨基酸(SEQ ID NO92)序列其后被确定和证实。30.GPR-NGA(GenBank入藏登记号U55312)按如下步骤产生人GPR-NGA的cDNA并把其克隆进pCMV表达载体以基因组cDNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸中每种0.2mM。循环条件是进行30个循环94℃ 1分钟;56℃ 1分钟;72℃ 1.5分钟。5’PCR引物包括EcoR I位点,其序列为5’-CAGAATTCAGAGAAAAAAAGTGAATATGGTTTTT-3’(SEQID NO93)
3’引物包括BamH I位点,其序列为5’-TTGGATCCCTGGTGCATAACAATTGAAAGAAT-3’(SEQ IDNO94)1.3kb PCR片段被用EcoR I和BamH I消化并被克隆进pCMV表达载体的EcoR I-BamH I位点。人GPR-NGA的核酸(SEQ ID NO95)和氨基酸(SEQ ID NO96)序列其后被确定和证实。31.H9(GenBank入藏登记号U52219)按如下步骤产生人HB954的cDNA并把其克隆进pCMV表达载体以垂体cDNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸中每种0.2mM。循环条件是进行30个循环94℃ 1分钟;62℃ 1分钟;72℃ 2分钟。5’PCR引物包括Hind III位点,其序列为5’-GGAAAGCTTAACGATCCCCAGGAGCAACAT-3’(SEQ IDNO97)3’引物包括BamH I位点,其序列为5’-CTGGGATCCTACGAGAGCATTTTTCACACAG-3’(SEQ IDNO98)1.9kb PCR片段被用Hind III和BamH I消化并被克隆进pCMV表达载体的Hind III-BamH I位点。与公开的序列相比较,还识别出一个不同的同种型并称其为“H9b”,其在细胞质尾部有一个12个bp的框插入。两个同种型都含有涉及氨基酸P320S和G448A改变的潜在多态现象。同种型H9a含有另一个涉及氨基酸S493N改变的潜在多态现象,而同种型H9b含有另两个额外的涉及氨基酸I502T和A532T(相当于同种型H9a的氨基酸528)改变的潜在多态现象。人H9的核酸(SEQ ID NO99)和氨基酸(SEQ ID NO100)序列其后被确定和证实(在下面的部分,两个同种型都依据人GPCR脯氨酸标记物规则进行突变)。32.HB954(GenBank入藏登记号D38449)按如下步骤产生人HB954的cDNA并把其克隆进pCMV表达载体以脑cDNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸每种0.2mM。循环条件是进行30个循环94℃ 1分钟;58℃ 1分钟;72℃ 2分钟。5’PCR引物包括Hind III位点,其序列为5’-TCCAAGCTTCGCCATGGGACATAACGGGAGCT-3’(SEQ IDNO101)3’引物包括EcoR I位点,其序列为5’-CGTGAATTCCAAGAATTTACAATCCTTGCT-3’(SEQ IDNO102)1.6kb PCR片段被用Hind III和EcoR I消化并被克隆进pCMV表达载体的Hind III-EcoR I位点。人HB954的核酸(SEQ ID NO103)和氨基酸(SEQ ID NO104)序列其后被确定和证实。33.HG38(GenBank入藏登记号AF062006)按如下步骤产生人HB38的cDNA并把其克隆进pCMV表达载体以脑cDNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸中每种0.2mM。循环条件是进行30个循环94℃ 1分钟;56℃ 1分钟;72℃ 1分钟30秒。进行两次PCR反应以分别获得5’和3’片段。对于5’片段,5’PCR引物包括Hind III位点,其序列为5’-CCCAAGCTTCGGGCACCATGGACACCTCCC-3’(SEQ IDNO259)
3’引物包括BamH I位点,其序列为5’-ACAGGATCCAAATGCACAGCACTGGTAAGC-3’(SEQ IDNO260)这个5’的1.5kb PCR片段被用Hind III和BamH I消化并被克隆进pCMV表达载体的Hind III-BamH I位点。对于3’引物,5’PCR引物被如下序列激酶化5’-CTATAACTGGGTTACATGGTTTAAC-3’(SEQ ID NO261)3’引物包括EcoR I位点,其序列为5’-TTTGAATTCACATATTAATTAGAGACATGG-3’(SEQ IDNO262)1.4kb的3’PCR片段被用EcoR I消化并被亚克隆进pCMV表达载体的平端-EcoR I位点。接着5’片段和3’片段通过一个共同的EcoRV位点被连接在一起,得到全长cDNA克隆。人HG38的核酸(SEQ IDNO263)和氨基酸(SEQ ID NO264)序列其后被确定和证实。34.HM74(GenBank入藏登记号D10923)按如下步骤产生人HM74的cDNA并把其克隆进pCMV表达载体以基因组cDNA或者胸腺cDNA为模板,用rTth聚合酶(PerkinElmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸每种0.2mM。循环条件是进行30个循环94℃ 1分钟;65℃ 1分钟;72℃ 1分钟20秒。5’PCR引物包括EcoR I位点,其序列为5’-GGAGAATTCACTAGGCGAGGCGCTCCATC-3’(SEQ IDNO105)3’引物被如下序列激酶化5’-GGAGGATCCAGGAAACCTTAGGCCGAGTCC-3’(SEQ IDNO106)1.3kb PCR片段被用EcoR I消化并被克隆进pCMV表达载体的EcoR I-Sma I位点。测序的克隆揭示了涉及一个N94K改造的潜在多态现象。暂且不论这个差别,人HM74的核酸(SEQ ID NO107)和氨基酸(SEQ ID NO108)序列其后被确定和证实。35.MIG(GenBank入藏登记号AFO44600和AFO44601)按如下步骤产生人MIG的cDNA并把其克隆进pCMV表达载体以基因组cDNA为模板,用TaqPlus Precision聚合酶(Stratagene)(第一轮PCR)或pfu聚合酶(Stratagene)(第二轮PCR)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸每种0.2mM(TaqPlus Precision)或0.5mM(pfu)。当用pfu时,在缓冲液中包括10%的DMSO。循环条件是进行30个循环94℃ 1分钟;65℃ 1分钟;在72℃(a)第一轮PCR 1分钟和(b)第二轮PCR 2分钟。因为在编码区有一个内含子,分别应用两套引物以产生重叠的5’和3’片段。5’片段PCR引物是5’-ACCATGGCTTGCAATGGCAGTGCGGCCAGGGGGCACT-3’(外部有义)(SEQ ID NO109)和5’-CGACCAGGACAAACAGCATCTTGGTCACTTGTCTCCGGC-3’(内部反义)(SEQ ID NO110)。
3’片段PCR引物为5’-GACCAAGATGCTGTTTGTCCTGGTCGTGGTGTTTGGCAT-3’(内部有义)(SEQ ID NO111)和5’-CGGAATTCAGGATGGATCGGTCTCTTGCTGCGCCT-3’(具有EcoRI位点的外部反义)(SEQ ID NO112)。
通过下列方法把5’和3’片段连接在一起应用第一轮PCR做模板,并用激酶化的外部有义引物和外部反义引物进行第二轮PCR。1.2kbPCR片段被用EcoR I消化并被克隆进pCMV表达载体的平端-EcoR I位点。人MIG的核酸(SEQ ID NO113)和氨基酸(SEQ ID NO114)序列其后被确定和证实。36.OGR1(GenBank入藏登记号U48405)按如下步骤产生人OGR1的cDNA并把其克隆进pCMV表达载体以基因组cDNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸中每种0.2mM。循环条件是进行30个循环94℃ 1分钟;65℃ 1分钟;72℃ 1分钟20秒。5’PCR引物被如下序列激酶化5’-GGAAGCTTCAGGCCCAAAGATGGGGAACAT-3’(SEQ IDNO115)3’引物包括BamH I位点,其序列为5’-GTGGATCCACCCGCGGAGGACCCAGGCTAG-3’(SEQ IDNO116)1.1kb PCR片段被用BamH I消化并被克隆进pCMV表达载体的EcoRV-BamH I位点。人OGR1的核酸(SEQ ID NO117)和氨基酸(SEQID NO118)序列其后被确定和证实。37. 5-羟色胺5HT2A编码人内源5HT2A受体的cDNA通过RT-PCR而得到应用人脑poly-A+RNA;来自5′未翻译区的5′引物,其具有如下Xho I限制位点5′-GACCTCGAGTCCTTCTACACCTCATC-3′(SEQ ID NO119)来自3′未翻译区具有如下Xba I位点的3′引物5′-TGCTCTAGATTCCAGATAGGTGAAAACTTG-3′(SEQ IDNO120)
用TaqPlusTMPrecision聚合酶(Stratagene)或rTthTM聚合酶(PerkinElmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸每种0.2mM。循环条件是进行30个循环94℃ 1分钟;57℃ 1分钟;72℃ 2分钟。1.5kb的PCR片段被用Xba I消化并被亚克隆进pBluescript的EcoRV-Xba I位点。得到的cDNA克隆被完全测序,发现它编码被公布序列的两种氨基酸的变化。第一个是在N-末端细胞外结构域的一个T25N突变;第二个是一个H452Y突变。因为使用两个不同商业来源(从Stratagene得到TaqPlusTM和从PerkinElmer得到的rTthTM)的Taq多聚酶通过两个独立的PCR反应得到cDNA克隆且cDNA克隆含有相同的两个突变,所以这些突变好象是代表序列多态现象而不是PCR错误。除了这些例外,人5HT2A的核酸(SEQ IDNO121)和氨基酸(SEQ ID NO122)序列其后被确定和证实。38. 5-羟色胺5HT2C编码人内源5HT2C受体的cDNA通过RT-PCR而得到其中应用人脑poly-A+RNA。从5′和3’未翻译区得到5′和3’引物,其序列为5′-GACCTCGAGGTTGCTTAAGACTGAAGC-3′(SEQ ID NO123)5′-ATTTCTAGACATATGTAGCTTGTACCG-3′(SEQ ID NO124)人5HT2C的核酸(SEQ ID NO125)和氨基酸(SEQ ID NO126)序列其后被确定和证实。39.V28(GenBank入藏登记号U20350)按如下步骤产生人V28的cDNA并把其克隆进pCMV表达载体以脑cDNA为模板,用rTth聚合酶(Perkin Elmer)和制造商提供的缓冲系统进行PCR,其中使用每个引物0.25μM、4种核苷酸每种0.2mM。循环条件是进行30个循环94℃ 1分钟;65℃ 1分钟;72℃ 1分钟20秒。5’PCR引物引物包括Hind III位点,其序列为5’-GGTAAGCTTGGCAGTCCACGCCAGGCCTTC-3’(SEQ IDNO127)3’引物包括EcoR I位点,其序列为5’-TCCGAATTCTCTGTAGACACAAGGCTTTGG-3’(SEQ IDNO128)1.1kb PCR片段被用Hind III和EcoR I消化并被克隆进pCMV表达载体的Hind III-EcoR I位点。人V28的核酸(SEQ ID NO129)和氨基酸(SEQ ID NO130)序列其后被确定和证实。实施例2制备非内源的人GPCR1.定点诱变把建立在此处公开的人GPCR脯氨酸标记物方法基础上的诱变使用于前述的内源人GPCR,应用Transformer Site-Directed Mutagenesis试剂盒(Clontech)并按照生产商的说明进行。对于此诱变方法,使用一个突变探针和一个选择标记物探针(除非另外说明,SEQ ID NO132的探针始终是同样的),用于特定序列的这些探针序列列于下表B(括号中的数字是SEQ ID NO)。为方便起见,引入人GPCR的密码子突变也被以标准的形式标出表B








然后,非内源的人GPCR被测序,得到且被证实的核酸和氨基酸序列被列于本发明所附的“序列列表”,如下的表C为其摘要表C


2.利用脯氨酸标记物算法规则的其他突变方法APJ、5-羟色胺5HT2A、5-羟色胺5HT2C和GPR30尽管上述定点诱变方法是特别优选地,但其他方法也可以用来创造如此的突变;那些熟悉本领域的人员知道用所选择的方法突变GPCR以适合技术人员的特别需要。a.APJ制备非内源的人APJ受体通过突变L247K而完成。合成两个含有此突变的寡核苷酸5’-GGCTTAAGAGCATCATCGTGGTGCTGGTG-3’(SEQ ID NO233)5’-GTCACCACCAGCACCACGATGATGCTCTTAAGCC-3’(SEQID NO234)两个寡核苷酸被退火,并被用来取代人的内源APJ的NaeI-BstEII片段,以产生非内源的人APJ。b.5-羟色胺5HT2A包含点突变C322K的cDNA通过利用包括氨基酸322的限制酶位点Sph I构建。含有C322K突变的引物5’-CAAAGAAAGTACTGGGCATCGTCTTCTTCCT-3’(SEQ IDNO235)与从受体的3’未翻译区得到的引物一起应用5’-TGCTCTAGATTCCAGATAGGTGAAAACTTG-3’(SEQ IDNO236)以进行PCR(在上述条件下)。得到的PCR片段然后被用来经T4多聚酶补平的Sph I位点去替代内源5HT2AcDNA的3’末端。c.5-羟色胺5HT2C包含S310K突变的cDNA通过用编码目的突变的合成双链寡核苷酸去替代包括氨基酸310的Sty I限制片段而构建。应用的意义链具有如下序列5’-CTAGGGGCACCATGCAGGCTATCAACAATGAAAGAAAAGCTAAGAAAGTC-3’(SEQ ID NO237)应用的反义链具有如下序列5’-CAAGGACTTTCTTAGCTTTTCTTTCATTGTTGATAGCCTGCATGGTGCCC-3’(SEQ ID NO238)d.GPR30在产生非内源GPR30之前,几个独立的pCR2.1/GPR30分离物被整体测序以识别不发生经PCR产生的突变的克隆。没有突变的克隆用EcoR I消化,并通过用EcoR I消化pCI-Neo并把从pCR2.1/GPR30得到的EcoR I-释放的GPR30片段亚克隆,将内源GPR30 cDNA片段转移进由CMV驱动表达的质粒pCI-Neo(Promega),以产生pCI/GPR30。其后,按照制造商的说明,用Quick-ChangeTM定点诱变试剂盒(Stratagene,#200518)把位于密码子258上的亮氨酸突变为赖氨酸,引物如下5’-CGGCGGCAGAAGGCGAAACGCATGATCCTCGCGGT-3’(SEQ ID NO239)和5’-ACCGCGAGGATCATGCGTTTCGCCTTCTGCCGCCG-3’(SEQID NO240)实施例3(内源和突变的)受体表达尽管在本领域中有多种细胞可用于蛋白质的表达,但最优选应用的是哺乳动物细胞。据预测,其基本原因是实用性,即例如表达GPCR的酵母细胞的应用,有可能把一种非哺乳动物细胞引入到程序中,此细胞可能不(其实,对于酵母来说,是不)包括偶联受体、遗传机制和分泌途径,而这些是经过进化用于哺乳动物系统的。因此,在非哺乳动物细胞中得到的结果,尽管是可能有用的,但并不如从哺乳动物细胞中得到的结果优选。在哺乳动物细胞中,COS-7、293和293T细胞是特别优选的,尽管应用的特定哺乳动物细胞可按技术人员的特别需要而被判定。
除非在此说明,应用如下的程序表达内源和非内源人GPCR。表D列举用于表达GPCR的哺乳动物细胞和数量。
表D

第一天,哺乳动物细胞被接种到板上。第二天,准备两支试管(比例是每板用于一支试管)通过混合20μg DNA(例如pCMV载体、带有内源受体cDNA的pCMV载体、带有非内源受体cDNA的pCMV载体)在1.2ml无血清的DMEM(Irvine Scientific,Irvine,CA)来制备试管A;通过混合120μl lipofectamine(Gibco BRL)在1.2ml无血清DMEM中制备试管B。把试管A和B互倾混合(几次),然后在室温下温育30-45分钟。组合物被称为“转染组合物”。植出的细胞用1XPBS洗涤,然后加入10ml无血清的DMEM。把2.4ml转染组合物加入到细胞中去,然后在37℃/5% CO2下温育4小时。接着通过抽吸移去转染组合物,然后加入25ml的DMEM/10%胎牛血清。接着细胞在37℃/5% CO2温育。72小时后收获细胞并用来进行分析。1.Gi偶联受体与Gs偶联受体共转染对于GPR320来说,已经确定本受体与G蛋白Gi偶联。已知Gi抑制腺苷酸环化酶,这对ATP向cAMP的转化催化是必须的。因此,非内源的、被组成型活化的GPR30将被期望与cAMP水平的下降相关。尽管可通过测定下降的cAMP水平对非内源的、可被组成型活化的GPR30进行测定证实,但它可通过联合应用Gs偶联受体而优选地被测定。例如,Gs偶联的受体将刺激腺苷酸环化酶,并因而与cAMP的升高相关联。本专利申请的受让人已经发现孤儿受体GPR6是一个内源的、被组成型活化的GPCR。GPR6与Gs偶联。因此,当被共转染时,可以很容易证实由一个假想的GPR30突变导致其组成型活化即当与内源的、被组成型活化的GPR6/非内源的、被组成型活化的GPR30(这导致一个相对更低水平的cAMP)相比时,内源的、被组成型活化的GPR6/内源的非组成型活化的GPR30细胞将导致cAMP水平的升高。探查cAMP的测定方法可用来确定一个候选化合物是否是例如针对Gs关联受体的反激活剂(即这样的一个化合物将降低cAMP的水平)或是Gi关联受体(或Go关联受体)的反向激活剂(即此候选化合物将增加cAMP的水平)。在本领域中已知有多种测量cAMP的方法可被应用;一个优选的方法依赖于抗-cAMP抗体的应用。另一个最优选的方法是利用全细胞第二信使报告基因系统试验。基因上的启动子引发特定基因编码的蛋白质表达。环AMP促进基因表达的过程,是它先促进响应cAMP的DNA结合蛋白质或转录因子(CREB)的结合,其后它们在被称为是cAMP效应元件的特殊位点与启动子结合并促进基因的表达。报告基因系统可被构建为含有一个启动子,该启动子在报告基因如β-半乳糖苷酶或荧光素酶前含有多个cAMP效应元件。因而,一个被活化的受体例如GPR6就引起cAMP的积聚,这然后又活化基因和促进报告蛋白质的表达。最优选293细胞与GPR6(或另外一个Gs偶联受体)和GPR30(或另外一个Gi偶联受体)质粒共转染,优选比例为1∶1,最优选比例为1∶4。因为GPR6是一个内源的、被组成型活化的受体并可刺激cAMP的产生,所以GPR6可强烈地激活报告基因和它的表达。报告蛋白质例如β-半乳糖苷酶或荧光素酶可接着被用标准生化测定方法检测到(Chen等,1995)。内源的、组成型活化的GPR6与内源的、非组成型活化的GPR30共转染可导致荧光素酶报告蛋白的增加;相反地,内源的、组成型活化的GPR6与非内源的、组成型活化的GPR30共转染可导致荧光素酶表达的急剧下降。在本领域中已知几个报告基因质粒并可用于测量第二信使试验。据认为,对于熟练的技术人员来说,很容易基于技术人员的特别需要为一个特别的基因表达确定合适的报告质粒。尽管可将多种可得的细胞用于表达,但哺乳动物细胞是最优选的,在这些类型中,293细胞是最优选的。293细胞被报告基因质粒pCRE-Luc/GPR6和非内源的、组成型活化的GPR30转染,其中使用Mammalian TransfectionTM试剂盒(Stratagene,#200285)CaPO4沉淀程序,并依照制造商的指令(关于公开的内源GPR6序列,参见28 Genomics 347(1995))。沉淀含有400ng报告基因、80ng CMV-表达质粒(具有1∶4的GPR6对内源GPR30或非内源GPR30的比例)和20ng CMV-SEAP(编码分泌的碱性磷酸酶的转染控制质粒)。把50%的沉淀分到96孔组织培养板(包含4×104细胞/孔)的3个孔中;其余的50%丢弃。第二天上午换培养液。转染开始48小时后,细胞被裂解,并按每个制造商的指令用LucliteTM试剂盒(Packard,Cat.#6016911)、Trilux 1450 MicrobetaTM液体闪烁和发光计数器(Wallac)检测荧光素酶的活性。用GraphPad Prism 2.0a(GraphPadSoftware Inc.)分析数据。
对于已被确定是偶联Gi的GPR17来说,可基于在本质上是应用另一个偶联Gs的内源受体GPR3,来对前述方法加以改进并使用。(参见23 Genomics 609(1994)和24 Genomics 391(1994))。最优选应用293细胞。这些细胞被植到96孔培养板上,密度为每孔2×104细胞,在第二天按照制造商的指令用Lipofectamine Reagent(BRL)转染。按如下程序为每6孔的转染制备一个DNA/脂的组合物在100μl DMEM中的260ng质粒DNA与在100μl DMEM中的脂轻轻混合(260ng质粒DNA含有200ng 8×CRE-Luc报告质粒(参见下面),50ng pCMV含有内源受体或非内源受体或仅有pCMV,10ng GPRS表达质粒(GPRS在pcDNA3(Invitrogen)中)。8×CRE-Luc报告基因质粒按如下方法制备通过在pβgal-Basic载体(Clontech)中的BglV-Hind III位点克隆大鼠生长激素释放抑制因子启动子(-71/+51),得到载体SRIF-β-gal。用腺病毒模板AdpCF126CCRE8(参见7 Human Gene Therapy 1883(1996))通过PCR得到cAMP效应元件的8个拷贝,把它克隆进SRIF-β-gal载体的Kpn-BglV位点,得到8×CRE-β-gal报告基因载体。8×CRE-Luc报告基因质粒是通过在HindIII-BamH I位点用从pGL3-基本载体(Promega)中得到的荧光素酶基因置换8×CRE-β-gal报告基因载体中的β-半乳糖苷酶基因而得到。在室温温育30分钟之后,DNA/脂组合物用400μlDMEM稀释,把100μl得到的被稀释的组合物加到每个孔中。在细胞培养温育箱中温育4小时之后,在每个孔中加入带有10%FCS的100μlDMEM。第二天上午,被转化的细胞被换成每孔使用带有10%FCS的200μl DMEM。8小时之后,在一次PBS洗涤之后,每孔被换成100μl无酚红的DMEM。下一天用LucLiteTM报告基因测定试剂盒(Packard),按照制造商的指令测定荧光素酶活性,在1450 MicrobetaTM闪烁发光计数器(Wallac)上记数。
图4表明,组成型活化的GPR30抑制GPR6介导的CRE-Luc报告基因在293细胞中的活化。在表达载体pCMV中,测定荧光素酶为大约4.1相对光单位。内源GPR30表达的荧光素酶是在大约8.5相对光单位,而非内源的、组成型活化的GPR30(L258K)分别表达大约3.8和3.1相对光单位的荧光素酶。用内源GPR30与内源GPR6以1∶4的比例共转染,可显著地增加荧光素酶的表达到大约104.1相对光单位。用非内源GPR30(L258K)与内源GPR6以相同的比例共转染,可显著地降低表达,它们分别在大约18.2和29.5相对光单位。对于GPR17,当与GPR3共转染时,也可观察到相似的结果,如在图5中所示的那样。实施例3确定非内源GPCR的组成性活性的测定方法A.细胞膜结合试验1.[35S]GTPγS试验当G蛋白偶联受体在其活性状态,并作为配体结合或者作为组成型活化的结果时,受体与G蛋白偶联并刺激GDP的释放和其后GTP与G蛋白的结合。G蛋白-受体复合物的α亚基作为GTP酶并慢慢地水解GTP为GDP,在此点受体通常发生失活。组成型活化受体继续把GDP转化为GTP。不可水解的GTP类似物[35S]GTPγS,可被用来展示[35S]GTPγS与表达组成型活化受体的膜的增强的结合。应用[35S]GTPγS结合测定组成型活化的优点是(a)它对所有G蛋白偶联受体是普遍适用的;(b)它邻近细胞膜表面,在此处较少可能拣到遇到影响细胞内级联反应的分子。
此试验利用G蛋白偶联受体的刺激[35S]GTPγS与表达相关受体的细胞膜结合的能力。因此本测定可用于直接识别法去筛选针对已知、孤儿和组成型活化G蛋白偶联受体的候选化合物。本测定是普遍的并可用于针对所有G蛋白偶联受体的药物发现。
GTPγS试验在20mM HEPES、1至大约20mM的MgCl2(尽管20mM是优选的,但这个剂量可针对结果的最优化进行调整)、pH7.4、含有在0.3和1.2nM之间的[35S]GTPγS(尽管1.2是优选的,但这个剂量可针对结果的最优进行调整)、12.5到75μg膜蛋白(例如,COS-7细胞表达受体,本剂量可为最优化进行调整,尽管75μg是优选的)和1μM GDP(这个剂量可针对结果的最优化进行改造)的结合缓冲液中温-6和跨膜-7之间(这些分别被称为“细胞外”区1、2和3(EC-1、EC-2和EC-3))。在细胞膜内部即“细胞内”一边,跨膜螺旋也通过氨基酸链进行连接,这些氨基酸链分别在跨膜-1和跨膜-2、跨膜-3和跨膜-4、跨膜-5和跨膜-6之间(这些分别被称为“细胞内”区1、2和3(IC-1、IC-2和IC-3))。受体的“羧基”(“C”)端是在细胞内的区域,受体的“氨基”(“N”)端在细胞外的区域。图1描绘了与G蛋白偶联的受体的一般结构。
一般来说,当内源配体与受体结合时(经常被称为受体的“活化”),细胞内区域的构象发生变化,以容许细胞内区域和细胞内“G-蛋白”进行偶联。尽管存在其他G蛋白,但当前已被识别的G-蛋白是Gq、Gs、Gi和Go。内源配体活化的GPCR与G-蛋白的偶联引发一个信号级联过程(被称为“信号传导”)。在通常情形下,信号传导最终导致细胞活化或细胞抑制。据认为,受体的IC-3环与羧基端都和G蛋白相互作用。本发明的一个重要焦点就涉及GPCR的跨膜-6(TM6)区域和细胞内-3(IC3)区域。
在生理条件下,GPCR存在于细胞膜上,并在“非活化”状态和“活化”状态这两种不同构象之间保持平衡。如在图2中所图示的那样,在非活性状态下的受体不能与细胞内信号传导途径相偶联以产生生物学反应。受体构象向活性状态的转变就使它与传导途径相偶联(通过G-蛋白)并产生生物学反应。
受体可被内源配体或药物等化合物稳定在活性状态。近来的发现提供了除内源配体或药物之外能够促进和稳定受体到活性状态构象的方法,这包括但不限于对受体的氨基酸序列的修饰。这些方法通过模仿与受体结合的内源配体的作用来有效地稳定活性状态的受体。通过如此的配体非依赖性方法形成的稳定被称为“组成型受体活化”。
如上所述,将孤儿受体用于筛选目的是不可能的。这是因为涉及HEPES和10mM MgCl2的缓冲液中均质化悬浮的细胞来制备细胞膜。均质化是在冰上用Brinkman PolytronTM进行大约10秒钟。得到的均质化物在4℃、49,000×g离心15分钟。得到的沉淀物接着在含有20mMpH 7.4的HEPES和0.1mM EDTA缓冲液中悬浮,均质化10秒钟,然后在4℃、49,000×g离心15分钟。得到的沉淀可被贮藏在-4℃备用。在测量的当天,膜沉淀物在室温下被缓慢解冻,在含有20mM pH 7.4的HEPES和10mM MgCl2的缓冲液中重新悬浮(这些数量可被优化,尽管在此列举的数值是优选的),得到最终蛋白质浓度为0.60mg/ml(重新悬浮的膜放置在冰上备用)。
按照制造商的指令制备和维持cAMP标准品和检测缓冲液(含有2μCi示踪物[125I cAMP(100μl)]的11ml检测缓冲液)。为筛选用的试验缓冲液被新鲜制备,它含有20Mm pH 7.4的HEPES、10mM MgCl2、20mM(Sigma)、0.1单位/ml肌酸磷酸激酶(Sigma)、50μM GTP(Sigma)和0.2mM ATP(Sigma);试验缓冲液可在冰上贮存备用。首先加入50μl试验缓冲液、接着加入50μl膜悬浮物到NEN Flash Plate,以开始试验。得到的测定组合物在室温下温育60分钟,然后加入100μl检测缓冲液。培养板接着再温育2-4小时,然后用Wallac MicroBeta液闪计数器记数。cAMP/孔的数值从标准cAMP曲线外推,该曲线包括在每个测定板之内。应用前述的分析MIG的测定方法。B.基于报告基因的测定1.CREB报告基因测定(Gs偶联受体)检测Gs刺激的方法依赖于转化因子CREB的已知性质,它是以cAMP依赖的方式被活化的。应用PathDetect CREB trans-ReportingSystem(Stratagene,Catalogue #219010)来检测在293和293T细胞中Gs偶联的活性。用上述系统的质粒成分和编码内源的或突变的受体的指明的表达质粒转染细胞,其使用哺乳动物细胞转染试剂盒(Stratagene,Catalogue #200285)并按照制造商的指令。简短而言,400ng pFR-Luc(荧光素酶报告基因质粒含有Gal4识别序列)、40ng pFA2-CREB(Gal4-CREB融合蛋白含有Gal4 DNA结合域)、80ng CMV-受体表达质粒(包括受体)和20ng CMV-SEAP(分泌的碱性磷酸酶表达质粒;碱性磷酸酶活性在转染细胞的培养基中测量,以控制在样品间转染效率的变化)在磷酸钙沉淀中按照试剂盒的指令进行混合。把沉淀的一半等量地分布在96孔培养板的3个孔中,保持细胞过夜,第二天上午置换新鲜培养基。转染后48小时,如上述GPR30系统所说的方法处理细胞并测定荧光素酶活性。此测定用于GHSR。2.AP1报告基因测定(Gq偶联的受体)测定Gq刺激依赖的方法依赖于Gq依赖的磷脂酶C已知的特性,即它可引起在其启动子含有AP1元件的基因活化。按照上述CREB报告基因测定所说的程序,使用Pathdetect AP-1 cis-Reporting System(Stratagene,Catalogue #219073),其中只是将磷酸钙沉淀的组分改为410ng pAPl-Luc、80ng受体表达质粒和20ng CMV-SEAP。本测定用于ETBR-LP2。C.细胞内IP3累积的测定在第一天,含有5-羟色胺受体(内源的和突变的)的细胞被接种于24孔培养板上,一般是1×105细胞/孔。在第二天转染细胞,首先混合在50μl/孔无血清DMEM中的0.25μg DNA和在50μl/孔无血清DMEM中的2μl lipofectamine。轻轻地混合溶液并在室温下温育15-30分钟。用0.5ml PBS洗涤细胞,把400μl无血清培养基与转染培养基混合并加到细胞中。然后在37℃/5% CO2下温育细胞3-4小时,再移去转染培养基,替换为1ml/孔常规培养基。在第三天,用3H-肌醇标记细胞。简短地说,移去培养基,细胞用0.5ml PBS洗涤,接着加入0.5ml/孔无肌醇/无血清培养基(GIBCO BRL)和0.25μ Ci/孔3H-肌醇,在37℃/5% CO2下温育细胞16-18小时。在第四天,用0.5ml PBS洗涤细胞,加入0.45ml试验培养基,其中含有无肌醇/无血清培养基10μM巴吉林10mM氯化锂或0.4ml试验培养基和50μl 10×ketaserin(ket)以得到10μM的终浓度。然后在37℃温育细胞30分钟。用0.5ml PBS洗涤细胞,加入200μl/孔新鲜的/冰冷的终止液(1M KOH、18mM硼酸钠、3.8mM EDTA)。溶液在冰上放置5-10分钟或直到细胞被溶解,然后用200μl新鲜的/冰冷的中和液(7.5%HCl)中和。然后把裂解物转移到1.5ml离心管中,加入1ml/管氯仿/甲醇(1∶2)。然后使溶液涡旋15秒钟,把上层上样至Biorad AGl-X8阴离子交换树脂(100-200目)。首先,树脂以1∶1.25 W/V的比例用水洗涤,向柱中加载0.9ml的上层溶液。用10ml 5mM肌醇和10ml 5mM的硼酸钠/60mM甲酸钠洗涤柱子。肌醇三磷酸酯被洗提入液闪管中,其中含有10ml液闪鸡尾,它有2ml 0.1M甲酸/1M甲酸铵。通过用10ml 0.1M甲酸/3M甲酸铵洗涤和用ddH20洗涤两次来再生交换柱,柱子贮存在4℃的水中。
图7图示了从包括C322K突变的人5-HT2A受体产生IP3。尽管这些结果表明脯氨酸突变算法规则可组成型活化本受体,但为了应用这样一个受体而筛选识别可能的治疗物,优选更大的差别。然而,因为活化的受体可被用来理解和阐释组成型活化的角色和用来识别可被进一步检查的化合物,我们相信这个差别本身在区分人5-HT2A受体的内源和非内源形式时是有用的。D.结果概要检测到的GPCR结果列于表E,其中百分数的增加表示,观察到的非内源GPCR结果与内源GPCR相比较时的百分数差异;这些结果后面的括号里面标明的是应用的检测方法。进一步,应用的测定系统在括号中列出(并且,在应用不同的宿主细胞时,两者都列出)。这些结果表明,可应用多种方法确定人GPCR的非内源形式的组成型活化。相信本领域的熟练技术人员在基于前述并参考本领域中的信息后可具有选择和/或最佳化适于研究者的特别需要的特别测定方法的能力。
表E


实施例6内源孤儿GPCR的组织分布应用商业可得的人组织斑点印迹系统,探查内源孤儿GPCR以确定这些受体被定位的区域。除非在下面指明,完整受体cDNA(放射标记的)被用作探针按照制造商的指令,应用Prime-It IITM随机引物标记试剂盒(Stratagene,#300385),用完全受体cDNA(从载体中切下)产生放射标记的探针。把人RNA Master BlotTM(Clontech,#7770-1)与GPCR放射标记的探针杂交,并按照制造商的指令在严格的条件下洗涤。印迹曝光于Kodak BioMax放射自显影底片,在-80℃过夜。
代表性的斑点印迹结果在表8中以GPR1(8A)、GPR30(8B)和APJ(8C)列出,针对所有受体的结果摘要列于表F。
表F

基于前述的信息,注意到可评定人GPCR在患病组织中的分布;然后,在“正常”和患病组织中的对比性评定可被用来确定在疾病状态下一个特别受体的过度表达和不足表达的可能性。当希望利用人GPCR的非内源形式进行筛选以直接识别可能与治疗相关的候选化合物时,注意到反激活剂可用于治疗由特定人GPCR过度表达引起的疾病和紊乱,而激活剂或部分激活剂对于治疗特定人GPCR不足表达引起的疾病和紊乱是有用的。
正如被期望的,利用本领域技术人员周知的技术(例如,原位杂交),更详细的受体细胞定位可被用于识别特殊细胞,其中感兴趣的受体在特殊细胞所在的组织中表达。
预期本发明文件提到的每一个专利、申请和印刷出版物,都以全文作为参考引入。
正如本领域的技术人员将认知的那样,可以对本发明的优选实施方案做多种变化和修饰而不背离本发明的精神。预期所有这些变化都落在本发明的范围之内。
虽然本领域的普通技术人员可以得到许多不同的载体为内源和非内源GPCR的目的使用,但最好是用pCMV载体。按照国际承认用于专利程序的微生物保存布达佩斯条约,该载体于1998年10月13日保存在美国典型培养物保藏中心(American Type Culture Collection)(ATCC)(10801 University Blvd,Manassas,VA20110-2209 USA7)。该载体由ATCC在1998年__月__日进行了检验并在1998年__年__日测定了其存活性。ATCC为pCMV给出了下列保藏号---。
序列表(1)一般资料(i)申请人多米尼克·P·比汉;德里克·T·查默斯;廖王蓁(ii)发明名称非内源的被组成型活化的人G蛋白偶联的受体(iii)序列数280(iv)通讯地址(A)收信人阿瑞那制药公司(B)Nancy Ridge大道6166号(C)城市圣地亚哥(D)州加利福尼亚州(E)国家美国(F)邮编92121(v)计算机可读形式(A)介质类型软盘(B)计算机IBM个人兼容机(C)操作系统PC-DOS/MS-DOS(D)软件PatentIn Release#1.0,#1.30版(vi)本申请资料(A)申请号(B)申请日(C)分类号(viii)代理人信息(A)姓名Burgoon,Richard P.(B)登记号34787(ix)电讯信息(A)电话(858)453-7200(B)电传(858)453-7210(2)SEQ ID NO1的资料(i)序列特征(A)长度1068个碱基对(B)类型核酸(C)链型单链
(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO1的序列描述ATGGAAGATT TGGAGGAAAC ATTATTTGAA GAATTTGAAA ACTATTCCTA TGACCTAGAC 60TATTACTCTC TGGAGTCTGA TTTGGAGGAG AAAGTCCAGC TGGGAGTTGT TCACTGGGTC 120TCCCTGGTGT TATATTGTTT GGCTTTTGTT CTGGGAATTC CAGGAAATGC CATCGTCATT 180TGGTTCACGG GGCTCAAGTG GAAGAAGACA GTCACCACTC TGTGGTTCCT CAATCTAGCC 240ATTGCGGATT TCATTTTTCT TCTCTTTCTG CCCCTGTACA TCTCCTATGT GGCCATGAAT 300TTCCACTGGC CCTTTGGCAT CTGGCTGTGC AAAGCCAATT CCTTCACTGC CCAGTTGAAC 360ATGTTTGCCA GTGTTTTTTT CCTGACAGTG ATCAGCCTGG ACCACTATAT CCACTTGATC 420CATCCTGTCT TATCTCATCG GCATCGAACC CTCAAGAACT CTCTGATTGT CATTATATTC 480ATCTGGCTTT TGGCTTCTCT AATTGGCGGT CCTGCCCTGT ACTTCCGGGA CACTGTGGAG 540TTCAATAATC ATACTCTTTG CTATAACAAT TTTCAGAAGC ATGATCCTGA CCTCACTTTG 600ATCAGGCACC ATGTTCTGAC TTGGGTGAAA TTTATCATTG GCTATCTCTT CCCTTTGCTA 660ACAATGAGTA TTTGCTACTT GTGTCTCATC TTCAAGGTGA AGAAGCGAAC AGTCCTGATC 720TCCAGTAGGC ATTTCTGGAC AATTCTGGTT GTGGTTGTGG CCTTTGTGGT TTGCTGGACT 780CCTTATCACC TGTTTAGCAT TTGGGAGCTC ACCATTCACC ACAATAGCTA TTCCCACCAT 840GTGATGCAGG CTGGAATCCC CCTCTCCACT GGTTTGGCAT TCCTCAATAG TTGCTTGAAC 900CCCATCCTTT ATGTCCTAAT TAGTAAGAAG TTCCAAGCTC GCTTCCGGTC CTCAGTTGCT 960GAGATACTCA AGTACACACT GTGGGAAGTC AGCTGTTCTG GCACAGTGAG TGAACAGCTC 1020AGGAACTCAG AAACCAAGAA TCTGTGTCTC CTGGAAACAG CTCAATAA 1068(3)SEQ ID NO2的资料(i)序列特征(A)长度355个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO2的序列描述Met Glu Asp Leu Glu Glu Thr Leu Phe Glu Glu Phe Glu Asn Tyr Ser1 5 10 15Tyr Asp Leu Asp Tyr Tyr Ser Leu Glu Ser Asp Leu Glu Glu Lys Val20 25 30Gln Leu Gly Val Val His Trp Val Ser Leu Val Leu Tyr Cys Leu Ala35 40 45Phe Val Leu Gly Ile Pro Gly Asn Ala Ile Val Ile Trp Phe Thr Gly50 55 60Leu Lys Trp Lys Lys Thr Val Thr Thr Leu Trp Phe Leu Asn Leu Ala65 70 75 80Ile Ala Asp Phe Ile Phe Leu Leu Phe Leu Pro Leu Tyr Ile Ser Tyr
85 90 95Val Ala Met Asn Phe His Trp Pro Phe Gly Ile Trp Leu Cys Lys Ala100 105 110Asn Ser Phe Thr Ala Gln Leu Asn Met Phe Ala Ser Val Phe Phe Leu115 120 125Thr Val Ile Ser Leu Asp His Tyr Ile His Leu Ile His Pro Val Leu130 135 140Ser His Arg His Arg Thr Leu Lys Asn Ser Leu Ile Val Ile Ile Phe145 150 155 160Ile Trp Leu Leu Ala Ser Leu Ile Gly Gly Pro Ala Leu Tyr Phe Arg165 170 175Asp Thr Val Glu Phe Asn Asn His Thr Leu Cys Tyr Asn Asn Phe Gln180 185 190Lys His Asp Pro Asp Leu Thr Leu Ile Arg His His Val Leu Thr Trp195 200 205Val Lys Phe Ile Ile Gly Tyr Leu Phe Pro Leu Leu Thr Met Ser Ile210 215 220Cys Tyr Leu Cys Leu Ile Phe Lys Val Lys Lys Arg Thr Val Leu Ile225 230 235 240Ser Ser Arg His Phe Trp Thr Ile Leu Val Val Val Val Ala Phe Val245 250 255Val Cys Trp Thr Pro Tyr His Leu Phe Ser Ile Trp Glu Leu Thr Ile260 265 270His His Asn Ser Tyr Ser His His Val Met Gln Ala Gly Ile Pro Leu275 280 285Ser Thr Gly Leu Ala Phe Leu Asn Ser Cys Leu Asn Pro Ile Leu Tyr290 295 300Val Leu Ile Ser Lys Lys Phe Gln Ala Arg Phe Arg Ser Ser Val Ala305 310 315 320Glu Ile Leu Lys Tyr Thr Leu Trp Glu Val Ser Cys Ser Gly Thr Val325 330 335Ser Glu Gln Leu Arg Asn Ser Glu Thr Lys Asn Leu Cys Leu Leu Glu340 345 350Thr Ala Gln
355(4)SEQ ID NO3的资料(i)序列特征(A)长度1089个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO3的序列描述ATGGGCAACC ACACGTGGGA GGGCTGCCAC GTGGACTCGC GCGTGGACCA CCTCTTTCCG 60CCATCCCTCT ACATCTTTGT CATCGGCGTG GGGCTGCCCA CCAACTGCCT GGCTCTGTGG 120GCGGCCTACC GCCAGGTGCA ACAGCGCAAC GAGCTGGGCG TCTACCTGAT GAACCTCAGC 180ATCGCCGACC TGCTGTACAT CTGCACGCTG CCGCTGTGGG TGGACTACTT CCTGCACCAC 240GACAACTGGA TCCACGGCCC CGGGTCCTGC AAGCTCTTTG GGTTCATCTT CTACACCAAT 300ATCTACATCA GCATCGCCTT CCTGTGCTGC ATCTCGGTGG ACCGCTACCT GGCTGTGGCC 360CACCCACTCC GCTTCGCCCG CCTGCGCCGC GTCAAGACCG CCGTGGCCGT GAGCTCCGTG 420GTCTGGGCCA CGGAGCTGGG CGCCAACTCG GCGCCCCTGT TCCATGACGA GCTCTTCCGA 480GACCGCTACA ACCACACCTT CTGCTTTGAG AAGTTCCCCA TGGAAGGCTG GGTGGCCTGG 540ATGAACCTCT ATCGGGTGTT CGTGGGCTTC CTCTTCCCGT GGGCGCTCAT GCTGCTGTCG 600TACCGGGGCA TCCTGCGGGC CGTGCGGGGC AGCGTGTCCA CCGAGCGCCA GGAGAAGGCC 660AAGATCAAGC GGCTGGCCCT CAGCCTCATC GCCATCGTGC TGGTCTGCTT TGCGCCCTAT 720CACGTGCTCT TGCTGTCCCG CAGCGCCATC TACCTGGGCC GCCCCTGGGA CTGCGGCTTC 780GAGGAGCGCG TCTTTTCTGC ATACCACAGC TCACTGGCTT TCACCAGCCT CAACTGTGTG 840GCGGACCCCA TCCTCTACTG CCTGGTCAAC GAGGGCGCCC GCAGCGATGT GGCCAAGGCC 900CTGCACAACC TGCTCCGCTT TCTGGCCAGC GACAAGCCCC AGGAGATGGC CAATGCCTCG 960CTCACCCTGG AGACCCCACT CACCTCCAAG AGGAACAGCA CAGCCAAAGC CATGACTGGC 1020AGCTGGGCGG CCACTCCGCC TTCCCAGGGG GACCAGGTGC AGCTGAAGAT GCTGCCGCCA 1080GCACAATGA 1089(5)SEQ ID NO4的资料(i)序列特征(A)长度362个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO4的序列描述Met Gly Asn His Thr Trp Glu Gly Cys His Val Asp Ser Arg Val Asp1 5 10 15His Leu Phe Pro Pro Ser Leu Tyr Ile Phe Val Ile Gly Val Gly Leu20 25 30Pro Thr Asn Cys Leu Ala Leu Trp Ala Ala Tyr Arg Gln Val Gln Gln35 40 45Arg Asn Glu Leu Gly Val Tyr Leu Met Asn Leu Ser Ile Ala Asp Leu50 55 60Leu Tyr Ile Cys Thr Leu Pro Leu Trp Val Asp Tyr Phe Leu His His65 70 75 80Asp Asn Trp Ile His Gly Pro Gly Ser Cys Lys Leu Phe Gly Phe Ile85 90 95Phe Tyr Thr Asn Ile Tyr Ile Ser Ile Ala Phe Leu Cys Cys Ile Ser100 105 110Val Asp Arg Tyr Leu Ala Val Ala His Pro Leu Arg Phe Ala Arg Leu115 120 125Arg Arg Val Lys Thr Ala Val Ala Val Ser Ser Val Val Trp Ala Thr130 135 140Glu Leu Gly Ala Asn Ser Ala Pro Leu Phe His Asp Glu Leu Phe Arg145 150 155 160Asp Arg Tyr Asn His Thr Phe Cys Phe Glu Lys Phe Pro Met Glu Gly165 170 175Trp Val Ala Trp Met Asn Leu Tyr Arg Val Phe Val Gly Phe Leu Phe180 185 190Pro Trp Ala Leu Met Leu Leu Ser Tyr Arg Gly Ile Leu Arg Ala Val195 200 205Arg Gly Ser Val Ser Thr Glu Arg Gln Glu Lys Ala Lys Ile Lys Arg210 215 220Leu Ala Leu Ser Leu Ile Ala Ile Val Leu Val Cys Phe Ala Pro Tyr225 230 235 240His Val Leu Leu Leu Ser Arg Ser Ala Ile Tyr Leu Gly Arg Pro Trp245 250 255Asp Cys Gly Phe Glu Glu Arg Val Phe Ser Ala Tyr His Ser Ser Leu260 265 270Ala Phe Thr Ser Leu Asn Cys Val Ala Asp Pro Ile Leu Tyr Cys Leu275 280 285Val Asn Glu Gly Ala Arg Ser Asp Val Ala Lys Ala Leu His Asn Leu290 295 300Leu Arg Phe Leu Ala Ser Asp Lys Pro Gln Glu Met Ala Asn Ala Ser305 310 315 320Leu Thr Leu Glu Thr Pro Leu Thr Ser Lys Arg Asn Ser Thr Ala Lys325 330 335Ala Met Thr Gly Ser Trp Ala Ala Thr Pro Pro Ser Gln Gly Asp Gln340 345 350Val Gln Leu Lys Met Leu Pro Pro Ala Gln355 360(6)SEQ ID NO5的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO5的序列描述TATGAATTCA GATGCTCTAA ACGTCCCTGC 30(7)SEQ ID NO6的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO6的序列描述TCCGGATCCA CCTGCACCTG CGCCTGCACC 30(8)SEQ ID NO7的资料(i)序列特征(A)长度1002个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO7的序列描述ATGGAGTCCT CAGGCAACCC AGAGAGCACC ACCTTTTTTT ACTATGACCT TCAGAGCCAG 60CCGTGTGAGA ACCAGGCCTG GGTCTTTGCT ACCCTCGCCA CCACTGTCCT GTACTGCCTG 120GTGTTTCTCC TCAGCCTAGT GGGCAACAGC CTGGTCCTGT GGGTCCTGGT GAAGTATGAG 180AGCCTGGAGT CCCTCACCAA CATCTTCATC CTCAACCTGT GCCTCTCAGA CCTGGTGTTC 240GCCTGCTTGT TGCCTGTGTG GATCTCCCCA TACCACTGGG GCTGGGTGCT GGGAGACTTC 300CTCTGCAAAC TCCTCAATAT GATCTTCTCC ATCAGCCTCT ACAGCAGCAT CTTCTTCCTG 360ACCATCATGA CCATCCACCG CTACCTGTCG GTAGTGAGCC CCCTCTCCAC CCTGCGCGTC 420CCCACCCTCC GCTGCCGGGT GCTGGTGACC ATGGCTGTGT GGGTAGCCAG CATCCTGTCC 480TCCATCCTCG ACACCATCTT CCACAAGGTG CTTTCTTCGG GCTGTGATTA TTCCGAACTC 540ACGTGGTACC TCACCTCCGT CTACCAGCAC AACCTCTTCT TCCTGCTGTC CCTGGGGATT 600ATCCTGTTCT GCTACGTGGA GATCCTCAGG ACCCTGTTCC GCTCACGCTC CAAGCGGCGC 660CACCGCACGG TCAAGCTCAT CTTCGCCATC GTGGTGGCCT ACTTCCTCAG CTGGGGTCCC 720TACAACTTCA CCCTGTTTCT GCAGACGCTG TTTCGGACCC AGATCATCCG GAGCTGCGAG 780GCCAAACAGC AGCTAGAATA CGCCCTGCTC ATCTGCCGCA ACCTCGCCTT CTCCCACTGC 840TGCTTTAACC CGGTGCTCTA TGTCTTCGTG GGGGTCAAGT TCCGCACACA CCTGAAACAT 900GTTCTCCGGC AGTTCTGGTT CTGCCGGCTG CAGGCACCCA GCCCAGCCTC GATCCCCCAC 960TCCCCTGGTG CCTTCGCCTA TGAGGGCGCC TCCTTCTACT GA1002(9)SEQ ID NO8的资料(i)序列特征(A)长度333个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO8的序列描述Met Glu Ser Ser Gly Asn Pro Glu Ser Thr Thr Phe Phe Tyr Tyr Asp1 5 10 15Leu Gln Ser Gln Pro Cys Glu Asn Gln Ala Trp Val Phe Ala Thr Leu20 25 30Ala Thr Thr Val Leu Tyr Cys Leu Val Phe Leu Leu Ser Leu Val Gly35 40 45Asn Ser Leu Val Leu Trp Val Leu Val Lys Tyr Glu Ser Leu Glu Ser50 55 60Leu Thr Asn Ile Phe Ile Leu Ash Leu Cys Leu Ser Asp Leu Val Phe65 70 75 80Ala Cys Leu Leu Pro Val Trp Ile Ser Pro Tyr His Trp Gly Trp Val85 90 95Leu Gly Asp Phe Leu Cys Lys Leu Leu Asn Met Ile Phe Ser Ile Ser100 105 110Leu Tyr Ser Ser Ile Phe Phe Leu Thr Ile Met Thr Ile His Arg Tyr115 120 125Leu Ser Val Val Ser Pro Leu Ser Thr Leu Arg Val Pro Thr Leu Arg130 135 140Cys Arg Val Leu Val Thr Met Ala Val Trp Val Ala Ser Ile Leu Ser145 150 155 160Ser Ile Leu Asp Thr Ile Phe His Lys Val Leu Ser Ser Gly Cys Asp165 170 175Tyr Ser Glu Leu Thr Trp Tyr Leu Thr Ser Val Tyr Gln His Asn Leu180 185 190Phe Phe Leu Leu Ser Leu Gly Ile Ile Leu Phe Cys Tyr Val Glu Ile195 200 205Leu Arg Thr Leu Phe Arg Ser Arg Ser Lys Arg Arg His Arg Thr Val210 215 220Lys Leu Ile Phe Ala Ile Val Val Ala Tyr Phe Leu Ser Trp Gly Pro225 230 235 240Tyr Asn Phe Thr Leu Phe Leu Gln Thr Leu Phe Arg Thr Gln Ile Ile245 250 255Arg Ser Cys Glu Ala Lys Gln Gln Leu Glu Tyr Ala Leu Leu Ile Cys260 265 270Arg Asn Leu Ala Phe Ser His Cys Cys Phe Asn Pro Val Leu Tyr Val275 280 285Phe Val Gly Val Lys Phe Arg Thr His Leu Lys His Val Leu Arg Gln290 295 300Phe Trp Phe Cys Arg Leu Gln Ala Pro Ser Pro Ala Ser Ile Pro His305 310 315 320Ser Pro Gly Ala Phe Ala Tyr Glu Gly Ala Ser Phe Tyr325 330(10)SEQ ID NO9的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO9的序列描述GCAAGCTTGG GGGACGCCAG GTCGCCGGCT 30(11)SEQ ID NO10的资料(i)序列特征(A)长度31个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO10的序列描述gcggatccgg acgctggggg agtcaggctg c 31(12) SEQ ID NO11的资料(i)序列特征(A)长度987个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO11的序列描述ATGGACAACG CCTCGTTCTC GGAGCCCTGG CCCGCCAACG CATCGGGCCC GGACCCGGCG 60CTGAGCTGCT CCAACGCGTC GACTCTGGCG CCGCTGCCGG CGCCGCTGGC GGTGGCTGTA 120CCAGTTGTCT ACGCGGTGAT CTGCGCCGTG GGTCTGGCGG GCAACTCCGC CGTGCTGTAC 180GTGTTGCTGC GGGCGCCCCG CATGAAGACC GTCACCAACC TGTTCATCCT CAACCTGGCC 240ATCGCCGACG AGCTCTTCAC GCTGGTGCTG CCCATCAACA TCGCCGACTT CCTGCTGCGG 300CAGTGGCCCT TCGGGGAGCT CATGTGCAAG CTCATCGTGG CTATCGACCA GTACAACACC 360TTCTCCAGCC TCTACTTCCT CACCGTCATG AGCGCCGACC GCTACCTGGT GGTGTTGGCC 420ACTGCGGAGT CGCGCCGGGT GGCCGGCCGC ACCTACAGCG CCGCGCGCGC GGTGAGCCTG 480GCCGTGTGGG GGATCGTCAC ACTCGTCGTG CTGCCCTTCG CAGTCTTCGC CCGGCTAGAC 540GACGAGCAGG GCCGGCGCCA GTGCGTGCTA GTCTTTCCGC AGCCCGAGGC CTTCTGGTGG 600CGCGCGAGCC GCCTCTACAC GCTCGTGCTG GGCTTCGCCA TCCCCGTGTC CACCATCTGT 660GTCCTCTATA CCACCCTGCT GTGCCGGCTG CATGCCATGC GGCTGGACAG CCACGCCAAG 720GCCCTGGAGC GCGCCAAGAA GCGGGTGACC TTCCTGGTGG TGGCAATCCT GGCGGTGTGC 780CTCCTCTGCT GGACGCCCTA CCACCTGAGC ACCGTGGTGG CGCTCACCAC CGACCTCCCG 840CAGACGCCGC TGGTCATCGC TATCTCCTAC TTCATCACCA GCCTGACGTA CGCCAACAGC 900TGCCTCAACC CCTTCCTCTA CGCCTTCCTG GACGCCAGCT TCCGCAGGAA CCTCCGCCAG 960CTGATAACTT GCCGCGCGGC AGCCTGA 987(13)SEQ ID NO12的资料(i)序列特征
(A)长度328个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO12的序列描述Met Asp Asn Ala Ser Phe Ser Glu Pro Trp Pro Ala Asn Ala Ser Gly1 5 10 15Pro Asp Pro Ala Leu Ser Cys Ser Asn Ala Ser Thr Leu Ala Pro Leu20 25 30Pro Ala Pro Leu Ala Val Ala Val Pro Val Val Tyr Ala Val Ile Cys35 40 45Ala Val Gly Leu Ala Gly Asn Ser Ala Val Leu Tyr Val Leu Leu Arg50 55 60Ala Pro Arg Met Lys Thr Val Thr Asn Leu Phe Ile Leu Asn Leu Ala65 70 75 80Ile Ala Asp Glu Leu Phe Thr Leu Val Leu Pro Ile Asn Ile Ala Asp85 90 95Phe Leu Leu Arg Gln Trp Pro Phe Gly Glu Leu Met Cys Lys Leu Ile100 105 110Val Ala Ile Asp Gln Tyr Asn Thr Phe Ser Ser Leu Tyr Phe Leu Thr115 120 125Val Met Ser Ala Asp Arg Tyr Leu Val Val Leu Ala Thr Ala Glu Ser130 135 140Arg Arg Val Ala Gly Arg Thr Tyr Ser Ala Ala Arg Ala Val Ser Leu145 150 155 160Ala Val Trp Gly Ile Val Thr Leu Val Val Leu Pro Phe Ala Val Phe165 170 175Ala Arg Leu Asp Asp Glu Gln Gly Arg Arg Gln Cys Val Leu Val Phe180 185 190Pro Gln Pro Glu Ala Phe Trp Trp Arg Ala Ser Arg Leu Tyr Thr Leu195 200 205Val Leu Gly Phe Ala Ile Pro Val Ser Thr Ile Cys Val Leu Tyr Thr210 215 220Thr Leu Leu Cys Arg Leu His Ala Met Arg Leu Asp Ser His Ala Lys225 230 235 240Ala Leu Glu Arg Ala Lys Lys Arg Val Thr Phe Leu Val Val Ala Ile245 250 255Leu Ala Val Cys Leu Leu Cys Trp Thr Pro Tyr His Leu Ser Thr Val260 265 270Val Ala Leu Thr Thr Asp Leu Pro Gln Thr Pro Leu Val Ile Ala Ile275 280 285Ser Tyr Phe Ile Thr Ser Leu Thr Tyr Ala Asn Ser Cys Leu Asn Pro290295 300Phe Leu Tyr Ala Phe Leu Asp Ala Ser Phe Arg Arg Asn Leu Arg Gln305 310 315 320Leu Ile Thr Cys Arg Ala Ala Ala325(14)SEQ ID NO13的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO13的序列描述CGGAATTCGT CAACGGTCCC AGCTACAATG 30(15)SEQ ID NO14的资料(i)序列特征(A)长度31个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO14的序列描述ATGGATCCCA GGCCCTTCAG CACCGCAATA T31(16)SEQ ID NO15的资料(i)序列特征
(A)长度1002个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO15的序列描述ATGCAGGCCG CTGGGCACCC AGAGCCCCTT GACAGCAGGG GCTCCTTCTC CCTCCCCACG 60ATGGGTGCCA ACGTCTCTCA GGACAATGGC ACTGGCCACA ATGCCACCTT CTCCGAGCCA 120CTGCCGTTCC TCTATGTGCT CCTGCCCGCC GTGTACTCCG GGATCTGTGC TGTGGGGCTG 180ACTGGCAACA CGGCCGTCAT CCTTGTAATC CTAAGGGCGC CCAAGATGAA GACGGTGACC 240AACGTGTTCA TCCTGAACCT GGCCGTCGCC GACGGGCTCT TCACGCTGGT ACTGCCCGTC 300AACATCGCGG AGCACCTGCT GCAGTACTGG CCCTTCGGGG AGCTGCTCTG CAAGCTGGTG 360CTGGCCGTCG ACCACTACAA CATCTTCTCC AGCATCTACT TCCTAGCCGT GATGAGCGTG 420GACCGATACC TGGTGGTGCT GGCCACCGTG AGGTCCCGCC ACATGCCCTG GCGCACCTAC 480CGGGGGGCGA AGGTCGCCAG CCTGTGTGTC TGGCTGGGCG TCACGGTCCT GGTTCTGCCC 540TTCTTCTCTT TCGCTGGCGT CTACAGCAAC GAGCTGCAGG TCCCAAGCTG TGGGCTGAGC 600TTCCCGTGGC CCGAGCGGGT CTGGTTCAAG GCCAGCCGTG TCTACACTTT GGTCCTGGGC 660TTCGTGCTGC CCGTGTGCAC CATCTGTGTG CTCTACACAG ACCTCCTGCG CAGGCTGCGG 720GCCGTGCGGC TCCGCTCTGG AGCCAAGGCT CTAGGCAAGG CCAGGCGGAA GGTGACCGTC 780CTGGTCCTCG TCGTGCTGGC CGTGTGCCTC CTCTGCTGGA CGCCCTTCCA CCTGGCCTCT 840GTCGTGGCCC TGACCACGGA CCTGCCCCAG ACCCCACTGG TCATCAGTAT GTCCTACGTC 900ATCACCAGCC TCACGTACGC CAACTCGTGC CTGAACCCCT TCCTCTACGC CTTTCTAGAT 960GACAACTTCC GGAAGAACTT CCGCAGCATA TTGCGGTGCT GA1002(17)SEQ ID NO16的资料(i)序列特征(A)长度333个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO16的序列描述Met Gln Ala Ala Gly His Pro Glu Pro Leu Asp Ser Arg Gly Ser Phe1 5 10 15Ser Leu Pro Thr Met Gly Ala Asn Val Ser Gln Asp Asn Gly Thr Gly20 25 30His Asn Ala Thr Phe Ser Glu Pro Leu Pro Phe Leu Tyr Val Leu Leu35 40 45Pro Ala Val Tyr Ser Gly Ile Cys Ala Val Gly Leu Thr Gly Asn Thr50 55 60Ala Val Ile Leu Val Ile Leu Arg Ala Pro Lys Met Lys Thr Val Thr65 70 75 80Asn Val Phe Ile Leu Asn Leu Ala Mal Ala Asp Gly Leu Phe Thr Leu85 90 95Val Leu Pro Val Asn Ile Ala Glu His Leu Leu Gln Tyr Trp Pro Phe100 105 110Gly Glu Leu Leu Cys Lys Leu Val Leu Ala Val Asp His Tyr Asn Ile115 120 125Phe Ser Ser Ile Tyr Phe Leu Ala Val Met Ser Val Asp Arg Tyr Leu130 135 140Val Val Leu Ala Thr Val Arg Ser Arg His Met Pro Trp Arg Thr Tyr145 150 155 160Arg Gly Ala Lys Val Ala Ser Leu Cys Val Trp Leu Gly Val Thr Val165 170 175Leu Val Leu Pro Phe Phe Ser Phe Ala Gly Val Tyr Ser Asn Glu Leu180 185 190Gln Val Pro Ser Cys Gly Leu Ser Phe Pro Trp Pro Glu Arg Val Trp195 200 205Phe Lys Ala Ser Arg Val Tyr Thr Leu Val Leu Gly Phe Val Leu Pro210 215 220Val Cys Thr Ile Cys Val Leu Tyr Thr Asp Leu Leu Arg Arg Leu Arg225 230 235 240Ala Val Arg Leu Arg Ser Gly Ala Lys Ala Leu Gly Lys Ala Arg Arg245 250 255Lys Val Thr Val Leu Val Leu Val Val Leu Ala Mal Cys Leu Leu Cys260 265 270Trp Thr Pro Phe His Leu Ala Ser Val Mal Ala Leu Thr Thr Asp Leu275 280 285Pro Gln Thr Pro Leu Val Ile Ser Met Ser Tyr Val Ile Thr Ser Leu290 295 300Thr Tyr Ala Asn Ser Cys Leu Asn Pro Phe Leu Tyr Ala Phe Leu Asp305 310 315 320Asp Asn Phe Arg Lys Asn Phe Arg Ser Ile Leu Arg Cys325 330(18)SEQ ID NO17的资料(i)序列特征(A)长度48个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO17的序列描述ACGAATTCAG CCATGGTCCT TGAGGTGAGT GACCACCAAG TGCTAAAT 48(19)SEQ ID NO18的资料(i)序列特征(A)长度27个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO18的序列描述GAGGATCCTG GAATGCGGGG AAGTCAG27(20)SEQ ID NO19的资料(i)序列特征(A)长度1107个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO19的序列描述ATGGTCCTTG AGGTGAGTGA CCACCAAGTG CTAAATGACG CCGAGGTTGC CGCCCTCCTG 60GAGAACTTCA GCTCTTCCTA TGACTATGGA GAAAACGAGA GTGACTCGTG CTGTACCTCC 120CCGCCCTGCC CACAGGACTT CAGCCTGAAC TTCGACCGGG CCTTCCTGCC AGCCCTCTAC 180AGCCTCCTCT TTCTGCTGGG GCTGCTGGGC AACGGCGCGG TGGCAGCCGT GCTGCTGAGC 240CGGCGGACAG CCCTGAGCAG CACCGACACC TTCCTGCTCC ACCTAGCTGT AGCAGACACG 300CTGCTGGTGC TGACACTGCC GCTCTGGGCA GTGGACGCTG CCGTCCAGTG GGTCTTTGGC 360TCTGGCCTCT GCAAAGTGGC AGGTGCCCTC TTCAACATCA ACTTCTACGC AGGAGCCCTC 420CTGCTGGCCT GCATCAGCTT TGACCGCTAC CTGAACATAG TTCATGCCAC CCAGCTCTAC 480CGCCGGGGGC CCCCGGCCCG CGTGACCCTC ACCTGCCTGG CTGTCTGGGG GCTCTGCCTG 540CTTTTCGCCC TCCCAGACTT CATCTTCCTG TCGGCCCACC ACGACGAGCG CCTCAACGCC 600ACCCACTGCC AATACAACTT CCCACAGGTG GGCCGCACGG CTCTGCGGGT GCTGCAGCTG 660GTGGCTGGCT TTCTGCTGCC CCTGCTGGTC ATGGCCTACT GCTATGCCCA CATCCTGGCC 720GTGCTGCTGG TTTCCAGGGG CCAGCGGCGC CTGCGGGCCA TGCGGCTGGT GGTGGTGGTC 780GTGGTGGCCT TTGCCCTCTG CTGGACCCCC TATCACCTGG TGGTGCTGGT GGACATCCTC 840ATGGACCTGG GCGCTTTGGC CCGCAACTGT GGCCGAGAAA GCAGGGTAGA CGTGGCCAAG 900TCGGTCACCT CAGGCCTGGG CTACATGCAC TGCTGCCTCA ACCCGCTGCT CTATGCCTTT 960GTAGGGGTCA AGTTCCGGGA GCGGATGTGG ATGCTGCTCT TGCGCCTGGG CTGCCCCAAC 1020CAGAGAGGGC TCCAGAGGCA GCCATCGTCT TCCCGCCGGG ATTCATCCTG GTCTGAGACC 1080TCAGAGGCCT CCTACTCGGG CTTGTGA 1107(21)SEQ ID NO20的资料(i)序列特征(A)长度368个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO20的序列描述Met Val Leu Glu Val Ser Asp His Gln Val Leu Asn Asp Ala Glu Val1 5 10 15Ala Ala Leu Leu Glu Asn Phe Ser Ser Ser Tyr Asp Tyr Gly Glu Asn20 25 30Glu Ser Asp Ser Cys Cys Thr Ser Pro Pro Cys Pro Gln Asp Phe Ser35 40 45Leu Asn Phe Asp Arg Ala Phe Leu Pro Ala Leu Tyr Ser Leu Leu Phe50 55 60Leu Leu Gly Leu Leu Gly Asn G1y Ala Val Ala Ala Val Leu Leu Ser65 70 75 80Arg Arg Thr Ala Leu Ser Ser Thr Asp Thr Phe Leu Leu His Leu Ala85 90 95Val Ala Asp Thr Leu Leu Val Leu Thr Leu Pro Leu Trp Ala Val Asp100 105 110Ala Ala Val Gln Trp Val Phe Gly Ser Gly Leu Cys Lys Val Ala Gly115 120 125Ala Leu Phe Asn Ile Asn Phe Tyr Ala Gly Ala Leu Leu Leu Ala Cys130 135 140Ile Ser Phe Asp Arg Tyr Leu Asn Ile Val His Ala Thr Gln Leu Tyr145 150 155 160Arg Arg Gly Pro Pro Ala Arg Val Thr Leu Thr Cys Leu Ala Val Trp165 170 175Gly Leu Cys Leu Leu Phe Ala Leu Pro Asp Phe Ile Phe Leu Ser Ala
180 185 190His His Asp Glu Arg Leu Asn Ala Thr His Cys Gln Tyr Asn Phe Prol95 200 205Gln Val Gly Arg Thr Ala Leu Arg Val Leu Gln Leu Val Ala Gly Phe210 215 220Leu Leu Pro Leu Leu Val Met Ala Tyr Cys Tyr Ala His Ile Leu Ala225 230 235 240Val Leu Leu Val Ser Arg Gly Gln Arg Arg Leu Arg Ala Met Arg Leu245 250 255Val Val Val Val Val Val Ala Phe Ala Leu Cys Trp Thr Pro Tyr His260 265 270Leu Val Val Leu Val Asp Ile Leu Met Asp Leu Gly Ala Leu Ala Arg275 280 285Asn Cys Gly Arg Glu Ser Arg Val Asp Val Ala Lys Ser Val Thr Ser290 295 300Gly Leu Gly Tyr Met His Cys Cys Leu Asn Pro Leu Leu Tyr Ala Phe305 310 315 320Val Gly Val Lys Phe Arg Glu Arg Met Trp Met Leu Leu Leu Arg Leu325 330 335Gly Cys Pro Asn Gln Arg Gly Leu Gln Arg Gln Pro Ser Ser Ser Arg340 345 350Arg Asp Ser Ser Trp Ser Glu Thr Ser Glu Ala Ser Tyr Ser Gly Leu355 360 365(22)SEQ ID NO21的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO21的序列描述TTAAGCTTGA CCTAATGCCA TCTTGTGTCC 30(23)SEQ ID NO22的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组)(xi)SEQ ID NO22的序列描述TTGGATCCAA AAGAACCATG CACCTCAGAG 30(24)SEQ ID NO23的资料(i)序列特征(A)长度1074个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO23的序列描述ATGGCTGATG ACTATGGCTC TGAATCCACA TCTTCCATGG AAGACTACGT TAACTTCAAC 60TTCACTGACT TCTACTGTGA GAAAAACAAT GTCAGGCAGT TTGCGAGCCA TTTCCTCCCA 120CCCTTGTACT GGCTCGTGTT CATCGTGGGT GCCTTGGGCA ACAGTCTTGT TATCCTTGTC 180TACTGGTACT GCACAAGAGT GAAGACCATG ACCGACATGT TCCTTTTGAA TTTGGCAATT 240GCTGACCTCC TCTTTCTTGT CACTCTTCCC TTCTGGGCCA TTGCTGCTGC TGACCAGTGG 300AAGTTCCAGA CCTTCATGTG CAAGGTGGTC AACAGCATGT ACAAGATGAA CTTCTACAGC 360TGTGTGTTGC TGATCATGTG CATCAGCGTG GACAGGTACA TTGCCATTGC CCAGGCCATG 420AGAGCACATA CTTGGAGGGA GAAAAGGCTT TTGTACAGCA AAATGGTTTG CTTTACCATC 480TGGGTATTGG CAGCTGCTCT CTGCATCCCA GAAATCTTAT ACAGCCAAAT CAAGGAGGAA 540TCCGGCATTG CTATCTGCAC CATGGTTTAC CCTAGCGATG AGAGCACCAA ACTGAAGTCA 600GCTGTCTTGA CCCTGAAGGT CATTCTGGGG TTCTTCCTTC CCTTCGTGGT CATGGCTTGC 660TGCTATACCA TCATCATTCA CACCCTGATA CAAGCCAAGA AGTCTTCCAA GCACAAAGCC 720CTAAAAGTGA CCATCACTGT CCTGACCGTC TTTGTCTTGT CTCAGTTTCC CTACAACTGC 780ATTTTGTTGG TGCAGACCAT TGACGCCTAT GCCATGTTCA TCTCCAACTG TGCCGTTTCC 840ACCAACATTG ACATCTGCTT CCAGGTCACC CAGACCATCG CCTTCTTCCA CAGTTGCCTG 900AACCCTGTTC TCTATGTTTT TGTGGGTGAG AGATTCCGCC GGGATCTCGT GAAAACCCTG 960AAGAACTTGG GTTGCATCAG CCAGGCCCAG TGGGTTTCAT TTACAAGGAG AGAGGGAAGC 1020TTGAAGCTGT CGTCTATGTT GCTGGAGACA ACCTCAGGAG CACTCTCCCT CTGA 1074(25)SEQ ID NO24的资料(i)序列特征(A)长度357个氨基酸(B)类型氨基酸(C)链型
(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO24的序列描述Met Ala Asp Asp Tyr Gly Ser Glu Ser Thr Ser Ser Met Glu Asp Tyr1 5 10 15Val Asn Phe Asn Phe Thr Asp Phe Tyr Cys Glu Lys Asn Asn Val Arg20 25 30Gln Phe Ala Ser His Phe Leu Pro Pro Leu Tyr Trp Leu Val Phe Ile35 40 45Val Gly Ala Leu Gly Asn Ser Leu Val Ile Leu Val Tyr Trp Tyr Cys50 55 60Thr Arg Val Lys Thr Met Thr Asp Met Phe Leu Leu Asn Leu Ala Ile65 70 75 80Ala Asp Leu Leu Phe Leu Val Thr Leu Pro Phe Trp Ala Ile Ala Ala85 90 95Ala Asp Gln Trp Lys Phe Gln Thr Phe Met Cys Lys Val Val Asn Ser100 105 110Met Tyr Lys Met Asn Phe Tyr Ser Cys Val Leu Leu Ile Met Cys Ile115 120 125Ser Val Asp Arg Tyr Ile Ala Ile Ala Gln Ala Met Arg Ala His Thr130 135 140Trp Arg Glu Lys Arg Leu Leu Tyr Ser Lys Met Val Cys Phe Thr Ile145 150 155 160Trp Val Leu Ala Ala Ala Leu Cys Ile Pro Glu Ile Leu Tyr Ser Gln165 170 175Ile Lys Glu Glu Ser Gly Ile Ala Ile Cys Thr Met Val Tyr Pro Ser180 185 190Asp Glu Ser Thr Lys Leu Lys Ser Ala Val Leu Thr Leu Lys Val Ile195 200 205Leu Gly Phe Phe Leu Pro Phe Val Val Met Ala Cys Cys Tyr Thr Ile210 215 220Ile Ile His Thr Leu Ile Gln Ala Lys Lys Ser Ser Lys His Lys Ala225 230 235 240Leu Lys Val Thr Ile Thr Val Leu Thr Val Phe Val Leu Ser Gln Phe245 250 255Pro Tyr Asn Cys Ile Leu Leu Val Gln Thr Ile Asp Ala Tyr Ala Met260 265 270Phe Ile Ser Asn Cys Ala Val Ser Thr Asn Ile Asp Ile Cys Phe Gln275 280 285Val Thr Gln Thr Ile Ala Phe Phe His Ser Cys Leu Asn Pro Val Leu290 295 300Tyr Val Phe Val Gly Glu Arg Phe Arg Arg Asp Leu Val Lys Thr Leu305 310 315 320Lys Asn Leu Gly Cys Ile Ser Gln Ala Gln Trp Val Ser Phe Thr Arg325 330 335Arg Glu Gly Ser Leu Lys Leu Ser Ser Met Leu Leu Glu Thr Thr Ser340 345 350Gly Ala Leu Ser Leu355(26)SEQ ID NO25的资料(i)序列特征(A)长度1110个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO25的序列描述A1(27)SEQ ID NO26的资料(i)序列特征(A)长度369个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO26的序列描述Met Ala Ser Ser Thr Thr Arg Gly Pro Arg Val Ser Asp Leu Phe Ser1 5 10 15Gly Leu Pro Pro Ala Val Thr Thr Pro Ala Asn Gln Ser Ala Glu Ala20 25 30Ser Ala Gly Asn Gly Ser Val Ala Gly Ala Asp Ala Pro Ala Val Thr35 40 45Pro Phe Gln Ser Leu Gln Leu Val His Gln Leu Lys Gly Leu Ile Val50 55 60Leu Leu Tyr Ser Val Val Val Val Val Gly Leu Val Gly Asn Cys Leu65 70 75 80Leu Val Leu Val Ile Ala Arg Val Pro Arg Leu His Asn Val Thr Asn85 90 95Phe Leu Ile Gly Asn Leu Ala Leu Ser Asp Val Leu Met Cys Thr Ala100 105 110Cys Val Pro Leu Thr Leu Ala Tyr Ala Phe Glu Pro Arg Gly Trp Val115 120 125Phe Gly Gly Gly Leu Cys His Leu Val Phe Phe Leu Gln Pro Val Thr130 135 140Val Tyr Val Ser Val Phe Thr Leu Thr Thr Ile Ala Val Asp Arg Tyr145 150 155 160Val Val Leu Val His Pro Leu Arg Arg Ala Ser Arg Cys Ala Ser Ala165 170 175Tyr Ala Val Leu Ala Ile Trp Ala Leu Ser Ala Val Leu Ala Leu Pro180 185 190Pro Ala Val His Thr Tyr His Val Glu Leu Lys Pro His Asp Val Arg195 200 205Leu Cys Glu Glu Phe Trp Gly Ser Gln Glu Arg Gln Arg Gln Leu Tyr210 215 220Ala Trp Gly Leu Leu Leu Val Thr Tyr Leu Leu Pro Leu Leu Val Ile225 230 235 240Leu Leu Ser Tyr Val Arg Val Ser Val Lys Leu Arg Asn Arg Val Val245 250 255Pro Gly Cys Val Thr Gln Ser Gln Ala Asp Trp Asp Arg Ala Arg Arg260 265 270Arg Arg Thr Phe Cys Leu Leu Val Val Val Val Val Val Phe Ala Val275 280 285Cys Trp Leu Pro Leu His Val Phe Asn Leu Leu Arg Asp Leu Asp Pro290 295 300His Ala Ile Asp Pro Tyr Ala Phe Gly Leu Val Gln Leu Leu Cys His305 310 315 320Trp Leu Ala Met Ser Ser Ala Cys Tyr Asn Pro Phe lle Tyr Ala Trp325 330 335Leu His Asp Ser Phe Arg Glu Glu Leu Arg Lys Leu Leu Val Ala Trp340 345 350Pro Arg Lys Ile Ala Pro His Gly Gln Asn Met Thr Val Ser Val Val355 360 365Ile(28)SEQ ID NO27的资料(i)序列特征(A)长度1083个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO27的序列描述ATGGACCCAG AAGAAACTTC AGTTTATTTG GATTATTACT ATGCTACGAG CCCAAACTCT 60GACATCAGGG AGACCCACTC CCATGTTCCT TACACCTCTG TCTTCCTTCC AGTCTTTTAC 120ACAGCTGTGT TCCTGACTGG AGTGCTGGGG AACCTTGTTC TCATGGGAGC GTTGCATTTC 180AAACCCGGCA GCCGAAGACT GATCGACATC TTTATCATCA ATCTGGCTGC CTCTGACTTC 240ATTTTTCTTG TCACATTGCC TCTCTGGGTG GATAAAGAAG CATCTCTAGG ACTGTGGAGG 300ACGGGCTCCT TCCTGTGCAA AGGGAGCTCC TACATGATCT CCGTCAATAT GCACTGCAGT 360GTCCTCCTGC TCACTTGCAT GAGTGTTGAC CGCTACCTGG CCATTGTGTG GCCAGTCGTA 420TCCAGGAAAT TCAGAAGGAC AGACTGTGCA TATGTAGTCT GTGCCAGCAT CTGGTTTATC 480TCCTGCCTGC TGGGGTTGCC TACTCTTCTG TCCAGGGAGC TCACGCTGAT TGATGATAAG 540CCATACTGTG CAGAGAAAAA GGCAACTCCA ATTAAACTCA TATGGTCCCT GGTGGCCTTA 600ATTTTCACCT TTTTTGTCCC TTTGTTGAGC ATTGTGACCT GCTACTGTTG CATTGCAAGG 660AAGCTGTGTG CCCATTACCA GCAATCAGGA AAGCACAACA AAAAGCTGAA GAAATCTATA 720AAGATCATCT TTATTGTCGT GGCAGCCTTT CTTGTCTCCT GGCTGCCCTT CAATACTTTC 780AAGTTCCTGG CCATTGTCTC TGGGTTGCGG CAAGAACACT ATTTACCCTC AGCTATTCTT 840CAGCTTGGTA TGGAGGTGAG TGGACCCTTG GCATTTGCCA ACAGCTGTGT CAACCCTTTC 900ATTTACTATA TCTTCGACAG CTACATCCGC CGGGCCATTG TCCACTGCTT GTGCCCTTGC 960CTGAAAAACT ATGACTTTGG GAGTAGCACT GAGACATCAG ATAGTCACCT CACTAAGGCT 1020CTCTCCACCT TCATTCATGC AGAAGATTTT GCCAGGAGGA GGAAGAGGTC TGTGTCACTC 1080TAA 1083(29)SEQ ID NO28的资料(i)序列特征(A)长度360个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO28的序列描述Met Asp Pro Glu Glu Thr Ser Val Tyr Leu Asp Tyr Tyr Tyr Ala Thr1 5 10 15Ser Pro Asn Ser Asp Ile Arg Glu Thr His Ser His Val Pro Tyr Thr20 25 30Ser Val Phe Leu Pro Val Phe Tyr Thr Ala Val Phe Leu Thr Gly Val35 40 45Leu Gly Asn Leu Val Leu Met Gly Ala Leu His Phe Lys Pro Gly Ser50 55 60Arg Arg Leu Ile Asp Ile Phe Ile Ile Asn Leu Ala Ala Ser Asp Phe65 70 75 80Ile Phe Leu Val Thr Leu Pro Leu Trp Val Asp Lys Glu Ala Ser Leu85 90 95Gly Leu Trp Arg Thr Gly Ser Phe Leu Cys Lys Gly Ser Ser Tyr Met100 105 110Ile Ser Val Asn Met His Cys Ser Val Leu Leu Leu Thr Cys Met Ser115 120 125Val Asp Arg Tyr Leu Ala Ile Val Trp Pro Val Val Ser Arg Lys Phe130 135 140Arg Arg Thr Asp Cys Ala Tyr Val Val Cys Ala Ser Ile Trp Phe Ile145 150 155 160Ser Cys Leu Leu Gly Leu Pro Thr Leu Leu Ser Arg Glu Leu Thr Leu165 170 175Ile Asp Asp Lys Pro Tyr Cys Ala Glu Lys Lys Ala Thr Pro Ile Lys180 185 190Leu Ile Trp Ser Leu Val Ala Leu Ile Phe Thr Phe Phe Val Pro Leu195 200 205Leu Ser Ile Val Thr Cys Tyr Cys Cys Ile Ala Arg Lys Leu Cys Ala210 215 220His Tyr Gln Gln Ser Gly Lys His Asn Lys Lys Leu Lys Lys Ser Ile225 230 235 240Lys Ile Ile Phe Ile Val Val Ala Ala Phe Leu Val Ser Trp Leu Pro245 250 255Phe Asn Thr Phe Lys Phe Leu Ala Ile Val Ser Gly Leu Arg Gln Glu260 265 270His Tyr Leu Pro Ser Ala Ile Leu Gln Leu Gly Met Glu Val Ser Gly275 280 285Pro Leu Ala Phe Ala Asn Ser Cys Val Asn Pro Phe Ile Tyr Tyr Ile290 295 300Phe Asp Ser Tyr Ile Arg Arg Ala Ile Val His Cys Leu Cys Pro Cys305 310 315 320Leu Lys Asn Tyr Asp Phe Gly Ser Ser Thr Glu Thr Ser Asp Ser His325 330 335Leu Thr Lys Ala Leu Ser Thr Phe Ile His Ala Glu Asp Phe Ala Arg340 345 350Arg Arg Lys Arg Ser Val Ser Leu355 360(30)SEQ ID NO29的资料(i)序列特征(A)长度31个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO29的序列描述CTAGAATTCT GACTCCAGCC AAAGCATGAA T 3l(31)SEQ ID NO30的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO30的序列描述GCTGGATCCT AAACAGTCTG CGCTCGGCCT 30(32)SEQ ID NO31的资料(i)序列特征(A)长度1020个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO31的序列描述ATGAATGGCC TTGAAGTGGC TCCCCCAGGT CTGATCACCA ACTTCTCCCT GGCCACGGCA 60GAGCAATGTG GCCAGGAGAC GCCACTGGAG AACATGCTGT TCGCCTCCTT CTACCTTCTG 120GATTTTATCC TGGCTTTAGT TGGCAATACC CTGGCTCTGT GGCTTTTCAT CCGAGACCAC 180AAGTCCGGGA CCCCGGCCAA CGTGTTCCTG ATGCATCTGG CCGTGGCCGA CTTGTCGTGC 240GTGCTGGTCC TGCCCACCCG CCTGGTCTAC CACTTCTCTG GGAACCACTG GCCATTTGGG 300GAAATCGCAT GCCGTCTCAC CGGCTTCCTC TTCTACCTCA ACATGTACGC CAGCATCTAC 360TTCCTCACCT GCATCAGCGC CGACCGTTTC CTGGCCATTG TGCACCCGGT CAAGTCCCTC 420AAGCTCCGCA GGCCCCTCTA CGCACACCTG GCCTGTGCCT TCCTGTGGGT GGTGGTGGCT 480GTGGCCATGG CCCCGCTGCT GGTGAGCCCA CAGACCGTGC AGACCAACCA CACGGTGGTC 540TGCCTGCAGC TGTACCGGGA GAAGGCCTCC CACCATGCCC TGGTGTCCCT GGCAGTGGCC 600TTCACCTTCC CGTTCATCAC CACGGTCACC TGCTACCTGC TGATCATCCG CAGCCTGCGG 660CAGGGCCTGC GTGTGGAGAA GCGCCTCAAG ACCAAGGCAG TGCGCATGAT CGCCATAGTG 720CTGGCCATCT TCCTGGTCTG CTTCGTGCCC TACCACGTCA ACCGCTCCGT CTACGTGCTG 780CACTACCGCA GCCATGGGGC CTCCTGCGCC ACCCAGCGCA TCCTGGCCCT GGCAAACCGC 840ATCACCTCCT GCCTCACCAG CCTCAACGGG GCACTCGACC CCATCATGTA TTTCTTCGTG 900GCTGAGAAGT TCCGCCACGC CCTGTGCAAC TTGCTCTGTG GCAAAAGGCT CAAGGGCCCG 960CCCCCCAGCT TCGAAGGGAA AACCAACGAG AGCTCGCTGA GTGCCAAGTC AGAGCTGTGA 1020(33)SEQ ID NO32的资料(i)序列特征(A)长度339个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO32的序列描述Met Asn Gly Leu Glu Val Ala Pro Pro Gly Leu Ile Thr Asn Phe Ser1 5 10 15Leu Ala Thr Ala Glu Gln Cys Gly Gln Glu Thr Pro Leu Glu Asn Met20 25 30Leu Phe Ala Ser Phe Tyr Leu Leu Asp Phe Ile Leu Ala Leu Val Gly35 40 45Asn Thr Leu Ala Leu Trp Leu Phe Ile Arg Asp His Lys Ser Gly Thr50 55 60Pro Ala Asn Val Phe Leu Met His Leu Ala Val Ala Asp Leu Ser Cys65 70 75 80Val Leu Val Leu Pro Thr Arg Leu Val Tyr His Phe Ser Gly Asn His85 90 95Trp Pro Phe Gly Glu Ile Ala Cys Arg Leu Thr Gly Phe Leu Phe Tyr100 105 110Leu Asn Met Tyr Ala Ser Ile Tyr Phe Leu Thr Cys Ile Ser Ala Asp115 120 125Arg Phe Leu Ala Ile Val His Pro Val Lys Ser Leu Lys Leu Arg Arg130 135 140Pro Leu Tyr Ala His Leu Ala Cys Ala Phe Leu Trp Val Val Val Ala145 150 155 160Val Ala Met Ala Pro Leu Leu Val Ser Pro Gln Thr Val Gln Thr Asn165 170 175His Thr Val Val Cys Leu Gln Leu Tyr Arg Glu Lys Ala Ser His His180 185 190Ala Leu Val Ser Leu Ala Val Ala Phe Thr Phe Pro Phe Ile Thr Thr195 200 205Val Thr Cys Tyr Leu Leu Ile Ile Arg Ser Leu Arg Gln Gly Leu Arg210 215 220Val Glu Lys Arg Leu Lys Thr Lys Ala Val Arg Met Ile Ala Ile Val225 230 235 240Leu Ala Ile Phe Leu Val Cys Phe Val Pro Tyr His Val Asn Arg Ser245 250 255Val Tyr Val Leu His Tyr Arg Ser His Gly Ala Ser Cys Ala Thr Gln260 265 270Arg Ile Leu Ala Leu Ala Asn Arg Ile Thr Ser Cys Leu Thr Ser Leu275 280 285Asn Gly Ala Leu Asp Pro Ile Met Tyr Phe Phe Val Ala Glu Lys Phe290 295 300Arg His Ala Leu Cys Asn Leu Leu Cys Gly Lys Arg Leu Lys Gly Pro305 310 315 320Pro Pro Ser Phe Glu Gly Lys Thr Asn Glu Ser Ser Leu Ser Ala Lys325 330 335Ser Glu Leu(34)SEQ ID NO33的资料(i)序列特征(A)长度29个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO33的序列描述ATAAGATGAT CACCCTGAAC AATCAAGAT 29(35)SEQ ID NO34的资料(i)序列特征(A)长度33个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO34的序列描述TCCGAATTCA TAACATTTCA CTGTTTATAT TGC 33(36)SEQ ID NO35的资料(i)序列特征(A)长度996个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO35的序列描述ATGATCACCC TGAACAATCA AGATCAACCT GTCACTTTTA ACAGCTCACA TCCAGATGAA 60TACAAAATTG CAGCCCTTGT CTTCTATAGC TGTATCTTCA TAATTGGATT ATTTGTTAAC 120ATCACTGCAT TATGGGTTTT CAGTTGTACC ACCAAGAAGA GAACCACGGT AACCATCTAT 180ATGATGAATG TGGCATTAGT GGACTTGATA TTTATAATGA CTTTACCCTT TCGAATGTTT 240TATTATGCAA AAGATGCATG GCCATTTGGA GAGTACTTCT GCCAGATTAT TGGAGCTCTC 300ACAGTGTTTT ACCCAAGCAT TGCTTTATGG CTTCTTGCCT TTATTAGTGC TGACAGATAC 360ATGGCCATTG TACAGCCGAA GTACGCCAAA GAACTTAAAA ACACGTGCAA AGCCGTGCTG 420GCGTGTGTGG GAGTCTGGAT AATGACCCTG ACCACGACCA CCCCTCTGCT ACTGCTCTAT 480AAAGACCCAG ATAAAGACTC CACTCCCGCC ACCTGCCTCA AGATTTCTGA CATCATCTAT 540CTAAAAGCTG TGAACGTGCT GAACCTCACT CGACTGACAT TTTTTTTCTT GATTCCTTTG 600TTCATCATGA TTGGGTGCTA CTTGGTCATT ATTCATAATC TCCTTCACGG CAGGACGTCT 660AAGCTGAAAC CCAAAGTCAA GGAGAAGTCC ATAAGGATCA TCATCACGCT GCTGGTGCAG 720GTGCTCGTCT GCTTTATGCC CTTCCACATC TGTTTCGCTT TCCTGATGCT GGGAACGGGG 780GAGAACAGTT ACAATCCCTG GGGAGCCTTT ACCACCTTCC TCATGAACCT CAGCACGTGT 840CTGGATGTGA TTCTCTACTA CATCGTTTCA AAACAATTTC AGGCTCGAGT CATTAGTGTC 900ATGCTATACC GTAATTACCT TCGAAGCCTG CGCAGAAAAA GTTTCCGATC TGGTAGTCTA 960AGGTCACTAA GCAATATAAA CAGTGAAATG TTATGA 996(37)SEQ ID NO36的资料(i)序列特征(A)长度331个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO36的序列描述Met Ile Thr Leu Asn Asn Gln Asp Gln Pro Val Thr Phe Asn Ser Ser1 5 10 15His Pro Asp Glu Tyr Lys Ile Ala Ala Leu Val Phe Tyr Ser Cys Ile20 25 30Phe Ile Ile Gly Leu Phe Val Asn Ile Thr Ala Leu Trp Val Phe Ser35 40 45Cys Thr Thr Lys Lys Arg Thr Thr Val Thr Ile Tyr Met Met Asn Val50 55 60Ala Leu Val Asp Leu Ile Phe Ile Met Thr Leu Pro Phe Arg Met Phe65 70 75 80Tyr Tyr Ala Lys Asp Ala Trp Pro Phe Gly Glu Tyr Phe Cys Gln Ile85 90 95Ile Gly Ala Leu Thr Val Phe Tyr Pro Ser Ile Ala Leu Trp Leu Leu100 105 110Ala Phe Ile Ser Ala Asp Arg Tyr Met Ala Ile Val Gln Pro Lys Tyr115 120 125Ala Lys Glu Leu Lys Asn Thr Cys Lys Ala Val Leu Ala Cys Val Gly
130 135 140Val Trp Ile Met Thr Leu Thr Thr Thr Thr Pro Leu Leu Leu Leu Tyr145 150 155 160Lys Asp Pro Asp Lys Asp Ser Thr Pro Ala Thr Cys Leu Lys Ile Ser165 170 175Asp Ile Ile Tyr Leu Lys Ala Val Asn Val Leu Asn Leu Thr Arg Leu180 185 190Thr Phe Phe Phe Leu Ile Pro Leu Phe Ile Met Ile Gly Cys Tyr Leu195 200 205Val Ile Ile His Asn Leu Leu His Gly Arg Thr Ser Lys Leu Lys Pro210 215220Lys Val Lys Glu Lys Ser Ile Arg Ile Ile Ile Thr Leu Leu Val Gln225 230 235 240Val Leu Val Cys Phe Met Pro Phe His Ile Cys Phe Ala Phe Leu Met245 250 255Leu Gly Thr Gly Glu Asn Ser Tyr Asn Pro Trp Gly Ala Phe Thr Thr260 265 270Phe Leu Met Asn Leu Ser Thr Cys Leu Asp Val Ile Leu Tyr Tyr Ile275 280 285Val Ser Lys Gln Phe Gln Ala Arg Val Ile Ser Val Met Leu Tyr Arg290 295 300Asn Tyr Leu Arg Ser Leu Arg Arg Lys Ser Phe Arg Ser Gly Ser Leu305 310 315 320Arg Ser Leu Ser Asn Ile Asn Ser Glu Met Leu330(38)SEQ ID NO37的资料(i)序列特征(A)长度28个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO37的序列描述CCAAGCTTCC AGGCCTGGGG TGTGCTGG28(39)SEQ ID NO38的资料(i)序列特征(A)长度29个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO38的序列描述ATGGATCCTG ACCTTCGGCC CCTGGCAGA 29(40)SEQ ID NO39的资料(i)序列特征(A)长度1077个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO39的序列描述ATGCCCTCTG TGTCTCCAGC GGGGCCCTCG GCCGGGGCAG TCCCCAATGC CACCGCAGTG 60ACAACAGTGC GGACCAATGC CAGCGGGCTG GAGGTGCCCC TGTTCCACCT GTTTGCCCGG 120CTGGACGAGG AGCTGCATGG CACCTTCCCA GGCCTGTGCG TGGCGCTGAT GGCGGTGCAC 180GGAGCCATCT TCCTGGCAGG GCTGGTGCTC AACGGGCTGG CGCTGTACGT CTTCTGCTGC 240CGCACCCGGG CCAAGACACC CTCAGTCATC TACACCATCA ACCTGGTGGT GACCGATCTA 300CTGGTAGGGC TGTCCCTGCC CACGCGCTTC GCTGTGTACT ACGGCGCCAG GGGCTGCCTG 360CGCTGTGCCT TCCCGCACGT CCTCGGTTAC TTCCTCAACA TGCACTGCTC CATCCTCTTC 420CTCACCTGCA TCTGCGTGGA CCGCTACCTG GCCATCGTGC GGCCCGAAGG CTCCCGCCGC 480TGCCGCCAGC CTGCCTGTGC CAGGGCCGTG TGCGCCTTCG TGTGGCTGGC CGCCGGTGCC 540GTCACCCTGT CGGTGCTGGG CGTGACAGGC AGCCGGCCCT GCTGCCGTGT CTTTGCGCTG 600ACTGTCCTGG AGTTCCTGCT GCCCCTGCTG GTCATCAGCG TGTTTACCGG CCGCATCATG 660TGTGCACTGT CGCGGCCGGG TCTGCTCCAC CAGGGTCGCC AGCGCCGCGT GCGGGCCATG 720CAGCTCCTGC TCACGGTGCT CATCATCTTT CTCGTCTGCT TCACGCCCTT CCACGCCCGC 780CAAGTGGCCG TGGCGCTGTG GCCCGACATG CCACACCACA CGAGCCTCGT GGTCTACCAC 840GTGGCCGTGA CCCTCAGCAG CCTCAACAGC TGCATGGACC CCATCGTCTA CTGCTTCGTC 900ACCAGTGGCT TCCAGGCCAC CGTCCGAGGC CTCTTCGGCC AGCACGGAGA GCGTGAGCCC 960AGCAGCGGTG ACGTGGTCAG CATGCACAGG AGCTCCAAGG GCTCAGGCCG TCATCACATC 1020CTCAGTGCCG GCCCTCACGC CCTCACCCAG GCCCTGGCTA ATGGGCCCGA GGCTTAG1077(41)SEQ ID NO40的资料(i)序列特征(A)长度358个氨基酸(B)类型氨基酸(C)链型
(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO40的序列描述Met Pro Ser Val Ser Pro Ala Gly Pro Ser Ala Gly Ala Val Pro Asn1 5 10 15Ala Thr Ala Val Thr Thr Val Arg Thr Asn Ala Ser Gly Leu Glu Val20 25 30Pro Leu Phe His Leu Phe Ala Arg Leu Asp Glu Glu Leu His Gly Thr35 40 45Phe Pro Gly Leu Cys Val Ala Leu Met Ala Val His Gly Ala Ile Phe50 55 60Leu Ala Gly Leu Val Leu Asn Gly Leu Ala Leu Tyr Val Phe Cys Cys65 70 75 80Arg Thr Arg Ala Lys Thr Pro Ser Val Ile Tyr Thr Ile Asn Leu Val85 90 95Val Thr Asp Leu Leu Val Gly Leu Ser Leu Pro Thr Arg Phe Ala Val100 105 110Tyr Tyr Gly Ala Arg Gly Cys Leu Arg Cys Ala Phe Pro His Val Leu115 120 125Gly Tyr Phe Leu Asn Met His Cys Ser Ile Leu Phe Leu Thr Cys Ile130 135 140Cys Val Asp Arg Tyr Leu Ala Ile Val Arg Pro Glu Ala Pro Ala Ala145 150 155 160Cys Arg Gln Pro Ala Cys Ala Arg Ala Val Cys Ala Phe Val Trp Leu165 170 175Ala Ala Gly Ala Val Thr Leu Ser Val Leu Gly Val Thr Gly Ser Arg180 185 190Pro Cys Cys Arg Val Phe Ala Leu Thr Val Leu Glu Phe Leu Leu Pro195 200 205Leu Leu Val Ile Ser Val Phe Thr Gly Arg Ile Met Cys Ala Leu Ser210 215 220Arg Pro Gly Leu Leu His Gln Gly Arg Gln Arg Arg Val Arg Ala Met225 230 235 240Gln Leu Leu Leu Thr Val Leu Ile Ile Phe Leu Val Cys Phe Thr Pro245 250 255Phe His Ala Arg Gln Val Ala Val Ala Leu Trp Pro Asp Met Pro His260 265 270His Thr Ser Leu Val Val Tyr His Val Ala Val Thr Leu Ser Ser Leu275 280 285Asn Ser Cys Met Asp Pro Ile Val Tyr Cys Phe Val Thr Ser Gly Phe290 295 300Gln Ala Thr Val Arg Gly Leu Phe Gly Gln His Gly Glu Arg Glu Pro305 310 315 320Ser Ser Gly Asp Val Val Ser Met His Arg Ser Ser Lys Gly Ser Gly325 330 335Arg His His Ile Leu Ser Ala Gly Pro His Ala Leu Thr Gln Ala Leu340 345 350Ala Asn Gly Pro Glu Ala355(42)SEQ ID NO41的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO41的序列描述GAGAATTCAC TCCTGAGCTC AAGATGAACT 30(43)SEQ ID NO42的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO42的序列描述CGGGATCCCC GTAACTGAGC CACTTCAGAT 30(44)SEQ ID NO43的资料(i)序列特征(A)长度1050个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO43的序列描述ATGAACTCCA CCTTGGATGG TAATCAGAGC AGCCACCCTT TTTGCCTCTT GGCATTTGGC 60TATTTGGAAA CTGTCAATTT TTGCCTTTTG GAAGTATTGA TTATTGTCTT TCTAACTGTA 120TTGATTATTT CTGGCAACAT CATTGTGATT TTTGTATTTC ACTGTGCACC TTTGTTGAAC 180CATCACACTA CAAGTTATTT TATCCAGACT ATGGCATATG CTGACCTTTT TGTTGGGGTG 240AGCTGCGTGG TCCCTTCTTT ATCACTCCTC CATCACCCCC TTCCAGTAGA GGAGTCCTTG 300ACTTGCCAGA TATTTGGTTT TGTAGTATCA GTTCTGAAGA GCGTCTCCAT GGCTTCTCTG 360GCCTGTATCA GCATTGATAG ATACATTGCC ATTACTAAAC CTTTAACCTA TAATACTCTG 420GTTACACCCT GGAGACTACG CCTGTGTATT TTCCTGATTT GGCTATACTC GACCCTGGTC 480TTCCTGCCTT CCTTTTTCCA CTGGGGCAAA CCTGGATATC ATGGAGATGT GTTTCAGTGG 540TGTGCGGAGT CCTGGCACAC CGACTCCTAC TTCACCCTGT TCATCGTGAT GATGTTATAT 600GCCCCAGCAG CCCTTATTGT CTGCTTCACC TATTTCAACA TCTTCCGCAT CTGCCAACAG 660CACACAAAGG ATATCAGCGA AAGGCAAGCC CGCTTCAGCA GCCAGAGTGG GGAGACTGGG 720GAAGTGCAGG CCTGTCCTGA TAAGCGCTAT GCCATGGTCC TGTTTCGAAT CACTAGTGTA 780TTTTACATCC TCTGGTTGCC ATATATCATC TACTTCTTGT TGGAAAGCTC CACTGGCCAC 840AGCAACCGCT TCGCATCCTT CTTGACCACC TGGCTTGCTA TTAGTAACAG TTTCTGCAAC 900TGTGTAATTT ATAGTCTCTC CAACAGTGTA TTCCAAAGAG GACTAAAGCG CCTCTCAGGG 960GCTATGTGTA CTTCTTGTGC AAGTCAGACT ACAGCCAACG ACCCTTACAC AGTTAGAAGC 1020AAAGGCCCTC TTAATGGATG TCATATCTGA 1050(45)SEQ ID NO44的资料(i)序列特征(A)长度349个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO44的序列描述Met Asn Ser Thr Leu Asp Gly Asn Gln Ser Ser His Pro Phe Cys Leu1 5 10 15Leu Ala Phe Gly Tyr Leu Glu Thr Val Asn Phe Cys Leu Leu Glu Val20 25 30Leu Ile Ile Val Phe Leu Thr Val Leu Ile Ile Ser Gly Asn Ile Ile35 40 45Val Ile Phe Val Phe His Cys Ala Pro Leu Leu Asn His His Thr Thr
50 55 60Ser Tyr Phe Ile Gln Thr Met Ala Tyr Ala Asp Leu Phe Val Gly Val65 70 75 80Ser Cys Val Val Pro Ser Leu Ser Leu Leu His His Pro Leu Pro Val85 90 95Glu G1u Ser Leu Thr Cys Gln Ile Phe Gly Phe Val Val Ser Val Leu100 105 110Lys Ser Val Ser Met Ala Ser Leu Ala Cys Ile Ser Ile Asp Arg Tyr115 120 125Ile Ala Ile Thr Lys Pro Leu Thr Tyr Asn Thr Leu Val Thr Pro Trp130 135 140Arg Leu Arg Leu Cys Ile Phe Leu Ile Trp Leu Tyr Ser Thr Leu Val145 150 155 160Phe Leu Pro Ser Phe Phe His Trp Gly Lys Pro Gly Tyr His Gly Asp165 170 175Val Phe Gln Trp Cys Ala Glu Ser Trp His Thr Asp Ser Tyr Phe Thr180 185 190Leu Phe Ile Val Met Met Leu Tyr Ala Pro Ala Ala Leu Ile Val Cys195 200 205Phe Thr Tyr Phe Asn Ile Phe Arg Ile Cys Gln Gln His Thr Lys Asp210 215 220Ile Ser Glu Arg Gln Ala Arg Phe Ser Ser Gln Ser Gly Glu Thr Gly225 230 235 240Glu Val Gln Ala Cys Pro Asp Lys Arg Tyr Ala Met Val Leu Phe Arg245 250 255Ile Thr Ser Val Phe Tyr Ile Leu Trp Leu Pro Tyr Ile Ile Tyr Phe260 265 270Leu Leu Glu Ser Ser Thr Gly His Ser Asn Arg Phe Ala Ser Phe Leu275 280 285Thr Thr Trp Leu Ala Ile Ser Asn Ser Phe Cys Asn Cys Val Ile Tyr290 295 300Ser Leu Ser Asn Ser Val Phe Gln Arg Gly Leu Lys Arg Leu Ser Gly305 310 315 320Ala Met Cys Thr Ser Cys Ala Ser Gln Thr Thr Ala Asn Asp Pro Tyr
325 330 335Thr Val Arg Ser Lys Gly Pro Leu Asn Gly Cys His Ile345(46)SEQ ID NO45的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO45的序列描述TCCCCCGGGA AAAAAACCAA CTGCTCCAAA 30(47)SEQ ID NO46的资料(i)序列特征(A)长度31个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO46的序列描述TAGGATCCAT TTGAATGTGG ATTTGGTGAA A 31(48)SEQ ID NO47的资料(i)序列特征(A)长度1302个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO47的序列描述WCHTRVHGNR AWCANMWRWM MWDNMAHMGS BDRVWNNYMK DDWBHHRTAN GKGBBKDWMH 60ATMWNWWNNN WMHASRTHHT MSNWRMANRG ARTMSNWRMA NRGARABACK SATGTGTTTT 120TCTCCCATTC TGGAAATCAA CATGCAGTCT GAATCTAACA TTACAGTGCG AGATGACATT 180GATGACATCA ACACCAATAT GTACCAACCA CTATCATATC CGTTAAGCTT TCAAGTGTCT 240CTCACCGGAT TTCTTATGTT AGAAATTGTG TTGGGACTTG GCAGCAACCT CACTGTATTG 300GTACTTTACT GCATGAAATC CAACTTAATC AACTCTGTCA GTAACATTAT TACAATGAAT 360CTTCATGTAC TTGATGTAAT AATTTGTGTG GGATGTATTC CTCTAACTAT AGTTATCCTT 420CTGCTTTCAC TGGAGAGTAA CACTGCTCTC ATTTGCTGTT TCCATGAGGC TTGTGTATCT 480TTTGCAAGTG TCTCAACAGC AATCAACGTT TTTGCTATCA CTTTGGACAG ATATGACATC 540TCTGTAAAAC CTGCAAACCG AATTCTGACA ATGGGCAGAG CTGTAATGTT AATGATATCC 600ATTTGGATTT TTTCTTTTTT CTCTTTCCTG ATTCCTTTTA TTGAGGTAAA TTTTTTCAGT 660CTTCAAAGTG GAAATACCTG GGAAAACAAG ACACTTTTAT GTGTCAGTAC AAATGAATAC 720TACACTGAAC TGGGAATGTA TTATCACCTG TTAGTACAGA TCCCAATATT CTTTTTCACT 780GTTGTAGTAA TGTTAATCAC ATACACCAAA ATACTTCAGG CTCTTAATAT TCGAATAGGC 840ACAAGATTTT CAACAGGGCA GAAGAAGAAA GCAAGAAAGA AAAAGACAAT TTCTCTAACC 900ACACAACATG AGGCTACAGA CATGTCACAA AGCAGTGGTG GGAGAAATGT AGTCTTTGGT 960GTAAGAACTT CAGTTTCTGT AATAATTGCC CTCCGGCGAG CTGTGAAACG ACACCGTGAA 1020CGACGAGAAA GACAAAAGAG AGTCTTCAGG ATGTCTTTAT TGATTATTTC TACATTTCTT 1080CTCTGCTGGA CACCAATTTC TGTTTTAAAT ACCACCATTT TATGTTTAGG CCCAAGTGAC 1140CTTTTAGTAA AATTAAGATT GTGTTTTTTA GTCATGGCTT ATGGAACAAC TATATTTCAC 1200CCTCTATTAT ATGCATTCAC TAGACAAAAA TTTCAAAAGG TCTTGAAAAG TAAAATGAAA 1260AAGCGAGTTG TTTCTATAGT AGAAGCTGAT CCCCTGCCTA ATAATGCTGT AATACACAAC 1320TCTTGGATAG ATCCCAAAAG AAACAAAAAA ATTACCTTTG AAGATAGTGA AATAAGAGAA 1380AAACGTTTAG TGCCTCAGGT TGTCACAGAC TAG 1413(49)SEQ ID NO48的资料(i)序列特征(A)长度433个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO48的序列描述Met Cys Phe Ser Pro Ile Leu Glu Ile Asn Met Gln Ser Glu Ser Asn1 5 10 15Ile Thr Val Arg Asp Asp Ile Asp Asp Ile Asn Thr Asn Met Tyr Gln20 25 30Pro Leu Ser Tyr Pro Leu Ser Phe Gln Val Ser Leu Thr Gly Phe Leu35 40 45Met Leu Glu Ile Val Leu Gly Leu Gly Ser Asn Leu Thr Val Leu Val50 55 60Leu Tyr Cys Met Lys Ser Asn Leu Ile Asn Ser Val Ser Asn Ile Ile65 70 75 80Thr Met Asn Leu His Val Leu Asp Val Ile Ile Cys Val Gly Cys Ile85 90 95Pro Leu Thr Ile Val Ile Leu Leu Leu Ser Leu Glu Ser Asn Thr Ala100 105 110Leu Ile Cys Cys Phe His Glu Ala Cys Val Ser Phe Ala Ser Val Ser115 120 125Thr Ala Ile Asn Val Phe Ala Ile Thr Leu Asp Arg Tyr Asp Ile Ser130 135 140Val Lys Pro Ala Asn Arg Ile Leu Thr Met Gly Arg Ala Val Met Leu145 150 155 160Met Ile Ser Ile Trp Ile Phe Ser Phe Phe Ser Phe Leu Ile Pro Phe165 170 175Ile Glu Val Asn Phe Phe Ser Leu Gln Ser Gly Asn Thr Trp Glu Asn180 185 190Lys Thr Leu Leu Cys Val Ser Thr Asn Glu Tyr Tyr Thr Glu Leu Gly195 200 205Met Tyr Tyr His Leu Leu Val Gln Ile Pro Ile Phe Phe Phe Thr Val210 215 220Val Val Met Leu Ile Thr Tyr Thr Lys Ile Leu Gln Ala Leu Asn Ile225 230 235 240Arg Ile Gly Thr Arg Phe Ser Thr Gly Gln Lys Lys Lys Ala Arg Lys245 250 255Lys Lys Thr Ile Ser Leu Thr Thr Gln His Glu Ala Thr Asp Met Ser260 265 270Gln Ser Ser Gly Gly Arg Asn Val Val Phe Gly Val Arg Thr Ser Val275 280 285Ser Val Ile Ile Ala Leu Arg Arg Ala Val Lys Arg His Arg Glu Arg290 295 300Arg Glu Arg Gln Lys Arg Val Phe Arg Met Ser Leu Leu Ile Ile Ser305 310 315 320Thr Phe Leu Leu Cys Trp Thr Pro Ile Ser Val Leu Asn Thr Thr Ile325 330 335Leu Cys Leu Gly Pro Ser Asp Leu Leu Val Lys Leu Arg Leu Cys Phe340 345 350Leu Val Met Ala Tyr Gly Thr Thr Ile Phe His Pro Leu Leu Tyr Ala355 360 365Phe Thr Arg Gln Lys Phe Gln Lys Val Leu Lys Ser Lys Met Lys Lys370 375 380Arg Val Val Ser Ile Val Glu Ala Asp Pro Leu Pro Asn Asn Ala Val385 390 395 400Ile His Asn Ser Trp Ile Asp Pro Lys Arg Asn Lys Lys Ile Thr Phe405 410 415Glu Asp Ser Glu Ile Arg Glu Lys Arg Leu Val Pro Gln Val Val Thr420 425 430Asp(50)SEQ ID NO49的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO49的序列描述GTGAAGCTTG CCTCTGGTGC CTGCAGGAGG 30(51)SEQ ID NO50的资料(i)序列特征(A)长度31个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO50的序列描述GCAGAATTCC CGGTGGCGTG TTGTGGTGCC C 31(52)SEQ ID NO51的资料(i)序列特征(A)长度1209个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO51的序列描述ATGTTGTGTC CTTCCAAGAC AGATGGCTCA GGGCACTCTG GTAGGATTCA CCAGGAAACT 60CATGGAGAAG GGAAAAGGGA CAAGATTAGC AACAGTGAAG GGAGGGAGAA TGGTGGGAGA 120GGATTCCAGA TGAACGGTGG GTCGCTGGAG GCTGAGCATG CCAGCAGGAT GTCAGTTCTC 180AGAGCAAAGC CCATGTCAAA CAGCCAACGC TTGCTCCTTC TGTCCCCAGG ATCACCTCCT 240CGCACGGGGA GCATCTCCTA CATCAACATC ATCATGCCTT CGGTGTTCGG CACCATCTGC 300CTCCTGGGCA TCATCGGGAA CTCCACGGTC ATCTTCGCGG TCGTGAAGAA GTCCAAGCTG 360CACTGGTGCA ACAACGTCCC CGACATCTTC ATCATCAACC TCTCGGTAGT AGATCTCCTC 420TTTCTCCTGG GCATGCCCTT CATGATCCAC CAGCTCATGG GCAATGGGGT GTGGCACTTT 480GGGGAGACCA TGTGCACCCT CATCACGGCC ATGGATGCCA ATAGTCAGTT CACCAGCACC 540TACATCCTGA CCGCCATGGC CATTGACCGC TACCTGGCCA CTGTCCACCC CATCTCTTCC 600ACGAAGTTCC GGAAGCCCTC TGTGGCCACC CTGGTGATCT GCCTCCTGTG GGCCCTCTCC 660TTCATCAGCA TCACCCCTGT GTGGCTGTAT GCCAGACTCA TCCCCTTCCC AGGAGGTGCA 720GTGGGCTGCG GCATACGCCT GCCCAACCCA GACACTGACC TCTACTGGTT CACCCTGTAC 780CAGTTTTTCC TGGCCTTTGC CCTGCCTTTT GTGGTCATCA CAGCCGCATA CGTGAGGATC 840CTGCAGCGCA TGACGTCCTC AGTGGCCCCC GCCTCCCAGC GCAGCATCCG GCTGCGGACA 900AAGAGGGTGA CCCGCACAGC CATCGCCATC TGTCTGGTCT TCTTTGTGTG CTGGGCACCC 960TACTATGTGC TACAGCTGAC CCAGTTGTCC ATCAGCCGCC CGACCCTCAC CTTTGTCTAC 1020TTATACAATG CGGCCATCAG CTTGGGCTAT GCCAACAGCT GCCTCAACCC CTTTGTGTAC 1080ATCGTGCTCT GTGAGACGTT CCGCAAACGC TTGGTCCTGT CGGTGAAGCC TGCAGCCCAG 1140GGGCAGCTTC GCGCTGTCAG CAACGCTCAG ACGGCTGACG AGGAGAGGAC AGAAAGCAAA 1200GGCACCTGA 1209(53)SEQ ID NO52的资料(i)序列特征(A)长度402个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO52的序列描述Met Leu Cys Pro Ser Lys Thr Asp Gly Ser Gly His Ser Gly Arg Ile1 5 10 15His Gln Glu Thr His Gly Glu Gly Lys Arg Asp Lys Ile Ser Asn Ser20 25 30Glu Gly Arg Glu Asn Gly Gly Arg Gly Phe Gln Met Asn Gly Gly Ser35 40 45Leu Glu Ala Glu His Ala Ser Arg Met Ser Val Leu Arg Ala Lys Pro50 55 60Met Ser Asn Ser Gln Arg Leu Leu Leu Leu Ser Pro Gly Ser Pro Pro65 70 75 80Arg Thr Gly Ser Ile Ser Tyr Ile Asn Ile Ile Met Pro Ser Val Phe85 90 95Gly Thr Ile Cys Leu Leu Gly Ile Ile Gly Asn Ser Thr Val Ile Phe100 105 110Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp115 120 125Ile Phe Ile Ile Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly130 135 140Met Pro Phe Met Ile His Gln Leu Met Gly Asn Gly Val Trp His Phe145 150 155 160Gly Glu Thr Met Cys Thr Leu Ile Thr Ala Met Asp Ala Asn Ser Gln165 170 175Phe Thr Ser Thr Tyr Ile Leu Thr Ala Met Ala Ile Asp Arg Tyr Leu180 185 190Ala Thr Val His Pro lle Ser Ser Thr Lys Phe Arg Lys Pro Ser Val195 200 205Ala Thr Leu Val Ile Cys Leu Leu Trp Ala Leu Ser Phe Ile Ser Ile210 215 220Thr Pro Val Trp Leu Tyr Ala Arg Leu Ile Pro Phe Pro Gly Gly Ala225 230 235 240Val Gly Cys Gly Ile Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp245 250 255Phe Thr Leu Tyr Gln Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val260 265 270Ile Thr Ala Ala Tyr Val Arg Ile Leu Gln Arg Met Thr Ser Ser Val275 280 285Ala Pro Ala Ser Gln Arg Ser Ile Arg Leu Arg Thr Lys Arg Val Thr290 295 300Arg Thr Ala Ile Ala Ile Cys Leu Val Phe Phe Val Cys Trp Ala Pro305 310 315 320Tyr Tyr Val Leu Gln Leu Thr Gln Leu Ser Ile Ser Arg Pro Thr Leu325 330 335Thr Phe Val Tyr Leu Tyr Asn Ala Ala Ile Ser Leu Gly Tyr Ala Asn340 345 350Ser Cys Leu Asn Pro Phe Val Tyr Ile Val Leu Cys Glu Thr Phe Arg355 360 365Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gln Gly Gln Leu Arg370 375 380Ala Val Ser Asn Ala Gln Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys385 390 395 400Gly Thr(54)SEQ ID NO53的资料(i)序列特征(A)长度27个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO53的序列描述GGCGGATCCA TGGATGTGAC TTCCCAA27(55)SEQ ID NO54的资料(i)序列特征(A)长度27个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO54的序列描述GGCGGATCCC TACACGGCAC TGCTGAA 27(56)SEQ ID NO55的资料(i)序列特征(A)长度1128个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO55的序列描述ATGGATGTGA CTTCCCAAGC CCGGGGCGTG GGCCTGGAGA TGTACCCAGG CACCGCGCAC 60GCTGCGGCCC CCAACACCAC CTCCCCCGAG CTCAACCTGT CCCACCCGCT CCTGGGCACC 120GCCCTGGCCA ATGGGACAGG TGAGCTCTCG GAGCACCAGC AGTACGTGAT CGGCCTGTTC 180CTCTCGTGCC TCTACACCAT CTTCCTCTTC CCCATCGGCT TTGTGGGCAA CATCCTGATC 240CTGGTGGTGA ACATCAGCTT CCGCGAGAAG ATGACCATCC CCGACCTGTA CTTCATCAAC 300CTGGCGGTGG CGGACCTCAT CCTGGTGGCC GACTCCCTCA TTGAGGTGTT CAACCTGCAC 360GAGCGGTACT ACGACATCGC CGTCCTGTGC ACCTTCATGT CGCTCTTCCT GCAGGTCAAC 420ATGTACAGCA GCGTCTTCTT CCTCACCTGG ATGAGCTTCG ACCGCTACAT CGCCCTGGCC 480AGGGCCATGC GCTGCAGCCT GTTCCGCACC AAGCACCACG CCCGGCTGAG CTGTGGCCTC 540ATCTGGATGG CATCCGTGTC AGCCACGCTG GTGCCCTTCA CCGCCGTGCA CCTGCAGCAC 600ACCGACGAGG CCTGCTTCTG TTTCGCGGAT GTCCGGGAGG TGCAGTGGCT CGAGGTCACG 660CTGGGCTTCA TCGTGCCCTT CGCCATCATC GGCCTGTGCT ACTCCCTCAT TGTCCGGGTG 720CTGGTCAGGG CGCACCGGCA CCGTGGGCTG CGGCCCCGGC GGCAGAAGGC GCTCCGCATG 780ATCCTCGCGG TGGTGCTGGT CTTCTTCGTC TGCTGGCTGC CGGAGAACGT CTTCATCAGC 840GTGCACCTCC TGCAGCGGAC GCAGCCTGGG GCCGCTCCCT GCAAGCAGTC TTTCCGCCAT 900GCCCACCCCC TCACGGGCCA CATTGTCAAC CTCGCCGCCT TCTCCAACAG CTGCCTAAAC 960CCCCTCATCT ACAGCTTTCT CGGGGAGACC TTCAGGGACA AGCTGAGGCT GTACATTGAG 1020CAGAAAACAA ATTTGCCGGC CCTGAACCGC TTCTGTCACG CTGCCCTGAA GGCCGTCATT 1080CCAGACAGCA CCGAGCAGTC GGATGTGAGG TTCAGCAGTG CCGTGTGA 1128(57)SEQ ID NO56的资料(i)序列特征(A)长度375个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO56的序列描述Met Asp Val Thr Ser Gln Ala Arg Gly Val Gly Leu Glu Met Tyr Pro1 5 10 15Gly Thr Ala His Ala Ala Ala Pro Asn Thr Thr Ser Pro Glu Leu Asn20 25 30Leu Ser His Pro Leu Leu Gly Thr Ala Leu Ala Asn Gly Thr Gly Glu35 40 45Leu Ser Glu His Gln Glr Tyr Val Ile Gly Leu Phe Leu Ser Cys Leu50 55 60Tyr Thr Ile Phe Leu Phe Pro Ile Gly Phe Val Gly Asn Ile Leu Ile65 70 75 80Leu Val Val Asn Ile Ser Phe Arg Glu Lys Met Thr Ile Pro Asp Leu85 90 95Tyr Phe Ile Asn Leu Ala Val Ala Asp Leu Ile Leu Val Ala Asp Ser100 105 110Leu Ile Glu Val Phe Asn Leu His Glu Arg Tyr Tyr Asp Ile Ala Val115 120 125Leu Cys Thr Phe Met Ser Leu Phe Leu Gln Val Asn Met Tyr Ser Ser130 135 140Val Phe Phe Leu Thr Trp Met Ser Phe Asp Arg Tyr Ile Ala Leu Ala145 150 155 160Arg Ala Met Arg Cys Ser Leu Phe Arg Thr Lys His His Ala Arg Leu165 170 175Ser Cys Gly Leu Ile Trp Met Ala Ser Val Ser Ala Thr Leu Val Pro180 185 190Phe Thr Ala Val His Leu Gln His Thr Asp Glu Ala Cys Phe Cys Phe195 200 205Ala Asp Val Arg Glu Val Gln Trp Leu Glu Val Thr Leu Gly Phe Ile210 215 220Val Pro Phe Ala Ile Ile Gly Leu Cys Tyr Ser Leu Ile Val Arg Val225 230 235 240Leu Val Arg Ala His Arg His Arg Gly Leu Arg Pro Arg Arg Gln Lys245 250 255Ala Leu Arg Met Ile Leu Ala Val Val Leu Val Phe Phe Val Cys Trp260 265 270Leu Pro Glu Asn Val Phe Ile Ser Val His Leu Leu Gln Arg Thr Gln275 280 285Pro Gly Ala Ala Pro Cys Lys Gln Ser Phe Arg His Ala His Pro Leu290 295 300Thr Gly His Ile Val Asn Leu Ala Ala Phe Ser Asn Ser Cys Leu Asn305 310 315 320Pro Leu Ile Tyr Ser Phe Leu Gly Glu Thr Phe Arg Asp Lys Leu Arg325 330 335Leu Tyr Ile Glu Gln Lys Thr Asn Leu Pro Ala Leu Asn Arg Phe Cys340 345 350His Ala Ala Leu Lys Ala Val Ile Pro Asp Ser Thr Glu Gln Ser Asp355 360 365Val Arg Phe Ser Ser Ala Val370 375(58)SEQ ID NO57的资料(i)序列特征(A)长度31个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO57的序列描述AAGGAATTCA CGGCCGGGTG ATGCCATTCC C31(59)SEQ ID NO58的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO58的序列描述GGTGGATCCA TAAACACGGG CGTTGAGGAC 30(60)SEQ ID NO59的资料(i)序列特征(A)长度960个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO59的序列描述atgccattcc caaactgctc agcccccagc actgtggtgg ccacagctgt gggtgtcttg 60ctggggctgg agtgtgggct gggtctgctg ggcaacgcgg tggcgctgtg gaccttcctg 120ttccgggtca gggtgtggaa gccgtacgct gtctacctgc tcaacctggc cctggctgac 180ctgctgttgg ctgcgtgcct gcctttcctg gccgccttct acctgagcct ccaggcttgg 240catctgggcc gtgtgggctg ctgggccctg cgcttcctgc tggacctcag ccgcagcgtg 300gggatggcct tcctggccgc cgtggctttg gaccggtacc tccgtgtggt ccaccctcgg 360cttaaggtca acctgctgtc tcctcaggcg gccctggggg tctcgggcct cgtctggctc 420ctgatggtcg ccctcacctg cccgggcttg ctcatctctg aggccgccca gaactccacc 480aggtgccaca gtttctactc cagggcagac ggctccttca gcatcatctg gcaggaagca 540ctctcctgcc ttcagtttgt cctccccttt ggcctcatcg tgttctgcaa tgcaggcatc 600atcagggctc tccagaaaag actccgggag cctgagaaac agcccaagct tcagcgggcc 660caggcactgg tcaccttggt ggtggtgctg tttgctctgt gctttctgcc ctgcttcctg 720gccagagtcc tgatgcacat cttccagaat ctggggagct gcagggccct ttgtgcagtg 780gctcatacct cggatgtcac gggcagcctc acctacctgc acagtgtcgt caaccccgtg 840gtatactgct tctccagccc caccttcagg agctcctatc ggagggtctt ccacaccctc 900cgaggcaaag ggcaggcagc agagccccca gatttcaacc ccagagactc ctattcctga 960(61)SEQ ID NO60的资料(i)序列特征(A)长度319个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO60的序列描述Met Pro Phe Pro Asn Cys Ser Ala Pro Ser Thr Val Val Ala Thr Ala1 5 10 15Val Gly Val Leu Leu Gly Leu Glu Cys Gly Leu Gly Leu Leu Gly Asn20 25 30Ala Val Ala Leu Trp Thr Phe Leu Phe Arg Val Arg Val Trp Lys Pro35 40 45Tyr Ala Val Tyr Leu Leu Asn Leu Ala Leu Ala Asp Leu Leu Leu Ala50 55 60Ala Cys Leu Pro Phe Leu Ala Ala Phe Tyr Leu Ser Leu Gln Ala Trp65 70 75 80HIs Leu Gly Arg Val Gly Cys Trp Ala Leu Arg Phe Leu Leu Asp Leu85 90 95Ser Arg Ser Val Gly Met Ala Phe Leu Ala Ala Val Ala Leu Asp Arg100 105 110Tyr Leu Arg Val Val His Pro Arg Leu Lys Val Asn Leu Leu Ser Pro115 120 125Gln Ala Ala Leu Gly Val Ser Gly Leu Val Trp Leu Leu Met Val Ala130 135 140Leu Thr Cys Pro Gly Leu Leu Ile Ser Glu Ala Ala Gln Asn Ser Thr145 150 155 160Arg Cys His Ser Phe Tyr Ser Arg Ala Asp Gly Ser Phe Ser Ile Ile165 170 175Trp Gln Glu Ala Leu Ser Cys Leu Gln Phe Val Leu Pro Phe Gly Leu180 185 190Ile Val Phe Cys Asn Ala Gly Ile Ile Arg Ala Leu Gln Lys Arg Leu195 200 205Arg Glu Pro Glu Lys Gln Pro Lys Leu Gln Arg Ala Gln Ala Leu Val210 215 220Thr Leu Val Val Val Leu Phe Ala Leu Cys Phe Leu Pro Cys Phe Leu225 230 235 240Ala Arg Val Leu Met His Ile Phe Gln Asn Leu Gly Ser Cys Arg Ala245 250 255Leu Cys Ala Val Ala His Thr Ser Asp Val Thr Gly Ser Leu Thr Tyr260 265 270Leu His Ser Val Val Asn Pro Val Val Tyr Cys Phe Ser Ser Pro Thr275 280 285Phe Arg Ser Ser Tyr Arg Arg Val Phe His Thr Leu Arg Gly Lys Gly290 295 300Gln Ala Ala Glu Pro Pro Asp Phe Asn Pro Arg Asp Ser Tyr Ser305 310 315(62)SEQ ID NO61的资料(i)序列特征(A)长度1143个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO61的序列描述ATGGAGGAAG GTGGTGATTT TGACAACTAC TATGGGGCAG ACAACCAGTC TGAGTGTGAG 60TACACAGACT GGAAATCCTC GGGGGCCCTC ATCCCTGCCA TCTACATGTT GGTCTTCCTC 120CTGGGCACCA CGGGAAACGG TCTGGTGCTC TGGACCGTGT TTCGGAGCAG CCGGGAGAAG 180AGGCGCTCAG CTGATATCTT CATTGCTAGC CTGGCGGTGG CTGACCTGAC CTTCGTGGTG 240ACGCTGCCCC TGTGGGCTAC CTACACGTAC CGGGACTATG ACTGGCCCTT TGGGACCTTC 300TTCTGCAAGC TCAGCAGCTA CCTCATCTTC GTCAACATGT ACGCCAGCGT CTTCTGCCTC 360ACCGGCCTCA GCTTCGACCG CTACCTGGCC ATCGTGAGGC CAGTGGCCAA TGCTCGGCTG 420AGGCTGCGGG TCAGCGGGGC CGTGGCCACG GCAGTTCTTT GGGTGCTGGC CGCCCTCCTG 480GCCATGCCTG TCATGGTGTT ACGCACCACC GGGGACTTGG AGAACACCAC TAAGGTGCAG 540TGCTACATGG ACTACTCCAT GGTGGCCACT GTGAGCTCAG AGTGGGCCTG GGAGGTGGGC 600CTTGGGGTCT CGTCCACCAC CGTGGGCTTT GTGGTGCCCT TCACCATCAT GCTGACCTGT 660TACTTCTTCA TCGCCCAAAC CATCGCTGGC CACTTCCGCA AGGAACGCAT CGAGGGCCTG 720CGGAAGCGGC GCCGGCTGCT CAGCATCATC GTGGTGCTGG TGGTGACCTT TGCCCTGTGC 780TGGATGCCCT ACCACCTGGT GAAGACGCTG TACATGCTGG GCAGCCTGCT GCACTGGCCC 840TGTGACTTTG ACCTCTTCCT CATGAACATC TTCCCCTACT GCACCTGCAT CAGCTACGTC 900AACAGCTGCC TCAACCCCTT CCTCTATGCC TTTTTCGACC CCCGCTTCCG CCAGGCCTGC 960ACCTCCATGC TCTGCTGTGG CCAGAGCAGG TGCGCAGGCA CCTCCCACAG CAGCAGTGGG 1020GAGAAGTCAG CCAGCTACTC TTCGGGGCAC AGCCAGGGGC CCGGCCCCAA CATGGGCAAG 1080GGTGGAGAAC AGATGCACGA GAAATCCATC CCCTACAGCC AGGAGACCCT TGTGGTTGAC 1140TAG 1143(63)SEQ ID NO62的资料(i)序列特征(A)长度380个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO62的序列描述Met Glu Glu Gly Gly Asp Phe Asp Asn Tyr Tyr Gly Ala Asp Asn Gln1 5 10 15Ser Glu Cys Glu Tyr Thr Asp Trp Lys Ser Ser Gly Ala Leu Ile Pro20 25 30Ala Ile Tyr Met Leu Val Phe Leu Leu Gly Thr Thr Gly Asn Gly Leu35 40 45Val Leu Trp Thr Val Phe Arg Ser Ser Arg Glu Lys Arg Arg Ser Ala50 55 60Asp Ile Phe Ile Ala Ser Leu Ala Val Ala Asp Leu Thr Phe Val Val65 70 75 80Thr Leu Pro Leu Trp Ala Thr Tyr Thr Tyr Arg Asp Tyr Asp Trp Pro85 90 95Phe Gly Thr Phe Phe Cys Lys Leu Ser Ser Tyr Leu Ile Phe Val Asn100 105 110Met Tyr Ala Ser Val Phe Cys Leu Thr Gly Leu Ser Phe Asp Arg Tyr115 120 125Leu Ala Ile Val Arg Pro Val Ala Asn Ala Arg Leu Arg Leu Arg Val130 135 140Ser Gly Ala Val Ala Thr Ala Val Leu Trp Val Leu Ala Ala Leu Leu145 150 155 160Ala Met Pro Val Met Val Leu Arg Thr Thr Gly Asp Leu Glu Asn Thr165 170 175Thr Lys Val Gln Cys Tyr Met Asp Tyr Ser Met Val Ala Thr Val Ser180 185 190Ser Glu Trp Ala Trp Glu Val Gly Leu Gly Val Ser Ser Thr Thr Val195 200 205Gly Phe Val Val Pro Phe Thr Ile Met Leu Thr Cys Tyr Phe Phe Ile210 215 220Ala Gln Thr Ile Ala Gly His Phe Arg Lys Glu Arg Ile Glu Gly Leu225 230 235 240Arg Lys Arg Arg Arg Leu Leu Ser Ile Ile Val Val Leu Val Val Thr245 250 255Phe Ala Leu Cys Trp Met Pro Tyr His Leu Val Lys Thr Leu Tyr Met260 265 270Leu Gly Ser Leu Leu His Trp Pro Cys Asp Phe Asp Leu Phe Leu Met275 280 285Asn Ile Phe Pro Tyr Cys Thr Cys Ile Ser Tyr Val Asn Ser Cys Leu290 295 300Asn Pro Phe Leu Tyr Ala Phe Phe Asp Pro Arg Phe Arg Gln Ala Cys305 310 315 320Thr Ser Met Leu Cys Cys Gly Gln Ser Arg Cys Ala Gly Thr Ser His325 330 335Ser Ser Ser Gly Glu Lys Ser Ala Ser Tyr Ser Ser Gly His Ser Gln340 345 350Gly Pro Gly Pro Asn Met Gly Lys Gly Gly Glu Gln Met His Glu Lys355 360 365Ser Ile Pro Tyr Ser Gln Glu Thr Leu Val Val Asp370 375 380(64)SEQ ID NO63的资料(i)序列特征(A)长度31个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO63的序列描述TGAGAATTCT GGTGACTCAC AGCCGGCACA G 31(65)SEQ ID NO64的资料(i)序列特征(A)长度31个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO64的序列描述GCCGGATCCA AGGAAAAGCA GCAATAAAAG G31(66)SEQ ID NO65的资料(i)序列特征(A)长度1119个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO65的序列描述ATGAACTACC CGCTAACGCT GGAAATGGAC CTCGAGAACC TGGAGGACCT GTTCTGGGAA 60CTGGACAGAT TGGACAACTA TAACGACACC TCCCTGGTGG AAAATCATCT CTGCCCTGCC 120ACAGAGGGTC CCCTCATGGC CTCCTTCAAG GCCGTGTTCG TGCCCGTGGC CTACAGCCTC 180ATCTTCCTCC TGGGCGTGAT CGGCAACGTC CTGGTGCTGG TGATCCTGGA GCGGCACCGG 240CAGACACGCA GTTCCACGGA GACCTTCCTG TTCCACCTGG CCGTGGCCGA CCTCCTGCTG 300GTCTTCATCT TGCCCTTTGC CGTGGCCGAG GGCTCTGTGG GCTGGGTCCT GGGGACCTTC 360CTCTGCAAAA CTGTGATTGC CCTGCACAAA GTCAACTTCT ACTGCAGCAG CCTGCTCCTG 420GCCTGCATCG CCGTGGACCG CTACCTGGCC ATTGTCCACG CCGTCCATGC CTACCGCCAC 480CGCCGCCTCC TCTCCATCCA CATCACCTGT GGGACCATCT GGCTGGTGGG CTTCCTCCTT 540GCCTTGCCAG AGATTCTCTT CGCCAAAGTC AGCCAAGGCC ATCACAACAA CTCCCTGCCA 600CGTTGCACCT TCTCCCAAGA GAACCAAGCA GAAACGCATG CCTGGTTCAC CTCCCGATTC 660CTCTACCATG TGGCGGGATT CCTGCTGCCC ATGCTGGTGA TGGGCTGGTG CTACGTGGGG 720GTAGTGCACA GGTTGCGCCA GGCCCAGCGG CGCCCTCAGC GGCAGAAGGC AGTCAGGGTG 780GCCATCCTGG TGACAAGCAT CTTCTTCCTC TGCTGGTCAC CCTACCACAT CGTCATCTTC 840CTGGACACCC TGGCGAGGCT GAAGGCCGTG GACAATACCT GCAAGCTGAA TGGCTCTCTC 900CCCGTGGCCA TCACCATGTG TGAGTTCCTG GGCCTGGCCC ACTGCTGCCT CAACCCCATG 960CTCTACACTT TCGCCGGCGT GAAGTTCCGC AGTGACCTGT CGCGGCTCCT GACCAAGCTG 1020GGCTGTACCG GCCCTGCCTC CCTGTGCCAG CTCTTCCCTA GCTGGCGCAG GAGCAGTCTC 1080TCTGAGTCAG AGAATGCCAC CTCTCTCACC ACGTTCTAG1119(67)SEQ ID NO66的资料(i)序列特征(A)长度372个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO66的序列描述Met Asn Tyr Pro Leu Thr Leu Glu Met Asp Leu Glu Asn Leu Glu Asp1 5 10 15Leu Phe Trp Glu Leu Asp Arg Leu Asp Asn Tyr Asn Asp Thr Ser Leu20 25 30Val Glu Asn His Leu Cys Pro Ala Thr Glu Gly Pro Leu Met Ala Ser35 40 45Phe Lys Ala Val Phe Val Pro Val Ala Tyr Ser Leu Ile Phe Leu Leu50 55 60Gly Val Ile Gly Asn Val Leu Val Leu Val Ile Leu Glu Arg His Arg65 70 75 80Gln Thr Arg Ser Ser Thr Glu Thr Phe Leu Phe His Leu Ala Val Ala85 90 95Asp Leu Leu Leu Val Phe Ile Leu Pro Phe Ala Val Ala Glu Gly Ser100 105 110Val Gly Trp Val Leu Gly Thr Phe Leu Cys Lys Thr Val Ile Ala Leu115 120 125His Lys Val Asn Phe Tyr Cys Ser Ser Leu Leu Leu Ala Cys Ile Ala130 135 140Val Asp Arg Tyr Leu Ala Ile Val His Ala Val His Ala Tyr Arg His145 150 155 160Arg Arg Leu Leu Ser Ile His Ile Thr Cys Gly Thr Ile Trp Leu Val165 170 175Gly Phe Leu Leu Ala Leu Pro Glu Ile Leu Phe Ala Lys Val Ser Gln180 185 190Gly His His Asn Asn Ser Leu Pro Arg Cys Thr Phe Ser Gln Glu Asn195 200 205Gln Ala Glu Thr His Ala Trp Phe Thr Ser Arg Phe Leu Tyr His Val
210 215 220Ala Gly Phe Leu Leu Pro Met Leu Val Met Gly Trp Cys Tyr Val Gly225 230 235 240Val Val His Arg Leu Arg Gln Ala Gln Arg Arg Pro Gln Arg Gln Lys245 250 255Ala Val Arg Val Ala Ile Leu Val Thr Ser Ile Phe Phe Leu Cys Trp260 265 270Ser Pro Tyr His Ile Val Ile Phe Leu Asp Thr Leu Ala Arg Leu Lys275 280 285Ala Val Asp Asn Thr Cys Lys Leu Asn Gly Ser Leu Pro Val Ala Ile290 295 300Thr Met Cys Glu Phe Leu Gly Leu Ala His Cys Cys Leu Asn Pro Met305 310 315 320Leu Tyr Thr Phe Ala Gly Val Lys Phe Arg Ser Asp Leu Ser Arg Leu325 330 335Leu Thr Lys Leu Gly Cys Thr Gly Pro Ala Ser Leu Cys Gln Leu Phe340 345 350Pro Ser Trp Arg Arg Ser Ser Leu Ser Glu Ser Glu Asn Ala Thr Ser355 360 365Leu Thr Thr Phe370(68)SEQ ID NO67的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO67的序列描述CAAAGCTTGA AAGCTGCACG GTGCAGAGAC 30(69)SEQ ID NO68的资料(i)序列特征(A)长度30个碱基对
(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组)(xi)SEQ ID NO68的序列描述GCGGATCCCG AGTCACACCC TGGCTGGGCC 30(70)SEQ ID NO69的资料(i)序列特征(A)长度1128个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO69的序列描述ATGGATGTGA CTTCCCAAGC CCGGGGCGTG GGCCTGGAGA TGTACCCAGG CACCGCGCAG 60CCTGCGGCCC CCAACACCAC CTCCCCCGAG CTCAACCTGT CCCACCCGCT CCTGGGCACC 120GCCCTGGCCA ATGGGACAGG TGAGCTCTCG GAGCACCAGC AGTACGTGAT CGGCCTGTTC 180CTCTCGTGCC TCTACACCAT CTTCCTCTTC CCCATCGGCT TTGTGGGCAA CATCCTGATC 240CTGGTGGTGA ACATCAGCTT CCGCGAGAAG ATGACCATCC CCGACCTGTA CTTCATCAAC 300CTGGCGGTGG CGGACCTCAT CCTGGTGGCC GACTCCCTCA TTGAGGTGTT CAACCTGCAC 360GAGCGGTACT ACGACATCGC CGTCCTGTGC ACCTTCATGT CGCTCTTCCT GCAGGTCAAC 420ATGTACAGCA GCGTCTTCTT CCTCACCTGG ATGAGCTTCG ACCGCTACAT CGCCCTGGCC 480AGGGCCATGC GCTGCAGCCT GTTCCGCACC AAGCACCACG CCCGGCTGAG CTGTGGCCTC 540ATCTGGATGG CATCCGTGTC AGCCACGCTG GTGCCCTTCA CCGCCGTGCA CCTGCAGCAC 600ACCGACGAGG CCTGCTTCTG TTTCGCGGAT GTCCGGGAGG TGCAGTGGCT CGAGGTCACG 660CTGGGCTTCA TCGTGCCCTT CGCCATCATC GGCCTGTGCT ACTCCCTCAT TGTCCGGGTG 720CTGGTCAGGG CGCACCGGCA CCGTGGGCTG CGGCCCCGGC GGCAGAAGGC GCTCCGCATG 780ATCCTCGCGG TGGTGCTGGT CTTCTTCGTC TGCTGGCTGC CGGAGAACGT CTTCATCAGC 840GTGCACCTCC TGCAGCGGAC GCAGCCTGGG GCCGCTCCCT GCAAGCAGTC TTTCCGCCAT 900GCCCACCCCC TCACGGGCCA CATTGTCAAC CTCACCGCCT TCTCCAACAG CTGCCTAAAC 960CCCCTCATCT ACAGCTTTCT CGGGGAGACC TTCAGGGACA AGCTGAGGCT GTACATTGAG 1020CAGAAAACAA ATTTGCCGGC CCTGAACCGC TTCTGTCACG CTGCCCTGAA GGCCGTCATT 1080CCAGACAGCA CCGAGCAGTC GGATGTGAGG TTCAGCAGTG CCGTGTAG 1128(71)SEQ ID NO70的资料(i)序列特征(A)长度375个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO70的序列描述Met Asp Val Thr Ser Gln Ala Arg Gly Val Gly Leu Glu Met Tyr Pro1 5 10 15Gly Thr Ala Gln Pro Ala Ala Pro Asn Thr Thr Ser Pro Glu Leu Asn20 25 30Leu Ser His Pro Leu Leu Gly Thr Ala Leu Ala Asn Gly Thr Gly Glu35 40 45Leu Ser Glu His Gln Gln Tyr Val Ile Gly Leu Phe Leu Ser Cys Leu50 55 60Tyr Thr Ile Phe Leu Phe Pro Ile Gly Phe Val Gly Asn Ile Leu Ile65 70 75 80Leu Val Val Asn Ile Ser Phe Arg Glu Lys Met Thr Ile Pro Asp Leu85 90 95Tyr Phe Ile Asn Leu Ala Val Ala Asp Leu Ile Leu Val Ala Asp Ser100 105 110Leu Ile Glu Val Phe Asn Leu His Glu Arg Tyr Tyr Asp Ile Ala Val115 120 125Leu Cys Thr Phe Met Ser Leu Phe Leu Gln Val Asn Met Tyr Ser Ser130 135 140Val Phe Phe Leu Thr Trp Met Ser Phe Asp Arg Tyr Ile Ala Leu Ala145 150 155 160Arg Ala Met Arg Cys Ser Leu Phe Arg Thr Lys His His Ala Arg Leu165 170 175Ser Cys Gly Leu Ile Trp Met Ala Ser Val Ser Ala Thr Leu Val Pro180 185 190Phe Thr Ala Val His Leu Gln His Thr Asp Glu Ala Cys Phe Cys Phe195 200 205Ala Asp Val Arg Glu Val Gln Trp Leu Glu Val Thr Leu Gly Phe Ile210 215 220Val Pro Phe Ala Ile Ile Gly Leu Cys Tyr Ser Leu Ile Val Arg Val225 230 235 240Leu Val Arg Ala His Arg His Arg Gly Leu Arg Pro Arg Arg Gln Lys245 250 255Ala Leu Arg Met Ile Leu Ala Val Val Leu Val Phe Phe Val Cys Trp
260 265 270Leu Pro Glu Asn Val Phe Ile Ser Val His Leu Leu Gln Arg Thr Gln275 280 285Pro Gly Ala Ala Pro Cys Lys Gln Ser Phe Arg His Ala His Pro Leu290 295 300Thr Gly His Ile Val Asn Leu Thr Ala Phe Ser Asn Ser Cys Leu Asn305 310 315 320Pro Leu Ile Tyr Ser Phe Leu Gly Glu Thr Phe Arg Asp Lys Leu Arg325 330 335Leu Tyr Ile Glu Gln Lys Thr Asn Leu Pro Ala Leu Asn Arg Phe Cys340 345 350His Ala Ala Leu Lys Ala Val Ile Pro Asp Ser Thr Glu Gln Ser Asp355 360 365Val Arg Phe Ser Ser Ala Val370 375(72)SEQ ID NO71的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO71的序列描述ACAGAATTCC TGTGTGGTTT TACCGCCCAG 30(73)SEQ ID NO72的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO72的序列描述CTCGGATCCA GGCAGAAGAG TCGCCTATGG 30(74)SEQ ID NO73的资料(i)序列特征(A)长度1137个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO73的序列描述ATGGACCTGG GGAAACCAAT GAAAAGCGTG CTGGTGGTGG CTCTCCTTGT CATTTTCCAG 60GTATGCCTGT GTCAAGATGA GGTCACGGAC GATTACATCG GAGACAACAC CACAGTGGAC 120TACACTTTGT TCGAGTCTTT GTGCTCCAAG AAGGACGTGC GGAACTTTAA AGCCTGGTTC 180CTCCCTATCA TGTACTCCAT CATTTGTTTC GTGGGCCTAC TGGGCAATGG GCTGGTCGTG 240TTGACCTATA TCTATTTCAA GAGGCTCAAG ACCATGACCG ATACCTACCT GCTCAACCTG 300GCGGTGGCAG ACATCCTCTT CCTCCTGACC CTTCCCTTCT GGGCCTACAG CGCGGCCAAG 360TCCTGGGTCT TCGGTGTCCA CTTTTGCAAG CTCATCTTTG CCATCTACAA GATGAGCTTC 420TTCAGTGGCA TGCTCCTACT TCTTTGCATC AGCATTGACC GCTACGTGGC CATCGTCCAG 480GCTGTCTCAG CTCACCGCCA CCGTGCCCGC GTCCTTCTCA TCAGCAAGCT GTCCTGTGTG 540GGCATCTGGA TACTAGCCAC AGTGCTCTCC ATCCCAGAGC TCCTGTACAG TGACCTCCAG 600AGGAGCAGCA GTGAGCAAGC GATGCGATGC TCTCTCATCA CAGAGCATGT GGAGGCCTTT 660ATCACCATCC AGGTGGCCCA GATGGTGATC GGCTTTCTGG TCCCCCTGCT GGCCATGAGC 720TTCTGTTACC TTGTCATCAT CCGCACCCTG CTCCAGGCAC GCAACTTTGA GCGCAACAAG 780GCCATCAAGG TGATCATCGC TGTGGTCGTG GTCTTCATAG TCTTCCAGCT GCCCTACAAT 840GGGGTGGTCC TGGCCCAGAC GGTGGCCAAC TTCAACATCA CCAGTAGCAC CTGTGAGCTC 900AGTAAGCAAC TCAACATCGC CTACGACGTC ACCTACAGCC TGGCCTGCGT CCGCTGCTGC 960GTCAACCCTT TCTTGTACGC CTTCATCGGC GTCAAGTTCC GCAACGATCT CTTCAAGCTC 1020TTCAAGGACC TGGGCTGCCT CAGCCAGGAG CAGCTCCGGC AGTGGTCTTC CTGTCGGCAC 1080ATCCGGCGCT CCTCCATGAG TGTGGAGGCC GAGACCACCA CCACCTTCTC CCCATAG1137(75)SEQ ID NO74的资料(i)序列特征(A)长度378个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO74的序列描述Met Asp Leu Gly Lys Pro Met Lys Ser Val Leu Val Val Ala Leu Leu1 5 10 15Val Ile Phe Gln Val Cys Leu Cys Gln Asp Glu Val Thr Asp Asp Tyr20 25 30Ile Gly Asp Asn Thr Thr Val Asp Tyr Thr Leu Phe Glu Ser Leu Cys
35 40 45Ser Lys Lys Asp Val Arg Asn Phe Lys Ala Trp Phe Leu Pro Ile Met50 55 60Tyr Ser Ile Ile Cys Phe Val Gly Leu Leu Gly Asn Gly Leu Val Val65 70 75 80Leu Thr Tyr Ile Tyr Phe Lys Arg Leu Lys Thr Met Thr Asp Thr Tyr85 90 95Leu Leu Asn Leu Ala Val Ala Asp Ile Leu Phe Leu Leu Thr Leu Pro100 105 110Phe Trp Ala Tyr Ser Ala Ala Lys Ser Trp Val Phe Gly Val His Phe115 120 125Cys Lys Leu Ile Phe Ala Ile Tyr Lys Met Ser Phe Phe Ser Gly Met130 135 140Leu Leu Leu Leu Cys Ile Ser Ile Asp Arg Tyr Val Ala Ile Val Gln145 150 155 160Ala Val Ser Ala His Arg His Arg Ala Arg Val Leu Leu Ile Ser Lys165 170 175Leu Ser Cys Val Gly Ile Trp Ile Leu Ala Thr Val Leu Ser Ile Pro180 185 190Glu Leu Leu Tyr Ser Asp Leu Gln Arg Ser Ser Ser Glu Gln Ala Met195 200 205Arg Cys Ser Leu Ile Thr Glu His Val Glu Ala Phe Ile Thr Ile Gln210 215 220Val Ala Gln Met Val Ile Gly Phe Leu Val Pro Leu Leu Ala Met Ser225 230 235 240Phe Cys Tyr Leu Val Ile Ile Arg Thr Leu Leu Gln Ala Arg Asn Phe245 250 255Glu Arg Asn Lys Ala Ile Lys Val Ile Ile Ala Val Val Val Val Phe260 265 270Ile Val Phe Gln Leu Pro Tyr Asn Gly Val Val Leu Ala Gln Thr Val275 280 285Ala Asn Phe Asn Ile Thr Ser Ser Thr Cys Glu Leu Ser Lys Gln Leu290 295 300Asn Ile Ala Tyr Asp Val Thr Tyr Ser Leu Ala Cys Val Arg Cys Cys305 310 315 320Val Asn Pro Phe Leu Tyr Ala Phe Ile Gly Val Lys Phe Arg Asn Asp325 330 335Leu Phe Lys Leu Phe Lys Asp Leu Gly Cys Leu Ser Gln Glu Gln Leu340 345 350Arg Gln Trp Ser Ser Cys Arg His Ile Arg Arg Ser Ser Met Ser Val355 360 365Glu Ala Glu Thr Thr Thr Thr Phe Ser Pro370 375(76)SEQ ID NO75的资料(i)序列特征(A)长度32个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO75的序列描述CTGGAATTCA CCTGGACCAC CACCAATGGA TA 32(77)SEQ ID NO76的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO76的序列描述CTCGGATCCT GCAAAGTTTG TCATACAGTT 30(78)SEQ ID NO77的资料(i)序列特征(A)长度1085个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO77的序列描述ATGGATATAC AAATGGCAAA CAATTTTACT CCGCCCTCTG CAACTCCTCA GGGAAATGAC 60TGTGACCTCT ATGCACATCA CAGCACGGCC AGGATAGTAA TGCCTCTGCA TTACAGCCTC 120GTCTTCATCA TTGGGCTCGT GGGAAACTTA CTAGCCTTGG TCGTCATTGT TCAAAACAGG 180AAAAAAATCA ACTCTACCAC CCTCTATTCA ACAAATTTGG TGATTTCTGA TATACTTTTT 240ACCACGGCTT TGCCTACACG AATAGCCTAC TATGCAATGG GCTTTGACTG GAGAATCGGA 300GATGCCTTGT GTAGGATAAC TGCGCTAGTG TTTTACATCA ACACATATGC AGGTGTGAAC 360TTTATGACCT GCCTGAGTAT TGACCGCTTC ATTGCTGTGG TGCACCCTCT ACGCTACAAC 420AAGATAAAAA GGATTGAACA TGCAAAAGGC GTGTGCATAT TTGTCTGGAT TCTAGTATTT 480GCTCAGACAC TCCCACTCCT CATCAACCCT ATGTCAAAGC AGGAGGCTGA AAGGATTACA 540TGCATGGAGT ATCCAAACTT TGAAGAAACT AAATCTCTTC CCTGGATTCT GCTTGGGGCA 600TGTTTCATAG GATATGTACT TCCACTTATA ATCATTCTCA TCTGCTATTC TCAGATCTGC 660TGCAAACTCT TCAGAACTGC CAAACAAAAC CCACTCACTG AGAAATCTGG TGTAAACAAA 720AAGGCTCTCA ACACAATTAT TCTTATTATT GTTGTGTTTG TTCTCTGTTT CACACCTTAC 780CATGTTGCAA TTATTCAACA TATGATTAAG AAGCTTCGTT TCTCTAATTT CCTGGAATGT 840AGCCAAAGAC ATTCGTTCCA GATTTCTCTG CACTTTACAG TATGCCTGAT GAACTTCAAT 900TGCTGCATGG ACCCTTTTAT CTACTTCTTT GCATGTAAAG GGTATAAGAG AAAGGTTATG 960AGGATGCTGA AACGGCAAGT CAGTGTATCG ATTTCTAGTG CTGTGAAGTC AGCCCCTGAA 1020GAAAATTCAC GTGAAATGAC AGAAACGCAG ATGATGATAC ATTCCAAGTC TTCAAATGGA 1080AAGTGA1086(79)SEQ ID NO78的资料(i)序列特征(A)长度361个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO78的序列描述Met Asp Ile Gln Met Ala Asn Asn Phe Thr Pro Pro Ser Ala Thr Pro1 5 10 15Gln Gly Asn Asp Cys Asp Leu Tyr Ala His His Ser Thr Ala Arg Ile20 25 30Val Met Pro Leu His Tyr Ser Leu Val Phe Ile Ile Gly Leu Val Gly35 40 45Asn Leu Leu Ala Leu Val Val Ile Val Gln Asn Arg Lys Lys Ile Asn50 55 60Ser Thr Thr Leu Tyr Ser Thr Asn Leu Val Ile Ser Asp Ile Leu Phe65 70 75 80Thr Thr Ala Leu Pro Thr Arg Ile Ala Tyr Tyr Ala Met Gly Phe Asp
85 90 95Trp Arg Ile Gly Asp Ala Leu Cys Arg Ile Thr Ala Leu Val Phe Tyr100 105 110Ile Asn Thr Tyr Ala Gly Val Asn Phe Met Thr Cys Leu Ser Ile Asp115 120 125Arg Phe Ile Ala Val Val His Pro Leu Arg Tyr Asn Lys Ile Lys Argl30 135 140Ile Glu His Ala Lys Gly Val Cys Ile Phe Val Trp Ile Leu Val Phe145 150 155 160Ala Gln Thr Leu Pro Leu Leu Ile Asn Pro Met Ser Lys Gln Glu Ala165 170 175Glu Arg Ile Thr Cys Met Glu Tyr Pro Asn Phe Glu Glu Thr Lys Ser180 185 190Leu Pro Trp Ile Leu Leu Gly Ala Cys Phe Ile Gly Tyr Val Leu Pro195 200 205Leu Ile Ile Ile Leu Ile Cys Tyr Ser Gln Ile Cys Cys Lys Leu Phe210 215 220Arg Thr Ala Lys Gln Asn Pro Leu Thr Glu Lys Ser Gly Val Asn Lys225 230 235 240Lys Ala Leu Asn Thr Ile Ile Leu Ile Ile Val Val Phe Val Leu Cys245 250 255Phe Thr Pro Tyr His Val Ala Ile Ile Gln His Met Ile Lys Lys Leu260 265 270Arg Phe Ser Asn Phe Leu Glu Cys Ser Gln Arg His Ser Phe Gln Ile275 280 285Ser Leu His Phe Thr Val Cys Leu Met Asn Phe Asn Cys Cys Met Asp290 295 300Pro Phe Ile Tyr Phe Phe Ala Cys Lys Gly Tyr Lys Arg Lys Val Met305 310 315 320Arg Met Leu Lys Arg Gln Val Ser Val Ser Ile Ser Ser Ala Val Lys325 330 335Ser Ala Pro Glu Glu Asn Ser Arg Glu Met Thr Glu Thr Gln Met Met340 345 350Ile His Ser Lys Ser Ser Asn Gly Lys
355 360(80)SEQ ID NO79的资料(i)序列特征(A)长度31个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO79的序列描述CTGGAATTCT CCTGCTCATC CAGCCATGCG G 31(81) SEQ ID NO80的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO80的序列描述CCTGGATCCC CACCCCTACT GGGGCCTCAG 30(82)SEQ ID NO81的资料(i)序列特征(A)长度1446个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO81的序列描述ATGCGGTGGC TGTGGCCCCT GGCTGTCTCT CTTGCTGTGA TTTTGGCTGT GGGGCTAAGC 60AGGGTCTCTG GGGGTGCCCC CCTGCACCTG GGCAGGCACA GAGCCGAGAC CCAGGAGCAG 120CAGAGCCGAT CCAAGAGGGG CACCGAGGAT GAGGAGGCCA AGGGCGTGCA GCAGTATGTG 180CCTGAGGAGT GGGCGGAGTA CCCCCGGCCC ATTCACCCTG CTGGCCTGCA GCCAACCAAG 240CCCTTGGTGG CCACCAGCCC TAACCCCGAC AAGGATGGGG GCACCCCAGA CAGTGGGCAG 300GAACTGAGGG GCAATCTGAC AGGGGCACCA GGGCAGAGGC TACAGATCCA GAACCCCCTG 360TATCCGGTGA CCGAGAGCTC CTACAGTGCC TATGCCATCA TGCTTCTGGC GCTGGTGGTG 420TTTGCGGTGG GCATTGTGGG CAACCTGTCG GTCATGTGCA TCGTGTGGCA CAGCTACTAC 480CTGAAGAGCG CCTGGAACTC CATCCTTGCC AGCCTGGCCC TCTGGGATTT TCTGGTCCTC 540TTTTTCTGCC TCCCTATTGT CATCTTCAAC GAGATCACCA AGCAGAGGCT ACTGGGTGAC 600GTTTCTTGTC GTGCCGTGCC CTTCATGGAG GTCTCCTCTC TGGGAGTCAC GACTTTCAGC 660CTCTGTGCCC TGGGCATTGA CCGCTTCCAC GTGGCCACCA GCACCCTGCC CAAGGTGAGG 720CCCATCGAGC GGTGCCAATC CATCCTGGCC AAGTTGGCTG TCATCTGGGT GGGCTCCATG 780ACGCTGGCTG TGCCTGAGCT CCTGCTGTGG CAGCTGGCAC AGGAGCCTGC CCCCACCATG 840GGCACCCTGG ACTCATGCAT CATGAAACCC TCAGCCAGCC TGCCCGAGTC CCTGTATTCA 900CTGGTGATGA CCTACCAGAA CGCCCGCATG TGGTGGTACT TTGGCTGCTA CTTCTGCCTG 960CCCATCCTCT TCACAGTCAC CTGCCAGCTG GTGACATGGC GGGTGCGAGG CCCTCCAGGG 1020AGGAAGTCAG AGTGCAGGGC CAGCAAGCAC GAGCAGTGTG AGAGCCAGCT CAACAGCACC 1080GTGGTGGGCC TGACCGTGGT CTACGCCTTC TGCACCCTCC CAGAGAACGT CTGCAACATC 1140GTGGTGGCCT ACCTCTCCAC CGAGCTGACC CGCCAGACCC TGGACCTCCT GGGCCTCATC 1200AACCAGTTCT CCACCTTCTT CAAGGGCGCC ATCACCCCAG TGCTGCTCCT TTGCATCTGC 1260AGGCCGCTGG GCCAGGCCTT CCTGGACTGC TGCTGCTGCT GCTGCTGTGA GGAGTGCGGC 1320GGGGCTTCGG AGGCCTCTGC TGCCAATGGG TCGGACAACA AGCTCAAGAC CGAGGTGTCC 1380TCTTCCATCT ACTTCCACAA GCCCAGGGAG TCACCCCCAC TCCTGCCCCT GGGCACACCT 1440TGCTGA1446(83)SEQ ID NO82的资料(i)序列特征(A)长度481个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO82的序列描述Met Arg Trp Leu Trp Pro Leu Ala Val Ser Leu Ala Val Ile Leu Ala1 5 10 15Val Gly Leu Ser Arg Val Ser Gly Gly Ala Pro Leu His Leu Gly Arg20 25 30His Arg Ala Glu Thr Gln Glu Gln Gln Ser Arg Ser Lys Arg Gly Thr35 40 45Glu Asp Glu Glu Ala Lys Gly Val Gln Gln Tyr Val Pro Glu Glu Trp50 55 60Ala Glu Tyr Pro Arg Pro Ile His Pro Ala Gly Leu Gln Pro Thr Lys65 70 75 80Pro Leu Val Ala Thr Ser Pro Asn Pro Asp Lys Asp Gly Gly Thr Pro85 90 95Asp Ser Gly Gln Glu Leu Arg Gly Asn Leu Thr Gly Ala Pro Gly Gln100 105 110Arg Leu Gln Ile Gln Asn Pro Leu Tyr Pro Val Thr Glu Ser Ser Tyrl15 120 125Ser Ala Tyr Ala Ile Met Leu Leu Ala Leu Val Val Phe Ala Val Gly130 135 140Ile Val Gly Asn Leu Ser Val Met Cys Ile Val Trp His Ser Tyr Tyr145 150 155 160Leu Lys Ser Ala Trp Asn Ser Ile Leu Ala Ser Leu Ala Leu Trp Aspl65 170 175Phe Leu Val Leu Phe Phe Cys Leu Pro Ile Val Ile Phe Asn Glu Ile180 185 190Thr Lys Gln Arg Leu Leu Gly Asp Val Ser Cys Arg Ala Val Pro Phe195 200 205Met Glu Va1 Ser Ser Leu Gly Val Thr Thr Phe Ser Leu Cys Ala Leu210 215 220Gly Ile Asp Arg Phe His Val Ala Thr Ser Thr Leu Pro Lys Val Arg225 230 235 240Pro Ile Glu Arg Cys Gln Ser Ile Leu Ala Lys Leu Ala Val Ile Trp245 250 255Val Gly Ser Met Thr Leu Ala Val Pro Glu Leu Leu Leu Trp Gln Leu260 265 270Ala Gln Glu Pro Ala Pro Thr Met Gly Thr Leu Asp Ser Cys Ile Met275 280 285Lys Pro Ser Ala Ser Leu Pro Glu Ser Leu Tyr Ser Leu Val Met Thr290 295 300Tyr Gln Asn Ala Arg Met Trp Trp Tyr Phe Gly Cys Tyr Phe Cys Leu305 310 315 320Pro Ile Leu Phe Thr Val Thr Cys Gln Leu Val Thr Trp Arg Val Arg325 330 335Gly Pro Pro Gly Arg Lys Ser Glu Cys Arg Ala Ser Lys His Glu Gln340 345 350Cys Glu Ser Gln Leu Asn Ser Thr Val Val Gly Leu Thr Val Val Tyr355 360 365Ala Phe Cys Thr Leu Pro Glu Asn Val Cys Asn Ile Val Val Ala Tyr370 375 380Leu Ser Thr Glu Leu Thr Arg Gln Thr Leu Asp Leu Leu Gly Leu Ile385 390 395 400Asn Gln Phe Ser Thr Phe Phe Lys Gly Ala Ile Thr Pro Val Leu Leu405 410 415Leu Cys Ile Cys Arg Pro Leu Gly Gln Ala Phe Leu Asp Cys Cys Cys420 425 430Cys Cys Cys Cys Glu Glu Cys Gly Gly Ala Ser Glu Ala Ser Ala Ala435 440 445Asn Gly Ser Asp Asn Lys Leu Lys Thr Glu Val Ser Ser Ser Ile Tyr450 455 460Phe His Lys Pro Arg Glu Ser Pro Pro Leu Leu Pro Leu Gly Thr Pro465 470 475 480Cys(84)SEQ ID NO83的资料(i)序列特征(A)长度22个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO83的序列描述ATGTGGAACG CGACGCCCAG CG 22(85)SEQ ID NO84的资料(i)序列特征(A)长度22个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO84的序列描述TCATGTATTA ATACTAGATT CT 22(86)SEQ ID NO85的资料(i)序列特征(A)长度38个碱基对
(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO85的序列描述TACCATGTGG AACGCGACGC CCAGCGAAGA GCCGGGGT38(87)SEQ ID NO86的资料(i)序列特征(A)长度39个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO86的序列描述CGGAATTCAT GTATTAATAC TAGATTCTGT CCAGGCCCG 39(88)SEQ ID NO87的资料(i)序列特征(A)长度1101个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi) SEQ ID NO87的序列描述ATGTGGAACG CGACGCCCAG CGAAGAGCCG GGGTTCAACC TCACACTGGC CGACCTGGAC 60TGGGATGCTT CCCCCGGCAA CGACTCGCTG GGCGACGAGC TGCTGCAGCT CTTCCCCGCG 120CCGCTGCTGG CGGGCGTCAC AGCCACCTGC GTGGCACTCT TCGTGGTGGG TATCGCTGGC 180AACCTGCTCA CCATGCTGGT GGTGTCGCGC TTCCGCGAGC TGCGCACCAC CACCAACCTC 240TACCTGTCCA GCATGGCCTT CTCCGATCTG CTCATCTTCC TCTGCATGCC CCTGGACCTC 300GTTCGCCTCT GGCAGTACCG GCCCTGGAAC TTCGGCGACC TCCTCTGCAA ACTCTTCCAA 360TTCGTCAGTG AGAGCTGCAC CTACGCCACG GTGCTCACCA TCACAGCGCT GAGCGTCGAG 420CGCTACTTCG CCATCTGCTT CCCACTCCGG GCCAAGGTGG TGGTCACCAA GGGGCGGGTG 480AAGCTGGTCA TCTTCGTCAT CTGGGCCGTG GCCTTCTGCA GCGCCGGGCC CATCTTCGTG 540CTAGTCGGGG TGGAGCACGA GAACGGCACC GACCCTTGGG ACACCAACGA GTGCCGCCCC 600ACCGAGTTTG CGGTGCGCTC TGGACTGCTC ACGGTCATGG TGTGGGTGTC CAGCATCTTC 660TTCTTCCTTC CTGTCTTCTG TCTCACGGTC CTCTACAGTC TCATCGGCAG GAAGCTGTGG 720CGGAGGAGGC GCGGCGATGC TGTCGTGGGT GCCTCGCTCA GGGACCAGAA CCACAAGCAA 780ACCGTGAAAA TGCTGGCTGT AGTGGTGTTT GCCTTCATCC TCTGCTGGCT CCCCTTCCAC 840GTAGGGCGAT ATTTATTTTC CAAATCCTTT GAGCCTGGCT CCTTGGAGAT TGCTCAGATC 900AGCCAGTACT GCAACCTCGT GTCCTTTGTC CTCTTCTACC TCAGTGCTGC CATCAACCCC 960ATTCTGTACA ACATCATGTC CAAGAAGTAC CGGGTGGCAG TGTTCAGACT TCTGGGATTC 1020GAACCCTTCT CCCAGAGAAA GCTCTCCACT CTGAAAGATG AAAGTTCTCG GGCCTGGACA 1080GAATCTAGTA TTAATACATG A 1101(89)SEQ ID NO88的资料(i)序列特征(A)长度366个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO88的序列描述Met Trp Asn Ala Thr Pro Ser Glu Glu Pro Gly Phe Asn Leu Thr Leu1 5 10 15Ala Asp Leu Asp Trp Asp Ala Ser Pro Gly Asn Asp Ser Leu Gly Asp20 25 30Glu Leu Leu Gln Leu Phe Pro Ala Pro Leu Leu Ala Gly Val Thr Ala35 40 45Thr Cys Val Ala Leu Phe Val Val Gly Ile Ala Gly Asn Leu Leu Thr50 55 60Met Leu Val Val Ser Arg Phe Arg Glu Leu Arg Thr Thr Thr Asn Leu65 70 75 80Tyr Leu Ser Ser Met Ala Phe Ser Asp Leu Leu Ile Phe Leu Cys Met85 90 95Pro Leu Asp Leu Val Arg Leu Trp Gln Tyr Arg Pro Trp Asn Phe Gly100 105 110Asp Leu Leu Cys Lys Leu Phe Gln Phe Val Ser Glu Ser Cys Thr Tyr115 120 125Ala Thr Val Leu Thr Ile Thr Ala Leu Ser Val Glu Arg Tyr Phe Ala130 135 140Ile Cys Phe Pro Leu Arg Ala Lys Val Val Val Thr Lys Gly Arg Val145 150 155 160Lys Leu Val Ile Phe Val Ile Trp Ala Val Ala Phe Cys Ser Ala Gly165 170 175Pro Ile Phe Val Leu Val Gly Val Glu His Glu Asn Gly Thr Asp Pro180 185 190Trp Asp Thr Asn Glu Cys Arg Pro Thr Glu Phe Ala Val Arg Ser Gly195 200 205Leu Leu Thr Val Met Val Trp Val Ser Ser Ile Phe Phe Phe Leu Pro210 215 220Val Phe Cys Leu Thr Val Leu Tyr Ser Leu Ile Gly Arg Lys Leu Trp225 230 235 240Arg Arg Arg Arg Gly Asp Ala Val Val Gly Ala Ser Leu Arg Asp Gln245 250 255Asn His Lys Gln Thr Val Lys Met Leu Ala Val Val Val Phe Ala Phe260 265 270Ile Leu Cys Trp Leu Pro Phe His Val Gly Arg Tyr Leu Phe Ser Lys275 280 285Ser Phe Glu Pro Gly Ser Leu Glu Ile Ala Gln Ile Ser Gln Tyr Cys290 295 300Asn Leu Val Ser Phe Val Leu Phe Tyr Leu Ser Ala Ala Ile Asn Pro305 310 315 320Ile Leu Tyr Asn Ile Met Ser Lys Lys Tyr Arg Val Ala Val Phe Arg325 330 335Leu Leu Gly Phe Glu Pro Phe Ser Gln Arg Lys Leu Ser Thr Leu Lys340 345 350Asp Glu Ser Ser Arg Ala Trp Thr Glu Ser Ser Ile Asn Thr355 360 365(90) SEQ ID NO89的资料(i)序列特征(A)长度33个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO89的序列描述GCAAGCTTGT GCCCTCACCA AGCCATGCGA GCC 33(91)SEQ ID NO90的资料(i)序列特征
(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi) SEQ ID NO90的序列描述CGGAATTCAG CAATGAGTTC CGACAGAAGC 30(92)SEQ ID NO91的资料(i)序列特征(A)长度1842个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO91的序列描述ATGCGAGCCC CGGGCGCGCT TCTCGCCCGC ATGTCGCGGC TACTGCTTCT GCTACTGCTC 60AAGGTGTCTG CCTCTTCTGC CCTCGGGGTC GCCCCTGCGT CCAGAAACGA AACTTGTCTG 120GGGGAGAGCT GTGCACCTAC AGTGATCCAG CGCCGCGGCA GGGACGCCTG GGGACCGGGA 180AATTCTGCAA GAGACGTTCT GCGAGCCCGA GCACCCAGGG AGGAGCAGGG GGCAGCGTTT 240CTTGCGGGAC CCTCCTGGGA CCTGCCGGCG GCCCCGGGCC GTGACCCGGC TGCAGGCAGA 300GGGGCGGAGG CGTCGGCAGC CGGACCCCCG GGACCTCCAA CCAGGCCACC TGGCCCCTGG 360AGGTGGAAAG GTGCTCGGGG TCAGGAGCCT TCTGAAACTT TGGGGAGAGG GAACCCCACG 420GCCCTCCAGC TCTTCCTTCA GATCTCAGAG GAGGAAGAGA AGGGTCCCAG AGGCGCTGGC 480ATTTCCGGGC GTAGCCAGGA GCAGAGTGTG AAGACAGTCC CCGGAGCCAG CGATCTTTTT 540TACTGGCCAA GGAGAGCCGG GAAACTCCAG GGTTCCCACC ACAAGCCCCT GTCCAAGACG 600GCCAATGGAC TGGCGGGGCA CGAAGGGTGG ACAATTGCAC TCCCGGGCCG GGCGCTGGCC 660CAGAATGGAT CCTTGGGTGA AGGAATCCAT GAGCCTGGGG GTCCCCGCCG GGGAAACAGC 720ACGAACCGGC GTGTGAGACT GAAGAACCCC TTCTACCCGC TGACCCAGGA GTCCTATGGA 780GCCTACGCGG TCATGTGTCT GTCCGTGGTG ATCTTCGGGA CCGGCATCAT TGGCAACCTG 840GCGGTGATGA GCATCGTGTG CCACAACTAC TACATGCGGA GCATCTCCAA CTCCCTCTTG 900GCCAACCTGG CCTTCTGGGA CTTTCTCATC ATCTTCTTCT GCCTTCCGCT GGTCATCTTC 960CACGAGCTGA CCAAGAAGTG GCTGCTGGAG GACTTCTCCT GCAAGATCGT GCCCTATATA 1020GAGGTCGCTT CTCTGGGAGT CACCACTTTC ACCTTATGTG CTCTGTGCAT AGACCGCTTC 1080CGTGCTGCCA CCAACGTACA GATGTACTAC GAAATGATCG AAAACTGTTC CTCAACAACT 1140GCCAAACTTG CTGTTATATG GGTGGGAGCT CTATTGTTAG CACTTCCAGA AGTTGTTCTC 1200CGCCAGCTGA GCAAGGAGGA TTTGGGGTTT AGTGGCCGAG CTCCGGCAGA AAGGTGCATT 1260ATTAAGATCT CTCCTGATTT ACCAGACACC ATCTATGTTC TAGCCCTCAC CTACGACAGT 1320GCGAGACTGT GGTGGTATTT TGGCTGTTAC TTTTGTTTGC CCACGCTTTT CACCATCACC 1380TGCTCTCTAG TGACTGCGAG GAAAATCCGC AAAGCAGAGA AAGCCTGTAC CCGAGGGAAT 1440AAACGGCAGA TTCAACTAGA GAGTCAGATG AACTGTACAG TAGTGGCACT GACCATTTTA 1500TATGGATTTT GCATTATTCC TGAAAATATC TGCAACATTG TTACTGCCTA CATGGCTACA 1560GGGGTTTCAC AGCAGACAAT GGACCTCCTT AATATCATCA GCCAGTTCCT TTTGTTCTTT 1620AAGTCCTGTG TCACCCCAGT CCTCCTTTTC TGTCTCTGCA AACCCTTCAG TCGGGCCTTC 1680ATGGAGTGCT GCTGCTGTTG CTGTGAGGAA TGCATTCAGA AGTCTTCAAC GGTGACCAGT 1740GATGACAATG ACAACGAGTA CACCACGGAA CTCGAACTCT CGCCTTTCAG TACCATACGC 1800CGTGAAATGT CCACTTTTGC TTCTGTCGGA ACTCATTGCT GA1842(93) SEQ ID NO92的资料(i)序列特征(A)长度613个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO92的序列描述Met Arg Ala Pro Gly Ala Leu Leu Ala Arg Met Ser Arg Leu Leu Leu1 5 10 15Leu Leu Leu Leu Lys Val Ser Ala Ser Ser Ala Leu Gly Val Ala Pro20 25 30Ala Ser Arg Asn Glu Thr Cys Leu Gly Glu Ser Cys Ala Pro Thr Val35 40 45Ile Gln Arg Arg Gly Arg Asp Ala Trp Gly Pro Gly Asn Ser Ala Arg50 55 60Asp Val Leu Arg Ala Arg Ala Pro Arg Glu Glu Gln Gly Ala Ala Phe65 70 75 80Leu Ala Gly Pro Ser Trp Asp Leu Pro Ala Ala Pro Gly Arg Asp Pro85 90 95Ala Ala Gly Arg Gly Ala Glu Ala Ser Ala Ala Gly Pro Pro Gly Pro100 105 110Pro Thr Arg Pro Pro Gly Pro Trp Arg Trp Lys Gly Ala Arg Gly Gln115 120 125Glu Pro Ser Glu Thr Leu Gly Arg Gly Asn Pro Thr Ala Leu Gln Leu130 135 140Phe Leu Gln Ile Ser Glu Glu Glu Glu Lys Gly Pro Arg Gly Ala Gly145 150 155 160Ile Ser Gly Arg Ser Gln Glu Gln Ser Val Lys Thr Val Pro Gly Ala165 170 175Ser Asp Leu Phe Tyr Trp Pro Arg Arg Ala Gly Lys Leu Gln Gly Ser180 185 190His His Lys Pro Leu Ser Lys Thr Ala Asn Gly Leu Ala Gly His Glu195 200 205Gly Trp Thr Ile Ala Leu Pro Gly Arg Ala Leu Ala Gln Asn Gly Ser210 215 220Leu Gly Glu Gly Ile His Glu Pro Gly Gly Pro Arg Arg Gly Asn Ser225 230 235 240Thr Asn Arg Arg Val Arg Leu Lys Asn Pro Phe Tyr Pro Leu Thr Gln245 250 255Glu Ser Tyr Gly Ala Tyr Ala Val Met Cys Leu Ser Val Val Ile Phe260 265 270Gly Thr Gly Ile Ile Gly Asn Leu Ala Val Met Ser Ile Val Cys His275 280 285Asn Tyr Tyr Met Arg Ser Ile Ser Asn Ser Leu Leu Ala Asn Leu Ala290 295 300Phe Trp Asp Phe Leu Ile Ile Phe Phe Cys Leu Pro Leu Val Ile Phe305 310 315 320His Glu Leu Thr Lys Lys Trp Leu Leu Glu Asp Phe Ser Cys Lys Ile325 330 335Val Pro Tyr Ile Glu Val Ala Ser Leu Gly Val Thr Thr Phe Thr Leu340 345 350Cys Ala Leu Cys Ile Asp Arg Phe Arg Ala Ala Thr Asn Val Gln Met355 360 365Tyr Tyr Glu Met Ile Glu Asn Cys Ser Ser Thr Thr Ala Lys Leu Ala370 375 380Val Ile Trp Val Gly Ala Leu Leu Leu Ala Leu Pro Glu Val Val Leu385 390 395 400Arg Gln Leu Ser Lys Glu Asp Leu Gly Phe Ser Gly Arg Ala Pro Ala405 410 415Glu Arg Cys Ile Ile Lys Ile Ser Pro Asp Leu Pro Asp Thr Ile Tyr420 425 430Val Leu Ala Leu Thr Tyr Asp Ser Ala Arg Leu Trp Trp Tyr Phe Gly435 440 445Cys Tyr Phe Cys Leu Pro Thr Leu Phe Thr Ile Thr Cys Ser Leu Val450 455 460Thr Ala Arg Lys Ile Arg Lys Ala Glu Lys Ala Cys Thr Arg Gly Asn465 470 475 480Lys Arg Gln Ile Gln Leu Glu Ser Gln Met Asn Cys Thr Val Val Ala485 490 495Leu Thr Ile Leu Tyr Gly Phe Cys Ile Ile Pro Glu Asn Ile Cys Asn500 505 510Ile Val Thr Ala Tyr Met Ala Thr Gly Val Ser Gln Gln Thr Met Asp515 520 525Leu Leu Asn Ile Ile Ser Gln Phe Leu Leu Phe Phe Lys Ser Cys Val530 535 540Thr Pro Val Leu Leu Phe Cys Leu Cys Lys Pro Phe Ser Arg Ala Phe545 550 555 560Met Glu Cys Cys Cys Cys Cys Cys Glu Glu Cys Ile Gln Lys Ser Ser565 570 575Thr Val Thr Ser Asp Asp Asn Asp Asn Glu Tyr Thr Thr Glu Leu Glu580 585 590Leu Ser Pro Phe Ser Thr Ile Arg Arg Glu Met Ser Thr Phe Ala Ser595 600 605Val Gly Thr His Cys610(94)SEQ ID NO93的资料(i)序列特征(A)长度34个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO93的序列描述CAGAATTCAG AGAAAAAAAG TGAATATGGT TTTT34(95)SEQ ID NO94的资料(i)序列特征(A)长度32个碱基对(B)类型核酸
(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO94的序列描述TTGGATCCCT GGTGCATAAC AATTGAAAGA AT 32(96)SEQ ID NO95的资料(i)序列特征(A)长度1248个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO95的序列描述ATGGTTTTTG CTCACAGAAT GGATAACAGC AAGCCACATT TGATTATTCC TACACTTCTG 60GTGCCCCTCC AAAACCGCAG CTGCACTGAA ACAGCCACAC CTCTGCCAAG CCAATACCTG 120ATGGAATTAA GTGAGGAGCA CAGTTGGATG AGCAACCAAA CAGACCTTCA CTATGTGCTG 180AAACCCGGGG AAGTGGCCAC AGCCAGCATC TTCTTTGGGA TTCTGTGGTT GTTTTCTATC 240TTCGGCAATT CCCTGGTTTG TTTGGTCATC CATAGGAGTA GGAGGACTCA GTCTACCACC 300AACTACTTTG TGGTCTCCAT GGCATGTGCT GACCTTCTCA TCAGCGTTGC CAGCACGCCT 360TTCGTCCTGC TCCAGTTCAC CACTGGAAGG TGGACGCTGG GTAGTGCAAC GTGCAAGGTT 420GTGCGATATT TTCAATATCT CACTCCAGGT GTCCAGATCT ACGTTCTCCT CTCCATCTGC 480ATAGACCGGT TCTACACCAT CGTCTATCCT CTGAGCTTCA AGGTGTCCAG AGAAAAAGCC 540AAGAAAATGA TTGCGGCATC GTGGATCTTT GATGCAGGCT TTGTGACCCC TGTGCTCTTT 600TTCTATGGCT CCAACTGGGA CAGTCATTGT AACTATTTCC TCCCCTCCTC TTGGGAAGGC 660ACTGCCTACA CTGTCATCCA CTTCTTGGTG GGCTTTGTGA TTCCATCTGT CCTCATAATT 720TTATTTTACC AAAAGGTCAT AAAATATATT TGGAGAATAG GCACAGATGG CCGAACGGTG 780AGGAGGACAA TGAACATTGT CCCTCGGACA AAAGTGAAAA CTATCAAGAT GTTCCTCATT 840TTAAATCTGT TGTTTTTGCT CTCCTGGCTG CCTTTTCATG TAGCTCAGCT ATGGCACCCC 900CATGAACAAG ACTATAAGAA AAGTTCCCTT GTTTTCACAG CTATCACATG GATATCCTTT 960AGTTCTTCAG CCTCTAAACC TACTCTGTAT TCAATTTATA ATGCCAATTT TCGGAGAGGG 1020ATGAAAGAGA CTTTTTGCAT GTCCTCTATG AAATGTTACC GAAGCAATGC CTATACTATC 1080ACAACAAGTT CAAGGATGGC CAAAAAAAAC TACGTTGGCA TTTCAGAAAT CCCTTCCATG 1140GCCAAAACTA TTACCAAAGA CTCGATCTAT GACTCATTTG ACAGAGAAGC CAAGGAAAAA 1200AAGCTTGCTT GGCCCATTAA CTCAAATCCA CCAAATACTT TTGTCTAA 1248(97)SEQ ID NO96的资料(i)序列特征(A)长度415个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO96的序列描述Met Val Phe Ala His Arg Met Asp Asn Ser Lys Pro His Leu Ile Ile1 5 10 15Pro Thr Leu Leu Val Pro Leu Gln Asn Arg Ser Cys Thr Glu Thr Ala20 25 30Thr Pro Leu Pro Ser Gln Tyr Leu Met Glu Leu Ser Glu Glu His Ser35 40 45Trp Met Ser Asn Gln Thr Asp Leu His Tyr Val Leu Lys Pro Gly Glu50 55 60Val Ala Thr Ala Ser Ile Phe Phe Gly Ile Leu Trp Leu Phe Ser Ile65 70 75 80Phe Gly Asn Ser Leu Val Cys Leu Val Ile His Arg Ser Arg Arg Thr85 90 95Gln Ser Thr Thr Asn Tyr Phe Val Val Ser Met Ala Cys Ala Asp Leu100 105 110Leu Ile Ser Val Ala Ser Thr Pro Phe Val Leu Leu Gln Phe Thr Thr115 120 125Gly Arg Trp Thr Leu Gly Ser Ala Thr Cys Lys Val Val Arg Tyr Phe130 135 140Gln Tyr Leu Thr Pro Gly Val Gln Ile Tyr Val Leu Leu Ser Ile Cys145 150 155 160Ile Asp Arg Phe Tyr Thr Ile Val Tyr Pro Leu Ser Phe Lys Val Ser165 170 175Arg Glu Lys Ala Lys Lys Met Ile Ala Ala Ser Trp Ile Phe Asp Ala180 185 190Gly Phe Val Thr Pro Val Leu Phe Phe Tyr Gly Ser Asn Trp Asp Ser195 200 205His Cys Asn Tyr Phe Leu Pro Ser Ser Trp Glu Gly Thr Ala Tyr Thr210 215 220Val Ile His Phe Leu Val Gly Phe Val Ile Pro Ser Val Leu Ile Ile225 230 235 240Leu Phe Tyr Gln Lys Val Ile Lys Tyr Ile Trp Arg Ile Gly Thr Asp245 250 255Gly Arg Thr Val Arg Arg Thr Met Asn Ile Val Pro Arg Thr Lys Val
260 265 270Lys Thr Ile Lys Met Phe Leu Ile Leu Asn Leu Leu Phe Leu Leu Ser275 280 285Trp Leu Pro Phe His Val Ala Gln Leu Trp His Pro His Glu Gln Asp290 295 300Tyr Lys Lys Ser Ser Leu Val Phe Thr Ala Ile Thr Trp Ile Ser Phe305 310 315 320Ser Ser Ser Ala Ser Lys Pro Thr Leu Tyr Ser Ile Tyr Asn Ala Asn325 330 335Phe Arg Arg Gly Met Lys Glu Thr Phe Cys Met Ser Ser Met Lys Cys340 345 350Tyr Arg Ser Asn Ala Tyr Thr Ile Thr Thr Ser Ser Arg Met Ala Lys355 360 365Lys Asn Tyr Val Gly Ile Ser Glu Ile Pro Ser Met Ala Lys Thr Ile370 375 380Thr Lys Asp Ser Ile Tyr Asp Ser Phe Asp Arg Glu Ala Lys Glu Lys385 390 395 400Lys Leu Ala Trp Pro Ile Asn Ser Asn Pro Pro Asn Thr Phe Val405 410 415(98)SEQ ID NO97的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO97的序列描述GGAAAGCTTA ACGATCCCCA GGAGCAACAT 30(99)SEQ ID NO98的资料(i)序列特征(A)长度31个碱基对(B)类型核酸(C)链型单链
(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO98的序列描述CTGGGATCCT ACGAGAGCAT TTTTCACACA G31(100)SEQ ID NO99的资料(i)序列特征(A)长度1842个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO99的序列描述ATGGGGCCCA CCCTAGCGGT TCCCACCCCC TATGGCTGTA TTGGCTGTAA GCTACCCCAG 60CCAGAATACC CACCGGCTCT AATCATCTTT ATGTTCTGCG CGATGGTTAT CACCATCGTT 120GTAGACCTAA TCGGCAACTC CATGGTCATT TTGGCTGTGA CGAAGAACAA GAAGCTCCGG 180AATTCTGGCA ACATCTTCGT GGTCAGTCTC TCTGTGGCCG ATATGCTGGT GGCCATCTAC 240CCATACCCTT TGATGCTGCA TGCCATGTCC ATTGGGGGCT GGGATCTGAG CCAGTTACAG 300TGCCAGATGG TCGGGTTCAT CACAGGGCTG AGTGTGGTCG GCTCCATCTT CAACATCGTG 360GCAATCGCTA TCAACCGTTA CTGCTACATC TGCCACAGCC TCCAGTACGA ACGGATCTTC 420AGTGTGCGCA ATACCTGCAT CTACCTGGTC ATCACCTGGA TCATGACCGT CCTGGCTGTC 480CTGCCCAACA TGTACATTGG CACCATCGAG TACGATCCTC GCACCTACAC CTGCATCTTC 540AACTATCTGA ACAACCCTGT CTTCACTGTT ACCATCGTCT GCATCCACTT CGTCCTCCCT 600CTCCTCATCG TGGGTTTCTG CTACGTGAGG ATCTGGACCA AAGTGCTGGC GGCCCGTGAC 660CCTGCAGGGC AGAATCCTGA CAACCAACTT GCTGAGGTTC GCAATTTTCT AACCATGTTT 720GTGATCTTCC TCCTCTTTGC AGTGTGCTGG TGCCCTATCA ACGTGCTCAC TGTCTTGGTG 780GCTGTCAGTC CGAAGGAGAT GGCAGGCAAG ATCCCCAACT GGCTTTATCT TGCAGCCTAC 840TTCATAGCCT ACTTCAACAG CTGCCTCAAC GCTGTGATCT ACGGGCTCCT CAATGAGAAT 900TTCCGAAGAG AATACTGGAC CATCTTCCAT GCTATGCGGC ACCCTATCAT ATTCTTCCCT 960GGCCTCATCA GTGATATTCG TGAGATGCAG GAGGCCCGTA CCCTGGCCCG CGCCCGTGCC 1020CATGCTCGCG ACCAAGCTCG TGAACAAGAC CGTGCCCATG CCTGTCCTGC TGTGGAGGAA 1080ACCCCGATGA ATGTCCGGAA TGTTCCATTA CCTGGTGATG CTGCAGCTGG CCACCCCGAC 1140CGTGCCTCTG GCCACCCTAA GCCCCATTCC AGATCCTCCT CTGCCTATCG CAAATCTGCC 1200TCTACCCACC ACAAGTCTGT CTTTAGCCAC TCCAAGGCTG CCTCTGGTCA CCTCAAGCCT 1260GTCTCTGGCC ACTCCAAGCC TGCCTCTGGT CACCCCAAGT CTGCCACTGT CTACCCTAAG 1320CCTGCCTCTG TCCATTTCAA GGGTGACTCT GTCCATTTCA AGGGTGACTC TGTCCATTTC 1380AAGCCTGACT CTGTTCATTT CAAGCCTGCT TCCAGCAACC CCAAGCCCAT CACTGGCCAC 1440CATGTCTCTG CTGGCAGCCA CTCCAAGTCT GCCTTCAGTG CTGCCACCAG CCACCCTAAA 1500CCCATCAAGC CAGCTACCAG CCATGCTGAG CCCACCACTG CTGACTATCC CAAGCCTGCC 1560ACTACCAGCC ACCCTAAGCC CGCTGCTGCT GACAACCCTG AGCTCTCTGC CTCCCATTGC 1620CCCGAGATCC CTGCCATTGC CCACCCTGTG TCTGACGACA GTGACCTCCC TGAGTCGGCC 1680TCTAGCCCTG CCGCTGGGCC CACCAAGCCT GCTGCCAGCC AGCTGGAGTC TGACACCATC 1740GCTGACCTTC CTGACCCTAC TGTAGTCACT ACCAGTACCA ATGATTACCA TGATGTCGTG 1800GTTGTTGATG TTGAAGATGA TCCTGATGAA ATGGCTGTGT GA1842(101)SEQ ID NO100的资料(i)序列特征(A)长度613个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO100的序列描述Met Gly Pro Thr Leu Ala Val Pro Thr Pro Tyr Gly Cys Ile Gly Cys1 5 10 15Lys Leu Pro Gln Pro Glu Tyr Pro Pro Ala Leu Ile Ile Phe Met Phe20 25 30Cys Ala Met Val Ile Thr Ile Val Val Asp Leu Ile Gly Asn Ser Met35 40 45Val Ile Leu Ala Val Thr Lys Asn Lys Lys Leu Arg Asn Ser Gly Asn50 55 60Ile Phe Val Val Ser Leu Ser Val Ala Asp Met Leu Val Ala Ile Tyr65 70 75 80Pro Tyr Pro Leu Met Leu His Ala Met Ser Ile Gly Gly Trp Asp Leu85 90 95Ser Gln Leu Gln Cys Gln Met Val Gly Phe Ile Thr Gly Leu Ser Val100 105 110Val Gly Ser Ile Phe Asn Ile Val Ala Ile Ala Ile Asn Arg Tyr Cys115 120 125Tyr Ile Cys His Ser Leu Gln Tyr Glu Arg Ile Phe Ser Val Arg Asn130 135 140Thr Cys Ile Tyr Leu Val Ile Thr Trp Ile Met Thr Val Leu Ala Val145 150 155 160Leu Pro Asn Met Tyr Ile Gly Thr Ile Glu Tyr Asp Pro Arg Thr Tyr165 170 175Thr Cys Ile Phe Asn Tyr Leu Asn Asn Pro Val Phe Thr Val Thr Ile180 185 190Val Cys Ile His Phe Val Leu Pro Leu Leu Ile Val Gly Phe Cys Tyr195 200 205Val Arg Ile Trp Thr Lys Val Leu Ala Ala Arg Asp Pro Ala Gly Gln
210 215 220Asn Pro Asp Asn Gln Leu Ala Glu Val Arg Asn Phe Leu Thr Met Phe225 230 235 240Val Ile Phe Leu Leu phe Ala Val Cys Trp Cys Pro Ile Asn Val Leu245 250 255Thr Val Leu Val Ala Val Ser Pro Lys Glu Met Ala Gly Lys Ile Pro260 265 270Asn Trp Leu Tyr Leu Ala Ala Tyr Phe Ile Ala Tyr Phe Asn Ser Cys275 280 285Leu Asn Ala Val Ile Tyr Gly Leu Leu Asn Glu Asn Phe Arg Arg Glu290 295 300Tyr Trp Thr Ile Phe His Ala Met Arg His Pro Ile Ile Phe Phe Pro305 310 315 320Gly Leu Ile Ser Asp Ile Arg Glu Met Gln Glu Ala Arg Thr Leu Ala325 330 335Arg Ala Arg Ala His Ala Arg Asp Gln Ala Arg Glu Gln Asp Arg Ala340 345 350His Ala Cys Pro Ala Val Glu Glu Thr Pro Met Asn Val Arg Asn Val355 360 365Pro Leu Pro Gly Asp Ala Ala Ala Gly His Pro Asp Arg Ala Ser Gly370 375 380His Pro Lys Pro His Ser Arg Ser Ser Ser Ala Tyr Arg Lys Ser Ala385 390 395 400Ser Thr His His Lys Ser Val Phe Ser His Ser Lys Ala Ala Ser Gly405 410 415His Leu Lys Pro Val Ser Gly His Ser Lys Pro Ala Ser Gly His Pro420 425 430Lys Ser Ala Thr Val Tyr Pro Lys Pro Ala Ser Val His Phe Lys Gly435 440 445Asp Ser Val His Phe Lys Gly Asp Ser Val His Phe Lys Pro Asp Ser450 455 460Val His Phe Lys Pro Ala Ser Ser Asn Pro Lys Pro Ile Thr Gly His465 470 475 480His Val Ser Ala Gly Ser His Ser Lys Ser Ala Phe Ser Ala Ala Thr
485 490 495Ser His Pro Lys Pro Ile Lys Pro Ala Thr Ser His Ala Glu Pro Thr500 505 510Thr Ala Asp Tyr Pro Lys Pro Ala Thr Thr Ser His Pro Lys Pro Ala515 520 525Ala Ala Asp Asn Pro Glu Leu Ser Ala Ser His Cys Pro Glu Ile Pro530 535 540Ala Ile Ala His Pro Val Ser Asp Asp Ser Asp Leu Pro Glu Ser Ala545 550 555560Ser Ser Pro Ala Ala Gly Pro Thr Lys Pro Ala Ala Ser Gln Leu Glu565 570 575Ser Asp Thr Ile Ala Asp Leu Pro Asp Pro Thr Val Val Thr Thr Ser580 585 590Thr Asn Asp Tyr His Asp Val Val Val Val Asp Val Glu Asp Asp Pro595 600 605Asp Glu Met Ala Val610(102)SEQ ID NO101的资料(i)序列特征(A)长度32个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO101的序列描述TCCAAGCTTC GCCATGGGAC ATAACGGGAG CT 32(103)SEQ ID NO102的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO102的序列描述CGTGAATTCC AAGAATTTAC AATCCTTGCT 30(104)SEQ ID NO103的资料(i)序列特征(A)长度1548个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO103的序列描述ATGGGACATA ACGGGAGCTG GATCTCTCCA AATGCCAGCG AGCCGCACAA CGCGTCCGGC 60GCCGAGGCTG CGGGTGTGAA CCGCAGCGCG CTCGGGGAGT TCGGCGAGGC GCAGCTGTAC 120CGCCAGTTCA CCACCACCGT GCAGGTCGTC ATCTTCATAG GCTCGCTGCT CGGAAACTTC 180ATGGTGTTAT GGTCAACTTG CCGCACAACC GTGTTCAAAT CTGTCACCAA CAGGTTCATT 240AAAAACCTGG CCTGCTCGGG GATTTGTGCC AGCCTGGTCT GTGTGCCCTT CGACATCATC 300CTCAGCACCA GTCCTCACTG TTGCTGGTGG ATCTACACCA TGCTCTTCTG CAAGGTCGTC 360AAATTTTTGC ACAAAGTATT CTGCTCTGTG ACCATCCTCA GCTTCCCTGC TATTGCTTTG 420GACAGGTACT ACTCAGTCCT CTATCCACTG GAGAGGAAAA TATCTGATGC CAAGTCCCGT 480GAACTGGTGA TGTACATCTG GGCCCATGCA GTGGTGGCCA GTGTCCCTGT GTTTGCAGTA 540ACCAATGTGG CTGACATCTA TGCCACGTCC ACCTGCACGG AAGTCTGGAG CAACTCCTTG 600GGCCACCTGG TGTACGTTCT GGTGTATAAC ATCACCACGG TCATTGTGCC TGTGGTGGTG 660GTGTTCCTCT TCTTGATACT GATCCGACGG GCCCTGAGTG CCAGCCAGAA GAAGAAGGTC 720ATCATAGCAG CGCTCCGGAC CCCACAGAAC ACCATCTCTA TTCCCTATGC CTCCCAGCGG 780GAGGCCGAGC TGCACGCCAC CCTGCTCTCC ATGGTGATGG TCTTCATCTT GTGTAGCGTG 840CCCTATGCCA CCCTGGTCGT CTACCAGACT GTGCTCAATG TCCCTGACAC TTCCGTCTTC 900TTGCTGCTCA CTGCTGTTTG GCTGCCCAAA GTCTCCCTGC TGGCAAACCC TGTTCTCTTT 960CTTACTGTGA ACAAATCTGT CCGCAAGTGC TTGATAGGGA CCCTGGTGCA ACTACACCAC 1020CGGTACAGTC GCCGTAATGT GGTCAGTACA GGGAGTGGCA TGGCTGAGGC CAGCCTGGAA 1080CCCAGCATAC GCTCGGGTAG CCAGCTCCTG GAGATGTTCC ACATTGGGCA GCAGCAGATC 1140TTTAAGCCCA CAGAGGATGA GGAAGAGAGT GAGGCCAAGT ACATTGGCTC AGCTGACTTC 1200CAGGCCAAGG AGATATTTAG CACCTGCCTG GAGGGAGAGC AGGGGCCACA GTTTGCGCCC 1260TCTGCCCCAC CCCTGAGCAC AGTGGACTCT GTATCCCAGG TGGCACCGGC AGCCCCTGTG 1320GAACCTGAAA CATTCCCTGA TAAGTATTCC CTGCAGTTTG GCTTTGGGCC TTTTGAGTTG 1380CCTCCTCAGT GGCTCTCAGA GACCCGAAAC AGCAAGAAGC GGCTGCTTCC CCCCTTGGGC 1440AACACCCCAG AAGAGCTGAT CCAGACAAAG GTGCCCAAGG TAGGCAGGGT GGAGCGGAAG 1500ATGAGCAGAA ACAATAAAGT GAGCATTTTT CCAAAGGTGG ATTCCTAG 1548(105)SEQ ID NO104的资料(i)序列特征(A)长度515个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO104的序列描述Met Gly His Asn Gly Ser Trp Ile Ser Pro Asn Ala Ser Glu Pro His1 5 10 15Asn Ala Ser Gly Ala Glu Ala Ala Gly Val Asn Arg Ser Ala Leu Gly20 25 30Glu Phe Gly Glu Ala Gln Leu Tyr Arg Gln Phe Thr Thr Thr Val Gln35 40 45Val Val Ile Phe Ile Gly Ser Leu Leu Gly Asn Phe Met Val Leu Trp50 55 60Ser Thr Cys Arg Thr Thr Val Phe Lys Ser Val Thr Asn Arg Phe Ile65 70 75 80Lys Asn Leu Ala Cys Ser Gly Ile Cys Ala Ser Leu Val Cys Val Pro85 90 95Phe Asp Ile Ile Leu Ser Thr Ser Pro His Cys Cys Trp Trp Ile Tyr100 105 110Thr Met Leu Phe Cys Lys Val Val Lys Phe Leu His Lys Val Phe Cys115 120 125Ser Val Thr Ile Leu Ser Phe Pro Ala Ile Ala Leu Asp Arg Tyr Tyr130 135 140Ser Val Leu Tyr Pro Leu Glu Arg Lys Ile Ser Asp Ala Lys Ser Arg145 150 155 160Glu Leu Val Met Tyr Ile Trp Ala His Ala Val Val Ala Ser Val Pro165 170 175Val Phe Ala Val Thr Asn Val Ala Asp Ile Tyr Ala Thr Ser Thr Cys180 185 190Thr Glu Val Trp Ser Asn Ser Leu Gly His Leu Val Tyr Val Leu Val195 200 205Tyr Asn Ile Thr Thr Val Ile Val Pro Val Val Val Val Phe Leu Phe210 215 220Leu Ile Leu Ile Arg Arg Ala Leu Ser Ala Ser Gln Lys Lys Lys Val225 230 235 240Ile Ile Ala Ala Leu Arg Thr Pro Gln Asn Thr Ile Ser Ile Pro Tyr245 250 255Ala Ser Gln Arg Glu Ala Glu Leu His Ala Thr Leu Leu Ser Met Val260 265 270Met Val Phe Ile Leu Cys Ser Val Pro Tyr Ala Thr Leu Val Val Tyr275 280 285Gln Thr Val Leu Asn Val Pro Asp Thr Ser Val Phe Leu Leu Leu Thr290 295 300Ala Val Trp Leu Pro Lys Val Ser Leu Leu Ala Asn Pro Val Leu Phe305 310 315 320Leu Thr Val Asn Lys Ser Val Arg Lys Cys Leu Ile Gly Thr Leu Val325 330 335Gln Leu His His Arg Tyr Ser Arg Arg Asn Val Val Ser Thr Gly Ser340 345 350Gly Met Ala Glu Ala Ser Leu Glu Pro Ser Ile Arg Ser Gly Ser Gln355 360 365Leu Leu Glu Met Phe His Ile Gly Gln Gln Gln Ile Phe Lys Pro Thr370 375 380Glu Asp Glu Glu Glu Ser Glu Ala Lys Tyr Ile Gly Ser Ala Asp Phe385 390 395 400Gln Ala Lys Glu Ile Phe Ser Thr Cys Leu Glu Gly Glu Gln Gly Pro405 410 415Gln Phe Ala Pro Ser Ala Pro Pro Leu Ser Thr Val Asp Ser Val Ser420 425 430Gln Val Ala Pro Ala Ala Pro Val Glu Pro Glu Thr Phe Pro Asp Lys435 440 445Tyr Ser Leu Gln Phe Gly Phe Gly Pro Phe Glu Leu Pro Pro Gln Trp450 455 460Leu Ser Glu Thr Arg Asn Ser Lys Lys Arg Leu Leu Pro Pro Leu Gly465 470 475 480Asn Thr Pro Glu Glu Leu Ile Gln Thr Lys Val Pro Lys Val Gly Arg485 490 495Val Glu Arg Lys Met Ser Arg Asn Asn Lys Val Ser Ile Phe Pro Lys500 505 510Val Asp Ser515(106)SEQ ID NO105的资料(i)序列特征(A)长度29个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO105的序列描述GGAGAATTCA CTAGGCGAGG CGCTCCATC 29(107)SEQ ID NO106的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO106的序列描述GGAGGATCCA GGAAACCTTA GGCCGAGTCC 30(108)SEQ ID NO107的资料(i)序列特征(A)长度1164个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO107的序列描述ATGAATCGGC ACCATCTGCA GGATCACTTT CTGGAAATAG ACAAGAAGAA CTGCTGTGTG 60TTCCGAGATG ACTTCATTGC CAAGGTGTTG CCGCCGGTGT TGGGGCTGGA GTTTATCTTT 120GGGCTTCTGG GCAATGGCCT TGCCCTGTGG ATTTTCTGTT TCCACCTCAA GTCCTGGAAA 180TCCAGCCGGA TTTTCCTGTT CAACCTGGCA GTAGCTGACT TTCTACTGAT CATCTGCCTG 240CCGTTCGTGA TGGACTACTA TGTGCGGCGT TCAGACTGGA ACTTTGGGGA CATCCCTTGC 300CGGCTGGTGC TCTTCATGTT TGCCATGAAC CGCCAGGGCA GCATCATCTT CCTCACGGTG 360GTGGCGGTAG ACAGGTATTT CCGGGTGGTC CATCCCCACC ACGCCCTGAA CAAGATCTCC 420AATTGGACAG CAGCCATCAT CTCTTGCCTT CTGTGGGGCA TCACTGTTGG CCTAACAGTC 480CACCTCCTGA AGAAGAAGTT GCTGATCCAG AATGGCCCTG CAAATGTGTG CATCAGCTTC 540AGCATCTGCC ATACCTTCCG GTGGCACGAA GCTATGTTCC TCCTGGAGTT CCTCCTGCCC 600CTGGGCATCA TCCTGTTCTG CTCAGCCAGA ATTATCTGGA GCCTGCGGCA GAGACAAATG 660GACCGGCATG CCAAGATCAA GAGAGCCATC ACCTTCATCA TGGTGGTGGC CATCGTCTTT 720GTCATCTGCT TCCTTCCCAG CGTGGTTGTG CGGATCCGCA TCTTCTGGCT CCTGCACACT 780TCGGGCACGC AGAATTGTGA AGTGTACCGC TCGGTGGACC TGGCGTTCTT TATCACTCTC 840AGCTTCACCT ACATGAACAG CATGCTGGAC CCCGTGGTGT ACTACTTCTC CAGCCCATCC 900TTTCCCAACT TCTTCTCCAC TTTGATCAAC CGCTGCCTCC AGAGGAAGAT GACAGGTGAG 960CCAGATAATA ACCGCAGCAC GAGCGTCGAG CTCACAGGGG ACCCCAACAA AACCAGAGGC 1020GCTCCAGAGG CGTTAATGGC CAACTCCGGT GAGCCATGGA GCCCCTCTTA TCTGGGCCCA 1080ACCTCAAATA ACCATTCCAA GAAGGGACAT TGTCACCAAG AACCAGCATC TCTGGAGAAA 1140CAGTTGGGCT GTTGCATCGA GTAA1164(109)SEQ ID NO108的资料(i)序列特征(A)长度387个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO108的序列描述Met Asn Arg His His Leu Gln Asp His Phe Leu Glu Ile Asp Lys Lys1 5 10 15Asn Cys Cys Val Phe Arg Asp Asp Phe Ile Ala Lys Val Leu Pro Pro20 25 30Val Leu Gly Leu Glu Phe Ile Phe Gly Leu Leu Gly Asn Gly Leu Ala35 40 45Leu Trp Ile Phe Cys Phe His Leu Lys Ser Trp Lys Ser Ser Arg Ile50 55 60Phe Leu Phe Asn Leu Ala Val Ala Asp Phe Leu Leu Ile Ile Cys Leu65 70 75 80Pro Phe Val Met Asp Tyr Tyr Val Arg Arg Ser Asp Trp Asn Phe Gly85 90 95Asp Ile Pro Cys Arg Leu Val Leu Phe Met Phe Ala Met Asn Arg Gln100 105 110Gly Ser Ile Ile Phe Leu Thr Val Val Ala Val Asp Arg Tyr Phe Arg115 120 125Val Val His Pro His His Ala Leu Asn Lys Ile Ser Asn Trp Thr Ala130 135 140Ala Ile Ile Ser Cys Leu Leu Trp Gly Ile Thr Val Gly Leu Thr Val145 150 155 160His Leu Leu Lys Lys Lys Leu Leu Ile Gln Asn Gly Pro Ala Asn Val165 170 175Cys Ile Ser Phe Ser Ile Cys His Thr Phe Arg Trp His Glu Ala Met180 185 190Phe Leu Leu Glu Phe Leu Leu Pro Leu Gly Ile Ile Leu Phe Cys Ser195 200 205Ala Arg Ile Ile Trp Ser Leu Arg Gln Arg Gln Met Asp Arg His Ala210 215 220Lys Ile Lys Arg Ala Ile Thr Phe Ile Met Val Val Ala Ile Val Phe225 230 235 240Val Ile Cys Phe Leu Pro Ser Val Val Val Arg Ile Arg Ile Phe Trp245 250 255Leu Leu His Thr Ser Gly Thr Gln Asn Cys Glu Val Tyr Arg Ser Val260 265 270Asp Leu Ala Phe Phe Ile Thr Leu Ser Phe Thr Tyr Met Asn Ser Met275 280 285Leu Asp Pro Val Val Tyr Tyr Phe Ser Ser Pro Ser Phe Pro Asn Phe290 295 300Phe Ser Thr Leu Ile Asn Arg Cys Leu Gln Arg Lys Met Thr Gly Glu305 310 315 320Pro Asp Asn Asn Arg Ser Thr Ser Val Glu Leu Thr Gly Asp Pro Asn325 330 335Lys Thr Arg Gly Ala Pro Glu Ala Leu Met Ala Asn Ser Gly Glu Pro340 345 350Trp Ser Pro Ser Tyr Leu Gly Pro Thr Ser Asn Asn His Ser Lys Lys355 360 365Gly His Cys His Gln Glu Pro Ala Ser Leu Glu Lys Gln Leu Gly Cys370 375 380Cys Ile Glu385(110)SEQ ID NO109的资料(i)序列特征(A)长度37碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(iv)反义否(xi)SEQ ID NO109的序列描述ACCATGGCTT GCAATGGCAG TGCGGCCAGG GGGCACT 37(111)SEQ ID NO110的资料(i)序列特征(A)长度39个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(iv)反义是(xi)SEQ ID NO110的序列描述CGACCAGGAC AAACAGCATC TTGGTCACTT GTCTCCGGC 39(112)SEQ ID NO111的资料(i)序列特征(A)长度39个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(iv)反义否(xi)SEQ ID NO111的序列描述GACCAAGATG CTGTTTGTCC TGGTCGTGGT GTTTGGCAT39(113)SEQ ID NO112的资料(i)序列特征(A)长度35个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(iv)反义是(xi)SEQ ID NO112的序列描述CGGAATTCAG GATGGATCGG TCTCTTCCTC CGCCT35(114)SEQ ID NO113的资料(i)序列特征(A)长度1212个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO113的序列描述ATGGCTTGCA ATGGCAGTGC GGCCAGGGGG CACTTTGACC CTGAGGACTT GAACCTGACT 60GACGAGGCAC TGAGACTCAA GTACCTGGGG CCCCAGCAGA CAGAGCTGTT CATGCCCATC 120TGTGCCACAT ACCTGCTGAT CTTCGTGGTG GGCGCTGTGG GCAATGGGCT GACCTGTCTG 180GTCATCCTGC GCCACAAGGC CATGCGCACG CCTACCAACT ACTACCTCTT CAGCCTGGCC 240GTGTCGGACC TGCTGGTGCT GCTGGTGGGC CTGCCCCTGG AGCTCTATGA GATGTGGCAC 300AACTACCCCT TCCTGCTGGG CGTTGGTGGC TGCTATTTCC GCACGCTACT GTTTGAGATG 360GTCTGCCTGG CCTCAGTGCT CAACGTCACT GCCCTGAGCG TGGAACGCTA TGTGGCCGTG 420GTGCACCCAC TCCAGGCCAG GTCCATGGTG ACGCGGGCCC ATGTGCGCCG AGTGCTTGGG 480GCCGTCTGGG GTCTTGCCAT GCTCTGCTCC CTGCCCAACA CCAGCCTGCA CGGCATCCGG 540CAGCTGCACG TGCCCTGCCG GGGCCCAGTG CCAGACTCAG CTGTTTGCAT GCTGGTCCGC 600CCACGGGCCC TCTACAACAT GGTAGTGCAG ACCACCGCGC TGCTCTTCTT CTGCCTGCCC 660ATGGCCATCA TGAGCGTGCT CTACCTGCTC ATTGGGCTGC GACTGCGGCG GGAGAGGCTG 720CTGCTCATGC AGGAGGCCAA GGGCAGGGGC TCTGCAGCAG CCAGGTCCAG ATACACCTGC 780AGGCTCCAGC AGCACGATCG GGGCCGGAGA CAAGTGACCA AGATGCTGTT TGTCCTGGTC 840GTGGTGTTTG GCATCTGCTG GGCCCCGTTC CACGCCGACC GCGTCATGTG GAGCGTCGTG 900TCACAGTGGA CAGATGGCCT GCACCTGGCC TTCCAGCACG TGCACGTCAT CTCCGGCATC 960TTCTTCTACC TGGGCTCGGC GGCCAACCCC GTGCTCTATA GCCTCATGTC CAGCCGCTTC 1020CGAGAGACCT TCCAGGAGGC CCTGTGCCTC GGGGCCTGCT GCCATCGCCT CAGACCCCGC 1080CACAGCTCCC ACAGCCTCAG CAGGATGACC ACAGGCAGCA CCCTGTGTGA TGTGGGCTCC 1140CTGGGCAGCT GGGTCCACCC CCTGGCTGGG AACGATGGCC CAGAGGCGCA GCAAGAGACC 1200GATCCATCCT GA 1212(115)SEQ ID NO114的资料(i)序列特征(A)长度403个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO114的序列描述Met Ala Cys Asn Gly Ser Ala Ala Arg Gly His Phe Asp Pro Glu Asp1 5 10 15Leu Asn Leu Thr Asp Glu Ala Leu Arg Leu Lys Tyr Leu Gly Pro Gln
20 25 30Gln Thr Glu Leu Phe Met Pro Ile Cys Ala Thr Tyr Leu Leu Ile Phe35 40 45Val Val Gly Ala Val Gly Asn Gly Leu Thr Cys Leu Val Ile Leu Arg50 55 60His Lys Ala Met Arg Thr Pro Thr Asn Tyr Tyr Leu Phe Ser Leu Ala65 70 75 80Val Ser Asp Leu Leu Val Leu Leu Val Gly Leu Pro Leu Glu Leu Tyr85 90 95Glu Met Trp His Asn Tyr Pro Phe Leu Leu Gly Val Gly Gly Cys Tyr100 105 110Phe Arg Thr Leu Leu Phe Glu Met Val Cys Leu Ala Ser Val Leu Asn115 120 125Val Thr Ala Leu Ser Val Glu Arg Tyr Val Ala Val Val His Pro Leu130 135140Gln Ala Arg Ser Met Val Thr Arg Ala His Val Arg Arg Val Leu Gly145 150 155 160Ala Val Trp Gly Leu Ala Met Leu Cys Ser Leu Pro Asn Thr Ser Leu165 170 175His Gly Ile Arg Gln Leu His Val Pro Cys Arg Gly Pro Val Pro Asp180 185 190Ser Ala Val Cys Met Leu Val Arg Pro Arg Ala Leu Tyr Asn Met Val195 200 205Val Gln Thr Thr Ala Leu Leu Phe Phe Cys Leu Pro Met Ala Ile Met210 215 220Ser Val Leu Tyr Leu Leu Ile Gly Leu Arg Leu Arg Arg Glu Arg Leu225 230 235 240Leu Leu Met Gln Glu Ala Lys Gly Arg Gly Ser Ala Ala Ala Arg Ser245 250 255Arg Tyr Thr Cys Arg Leu Gln Gln His Asp Arg Gly Arg Arg Gln Val260 265 270Thr Lys Met Leu Phe Val Leu Val Val Val Phe Gly Ile Cys Trp Ala275 280 285Pro Phe His Ala Asp Arg Val Met Trp Ser Val Val Ser Gln Trp Thr
290 295 300Asp Gly Leu His Leu Ala Phe Gln His Val His Val Ile Ser Gly Ile305 310 315 320Phe Phe Tyr Leu Gly Ser Ala Ala Asn Pro Val Leu Tyr Ser Leu Met325 330 335Ser Ser Arg Phe Arg Glu Thr Phe Gln Glu Ala Leu Cys Leu Gly Ala340 345 350Cys Cys His Arg Leu Arg Pro Arg His Ser Ser His Ser Leu Ser Arg355 360 365Met Thr Thr Gly Ser Thr Leu Cys Asp Val Gly Ser Leu Gly Ser Trp370 375 380Val His Pro Leu Ala Gly Asn Asp Gly Pro Glu Ala Gln Gln Glu Thr385 390 395 400Asp Pro Ser(116)SEQ ID NO115的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO115的序列描述GGAAGCTTCA GGCCCAAAGA TGGGGAACAT 30(117)SEQ ID NO116的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO116的序列描述GTGGATCCAC CCGCGGAGGA CCCAGGCTAG30(118)SEQ ID NO117的资料(i)序列特征(A)长度1098个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组)(xi)SEQ ID NO117的序列描述ATGGGGAACA TCACTGCAGA CAACTCCTCG ATGAGCTGTA CCATCGACCA TACCATCCAC 60CAGACGCTGG CCCCGGTGGT CTATGTTACC GTGCTGGTGG TGGGCTTCCC GGCCAACTGC 120CTGTCCCTCT ACTTCGGCTA CCTGCAGATC AAGGCCCGGA ACGAGCTGGG CGTGTACCTG 180TGCAACCTGA CGGTGGCCGA CCTCTTCTAC ATCTGCTCGC TGCCCTTCTG GCTGCAGTAC 240GTGCTGCAGC ACGACAACTG GTCTCACGGC GACCTGTCCT GCCAGGTGTG CGGCATCCTC 300CTGTACGAGA ACATCTACAT CAGCGTGGGC TTCCTCTGCT GCATCTCCGT GGACCGCTAC 360CTGGCTGTGG CCCATCCCTT CCGCTTCCAC CAGTTCCGGA CCCTGAAGGC GGCCGTCGGC 420GTCAGCGTGG TCATCTGGGC CAAGGAGCTG CTGACCAGCA TCTACTTCCT GATGCACGAG 480GAGGTCATCG AGGACGAGAA CCAGCACCGC GTGTGCTTTG AGCACTACCC CATCCAGGCA 540TGGCAGCGCG CCATCAACTA CTACCGCTTC CTGGTGGGCT TCCTCTTCCC CATCTGCCTG 600CTGCTGGCGT CCTACCAGGG CATCCTGCGC GCCGTGCGCC GGAGCCACGG CACCCAGAAG 660AGCCGCAAGG ACCAGATCCA GCGGCTGGTG CTCAGCACCG TGGTCATCTT CCTGGCCTGC 720TTCCTGCCCT ACCACGTGTT GCTGCTGGTG CGCAGCGTCT GGGAGGCCAG CTGCGACTTC 780GCCAAGGGCG TTTTCAACGC CTACCACTTC TCCCTCCTGC TCACCAGCTT CAACTGCGTC 840GCCGACCCCG TGCTCTACTG CTTCGTCAGC GAGACCACCC ACCGGGACCT GGCCCGCCTC 900CGCGGGGCCT GCCTGGCCTT CCTCACCTGC TCCAGGACCG GCCGGGCCAG GGAGGCCTAC 960CCGCTGGGTG CCCCCGAGGC CTCCGGGAAA AGCGGGGCCC AGGGTGAGGA GCCCGAGCTG 1020TTGACCAAGC TCCACCCGGC CTTCCAGACC CCTAACTCGC CAGGGTCGGG CGGGTTCCCC 1080ACGGGCAGGT TGGCCTAG 1098(119)SEQ ID NO118的资料(i)序列特征(A)长度365个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO118的序列描述Met Gly Asn Ile Thr Ala Asp Asn Ser Ser Met Ser Cys Thr Ile Asp1 5 10 15His Thr Ile His Gln Thr Leu Ala Pro Val Val Tyr Val Thr Val Leu20 25 30Val Val Gly Phe Pro Ala Asn Cys Leu Ser Leu Tyr Phe Gly Tyr Leu35 40 45Gln Ile Lys Ala Arg Asn Glu Leu Gly Val Tyr Leu Cys Asn Leu Thr50 55 60Val Ala Asp Leu Phe Tyr Ile Cys Ser Leu Pro Phe Trp Leu Gln Tyr65 70 75 80Val Leu Gln His Asp Asn Trp Ser His Gly Asp Leu Ser Cys Gln Val85 90 95Cys Gly Ile Leu Leu Tyr Glu Asn Ile Tyr Ile Ser Val Gly Phe Leu100 105 110Cys Cys Ile Ser Val Asp Arg Tyr Leu Ala Val Ala His Pro Phe Arg115 120 125Phe His Gln Phe Arg Thr Leu Lys Ala Ala Val Gly Val Ser Val Val130 135 140Ile Trp Ala Lys Glu Leu Leu Thr Ser Ile Tyr Phe Leu Met His Glu145 150 155 160Glu Val Ile Glu Asp Glu Asn Gln His Arg Val Cys Phe Glu His Tyr165 170 175Pro Ile Gln Ala Trp Gln Arg Ala Ile Asn Tyr Tyr Arg Phe Leu Val180 185 190Gly Phe Leu Phe Pro Ile Cys Leu Leu Leu Ala Ser Tyr Gln Gly Ile195 200 205Leu Arg Ala Val Arg Arg Ser His Gly Thr Gln Lys Ser Arg Lys Asp210 215 220Gln Ile Gln Arg Leu Val Leu Ser Thr Val Val Ile Phe Leu Ala Cys225 230 235 240Phe Leu Pro Tyr His Val Leu Leu Leu Val Arg Ser Val Trp Glu Ala245 250 255Ser Cys Asp Phe Ala Lys Gly Val Phe Asn Ala Tyr His Phe Ser Leu260 265 270Leu Leu Thr Ser Phe Asn Cys Val Ala Asp Pro Val Leu Tyr Cys Phe275 280 285Val Ser Glu Thr Thr His Arg Asp Leu Ala Arg Leu Arg Gly Ala Cys290 295 300Leu Ala Phe Leu Thr Cys Ser Arg Thr Gly Arg Ala Arg Glu Ala Tyr305 310 315 320Pro Leu Gly Ala Pro Glu Ala Ser Gly Lys Ser Gly Ala Gln Gly Glu325 330 335Glu Pro Glu Leu Leu Thr Lys Leu His Pro Ala Phe Gln Thr Pro Asn340 345 350Ser Pro Gly Ser Gly Gly Phe Pro Thr Gly Arg Leu Ala355 360 365(120)SEQ ID NO119的资料(i)序列特征(A)长度26个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组)(xi) SEQ ID NO119的序列描述GACCTCGAGT CCTTCTACAC CTCATC 26(121)SEQ ID NO120的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO120的序列描述TGCTCTAGAT TCCAGATAGG TGAAAACTTG 30(122)SEQ ID NO121的资料(i)序列特征(A)长度1416个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi) SEQ ID NO121的序列描述ATGGATATTC TTTGTGAAGA AAATACTTCT TTGAGCTCAA CTACGAACTC CCTAATGCAA 60TTAAATGATG ACAACAGGCT CTACAGTAAT GACTTTAACT CCGGAGAAGC TAACACTTCT 120GATGCATTTA ACTGGACAGT CGACTCTGAA AATCGAACCA ACCTTTCCTG TGAAGGGTGC 180CTCTCACCGT CGTGTCTCTC CTTACTTCAT CTCCAGGAAA AAAACTGGTC TGCTTTACTG 240ACAGCCGTAG TGATTATTCT AACTATTGCT GGAAACATAC TCGTCATCAT GGCAGTGTCC 300CTAGAGAAAA AGCTGCAGAA TGCCACCAAC TATTTCCTGA TGTCACTTGC CATAGCTGAT 360ATGCTGCTGG GTTTCCTTGT CATGCCCGTG TCCATGTTAA CCATCCTGTA TGGGTACCGG 420TGGCCTCTGC CGAGCAAGCT TTGTGCAGTC TGGATTTACC TGGACGTGCT CTTCTCCACG 480GCCTCCATCA TGCACCTCTG CGCCATCTCG CTGGACCGCT ACGTCGCCAT CCAGAATCCC 540ATCCACCACA GCCGCTTCAA CTCCAGAACT AAGGCATTTC TGAAAATCAT TGCTGTTTGG 600ACCATATCAG TAGGTATATC CATGCCAATA CCAGTCTTTG GGCTACAGGA CGATTCGAAG 660GTCTTTAAGG AGGGGAGTTG CTTACTCGCC GATGATAACT TTGTCCTGAT CGGCTCTTTT 720GTGTCATTTT TCATTCCCTT AACCATCATG GTGATCACCT ACTTTCTAAC TATCAAGTCA 780CTCCAGAAAG AAGCTACTTT GTGTGTAAGT GATCTTGGCA CACGGGCCAA ATTAGCTTCT 840TTCAGCTTCC TCCCTCAGAG TTCTTTGTCT TCAGAAAAGC TCTTCCAGCG GTCGATCCAT 900AGGGAGCCAG GGTCCTACAC AGGCAGGAGG ACTATGCAGT CCATCAGCAA TGAGCAAAAG 960GCATGCAAGG TGCTGGGCAT CGTCTTCTTC CTGTTTGTGG TGATGTGGTG CCCTTTCTTC 1020ATCACAAACA TCATGGCCGT CATCTGCAAA GAGTCCTGCA ATGAGGATGT CATTGGGGCC 1080CTGCTCAATG TGTTTGTTTG GATCGGTTAT CTCTCTTCAG CAGTCAACCC ACTAGTCTAC 1140ACACTGTTCA ACAAGACCTA TAGGTCAGCC TTTTCACGGT ATATTCAGTG TCAGTACAAG 1200GAAAACAAAA AACCATTGCA GTTAATTTTA GTGAACACAA TACCGGCTTT GGCCTACAAG 1260TCTAGCCAAC TTCAAATGGG ACAAAAAAAG AATTCAAAGC AAGATGCCAA GACAACAGAT 1320AATGACTGCT CAATGGTTGC TCTAGGAAAG CAGTATTCTG AAGAGGCTTC TAAAGACAAT 1380AGCGACGGAG TGAATGAAAA GGTGAGCTGT GTGTGA 1416(123)SEQ ID NO122的资料(i)序列特征(A)长度471个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO122的序列描述Met Asp Ile Leu Cys Glu Glu Asn Thr Ser Leu Ser Ser Thr Thr Asn1 5 10 15Ser Leu Met Gln Leu Asn Asp Asp Asn Arg Leu Tyr Ser Asn Asp Phe20 25 30Asn Ser Gly Glu Ala Asn Thr Ser Asp Ala Phe Asn Trp Thr Val Asp35 40 45Ser Glu Asn Arg Thr Asn Leu Ser Cys Glu Gly Cys Leu Ser Pro Ser50 55 60Cys Leu Ser Leu Leu His Leu Gln Glu Lys Asn Trp Ser Ala Leu Leu65 70 75 80Thr Ala Val Val Ile Ile Leu Thr Ile Ala Gly Asn Ile Leu Val Ile
85 90 95Met Ala Val Ser Leu Glu Lys Lys Leu Gln Asn Ala Thr Asn Tyr Phe100 105 110Leu Met Ser Leu Ala Ile Ala Asp Met Leu Leu Gly Phe Leu Val Met115 120 125Pro Val Ser Met Leu Thr Ile Leu Tyr Gly Tyr Arg Trp Pro Leu Pro130 135 140Ser Lys Leu Cys Ala Val Trp Ile Tyr Leu Asp Val Leu Phe Ser Thr145 150 155 160Ala Ser lle Met His Leu Cys Ala Ile Ser Leu Asp Arg Tyr Val Ala165 170 175Ile Gln Asn Pro Ile His His Ser Arg Phe Asn Ser Arg Thr Lys Ala180 185 190Phe Leu Lys Ile Ile Ala Val Trp Thr Ile Ser Val Gly Ile Ser Met195 200 205Pro Ile Pro Val Phe Gly Leu Gln Asp Asp Ser Lys Val Phe Lys Glu210 215 220Gly Ser Cys Leu Leu Ala Asp Asp Asn Phe Val Leu Ile Gly Ser Phe225 230 235 240Val Ser Phe Phe Ile Pro Leu Thr Ile Met Val Ile Thr Tyr Phe Leu245 250 255Thr Ile Lys Ser Leu Gln Lys Glu Ala Thr Leu Cys Val Ser Asp Leu260 265 270Gly Thr Arg Ala Lys Leu Ala Ser Phe Ser Phe Leu Pro Gln Ser Ser275 280 285Leu Ser Ser Glu Lys Leu Phe Gln Arg Ser Ile His Arg Glu Pro Gly290 295 300Ser Tyr Thr Gly Arg Arg Thr Met Gln Ser Ile Ser Asn Glu Gln Lys305 310 315 320Ala Cys Lys Val Leu Gly Ile Val Phe Phe Leu Phe Val Val Met Trp325 330 335Cys Pro Phe Phe Ile Thr Asn Ile Met Ala Val Ile Cys Lys Glu Ser340 345 350Cys Asn Glu Asp Val Ile Gly Ala Leu Leu Asn Val Phe Val Trp Ile
355 360 365Gly Tyr Leu Ser Ser Ala Val Asn Pro Leu Val Tyr Thr Leu Phe Asn370 375 380Lys Thr Tyr Arg Ser Ala Phe Ser Arg Tyr Ile Gln Cys Gln Tyr Lys385 390 395 400Glu Asn Lys Lys Pro Leu Gln Leu Ile Leu Val Asn Thr Ile Pro Ala405 410 415Leu Ala Tyr Lys Ser Ser Gln Leu Gln Met Gly Gln Lys Lys Asn Ser420 425 430Lys Gln Asp Ala Lys Thr Thr Asp Asn Asp Cys Ser Met Val Ala Leu435 440 445Gly Lys Gln Tyr Ser Glu Glu Ala Ser Lys Asp Asn Ser Asp Gly Val450 455 460Asn Glu Lys Val Ser Cys Val465 470(124)SEQ ID NO123的资料(i)序列特征(A)长度27个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO123的序列描述GACCTCGAGG TTGCTTAAGA CTGAAGC27(125)SEQ ID NO124的资料(i)序列特征(A)长度27个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO124的序列描述ATTTCTAGAC ATATGTAGCT TGTACCG 27(126)SEQ ID NO125的资料(i)序列特征(A)长度1377个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO125的序列描述ATGGTGAACC TGAGGAATGC GGTGCATTCA TTCCTTGTGC ACCTAATTGG CCTATTGGTT 60TGGCAATGTG ATATTTCTGT GAGCCCAGTA GCAGCTATAG TAACTGACAT TTTCAATACC 120TCCGATGGTG GACGCTTCAA ATTCCCAGAC GGGGTACAAA ACTGGCCAGC ACTTTCAATC 180GTCATCATAA TAATCATGAC AATAGGTGGC AACATCCTTG TGATCATGGC AGTAAGCATG 240GAAAAGAAAC TGCACAATGC CACCAATTAC TTCTTAATGT CCCTAGCCAT TGCTGATATG 300CTAGTGGGAC TACTTGTCAT GCCCCTGTCT CTCCTGGCAA TCCTTTATGA TTATGTCTGG 360CCACTACCTA GATATTTGTG CCCCGTCTGG ATTTCTTTAG ATGTTTTATT TTCAACAGCG 420TCCATCATGC ACCTCTGCGC TATATCGCTG GATCGGTATG TAGCAATACG TAATCCTATT 480GAGCATAGCC GTTTCAATTC GCGGACTAAG GCCATCATGA AGATTGCTAT TGTTTGGGCA 540ATTTCTATAG GTGTATCAGT TCCTATCCCT GTGATTGGAC TGAGGGACGA AGAAAAGGTG 600TTCGTGAACA ACACGACGTG CGTGCTCAAC GACCCAAATT TCGTTCTTAT TGGGTCCTTC 660GTAGCTTTCT TCATACCGCT GACGATTATG GTGATTACGT ATTGCCTGAC CATCTACGTT 720CTGCGCCGAC AAGCTTTGAT GTTACTGCAC GGCCACACCG AGGAACCGCC TGGACTAAGT 780CTGGATTTCC TGAAGTGCTG CAAGAGGAAT ACGGCCGAGG AAGAGAACTC TGCAAACCCT 840AACCAAGACC AGAACGCACG CCGAAGAAAG AAGAAGGAGA GACGTCCTAG GGGCACCATG 900CAGGCTATCA ACAATGAAAG AAAAGCTTCG AAAGTCCTTG GGATTGTTTT CTTTGTGTTT 960CTGATCATGT GGTGCCCATT TTTCATTACC AATATTCTGT CTGTTCTTTG TGAGAAGTCC 1020TGTAACCAAA AGCTCATGGA AAAGCTTCTG AATGTGTTTG TTTGGATTGG CTATGTTTGT 1080TCAGGAATCA ATCCTCTGGT GTATACTCTG TTCAACAAAA TTTACCGAAG GGCATTCTCC 1140AACTATTTGC GTTGCAATTA TAAGGTAGAG AAAAAGCCTC CTGTCAGGCA GATTCCAAGA 1200GTTGCCGCCA CTGCTTTGTC TGGGAGGGAG CTTAATGTTA ACATTTATCG GCATACCAAT 1260GAACCGGTGA TCGAGAAAGC CAGTGACAAT GAGCCCGGTA TAGAGATGCA AGTTGAGAAT 1320TTAGAGTTAC CAGTAAATCC CTCCAGTGTG GTTAGCGAAA GGATTAGCAG TGTGTGA1377(127)SEQ ID NO126的资料(i)序列特征(A)长度458个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO126的序列描述Met Val Asn Leu Arg Asn Ala Val His Ser Phe Leu Val His Leu Ile1 5 10 15Gly Leu Leu Val Trp Gln Cys Asp Ile Ser Val Ser Pro Val Ala Ala20 25 30Ile Val Thr Asp Ile Phe Asn Thr Ser Asp Gly Gly Arg Phe Lys Phe35 40 45Pro Asp Gly Val Gln Asn Trp Pro Ala Leu Ser Ile Val Ile Ile Ile50 55 60Ile Met Thr Ile Gly Gly Asn Ile Leu Val Ile Met Ala Val Ser Met65 70 75 80Glu Lys Lys Leu His Asn Ala Thr Asn Tyr Phe Leu Met Ser Leu Ala85 90 95Ile A1a Asp Met Leu Val Gly Leu Leu Val Met Pro Leu Ser Leu Leu100 105 110Ala Ile Leu Tyr Asp Tyr Val Trp Pro Leu Pro Arg Tyr Leu Cys Pro115 120 125Val Trp Ile Ser Leu Asp Val Leu Phe Ser Thr Ala Ser Ile Met His130 135 140Leu Cys Ala Ile Ser Leu Asp Arg Tyr Val Ala Ile Arg Asn Pro Ile145 150 155 160Glu His Ser Arg Phe Asn Ser Arg Thr Lys Ala Ile Met Lys Ile Ala165 170 175Ile Val Trp Ala Ile Ser Ile Gly Val Ser Val Pro Ile Pro Val Ile180 185 190Gly Leu Arg Asp Glu Glu Lys Val Phe Val Asn Asn Thr Thr Cys Val195 200 205Leu Asn Asp Pro Asn Phe Val Leu Ile Gly Ser Phe Val Ala Phe Phe210 215 220Ile Pro Leu Thr Ile Met Val Ile Thr Tyr Cys Leu Thr Ile Tyr Val225 230 235 240Leu Arg Arg Gln Ala Leu Met Leu Leu His Gly His Thr Glu Glu Pro245 250 255Pro Gly Leu Ser Leu Asp Phe Leu Lys Cys Cys Lys Arg Asn Thr Ala260 265 270Glu Glu Glu Asn Ser Ala Asn Pro Asn Gln Asp Gln Asn Ala Arg Arg275 280 285Arg Lys Lys Lys Glu Arg Arg Pro Arg Gly Thr Met Gln Ala Ile Asn290 295 300Asn Glu Arg Lys Ala Ser Lys Val Leu Gly Ile Val Phe Phe Val Phe305 310 315 320Leu Ile Met Trp Cys Pro Phe Phe Ile Thr Asn Ile Leu Ser Val Leu325 330 335Cys Glu Lys Ser Cys Asn Gln Lys Leu Met Glu Lys Leu Leu Asn Val340 345 350Phe Val Trp Ile Gly Tyr Val Cys Ser Gly Ile Asn Pro Leu Val Tyr355 360 365Thr Leu Phe Asn Lys Ile Tyr Arg Arg Ala Phe Ser Asn Tyr Leu Arg370 375 380Cys Asn Tyr Lys Val Glu Lys Lys Pro Pro Val Arg Gln Ile Pro Arg385 390 395 400Val Ala Ala Thr Ala Leu Ser Gly Arg Glu Leu Asn Val Asn Ile Tyr405 410 415Arg His Thr Asn Glu Pro Val Ile Glu Lys Ala Ser Asp Asn Glu Pro420 425 430Gly Ile Glu Met Gln Val Glu Asn Leu Glu Leu Pro Val Asn Pro Ser435 440 445Ser Val Val Ser Glu Arg Ile Ser Ser Val450 455(128)SEQ ID NO127的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO127的序列描述GGTAAGCTTG GCAGTCCACG CCAGGCCTTC 30(129)SEQ ID NO128的资料(i)序列特征
(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO128的序列描述TCCGAATTCT CTGTAGACAC AAGGCTTTGG 30(130) SEQ ID NO129的资料(i)序列特征(A)长度1068个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO129的序列描述ATGGATCAGT TCCCTGAATC AGTGACAGAA AACTTTGAGT ACGATGATTT GGCTGAGGCC 60TGTTATATTG GGGACATCGT GGTCTTTGGG ACTGTGTTCC TGTCCATATT CTACTCCGTC 120ATCTTTGCCA TTGGCCTGGT GGGAAATTTG TTGGTAGTGT TTGCCCTCAC CAACAGCAAG 180AAGCCCAAGA GTGTCACCGA CATTTACCTC CTGAACCTGG CCTTGTCTGA TCTGCTGTTT 240GTAGCCACTT TGCCCTTCTG GACTCACTAT TTGATAAATG AAAAGGGCCT CCACAATGCC 300ATGTGCAAAT TCACTACCGC CTTCTTCTTC ATCGGCTTTT TTGGAAGCAT ATTCTTCATC 360ACCGTCATCA GCATTGATAG GTACCTGGCC ATCGTCCTGG CCGCCAACTC CATGAACAAC 420CGGACCGTGC AGCATGGCGT CACCATCAGC CTAGGCGTCT GGGCAGCAGC CATTTTGGTG 480GCAGCACCCC AGTTCATGTT CACAAAGCAG AAAGAAAATG AATGCCTTGG TGACTACCCC 540GAGGTCCTCC AGGAAATCTG GCCCGTGCTC CGCAATGTGG AAACAAATTT TCTTGGCTTC 600CTACTCCCCC TGCTCATTAT GAGTTATTGC TACTTCAGAA TCATCCAGAC GCTGTTTTCC 660TGCAAGAACC ACAAGAAAGC CAAAGCCATT AAACTGATCC TTCTGGTGGT CATCGTGTTT 720TTCCTCTTCT GGACACCCTA CAACGTTATG ATTTTCCTGG AGACGCTTAA GCTCTATGAC 780TTCTTTCCCA GTTGTGACAT GAGGAAGGAT CTGAGGCTGG CCCTCAGTGT GACTGAGACG 840GTTGCATTTA GCCATTGTTG CCTGAATCCT CTCATCTATG CATTTGCTGG GGAGAAGTTC 900AGAAGATACC TTTACCACCT GTATGGGAAA TGCCTGGCTG TCCTGTGTGG GCGCTCAGTC 960CACGTTGATT TCTCCTCATC TGAATCACAA AGGAGCAGGC ATGGAAGTGT TCTGAGCAGC 1020AATTTTACTT ACCACACGAG TGATGGAGAT GCATTGCTCC TTCTCTGA 1068(131)SEQ ID NO130的资料(i)序列特征(A)长度355个氨基酸(B)类型氨基酸(C)链型
(D)拓扑学不相关(ii)分子类蛋白质(xi)SEQ ID NO130的序列描述Met Asp Gln Phe Pro Glu Ser Val Thr Glu Asn Phe Glu Tyr Asp Asp1 5 10 15Leu Ala Glu Ala Cys Tyr Ile Gly Asp Ile Val Val Phe Gly Thr Val20 25 30Phe Leu Ser Ile Phe Tyr Ser Val Ile Phe Ala Ile Gly Leu Val Gly35 40 45Asn Leu Leu Val Val Phe Ala Leu Thr Asn Ser Lys Lys Pro Lys Ser50 55 60Val Thr Asp Ile Tyr Leu Leu Asn Leu Ala Leu Ser Asp Leu Leu Phe65 70 75 80Val Ala Thr Leu Pro Phe Trp Thr His Tyr Leu Ile Asn Glu Lys Gly85 90 95Leu His Asn Ala Met Cys Lys Phe Thr Thr Ala Phe Phe Phe Ile Gly100 105 110Phe Phe Gly Ser Ile Phe Phe Ile Thr Val Ile Ser Ile Asp Arg Tyr115 120 125Leu Ala Ile Val Leu Ala Ala Asn Ser Met Asn Asn Arg Thr Val Gln130 135 140His Gly Val Thr Ile Ser Leu Gly Val Trp Ala Ala Ala Ile Leu Val145 150 155 160Ala Ala Pro Gln Phe Met Phe Thr Lys Gln Lys Glu Asn Glu Cys Leu165 170 175Gly Asp Tyr Pro Glu Val Leu Gln Glu Ile Trp Pro Val Leu Arg Asn180 185 190Val Glu Thr Asn Phe Leu Gly Phe Leu Leu Pro Leu Leu Ile Met Ser195 200 205Tyr Cys Tyr Phe Arg Ile I1e Gln Thr Leu Phe Ser Cys Lys Asn His210 215 220Lys Lys Ala Lys Ala Ile Lys Leu Ile Leu Leu Val Val Ile Val Phe225 230 235 240Phe Leu Phe Trp Thr Pro Tyr Asn Val Met Ile Phe Leu Glu Thr Leu245 250 255Lys Leu Tyr Asp Phe Phe Pro Ser Cys Asp Met Arg Lys Asp Leu Arg260 265 270Leu Ala Leu Ser Val Thr Glu Thr Val Ala Phe Ser His Cys Cys Leu275 280 285Asn Pro Leu Ile Tyr Ala Phe Ala Gly Glu Lys Phe Arg Arg Tyr Leu290 295 300Tyr His Leu Tyr Gly Lys Cys Leu Ala Val Leu Cys Gly Arg Ser Val305 310 315 320His Val Asp Phe Ser Ser Ser Glu Ser Gln Arg Ser Arg His Gly Ser325 330 335Val Leu Ser Ser Asn Phe Thr Tyr His Thr Ser Asp Gly Asp Ala Leu340 345 350Leu Leu Leu355(132)SEQ ID NO131的资料(i)序列特征(A)长度32个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO131的序列描述GATCTCCAGT AGGCATAAGT GGACAATTCT GG 32(133)SEQ ID NO132的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO132的序列描述CTCCTTCGGT CCTCCTATCG TTGTCAGAAG 30(134)SEQ ID NO133的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO133的序列描述AGAAGGCCAA GATCGCGCGG CTGGCCCTCA 30(135)SEQ ID NO134的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO134的序列描述CGGCGCCACC GCACGAAAAA GCTCATCTTC 30(136)SEQ ID NO135的资料(i)序列特征(A)长度33个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO135的序列描述GCCAAGAAGC GGGTGAAGTT CCTGGTGGTG GCA 33(137)SEQ ID NO136的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO136的序列描述CAGGCGGAAG GTGAAAGTCC TGGTCCTCGT 30(138)SEQ ID NO137的资料(i)序列特征(A)长度33个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO137的序列描述CGGCGCCTGC GGGCCAAGCG GCTGGTGGTG GTG 33(139)SEQ ID NO138的资料(i)序列特征(A)长度31个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO138的序列描述CCAAGCACAA AGCCAAGAAA GTGACCATCA C31(140)SEQ ID NO139的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO139的序列描述GCGCCGGCGC ACCAAATGCT TGCTGGTGGT 30(141)SEQ ID NO140的资料(i)序列特征
(A)长度41个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO140的序列描述CAAAAAGCTG AAGAAATCTA AGAAGATCAT CTTTATTGTC G 41(142)SEQ ID NO141的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO141的序列描述CAAGACCAAG GCAAAACGCA TGATCGCCAT 30(143)SEQ ID NO142的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO142的序列描述GTCAAGGAGA AGTCCAAAAG GATCATCATC 30(144)SEQ ID NO143的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO143的序列描述CGCCGCGTGC GGGCCAAGCA GCTCCTGCTC 30(145)SEQ ID NO144的资料(i)序列特征(A)长度33个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO144的序列描述CCTGATAAGC GCTATAAAAT GGTCCTGTTT CGA 33(146)SEQ ID NO145的资料(i)序列特征(A)长度36个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO145的序列描述GAAAGACAAA AGAGAGTCAA GAGGATGTCT TTATTG 36(147)SEQ ID NO146的资料(i)序列特征(A)长度33个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO146的序列描述CGGAGAAAGA GGGTGAAACG CACAGCCATC GCC 33(148)SEQ ID NO147的资料(i)序列特征(A)长度30个碱基对(B)类型核酸
(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO147的序列描述AAGCTTCAGC GGGCCAAGGC ACTGGTCACC 30(149)SEQ ID NO148的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO148的序列描述CAGCGGCAGA AGGCAAAAAG GGTGGCCATC 30(150)SEQ ID NO149的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO149的序列描述CGGCAGAAGG CGAAGCGCAT GATCCTCGCG 30(151)SEQ ID NO150的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO150的序列描述GAGCGCAACA AGGCCAAAAA GGTGATCATC 30(152)SEQ ID NO151的资料(i)序列特征(A)长度39个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO151的序列描述GGTGTAAACA AAAAGGCTAA AAACACAATT ATTCTTATT39(153) SEQ ID NO152的资料(i)序列特征(A)长度27个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO152的序列描述GAGAGCCAGC TCAAGAGCAC CGTGGTG 27(154)SEQ ID NO153的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO153的序列描述CCACAAGCAA ACCAAGAAAA TGCTGGCTGT 30(155)SEQ ID NO154的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO154的序列描述CATCAAGTGT ATCATGTGCC AAGTACGCCC 30(156)SEQ ID NO155的资料(i)序列特征(A)长度34个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO155的序列描述CTAGAGAGTC AGATGAAGTG TACAGTAGTG GCAC 34(157)SEQ ID NO156的资料(i)序列特征(A)长度36个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO156的序列描述CTAGAGAGTC AGATGAAGTG TACAGTAGTG GCAC 34(158)SEQ ID NO157的资料(i)序列特征(A)长度33个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO157的序列描述GCTGAGGTTC GCAATAAACT AACCATGTTT GTG 33(159)SEQ ID NO158的资料(i)序列特征
(A)长度29个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO158的序列描述GGGAGGCCGA GCTGAAAGCC ACCCTGCTC 29(160)SEQ ID NO159的资料(i)序列特征(A)长度31个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO159的序列描述CAAGATCAAG AGAGCCAAAA CCTTCATCAT G 31(161)SEQ ID NO160的资料(i)序列特征(A)长度31个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO160的序列描述CCGGAGACAA GTGAAGAAGA TGCTGTTTGT C 31(162)SEQ ID NO161的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO161的序列描述GCAAGGACCA GATCAAGCGG CTGGTGCTCA 30(163)SEQ ID NO162的资料(i)序列特征(A)长度34个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO162的序列描述CAAGAAAGCC AAAGCCAAGA AACTGATCCT TCTG34(164)SEQ ID NO163的资料(i)序列特征(A)长度1068个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO163的序列描述ATGGAAGATT TGGAGGAAAC ATTATTTGAA GAATTTGAAA ACTATTCCTA TGACCTAGAC 60TATTACTCTC TGGAGTCTGA TTTGGAGGAG AAAGTCCAGC TGGGAGTTGT TCACTGGGTC 120TCCCTGGTGT TATATTGTTT GGCTTTTGTT CTGGGAATTC CAGGAAATGC CATCGTCATT 180TGGTTCACGG GGCTCAAGTG GAAGAAGACA GTCACCACTC TGTGGTTCCT CAATCTAGCC 240ATTGCGGATT TCATTTTTCT TCTCTTTCTG CCCCTGTACA TCTCCTATGT GGCCATGAAT 300TTCCACTGGC CCTTTGGCAT CTGGCTGTGC AAAGCCAATT CCTTCACTGC CCAGTTGAAC 360ATGTTTGCCA GTGTTTTTTT CCTGACAGTG ATCAGCCTGG ACCACTATAT CCACTTGATC 420CATCCTGTCT TATCTCATCG GCATCGAACC CTCAAGAACT CTCTGATTGT CATTATATTC 480ATCTGGCTTT TGGCTTCTCT AATTGGCGGT CCTGCCCTGT ACTTCCGGGA CACTGTGGAG 540TTCAATAATC ATACTCTTTG CTATAACAAT TTTCAGAAGC ATGATCCTGA CCTCACTTTG 600ATCAGGCACC ATGTTCTGAC TTGGGTGAAA TTTATCATTG GCTATCTCTT CCCTTTGCTA 660ACAATGAGTA TTTGCTACTT GTGTCTCATC TTCAAGGTGA AGAAGCGAAC AGTCCTGATC 720TCCAGTAGGC ATAAGTGGAC AATTCTGGTT GTGGTTGTGG CCTTTGTGGT TTGCTGGACT 780CCTTATCACC TGTTTAGCAT TTGGGAGCTC ACCATTCACC ACAATAGCTA TTCCCACCAT 840GTGATGCAGG CTGGAATCCC CCTCTCCACT GGTTTGGCAT TCCTCAATAG TTGCTTGAAC 900CCCATCCTTT ATGTCCTAAT TAGTAAGAAG TTCCAAGCTC GCTTCCGGTC CTCAGTTGCT 960GAGATACTCA AGTACACACT GTGGGAAGTC AGCTGTTCTG GCACAGTGAG TGAACAGCTC 1020AGGAACTCAG AAACCAAGAA TCTGTGTCTC CTGGAAACAG CTCAATAA 1068(165)SEQ ID NO164的资料(i)序列特征
(A)长度355个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO164的序列描述Met Glu Asp Leu Glu Glu Thr Leu Phe Glu Glu Phe Glu Asn Tyr Ser1 5 10 15Tyr Asp Leu Asp Tyr Tyr Ser Leu Glu Ser Asp Leu Glu Glu Lys Val20 25 30Gln Leu Gly Val Val His Trp Val Ser Leu Val Leu Tyr Cys Leu Ala35 40 45Phe Val Leu Gly Ile Pro Gly Asn Ala Ile Val Ile Trp Phe Thr Gly50 55 60Leu Lys Trp Lys Lys Thr Val Thr Thr Leu Trp Phe Leu Asn Leu Ala65 70 75 80Ile Ala Asp Phe Ile Phe Leu Leu Phe Leu Pro Leu Tyr Ile Ser Tyr85 90 95Val Ala Met Asn Phe His Trp Pro Phe Gly Ile Trp Leu Cys Lys Ala100 105 110Asn Ser Phe Thr Ala Gln Leu Asn Met Phe Ala Ser Val Phe Phe Leu115 120 125Thr Val Ile Ser Leu Asp His Tyr Ile His Leu Ile His Pro Val Leu130 135 140Ser His Arg His Arg Thr Leu Lys Asn Ser Leu Ile Val Ile Ile Phe145 150 155 160Ile Trp Leu Leu Ala Ser Leu Ile Gly Gly Pro Ala Leu Tyr Phe Arg165 170 175Asp Thr Val Glu Phe Asn Asn His Thr Leu Cys Tyr Asn Asn Phe Gln180 185 190Lys His Asp Pro Asp Leu Thr Leu Ile Arg His His Val Leu Thr Trp195 200 205Val Lys Phe Ile Ile Gly Tyr Leu Phe Pro Leu Leu Thr Met Ser Ile210 215 220Cys Tyr Leu Cys Leu Ile Phe Lys Val Lys Lys Arg Thr Val Leu Ile225 230 235 240Ser Ser Arg His Lys Trp Thr Ile Leu Val Val Val Val Ala Phe Val245 250 255Val Cys Trp Thr Pro Tyr His Leu Phe Ser Ile Trp Glu Leu Thr Ile260 265 270His His Asn Ser Tyr Ser His His Val Met Gln Ala Gly Ile Pro Leu275 280 285Ser Thr Gly Leu Ala Phe Leu Asn Ser Cys Leu Asn Pro Ile Leu Tyr290 295 300Val Leu Ile Ser Lys Lys Phe Gln Ala Arg Phe Arg Ser Ser Val Ala305 310 315 320Glu Ile Leu Lys Tyr Thr Leu Trp Glu Val Ser Cys Ser Gly Thr Val325 330 335Ser Glu Gln Leu Arg Asn Ser Glu Thr Lys Asn Leu Cys Leu Leu Glu340 345 350Thr Ala Gln355(166)SEQ ID NO165的资料(i)序列特征(A)长度1089个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO165的序列描述ATGGGCAACC ACACGTGGGA GGGCTGCCAC GTGGACTCGC GCGTGGACCA CCTCTTTCCG 60CCATCCCTCT ACATCTTTGT CATCGGCGTG GGGCTGCCCA CCAACTGCCT GGCTCTGTGG 120GCGGCCTACC GCCAGGTGCA ACAGCGCAAC GAGCTGGGCG TCTACCTGAT GAACCTCAGC 180ATCGCCGACC TGCTGTACAT CTGCACGCTG CCGCTGTGGG TGGACTACTT CCTGCACCAC 240GACAACTGGA TCCACGGCCC CGGGTCCTGC AAGCTCTTTG GGTTCATCTT CTACACCAAT 300ATCTACATCA GCATCGCCTT CCTGTGCTGC ATCTCGGTGG ACCGCTACCT GGCTGTGGCC 360CACCCACTCC GCTTCGCCCG CCTGCGCCGC GTCAAGACCG CCGTGGCCGT GAGCTCCGTG 420GTCTGGGCCA CGGAGCTGGG CGCCAACTCG GCGCCCCTGT TCCATGACGA GCTCTTCCGA 480GACCGCTACA ACCACACCTT CTGCTTTGAG AAGTTCCCCA TGGAAGGCTG GGTGGCCTGG 540ATGAACCTCT ATCGGGTGTT CGTGGGCTTC CTCTTCCCGT GGGCGCTCAT GCTGCTGTCG 600TACCGGGGCA TCCTGCGGGC CGTGCGGGGC AGCGTGTCCA CCGAGCGCCA GGAGAAGGCC 660AAGATCGCGC GGCTGGCCCT CAGCCTCATC GCCATCGTGC TGGTCTGCTT TGCGCCCTAT 720CACGTGCTCT TGCTGTCCCG CAGCGCCATC TACCTGGGCC GCCCCTGGGA CTGCGGCTTC 780GAGGAGCGCG TCTTTTCTGC ATACCACAGC TCACTGGCTT TCACCAGCCT CAACTGTGTG 840GCGGACCCCA TCCTCTACTG CCTGGTCAAC GAGGGCGCCC GCAGCGATGT GGCCAAGGCC 900CTGCACAACC TGCTCCGCTT TCTGGCCAGC GACAAGCCCC AGGAGATGGC CAATGCCTCG 960CTCACCCTGG AGACCCCACT CACCTCCAAG AGGAACAGCA CAGCCAAAGC CATGACTGGC 1020AGCTGGGCGG CCACTCCGCC TTCCCAGGGG GACCAGGTGC AGCTGAAGAT GCTGCCGCCA 1080GCACAATGA 1089(167)SEQ ID NO166的资料(i)序列特征(A)长度362个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO166的序列描述Met Gly Asn His Thr Trp Glu Gly Cys His Val Asp Ser Arg Val Asp1 5 10 15His Leu Phe Pro Pro Ser Leu Tyr Ile Phe Val Ile Gly Val Gly Leu20 25 30Pro Thr Asn Cys Leu Ala Leu Trp Ala Ala Tyr Arg Gln Val Gln Gln35 40 45Arg Asn Glu Leu Gly Val Tyr Leu Met Asn Leu Ser Ile Ala Asp Leu50 55 60Leu Tyr Ile Cys Thr Leu Pro Leu Trp Val Asp Tyr Phe Leu His His65 70 75 80Asp Asn Trp Ile His Gly Pro Gly Ser Cys Lys Leu Phe Gly Phe Ile85 90 95Phe Tyr Thr Asn Ile Tyr Ile Ser Ile Ala Phe Leu Cys Cys Ile Ser100 105 110Val Asp Arg Tyr Leu Ala Val Ala His Pro Leu Arg Phe Ala Arg Leu115 120 125Arg Arg Val Lys Thr Ala Val Ala Val Ser Ser Val Val Trp Ala Thr130 135 140Glu Leu Gly Ala Asn Ser Ala Pro Leu Phe His Asp Glu Leu Phe Arg145 150 155 160Asp Arg Tyr Asn His Thr Phe Cys Phe Glu Lys Phe Pro Met Glu Gly165 170 175Trp Val Ala Trp Met Asn Leu Tyr Arg Val Phe Val Gly Phe Leu Phe
180 185 190Pro Trp Ala Leu Met Leu Leu Ser Tyr Arg Gly Ile Leu Arg Ala Val195 200 205Arg Gly Ser Val Ser Thr Glu Arg Gln Glu Lys Ala Lys Ile Ala Arg210 215 220Leu Ala Leu Ser Leu Ile Ala Ile Val Leu Val Cys Phe Ala Pro Tyr225 230 235 240His Val Leu Leu Leu Ser Arg Ser Ala Ile Tyr Leu Gly Arg Pro Trp245 250 255Asp Cys Gly Phe Glu Glu Arg Val Phe Ser Ala Tyr His Ser Ser Leu260 265 270Ala Phe Thr Ser Leu Asn Cys Val Ala Asp Pro Ile Leu Tyr Cys Leu275 280 285Val Asn Glu Gly Ala Arg Ser Asp Val Ala Lys Ala Leu His Asn Leu290 295 300Leu Arg Phe Leu Ala Ser Asp Lys Pro Gln Glu Met Ala Asn Ala Ser305 310 315 320Leu Thr Leu Glu Thr Pro Leu Thr Ser Lys Arg Asn Ser Thr Ala Lys325 330 335Ala Met Thr Gly Ser Trp Ala Ala Thr Pro Pro Ser Gln Gly Asp Gln340 345 350Val Gln Leu Lys Met Leu Pro Pro Ala Gln355 360(168)SEQ ID NO167的资料(i)序列特征(A)长度1002个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO167的序列描述ATGGAGTCCT CAGGCAACCC AGAGAGCACC ACCTTTTTTT ACTATGACCT TCAGAGCCAG 60CCGTGTGAGA ACCAGGCCTG GGTCTTTGCT ACCCTCGCCA CCACTGTCCT GTACTGCCTG 120GTGTTTCTCC TCAGCCTAGT GGGCAACAGC CTGGTCCTGT GGGTCCTGGT GAAGTATGAG 180AGCCTGGAGT CCCTCACCAA CATCTTCATC CTCAACCTGT GCCTCTCAGA CCTGGTGTTC 240GCCTGCTTGT TGCCTGTGTG GATCTCCCCA TACCACTGGG GCTGGGTGCT GGGAGACTTC 300CTCTGCAAAC TCCTCAATAT GATCTTCTCC ATCAGCCTCT ACAGCAGCAT CTTCTTCCTG 360ACCATCATGA CCATCCACCG CTACCTGTCG GTAGTGAGCC CCCTCTCCAC CCTGCGCGTC 420CCCACCCTCC GCTGCCGGGT GCTGGTGACC ATGGCTGTGT GGGTAGCCAG CATCCTGTCC 480TCCATCCTCG ACACCATCTT CCACAAGGTG CTTTCTTCGG GCTGTGATTA TTCCGAACTC 540ACGTGGTACC TCACCTCCGT CTACCAGCAC AACCTCTTCT TCCTGCTGTC CCTGGGGATT 600ATCCTGTTCT GCTACGTGGA GATCCTCAGG ACCCTGTTCC GCTCACGCTC CAAGCGGCGC 660CACCGCACGA AAAAGCTCAT CTTCGCCATC GTGGTGGCCT ACTTCCTCAG CTGGGGTCCC 720TACAACTTCA CCCTGTTTCT GCAGACGCTG TTTCGGACCC AGATCATCCG GAGCTGCGAG 780GCCAAACAGC AGCTAGAATA CGCCCTGCTC ATCTGCCGCA ACCTCGCCTT CTCCCACTGC 840TGCTTTAACC CGGTGCTCTA TGTCTTCGTG GGGGTCAAGT TCCGCACACA CCTGAAACAT 900GTTCTCCGGC AGTTCTGGTT CTGCCGGCTG CAGGCACCCA GCCCAGCCTC GATCCCCCAC 960TCCCCTGGTG CCTTCGCCTA TGAGGGCGCC TCCTTCTACT GA1002(169)SEQ ID NO168的资料(i)序列特征(A)长度333个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO168的序列描述Met Glu Ser Ser Gly Asn Pro Glu Ser Thr Thr Phe Phe Tyr Tyr Asp1 5 10 15Leu Gln Ser Gln Pro Cys Glu Asn Gln Ala Trp Val Phe Ala Thr Leu20 25 30Ala Thr Thr Val Leu Tyr Cys Leu Val Phe Leu Leu Ser Leu Val Gly35 40 45Asn Ser Leu Val Leu Trp Val Leu Val Lys Tyr Glu Ser Leu Glu Ser50 55 60Leu Thr Asn Ile Phe Ile Leu Asn Leu Cys Leu Ser Asp Leu Val Phe65 70 75 80Ala Cys Leu Leu Pro Val Trp Ile Ser Pro Tyr His Trp Gly Trp Val85 90 95Leu Gly Asp Phe Leu Cys Lys Leu Leu Asn Met Ile Phe Ser Ile Ser100 l05 110Leu Tyr Ser Ser Ile Phe Phe Leu Thr Ile Met Thr Ile His Arg Tyr115 120 125Leu Ser Val Val Ser Pro Leu Ser Thr Leu Arg Val Pro Thr Leu Arg
130 135 140Cys Arg Val Leu Val Thr Met Ala Val Trp Val Ala Ser Ile Leu Ser145 150 155 160Ser Ile Leu Asp Thr Ile Phe His Lys Val Leu Ser Ser Gly Cys Asp165 170 175Tyr Ser Glu Leu Thr Trp Tyr Leu Thr Ser Val Tyr Gln His Asn Leu180 185 190Phe Phe Leu Leu Ser Leu Gly Ile Ile Leu Phe Cys Tyr Val Glu Ile195 200 205Leu Arg Thr Leu Phe Arg Ser Arg Ser Lys Arg Arg His Arg Thr Lys210 215 220Lys Leu Ile Phe Ala Ile Val Val Ala Tyr Phe Leu Ser Trp Gly Pro225 230 235 240Tyr Asn Phe Thr Leu Phe Leu Gln Thr Leu Phe Arg Thr Gln Ile Ile245 250 255Arg Ser Cys Glu Ala Lys Gln Gln Leu Glu Tyr Ala Leu Leu Ile Cys260 265 270Arg Asn Leu Ala Phe Ser His Cys Cys Phe Asn Pro Val Leu Tyr Val275 280 285Phe Val Gly Val Lys Phe Arg Thr His Leu Lys His Val Leu Arg Gln290 295 300Phe Trp Phe Cys Arg Leu Gln Ala Pro Ser Pro Ala Ser Ile Pro His305 310 315 320Ser Pro Gly Ala Phe Ala Tyr Glu Gly Ala Ser Phe Tyr325 330(170)SEQ ID NO169的资料(i)序列特征(A)长度987个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO169的序列描述ATGGACAACG CCTCGTTCTC GGAGCCCTGG CCCGCCAACG CATCGGGCCC GGACCCGGCG 60CTGAGCTGCT CCAACGCGTC GACTCTGGCG CCGCTGCCGG CGCCGCTGGC GGTGGCTGTA 120CCAGTTGTCT ACGCGGTGAT CTGCGCCGTG GGTCTGGCGG GCAACTCCGC CGTGCTGTAC 180GTGTTGCTGC GGGCGCCCCG CATGAAGACC GTCACCAACC TGTTCATCCT CAACCTGGCC 240ATCGCCGACG AGCTCTTCAC GCTGGTGCTG CCCATCAACA TCGCCGACTT CCTGCTGCGG 300CAGTGGCCCT TCGGGGAGCT CATGTGCAAG CTCATCGTGG CTATCGACCA GTACAACACC 360TTCTCCAGCC TCTACTTCCT CACCGTCATG AGCGCCGACC GCTACCTGGT GGTGTTGGCC 420ACTGCGGAGT CGCGCCGGGT GGCCGGCCGC ACCTACAGCG CCGCGCGCGC GGTGAGCCTG 480GCCGTGTGGG GGATCGTCAC ACTCGTCGTG CTGCCCTTCG CAGTCTTCGC CCGGCTAGAC 540GACGAGCAGG GCCGGCGCCA GTGCGTGCTA GTCTTTCCGC AGCCCGAGGC CTTCTGGTGG 600CGCGCGAGCC GCCTCTACAC GCTCGTGCTG GGCTTCGCCA TCCCCGTGTC CACCATCTGT 660GTCCTCTATA CCACCCTGCT GTGCCGGCTG CATGCCATGC GGCTGGACAG CCACGCCAAG 720GCCCTGGAGC GCGCCAAGAA GCGGGTGAAG TTCCTGGTGG TGGCAATCCT GGCGGTGTGC 780CTCCTCTGCT GGACGCCCTA CCACCTGAGC ACCGTGGTGG CGCTCACCAC CGACCTCCCG 840CAGACGCCGC TGGTCATCGC TATCTCCTAC TTCATCACCA GCCTGACGTA CGCCAACAGC 900TGCCTCAACC CCTTCCTCTA CGCCTTCCTG GACGCCAGCT TCCGCAGGAA CCTCCGCCAG 960CTGATAACTT GCCGCGCGGC AGCCTGA 987(171)SEQ ID NO170的资料(i)序列特征(A)长度328个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO170的序列描述Met Asp Asn Ala Ser Phe Ser Glu Pro Trp Pro Ala Asn Ala Ser Gly1 5 10 15Pro Asp Pro Ala Leu Ser Cys Ser Asn Ala Ser Thr Leu Ala Pro Leu20 25 30Pro Ala Pro Leu Ala Val Ala Val Pro Val Val Tyr Ala Val Ile Cys35 40 45Ala Val Gly Leu Ala Gly Asn Ser Ala Val Leu Tyr Val Leu Leu Arg50 55 60Ala Pro Arg Met Lys Thr Val Thr Asn Leu Phe Ile Leu Asn Leu Ala65 70 75 80Ile Ala Asp Glu Leu Phe Thr Leu Val Leu Pro Ile Asn Ile Ala Asp85 90 95Phe Leu Leu Arg Gln Trp Pro Phe Gly Glu Leu Met Cys Lys Leu Ile100 105 110Val Ala Ile Asp Gln Tyr Asn Thr Phe Ser Ser Leu Tyr Phe Leu Thr
115 120 125Val Met Ser Ala Asp Arg Tyr Leu Val Val Leu Ala Thr Ala Glu Ser130 135 140Arg Arg Val Ala Gly Arg Thr Tyr Ser Ala Ala Arg Ala Val Ser Leu145 150 155 160Ala Val Trp Gly Ile Val Thr Leu Val Val Leu Pro Phe Ala Val Phe165 170 175Ala Arg Leu Asp Asp Glu Gln Gly Arg Arg Gln Cys Val Leu Val Phe180 185 190Pro Gln Pro Glu Ala Phe Trp Trp Arg Ala Ser Arg Leu Tyr Thr Leu195 200 205Val Leu Gly Phe Ala Ile Pro Val Ser Thr Ile Cys Val Leu Tyr Thr210 215 220Thr Leu Leu Cys Arg Leu His Ala Met Arg Leu Asp Ser His Ala Lys225 230 235 240Ala Leu Glu Arg Ala Lys Lys Arg Val Lys Phe Leu Val Val Ala Ile245 250 255Leu Ala Val Cys Leu Leu Cys Trp Thr Pro Tyr His Leu Ser Thr Val260 265 270Val Ala Leu Thr Thr Asp Leu Pro Gln Thr Pro Leu Val Ile Ala Ile275 280 285Ser Tyr Phe Ile Thr Ser Leu Thr Tyr Ala Asn Ser Cys Leu Ash Pro290 295 300Phe Leu Tyr Ala Phe Leu Asp Ala Ser Phe Arg Arg Asn Leu Arg Gln305 310 315 320Leu Ile Thr Cys Arg Ala Ala Ala325(172)SEQ ID NO171的资料(i)序列特征(A)长度1002个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO171的序列描述ATGCAGGCCG CTGGGCACCC AGAGCCCCTT GACAGCAGGG GCTCCTTCTC CCTCCCCACG 60ATGGGTGCCA ACGTCTCTCA GGACAATGGC ACTGGCCACA ATGCCACCTT CTCCGAGCCA 120CTGCCGTTCC TCTATGTGCT CCTGCCCGCC GTGTACTCCG GGATCTGTGC TGTGGGGCTG 180ACTGGCAACA CGGCCGTCAT CCTTGTAATC CTAAGGGCGC CCAAGATGAA GACGGTGACC 240AACGTGTTCA TCCTGAACCT GGCCGTCGCC GACGGGCTCT TCACGCTGGT ACTGCCTGTC 300AACATCGCGG AGCACCTGCT GCAGTACTGG CCCTTCGGGG AGCTGCTCTG CAAGCTGGTG 360CTGGCCGTCG ACCACTACAA CATCTTCTCC AGCATCTACT TCCTAGCCGT GATGAGCGTG 420GACCGATACC TGGTGGTGCT GGCCACCGTG AGGTCCCGCC ACATGCCCTG GCGCACCTAC 480CGGGGGGCGA AGGTCGCCAG CCTGTGTGTC TGGCTGGGCG TCACGGTCCT GGTTCTGCCC 540TTCTTCTCTT TCGCTGGCGT CTACAGCAAC GAGCTGCAGG TCCCAAGCTG TGGGCTGAGC 600TTCCCGTGGC CCGAGCAGGT CTGGTTCAAG GCCAGCCGTG TCTACACGTT GGTCCTGGGC 660TTCGTGCTGC CCGTGTGCAC CATCTGTGTG CTCTACACAG ACCTCCTGCG CAGGCTGCGG 720GCCGTGCGGC TCCGCTCTGG AGCCAAGGCT CTAGGCAAGG CCAGGCGGAA GGTGAAAGTC 780CTGGTCCTCG TCGTGCTGGC CGTGTGCCTC CTCTGCTGGA CGCCCTTCCA CCTGGCCTCT 840GTCGTGGCCC TGACCACGGA CCTGCCCCAG ACCCCACTGG TCATCAGTAT GTCCTACGTC 900ATCACCAGCC TCACGTACGC CAACTCGTGC CTGAACCCCT TCCTCTACGC CTTTCTAGAT 960GACAACTTCC GGAAGAACTT CCGCAGCATA TTGCGGTGCT GA1002(173)SEQ ID NO172的资料(i)序列特征(A)长度333个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO172的序列描述Met Gln Ala Ala Gly His Pro Glu Pro Leu Asp Ser Arg Gly Ser Phe1 5 10 15Ser Leu Pro Thr Met Gly Ala Asn Val Ser Gln Asp Asn Gly Thr Gly20 25 30His Asn Ala Thr Phe Ser Glu Pro Leu Pro Phe Leu Tyr Val Leu Leu35 40 45Pro Ala Val Tyr Ser Gly Ile Cys Ala Val Gly Leu Thr Gly Asn Thr50 55 60Ala Val Ile Leu Val Ile Leu Arg Ala Pro Lys Met Lys Thr Val Thr65 70 75 80Asn Val Phe Ile Leu Asn Leu Ala Val Ala Asp Gly Leu Phe Thr Leu85 90 95Val Leu Pro Val Asn Ile Ala Glu His Leu Leu Gln Tyr Trp Pro Phe100 105 110Gly Glu Leu Leu Cys Lys Leu Val Leu Ala Val Asp His Tyr Asn Ile115 120 125Phe Ser Ser Ile Tyr Phe Leu Ala Val Met Ser Val Asp Arg Tyr Leu130 135 140Val Val Leu Ala Thr Val Arg Ser Arg His Met Pro Trp Arg Thr Tyr145 150 155 160Arg Gly Ala Lys Val Ala Ser Leu Cys Val Trp Leu Gly Val Thr Val165 170 175Leu Val Leu Pro Phe Phe Ser Phe Ala Gly Val Tyr Ser Asn Glu Leu180 185 190Gln Val Pro Ser Cys Gly Leu Ser Phe Pro Trp Pro Glu Gln Val Trp195 200 205Phe Lys Ala Ser Arg Val Tyr Thr Leu Val Leu Gly Phe Val Leu Pro210 215 220Val Cys Thr Ile Cys Val Leu Tyr Thr Asp Leu Leu Arg Arg Leu Arg225 230 235 240Ala Val Arg Leu Arg Ser Gly Ala Lys Ala Leu Gly Lys Ala Arg Arg245 250 255Lys Val Lys Val Leu Val Leu Val Val Leu Ala Val Cys Leu Leu Cys260 265 270Trp Thr Pro Phe His Leu Ala Ser Val Val Ala Leu Thr Thr Asp Leu275 280 285Pro Gln Thr Pro Leu Val Ile Ser Met Ser Tyr Val Ile Thr Ser Leu290 295 300Thr Tyr Ala Asn Ser Cys Leu Asn Pro Phe Leu Tyr Ala Phe Leu Asp305 310 315 320Asp Asn Phe Arg Lys Asn Phe Arg Ser Ile Leu Arg Cys325 330(174)SEQ ID NO173的资料(i)序列特征(A)长度1107个碱基对(B)类型核酸(C)链型单链
(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO173的序列描述ATGGTCCTTG AGGTGAGTGA CCACCAAGTG CTAAATGACG CCGAGGTTGC CGCCCTCCTG 60GAGAACTTCA GCTCTTCCTA TGACTATGGA GAAAACGAGA GTGACTCGTG CTGTACCTCC 120CCGCCCTGCC CACAGGACTT CAGCCTGAAC TTCGACCGGG CCTTCCTGCC AGCCCTCTAC 180AGCCTCCTCT TTCTGCTGGG GCTGCTGGGC AACGGCGCGG TGGCAGCCGT GCTGCTGAGC 240CGGCGGACAG CCCTGAGCAG CACCGACACC TTCCTGCTCC ACCTAGCTGT AGCAGACACG 300CTGCTGGTGC TGACACTGCC GCTCTGGGCA GTGGACGCTG CCGTCCAGTG GGTCTTTGGC 360TCTGGCCTCT GCAAAGTGGC AGGTGCCCTC TTCAACATCA ACTTCTACGC AGGAGCCCTC 420CTGCTGGCCT GCATCAGCTT TGACCGCTAC CTGAACATAG TTCATGCCAC CCAGCTCTAC 480CGCCGGGGGC CCCCGGCCCG CGTGACCCTC ACCTGCCTGG CTGTCTGGGG GCTCTGCCTG 540CTTTTCGCCC TCCCAGACTT CATCTTCCTG TCGGCCCACC ACGACGAGCG CCTCAACGCC 600ACCCACTGCC AATACAACTT CCCACAGGTG GGCCGCACGG CTCTGCGGGT GCTGCAGCTG 660GTGGCTGGCT TTCTGCTGCC CCTGCTGGTC ATGGCCTACT GCTATGCCCA CATCCTGGCC 720GTGCTGCTGG TTTCCAGGGG CCAGCGGCGC CTGCGGGCCA AGCGGCTGGT GGTGGTGGTC 780GTGGTGGCCT TTGCCCTCTG CTGGACCCCC TATCACCTGG TGGTGCTGGT GGACATCCTC 840ATGGACCTGG GCGCTTTGGC CCGCAACTGT GGCCGAGAAA GCAGGGTAGA CGTGGCCAAG 900TCGGTCACCT CAGGCCTGGG CTACATGCAC TGCTGCCTCA ACCCGCTGCT CTATGCCTTT 960GTAGGGGTCA AGTTCCGGGA GCGGATGTGG ATGCTGCTCT TGCGCCTGGG CTGCCCCAAC 1020CAGAGAGGGC TCCAGAGGCA GCCATCGTCT TCCCGCCGGG ATTCATCCTG GTCTGAGACC 1080TCAGAGGCCT CCTACTCGGG CTTGTGA 1107(175)SEQ ID NO174的资料(i)序列特征(A)长度368个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO174的序列描述Met Val Leu Glu Val Ser Asp His Gln Val Leu Asn Asp Ala Glu Val1 5 10 15Ala Ala Leu Leu Glu Asn Phe Ser Ser Ser Tyr Asp Tyr Gly Glu Asn20 25 30Glu Ser Asp Ser Cys Cys Thr Ser Pro Pro Cys Pro Gln Asp Phe Ser35 40 45Leu Asn Phe Asp Arg Ala Phe Leu Pro Ala Leu Tyr Ser Leu Leu Phe50 55 60Leu Leu Gly Leu Leu Gly Asn Gly Ala Val Ala Ala Val Leu Leu Ser65 70 75 80Arg Arg Thr Ala Leu Ser Ser Thr Asp Thr Phe Leu Leu His Leu Ala
85 90 95Val Ala Asp Thr Leu Leu Val Leu Thr Leu Pro Leu Trp Ala Val Asp100 105 110Ala Ala Val Gln Trp Val Phe Gly Ser Gly Leu Cys Lys Val Ala Gly115 120 125Ala Leu Phe Asn Ile Asn Phe Tyr Ala Gly Ala Leu Leu Leu Ala Cys130 135 140Ile Ser Phe Asp Arg Tyr Leu Asn Ile Val His Ala Thr Gln Leu Tyr145 150 155 160Arg Arg Gly Pro Pro Ala Arg Val Thr Leu Thr Cys Leu Ala Val Trp165 170 175Gly Leu Cys Leu Leu Phe Ala Leu Pro Asp Phe Ile Phe Leu Ser Ala180 185 190His His Asp Glu Arg Leu Asn Ala Thr His Cys Gln Tyr Asn Phe Pro195 200 205Gln Val Gly Arg Thr Ala Leu Arg Val Leu Gln Leu Val Ala Gly Phe210 215 220Leu Leu Pro Leu Leu Val Met Ala Tyr Cys Tyr Ala His Ile Leu Ala225 230 235 240Val Leu Leu Val Ser Arg Gly Gln Arg Arg Leu Arg Ala Lys Arg Leu245 250 255Val Val Val Val Val Val Ala Phe Ala Leu Cys Trp Thr Pro Tyr His260 265 270Leu Val Val Leu Val Asp Ile Leu Met Asp Leu Gly Ala Leu Ala Arg275 280 285Asn Cys Gly Arg Glu Ser Arg Val Asp Val Ala Lys Ser Val Thr Ser290 295 300Gly Leu Gly Tyr Met His Cys Cys Leu Asn Pro Leu Leu Tyr Ala Phe305 310 315 320Val Gly Val Lys Phe Arg Glu Arg Met Trp Met Leu Leu Leu Arg Leu325 330 335Gly Cys Pro Asn Gln Arg Gly Leu Gln Arg Gln Pro Ser Ser Ser Arg340 345 350Arg Asp Ser Ser Trp Ser Glu Thr Ser Glu Ala Ser Tyr Ser Gly Leu
355 360 365(176)SEQ ID NO175的资料(i)序列特征(A)长度1074个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO175的序列描述ATGGCTGATG ACTATGGCTC TGAATCCACA TCTTCCATGG AAGACTACGT TAACTTCAAC 60TTCACTGACT TCTACTGTGA GAAAAACAAT GTCAGGCAGT TTGCGAGCCA TTTCCTCCCA 120CCCTTGTACT GGCTCGTGTT CATCGTGGGT GCCTTGGGCA ACAGTCTTGT TATCCTTGTC 180TACTGGTACT GCACAAGAGT GAAGACCATG ACCGACATGT TCCTTTTGAA TTTGGCAATT 240GCTGACCTCC TCTTTCTTGT CACTCTTCCC TTCTGGGCCA TTGCTGCTGC TGACCAGTGG 300AAGTTCCAGA CCTTCATGTG CAAGGTGGTC AACAGCATGT ACAAGATGAA CTTCTACAGC 360TGTGTGTTGC TGATCATGTG CATCAGCGTG GACAGGTACA TTGCCATTGC CCAGGCCATG 420AGAGCACATA CTTGGAGGGA GAAAAGGCTT TTGTACAGCA AAATGGTTTG CTTTACCATC 480TGGGTATTGG CAGCTGCTCT CTGCATCCCA GAAATCTTAT ACAGCCAAAT CAAGGAGGAA 540TCCGGCATTG CTATCTGCAC CATGGTTTAC CCTAGCGATG AGAGCACCAA ACTGAAGTCA 600GCTGTCTTGA CCCTGAAGGT CATTCTGGGG TTCTTCCTTC CCTTCGTGGT CATGGCTTGC 660TGCTATACCA TCATCATTCA CACCCTGATA CAAGCCAAGA AGTCTTCCAA GCACAAAGCC 720AAGAAAGTGA CCATCACTGT CCTGACCGTC TTTGTCTTGT CTCAGTTTCC CTACAACTGC 780ATTTTGTTGG TGCAGACCAT TGACGCCTAT GCCATGTTCA TCTCCAACTG TGCCGTTTCC 840ACCAACATTG ACATCTGCTT CCAGGTCACC CAGACCATCG CCTTCTTCCA CAGTTGCCTG 900AACCCTGTTC TCTATGTTTT TGTGGGTGAG AGATTCCGCC GGGATCTCGT GAAAACCCTG 960AAGAACTTGG GTTGCATCAG CCAGGCCCAG TGGGTTTCAT TTACAAGGAG AGAGGGAAGC 1020TTGAAGCTGT CGTCTATGTT GCTGGAGACA ACCTCAGGAG CACTCTCCCT CTGA 1074(177)SEQ ID NO176的资料(i)序列特征(A)长度357个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO176的序列描述Met Ala Asp Asp Tyr Gly Ser Glu Ser Thr Ser Ser Met Glu Asp Tyr1 5 10 15Val Asn Phe Asn Phe Thr Asp Phe Tyr Cys Glu Lys Asn Asn Val Arg20 25 30Gln Phe Ala Ser His Phe Leu Pro Pro Leu Tyr Trp Leu Val Phe Ile35 40 45Val Gly Ala Leu Gly Asn Ser Leu Val Ile Leu Val Tyr Trp Tyr Cys50 55 60Thr Arg Val Lys Thr Met Thr Asp Met Phe Leu Leu Asn Leu Ala Ile65 70 75 80Ala Asp Leu Leu Phe Leu Val Thr Leu Pro Phe Trp Ala Ile Ala Ala85 90 95Ala Asp Gln Trp Lys Phe Gln Thr Phe Met Cys Lys Val Val Asn Ser100 105 110Met Tyr Lys Met Asn Phe Tyr Ser Cys Val Leu Leu Ile Met Cys Ile115 120 125Ser Val Asp Arg Tyr Ile Ala Ile Ala Gln Ala Met Arg Ala His Thr130 135 140Trp Arg Glu Lys Arg Leu Leu Tyr Ser Lys Met Val Cys Phe Thr Ile145 150 155 160Trp Val Leu Ala Ala Ala Leu Cys Ile Pro Glu Ile Leu Tyr Ser Gln165 170 175Ile Lys Glu Glu Ser Gly Ile Ala Ile Cys Thr Met Val Tyr Pro Ser180 185 190Asp Glu Ser Thr Lys Leu Lys Ser Ala Val Leu Thr Leu Lys Val Ile195 200 205Leu Gly Phe Phe Leu Pro Phe Val Val Met Ala Cys Cys Tyr Thr Ile210 215 220Ile Ile His Thr Leu Ile Gln Ala Lys Lys Ser Ser Lys His Lys Ala225 230 235 240Lys Lys Val Thr Ile Thr Val Leu Thr Val Phe Val Leu Ser Gln Phe245 250 255Pro Tyr Asn Cys Ile Leu Leu Val Gln Thr Ile Asp Ala Tyr Ala Met260 265 270Phe Ile Ser Asn Cys Ala Val Ser Thr Asn Ile Asp Ile Cys Phe Gln275 280 285Val Thr Gln Thr Ile Ala Phe Phe His Ser Cys Leu Asn Pro Val Leu290 295 300Tyr Val Phe Val Gly Glu Arg Phe Arg Arg Asp Leu Val Lys Thr Leu305 310 315 320Lys Asn Leu Gly Cys Ile Ser Gln Ala Gln Trp Val Ser Phe Thr Arg325 330 335Arg Glu Gly Ser Leu Lys Leu Ser Ser Met Leu Leu Glu Thr Thr Ser340 345 350Gly Ala Leu Ser Leu355(178)SEQ ID NO177的资料(i)序列特征(A)长度1110个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO177的序列描述ATGGCCTCAT CGACCACTCG GGGCCCCAGG GTTTCTGACT TATTTTCTGG GCTGCCGCCG 60GCGGTCACAA CTCCCGCCAA CCAGAGCGCA GAGGCCTCGG CGGGCAACGG GTCGGTGGCT 120GGCGCGGACG CTCCAGCCGT CACGCCCTTC CAGAGCCTGC AGCTGGTGCA TCAGCTGAAG 180GGGCTGATCG TGCTGCTCTA CAGCGTCGTG GTGGTCGTGG GGCTGGTGGG CAACTGCCTG 240CTGGTGCTGG TGATCGCGCG GGTGCCGCGG CTGCACAACG TGACGAACTT CCTCATCGGC 300AACCTGGCCT TGTCCGACGT GCTCATGTGC ACCGCCTGCG TGCCGCTCAC GCTGGCCTAT 360GCCTTCGAGC CACGCGGCTG GGTGTTCGGC GGCGGCCTGT GCCACCTGGT CTTCTTCCTG 420CAGCCGGTCA CCGTCTATGT GTCGGTGTTC ACGCTCACCA CCATCGCAGT GGACCGCTAC 480GTCGTGCTGG TGCACCCGCT GAGGCGCGCA TCTCGCTGCG CCTCAGCCTA CGCTGTGCTG 540GCCATCTGGG CGCTGTCCGC GGTGCTGGCG CTGCCGCCCG CCGTGCACAC CTATCACGTG 600GAGCTCAAGC CGCACGACGT GCGCCTCTGC GAGGAGTTCT GGGGCTCCCA GGAGCGCCAG 660CGCCAGCTCT ACGCCTGGGG GCTGCTGCTG GTCACCTACC TGCTCCCTCT GCTGGTCATC 720CTCCTGTCTT ACGTCCGGGT GTCAGTGAAG CTCCGCAACC GCGTGGTGCC GGGCTGCGTG 780ACCCAGAGCC AGGCCGACTG GGACCGCGCT CGGCGCCGGC GCACCAAATG CTTGCTGGTG 840GTGGTCGTGG TGGTGTTCGC CGTCTGCTGG CTGCCGCTGC ACGTCTTCAA CCTGCTGCGG 900GACCTCGACC CCCACGCCAT CGACCCTTAC GCCTTTGGGC TGGTGCAGCT GCTCTGCCAC 960TGGCTCGCCA TGAGTTCGGC CTGCTACAAC CCCTTCATCT ACGCCTGGCT GCACGACAGC 1020TTCCGCGAGG AGCTGCGCAA ACTGTTGGTC GCTTGGCCCC GCAAGATAGC CCCCCATGGC 1080CAGAATATGA CCGTCAGCGT GGTCATCTGA 1110(179)SEQ ID NO178的资料(i)序列特征(A)长度369个氨基酸(B)类型氨基酸
(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO178的序列描述Met Ala Ser Ser Thr Thr Arg Gly Pro Arg Val Ser Asp Leu Phe Ser1 5 10 15Gly Leu Pro Pro Ala Val Thr Thr Pro Ala Asn Gln Ser Ala Glu Ala20 25 30Ser Ala Gly Asn Gly Ser Val Ala Gly Ala Asp Ala Pro Ala Val Thr35 40 45Pro Phe Gln Ser Leu Gln Leu Val His Gln Leu Lys Gly Leu Ile Val50 55 60Leu Leu Tyr Ser Val Val Val Val Val Gly Leu Val Gly Asn Cys Leu65 70 75 80Leu Val Leu Val Ile Ala Arg Val Pro Arg Leu His Asn Val Thr Asn85 90 95Phe Leu Ile Gly Asn Leu Ala Leu Ser Asp Val Leu Met Cys Thr Ala100 105 110Cys Val Pro Leu Thr Leu Ala Tyr Ala Phe Glu Pro Arg Gly Trp Val115 120 125Phe Gly Gly Gly Leu Cys His Leu Val Phe Phe Leu Gln Pro Val Thr130 135 140Val Tyr Val Ser Val Phe Thr Leu Thr Thr Ile Ala Val Asp Arg Tyr145 150 155 160Val Val Leu Val His Pro Leu Arg Arg Ala Ser Arg Cys Ala Ser Ala165 170 175Tyr Ala Val Leu Ala Ile Trp Ala Leu Ser Ala Val Leu Ala Leu Pro180 185 190Pro Ala Val His Thr Tyr His Val Glu Leu Lys Pro His Asp Val Arg195 200 205Leu Cys Glu Glu Phe Trp Gly Ser Gln Glu Arg Gln Arg Gln Leu Tyr210 215 220Ala Trp Gly Leu Leu Leu Val Thr Tyr Leu Leu Pro Leu Leu Val Ile225 230 235 240Leu Leu Ser Tyr Val Arg Val Ser Val Lys Leu Arg Asn Arg Val Val
245 250 255Pro Gly Cys Val Thr Gln Ser Gln Ala Asp Trp Asp Arg Ala Arg Arg260 265 270Arg Arg Thr Lys Cys Leu Leu Val Val Val Val Val Val Phe Ala Val275 280 285Cys Trp Leu Pro Leu His Val Phe Asn Leu Leu Arg Asp Leu Asp Pro290 295 300His Ala Ile Asp Pro Tyr Ala Phe Gly Leu Val Gln Leu Leu Cys His305 310 315 320Trp Leu Ala Met Ser Ser Ala Cys Tyr Asn Pro Phe Ile Tyr Ala Trp325 330 335Leu His Asp Ser Phe Arg Glu Glu Leu Arg Lys Leu Leu Val Ala Trp340 345 350Pro Arg Lys Ile Ala Pro His Gly Gln Asn Met Thr Val Ser Val Val355 360 365(180)SEQ ID NO179的资料(i)序列特征(A)长度1083个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO179的序列描述ATGGACCCAG AAGAAACTTC AGTTTATTTG GATTATTACT ATGCTACGAG CCCAAACTCT 60GACATCAGGG AGACCCACTC CCATGTTCCT TACACCTCTG TCTTCCTTCC AGTCTTTTAC 120ACAGCTGTGT TCCTGACTGG AGTGCTGGGG AACCTTGTTC TCATGGGAGC GTTGCATTTC 180AAACCCGGCA GCCGAAGACT GATCGACATC TTTATCATCA ATCTGGCTGC CTCTGACTTC 240ATTTTTCTTG TCACATTGCC TCTCTGGGTG GATAAAGAAG CATCTCTAGG ACTGTGGAGG 300ACGGGCTCCT TCCTGTGCAA AGGGAGCTCC TACATGATCT CCGTCAATAT GCACTGCAGT 360GTCCTCCTGC TCACTTGCAT GAGTGTTGAC CGCTACCTGG CCATTGTGTG GCCAGTCGTA 420TCCAGGAAAT TCAGAAGGAC AGACTGTGCA TATGTAGTCT GTGCCAGCAT CTGGTTTATC 480TCCTGCCTGC TGGGGTTGCC TACTCTTCTG TCCAGGGAGC TCACGCTGAT TGATGATAAG 540CCATACTGTG CAGAGAAAAA GGCAACTCCA ATTAAACTCA TATGGTCCCT GGTGGCCTTA 600ATTTTCACCT TTTTTGTCCC TTTGTTGAGC ATTGTGACCT GCTACTGTTG CATTGCAAGG 660AAGCTGTGTG CCCATTACCA GCAATCAGGA AAGCACAACA AAAAGCTGAA GAAATCTAAG 720AAGATCATCT TTATTGTCGT GGCAGCCTTT CTTGTCTCCT GGCTGCCCTT CAATACTTTC 780AAGTTCCTGG CCATTGTCTC TGGGTTGCGG CAAGAACACT ATTTACCCTC AGCTATTCTT 840CAGCTTGGTA TGGAGGTGAG TGGACCCTTG GCATTTGCCA ACAGCTGTGT CAACCCTTTC 900ATTTACTATA TCTTCGACAG CTACATCCGC CGGGCCATTG TCCACTGCTT GTGCCCTTGC 960CTGAAAAACT ATGACTTTGG GAGTAGCACT GAGACATCAG ATAGTCACCT CACTAAGGCT 1020CTCTCCACCT TCATTCATGC AGAAGATTTT GCCAGGAGGA GGAAGAGGTC TGTGTCACTC 1080TAA 1083(181)SEQ ID NO180的资料(i)序列特征(A)长度360个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO180的序列描述Met Asp Pro Glu Glu Thr Ser Val Tyr Leu Asp Tyr Tyr Tyr Ala Thr1 5 10 15Ser Pro Asn Ser Asp Ile Arg Glu Thr His Ser His Val Pro Tyr Thr20 25 30Ser Val Phe Leu Pro Val Phe Tyr Thr Ala Val Phe Leu Thr Gly Val35 40 45Leu Gly Asn Leu Val Leu Met Gly Ala Leu His Phe Lys Pro Gly Ser50 55 60Arg Arg Leu Ile Asp Ile Phe Ile Ile Asn Leu Ala Ala Ser Asp Phe65 70 75 80Ile Phe Leu Val Thr Leu Pro Leu Trp Val Asp Lys Glu Ala Ser Leu85 90 95Gly Leu Trp Arg Thr Gly Ser Phe Leu Cys Lys Gly Ser Ser Tyr Met100 105 110Ile Ser Val Asn Met His Cys Ser Val Leu Leu Leu Thr Cys Met Ser115 120 125Val Asp Arg Tyr Leu Ala Ile Val Trp Pro Val Val Ser Arg Lys Phe130 135 140Arg Arg Thr Asp Cys Ala Tyr Val Val Cys Ala Ser Ile Trp Phe Ile145 150 155 160Ser Cys Leu Leu Gly Leu Pro Thr Leu Leu Ser Arg Glu Leu Thr Leu165 170 175Ile Asp Asp Lys Pro Tyr Cys Ala Glu Lys Lys Ala Thr Pro Ile Lys180 185 190Leu Ile Trp Ser Leu Val Ala Leu Ile Phe Thr Phe Phe Val Pro Leu195 200 205Leu Ser Ile Val Thr Cys Tyr Cys Cys Ile Ala Arg Lys Leu Cys Ala210 215 220His Tyr Gln Gln Ser Gly Lys His Asn Lys Lys Leu Lys Lys Ser Lys225 230 235 240Lys Ile Ile Phe Ile Val Val Ala Ala Phe Leu Val Ser Trp Leu Pro245 250 255Phe Asn Thr Phe Lys Phe Leu Ala Ile Val Ser Gly Leu Arg Gln Glu260 265 270His Tyr Leu Pro Ser Ala Ile Leu Gln Leu Gly Met Glu Val Ser Gly275 280 285Pro Leu Ala Phe Ala Asn Ser Cys Val Asn Pro Phe Ile Tyr Tyr Ile290 295 300Phe Asp Ser Tyr Ile Arg Arg Ala Ile Val His Cys Leu Cys Pro Cys305 310 315 320Leu Lys Asn Tyr Asp Phe Gly Ser Ser Thr Glu Thr Ser Asp Ser His325 330 335Leu Thr Lys Ala Leu Ser Thr Phe Ile His Ala Glu Asp Phe Ala Arg340 345 350Arg Arg Lys Arg Ser Val Ser Leu355 360(182)SEQ ID NO181的资料(i)序列特征(A)长度1020个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO181的序列描述ATGAATGGCC TTGAAGTGGC TCCCCCAGGT CTGATCACCA ACTTCTCCCT GGCCACGGCA 60GAGCAATGTG GCCAGGAGAC GCCACTGGAG AACATGCTGT TCGCCTCCTT CTACCTTCTG 120GATTTTATCC TGGCTTTAGT TGGCAATACC CTGGCTCTGT GGCTTTTCAT CCGAGACCAC 180AAGTCCGGGA CCCCGGCCAA CGTGTTCCTG ATGCATCTGG CCGTGGCCGA CTTGTCGTGC 240GTGCTGGTCC TGCCCACCCG CCTGGTCTAC CACTTCTCTG GGAACCACTG GCCATTTGGG 300GAAATCGCAT GCCGTCTCAC CGGCTTCCTC TTCTACCTCA ACATGTACGC CAGCATCTAC 360TTCCTCACCT GCATCAGCGC CGACCGTTTC CTGGCCATTG TGCACCCGGT CAAGTCCCTC 420AAGCTCCGCA GGCCCCTCTA CGCACACCTG GCCTGTGCCT TCCTGTGGGT GGTGGTGGCT 480GTGGCCATGG CCCCGCTGCT GGTGAGCCCA CAGACCGTGC AGACCAACCA CACGGTGGTC 540TGCCTGCAGC TGTACCGGGA GAAGGCCTCC CACCATGCCC TGGTGTCCCT GGCAGTGGCC 600TTCACCTTCC CGTTCATCAC CACGGTCACC TGCTACCTGC TGATCATCCG CAGCCTGCGG 660CAGGGCCTGC GTGTGGAGAA GCGCCTCAAG ACCAAGGCAA AACGCATGAT CGCCATAGTG 720CTGGCCATCT TCCTGGTCTG CTTCGTGCCC TACCACGTCA ACCGCTCCGT CTACGTGCTG 780CACTACCGCA GCCATGGGGC CTCCTGCGCC ACCCAGCGCA TCCTGGCCCT GGCAAACCGC 840ATCACCTCCT GCCTCACCAG CCTCAACGGG GCACTCGACC CCATCATGTA TTTCTTCGTG 900GCTGAGAAGT TCCGCCACGC CCTGTGCAAC TTGCTCTGTG GCAAAAGGCT CAAGGGCCCG 960CCCCCCAGCT TCGAAGGGAA AACCAACGAG AGCTCGCTGA GTGCCAAGTC AGAGCTGTGA 1020(183)SEQ ID NO182的资料(i)序列特征(A)长度339个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO182的序列描述Met Asn Gly Leu Glu Val Ala Pro Pro Gly Leu Ile Thr Asn Phe Ser1 5 10 15Leu Ala Thr Ala Glu Gln Cys Gly Gln Glu Thr Pro Leu Glu Asn Met20 25 30Leu Phe Ala Ser Phe Tyr Leu Leu Asp Phe Ile Leu Ala Leu Val Gly35 40 45Asn Thr Leu Ala Leu Trp Leu Phe Ile Arg Asp His Lys Ser Gly Thr50 55 60Pro Ala Asn Val Phe Leu Met His Leu Ala Val Ala Asp Leu Ser Cys65 70 75 80Val Leu Val Leu Pro Thr Arg Leu Val Tyr His Phe Ser Gly Asn His85 90 95Trp Pro Phe Gly Glu Ile Ala Cys Arg Leu Thr Gly Phe Leu Phe Tyr100 105 110Leu Asn Met Tyr Ala Ser Ile Tyr Phe Leu Thr Cys Ile Ser Ala Asp115 120 125Arg Phe Leu Ala Ile Val His Pro Val Lys Ser Leu Lys Leu Arg Arg130 135 140Pro Leu Tyr Ala His Leu Ala Cys Ala Phe Leu Trp Val Val Val Ala145 150 l55 160Val Ala Met Ala Pro Leu Leu Val Ser Pro Gln Thr Val Gln Thr Asn165 170 175His Thr Val Val Cys Leu Gln Leu Tyr Arg Glu Lys Ala Ser His His180 185 190Ala Leu Val Ser Leu Ala Val Ala Phe Thr Phe Pro Phe Ile Thr Thr195 200 205Val Thr Cys Tyr Leu Leu Ile Ile Arg Ser Leu Arg Gln Gly Leu Arg210 215 220Val Glu Lys Arg Leu Lys Thr Lys Ala Lys Arg Met Ile Ala Ile Val225 230 235 240Leu Ala Ile Phe Leu Val Cys Phe Val Pro Tyr His Val Asn Arg Ser245 250 255Val Tyr Val Leu His Tyr Arg Ser His Gly Ala Ser Cys Ala Thr Gln260 265 270Arg Ile Leu Ala Leu Ala Asn Arg Ile Thr Ser Cys Leu Thr Ser Leu275 280 285Asn Gly Ala Leu Asp Pro Ile Met Tyr Phe Phe Val Ala Glu Lys Phe290 295 300Arg His Ala Leu Cys Asn Leu Leu Cys Gly Lys Arg Leu Lys Gly Pro305 310 315 320Pro Pro Ser Phe Glu Gly Lys Thr Asn Glu Ser Ser Leu Ser Ala Lys325 330 335Ser Glu Leu(184)SEQ ID NO183的资料(i)序列特征(A)长度996个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO183的序列描述ATGATCACCC TGAACAATCA AGATCAACCT GTCCCTTTTA ACAGCTCACA TCCAGATGAA 60TACAAAATTG CAGCCCTTGT CTTCTATAGC TGTATCTTCA TAATTGGATT ATTTGTTAAC 120ATCACTGCAT TATGGGTTTT CAGTTGTACC ACCAAGAAGA GAACCACGGT AACCATCTAT 180ATGATGAATG TGGCATTAGT GGACTTGATA TTTATAATGA CTTTACCCTT TCGAATGTTT 240TATTATGCAA AAGATGAATG GCCATTTGGA GAGTACTTCT GCCAGATTCT TGGAGCTCTC 300ACAGTGTTTT ACCCAAGCAT TGCTTTATGG CTTCTTGCCT TTATTAGTGC TGACAGATAC 360ATGGCCATTG TACAGCCGAA GTACGCCAAA GAACTTAAAA ACACGTGCAA AGCCGTGCTG 420GCGTGTGTGG GAGTCTGGAT AATGACCCTG ACCACGACCA CCCCTCTGCT ACTGCTCTAT 480AAAGACCCAG ATAAAGACTC CACTCCCGCC ACCTGCCTCA AGATTTCTGA CATCATCTAT 540CTAAAAGCTG TGAACGTGCT GAACCTCACT CGACTGACAT TTTTTTTCTT GATTCCTTTG 600TTCATCATGA TTGGGTGCTA CTTGGTCATT ATTCATAATC TCCTTCACGG CAGGACGTCT 660AAGCTGAAAC CCAAAGTCAA GGAGAAGTCC AAAAGGATCA TCATCACGCT GCTGGTGCAG 720GTGCTCGTCT GCTTTATGCC CTTCCACATC TGTTTCGCTT TCCTGATGCT GGGAACGGGG 780GAGAATAGTT ACAATCCCTG GGGAGCCTTT ACCACCTTCC TCATGAACCT CAGCACGTGT 840CTGGATGTGA TTCTCTACTA CATCGTTTCA AAACAATTTC AGGCTCGAGT CATTAGTGTC 900ATGCTATACC GTAATTACCT TCGAAGCATG CGCAGAAAAA GTTTCCGATC TGGTAGTCTA 960AGGTCACTAA GCAATATAAA CAGTGAAATG TTATGA 996(185)SEQ ID NO184的资料(i)序列特征(A)长度331个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO184的序列描述Met Ile Thr Leu Asn Asn Gln Asp Gln Pro Val Pro Phe Asn Ser Ser1 5 10 15His Pro Asp Glu Tyr Lys Ile Ala Ala Leu Val Phe Tyr Ser Cys Ile20 25 30Phe Ile Ile Gly Leu Phe Val Asn Ile Thr Ala Leu Trp Val Phe Ser35 40 45Cys Thr Thr Lys Lys Arg Thr Thr Val Thr Ile Tyr Met Met Asn Val50 55 60Ala Leu Val Asp Leu Ile Phe Ile Met Thr Leu Pro Phe Arg Met Phe65 70 75 80Tyr Tyr Ala Lys Asp Glu Trp Pro Phe Gly Glu Tyr Phe Cys Gln Ile85 90 95Leu Gly Ala Leu Thr Val Phe Tyr Pro Ser Ile Ala Leu Trp Leu Leu100 105 110Ala Phe Ile Ser Ala Asp Arg Tyr Met Ala Ile Val Gln Pro Lys Tyr115 120 125Ala Lys Glu Leu Lys Asn Thr Cys Lys Ala Val Leu Ala Cys Val Gly130 135 140Val Trp Ile Met Thr Leu Thr Thr Thr Thr Pro Leu Leu Leu Leu Tyr145 150 155 160Lys Asp Pro Asp Lys Asp Ser Thr Pro Ala Thr Cys Leu Lys Ile Ser165 170 175Asp Ile Ile Tyr Leu Lys Ala Val Asn Val Leu Asn Leu Thr Arg Leu180 185 190Thr Phe Phe Phe Leu Ile Pro Leu Phe Ile Met Ile Gly Cys Tyr Leu195 200 205Val Ile Ile His Asn Leu Leu His Gly Arg Thr Ser Lys Leu Lys Pro210 215 220Lys Val Lys Glu Lys Ser Lys Arg Ile Ile Ile Thr Leu Leu Val Gln225 230 235 240Val Leu Val Cys Phe Met Pro Phe His Ile Cys Phe Ala Phe Leu Met245 250 255Leu Gly Thr Gly Glu Asn Ser Tyr Asn Pro Trp Gly Ala Phe Thr Thr260 265 270Phe Leu Met Asn Leu Ser Thr Cys Leu Asp Val Ile Leu Tyr Tyr Ile275 280 285Val Ser Lys Gln Phe Gln Ala Arg Val Ile Ser Val Met Leu Tyr Arg290 295 300Asn Tyr Leu Arg Ser Met Arg Arg Lys Ser Phe Arg Ser Gly Ser Leu305 310 315 320Arg Ser Leu Ser Asn Ile Asn Ser Glu Met Leu325 330(186)SEQ ID NO185的资料(i)序列特征(A)长度1077个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO185的序列描述ATGCCCTCTG TGTCTCCAGC GGGGCCCTCG GCCGGGGCAG TCCCCAATGC CACCGCAGTG 60ACAACAGTGC GGACCAATGC CAGCGGGCTG GAGGTGCCCC TGTTCCACCT GTTTGCCCGG 120CTGGACGAGG AGCTGCATGG CACCTTCCCA GGCCTGTGCG TGGCGCTGAT GGCGGTGCAC 180GGAGCCATCT TCCTGGCAGG GCTGGTGCTC AACGGGCTGG CGCTGTACGT CTTCTGCTGC 240CGCACCCGGG CCAAGACACC CTCAGTCATC TACACCATCA ACCTGGTGGT GACCGATCTA 300CTGGTAGGGC TGTCCCTGCC CACGCGCTTC GCTGTGTACT ACGGCGCCAG GGGCTGCCTG 360CGCTGTGCCT TCCCGCACGT CCTCGGTTAC TTCCTCAACA TGCACTGCTC CATCCTCTTC 420CTCACCTGCA TCTGCGTGGA CCGCTACCTG GCCATCGTGC GGCCCGAAGG CTCCCGCCGC 480TGCCGCCAGC CTGCCTGTGC CAGGGCCGTG TGCGCCTTCG TGTGGCTGGC CGCCGGTGCC 540GTCACCCTGT CGGTGCTGGG CGTGACAGGC AGCCGGCCCT GCTGCCGTGT CTTTGCGCTG 600ACTGTCCTGG AGTTCCTGCT GCCCCTGCTG GTCATCAGCG TGTTTACCGG CCGCATCATG 660TGTGCACTGT CGCGGCCGGG TCTGCTCCAC CAGGGTCGCC AGCGCCGCGT GCGGGCCAAG 720CAGCTCCTGC TCACGGTGCT CATCATCTTT CTCGTCTGCT TCACGCCCTT CCACGCCCGC 780CAAGTGGCCG TGGCGCTGTG GCCCGACATG CCACACCACA CGAGCCTCGT GGTCTACCAC 840GTGGCCGTGA CCCTCAGCAG CCTCAACAGC TGCATGGACC CCATCGTCTA CTGCTTCGTC 900ACCAGTGGCT TCCAGGCCAC CGTCCGAGGC CTCTTCGGCC AGCACGGAGA GCGTGAGCCC 960AGCAGCGGTG ACGTGGTCAG CATGCACAGG AGCTCCAAGG GCTCAGGCCG TCATCACATC 1020CTCAGTGCCG GCCCTCACGC CCTCACCCAG GCCCTGGCTA ATGGGCCCGA GGCTTAG1077(187)SEQ ID NO186的资料(i)序列特征(A)长度358个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO186的序列描述Met Pro Ser Val Ser Pro Ala Gly Pro Ser Ala Gly Ala Val Pro Asn1 5 10 15Ala Thr Ala Val Thr Thr Val Arg Thr Asn Ala Ser Gly Leu Glu Val20 25 30Pro Leu Phe His Leu Phe Ala Arg Leu Asp Glu Glu Leu His Gly Thr35 40 45Phe Pro Gly Leu Cys Val Ala Leu Met Ala Val His Gly Ala Ile Phe50 55 60Leu Ala Gly Leu Val Leu Asn Gly Leu Ala Leu Tyr Val Phe Cys Cys65 70 75 80Arg Thr Arg Ala Lys Thr Pro Ser Val Ile Tyr Thr Ile Asn Leu Val85 90 95Val Thr Asp Leu Leu Val Gly Leu Ser Leu Pro Thr Arg Phe Ala Val100 105 110Tyr Tyr Gly Ala Arg Gly Cys Leu Arg Cys Ala Phe Pro His Val Leu
115 120 125Gly Tyr Phe Leu Asn Met His Cys Ser Ile Leu Phe Leu Thr Cys Ile130 135 140Cys Val Asp Arg Tyr Leu Ala Ile Val Arg Pro Glu Gly Ser Arg Ala145 150 155 160Cys Arg Gln Pro Ala Cys Ala Arg Ala Val Cys Ala Phe Val Trp Leu165 170 175Ala Ala Gly Ala Val Thr Leu Ser Val Leu Gly Val Thr Gly Ser Arg180 185 190Pro Cys Cys Arg Val Phe Ala Leu Thr Val Leu Glu Phe Leu Leu Pro195 200 205Leu Leu Val Ile Ser Val Phe Thr Gly Arg Ile Met Cys Ala Leu Ser210 215 220Arg Pro Gly Leu Leu His Gln Gly Arg Gln Arg Arg Val Arg Ala Lys225 230 235 240Gln Leu Leu Leu Thr Val Leu Ile Ile Phe Leu Val Cys Phe Thr Pro245 250 255Phe His Ala Arg Gln Val Ala Val Ala Leu Trp Pro Asp Met Pro His260 265 270His Thr Ser Leu Val Val Tyr His Val Ala Val Thr Leu Ser Ser Leu275 280 285Asn Ser Cys Met Asp Pro Ile Val Tyr Cys Phe Val Thr Ser Gly Phe290 295 300Gln Ala Thr Val Arg Gly Leu Phe Gly Gln His Gly Glu Arg Glu Pro305 310 315 320Ser Ser Gly Asp Val Val Ser Met His Arg Ser Ser Lys Gly Ser Gly325 330 335Arg His His Ile Leu Ser Ala Gly Pro His Ala Leu Thr Gln Ala Leu340 345 350Ala Asn Gly Pro Glu Ala355(188)SEQ ID NO187的资料(i)序列特征(A)长度1050个碱基对
(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO187的序列描述ATGAACTCCA CCTTGGATGG TAATCAGAGC AGCCACCCTT TTTGCCTCTT GGCATTTGGC 60TATTTGGAAA CTGTCAATTT TTGCCTTTTG GAAGTATTGA TTATTGTCTT TCTAACTGTA 120TTGATTATTT CTGGCAACAT CATTGTGATT TTTGTATTTC ACTGTGCACC TTTGTTGAAC 180CATCACACTA CAAGTTATTT TATCCAGACT ATGGCATATG CTGACCTTTT TGTTGGGGTG 240AGCTGCGTGG TCCCTTCTTT ATCACTCCTC CATCACCCCC TTCCAGTAGA GGAGTCCTTG 300ACTTGCCAGA TATTTGGTTT TGTAGTATCA GTTCTGAAGA GCGTCTCCAT GGCTTCTCTG 360GCCTGTATCA GCATTGATAG ATACATTGCC ATTACTAAAC CTTTAACCTA TAATACTCTG 420GTTACACCCT GGAGACTACG CCTGTGTATT TTCCTGATTT GGCTATACTC GACCCTGGTC 480TTCCTGCCTT CCTTTTTCCA CTGGGGCAAA CCTGGATATC ATGGAGATGT GTTTCAGTGG 540TGTGCGGAGT CCTGGCACAC CGACTCCTAC TTCACCCTGT TCATCGTGAT GATGTTATAT 600GCCCCAGCAG CCCTTATTGT CTGCTTCACC TATTTCAACA TCTTCCGCAT CTGCCAACAG 660CACACAAAGG ATATCAGCGA AAGGCAAGCC CGCTTCAGCA GCCAGAGTGG GGAGACTGGG 720GAAGTGCAGG CCTGTCCTGA TAAGCGCTAT AAAATGGTCC TGTTTCGAAT CACTAGTGTA 780TTTTACATCC TCTGGTTGCC ATATATCATC TACTTCTTGT TGGAAAGCTC CACTGGCCAC 840AGCAACCGCT TCGCATCCTT CTTGACCACC TGGCTTGCTA TTAGTAACAG TTTCTGCAAC 900TGTGTAATTT ATAGTCTCTC CAACAGTGTA TTCCAAAGAG GACTAAAGCG CCTCTCAGGG 960GCTATGTGTA CTTCTTGTGC AAGTCAGACT ACAGCCAACG ACCCTTACAC AGTTAGAAGC 1020AAAGGCCCTC TTAATGGATG TCATATCTGA 1050(189)SEQ ID NO188的资料(i)序列特征(A)长度349个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO188的序列描述Met Asn Ser Thr Leu Asp Gly Asn Gln Ser Ser Hi s Pro Phe Cys Leu1 5 10 15Leu Ala Phe Gly Tyr Leu Glu Thr Val Asn Phe Cys Leu Leu Glu Val20 25 30Leu Ile Ile Val Phe Leu Thr Val Leu Ile Ile Ser Gly Asn Ile Ile35 40 45Val Ile Phe Val Phe His Cys Ala Pro Leu Leu Asn His His Thr Thr50 55 60Ser Tyr Phe Ile Gln Thr Met Ala Tyr Ala Asp Leu Phe Val Gly Val65 70 75 80Ser Cys Val Val Pro Ser Leu Ser Leu Leu His His Pro Leu Pro Val85 90 95Glu Glu Ser Leu Thr Cys Gln Ile Phe Gly Phe Val Val Ser Val Leu100 105 110Lys Ser Val Ser Met Ala Ser Leu Ala Cys Ile Ser Ile Asp Arg Tyr115 120 125Ile Ala Ile Thr Lys Pro Leu Thr Tyr Asn Thr Leu Val Thr Pro Trp130 135 140Arg Leu Arg Leu Cys Ile Phe Leu Ile Trp Leu Tyr Ser Thr Leu Val145 150 155 160Phe Leu Pro Ser Phe Phe His Trp Gly Lys Pro Gly Tyr His Gly Asp165 170 175Val Phe Gln Trp Cys Ala Glu Ser Trp His Thr Asp Ser Tyr Phe Thr180 185 190Leu Phe Ile Val Met Met Leu Tyr Ala Pro Ala Ala Leu Ile Val Cys195 200 205Phe Thr Tyr Phe Asn Ile Phe Arg Ile Cys Gln Gln His Thr Lys Asp210 215 220Ile Ser Glu Arg Gln Ala Arg Phe Ser Ser Gln Ser Gly Glu Thr Gly225 230 235 240Glu Val Gln Ala Cys Pro Asp Lys Arg Tyr Lys Met Val Leu Phe Arg245 250 255Ile Thr Ser Val Phe Tyr Ile Leu Trp Leu Pro Tyr Ile Ile Tyr Phe260 265 270Leu Leu Glu Ser Ser Thr Gly His Ser Asn Arg Phe Ala Ser Phe Leu275 280 285Thr Thr Trp Leu Ala Ile Ser Asn Ser Phe Cys Asn Cys Val Ile Tyr290 295 300Ser Leu Ser Asn Ser Val Phe Gln Arg Gly Leu Lys Arg Leu Ser Gly305 310 315 320Ala Met Cys Thr Ser Cys Ala Ser Gln Thr Thr Ala Asn Asp Pro Tyr325 330 335Thr Val Arg Ser Lys Gly Pro Leu Asn Gly Cys His Ile
340 345(190)SEQ ID NO189的资料(i)序列特征(A)长度1302个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO189的序列描述ATGTGTTTTT CTCCCATTCT GGAAATCAAC ATGCAGTCTG AATCTAACAT TACAGTGCGA 60GATGACATTG ATGACATCAA CACCAATATG TACCAACCAC TATCATATCC GTTAAGCTTT 120CAAGTGTCTC TCACCGGATT TCTTATGTTA GAAATTGTGT TGGGACTTGG CAGCAACCTC 180ACTGTATTGG TACTTTACTG CATGAAATCC AACTTAATCA ACTCTGTCAG TAACATTATT 240ACAATGAATC TTCATGTACT TGATGTAATA ATTTGTGTGG GATGTATTCC TCTAACTATA 300GTTATCCTTC TGCTTTCACT GGAGAGTAAC ACTGCTCTCA TTTGCTGTTT CCATGAGGCT 360TGTGTATCTT TTGCAAGTGT CTCAACAGCA ATCAACGTTT TTGCTATCAC TTTGGACAGA 420TATGACATCT CTGTAAAACC TGCAAACCGA ATTCTGACAA TGGGCAGAGC TGTAATGTTA 480ATGATATCCA TTTGGATTTT TTCTTTTTTC TCTTTCCTGA TTCCTTTTAT TGAGGTAAAT 540TTTTTCAGTC TTCAAAGTGG AAATACCTGG GAAAACAAGA CACTTTTATG TGTCAGTACA 600AATGAATACT ACACTGAACT GGGAATGTAT TATCACCTGT TAGTACAGAT CCCAATATTC 660TTTTTCACTG TTGTAGTAAT GTTAATCACA TACACCAAAA TACTTCAGGC TCTTAATATT 720CGAATAGGCA CAAGATTTTC AACAGGGCAG AAGAAGAAAG CAAGAAAGAA AAAGACAATT 780TCTCTAACCA CACAACATGA GGCTACAGAC ATGTCACAAA GCAGTGGTGG GAGAAATGTA 840GTCTTTGGTG TAAGAACTTC AGTTTCTGTA ATAATTGCCC TCCGGCGAGC TGTGAAACGA 900CACCGTGAAC GACGAGAAAG ACAAAAGAGA GTCAAGAGGA TGTCTTTATT GATTATTTCT 960ACATTTCTTC TCTGCTGGAC ACCAATTTCT GTTTTAAATA CCACCATTTT ATGTTTAGGC 1020CCAAGTGACC TTTTAGTAAA ATTAAGATTG TGTTTTTTAG TCATGGCTTA TGGAACAACT 1080ATATTTCACC CTCTATTATA TGCATTCACT AGACAAAAAT TTCAAAAGGT CTTGAAAAGT 1140AAAATGAAAA AGCGAGTTGT TTCTATAGTA GAAGCTGATC CCCTGCCTAA TAATGCTGTA 1200ATACACAACT CTTGGATAGA TCCCAAAAGA AACAAAAAAA TTACCTTTGA AGATAGTGAA 1260ATAAGAGAAA AACGTTTAGT GCCTCAGGTT GTCACAGACT AG1302(191)SEQ ID NO190的资料(i)序列特征(A)长度433个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO190的序列描述Met Cys Phe Ser Pro Ile Leu Glu Ile Asn Met Gln Ser Glu Ser Asn1 5 10 15Ile Thr Val Vrg Asp Asp Ile Asp Asp Ile Asn Thr Asn Met Tyr Gln20 25 30Pro Leu Ser Tyr Pro Leu Ser Phe Gln Val Ser Leu Thr Gly Phe Leu35 40 45Met Leu Glu Ile Val Leu Gly Leu Gly Ser Asn Leu Thr Val Leu Val50 55 60Leu Tyr Cys Met Lys Ser Asn Leu Ile Asn Ser Val Ser Asn Ile Ile65 70 75 80Thr Met Asn Leu His Val Leu Asp Val Ile Ile Cys Val Gly Cys Ile85 90 95Pro Leu Thr Ile Val Ile Leu Leu Leu Ser Leu Glu Ser Asn Thr Ala100 l05 110Leu Ile Cys Cys Phe His Glu Ala Cys Val Ser Phe Ala Ser Val Ser115 120 125Thr Ala Ile Asn Val Phe Ala Ile Thr Leu Asp Arg Tyr Asp Ile Ser130 135 140Val Lys Pro Ala Asn Arg Ile Leu Thr Met Gly Arg Ala Val Met Leu145 150 155 160Met Ile Ser Ile Trp Ile Phe Ser Phe Phe Ser Phe Leu Ile Pro Phe165 170 175Ile Glu Val Asn Phe Phe Ser Leu Gln Ser Gly Asn Thr Trp Glu Asn180 185 190Lys Thr Leu Leu Cys Val Ser Thr Asn Glu Tyr Tyr Thr Glu Leu Gly195 200 205Met Tyr Tyr His Leu Leu Val Gln Ile Pro Ile Phe Phe Phe Thr Val210 215 220Val Val Met Leu Ile Thr Tyr Thr Lys Ile Leu Gln Ala Leu Asn Ile225 230 235 240Arg Ile Gly Thr Arg Phe Ser Thr Gly Gln Lys Lys Lys Ala Arg Lys245 250 255Lys Lys Thr Ile Ser Leu Thr Thr Gln His Glu Ala Thr Asp Met Ser260 265 270Gln Ser Ser Gly Gly Arg Asn Val Val Phe Gly Val Arg Thr Ser Val275 280 285Ser Val Ile Ile Ala Leu Arg Arg Ala Val Lys Arg His Arg Glu Arg290 295 300Arg Glu Arg Gln Lys Arg Val Lys Arg Met Ser Leu Leu Ile Ile Ser305 310 315 320Thr Phe Leu Leu Cys Trp Thr Pro Ile Ser Val Leu Asn Thr Thr Ile325 330 335Leu Cys Leu Gly Pro Ser Asp Leu Leu Val Lys Leu Arg Leu Cys Phe340 345 350Leu Val Met Ala Tyr Gly Thr Thr Ile Phe His Pro Leu Leu Tyr Ala355 360 365Phe Thr Arg Gln Lys Phe Gln Lys Val Leu Lys Ser Lys Met Lys Lys370 375 380Arg Val Val Ser Ile Val Glu Ala Asp Pro Leu Pro Asn Asn Ala Val385 390 395 400Ile His Asn Ser Trp Ile Asp Pro Lys Arg Asn Lys Lys Ile Thr Phe405 410 415Glu Asp Ser Glu Ile Arg Glu Lys Arg Leu Val Pro Gln Val Val Thr420 425 430Asp(192)SEQ ID NO191的资料(i)序列特征(A)长度1209个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO191的序列描述ATGTTGTGTC CTTCCAAGAC AGATGGCTCA GGGCACTCTG GTAGGATTCA CCAGGAAACT 60CATGGAGAAG GGAAAAGGGA CAAGATTAGC AACAGTGAAG GGAGGGAGAA TGGTGGGAGA 120GGATTCCAGA TGAACGGTGG GTCGCTGGAG GCTGAGCATG CCAGCAGGAT GTCAGTTCTC 180AGAGCAAAGC CCATGTCAAA CAGCCAACGC TTGCTCCTTC TGTCCCCAGG ATCACCTCCT 240CGCACGGGGA GCATCTCCTA CATCAACATC ATCATGCCTT CGGTGTTCGG CACCATCTGC 300CTCCTGGGCA TCATCGGGAA CTCCACGGTC ATCTTCGCGG TCGTGAAGAA GTCCAAGCTG 360CACTGGTGCA ACAACGTCCC CGACATCTTC ATCATCAACC TCTCGGTAGT AGATCTCCTC 420TTTCTCCTGG GCATGCCCTT CATGATCCAC CAGCTCATGG GCAATGGGGT GTGGCACTTT 480GGGGAGACCA TGTGCACCCT CATCACGGCC ATGGATGCCA ATAGTCAGTT CACCAGCACC 540TACATCCTGA CCGCCATGGC CATTGACCGC TACCTGGCCA CTGTCCACCC CATCTCTTCC 600ACGAAGTTCC GGAAGCCCTC TGTGGCCACC CTGGTGATCT GCCTCCTGTG GGCCCTCTCC 660TTCATCAGCA TCACCCCTGT GTGGCTGTAT GCCAGACTCA TCCCCTTCCC AGGAGGTGCA 720GTGGGCTGCG GCATACGCCT GCCCAACCCA GACACTGACC TCTACTGGTT CACCCTGTAC 780CAGTTTTTCC TGGCCTTTGC CCTGCCTTTT GTGGTCATCA CAGCCGCATA CGTGAGGATC 840CTGCAGCGCA TGACGTCCTC AGTGGCCCCC GCCTCCCAGC GCAGCATCCG GCTGCGGACA 900AAGAGGGTGA AACGCACAGC CATCGCCATC TGTCTGGTCT TCTTTGTGTG CTGGGCACCC 960TACTATGTGC TACAGCTGAC CCAGTTGTCC ATCAGCCGCC CGACCCTCAC CTTTGTCTAC 1020TTATACAATG CGGCCATCAG CTTGGGCTAT GCCAACAGCT GCCTCAACCC CTTTGTGTAC 1080ATCGTGCTCT GTGAGACGTT CCGCAAACGC TTGGTCCTGT CGGTGAAGCC TGCAGCCCAG 1140GGGCAGCTTC GCGCTGTCAG CAACGCTCAG ACGGCTGACG AGGAGAGGAC AGAAAGCAAA 1200GGCACCTGA 1209(193)SEQ ID NO192的资料(i)序列特征(A)长度402个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO192的序列描述Met Leu Cys Pro Ser Lys Thr Asp Gly Ser Gly His Ser Gly Arg Ile1 5 10 15His Gln Glu Thr His Gly Glu Gly Lys Arg Asp Lys Ile Ser Asn Ser20 25 30Glu Gly Arg Glu Asn Gly Gly Arg Gly Phe Gln Met Asn Gly Gly Ser35 40 45Leu Glu Ala Glu His Ala Ser Arg Met Ser Val Leu Arg Ala Lys Pro50 55 60Met Ser Asn Ser Gln Arg Leu Leu Leu Leu Ser Pro Gly Ser Pro Pro65 70 75 80Arg Thr Gly Ser Ile Ser Tyr Ile Asn Ile Ile Met Pro Ser Val Phe85 90 95Gly Thr Ile Cys Leu Leu Gly Ile Ile Gly Asn Ser Thr Val Ile Phe100 105 110Ala Val Val Lys Lys Ser Lys Leu His Trp Cys Asn Asn Val Pro Asp115 120 125Ile Phe Ile Ile Asn Leu Ser Val Val Asp Leu Leu Phe Leu Leu Gly130 135 140Met Pro Phe Met Ile His Gln Leu Met Gly Asn Gly Val Trp His Phe145 150 155 160Gly Glu Thr Met Cys Thr Leu Ile Thr Ala Met Asp Ala Asn Ser Gln165 170 175Phe Thr Ser Thr Tyr lle Leu Thr Ala Met Ala Ile Asp Arg Tyr Leu180 185 190Ala Thr Val His Pro Ile Ser Ser Thr Lys Phe Arg Lys Pro Ser Val195 200 205Ala Thr Leu Val Ile Cys Leu Leu Trp Ala Leu Ser Phe Ile Ser Ile210 215 220Thr Pro Val Trp Leu Tyr Ala Arg Leu Ile Pro Phe Pro Gly Gly Ala225 230 235 240Val Gly Cys Gly Ile Arg Leu Pro Asn Pro Asp Thr Asp Leu Tyr Trp245 250 255Phe Thr Leu Tyr Gln Phe Phe Leu Ala Phe Ala Leu Pro Phe Val Val260 265 270Ile Thr Ala Ala Tyr Val Arg Ile Leu Gln Arg Met Thr Ser Ser Val275 280 285Ala Pro Ala Ser Gln Arg Ser Ile Arg Leu Arg Thr Lys Arg Val Lys290 295 300Arg Thr Ala Ile Ala Ile Cys Leu Val Phe Phe Val Cys Trp Ala Pro305 310 315 320Tyr Tyr Val Leu Gln Leu Thr Gln Leu Ser Ile Ser Arg Pro Thr Leu325 330 335Thr Phe Val Tyr Leu Tyr Asn Ala Ala Ile Ser Leu Gly Tyr Ala Asn340 345 350Ser Cys Leu Asn Pro Phe Val Tyr Ile Val Leu Cys Glu Thr Phe Arg355 360 365Lys Arg Leu Val Leu Ser Val Lys Pro Ala Ala Gln Gly Gln Leu Arg370 375 380Ala Val Ser Asn Ala Gln Thr Ala Asp Glu Glu Arg Thr Glu Ser Lys385 390 395 400Gly Thr(194)SEQ ID NO193的资料(i)序列特征
(A)长度1128个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO193的序列描述ATGGATGTGA CTTCCCAAGC CCGGGGCGTG GGCCTGGAGA TGTACCCAGG CACCGCGCAC 60GCTGCGGCCC CCAACACCAC CTCCCCCGAG CTCAACCTGT CCCACCCGCT CCTGGGCACC 120GCCCTGGCCA ATGGGACAGG TGAGCTCTCG GAGCACCAGC AGTACGTGAT CGGCCTGTTC 180CTCTCGTGCC TCTACACCAT CTTCCTCTTC CCCATCGGCT TTGTGGGCAA CATCCTGATC 240CTGGTGGTGA ACATCAGCTT CCGCGAGAAG ATGACCATCC CCGACCTGTA CTTCATCAAC 300CTGGCGGTGG CGGACCTCAT CCTGGTGGCC GACTCCCTCA TTGAGGTGTT CAACCTGCAC 360GAGCGGTACT ACGACATCGC CGTCCTGTGC ACCTTCATGT CGCTCTTCCT GCAGGTCAAC 420ATGTACAGCA GCGTCTTCTT CCTCACCTGG ATGAGCTTCG ACCGCTACAT CGCCCTGGCC 480AGGGCCATGC GCTGCAGCCT GTTCCGCACC AAGCACCACG CCCGGCTGAG CTGTGGCCTC 540ATCTGGATGG CATCCGTGTC AGCCACGCTG GTGCCCTTCA CCGCCGTGCA CCTGCAGCAC 600ACCGACGAGG CCTGCTTCTG TTTCGCGGAT GTCCGGGAGG TGCAGTGGCT CGAGGTCACG 660CTGGGCTTCA TCGTGCCCTT CGCCATCATC GGCCTGTGCT ACTCCCTCAT TGTCCGGGTG 720CTGGTCAGGG CGCACCGGCA CCGTGGGCTG CGGCCCCGGC GGCAGAAGGC GAAACGCATG 780ATCCTCGCGG TGGTGCTGGT CTTCTTCGTC TGCTGGCTGC CGGAGAACGT CTTCATCAGC 840GTGCACCTCC TGCAGCGGAC GCAGCCTGGG GCCGCTCCCT GCAAGCAGTC TTTCCGCCAT 900GCCCACCCCC TCACGGGCCA CATTGTCAAC CTCGCCGCCT TCTCCAACAG CTGCCTAAAC 960CCCCTCATCT ACAGCTTTCT CGGGGAGACC TTCAGGGACA AGCTGAGGCT GTACATTGAG 1020CAGAAAACAA ATTTGCCGGC CCTGAACCGC TTCTGTCACG CTGCCCTGAA GGCCGTCATT 1080CCAGACAGCA CCGAGCAGTC GGATGTGAGG TTCAGCAGTG CCGTGTGA 1128(195)SEQ ID NO194的资料(i)序列特征(A)长度375个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO194的序列描述Met Asp Val Thr Ser Gln Ala Arg Gly Val Gly Leu Glu Met Tyr Pro1 5 10 15Gly Thr Ala His Ala Ala Ala Pro Asn Thr Thr Ser Pro Glu Leu Asn20 25 30Leu Ser His Pro Leu Leu Gly Thr Ala Leu Ala Asn Gly Thr Gly Glu35 40 45Leu Ser Glu His Gln Gln Tyr Val Ile Gly Leu Phe Leu Ser Cys Leu50 55 60Tyr Thr Ile Phe Leu Phe Pro Ile Gly Phe Val Gly Asn Ile Leu Ile65 70 75 80Leu Val Val Asn Ile Ser Phe Arg Glu Lys Met Thr Ile Pro Asp Leu85 90 95Tyr Phe Ile Asn Leu Ala Val Ala Asp Leu Ile Leu Val Ala Asp Ser100 105 110Leu Ile Glu Val Phe Asn Leu His Glu Arg Tyr Tyr Asp Ile Ala Val115 120 125Leu Cys Thr Phe Met Ser Leu Phe Leu Gln Val Asn Met Tyr Ser Ser130 135 140Val Phe Phe Leu Thr Trp Met Ser Phe Asp Arg Tyr Ile Ala Leu Ala145 150 155 160Arg Ala Met Arg Cys Ser Leu Phe Arg Thr Lys His His Ala Arg Leu165 170 175Ser Cys Gly Leu Ile Trp Met Ala Ser Val Ser Ala Thr Leu Val Pro180 185 190Phe Thr Ala Val His Leu Gln His Thr Asp Glu Ala Cys Phe Cys Phe195 200 205Ala Asp Val Arg Glu Val Gln Trp Leu Glu Val Thr Leu Gly Phe Ile210 215 220Val Pro Phe Ala Ile Ile Gly Leu Cys Tyr Ser Leu Ile Val Arg Val225 230 235 240Leu Val Arg Ala His Arg His Arg Gly Leu Arg Pro Arg Arg Gln Lys245 250 255Ala Lys Arg Met Ile Leu Ala Val Val Leu Val Phe Phe Val Cys Trp260 265 270Leu Pro Glu Asn Val Phe Ile Ser Val His Leu Leu Gln Arg Thr Gln275 280 285Pro Gly Ala Ala Pro Cys Lys Gln Ser Phe Arg His Ala His Pro Leu290 295 300Thr Gly His Ile Val Asn Leu Ala Ala Phe Ser Asn Ser Cys Leu Asn305 310 315 320Pro Leu Ile Tyr Ser Phe Leu Gly Glu Thr Phe Arg Asp Lys Leu Arg325 330 335Leu Tyr Ile Glu Gln Lys Thr Asn Leu Pro Ala Leu Asn Arg Phe Cys340 345 350His Ala Ala Leu Lys Ala Val Ile Pro Asp Ser Thr Glu Gln Ser Asp355 360 365Val Arg Phe Ser Ser Ala Val370 375(196)SEQ ID NO195的资料(i)序列特征(A)长度960个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO195的序列描述ATGCCATTCC CAAACTGCTC AGCCCCCAGC ACTGTGGTGG CCACAGCTGT GGGTGTCTTG 60CTGGGGCTGG AGTGTGGGCT GGGTCTGCTG GGCAACGCGG TGGCGCTGTG GACCTTCCTG 120TTCCGGGTCA GGGTGTGGAA GCCGTACGCT GTCTACCTGC TCAACCTGGC CCTGGCTGAC 180CTGCTGTTGG CTGCGTGCCT GCCTTTCCTG GCCGCCTTCT ACCTGAGCCT CCAGGCTTGG 240CATCTGGGCC GTGTGGGCTG CTGGGCCCTG CGCTTCCTGC TGGACCTCAG CCGCAGCGTG 300GGGATGGCCT TCCTGGCCGC CGTGGCTTTG GACCGGTACC TCCGTGTGGT CCACCCTCGG 360CTTAAGGTCA ACCTGCTGTC TCCTCAGGCG GCCCTGGGGG TCTCGGGCCT CGTCTGGCTC 420CTGATGGTCG CCCTCACCTG CCCGGGCTTG CTCATCTCTG AGGCCGCCCA GAACTCCACC 480AGGTGCCACA GTTTCTACTC CAGGGCAGAC GGCTCCTTCA GCATCATCTG GCAGGAAGCA 540CTCTCCTGCC TTCAGTTTGT CCTCCCCTTT GGCCTCATCG TGTTCTGCAA TGCAGGCATC 600ATCAGGGCTC TCCAGAAAAG ACTCCGGGAG CCTGAGAAAC AGCCCAAGCT TCAGCGGGCC 660AAGGCACTGG TCACCTTGGT GGTGGTGCTG TTTGCTCTGT GCTTTCTGCC CTGCTTCCTG 720GCCAGAGTCC TGATGCACAT CTTCCAGAAT CTGGGGAGCT GCAGGGCCCT TTGTGCAGTG 780GCTCATACCT CGGATGTCAC GGGCAGCCTC ACCTACCTGC ACAGTGTCGT CAACCCCGTG 840GTATACTGCT TCTCCAGCCC CACCTTCAGG AGCTCCTATC GGAGGGTCTT CCACACCCTC 900CGAGGCAAAG GGCAGGCAGC AGAGCCCCCA GATTTCAACC CCAGAGACTC CTATTCCTGA 960(197)SEQ ID NO196的资料(i)序列特征(A)长度319个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO196的序列描述Met Pro Phe Pro Asn Cys Ser Ala Pro Ser Thr Val Val Ala Thr Ala1 5 10 15Val Gly Val Leu Leu Gly Leu Glu Cys Gly Leu Gly Leu Leu Gly Asn20 25 30Ala Val Ala Leu Trp Thr Phe Leu Phe Arg Val Arg Val Trp Lys Pro35 40 45Tyr Ala Val Tyr Leu Leu Asn Leu Ala Leu Ala Asp Leu Leu Leu Ala50 55 60Ala Cys Leu Pro Phe Leu Ala Ala Phe Tyr Leu Ser Leu Gln Ala Trp65 70 75 80His Leu Gly Arg Val Gly Cys Trp Ala Leu Arg Phe Leu Leu Asp Leu85 90 95Ser Arg Ser Val Gly Met Ala Phe Leu Ala Ala Val Ala Leu Asp Arg100 105 110Tyr Leu Arg Val Val His Pro Arg Leu Lys Val Asn Leu Leu Ser Pro115 120 125Gln Ala Ala Leu Gly Val Ser Gly Leu Val Trp Leu Leu Met Val Ala130 135 140Leu Thr Cys Pro Gly Leu Leu Ile Ser Glu Ala Ala Gln Asn Ser Thr145 150 155 160Arg Cys His Ser Phe Tyr Ser Arg Ala Asp Gly Ser Phe Ser Ile Ile165 170 175Trp Gln Glu Ala Leu Ser Cys Leu Gln Phe Val Leu Pro Phe Gly Leu180 185 190Ile Val Phe Cys Asn Ala Gly Ile Ile Arg Ala Leu Gln Lys Arg Leu195 200 205Arg Glu Pro Glu Lys Gln Pro Lys Leu Gln Arg Ala Lys Ala Leu Val210 215 220Thr Leu Val Val Val Leu Phe Ala Leu Cys Phe Leu Pro Cys Phe Leu225 230 235 240Ala Arg Val Leu Met His Ile Phe Gln Asn Leu Gly Ser Cys Arg Ala245 250 255Leu Cys Ala Val Ala His Thr Ser Asp Val Thr Gly Ser Leu Thr Tyr260 265 270Leu His Ser Val Val Asn Pro Val Val Tyr Cys Phe Ser Ser Pro Thr
275 280 285Phe Arg Ser Ser Tyr Arg Arg Val Phe His Thr Leu Arg Gly Lys Gly290 295 300Gln Ala Ala Glu Pro Pro Asp Phe Asn Pro Arg Asp Ser Tyr Ser305 310 315(198)SEQ ID NO197的资料(i)序列特征(A)长度1143个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO197的序列描述ATGGAGGAAG GTGGTGATTT TGACAACTAC TATGGGGCAG ACAACCAGTC TGAGTGTGAG 60TACACAGACT GGAAATCCTC GGGGGCCCTC ATCCCTGCCA TCTACATGTT GGTCTTCCTC 120CTGGGCACCA CGGGAAACGG TCTGGTGCTC TGGACCGTGT TTCGGAGCAG CCGGGAGAAG 180AGGCGCTCAG CTGATATCTT CATTGCTAGC CTGGCGGTGG CTGACCTGAC CTTCGTGGTG 240ACGCTGCCCC TGTGGGCTAC CTACACGTAC CGGGACTATG ACTGGCCCTT TGGGACCTTC 300TTCTGCAAGC TCAGCAGCTA CCTCATCTTC GTCAACATGT ACGCCAGCGT CTTCTGCCTC 360ACCGGCCTCA GCTTCGACCG CTACCTGGCC ATCGTGAGGC CAGTGGCCAA TGCTCGGCTG 420AGGCTGCGGG TCAGCGGGGC CGTGGCCACG GCAGTTCTTT GGGTGCTGGC CGCCCTCCTG 480GCCATGCCTG TCATGGTGTT ACGCACCACC GGGGACTTGG AGAACACCAC TAAGGTGCAG 540TGCTACATGG ACTACTCCAT GGTGGCCACT GTGAGCTCAG AGTGGGCCTG GGAGGTGGGC 600CTTGGGGTCT CGTCCACCAC CGTGGGCTTT GTGGTGCCCT TCACCATCAT GCTGACCTGT 660TACTTCTTCA TCGCCCAAAC CATCGCTGGC CACTTCCGCA AGGAACGCAT CGAGGGCCTG 720CGGAAGCGGC GCCGGCTTAA GAGCATCATC GTGGTGCTGG TGGTGACCTT TGCCCTGTGC 780TGGATGCCCT ACCACCTGGT GAAGACGCTG TACATGCTGG GCAGCCTGCT GCACTGGCCC 840TGTGACTTTG ACCTCTTCCT CATGAACATC TTCCCCTACT GCACCTGCAT CAGCTACGTC 900AACAGCTGCC TCAACCCCTT CCTCTATGCC TTTTTCGACC CCCGCTTCCG CCAGGCCTGC 960ACCTCCATGC TCTGCTGTGG CCAGAGCAGG TGCGCAGGCA CCTCCCACAG CAGCAGTGGG 1020GAGAAGTCAG CCAGCTACTC TTCGGGGCAC AGCCAGGGGC CCGGCCCCAA CATGGGCAAG 1080GGTGGAGAAC AGATGCACGA GAAATCCATC CCCTACAGCC AGGAGACCCT TGTGGTTGAC 1140TAG 1143(199)SEQ ID NO198的资料(i)序列特征(A)长度380个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO198的序列描述Met Glu Glu Gly Gly Asp Phe Asp Asn Tyr Tyr Gly Ala Asp Asn Gln1 5 10 15Ser Glu Cys Glu Tyr Thr Asp Trp Lys Ser Ser Gly Ala Leu Ile Pro20 25 30Ala Ile Tyr Met Leu Val Phe Leu Leu Gly Thr Thr Gly Asn Gly Leu35 40 45Val Leu Trp Thr Val Phe Arg Ser Ser Arg Glu Lys Arg Arg Ser Ala50 55 60Asp Ile Phe Ile Ala Ser Leu Ala Val Ala Asp Leu Thr Phe Val Val65 70 75 80Thr Leu Pro Leu Trp Ala Thr Tyr Thr Tyr Arg Asp Tyr Asp Trp Pro85 90 95Phe Gly Thr Phe Phe Cys Lys Leu Ser Ser Tyr Leu Ile Phe Val Asn100 105 110Met Tyr Ala Ser Val Phe Cys Leu Thr Gly Leu Ser Phe Asp Arg Tyr115 120 125Leu Ala Ile Val Arg Pro Val Ala Asn Ala Arg Leu Arg Leu Arg Val130 135 140Ser Gly Ala Val Ala Thr Ala Val Leu Trp Val Leu Ala Ala Leu Leu145 150 155 160Ala Met Pro Val Met Val Leu Arg Thr Thr Gly Asp Leu Glu Asn Thr165 170 175Thr Lys Val Gln Cys Tyr Met Asp Tyr Ser Met Val Ala Thr Val Ser180 185 190Ser Glu Trp Ala Trp Glu Val Gly Leu Gly Val Ser Ser Thr Thr Val195 200 205Gly Phe Val Val Pro Phe Thr Ile Met Leu Thr Cys Tyr Phe Phe Ile210 215 220Ala Gln Thr Ile Ala Gly His Phe Arg Lys Glu Arg Ile Glu Gly Leu225 230 235 240Arg Lys Arg Arg Arg Leu Lys Ser Ile Ile Val Val Leu Val Val Thr245 250 255Phe Ala Leu Cys Trp Met Pro Tyr His Leu Val Lys Thr Leu Tyr Met260 265 270Leu Gly Ser Leu Leu His Trp Pro Cys Asp Phe Asp Leu Phe Leu Met275 280 285Asn Ile Phe Pro Tyr Cys Thr Cys Ile Ser Tyr Val Asn Ser Cys Leu290 295 300Asn Pro Phe Leu Tyr Ala Phe Phe Asp Pro Arg Phe Arg Gln Ala Cys305 310 315 320Thr Ser Met Leu Cys Cys Gly Gln Ser Arg Cys Ala Gly Thr Ser His325 330 335Ser Ser Ser Gly Glu Lys Ser Ala Ser Tyr Ser Ser Gly His Ser Gln340 345 350Gly Pro Gly Pro Asn Met Gly Lys Gly Gly Glu Gln Met His Glu Lys355 360 365Ser Ile Pro Tyr Ser Gln Glu Thr Leu Val Val Asp370 375 380(200)SEQ ID NO199的资料(i)序列特征(A)长度1119个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO199的序列描述ATGAACTACC CGCTAACGCT GGAAATGGAC CTCGAGAACC TGGAGGACCT GTTCTGGGAA 60CTGGACAGAT TGGACAACTA TAACGACACC TCCCTGGTGG AAAATCATCT CTGCCCTGCC 120ACAGAGGGTC CCCTCATGGC CTCCTTCAAG GCCGTGTTCG TGCCCGTGGC CTACAGCCTC 180ATCTTCCTCC TGGGCGTGAT CGGCAACGTC CTGGTGCTGG TGATCCTGGA GCGGCACCGG 240CAGACACGCA GTTCCACGGA GACCTTCCTG TTCCACCTGG CCGTGGCCGA CCTCCTGCTG 300GTCTTCATCT TGCCCTTTGC CGTGGCCGAG GGCTCTGTGG GCTGGGTCCT GGGGACCTTC 360CTCTGCAAAA CTGTGATTGC CCTGCACAAA GTCAACTTCT ACTGCAGCAG CCTGCTCCTG 420GCCTGCATCG CCGTGGACCG CTACCTGGCC ATTGTCCACG CCGTCCATGC CTACCGCCAC 480CGCCGCCTCC TCTCCATCCA CATCACCTGT GGGACCATCT GGCTGGTGGG CTTCCTCCTT 540GCCTTGCCAG AGATTCTCTT CGCCAAAGTC AGCCAAGGCC ATCACAACAA CTCCCTGCCA 600CGTTGCACCT TCTCCCAAGA GAACCAAGCA GAAACGCATG CCTGGTTCAC CTCCCGATTC 660CTCTACCATG TGGCGGGATT CCTGCTGCCC ATGCTGGTGA TGGGCTGGTG CTACGTGGGG 720GTAGTGCACA GGTTGCGCCA GGCCCAGCGG CGCCCTCAGC GGCAGAAGGC AAAAAGGGTG 780GCCATCCTGG TGACAAGCAT CTTCTTCCTC TGCTGGTCAC CCTACCACAT CGTCATCTTC 840CTGGACACCC TGGCGAGGCT GAAGGCCGTG GACAATACCT GCAAGCTGAA TGGCTCTCTC 900CCCGTGGCCA TCACCATGTG TGAGTTCCTG GGCCTGGCCC ACTGCTGCCT CAACCCCATG 960CTCTACACTT TCGCCGGCGT GAAGTTCCGC AGTGACCTGT CGCGGCTCCT GACCAAGCTG 1020GGCTGTACCG GCCCTGCCTC CCTGTGCCAG CTCTTCCCTA GCTGGCGCAG GAGCAGTCTC 1080TCTGAGTCAG AGAATGCCAC CTCTCTCACC ACGTTCTAG1119(201)SEQ ID NO200的资料(i)序列特征(A)长度372个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO200的序列描述Met Asn Tyr Pro Leu Thr Leu Glu Met Asp Leu Glu Asn Leu Glu Asp1 5 10 15Leu Phe Trp Glu Leu Asp Arg Leu Asp Asn Tyr Asn Asp Thr Ser Leu20 25 30Val Glu Asn His Leu Cys Pro Ala Thr Glu Gly Pro Leu Met Ala Ser35 40 45Phe Lys Ala Val Phe Val Pro Val Ala Tyr Ser Leu Ile Phe Leu Leu50 55 60Gly Val Ile Gly Asn Val Leu Val Leu Val Ile Leu Glu Arg His Arg65 70 75 80Gln Thr Arg Ser Ser Thr Glu Thr Phe Leu Phe His Leu Ala Val Ala85 90 95Asp Leu Leu Leu Val Phe Ile Leu Pro Phe Ala Val Ala Glu Gly Ser100 105 110Val Gly Trp Val Leu Gly Thr Phe Leu Cys Lys Thr Val Ile Ala Leu115 120 125His Lys Val Asn Phe Tyr Cys Ser Ser Leu Leu Leu Ala Cys Ile Ala130 135 140Val Asp Arg Tyr Leu Ala Ile Val His Ala Val His Ala Tyr Arg His145 150 155 160Arg Arg Leu Leu Ser Ile His Ile Thr Cys Gly Thr Ile Trp Leu Val165 170 175Gly Phe Leu Leu Ala Leu Pro Glu Ile Leu Phe Ala Lys Val Ser Gln180 185 190Gly His His Asn Asn Ser Leu Pro Arg Cys Thr Phe Ser Gln Glu Asn195 200 205Gln Ala Glu Thr His Ala Trp Phe Thr Ser Arg Phe Leu Tyr His Val210 215 220Ala Gly Phe Leu Leu Pro Met Leu Val Met Gly Trp Cys Tyr Val Gly225 230 235 240Val Val His Arg Leu Arg Gln Ala Gln Arg Arg Pro Gln Arg Gln Lys245 250 255Ala Lys Arg Val Ala Ile Leu Val Thr Ser Ile Phe Phe Leu Cys Trp260 265 270Ser Pro Tyr His Ile Val Ile Phe Leu Asp Thr Leu Ala Arg Leu Lys275 280 285Ala Val Asp Asn Thr Cys Lys Leu Asn Gly Ser Leu Pro Val Ala Ile290 295 300Thr Met Cys Glu Phe Leu Gly Leu Ala His Cys Cys Leu Asn Pro Met305 310 315 320Leu Tyr Thr Phe Ala Gly Val Lys Phe Arg Ser Asp Leu Ser Arg Leu325 330 335Leu Thr Lys Leu Gly Cys Thr Gly Pro Ala Ser Leu Cys Gln Leu Phe340 345 350Pro Ser Trp Arg Arg Ser Ser Leu Ser Glu Ser Glu Asn Ala Thr Ser355 360 365Leu Thr Thr Phe370(202)SEQ ID NO201的资料(i)序列特征(A)长度1128个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO201的序列描述ATGGATGTGA CTTCCCAAGC CCGGGGCGTG GGCCTGGAGA TGTACCCAGG CACCGCGCAG 60CCTGCGGCCC CCAACACCAC CTCCCCCGAG CTCAACCTGT CCCACCCGCT CCTGGGCACC 120GCCCTGGCCA ATGGGACAGG TGAGCTCTCG GAGCACCAGC AGTACGTGAT CGGCCTGTTC 180CTCTCGTGCC TCTACACCAT CTTCCTCTTC CCCATCGGCT TTGTGGGCAA CATCCTGATC 240CTGGTGGTGA ACATCAGCTT CCGCGAGAAG ATGACCATCC CCGACCTGTA CTTCATCAAC 300CTGGCGGTGG CGGACCTCAT CCTGGTGGCC GACTCCCTCA TTGAGGTGTT CAACCTGCAC 360GAGCGGTACT ACGACATCGC CGTCCTGTGC ACCTTCATGT CGCTCTTCCT GCAGGTCAAC 420ATGTACAGCA GCGTCTTCTT CCTCACCTGG ATGAGCTTCG ACCGCTACAT CGCCCTGGCC 480AGGGCCATGC GCTGCAGCCT GTTCCGCACC AAGCACCACG CCCGGCTGAG CTGTGGCCTC 540ATCTGGATGG CATCCGTGTC AGCCACGCTG GTGCCCTTCA CCGCCGTGCA CCTGCAGCAC 600ACCGACGAGG CCTGCTTCTG TTTCGCGGAT GTCCGGGAGG TGCAGTGGCT CGAGGTCACG 660CTGGGCTTCA TCGTGCCCTT CGCCATCATC GGCCTGTGCT ACTCCCTCAT TGTCCGGGTG 720CTGGTCAGGG CGCACCGGCA CCGTGGGCTG CGGCCCCGGC GGCAGAAGGC GAAGCGCATG 780ATCCTCGCGG TGGTGCTGGT CTTCTTCGTC TGCTGGCTGC CGGAGAACGT CTTCATCAGC 840GTGCACCTCC TGCAGCGGAC GCAGCCTGGG GCCGCTCCCT GCAAGCAGTC TTTCCGCCAT 900GCCCACCCCC TCACGGGCCA CATTGTCAAC CTCACCGCCT TCTCCAACAG CTGCCTAAAC 960CCCCTCATCT ACAGCTTTCT CGGGGAGACC TTCAGGGACA AGCTGAGGCT GTACATTGAG 1020CAGAAAACAA ATTTGCCGGC CCTGAACCGC TTCTGTCACG CTGCCCTGAA GGCCGTCATT 1080CCAGACAGCA CCGAGCAGTC GGATGTGAGG TTCAGCAGTG CCGTGTAG 1128(203)SEQ ID NO202的资料(i)序列特征(A)长度375个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO202的序列描述Met Asp Val Thr Ser Gln Ala Arg Gly Val Gly Leu Glu Met Tyr Pro1 5 10 15Gly Thr Ala Gln Pro Ala Ala Pro Asn Thr Thr Ser Pro Glu Leu Asn20 25 30Leu Ser His Pro Leu Leu Gly Thr Ala Leu Ala Asn Gly Thr Gly Glu35 40 45Leu Ser Glu His Gln Gln Tyr Val Ile Gly Leu Phe Leu Ser Cys Leu50 55 60Tyr Thr Ile Phe Leu Phe Pro Ile Gly Phe Val Gly Asn Ile Leu Ile65 70 75 80Leu Val Val Asn Ile Ser Phe Arg Glu Lys Met Thr Ile Pro Asp Leu85 90 95Tyr Phe Ile Asn Leu Ala Val Ala Asp Leu Ile Leu Val Ala Asp Ser100 105 110Leu Ile Glu Val Phe Asn Leu His Glu Arg Tyr Tyr Asp Ile Ala Val115 120 125Leu Cys Thr Phe Met Ser Leu Phe Leu Gln Val Ash Met Tyr Ser Ser130 135 140Val Phe Phe Leu Thr Trp Met Ser Phe Asp Arg Tyr Ile Ala Leu Ala145 150 155 160Arg Ala Met Arg Cys Ser Leu Phe Arg Thr Lys His His Ala Arg Leu165 170 175Ser Cys Gly Leu Ile Trp Met Ala Ser Val Ser Ala Thr Leu Val Pro180 185 190Phe Thr Ala Val His Leu Gln His Thr Asp Glu Ala Cys Phe Cys Phe195 200 205Ala Asp Val Arg Glu Val Gln Trp Leu Glu Val Thr Leu Gly Phe Ile210 215 220Val Pro Phe Ala Ile Ile Gly Leu Cys Tyr Ser Leu Ile Val Arg Val225 230 235 240Leu Val Arg Ala His Arg His Arg Gly Leu Arg Pro Arg Arg Gln Lys245 250 255Ala Lys Arg Met Ile Leu Ala Val Val Leu Val Phe Phe Val Cys Trp260 265 270Leu Pro Glu Asn Val Phe Ile Ser Val His Leu Leu Gln Arg Thr Gln275 280 285Pro Gly Ala Ala Pro Cys Lys Gln Ser Phe Arg His Ala His Pro Leu290 295 300Thr Gly His Ile Val Asn Leu Thr Ala Phe Ser Asn Ser Cys Leu Asn305 310 315 320Pro Leu Ile Tyr Ser Phe Leu Gly Glu Thr Phe Arg Asp Lys Leu Arg325 330 335Leu Tyr Ile Glu Gln Lys Thr Asn Leu Pro Ala Leu Asn Arg Phe Cys340 345 350His Ala Ala Leu Lys Ala Val Ile Pro Asp Ser Thr Glu Gln Ser Asp355 360 365Val Arg Phe Ser Ser Ala Val370 375(204)SEQ ID NO203的资料(i)序列特征(A)长度1137个碱基对(B)类型核酸
(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO203的序列描述ATGGACCTGG GGAAACCAAT GAAAAGCGTG CTGGTGGTGG CTCTCCTTGT CATTTTCCAG 60GTATGCCTGT GTCAAGATGA GGTCACGGAC GATTACATCG GAGACAACAC CACAGTGGAC 120TACACTTTGT TCGAGTCTTT GTGCTCCAAG AAGGACGTGC GGAACTTTAA AGCCTGGTTC 180CTCCCTATCA TGTACTCCAT CATTTGTTTC GTGGGCCTAC TGGGCAATGG GCTGGTCGTG 240TTGACCTATA TCTATTTCAA GAGGCTCAAG ACCATGACCG ATACCTACCT GCTCAACCTG 300GCGGTGGCAG ACATCCTCTT CCTCCTGACC CTTCCCTTCT GGGCCTACAG CGCGGCCAAG 360TCCTGGGTCT TCGGTGTCCA CTTTTGCAAG CTCATCTTTG CCATCTACAA GATGAGCTTC 420TTCAGTGGCA TGCTCCTACT TCTTTGCATC AGCATTGACC GCTACGTGGC CATCGTCCAG 480GCTGTCTCAG CTCACCGCCA CCGTGCCCGC GTCCTTCTCA TCAGCAAGCT GTCCTGTGTG 540GGCATCTGGA TACTAGCCAC AGTGCTCTCC ATCCCAGAGC TCCTGTACAG TGACCTCCAG 600AGGAGCAGCA GTGAGCAAGC GATGCGATGC TCTCTCATCA CAGAGCATGT GGAGGCCTTT 660ATCACCATCC AGGTGGCCCA GATGGTGATC GGCTTTCTGG TCCCCCTGCT GGCCATGAGC 720TTCTGTTACC TTGTCATCAT CCGCACCCTG CTCCAGGCAC GCAACTTTGA GCGCAACAAG 780GCCAAAAAGG TGATCATCGC TGTGGTCGTG GTCTTCATAG TCTTCCAGCT GCCCTACAAT 840GGGGTGGTCC TGGCCCAGAC GGTGGCCAAC TTCAACATCA CCAGTAGCAC CTGTGAGCTC 900AGTAAGCAAC TCAACATCGC CTACGACGTC ACCTACAGCC TGGCCTGCGT CCGCTGCTGC 960GTCAACCCTT TCTTGTACGC CTTCATCGGC GTCAAGTTCC GCAACGATCT CTTCAAGCTC 1020TTCAAGGACC TGGGCTGCCT CAGCCAGGAG CAGCTCCGGC AGTGGTCTTC CTGTCGGCAC 1080ATCCGGCGCT CCTCCATGAG TGTGGAGGCC GAGACCACCA CCACCTTCTC CCCATAG1137(205)SEQ ID NO204的资料(i)序列特征(A)长度378个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO204的序列描述Met Asp Leu Gly Lys Pro Met Lys Ser Val Leu Val Val Ala Leu Leu1 5 10 15Val Ile Phe Gln Val Cys Leu Cys Gln Asp Glu Val Thr Asp Asp Tyr20 25 30Ile Gly Asp Asn Thr Thr Val Asp Tyr Thr Leu Phe Glu Ser Leu Cys35 40 45Ser Lys Lys Asp Val Arg Asn Phe Lys Ala Trp Phe Leu Pro Ile Met50 55 60Tyr Ser Ile Ile Cys Phe Val Gly Leu Leu Gly Asn Gly Leu Val Val65 70 75 80Leu Thr Tyr Ile Tyr Phe Lys Arg Leu Lys Thr Met Thr Asp Thr Tyr85 90 95Leu Leu Asn Leu Ala Val Ala Asp Ile Leu Phe Leu Leu Thr Leu Pro100 105 110Phe Trp Ala Tyr Ser Ala Ala Lys Ser Trp Val Phe Gly Val His Phe115 120 125Cys Lys Leu Ile Phe Ala Ile Tyr Lys Met Ser Phe Phe Ser Gly Met130 135 140Leu Leu Leu Leu Cys Ile Ser Ile Asp Arg Tyr Val Ala Ile Val Gln145 150 155 160Ala Val Ser Ala His Arg His Arg Ala Arg Val Leu Leu Ile Ser Lys165 170 175Leu Ser Cys Val Gly Ile Trp Ile Leu Ala Thr Val Leu Ser Ile Pro180 185 190Glu Leu Leu Tyr Ser Asp Leu Gln Arg Ser Ser Ser Glu Gln Ala Met195 200 205Arg Cys Ser Leu Ile Thr Glu His Val Glu Ala Phe Ile Thr Ile Gln210 215 220Val Ala Gln Met Val Ile Gly Phe Leu Val Pro Leu Leu Ala Met Ser225 230 235 240Phe Cys Tyr Leu Val Ile Ile Arg Thr Leu Leu Gln Ala Arg Asn Phe245 250 255Glu Arg Asn Lys Ala Lys Lys Val Ile Ile Ala Val Val Val Val Phe260 265 270Ile Val Phe Gln Leu Pro Tyr Asn Gly Val Val Leu Ala Gln Thr Val275 280 285Ala Asn Phe Asn Ile Thr Ser Ser Thr Cys Glu Leu Ser Lys Gln Leu290 295 300Asn Ile Ala Tyr Asp Val Thr Tyr Ser Leu Ala Cys Val Arg Cys Cys305 310 315 320Val Asn Pro Phe Leu Tyr Ala Phe Ile Gly Val Lys Phe Arg Asn Asp325 330 335Leu Phe Lys Leu Phe Lys Asp Leu Gly Cys Leu Ser Gln Glu Gln Leu340 345 350Arg Gln Trp Ser Ser Cys Arg His Ile Arg Arg Ser Ser Met Ser Val355 360 365Glu Ala Glu Thr Thr Thr Thr Phe Ser Pro370 375(206)SEQ ID NO205的资料(i)序列特征(A)长度1086个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO205的序列描述ATGGATATAC AAATGGCAAA CAATTTTACT CCGCCCTCTG CAACTCCTCA GGGAAATGAC 60TGTGACCTCT ATGCACATCA CAGCACGGCC AGGATAGTAA TGCCTCTGCA TTACAGCCTC 120GTCTTCATCA TTGGGCTCGT GGGAAACTTA CTAGCCTTGG TCGTCATTGT TCAAAACAGG 180AAAAAAATCA ACTCTACCAC CCTCTATTCA ACAAATTTGG TGATTTCTGA TATACTTTTT 240ACCACGGCTT TGCCTACACG AATAGCCTAC TATGCAATGG GCTTTGACTG GAGAATCGGA 300GATGCCTTGT GTAGGATAAC TGCGCTAGTG TTTTACATCA ACACATATGC AGGTGTGAAC 360TTTATGACCT GCCTGAGTAT TGACCGCTTC ATTGCTGTGG TGCACCCTCT ACGCTACAAC 420AAGATAAAAA GGATTGAACA TGCAAAAGGC GTGTGCATAT TTGTCTGGAT TCTAGTATTT 480GCTCAGACAC TCCCACTCCT CATCAACCCT ATGTCAAAGC AGGAGGCTGA AAGGATTACA 540TGCATGGAGT ATCCAAACTT TGAAGAAACT AAATCTCTTC CCTGGATTCT GCTTGGGGCA 600TGTTTCATAG GATATGTACT TCCACTTATA ATCATTCTCA TCTGCTATTC TCAGATCTGC 660TGCAAACTCT TCAGAACTGC CAAACAAAAC CCACTCACTG AGAAATCTGG TGTAAACAAA 720AAGGCTAAAA ACACAATTAT TCTTATTATT GTTGTGTTTG TTCTCTGTTT CACACCTTAC 780CATGTTGCAA TTATTCAACA TATGATTAAG AAGCTTCGTT TCTCTAATTT CCTGGAATGT 840AGCCAAAGAC ATTCGTTCCA GATTTCTCTG CACTTTACAG TATGCCTGAT GAACTTCAAT 900TGCTGCATGG ACCCTTTTAT CTACTTCTTT GCATGTAAAG GGTATAAGAG AAAGGTTATG 960AGGATGCTGA AACGGCAAGT CAGTGTATCG ATTTCTAGTG CTGTGAAGTC AGCCCCTGAA 1020GAAAATTCAC GTGAAATGAC AGAAACGCAG ATGATGATAC ATTCCAAGTC TTCAAATGGA 1080AAGTGA 1086(207)SEQ ID NO206的资料(i)序列特征(A)长度361个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO206的序列描述Met Asp Ile Gln Met Ala Asn Asn Phe Thr Pro Pro Ser Ala Thr Pro1 5 10 15Gln Gly Asn Asp Cys Asp Leu Tyr Ala His His Ser Thr Ala Arg Ile20 25 30Val Met Pro Leu His Tyr Ser Leu Val Phe Ile Ile Gly Leu Val Gly35 40 45Asn Leu Leu Ala Leu Val Val Ile Val Gln Asn Arg Lys Lys Ile Asn50 55 60Ser Thr Thr Leu Tyr Ser Thr Asn Leu Val Ile Ser Asp Ile Leu Phe65 70 75 80Thr Thr Ala Leu Pro Thr Arg Ile Ala Tyr Tyr Ala Met Gly Phe Asp85 90 95Trp Arg Ile Gly Asp Ala Leu Cys Arg Ile Thr Ala Leu Val Phe Tyr100 105 110Ile Asn Thr Tyr Ala Gly Val Asn Phe Met Thr Cys Leu Ser Ile Asp115 120 125Arg Phe Ile Ala Val Val His Pro Leu Arg Tyr Asn Lys Ile Lys Arg130 135 140Ile Glu His Ala Lys Gly Val Cys Ile Phe Val Trp Ile Leu Val Phe145 150 155 160Ala Gln Thr Leu Pro Leu Leu Ile Asn Pro Met Ser Lys Gln Glu Ala165 170 175Glu Arg Ile Thr Cys Met Glu Tyr Pro Asn Phe Glu Glu Thr Lys Ser180 185 190Leu Pro Trp Ile Leu Leu Gly Ala Cys Phe Ile Gly Tyr Val Leu Pro195 200 205Leu Ile Ile Ile Leu Ile Cys Tyr Ser Gln Ile Cys Cys Lys Leu Phe210 215 220Arg Thr Ala Lys Gln Asn Pro Leu Thr Glu Lys Ser Gly Val Asn Lys225 230 235 240Lys Ala Lys Asn Thr Ile Ile Leu Ile Ile Val Val Phe Val Leu Cys245 250 255Phe Thr Pro Tyr His Val Ala Ile Ile Gln His Met Ile Lys Lys Leu260 265 270Arg Phe Ser Asn Phe Leu Glu Cys Ser Gln Arg His Ser Phe Gln Ile
275 280 285Ser Leu His Phe Thr Val Cys Leu Met Asn Phe Asn Cys Cys Met Asp290 295 300Pro Phe Ile Tyr Phe Phe Ala Cys Lys Gly Tyr Lys Arg Lys Val Met305 310 315 320Arg Met Leu Lys Arg Gln Val Ser Val Ser Ile Ser Ser Ala Val Lys325 330 335Ser Ala Pro Glu Glu Asn Ser Arg Glu Met Thr Glu Thr Gln Met Met340 345 350Ile His Ser Lys Ser Ser Asn Gly Lys355 360(208)SEQ ID NO207的资料(i)序列特征(A)长度1446个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO207的序列描述ATGCGGTGGC TGTGGCCCCT GGCTGTCTCT CTTGCTGTGA TTTTGGCTGT GGGGCTAAGC 60AGGGTCTCTG GGGGTGCCCC CCTGCACCTG GGCAGGCACA GAGCCGAGAC CCAGGAGCAG 120CAGAGCCGAT CCAAGAGGGG CACCGAGGAT GAGGAGGCCA AGGGCGTGCA GCAGTATGTG 180CCTGAGGAGT GGGCGGAGTA CCCCCGGCCC ATTCACCCTG CTGGCCTGCA GCCAACCAAG 240CCCTTGGTGG CCACCAGCCC TAACCCCGAC AAGGATGGGG GCACCCCAGA CAGTGGGCAG 300GAACTGAGGG GCAATCTGAC AGGGGCACCA GGGCAGAGGC TACAGATCCA GAACCCCCTG 360TATCCGGTGA CCGAGAGCTC CTACAGTGCC TATGCCATCA TGCTTCTGGC GCTGGTGGTG 420TTTGCGGTGG GCATTGTGGG CAACCTGTCG GTCATGTGCA TCGTGTGGCA CAGCTACTAC 480CTGAAGAGCG CCTGGAACTC CATCCTTGCC AGCCTGGCCC TCTGGGATTT TCTGGTCCTC 540TTTTTCTGCC TCCCTATTGT CATCTTCAAC GAGATCACCA AGCAGAGGCT ACTGGGTGAC 600GTTTCTTGTC GTGCCGTGCC CTTCATGGAG GTCTCCTCTC TGGGAGTCAC GACTTTCAGC 660CTCTGTGCCC TGGGCATTGA CCGCTTCCAC GTGGCCACCA GCACCCTGCC CAAGGTGAGG 720CCCATCGAGC GGTGCCAATC CATCCTGGCC AAGTTGGCTG TCATCTGGGT GGGCTCCATG 780ACGCTGGCTG TGCCTGAGCT CCTGCTGTGG CAGCTGGCAC AGGAGCCTGC CCCCACCATG 840GGCACCCTGG ACTCATGCAT CATGAAACCC TCAGCCAGCC TGCCCGAGTC CCTGTATTCA 900CTGGTGATGA CCTACCAGAA CGCCCGCATG TGGTGGTACT TTGGCTGCTA CTTCTGCCTG 960CCCATCCTCT TCACAGTCAC CTGCCAGCTG GTGACATGGC GGGTGCGAGG CCCTCCAGGG 1020AGGAAGTCAG AGTGCAGGGC CAGCAAGCAC GAGCAGTGTG AGAGCCAGCT CAAGAGCACC 1080GTGGTGGGCC TGACCGTGGT CTACGCCTTC TGCACCCTCC CAGAGAACGT CTGCAACATC 1140GTGGTGGCCT ACCTCTCCAC CGAGCTGACC CGCCAGACCC TGGACCTCCT GGGCCTCATC 1200AACCAGTTCT CCACCTTCTT CAAGGGCGCC ATCACCCCAG TGCTGCTCCT TTGCATCTGC 1260AGGCCGCTGG GCCAGGCCTT CCTGGACTGC TGCTGCTGCT GCTGCTGTGA GGAGTGCGGC 1320GGGGCTTCGG AGGCCTCTGC TGCCAATGGG TCGGACAACA AGCTCAAGAC CGAGGTGTCC 1380TCTTCCATCT ACTTCCACAA GCCCAGGGAG TCACCCCCAC TCCTGCCCCT GGGCACACCT 1440TGCTGA 1446(209)SEQ ID NO208的资料(i)序列特征(A)长度481个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO208的序列描述Met Arg Trp Leu Trp Pro Leu Ala Val Ser Leu Ala Val Ile Leu Ala1 5 10 15Val Gly Leu Ser Arg Val Ser Gly Gly Ala Pro Leu His Leu Gly Arg20 25 30His Arg Ala Glu Thr Gln Glu Gln Gln Ser Arg Ser Lys Arg Gly Thr35 40 45Glu Asp Glu Glu Ala Lys Gly Val Gln Gln Tyr Val Pro Glu Glu Trp50 55 60Ala Glu Tyr Pro Arg Pro Ile His Pro Ala Gly Leu Gln Pro Thr Lys65 70 75 80Pro Leu Val Ala Thr Ser Pro Asn Pro Asp Lys Asp Gly Gly Thr Pro85 90 95Asp Ser Gly Gln Glu Leu Arg Gly Asn Leu Thr Gly Ala Pro Gly Gln100 105 110Arg Leu Gln Ile Gln Asn Pro Leu Tyr Pro Val Thr Glu Ser Ser Tyr115 120 125Ser Ala Tyr Ala Ile Met Leu Leu Ala Leu Val Val Phe Ala Val Gly130 135 140Ile Val Gly Asn Leu Ser Val Met Cys Ile Val Trp His Ser Tyr Tyr145 150 155 160Leu Lys Ser Ala Trp Asn Ser Ile Leu Ala Ser Leu Ala Leu Trp Asp165 170 175Phe Leu Val Leu Phe Phe Cys Leu Pro Ile Val Ile Phe Asn Glu Ile180 185 190Thr Lys Gln Arg Leu Leu Gly Asp Val Ser Cys Arg Ala Val Pro Phe
195 200 205Met Glu Val Ser Ser Leu Gly Val Thr Thr Phe Ser Leu Cys Ala Leu210 215 220Gly Ile Asp Arg Phe His Val Ala Thr Ser Thr Leu Pro Lys Val Arg225 230 235 240Pro Ile Glu Arg Cys Gln Ser Ile Leu Ala Lys Leu Ala Val Ile Trp245 250 255Val Gly Ser Met Thr Leu Ala Val Pro Glu Leu Leu Leu Trp Gln Leu260 265 270Ala Gln Glu Pro Ala Pro Thr Met Gly Thr Leu Asp Ser Cys Ile Met275 280 285Lys Pro Ser Ala Ser Leu Pro Glu Ser Leu Tyr Ser Leu Val Met Thr290 295 300Tyr Gln Asn Ala Arg Met Trp Trp Tyr Phe Gly Cys Tyr Phe Cys Leu305 310 315 320Pro Ile Leu Phe Thr Val Thr Cys Gln Leu Val Thr Trp Arg Val Arg325 330 335Gly Pro Pro Gly Arg Lys Ser Glu Cys Arg Ala Ser Lys His Glu Gln340 345 350Cys Glu Ser Gln Leu Lys Ser Thr Val Val Gly Leu Thr Val Val Tyr355 360 365Ala Phe Cys Thr Leu Pro Glu Asn Val Cys Asn Ile Val Val Ala Tyr370 375 380Leu Ser Thr Glu Leu Thr Arg Gln Thr Leu Asp Leu Leu Gly Leu Ile385 390 395 400Asn Gln Phe Ser Thr Phe Phe Lys Gly Ala Ile Thr Pro Val Leu Leu405 410 415Leu Cys Ile Cys Arg Pro Leu Gly Gln Ala Phe Leu Asp Cys Cys Cys420 425 430Cys Cys Cys Cys Glu Glu Cys Gly Gly Ala Ser Glu Ala Ser Ala Ala435 440 445Asn Gly Ser Asp Asn Lys Leu Lys Thr Glu Val Ser Ser Ser Ile Tyr450 455 460Phe His Lys Pro Arg Glu Ser Pro Pro Leu Leu Pro Leu Gly Thr Pro465 470 475 480Cys(210)SEQ ID NO209的资料(i)序列特征(A)长度1101个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO209的序列描述ATGTGGAACG CGACGCCCAG CGAAGAGCCG GGGTTCAACC TCACACTGGC CGACCTGGAC 60TGGGATGCTT CCCCCGGCAA CGACTCGCTG GGCGACGAGC TGCTGCAGCT CTTCCCCGCG 120CCGCTGCTGG CGGGCGTCAC AGCCACCTGC GTGGCACTCT TCGTGGTGGG TATCGCTGGC 180AACCTGCTCA CCATGCTGGT GGTGTCGCGC TTCCGCGAGC TGCGCACCAC CACCAACCTC 240TACCTGTCCA GCATGGCCTT CTCCGATCTG CTCATCTTCC TCTGCATGCC CCTGGACCTC 300GTTCGCCTCT GGCAGTACCG GCCCTGGAAC TTCGGCGACC TCCTCTGCAA ACTCTTCCAA 360TTCGTCAGTG AGAGCTGCAC CTACGCCACG GTGCTCACCA TCACAGCGCT GAGCGTCGAG 420CGCTACTTCG CCATCTGCTT CCCACTCCGG GCCAAGGTGG TGGTCACCAA GGGGCGGGTG 480AAGCTGGTCA TCTTCGTCAT CTGGGCCGTG GCCTTCTGCA GCGCCGGGCC CATCTTCGTG 540CTAGTCGGGG TGGAGCACGA GAACGGCACC GACCCTTGGG ACACCAACGA GTGCCGCCCC 600ACCGAGTTTG CGGTGCGCTC TGGACTGCTC ACGGTCATGG TGTGGGTGTC CAGCATCTTC 660TTCTTCCTTC CTGTCTTCTG TCTCACGGTC CTCTACAGTC TCATCGGCAG GAAGCTGTGG 720CGGAGGAGGC GCGGCGATGC TGTCGTGGGT GCCTCGCTCA GGGACCAGAA CCACAAGCAA 780ACCAAGAAAA TGCTGGCTGT AGTGGTGTTT GCCTTCATCC TCTGCTGGCT CCCCTTCCAC 840GTAGGGCGAT ATTTATTTTC CAAATCCTTT GAGCCTGGCT CCTTGGAGAT TGCTCAGATC 900AGCCAGTACT GCAACCTCGT GTCCTTTGTC CTCTTCTACC TCAGTGCTGC CATCAACCCC 960ATTCTGTACA ACATCATGTC CAAGAAGTAC CGGGTGGCAG TGTTCAGACT TCTGGGATTC 1020GAACCCTTCT CCCAGAGAAA GCTCTCCACT CTGAAAGATG AAAGTTCTCG GGCCTGGACA 1080GAATCTAGTA TTAATACATG A 1101(211)SEQ ID NO210的资料(i)序列特征(A)长度366个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO210的序列描述Met Trp Asn Ala Thr Pro Ser Glu Glu Pro Gly Phe Asn Leu Thr Leu1 5 10 15Ala Asp Leu Asp Trp Asp Ala Ser Pro Gly Asn Asp Ser Leu Gly Asp
20 25 30Glu Leu Leu Gln Leu Phe Pro Ala Pro Leu Leu Ala Gly Val Thr Ala35 40 45Thr Cys Val Ala Leu Phe Val Val Gly Ile Ala Gly Asn Leu Leu Thr50 55 60Met Leu Val Val Ser Arg Phe Arg Glu Leu Arg Thr Thr Thr Asn Leu65 70 75 80Tyr Leu Ser Ser Met Ala Phe Ser Asp Leu Leu Ile Phe Leu Cys Met85 90 95Pro Leu Asp Leu Val Arg Leu Trp Gln Tyr Arg Pro Trp Asn Phe Gly100 105 110Asp Leu Leu Cys Lys Leu Phe Gln Phe Val Ser Glu Ser Cys Thr Tyr115 120 125Ala Thr Val Leu Thr Ile Thr Ala Leu Ser Val Glu Arg Tyr Phe Ala130 135 140Ile Cys Phe Pro Leu Arg Ala Lys Val Val Val Thr Lys Gly Arg Val145 150 155 160Lys Leu Val Ile Phe Val Ile Trp Ala Val Ala Phe Cys Ser Ala Gly165 170 175Pro Ile Phe Val Leu Val Gly Val Glu His Glu Asn Gly Thr Asp Pro180 185 190Trp Asp Thr Asn Glu Cys Arg Pro Thr Glu Phe Ala Val Arg Ser Gly195 200 205Leu Leu Thr Val Met Val Trp Val Ser Ser Ile Phe Phe Phe Leu Pro210 215 220Val Phe Cys Leu Thr Val Leu Tyr Ser Leu Ile Gly Arg Lys Leu Trp225 230 235 240Arg Arg Arg Arg Gly Asp Ala Val Val Gly Ala Ser Leu Arg Asp Gln245 250 255Asn His Lys Gln Thr Lys Lys Met Leu Ala Val Val Val Phe Ala Phe260 265 270Ile Leu Cys Trp Leu Pro Phe His Val Gly Arg Tyr Leu Phe Ser Lys275 280 285Ser Phe Glu Pro Gly Ser Leu Glu Ile Ala Gln Ile Ser Gln Tyr Cys
290 295 300Asn Leu Val Ser Phe Val Leu Phe Tyr Leu Ser Ala Ala Ile Asn Pro305 310 315 320Ile Leu Tyr Asn Ile Met Ser Lys Lys Tyr Arg Val Ala Val Phe Arg325 330 335Leu Leu Gly Phe Glu Pro Phe Ser Gln Arg Lys Leu Ser Thr Leu Lys340 345 350Asp Glu Ser Ser Arg Ala Trp Thr Glu Ser Ser Ile Asn Thr355 360 365(212)SEQ ID NO211的资料(i)序列特征(A)长度1842个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO211的序列描述ATGCGAGCCC CGGGCGCGCT TCTCGCCCGC ATGTCGCGGC TACTGCTTCT GCTACTGCTC 60AAGGTGTCTG CCTCTTCTGC CCTCGGGGTC GCCCCTGCGT CCAGAAACGA AACTTGTCTG 120GGGGAGAGCT GTGCACCTAC AGTGATCCAG CGCCGCGGCA GGGACGCCTG GGGACCGGGA 180AATTCTGCAA GAGACGTTCT GCGAGCCCGA GCACCCAGGG AGGAGCAGGG GGCAGCGTTT 240CTTGCGGGAC CCTCCTGGGA CCTGCCGGCG GCCCCGGGCC GTGACCCGGC TGCAGGCAGA 300GGGGCGGAGG CGTCGGCAGC CGGACCCCCG GGACCTCCAA CCAGGCCACC TGGCCCCTGG 360AGGTGGAAAG GTGCTCGGGG TCAGGAGCCT TCTGAAACTT TGGGGAGAGG GAACCCCACG 420GCCCTCCAGC TCTTCCTTCA GATCTCAGAG GAGGAAGAGA AGGGTCCCAG AGGCGCTGGC 480ATTTCCGGGC GTAGCCAGGA GCAGAGTGTG AAGACAGTCC CCGGAGCCAG CGATCTTTTT 540TACTGGCCAA GGAGAGCCGG GAAACTCCAG GGTTCCCACC ACAAGCCCCT GTCCAAGACG 600GCCAATGGAC TGGCGGGGCA CGAAGGGTGG ACAATTGCAC TCCCGGGCCG GGCGCTGGCC 660CAGAATGGAT CCTTGGGTGA AGGAATCCAT GAGCCTGGGG GTCCCCGCCG GGGAAACAGC 720ACGAACCGGC GTGTGAGACT GAAGAACCCC TTCTACCCGC TGACCCAGGA GTCCTATGGA 780GCCTACGCGG TCATGTGTCT GTCCGTGGTG ATCTTCGGGA CCGGCATCAT TGGCAACCTG 840GCGGTGATGT GCATCGTGTG CCACAACTAC TACATGCGGA GCATCTCCAA CTCCCTCTTG 900GCCAACCTGG CCTTCTGGGA CTTTCTCATC ATCTTCTTCT GCCTTCCGCT GGTCATCTTC 960CACGAGCTGA CCAAGAAGTG GCTGCTGGAG GACTTCTCCT GCAAGATCGT GCCCTATATA 1020GAGGTCGCCT CTCTGGGAGT CACCACTTTC ACCTTATGTG CTCTGTGCAT AGACCGCTTC 1080CGTGCTGCCA CCAACGTACA GATGTACTAC GAAATGATCG AAAATTGTTC CTCAACAACT 1140GCCAAACTTG CTGTTATATG GGTGGGAGCT CTATTGTTAG CACTTCCAGA AGTTGTTCTC 1200CGCCAGCTGA GCAAGGAGGA TTTGGGGTTT AGTGGCCGAG CTCCGGCAGA AAGGTGCATT 1260ATTAAGATCT CTCCTGATTT ACCAGACACC ATCTATGTTC TAGCCCTCAC CTACGACAGT 1320GCGAGACTGT GGTGGTATTT TGGCTGTTAC TTTTGTTTGC CCACGCTTTT CACCATCACC 1380TGCTCTCTAG TGACTGCGAG GAAAATCCGC AAAGCAGAGA AAGCCTGTAC CCGAGGGAAT 1440AAACGGCAGA TTCAACTAGA GAGTCAGATG AAGTGTACAG TAGTGGCACT GACCATTTTA 1500TATGGATTTT GCATTATTCC TGAAAATATC TGCAACATTG TTACTGCCTA CATGGCTACA 1560GGGGTTTCAC AGCAGACAAT GGACCTCCTT AATATCATCA GCCAGTTCCT TTTGTTCTTT 1620AAGTCCTGTG TCACCCCAGT CCTCCTTTTC TGTCTCTGCA AACCCTTCAG TCGGGCCTTC 1680ATGGAGTGCT GCTGCTGTTG CTGTGAGGAA TGCATTCAGA AGTCTTCAAC GGTGACCAGT 1740GATGACAATG ACAACGAGTA CACCACGGAA CTCGAACTCT CGCCTTTCAG TACCATACGC 1800CGTGAAATGT CCACTTTTGC TTCTGTCGGA ACTCATTGCT GA 1842(213)SEQ ID NO212的资料(i)序列特征(A)长度613个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO212的序列描述Met Arg Ala Pro Gly Ala Leu Leu Ala Arg Met Ser Arg Leu Leu Leul 5 10 15Leu Leu Leu Leu Lys Val Ser Ala Ser Ser Ala Leu Gly Val Ala Pro20 25 30Ala Ser Arg Asn Glu Thr Cys Leu Gly Glu Ser Cys Ala Pro Thr Val35 40 45Ile Gln Arg Arg Gly Arg Asp Ala Trp Gly Pro Gly Asn Ser Ala Arg50 55 60Asp Val Leu Arg Ala Arg Ala Pro Arg Glu Glu Gln Gly Ala Ala Phe65 70 75 80Leu Ala Gly Pro Ser Trp Asp Leu Pro Ala Ala Pro Gly Arg Asp Pro85 90 95Ala Ala Gly Arg Gly Ala Glu Ala Ser Ala Ala Gly Pro Pro Gly Pro100 105 1l0Pro Thr Arg Pro Pro Gly Pro Trp Arg Trp Lys Gly Ala Arg Gly Gln115 120 125Glu Pro Ser Glu Thr Leu Gly Arg Gly Asn Pro Thr Ala Leu Gln Leu130 135 140Phe Leu Gln Ile Ser Glu Glu Glu Glu Lys Gly Pro Arg Gly Ala Gly145 150 155 160Ile Ser Gly Arg Ser Gln Glu Gln Ser Val Lys Thr Val Pro Gly Ala165 170 175Ser Asp Leu Phe Tyr Trp Pro Arg Arg Ala Gly Lys Leu Gln Gly Ser
180 185 190His His Lys Pro Leu Ser Lys Thr Ala Asn Gly Leu Ala Gly His Glu195 200 205Gly Trp Thr Ile Ala Leu Pro Gly Arg Ala Leu Ala Gln Asn Gly Ser210 215 220Leu Gly Glu Gly Ile His Glu Pro Gly Gly Pro Arg Arg Gly Asn Ser225 230 235 240Thr Asn Arg Arg Val Arg Leu Lys Asn Pro Phe Tyr Pro Leu Thr Gln245 250 255Glu Ser Tyr Gly Ala Tyr Ala Val Met Cys Leu Ser Val Val Ile Phe260 265 270Gly Thr Gly Ile Ile Gly Asn Leu Ala Val Met Cys Ile Val Cys His275 280 285Asn Tyr Tyr Met Arg Ser Ile Ser Asn Ser Leu Leu Ala Asn Leu Ala290 295 300Phe Trp Asp Phe Leu Ile Ile Phe Phe Cys Leu Pro Leu Val Ile Phe305 310 315 320His Glu Leu Thr Lys Lys Trp Leu Leu Glu Asp Phe Ser Cys Lys Ile325 330 335Val Pro Tyr Ile Glu Val Ala Ser Leu Gly Val Thr Thr Phe Thr Leu340 345 350Cys Ala Leu Cys Ile Asp Arg Phe Arg Ala Ala Thr Asn Val Gln Met355 360 365Tyr Tyr Glu Met Ile Glu Asn Cys Ser Ser Thr Thr Ala Lys Leu Ala370 375 380Val Ile Trp Val Gly Ala Leu Leu Leu Ala Leu Pro Glu Val Val Leu385 390 395 400Arg Gln Leu Ser Lys Glu Asp Leu Gly Phe Ser Gly Arg Ala Pro Ala405 410 415Glu Arg Cys Ile Ile Lys Ile Ser Pro Asp Leu Pro Asp Thr Ile Tyr420 425 430Val Leu Ala Leu Thr Tyr Asp Ser Ala Arg Leu Trp Trp Tyr Phe Gly435 440 445Cys Tyr Phe Cys Leu Pro Thr Leu Phe Thr Ile Thr Cys Ser Leu Val
450 455 460Thr Ala Arg Lys Ile Arg Lys Ala Glu Lys Ala Cys Thr Arg Gly Asn465 470 475 480Lys Arg Gln Ile Gln Leu Glu Ser Gln Met Lys Cys Thr Val Val Ala485 490 495Leu Thr Ile Leu Tyr Gly Phe Cys Ile Ile Pro Glu Asn Ile Cys Asn500 505 510Ile Val Thr Ala Tyr Met Ala Thr Gly Val Ser Gln Gln Thr Met Asp515 520 525Leu Leu Asn Ile Ile Ser Gln Phe Leu Leu Phe Phe Lys Ser Cys Val530 535 540Thr Pro Val Leu Leu Phe Cys Leu Cys Lys Pro Phe Ser Arg Ala Phe545 550 555 560Met Glu Cys Cys Cys Cys Cys Cys Glu Glu Cys Ile Gln Lys Ser Ser565 570 575Thr Val Thr Ser Asp Asp Asn Asp Asn Glu Tyr Thr Thr Glu Leu Glu580 585 590Leu Ser Pro Phe Ser Thr Ile Arg Arg Glu Met Ser Thr Phe Ala Ser595 600 605Val Gly Thr His Cys610(214)SEQ ID NO213的资料(i)序列特征(A)长度1248个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO213的序列描述ATGGTTTTTG CTCACAGAAT GGATAACAGC AAGCCACATT TGATTATTCC TACACTTCTG 60GTGCCCCTCC AAAACCGCAG CTGCACTGAA ACAGCCACAC CTCTGCCAAG CCAATACCTG 120ATGGAATTAA GTGAGGAGCA CAGTTGGATG AGCAACCAAA CAGACCTTCA CTATGTGCTG 180AAACCCGGGG AAGTGGCCAC AGCCAGCATC TTCTTTGGGA TTCTGTGGTT GTTTTCTATC 240TTCGGCAATT CCCTGGTTTG TTTGGTCATC CATAGGAGTA GGAGGACTCA GTCTACCACC 300AACTACTTTG TGGTCTCCAT GGCATGTGCT GACCTTCTCA TCAGCGTTGC CAGCACGCCT 360TTCGTCCTGC TCCAGTTCAC CACTGGAAGG TGGACGCTGG GTAGTGCAAC GTGCAAGGTT 420GTGCGATATT TTCAATATCT CACTCCAGGT GTCCAGATCT ACGTTCTCCT CTCCATCTGC 480ATAGACCGGT TCTACACCAT CGTCTATCCT CTGAGCTTCA AGGTGTCCAG AGAAAAAGCC 540AAGAAAATGA TTGCGGCATC GTGGATCTTT GATGCAGGCT TTGTGACCCC TGTGCTCTTT 600TTCTATGGCT CCAACTGGGA CAGTCATTGT AACTATTTCC TCCCCTCCTC TTGGGAAGGC 660ACTGCCTACA CTGTCATCCA CTTCTTGGTG GGCTTTGTGA TTCCATCTGT CCTCATAATT 720TTATTTTACC AAAAGGTCAT AAAATATATT TGGAGAATAG GCACAGATGG CCGAACGGTG 780AGGAGGACAA TGAACATTGT CCCTCGGACA AAAGTGAAAA CTAAAAAGAT GTTCCTCATT 840TTAAATCTGT TGTTTTTGCT CTCCTGGCTG CCTTTTCATG TAGCTCAGCT ATGGCACCCC 900CATGAACAAG ACTATAAGAA AAGTTCCCTT GTTTTCACAG CTATCACATG GATATCCTTT 960AGTTCTTCAG CCTCTAAACC TACTCTGTAT TCAATTTATA ATGCCAATTT TCGGAGAGGG 1020ATGAAAGAGA CTTTTTGCAT GTCCTCTATG AAATGTTACC GAAGCAATGC CTATACTATC 1080ACAACAAGTT CAAGGATGGC CAAAAAAAAC TACGTTGGCA TTTCAGAAAT CCCTTCCATG 1140GCCAAAACTA TTACCAAAGA CTCGATCTAT GACTCATTTG ACAGAGAAGC CAAGGAAAAA 1200AAGCTTGCTT GGCCCATTAA CTCAAATCCA CCAAATACTT TTGTCTAA1248(215)SEQ ID NO214的资料(i)序列特征(A)长度415个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO214的序列描述Met Val Phe Ala His Arg Met Asp Asn Ser Lys Pro His Leu Ile Ile1 5 10 15Pro Thr Leu Leu Val Pro Leu Gln Asn Arg Ser Cys Thr Glu Thr Ala20 25 30Thr Pro Leu Pro Ser Gln Tyr Leu Met Glu Leu Ser Glu Glu His Ser35 40 45Trp Met Ser Asn Gln Thr Asp Leu His Tyr Val Leu Lys Pro Gly Glu50 55 60Val Ala Thr Ala Ser Ile Phe Phe Gly Ile Leu Trp Leu Phe Ser Ile65 70 75 80Phe Gly Asn Ser Leu Val Cys Leu Val Ile His Arg Ser Arg Arg Thr85 90 95Gln Ser Thr Thr Ash Tyr Phe Val Val Ser Met Ala Cys Ala Asp Leu100 105 110Leu Ile Ser Val Ala Ser Thr Pro Phe Val Leu Leu Gln Phe Thr Thr115 120 125Gly Arg Trp Thr Leu Gly Ser Ala Thr Cys Lys Val Val Arg Tyr Phe130 135 140Gln Tyr Leu Thr Pro Gly Val Gln Ile Tyr Val Leu Leu Ser Ile Cys145 150 155 160Ile Asp Arg Phe Tyr Thr Ile Val Tyr Pro Leu Ser Phe Lys Val Ser165 170 175Arg Glu Lys Ala Lys Lys Met Ile Ala Ala Ser Trp Ile Phe Asp Ala180 185 190Gly Phe Val Thr Pro Val Leu Phe Phe Tyr Gly Ser Asn Trp Asp Ser195 200 205His Cys Asn Tyr Phe Leu Pro Ser Ser Trp Glu Gly Thr Ala Tyr Thr210 215 220Val Ile His Phe Leu Val Gly Phe Val Ile Pro Ser Val Leu Ile Ile225 230 235 240Leu Phe Tyr Gln Lys Val Ile Lys Tyr Ile Trp Arg Ile Gly Thr Asp245 250 255Gly Arg Thr Val Arg Arg Thr Met Asn Ile Val Pro Arg Thr Lys Val260 265 270Lys Thr Lys Lys Met Phe Leu Ile Leu Asn Leu Leu Phe Leu Leu Ser275 280 285Trp Leu Pro Phe His Val Ala Gln Leu Trp His Pro His Glu Gln Asp290 295 300Tyr Lys Lys Ser Ser Leu Val Phe Thr Ala Ile Thr Trp Ile Ser Phe305 310 315 320Ser Ser Ser Ala Ser Lys Pro Thr Leu Tyr Ser Ile Tyr Asn Ala Asn325 330 335Phe Arg Arg Gly Met Lys Glu Thr Phe Cys Met Ser Ser Met Lys Cys340 345 350Tyr Arg Ser Asn Ala Tyr Thr Ile Thr Thr Ser Ser Arg Met Ala Lys355 360 365Lys Asn Tyr Val Gly Ile Ser Glu Ile Pro Ser Met Ala Lys Thr Ile370 375 380Thr Lys Asp Ser Ile Tyr Asp Ser Phe Asp Arg Glu Ala Lys Glu Lys385 390 395 400Lys Leu Ala Trp Pro Ile Asn Ser Asn Pro Pro Asn Thr Phe Val405 410 415(216) SEQ ID NO215的资料(i)序列特征(A)长度1842个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO215的序列描述ATGGGGCCCA CCCTAGCGGT TCCCACCCCC TATGGCTGTA TTGGCTGTAA GCTACCCCAG 60CCAGAATACC CACCGGCTCT AATCATCTTT ATGTTCTGCG CGATGGTTAT CACCATCGTT 120GTAGACCTAA TCGGCAACTC CATGGTCATT TTGGCTGTGA CGAAGAACAA GAAGCTCCGG 180AATTCTGGCA ACATCTTCGT GGTCAGTCTC TCTGTGGCCG ATATGCTGGT GGCCATCTAC 240CCATACCCTT TGATGCTGCA TGCCATGTCC ATTGGGGGCT GGGATCTGAG CCAGTTACAG 300TGCCAGATGG TCGGGTTCAT CACAGGGCTG AGTGTGGTCG GCTCCATCTT CAACATCGTG 360GCAATCGCTA TCAACCGTTA CTGCTACATC TGCCACAGCC TCCAGTACGA ACGGATCTTC 420AGTGTGCGCA ATACCTGCAT CTACCTGGTC ATCACCTGGA TCATGACCGT CCTGGCTGTC 480CTGCCCAACA TGTACATTGG CACCATCGAG TACGATCCTC GCACCTACAC CTGCATCTTC 540AACTATCTGA ACAACCCTGT CTTCACTGTT ACCATCGTCT GCATCCACTT CGTCCTCCCT 600CTCCTCATCG TGGGTTTCTG CTACGTGAGG ATCTGGACCA AAGTGCTGGC GGCCCGTGAC 660CCTGCAGGGC AGAATCCTGA CAACCAACTT GCTGAGGTTC GCAATAAACT AACCATGTTT 720GTGATCTTCC TCCTCTTTGC AGTGTGCTGG TGCCCTATCA ACGTGCTCAC TGTCTTGGTG 780GCTGTCAGTC CGAAGGAGAT GGCAGGCAAG ATCCCCAACT GGCTTTATCT TGCAGCCTAC 840TTCATAGCCT ACTTCAACAG CTGCCTCAAC GCTGTGATCT ACGGGCTCCT CAATGAGAAT 900TTCCGAAGAG AATACTGGAC CATCTTCCAT GCTATGCGGC ACCCTATCAT ATTCTTCTCT 960GGCCTCATCA GTGATATTCG TGAGATGCAG GAGGCCCGTA CCCTGGCCCG CGCCCGTGCC 1020CATGCTCGCG ACCAAGCTCG TGAACAAGAC CGTGCCCATG CCTGTCCTGC TGTGGAGGAA 1080ACCCCGATGA ATGTCCGGAA TGTTCCATTA CCTGGTGATG CTGCAGCTGG CCACCCCGAC 1140CGTGCCTCTG GCCACCCTAA GCCCCATTCC AGATCCTCCT CTGCCTATCG CAAATCTGCC 1200TCTACCCACC ACAAGTCTGT CTTTAGCCAC TCCAAGGCTG CCTCTGGTCA CCTCAAGCCT 1260GTCTCTGGCC ACTCCAAGCC TGCCTCTGGT CACCCCAAGT CTGCCACTGT CTACCCTAAG 1320CCTGCCTCTG TCCATTTCAA GGCTGACTCT GTCCATTTCA AGGGTGACTC TGTCCATTTC 1380AAGCCTGACT CTGTTCATTT CAAGCCTGCT TCCAGCAACC CCAAGCCCAT CACTGGCCAC 1440CATGTCTCTG CTGGCAGCCA CTCCAAGTCT GCCTTCAATG CTGCCACCAG CCACCCTAAA 1500CCCATCAAGC CAGCTACCAG CCATGCTGAG CCCACCACTG CTGACTATCC CAAGCCTGCC 1560ACTACCAGCC ACCCTAAGCC CGCTGCTGCT GACAACCCTG AGCTCTCTGC CTCCCATTGC 1620CCCGAGATCC CTGCCATTGC CCACCCTGTG TCTGACGACA GTGACCTCCC TGAGTCGGCC 1680TCTAGCCCTG CCGCTGGGCC CACCAAGCCT GCTGCCAGCC AGCTGGAGTC TGACACCATC 1740GCTGACCTTC CTGACCCTAC TGTAGTCACT ACCAGTACCA ATGATTACCA TGATGTCGTG 1800GTTGTTGATG TTGAAGATGA TCCTGATGAA ATGGCTGTGT GA 1842(217)SEQ ID NO216的资料(i)序列特征(A)长度613个氨基酸(B)类型氨基酸
(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO216的序列描述Met Gly Pro Thr Leu Ala Val Pro Thr Pro Tyr Gly Cys Ile Gly Cys1 5 10 15Lys Leu Pro Gln Pro Glu Tyr Pro Pro Ala Leu Ile Ile Phe Met Phe20 25 30Cys Ala Met Val Ile Thr Ile Val Val Asp Leu Ile Gly Asn Ser Met35 40 45Val Ile Leu Ala Val Thr Lys Asn Lys Lys Leu Arg Asn Ser Gly Asn50 55 60Ile Phe Val Val Ser Leu Ser Val Ala Asp Met Leu Val Ala Ile Tyr65 70 75 80Pro Tyr Pro Leu Met Leu His Ala Met Ser Ile Gly Gly Trp Asp Leu85 90 95Ser Gln Leu Gln Cys Gln Met Val Gly Phe Ile Thr Gly Leu Ser Val100 105 110Val Gly Ser Ile Phe Asn Ile Val Ala Ile Ala Ile Asn Arg Tyr Cys115 120 125Tyr Ile Cys His Ser Leu Gln Tyr Glu Arg Ile Phe Ser Val Arg Asn130 135 140Thr Cys Ile Tyr Leu Val Ile Thr Trp Ile Met Thr Val Leu Ala Val145 150 155 160Leu Pro Asn Met Tyr Ile Gly Thr Ile Glu Tyr Asp Pro Arg Thr Tyr165 170 175Thr Cys Ile Phe Asn Tyr Leu Asn Asn Pro Val Phe Thr Val Thr Ile180 185 190Val Cys Ile His Phe Val Leu Pro Leu Leu Ile Val Gly Phe Cys Tyr195 200 205Val Arg Ile Trp Thr Lys Val Leu Ala Ala Arg Asp Pro Ala Gly Gln210 215 220Asn Pro Asp Asn Gln Leu Ala Glu Val Arg Asn Lys Leu Thr Met Phe225 230 235 240Val Ile Phe Leu Leu Phe Ala Val Cys Trp Cys Pro Ile Asn Val Leu
245 250 255Thr Val Leu Val Ala Val Ser Pro Lys Glu Met Ala Gly Lys Ile Pro260 265 270Asn Trp Leu Tyr Leu Ala Ala Tyr Phe Ile Ala Tyr Phe Asn Ser Cys275 280 285Leu Asn Ala Val Ile Tyr Gly Leu Leu Asn Glu Asn Phe Arg Arg Glu290 295 300Tyr Trp Thr Ile Phe His Ala Met Arg His Pro Ile Ile Phe Phe Ser305 310 315 320Gly Leu Ile Ser Asp Ile Arg Glu Met Gln Glu Ala Arg Thr Leu Ala325 330 335Arg Ala Arg Ala His Ala Arg Asp Gln Ala Arg Glu Gln Asp Arg Ala340 345 350His Ala Cys Pro Ala Val Glu Glu Thr Pro Met Asn Val Arg Asn Val355 360 365Pro Leu Pro Gly Asp Ala Ala Ala Gly His Pro Asp Arg Ala Ser Gly370 375 380His Pro Lys Pro His Ser Arg Ser Ser Ser Ala Tyr Arg Lys Ser Ala385 390 395 400Ser Thr His His Lys Ser Val Phe Ser His Ser Lys Ala Ala Ser Gly405 410 415His Leu Lys Pro Val Ser Gly His Ser Lys Pro Ala Ser Gly His Pro420 425 430Lys Ser Ala Thr Val Tyr Pro Lys Pro Ala Ser Val His Phe Lys Ala435 440 445Asp Ser Val His Phe Lys Gly Asp Ser Val His Phe Lys Pro Asp Ser450 455 460Val His Phe Lys Pro Ala Ser Ser Asn Pro Lys Pro Ile Thr Gly His465 470 475 480His Val Ser Ala Gly Ser His Ser Lys Ser Ala Phe Asn Ala Ala Thr485 490 495Ser His Pro Lys Pro Ile Lys Pro Ala Thr Ser His Ala Glu Pro Thr500 505 510Thr Ala Asp Tyr Pro Lys Pro Ala Thr Thr Ser His Pro Lys Pro Ala
515 520 525Ala Ala Asp Asn Pro Glu Leu Ser Ala Ser His Cys Pro Glu Ile Pro530 535 540Ala Ile Ala His Pro Val Ser Asp Asp Ser Asp Leu Pro Glu Ser Ala545 550 555 560Ser Ser Pro Ala Ala Gly Pro Thr Lys Pro Ala Ala Ser Gln Leu Glu565 570 575Ser Asp Thr Ile Ala Asp Leu Pro Asp Pro Thr Val Val Thr Thr Ser580 585 590Thr Asn Asp Tyr His Asp Val Val Val Val Asp Val Glu Asp Asp Pro595 600 605Asp Glu Met Ala Val610(218)SEQ ID NO217的资料(i)序列特征(A)长度1854个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组)(xi)SEQ ID NO217的序列描述ATGGGGCCCA CCCTAGCGGT TCCCACCCCC TATGGCTGTA TTGGCTGTAA GCTACCCCAG 60CCAGAATACC CACCGGCTCT AATCATCTTT ATGTTCTGCG CGATGGTTAT CACCATCGTT 120GTAGACCTAA TCGGCAACTC CATGGTCATT TTGGCTGTGA CGAAGAACAA GAAGCTCCGG 180AATTCTGGCA ACATCTTCGT GGTCAGTCTC TCTGTGGCCG ATATGCTGGT GGCCATCTAC 240CCATACCCTT TGATGCTGCA TGCCATGTCC ATTGGGGGCT GGGATCTGAG CCAGTTACAG 300TGCCAGATGG TCGGGTTCAT CACAGGGCTG AGTGTGGTCG GCTCCATCTT CAACATCGTG 360GCAATCGCTA TCAACCGTTA CTGCTACATC TGCCACAGCC TCCAGTACGA ACGGATCTTC 420AGTGTGCGCA ATACCTGCAT CTACCTGGTC ATCACCTGGA TCATGACCGT CCTGGCTGTC 480CTGCCCAACA TGTACATTGG CACCATCGAG TACGATCCTC GCACCTACAC CTGCATCTTC 540AACTATCTGA ACAACCCTGT CTTCACTGTT ACCATCGTCT GCATCCACTT CGTCCTCCCT 600CTCCTCATCG TGGGTTTCTG CTACGTGAGG ATCTGGACCA AAGTGCTGGC GGCCCGTGAC 660CCTGCAGGGC AGAATCCTGA CAACCAACTT GCTGAGGTTC GCAATAAACT AACCATGTTT 720GTGATCTTCC TCCTCTTTGC AGTGTGCTGG TGCCCTATCA ACGTGCTCAC TGTCTTGGTG 780GCTGTCAGTC CGAAGGAGAT GGCAGGCAAG ATCCCCAACT GGCTTTATCT TGCAGCCTAC 840TTCATAGCCT ACTTCAACAG CTGCCTCAAC GCTGTGATCT ACGGGCTCCT CAATGAGAAT 900TTCCGAAGAG AATACTGGAC CATCTTCCAT GCTATGCGGC ACCCTATCAT ATTCTTCTCT 960GGCCTCATCA GTGATATTCG TGAGATGCAG GAGGCCCGTA CCCTGGCCCG CGCCCGTGCC 1020CATGCTCGCG ACCAAGCTCG TGAACAAGAC CGTGCCCATG CCTGTCCTGC TGTGGAGGAA 1080ACCCCGATGA ATGTCCGGAA TGTTCCATTA CCTGGTGATG CTGCAGCTGG CCACCCCGAC 1140CGTGCCTCTG GCCACCCTAA GCCCCATTCC AGATCCTCCT CTGCCTATCG CAAATCTGCC 1200TCTACCCACC ACAAGTCTGT CTTTAGCCAC TCCAAGGCTG CCTCTGGTCA CCTCAAGCCT 1260GTCTCTGGCC ACTCCAAGCC TGCCTCTGGT CACCCCAAGT CTGCCACTGT CTACCCTAAG 1320CCTGCCTCTG TCCATTTCAA GGCTGACTCT GTCCATTTCA AGGGTGACTC TGTCCATTTC 1380AAGCCTGACT CTGTTCATTT CAAGCCTGCT TCCAGCAACC CCAAGCCCAT CACTGGCCAC 1440CATGTCTCTG CTGGCAGCCA CTCCAAGTCT GCCTTCAGTG CTGCCACCAG CCACCCTAAA 1500CCCACCACTG GCCACATCAA GCCAGCTACC AGCCATGCTG AGCCCACCAC TGCTGACTAT 1560CCCAAGCCTG CCACTACCAG CCACCCTAAG CCCACTGCTG CTGACAACCC TGAGCTCTCT 1620GCCTCCCATT GCCCCGAGAT CCCTGCCATT GCCCACCCTG TGTCTGACGA CAGTGACCTC 1680CCTGAGTCGG CCTCTAGCCC TGCCGCTGGG CCCACCAAGC CTGCTGCCAG CCAGCTGGAG 1740TCTGACACCA TCGCTGACCT TCCTGACCCT ACTGTAGTCA CTACCAGTAC CAATGATTAC 1800CATGATGTCG TGGTTGTTGA TGTTGAAGAT GATCCTGATG AAATGGCTGT GTGA1854(219)SEQ ID NO218的资料(i)序列特征(A)长度617个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi) SEQ ID NO218的序列描述Met Gly Pro Thr Leu Ala Val Pro Thr Pro Tyr Gly Cys Ile Gly Cys1 5 10 15Lys Leu Pro Gln Pro Glu Tyr Pro Pro Ala Leu Ile Ile Phe Met Phe20 25 30Cys Ala Met Val Ile Thr Ile Val Val Asp Leu Ile Gly Asn Ser Met35 40 45Val Ile Leu Ala Val Thr Lys Asn Lys Lys Leu Arg Asn Ser Gly Asn50 55 60Ile Phe Val Val Ser Leu Ser Val Ala Asp Met Leu Val Ala Ile Tyr65 70 75 80Pro Tyr Pro Leu Met Leu His Ala Met Ser Ile Gly Gly Trp Asp Leu85 90 95Ser Gln Leu Gln Cys Gln Met Val Gly Phe Ile Thr Gly Leu Ser Val100 105 110Val Gly Ser Ile Phe Asn Ile Val Ala Ile Ala Ile Asn Arg Tyr Cys115 120 125Tyr Ile Cys His Ser Leu Gln Tyr Glu Arg Ile Phe Ser Val Arg Asn130 135 140Thr Cys Ile Tyr Leu Val Ile Thr Trp Ile Met Thr Val Leu Ala Val145 150 155 160Leu Pro Asn Met Tyr Ile Gly Thr Ile Glu Tyr Asp Pro Arg Thr Tyr165 170 175Thr Cys Ile Phe Asn Tyr Leu Asn Asn Pro Val Phe Thr Val Thr Ile180 185 190Val Cys Ile His Phe Val Leu Pro Leu Leu Ile Val Gly Phe Cys Tyr195 200 205Val Arg Ile Trp Thr Lys Val Leu Ala Ala Arg Asp Pro Ala Gly Gln210 215 220Asn Pro Asp Asn Gln Leu Ala Glu Val Arg Asn Lys Leu Thr Met Phe225 230 235 240Val Ile Phe Leu Leu Phe Ala Val Cys Trp Cys Pro Ile Asn Val Leu245 250 255Thr Val Leu Val Ala Val Ser Pro Lys Glu Met Ala Gly Lys Ile Pro260 265 270Asn Trp Leu Tyr Leu Ala Ala Tyr Phe Ile Ala Tyr Phe Asn Ser Cys275 280 285Leu Asn Ala Val Ile Tyr Gly Leu Leu Asn Glu Asn Phe Arg Arg Glu290 295 300Tyr Trp Thr Ile Phe His Ala Met Arg His Pro Ile Ile Phe Phe Ser305 310 315 320Gly Leu Ile Ser Asp Ile Arg Glu Met Gln Glu Ala Arg Thr Leu Ala325 330 335Arg Ala Arg Ala His Ala Arg Asp Gln Ala Arg Glu Gln Asp Arg Ala340 345 350His Ala Cys Pro Ala Val Glu Glu Thr Pro Met Asn Val Arg Asn Val355 360 365Pro Leu Pro Gly Asp Ala Ala Ala Gly His Pro Asp Arg Ala Ser Gly370 375 380His Pro Lys Pro His Ser Arg Ser Ser Ser Ala Tyr Arg Lys Ser Ala385 390 395 400Ser Thr His His Lys Ser Val Phe Ser His Ser Lys Ala Ala Ser Gly405 410 415His Leu Lys Pro Val Ser Gly His Ser Lys Pro Ala Ser Gly His Pro
420 425 430Lys Ser Ala Thr Val Tyr Pro Lys Pro Ala Ser Val His Phe Lys Ala435 440 445Asp Ser Val His Phe Lys Gly Asp Ser Val His Phe Lys Pro Asp Ser450 455 460Val His Phe Lys Pro Ala Ser Ser Asn Pro Lys Pro Ile Thr Gly His465 470 475 480His Val Ser Ala Gly Ser His Ser Lys Ser Ala Phe Ser Ala Ala Thr485 490 495Ser His Pro Lys Pro Thr Thr Gly His Ile Lys Pro Ala Thr Ser His500 505 510Ala Glu Pro Thr Thr Ala Asp Tyr Pro Lys Pro Ala Thr Thr Ser His515 520 525Pro Lys Pro Thr Ala Ala Asp Asn Pro Glu Leu Ser Ala Ser His Cys530 535 540Pro Glu Ile Pro Ala Ile Ala His Pro Val Ser Asp Asp Ser Asp Leu545 550 555 560Pro Glu Ser Ala Ser Ser Pro Ala Ala Gly Pro Thr Lys Pro Ala Ala565 570 575Ser Gln Leu Glu Ser Asp Thr Ile Ala Asp Leu Pro Asp Pro Thr Val580 585 590Val Thr Thr Ser Thr Asn Asp Tyr His Asp Val Val Val Val Asp Val595 600 605Glu Asp Asp Pro Asp Glu Met Ala Val610 615(220)SEQ ID NO219的资料(i)序列特征(A)长度1548个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组)(xi)SEQ ID NO219的序列描述ATGGGACATA ACGGGAGCTG GATCTCTCCA AATGCCAGCG AGCCGCACAA CGCGTCCGGC 60GCCGAGGCTG CGGGTGTGAA CCGCAGCGCG CTCGGGGAGT TCGGCGAGGC GCAGCTGTAC 120CGCCAGTTCA CCACCACCGT GCAGGTCGTC ATCTTCATAG GCTCGCTGCT CGGAAACTTC 180ATGGTGTTAT GGTCAACTTG CCGCACAACC GTGTTCAAAT CTGTCACCAA CAGGTTCATT 240AAAAACCTGG CCTGCTCGGG GATTTGTGCC AGCCTGGTCT GTGTGCCCTT CGACATCATC 300CTCAGCACCA GTCCTCACTG TTGCTGGTGG ATCTACACCA TGCTCTTCTG CAAGGTCGTC 360AAATTTTTGC ACAAAGTATT CTGCTCTGTG ACCATCCTCA GCTTCCCTGC TATTGCTTTG 420GACAGGTACT ACTCAGTCCT CTATCCACTG GAGAGGAAAA TATCTGATGC CAAGTCCCGT 480GAACTGGTGA TGTACATCTG GGCCCATGCA GTGGTGGCCA GTGTCCCTGT GTTTGCAGTA 540ACCAATGTGG CTGACATCTA TGCCACGTCC ACCTGCACGG AAGTCTGGAG CAACTCCTTG 600GGCCACCTGG TGTACGTTCT GGTGTATAAC ATCACCACGG TCATTGTGCC TGTGGTGGTG 660GTGTTCCTCT TCTTGATACT GATCCGACGG GCCCTGAGTG CCAGCCAGAA GAAGAAGGTC 720ATCATAGCAG CGCTCCGGAC CCCACAGAAC ACCATCTCTA TTCCCTATGC CTCCCAGCGG 780GAGGCCGAGC TGAAAGCCAC CCTGCTCTCC ATGGTGATGG TCTTCATCTT GTGTAGCGTG 840CCCTATGCCA CCCTGGTCGT CTACCAGACT GTGCTCAATG TCCCTGACAC TTCCGTCTTC 900TTGCTGCTCA CTGCTGTTTG GCTGCCCAAA GTCTCCCTGC TGGCAAACCC TGTTCTCTTT 960CTTACTGTGA ACAAATCTGT CCGCAAGTGC TTGATAGGGA CCCTGGTGCA ACTACACCAC 1020CGGTACAGTC GCCGTAATGT GGTCAGTACA GGGAGTGGCA TGGCTGAGGC CAGCCTGGAA 1080CCCAGCATAC GCTCGGGTAG CCAGCTCCTG GAGATGTTCC ACATTGGGCA GCAGCAGATC 1140TTTAAGCCCA CAGAGGATGA GGAAGAGAGT GAGGCCAAGT ACATTGGCTC AGCTGACTTC 1200CAGGCCAAGG AGATATTTAG CACCTGCCTG GAGGGAGAGC AGGGGCCACA GTTTGCGCCC 1260TCTGCCCCAC CCCTGAGCAC AGTGGACTCT GTATCCCAGG TGGCACCGGC AGCCCCTGTG 1320GAACCTGAAA CATTCCCTGA TAAGTATTCC CTGCAGTTTG GCTTTGGGCC TTTTGAGTTG 1380CCTCCTCAGT GGCTCTCAGA GACCCGAAAC AGCAAGAAGC GGCTGCTTCC CCCCTTGGGC 1440AACACCCCAG AAGAGCTGAT CCAGACAAAG GTGCCCAAGG TAGGCAGGGT GGAGCGGAAG 1500ATGAGCAGAA ACAATAAAGT GAGCATTTTT CCAAAGGTGG ATTCCTAG1548(221)SEQ ID NO220的资料(i)序列特征(A)长度515个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO220的序列描述Met Gly His Asn Gly Ser Trp Ile Ser Pro Asn Ala Ser Glu Pro His1 5 10 15Asn Ala Ser Gly Ala Glu Ala Ala Gly Val Asn Arg Ser Ala Leu Gly20 25 30Glu Phe Gly Glu Ala Gln Leu Tyr Arg Gln Phe Thr Thr Thr Val Gln35 40 45Val Val Ile Phe Ile Gly Ser Leu Leu Gly Asn Phe Met Val Leu Trp50 55 60Ser Thr Cys Arg Thr Thr Val Phe Lys Ser Val Thr Asn Arg Phe Ile65 70 75 80Lys Asn Leu Ala Cys Ser Gly Ile Cys Ala Ser Leu Val Cys Val Pro
85 90 95Phe Asp Ile Ile Leu Ser Thr Ser Pro His Cys Cys Trp Trp Ile Tyr100 105 110Thr Met Leu Phe Cys Lys Val Val Lys Phe Leu His Lys Val Phe Cys115 120 125Ser Val Thr Ile Leu Ser Phe Pro Ala Ile Ala Leu Asp Arg Tyr Tyr130 135 140Ser Val Leu Tyr Pro Leu Glu Arg Lys Ile Ser Asp Ala Lys Ser Arg145 150 155 160Glu Leu Val Met Tyr Ile Trp Ala His Ala Val Val Ala Ser Val Pro165 170 175Val Phe Ala Val Thr Asn Val Ala Asp Ile Tyr Ala Thr Ser Thr Cys180 185 190Thr Glu Val Trp Ser Asn Ser Leu Gly His Leu Val Tyr Val Leu Val195 200 205Tyr Asn Ile Thr Thr Val Ile Val Pro Val Val Val Val Phe Leu Phe210 215 220Leu Ile Leu Ile Arg Arg Ala Leu Ser Ala Ser Gln Lys Lys Lys Val225 230 235 240Ile Ile Ala Ala Leu Arg Thr Pro Gln Asn Thr Ile Ser Ile Pro Tyr245 250 255Ala Ser Gln Arg Glu Ala Gh Leu Lys Ala Thr Leu Leu Ser Met Val260265 270Met Val Phe Ile Leu Cys Ser Val Pro Tyr Ala Thr Leu Val Val Tyr275 280 285Gln Thr Val Leu Asn Val Pro Asp Thr Ser Val Phe Leu Leu Leu Thr290 295 300Ala Val Trp Leu Pro Lys Val Ser Leu Leu Ala Asn Pro Val Leu Phe305 310 315 320Leu Thr Val Asn Lys Ser Val Arg Lys Cys Leu Ile Gly Thr Leu Val325 330 335Gln Leu His His Arg Tyr Ser Arg Arg Asn Val Val Ser Thr Gly Ser340 345 350Gly Met Ala Glu Ala Ser Leu Glu Pro Ser Ile Arg Ser Gly Ser Gln
355 360 365Leu Leu Glu Met Phe His Ile Gly Gln Gln Gln Ile Phe Lys Pro Thr370 375 380Glu Asp Glu Glu Glu Ser Glu Ala Lys Tyr Ile Gly Ser Ala Asp Phe385 390 395 400Gln Ala Lys Glu Ile Phe Ser Thr Cys Leu Glu Gly Glu Gln Gly Pro405 410 415Gln Phe Ala Pro Ser Ala Pro Pro Leu Ser Thr Val Asp Ser Val Ser420 425 430Gln Val Ala Pro Ala Ala Pro Val Glu Pro Glu Thr Phe Pro Asp Lys435 440 445Tyr Ser Leu Gln Phe Gly Phe Gly Pro Phe Glu Leu Pro Pro Gln Trp450 455 460Leu Ser Glu Thr Arg Asn Ser Lys Lys Arg Leu Leu Pro Pro Leu Gly465 470 475 480Asn Thr Pro Glu Glu Leu Ile Gln Thr Lys Val Pro Lys Val Gly Arg485 490 495Val Glu Arg Lys Met SerArg Asn Asn Lys Val Ser Ile Phe Pro Lys500505 510Val Asp Ser515(222)SEQ ID NO221的资料(i)序列特征(A)长度1164个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO221的序列描述ATGAATCGGC ACCATCTGCA GGATCACTTT CTGGAAATAG ACAAGAAGAA CTGCTGTGTG 60TTCCGAGATG ACTTCATTGC CAAGGTGTTG CCGCCGGTGT TGGGGCTGGA GTTTATCTTT 120GGGCTTCTGG GCAATGGCCT TGCCCTGTGG ATTTTCTGTT TCCACCTCAA GTCCTGGAAA 180TCCAGCCGGA TTTTCCTGTT CAACCTGGCA GTAGCTGACT TTCTACTGAT CATCTGCCTG 240CCGTTCGTGA TGGACTACTA TGTGCGGCGT TCAGACTGGA AGTTTGGGGA CATCCCTTGC 300CGGCTGGTGC TCTTCATGTT TGCCATGAAC CGCCAGGGCA GCATCATCTT CCTCACGGTG 360GTGGCGGTAG ACAGGTATTT CCGGGTGGTC CATCCCCACC ACGCCCTGAA CAAGATCTCC 420AATTGGACAG CAGCCATCAT CTCTTGCCTT CTGTGGGGCA TCACTGTTGG CCTAACAGTC 480CACCTCCTGA AGAAGAAGTT GCTGATCCAG AATGGCCCTG CAAATGTGTG CATCAGCTTC 540AGCATCTGCC ATACCTTCCG GTGGCACGAA GCTATGTTCC TCCTGGAGTT CCTCCTGCCC 600CTGGGCATCA TCCTGTTCTG CTCAGCCAGA ATTATCTGGA GCCTGCGGCA GAGACAAATG 660GACCGGCATG CCAAGATCAA GAGAGCCAAA ACCTTCATCA TGGTGGTGGC CATCGTCTTT 720GTCATCTGCT TCCTTCCCAG CGTGGTTGTG CGGATCCGCA TCTTCTGGCT CCTGCACACT 780TCGGGCACGC AGAATTGTGA AGTGTACCGC TCGGTGGACC TGGCGTTCTT TATCACTCTC 840AGCTTCACCT ACATGAACAG CATGCTGGAC CCCGTGGTGT ACTACTTCTC CAGCCCATCC 900TTTCCCAACT TCTTCTCCAC TTTGATCAAC CGCTGCCTCC AGAGGAAGAT GACAGGTGAG 960CCAGATAATA ACCGCAGCAC GAGCGTCGAG CTCACAGGGG ACCCCAACAA AACCAGAGGC 1020GCTCCAGAGG CGTTAATGGC CAACTCCGGT GAGCCATGGA GCCCCTCTTA TCTGGGCCCA 1080ACCTCAAATA ACCATTCCAA GAAGGGACAT TGTCACCAAG AACCAGCATC TCTGGAGAAA 1140CAGTTGGGCT GTTGCATCGA GTAA1164(223)SEQ ID NO222的资料(i)序列特征(A)长度387个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO222的序列描述Met Asn Arg His His Leu Gln Asp His Phe Leu Glu Ile Asp Lys Lys1 5 10 15Asn Cys Cys Val Phe Arg Asp Asp Phe Ile Ala Lys Val Leu Pro Pro20 25 30Val Leu Gly Leu Glu Phe Ile Phe Gly Leu Leu Gly Asn Gly Leu Ala35 40 45Leu Trp Ile Phe Cys Phe His Leu Lys Ser Trp Lys Ser Ser Arg Ile50 55 60Phe Leu Phe Asn Leu Ala Val Ala Asp Phe Leu Leu Ile Ile Cys Leu65 70 75 80Pro Phe Val Met Asp Tyr Tyr Val Arg Arg Ser Asp Trp Lys Phe Gly85 90 95Asp Ile Pro Cys Arg Leu Val Leu Phe Met Phe Ala Met Asn Arg Gln100 105 110Gly Ser Ile Ile Phe Leu Thr Val Val Ala Val Asp Arg Tyr Phe Arg115 120 125Val Val His Pro His His Ala Leu Asn Lys Ile Ser Asn Trp Thr Ala130 135 140Ala Ile Ile Ser Cys Leu Leu Trp Gly Ile Thr Val Gly Leu Thr Val145 150 155 160His Leu Leu Lys Lys Lys Leu Leu Ile Gln Asn Gly Pro Ala Asn Val165 170 175Cys Ile Ser Phe Ser Ile Cys His Thr Phe Arg Trp His Glu Ala Met180 185 190Phe Leu Leu Glu Phe Leu Leu Pro Leu Gly Ile Ile Leu Phe Cys Ser195 200 205Ala Arg Ile Ile Trp Ser Leu Arg Gln Arg Gln Met Asp Arg His Ala210 215 220Lys Ile Lys Arg Ala Lys Thr Phe Ile Met Val Val Ala Ile Val Phe225 230 235 240Val Ile Cys Phe Leu Pro Ser Val Val Val Arg Ile Arg Ile Phe Trp245 250 255Leu Leu His Thr Ser Gly Thr Gln Asn Cys Glu Val Tyr Arg Ser Val260 265 270Asp Leu Ala Phe Phe Ile Thr Leu Ser Phe Thr Tyr Met Asn Ser Met275 280 285Leu Asp Pro Val Val Tyr Tyr Phe Ser Ser Pro Ser Phe Pro Asn Phe290 295 300Phe Ser Thr Leu Ile Asn Arg Cys Leu Gln Arg Lys Met Thr Gly Glu305 310 315 320Pro Asp Asn Asn Arg Ser Thr Ser Val Glu Leu Thr Gly Asp Pro Asn325 330 335Lys Thr Arg Gly Ala Pro Glu Ala Leu Met Ala Asn Ser Gly Glu Pro340 345 350Trp Ser Pro Ser Tyr Leu Gly Pro Thr Ser Asn Asn His Ser Lys Lys355 360 365Gly His Cys His Gln Glu Pro Ala Ser Leu Glu Lys Gln Leu Gly Cys370 375 380Cys Ile Glu385(224)SEQ ID NO223的资料(i)序列特征
(A) 长度1212个碱基对(B)类型核酸(C)链型单链(D)拓扑学不相关(ii)分子类型DNA(基因组的)(xi)SEQ ID NO223的序列描述ATGGCTTGCA ATGGCAGTGC GGCCAGGGGG CACTTTGACC CTGAGGACTT GAACCTGACT 60GACGAGGCAC TGAGACTCAA GTACCTGGGG CCCCAGCAGA CAGAGCTGTT CATGCCCATC 120TGTGCCACAT ACCTGCTGAT CTTCGTGGTG GGCGCTGTGG GCAATGGGCT GACCTGTCTG 180GTCATCCTGC GCCACAAGGC CATGCGCACG CCTACCAACT ACTACCTCTT CAGCCTGGCC 240GTGTCGGACC TGCTGGTGCT GCTGGTGGGC CTGCCCCTGG AGCTCTATGA GATGTGGCAC 300AACTACCCCT TCCTGCTGGG CGTTGGTGGC TGCTATTTCC GCACGCTACT GTTTGAGATG 360GTCTGCCTGG CCTCAGTGCT CAACGTCACT GCCCTGAGCG TGGAACGCTA TGTGGCCGTG 420GTGCACCCAC TCCAGGCCAG GTCCATGGTG ACGCGGGCCC ATGTGCGCCG AGTGCTTGGG 480GCCGTCTGGG GTCTTGCCAT GCTCTGCTCC CTGCCCAACA CCAGCCTGCA CGGCATCCGG 540CAGCTGCACG TGCCCTGCCG GGGCCCAGTG CCAGACTCAG CTGTTTGCAT GCTGGTCCGC 600CCACGGGCCC TCTACAACAT GGTAGTGCAG ACCACCGCGC TGCTCTTCTT CTGCCTGCCC 660ATGGCCATCA TGAGCGTGCT CTACCTGCTC ATTGGGCTGC GACTGCGGCG GGAGAGGCTG 720CTGCTCATGC AGGAGGCCAA GGGCAGGGGC TCTGCAGCAG CCAGGTCCAG ATACACCTGC 780AGGCTCCAGC AGCACGATCG GGGCCGGAGA CAAGTGAAGA AGATGCTGTT TGTCCTGGTC 840GTGGTGTTTG GCATCTGCTG GGCCCCGTTC CACGCCGACC GCGTCATGTG GAGCGTCGTG 900TCACAGTGGA CAGATGGCCT GCACCTGGCC TTCCAGCACG TGCACGTCAT CTCCGGCATC 960TTCTTCTACC TGGGCTCGGC GGCCAACCCC GTGCTCTATA GCCTCATGTC CAGCCGCTTC 1020CGAGAGACCT TCCAGGAGGC CCTGTGCCTC GGGGCCTGCT GCCATCGCCT CAGACCCCGC 1080CACAGCTCCC ACAGCCTCAG CAGGATGACC ACAGGCAGCA CCCTGTGTGA TGTGGGCTCC 1140CTGGGCAGCT GGGTCCACCC CCTGGCTGGG AACGATGGCC CAGAGGCGCA GCAAGAGACC 1200GATCCATCCT GA 1212(225)SEQ ID NO224的资料(i)序列特征(A)长度403个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi) SEQ ID NO224的序列描述Met Ala Cys Asn Gly Ser Ala Ala Arg Gly His Phe Asp Pro Glu Asp1 5 10 15Leu Asn Leu Thr Asp Glu Ala Leu Arg Leu Lys Tyr Leu Gly Pro Gln20 25 30Gln Thr Glu Leu Phe Met Pro Ile Cys Ala Thr Tyr Leu Leu Ile Phe35 40 45Val Val Gly Ala Val Gly Asn Gly Leu Thr Cys Leu Val Ile Leu Arg50 55 60His Lys Ala Met Arg Thr Pro Thr Asn Tyr Tyr Leu Phe Ser Leu Ala65 70 75 80Val Ser Asp Leu Leu Val Leu Leu Val Gly Leu Pro Leu Glu Leu Tyr85 90 95Glu Met Trp His Asn Tyr Pro Phe Leu Leu Gly Val Gly Gly Cys Tyr100 105 110Phe Arg Thr Leu Leu Phe Glu Met Val Cys Leu Ala Ser Val Leu Asn115 120 125Val Thr Ala Leu Ser Val Glu Arg Tyr Val Ala Val Val His Pro Leu130 135 140Gln Ala Arg Ser Met Val Thr Arg Ala His Val Arg Arg Val Leu Gly145 150 155 160Ala Val Trp Gly Leu Ala Met Leu Cys Ser Leu Pro Asn Thr Ser Leu165 170 175His Gly Ile Arg Gln Leu His Val Pro Cys Arg Gly Pro Val Pro Asp180 185 190Ser Ala Val Cys Met Leu Val Arg Pro Arg Ala Leu Tyr Asn Met Val195 200 205Val Gln Thr Thr Ala Leu Leu Phe Phe Cys Leu Pro Met Ala Ile Met210 215 220Ser Val Leu Tyr Leu Leu Ile Gly Leu Arg Leu Arg Arg Glu Arg Leu225 230 235 240Leu Leu Met Gln Glu Ala Lys Gly Arg Gly Ser Ala Ala Ala Arg Ser245 250 255Arg Tyr Thr Cys Arg Leu Gln Gln His Asp Arg Gly Arg Arg Gln Val260 265 270Lys Lys Met Leu Phe Val Leu Val Val Val Phe Gly Ile Cys Trp Ala275 280 285Pro Phe His Ala Asp Arg Val Met Trp Ser Val Val Ser Gln Trp Thr290 295 300Asp Gly Leu His Leu Ala Phe Gln His Val His Val Ile Ser Gly Ile305 310 315 320Phe Phe Tyr Leu Gly Ser Ala Ala Asn Pro Val Leu Tyr Ser Leu Met325 330 335Ser Ser Arg Phe Arg Glu Thr Phe Gln Glu Ala Leu Cys Leu Gly Ala340 345 350Cys Cys His Arg Leu Arg Pro Arg His Ser Ser His Ser Leu Ser Arg355 360 365Met Thr Thr Gly Ser Thr Leu Cys Asp Val Gly Ser Leu Gly Ser Trp370 375 380Val His Pro Leu Ala Gly Asn Asp Gly Pro Glu Ala Gln Gln Glu Thr385 390 395 400Asp Pro Ser(226)SEQ ID NO225的资料(i)序列特征(A)长度1098个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO225的序列描述ATGGGGAACA TCACTGCAGA CAACTCCTCG ATGAGCTGTA CCATCGACCA TACCATCCAC 60CAGACGCTGG CCCCGGTGGT CTATGTTACC GTGCTGGTGG TGGGCTTCCC GGCCAACTGC 120CTGTCCCTCT ACTTCGGCTA CCTGCAGATC AAGGCCCGGA ACGAGCTGGG CGTGTACCTG 180TGCAACCTGA CGGTGGCCGA CCTCTTCTAC ATCTGCTCGC TGCCCTTCTG GCTGCAGTAC 240GTGCTGCAGC ACGACAACTG GTCTCACGGC GACCTGTCCT GCCAGGTGTG CGGCATCCTC 300CTGTACGAGA ACATCTACAT CAGCGTGGGC TTCCTCTGCT GCATCTCCGT GGACCGCTAC 360CTGGCTGTGG CCCATCCCTT CCGCTTCCAC CAGTTCCGGA CCCTGAAGGC GGCCGTCGGC 420GTCAGCGTGG TCATCTGGGC CAAGGAGCTG CTGACCAGCA TCTACTTCCT GATGCACGAG 480GAGGTCATCG AGGACGAGAA CCAGCACCGC GTGTGCTTTG AGCACTACCC CATCCAGGCA 540TGGCAGCGCG CCATCAACTA CTACCGCTTC CTGGTGGGCT TCCTCTTCCC CATCTGCCTG 600CTGCTGGCGT CCTACCAGGG CATCCTGCGC GCCGTGCGCC GGAGCCACGG CACCCAGAAG 660AGCCGCAAGG ACCAGATCAA GCGGCTGGTG CTCAGCACCG TGGTCATCTT CCTGGCCTGC 720TTCCTGCCCT ACCACGTGTT GCTGCTGGTG CGCAGCGTCT GGGAGGCCAG CTGCGACTTC 780GCCAAGGGCG TTTTCAACGC CTACCACTTC TCCCTCCTGC TCACCAGCTT CAACTGCGTC 840GCCGACCCCG TGCTCTACTG CTTCGTCAGC GAGACCACCC ACCGGGACCT GGCCCGCCTC 900CGCGGGGCCT GCCTGGCCTT CCTCACCTGC TCCAGGACCG GCCGGGCCAG GGAGGCCTAC 960CCGCTGGGTG CCCCCGAGGC CTCCGGGAAA AGCGGGGCCC AGGGTGAGGA GCCCGAGCTG 1020TTGACCAAGC TCCACCCGGC CTTCCAGACC CCTAACTCGC CAGGGTCGGG CGGGTTCCCC 1080ACGGGCAGGT TGGCCTAG1098(227)SEQ ID NO226的资料(i)序列特征
(A)长度365个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO226的序列描述Met Gly Asn Ile Thr Ala Asp Asn Ser Ser Met Ser Cys Thr Ile Asp1 5 10 15His Thr Ile His Gln Thr Leu Ala Pro Val Val Tyr Val Thr Val Leu20 25 30Val Val Gly Phe Pro Ala Asn Cys Leu Ser Leu Tyr Phe Gly Tyr Leu35 40 45Gln Ile Lys Ala Arg Asn Glu Leu Gly Val Tyr Leu Cys Asn Leu Thr50 55 60Val Ala Asp Leu Phe Tyr Ile Cys Ser Leu Pro Phe Trp Leu Gln Tyr65 70 75 80Val Leu Gln His Asp Asn Trp Ser His Gly Asp Leu Ser Cys Gln Val85 90 95Cys Gly Ile Leu Leu Tyr Glu Asn Ile Tyr Ile Ser Val Gly Phe Leu100 105 110Cys Cys Ile Ser Val Asp Arg Tyr Leu Ala Val Ala His Pro Phe Arg115 120 125Phe His Gln Phe Arg Thr Leu Lys Ala Ala Val Gly Val Ser Val Val130 135 140Ile Trp Ala Lys Glu Leu Leu Thr Ser Ile Tyr Phe Leu Met His Glu145 150 155 160Glu Val Ile Glu Asp Glu Asn Gln His Arg Val Cys Phe Glu His Tyr165 170 175Pro Ile Gln Ala Trp Gln Arg Ala Ile Asn Tyr Tyr Arg Phe Leu Val180 185 190Gly Phe Leu Phe Pro Ile Cys Leu Leu Leu Ala Ser Tyr Gln Gly Ile195 200 205Leu Arg Ala Val Arg Arg Ser His Gly Thr Gln Lys Ser Arg Lys Asp210 215 220Gln Ile Lys Arg Leu Val Leu Ser Thr Val Val Ile Phe Leu Ala Cys225 230 235 240Phe Leu Pro Tyr His Val Leu Leu Leu Val Arg Ser Val Trp Glu Ala245 250 255Ser Cys Asp Phe Ala Lys Gly Val Phe Asn Ala Tyr His Phe Ser Leu260 265 270Leu Leu Thr Ser Phe Asn Cys Val Ala Asp Pro Val Leu Tyr Cys Phe275 280 285Val Ser Glu Thr Thr His Arg Asp Leu Ala Arg Leu Arg Gly Ala Cys290 295 300Leu Ala Phe Leu Thr Cys Ser Arg Thr Gly Arg Ala Arg Glu Ala Tyr305 310 315 320Pro Leu Gly Ala Pro Glu Ala Ser Gly Lys Ser Gly Ala Gln Gly Glu325 330 335Glu Pro Glu Leu Leu Thr Lys Leu His Pro Ala Phe Gln Thr Pro Asn340 345 350Ser Pro Gly Ser Gly Gly Phe Pro Thr Gly Arg Leu Ala355 360 365(228)SEQ ID NO227的资料(i)序列特征(A)长度1416个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO227的序列描述ATGGATATTC TTTGTGAAGA AAATACTTCT TTGAGCTCAA CTACGAACTC CCTAATGCAA 60TTAAATGATG ACAACAGGCT CTACAGTAAT GACTTTAACT CCGGAGAAGC TAACACTTCT 120GATGCATTTA ACTGGACAGT CGACTCTGAA AATCGAACCA ACCTTTCCTG TGAAGGGTGC 180CTCTCACCGT CGTGTCTCTC CTTACTTCAT CTCCAGGAAA AAAACTGGTC TGCTTTACTG 240ACAGCCGTAG TGATTATTCT AACTATTGCT GGAAACATAC TCGTCATCAT GGCAGTGTCC 300CTAGAGAAAA AGCTGCAGAA TGCCACCAAC TATTTCCTGA TGTCACTTGC CATAGCTGAT 360ATGCTGCTGG GTTTCCTTGT CATGCCCGTG TCCATGTTAA CCATCCTGTA TGGGTACCGG 420TGGCCTCTGC CGAGCAAGCT TTGTGCAGTC TGGATTTACC TGGACGTGCT CTTCTCCACG 480GCCTCCATCA TGCACCTCTG CGCCATCTCG CTGGACCGCT ACGTCGCCAT CCAGAATCCC 540ATCCACCACA GCCGCTTCAA CTCCAGAACT AAGGCATTTC TGAAAATCAT TGCTGTTTGG 600ACCATATCAG TAGGTATATC CATGCCAATA CCAGTCTTTG GGCTACAGGA CGATTCGAAG 660GTCTTTAAGG AGGGGAGTTG CTTACTCGCC GATGATAACT TTGTCCTGAT CGGCTCTTTT 720GTGTCATTTT TCATTCCCTT AACCATCATG GTGATCACCT ACTTTCTAAC TATCAAGTCA 780CTCCAGAAAG AAGCTACTTT GTGTGTAAGT GATCTTGGCA CACGGGCCAA ATTAGCTTCT 840TTCAGCTTCC TCCCTCAGAG TTCTTTGTCT TCAGAAAAGC TCTTCCAGCG GTCGATCCAT 900AGGGAGCCAG GGTCCTACAC AGGCAGGAGG ACTATGCAGT CCATCAGCAA TGAGCAAAAG 960GCAAAGAAGG TGCTGGGCAT CGTCTTCTTC CTGTTTGTGG TGATGTGGTG CCCTTTCTTC 1020ATCACAAACA TCATGGCCGT CATCTGCAAA GAGTCCTGCA ATGAGGATGT CATTGGGGCC 1080CTGCTCAATG TGTTTGTTTG GATCGGTTAT CTCTCTTCAG CAGTCAACCC ACTAGTCTAC 1140ACACTGTTCA ACAAGACCTA TAGGTCAGCC TTTTCACGGT ATATTCAGTG TCAGTACAAG 1200GAAAACAAAA AACCATTGCA GTTAATTTTA GTGAACACAA TACCGGCTTT GGCCTACAAG 1260TCTAGCCAAC TTCAAATGGG ACAAAAAAAG AATTCAAAGC AAGATGCCAA GACAACAGAT 1320AATGACTGCT CAATGGTTGC TCTAGGAAAG CAGTATTCTG AAGAGGCTTC TAAAGACAAT 1380AGCGACGGAG TGAATGAAAA GGTGAGCTGT GTGTGA 1416(229)SEQ ID NO228的资料(i)序列特征(A)长度470个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO228的序列描述Met Asp Ile Leu Cys Glu Glu Asn Thr Ser Leu Ser Ser Thr Thr Asn1 5 10 15Ser Leu Met Gln Leu Asn Asp Asp Asn Arg Leu Tyr Ser Asn Asp Phe20 25 30Asn Ser Gly Glu Ala Asn Thr Ser Asp Ala Phe Asn Trp Thr Val Asp35 40 45Ser Glu Asn Arg Thr Asn Leu Ser Cys Glu Gly Cys Leu Ser Pro Ser50 55 60Cys Leu Ser Leu Leu His Leu Gln Glu Lys Asn Trp Ser Ala Leu Leu65 70 75 80Thr Ala Val Val Ile Ile Leu Thr Ile Ala Gly Asn Ile Leu Val Ile85 90 95Met Ala Val Ser Leu Glu Lys Lys Leu Gln Asn Ala Thr Asn Tyr Phe100 105 110Leu Met Ser Leu Ala Ile Ala Asp Met Leu Leu Gly Phe Leu Val Met115 120 125Pro Val Ser Met Leu Thr Ile Leu Tyr Gly Tyr Arg Trp Pro Leu Pro130 135 140Ser Lys Leu Cys Ala Val Trp Ile Tyr Leu Asp Val Leu Phe Ser Thr145 150 155 160Ala Ser Ile Met His Leu Cys Ala Ile Ser Leu Asp Arg Tyr Val Ala165 170 175Ile Gln Asn Pro Ile His His Ser Arg Phe Asn Ser Arg Thr Lys Ala180 185 190Phe Leu Lys Ile Ile Ala Val Trp Thr Ile Ser Val Gly Ile Ser Met195 200 205Pro Ile Pro Val Phe Gly Leu Gln Asp Asp Ser Lys Val Phe Lys Glu210 215 220Gly Ser Cys Leu Leu Ala Asp Asp Asn Phe Val Leu Ile Gly Ser Phe225 230 235 240Val Ser Phe Phe Ile Pro Leu Thr Ile Met Val Ile Thr Tyr Phe Leu245 250 255Thr Ile Lys Ser Leu Gln Lys Glu Ala Thr Leu Cys Val Ser Asp Leu260 265 270Gly Thr Arg Ala Lys Leu Ala Ser Phe Ser Phe Leu Pro Gln Ser Ser275 280 285Leu Ser Ser Glu Lys Leu Phe Gln Arg Ser Ile His Arg Glu Pro Gly290 295 300Ser Tyr Thr Gly Arg Arg Thr Met Gln Ser Ile Ser Asn Glu Gln Lys305 310 315 320Ala Lys Lys Val Leu Gly Ile Val Phe Phe Leu Phe Val Val Met Trp325 330 335Cys Pro Phe Phe Ile Thr Asn Ile Met Ala Val Ile Cys Lys Glu Ser340 345 350Cys Asn Glu Asp Val Ile Gly Ala Leu Leu Asn Val Phe Val Trp Ile355 360 365Gly Tyr Leu Ser Ser Ala Val Asn Pro Leu Val Tyr Thr Leu Phe Asn370 375 380Lys Thr Tyr Arg Ser Ala Phe Ser Arg Tyr Ile Gln Cys Gln Tyr Lys385 390 395 400Glu Asn Lys Lys Pro Leu Gln Leu Ile Leu Val Asn Thr Ile Pro Ala405 410 415Leu Ala Tyr Lys Ser Ser Gln Leu Gln Met Gly Gln Lys Lys Asn Ser420 425 430Lys Gln Asp Ala Lys Thr Thr Asp Asn Asp Cys Ser Met Val Ala Leu435 440 445Gly Lys Gln Tyr Ser Glu Glu Ala Ser Lys Asp Asn Ser Asp Gly Val450 455 460Asn Glu Lys Val Ser Cys Val465 470(230)SEQ ID NO229的资料(i)序列特征(A)长度1377个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO229的序列描述ATGGTGAACC TGAGGAATGC GGTGCATTCA TTCCTTGTGC ACCTAATTGG CCTATTGGTT 60TGGCAATGTG ATATTTCTGT GAGCCCAGTA GCAGCTATAG TAACTGACAT TTTCAATACC 120TCCGATGGTG GACGCTTCAA ATTCCCAGAC GGGGTACAAA ACTGGCCAGC ACTTTCAATC 180GTCATCATAA TAATCATGAC AATAGGTGGC AACATCCTTG TGATCATGGC AGTAAGCATG 240GAAAAGAAAC TGCACAATGC CACCAATTAC TTCTTAATGT CCCTAGCCAT TGCTGATATG 300CTAGTGGGAC TACTTGTCAT GCCCCTGTCT CTCCTGGCAA TCCTTTATGA TTATGTCTGG 360CCACTACCTA GATATTTGTG CCCCGTCTGG ATTTCTTTAG ATGTTTTATT TTCAACAGCG 420TCCATCATGC ACCTCTGCGC TATATCGCTG GATCGGTATG TAGCAATACG TAATCCTATT 480GAGCATAGCC GTTTCAATTC GCGGACTAAG GCCATCATGA AGATTGCTAT TGTTTGGGCA 540ATTTCTATAG GTGTATCAGT TCCTATCCCT GTGATTGGAC TGAGGGACGA AGAAAAGGTG 600TTCGTGAACA ACACGACGTG CGTGCTCAAC GACCCAAATT TCGTTCTTAT TGGGTCCTTC 660GTAGCTTTCT TCATACCGCT GACGATTATG GTGATTACGT ATTGCCTGAC CATCTACGTT 720CTGCGCCGAC AAGCTTTGAT GTTACTGCAC GGCCACACCG AGGAACCGCC TGGACTAAGT 780CTGGATTTCC TGAAGTGCTG CAAGAGGAAT ACGGCCGAGG AAGAGAACTC TGCAAACCCT 840AACCAAGACC AGAACGCACG CCGAAGAAAG AAGAAGGAGA GACGTCCTAG GGGCACCATG 900CAGGCTATCA ACAATGAAAG AAAAGCTAAG AAAGTCCTTG GGATTGTTTT CTTTGTGTTT 960CTGATCATGT GGTGCCCATT TTTCATTACC AATATTCTGT CTGTTCTTTG TGAGAAGTCC 1020TGTAACCAAA AGCTCATGGA AAAGCTTCTG AATGTGTTTG TTTGGATTGG CTATGTTTGT 1080TCAGGAATCA ATCCTCTGGT GTATACTCTG TTCAACAAAA TTTACCGAAG GGCATTCTCC 1140AACTATTTGC GTTGCAATTA TAAGGTAGAG AAAAAGCCTC CTGTCAGGCA GATTCCAAGA 1200GTTGCCGCCA CTGCTTTGTC TGGGAGGGAG CTTAATGTTA ACATTTATCG GCATACCAAT 1260GAACCGGTGA TCGAGAAAGC CAGTGACAAT GAGCCCGGTA TAGAGATGCA AGTTGAGAAT 1320TTAGAGTTAC CAGTAAATCC CTCCAGTGTG GTTAGCGAAA GGATTAGCAG TGTGTGA 1377(231)SEQ ID NO230的资料(i)序列特征(A)长度458个氨基酸(B)类型氨基酸
(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO230 的序列描述Met Val Asn Leu Arg Asn Ala Val His Ser Phe Leu Val His Leu Ile1 5 10 15Gly Leu Leu Val Trp Gln Cys Asp Ile Ser Val Ser Pro Val Ala Ala20 25 30Ile Val Thr Asp Ile Phe Asn Thr Ser Asp Gly Gly Arg Phe Lys Phe35 40 45Pro Asp Gly Val Gln Asn Trp Pro Ala Leu Ser Ile Val Ile Ile Ile50 55 60Ile Met Thr Ile Gly Gly Asn Ile Leu Val Ile Met Ala Val Ser Met65 70 75 80Glu Lys Lys Leu His Asn Ala Thr Asn Tyr Phe Leu Met Ser Leu Ala85 90 95Ile Ala Asp Met Leu Val Gly Leu Leu Val Met Pro Leu Ser Leu Leu100 105 110Ala Ile Leu Tyr Asp Tyr Val Trp Pro Leu Pro Arg Tyr Leu Cys Pro115 120 125Val Trp Ile Ser Leu Asp Val Leu Phe Ser Thr Ala Ser Ile Met His130 135 140Leu Cys Ala Ile Ser Leu Asp Arg Tyr Val Ala Ile Arg Asn Pro Ile145 150 155 160Glu His Ser Arg Phe Asn Ser Arg Thr Lys Ala Ile Met Lys Ile Ala165 170 175Ile Val Trp Ala Ile Ser Ile Gly Val Ser Val Pro Ile Pro Val Ile180 185 190Gly Leu Arg Asp Glu Glu Lys Val Phe Val Asn Asn Thr Thr Cys Val195 200 205Leu Asn Asp Pro Asn Phe Val Leu Ile Gly Ser Phe Val Ala Phe Phe210 215 220Ile Pro Leu Thr Ile Met Val Ile Thr Tyr Cys Leu Thr Ile Tyr Val225 230 235 240Leu Arg Arg Gln Ala Leu Met Leu Leu His Gly His Thr Glu Glu Pro
245 250 255Pro Gly Leu Ser Leu Asp Phe Leu Lys Cys Cys Lys Arg Asn Thr Ala260 265 270Glu Glu Glu Asn Ser Ala Asn Pro Asn Gln Asp Gln Asn Ala Arg Arg275 280 285Arg Lys Lys Lys Glu Arg Arg Pro Arg Gly Thr Met Gln Ala Ile Asn290 295 300Asn Glu Arg Lys Ala Lys Lys Val Leu Gly Ile Val Phe Phe Val Phe305 310 315 320Leu Ile Met Trp Cys Pro Phe Phe Ile Thr Asn Ile Leu Ser Val Leu325 330 335Cys Glu Lys Ser Cys Asn Gln Lys Leu Met Glu Lys Leu Leu Asn Val340 345 350Phe Val Trp Ile Gly Tyr Val Cys Ser Gly Ile Asn Pro Leu Val Tyr355 360 365Thr Leu Phe Asn Lys Ile Tyr Arg Arg Ala Phe Ser Asn Tyr Leu Arg370 375 380Cys Asn Tyr Lys Val Glu Lys Lys Pro Pro Val Arg Gln Ile Pro Arg385 390 395 400Val Ala Ala Thr Ala Leu Ser Gly Arg Glu Leu Asn Val Asn Ile Tyr405 410 415Arg His Thr Asn Glu Pro Val Ile Glu Lys Ala Ser Asp Asn Glu Pro420 425 430Gly Ile Glu Met Gln Val Glu Asn Leu Glu Leu Pro Val Asn Pro Ser435 440 445Ser Val Val Ser Glu Arg Ile Ser Ser Val450 455(232)SEQ ID NO231的资料(i)序列特征(A) 长度1068个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO231的序列描述ATGGATCAGT TCCCTGAATC AGTGACAGAA AACTTTGAGT ACGATGATTT GGCTGAGGCC 60TGTTATATTG GGGACATCGT GGTCTTTGGG ACTGTGTTCC TGTCCATATT CTACTCCGTC 120ATCTTTGCCA TTGGCCTGGT GGGAAATTTG TTGGTAGTGT TTGCCCTCAC CAACAGCAAG 180AAGCCCAAGA GTGTCACCGA CATTTACCTC CTGAACCTGG CCTTGTCTGA TCTGCTGTTT 240GTAGCCACTT TGCCCTTCTG GACTCACTAT TTGATAAATG AAAAGGGCCT CCACAATGCC 300ATGTGCAAAT TCACTACCGC CTTCTTCTTC ATCGGCTTTT TTGGAAGCAT ATTCTTCATC 360ACCGTCATCA GCATTGATAG GTACCTGGCC ATCGTCCTGG CCGCCAACTC CATGAACAAC 420CGGACCGTGC AGCATGGCGT CACCATCAGC CTAGGCGTCT GGGCAGCAGC CATTTTGGTG 480GCAGCACCCC AGTTCATGTT CACAAAGCAG AAAGAAAATG AATGCCTTGG TGACTACCCC 540GAGGTCCTCC AGGAAATCTG GCCCGTGCTC CGCAATGTGG AAACAAATTT TCTTGGCTTC 600CTACTCCCCC TGCTCATTAT GAGTTATTGC TACTTCAGAA TCATCCAGAC GCTGTTTTCC 660TGCAAGAACC ACAAGAAAGC CAAAGCCAAG AAACTGATCC TTCTGGTGGT CATCGTGTTT 720TTCCTCTTCT GGACACCCTA CAACGTTATG ATTTTCCTGG AGACGCTTAA GCTCTATGAC 780TTCTTTCCCA GTTGTGACAT GAGGAAGGAT CTGAGGCTGG CCCTCAGTGT GACTGAGACG 840GTTGCATTTA GCCATTGTTG CCTGAATCCT CTCATCTATG CATTTGCTGG GGAGAAGTTC 900AGAAGATACC TTTACCACCT GTATGGGAAA TGCCTGGCTG TCCTGTGTGG GCGCTCAGTC 960CACGTTGATT TCTCCTCATC TGAATCACAA AGGAGCAGGC ATGGAAGTGT TCTGAGCAGC 1020AATTTTACTT ACCACACGAG TGATGGAGAT GCATTGCTCC TTCTCTGA1068(233)SEQ ID NO232的资料(i)序列特征(A)长度355个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO232的序列描述Met Asp Gln Phe Pro Glu Ser Val Thr Glu Asn Phe Glu Tyr Asp Asp1 5 10 15Leu Ala Glu Ala Cys Tyr Ile Gly Asp Ile Val Val Phe Gly Thr Val20 25 30Phe Leu Ser Ile Phe Tyr Ser Val Ile Phe Ala Ile Gly Leu Val Gly35 40 45Asn Leu Leu Val Val Phe Ala Leu Thr Asn Ser Lys Lys Pro Lys Ser50 55 60Val Thr Asp Ile Tyr Leu Leu Asn Leu Ala Leu Ser Asp Leu Leu Phe65 70 75 80Val Ala Thr Leu Pro Phe Trp Thr His Tyr Leu Ile Asn Glu Lys Gly85 90 95Leu His Asn Ala Met Cys Lys Phe Thr Thr Ala Phe Phe Phe Ile Gly
100105 110Phe Phe Gly Ser Ile Phe Phe Ile Thr Val Ile Ser Ile Asp Arg Tyr115 120 125Leu Ala Ile Val Leu Ala Ala Asn Ser Met Asn Asn Arg Thr Val Gln130 135 140His Gly Val Thr Ile Ser Leu Gly Val Trp Ala Ala Ala Ile Leu Val145 150 155 160Ala Ala Pro Gln Phe Met Phe Thr Lys G1n Lys Glu Asn Glu Cys Leu165 170 175Gly Asp Tyr Pro Glu Val Leu Gln Glu Ile Trp Pro Val Leu Arg Asn180 185 190Val Glu Thr Asn Phe Leu Gly Phe Leu Leu Pro Leu Leu Ile Met Ser195 200 205Tyr Cys Tyr Phe Arg Ile Ile Gln Thr Leu Phe Ser Cys Lys Asn His210 215 220Lys Lys Ala Lys Ala Lys Lys Leu Ile Leu Leu Val Val Ile Val Phe225 230 235 240Phe Leu Phe Trp Thr Pro Tyr Asn Val Met Ile Phe Leu Glu Thr Leu245 250 255Lys Leu Tyr Asp Phe Phe Pro Ser Cys Asp Met Arg Lys Asp Leu Arg260 265 270Leu Ala Leu Ser Val Thr Glu Thr Val Ala Phe Ser His Cys Cys Leu275 280 285Asn Pro Leu Ile Tyr Ala Phe Ala G1y Glu Lys Phe Arg Arg Tyr Leu290 295 300Tyr His Leu Tyr Gly Lys Cys Leu Ala Val Leu Cys Gly Arg Ser Val305 310 315 320His Val Asp Phe Ser Ser Ser Glu Ser Gln Arg Ser Arg His Gly Ser325 330 335Val Leu Ser Ser Asn Phe Thr Tyr His Thr Ser Asp Gly Asp Ala Leu340 345 350Leu Leu Leu355(234)SEQ ID NO233的资料(i)序列特征(A)长度29个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(iv)反义否(xi)SEQ ID NO233的序列描述GGCTTAAGAG CATCATCGTG GTGCTGGTG 29(235)SEQ ID NO234的资料(i)序列特征(A)长度34个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(iv)反义是(xi)SEQ ID NO234的序列描述GTCACCACCA GCACCACGAT GATGCTCTTA AGCC 34(236)SEQ ID NO235的资料(i)序列特征(A)长度31个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO235的序列描述CAAAGAAAGT ACTGGGCATC GTCTTCTTCC T 31(237)SEQ ID NO236的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO236的序列描述TGCTCTAGAT TCCAGATAGG TGAAAACTTG 30(238)SEQ ID NO237的资料(i)序列特征(A)长度50个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(iv)反义否(xi)SEQ ID NO237的序列描述CTAGGGGCAC CATGCAGGCT ATCAACAATG AAAGAAAAGC TAAGAAAGTC 50(239)SEQ ID NO238的资料(i)序列特征(A)长度50个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(iv)反义是(xi)SEQ ID NO238的序列描述CAAGGACTTT CTTAGCTTTT CTTTTCATTGT TGATAGCCTG CATGGTGCCC 50(240)SEQ ID NO239的资料(i)序列特征(A)长度35个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO239的序列描述CGGCGGCAGA AGGCGAAACG CATGATCCTC GCGGT 35(241)SEQ ID NO240的资料(i)序列特征(A)长度35个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO240的序列描述ACCGCGAGGA TCATGCGTTT CGCCTTCTGC CGCCG35(242)SEQ ID NO241的资料(i)序列特征(A)长度24个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO241的序列描述GAGACATATT ATCTGCCACG GAGG 24(243)SEQ ID NO242的资料(i)序列特征(A)长度24个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi) SEQ ID NO242的序列描述TTGGCATAGA AACCGGACCC AAGG 24(244)SEQ ID NO243的资料(i)序列特征(A)长度28个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO243的序列描述taagaattcc ataaaaatta tggaatgg 28(245)SEQ ID NO244的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO244的序列描述CCAGGATCCA GCTGAAGTCT TCCATCATTC 30(246)SEQ ID NO245的资料(i)序列特征(A)长度1071个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA (基因组的)(xi)SEQ ID NO245的序列描述ATGAATGGGG TCTCGGAGGG GACCAGAGGC TGCAGTGACA GGCAACCTGG GGTCCTGACA 60CGTGATCGCT CTTGTTCCAG GAAGATGAAC TCTTCCGGAT GCCTGTCTGA GGAGGTGGGG 120TCCCTCCGCC CACTGACTGT GGTTATCCTG TCTGCGTCCA TTGTCGTCGG AGTGCTGGGC 180AATGGGCTGG TGCTGTGGAT GACTGTCTTC CGTATGGCAC GCACGGTCTC CACCGTCTGC 240TTCTTCCACC TGGCCCTTGC CGATTTCATG CTCTCACTGT CTCTGCCCAT TGCCATGTAC 300TATATTGTCT CCAGGCAGTG GCTCCTCGGA GAGTGGGCCT GCAAACTCTA CATCACCTTT 360GTGTTCCTCA GCTACTTTGC CAGTAACTGC CTCCTTGTCT TCATCTCTGT GGACCGTTGC 420ATCTCTGTCC TCTACCCCGT CTGGGCCCTG AACCACCGCA CTGTGCAGCG GGCGAGCTGG 480CTGGCCTTTG GGGTGTGGCT CCTGGCCGCC GCCTTGTGCT CTGCGCACCT GAAATTCCGG 540ACAACCAGAA AATGGAATGG CTGTACGCAC TGCTACTTGG CGTTCAACTC TGACAATGAG 600ACTGCCCAGA TTTGGATTGA AGGGGTCGTG GAGGGACACA TTATAGGGAC CATTGGCCAC 660TTCCTGCTGG GCTTCCTGGG GCCCTTAGCA ATCATAGGCA CCTGCGCCCA CCTCATCCGG 720GCCAAGCTCT TGCGGGAGGG CTGGGTCCAT GCCAACCGGC CCGCGAGGCT GCTGCTGGTG 780CTGGTGAGCG CTTTCTTTAT CTTCTGGTCC CCGTTTAACG TGGTGCTGTT GGTCCATCTG 840TGGCGACGGG TGATGCTCAA GGAAATCTAC CACCCCCGGA TGCTGCTCAT CCTCCAGGCT 900AGCTTTGCCT TGGGCTGTGT CAACAGCAGC CTCAACCCCT TCCTCTACGT CTTCGTTGGC 960AGAGATTTCC AAGAAAAGTT TTTCCAGTCT TTGACTTCTG CCCTGGCGAG GGCGTTTGGA 1020GAGGAGGAGT TTCTGTCATC CTGTCCCCGT GGCAACGCCC CCCGGGAATG A1071(247)SEQ ID NO246的资料(i)序列特征(A)长度356个氨基酸
(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋质(xi)SEQ ID NO246的序列描述Met Asn Gly Val Ser Glu Gly Thr Arg Gly Cys Ser Asp Arg Gln Pro1 5 10 15Gly Val Leu Thr Arg Asp Arg Ser Cys Ser Arg Lys Met Asn Ser Ser20 25 30Gly Cys Leu Ser Glu Glu Val Gly Ser Leu Arg Pro Leu Thr Val Val35 40 45Ile Leu Ser Ala Ser Ile Val Val Gly Val Leu Gly Asn Gly Leu Val50 55 60Leu Trp Met Thr Val Phe Arg Met Ala Arg Thr Val Ser Thr Val Cys65 70 75 80Phe Phe His Leu Ala Leu Ala Asp Phe Met Leu Ser Leu Ser Leu Pro85 90 95Ile Ala Met Tyr Tyr Ile Val Ser Arg Gln Trp Leu Leu Gly Glu Trp100 105 110Ala Cys Lys Leu Tyr Ile Thr Phe Val Phe Leu Ser Tyr Phe Ala Ser115 120 125Asn Cys Leu Leu Val Phe Ile Ser Val Asp Arg Cys Ile Ser Val Leu130 135 140Tyr Pro Val Trp Ala Leu Asn His Arg Thr Val Gln Arg Ala Ser Trp145 150 155 160Leu Ala Phe Gly Val Trp Leu Leu Ala Ala Ala Leu Cys Ser Ala His165 170 175Leu Lys Phe Arg Thr Thr Arg Lys Trp Asn Gly Cys Thr His Cys Tyr180 185 190Leu Ala Phe Asn Ser Asp Ash Glu Thr Ala Gln Ile Trp Ile Glu Gly195 200 205Val Val Glu Gly His Ile Ile Gly Thr Ile Gly His Phe Leu Leu Gly210 215 220Phe Leu Gly Pro Leu Ala Ile Ile Gly Thr Cys Ala His Leu Ile Arg225 230 235 240Ala Lys Leu Leu Arg Glu Gly Trp Val His Ala Asn Arg Pro Ala Arg245 250 255Leu Leu Leu Val Leu Val Ser Ala Phe Phe Ile Phe Trp Ser Pro Phe260 265 270Asn Val Val Leu Leu Val His Leu Trp Arg Arg Val Met Leu Lys Glu275 280 285Ile Tyr His Pro Arg Met Leu Leu Ile Leu Gln Ala Ser Phe Ala Leu290 295 300Gly Cys Val Asn Ser Ser Leu Asn Pro Phe Leu Tyr Val Phe Val Gly305 310 315 320Arg Asp Phe Gln Glu Lys Phe Phe Gln Ser Leu Thr Ser Ala Leu Ala325 330 335Arg Ala Phe Gly Glu Glu Glu Phe Leu Ser Ser Cys Pro Arg Gly Asn340 345 350Ala Pro Arg Glu355(248) SEQ ID NO247的资料(i)序列特征(A)长度32个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO247的序列描述GCAGAATTCG GCGGCCCCAT GGACCTGCCC CC 32(249)SEQ ID NO248的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO248的序列描述GCTGGATCCC CCGAGCAGTG GCGTTACTTC30(250) SEQ ID NO249的资料(i)序列特征(A)长度903个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO249的序列描述ATGGACCTGC CCCCGCAGCT CTCCTTCGGC CTCTATGTGG CCGCCTTTGC GCTGGGCTTC 60CCGCTCAACG TCCTGGCCAT CCGAGGCGCG ACGGCCCACG CCCGGCTCCG TCTCACCCCT 120AGCCTGGTCT ACGCCCTGAA CCTGGGCTGC TCCGACCTGC TGCTGACAGT CTCTCTGCCC 180CTGAAGGCGG TGGAGGCGCT AGCCTCCGGG GCCTGGCCTC TGCCGGCCTC GCTGTGCCCC 240GTCTTCGCGG TGGCCCACTT CTTCCCACTC TATGCCGGCG GGGGCTTCCT GGCCGCCCTG 300AGTGCAGGCC GCTACCTGGG AGCAGCCTTC CCCTTGGGCT ACCAAGCCTT CCGGAGGCCG 360TGCTATTCCT GGGGGGTGTG CGCGGCCATC TGGGCCCTCG TCCTGTGTCA CCTGGGTCTG 420GTCTTTGGGT TGGAGGCTCC AGGAGGCTGG CTGGACCACA GCAACACCTC CCTGGGCATC 480AACACACCGG TCAACGGCTC TCCGGTCTGC CTGGAGGCCT GGGACCCGGC CTCTGCCGGC 540CCGGCCCGCT TCAGCCTCTC TCTCCTGCTC TTTTTTCTGC CCTTGGCCAT CACAGCCTTC 600TGCTACGTGG GCTGCCTCCG GGCACTGGCC CGCTCCGGCC TGACGCACAG GCGGAAGCTG 660CGGGCCGCCT GGGTGGCCGG CGGGGCCCTC CTCACGCTGC TGCTCTGCGT AGGACCCTAC 720AACGCCTCCA ACGTGGCCAG CTTCCTGTAC CCCAATCTAG GAGGCTCCTG GCGGAAGCTG 780GGGCTCATCA CGGGTGCCTG GAGTGTGGTG CTTAATCCGC TGGTGACCGG TTACTTGGGA 840AGGGGTCCTG GCCTGAAGAC AGTGTGTGCG GCAAGAACGC AAGGGGGCAA GTCCCAGAAG 900TAA 903(251) SEQ ID NO250的资料(i)序列特征(A)长度300个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO250的序列描述Met Asp Leu Pro Pro Gln Leu Ser Phe Gly Leu Tyr Val Ala Ala Phe1 5 10 15Ala Leu Gly Phe Pro Leu Asn Val Leu Ala Ile Arg Gly Ala Thr Ala20 25 30His Ala Arg Leu Arg Leu Thr Pro Ser Leu Val Tyr Ala Leu Asn Leu35 40 45Gly Cys Ser Asp Leu Leu Leu Thr Val Ser Leu Pro Leu Lys Ala Val
50 55 60Glu Ala Leu Ala Ser Gly Ala Trp Pro Leu Pro Ala Ser Leu Cys Pro65 70 75 80Val Phe Ala Val Ala His Phe Phe Pro Leu Tyr Ala Gly Gly Gly Phe85 90 95Leu Ala Ala Leu Ser Ala Gly Arg Tyr Leu Gly Ala Ala Phe Pro Leu100 105 110Gly Tyr Gln Ala Phe Arg Arg Pro Cys Tyr Ser Trp Gly Val Cys Ala115 120 125Ala Ile Trp Ala Leu Val Leu Cys His Leu Gly Leu Val Phe Gly Leu130 135 140Glu Ala Pro Gly Gly Trp Leu Asp His Ser Asn Thr Ser Leu Gly Ile145 150 155 160Asn Thr Pro Val Asn Gly Ser Pro Val Cys Leu Glu Ala Trp Asp Pro165 170 175Ala Ser Ala Gly Pro Ala Arg Phe Ser Leu Ser Leu Leu Leu Phe Phe180 185 190Leu Pro Leu Ala Ile Thr Ala Phe Cys Tyr Val Gly Cys Leu Arg Ala195 200 205Leu Ala Arg Ser Gly Leu Thr His Arg Arg Lys Leu Arg Ala Ala Trp210 215 220Val Ala Gly Gly Ala Leu Leu Thr Leu Leu Leu Cys Val Gly Pro Tyr225 230 235 240Asn Ala Ser Asn Val Ala Ser Phe Leu Tyr Pro Asn Leu Gly Gly Ser245 250 255Trp Arg Lys Leu Gly Leu Ile Thr Gly Ala Trp Ser Val Val Leu Asn260 265 270Pro Leu Val Thr Gly Tyr Leu Gly Arg Gly Pro Gly Leu Lys Thr Val275 280 285Cys Ala Ala Arg Thr Gln Gly Gly Lys Ser Gln Lys290 295 300(252)SEQ ID NO251的资料(i)序列特征(A)长度31个碱基对
(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO251的序列描述CTCAAGCTTA CTCTCTCTCA CCAGTGGCCA C 31(253)SEQ ID NO252的资料(i)序列特征(A)长度24个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO252的序列描述CCCTCCTCCC CCGGAGGACC TAGC 24(254)SEQ ID NO253的资料(i)序列特征(A)长度1041个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO253的序列描述ATGGATACAG GCCCCGACCA GTCCTACTTC TCCGGCAATC ACTGGTTCGT CTTCTCGGTG 60TACCTTCTCA CTTTCCTGGT GGGGCTCCCC CTCAACCTGC TGGCCCTGGT GGTCTTCGTG 120GGCAAGCTGC AGCGCCGCCC GGTGGCCGTG GACGTGCTCC TGCTCAACCT GACCGCCTCG 180GACCTGCTCC TGCTGCTGTT CCTGCCTTTC CGCATGGTGG AGGCAGCCAA TGGCATGCAC 240TGGCCCCTGC CCTTCATCCT CTGCCCACTC TCTGGATTCA TCTTCTTCAC CACCATCTAT 300CTCACCGCCC TCTTCCTGGC AGCTGTGAGC ATTGAACGCT TCCTGAGTGT GGCCCACCCA 360CTGTGGTACA AGACCCGGCC GAGGCTGGGG CAGGCAGGTC TGGTGAGTGT GGCCTGCTGG 420CTGTTGGCCT CTGCTCACTG CAGCGTGGTC TACGTCATAG AATTCTCAGG GGACATCTCC 480CACAGCCAGG GCACCAATGG GACCTGCTAC CTGGAGTTCC GGAAGGACCA GCTAGCCATC 540CTCCTGCCCG TGCGGCTGGA GATGGCTGTG GTCCTCTTTG TGGTCCCGCT GATCATCACC 600AGCTACTGCT ACAGCCGCCT GGTGTGGATC CTCGGCAGAG GGGGCAGCCA CCGCCGGCAG 660AGGAGGGTGG CGGGGCTGTT GGCGGCCACG CTGCTCAACT TCCTTGTCTG CTTTGGGCCC 720TACAACGTGT CCCATGTCGT GGGCTATATC TGCGGTGAAA GCCCGGCATG GAGGATCTAC 780GTGACGCTTC TCAGCACCCT GAACTCCTGT GTCGACCCCT TTGTCTACTA CTTCTCCTCC 840TCCGGGTTCC AAGCCGACTT TCATGAGCTG CTGAGGAGGT TGTGTGGGCT CTGGGGCCAG 900TGGCAGCAGG AGAGCAGCAT GGAGCTGAAG GAGCAGAAGG GAGGGGAGGA GCAGAGAGCG 960GACCGACCAG CTGAAAGAAA GACCAGTGAA CACTCACAGG GCTGTGGAAC TGGTGGCCAG 1020GTGGCCTGTG CTGAAAGCTA G 1041(255)SEQ ID NO254的资料(i)序列特征(A)长度346个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO254的序列描述Met Asp Thr Gly Pro Asp Gln Ser Tyr Phe Ser Gly Asn His Trp Phe1 5 10 15Val Phe Ser Val Tyr Leu Leu Thr Phe Leu Val Gly Leu Pro Leu Asn20 25 30Leu Leu Ala Leu Val Val Phe Val Gly Lys Leu Gln Arg Arg Pro Val35 40 45Ala Val Asp Val Leu Leu Leu Asn Leu Thr Ala Ser Asp Leu Leu Leu50 55 60Leu Leu Phe Leu Pro Phe Arg Met Val Glu Ala Ala Asn Gly Met His65 70 75 80Trp Pro Leu Pro Phe Ile Leu Cys Pro Leu Ser Gly Phe Ile Phe Phe85 90 95Thr Thr Ile Tyr Leu Thr Ala Leu Phe Leu Ala Ala Val Ser Ile Glu100 105 110Arg Phe Leu Ser Val Ala His Pro Leu Trp Tyr Lys Thr Arg Pro Arg115 120 125Leu Gly Gln Ala Gly Leu Val Ser Val Ala Cys Trp Leu Leu Ala Ser130 135 140Ala His Cys Ser Val Val Tyr Val Ile Glu Phe Ser Gly Asp Ile Ser145 150 155 160His Ser Gln Gly Thr Asn Gly Thr Cys Tyr Leu Glu Phe Arg Lys Asp165 170 175Gln Leu Ala Ile Leu Leu Pro Val Arg Leu Glu Met Ala Val Val Leu180 185 190Phe Val Val Pro Leu Ile Ile Thr Ser Tyr Cys Tyr Ser Arg Leu Val195 200 205Trp Ile Leu Gly Arg Gly Gly Ser His Arg Arg Gln Arg Arg Val Ala210 215 220Gly Leu Leu Ala Ala Thr Leu Leu Asn Phe Leu Val Cys Phe Gly Pro225 230 235 240Tyr Asn Val Ser His Val Val Gly Tyr Ile Cys Gly Glu Ser Pro Ala245 250 255Trp Arg Ile Tyr Val Thr Leu Leu Ser Thr Leu Asn Ser Cys Val Asp260 265 270Pro Phe Val Tyr Tyr Phe Ser Ser Ser Gly Phe Gln Ala Asp Phe His275 280 285Glu Leu Leu Arg Arg Leu Cys Gly Leu Trp Gly Gln Trp Gln Gln Glu290 295 300Ser Ser Met Glu Leu Lys Glu Gln Lys Gly Gly Glu Glu Gln Arg Ala305 310 315 320Asp Arg Pro Ala Glu Arg Lys Thr Ser Glu His Ser Gln Gly Cys Gly325 330 335Thr Gly Gly Gln Val Ala Cys Ala Glu Ser340 345(256)SEQ ID NO255的资料(i)序列特征(A)长度31个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO255的序列描述TTTAAGCTTC CCCTCCAGGA TGCTGCCGGA C 31(257)SEQ ID NO256的资料(i)序列特征(A)长度31个碱基对(B )类型核酸(C)链型单链(D)拓扑学不相关(ii)分子类型DNA(基因组的)(xi)SEQ ID NO256的序列描述GGCGAATTCT GAAGGTCCAG GGAAACTGCT A 31(258)SEQ ID NO257的资料(i)序列特征(A)长度993个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA (基因组的)(xi)SEQ ID NO257的序列描述ATGCTGCCGG ACTGGAAGAG CTCCTTGATC CTCATGGCTT ACATCATCAT CTTCCTCACT 60GGCCTCCCTG CCAACCTCCT GGCCCTGCGG GCCTTTGTGG GGCGGATCCG CCAGCCCCAG 120CCTGCACCTG TGCACATCCT CCTGCTGAGC CTGACGCTGG CCGACCTCCT CCTGCTGCTG 180CTGCTGCCCT TCAAGATCAT CGAGGCTGCG TCGAACTTCC GCTGGTACCT GCCCAAGGTC 240GTCTGCGCCC TCACGAGTTT TGGCTTCTAC AGCAGCATCT ACTGCAGCAC GTGGCTCCTG 300GCGGGCATCA GCATCGAGCG CTACCTGGGA GTGGCTTTCC CCGTGCAGTA CAAGCTCTCC 360CGCCGGCCTC TGTATGGAGT GATTGCAGCT CTGGTGGCCT GGGTTATGTC CTTTGGTCAC 420TGCACCATCG TGATCATCGT TCAATACTTG AACACGACTG AGCAGGTCAG AAGTGGCAAT 480GAAATTACCT GCTACGAGAA CTTCACCGAT AACCAGTTGG ACGTGGTGCT GCCCGTGCGG 540CTGGAGCTGT GCCTGGTGCT CTTCTTCATC CCCATGGCAG TCACCATCTT CTGCTACTGG 600CGTTTTGTGT GGATCATGCT CTCCCAGCCC CTTGTGGGGG CCCAGAGGCG GCGCCGAGCC 660GTGGGGCTGG CTGTGGTGAC GCTGCTCAAT TTCCTGGTGT GCTTCGGACC TTACAACGTG 720TCCCACCTGG TGGGGTATCA CCAGAGAAAA AGCCCCTGGT GGCGGTCAAT AGCCGTGGTG 780TTCAGTTCAC TCAACGCCAG TCTGGACCCC CTGCTCTTCT ATTTCTCTTC TTCAGTGGTG 840CGCAGGGCAT TTGGGAGAGG GCTGCAGGTG CTGCGGAATC AGGGCTCCTC CCTGTTGGGA 900CGCAGAGGCA AAGACACAGC AGAGGGGACA AATGAGGACA GGGGTGTGGG TCAAGGAGAA 960GGGATGCCAA GTTCGGACTT CACTACAGAG TAG 993(259)SEQ ID NO258的资料(i)序列特征(A)长度362个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi) SEQ ID NO258的序列描述Met Leu Pro Asp Trp Lys Ser Ser Leu Ile Leu Met Ala Tyr Ile Ile1 5 10 15Ile Phe Leu Thr Gly Leu Pro Ala Asn Leu Leu Ala Leu Arg Ala Phe20 25 30Val Gly Arg Ile Arg Gln Pro Gln Pro Ala Pro Val His Ile Leu Leu35 40 45Leu Ser Leu Thr Leu Ala Asp Leu Leu Leu Leu Leu Leu Leu Pro Phe50 55 60Lys Ile Ile Glu Ala Ala Ser Asn Phe Arg Trp Tyr Leu Pro Lys Val65 70 75 80Val Cys Ala Leu Thr Ser Phe Gly Phe Tyr Ser Ser Ile Tyr Cys Ser85 90 95Thr Trp Leu Leu Ala Gly Ile Ser Ile Glu Arg Tyr Leu Gly Val Ala100 105 110Phe Pro Val Gln Tyr Lys Leu Ser Arg Arg Pro Leu Tyr Gly Val Ile115 120 125Ala Ala Leu Val Ala Trp Val Met Ser Phe Gly His Cys Thr Ile Val130 135 140Ile Ile Val Gln Tyr Leu Asn Thr Thr Glu Gln Val Arg Ser Gly Asn145 150 155 160Glu Ile Thr Cys Tyr Glu Asn Phe Thr Asp Asn Gln Leu Asp Val Val165 170 175Leu Pro Val Arg Leu Glu Leu Cys Leu Val Leu Phe Phe Ile Pro Met180 185 190Ala Val Thr Ile Phe Cys Tyr Trp Arg Phe Val Trp Ile Met Leu Ser195 200 205Gln Pro Leu Val Gly Ala Gln Arg Arg Arg Arg Ala Val Gly Leu Ala210 215 220Val Val Thr Leu Leu Asn Phe Leu Val Cys Phe Gly Pro Tyr Asn Val225 230 235 240Ser His Leu Val Gly Tyr His Gln Arg Lys Ser Pro Trp Trp Arg Ser245 250 255Ile Ala Val Val Phe Ser Ser Leu Asn Ala Ser Leu Asp Pro Leu Leu260 265 270Phe Tyr Phe Ser Ser Ser Val Val Arg Arg Ala Phe Gly Arg Gly Leu275 280 285Gln Val Leu Arg Asn Gln Gly Ser Ser Leu Leu Gly Arg Arg Gly Lys290 295 300Asp Thr Ala Glu Gly Thr Asn Glu Asp Arg Gly Val Gly Gln Gly Glu305 310 315 320Gly Met Pro Ser Ser Asp Phe Thr Thr Glu325 330(260)SEQ ID NO259的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO259的列描述CCCAAGCTTC GGGCACCATG GACACCTCCC 30(261)SEQ ID NO260的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO260的序列描述ACAGGATCCA AATGCACAGC ACTGGTAAGC 30(262)SEQ ID NO261的资料(i)序列特征(A)长度25个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO261的序列描述CTATAACTGG GTTACATGGT TTAAC25(263)SEQ ID NO262的资料(i)序列特征
(A)长度30个碱基对(B)类型核酸(C)链型单链(D)扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO262的序列描述TTTGAATTCA CATATTAATT AGAGACATGG 30(264)SEQ ID NO263的资料(i)序列特征(A)长度2724个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO263的序列描述ATGGACACCT CCCGGCTCGG TGTGCTCCTG TCCTTGCCTG TGCTGCTGCA GCTGGCGACC 60GGGGGCAGCT CTCCCAGGTC TGGTGTGTTG CTGAGGGGCT GCCCCACACA CTGTCATTGC 120GAGCCCGACG GCAGGATGTT GCTCAGGGTG GACTGCTCCG ACCTGGGGCT CTCGGAGCTG 180CCTTCCAACC TCAGCGTCTT CACCTCCTAC CTAGACCTCA GTATGAACAA CATCAGTCAG 240CTGCTCCCGA ATCCCCTGCC CAGTCTCCGC TTCCTGGAGG AGTTACGTCT TGCGGGAAAC 300GCTCTGACAT ACATTCCCAA GGGAGCATTC ACTGGCCTTT ACAGTCTTAA AGTTCTTATG 360CTGCAGAATA ATCAGCTAAG ACACGTACCC ACAGAAGCTC TGCAGAATTT GCGAAGCCTT 420CAATCCCTGC GTCTGGATGC TAACCACATC AGCTATGTGC CCCCAAGCTG TTTCAGTGGC 480CTGCATTCCC TGAGGCACCT GTGGCTGGAT GACAATGCGT TAACAGAAAT CCCCGTCCAG 540GCTTTTAGAA GTTTATCGGC ATTGCAAGCC ATGACCTTGG CCCTGAACAA AATACACCAC 600ATACCAGACT ATGCCTTTGG AAACCTCTCC AGCTTGGTAG TTCTACATCT CCATAACAAT 660AGAATCCACT CCCTGGGAAA GAAATGCTTT GATGGGCTCC ACAGCCTAGA GACTTTAGAT 720TTAAATTACA ATAACCTTGA TGAATTCCCC ACTGCAATTA GGACACTCTC CAACCTTAAA 780GAACTAGGAT TTCATAGCAA CAATATCAGG TCGATACCTG AGAAAGCATT TGTAGGCAAC 840CCTTCTCTTA TTACAATACA TTTCTATGAC AATCCCATCC AATTTGTTGG GAGATCTGCT 900TTTCAACATT TACCTGAACT AAGAACACTG ACTCTGAATG GTGCCTCACA AATAACTGAA 960TTTCCTGATT TAACTGGAAC TGCAAACCTG GAGAGTCTGA CTTTAACTGG AGCACAGATC 1020TCATCTCTTC CTCAAACCGT CTGCAATCAG TTACCTAATC TCCAAGTGCT AGATCTGTCT 1080TACAACCTAT TAGAAGATTT ACCCAGTTTT TCAGTCTGCC AAAAGCTTCA GAAAATTGAC 1140CTAAGACATA ATGAAATCTA CGAAATTAAA GTTGACACTT TCCAGCAGTT GCTTAGCCTC 1200CGATCGCTGA ATTTGGCTTG GAACAAAATT GCTATTATTC ACCCCAATGC ATTTTCCACT 1260TTGCCATCCC TAATAAAGCT GGACCTATCG TCCAACCTCC TGTCGTCTTT TCCTATAACT 1320GGGTTACATG GTTTAACTCA CTTAAAATTA ACAGGAAATC ATGCCTTACA GAGCTTGATA 1380TCATCTGAAA ACTTTCCAGA ACTCAAGGTT ATAGAAATGC CTTATGCTTA CCAGTGCTGT 1440GCATTTGGAG TGTGTGAGAA TGCCTATAAG ATTTCTAATC AATGGAATAA AGGTGACAAC 1500AGCAGTATGG ACGACCTTCA TAAGAAAGAT GCTGGAATGT TTCAGGCTCA AGATGAACGT 1560GACCTTGAAG ATTTCCTGCT TGACTTTGAG GAAGACCTGA AAGCCCTTCA TTCAGTGCAG 1620TGTTCACCTT CCCCAGGCCC CTTCAAACCC TGTGAACACC TGCTTGATGG CTGGCTGATC 1680AGAATTGGAG TGTGGACCAT AGCAGTTCTG GCACTTACTT GTAATGCTTT GGTGACTTCA 1740ACAGTTTTCA GATCCCCTCT GTACATTTCC CCCATTAAAC TGTTAATTGG GGTCATCGCA 1800GCAGTGAACA TGCTCACGGG AGTCTCCAGT GCCGTGCTGG CTGGTGTGGA TGCGTTCACT 1860TTTGGCAGCT TTGCACGACA TGGTGCCTGG TGGGAGAATG GGGTTGGTTG CCATGTCATT 1920GGTTTTTTGT CCATTTTTGC TTCAGAATCA TCTGTTTTCC TGCTTACTCT GGCAGCCCTG 1980GAGCGTGGGT TCTCTGTGAA ATATTCTGCA AAATTTGAAA CGAAAGCTCC ATTTTCTAGC 2040CTGAAAGTAA TCATTTTGCT CTGTGCCCTG CTGGCCTTGA CCATGGCCGC AGTTCCCCTG 2100CTGGGTGGCA GCAAGTATGG CGCCTCCCCT CTCTGCCTGC CTTTGCCTTT TGGGGAGCCC 2160AGCACCATGG GCTACATGGT CGCTCTCATC TTGCTCAATT CCCTTTGCTT CCTCATGATG 2220ACCATTGCCT ACACCAAGCT CTACTGCAAT TTGGACAAGG GAGACCTGGA GAATATTTGG 2280GACTGCTCTA TGGTAAAACA CATTGCCCTG TTGCTCTTCA CCAACTGCAT CCTAAACTGC 2340CCTGTGGCTT TCTTGTCCTT CTCCTCTTTA ATAAACCTTA CATTTATCAG TCCTGAAGTA 2400ATTAAGTTTA TCCTTCTGGT GGTAGTCCCA CTTCCTGCAT GTCTCAATCC CCTTCTCTAC 2460ATCTTGTTCA ATCCTCACTT TAAGGAGGAT CTGGTGAGCC TGAGAAAGCA AACCTACGTC 2520TGGACAAGAT CAAAACACCC AAGCTTGATG TCAATTAACT CTGATGATGT CGAAAAACAG 2580TCCTGTGACT CAACTCAAGC CTTGGTAACC TTTACCAGCT CCAGCATCAC TTATGACCTG 2640CCTCCCAGTT CCGTGCCATC ACCAGCTTAT CCAGTGACTG AGAGCTGCCA TCTTTCCTCT 2700GTGGCATTTG TCCCATGTCT CTAA2724(265)SEQ ID NO264的资料(i)序列特征(A)长度907个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO264的序列描述Met Asp Thr Ser Arg Leu Gly Val Leu Leu Ser Leu Pro Val Leu Leu1 5 10 15Gln Leu Ala Thr Gly Gly Ser Ser Pro Arg Ser Gly Val Leu Leu Arg20 25 30Gly Cys Pro Thr His Cys His Cys Glu Pro Asp Gly Arg Met Leu Leu35 40 45Arg Val Asp Cys Ser Asp Leu Gly Leu Ser Glu Leu Pro Ser Asn Leu50 55 60Ser Val Phe Thr Ser Tyr Leu Asp Leu Ser Met Asn Asn Ile Ser Gln65 70 75 80Leu Leu Pro Asn Pro Leu Pro Ser Leu Arg Phe Leu Glu Glu Leu Arg85 90 95Leu Ala Gly Asn Ala Leu Thr Tyr Ile Pro Lys Gly Ala Phe Thr Gly100 105 110Leu Tyr Ser Leu Lys Val Leu Met Leu Gln Asn Asn Gln Leu Arg His115 120 125Val Pro Thr Glu Ala Leu Gln Asn Leu Arg Ser Leu Gln Ser Leu Arg130 135 140Leu Asp Ala Asn His Ile Ser Tyr Val Pro Pro Ser Cys Phe Ser Gly145 150 155 160Leu His Ser Leu Arg His Leu Trp Leu Asp Asp Asn Ala Leu Thr Glu165 170 175Ile Pro Val Gln Ala Phe Arg Ser Leu Ser Ala Leu Gln Ala Met Thr180 185 190Leu Ala Leu Asn Lys Ile His His Ile Pro Asp Tyr Ala Phe Gly Asn195 200 205Leu Ser Ser Leu Val Val Leu His Leu His Asn Asn Arg Ile His Ser210 215 220Leu Gly Lys Lys Cys Phe Asp Gly Leu His Ser Leu Glu Thr Leu Asp225 230 235 240Leu Asn Tyr Asn Asn Leu Asp Glu Phe Pro Thr Ala Ile Arg Thr Leu245 250 255Ser Asn Leu Lys Glu Leu Gly Phe His Ser Asn Asn Ile Arg Ser Ile260 265 270Pro Glu Lys Ala Phe Val Gly Asn Pro Ser Leu Ile Thr Ile His Phe275 280 285Tyr Asp Asn Pro Ile Gln Phe Val Gly Arg Ser Ala Phe Gln His Leu290 295 300Pro Glu Leu Arg Thr Leu Thr Leu Asn Gly Ala Ser Gln Ile Thr Glu305 310 315 320Phe Pro Asp Leu Thr Gly Thr Ala Asn Leu Glu Ser Leu Thr Leu Thr325 330 335Gly Ala Gln Ile Ser Ser Leu Pro Gln Thr Val Cys Asn Gln Leu Pro340 345 350Asn Leu Gln Val Leu Asp Leu Ser Tyr Asn Leu Leu Glu Asp Leu Pro355 360 365Ser Phe Ser Val Cys Gln Lys Leu Gln Lys Ile Asp Leu Arg His Asn370 375 380Glu Ile Tyr Glu Ile Lys Val Asp Thr Phe Gln Gln Leu Leu Ser Leu385 390 395 400Arg Ser Leu Asn Leu Ala Trp Asn Lys Ile Ala Ile Ile His Pro Asn405 410 415Ala Phe Ser Thr Leu Pro Ser Leu Ile Lys Leu Asp Leu Ser Ser Asn420 425 430Leu Leu Ser Ser Phe Pro Ile Thr Gly Leu His Gly Leu Thr His Leu435 440 445Lys Leu Thr Gly Asn His Ala Leu Gln Ser Leu Ile Ser Ser Glu Asn450 455 460Phe Pro Glu Leu Lys Val Ile Glu Met Pro Tyr Ala Tyr Gln Cys Cys465 470 475 480Ala Phe Gly Val Cys Glu Asn Ala Tyr Lys Ile Ser Asn Gln Trp Asn485 490 495Lys Gly Asp Asn Ser Ser Met Asp Asp Leu His Lys Lys Asp Ala Gly500 505 510Met Phe Gln Ala Gln Asp Glu Arg Asp Leu Glu Asp Phe Leu Leu Asp515 520 525Phe Glu Glu Asp Leu Lys Ala Leu His Ser Val Gln Cys Ser Pro Ser530 535 540Pro Gly Pro Phe Lys Pro Cys Glu His Leu Leu Asp Gly Trp Leu Ile545 550 555 560Arg Ile Gly Val Trp Thr Ile Ala Val Leu Ala Leu Thr Cys Asn Ala565 570 575Leu Val Thr Ser Thr Val Phe Arg Ser Pro Leu Tyr Ile Ser Pro Ile580 585 590Lys Leu Leu Ile Gly Val Ile Ala Ala Val Asn Met Leu Thr Gly Val595 600 605Ser Ser Ala Val Leu Ala Gly Val Asp Ala Phe Thr Phe Gly Ser Phe610 615 620Ala Arg His Gly Ala Trp Trp Glu Asn Gly Val Gly Cys His Val Ile625 630 635 640Gly Phe Leu Ser Ile Phe Ala Ser Glu Ser Ser Val Phe Leu Leu Thr645 650 655Leu Ala Ala Leu Glu Arg Gly Phe Ser Val Lys Tyr Ser Ala Lys Phe660 665 670Glu Thr Lys Ala Pro Phe Ser Ser Leu Lys Val Ile Ile Leu Leu Cys675 680 685Ala Leu Leu Ala Leu Thr Met Ala Ala Val Pro Leu Leu Gly Gly Ser690 695 700Lys Tyr Gly Ala Ser Pro Leu Cys Leu Pro Leu Pro Phe Gly Glu Pro705 710 715 720Ser Thr Met Gly Tyr Met Val Ala Leu Ile Leu Leu Asn Ser Leu Cys725 730 735Phe Leu Met Met Thr Ile Ala Tyr Thr Lys Leu Tyr Cys Asn Leu Asp740 745 750Lys Gly Asp Leu Glu Asn Ile Trp Asp Cys Ser Met Val Lys His Ile755 760 765Ala Leu Leu Leu Phe Thr Asn Cys Ile Leu Asn Cys Pro Val Ala Phe770 775 780Leu Ser Phe Ser Ser Leu Ile Asn Leu Thr Phe Ile Ser Pro Glu Val785 790 795 800Ile Lys Phe Ile Leu Leu Val Val Val Pro Leu Pro Ala Cys Leu Asn805 810 815Pro Leu Leu Tyr Ile Leu Phe Asn Pro His Phe Lys Glu Asp Leu Val820 825 830Ser Leu Arg Lys Gln Thr Tyr Val Trp Thr Arg Ser Lys His Pro Ser835 840 845Leu Met Ser Ile Asn Ser Asp Asp Val Glu Lys Gln Ser Cys Asp Ser850 855 860Thr Gln Ala Leu Val Thr Phe Thr Ser Ser Ser Ile Thr Tyr Asp Leu865 870 875 880Pro Pro Ser Ser Val Pro Ser Pro Ala Tyr Pro Val Thr Glu Ser Cys885 890 895His Leu Ser Ser Val Ala Phe Val Pro Cys Leu900 905(266)SEQ ID NO265的资料(i)序列特征(A)长度30个碱基对
(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO265的序列描述CGGAAGCTGC GGGCCAAATG GGTGGCCGGC 30(267)SEQ ID NO266的资料(i)序列特征(A)长度27个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO266的序列描述CAGAGGAGGG TGAAGGGGCT GTTGGCG 27(268)SEQ ID NO267的资料(i)序列特征(A)长度30个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO267的序列描述GGCGGCGCCG AGCCAAGGGG CTGGCTGTGG30(269)SEQ ID NO268的资料(i)序列特征(A)长度32个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO268的序列描述GGGACTGCTC TATGAAAAAA CACATTGCCC TG32(270)SEQ ID NO269的资料(i)序列特征(A)长度1071个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO269的序列描述ATGAATGGGG TCTCGGAGGG GACCAGAGGC TGCAGTGACA GGCAACCTGG GGTCCTGACA 60CGTGATCGCT CTTGTTCCAG GAAGATGAAC TCTTCCGGAT GCCTGTCTGA GGAGGTGGGG 120TCCCTCCGCC CACTGACTGT GGTTATCCTG TCTGCGTCCA TTGTCGTCGG AGTGCTGGGC 180AATGGGCTGG TGCTGTGGAT GACTGTCTTC CGTATGGCAC GCACGGTCTC CACCGTCTGC 240TTCTTCCACC TGGCCCTTGC CGATTTCATG CTCTCACTGT CTCTGCCCAT TGCCATGTAC 300TATATTGTCT CCAGGCAGTG GCTCCTCGGA GAGTGGGCCT GCAAACTCTA CATCACCTTT 60GTGTTCCTCA GCTACTTTGC CAGTAACTGC CTCCTTGTCT TCATCTCTGT GGACCGTTGC 420ATCTCTGTCC TCTACCCCGT CTGGGCCCTG AACCACCGCA CTGTGCAGCG GGCGAGCTGG 480CTGGCCTTTG GGGTGTGGCT CCTGGCCGCC GCCTTGTGCT CTGCGCACCT GAAATTCCGG 540ACAACCAGAA AATGGAATGG CTGTACGCAC TGCTACTTGG CGTTCAACTC TGACAATGAG 600ACTGCCCAGA TTTGGATTGA AGGGGTCGTG GAGGGACACA TTATAGGGAC CATTGGCCAC 660TTCCTGCTGG GCTTCCTGGG GCCCTTAGCA ATCATAGGCA CCTGCGCCCA CCTCATCCGG 720GCCAAGCTCT TGCGGGAGGG CTGGGTCCAT GCCAACCGGC CCAAGAGGCT GCTGCTGGTG 780CTGGTGAGCG CTTTCTTTAT CTTCTGGTCC CCGTTTAACG TGGTGCTGTT GGTCCATCTG 840TGGCGACGGG TGATCCTCAA GGAAATCTAC CACCCCCGGA TGCTGCTCAT CCTCCAGGCT 900AGCTTTGCCT TGGGCTGTGT CAACAGCAGC CTCAACCCCT TCCTCTACGT CTTCGTTGGC 960AGAGATTTCC AAGAAAAGTT TTTCCAGTCT TTGACTTCTG CCCTGGCGAG GGCGTTTGGA 1020GAGGAGGAGT TTCTGTCATC CTGTCCCCGT GGCAACGCCC CCCGGGAATG A 1071(271)SEQ ID NO270的资料(i)序列特征(A)长度356个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi) SEQ ID NO270的序列描述Met Asn Gly Val Ser Glu Gly Thr Arg Gly Cys Ser Asp Arg Gln Pro1 5 10 15Gly Val Leu Thr Arg Asp Arg Ser Cys Ser Arg Lys Met Asn Ser Ser20 25 30Gly Cys Leu Ser Glu Glu Val Gly Ser Leu Arg Pro Leu Thr Val Val
35 40 45Ile Leu Ser Ala Ser Ile Val Val Gly Val Leu Gly Asn Gly Leu Val50 55 60Leu Trp Met Thr Val Phe Arg Met Ala Arg Thr Val Ser Thr Val Cys65 70 75 80Phe Phe His Leu Ala Leu Ala Asp Phe Met Leu Ser Leu Ser Leu Pro85 90 95Ile Ala Met Tyr Tyr Ile Val Ser Arg Gln Trp Leu Leu Gly Glu Trp100 105 110Ala Cys Lys Leu Tyr Ile Thr Phe Val Phe Leu Ser Tyr Phe Ala Ser115 120 125Asn Cys Leu Leu Val Phe Ile Ser Val Asp Arg Cys Ile Ser Val Leu130 135 140Tyr Pro Val Trp Ala Leu Asn His Arg Thr Val Gln Arg Ala Ser Trp145 150 155 160Leu Ala Phe Gly Val Trp Leu Leu Ala Ala Ala Leu Cys Ser Ala His165 170 175Leu Lys Phe Arg Thr Thr Arg Lys Trp Asn Gly Cys Thr His Cys Tyr180 185 190Leu Ala Phe Asn Ser Asp Asn Glu Thr Ala Gln Ile Trp Ile Glu Gly195 200 205Val Val Glu Gly His Ile Ile Gly Thr Ile Gly His Phe Leu Leu Gly210 215 220Phe Leu Gly Pro Leu Ala Ile Ile Gly Thr Cys Ala His Leu Ile Arg225 230 235 240Ala Lys Leu Leu Arg Glu Gly Trp Val His Ala Asn Arg Pro Lys Arg245 250 255Leu Leu Leu Val Leu Val Ser Ala Phe Phe Ile Phe Trp Ser Pro Phe260 265 270Ash Val Val Leu Leu Val His Leu Trp Arg Arg Val Met Leu Lys Glu275 280 285Ile Tyr His Pro Arg Met Leu Leu Ile Leu Gln Ala Ser Phe Ala Leu290 295 300Gly Cys Val Asn Ser Ser Leu Asn Pro Phe Leu Tyr Val Phe Val Gly305 310 315 320Arg Asp Phe Gln Glu Lys Phe Phe Gln Ser Leu Thr Ser Ala Leu Ala325 330 335Arg Ala Phe Gly Glu Glu Glu Phe Leu Ser Ser Cys Pro Arg Gly Asn340 345 350Ala Pro Arg Glu355(272)SEQ ID NO271的资料(i)序列特征(A)长度903个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO271的序列描述ATGGACCTGC CCCCGCAGCT CTCCTTCGGC CTCTATGTGG CCGCCTTTGC GCTGGGCTTC 60CCGCTCAACG TCCTGGCCAT CCGAGGCGCG ACGGCCCACG CCCGGCTCCG TCTCACCCCT 120AGCCTGGTCT ACGCCCTGAA CCTGGGCTGC TCCGACCTGC TGCTGACAGT CTCTCTGCCC 180CTGAAGGCGG TGGAGGCGCT AGCCTCCGGG GCCTGGCCTC TGCCGGCCTC GCTGTGCCCC 240GTCTTCGCGG TGGCCCACTT CTTCCCACTC TATGCCGGCG GGGGCTTCCT GGCCGCCCTG 300AGTGCAGGCC GCTACCTGGG AGCAGCCTTC CCCTTGGGCT ACCAAGCCTT CCGGAGGCCG 360TGCTATTCCT GGGGGGTGTG CGCGGCCATC TGGGCCCTCG TCCTGTGTCA CCTGGGTCTG 420GTCTTTGGGT TGGAGGCTCC AGGAGGCTGG CTGGACCACA GCAACACCTC CCTGGGCATC 480AACACACCGG TCAACGGCTC TCCGGTCTGC CTGGAGGCCT GGGACCCGGC CTCTGCCGGC 540CCGGCCCGCT TCAGCCTCTC TCTCCTGCTC TTTTTTCTGC CCTTGGCCAT CACAGCCTTC 600TGCTACGTGG GCTGCCTCCG GGCACTGGCC CGCTCCGGCC TGACGCACAG GCGGAAGCTG 660CGGGCCAAAT GGGTGGCCGG CGGGGCCCTC CTCACGCTGC TGCTCTGCGT AGGACCCTAC 720AACGCCTCCA ACGTGGCCAG CTTCCTGTAC CCCAATCTAG GAGGCTCCTG GCGGAAGCTG 780GGGCTCATCA CGGGTGCCTG GAGTGTGGTG CTTAATCCGC TGGTGACCGG TTACTTGGGA 840AGGGGTCCTG GCCTGAAGAC AGTGTGTGCG GCAAGAACGC AAGGGGGCAA GTCCCAGAAG 900TAA 903(273)SEQ ID NO272的资料(i)序列特征(A)长度300个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO272的序列描述Met Asp Leu Pro Pro Gln Leu Ser Phe Gly Leu Tyr Val Ala Ala Phe1 5 10 15Ala Leu Gly Phe Pro Leu Asn Val Leu Ala Ile Arg Gly Ala Thr Ala20 25 30His Ala Arg Leu Arg Leu Thr Pro Ser Leu Val Tyr Ala Leu Asn Leu35 40 45Gly Cys Ser Asp Leu Leu Leu Thr Val Ser Leu Pro Leu Lys Ala Val50 55 60Glu Ala Leu Ala Ser Gly Ala Trp Pro Leu Pro Ala Ser Leu Cys Pro65 70 75 80Val Phe Ala Val Ala His Phe Phe Pro Leu Tyr Ala Gly Gly Gly Phe85 90 95Leu Ala Ala Leu Ser Ala Gly Arg Tyr Leu Gly Ala Ala Phe Pro Leu100 105 110Gly Tyr Gln Ala Phe Arg Arg Pro Cys Tyr Ser Trp Gly Val Cys Ala115 120 125Ala Ile Trp Ala Leu Val Leu Cys His Leu Gly Leu Val Phe Gly Leu130 135 140Glu Ala Pro Gly Gly Trp Leu Asp His Ser Asn Thr Ser Leu Gly Ile145 150 155 160Asn Thr Pro Val Asn Gly Ser Pro Val Cys Leu Glu Ala Trp Asp Pro165 170 175Ala Ser Ala Gly Pro Ala Arg Phe Ser Leu Ser Leu Leu Leu Phe Phe180 185 190Leu Pro Leu Ala Ile Thr Ala Phe Cys Tyr Val Gly Cys Leu Arg Ala195 200 205Leu Ala Arg Ser Gly Leu Thr His Arg Arg Lys Leu Arg Ala Lys Trp210 215 220Val Ala Gly Gly Ala Leu Leu Thr Leu Leu Leu Cys Val Gly Pro Tyr225 230 235 240Asn Ala Ser Asn Val Ala Ser Phe Leu Tyr Pro Asn Leu Gly Gly Ser245 250 255Trp Arg Lys Leu Gly Leu Ile Thr Gly Ala Trp Ser Val Val Leu Asn260 265 270Pro Leu Val Thr Gly Tyr Leu Gly Arg Gly Pro Gly Leu Lys Thr Val
275 280285Cys Ala Ala Arg Thr Gln Gly Gly Lys Ser Gln Lys290 295 300(274)SEQ ID NO273的资料(i)序列特征(A)长度1041个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO273的序列描述ATGGATACAG GCCCCGACCA GTCCTACTTC TCCGGCAATC ACTGGTTCGT CTTCTCGGTG 60TACCTTCTCA CTTTCCTGGT GGGGCTCCCC CTCAACCTGC TGGCCCTGGT GGTCTTCGTG 120GGCAAGCTGC AGCGCCGCCC GGTGGCCGTG GACGTGCTCC TGCTCAACCT GACCGCCTCG 180GACCTGCTCC TGCTGCTGTT CCTGCCTTTC CGCATGGTGG AGGCAGCCAA TGGCATGCAC 240TGGCCCCTGC CCTTCATCCT CTGCCCACTC TCTGGATTCA TCTTCTTCAC CACCATCTAT 300CTCACCGCCC TCTTCCTGGC AGCTGTGAGC ATTGAACGCT TCCTGAGTGT GGCCCACCCA 360CTGTGGTACA AGACCCGGCC GAGGCTGGGG CAGGCAGGTC TGGTGAGTGT GGCCTGCTGG 420CTGTTGGCCT CTGCTCACTG CAGCGTGGTC TACGTCATAG AATTCTCAGG GGACATCTCC 480CACAGCCAGG GCACCAATGG GACCTGCTAC CTGGAGTTCC GGAAGGACCA GCTAGCCATC 540CTCCTGCCCG TGCGGCTGGA GATGGCTGTG GTCCTCTTTG TGGTCCCGCT GATCATCACC 600AGCTACTGCT ACAGCCGCCT GGTGTGGATC CTCGGCAGAG GGGGCAGCCA CCGCCGGCAG 660AGGAGGGTGA AGGGGCTGTT GGCGGCCACG CTGCTCAACT TCCTTGTCTG CTTTGGGCCC 720TACAACGTGT CCCATGTCGT GGGCTATATC TGCGGTGAAA GCCCGGCATG GAGGATCTAC 780GTGACGCTTC TCAGCACCCT GAACTCCTGT GTCGACCCCT TTGTCTACTA CTTCTCCTCC 840TCCGGGTTCC AAGCCGACTT TCATGAGCTG CTGAGGAGGT TGTGTGGGCT CTGGGGCCAG 900TGGCAGCAGG AGAGCAGCAT GGAGCTGAAG GAGCAGAAGG GAGGGGAGGA GCAGAGAGCG 960GACCGACCAG CTGAAAGAAA GACCAGTGAA CACTCACAGG GCTGTGGAAC TGGTGGCCAG 1020GTGGCCTGTG CTGAAAGCTA G 1041(275)SEQ ID NO274的资料(i)序列特征(A)长度346个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO274的序列描述Met Asp Thr Gly Pro Asp Gln Ser Tyr Phe Ser Gly Asn His Trp Phe1 5 10 15Val Phe Ser Val Tyr Leu Leu Thr Phe Leu Val Gly Leu Pro Leu Asn
20 25 30Leu Leu Ala Leu Val Val Phe Val Gly Lys Leu Gln Arg Arg Pro Val35 40 45Ala Val Asp Val Leu Leu Leu Asn Leu Thr Ala Ser Asp Leu Leu Leu50 55 60Leu Leu Phe Leu Pro Phe Arg Met Val Glu Ala Ala Asn Gly Met His65 70 75 80Trp Pro Leu Pro Phe Ile Leu Cys Pro Leu Ser Gly Phe Ile Phe Phe85 90 95Thr Thr Ile Tyr Leu Thr Ala Leu Phe Leu Ala Ala Val Ser Ile Glu100 105 110Arg Phe Leu Ser Val Ala His Pro Leu Trp Tyr Lys Thr Arg Pro Arg115 120 125Leu Gly Gln Ala Gly Leu Val Ser Val Ala Cys Trp Leu Leu Ala Ser130 135 140Ala His Cys Ser Val Val Tyr Val Ile Glu Phe Ser Gly Asp Ile Ser145 150 155 160His Ser Gln Gly Thr Asn Gly Thr Cys Tyr Leu Glu Phe Arg Lys Asp165 170 175Gln Leu Ala Ile Leu Leu Pro Val Arg Leu Glu Met Ala Val Val Leu180 185 190Phe Val Val Pro Leu Ile Ile Thr Ser Tyr Cys Tyr Ser Arg Leu Val195 200 205Trp Ile Leu Gly Arg Gly Gly Ser His Arg Arg Gln Arg Arg Val Lys210 215 220Gly Leu Leu Ala Ala Thr Leu Leu Asn Phe Leu Val Cys Phe Gly Pro225 230 235 240Tyr Asn Val Ser His Val Val Gly Tyr Ile Cys Gly Glu Ser Pro Ala245 250 255Trp Arg Ile Tyr Val Thr Leu Leu Ser Thr Leu Asn Ser Cys Val Asp260 265 270Pro Phe Val Tyr Tyr Phe Ser Ser Ser Gly Phe Gln Ala Asp Phe His275 280 285Glu Leu Leu Arg Arg Leu Cys Gly Leu Trp Gly Gln Trp Gln Gln Glu
290 295 300Ser Ser Met Glu Leu Lys Glu Gln Lys Gly Gly Glu Glu Gln Arg Ala305 310 315 320Asp Arg Pro Ala Glu Arg Lys Thr Ser Glu His Ser Gln Gly Cys Gly325 330 335Thr Gly Gly Gln Val Ala Cys Ala Glu Ser340 345(276)SEQ ID NO275的资料(i)序列特(A)长度993个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO275的序列描述ATGCTGCCGG ACTGGAAGAG CTCCTTGATC CTCATGGCTT ACATCATCAT CTTCCTCACT 60GGCCTCCCTG CCAACCTCCT GGCCCTGCGG GCCTTTGTGG GGCGGATCCG CCAGCCCCAG 120CCTGCACCTG TGCACATCCT CCTGCTGAGC CTGACGCTGG CCGACCTCCT CCTGCTGCTG 180CTGCTGCCCT TCAAGATCAT CGAGGCTGCG TCGAACTTCC GCTGGTACCT GCCCAAGGTC 240GTCTGCGCCC TCACGAGTTT TGGCTTCTAC AGCAGCATCT ACTGCAGCAC GTGGCTCCTG 300GCGGGCATCA GCATCGAGCG CTACCTGGGA GTGGCTTTCC CCGTGCAGTA CAAGCTCTCC 360CGCCGGCCTC TGTATGGAGT GATTGCAGCT CTGGTGGCCT GGGTTATGTC CTTTGGTCAC 420TGCACCATCG TGATCATCGT TCAATACTTG AACACGACTG AGCAGGTCAG AAGTGGCAAT 480GAAATTACCT GCTACGAGAA CTTCACCGAT AACCAGTTGG ACGTGGTGCT GCCCGTGCGG 540CTGGAGCTGT GCCTGGTGCT CTTCTTCATC CCCATGGCAG TCACCATCTT CTGCTACTGG 600CGTTTTGTGT GGATCATGCT CTCCCAGCCC CTTGTGGGGG CCCAGAGGCG GCGCCGAGCC 660AAGGGGCTGG CTGTGGTGAC GCTGCTCAAT TTCCTGGTGT GCTTCGGACC TTACAACGTG 720TCCCACCTGG TGGGGTATCA CCAGAGAAAA AGCCCCTGGT GGCGGTCAAT AGCCGTGGTG 780TTCAGTTCAC TCAACGCCAG TCTGGACCCC CTGCTCTTCT ATTTCTCTTC TTCAGTGGTG 840CGCAGGGCAT TTGGGAGAGG GCTGCAGGTG CTGCGGAATC AGGGCTCCTC CCTGTTGGGA 900CGCAGAGGCA AAGACACAGC AGAGGGGACA AATGAGGACA GGGGTGTGGG TCAAGGAGAA 960GGGATGCCAA GTTCGGACTT CACTACAGAG TAG 993(277)SEQ ID NO276的资料(i)序列特征(A)长度330个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO276的序列描述Met Leu Pro Asp Trp Lys Ser Ser Leu Ile Leu Met Ala Tyr Ile Ile1 5 10 15Ile Phe Leu Thr Gly Leu Pro Ala Asn Leu Leu Ala Leu Arg Ala Phe20 25 30Val Gly Arg Ile Arg Gln Pro Gln Pro Ala Pro Val His Ile Leu Leu35 40 45Leu Ser Leu Thr Leu Ala Asp Leu Leu Leu Leu Leu Leu Leu Pro Phe50 55 60Lys Ile Ile Glu Ala Ala Ser Asn Phe Arg Trp Tyr Leu Pro Lys Val65 70 75 80Val Cys Ala Leu Thr Ser Phe Gly Phe Tyr Ser Ser Ile Tyr Cys Ser85 90 95Thr Trp Leu Leu Ala Gly Ile Ser Ile Glu Arg Tyr Leu Gly Val Ala100 105 110Phe Pro Val Gln Tyr Lys Leu Ser Arg Arg Pro Leu Tyr Gly Val Ile115 120 125Ala Ala Leu Val Ala Trp Val Met Ser Phe Gly His Cys Thr Ile Val130 135 140Ile Ile Val Gln Tyr Leu Asn Thr Thr Glu Gln Val Arg Ser Gly Asn145 150 155 160Glu Ile Thr Cys Tyr Glu Asn Phe Thr Asp Asn Gln Leu Asp Val Val165 170 175Leu Pro Val Arg Leu Glu Leu Cys Leu Val Leu Phe Phe Ile Pro Met180 185 190Ala Val Thr Ile Phe Cys Tyr Trp Arg Phe Val Trp Ile Met Leu Ser195 200 205Gln Pro Leu Val Gly Ala Gln Arg Arg Arg Arg Ala Lys Gly Leu Ala210 215 220Val Val Thr Leu Leu Asn Phe Leu Val Cys Phe Gly Pro Tyr Asn Val225 230 235 240Ser His Leu Val Gly Tyr His Gln Arg Lys Ser Pro Trp Trp Arg Ser245 250 255Ile Ala Val Val Phe Ser Ser Leu Asn Ala Ser Leu Asp Pro Leu Leu260 265 270Phe Tyr Phe Ser Ser Ser Val Val Arg Arg Ala Phe Gly Arg Gly Leu275 280 285Gln Val Leu Arg Asn Gln Gly Ser Ser Leu Leu Gly Arg Arg Gly Lys290 295 300Asp Thr Ala Glu Gly Thr Asn Glu Asp Arg Gly Val Gly Gln Gly Glu305 310 315 320Gly Met Pro Ser Ser Asp Phe Thr Thr Glu325 330(278)SEQ ID NO277的资料(i)序列特征(A)长度2724个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO277的序列描述ATGGACACCT CCCGGCTCGG TGTGCTCCTG TCCTTGCCTG TGCTGCTGCA GCTGGCGACC 60GGGGGCAGCT CTCCCAGGTC TGGTGTGTTG CTGAGGGGCT GCCCCACACA CTGTCATTGC 120GAGCCCGACG GCAGGATGTT GCTCAGGGTG GACTGCTCCG ACCTGGGGCT CTCGGAGCTG 180CCTTCCAACC TCAGCGTCTT CACCTCCTAC CTAGACCTCA GTATGAACAA CATCAGTCAG 240CTGCTCCCGA ATCCCCTGCC CAGTCTCCGC TTCCTGGAGG AGTTACGTCT TGCGGGAAAC 300GCTCTGACAT ACATTCCCAA GGGAGCATTC ACTGGCCTTT ACAGTCTTAA AGTTCTTATG 360CTGCAGAATA ATCAGCTAAG ACACGTACCC ACAGAAGCTC TGCAGAATTT GCGAAGCCTT 420CAATCCCTGC GTCTGGATGC TAACCACATC AGCTATGTGC CCCCAAGCTG TTTCAGTGGC 480CTGCATTCCC TGAGGCACCT GTGGCTGGAT GACAATGCGT TAACAGAAAT CCCCGTCCAG 540GCTTTTAGAA GTTTATCGGC ATTGCAAGCC ATGACCTTGG CCCTGAACAA AATACACCAC 600ATACCAGACT ATGCCTTTGG AAACCTCTCC AGCTTGGTAG TTCTACATCT CCATAACAAT 660AGAATCCACT CCCTGGGAAA GAAATGCTTT GATGGGCTCC ACAGCCTAGA GACTTTAGAT 720TTAAATTACA ATAACCTTGA TGAATTCCCC ACTGCAATTA GGACACTCTC CAACCTTAAA 780GAACTAGGAT TTCATAGCAA CAATATCAGG TCGATACCTG AGAAAGCATT TGTAGGCAAC 840CCTTCTCTTA TTACAATACA TTTCTATGAC AATCCCATCC AATTTGTTGG GAGATCTGCT 900TTTCAACATT TACCTGAACT AAGAACACTG ACTCTGAATG GTGCCTCACA AATAACTGAA 960TTTCCTGATT TAACTGGAAC TGCAAACCTG GAGAGTCTGA CTTTAACTGG AGCACAGATC 1020TCATCTCTTC CTCAAACCGT CTGCAATCAG TTACCTAATC TCCAAGTGCT AGATCTGTCT 1080TACAACCTAT TAGAAGATTT ACCCAGTTTT TCAGTCTGCC AAAAGCTTCA GAAAATTGAC 1140CTAAGACATA ATGAAATCTA CGAAATTAAA GTTGACACTT TCCAGCAGTT GCTTAGCCTC 1200CGATCGCTGA ATTTGGCTTG GAACAAAATT GCTATTATTC ACCCCAATGC ATTTTCCACT 1260TTGCCATCCC TAATAAAGCT GGACCTATCG TCCAACCTCC TGTCGTCTTT TCCTATAACT 1320GGGTTACATG GTTTAACTCA CTTAAAATTA ACAGGAAATC ATGCCTTACA GAGCTTGATA 1380TCATCTGAAA ACTTTCCAGA ACTCAAGGTT ATAGAAATGC CTTATGCTTA CCAGTGCTGT 1440GCATTTGGAG TGTGTGAGAA TGCCTATAAG ATTTCTAATC AATGGAATAA AGGTGACAAC 1500AGCAGTATGG ACGACCTTCA TAAGAAAGAT GCTGGAATGT TTCAGGCTCA AGATGAACGT 1560GACCTTGAAG ATTTCCTGCT TGACTTTGAG GAAGACCTGA AAGCCCTTCA TTCAGTGCAG 1620TGTTCACCTT CCCCAGGCCC CTTCAAACCC TGTGAACACC TGCTTGATGG CTGGCTGATC 1680AGAATTGGAG TGTGGACCAT AGCAGTTCTG GCACTTACTT GTAATGCTTT GGTGACTTCA 1740ACAGTTTTCA GATCCCCTCT GTACATTTCC CCCATTAAAC TGTTAATTGG GGTCATCGCA 1800GCAGTGAACA TGCTCACGGG AGTCTCCAGT GCCGTGCTGG CTGGTGTGGA TGCGTTCACT 1860TTTGGCAGCT TTGCACGACA TGGTGCCTGG TGGGAGAATG GGGTTGGTTG CCATGTCATT 1920GGTTTTTTGT CCATTTTTGC TTCAGAATCA TCTGTTTTCC TGCTTACTCT GGCAGCCCTG 1980GAGCGTGGGT TCTCTGTGAA ATATTCTGCA AAATTTGAAA CGAAAGCTCC ATTTTCTAGC 2040CTGAAAGTAA TCATTTTGCT CTGTGCCCTG CTGGCCTTGA CCATGGCCGC AGTTCCCCTG 2100CTGGGTGGCA GCAAGTATGG CGCCTCCCCT CTCTGCCTGC CTTTGCCTTT TGGGGAGCCC 2160AGCACCATGG GCTACATGGT CGCTCTCATC TTGCTCAATT CCCTTTGCTT CCTCATGATG 2220ACCATTGCCT ACACCAAGCT CTACTGCAAT TTGGACAAGG GAGACCTGGA GAATATTTGG 2280GACTGCTCTA TGAAAAAACA CATTGCCCTG TTGCTCTTCA CCAACTGCAT CCTAAACTGC 2340CCTGTGGCTT TCTTGTCCTT CTCCTCTTTA ATAAACCTTA CATTTATCAG TCCTGAAGTA 2400ATTAAGTTTA TCCTTCTGGT GGTAGTCCCA CTTCCTGCAT GTCTCAATCC CCTTCTCTAC 2460ATCTTGTTCA ATCCTCACTT TAAGGAGGAT CTGGTGAGCC TGAGAAAGCA AACCTACGTC 2520TGGACAAGAT CAAAACACCC AAGCTTGATG TCAATTAACT CTGATGATGT CGAAAAACAG 2580TCCTGTGACT CAACTCAAGC CTTGGTAACC TTTACCAGCT CCAGCATCAC TTATGACCTG 2640CCTCCCAGTT CCGTGCCATC ACCAGCTTAT CCAGTGACTG AGAGCTGCCA TCTTTCCTCT 2700GTGGCATTTG TCCCATGTCT CTAA 2724(279)SEQ ID NO278的资料(i)序列特征(A)长度907个氨基酸(B)类型氨基酸(C)链型(D)拓扑学不相关(ii)分子类型蛋白质(xi)SEQ ID NO278的序列描述Met Asp Thr Ser Arg Leu Gly Val Leu Leu Ser Leu Pro Val Leu Leu1 5 10 15Gln Leu Ala Thr Gly Gly Ser Ser Pro Arg Ser Gly Val Leu Leu Arg20 25 30Gly Cys Pro Thr His Cys His Cys Glu Pro Asp Gly Arg Met Leu Leu35 40 45Arg Val Asp Cys Ser Asp Leu Gly Leu Ser Glu Leu Pro Ser Asn Leu50 55 60Ser Val Phe Thr Ser Tyr Leu Asp Leu Ser Met Asn Asn Ile Ser Gln65 70 75 80Leu Leu Pro Asn Pro Leu Pro Ser Leu Arg Phe Leu Glu Glu Leu Arg85 90 95Leu Ala Gly Asn Ala Leu Thr Tyr Ile Pro Lys Gly Ala Phe Thr Gly100 105 110Leu Tyr Ser Leu Lys Val Leu Met Leu Gln Asn Asn Gln Leu Arg His115 120 125Val Pro Thr Glu Ala Leu Gln Asn Leu Arg Ser Leu Gln Ser Leu Arg130 135 140Leu Asp Ala Asn His Ile Ser Tyr Val Pro Pro Ser Cys Phe Ser Gly145 150 155 160Leu His Ser Leu Arg His Leu Trp Leu Asp Asp Asn Ala Leu Thr Glu165 170 175Ile Pro Val Gln Ala Phe Arg Ser Leu Ser Ala Leu Gln Ala Met Thr180 185 190Leu Ala Leu Asn Lys Ile His His Ile Pro Asp Tyr Ala Phe Gly Asn195 200 205Leu Ser Ser Leu Val Val Leu His Leu His Asn Asn Arg Ile His Ser210 215 220Leu Gly Lys Lys Cys Phe Asp Gly Leu His Ser Leu Glu Thr Leu Asp225 230 235 240Leu Asn Tyr Asn Asn Leu Asp Glu Phe Pro Thr Ala Ile Arg Thr Leu245 250 255Ser Asn Leu Lys Glu Leu Gly Phe His Ser Asn Asn Ile Arg Ser Ile260 265 270Pro Glu Lys Ala Phe Val Gly Asn Pro Ser Leu Ile Thr Ile His Phe275 280 285Tyr Asp Asn Pro Ile Gln Phe Val Gly Arg Ser Ala Phe Gln His Leu290 295 300Pro Glu Leu Arg Thr Leu Thr Leu Asn Gly Ala Ser Gln Ile Thr Glu305 310 315 320Phe Pro Asp Leu Thr Gly Thr Ala Asn Leu Glu Ser Leu Thr Leu Thr325330 335Gly Ala Gln Ile Ser Ser Leu Pro Gln Thr Val Cys Asn Gln Leu Pro340 345 350Asn Leu Gln Val Leu Asp Leu Ser Tyr Asn Leu Leu Glu Asp Leu Pro355 360 365Ser Phe Ser Val Cys Gln Lys Leu Gln Lys Ile Asp Leu Arg His Asn370 375 380Glu Ile Tyr Glu Ile Lys Val Asp Thr Phe Gln Gln Leu Leu Ser Leu385 390 395 400Arg Ser Leu Asn Leu Ala Trp Asn Lys Ile Ala Ile Ile His Pro Asn405 410 415Ala Phe Ser Thr Leu Pro Ser Leu Ile Lys Leu Asp Leu Ser Ser Asn420 425 430Leu Leu Ser Ser Phe Pro Ile Thr Gly Leu His Gly Leu Thr His Leu435 440 445Lys Leu Thr Gly Asn His Ala Leu Gln Ser Leu Ile Ser Ser Glu Asn450 455 460Phe Pro Glu Leu Lys Val Ile Glu Met Pro Tyr Ala Tyr Gln Cys Cys465 470 475 480Ala Phe Gly Val Cys Glu Asn Ala Tyr Lys Ile Ser Asn Gln Trp Asn485 490 495Lys Gly Asp Asn Ser Ser Met Asp Asp Leu His Lys Lys Asp Ala Gly500 505 510Met Phe Gln Ala Gln Asp Glu Arg Asp Leu Glu Asp Phe Leu Leu Asp515 520 525Phe Glu Glu Asp Leu Lys Ala Leu His Ser Val Gln Cys Ser Pro Ser530 535 540Pro Gly Pro Phe Lys Pro Cys Glu His Leu Leu Asp Gly Trp Leu Ile545 550 555 560Arg Ile Gly Val Trp Thr Ile Ala Val Leu Ala Leu Thr Cys Asn Ala565 570 575Leu Val Thr Ser Thr Val Phe Arg Ser Pro Leu Tyr Ile Ser Pro Ile580 585 590Lys Leu Leu Ile Gly Val Ile Ala Ala Val Asn Met Leu Thr Gly Val595 600 605Ser Ser Ala Val Leu Ala Gly Val Asp Ala Phe Thr Phe Gly Ser Phe610 615 620Ala Arg His Gly Ala Trp Trp Glu Asn Gly Val Gly Cys His Val Ile625 630 635 640Gly Phe Leu Ser Ile Phe Ala Ser Glu Ser Ser Val Phe Leu Leu Thr645 650 655Leu Ala Ala Leu Glu Arg Gly Phe Ser Val Lys Tyr Ser Ala Lys Phe660 665 670Glu Thr Lys Ala Pro Phe Ser Ser Leu Lys Val Ile Ile Leu Leu Cys675 680 685Ala Leu Leu Ala Leu Thr Met Ala Ala Val Pro Leu Leu Gly Gly Ser690 695 700Lys Tyr Gly Ala Ser Pro Leu Cys Leu Pro Leu Pro Phe Gly Glu Pro705 710 715 720Ser Thr Met Gly Tyr Met Val Ala Leu Ile Leu Leu Asn Ser Leu Cys725 730 735Phe Leu Met Met Thr Ile Ala Tyr Thr Lys Leu Tyr Cys Asn Leu Asp740 745 750Lys Gly Asp Leu Glu Asn Ile Trp Asp Cys Ser Met Lys Lys His Ile755 760 765Ala Leu Leu Leu Phe Thr Asn Cys Ile Leu Asn Cys Pro Val Ala Phe770 775 780Leu Ser Phe Ser Ser Leu Ile Asn Leu Thr Phe Ile Ser Pro Glu Val785 790 795 800Ile Lys Phe Ile Leu Leu Val Val Val Pro Leu Pro Ala Cys Leu Asn805 810 815Pro Leu Leu Tyr Ile Leu Phe Asn Pro His Phe Lys Glu Asp Leu Val820 825 830Ser Leu Arg Lys Gln Thr Tyr Val Trp Thr Arg Ser Lys His Pro Ser835 840 845Leu Met Ser Ile Asn Ser Asp Asp Val Glu Lys Gln Ser Cys Asp Ser850 855 860Thr Gln Ala Leu Val Thr Phe Thr Ser Ser Ser Ile Thr Tyr Asp Leu865 870 875 880Pro Pro Ser Ser Val Pro Ser Pro Ala Tyr Pro Val Thr Glu Ser Cys885 890 895His Leu Ser Ser Val Ala Phe Val Pro Cys Leu900 905(280)SEQ ID NO279的资料(i)序列特征
(A)长度32个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO279的序列描述CATGCCAACC GGCCCGCGAG GCTGCTGCTG GT 32(281)SEQ ID NO280的资料(i)序列特征(A)长度32个碱基对(B)类型核酸(C)链型单链(D)拓扑学线形(ii)分子类型DNA(基因组的)(xi)SEQ ID NO280的序列描述ACCAGCAGCA GCCTCGCGGG CCGGTTGGCA TG 3权利要求
1.一种内源的人孤儿G蛋白偶联受体(GPCR)的组成型活化的非内源形式,该受体含有下列氨基酸残基(从C-末端到N-末端走向),它们横跨非内源的GPCR的跨膜-6(TM6)和细胞内环-3(IC3)区域P1AA15X其中(1)P1是位于非内源GPCR的TM6区域内的一个氨基酸残基,其中,P1选自(i)内源的孤儿GPCR脯氨酸残基和(ii)除脯氨酸之外的非内源的氨基酸残基;(2)AA15是15个氨基酸残基,它们选自(a)内源的孤儿GPCR的15个内源氨基酸残基、(b)15个非内源的氨基酸残基和(c)15个组合的氨基酸残基,其中含有内源孤儿GPCR的至少一个内源氨基酸残基和至少一个非内源氨基酸残基的组合,除非在位于GPCR的TM6区域内的15个内源氨基酸残基都不是脯氨酸;和(3)X是位于所说的非内源GPCR的IC3区域内的非内源氨基酸残基。
2.权利要求1的非内源人GPCR,其中P1是内源脯氨酸残基。
3.权利要求1的非内源人GPCR,其中P1是除脯氨酸残基之外的非内源氨基酸残基。
4.权利要求1的非内源人GPCR,其中AA15是内源GPCR的15个内源氨基酸残基。
5.权利要求1的非内源人GPCR,其中X是从由赖氨酸、组氨酸、精氨酸和丙氨酸残基中选择而来,但是当在所说的内源人GPCR的X位置的内源氨基酸是赖氨酸时,X是从由组氨酸、精氨酸和丙氨酸中选择而来。
6.权利要求1的非内源人GPCR,其中X是赖氨酸残基,但是当在所说的内源人GPCR的X位置的内源氨基酸是赖氨酸时,X是除赖氨酸以外的氨基酸。
7.权利要求4的非内源人GPCR,其中X是赖氨酸残基,但是当在所说的内源人GPCR的X位置的内源氨基酸是赖氨酸时,X是除赖氨酸以外的氨基酸。
8.权利要求1的非内源人GPCR,其中P1是脯氨酸残基,X是赖氨酸残基,但是当在所说的内源人GPCR的X位置的内源氨基酸是赖氨酸时,X是除赖氨酸以外的氨基酸。
9.一种包含权利要求1的非内源人GPCR的宿主细胞。
10.权利要求9的材料,其中所说的宿主细胞是来自哺乳动物。
11.权利要求1的非内源人GPCR,其为经纯化和分离后的形式。
12.一种编码内源人孤儿G蛋白偶联受体(GPCR)的组成型活化的、非内源形式的核酸序列,包括下列核酸序列区域,它们横跨孤儿GPCR的跨膜-6(TM6)和细胞内环-3(IC3)区域3’-P密码子(AA-密码子)15X密码子-3’其中(1)P密码子是位于非内源的GPCR的TM6区内的一个核酸编码区,其中P密码子编码从(i)内源GPCR的脯氨酸残基和(ii)除脯氨酸之外的非内源氨基酸残基中选择出来的氨基酸;(2)(AA-密码子)15是编码15个氨基酸的15个密码子,这些氨基酸残基选自(a)内源孤儿GPCR的15个内源氨基酸残基,(b)15个非内源的氨基酸残基,和(c)15个氨基酸残基的组合,该组合包括内源孤儿GPCR的至少一个内源氨基酸残基和至少一个非内源氨基酸残基,除非位于GPCR的TM6区域内的15个内源氨基酸残基都不是脯氨酸;和(3)X密码子是编码位于所说的非内源人GPCR的IC3区域内氨基酸残基的核酸编码区,其中X密码子编码非内源氨基酸。
13.权利要求12的核酸序列,其中P密码子编码内源脯氨酸残基。
14.权利要求12的核酸序列,其中P密码子编码不是脯氨酸的非内源脯氨酸残基。
15.权利要求12的核酸序列,其中X密码子编码非内源氨基酸,该氨基酸是从赖氨酸、组氨酸、精氨酸和丙氨酸中选择而来,但是当在所说的内源人GPCR的X位置的内源氨基酸是赖氨酸时,X密码子编码从组氨酸、精氨酸和丙氨酸中选择而来的氨基酸。
16.权利要求13的核酸序列,其中X密码子编码非内源赖氨酸,但是当在所说的内源人GPCR的X位置的内源氨基酸是赖氨酸时,X密码子编码从组氨酸、精氨酸和丙氨酸中选择而来的氨基酸。
17.权利要求12的核酸序列,其中X密码子是从由AAA、AAG、GCA、GCG、GCC和GCU组成的一组中选择而来。
18.权利要求12的核酸序列,其中X密码子是从由AAA和AAG组成的一组中选择而来。
19.权利要求12的核酸序列,其中P密码子是从由CCA、CCC、CCG和CCU组成的一组中选择而来,而X密码子是从由AAA和AAG组成的一组中选择而来。
20.一种含有权利要求12的核酸序列的载体。
21.一种含有权利要求12的核酸序列的质粒。
22.一种含有权利要求21的核酸序列的宿主细胞。
23.权利要求12的核酸序列,其为经纯化和分离后的形式。
24.一种选择改变在人G蛋白偶联受体(“GPCR”)的第三个细胞内环内的内源氨基酸残基的方法,其中所说的受体包括一个跨膜6区和一个细胞内环3区,并且当此内源氨基酸被改造为非内源氨基酸残基时,组成型活化所说的人GPCR,该方法包括如下步骤(a)识别在人GPCR的跨膜6区的内源脯氨酸残基;(b)通过从所说的GPCR的羧基末端区域指向所说的GPCR的氨基末端区域的方向上移动,来识别距离从所说的脯氨酸残基起算为第16位的内源氨基酸残基;(c)把步骤(b)的内源残基改造为非内源的氨基酸残基以创造内源人GPCR的非内源形式;和(d)确定步骤(c)的非内源人GPCR是否是组成型活化的。
25.权利要求24的方法,其中按从羧基末端到氨基末端走向距离跨膜6区内所说的脯氨酸残基两个残基的氨基酸是色氨酸。
26.一种由权利要求24的方法生产的组成型活性的、非内源人GPCR。
27.一种由权利要求25的方法生产的组成型活性的、非内源人GPCR。
28.一种创造内源人G蛋白偶联受体(GPCR)的非内源的、组成型活化形式的算法规则,其中所说的内源GPCR包括一个跨膜6区和一个细胞内环3区,该算法规则包括如下步骤(a)选择在跨膜-6区含有脯氨酸残基的内源人GPCR;(b)通过在从羧基末端指向氨基末端方向上从步骤(a)所说的脯氨酸残基起数16个氨基酸残基而识别出内源氨基酸残基;(c)把步骤(b)识别的氨基酸残基改造为非内源的氨基酸残基以创造内源人GPCR的非内源形式;和(d)确定步骤(c)的内源人GPCR的非内源形式是否是组成型活化的。
29.权利要求28的算法规则,其中在从羧基末端指向氨基末端方向上距离跨膜6区的所说的脯氨酸残基两个残基的氨基酸残基是色氨酸。
30.一种由权利要求28的算法规则产生的组成型活性的、非内源人GPCR。
31.一种由权利要求29的算法规则产生的组成型活性的、非内源人GPCR。
32.一种直接识别选自非内源的、组成型活化的人G蛋白偶联受体的反激活剂、激活剂和部分激活剂的化合物的方法,其中所说的受体含有一个跨膜-6区和细胞内环-3区,该方法包括如下步骤(a)选择内源人GPCR;(b)识别在步骤(a)的GPCR的跨膜-6区内的脯氨酸残基;(c)在从羧基末端指向氨基末端方向上识别从步骤(b)的脯氨酸残基起算为第16位的内源氨基酸残基;(d)把步骤(c)的内源氨基酸改造为非内源的氨基酸;(e)证实步骤(d)的非内源人GPCR是组成型活化的;(f)用步骤(e)的非内源的、组成型活化的GPCR接触候选化合物;和(g)通过测量所说的被接触的受体的化合物效应,确定所说的化合物是否是所说受体的反激活剂、激活剂或部分激活剂。
33.权利要求32的方法,其中步骤(d)的非内源氨基酸是赖氨酸。
34.一种经权利要求32的方法直接识别的化合物。
35.权利要求32的方法,其中被直接识别的化合物是反激活剂。
36.权利要求32的方法,其中被直接识别的化合物是激活剂。
37.权利要求32的方法,其中被直接识别的化合物是部分激活剂。
38.一种含有权利要求35所述的反激活剂的组合物。
39.一种含有权利要求36所述的反激活剂的组合物。
40.一种含有权利要求37的部分激活剂的组合物。
41.一种直接识别针对非内源的、组成型活化的人G蛋白偶联受体(“GPCR”)的反激活剂的方法,其中所说的GPCR含有一个跨膜-6区和细胞内环-3区,该方法包括如下步骤(a)选择内源人GPCR;(b)识别在步骤(a)的GPCR的跨膜-6区内的脯氨酸残基;(c)识别在从羧基末端指向氨基末端方向上从步骤(b)的脯氨酸残基起算为第16位的内源氨基酸残基;(d)把步骤(c)的内源氨基酸改造为非内源的赖氨酸残基;(e)证实步骤(d)的非内源人GPCR是组成型活化的;(f)用步骤(e)的非内源的、组成型活化的GPCR接触候选化合物;和(g)通过测量所说的被接触的受体的化合物效应,确定所说的化合物是否是所说受体的反激活剂。
42.一种经权利要求37的方法直接识别的反激活剂。
43.一种含有权利要求38所述的反激活剂的组合物。
全文摘要
在此公开的是内源人G蛋白偶联受体(GPCR)的组成型活化的非内源形式,该受体含有(a)下列氨基酸区域(从C-末端到N-末端走向)和/或(b)横跨GPCR的跨膜-6(TM6)和细胞内环-3(IC3)区域的下列核酸序列区域(3’到5’走向),分别是(a)P
文档编号C07D231/12GK1398298SQ99812091
公开日2003年2月19日 申请日期1999年10月12日 优先权日1998年10月13日
发明者多米尼克·P·比汉, 德里克·T·查默斯, 廖王蓁 申请人:阿瑞那制药公司
网友询问留言 已有0条留言
  • 还没有人留言评论。精彩留言会获得点赞!
1