实现改进的多肽表达的方法

文档序号:438662阅读:829来源:国知局

专利名称::实现改进的多肽表达的方法
技术领域
:本发明涉及在宿主细胞中生产多肽的方法,其中根据宿主细胞的密码子使用(codon-usage)、尤其是所使用的密码子对(codon-pair)对编码多肽的核苷酸序列进行了修饰,从而获得改进的编码多肽的核苷酸序列的表达和/或改进的多肽生产。
背景技术
:本发明涉及用于生产多肽的改进方法。大量途径已被用于产生用于蛋白质过表达和/或生产的菌株。这包括但不限于,制造下述菌株和应用强启动子序列,所述菌株具有多个拷贝的编码感兴趣的蛋白质(POI)的基因。每个特定氨基酸最少由一个密码子、最多由六个密码子编码。先前的研究已显示不同物种间编码细胞的多肽的基因中密码子使用是有偏向的(Kanaya,S,Y.Yamada,Y.KudoandT.Ikemura(1999)StudiesofcodonusageandtRNAgenesat18unicellularorganismsandquantificationof5acz'〃ws■smZ"/^tRNAs:geneexpressionlevelandspecies-specificdiversityofcodonusagebasedonmultivariateanalysis.Gewe238:143-155)。先前的出版物公开了对给定宿主细胞中密码子使用的优化(optimization),以改进多肽生产(以WO97/11086为例)。更具体地,WO03/70957描述了丝状真菌中用于生产植物多肽的优化密码子使用。在所有这些"经典的"密码子优化情况中,天然的密码子已被来自基因的参考组的最常用的密码子所取代,而针对每个氨基酸的密码子翻译率被设计为高(优化的)。更近来,在WO03/85114中描述了对密码子使用的调和(harmonization),其影响宿主生物的基因中所有密码子的分布,假定这些影响蛋白质折叠。7最近数年中,许多生物如5fl"〃"ssw6rtfc(Kunstetal.1997)、"fl"7/w,Z/o/一咖c^w、^pe/^'〃1^1(Peletal.,2007,NatBiotech.21:221-231)、《—vera,ces/acto、5"accAflram少cescerevz、/ae(http:Vwww.yeastgenome.org/)、多种植物基因组、小鼠、大鼠和人的经完全测序的基因组的可用性提供了相对于基因序列的天然表达水平(mRNA或蛋白质水平)来分析其自身的不同方面的可能性。一个良好的例子是密码子使用(偏向性)分析和后续的单个密码子优化。应当注意单个密码子优化在本文中被理解为表示密码子优化或密码子调和技术,所述技术关注密码子作为单个独立实体的优化,这与本发明主题的密码子对优化相反。尽管单个密码子使用(偏向性)之前己经被广泛研究(综述见Gustafssonetal.,2004,TrendsBiotechnol.22:346-353),但是对关于密码子对选择和密码子对优化的报道很少。已经例如针对AGG-AGG密码子(SpanjaardandvanDuin,1988,Proc.Natl.Acad.Sci.USA^:7967-7971;Gurvichetal.,2005,J.Bacteriol.巡:4023-432)和针对UUU-YNN位点(SchwarzandCurran,1997,NucleicAcidsRes.^:2005-2011)研究了少量特定的密码子对对五.co//中核糖体移码的影响。Gutman和Hatfield(1989,Proc.Natl,Acad.SciUSA巡:3699-3703)分析了用于五.co/,的所有可能的密码子对的更大的序列集合,并且发现密码子对是有定向偏向的。另外,他们观察到高度不足量表现(underrepresented)的对几乎是高表达的基因中过量表现的(overrepresented)对使用频率的两倍,而在弱表达的基因中,过量表现的对被更频繁地使用。US5,082,767(HatfieldandGutman,1992)公开了下述方法,所述方法测定生物中相对天然的密码子对偏好,并根据所述密码子对偏好改变感兴趣的基因的密码子对从而以预定的方式改变所述基因的翻译动力学,有针对EC0/Z和cerev^ae的例子。然而,在他们的方法中,Hatfield和Gutman仅优化了个别的相邻密码子对。另外,在他们的专利(US5,082,767)中要求通过被修饰的序列提高至少一部分基因的翻译动力学,所述被修饰的序列中密码子8对被改变以增加下述密码子对的数量,所述密码子与随机密码子对使用相比是生物中更加大量并且更加不足量表现的密码子对。本发明公开了通过被修饰的序列提高翻译的一种方法,所述被修饰的序列中密码子对被改变,以增加下述密码子对的数量,所述密码子与随机密码子对使用相比是生物中更加过量表现的密码子对。Mouraetal.(2005,GenomeBiology,《:R28)分析了整个《S.cem^sa6ORF组,但是没有发现约47%密码子对的统计学显著的偏向性。各自的数值在物种间差异,导致可以被称作密码子对使用的"物种特异指纹"的"密码子背景图(codoncontextmap)"。Boychevaetal.(2003,BioinformaticsiH8):987-998)通过在具有高和弱表达的基因中寻找过量表现的和不足量表现的密码子对,鉴定了五.co"中被称作假定衰减的(hypotheticallyattenuating)和假定非衰减的两个密码子对集合。然而,他们未提出应用该发现的方法,也未给出对他们的假设的任何实验证明。应当注意,这些组被定义为与Gutman和Hatfield(1989,1992:上文)定义的组(他们提出了高表达的基因中高度不足量表现的对的非衰减效应)完全相反。Buchan、Aucott禾口Stanfield(2006,M/c/e/c爿c/山T^sewc/MP):1015-1027)分析了相对密码子对偏向性而言的tRNA特性。对于偏向性在密码子对利用中的牵连而言,Irwinetal.(1995,J.Biol.Chem.270:22801-22806)证明了在五.co/Z中当用高度过量表现的密码子对替换高度不足量表现的密码子对时,合成率实际上大量降低,当将轻微不足量表现的密码子对替换为更高度不足量表现的密码子对时,合成率提高。这是非常显著的,因为这与人们考虑到单个密码子偏向性对蛋白质水平的影响后所预期的有些相反。然而,上述领域均未公开如何考虑下述事实来优化全长密码子序列的密码子对使用,所述事实为密码子对通过定义重叠,和每个个体密码子对因此影响重叠的上游和下游密码子对的偏向性。另外,上述领域均未公开将单个密码子以及密码子对二者的优化组合的方法。考虑到下述密码子对优化与单个密码子优化的任选的组合和所述密码子对重叠的密码子对优化9会大幅改进编码感兴趣的多肽的核苷酸序列的表达和/或改进所述多肽的生产。因此,本领域仍然需要新颖的方法,所述方法用于在宿主细胞中优化编码序列以改进多肽的生产。发明概述本发明的一个目的是提供下述方法,所述方法针对有效的基因转录和蛋白质翻译来优化编码序列。为达到此目的,本发明提供了编码预定的氨基酸序列的核苷酸序列的优化方法,其中所述编码序列针对在预定的宿主细胞中的表达被优化,所述方法包括(a)产生至少一条编码预定的氨基酸序列的原始编码序列;(b)通过用同义密码子替换该至少一条原始编码序列中的一个或多个密码子,从该至少一条原始编码序列产生至少一条新产生的编码序列;(c)测定所述至少一条原始编码序列的适合度值和所述至少一条新产生的编码序列的适合度值,同时使用下述适合度函数,所述函数针对预定的宿主细胞至少测定单个密码子适合度和密码子对适合度之一;(d)根据预定的选择标准,在所述至少一条原始编码序列和所述至少一条新产生的编码序列中选择一条或多条选定的编码序列,所述适合度值越高,被选择的机会越高;和(e)重复动作b)到d),同时在动作b)到d)中将所述一条或多条选定的编码序列作为一条或多条原始编码序列处理,直至满足预定的迭代终止标准(iterationstopcriterion)。在一些实施方案中,本发明涉及例如单个密码子使用、密码子调和、二核苷酸使用的方面,并涉及密码子对偏向性。该方法可以通过在下述计算机上运行的计算机程序进行,所述计算机程序使用可以在MATLAB(http://Www.mathWOrkS.COm/)中完成的序列分析和序列最优化的数学算法。除了正的密码子最优化(例如以正的方式调控基因表达和蛋白质生产)以外,本发明还提供了使密码子适应"不良"密码子对的方法(即负的密码子对最优化)。后一方法适用于对照目的以及以负的方式调控基因表达。附图概述应当观察到,将参考下述若干附图来阐述本发明,所述附图仅旨在阐述本发明,而不限制本发明的范围,本发明的范围仅由附带的权利要求书及其等价物所定义。图l显示了计算机装置,本发明的方法可以在所述装置上进行。图2显示了本发明的实施方案的流程图。图3显示了不同生物中3,721个正义:正义密码子对的密码子对偏向性数值分布。每个直方图右上角的数字是观察到的分布的标准差;对所有生物而言均值(未显示)在-0.06和-0.01之间。图4显示了多种生物的密码子对偏向性的相关性。相关系数在每个小图的右上角显示。图5显示了A的密码子偏向性图谱。偏向性数值范围从-0.67到0.54,而在其它生物中它们甚至可以稍高于+-0.9(也见图3)。这些图表中最高强度的黑色代表0.9(图5A和5C为正值,原始为绿色)和-0.9(图5B和5D为负值,原始为红色)的数值。在图5A和B中,行和列根据它们密码子的字母顺序排列。在图5C和5D中,行根据下述的字母顺序排列第三位核苷酸作为首要的排列标准,中间位置核苷酸作为次要排列标准,第一位核苷酸作为第三排列标准。图6显示了及仰M/m的密码子偏向性图谱。偏向性数值范围从-0.97到0.87,而在其它生物中它们甚至可以稍高于+-0.9(也见图3)。这些图表中最高强度的黑色代表0.9(图6A为正值,原始为绿色)和-0.9(图6B为负值,原始为红色)的数值。图7显示了五.co/z'的密码子偏向性图谱。偏向性数值范围从-0.97到0.85,而在其它生物中它们甚至可以稍高于+-0.9(也见图3)。这些图表中最高强度的黑色代表0.9(图7A为正值,原始为绿色)和-0.9(图7B为负值,原始为红色)的数值。图8显示了与前文图5-7类似的Amgw的479个高度转录的基因的密码子偏向性图谱。这些图表中最高强度的黑色代表0.9(图8A为正值,原始为绿色)和-0.9(图8B为负值,原始为红色)的数值。该组中最大的偏ii向性数值为-1,即一些可能的密码子对完全不存在,尽管它们各自的密码子和所编码的氨基酸对存在。这可以是与全基因组中5,885,942个相比更小量的188,067个密码子对的结果。然而,主要的原因应当是这类对的真实不足量表现,其由高表达的基因中的选择引起。图9显示了A中一组479个高表达的基因的偏向性(垂直轴)对所有基因偏向性(水平轴)的散布图。不涉及终止密码子的所有3,721个密码子对被示出。从浅灰到黑的颜色根据整个基因组中z-评分的绝对值指定(即图中的亮点在所有的基因中不具有显著的偏向性),大小根据高表达的组中绝对z-评分指定,即非常小的点在其中不具有显著的偏向性(此处|2-评分|<1.9)。黑色实线指出两个偏向性数值相等的地方;虚线表示实际相关性的最佳线性近似(通过主成分分析确定);其斜率为2.1左右。图10:与其转录水平的对数相比,4,584个Am'gw基因的适合度值。相关系数为-0.62。图11显示了单个密码子优化与密码子对优化相比。野生型(/^c(gFUA^0.165,^一(gFUA)-0.033)不适合该图(其应该在右上远处)。清楚的是C/7Z'参数确定单个密码子和密码子对适合度之间的权衡。最优的基因始终是具有最低y^e和y^值的基因。考虑到点的位置仍然不清楚对哪个c;^值可获得最好的基因,因为我们还不知道单个密码子使用或密码子对使用哪个更重要。尽管实施例表明除单个密码子适合度以外密码子对适合度也非常重要的有力证据,这表示c声应当被选择为至少X)。图12展示了两幅图,其显示了上述FUA的(499个中)最初20个密码子的序列品质(也见实施例2)。黑点指出了期望的密码子比例,而x-标记显示实际的密码子比例(在整个基因中),二者通过虚线连接。然后单个密码子适合度可以被解释为这些虚线长度的平均值(注意对于位置4和5上预期的和实际的比例相等的密码子例如TGG(其没有同义密码子)而言,该"长度"为零;还注意"长度"不能为负)。黑色条反过来显示两个相邻密码子形成的对的权重。黑色点(中间,条下方)指出编码相同12的二肽的任何密码子对的最小权重。然后密码子对适合度是这些条的平均高度(注意此处使用的高度可以适当地为负)。图13指出使用本发明所述的遗传算法途径用于优化amyB基因的,c。油.的趋同(convergence),其得到SEQIDNO.6。图14为了解释指出单个密码子分布图表(如例如图15中所示的分布图表)的部分。两幅图指出编码苯丙氨酸的两个同义密码子UUU(上图)和UUC(下图)的单个密码子使用。两幅图的X轴和Y轴均从0%到100%。灰色直方图是一组250个高表达的A"/gw基因针对每个氨基酸(同义密码子组)标准化的密码子使用直方图,其中所述基因包括在具有0%、>0画<10%、10-<20%、…、90画<100%、100%的组中。例如,50%的高表达的基因落入具有0%UUU密码子使用和因此100%uuc密码子使用以编码苯丙氨酸的组中。白色条给出关于直方图的相似库(bin)中基因A(在该情况下为WTamyB)的密码子使用;因此对基因A而言100。%在库20-30%(20%是3/15的密码子为UUU)中,因而100%在库80-<90%(80%是12/15的密码子为UUC)中。黑色条给出基因B(在该情况下为amyB的单个密码子优化的变体)的统计学。可以用类似的方式产生16乘4的图,以显示所有64个密码子的统计学,见例如图15。图15(第1和第2部分)描述了单个密码子优化的amyB基因(黑色)与野生型amyB基因(白色)相比的三个密码子频率。灰色直方图描述了Am'ger中250个高表达的基因的统计学。显而易见,某些密码子如半胱氨酸(UGU/UGC)、组氨酸(CAU/CAC)、酪氨酸(UAU/UAC)和其它的密码子被用来进行了真正的改进。图16(第l和第2部分)描述了已经关于单个密码子和密码子对二者被优化的amyB基因(黑色)与野生型amyB(白色)相比的单个密码子频率。灰色直方图描述了中250个高表达的基因的统计学。显而易见,这些图高度类似图15中描述的单个密码子优化的基因的情况。图17描述了Am'ger的WTamyB基因单个密码子和密码子对统计学全图(图18)的部分。X轴上是在位置1上用起始密码子ATG起始的基因中后续的密码子。黑点"."指出该位置上的密码子相对于其同义密码子13的目标单个密码子比例。对ATG而言其为1.0(100%)。交叉"x"是所示基因中实际的密码子比例;虚线显示目标比例与实际比例之间的差异。密码子权重为-1和1之间的数值。条指出相邻密码子的实际密码子对权重,而五角星指出最优的能达到的同义密码子对的权重(不考虑相邻的对)。例如,第一条为-0.23,其为'ATG-GTC'的权重,第二条为0.66,其为'GTC-GCG'的权重。图18描述了SEQIDNO.2(WTAmyB)的单个密码子和密码子对统计学。图19描述了SEQIDNO.5(单个密码子优化的AmyB)的单个密码子和密码子对统计学。图20描述了SEQIDNO.6(单个密码子和密码子对优化的WTAmyB)的单个密码子和密码子对统计学。图21描述了表达载体pGBFINFUA-1的质粒图谱。图21还提供了质粒pGBFINFUA-2和pGBFINFUA-3的代表性图谱。所有的克隆来自pGBFIN-12(在W099/32617中描述)表达载体。其标示出了相对于a/w;;B启动子变体序列和编码ce-淀粉酶的Am'geramyBcDNA序列的g/"A侧翼区。可以在转化菌株之前通过用限制性酶消化去除五.co/z'DNA。图22描述了通过单次同源重组整合的图示。表达载体包含选择性amdS标记物和与amyB基因相连的g/aA启动子。这些特征侧翼是g/aA基因座的同源区(分别为3'g/aA和3"g/aA),以指导在基因组g/aA基因座处的整合。图23描述了表达三种不同构建体的Am'gw菌株培养液中a淀粉酶的活性。描述了表达天然am;^构建体的Am'gw菌株培养液中Q!淀粉酶的活性,其中(l)翻译起始序列和翻译终止序列被修饰(pGBFINFUA-l),和(2)翻译起始序列、翻译终止序列和单个密码子使用被修饰(pGBFINFUA-2),和(3)翻译起始序列、翻译终止序列和单个密码子使用与密码子对使用被修饰(pGBFINFUA-3),所述修饰根据本发明的方法进行。a-淀粉酶活性以相对单位[AU]表示,第4天FUA1组的10个菌株中6个单拷贝菌株的均值设定为100%。所示的每组10个转化体是被独立地分离和培养的转化体。图24(A和B)描述了5ac///^物种单个密码子优化的单个密码子频率。图14给出了对小图的解释。灰色直方图代表及^^"fo中50个最高表达的基因的密码子分布,见文本。黑色条指出了目标单个密码子频率。图25描述了针对SEQIDNO.14(1/3)、SEQIDNO.17(2/3)和SEQIDNO.14(3/3)的单个密码子和密码子对统计学,所述序列分别使用密码子对+单个密码子(1/3)、单个密码子(2/3)和负密码子对+单个密码子优化(3/3)被优化。关于图的解释见图17。图26.E.co/^g"cz7/^穿梭载体pBHA-12。其中指出了多克隆位点(MCS)1禾口2。图27.在五.co/z'/5fl"'〃w穿梭载体pBHA-12中克隆基因的例子。图显示了SEQIDNO.9被克隆的部分A和B(灰色箭头)。其中指出了1A部分的克隆位点iVrfel和5amHI,1B部分的克隆位点S/wal和^wl。使用PvwII切除五.co/z'部分。发明详述除单个密码子偏向性外,核苷酸序列中的其它结构也可能影响蛋白质表达,所述结构例如二核苷酸或某些短核苷酸序列的重复(密码子使用终究能够按照读码框成直线以三核苷酸序列的模式重复)。该工作呈现了鉴定某些密码子对优选级(preference)的方法,即密码子是否像它们是根据所鉴定的密码子使用比例被选择然后随机分布在基因(关于氨基酸序列)中那样来出现在基因中,或者,是否一些密码子更经常紧邻某些密码子出现和更不经常紧邻其它密码子出现。对密码子对(codonpair)的分析还包括其它方面,即读码框周围的二核苷酸使用,和对与密码子紧邻的某单个核苷酸可能的优选级。本发明公开了针对给定的宿主生物来产生密码子对偏向性表格的方法,藉此经测序的全基因组的所有经鉴定的ORF之任一被用作输入值或选定的基因(例如高表达的基因)组。本发明公开了一种方法,其中如此鉴定的密码子对偏15向性表格随后被用于在感兴趣的基因(GOI)中优化密码子对分布,以改善相应的感兴趣的蛋白质(POI)的表达。单个密码子优化提供了改进感兴趣的蛋白质的表达水平的一个良好起点。尽管其它人针对具有低丰度的tRNA插入tRNA基因的额外拷贝,尝试通过宿主细胞的适应克服由感兴趣的基因中存在被拒绝的密码子引起的缺点(例如StmtageneBL-21CodonPlusTM感受态细胞、NovagenRosettaTM宿主菌株,均为五.co/z'),但是本发明人专注于感兴趣的基因自身的适应。基因组中不想要的密码子已被替换为同义密码子,使得得到的序列的单个密码子分布尽可能地接近先前鉴定的期望的密码子比例。然而,这种密码子调和仍然具有非常大量可能的基因是同样"最适"的,因为优化的基因中总体密码子分布是选择标准,因此能够容易地考虑密码子序列的其它期望的特性,例如不存在某些酶的限制性位点或已知引起移码的密码子对。更进一步,人们能够将密码子对使用优化至有限的程度。但是当(例如朝向最丰富的密码子对使用)优化基因的密码子对时,得到的序列的单个密码子使用可能不接近最适度,因为可能存在由不足量表现的单个密码子组成的优选的密码子对,因此必须找到单个密码子和密码子对优化之间的平衡。本发明公开了允许平衡单个密码子和密码子对优化二者的方法。下述密码子对优化大幅地改进了编码感兴趣的多肽的核苷酸序列的表达和/或改进了所述多肽的生产,所述密码子对优化考虑了密码子对重叠和所述密码子对优化与单个密码子优化的任选组合。在本发明的上下文中,核苷酸编码序列或编码序列被定义为编码多肽的核苷酸序列。编码序列的边界一般由位于mRNA5'端开放读码框开端的起始密码子(在真核生物中通常为ATG,而在原核生物中其可以是ATG、CTG、GTG、TTG之一)和位于mRNA3'端开放读码框紧下游的终止密码子(一般为TAA、TGA、TAG之一,尽管存在该"通用"编码的例外)界定。编码序列可包括但不限于DNA、cDNA、RNA和重组核酸(DNA、cDNA、RNA)序列(注意本领域公知在RNA中尿嘧啶U代替脱氧核苷酸胸腺嘧啶T)。如果编码序列旨在用于在真核细胞中表达,则多聚腺苷酸化信号和转录终止序列通常会位于编码序列的3'。编码序列包括翻16任选的信号序列,和任选的一条或多条内含子序列。尽管术语"编码序列"和"基因"严格地讲不表示相同的实体,但是两个术语在本文中频繁地可交换地使用,技术人员会根据上下文明白该术语是表示全基因还是仅表示其编码序列。用于单个密码子和/或密码子对适应(codonpairadaptation)的方法和计算机设置对于高表达的基因的单个密码子使用特性而言,在所有基因和一组高表达基因中的单个密码子比例"手动"比较导致用于改善基因表达水平的一些"期望的密码子比例"。然后可以如下进行基因的单个密码子适应(l)计算基因中的实际比例,重复地(例如随机地)挑出其期望的比例比实际比例低的密码子并将其替换为具有过低比例的同义密码子;或(2)使用"期望的密码子比例"来计算每个密码子的期望数,制造多组同义密码子,并重复地针对基因中的每个位置从编码预定的氨基酸的同义组中(例如随机地)挑出密码子;使用方法(l)和/或(2)制造多重变体,并基于额外的选择标准挑出最相关的基因(例如想要的和不想要的限制性位点和/或折叠能量)。然而该途径不适合密码子对适应,首先因为考虑到其复杂性,针对所有密码子对的偏向性数据的视觉检查是不可能的,其次因为改变一个密码子对(这表示替换两个参与的密码子中至少一个)也会影响至少一个邻近的密码子对,因此"期望的密码子对比例"将是不可获得的。因为这带来的限制,确定性途径(deterministicapproach)被认为过于复杂并且不够有前途,然后选择了"遗传算法"途径。观察到术语"遗传算法"在其似乎涉及遗传工程的含义中可以是混淆的。然而,"遗传算法"是来自计算机科学的途径,其被用于多维优化问题的近似解(Michalewicz,Z.,GeneticAlgorithms+DataStructure=EvolutionPrograms,SpringerVerlag1994;DavidE.Goldberg.GeneticAlgorithmsinSearch,OptimizationandMachineLearning.Addison-Wesley,ReadingMA,1989;http:〃en.wikipedia.org/wiki/Genetic_algorithm)。在本发明中,该途径被用于解决选择可能的"最佳"基因的优化问题,所述"最佳"基因即感兴趣的聚体蛋白质的编码序列。在该途径中,基因中的每个位置(即每个密码子)能够被认为是一维,其数值集合是离散的并通过可获得的同义密码子测定。一般而言,在遗传算法中,首先通常随机地、或者通过对最初提供的解变更产生该问题的可能的"解"集,(尽管存在许多其它方法途径)。该集合被称作"种群";其元件为"个体"或"染色体",主要由包含每一维坐标的载体(数学含义上的载体)表示。因为遗传算法是在涉及自然选择的过程之后建模,所以许多术语来自于遗传学。然而,因为它们(在该情况下不同)主要应用于计算机科学的领域,但是也提出了遗传算法在生物科学问题中应用的一些例子,例如用于蛋白质二级结构预测(Armanoetal.2005BMCBioinformatics1(6)Suppl.4:S3);用于在计算机芯片上网络优化(Patiletal.2005BMCBioinformatics.23(6):308);基因表达数据聚类(DiGesuetal.2005BMCBioinformatics.7(6):289)。在该情况下,载体含有密码子。通过改变已有个体的某些位置("突变")或通过将一个个体的部分(即某些坐标)与来自另一个体的另一部分(即另一维的坐标)组合("交换"),来从该种群产生新的个体。然后检查这些个体有多良好(因为新的个体也可能解决最初的优化问题),并将更好("最适合")的个体再次作为初始种群,用于产生新的个体("下一世代";例如保留最好的10%、20%、30%、40%、50%、60%,但是存在许多其它可能性,其选择子集作为下一代用于获得朝向更适合个体的趋同,例如转轮选择(roulettewheelselecting),见Michalewicz,Z,1994)。当允许来自原始种群的最佳个体被带入下一世代时,确保对每个种群而言可能的解的品质变得更好或至少维持相同。然后假定,通过将该算法进行许多世代(=迭代;数百到数千,取决于问题的复杂性),人们会得到接近最优的解。在计算机科学中遗传算法已被精密地研究,包括性能如种群大小和世代数量的最适比例、如何预防该算法陷入局部最优等,但是这在本文中关系不大。如何针对实际的优化步骤设定这些参数的信息,参见实施例2中MATLAB中遗传算法完成的描述。这将参考图2来详细解释。图2显示了用于基因优化的遗传算法的流程图。这类遗传算法可以在适当编程的计算机上进行,其例子将在图1中展示并首先参考图1解释。图1显示了可以用于进行本发明方法的计算机设置的综述。该设置包括用于进行算术运算(arithmeticoperation)的处理器1。应当注意,遗传算法一般是非确定性的,因为它们涉及随机化的步骤(例如随机化的选择标准和/或随机化的算子选择和/或随机化的势解(potentialsolution)产生),然而,存在以确定性方式进行的例外。"遗传算法"是用于下述算法的通用工具,所述算法处理一组(称作种群)势解,其通过使用一个或多个对象(objective)筛选和/或选择和/或去除,和/或(再)引入趋向于最适解的(新)产生的解。考虑到该定义,被描述为进化编程、进化算法、经典遗传算法、实际编码(real-coded)的遗传算法、模拟退火、ant算法以及Monte-Carlo以及趋化性方法的方法也属于类似的算法种类,与下述算法相反,所述算法基于使用确定性算法的单个势解朝向最适解的趋同,例如线性编程和梯度算法。另外,技术人员从上下文会明白是否另一原始术语表示相同的算法种类。另外,尽管遗传算法是优选的方法,但是我们不排除遗传算法以外用于解决本发明所述的单个密码子和/或密码子对优化问题的任何其它方法。处理器1与大量存储组件连接,所述存储组件包括硬盘5、只读存储器(ROM)7、电可擦除可编程只读存储器(EEPROM)9和随机存取存储器(RAM)11。不是必须提供所有这些存储器类型。另外,这些存储组件不需要物理地位于接近处理器1处,而是可以位于远离处理器l处。处理器1还与使用者输入指令、数据等的手段连接,所述手段如键盘13和鼠标15。也可以提供本领域技术人员已知的其它输入手段,如触摸屏、轨迹球和/或声音转换器。本发明提供了与处理器1连接的读取单元17。读取单元17被设置为从数据运载体如软盘19或CDROM21中读取数据和可能地写入数据。其它数据运载体可以是如本领域技术人员已知的磁带、DVD、记忆棒等。处理器1还与用于打印输出数据的打印机23和显示器3(例如监视器或LCD屏(液晶显示器))或本领域技术人员已知的任何类型的显示器相连。处理器1可以通过I/O设备25与通信网络27连接,所述通信网络27例如公共交换电话网络(PSTN)、局域网(LAN)、广域网(WAN)等。处理器1可以被设置为通过网络27与其它通信设施通信。数据运载体19、21可包含数据和指令形式的计算机程序产物,其被设置为给处理器提供进行本发明的方法的能力。然而,这类计算机程序产物或者可通过电信网络27下载。处理器1可以作为独立系统运行,或者作为多个平行操作处理器(每个被设置为完成更大计算机程序的子任务)的大部分运行,或者作为具有若干个亚处理器的一个或多个主处理器运行。本发明的功能性部分甚至可以通过经过网络27与处理器1通信的远程处理器完成。现在将解释图2的算法,其可以在处理器1运行存储于其存储器中的计算机程序时在处理器1上进行。在动作32中,计算机产生一个或多个编码预定的蛋白质的基因。这可以通过从计算机存储器中存储的表格中取出这类数据来完成该处理。这类基因可以例如是ATG,GTT,GCA,TGG,TGG,TCT,…ATG,GTA,GCA,TGG,TGG,TCA,…就算法的目的而言,这些产生的基因被称作"原始基因"。在动作32之后,计算机程序通过将动作34-40进行一次或多次而进行一个或多个迭代循环(itemtionloop)。在动作34中,计算机程序通过将原始基因中的一个或多个密码子替换为同义密码子,从而使得新产生的基因仍然编码预定的蛋白质(交叉&突变过程)来产生新基因。为了能够达到该目的,计算机的存储器存储了密码子使用表格,该表格显示哪个密码子编码哪个氨基酸。(注意如果是特定的宿主生物的情况,则存在并考虑与"通用密码子"的偏差,见例如Laplazaetal"2006,EnzymeandMicrobialTechnology,38:741-747)。知道了蛋白质中的氨基酸序列,计算机程序就能够从如本领域所公知的表格中选择备选的密码子。使用动作32的例子,新产生的基因可以是(用粗体指出)-ATG,GTT,GCA,TGG,TGG,TCT,…ATG,GTA,GCA,TGG,TGG,TCA,…ATG,GTT,GCA,TGG,TGG,TCA,…ATG,GTA,GCA,TGGTGG,TCA,…ATG,GTA,GCC,TGG,TGG,TCA,...在动作36中,通过计算机程序使用适合度函数来测定所有基因的品质数值,所述适合度函数至少测定密码子适合度和密码子对适合度之一,所述所有基因包括原始基因和新产生的基因。这类适合度函数将在下文"进行密码子对优化"部分中详细解释。在动作38中,选择大量的根据适合度函数显示最佳适合度的基因参与"育种过程"(交叉和突变),并且选择大量的根据适合度函数显示最差适合度的基因从种群中去除。这些数量可以是预定的数量或取决于预定的适合度改进量。这些基因的选择应当是确定性的,但是当具有更高改变的"最适基因"被选择用于育种,而相反的被选择从种群中删除时,一般随后是随机过程。该方法被称作转轮选择。得到的被选择用于育种的基因可以例如是(非选择的基因用删除线表示)ATG,GTT,GCA,TGG,TGG,TCT,…ATG,GTT,GCA,TGG,TGG,TCA,…ATG,GTA,GCC,TGG,TGG,TCA,…在动作40中,计算机程序测试一个或多个终端标准是否被满足。通常终端标准之一为预定的迭代最大值。备选的标准是检验相对于原始基因的适合度而言,通过选择的基因获得的适合度是否被改进了至少最小的阈值;或检验相对于n迭代之前(优选地n为<10,100中选择的数值)具21有最佳适合度的基因的适合度而言,通过选择的基因获得的适合度是否被改进了至少最小的阈值。如果总体终端标准没有被满足,则计算机程序跳回动作34同时将选择的基因作为"原始基因"处理。如果在动作40中,计算机程序确立了改进低于最小阈值,则动作34-38没有很大意义,且计算机程序用动作42继续。应当理解,可在动作40中使用任何其它合适的迭代终止标准(如进行的迭代数)以留下迭代动作34-40并用动作42继续。在动作42中,在所有选择的基因中选择具有最佳适合度的基因,并例如通过监视器或借助于打印机通过打印输出提交给用户。在使用遗传算法的基因适应的情况下,必须确保交叉总是在读码框位置进行,因为否则的话得到的氨基酸序列可能被改变,组合了一个密码子的一个核苷酸与另一密码子的两个核苷酸。为了更好的趋同提出了经修饰的突变算子,对该突变算子而言只有同义密码子置换被允许,所述同义密码子置换导致至少最优的单个密码子使用或最优的密码子对使用之一。因此,目前密码子对优化的一个重要问题是如何测量个体的品质。该所谓的适合度函数可以被认为是遗传算法的中枢部分,因为它是要被优化的实际函数。在本发明中,一种优选的途径是对每个密码子对指定一个实数(称作权重)并在基因中利用该权重作为其"适合度",从而导致要被最小化的函数。在本说明书中,发明人将基因优化的过程描述为最小化问题。这是一个相当随意(arbitrarily)的途径。应注意,如果函数/要被最大化,人们能够同样寻找-/的最小值,因此这并非对一般性(generality)的限制。因此,必须鉴定测定密码子对权重的方法,其中被认为对表达水平有益的密码子对具有低权重,被认为对表达水平有害的密码子对具有高权重。用于基因适应的密码子对权重鉴定为了鉴定下述密码子对权重(其与更高的转录/表达水平相关并可作为输入值用于密码子对使用的适应),可以应用以下的方法,所述方法在本22文中通过Am'ger(其大部分被表达的基因的转录水平是已知的)和5.strife(可获得其转录水平的数据以及一组300个高表达的基因)例证。在Am'gw中(其中可以获得上述一组4,584个实际表达的基因的、从GeneChip数据中提取的完全分级(见实施例1)),计算每个基因的平均密码子对权重(即刀^(g)值的等价物)。然后根据适合度值(升序排列)和表达水平(降序排列)将基因分类。因为高表达的基因被假定为具有低密码子对适合度值,所以当使用理想的密码子对权重时这两种分级应当是相等的,因此,这两种分级的比较可以给出与适合度函数中使用的权重的品质相关的信息(其中与普通基因的分级相比,给予高表达的基因的"正确"分级稍微更多的注意力)。因此,计算了4,584个基因的分级和平均密码子对权重之间的相关系数(每个变量的协方差除以标准差)。在本发明的方法中可以使用若干个可能的权重组,包括选自下组的一个或多个(i)来自全基因组的偏向性值;(ii)来自一组高表达的基因的偏向性值;(iii)具有所有下述值的偏向性,所述值不具有调整到零的确定的最小z-分数(其中z-分数如实施例l丄4中所述被测定);(iv)上升至2或3、4、5或更高的幂次(power)的偏向性值(以给予高度优选的或拒绝的密码子更低/更高的影响);(v)z-分数自身;(vi)来自高表达的组和全基因组的偏向性数值/z-分数的差异;和(vii)(i)-(vi)中一个或多个的组合。对遗传算法而言,它们的求反(negation)已被使用,因为己用正值随意地识别优选的密码子对,然而遗传算法进行最小化。这适用于所有上述权重。可以如上所述,使用根据全基因组的密码子比值计算的期望值通过计算高表达的组中的密码子对"偏向性"获得更优选的权重矩阵。《"(q)仍然表示全基因组数据集合中^的单个密码子比例,"T((e^))表示高表达的组中对(c,,。)的发生,然后"组合的期望值""、,.,:))的i"f算对应于",((c,.,c,))^/(c,.)《"(c》.Z从而23,。'—m一n。)d。)))°其中vv((^。))被定义为密码子的序列g中密码子对(c,,。)的权重。注意因为优化函数会寻找最小的平均权重,所以与针对偏向性值的方程相比两个分子术语被反转,但是除了改变标志以外这不影响与表达水平的相关性。与测试的其它权重集合不同,下述密码子对在本文中略微具有缺点,所述密码子对涉及在高表达的组中更不足量表现的密码子。因此,这些权重是仅有的也反映高表达的组和所有基因的不同单个密码子偏向性的权重。使用这些权重带有拒绝下述一些密码子对的风险,所述密码子对实际上在高表达的组中具有正偏向性,但是由(高表达的组中)很少使用的密码子组成。然而,因为我们期望的单个密码子比例通常与具有高表达的基因组中的单个密码子比例并不相同,而是比它们更加"极端",单个密码子优化无论如何会代替这些不足量表现的密码子,因此我们能够认为上文所述的权重对于密码子对优化是非常方便的。因此,尽管密码子对权重也在有限的程度上反映了单个密码子偏向性,但是对优化而言,单个密码子使用被认为是独立的、另外的问题。使用遗传算法对单个密码子和密码子对的优化在本发明的方法中,优选使用被程序化以进行如上所述的遗传算法的计算机设置,来进行密码子对适应,或进行了组合的单个密码子和密码子对适应。将遗传算法应用于单个密码子适应也是可能的并且不从本发明中被排除,但是此处不想要的密码子可以被替换为同义密码子而不受相邻密码子的约束,因此使用遗传算法不是真正必需的。对密码子对而言,改变单个密码子通常会改变两个密码子对的权重,因此密码子对优化是受严重限制的,因为替换不想要的密码子对的单个密码子改变总是会改变另一密码子对,这不一定是针对更好的改变,随后在邻近的密码子对中改正针对更差的改变会再次改变另一对,依此类推。对于突变算子而言,只有密码子序列的下述变更被允许,所述变更不改变所编码的肽序列并且至少改进单个密码子适合度和密码子对适合度之一,即在改变之前,突变算子寻找下述同义密码子,其是不足量表现的(根据期望的单个密码子比例)或者是其所涉及的两个密码子对具有更好的权重的密码子。随机选择进行两种类型突变中的哪一种。在每个单个密码子上进行前一"突变"算子足够创建单个密码子优化的基因,而不使用任何遗传算法。考虑两个方面来测定基因的品质,即单个密码子"适合度"和密码子对"适合度"。后者简单地是密码子序列g(或基因)中所有密码子对的权重w((c(y^C("7))的均值。即当g再次表示密码子序列时,lgl为其长度(以密码子表示)而c(0为其A:-th密码子1lgl-lA,(力=H"T'2w((卓),+Ws。单个密码子适合度被定义为基因中实际密码子比例和目标密码子比例的差异,其针对每个密码子的出现数被标准化。单个密码子比例在本文的实施例l丄2中定义并如其中所述被测定。《w(c0t))是密码子Q的期望的比例(或频率),而rj(c^))如前文是基因g中的实际比例,然后单个密码子适合度被定义为Ac(g)=A£k,'(c(")—《(cW)l。因此,無c能够达到中的数值,其中最优的序列接近0,而無p受权重限制,所述权重在本文中也在[-l,l]中。为了针对两个方面进行优化,在一个实施方案中引入了组合的适合度函数qw+艮(g)其中代表"密码子对重要性"的是大于零的实数,并且决定两个适合度函数中的哪一个对组合适合度具有更大的影响。C戸'接近零时,当刀4c(g)更好时(即也接近零)分母接近零,因此/^c(g)中的小改变比^^(g)中的小改变更多地影响y^。^(g),然而在高c戸'下,y^p(g)的轻微改进比起無c(g)的中度改进可对Ac。油(g)具有更大的作用。注意,使用不同的C^'值获得的y^。油值是不能比较的(W接近于0可导致_/^。威接近于-100,然而对C^>0.2而言力UnW—般在0和-1之间)。在一个实施方案中,如果g含有某些不想要的序列(例如限制性位点或导致mRNA中不想要的二级结构的序列),则添加"罚分(penalty)"。构建合成基因时这可以是有用的,但是其自身与单个密码子和密码子对使用的优化无关。经修饰的适合度函数成为声二,(力=二:、+尸(g),其中尸(g)表示罚分函数(penaltyfonction),其在不想要的序列结构是基因g的部分的情况下创建了正权重。应当理解,在本发明的实施方案中,核苷酸和氨基酸序列可以是理论序列,其仅存在于例如纸上或另一优选的计算机可读数据运载体上,或它们可以作为切实地、物理地创建的具体化身存在。因此本发明第一方面涉及优化编码预定的氨基酸序列的核苷酸序列的方法,其中所述编码序列针对在预定的宿主细胞中的表达被优化。该方法优选地包括下述步骤(a)产生至少一条编码预定的氨基酸序列的原始编码序列;(b)通过用同义密码子替换该至少一条原始编码序列中的一个或多个密码子,从该至少一条原始编码序列产生至少一条新产生的编码序列;(c)测定所述至少一条原始编码序列的适合度值和所述至少一条新产生的编码序列的适合度值同时使用下述适合度函数,所述函数针对预定的宿主细胞至少测定单个密码子适合度和密码子对适合度之一;(d)根据预定的选择标准,在所述至少一条原始基因和所述至少一条新产生的编码序列中选择一条或多条选定的编码序列,所述适合度值越高,被选择的机会越高;和(e)重复动作b)到d),同时在动作b)到d)中将所述一条或多条选定的编码序列作为一条或多条原始编码序列处理,直至满足预定的迭代终止标准。根据本发明的一个实施方案,该方法优选地包括步骤(a)产生至少一条编码预定的氨基酸序列的原始编码序列;(b)通过用同义密码子替换该至少一条原始编码序列中的一个或多个密码子,从该至少一条原始编码序列26产生至少一条新产生的编码序列;(C)测定所述至少一条原始编码序列的适合度值和所述至少一条新产生的编码序列的适合度值同时使用下述适合度函数,所述函数针对预定的宿主细胞测定密码子对适合度;(d)根据预定的选择标准,在所述至少一条原始基因和所述至少一条新产生的编码序列中选择一条或多条选定的编码序列,所述适合度值越高,被选择的机会越高;和(e)重复动作b)到d),同时在动作b)到d)中将所述一条或多条选定的编码序列作为一条或多条原始编码序列处理,直至满足预定的迭代终止标准。根据本发明的另一实施方案,该方法优选地包括下述步骤(a)产生至少一条编码预定的氨基酸序列的原始编码序列;(b)通过用同义密码子替换该至少一条原始编码序列中的一个或多个密码子,从该至少一条原始编码序列产生至少一条新产生的编码序列;(c)测定所述至少一条原始编码序列的适合度值和所述至少一条新产生的编码序列的适合度值同时使用下述适合度函数,所述函数包括针对预定的宿主细胞测定单个密码子适合度和密码子对适合度;(d)根据预定的选择标准,在所述至少一条原始基因和所述至少一条新产生的编码序列中选择一条或多条选定的编码序列,所述适合度值越高,被选择的机会越高;和(e)重复动作b)到d),同时在动作b)到d)中将所述一条或多条选定的编码序列作为一条或多条原始编码序列处理,直至满足预定的迭代终止标准。在所述方法中,优选地,预定的选择标准是所述一条或多条选择的编码序列具有根据预定的标准最优的适合度值。根据本发明的方法还可在动作e)之后包括在所述一条或多条选定的编码序列中选择最佳个体编码序列,其中所述最佳个体编码序列具有比其它选定的编码序列更好的适合度值。在本发明的方法中,其中所述预定的迭代终止标准优选地至少为下述之一(a)测试是否至少一条所述选定的编码序列具有高于预定的阈值的最佳适合度值;(b)测试是否所述选定的编码序列均不具有低于所述预定的阈值的最佳适合度值;(c)测试是否至少一条所述选定的编码序列在所述原始编码序列中有至少30%的对预定的宿主细胞而言具有相关正密码子对权重的密码子对被转化为具有相关负权重的密码子对;和(d)测试是否至少一条所述选定的编码序列在所述原始编码序列中有至少10%、20%、30%、40%、50%、60%、70%、80%或90%的对预定的宿主细胞而言具有高于0的相关正权重的密码子对被转化为具有低于0的相关权重的密码子对。在本发明的方法中,适合度函数优选地借助于A(g)=1。0-A,Ek,(c柳-厂/(顺)来定义单个密码子适合度,其中g象征编码序列,lgl为长度,g(Q为k-th密码子,^'^'(c(yt))是期望的密码子c^)比值(附录2;CR载体),且r/(c(")为核苷酸编码序列g中的实际比值。在本发明的方法中,适合度函数优选地借助于A(g)=rfr.Sw(,,+1))定义密码子对适合度,其中w((c(Q,C(A+l))是编码序列g中密码子对的权重,lgl为所述核苷酸编码序列的长度,且C(Q为所述编码序列g中的k-th密码子。更优选地,在本发明的方法中,适合度函数借助于定义,其中艮(g)=rV2卜"'(柳-^(柳l水cW是大于零的真实值,y^p(g)是密码子对适合度函数,刀4c(g)是单密码子适合度函数,w((c附+i))是编码序列g中密码子对的权重(附录3,CPW矩阵),lgl是所述编码序列的长度,c(Q是所述密码子序列中的k-th密码子,CcW)是期望的密码子C的比例,《(c(")是编码序列g中的实际比例。优选地c戸'在0和10之间,更优选地在0和0.5之间,最优选地为约0.2。28在本发明的方法中,密码子对权重W(附录3)可以来自包含终止密码子的61x64密码子对矩阵。注意终止:正义对和终止:终止对的权重总是零。优选地根据基于计算机的方法,使用至少以下之一作为输入值来计算密码子对权重w:(a)预定的宿主细胞的基因组序列,其中至少5%、10%、20%或80%的蛋白质编码核苷酸序列被测序;(b)预定的宿主细胞的相关物种的基因组序列,其中至少5%、10%、20%或80%的蛋白质编码核苷酸序列被测序;(c)由预定的宿主细胞的至少200个编码序列组成的一组核苷酸序列;和(d)由预定的宿主细胞的相关物种的至少200个编码序列组成的一组核苷酸序列。相关物种在本文中应当被理解为表示下述物种,其小亚基核糖体RNA的核苷酸序列与预定的宿主细胞的小亚基核糖体RNA的核苷酸序列具有至少60%、70%、80%或90%的同一性(Wuytsetal.,2004,NucleicAcidsRes.12:D101-D103)。不需要针对所有可能的61x64密码子对(其包含作为终止密码子的终止信号)来测定密码子对权重w,而是可以仅针对其片段来测定,例如至少5%、10%、20%、50%和优选地100%的包含作为终止密码子的终止信号的可能的61x64密码子对。选择高表达的基因为了计算密码子对权重矩阵和单个密码子目标比例载体,人们可以应用来自特定的宿主细胞自身的核苷酸序列集合、来自相关物种的核苷酸序列集合,或二者的组合。核苷酸序列的A集合被称为"全体参考集合(referencesetall)"。最优选地,该集合含有被完全测序(>95%)的生物的开放读码框(ORF)的全集。在本发明一个优选的实施方案中,子集B被选择为含有下述子集,所述子集过量表现高表达的基因或编码高表达的蛋白质的基因。该集合可以使用下述测量方法和随后的分级测定,所述测量方法如使用阵列(例如来自Affymetrix、Nimblegen、Agilent或参考集合A的任何其它来源的阵列)技术的mRNA杂交。其它测量方法可以是RT-PCR、蛋白质凝胶、MS-MS分析或本领域技术人员已知的任何其它测量技术。除了在测量方法的基础上进行分级以外,还可以应用生物信息学工具来直接预测一组高29表达的基因,例如通过选择最偏向的基因(Carboneetal,2003)或通过选择已知在大范围生物中高表达的基因来预测。其中包括涉及初级代谢的核糖体蛋白质、糖酵解和TCA循环基因,涉及转录和翻译的基因。优选地,根据基于计算机的方法,使用在预定的宿主细胞中高表达的基因组作为输入值计算密码子对权重w。高表达的基因在本文中被理解为表示下述基因,所述基因的mRNA可以以每个细胞至少IO个、优选地20个、更优选地50个、更优选地100个、更优选地500个和最优选地至少1,000个拷贝的水平被检测。例如,Gygietal.测量了每个酵母细胞15,000mRNA分子。特定的mRNA分子的丰度被测定为在每个细胞0.1-470的范围内(Gygi,S.P.,Y.Rochon,B.R.FranzaandR.Aebersold(1999).CorrelationbetweenproteinandmRNAabundanceinyeast.Mol.Cel.Biol.19(3):1720-30)或低10倍的范围内每个细胞0.01-50(byAkashi,H.(2003).Translationalselectionandyeastproteomeevolution.Genetics164(4):1291-1303.)。或者,预定的宿主细胞中高表达基因的组可以是包含1000、500、400、300或200或100个最大量的mRNA或蛋白质的组。技术人员应当明白为了计算单个密码子比例,高表达基因的组大小可以是小的,因为至多仅64个目标值被明确。此时具有高表达基因的参考集合可以低至1个基因,但是通常认为1%的基因组大小是高表达基因的代表性集合,见例如Carbone,A.etal.(2003)(Codonadaptationindexasameasureofdominatingcodonbias.Bioinformatics.19(16):2005-15)。对密码子对权重矩阵的计算而言,通常200-500个参考基因的集合满足条件,其对应于细菌基因组(3000-15000个基因)的2-7%。另一种可能性是从文献中得出推定高表达的基因的子集。例如对于模式生物Sa"7/^wMto而言,存在许多针对单个密码子偏向性的文献。针对石acz7/usswZ^fo的本领域水平的良好综述由Kanaya"a/.(1999)的工作给出。在我们的途径中(见实施例4),我们根据通过Affymetrix技术测量的mRNA水平将数据归类在高表达组的子集中,并将这些序列与基因组ORF的全集比较。在文献中已经使用的其它选择是蛋白质表达数据,和功能性范畴的(预期的)基因组,所述基因如核糖体蛋白质、涉及翻译和转录、孢子形成、能量代谢和鞭毛体系的蛋白质(Kanayaetal.,1999;KarlinandMrazek,2000)。事实上人们常发现例如核糖体蛋白质以及其它名称的组中的高密码子偏向性。然而,一般地后一组中并非所有基因均显示这样的表现。我们也不知道核糖体蛋白质如何在低生长生产条件下反应。因此,推出高表达基因子集的直接测量技术似乎是合理的。然后我们可以选择转录体组(TX)和/或蛋白质组(PX)数据。对二者而言均存在pros和cons。TX给出全基因组中基因mRNA水平的相当全面的图谱,而PX数据可能由水溶性蛋白质的过量表现引起偏向。TX数据是可获得的进行翻译的mRNA的直接度量,而蛋白质是蓄积过程的部分,其中周转(tumover)也起到重要作用。总之,TX和PX数据显示与高表达的基因相关(Gygietal,1999)。另一个有趣的工作是通过与平均密码子使用的偏差和与下述蛋白质的相似性来预测高表达的(PHX)基因,所述蛋白质为核糖体蛋白质和涉及翻译和转录加工因子的蛋白质,和陪伴分子降解蛋白质(KarlinandMrazek,2000)。尤其是对于快速生长的生物如^^7/w、五.co/z'等而言,发现主要的糖酵解基因和三羧酸循环基因属于上述组。该预测方法在mRNA数据和蛋白质表达上与已知高表达的基因良好地比较。技术人员应当明白,单个密码子权重和密码子对权重w二者均可针对经修饰的宿主细胞被测定,所述宿主细胞关于其tRNA编码基因的含量和性质被修饰,即包含存在的tRNA基因的额外拷贝、下述新(外源)tRNA基因(包括非天然的tRNA基因)的宿主细胞,以及其中一个或多个tRNA基因被失活或删除的宿主细胞,所述非天然的tRNA基因包括编码tuna的基因,所述tima被修饰为包括非天然的氨基酸或其它化合物。在本发明的方法中,编码预定的氨基酸序列的原始编码核苷酸序列选自(a)编码预定的氨基酸序列的野生型核苷酸序列;(b)预定的氨基酸序列的逆翻译,其中预定的氨基酸序列中氨基酸位置上的密码子随机地选自编码该氨基酸的同义密码子;和(c)预定的氨基酸序列的逆翻译,其中根据预31定的宿主或与所述宿主细胞相关的物种的单个密码子偏向性来选择预定的氨基酸序列中氨基酸位置上的密码子。宿主细胞在本发明的方法中,预定的宿主细胞可以是适用于通过表达被优化的核苷酸编码序列来生产感兴趣的多肽的任何宿主细胞或生物。因此宿主细胞可以是原核或真核的宿主细胞。宿主细胞可以是适用于在液体或固体培关宜rh+立关66fe+如胎i姿fe+袖胎TiTI、1且"IC^细胞苴^b玄细跑细织或/和多细胞生物如(转基因)植物、动物或人的部分。宿主细胞可以是微生物或非微生物的。合适的非微生物宿主细胞包括例如哺乳动物宿主细胞如仓鼠细胞CHO(中国仓鼠卵巢)、BHK(幼仓鼠肾)细胞、小鼠细胞(例如NS0)、猴细胞如COS或Vero;人细胞如PER,C6tm或HEK-293细胞;或昆虫细胞如DrosophilaS2和SpodopteraSf9或Sf21细胞;或植物细胞如烟草、番茄、马铃薯、欧洲油菜(oilseedrape)、巻心菜、豌豆、小麦、玉米、水稻、7b:a^物种如7Vm^Zrev^/z'a、JraZu'fifo/w^物禾中如Jra&Jo/w^sAa/zVma,禾口7V/'co/zVma物禾中如Mcori朋atoZ^a/m。这类非微生物的细胞尤其适用于生产用于哺乳动物或人治疗中的哺乳动物或人蛋白质。宿主细胞也可以是微生物宿主细胞如细菌或真菌细胞。合适的细菌宿主细胞包括革兰氏阳性和革兰氏阴性细菌。合适的细菌宿主细胞包括来自于5ac/〃i/s、Jc">iom_yce^s、五sc/en'c/n'fl、5Vre//o附少ces属的细菌,以及字L酸菌如丄actoZjflcz'〃船、5V/^ptococcw51、丄actococctw、(9ewococcMs、Z^t/cowoWoc、尸Wococcw、Ca/7ZoZacten'wm、尸ra/z'om'Zac^7'訓、^Vz^"ococcws禾口5zy^o6acfen'wm。尤其优选的是5ac/〃ws仰ZZ/fc、5ac/〃wscoe/z'co/or、5Vre/tomycesc/avw/z'geras禾口丄acto6a".〃1^//awtan/m、或者,宿主细胞可以是真核微生物如酵母或丝状真菌。优选的作为宿主细胞的酵母属于iSflcc/jaramycas、A7i/_yv£ram_ycas、Ca涵Wa、尸ZcAz.g、32的属。尤其优选的Z)eZarom;;w宿主细胞包括Sacc/aram少cascere由'oe禾口根据一个更优选的实施方案,本发明的宿主细胞是丝状真菌的细胞。"丝状真菌"包括真菌门和卵菌门(Oomycota)亚类的所有丝状形式(如上文Hawksworthetal.、1995定义)。丝状真菌的特征是由甲壳质、纤维素、葡聚糖、壳聚糖、甘露聚糖和其它复合多糖组成的菌丝体壁。植物性生长通过菌丝延长进行,碳分解代谢是专性需氧的。在本发明中其菌株可以用作宿主细胞的丝状真菌属包括,但不限于下述属的菌株Fwsan'謹、//w附Zco/a、MagwaporAe、Mwcor、Afyce/z.o/滅ora、iVeocfl〃z'mas^'x、7Vei/ro5^ora、尸aecz7om少ces、尸ewz'cz'〃z'wm、尸zVom少ces、5^/'20/^少//"附、C7/7sos/on'w附、7"a/arom少cas1、TT^r/woascus、77'e/av/a、7b/少;oc/a^/m禾卩7WcAo&ma。优选地,丝状真菌属于选自下组的物种,所述组由J5pergi〃twWger、v45^erg/〃wso,ae、^45^ergz'〃ws呵'ae、7Wc/06fe/7wa或尸em'cz'〃z.wmc/j^sogewwm组成。合适的宿主菌株的例子包括^sperg"/^CBS513.88(Peletal.,2007,NatBiotech.21:221-231)、^sp^^〃Mo^yzaeATCC20423、IFO4177、ATCC1011、ATCC9576、ATCC14488-14491、ATCC11601、ATCC12892、尸.c/owge"wmCBS455.95、尸ew/cz'〃/wmc"n'wwmATCC38065、尸ewz'"'〃ZwwcA/^^ogewwmP2、ylcrewow'謂c/,ogew訓ATCC36225或ATCC48272、7Wc/ocfermflre^e/ATCC26921或ATCC56765或ATCC26921、A;wg"/MSscj/mATCC11906、CA^sos;o〃'wm/wc^zoweraeATCC44006及其衍生物。宿主细胞可以是野生型丝状真菌宿主细胞或变体、突变体或经遗传修饰的丝状真菌宿主细胞。这类经修饰的丝状真菌宿主细胞包括例如,具有降低的蛋白酶水平的宿主细胞,例如蛋白酶缺陷菌株如J^Wgz7/loo^aeJaL125(描述与WO97/35956或EP429490中);如WO96/14404中公开的三肽酰-氨肽酶-缺陷型Am'gw菌株,或具有降低的蛋白酶转录激活子生产的宿主细胞(parT;如WO01/68864、US2004/0191864A1和WO2006/040312中所述);宿主菌株如^spwgW/i^oo^aeBECh2,其中三个TAKA淀粉酶基因、两个蛋白酶基因以及形成代谢产物环并偶氮酸和曲酸的能力已经失活(BECh2在WO00/39322中公开);与野生型细胞相比包含提高的未折叠的蛋白质应答(UPR)以增强感兴趣的多肽的生产能力的丝状真菌宿主细胞(描述于US2004/0186070A1、US2001/0034045A1、WO01/72783A2和andWO2005/123763中);具有草酸盐缺陷表型的宿主细胞(在WO2004/070022A2和WO2000/50576中描述);具有多量内源多肽的降低的表达的宿主细胞,所述内源多肽如葡萄糖淀粉酶、中性a-淀粉酶A、中性a-淀粉酶B、ce-l,6-转葡萄糖基酶、蛋白酶、纤维二糖水解酶和/或草酸水解酶(如可根据US2004/0191864A1中所述技术通过遗传修饰获得);具有提高的同源重组效率的宿主细胞(如WO2005/095624中所述具有缺陷的或/^/B基因);和具有这些修饰的任何可能的组合的宿主细胞。在本发明的一种方法中,预定的氨基酸序列可以是与所述预定的宿主细胞异源的(感兴趣的多肽的)氨基酸序列,或其可以是与所述预定的宿主细胞同源的(感兴趣的多肽的)氨基酸序列。在关于核酸(DNA或RNA)或蛋白质的方面,使用术语"异源的"表示下述核酸或蛋白质,其不作为其存在的生物、细胞、基因组或DNA或RNA序列的部分天然存在,或其存在于与其天然存在的细胞或位置基因组或DNA或RNA序列中的位点不同的地方。异源核酸或蛋白质对其被引入的细胞而言不是内源的,但是得自另一细胞或被合成或重组地生产。一般地(尽管并非必须),这类核酸编码下述细胞通常不生产的蛋白质,所述核酸在所述细胞中被表达。在本文中术语异源核酸或蛋白质包括本领域技术人员会识别为对于下述细胞是异源或外源的任何核酸或蛋白质,所述核酸或蛋白质在所述细胞中被表达。术语异源的也适用于核酸或氨基酸序列的非天然的组合,即组合中至少两条组合的序列对彼此而言是外源的。当用于指出给定的(重组的)核酸或多肽分子与给定的宿主生物或宿主细胞之间的相互关系时,术语"同源的"应当被理解为表示该核酸或多肽分子天然地由相同物种(优选地相同变种或菌株)的宿主细胞或生物生产。预定的氨基酸序列可以是具有商业或工业应用性或实用性的任何感兴趣的多肽的序列。因此,感兴趣的多肽可以是抗体或其部分、抗原、凝固因子、酶、激素或激素变体、受体或其部分、调节蛋白、结构蛋白、受体、或运输蛋白、细胞内蛋白质、涉及分泌过程的蛋白质、涉及折叠过程的蛋白质、陪伴分子、肽氨基酸转运蛋白、糖基化因子、转录因子。优选地,感兴趣的多肽通过经典的分泌途径、通过非经典的分泌途径或通过备选的分泌途径(描述于WO2006/040340中)被分泌进宿主细胞的细胞外环境中。如果感兴趣的多肽是一种酶,则其可以是例如氧化还原酶、转移酶、水解酶、裂合酶、异构酶、连接酶、过氧化氢酶、纤维素酶、壳多糖酶、胶质酶、过氧核糖核酸酶、葡聚糖酶、酯酶。更优选的酶包括例如糖酶,例如纤维素酶如内切葡聚糖酶、j8-葡聚糖酶、纤维二糖水解酶或j8-葡糖苷酶,半纤维素酶或果胶分解酶如木聚糖酶、木糖苷酶、甘露聚糖酶、半乳聚糖酶、半乳糖苷酶、果胶甲基酯酶、果胶裂合酶、果胶酸裂合酶、多聚半乳糖醛酸内切酶、多聚半乳糖醛酸酶外切酶、鼠李半乳糖醛酸酶、阿拉伯聚糖酶、阿拉伯呋喃糖酶、阿拉伯木聚糖水解酶、半乳糖醛酸酶、裂合酶或淀粉酶;水解酶、异构酶或连接酶、磷酸镁如植酸酶、酯酶如脂酶、蛋白水解酶、氧化还原酶如氧化酶、转移酶,或异构酶、植酸酶、氨肽酶、羧肽酶、内切蛋白酶、金属蛋白酶、丝氨酸蛋白酶、过氧化氢酶、甲壳酶、角质酶、环糊精葡萄糖基转移酶、脱氧核糖核酸酶、a-半乳糖苷酶、/5-半乳糖苷酶、葡萄糖淀粉酶、a-葡糖苷酶、/3-葡糖苷酶、卤素过氧化氢酶、转化酶、漆酶、甘露糖苷酶、变构酶(mutanase)、过氧化物酶、磷脂酶、多酚氧化酶、核糖核酸酶、转谷氨酰胺酶、葡萄糖氧化酶、己糖氧化酶和单加氧酶。感兴趣的若干种治疗蛋白质包括例如抗体及其片段、人胰岛素及其类似物、人乳铁蛋白及其类似物、人生长激素、红细胞生成素、组织纤维蛋白溶酶原激活剂(tPA)或胰岛素调理素(insulinotropin)。多肽可以涉及代谢产物(优选地为柠檬酸)的合成。这类多肽例如包括乌头酸水合酶、顺乌头酸酶、6-果糖磷酸激酶、柠檬酸合成酶、羧基磷酰基烯醇丙酮酸盐磷酰基变位酶(carboxyphosphonoenolpyruvatephosphonomutase)、乙醇酸还原酶、葡萄糖氧化酶前体goxC、核苷二磷酸糖差向异构酶、葡萄糖氧化酶、锰超氧化物岐化酶、柠檬酸裂合酶、泛醌还原酶、载体蛋白、柠檬酸传递蛋白、线粒体呼吸蛋白质和金属传递蛋白。计算机、程序和数据运载体本发明又一方面涉及包含处理器和存储器的计算机,所述处理器被设置为从所述存储器读取和写入所述存储器,所述存储器包含数据和指令,所述数据和指令被设置为提供给处理器进行本发明方法的能力。本发明另一方面涉及计算机程序产品,其包含数据和指令并被设置为可以负载于计算机的存储器中,所述计算机也包含处理器,所述处理器被设置为从所述存储器读取和写入所述存储器,所述数据和指令被设置为提供给所述处理器进行本发明方法的能力。本发明还在另一方面涉及用上文所定义的计算机程序产品提供的数据运载体。核酸分子本发明又一方面涉及包含编码预定的氨基酸序列的编码序列的核酸分子。编码序列优选地是与天然存在的编码序列不类似的核苷酸序列。核酸分子中的编码序列不是天然存在的核苷酸序列,而是人工的(即工程操作的)人造核苷酸序列,其基于下述方法产生并随后作为有形的核酸分子合成,所述方法用于针对预定的宿主细胞根据本文定义的方法优化单个密码子和/或密码子对偏向性。优选地,编码序列具有针对预定宿主细胞至少低于0.2,或更优选地低于0.1和最优选地低于0.02的^&(gj。更优选地,编码序列具有针对预定宿主细胞至少低于0的y一fe)。最优选地,编码序列具有针对预定宿主细胞至少低于-o.i,或更优选地至少低于-o.2的y^p(g」。优选地,经优化的基因^中密码子对的数量含有至少60%、70%、75%、3680%、85%的密码子对和最优选地至少90%的密码子对针对特定的宿主生物具有相关的负密码子对。由编码序列编码的预定的氨基酸序列可以是本文如上定义的任何感兴趣的多肽,预定的宿主细胞也可以是本文如上定义的任何宿主细胞。在核酸分子中,编码序列优选地与表达控制序列可操作地连接,所述表达控制序列能够指导编码序列在预定的宿主细胞中的表达。在本发明的上下文中,控制序列被定义为当一起存在时与编码序列可操作地连接的核苷酸序列,其包括对编码要生产的多肽的核苷酸序列表达是必需的或有利的所有组件。对于编码要生产的多肽的核苷酸序列而言,每个控制序列可以是固有的或外来的。这类控制序列可包括但不限于,前导序列、多聚腺苷酸化序列、原肽序列(propeptide)、启动子、翻译起始因子序列、翻译起始因子编码序列、翻译转录终止子和转录终止子序列。例如为了引入特异限制性位点的目的,可以用连接子提供控制序列,以便于将控制序列与编码多肽的核苷酸的编码区连接。表达控制序列通常会最低限度地包含启动子。本文使用术语"启动子"是指一种核酸片段,其功能是控制一个或多个基因的转录,根据转录的方向位于基因的转录起点上游,并且结构上由DNA-依赖性RNA聚合酶、转录起点和任何其它DNA序列的存在识别,所述任何其它DNA序列包括但不限于转录因子结合位点、阻抑因子和激活因子蛋白结合位点和本领域技术人员已知的直接或间接地作用以调节来自启动子的转录量的任何其它核苷酸序列。"组成型"启动子是在大部分环境和发育条件下有活性的启动子。"诱导型"启动子是在环境或发育调节下有活性的启动子。当DNA区段(如表达调控序列)被置于与另一DNA区段的功能关系中时,其是"可操作地连接"的。例如,如果启动子或增强子刺激编码序列的转录,则其与该编码序列是可操作地连接的。如果信号序列的DNA被表达为参与多肽分泌的前蛋白,则该信号序列的DNA与编码该多肽的DNA可操作地连接。一般地,可操作地连接的DNA序列是连续的,并且在信号序列的情况下,不仅是连续的而且是处于读码状态的(inreadingphase)。然而,增强子不必须与它们调控其转录的编码序列连续。连接用37本领域已知的手段通过在便利的限制性位点处或接头、连接子或PCR片段处的连接完成。对适当的启动子序列的选择一般取决于被选择用于表达DNA区段的宿主细胞。合适的启动子序列的例子包括本领域公知的原核和真核启动子(见例如SambrookandRussell,2001,"MolecularCloning:ALaboratoryManual(3rdedition),ColdSpringHarborLaboratory,ColdSpringHarborLaboratoryPress,NewYork)。转录调节序列典型地包括被宿主识别的异源增强子或启动子。适当启动子的选择取决于宿主,但是启动子如trp、lac和噬菌体启动子、tRNA启动子和糖酵解酶启动子是已知和可获得的(见例如SambrookandRussell,2001,上文)。可以使用的优选的诱导型启动子的例子为淀粉-、铜-、油酸-诱导的启动子。丝状真菌宿主细胞优选的启动子例如包括Am'gw的葡萄糖淀粉酶启动子或Aoo^m的TAKA淀粉酶启动子和WO2005/100573中描述的启动子。本发明的核苷酸序列可还包含信号序列,或更确切地包含信号肽编码区。信号序列编码与多肽的氨基端连接的氨基酸序列,其能够指导被表达的多肽进入细胞的分泌途径。信号序列通常含有约4-15个氨基酸的疏水核心,其常紧邻地位于碱性氨基酸之前。信号肽的羧基端存在被单个插入氨基酸分开的一对小的、不带电的氨基酸,所述单个惨入氨基酸定义了信号肽切割位点(vonHeijne,G.(1990)J.MembraneBiol.115:195-201)。尽管它们整体的结构和功能相似,但是天然的信号肽不具有共有序列。合适的信号肽编码区可得自来自^^W^7/m物种的葡萄糖淀粉酶或淀粉酶基因、来自//^owwcor物种的脂酶或蛋白酶基因、来自fcccAwom;;cMcewv^^的o;-因子基因、来自5a"7/w物种的淀粉酶或蛋白酶基因、或小牛前-原-凝乳酶基因。然而,本发明中可以使用能够指导表达的蛋白质进入选择的宿主细胞的分泌途径的任何信号肽编码区。丝状真菌宿主细胞的优选的信号肽编码区是得自A戸^7tooo^eTAKA淀粉酶基因(EP238023)、^/ergz'〃mmgw中性淀粉酶基因、As^^^'〃mw&er葡萄糖淀粉酶、A/n'zo附wcor脂W^'天冬氨酸蛋白酶基因、//i/mko/a/awwg/"o犯纤维素酶基因、ifwmz.co/a/wso/ews乡千会佳素醇、//M/w/co/a^wo/ew角质酉每、Gflwcfe/a38awtorcfefl脂酶B基因或A/'zomMcorm〖e/ie!'脂酶基因的信号肽编码区及其突变体、截短的和杂种信号序列。在本发明的一个优选的实施方案中,编码信号序列的核苷酸序列是下述编码序列的一个完整部分,所述编码序列针对预定的宿主关于单个密码子和/或密码子对偏向性被优化。在本发明的核酸分子中,编码序列还优选地与翻译起始因子序列可操作地连接。在真核生物中,起始因子ATG-密码子之前的核苷酸共有序列(6-12个核苷酸)常由于Kozak在该客体上的初期工作而被称作Kozak共有序歹iJ(Kozak,M.(1987):ananalysisof5'腸noncodingsequencesfrom699vertebratemessengerRNAs.15(20):8125-47)。包含由Kozak推出的+4个核苷酸的原始Kozak共有序列与高等真核生物中的翻译起始相关联。对原核宿主细胞而言,相应的Shine-Delgamo序列(AGGAGG)优选地存在于原核mRNA的5'-非翻译区,作为核糖体的翻译起点作用。在本发明的上下文中,"翻译起始因子序列"被定义为编码多肽的DNA序列的开放读码框的起始因子或起始密码子上游紧邻的十个核苷酸。起始因子或起始密码子编码氨基酸甲硫氨酸。起始因子密码子典型地为ATG,但是也可以是任何功能性起始密码子如GTG、TTG或CTG。在本发明的一个尤其优选的实施方案中,核酸分子包含编码预定氨基酸序列的编码序列,所述预定的氨基酸序列要在真菌宿主细胞中被表达,即所述预定的宿主细胞优选地是真菌,最优选丝状真菌。包含编码序列(其根据本发明的方法针对在真菌中的表达被优化)的核酸分子可还包含一个或多个下述元件l)真菌共有翻译起始因子序列;2)真菌翻译起始因子编码序列;和3)真菌翻译终止序列。共有的真菌翻译起始因子序列优选地由以下序列定义5'-mwChkyCAmv-3',使用编码下述核苷酸的多义性(ambiguity):m(A/C);r(A/G);w(A/T);s(C/G);y(C/T);k(G/T);v(A/C/G);h(A/C/T);d(A/G/T);b(C/G/T);n(A/C/G/T)。根据一个更优选的实施方案,该序列为5'-mwChkyCAAA-3,;5'-mwChkyCACA-3,或5'-mwChkyCAAG-3,。最优选的,翻译起始共有序列为5'-CACCGTCAAA-3,或5'-CGCAGTCAAG-3'。在本发明的上下文中,术语"共有翻译起始因子编码序列"在本文中被定义为编码序列的开放读码框起始因子密码子上游紧邻的九个核苷酸(起始因子密码子典型地为ATG,但是也可以是任何功能性起始密码子如GTG)。一个优选的真菌共有翻译起始因子编码序列具有以下的核苷酸序列5'-GCTnCCyyC-3',使用编码核苷酸y(C/T)和n(A/C/G/T)的多义性。这导致翻译起始因子编码序列的16个变体,其中5'-GCTTCCTTC-3'是最优选的。使用共有翻译起始因子编码序列,在所述氨基酸位置允许以下的氨基酸被编码的多肽中位置+2处的丙氨酸,位置+3处的丙氨酸、丝氨酸、脯氨酸或苏氨酸,和位置+4处的苯丙氨酸、丝氨酸、亮氨酸或脯氨酸。优选的在本发明中,共有翻译起始因子编码序列对于编码要生产的多肽的核酸序列是外来的,但是共有翻译起始因子对真菌宿主细胞可以是固有的。在本发明的上下文中,术语"翻译终止序列"被定义为从开放读码框或编码序列3'端的翻译终止密码子开始的四个核苷酸。优选的真菌翻译终止序列包括5'-TAAG-3'、5'-TAGA-3'和5'-TAAA-3',其中最优选5'-TAAA-3,。编码要在真菌宿主细胞中被表达的预定氨基酸序列的编码序列还优选的关于单个密码子频率被优化,使得至少一个、两个、三个、四个或五个原始密码子,更优选的至少1%、2%、3%、4%、5%、10%、15%、20%、25%、50%、75%、80%、85%、90%或95%的原始密码子被交换为同义密码子,所述同义密码子编码与固有的密码子相同的氨基酸并且与原始密码子相比在表A中定义的密码子使用中具有更高的频率。表A:以%表示的最优的丝状真菌同义密码子的密码子频率。.T..C..A..G.T..Phe0Ser21Tyr0Cys0..TT.Phe100Ser44Tyr100Cys100..CT..Leu0Ser0终止100终止0..AT..LeuSer终止Trp.G4013140100LeuProHisArgi,,/:AV/49..TLeuProHisArgc..386410051..CLeuProGinArgc..0000..ALeuProGinArgc..3201000GlieThrAsnSerA.273000.TlieThrAsnSerA..737010021.ClieThrLysArgA..0000..AMetThrLysArgA..10001000..GValAlaAspGlyG..27383649..TValAlaAspGlyG..54516435..CValAlaGluGlyG..002616.AValAlaGluGlyG..1911740..G一条进一步更优选的编码预定的氨基酸序列(其要在真菌宿主细胞中被表达)的编码序列还优选地关于单个密码子频率被优化,使得至少一个、两个、三个、四个或五个原始密码子,更优选的至少1%、2%、3%、4%、5%、10%、15%、20%、25%、50%、75%、80%、85%、90%或95%的原始密码子被交换为同义密码子,所述同义密码子改变密码子频率使得所述频率中所述密码子的百分比和列出的最优百分比之间的绝对差值在修饰后变得更小,应用以下的最优百分比列表由TGC(100%)编码的半胱氨酸;由TTC(100%)编码的苯丙氨酸;由CAC(100%)编码的组氨酸;由AAG(100%)编码的赖氨酸;由AAC(100%)编码的天冬酰胺;由CAG(100%)编码的谷氨酰胺;由TAC(100%)编码的酪氨酸;由GCT(38.0。/cO、GCC(50.7%)或GCG(11.3%)编码的丙氨酸;由GAC(63.2%)编码的天冬氨酸;由GAG(74.2%)编码的谷氨酸;由GGT(49.0%)、GGC(35.9%)、GGA(15.1%)编码的甘氨酸;由ATT(26.7%)、ATC(73.3%)编码的异亮氨酸;由TTG(12.70/。)、CTT(17.4%)、CTC(38.7%)、CTG(31.2%)编码的亮氨酸;由CCT(35.6%)、CCC(64.4%)编码的脯氨酸;由CGT(49.1%)、CGC(50.9%)编码的精氨酸;由TCT(20.8%)、TCC(44.0%)、TCG(14.4%)、AGC(20.8%)编码的丝氨酸;由ACT(29.7%)、ACC(70.3%)编码的苏氨酸和/或由GTT(27.4%)、GTC(54.5%)、GTG(18.1%)编码的缬氨酸;所有其它可能的氨基酸编码密码子(0%)。上文定义的包含本发明的编码序列的核酸分子(用于在预定的宿主细胞中表达)可进一步包含通常存在于表达载体中的元件,如可选择的标记物、复制起点和/或(优选地通过基因组中预定位点的同源重组)促进整合的序列。这类其它元件是本领域公知的并且不需要在本文中进一步说明。本发明又一方面涉及包含本文如上定义的核酸分子的宿主细胞。宿主细胞优选地是本文如上定义的宿主细胞。本发明还在又一方面涉及生产具有预定的氨基酸序列的多肽的方法。该方法优选地包括在有助于所述多肽表达的条件下培养宿主细胞,所述宿主细胞包含本文如上定义的核酸分子,以及任选地,回收所述多肽。本发明再在又一方面涉及至少生产细胞内和细胞外代谢产物之一的方法。该方法包括在有助于生产代谢产物的条件下培养本文如上定义的宿主细胞。优选地,宿主中具有预定的氨基酸序列的多肽(其由如上所述的核酸分子编码)涉及代谢产物的生产。代谢产物(其为初级或刺激代谢产物或二者,其为细胞内、细胞外或二者)可以是可以在发酵过程中生产的任何发酵产物。这类发酵产物例如包括氨基酸如赖氨酸、谷氨酸、亮氨酸、苏氨酸、色氨酸;抗生素,包括例如氨苄青霉素、杆菌肽、先锋霉素、红霉素、莫能菌素、青霉素、链霉素、四环素、泰乐菌素、大环内酯和喹诺酮;优选的抗生素为先锋霉素和0-内酰胺;脂质和脂肪酸,包括例如多不饱和脂肪酸(PUFA);链烷醇如乙醇、丙醇和丁醇;多元醇如1,3-丙烷-二醇、丁二醇、甘油和木糖醇;酮如丙酮;胺、二胺、乙烯;类异戊二烯,如类胡萝卜素、胡萝卜素、虾青素、番茄红素、叶黄素;丙烯酸、甾醇如胆固醇和麦角固醇;维生素,包括例如维生素A、B2、B12、C、D、E和K,和有机酸,包括例如葡萄糖二酸、葡糖酸、戊二酸、己二酸、琥珀酸、酒石酸、草酸、乙酸、乳酸、甲酸、苹果酸、马来酸、丙二酸、柠檬酸、延胡索酸、衣康酸、乙酰丙酸、木质酸、乌头酸、抗坏血酸、曲酸和comeric酸;一种优选的有机酸为柠檬酸。在该文件及其权利要求书中,动词"包括"及其变化形式以其非限制性的含义被使用,表示该词语后的项目被包括,但是未明确提到的项目不排除在外。另外,涉及元件时,不定冠词"一个/种"("a"或"an")不排除存在多于一个/种元件的可能性,除非上下文清楚地要求有且仅有一个/种元件。不定冠词"一个/种"("a"或"an")因此通常表示"至少一个/种"。实施例1.实施例1:密码子对偏向性的分析1.1材料和方法U.l数据和软件可以对全基因组序列数据中的编码序列(CDS)以及来自它们的部分组(或部分基因组序列,例如cDNA/EST文库,或甚至来自相关生物的多个基因组的部分基因组数据)进行密码子对分析。本发明中使用的工具使用FASTA文件作为输入值阅读这些数据。所有计算中的大部分在MATLAB7.01(TheMathWorks,Inc.,www.mathworks.com〗中进行,但是对得到的结果的一些详细的分析而言,使用SpotfireDecisionSite8.0(Spotfire,Inc.,http:〃www邻otfire.com/products/decisionsite.cfm)。对于A"&er而言,使用针对CBS513.88(Peletal.,2007,NatBiotech.21:221-231)全基因组预测的cDNA序列和一组479个高表达的基因的FASTA文件。另外,因为在中试规模发酵条件下A"&w的>14,000个基因中通常少于一半同时被表达,所以来自使用这类条件获得的24个基因芯片的数据被用于选取第二组基因并根据观察到的mRNA水平(因为此时不能获得其它数据)将它们分级,从而能够容易地鉴定任何大小的一组(推定地)高表达的基因,所述第二组基因仅包括在多种实验中实际表达的基因(仅考虑具有至少18个"存在"口令的基因,使用AffymetrixMAS5.0阵列分析软件;该集合包含4,584个基因)。对该分析而言,我们使用了基因的转录水平。或者也可以使用定量的蛋白质表达数据,例如蛋白质的双向凝胶电泳和随后通过质谱法的鉴定。然而,与mRNA水平的测定(例如使用基因芯片)相比,产生针对大组蛋白质的蛋白质表达仍然是耗时的。因此,本文完成了在翻译实际发生之前研究密码子偏向性对翻译的影响。Gygi"(Yeast.Mol.Cel.Biol.19(3):1720-30)实际上发现了五.co/z'中"蛋白质和mRNA表达水平与密码子偏向性的关联",尽管mRNA和蛋白质表达水平的关联仅仅是相当初步的。因此,在本文上下文中,当实际上仅测定了对转录水平的影响时会使用术语"表达水平"。对含有约4,000个基因的生物而言,能够获得并分析了一组300个高表达的基因。在该研究中考虑到的所有生物基因组的基本特性(然而,它们并非均会被详细描述)的综述见表l.l。在每个分析中忽略其中在末端外的另一位置包含一个或多个终止密码子的(推定的)基因,和其长度不能被3整除的序列(即在测序期间可能发生移码的序列)。每个基因的最先五个密码子和最后五个密码子也不考虑在内,因为这些位点可能涉及蛋白质结合和释放效率,并因而承受与序列其它部分不同的选择压力,因此此处的密码子和密码子对偏向性可能不具代表性。比20个密码子更短的ORF(OPO^开放读码框)也从该分析中被省略。在表l.l中己经考虑。表1.1若干种生物的核苷酸含量,包括ORF数量和以兆碱基对(Mbp)表示的基因组大小。生物名称ORF数Mbp核苷酷A:含量cGT7,78210.6124%28%26%22%13,96218.4124%27%26%22%12,07416,2925%26%26%23%44生物名称ORF数Mbp核苷醇a:含量c|gT4,4493,5426%24%|27%23%4,1043.6630%20%|24%26%co/z'4,2894.0924%25%|27%24%尺/acfc5,3367.5232%19%|21%28%尸.cA,ogewwm13,16417.5424%27%|25%23%6,4499.0133%19%|20%28%7,8947.6214%37%|35%13%71.re&seZ8,33111.4523%30%|28%20%l丄2预期的密码子对出现(occurrence)为了分析密码子对使用,首先计数每个单个密码子和每个密码子对的出现,下文记为"由((c,,。)),其中ofo表示观察到的。双括号必须指出"观察到的数值"即w。^是仅有一个自变量的函数,其自身为对(在该情况下为密码子对,即(c,,。))。这同样适用于下文定义的针对密码子对的所有函数。指数/、以及A:可以是1到64,指出内部表征(intemalrepresentation)(根据其字母顺序)中的密码子编号。(。,c》表示密码子对,c,为左侧密码子(即6-核苷酸序列的5'三联体),而。为右侧密码子(即更接近3'端),以及针对每个密码子^的出现数《"(cj(其中下标w表示单个密码子,上标"//指出表示该数字涉及全基因组,与可被用于表示单个基因g中密码子比例的<")相反;密码子对的函数如"血((c,,。))始终表示全基因组或更大组基因中的数量)。然后计算单个密码子比例(注意在一些文章中,这些比例也被称作频率。然而,密码子频率也可表示密码子的出现数除以所有密码子的总数)ix"(。)其中^yw(^)表示与q编码相同的氨基酸并因而与Q同义的密码子集合。因此,分数线以下的总和数值等于整个蛋白质组中由c,编码的氨基酸的出现数。本文中使用的最重要的符号和式的简表见附录l。为了揭示是否某些所述的密码子对优选级仅仅是个体密码子优选级的结果,必须基于个体密码子频率计算每个密码子对的预期值。这些使用下式计算":((c,,c》)":"(c,.;Kf(c》.z"。j(,c"))上标OVW2用于将这些数值与使用下文提到的其它方法获得的数值区分。在该等式的最后一个因数中,计算所有同义密码子对的实际出现数的总和。因此,每个密码子对的预期量是个体密码子使用比例和各自氨基酸对出现数的乘积。GutmanandHatfield(1989,Proc.Natl.Acad.SciUSA巡:3699-3703)提出计算预期值的另一种方法。他们最初的途径是个别地计算每个基因的密码子频率(即基因g中的密码子量除以g中密码子的总量,所述总量表示为lgl),然后将这些值与该序列中的密码子对数(其为lgl-l)逐对相乘。lgllgl在该等式中"gW"表示GutmanandHatfield方法1(1989,上文)。这得到了针对每个基因预期的密码子对值(上文等式中求和算子(sumoperator)后的部分),所述预期的密码子对值随后被加在一起,得到最终的预期值,所述最终的预期值通过定义针对相同基因组中不同基因间单个密码子使用的可能偏差被调节,但是不考虑氨基酸对使用中可能的偏向性。这表示如果某些氨基酸倾向于比其它的更经常彼此紧邻,或者也就是说如果氨基酸对的的发生不与它们在具有相同氨基酸组成的随机化的序列中会发生的不相似,则预期值也应显著不同,因为编码很少使用的氨基酸对的密码子对会具有过高的预期值,而编码更常使用的氨基酸对的密码子对会具有过低的预期值。GutmanandHatfield(1989,上文)还提出了标准化他们关于氨基酸对偏向性的预期值。因此,他们将根据他们的方法得到的氨基酸对预期值与观察到的值简单地比较,并相应地按比例计算所有受影响的密码子对的预期值使得前者与后者匹配46i:《uce—。)在该等式中"gA2"表示GutmanandHatfield方法2(1989,上文)l丄3计算密码子对偏向性然后实际的密码子对偏向性&'"<&,"应当得自预期的和实际的(观察到的)密码子对数之间的差异(其中可使用针对预期值的任何这些方法)。最初的途径简单地通过一"邵((c,,。))计算。以这种方式,该偏向性值会指出事实上使用的密码子对比预期的多或少多少个百分比(即乘以100%)。对于被分析的基因集合中不存在的氨基酸对而言,根据该式的偏向性值对所有相应的密码子对而言应当是0/0。在该情况下,其被定义为0。因此偏向性值的下限应当是-1,而不存在清楚的上限。这被认为有些不实用,因此使用max(w血((c,.,。)),"哗((c,.,。.)))来代替,其中max(a,b)表示a和b两个数值中更大的一个,这总是导致在(-l,l)中的偏向性值。这表示偏向性值可以是-1,但是不是+1。前者发生于某一密码子对完全不被用于编码真实发生的氨基酸对;数值+1不能达到因为那时"^((。,e,))会是0,但是这仅在"J(。,。))也是0时是可能的。上文给出的解释对于O(这表示"。J(c,A))〈"^((c,,。)),因此两式具有相同的结果)的偏向性值仍然有效。如果"^(",。))>"^((。,。)),则偏向性值(此时其>0)指出预期的值比观察到的值低多少个百分比(即在该情况下基线被改变)。l丄4偏向性的统计学显著性GutmanandHatfield(1989,上文)使用^-检验测定他们的结果的统计学显著性。该检验被用于检査在特定的假设下某观察到的结果偶然发生47的可能性。当检査密码子对时,该假设应当是密码子对使用是独立地随机选择每个密码子的结果。为了检验该假设,计算^-值<formula>formulaseeoriginaldocumentpage48</formula>(其中CP表示不包括终止密码子的所有密码子对的集合)。此时自由度为3720(61*61-1)。如果密码子对选择是随机的,则会预期^-值在3720左右(等于自由度),标准差等于2*自由度的平方根。以这种方式,可以检验观察到的偏向性的总体统计学显著性。然而,也可以演绎个体密码子对偏向性的统计学显著性。如前文提出的计算预期值的方法一样,密码子对的出现数被认为是一系列的独立的是/否实验(是这两个密码子被选择用于编码各自的氨基酸对;否另一个密码子对被选择)的结果,因此其允许二项分布,如果被分析的基因集合足够大的话这可以被近似为正态分布。如果n*p>4的话这被认为是良好的逼近,其中n表示实验数,p表示"是"的概率,其也是预期值。因此,对于每个密码子对而言,标准差可以根据下式计算=V"邵((c',。)).(1—C(c》《"(。))。然后可以计算标准分数(也被称作z-分数)"、、_("。fa((C,,。))-"exP((C,,。))琳',。"-。z-分数的绝对值指出实际(观察到的)值偏离预期值多少标准差。假定为正态分布,则所有观察结果约95%应当偏离预期值在两倍的标准差内,>99%应当偏离预期值在三倍的标准差内。1.2结果1.2.1密码子对偏向性的存在使用上述的方法,我们发现存在显著的密码子对偏向性。对于所有被研究的生物而言,^-检验给出了^-值,其为自由度的若干倍高并因此也高于预期值多倍标准差。如针对个体密码子对的偏向性一样,可以证实Mouraetal.的发现在酵母中"约47%的密码子对上下文落入"偏离预期48值"-3到+3"个标准差"的区间中"(尽管他们以不同的方式计算预期值),所述预期值对应于我们分析中的z-分数。总之,如果密码子对使用是随机的,则存在比应当有的显著更多的具有相当高z-分数的密码子对。见表1.2:使用会导致近似正态分布的随机选择时,例如仅约5%的所有密码子对会具有大于2或小于-2的z-分数,但是在选择的四个生物的全基因组中,这实际上适用于多于三分之二。表1.2.不同生物中的Z-分数lz-分数l>1>2>3正态分布68.3%5.0%0.3%86.1%73.7%60.4%89.2%79.1%69.7%j.O,M88.4%76.7%65.1%88.1%76.4%64.0%86.1%72.0%59.3%五.尺"86.1%74.8%64.0%82.6%67.0%53.4%尸.c/,ogewww89.3%79.1%69.0%82.7%67.6%52.1%82.0%66.5%53.5%7!"65^89.0%79.8%71.00/0注意这些值与基因组大小有些相关(见表1.1的比较),即具有更大基因组的生物趋向于具有有更极端z-分数的密码子对。特别是当分析更小的基因(例如A中479个高表达的基因)组时,该数值更低(对该例子分别为65.1%、37.2%禾卩19.7),因为更小的出现数导致(与预期值相比)更高的标准差,并因此导致结果的更低的统计学显著性。这导致下述结论密码子对使用不是根据单个密码子比例的密码子随机选择的结果。偏向性值自身的分布在一个生物和另一个生物间差异。这可以根据图3解释,图3显示了不同生物中3,721个正义:正义密码子对的密码子对偏49向性值的分布。图3中每个直方图右上角的数字是观察到的分布的标准差;对所有生物而言平均值(未显示)在-0.06和0.01之间。在图3所示直方图中,可以看到在测试的十个生物中,细菌五.co/z'、及^Z^7m、及和&coe/z'co/or具有最极端的密码子对偏向性,而真菌爿.、^4.07zae、A/e/reus、j.wV/w/am1、尸.c/o^ogewwm禾口酵母51.ce"Ww'ae和K/acto中的偏向性较不极端。比较不同生物的密码子对偏向性时可以得到另一有趣的观察结果。来自相关生物的偏向性值显示比来自无关生物的这些更高的关联。这根据图4解释。图4显示了多种生物的密码子对偏向性之间的关联。相关系数在每个小图的右上角显示。在该分析中,可以在乂.m'ger对尸.c/io^oge"wm,和A对Aoo^ae之间观察到最高的相关,可以在5.和S.co^'co/or之间观察到最低的、即观察不到有效的相关。有趣的是,未观察到负相关。这表示尽管具有高GC-含量的生物(如5l"^'"/w)大部分偏好在富含AT的生物(如S.cwwW^,或尽管其并非极端富含AT的及w6"fo)中较少使用的这些密码子,但是不存在两种生物,其一优选的对在另一中可能被拒绝且反之亦然。这可表示尽管几乎每个单个密码子的偏向性都是生物依赖性的,但是存在在几乎每个生物中都被优选的和/或被拒绝的若干密码子对(例如因为它们引起移码或具有不匹配结构的tRNA的可能性)。1.2.2密码子对偏向性的模式为了显示观察到的密码子对偏向性,可以如Mouraetal.(2005)所做绘制所谓的图谱(他们将这些图谱称作"密码子上下文图谱")。这可以最容易地根据彩色图片解释,所述图片由针对每个密码子对的彩色矩形组成,行表示对的第一密码子,列表示对的第二密码子。红色表示负偏向性,绿色表示正偏向性。白色表示实际上具有等于0的偏向性的密码子对(其为例如ATG-ATG的情况,因为这是编码氨基酸对Met-Met的唯一方式)和掺合了终止密码子的对。然而,彩色的图片不能是专利申请公开内容的部分。为了进行黑白的显示,在该实施例中将图片拆分为两个图片。图5A展示了Am'ger的正密码子对,而图5B展示了的负密码子对(也见附录3,表Cl)。密码子对越偏向,则相应的矩形越黑。此处的偏向性值范围在-0.67和0.54之间,而在其它生物中他们甚至可能稍高于+/-0.9(也见图3)。这些矩形中最高的黑色强度(原始的绿色(顶部)和黑色(原始红色(底部)))分别表示0.9和-0.9的值(此处未达到;通常最大偏向性的绝对值稍低于最小偏向性的绝对值)。另外,我们在附录3中提到CPW矩阵-表,其含有密码子对的偏向性的数值,我们提到图5作为彩色图片的黑色和白色例子,从而技术人员能够使用来自附录3的表中的数值重建彩色的版本。这些密码子对图谱的第一条途径是根据其字母顺序(因为这是它们的内部表征)对行和列排列。该图谱中可以看出对角线似乎含有比红点稍多的绿点,这指出许多密码子具有与其邻居相同的密码子偏好。另外,大部分相邻的列有些相似,而相邻的行则不然(数据未显示),见图5A和5B和附录3,表C1。然而大部分行与隔着另外三行的行相似,即每四行存在一些相似性。因为每四行的普遍特性是所述对的第一个密码子的最末核苷酸,所以更优选地根据第三位的字母顺序作为第一排列标准、中位的字母顺序作为第二排列标准对行进行排列。然后可以在Am'gw图谱(图5C和D,和附录3,表Cl)中看到的是偏向性似乎的确主要与第一个(5')密码子的最末核苷酸和第二(3')密码子的第一个核苷酸相关,因为16*16密码子对的各块值中大部分具有相同的颜色。例如,在Aspergillus中可以鉴定的一个普遍规律是密码子对如xxT-Axx(x表示任何核苷酸,指出各位置上的核苷酸对于所说明的规律不重要)被拒绝(左下角的红色块),而模式xxA-Txx表征了优选的密码子(右上角的绿色块),再次指出密码子对偏向性是有方向性的。然而,并非所有偏向性可以仅用密码子对"中间"两个相邻的核苷酸中的模式解释。例如xxC-Axx密码子对(见最左侧从顶部起第二块)通常不被优选或拒绝,但是对xxC-AAx模式的对(注意刚提到的块左侧的四个绿色块)存在清楚的偏好。偏向性也可取决于不相邻的核苷酸(例如及s由zfc中对CxA-Gxx对的强拒绝;见图6A和6B和附录3,表C4)。不幸的是,密码子对偏向性不能总是归因于这类"简单"的模式(见例如图7A和B和附录3,表C5中针对S的相当混乱的图谱)一甚至当使用SpotfireDecisionSite8.0(http:〃www.spotfire.com/products/decisionsite.cfm)进《f聚类分丰斤日寸,也未发现普遍特性(数据未显示),即鉴定的聚类主要由不相关的密码子组成(即相同位置没有通用核苷酸)。1.2.3偏向性和表达水平的关系观察A的具有高表达水平(或更好地为假定高表达水平,因为它们仅通过观察转录水平被鉴定)的基因的偏向性图谱(见图8),更大组(即简图中的块)的存在不是同样明显的(或者,也就是说,如上所述的简单规律可能完全不存在)。然而因为所有密码子对中的三分之二在该组中出现36倍或更少倍,还因为如上所述平均低得多的z-分数,人们能够将这归因于大范围的随机变异。图9显示Am'gw的一组479个高表达基因的偏向性(垂直轴)对所有基因中偏向性(水平)的散布图。显示了不涉及终止密码子的所有3,721个密码子对。从浅灰到黑色的阴影处理根据全基因组中z-分数的绝对值指定(即图中的浅色点在所有的基因中不具有显著偏向性),大小根据高表达组中z-分数的绝对值指定,即非常小的点在其中不具有显著偏向性(此处lz-分数|<1.9)。黑色实线指出两个偏向性数值相等的地方;黑色虚线显示实际相关的最佳线性逼近(通过主成分分析鉴定的);其斜率在2.1左右。将每个密码子对在高表达组中和全基因组中的两个偏向性值进行比较时(见图9中的散布图),可以看出对大部分对而言,高转录组中的偏向性更加极端,即如果低于0则更低且如果为正则更高,但是存在一些对,其偏向性值相当不同,甚至具有不同的标志。然而,这些主要是在顶部组(topgroup)中小量发生的密码子对,其中偏向性高度显著的大部分对(蓝色,大圈)在两组中具有相似的偏向性(即它们接近蓝色线,所述蓝色线指出两个偏向性值相等的位置)。52未能(无论是针对Am'gw还是针对均及^/6"fc未能)找到分享三个核苷酸中两个的、涉及密码子的相似偏向性差异的特定模式,即在与上图相同的密码子差异图中,不存在具有相似的偏向性差异的更大组。1.3.为了基因适应鉴定密码子对权重的细节目前可以根据所述的方法来测定用于适应的密码子对权重(附录1:密码子对权重-方法一个序列组(或基因组))1.基于基因的全集;基于1的子集。2.被识别为高表达基因的部分。另外,我们开始搜索以鉴定与更高的转录水平明显相关的密码子对权重,其是适应密码子对使用的改进的方法所需要的,己经应用以下的方法在中计算每个基因的平均密码子对权重(即刀^(g)值的等价物),所述Am'gw中可以针对上述4,584个实际表达的基因集合获得取自GeneChip数据的完全分级(见"材料和方法"中的"数据")。然后根据适合度值(升序)和表达水平(降序)来排列基因。因为高表达的基因应该具有低密码子对适合度值,当使用理想的密码子对权重时这两个分级应当是相等的,因此这两个分级的比较能够给出关于适合度函数(其中对高表达基因的"正确"分级给予了比中等表达基因的分级稍微更多的关注)中使用的权重品质的信息。另外,计算了4,584个基因的分级和平均密码子对权重之间的相关系数(协方差除以每个变量的标准差)。已经检査了若干种可能的权重集合,包括i.来自全基因组的偏向性值,ii.高表达组的偏向性值,iii.具有所有值的偏向性,其不具有设置为零的确定的最小z-分数iv.上升至2的幂次(和一些其它值)的偏向性值,以给予高表达的或拒绝的密码子更低/更高的影响v.其组合vi.z-分数自身"1.来自高表达组和全基因组的偏向性值/2-分数的差异。53对遗传算法而言,它们的求反(negation)己被使用,因为已用正值(相当随意地)识别优选的密码子对,然而GA进行最小化。这适用于所有提到的权重。其中,"最佳"的权重矩阵最终是ii到iv项的组合,然而,使用基于全基因组的密码子比例计算的预期值通过计算高表达组中的密码子对"偏向性",如上所述可以获得进一步更好的矩阵。图10显示观察到的相关。与测试的所有其它权重结合不同,涉及在高表达组中更不足量表现的密码子的密码子对在本文中具有轻度的缺点。因此,这些权重是仅有的也反映高表达的组和所有基因的不同单个密码子偏向性的权重。使用这些权重带有拒绝下述一些密码子对的风险,所述密码子对实际上在高表达的组中具有正偏向性,但是由(高表达的组中)很少使用的密码子组成。然而,因为我们期望的单个密码子比例通常与具有高表达的基因组中的单个密码子比例并不相同,而是比它们更加"极端",单个密码子优化无论如何会代替这些不足量表现的密码子,因此我们能够认为上文所述的权重对于密码子对优化是非常方便的。总而言之,己经如上所述鉴定了用于基因适应的被潜在地改进的密码子对权重矩阵。方程在附录1中给出密码子对权重一方法高表达组与参考组(或基因组)。1.4.在计算机芯片上的单个密码子和密码子对优化1.4.1材料和方法用于分析和优化基因的被开发的MATLAB工具箱由若干个函数组成,这些函数根据其能力被组织在不同的目录中。因此为了使用它们,必须使得它们对MATLAB环境而言均是已知的。为此,从文件菜单中选择"设置途径(SetPath)"并点击"从子文件夹添加(Addst^/oWera)"并选择安装该工具箱的途径(通常被称作"Matlab-bio")。还添加FASTA和应当被分析的其它文件的位置。所有个体MATLAB函数简要描述于"contents.m"中(键入"helpMatlab-bio"从而在MATLAB环境中显示该文件,并且在函数名前使用"help"得到关于该函数的详细信息)。对于关注54于密码子对使用的基因优化而言,两个重要的函数是"follanalysis"和"geneopt"。如果你希望一个基因所适应的生物的全基因组位于该文件(即"Jm'gw—O^F/osto")中且其高表达基因的标识符在"a"-Z^g/.加"中,键入"fullanalysis('Aniger—ORF.fasta','an-high.txt','an');,,后,你会得到(i)全基因组的密码子对偏向性图谱,(ii)第二个文件中基因的组的密码子对偏向性图谱,和(iii)MATLAB工作空间中用于进一步用途的若干个变量(即临时存储的数据集合)。"follanalysis"的第三个参数仅确定这些变量如何被命名,并且如果同时只要分析一个基因组的话可以被省略。所提到的变量中有(i)全基因组的密码子对使用和偏向性数据(在该实施例中称作"^朋"),(ii)由第二参数说明的特定组基因的密码子对使用和偏向性数据(称作"c/^ra")和(iii)下述结构,其具有能够用于遗传算法的目标单个密码子比例和密码子对权重。"fiillanalysis('Xyz—ORF.fasta,);"仅会显示密码子对偏向性图谱并存储各自基因组的偏向性数据。尽管第二参数可以是包含基因标识符的任何文件(例如具有低表达的基因或具有某共有功能的基因的集合),但是其总是被对待为关于该(潜在的)参数的高表达基因的集合(在该实施例中称作"o;^;w画/ora"",这表示牛寺定生物的优"f七参数(^e0/"/w'za"o"_pflraw"erAes/e"yo/orgasm))。注意此处的单个密码子比例被简单地计算Cg"")=2^sA")-《""),这是可接受的近似值。同样可以通过其它方法鉴定目标比例从而进一步改进期望比例的规格,所述其它方法包括单个密码子分布的细节(见主文本)。另外,当未发现特异偏向性时目标比例可以保持为空,从而在寻找具有更高密码子对适合度时给予密码子对算法更多自由度。附录1中针对多种宿主生物给出了若干个这类预定的单个密码子目标载体。为了将预定的单个密码子目标比例用于遗传算法,如下改变参数的字段(field)"c,,键入"optparamforan.cr=[",然后粘贴单个密码子比例(例如从Excel表格中拷贝;注意它们应当按照密码子的字母顺序),如果该比例可以作为64-元行获得则键入"];",或如果它们从列中拷贝则键入55"],;",按回车(注意后一情况下方括号后额外的单引号或上撇号)。不重要的密码子比例(即其中不期望特定的目标比例)可以被指派"数值"NaN(不是数字),并且在计算单个密码子适合度时它们会被忽略。为了从优化的基因中排除某些短序列,以同样的方式设定参数'、",其中每条序列必须括在单引号内,且所有的序列必须被一起括在大括号内例如(无断行)"optparamforan.rs={'CTGCAG''GCGGCGCC'〉;"。最后,可以改变参数的字段cp',给予单个密码子优化或密码子对优化在组合的适合度函数中更高的重要性(见"结果和讨论"中的分段"进行密码子对优化")。默认值为0.2。如果用密码子对优化的基因的实验结果显示密码子对优化的基因与单个密码子优化的基因相比很少的改进,则将其设定为更低的值;在相反的情况下,更高的c^'可能更好。然后可以使用函数geneopt进行使用遗传算法的基因实际优化。只需要下述参数要被优化的序列和含有密码子对权重的结构、如上所述的目标比例和限制性位点,因此geneopt('MUVARNEQST*,,optparamforan);可以例如被用于优化给定的(相当短的)蛋白质序列从而在中高表达;'*,被用于表示得到的遗传序列应当在末端具有终止密码子(然而,因为中的最优终止信号被认为是四聚体TAAA,所以这不是必要的)。注意要被优化的序列也必须括在单引号内;如果该序列仅含有字母A、C、G、T或U且其长度为3的倍数,则其被自动地识别为核苷酸序列。然后将遗传算法进行1000世代,种群大小为200,其中80个被该世代保留(79个最优的和一个随机挑选的)并用于产生新个体,其中40%的新个体使用交换产生,60%的新个体使用突变算子产生。这些默认值证明对优化是非常便利的,即这些参数中的改变仅会(如果根本的话)导致非常轻微"更好"的基因,但是如果在优化上应当花费显著更多或更少的计算时间,则它们可同样被改变(在1.4GHzPentiumM处理器上用约500个密码子的基因平均运行geneopt耗时约15分钟)。geneopt(seq,optparamforan,[50750500.6])会例如使得遗传算法计算种群的750个世代,其中每个新世代保留50个个体并新产生250个个体(5*50;即在每代中检查300个个体),仅保留最优的(并且无随机挑选的)个体且60%的重组使用交换算子进行。关于如何明确这些参数的更多细节,键入helpgeneopt禾口helpgeneticalgorithm。注意尽管本文中针对A"&er和及犯6rifc显示和描述了通过分析相应的FASTA文件产生密码子对权重的步骤,但是只有对这两种生物这不是必要的,因为已针对先前的基因优化进行过这些计算。为了更简单的使用,已存储了遗传算法的各自参数(分别键入"loadgadata一for—an"或"loadgadata—for一bs";注意参数现在仅简单地称作anjaram和bs_param)。1.4.2结果图ll显示了五个优化版本的适合度值,各针对不同的c^值(见图ll中的图例)。蛋白质为针对宿主Am'ger(见实施例2)优化的真菌o;-淀粉酶(FUA;也称作AmyB)。另外,显示了"纯"单个密码子优化(右侧黑点)的结果和密码子对优化的结果。通过对400个种群大小将遗传算法进行IOOO世代左右获得优化的版本,每个世代在1.4GHzPentiumM上运行耗时约17分钟。注意纯单个密码子优化和纯密码子对优化仅耗费该时间的约60%。在图11中,野生型WUg—)=0.165,y^p(g—)=0.033)不适合该图(其应该在右上远处)。最优的基因总是具有最低刀4c和刀一值的基因。考虑到点的位置,不清楚针对哪个c^值能够获得最改进的基因,因为我们还不知道是单个密码子使用还是密码子对使用更加重要。然而,一种费用平衡(faretrade-off)似乎在cpz'=0.2的情况下发生。单个密码子和密码子对使用中的改进可以显示在该工作中提出的所谓序列品质绘图中。图12阐述了两幅简图,其显示上述FUA(也见实施例2)的(449个中)最初20个密码子的序列品质。注意这些序列品质简图不仅取决于序列自身,而且取决于权重和期望的单个密码子比例的设置,并因此取决于生物。注意对于具有低或无密码子偏向性的密码子而言,还可能将目标单个密码子比例定义为"不关注",即不考虑某密码子的使用与其同义密码子相比对于表达是正还是负。在该情况下,基因中各自密码子的实际比例仅57显示蓝色的X-标记,并且计算单个密码子适合度时该具体的位置被忽略(见1.4在计算机芯片上的单个密码子和密码子对优化)。1.5结论已经在大范围的生物中确立了密码子对使用和转录水平的显著相关。这证实了该偏向性不能仅通过开发读码位点周围的单核苷酸偏向性来解释。因为偏好或拒绝某密码子对的可能的解释均集中在翻译上,所以应当假定偏好或拒绝二者均由天然选择引起,所述天然选择同时作用于影响翻译的特征和影响转录的其它特征,从而最小化细胞生产酶或至少更重要的酶的作用。除了经典的单个密码子优化或密码子对调和外,可因此考虑在多肽编码序列中优化密码子对使用以达成改进的过表达,其中对于优化仅考虑单个密码子频率。对于在该实施例中被研究的真菌宿主种类和bacilli而言,相同基因的密码子对适应和单个密码子适应仅轻微地干扰,即二者可同时进行且结果会比野生型基因具有"更好"的单个密码子使用和"更好"的密码子对使用,且当忽略另一个时,两方面中的任何能够仅被轻微地改进。为了阅读FASTA文件,并进行分析和优化,已开发了用户友好的MATLAB函数。已介绍了展示单个基因的密码子对偏向性和密码子对使用的新方法,见实施例2和实施例4。针对优化设计的遗传算法允许有效处理邻近密码子对的相互依赖性带来的约束,同时特别设计的突变算子有助于克服由于遗传算法在最初几个世代后的重组步骤中产生许多不良的可能解的性状而通常伴随着遗传算法的无效(inefficiency),所述突变算子总能改进序列品质(单个密码子和密码子对适合度)的两个方面之一。合适的密码子对使用能影响对酶的生产,这会在以下的实施例中通过实验显示。已经制备了要在及^6"fo中表达的三个基因的密码子对优化的变体,其中一个将与仅适应了单个密码子使用的合成基因比较而另一个将与下述合成基因比较,其经历了优化过程(所述过程使用假定为正的权重的求反)但是仍然以与前文相同的方式(见实施例4和实施例5)针对单个密码子使用被优化。通过这种方式,也对在本文中被否决的Irwin58e"/.(1995)的观点进行检验,所述观点为不足量表现的密码子刺激翻译。对于A"Zgw而言,将检验上述amyB的密码子对优化的版本并与具有单个密码子调和的野生型和合成基因比较,见实施例2和3。2.实施例2:用于构建改进的DNA序列以改进Ape/^7tom'ggr真菌淀粉酶在Am'ger中生产的本发明的方法的用途下文中,本发明的方法被应用于设计Am'gw的AmyB(FUA)基因的新颖的核苷酸序列,该序列为了在Aw/gw中改进的表达而在单个密码子和/或密码子对使用上被优化。该方法可以以相同的方式应用于改进任何核苷酸序列的密码子使用。2.1介绍通过密码子调和的单个密码子优化的概念先前由本发明的申请人开发并在主文本中报道(也见实施例3)。在该实施例中我们显示了如何将本发明的方法应用于设计下述基因,所述基因针对单个密码子和密码子对使用被优化。在该特定的情况下应用权重矩阵,所述权重矩阵通过应用含有14,000个基因的Am'gw全基因组中2%和4%的高表达基因的两个子集产生。对于单个密码子使用而言,该算法使得解趋向于具有表B.l(=表2.1的第三列)定义的同义密码子-频率的基因,而对于密码子对使用而言,其应朝向下述最优密码子对的集合被优化,所述密码子对集合的高频率具有相关的负权重(在表C.2中),是对于其在4%高表达基因集合中的期望值而言过量表现的密码子对。注意万一没有特定宿主的高表达基因的确定列表,则人们也可以(i)引用类似的宿主生物的权重矩阵,例如尸.cA,oge"廳矩阵可以被用于^m'gw,或(ii)应用全基因组序列数据或其子集产生良好的、但是较不最优的权重矩阵。2.2材料和方法2.2.1编码Am'ggro;-淀粉酶Jmv^的野生型amyB编码序列编码a-淀粉酶蛋白质的am;;B基因的DNA序列公开于J.Biochem.Mol.Biol.37(4):429曙438(2004)(MatsubamT.,AmmarY.B.,AnindyawatiT.,YamamotoS.,ItoK.,IizukaM.,MinamiuraN."Molecularcloninganddeterminationofthenucleotidesequenceofrawstarchdigestingalpha-amylasefrom^pe/^7/usmvamoWKT-11.")中,其也可以以登录号AB083159得自EMBLNucleotideSequenceDatabase(http:〃www.ebi.ac.uk/embl/index.html)。天然Am'geram少B基因的基因组序列作为SEQIDNO.1显示。aw少B的相应编码或cDNA序列作为SEQIDNO.2显示。被翻译的SEQIDNO.2序列被指定为SEQIDNO.3,代表Am'gero;-淀粉酶蛋白质AmyB。该序列与Ao^^aece-淀粉酶蛋白质也具有100X的相似性(WirselS.,LachmundA.,WildhardtG.,RuttkowskiE.,"Threealpha-amylasegenesofAspergillusoryzaeexhibitidenticalin加n-exonorganization.";Mol.Microbiol.3:3-14(1989,UniProtaccessionnr.P10529,P11763orQ00250)。已对amyBcDNA序列进行了根据本发明方法的优化。2.3设计步骤经优化的编码核苷酸序列SEQIDNO6是运行所述软件方法的结果。应用的参数为种群大小=200;迭代数=1000;c戸'=0.20、CPW矩阵="表C.2.CPW:Aspergillusniger-高表达的序列"和CR矩阵二"表B.l第4列CR表ANS:Aspergillusniger-高表达的序列"。另外,对Pw/(CTGCAG)和Notl(GCGGCGCC)位点的每次发生,对_/^。油.加上+1的罚分值。朝向y^。油的最小值的解趋同在图13中显示。针对SEQIDNO.6获得的目标值(objectivevalue)在表2.2中与针对SEQIDNO.2和SEQIDNO.5的值一起给出,图14解释了如图15和16中所示的这些基因的单个密码子统计,表2.2给出三条序列中密码子的实际值。图18-20显示三个基因变体的单个密码子和密码子对二者的统计。该类图表在图17及其描述中详细解释。从这些图中明确了SEQIDNO.5和SEQIDNO.6的单个密码子统计是高度相似的。然而,本发明的方法导致下述基因,所述基因具有增加的具有相关负权重(w"g)《O)的密码子对数量(93%比74%),还导致了y^从-0.18到-0.34的进一步降低,指出具有更多负权重与之相关的密码子对的更加最优的使用。表2.1amyB的密码子优化。AA密码子最佳密码子分amyBamyB优化的优化的布w.t.w丄amyBamyB[#密码[%密scsc&cp子]码子[#密码[#密码子]/AAJ子]AAlaGCT3811.91618AlaGCC511535.72123AlaGCA01228.600AlaGCG111023.851CCysTGT0777.800CysTGC100222.299DAspGAT362047.61515AspGAC642252.42727EGluGAA2641.73GluGAG74758.399FPheTTT0320.000Phe—TTC1001280.01515GGlyGGT491023.32122GlyGGC351841.91515GlyGGA161023.376GlyGGG011.600HHisCAT042.900HisCAC100457.177IlieATT27725.077lie—ATC731967.92121Ile一ATA027.100KLys一AAA0735.000Lys一AAG1001365.02020LeuTTA012.700Leu一TTG131027.054LeuCTT17410.867LeuCTC381335.11415LeuCTA038.100LeuCTG32616.21211MMetATG10010100.01010NAsn_AAT0311.500Asn一AAC1002388.52626PProCCT36627.388ProCCC64836.41414ProCCA0313.600ProCCG0522.70061QGinCAA0525.000Gln一CAG1001575.02020RArgCGT49110.05ArgCGC51220.055ArgCGA0220.000Arg—CGG0220.000ArgAGA000.000Arg—AGG038.100SSerTCT21410.888SerTCC44924.31617SerTCA0410.800SerTCG141027.054Ser_AGT0410.800Ser_AGC21616.288TThrACT30922.51212Thr一ACC701332.52828Thr一ACA01025.000ThrACG0820.000VValGTT2716.189ValGTC541238.71717ValGTA0412.900ValGTG191032.365wTrpTGG10012100.01212YTyrTAT01131.400TyrTAC1002468.63535表2.2amyB的密码子优化序列类型力4p《0(,0.2)SEQIDNO.2WT0.16520.032937.3%0.090SEQIDNO.5sc优化的0.0046-0.176573.9%-0.862SEQIDNO.6sc+cp优化的0.0109-0.342092.6%-1.621表2.2中列出的所有三条序列是编码序列,其翻译的序列被指定为SEQIDNO.3。3.实施例3:测试用于构建改进的DNA序列以提供乂幼ergz7/wm'ger真菌淀粉酶在Am'g^中改进的生产的本发明的方法62本发明的方法在下文中用于改善A的AmyB基因的单个密码子和密码子对使用。该方法可以用相同的方式应用于任何核苷酸序列的密码子使用改善和改进的表达。3.1材料和方法3丄1菌株WT1:该菌株被用作野生型菌株。该菌株在保藏号CBS513.88下保藏于CBSInstitute。WT2:该Am'ger菌株是包含编码葡萄糖淀粉酶的基因(g/aA)缺失的WT1菌株。按照EP0635574Bl中所述,通过使用"MARKER-GENEFREE"途径,构建WT2。在该专利中详尽地描述了如何缺失CBS513.88基因组中的g/aA特异DNA序列。该步骤产生MARKER-GENEFREEAg/aA重组体Am'gwCBS513.88菌株,其最终完全不具有外来的DNA序列。WT3:该Am'gw菌株是含有突变的WT2菌株,所述突变导致草酸盐缺陷的Am'ger菌株。通过使用如EP1590444中所述的方法来构建WT3。在该专利申请中详尽地描述了如何筛选草酸盐缺陷的Am'gw菌株。根据EP1590444的实施例1和2的方法来构建菌株WT3,菌株WT3是EP1590444的突变体菌株22(在EP1590444中命名为FINAL)。WT4:该Am'ger菌株是在三个先后的步骤中包含编码a-淀粉酶的三个基因(awjB,am少BI禾naw少BII)缺失的WT3菌株。缺失载体的构建和这三个基因的基因组缺失在WO2005095624中详细描述。WO2005095624中描述的载体pDEL-AMYA、pDEL-AMYBI和pDEL-AMYBII已如EP0635574Bl中所述根据"MARKER-GENEFREE"途径使用。上述步骤产生草酸盐缺陷、MARKER-GENEFREEAg/aA、Aam;/A、Aflw少BI禾卩AamyBII淀粉酶-阴性重组体A"&wCBS513.88菌株,其最终完全不含有外来的DNA序列。因此,与WT1相比,WT4针对a-淀粉酶表达更加优化。3丄2Am'ger摇瓶发酵如W099/32617的实施例"A摇瓶发酵"章节中所述,在20ml预培养基中预培养Aw&er菌株。过夜生长后,将10ml该培养物转移63至发酵培养基l(FMl)中进行a-淀粉酶发酵。在含100ml发酵液的500ml带盖三角瓶中于34'C和170rpm下将发酵进行指定的天数,一般如W099/32617中所述。该FM1培养基每升含有52.570g葡萄糖、8.5g麦芽糖、25g水解酪蛋白、12.5g酵母提取物、1gKH2P04、2gK2S04、0.5gMgS04-7H20、0.03gZnCl2、80.02gCaCl2、0.01gMnS04,4H20、0.3gFeS04.7H20、10mlPen-Strep(Invitrogen,cat.nr.10378-016)、48gMES,用4N112304调节至pH5.6。3丄3真菌a-淀粉酶活性为了测定Aw&w培养液中的ce-淀粉酶活性,根据供应商的方案,使用Megazyme谷物a-淀粉酶试剂盒(Megazyme,CERALPHAa-淀粉酶测试试剂盒,产品目录号K-CERA,2000-2001年)。测量的活性基于存在过量的葡萄糖淀粉酶和a-葡萄糖苷酶时非-还原-端封闭的p-硝基苯基麦芽七糖苷(p-nitrophenylmaltoheptaoside)的水解。形成的p-硝基苯酚的量是样品中存在的a-淀粉酶的度量。3.2针对编码Am'gwa-淀粉酶JmvA的野生型amv丑编码序列构建Aspergillus表达构建体野生型amyB的DNA序列已在2.2.1中描述过。为了Am'gw"m少B构建体在^^wgz7/w物种中的表达分析,使用基于pGBFIN的表达构建体将强amyB启动子用于a-淀粉酶在Am'gw中的过表达(如W099/32617中所述)。包括PamyB的ATG起始密码子的"mj^启动子的翻译起始序列为5,-GGCATTTATGATG-3,或5,-GAAGGCATTTATG-3,,取决于选择哪个ATG作为起始密码子。在下文产生的所有后来的"m;;5表达构建体中,PamyB的该翻译起始序列已经被修饰为5'-CACCGTCAAAATG-3,。在两端引入适当的限制性位点,以允许在表达载体中克隆。天然的"mj;B基因含有'TGA,终止密码子。在下文制造的所有flm少B构建体中,5'-TGA-3,翻译终止序列被替换为5'-TAAA-3',随后是限制性位点的5,-TTAATTAA-3,。在5'-端引入Wol位点,在3'-端引入P"cl位点。因64此,包含经修饰的基因组amyB启动子和amyBcDNA的片段被完全合成、克隆,并通过序列分析验证该序列。用J^ol和消化包含Of-淀粉酶启动子(其带有经修饰的翻译起始序列)和amyBcDNA序列(其带有经修饰的翻译终止序列)的片段,并将其引入经和尸ad消化的pGBFIN-12载体(如W099/32617中所述构建和设计)中,得到pGBFINFUA-l(图21)。通过序列分析验证引入的PCR片段的序列,并将其序列表示在SEQIDNO.4中。3.3改善用于在Am'g^中表达的a-淀粉酶编码序列amvB的单个密码子使用下文应用单个密码子使用优化来改善Am'g^amyB基因的密码子使用。天然amyB的核苷酸编码序列显示为SEQIDNO.2。天然Am'gwamyB基因和合成的经优化的变体的密码子使用在下表2.1中给出。对于天然的和经单个密码子优化的合成的"m少B基因而言,给出了每个密码子的精确数以及每个氨基酸的分布。另外,第三列提供了提出的最优分布,其为优化的目标。对第1组氨基酸而言,仅有一种可能性。第1组由总是由ATG编码的甲硫氨酸和总是由TGG编码的色氨酸组成。第2组氨基酸根据0%或100%的极端频率进行优化,该策略是清楚的。第2组AA的所有密码子被特异地改变为两种可能的密码子的最优变体。更明确地,对于半胱氨酸,将密码子TGT替换为TGC;对于苯丙氨酸,将TTT替换为TTC;对于组氨酸,将CAT替换为CAC;对于赖氨酸,将AAA替换为AAG;对于天冬酰胺,将AAT替换为AAC;对于谷氨酰胺,将CAA替换为CAG;对于酪氨酸,将TAT替换为TAC。第3组氨基酸可以由表3.1中指出的若干种密码子编码;根据以下的方法来优化以优选的密码子频率存在的每个密码子(对于丙氨酸为GCT,GCC,GCA,GCG;对于天冬氨酸为GAT,GAC;对于谷氨酸为GAA,GAG;对于甘氨酸为GGT,GGC,GGA,GGG;对于异亮氨酸为ATT,ATC,ATA;对于亮氨酸为TTA,TTG,CTT,CTC,CTA,CTG;对于脯氨酸为CCT,CCC,CCA,CCG;对于精氨酸为CGT,CGC,CGA,CGG,AGA,AGG;对于丝氨酸为TCT,TCC,TCA,TCG,AGT,AGC;对于苏氨酸为ACT,ACC,ACA,ACG;对于缬氨酸为GTT,GTC,GTA,GTG):对第3组氨基酸及其编码密码子而言,给定的编码序列中每个可能的密码子的最优出现率的计算是根据以下的方法进行的i.对各自第3组AA,求和给定序列中编码的残基总数,见低Al列(表3.1),ii.针对每个AA和编码该AA的密码子,用该AA的总数乘以表2.1中的最优密码子分布,得到粗密码子分布,这一般可含有小数(decimalnumber),见第A2列(表3.2),iii.通过去除小数位(digit)近似(roundoff)粗密码子分布(ii)的值,得到近似的密码子分布,见第A3列(表3.2),iv.针对每个AA,求和近似密码子分布(iii)中表示的AA总数,见第A4列(表3.1),v.通过从给定序列中编码的残基总数(i)减去近似的密码子分布(iv)中表示的AA总数,计算近似的密码子分布中每个各自AA的残基的总丢失数,见第A5列(表3.1),vi.针对每个密码子,通过减法计算粗密码子分布(ii)和近似密码子分布(iii)之间的小数差,见第A6列(表3.2),vii.针对每个密码子,用小数差(vi)乘以表1中的最优密码子分布,得到每个密码子的权重值,见第A7列(表3.2),viii.针对每个各自的AA,选择丢失的残基量(v),具有最高权重值(vii)的密码子的各自量,见第A8列(表3.2),k.编码多肽的给定序列中最终最优密码子分布的计算如下进行针对每个密码子将近似密码子分布(iii沐选择的丢失残基量(viii)相加,见第A9列(表3.2)。表3.1AA(i)IAlA4A5Ala142402Asp24241166Glu312111Gly443421lie28271Leu637352Pro722211Arg81091Ser937352Thr1040400Val1131292表3.2密码子A2A3A6A7A8A9Ala_GCT15.96150.960.365116Ala—GCC21.42210.420.014121Ala一GCA0000.00000Ala—GCG4.6240.620.0680AspGAT15.12150.120.043015Asp—GAC26.88260.880.563127Glu—GAA3.1230.120.0310Glu—GAG8.8880.880.65119Gly_GGT21.07210.070.034021Gly_GGC15.05150.050.018015Gly一GGA6.8860.880.14117Gly_GGG0000.00000lie—ATT7.5670.560.15107lie—ATC20.44200.440.321121lie—ATA0000.00000Leu—TTA0000.0000067Leu—TTG4,8140.810.10515Leu_CTT6.2960.290.04906Leu—CTC14.06140.060.023014Leu—CTA000O扁00Leu—CTG11.84110.840.269112Pro—CCT7.9270.920.33118Pro—CCC14.08140.080.051014Pro_CCA0000.00000Pro_CCG0000.00000Arg—CGT4.940.90.44115Arg_CGC5.10.10.0510Arg一CGA000O細00Arg—CGG000O扁00Arg一AGA0000,00000Arg—AGG000O細00Ser—TCT7.7770.770.16218Ser—TCC16.28160.280.123016Ser—TCA000O扁00Ser—TCG5.180.180.0250Ser—AGT0000.00000Ser—AGC7.7770.770.16218Thr—ACT12120O扁012Thr_ACC282800.000028Thr_ACA000O扁00Thr_ACG000O扁00Val—GTT8.3780.370.10008Val—GTC16.74160.740.400117Val一GTA000O細00ValGTG5.8950.890.16916随后,通过随机分布原始amyB肽中每个氨基酸的提出量的同义密码子(表2.1)创建全新的核苷酸编码序列。得自上述过程的合成的"mj;B序列在SEQIDNO.5中指出。使用CloneManager7程序(Sci.Ed.Central:Scientific&Educational软件,7.02版)针对有害二级结构可能的发生来检验经修饰的编码序列中的二级结构。3.4针对a-淀粉酶编码序列"mvB在Am'g^中的表达根据本发明的组合的单个密码子和密码子对方法优化编码序列本发明的方法被用于改进Am'ger的amyB基因的编码序列。得自实施例2所述过程的经优化的am_yB序列在SEQIDNO.6中指出。使用CloneManager7程序(Sci.Ed.Central:Scientific&Educational软件,7.02版)针对有害二级结构可能的发生来检验经修饰的编码序列中的二级结构。3.5构建用于表达由实施例3.2和3.3中所述编码序列编码的Am'gera-淀粉酶AmvB的经修饰的amvB表达载体pGBFINFUA-1(图2)的Wol-尸acl片段的DNA序列如SEQIDNO.4所示并包含amyB启动子和带有经修饰的翻译起始序列和经修饰的翻译终止序列的野生型amyBcDNA序列。如实施例1.2中所述的DNA序列表示为SEQIDNO.7,所述DNA序列包含ce-淀粉酶启动子的翻译起始序列的变体,其与编码a-淀粉酶的amyB基因的密码子优化的编码序列组合。如实施例3.3中所述的DNA序列表示为SEQIDNO.8,所述DNA序列包含a-淀粉酶启动子的翻译起始序列的变体,其与编码a-淀粉酶的amyB基因的优化的编码序列组合,所述优化根据本发明的组合的单个密码子和密码子对方法完成。为了在表达载体中克隆这些经修饰的序列变体,用^ol和Pad消化两个合成的基因片段,并将其引入经Wol和尸acl消化的pGBFINFUA-1载体(图21)的更大片段中,产生变体表达载体。检验正确片段的整合后,将变体表达构建体命名为pGBFINFUA-2和pGBFINFUA-3,如下文表3.3中所述。表3.3:用于在Am'gw中表达a-淀粉酶的经修饰的表达构建体质粒名称SEQID翻译起始序列编码序列翻译终止序列NO69<table>tableseeoriginaldocumentpage70</column></row><table>质粒pGBFINFUA-l到pGBFINFUA-3的amyB编码序列的翻译的序列是按照SEQIDNO3中所示的氨基酸序列,其代表野生型Ag;-淀粉酶。3.6Am'gwo;-淀粉酶的经修饰的pGBFINFUA-表达构建体在Am'gw中的表达如下所述和根据图22中所示策略通过转化将如上所述制备的pGBFINFUA-l、pGBFINFUA-2和pGBFINFUA-3表达构建体引入A中。为了在WT4中引入三个pGBFINFUA-l、-2和-3载体(表3.3),如W098/46772禾卩W099/32617中所述进行转化和随后的转化体选择。简言之,分离pGBFINFUA-构建体的线性DNA并用于转化Am'ger。在乙酰胺培养基上选择转化体并根据标准步骤纯化菌落。诊断菌落在glaA基因座上的整合,并使用PCR诊断其拷贝数。选择了pGBFINFUA-l、-2和-3构建体每一个的具有相似的评估拷贝数(低拷贝1-3)的十个独立转化体,并使用转化质粒的名称编号,例如分别为FUA-1-1(对于第一个pGBFINFUA-l转化体而言)和FUA-3-l(对于第一个pGBFINFUA-3转化体而言)。所选择的FUA-菌株和AWT4被用于在100ml培养基中和如上所述的条件下进行摇瓶实验。3和4天的发酵后,取样。测量所有三个不同的AFUA-转化体中o;-淀粉酶的生产。如从图23中可以看出,根据本发明方法的编码序列的优化显示与称作单个密码子优化的测试的其它方法相比对AmyB表达的更高改进。这些图已概括于下表3.4中。表3.4.带有野生型构建体的转化体与带有经修饰的amj;B编码序列的转化体相比的相对平均a-淀粉酶活性(如从图23中所概括的)。菌株类型SEQIDNO编码序列a-淀粉酶活性FUA醒14w.t.100%FUA画27单个密码子优化的200%FUA隱38根据本发明修饰的400%这些结果清楚地指出本发明的方法可应用于促进宿主中的蛋白质表达,尽管表达构建体和宿主已具有若干种其它优化,例如强启动子、改进的翻译起始序列、改进的翻译终止序列、最优的单个密码子使用和/或针对蛋白质表达改进的宿主。4.实施例4:设计用于三种异源酶在Sacz7/船物种^cz7/mw&"fo禾口丑flcz'〃Mamz7o//gMe&cZe/xy中表达的改进的DNA序歹ll4丄介绍实施例4描述了本专利所述的本发明方法的实验设计和应用,该方法用于两种物种(在该实施例中更特定地为3fl"7/wsw6"fo和Sacz'〃wa/m'/o/zXac/ms)中异源蛋白质的(改进的)表达。一个优选的表达宿主为5acz'〃M5aw//o/z々t/^/2i"'ms。5ac/〃ww&z'to基因组于1997年公布,其它5acz7/w物种随后公布(Kunst,F.a/.1997.ThecompletegenomesequenceoftheGram-positivebacterium5ac"/wsw6"/z's.Nature390:249-56;Rey,M.W."a/.(2004).Completegenomesequenceoftheindustrialbacterium5a"7/船/z'c/iemybnm'sandcomparisonswithcloselyrelated5ac編sspecies.GenomeBiology5:R77;RaskoD.A.etal.(2005》Genomicsofthe5a"'〃uscem/sgroupoforganisms.FEMSMicrobiologyReviews29:303-329)。在该实施例中,选择及w&Zfo的全序列作为计算单个密码子频率和密码子对权重的基础。对GC-含量和tRNA的比较提供了所述^"7/w物种的类似图谱(见上文)。这表示相同的统计可应用于其它相关的5^z7/w物种。另外,从实施例1(也见图4)已经清楚相关的物种显示相似的密码子对频率。71在图4(也见实施例1)中,可以找到基于及对及flm少/o/Z^^^de朋的全基因组统计的密码子对比较图表。观察到两个数据集合之间的良好相关。另外,似乎及amj/o/^i/e/a"'e似更加通用(versatile),因为有在及awz7o/^we/ac/e/w中被良好接受而在及jw6/z'fo中具有高度负值的密码子对组合的子集;但未观察到相反情况。4.2.实验设计选择三条蛋白质序列用于在5a"'〃M禾口5a"'〃M,z7。/—咖C&"S二者中表达蛋白质1:来自万acz7/wWeara/Aermc^Az7^的木糖(葡萄糖)异构酶xylA(EC.5.3丄5);蛋白质2:来自^/^tomj;cMo/z'voc/iramogewM的木糖(葡萄糖)异构酶xylA0EC.5.3丄5);蛋白质3:来自T7^r附o朋aera6a"er的L-阿拉伯糖异构酶(EC5.3丄4)。表4.1概述基因构建体;选择蛋白质2以进一步探究密码子更广义的概念o基因蛋白质单个密码子优化单个密码子&正密码子对优化单个密码子&负密码子对优化蛋白质lSEQIDNO.9SEQIDNO.16SEQIDNO.13蛋白质2SEQIDNO.10SEQIDNO.17SEQIDNO.14SEQIDNO.18蛋白质3SEQIDNO.11SEQIDNO.12SEQIDNO.15表4.1提供了应用于上文所述3个基因的方法。对蛋白质l、蛋白质2和蛋白质3而言,除了之前开发的单个密码子优化外还应用本发明方法的密码子对优化。作为对照,通过包含蛋白质2的2个额外构建体实验,来检验单个密码子优化和负密码子对优化的作用。一个变体(SEQ.ID.18)被设计为其朝向不良密码子对被"优化"(即负密码子对优化),第二变体被设计为仅进行单个密码子优化(SEQ.ID.17)。选择蛋白质2是因为&r印tom;;c^物种限制高度差异的密码子对偏向性,见实施例1和图4。72所有设计的及a附j/o/we/acz'e朋基因避免MM(CATATG)和5awHI(GGATTC)限制性位点的发生。另外,它们含有去除克隆载体pBHA12的五."//部分的单个限制性位点。4.3.单个密码子优化蛋白质1和蛋白质2的单个密码子优化的变体根据实施例3.3中所述用于单个密码子优化的方法设计,分别得到SEQIDNO.16和SEQIDNO.17。应用的单个密码子分布表(表4.2)使用50个最高表达的基因测定,所述最高表达的基因使用及^M/m168的24AffymetrixGeneChips(其使用6个独立的发酵时间系列)测定。所有的GeneChips根据其算术平均值被标准化。表达列表排除了下述基因其在菌株工程操作中被有意地过表达,因此它们测量出的表达水平不能与其密码子使用相关联。单个密码子分布表4.2的测定通过视觉检査50、100、200、400个最高表达的序列和所有及wto'fc序列的密码子频率直方图完成。在清楚趋向于0%或100%的最高表达基因的情况下,分别进行0%和100%的指定。对于未指定的其它密码子而言,计算平均使用并通过省略指定的密码子对同义密码子集合进行标准化。得到的目标单个密码子频率在表4.2第3列中给出。表4.2合成基因设计的密码子使用分布,其基于50个最高表达的基因和单个密码子使用直方图例如图24的视觉检查;在密码子对优化期间可以应用不关注(Don'tcare)的项目,以允许这些密码子的选择自由化,从而不考虑这些密码子的单个密码子优化。单个密码子分布不关注=0%关注=1AAlaGCT500Ala—GCC01Ala一GCA500AlaGCG01CCysTGT510Cys—TGC490DAspGAT631AspGAC371EGluGAA100173GluGAG01FPheTTT550Phe—TTC450GGlyGGT311GlyGGC341GlyGGA351GlyGGG01HHisCAT710HisCAC290IlieATT600He—ATC400lie—ATA01KLys_AAA1001Lys_AAG01LeuTTA390LeuTTG240LeuCTT370Leu_CTC01LeuCTA01Ixu—CTG01MMetATG1001NAsnAAT450Asn—AAC550PProCCT350ProCCC01ProCCA220Pro—CCG430QGinCAA1001GinCAG01RArgCGT380ArgCGC340ArgCGA01ArgCGG01Arg—AGA280Arg—AGG01SSerTCT340SerTCC01SerTCA340Ser一TCG01SerAGT01SerAGC320TThrACT330Thr一ACC01Thr一ACA460Thr—ACG221VValGTT471Val一GTC01Val—GTA231ValGTG301WTrp—TGG1001YTyrTAT620TyrTAC380终止—TGA01终止—TAG01终止—TAA10014.4.密码子对优化根据本发明的方法进行密码子对优化。优化的编码核苷酸序列SEQIDNO.13-15运行所述的软件方法的结果。应用的参数为种群大小=200;迭代数=1000;=0.20、CPW矩阵="表C.4.CPW:Bacz'〃ww&Zfo-高表达的序列"和CR矩阵-"表B.l第5歹!j:CR表BAS:5acz7/wwMfo-高表达的序列"(也在表4.2中)和如表4.2中的"不关注"元素。另外,对A/^el(CATATG)和BamHI(GGATTC)限制性位点的每次发生,对y^。i加上+l的罚分值。经优化的编码核苷酸序列SEQIDNO.18是运行所述软件方法的结果。应用的参数为种群大小=200;迭代数=1000;—-0.20、CPW矩阵=-1倍的"表C.4.CPW:5"cz7/m^&z'fo-高表达的序列"(用于获得朝向不良密码子对的密码子对优化)和CR矩阵="表B.l第5列CR表BAS:Sa"7/wswMfo-高表达的序列"(也在表4.2中)和如表4.2中的"不关注"元素。另外,对A^I(CATATG)和5amHI(GGATTC)限制性位点的每次发生,对y^。油.加上+l的罚分值。对不显示密码子偏向性的这些密码子选择表4.2中的"不关注"元素。这通过视觉检査单个密码子偏向性图完成,见4.3。这类元素的使用对优化的密码子对部分提供了额外的自由度。所有的优化朝向y^。油的最小值集中。表4.2中与获得的SEQIDNO.11、SEQIDNO.16和SEQIDNO.17的目标值一起,给出获得的SEQIDNO.13-15和SEQIDN018的目标值。从该数据了解到对SEQIDNO.16和SEQIDNO.17而言与SEQIDNO.14和SEQIDNO.15相比单个密码子统计是高度类似的。然而,本发明的方法导致下述基因,所述基因具有增加的密码子对(其具有相关负权重)数量,指出具有更多正权重与之相关的密码子对的更加最优的使用,见表4.3。使用y^p最大值化的"优化"导致下述基因,所述基因具有增加的密码子对(其具有相关负权重)数量,指出具有更多正权重与之相关的密码子对的提高的使用,因此预期对翻译特征的不良影响。对SEQIDNO.18(Mv(g)SO)为24%而对SEQIDNO.14为85%,且^一也从1.20提高至-1.43。表4.3密码子优化;在及w6"'fo和5.amy/o/z々i/e/ac/e似中表达的基因的目标适合度值。序列类型SO(—=0.2)SEQIDNO.11WT0.0780.09741.1%0.350SEQIDNO.13sc+cp优化的0.004-0.29389.1%-1.439SEQIDNO.14sc+cp优化的0据-0.29284.8%-1.431SEQIDNO.15sc+cp优化的0扁-0.30389.2%-1.493SEQIDNO.16sc优化的0.002-0.02356.9%-0.114SEQIDNO.17sc优化的0扁0.08744.3%0.428SEQIDNO.18sc+负cp优化的0.0150.25723.5%1.1965.实施例5:针对三种异源酶在^zcz7/mw6"fo和Sfld〃t^awvfo/Z卵e&cZg似中的表达检验本发明的方法。5.1介绍76实施例5描述了具有这些序列变体的3个异源基因在B""7/m和^"7/mflm,7o/zXflc/ms宿主细胞二者中表达的实验和结果。变体根据本发明的方法制造,如实施例4中所述。5.2材料和方法5.2.1万a"7/船生长培养基2*TY(每L):胰蛋白胨16g、酵母提取物Difco10g、NaCl5g。5.2.2及sWz'fo的转化培养基2c&z'zz'zgw培养基28gK2HP04;12gKH2P04;4g(NH4)2S04;2.3g柠檬酸三钠.21120;0.4gMgS04.7H20;加H20至900ml并用4NNaOH调节至pH7.0-7.4。加仏0至l升。在120°C高压灭菌20分钟。1x5WzZzgw画//t^培养基向50ml2xSpizizen培养基中添加50mlmilliQ;lml50。/。葡萄糖和lOO)il水解酪蛋白氨基酸(20叫/ml终浓度)。来自非选择性2xTY琼脂平板的单个S""7/m菌落(或来自深层冷冻管中的等分式样)被接种在100ml摇瓶中的10ml2xTY发酵液中。细胞于37。C和士250rpm下在培养箱摇床中培养过夜。在600nm处测量OD值,并用lxSpizizen-plus培养基将培养物稀释至OD,i).l。在37。C和250-300rpm下培养细胞,直到培养物的006()。为0.4-0.6。用补充有0.5%葡萄糖的lxSpizizen培养基(饥饿培养基)对培养物进行l:l稀释,并将其在37。C和250-300rpm下孵育90分钟。在桌上离心机中于4500rpm下将培养物离心10分钟。去除90%的上清液并将沉淀物悬浮在剩余的体积中。将DNA(最大20nl体积中l-5pg)与0.5ml感受态细胞在万能管中混合,并在旋转摇动水浴中在37'C于稳定的摇动(j/6)下孵育l小时。将细胞涂布(20到200^1)在含25卞g/ml卡那霉素的选择性2xTY琼脂平板上并在37'C孵育过夜。5.2.3制备无细胞提取物将得自lml培养物的沉淀物重悬于含10mMThris-HCl(pH7.5)、10mMEDTA、F50mMNaCl、lmg/ml溶菌酶和蛋白酶抑制剂(无EDTA的77蛋白酶抑制剂完全混合物,Roche)的缓冲液A中。将重悬的沉淀物在37。C孵育30分钟,以进行原生质体化,并随后如下进行超声处理30秒,IO微米振幅(3个循环),循环之间冷却15秒。超声处理后通过离心(10分钟,4'C下13000rpm)将细胞碎片离心,澄清的溶胞产物被用于进一步分析。5.2.4选择葡萄糖异构酶和L-阿拉伯糖异构酶编码基因,并设计用于在Bflc/〃Ma,/o/z々"g/^c/ms1和丑acZ〃t/51s"6"fe中表达的合成基因所选择的三个酶为1.Sac/〃MWeara/Aermop/n7^木糖异构酶(P54272Swissprot);蛋白质序列SEQIDNO.9,2.S&e/to附jcaso/z'voc/iramogew&s木糖异构酶(P15587Swissprot);蛋白质SEQIDNO.10,3.77^厂附0朋^尸0^"^附^/^^"1^-阿拉伯糖异构酶(八1582623.1EMBL:以及US2003/012971A1),蛋白质SEQIDNO.11,核苷酸SEQIDNO.12。如从上文中看出的,所选择的酶具有不同的微生物来源。为了在5ac/〃iww6rifc或5acz'〃wam;;/o/z々t/e/acz'era中过量生产这些酶的目的,我们已经用使得其适合于在^^7/^物种中表达的方式优化了每个蛋白质的核苷酸序列,见实施例4。我们已优化了编码上述酶的核苷酸序列。序列在序列表中以SEQIDNO.13.(Ba"'〃wsWeara//iermo;/n7i葡萄糖(木糖)异构酶)、SEQIDNO.14.(5^e//o附jcaso"voc/^omogewes葡萄糖(木糖)异构酶)、SEQIDN0.15列出。作为对照,产生了具有单个密码子优化而无密码子对优化的一个变体SEQIDNO.16-17,和具有单个密码子优化与"负密码子对优化"的一个变体SEQIDNO.18,见实施例4和表4.1。5.3在五.co/,/gflc///^穿梭载体中克隆葡萄糖异构酶和L-阿拉伯糖异构酶编码基因并转化进^^'肌78为了在5acz犯中表达选择的基因,我们使用了pBHA12E.co///Bac7'〃i穿梭载体(图26)。该载体主要来自于表达载体pBHA-l(EP340878),其中来自5flcZ〃Mam;;/o/z々"e/acz'e朋的am少Q基因的启动子替换了启动子。pBHA12质粒含有两个多克隆位点(图26)。所有选择的和优化的基因被合成制造(DNA2.0,MenloPark,CA,U.S.A.)为两个片段(A和B)。对应于基因5'端的A片段被克隆在awj;Q启动子之后。两个片段均用特定的限制性内切核酸酶位点延长,从而允许在多克隆位点1和2中的直接克隆(见图27)。片段A的3'端和片段B的5'端通过特有的限制性内切核酸酶位点重叠,所述位点允许将载体的Eo^部分切除并在转化万a"7/w犯6rifo之前连接回去(CBS363.94)。在克隆和转化及犯6"fo的步骤期间,使用五.co"作为中间宿主。选择pBHA12中的两步克隆途径,从而避免在五.co/Z中克隆和繁殖表达载体时可能的问题。在表5.1中列出了添加至片段A和B的限制性酶识别位点,以及允许回复连接(backligation)和原样再建完整功能基因的特有限制性位点。所有的A片段5'端含有M/el位点(识别序列CATATG),其允许将基因作为片段克隆,所述片段精确地在其各自的起始密码子(ATG)处起始。表5丄对已被添加至基因片段以协助在pBHA12中克隆的限制性内切核酸酶(RE)克隆位点的概述。__基因/RE片段A5,顿3,顿片段Br,一山,,一山5顺3顿特有的RE位点(基因中的位置)5.WearaAe/7wop/n7t^GI扁I争I(496bp)S.o/z'voc/iramogewesGIIMM一IC/fll(372bp)71.ma/Zr纖',ARAA扁IMM一IC/al(708bp)已使用标准的分子生物学方法(Sambrook&Russell,Mo/eoJwC/om'wg:爿丄aZora/oryM朋認/,3n/CSHLPress,ColdSpringHarbor,NY,2001;andAusubel"a/.,Cw/re"/尸ratoco/sz'wMo/ecw/arWo/ogy,WileyInterScience,NY,1995)将5个基因的A和B片段在两个步骤中分别克隆进MCS1和2,如图27中的SEQIDNO.13所示。转化在Aco//TOP10(Invitrogen)中进行,或者在另一步骤中使用甲基化敏感的限制性内切核酸酶的情况下,转79化在INV110(Invitrogen)中进行。使用小量或中量质粒分离试剂盒(分别来自Macherey-Nagel禾卩Sigma),分离针对每个表达构建体的若干个五.氨苄青霉素抗性转化体。通过限制性分析证实pBHA12载体中相应的A和B片段的正确连接。在下一步中,用特有的限制性内切核酸酶(见表5.1)消化含有基因的A和B片段的pBHA12质粒,从而将载体的五.co//部分切除。使用凝胶提取试剂盒(Macherey-Nagel)从琼脂糖凝胶上分离含有断裂基因的载体的^""7/m部分,并回复连接。通过感受态细胞转化将连接混合物转化至及w^'fcCBS363.94菌株。使用小量或中量质粒分离试剂盒(分别来自Macherey-Nagel和Sigma)分离针对每个表达构建体的若干个及^Z^fo卡那霉素抗性转化体。通过限制性分析针对下述内容检验表达构建体切除五.co/Z部分后正确的模式和pBHA12载体的Sfl"7/w部分的回复连接。对于每个构建体而言,选择三个及^Mfo转化体用于分析无细胞的提取物。5.4检测在丑acz7/z'中过量生产的酶使用每个构建体的三个及s^rito转化体和三个及amy/o/^we/fldew转化体分析无细胞提取物中相应蛋白质(葡萄糖或L-阿拉伯糖异构酶)的存在。使用2xTY发酵培养基培养菌株。在(摇瓶中)发酵24小时时取样(lml)并制备无细胞提取物,其中提取缓冲液中含有蛋白酶抑制剂。在SDS-PAGE(Invitrogen)上分析13/^无细胞提取物。针对若干个转化体检测对应于过表达的蛋白质的预期Mw的清楚条带。条带的视觉比较在表5.2中给出。显然,本发明的方法通过使用密码子对方法改进了Sad//^他araAermo//n7ws木糖异构酶、*S/rep/o,ceso/ZvocAramogewes木糖异构酶和7Terwo朋am^fl"ermaAra""L-阿拉伯糖异构酶的蛋白质生产,即这导致了与WT对照基因或单个密码子优化的变体之任一相比改进的蛋白质生产。另外,如果与单个密码子优化一起应用负密码子对优化,则未检测到任何产物。表5.2:三个异源基因在万a"7/z'中的过表达。WT:野生型;SC:单个密码子优化;cp:密码子对优化;cp、负密码子对优化。_及SMMfo80<table>tableseeoriginaldocumentpage81</column></row><table>参考文献Boycheva,S.,Chkodrov,G.&Ivanov,I.(2003).CodonpairsinthegenomeofEscherichiacoli.Bioinformatics19(8》987-998Gurvich,O丄.,Baranov,P.V.,Gesteland,R.F.,Atkins,J.F.(2005).ExpressionlevelsinfluenceribosomalframeshiftingatthetandemrareargininecodonsAGG—AGGandAGA—AGA.J.Bacteriol.187:4023-4032.Gustafsson,C.,Govindarajan,S.&Minshull,J.(2004).Codonbiasandheterologousproteinexpression.TrendsBiotechnol.22(7):346-353Gutman,G.A.&Hatfield,G.W.(1989).NonrandomutilizationofcodonpairsinEscherichiacoli.PNAS86:3699-3703Gygi,S.P.,Rochon,Y.,Franza,B.R.,&Aebersold,R.(1999).CorrelationbetweenproteinandmRNAabundanceinYeast.Mol.Cel.Biol.19(3):1720-30Hatfield,G.W.&Gutman,G.A.(1992).Codonpairutilization.UnitedStatesPatentNo5,082,767Irwin,B.,Heck,D.&Hatfield,G.W.(1995).Codonpairutilizationbiasesinfluencetranslationalelongationsteptimes.JBiolChem270:22801-22806Karlinetal.(2001).Characterizationofhighlyexpressedgenesoffourfast-growingbacteria.J.ofBacteriology183(17):5025-39.Kunst,F.etal.(1997).ThecompletegenomesequenceoftheGram-positivebacteriumBacillussubtilis.Nature390:249-256Lithwick,G.&Margalit,H.(2003).Hierarchyofsequence-dependentfeaturesassociatedwithprokaryotictranslation.GenomeRes.13(12):2665-73.Makrides,S.C.(1996).Strategiesforachievinghigh-levelexpressionofgenesinEscherichiacoli.Microbiol.Rev.60:512-538Moura,G.etal.(2005).ComparativecontextanalysisofcodonpairsonanORFeomescale.GenomeBiology2005,6:R28Nevalainen,K.M.H.,Te,o,V.S丄&Bergquist,P丄.(2005).Heterologousproteinexpressioninfilamentousflmgi.TrendsBiotechnol.200523(9):468-47482Pel,H丄,etal.(2007).GenomesequencingandanalysisoftheversatilecellfactoryAy/erg"/^m'gerCBS513.88.NatBiotech.200721(2):221-231Punt,P丄,vanBiezen,N.,Conesa,A.,Albers,A.,Mangnus,J.&vandenHondel,C.(2005).FilamentousfUngiascellfactoriesforheterologousproteinproduction.TrendsBiotechnol.20(5):200-206Rocha,E.P.C.,A.DanchinandA.Viari(1999》TranslationinBacillussubtilis:rolesandtrendsofinitiationandtermination,insightsfromagenomeanalysis.NAR,27(17):3567画76.Boycheva,S.,Chkodrov,G.&Ivanov,I.(2003).CodonpairsinthegenomeofEscherichiacoli.Bioinformatics19(8):987-998Schwartz,S.&Curran,J.F.(1997》AnalysesofframeshiftingatUUU-pyrimidinesites.NAR25(10):2005匿2011Spanjaard,R.A.&vanDuin,J.(1988》TranslationofthesequenceAGG-AGGyields50%ribosomalframeshift.PNAS85:7967-7971附录l:符号和等式列表单个密码子编码相同氨基酸的密码子,(c,)密码子c,.的出现数密码子q的比例(与其同义的相比)密码子对(c',。)密码子对的出现率(观察到的数量)"血((c,,c,))该密码子对的预期数量":((c,,。X"(c,;Kf(。).z"。J(,0)相应的标准差((c,,。))—"邵((c,,c》H1-《"(c,)《"(。))相应的标准评分(z-评分)〃、、H。))-((c"。))'",,c,))密码子对的偏向性系数L.,,、、"。6,((C,,S))—"邵((C,,。))6zos((c,.,c,))=-^-^-max("血((c,.,。.)),"邵((c,.,。》)组合的"预期"值(对权重而言)《;6'((c,,cy))=od).Z"r((q,c,)〕密码子对权重-方法,一个序列组(或基因组)84龟》=,,》max^t((c,.,。)),《((c,,。)))密码子对权重-方法,高表达的组与参考组(或基因组)c)卜o(c,,。))-"0r((c,,。.))85附录2:CR载体表B.l:按列表示的以下生物的CR矩阵值(1)AN:Am'g^全基因组-方法统计分布;(2)ANS:Am'g^250个高表达基因-方法目检,(3)AN—d:Am'ge/"关注-不关注(O-l)载体;(4)BS:及w6rifc全基因组-方法统计分布;(5)BSS:及50个高表达基因-方法,目检,(6)BS—d:及关注-不关注(O-l)载体;(7)EC:五.co//全基因组4298seq;-方法统计分布;(8)ECS五.co//来自Carboneetal.(2003)的高表达组100seq-方法目检;(9)EC—d:五.co"关注-不关注(0-1)载体;(10)BA:及"mj/o/z々we/adms全基因组-方法统计分布;(11)BAS:及flm少o〃z々t/e/fl"'eAw50个高表达的基因-方法目检;(12)BS—d:及"w少o〃/《Me/ac/e"j关注-不关注(0-l)载体;(13)SC:&cereWWae全基因组-方法统计分布;(14)SCS:cewv&'ae200个高表达的基因-方法目检,(15)SC—d:5".cem^z'ae关注-不关注(0-l)载体;(16)SCO:&coe//co/oM3(2)全基因组-方法统计分布。注意对于真菌微生物(更特定的P.c/>sogemw7、AOryzae、A^re"51、X.w.c/w/"ra、爿.^m/ga加、Zreesez'、iV",/^c/en'0而言,使用J.m'gw序列衍生的CR载体适用。对于一般酵母(更特定的^cfc和;cw26e)而言,使用cem^'ae序列衍生的CR载体适用。对于^r印tomyces物种而言,使用&coe//co/orA3(2)的CR载体适用。1ANANSAN_d4BSBSS6BS一dEC8ECS9EC—d10BA11BAS12BA—d1AAA3301711001758106910012AAC58100143550731000665003AAG67100129012519031014AAT4201574506600765005ACA2101414601400293006ACC3570116014257121017ACG2201262212600384008ACT233011733018431133009AGA1301272805001420010AGC1821122320272812230011AGG1201100130070112AGT13011101160090113ATA14011401900150114ATC527313540041721424008615ATG100100110010011001000100100116ATT3427151600512814460017CAA4001531001361704260018CAC51100132290428104140019CAG6010014701648305840020CAT490168710581905960021CCA22012022020140110122CCC2964110011300170123CCG240141430498605260024CCT253612935017002040025CGA15011001700120126CGC2551119340383403240027CGG180115011000200128CGT1649118380366601540029CTA100150140040130CTC2438111011000170131CTG2532123014710002820032CTT171712337011002130033GAA42261691001698006640034GAC4964136371376414440035GAG587413101312003460036GAT5136164631633615660037GCA210129500223001830038GCC3251120012700260139GCG211112501342303830040GCT2638126500174701840041GGA241613235112002630042GGC3235133341394214140043GGG190116011500170144GGT2549119311345811630045GTA110121231162901625046GTC35541240121003225047GTG3019126301361902825048GTT2427129471265302325049TAA27100162100159100041100150TAC58100134380427615950051TAG310122013200210152TAT420166620582411650053TCA13012434013001630054TCC23441120114311160155TCG17141100114006140056TCT17211213401541110030057TGA420116019003920058TGC591001544905510001730059TGG100100110010011001000410160TGT4101465104500140161TTA601213901400590162TTC65100130450417716940063TTG1813116240130066100164TTT3501705505923131600ANANSAN—dBABASBA_dECECSEC_dBSBSSBS一d12346"789101112表B.l继续13141516171819202122232487SCSCSSC—dSCO1AAA5921152AAC40751%3AAG41791954AAT6025145ACA316126ACC21401657ACG1421318ACT3452129AGA47761110AGC11312511AGG2131412AGT1641313ATA2801214ATC264819615ATG100100110016ATT46581217CAA69901518CAC365919319CAG311019520CAT64411721CCA41741222CCC16514123CCG13015424CCT31231225CGA701326CGC6114727CGG4013928CGT14251629CTA1491030CTC6013631CTG11516032CTT1331233GAA708511534GAC355119535GAG301518536GAT65491537GCA3021438GCC223315839GCG11013640GCT37641241GGA2301742GGC20816443GGG12111944GGT459511045GTA2201346GTC203915547GTG19814048GTT39541249TAA-1001-50TAC437419551TAG-01-52TAT57261553TCA2181254TCC16321418855TCG10512856TCT26481157TGA-01-58TGC381319159TGG100100110060TGT62871961TTA28211062TTC417119863TTG28621264TTT592912SCSCS1SCO131415161718192021222324附录3:CPW矩阵表C丄CPW矩阵Azergz7^m'ger全基因组(左侧密码子在第2列中指出,右侧密码子在第2行中指出)。宿主细胞A序列数据全Am'ger基因组。123456789101112AAAAACAAGAATACAACCACGACTAGAAGCAGGAGT1AAA'0.620.370.380.34-0.160.070.180.29-0.19-0.03-0.140.112AAC-0.17-0.28-0.23-0.050.08-0.25-0.010.030.02-0.300.17-0.043AAG-0.02-0.20-0.24-0.05-0.02-0.04-0.120.03-0.14-0.05-0.18-0.034AAT0.240.450.440.220.160.160.150.060.19o.n0.230.085ACA0.000.090.250.04-0.250.040.140.20-0.25-0.01-0.220.106ACC-0.17-0.35-0.28-0.070.11-0.300.170.01-0.08-0.220.07-0.027ACG-0.180.01-0.06-0.050.000.21-0.140.10-0.250.15-0.360.008ACT0.360.570.530.400.130.170.17-0.220.030.270.150.209AGA-0.010.050.06-0.08-0.180.190.100.11-0.37-0.17-0.30-0.1510AGC-0.28-0.26-0.26-0.250.00-0.060.040.03-0.20-0.31-0.04-0.2111AGG-0.310.08-0.22-0.32-0.170.31-0.250.01-0.390.10-0.52-0.1812AGT0.210.350.470.170.240.370.280.130.210.280.270.0513ATA0.060.250.380.16-0.210.030.080.04-0.100.07-0.070.1714ATC-0.27-0.35-0.31-0.150.03-0.30-0.03-0.040.14-0.100.200.0215ATG0.02-0.06-0.010.090.050.01-0.100.04-0.13-0.14-0.080.0016ATT0.500.550.560.450.340.260.260.160.410.440.460.3817CAA0.270.210.250.10-0.15-0.010.140.20-0.07-0.090.050.0318CAC-0.29-0.25-0.26-0.18-0.05-0.220.01-0.030.11-0.250.32-0.0919CAG-0.17-0.08-0.13-0.11-0.080.09-0.09-0.06-0.08-0.010.02-0.0120CAT0.230.440.460.090.080.180.150.040.420.290.500.2221CCA0.100.160.240.01-0.240.010.120.02-0.120.13-0.110.1722CCC-0.28-0.36-0.37-0.150.09-0.150.03-0.04-0.12-0.170.05-0.0623CCG-0.140.070.06-0.09-0.120.08-0.140.00-0.040.32-0.060.1424CCT0.380.460.480.260.090.180.16-0.020.310.380.330.2325CGA0.170.190.280.12-0.190.060.160.17-0.03-0.140.010.0326CGC-0.24-0.25-0.26-0.22-0.07-0.16-0.160.030.01-0.320.18-0.2727CGG-0.220.130.01-0.15-0.260.15-0.31-0.11-0.240.07-0.19-0.1828CGT0.510.450.670.480.290.240.320.170.630.440.580.4029CTA0.240.260.430.25-0.030.030.330.270.240.120.180.2530CTC-0.23-0.30-0.20-0.110.03-0.240.09-0.020.29-0.170.33-0.0631CTG-0.16-0.12-0.130.040.120.070.020.090.120.010.040.0932CTT0.540.520.640.480.190.230.260.110.560.440.550.3833GAA0.460.270.240.09-0.090.060.090.17-0.23-0.16-0.14-0.1234GAC-0.18-0.21-0.28-0.190.01-0.13-0.030.000.01-0.310.05-0.2335GAG-0.07-0.07-0.23-0.19-0.050.10-0.22-0.03-0.19-0.09-0.31-0.2036GAT0.240.340.360.090.060.120.02-0.010.200.190.270.0137GCA0.080.090.150.00-0.160.070.050.04-0.190.07-0.230.0638GCC-0.28-0.36-0.28-0.230.10-0.11-0.04-0.05-0.05-0.25-0.01-0.2639GCG-0.070.11-0.06-0.08-0.160.15-0.260.00-0.180.28-0.310.0540GCT0.380.600.480.380.160.210.10-0.120.180.310.220.1341GGA0.03-0.09-0.11-0.19-0.210.07-0.09-0.05-0.37-0.20-0.30-0.2842GGC-0.12-0.13-0.19-0.04-0.10-0.12-0.11-0.040.10-0.250.15-0.2743GGG-0.330.21-0.17-0.29-0.150.34-0.26-0.02-0.310.20-0.52-0.2544GGT0.340.310.650.320.240.170.380.030.310.210.45-0.0245GTA0.220.300.380.09-0.130.090.080.120.180.380.070.3946GTC-0.20-0.34-0.33-0.240.14-0.180.00-0.180.18-0.190.15-0.1147GTG-0.010.03-0.060.06-0.010.09-0.22-0.120.030.16-0.270.2448GTT0.530.550.500.380.350.310.13-0.030.430.370.240.2849TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC-0.18-0.29-0.24-0.060.13-0.230.000.010.15-0.200.34-0.0651TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.330.440.410.240.180.140.050.050.320.330.310.2153TCA0.010.220.29-0.03-0.330.020.05-0.09-0.180.11-0.270.1454TCC-0.21-0.30-0.30-0.070.07-0.210.100.00-0.01-0.190.080.1355TCG-0.140.06-0.02-0.11-0.090.00-0.20-0.09-0.070.21-0.210.1056TCT0.440.550.560.370.030.150.17-0.130.220.350.300.3757TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC-0.26-0.19-0.25-0.23-0.10-0.10-0.15-0.07-0.09-0.10-0.04-0.2659TGG-0.090.020.05-0.03-0.030.15-0.15-0.04-0.28-0.01-0.29-0.2260TGT0.270.430.610.310.160.250.140.080.390.470.370.3261TTA0.310.390.410.20-0.050.130.110.090.040.24-0.010.1362TTC-0.18-0.30-0.27-0.080.14-0.240.05-0.060.19-0.090.200.0263TTG-0.17-0.12-0.26-0.24-0.17-0.08-0.34-0.29-0.080.08-0.35-0.1264TTT0.540.580.610.460.250.190.090.080.400.410.290.29AAAAACAAGAATACAACCACGACTAGAAGCAGGAGT146"789101112表C.l继续131415161718192021222324ATAATCATGATTCAACACCAGCATCCAcccCCGCCT1AAA-0.140.210.190.31-0.14-0.070.16-0.08-0.11-0.040.050.192AAC0.24-0.17-0.090.08-0.02-0.18-0.110.100.05-0.19-0.050.083AAG-0.04-0.13-0.08-0.010.060.00-0.060.080.03-0.090.000.054AAT0.060.080.140.040.060.160.150.020.070.050.100.05ACA-0.110.200.040.20-0.070.010.050.00-0.22-0.020.000.056ACC0.11-0.29-0.14-0.070.09-0.120.080.200.190.020.110.217ACG-0.160.12-0.050.09-0.06-0.02-0.19-0.07-0.030.10-0.150.048ACT0.120.180.260.09-0.01-0.010.04-0.02-0.02-0.16-0.07-0.199AGA-0.310.190.060.09-0.14-0.05-0.05-0.15-0.100.150.100.1410AGC-0.08-0.09-0.20-0.090.00-0.05-0.09-0.120.130.220.130.1611AGG-0.440.19-0.09-0.110.050.200.100.02-0.150.27-0.050.0112AGT-0.200.030.11-0.02-0.040.110.07-0.120.040.07-0.050.0213ATA-0.340.170.100.21-0.140.110.07-0.07-0.20-0.08-0.02-0.0914ATC0.21-0.25-0.190.000.04-0.18-0.020.170.16-0.090.210.1915ATG-0.090.010.000.030.04-0.04-0.030.04-0.010.01-0.020.0216ATT0.240.190.320.16-0.030.070.03-0.010.03-0.19-0.03-0.1617CAA-0.210.140.150.25-0.160.020.22-0.03-0.150.020.100.1418CAC0.21-0.16-0.13-0.02-0.14-0.180.000.140.07-0.010.060.1519CAG-0.09-0.07-0.09-0.070.010.04-0.06-0.04-0.110.040.00-0.0120CAT-0.040.160.150.01-0.060.140.15-0.05-0.07-0.04-0.03-0.1121CCA-0.020.240.090.08-0.050.110.210.08-0.300.08-0.030.1022CCC-0.06-0.29-0.20-0.120.060.160.020.150.190.560.060.2923CCG-0.140.09-0.040.05-0.080.02-0.07-0.10-0.080.18-0.190.0124CCT0.070.200.260.08-0.12-0.170.01-0.19-0.23-0.17-0.20-0.2525CGA0.070.280.180.25-0.15-0.11-0.08-0.04-0.270.05-0.23-0.0326CGC-0.01-0.23-0.20-0.140.070.07-0.04-0.070.230.200.230.3127CGG-0.330.18-0.06-0.030.070.300.170.05-0.230.12-0.14-0.0828CGT0.110.010.310.19-0.08-0.100.02-0.09-0.13-0.22-0.20-0.2029CTA0.030.250.240.32-0.19-0.15-0.04-0.10-0.26-0.08-0.05-0.0630CTC0.06-0.22-0.08-0.030.06-0.030.170.230.14-0.120.310.1331CTG0.050.00-0.020.060.02-0.06-0.120.050.150.090.150.0932CTT0.210.250.390.20-0.12-0.130.04-0.18-0.23-0.34-0.04-0.3333GAA-0.070.110.130.14-0.050.070.170.000.000.090.100.1634GAC0.100.00-0.10-0.03-0.04-0.15-0.12-0.030.16-0.12-0.02-0.0135GAG-0.05-0.01-0.09-0.140.010.03-0.09-0.07-0.090.02-0.09-0.0936GAT-0.010.060.11-0.080.100.220.09-0.020.10-0.02-0.040.0237GCA0.090.300.180.170.040.100.07-0.02-0.180.10-0.10-0.0238GCC0.13-0.27-0.24-0.280.210.000.040.140.280.200.150.2039GCG0.010.230.020.08-0.100.02-0.24-0.23-0.090.06-0.28-0.0740GCT0.120.180.24-0.070.040.04-0.02-0.060.04-0.13-0.01-0.2641GGA-0.110.210.020.070.000.17-0.01-0.01-0.080.16-0.050.0442GGC0.12-0.04-0.14-0.10-0.05-0.15-0.14-0.120.160.070.090.0043GGG-0.300.26-0.08-0.150.160.330.06-0.04-0.150.01-0.20-0.1844GGT0.15-0.130.26-0.030.040.060.11-0.020.11-0.020.06-0.1545GTA-0.040.300.220.20-0.17-0.02-0.04-0.16-0.29-0.04-0.11-0.2146GTC0.13-0.22-0.18-0.230.23-0.020.100.190.330.110.270.1547GTG0.020.19-0.01-0.060.010.03-0.21-0.080.140.10-0.15-0.0748GTT0.140.220.250.000.010.000.04-0.08-0.05-0.21-0.02-0.3249TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.18-0.17-0.110.040.02-0.15-0.050.130.17-0.020.170.1551TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.080.130.180.05-0.01o.n0.06-0.04-0.06-0.15-0.15-0.1553TCA-0.240.200.040.070.000.180.140.12-0.280.000.00-0.0254TCC-0.01-0.25-0.110.030.13-0.020.080.160.120.120.090.1855TCG-0.140.120.000.15-0.120.05-0.11-0.12-0.040.04-0.14-0.0956TCT0.080.210.330.21-0.08-0.010.02-0.09-0.15-0.21-0.13-0.2757TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC-0.05-0.01-0.14-0.170.04-0.04-0.06-0.050.260.170.060.1559TGG-0.090.090.00-0.090.050.00-0.030.00-0.020.06-0.070.0260TGT-0.020.160.230.12-0.060.150.09-0.01-0.13-0.11-0.27-0.2161TTA-0.250.270.090.18-0.18-0.010.00-0.20-0.32-0.01-0.12-0.1562TTC0.30-0.21-0.13-0.020.04-0.20-0.140.120.20-0.110.11-0.0263TTG-0.23-0.12-0.28-0.260.150.27-0.020.050.210.270.090.0864TTT0.280.210.290.130.100.170.190.070.00-0.09-0.01-0.07ATAATCATGATTCAACACCAGCATCCACCCCCGCCT131415161718192021222324表l继续252627282930313233343536CGACGCCGGCGTCTACTCCTGCTTGAAGACGAGGAT1AAA-0.140.130.000.25-0.120.050.150.18-0.15-0.180.07-0.042AAC0.02-0.18-0.06-0.110.11-0.20-0.080.050.110.020.090.293AAG0.130.010.080.070.04-0.06-0.10-0.040.100.09-0.040.04<table>tableseeoriginaldocumentpage92</column></row><table>59TGG0.150.080.170.17-0.140.09-0.05-0.070.010.12-0.01-0.1060TGT-0.23-0.16-0.29-0.09-0.21-0.16-0.19-0.23-0.110.01-0.04-0.1961TTA-0.230.25-0.140.19-0.33-0.01-0.06-0.200.040.130.160.1162TTC0.01-0.22-0.08-0.200.18-0.21-0.010.010.140.040.110.2463TTG0.120.200.090.270.160.250.060.04-0.030.14-0.14-0.1464TTT-0.070.110.090.180.010.000.110.01-0.13-0.20-0.22-0.21CGACGCCGGCGTCTACTCCTGCTTGAAGACGAGGAT252627282930313233343536表C.l继续373839404142434445464748GCAGCCGCGGCTGGAGGCGGGGGTGTAGTCGTGGTT1AAA-0.27-0.15-0.040.06-0.15-0.15-0.130.21-0.26-0.120.040.182AAC0.310.040.410.390.260.010.310.270.420.090.300.323AAG0.160.050.000.040.090.010.10-0.030.22-0.030.02-0.024AAT-0.23-0.26-0.35-0.23-0.23-0.26-0.29-0.02-0.13-0.30-0.23-0.255ACA-0.29-0.070.010.06-0.020.06-0.010.34-0.240.100.060.186ACC0.280.160.350.380.250.000.310.110.35-0.070.210.187ACG-0.080.01-0.30-0.010.020.01-0.210.07-0.090.06-0.250.038ACT-0.19-0.17-0.15-0.27-0.23-0.20-0.19-0.28-0.12-0.14-0.02-0.279AGA-0.160.150.010.04-0.220.09-0.190.08-0.350.190.080.1410AGC0.180.150.310.270.160.080.210.160.270.130.170.2011AGG0.000.32-0.08-0.03-0.020.28-0.250.08-0.210.25-0.13-0.1312AGT-0.09-0.05-0.23-0.17-0.29-0.17-0.32-0.30-0.31-0.16-0.31-0.3013ATA-0.160.140.140.130.160.280.120.37-0.280.220.190.2214ATC0.280.040.330.350.170.060.250.150.430.040.240.3515ATG0.020.02-0.04-0.010.01-0.050.050.02-0.040.010.000.0016ATT-0.23-0.30-0.30-0.28-0.28-0.23-0.28-0.20-0.16-0.32-0.26-0.3017CAA-0.23-0.100.04-0.02-0.18-0.20-0.160.11-0.28-0.110.060.0818CAC0.300.030.330.270.21-0.030.320.150.440.020.480.2119CAG0.070.140.000.020.080.080.250.000.060.030.030.0320CAT-0.22-0.11-0.25-0.16-0.12-0.16-0.16-0.04-0.20-0.23-0.10-0.2821CCA-0.250.010.11-0.07-0.110.02-0.030.20-0.270.050.040.1222CCC0.250.030.280.200.160.040.20-0.020.25-0.090.180.1623CCG0.030.13-0.13-0.030.000.16-0.120.10-0.070.17-0.13-0.0524CCT-0.160.01-0.08-0.28-0.17-0.07-0.11-0.22-0.09-0.060.02-0.2525CGA-0.230.08-0.070.08-0.12-0.05-0.010.23-0.250.030.010.1626CGC0.14-0.060.240.310.16-0.010.210.130.230.110.110.3127CGG0.020.17-0.24-0.060.060.250.180.14-0.230.19-0.15-0.0228CGT-0.05-0.23-0.15-0.26-0.22-0.19-0.01-0.44-0.15-0.28-0.13-0.2729CTA-0.26-0.080.110.00-0.19-0.130.060.19-0.25-0.050.180.1430CTC0.31-0.020.400.290.240.100.350.160.33-0.080.320.2431CTG-0.06-0.06-0.13-0.09-0.15-0.05-0.02-0.12-0.04-0.03-0.09-0.0132CTT-0.14-0.10-0.13-0.22-0.230.01-0.18-0.23-0.01-0.180.03-0.2733GAA-0.21-0.14-0.08-0.06-0.17-0.13-0.220.06-0.26-0.09-0.010.0134GAC0.320.170.380.390.290.130.210.260.360.280.250.3035GAG0.140.19-0.040.060.160.17-0.020.040.040.18-0.03-0.0236GAT-0.16-0.17-0.34-0.23-0.18-0.15-0.31-0.07-0.19-0.13-0.25-0.2937GCA-0.300.05-0.02-0.02-0.110.010.000.17-0.210.190.020.1338GCC0.300.100.340.280.270.170.270.000.230.010.080.1739GCG0.130.23-0.230.140.150.14-0.080.07-0.040.26-0.140.0940GCT-0.12-0.13-0.18-0.40-0.18-0.11-0.22-0.37-0.11-0.15-0.15-0.3041GGA-0.060.230.100.08-0.120.11-0.090.09-0.110.260.070.1742GGC0.120.090.240.180.250.050.270.140.250.200.150.2643GGG0.020.33-0.120.010.280.420.280.21-0.230.37-0.20-0.039344GGT-0.14-0.27-0.09-0.40-0.22-0.18-0.07-0.54-0.20-0.32-0.18-0.3945GTA-0.25-0.04-0.010.00-0.090.16-0.080.37-0.310.130.140.2346GTC0.250.100.310.220.170.000.180.040.390.090.250.2747GTG0.030.12-0.08-0.090.040.150.030.16-0.090.15-0.28-0.0948GTT-0.15-0.10-0.25-0.32-0.22-0.15-0.31-0.30-0.01-0.17-0.14-0.3449TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.350.030.340.330.17-0.070.280.200.420.130.190.3051TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT-0.17-0.24-0.38-0.19-0.17-0.17-0.18-0.03-0.11-0.23-0.26-0.2553TCA-0.240.020.140.030.000.17-0.070.31-0.270.090.120.1854TCC0.17-0.080.320.230.210.000.220.100.28-0.070.190.2155TCG0.070.09-0.080.030.070.12-0.060.23-0.120.09-0.110.0656TCT-0.26-0.18-0.19-0.35-0.24-0.14-0.18-0.20-0.10-0.08-0.05-0.2057TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC0.180.040.240.200.250.180.14-0.030.190.210.170.2959TGG0.020.11-0.12-0.04-0.070.070.18-0.14-0.200.24-0.12-0.0560TGT-0.06-0.15-0.29-0.22-0.17-0.03-0.21-0.26-0.22-0.21-0.21-0.3161TTA-0.110.150.130.030.000.17-0.150.19-0.280.200.110.0762TTC0.400.120.380.350.300.210.320.160.450.060.300.2963TTG0.090.17-0.15-0.05-0.010.21-0.160.08-0.010.13-0.14-0.1264TTT-0.29-0.33-0.42-0.36-0.29-0.28-0.40-0.27-0.15-0.30-0.33-0.31GCAGCCGCGGCTGGAGGCGGGGGTGTAGTCGTGGTT373839404142434445464748表C.l继续495051525354555657585960TAATACTAGTATTCATCCTCGTCTTGATGCTGGTGT1AAA0.00-0.250.00-0.18-0.32-0.22-0.04-0.040.00-0.37-0.34-0.292AAC0.00-0.250.000.060.15-0.09-0.110.120.00-0.19-0.120.16AAG0.000.140.000.140.110.130.040.130.000.270.240.224AAT0.000.300.000.140.040.160.140.100.000.090.190.115ACA0.00-0.140.00-0.21-0.25-0.12-0.06-0.180.00-0.14-0.22-0.146ACC0.00-0.270.000.170.17-0.22-0.010.120.00-0.23-0.110.127ACG0.000.230.000.060.190.360.090.190.000.330.310.348ACT0.000.310.000.21-0.070.02-0.04-0.070.000.030.17-0.019AGA0.00-0.050.00-0.25-0.050.250.150.090.00-0.18-0.18-0.1710AGC0.00-0.230.00-0.130.100.080.100.090.00-0.21-0.21-0.1411AGG0.000.440.00-0.02-0.050.430.120.190.000.390.210.0512AGT0.000.300.000.020.210.440.380.340.000.150.210.1613ATA0.00-0.200.00-0.31-0.51-0.32-0.27-0.370.00-0.21-0.26-0.2914ATC0.00-0.270.000.120.17-0.230.030.100.00-0.10-0.100.1015ATG0.00-0.010.000.020.080.070.020.000.00-0.010.000.0216ATT0.000.410.000.380.130.070.200.130.000.200.310.1317CAA0.00-0.070.00-0.13-0.19-0.100.070.060.00-0.22-0.20-0.2418CAC0.00-0.210.00-0.020.11-0.15-0.060.050.00-0.21-0.150.0819CAG0.000.130.000.00-0.080.140.020.060.000.220.160.1620CAT0.000.290.000.00-0.070.090.020.040.000.170.190.0621CCA0.00-0.070.00-0.09-0.38-0.15-0.13-0.170.00-0.11-0.18-0.1722CCC0.00-0.170.000.230.15-0.160.130.040.00-0.08-0.020.1023CCG0.000.130.00-0.16-0.060.19-0.080.010.000.260.060.2124CCT0.000.110.000.09-0.110.020.05-0.120.00-0.060.16-0.0925CGA0.00-0.060.00-0.16-0.330.07-0.170.030.00-0.16-0.16-0.1126CGC0.00-0.160.000.000.04-0.110.050.060.00-0.14-0.090.0727CGG0.000.120.00-0.20-0.150.18-0.24-0.010.000.200.090.0828CGT0.000.320.000.190.190.160.250.090.000.090.260.0494<table>tableseeoriginaldocumentpage95</column></row><table>14ATC0.17-0.210.180.1415ATG0.220.020.15-0.0316ATT0.230.270.190.4617CAA-0.36-0.11-0.03-0.0718CAC0.26-0.110.11-0.1019CAG0.120.060.140.0920CAT-0.040.090.040.1821CCA-0.320.01-0.20-0.1722CCC0.04-0.12-0.040.0923CCG-0.170.12-0.22-0.0924CCT-0.170.06-0.110.1325CGA-0.24-0.06-0.02-0.0726CGC0.03-0.190.160.0327CGG-0.27-0.13-0.19-0.2028CGT-0.050.140.240.4029CTA-0.28-0.040.13-0.1430CTC0.20-0.290.29隱O.Ol31CTG0.340.150.230.1732CTT0.00-0.010.070.2333GAA-0.190.00-0.08-0.0834GAC0.29-0.10-0.08-0.3335GAG0.230.08-0.03-0.0936GAT0.100.240.030.2537GCA-0.260.02-0.21-0.2038GCC0.13-0.04-0.08-0.1639GCG0.120.24-0.23-0.1140GCT-0.170.08-0.150.1041GGA-0.260.03-0.13-0.1542GGC0.00-0.10-0.09-0.0443GGG-0.330.00-0.37-0.3644GGT0.120.230.280.4345GTA-0.38-0.13-0.13-0.2546GTC0.23-0.190.04-0.1147GTG0.260.28-0.10-0.1748GTT0.220.200.060.3449TAA0.000.000.000.0050TAC0.25-0.150.21-0.1151TAG0.000.000.000.0052TAT0.180.210.150.2553TCA-0.39-0.08-0.13-0.1454TCC0.05-0.120.080.0655TCG-0.010.14-0.020.0356TCT-0.130.02-0.050.1057TGA0.000.000.000.0058TGC-0.10-0.07-0.13-0.1759TGG0.040.040.10-0.0660TGT-0.090.190.010.1561TTA-0.39-0.03-0.14-0.2162TTC0.19-0.240.00-0.0163TTG0.170.23-0.050.0364TTT0.280.290.150.56TTATTCTTGTTT6162636496表C2:CPW矩阵Am'g^高表达序列(左侧密码子在第2列中指出,右侧密码子在第2行中指出)。宿主细胞AWgW;序列数据全A基因组;高表达的组400条序列。123456789101112AAAAACAAGAATACAACCACGACTAGAAGCAGGAGT1AAA0.930.640.650.920.610.650.670.570.070.61-0.090.752AAC0.44-0.48-0.420.490.57-0.500.16-0.330.33-0.250.600.133AAG0.51-0.42-0.360.480.27-0.460.43-0.18-0.23-0.140.130.234AAT0.600.680.750.650.350.630.440.800.750.360.800.315ACA0.290.450.560.710.220.430.150.660.330.58-0.140.286ACC0.34-0.58-0.530.450.33-0.590.43-0.460.31-0.41-0.200.157ACG0.25-0.020.390.160.560.080.520.470.010.59-0.300.578ACT0.500.810.480.480.660.100.600.20-0.360.260.640.629AGA0.64-0.070.010.270.560.270.630.29-0.320.340.090.2710AGC0.34-0.39-0.440.080.40-0.340.33-0.380.42-0.39-0.300.2011AGG-0.200.24-0.290.17-0.44-0.33-0.440.17-0.040.580.720.3312AGT0.780.560.790.510.280.580.540.450.490.710.890.7513ATA0.020.860.620.580.440.610.530.740.410.680.690.5014ATC0.12-0.53-0.440.290.52-0.580.12-0.030.38-0.140.040.2115ATG0.41-0.21-0.130.440.56-0.250.28-0.16-0.08-0.360.310.2616ATT0.630.770.820.820.660.380.610.470.760.610.750.5817CAA0.700.520.580.01-0.250.440.030.57-0.450.19-0.060.5318CAC-0.13-0.47-0.47-0.040.41-0.55-0.29-0.050.02-0.28-0.310.1619CAG0.29-0.29-0.370.180.45-0.390.49-0.270.15-0.240.220.1320CAT0.740.740.840.550.490.480.610.700.770.770.840.5721CCA0.610.760.310.48-0.330.310.040.410.320.290.280.6622CCC0.57-0.56-0.620.120.38-0.440.25-0.31-0.06-0.51-0.400.2823CCG0.130.080.590.270.330.360.460.190.190.590.840.6124CCT0.530.240.550.43-0.13-0.220.51-0.160.80-0.240.57-0.1825CGA0.730.370.770.77-0.070.380.610.620.720.350.800.5226CGC0.34-0.49-0.56-0.030.51-0.400.26-0.29-0.36-0.47-0.10-0.0827CGG0.200.320.580.480.140.480.110.340.580.510.250.4028CGT0.77-0.210.210.590.11-0.480.090.080.660.480.780.5329CTA0.800.470.710.91-0.070.190.830.51-0.260.120.450.3830CTC0.20-0.46-0.490.140.40-0.440.39-0.070.28-0.320.49-0.2531CTG0.42-0.34-0.220.380.61-0.420.380.080.30-0.24-0.420.2432CTT0.320.400.740.810.42-0.040.43-0.020.700.480.840.5133GAA0.790.310.580.580.100.330.470.640.480.340.010.4934GAC0.37-0.42-0.500.130.34-0.470.46-0.180.23-0.390.50-0.2635GAG0.52-0.37-0.430.280.54-0.430.33-0.400.06-0.380.220.0936GAT0.660.420.660.42-0.160.190.570.310.590.520.230.4437GCA0.240.210.480.420.460.32-0.100.430.200.17-0.350.5838GCC0.38-0.61-0.540.250.19-0.470.11-0.37-0.08-0.440.52-0.1439GCG0.660.260.420.300.500.420.540.240.060.530.630.7340GCT0.380.660.320.780.49-0.200.50-0.27-0.210.190.540.4841GGA0.67-0.01-0.010.400.330.05-0.03-0.020.11-0.360.530.0242GGC0.11-0.38-0.510.330.52-0.300.26-0.05-0.13-0.48-0.23-0.1343GGG0.720.530.610.640.640.440.700.570.830.660.910.5744GGT0.48-0.290.420.480.62-0.530.57-0.350.580.010.780.1545GTA0.790.650.640.580.560.500.750.350.700.590.680.3446GTC0.25-0.60-0.560.390.66-0.550.32-0.220.33-0.33-0.26-0.0247GTG0.630.210.370.710.32-0.220.190.200.220.22-0.230.6748GTT0.600.510.540.640.720.010.560.050.830.410.690.759749TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.34-0.49-0.450.390.49-0.520.59-0.31-0.42-0.250.680.0851TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.840.720.810.740.520.480.730.221.000.62-0.170.8053TCA0.400.600.66-0.06-0.640.33-0.430.570.360.33-0.630.3654TCC0.42-0.55-0.570.350.69-0.470.48-0.280.05-0.30-0.280.1355TCG0.18-0.020.13-0.090.130.020.290.240.390.300.070.4356TCT0.700.750.640.740.500.020.21-0.050.220.290.700.8257TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC0.57-0.33-0.33-0.170.20-0.460.11-0.31-0.050.39-0.520.4159TGG0.19-0.20-0.060.430.63-0.280.65-0.34-0.500.030.09-0.1260TGT0.200.740.420.500.670.410.810.640.420.771.000.3861TTA0.850.850.450.510.490.690.360.690.410.530.160.5662TTC0.18-0.45-0.350.420.33-0.460.36-0.180.46-0.140.170.0963TTG-0.26-0.020.10-0.050.25-0.21-0.06-0.220.420.300.590.1764TTT0.700.820.820.770.530.490.540.62-0.230.52-0.270.76AAAAACAAGAATACAACCACGACTAGAAGCAGGAGT123456789101112表C.2继续131415161718192021222324ATAATCATGATTCAACACCAGCATCCACCCCCGCCTlAAA0.110.750.530.32-0.36-0.090.550.560.610.290.530.802AAC0.76-0.39-0.10-0.080.35-0.44-0.460.400.62-0.560.04-0.153AAG0.63-0.36-0.160.170.47-0.37-0.260.550.53-0.510.23-0.084AAT0.180.600.190.530.790.610.630.150.790.550.770.16ACA0.700.590.370.47-0.150.35-0.170.490.030.050.730.166ACC0.31-0.53-0.41-0.280.37-0.53-0.280.680.62-0.54-0.010.03"7ACG-0.240.580.520.470.530.270.340.270.610.530.530.398ACT0.700.140.370.550.38-0.25-0.220.170.31-0.510.57-0.319AGA0.12-0.390.100.62-0.14-0.260.060.250.500.300.310.0210AGC0.64-0.23-0.26-0.010.43-0.05-0.280.380.58-0.160.15-0.1111AGG-0.600.550.050.750.20-0.390.470.650.150.630.76-0.2112AGT0.550.220.390.450.490.210.230.540.270.290.530.1513ATA-0.150.740.560.620.060.74-0.040.84-0.38-0.070.880.4614ATC0.58-0.43-0.31-0.070.33-0.52-0.290.520.04-0.390.250.0915ATG-0.07-0.160.000.330.10-0.26-0.060.410.22-0.410.280.4316ATT0.840.480.570.280.070.240.340.420.63-0.100.180.0117CAA0.770.300.250.560.21-0.190.440.28-0.32-0.05-0.130.2218CAC0.93-0.49-0.20-0.10-0.37-0.52-0.180.570.38-0.550.39-0.2619CAG-0.08-0.32-0.120.080.20-0.17-0.300.240.49-0.290.43-0.0720CAT0.440.530.300.620.090.490.610.250.260.620.62-0.0821CCA0.620.220.650.570.390.660.680.660.170.26-0.33-0.0822CCC0.17-0.48-0.42-0.340.40-0.57-0.510.540.370.26-0.08-0.0523CCG0.200.420.110.230.340.490.350.570.100.520.260.6924CCT-0.280.340.320.290.25-0.46-0.220.17-0.45-0.410.14-0.3925CGA0.660.440.480.570.730.570.490.490.690.640.41-0.2526CGC0.73-0.09-0.34-0.070.26-0.40-0.450.570.52-0.570.23-0.2027CGG0.240.400.360.480.460.510.710.71-0.140.500.420.7628CGT-0.19-0.610.00-0.010.02-0.55-0.48-0.180.44-0.620.18-0.2929CTA0.330.800.470.57-0.16-0.380.480.37-0.30-0.210.480.0530CTC0.65-0.35-0.18-0.080.50-0.38-0.410.620.58-0.480.35-0.0131CTG0.71-0.40-0.25-0.050.34-0.35-0.200.320.62-0.300.310.2032CTT0.610.440.560.56-0.24-0.16-0.100.39-0.41-0.460.50-0.3798<table>tableseeoriginaldocumentpage99</column></row><table>17CAA0.590.600.420.060.460.400.370.280.230.310.450.2718CAC0.60-0.36-0.01-0.66-0.26-0.46-0.41-0.10-0.12-0.43-0.220.3519CAG0.50-0.330.45-0.510.66-0.44-0.28-0.150.28-0.21-0.35-0.0620CAT0.650.660.470.23-0.300.600.520.760.240.320.290.1921CCA0.780.390.330.570.350.650.100.490.190.450.330.5022CCC0.49-0.310.55-0.69-0.29-0.340.040.090.07-0.32-0.46-0.1723CCG0.660.420.530.710.500.490.24-0.050.51-0.090.450.4524CCT-0.22-0.340.61-0.640.43-0.35-0.41-0.330.14-0.31-0.130.2925CGA0.640.620.460.690.100.730.450.350.340.300.690.7126CGC-0.13-0.450.37-0.620.17-0.30-0.110.350.170.01-0.220.2027CGG0.190.630.760.600.530.780.620.580.560.360.460.2028CGT0.48-0.450.21-0.69-0.16-0.63-0.59-0.470.09-0.62-0.64-0.3329CTA0.45-0.51-0.01-0.25-0.36-0.300.520.550.730.290.390.1730CTC0.69-0.540.07-0.650.70-0.44-0.18-0.280.14-0.29-0.330.0231CTG0.54-0.220.52-0.350.69-0.18-0.010.040.13-0.10-0.180.1332CTT0.580.020.60-0.330.33-0.31-0.24-0.050.26-0.34-0.300.1033GAA0.140.490.630.220.790.250.490.420.24-0.080.300.2234GAC0.52-0.44-0.15-0.58-0.09-0.33-0.10-0.260.32-0.05-0.210.3435GAG0.40-0.320.39-0.580.57-0.42-0.29-0.340.34-0.11-0.340.0536GAT-0.090.440.660.300.610.060.02-0.070.50-0.29-0.210.1337GCA0.390.640.600.670.480.730.520.390.410.350.170.4738GCC0.72-0.340.49-0.560.72-0.36-0.19-0.230.42-0.38-0.25-0.1239GCG0.740.650.520.310.230.260.350.440.37-0.080.360.3440GCT0.59-0.530.48-0.650.62-0.47-0.36-0.370.200.13-0.440.0941GGA0.710.140.62-0.100.640.560.580.480.480.510.300.2542GGC0.33-0.350.60-0.600.080.170.080.290.330.17-0.180.3743GGG-0.130.810.810.150.450.350.720.620.590.410.640.6544GGT0.65-0.270.59-0.64-0.11-0.64-0.54-0.47-0.12-0.58-0.47-0.2645GTA0.770.160.910.660.780.560.620.460.750.710.700.4846GTC0.57-0.520.25-0.620.28-0.40-0.23-0.310.43-0.28-0.230.1547GTG0.66-0.030.520.450.780.060.150.070.360.190.150.5048GTT0.78-0.200.53-0.25-0.07-0.37-0.14-0.100.01-0.48-0.470.0749TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.57-0.420.23-0.540.38-0.36-0.24-0.210.18-0.28-0.330.2551TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.570.630.390.56-0.110.22-0.070.480.500.160.260.0953TCA-0.180.680.630.34-0.550.240.460.460.540.270.11-0.0354TCC0.77-0.480.14-0.52-0.21-0.40-0.30-0.010.34-0.54-0.380.0955TCG0.490.200.530.55-0.540.340.100.320.420.200.180.1956TCT-0.02-0.470.43-0.550.42-0.15-0.30-0.320.140.16-0.200.2657TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC0.820.050.73-0.59-0.22-0.02-0.040.340.090.02-0.200.1359TGG0.74-0.280.61-0.02-0.43-0.120.050.170.210.11-0.12-0.1060TGT0.410.71-0.19-0.460.09-0.05-0.11-0.090.240.320.16-0.3761TTA0.100.930.430.78-0.050.480.401.000.430.230.850.8162TTC0.67-0.370.43-0.610.04-0.26-0.35-0.150.31-0.17-0.210.2563TTG0.560.630.590.30-0.13-0.010.290.010.370.240.080.2264TTT0.820.640.770.68-0.500.640.680.450.03-0.350.100.45CGACGCCGGCGTCTACTCCTGCTTGAAGACGAGGAT252627282930313233343536表C.2继续373839404142434445464748GCAGCCGCGGCTGGAGGCGGGGGTGTAGTCGTGGTT1001AAA0.410.430.350.59-0.150.240.210.760.900.310.570.602AAC0.66-0.300.33-0.040.12-0.320.53-0.220.92-0.400.38-0.033AAG0.49-0.430.52-0.270.38-0.140.85-0.470.73-0.420.29-0.204AAT-0.06-0.110.24-0.130.340.110.360.270.57-0.070.140.15ACA0.530.450.570.540.640.460.450.530.060.550.370.696ACC0.30-0.140.53-0.140.19-0.170.69-0.460.77-0.450.29-0.317ACG0.24-0.120.500.180.410.410.700.290.040.570.180.128ACT-0.18-0.470.05-0.54-0.03-0.300.55-0.610.24-0.380.25-0.309AGA0.160.270.500.470.180.630.820.210.420.540.350.4510AGC0.590.140.58-0.080.53-0.050.71-0.350.730.050.17-0.1211AGG-0.340.090.820.440.570.490.810.39-0.180.210.510.4712AGT0.46-0.260.240.050.350.160.53-0.320.36-0.16-0.17-0.0913ATA0.600.700.790.770.160.860.830.83-0.540.730.520.6914ATC0.61-0.060.62-0.040.400.200.79-0.440.72-0.260.530.1515ATG0.30-0.240.38-0.080.330.060.36-0.330.58-0.230.30-0.1116ATT0.29-0.550.02-0.44-0.03-0.210.39-0.470.00-0.450.07-0.2317CAA-0.020.240.480.370.080.170.680.030.500.340.540.5518CAC0.20-0.340.32-0.27-0.04-0.150.80-0.440.45-0.430.71-0.3519CAG0.33-0.290.27-0.330.32-0.120.75-0.450.57-0.420.16-0.2920CAT-0.420.390.430.450.010.160.520.270.92-0.150.480.2021CCA0.450.570.350.090.35-0.07-0.290.38-0.250.570.540.5522CCC0.57-0.430.57-0.290.09-0.200.64-0.570.12-0.48-0.08-0.3123CCG0.140.220.09-0.170.560.490.640.530.680.430.420.1224CCT0.53-0.260.55-0.440.120.050.60-0.480.59-0.280.11-0.1725CGA0.680.660.750.600.47-0.170.530.700.780.130.470.5826CGC0.22-0.150.65-0.150.440.100.60-0.060.090.140.200.1227CGG0.520.430.390.300.630.580.740.420.250.550.110.2728CGT0.09-0.69-0.02-0.66-0.35-0.430.45-0.820.51-0.73-0.26-0.4629CTA0.53-0.13-0.23-0.32-0.39-0.220.660.730.440.560.75-0.2230CTC0.48-0.220.52-0.070.130.050.78-0.350.67-0.380.38-0.0131CTG0.24-0.280.20-0.260.07-0.070.65-0.320.35-0.350.11-0.0932CTT0.34-0.450.45-0.46-0.32-0.260.58-0.58-0.07-0.330.18-0.2833GAA0.160.030.210.380.300.050.490.310.70-0.160.340.3434GAC0.60-0.120.490.260.40-0.160.69-0.140.590.060.490.0235GAG0.51-0.290.44-0.370.34-0.150.77-0.490.69-0.310.37-0.3236GAT0.53-0.420.09-0.290.04-0.170.06-0.070.36-0.360.15-0.2737GCA0.290.660.420.510.230.300.390.50-0.220.510.370.5338GCC0.61-0.280.42-0.060.540.230.75-0.490.59-0.410.34-0.2839GCG0.460.280.440.480.510.510.740.320.160.450.450.5140GCT0.31-0.530.12-0.58-0.05-0.350.57-0.640.07-0.400.19-0.3641GGA0.140.380.500.420.450.460.800.310.780.500.630.2342GGC0.510.360.750.110.490.280.750.080.640.470.540.2843GGG0.610.780.840.460.600.830.890.780.730.730.740.7744GGT0.39-0.660.32-0.67-0.15-0.330.63-0.780.63-0.66-0.22-0.6945GTA-0.11-0.050.340.730.510.51-0.210.890.660.390.460.7046GTC0.43-0.160.66-0.200.22-0.140.47-0.530.67-0.360.41-0.0447GTG0.590.140.580.210.500.370.710.550.750.220.390.3848GTT-0.13-0.470.22-0.530.02-0.170.41-0.560.57-0.510.06-0.5049TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.66-0.320.52-0.240.41-0.220.06-0.220.77-0.16-0.100.1551TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.45-0.260.59-0.130.38-0.140.520.110.36-0.040.20-0.2553TCA0.550.540.160.310.430.820.620.780.440.220.490.5754TCC0.10-0.390.53-0.320.00-0.370.52-0.500.47-0.540.10-0.3655TCG0.210.200.400.130.410.390.660.350.570.370.270.3510156TCT0.20-0.440.26-0.510.08-0.420.06-0.370.33-0.090.33-0.1657TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC0.630.140.31-0.430.410.260.16-0.43-0.170.350.580.2359TGG0.350.000.00-0.190.420.120.68-0.45-0.150.49-0.17-0.3160TGT0.50-0.230.25-0.230.210.310.44-0.400.66-0.580.35-0.4661TTA0.680.700.700.450.920.95-0.020.810.360.79-0.141.0062TTC0.77-0.260.66-0.240.460.060.80-0.430.09-0.350.450.0163TTG0.550.290.450.270.470.470.780.000.230.340.330.0164TTT0.19-0.270.01-0.170.06-0.020.55-0.290.74-0.140.230.26GCAGCCGCGGCTGGAGGCGGGGGTGTAGTCGTGGTT373839404142434445464748表C.2继续495051525354555657585960TAATACTAGTATTCATCCTCGTCTTGATGCTGGTGT1AAA0.000.050.000.44-0.600.470.210.510.000.06-0.070.162AAC0.00-0.380.000.110.51-0.47-0.17-0.170.00-0.39-0.300.08AAG0.00-0.270.000.430.32-0.380.040.040.00-0.170.030.254AAT0.000.470.000.740.470.660.450.430.000.580.720.65ACA0.000.100.000.360.290.260.350.180.000.31-0.110.266ACC0.00-0.540.000.410.29-0.58-0.30-0.010.00-0.44-0.220.297ACG0.000.110.000.590.180.400.220.460.000.520.030.658ACT0.000.470.000.540.38-0.120.33-0.220.00-0.190.53-0.029AGA0.00-0.270.00-0.510.12-0.310.72-0.190.000.27-0.66-0.1710AGC0.00-0.450.000.29-0.12-0.280.22-0.220.00-0.11-0.40-0.0611AGG0.000.710.000.460.18-0,430.560.300.000.870.490.3612AGT0.000.280.000.330.310.550.700.470.000.300.650.6113ATA0.000.300.00-0.42-0.840.700.67-0.570.000.400.58-0.5414ATC0.00-0.410.000.350.64-0.590.23-0.050.00-0.17-0.32-0.1715ATG0.00-0.220.000.450.34-0.170.390.090.00-0.110.000.2116ATT0.000.550.000.470.690.410.480.480.000.510.620.2517CAA0.000.060.00-0.090.590.190.290.400.000.16-0.04-0.4218CAC0.00-0.470.000.350.17-0.54-0.01-0.330.00-0.50-0.35-0.0419CAG0.00-0.160.000.320.43-0.42-0.05-0.080.00-0.100.030.4520CAT0.000.510.000.450.540.420.380.230.000.660.640.8921CCA0.000.090.000.520.290.250.37-0.050.000.260.450.3822CCC0.00-0.500.000.510.76-0.590.33-0.170.00-0.42-0.330.3723CCG0.00-0.020.000.23-0.330.390.370.190.000.640.381.0024CCT0.000.100.000.520.67-0.120.17-0.060.00-0.45-0.07-0.0725CGA0.00-0.220.000.71-0.430.170.260.670.000.450.550.0826CGC0.00-0.350.000.430.26-0.53-0.27-0.360.00-0.610.000.4027CGG0.000.340.000.560.360.320.560.500.000.510.440.5328CGT0.00-0.390.000.500.26-0.280.220.140.00-0.08-0.020.4329CTA0.00-0.150.000.610.59-0.250.540.570.000.47-0.111.0030CTC0.00-0.410.000.240.11-0.460.27-0.130.00-0.42-0.44-0.1131CTG0.00-0.280.000.31-0.10-0.210.11-0.150.00-0.010.110.2332CTT0.000.200.000.590.25-0.180.03-0.300.00-0.510.320.3633GAA0.00-0.210.000.330.460.070.180.350.000.45-0.130.2634GAC0.00-0.470.000.420.61-0.44-0.12-0.320.00-0.43-0.330.0035GAG0.00-0.200.000.480.56-0.400.20-0.090.00-0.380.100.3636GAT0.000.300.000.560.680.200.440.290.000.620.520.2537GCA0.000.130.000.38-0.110.030.230.210.000.170.100.4538GCC0.00-0.590.000.380.57-0.560.17-0.130.00-0.25-0.33-0.2339GCG0.000.540.000.550.180.520.520.270.000.690.390.6840GCT0.000.440.000.270.49-0.320.26-0.130.00-0.260.27-0.1541GGA0.00-0.250.000.030.23-0.150.190.390.000.020.03-0.2642GGC0.00-0.260.000.310.46-0.370.38-0.280.00-0.22-0.40-0.0743GGG0.000.560.000.120.330.610.400.370.000.680.700.2944GGT0.00-0.160.000.490.75-0.320.510.060.00-0.100.420.3845GTA0.00-0.300.000.580.110.08-0.05-0.520.000.430.450.2246GTC0.00-0.440.000.250.51-0.550.22-0.300.00-0.50-0.300.2647GTG0.000.190.000.480.110.390.460.080.000.030.060.7048GTT0.000.080.000.570.02-0.270.430.160.000.240.420.5249TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.00-0.430.000.180.23-0.400.04-0.500.00-0.43-0.280.1351TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.000.610.000.710.840.430.600.270.000.780.650.5753TCA0.000.390.000.41-0.19-0.070.58-0.100.000.340.210.3254TCC0.00-0.470.000.310.27-0.520.47-0.220.00-0.48-0.250.3655TCG0.000.340.000.520.110.220.110.140.000.380.480.7456TCT0.000.170.000.140.22-0.480.02-0.340.00-0.400.120.2857TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC0.00-0.320.000.350.30-0.47-0.45-0.470.00-0.30-0.25-0.0859TGG0.00-0.190.000.38-0.24-0.130.280.170.000.110.00-0.1660TGT0.000.340.000.140.610.510.750.490.000.500.560.5061TTA0.000.540.000.36-0.200.560.410.490.000.830.550.7262TTC0.00-0.420.000.330.51-0.490.38-0.190.00-0.08-0.21-0.3563TTG0.000.200.000.090.430.310.430.250.000.790.540.2564TTT0.000.680.000.770.650.270.43-0.090.000.740.590.30TAATACTAGTATTCATCCTCGTCTTGATGCTGGTGT495051525354555657585960表C.2继续61TTA62TTC63TTG64TTT1AAA0.720.340.100.502AAC0.79-0.390.000.313AAG0.15-0.270.340.394AAT0.460.650.370.455ACA0.000.140.080.376ACC0.89-0.370.190.547ACG0.810.460.68-0.108ACT0.82-0.090.210.159AGA0.100.210.180.6610AGC0.55-0.12-0.010.2111AGG0.050.52-0.180.7112AGT0.420.370.200.6613ATA0.290.420.710.4214ATC0.50-0.410.240.2915ATG0.52-0.160.350.4516ATT0.850.540.200.5317CAA0.050.11-0.120.3418CAC0.73-0.370.050.0719CAG0.70-0.260.510.4620CAT0.790.340.330.7121CCA-0.070.310.360.2222CCC0.55-0.51-0.320.3223CCG0.410.390.250.4010324CCT0.570.010.370.6625CGA1.000.420.540.4326CGC-0.29-0.440.11-0.2027CGG0.610.300.030.5728CGT1.00-0.370.020.4629CTA-0.64-0.190.530.0030CTC0.65-0.440.100.1831CTG0.68-0.060.250.2532CTT0.490.000.070.4933GAA0.550.130.200.3934GAC0.71-0.410.19-0.1335GAG0.78-0.280.220.4336GAT0.870.440.370.7837GCA0.500.460.130.2638GCC0.82-0.170.030.1739GCG0.460.200.44-0.0740GCT0.49-0.360.110.4641GGA0.31-0.140.180.3642GGC0.76-0.120.120.4343GGG0.530.63-0.050.3544GGT0.94-0.440.520.7545GTA0.570.270.480.6346GTC0.58-0.430.260.2947GTG0.930.100.300.3548GTT0.590.220.390.5149TAA0.000.000.000.0050TAC0.87-0.370.270.1451TAG0.000.000.000.0052TAT0.790.560.580.6553TCA0.090.290.120.4954TCC0.65-0.320.190.3955TCG0.62-0.110.440.3356TCT-0.41-0.320.270.2757TGA0.000.000.000.0058TGC0.30-0.28-0.180.2059TGG0.77-0.140.110.3760TGT-0.140.330.440.3861TTA0.660.790.670.4062TTC0.75-0.370.130.3463TTG0.590.400.330.3964TTT0.770.630.500.80TTATTCTTGTTT61626364104表C.3:CPW矩阵^^7/m全基因组(左侧密码子在第2列中指出,右侧密码子在第2行中指出)。宿主细胞序列数据全及wZrito基因组。123456789101112aaaaacaagaatacaaccacgactagaagcaggagt1aaa0.02-0.28-0.110.04-0.28-0.13-0.230.440.16-0.39-0.020.042aac-0.04-0.220.01-0.160.090.100.060.10-0.17-0.63-0.38-0.543aag0.000.330.180.360.430.590.380.500.480.220.280.054aat-0.030.110.130.24-0.08-0.05-0.200.290.610.570.560.325aca-0.22-0.31-0.27-0.16-0.13-0.24-0.13-0.33-0.45-0.30-0.51-0.256acc0.400.190.43-0.130.420.100.61-0.120.050.370.160.027acg-0.060.22-0.200.04-0.180.170.01-0.31-0.370.26-0.22-0.028act0.570.690.540.490.450.530.680.180.360.750.320.469aga-0.31-0.31-0.32-0.18-0.09-0.30-0.370.22-0.44-0.34-0.59-0.2010agc-0.090.05-0.15-0.14-0.07-0.15-0.10-0.20-0.31-0.32-0.59-0.3811agg0.300.520.230.290.350.510.450.43-0.070.44-0.020.2612agt0.100.540.280.280.030.520.030.150.710.870.820.8013ata-0.40-0.15-0.37-0.080.24-0.230.12-0.14-0.110.28-0.430.0614atc-0.15-0.170.07-0.29-0.05-0.130.15-0.21-0.44-0.50-0.47-0.4815atg0.110.08-0.21-0.060.020.14-0.10-0.01-0.31-0.09-0.32-0.1916att0.200.210.310.31-0.020.03-0.030.180.690.850.660.6017caa-0.27-0.43-0.38-0.26-0.10-0.46-0.26-0.02-0.18-0.40-0.18-0.1718cac0.14-0.04-0.04-0.220.26-0.080.100.10-0.32-0.52-0.40-0.4419cag0.470.580.400.520.310.410.160.330.500.650.290.6320cat-0.060.010.020.15-0.120.01-0.090.110.610.660.620.4721cca-0.34-0.49-0.45-0.340.09-0.330.11-0.37-0.51-0.48-0.51-0.5022ccc0.500.350.59-0.090.550.250.69-0.110.140.300.830.1123ccg-0.020.18-0.13-0.12-0.31-0.09-0.03-0.51-0.220.370.390.0724cct0.310.490.330.390.490.450.670.280.550.800.540.5825cga-0.47-0.52-0.32-0.45-0.08-0.21-0.160.01-0.32-0.550.17-0.2226cgc0.310.130.10-0.100.15-0.060.140.27-0.13-0.33-0.15-0.3127cgg0.210.480.210.04-0.250.24-0.43-0.040.380.450.310.4628cgt0.420.610.510.500.240.590.170.360.890.900.900.8529cta-0.36-0.20-0.43-0.270.29-0.360.28-0.06-0.37-0.31-0.47-0.2330ctc0.370.340.27-0.020.34-0.090.50-0.11-0.39-0.38-0.04-0.3331ctg-0.100.03-0.20-0.13-0.26-0.18-0.23-0.410.090.450.100.3932ctt0.670.610.680.550.560.440.610.310.540.740.600.6833gaa-0.09-0.25-0.06-0.03-0.20-0.16-0.210.280.08-0.370.15-0.1534gac-0.05-0.16-0.05-0.210.130.010.18-0.07-0.35-0.65-0.52-0.5835gag0.200.400.170.320.360.570.160.350.340.230.500.1436gat-0.010.070.140.18-0.090.09-0.150.130.720.770.720.5637gca-0.29-0.29-0.24-0.19-0.21-0.32-0.07-0.38-0.46-0.27-0.39-0.3538gcc0.13-0.100.05-0.350.160.120.44-0.31-0.47-0.230.06-0.2539gcg-0.070.30-0.150.04-0.250.210.05-0.26-0.250.440.310.2640gct0.430.440.470.500.330.530.560.130.650.870.760.6141gga-0.33-0.38-0.11-0.24-0.21-0.23-0.36-0.14-0.29-0.45-0.20-0.2642ggc0.300.270.210.110.100.070.14-0.01-0.27-0.30-0.36-0.2743ggg-0.24-0.01-0.13-0.160.100.340.100.220.070.100.21-0.0244ggt0.320.520.510.510.230.620.230.080.810.880.850.8445gta-0.34-0.34-0.37-0.22-0.09-0.23-0.210.05-0.28-0.09-0.400.0046gtc0.000.010.43-0.260.01-0.260.03-0.19-0.55-0.56-0.38-0.4747gtg-0.210.05-0.29-0.16-0.25-0.05-0.170.020.220.550.190.4248gtt0.560.490.490.530.410.360.410.130.680.870.760.6649TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.01-0.320.10-0.20-0.04-0.03-0.030.02-0.36-0.60-0.49-0.5851TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT-0.060.110.090.24-0.050.20-0.120.240.650.750.680.5453TCA-0.19-0.22-0.19-0.22-0.16-0.27-0.04-0.44-0.32-0.23-0.36-0.1654TCC0.250.060.37-0.260.280.060.45-0.15-0.42-0.220.29-0.3055TCG-0.020.32-0.14-0.06-0.050.080.24-0.28-0.030.580.220.2056TCT0.150.270.240.240.230.310.51-0.130.700.860.560.5857TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC0.06-0.09-0.14-0.150.21-0.200.050.310.08-0.39-0.34-0.1259TGG0.130.06-0.23-0.04-0.020.29-0.190.15-0.28-0.200.04-0.1360TGT0.010.330.020.050.01-0.26-0.300.200.820.910.830.706iTTA-0.35-0.36-0.37-0.35-0.17-0.33-0.32-0.33-0.26-0.16-0.19-0.2162TTC-0.090.020.02-0.040.12-0.180.29-0.29-0.54-0.49-0.58-0.4663TTG-0.080.10-0.22-0.190.150.040.10-0.180.000.430.110.1664TTT-0.020.030.14-0.010.02-0.160.000.050.490.800.500.57AAAAACAAGAATACAACCACGACTAGAAGCAGGAGT123456789101112表C.3继续131415161718192021222324ATAATCATGATTCAACACCAGCATCCACCCCCGCCT1AAA-0.36-0.29-0.140.220.070.140.150.350.070.350.190.452AAC0.170.010.040.060.090.020.320.170.200.170.190.123AAG0.010.320.380.29-0.12-0.37-0.28-0.41-0.20-0.31-0.41-0.424AAT-0.25-0.06-0.030.05-0.230.04-0.02-0.14-0.26-0.27-0.110.065ACA0.160.230.040.360.020.09-0.050.220.080.27-0.030.056ACC-0.05-0.230.19-0.520.00-0.050.11-0.290.290.400.470.367ACG-0.35-0.19-0.35-0.370.050.180.300.24-0.320.05-0.24-0.338ACT0.510.780.640.66-0.34-0.39-0.08-0.330.050.140.29-0.099AGA-0.14-0.09-0.110.110.210.23-0.190.280.440.230.260.3610AGC-0.090.06-0.21-0.02-0.20-0.07-0.20-0.130.350.360.100.3411AGG-0.140.310.210.25-0.13-0.26-0.40-0.27-0.29-0.02-0.33-0.3112AGT-0.220.24-0.06-0.080.080.430.180.01-0.21-0.310.00-0.0713ATA0.020.680.560.72-0.190.04-0.090.01-0.05-0.13-0.170.0514ATC-0.11-0.24-0.19-0.260.310.380.380.240.200.110.280.3215ATG0.090.020.00-0.040.27-0.04-0.220.020.260.10-0.11-0.0216ATT0.220.030.030.07-0.23-0.25-0.10-0.13-0.21-0.29-0.12-0.0817CAA-0.21-0.33-0.30-0.140.640.760.700.740.690.730.870.8118CAC0.260.050.050.050.150.000.170.00-0.010.070.170.1319CAG0.450.420.450.19-0.31-0.44-0.48-0.43-0.40-0.44-0.46-0.4720CAT0.04-0.03-0.02-0.06-0.05-0.03-0.090.01-0.10-0.260.00-0.0221CCA-0.02-0.22-0.320.070.580.540.650.640.400,710.870.6522CCC0.420.150.360.06-0.03-0.350.34-0.110.130.690.44-0.0223CCG0.12-0.12-0.18-0.36-0.24-0.09-0.14-0.15-0.40-0.24-0.22-0.3524CCT0.430.570.520.46-0.08-0.02-0.09-0.12-0.050.080.230.0325CGA-0.15-0.37-0.38-0.230.680.600.590.570.640.790.710.8526CGC0.27-0.050.180.08-0.06-0.29-0.32-0.310.28-0.07-0.120.0127CGG-0.05-0.03-0.15-0.300.00-0.14-0.12-0.250.17-0.22-0.23-0.2028CGT0.330.220.340.210.140.240.320.23-0.24-0.43-0.32-0.2829CTA-0.170.200.120.380.540.650.600.480.750.760.800.7930CTC-0.04-0.10-0.11-0.240.210.370.440.170.22-0.090.150.1031CTG0.29-0.25-0.22-0.21-0.23-0.14-0.040.01-0.25-0.12-0.32-0.2632CTT0.470.530.580.48-0.25-0.42-0.23-0.30-0.20-0.29-0.18-0.32106<table>tableseeoriginaldocumentpage107</column></row><table>17CAA0.490.730.680.780.430.810.820.850.140.02-0.110.1918CAC-0.19-0.53-0.27-0.460.100.210.310.110.270.360.140.1219CAG-0.39-0.43-0.52-0.45-0.36-0.50-0.58-0.58-0.030.04-0.10-0.1920CAT0.100.250.130.130.030.100.260.26-0.06-0.14-0.17-0.0621CCA0.600.670.730.600.530.690.770.720.250.07-0.020.2522CCC-0.48-0.62-0.11-0.570.210.190.610.400.250.08-0.17-0.1723CCG-0.12-0.22-0.15-0.22-0.26-0.55-0.33-0.520.210.360.280.3124CCT0.450.380.630.040.150.290.650.24-0.33-0.32-0.34-0.4025CGA0.530.650.680.700.510.640.590.660.360.280.050.3026CGC0.09-0.51-0.23-0.510.100.09-0.060.140.580.290.240.3527CGG-0.010.360.160.210.12-0.37-0.51-0.41-0.04-0.04-0.14-0.0628CGT0.850.760.850.48-0.03-0.30-0.05-O.U-0.27-0.34-0.17-0.3329CTA0.850.820.760.720.150.720.780.700.190.24-0.110.1630CTC0.08-0.320.07-0.300.420.280.490.380.350.100.49-0.0631CTG0.10-0.12-0.18-0.130.09-0.32-0.44-0.310.300.390.050.2732CTT0.10-0.310.02-0.210.06-0.140.340.11-0.35-0.09-0.38-0.4433GAA-0.230.20-0.110.330.290.480.270.58-0.02-0.07-0.130.1634GAC-0.38-0.54-0.45-0.510.270.280.320.250.360.390.250.2535GAG-0.13-0.21-0.49-0.34-0.39-0.35-0.60-0.630.100.080.17-0.2336GAT0.160.490.330.390.040.060.170.10-0.18-0.18-0.09-0.1337GCA0.170.190.360.260.160.150.240.240.00-0.05-0.020.1438GCC-0.01-0.46-0.18-0.380.400.310.470.270.430.370.180.2039GCG0.100.210.10-0.01-0.11-0.39-0.37-0.480.130.240.140.2140GCT0.680.640.730.220.030.150.530.03-0.31-0.36-0.29-0.3541GGA-0.11-0.050.060.030.250.300.270.27-0.14-0.28-0.040.0642GGC0.00-0.08-0.16-0.310.330.460.320.140.410.300.180.2743GGG-0.09-0.13-0.160.02-0.17-0.36-0.53-0.560.050.210.270.1244GGT0.710.790.790.670.140.020.34-0.20-0.37-0.38-0.28-0.2845GTA0.320.320.300.370.140.220.300.51-0.06-0.12-0.180.1246GTC0.20-0.320.18-0.180.470.450.500.500.490.460.370.1847GTG0.180.12-0.070.00-0.10-0.37-0.55-0.420.180.270.140.2548GTT0.210.010.16-0.02-0.05-0.210.180.04-0.35-0.31-0.27-0.3549TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC-0.33-0.42-0.39-0.400.050.340.420.360.290.290.190.1651TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT-0.110.320.080.29-0.120.110.390.36-0.18-0.160.04-0.0653TCA0.240.280.260.39-0.140.080.340.160.03-0.07-0.140.0754TCC-0.07-0.45-0.11-0.410.120.270.510.170.550.460.340.3055TCG0.390.370.230.10-0.36-0.38-0.31-0.470.260.330.210.1856TCT0.400.600.650.35-0.19-0.160.41-0.12-0.25-0.20-0.15-0.3057TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC-0.49-0.53-0.45-0.470.130.27-0.020.350.500.430.140.3959TGG0.220.10-0.080.36-0.04-0.28-0.41-0.170.080.16-0.14-0.0860TGT0.840.830.640.77-0.31-0.210.000.03-0.29-0.36-0.37-0.3161TTA0.490.640.430.650.150.550.580.660.030.10-0.100.0762TTC-0.33-0.50-0.26-0.53-0.010.240.320.280.440.480.530.4463TTG-0.23-0.25-0.26-0.25-0.22-0.32-0.47-0.350.270.230.110.1564TTT0.430.420.380.43-0.08-0.060.370.39-0.22-0.18-0.07-0.17CGACGCCGGCGTCTACTCCTGCTTGAAGACGAGGAT252627282930313233343536表C.3继续373839404142434445464748GCAGCCGCGGCTGGAGGCGGGGGTGTAGTCGTGGTT108<table>tableseeoriginaldocumentpage109</column></row><table>56TCT-0.11-0.160.13-0.310.440.580.560.23-0.31-0.400.09-0.4957TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC0.23-0.010.000.36-0.49-0.38-0.35-0.170.330.390.370.5859TGG-0.060.29-0.190.08-0.230.090.000.320.060.33-0.300.0560TGT-0.29-0.05-0.230.080.690.890.800.63-0.46-0.44-0.36-0.0261TTA0.250.210.260.270.100.12-0.17-0.070.290.340.250.3262TTC0.20-0.050.450.16-0.33-0.44-0.33-0.400.330.250.560.0863TTG-0.150.31-0.03-0.130.390.480.340.570.480.330.05-0.0664TTT0.01-0.28-0.090.040.350.190.370.330.07-0.31-0.06-0.07GCAGCCGCGGCTGGAGGCGGGGGTGTAGTCGTGGTT373839404142434445464748表C.3继续495051525354555657585960TAATACTAGTATTCATCCTCGTCTTGATGCTGGTGT1AAA0.00-0.180.000.06-0.190.23-0.210.400.00-0.09-0.030.082AAC0.00-0.080.00-0.090.230.090.310.270.00-0.29-0.25-0.253AAG0.000.110.000.080.340.390.180.270.000.170.07-0.104AAT0.000.010.000.110.060.270.090.280.000.300.260.305ACA0.000.000.000.12-0.12-0.03-0.010.060.000.050.000.006ACC0.000.110.00-0.170.30-0.09-0.20-0.130.00-0.140.42隱O.Ol7ACG0.00-0.030.00-0.070.210.240.390.180.000.050.100.538ACT0.00-0.120.000.07-0.32-0.150.02-0.380.00-0.04-0.37-0.489AGA0.000.180.000.380.360.450.030.500.000.14-0.160.4410AGC0.00-0.130.00-0.180.230.24-0.020.180.00-0.31-0.39-0.2911AGG0.000.230.000.260.250.35-0.170.420.000.190.100.3012AGT0.000.440.000.16-0.220.21-0.050.140.000.820.480.4113ATA0.00-0.240.00-0.03-0.12-0.23-0.38-0.060.00-0.30-0.45-0.2114ATC0.000.380.000.250.350.330.450.400.000.220.280.2015ATG0.00-0.190.000.120.140.08-0.070.050.00-0.090.000.1216ATT0.00-0.150.00-0.16-0.16-0.250.00-0.320.00-0.03-0.01-0.0917CAA0.00-0.270.00-0.21-0.18-0.28-0.39-0.040.00-0.35-0.19-0.2118CAC0.000.280.000.090.24-0.070.130.100.00-0.21-0.070.0719CAG0.000.400.000.270.250.130.210.240.000.390.240.4720CAT0.00-0.080.00-0.07-0.08-0.27-0.02-0.060.000.170.04-0.0821CCA0.00-0.300.00-0.18-0.28-0.17-0.06-0.390.00-0.36-0.38-0.1622CCC0.000.420.000.000.13-0.160.280.230.00-0.130.710.2223CCG0.000.320.000.330.210.300.440.070.000.370.290.4324CCT0.00-0.150.00-0.30-0.21-0.40-0.24-0.350.00-0.19-0.20-0.3325CGA0.00-0.160.00-0.200.250.03-0.010.250.00-0.050.15-0.0126CGC0.00-0.250.00-0.27-0.06-0.280.040.000.00-0.54-0.47-0.5627CGG0.000.340.000.240.240.51-0.060.430.000.560.590.4228CGT0.00-0.140.00-0.25-0.42-0.42-0.50-0.450.000.590.630.3529CTA0.00-0.370.00-0.37-0.28-0.37-0.47-0.200.00-0.50-0.53-0.5130CTC0.000.260.000.170.220.280.340.210.000.280.28-0.1631CTG0.000.110.000.110.06-0.05-0.01-0.020.000.290.180.6032CTT0.00-0.060.00-0.13-0.09-0.29-0.07-0.440.00-0.150.11-0.1133GAA0.00-0.160.000.05-0.070.08-0.220.280.00-0.150.00-0.0334GAC0.000.280.000.180.500.340.360.380.00-0.24-0.21-0.1135GAG0.000.130.000.050.280.500.150.290.000.180.000.2936GAT0.00-0.080.00-0.12-0.12-0.15-0.110.010.000.200.150.0537GCA0.00-0.040.000.150.00-0.23-0.08-0.010.000.01-0.110.0038GCC0.000.080.00-0.080.310.160.330.070.00-0.390.11-0.2939GCG0.000.290.000.260.390.410.530.290.000.400.350.3411040GCT0.00-0.250.00-0.27-0.41-0.45-0.25-0.480.000.18-0.24-0.1041GGA0.000.100.000.10-0.050,790.070.420.000.130.270.3142GGC0.00-0.200.00-0.170.220.110.180.170.00-0.39-0.46-0.4743GGG0.000.290.000.240.420.600.390.410.000.280.460.4744GGT0.000.250.00-0.10-0.51-0.11-0.39-0.410.000.760.700.6345GTA0.00-0.350.00-0.08-0.29-0.32-0.39-0.270.000.01-0.36-0.3346GTC0.000.480.000.210.410.360.440.440.00-0.040.290.0447GTG0.00-0.100.00-0.020.230.260.230.210.000.360.110.4248GTT0.000.070.00-0.12-0.14-0.40-0.01-0.370.00-0.120.03-0.2149TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.000.000.000.050.290.080.050.180.00-0.10-0.05-0.2051TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.00-0.020.00-0.02-0.09-0.14-0.010.090.000.070.030.1153TCA0.000.180.000.300.08-0.090.13-0.100.000.260.100.3254TCC0.00-0.100.00-0.29-0.01-0.130.14-0.170.00-0.400.20-0.3255TCG0.000.240.000.160.290.300.510.140.000.380.250.5556TCT0.00-0.080.00-0.11-0.30-0.27-0.12-0.400.000.120.10-0.1757TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC0.00-0.080.00-0.050.090.19-0.280.250.00-0.31-0.32-0.4159TGG0.00-0.100.000.060.150.32-0.03-0.010.000.060.00-0.0760TGT0.000.270.00-0.03-0.30-0.17-0.300.070.000.860.560.4561TTA0.000.030.00-0.15-0.20-0.20-0.21-0.050.00-0.34-0.29-0.1062TTC0.000.130.000.210.01-0,050.04-0.070.00-0.09-0.09-0.1963TTG0.000.220.000.170.170.120.27-0.010.000.070.220.4964TTT0.00-0.060.00-0.09-0.11-0.10-0.05-0.170.000.140.05-0.01TAATACTAGTATTCATCCTCGTCTTGATGCTGGTGT495051525354555657585960表C.3继续61TTA62TTC63TTG64TTT1AAA0.34-0.270.310.262AAC0.22-0.270.24-0.063AAG0.63-0.100.72-0.124AAT-0.09-0.13-0.070.255ACA0.060.00-0.220.166ACC0.12-0.15-0.08-0.307ACG0.150.070.58-0.108ACT-0.110.13-0.030.199AGA0.480.440.290.4810AGC0.260.210.10-0.0111AGG0.460.290.680.2912AGT-0.270.38-0.260.1013ATA-0.45-0.39-0.48-0.1014ATC0.350.210.490.2815ATG0.74-0.020.760.0116ATT-0.47-0.32-0.450.0417CAA0.11-0.120.14-0.1718CAC0.090.05-0.110.1719CAG0.660.270.680.1620CAT-0.310.02-0.43-0.0921CCA-0.22-0.22-0.18-0.2822CCC0.380.420.100.2223CCG0.400.230.67-0.0411124CCT-0.270.150.010.0525CGA-0.24-0.34-0.22-0.2426CGC0.330.040.180.0527CGG0.410.410.720.0028CGT-0.36-0.40-0.37-0.4529CTA-0.49-0.30-0.57-0.2830CTC0.350.320.040.2331CTG0.190.100.49-0.1932CTT-0.050.25-0.330.4133GAA0.20-0.040.020.0334GAC0.120.200.070.1835GAG0.520.220.58-0.1136GAT-0.33-0.10-0.40-0.1037GCA-0.050.18-0.20-0.0238GCC0.350.260.140.0539GCG0.210.340.56-0.0940GCT-0.36-0.08-0.32-0.1941GGA-0.010.10-0.140.0342GGC0.270.080.110.1343GGG0.190.430.40-0.0244GGT-0.40-0.13-0.46-0.3445GTA-0.24-0.36-0.36-0.2546GTC0.530.320.420.2947GTG0.140.170.45-0.1548GTT-0.250.00-0.240.0949TAA0.000.000.000.0050TAC-0.070.08-0.050.1351TAG0.000.000.000.0052TAT-0.43-0.21-0.500.0353TCA-0.130.02-0.200.0154TCC0.09-0.110.02-0.2455TCG-0.170.100.56-0.2856TCT-0.330.09-0.090.0957TGA0.000.000.000.0058TGC0.230.130.090.1059TGG0.700.010.83-0.0160TGT-0.310.17-0.33-0.2261TTA-0.17-0.09-0.20-0.1562TTC-0.22-0.260.15-0.1463TTG0.24-0.050.58-0.2464TTT-0.44-0.25-0.360.30TTATTCTTGTTT61626364112<table>tableseeoriginaldocumentpage113</column></row><table>49TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC-0.21-0.66-0.040.14-0.430.420.23-0.46-0.31-0.510.50-0.6851TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.030.370.280.460.360.54-0.300.310.761.001.000.7653TCA-0.24-0.490.290.220.12-0.23-0.11-0.66-0.36-0.320.24-0.5354TCC0.59-0.030.31-0.340.410,360.49-0.42-0.54-0.341.00-0.2855TCG0.200.56-0.300.210.360.730.840.46-0.071.00-0.15-0.0556TCT-0.15-0.23-0.070.16-0.210.460.52-0.250.791.001.000.3157TCA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC-0.17-0.180.300.24-0.051.00-0.280.33-0.44-0.381.000.0059TGG0.18-0.22-0.300.220.130.55-0.33-0.04-0.35-0.470.04-0.3160TGT-0.09-0.160.430.07-0.21-0.180.28-0.181.001.001.001.0061TTA-0.33-0.52-0.32-0.14-0.190.40-0.32-0.48-0.31-0.110.460.0462TTC-0.38-0.39-0.060.49-0.380.33G.07-0.19-0.55-0.62-0.46-0.5463TTG-0.23-0.12-0.180.000.270.230.09-0.440.330.750.310.4664TTT0.16-0.120.340.110.13-0.270.310.160.590.880.750.73AAAAACAAGAATACAACCACGACTAGAAGCAGGAGT123456789101112表C.4继续131415161718192021222324ATAATCATGATTCAACACCAGCATCCACCCCCGCCT1AAA0.42-0.44-0.150.14-0.04-0.120.370.570.040.460.210.202AAC0.66-0.46-0.330.09-0.30-0,150.04-0.14-0.240.53-0.150.133AAG0.920.460.390.21-0.19-0.51-0.33-0.39-0.040.84-0.36-0.494AAT0.54-0.110.380.24-0.01-0.180.330.30-0.08-0.08-0.080.36ACA0.70-0.18-0.120.23-0.14-0.280.060.24-0.340.33-0.110.046ACC0.080.080.38-0.320.47-0.330.450.630.510.660.930.687ACG-0.42-0.27-0.26-0.390.310.220.160.40-0.510.590.00-0.348ACT0.540.950.580.74-0.47-0.62-0.34-0.36-0.610.660.580.049AGA0.41-0.100.390.240.61-0.290.200.150.481.00-0.160.4310AGC0.37-0.190.01-0.22-0.16-0.36-0.180.360.140.700.070.2411AGG0.350.570.610.600.700.570.380.160.531.000.180.3812AGT0.170.540.230.120.730.430.440.300.370.340.32-0.0313ATA0.810.800.930.710.150.330.050.130.160.64-0.080.1114ATC0.15-0.51-0.36-0.200.050.260.340.12-0.200.640.070.4415ATG0.55-0.130.00-0.030.19-0.21-0.170.13-0.110.440.11-0.1816ATT0.66-0.030.180.21-0.22-0.48-0.040.19-0.250.47-0.19-0.0617CAA0.21-0.49-0.32-0.140.590.900.820.610.560.810.890.8318CAC1.00-0.11-0.14-0.330.04-0.570.09-0.05-0.211.000.04-0.0419CAG0.700.500.490.31-0.26-0.42-0.53-0.42-0.500.04-0.46-0.4920CAT0.56-0.030.080.050.130.32-0.160.18-0.320.51-0.070.1821CCA0.75-0.50-0.230.08-0.02-0.370.79O.卯0.121.001.000.1322CCC1.001.000.631.000.35-0.211.000.170.071.001.000.3923CCG0.57-0.38-0.28-0.43-0.390.050.50-0.12-0.42-0.17-0.32-0.4724CCT1.000.560.600.64-0.330.21-0.15-0.260.421.000.50-0.1325CGA-0.20-0.20-0.24-0.070.411.000.700.800.541.000.801.0026CGC0.850.12-0.24-0.20-0.23-0.26-0.43-0.12-0.330.55-0.10-0.1127CGG0.040.10-0.09-0.270.370.490.260.000.161.000.640.2628CGT0.67-0.36-0.20-0.24-0.62-0.360.03-0.29-0.55-0.03-0.53-0.6329CTA1.000.18-0.01-0.010.421.001.000.560.671.000.861.0030CTC0.46-0.03-0.13-0.200.550.610.600.320.280.690.250.2431CTG0.24-0.40-0.27-0.22-0.110.540.190.19-0.300.86-0.240.0232CTT0.750.370.610.65-0.47-0.58-0.22-0.46-0.360.42-0.27-0.6011433GAA0.35-0.34-0.140.100.270.630.130.370.350.910.380.4634GAC0.53-0.080.080.07-0.29-0.340.38-0.16-0.150.460.230.0035GAG0.560.220.360.10-0.19-0.58-0.38-0.43-0.28-0.09-0.53-0.5636GAT0.30-0.07-0.04-0.080.130.23-0.100.14-0.050.49-0.15-0.0637GCA0.59-0.19-0.220.34-0.150.15-0.250.35-0.170.82-0.07-0.0738GCC0.45-0.080.23-0.190.230.110.430.070.530.750.500.6939GCG0.13-0.35-0.25-0.390.02-0.370.51-0.03-0.270.42-0.28-0.2540GCT0.450.420.490.46-0.42-0.330.220.00-0.150.790.15-0.4741GGA0.82-0.030.240.15-0.20-0.54-0.050.170.080.41-0.160.0942GGC0.35-0.190.03-0.02-0.04-0.060.220.130.240.640.010.3943GGG0.540.390.25-0.130.10-0.330.010.05-0.060.25-0.41-0.4744GGT0.19-0.43-0.41-0.070.110.08-0.020.610.060.670.12-0.1245GTA0.620.150.350.740.14-0.19-0.140.29-O.Ol0.520.08-0.0446GTC0.57-0.37-0.34-0.370.520.610.650.620.830.460.740.4447GTG0.57-0.11-0.060.140.350.220.020.16-0.020.46-0.46-0.3248GTT0.35-0.360.280.25-0.44-0.51-0.37-0.40-0.240.180.03-0.3949TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC-0.12-0.43-0.260.32-0.12-0.060.240.04-0.250.370.290.0251TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.30-0.050.190.10-0.040.05-0.01-0.03-0.260.49-0.170.2153TCA0.760.06-0.210.25-0.21-0.010.130.32-0.200.120.16-0.2754TCC0.56-0.26-0.20-0.350.78-0.380.430.550.240.460.560.8355TCG1.00-0.190.37-0.380.02-0.310.280.280.040.32-0.40-0.4856TCT0.17-0.140.140.31-0.53-0.460.29-0.32-0.611.000.11-0.4057TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC0.49-0.160.090.63-0.22-0.50-0.37-0.19-0.230.320.16-0.0659TGG-0.230.050.000.040.46-0.29-0.320.200.341.000.00-0.3560TGT-0.18-0.59-0.100.400.690.390.360.700.610.17-0.01-0.3561TTA0.90-0.38-0.200.070.370.550.120.220.500.640.780.3462TTC0.59-0.23-0.160.02-0.51-0.58-0.37-0.41-0.550.27-0.36-0.6563TTG0.60-0.280.06-0.220.00-0.13-0.150.360.250.10-0.200.2364TTT0.230.020.09-0.050.310.490.470.410.270.660.370.68ATAATCATGATTCAACACCAGCATCCACCCCCGCCT131415161718192021222324表C.4继续252627282930313233343536CGACGCCGGCGTCTACTCCTGCTTGAAGACGAGGAT1AAA-0.20-0.38-0.04-0.20-0.170.490.500.26-0.13-0.26-0.060.252AAC-0.22-0.550.04-0.70-0.610.17-0.31-0.460.13-0.100.390.183AAG0.30-0.030.07-0.50-0.39-0.31-0.66-0.680.25-0.120.38-0.054AAT0.330.530.73-0.200.330.290.250.03-0.19-0.27-0.020.12ACA1.000.040.42-0.020.360.370.17-0.08-0.200.050.050.206ACC0.190.430.750.130.340.560.45-0.140.570.370.330.21"7ACG0.520.440.560.23-0.36-0.17-0.23-0.490.270.550.680.258ACT0.19-0.471.00-0.830.020.570.73-0.02-0.42-0.67-0.45-0.439AGA-0.34-0.090.600.190.780.700.350.72-0.07-0.270.030.4410AGC0.49-0.60-0.08-0.330.090.30-0.04-0.240.340.310.400.5011AGG1.000.73-0.310.690.39-0.08-0.440.240.530.320.34-0.2612AGT0.450.490.320.710.010.780.07-0.12-0.51-0.54-0.51-0.4413ATA-0.030.361.000.630.61-0.040.590.670.430.560.460.1514ATC0.07-0.630.36-0.620.070.360.48-0.41-0.10-0.010.310.1715ATG0.86-0.130.41-0.320.670.18-0.40-0.430.02-0.12-0.040.0816ATT0.150.010.74-0.390.22-0.140.52-0.02-0.14-0.20-0.04-0.0811517CAA0.340.54O.卯0.380.590.940.910.860.040.21-0.300.1818CAC0.07-0.54-0.56-0.710.180.820.15-0.38-0.110.230.260.0819CAG0.32-0.52-0.58-0.54-0.46-0.49-0.44-0.680.050.180.25-0.2920CAT0.770.360.440.150.400.650.500.460.14-0.21-0.230.0521CCA0.45-0.020.66-0.321.000.640.580.660.080.05-0.260.1322CCC1.00-0.771.000.381.000.230.820.640.460.530.320.8223CCG-0.300.340.27-0.51-0.43-0.39-0.20-0.620.250.330.510.2824CCT1.000.330.33-0.350.190.290.610.05-0.40-0.39-0.33-0.4525CGA1.000.731.000.700.410.740.390.630.540.180.790.4426CGC0.20-0.66-0.12-0.76-0.27-0.32-0.38-0.080.43-0.130.100.2227CGG1.000.510.360.810.270.36-0.33-0.220.500.300.340.3728CGT1.000.290.44-0.64-0.55-0.40-0.05-0.60-0.52-0.62-0.63-0.1929CTA1.001.001.001.00-0.260.100.860.860.030.33-0.160.0230CTC0.70-0.110.26-0.230.700.740.570.500.530.110.67-0.1231CTG0.72-0.110.48-0.270.44-0.27-0.22-0.430.490.450.220.4832CTT-0.22-0.720.13-0.60-0,11-0.300.01-0.50-0.53-0.40-0.43-0.5433GAA-0.02-0.030.22-0.030.240.330.330.50-0.11-0.27-0.200.0834GAC-0.42-0.74-0.37-0.670.230.260.310.020.050.290.240.1735GAG0.22-0.53-0.21-0.61-0.48-0.46-0.50-0.700.300.220.430.1536GAT0.590.430.650.570.200.100.23-0.10-0.11-0.310.060.0737GCA0.87-0.380.61-0.190.260.190.36-0.060.12-0.070.200.2038GCC0.830.110.47-0.550.670.630.450.590.390.630.430.3139GCG0.32-0.170.67-0.40-0.04-0.25-0.37-0.590.140.120.190.2640GCT0.410.320.82-0.54-0.350.380.54-0.16-0.43-0.56-0.31-0.2741GGA0.26-0.510.42-0.480.610.430.41-0.04-0.11-0.270.15-0.0342GGC0.33-0.300.05-0.57-0.160.420.34-0.120.340.120.200.3543GGG0.390.150.260.240.500.23-0.16-0.450.400.330.660.3644GGT0.820.580.89-0.25-0.35-0.320.63-0.65-0.49-0.59-0.44-0.0545GTA0.58-0.320.36-0.19-0.430.140.23-0.17-0.11-0.15-0.340.0646GTC0.68-0.460.81-0.460.560.350.660.540.560.510.600.3147GTG0.370.040.61-0.090.41-0.43-0.51-0.470.470.280.380.3548GTT0.28-0.400.47-0.540.06-0.200.58-0.27-0.47-0.43-0.30-0.3549TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC-0.17-0.63-0,25-0.70-0.330.200.19-0.230.160.420.29-0.2351TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.350.220.680.100.200.070.500.30-0.15-0.170.040.1453TCA1.000.200.85-0.040.34-0.140.180.000.120.14-0.020.2154TCC1.00-0.600.18-0.40-0.170.650.580.090.690.560.340.2655TCG0.430.740.65-0.17-0.35-0.11-0.37-0.500.600.890.490.1956TCT0.72-0.151.00-0.39-0.430.450.43-0.29-0.40-0.49-0.31-0.3457TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC-0.15-0.63-0.30-0.600.210.30-0.32-0.130.610.30-0.190.5259TGG0.69-0.010.240.170.62-0.41-0.30-0.160.05-0.14-0.090.0960TGT1.001.001.001.000.040.150.60-0.28-0.20-0.16-0.43-0.4261TTA0.650.670.680.340.650.470.640.500.070.350.200.4062TTC-0.22-0.60-0.29-0.77-0.300.11-0.27-0.440.24-0.150.640.3663TTG1.00-0.040.04-0.24-0.24-0.31-0.33-0.450.420.300.370.3464TTT0.640.660.850.610.460.400.580.41-0.27-0.270.230.09CGACGCCGGCGTCTACTCCTGCTTGAAGACGAGGAT252627282930313233343536表C.4继续373839404142434445464748GCAGCCGCGGCTGGAGGCGGGGGTGTAGTCGTGGTT116<table>tableseeoriginaldocumentpage117</column></row><table>TCT-0.050.03-0.18-0.650.400.450.54-0.45-0.56-0.480.14-0.5657TGA0.000.000.000.000.000.000.000.000.000.000.000.00foJO/八oou丄or\,f-u.JJw乂,-u.jw-ar\,-0.15-0.280.470.470.5259TGG-0.180.13-0.100.260.05-0.340.500.41-0.180.54-0.270.0060TGT0.04-0.23-0.16-0.091.000.740.640.52-0.29-0.48-0.330.2261TTA0.010.550.190.510.300.260.29-0.07-0.080.180.610.1062TTC-0.050.010.160.04-0.38-0.49-0.39-0.65-0.170.040.57-0.3663TTG-0.360.490.29-0.050.620.530.750.710.270.620.450.2564TTT0.110.13-0.22-0.010.440.430.650.350.00-0.120.190.01GCAGCCGCGGCTGGAGGCGGGGGTGTAGTCGTGGTT373839404142434445464748表C.4继续495051525354555657585960TAATACTAGTATTCATCCTCGTCTTGATGCTGGTGTlAAA0.00-0.210.000.10-0.350.10-0.040.110.00-0.23-0.06-0.192AAC0.00-0.460.00-0.15-0.28-0.120.68-0.450.00-0.52-0.39-0.413AAG0.00-0.120.000.170.230.500.620.180.000.660.150.594AAT0.000.380.000.280.410.510.500.570.000.730.500.675ACA0.00-0.040.000.19-0.32-0.180.180.010.00-0.370.050.056ACC0.000.280.000.290.390.630.770.540.00-0.250.401.007ACG0.00-0.330.000.000.340.020.310.390.000.010.281.008ACT0.00-0.580.000.35-0.38-0.320.07-0.540.000.00-0.50-0.389AGA0.000.540.000.690.820.84-0.190.700.000.43-0.10-0.0310AGC0.00-0.390.00-0.17-0.050.490.52-0.030.000.27-0.32-0.3211AGG0.000.370.001.001.001.00-0.110.720.001.000.621.0012AGT0.000.840.000.39-0.181.00-0.53-0.280.001.000.710.3513ATA0.000.800.00-0.050.570.030.260.270.000.390.551.0014ATC0.00-0.050.000.330.280.220.250.020.000.590.40-0.1115ATG0.00-0.210.000.150.03-0.350.320.130.00-0.180.000.2616ATT0.00-0.470.000.17-0.10-0.230.27-0.440.00-0.34-0.30-0.1617CAA0.00-0.360.00-0.24-0.090.00-0.46-0.410.00-0.26-0.29-0.3918CAC0.00-0.010.00-0.200.070.041.000.100.000.45-0.12-0.0119CAG0.000.290.000.480.490.530.29-0.260.000.540.420.4420CAT0.000.000.000.12-0.270.170.70-0.100.00-0.180.070.0121CCA0.00-0.560.00-0.04-0.140.530.70-0.480.00-0.52-0.18-0.4022CCC0.000.660.001.000.180.50-0.210.690.001.000.571.0023CCG0.000.090.000.48-0.010.590.35-0.480.001.000.290.6424CCT0.00-0.530.00-0.120.060.530.41-0.430.00-0.45-0.32-0.5525CGA0.000.390.000.010.770.151.000.730.00-0.330.27-0.6326CGC0.00-0.420.00-0.27-0.301.001.00-0.460.00-0.68-0.61-0.5327CGG0.000.440.000.500.570.480.670.840.001.000.780.4428CGT0.00-0.590.00-0.53-0.670.09-0.51-0.800.001.000.801.0029CTA0.00-0.480.00-0.180.260.66-0.41-0.320.00-0.41-0.60-0.5130CTC0.000.140.000.310.021.000.810.440.000.250.181.0031CTG0.000.330.00-0.10-0.180.370.65-0.120.000.300.480.7932CTT0.00-0.590.00-0.16-0.21-0.50-0.19-0.660.00-0.520.230.3633GAA0.00-0.340.000.32-0.160.21-0.140.140.000.400.13-0.2534GAC0.00-0.200.000.180.310.310.270.140.00-0.51-0.380.1735GAG0.000.000.00-0.090.220.470.330.450.00-0.41-0.220.4936GAT0.00-0.280.000.19-0.06-0.190.590.230.000.380.350.1637GCA0.00-0.090.000.140.04-0.270.080.090.00-0.41-0.180.2138GCC0.000.530.000.280.570.710.250.690.00-0.050.280.3639GCG0.00-0.010.000.340.280.540.610.180.000.310.280.5011840GCT0.00-0.500.00-0.29-0.26-0.53-0.36-0.640.000.41-0.23-0.4941GGA0.00-0.100.000.120.040.720.380.210.000.480.340.6242GGC0.00-0.160.00-0.100.470.170.590.080.00-0.42-0.47-0.5943GGG0.000.260.000.350.630.870.830.490.000.400.580.2744GGT0.000.340.00-0.24-0.650.17-0.17-0.730.001.000.530.3645GTA0.00-0.640.00-0.17-0.38-0.18-0.07-0.550.00-0.32-0.11-0.3746GTC0.000.590.000.530.610.790.280.600.00-0.190.230.1047GTG0.00-0.080.000.410.180.500.460.290.000.510.150.7048GTT0.00-0.330.00-0.03-0.12-0.230.26-0.530.000.21-0.21-0.3349TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.00-0.130.000.210.41-0.070.09-0.440.00-0.17-0.09-0.4951TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.000.040.00-0.08-0.06-0.13-0.180.160.000.570.05-0.04530.00.0.310.000.5了0.320.2g0.38-0.190.00-0.29-0.030.4354TCC0.00-0.370.000.290.260.33-0.120.020.000.141.000.4855TCG0.000.500.000.640.690.720.640.290.000.450.401.0056TCT0.00-0.420.00-0.19-0.38-0.350.47-0.540.00-0.38-0.38-0.4957TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC0.000.130.00-0.200.11-0.18-0.03-0.010.00-0.551.00-0.2659TGG0.000.200.00-0.090.890.810.26-0.310.00-0.340.000.6360TGT0.000.650.00-0.12-0.380.02-0.200.080.001.00-0.551.0061TTA0.000.340.000.33-0.29-0.270.23-0.310.00-0.540.051.0062TTC0.00-0.570.00-0.13-0.460.07-0.24-0.570.00-0.18-0.52-0.3363TTG0.000.160.000.600.340.780.580.110.00-0.27-0.450.3364TTT0.000.250.000.270.250.350.590.110.000.230.510.07TAATACTAGTATTCATCCTCGTCTTGATGCTGGTGT495051525354555657585960表C.4继续61TTA62TTC63TTG64TTT1AAA0.32-0.440.420.382AAC0.12-0.580.11-0.103AAG0.70-0.020.66-0.034AAT0.32-0.140.520.665ACA0.01-0.33-0.320.046ACC0.07-0.040.360.347ACG0.30-0.340.43-0.068ACT-0.340.300.040.569AGA0.430.310.270.6810AGC0.42-0.290.33-0.1311AGG0.530.531.000.6712AGT-0.120.360.190.3913ATA-0.10-0.41-0.350.4214ATC0.05-0.360.270.3615ATG0.80-0.410.860.3316ATT-0.41-0.49-0.300.3517CAA0.16-0.330.10-0.0518CAC-0.40-0.43-0.250.1819CAG0.670.060.630.2720CAT-0.37-0.18-0.430.1921CCA-0.04-0.60-0.42-0.3022CCC0.781.000.440.6123CCG0.32-0.080.710.0111924CCT-0.090.220.040.3525CGA0.55-0.371.00-0.1526CGC-0.05-0.48-0.30-0.0927CGG0.630.860.880.1528CGT-0.48-0.71-0.51-0.1229CTA-0.47-0.69-0.49-0.1530CTC0.54-0.18-0.140.5331CTG0.39-0.310.50-0.0432CTT-0.220.02-0.190.6633GAA0.37-0.330.260.1634GAC0.08-0.380.000.3435GAG0.460.140.670.0936GAT-0.09-0.26-0.410.1437GCA-0.09-0.300.040.1338GCC0.530.15-0.070.2039GCG0.10-0.050.70-0.1640GCT-0.40-0.15-0.120.1641GGA0.17-0.23-0.010.3542GGC0.18-0.460.110.2543GGG0.870.530.590.5244GGT-0.37-0.50-0.52-0.2345GTA-0.37-0.69-0.34-0.1446GTC0.740.240.430.2947GTG0.360.300.620.0148GTT-0.140.05-0.150.3049TAA0.000.000.000.0050TAC0.08-0.280.03-0.1151TAG0.000.000.000.0052TAT-0.22-0.25-0.500.3253TCA-0.430.070.21-0.1254TCC0.390.09-0.300.5155TCG0.350.170.50-0.2056TCT-0.29-0.490.020.3957TGA0.000.000.000.0058TGC-0.17-0.470.490.1159TGG0.51-0.180.630.1160TGT0.510.23-0.360.2861TTA0.34-0.220.490.1662TTC-0.41-0.67-0.23-0.1863TTG0.33-0.490.71-0.1364TTT-0.29-0.05-0.050.58TTATTCTTGTTT61626364<table>tableseeoriginaldocumentpage121</column></row><table><table>tableseeoriginaldocumentpage122</column></row><table>33GAA-0.25-0.040.200.180.070.250.330.160.36-0.030.090.0234GAC0.630.190.290.370.45-0.07-0.080.040.380.35-0.090.4235GAG-0.32-0.22-0.310.06-0.43-0.33-0.29-0.29-0.420.65-0.24-0.1836GAT0.33-0.26-0.15-0.10-0.03-0.19-0.070.190.10-0.23-0.07-0.1137GCA-0.330.24-0.130.18-0.320.220.140.14-0.400.07-0.35-0.2938GCC0.29-0.020.36-0.190.660.170.270.180.540.040.600.3339GCG-0.270.36-0.31-0.13-0.42-0.13-0.17-0.18-0.430.31-0.130.0340GCT0.24-0.270.590.000.57-0.220.090.020.38-0.050.190.2841GGA-0.610.61-0.370.02-0.35-0.010.13-0.29-0.330.13-0.200.1042GGC0.340.270.190.010.820.540.530.530.290.240.010.4443GGG-0.47-0.28-0.43-0.28-0.730.10-0.45-0.50-0.670.72-0.52-0.0244GGT0.43-0.010.30-0.090.40-0.32-0.24-0.190.390.640.100.3345GTA-0.040.52-0.110.41-0.340.07-0.33-0.10-0.15-0.09-0.36-0.1246GTC0.260.020.30-0.130.860.420.450.160.760.180.620.3147GTG-0.05-0.01-0.35-0.20-0.130.36-0.20-0.18-0.190.28-0.21-0.0148GTT0.47-0.220.590.130.59-0.31-0.13-0.070.42-0.270.03-0.3049TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.44-0.200.130.170.650.020.170.240.350.100.150.3051TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.03-0.19-0.090.17-0.03-0.37-0.270.250.18-0.35-0.19-0.0353TCA-0.540.28-0.330.17-0.390.130.260.19-0.45-0.38-0.33-0.2854TCC0.08-0.280.22-0.100.730.220.390.410.35-0.160.42-0.0655TCG-0.380.29-0.43-0.17-0.490.110.01-0.01-0.62-0.34-0.31-0.2356TCT-0.08-0.210.41-0.120.47-0.320.330.120.14-0.140.07-0.1457TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC0.130.270.190.100.48-0.19-0.28-0.13-0.020.43-0.350.0959TGG-0.490.010.000.12-0.090.080.05-0.060.100.26-0.06-0.0960TGT-0.22-0.10-0.19-0.240.710.32-0.160.170.520.620.090.3861TTA-0.27-0.06-0.25-0.20-0.310.11-0.240.030.09-0.57-0.21-0.3662TTC0.680.300.570.530.84-0.35-0.50-0.080.26-0.27-0.45-0.2663TTG-0.39-0.05-0.33-0.40-0.45-0.17-0.54-0.27-0.37-0.45-0.27-0.4064TTT0.18-0.29-0.30-0.250.520.320.150.130.540.020.450.12ATAATCATGATTCAACACCAGCATCCAcccCCGCCT131415161718192021222324表C.5继续252627282930313233343536CGACGCCGGCGTCTACTCCTGCTTGAAGACGAGGAT1AAA0.270.060.160.210.20-0.060.050.200.14-0.10-0.46-0.142AAC0.10-0.170.02-0.160.66-0.15-0.090.000.03-0.140.180.023AAG-0.56-0.25-0.50-0.27-0.51-0.23-0.38-0.060.610.560.410.424AAT0.100.120.120.340.04-0.400.07-0.30-0.13-0.130.020.195ACA-0.180.210.020.38-0.47-0.150.25-0.23-0.190.26-0.100.206ACC-0.10-0.120.190.080.530.380.540.460.01-0.240.16-0.157ACG-0.46-0.03-0.30-0.21-0.72-0.42-0.53-0.470.160.540.490.458ACT0.49-0.060.810.170.350.140.420.31-0.40-0.490.070.019AGA-0.540.30-0.310.31-0.130.300.43-0.280.050.410.070.2810AGC0.14-0.09-0.09-0.120.660.160.060.32-0.25-0.16-0.12-0.2611AGG-0.630.20-0.300.21-0.70-0.28-0.18-0.620.010.44-0.19-0.1212AGT0.280.24-0.150.320.620.300.190.18-0.390.01-0.20-0.0613ATA-0.230.310.100.20-0.31-0.210.41-0.330.060.470.110.3314ATC0.69-0.210.550.060.84-0.590.31-0.050.08-0.420.300.1515ATG0.21-0.080.31-0.090.05-0.17-0.020.19-0.030.020.07-0.0116ATT0.50-0.190.54-0.080.62-0.440.07-0.30-0.120.25-0.11-0.0117CAA-0.380.10-0.31-0.09-0.16-0.30-0.21-0.390.440.590.110.5618CAC0.100.000.02-0.040.800.290.190.140.13-0.280.14-0.1619CAG-0.430.29-0.20-0.09-0.310.160.47-0.32-0.15-0.17-0.15-0.2620CAT0.24-0.15-0.190.13-0.06-0.26-0.10-0.42-0.12-0.08-0.020.3721CCA-0.52-0.20-0.380.20-0.360.210.180.18-0.14-0.06-0.100.2822CCC-0.220.25-0.160.310.570.610.740.420.50-0.050.54-0.3123CCG-0.11-0.170.55-0.20-0.52-0.06-0.300.10-0.130.160.210.1124CCT0.500.390.560.350.360.540.610.30-0.15-0.390.10-0.1825CGA-0.34-0.05-0.38-0.04-0.53-0.27-0.39-0.300.260.610.270.4526CGC0.370.020.270.010.810.510.120.450.13-0.210.01-0.2227CGG-0.560.21-0.64-0.06-0.560.34-0.25-0.25-0.260.640.080.2628CGT0.30-0.02-0.100.010.560.25-0.200.01-0.13-0.100.040.1729CTA-0.54-0.52-0.54-0.37-0.46-0.41-0.16-0.410.790.920.770.8430CTC0.680.770.330.790.930.660.810.620.00-0.680.47-0.5031CTG0.20-0.060.28-0.170.040.150.020.10-0.300.24-0.20-0.1232CTT0.690.520.580.560.500.220.690.25-0.17-0.01-0.53-0.4933GAA0.300.240.110.190.290.30-0.010.290.07-0.04-0.49-0.2734GAC-0.01-0.13-0.09-0.180.660.050.140.170.08-0.040.15-0.0835GAG-0.45-0.19-0.51-0.34-0.490.67-0.31-0.210.590.610.470.5336GAT0.54-0.050.230.250.28-0.490.15-0.15-0.05-0.14-0.070.1637GCA-0.330.27-0.080.27-0.330.310.010.17-0.260.26-0.220.1938GCC0.06-0.020.020.040.730.520.520.520.17-0.240.14-0.3039GCG-0.440.15-0.37-0.20-0.580.08-0.42-0.110.210.570.400.3340GCT0.550.180.660.210.450.510.470.45-0.39-0.470.05-0.2741GGA-0.63-0.34-0.54-0.35-0.290.420.04-0.260.190.510.090.4542GGC0.610.560.610.550.730.540.250.42-0.07-0.20-0.13-0.3543GGG-0.71-0.36-0.59-0.46-0.690.78-0.51-0.520.360.760.310.5244GGT0.450.010.25-0.170.640.84-0.040.31-0.130.100.050.1045GTA-0.08-0.20-0.04-0.34-0.040.00-0.32-0.25-0.030.400.000.3046GTC0.670.450.430.420.890.550.700.580.130.23-0.01-0.4547GTG0.06-0.12-0.27-0.25-0.20-0.03-0.40-0.040.250.550.230.2348GTT0.420.130.580.120.560.170.400.28-0.25-0.11-0.37-0.3349TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.31-0.02-0.060.100.760.070.220.310.06-0.330.180.0251TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.10-0.260.010.150.15-0.53-0.13-0.33-0.10-0.310.000.4753TCA-0.15-0.08-0.290.30-0.57-0.25-0.11-0.280.010.410.280.4054TCC-0.15-0.230.040.010.660.380.510.410.48-0.120.380.0055TCG-0.28-0.19-0.42-0.07-0.69-0.33-0.53-0.440.420.670.680.6256TCT0.680.100.750.240.260.220.20-0.09-0.17-0.500.420.0057TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC0.02-0.13-0.24-0.140.73-0.07-0.240.040.06-0.210.240.0059TGG-0.500.22-0.400.00-0.470.57-0.110.22-0.080.260.19-0.1360TGT-0.100.42-0.540.350.490.38-0.070.05-0.180.000.020.2061TTA0.12-0.29-0.12-0.200.00-0.28-0.04-0.210.660.740.620.5662TTC0.64-0.420.23-0.330.83-0.66-0.12-0.290.26-0.070.300.1563TTG-0.28-0.21-0.28-0.28-0.54-0.44-0.49-0.430.840.900.830.6964TTT0.640.220.540.340.63-0.290.51-0.14-0.19-0.06-0.12-0.04CGACGCCGGCGTCTACTCCTGCTTGAAGACGAGGAT252627282930313233343536表C.5继续373839404142434445464748GCAGCCGCGGCTGGAGGCGGGGGTGTAGTCGTGGTT1AAA0.00-0.15-0.210.310.11-0.16-0.28-0.010.29-0.08-0.220.182AAC0.41-0.360.270.140.19-0.210.36-0.200.12-0.360.07-0.103AAG0.280.520.180.25-0.050.410.170.68-0.040.190.100.114AAT0.20-0.340.13-0.050.35-0.040.560.100.35-0.220.290.135ACA-0.240.280.150.06-0.430.26-0.130.10-0.150.370.250.336ACC0.02-0.190.30-0.11-0.40-0.04-0.32-0.20-0.03-0.010.19-0.227ACG-0.320.70-0.190.220.200.480.240.43-0.450.17-0.270.068ACT-0.06-0.25-0.08-0.260.03-0.300.670.110.04-0.030.470.049AGA-0.300.45-0.020.09-0.540.27-0.140.33-0.21-0.020.140.0710AGC0.17-0.390.340.55-0.15-0.140.10-0.16-0.12-0.16-0.010.0111AGG-0.490.510.04-0.23-0.260.510.410.600.170.55-0.10-0.0612AGT0.12-0.140.030.010.040.200.290.090.450.400.370.3713ATA0.210.000.400.330.260.390.050.310.400.030.430.1914ATC0.45-0.430.45-0.190.64-0.130.44-0.130.53-0.380.34-0.1315ATG-0.050.52-0.290.12-0.14-0.10-0.360.430.290.08-0.210.1416ATT0.46-0.410.30-0.260.68-0.220.59-0.240.59-0.390.14-0.2117CAA0.610.430.540.54-0.040.34-0.270.270.600.610.580.5518CAC0.21-0.340.14-0.210.16-0.350.05-0.41-0.24-0.330.87-0.3619CAG-0.320.21-0.36-0.19-0.31-0.08-0.450.36-0.31-0.01-0.27-0.2820CAT0.35-0.110.10-0.040.320.290.520.280.25-0.150.04-0.1021CCA0.010.240.330.19-0.41-0.03-0.090.15-0.070.550.260.4422CCC-0.37-0.460.06-0.40-0.47-0.410.34-0.350.05-0.200.30-0.2923CCG-0.250.760.090.34-0.040.330.010.35-0.420.31-0.14-0.0624CCT-0.29-0.45-0.38-0.46-0.22-0.440.05-0.08-0.06-0.080.40-0.2225CGA0.090.360.110.11-0.110.570.170.54-0.190.24-0.070.4026CGC-0.04-0.280.22-0.07-0.06-0.210.01-0.14-0.27-0.12-0.15-0.1327CGG-0.520.23-0.49-0.21-0.450.520.160.55-0.66-0.19-0.55-0.2128CGT0.250.240.140.00-0.03-0.010.17-0.070.360.430.480.3129CTA0.760.730.750.780.730.710.590.740.830.720.810.7230CTC0.43-0.340.420.11-0.20-0.59-0.34-0.540.710.390.750.4731CTG-0.430.48-0.46-0.120.010.23-0.270.29-0.29-0.12-0.49-0.1532CTT0.29-0.330.33-0.210.17-0.39-0.23-0.080.640.310.730.3733GAA0.02-0.13-0.250.09-0.07-0.13-0.42-0.160.21-0.07-0.330.0234GAC0.29-0.160.24-0.020.15-0.230.02-0.280.16-0.060.18-0.1635GAG0.320.510.120.420.450.590.210.670.440.440.250.4036GAT0.26-0.250.01-0.130.34-0.040.590.100.29-0.19-0.01-0.0237GCA-0.130.410.180.10-0.25-0.03-0.21-0.12-0.160.420.250.3838GCC-0.17-0.150.28-0.18-0.680.87-0.56-0.090.000.050.31-0.1239GCG-0.330.70-0.190.33-0.060.260.050.19-0.500.28-0.320.0140GCT-0.20-0.31-0.13-0.300.01-0.150.21-0.02-0.230.080.43-0.1141GGA0.110.380.260.44-0.010.650.540.710.410.630.630.6242GGC-0.460.96-0.28-0.11-0.21-0.38-0.15-0.37-0.42-0.14-0.33-0.2743GGG-0.120.44-0.240.230.300.730.670.84-0.030.600.310.3244GGT-0.050.04-0.020.010.210.170.21-0.070.130.240.200.1245GTA0.240.460.140.410.260.42-0.070.200.280.390.070.4646GTC0.27-0.350.13-0.26-0.04-0.31-0.43-0.440.460.120.300.0447GTG-0.030.66-0.340.200.320.46-0.010.36-0.13-0.06-0.48-0.1648GTT0.21-0.370.23-0.370.44-0.260.06-0.170.450.080.49-0.0449TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.24-0.330.25-0.080.34-0.310.24-0.290.31-0.120.34-0.1151TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.24-0.330.180.160.370.020.600.150.29-0.350.11-0.1753TCA-0.170.150.190.35-0.280.410.210.30-0.210.200.270.3554TCC-0.16-0.070.38-0.20-0.09-0.39-0.36-0.47-0.20-0.100.30-0.1155TCG-0.270.64-0.140.220.450.740.470.61-0.520.02-0.420.03125<table>tableseeoriginaldocumentpage126</column></row><table>40GCT0.00-0.560.00-0.56-0.36-0.42-0.27-0.480.000.080.94-0.2241GGA0.000.310.00-0.150.130.72-0.490.470.00-0.070.21-0.1242GGC0.000.020.000.040.410.330.380.450.00-0.20-0.44-0.2243GGG0.000.700.000.310.520.840.420.680.000.790.780.4644GGT0.00-0.120.00-0.26-0.34-0.30-0.35-0.420.000.150.52-0.0445GTA0.000.440.000.440.060.17-0.140.180.000.38-0.02-0.0446GTC0.00-0.080.00-0.220.31-0.130.23-0.160.00-0.35-0.61-0.2947GTG0.000.670.000.400.590.790.270.610.000.390.370.4748GTT0.00-0.390.00-0.49-0.28-0.54-0.23-0.580.00-0.260.79-0.2249TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.00-0.300.000.320.29-0.270.230.040.00-0.33-0.510.1751TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.00-0.300.000.33-0.10-0.280.340.310.000.030.780.3153TCA0.000.590.000.54-0.550.07-0.40-0.170.000.030.000.4054TCC0.000.110.000.460.04-0.150.27-0.100.00-0.53-0.45-0.5055TCG0.000.740.000.58-0.050.490.020.380.000.480.400.6256TCT0.00-0.270.00-0.22-0.51-0.54-0.26-0.530.000.200.960.0357TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC0.00-0.140.000.09-0.030.130.19-0.040.00-0.15-0.33-0.1859TGG0.000.100.00-0.07-0.380.49-0.350.400.000.100.00-0.1160TGT0.000.110.00-0.04-0.50-0.31-0.30-0.260.000.370.610.0961TTA0.000.620.000.31-0.060.04-0.260.130.00-0.08-0.11-0.1362TTC0.00-0.130.000.120.02-0.40-0.18-0.240.00-0.25-0.52-0.0863TTG0.000.380.000.01-0.170.55-0.350.340.000.03-0.21-0.1764TTT0.000.100.00-0.070.08-0.030.350.080.000.100.800.25TAATACTAGTATTCATCCTCGTCTTGATGCTGGTGT495051525354555657585960表C.5继续61TTA62TTC63TTG64TTT1AAA0.15-0.030.400.232AAC0.46-0.300.580.193AAG-0.02-0.22-0.02-0.314AAT-0.23-0.040.060.205ACA-0.500.16-0.40-0.136ACC0.43-0.190.530.017ACG-0.240.41-0.340.278ACT0.24-0.340.32-0.099AGA-0.340.66-0.010.3010AGC0.650.040.670.2711AGG-0.240.61-0.440.3712AGT0.06-0.13-0.27-0.3213ATA-0.57-0.25-0.57-0.4914ATC0.66-0.160.690.3715ATG0.36-0.15-0.210.1316ATT-0.03-0.16-0.350.1217CAA-0.110.390.050.0518CAC0.53-0.160.680.0819CAG-0.07-0.16-0.25-0.0420CAT-0.240.03-0.210.0321CCA-0.320.19-0.260.1222CCC0.320.020.400.0423CCG-0.03-0.09-0.120.0024CCT-0.07-0.070.31-0.0525CGA-0.470.12-0.44-0.0926CGC0.380.100.48-0.0827CGG-0.360.78-0.440.3828CGT0.02-0.05-0.13-0.2429CTA-0.460.18-0.47-0.3330CTC0.770.260.850.1531CTG0.220.090.07-0.0832CTT0.120.150.300.0333GAA0.240.510.390.1634GAC0.39-0.050.590.3335GAG-0.25-0.38-0.35-0.4236GAT-0.39-0.18-0.18-0.0137GCA-0.410.06-0.35-0.1938GCC0.470.160.57-0.0339GCG-0.270.29-0.29-0.0740GCT0.21-0.040.27-0.1741GGA-0.66-0.18-0.45-0.3742GGC0.320.080.410.0343GGG-0.490.57-0.400.3444GGT-0.190.12-0.18-0.2345GTA-0.48-0.19-0.53-0.2746GTC0.62-0.100.71-0.1547GTG0.120.16-0.280.1448GTT0.160.080.250.1149TAA0.000.000.000.0050TAC0.66-0.280.720.2851TAG0.000.000.000.0052TAT-0.10-0.15-0.100.1453TCA-0.530.09-0.33-0.1854TCC0.480.000.580.2355TCG-0.230.12-0.39-0.0956TCT0.14-0.090.010.0257TGA0.000.000.000.0058TGC0.41-0.030.550.1059TGG0.090.220.01-0.1460TGT-0.150.130.07-0.1661TTA-0.420.19-0.430.0062TTC0.52-0.390.50-0.1563TTG-0.31-0.18-0.54-0.1564TTT-0.340.14-0.230.39TTATTCTTGTTT61626364128表C.6:CPW矩阵EycAgn'c/'co/z'K12高表达序列(左侧密码子在第2列中指出,右侧密码子在第2行中指出)。宿主细胞序列数据<table>tableseeoriginaldocumentpage129</column></row><table>49TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC-0.47-0.71-0.290.841.00-0.681.00-0.731.00-0.641.001.0051TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.520.500.821.001.000.471.000.801.000.721.001.0053TCA1.000.461.001.001.001.001.00-0.241.001.001.001.0054TCC-0.63-0.73-0.570.730.31-0.781.00-0.691.00-0.771.001.0055TCG0.511.000.681.001.001.001.000.491.001.001.001.0056TCT-0.56-0.760.311.000.29-0.771.00-0.831.001.001.001.0057TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC-0.22-0.281.001.001.00-0.481.000.271.00-0.231.000.2059TGG-0.23-0.371.000.711.00-0.200.32-0.391.000.281.001.0060TGT-0.25-0.571.001.00-0.150.661.00-0.781.001.001.001.0061TTA0.941.001.001.001.001.001.000.641.000.781.001.0062TTC-0.49-0.690.140.650.51-0.611.00-0.771.00-0.301.001.0063TTG0.810.831.001.000.531.001.000.631.000.561.001.0064TTT0.540.470.490.940.820.731.000.001.000.761.001.00AAAAACAAGAATACAACCACGACTAGAAGCAGGAGT123456"789101112表C.6继续131415161718192021222324ATAATCATGATTCAACACCAGCATCCACCCCCGCCT1AAA1.00-0.36-0.040.440.20-0.510.161.00-0.171.000.050.382AAC1.00-0.72-0.410.690.54-0.70-0.560.470.241.00-0.620.823AAG1.00-0.490.150.320.24-0.64-0.500.540.711.00-0.61-0.034AAT1.000.620.870.941.000.710.701.000.811.000.461.005ACA1.000.831.001.001.001.000.761.001.001.000.811.006ACC1.00-0.67-0.470.300.19-0.470.010.610.371.000.050.817ACG1.001.000.730.800.560.440.881.000.491.000.360.698ACT1.00-0.640.350.35-0.44-0.85-0.731.000.581.00-0.82-0.609AGA1.001.001.001.001.001.001.00-0.331.001.001.001.0010AGC1.00-0.370.380.631.00-0.51-0.170.781.001.00-0.291.0011AGG1.001.001.001.001.001.001.001.001.001.001.001.0012AGT1.000.841.000.860.531.001.001.001.001.000.481.0013ATA1.001.001.001.001.001.001.001.001.001.001.001.0014ATC1.00-0.70-0.530.040.45-0.68-0.420.50-0.180.73-0.57-0.3115ATG1.00-0.480.000.640.33-0.43-0.150.56-0.241.00-0.170.3016ATT1.000.230.820.881.00-0.09-0.240.720.861.000.021.0017CAA1.000.320.660.910.82-0.110.810.581.001.00-0.040.0018CAC1.00-0.73-0.380.271.00-0.72-0.520.55-0.331.00-0.641.0019CAG1.00-0.58-0.260.310.52-0.58-0.480.780.781.00-0.490.4820CAT1.000.530.460.761.000.09-0.101.001.001.000.400.3321CCA1.00-0.610.570.840.370.590.340.691.001.00-0.231.0022CCC1.000.701.001.001.001.001.001.001.001.001.001.0023CCG1.00-0.58-0.370.240.33-0.70-0.530.56-0.421.00-0.50-0.3624CCT1.000.530.491.001.001.000.800.631.001.001.001.0025CGA1.000.581.001.001.001.001.001.001.001.001.001.0026CGC1.000.540.100.351.000.300.440.870.781.000.360.2027CGG1.001.001.001.001.001.001.001.001.001.001.001.0028CGT1.00-0.74-0.400.140.55-0.73-0.690.020.291.00-0.66-0.6129CTA1.001.001.001.001.001.001.001.001.001.001.001.0030CTC1.000.140.810.640.630.470.611.001.001.000.821.0031CTG1.00-0.71-0.430.25-0.28-0.75-0.540.67-0.480.84-0.60-0.0232CTT1.000.781.001.001.001.001.001.001.001.000.631.0013033GAA1.00-0.56-0.110.660.49-0.280.120.67-0.221.00-0.06-0.0234GAC1.00-0.66-0.350.380.87-0.66-0.570.340.511.00-0.401.0035GAG1.00-0.300.270.780.48-0.64-0.520.511.001.00-0.480.6236GAT1.00-0.300.330.850.68-0.280.160.940.421.00-0.380.6537GCA1.00-0.56-0.410.470.76-0.47-0.360.690.410.55-0.62-0.0638GCC1.000.500.600.701.001.000.410.761.001.000.920.4539GCG1.000.07-0.010.170.72-0.52-0.040.630.650.73-0.271.0040GCT1.00-0.78-0.070.490.37-0.76-0.700.180.221.00-0.701.0041GGA1.001.000.881.001.001.001.001.001.001.000.231.0042GGC1.000.250.070.580.92-0.390.260.870.430.71-0.130.7743GGG1.000.711.000.841.000.070.881.000.491.000.271.0044GGT1.00-0.75-0.45-0.040.80-0.67-0.71-0.010.321.00-0.640.4645GTA1.00-0.52-0.560.510.31-0.68-0.650.33-0.471.00-0.63-0.5546GTC1.000.440.770.831.001.000.490.760.211.000.530.3747GTG1.00-0.100.060.760.710.820.931.000.541.000.560.6348GTT1.00-0.780.030.610.39-0.80-0.650.200.331.00-0.55-0.5349TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC1.00-0.74-0.550.331.00-0.66-0.55-0.020.231.00-0.481.0051TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT1.000.570.910.781.000.180.011.000.711.00-0.260.3053TCA1.001.000.721.001.001.000.691.001.001.000.681.0054TCC1.00-0.81-0.610.041.00-0.69-0.500.601.001.00-0.621.0055TCG1.000.840.110.741.001.001.001.000.291.000.751.0056TCT1.00-0.78-0.470.721.00-0.86-0.820.581.001.00-0.830.0957TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC1.00-0.24-0.071.001.00-0.58-0.450.550.051.00-0.581.0059TGG1.00-0.280.000.181.00-0.57-0.341.001.001.00-0.471.0060TGT1.00-0.730.101.00-0.540.251.000.44-0.161.000.571.0061TTA1.001.001.000.851.001.001.001.001.001.001.001.0062TTC1.00-0.74-0.300.731.00-0.79-0.710.410.231.00-0.73-0.2863TTG1.000.820.231.001.001.000.840.670.581.000.701.0064TTT1.000.190.320.861.001.000.901.001.001.001.001.00ATAATCATGATTCAACACCAGCATCCAcccCCGCCT131415161718192021222324表C.6继续252627282930313233343536CGACGCCGGCGTCTACTCCTGCTTGAAGACGAGGAT1AAA1.000.431.00-0.361.000.76-0.460.84-0.17-0.53-0.320.262AAC1.00-0.261.00-0.700.42-0.38-0.660.80-0.47-0.62-0.150.01AAG0.33-0.321.00-0.611.000.19-0.410.730.950.821.000.844AAT1.001.001.001.001.001.000.741.000.820.590.790.85ACA1.001.001.000.341.001.000.861.000.410.410.741.006ACC1.000.331.00-0.561.000.62-0.450.61-0.36-0.400.220.317ACG1.000.901.000.791.001.00-0.311.000.891.001.001.008ACT1.00-0.591.00-0.77-0.321.00-0.751.00-0.61-0.830.18-0.279AGA1.001.001.001.001.001.001.001.001.001.001.001.0010AGC1.000.071.00-0.691.001.00-0.430.45-0.20-0.190.750.3311AGG1.001.001.001.001.001.001.001.001.001.001.001.0012AGT1.001.001.001.001.001.000.901.000.790.791.000.8813ATA1.001.001.001.001.001.001.001.001.001.001.001.0014ATC1.00-0.341.00-0.631.00-0.67-0.590.13-0.57-0.72-0.08-0.0715ATG1.000.351.00-0.491.000.27-0.410.85-0.10-0.450.240.4916ATT1.000.551.00-0.121.000.64-0.131.000.530.570.860.80<table>tableseeoriginaldocumentpage132</column></row><table><table>tableseeoriginaldocumentpage133</column></row><table>56TCT-0.78-0.19-0.62-0.831.00-0.521.00-0.84-0.87-0.140.32-0.7457TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC-0.670.470.400.551.000.411.00-0.60-0.440.360.25-0.0759TGG-0.460.800.07-0.281.00-0.261.00-0.26-0.280.670.42-0.4860TGT0.570.67-0.21-0.410.300.071.00-0.25-0.701.000.060.3361TTA0.710.771.000.421.000.910.760.680.411.001.000.8262TTC-0.370.110.19-0.630.18-0.190.71-0.72-0.700.720.43-0.7163TTG1.000.880.910.801.001.001.000.890.701.001.000.8264TTT0.570.540.44-0.551.000.800.890.230.260.790.690.12GCAGCCGCGGCTGGAGGCGGGGGTGTAGTCGTGGTT373839404142434445464748表C.6继续495051525354555657585960TAATACTAGTATTCATCCTCGTCTTGATGCTGGTGT1AAA0.00-0.310.000.211.00-0.350.66-0.650.00-0.060.020.202AAC0.00-0.710.000.401.00-0.740.81-0.710.00-0.62-0.450.183AAG0.000.140.000.351.000.210.24-0.810.00-0.29-0.070.114AAT0.001.000.001.001.000.511.001.000.001.001.001.00ACA0.001.000.001.001.001.001.000.310.001.000.401.006ACC0.00-0.600.000.381.00-0.150.81-0.690.00-0.27-0.53-0.617ACG0.001.000.000.851.001.001.000.670.001.001.001.008ACT0.00-0.770.000.530.36-0.880.00-0.880.00-0.451.001.009AGA0.001.000.001.001.001.001.001.000.001.001.001.0010AGC0.00-0.400.000.491.00-0.541.00-0.470.00-0.280.700.4211AGG0.001.000.001.001.001.001.001.000.001.001.001.0012AGT0.000.690.001.001.000.321.000.300.001.001.001.0013ATA0.001.000.001.001.001.001.001.000.001.001.001.0014ATC0.00-0.690.000.281.00-0.840.84-0.640.00-0.41-0.58-0.5315ATG0.00-0.430.000.561.00-0.600.80-0.420.000.400.00-0.3316ATT0.000.300.000.701.000.030.87-0.530.000.531.000.7017CAA0.000.400.000.891.000.541.001.000.001.000.641.0018CAC0.00-0.770.000.61-0.27-0.401.00-0.830.00-0.60-0.57-0.0519CAG0.00-0.530.000.231.00-0.310.77-0.800.00-0.51-0.250.1420CAT0.001.000.000.711.001.001.001.000.000.371.001.0021CCA0.000.340.000.751.00-0.221.000.340.00-0.740.471.0022CCC0.001.000.001.001.001.001.001.000.001.001.001.0023CCG0.00-0.440.000.370.44-0.521.00-0.780.000.32-0.41-0.4124CCT0.00-0.770.000.700.05-0.681.00-0.580.001.001.001.0025CGA0.001.000.001.001.001.001.001.000.001.001.001.0026CGC0.000.010.000.501.00-o.u1.00-0.140.000.36-0.521.0027CGG0.001.000.001.001.001.001.001.000.001.001.001.0028CGT0.00-0.640.00-0.160.13-0.750.77-0.870.00-0.700.560.1529CTA0.001.000.001.001.001.001.001.000.001.001.001.0030CTC0.00-0.310.001.001.00-0.501.00-0.360.001.001.001.0031CTG0.00-0.670.000.240.74-0.600.70-0.800.00-0.48-0.44-0.4232CTT0.000.020.001.000.37-0.351.00-0.520.00-0.251.001.0033GAA0.00-0.540.000.641.00-0.560.77-0.570.000.400.28-0.3434GAC0.00-0.660.000.370.71-0.800.54-0.490.00-0.62-0.610.4035GAG0.00-0.070.000.651.00-0.531.00-0.600.00-0.25-0.380.4436GAT0.00-0.180.000.830.83-0.411.00-0.150.000.430.940.4637GCA0.00-0.420.000.680.69-0.430.76-0.700.000.50-0.62-0.4738GCC0.000.440.000.081.000.421.00-0.380.000.22-0.020.0239GCG0.000.410.001.001.000.110.72-0.610.000.410.69-O.IO13440GCT0.00-0.830.000.001.00-0.810.37-0.890.00-0.620.651.0041GGA0.000.730.001.000.461.001.001.000.001.001.001.0042GGC0.00-0.360.000.571.00-0.331.000.400.000.32-0.50-0.0743GGG0.001.000.001.001.001.001.000.360.001.001.001.0044GGT0.00-0.680.000.160.15-0.790.87-0.870.00-0.620.44-0.0345GTA0.00-0.400.000.641.00-0.821.00-0.840.00-0.46-0.71-0.6846GTC0.00-0.020.000.871.000.721.000.710.001.000.651.0047GTG0.000.110.000.771.000.35LOO0.330.00-0.020.171.0048GTT0.00-0.710.00-0.161.00-0.811.00-0.850.00-0.610.700.5449TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.00-0.780.000.611.00-0.760.67-0.770.00-0.71-0.571.0051TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.000.810.001.001.000.221.001.000.001.001.000.3453TCA0.000.240.001.001.001.001.000.140.001.000.321.0054TCC0.00-0.680.000.531.00-0.630.35-0.520.00-0.71-0.80-0.5455TCG0.001.000.001.001.001.001.00-0.260.001.001.001.0056TCT0.00-0.820.000.030.14-0.84-0.63-0.880.00-0.440.43-0.5557TCA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC0.00-0.760.001.001.00-0.580.23-0.760.00-0.54-0.44-0.2659TGG0.00-0.410.000.521.00-0.501.00-0.710.001.000.00-0.5660TGT0.001.000.001.001.00-0.011.00-0.040.001.001.001.0061TTA0.000.610.001.001.001.001.001.000.001.001.001.0062TTC0.00-0.740.000.111.00-0.790.72-0.860.00-0.69-0.57-0.1563TTG0.001.000.001.001.001.001.000.150.001.000.171.0064TTT0.000.820.000.871.000.561.001.000.001.001.001.00TAATACTAGTATTCATCCTCGTCTTGATGCTGGTGT495051525354555657585960表C.6继续61626364TTATTCTTGTTT1AAA0.87-0.430.870.692AAC1.00-0.720.830.573AAG0.78-0.530.780.424AAT0.801.001.000.935ACA1.001.001.001.006ACC0.84-0.591.000.287ACG0.750.430.740.798ACT1.00-0.640.580.319AGA1.001.001.001.0010AGC1.00-0.480.560.8811AGG1.001.001.001.0012AGT1.000.400.591.0013ATA1.001.001.001.0014ATC1.00-0.691.000.2915ATG0.76-0.520.760.8116ATT1.000.050.850.8817CAA1.001.001.000.8818CAC1.00-0.770.560.4919CAG1.00-0.660.750.5820CAT1.000.740.671.0021CCA1.001.001.001.0022CCC1.001.001.001.0023CCG0.80-0.700.410.4024CCT1.000.661.000.5025CGA1.001.001.001.0026CGC1.00-0.481.000.3627CGG1.001.001.001.0028CGT1.00-0.611.000.5129CTA1.001.001.001.0030CTC1.001.001.000.7331CTG0.57-0.720.460.4432CTT1.000.271.001.0033GAA0.95-0.350.950.7634GAC0.86-0.740.580.4035GAG0.88-0.631.000.4636GAT0.840.220.840.8737GCA1.00-0.700.830.2938GCC1.00-0.061.000.8439GCG1.00-0.441.000.7640GCT0.78-0.441.000.7341GGA1.001.001.001.0042GGC1.00-0.600.900.3343GGG1.000.841.001.0044GGT0.76-0.480.880.3645GTA1.00-0.630.450.0046GTC1.00-0.431.000.7647GTG1.00-0.550.770.7248GTT1.00-0.060.840.9049TAA0.000.000.000.0050TAC1.00-0.751.000.4251TAG0.000.000.000.0052TAT1.000.651.000.9153TCA1.001.001.001.0054TCC0.59-0.790.590.3355TCG1.000.421.000.7856TCT0.58-0.730.570.3057TGA0.000.000.000.0058TGC1.000.161.00-0.4759TGG0.51-0.451.000.6160TGT1.00-0.051.001.0061TTA1.001.001.001.0062TTC1.00-0.761.000.1663TTG1.000.700.581.0064TTT1.001.001.000.92TTATTCTTGTTT61626364<table>tableseeoriginaldocumentpage137</column></row><table>49TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.12-0.16-0.06-0.170.130.05-0.140.14-0.59-0.55-0.61-0.5751TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT-0.140.140.230.14-0.150.28-0.030.080.500.820.590.7053TCA-0.38-0.39-0.19-0.39-0.28-0.360.06-0.58-0.50-0.30-0.42-0.2654TCC0.290.350.46-0.180.250.110.54-0.33-0.440.350.37-0.0855TCG0.35-0.06-0.140.220.18-0.26-0.31-0.240.090.000.160.4156TCT-0.040.400.330.240.200.490.63-0.010.760.910.750.7457TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC0.21-0.16-0.170.090.27-0.15-0.100.380.10-0.27-0.39-0.2259TGG0.050.18-0.10-0.150.160.21-0.210.15-0.32-0.090.01-0.2760TGT-0.180.460.00-0.22-0.110.46-0.230.000.770.910.740.5961TTA-0.47-0.45-0.45-0.43-0.25-0.32-0.19-0.34-0.52-0.18-0.30-0.3362TTC0.19-0.010.060.040.25-0.200.20-0.25-0.62-0.45-0.63-0.4163TTG-0.25-0.09-0.16-0.28-0.22-0.19-0.13-0.35-0.270.35-0.200.1364TTT-0.180.060.13-0.08-0.15-0.060.06-0.040.580.930.670.66AAAAACAAGAATACAACCACGACTAGAAGCAGGAGT123456789101112表C.7继续131415161718192021222324ATAATCATGATTCAACACCAGCATCCACCCCCGCCTIAAA-0.41-0.20-0.080.26-0.090.320.260.200.180.450.260.342AAC0.240.090.090.150.170.250.320.100.190.400.240.133AAG0.140.180.200.200.00-0.28-0.30-0.40-0.30-0.28-0.43-0.374AAT-0.31-0.08-0.08-0.08-0.34-0.06-0.07-0.18-0.11-0.12-0.22-0.185ACA-0.120.340.060.35-0.150.140.050.170.230.110.080.076ACC-0.09-0.040.43-0.390.150.240.30-0.100.220.380.580.297ACG-0.01-0.13-0.30-0.210.25-0.13-0.120.170.03-0.23-0.27-0.168ACT0.030.740.570.46-0.53-0.290.02-0.41-0.030.050.14-0.429AGA-0.40-0.21-0.37-0.15-0.100.47-0.270.350.380.490.400.2910AGC0.12-0.01-0.12-0.11-0.030.10-0.12-0.160.500.160.110.4411AGG-0.350.370.01-0.110.00-0.24-0.37-0.25-0.13-0.31-0.240.2112AGT-0.210.23-0.14-0.06-0.250.38-0.06-0.26-0.29-0.01-0.03-0.0713ATA-0.240.590.420.61-0.37-0.04-0.13-0.12-0.200.120.03-0.2014ATC0.15-0.11-0.03-0.160.490.490.500.360.380.410.390.3915ATG0.130.070.00-0.100.43-0.15-0.220.140.490.13-0.09-0.0416ATT-0.090.02-0.08-0.01-0.36-0.23-0.25-0.33-0.19-0.35-0.30-0.2317CAA-0.31-0.10-0.210.000.550.690.700.720.530.600.820.5418CAC0.200.150.240.190.420.340.430.150.180.390.280.2419CAG0.220.000.170.100.07-0.37-0.42-0.27-0.18-0.30-0.32-0.3720CAT-0.17-0.05-0.15-0.16-0.20-0.19-0.27-0.11-0.24-0.17-0.17-0.1621CCA0.000.270.040.390.190.040.360.38-0.390.200.54-0.0422CCC0.300.500.530.300.470.400.620.23-0.280.530.690.2723CCG0.17-0.25-0.26-0.32-0.18-0.23-0.120.07-0.25-0.39-0.13-0.0724CCT0.240.650.600.43-0.22-0.010.02-0.120.21-0.060.33-0.1525CGA-0.50-0.20-0.040.310.12-0.380.090.01-0.450.310.410.0226CGC0.310.010.300.470.560.330.390.400.070.220.330.5527CGG-0.33-0.31-0.30-0.27-0.23-0.43-0.39-0.27-0.48-0.48-0.37-0.4028CGT0.120.310.420.38-0.02-0.23-0.090.05-0.40-0.24-0.07-0.1529CTA-0.380.320.380.47-0.29-0.010.66-0.21-0.57-0.090.37-0.4430CTC0.280.200.390.110.600.670.680.380.490.530.560.3131CTG0.27-0.29-0.30-0.17-0.25-0.27-0.14-0.010.28-0.34-0.38-0.1232CTT0.450.550.510.42-0.27-0.36-0.24-0.34-0.03-0.23-0.15-0.40138<table>tableseeoriginaldocumentpage139</column></row><table>17CAA0.240.570.560.56-0.180.750.640.760.160.090.100.1818CAC-0.20-0.32-0.14-0.190.390.200.230.180.24-0.11-0.230.0019CAG0.01-0.34-0.31-0.350.02-0.38-0.34-0.46-0.140.050.05-0.1720CAT-0.140.220.11-0.07-0.190.01-0.080.14-0.070.110.03-0.0121CCA-0.14-0.12-0.07-0.21-0.490.440.690.540.300.260.230.3522CCC-0.260.040.08-0.160.270.480.660.380.05-0.36-0.36-0.1723CCG-0.05-0.220.140.01-0.20-0.43-0.18-0.380.200.270.320.2724CCT-0.270.200.34-0.05-0.340.380.510.10-0.44-0.31-0.38-0.4525CGA-0.61-0.33-0.22-0.24-0.530.670.590.550.510.380.550.2526CGC-0.20-0.110.250.240.410.090.160.420.31-0.35-0.46-0.0327CGG-0.36-0.34-0.22-0.28-0.62-0.54-0.45-0.390.230.460.280.2328CGT-0.55-0.140.08-0.28-0.42-0.29-0.19-0.19-0.12-0.27-0.25-0.1429CTA-0.64-0.16-0.16-0.37-0.700.740.680.320.370.590.490.2630CTC0.500.350.26-0.100.320.560.710.490.28-0.46-0.30-0.1631CTG0.45-0.39-0.10-0.040.18-0.40-0.36-0.280.310.490.350.4232CTT-0.11-0.030.16-0.25-0.08-0.080.20-0.02-0.42-0.17-0.36-0.4533GAA0.290.480.020.370.330.600.270.51-0.13-0.06-0.06-0.0234GAC-0.01-0.07-0.20-0.270.360.380.320.230.30-0.07-0.130.0335GAG-0.11-0.43-0.52-0.36-0.23-0.32-0.54-0.580.220.200.27-0.0336GAT-0.260.150.280.09-0.25-0.07-0.120.04-0.190.040.08-0.0137GCA0.320.470.450.52-0.060.410.460.470.110.090.150.1738GCC0.070.070.260.010.440.560.610.440.11-0.26-0.41-0.2039GCG-0.24-0.34-0.050.04-0.21-0.57-0.37-0.360.220.370.350.3240GCT-0.56-0.14-0.05-0.28-0.320.220.45-0.18-0.39-0.19-0.11-0.3941GGA0.190.440.080.270.090.410.120.04-0.24-0.040.020.0742GGC0.06-0.10-0.12-0.050.620.310.240.340.32-0.15-0.230.0343GGG-0.28-0.20-0.32-0.23-0.04-0.39-0.45-0.530.270.520.420.4144GGT-0.120.390.310.150.020.04-0.07-0.39-0.32-0.29-0.21-0.3445GTA-0.060.350.150.170.170.460.380.550.130.150.130.3246GTC0.450.190.100.160.620.660.580.570.29-0.30-0.35-0.1147GTG0.44-0.36-0.150.130.39-0.52-0.57-0.430.180.520.330.3548GTT-0.370.050.07-0.20-0.13-0.110.18-0.14-0.35-0.17-0.12-0.2749TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.20-0.20-0.43-0.440.320.460.240.470.320.050.140.1651TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.390.540.240.41-0.230.050.070.29-0.24-0.060.06-0.0953TCA0.450.430.430.42-0.090.330.370.20-0.10-0.010.000.1554TCC0.170.020.07-0.190.500.510.580.320.340.060.140.0455TCG-0.24-0.45-0.130.11-0.28-0.50-0.29-0.260.400.360.360.4156TCT0.120.500.470.22-0.17-0.040.38-0.23-0.38-0.32-0.15-0.3457TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC-0.24-0.27-0.19-0.020.670.090.100.210.47-0.07-0.040.2759TGG0.410.06-0.04-0.020.01-0.23-0.280.020.060.04-0.11-0.0360TGT-0.100.490.530.34-0.04-0.21-0.33-0.14-0.42-0.24-0.15-0.1261TTA0.630.770.440.600.290.660.630.61-0.020.310.170.1262TTC0.09-0.30-0.24-0.440.370.370.360.440.460.11-0.050.3763TTG0.26-0.14-0.13-0.15-0.19-0.05-0.18-0.260.100.430.250.0964TTT0.380.470.420.34-0.27-0.040.240.16-0.28-0.120.14-0.18CGACGCCGGCGTCTACTCCTGCTTGAAGACGAGGAT252627282930313233343536表C.7继续373839404142434445464748GCAGCCGCGGCTGGAGGCGGGGGTGTAGTCGTGGTT140<table>tableseeoriginaldocumentpage141</column></row><table>56TCT-0.140.050.50-0.250.660.830.750.53-0.37-0.230.33-0.4257TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC0.21-0.20-0.270.03-0.28-0.34-0.22-0.330.23-0.07-0.010.3759TGG0.050.29-0.200.07-0.180.04-0.080.360.270.26-0.370.1660TGT0.120.420.310.400.780.870.840.67-0.13-0.10-0.17-0.1761TTA0.370.410.560.29-0.070.260.050.140.280.490.550.4162TTC0.17-0.26-0.14-0.05-0.45-0.55-0.32-0.540.31-0.230.200.1063TTG-0.150.220.09-0.050.160.490.270.270.280.430.250.1364TTT-0.04-0.010.230.140.670.770.770.57-0.04-0.160.160.05GCAGCCGCGGCTGGAGGCGGGGGTGTAGTCGTGGTT373839404142434445464748表C.7继续495051525354555657585960TAATACTAGTATTCATCCTCGTCTTGATGCTGGTGT1AAA0.00-0.100.00-0.03-0.260.150.230.380.000.00-0.07-0.132AAC0.000.100.000.030.380.410.390.270.00-0.21-0.17-0.053AAG0.000.120.000.170.260.440.170.320.000.160.160.024AAT0.000.030.00-0.11-0.11-0.050.210.050.000.330.19-0.065ACA0.00-0.200.00-0.13-0.17-0.18-0.06-0.190.00-0.15-0.38-0.126ACC0.000.250.000.050.370.260.190.080.00-0.150.36-0.297ACG0.000.220.000.130.260.300.020.320.000.360.330.488ACT0.00-0.280.00-0.26-0.50-0.40-0.26-0.580.00-0.13-0.27-0.589AGA0.00-0.130.00-0.19-0.030.11-0.060.490.000.34-0.400.0810AGC0.000.020.00-0.100.350.350.230.400.00-0.03-0.36-0.0911AGG0.000.490.000.390.010.640.100.190.000.48-0.300.1512AGT0.000.250.00-0.12-0.350.10-0.10-0.180.000.220.710.3813ATA0.00-0.290.00-0.06-0.21-0.34-0.17-0.130.00-0.16-0.47-0.4714ATC0.000.390.000.360.460.340.360.560.00-0.070.080.1515ATG0.00-0.030.000.030.160.09-0.010.180.000.000.000.0016ATT0.00-0.180.00-0.30-0.29-0.350.14-0.440.000.350.17-0.2417CAA0.00-0.370.00-0.35-0.39-0.41-0.35-0.280.00-0.31-0.42-0.2618CAC0.000.270.000.330.410.210.280.350.000.060.080.0619CAG0.000.250.000.450.310.400.190.370.000.200.480.4120CAT0.00-0.210.00-0.17-0.30-0.32-0.23-0.070.000.05-0.06-0.1821CCA0.00-0.320.00-0.28-0.27-0.540.01-0.540.00-0.57-0.41-0.4522CCC0.000.560.000.410.490.420.490.460.000.180.490.1923CCG0.000.170.000.150.080.06-0.110.050.000.190.060.4924CCT0.00-0.320.00-0.44-0.41-0.400.02-0.540.00-0.22-0.18-0.4125CGA0.00-0.160.000.22-0.18-0.58-0.49-0.500.00-0.680.38-0.3726CGC0.00-0.170.000.110.470.33-0.190.520.00-0.08-0.310.2627CGG0.000.350.000.110.400.620.220.440.000.360.640.5128CGT0.00-0.150.00-0.26-0.40-0.30-0.46-0.440.000.080.540.5329CTA0.00-0.410.00-0.44-0.41-0.54-0.48-0.620.00-0.72-0.56-0.6830CTC0.000.490.000.450.580.480.550.600.000.240.400.3531CTG0.00-0.100.000.14-0.07-0.01-0.290.000.000.040.070.5232CTT0.00-0.090.00-0.21-0.16-0.350.07-0.530.000.020.22-0.3733GAA0.00-0.200.000.00-0.09-0.09-0.170.270.000.01-0.11-0.2334GAC0.000.350.000.290.520.530.540.450.00-0.10-0.130.0435GAG0.000.210.000.210.430.500.310.390.000.180.230.2136GAT0.00-0.210.00-0.20-0.22-0.28-0.22-0.150.000.060.130.0337GCA0.00-0.350.00-0.07-0.35-0.41-0.40-0.380.00-0.32-0.48-0.1438GCC0.000.380.000.150.430.360.330.460.00-0.240.28-0.1339GCG0.000.210.000.400.480.37-0.190.350.000.270.220.5514240GCT0.00-0.450.00-0.47-0.50-0.54-0.33-0.530.00-0.05-0.06-0.1141GGA0.000.040.00-0.09-0.260.790.040.060.00-0.060.160.1642GGC0.00-0.110.000.000.240.35-0.100.360.00-0.28-0.35-0.3343GGG0.000.380.000.290.410.660.590.530.000.560.550.4244GGT0.000.010.00-0.26-0.60-0.28-0.43-0.500.000.490.550.6345GTA0.00-0.400.00-0.28-0.42-0.47-0.40-0.420.00-0.35-0.43-0.3546GTC0.000.440.000.230.550.450.490.480.00-0.080.170.0347GTG0.000.030.000.120.270.18-0.230.340.000.210.100.5148GTT0.00-0.140.00-0.24-0.24-0.47-0.02-0.470.000.260.12-0.2749TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.000.230.000.340.350.380.350.280.00-0.14-0.08-0.0651TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.00-0.190.00-0.17-0.25-0.24-0.01-0.080.000.150.06-0.0253TCA0.000.180.000.25-0.15-0.220.21-0.210.000.11-0.130.0954TCC0.000.360.00-0.100.31-0.100.45-0.010.00-0.290.25-0.2155TCG0.000.140.000.450.320.24-0.390.350.00-0.080.110.5856TCT0.00-0.280.00-0.47-0.41-0.50-0.02-0.550.000.140.27-0.1957TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC0.00-0.050.000.300.460.29-0.250.260.00-0.08-0.30-0.3159TGG0.000.010.00-0.010.160.33-0.210.100.00-0.030.000.0660TGT0.000.020.00-0.34-0.32-0.17-0.29-0.350.000.500.820.2661TTA0.00-0.260.00-0.27-0.38-0.230.00-0.210.00-0.04-0.44-0.3862TTC0.000.310.000.420.200.070.060.290.00-0.18-0.05-0.1363TTG0.000.260.000.060.10-0.140.110.020.000.170.080.4164TTT0.00-0.180.00-0.24-0.26-0.230.18-0.340.000.270.04-0.08TAATACTAGTATTCATCCTCGTCTTGATGCTGGTGT495051525354555657585960表C.7继续61TTA62TTC63TTG64TTT1AAA0.10-0.150.170.242AAC0.19-0.080.380.113AAG0.63-0.110.69-0.144AAT-0.37-0.07-0.020.015ACA-0.23-0.16-0.52-0.126ACC0.460.170.160.077ACG0.440.030.660.268ACT-0.45-0.10-0.45-0.389AGA0.170.30-0.200.3410AGC0.360.150.250.1911AGG0.180.190.520.0112AGT-0.400.06-0.23-0.2813ATA-0.48-0.32-0.61-0.2214ATC0.370.300.550.4015ATG0.660.100.81-0.0716ATT-0.62-0.26-0.52-0.1817CAA-0.14-0.27-0.25-0.3218CAC0.360.120.200.4019CAG0.760.190.490.3420CAT-0.36-0.14-0.43-0.1921CCA-0.29-0.19-0.42-0.1922CCC0.650.560.430.4823CCG0.430.070.62-0.13<table>tableseeoriginaldocumentpage144</column></row><table><table>tableseeoriginaldocumentpage0</column></row><table>48GTT0.490.600.750.280.420.360.650.170.771.000.600.7849TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.060.510.23-0.25-0.10-0.12-0.15-0.01-0.19-0.34-0.17-0.6751TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT-0.220.070.42-0.27-0.170.000.240.550.840.901.000.7053TCA-0.10-0.49-0.25-0.050.150.21-0.33-0.31-0.43-0.45-0.08-0.7454TCC0.330.650.25-0.550.760.12-0.040.02-0.47-0.091.00-0.2255TCG0.100.410.580.000.190.50-0.15-0.390.090.75-0.511.0056TCT-0.020.110.180.35-0.370.030.70-0.071.001.001.001.0057TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC0.75-0.400.28-0.221.00-0.56-0.441.000.22-0.41-0.63-0.1959TGGo.n-0.22-0.240.35-0.010.73-0.12-0.32-0.350.11-0.05-0.6160TGT-0.551.00-0.040.380.401.00-0.36-0.181.001.001.001.0061TTA-0.38-0.24-0.46-0.470.15-0.12-0.35-0.650.08-0.250.37-0.1662TTC-0.06-0.080.130.420.22-0.210.55-0.53-0.58-0.44-0.59-0.4263TTG-0.27-0.35-0.47-0.360.59-0.330.30-0.39-0.170.461.000.6164TTT-0.01-0.020.05-0.18-0.230.040.100.070.640.771.001.00AAAAACAAGAATACAACCACGACTAGAAGCAGGAGT123456789101112表C.8继续131415161718192021222324ATAATCATGATTCAACACCAGCATCCACCCCCGCCTlAAA-0.14-0.34-0.020.51-0.070.150.450.25-0.280.090.430.312AAC0.430.18-0.04-0.03-0.10-0.050.390.050.020.190.350.503AAG0.110.060.060.110.20-0.52-0.56-0.150.22-0.38-0.46-0.564AAT-0.590.050.05-0.07-0.200.43-0.18-0.26-0.40-0.42-0.420.525ACA0.510.27-0.220.61-0.420.06-0.110.500.120.28-0.170.506ACC-0.45-0.090.56-0.450.680.420.080.320.201.000.750.387ACG-0.33-0.24-0.28-0.350.40-0.080.24-0.22-0.12-0.20-0.400.278ACT0.500.810.840.500.30-0.22-0.44-0.430.110.270.32-0.419AGA-0.19-0.130.01-0.130.460.73-0.07-0.221.001.00-0.240.6310AGC-0.010.000.10-0.130.230.21-0.28-0.490.46-0.440.421.0011AGG-0.640.480.330.160.08-0.640.670.281.00-0.63-0.331.0012AGT0.02-0.090.760.020.630.62-0.070.70-0.84-0.23-0.31-0.1713ATA0.320.501.000.86-0.670.10-0.17-0.070.05-0.57-0.470.6414ATC0.24-0.28-0.03-0.030.640.240.330.470.300.570.570.1915ATG0.14-0.060.000.030.47-0.23-0.250.230.140.290.05-0.2416ATT-0.18-0.06-0.140.24-0.43-0.310.04-0.26-0.59-0.41-0.21-0.1417CAA0.15-0.10-0.25-0.030.671.000.600.730.371.001.000.7618CAC-0.420.28-0.07-0.110.050.290.54-0.110.551.000.10-0.1819CAG0.590.120.25-0.140.11-0.52-0.46-0.200.091.00-0.45-0.5620CAT0.32-0.070.06-0.02-0.110.21-0.27-0.20-0.30-0.15-0.170.1821CCA1.000.36-0.21-0.520.460.021.001.001.001.001.001.0022CCC1.001.001.000.720.110.200.680.051.001.001.00-0.1223CCG0.47-0.32-0.33-0.24-0.19-0.360.160.06-0.09-0.45-0.20-0.2124CCT1.000.380.880.47-0.520.44-0.050.121.00-0.710.37-0.0625CGA1.00-0.43-0.63-0.560.381.000.550.021.001.001.001.0026CGC0.710.030.030.010.150.05-0.03-0.171.001.000.180.4827CGG-0.56-0.14-0.18-0.200.26-0.10-0.320.12-0.24-0.770.20-0.3428CGT0.580.610.390.16-0.390.17-0.09-0.081.000.19-0.49-0.4729CTA1.00-0.62-0.271.001.001.001.001.001.001.000.411.0030CTC-0.610.100.02-0.040.740.430.540.401.001.000.460.4231CTG-0.48-0.25-0.24-0.19-0.02-0.170.100.22-0.42-0.29-0.31-0.0932CTT0.540.480.280.77-0.30-0.30-0.33-0.38-0.21-0.61-0.23-0.4133GAA-0.01-0.12-0.130.040.110.350.190.270.210.220.230.6434GAC0.380.120.210.130.350.660.290.120.110.510.140.4335GAG-0.200.300.380.00-0.25-0.38-0.30-0.47-0.58-0.48-0.44-0.3436GAT-0.320.12-0.15-0.24-0.39-0.180.01-0.260.000.39-0.310.0437GCA0.20-0.08-0.030.360.080.820.040.58-0.500.170.540.4238GCC-0.45-0.01-0.27-0.35-0.280.400.250.061.000.300.740.3539GCG0.53-0.16-0.17-0.200.22-0.12-0.230.02-0.150.03-0.15-0.4840GCT0.210.560.810.47-0.10-0.480.27-0.440.01-0.51-0.22-0.5241GGA0.150.08-0.12-0.06-0.44-0.210.02-0.091.000.590.02-0.4142GGC0.27-0.100.02-0.190.460.180.130.220.300.130.360.6043GGG-0.020.68-0.24-0.130.660.08-0.41-0.451.00-0.38-0.49-0.3444GGT0.160.270.45-0.15-0.490.050.700.250.190.33-0.19-0.5445GTA0.740.510.300.690.520.57-0.17-0.170.530.620.64-0.0746GTC0.21-0.38-0.32-0.170.350.600.740.380.720.530.700.7847GTG-0.090.010.01-0.090.080.69-0.43-0.170.01-0.47-0.53-0.4348GTT0.06-0.080.330.17-0.39-0.51-0.10-0.420.660.17-0.25-0.3549TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC-0.620.220.38-0.120.060.520.23-0.13-0.400.660.230.3651TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT-0.400.19-0.250.13-0.130.15-0.12-0.261.000.39-0.410.2953TCA-0.270.49-0.130.32-0.30-0.06-0.030.790.451.000.41-0.0554TCC-0.13-0.33-0.10-0.540.780.320.681.000.070.240.290.6455TCG0.35-0.080.060.090.51-0.60-0.200.00-0.04-0.42-0.21-0.5856TCT0.160.14-0.180.33-0.37-0.45-0.040.09-0.700.44-0.12-0.5757TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC1.00-0.440.440.57-0.09-0.37-0.25-0.21-0.011.000.231.0059TGG0.480.330.00-0.310.63-0.23-0.310.231.00-0.710.130.4560TGT-0.350.14-0.390.07-0.050.540.620.64-0.66-0.580.55-0.7561TTA-0.43-0.10-0.16-0.060.510.640.030.430.040.610.560.6362TTC1.00-0.160.150.130.75-0.08-0.20-0.181.001.000.03-0.0963TTG0.530.050.44-0.28-0.22-0.17-0.34-0.341.00-0.360.610.0364TTT0.340.06-0.10-0.190.16-0.22-0.260.45-0.23-0.650.010.33ATAATCATGATTCAACACCAGCATCCACCCCCGCCT131415161718192021222324表C.8继续252627282930313233343536CGACGCCGGCGTCTACTCCTGCTTGAAGACGAGGATiAAA-0.560.18-0.220.32-0.170.490.310.39-0.17-0.290.140.272AAC-0.40-0.26-0.39-0.091.000.00-0.170.130.200.50-0.070.333AAG1.00-0.13-0.030.360.15-0.52-0.63-0.640.240.010.480.174AAT-0.640.570.580.061.00-0.35-0.470.04-0.18-0.37-0.01-0.315ACA0.180.500.76-0.090.210.590.180.100.240.10-0.110.206ACC1.000.771.00-0.251.000.240.14-0.060.39-0.310.030.227ACG0.290.67-0.211.001.00-0.14-0.21-0.420.000.610.15-0.068ACT1.00-0.220.50-0.70-0.790.370.64-0.24-0.38-0.21-0.50-0.559AGA0.03-0.160.420.14-0.580.850.560.71-0.200.02-0.220.1810AGC-0.31-0.36-0.340.191.000.62-0.07-0.150.610.260.470.5711AGG1.00-0.01-0.751.001.00-0.36-0.82-0.56-0.430.63-0.540.3812AGT1.000.681.001.001.00-0.45-0.200.09-0.44-0.53-0.65-0.6013ATA-0.490.40-0.430.561.000.63-0.211.00-0.290.570.42-0.0814ATC-0.32-0.590.67-0.12-0.370.170.570.220.230.160.310.3715ATG0.160.360.130.160.46-0.04-0.37-0.230.000.05-0.01-0.0414716ATT-0.160.580.410.390.43-0.21-0.060.17-0.28-0.020.15-0.3817CAA0.370.610.430.150.131.000.870.860.21-0.19-0.050.1818CAC-0.72-0.270.47-0.50-0.400.780.010.460.110.450.190.3819CAG-0.46-0.45-0.39-0.250.37-0.36-0.54-0.47-0.100.04-0.06-0.0220CAT-0.29-0.230.580.69-0.240.140.310.25-0.13-0.37-0.01-0.0921CCA1.00-0.29-0.271.001.001.001.000.240.260.21-0.461.0022CCC1.00-0.65-0.701.001.000.000.430.370.39-0.42-0.340.1023CCG-0.64-0.22-0.040.380.28-0.460.100.190.130.360.360.1924CCT1.00-0.070.48-0.151.000.530.47-0.32-0.33-0.24-0.38-0.5425CGA1.000.32-0.251.001.000.290.190.560.80-0.341.000.1626CGC0.32-0.44-0.18-0.530.170.240.140.260.480.230.250.3527CGG1.000.59-0.371.001.00-0.66-0.31-0.160.170.40-0.150.2528CGT1.000.700.70-0.351.000.840.01-0.49-0.13-0.63-0.43-0.1829CTA1.001.001.001.001.001.001.000.351.001.00-0.21-0.2030CTC-0.440.050.73-0.281.000.320.690.410.41-0.190.27-0.0831CTG-0.01-0.41-0.270.100.40-0.52-0.46-0.290.270.410.110.4432CTT1.000.150.67-0.420.350.490.23-0.10-0.39-0.31-0.46-0.4633GAA0.290.45-0.220.410.570.500.380.56-0.060.02-0.160.0834GAC-0.63-0.34-0.59-0.47-0.020.140.360.010.250.140.030.2135GAG-0.25-0.59-0.46-0.40-0.38-0.42-0.55-0.640.250.130.24-0.2636GAT0.250.880.780.580.15-0.14-0.100.38-0.240.090.26-0.2637GCA1.000.470.79-0.070.240.120.610.570.280.010.06-0.0438GCC0.42-0.20-0.28-0.301.000.500.480.530.340.050.010.0639GCG0.19-0.23-0.360.460.56-0.58-0.29-0.40-0.070.420.120.2840GCT1.001.001.00-0.53-0.660.710.50-0.17-0.40-0.280.01-0.4541GGA0.490.220.24-0.201.000.420.200.28-0.25-0.04-0.190.2542GGC1.00-0.33-0.16-0.23-0.440.010.300.340.27-0.170.160.2643GGG-0.010.22-0.630.321.00-0.49-0.49-0.650.390.600.300.0144GGT0.160.490.500.07-0.520.190.30-0.41-0.22-0.51-0.36-0.0945GTA1.000.540.550.33-0.670.610.330.350.030.14-0.100.2846GTC0.54-0.040.18-0.24-0.170.760.510.710.480.05-0.19-0.1447GTG0.47-0.46-0.370.53-0.29-0.56-0.55-0.470.160.400.540.4148GTT-0.380.500.520.28-0.30-0.400.31-0.09-0.39-0.33-0.34-0.3349TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC-0.380.01-0.41-0.721.000.280.100.460.270.360.310.0251TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT-0.540.560.790.84-0.490.11-0.080.45-0.26-0.260.010.0053TCA1.00-0.311.000.510.020.490.420.370.06-0.04-0.26-0.0654TCC1.00-0.120.26-0.09-0.39-0.070.510.200.140.370.09-0.1455TCG1.00-0.06-0.60-0.19-0.73-0.58-0.27-0.170.380.57-0.160.1756TCT1.000.171.000.00-0.58-0.090.64-0.32-0.25-0.16-0.36-0.1457TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC-0.45-0.60-0.07-0.381.00-0.13-0.080.280.650.350.350.5959TGG1.000.56-0.21-0.21-0.490.24-0.510.05-0.010.100.03-0.0860TGT1.001.001.001.001.000.450.05-0.52-0.39-0.46-0.55-0.3661TTA0.150.480.240.811.000.610.710.760.030.380.300.1262TTC-0.70-0.260.34-0.591.000.320.540.430.480.220.040.5563TTG1.00-0.27-0.62-0.19-0.24-0.16-0.43-0.280.340.59-0.11-0.0264TTT1.000.750.500.45-0.50-0.39-0.070.57-0.30-0.350.15-0.09CGACGCCGGCGTCTACTCCTGCTTGAAGACGAGGAT252627282930313233343536表C.8继续373839404142434445464748148<table>tableseeoriginaldocumentpage149</column></row><table>55TCG-0.40-0.43-0.09-0.030.37-0.071.000.580.78-0.250.69-0.3056TCT-0.260.310.520.070.670.650.840.33-0.220.220.50-0.4357TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC-0.500.160.030.02-0.46-0.540.47-0.080.200.270.721.0059TGG-0.480.100.250.37-0.190.26-0.510.66-0.190.25-0.180.1160TGT1.000.59-0.410.530.811.001.001.00-0.66-0.43-0.510.1761TTA0.160.600.560.36-0.070.02-0.06-0.21-0.360.310.50-0.1162TTC0.58-0.060.39-0.03-0.59-0.530.06-0.50-0.180.200.220.3763TTG-0.190.170.06-0.250.800.710.000.500.810.210.21-0.1764TTT0.21-0.32-0.20-0.080.850.711.000.580.00-0.450.180.28GCAGCCGCGGCTGGAGGCGGGGGTGTAGTCGTGGTT373839404142434445464748表C.8继续495051525354555657585960TAATACTAGTATTCATCCTCGTCTTGATGCTGGTGT1AAA0.00-0.260.000.06-0.31-0.020.080.350.000.020.170.052AAC0.00-0.140.000.180.720.76-0.260.300.000.07-0.13-0.403AAG0.000.400.000.360.090.62-0.340.160.000.08-0.32-0.254AAT0.00-0.080.000.040.10-0.590.310.780.000.410.200.145ACA0.00-0.160.00-0.050.02-0.10-0.310.060.00-0.260.150.036ACC0.00-0.090.000.170.601.000.240.260.00-0.190.611.007ACG0.000.260.000.410.340.360.28-0.140.00-0.14-0.220.168ACT0.00-0.420.00-0.35-0.100.25-0.21-0.670.00-0.27-0.231.009AGA0.001.000.000.22-0.19-0.23-0.15-0.120.000.220.01-0.4110AGC0.00-0.150.000.070.350.78-0.190.200.000.430.020.5911AGG0.001.000.000.340.481.001.000.350.001.00-0.561.0012AGT0.000.320.000.09-0.34-0.22-0.65-0.280.001.000.051.0013ATA0.00-0.600.00-0.37-0.11-0.301.00-0.040.00-0.38-0.521.0014ATC0.000.460.000.400.370.110.500.550.00-0.04-0.130.3515ATG0.000.180.00-0.14-0.430.720.52-0.040.000.580.00-0.4516ATT0.00-0.190.00-0.22-0.23-0.540.04-0.420.000.200.38-0.3817CAA0.00-0.320.00-0.47-0.21-0.47-0.580.140.00-0.17-0.42-0.4318CAC0.000.460.00-0.300.460.70-0.020.110.00-0.110.18-0.2319CAG0.000.590.000.42-0.180.320.420.000.00-0.140.531.0020CAT0.000.470.00-0.29-0.50-0.310.460.470.000.11-0.120.2321CCA0.00-0.330.00-0.430.031.00-0.46-0.580.00-0.170.05-0.4322CCC0.000.190.000.640.210.331.000.510.000.011.001.0023CCG0.000.090.000.07-0.030.500.440.360.000.45-0.31-0.2524CCT0.00-0.250.00-0.01-0.46-0.36-0.05-0.370.001.000.64-0.7025CGA0.00-0.510.00-0.440.29-0.171.000.130.00-0.451.001.0026CGC0.00-0.350.000.040.780.820.18-0.320.00-0.40-0.46-0.6427CGG0.000.400.000.200.581.000.600.480.000.460.551.0028CGT0.00-0.440.00-0.16-0.420.20-0.58-0.680.001.001.001.0029CTA0.001.000.00-0.78-0.60-0.29-0.69-0.030.001.00-0.79-0.8230CTC0.000.170.000.560.240.63-0.190.730.000.490.37-0.3231CTG0.00-0.120.00-0.21-0.050.16-0.350.010.000.42-0.080.5832CTT0.00-0.130.000.45-0.260.31-0.03-0.500.000.040.60-0.5733GAA0.00-0.080.00-0.06-0.01-0.180.090.290.00-0.110.080.1034GAC0.000.320.000.510.760.600.400.320.00-0.34-0.09-0.1835GAG0.000.040.000.320.76-0.29-0.120.260.00-0.19-0.170.5536GAT0.00-0.350.00-0.16-0.11-0.470.120.100.000.580.08-0.0237GCA0.00-0.220.00-0.02-0.13-0.06-0.16-0.230.000.28-0.390.4838GCC0.000.710.000.050.640.550.490.120.00-0.53-0.07-0.55<table>tableseeoriginaldocumentpage151</column></row><table>23CCG-0.250.160.77-0.2124CCT-0.630.72-0.43-0.2225CGA0.33-0.65-0.72-0.0726CGC0.280.130.050.3527CGG1.000.37-0.06-0.4828CGT-0.26-0.42-0.37-0.4029CTA-0.501.00-0.24-0.7530CTC0.480.73-0.020.5931CTG0.780.070.32-0.2932CTT-0.270.06-0.480.6833GAA0.180.01-0.320.0334GAC0.430.330.170.3735GAG0.41-0.260.740.1736GAT-0.35-0.41-0.520.0137GCA-0.46-0.33-0.10-0.2638GCC0.210.660.170.2439GCG0.290.300.780.1740GCT-0.32-0.32-0.59-0.3241GGA-0.090.30-0.39-0.1842GGC0.440.390.490.3143GGG0.530.680.18-0.0544GGT-0.14-0.46-0.51-0.5345GTA-0.10-0.230.03-0.4046GTC0.480.690.800.5347GTG0.040.360.43-0.2448GTT-0.29-0.37-0.22-0.1149TAA0.000.000.000.0050TAC-0.390.310.280.2751TAG0.000.000.000.0052TAT-0.36-0.09-0.48-0.2753TCA-0.17-0.08-0.30-0.2854TCC0.390.820.190.4555TCG0.540.591.00-0.3656TCT-0.66-0.420.02-0.0157TGA0.000.000.000.0058TGC-0.31-0.051.000.8059TGG0.760.371.00-0.2260TGT-0.05-0.120.31-0.5061TTA-0.42-0.04-0.23-0.4062TTC-0.08-0.010.43-0.1063TTG0.68-0.03-0.06-0.4564TTT-0.59-0.15-0.230.22TTATTCTTGTTT61626364152<table>tableseeoriginaldocumentpage153</column></row><table>0.000.000.000.000.000.000.000.000.000.000.000.0050TAC-0.13-0.18-0.23-0.10-0.14-0.04-0.09-0.10-0.020.11-0.150.0451TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.120.150.220.100.060.13-0.140.170.210.300.070.2653TCA0.030.040.07-0.050.160.300.060.12-0.270.03-0.270.1054TCC-0.28-0.17-0.31-0.13-0.16-0.22-0.12-0.34-0.05-0.14-0.02-0.1455TCG-0.150.03-0.14-0.040.120.330.020.30-0.150.07-0.210.2156TCT0.210.280.270.190.050.030.11-0.070.350.330.240.3057TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC-0.12-0.03-0.14-0.10-0.04-0.110.04-0.020.09-0.15-0.170.0059TGG-0.040.020.06-0.01-0.03-0.02-0.110.09-0.170.15-0.150.1860TGT0.000.000.210.080.13-0.06-0.050.020.300.270.160.3661TTA0.140.050.060.10-0.12-0.07-0.10-0.06-0.08-0.03-0.09-0.0762TTC-0.35-0.32-0.34-0.280.03-0.28-0.07-0.19-0.18-0.17-0.17-0.1663TTG-0.28-0.25-0.29-0.150.07-0.18-0.06-0.05-0.13-0.10-0.09-0.0764TTT0.300.280.440.290.26-0.060.070.110.140.130.150.18AAAAACAAGAATACAACCACGACTAGAAGCAGGAGT123456789101112表C.9继续131415161718192021222324ATAATCATGATTCAACACCAGCATCCACCCCCGCCT1AAA-0.160.00-0.10-0.060.100.180.050.130.000.10-0.08-0.012AAC0.06-0.050.000.01-0.15-0.07-0.12-0.070.010.060.18-0.023AAG0.110.100.150.13-0.12-0.21-0.10-0.15-0.070.030.110.044AAT0.050.010.00-0.050.060.110.230.010.030.15-0.15-0.105ACA-0.100.27-0.130.070.09-0.02-0.05-0.13-0.050.040.04-0.176ACC-0.19-0.33-0.15-0.240.040.070.200.210.140.210.350.367ACG-0.160.25-0.050.30-0.12-0.19-0.26-0.230.02-0.28-0.35-0.248ACT0.130.040.260.09-0.070.13-0.110.09-0.110.130.050.099AGA-0.140.03-0.06-0.010.100.050.100.120.040.230.080.0410AGC-0.21-0.020.09-0.020.100.050.210.040.360.140.390.2311AGG-0.210.270.100.14-0.090.14-0.18-0.02-0.18-0.29-0.32-0.1912AGT-0.150.140.240.040.250.210.470.180.420.280.330.3013ATA-0.190.17-0.080.180.20-0.080.060.090.05-0.03-0.390.0914ATC-0.06-0.30-0.22-0.140.070.180.020.040.100.240.290.3015ATG0.010.080.00-0.05-0.070.070.16-0.040.07-0.210.070.0216ATT0.180.010.210.06-0.14-0.08-0.04-0.06-0.070.01-0.19-0.1017CAA-0.130.04-0.09-0.010.190.16-0.060.080.000.31-0.21-0.0118CAC0.03-0.05-0.07-0.06-0.07-0.09-0.090.100.050.190.230.3319CAG-0.050.100.220.16-0.05-0.15-0.40-0.220.06-0.03-0.270.0120CAT0.08-0.090.040.040.02-0.010.11-0.02-0.040.17-0.29-0.1621CCA-0.060.14-0.030.030.180.09-0.110.12-0.180.36-0.170.0022CCC-0.28-0.29-0.25-0.210.340.260.310.020.330.490.550.3823CCG-0.180.130.030.07-0.10-0.11-0.45-0.15-0.08-0.070.050.0024CCT0.170.090.210.13-0.18-0.100.02-0.13-0.180.00-0.09-0.0425CGA-0.320.13-0.240.070.110.25-0.20-0.230.160.120.530.0126CGC-0.210.030.050.18-0.09-0.33-0.13-0.300.02-0.25-0.130.0427CGG0.020.27-0.020.260.10-0.09-0.410.120.09-0.47-0.29-0.1128CGT0.160.010.20-0.08-0.02-0.08-0.01-0.140.240.050.110.1929CTA-0.100.10-0.010.19-0.020.00-0.10-0.03-0.07-0.08-0.25-0.0630CTC-0.26-0.240.03-0.160.420.080.250.090.19-0.060.360.1931CTG-0.220.030.030.080.01-0.11-0.15-0.050.22-0.28-0.060.1932CTT0.300.210.420.130.11-0.050.25-0.13-0.09-0.15-0.18-0.08154<table>tableseeoriginaldocumentpage155</column></row><table>17CAA-0.030.22-0.010.12-0.030.26-0.140.22-0.040.11-0.05-0.0318CAC0.12-0.20-0.15-0.320.41-0.060.180.020.06-0.040.080.0119CAG-0.13-0.26-0.35-0.19-0.13-0.11-0.19-0.040.14-0.100.020.0020CAT-0.48-0.19-0.04-0.200.08-0.010.160.01-0.050.000.000.0121CCA0.160.060.010.090.100.200.120.18-0.020.16-0.090.0422CCC0.24-0.010.320.090.19-0.110.290.170.270.190.300.2023CCG-0.14-0.22-0.340.18-0.45-0.03-0.41-0.170.01-0.08-0.13-0.0424CCT-0.38-0.210.06-0.290.200.090.010.01-0.06-0.09-0.08-0.1925CGA0.300.080.040.36-0.380.070.03-0.070.07-0.16-0.090.0026CGC-0.03-0.47-0.49-0.380.15-0.23-0.10-0.010.330.21-0.160.0127CGG0.16-0.22-0.540.29-0.31-0.42-0.43-0.360.130.14-0.010.2228CGT0.01-0.320.05-0.480.04-0.110.220.01-0.05-0.110.07-0.0929CTA-0.290.10-0.10-0.10-0.20-0.13-0.270.06-0.030.10-0.020.0830CTC-0.25-0.220.180.210.19-O.U0.150.050.290.090.08-0.0831CTG-0.42-0.19-0.390.16-0.16-0.11-0.22-0.01-0.08-0.18-0.14-0.0432CTT-0.490.05-0.29-0.04-0.11-0.36-0.03-0.270.150.000.28-0.0333GAA0.010.21-0.140.08-0.130.11-0.060.110.000.020.01-0.0234GAC0.05-0.30-0.02-0.420.180.010.06-0.030.110.110.170.1335GAG0.20-0.16-0.030.07-0.20-0.24-0.30-0.130.04-0.03-0.110.0336GAT0.070.010.35-0.110.230.010.220.02-0.06-0.05-0.07-0.0737GCA0.18-0.070.270.030.010.020.050.19-0.15-0.10-0.09-0.1338GCC0.570.160.43-0.160.13-0.050.21-0.090.330.330.430.3039GCG0.41-0.19-0.080.18-0.37-0.30-0.36-0.24-0.01-0.15-0.11-0.0540GCT0.240.02-0.10-0.130.140.120.31-0.09-0.120.000.05-0.0741GGA0.11-0.17-0.050.16-0.030.01-0.100.020.050.15-0.030.1342GGC0.29-0.330.03-0.380.05-0.20-0.02-0.050.080.040.110.0843GGG-0.16-0.10-0.10-0.17-0.39-0.43-0.42-0.390.180.230.030.3344GGT0.510.260.63-0.080.040.070.410.08-0.13-0.210.05-0.1145GTA-0.51-0.18-0.29-0.270.01-0.12-0.14-0.090.020.02-0.17-0.0746GTC0.49-0.070.37-0.070.26-0.040.17-0.200.230.040.320.2347GTG0.13-0.09-0.360.16-0.10-0.13-0.16-0.11-0.26-0.14-0.32-0.1048GTT0.23-0.110.20-0.310.19-0.170.18-0.250.060.020.150.0049TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.23-0.19-0.05-0.270.060.00-0.050.180.060.020.070.0851TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT-0.04-0.02-0.03-0.180.14-0.020.060.01-0.05-0.10-0.04-0.0153TCA-0.370.02-0.010.27-0.110.14-0.11-0.040.020.150.050.0554TCC-0.110.020.32-0.050.140.180.170.130.330.360.210.3055TCG-0.02-0.220.060.23-0.32-0.10-0.34-0.110.190.270.010.1556TCT-0.17-0.300.20-0.180.060.040.130.01-0.010.090.07-0.0557TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC-0.26-0.19-0.46-0.410.16-0.270.01-0.110.230.140.150.1759TGG0.170.510.360.52-0.09-0.09-0.180.050.04-0.10-0.080.0660TGT-0.17-0.150.05-0.230.08-0.190.03-0.03-0.07-0.09-0.20-0.0961TTA-0.22-0.19-0.150.01-0.19-0.08-0.19-0.05-0.020.03-0.020.0462TTC-0.08-0.22-0.01-0.250.09-0.140.150.060.270.230.280.2363TTG0.120.080.130.110.140.270.150.27-0.06-0.03-0.04-0.0264TTT-0.050.13-0.200.380.21-0.030.190.04-0.19-0.14-0.07-0.13CGACGCCGGCGTCTACTCCTGCTTGAAGACGAGGAT252627282930313233343536表C.9继续373839404142434445464748GCAGCCGCGGCTGGAGGCGGGGGTGTAGTCGTGGTT<table>tableseeoriginaldocumentpage157</column></row><table><table>tableseeoriginaldocumentpage158</column></row><table><table>tableseeoriginaldocumentpage159</column></row><table><table>tableseeoriginaldocumentpage160</column></row><table><table>tableseeoriginaldocumentpage161</column></row><table>49TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC-0.09-0.58-0.610.080.04-0.560.42-0.57-0.570.300.52-0.0951TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.600.060.430.610.670.420.660.300.590.520.640.6453TCA0.690.220.070.480.370.200.290.52-0.290.190.030.3954TCC0.04-0.68-0.74-0.25-0.17-0.66-0.13-0.59-0.60-0.090.10-0.0255TCG-0.010.17-0.050.380.680.800.590.12-0.230.650.220.2156TCT0.21-0.45-0.360.380.29-0.540.11-0.440.060.670.500.4157TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC0.260.620.120.420.110.020.500.700.340.290.760.5159TGG0.15-0.17-0.170.140.59-0.250.16-0.28-0.330.350.170.3360TGT0.09-0.43-0.300.000.45-0.520.23-0.30-0.260.130.400.7061TTA0.57-0.210.060.610.25-0.380.58-0.14-0.190.160.180.3562TTC-0.20-0.61-0.64-0.130.26-0.600.42-0.51-0.58-0.100.44-0.0763TTG-0.27-0.64-0.68-0.280.17-0.650.43-0.59-0.62-0.040.21-0.0664TTT0.680.330.470.610.540.070.470.340.320.390.430.53AAAAACAAGAATACAACCACGACTAGAAGCAGGAGT123456789101112表C.10继续131415161718192021222324ATAATCATGATTCAACACCAGCATCCACCCCCGCCT1AAA0.520.140.160.150.220.250.340.540.040.540.590.322AAC0.53-0.61-0.24-0.41-0.45-0.530.05-0.33-0.500.400.18-0.193AAG0.62-0.56-0.18-0.31-0.41-0.550.35-0.21-0.510.160.57-0.164AAT0.710.030.220.390.260.440.630.520.200.090.440.365ACA0.680.510.290.410.600.330.410.220.010.500.940.526ACC0.44-0.61-0.54-0.59-0.26-0.490.330.08-0.400.130.450.307ACG0.540.120.180.630.140.430.340.420.090.000.440.558ACT0.38-0.300.41-0.12-0.48-0.440.430.08-0.540.460.60-0.129AGA0.47-0.48-0.21-0.32-0.26-0.410.210.06-0.540.440.44-0.1110AGC0.360.020.590.540.090.130.390.510.540.561.000.1011AGG0.500.460.280.270.510.250.390.390.190.580.470.5012AGT0.630.520.540.190.520.630.790.420.610.900.620.4813ATA0.660.510.370.680.790.780.680.490.530.590.770.6514ATC0.41-0.64-0.47-0.46-0.44-0.390.380.09-0.390.350.43-0.0215ATG0.71-0.250.00-0.19-0.05-0.190.110.13-0.330.270.590.2616ATT0.62-0.240.28-0.13-0.30-0.510.210.17-0.480.300.39-0.0217CAA0.54-0.50-0.22-0.16-0.19-0.170.26-0.08-0.370.360.42-0.0518CAC0.69-0.52-0.45-0.46-0.27-0.61-0.170.28-0.210.780.320.1219CAG0.850.270.600.290.220.370.070.230.450.600.37-0.0220CAT0.480.090.460.230.070.130.410.26-0.270.580.62-0.1921CCA0.53-0.52-0.38-0.42-0.12-0.210.020.36-0.470.530.74-0.2222CCC0.58-0.360.360.190.350.120.090.670.060.651.000.2923CCG0.690.420.430.620.590.260.490.790.40-0.090.720.7824CCT0.780.130.450.13-0.33-0.540.28-0.28-0.220.210.100.1425CGA0.330.760.700.790.900.510.781.000.681.001.001.0026CGC0.47-0.010.100.75-0.10-0.43-0.380.350.430.511.000.0127CGG1.000.791.000.530.821.000.621.001.000.301.001.0028CGT0.43-0.35-0.18-0.32-0.56-0.600.540.03-0.38-0.030.470.0529CTA0.610.580.450.310.12-0.230.04-0.10-0.28-0.120.900.1430CTC0.580.460.470.330.740.590.660.890.650.820.770.7231CTG0.60-0.100.350.120.16-0.170.27-0.040.25-0.200.880.4632CTT0.850.630.530.540.300.260.690.590.460.270.590.34<table>tableseeoriginaldocumentpage163</column></row><table>17CAA0.910.490.85-0.160.180.510.110.56-0.21-0.100.10-0.0918CAC1.001,000.51-0.540.570.660.460.24-0.27-0.42-0.030.0119CAG0.44-0.110.04-0.050.240.470.370.710.26-0.070.480.3920CAT0.68-0.420.72-0.320.320.530.190.320.13-0.170.200.3221CCA0.730.360.54-0.13-0.080.520.310.31-0.43-0.240.07-0.1022CCC1.00-0.390.420.650.220.590.560.720.670.290.680.5223CCG0.571.000.261.000.56-0.04-0.100.180.410.260.430.0424GCT0.82.0.040.70-0.41.0.100.440.810.030.040.45-0.0725CGA1.001.001.001.000.830.191.000.640.730.880.740.8026CGC1.000.23-0.09-0.49-0.010.520.491.000.670.710.850.2227CGG-0.461.001.001.000.42-0.27-0.090.690.810.581.000.8928CGT1.001.001.00-0.75-0.110.80-0.390.08-0.48-0.400.15-0.2529CTA0.841.000.72-0.53-0.050.76-0.070.360.130.030.290.1130CTC0.620.081.000.800.761.000.850.750.690.040.850.7231CTG0.380.510.300.270.05-0.17-0.270.390.13-0.260.300.1732CTT0.830.79-0.320.390.090.240.120.490.420.100.700.5033GAA0.760.710.42-0.31-0.180.610.160.50-0.20-0.180.18-0.0534GAC0.57-0.020.51-0.700.420.630.270.55-0.12-0.210.110.1535GAG0.77-0.080.610.02-0.070.150.210.300.300.160.270.3236GAT0.920.630.87-0.390.270.530.430.40-0.09-0.210.330.1437GCA0.790.510.120.000.640.820.670.480.440.180.410.5038GCC1.000.500.76-0.380.170.340.56-0.030.14-0.120.470.1839GCG1.000.361.000.180.33-0.100.230.430.570.140.100.4540GCT0.830.500.86-0.490.020.460.180.20-0.48-0.460.08-0.2041GGA0.690.821.000.450.570.630.550.710.580.630.600.5742GGC0.82-0.211.00-0.500.460.360.370.420.360.450.410.2743GGG0.430.660.510.28-0.08-0.030.080.230.780.740.580.7844GGT0.840.810.87-0.57-0.270.490.300.14-0.48-0.54-0.05-0.2745GTA0.820.580.700.020.610.540.560.340.290.360.190.4646GTC0.810.091.00-0.590.330.600.15-0.03-0.22-0.440.360.0447GTG0.800.061.000.300.260.590.170.16-0.02-0.090.040.2848GTT1.000.050.83-0.560.250.270.500.04-0.24-0.410.410.0949TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.290.160.70-0.660.250.250.050.37-0.21-0.33-0.03-0.0851TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.870.840.54-0.020.400.500.240.550.10-0.110.260.3453TCA1.000.640.230.240.230.500.330.590.320.520.480.3654TCC0.800.031.00-0.600.190.430.520.29-0.20-0.29-0.06-0.0155TCG1.000.221.000.340.080.64-0.310.430.540.070.310.2856TCT0.63-0.021.00-0.63-0.290.320.310.29-0.45-0.480.34-0.0957TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC1.00-0.411.000.280.580.340.47-0.040.050.060.700.9459TGG1.000.420.580.510.230.54-0.180.79-0.15-0.290.410.2160TGT1.00-0.361.00-0.650.14-0.02-0.290.36-0.20-0.190.09-0.3261TTA0.750.700.860.070.020.820.280.260.150.250.380.2862TTC0.670.031.00-0.570.08-0.010.280.21-0.10-0.220.200.0363TTG0.830.700.72-0.39-0.290.640.190.59-0.52-0.49-0.08-0.2864TTT0.890.200.810.600.590.830.600.72-0.10-0.170.290.19CGACGCCGGCGTCTACTCCTGCTTGAAGACGAGGAT252627282930313233343536表C.10继续373839404142434445464748GCAGCCGCGGCTGGAGGCGGGGGTGTAGTCGTGGTT<table>tableseeoriginaldocumentpage165</column></row><table>56TCT0.15-0.610.21-0.590.590.350.43-0.610.34-0.56-0.29-0.4157TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC0.750.200.560.590.900.33-0.070.221.000.871.000.3959TGG0.46-0.200.44-0.260.620.48-0.21-0.310.58-0.060.53-0.3660TGT0.63-0.47-0.48-0.380.76-0.030.23-0.480.19-0.450.01-0.4661TTA0.490.160.36-0.050.480.290.54-0.250.52-0.220.040.0362TTC0.470.030.500.200.700.180.63-0.510.73-0.440.17-0.0163TTG0.18-0.580.08-0.620.480.050.50-0.590.32-0.58-0.16-0.5064TTT0.20-0.400.25-0.260.730.240.47-0.230.53-0.290.34-0.23GCAGCCGCGGCTGGAGGCGGGGGTGTAGTCGTGGTT373839404142434445464748表C.10继续495051525354555657585960TAATACTAGTATTCATCCTCGTCTTGATGCTGGTGT1AAA0.00-0.140.000.310.100.120.26-0.180.000.34-0.030.012AAC0.00-0.530.00-0.15-0.03-0.48-0.17-0.400.000.25-0.43-0.573AAG0.00-0.430.000.320.35-0.370.07-0.420.000.240.05-0.314AAT0.000.340.000.440.460.080.340.110.000.600.510.415ACA0.000.310.000.630.410.160.460.190.000.660.27-0.166ACC0.00-0.540.000.240.25-0.45-0.15-0.440.00-0.33-0.39-0.427ACG0.00-0.020.000.660.560.700.520.460.000.750.570.538ACT0.00-0.500.000.11-0.11-0.49-0.11-0.540.000.90-0.08-0.349AGA0.00-0.310.000.430.05-0.440.37-0.410.000.46-0.17-0.3410AGC0.000.230.000.600.390.010.560.170.000.200.100.5111AGG0.000.230.000.540.000.120.490.190.001.000.610.7112AGT0.000.140.000.630.410.510.570.470.000.450.440.4913ATA0.000.250.000.560.430.370.440.500.000.710.760.5314ATC0.00-0.490.00-0.06-0.06-0.570.03-0.530.000.37-0.44-0.4615ATG0.00-0.240.000.230.31-0.330.28-0.270.000.290.00-0.1516ATT0.00-0.210.000.180.34-0.450.14-0.390.00-0.06-0.02-0.1717CAA0.00-0.270.000.200.36-0.270.59-0.330.000.44-0.20-0.3018CAC0.00-0.310.00-0.090.52-0.60-0.14-0.600.000.45-0.31-0.3219CAG0.00-0.210.000.350.130.160.330.080.000.190.550.2520CAT0.00-0.060.000.290.300.050.42-0.160.000.080.250.0621CCA0.00-0.450.000.260.15-0.390.24-0.580.000.16-0.48-0.4522CCC0.000.470.000.140.360.040.30-0.090.000.280.900.1223CCG0.000.520.000.560.760.780.820.800.000.540.880.4424CCT0.00-0.360.000.240.34-0.39-0.18-0.090.00-0.090.400.5525CGA0.000.840.000.631.001.000.630.720.001.000.811.0026CGC0.000.030.000.411.00-0.28-0.250.160.000.170.541.0027CGG0.000.440.000.58-0.150.601.000.760.001.000.670.2628CGT0.00-0.680.00-0.470.30-0.53-0.25-0.500.00-0.44-0.51-0.6929CTA0.00-0.180.000.080.320.030.04-0.050.000.31-0.38-0.3030CTC0.000.010.000.550.60-0.350.820.210.000.670.590.0031CTG0.000.100.000.520.480.410.340.260.000.310.42-0.2232CTT0.00-0.250.000.150.17-0.39-0.10-0.250.000.560.39-0.0733GAA0.00-0.260.000.230.55-0.440.53-0.380.000.30-0.12-0.2634GAC0.00-0.480.000.070.31-0.570.00-0.580.00-0.01-0.25-0.5635GAG0.00-0.170.000.220.58-0.060.030.130.000.370.320.1536GAT0.000.090.000.280.390.02-0.110.000.000.700.180.2437GCA0.000.240.000.590.32-0.120.210.070.000.610.350.3338GCC0.00-0.240.000.450.06-0.56-0.10-0.400.000.680.08-0.2639GCG0.000.000.000.340.600.390.570.510.000.39-0.020.75<table>tableseeoriginaldocumentpage167</column></row><table><table>tableseeoriginaldocumentpage168</column></row><table><table>tableseeoriginaldocumentpage169</column></row><table>49TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.47-0.08-0.070.830.200.00-0.160.920.75-0.200.790.7051TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT1.000.910.901.000.390.820.711.000.440.740.670.0353TCA0.120.890.700.38-0.570.730.29-0.500.350.13-0.14-0.5454TCC0.30-0.17-0.020.280.01-0.07-0.12-0.280.05-0.04-0.04-0.0355TCG0.030.240.000.21-0.240.32-0.300.270.210.490.380.2056TCT1.000.911.00-0.05-0.320.620.461.00-0.550.580.781.0057TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC-0.10-0.07-0.07-0.26-0.210.02-0.120.400.040.290.42-0.0959TGG-0.36-0.010.030.34-0.200.15-0.23-0.200.47-0.020.510.0060TGT-0.460.970.81-0.561.000.51-0.10-0.500.120.920.310.6461TTA1.001.001.001.00-0.760.840.671.00-0.900.461.001.0062TTC-0.20-0.020.000.190.58-0.130.220.280.21-0.010.330.2063TTG-0.700.24-0.23-0.600.040.35-0.20-0.32-0.71-0.22-0.49-0.5464TTT-0.050.950.731.00-0.070.720.62-0.55-0.410.610.001.00AAAAACAAGAATACAACCACGACTAGAAGCAGGAGT123456789101112表C.ll继续131415161718192021222324ATAATCATGATTCAACACCAGCATCCACCCCCGCCT1AAA-0.500.740.22-0.69-0.42-0.21-0.24-0.69-0.67-0.47-0.58-0.572AAC0.58-0.05-0.030.210.37-0.020.000.470.550.23-0.030.223AAG0.46-0.05-0.010.230.50-0.01-0.010.370.360.060.040.334AAT1.000.750.83-0.39-0.74-0.10-0.18-0.74-0.82-0.62-0.72-0.705ACA1.000.850.68-0.07-0.67-0.24-0.44-0.76-0.89-0.73-0.75-0.806ACC0.590.110.16-0.100.150.090.04-0.210.400.380.180.157ACG0.18-0.26-0.30-0.35-0.15-0.050.02-0.270.05-0.15-0.10-0.298ACT0.310.820.86-0.32-0.700.13-0.37-0.78-0.87-0.75-0.75-0.909AGA-0.300.390.37-0.67-0.74-0.13-0.13-0.17-0.86-0.19-0.28-0.7910AGC1.000.130.30-0.170.12-0.12-0.07-0.480.040.080.12-0.3711AGG0.720.000.06-0.45-0.640.20-0.02-0.30-0.62-0.23-0.10-0.1812AGT0.510.700.85-0.52-0.33-0.22-0.20-0.69-0.65-0.30-0.37-0.7913ATA1.000.870.91-0.570.430.120.06-0.250.44-0.67-0.65-0.6614ATC0.82-0.05-0.04-0.120.440.00-0.010.180.500.180.080.4315ATG0.97-0.030.000.170.15-0.01-0.010.170.090.07-0.060.0716ATT1.000.730.80-0.19-0.83-0.44-0.24-0.58-0.91-0.79-0.81-0.9117CAA0.630.800.77-0.17-0.320.220.020.45-0.19-0.25-0.19-0.7218CAC-0.25-0.06-0.050.370.200.010.000.040.280.030.200.1419CAG-0.18-0.04-0.040.070.01-0.020.000.12-0.24-0.050.09-0.0220CAT0.190.790.650.57-0.68-0.08-0.06-0.37-0.59-0.60-0.65-0.6721CCA-0.580.960.69-0.83-0.700.03-0.37-0.42-0.93-0.67-0.73-0.8722CCC-0.180.300.25-0.280.320.560.440.210.290.820.480.0723CCG-0.29-0.21-0.20-0.18-0.18-0.30-0.230.07-0.35-0.28-0.18-0.1624CCT0.300.840.75-0.78-0.70-0.08-0.23-0.65-0.84-0.60-0.55-0.8425CGA-0.600.630.64-0.59-0.600.34-0.18-0.58-0.78-0.06-0.34-0.6526CGC-0.090.030.080.050.130.310.24-0.050.280.050.340.3927CGG-0.37-0.12-0.21-0.26-0.27-0.29-0.18-0.22-0.01-0.150.08-0.0628CGT0.350.470.610.11-0.610.21-0.04-0.56-0.67-0.49-0.46-0.7129CTA-0.440.750.83-0.48-0.800.14-0.34-0.67-0.88-0.54-0.47-0.8230CTC0.070.230.30-0.01-0.230.54-0.380.520.560.410.570.5431CTG0.16-0.14-0.170.28-0.48-0.270.560.320.25-0.32-0.070.1932CTT0.600.890.85-0.21-0.690.08-0.72-0.39-0.86-0.75-0.64-0.8233GAA0.500.800.710.40-0.35-0.15-0.150.20-0.64-0.38-0.48-0.4334GAC-0.06-0.05-0.040.360.340.040.010.010.460.240.000.3735GAG-0.17-0.13-0.110.200.220.010.030.320.260.210.080.4236GAT0.160.860.75-0.44-0.68-0.36-0.27-0.68-0.80-0.66-0.72-0.7437GCA0.260.830.81-0.42-0.69-0.33-0.47-0.58-0.84-0.70-0.72-0.7838GCC-0.040.220.170.220.200.140.11-0.090.440.330.300.2539GCG-0.49-0.32-0.29-0.180.12-0.11-0.030.080.05-0.130.010.0840GCT0.470.860.85-0.12-0.75-0.03-0.42-0.80-0.85-0.77-0.70-0.8641GGA-0.150.540.40-0.49-0.57-0.37-0.32-0.41-0.73-0.33-0.51-0.5742GGC-0.13-0.16-0.120.150.410.260.260.250.360.240.290.3643GGG-0.220.13-0.02-0.28-0.47-0.36-0.41-0.51-0.38-0.16-0.27-0.4244GGT0.720.620.680.16-0.06-0.040.01-0.46-0.38-0.30-0.38-0.5245GTA0.110.880.85-0.40-0.670.21-0.34-0.32-0.47-0.50-0.68-0.5346GTC0.100.050.150.190.430.250.140.110.710.330.370.5447GTG0.35-0.16-0.230.210.28-0.27-0.140.270.04-0.18-0.250.3048GTT0.580.810.70-0.44-0.55-0.17-0.39-0.77-0.86-0.75-0.72-0.8549TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.27-0.05-0.040.400.24-0.050.000.790.330.350.000.6351TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.130.750.80-0.46-0.60-0.22-0.181.00-0.81-0.69-0.80-0.6553TCA-0.290.820.87-0.74-0.530.13-0.40-0.56-0.87-0.65-0.74-0.8354TCC-0.120.090.110.120.340.230.140.310.280.280.180.2055TCG-0.43-0.28-0.38-0.310.19-0.12-0.070.130.04-0.10-0.050.0956TCT-0.720.800.87-0.220.40-0.25-0.35-0.79-0.37-0.70-0.62-0.9057TCA0.000.000.000.000.000.000.000.000.000.000.000.0058丁GC0.22-0.07-0.04-0.450.020.020.00-0.11-0.060.100.040.1859TGG-0.180.010.00-0.010.27-0.02-0.010.200.040.00-0.010.1960TGT1.000.910.41-0.29-0.420.020.01-0.69-0.67-0.31-0.44-0.4661TTA1.001.001.00-0.91-0.861.00-0.31-0.88-0.91-0.36-0.611.0062TTC0.41-0.02-0.010.200.47-0.01-0.020.350.700.050.030.3363TTG0.21-0.05-0.19-0.68-0.440.05-0.60-0.30-0.68-0.01-0.16-0.4364TTT1.000.720.85-0.84-0.79-0.34-0.21-0.84-0.89-0.83-0.71-0.94ATAATCATGATTCAACACCAGCATCCAcccCCGCCT131415161718192021222324表C.ll继续252627282930313233343536CGACGCCGGCGTCTACTCCTGCTTGAAGACGAGGAT1AAA-0.82-0.59-0.56-0.28-0.85-0.65-0.45-0.580.200.670.500.102AAC0.02-0.01-0.050.310.60-0.170.150.080.14-0.03-0.040.273AAG0.190.12-0.050.340.37-0.070.130.29-0.10-0.05-0.010.384AAT-0.680.030.24-0.43-0.74-0.56-0.38-0.80-0.440.480.46-0.655ACA-0.80-0.08-0.34-0.69-0.63-0.35-0.36-0.68-0.090.570.55-0.076ACC0.07-0.09-0.12-0.210.800.000.250.02-0.07-0.05-0.12-0.297ACG0.080.430.350.020.44-0.16-0.23-0.270.220.100.20-0.168ACT-0.58-0.18-0.05-0.40-0.59-0.44-0.08-0.86-0.160.500.49-0.499AGA-0.76-0.08-0.12-0.22-0.37-0.39-0.17-0.78-0.400.560.47-0.3510AGC-0.41-0.16-0.11-0.420.890.120.23-0.36-0.20-0.13-0.25-0.4911AGG-0.440.280.10-0.11-0.21-0.130.01-0.42-0.480.190.23-0.4112AGT-0.420.470.53-0.51-0.450.110.02-0.62-0.50-0.03-0.12-0.7113ATA-0.75-0.53-0.60-0.72-0.59-0.39-0.15-0.370.210.520.280.2814ATC0.47-0.060.190.080.66-0.220.18-0.180.22-0.03-0.060.1815ATG0.10-0.080.020.160.75-0.130.050.280.15-0.01-0.030.2316ATT-0.77-0.300.12-0.661.00-0.50-0.38-0.910.360.560.51-0.73m<table>tableseeoriginaldocumentpage172</column></row><table><table>tableseeoriginaldocumentpage173</column></row><table>56TCT-0.370.120.02-0.87-0.480.660.20-0.32-0.050.240.56-0.7157TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC0.190.12-0.100.06-0.130.05-0.23-0.19-0.060.14-0.04-0.0159TGG-0.030.14-0.190.300.05-0.090.200.210.260.16-0.200.2460TGT-0.10-0.23-0.32-0.500.050.710.560.37-0.08-0.23-0.49-0.6961TTA1.000.920.24-0.84-0.510.88-0.161.00-0.610.580.031.0062TTC0.39-0.090.090.410.31-0.070.080.050.25-0.090.090.1663TTG-0.340.430.21-0.360.130.570.280.150.480.480.18-0.1864TTT0.280.240.48-0.76-0.070.630.680.36-0.320.470.49-0.71GCAGCCGCGGCTGGAGGCGGGGGTGTAGTCGTGGTT373839404142434445464748表C.11继续495051525354555657585960TAATACTAGTATTCATCCTCGTCTTGATGCTGGTGT1AAA0.000.200.00-0.13-0.480.260.57-0.480.000.33-0.35-0.402AAC0.00-0.040.000.450.410.020.220.080.00-0.06-0.030.463AAG0.00-0.040.000.620.270.01-0.100.260.00-0.050.030.394AAT0.000.460.001.00-0.43-0.07-0.07-0.750.000.580.75-0.535ACA0.000.570.00-0.37-0.640.13-0.02-0.790.000.150.17-0.596ACC0.000.120.00-0.28-0.200.13-0.12-0.170.00-0.17-0.22-0.397ACG0.00-0.200.00-0.50-0.160.180.05-0.340.000.550.56-0.078ACT0.000.680.00-0.08-0.76-0.22-0.45-0.900.000.390.161.009AGA0.000.740.00-0.42-0.860.16-0.31-0.800.000.040.110.1210AGC0.000.220.00-0.540.350.430.42-0.170.000.05-0.07-0.2711AGG0.000.390.00-0.37-0.620.15-0.29-0.580.000.240.48-0.1812AGT0.000.120.00-0.51-0.59-0.27-0.56-0.780.00-0.270.08-0.5113ATA0.000.510.000.180.38-0.38-0.43-0.760.000.370.220.2114ATC0.00-0.030.000.180.490.010.240.290.00-0.05-0.010.3115ATG0.00-0.020.000.390.340.11-0.01-0.030.00-0.030.000.3616ATT0.000.300.00-0.43-0.75-0.240.02-0.700.000.830.45-0.4117CAA0.000.800.00-0.13-0.390.470.491.000.000.580.111.0018CAC0.00-0.050.00-0.050.07-0.020.210.420.00-0.08-0.060.2719CAG0.00-0.070.000.630.09-0.02-0.060.390.00-0.07-0.010.4020CAT0.000.640.000.260.15-0.060.24-0.480.000.660.790.6921CCA0.000.720.000.41-0.750.430.34-0.810.000.110.15-0.7222CCC0.000.120.00-0.15-0.170.00-0.11-0.510.00-0.29-0.25-0.4223CCG0.00-0.110.00-0.09-0.08-0.01-0.010.170.000.330.230.4024CCT0.000.580.00-0.61-0.83-0.53-0.50-0.860.000.130.45-0.7625CGA0.000.410.00-0.46-0.720.240.14-0.720.00-0.17-0.05-0.6526CGC0.00-0.190.00-0.43-0.18-0.110.060.050.00-0.21-0.31-0.3827CGG0.000.190.00-0.020.020.320.170.320.000.330.410.1128CGT0.000.340.00-0.11-0.66-0.42-0.42-0.640.000.380.58-0.4229CTA0.000.590.00-0.76-0.720.710.49-0.500.000.68-0.10-0.6830CTC0.00-0.160.000.040.52-0.230.420.310.000.400.430.3731CTG0.000.080.000.370.43-0.110.050.520.00-0.23-0.200.2732CTT0.000.580.00-0.53-0.62-0.37-0.18-0.830.000.720.750.3033GAA0.000.650.000.45-0.350.440.420.010.000.210.160.7534GAC0.00-0.050.000.390.330.020.090.510.00-0.06-0.040.2735GAG0.00-0.130.000.560.15-0.02-0.170.570.00-0.09-0.030.4536GAT0.000.600.000.37-0.78-0.08-0.17-0.680.000.640.880.5937GCA0.000.560.000.28-0.530.13-0.10-0.680.000.320.05-0.3238GCC0.000.000.00-0.16-0.04-0.12-0.11-0.090.00-0.18-0.19-0.4139GCG0.00-0.070.00-0.100.050.060.030.050.000.380.330.2440GCT0.000.590.000.00-0.75-0.53-0.52-0.870.000.440.52-0.5241GGA0.00-0.010.00-0.29-0.470.32-0.090.200.000.220.23-0.4642GGC0.000.010.00-0.320.130.120.040.240.00-0.11-0.21-0.3243GGG0.00-0.070.00-0.06-0.070.330.000.150.000.250.38-0.1244GGT0.000.230.000.26-0.56-0.26-0.49-0.700.000.570.800.0145GTA0.000.880.000.470.380.550.46-0.460.000.750.35-0.1346GTC0.00-0.100.00-0.070.40-0.220.160.170.00-0.040.140.2147GTG0.000.050.000.210.270.19-0.030.470.00-0.07-0.200.2448GTT0.000.640.00-0.24-0.83-0.22-0.33-0.490.000.680.810.2849TAA0.000.000.000.000.000.000.000.000.000.000.000.0050TAC0.00-0.080.000.940.260.10-0.110.800.00-0.09-0.030.6251TAG0.000.000.000.000.000.000.000.000.000.000.000.0052TAT0.000.730.000.61-0.460.570.041.000.000.620.640.4553TCA0.000.780.001.00-0.630.38-0.12-0.850.000.180.070.4154TCC0.00-0.010.000.280.10-0.07-0.24-0.020.00-0.25-0.22-0.0555TCG0.00-0.210.000.220.010.10-0.070.070.000.450.450.4256TCT0.000.760.00-0.46-0.62-0.52-0.55-0.900.000.410.481.0057TGA0.000.000.000.000.000.000.000.000.000.000.000.0058TGC0.00-0.020.00-0.250.030.06-0.20-0.030.00-0.03-0.07-0.0859TGG0.00-0.010.000.15-0.140.15-0.170.390.00-0.030.000.2760TGT0.000.430.00-0.26-0.72-0.31-0.45-0.840.000.430.76-0.4261TTA0.001.000.00-0.94-0.850.670.52-0.910.00-0.45-0.691.0062TTC0.00-0.020.000.380.40-0.120.140.400.00-0.03-0.010.2963TTG0.000.080.00-0.55-0.73-0.10-0.27-0.570.000.01-0.44-0.5264TTT0.000.470.00-0.48-0.48-0.02o.n-0.690.000.570.62-0.69TAATACTAGTATTCATCCTCGTCTTGATGCTGGTGT495051525354555657585960表C.ll继续61TTA62TTC63TTG64TTT1AAA1.000.43-0.28-0.732AAC0.08-0.020.780.573AAG1.00-0.020.27-0.024AAT1.000.350.44-0.655ACA1.000.65-0.14-0.516ACC0.62-0.120.280.017ACG-0.370.22-0.25-0.018ACT-0.930.25-0.57-0.829AGA1.000.48-0.14-0.7610AGC1.000.180.51-0.4111AGG-0.440.13-0.480.2012AGT1.000.150.60-0.8013ATA1.000.41-0.05-0.4314ATC1.00-0.010.760.1815ATG1.00-0.010.630.3816ATT1.000.25-0.70-0.9117CAA-0.900.77-0.69-0.8518CAC-0.19-0.040.490.1619CAG-0.46-0.04-0.740.3920CAT1.000.50-0.311.0021CCA-0.840.71-0.64-0.5922CCC-0.37-0.15-0.07-0.1523CCG0.040.09-0.58-0.0717524CCT1.000.34-0.41-0.6525CGA-0.880.41-0.39-0.6926CGC0.51-0.090.64-0.3027CGG-0.430.09-0.510.2128CGT-0.72-0.050.03-0.5629CTA-0.950.56-0.70-0.6730CTC0.32-0.090.710.0431CTG0.290.040.020.5432CTT-0.940.36-0.56-0.7933GAA-0.290.650.190.0034GAC1.00-0.030.620.3835GAG-0.12-0.11-0.520.2636GAT-0.570.40-0.390.3737GCA-0.590.68-0.52-0.5538GCC0.26-0.100.47-0.1239GCG-0.520.09-0.43-0.2040GCT-0.850.36-0.53-0.7041GGA-0.500.190.08-0.3342GGC0.65-0.040.63-0.1243GGG-0.480.05-0.43-0.1944GGT-0.350.050.31-0.5845GTA1.000.82-0.210.6546GTC0.61-0.180.840.5747GTG0.640.200.320.4148GTT1.000.26-0.53-0.2449TAA0.000.000.000.0050TAC0.65-0.020.600.3751TAG0.000.000.000.0052TAT-0.860.37-0.31-0.4153TCA-0.870.65-0.42-0.8154TCC0.27-0.240.12-0.1055TCG-0.060.25-0.520.2556TCT-0.920.33-0.72-0.6757TGA0.000.000.000.0058TGC-0.52-0.020.50-0.2159TGG-0.170.00-0.01-0.1160TGT1.000.31-0.25-0.2261TTA1.000.72-0.821.0062TTC0.16-0.020.630.5163TTG1.000.20-0.74-0.5564TTT1.000.45-0.86-0.47TTATTCTTGTTT61626364申请人或代理人文件参考编号25051WO国际申请号与被保藏的微生物相关的说明(PCTRule13to)A.下文说明涉及英文说明书第26页24行提到的微生物B.保藏证明还有其它保藏记载于附页上(是)保藏单位名称CENTRAALBUREAUVOORSCHIMMELCULTURES保藏单位地址(包括邮政编码和国家)Uppsalalaan8P.O.Box85167NL-3508ADUtrechtTheNetherlands保藏日期10-08-1988编号CBS513.88C.其它说明0^7^"适y^^,5^0该信息还有附页继续说明()我方通知贵方,根据Rulel3bisPCT,在国家专利授权公告公开之前,上述微生物,仅可由请求人指定的专家获得样品,如果该申请被驳回、撤回或视为撤回,这条规定将从申请日起算二十年内有效。_D.说明适用的指定国家和地区0^菜说剪不是^对,夯房家浙巡区游话JE.对说明的单独补充0^7^^^^^y像空S)下述说明将随后提交至国际局(指明说明的常见性质,例如"保藏号")接收局专用()该表随国际申请收到国际局专用()该表于下述日期被国际局收负责官员负责官员(签名)申请人或代理人文件参考编号25051WO国际申请号m与被保藏的微生物相关的说明(PCTRule13to)A.下文说明涉及英文说明书第26页26行提到的微生物_B.保藏证明_还有其它保藏记载于附页上(是)保藏单位名称CENTRAALBUREAUVOORSCHIMMELCULTURES_保藏单位地址(包括邮政编码和国家)Uppsalalaan8P.O.Box85167NL-3508ADUtrechtTheNetherlands保藏日期02-06-1995编号CBS455.95C.其它说明p^^^^^^y像s^^该信息还有附页继续说明()我方通知贵方,根据Rulel3bisPCT,在国家专利授权公告公开之前,上述微生物,仅可由请求人指定的专家获得样品,如果该申请被驳回、撤回或视为撤回,这条规定将从申请日起算二十年内有效。_D.说明适用的指定国家和地区0^菜说敏不^#对屏存嵐家浙邀^游话^E.对说明的单独补充Q^^"^^^y歡《aO_下述说明将随后提交至国际局(指明说明的常见性质,例如"保藏号")接收局专用()该表随国际申请收到国际局专用()该表于下述日期被国际局收负责官员负责官员(签名)178权利要求1.优化编码预定的氨基酸序列的核苷酸序列的方法,其中针对在预定的宿主细胞中的表达对所述编码序列进行优化,所述方法包括(a)产生至少一条编码所述预定的氨基酸序列的原始编码序列;(b)通过用同义密码子替换该至少一条原始编码序列中的一个或多个密码子,从该至少一条原始编码序列产生至少一条新产生的编码序列;(c)测定所述至少一条原始编码序列的适合度值和所述至少一条新产生的编码序列的适合度值,同时使用下述适合度函数,所述函数针对所述预定的宿主细胞至少测定单个密码子适合度和密码子对适合度之一;(d)根据预定的选择标准,在所述至少一条原始编码序列和所述至少一条新产生的编码序列中选择一条或多条选定的编码序列,所述适合度值越高,被选择的机会越高;(e)重复动作b)到d),同时在动作b)到d)中将所述一条或多条选定的编码序列作为一条或多条原始编码序列处理,直至满足预定的迭代终止标准。2.根据权利要求1的方法,其中所述预定的选择标准是这样的,其使得所述一条或多条选定的编码序列根据预定标准具有最优的适合度值。3.根据权利要求1或2的方法,其中所述方法在动作e)之后包括在所述一条或多条选定的编码序列中选择最佳个体编码序列,其中所述最佳个体编码序列具有比其它选定的编码序列更好的适合度值。4.根据权利要求1-3中任一项的方法,其中所述预定的迭代终止标准至少为下述之一测试是否至少一条所述选定的编码序列具有高于预定的阈值的最佳适合度值;测试是否所述选定的编码序列均不具有低于所述预定的阈值的最佳适合度值;测试是否至少一条所述选定的编码序列在所述原始编码序列中有至少30%的对预定的宿主细胞而言具有相关正密码子对权重的密码子对被转化为具有相关负权重的密码子对;和,测试是否至少一条所述选定的编码序列在所述原始编码序列中有至少30%的对预定的宿主细胞而言具有高于0的相关正权重的密码子对被转化为具有低于0的相关权重的密码子对。5.根据权利要求1-4中任一项的方法,其中所述适合度函数通过下式来定义单个密码子对适合度<formula>formulaseeoriginaldocumentpage3</formula>其中g象征编码序列,lgl为长度,g("为k-th密码子,",'(c(fc))是期望的密码子C^比值,且r/(C(&))为核苷酸编码序列g中的实际比值。6.根据权利要求1-4中任一项的方法,其中所述适合度函数通过下式来定义密码子对适合度<formula>formulaseeoriginaldocumentpage3</formula>其中vv((c(Ac(A+l))是编码序列g中密码子对的权重,lgl为所述核苷酸编码序列的长度,且c(Q为所述编码序列g中的k-th密码子。7.根据权利要求1-4中任一项的方法,其中所述适合度函数通过下式定义<formula>formulaseeoriginaldocumentpage3</formula>其中艮(g)=A'2>,(柳-《(柳cp/是大于零的真实值,y一(g)是密码子对适合度函数,y^(g)是单个密码子适合度函数,w((c(A),+l))是编码序列g中密码子对的权重,|g|是所述编码序列的长度,C(Q是所述密码子序列中的k-th密码子,C一(c(")是期望的密码子c(0比例,《(c(;t))是编码序列g中的实际比例。8.根据权利要求7的方法,其中c^在104和0.5之间。9.根据权利要求6-8中任一项的方法,其中所述密码子对权重w来自无终止密码子的61x61密码子对矩阵,或包括终止密码子的61x64密码子对矩阵,且其中根据基于计算机的方法,使用至少以下之一作为输入值来计算所述密码子对权重w:由预定的宿主的至少200个编码序列组成的核苷酸序列组;由所述预定的宿主所属的物种的至少200个编码序列组成的核苷酸序列组;由预定的宿主的基因组序列中至少5Q/^的蛋白质编码核苷酸组成的核苷酸序列组;禾口由与预定的宿主相关的属的基因组序列中至少5%的蛋白质编码核苷酸序列组成的核苷酸序列组。10.根据权利要求9的方法,其中针对可能的61x64密码子对中的至少5%、10%、20%、50%和优选地100%来测定所述密码子对权重w,所述密码子对包含作为密码子的终止子信号。11.根据权利要求6-8的方法,其中所述密码子对权重w来自无终止密码子的61x61密码子对矩阵,或包括终止密码子的61x64密码子对矩阵,且其中所述密码子对权重vv通过下式定义'。一maxW^(c,.,c》),"-'((c,,c》))其中所述组合的预期值"、"。))通过下式定义"n,。)x(c,.)《(。).s"r((q,。))其中OJ表示全基因组数据集中Ck的单个密码子比例,且"r((A,。))是高表达的组中对(c,,。)的出现,且其中所述高表达的组是其mRNA能够以至少每个细胞20个拷贝的水平被检测的多个基因。12.根据前述权利要求中任一项的方法,其中编码预定的氨基酸序列的所述原始编码核苷酸序列选自(a)编码所述预定的氨基酸序列的野生型核苷酸序列(b)预定的氨基酸序列的逆翻译,其中预定的氨基酸序列中氨基酸位置上的密码子随机地选自编码所述氨基酸的同义密码子;和(c)预定的氨基酸序列的逆翻译,其中根据预定的宿主或与所述宿主细胞相关的物种的单个密码子偏向性来选择预定的氨基酸序列中氨基酸位置上的密码子。13.根据权利要求1-12中任一项的方法,其中所述预定的宿主细胞是微生物细胞,优选地是选自以下的属的微生物Sflcz7/w、Jc""om;;cefo、■EscAen'c/n'a、》e/to附;;c&s、A/erg/〃ws、Pew'c/肌wm、《—veram3;ces、14.根据权利要求1-12中任一项的方法,其中所述预定的宿主细胞是动物或植物细胞,优选地是选自CHO、BHK、NS0、COS、Vero、PER.C6、HEK-293、DrosophilaS2、SpodopteraSf9和SpodopteraSf21的细胞系的细胞。15.计算机,其包含处理器和存储器,所述处理器被设置为从所述存储器读取和写入,所述存储器包含数据和指令,所述数据和指令被设置为提供给所述处理器进行权利要求1-14中任一项的方法的能力。16.计算机程序产品,其包含数据和指令,并被设置为可以负载于计算机的存储器中,所述计算机也包含处理器,所述处理器被设置为从所述存储器读取和写入,所述数据和指令被设置为提供给所述处理器进行权利要求1-14中任一项的方法的能力。17.如权利要求16中要求保护的计算机程序产品提供的数据运载体。18.包含编码预定的氨基酸序列的编码序列的核酸分子,其中所述编码序列不是天然存在的编码序列,并且其中所述编码序列具有针对预定的宿主细胞至少低于-0.1、优选地低于-0.2、更优选地低于-0.3的力^(g)。19.包含编码预定的氨基酸序列的编码序列的核酸分子,其中所述编码序列不是天然存在的编码序列,并且其中所述编码序列具有针对预定的宿主细胞至少低于-0.1、优选地针对预定的宿主细胞低于-0.2的和针对预定的宿主细胞至少低于o.i的y^(g,。20.根据权利要求18或19的核酸分子,其中所述编码序列与表达控制序列可操作地连接,所述表达控制序列能够指导所述编码序列在预定的宿主细胞中的表达。21.包含如权利要求20中定义的核酸分子的宿主细胞。22.生产具有预定的氨基酸序列的多肽的方法,所述方法包括在有助于所述多肽表达的条件下,培养如权利要求21中定义的宿主细胞,以及,任选地回收所述多肽。23.至少生产细胞内和细胞外代谢产物之一的方法,所述方法包括在有助于所述代谢产物生产的条件下培养如权利要求21中定义的宿主细胞,其中具有所述预定的氨基酸序列的所述多肽优选地涉及所述代谢产物的生产。全文摘要本发明涉及优化蛋白质编码序列以在给定的宿主细胞中表达的方法。该方法应用遗传算法来优化编码预定的氨基酸序列的序列的单个密码子适合度和/或密码子对适合度。在该算法中,重复进行产生新序列变体和随后选择适当变体的过程,直到变体的编码序列达到单个密码子适合度和/或密码子对适合度的最小值。本发明还涉及包含处理器和存储器的计算机,所述处理器被设置为从所述存储器读取和写入,所述存储器包含数据和指令,所述数据和指令被设置为向处理器提供进行下述遗传算法的能力,所述遗传算法用于优化单个密码子适合度和/或密码子对适合度。本发明还涉及包含预定的氨基酸序列的编码序列的核酸,所述编码序列针对本发明的方法中给定的宿主细胞针对单个密码子适合度和/或密码子对适合度进行了优化,本发明还涉及包含这类核酸的宿主细胞和使用这些宿主细胞生产多肽和其它发酵产物的方法。文档编号C12N15/67GK101490262SQ200780024670公开日2009年7月22日申请日期2007年6月15日优先权日2006年6月29日发明者约翰尼斯·安德列什·劳博斯,诺埃尔·尼古拉斯·玛丽亚·伊丽莎白·佩吉·范申请人:帝斯曼知识产权资产管理有限公司
网友询问留言 已有0条留言
  • 还没有人留言评论。精彩留言会获得点赞!
1