人类肝脏表达序列标签f组的制作方法

文档序号:3531284阅读:188来源:国知局
专利名称:人类肝脏表达序列标签f组的制作方法
技术领域
本发明涉及生物技术领域,具体地,涉及一类表达序列标签,尤其涉及一类人类肝脏表达序列标签。
背景技术
肝脏是人体内最大的消化腺。也是体内新陈代谢的中心站。据估计,在肝脏中发生的化学反应有500种以上,实验证明,动物在完全摘除肝脏后即使给予相应的治疗,最多也只能生存50多个小时。这说明肝脏是维持生命活动的一个必不可少的重要器官。肝脏的血流量极为丰富,约占心输出量的1/4。每分钟进入肝脏的血流量为1000-1200ml。肝脏的主要功能是进行糖的分解、贮存糖原;参与蛋白质、脂肪、维生素、激素的代谢;解毒;分泌胆汁;吞噬、防御机能;制造凝血因子;调节血容量及水电解质平衡;产生热量等。在胚胎时期肝脏还有造血功能。
肝脏疫病分为肝炎、肝硬化、脂肪肝、肝癌等。现代医学实验证明,肝病病毒侵入人体后,并不直接引起肝细胞的损害,只是在肝细胞内吸收营养赖以生存,并在肝细胞内复制、繁殖。其复制病毒的“零部件”如表面抗原(HBsAg)、e抗原(HBeAg)释放在肝细胞膜上,引起人体免疫系统对这些抗原物质产生免疫反应,这种反应造成肝细胞的损伤、坏死。免疫反应的强弱决定于肝脏受损程度及临床症状轻重。这场由病毒引发的、免疫系统对肝细胞的战争,使大约25%的患者的肝脏成为战火连绵的战场,肝脏的损伤由此加重。肝病的危害绝不仅仅限于肝脏本身,它还可以引起其它多种疾病。常见的有(1)糖尿病;(2)胰腺炎;(3)胆道感染;(4)功能性肾衰竭;(5)胆汗性肾病;(6)肾小球肾炎;(7)肾小管酸中毒;(8)溶血性贫血;(9)再生障碍性贫血;(10)心肌炎和心包炎;(11)结节性动脉炎;(12)消化性溃疡;(13)自发性腹膜炎;(14)性激素代谢紊乱;(15)甲状腺功能改变;(16)肝性骨病,等等。肝病不仅对患者的身体甚至生命造成危害,而且对患者心理上的打击也是十分沉重的。无论是肝病患者还是病毒携带者,在生活、社交、求职、升学等方面都会受到严重影响。
生物基因组中可转录表达的序列(即基因)仅占总序列的3-5%,对这部分序列进行测定,将直接导致新基因的发现,并获取基因组中与产业化关系最为密切的信息。20世纪80年代,高通量的自动测序的出现,使从质粒互补脱氧核糖核酸(Complementary DNA,简称cDNA)文库随机选取许多cDNA克隆和决定来自非载体两端的几百个碱基的DNA序列成为可能。这些短的DNA序列叫做“表达序列标签”(Expressed Sequence Tags,简称ESTs)。表达序列标签的概念最早是由Adams等在1992年提出来的(Nature,355,642-644)。1992年Sikela和Matsubara(Sikela,et al.Nucleic Acids Res.19,1837-1843;Matsubara,et al.Nature Genetics,2,173-179)针对获得大量信使核糖核酸(mRNA)序列的迫切需要,提出大规模互补脱氧核糖核酸(cDNA)测序的研究战略。随后Venter创立了大规模表达序列标签技术。其基本特征就是从以质粒为载体,构建完成的目的组织互补脱氧核糖核酸(Complementary DNA,简称cDNA)文库中,随机选择许多cDNA克隆,利用质粒上携带的通用引物对cDNA两端进行一轮脱氧核糖核酸序列测定,所获得的来自3’端或5’端的几百个碱基的非载体短脱氧核糖核酸(DNA)序列。简而言之,表达序列标签是来自表达基因片段3’端或5’端的短脱氧核糖核酸序列,代表一个表达基因的部分转录片段。
表达序列标签可用于新基因克隆、人类基因组图谱绘制、基因组序列编码区的确定等。如果一个表达序列标签在基因组中只出现一次,那么它可以作为序列标签位点(STS)。由表达序列标签构建的物理图谱叫表达图或转录图(expression ortranscript map)。利用表达序列标签进行基因图制作,可以加快序列标签位点的制作和新基因的染色体定位。表达序列标签可以作为基因特异性探针,对组织特异性基因表达的研究具有重要的作用。表达序列标签还可以进行新基因的遗传进化关系分析。表达序列标签可以对所有动植物的基因作为一种数据库,通过不同的序列比较可以获得保守序列片段,从而获得基因的遗传进化图谱。正因为表达序列标签具有如此的优越性,因此表达序列标签测序已经成为许多基因组研究机构的工作重点。
由于本发明人类肝脏表达基因与一些肝脏疾病相关,因此,研究人类肝脏中表达的表达序列标签对探索肝脏疾病的发病机理及研制肝病的治疗药物具有重要意义。

发明内容
本发明要解决的技术问题是提供一类人类肝脏表达序列标签。
本发明要解决的技术问题通过如下技术方案实现本发明提供一类人类肝脏表达序列标签的序列,其包括(a)SEQ ID No.1~SEQ ID No.50所示的序列;(b)SEQ ID No.1~SEQ ID No.50所示的序列中每条序列的互补序列;(c)与SEQ ID No.1~SEQ ID No.50所示的序列中每条序列有至少70%同源性的序列,及(d)上述(a)~(c)中一条或数条的组合。
较佳地,所述序列包括具有SEQ ID No.1~SEQ ID No.50所示的序列。
本发明还提供了一种探针分子,所述的探针分子含有上述序列中约8-100个连续的核苷酸。
由本发明的在人类肝脏中表达的表达序列标签,可以方便的寻找出在人类肝脏中表达的相关基因,从而在研究肝脏疾病的致病机理以及开发治疗肝脏疾病的药物中发挥重要作用。
具体实施例方式
下面结合具体实施例,进一步阐述本发明。应理解,这些实施例仅用于说明本发明而不是限制本发明的范围。下列实施例中未注明具体条件的实验方法,通常按照常规条件如Sambrook等人,分子克隆实验室手册(New YorkCold Spring HarborLaboratory Press,1989)中所述的条件,或按照制造厂商所建议的条件。
实施例1人肝脏组织的mRNA的分离组织分离(Tissue isolation)肝脏来源于5个成年男性,在肝脏切除手术后,将肝脏组织立即置于液氮中冷冻保存。
mRNA的分离(mRNA isolation)取出肝脏组织,用研钵研碎,加入盛有裂解液的50ml管,充分振荡后,再移入玻璃匀浆器内,匀浆后移至50ml新管,抽提总RNA(TRIzol Reagents,Gibco,NY,USA)。用甲醛变性胶电泳鉴定总RNA质量。用带Oligod(T)的纤维素柱分离总RNA中的mRNA,定量。
实施例2cDNA文库的构建(Constuction of cDNA library)以mRNA为模板,合成双链cDNA。补平末端后,加含EcoRI切点的接头。磷酸化EcoRI末端后,用XhoI限制性内切酶消化1.5小时,再进行片断分离。过柱筛选长度>500bp的片段,用酚-氯仿抽提,乙醇沉淀,无菌水溶解,连接至Uni-ZAP XR载体(Strategene,CA9203,USA),以ZAP-cDNA Gigapack III Gold Cloning Kit(Strategene,CA9203,USA)进行包装,宿主菌使用XL 1 Blue MRF’(Strategene,CA9203,USA)细菌。涂板并测定滴度。
实施例3测序及数据库建立(Seqencing and Database Constructing)挑选文库中有外源片段插入的克隆,扩增后抽提质粒(Qiagen Germany),用T3和T7作为3’和5’端的通用引物,采用终止物荧光标记(Big-Dye,Perkin-Elmer,USA)的方法,在ABI 377测序仪(Perkin-Elmer,USA)上进行EST大规模测序。测序结果用FACTURA软件去除载体序列,传输到SUN Ultra 450Server上进行下一步的处理。所有的序列信息再用GCG软件包(Wisconsin group,USA)中的BLAST和FASTA软件搜索已有的数据库(Genebank+EMBL),将无同源性或同源性低于95%的序列视为新基因建立数据库。
实施例4基因的全长克隆(Cloning of Full-length cDNA)在得到的新基因片段序列信息基础上,进行cDNA全长克隆,分两阶段进行(1)“电子克隆”(Electronic Cloning)以新基因片段序列作为探针搜寻dbEST数据库,将重叠序列>50bp,同源性在98%以上的表达序列标签(Expressed Sequence Tag,简称“EST”)序列认为同一序列(Consensus Sequence),取出并用AUTOASSEMBLER软件进行连接,部分EST可以延伸探针序列。再用STRIDER软件分析被延伸的序列是否具有完整的开放阅读框架(OpenReading Frame,ORF),用BLAST搜寻Genbank或SwissProt以确定该序列的核苷酸和氨基酸水平上是否与其他物种有同源性,以帮助判别所得到的基因全长完整性如何。通过电子克隆的方法,通常可获取人肝脏相关基因的全长序列。
(2)cDNA末端快速扩增(Rapid Amplification of cDNA Ends,RACE)如果通过“电子克隆”方法仍未得到完整的cDNA全长,则在已有序列5’或3’端设计引物,在人类肝脏Marathon-Ready cDNA文库(clontech Lab,Inc,USA)中进行长距离PCR反应。然后对PCR产物克隆、测序。用AUTOASSEMBLER及STRIDER软件分析被延长的序列有无完整的ORF,如无,重复上述过程直至获得全长。
(3)RT-PCR对于5’和3’端的已知的序列,如果中间有一段间隙(gap)无法从已有的公共数据库或自身数据库获得,可考虑采用RT-PCR的方法。在序列5’端设计引物,3’端引物采用Oligo-dT,在肝脏总RNA库中进行扩增。然后对产物进行克隆、测序。最后拼接便获得全长。
通过组合使用上述3种方法,可获得人肝脏相关蛋白的全长编码序列。
序列表<110>上海人类基因组研究中心<120>人类肝脏表达序列标签F组<130>NP-10044<160>50<210>1<211>562<212>DNA<213>Homo sapiens<400>11 gaaaaagtat attcaataaa ttttgtcttt aaaacagagc tcttgaccta taaagtataa61 aaagtaatta caatgaaata ttcttcagta aatctgacac tttgggattc caggcaaaag121 gatcgcttgg gtgccaagag ttcaagacca gcctggtcaa catagtgaga ttctatctct181 gactctgcag ttcactctgc tgcttcagat gaaaattttc aggtctgtct gccactgtag241 tgaaggactg ctttgggtag tgtctgtgga gaaacttttt aaaggacata gttaaaatat301 tgtgctacaa accaattctt gaacaccaat cgttttgtcc tacctctctg tgacagtgaa361 agatggctgt gctcttggca aaggtcatct ctctggtcat gattctggat ctccttttct421 tctcctgatt atatgtcact cagttatccc cactggtctt ccacaacctt taacctcagc481 ttccccaatc ctaggagact cttctcgcag gctcnaggct ctcctaccct cccatcaatc541 ctcaatctac cccaaaaagc ca<210>2<211>437<212>DNA<213>Homo sapiens<400>21 acctgccctg acctccgaca gtgctgggat tacaggcata gcacccgtgc ctgaccaggg61 ctcttttagc aaggaaaacg tgaggaatga atggctgttg gtgtgcaaca aatcatactt121 gctacatgtt gtgaaacctg aagttatttg ttagtctgta tgaagaatgt accccagaga181 tgcaccctgt atctgcctta tgtctcttac aatggaggtt ctctcctgtt atattgctga241 ataaactatg tgaactgcta aattgactga agactagtta gtttatcact aactctttaa301 tgaagaattt cactatccta caactcctcc taaaaagtga aaatttactt tngagagagt361 gctttcgaga atagtattat gatcagtatt attgccccta aattttttaa ttattagaga421 atgcctnatg ttacaag
<210>3<211>590<212>DNA<213>Homo sapiens<400>31 catatacttt atgctttact tcttacttca aacatacatc cttgattaaa gaacaaaaac61 aactcactga agggtgataa aataatcaaa atgtatttgc ccctgaaagt ggaattacta121 cctcaaaaag aatacaactc tttcattttc cccaaaataa tcatgtgagt tcatgggcat181 gctcatcact cgctgtctgt gtggaagaga agatcgaaga gggatttact ggactgaact241 ggcctaggaa gcctttgctg gcatctctca gactggactg cagcccagat ccttttactc301 agatgcatgc acttagaaca tgaaaacant aagataaacg ctggtggtat tattacctta361 tattgaggaa cattttatga cttcctaact ctgntttcaa tagaaacatc cccacttaat421 gaaattgtca ataaatgctg gctcaaacca ccctccccaa atactggaaa aacaggtaca481 tccggtttct cnggtacccc tggncaagtt gtctggcaat ggcttggcga tttttcnact541 gaggttagac aacttgngat ttttttcccc ttcttnaaca cnaatttaaa<210>4<211>587<212>DNA<213>Homo sapiens<400>41 ttttaaaggg gaaaaatctt taattgattt ctcaggtaat ttttttccag attgtacata61 aagtgttctt atgttctcta tttggatgtt tcaggagaca tacaaatgaa atacagtaca121 tagacaaatg aaatgctaat aagaagtagg atngatatta aaatatcctt actttgcctg181 tatggaacaa aggcagtcta ctccatcggg gaatcaaagc aaatgtgaat aagaggtttc241 caccttgcaa aactgtgagc ttcatttgcc ctggagagaa ctactaggca aggcttcaca301 tgacagtacc tngtagggat gtccatgggg ctgtgaagca ggtgtattgt atccgaacat361 gcagctgcaa tgaaggcgtg gagctcagga gaagcagcca ctgacaatgc cccctcctgg421 ccgcagggtc ttctgtgcag cctcccatgg atcaatatac ccagttagtt aaaacaccca481 tccagtantt tctggtccct tagcaaccga tggatgatan ttgccnccag ccgagtgtga541 tcngtgangc cgattttaaa agcagncccg gggttcnacc cggantg<210>5<211>508<212>DNA<213>Homo sapiens<400>51 tttttttttt ttttttatga tttcacaaat cacatggttt attgccctac agagatgata61 tttttcctag gttcacaatg cttanggagg gggagaacaa catgaacaaa tctcacaggg
121 aatgcgctgg gtagacaggg tgtattagtc acggggtgag tcgagttcca aagttactaa181 agtctacttg ctcaattagg atggaagcag gcctctgtgg agagcgaanc tcattaaaat241 gtggtgtgtc tcttttataa gagcagctct tgggagagaa gctgtccaat gcaacggaga301 ttcacagaga catatatngg tccctcaagg ctagagggtg gcactcggtg tttgtgggcg361 atgcttccaa gttcaaaggc ttagcagggt ggggccctgt cnaagagagg ctgacagagt421 gttagcttgc cacagcccaa gctctcaaag tgtctggtcg aaaccntaag gttttttaca481 aaagttttgn aagtttcagg tgggtttg<210>6<211>564<212>DNA<213>Homo sapiens<400>61 gttttgtctt cctttattac atttttgctt ttgagataca aaccaacctc tcataccaca61 caaagaacat ccattcttgg gatttttaaa gtgcatttcc ccttgaactc tgtgtacaaa121 aatatttatc ttttaaaaca tgcaaaaatt tcttgacaag gcacttttag gtataaaatg181 aagatgagtc cttggttcta cattcacact gaagtaatag tgaaacatca tcacagctgc241 actctcaaag ccctcagagg tccagcagtc tctaaaaact cgtcaacaag actaaaaaca301 ttcacggctt tacaatgtgg gttacagagc tttacaacca tggaccagga aaaactgctc361 gtaacaacag ctgtccttcc cagttccaca tggtgttggt ctcactggca gggaaattaa421 ggataagctg gccaaattgg tcnttggtgg tggaccccaa taagantccc atcatcccat481 ggtnaccttc ctggtccccc ganttccagt ggacatggtc aggagagacg ttgatcgggg541 actcccccag gnctacattc aggg<210>7<211>467<212>DNA<213>Homo sapiens<400>71 tttcataaag aaaagaggtt tatttggctc atgattctgg tcgctggaaa gtccaagact61 gggcagttgc atctggtgag ggactcacat tgctttgact catggtggac aggggacggg121 gagccagcat taatatagaa gagcagggnt cccctgcctt gaggaggacc aatgcaggaa181 aggccaaaga gcagttagaa atcagcatcc cgagagantg caagatnttn tgaccaaggt241 ntctaantat caaacgcaag tcaggcggta atgacttctc ggagggagct ggcttcggct301 caggccnttc cgggtggggg gagaggaaac tgaagacaaa actntgcttc tcgtaaaggt361 ctatttccta aaattaaaga cagagttagc ctagtgagtt tctgcaggga tgcttgtctn421 agttagagaa gttaccagag ccntgccnag gttcttgagt tagggca<210>8
<211>482<212>DNA<213>Homo sapiens<400>81 tttaatcagg tttttcacta tacagtgatg ggggtagaag atggtgactg ttttcaagca61 aattcacgag gcacatgttc gtccctgccc caaagtgaac agtctgggct tcccagaaca121 gaaaagtgct ttccttcctg ggggaatccc attcctgagc tgacaagaca gatttcagta181 agaatgagca caaaggatag ggcaaaatag tgaagggagc caggtgcata tttgaattct241 ttccagtgca gactggaact agacatgcag gtatccctcc taaaggcaac gccaaatccc301 agccattccc actagaggcc aaaccgcctg cccacagaga ttgacagcca atgttcatct361 cataactntc ctcccagcag tgcaccagta aactcaggtn gcctgagtgc tttntgggca421 ccacacaaca gntgcgggcn ttctnttcat tgggccctng ggtgctgntg gggtntgctt481 gg<210>9<211>348<212>DNA<213>Homo sapiens<400>91 nntaatanta antagacagn ctttattnng nanantnagn nnnagnncaa nagnanctat61 anngagtnca taaaagcttg tttttaatta cttataacaa agcacaaatt acttctaaca121 aggcaaatat ttattgacat ataagacatt ctgctaggca ttgtaaaaan nnnnnnnnnn181 aagtaatggc ttttgtatac ataggtactt aaggcagnca tccnaangng ggnaaaatgc241 ttaatgagtg anggtagcnc tgnnncntgt tgacttttac tctagggttt gtttttaggt301 gggggttagg tnttttcccc tacggtantg tacntgtctt taggcccc<210>10<211>1240<212>DNA<213>Homo sapiens<400>101 acatacatcg ggtgtgagca cggaatagct gggcggcggc tggtccacag acactgggag61 acagaagggg ccggtcccca ggcgcccgtg ctgcagcagt gggatccact ggggagaggc121 tggagaccac cggtgtgaag gcctcgcggg taaattcact gcagctggac ttgccaaaga181 tgacctggga gagtggattc atgagggacc catgggggat gccctacccc ccttccctgg241 ggcacagcca ctgaccggca gagcctggct ggggtcctcg cctgtcatgt actgcactcg301 cacggcaagg ttgcgcacgg agccctggcg gctgctgaag ttgaggctgt tataccttag361 accctcagtc atgccagtgc ctgctctgtg cctgctctgg gccctggcaa tggtgacccg421 gcctgcctca gcggccccca tgggcggccc agaactggca cagcatgagg agctgaccct
481 gctcttccat gggaccctgc agctgggcca ggccctcaac ggtgtgtaca ggaccacgga541 gggacggctg acaaaggcca ggaacagcct gggtctctat ggccgcacaa tagaactcct601 ggggcaggag gtcagccggg gccgggatgc agcccaggaa cttcgggcaa gcctgttgga661 gactcagatg gaggaggata ttctgcagct gcaggcagag gccacagctg aggtgctggg721 ggaggtggcc caggcacaga aggtgctacg ggacagcgtg cagcggctag aagtccagct781 gaggagcgcc tggctgggcc ctgcctaccg agaatttgag gtcttaaagg ctcacgctga841 caagcagagc cacatcctat gggccctcac aggccacgtg cagcggcaga ggcgggagat901 ggtggcacag cagcatcggc tgcgacagat ccaggagaga ctccacacag cggcgctccc961 agcctgaatc tgcctggatg gaactgagga ccaatcatgc tgcaaggaac acttccacgc1021 cccgtgaggc ccctgtgcag ggaggagctg cctgttcact gggatcagcc agggcgccgg1081 gccccacttc tgagcacaga gcagagacag acgcaggcgg ggacaaaggc agaggatgta1141 gccccattgg ggaggggtgg aggaaggaca tgtacccttt catgcctaca cacccctcat1201 taaagcagag tcgtggcatc tcaaaaaaaa aaaaaaaaaa<210>11<211>1109<212>DNA<213>Homo sapiens<400>111 actggaagac caggcagccc agctgaaggc agtaagctcg gctcacagtc gcaggagagt61 tctggggtac acgggcaaag gggcttgaga aggcccggag gcgaagccga agagaagcaa121 ctgtgccccg gagaagagaa gctcgcccat tccagactgg gaaccagctt tcagtgaaga181 tggcagggcc agaactgttg ctcgactcca acatccgcct ctgggtggtc ctacccatcg241 ttatcatcac tttcttcgta ggcatgatcc gccactacgt gtccatcctg ctgcagagcg301 acaagaagct cacccaggaa caagtatctg acagtcaagt cctaattcga agcagagtcc361 tcagggaaaa tggaaaatac attcccaaac agtctttctt gacacgaaaa tattatttca421 acaacccaga ggatggattt ttcaaaaaaa ctaaacggaa ggtagtgcca ccttctccta481 tgactgatcc tactatgttg acagacatga tgaaagggaa tgtaacaaat gtcctcccta541 tgattcttat tggtggatgg atcaacatga cattctcagg ctttgtcaca accaaggtcc601 catttccact gaccctccgt tttaagccta tgttacagca aggaatcgag ctactcacat661 tagatgcatc ctgggtgagt tctgcatcct ggtacttcct caatgtattt gggcttcgga721 gcatttactc tctgattctg ggccaagata atgccgctga ccaatcacga atgatgcagg781 agcagatgac gggagcagcc atggccatgc ccgcagacac aaacaaagct ttcaagacag841 agtgggaagc tttggagctg acggatcacc agtgggcact agatgatgtc gaagaagagc901 tcatggccaa agacctccac ttcgaaggca tgttcaaaaa ggaattacag acctctattt961 tttgaagacc gagcagggat tagctgtgtc aggaacttgg agttgcactt aaccttgtaa1021 ctttgtttgg agctggcacc tcttgaaata aaaaggagga tgcacgagct ggcaggcatg1081 caaaaaaaaa aaaaaaaaaa aaaaaaaaa<210>12<211>476
<212>DNA<213>Homo sapiens<400>121 tcatttttat tacaccaaat taaagttggg agttccaaga tcccattcca gttaatattt61 aaccaaggtc taaaatttga ttttttttaa atctttgaat cctcccttct gcccctcatg121 gatccttggt tttaatgata tggaaacatc taattcttag aattattccg tagcttttgc181 tgattactct gagatttcag ttaagacttg tttcaaaaga cagatagctg actggttcat241 aatacattgg aataagttgg atccaaacta ataagaataa gctgtacagg aactaagtgc301 tcaatataca ttgtataaat ttgtggnaat ctcctggatg tgaattgtta cttcaagtgg361 cctttattaa gattttctca gacttacctt gggaggttaa agcaaacccc aatgnggaat421 aattttggtt acagagcnct gcctttataa ttttggaata aaggttcaat acagac<210>13<211>500<212>DNA<213>Homo sapiens<400>131 ctgtctgaat caactgtacg agttaaacag ttttatttcg aaattaacca acataaacaa61 caacaacaac aacaacattc actacccatt ccctcgttag gttggaaaac cagtctgaaa121 aaataaatac aatctgtgac catgaaaagg agcaatcact tatttttccc atatgttttg181 aacattattt ttaaaaatag attgggaatc tggagggtaa aatgccgggt cccttcccga241 gcccttagag cccgaacgtt gtcgaatcgg tcaatgtcca ccccgctgct cacctctgtc301 ccagcagcgc cggggccagc gcgccctccc gccgcgtctg ggagctggcg gggaaaagca361 ggtccccggg gggtatcgag tgtccaggga tatccacgca gacatccttg tacttgcaaa421 tacaaagaaa cacacaggac agagatcagg ctcgcgcagg gctcgatctc ctcggggctc481 agggttggag ggccgtacan<210>14<211>474<212>DNA<213>Homo sapiens<400>141 gagacctttg tggatgcact caggatacat tcctccttgc cccatcccct ttgctgcctc61 ttattcattc aacaaatatc ctgaaggatg agaaaaaggc taggcttgca gagtgctatg121 gatacaggaa acagtatata aaaaagttcc agatggtggc tcacacctgt aatcccagca181 ctttgggagg ccgaggtggg cagatcacga ggtcaggagt tcaagaccag actggccaac241 atggtggaac ctgtctctac taaaaataca aaaattagct gggcctggtg gcgggcgcct301 gtaatcccag ctagtcggga ggctgaggca ggagaatcac ttgaacctgg gaggcggagg361 ttncagtgag ccaagatttg agtcactgca ctccagcctg ggtgacagag caagactcca
421 tctcgaaaaa agttccagan gtgggaaaga gcntgctacc ntctgggggc taan<210>15<211>611<212>DNA<213>Homo sapiens<400>151 cctttaataa taaaaaagaa aaccaaaacc tcctataatt tataagctat gtttgactat61 ctacattata tagaaaatat agactctgtt tactttataa cacacatctt tttcccttga121 taaataactc tttaaaatat ctagtataca tcgccttgac ttcttatata tataagtttg181 gttttataca tatacataca tatatactca tgtttgtaaa acacaataaa tatatactcg241 acaatgacag ccaaacaaat gccattttgg ttaaaaaaca caacaacaac aataaaaagc301 agatgaagta ctataaaagc accaggcagc agacagaaag ccacatttgc tagaaacttc361 tccctcccct gggcaaggca caggggcaga atactaaata cgtccaggtg cctgaaagag421 aaggaaggca gcaaaggaat gggtcagatc acaagctttt ggttttgttt cttagcctgg481 ggattagacc aagaatcaca agtaagtcat tgcgtttata agggaaaacc aggggctcat541 tcaacagctt agccctgtgg tggctgcngg gacagtagag tctgggaggg ggaaggagag601 agcngtgacc t<210>16<211>474<212>DNA<213>Homo sapiens<400>161 gaatttaaat catttatttt cacttattac taatcttact acagagaata atacaatact61 ataatgtatc atcctcagta aattaatctt cacttagaaa atgttactga taaaatacaa121 catttagaat tttcatttaa aaatacaaat tttatgaaaa tattaagaat ttttaggatg181 ggaattataa aagtcaatgc tgaacaaaat tctactttta gatagtatgt atcttaaaat241 attatgaaaa atattcttta ttctgaaaac aatgtcagga aagaatacct gagttctcta301 aaaccactag ttctaatttc aaattcgctg ttttggtaac ataaagaaaa cacatttaca361 atttaacact ggcagttacc tgtcaaataa tttaggaata gaataaatgt tgcatatacn421 ttagggttgg acnttatata tattttaaaa cncaggggtg ggtttcattt taat<210>17<211>582<212>DNA<213>Homo sapiens<400>17
1 nagctctatt aagttgcaat tttattttta taaagctttg taaagttatg aacatggaaa61 ccctaagaat cacctaagct ttttacataa gtaaatttca agttgacagt ctgtccattt121 tgaactattt ataattaaca ggacacttaa cacagtaata tcttagacaa ggaagagggc181 attaattctg caggcaggta gcaatgaaaa tgccaaacct gaacaatcag tcactatttc241 cttatacagt ttgagtaaaa gaggtagact acagaaagaa aaaaaaaagt gaatataaaa301 tgtatcttcc ccaaatacaa ctctacttca ataagtcttt gaagattacc ggatgacatc361 caacaaaaat ggaatggaac tcaggggaat ggagttcttt ctccccatcc aaaggaaaaa421 aacagcatgg ggtaattttt naatttggaa aaanaaaaga atgggcattt ggcaagcnaa481 ggaaancccg nttcccattc cacctatttn cattggtcca ttcccngggg ccaaaatnaa541 tattnccttn ggtaatttaa aaaaaaancc cttttttaaa cc<210>18<211>600<212>DNA<213>Homo sapiens<400>181 aacttattta tagaaatgag gtctcactat gttgcccagg ctggtctcat acttctggcc61 tcaagtgttc ctcctctttg gcctcccaaa gcactgggat tacaggtgtg agccaccatg121 ctttggcatg nggngctatc gcagtaggag agcaggggtg gtggtagtgt ctccatatgt181 actctcacta aaactggtgt gggcagatac tggatgccta cttatttagg ggcatctaaa241 ttcctgtgct aaattagaaa atagaggcag ctcttgaaag gagtgatggc atgaaggaga301 gatgcttgga ccaaccacca cacaaccctt ataaataatt gataaccttg gtttttgatg361 ccacgtcctc ttcaacagtt gttggttgaa atgacacatg attccaggtg agcttttcca421 gataaattgc cctgggaact tcagactttg tcgggtgaat cgatgtggaa tttcagaatg481 ggctggaatg tctaagatgg gcatgtccat attcagctca gtgtggggaa actggtggtg541 gccagggctg gttgcaagct gagttgtaag ggtgacagtt caccattggt tatttcnggg<210>19<211>598<212>DNA<213>Homo sapiens<400>191 tgataggcaa aaggttttaa ttgtatagat taaaattaac tttggacaaa aattaaaact61 caggcagaga atgtcttctt tttgcaacag cagacactag taaaaacaaa ggcacagtaa121 aaattgagac cccaaatttg cagtgtagag atatgaatat aataatagac acaggcaggg181 aggattaata aatgataaaa tgtttagagg atgatcatta gaatacagga tatttatact241 cttgaaaacc gctttcccaa gtacttcatt ataagtaagg tgtctctaaa agggacagat301 ctcctagacc cctccttaac caagtaacca gtcctgatat cataatggtg atggacaaac361 tagaccttct ctgcccgcag atgggctgag gttggaaact cacagcattg tctctgcagt421 gttcctgggc aaaacgttta ggctgaattt aatcatgaag acattttcag acaacttcag
481 aatggtagat cnctggagcc agacagctga cctggtcttc cataaaccag tccatgtcac541 caccatccnt ggaccaccac caaaagatga ggaattttgg ggttccaaat actaagga<210>20<211>534<212>DNA<213>Homo sapiens<400>201 aggggtgtat ggttttattt ttttaaaaat gtatatgctc aataacagtt tacttggaaa61 ttttcactag aagaaataac atgaaagaac accagcataa atacataata catactttaa121 ttgtggctat attaaatgta tcttaatgta ataccgctgt aacatggtat catctcattg181 ttaaaaatta aaaaaaaaaa aagaacatag ggaagtgaaa ctaaaaaact atagaatcct241 ggattctgtg gcagatggag tacccttatt taagctagaa aaggagccct gtaaggtaga301 aagagaaaag aggcaagcaa tctgaaacca tcatgttaac caactgtgag tttttaaaaa361 tgtggttaaa atgaatctaa gaatgccttt taacacagct cacactttaa tgaacaccag421 agaattcnca gcaagactgg taaagcctag gaatgtaaag ttattggtgg ggntttagcc481 acgaccctca cctcactcct acctatatgc caaaggagga angaaaagtc ctta<210>21<211>547<212>DNA<213>Homo sapiens<400>211 gaaaaataaa catctttatt tttttgccta ctttatttca ttttttcaaa taaaatttaa61 atctgtacaa agtatactgt tacagtatat attttgtaag aatcaatgcc taaaataatc121 acaatacttc aataagcagt acagcagacc tcgctagttt tcagctttga tattgaacaa181 actcaagccg gctgatgcac aacacgtttg cttggtttcc acatggtgat ttcccagcac241 tgagatggga gaacatgaca gcaaatatgg taatattaca gcccgacaca ctgcgtttct301 tcatgtgata ataactgcac atatttaata cagaatgctc aaatttactt tttaaattgc361 atttgcttac ttcttagttg ggcaaaattc agcattctta aatggggtcc ccaaacagtg421 gnaatttaat gntattcnaa ctggtatatt gggcacatca cccnataggg gttacatggt481 aactcncaag gatatcnaca acttagtccc ncaattttta cctaatatcn tctgggncca541 tttaacn<210>22<211>1231<212>DNA<213>Homo sapiens
<400>221 caaagaggaa atggaggttt tgcctgagcc ccctcctcca ataaatagga aaaaagacaa61 gagctatgct acagctatgg gaccctttct taggcaagag gcattaagtg gggagctctt121 agcctgcctg gtaatacaag attgacaggg caattgggta tataaaccca tttcttttaa181 catttataaa aagttaagaa aaagcattag aggctgaaac cgcgtggcca aatgggtggt241 aggcagaagg aaatacttgg cagcaaagga agctcgcaga cctggagcca gcaagtgctt301 ccaggaactc agcctgcaca gaggcagcag tgacaaggag ccaggcgcag cacaccagag361 ctggcctgat agggaggggc aggaggcgtg cacgaaggtc cgcccagcgg cagcagggtg421 agggagcagc acagacaaaa ggtggcacag acaaaaagca gcgcctgagc aagcgcaaca481 tgaccgccgc cccgggaccc gccggctaga ctctcttggc tccgcgggtg gcccatgggc541 aaaatttcat gtgttccttg tatacaagcg acatcccaga ttataattct ctgctaagat601 ttaagtacaa tttaagaatt taaaacacct ctttctaata atggccactg ttgcttttac661 actattcccc tggcaaagca agacagaaaa atttgcattt acaataccag ctatcaatat721 tgaaaggtca gcttgccgat ttcattggaa agtgcttcct caaggaatgc taaacagtgc781 tgccatgtgt cagtatcatg taaatcaagc tttgttcccc agtagaaaag aatttcctca841 ttgcaaggtt attcatttta tggatgatat tctactagca gccccagtgg agccagtgct901 tttaagttta tatacctctg tcataaagaa tgcacagcta agtggtttaa tcattgcacc961 tgaaaaagca caaatgtcct ctccttggaa gtatcttggg tacatacttc ctggtcagta1021 agacctcaaa aggttaaatt aaatactagc aacttacaca ccttaaatga ttatcagaaa1081 ttactaggta attttaactg gcttcacccc accttgggca ttcctactta taagctgcaa1141 aacctgtttt ctatcttaaa gggtaacaca gccctggatt ctcccagata tttaacccct1201 gcagccaaaa aaaaaaaaaa aaaaaaaaaa a<210>23<211>1408<212>DNA<213>Homo sapiens<400>231 aagatcaagg actgctacgg actgggctcc gggcagaatc atttcatcaa ggacagtcag61 tgggagcagc aagctgagat cttcaacgct tcctacaaga agtacctaga tagggagtgg121 gaggaagagc cactcagtac ggccaccttc tatttccttc ttcctagctg cctatttgca181 atgccaccgg aagtcaaggg cccctcagga atggcctgtg tccttggtat acactggacc241 agaagtcaca atttcttcct gtattcacta aaccgaactc taaaggataa agctgacccc301 gagggtgtgt ggccctgtgc tgcgcccatt gcagtctctc agctcagctg ctcctcctcc361 tacctggtgc tggcctgcga ggatggtgtg ctcacgctgt gggacctggc caaaggattc421 cctcttgggg tcgctgctct tcctcaggga tgtttctgcc aaagcattca cttcctaaaa481 tatttctcgg tccacaaagg acagaatatg tatcctgaag gtcaagtgaa atcccaaatg541 aaatgtgtgg tgctgtgcac agacgcctcc ctccatctgg tggaggctag cgggacccaa601 ggacccacca tcagtgtgct tgttgagagg cctgtaaagc acctggataa aaccatctgt661 gccgtggccc cagtcccagc cttacctggc atggtgctca tcttttccaa gaatggctct
721 gtgtgcctta tggatgtggc caagcgtgaa atctgtgcct ttgcccctcc gggagccttt781 cctctggagg tcccctggaa gccagtgttt gctgtgtctc cagaccatcc atgtttcctg841 ctccgaggag actattcaca tgaaactgcg tccaccgacg atgctggaat ccaatattct901 gttttctatt ttaattttga ggcctgccca ctcctggaaa atatctcaaa aaattgtacc961 attcctcaaa gggacttgga taacatggcc ttcccccaag cactgccact ggagaagaga1021 tgtgagcgtt tcctccagaa gagctatcgg aagctggaga agaacccaga gaaggaggag1081 gagcactggg cccggcttca gaggtactcc ttgtcgctcc agagagagaa cttcaagaag1141 tgaggctgcc accgccctgg gatctctgaa aaggaggttt cagccacgag gcagctgctc1201 ccaggacact gaggccaaga gaaatgtaac agagccacag ctccacaggc ctgcactcgg1261 agtctggggc ctctgcagag ccagcaaggg gaaaagtata atctggggga ccttcaacca1321 ctaagcctct tgtcagagcc ctcaggcagg cagatgtgtc acccaaataa acagtgatat1381 tgtctccaga aaaaaaaaaa aaaaaaaa<210>24<211>1026<212>DNA<213>Homo sapiens<400>241 gtttgttcct aaatctaatc gagctcccaa ggaactagta tcttatagaa cacacataga61 aaatagtgct ctacagtagt cactttcaca tttttttctt ggatccaact tcatgataag121 aaatacattt taactcataa tctattcaca tgtatatgaa taactgaaac agaagtttca181 caaaattatg attatgtttg ccacaaatgg tgtgctctaa tattttcctt ctgtttcatc241 agaagaaaaa tactggttac tatccagcta agctgatttt gtgacccatt aatgagtttc301 agttctcact ttgaaaacac tgttctagaa ttacagaaag gtctagggat gagaactaac361 accactgtgc atcacagtgt gtcagtctgt gccaggcagc ttataaatat tttttgcctc421 taatttttat acatttatga ggtaagtatc attttctagg taaggatgct aatctgtctc481 caagccaaat aacacacagt aaatcatggc accaggattt gaatctgggt ctttatacat541 catagcccat gctgttctca ctgtattttg ctttttccaa gtataacccc gttttcacac601 gaatggcccc ttcacatatt tgaagactac cgtcgtgtcc gtgctgaccc tttctccctg661 ccacacatgg ctggagtgca atggcgcgat ctcggctcac tgcaacctct gtctcccagg721 ttcaggaaaa tggctttgta aagaagcttg agcctaaatc tggctggatg acttttctag781 aagttacagg aaagatctgt gaaatgctct tctgtcctga agcaatactg ttgaccagaa841 aggacactcc atattgtgaa accggcctaa tttttctgac tcttacgaaa acgattgcca901 acacatactt ctacttttaa ataaacaact ttgatgatgt aacttgacct tccagagtta961 cagaaatttt gtccctattt aatgaataaa ttgtatgtat ttttaaaaaa aaaaaaaaaa1021 aaaaaa<210>25<211>1067
<212>DNA<213>Homo sapiens<400>251 atgacctttc ctctttatct tccttgttgt gcaggtaaag aaaccaagtg gaagagtgtt61 tcctcctctg gccgtaaagc agctgtcccc gccctactcc ggaccgcccc aaagactcca121 tgggatggac ctgagtcagc cgaatcctag ccccttccct tgggcctgct gtggtgctcg181 acatcagtga cagacggaag cagcagacca tcaaggctac gggaggcccg gggcgcttgc241 gaagatgaag tttggctgcc tctccttccg gcagccttat gctggctttg tcttaaatgg301 aatcaagact gtggagacgc gctggcgtcc tctgctgagc agccagcgga actgtaccat361 cgccgtccac attgctcaca gggactggga aggcgatgcc tgtcgggagc tgctggtgga421 gagactcggg atgactcctg ctcagattca ggccttgctc aggaaagggg aaaagtttgg481 tcgaggagtg atagcgggac tcgttgacat tggggaaact ttgcaatgcc ccgaagactt541 aactcccgat gaggttgtgg aactagaaaa tcaagctgca ctgaccaacc tgaagcagaa601 gtacctgact gtgatttcaa accccaggtg gttactggag cccataccta ggaaaggagg661 caaggatgta ttccaggtag acatcccaga gcacctgatc cctttggggc atgaagtgtg721 acaagtgtgg gctcctgaaa ggaatgttcc agagaaacca gctaaatcat ggcaccttca781 atttgccatc gtgacgcaga cctgtataaa ttaggttaaa gatgaatttc cactgctttg841 gagagtccca cccactaagc actgtgcatg taaacaggtt cctttgctca gatgaaggaa901 gtagggggtg gggctttcct tgtgtgatgc ctccttaggc acacaggcaa tgtctcaagt961 actttgacct tagggtagaa ggcaaagctg ccagtaaatg tctcagcatt gctgctaatt1021 ttggtcctgc tagtttctgg attgtacaaa taaatgtgtt gtagatg<210>26<211>770<212>DNA<213>Homo sapiens<400>261 gctcagcctg ccccatcccc tgctgatttg cctgttccta gagcacagcc ccctgccctg61 aagacttttt ataggctggt cacacccgga gcaggagtca gccccagtca ggacacagca121 cagacatgag ggcccccact cagctcctgg ggctcctggt gctctggctg ccaggtaagg181 aaggagaaca ctaggattat actcggtcag tgtgctcagt actgtctgga acttcaggga241 agtcctctga taacatgatt aattgcgaca atatttgttt ttatgtttcc aacttcaggt301 gccagatgtg acatccagat gacccagtct ccatcctccc tgtctgcatc tgtaggagac361 agagtcacca tcacttgccg ggcgagtcag ggcattagca ataatttaaa ttggtatcag421 cagaaaccag ggaaaactcc taagctcctg atctatgctg cacccagtct gcaaagtggg481 attccctctc ggttcagtga cagtggatct ggggcagatt acactctcac catccgcagc541 ctgcagcctg aagattttgc aacttattag tgtcaacaga gtgacagtac ccctcccaca601 gcgttacaag tcataacata atccccaagg aagcagatgt gtgaggctgg gctgccccaa661 tgctccttct ggtgcctcta tctgctgagg gaagttctca aactcagtca ggtttggaaa721 gtcattggga gattttccta gaggaggcca gggaggttcc tctgaaccct
<210>27<211>1035<212>DNA<213>Homo sapiens<400>271 gtcacaaggt tgattgatta gttggggtga ggcaggaaca aatcacaaag gtggaatgtc61 atcttttgtg gttcttcagt tgctccaggc catctgggtg tatatgtgca ggtcacaggc121 ttagcttggg ctcagaggcc tgacaattta taaaatactt cattcaacaa tagcagacca181 cacattcttc tcaagttcac atggaacatt tgccatgaca gatcacattc tgggccataa241 aatacacctt agcacctttt caaagaatag aaatcatacc aagtactctt tcagaccaca301 gtagaattaa actagaaata aataaataac aaagttaact ggaaaatccc aaaatatttg361 gagattaaac aaaacactac taaataacac atgaaccaaa gaagtctcaa gaggcataaa421 aatattttaa gctaaacaaa aaatataact tatcaaatct tgtgggatgc aggaaaagca481 gtgcttagag ggaaatttat agtattgact acatagatta gaaaagaaga aagatccaaa541 atctacaagc ttcaacatta ggaaacgtaa caaaaaatta ataatcaaca ttagaacaga601 aattaatgaa attgaaaaca ggaaatcaga gaaaatttta aacatcaaaa gctggttctt661 tgaaaaaaaa aatcaataaa atcaataaaa ctttagccag gctaaacaag aggaaaagaa721 aagagacaca aattccaaac atcagaaata aaggaggggc atcactactg gtaacatggc781 taataaaaag attaaaagga atatcatgaa cagccctaca cccataaact tggtaactta841 gataacattg gccagtttct tgaaaaacgc tatctaccaa aactcaaaca aggagaaata901 taatctgggt gggcctacat caattaaaag aaatggaatc cataattaat actcttccaa961 aacggaaagc acaagactca gatgttttca ctggtgaatt ctaccaaaca tttaaggaaa1021 aaaaaaaaaa aaaaa<210>28<211>1057<212>DNA<213>Homo sapiens<400>281 acgctgccca gactcactac ccgccctctc ggtcccgcag ccacgaggag gacgctgccc61 agactcacta cccgccggct ccctcccccg cgtccctgtg gtggtgggcg aagatgacct121 gttgtgcatc agatgccttt gagacataca actggagatg tgagatagga aattgaatgt181 aaagactgga agctcggcag agcagaactc tgaacaacgt gcctcaggag tgcaagttgc241 tgatgaagtg tgtcgcattt tttatgacat gagagttcgt aaatgctcca caccagaaga301 aatcaagaaa agaacaaagg ctgtcatttt tttgtctcag tgcagacaaa aagtgcatca361 tgtagaaggc aaagagatct tggttggaga tgttggtgta actataagtg agcctttcaa421 gcattttgtg ggaatgcttc ctgaaaaaga ctgttgctat gctttgtatg atgcaagctt481 tgaaacaaaa gaatccagaa gaattgatgt tttcttgtgg gcatcagaac tagcaccttt541 gaaaagtaaa atgatctata caagctccaa ggatgcaatc aaaaagaaat ttcaaggcat601 aaaacatgaa tggcaaacaa acggaccaga agatctcaat cgggcttgta ctgctgaaaa
661 gttaggtgga tccttttttt tttttttgag atggagtctt gatctgtcgc ccaggctgga721 gtgcagtgac aggatctcgg ctcactgcaa cctctgcctc ccgggttcaa gtgattttcc781 tgcccagcct cctgagtagc tgggattata ggcgtgcgcc accacacctg gctaattttt841 gtattcttag tagagtcagg gtttcaccat attggtcagg ctggtctcga actcctgacc901 tcgtgatttg ccctccttgg cctcccaaag tgctgggatt acaggcgtga gccagcatgc961 ctggccagtg gatccttaat tgtagccttt gaaggatgcc ctgtgtagat catcattcag1021 tgccacaaat tgaaagcttc cacgtttaat gttatcc<210>29<211>1003<212>DNA<213>Homo sapiens<400>291 acgcgtccgg aaatgtctta cgatgatcat ttagaggttt attttgaaca actggcaatt61 ccaggaatga tggaataaag catacgaagt agaaggactg gaacctccag aaaaagtact121 ttaagttacc tacaggtgat cctagtcagg tatgaattga taagaaatgc ctgcaccttc181 cctccttcct atctttccct tgcctacaga aaattaaaag gcaaaacaat agacatctgc241 atattcttca ttcagatcaa ccagtggcta gcatttgcca ccttttgcag tttctttctc301 tttccataag tactttcttc tctgaatcat ttgaaagcaa atgaaaacag tagcctaaag361 tgtcagtttc aaccagaaaa taacagctct gatttctcat ggctcatact cgtctgaaac421 gactcaggta gaggctgagg aaggccgtgt tgtttgtcta cctgggacta cacctactga481 aagaagttct caagttctga ttgagttcta aaattttttg aagattggaa ttcttcatat541 gtaaaaagag aagagaaaga cattcttgat tttggtgact agagttgtag atgctggaag601 ctttgccact aacattgatg aaccagatgc atgtcaccag cccatcacgg gcaccatggg661 cccggcactg ccacgttttc caaggaacct gccggagctc ctgcggaagc tgctcctcag721 gcgatggagt cccttcgctg gtaggccctt cttcagtcac cacaggatgc ctgccattca781 tgaacaagaa ggagaggacg ggctttgaat aaaaaacagg atccaggaaa tatttgtgag841 gccatttgga cttcagtgtg aaatggtgtt aaaagatgaa gtcatttatt caagaagtaa901 acctctgcca cctggactgt gcgcagacat ttcattgatt ttgtttaata aacattttct961 ggcttaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaa<210>30<211>665<212>DNA<213>Homo sapiens<400>301 cttttttttt tttttttttt ctttttttag gcatatgtag taatattaga aacatttaat61 ttgggaaact ttgattcttg aaagagaaaa caaaagcatg tgaataaact ttgaagtgtt121 cacctcagtt tgggaccaaa ctgcttggat ctttgtaaaa accggttttg tatgtcaagg181 aggagtttaa ggcctttccg accaccttgn gttccccttt tctgcgcacc attgtatcac
241 gtggagttgc tccttaccac acctcacgtg cccctgagcc cnatttcctg atttcttctg301 ggctggactt ccccgttctc caccagcagc tccagtatcc caaactttct agtcctgctg361 atcctcccag caacggggtg gaaactggag ggcagtgtct ggtctgtttt ctaagaaact421 tatgaattct attatcntta caaatatgng aaaatttttt caanaatttt taataancct481 tttataaaat gaaaagaaac ccnatgatcg attaaggaag nggggtatgg cngggggggt541 cagggggttt ttnggggtcc cttttttttt cctggccttt aaccctaagc ggntaagtgg601 aagcatccca naagttnggg gggaaaancc cctaaaangg gccntgggct gcctctgggg661 ngggt<210>31<211>462<212>DNA<213>Homo sapiens<400>311 tttttaagat gaagacagta atataatatt ttggnacaaa aacagctgca ggangaggtg61 gaggggggcc tgtcattatg ttnccccccc accccccaac gaaaggaaaa ctaagactcc121 caacataaac agggccttga ggggggggat tacaggcact tgggcatgga gtcttcggct181 gcaggaagca ctccgcttat tcttcaggaa tgggaaaggc gtgacccaac gagagcatct241 gtctcagagc tccactcagg gtcacccctc tccagaggcc ggtatggggt ggcttcagac301 ttccactgca cgacctgggn gcaccaagac cacacaccac aataccaaat ttcacccaag361 aagaggtttc agcattgtgt aggttggagt aaaacttgca gagattncca ggggggtntt421 ccatggattt tctggggttt cagaacagta attntagtnt tt<210>32<211>440<212>DNA<213>Homo sapiens<400>321 ttgaaacacg ttccaaatac attataataa attgtgtgtt ttgctatttt tgggtgcagt61 ttttatatca aagtatttaa tcctactgct ttaaattttt ctattattat tgatattttg121 tatccttttt gctatcagtt attgaaaaag gtatattaaa gtctttatga ttgaagacgc181 ggctattgct ccttttagtt ctataaaaat tgtaaatgtg tgttaactgt gtgtgagtga241 atatacatac acaagtttga caacttcctc aatagttgat tcttttatat gggatgatgg301 gatacatctt cctttcaagt aataatttct gccttaaaaa tctcttttgt ctggatatta361 acatagctaa acaaattttc atttgggctt gtgtttggat taccctttca tatggcatat421 atggangggg nttttnaaat
<210>33<211>481<212>DNA<213>Homo sapiens<400>331 tttttttttt tttttttggg cttcaagttc atttattgat gcattggttg gacacaataa61 aaccacaatt gttatcaaaa agtttccctc ccctccccct ttttctggtt ctcccattgg121 aaatgtcttt aaaaaacaaa aaaaagatac agggggatac tggcagggtc ccatattgtg181 gggtggggag ggggcagtgc tgcagtggaa aggagtttgt gacattgaca ttgacagcat241 gaaaatgggt cttgccagtg catgtgagac tgcctcccaa cactcagctt cacagtggnc301 aggggctggt tgggagaagg gaaagctgaa tttagtgcta gggaccagtt tgggagtggg361 gaagctntag ggngaaagct cttcctttgg ggagctcttc ggggggtttc cggtttttna421 gctgagtncc ataattctga gggnagaccc acacccgccn tccaagattt tgggtttcaa481 g<210>34<211>454<212>DNA<213>Homo sapiens<400>341 tttttttttt tttttcattt gttattttct tttatttgtg aatatgtgtg gggttgggag61 actgggtgct agacatgagc tgaggtccag gctaggcaca gggagctgac agctgcagga121 catgctaatt ggtctgggaa ggcctttgga atctagggaa ttggctttct gagctgaggc181 gggtccaggc aaggaggcaa caggctgcag gaatcacaaa gggaccgggg acctccatgt241 acccaatatc taggtctcat cagtcggaga aacattctca gctcttgggt gaccttgtcc301 tccctcaaca cggtacttca ctgcatatgg gacttcattt gacaattgcc acaaaggctt361 tgggcttcag aggcactttt aagggatatt aacagtaggg aaaggggagg cttgggggga421 gnccagagag ttcacccaca caaagcccca cact<210>35<211>307<212>DNA<213>Homo sapiens<400>351 aggacttcat ctnatattca gattatagca agaaaatcaa tgtgctccat atatttnacc61 catgatgaaa cttgtcccat atccattact atatatatat atatagttta actgggtggg121 gttgggaggg agaaggcatg aattcaaaat tttaattaaa tattacataa tttaatgttc
181 tcccaagaat ttgagctgac attttcatat gaatacaatt tcacaaaggt tacaaatata241 tatcanctta caacagaagg natcatttag tattttacag tgtttttaaa aatctagctc301 cataatc<210>36<211>791<212>DNA<213>Homo sapiens<400>361 tttttttttt ttttttttga gacataatct tgctctgtcg cccaggctgg agcgcagtgg61 catgatctca gcttactgca acctccgtct cctgggttta agcaattatc ttgcctcagc121 ctcccaaggt gctaggacta caggcgtgtg ccaccacacc tggctaattt ttgtgttttc181 agcagagaca gggttttgcc atgttggcca ggttggtctt gaattcctga tctcatgtga241 tctgcctgcc ccagcctccc aaagtgttgg gattacaggt gtgagccacc atgcctggcc301 ttattttgtt gattctaaca cgtggtgata tctcctggtt taaagaagcg tttccctaat361 ggtatcaaac atcttttcat gtgcttattt gcaatctgta tatcttctcc agtgaaatgt421 tgctttgtat cttttgctca tgttctaatt gcattctttg ttttgttact attttaagac481 ttttaaatgt attctagata ctagtccttt gcaaatatag tcatgcatag cataaggaca541 ttttggtcaa tgacaaactg catataggat gcggtcccat aagattataa taatgggcct601 gaaaaaattc ctgttgccga gtgcaatgct actcatgtgt ctagtgatgc tcgtgntaag661 ctactgtgct gccacttgca ntaaagcaca gcgcacacca ttctgtacag tacacaatac721 ttgataatga tataaatgtg ttcctggata gcacagacac acattctgta catacacata781 cgtgatatga t<210>37<211>502<212>DNA<213>Homo sapiens<400>371 tccaaactcc cttatattcc tatctggaat gcgaaagctg gaaatacaag tgctctgtaa61 tcacaggcag tggttttctt ggagtctggt gtgatttctg tagcttgcag caatatgtgg121 gacacacttg cactgaagaa tttaatgact tgttttactg ctgtctgcca tcaatattta181 tctatattaa ctgaagaaag gcattactcg aaatggcatt tcacacctct gagcattata241 cagtaattct aaagtcatac gttgtaccaa attctctggg caaacacaga gatacagtat301 gaatggaaag aatccaagtt gaagttttta aaacttgaat tgtctgcagt aatctggaaa361 accaaagtca aaatgactcg aattgctatg gctgggaatg actggattaa aagcaatgga421 aacaatcgtt tctcggcttg gtttttcttg aattagcaag agcacatttc aagtatattt481 aatagaaaag aaaaactggg cc
<210>38<211>516<212>DNA<213>Homo sapiens<400>381 tttttttttt ttagtatttc ttgttatatt tcataaacag tctaacaaaa cactaacaaa61 gttattttgc ttattattct gatgctggaa taatttggtc aaatattcat aaatagttat121 caactttacc aaggcacaaa aatcaagaat acatgctttc tataaactca tttactactt181 taaagttttt aactgtttan cagttatagg aaattttttt aatcgaaaag aaaactattt241 acatttcaaa tacaaaagaa aattaaaatt attcttgatt gatctctaaa acccaaccac301 ttagatgctc attattatta ttttctttca ccaatccagt atactctatt caacatgtgc361 atcctgtgta cttactaagc atttatttgc acaaggattt tctgtgtcag gttaactttg421 ctgtgtagat tctgtgttag ctacctgaat tcgtccaana aaaagtctca cntttttttg481 ganaaccaat gctgctacnc tgtttccaga tctgcc<210>39<211>684<212>DNA<213>Homo sapiens<400>391 tttttttttt ttttttttca aaaagaaaga agtaaaatta tatttggttt gtagacgaca61 tgatcttctg tgcagaaaat cacaaagagt caaccaaaat actactggaa ctaagtaaca121 tattcagtgg atttgcacaa tacagtatca gcaccaaaaa ttagttgaat ttccatactt181 ttaacaataa acaactttaa aagaaaattt taaaacactt ccattggctg gagaacttaa241 agtaagaaat acttagaaat aaacgtaaaa aggtcagagg tttttacctt gaaatctaca301 aacattgatc aaaatgatta gaaacataga gctacaaccc atacttatgg tttggaaaaa361 ttaatattgt taaatgatta tataacccaa tgtgatttag attcaacacg gtacccataa421 aaatccttat cactggggct gggcgcattg gctcacgcct gtaatcccag cactttggga481 ggccgaggtg ggtggatcat ctgaggctgg gagttcaaga ccagcctaac caacatggag541 aaaccccatc tctactaaaa atacaaaatt atctgggcgt ggtggctcat gcctgtaatc601 ctagctactc tggaggctga ggcaggagaa ttgcttgaac ctgggaggcg gaggttgtac661 tgagctggga tcgctcattg cagt<210>40<211>738<212>DNA<213>Homo sapiens
<400>401 tttttttttt ttttttttga agtcagggca acttttattt acttgaacaa cagacatgac61 acacagcacg atctctcctc tccatcccat cccaagcttc tgtccacgac tacccagggt121 ggggaaaaag gcaatggccc agccccgatg acctggggga cttcagtgtg cctcctcagt181 cccacccaga aaaaagcctc ctcccatctg atttcaagat agtggagaaa agccaggcac241 ggtggctcac acctgagcca cctgtaattc cggcgcattg ggagactaag gtgggaggat301 ggcttgtggc caggagttcg agaacagcct gggcaacata gtgagacctg cccccccatc361 tctacaaaaa ttaaaattaa aaaaattagc caggtgtggt ggcgcatacc tgtagtccca421 gctctactgg ggaggctaag gcaggaggat cacttgtgcc caggaattca aggttgcaat481 gagccatgat cacaccagcc tggccaagat ggtgaaaccc cgtctctact aaaaatacaa541 agattagccg ggcatggtgg tgggtgcctg taagcccagc tactcaggag gctgaggcag601 gagaatcgct tgaacccagg aggcagaggt tgcagtgagc caagatcaca ccactgcact661 tcagcctggg caacagagtg agactccgtc tcanaaacan acaataaaag gccaggcacg721 gtggctcaca gctgtaat<210>41<211>551<212>DNA<213>Homo sapiens<400>411 gtgtacaaac aaaaatcact ttattcacat gccagaacag tgtttgagta gatagtttta61 agggcaatac aatcaatttc ttgaactcag atttgaattt ttaaaaaatt atattcaagc121 tctaagtatt ttacttacag aattttgcac atggctcact cctttctgca caaaggtaaa181 ataatatctt tatttgctta attccccatt gttttgttgt ttcaacatga ttttgtaccc241 tcagtctttc aacgttagtc aaacgttaca ttttctttca ctcttaccct tttttgagct301 tcatttcctc tctttatgtt ctcgagtata gcttcgaggt gtagggtatt cattcccaag361 atccacaaca attacctccc tagaaaatgt cactcctttt aaaggtcttt tttggctttc421 aacaactgaa ggagttctag tgacagatgt gtgtattttc tgcaaaccat catcctcttc481 ttgaccagtt tctccagcat tctcaggctc ttgcttctca tttatttggn tctcctcttc541 tgcctcccat g<210>42<211>747<212>DNA<213>Homo sapiens<400>421 tttttttttt ttttttttga gacagagtct cactctgttg cctaggctga gtgagaataa61 ggttgtgtca taaataaata aataaagtca tttacttaac atttacatag cactgattat
121 ctgacagaga ctgaggaagc agaaacgagt aagacaccac tccaggcccc agggatcttt181 cttgagggag attcacatcc tcgggaaaac aagatgtata gtgaatgaag gaggtagcag241 aggggtctgc agtaactgag taaggtatgg agctagagga ctaatacctc caggataaat301 gaatttatac aaagattgta tttgagagaa gaggtgagga accactcact agtagttttg361 agtgccctga aagtgaaata gattgaattt tgtcctggag gtcaatggga tgttccagga421 cttaagcccc ataggactga caatcaaaaa caggtctggc tgcactgtgg gaaatggatt481 gagggggcgc tggagtgaag gcaggaaggc tgtgagacgt gtcagacagg acaggttgaa541 cctggggctg agagaactga accaactgga aaggtctctt aaggtctctt acaaggtaaa601 aagagaccta aggcaggtgt gaagagcgat aaggcaagga tgagatccag gtttctggcg661 tgggtgtcac ggaaatgaga acacgatggg ggcagagctg gggaaaagct ggggagatca721 atttctcgcc gagcctgtgg tgtctga<210>43<211>699<212>DNA<213>Homo sapiens<400>431 tttttttttt ttttttttca ataaaagcat ttacatttat ttgcacacta aacacattga61 tttactactt attcccagtt ttcttattta cttatatagg atccattcat tcattttaca121 aatacagatt ttcagagtaa gaagagagaa taattcttct aaaaataaag gcattacaat181 agaatttgca aatctaggta ccacgtggaa agtctcatcc gttcccatat cattaactgc241 tcattttcac actagtgact tccaagtcct tattgtcagc atcctctatc ttcccatctc301 cactggagca tttccaacct cctgccacca actcaacaca ttcagggaga agagggctgg361 tctgggacca cagaaatcct tgttttaatc ttggaggcag tgctcagtat gaacttaatg421 aattgttaag tgaatttgtt ttttaagaaa gtcaacaagt aatgtatccc caggcaattt481 tatctgattt ttgacattta gattggtttt ctattacttg gcaaagagaa tggcgtacag541 tcactcctct gggggttctg catgaggaag tatccattat acatgccgtt aaggagtgat601 caggtaggga gggaaaggtt ggccacatca actctccttt gtgaagccaa cagctctcaa661 caagggagat atagtgctct gtagagcaga aacatgaat<210>44<211>728<212>DNA<213>Homo sapiens<400>441 tttttttttt ttttttttgc tgggggttag gattttaata taaagttttt tggtgggggt61 gggggagaca caatttgatc cgtgctgagt ccattttgtg ttgctctgac agaagacctg121 aggctgggtc tttataaagg aagaggttga tttggctcag attctgatgg ctggaaggtt
181 caagactggg caggtgcact cactgaggac ctccctcatc cctgaggctg ctgctgcttc241 tgggggaggg cagaaggatg cagctgtgtg cacagagatc acatggtgct gaaggaagca301 agggagagaa aaccaaattc ccaaattccc aagggtgagc actcactcac cccttaaggc361 aggcattaat ctatccatga ggactccacc tccaagacca cacagctcct atgaggcccc421 acatcccaac acctgccaca ctggggctcg aatctcaaca tgaattttag tgaggacaaa481 cgatatccaa accattgcca cccaaaagaa gtgggctgag gcctggcgcg gtggctcaca541 cctgtaatcc cagcaacttt gggaggctga gacagatgga tcgcctgagg tcaggagttc601 gagaccagct tggccaatat ggtgaaaccc cgtctctact aaaaatacaa aaattagctg661 ggtacggtgg ctcacatctg taatcccagc tactccggga agctgaggca ggggaatcgc721 ttgcacct<210>45<211>661<212>DNA<213>Homo sapiens<400>451 tttttttttt ttttttttgc agttagaaca tagtttattc tttaagtgta ggagtgcatg61 acataacact tgcctggcat gaccttagat cttacgaata atttgttatt gccacaaagg121 gtctattctg tcagccttat gatctttatt ttgacattaa tgctggtcag ttgttgtgtc181 taaaccataa aagagacgga gatataatga agcgtgtctg acctcccatc ccgtcatggc241 caggaactca gttttaaggt ttttctgggg tcactttggt caagaggggt tccattcagt301 caattagggg gcttaggatt ttagtttaca acttacttca ccatctctag ggtgttgtct361 ttatttatgt ggtccagaat gtctacatgt ggccaactaa aaattgatga ctataagaaa421 gggaaagttg ctttaaggac gaccccagta tctggcacac tctggcaact gaagtccaga481 gctgaagcat aaaaccaaag gaaacaaaaa gcagtctcag agcatatgag aaatcaaaga541 gtttcttata tactgcagat agcaatttta aaaatcagta tttttaagct cagagcattg601 gatgccaaca acccatagtt cctgcacaaa gacatggaaa aatccagcca caggatctgt661 c<210>46<211>677<212>DNA<213>Homo sapiens<400>461 tttttttttt ttttttttac gtgttgtaag attccattta tatgaagtgt ccagggcagg61 aaaatccaca gagacagaaa gtagattaat gggtgtgggg ctgaggagag gaaggcattg121 ggagaaacgg actttttctt ttagggtgat ggaaatgtgc tgtaattagc ggtaatggtt181 ccactctatt gtgaacacgc taaagctact gagttgtaca ctttaaaatg gttaatttta
241 tgttatgtaa attttctctc aaaaatttta aacaaaatgc acagggaacg agcgtgggcc301 ggtgatgccg gcaaccgggt gcctgttgct gcgctgctgt ggcgccccct gctggctgcc361 cgtgaattcg gggagggatg gggcctggat gcgccggtgt ctcctgcttt ctgttccgga421 gacttcccag gagggtggag accccaccca gagtgacttc ccaaaaggca gcactacccg481 acgagggacg ccacaggagg aggacttgga gggagggcag tgggcacggt agagtcttct541 ttcctgaggc gaccccagct tgttggacga ggcacagcan atgctgacta tagtaataga601 agagtgagtt atggcttcag ggattttttt aattattact tttacttcca gtttntctat661 agtgagtaca gattact<210>47<211>635<212>DNA<213>Homo sapiens<400>471 ggcccagatt caacttttct atgaaaaata gaaactgtca tgttgagtca taaaataaaa61 aaactagcaa atccagctct atgctcagag aattaccaga aaataaaatt acatgaagct121 tgaatatagg gagatggaaa gatattagac aaatattaaa gaaaatctgg gccaggtgtg181 gtggctcaca cctgcaatcc cagcactttg ggaggcccaa ggtgggaaga ttgcttgagg241 caaggggttt gagaccagcc cgggcaacat ggtgaaactc tgtctcttta aaaaagaaag301 aaaagaaaag aaagaaagaa aagaaaatct cagtgagtga tggtcagaat agaattcaac361 ataacaagct cattattaaa atatttgatc tcactgtgta caattctgaa gacactcatt421 catgtacttc attaaatatt tctagtttgc taaaaataga attacccttc aacccagcaa481 tcccattact gggtatctac caaaaggaaa aaaaaaatca ttctatgaaa agatgcctgc541 acttgtatgt tcatcacaga actatttaca gtagcaaaga cgtggaatca acccaggtgc601 ccatcaacag tggactggat aaagaaagtg tggtc<210>48<211>678<212>DNA<213>Homo sapiens<400>481 tggtagtgga cccggcctcg gaccgcgtgc tggccaccgg ccacgactgc agctgcgcgg61 acaaccccct cctgcacgcc gtcatggtgt gcgtggacct cgtggcgcgc ggcagggccg121 cggcacctac gacttcagac ccttccccgc ctgctccttc gccccggccg ctgcccccca181 ggccgtccgc gcaggcgccg tgcgtaaact ggacgcagac gaggacggcc tcccctacct241 gtgcactggc tacgacctgt acgtgacccg cgagccctgc gccatgtgcg ccatggccct301 ggtgcacgca cgcatcctgc gcgtcttcta cggtgcgccc tcgcccgacg gcgccctggg361 cacccgcttc cgcatccacg cacggcccga cctcaaccac cgcttccagg tgttccgcgg
421 ggtgctggag gagcagtgcc gctggctgga cccccgacac gtaggcgccg ccctcctgcc481 tccggaccct tcccgctccc ggccgtgggg cgcccctcct ggacttccgg gcctcggatt541 tcttccgcac aagcctgacc gtggatttca gggacacata ccgctccagc gggggagcac601 gggtgctgcc ttccgtgcgg atcgagcttt cctggactcg gtcattgggg ccaccccgtg661 ccaagcggtt gcccttct<210>49<211>837<212>DNA<213>Homo sapiens<400>491 tgcatttgaa agctgaagga attactttag atagatagga attgctacct ttattctgga61 aaggtttagt ttctgcctca aagcattttg aagtgtttta accattaaag ggtctaattt121 ttttttctta atgaaatcaa gcattttaat ttactgtggg aggcatcctg accacggaca181 tccataacag caaagcacaa atcgttttcg tctgtagtca tatcctgaaa cataggtgga241 caaattttta actgagagac aaaaatcaca tagttgaatt gagcagaaca cttaagtgct301 ttctgcatct atttaggagt ctatttctta ccaataaact tgacaacgca tttggaaaac361 tagtgaacac cttacagctt tcattctgct ttaatgtttc aattcaagcc ggtgtaaaaa421 taatttccaa ggcatttctg tttattcttt agtaatctca ctactggcta tgtcagcaat481 atctttttca atctggttcc ttttgtatat gatgtcactg tgacctcttt gaaatatagt541 gatggctttt acctactttg aaaagaattt tcacatagag tcagaaaaaa aaggaactat601 caaatcactt gcctttccac ttggagagca cgacagttgc caacaacaag gggtcaaggg661 acgcacagga gatgtgtggg ggcctggaca ctcccaactg gtgaccagag gcttgttcgc721 atttacattt gtctccactt gtaaaaaaca tagttcaggt cctaaattct tatcactgga781 tttacttagc aatcttcaat gacctgcagt cattgcctgt ataagtgaac acgaaaa<210>50<211>819<212>DNA<213>Homo sapiens<400>501 ctgcctgtct cggctcccaa agaactggga ttacaggcat gaggcaccgt gcctgtccca61 tgtgtactta atagatattt accatataca attatttttt aatttaaata aatcaatgaa121 cagtgttatt ttttaaatct tgtggcttta atacaccctt tcaattctca gggagtgttt181 tggtgataca ggctggggag ttaaataaat gaacgaataa acttagttgt tatccattca241 aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa aaaaaacaaa aaaaaaaaaa301 aaaaaaaaaa aaaaaaacaa aaaaaaagaa gagagagaga gaagagaaga aacaaataac361 aaatcaaaaa agaagaagca acaaaaaagc aaagcgtaaa caacacagag agcgcgggcg
421 cgagcagagg cgagcccccc ttgagtgaga acacattata aaataccgca gcgaagacga481 gggtagcgtt tacagaaaga cggagcagcg ttcctcgagc gggagggaca acttcaccta541 ttaacaacct gaatctagct tgtggaagaa cagaacaaca ccactcggtc ggagggagga601 cacaaacaaa tgacgtgata acagtaacat ctccatcctc tgcatcgaaa ggattaccac661 acccccgttc gggcggcgag acaaaaataa atgatagtga aagttacata ctacagcggt721 agcacaataa acgacacgtc agactactat gatcggctat aatagtcata aatgtttgcg781 agcacaagcc gccgagttta ttcgatcacg agacctctt
权利要求
1.一类人类肝脏表达序列标签的序列,其包括(a)SEQ ID No.1~SEQ ID No.50所示的序列;(b)SEQ ID No.1~SEQ ID No.50所示的序列中每条序列的互补序列;(c)与SEQ ID No.1~SEQ ID No.50所示的序列中每条序列有至少70%同源性的序列,及(d)上述(a)~(c)中一条或数条的组合。
2.根据权利要求1所述的一类人类肝脏表达序列标签的序列,其特征在于所述序列包括具有SEQ ID No.1~SEQ ID No.50所示的序列。
3.一种探针分子,其特征在于所述的探针分子含有权利要求1中所述的序列中约8-100个连续的核苷酸。
全文摘要
本发明公开了一类人类肝脏表达序列标签。利用本发明的在人类肝脏中表达的表达序列标签,可以方便的寻找出在人类肝脏中表达的相关基因,从而在研究肝脏疾病的致病机理以及开发治疗肝脏疾病的药物中发挥重要作用。
文档编号C07H21/00GK1955289SQ20051003082
公开日2007年5月2日 申请日期2005年10月28日 优先权日2005年10月28日
发明者黄健, 韩泽广 申请人:上海人类基因组研究中心
网友询问留言 已有0条留言
  • 还没有人留言评论。精彩留言会获得点赞!
1