Inventors list

Assignees list

Classification tree browser

Top 100 Inventors

Top 100 Assignees

Patent application title: NUCLEIC SEQUENCE AND DEDUCED PROTEIN SEQUENCE FAMILY WITH HUMAN ENDOGENOUS RETROVIRAL MOTIFS, AND THEIR USES

Inventors:  Patrick M. Alliel (Clamart, FR)  Jean-Pierre Perin (Le Plessis-Robinson, FR)  Francois Rieger (Boulogne, FR)
Assignees:  INSTITUT NAT'L DE LA SANTE ET DE LA RECHERCHE MED.
IPC8 Class: AC12Q100FI
USPC Class: 435 4
Class name: MEASURING OR TESTING PROCESS INVOLVING ENZYMES OR MICRO-ORGANISMS; COMPOSITION OR TEST STRIP THEREFORE; PROCESSES OF FORMING SUCH COMPOSITION OR TEST STRIP
Publication date: 10/22/2009
Patent application number: 20090263783






Sign up to receive free email alerts when patent applications with chosen keywords are published SIGN UP

Abstract:

The present invention provides a translational product encoded by the nucleotide sequence of SEQ ID NO: 2, which corresponds to the gag gene of an endogenous human retrovirus named HERV-7q. The present invention also provides methods for diagnosing a neurological or autoimmune disease in a patient.

Claims:

1. A translational product encoded by the nucleotide sequence of SEQ ID NO: 2, corresponding to the gag gene of an endogenous human retrovirus named HERV-7q.

2. The translational product according to claim 1, which is selected from the group consisting of the sequences SEQ ID NOs: 29 to 34, 36, and 58.

3. A method for diagnosing a neurological or autoimmune disease in a patient, comprising the detection of a translation product according to claim 1 in a biological sample containing proteins obtained from said patient.

4. The method according to claim 3, wherein said neurological or autoimmune disease is multiple sclerosis.

5. The method according to claim 3, wherein said detection is an immunodetection.

6. A method for diagnosing a neurological or autoimmune disease in a patient, comprising comparative analysis of the expression of the translational product according to claim 1 between a biological sample containing proteins obtained from said patient and a sample from a normal individual used as a reference.

7. The method according to claim 6, wherein said neurological or autoimmune disease is multiple sclerosis.

8. A method for diagnosing a neurological or autoimmune disease in a patient, comprising comparing the protein sequences present in a biological sample from said patient with the sequence of the translation product according to claim 1.

9. The method according to claim 8, wherein said neurological or autoimmune disease is multiple sclerosis.

Description:

[0001]The present invention relates to a novel nucleic sequence and deduced protein sequence family with complete or partial human endogenous retroviral motifs, and sequences flanking or adjacent to said sequences, and controlled by the latter; modification of the expression or impairment of the structure (polyadenylation, alternative splicing and the like) of said flanking sequences.

[0002]The invention also relates to the detection and/or use of said nucleic sequences and of said corresponding protein sequences in the context of diagnostic, prophylactic and therapeutic applications, in particular for neuropathological conditions with an autoimmune component such as multiple sclerosis.

[0003]The invention also relates to the production of antisense double-stranded and single-stranded nucleic probes, of ribozymes, capable of modulating viral replication (T. R. Cech, Science, 1987, 236, 1532-1539; R. H. Symons, Trends Biochem. Sci., 1989, 14, 445-450) of the corresponding recombinant molecules, and associated antibodies.

[0004]Retroviruses are viruses which replicate solely by using the opposite route to the conventional processing of genetic information. This process, called reverse transcription, is mediated by an RNA dependent DNA polymerase or reverse transcriptase, encoded by the pol gene. Retroviruses also encode at least two additional genes. The gag gene encodes the proteins of the skeleton, matrix, nucleocapsid and capsid. The env gene encodes the envelope glycoproteins. Retroviral transcription is regulated by promoter regions or "enhancers" situated in highly repeated regions or LTR (Long Terminal Repeat) and which are present at both ends of the retroviral genome.

[0005]During the infection of a cell, polymerase makes a DNA copy of the RNA genome; this copy may then integrate into the human genome. Retroviruses do not kill the cells which they infect, but on the contrary often enhance their rate of growth. Retroviruses can infect germ cells or embryos at an early stage; they can, under these conditions, integrate the germ line and be transmitted by vertical Mendelian transmission, which constitutes the closest relationship between a host and its parasite. These endogenous viruses can degenerate during generations of the host organism and lose their initial properties. However, some of them may conserve all or part of their properties or of the properties of their constituent motifs, or acquire novel functional properties having an advantage for the host organism, which would explain the preservation of their sequence.

[0006]The existence of endogenous motifs having long open reading frames and/or subjected to a strong selection pressure can therefore be an indication of a preserved or acquired biological function, which may correspond to a benefit for the host organism. These retroviral sequences can also undergo, over the generations, discrete modifications which will be able to trigger some of their potentials and generate or promote pathological processes. It has recently appeared necessary to carry out a review and to identify these sequences so as to be able to evaluate their functional impact.

[0007]Human endogenous retroviral sequences or HERVs represent a substantial part of the human genome. These retroviral regions exist in several forms: [0008]complete endogenous retroviral structures combining gag, pol and env motifs, flanked by repeat nucleic sequences which exhibit a significant analogy with the LTR-gag-pol-env-LTR structure of infectious retroviruses, [0009]truncated retroviral sequences; for example the retrotransposons lack their env domain and the retroposons do not possess the env and LTR regions.

[0010]Up until now, the study of these regions of the genome has been neglected in humans for essentially two reasons: [0011]the existence of insertions/deletions which can shift the reading frame and of mutations which modify the sequence. These modifications cause impairment of the structure and consequently of the biological function of these motifs, [0012]the absence of confirmed associations with human pathological conditions.

[0013]The recent knowledge of fragments which are significantly representative of the human genome and an orientation of research studies toward a study of structure/function of endogenous retroviral motifs have made it possible to specify the importance of these regions. The involvement of truncated or complete endogenous sequences in pathological conditions in animals is documented; for example their association with tumor processes has been clearly demonstrated (S. K. Chattopadhyay et al., 1982, Nature, 295, 25-31). Research aimed at specifying the association or the influence of HERVs in human pathological conditions is now therefore justified.

[0014]A classification of the HERV elements has been proposed (Tonjes R. R. et al., AIDS & Hum. Retroviral., 1996, 13, p 261-p 267; A. M. Krieg et al., FASEB J., 1992, 6, 2537-2544). It is based on a homology of these sequences with retroviruses isolated in animals, with the aid of heterologous retroviral probes. Indeed, in general, the HERVs exhibit relatively little homology with known human infectious retroviruses.

[0015]The class I families exhibit a sequence homology with the type C mammalian retroviruses; there may be mentioned in particular the ERI superfamily, close to the MuLV virus (murine leukemia virus) and to the BaEV virus (baboon endogenous virus).

[0016]The class II families exhibit a sequence homology with the type B mammalian retroviruses such as MMTV (mouse mammary tumor virus) or the type D retroviruses such as SRV (squirrel monkey retrovirus).

[0017]Other families have also been described; among these, there may be mentioned HERVs which exceptionally exhibit partial homology with HTLV-1 (RTVL-H) or primate viruses; HRES-1, for example, exhibits sequence homology with HTLVs.

[0018]Programmes for very large sequencing of the human genome now make it possible to have available a significant number of novel retroviral sequences. The use of data processing software packages makes it possible to identify and analyse these genes. In this context, a systematic search relating to the entire information available to date has been initiated in order to identify novel human endogenous retroviral sequences as a function of certain analytical criteria: [0019]presence of long open reading frames conserved during evolution of the host organism and which may suggest a biological function, [0020]analogy with sequences already characterized outside or inside the retrovirus domain, [0021]location in regions of susceptibility for certain pathological conditions or close to essential genes, for example in the cancer domain, regulation of the immune system or in certain neuropathological conditions.

[0022]The work carried out by the inventors on sequence databases allowed them to identify a set of endogenous retroviral sequences or motifs whose normal or pathological expression can promote or disrupt a protective effect in relation to pathological processes, or play a role in the onset or worsening of pathological conditions.

[0023]The subject of the present invention is a purified nucleic acid fragment, characterized in that it comprises all or part of a sequence encoding a human endogenous retroviral sequence, which has at least env-type retroviral motifs, corresponding to the sequence SEQ ID NO: 1 or to a sequence exhibiting a level of homology with said sequence SEQ ID NO: 1 greater than or equal to 80% on more than 190 nucleotides or greater than or equal to 70% on more than 600 nucleotides for the env-type domains.

[0024]The expression homologous sequence is understood to mean both a sequence which exhibits complete or partial identity with the above-mentioned sequence SEQ ID NO: 1 and a sequence which exhibits partial similarity with said sequence SEQ ID NO: 1.

[0025]According to an advantageous embodiment of said fragment, it has retroviral motifs corresponding to an env domain and corresponding to the sequence SEQ ID NO: 1 and retroviral motifs corresponding to a gag domain and corresponding to the sequence SEQ ID NO: 2 or to a sequence exhibiting a level of homology greater than or equal to 80% on more than 190 nucleotides or greater than or equal to 70% on more than 600 nucleotides for the env-type domains and a level of homology greater than or equal to 90% on more than 700 nucleotides or greater than or equal to 70% on more than 1 200 nucleotides for the gag-type domains, said motifs having no insertion or deletion of more than 200 nucleotides.

[0026]Said fragments constitute a novel family of human endogenous retroviral sequences (HERV-7q family) which exhibits sequence homology with the MSRV retro-viruses, as described in International Application WO 97/06260; said fragments according to the present invention have: [0027]two repeat nucleotide motifs of 711 bp (FIG. 3), having characteristic signals identified in LTRs (Long Terminal Repeats): transcription promoters of the TATAA or CCAAT box type. These repeat domains delimit three deduced motifs of the gag, pol and env type (FIG. 2), [0028]an env-type motif (positions 6965 nt-9550 nt on the sequence SEQ ID NO: 3 or in FIG. 1) which contains a long open reading frame of 1 620 nucleotides (positions 7874-9493 of the sequence ID NO: 3 and FIG. 1) encoding a protein having an unpublished sequence of 540 amino acids called enverin (FIG. 4 and SEQ ID NO: 26) and underlined fragment in FIG. 18. There is present inside the transmembrane domain of this env domain a peptide motif of the CKS-25/CKS-17 type (FIG. 5), recognized as having immunosuppressive functions on the host lymphocytic cells (M. Mitani et al., 1987, Proc. Natl. Acad. Sci. USA, 84, 237-240). A zinc finger type domain HX3-4HX22-33CX2C (Kulkolski et al., 1992, Mol. Cell. Biol., 12, 2331-2338), which is present in integrase-type domains is identified in another reading frame. This particular env domain signatures the characteristic of novel endogenous retroviral motifs,

[0029]the motif (positions 3065 nt-4390 nt on the sequence SEQ ID NO: 3) of the gag type encoding protein motifs according to FIG. 6 (SEQ ID NO: 58) (positions 3118-4198 of SEQ ID NO: 3) was identified by virtue of analogies with known gag domains. The region of major homology QX3EX7R is for example present (Benit et al., 1997, J. Virol., 71, 5652-5657). The nucleic acid binding motif CX2CX3-4HX4C, situated at the C-terminal position, is identified in another reading frame (Covey et al., 1986, Nucleic Acids Res., 14, 623-633). Upstream of the gag domain, a motif of 182 nucleotides is detected which is repeated twice (FIG. 1), [0030]the pol domain exhibits the conventional consensus of a retrovirus pol region at the level of the protease, reverse transcriptase and RNAse H domains. A motif close to the consensus LLDTGA is found in pol (Weber et al., 1988, Science, 243, 928-931). The motifs D and AF, LPQ and SP, and YVDD (Xiong and Eickbush, 1990, EMBO J., 9, 3353-3362) are respectively found in the 3rd, 4th and 5th homology boxes. The motifs YTDGSS and TDS are present in the RNAse H region, [0031]the gag and pol regions could be considered as being joined with a passage from the gag region to the pol region by a reading frame shift.

[0032]The present invention includes the sequences belonging to the HERV-7q family as defined above (presence of the SEQ ID NO: 1 sequence or of a homologous sequence or presence of both the sequences SEQ ID NO: 1 and SEQ ID NO: 2) and in particular the sequences SEQ ID NO: 3-22, 28 and 61; it also includes the complementary nucleic sequences and the reverse sequences complementary to the preceding sequences as well as fragments derived from the coding regions of the preceding sequences corresponding to a shifting frame greater than or equal to 14 nucleotides or their complementary sequences (SEQ ID NO: 37-57, 59-60 and 121-122).

[0033]These various fragments may be advantageously used as primers or as probes (reagents A); they hybridize specifically under high stringency conditions to a sequence of the HERV-7q family.

[0034]Among these fragments, the following fragments may be preferably mentioned: [0035]a fragment of 182 nucleotides, repeated twice, situated upstream of the gag domain at positions 2502-2611/2613˜2865 of SEQ ID NO: 3:

[0036]Primers and Probes Specific for the gag Region [0037]a sense primer G1F located in the region upstream of the gag domain of HERV-7q:

TABLE-US-00001 [0037]5'GGACCATAGAGGACACTCCAGGACTA3'; (SEQ ID NO: 37)

[0038]an antisense primer G1R located in the terminal 3' region of the gag domain:

TABLE-US-00002 [0038]5'CCTCAGTCCTGCTGCTGGATCATCT3' (SEQ ID NO: 38)

[0039]the fragment of 1505 nt amplified by the pair G1F-G1R is used in order to generate the probes capable of hybridizing the various PCR amplification products:

TABLE-US-00003 [0039] a nested sense primer G2F: (SEQ ID NO: 39) 5'CCTCCAAGCAGTGGGAGGAAGAGAATT3' a nested antisense primer G2R: (SEQ ID NO: 40) 5'CCTTCCCTGTGTTATTGTGGACATCATT3' a nested sense primer G4F: (SEQ ID NO: 41) 5'GGAAGAAGTCTATGAATTATTCAATGATGT3' a nested sense primer G3F: (SEQ ID NO: 42) 5'GGGACACAGAATCAGAACATGGAGATT3' a nested antisense primer G4R: (SEQ ID NO: 43) 5'GCCTTCAGAAGAGTCAGGTGACAGAGA3' a nested antisense primer G5R: (SEQ ID NO: 44) 5'GAGCCTCCAAAGTCCACTTGCCTGA3'

[0040]Primers and Probes Specific for the env Region

TABLE-US-00004 a sense primer E1F: (SEQ ID NO: 45) 5'GATTTCAGTATCTACTAGTCTGGGTAGAT3' an antisense primer E1R: (SEQ ID NO: 46) 5'CTAGGAAATCCAGCTAGTCCTGTCTCA3'

[0041]the fragment of 2529 nt, amplified by the pair of primers E1F-E1R, is used to generate the probes capable of hybridizing the various PCR amplification products:

TABLE-US-00005 [0041] a sense primer E2F: (SEQ ID NO: 47) 5'CCAAGACAGCCAACTTAGTTGCAGACAT3' an antisense primer E2R: (SEQ ID NO: 48) 5'GGACGCTGCATTCTCCATAGAAACTCTT3' a sense primer E3F: (SEQ ID NO: 49) 5'GCAATACTACATACACAACCAACTCCCAA3' an antisense primer E3R: (SEQ ID NO: 50) 5'GGGGGAGGCATATCCAACAGTTAGTA3' a sense primer E4F: (SEQ ID NO: 51) 5'CCATCTACACTGAACAAGATTTATACACTT3' an antisense primer E4R: (SEQ ID NO: 52) 5'AATGCCAGTACCTAGTGCACCTAGCACT3' a sense primer E5F: (SEQ ID NO: 53) 5'CGAATACAACGTAGAGCAGAGGAGCTTCGAA3' a sense primer E6F: (SEQ ID NO: 54) 5'AGCCCAAGATGCAGTCCAAGACTAAGAT3' a primer E5R: (SEQ ID NO: 55) 5'GCGTAGTAGAGGTTGTGCAGCTGAGAT3' a primer ExF: (SEQ ID NO: 56) CCCTTACCAAGAGTTTCTATGGAGAAT a primer ExR: (SEQ ID NO: 57) ACCGCTCTAACTGCTTCCTGCTGAATT

[0042]All the oligonucleotides are designed to be able to generate a sense primer and an antisense primer by a shift in the sequence of the reference primer of 1 to 7 nucleotides toward the 5' side or toward the 3' side; the modification of the sequence may cause a modification of the size of the primer of 1 to 7 nucleotides depending on the cases. The primers chosen may be optimized depending on the cases by shortening or extension affecting 1 to 9 nucleotides.

[0043]Preferably, the hybridization, cloning, subcloning, production, preparation and analysis of the nucleic acids, peptides and antibodies, the sequencing of the nucleic acids and peptides, the in situ hybridization and the immunohistochemistry are carried out under the conditions described in the following books: [0044]Current Protocols in Molecular Biology, Eds. F. M. Ausubel, R. Brent & R. E. Kingston et al. Green Publishing associates and Wiley Interscience. [0045]Molecular Cloning: a laboratory manual. Eds. J. Sambrook, E. F. Fritsch & T. Maniatis, Cold Spring Harbor Laboratory Press, Cold Spring Harbor. [0046]The Practical Approach series. Eds. D. Rickwood & B. D. Ames, IRL Press and Oxford University Press. In particular antibodies I & II; DNA cloning I, II, III; Nucleic acid and protein sequence analysis; Nucleic acid hybridization; Nucleic acid sequencing; Oligonucleotide synthesis; Protein purification applications; Protein purification methods; Protein sequencing; Transcription and translation; Gels electrophoresis of nucleic acids; Gels electrophoresis of proteins; Genome analysis; HPLC of macromolecules; Human genetic diseases; Microcomputing in biology; Molecular neurobiology; Mutagenicity testing; Essential molecular biology I & II. [0047]Proteome research: New frontiers in functional genomics, Eds. M. R. Wilkins et al., Springer.

[0048]The human endogenous retroviral sequence (SEQ ID NO: 3) situated on the long arm of chromosome 7 corresponds to the HERV-7q sequence; it has 10.5 kb (FIGS. 1 and 2) and satisfies the criteria defined above.

[0049]The search for domains exhibiting total or partial similarity with the gag and env regions of HERV-7q resulted in the identification of novel endogenous retroviral sequences. These sequences may have the structure of a complete endogenous retrovirus such as the endogenous retroviral sequence situated close to the gene for the alpha and delta subunits of the T cell receptor, and consequently called HERV-TcR; by way of example, FIG. 7 shows the comparison of the nucleic alignments of the respective gag domains of HERV-7q and HERV-TcR (sequence HG12, SEQ ID NO: 19). Partial retroviral structures also exist. These retroviral domains, similar to HERV-7q, are identified in independent nucleic sequences as shown by their chromosomal location. Nucleic motifs (called here HEx or HGx, and analogous to env or gag type domains, respectively) resembling the env or gag domains of HERV-7q were found, with the aid of the above-mentioned databases: [0050]HE2: chromosome 17 (SEQ ID NO: 4), [0051]HE3 and HG3: chromosome 6 (SEQ ID NO: 5 and 6), [0052]HE4: chromosome X (SEQ ID NO: 7), [0053]HE5: chromosome X q22 (SEQ ID NO: 8), [0054]HE6 and HG6: chromosome 1 q23.3-q24.3 (SEQ ID NO: 9 and 10), [0055]HE7: chromosome 7 p15 (SEQ ID NO: 11), [0056]HE8 and HGB: chromosome 19 (SEQ ID NO: 12 and 13), [0057]HE9: chromosome X (SEQ ID NO: 14), [0058]HE10: chromosome X q13.1-21.1 (SEQ ID NO: 15), [0059]HE11 and HG11: chromosome 7 q21-22 (SEQ ID NO: 16 and 17), [0060]HE12 and HG12, in HERV-TcR: chromosome 14 q11.2 (SEQ ID NO: 18 and 19), [0061]HE13 (SEQ ID NO: 61): chromosome 6 q24.1-24.3

[0062]The present invention also includes the coding and noncoding fragments for all or part of enverin comprising at least 14 nucleotides and in particular the fragments encoding the C-terminal part of enverin, either from amino acid 291, or from amino acid 321, starting from the first methionine.

[0063]These fragments comprise in particular a critical zone where two inserts of 12 nucleotides were characterized: [0064]a first insert was identified (sequence A) in individuals of 2 groups (patients and controls). This insert, situated between amino acids 487 and 488, makes it possible to insert the tetrapeptide VLQM. A comparative analysis shows that this insert is identified in a homologous region situated in the sequence HE13, belonging to the HERV-7q family. The amplification of the HE13 type sequence could indicate that there is an impairment of the enverin sequence of HERV-7q, which would promote the amplification of the sequence contained in HE13. This observation also makes it possible to use this insert as a specific element for amplification of sequences of the HE13 type.

[0065]A second insert (sequence B) was identified in a patient with MS. The insert of 12 nucleotides is situated at the level of amino acid 495 and encodes the tetrapeptide MQSM. It is remarkable to observe that this insert is also identified in a homologous region situated in HE13.

TABLE-US-00006 (SEQ ID NO: 59) Sequence A: TAAACTACAAATGGTTCTTCAAATGGAGCCCA (SEQ ID NO: 60) Sequence B: GATGCAGTCCAAGATGCAGTCCATGACTAAGA

[0066]These observations demonstrate modifications of the enverin sequence of the HERV-7q type which constitute the basis for a detection strategy by allele-specific amplification (AS-PCR), making it possible to detect these differences in a population and which could correspond either to a mutation/deletion associated with a degree of susceptibility, or to a polymorphism, or to a mutation/deletion associated with a pathological condition such as multiple sclerosis.

[0067]The alignments of the env (FIG. 8) and gag (FIG. 9) domains explain the levels of homology observed between the sequences described above and the homologous sequences in HERV-7q. The analogies can extend to the flanking retroviral motifs.

[0068]Analysis of the sequence tags available in databases shows that transcripts belonging to some members of this family, in particular HERV-7q, are essentially expressed in tissues of foetal or placental origin.

[0069]Polypeptide sequences generated by these transcripts can therefore be potentially produced and biological functions or activities can be envisaged, by analogy with biologically active polypeptides of viral or retroviral origin; for example, the peptide motifs of the CKS-17 type (Haraguchi et al., PNAS, 1995, 92, 5568-5571) (FIG. 5) or CKS-25 type (Huang S. S, and Huang J. S., J. Biol. Chem. 1998, 273, 4815-4818) which have immuno-modulatory functions on the lymphocytic host cells. The differences in sequence which are observed and possible normal or pathological modifications are in particular responsible for modulation of the function.

[0070]HERV-7q represents the paradigm of the novel family of human endogenous retroviral sequences or of endogenous retroviral motifs.

[0071]HERV-7q and some of the endogenous retroviral sequences belonging to its family have a pol-type domain analogous to pol-type retroviral sequences such as for example the pol region identified in the MSRV retrovirus associated with multiple sclerosis and described by H. Perron et al. (1997, Proc. Natl. Acad. Sci. USA, 94, 7583-7588; International Application PCT WO 97/06260).

[0072]However, the sequences according to the present invention are distinguishable from the infectious exogenous retroviral sequences analogous to MSRV previously described in that the gag and env sequences according to the invention are significantly different according to the criteria defined above and as a function of certain specific characteristics, for example the long open reading frame of the env domain of HERV-7q; they would be able to allow the signaturing of a pathological condition when they have insertions, deletions, reading frame shifts or mutations.

[0073]Indeed, the differences observed between the human sequences of the HERV-7q type, which are isolated from individuals reputed to be normal, and the sequences derived from some samples of pathological origin are not randomly distributed. Comparisons carried out between the gag region obtained from infectious retroviral particles (EMBL accession No.: A60168, A60200, A60201, A60171 and the like) and the corresponding gag sequence of HERV-7q (FIG. 9), make it possible to observe that the mutations preferably affect non-sense codons. For example, two non-sense codons in HERV-7q are replaced by an arginine codon in A60200, which makes it possible to obtain a deduced sequence of 109 amino acids for HERV-7q and of 166 amino acids for A60200. The base changes consequently make it possible to extend the reading frame and to potentially encode larger sized polypeptide structures (FIG. 10).

[0074]Likewise, an env-type sequence obtained from infectious retroviral particles exhibits a significant analogy with the env domain of HERV-7q (FIG. 11). These marked analogies between exogenous and endogenous retroviral sequences could be responsible for the triggering or worsening of certain pathological processes, in particular certain autoimmune diseases such as multiple sclerosis. In this regard, it is possible to note that certain endogenous retroviral sequences described in the invention are situated close to or in regions reputed to exhibit susceptibility for multiple sclerosis: for example HERV-7q and the 7q21-22 region of chromosome 7, likewise for HE12 and HG12 in HERV-TcR and the region of the gene encoding the alpha and delta chains of the T cell receptor, HE2 and chromosome 17, or HE3, HE13 and HG3 and chromosome 6, for example, the sequences HE11 and HG11, around the region 7q 21-22 or HE4, HE5, HE6, HE9, HE10 or HG10 on the X chromosome. These sequences would therefore be capable of providing the means for locating or identifying the genes for predisposition.

[0075]No significant homology is observed with endogenous retroviral sequences already described; on the other hand, a limited homology may be noted, which makes it possible to identify a general structure of the env domain; however, said homology is less than the criteria defined according to the invention between the env domains of the sequence HERV-7q (SEQ ID NO: 1) and the sequence HERV-9 (FIG. 12). FIG. 11 shows extensive homologies between the sequence HERV-7q with an exogenous retroviral sequence (accession No. EMBL: A60170).

[0076]The human endogenous retroviral sequences belonging to the HERV-7q family can protect against attacks linked to the environment or can be beneficial for the individual. This beneficial effect could be one of the possible reasons for the selection pressure exerted on some of these sequences and the potentially functional character of the deduced protein structures identified: for example the long open reading frame capable of encoding a novel protein and corresponding to the env domain of HERV-7q.

[0077]The human endogenous retroviral sequences belonging to the HERV-7q family could be associated, for example, with pathological conditions related to processes linked to cancer, to neuropathological conditions with an autoimmune component or to any other pathological process in association or otherwise with endogenous or exogenous viruses or retroviruses. Their action could be related to the outbreak, the worsening, the modification of the time of appearance or the protection against the disease.

[0078]In the context of application to autoimmune pathological conditions (such as for example lupus, Sjogren's syndrome, rheumatoid arthritis, multiple sclerosis and the like), significant analogies may be detected between the endogenous retroviral motifs identified and motifs found in retroviral structures characterized in patients with autoimmune pathological conditions such as multiple sclerosis; for example, fragments of gag domain (recently available in databases) obtained from infectious retroviral particles or the complete sequence of the pol domain corresponding to the MSRV virus associated with multiple sclerosis. These retroviral motifs possess significant analogies with homologous endogenous sequences of the HERV-7q type, which makes it possible to envisage direct or indirect association with pathological processes, including multiple sclerosis, in association or otherwise with MSRV.

[0079]The importance of these sequences goes beyond the context of autoimmune diseases. Apart from the general importance of retroviral motifs in the triggering or worsening of a tumor process, which is well established in particular in murine models (H. Fan in The retroviridiae, 1994, ed. J. A. Levy, Plenum, New York, p. 313-353), these sequences could be present close to or inside important genes and could alter the expression thereof: for example HERV-TcR and the genes for the alpha and delta subunits of the receptor for the T cells involved in disruptions of the immune system.

[0080]The present invention includes, in addition, the use of sequences combined with the sequences of the HERV-7q family for the detection and/or prognosis of various autoimmune diseases (neuropathological conditions in particular); these sequences encode all or part of a factor whose function, the regulation/de-regulation or alteration (polyadenylation, alternative splicing), is associated with the normal or pathological expression or with the regulation/deregulation of the motifs belonging to the HERV-7q family and correspond to transcripts or cDNAs of the nucleotide sequences encoding genes situated in regions flanking or delimiting retroviral sequences of the HERV-7q family.

[0081]The expression flanking region is understood to mean any region situated close to (contained in or including) an endogenous retroviral sequence belonging to the HERV-7q family, as defined above, up to and including the genes immediately contiguous and/or situated at a distance which cannot exceed 120 kb.

[0082]The inventors have now found that the presence of the retroviral sequences as defined above disrupts the expression or impairs the structure of the flanking sequences defined below.

[0083]The transcripts of said flanking sequences (and fragments thereof, in particular those underlined or in italics in FIGS. 14-16, 22-26, as defined below: [0084]at 1021 bp upstream of HERV-7q, there is identified an endogenous retroviral sequence called RH7 (SEQ ID NO: 62 and FIG. 22); this sequence is situated in 5' of the HERV-7q sequence; in FIG. 22, the portion in italics corresponds to the beginning of the HERV-7q sequence; the RH7 sequence is underlined; two putative polyadenylation sites are in bold. This sequence SEQ ID NO: 62 exhibits significant homology, on more than 6 kb, with RGH-type endogenous retroviral sequences (FIG. 13). Sequences belonging to this family are expressed in particular in patients with rheumatoid osteoarthritis (Nakagawa et al., (1997), Arthritis, Rheum., 40, 627-638). The present invention also includes fragments of the sequence SEQ ID NO: 62, comprising between 14 and 50 nucleotides (used as primers), preferably between 14 and 25 nucleotides, or at least 25 nucleotides (used as probe), which fragments have the following characteristics: the 4 nucleotides of the 3' end are different from the corresponding motifs of the sequence RGH2 (bottom sequence in FIG. 13, GenBank accession No.: D110 18), [0085]at less than 9 kb upstream of HERV-7q, there is identified the sequence RAM75 (SEQ ID NO: 63 and FIG. 14) containing the 24 coding exons (which cover close to 41 kb) of the gene for peroxisomal ATPase PEX1. PEX1, in combination with PEX6, is responsible for the import of peroxisomal proteins and for stabilizing the PEX5 receptor. A disruption/alteration affecting PEX1 is responsible for various neuropathological conditions such as Zellweger syndrome, neonatal adrenoleukodystrophy and the infantile form of Refsum's disease (Reuber et al., (1997), Nature Genet., 17, 445-448). It can be recalled that the main function of the peroxisomes is associated with the metabolism of fatty acids, in particular by β-oxidation processes. Impairment of the gene identified in the sequence RAM75, or of its expression, by modification of the function of the regulatory 5' and 3' regions or by modification of the splicings or of the polyadenylation processes, in particular under the influence of neighboring retroviral motifs, would be able to disrupt the expression and the structure of ATPase and consequently to disrupt one of the peroxisomal functions, in particular the metabolism of lipids, in particular myelin lipids, with consequences for certain pathological conditions, including neuro-pathological conditions such as multiple sclerosis; the underlined portions (FIG. 14) correspond to the 24 coding exons.

[0086]The present invention also includes the fragments of the sequence SEQ ID NO: 63, included in the abovementioned 24 coding exons and comprising at least 14 nucleotides.

[0087]Analysis of the expression profile (transcripts and proteins) of the sequence RAM75 (SEQ ID NO: 63) is a good indicator for the differential diagnosis of neuropathological conditions with an autoimmune component.

[0088]In FIG. 14, the coding exons are underlined. The initiation and non-sense codons as well as the putative polyadenylation sites are in bold and underlined; [0089]at 0.7 kb downstream of the sequence HERV-7q and on nearly 17 kb (SEQ ID NO: 64 and FIG. 15), there is identified the nucleotide sequence RAV73, where there are detected sequence tags and potential exons capable of producing one or more polypeptide sequences; the invention also includes fragments of this sequence SEQ ID NO: 64 included in the sequence tags and the potential exons as they appear (portions underlined) in FIG. 15, which fragments comprise at least 14 nucleotides, [0090]at 120 kb upstream of the sequence HG3, and on 15 kb, there is the nucleotide sequence RBP3 (SEQ ID NO: 65 and FIG. 23), which covers the 3' end of the gene encoding a transcription factor of the Blimp-1 family (SEQ ID NO: 119 and FIG. 25), a protein of 789 amino acids which is a repressor of the expression of the interferon-beta gene (Keller and Maniatis, Genes Dev., (1991), 5, 868-879), which is already associated with certain malignant pathological conditions (Mock et al., Genomics, (1996), 37, 24-28), and which could play a role in the differentiation and the pathogenesis of B cells. The possible association of the endogenous retroviral sequence containing the motifs HG3 and HE3 and of Blimp-1 has many benefits, in the case of pathological conditions, and in particular multiple sclerosis. Blimp-1 acts in particular on the B cells whose contribution in inflammatory processes associated with multiple sclerosis is known. Blimp-1 is capable of blocking the viral induction of the INFβ promoter whose capacity to reduce the frequency of attacks and the progression of lesions in patients with MS is known. Disruption in the expression or the structure of Blimp-1, in relation to a retroviral element of the HERV-7q type, is consequently associated with neuropathological conditions or with diseases having an autoimmune character, such as multiple sclerosis; this nucleotide sequence RBP3 (SEQ ID NO: 65) contains nucleotide motifs identified in the nucleic sequence encoding the Blimp-1 gene; the invention also includes the detection of the mRNA sequences for the Blimp-1 protein (SEQ ID NO: 119), [0091]the endogenous retroviral sequence of the HERV-7q type, containing HE3 and HG3, is situated in the HI3 region corresponding to an intron extending over more than 46 kb (SEQ ID NO: 66), of a gene which could encode the analogue of APS (FIG. 24), a protein of 275 amino acids specific to apoptosis, overexpressed in various cells in culture after triggering an apoptotic process (Hammond et al., FEBS Lett., (1998), 425, 391-395). The intron is situated at the level of amino acid 231 of APS. The end of HE3 is at more than 12 kb from the 5' end of the intron, whereas HG3 is situated at more than 28 kb from the 3' end of the intron. Apoptotic processes are associated with multiple sclerosis. In particular, there has been described an apoptotic process affecting astrocytes and oligodendrocytes in the presence of a purified fraction of cerebrospinal fluid of patients suffering from multiple sclerosis (Menard et al., J. Neurol. Sci., (1998), 154, 209-221).

[0092]Finally, it should be stressed that the nucleic region containing HE3, HG3, HI3 and RBP3 is located at the level of the short arm of chromosome 6, in 6p21, which is a proposed region of susceptibility to multiple sclerosis (The Multiple Sclerosis Genetic Group, Nature Genet., (1996), 13, 469-472).

[0093]The interaction between the HERV-7q type sequences and the flanking sequences and the importance of establishing a profile of expression including one or more of the abovementioned sequences in order to establish a differential diagnosis of a neuro-pathological condition is even more evident because it is observed that the sequences HG12 and HE12 are situated in an intron region of the gene encoding the alpha and delta subunits of the T cell receptors. The T cell receptors are involved in the immune regulation process and their influence has been proposed in the case of autoimmune diseases, including multiple sclerosis.

[0094]The subject of the invention is also transcripts generated from the abovementioned sequences as well as those optionally exhibiting modifications in the reference sequences described in the invention when they are expressed in certain patients.

[0095]Indeed, the systems for regulating the expression of the retroviral proteins of HERV-7q, which are present in the LTR type motifs, could influence the expression of genes situated in the close or distant chromosomal vicinity and could induce disruptions of an immunological and/or neurological character. For example, the endogenous retroviral sequence HERV-TcR exists in the immediate vicinity of the genes for the alpha and delta subunits of the T cell receptor previously described. The LTR-type motifs could also encode superantigens (Acha-Orbea and Palmer, 1991, Immunol. Today, 12, 356-361). In general, retroviral proteins of the HERV-7q or related type, or their truncated or partial forms could be involved in cytotoxicity or superantigenicity phenomena, such as for example those derived from the long open reading frame identified in the env domain (FIG. 4).

[0096]Sequences of the HERV-7q 5' and 3' LTR type, which are highly conserved, are involved in such regulatory effects. By way of example, LTX is described, which is a sequence comparable to that of an HERV-7q LTR (SEQ ID NO: 67 and FIG. 16), and which is present in the center of an intron of more than 49 kb, but at 2 kb from the donor 5' site of the FMR2 gene associated with fragile X and encoding a protein of 1311 amino acids (FIG. 26). The LTRs modulate the alternative splicing (Kapitonov and Jurka, (1999), J. Mol. Evol., 48, 248-251), the expression of the gene, the binding to nuclear proteins (Akopov et al., (1998), FEBS Lett., 421, 229-233), or allow the production of an alternative polyadenylation signal (Goodchild et al., (1992), Gene, 121, 287-294).

[0097]In general, there may be noted the existence of several endogenous retroviral sequences of the HERV-7q type (HE4, HE5, HE9, HE10), situated at the level of chromosome X which represents the chromosome associated with the largest number of pathological conditions.

[0098]In this regard, it is possible to note that retroviral motifs derived from defective regions are capable of having biological functions; for example, the envelope protein p15E, derived from defective retroviral motifs, possesses an anti-inflammatory and immunosuppressive activity (Snyderman and Ciancolo, 1984, Immunol. Today, 5, 240-244).

[0099]These structures are probably capable of causing breaks or of amplifying deregulations in the immune defense processes. Some of the motifs of the gag, env and LTR-type domains may be associated with a particular function or may contribute to the normal or pathological function of the flanking domains as defined above (SEQ ID NO: 62-67). Recombinations with an element of exogenous, retroviral origin or otherwise can give rise to the production of nucleic or protein motifs which could either protect or trigger or promote or worsen a pathological condition. Likewise, a retroviral structure containing endogenous retroviral elements according to the invention would be capable of causing a pathological process after passing through an exogenous transient cycle followed by reintegration into a sensitive or critical region of the human genome.

[0100]It is thus possible to obtain expression profiles (transcripts and optionally proteins) which correspond to the abovementioned neuropathological conditions.

[0101]Likewise, the combination of motifs belonging to the HERV-7q family, or of elements induced by motifs belonging to the HERV-7q family, with motifs of exogenous origin or induced exogenously would be capable of triggering or worsening a pathological process or on the contrary of promoting protection or partial remission or a complete and permanent cure.

[0102]The detection made possible of the HERV-7q type domains suggests possible applications at the prophylactic, prognostic and diagnostic level; for example, immunological approaches or gene amplification, which make it possible to compare normal individuals serving as reference with patients, would be capable of promoting screening, of improving early detection of the outbreak of the disease and/or of monitoring the progression of a pathological condition in patients which may exhibit a susceptibility or in whom there has been an outbreak of the disease or in individuals considered to be normal, based on current clinical criteria.

[0103]The specific nucleic and immunological probes, as defined, in the present invention are capable of promoting the identification and detection of motifs which are abnormally expressed in the context of pathological conditions associated with cancer, or of neuropathological conditions, in particular autoimmune pathological conditions, at the forefront of which is multiple sclerosis.

[0104]The subject of the present invention is also hybrid nucleic sequences, characterized in that they comprise sequences or motifs belonging to the HERV-7q family, or of elements induced by motifs belonging to the HERV-7q family, with motifs of exogenous origin or induced exogenously (exogenous retroviral sequences); such hybrid sequences are probably capable of triggering or worsening a pathological process or on the contrary of promoting protection or partial remission or a complete and permanent cure.

[0105]The subject of the present invention is also a diagnostic reagent for the differential detection of complete or partial human endogenous nucleic sequences, having retroviral motifs, selected from the sequences SEQ ID NO: 1 and/or SEQ ID NO: 2, characterized in that it is selected from the group consisting of the sequences SEQ ID NO: 1-22, 28, 37-57, 59-61 and 121-122, the complementary nucleic sequences and the reverse sequences complementary to the preceding sequences, of nucleotide fragments capable of defining or of identifying the sequences SEQ ID NO: 1 and/or SEQ ID NO: 2 and any flanking sequence or any sequence overlapping them as well as of fragments derived from the coding regions of the sequences SEQ ID NO: 1-22 and 61, corresponding to a shifting frame greater than or equal to 14 nucleotides or their complementary sequences, optionally labeled with an appropriate marker as well as of sequences as defined in FIGS. 18-21.

[0106]The sequences of the nucleic, ribonucleic and oligonucleotide probes used will be chosen from the env and gag regions or their flanking regions; for example the oligonucleotide primers for HERV-7q will be chosen from the regions situated between nucleotides 3065 and 4390, nucleotides 6965 and 9550 or nucleotides 2502-2865 of SEQ ID NO: 3, as well as from any adjacent sequence (upstream or downstream) capable of allowing specific amplification (FIG. 1).

[0107]Among the appropriate markers, there may be mentioned radioactive isotopes, enzymes, fluorochromes, chemical markers (biotin), haptens (digoxygenin) and antibodies or appropriate base analogues.

[0108]Preferably: [0109]said reagent is selected from the sequences SEQ ID NO: 37-57 and is capable of being used as a primer, [0110]said reagent is selected from the following sequences: [0111]a fragment of 1505 nt amplified by the pair of primers SEQ ID NO: 37 and SEQ ID NO: 38 (primers G1F and G1R), [0112]a fragment of 2529 nt amplified by the pair of primers SEQ ID NO: 45 and SEQ ID NO: 46 (primers E1F and E1R), [0113]a fragment of 182 nucleotides, repeated twice, situated upstream of the gag domain at positions 2502-2611/2613-2865, [0114]fragments encoding or not encoding all or part of enverin, comprising at least 14 nucleotides and in particular the fragments encoding the C-terminal portion of enverin, either from amino acid 291, or from amino acid 321, starting from the first methionine,

[0115]and is capable of being used as a probe.

[0116]The subject of the present invention is also a method for the rapid and differential detection of the endogenous retroviral nucleic sequences of the env or env and gag type, their normal or pathological variants, by hybridization and/or gene amplification, carried out using a biological sample, which method is characterized in that it comprises:

[0117](a) a step in which a biological sample to be analysed is brought into contact with at least one probe as defined above, and

[0118](b) a step in which the product(s) resulting from the nucleotide sequence-probe interaction is detected by any appropriate means.

[0119]In accordance with said method, it may comprise: [0120]prior to step (a): [0121]a step of preparing the relevant biological tissue or fluid, [0122]a step of extracting the nucleic acid to be detected, and [0123]at least one gene amplification cycle, and subsequent to step (b): [0124]a step of comparing the nucleic sequences obtained in said biological sample with the human endogenous retroviral sequences according to the invention by any appropriate means and in particular by sequencing, Southern blotting, restriction cleavage, SSCP or any other method which makes it possible to identify an insertion or a deletion or a single mutation between the various sequences compared.

[0125]In accordance with the invention, the human endogenous retroviral sequences according to the invention are thus compared with the nucleic sequences present in the biological sample to be analysed and allow the detection of homologous sequences from patients suffering from pathological conditions likely to involve a modification of their genome.

[0126]Advantageously, said gene comparisons are carried out using genomic DNA obtained from control individuals and from patients.

[0127]A conventional gene amplification by PCR will be carried out with the aid of 5'-sense and 3'-antisense primers delimiting or comprising the zone to be studied (env zone or gag zone).

[0128]Also advantageously, the sequences of the nucleic, ribonucleic and oligonucleotide probes used are chosen from the env and gag regions or their flanking regions; for example the oligonucleotides which are primers for HERV-7q will be chosen from the regions situated between nucleotides 3065 and 4390 and nucleotides 6965 and 9550, and from any adjacent sequence (upstream or downstream) capable of allowing specific amplification (FIG. 1), as specified above. They are preferably selected from the group consisting of

[0129]a fragment of 1505 nt amplified by the pair of primers SEQ ID NO: 37 and SEQ ID NO: 38 (primers G1F and G1R),

[0130]a fragment of 2529 nt amplified by the pair of primers SEQ ID NO: 45 and SEQ ID NO: 46 (primers E1F and E1R).

[0131]The gene amplification step is in particular carried out with the aid of one of the following gene amplification techniques: amplification using Qβ-replicase, PCR, LCR, ERA, CPR or SDA.

[0132]The subject of the present invention is also chimeric sequences, characterized in that they consist of a fragment of 17 to 40 nucleotides of a flanking sequence as defined above combined with an endogenous retroviral motif of the HERV-7q type comprising between 17 and 40 nucleotides, as defined above.

[0133]The subject of the present invention is also a method of detecting transcripts as defined above, characterized in that it comprises: [0134]collecting messenger RNAs obtained from control biological samples (biological tissues, cells or fluids) and from a similar sample collected from patients, and [0135]the qualitative and/or quantitative analysis of said mRNAs by in situ hybridization, by dot-blot, Northern blotting, RNAse mapping or RT-PCR, with the aid of a diagnostic reagent as defined above.

[0136]The subject of the present invention is also a method for the detection and/or evaluation of an overexpression/underexpression or of a modification of at least one of the endogenous retroviral sequences or fragments of sequences of the HERV-7q type and/or of their associated flanking sequences, characterized in that it comprises: [0137]depositing on an appropriate support, such as for example a nylon filter, a glass slide or their equivalent, cDNA or its equivalent obtained from clones, PCR products obtained from genomic DNA, RT-PCR products obtained from transcripts or from specific oligonucleotide sequences, said DNA sequences being endogenous retroviral sequences or fragments of sequences of the HERV-7q type and/or their flanking sequences, as defined above, consisting of transcripts and cDNAs of the genomic sequences, which encode all or part of a factor, whose function, regulation/de-regulation or alteration is associated with the normal or pathological expression or with the regulation/deregulation of motifs belonging to said HERV-7q family, these sequences corresponding to nucleotide sequences encoding genes situated in flanking regions situated upstream and/or downstream of a retroviral sequence of said HERV-7q family and in which one of the ends cannot be at a distance exceeding 120 kb, and/or a chimeric sequence as defined above, [0138]the hybridization of said support with at least one appropriately labeled probe obtained, for example, by retrotransposition of an RNA mixture obtained from biological cells, tissues or fluids obtained from controls reputed to be normal, from members of various ethnic populations, from patients suffering from pathological conditions often associated with expression of retroviruses, such as tumor processes, or such as autoimmune diseases, and [0139]the detection of the hybrids formed.

[0140]According to an advantageous embodiment of said method, said transcript or cDNA is selected from the group consisting of the sequences SEQ ID NO: 62-67 and 119 and their fragments corresponding to a shifting frame greater than or equal to 14 nucleotides or their complementary sequences.

[0141]According to another advantageous embodiment of said method, said support comprises, in addition, any endogenous or exogenous retroviral sequence.

[0142]The method of DNA chips (Bowtell, (1999), Nature Genet., 21, 25-32), is used to evaluate the modification of the expression of all or part of some of the sequences of retroviral origin of the HERV-7q type and flanking sequences. Briefly, DNA obtained from clones, PCR products obtained from genomic DNA, RT-PCR products obtained from transcripts or specific oligonucleotide sequences are deposited on a support, such as for example a nylon filter, a glass slide or their equivalent. The deposited nucleic sequences cover the various retroviral domains described above, as well as the contiguous sequences and the flanking genes. In order to detect possible alternative splicing processes, specific DNAs are synthesized per step of 500-600 nucleotides with an overlap of 250-300 nucleotides on either side. The alternative splicings already identified will be the subject of a specific synthesis. The hybridization is carried out with the aid of a probe obtained, for example, by retrotransposition of an RNA mixture obtained from biological cells, tissues or fluids obtained from controls reputed to be normal, members of the various ethnic populations, patients suffering from pathological conditions often associated with expression of retroviruses, such as tumor processes, or such as autoimmune diseases, including multiple sclerosis. In this case, a μg fraction and up to a few μg of mRNA or up to a few μg or a few tens of μg of RNA, depending on the method used and the size of the DNA chip involved, are sufficient for the synthesis of the nucleic probe. The nucleic probe is suitably labeled so as to allow subsequent detection, such as for example by fluorescence or by an equivalent method.

[0143]The use of bi- or even multicolored probes makes it possible to specify the concerted expression of several genes in parallel, while taking advantage, furthermore, of a precise normalization. The results are acquired automatically, such as for example by a laser scanning system or its equivalent.

[0144]Two types of DNA chips are designed, on the one hand chips having an exhaustive set of sequences, and on the other hand specific DNA chips enabling targeting to a more specific application.

[0145]For example, a critical sequence in that it would contain a difference relating to a deletion or even a mutation is detected with the aid of specific oligonucleotides (Wang et al., (1998), Science, 280, 1077-1082). The polymorphism associated with a base or with a mutation is detected with the aid of four oligonucleotides possessing one of the four sequence possibilities at the level of a base (A, C, G or T); for each point difference, the 4 oligonucleotides are deposited and the hybridization intensities are compared. Furthermore, an alternative splicing is detected using DNAs corresponding to a single effective or putative exon; the gene is therefore analyzed exon by exon. The DNA chips also relate, by extension, to any endogenous or exogenous retroviral sequence, such as for example ERV-9, ERV-K, ERV-L, ERV-H, ERV-4, ERV-6, ERV-8, ERV-10, ERV-15, ERV-16, ERV-17, ERV-18, ERV-21, ERV-24, ERV-33, ERV-34, ERV-36, ERV-40, ERV-42, ERV-MLN, ERV-FRD, ERV-FTD and the like), as well as all the putative exon sequences (identified by the existence of sequence tags and corresponding transcripts) or effective exon sequences, and which are situated on either side up to a distance of 120 kb of the endogenous retroviral sequences of the HERV-7q type.

[0146]The comparative study is carried out between a control sample and the sample to be tested, in a prophylactic, diagnostic or therapeutic perspective, such as for example the early detection of a modification of the expression of one of the sequences, in a cell, a tissue or an organism, the identification of a sequence associated with a susceptibility or with any pathological condition, the monitoring of the progression of the pathological condition or the monitoring of a treatment and the evaluation of its efficacy.

[0147]Apart from the applications already mentioned, the advantage of the method makes it possible, more generally, to make an assessment of the changes observed in an individual, which constitutes to a certain extent an identity card, which facilitates an epidemiological approach which makes it possible to establish novel correlations between a particular observed profile and a pathological condition, in the absence of an a priori regarding this pathological condition.

[0148]The subject of the present invention is also a kit for the detection and/or evaluation of an auto-immune disease and in particular of neuropathological conditions with an autoimmune etiology, characterized in that it comprises, in addition to the buffers necessary for carrying out the methods as defined above: [0149]diagnostic reagents A as defined above, and [0150]reagents B consisting of the transcripts and cDNAs of the genomic sequences, which encode all or part of a factor, whose function, regulation/de-regulation or alteration is associated with the normal or pathological expression or with the regulation/de-regulation of motifs belonging to said HERV-7q family, these sequences corresponding to nucleotide sequences encoding genes situated in flanking regions situated upstream and/or downstream of a retroviral sequence of said HERV-7q family, of which one of the ends cannot be at a distance exceeding 120 kb, [0151]which reagents are preferably attached to an appropriate support.

[0152]According to an advantageous embodiment of said kit, said reagents B are selected from the group consisting of the sequences SEQ ID NO: 62-67 and 119 and their fragments corresponding to a shifting frame greater than or equal to 14 nucleotides or their complementary sequences, as well as the sequences represented in FIGS. 13-17, 22-26.

[0153]The subject of the present invention is also products of translation, characterized in that they are encoded by a nucleotide sequence as defined above.

[0154]The subject of the present invention is also a peptide, characterized in that it is capable of being expressed with the aid of a nucleotide sequence selected from the group consisting of the sequences SEQ ID NO: 1-22, 28 and 61, as defined above, according to the combinations offered by the use of the various possible reading frames (see also FIGS. 18-21).

[0155]Said peptide also includes the derived peptides or polypeptides comprising between 5 and 540 amino acids (SEQ ID NO: 23-36 and SEQ ID NO: 58 and their fragments of at least 5 amino acids) and in particular a fragment of 538 amino acids, starting at the first methionine of the sequence SEQ ID NO: 26 (enverin).

[0156]According to an advantageous embodiment of said peptides they are in particular selected from the sequences SEQ ID NO: 23-36, 58, in particular the sequence SEQ ID NO: 26 and its C-terminal fragments, either from the amino acid 291, or from the amino acid 321, starting from the first methionine.

[0157]According to another advantageous embodiment of said peptides, they are obtained from nucleic sequences as defined above, in which at least one non-sense codon may be replaced with a codon encoding one of the following amino acids: Phe (F), Leu (L), Ser (S), Tyr (Y), Cys (C), Trp (W), Gln (O), Arg (R), Lys (K), Glu (E) or Gly (G).

[0158]The invention thus includes the deduced peptides or the deduced proteins corresponding to all or part of the nucleic sequences described in the invention, and optionally exhibiting modifications with the reference sequences described in the invention, when they are expressed in some patients. In particular, the invention includes the complete or partial sequences obtained according to the 3 sense reading frames and the 3 reverse and complementary reading frames (see FIGS. 18-21).

[0159]Advantageously, the analysis of the structure of the env domain of HERV-7q, called enverin, made it possible to demonstrate successively: [0160]an N-terminal signal peptide (region 1-21) and two transmembrane domains (region 320-340; 455-477), responsible for interactions with membrane lipid or protein motifs, [0161]an immunomodulatory motif of the CKS-17 (Haraguchi et al., (1995), 92, 5568-5571)/CKS-25 type. It is possible to note, in this regard, the presence of an RalD motif inside the peptide of the CKS-17/CKS-25 type of HERV-7q and a motif RvaD at position 363 which correspond to the consensus W/RxxD, proposed for the active site of the TGF-βs (Huang et al., J. Biol. Chem., 1997, 272, 27155-27159), potent factors associated with growth, with differentiation and with morphogenesis and which are associated with many human pathological conditions, such as tumor processes (Tang et al., (1998), Nat. Med., 4, 802-807) or neuro-degenerative diseases (Flanders et al., (1998), Prog. Neurobiol., 54, 71-85). The peptides according to the invention containing these motifs can advantageously serve as antagonists by inhibiting the attachment of the TGF-βs to their natural receptors, [0162]N-glycosylation motifs. The glycosylation of the envelope proteins of retroviruses appears to be directly associated with their functional properties, for example by influencing the number of determinants available in the T cells or by promoting recognition of antigens by the T cells. Glycosylation could play a role in the outbreak or the spread of a pathological condition with an autoimmune component. The glycosylations are necessary for maintaining the conformation of certain epitopes, in particular during the production of a recombinant envelope protein so as to develop a diagnostic reagent and to promote the efficacy of a possible vaccine. Positions 171, 210, 216, 236, 244, 283 and 411. Expected number at random: 3.2 [0163]prenylation sites. Prenylation is an essential mechanism for attachment to the cell membrane and for the targeting of certain proteins. This targeting process could be essential for the production of specific therapeutic agents capable of interfering with the production and regulation of the traffic of cellular complexes calling into play proteins involved in the cell interactions, growth and movement. Positions 188 and 290. Expected number at random: 1.8 [0164]targeting sites in the endoplasmic reticulum. These sites could make it possible to bring about the targeting toward the endoplasmic reticulum in order to carry out the modifications necessary for promoting membrane crossing. Positions 353 and 431. Expected number at random: 0.2

[0165]Moreover, the inventors have shown that a number of peptides derived from the env protein of HERV-7q (enverin) have a high affinity/half-life for the class I HLA alleles. CADD analysis has made it possible to select candidate peptides, for which the best scores are indicated in Table I:

TABLE-US-00007 TABLE I HLA Location Sequence molecule Score Sequence No. 399 FLGEECCYYV A-0201 7214 SEQ ID NO: 68 462 LLFGPCIFNL A-0201 1792 SEQ ID NO: 69 189 CLPLNFRPYV A-0201 1453 SEQ ID NO: 70 439 GLLSQWMPWI A-0201 488 SEQ ID NO: 71 263 CLPSGIFFV A-0201 5103 SEQ ID NO: 72 444 WMPWILPFL A-0201 897 SEQ ID NO: 73 252 IRWVTPPTQI B-2705 3000 SEQ ID NO: 74 432 LRNTGPWGLL B-2705 2000 SEQ ID NO: 75 158 LRTHTRLVSL B-2705 2000 SEQ ID NO: 76 316 KRVPILPFVI B-2705 1800 SEQ ID NO: 77 25 CRCMTSSSPY B-2705 1000 SEQ ID NO: 78 137 TRVHGTSSPY B-2705 1000 SEQ ID NO: 79 124 AREKHVKEVI B-2705 600 SEQ ID NO: 80 478 SRIEAVKLQM B-2705 600 SEQ ID NO: 81 442 SQWMPWILPF B-2705 500 SEQ ID NO: 82 405 CYYVNQSGI Kd 2400 SEQ ID NO: 83 346 FYYKLSQEL Kd 2400 SEQ ID NO: 84 244 TYTTNSQCI Kd 2400 SEQ ID NO: 85 291 SFLVPPMTI Kd 1600 SEQ ID NO: 86 406 YYVNQSGIV Kd 1200 SEQ ID NO: 87 167 LFNTTLTGL Kd 1152 SEQ ID NO: 88 463 LFGPCIFNL Kd 960 SEQ ID NO: 89 253 RWVTPPTQI Kd 480 SEQ ID NO: 90 449 LPFLGPLAAI B-5102 2200 SEQ ID NO: 91 3 LPYHIFLFTV B-5102 1210 SEQ ID NO: 92 331 GALGTGIGGI B-5102 798 SEQ ID NO: 93 321 LPFVIGAGVL B-5102 550 SEQ ID NO: 94 499 RRPLDRPAS B-2705 600 SEQ ID NO: 95 194 FRPYVSIPV B-2705 600 SEQ ID NO: 96 383 RRALDLLTA B-2705 600 SEQ ID NO: 97 39 WRNQRPGNI B-2705 600 SEQ ID NO: 98 423 DRIQRRAEEL B14 1800 SEQ ID NO: 99 158 LRTHTRLVSL B14 600 SEQ ID NO: 100 359 ERVADSLVTL B14 540 SEQ ID NO: 101 463 LFGPCIFNLL Kd 1658 SEQ ID NO: 102 345 QFYYKLSQEL Kd 1152 SEQ ID NO: 103 443 QWMPWILPFL Kd 691 SEQ ID NO: 104 405 CYYVNQSGIV Kd 500 SEQ ID NO: 105 474 NFVSSRIEAV Kd 480 SEQ ID NO: 106 221 GPLVSNLEI B-5102 1320 SEQ ID NO: 107 190 LPLNFRPYV B-5102 726 SEQ ID NO: 108 449 LPFLGPLAAI B-5101 1144 SEQ ID NO: 109 488 EPKMQSKTKI B-5101 968 SEQ ID NO: 110 3 LPYHFLFTV B-5101 629 SEQ ID NO: 111 125 REKHVKEVI Kk 1000 SEQ ID NO: 112 312 KPRNKRVPIL B7 800 SEQ ID NO: 113 378 VVLQNRRAL Db 792 SEQ ID NO: 114 377 AVVLQNRRAL Db 660 SEQ ID NO: 115 321 LPFVIGAGV B-5101 629 SEQ ID NO: 116 304 DLYSYVISK A3 540 SEQ ID NO: 117 301 TEQDLYSYVI Kk 500 SEQ ID NO: 118

[0166]This Table I indicates an estimation of the dissociation half-life of a peptide of enverin with an allele of the class I HLA system (the tables of Parker coefficients: J. Immunol, (1994), 152, 163-175). The location indicates the position of the first amino acid of the peptides tested in the enverin sequence. The one-letter code is used for the amino acid sequence. The scores around 500 or greater than 500 were selected. By way of comparison, an analysis was carried out on a concatenation of peptides (polypeptide of 4968 amino acids) reputed to bind the molecules of the class I major histocompatibility complex (Rammensee, Immunogenetics, (1995), 41, 178-228); the ten best scores recorded for nonapeptides and the HLA type A--0201 are respectively 4984, 4047, 2406, 1267, 800, 705, 607, 591, 591 and 577.

[0167]It can be seen from this Table I that some molecules of the type I major histocompatibility complex are capable of binding peptides derived from enverin, thus assimilated with peptides of viral or tumor origin, at the level of the endoplasmic reticulum. The complexes formed at the level of the endoplasmic reticulum are then transported to the cell surface, which causes the destruction of the target cell by the cytotoxic T lymphocytes. The peptides identified generally comprise 8 to 10 amino acids. Studies have shown that some alleles of the class I HLA system are thus associated with certain pathologies, in particular with an autoimmune character, such as HLA-B27 with rheumatoid spondylitis or HLA-B51 with Behcet's disease.

[0168]A peptide capable of binding a particular class I molecule is consequently capable of functioning as a T cell epitope.

[0169]Consequently, the present invention also includes the fragments 399-471 and 244-271 of enverin which advantageously group together several epitopes having high affinity for various haplotypes of the class I HLA system. The use of all or some of these polypeptides is consequently capable of promoting an increase in the T cell repertoire, by allowing better efficacy of the immune response in the context of the various immunotherapeutic, prophylactic or vaccine strategies. These polypeptides may be advantageously delivered for example by the use of viral vectors, viral or synthetic particles, lipopeptides, conventional adjuvants, naked nucleic acids or nucleic acids adsorbed on particles, or liposomes.

[0170]For the purposes of the present invention, the peptides may be chemically or biochemically modified; some of the amino acids may be replaced with an analogous amino acid, according to conventional criteria for homologies (A or G; S or T; I, L or V; F, Y or W; N or Q; D or E).

[0171]The subject of the present invention is also immunogenic or vaccine compositions for protecting against autoimmune diseases, in particular in at-risk subjects, characterized in that it comprises at least one peptide comprising at least one motif of the CKS type and/or at least one peptide consisting of a motif having affinity with one of the haplotypes of the class I or class II HLA system and a pharmaceutically acceptable vehicle.

[0172]According to an advantageous embodiment of said composition, said motif is selected from the group consisting of peptides, as defined in Table I above.

[0173]According to another advantageous embodiment of said composition, said peptide has the following sequence:

TABLE-US-00008 (SEQ ID NO: 120) sequence CKH: LQNRRALDLLTAERGGTclFLGEECCYYV

[0174]It is remarkable to note at the level of position 380 of the enverin protein, the contiguousness of the motifs of the CKS-17 type (underlined) and of the peptide having the highest score (in bold; see peptide at position 399 in Table I, SEQ ID NO: 68) in the sequence CKH.

[0175]The clonal activation of the subgroups of lymphocytes, for example of cytotoxic lymphocytes, by the peptides in Table I and by extension their homologues, is blocked by conventional immunotherapy means such as for example serotherapy and vaccination.

[0176]The combination of two sequences or of the sequences analogous to the CKH peptide (SEQ ID NO: 120), is capable of causing a synergistic process in the immune response, which could bring into play additional signaling and activation pathways capable of modulating the lymphocyte activation.

[0177]The vaccination relates to the production of antibodies directed against the peptides of Table I, according to the rules of the prior art and according to the methods of release controlled by artificial or cellular implants using a composition as defined above and by using gene therapy means, such as for example expression of nucleic sequences encoding the peptides of Table I. Consequently, the subject of the invention is also immunogenic or vaccine compositions, characterized in that they comprise a vector including at least one nucleic sequence encoding a peptide as defined in Table I, optionally combined with a sequence encoding a motif of the CKS-17 type.

[0178]The serotherapy relates to the use of neutralizing antibodies produced from the peptides of Table I and their homologues.

[0179]The protein products generated by the endogenous retroviral sequences or produced in parallel may be advantageously characterized by micro-methods of analysis and quantification of peptides and proteins: HPLC/FPLC or equivalent, capillary electrophoresis or equivalent, microsequencing techniques (Edman method or equivalent, mass spectrometry and the like).

[0180]The subject of the invention is also antibodies directed against one or more of the peptides described above and their use either for carrying out a method, in particular a differential method, of in vitro detection of the presence of such a sequence in an individual, or for the preparation of a composition capable of being used in serotherapy in neuropathological conditions with an autoimmune component.

[0181]Said antibodies are advantageously polyclonal or monoclonal antibodies obtained by an immunological reaction from a human, mammalian or avian organism or other species toward the proteins, as defined above.

[0182]The subject of the present invention is a method for the differential immunological screening of normal or pathological human endogenous retroviral sequences of the HERV-7q family, characterized in that it comprises bringing a biological sample into contact with an antibody according to the invention, the reading of the result being visualized by an appropriate means, in particular EIA, ELISA, RIA, fluorescence.

[0183]By way of illustration, such an in vitro diagnostic method according to the invention comprises bringing a biological sample collected from a patient into contact with antibodies according to the invention and detecting with the aid of any appropriate method, in particular with the aid of labeled anti-immunoglobulins, the immunological complexes formed between the proteins produced normally or pathologically and the antibodies.

[0184]Monoclonal or polyclonal antibodies, produced from antigens corresponding to synthetic peptides, or recombinant polypeptide or proteins make it possible to monitor the expression of the peptides or proteins produced normally or pathologically. The analysis is preferably carried out by ELISA or equivalent, Western blotting or equivalent, or by immunohistochemistry.

[0185]The peptides or proteins, derived from the endogenous retroviral sequences or whose expression is associated with the expression of these endogenous retroviral sequences, are tested for and identified.

[0186]The subject of the present invention is also a method for the identification and detection of endogenous retroviral motifs which are abnormally expressed in the context of pathological conditions associated with cancer, or of neuropathological conditions, in particular autoimmune neuropathological conditions, at the forefront of which is multiple sclerosis, characterized in that it comprises the comparative analysis of the sequences extracted from a biological sample and the sequences according to the invention.

[0187]The subject of the present invention is also the application of the nucleic sequences or of the protein sequences according to the invention to the diagnosis of, to the prognosis of, to the evaluation of genetic susceptibility to, any induced, congenital or acquired human diseases, in particular those with cancerous, autoimmune and/or neurological components, such as multiple sclerosis, the associated syndromes and the neurodegenerative diseases in which all or part of the nucleic sequences according to the invention and related endogenous or exogenous forms are involved.

[0188]The subject of the present invention is also hybrid nucleic sequences, characterized in that they comprise nucleic sequences or motifs according to the invention, combined with sequences or motifs of endogenous origin or of exogenous origin or induced exogenously.

[0189]The subject of the present invention is, in addition, a recombinant cloning or expression vector, characterized in that it comprises a nucleic sequence in accordance with the invention.

[0190]Therapeutic strategies may be envisaged by using some of the nucleic sequences contained in HERV-7q and the sequences of the same family or deduced polypeptide structures or by the use of peptides or proteins, or of specific antibodies.

[0191]In accordance with the invention, all or part of the endogenous retroviral nucleic sequences of the HERV-7q type may be used for use as a vector or as vector elements for therapeutic use, in particular the LTR sequences and the gag region (SEQ ID NO: 2, 21 and 22).

[0192]The advantage of such sequences lies in the safety of the vector thus formed, in the possibility of a targeted specific insertion in a well-defined region by a strategy similar to homologous recombination, in cellular targeting, which is optionally transient in the case of a placental expression in women. Another aspect relates to the possibility of combining with the genes of interest the biologically active retroviral motifs (immunomodulatory peptides, as represented in the sequences SEQ ID NO: 68-118, below, fusogenic peptide and the like).

[0193]The subject of the present invention is also transgenic animals, characterized in that they comprise all or part of a sequence of the HERV-7q type (SEQ ID NO: 1-22 and 61).

[0194]Table II below establishes the correspondences between the sequence numbers as they appear in the sequence listing and the name of the various sequences.

TABLE-US-00009 TABLE II SEQ ID NO: DESIGNATION 1 Nucleic acid: 7 env 2 Nucleic acid: gag 3 Nucleic acid: HERV-7q 4 Nucleic acid: HE2 5 Nucleic acid: HE3 6 Nucleic acid: HG3 7 Nucleic acid: HE4 8 Nucleic acid: HE5 9 Nucleic acid: HE6 10 Nucleic acid: HG6 11 Nucleic acid: HE7 12 Nucleic acid: HE8 13 Nucleic acid: HG8 14 Nucleic acid: HE9 15 Nucleic acid: HE10 16 Nucleic acid: HE11 17 Nucleic acid: HG11 18 Nucleic acid: HE12 19 Nucleic acid: HG12 20 Nucleic acid: R1 21 Nucleic acid: RIF 22 Nucleic acid + deduced env protein: HERV-7q 23 Fragment of deduced env protein according to SEQ ID NO: 22 24 Fragment of deduced env protein according to SEQ ID NO: 22 25 Fragment of deduced env protein according to SEQ ID NO: 22 26 Protein: enverin 27 Fragment of deduced env protein according to SEQ ID NO: 22 28 Nucleic acid + protein deduced from gag: HERV-7q 29 Fragment of deduced gag protein according to SEQ ID NO: 28 30 Fragment of deduced gag protein according to SEQ ID NO: 28 31 Fragment of deduced gag protein according to SEQ ID NO: 28 32 Fragment of deduced gag protein according to SEQ ID NO: 28 33 Fragment of deduced gag protein according to SEQ ID NO: 28 34 Fragment of deduced gag protein according to SEQ ID NO: 28 35 env protein: reading frame 1 36 gag protein 37 Nucleic acid: G1F (primer) 38 Nucleic acid: G1R (primer) 39 Nucleic acid: G2F (primer) 40 Nucleic acid: G2R (primer) 41 Nucleic acid: G4F (primer) 42 Nucleic acid: G3F (primer) 43 Nucleic acid: G4R (primer) 44 Nucleic acid: G5R (primer) 45 Nucleic acid: E1F (primer) 46 Nucleic acid: E1R (primer) 47 Nucleic acid: E2F (primer) 48 Nucleic acid: E2R (primer) 49 Nucleic acid: E3F (primer) 50 Nucleic acid: E3R (primer) 51 Nucleic acid: E4F (primer) 52 Nucleic acid: E4R (primer) 53 Nucleic acid: E5F (primer) 54 Nucleic acid: E6F (primer) 55 Nucleic acid: E5R (primer) 56 Nucleic acid: ExF (primer) 57 Nucleic acid: ExR (primer) 58 Protein gag 59 Nucleic acid: Sequence A (insertion sequence) 60 Nucleic acid: Sequence B (insertion sequence) 61 Nucleic acid: HE13 62 Nucleic acid: RH7 63 Nucleic acid: RAM75 64 Nucleic acid: RAV73 65 Nucleic acid: RBP3 66 Nucleic acid: HI3 67 Nucleic acid: LTX 68 Peptide Table I 69 Peptide Table I 70 Peptide Table I 71 Peptide Table I 72 Peptide Table I 73 Peptide Table I 74 Peptide Table I 75 Peptide Table I 76 Peptide Table I 77 Peptide Table I 78 Peptide Table I 79 Peptide Table I 80 Peptide Table I 81 Peptide Table I 82 Peptide Table I 83 Peptide Table I 84 Peptide Table I 85 Peptide Table I 86 Peptide Table I 87 Peptide Table I 88 Peptide Table I 89 Peptide Table I 90 Peptide Table I 91 Peptide Table I 92 Peptide Table I 93 Peptide Table I 94 Peptide Table I 95 Peptide Table I 96 Peptide Table I 97 Peptide Table I 98 Peptide Table I 99 Peptide Table I 100 Peptide Table I 101 Peptide Table I 102 Peptide Table I 103 Peptide Table I 104 Peptide Table I 105 Peptide Table I 106 Peptide Table I 107 Peptide Table I 108 Peptide Table I 109 Peptide Table I 110 Peptide Table I 111 Peptide Table I 112 Peptide Table I 113 Peptide Table I 114 Peptide Table I 115 Peptide Table I 116 Peptide Table I 117 Peptide Table I 118 Peptide Table I 119 Nucleic acid: BLIMP-1 120 Peptide: CKH 121 Nucleic acid: F645 (primer) 122 Nucleic acid: PS5D (primer)

[0195]In addition to the preceding arrangements, the invention also comprises other arrangements which will emerge from the description which follows, which refers to exemplary embodiments of the method which is the subject of the present invention as well as to the appended drawings, in which:

[0196]FIG. 1. Human nucleic sequence HERV-7q, whose analysis and treatment make it possible to characterize a novel endogenous retroviral structure. The repeat nucleic regions of type R1 and R2 and the gag, pol and env domains are underlined. The gag and env type domains are in italics. The region homologous to a noncoding 3' portion of Rab7 is double underlined.

[0197]FIG. 2. Map of the human endogenous retro-viral region HERV-7q. The upper part of the figure corresponds to an anonymous region of the human genome situated on the long arm of chromosome 7. The repeat domains (1), gag (2), pol (3) and env (4) of HERV-7q can be identified. The C-terminal env region (4.3) is prolonged upstream in the form of a long open reading frame (4.2). The domain 4.1 corresponds to the N-terminal region of the env domain.

[0198]FIG. 3. Comparison of the repeat nucleic sequences situated at the boundaries of HERV-7q. The 5' (top) and 3' (bottom) repeat nucleic regions are compared and the identical bases are indicated by two dots.

[0199]FIG. 4. Deduced sequence having an open reading frame in the env-type domain of HERV-7q according to the longest open reading frame rule.

[0200]FIG. 5. Sequences around the CKS-17 domain identified in various deduced env domains of the HERV-7q family and comparison with reference CKS-17 motifs.

[0201]1) HE2-2) HERV-7q-3) GenBank accession No.: M85205-4) HE7-5) HE9-6) CKS-17; the peptide motif endowed with immunomodulatory properties is underlined-7) gp20 of retrovirus type D (SRV-Pc).

[0202]FIG. 6. Possible deduced sequence of the gag-type domain identified in HERV-7q established according to the longest open reading frame rule. X and/correspond to a non-sense codon and to a reading frame shift, respectively. The underlined sequence corresponds to the beginning of the pol domain.

[0203]FIG. 7. Comparison of the nucleic regions covering the gag region of HERV-7q (top) and HERV-TcR (bottom) and their flanking regions. The identical bases are specified by two dots.

[0204]FIG. 8. Example of nucleic alignments of the env-type domain of HERV-7q with similar env-type domains present in human endogenous retroviral sequences of the same family. The non-sense codons are underlined: 1) HERV-7q-2) HE2 03) HE3-04) HE4.

[0205]FIG. 9. Nucleic alignments between the gag domain of HERV-7q and the corresponding domains belonging to the same family. Comparison with fragments of gag domains isolated from infectious retroviral agents. Sequences of infectious retroviral origin: EMBL database accession No.: 1) A60168-2) A60201-3) A60200-4) A60171. Human endogenous retroviral sequences: 5) HERV-7q-6) HG11-7) HG3. The figures indicated in the endogenous sequences correspond to the number of nucleotides inserted in order to optimize the alignment with the gag-type sequences identified in retroviruses of infectious origin.

[0206]FIG. 10. Alignment of a deduced gag protein motif (top) belonging to an infectious retrovirus (EMBL accession No.: A60200) with the deduced gag protein motif (bottom) identified in HERV-7q. The non-sense codons are in bold and underlined. The identical amino acids are specified by 2 dashes. One dash indicates a deletion or a homologous amino acid.

[0207]FIG. 11. Alignment of an env motif (top) belonging to an infectious retrovirus (EMBL accession No.: A60170) with the env motif (bottom) identified in HERV-7q. The homologous nucleotides are specified by two dots and the deletions by a dash.

[0208]FIG. 12. Comparison between the env domain of HERV-7q (top) and the env domain of HERV-9 (bottom). The 66% homology is limited to the 3' region of the env domain of HERV-7q and HERV-9, respectively between nucleotides 8976 nt and 9500 nt of HERV-7q and nucleotides 2898 nt and 3465 nt of HERV-9 (GenBank accession No.: X57147). Numerous insertions/deletions are also observed.

[0209]FIG. 13. Homology between a portion of the sequence of the transcript encoding RH7 (top, SEQ ID NO: 62) and an RGH2 motif (bottom-GenBank accession No.: D11018).

[0210]FIG. 14. Identification of the sequence of the transcript encoding RAM75 (SEQ ID NO: 63), corresponding to the gene for an ATPase of PEX1 type.

[0211]The coding exons are underlined. The initiation and non-sense codons as well as the putative poly-adenylation sites are in bold and underlined. The region in italics corresponds to the beginning of the endogenous retroviral sequence RH7.

[0212]FIG. 15. Sequence of the transcript encoding RAV73 (SEQ ID NO: 64), situated at 0.7 kb downstream of HERV-7q; the nucleic sequences capable of encoding one or more polypeptides are underlined.

[0213]FIG. 16. Comparison between the 3' LTR sequence (top) of HERV-7q and the intron sequence LTX (SEQ ID NO: 67), situated in the FMR2 gene, associated with fragile X (bottom).

[0214]FIG. 17. Detection of modifications on the nucleotide sequence (ID NO: 3), in patients suffering from MS. The modified bases, in at least one patient, are underlined. The primers used are in italics (sequences SEQ ID NO: 121 and 122). The initiation ATG and the non-sense codon are in bold.

[0215]FIG. 18. The env coding portion of the HERV-7q sequence (sequence ID NO: 3), with 3 reading frames.

[0216]FIGS. 19, 20, 21. Separate presentation of the env protein according to the 3 reading frames.

[0217]FIG. 22. Nucleic sequence containing the retroviral sequence RH7 situated in 5' of the HERV-7q sequence. The sequence in italics corresponds to the beginning of the HERV-7q sequence. The RH7 sequence is underlined. Two putative polyadenylation sites are in bold.

[0218]FIG. 23. Sequence of the transcript encoding RBP3 containing nucleotide motifs identified in the nucleic sequence encoding the Blimp-1 gene.

[0219]FIG. 24. Sequence of the transcript encoding APS.

[0220]FIG. 25. Sequence of the transcript encoding Blimp-1; the coding portion is underlined; the initiation and termination codons are in bold.

[0221]FIG. 26. Sequence of the transcript encoding FMR2. The coding portion is underlined. The initiation and non-sense codons are in bold.

[0222]It should be clearly understood, however, that these examples are given solely by way of illustration of the subject of the invention and do not in any manner constitute a limitation thereto.

Example 1

Detection, by Gene Amplification, of a Nucleic Sequence Belonging to a Domain of the gag or env Type According to the Invention, in a Genomic DNA Sample of Human or Mammalian Origin

[0223]The gene amplification is carried out using genomic DNA isolated from blood. An anticoagulant treatment is carried out with 1 ml of a citrate solution (per liter: 4.8 g of citric acid, 13.2 g of sodium citrate, 14.7 g of glucose) per 6 ml of fresh blood. After centrifugation of 20 ml of blood for 15 min at 130 000 g, the supernatant is removed and the fraction enriched with white blood cells is transferred into a new tube and then recentrifuged under the same conditions as above. The fraction enriched with white blood cells is resuspended in an extraction buffer (10 nM Tris-HCl, 0.1 M EDTA, 20 μg/ml of pancreatic RNAse treated so as to eliminate the DNAses, 0.5% SDS, pH 8.0), and then incubated for 1 hour at 37° C. Proteinase K is added at a final concentration of 100 μg/ml. The suspension of lyzed cells is incubated at 50° C. for 3 hours, with occasional stirring, and then treated with an equal volume of phenol equilibrated with 0.5 M Tris-HCl, pH 8.0. The emulsion formed is placed on a wheel for one hour and then centrifuged at 5 000 g for 15 min at room temperature. The aqueous solution is treated and deproteinized by a triple phenol extraction in order to obtain a level of purification corresponding to an absorbance A260/A280 final ratio greater than 1.75. The aqueous fraction is precipitated with 0.2 vol. of 10 M sodium acetate and 2 vol. of ethanol. The DNA is then either collected with the tip of a bent Pasteur pipette, or centrifuged at 5 000 g for 5 min at room temperature. The DNA or the DNA pellet is washed twice with 70% ethanol and then taken up in 1 ml of TE, pH 8.0 so as to be eluted, with gentle stirring, for 12 to 24 hours.

[0224]Oligonucleotides specific for the endogenous sequences described according to the invention are chosen in order to amplify the gag or env region of the endogenous retroviral regions described according to the invention. The genomic DNA studied is obtained from patients having pathological conditions such as multiple sclerosis and from individuals reputed to be healthy.

[0225]The thermostable DNA polymerases used were chosen for their high accuracy during the amplification process, such as Vent DNA polymerase (Biolabs) and the like, and are used according to the conditions recommended by the supplier.

[0226]The amplification strategy uses, depending on the case, a simple PCR, or a nested or seminested PCR.

[0227]Oligonucleotides used to amplify the gag region: [0228]primer G1F, sense, located in the region upstream of the gag domain of HERV-7q (SEQ ID NO: 37), [0229]primer G1R, antisense, located in the 3' terminal region of the gag domain (SEQ ID NO: 38).

[0230]The fragment of 1505 nt amplified by the pair G1F-G1R; 1505 nt is used to generate the probes capable of hybridizing the various PCR amplification products. [0231]primer G2F, sense nested (SEQ ID NO: 39), [0232]primer G2R, antisense nested (SEQ ID NO: 40), [0233]primer G4F, sense nested (SEQ ID NO: 41), [0234]primer G3F, sense nested (SEQ ID NO: 42), [0235]primer G4R, antisense nested (SEQ ID NO: 43), [0236]primer G5R, antisense nested (SEQ ID NO: 44).

[0237]Oligonucleotides used to amplify the env region of HERV-7q: [0238]primer E1F, sense (SEQ ID NO: 45), [0239]primer E1R, antisense (SEQ ID NO: 46).

[0240]The fragment of 2529 nt amplified by the pair of primers E1F-E1R is used to generate the probes capable of hybridizing the various PCR amplification products. [0241]primer E2F, sense (SEQ ID NO: 47), [0242]primer E2R, antisense (SEQ ID NO: 48), [0243]primer E3F, sense (SEQ ID NO: 49), [0244]primer E3R, antisense (SEQ ID NO: 50), [0245]primer E4F, sense (SEQ ID NO: 51), [0246]primer E4R, antisense (SEQ ID NO: 52), [0247]primer E5F, sense (SEQ ID NO: 53), [0248]primer E6F, sense (SEQ ID NO: 54), [0249]primer E5R (SEQ ID NO: 55), [0250]primer EXF (SEQ ID NO: 56), [0251]primer ExR (SEQ ID NO: 57).

[0252]The PCR is carried out using 50 to 200 ng of genomic DNA. The PCR conditions are those recommended by the supplier. The amplification cycle conditions are carried out in 50 μl: denaturation of 94° C. for 1 min, hybridization of 70° C. for 1 min, and extension at 72° C. for 1 to 2 min, depending on the amplified fragments. After 35 cycles, a terminal reaction is carried out at 72° C. for 10 min. Automated sequencing of the amplified samples is carried out with the aid of an Applied Biosystems type ABI 377 sequencer or another comparable model, according to the protocols provided by the manufacturer.

[0253]In the case of a nested or seminested PCR, the same experimental conditions are used, the only difference being that the genomic DNA sequence is replaced with 5 to 10 μl of the amplification product derived from the first PCR.

[0254]Two independent amplifications are carried out using the same sample. A control reaction is carried out by replacing the DNA sample with water in order to detect possible contaminants.

Example 2

Detection, by Gene Amplification, of a Nucleic Sequence According to the Invention in a Biological Sample of Genomic DNA Collected from Patients having an Existing Candidate Pathological Condition or Suspected of having this Pathological Condition

[0255]The amplification protocol is the same as in Example 1, apart from the origin of the sample which is obtained from patients having a candidate pathological condition. A genomic DNA sample reputed to be normal is systematically integrated into the set of amplified pathological samples and then analyzed.

[0256]The PCR products are separated on a 1.5% agarose gel and then transferred in the presence of 0.4 N sodium hydroxide on a charged nylon membrane. Hybridization is carried out with a specific probe corresponding to the PCR fragments amplified either with the pair G1F-G1R or the pair E1F-E1R. The probe is labeled by incorporating dUTP-digoxygenin according to the supplier's protocol (Boehringer Mannheim). The hybridization is carried out in a hybridization buffer (5×SSC, 50% formamide, 0.1% lauroylsarcosine, 0.02% SDS, 2% blocking reagent Boehringer) overnight at 42° C. The Southern is washed for twice 5 min at room temperature in a 2×SSC solution containing 0.1% SDS. Next, a high stringency wash is carried out twice for 15 min at 55° C. in a 0.1×SSC solution containing 0.1% SDS. The hybridization is visualized according to the supplier's protocol (Boehringer Mannheim), in the presence of a chemiluminescent substrate for alkaline phosphatase, of the CSPD or CDP-STAR type. The filter is visualized after a 15 min exposure at 60° C.

[0257]SSCP (single strand conformation polymorphism) analysis makes it possible to detect discrete modifications of the sequence of the fragments amplified by PCR. The PCR is carried out in the presence of dCTP labeled with 32P. The sample to be analyzed is denatured at 95° C. for 10 min in the presence of loading buffer, and then immediately loaded onto a 10% polyacrylamide gel containing 7.5% glycerol. The migration is carried out at 4° C. at 8-10 W. The gel is dried and then autoradiographed.

[0258]The PCR fragments likely to exhibit an alteration of their nucleotide sequence are sequenced according to Example 1.

[0259]Hybridization with the aid of a specific oligonucleotide (17 mers to 20 mers) corresponding to the modified nucleotide region makes it possible to identify the samples having an identical modification (ASO method). Briefly, the southern is hybridized with an oligonucleotide which is distally labeled either with 32P, or in the presence of digoxygenin (according to the Boehringer Mannheim protocol) and then washed under stringent conditions at 65° C. in a 6×SSC solution containing 0.05% sodium pyrophosphate.

[0260]For example, an automated nucleotide sequencing was carried out on six PCR fragments obtained from 5 patients suffering from MS and a control reputed to be normal, and which were amplified using the primers F645: CTTCAAACAACAACCAGGAGG (SEQ ID NO: 121) (situated 26 nucleotides upstream of the initiation methionine of enverin) and PS5D: TTGGGGAGGTTGGCCGACGA (SEQ ID NO: 122) (situated 6 nucleotides downstream of the non-sense codon of enverin). Modifications of the sequence of enverin were observed on the DNA from some patients (FIG. 17).

Example 3

Detection of a Protein According to the Invention in a Biological Sample

[0261]Preparation of a Purified Protein Fraction of Cerebrospinal Fluid from Patients Suffering from MS

[0262]After a treatment at 56° C. for 30 min and removal of the immunoglobulins on a G HiTrap protein column (Pharmacia), the equivalent of 10 ml of CSF is deposited on a DEAE Sepharose CL-6B column (Pharmacia). The elution is carried out in 20 mM Tris-HCl, pH 8.8, and a gradient from 0 to 0.4 M NaCl, and then the fraction is dialyzed twice against a phosphate-NaCl buffer (PBS). After concentration on Ultrafree-MC (Millipore), the fraction is deposited on a Superose 12 column (FPLC Pharmacia) and eluted in the presence of PBS. After separation by polyacrylamide-SDS gel electrophoresis and electrotransfer onto an Immobilon-P membrane (Millipore), the protein bands are subjected to controlled trypsin hydrolysis.

[0263]Analysis of the Protein Fraction by Mass Spectrometry

[0264]The peptides digested in the presence of trypsin are analyzed by the MALDI-TOF method, which allows the analysis of peptides present in a mixture (COTTRELL J. S., Pept. Res., 1997, 7, 115-124). The peptides characterized according to their mass are compared with the proteins and with the associated proteins according to the invention.

Example 4

Detection of Specific Antibodies to the env Domain of HERV-7q

[0265]The identification of a long open reading frame in the env sequence of HERV-7q made it possible to determine a deduced protein sequence SEQ ID NO: 22 and 35 and FIGS. 18-20 of a region of the said gene.

[0266]The protein sequences deduced from the sequences ID NO: 22, 35 and FIGS. 18-20 are positioned as follows with respect to FIG. 1 or the sequence ID NO: 3:

[0267]SEQ ID NO: 22 (reading frame 1) and FIG. 19: beginning of the coding sequence: position 7874, end of the coding sequence 1st nonsense codon (position 9493)

[0268]SEQ ID NO: 35: beginning of the coding sequence: position 7874, end of the coding sequence 1st nonsense codon (position 9493) (reading frame 1)

[0269]FIG. 19: beginning of the coding sequence: position 6970, end of the coding sequence 1st nonsense codon (position 9493) (reading frame 1)

[0270]FIG. 20: beginning of the coding sequence: position 6971, the end of the reading frame is shifted depending on the case by 1, 2 or 3 codons

[0271]FIG. 21: beginning of the coding sequence: position 6972, the end of the reading frame is shifted depending on the case by 1, 2 or 3 codons

[0272]Various peptides corresponding to all or part of SEQ ID NO: 22 (see SEQ ID NO: 23-27 and 35) were synthesized by genetic engineering in order to test their antigenic specificity toward sera or tissues from patients suffering from MS, for example. Briefly, all or part of the env region of HERV-7q is subcloned into the vectors pQE30, 31 and 32. The vectors pQE30, 31 and 32 contain, in 5' of the multiple cloning site, the consensus sequences for transcription (the strong T5 bacteriophage promoter, 2 operators of the lactose operon) and translation (one synthetic ribosome binding site). Likewise, pQE30, 31 and 32 possess, in 3', the phage 1 transcription terminator as well as a Stop codon for translation. The expression of the protein is carried out after transformation in E. coli M15. The plasmid pQE30, 31 and 32 possess, upstream of the multiple cloning site, the coding sequence for a succession of 6 histidines having affinity for nickel ions. This stretch allows the purification of the expressed chimeric protein by adsorption on a resin consisting of a chelating ligand, nitrotriacetic acid (NTA), charged with 4 nickel ions (NI-NTA resin, Qiagen).

[0273]The transformation is carried out by electroporation or treatment with calcium chloride. For example, an E. coli M15 colony is incubated in 100 ml of LB medium containing 250 μg of kanamycin, with stirring at 37° C. until an OD600 of 0.5 is obtained. After centrifugation for 5 minutes at 2000 g at 4° C., the bacterial pellet is taken up in 30 ml of TFB1 solution (100 mM rubidium chloride, 50 mM manganese chloride, 30 mM potassium acetate, 10 mM CaCl2, 15% glycerol, pH 5.8), at 4° C. for 90 minutes. After a centrifugation of 5 minutes at 2000 g at 4° C., the bacterial pellet is taken up in 4 ml of TFB2 solution (10 mM rubidium chloride, 10 mM MOPS, 75 mM CaCl2, 15% glycerol, pH 8). The cells may be kept at -70° C. in aliquots of 500 ml. 20 μl of the ligation and 125 μl of competent cells are mixed and placed on ice for 20 minutes. After a heat shock of 42° C. for 90 seconds, the cells are stirred for 90 minutes at 37° C. in 500 ml of Psi-broth medium (LB medium supplemented with 4 mM MgSO4, 10 mM potassium chloride). The transformed cells are plated on LB-agar dishes supplemented with 25 μg/ml of kanamycin and 100 μg/ml of ampicillin, and the dishes are incubated overnight at 37° C.

[0274]The potentially recombinant clones are sub-cultured in an orderly manner on a nylon filter deposited on an LB-agar dish supplemented with 25 μg/ml of kanamycin and 100 μg/ml of ampicillin. After one night at 37° C., the recombinant clones are located by hybridization of the plasmid DNA with the nucleotide probe amplified by PCR with the pair of primers according to SEQ ID NO: 45 and SEQ ID NO: 46.

[0275]An independent colony containing the insert is inoculated at 20 ml of LB medium supplemented with 25 μg/ml of kanamycin and 100 μg/ml of ampicillin. After one night at 37° C., with stirring, 500 ml of the same medium are incubated at 1/50 with this preculture until an OD600 of 0.8 is obtained, and then 1 to 2 mM final of IPTG is added. After 5 hours, the cells are centrifuged for 20 minutes at 4 000 g.

[0276]A portion of the cellular pellet is taken up in 5 ml of sonification buffer (50 mM of sodium phosphate, pH 7.8, 300 mM NaCl) and then placed on ice. After rapid sonification, the cells are centrifuged for 20 minutes at 10 000 g. A portion of the cellular pellet is taken up in 10 ml of a 30 mM Tris/HCl-20% sucrose solution pH 8. The cells are incubated for 5 to 10 minutes, with stirring, after addition of 1 mM EDTA. After a centrifugation of 10 minutes at 8 000 g at 4° C., the pellet is taken up in 10 ml of 5 mM ice cold MGSO4. After 10 minutes on the ice, with stirring, the cells are centrifuged for 10 minutes at 8 000 g at 4° C.

[0277]The pellet is taken up in 5 ml/g in buffer A (6 M GuHCl (guanidine hydrochloride), 0.1 M sodium phosphate, 0.01 M Tris/HCl, pH 8), 1 hour at room temperature. The lysate is centrifuged for 15 minutes at 10 000 g at 4° C., and the supernatant is supplemented with 8 ml of Ni-NTA resin, pre-equilibrated in buffer A. After 45 minutes at room temperature, the resin is poured into a column, washed with 10 times the column volume with buffer A and then with 5 times the column volume with buffer B (8 M urea, 0.1 M sodium phosphate, 0.01 M Tris/HCl, pH 8). The column is washed with buffer C (8 M urea, 0.1 M sodium phosphate, 0.01 M Tris/HCl, pH 6.3) until A280 is less than 0.01. The recombinant protein is eluted with 10 to 20 ml of buffer D (8 M urea, 0.1 M sodium phosphate, 0.01 M Tris/HCl, pH 5.9) and then with 10 to 20 ml of buffer E (8 M urea, 0.1 M sodium phosphate, 0.01 M Tris/HCl, pH 4.5), and then with 20 ml of buffer F (6 M HCl, 0.2 M acetic acid). After SDS-PAGE analysis, the purified fraction(s) containing the chimeric protein allowed the production of antibodies in rabbits. The antibodies obtained are tested by Western blotting after visualization with a secondary antibody coupled to alkaline phosphatase.

[0278]Antibodies are obtained in the same manner, using peptides synthesized chemically according to the Merrifield technique (G. Barany and B. Merrifield, 1980, in The peptides, 2, 1-284, E. Gross and J. Meienhofer, Academic Press, New York).

[0279]The specific antibodies obtained are used for detection of the serum or tissue expression of all or part of the endogenous retroviral sequences according to the invention, in normal and pathological cases.

[0280]The proteins of serum or tissue origin are separated on acrylamide-SDS gel and then transferred onto a nitrocellulose filter with the aid of a Novablot 2117-2250 apparatus (LKB). The transfer is carried out on a Hybond C-extra sheet (Amersham) using a 100 mM CAPS buffer pH 11, methanol, water (V/V/V:1/1/8) containing 1 mM CaCl2. After a transfer of 1 hour at 0.8 mA/cm2, the sheet is saturated for 1 hour at room temperature in PBS-0.5% gelatin. The sheet is brought into contact with the specific antibody at the concentration of 1/1 000 in PBS-0.25% gelatin. After 2 hours, the filter is washed 3 times 15 minutes in PBS-0.1% Tween-20, and then the filter is incubated for 30 minutes in the presence of a secondary antibody coupled to alkaline phosphatase (Promega), diluted 1/7 500 in PBS-0.25% gelatin. After three washes in PBS-0.1% Tween-20, the filter is equilibrated in a buffer (100 mM Tris-HCl, pH 9.5, 100 mM NaCl, 5 mM MgCl2). The visualization is carried out in the presence of 45 μl of NBT at 75 mg/ml and 35 μl of BCIP at 50 mg/ml, per 10 ml of alkaline phosphatase buffer.

[0281]The chimeric proteins obtained by genetic engineering are also used for tests of biological activity, such as for example the test for biological activity of the CKS-17-type peptide identified in the env domain of HERV-7q (FIG. 5).

Example 5

Production of Ribonucleic Probes Encoding the env Sequences of HERV-7q

[0282]The PCR fragments obtained are subcloned into the plasmid PGEM 4Z (Promega) which possesses on either side of its multiple cloning site, promoter sequences for the SP6 and T7 RNA polymerases.

[0283]The method of competence used is electroporation. The plasmid and the PCR fragment are hybridized in a ratio of 50 ng of vector (SmaI cleavage) to 100 ng of PCR fragment (made blunt ended by treatment with the Klenow fragment of DNA polymerase). The incubation takes place overnight at 22° C. in ligation buffer (66 mM Tris-HCl, pH 7.5, 5 mM MgCl21 1 mM dithioerythritol, 1 mM ATP) in the presence of 1 u of T4 DNA ligase and is then stopped by denaturation for 10 minutes at 65° C. In parallel, the E. coli JM 105 strain is inoculated overnight at 37° C. in LB medium. This preculture is diluted 1/500 and placed at 37° C. until an OD600 equal to 1 is obtained. For the remainder of the procedure, the cells will always be stored at cold temperature. After centrifugation for 5 minutes at 3 500 g at 4° C., the cellular pellet is resuspended in 1/4 vol. of ultra-pure ice-cold water. This step is repeated 5 to 6 times. The pellet is then resuspended in 1/4 000 vol. of water; 10% of sterile glycerol is added, allowing preservation of the electrocompetent cells, in aliquots of 10 μl at 20° C. 1 μl of the ligation is added to 50 μl of electrocompetent cells; the mixture is subjected to an electrical discharge of 12.5 kV/cm, applied for 5.8 ms. The cells are rapidly resuspended in the SOC medium, incubated for 1 hour at 37° C. and then plated in the presence of 2% X-Gal in dimethylformamide, and 10 mM IPTG, on an LB-agar dish supplemented with ampicillin (100 μg/ml). After one night at 37° C., the potentially recombinant white clones are subcultured in an orderly manner on an LB/ampicillin dish and in parallel on a nylon filter deposited on an LB/ampicillin dish. These two dishes are incubated overnight at 37° C. The recombinant clones are then located by hybridization with a nucleic probe amplified by PCR with the pair or primers according to SEQ ID NO: 45 and SEQ ID NO: 46 and labeled with digoxygenin.

[0284]The recombinant clones are cultured in 50 ml of LB/ampicillin medium (100 μg/ml), with stirring, over-night at 37° C. After centrifugation at 3 500 g for 15 minutes at 4° C., the bacterial pellet is taken up in 4 ml of P1 buffer (50 mM Tris-HCl, 10 mM EDTA, 400 μg/ml RNase A, pH 8) and 4 ml of P2 buffer (200 mM NaOH, 1% SDS). The medium is incubated at room temperature for 5 minutes. After addition of 4 ml of P3 buffer (2.55 M potassium acetate, pH 4.8), the mixture is centrifuged at 12 000 g for 30 minutes at 4° C. This supernatant is applied to a Qiagen type 100 column, pre-equilibrated with 2 ml of QBT buffer (750 mM NaCl, 50 mM MOPS, 15% ethanol, pH 7), the column is washed with twice 4 ml of QC buffer (1 M NaCl, 50 mM MOPS, 15% ethanol, pH 7) and the DNA is eluted with 2 ml of QF buffer (1.2 M NaCl, 50 mM MPOS, 15% ethanol, pH 8). The DNA is precipitated with 0.8 vol. of isopropanol and centrifuged at 12 000 g at 4° C. for 30 minutes. The pellet is washed with 70% ice-cold ethanol and then the plasmid DNA is taken up in twice 150 μl of TE buffer.

[0285]The ribonucleic probes are used as specific probes, in particular for the detection of the transcripts expressed by the endogenous retroviral sequences according to the invention.

Example 6

Construction of a Transgenic Mouse Containing all or Part of the Gene for Enverin

[0286]A transgenic mouse containing all or part of the HERV-7q sequence (SEQ ID NO: 3) is constructed so as to identify the sequences responsible for the tissue specificity, and to evaluate the role of all or part of the endogenous retroviral motifs of the HERV-7q type, in particular all or part of the peptide motifs of enverin. The microinjection technique used refers to the conventional technique (Hogan et al., (1994), Manipulating the mouse embryo, Cold Spring Harbor, Cold Spring Harbor Laboratory Press) or to its equivalents. Forms identical to the normal human molecule of motifs of the HERV-7q type, including enverin, or forms which are mutated, deleted, having insertions, or truncated are tested in order to determine the motifs which are critical both from the normal and pathological point of view, and more particularly during fetal development and during tumor processes.

BIBLIOGRAPHIC REFERENCES

[0287]Benit L. et al., 1997. Cloning of a new murine endogenous retrovirus MuERV-L, with strong similarity of the human HERV-L element and with a gag coding sequence closely related to the Fv1 restriction gene. J. Virol. 71, 5652-5657. [0288]Coffin J. M. 1985. Endogenous retrovirus, In: "RNA tumor viruses" (Weiss R. A., Varmus H. E., Teich N. M., and Coffin J. M. eds), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. [0289]Conrad B., Weissmahr R. N., Boni J., Arcari R., Schupbach J., and Mach B. 1997. A human endogenous retroviral superantigen as candidate autoimmunogene in type 1 diabetes. Cell 90, 303-313. [0290]Covey S. N. 1986. Amino acid sequence homology in gag region of reverse transcribing elements and the coat protein gene of cauliflower mosaic virus, Nucleic Acids Res. 14, 623-633. [0291]Hertig C., Coupar B. E., Gould A. R., and Boyle D. B. 1997. Field and vaccine strains of fowlpox virus carry integrated sequences from the avian retrovirus, reticuloendotheliosis virus. Virology 235, 367-376. [0292]Hohenadl C., Leib-Mosch C., Hehlemann R., and Erfle Y. 1996. Biological significance of human endogenous retroviral sequences. J. Acqui. 1 mm. Def. Synd. Hum. Retrovir. 13, S268-S273. [0293]Kulkoski J. K., Jones S., Katz R. A., Mack J. P. G., and Skalka A. M. 1992. Residues critical for retroviral integrative recombination in a region that is highly conserved among retroviral/retrotransposon integrases and bacterial insertion sequence transposases. Mol. Cell. Biol. 12, 2331-2338. [0294]La Mantia G. et al., N.A.R., 1991, 19, 7, 1513-1520 [0295]Patience C., Wilkinson D. A., and Weiss R. A. 1997. Our retroviral heritage. Trends Genet. 13, 116-120. [0296]Pearson W. R. 1994. Using the FASTA program to search protein and DNA sequence databases. Methods Mol. Biol. 24, 307-331. [0297]Perron H., Garson J. A., Bedin F., Beseme F., Paranhos-Baccala G., Komurian-Pradel F., Mallet F., Tuke P. W., Voisset C., Blond J. L., Lalande B., Seigneurin J. M., Mandrand B. and the Collaborative Research Group on Multiple Sclerosis. 1997. Molecular identification of a novel retrovirus repeatedly isolated from patients with multiple sclerosis. Proc. Natl. Acad. Sci. USA 94, 7583-7588. [0298]Tonjes R. R. et al., J. AIDS and Hum. Retrovirol. 1996, 13. S261-S267. [0299]Vitelli R., Chiarillo M., Lattero D., Bruni C. B., and Bucci C., 1996. Molecular cloning and expression analysis of the human Rab7 GTP-ase complementary deoxyribonucleic acid. Biochem. Biophys. Res. Commun. 229, 887-890. [0300]Weber L. T., Miller M., Jaskolski M., Leis J., Skalka M., and Wlodawer A., 1989. Molecular modeling of the HIV-1 protease and its substrate binding site. Science 243, 928-931. [0301]Wilkinson D., Mager D. L., and Leong J. A. C. 1994. Endogenous human retroviruses. In: "The Retroviridae" (Levy J. A. ed). Plenum Press New York., Vol. 3, 465-535. [0302]Xiong Y., and Eickbush, T. 1990. Origin and evolution of retroelements based upon their reverse transcriptase sequences. EMBO J. 9, 3353-3362.

[0303]As is evident from the above, the invention is not at all limited to its embodiments, implementations and applications which have just been described more explicitly; it embraces on the contrary all the variants which may occur to a specialist in this field, without departing from the framework or scope of the present invention.

Sequence CWU 1

14312599DNAHomo sapiens 1atcccctgcc ttaatcgcca agctccttca ggagaacaaa gaacaggcca ttaccctgga 60gaagactggc aactgatttt acccacaagc ccaaacctca gggatttcag tatctactag 120tctgggtaga tactttcacg ggttgggcag aggccttccc ctgtaggaca gaaaaggccc 180aagaggtaat aaaggcacta gttcatgaaa taattcccag attcggactt ccccgaggct 240tacagagtga caatagccct gctttccagg ccacagtaac ccagggagta tcccaggcgt 300taggtatacg atatcactta cactgcgcct gaaggccaca gtcctcaggg aaggtcgaga 360aaatgaatga aacactcaaa ggacatctaa aaaagcaaac ccaggaaacc cacctcacat 420ggcctgctct gttgcctata gccttaaaaa gaatctgcaa ctttccccaa aaagcaggac 480ttagcccata cgaaatgctg tatggaaggc ccttcataac caatgacctt gtgcttgacc 540caagacagcc aacttagttg cagacatcac ctccttagcc aaatatcaac aagttcttaa 600aacattacaa ggaacctatc cctgagaaga gggaaaagaa ctattccacc cttgtgacat 660ggtattagtc aagtcccttc cctctaattc cccatcccta gatacatcct gggaaggacc 720ctacccagtc attttatcta ccccaactgc ggttaaagtg gctggagtgg agtcttggat 780acatcacact tgagtcaaat cctggatact gccaaaggaa cctgaaaatc caggagacaa 840cgctagctat tcctgtgaac ctctagagga tttgcgcctg ctcttcaaac aacaaccagg 900aggaaagtaa ctaaaatcat aaatccccat ggccctccct tatcatattt ttctctttac 960tgttctttta ccctctttca ctctcactgc accccctcca tgccgctgta tgaccagtag 1020ctccccttac caagagtttc tatggagaat gcagcgtccc ggaaatattg atgccccatc 1080gtataggagt ctttctaagg gaacccccac cttcactgcc cacacccata tgccccgcaa 1140ctgctatcac tctgccactc tttgcatgca tgcaaatact cattattgga caggaaaaat 1200gattaatcct agttgtcctg gaggacttgg agtcactgtc tgttggactt acttcaccca 1260aactggtatg tctgatgggg gtggagttca agatcaggca agagaaaaac atgtaaaaga 1320agtaatctcc caactcaccc gggtacatgg cacctctagc ccctacaaag gactagatct 1380ctcaaaacta catgaaaccc tccgtaccca tactcgcctg gtaagcctat ttaataccac 1440cctcactggg ctccatgagg tctcggccca aaaccctact aactgttgga tatgcctccc 1500cctgaacttc aggccatatg tttcaatccc tgtacctgaa caatggaaca acttcagcac 1560agaaataaac accacttccg ttttagtagg acctcttgtt tccaatctgg aaataaccca 1620tacctcaaac ctcacctgtg taaaatttag caatactaca tacacaacca actcccaatg 1680catcaggtgg gtaactcctc ccacacaaat agtctgccta ccctcaggaa tattttttgt 1740ctgtggtacc tcagcctatc gttgtttgaa tggctcttca gaatctatgt gcttcctctc 1800attcttagtg ccccctatga ccatctacac tgaacaagat ttatacagtt atgtcatatc 1860taagccccgc aacaaaagag tacccattct tccttttgtt ataggagcag gagtgctagg 1920tgcactaggt actggcattg gcggtatcac aacctctact cagttctact acaaactatc 1980tcaagaacta aatggggaca tggaacgggt cgccgactcc ctggtcacct tgcaagatca 2040acttaactcc ctagcagcag tagtccttca aaatcgaaga gctttagact tgctaaccgc 2100tgaaagaggg ggaacctgtt tatttttagg ggaagaatgc tgttattatg ttaatcaatc 2160cggaatcgtc actgagaaag ttaaagaaat tcgagatcga atacaacgta gagcagagga 2220gcttcgaaac actggaccct ggggcctcct cagccaatgg atgccctgga ttctcccctt 2280cttaggacct ctagcagcta taatattgct actcctcttt ggaccctgta tctttaacct 2340ccttgttaac tttgtctctt ccagaatcga agctgtaaaa ctacaaatgg agcccaagat 2400gcagtccaag actaagatct accgcagacc cctggaccgg cctgctagcc cacgatctga 2460tgttaatgac atcaaaggca cccctcctga ggaaatctca gctgcacaac ctctactacg 2520ccccaattca gcaggaagca gttagagcgg tctcggccaa cctccccaac agcacttagg 2580ttttcctgtt gagatgggg 259921326DNAHomo sapiens 2gccgcctggc actcctgagg gaagtataaa ttataacacc atcttacagc tagacctctt 60ttgtagaaaa ggcaaatgga gtgaagtgcc ataagtacaa actttctttt cattaagaga 120caactcacaa ttatgtaaaa agtgtgattt atgccctaca ggaagccttc agagtctacc 180tccctatccc agcatccccg actccttccc caactaataa ggacccccct tcaacccaaa 240tggtccaaaa ggagatagac aaaagggtaa acagtgaacc aaagagtgcc aatattcccc 300aattatgacc cctccaagca gtgggaggaa gagaattcgg cccagccaga gtgcatgtgc 360ctttttctct cccagactta aagcaaataa aaacagactt aggtaaattc tcagataacc 420ctgatggcta tattgatgtt ttacaagggt taggacaatt ctttgatctg acatggagag 480atataatgtc actgctaaat cagacactaa ccccaaatga gagaagtgcc accataactg 540cagcctgaga gtttggcgat ctctggtatc tcagtcaggt caatgatagg atgacaacag 600aggaaagaga atgattcccc acaggccagc aggcagttcc cagtctagac cctcattggg 660acacagaatc agaacatgga gattggtgct gcagacattt gctaacttgt gtgctagaag 720gactaaggaa aactaggaag aagtctatga attactcaat gatgtccacc ataacacagg 780gaagggaaga aaatcctact gcctttctgg agagactaag ggaggcattg aggaagcgtg 840cctctctgtc acctgactct tctgaaggcc aactaatctt aaagcgtaag tttatcactc 900agtcagctgc agacattaga aaaaaacttc aaaagtctgc cgtaggcccg gagcaaaact 960tagaaaccct attgaacttg gcaacctcgg ttttttataa tagagatcag gaggagcagg 1020cggaacagga caaacgggat taaaaaaaag gccaccgctt tagtcatgac cctcaggcaa 1080gtggactttg gaggctctgg aaaagggaaa agctgggcaa attgaatgcc taatagggct 1140tgcttccagt gcggtctaca aggacacttt aaaaaagatt gtccaagtag aagtaagccg 1200ccccctcgtc catgcccctt atttcaaggg aatcactgga aggcccactg ccccagggga 1260caaaggtcct ctgagtcaga agccactaac cagatgatcc agcagcagga ctgagggtgc 1320ctgggg 1326310499DNAHomo sapiens 3ccctggggcg ggcttccttt ctgggatgag ggcaaaacgc ctggagatac agcaattatc 60ttgcaactga gagacaggac tagctggatt tcctaggccg actaagaatc cctaagccta 120gctgggaagg tgaccacgtc cacctttaaa cacggggctt gcaacttagc tcacacctga 180ccaatcagag agctcactaa aatgctaatt aggcaaagac aggaggtaaa gaaatagcca 240atcatctatt gcctgagagc acagcaggag ggacaacaat cgggatataa acccaggcat 300tcgagctggc aacagcagcc cccctttggg tcccttccct ttgtatggga gctgttttca 360tgctatttca ctctattaaa tcttgcaact gcactcttct ggtccatgtt tcttacggct 420cgagctgagc ttttgctcac cgtccaccac tgctgtttgc caccaccgca gacctgccgc 480tgactcccat ccctctggat cctgcagggt gtccgctgtg ctcctgatcc agcgaggcgc 540ccattgccgc tcccaattgg gctaaaggct tgccattgtt cctgcacggc taagtgcctg 600ggtttgttct aattgagctg aacactagtc actgggttcc atggttctct tctgtgaccc 660acggcttcta atagaactat aacacttacc acatggccca agattccatt ccttggaatc 720cgtgaggcca agaactccag gtcagagaat acgaggcttg ccaccatctt ggaagcggcc 780tgctaccatc ttggaagtgg ttcaccacca tcttgggagc tctgtgagca aggacccccc 840ggtaacattt tggcaaccac gaacggacat ccaaagtggt gagtaatatt ggaccacttt 900cacttgctat tctgtcctat ccttccttag aattggagga aaataccggg cacttgtcgg 960ccagttaaaa acgattagtg tggccaccgg acttaagact caggtgtgag gctatctggg 1020gaagggcttt ctaacaaccc ccaacccttc tgggttgggg acttggtttg cctcaagcca 1080gcttccactt tcagttttct tggggaagcc gagggccgac tagaggcaga aagctgtcgt 1140cctgaactcc cggcagtagc cggttgagat catggtgtag ccagaagtct caacagtcgc 1200ccatgcatgc acccctatct ttccttctga cccatacctc ctgggtccca accacaactt 1260tcttcaaagt gtagccccaa aattctcctt acctctgaat atacttcctc tgatccctgc 1320ctcctaggta ctattggttc agacttccat ttcctctagc aagttgtatc tccaaaggga 1380tctaaggaag ctctgcgctg cgtccttagg cacctaggct ataacccagg gagtcttatc 1440cctggtgtcc ctcccaattt aggcatacag ctcttgacat gggcagttat gtaggaccca 1500ctccccacca cccttgccag ggccccaagt ttgtaaatgg ctgagggaaa agagagacag 1560aggagagaga gagaaatgga ggagaaagag agagagacag agaggagaga gagacagtga 1620gagagacaga agagagagag agacaaagag gagagagaga gagtcaaaga gagaaagaaa 1680gagaaagaaa tagtaaaaaa cagtgtgccc tattccttta aaagccaggg taaatttaaa 1740acctgtactt gataattgaa ggtcttctct gtgaccctat agcactccaa tccactttgt 1800ggtcagtgta aataagagca taggccgaaa gcactgaggc cattgacaac ccgtagcttc 1860cctatcaaaa atccttaacc cagtaacccg cagatggacc aaatgcattc agtcggtagc 1920gcaactgctt tgctaaaagt agaaaagtaa cttttagagg aaacctcatt gtgagcacac 1980ctcacctgtt cagaattatt ctaataaaaa aagcaaaaag gtagcttact aactcaaaaa 2040tcttaaagta tggggctatt ctgttagaaa aaggtaatgt aactccaacc actgataatt 2100cccttaaccc agcagatttc ctaacgggat ttaaatctta attaccatac aaaggtccga 2160ccagacctag gcggaactcc cttcaggaca ggacgataga tggttcctcc caggtgattg 2220aggaaaaaaa ccacaatggg tattcagtaa ttgatacggg gactcttgtg gaagcagagt 2280tagaaaaatt gcctaataac tggtctcctc aaacgtgtga gctgtttgca ctcagccaag 2340ccttaaagta cttacagaat caaaagacta tctcaatcct gattcaaaag gttagctaca 2400ccctctctgt aatgcatttg cataagaact tgtttatggg aatgcatctt gatggggcag 2460ctgggttgtt ataaaatagg aacccagccc agctctagga ctcacccctg agcgcaaagg 2520caatgttggg catgctggta aaggaccact agaatccagc agcccagacc cctttctttg 2580tggtcaagaa aggcgggaaa aggggtgcag gactgctaca tcggtaagca taactaatcc 2640gataaacaga ggtccatggg tggttacgca ccctggaaag gaactcaccc ctgagcacaa 2700aggcaatgtt gggcacgctg gtaaaggacc actagaatcc agcagcctgg acccctttct 2760ttgtggtcaa gagaggcagg aaaacaggtg caggactgca acatcagtga gcataactaa 2820ttcgataagc agaggtccat gggtggtgat gcaccctgga aagaataagc attaggacca 2880tagaggacac tccaggacta aagctcatcg gaaaatgact agggttgctg gcatccctat 2940gttctttttt cagatgggaa acgttccccg caagacaaaa acgcccctaa gacgtattct 3000ggagaattgg gaccaatttg accctcagac actaagaaag aaacgactta tattcttctg 3060cagtgccgcc tggcactcct gagggaagta taaattataa caccatctta cagctagacc 3120tcttttgtag aaaaggcaaa tggagtgaag tgccataagt acaaactttc ttttcattaa 3180gagacaactc acaattatgt aaaaagtgtg atttatgccc tacaggaagc cttcagagtc 3240tacctcccta tcccagcatc cccgactcct tccccaacta ataaggaccc cccttcaacc 3300caaatggtcc aaaaggagat agacaaaagg gtaaacagtg aaccaaagag tgccaatatt 3360ccccaattat gacccctcca agcagtggga ggaagagaat tcggcccagc cagagtgcat 3420gtgccttttt ctctcccaga cttaaagcaa ataaaaacag acttaggtaa attctcagat 3480aaccctgatg gctatattga tgttttacaa gggttaggac aattctttga tctgacatgg 3540agagatataa tgtcactgct aaatcagaca ctaaccccaa atgagagaag tgccaccata 3600actgcagcct gagagtttgg cgatctctgg tatctcagtc aggtcaatga taggatgaca 3660acagaggaaa gagaatgatt ccccacaggc cagcaggcag ttcccagtct agaccctcat 3720tgggacacag aatcagaaca tggagattgg tgctgcagac atttgctaac ttgtgtgcta 3780gaaggactaa ggaaaactag gaagaagtct atgaattact caatgatgtc caccataaca 3840cagggaaggg aagaaaatcc tactgccttt ctggagagac taagggaggc attgaggaag 3900cgtgcctctc tgtcacctga ctcttctgaa ggccaactaa tcttaaagcg taagtttatc 3960actcagtcag ctgcagacat tagaaaaaaa cttcaaaagt ctgccgtagg cccggagcaa 4020aacttagaaa ccctattgaa cttggcaacc tcggtttttt ataatagaga tcaggaggag 4080caggcggaac aggacaaacg ggattaaaaa aaaggccacc gctttagtca tgaccctcag 4140gcaagtggac tttggaggct ctggaaaagg gaaaagctgg gcaaattgaa tgcctaatag 4200ggcttgcttc cagtgcggtc tacaaggaca ctttaaaaaa gattgtccaa gtagaagtaa 4260gccgccccct cgtccatgcc ccttatttca agggaatcac tggaaggccc actgccccag 4320gggacaaagg tcctctgagt cagaagccac taaccagatg atccagcagc aggactgagg 4380gtgcctgggg caagcgccat cccatgccat caccctcaca gagccctggg tatgcttgac 4440cattgagggc caggaggttg tctcctggac actggtgcgg tcttcttagt cttactcttc 4500tgtcccggac aactgtcctc cagatctgtc actatctgag ggggtcctaa gacgggcagt 4560cactagatac ttctcccagc cactaagtta tgactgggga gctttattct tttcacatgc 4620ttttctaatt atgcttgaaa gccccactac cttgttaggg agagacattc tagcaaaagc 4680aggggccatt atacacctga acataggaga aggaacaccc gtttgttgtc ccctgcttga 4740ggaaggaatt aatcctgaag tctgggcaac agaaggacaa tatggacgag caaagaatgc 4800ccgtcctgtt caagttaaac taaaggattc cacctccttt ccctaccaaa ggcagtaccc 4860cctcagaccc aaggcccaac aaggactcca aaagattgtt aaggacctaa aagcccaagg 4920cctagtaaaa ccatgcagta acccctgcag tactccaatt ttaggagtac agaaacccaa 4980cagacagtgg aggttagtgc aagatctcag gattatcaat gaggctgttg ttcctctata 5040gccagctgta cctagccctt atactctgct ttcccaaata ccagaggaag cagagtggtt 5100tacagtcctg gaccttcagg atgccttctt ctgcatccct gtacatcctg actctcaatt 5160cttgtttgcc tttgaagata cttcaaaccc aacatctcaa ctcacctgga ctattttacc 5220ccaagggttc agggatagtc cccatctatt tggccaggca ttagcccaag acttgagcca 5280atcctcatac ctggacactt gtccttcggt aggtggatga tttacttttg gccgcccatt 5340cagaaacctt gtgccatcaa gccacccaag cgctcttcaa tttcctcgct acctgtggct 5400acatggtttc caaaccaaag gctcaactct gctcacagca ggttacttag ggctaaaatt 5460atccaaaggc accagggccc tcagtgagga acacatccag cctatactgg cttatcctca 5520tcccaaaacc ctaaagcaac taaggggatt ccttggcgta ataggtttct gccgaaaatg 5580gattcccagg tatggcgaaa tagccaggtc attaaataca ctaattaagg aaactcagaa 5640agccaatacc catttagtaa gatggacaac tgaagtagaa gtggctttcc aggccctaac 5700ccaagcccca gtgttaagtt tgccaacagg gcaagacttt tcttcatatg tcacagaaaa 5760aacaggaata gctctaggag tccttacaca gatccgaggg atgagcttgc aacctgtggc 5820atacctgact aaggaaattg atgtagtggc aaagggttga cctcattgtt tacgggtagt 5880ggtggcagta gcagtcttag tatctgaagc agttaaaata atacagggaa gagatcttac 5940tgtgtggaca tctcatgatg tgaatggcat actcactgct aaaggagact tgtggctgtc 6000agacaactgt ttacttaaat gtcaggctct attacttgaa gggccagtgc tgcgactgtg 6060cacttgtgca actcttaacc cagccacatt tcttccagac aatgaagaaa agataaaaca 6120taactgtcaa caagtaattt ctcaaaccta tgccactcga ggggaccttt tagaggttcc 6180tttgactgat cccgacctca acttgtatac tgatggaagt tcctttgtag aaaaaggact 6240tcgaaaagtg gggtatgcag tggtcagtga taatggaata cttgaaagta atcccctcac 6300tccaggaact agtgctcagc tagcagaact aatagccctc acttgggcac tagaattagg 6360agaagaaaaa agggcaaata tatatacaga ctctaaatat gcttacctag tcctccatgc 6420ccatgcagca atatggaaag aaagggaatt cctaacttct gagagaacac ctatcaaaca 6480tcaggaagcc attaggaaat tattattggc tgtacagaaa cctaaagagg tggcagtctt 6540acactgccgg ggtcatcaga aaggaaagga aagggaaata gaagagaact gccaagcaga 6600tattgaagcc aaaagagctg caaggcagga ccctccatta gaaatgctta taaaacaacc 6660cctagtatag ggtaatcccc tccgggaaac caagccccag tactcagcag gagaaacaga 6720atggggaacc tcacgaggac agttttctcc cctcgggacg gctagccact gaagaaggga 6780aaatactttt gcctgcaact atccaatgga aattacttaa aacccttcat caaacctttc 6840acttaggcat cgatagcacc catcagatgg ccaaatcatt atttactgga ccaggccttt 6900tcaaaactat caagcagata gtcagggcct gtgaagtgtg ccagagaaat aatcccctgc 6960cttatcgcca agctccttca ggagaacaaa gaacaggcca ttaccctgga gaagactggc 7020aactgatttt acccacaagc ccaaacctca gggatttcag tatctactag tctgggtaga 7080tactttcacg ggttgggcag aggccttccc ctgtaggaca gaaaaggccc aagaggtaat 7140aaaggcacta gttcatgaaa taattcccag attcggactt ccccgaggct tacagagtga 7200caatagccct gctttccagg ccacagtaac ccagggagta tcccaggcgt taggtatacg 7260atatcactta cactgcgcct gaaggccaca gtcctcaggg aaggtcgaga aaatgaatga 7320aacactcaaa ggacatctaa aaaagcaaac ccaggaaacc cacctcacat ggcctgctct 7380gttgcctata gccttaaaaa gaatctgcaa ctttccccaa aaagcaggac ttagcccata 7440cgaaatgctg tatggaaggc ccttcataac caatgacctt gtgcttgacc caagacagcc 7500aacttagttg cagacatcac ctccttagcc aaatatcaac aagttcttaa aacattacaa 7560ggaacctatc cctgagaaga gggaaaagaa ctattccacc cttgtgacat ggtattagtc 7620aagtcccttc cctctaattc cccatcccta gatacatcct gggaaggacc ctacccagtc 7680attttatcta ccccaactgc ggttaaagtg gctggagtgg agtcttggat acatcacact 7740tgagtcaaat cctggatact gccaaaggaa cctgaaaatc caggagacaa cgctagctat 7800tcctgtgaac ctctagagga tttgcgcctg ctcttcaaac aacaaccagg aggaaagtaa 7860ctaaaatcat aaatccccat ggccctccct tatcatattt ttctctttac tgttctttta 7920ccctctttca ctctcactgc accccctcca tgccgctgta tgaccagtag ctccccttac 7980caagagtttc tatggagaat gcagcgtccc ggaaatattg atgccccatc gtataggagt 8040ctttctaagg gaacccccac cttcactgcc cacacccata tgccccgcaa ctgctatcac 8100tctgccactc tttgcatgca tgcaaatact cattattgga caggaaaaat gattaatcct 8160agttgtcctg gaggacttgg agtcactgtc tgttggactt acttcaccca aactggtatg 8220tctgatgggg gtggagttca agatcaggca agagaaaaac atgtaaaaga agtaatctcc 8280caactcaccc gggtacatgg cacctctagc ccctacaaag gactagatct ctcaaaacta 8340catgaaaccc tccgtaccca tactcgcctg gtaagcctat ttaataccac cctcactggg 8400ctccatgagg tctcggccca aaaccctact aactgttgga tatgcctccc cctgaacttc 8460aggccatatg tttcaatccc tgtacctgaa caatggaaca acttcagcac agaaataaac 8520accacttccg ttttagtagg acctcttgtt tccaatctgg aaataaccca tacctcaaac 8580ctcacctgtg taaaatttag caatactaca tacacaacca actcccaatg catcaggtgg 8640gtaactcctc ccacacaaat agtctgccta ccctcaggaa tattttttgt ctgtggtacc 8700tcagcctatc gttgtttgaa tggctcttca gaatctatgt gcttcctctc attcttagtg 8760ccccctatga ccatctacac tgaacaagat ttatacagtt atgtcatatc taagccccgc 8820aacaaaagag tacccattct tccttttgtt ataggagcag gagtgctagg tgcactaggt 8880actggcattg gcggtatcac aacctctact cagttctact acaaactatc tcaagaacta 8940aatggggaca tggaacgggt cgccgactcc ctggtcacct tgcaagatca acttaactcc 9000ctagcagcag tagtccttca aaatcgaaga gctttagact tgctaaccgc tgaaagaggg 9060ggaacctgtt tatttttagg ggaagaatgc tgttattatg ttaatcaatc cggaatcgtc 9120actgagaaag ttaaagaaat tcgagatcga atacaacgta gagcagagga gcttcgaaac 9180actggaccct ggggcctcct cagccaatgg atgccctgga ttctcccctt cttaggacct 9240ctagcagcta taatattgct actcctcttt ggaccctgta tctttaacct ccttgttaac 9300tttgtctctt ccagaatcga agctgtaaaa ctacaaatgg agcccaagat gcagtccaag 9360actaagatct accgcagacc cctggaccgg cctgctagcc cacgatctga tgttaatgac 9420atcaaaggca cccctcctga ggaaatctca gctgcacaac ctctactacg ccccaattca 9480gcaggaagca gttagagcgg tctcggccaa cctccccaac agcacttagg ttttcctgtt 9540gagatggggg actgagagac aggactagct ggatttccta ggctgactaa gaatccctaa 9600gcctagctgg gaaggtgacc acatccacct ttaaacacgg ggcttgcaac ttagctcaca 9660cctgaccaat cagagagctc actaaaatgc taattaggca aagacaggag gtaaagaaat 9720agccaatcat ctattgcctg agagcacagc aggagggaca atgatcggga tataaaccca 9780agtcttcgag ccggcaacgg caaccccctt tgggtcccct ccctttgtat gggagctctg 9840ttttcatgct atttcactct attaaatctt gcaactgcac tcttctggtc catgtttctt 9900acggcttgag ctgagctttc gctcgccatc caccactgct gtttgccgcc accgcagacc 9960cgccgctgac tcccatccct ctggatcatg cagggtgtcc gctgtgctcc tgatccagcg 10020aggcacccat tgccgctccc aatcgggcta aaggcttgcc attgttcctg catggctaag 10080tgcctgggtt catcctaatt gagctgaaca ctagtcactg ggttccatgg ttctcttctg 10140tgacccacag cttctaatag agctataaca ctcaccgcat ggcccaaggt tccattcctt 10200gaatccataa ggccaagaac cccaggtcag agaacacgag gcttgccacc atcttgggag 10260ctctgtgagc aaggaccccc aagtaacaca accatgaggg tgcaaatgca tgggccacta 10320atggtagagc aagaaaacag aagggccctg gttcctcgaa ggcatcagtg agctgaaatg 10380cctgccctgg atgtcctatt cctaggtgtt tttctgcctg aagcagatta aaccctttgt 10440tcacttctcc aagtagggct tctattacag cccaaatcaa tccccacccc agatgacat 1049942784DNAHomo sapiens 4ctccttcagg agaacaaaga acaggccact acccaagaga agactggcaa ctagatttta 60cccatatgcc caaatctcag ggatttcagt atctactagt ttgggtagat actttcactg 120gttgggcaga ggccttcccc tgtaggacag aaaaggccca agaggtaata aacgttcatg 180aaataattcc cagattcgga cttccccaag gcttacagag tgacaatggc cctgctttca 240aggctacagt aacccaagga gtatcccagg tgttaggtat acaatatcac tcacactgcg 300cctggaggcc acagtcctca ggaaaggtgg agaaaatgaa caaaacactc aaatgacatc 360taaaaaagct aatccaggaa acccacctcg catggcctgc tctgttgcct atagccttac 420taagaatccg aaactctccc caaaaagcag gacttagtcc

atacaaaatg ctgtatggac 480ggcccttcct aaccaatgaa cttgggcttg accgagagac agccaactta gttgcagaca 540tcatctcctt agccaaatat caacaggttc ttaaaacatt acagggagcc tgtccccaag 600aagagggaaa ggaactattc caccctggtg acatggtatt agtcaagtcc cttccctcta 660attccccatc cctagataca tcctgggaag gaaactaccc agccatttta tctaccctaa 720cggcagttaa agtggctgga gcggagtctt ggatacatca cactcaagtc aaaccctgga 780tactgccaaa ggaactcaaa aatccatgag acaatgctag ctattcctgt gaacctctag 840aggatctgcg cctgctcttc aaatgacaac cagggggaaa gtaactaaaa tcgtaaatcc 900cctggccctc ccttatcata tttttctctt tactgttctc ttaccccctt tcactctcac 960tgcaccccgt ccatgccact gcaccccgtc catgccccgt ccatgccagt agctcccctt 1020agcaagagtt tctatggaga atgcagcgtc ccggaaatat tgatgcccca ttgtatagga 1080gtttatctaa gggaaccccc accttcactg cccacaccca tatgccccac aactgctata 1140actctgccac tctttgcatg catgcaaata ctcattattg gacaggaaaa acgattaatc 1200ccagttgtcc tggaggactt ggaggactca cttcactcat accagtatgt ctgatggggg 1260tggagttcaa gatcaggcaa cagaaaaaca cataaaggaa gtaatctccc aactgacctg 1320ggtacatagc acccctggcc cctacaaagg actagatctc tcaaaactac atgaaaccct 1380ccatacccat actggcctgg taagcctatt taataccacc ctgactgggc tccatgaggt 1440ctcggcccaa aaccctacta actgttggat gtgcctcccc ctgcacttta ggccatacat 1500ttcaatccct atacctgaac aatggaacaa cttcagcaca gaaataaaca ccacttctgt 1560tttagtaggt cctctttcca atctggaaat aacccatacc tcaaacctca cctgtgtaaa 1620atttagcaat actatagaca cagccaactc ccaatgcatc aggtgggtaa ctcctcccac 1680acgaatagtc tgcctaccct caggaatatt ttttgtctgt ggtacctcag cctatcattg 1740tttgaatggc tcttcagaat ctgtgtgctt cctctcattc ttagtggccc ctatgcccat 1800ctacactgaa caagatttat acaatcatgt catacctaag ccccgcaaca aaagagtacc 1860cattcttcct tttgttattg gagcaggagt gctaggcgga gtagctactg gcattggcgg 1920tatcacaacc tctactcagt tctactacaa actgtctcaa gaactaaatg gtgacatgga 1980atgggtcgct gataccctgg tcaccttgca agatcaactt aactccctag cagcagtagt 2040ccttcaaaat cgaagagctt tagacttgct aaccgcggaa agcgggggaa cctttttatt 2100tttagaggaa aaatgctgtt gttatgttaa tcaatccgga atcatcaccg agaaagttaa 2160agaaattcaa ggtcgaatat aacgtagagc aaaggagctg caaaacactg gaccctgggg 2220cctcctcagc caatggatgc cctggattct ccccttctta ggacctctag cagctataat 2280attgttactc ctctttggac cctgtatctt taacctcctt gttaagtttg tcttttccag 2340aatcgaagca gtaaaactac aaatcgttct tcaaatggag ccccagatgc agtccatgag 2400taaaatctac cacggacccc tggaccggcc tgctagccca tgctctgatg ttaatgacat 2460caaaggcacc cctcccgagg aaatctcaac tgcacaacct ctactacgcc ccaattcagc 2520aggaagcagt tagagtggtt gttggccaac ctccccaaca gcagttgggt tttcctgttg 2580agagggggga ctgagagaca ggaataacta gatttcctag accaactaag aatccctaag 2640actagctggg aaggtgaccg cttccacctt taaacaccgg gcttgcaact tagctcacgc 2700ccaaccaatc agatactaaa gagagctcac taaaatgcta attaggcaaa aacaggagat 2760aaagaaatag ccaatcatct gttg 278451799DNAHomo sapiens 5gggattctta gtcggcctag gaaatccagc taatcctgtc tctcagtccc cccactcaac 60aggaaaaccc aagtgctgtt ggggaggttg gctgacgacc agtctaactg cttcctgcgg 120aattggggca tagtaggggt tgtgcagttg agatttcctc gggaggggtg cgttcgatat 180cattacaatt ggagcatggg ctagtaggcc ggtccagggg tccacggtag atcttagtca 240tggacttcat ctggggttcc atttgaagaa cgatttgtag ctttacaact ttgattctgg 300aagagacaaa cttaacaagg aggttaaaga tacagggtcc aaagaggagt atcaatatta 360gagctgctag agatcctaag aaggggagaa tccagggcat ccattggctg aggaggcccc 420agggtctggt gtttttgaag ctcctctgtt ctacgttgta ttcaatctcg aatttcttca 480actttctctg tgacaattca ggattgatta acataataac aacattcttc cgctaaaata 540acataataac aacattcttc ccctaaaaat aaacagcttc cccctctttc agaggttagc 600aagtctaaag ctcttcaatt ttgaaggact actgatgcta ggaagttaag ttgatcttgc 660aaggtgacca gggagtcggc aacccattcc atgtcaccat tgagttcttg agatagtttg 720tagtagaact gagtagaggt tgtggtaccg ccaatgccag aacctagtcc acctagcact 780cctgctccga taacaaaagg aagaatgagt actcttttgt tgtggggctt aggtacaaca 840taattgtata aatcttgttc agtgtaaatg gtcatggggg cactaagaat gagaggaagc 900acatagattc tgaagagcca ttcaaacaac gataggctaa ggtaccacag acaaaaaata 960ttcctgaggg taggcagact attcgtgtgg gaggagttac ccacctgatg cattgggagt 1020tggttgtgtc tacagtattg ctaaatttta cacaggtgag gtttgaggta tgggttattt 1080ccagattgga aacaagaggt cctactaaaa cggaagtggt gtttatttct gtgctgtagt 1140tgttccattg ttcaggtaca gggattgaaa tgcatggcct gaaatacagg gggaggcaca 1200accaacagtt agtagggttt tggaccgaga cctcatggag cccagtgagg gtggtattaa 1260ataggcttac caggcaagta tgggtatgga gggtttcatg tagttttaag agatctagtc 1320ctttgtaggg gctaggggtg ctatgtaccc gggtcagttg ggaggttact tcctttacat 1380gtttttctct tgcctgatct tgaactccac ccccctcaga cataccagta tgggtgaagt 1440aagtccgaca gacagtggct ccaagtcttc caggacaact aggattaatc attttccctg 1500tccaataatg agtatttgca tgcatgcaaa gagtggcaga gttatagcag ttgtggggca 1560tatgggtgtg ggcagtgaag gtggagtttc ctttaggtaa actcctattt gatggggcat 1620caatatttct gggaagccgc attcttcata gaaactcttg gtaaggggag ctgctggttg 1680tacagcagca tggagggggt gcagtgagag tgaaaggggg taagagaaca gtaaagagaa 1740aaatatgata agggagggcc atggggattt acgattttag ttactttcct cacggttgt 179961489DNAHomo sapiens 6tggtgcttgc cccgggcact ctcagtcctg ctgctggatc atctggttag tggcttctga 60ctcagaggac ctacgtcccc tggggcagtg ggccttacag tgattccctt gacacgaggt 120gcatggacga gggggcggct tatttctatt tggacaatct tttttaaagt gtccttgtag 180accgcactgg aagcaaaccc tattaggcat ttgatttgcc tagcttttcc cttttccagt 240gcctccaaag tccgcttgcc tgagggccat gactaaagcg gtggcctttt ttttatccca 300tttgtcccat tctgcctgct catcctgatc tctattataa aaaactgagg ttgccaagtt 360caatagggtt tctaagtttt gttccgggcc taaggcagac ttttgaagtt ttttcctaat 420gtctgtagct gactgagtga taaacttatc ctttaagatt agttggcctt cagtagagtc 480agttgacaga gagaggtatg cttcctcaat gcctccgtta gtcactccag aaaggcggta 540ggattttctt cctttccctg tgttatagtg gacatcattg aataactcac aggcttcttt 600ctagttttcc ttagtccttc tagcacgcaa gttagcaaat gtctgcggca ccaatctcca 660tgttctgatt ctgtgtccca gtgagggtct acactgggaa ctgcctgctg gcctgtgggg 720aatcgttctc tttcctctgt tgtcgaccta tcattgacct gactgagata ccagagatcg 780ccaaactctc aggctgcagt tacggcgaca cttctgtcat ttggggttag tgtctgattt 840agcagtaaca ttatatctct ccatatcaga tcaaaggatt gtcctaaacc ttgtaaaaca 900tcaatatagc cattagggtt atctgagaat ttacctaggt ctattttaat ttaaagtctg 960ggagagaaaa aggcacatgc actctggctg ggccgaattc tcttcctccc actgcgtctg 1020agagagaaaa aggtacgtgc actctggctg ggccgaattc tcctcccacc gcttggaggg 1080ggcataatcg gggaatattg gcattctttg gttagttgtt tacccctttg tctatctcct 1140tttggaccgt ttgggttgaa ggggggtcct tattatttgg ggaaggagtc tgggggatgc 1200tggggtaggg aggtagactc tgagggcttc ctgtagggca taaatcacac tttttacata 1260attgcgagtt gtctcttaat gaaaagaaag tttgtacgta tgacacttca caccatttgc 1320cttcttttct acaaaagagg tctagctgta agatggtgtt ataatttatg cttccctcag 1380gatgccaggt ttctccccct taaagagtat atcgttgcca ggcggtactg cagaagaata 1440tgtctttttt ttcttagcat ctgagagtca aattggtccc aattctcca 148971216DNAHomo sapiens 7taaagataca gggattgaaa tgtatggcct gaagtgcagg gtcatatagg tgtgggtggt 60gaaaatgggg tttcctttag aaaaactcct atacgatggg tcatcaatat ttccaggaag 120ccgcattctc catagaagct cttggtaatg ggagctactg gtagtacagt ggcatggagg 180gggtgcagtg agagtgaaag agggtaaaag aacagtaaag agaaaaatat gataagggag 240gggttcagtg agagtgaaag ggggtaagag aacagtaaag aaaaaaatat gacaaggagg 300gccatgagga tctacgattc tagttacttt cctcacggtt gtcgcttgaa gagcaggtgc 360agatcctcta gaggttcaca ggaatagcta gcgttgtctc ctggattttc gggttccttt 420ggcagtatac agagtttgac tcgagtgtga tgtattcaag actccactcc agccacttta 480accgcagttg gggtagataa aatgactggg tagggtcctt cccaggatgt atctaaggat 540ggggacttag aaggaaggga cttgactaat accatgtcac cagggtgcaa taattacttt 600ccctcttctc gggaacaggt tccctgtaat gttttaagaa cttgttgata tttggccaag 660gaggtgatgt ctgcaactaa gctggccatc tctcggtcaa gcacaaggtc cttggttagg 720aagggccatc catacagcat tttgtatggg ctaagtcctg ctttttgggg agagttttgg 780attcttagta aggctgtagg caacagagca ggccatgcaa ggtgggtttc ttgggttagc 840ttttttaaat gtcgtttgag tgcttcattc attttcttga cttttcctga ggattgtggc 900ctccacgcgc agtgtaagtg atattgtatg cctaatgcct gggatactcc ctgggttact 960gtagccttga aaacggggcc attgtcactc tgtaagcctc ggggaagtcc gaatctggga 1020attatttcat gaattagtgc ctttattaca tcttggtcct tttctgtcct acaaaggaag 1080gcctctgccc aaccagtgaa aatatctacc cagactagta gatactgaaa tccctgagat 1140ttgggcatgt gggtaaaatc tagttgccag tcttctcctg agtaatggcc tgttctttgt 1200tctcctgaag gagctt 12168976DNAHomo sapiens 8agtgataatg gaatacttga aagtaatccc ctcactccag gaactagtgc tgagctggcc 60aaactaatag ccctcactcg ggcactagaa ttaggagaag agaaaagggt aaatatatat 120acagactata agtatgctta cctagtcctt catgcccatg cagcaatatg gagagaaagg 180gaattcctaa cttccaaagg aacacctatc aaacatcagg aagccattag gatattatta 240ttggtggtac agaaacctaa agaggtggca gtcctacact gctggggtca tcagaaaaaa 300aaggaaaggg aaatagaagg gaactaccaa gcagatattg aagccaaaag agccgcaagg 360caggaccctc cattagaaat gcttatagaa ggacccctag tgtggggtaa ccccctccag 420gaaagcaatc cccagtactc agcaggagaa ataaaatgga gaacctcacg aggacatact 480ttcctcccct caggatggct agccaccaaa gaaggaaaaa tgcttttgcc tgcagctaac 540caatggaaat tacttaaaac ccttcaccaa acctttcact taggattgat agcacccatc 600agatggccaa attattattt actggatcag gccttttcaa aactatcaag caggtagtca 660gggcctgtaa agtgtgccaa agaaataatc tcctgcactg caagccatac atttcaatcc 720ctgtatcttt aacctccttg ttaagtttgt ctcttccaga atcaaagctg taaaactaca 780aatggttctt caaatggagt ctcagatgca gtccatgact aagatatacc gcagccccct 840ggagggggcc tgctagccca tgctccaatg ttaatgacat cgaaggcacc cctcccgggg 900aaatctcaac tgcacaaccc ctactatgtc ccaattcagc aggaagcagt taaagcggtc 960atcggccaac ctcccc 9769942DNAHomo sapiens 9agaggagaac agcagcataa gcggctggca gaggtaggga aagaccagca agaagaaaag 60agagaaagag aaagagaaag tcagagaaag agacagagag aggaagagac aaagagacag 120aaagtcaaag aggtagtagt cagaaacaga gacaaaaaaa aggagtcaga aagagggaca 180gacacagaaa gtcaaaaaaa aagttaagaa gaaaggaaaa gacaaagaag aagtcgaaga 240ggagaaagag agagatagaa gtagtaaaga aaaaaacagc atatcccatt cctttaaagc 300cagggtaaat ttctatctac ccagccaagg catattctac ttatgtggat cttcaaccca 360tatctgcctc tcagacagtt tgcaagaaat aatgaaatct atccttactt tacaatccca 420aatagactct ttggcagcag tgactctcca aaactgcaga ggcctagacc tcctcactgc 480tgaaaaagga ggacactaca ccttcttagg ggaagaatgt tgtttttaca ctaaccagtc 540ggggatagta tgagatgctg cccggagttt acaggaaaag gcttctgaaa tcagacaacg 600cctttcaaat tcttatacca acttctggag ttaggcaaca tggcttctcc cctttctagg 660tcctgtggca gccatcttgc tgttactcgc ctttgggccc tgtattttta accttcttgt 720caaatttgtt tcctctagaa tcgaggccat caagctacag atggtcttac aaatggaacc 780ccaaaagagt tcaactaaca acttctaccg aggacccctg gatcaaccca ctggcacttc 840ccctggccta gagagttccc ctctgaagga caccgcaact gcagggccct tctttgcccc 900atccagcagg agtagctaga gtggtcatcg gccaaattgc ca 942101375DNAHomo sapiens 10ccccaatatt ctctttctga tggggaaaaa tggccacctg agggaagcac aaattacaat 60actatcctgc agcttgatct tttctgtaag agggaaggca aatggagtga aataccttat 120gtccaagctt tcttttcatt gagggagaat acacaactat gcaaagcttg caatttacat 180cccacaggag gacccctcag cttaccccca tatcctagcc tccctatagc ttcccttcct 240attgatgata ctcctcctct aatctcccct gcccagaagg aaataagcaa agaaatctcc 300aaaggtccac aaaaaccccc gggctatcgg ttatgtcccc ttcaagctgt agggggaggg 360gaatttggcc caacccgggt gcatgtcccc ttctccctct ctgatttaaa gcagatcagg 420cagacctggg gaagttttca gatgatcctg ataggtacat agatgtccta cagggtctag 480ggcaaacctt tgacctcact tggagagacg tcatgctact gttagatcaa accctggcct 540ttaatgaaaa gaatgcggct ttagctgcag cctgagagtt tggagatacc tggtatccta 600gtcaagtaaa tgaaagaatg acagccgaag aaagggacaa cttccctact ggtcagcaag 660ccatccccag tatggatccc cactgggact ttgactcaga tcatggggac tggagtcgta 720aacatctgtt gatctgtgtt ctggaaggac taaggagaat tgggaaaaag cccatgaatt 780attcaatgat atccaccata acccagggaa aggaagaaaa tccttctgcc ttcctcgagc 840ggctacaaga ggccttaaga aaatatactc ccctgtcacc cgaatcactc gagggtcaat 900tgattctaaa agataagttt attacccaat cagccacaga tatcaggaga aagctccaaa 960agcaagccct gagccctgaa caaaatctag agacattatt aaacctggca accttggtgt 1020tctataatag ggaccaagag gaacaggccc aaaaggaaaa gcgagatcag agaaaggccg 1080cagccttagt catggccctc agacaaacaa accttggtgg ttcagagagg tcagaaaatg 1140gagcaggcca atcacctggt acggcttgtt atcagtgcgg tttactagga cactttaaaa 1200aagattgtcc aataagaaac aagctgcccc ctcatccgtg tccactatgc cgaggcaatc 1260actggaaggt gcactgcccc agaggatgaa ggttccctgg gttagaagcc cccaaccaga 1320tgatccaaca acaggactga gggtgcccgg ggcaagcacc agctcatgtc atcac 137511944DNAHomo sapiens 11acctaggagg aactgtcttc aggacaggac tatagatgct tcctcccagg cgattaaggg 60aaaaagacac aatgggtatt cagtaagtga taaggaaact cttgtagaag cagagttagg 120aaaattgcct aataattggt ctgctcaaat gtgcgagctg tttgcactca gccaaacctt 180aaaagtatta cagaatcagg aagaagccat ctataccaat tctaagttaa tatggactga 240acgagaactt attaatagca aagaataatt gaaatcccaa acttacaagg ttttcaacaa 300aagcacagtt tgctaaaagt taactgtgta acatgtatta tcctactacc acaaactctc 360aaatgatttc tcagacagtt tgcaagaaac aatgaaacct atccttactc tacaatccca 420aatagactct ttggcagcag tgactctcca aaaccaccaa ggcctagacc tcctcactgc 480tgagaaagga ggactctgca ccttcttagg ggaagattgt tgtttttaca ctaaccagtc 540agggatagtg tgagatgcca cccagcgttt acaggaaaag gcttctgaaa tcagacacaa 600tgcttttcaa accttatagc aacctctgga gttcggcgac tggcttttcc cctttctagg 660tcctgtgaca gccatcttgc tattactcgc cttcgggccc tgtattttta acctcctcgt 720caaatttgtt tcctctagga tcgaggccat caagctacag atggtcttac aaatggaacc 780ccaaatgagc tcgactaaca acttctactg aggacccctg gaccgaccca ctggcccttt 840aactggctta aagagtttcc ctctggagga cactacaact gcagggcccc ttctttgccc 900catccacagg aagttagcta gagcagtcat cacccaattc ccaa 94412963DNAHomo sapiens 12tacaggaacc ccataatacg tccttggcaa attctattca gctccaactg ctaggagtgg 60cccatttgtc ctgaaccctc aaatcatggg aatgagaaat gaatttagac tgaccacagc 120ccttatgagt tttcagctac aggggtgtat agaaccctga taaggagttt tctttgtgtg 180tggaagatcc ttctatattt gcctccccac caactggaca ggaacttgta ctttagccta 240catagtacct cctgtgactt atccttttca gaagaggcag tagctgtgcc cattcatgct 300aagcttcagc cgagagcaat ctcactactt cctctattgg ctggtttagg atttactacc 360acctaggaag tggactcaca gcctagatga aatctctctc caacttactc aaatccagga 420ccaaatagac tcattagcag ctgtggttct ccgaaccagt gagcactaga tctccaatct 480cctcactgcc gaaaggggag gaacatgcct ttttctgaac aaggaatgtt gtttttatgt 540caataaatca ggcatagtga gagatggaat taaatgactt caggatagag ctagcagact 600acatggtggg acaaccgaaa ctacctcagg gttctcacag cctgttctcc actggcttct 660tccattttta ggtcccttcc ttatgattat tctaggagta acctttggcc catgtctttt 720cagttccttc atcctttcgt ttcttcctga atagaatcaa tgaaactaga aatgttactg 780cagatggaac ctcagatgac ttcaaccagc acctattatc aaggacccct aaaccagcct 840gccggcccat acccggacgt tgacacccaa accacctctc acgaggaaac ctcagctaca 900gaaccccttc tatgccccta ttcagcagga agcaattaga gtggtcatcc tcccacaccc 960caa 963131362DNAHomo sapiens 13ccacaatatc ctcttccagg aggagaacga tggccacctg agggaagtat acactataat 60accatcctgc aactagatct gttttgtaaa caagaaggca agtggattta ggtaccatat 120gttcagacct ttttctcatt aagggatgat aacccacgat tgtgtaagac atgtaacctg 180caccccacag ggagtcctca aattctaccc ccatacccag tcctccccac ggctcctcct 240actaatgcca aaccctctct ggcttctaca gcccaaaagg gaacaaataa aagagccttc 300agagagccaa gagaccccac tggcccctgg ctatgtcctc ttcaggctgt aggaggggaa 360tttggcccaa cccgagtaca tgttcccttt tctctctctg atctaaagca aattaaggca 420gacttggatg aaagttctca gatgacccca atagatacgt agatggcctg ctgggtctgg 480gacaatcttt tgacctttcc tggagagaga tcatgttatt gcttgatcag acctaacctc 540taatgagaag aatgctgctt taacaggagc ccgagagttt ggggatacct ggtacctcag 600ttaagtaagt gatagaatga catcagaaga gagcagtttc ctactggcca gcaagcagtc 660cccagtatgg atccccactg ggaccctgac tcggatcatg gggactggag tcacaaacat 720ttactgacct gtatcctaga agggttaagg agaactagga aaaagcccat gaactattca 780atgatgtcta ctataaccca agggaaggaa gaaaacccta ttgccttcct caaaaggctg 840agggaggctt tgagaaaata tactcccctg tcaccagatt ccctcgaagg ccagttaatt 900ttaaaggaca aatttattac tcagtcagct gcagacatta ggaaaaagct ccaaaagtta 960gccttgggcc gagcaaaatt tggaggcatc attaaacctg gcaacctcag tgttctatca 1020tagggaccaa gaggaacagg ccgaaaagga aaagcaggat aagagaaagg ctgcagattt 1080agtcatgccc tcagacaaac cttggcggtt caaagaggag aaaaaatgga gcaggccaat 1140cacccagcag ggcttattat cagtgcagtt tacaaggaca ctttaaacaa gattgtccaa 1200agagaaataa gccgccctct cacccatgtc cactatgcca aggtgatcac tggaaggcac 1260actgtcccag aggacaaagg ttctctgggc cagaagtccc caaccagatg atccagcaac 1320aggatggagg gtgcccgggg caagcaccag ctcgtgttgt ca 136214945DNAHomo sapiens 14ttgcagatca atctcagact gctgtgctag caatgagtga ggcttcgtgg gcatgggacc 60ctctgagcca ggcatgggat ataatgtcct tgtgtgccat ttgctaagac tgttggaata 120gcacagtatt agggtgggag tggcccgatt ttccaggtgc tgtctgtcac cgcttccctt 180ggctaggaaa gagaattccc tgaccccttg ttcttcccag gtaaggcagt gcctcaccct 240gcttcagctc acactcaggt gactgcaccc actgtcctgc ccccactgtc ggacaagccc 300cagtgagatg aacctggtac ctcagttgga aatgcagaaa tcacctgtct tctgcgtcac 360tcacactggg agctgtagac tggagctgtt cctatttggc catcttggaa ccatctccca 420aatagactct ttggcagcag tgactctcca aaaccaccaa ggcctagacc tcctcattgc 480tgagaaagga ggactctgca ccttcttagg ggaggagtgt tgtttttata ctgaccagtc 540agggatggta cgagatgcca cccgatgttt acaggaaaag gcttctgaaa tcacacaaca 600cctttcaaac tcttatacca acctctggag ttgggcaaca tggcttctcc cctttctcgg 660tcccattgca gccatcttgc tattactcgc cttcaggctg tgtattttta acctccttgt 720caaatttgtt tcctctagaa ttgaggccgt caagctacag atggtcttac aaatgggacc 780ccaaatgagc tcaactaaca acttctgcca aggacccctg gaccaacctg ctggcccttt 840cactggcctt aagagttccc ctctggaggg cactacaact gcagggcccc ttctttgccc 900ctatccagca ggaagtagct agagcagtca tcacccaatt cccaa 94515939DNAHomo sapiens 15agagctacct tggcaagtac tctaggagta tgggaaaatg aaaacaacaa actcacacac 60cattttaaca tacacaatca ggtctgccca cccagcaagg tatattcttt gtatgtggaa 120catcgaccta tatctgcctc cccactaact agacagccac ctgaatctta gtctttctaa 180gtcccaacag taacattgcc ccaggaaatc agaccatatc agtatccctc aaagctcaag

240tctgtcagtg cagagccata caactaatac ccctacttat agggtaagga atggctactg 300ctacaggaac cagaatagct agtttgttta cttcattatc ctactaccac acactctcaa 360atgatttctc agacagtttg caagaaataa cgaaatctat ccttactcta caatcccaaa 420tagactcctt ggcagcagtg accctccaaa acggctgagg cctagacctc ctcactgcca 480agaaaggagg actctgcatt ttcttagggg aagagtgttt ttacactaac cagtcaggga 540cagtatgaga tgccactcgg agtttacagg aaaaggcttc tgaagtcaga caatgccttt 600caaactctat accaaactct ggagttgggc aacatggctt ctcccctttc taggtcccgt 660gacagccatc ttgctattat ttgcctttga gccctgtatt tttaatctcc ttttcaaatt 720tgtttcctct ggatcgaggc catcgagcta cagatggtct tcacaaatgg aaccccaaat 780gagctcaact aacaacttct actgaggacc cctggactaa cctgctgacc ctttcactgg 840cctgaagaat tcccctctgg aggacactac aactgcaggg ctccttcttt gcccctatcc 900agcaggaagt agctagagct gtcattgcct aattcctaa 93916979DNAHomo sapiens 16agtgataatg gaatacttga aagtaatccc ctcactcccc aggaactagt gctcagctgg 60cagaactaat agccctcact cgggtactag aatcaggaga aggaaaaagg gtaaatatat 120atacagactc taagtgtgct tacctagtcc tccatgccca tgcagcaata tggagagaaa 180gggaattcct aacttccgag ggaacaccta tcaaacatca ggaagccatt aggaaattat 240tattggctgt acagaaacct aaagaggtgg cagttttaca ctgccggggt catcagaaag 300gaaaggaaag ggaaatacaa gggagccacc aagttgatat tgaagtcaaa agagccacaa 360ggctggaccc tccattagaa atgcttatag gaggacccct agtatggggt aatcccctcc 420gggaagccaa gccccagtac tcagcaggag aaatagaata gggaacttca tgaggacata 480cttccctccc ctccagatgg ctagccacca ataaaggaaa aatacttttg cctgcagcta 540accaatagaa attacttaaa acccttcatc aaaccttcca cttaggcatt gatagcaccc 600atgagatggc caaattatta tttactggac caggcctttt caaaactatc aagcagatag 660tcagggcctg taaagtctgc caaagaaata atcccctgca ctgcaggcca tacatttcaa 720tccctgtatc tttaacctcc ttcttaaatt tgtctcttcc agaatcaaag ctgtaaaatt 780acaaatagtt cttcaaatgg agccacagat gcagtccatg actaagatcc accacagacc 840cctggaccag cctgctagcc catgctccaa tgttaatgac atcgaaggca ccccctcctg 900aggaaatctc aactgcacaa cccctactac gccccaattc agcagaaagc agttagagtg 960gtcatcagcc aacctcccc 979171774DNAHomo sapiens 17catgctggta aaggaccgct agaatccagc agccaggacc actttctttg tggtcaagaa 60aggtgggaaa acaggtgcag gactgctaca ctggtaagca taactaatcc gataagcaga 120ggtccatggg tggttacgca ccctggaaag gaataagcat taggactata gaggacactc 180taggactaat gctcatcgga aaatgactag gggtactggc atccctatgt tcttttttca 240gatgggaaat gttcccccca aggcagaaat gcccctaaga tgtattctgg agaaatggga 300ccaatctgac catcagacac taagaaagaa atgacttata ttcttctgca gtaccacctg 360gccacaatat cttcttcaag gggcagaaac ctggcctcct gagggaagta taaattataa 420caccatctta cagctagacc tcttttgtag aaaagaaggc aaatggagtg aagtgccata 480tgtacaaact ttcttttcat taagagataa ctcccaatta tgtaaaaagt gtgatttatg 540ccctacagga agccctcaga gtctacctcc cgaccccagc aagaccccaa ctccttctcc 600aactaataag gacccccctt caacccaaat ggtccaaaag gagatagaca aaggggtaaa 660caatgaacca aagagtgcca atattacacg attatactcg ctccaagcag tgggaggaga 720atttggccca gccagcgtgc atgtaccttt ttctctctca gatttaaagc aaattaaaat 780agacctaggt aaattctcag ataaccctga tggctatatt gatgttttac aagggttagg 840acaatccttt gatctgacat ggagagatat aatgttactg ctaaatcaga cactaacccc 900aaatgaaaaa agtgctgcca taacagcagc ctgagagttt ggcgaactct ggtatctcag 960tcaggtcaat gataggatga caacagatga aagagaatga ttccccacag gccagcaggc 1020agttcccagt gtagaccctc attaggacac agaatcagaa cttggagatt ggtgccacag 1080acatttgcta acttgcgtgc tagaaggact aaggaaaact aggaagaagc ccatgaatta 1140ttcaatgatg tcccctataa cacagggaaa ggaagaaaat cctactgcct ttctggagag 1200actaagggaa ggattgagga agcatacctc cctgtcacct gactctatta aaggccaact 1260aatcttaaag gataagttta tcactcagtc agctgcagag attaagaaaa aacttcaaaa 1320gtatgcctta ggcccagagc aaaacttaga aaccctactg aacttggcaa cctcagtttt 1380ttataataga gatcaggaag agcaggggaa tgggacaaat gggataaaaa aaaaaaaaaa 1440aggtgactgc tttagtcgtg gccctcaggc aaatggactt tggaggctcc agaaaaggga 1500aaagctgagc aaattgaatg cctaacaggg cttgcttcta gtgtggtcta caaggacact 1560ttaaaaaaga ttgtccaagt agaaacaagc tgcccccttg tccatgcccc ttatgtcaag 1620ggaatcactg gaaggcccac tgccccagga gatgaaggtc ctctgagtca gaagccacta 1680accagataat ccagcagcag gactgaggat gcccagggca agcgccagcc catgccatca 1740ccctcacaga gccttgggta tgcttgacca ttga 177418938DNAHomo sapiens 18tgtaggaaga actcccttca ggacaggaca atagatggtt cctcccaggt gattaaggaa 60aaaagacaca gtattcagta agtgataagg aaactcttgt agaagcagag ttagaaaaat 120tgcctaataa ttggtctgct caaatgtgtg agttgtttgc actcagccaa atcttaaagt 180acttacagaa tcaggaagca gccatctata ccaattctaa gttaatatgg actaaacgag 240gttttattag tagcaaagaa aaattaaaat cccaaactta caaggttttc aactaaagtt 300tgccaaaagt taacagtgta acatgtatta tcctactatc acacactctc aaaggatttc 360tcagacagtt tgcaagaaat aacgtaatct atccttactc tacagtccca aatagactct 420ttggtagcag tgactctcca aaactgccga ggtctagacc tcctcaatgc tgagaaagga 480gaactctgca ccttcttagg ggaagagtgc tgtttttaca ctaaccagtc agggatagta 540tgagatactg cctgacgttt acaggaaaag gcttctgaaa tcagacaacg cctttcaagc 600tcttatacca acctctggag ttgggcaaca tggcttctcc ccttgctagg tcctgtggca 660gccatcttgc tattacttgc cttcgggccc tgtattttta acctccttgt caaatttgtt 720tcctctagga tcaaggccat caagctacag atggtcttac aaatggaacc ccaaatgagc 780tcaactaaca acttctactg aggacacctg gactgaccca ctggcccttt cactggccta 840aagagttccc ttctggagga cactacaact gcagggcccc gtcttcaccc ctatccagca 900ggaagtagct agatcagtca ttgcccaatt cccaacag 938191308DNAHomo sapiens 19gatgcttgcc ccaggcaccc tcagtcctgt tgttggatca tctggtcggg ggcttctggc 60ccaaagaacc tttgtcctct gaggcagtgc accttccagt gattgcctca gcattgtgga 120catgggcaag ggggcagctt gtttctcact ggacaatctt ttttaaggtg tccttccaaa 180ccacactggt aacaagccct accaggtgat tggcctgctc tattttctgt cctctctgaa 240ccaccaaggt ttgtctgtct gagggtcatg actaaggctg tggcctttct ctgatcttgc 300ttttcctttt tggcctgttc ctcttggtac ctattataga acactgaggt tgccaggttt 360aacaatggct ccagattttg ttcagggcac agggctcatt ttggagcttt ctcctgatat 420ctgcagctga ttgggtaata aacttatctt ttaggatcaa ttgactctca agagagttgg 480gtgacagggg agtatatttc cttgaggcct cccatagccg ctctaggaag gcagaaggat 540tttcttcctt tccctgagtt ataaaagaca tcattgaaca actcatggac tttttcccaa 600ttctccgtag tccttctaga acacaggtca gcagatgttt acgactccag tccccatgat 660ctgagtctag acaccagtgg ggatccatac tggggatggc ctgctgactg gtagggaatt 720tgtccctttc tttggctgtc attctatcat ttacttgact aagataccaa gtatctccaa 780attctcaggc tgcagctaaa gctgcattct tttcattaaa ggccagggtt tgatctaata 840gcatgacatc tctccaagtg aggtcaaagg tttgccctag atccatagga catcagagaa 900ggagaagggg acatacacct gagttagcca aattcccctc cctctacagc ttgaagggga 960cataagcaat agcctgggga tttttgtggt cctttggaga tttctttgct tgtttccttc 1020tgggtggggg agattagagg aggcttatca gtaataggaa ggggagctat agggaggcta 1080ggatatgggg gtaagctgag aggtcatctt gtgggatgta aattgcaagc tttgcatagt 1140tgtggatttt ccttacaatg aaaataaagc ttggacataa ggtatttcac tccatttgcc 1200ttccctctta cagaaaaggt caagctgcag gatagtactg taatttatac ttccttcagg 1260tggccatttc ttcccatcag agagagaata ctggggctgg gccatagt 130820711DNAHomo sapiens 20actgagagac aggactagct ggatttccta ggccgactaa gaatccctaa gcctagctgg 60gaaggtgacc acgtccacct ttaaacacgg ggcttgcaac ttagctcaca cctgaccaat 120cagagagctc actaaaatgc taattaggca aagacaggag gtaaagaaat agccaatcat 180ctattgcctg agagcacagc aggagggaca acaatcggga tataaaccca ggcattcgag 240ctggcaacag cagcccccct ttgggtccct tccctttgta tgggagctgt tttcatgcta 300tttcactcta ttaaatcttg caactgcact cttctggtcc atgtttctta cggctcgagc 360tgagcttttg ctcaccgtcc accactgctg tttgccacca ccgcagacct gccgctgact 420cccatccctc tggatcctgc agggtgtccg ctgtgctcct gatccagcga ggcgcccatt 480gccgctccca attgggctaa aggcttgcca ttgttcctgc acggctaagt gcctgggttt 540gttctaattg agctgaacac tagtcactgg gttccatggt tctcttctgt gacccacggc 600ttctaataga actataacac ttaccacatg gcccaagatt ccattccttg gaatccgtga 660ggccaagaac tccaggtcag agaatacgag gcttgccacc atcttggaag c 71121711DNAHomo sapiens 21actgagagac aggactagct ggatttccta ggctgactaa gaatccctaa gcctagctgg 60gaaggtgacc acatccacct ttaaacacgg ggcttgcaac ttagctcaca cctgaccaat 120cagagagctc actaaaatgc taattaggca aagacaggag gtaaagaaat agccaatcat 180ctattgcctg agagcacagc aggagggaca atgatcggga tataaaccca agtcttcgag 240ccggcaacgg caaccccctt tgggtcccct ccctttgtat gggagctctg ttttcatgct 300atttcactct attaaatctt gcaactgcac tcttctggtc catgtttctt acggcttgag 360ctgagctttc gctcgccatc caccactgct gtttgccgcc accgcagacc cgccgctgac 420tcccatccct ctggatcatg cagggtgtcc gctgtgctcc tgatccagcg aggcacccat 480tgccgctccc aatcgggcta aaggcttgcc attgttcctg catggctaag tgcctgggtt 540catcctaatt gagctgaaca ctagtcactg ggttccatgg ttctcttctg tgacccacag 600cttctaatag agctataaca ctcaccgcat ggcccaaggt tccattcctt gaatccataa 660ggccaagaac cccaggtcag agaacacgag gcttgccacc atcttgggag c 711222055DNAHomo sapiensCDS(1)..(2055) 22ccc aag aca gcc aac tta gtt gca gac atc acc tcc tta gcc aaa tat 48Pro Lys Thr Ala Asn Leu Val Ala Asp Ile Thr Ser Leu Ala Lys Tyr1 5 10 15caa caa gtt ctt aaa aca tta caa gga acc tat ccc tga gaa gag gga 96Gln Gln Val Leu Lys Thr Leu Gln Gly Thr Tyr Pro Glu Glu Gly 20 25 30aaa gaa cta ttc cac cct tgt gac atg gta tta gtc aag tcc ctt ccc 144Lys Glu Leu Phe His Pro Cys Asp Met Val Leu Val Lys Ser Leu Pro 35 40 45tct aat tcc cca tcc cta gat aca tcc tgg gaa gga ccc tac cca gtc 192Ser Asn Ser Pro Ser Leu Asp Thr Ser Trp Glu Gly Pro Tyr Pro Val 50 55 60att tta tct acc cca act gcg gtt aaa gtg gct gga gtg gag tct tgg 240Ile Leu Ser Thr Pro Thr Ala Val Lys Val Ala Gly Val Glu Ser Trp 65 70 75ata cat cac act tga gtc aaa tcc tgg ata ctg cca aag gaa cct gaa 288Ile His His Thr Val Lys Ser Trp Ile Leu Pro Lys Glu Pro Glu80 85 90aat cca gga gac aac gct agc tat tcc tgt gaa cct cta gag gat ttg 336Asn Pro Gly Asp Asn Ala Ser Tyr Ser Cys Glu Pro Leu Glu Asp Leu95 100 105 110cgc ctg ctc ttc aaa caa caa cca gga gga aag taa cta aaa tca taa 384Arg Leu Leu Phe Lys Gln Gln Pro Gly Gly Lys Leu Lys Ser 115 120atc ccc atg gcc ctc cct tat cat att ttt ctc ttt act gtt ctt tta 432Ile Pro Met Ala Leu Pro Tyr His Ile Phe Leu Phe Thr Val Leu Leu125 130 135 140ccc tct ttc act ctc act gca ccc cct cca tgc cgc tgt atg acc agt 480Pro Ser Phe Thr Leu Thr Ala Pro Pro Pro Cys Arg Cys Met Thr Ser 145 150 155agc tcc cct tac caa gag ttt cta tgg aga atg cag cgt ccc gga aat 528Ser Ser Pro Tyr Gln Glu Phe Leu Trp Arg Met Gln Arg Pro Gly Asn 160 165 170att gat gcc cca tcg tat agg agt ctt tct aag gga acc ccc acc ttc 576Ile Asp Ala Pro Ser Tyr Arg Ser Leu Ser Lys Gly Thr Pro Thr Phe 175 180 185act gcc cac acc cat atg ccc cgc aac tgc tat cac tct gcc act ctt 624Thr Ala His Thr His Met Pro Arg Asn Cys Tyr His Ser Ala Thr Leu 190 195 200tgc atg cat gca aat act cat tat tgg aca gga aaa atg att aat cct 672Cys Met His Ala Asn Thr His Tyr Trp Thr Gly Lys Met Ile Asn Pro205 210 215 220agt tgt cct gga gga ctt gga gtc act gtc tgt tgg act tac ttc acc 720Ser Cys Pro Gly Gly Leu Gly Val Thr Val Cys Trp Thr Tyr Phe Thr 225 230 235caa act ggt atg tct gat ggg ggt gga gtt caa gat cag gca aga gaa 768Gln Thr Gly Met Ser Asp Gly Gly Gly Val Gln Asp Gln Ala Arg Glu 240 245 250aaa cat gta aaa gaa gta atc tcc caa ctc acc cgg gta cat ggc acc 816Lys His Val Lys Glu Val Ile Ser Gln Leu Thr Arg Val His Gly Thr 255 260 265tct agc ccc tac aaa gga cta gat ctc tca aaa cta cat gaa acc ctc 864Ser Ser Pro Tyr Lys Gly Leu Asp Leu Ser Lys Leu His Glu Thr Leu 270 275 280cgt acc cat act cgc ctg gta agc cta ttt aat acc acc ctc act ggg 912Arg Thr His Thr Arg Leu Val Ser Leu Phe Asn Thr Thr Leu Thr Gly285 290 295 300ctc cat gag gtc tcg gcc caa aac cct act aac tgt tgg ata tgc ctc 960Leu His Glu Val Ser Ala Gln Asn Pro Thr Asn Cys Trp Ile Cys Leu 305 310 315ccc ctg aac ttc agg cca tat gtt tca atc cct gta cct gaa caa tgg 1008Pro Leu Asn Phe Arg Pro Tyr Val Ser Ile Pro Val Pro Glu Gln Trp 320 325 330aac aac ttc agc aca gaa ata aac acc act tcc gtt tta gta gga cct 1056Asn Asn Phe Ser Thr Glu Ile Asn Thr Thr Ser Val Leu Val Gly Pro 335 340 345ctt gtt tcc aat ctg gaa ata acc cat acc tca aac ctc acc tgt gta 1104Leu Val Ser Asn Leu Glu Ile Thr His Thr Ser Asn Leu Thr Cys Val 350 355 360 aaa ttt agc aat act aca tac aca acc aac tcc caa tgc atc agg tgg 1152Lys Phe Ser Asn Thr Thr Tyr Thr Thr Asn Ser Gln Cys Ile Arg Trp365 370 375 380gta act cct ccc aca caa ata gtc tgc cta ccc tca gga ata ttt ttt 1200Val Thr Pro Pro Thr Gln Ile Val Cys Leu Pro Ser Gly Ile Phe Phe 385 390 395gtc tgt ggt acc tca gcc tat cgt tgt ttg aat ggc tct tca gaa tct 1248Val Cys Gly Thr Ser Ala Tyr Arg Cys Leu Asn Gly Ser Ser Glu Ser 400 405 410atg tgc ttc ctc tca ttc tta gtg ccc cct atg acc atc tac act gaa 1296Met Cys Phe Leu Ser Phe Leu Val Pro Pro Met Thr Ile Tyr Thr Glu 415 420 425caa gat tta tac agt tat gtc ata tct aag ccc cgc aac aaa aga gta 1344Gln Asp Leu Tyr Ser Tyr Val Ile Ser Lys Pro Arg Asn Lys Arg Val 430 435 440ccc att ctt cct ttt gtt ata gga gca gga gtg cta ggt gca cta ggt 1392Pro Ile Leu Pro Phe Val Ile Gly Ala Gly Val Leu Gly Ala Leu Gly445 450 455 460act ggc att ggc ggt atc aca acc tct act cag ttc tac tac aaa cta 1440Thr Gly Ile Gly Gly Ile Thr Thr Ser Thr Gln Phe Tyr Tyr Lys Leu 465 470 475tct caa gaa cta aat ggg gac atg gaa cgg gtc gcc gac tcc ctg gtc 1488Ser Gln Glu Leu Asn Gly Asp Met Glu Arg Val Ala Asp Ser Leu Val 480 485 490acc ttg caa gat caa ctt aac tcc cta gca gca gta gtc ctt caa aat 1536Thr Leu Gln Asp Gln Leu Asn Ser Leu Ala Ala Val Val Leu Gln Asn 495 500 505cga aga gct tta gac ttg cta acc gct gaa aga ggg gga acc tgt tta 1584Arg Arg Ala Leu Asp Leu Leu Thr Ala Glu Arg Gly Gly Thr Cys Leu 510 515 520ttt tta ggg gaa gaa tgc tgt tat tat gtt aat caa tcc gga atc gtc 1632Phe Leu Gly Glu Glu Cys Cys Tyr Tyr Val Asn Gln Ser Gly Ile Val525 530 535 540act gag aaa gtt aaa gaa att cga gat cga ata caa cgt aga gca gag 1680Thr Glu Lys Val Lys Glu Ile Arg Asp Arg Ile Gln Arg Arg Ala Glu 545 550 555gag ctt cga aac act gga ccc tgg ggc ctc ctc agc caa tgg atg ccc 1728Glu Leu Arg Asn Thr Gly Pro Trp Gly Leu Leu Ser Gln Trp Met Pro 560 565 570tgg att ctc ccc ttc tta gga cct cta gca gct ata ata ttg cta ctc 1776Trp Ile Leu Pro Phe Leu Gly Pro Leu Ala Ala Ile Ile Leu Leu Leu 575 580 585ctc ttt gga ccc tgt atc ttt aac ctc ctt gtt aac ttt gtc tct tcc 1824Leu Phe Gly Pro Cys Ile Phe Asn Leu Leu Val Asn Phe Val Ser Ser 590 595 600aga atc gaa gct gta aaa cta caa atg gag ccc aag atg cag tcc aag 1872Arg Ile Glu Ala Val Lys Leu Gln Met Glu Pro Lys Met Gln Ser Lys605 610 615 620act aag atc tac cgc aga ccc ctg gac cgg cct gct agc cca cga tct 1920Thr Lys Ile Tyr Arg Arg Pro Leu Asp Arg Pro Ala Ser Pro Arg Ser 625 630 635gat gtt aat gac atc aaa ggc acc cct cct gag gaa atc tca gct gca 1968Asp Val Asn Asp Ile Lys Gly Thr Pro Pro Glu Glu Ile Ser Ala Ala 640 645 650caa cct cta cta cgc ccc aat tca gca gga agc agt tag agc ggt cgt 2016Gln Pro Leu Leu Arg Pro Asn Ser Ala Gly Ser Ser Ser Gly Arg 655 660 665cgg cca acc tcc cca aca gca ctt agg ttt tcc tgt tga 2055Arg Pro Thr Ser Pro Thr Ala Leu Arg Phe Ser Cys 670 6752328PRTHomo sapiens 23Pro Lys Thr Ala Asn Leu Val Ala Asp Ile Thr Ser Leu Ala Lys Tyr1 5 10 15Gln Gln Val Leu Lys Thr Leu Gln Gly Thr Tyr Pro 20 252455PRTHomo sapiens 24Glu Glu Gly Lys Glu Leu Phe His Pro Cys Asp Met Val Leu Val Lys1 5 10 15Ser Leu Pro Ser Asn Ser Pro Ser Leu Asp Thr Ser Trp Glu Gly Pro 20 25 30Tyr Pro Val Ile Leu Ser Thr Pro Thr Ala Val Lys Val Ala Gly Val 35 40 45Glu Ser Trp Ile His His Thr 50 552538PRTHomo sapiens 25Val Lys Ser Trp Ile Leu Pro Lys Glu Pro Glu Asn Pro Gly Asp Asn1 5 10 15Ala Ser Tyr Ser Cys Glu Pro Leu Glu Asp Leu Arg Leu Leu Phe Lys 20 25 30Gln Gln Pro Gly Gly Lys 3526540PRTHomo sapiens 26Ile Pro Met Ala Leu Pro Tyr His Ile Phe Leu Phe Thr Val Leu Leu1 5 10 15Pro Ser Phe Thr Leu Thr Ala Pro Pro Pro Cys Arg Cys Met Thr Ser

20 25 30Ser Ser Pro Tyr Gln Glu Phe Leu Trp Arg Met Gln Arg Pro Gly Asn 35 40 45Ile Asp Ala Pro Ser Tyr Arg Ser Leu Ser Lys Gly Thr Pro Thr Phe 50 55 60Thr Ala His Thr His Met Pro Arg Asn Cys Tyr His Ser Ala Thr Leu65 70 75 80Cys Met His Ala Asn Thr His Tyr Trp Thr Gly Lys Met Ile Asn Pro 85 90 95Ser Cys Pro Gly Gly Leu Gly Val Thr Val Cys Trp Thr Tyr Phe Thr 100 105 110Gln Thr Gly Met Ser Asp Gly Gly Gly Val Gln Asp Gln Ala Arg Glu 115 120 125Lys His Val Lys Glu Val Ile Ser Gln Leu Thr Arg Val His Gly Thr 130 135 140Ser Ser Pro Tyr Lys Gly Leu Asp Leu Ser Lys Leu His Glu Thr Leu145 150 155 160Arg Thr His Thr Arg Leu Val Ser Leu Phe Asn Thr Thr Leu Thr Gly 165 170 175Leu His Glu Val Ser Ala Gln Asn Pro Thr Asn Cys Trp Ile Cys Leu 180 185 190Pro Leu Asn Phe Arg Pro Tyr Val Ser Ile Pro Val Pro Glu Gln Trp 195 200 205Asn Asn Phe Ser Thr Glu Ile Asn Thr Thr Ser Val Leu Val Gly Pro 210 215 220Leu Val Ser Asn Leu Glu Ile Thr His Thr Ser Asn Leu Thr Cys Val225 230 235 240Lys Phe Ser Asn Thr Thr Tyr Thr Thr Asn Ser Gln Cys Ile Arg Trp 245 250 255Val Thr Pro Pro Thr Gln Ile Val Cys Leu Pro Ser Gly Ile Phe Phe 260 265 270Val Cys Gly Thr Ser Ala Tyr Arg Cys Leu Asn Gly Ser Ser Glu Ser 275 280 285Met Cys Phe Leu Ser Phe Leu Val Pro Pro Met Thr Ile Tyr Thr Glu 290 295 300Gln Asp Leu Tyr Ser Tyr Val Ile Ser Lys Pro Arg Asn Lys Arg Val305 310 315 320Pro Ile Leu Pro Phe Val Ile Gly Ala Gly Val Leu Gly Ala Leu Gly 325 330 335Thr Gly Ile Gly Gly Ile Thr Thr Ser Thr Gln Phe Tyr Tyr Lys Leu 340 345 350Ser Gln Glu Leu Asn Gly Asp Met Glu Arg Val Ala Asp Ser Leu Val 355 360 365Thr Leu Gln Asp Gln Leu Asn Ser Leu Ala Ala Val Val Leu Gln Asn 370 375 380Arg Arg Ala Leu Asp Leu Leu Thr Ala Glu Arg Gly Gly Thr Cys Leu385 390 395 400Phe Leu Gly Glu Glu Cys Cys Tyr Tyr Val Asn Gln Ser Gly Ile Val 405 410 415Thr Glu Lys Val Lys Glu Ile Arg Asp Arg Ile Gln Arg Arg Ala Glu 420 425 430Glu Leu Arg Asn Thr Gly Pro Trp Gly Leu Leu Ser Gln Trp Met Pro 435 440 445Trp Ile Leu Pro Phe Leu Gly Pro Leu Ala Ala Ile Ile Leu Leu Leu 450 455 460Leu Phe Gly Pro Cys Ile Phe Asn Leu Leu Val Asn Phe Val Ser Ser465 470 475 480Arg Ile Glu Ala Val Lys Leu Gln Met Glu Pro Lys Met Gln Ser Lys 485 490 495Thr Lys Ile Tyr Arg Arg Pro Leu Asp Arg Pro Ala Ser Pro Arg Ser 500 505 510Asp Val Asn Asp Ile Lys Gly Thr Pro Pro Glu Glu Ile Ser Ala Ala 515 520 525Gln Pro Leu Leu Arg Pro Asn Ser Ala Gly Ser Ser 530 535 5402715PRTHomo sapiens 27Ser Gly Arg Arg Pro Thr Ser Pro Thr Ala Leu Arg Phe Ser Cys1 5 10 15281080DNAHomo sapiensCDS(1)..(1080) 28acc tct ttt gta gaa aag gca aat gga gtg aag tgc cat aag tac aaa 48Thr Ser Phe Val Glu Lys Ala Asn Gly Val Lys Cys His Lys Tyr Lys1 5 10 15ctt tct ttt cat taa gag aca act cac aat tat gta aaa agt gtg att 96Leu Ser Phe His Glu Thr Thr His Asn Tyr Val Lys Ser Val Ile 20 25 30tat gcc cta cag gaa gcc ttc aga gtc tac ctc cct atc cca gca tcc 144Tyr Ala Leu Gln Glu Ala Phe Arg Val Tyr Leu Pro Ile Pro Ala Ser 35 40 45ccg act cct tcc cca act aat aag gac ccc cct tca acc caa atg gtc 192Pro Thr Pro Ser Pro Thr Asn Lys Asp Pro Pro Ser Thr Gln Met Val 50 55 60caa aag gag ata gac aaa agg gta aac agt gaa cca aag agt gcc aat 240Gln Lys Glu Ile Asp Lys Arg Val Asn Ser Glu Pro Lys Ser Ala Asn 65 70 75att ccc caa tta tga ccc ctc caa gca gtg gga gga aga gaa ttc ggc 288Ile Pro Gln Leu Pro Leu Gln Ala Val Gly Gly Arg Glu Phe Gly80 85 90cca gcc aga gtg cat gtg cct ttt tct ctc cca gac tta aag caa ata 336Pro Ala Arg Val His Val Pro Phe Ser Leu Pro Asp Leu Lys Gln Ile95 100 105 110aaa aca gac tta ggt aaa ttc tca gat aac cct gat ggc tat att gat 384Lys Thr Asp Leu Gly Lys Phe Ser Asp Asn Pro Asp Gly Tyr Ile Asp 115 120 125gtt tta caa ggg tta gga caa ttc ttt gat ctg aca tgg aga gat ata 432Val Leu Gln Gly Leu Gly Gln Phe Phe Asp Leu Thr Trp Arg Asp Ile 130 135 140atg tca ctg cta aat cag aca cta acc cca aat gag aga agt gcc acc 480Met Ser Leu Leu Asn Gln Thr Leu Thr Pro Asn Glu Arg Ser Ala Thr 145 150 155ata act gca gcc tga gag ttt ggc gat ctc tgg tat ctc agt cag gtc 528Ile Thr Ala Ala Glu Phe Gly Asp Leu Trp Tyr Leu Ser Gln Val 160 165 170aat gat agg atg aca aca gag gaa aga gaa tga ttc ccc aca ggc cag 576Asn Asp Arg Met Thr Thr Glu Glu Arg Glu Phe Pro Thr Gly Gln 175 180 185cag gca gtt ccc agt cta gac cct cat tgg gac aca gaa tca gaa cat 624Gln Ala Val Pro Ser Leu Asp Pro His Trp Asp Thr Glu Ser Glu His 190 195 200gga gat tgg tgc tgc aga cat ttg cta act tgt gtg cta gaa gga cta 672Gly Asp Trp Cys Cys Arg His Leu Leu Thr Cys Val Leu Glu Gly Leu205 210 215 220agg aaa act agg aag aag tct atg aat tac tca atg atg tcc acc ata 720Arg Lys Thr Arg Lys Lys Ser Met Asn Tyr Ser Met Met Ser Thr Ile 225 230 235aca cag gga agg gaa gaa aat cct act gcc ttt ctg gag aga cta agg 768Thr Gln Gly Arg Glu Glu Asn Pro Thr Ala Phe Leu Glu Arg Leu Arg 240 245 250gag gca ttg agg aag cgt gcc tct ctg tca cct gac tct tct gaa ggc 816Glu Ala Leu Arg Lys Arg Ala Ser Leu Ser Pro Asp Ser Ser Glu Gly 255 260 265caa cta atc tta aag cgt aag ttt atc act cag tca gct gca gac att 864Gln Leu Ile Leu Lys Arg Lys Phe Ile Thr Gln Ser Ala Ala Asp Ile 270 275 280aga aaa aaa ctt caa aag tct gcc gta ggc ccg gag caa aac tta gaa 912Arg Lys Lys Leu Gln Lys Ser Ala Val Gly Pro Glu Gln Asn Leu Glu285 290 295 300acc cta ttg aac ttg gca acc tcg gtt ttt tat aat aga gat cag gag 960Thr Leu Leu Asn Leu Ala Thr Ser Val Phe Tyr Asn Arg Asp Gln Glu 305 310 315gag cag gcg gaa cag gac aaa cgg gat taa aaa aaa ggc cac cgc ttt 1008Glu Gln Ala Glu Gln Asp Lys Arg Asp Lys Lys Gly His Arg Phe 320 325 330agt cat gac cct cag gca agt gga ctt tgg agg ctc tgg aaa agg gaa 1056Ser His Asp Pro Gln Ala Ser Gly Leu Trp Arg Leu Trp Lys Arg Glu 335 340 345aag ctg ggc aaa ttg aat gcc taa 1080Lys Leu Gly Lys Leu Asn Ala 350 2920PRTHomo sapiens 29Thr Ser Phe Val Glu Lys Ala Asn Gly Val Lys Cys His Lys Tyr Lys1 5 10 15Leu Ser Phe His 203063PRTHomo sapiens 30Glu Thr Thr His Asn Tyr Val Lys Ser Val Ile Tyr Ala Leu Gln Glu1 5 10 15Ala Phe Arg Val Tyr Leu Pro Ile Pro Ala Ser Pro Thr Pro Ser Pro 20 25 30Thr Asn Lys Asp Pro Pro Ser Thr Gln Met Val Gln Lys Glu Ile Asp 35 40 45Lys Arg Val Asn Ser Glu Pro Lys Ser Ala Asn Ile Pro Gln Leu 50 55 603179PRTHomo sapiens 31Pro Leu Gln Ala Val Gly Gly Arg Glu Phe Gly Pro Ala Arg Val His1 5 10 15Val Pro Phe Ser Leu Pro Asp Leu Lys Gln Ile Lys Thr Asp Leu Gly 20 25 30Lys Phe Ser Asp Asn Pro Asp Gly Tyr Ile Asp Val Leu Gln Gly Leu 35 40 45Gly Gln Phe Phe Asp Leu Thr Trp Arg Asp Ile Met Ser Leu Leu Asn 50 55 60Gln Thr Leu Thr Pro Asn Glu Arg Ser Ala Thr Ile Thr Ala Ala65 70 753221PRTHomo sapiens 32Glu Phe Gly Asp Leu Trp Tyr Leu Ser Gln Val Asn Asp Arg Met Thr1 5 10 15Thr Glu Glu Arg Glu 2033142PRTHomo sapiens 33Phe Pro Thr Gly Gln Gln Ala Val Pro Ser Leu Asp Pro His Trp Asp1 5 10 15Thr Glu Ser Glu His Gly Asp Trp Cys Cys Arg His Leu Leu Thr Cys 20 25 30 Val Leu Glu Gly Leu Arg Lys Thr Arg Lys Lys Ser Met Asn Tyr Ser 35 40 45Met Met Ser Thr Ile Thr Gln Gly Arg Glu Glu Asn Pro Thr Ala Phe 50 55 60Leu Glu Arg Leu Arg Glu Ala Leu Arg Lys Arg Ala Ser Leu Ser Pro65 70 75 80Asp Ser Ser Glu Gly Gln Leu Ile Leu Lys Arg Lys Phe Ile Thr Gln 85 90 95Ser Ala Ala Asp Ile Arg Lys Lys Leu Gln Lys Ser Ala Val Gly Pro 100 105 110Glu Gln Asn Leu Glu Thr Leu Leu Asn Leu Ala Thr Ser Val Phe Tyr 115 120 125Asn Arg Asp Gln Glu Glu Gln Ala Glu Gln Asp Lys Arg Asp 130 135 1403429PRTHomo sapiens 34Lys Lys Gly His Arg Phe Ser His Asp Pro Gln Ala Ser Gly Leu Trp1 5 10 15Arg Leu Trp Lys Arg Glu Lys Leu Gly Lys Leu Asn Ala 20 2535685PRTHomo sapiensmisc_feature(29)..(29)Xaa is any amino acid 35Pro Lys Thr Ala Asn Leu Val Ala Asp Ile Thr Ser Leu Ala Lys Tyr1 5 10 15Gln Gln Val Leu Lys Thr Leu Gln Gly Thr Tyr Pro Xaa Glu Glu Gly 20 25 30Lys Glu Leu Phe His Pro Cys Asp Met Val Leu Val Lys Ser Leu Pro 35 40 45Ser Asn Ser Pro Ser Leu Asp Thr Ser Trp Glu Gly Pro Tyr Pro Val 50 55 60Ile Leu Ser Thr Pro Thr Ala Val Lys Val Ala Gly Val Glu Ser Trp65 70 75 80Ile His His Thr Xaa Val Lys Ser Trp Ile Leu Pro Lys Glu Pro Glu 85 90 95Asn Pro Gly Asp Asn Ala Ser Tyr Ser Cys Glu Pro Leu Glu Asp Leu 100 105 110Arg Leu Leu Phe Lys Gln Gln Pro Gly Gly Lys Xaa Leu Lys Ser Xaa 115 120 125Ile Pro Met Ala Leu Pro Tyr His Ile Phe Leu Phe Thr Val Leu Leu 130 135 140Pro Ser Phe Thr Leu Thr Ala Pro Pro Pro Cys Arg Cys Met Thr Ser145 150 155 160Ser Ser Pro Tyr Gln Glu Phe Leu Trp Arg Met Gln Arg Pro Gly Asn 165 170 175Ile Asp Ala Pro Ser Tyr Arg Ser Leu Ser Lys Gly Thr Pro Thr Phe 180 185 190Thr Ala His Thr His Met Pro Arg Asn Cys Tyr His Ser Ala Thr Leu 195 200 205Cys Met His Ala Asn Thr His Tyr Trp Thr Gly Lys Met Ile Asn Pro 210 215 220Ser Cys Pro Gly Gly Leu Gly Val Thr Val Cys Trp Thr Tyr Phe Thr225 230 235 240Gln Thr Gly Met Ser Asp Gly Gly Gly Val Gln Asp Gln Ala Arg Glu 245 250 255Lys His Val Lys Glu Val Ile Ser Gln Leu Thr Arg Val His Gly Thr 260 265 270Ser Ser Pro Tyr Lys Gly Leu Asp Leu Ser Lys Leu His Glu Thr Leu 275 280 285Arg Thr His Thr Arg Leu Val Ser Leu Phe Asn Thr Thr Leu Thr Gly 290 295 300Leu His Glu Val Ser Ala Gln Asn Pro Thr Asn Cys Trp Ile Cys Leu305 310 315 320Pro Leu Asn Phe Arg Pro Tyr Val Ser Ile Pro Val Pro Glu Gln Trp 325 330 335Asn Asn Phe Ser Thr Glu Ile Asn Thr Thr Ser Val Leu Val Gly Pro 340 345 350Leu Val Ser Asn Leu Glu Ile Thr His Thr Ser Asn Leu Thr Cys Val 355 360 365Lys Phe Ser Asn Thr Thr Tyr Thr Thr Asn Ser Gln Cys Ile Arg Trp 370 375 380Val Thr Pro Pro Thr Gln Ile Val Cys Leu Pro Ser Gly Ile Phe Phe385 390 395 400Val Cys Gly Thr Ser Ala Tyr Arg Cys Leu Asn Gly Ser Ser Glu Ser 405 410 415Met Cys Phe Leu Ser Phe Leu Val Pro Pro Met Thr Ile Tyr Thr Glu 420 425 430Gln Asp Leu Tyr Ser Tyr Val Ile Ser Lys Pro Arg Asn Lys Arg Val 435 440 445Pro Ile Leu Pro Phe Val Ile Gly Ala Gly Val Leu Gly Ala Leu Gly 450 455 460Thr Gly Ile Gly Gly Ile Thr Thr Ser Thr Gln Phe Tyr Tyr Lys Leu465 470 475 480Ser Gln Glu Leu Asn Gly Asp Met Glu Arg Val Ala Asp Ser Leu Val 485 490 495Thr Leu Gln Asp Gln Leu Asn Ser Leu Ala Ala Val Val Leu Gln Asn 500 505 510Arg Arg Ala Leu Asp Leu Leu Thr Ala Glu Arg Gly Gly Thr Cys Leu 515 520 525Phe Leu Gly Glu Glu Cys Cys Tyr Tyr Val Asn Gln Ser Gly Ile Val 530 535 540Thr Glu Lys Val Lys Glu Ile Arg Asp Arg Ile Gln Arg Arg Ala Glu545 550 555 560Glu Leu Arg Asn Thr Gly Pro Trp Gly Leu Leu Ser Gln Trp Met Pro 565 570 575Trp Ile Leu Pro Phe Leu Gly Pro Leu Ala Ala Ile Ile Leu Leu Leu 580 585 590Leu Phe Gly Pro Cys Ile Phe Asn Leu Leu Val Asn Phe Val Ser Ser 595 600 605Arg Ile Glu Ala Val Lys Leu Gln Met Glu Pro Lys Met Gln Ser Lys 610 615 620Thr Lys Ile Tyr Arg Arg Pro Leu Asp Arg Pro Ala Ser Pro Arg Ser625 630 635 640Asp Val Asn Asp Ile Lys Gly Thr Pro Pro Glu Glu Ile Ser Ala Ala 645 650 655Gln Pro Leu Leu Arg Pro Asn Ser Ala Gly Ser Ser Xaa Ser Gly Arg 660 665 670Arg Pro Thr Ser Pro Thr Ala Leu Arg Phe Ser Cys Xaa 675 680 68536360PRTHomo sapiensmisc_feature(21)..(21)Xaa is any amino acid 36Thr Ser Phe Val Glu Lys Ala Asn Gly Val Lys Cys His Lys Tyr Lys1 5 10 15Leu Ser Phe His Xaa Glu Thr Thr His Asn Tyr Val Lys Ser Val Ile 20 25 30Tyr Ala Leu Gln Glu Ala Phe Arg Val Tyr Leu Pro Ile Pro Ala Ser 35 40 45Pro Thr Pro Ser Pro Thr Asn Lys Asp Pro Pro Ser Thr Gln Met Val 50 55 60Gln Lys Glu Ile Asp Lys Arg Val Asn Ser Glu Pro Lys Ser Ala Asn65 70 75 80Ile Pro Gln Leu Xaa Pro Leu Gln Ala Val Gly Gly Arg Glu Phe Gly 85 90 95Pro Ala Arg Val His Val Pro Phe Ser Leu Pro Asp Leu Lys Gln Ile 100 105 110Lys Thr Asp Leu Gly Lys Phe Ser Asp Asn Pro Asp Gly Tyr Ile Asp 115 120 125Val Leu Gln Gly Leu Gly Gln Phe Phe Asp Leu Thr Trp Arg Asp Ile 130 135 140Met Ser Leu Leu Asn Gln Thr Leu Thr Pro Asn Glu Arg Ser Ala Thr145 150 155 160Ile Thr Ala Ala Xaa Glu Phe Gly Asp Leu Trp Tyr Leu Ser Gln Val 165 170 175Asn Asp Arg Met Thr Thr Glu Glu Arg Glu Xaa Phe Pro Thr Gly Gln 180 185 190Gln Ala Val Pro Ser Leu Asp Pro His Trp Asp Thr Glu Ser Glu His 195 200 205Gly Asp Trp Cys Cys Arg His Leu Leu Thr Cys Val Leu Glu Gly Leu 210 215 220Arg Lys Thr Arg Lys Lys Ser Met Asn Tyr Ser Met Met Ser Thr Ile225 230 235 240Thr Gln Gly Arg Glu Glu Asn Pro Thr Ala Phe Leu Glu Arg Leu Arg 245 250 255Glu Ala Leu Arg Lys Arg Ala Ser Leu Ser Pro Asp Ser Ser Glu Gly 260 265 270Gln Leu Ile Leu Lys Arg Lys Phe Ile Thr Gln Ser Ala Ala Asp Ile 275

280 285Arg Lys Lys Leu Gln Lys Ser Ala Val Gly Pro Glu Gln Asn Leu Glu 290 295 300Thr Leu Leu Asn Leu Ala Thr Ser Val Phe Tyr Asn Arg Asp Gln Glu305 310 315 320Glu Gln Ala Glu Gln Asp Lys Arg Asp Xaa Lys Lys Gly His Arg Phe 325 330 335Ser His Asp Pro Gln Ala Ser Gly Leu Trp Arg Leu Trp Lys Arg Glu 340 345 350Lys Leu Gly Lys Leu Asn Ala Xaa 355 3603726DNAHomo sapiens 37ggaccataga ggacactcca ggacta 263825DNAHomo sapiens 38cctcagtcct gctgctggat catct 253927DNAHomo sapiens 39cctccaagca gtgggaggaa gagaatt 274028DNAHomo sapiens 40ccttccctgt gttattgtgg acatcatt 284130DNAHomo sapiens 41ggaagaagtc tatgaattat tcaatgatgt 304227DNAHomo sapiens 42gggacacaga atcagaacat ggagatt 274327DNAHomo sapiens 43gccttcagaa gagtcaggtg acagaga 274425DNAHomo sapiens 44gagcctccaa agtccacttg cctga 254529DNAHomo sapiens 45gatttcagta tctactagtc tgggtagat 294627DNAHomo sapiens 46ctaggaaatc cagctagtcc tgtctca 274728DNAHomo sapiens 47ccaagacagc caacttagtt gcagacat 284828DNAHomo sapiens 48ggacgctgca ttctccatag aaactctt 284929DNAHomo sapiens 49gcaatactac atacacaacc aactcccaa 295026DNAHomo sapiens 50gggggaggca tatccaacag ttagta 265130DNAHomo sapiens 51ccatctacac tgaacaagat ttatacactt 305228DNAHomo sapiens 52aatgccagta cctagtgcac ctagcact 285331DNAHomo sapiens 53cgaatacaac gtagagcaga ggagcttcga a 315428DNAHomo sapiens 54agcccaagat gcagtccaag actaagat 285527DNAHomo sapiens 55gcgtagtaga ggttgtgcag ctgagat 275627DNAHomo sapiens 56cccttaccaa gagtttctat ggagaat 275727DNAHomo sapiens 57accgctctaa ctgcttcctg ctgaatt 2758420PRTHomo sapiensmisc_feature(21)..(21)Xaa is any amino acid 58Thr Ser Phe Val Glu Lys Ala Asn Gly Val Lys Cys His Lys Tyr Lys1 5 10 15Leu Ser Phe His Xaa Glu Thr Thr His Asn Tyr Val Lys Ser Val Ile 20 25 30Tyr Ala Leu Gln Glu Ala Phe Arg Val Tyr Leu Pro Ile Leu Pro Ala 35 40 45Ser Pro Thr Pro Ser Pro Thr Asn Lys Asp Pro Pro Ser Thr Gln Met 50 55 60Val Gln Lys Glu Ile Asp Lys Arg Val Asn Ser Glu Pro Lys Ser Ala65 70 75 80Asn Ile Pro Gln Leu Xaa Pro Leu Gln Ala Val Gly Gly Arg Glu Phe 85 90 95Gly Pro Ala Arg Val His Val Pro Phe Ser Leu Pro Asp Leu Lys Gln 100 105 110Ile Lys Thr Asp Leu Gly Lys Phe Ser Asp Asn Pro Asp Gly Tyr Ile 115 120 125Asp Val Leu Gln Gly Leu Gly Gln Phe Phe Asp Leu Thr Trp Arg Asp 130 135 140Ile Met Ser Leu Leu Asn Gln Thr Leu Thr Pro Asn Glu Arg Ser Ala145 150 155 160Thr Ile Thr Ala Ala Xaa Glu Phe Gly Asp Leu Trp Tyr Leu Ser Gln 165 170 175Val Asn Asp Arg Met Thr Thr Glu Glu Arg Glu Xaa Phe Pro Thr Gly 180 185 190Gln Gln Ala Val Pro Ser Leu Asp Pro His Trp Asp Thr Glu Ser Glu 195 200 205His Gly Asp Trp Cys Cys Arg His Leu Leu Thr Cys Val Leu Glu Gly 210 215 220Leu Arg Lys Thr Arg Lys Lys Ser Met Asn Tyr Ser Met Met Ser Thr225 230 235 240Ile Thr Gln Gly Arg Glu Glu Asn Pro Thr Ala Phe Leu Glu Arg Leu 245 250 255Arg Glu Ala Leu Arg Lys Arg Ala Ser Leu Ser Pro Asp Ser Ser Glu 260 265 270Gly Gln Leu Ile Leu Lys Arg Lys Phe Ile Thr Gln Ser Ala Ala Asp 275 280 285Ile Arg Lys Lys Leu Gln Lys Ser Ala Val Gly Pro Glu Gln Asn Leu 290 295 300Glu Thr Leu Leu Asn Leu Ala Thr Ser Val Phe Tyr Asn Arg Asp Gln305 310 315 320Glu Glu Gln Ala Glu Gln Asp Lys Arg Asp Xaa Lys Lys Gly His Arg 325 330 335Phe Ser His Asp Pro Gln Ala Ser Gly Leu Trp Arg Leu Trp Lys Arg 340 345 350Glu Lys Leu Gly Lys Leu Asn Ala Xaa Xaa Gly Leu Leu Pro Val Arg 355 360 365Ser Thr Arg Thr Leu Xaa Lys Arg Leu Ser Lys Xaa Lys Xaa Ala Ala 370 375 380Pro Ser Ser Met Pro Leu Ile Ser Arg Glu Ser Leu Glu Gly Pro Leu385 390 395 400Pro Gln Gly Thr Lys Val Leu Xaa Val Arg Ser His Xaa Pro Asp Ser 405 410 415Ser Ser Arg Thr 4205932DNAHomo sapiens 59taaactacaa atggttcttc aaatggagcc ca 326032DNAHomo sapiens 60gatgcagtcc aagatgcagt ccatgactaa ga 32611740DNAHomo sapiens 61aggttggctg acaaccgctc ttaactgctt catgctgaat tggggcatag taggggtcgt 60gcagttgaga tttccttggg aggggtgcct tcaatgtcat caacattgga gcatgggcta 120gcaggccagt ccaggggtcc gcggtagatc ttagtcatgg actgcatctg gggctccatt 180tgaagaacca tttgtagttt tacagcttcg attctggaag agacaaacgt aacaaggagg 240ttaaagatac aaggattgaa atgtacggcc tgaagtgcag gggcatatga gtgtgggcgg 300tgcaagtggg gtttccttta gaaaaactcc gatacaatag ggcatcaata tttctaggaa 360gccacattct ccatagaagc tctcggtaag gggagctact ggtagtacag cagcatacag 420ggggtgcagt gagagtgaaa gggggtaaga gaacagtaaa aagaaaaata tgacaaggga 480gggccaagag gatctacgat tctagttact ttcctcacgg ttgtcgcctg aagagcaggc 540gcagatcctc tagaggttca caggaatagc tagcattgtc tgctggattt tcgggttcct 600ttggcagtat ccagggtttg gctcgagtgt gacttatcca agactccact ccagccactt 660aactgcggtt agggtagata aaatgactgg gtagggtcct tcccaggatg tgtgtaggga 720tggggaatta aaggggaagg gacttgacta ataccatgtc accagggtgg aataattcct 780ttccctcctc tcagggacag gttccctgta atgttttaag aactcgttga tatttggcta 840aggaggtgat gtctgcaact aagttggccg tctctcagtc aagcacaagg tcattggtta 900ggaagggctg tccatacagc atctcatatg gactaagtcc tgctttttgg ggacagtttc 960ggattcttag taaggctata ggcaacagag caggccatgc aaggtgggtt tcttgggtta 1020gcttttttag atgtcgtttg agtgtttcat tcattttctc aacttttcct gaggatcgtg 1080gcctccaggc acagtgtaag tgatattgta tacctaacgc ctgggatact ccctgcgtta 1140ctgcagcctt gaaattgggg ccattgtcac tctgtaaacc tcagggaagt ccgaatctgg 1200gaattatttc atgaattagt acttttatta cctcttgggc cttttctgtc ctacaaggga 1260aggcctccac ccaaccagtg aaagtaccca gattagtaga tactgaaatc tctgagattt 1320gggcatgtgg gtaaaatcta gttgctagtc ttctcctggg taatggcctg ttctttgttc 1380tcctgaagga gcttggcaat aaggcagggg attatttctt tggcacactt cacaggccct 1440gactatctgc ttgacagttt tgaaaaggcc tggtccagta aataatgatt tggccatctg 1500atgggtgctg tcaatgccta agtgaaaggt ctggtgaagg gttttaagta atttccattg 1560gttagctgca ggcaaaagta ttttttcttt ggtggctggc catcctgagg agaggaaact 1620atgtcctcgt gagtttcccc attccatttc ttctgctgag tactggagct tggtttccca 1680gaggggatta ccccatacta ggggtccttc tgtaagcatt tctaatggag agtcctgcct 1740627140DNAHomo sapiens 62ttggtcttaa gaacacaaat gatatggctc caatgactgg aggaacacca gggtccttgg 60tctcacgctg atttagataa aacgactgtc aggcctctga gcccaagcta agccatcctc 120ccctgtgacc tgcacgtata catccagatg gcctgaagta accaaagaat cacaaaagca 180gtgaaaatgg cctgttcctg ccttaactga tgacattcca ccattgtgat ttgttcctgc 240cccatcttaa ctgagcgatt aaccttgtga aattccttct cctggctcaa aacctccccc 300actgagcacc ttgtgacccc cgcccctgcc cctaagagaa aacccccttt gattataatt 360ttccactacc cacccaaatc ctataaaatg gccccacccc tatctccctt cgctgactcc 420tttttcggac tcagcccgcc tgcacccagg tgaaataaac agccttgttg ctcacacaaa 480gcctgtttgg tggactctct tcacacggac gctcatgaca tttggtgcca aaacctggga 540taggaggact ccttcaggag accagtcccc tgtccttgcc ctcactctgt gaggacatcc 600acctacaacc ttgggtcctc agaccaacca gcccaaggaa cagctcacca atttcaaatc 660aggtaagcag tcttttcact ctcttctcca gcctctcttg ctacccttca aactccctct 720ctcactaccc ttcaatctcc ctgtccttcc aattccagtt ctttttcatc tctagtagag 780acaaaggaga cacattttat ccatggaccc aaaactccag caccagtcac ggacttggga 840agacagtctt cccttggtgt ttaatcactg cggggacgcc tgcctgatta ttcacccaca 900ctccattggt gtctgatcac ggtggggaca cctgccttgg tcactcaccc acattccctt 960ggtggtacgt caactgcaaa agcaggggac gcctgctttg gctgctcacc cacccccttc 1020tctgtgtctc tacctttctc tttaaactta cctccttcac tatgggcaaa cttctgccct 1080ccattccccc ttcttctccc ttagcctgtg ttcttaaaaa cctaaaacct cttcaactca 1140cacctgacct aaaacctaaa tgccttattt tcttctgcaa cactgcgtgg ctgcagtaca 1200aacttgataa tagctttaaa tggccagaat atggcacttt caatttctcc atcctacaag 1260atctagataa tttttgtgga aaaatggaaa aatggtctga gatgcctgac gtccaggcat 1320tcttttacac attggtccct ccctagtctc tgctcccaat gcgactcatc ccaaatcttt 1380cttctttctc tcctgtctgt tccttcagtc tccaccccaa gctctgagtc ctttgaatcc 1440tcctttgcta cagacccatc tgaactctcc cctcctcccc aggctgctcc tcaccaggcc 1500gagccaggtc ccaattcttc ctcagcctct gctcccccac cctataatcc ttttatcacc 1560tcctctcctc acactcagtc cggcttacag tttcgttctg tgactagccc tcccccatct 1620gcccaacaat ttcctcttaa agaggtggct ggagctaaag gcatagtcaa ggttaatgct 1680cctttttctt tatctgacct ctcccaaatc agttagcgtt tacgctcttt ttcatcaaat 1740ataaaaaccc agccagttca tggcccatct ggcaacaacc cttacaggct ttacagccct 1800agaccctgaa gggtcagaag gccgtcttat tctcaatatg cattttatta cccaatccgc 1860tcccaacatt aaataaagct ccaaaaatta aattctggcc ctcaaacccc acaacaggac 1920ttaattaacc tcacttcaag gtgtacaaga atagagtaga ggcagccaag tagcaacgta 1980tttgagttgc aattccttgc ctcaactctg agagaaaccc cagccacatc tccagcaaac 2040aagaacttca aaacacctga actgcagcag ccaggcgttc ctccaggacc acctccccca 2100ggatcttgct tcaagtgccg gaaatctgac cattgggcca aggaatgcct gcagcccagg 2160attcctccta agccacgtcc catttgtgca ggaccccact ggaaatcgga ctgtccaact 2220cacccggcag ccaatcccag agcccctgga actctggccc aaggctctct gactgactcc 2280ttcccagatc ttctcggctt agcagctgaa gactgacact gcccgatcac ttcagaagtc 2340ccctggacca tcacggatac tgagcttcag gtaactctca cagtggaggc taagtccatc 2400ccctgtttaa tcgatacagg ggctacccac tccacatcac cttcttttca agggcctgtt 2460tccctttccc ccataactgt tgtgggtatt gacggccaag cttcaaaacc ccttaaaact 2520cccccactct ggtgccaact tggacaacat tcttttatgc actctttttc agttatcctc 2580acctgcccag ttcccttatt aggccgagac attttaacca aattatctgc ttccccgact 2640attcctgggc tacagccaca tctccttgcc gcccttcttc ccaacccaaa gcctccttca 2700tatcttcctc tcatatcccc ccaccttaac ccacaagtat gggacacctc tactccctcc 2760ctggcaaccg atcacacgcc cattactatc ccattaaaac ctaatcaccc ttaccctgct 2820caatgccagt atcccatacc acaacaggct ttaaagggat tgaagcctgt tatcacttgc 2880ctgctacagc acgggcttct aaaacctata aactctccat acaattcccc cattttacct 2940gtctaaaaac cagataagtc ttacaggtta gttcagaatc tgcaccttat caaccaaatt 3000gttttgccta tccaccctgt agcacccaac tcgtacactc ttttgtcctc aatgccttcc 3060cccacaactc actattccgt tcttgatctt aaagatgctt ttttcactat tcccctgcac 3120ccctcatccc agcctctctt tgcttttacc tggactgacc ctgacaccca tcagtcccag 3180cagcttacct gggctgtact gccgcaaggc ttcagggaca gccctcatta cttcagccaa 3240gctctttctc atgatttact ttctttccac ctctctgctt ctcaccttat tcaatatatt 3300gatgaccttc tactttgtag cccctccttt aaatcttctc aacaagacac cctcctgctc 3360cttcaacatt tgttctccaa aggatatcgg gtatccccct ccaaagctca aatttcttct 3420ccatctgtta catacctcgg cataattctt catgaaaaca catgtgctct ccctgccaat 3480tgcgtctcca actgatctct caaatcccaa cctcttctac aaaacaacaa ctcctttccc 3540tcctaggcat ggttggatac ttttgccttt ggatacctgg ttttgccatc ctaacaaaat 3600cattatataa actcacaaaa ggaaacctag ctgaccccat agattctaaa tcctttcccc 3660actcctcttt ccattccttg aagacagctt tagagactgc tcccacacta gctctccctg 3720tctcatccca acccttttca ttacacacag ccgaagtgca gggctgtgca gtcggaattc 3780ttacacaagg accgggacca tgccctgtag cctttttgtc caaacaactt gaccttactg 3840ttttaggctc gccatcatgt ctccatgcgg tagcttccgc tgccctaata cttttagagg 3900ccctcaaaat cacaaactat gctcaactca ctctctacag ctctcacaac ttccaaaatc 3960tattttcttt ctcacacctg acgcatatac tttctgctcc ccggctcctt cagctgtatt 4020cactctttgt tgagtctccc acaattacca ttcttcctgg cccagacttc aatctggcct 4080cccacattat tctggatacc acacctgacc ctgatgattg tatgtctctg atctacctga 4140cattcacccc atttccccat atttccttct tttctgttcc tcatgttgat cacatttggt 4200ttactgacgg cagttccacc aggcctgatc gccactcacc agcaaaggca ggctatgcta 4260tagaatcttc cacatccatc attgaggcta ctgctctgcc cccctccact acctctcagc 4320aagccgaact gattgcctta actcgggcct tcactcttgc aaagggacta cacgtcaata 4380tttatactga ctctaaatat gccttccata tcttgcacca ccatgctgtt atatgggctg 4440aaagaggttt cctcactacg caagggtcct ccatcattaa tgcctcttta ataaaaactc 4500ttctcaaggc tgctttactt ccaaaggaag ctggagtcac acactgcaag ggccaccaaa 4560aggcgtcaga tcccattact ctaggaaatg cttatgctga taaggtagct aaagaagcac 4620ctagcgttcc aacttctgtc cctcatggcc agtttttctc cttcccatca gtcattccca 4680cctactcccc cattgaaact tccgcctatc aatctcttct cacacaaggc aaatggttct 4740tagaccaagg aaaatatctc cttccagcct cacaggccca ttctattctg tcatcatttc 4800ataacctctt ccatgtaggt tacaagccac tagtccacct cttagaacct ctcatttcct 4860tccatcgtgg aaacatatcc tcaaggaaat cacttctcag tgttccatct gctattctac 4920tacccctcag ggattgttca ggccccctcc cctccctaca catcaagctc ggggatttgc 4980ccctgcccag gactggcaaa ttgactttac tcacatgccc tgagtcagga aactaaaata 5040cctcttggtc tgggtagaca ctgtcactgg atgggtagag gcctttccca cagggtctga 5100gaaggccact gcagtcattt cttcccttct gtcagacata attccttggg ttggccttcc 5160cacctctata cagtccaata acggagcagc ctttattagt caaatcacct gagcagtttt 5220tcaggctctt ggtattcagt ggaaccttcg taccccttac tgtcctcaat cttcaggaaa 5280ggtagaatgg actaatggtc ttttaaaaac acaccccacc aaactcagcc tccaacttaa 5340aaaggaggat agagcccaaa aactcgcaac caagctagta attatgctga acccccttgg 5400gcactctcta attggatgtc ttaggtcctc ccaaatctta gtcctttaat atctgttttt 5460ctccttctct tattcggacc ttgtgtcttc cgtttagttt ttcaattcat acaaaaccgc 5520atccaggcca tcaccaatcg ttctatacaa taaatgctcc ttctaacaac cccacaatat 5580cgccccttac cacaaaatct tccttcagct taatctctcc cactctaggt tcccatgccg 5640cccataatcc ctctcgaagc agccctgaga aacatagccc attatctctc cataccaccc 5700ccaaaatttt tgctgcccca acacttcaac actattttac attatttttc ttattaatat 5760aagaagacag caatgtcagg cctctgagcc caagccatca tatcccctgt gacctgcaca 5820tatacatcca gatggcctga agtaactgaa gaatcacaaa agaagtgaaa atggcctgtt 5880cctgccttaa ccgatgacat tccaccactg tgatttgttc ctgccccacc ttaactgagc 5940aattaacctt gggaaattcc ttctcctggc tcaaaacctc ccccactgag caccttgtga 6000cccctgcccc tccactaccc acccaaatcc tataaaatgg ccccacccca tctcccttag 6060ctgactcctt ttttggactc agcccgcctg cacccaggtg aaataaacag ccttgttgct 6120cacacaaagc ctgtttggtg gactctcttc acagggacgg gggtgacaac aacacggaca 6180cacatggagt ggttttaagg agcagagagt ttaatacgca aaaaagaagg aagaggctcc 6240cctgtacaga cacagaggga gggggctcca agccgagaga aggaaacccc atgtgcagtg 6300gaaaagtggt tgattatact gggaggctgg aggaggcggt gtctgatttg cacagggccc 6360aggggattgg gttgaccagg tgtatcattc atgtaccccg caaaaaacct ggccctccca 6420cctcagccct ttaatatgca aatgtgggtt gccatgatgt tctgaaaaca catgaattat 6480ctggaggggg ccatgacact tggtacatgt gctgacaaga agagggtggg aatcgccatg 6540gtggccatgt tgggtggacc tagtttttaa tagcctgcat ttgcatatca aagtttgctg 6600gcctggctct ttaagctgtc ttttctgtta gaaaaggaat ggtttggaat gggtgagggt 6660tgcttcttat tacaagaaaa tttccaaaaa cctttactct ttctagctgc caaaaaacta 6720tttcttaata acttatgtat taccataatt aggcagcacc aaagatccct gcaggtcaga 6780ccactgcaat taacatgctg gctttactgc tgattatggt agctgcatcc acctagcctc 6840tcatattgca actgcctgac ctctgccacc ccacgagcca cttatcccca cttataatca 6900gcccatttcg attgtaacat ctgccactta ttcccgacgt tgtggtatat cctatagatg 6960aattcattca acatccattc caacaccacc tctcttgcct tcctatactc tctggagagt 7020gaattactga gtcacatgat cttcactgca gtcatttgtg gctatgtgac atagttctgg 7080acagtgaaca tagacagaag tccctggggc gggcttcctt tctgggatga gggcaaaacg 71406344100DNAHomo sapiens 63tgcctttatt tccgtaggct ggtcatatgg cgctagcact cacataaagc taccgaggag 60agcgaatgaa accaaaatca ctttaccttc acagcacgag gccgtcgtcc ctctcgatat 120ttggcccgtg tgtcgcatac cgccctctgg acgtggtgat caaataaact ccctagctcc 180ccgccgctcg acgccatctt gcctactttg atcctcgcag ggaggacaac atccgcccta 240ctgagctccc ttttatccaa taagagagcg ggatgagtta aggagtgcca ggattggctg 300gagaatcgac agcgtcggcc atcgtttcct gcgtgcgaag atttgatgaa cgaggtgccg 360cccccgagcg gctcggcgga gaggcgcggt gggtgacaga agctttcttg tcccacccac 420tacaggctta cggcaggatg cgcagcgggg agagggggcg gggccgcagg gggcggggcc 480gatcgatctc ctccggctcc gacgtcctcg gcctgccggg tcccgggtcc tttgcggcgc 540tagggtgggc gaacccagag cgacgctccg ggacgatgtg gggcagcgat cgcctggcgg 600gtgctggggg aggcggggcg gcagtgactg tggccttcac caacgctcgc gactgcttcc 660tccacctgcc gcggcgtctc gtggcccagc tgcatctgct gcaggtaacc tgccggcccc 720gagccacctg atcttcagcc tggggtcgga cgaggccgaa gcctctcagg gacgcggcgg 780gacaccggct gccacccggg cgccgccgaa gcgcgcagag atcagggtcc ctcgacggca 840gggcccttct gggtagtctc tggatcccac aagtccagtg cagccctggg ctcgtcttat 900cccaggtctt ttcacttggt gaaactgaac ctagaaacgt cctaatattc taccactgtt 960tttataaata ttccttattc caggctggaa aagctcctga gaagtggttt gtttttatta 1020ttttaaaagg tgttttcctt gccagccatt tccagttaac ctgcgctgct gccgtccggg 1080ccgcgagagc gggacgcaga gttgttggcg gagcccctgt cggttcccgg ggactaagca 1140ccgcgtccca tgagcgggaa aggttaatac aatgatggtt ctgccctgcg tcgctgacgc

1200ggaacacagc tgtagtgtgt taggaacaca taacgtagtt aagatcactt gaagctctgc 1260gatcagtcgc ccttctggac gttgtggtta ggatgtttca cagttctaac cactggtgga 1320gatacagcgt ccatattttc ataattaaaa atagaggcac atggtctcac gagtttgagt 1380gtacttatgg gggcaaaagg acggcgtatt tgaaatcctc ataaatcctg gatgcatggt 1440acccaccagt ggctaatcta tgcaatgaat agagtttgca ataatttcaa gcatcccttc 1500tttccacttg agttacttcc ccatacctag gggaagatat ttttggtcca ctgaaaacat 1560gagttcagca gaatcctcct atcatcgtcg ttattatttt ttaccactaa gtagacaatc 1620ttttggtttt tgatgggctt tatggctaga gacaaatcag tcactgtcac caagttccag 1680gtagaagttg gttcagtgct ctgtcagctt cgatgggatt tttcaacatg ttttcaaatc 1740tgcacttaat agtaggaatg ctttcttaca gtaactctaa tttgatccta agatgtagtt 1800gttaccttac attcatcact gtttaagaat ttagtggtct tgatctttgt tttaaatttt 1860gagccttcgg gaagtactta taagaattaa ttcatgcata tctttttgaa atgtaaatgt 1920ctttagccct ggaacaaatt gctgtttctg ttcagcccat attagcagaa taggtcaact 1980ttactttcta attatcaatg taataagttt attactttat agattccata aatctataca 2040tttattcctc gatgaattat ataaatttat agaatttatg ttttatagaa aatttggaaa 2100gcatggaaaa ttattaacaa gaaaataagt tacccataat cccagaactt agaggtgact 2160aatgttgaca gtttggatca aatcttccag ttttgtttct aatctttatt tttaacataa 2220atgaggtcct gtatacacac gtacagtttt gtgtcctggt gtttttattt aatgttatta 2280tgagtgtttt attttgttaa aaggtcatca ttttaagttg ttaattagta ttctagcaca 2340aatttgccat aatttattta attgtttact atgattgacc atttagattg tacttaattt 2400ttaggcatta gaagtgataa actatatttt aatcagacgt tgaaaataac acatctttgt 2460ttagaaaaca tcattttatt tctggttgtc taggatagat tcccagaatt cttgggttag 2520aggccataga taattatgaa agcagaaaga ttcacaagtt gggagttaat acttgaatta 2580ctttatttgg ggtgaagcat tgagtgcata atacagatca tgcagtaatg ggaagaaggg 2640ttggaacaat ggttttctgg cctatgtcag acttaccttg aagcttttaa gaatacagat 2700gttctgatca accctcagac ctattaaatc agacctaaaa tcttagggaa taggctttag 2760gcatctctaa ttttaaaaaa tttattcagg ctacttggat gcacaaaaga gttgagacct 2820actgtcctag aatcatagaa ttttaatgac gatagagacc ttaagcatct aggtcgtttc 2880tgtactttta catgtaagga aactggcatt cctaggccag taccattgcc atgcagctaa 2940tttgccctct tgtctatagc tcactctgca tcacccaacc taccgttctc actgtttctt 3000ctataaccaa tctccttccc acttctgttc tcttactcat gccattcttc cctcagtcat 3060ttttcttcct tccatacaaa ttccatgtct ttaaaaagga ataatcctac ctcctccaca 3120tagctttcca attctctgtt gcccacattt gtctcccttt caatacttct ctgttgtgtt 3180atgtgacaca tcacatttga tatactctgt actgtgtttc aagtattgta ttctcttgtt 3240tactcaagtc attatttcag gactgactac ccagtagatg ctttaagtca ggatttctca 3300accttggcac tgttgacatt ttgagctgga taattttttg ttttgggggc tctcctgtac 3360attttaagat gtttaacagc acccttggcc tctatccagt agacgcctgt actgcctccc 3420cctatctgtg acaaccaaaa aggtcttcag acattgtcag atgtctactg aaggacaaaa 3480tcacctctgg ttgagaacca ccgcttcaac taagttatct tctctgtact cagaacttga 3540tgtgattgca gcagggggag aggattcata tacacagtga atgcaaacga acctaaatca 3600ccattcggat atggccacac aattttcatt tcccttgtgt tagcaagaga taccctaggc 3660tttggacctg attattccta aggcattctg atgtatggtt ttacctgcag atttcctggt 3720aatactgata cctcagtttg ggtcaaagaa ggtcaattaa ttgattgatt tgatttgact 3780cctggaaaag acgctccttt ctagctgtct ctttcttctc tttacctgaa tagccagggc 3840tctgtggttc aagtgaagta ttttgacata aaaattaact tagaacattg gtctgcagag 3900tttgctcaat ataactgagc acatattgtg gctttatgga gctggttact actttttgac 3960caaataaata attagaagta tttttcctcc tcaataaggt tcatttttcc ttttttcagt 4020gagctggtag agtttccttt tttgatattt cagggcatct ttcatatttc catctcttaa 4080gtttcttcat atgaagtaga atttatctgg attatgtatt gctgactctg atgaaaaccc 4140atagaaagca tctggggctt gatcaccttc attcttgtaa tagctcacac ggttacagct 4200gatatggtaa cttaagactt ttgattccaa atctaggcaa aatacactca gttgaaagaa 4260tttgtcagcc agaacagttg gactgttctg tgaaaattgt gagaaaaatt acacaactaa 4320gtgatacatg atgatggctt tcttaaatat aaaattgtaa taacatggtt aatttccagt 4380acgttatatt gtcccagaag tggctccaac attgtttgaa atttgtctca tttaaagaaa 4440cataagctgg ctatggtggc tcacgcctgt aatcccagca ctttgggagg ctgaggcagg 4500cagatcacct gaggtcagga gttcgagacc agcctggcca acatggtaaa accccatctc 4560tactaaaaat acaaaaatta gccgggcatt tggtgggggc ctgtaatccc agctacttgg 4620gaggctgagg caggagaatt gcttgaatct gggaggtgga ggttgcagtg agccgagatt 4680gtgccactgc cctccagcct gggtgacaga gtgagtctcc gtctcaagaa aaaaaaaaaa 4740aaaagcaaga aacataaaga ctgggcatgt tggctcatgc ctgtaatccc agcactttga 4800gagactgagg tgggaagatc acttgagccc aggaggttaa ggctgcagtg agccgtgatt 4860ttgccactgt actcgagcct gggcaacaca gtgagatcct gtctcaggaa aaaaaaaatt 4920gcatgtaaat gaatgaattt gatatttaat attttaaatt atgaaaactg ttctgtagag 4980atgtagatct tgccatgttg cccaggctgg ctttgaactt ctgggctcaa acaatcctcc 5040tgtctcagtc tcccaaagta taaagattac acatgtgagc cactgcacct ggcctaatat 5100ttttaactta atgaatttat tttgatataa ataaattaat aacactgaag cttcctgata 5160taataagtct ttttgtgtgt gtgacgggtt ctcactctgt tgcccagact ggagtgtaat 5220ggcactatca tggctcactg tagcctcaac ctccctgact caagtgatcc tcccacctcg 5280gcttcctgag tagatgggac cacaggcgta tgccaccaca cctggctgat ttttaaaatt 5340tattattgat acatattaat aaaattattt ttattttaaa aatgatatat gtggctgggc 5400atggtggctc atgcctgtaa tcccgacagt ttgggaggcc gaggtgggag gatcacttga 5460gaccaggagc ttaagaccag cctaagcaac atagtgagat cccatctcta tagaaaaaaa 5520aaatggctag gtgtggtggt gtatgcctat attcccagct actcaggaga ctgaggtgag 5580aggattgcta gagcccagga gtttcaagtt acagtgacct atgattgtgc cagtgcactc 5640cagcctgggc aacagagcaa aatcctgtct caaaaaaaaa aaaagttcga aaatgcttat 5700gatgcaatat aagtagtgga aaaggatatt aaattgtgcc tatatgaaca caactatatg 5760aaaaacttgc acatagagaa aaggattaac aagaaataga ccaaattgtt cacatggttg 5820tcttgtttgt ggagagaata tcagtagttc atttgtttcc ttccaagttt atatgttttc 5880cgaggtctct ataatgagtt tgtaattgtt taatcataga aaaccctttt ttggtccttg 5940gccacaaact tacatgtttt aatgtaattg cttttttaat gagaataaat gttatatttt 6000gcttttttaa aacctatatt cccatagtta tatgagccct tacaattatt aagaggctgc 6060ataatataac gtttctggaa gggtacagaa gaaacagcag taattacctc tgagaacaga 6120gacatggctt cacattttac ccttttgtac gttttgtgct tttgccacat gcatttatta 6180ttcttccaat aaataagtaa ataaatatgg attgtatact ccatctggtt ggtgtttcat 6240aattctaaaa ttatattgct acatttttaa agatgatatg tgtttctact tattaacgta 6300tatgttaaaa tagtaaattt atatcttatt taataatttc cctattgata gacatttaag 6360acagtctcaa gtgttcacta tcatagaaaa tactgcacag atagcttttg ctatagtttc 6420ttttttcttt gaatcgttaa ttgggaataa atgctcaaat agttatatgt ggctcaactg 6480ctatttaagt ttattgactg actgctgcca ttttgaattc tgaaggggtt gattaaattt 6540ataatgctgc cataagaata taagggtatt ggcttcatta gcatccacca gcattgggtg 6600ttggaaatga ttatagattt ttaaatgcta caacaaatgt agataacaga gaactatcta 6660tagaactctt tttggacatg tgaattgtaa taatagttta ttttcatgtg aatccagaaa 6720aatgtatacg aaaacctttt ttcctctcat ttcttatatg aatagaatca agctatagaa 6780gtggtctgga gtcaccagcc tgcattcttg agctgggtgg aaggcaggca ttttagtgat 6840gggggacagg taagcacatg tgatggcaat aactttcttc taatatcaca taatatagca 6900atagaaataa aattaaaagt ttagattttt tgttaaagga ggtgagatgt cacctaattt 6960gtatgctatt atgtaactag tctaggatat tgaagctgac tatactctgt ttttaggtca 7020ttatcttgta gtttaccata ctccctactt gcttcttatt ctactattta actcattttc 7080cacatcccct aattttggtt tcatgaaatt atttttcctt ctgaattact aggttctact 7140tactattatt aaactttatt tctgacatat tttataacct tccatggtct cacttgatta 7200aaaataaaaa attcagctgg gtgcggtggc tcacacctat aatcccagca ctttgggagg 7260ccaaggtggg cggataattt gaggtcagga gttggagacc agcctgccca acgtggtgaa 7320accccccctc tctactaaaa attcaaaaat tagctgggca tggtggcagg tgcctgtaat 7380cccagctact caggaggctg aggcaggaga attgcttgaa cctgggaggt ggaggttgca 7440gtgagctgag attgcactgc tgcacttcag ctgggtgaca agagcgaaac aatgtcttga 7500aaaaaaataa aaaataaaaa attctacaac acagggttat tatttttcca tttttgtttt 7560cccttatgag tttaatatgt ttagattata aacctgaaag cttgaatacc tatgtctatc 7620ttttgttttc ttatgtttat caagttattc ctttaaacat tttctaaact gtaagaataa 7680tgtgaggctg ggctcaatgg cttatgcctg taatcccagt gctttgggag gccaaggtgg 7740gaggaccact tgaggccacg agttcaagat tagcctggct aggcaacata gcaagaccct 7800atctctataa aaaaattaaa aaaattagct gggcatggta gcaaatgctt gtagtcccag 7860ctactcagca gactgaggta ggaggaatgc ttgagaccag gaatttgagt gacctatgat 7920tatgcactcc agcccgggca atagcaagac cctatctctt aaaagaagaa gatgtagtaa 7980taatacatat tcattataac tattttacca ttgaaagtaa aaaatgagtt tttacctttt 8040cccagtccca tcctcagaat ggggatctca gtagaccttt aggattggaa gaatgagatc 8100attcatattt tctgcaatta ttaccccaca aaatatttca gatacctttc catgtattac 8160aaacaatgtg catttaacat gtctctctct ttctctctct ctctgtgtgc gtcttcatga 8220tcctctgttg cagccctgcc agtaagacac tatctcctga agaatcactg ataggaacag 8280aaagtggact ggctaggcca ggagtcctta gcttcttagg gggcaggagc tgctttgtgc 8340tttctcagaa tcagatatat atgtggactg aaacatttaa aaacagaata gccaagggtg 8400ctatacgttt aaaacttata tagatggggc tacattgctc tctattacta atttcccatg 8460acaatacacg agagtgccat gtctttttaa cttgttttga gcacagacta atcttgttta 8520tgcatgtttt ttgatgagaa taggctactc atgagaaatc tgtaaaccta acactagtcc 8580cttgcatact ctaaattgtt gctagaatct taaaatttta gcaccagacg gaccttagaa 8640atcattaact ttggtgcttt gttctacaat acaaggagat ggaatatttt acccaggatt 8700gcttagcagg ttacagttct gccctctgag tacccagcac ttccctgtgg gcaacatcaa 8760cttcctgatt ttcaagtctt aattagtact ctgaagaatc ctacttgttt ttaactccca 8820tttgctttga agtgacttta cctgattttt ttagatccct tattgcagca atgccactaa 8880gaaactgagt ctctagcttc ttggtgggca ggagctgctt tgtgcttgct cagaatcatc 8940cttttcagta agggagatat tgaagagaaa tctactgagg agtctggggg tgaggcactc 9000agggaaatcc tgctccagtc cacaaaagca gagaggaagg gttggttacc tagagtattt 9060aacatgcaga ggctttggat tttactcctt taatccttgg aaatgcctat ggaaggggaa 9120aggaagtaag atggtgactc cagcttatag acatactagt gttacatata tttaaactat 9180aataggaggg tattattagt tttacttaac tttcaactgt gaaggattat acttctcaat 9240atttgtctcc agtgtctatt tcagtgtatt tttcactttt cttgaagcag catgtctgtt 9300gcaaaacttc tagaaataat gagaatattt atatattaga tcaagccata acttgatgat 9360atagtcattt cttcttatat tttttactta catttttaca ttttaatgat tactttcatt 9420tttgaaaaac atgtcatgct gagatgtatt tttcttcatt ctgtaattag ttatgaaaca 9480gtttttccta aaatgctgag tatatcaagt cttggctaag aataagtaat aaatatttgc 9540cacatgaaag actacacata tagccaggtg cagtggcttg cacctgtttt cccagctacc 9600caggaggctg aggcaggagg attgcttgag cccagggttt ccaggctgca gtgaactatg 9660attgtaccac tctactccag aatgggtgac agagccaggc cccatctctc aaaacagaaa 9720agaaagatta catagactac atatacaccc ccatccaaaa catacacaca catctactta 9780acctaaaatg gtaagaagat aacttcttat tttctaatat atgacacaga aaagtttttt 9840taaagtagtt ttaaattttt aattttttct aggtatttct caagccatgt tcccatgtgg 9900tatcttgtca acaagttgag gtggaacccc tctcagcaga tgattgggag atactggtaa 9960agaaaaccaa ataagaacta tctcatttaa ggttaaatta cttcacaata tcaatgtctt 10020tagctttctc taagctttat tatatattct gagttggttt tgaattataa gaatgaattg 10080gggccaggca cagtagctca tgcctatagt cccagcactt tgggaggcca aggcaggtgg 10140attgcttgag tccaggagtt caagaccagg ctgggcaaca tggtgaaacc ccgtatctac 10200taaaaataca aaaattagcc aggcatggta gtgcatgcca ttagtcccag tcacttggga 10260ggctgaggca ggagaatcgc ttgagcccgt aaagtcaagg ctgcagtgag tcaggatctt 10320gccattgtac tccagtctgg aaaacagagt gagaccttgt ctcaaataaa aaaagaatga 10380attgatagag atctaatgta caacctgaca actataggta ataaaattgt attggggatt 10440catgttaaat gagtagattt taactactct taccacaaaa acacaaaagt gggtaactgt 10500gagatgatgt atatgttaat ttacttcact atagtaacca ttatactatc tatatgtagc 10560tcataacacc atgtcgtgta tattaaatat gcacattaaa atttgttttt taaaaaaaga 10620attgagattt tttttaacta gatatggagt ggacaaaatg taaagtgaat tgatcttttc 10680gtctgttggt tctaggagct gcatgctgtt tcccttgaac aacatcttct agatcaaatt 10740cgaatagttt ttccaaaagc catttttcct gtttgggttg atcaacaaac gtacatattt 10800atccaaattg gtaggtgcta ttgtaatatt tgctgtcata ttctacacta tagcattgag 10860tccaaagtag aaatgaatgt gcactaatga gctttatttt ctacacagtt gcactaatac 10920cagctgcctc ttatggaagg ctggaaactg acaccaaact ccttattcag ccaaagacac 10980gccgagccaa agagaataca ttttcaaaag ctgatgctga atataaaaaa cttcatagtt 11040atggaagaga ccagaaagga atgatgaaag aacttcaaac caagcaactt cagtcaaata 11100ctgtgggaat cactgaatct aatgaaaacg agtcagagat tccagttgac tcatcatcag 11160tagcaagttt atggactatg ataggaagca ttttttcctt tcaatctgag aagaaacaag 11220agacatcttg gggtttaact gaaatcaatg cattcaaaaa tatgcagtca aaggttgttc 11280ctctagacaa tattttcaga gtatgcaaat ctcaacctcc tagtatatat aacgcgtcag 11340caacctctgt ttttcataaa cactgtgcca ttcatgtatt tccatgggac caggaatatt 11400ttgatgtaga gcccagcttt actgtgacat atggaaagct agttaagcta ctttctccaa 11460agcaacagca aagtaaaaca aaacaaaatg tgttatcacc tgaaaaagag aagcagatgt 11520cagagccact agatcaaaaa aaaattaggt cagatcataa tgaagaagat gagaaggcct 11580gtgtgctaca agtagtctgg aatggacttg aagaattgaa caatgccatc aaatatacca 11640aaaatgtaga agttctccat cttgggaaag tctgggttag tataaatttt ataacttggg 11700agaaatttta tgtggcttaa acatccccaa attatgaatt agaatagtat ttcatatata 11760aattgaaaat caattaaaaa gaaacacagt gcctaaaggc acttggggga cacatttacg 11820ctttgcagta aagtccttgt ttggataaag attgtatgtt ttctggccaa gtaagcttga 11880ataggtacaa gcttagatag gttcaggcca gagaggtcaa aattacttgc ctgagattgc 11940atagctagtg ttacaactag gattcaaacc caggcagatt gacttggggg ttcatcagga 12000tggagtgccc tacaaagcct cccatcttta atgcttgcag atttgttccc cagttaccga 12060aagcaacttg ttaatattag ggaaaagggc cagtgtaggg agagatccat ggcatgaggt 12120aaccttcctg ctgcatgtgg tggcacctgg attggaatgc atccaggagc tgcttaccct 12180gccggtgtct gctctttaat ttgtgtataa cggagaggaa gtagacaggg caactagtgc 12240tccagcccct catcctggcc acaaatatta atgctacctt tatatgacat aagtcactag 12300tccatttatt ggaacctaaa tttgaaccac tgtaaagtaa gacttcatag tgataaagag 12360aggaacttgt taggaaagag aataaaatag aaagagaagg ttgtctcctt ttgtagattt 12420tttttttttc tccaacagtt ttacctgtga cctttataca aataactgac aaagcattaa 12480tctctttggc ctacatcatt ttcttttcta tttttttttt ccacaagatg gagtttcact 12540cttcttgccc aagctggagt gcagtggcat gatctggctc actgcaacct ccgcctccca 12600cgttcaagtg gttctcctgc ctcagcctcc tgagtagctg ggactacagg catgcaccac 12660cacgcctggc taattttttg tatttttagt agaaactggg tttcaccatg ttagccagcc 12720tggtctggaa ctcctgacct caggtgatct gcctgcctcg gcctcccaaa gtgctgggat 12780tacaggcatg agccactgct cctggccggc ctacatcatt ttctaaagct ccagaccatt 12840cttttctttt cttttctttt cttttctttt cttttctttt cttttctttt cttttttctc 12900ttctcttctc ttctcttctc ttctcttctc ttctcttctc ttttcttttc tttttttgag 12960ttagaagctt gctttgttgc ccaggctgga gtgcagtggc accacctcca ctcactacaa 13020cctccacctc ccaggttcaa atgattctcc tgcctcagcc ttcagagtag ctgggactac 13080aagtgtgcgc caccactcct ggctaatttt tgtattttta gtagggacga ggtttcacca 13140tgttggccag gctagtcttg aactcctggt ctcaagtgat ccgcctgcct cagtctccca 13200aggtgctggg attacaggcg tgagccactg tgcctggcct cagatcatta ttttctgtta 13260gctttaaact gtccgttcag gagatcccac tgcatcctca aattcaaaat atctaacact 13320gagcttatga tttagctggt tctgtcatta gatgggaata tccttttatt tccttgaaat 13380tatatggtga gaacagggag aagtgctgat ggtaaagtcc tgtgattaag atagcaataa 13440ggactccgcc cttcccactc cactgaaggt tgaagagcca tggacaatga gaagtcacag 13500taggtgaaat caggtactaa aatggacttg gcttgagaga tcaaaattga tcacttggtg 13560atacaactaa caaattcatg ttaacttgaa cctttattac cctgtgaagc atggtgatta 13620aaaaaaaaca acaaacaaac aggaaacttg attgttaaat tctctttaag tcagaatatg 13680taccttagag tttttattta tgcttttgtc taccattaat atgtctgcac ctgctcttta 13740gaagttaata gagagtaaag tcgtctttat gtctttcagt gcttacttat atttgggaag 13800ttgagaaaaa tttttaacat cattattgat atatatatat atatatatat atatatatat 13860atatatatat atatatatat atagataatt tttttttttt tcttgagacg gagtctcact 13920ctgtcgccca ggccggagtg tggtggcgat ctccactcaa tgcaagctct gcctcccagg 13980ttcaagcgat tctcttgcct cagcctcccg agtagctagg atacaggctc ccaccaccac 14040gcctggctaa tttttgtagt tttagtagag acgaggtttc accatattgg ccacgctggt 14100ctcaaactcc tgaccttgtg atccgcccac ctcggcctcc caaagtgctg ggattacagg 14160cgtgagccac tgcgcccggc tgaggtaaaa tttaaagtgt acaattcagt catttttagt 14220atatttatac tagttgtaca gccatcacca caatctaagt ttagaacatt ttcattaggg 14280ggtgggagaa attttactct gctttttaga ttaagtttct gtctggatct aatcatttaa 14340tcagacaatc aggcagattg tctgtgatta gttttggcca ttccagcttc ttcattggtt 14400gttaactttc acaaataaag gctgctcaaa gattagaaat aacatttaat ttgaatgtaa 14460atgtgccata gtttaaaaga tgggtttggt gaatacagtc aaatacatac atttaaagct 14520ctaattctga agattatgta aagaaaagga aagaaatgta gggagaggat tgaaatgttc 14580atggtataac aatatctgaa catccatctg gtcacaccgt tggtatttga atgttttgtc 14640ctcctcaaat tcatatgtcg aaatcccaac tcccaaggtg atcgtattag gaggtgtggt 14700ctttgggaag tgattaggtc atgaaggtga agccttcatg aatgggattc gtgctcttat 14760aaaagagaac tgtgagaaat aagtttctgt cgtttgttag ccacccagtt taggatattt 14820tgatatagca gcctgcatgg actgagacaa ctatgagtta ttatgatagc ttctgttatt 14880tcacctaaat tcatagaagc taatatatca atatttatgc tatgaaatat ttcttaacca 14940agctttgaat atatttatat ttttgtttat ttttaaattt cagattccag atgacctgag 15000gaagagacta aatatagaaa tgcatgccgt agtcaggata actccagtgg aagttacccc 15060taaaattcca agatctctaa agttacaacc tagagagaat ttagtgagtt caaatatata 15120tgttacatca aaattctttt acacgttttg taagatttct agttgcttta gctaagtaat 15180aagaatgttg tattcctttt tgatacaaat ctttttttat tgtgttaaac tatatataac 15240ataaaatatg ccatgttcgc catttttaag tgtataattc aaaggcatta attacattca 15300taatattgta caaccatcac cactatctat atccagaact tttccatcac cccaaagaga 15360aacttggtac ccattaaaca ataattcccc gtccactcct ttccccagtc cctggtaatc 15420tctaatgtat attgtgtctc tatgaattta cttattctag atatttcata tataagtaga 15480agtatgcatt tgtcttatgt atctgactta tttcatttaa cataatgttt tcaaggctca 15540tctgtgttgt atgtatcaga atgttattcc ttttcatggc tgaatactat tccattgact 15600gcatatacca catttgttta tccattcatc tgttgatgga cacttgggtt gtttccacat 15660ttttggctgc tgtgaataat gctacagtga acattggtgt acaagtatct gtttgagttc 15720ctcttttcag ctcctttggg atatacctag gaattatgtt taactttttg agaagctgag 15780aaatctttaa taaatgataa cacaaatact tatatttgcc aatgcaaata tgaatatttt 15840tggcttttaa gagattgatc attttgccac gtggttgtaa ttaaaaaaaa ttgtcccatg 15900ttgtttcagt attaatattg tagcctaaaa gagtgctaga ctgttttact ttttactcag 15960ttaattcttt ggatactggt agagtcagga aatgagatat tgaacttaaa gatctttgca 16020ggtggggtcc agtggctcac acctgtaatc ctagcacttt gggaagctga ggtgggagga 16080ttgcttgagg ccaagagttt gagaatagcc tgggcaacat agcaagaccc catctctaca 16140aaaaaattaa aaaaaaaatt aagccaggcg tggtagctca cgcctgttat cccaacactt 16200cgggaggctg agatgggtgg atcacttgag gtcaggagtt ggagaccagc ctggccaaca

16260tggtgaaacc ccatctctac taaaaatacc aaaattatcg gggcgtggtg ctaatcctgt 16320aatctcagct actcaggagg ctgaggcagg agaaccactt gaactgagga ggtggaagtt 16380gcagtgagcc tagatctcac cactgcactc cagcctgggt aacagagcga gactctattt 16440caaaaaaagt aaaaataaaa attagacaca tgtggtggca catgcctgta gtcctagcta 16500ctcaggaggc tgactgaagt gggaggatct cttgagccca ggagttccac actgcagtga 16560gctatgattg tgccactgca ctccagccta ggcaatatct caaaaaaaat ttttttaaat 16620agattattag gccagacgtg gtggctcatg ccagtaatcc cagcactttg gaaggccaag 16680gcaggcggat cacctgaggc caggagtttg agaccagcct ggccaacatg gtgaaacccc 16740atgtctacca aaaatacaaa aattagctgc aatgtctata atcccagcta cttgggagcc 16800tgaggcaagc gaatcgcttg aacccgggag gcagaggttg cagtgagtgg agactgcgcc 16860actgcactcc agcctgggcg atacagcgag attctgtctc aaagaaaaag gaatttgttt 16920tcctgtcttt atcgtagagg gaggaaaggg agaatggggt tggaatggtt attgagtgag 16980ccacattatg gtagatgtat cactgggcat agagaaaagg agcatttaaa acttttccgc 17040ctaacagatg tttcttcagg ctacactgca ctcattgtgc taactgtaat gtcaaatccc 17100agacctgtgc ctatagaaca tgaacatcct tcattggatt tgtttggtca ggcttacact 17160ttattaggaa gatcagatgt taaaataagg gtgttaaagt taagttcaga tatgaggata 17220attcattact attccttttt ctggcagcct aaagacataa gtgaagaaga cataaaaact 17280gtattttatt catggctaca gcagtctact accaccatgc ttcctttggt aatatcagag 17340gaagaattta ttaagctgga aactaaagat ggtgagtaca tttgttattt tgactttttt 17400ttctatttaa atagttgtac atttttaatt gttcttgcaa cctgtcatac ctgtgaacag 17460tatgtgaata gtgaaatata attatgataa ttaaacagta gtttttatgt attgaaaaat 17520atctttggcc gggtgcagtg gctcatgcct gtaatcccag cactttggga ggccgaggca 17580ggcggatcac ttgaggccag gagttcgaga gcagcctgcc aacatggcgc aaccctatct 17640atacaaaaaa atacaaaaat tagcctgaca tagtggtgta tgcctgtagt cccagctact 17700tgggaggctg aggcagaagg atcacttgag cccaggaggt ctgtgttcct gccactgcac 17760tccagcctgg gcagcagagt gagaccctgt tggggggaaa aaaaaaaaag tctttaactt 17820aaataaattt gacatttaaa atcttaaatt atttcatctc tgtttcagta ctaactctgc 17880atttattact ttctttttaa taggactgaa ggaattttct ctgagtatag ttcattcttg 17940ggaaaaagaa aaagataaaa atatttttct gttgagtccc aatttgctgc agaagactac 18000aatacaagta atagcatgtt attgaatatt taataaaata ctatttgtta catatgattg 18060ataataaagt atgaagttcc ttgtaacacc ttgcattgtg aagtgtatta aaaacctgct 18120aagagtaagg aataacttga tttaaaatat tttattctgt aatctcttta aattatctgt 18180acaaattatt gacttaacct aaatttaaaa atgaatgcct tagcacaatt aagttccaag 18240aatagagttg atcatgttaa ctggtaaatg gatcatgatt taaaattctt ctaggattga 18300aacaaatgaa aacgtagttt taagggtttg attttttaaa ttcctatttt tacatgcaat 18360tttactgcac aacccatctt attttgacag ttcttaaatt cgcaactctt cagaaatatt 18420atcagatcac ttttctttgc ttccataagt ttttttatta ttatattatt attttttttt 18480tttaaaagac ggtgtctcac tttgtcgccc aggctggagt gcagtggcat gatcatggct 18540cactgcagcc tcgacctccc aggctcaggt gattctccca cctcagcctc ccaagtagct 18600gggaccacag gcgaatgcca tgatgcctgg ctaatttttg tatgttttgt agagataggg 18660tttcaccatg ttgcccagaa ttgtcttgaa ctcctgggtt caagcagttg ttctgccttg 18720cccacccaaa gttgtgggat tacaagtgtg agccactgcg cccagctatt ctagaagtat 18780tttaagagtc atcttttttt tttttttgag atggagtctc actctgtcac ccaggctgga 18840gtgcagtggc acactctcgg ctcactgcaa cctccacctc ctgggttcaa gtgattctcc 18900tgcctcagct tccctagtag ctaggattac aggcgcatgc caccatgccc tgctattttt 18960tgtagtttta gtagagacga gatttcacca tgttggccag gctgctcttg aactcctgac 19020ctcaagtgat ctgccctcct cagcctccca aagtgctggg attctaagtg taaaccacca 19080cacccagcca agagtggtct ttttacaata ttattttttg attaggacat tcattcttgt 19140cataaaattg aagatactct agtcatttag aatttcattg ttttggaact agacattgtt 19200tctttatttt tgaaatgtta ttgaaggaat accatttgga gaagatacaa atgtaagaat 19260tgtgaaaagg ataattgtga cacaaatcaa aattatagat aaaaatatac ctgtaaaatg 19320tattaaggca ataacattct ttctgcttgt tgaccataaa tatttatatt ccctggatgg 19380gtacattgtt attgtcaagg gtgtttaaat aatgatcttg catgcataat ttattctctc 19440tggtataaca gaatcagcaa tttagttttc tgggacccga gaaaaacatg caaaagacat 19500actttgaaat gtaaaactga tttttccttg caactgtagg tccttctaga tcctatggta 19560aaagaagaaa acagtgagga aattgacttt attcttcctt ttttaaagct gagctctttg 19620gggtaagaag ttatggccaa actagcatgt tagacatgtt tttaacacta tatctggcag 19680agttttcaat gtaaatatta aagtagatgt taatgtcaat aagtgatctt aataatgcat 19740cagtagatat tttttcaagg attgtctcta tcttcacgcc tagcttataa tttgccttgt 19800cgtctttttt tttttctctt tatttttatg tttttatcca tccctggtgg taggggataa 19860ccttgtcttc ttcgataaca agaagtctga agcttattag aaattttact ttgagaattg 19920atcgatgaga agaaagcaac tagatatcac gtggatcata tatgcttgaa taaaacaata 19980attcttagaa caaataaata cattttaaaa gttaaagcca aaaacattag ttgaatgttt 20040aaaaatattt caaattaagt tattccttca ctgtcttgta ttactgtaat aatttggatt 20100atttgtgttt ttctcaactt ttaaaacaaa tatttaaaaa attcctcttt tgattaagta 20160gggctagata aaatataaaa aatatttttt aaactcctct taatttccat atttcttata 20220taatatgaga atctcttata aacactacct cttagaagtc tccacagaag ctttggtaga 20280tgtagtagta gggatttgat ttcttagaat ggtataatct gtaaatgttt tagtaaaagg 20340attaaacgat aaagtcaaaa tgtttatagc acagtgttta ttaatataaa ataaaatctc 20400tttttttttt tttgagatgg actctcactt tgtcactcag gctggagtgc agtgttgcaa 20460tctcagctca ttgcaacctc cgcctcctgg gttcaagcaa tccttccgca tcagcctcct 20520aagtagctgg gattacaagc atgcaccacc acacctgcct aattttttgt atttttagta 20580gagatggggt ttcaccatgt tggccaggct ggtctcaagt gatccgcctg cctcagcctc 20640ccaaagtgct gggattacag gcgtgaacca ctgtgcccag cataaagtaa aatctcttca 20700gactctcatg tgatcatgta aagtggcagg cagtcacagt caagaagtag tttaaagttc 20760atgtttgtaa aatataatct acagattgat actggatttc ataggtaatg tttaagagaa 20820aataagtttt tagttatcct cagtacttca aaagcaccca tttatgatta tgttgattac 20880taaactaaat catttggggg ctagaggtgt ttttttatgt gttaagattc cttaaggagt 20940tctattaggg caaaactttt agtaactgca tattttaaaa gtaataaaac taattttaaa 21000agcttggagg ctgggcgcgg tggctcacac ctgtaattcc agcactttgg gaggccaagg 21060cgggtggatc acttgaggtc aggagtttga gacgagcctg agcaacatgg tgaaaccttg 21120tctctactaa aaatacagaa attagccagg tgtggtggtg ggcacctgta atcccagcta 21180ctcgggaggc taaggcagga gaattgctcg aacttgggag gcagaggttg cagtgagccg 21240agatcatgcc actgcactcc agcctgggtg acagagcaag actccgtctc aaaaaaaaaa 21300aaaaaaaaaa gcttgaagtc agattcgaca ttaatcagta tactttctct caagtagggg 21360acaatttcta agattttagt cttttaaaat ttattaacta gtctgagcat ggtggcttgt 21420gtctataatc ccagcacttt gtggggccga ggcagatgga tcacttgagc ccaggagttg 21480gagactagcc tgggcaacat ggcaaaaccc cgtctctaca acaaatgcac acacaaaaaa 21540cccaatcagc tgggtgtggt gttacactcc tgaagtccca gctactcggg aggctgaggc 21600aggaggatca cctttgccag ggcgtttgag gctgcaggga gctgggttca caccactgcg 21660ctccagcctg gatgacacag caagcccctt tctcaaaaaa aaaaagataa aaaattaaat 21720taaattaatt aactacactg ggaaggcaaa attcagcatt tttttatagc taaattttat 21780cctgcttcag tcttttatca tgtaactatg tatatttttt acagaggagt gaattcctta 21840ggcgtatcct ccttggagca catcactcac agcctcctgg gacgcccttt gtctcggcag 21900ctgatgtctc ttgttgcagg acttaggaat ggagctcttt tactcacagg aggaaaggta 21960agtggttaag gtgtgttcat ttttctgtaa catttaataa cttttcattt atctttcttt 22020gggttttgac catctattat atagggtggg ttttgaccat ctattatata gggtttatac 22080gacatatgga aagcattcat ttattcacta atatttctgt gtgtctgctt ttaggtgttg 22140ggggagtgat gacgaataag actgatgttc tccatgccct ttttctgtgt cagttgatac 22200aattatatgg tttttctttt ttaggctatt aggtgttgat agggttgagt aacttacaaa 22260tgttgaacca gccttgcata cctgtgataa ataccacgta gttgtggtgt atcattcttt 22320ctacattgct gagttttatc tgctaatgtt ctgttgagct tttgtccatt taagtttgaa 22380agtgattagt ttgcagtttt ctgtttttgt gttgtctttg tctggttttg ctatccgtgt 22440aaatctggcc tcataaaatg agatgggaag tattctctcc tcttcttttg tttttttgga 22500agaggttgta taaaattgag gctgaatctt ggtggttgcc acaatgacag gaactatttc 22560tgtgactgaa tatattggga attcctataa agcaattatt ttctagggaa gtggaaaatc 22620aactttagcc aaagcaatct gtaaagaagc atttgacaaa ctggatgccc atgtggagag 22680agttgactgt aaagctttac gaggtatgag tatggtaaca ctctatataa atcccttttt 22740cattagaaag acaggaatgt tatacataat gctgtcaatc taataaatac acatatcatc 22800tagtctttaa cttttctgtt tatcatttag tcattaaaat ttctttggct ttctaatgtt 22860tttgataaaa tttctaaaac tctccatatt taatggaggc ctattttttt ttctagccag 22920aactttttgt agactacatt tctggaagtg ctcactgaca ccactctgaa aaattagtac 22980ttagaatata ctctaattgg tataaatgat ctctgaattg ctatggaaaa ctgggagaat 23040ggttgcttca ggggagagaa agtaggaggc tgtggacagc aatgaggaga attacagttc 23100accatataac acttttgtac ttttaaagtc cttaacattt acattattat ctattcaatt 23160aaaaaatatt gggaagattt tactttgaac agttaatttt tcccccatgg gtaccgctgt 23220catatagttc caactaatca tgaacttgtg tatttcctgt tctttgtaaa tttaaacttt 23280gtaactcacc aggaagtttg aagccaaatt tgtgtttcaa atatagcaac tccaggatct 23340ctaggcagat gcatttgcat ttgattttaa atgaatcttg atcccttact ctcacttatg 23400ttttcccaca tcctactttt tttattttgt tgtaagccat ctaaaattct caatgggatg 23460aaactgggta taaatgaata catgcataca ggaattatag tagcatattc cttttctttt 23520ttcttttttt ttttttttga gacagagtct tgctctgtag cccaggctgg agtgcagtgg 23580tgcgatctcg gctcactata gcctccacct cccaggttca agcaattctc gtgcctcaac 23640ctcccgagta attgggacta caggtgcatg ccaccacacc tggctaattt ttgtattttt 23700tagtagagat ggggtttcac catgttggcc aggctgatct caaactcctg acctcaaagt 23760gatctgcctg ccttggtttc ccaaagtgct gggattacta gcataagcca ctgcacctgg 23820cctccttttc tgagttttat aaaatttgat actttactgc acgctttgag actgtattaa 23880ttgaaccatg ttgatgaaca agtttttgtg atgggtatat taataaaata tagatcaaat 23940ttttatagtt aaatcaatat cgagcttttc tagtgctttc aaaaggacaa cctgaatttt 24000cccagcactg aaatgatact gaaaccattt catatcttct gtattaagga aaaaggcttg 24060aaaacataca aaaaacccta gaggtggctt tctcagaggc agtgtggatg cagccatctg 24120ttgtcctgct ggatgacctt gacctcattg ctggactgcc tgctgtcccg gaacatgagc 24180acagtcctga tgcggtgcag agccagcggc ttgctcatgg taaatgcatc caccactggc 24240ttaaggtctt gttcttttgt cagtcagcat ttttagtctt aacaataaat ctactctctt 24300cagagaataa tatatgtgtt atgttaagtg ttgtgtttga ggcccctgat ggcattctac 24360agttgtccta tagactgtaa tagcaaaatt ggtagagtaa aaacagtgtg aaaattctgc 24420aacttcatgg ttagtccttt agggtttttc attctccctt acttattgtt taatttacag 24480atttactctt ttgttcattt gacaaatatt tgtcaaatgc ttgtgcacag tctgtattct 24540caaattctag gagaaaaaga agggtgaaca gtattagcgc agaacgatac taataatgat 24600ggctactgtg tatgagtagc cagccctttc ttggctttct tggattgctt tgtattctac 24660atgaagatat tccctgggct ttacaggtca ataaatggaa attcagagag attaatttga 24720ccagggtgac caacaaggag atgacagcat acactatgcg agaagtatac acagagtagt 24780gtaggagcat ataacctaaa ctgggggtga ggtgggataa ggagttatca gggaaggctt 24840tttggaggag ttgacaactg agccgagttt tgatggaaga gtagaaatta gcatgaacca 24900atttcatgct aataaagaag caaaggaagc gtggtctaca ggcaaaagca cagaggtaca 24960ggaagtaatg atatgttggg gaataccctg ttgactggag cttagagtgc aaggagagga 25020gtgctaggga ggtgaggttg gagggtttgg cagcattgac ttgcttcaag gttcttaaga 25080gctgaaatag atataaaatg caactaagag tggcttggat tattattacc tagtgtgtta 25140atctcaaatt ttgaaatcta tagcatctat aggactggtg ttactaatct tacactcgat 25200ctgttactgt tcttatacta gatctattag tccagtgttt aagggagtgg tgcagatttc 25260taggtcagga caggactcag atgtacatta ttaatgccta tttcagttct gaccttctca 25320tatgaaacct tataagacct ggggtaggaa gagattgttc tggaagtcat aggaatatga 25380actgtatttt gtttaacaaa caatacagta tggaaattta tcacccttcc agaatattta 25440tttcagagac aaatttttat cattcgttca tttatttcat aagatccacg agtagggaac 25500ctcactagac attgctctga gtatatggtc tgagtttgca gtacctcttg tgtctccatt 25560agatttatta ggtcctcaat agataaatca gggaataact agatggattc attttttaaa 25620gacatgaaag agcgatacca tacatactgc accttaaagg tcaaccttag agtatcatta 25680tttttaatga atgtataatt tttaaatttc atgtttactt ttcctaagct tttgcactat 25740attgcttaat tccagctttg aatgatatga taaaagagtt tatctccatg ggaagtttgg 25800ttgcactgat tgccacaagt cagtctcagc aatctctaca tcctttactt gtttctgctc 25860aaggagttca catatttcag tgcgtccaac acattcagcc tcctaatcag gtaatacact 25920acttgtaagg attattgaat tatgtccctt ttatagaaat tatttttcaa ttttattagt 25980aattcgtggc tttaaattta tgcttctctt aatgatttta aggatatgta agtcaacatt 26040tggtgcatat tgtgctagag gcataaatta taatttatag ccacctgaaa tgttagtatg 26100cgctttccaa gaaaatgact tttttgaaaa tggtatttct ttgaatgaga aagaacagag 26160agaaatagat agatggcttt taaacacttc attaattaaa cttttttttt ccaccatcac 26220ataatggcac ttagtcccct ttgggaactc atgagggttt tagtggtagt gagctgaaag 26280aaatatgttc caggactggc aaacatattc taaattcttt aaaattttca cctagcatct 26340accctaaata ttcagaccct gtgctagtta actgctattg aagaacaaag gtattatatc 26400tattattaag gataatagaa tggtatttga gatattggtc attgaatatg aatatgtttt 26460gagaaataag ttttatagga accaaaaaaa aattcttaaa ggaaccatat attactaaaa 26520atgcttctta ttggagaaag aaatgacaat catttattaa tgtgattttt tcacaacttt 26580attaagatat aatttaagta caacaaactc acataaagtg tacaatttga tcagttttaa 26640catatgtaga tgccatgaaa ccatcaccac aattaaggaa acaaacattt tcatcactcc 26700agaagtctcc tagccctttt actacccatt cctcccctgc tccatcccca gacaactacc 26760aatttgcttt ctgtcactat agatttgtca acctgatttt ctccaaatat acattcaaaa 26820atatacagtt gaatacaatt ggaaattcga attttgtgtt tttttcttta ggaacaaaga 26880tgtgaaattc tgtgtaatgt aataaaaaat aaattggact gtgatataaa caagttcacc 26940gatcttgacc tgcagcatgt agctaaagaa actggcgggt ttgtggctag agattttaca 27000gtacttgtgg atcgagccat acattctcga ctctctcgtc agagtatatc caccagagaa 27060agtatgtttt actattaaaa cctgaacttg gaatcttctt tctattgtgg agaaatgtaa 27120ttgtagtaag acaagaatta aatatattcc attgtagtat ttgaataagc agttatttga 27180gtagaaaatt agtgtttcca gctaagatga tggcatattt tgaaaattca tatagtgaat 27240ataactagta aaagaagttt tgtttatttt taaacagaat tagttttaac aacattggac 27300ttccaaaagg ctctccgcgg atttcttcct gcgtctttgc gaagtgtcaa cctgcataaa 27360cctagagacc tgggttggga caagattggt gggttacatg aagttaggca gatactcatg 27420gatactatcc agttacctgc caaggtatgt ttaaaaaaag aaaaagtgaa tacttactcc 27480cagaagaacc actgtattat tggctttggc tttatgtgtc agcttgccca atctccgtgt 27540gagtcaacaa gtgtttactg agttaccaaa taaatgtctt aacactattt taggtacttt 27600aacaaatttt aattttatta attaattttt tattagaatt gagacctcac tctgtcatct 27660aggctggagt acactcacag ctcactgcaa cctcaaactc ctgggctcaa gcaatcctcc 27720tgcctcagcc tccccagtag ctagaactac aggcatgaac caccatgccc ggccaactct 27780ttaattttct tagagacgga gtcttgctat gttgcccagg cagacagatt ttaatgtgta 27840tgatgcagtc tttgatgata agaaacttat aatggaaagc tgaggtgata gttacagtaa 27900atacattttg atgtataatt ctgtttgctt taatcattca aattgtagta aagcaagatg 27960aactgtctgc tgggatttga gcagaaatgg ataggaataa actaggaggt agaagagtta 28020tcaaggttca caggactgat gggtgaagct agatttccag acccgggatg tcagtccttg 28080aaaagcagac ttggcaggca tagacgaggc agatagcagg ataaaggaga caaatgtaga 28140ttgttcttca gaagatcaga tggtagagtc taggaggtag tgtgttttaa tcagagatct 28200gagaggcaaa gatcattgca tgagatcagg gacccatgca aaggagtgag aaaaaaaact 28260gggttaagga gcctgctgca tggcaactcc tgggaacagt ggccactggg gcctgggaca 28320tgttgattgc agcccaggac tgttaaaacc agtgtgagag aacatgggta tggaagtact 28380agctagcagg atcatgaccc cgatgctggg atggggcatc aagcattagt acatggagat 28440tcagtacatc cagatgcagt acatggagac tatatgcgta actgctgact ttgggcttct 28500ttcagattgg agcagaggta gaggtgagtg ggaatattct caatagaggg aactaaatag 28560gcatacctaa taaaggagac caggatattg cagacagtag cctcatgttt ggctcacctg 28620ttcaaaaagt tctcttgttc ttgagcagtg gtgccttaaa aggtaacttg agaagcagtc 28680gattatttgt tcagcctgga gactcttggg atattttact atctttgatt gaatagattt 28740aaatgtacac agctctcata acttgcccca tgaagcatat ccatgaaagg cactatactt 28800gttaaaagat tggtttgtac tttttaaatg tagtactttt aataaaacag gaaaaataga 28860agttctgatg cagttatatg cattttatat agaatgtgtt cttaattgga aaaaatttgt 28920cgtagttcct ttgagttcat ttacagtttt tagtaggaat tgtattttct actgttgtac 28980ttgctgttac taaagaaaga tggtcgtgat taccatctga attttttttc tatacattga 29040tctttagctg ctacttagtc atttctgttt agacttgagc tctttttcat attttttttt 29100tttgtttctc agtatccaga attatttgca aacttgccca tacgacaaag aacaggaata 29160ctgttgtatg gtccgcctgg aacaggaaaa accttactag ctggggtaat tgcacgagag 29220agtagaatga attttataag tgtcaaggta tgttgtctac ttatcttctt tttttattta 29280ggtaaaatta acataaatgc agttagccat ttcaaagtgt aaattcactg gcatttagtg 29340cattcacaat gctatgcaac caccacctct ctctaatttc aaaacttttt cattccactc 29400ctcctcttgc ttatcccctg gcaaccattc atctgctttt tgtctctatg gatttgcctt 29460ttctgtatat ttcatataaa acaaatcatg caatatgtga ccttttttgt ctggcttctt 29520tcacttatgt aatgttttca tggttcatcc aggtagtagc atgtatcagt acttcattcc 29580tttgcatgac tgaataatgt taccatactt tgtttatcca cttatcagtg gtgaacattt 29640gaattgtttc taccttttga ctattatgaa taatgttgct gtaaatattc atgcacaaat 29700ttctccacgg atatgttttc atttctcttg ggtataaact gaggagtaga attcttgggt 29760cttagggtaa ttctctaact tttcaaagaa ccaccaaact gtctttcaca ccaactgcac 29820cattcccact agcagtgtgg ggggttcctg attctccaca tctttaccaa caccattatg 29880tttctcaatt gtgggctagt ctcacatttg gaaagctagt gggagcagcg atccatctat 29940taaaagttgt atgaaattga gtaatgagcc acctctctct tgtagggctt attatgttct 30000tgcttaaggc aatcttcatg cattgtgaac agaattatac ataaatgctc agataaaagg 30060gcaaaccatt cttaaaggga gtagacaact agaggcagga gaccatactg aggcaggaag 30120ctggggtttt tatggttctg ttacttttga ctatatctca ccattgcttt tgtcaaagtg 30180agactaggtc taagtttttt tcaggtataa ggtgagtgtg gtaattaagg ggcatgctag 30240cagatcattt tgggtaatgc ttcacagtcc accactggtg tgtcattgtg gtcgcagatc 30300cagtatctta gctgtgtaat ttcagacatc agcaatatta gtttaacaaa gggcaattag 30360attccaagac aaaggaatcg tgtattattc tagccttatt caaacttgat ttataaatca 30420gtttagtaat ttatttattt gtttctgtat ttatttttat ttctttgaga tggagtctca 30480ctctattggc caggctggag tgtagtgatg caatcttggc ttactgcaac ctctgcctcc 30540tgggttcaag ctattctcct gcctcagcct cccgagtagc tgggattaca ggctaatttt 30600tgtattttta gtagagatgg ggtttcacca tgttggccag gctggtcttg aactcctgac 30660ctcgagtgat ctgcccgcct tggcctccca aagttctggg attacagacg tgagctaccg 30720tgcccagctc agtttagtaa tgtataactg ggttttaccc agttgtaaat tactcttttg 30780tcgtgttttt ttgagaactg gcaatgacgg agaaactaaa agtgccaggc tgttgccttg 30840ttcctgttat tttgccttag tttttttttt tttttttttt ttctctgaga ctgagtcttg 30900ttgtgttacc aggctagagt ggagtggcat gatctcggct cactgcaacc tctgcctcct 30960gggttcaagt gattcctgcc tcagcctccc gagtagctgg gattacaggc gcctgccacc 31020gcacccggtg aatttttgta tttttagtag agacgggatt ttaccatgtt ggccaggctg 31080gcctcgacct cctgacctca tgatccacca gcttcggcct cccaaagtgc tgggattaca 31140ggcgagaacc accgtgcccg gtcttgcctt agttatttct tgttccctcc tctagtccta 31200tagttctctg actgtattga ggaaatgtaa ttaaatatta ttatgttaat agatatttat 31260gtggttgaat attagaaatt ccttattttg gtcacatatc ctgatcagta gttggtcttc

31320tggagatagt gatttttcac tagagatgac tttaggacct attcaggttt tttttaagat 31380cccaatttaa ggaaagacta ttctcattat tgattttgct atatgcaggg aaatttattt 31440cgaaaggttt ttcagttggc ttttagggaa gattatatat tctctttttt tttttttggc 31500cttttcccac atgttctaaa aatgatatat tctttaactc ctatgaaaat acattgtttc 31560agtaattgaa gatgctgatt aaagtcatat ctctacacat tttttaaaat ttgagataga 31620tgggactttg tcccttctta caccattcac ttattcactt ggaaaaacta ttatccaata 31680cttatgtggc agacactgtt tctggcacaa gggattcagc agtgaacaaa actgcctttt 31740tggagtttac attctactag tggaaagcga caacaagcag atagacacat tcagtatata 31800attcactgtc agatggtggt ggtaagtcct atgtaggaag aaaagcaggg taaggaggct 31860tggagtaact ggagtgagtc atagatggac ttgtcaggaa agggtttctg aagaggtggt 31920atttgggcag agatctaaat aaaatgaagc aacaagccat gagaatatcc gggggaaaat 31980gttctgggca gaagcatcaa gcatagaact tgtggtatga tatttattct agcacacatt 32040aattttaaaa atgtataaaa gacatccatt taatcatatt aaagatttcc atgattcatt 32100tagacttagt cagaaaccaa atttatattt tctttttaaa taattttatc tcaactctta 32160ttttacccaa taggggccag agttactcag caaatacatt ggagcaagtg aacaagctgt 32220tcgggatatt tttattaggt tggtagccta tgaatgtttt taaagtaact gactctgtta 32280ttatttatca atcagtgctt tttttggtct tgttttttga agaactgata tttgaaacct 32340gtggtttatg tgaattatta ataagctaga ggacgtggat tctctatttc atcaaataat 32400acaaaacatt ttagatatta aattttggaa attatttggt tttgttttac aatagaaata 32460ctcctcaaag tggaatcgaa gtggttattc aaagaaatct cagagtagat tcttatatga 32520agcaaataat tgcccctaat ttatctctaa attttgtaag ttctaaattc ttttttcccc 32580cagtttctaa tttatctctt ataagtcaag agtccatctg gccaatttaa tttcagtgag 32640tgtaactatt ttgcatatat taaaaaactg tatatgaata cagaagatgg tatttaagga 32700tgaaaataat tattcaaatg tgatagcatt atggggagtt ttaaaataaa agttactgtt 32760ttattcttcc aaaaatttta ttataaagta tacagttaag agaatataca taaaatacat 32820atgcagctta aggaagaata ataaaatgaa tacttcatgt attcaccacc gagtttacca 32880ggaaaaagca taaacaaaat aaacctcttc cacgtaattc ctgggttaaa gagaagttat 32940agtggaaaat atttgggagc aaacgataat gaaaatacta tccattaaaa ttgttagatg 33000ttgcaaaact gatttcaagg aaaatttata gtgttaaatg tttagaaaag aaaaaaggtt 33060agaagttaac cacttatgta tctatctcat gaaattagga aaattataga tataaactaa 33120aaaatatgtt aaaagggaaa taataaagat aagaatgaag tttaatgaaa cacaaaacag 33180agaagctcac aaagccaaga tttatttttt gaacaccgag tacaattgac aaatctctaa 33240caagtttgat taagaaaaaa gaaagcatga ataaacaatt ttagggataa aaagggaaac 33300atcgctaaag atatcccaga aatgtaaaag ataataaggg aatattatga aaatattcat 33360gccaatacat ttgaaaactt aggtgacata gacaaaaaca aaattgacca aaattgagca 33420aaaaagaaac aaaatctgag tagtcctgta acttagtaaa aattgagtta gaaaagttaa 33480agaagtcttt acacaaatca aacatcagac tcagttttct aggagagttt tgccaaacat 33540tcaagtagca gataattctg gtctattttt ggccccagaa gatatatttt acttgccatg 33600catttaatga gatagctgtt gatttttttc aatcaccgtg acaggtgttt tatattaggt 33660gttattcgcc agacatctag tccacctgtt gccagatatg gaattaatat tcacttattt 33720tgaattaaaa tttgttaata aattaataaa acaaagtcaa agttcaaatt attaaaaaag 33780taaaagaaat aaaatatatt ttatagagag cccttacaaa acagtaccaa cataatgagc 33840tttccaaatt ttgaatgggc aaaataaatg aataggcatt tcacaaaaga aggaagggtg 33900gccaataagt atatattaat ataaaaatgg ttacttgtaa taggaatcaa aagtgtttga 33960cttattgact aagagtcagt ttttgttttg atccctgtta gtctatccag aaggcatggg 34020tcttaataaa caccttgacc tcaacagttt actgaataca agggtaattt catatgcctt 34080gccttcttta agggtttgtt gtaaagatta aaataaatac ataaatatat ataaatacat 34140ttatatgtat ttatatgtaa ttacatacaa cttgccttct ttaagggttt gttgtaaaaa 34200ttaaaagaag tatataaata tatataaata cataaaataa atacattcat atatgtatat 34260gaaatcactt tgccaactat gaagcctgat tcaaatatga aatgttgttt gtttttccca 34320gagcacaggc tgcaaagccc tgcattcttt tctttgatga atttgaatcc attgctcctc 34380ggcggggtca tgataataca ggagttacag accgagtagt taaccagttg ctgactcagt 34440tggatggagt agaaggctta cagggtaata attataaata cagaaataga atgttataac 34500aaaatgtcat catgtcatca gattttggta aaaaaatgtt cttttttcct ctaggtgttt 34560atgtattggc tgctactagt cgccctgact tgattgaccc tgccctgctt aggcctggtc 34620gactagataa atgtgtatac tgtcctcctc ctgatcaggt gacaatttca tatttagagt 34680ccaaaaccca acaaatgcta cactctttcc ttgtgagctt tacttctgcc aggtaatggc 34740aattgtcctt agaagaccag ctttcttagg gaaaagcttt agccactgtt tgctcaaagc 34800ataaaaagat tctgaattag atgcaaagcc tttttttggc ccagtgcaag tctgaaaact 34860ttgtaatcct tctgtgttgg ctgattgggg aaaaaaaaat gcaagaaacc taatgtatta 34920tattttcaca ttatcttctg ttcaaagatt acatacttcc attatcctgt caaaaaaaaa 34980actctgatac agaatcaagc atgtgaatcg taagcatgta agcaggtttc atagagataa 35040tttttcaact cttccttgtc ctgtgttgtt ccaactctta ttctccaatt tagaagcaaa 35100caaataaatg aatgaaagaa cagatagaca aatgaatagt caaaggtata aagtatctgt 35160atatatgtta catgtagcta ttatttaaat tatttagatt ttccttttga aataccttct 35220tggcacactt gcctaaatct agaaaataag cactgtgtga ataagaaatt atttacactg 35280aatattttgt aggtttttgg gtttttgttt ttcagacaag gtctcacttt gtcacccagg 35340ctggagtaca ctggtacgat cacaactcac tgcagcctct atggcccagg ctcaagcaat 35400ctccccacct cagcctcccg agtagctggg accacaggca cacgctacca tgcccagata 35460attttattat taatttttgt atagagatgg ggtctccctg tgttgcccag gctttcttga 35520actccagggc tcaagtgatc ctcccacctc aacctcccaa agtgttggga ttacaggcgt 35580gagccaccat gcccagcctt aagagtgttt gattttcatt cattttccta tatatattat 35640ttctgttggg gaaaaaattc caaggaagat aaatagtagg ctgttggtac atttctcaac 35700ttacttataa agctttttag atatataagg ttaatttatg aagaaaatca taagatacac 35760aatttaagat aatattttta attttatttt ttatttgtta aataaatttt tctcctttca 35820ggtgtcacgt cttgaaattt taaatgtcct cagtgactct ctacctctgg cagatgatgt 35880tgaccttcag catgtagcat cagtaactga ctcctttact ggagctgatc tgaaagcttt 35940actttacaat gcccaattgg aggccttaca tggaatgctg ctctcgagtg gactccaggc 36000aagttatatg aggaagttgt tatgacattt tatgagtgat aaaagaagta caatgtcaaa 36060atttccacct taaaaaatgc tattttttaa acaactttgg taaaactgta tagaaacata 36120aatttacctt tagttgaatg ttccatagtt ggaatatggg ttttgcagag aatttataat 36180tatgaagttt gatgtctgtt tctttaacat taccttaata ttggcaaaaa catgttggtg 36240tttgcaagga tattatttaa attgggatac catgaattaa atactacaaa caaaaataat 36300tagagttttt tgtttgtttg tactttaact tttaaaaaat aatcagttaa agttgttgtt 36360ttgaagctca cattgttcca atctggccaa taggagcccc ttttgtatgg ctcctgtatc 36420tttatgacat gtcctcatca ttcttgaatc acttcctcac ttccagatac agtaagttat 36480tcttggccag gtgcagtggt tcacgcctgt aatcccagca ctttggcagg ccaaggcagg 36540aggatcattt gggcctagtt tgagaccaaa tcatggttgc acaaactgta cccactatgg 36600acaacagagt gggatcttgt ctctgtgaaa aatttaaaaa ttagctgggc atggtggcac 36660atacctgtag tcctagcttc ttgggagagg ctgtggcagg aggatcgctt gagtaaatcc 36720aggatgcagt gagccatgct tgtgccactg cactccagca tggatgacag aatgagaccc 36780tgcccccaaa aaagaaaaat attcttggtt tatcttgtac tttctgtatc ccagccctag 36840catcagcctt ttctctaaag acagtattat gattttaata tttacagtag atatttgaac 36900tgttacatta tagactttac catatatttt ctaggaagga ttattctatt actcttcttt 36960accacatttg tttggaatgt ctacagaacc tacagtttct aaatcagaaa ctccctaggt 37020ttttgctatt ttggcaagcc attgaagttc ttccctctcc ctttactacc agaaaggtgt 37080gtatttgtag agctctctat aatgagaaag cactctataa catggttgat tcatcatttt 37140ggagtagaaa agtatgaatg gaaagtcaga gacataaaaa taaagcccag aggtctgagt 37200cttagcttca ttacagactt tcttggggga tggttggtaa attatctaca cattctatct 37260tgtctttata attttaatag ttaaattttt accatgtgcc tcaaaaccgt tagagaatta 37320atgagctctt tgaaaaatgc ttctaagttt cttgtattgc tctaatagaa tgctatctat 37380gttattattt atttctgaga ctaaaattgt ttacatcttt aaactggttg tccttttgtg 37440tattttagga tggaagttcc agctctgata gtgacctaag tctgtcttca atggtctttc 37500ttaaccatag cagtggctct gacgattcag ctggagatgg agaatgtggc ttagatcagt 37560cccttgtttc tttagagatg tccgagatcc ttccagatga atcaaaattc aatatgtacc 37620ggctctactt tggaagctct tatgaatcag aacttggaaa tggaacctct tctgatttgg 37680tatcttgtgc agtcatcatt atacagttct gaaatataaa gctatatgtt ggtgtaaagt 37740tgcagtgatt tctctcctaa ccagccccac atattcttcc tggttggttg gttcttcagt 37800aaaatagtct tgtttcttgc ttacactaat tggtaatttg cattccttgt taagattttc 37860aagacagggc tgggagcaag gaaccaaagt agcgcgtggt tgtgattacc tttggtttct 37920ttgaggtttc tcttacctag tggctttaaa acatctttag gagcagttcc attttatagt 37980aaacttaaat tctgttatca tgaacagttg aggataatga ataatttgat acaataatgt 38040aagaaattcc tgaaaacaaa gtgttatctg tgatactttt gctgcatagt aagcacaatg 38100aagtgtactg ataatgtttc aacaggaaag tgttttgatt aaatgtgggc agtatcactg 38160ttctactagc attcaacatc tcttctaaaa attaatagtg gttcactgta attttattgg 38220tacatgtaac atctgtacat gtgtttggtt atctatatgt ttcctggttt tttgtacatt 38280tgctttatta atttaggctt tttttttttt ttttttttga gacagtctca ctctatcatc 38340cagactagag tgcagtggca caattatggc tcactgcagc cttgacctcc tgggcttagg 38400tgattcttcc acctcagcct cctgagtagc tgggactaca ggcacatgcc accatgccca 38460gctaattttt gtatgttttg tagagacgag gtttcaccat attgcccagg ctggtctcaa 38520actcctgggc tcaagctatc tgcgtgcctt gacctcccaa agtgctagga ttacaggtgt 38580gagccactat gcctagccta actcagactt taaaaatata aaagcaattc atttttattc 38640ccaagaacag taaggtggtg gtttaatttt agtctttaat tctgttttta atttattcta 38700tttagaaatg tcccagaaac ttagtataac tttactttct gaaaatgaag aaacctgtcc 38760ttgggcatta gtgtgttgga tttaagcaac aaagttaaaa aaacctaccc tgtgttatgg 38820caattttcac ttgatggtgg ttctataaca caggtatcag tgaaccttta taaaagatga 38880acaacttttc agcttgctta atttcagtta attaacatgt atacttatct atgttaatgt 38940tttattgctt aaaatgttta atttttatat ttggtaaaca gatagttttt tctctccccc 39000tcttccttcc atctttcatt actacaattt accatgcaga gctcacaatg tctctctgca 39060ccaagctcca tgactcagga tttgcctgga gttcctggga aagaccagtt gttttcacag 39120cctccagtgt taaggacagc ttcacaagag ggttgccaag aacttacaca agaacaaaga 39180gatcaactga gggcagatat cagtattatc aaaggcagat accggagcca aagtggagta 39240tggctttttc cccctcatta taattgttaa aacttcttaa aaattgtttc acccttttga 39300tatatatttc tttgacttat aaacgagcta tatttataaa caagggacca gaacacatta 39360actcagtcat ggttatgtgc ttccttgctt tcaatgtttc attatcttat aaggaagaga 39420acgtatggtc tcttgaaaaa actgacaata agaagtaaca actggactac cacatttttt 39480tttacatcct taatttaact cttcgtcaat ttcttttttt acttaaggag gacgaatcca 39540tgaaccaacc aggaccaatc aaaaccagac tggctattag tcagtcacat ttaatgactg 39600cacttggtca cacaagacca tccattagtg aagatgactg gaagaatttt gctgagctgt 39660aagtaacaga ttctgttttg gaagtacagc tactattaca agtgacatag tattacactt 39720aaacctttaa agttcgtgtt taaaataaaa atattttgaa tatttaaaag ctaattcaaa 39780aaatatgtgt cgtagctatg cattaaaaaa ccccaaaatg tcagaagtac agaagtcaaa 39840attgagtttt cattaaccag ttcatttgat tatatttgaa ttattcataa tggactcatt 39900taattttagt aactttgggc tgggtgctgt ggctcatgcc tgtaatccca gctctttggg 39960aggccaaggc aggtggatca cctgaggtca ggagttcgag gcaagcctaa ccaacacggg 40020gaaaccccat ctctactaaa aatacaaaaa ttagccaggt gtggtggcat gtgcctgtag 40080tcccagctac ttgggaggct gagacaggag aattgcttga acccaggagg tggaggttgc 40140agtgagccga gattgcacca ctgcactcca tccagcctgg gccacagagc gagactgtgt 40200ctcaaaaaaa aaaaaaaaaa atttagtaac ttcgaagaaa taagaaggaa aattaaaagt 40260tgaaagtgat tctaatgtat agtttataaa attttgttat aaaaatacct gttttgcctt 40320caaaataatt tatattaata ttttattgac ctcaagaaca tttaaataca ttcagattta 40380ttcatttgtg gaccacattt gttatacatt ggatttaaag gatccttgca attgagttta 40440tggccaccta tgcatctgag acccatggac tgggaaccat tctaggtcaa tgattcagtg 40500tgattcaatt taagagatgt ttattcctgg tctttagaag ctgctacctt ttgttatcta 40560attttgcagt actttgaagt atgtatgtat gtgtacatac gttagtgcta tgtatttatt 40620aaagaagaat cagaaaacag aggtaaggaa aaataaggaa acaaatttct gttaagccca 40680ccacctccca aagcatattt gtttatatgc ttatatatgt tttcctatta tggtaagaac 40740agtctgtaca tattgctata tagcagtccc cctttatcca catacatcct gaaaattgtt 40800ttacatttta aatgttaact actttattgt ttttaaatgt cattttatag tgtagctatg 40860ccacaatatc caatttttag acatttaaat tgctcccagg caatgtggta atgaacattc 40920ttgcagctga atatatgcac atatctaatt gtttcactag gatagaggtg gaattgtata 40980acagggagct cacatttttt aaggcttttg aaatgtattg ccaaattgcc tgccagatat 41040actgcaccat cactaacatt gtgtgttgca gtatttttct aaacttggcc cttttgattt 41100tagaaaaatg atatcaataa tttacatttc tttgattaaa gtgtagaagt tataattttt 41160catattattc attgtcattt gtattttatc ttttctaact tgtctcttca tcccctttgc 41220tccgttttct attggagtgc aactttattt gtaagaattc tttttaattt ctgtgactgg 41280aatttttttt tctagtttgt tatttcccgt tcatttctta aaatataatt gtgtttgcca 41340acaatccatt atcttttgtt ttgtaatggt agtatttata catattaaat tatctctttc 41400ttttttcaga tatgaaagct ttcaaaatcc aaagaggaga aaaaatcaaa gtggaacaat 41460gtttcgacct ggacagaaag taactttagc ataaaatata cttctttttg atttggttct 41520gttaagtttt ttgatggctt ttccatatgt tgtaacagga aaaaaatggt gtctatgaat 41580ttcttcttaa tttaacaaat ttggttaatt tataaaatca cagattggta aatgctataa 41640ttatgtaatg atcaggattg agattaatac tgtagtataa attgggacat tataacagat 41700tccatatttt atttcctaaa atctaaattc agtctttaat gaaataatat tagccaaatg 41760gtggaactaa tttatttctt ttgaggaaaa gataataaag aatgtaatta aatttaaatt 41820tcttggaatt cccagttgta tattcatcac ctttgtagca tttgacaaat tttatgctta 41880gcagcttctt cactgttttg aaataaaata tcctattacc tactgataca attatctgtt 41940ctttgtatat caaaaaatgt gaaatttaca cataattcaa atacatttaa ttatccgctc 42000aaccagaaat gaaatcacat ccctctacta tactacatcc agctccaagc ccaagatatt 42060taaatgacat ccattcctct cctagttcca gttatgattt tatcttgata ttctctcata 42120tatgaactaa attataaagt tagccaccat caatacaatc tgcgtatcta atatcttaac 42180tatatagtaa tggggtaagg gaacagcaaa aaggagaaca ttaattaaaa tatacaagta 42240agcctgggca acatagtgag accccatctc ttaaaaaaaa aattagccat gcatgatggt 42300atgcctctag tcccagctac ttgggaggct gaggtaggag gatcacttgc tcccaggagg 42360ttcaaggttc taaaccagca aagctcagaa tcccagggga tagaaacaaa gacttagtgg 42420atcactagta ttaaactgag acacgtcacc ctgcattgca ctttgtttct cagttctttg 42480atgaaatcac tgagctgaca tacctgccct cttttcacca taaagtgagt ttcatgatca 42540gaagcaatgt ctatgggata gcctaacaaa caatgtaaaa accatttagt aagttcatga 42600agggtggtgg tggtaaaaat ttggagaaca tacaaaacaa atacaattcc aaggtgtgtc 42660ccctccagga aggacaaatt gctgcctgct ctgtgataga agaggatcag atgtaatcaa 42720cctgccgtca gacttgggct gttctctcct gggtgtggac ttgcctggtt ggtcactgct 42780gctgacaagt aggctgtcaa tatagctggg ttgtcatgtc agctgtggtg agggggaagt 42840ccacattgtg gaggccacat ccctgcactc ttggccaatt tgaccatgaa tcttaagcac 42900tggggtggct ggaaaagaca gccgattgac atccatacag aggtcatctt gaccacttga 42960ttagtataag cactgaaggc ttttaactga gcattcacat aggacacaaa tattctgatt 43020ctttgggccc attccaagaa ctctgggcat acttttcctc cagacctcat acccagttgt 43080gttctttcca aatttctggt catctggtta tgttattagc cactatctgt gaatcagcat 43140agatttttat atcagacatc tctacctcct gacagaatgg aggagatatg ttacttaaca 43200attctgttcc cttggaagat ttcctgtctc cactgtttgt aagggctact ccctcaatgt 43260agcagtaatg ctttcactct gatgggaagt cacagtggaa ttctgggtct ccaagaatta 43320gtgttagtgc atacacagtg tctgataatc cccagagtgt ctggtgccct tggatcctgt 43380gaagaaggct tggagaaaag aagattcatg gcaagaactt gtgatgtgat gacagggcct 43440tttctctggc tcttcattct tagtctgacc taggtgtgag aattaggtca ggggccatga 43500ctatattgtg gtgactcaaa ccaggccttt gtttactaac tgggagattt ttacattgta 43560agaatcaagt aggatctttg cccatgtatt ttggtcttaa gaacacaaat gatatggctc 43620caatgactgg aggaacacca gggtccttgg tctcacgctg atttagataa aacgactgtc 43680aggcctctga gcccaagcta agccatcctc ccctgtgacc tgcacgtata catccagatg 43740gcctgaagta accaaagaat cacaaaagca gtgaaaatgg cctgttcctg ccttaactga 43800tgacattcca ccattgtgat ttgttcctgc cccatcttaa ctgagcgatt aaccttgtga 43860aattccttct cctggctcaa aacctccccc actgagcacc ttgtgacccc cgcccctgcc 43920cctaagagaa aacccccttt gattataatt ttccactacc cacccaaatc ctataaaatg 43980gccccacccc tatctccctt cgctgactcc tttttcggac tcagcccgcc tgcacccagg 44040tgaaataaac agccttgttg ctcacacaaa gcctgtttgg tggactctct tcacacggac 441006416869DNAHomo sapiens 64aagctttagt agagatctca aaaatggttg gatggtagca aattactaag aactctcaaa 60gtttctaaag ccttagtttc agcttgctag aaaacctatg ttgagtatta tggctagttc 120catagttgag ttgggaaatg tctttgagga gacacttttt cactttgtat tcatctgtac 180attttctgtt acttgcattc tgtcatgctc aggctattag agcaggtaca tttttataac 240tggaatgttt atgtgtagtg aagctctgag aggactttgc attagatctc agcagcataa 300tcagaaggtt gtcctttgtc tcagcaattt ttaagctaat agtagcagaa attgcagtgg 360aaatagactg ctttgccaca acattcagaa aatcatttat ctttttattg cagttcttgt 420caccaaacaa tacattttag tacttctcaa attgcagaac tctcataggg ctgggaaaat 480gcctgtagac acatacatac tatgaatgtg ctaatgtttt ttgtattttc atagcccatc 540aaagctcctg agtcagtttc cactataatc actgcagaat caatcttcta caaggtaagc 600ttttgtagag ttactgaagg aagagttggg cctagtgggt aatgtgccac taaaatgttg 660gattagtcta aaggtctctg ctactcttta tttgtataag gtgtgattat actttttgtt 720cccttcttag ctgttttccc ccataagtgg ctgttattaa aacatctcat ctagagctga 780agtgggagga gaaagtgcct actgacacat gatgtgagga tcttaagtat ttttttttag 840tgtagattgt aggaattatt cttaaaatgc tgattgtata gtgtggagcc atggaagact 900gagccgttag tgcgatggca ttgaagaatg agaaggacag agacaggatt tggactagta 960gaggttgtcg actgtggtgt caaatgggta gagtaggccc agagattcta aaatgccttt 1020aagtggagtt gagctgagta agggcagtag tgaggattaa cacctactag aaattcatag 1080tgagaggaat tccaagatgt tttgataaaa gaatgaggag gtcaggtttc ccagggccaa 1140agtccatgaa catctgatac ctcagtgaga gaagtgacag attgttgtgt ttaaaccaga 1200agtcttagga aaggaattag aacatagacc cccaaggctc ggcaggcctg gcacggcaca 1260ggcagcaacc attgaaggct atttggtgtt tcgggatctg aactgtcatt taggggacag 1320tggtgtgagt tagtacttta tacttgaccc aggtggactg agaaactcaa gtgatgatgc 1380ccttaagtat actttttttt aagcccacaa tctatatagt cgaagtctgt tcctcccaac 1440aggggtacac tggcattcct cagcagggct gggaaaaacc aacaacaaaa aaagtctgta 1500cacaggcaaa catctctctt atttttccaa catttaatac attgttaata aaatatctaa 1560agtttagcaa acagttgctg tgtatcagtg gctgagcatt ttgcatgctt tatttcattc 1620agttcactct atgaggtgga tactactatc cccattttct agatgagaac attgaggcac 1680agcgaggtta attaacttgt ccaagatcac atagccaaca agtcatggag tgaggcagtc 1740tcatgccaga gcttaagcct agagcatagt tcctggctct acagctttag caagtgactg 1800gctatgtgac gaggaccaac ctctctaatg tctcatctgt aaaataggaa ttgtaaatag 1860ttactacctc agtgggtcaa atgaaatcat atgtgttaag cacttagcag agtaagcact 1920caatgaatag taggagttat cacatcttcg tatttgtgca ttaccttcac agtttacaga 1980ttaaggccag aagcaacttg ttgagctacg ggtttagtgt actaacagtt tccatgtgtg 2040tctccatgga agggtgtgtg ggacctgtta ttgtgactgt ctgtactttc gtattgttgt 2100ctgccaccca tgtttattaa atgataagga caataatgca acaaagtagt caagtaatgt 2160tgcaaatgcc cagtattgta gtggctatca cagcagtgcc actggcaggc agcaccatgg 2220tggcaagttc aagaggtcac tgccagccac tgagctagag

cccagatcag gcatgcaaga 2280ggagcctgag tgggagccac tggggatcac ggccaagagt gtgaccaccc aagacccaga 2340atggctgagt ggcctccctg gagcatggca gtggcagaac aactccatga actcagatct 2400ggtgatgcct aaactagtgc tgttctcgtg tggacccctt ttctctacca gaaaccttga 2460atcctctcag caaatgagga gactactcag atcagtgact tagtcctgtt tggtgttata 2520tatgtgtaca caacacagca catattaata aatacctact atgtgccagg cactgcctac 2580cactggaatc tttcactaag acattgtttt tactttgcat ttctgccttt acactatgaa 2640agtagatgtt ttggattcat attcattcag catacatttg aatatgctgt gttatgcata 2700gtaagcctat gataagcaag tattctcatt tagaatttgg gaatattgat tatacatgtg 2760gacaaacaaa ccataaatgc aaactattta tatgataaat aactttggac tgatggctgg 2820gaggaaggac cagctattga tgggtaggaa ctagcaagta gcggactgtg gcctgcatag 2880accagaccca tccgtagtga tccagatgaa acagccaccc tcagacactt ggataaaggg 2940tccaccagga aaaaactcct ggcctatcag gtgctatgtt acagttcagt tactggaagt 3000atttcctcaa aagtgttttt atggttgagg tacacattcc tacagcttta cctgctgcca 3060agtccctgtt tcaagggaag cagcaatgaa ttacactgtt cccgtagtca aggacagtat 3120atcttaccaa gaactatacc cacttaagga ggtgctggat gtcataaaga tttggatcaa 3180ccattatggg tgttcagagg agagattatt tccagctcaa gacccaggga agaggacata 3240ggatggatac cagagtcata gggaggattt aacacaggac atgtacacat tagttagttg 3300ggtataaagt ggaacagaaa tgaatgagac acaaagcctt gaatgccaga aatactagta 3360gtcctgttgt ggaaggatat aaaactcaac tgggagtgga agagaaaggc agcagtgagt 3420ctaggagatg tacagtaggt tgaggtaaac atatcctgaa gactataatc caaagattat 3480ttttggtttg aatttgtttt ggtttgaatt catggtatct attttctttg agtggatggt 3540tggggagggt ggcatgtaga atgcattctt accaaatcag catgattttc aagacagtac 3600agagaaaaga ctgctgagct gatgtaggag ctttggctgc agtctctatg gctttcagca 3660agccgtttaa ccttactact gcttcatgac tgtggctaac aaagtaggga tagtacggag 3720cacagaggat ttttagggcg gtgaaactat taatactctc tttgtatgat actataatgg 3780tgggtacatg tcattataca tttgcccaac cccacagaat acacagcacc aagagtgaac 3840cctaatgtga actctggtct ttgatgatgc tatgtcagtg tacgttcatc cgtgtaacaa 3900gtgtaccact ctagtggtgg gaggggttat tgataatagg ggaggatgtg catgtgtggg 3960ggcaggaagt atatgggaaa tctctctact tctgctcaat tttgctgtaa acctaaaacc 4020tctgtaaaaa ataaagtcta ttttttaaaa agtggggatg gtattacggc aatataaaat 4080caaaatactt tatgaacaaa tcttttctcc agatgtaaac tgtcatatat gcaccctcgt 4140atgtgtatgt ataattttca ttcaaacgtg aaacaacttt agaattggca ccaaacatat 4200aaacactgat acattagact atctcgaaca ccttttactg accactttga aaacttgctt 4260acctattaag gttcattcat agctgtgatg ttctattttt attttcaatg tgggattatc 4320ttctgtttcc cccagggagt atattaccaa attggtgatg ttgtttctgt gattgatgaa 4380caagatggaa agccctacta tgctcaaatc agaggtttta tccaggacca gtattgcgag 4440aagagtgcag cactgacgtg gctcattcct accctctcta gccccagaga ccaatttgat 4500cccgcctcct atatcatagg taagtttgac aaatggcaca ggtttttttt taacttagtt 4560aactctccaa tattatgtaa aagagtgtgt tagtcagctt gggctgtcag gacaaaatat 4620cacagactga gtggcttaaa caacagaaag tcactttctc acagttgtgg aggctgaagt 4680ccaacatcaa ggtgctggca acacggattt ctggggaggc ttttcttcct ggcatataga 4740tggtcacctt cttgctgtgt cctcacatgg cctttcatgg agtgagagct ctttggtgta 4800tcttcttata aggacaccat ttctgtcaga tgagggcccc acccttatgg tttcatttaa 4860ccttaattgc ctccctaaag gtctcatctc caagtaccat cacattgggg attagggctt 4920caacatataa atttggaggg tggcgggggg ggatgcaatt cagtccataa caaaaaaagc 4980atgagtatta ttaagtacaa aaaaattaga gagctttata gaaaatatga ggcattttat 5040gtagctggag tgtgagtgct atcagttatt ttgagttaga gcaatgtgca tctactaaga 5100agtggtatgg ataagatttt tttggagtga cccagggtta aactgtacta caagaatgta 5160ttgctcagga actaggttat ttaggttact tatttataca aacctattca aaaataattt 5220aggaaagaac tatcccagtt atcccatact tgcaaattct caatatgtgt gcctctgcat 5280gctacacatg tcatcttagg cctttatagt ataaaggctg atagttgaaa tggcagctgc 5340tgtgcttttg ttaatttcaa agctgccaaa acagttgtga gatagactca caagaattta 5400ctgattaata caatttttaa agttttcaga tttttacagt tacttcagac tttttatctt 5460tctgcagtga gcatgcatca ttacttttgc atcctgagaa caagcataag tgtgtttttg 5520gagagaactc cagggacaaa taatatacca ctgttattct cacctatatg tcaagtttga 5580tacattacca aacaattcta gccttctgct tataagtata tagaattttt atttacctta 5640tctatggatc aggatctcag cagaggcagt gatgtatcag aatcaccttc gggattcctc 5700tactgcctcc tctttctaat ccccagattc tgatatgcat ccttgtccta cagcgaggca 5760gcatggcatg aggtcagaac accagttctg gagccagact gtctaggttc acagcctgcc 5820atttaccggc catgtgactt tggcaagttt cttagtctct cttgcctcac tttcctcata 5880tgtaaaatgg gaataataat agtgcctacc tcagaaggtt gatgtgagga atgaaggtat 5940tgatacatgt aaacttagag cagtgtgggt acaaaataaa catgatgcaa gtgttcaatc 6000actgtttttg ggagaatgcc atattcttta agccgttaaa gaagaaaaaa tgattaagaa 6060taatttcaaa gtaatgcatg tttcaagggc taatgccagg ttgctcccag agtggtctct 6120cccagtgtct agaaatttta acatcttatg aaaatgatat atatggtcaa aaatgtattt 6180aacctttccc ttggctgcct tccagggcca gaggaagatc ttccaaggaa gatggaatac 6240ttggaatttg tttgtcatgc accttctgag tatttcaagt cacggtcatc accatttccc 6300acagttccca ccagaccaga gaagggctac atatggactc atgttgggcc tactcctgca 6360ataacaatta aggaatcagt tgccaaccat ttgtagttca caaattaaaa ctgggtttcc 6420aggcctggtg tggtggctca cgcctgtagc cccagctatt gcaccactgc tctccaagct 6480gggcaatgga gtcagattct ctttcttaaa aaaccacaaa aaaactggat ttccagttct 6540ctaatattct tagtaccaca agatatgtca taggtatctt taaatgaaat tcttagctgg 6600aaaagtgact aaaaagtttt tctcctgcta cctagtaata aacaaatcat tgtttattac 6660tggtcactta gaaaattaaa agggataggg ccaggcacag tggcttatgc ctgtaattgc 6720agcactttta gaggccgagg caggcggatc acctgaggtc gggaagtgga tcgcctgagg 6780tcaggagttc gagaccagcc tggccaacat ggcgaaaccc cgtcgctact aaaaatacaa 6840aaattagcca ggtgtggtgg catgtgcctg taatcccagc tatttgggag gctgaggcag 6900gagaatcgcc taaacccagg aggtggaggt tgtagtgagc caagattgca ccgctgtgct 6960ccagcctggg caacagagtg agactcttgt ctcggaaaaa aaaaaaaaaa aaaaaggctg 7020ggcacagtgg ctcacgcctt taatcccagc actttgggag gctgaggcag atggatcgcc 7080tgaggttggg agttcgagac cagcctggcc agcatggtga aaccctgtct ctactaaaaa 7140tacaaaaatt agccaggtgt ggtggcgcac acctgtagtc ccagctactc gggaggctga 7200ggcaggagaa ttggttgaac ccaggaggcg gaggttgcag tgagcagaga tcgtgccact 7260gcactccagc ctgggtggac agagcaagac tccgtctcaa agaaacaaac aaaaaattaa 7320aagggataga atataatgaa atatattttg aacttaaatt atattctata tgtgtatctt 7380cctaggcaaa agctgtaatt tccagagaga ccattaggaa caggtagtat ctatttttct 7440ccattattta tttctagaaa ctcataaaat ggattgtatt tttctataag aacaaaatat 7500taattaaggt atagatgact gaccaagggc ttaatcaaat aaaatgacta acagcatcta 7560tcataaagcc acacaagcct tatgttctca tctcaaaaat gctgtgacag ctttttggct 7620gctttaacca taagaaaaat gattggtgga tgattttatt agcccaggct tttaaaaact 7680ttcatctagg ccacgtgcgg tggctcatgc ctgtaatccc ggcactttgg gaggcctgag 7740tggatggatc acttgaggtc aggagttcag gaccagcctg gccaacatga tgaaaccctg 7800tctctactaa atatacaaaa attagttggg tgttatggtg catgcctgta atcccagcta 7860ctcgggaggc tgaggcagga gaattgcttg aactcgggag gtggagattg cagtaagccg 7920agatcgtgcc actgcactcc agcctgggtg atagagcaag actgtctcaa aaaagaaaaa 7980aaagaaaaaa ttttaattta atccttctgt agaaacaggc attcagaacc attccattga 8040tcttaataaa gctgctcttt actgtttcta gtcaaaaatg agacttcgat caaaccataa 8100gattttatac tgcagatagt cagcttcacc aaagccgcag aggaaacatg tcgagatcag 8160gcttcctgct tgatagtctc ttgactacca ttaaaacgaa tattgggagg tcatgaaagt 8220cattggtagg ccattagcat tgatatcttt aaaacatcta ccctaaacca tctgctatgg 8280acccataata agaggcctgt tgtatatgaa attgtctaga attcaggtgc aggtctttgc 8340cggttaagta agggagcaac acgtaaaatg ggagaggagt ggggtgtact cacttgcctc 8400ctcttttgtc ctgatttaac cagcattttt caaccctggg aaaatttgca gaatctaagt 8460tgattgtaat gattttgagc tgcagcagct ttaactctta ccctttttcc acatagttat 8520ggtgtttgag ttggaaagaa acaactatag gtagctacac gtacataatt atctctttat 8580tcacaaaggg tatagtaaaa ttgattgtaa ataactttct aagtgccaat attcaaaact 8640tttggattaa aatgtatttt tcaccgtgca tttactttgg atgtatttat ttcatttaaa 8700caatttaaat ggggctcttt aaccaaaaat ggtatttaaa accaaaacag tatcgtactt 8760agaatttgga gtagaggccg ggcacagtgg ctcacgcctg taatcccagc actttggaag 8820gctgaggcag gcggatcacc tgaggtcagg agttcgagac cagcctggtc aacatgaaac 8880cccgtctcta ctaaaaatac aaaaattagc tgggcgtggt ggcgtgcgcc tataatccca 8940gctagtctac tcgggaggct gaggcaggag aatcgctgga actcaggagg cagagactgc 9000agtgagccga gatcgcgcca ctgcactcca gtctgggtga cggcatgact ccatctccaa 9060aaaaaaaaaa aaaagatttt ggagtagatt catcattaat aagtaacaga ttttaggaaa 9120atcaaaaaat ggctaataaa atgaacacaa tgtaaaacat ttattaaaat gtagactttt 9180aaaaatctat aaattgatca tctgtttata aattggcaga tggttgtgta ccatctttta 9240aaataaagat tgaatttcac ccagtgtgat ggttcccatt gcttatattt ctcctgctga 9300ggccggacct gatatggccc tggtctgtgt tcccagcctt gtttcctcat taccactaaa 9360atctttcccc tgtatgcccg cccaattttt ctggctctga gtccttgttc atactgttct 9420ctccaattct accttccaaa ggcctttctt aacaccttcg gattctttct ttgagaactt 9480tccagattcc catgcctttt tggaatcaat ctctatccta ttgtcatcac atttaagttt 9540ctacttccat catcctcact cctatccctt tggtcctggg atgacaggga tgctgtgttt 9600tatttactca tctttgtaac ttccacataa cctaaccccg gttcttgctt atgggagatg 9660ctgattgtag ggtctgagtt agatactgtt aactaaaatg cttgttgata ttttagttat 9720taattcatat taactttggc tgaaactttt aaattctatt gtgaatagtc aagtaaaatt 9780tagattgtta cattctgggt tagtattaga ttgtttttaa gattgtttta aacaagatgt 9840ttttaagatg agttttaaat agttctctta acacaaataa agcttaatat gagtatttga 9900aggaaattat cccaaaccat tccagttcct ggctgtgaaa ggcttttcca ggcctaataa 9960gttttccact tcagccgtaa gtaggtgaaa tcaaatgaac aatagaggga aatgtattta 10020tttgctttat acacatgcat gtgtgttgtg tctacatata aacattgcac acgcttagaa 10080tgaagtttct gtcatgccca gaaaagggag aggcattttt gtggattttg tctggctgcc 10140ctggggatgt ttgaagaact gtgctgttta cttcatacca ggtgtgtgag ccataccttt 10200ggtaggaggg tatacctcct acacccaaga aatataagcc aggagaaggt ctgtgccaag 10260agaaggaacc caaatgaccc acaagaggtg ggccattaat tattgggtca gatgcataaa 10320tgcacagtaa tttatttaag cacctcttaa tggtgaccca caaggaagat tgctcgtagt 10380agcggaaagg ttcacaataa ataagagaaa aaagcagaat gtagaactgt atgatagcaa 10440ttctgcaaac aagaagcatc ttttataaaa gatggaagga gcccaggcac agtagctcat 10500gcctgtaatc ccagcacttt aagaggctga ggtggaggat cacttgagct gcagtgaccc 10560atgattgtgc caccactcca gcctgggtga tagaagtgag accttctctc aaaaaaaaaa 10620aaaaaaaaaa aaagacggaa attcctccag aattttaaca tgtcaacaga ggttttctgc 10680agctactttt ttcagcttta tacttcgcag tattttccaa attttctcta acaagcagta 10740ttttccaaat tttttacaat aagcacacac acacacacac gtttgtttgc ataagtgccc 10800aactggtggt gaacaaccgc tggcttttag tctatacata tctagaatat tttataaata 10860gtagttctta aacccttgaa agggagtgaa tgaccagctg agaaaataaa gtcagtgatt 10920tcattatttt cctatattca catcatgatt ctaggaaaga acttgggagt gacttccttc 10980agcttcagcc actcctgggc caggcgcatg cttagctctg tggtaaaggt caccagcttc 11040ttctgcaggg tgcctgtatc atctgaattg gaggtttggc gagggtaaga gactgatgta 11100ggttcaagtt tttctttcct gtcctccact tgaaatctgt cttcccttcc agactgcctg 11160cgctgctgac ttaaggcccc aacaccaaac acagaagcaa cagccttaca cagagtgttc 11220agcaagctcc aacaattgtg taaggtaaag tttcctttat agattccttt tctatatcgc 11280tcctagtggt tctgtttctc tgatcgaatt ctggctgata acagttgctg agactctgaa 11340agagaaggca aggaactact gtttctcatt ataaactgtt tagaattatt tggccatctt 11400tttgctatga atatgtagtg ctttgataca ttttttaaat caaaaagtaa tgaaagagat 11460cacataggga aagatagatt ggattatttt taaagtttat atactaaatt gaaaagcaaa 11520gaataaaatg ggagaaacag ctccctcatg tggctgttgg caggaagctt ccattcctct 11580ctgtgggcct ccacaggttt gctcacagca aatggtccgt gacagaaaga cgcaagggca 11640gttgcaccca agatggaagc caccatcttt tctataacct aatctgaaag aagggacata 11700ccagcacttc tgccatatgc tgttgggtca cacagaccaa ctctggtaca gtgtgaacac 11760aggaccacac aagggcgtga attccaaggg cagagaccac tagggaccac ctcagaggca 11820cagagggaca ccctatccag ctggtggcca atgtaaatta acatagcttt ttagaatagc 11880aatatgtatc tataatctta aaagtattaa aagtacttct tgatccagta atttcatttc 11940taagaatcca tgctaagagg atttaaaatg tggaccaaaa aatgggtata aaaagaagtt 12000gttaacagta tttaaagttg tgaaaaacca gaaacaatct aaaggtccaa caataggaaa 12060atgaattttg atatttttct aatagaattt tatgctgtca tcagaaatac catttacaaa 12120taatttttaa taacgcaaaa aaaagtttat aaaatgttta gtgtaaaacc tggacacaac 12180tacataatga ttctgatttt gtaaaaaaaa aaaacaaaaa cacacacata tacacatgca 12240tacatatgca tataaagaaa actggaacaa acaaaataac aagcatagtt ggaattacag 12300tcattttaat attctttatg cttttaaaaa ttttgaagtt tgtattacta gcatccacta 12360cttacgtagt caggaaaaaa atacaacttt aaaatagata tttaggtcca aagatggtaa 12420tctaaatggt gttacaggct gaatgtgtgc ctgatcccca tgccccaagt tcatatgtta 12480aagccctggc ccccaaggca atggtattag gggagtaggg cctttgggag gtaatcagat 12540ttctacgagg tcatgagggt ggagcccgca tagtggaatt agtgtccttt taggaagagg 12600agaacagacc aaagccttcc tttctctcct cactatgtaa gaagacagcc agaaggtggc 12660cacagccagg aagagagctc tcaccagaac ccaaatctgc tagcaccttg ctcttgggtt 12720ctcagcatcc agaactgtga gaaatgaatg tgtgttgttt aaaccactca ggctacggta 12780ttttgttgca gcagcccaag ctgacagaga tagaaacaac acaaggaccc atcagcagac 12840gaatggatga tcaaaacgtg gtgaggtcgt gcagtgggat attattcagc cgtagaagga 12900atgaaattct gatacatgct ataatgatga accttgaaaa catgttaatg gaaataagcc 12960aaacttaaaa ggacaaatat tgtataattc cacttatatg agttagttac ctagaatagg 13020caaattatgt catagataca gaacattaga ggttaccagg gttgtgggaa gaggggtatt 13080gtgggtacaa attttcggtt tggagtgatt ttgaaaaaat tctggaaatg ggtagtgaca 13140gtagtcaaca tgatgaatgt acttaatgac actaaattgt acacttaaaa atggttaata 13200ctgggctggc gcagtggctc atggctgtaa atcccagaac tttgggaggc caagacaggc 13260ggatcatgag gtcaggagat tgagaccatt ctggctaaca tggtgaaacc ctgtctctac 13320taaaaaataa aaacaaataa aaaaaaaatt agccgggcat ggtggcaggc acctgtagtc 13380ccagctactc gggaggctga ggcaggagaa tggtgtgacc tgggagtcgg agcttgcagt 13440gagctgagat cgcgccactg cactccagcc tgggcaacag agccagattc cgtctcaaaa 13500aaaaaaaaaa aaaggttgat acctgggtgc ggtggctcat gcctgtaatt tcagcacttt 13560gggaggccaa ggcaggcaga tcagttgagg tcaagagtta aggaccagcc tggccaacgt 13620ggcgaaaccc catctctatt aaaaatacaa aaattagtcg agtgtggtgg tgggtgcctg 13680tagtcccagc tgctgggagg atgaggccta ggaattgctt gaacccagga ggcagaggtt 13740gcagtgagtt gagattgcgc cactgcactc cagcctgggg gacagagcga gacttagtct 13800caaaaaaaag gttaaaattg taagttttgt tatgcatatt ttaccataat ctttaaaaaa 13860tagatatata ggagataaag tcaacagaat ttaataacca gttgtaaata gagactgagt 13920gaggaggatg aattaaggaa gacattgagt acaacttttt ggtaggtgaa aaactcttaa 13980aaaaatacgt gggcaaagat cctacttgat tcttataatt taaaaatctc ccagttagta 14040aacaaggcta ggtggagatt tgcatgtgat gtgaggtgtg tgttctgttt tgtaatgtga 14100ggactgtgag ccatctcctg gacttgaata tccattagat aattgaaaat acggatttga 14160gaactcagga gacgtgcaat gcagtaacaa aactctgcac ctagttgatt tctgtctcct 14220aatttaatgc ttttatggga caaactgtta ggcaggtggg caagatggac agccatattt 14280ttgtgggttt ctggcctgtg ggccagcctc agtgctcact ctgaggtcat gtccaaactt 14340agaacacatt caggcctacc acagtcaagg ctccctttct caactctagt cctctgcaca 14400aatatccgaa gcctagaaat aataatcatc tgtccttgtg tcttgcatta tgaaagccta 14460ggaaagggcc ttgggaatta agaagaatgg aaaaactggt ctaactgctg catgcttcag 14520cttgcagggg aatcactgaa atggggacag gccataaaag gacaaccaga agagtggctt 14580cagcaaaggc atcgtttttc agagcaagct agagaatcct gccagcgtcc tcaggcaggg 14640cccctgggca cagaggttag gcaagggagt gtcccagcat gttgatgccc tgagcatcag 14700aataatgcca tagaggagct tccaaagagt tcatttcagg ttttgtaagc cgaacatttc 14760taggcaaata aaatttgatt ttgtgaataa agcttgtttc ttcaactcca gtgcagattc 14820tcatagattg atagtggctt gtgatccaga taaagaaaac aatttttcaa agattcatat 14880tctttgtaga tgtacggatt tagagaccat ctaatctaac tccctcattc tacagatagg 14940aaaaatgagg cctaaagaag ttaagaaaat accatggaaa tgtcactgct gaactgccat 15000acgtaggatc cgaaagaaat tgggtaaatg ctactgtgag aaatacagta ctaggtccaa 15060agaatctaat acaaattaaa aatctaaatg ttatttctaa agcatccctg cacatggctg 15120aacttacata gtttcatttt ctttcttttc tgttgaagaa gaggcaattg gctgggtgca 15180gtggctcatg cctgtaatcc tggcactttg agaggccgag gcgggtggat cacctgaggt 15240caggagtttg agaccagcct ggccaacatg gtgaaacccc atctctacta aaaatacaaa 15300aattagctgg ctgtggtggc cgctgcctgt aatcccagct actccagagg ctgaggcagg 15360agaattactt gaatctggga ggtggaggtt gcagtgagcc aagatcacgc cattgcactc 15420tagcctggat gacaagaggg aaactccatc tcaaaaaaaa aaagaaaaaa agcaatcact 15480aacctgtgtt gtttattaaa catgacagac tggcatgaag taattaccaa actgtaaaca 15540aaaaagctac aatctgccag gcatggtggc tcatgcctgt aatcccccac cttgggaggc 15600caggttgggg gatcacctga ggcctggagt tcaagactag cctggtcaac atggtgaaac 15660ctcgtctcta ctaaaaatac aaaaattagc ccggcgtggt ggcacatccc tgtaatccca 15720gttactcagg aggctgaggc aggagaatca cttgaacctg ggcagtgggg aggttgcagt 15780gagccaagat cgcaccgttg tactccagtc tgggccgaca gagtgagact cggtctcaaa 15840aaaaagaaaa aagaaaagct acaaccttaa tctcaacttc tcataacatc atctctactt 15900ctgattagaa gagtggaagt ggggaggttt attacaaaaa gactgttata ccttacacac 15960ttctccccat gaatagtgaa ggtgtgagtg aaaaagacag caattttatt ttttttttga 16020aacaggttct tgcactgtca cccgggctgg agtgcactgt tgtgatcact gctcactgca 16080gcctccacct cccaggctca agtgatcctc ctacctcagc ctcctgagta gctgggacca 16140cagttgtgca ctaccatgcc cagctatttt tttttaagag atggggtctc actatattgc 16200ttaggctagt tctcaaactc ctggcctcaa gcagtcctcc gaccttggcc tcccaaaggg 16260ttgtgattac aggcataagc caccacaccc agccagcagt tttagaataa agggtgaagg 16320tgctgttggg gaaatataat ttaaaaaaca aaatcttctc tcaacccaga aatcctctcc 16380atgaaggcag tagagaaaga taagctttat tattgaataa aaattaaatg agaatgtgat 16440gcacatcaca ggcactttgc taagagatca caaagacaga aggaaatttc accattttgt 16500acagccaagc aggtacagcc cattacatgt atgttttcga gataaatagt cctcaactaa 16560gagaacttga cagcaccact ggtcacacag ttcattctaa ctttacctga taattgatgt 16620gaccacttgt gttatctaag atatcaactt ttcgggggtg ggggagtgtg gaaacaggag 16680ttacttttat agcttggtgc aaggtactca ttaagattag gctgttaccc tcccacagaa 16740actggaagat aggtatgcta tctggtaatg tttacatttc ccagatcctt gagaaagaca 16800ttcctaggtc ataaagctga caaaaggctg attcagtttt taaatatata tatctgtata 16860tgtatttca 168696515000DNAHomo sapiens 65gatctcttga tcccaggagg tcaaggctgc aatgagctaa gatcaagcca ctgcattcca 60gcctgagtga tagtgggaga ccttgtcttt aaaacacaca cacacacaca cacacacacg 120agggcctttg accactcttg agtagaagac tcgagaagaa caaagtagaa ggccagagaa 180gaacaaagtt acttgaaaga tctcttatta aagagaatgt acaagctatg aaaaaaaaaa 240aacacacaca cacacacaaa cctcatctgg aatgaaaaaa acataatgca tttggtttct 300ggttccttag gctgttatgg aacaaccaaa gaacattatt ttggtttctg aggtcagaac 360tattttattc ccctcaagca

cactatgctt atggtttgag ggagaatgag aaataggaaa 420ctaggaacag gctgaaatgg tctaatcttg accatctaat tctgcagtgt cttattctca 480ttctaaaaga gaatggttat attcgctgtt ctagcataaa aagtaatgat aaaaataaaa 540gatcccgtat taccagacaa taatccccta gactgtttta atgcttggtt gagtatttgc 600ttatgatctc agactttaaa agatggtctc cccctatggt gaagcttgtt aattatgtag 660gcatcattaa tgtctgttta cttatcaaaa ttttatcatt gttagttgta ttactacttg 720acagtccaat ttatttaatt gaaaagattg gttaacattt tatagtcaaa gtaattgttt 780cctgtgtttt ttcctgttta ggttattgga gtgatgagta aagaatacat accaaagggc 840acacgttttg gacccctaat aggtgaaatc tacaccaatg acacagttcc taagaacgcc 900aacaggaaat atttttggag ggtaagtaag ggaaatttct tcagacccat taaatgttag 960gaaaaaatgg agctaaaaga gctgggtggc tcacctttct catcctgtgc tgagaaatgc 1020tggggctcac ccataagtat ccagcatccc catggacaca gggaattctg aacaaatgtg 1080atgaaaccga tgaaatgtct ggcctgtagg tggttagtga tggagatacg ggctatatgt 1140gaatcttgat ttttgcaatt cattagagct ttgtaatgaa aggaaacagt ttgttgcttg 1200ctttaaggat aggttcattt gcatttctcc gcaaggaagt agtaatgagt taccaagcct 1260tagatttcac ccctttttga tttcttgctg acttaacttt aattgaatgg aagagttatc 1320acaaatgaat tatctttttg gttttttttt ttttgagatg gagtctcact ctgtcaccag 1380gctggagtgc aatggcatga tctcggctca ctgcaacctc cgcctcccag gttcaagcaa 1440ttgtcctgcc tcagcctccc gagtagctgg gactaaggtg cgcgccacca tgcccagtta 1500atttttgtat ttttagtaga gacggggttc cactatgttg gccatgatgg tctcgatctc 1560tggacctcgt gatccgccca ccttggcctc ccaaagtgct ggaattacag gcaagagcca 1620ccgcgcccag ccaggaatga caaatgaatt accttataag taaatgccat taaggaagga 1680tagctggaag atgggttgag gggaatggag gaccacagaa ctagtcctat ttaaatacat 1740gtgcatggta aaatgattcc atttgacaat aggttaatta tctcatagca taaggaaaat 1800gcttaacagt catatgcaag atgataagct ttcctatagc atccaaccaa aagatctagc 1860cagtacaatt tcctttgcta tattagggtt agaaaggccc ccagaggtga accaattaga 1920tggaatcctt gaataaaaca ctggattagc agtgaacaga aaaaagtcag attgctttcc 1980ttcttcccat agatgtctca gggatattta gtttcctcag aagataaaga atttagtaag 2040cgtttttttg tgcatactta catgaaatgt acattatttg aattctttaa aaagaaacag 2100ctgcatgata acaaaaattg tgttatgctt gctttagctg gtatttttgc ctagaacgat 2160tatatcgttc ggacaagaag ctattcctaa gaaacaatat ttttaatcca ggaagttttt 2220catttttaga aatttatctt actatttccc aagcaaaaga gggtagttac agattcacta 2280agaatcatgt gctcacaatt tttatttaat aattattcct ccttaaaata tattaatcac 2340ctgacttaca atggtggaac catgagtgca tttttgcctt tattgtcaat aacgtcttct 2400cagaagtgag ccacaaaggt gcatagttct tggagttaaa ggtctgaatt aagacaatcc 2460agcataagtc tcattaatgt gtgattattt tgagaaaagg caagaagtac ctaagaatct 2520ccccctcact gtccagttcc ctgtttcatt taaagattca ctgtaagtaa ctgaaaggct 2580ttccttggga ggatttattt gaatcagtct ttcacatgca aaggatattg tagaacatct 2640cgtttttgct ggcaggaata tgaacatctg ttgtgaggaa agaaaaagtt tcatgcaaat 2700tacactgcca aagaagggat gttcaagttg agaaaccagt gacatttctt gtaactgtac 2760tatgaatcag cgcattttaa tcttctagat aatatatgga agtgcaggaa ggtggtagga 2820aacggtgttc attttacata tgcgttattt tattctgtgt gagtgacttc atggcaccga 2880cattgctgtt tttaaatgag gatacagtaa attgcagtcc gaggaaggct aactggaatc 2940aacatacccg tagctttaga aagcagtttc cgcaccagcg aagagtacaa gagcgatgga 3000accccatgtt cctggaagtt tgcacatcag agtaaacaaa cttgaaaacc cctcttgata 3060gcagaattca cccagccttg ttccattttc tcttaacaaa acacaccgca aaagctctca 3120caagctgctt tgatgaagcc acatgtattt cccccttcac aatttacagg aagttactct 3180taaaagaaag tgattctggt gtttaccgcc tgtgttaaag ggacagagtt cctttttatt 3240tctgataacg tttgagcgaa atacagaaac tatctgtaga ctagcatagt cggtacgtga 3300gtaaggaaaa gcaataacct gctgtccggt gagcacaaaa ttcctgctac gaacagtgcc 3360ttactgctgc ttggagactg caagtcgcag atcacactag gtattgactg attgtataag 3420gaaatttctt aaagtctaaa gtaaaggtgg tacctcctaa aaagagggga agagagaaaa 3480ctttgtgtgg aaggataagg agtgtgttta tagtttcagt aagagtgtac gttttaattt 3540ttcttcttcc tctgcctctt tgccaagtag cctgagtgca tctgttatcc agaagtagta 3600ttactctagg acaaacttca aattcttcat tctgcgttgc ctttaaggaa caacatactt 3660tcttcctgtt ctttttccaa aaacacacgc ctatggctct gtgtgtggtg ttttagccag 3720cctcctccca gataaggggt tcccttccct cctttgcatt gaaaggaaag tgcaagtctg 3780gacatgttta tcaagaggaa aagtgacttc tcagtaatag actgtcaaat tcgggctgct 3840gcccgagtgt tcgctttgtt atggcaggtg aagttcacct ttgccccacc cagtgtttcc 3900acaaaaaggc aaggttccaa gtattcatat gaacaagtgt tactttagga cttggagggt 3960tgggggtgga ggatgtttgc atagttgaag ccttgggcgg gggtgtagga aacggcgagt 4020acagaggcca tagaaaaagc taagactcag tttgacgtcg tcagccggct tggtcttcta 4080cccagtgact caaagcacta aaagtcagca taatcggaac tgaagtcagt agcatcgccc 4140atttgccatt cactgcagta gcaaaagtag tactctgtgg tgggttaatc ggtttgaggc 4200agctccttaa atgaacattt gtgtttcatt tttctgttat tttcccgaac atgaaaagac 4260gataaaactg aaatggaaaa ggtaactgac aaaagtgtgc cttacctgtt tccgccctga 4320tttctgctga ttcaagacta ttctggctaa actgattgga ttctttttct aactaggcag 4380taggggatca gaaatcacac acggtaccgg ctgtgtttat tctgagaggt gctggggagc 4440tttgggtctg acttcctttt acatgcctgt cttctctttt ggacagatct attccagagg 4500ggagcttcac cacttcattg acggctttaa tgaagagaaa agcaactgga tgcgctatgt 4560gaatccagca cactctcccc gggagcaaaa cctggctgcg tgtcagaacg ggatgaacat 4620ctacttctac accattaagc ccatccctgc caaccaggaa cttcttgtgt ggtattgtcg 4680ggactttgca gaaaggcttc actaccctta tcccggagag ctgacaatga tgaatctcag 4740taagtggatt acagaacaaa aaaataaaaa atgccagtaa tgtcggttct gcccctttga 4800actaataaca tgttgtttaa ttatacggct ttgtcatgtg ttggatgaag taggtggctt 4860aagctaggga ctaggaagag gaaaaacatt ttttgagtcc ctattaacta ttaggaaact 4920tgatcattta aaagtatata tatatatgag gagctacctt gagttttgaa ttcaggatgt 4980tacaggaaga aatatatgtc caattctaat ttatccaaaa gcagttggga gaattacagg 5040gattggtcca gacatgctgc gtatgcaagg tatagccctc atctgtggta ctttggcagg 5100gcttagactg catcaaaata tttatagatg tacatttgag tgtacagtta ggatctgatg 5160tggaacattg taagatcatt gctagaaaaa ctttgtcata atttttcaat attattctaa 5220gtgaataacc gtaaagattt tacatcttag cttccttcct tacagtaaaa aaactatctg 5280atctcttgat cagtattata gtagccacct atcactttat cttaacaaat tctcaattcc 5340ttaggtttat gtgcttttac ttcttttatt tgattaaaat tgctgtcatg acctctctct 5400gcagagggct gcatcatttt ggtcattctc aagtgatctc tttgagcaat ttaagaattg 5460ccataagatt ctaacctctg ctgtaactat ggttgtgtgt tcttggttag accactaaat 5520cttattagca gttttaaaaa ttattccttt tggtttagaa gttaagacta aatgctgaag 5580tttttgtaac ttttggtttt gatatcattt caaacttaag aaaacatttg aagaaaagga 5640caaagaattt ccacttaccc tttacccagg tttaccagtt attgataagt atatccattt 5700gctttaccag aaggctaact tgttttagtt ctcattttca cctttgagac atttggaata 5760aatatcaatg ttaacataaa ttggaatttt gactttgatt ttaggaccaa tgaacaagcc 5820aagtacttac cctagtcata tataatccaa ctgtatggtt atttggtatt cattccacac 5880ttcattttac ttgatctccc ttaagattgc aagattgtgt ttgcagtttt tctgaaaatc 5940tggggctata aaagcatcag gacctccccc gtaggggagg tcgtgtgttt ggggtcctta 6000cacaacaggt tacccttgag cttcaggaaa agaactggct ctcagttccc cagttccagc 6060ttaatgggtc taattaggtc ctgaccaaaa aggtggcagt tcttttccct catgtctctt 6120cagcgctccc cgagactctg gagactctgt catatcccta gggctgagcc tcccaggaac 6180cattcggctg ttgtggcatc tgtgtatgcc atgcccagtg ctgaggacct agtaacaaac 6240gacaaatgca caggcacagt ggcatttttg tggaactcgt attccagctg tgcgtctcag 6300aagaagcgca cagctccctc ctggctttct taacatagtg agccacttcc acttaagggt 6360ctccttacat tccttgagtt taatcattca tggattcaga ggaaagtctt ttgatttttg 6420cttttcttta aacagttcat ttgaggtgac ctaccccagt gactttgcac caaccaccaa 6480gaaacttttt tgcatgcttc ccgcaccctg tgccaatcaa gggaagggtt taaaggcctg 6540gcgtttttat tcctcaaaga aaggttttgc acagtatttt aaggttcaag tgcttctact 6600ttgtgttcag aagcaactgt catatatact gtgaaatgac accttttatt tatccctttt 6660tatttatgca gtatgtcccc ttttattttg gcagaatttt ttctaaatgg tggtttaaca 6720ttttcaagca catttcattg tccaatattc atagtaaaga atgagagtta acaataacca 6780gtcacattaa aacaagattc ctgctgccag ttgtgaaacc ggttgtctta ggcgtggcag 6840ctgatgattg agactgtgat caggaaaatt tccactattt catcaggcct aataggtaga 6900ttgtgtctcc aaatgaactg tgttgggttt ccatgcttaa agcacaatag aggtggtgca 6960agaatctcca tgagggctta aatggcagtg atggttcagg cggtagagtt tggagaagaa 7020gggatttgaa acaaaccaaa ggaaagaaaa gtaagtagcc agaaatcaca aaatggcatt 7080tttctaaaaa caaaggaaaa ggaataaaag aactaataag tttgaaaccc ctacccctcc 7140caaatttggc agggggggag gtattttttt tctatctatc taactaaccc atctagaaaa 7200cagttgacca aattatagac ttctaaatgt taatctgctt tctcagtttc agttgaaaag 7260agactttgtt ttgcctactg cagaacttct aggttctttc ttatagtctt ggggttctta 7320ttatagatcg aaaatgtgag tcggcataat taagccattc ggagtcttca gaagcagttc 7380actcttgaaa tgactccgtc cgcctacagc catttaagat ttcagaacaa aaacagatct 7440tgattttctt tttcatgtta actcaagctg ttgctgagtg ggagagtcag aaatgacacc 7500agctccactg attactcagc tgctgaagga tgatttttta aaatgcacct ttactgtata 7560tggacttcct aatttccacc tgtagagcat cttagggagg ctaacatgtc actctggatg 7620ttcttttaga ataagatgca aatctatttt tctgaaggca ttagagatag caaacattta 7680ttgtgagttt actatatact aggcactgtg ctaagtgttt tgcatagaaa gtttaaaatt 7740ctggcttttt tgttggccca atcataagtt tcatatcagt tcaacattca aattatatta 7800aggtacttaa gaagaatccc tggctaaatg tgaggggcag tgccacagat ggactgaaac 7860tttatgctta ttgcacattt atgctattat tatttgttga attatagaac caagggagtg 7920tggaagccac tggaaaaaat atgagactta gatacataat ttgagtaaaa atggctcaaa 7980gtcatgaggg taaagttttt tgtatttcca ttttattcga gcggcatcgt ttttaaaaat 8040cattatgaat ttgaccctat atagatgttt ccaaataatt ctttttcacc ttcataaaat 8100tccttcctgt ggctgtgaga tgccttgcct atcagttttc aagcttagtt gtctttctca 8160tcctttacca ttttagcttt aaaaaacaaa agtgacaatt agaacttcct gcctgctggg 8220cctcactgaa agaccgatat tggcctgata aggagatatt tattttgttt tagtggcttc 8280agaaatccct ctccctcagc aagctttcca tcacggcccc cccgtcagca tcttccctga 8340tagcgttctt ctctgtgttt attctggggc ttcaggctcg cccaggagga actgataacc 8400gctggcagga gataacattc tctaaggggc tctcaaattg gaatcgaatc cctcaagcca 8460gtcagcctag agaatacatt taaagggttc agttctggag tttcacagag ttcatttcta 8520gacctatcag atagcaagtg tggagttctt tctcaactaa attcaagcag agacattttt 8580tagacgatga aggatatttg cacaaaggct tcagcatgat cccccaaacc tgctgcctct 8640gaaggcatct ccacacattg acagccaatg ccttcagtgc gttcctaggg caggtgtcct 8700ggcttgagtg actgtcctcc aataatcaga gctcaaacta aacatcgtat gttttacttt 8760tggtttccag gcaaggctga gcagggaatt ttcagttttc cctgcccaga tgggtgtttt 8820ttcctgaagg catcatttat tgtgtagcga ggagacaggg ctggctgtgg cagggatagt 8880ctagaactgt cctcattgct gctgttccta aatagtatct ttaccaagta ataacgtgcc 8940gtctttggga ataagtgctt tcctcttagc ctgttctgtt ttcttgggtg cgctaagtaa 9000ttgaactggc tcaggaagta cctattgtgg tttggcagag gtgactgtca cgccttgtga 9060ctccaggggc cagcactgct gggatcctgg ctagaccaga cagagccttg gtgaagtgct 9120taggctgtct gcacatcgcg aggaaggtgg tattcacttc gctaagctcc ttggcatagg 9180cagtttgaac agggctttat caaattcgta ttcaacaaga gtagaagcga aaattgatga 9240ctgtgtatta cttgaaatga gtcttaatct ttcacattta gttctcaggg tatgctgatt 9300tcctttaggt aaaccatgaa catcagaaag acttttatta acctatgaca gggtccccac 9360cccagtattt ttccactcca ttaaaatgga agtttttttt ttttttttct tttttgagac 9420agagttttgc tcttgttgcc cagtctggag tgcaatggca caatctcggc tcaccacaac 9480ctccacctcc cagattcaag cgattcttct gcctcagcct cccaagtagc tgggattaca 9540ggtgtgcgcc accacgccca gctaattttg tatttttagt agagatgggg tttctccatg 9600ttggtcaggc tggtctcgaa cttccgacct caggtgatcc gcccacctcg gcctcccaaa 9660gtgctgggat tacaggcaag agccactgca tccagcttag gctatcttac tccagcctaa 9720acagcaattt tctatcataa ggtctgtact aatgaaaaca gaatcaccca aggctgctgt 9780ttgttctgtc tgtgctgcca ttgtccgcat tttgctgagg aggaaacgga actgcacttt 9840tgagtgagtg gcccagagcc ttctagaatg agagtgcgtt ggaagccaga tatgtggcga 9900ttgtgtcgcc agctgttact caggttttct caagaaggag gagcaacttt ggcagttttg 9960cttcagttct ctctagccct ctgtgtaatc gccccttttt ctttatttca gcacaaacac 10020agagcagtct aaagcaaccg agcactgaga aaaatgaact ctgcccaaag aatgtcccaa 10080agagagagta cagcgtgaaa gaaatcctaa aattggactc caacccctcc aaaggaaagg 10140acctctaccg ttctaacatt tcacccctca catcagaaaa ggacctcgat gactttagaa 10200gacgtgggag ccccgaaatg cccttctacc ctcgggtcgt ttaccccatc cgggcccctc 10260tgccagaaga ctttttgaaa gcttccctgg cctacgggat cgagagaccc acgtacatca 10320ctcgctcccc cattccatcc tccaccactc caagcccctc tgcaagaagc agccccgacc 10380aaagcctcaa gagctccagc cctcacagca gccctgggaa tacggtgtcc cctgtgggcc 10440ccggctctca agagcaccgg gactcctacg cttacttgaa cgcgtcctac ggcacggaag 10500gtttgggctc ctaccctggc tacgcacccc tgccccacct cccgccagct ttcatcccct 10560cgtacaacgc tcactacccc aagttcctct tgccccccta cggcatgaat tgtaatggcc 10620tgagcgctgt gagcagcatg aatggcatca acaactttgg cctcttcccg aggctgtgcc 10680ctgtctacag caatctcctc ggtgggggca gcctgcccca ccccatgctc aaccccactt 10740ctctcccgag ctcgctgccc tcagatggag cccggaggtt gctccagccg gagcatccca 10800gggaggtgct tgtcccggcg ccccacagtg ccttctcctt taccggggcc gccgccagca 10860tgaaggacaa ggcctgtagc cccacaagcg ggtctcccac ggcgggaaca gccgccacgg 10920cagaacatgt ggtgcagccc aaagctacct cagcagcgat ggcagccccc agcagcgacg 10980aagccatgaa tctcattaaa aacaaaagaa acatgaccgg ctacaagacc cttccctacc 11040cgctgaagaa gcagaacggc aagatcaagt acgaatgcaa cgtttgcgcc aagactttcg 11100gccagctctc caatctgaag gtaggccttg agagagagca gtccaagggg ctgtgagtgc 11160atgcttgtgt ttgtatttag cttgctttcc atggggtatc gattgcattt gcagtagtat 11220gagcccccgg ttggggatag tgggtatgga ttccgcctgg cttttgccac ttctagctct 11280ttgactttgg acaagtgact tcccttctcc tgattttctt ctgaataata aaaaaattag 11340gggtttggac tagaagatta ggtgaaactc cctgctagcc tgtgattttt gtgcttttaa 11400gaaaaacacc attctgaaaa catgaagatt tcttcttttt aagactgtct tgatgctttt 11460cttaagatat ttgcatcaac acttgagtct tggagcagaa atgttaggtc tcagagccag 11520cttgagagca gagctaacac atgtggcttc ttcccaggtc cacctgagag tgcacagtgg 11580agaacggcct ttcaaatgtc agacttgcaa caagggcttt actcagctcg cccacctgca 11640gaaacactac ctggtacaca cgggagaaaa gccacatgaa tgccaggtgc gcagtatttt 11700ctgggtagac cttctgacct ttgtagaaaa tgtctgtgag tcaccctccc atgtcctata 11760tagcccgtag ttaaagccaa caccagattc tgcgttgtcc catcctggac tgatggcact 11820atggtccttc ccagtacttt gtatctgctg atgacttgag atggcacagc cagcttccag 11880tgggtgggaa aatggtaggg gaaataaaca gcccctcgtg tgctgtgtgc ccacatcccc 11940ccgtttgctt aataccacac tggaggtgcc acaaggaggc ttctcacctc ctaggttgct 12000gggcgttggc cggtaagcct gcccctcccg ttggcaactc ttaatcttct ggccttcctg 12060tctcccttcc ctgctgtctc tctcccctac actgtaggtc tgccacaaga gatttagcag 12120caccagcaat ctcaagaccc acctgcgact ccattctgga gagaaaccat accaatgcaa 12180ggtgtgccct gccaagttca cccagtttgt gcacctgaaa ctgcacaagc gtctgcacac 12240ccgggagcgg ccccacaagt gctcccagtg ccacaagaac tacatccatc tctgtagcct 12300caaggttcac ctgaaaggga actgcgctgc ggccccggcg cctgggctgc ccttggaaga 12360tctgacccga atcaatgaag aaatcgagaa gtttgacatc agtgacaatg ctgaccggct 12420cgaggacgtg gaggatgaca tcagtgtgat ctctgtagtg gagaaggaaa ttctggccgt 12480ggtcagaaaa gagaaagaag aaactggcct gaaagtgtct ttgcaaagaa acatggggaa 12540tggactcctc tcctcagggt gcagccttta tgagtcatca gatctacccc tcatgaagtt 12600gcctcccagc aacccactac ctctggtacc tgtaaaggtc aaacaagaaa cagttgaacc 12660aatggatcct taagattttc agaaaacact tattttgttt cttaagttat gacttggtga 12720gtcagggtgc ctgtaggaag tggcttgtac ataatcccag ctctgcaaag ctctctcgac 12780agcaaatggt ttcccctcac ctctggaatt aaagaaggaa ctccaaagtt actgaaatct 12840cagggcatga acaaggcaaa ggccatatat atatatatat atatatctgt atacatatta 12900tatatactta tttacacctg tgtctatata tttgcccctg tgtattttga atatttgtgt 12960ggacatgttt gcatagcctt cccattacta agactattac ctagtcataa ttattttttc 13020aatgataatc cttcataatt tattatacaa tttatcattc agaaagcaat aattaaaaaa 13080gtttacaatg actggaaaga ttccttgtaa tttgagtata aatgtatttt tgtcttgtgg 13140ccattctttg tagataattt ctgcacatct gtataagtac ctaagattta gttaaacaaa 13200tatatgactt cagtcaacct ctctctctaa taatggtttg aaaatgaggt ttgggtaatt 13260gccaatgttg gacagttgat gtgttcattc ctgggatcct atcatttgaa cagcattgta 13320cataacttgg gggtatgtgt gcaggattac ccaagaataa cttaagtaga agaaacaaga 13380aagggaatct tgtatatttt tgttgatagt tcatgttttt cccccagcca caattttacc 13440ggaagggtga caggaaggct ttaccaacct gtctctccct ccaaaagagc agaatcctcc 13500caccgccctg ccctccccac cgagtcctgt ggccattcag agcggccaca tgacttttgc 13560atccattgta ttatcagaaa atgtgaagaa gaaaaaaatg ccatgtttta aaaccactgc 13620gaaaatttcc ccaaagcata ggtggctttg tgtgtgtgcg atttgggggc ttgagtctgg 13680gtggtgtttt gttgttggtt tttgttgctt tttttttttt ttttttttta atgtcaaaat 13740tgcacaaaca tggtgctcta ccaggaagga ttcgaggtag ataggctcag gccacacttt 13800aaaaacaaac acacaaacaa caaaaaacgg gtattctagt catcttgggg taaaagcggg 13860taatgaacat tcctatcccc aacacatcaa ttgtattttt tctgtaaaac tcagattttc 13920ctcagtattt gtgtttttac attttatggt taatttaatg gaagatgaaa gggcattgca 13980aagttgttca acaacagtta cctcattgag tgtgtccagt agtgcaggaa atgatgtctt 14040atctaatgat ttgcttctct agaggagaaa ccgagtaaat gtgctccagc aagatagact 14100ttgtgttatt ctatctttta ttctgctaag cccaaagatt acatgttggt gttcaaagtg 14160tagcaaaaaa tgatgtatat ttataaatct atttatacca ctatatcata tgtatatata 14220tttataacca cttaaattgt gagccaagcc atgtaaaaga tctacttttt ctaagggcaa 14280aaaaaaaaaa aaaaaaaaaa gaacactcct ttctgagact ttgcttaata cttggtgacc 14340tcacaatcac gtcggtatga ttgggcaccc ttgcctactg taagagaccc taaaaccttg 14400gtgcagtggt ggggaccaca aaacaaccag ggaggaagag atacatcatt ttttagtatt 14460aaggaccatc taagacagct ctattttttt tttgccactt tatgattatg tggtcacacc 14520caagtcacag aaataaaaaa ctgactttac cgctgcaatt tttctgtttt cctccttact 14580aaatactgat acattactcc aatctatttt ataattatat ttgacatttt gttcacatca 14640actaatgttc acctgtagaa gagaacaaat ttcgaataat ccagggaaac ccaagagcct 14700tactggtctt ctgtaacttc caagactgac agctttttat gtatcagtgt ttgataaaca 14760cagtccttaa ctgaaggtaa accaaagcat cacgttgaca ttagaccaaa tacttttgat 14820tcccaactac tcgtttgttc tttttctcct tttgtgcttt cccatagtga gaatttttat 14880aaagacttct tgcttctctc accatccatc cttctctttt ctgcctctta catgtgaatg 14940ttgagcccac aatcaacagt ggttttattt tttcctctac tcaaagttaa aactgaccaa 150006646340DNAHomo sapiens 66tattttactt cagtaacaga aaatgaaaga aatgttttaa tgttgctgat tgtattacct 60tcaggatcaa tagcagaagg acaaacttct ttgaggagat ctcctagtgt gtgcaactgt 120ccatctgcag ccacaggacg aaacagcttc tgaatgaaag gtctttcagt cgttgtctat 180ttgaaaaagg aaaaaatgat tcaagcaatt aagtctttgt tgctgccaat tacaaattta 240tatatcataa actttatgtt ggcattaggt gccttttgat acggtgttag cataattaca 300caacatcaca gatgtggtat cactgtgaaa aatgtttaac atgataaatt caggtaaatc 360taattctgag gaaacagaca aatccaaagt tgggtgggac attctaaaga taattggctg

420ggacccttca aaaacttaaa gacattaaaa agcaaacaac acaaaaagat atcaacaaaa 480gcattttttc tcagtatctc ttaaagagac taacaaagca aatacaaaac ataaaccatg 540gctgaatact aaattgaaga aggacatttt ttagaaatcc aactatgaaa cacagttttg 600ggataaatgg ggaaatacag aatggacaac tgataatatt attgagttaa tgtcaaattt 660cttaggtaca ataaggacaa tccttatttt taagaaattc attgttcaag tgtttaggaa 720agaagtgcca tgatatccaa aacttaatct tctttctctt tttttggaga cagagtctcg 780ctctgccacc ccggctggag tgcagtggcg cgatctcagc tcactgcaac ctctactttc 840caggttcaag tgattctcat ggctcagcct cccaagtagc tgggactaca ggagtgcgcc 900accatgtcca gctaactttt tgtattttta ctagagatgg ggtttcacca tgttgcccag 960gctggtctca aactcctgag ctcaggcaat ctgccggctt cggcctccca gagtgttagg 1020gttacaggcg tgagccaacc gctcctggcc ccaaaactta accatctaat ggttgagaga 1080gagacagaga gagagagaaa gagagagaca gagaatgtgt gtgtgtgtga agacaaagca 1140aaaataaaaa aatattaact aatggtgatt ctaggtagag ggtgtatgat tttagtagtt 1200tcattatttc aacttttcga taggtttcac aatttccaaa acagcagatc cagccatttc 1260atctgacaaa aactgttagc agcactacat cgtaatttat tgctaataat ctcattgttt 1320tactcttaaa attgtttcat ttactaaatt tccttagtga tgatggaggc tttatcatga 1380cagagtacag aggctctgaa atgagccagt gtctatgaag agcaccactg tttgcaagat 1440ctatgatctt gtacccagtt tcctttatct gttaatttgg gacattccat atctcttgag 1500tttgttgtgg aaataaatga gcaactttgc caaccacaga gtaaataaat aaatgttaaa 1560gagaataaaa gcatttttac ctcctctctc cctcttaacg gttatttcac tttaagatgg 1620taaattttaa gctttctgag atgaaaaatc attaaaactt aacaagaaca gagaaatgcc 1680atacatacat attttttgtt tgcttgtttc ctgagacaag gtttcactct gtcacccagg 1740ttgaattgca gtggtgcaac ccccaagttg caatcctcca cctaagcctc cagagtagct 1800gggactacag gtgtgagcca ccatgctcag ctaatttttt tacttttttg tagaaggggg 1860tctcactatg ttgcccaggc tgcctcatat tttataagaa tatgacttca aacacttagg 1920cattagcgac aaggttttgt ttttgtcttt taatgacaga ggtatacctc aacatatttg 1980acacaactgt tagagatttg gtttaaaaag aaatagacat ggatgaagct ggaaactatc 2040attctcagca aactaacaca ggaacagaaa accaaacacc tcatgttctc actcacaact 2100gggagctgaa caacgagaac acatggacac aggcagggga acatcacaca ccaaggcctg 2160tcggggagta gggggctagg ggagggatag cattaggaga aatacctaac gtagatgagg 2220ggctgatggg tgcagcaaac caccatggca catgcatatc tatgtaacaa acctgcacat 2280tctgcacatg tattccagaa cttaaagtat aatacaaaat gaaaaaataa ataaaaataa 2340gtagaaaaaa taaacatgta agcatgtgag ctgcctttcc taattctatg tttatgtatt 2400cactgaatac atagtatttt aaaatagtaa tccaataata tatttgagtg tttgtgacaa 2460gtatgaaaat tgtaattttt aaaaaatctt gataatatgc attgaatatg atttaattca 2520cttcactatt tgaactcttt agggattatt tttaaaaata tgattgatat cctttgatat 2580gttttggctc tgtgtttcca tccaaatctc atctcaaatt gtaatcccca cccgtctagg 2640gagggactgt aatccccatg tgtcgaggga gggaggtgat tgggtcatag gggtggtttt 2700cctcatgttg ttctcgtgat actgagtgaa ttctcatgag atctgatggt tttaaaagtg 2760gcagtttttc ctgcactctc atctctcttt cctgctggct tgtgaaggtg cctgcttccc 2820tttctgccat gattttaagt ttcctgaggc ccccacaagc catacggaac tgtgagtcaa 2880ttaaaccttt tgcctttata aattatccag tctcagatat ttctttaaag cagagtgaaa 2940acagactaat acattcttca atttaaaaag ccatactttc tcatacaagt tgaaaccaag 3000aacaatatca tgcataatca agtgattaac tgtgtaaaga taataaggtt gaggagttca 3060gagaagaaaa gaaatgaata gggaactgta gtgataattt aaaatagcca tccctcactc 3120agggtttttg atcttcaggc catgaagaag cttttaatgc tttttagcaa aggaagtaat 3180gttggtgaaa ggctttttct gacgactaat ggaaagcagt gctatgtatg gtgacttggt 3240tatgaaccaa aaccagaatg actggtgaga ggctgactga atacagcaag cttatgtgaa 3300gacaactgga gctggtgcag tggaaaagga agacagcagg actgtaccca caactcaaag 3360aaaaaagtca gaaggtacct cccgcagtcc aacctgaaaa caacaaagtc aaaggaatct 3420tttcaagaat ttggagctct cattcatatc ctaattagtg tatgaaatgt gaggtggctt 3480tgctataatg aaattacctg gaatatttct aacacaaaga aataataaat gcttgaggtg 3540gtgaatatcc tcatttgatc attacacatt gcatgcttat agcaaaagat tacatgtacc 3600ccataaataa ttgcaactat tatgtatcca taataattaa aactaaaaga ttaaaaatta 3660cctgaaaaaa aatgctaaac aggaaaggcc aactagtctt ggttacatat taaaaaacag 3720aaattcttct ctaacctcac tattggagaa atatcctgtt atttttatat atcttttttt 3780tcaccctttc ccaaatctga gcaagtatta taaaggtata accttcaaca atcttttatg 3840atgaggtatt tgcttactgg ggacaaagcc ccagtgctat tacatagtgt agctaaacgc 3900tgtagaatgg taaaaacaag aaaatgctca gcaaagtgtt gtttctcatt taatgaaaat 3960cttattttaa aacacaaaaa ctcaatatac cccaaccaaa aatctgatga acattttctg 4020tttaatattt attatacagt acctttaaaa acgtaatatt cttattctta aaaatttagt 4080gtgctagcaa atagcaatta agtacctaag tcaatcagga cgacaaaaaa atactcaatt 4140tggggagtta gttacttcta tcatctgaat gcgtccctcc aaaattcatg ctgaaaccta 4200ttcctcatca tggcagtatt aagaggtgaa gcctttgaga ggtaattagg tcatgagggc 4260agagtcctca agaatgggat caatgctctt ataaaagagg ccccagggag cttgtaaggc 4320ttttgcccct tctgccatgt tgggggggtg ggggtggggg cgcagcaacc agtgctaact 4380ctgaagcaga gagcagccct caccagaaac cgaatctgtt gaagccttga tctctgactt 4440cccagcctcc agaactgtga gaaataattt tctgttgttt ataaattacc cagtctaggc 4500tgggcgtggt ggatcacctg aggtcaggag ttcaagacca gcctggccaa tatggtgaaa 4560ccccatctct actaaaaata cagaaaatta gctgggcata gttgtgggcg cctgtaatcc 4620cagctactca ggaggctgag gcaggagaat cacttgaacc cagaaggcag aggttgcagt 4680gaatcaagat catgccattg aactccagcc tgggcaacaa gagggaaact gtctcaaaaa 4740aaaaaaaaaa aagtacacac tctaacatat tttggtatag cagcccaaat ggaatggact 4800aagacaatta cccttaaaat aaaagctccc atagagagat catgcattca agtacagagg 4860ttcttaaggg caatgggaat ggaggacata ttcctgcaaa cttttcaaca gctctcatta 4920gcccgatgtt agagctctgc aaagaagact aaattatact gagaaatatt tttaaatctc 4980cacaaatagg aatgctgtaa acgttgattt agtatatata aaattagaca agactaacaa 5040tatccaatgc aatctaaatc ttaggttgac agacaagaaa gccactgcaa acaggaatat 5100accacaatac ctgatcttgc cacatatttg taaatatgca aagtatttca ataacttcca 5160agaaacagta ttactctcat gagaaataac atgatgtaag tcacctttga aactgtcctt 5220gttacttttt caaatgtatg ttagtcattt cttaacacca aatgaaatga aaaactgagg 5280tggtaatggc tggctgctcc catctctcct ctactcatgt gccttcacca atacagcaat 5340cattttttct tatatgggaa atttacagtg ttgatatagc tcagagatat attgaagaaa 5400agcagaaaaa cgaaacttat aaacatttta ggaaacctta tgtattttct taaatagttc 5460aagtgtaaaa cttagaattc ttataaataa tgtgtgttac agctatattg taaatggtgg 5520ctcatgcctg taatcccagc acttcaggag accgaggtgg gaggagagct tgagcccatg 5580agtttgagac tcacccgggc aacacagaga gacctcatct cttaaaaaaa aaagaaagaa 5640agaaagaaat gaaatgcaaa gaaaaagtct ctatttcaaa tgtagccagt agagccaata 5700ggttaaccaa tattaacatt aacgttgata aaacaagaaa tgatgattta ctataagctg 5760aaaatcagac aatgtatgga ctttaagagt aacaggcacg atcatcacaa acttaaatca 5820ggtttgagtc ctatgagtta tatacagtta catgatgcaa caaaagatgc cagccagttg 5880ttaaagagta ttagattcgg ctgggggtgg tggctcatgc ctgtaattcc agcactttgg 5940gaggccgagg agggaggatc acgaggtcgg gagtccgaga ccagcctggc caatatagtg 6000aaacctgatc tctactaaaa atacaaaaac tagtcaggca tggtggcacg tgcctgtaat 6060cccagctact cgggaggctg aggcaggaga attgcttgaa cccagggggc ggaggttgca 6120gtgagccgaa atcgcgccac tgcactctag cctgggcaac agagcaagac tctgtctcaa 6180aaaagagtat tagattcaag tcctgtttct gtcatttatt atggaaccat ggacacaact 6240acctatcttt cctgaacctc agttttttca actgcaaaac aggaatatat acatatgtgt 6300atatatacat ctgtgtaaac acatatgtgt atatatacat ctgtgtaaac acatatgtat 6360atgtataaat ggagataata cctacattat agtttctgag ataataaaat gcacaacaca 6420attctgacac ataacaattt gtaacttaaa acataccatc accagggcca ctagttttag 6480aacactgtaa tgcatagtct aatttaatac tatgcaaact gtgttcactc aaggttttat 6540ttccttttaa tttcattcat ttactcttca gttgtttgta agctaaaaag tccagaatca 6600tgaaattcag aagtttacgt tttaatgttt ttctatatgg caaggaaaaa aaaaagggca 6660aagtcatttt aacactactt tcaaaatcag cctagaactt aacactaaag gcatgaccca 6720taaaagggaa tactaataaa tagacttaat taaaattaaa caacaacaac aacagctaag 6780cttttgttct gcaaaagatc ctgtgaagag aatgaaaaca taagccgcag gctgggagaa 6840aatatttgca aaccatattt ccgagaaagg tcttgtgtct ataatatata agaactccca 6900aaattcaaca gtttttaaaa aaagcaaata atccaattag aaaatgggca aaagacatga 6960acagacattt taccaaagag aatatatagg tggcaaataa gcatatgaaa acatatctca 7020cacatcatta gccattaaag aaatgcaaat taaaaccaca atgtgatatc attacacacc 7080taccaaaata tccaaaataa aaattagtgg taacaccaaa tgctggtgcg catgtggaaa 7140aatagtcctt cacacactga tggtacaaat gcaaaacagt acagtccctc aggaaaggag 7200tatggcagtt tcttacaaaa ctaaacatgc acttaccata tgaccaagta attatactct 7260tgaatattcc cagaagtaaa aatgtcttct ccaaaaaact tatacatgaa cgttcatagc 7320tgttttattc gtgagagtca aaaacagaaa gcaatcccag ggctacccat taaaacaggt 7380gaatgcttat aaactgactg taataggtct gtcccacgga atactactca gcaataaaaa 7440ggaacaaact actggtatat gcaacaactt ggatagatct caagggagtt atgttatgtg 7500aaaaaagtca atctcaaaag gttacacact gcatgactcc actgatataa cattagtgaa 7560atgacaaaaa ttttagaaat ggaaaacaaa ttagtagttg tcagaggtta gggaagaaat 7620gcagtaaggt aggtggctgt ggctataaaa gggtagccta agagatcctt ctgttgaaac 7680gggtatattt tgaatatagg gtgaatttac atatgtgata aagattgcat agaactaaat 7740acacacacac agtatatgta aaactaagga aatctgagta aggtttgtgg attatattaa 7800tacaatttcc tggttgtgat actgtactgt aattatgcaa gatgttagaa ttgggggaaa 7860ctagatgaag ggtatgtaga tctttctgta ttatttctta caattgcatg tgaatctgta 7920attatctcaa aataaaaatt tttttcaaaa tttcaaaaca actagtctag agctttgtta 7980atcaaagttt tctctgagga cctgtagcat tttggttatc acctggatct tattaaaatg 8040tagattctca ggctgcatat tggaattcct gaattggaat ccgcatttta acaagatttc 8100caagtgattc atgtttaaag tttgagaagc actagtctac aacaatgact tttaaccttt 8160caacctactc taacacactt gaaggccata acaaaattca catcaataac agttgctcgg 8220ttggacagtg actctcaaca caaatgagtg aggaaaggtg gggactcaag actcaggtag 8280caggaaaagc cccttaggtg atcctgatga aatgttttct ccatcctggc tgaaaaaccc 8340agaacagtca attaaggctc aaaacaaaag taatgtttat aatactggag atctttaaaa 8400ggcagataat atatactata acagagcaaa ggtaattatt acaatgtata aatcttataa 8460gaaccaaaat cagaattaaa atcactaagc acataatgaa aatcctttaa aaagtataaa 8520aatgaatgta gtctaagtaa atactaataa tggcagttat agtgagaaaa gctctagagt 8580cttttactct tcatacttcc tagtcacaaa catctatttc caaaactgac ccttcgtatt 8640tcaaataatt tatggcctgg tacagtaata agagcatgat atttaaagcc agtcagaaga 8700cacatattct agctctggat ggcacttgat gacgatggat tcagcttatg gttccaatcc 8760cagctctgtc aattagtacc tatatgaccc tagtcaaata cttaaacctt cttgtgttac 8820ttgtgtgtca attgtatcat ctataaaatg aggatattaa cagtatatac ctcatagatt 8880tttttgtgaa ggttatacaa ttaattcata taaagtattt agaacaatgt ctagcacagt 8940gaattctcaa tgagtgttat aattgttctt tttaaatgtg acttgactct caacagaact 9000ctactgaatt ctaatatgta ttctgtattg agctgtcaaa aaaaataagg attataataa 9060catatactat tcttgtagtc aaccctgtta ctatgttatt actagtgtca gttttgttgt 9120tttggtcata catattgttt tacatacatt aagaattatt agaaatgttg gtttattaaa 9180aatgaccatt tatggctaga agggtatata tctggctcac tgactgtgga gtcaatgtcc 9240ataaagagga ggaagaatgc catcagagta aaaggagatt ctattcactg aaacaaagtg 9300ataaaaagct atgaaagaga aaaacataaa ataaccaaag gggtgaaact taacagatgc 9360ccagtagatg cacaatgcac tgggttgtaa aacttaaaat ggccttaatt aaaagccaag 9420cacggatgga ggtgctgggg gagtctccta cggacacagc aggcagaatg taacaatgac 9480aaggggctca agtttattta aaaagagatt ggacaggccg ggcgtggtgg ctcacgcctg 9540taatcccagc actttgggag gctgaggcgg gtggatcatg aggtcgggag ttcgaggcca 9600gcctggccaa catggcgaaa cctcatctct actaaaaata aaaaaaatta gccgggagtg 9660gtggcgtgca tctgtagtcc cagctactca ggaggctgag gcaggagaat cacttgaacc 9720tgggaggcaa aggttgcagt gagctgagat catgtcactg cactccagcc tgggcaacag 9780agtgagactg ctcaggatct cccaaagacc caaatccctg taaactgaat gcataatatc 9840atttgctcca gtgaggctta gatggacatt ctagtcttct tggttgagct gaagaaacaa 9900atattatatt gataatttat gtatgttgta tttttcaagg tatagcaaca agtttttatt 9960catcagctac tttgtgtgtg tgctttgttt ttaagtcttt tgaaacagga tggtgattta 10020ctacatttat aagtaaaatt tatttgattt acaagggttg cttaagtgta tcacaggatt 10080tcacttgtta tatttgcagg tgcttaaaaa atcagctata ctaaactata actggaatta 10140gcaaagttca tttattgatt aatcaagaat ataattagat ttgcctaact atataagtag 10200tactatgtgt tatttaagaa ttaaatctag aaaagggatg gactctggaa atatcaagaa 10260gtgaaaaaga ctgctctcat ttttgtacaa caattactaa atttctaagt agcattaatt 10320gaactgaaaa ggcattttag aaaaactaga ttttacaatt tataactcta ataaaacaca 10380actaactatg agtgtgcttg ttcatgccca aaagctacct tccaaaatta aaaaccctat 10440tggatggctg ggtgcagagg ctcatgcctg taattccagc actttgggag gccaaggcgg 10500gcggatcacc tgaggtcagg agttcgagat cagcctggcc aatatggtga acccgtctct 10560aacaaaaata caaaaattag ccgggcgctg tggcgggtgc ttgtaatccc agctactcgg 10620gaggctgagg caggagaatc acttgatcct gtgaggcgga ggttgcagtg agctgacacc 10680gtcccactgc actccagcct gggcgagagc ccagagcgag actccgtata ttaaacaaaa 10740caaaacaaaa ctcaaaaaac cctattggca attactaggg ccatcaaatc agtatatttt 10800cacttgacac acaattttga gataatgaac cgaacttact atttttgaaa atattacata 10860ataaatatta gtgaagcttc attgctgaaa tggtgacaaa gatgaatagc aataaaactt 10920ttcttataga tctttagcaa aaacaaaaaa accccaagca tactatggta cattacttta 10980gagaatcaag tagctgctag ttgagtaata gtggtaatag gcactacaat gatataaaca 11040aattacaaca aagaatattg tttttatttc ctgtccatgt tttaaaaaag ctttggtttt 11100acctatgttt aacaaaagca taggtacaac aacgactact actactaaca tataagtagc 11160ctggatagaa ttatcttaat agtagtaccc aagtgcagga tctctaagta atgatcagaa 11220ggcaggaata aattttatca gaaatcttca ttcattacat atttactatg catttaccag 11280ggtatcacta tgctaatgga tacaaagata aataacatgc aaacaactgt aatacagtgt 11340tatgtgataa cagaaatatg tacaaagcac tatgaaaaaa attacaaagc ttgagcacaa 11400attttaactc tggacttact ggcatttaga gcaaaaccaa aacaatccta actggttaat 11460ttcattttct aagagttgga agctatatca gtaggtacaa agtaaaatat gctaattgtg 11520gtagaaagta aaatattaca acagtagaga atttcaaaag aagataaaaa taatggaggg 11580aatatagaag gtcttcaagc ttccagcttg aaatacatat ttttttttaa atagagaaag 11640agataaagtc atttgagtat tcagagggca gactgaatat aatggtactt ctgagaaatc 11700agtggataag gagagaaaag tggactaaag gccatagcat atagagcttg gaatgtcaaa 11760tgtagtggaa ataacaaagg tttggttgga atcccaactc ccaacaacgt actgtgtatc 11820tagagcaaat tacatcaacc tttgggagta ctgtttctga atctgaaaaa tgaggaaaac 11880ttatctttga acaattgatg tgataattaa atgagatata tgaaatatct aatgtaacaa 11940gtgcttaaca atgactagtt cttttcattc ctctcttgaa ccattgtgaa acgtagaacc 12000aagaaaggta acagtattta gttgttacag aacccattaa gagagaataa aaaataactg 12060gtattctaac ttcagtttcc tttgaagtct tgttaatgag aataaatatt atgtggcaca 12120aagaaaaaga aaacaggggt ttacacagga tatgctgcca gactttacca acaatgacac 12180atgatatctg cttcaactgt cccatgcata tttggcttaa gatatattca tgcatatcaa 12240attttacatc acatggtttt caaaagaaga ttcattaaaa ttagcttaag aatgtacaca 12300atatacaata cctcattaaa taaaaagaac agaccatttc caaatgaatg cttttagagc 12360tttacagtaa acagtctttt ggtggtagaa agagggggaa cagagagggg agtgggtggg 12420agtctgtagc acttatcaga ctacttttat cctttatgta gagaaatagg agagttgaaa 12480ataagcactt tctgtactta tgttgagagt ctgaagccca cttttaatag tcttgacaac 12540actaaaaaat aataattaac atttgaaaag ctgtcattat tatagtcagg gacacttaat 12600ctccaaagga gaagtttctt aattgatact atgattaaat aaaagcatcc atcagaatta 12660tatccacaat ctggtttgga gtttatgttt tgtcttattt aaattgttat acttattata 12720attctgtcta gacagtgcca aatgtacttt gtcatacaaa cacttgaggc aaattttctt 12780caaataagcg caacactttg tttcctcttc gtatcctttg actgaataac gtgtggtaca 12840gagaagtaat acttcccttt cttgggatcg agatcaattt gatgcttgtt ataagcccat 12900ttacagaaca aatggtattg cttttaaatt tttatatgaa cttatcagta gactagccaa 12960aaaagaagct tcatataaaa gtgctaggat tgatattctt agtaataatt aggtaaattc 13020tctaaaattt tctcccaaaa gatctgaaaa atcataccaa gggaagtata gtttaaattt 13080cattatatat aatagcttta aaatatcttt gctaattcta cccaaagcca cactaaaaag 13140actaatacaa aaagaatgta attaataaac tattttcctc tgaagaatca aagggcactt 13200ctgcatatga acatgtttta tccttttggt gtacttacat aaaataatta agaaacactt 13260ttaattagta taaacaaaga aatcaaaata gcaagaagaa atgtctgagt aaaagcagct 13320gtgctgacct caaaagtgaa attctgttct cttgatgccc agttaagtgt ctaacccagg 13380gaaaagtgat tctaaacctg ggctaggagc tagtggagct cttcaaacag tctcacctac 13440cctcacccct caaggaatgg tctatgggtt ctgtggtgaa cgctaaagtt tataacatgg 13500gaatatttat tattttgttt ctaacacaaa taatttttaa aaatttattc tactaaagta 13560acatcaaagg gaaatttcat aaaaattctt ttgaaatttt tagaagtagc aaataaaggc 13620aagtgataaa tattttacag atttcaccac ttacgtaatc tgatcaacaa attttaaaaa 13680catagcactt gaatactatt aaaaatatat taaaaaggta acatagtaaa actataaaat 13740tctttaaaaa aaatataaga ggaaaccttc gtgaccttgg attaggaaat ggtttcttac 13800atacggcaac ctaaaaatac aagcaaccaa agaaaaaaac agacaaactg gacttcatca 13860aagttaaaaa cttttgttct tcaaatgaca tcatcaagaa aataaatccc acagaatggg 13920acaaaatatt tgcaaaccat atctgataag agaccactat tcagaatatg taaagaattt 13980gtaaaactta taaataaaaa gttaaagaag tcaattttaa aatgagcaaa ggatctgaag 14040acaattctcc taagaaatac gaatggctag ttaaatgcat gaaaagatgt ttagcatcac 14100tggtcattag gaaagagcaa aaaccaaaat gatatactcc ttcataccca ctaagactgc 14160tgtaattaaa actatagaaa ataagcgttg gcaaggatgt ggacaaattg gaaccctcct 14220catacactga tggtagaaat gtaaaatggt gcagatgctt tggaaaacag tctgacaata 14280ccccaaaggt ttaaacgtgg aattaccatg caacccagca attctactcc taagtatcta 14340cccaagagaa atgaaaatat atgttcacca aaacatttgt acataaatat taactgcagc 14400ttttattcat aatagccaaa aagtggagac aatccacatg tctatcaatt ggtgaattga 14460taaacaaaat gtggtatctt catacaacta ttactgggcc ataaaaagaa tgatgtattg 14520atacatgcta caaaatgaat gaaccttaaa aacaatatgc aagcaaaaga aaccagacac 14580aaaaggccat atattacatg atgctaatta cataaaatgt ccagaaggga gaaataaatt 14640agtagttgcc aagggctgga gggaggggga atgatataag tgactgccaa tgggcatggg 14700gtttcttttt agggtgatga aaatgttctg aaattttatc acgggaatgg ttgcacaact 14760ctgtgtaact tagaattcag tgactcctaa aaccaatgaa tagcatgctt taaaaggtga 14820cctttgctga gcatagtggc tatagtccta gctacttggg aagctgaggc aagaggatca 14880cttgagccag gagttccagg ctgtactgca ctatgatcat acctgtaaat agccaccata 14940cacaccagcc tgggcaacac agaccatgtc tctaaataaa taaacaaata aataaataaa 15000agggtgacct ctgtagtatt gagattatac ttcaagtaag ctgttattaa aaaaaaaaaa 15060gttatcatat gggtggcagg ggaaatcatt ctgggatgat ggctaacttc atcagtattt 15120gatttatacc tatgcatcat accttatgtt tgttttatgc attttgtggg ttttttaaaa 15180aaattatatt tcataaaaac aaattttaaa aaaattaaag tcaagaaccc caaaacaaca 15240aagatcagag atacatttct accttatcaa ttcagaaaaa ttacaagttt ttttcttaaa 15300aattgtatag catcatggtg attttaagtt acctgtagga atttaaataa ctttgtctta 15360actgttcacc aaaactcatt taatattcat gttctgatac tgaaaatgaa gctgaaaagt 15420tttgaaatta caatatgcta gtttaaaaag gtttactaaa atacataatt tcattataag

15480gagtaatatg aaataaaagt atcaaatatg ggaccattaa aaatgtcctt actaacaaat 15540tgctacccac attgtggact cactgcgtcc actgtttgcg agcttttcca gaacgctcgc 15600caccagttag ggtagccaag aactcctcat cttcactttc ttcctcacta gcttggaacc 15660tctggattcc cacccacact gctgtgacct gaatggggaa gagaaacgcc atagtaaggg 15720aactcttcct tttatagatt tctgaattag aatctggcat tacaaaagaa caatgttata 15780aatccaggtc agagtttata gttctatttc actattactt atatggcttg tcctaggaac 15840ttaactatta tttacaatgt aagtacctat ttccacaaaa aaattcaaaa ttttggaata 15900caatatctga agagagaatg gtctattgaa tccaaagtag gctgatacat cccaacagta 15960tttcagattg agataataat aataccacca attcatcaag tcaaattata tgcttatttt 16020ccacaatgga agttttaaaa tagtataaac attttaatat atagcaggct taacttatga 16080ttattaaaca gggttctaag aaaatagtat acatcaaata ttaatgtgct tcttgtataa 16140tttaggtgac aatttatcca tctgagaaat gcaaaagaga ctttggtaag gggttgagta 16200aggagcattc tgtgtcaaag aattcactag caaaagaggg tatactgtag ttacaagcta 16260taatcactgt acttatttta aatccctctt cagaaccagg tcttaaaaga tgataaacat 16320ggcctcatga ataactatca accaaactat agaaaagagt gcaagagtgt ggtgttctaa 16380cttaaaatat ggtgttttat tcaaataatt ttatttaagg ctccaaaagc agcagcctca 16440ttccccagaa atcatagtta aatgaaatct tccttactaa aggaaaaatg aatcacaata 16500tttaacgtga acattttaaa aacactctaa agcaacaaaa ctattcaatt gtatgtgata 16560tggcttagaa aggcatgtag gtaaaaagga ctaaaaactc taataatggt tgggccaaaa 16620gtaaatttgt tagttctact ccattaagca ttcctcaagc agtgtaaaaa tcagagttca 16680agttacactt tgatgtgtag atcctttgaa agccactcta ccctgtttta tatgaagcat 16740ccgcagctaa aatgaacacc tagtgaagag tatgaatgct gcaatacata agcagacgtc 16800agaattgtcc caagctgatt ctaagttact ttaaacatgt atgcagagtc agaatatgac 16860ttacttctta gaagtaacag ataattacct ttggcataat gaaaaaaact ttaaatgtaa 16920gttaatacag gtattttccc tttagcaaag ctttgctttt aaaagaaaac ttcaaaactt 16980aaattaaaat aggaaatgct ctactatgta gtaaaaatac tttttagatt actgaagcaa 17040agaaaaggaa ggattctatg agggaggaaa agtgggagaa aaatgtaaag aaaaaaagga 17100agaaggaaag aaaagagaaa aggaggaaag aacacaagga cagaaaggcc tattgaaata 17160tattatttct ttcaaatttt aaacgagcag aataaattct tttgttttat aactatgaaa 17220taatctatgt tcctcttatc tatgcttgga aaatttagac aaaatgttaa gagtaagtac 17280tacattggat ttccgggtct tcagctctga aaacaagctg tttcttaaca tacgtcaatt 17340ttctatattt catgtcattt ctatttgcaa atgttataaa gttcaatatg atgtaaaaca 17400tggttaaatg aagttcaaaa ataagtataa catacattag tttggctatt ccaaatttca 17460tgcacattaa ctcagccaca catctaacac agtcagccct ccctatccag gggttctgca 17520tctgcagatt caactaacca tgggtcgaaa atgtttttgt accaaacatg tacaggcttt 17580ttttcttgtt atcattccct aactacagta taacaactat tttcacagtg tgtacatgtg 17640tatgaaatat tataagtaat ctacagataa tttaaagtat acaagagggt atgcataggt 17700tatatgcaaa tactacacca ttttatatca gactctcaaa catcagtaga atttggtaac 17760ccagggaggt cctggaacta atcacccaga ggtatcgaca gatggctata tataaatcac 17820tcagtgaatt caggattcac attatttcac aactagtata attttatgtt gttcacataa 17880ttgtgtcaca acatatacat gcagacaggt gactttcatg aaaagattac acccaagata 17940gacatatggt ctactcaaat acggtttcca aatgtgtatc caatcttgtt taattataat 18000caaactcacc attccattga taagcgacct ctaccaacct gcttatcccc tccaagcaat 18060ataacagtgg ttctctgaac caatattgac cctcctttaa attgatagcc tttttttaaa 18120aagctaacca ttgagaagta catactgttg aagacagaac atattctgta aaatgctccc 18180aagatatcaa agtcagatga tacaactgaa tgtttatgct agattatatt tctaagctga 18240gaattacatt ttaatatacc ataagcaatc tgcaaaagaa gcaacttgcc taaagatttc 18300aggagtttca agtatgcata tgtcaatatc tgtatcaata tgtaatatca atataatcaa 18360tgcacacaac aatacgtaac tgtacttata tcatctcctt agcactaatt attacaaaca 18420atctgcatgc actgcaaagc aaaagtataa tataaaatcc caaaaaacct tgaaaattta 18480ataaaaccaa aaaacaggca tcacacacaa gaactgaggc gtatacttca ttaatgagta 18540tgatatcctg atatgaaatg tcaaacaaaa ttacccaggc tcaggttaga aataaagata 18600ggacattagt ctttgtattt ttaaattgat tttttcttct aatattcctt aatgataacc 18660ctatatatta cctacttaaa attattagca aatagttatt ttaaaagtat gagtaattag 18720accaaaagca actctcatat ttacccaaaa gaaggaacca ctaccaagaa tcaaagccta 18780gtaattctgt tcttaacaga caggtgttgt gtattctggc atgttacatg aaaatcactt 18840atgagaagaa cagaaaaaaa aattagaagg tagttttcac tatggaaata ggtaagtgat 18900taagcagatt ttcttacacc atgaaattgt cagcagactc aataatcacc ctaaggggca 18960tcattctgga tgccgacatt ctctatgatg gaaagggact gaaagtaaaa tgcactaatg 19020acataaagaa accaatatcc aatagtaaag ttgaagaaat aaacattctt tggacaggaa 19080ctaagctgaa gtttgcaact accaagaatg tattatgcca gcagtaaatt aggaaactaa 19140agcccatgtc aaccaatgaa aaatgggagg actgaaatca atcattaaag cagcagcaag 19200gttctaacta ttctaaggta taggctacct ctggcgtata ttatcagagt tgacaattct 19260tccaagaaat tctaacatca actgtaatct gaggtccttt aaaaaataat ataaaccagg 19320cagtagactt acattttgta atattttctt ctaagagctg tacattaaga ttttatttgt 19380gatataaata ctatcaaata attagctata gaacagctct attttcaaca gttataacat 19440tttaagccat ctcacattta acctaaactt ttatcaaatg tcaaaactga ggccgggtac 19500ggtggctaac acctgtagtc ccagcacttt gggaggccaa gatgggcgga tcacttgagc 19560ccaggaattc gagaccaacc tgggcaacat ggtgaaaccc catctctata aaaaatacaa 19620aaattagctg cgcctggtgg tgtgcgcctg tagtcccagc tactagagag gctgagggag 19680gagaatcacc agggcctggg agatcaaagc tgcagtgagc tgagatcgtg ccactgcact 19740ccaccctggg tgacagagtg agaccctgtc tcaaaaaaaa aaaaaaaaag aaagaaagaa 19800aaaaaaatca aaactgatca cttgaggtcc aacttatgtt tactatatct acttatattc 19860ccaaagacat cttaaggaga gatgaaatca taaaaaggtg aggatgagaa agaaaatagt 19920aagtcagtaa ggtcaatttt tacatatatt aggctagcat aataaaaata tgagtgtctt 19980attattattt ttttttgaga cagagtcttg ctctgttgcc caggctggag tgcagtggtg 20040caatcatggc ttactgcaat gtctgccttc caggttcaag caatccttgt gcctcagcct 20100cctgagtagc tgggattaca ggtgtgcgtc accctgccca gctaattttt gtattttcag 20160tagagacagg gtttcaccac gttaaaccat gagtttggcc aggatggtct caaactccca 20220aagtgctagg attacatgcg tgagccactg cgtctggcct aaagtgtctt attataacca 20280agaatttatt tgtggagaga ggtaaagaaa actcattttt agtgaaataa ttaaaactgc 20340atcattcaca atctatcttt caaaatgagg tattaactat tttggcttct aaaattaccc 20400catatactac atgcatgagc atgggaattg aagttatttt attcctaagt ttgagacttc 20460atgttttaat gtgatcacta aaaatttcct aattgatgat taggaaaata actttctgta 20520aaattccaga attttagctg tttcaatctc ttcatattaa ggggagaaca ttatgttttt 20580actttctgtg catgcacttt ctttattaga agaaaatgga ctgagggcag taagcaaccg 20640aaaaggaaga gtaataagaa gcctgatgtg tgtgaaaact ggagaacagt ctcaaatcat 20700aaaaagttat gacagaagag gcataaaaaa taaaagtaat gaacttaata tatgaaaggt 20760aataatgatt aagagcatag gctataaagc cagactggac tccctggatt caaatcctgg 20820ctcttctaat tactaggtag gtaaccctga gcaagtttca atgaccaatc tttttctcaa 20880ttacctcagg tatataaagg ggacagtaac agcatttaac ccagaggaca ataaggatta 20940aataaataca tgtaaaataa tttaaaacag tacctggtat tcaataaagc gcaataaatg 21000ttagctgcta ttattattca tctaaacttt actttcatta ccagcaatat tttttaatct 21060taaaaatatt gaataaaaca atgacctagc ttagtaaata aattcataat gagaaaatgt 21120tgatttcatt taataataac tttagtagtt tgggataaca ctttgcatat tttaatttcc 21180ccagctataa ataactcaaa taatttgcca tcagatgatc tgttattttg aagttaacaa 21240ataaagcatt tcctaaaaaa gttctaatac ataacttttg ctctcatctt atgttttaaa 21300aacaaaatgg caaatcatct gcatcaaata gttcctactc ttataacatg acaattgttt 21360taaaatatat ctgctggaaa aagcaactga agtcctagaa aatagaaatg taattttaaa 21420ctattccaat aaagctggag gaggaagggg aaaaacatat ctgccaaata agcttataat 21480taatagttgt tttcagtttt caaaaatcca cataggaagc aatttaagcc taaattgcct 21540aagtctcaat ctcagcgtag tagatagctt agggcaatca aaacttgctg tgttgggctg 21600ccccctacag gactcaattt acctatttct tttaaaaggt gtgtaagtag gaaatatgat 21660tcaagtttta cattaacaat attaatgcta aagcagatga ttatcattca cgcattcact 21720ataggaggaa acagtctctg agaaccatct atagagatac agagagaaat gaaacaatcc 21780ttgtccttga ggaattaata gtttactgct tacagagaaa ctacatacat ggtgaaatat 21840ttaaaaatag ctcatgatat cctctatgat attatgtttg ctatagaaaa agaacaaggc 21900tgaagatcta agatccaagt tctactgttg gctctgccat caaacaataa gctaaacaat 21960gtacaagtca gttttgggga agctgtctta ttcccaaaat gaggaggtta aattagttaa 22020ttcttccagc ctctatggct ctaatattcc acagttacat ttgtcaaaac aaaaggtaga 22080aggaaatgtt tcaaaaacag acttcgcaga aagaacatct atatgatatg aagggctggg 22140gcatatgtga agaaatcaag gaagacttct tgaggaaggt gacatctgaa gtaactttag 22200aagcactctg ggagccaagg ctattcccag gagttaacag agtcagataa taaaagatca 22260aagatgttta ggggaatagc atgcagtgtt atttggttgc agtctagcta tattttagga 22320aacatcaaat taatatcagt ataaaactca acagaatgga gggagaaaaa gcaggtagaa 22380aaatctaaga accactaaaa tagttcatct agaagataaa ggacccatga gctaaatcag 22440tgcaaatggc aagaagggaa taaatgaaga cagttctggt ccattagaac tgcaactcaa 22500caaaagtgat caaaagagtt attccaaagt attgacctgg taacttgaag aaaagtaaag 22560aaagaggaaa ctggacactg aaacagaaga agtagattat gtatttggta gtgaatggaa 22620gtagattggt gggaccagtt agaacctcac agagaagaac tatgttaaga ccagaaatac 22680ggccaggtgc ggtggctcat gcctgtaatc ccagcacttt gggaggcctg ggtgggcgga 22740tcacctgagg tcaggagttc aagaccagcc tgacaaagat ggagaaaccc tgtctcccct 22800gtctgtacta atacaaaatt agccaggtgt ggtggtgcat gcctgtaatc ccagctactc 22860aggaggctga ggtaggagaa tcgcttgaac ccgggaggcg gaggttgcag tgagctgaga 22920tcgcaccatt gcactccagg ctgggcaaaa agagcgaaac tcttgtctca aaaaacaaac 22980aaacaaaaca aaacaaaaca cagaaataca tcaattaaaa aagtgagcta ttcaccagat 23040atgttccact ggtcataaaa caaaagaata caggaggcat gacaagccat catcattgct 23100gttaaaataa ctcacagcaa aattataatg atttaagtca ataacatcta ataattccag 23160ctatagtgtg caatttaatt tattatgtgc caggcacaat agtttattaa aggtattacc 23220tctaattttc acaataaccc tattttacag attataaaat ggaggcccag agatgtaagg 23280tgaacgagcc aaatcaccta gttacctgga atataaactc agaactgcct aaatcaaaag 23340ctctcaatct taaccacatg ctatactgat gcatgtcaaa gattcaattc attcagattt 23400ttcaaggtta tcggaaaacc tatgtagata aaaatttcca aaataatcaa ggatatgtaa 23460cttttacaga aagcaatcac tgatcatcta ttgcaatact catgttctta agcaatatac 23520tgagttgaaa tttttatatt ttataaataa ttagaaagaa tacatttttt aaaactttaa 23580aaaacacctc agtttttatt ctcttcccca aatttcaaca aaatccattt atccaaactt 23640gaggttgaat cattaaagtg gtgatatcat cagtaatagc agagtgagga ccctgaatat 23700actctcctcc ataaaagcaa caagaacaca aaaattctca aaatgaactt tttctgaaat 23760ctttcaaaag ccccactctc agaaaactgt cattatttga tctgccagtt ccctagaaaa 23820acctccctca taggacatta tttgacttga ctcagagctc actcagtgca aacaatttta 23880tcaccaggag agtttgtgga aaatcagtgg caattgttaa acatcacatc tgccatgaga 23940tagcaataac agatgggaca aacaagctaa ccaaaaaatt aaaagaaaaa cctgggaaat 24000aagaaatcca aagggggtct gaaaagttct aacatatttc tgataatcca gaaagccata 24060cacatgtata gagctgtgta cacgctcaaa aaacatctac gaaggcccta aactctcacc 24120tatgggaaac cctgaggctc tgtacaagaa gaaagtaaaa tccagttata aattgcttgc 24180cgtatcattg aaggcaatgc cccaacattc acacataggc ccctggcaaa gattggaaga 24240tactctagtt ctaggcattc aagaaaatct cttctaatca tcagatgatc actaaactca 24300ccaagcagta actttagggg cctgtgtgat aaaaaataaa aacctgaaag aattagttca 24360ggaaagaaac taaacaagca acagcaacaa caaaaacaga ccttgggaaa ggggggaagc 24420atctggtttc cagagttatt ctgttatact atataaaata ttcaggtctc aacaacaaca 24480aaattacaaa gacatgcaaa gaaacaagta taagccacaa actgggggga aaaagcagca 24540gaaactggcc ctgaaaaaga ccagatgctg gacttactgg acaaagactt taagagagtt 24600attttaaata tgcgcaaaga actaaaaaaa agtttatcta aagaactaca ggaaagtatc 24660agaacaatat ttctgatcct tcagaagaac cactttttgt cactacagat tagttctgtc 24720tggtctagaa cttcttaaaa acagaatcat agagtatatt ctctttatat cagctctttt 24780tactcaacac aatgttgtgt gagatttatc catgttgttg catgtatcat tcccaaacag 24840aaatagaaat tatagagata aataggagtt acaaaaaagt accaaacaaa aattctggag 24900ttgaaaagca caaaaactga attaacttga ggggctcaac agctgatttg ggcagccaga 24960agaatgaatc agcaaatcta aagataggtc aattgcgaga aagagaggga agaaggaagg 25020aaggaaggaa aggaggctca gagacccaag agacaccatc aggcatacca atatacatat 25080aatgagaggc ccagaagaag atgcagaaaa agggtcagag tatctgaaaa aataatggcc 25140ctaaacttcc cgaacttgac cccaaaaatt aatctacaca tccaagaaga taaacaaact 25200aaaaagaata aaatcaaagc gatccacacc taggtacatc ataatcaaat gactgaaata 25260taaagagaga ctctcaaaac aggcaaggga cttatgtaca aaacatcttc agattaataa 25320caaatttctc atcagaaatg atgttgtcaa taggcaatca gatgacataa tcaaagcact 25380gaaagaagta gaatgtctgg gacctggaat gctggtggac acctgtaatc tcagtatttt 25440gggtggccaa ggtgggagga tcacttgagg caaggagttg aagaccagcc tgggcagcag 25500aaagaggctc tgtctctaca aagaataaaa agattggctg aatgtggtgg tgtggacctg 25560tagtcccagc tactcaggcg gctaaggtgg aaagatcgct tgagcccagg agttggaggc 25620tgcagtgagc tatgactgtg ccactgcact cttgcagtgg agaccctgtc tctataaaga 25680aaaaatgtca accaaaaact acatgcagaa aaactgcact tcaagaaatg atcagtacct 25740tgaagctctg aaggtgctta agactgtaga tcaataccat agaaaataat ttagtattta 25800ggaatgtaag aaaattaaga cagccttgtt tgataactac acataatact gtaactgttc 25860ttgcactgtt ctggttattg tcaagctatg agcacaaact gatgactgaa atacagaata 25920cagaacagga tataaaatct tatcaggtaa agttaggcaa gcaattacta gttgtaattc 25980aacttgaagg agaaggaata aggaaccaac tcaaaccagg cagcaatgaa ttgtaaaaaa 26040gcttaaggta aaacaaacag ggaaataaaa caactcagaa cctaagcata tcgtaagaac 26100ctaatctaac aaggaggggc ttaaactgat tattttacag cttgggtgca attatcccac 26160aaaaaacttt caggagtttc accagtccat aaactatttg gttattagaa aatagcttta 26220ttgggctacc ctctttgggt cccctccctt tgtatgggag ctctgttttc actctattaa 26280atcttgcaac tgcactcttc tggtccgtgt ttgttacggc tcgagctgag ctttcactct 26340ccatccacca ctgctgtttg ccgccatcgc aggcctgcca ctgacttcca tccctctgga 26400tctagcaggg tgtccgttgt gctcctgatc cagtgagacg cccattgccg atcccgactg 26460ggctaaagac ttgccattgt tcctacgcgg ctaagtgccc gggttcatcc taattgagct 26520gaacactagt cactgggttc cacggttctc ttctgtgacc cgtggcttct aatagagcta 26580taacactcac cgcgtggccc aagattccat ttattggaat ccatgaggcc aagaacccca 26640ggtcagagaa cacgaggctt gccatcatct tagaagcagc ccgccaccat cttcggagtt 26700ctgggagcaa ggaccccctg gtaacaattt ggcgaccaca aagggacctg aacccgcaac 26760catgaaggga tctccaaagc ggtaatattg gaccactttt gcttgctact ctggcctatc 26820ccttagaatt ggaggaaaat actgggcacc tgtcggccgg ttaaaaacga ttagcatggc 26880cgccagactt tagactcagg tatgaggcta tctggggaag ggctttctaa caaccctcaa 26940cccttctggg ttgggaacct tggtctgcct ggagccagct tccactttca attttcctgg 27000ggaagccaag ggctgactag aggcagaaag ctgtcgtccc gaactcccgg cattagccgg 27060ttgagatcat gtcgcagcca gaagtctcta ctcaacagtc gcccatgcgt gcgctcctac 27120cttcccttct gtcccacacc tcctgggtcc caaccacgac tttcttgaaa gtgtagcccc 27180aaaattctcc ttacctctga atctacttcc tctgatccct gcctcctagg tactaatggt 27240tgagactttc atttcctcta gcaagttgta tctccaaagg gatctaagga agctctatgc 27300tgcgccctta ggcatctagg ctataaaccc agggagtctt gtccctggtg tccctcctga 27360tttaggtata cagctctaga catgggcagt tatgtgggac ctgttcccca ccacccttgc 27420cagggcccca agtttgtaaa tggctaagag aggaaacaga gagagacaga gagaaagaga 27480cagtgagaga cagacagaga cagagagaga gagagacaga gaggagagag agagagacag 27540ggaggacagg gagagagaca gagaggagag ggagagagac aaagaggaga aagaggcaga 27600gagacaaaca gggagtcaga gaaagaaaga caaagataga aatagtaaaa aaaaacagtg 27660tgccctattc ctttaaaagc cagggtaaat gtaaaaccta taattgataa ttgaaggtct 27720tctccgcgac cctataacac tccaatacta ccttgttgtc agcgtaaaca agggcgtagc 27780ctgaaaacac taagaccact gacaacccat agccttccta tcaaaaatcc ttaacatcca 27840gtgacctgcg gatggcccaa atgcattcaa tctgtagcgg caactgcttt gctaacagaa 27900aaaagtagaa aagtaacttt tagaggaaac ctcattgtga gcacacctca ccggttcaga 27960attattctaa gtcaaaaaag caaaaaggta gcttattaac tcaaaaatat taaagtatgg 28020ggctattctg tcagaaaaag gtaatttaac actaaccact gataattccc ttaaccctgc 28080agatttcctt acaggggatt taaatcttaa ttaccataca aaggtccgac cagacctagg 28140aggaactccc ttcaggacag gatgatagat ggttcctccc aaatgactga ggaaaaaacc 28200acaatgggta ttcagtaatt gatagggaga ctcttgtgga agcagagtta gaaaaattgc 28260ctaataattg gtctcctcaa atgtcagagc tgtttgcact cagccaagcc ttaacgtact 28320taccgaatca aaaagactat ctcaatcctg actcaaaagc ttacttatac cctctctgaa 28380acgaatttgc ctaagaactg ttgtttatgg gaatgcatct tgatggagca gctgggttgt 28440tatgaaatac tcaggaactc agcctagctc taggactcac ccctgagcac aaaggcaatg 28500ttgggcacgc tggtaaagga ccactagaat ccagcagccc ggaccccttt ctttgtgatc 28560aagaaaggcg ggaaaagggg tgagggctgc tacatcagtg agcataacta atccgataag 28620cagaggtcca tgggtggtta cacaccccgg aaaggaataa gcattaggac catagaggac 28680gctctaggac taatgctcat cggaaaatga ctagtggtgc tggcatccct atgttctttt 28740ttcagatagg aaacgttccc ctcaaggcaa aaacacccct aagatgtatt ctggagaatt 28800gggaccaatt tgactctcag atgctaagaa aaaaaagaca tattcttctg cagtaccgcc 28860tggcaacgat atactcttta agggggagaa acctggcatc ctgagggaag cataaattat 28920aacaccatct tacagctaga cctcttttgt agaaaagaag gcaaatggtg tgaagtgtca 28980tacgtacaaa ctttcttttc attaagagac aactcgcaat tatgtaaaaa gtgtgattta 29040tgccctacag gaagccctca gagtctacct ccctacccca gcatccccca gactccttcc 29100ccaaataata aggacccccc ttcaacccaa acggtccaaa aggagataga caaaggggta 29160aacaactaac caaagaatgc caatattccc cgattatgcc ccctccaagc ggtgggagga 29220gaattcggcc cagccagagt gcacgtacct ttttctctct cagactttaa attaaaatag 29280acctaggtaa attctcagat aaccctaatg gctatattga tgttttacaa ggtttaggac 29340aatcctttga tctgatatgg agagatataa tgttactgct aaatcagaca ctaaccccaa 29400atgacagaag tgtcgccgta actgcagcct gagagtttgg cgatctctgg tatctcagtc 29460aggtcaatga taggtcgaca acagaggaaa gagaacgatt ccccacaggc cagcaggcag 29520ttcccagtgt agaccctcac tgggacacag aatcagaaca tggagattgg tgccgcagac 29580atttgctaac ttgcgtgcta gaaggactaa ggaaaactag aaagaagcct gtgagttatt 29640caatgatgtc cactataaca cagggaaagg aagaaaatcc taccgccttt ctggagtgac 29700taacggaggc attgaggaag catacctctc tctgtcaact gactctactg aaggccaact 29760aatcttaaag gataagttta tcactcagtc agctacagac attaggaaaa aacttcaaaa 29820gtctgcctta ggcccggaac aaaacttaga aaccctattg aacttggcaa cctcagtttt 29880ttataataga gatcaggatg agcaggcaga atgggacaaa tgggataaaa aaaaggccac 29940cgctttagtc atggccctca ggcaagcgga ctttggaggc actggaaaag ggaaaagcta 30000ggcaaatcaa atgcctaata gggtttgctt ccagtgcggt ctacaaggac actttaaaaa 30060agattgtcca aatagaaata agccgccccc tcgtccatgc acctcgtgtc aagggaatca 30120ctgtaaggcc cactgcccca ggggacgtag gtcctctgag tcagaagcca ctaaccagat 30180gatccagcag caggactgag agtgcccggg gcaagcacca gcccatgcca tcaccctcac 30240agagccctgg gtatgcttga ccattgacgg ccaggaggct aactgtctcc tggacactgg 30300tgtggccttc tcagtcttat tttcctgtcc cagacaacgg tcctccagag ctgtcactat 30360ccaaggggtc ctaggacagc cagtcactag atacttctcc cagccactaa gttgtgactg 30420gggaacttca ctcttttcac atgcttttct aattatgcct gaaagcccaa ctcccttgtt 30480agggagagac attctagcaa aagcaggggc cattatacac ctgaacatag gagaacaccc

30540gtttgttgtc ccctgcttga ggaaggaatt aatcttgaag actgggcaac agaaggacaa 30600tatggacgag caaagaatgc ccgtcctgtt caagttaaac taaaggattc tgcctccttt 30660ccccaccaaa ggcagtaccc ccttagaccc gaggctcaac aaggactcca aaagattaag 30720gacctaaaag cccaaggcct agtaaaagca tgcaatagcc cctacaataa tccaacttta 30780ggagtacaga aacccagtgg acagtggagg ttagtgcaag atctcaggat tatcaatgag 30840gtcactgtcc ctctatacct agctgtacct aacccttata ttctgctttc ccaaatacca 30900gaggaagcag agtggtttac agacctggac cttaaggatg cctttttctg catccctgta 30960catcctgact ctcaattctt atttgccttt gaagatcctt caaacccaat gtctcaactc 31020acctggactg tttcacccca agggttcagg gatagccccc atctatttgg ccaggcatta 31080gcccaagact tgagccggtt ctcatacctg ggcactcttg tcctttggta tgtggatgat 31140ttttactttt agccgccagt tcagaaacct tgtgccatca agtcacccaa gtgctcttaa 31200attttctcgc tacctgtggc tacaaggttt ccaaaccaaa ggctcagctc tgctcacagc 31260aggttaaata cttagggcta aaattatcca aaggcaccag ggccctcagt gcctattctg 31320gcttatcctc atcccaaaac cctaaagcaa ctaagaggat tccttgacat aacaggtttc 31380tgccaaatat ggattcccag gtacggcgaa atagccagac cattatatac actaattaag 31440gaaactcaga aagccaatac ccatttagta agatggacac ctgaagcaga agcggctttc 31500caggccctaa agaaggccct aacccaagcc ccagtgttta gcttgccaac ggggcaagac 31560ttttctttac atgtcacaga aaaaaacaga aatagctcta ggagtcctta cacaggtcga 31620tgagcttgca acccatggca tacctgagta aggaaattga tgtagtggca aagggttggc 31680ctcattgttt atgggtagtg gcggcagtag cagtcttagt atctgaagca gttaaaataa 31740tacaaggaag agatctgtgt agacatctca taacgtgaac ggcatactca ctgctaaagg 31800agacttgtgg ctgtcagaca accgtgagga aagtaactaa aatcgtaaat ccccatggcc 31860ctcccttatc atatttttct ctttactgtt ctcttacccc ctttcactct cactgcaccc 31920cctccatgct gctgtacaac cagcagctcc ccttaccaag agtttctatg aagaatgcgg 31980cttcccagaa atattgatgc cccatcaaat aggagtttac ctaaaggaaa ctccaccttc 32040actgcccaca cccatatgcc ccacaactgc tataactctg ccactctttg catgcatgca 32100aatactcatt attggacagg gaaaatgatt aatcctagtt gtcctggaag acttggagcc 32160actgtctgtc ggacttactt cacccatact ggtatgtctg aggggggtgg agttcaagat 32220caggcaagag aaaaacatgt aaaggaagta acctcccaac tgacccgggt acatagcacc 32280cctagcccct acaaaggact agatctctta aaactacatg aaaccctcca tacccatact 32340tgcctggtaa gcctatttaa taccaccctc actgggctcc atgaggtctc ggcccaaaac 32400cctactaact gttggatgtg cctccccctg tatttcaggc catgcatttc aatccctgta 32460cctgaacaat ggaacaacta cagcacagaa ataaacacca cttccgtttt agtaggacct 32520cttgtttcca atctggaaat aacccatacc tcaaacctca cctgtgtaaa atttagcaat 32580actgtagaca caaccaactc ccaatgcatc aggtgggtaa ctcctcccac acgaatagtc 32640tgcctaccct caggaatatt ttttgtctgt ggtaccttag cctatcgttg tttgaatggc 32700tcttcagaat ctatgtgctt cctctcattc ttagtgcccc catgaccatt tacactgaac 32760aagatttata caattatgtt gtacctaagc cccacaacaa aagagtactc attcttcctt 32820ttgttatcgg agcaggagtg ctaggtggac taggttctgg cattggcggt accacaacct 32880ctactcagtt ctactacaaa ctatctcaag aactcaatgg tgacatggaa tgggttgccg 32940actccctggt caccttgcaa gatcaactta acttcctagc atcagtagtc cttcaaaatt 33000gaagagcttt agacttgcta acctctgaaa gagggggaag ctgtttattt ttaggggaag 33060aatgttgtta ttatgttatt ttagcggaag aatgttgtta ttatgttaat caatcctgaa 33120ttgtcacaga gaaagttgaa gaaattcgag attgaataca acgtagaaca gaggagcttc 33180aaaaacacca gaccctgggg cctcctcagc caatggatgc cctggattct ccccttctta 33240ggatctctag cagctctaat attgatactc ctctttggac cctgtatctt taacctcctt 33300gttaagtttg tctcttccag aatcaaagtt gtaaagctac aaatcgttct tcaaatggaa 33360ccccagatga agtccatgac taagatctac cgtggacccc tggaccggcc tactagccca 33420tgctccaatt gtaatgatat cgaacgcacc cctcccgagg aaatctcaac tgcacaaccc 33480ctactatgcc ccaattccgc aggaagcagt tagactggtc gtcagccaac ctccccaaca 33540gcacttgggt tttcctgttg agtgggggga ctgagagaca ggattagctg gatttcctag 33600gccgactaag aatcccaaag cctagctggg aaggtgacca catccacctt taaacactgg 33660gcttgcaact tagctcacac ccgaccaatc aggtagtaaa gagagctcac taaaatgcta 33720attagacaaa aacaggaggt aaaaaaatag ccaatcatct atcgcctgag agcacagcgg 33780gaaggacaat gatcgggata taaacccagg cattcaagcc ggcaacggct accttctttg 33840ggtcccctcc ctttgtatgg gagctctctc tgtcttcact ctattaaata ttgcaactgc 33900aaaaaaaaaa tagcttaatt gaagaataaa ttaatacaat aaaaggaata cattttaagt 33960atacagttca aactgtaaca gtgttacagt ttcaagagga ccccttcaac aagatattgg 34020gcatttccat catgccctaa aagttccttc ttgtccctta ctggttgggt ccatctctac 34080tacaccctcc tgacctggcc cagaccttgg cctcagaaga atcatttttt tgtcactaca 34140tattagtttt gtctgttcta gaacttctta aaaacagaat catagagtat gttctctttg 34200tattggttct ttttactcaa tgtaatgttc tgtgacattt atccatatta ttgcatgtat 34260tattcctttt aatcctgaat agtatgctgt tttaggaata taatgcaatt gtttattcat 34320ttacctgttg acagatatct gagctattat gatggatatt atgaataatt ctgctatgaa 34380cacttctgta caatgttttc tcggacatat attttcattt ttcttgagtg gagctgttag 34440aactgttgga tcagaaagta agcatatgtt gaattttgaa agaaactggt aaactcttgt 34500ctaaagtgat ttgtaccatt ttacactcct actaataatg tatgagagtt atatttgctc 34560cacagccttt ttactacttt gttaatcttt ttagtactgt caaccttttt aatttatcca 34620atctagggaa cgtgaagtag tatctcactg ttattttcat tttcctgatg agtaacaata 34680tcgtgtatct tttcatgtgc ttattagcca ttcctatatc ttttgtgaaa tagttaactt 34740aaatttgtaa ctaaaggtgc tttcctgagt ttcaggtagt aagcctattt ccctcaagtg 34800aataaactac agtcttggaa tgaaaaatta aacacagtgg agacattttt tgtataagtt 34860gttttactct gtgtatgtct ggtttgctta gtctattatt atatgcccca tgaaagcaaa 34920cacagtgctt atttcactaa tgagtatcac tagcacatag aactgtgctt gcccaaagca 34980tgaactcaat aaatatgtta atgtgtatgc atgcacatac atctacatgc atgtacatct 35040atacacacat ataaacatat attaattttt agacccacaa atctaagaaa actaattctt 35100gagcctctgg tttgaagaat tctcaaatta ttaacatatc tttatgttcc actccacatc 35160cactgtacct gaaatagccc tactgttcta ctttggtaaa tcaggcaaat ttaatttttt 35220aaataattaa gattccaact aattttaaaa tataatttga aagttaacaa tgaaatacat 35280tacataaaaa gaaaatttta aataaaagca aaactaaacc caataagagg aaagaaagtt 35340gggctgtatt tctttaatcc tttaaaattc aaatcacaca atgctccaat gaaatcttca 35400ttaactgaac caaactatgc ccatgaaaga tctcatatgc aactgctaaa acctcaataa 35460acatattcat cttcttgcaa aaaagatatt tctttataat atgcacatgc agtatatact 35520attttgaggc agatttgtac tttagtcctt gttccattgc ttaccggctg gctgtccttt 35580gtctggtcat tgacctccaa cttaaaaaat aatacttgcc ttgtctaccc cacagaagtg 35640ttatgaaagt caaacaaggt agcataaagg tattttacaa gatataaagt gctataatac 35700agattttaaa aatcactcta catcccataa tactttgttg tacaatttta gagcaatagt 35760agaaaataac aattattgcc taattgaaaa tccagtcccg aattccataa aatgtatgat 35820atgaacatta tagtacatca tattacgagc cccaaataat cactgcttat atagttggtt 35880aggatttcct tagtttgttc atatagttta tatatttatg cagtccctat tttgtgagag 35940gcattgtgag gagcataaag acataagcac agtacagagc cttagcttct ctacatttac 36000taaagaagac ttcttcttgg gtatttaatc aatatttaaa gtattctggg aagaaatgaa 36060attaacttca tagactgacc ttagattact atcattacaa aaagatgcct gagtgatctg 36120tctttaacat accagtattt atcttataac tgttatattt acttgaatca gaagtgaagt 36180ccttttaagc actaagcatc cattctatac tttcttgtct ttacatatga gatacaaatc 36240atatttttaa aacttttatt tacttttatt ttttagagac ggagtcttgc tctgtagccc 36300aggctggagt acagtggcat gatcttggct caccacaatc tccacctcca cttcccaggg 36360ttcaagtgaa caaatcatac ttttaagcac agattctcaa catgtatcct agcatgctac 36420tgccataact agggtgtgaa ttaagtatta aagacagctt accccaaata ttactgtaac 36480atatatctct aaatgaaaaa gaacatatta acaactatac ttggatggga ttctgggagc 36540taacccatcc ctctctcccc tttcctccaa attccatctc ctattaacac accagctctc 36600ctgagctaag cagctcctgg ggttggggaa gggtgtacat ggagaaagct agaacctcta 36660cagtgttttc ctctctggga ggaactagca ggcatacgaa cagaaaaagc tgaataaaag 36720gctgaatcct ttctattcct gaggcagaca gagagaagac cagggaacaa agagacttcg 36780accaagagcc ctgccaggta ttgatacctt tgatactgag aaaatatctg ggatatgaaa 36840tacaaatgct aaataagtat ctttgaaata ggggtaaaag aataaagggt cttgatgagt 36900aaaatgggta gtatttttta ataacctgat aatgagcttt aggaaaaggg aaggtcaacg 36960ttatggaatg aaaacacaga ggtaccaaat ttaaaagcat aaaaaaaagt ggaggggggg 37020aacccaataa cttcatcaaa ctagcaaata acttagtatc atttctaatt agaaacgcta 37080gaaggaaatc acttagatct gataaagact aggctataat tctaactgat gaaacactta 37140aactgtatca attaatacca gaaaacaaac acagaaaagt ctactagaac catcattatt 37200cagcacagtc ttggtaatgc aatactataa tagcaatgca ataaagcaag aaaaaaaaaa 37260gtttgtaaaa acacaatagg atgagatttt tgtttttcca atgccataaa taactagaaa 37320tggaaacaaa ataaagaaaa acaaaatcta caaaacacct ggaaataaaa agaaaaatgg 37380tctatttgaa gaaaacctta aaatctatgc agaacataaa acaaaatctg aataaaaaga 37440aatatcatgt tcttgtctgg gaagacttaa tatcataaga aagtgaatta tatcaaaatt 37500taaatcgaaa tttaatgtat ttccatctct aatcagacag gacactatgg ggaactgaat 37560aagtgatttt aaaagtcatg gaaaattaat aactgagaat aaccatgaaa agtatgaaaa 37620aaggagacaa atgaattgct ccaacagata tcagaacgct aaaattaaat aaaaatacta 37680ctaggataag aaaatacata tactgatgta atgaataaag aatccagaat tagattccag 37740taagtcaaac tactttacta taaaccaggg gtggcatatt catccagtgg gaaaaggaca 37800gtaagaagtg agtaaactat ggcccactgg ccaaattgtg gcctctgcct atttttgcaa 37860ataaagtttt actgggacaa agccaagcct atcatttgca aattgtctat aaatattttc 37920atgttacaga atcacacagt ttcaacagag accatcttgt ctacaaagct gaaaatatct 37980actatctggc ccttgaagaa agtttgccaa accttagttt atataataaa agatcagcta 38040tctcatagac acctatctca cacaacacat tgtgggaaag gaccttcttt tttttttgag 38100acggggtctt gctctgttga ccaggctgga ctgtagtggc atgatcatgg ctcactgcag 38160cctcaacctc ccaggttcaa gtaatgctcc caccacagaa tcccaaacag ctgggagaga 38220tgtgtgccac tacgcctggc taaggggcct ttttaacaga gaaagaaatc cacatactac 38280taagaaaaag aagggcatat ttgatatata tttatatttt ttatatagat atcataaaaa 38340tcaagatgaa ttatacagtt atattttgca atgtgtttga cggtaaaagt ttaatatcta 38400taaaaattat tttataaaat atctttaata tatttataga tattataata taaaatatct 38460ataaaattat tttataaaat aaaaagttaa gaagaaaaga taggcaaaac aaaatacagt 38520gcaatttaca gaaaaccaag tccaaatggt caacaaagat aaaacagatt tataaactca 38580ctaagtgtga gagaattatt agttaaagta aaaatatctc tctataccca caatactact 38640aaaaatcaga gttataatgc cctattgctg gtggagatgt aaggggagaa gcatgctctc 38700atatactgtt agtgaaaatt taaactaata catttttgaa aagtaagctg gcaatttttt 38760ttttaatctc taccttttga tgcaaaaact catttttggg tacctattcc ataccttaaa 38820aaaaatacat atgcttactg tagtactgtt tataatggta aaaactagaa aaaaagaaaa 38880cttgatagtg aatactgaac aaattacagt gcatctacag attaaacata atgcagccat 38940taaaaaagaa taaattaggc tgggtgcggt ggctcatgcc cgtaatccca gcactttggg 39000aggccaaagc aggcggatca cttgaggcca ggagttcgag accagcctgg ccaacatggc 39060aaaaccctgg ctctacaaaa aatacaaaaa ttagtcgggc atggtggtgg gcacctgtag 39120tcccagctac tcaggaggct gaggcaggag aatcacttga gcctgggaga cagagattgc 39180agtgagccaa gatcatgcca cagcattcca gtccaggtga cagaacgaga ctctgtctca 39240acaaaaagaa caaattaaac cctacaactc atcaacaaaa atacccaaac ccaattcaaa 39300aatgggcaaa ggacttgaat agacatttct tcaaggatga taaacaagca catgaaaaga 39360tgcagagcac tattcattag tgattacatc ccacatgcat taggatggct agtatgaaga 39420acagaaaata ataaatattg gtgaagatct gaaaaacaga aacctttgtg cactgttggt 39480gggaatgtaa agtggtacag ctactacgga aaacagtatg gccattcctc aagaaaataa 39540aaataaaatt atcttatgat aggaatatgc atttctgggt aaatacccca aataactgaa 39600aacagggtgt acacccattt caacatttac atgtcaattc aactgggcca gaatacccag 39660atatttgttc aaatattctt ctggatgctt ctatatatat gttttttggc tgaggttaac 39720atttaaattg gtggattctg agtacagcag attaccatcc acaatgtagg tgggcctcat 39780ctactcagtt gaaggtctta cagaaaaaga ctgacctccc ttgagcaaga aagaattcag 39840gcaacagact gcctttggac tcaactgcaa ctcttccttg agtcaacagc ccatcccatc 39900accctggctt ggtgagtcca gggtctgatg aggtaggctg cagactcaag gaagagctgc 39960caaaaccagg aaagccaatt cattaaaata aatctctctc tacacaaaca cacacacaca 40020ctaccaccac caccatgatg gttctgtttc tctggagaat gctaatacac ccctgttcat 40080ggcagcatta ttcacaatag ccaaaaggtg gaagcaactc cagcagatga atggagaagc 40140aaaatgtggt atgtatatac aatggaatat tattaagcct ttaaaaagtg gaaattatat 40200ctatctatat ctatacacac atactcacac acacacacac acatttatag aagacagggt 40260ttcaccatgt tgtcaaggct ggtctcgaac tcctgggctc aagcaaaccg cctgcctcag 40320cttcccaaag tgctgagatt acatgtgtga gccaccacac ccagccaaaa aaaggacatt 40380ctgacacata atacaatata gataaacaat gaggacatca tgatatgcga aataagcctg 40440tcacaaaaag gcaattagtg tatgattcct cttgtatgag gtacctatgg atgtcaaatc 40500cataaagtag aatggggaaa cagagagttg tttaatgggt atagagtttg ttttgcaaga 40560agaaaagagt tttggagaat gaatgtacaa cagtgtgaac ataattaaca ctactgaaaa 40620tggttaagat tataaatttt atgttacatt tattttacca tgattaaaaa ttaaaacaaa 40680ataatattaa ggaaaaatac tataaataac aacaacaaaa aaaacacctc aagcaactta 40740cattcacctg ggaaacagaa tacatcctat tctgctagag atatatctgc agttcaaaat 40800ttattacaaa tgatgttgtg tatctttttg aaatgactga aaaactaaat taaaagcaat 40860aatattcagt ttactaacca gtaagtcctt ctttcatggt tcctgacttt tctgtaagat 40920gttattgcaa gatatctact aaaatggaaa acaactgaaa aggcaaaatt ataatttctt 40980atcaacatcg ctaaaaccct ggaggggaag aatcctaaca aacatggcca taatttgcca 41040catatttcta ctgtcctcac ttttcaaaat ccagaaatca acatttctgg aaacaaaaca 41100gagtctaaaa tttggctcct tcttcagttt agaaggtgcc aagttaatcc ctgacatcct 41160agtttccatt ttcaaaaatg tactttttct ctccccaaac cggtatctag attcttaaat 41220atttttagca catagaagtt aaatagattt gcttaaccaa aatagccagt aaacctccca 41280aaagaattaa aatattaatg gcgctttaat gatacaaatg aacaacttta cattcaatcg 41340tcaatgggaa aggaagcaga attctgagga ttatgaaagt aaacaaaacg aagttcaaat 41400tctactttat tttacttttt tgtaactaat gaacaacttc ttccaaagac aagtaggaaa 41460tacaaaaatt agccaggcat ggcacatgcc tgtagtcctg gttacttgga aggctgaagt 41520gggtggatcg cttgagccgg gaaggcagag gctgtagtga gctgagatca catcactgca 41580ctcaagcctg ggtgacagag caagaccctc tctggggaaa aaaaaaaaaa aaataggctg 41640ggcgcagtgg ctcacacttg taattccagc actttgggag gctgaggcag gtggttcacc 41700tgaggtcagg agttctagac cagcctgacc aatatggtga aaccctgtct ctactaaaaa 41760tacaaaaatt agccaggcat ggtggtgggc aattgtaatc ctagctactc gggaggctga 41820ggcaggaaaa tcgcctgaac ccaagaggcg gaggtttcag tgagccgaga ttgcactagt 41880gcactccagc ctgggcgaca gagcaagact tcatctcaaa ataaataaat aagtaagtaa 41940ataaaattaa aaaatatata aaaataaaac aaagataagt aggaaccatc cttttttttt 42000tttttttttt ttttttttaa agatagggtc tgtttctgat gcccaggctt gagtgtagtg 42060gcatgatcat ggctcactgc aaccttgacc tctcaaatac aagtgactct cctacctcag 42120cctcccaagt agctgggact acaggtgctt accaccccat ccggctcatt taaaaaaatt 42180tttttgtaga ggtggggtct cactatgttg tatccaggct ggtctcattt taactttatt 42240agaaaacaag cattgtttta tcagcttctt gtttttttaa aactaaaaat aacactgcta 42300ggttgtttct atgaagattc tctaaattta tttataacct taagaataac atgtagaaca 42360aagtagatga ctgaatgatc tttgttgaat aaatatgaat ggatattcaa ataattaaaa 42420atctcttaag atctcccatt ctttacagga tacagagaaa actcgttaat atggcctgac 42480ttttaccttt gcagccttat ccaaactctg tggtcaagac aaacaggttg tccttatact 42540tacaacgtcc ccctttgcct acaaagctct tctcatgact ctttgcctat cttaagttca 42600cctatctgtc aaatctctgg gaatgcaaca tttcctcaag gtagccttct ctcctcccaa 42660actagaacaa attcttcctg gggcattagg tttttattgc actgtatgtc tcttcttcac 42720agcaatcaca gttccaatgt tatatttgta ttcttagttg atttgtttct ttccaccttt 42780agactataac cttctaaggg gtcacacata atatcgatca tcagttgtat cccttgtgca 42840tagcacaggg catggcaggc aaatatgtgt gtaaataaac ttgttgaatg aatcaatgag 42900acacactttt cttacccaaa gtataatggc aggataacat ttatcaatct attgcttctt 42960gaaaaacaga tatgatgtgc ttaattttca ttttacatct caaataccaa tgcctaagga 43020attcacagtc attttacaaa tctttttgac aaatgccttc attaatcacc acctgtttac 43080aagtgctaaa taacattttg gttacattct gtaacatttc ctgcacttaa tgtcatctct 43140agaatactgg ctaatatgaa gcacctggac ttcaggaaca caaacctgaa actaacacac 43200caaactaaac tgttatgtaa atgacagaaa tgacacattt tggtctgcaa catctctaga 43260tggcttttgg accaattcaa cttttaccac taaaaatcgg tcacctgact atagtcattt 43320tgagctcatg ataaatgaat tacagatgaa aaataaatag tttgatgaca atctttacaa 43380aagtttatct tcaaagaata ccaccagtca caggtattct aggctcctat caacttattt 43440ggtcagggca gacttcactt ttcatgataa ttatgttctg aaaattctac aaacttaatg 43500attacaaaca aaagtcatag tttgctcata aatcaggcct aggtctggat tctagttctt 43560ccatttttca tttgttcact gaggcaagtg acttaaaatt ccctagcctc agtttcctca 43620catgtaaaat cagataatga ttcctattcc taagatggtt ttgaggcttc aacaagataa 43680gatgggcctc actcaagcat gctcagtact ctgtctctct ctctccggtt atgcagaaat 43740tctattagga ttctgcaaag taaaataaat atttcagtaa aaattatgcc ctttattaat 43800gaatctagat tttcagattt tccttaaatt tacttagtaa cttaagggct caaatattat 43860agagatttgt atctagtatt ttaaagaaat gaaaggtgtt aatcaaaatg ctgcacaaat 43920aaatgctaca tttaacaaac agaatatcac aaccatacaa actaatcaga tataaagaag 43980tcagcaacag aaatctgatg ttgcctttag atcacacaat taggcaaaca aaaatagagt 44040tccatcctcc tttggtcaag gccatggttg aagactgaat accaaatagg gaaataggaa 44100aagccaggaa atggcaaatt agcaaaaact ggactcctta atttttatat tcattttcat 44160atctcacttc taaaacttta attaaattca aataaaaacc aaaatggaac tgagataaag 44220ccaaaaggaa agttatgtag gtcaaatgag aacctatatt gtccttaggc tctttgttgc 44280tttctgttta aggaaaaact gcccaagtgc cttgacacat taaagatcaa gcaggaggtt 44340ctgccgagag tccccatctg gcagccaggt tttgtcaagc aaattttgag aattctctac 44400cctcccactt tctatctaat tatagcactt tataaaaacc attctctctc tgtctctgtc 44460tctctctctc tctctctctc acacacacac acacacacac acacacacac acacacaccc 44520tttctctctc tctctctctg aaacttatct gtattataat aacacaacac taggtatgga 44580ttaatctgac aattttcccc taaaacagaa taaattcaaa aaggaaaacc tttcctctgt 44640acacatgcac tatattctga caataataat tcctaaatta agtataatac attttcccta 44700caggagttta aagaagttac agtaaagaat ctcttgtata aatatatatg ccagaacttg 44760acccaaataa gtgctgagag gtataaatct caaaacagtt tccggactct ttgtgaaatg 44820tcttcagagt ctgcgatata ttttcttcaa ctaaattata caagtaagat attttgctgg 44880gctgtgggaa tgccttacgg catgttactg tggagctcat ggtaaaatag aaagaatata 44940aataattaaa ataaaattga caaatgataa atgatttaat aaattagaaa ttcaaatgcc 45000gggcactttt ctagaacctg gacacaaagc atgaacctaa caataacccc gccttcatga 45060aaaatatgga ctatttgaaa attatacctg caacactaaa taaatattct tcattcttcc 45120agtatattga gatgtttact ttcaattaga caatttgctt tcctctctga acacatagtt 45180atgtgatggc tctataaaag attttaaaat aactatagaa ggaactattg gtaaagactg 45240tgggatacta aaaatggcta caaagaaagt tatgacaaaa cctctgagtt tgaatggaag 45300tcctactaga ttagagtcta agcctgtgac attatgcttc tggttcttgt tcttaaatgc 45360ttttctcatt aatagtatgt aacttacttc ctggaatgcc attcattaaa aaaatattta 45420atatttgcta aatgtcaata tttatgccag cacttttaaa gtacagaaac atggagtttc 45480tttacctcat gcaaatatgc tgtgagaaag acttaagagc ctattgccta ctttgtggta 45540caacactgaa gactcaccat ccaaaacaaa cagacttagt aaattcttgt gatttgcagt

45600agttctgttc tataaggtta ccacaaacac tgaaatcatc gctcctgggg gaatacaagg 45660ttatgtttcc gtgagccctc ggtcacaaca tgttcattaa ctgatcaata cataaccttg 45720ttctatgtgt gtttctgttt aaaaagagca cttcagtgct acatttggag tctgttttaa 45780acagcaaaat cactaataaa aagcacaaaa atgtaaaagc atggcactac atacactgtg 45840acaagaaggc ttgtttatag tatgacagct gagacaagaa ggtagagcct cgctttgatc 45900aacctctgct gggaaatgag catcaggtga atcaattttt caccactctg aatgaccgta 45960aaagtgctcc aagtactgac tttggggtta cacataaatt ttagtaagca tgtgaatctg 46020ccaatatgaa atctacaaat aatgagtacc aaatgcatat gagtcaaata tttcagtgcg 46080gtatctgact tgattgccac tgaaagacac agtttggaaa acccctaata aataccgttt 46140agttactatg cagacaaaga gttctacact agagtgcttc aattaagatg tctgaggctt 46200tcataaatgg atgtttttta aaatgttatt tcctacctga tatattctaa aggggatata 46260acgaaatcca ttttcttctg caggatattc catgagtttc cgattgatgg cccaaaactg 46320gtcaaatctg tctgtaatga 4634067773DNAHomo sapiens 67actgagagac aggactagct ggatttccta ggctgactaa gaatccctaa gcctagctgg 60gaaggtgacc acatccacct ttaaacacgg ggcttgcaac ttagctcaca cctgaccaag 120gaaggtgacc acaccctcct ttaaacacag agcttgtaac tcagctcaca cccgaccaat 180caggtagtaa agagagctca ctaaaatacc aattaggcta aaaacaggag gtaaagaaat 240aatcaaatca tctatcgcct gagagcacag ggggagggac aatgatcggg atataaaccc 300aggcatttga gccagatcag gtaaccctct ttgggtcccc tcacactgta tgggagctct 360gttttcactc tattaaatct tgcaactgca cactcttctg gtccatgttt gttccggctc 420aagctgagct tttgctcgcc gtccaccact gctgaatgcc gccattgcag acctgccctt 480gacttccacc cctccggatc cggcagagtg tccgctgcac tcctgatcca gcgaggcacc 540cattgccact cccgatcagg ctaaaggctt gccattgttc ctgcacagct aagtgcctgg 600gttcatccta atcaggctga acactggtcg ctgggttcca cggttctctt ccatgactca 660cagcttctaa tagagctata acactcacca catggcccaa ggttccattc gttggaatcc 720atgaggccaa gaaccccagg tcagagaata aaaggcccgc cccatcttgg gag 7736810PRTHomo sapiens 68Phe Leu Gly Glu Glu Cys Cys Tyr Tyr Val1 5 106910PRTHomo sapiens 69Leu Leu Phe Gly Pro Cys Ile Phe Asn Leu1 5 107010PRTHomo sapiens 70Cys Leu Pro Leu Asn Phe Arg Pro Tyr Val1 5 107110PRTHomo sapiens 71Gly Leu Leu Ser Gln Trp Met Pro Trp Ile1 5 10729PRTHomo sapiens 72Cys Leu Pro Ser Gly Ile Phe Phe Val1 5739PRTHomo sapiens 73Trp Met Pro Trp Ile Leu Pro Phe Leu1 57410PRTHomo sapiens 74Ile Arg Trp Val Thr Pro Pro Thr Gln Ile1 5 107510PRTHomo sapiens 75Leu Arg Asn Thr Gly Pro Trp Gly Leu Leu1 5 107610PRTHomo sapiens 76Leu Arg Thr His Thr Arg Leu Val Ser Leu1 5 107710PRTHomo sapiens 77Lys Arg Val Pro Ile Leu Pro Phe Val Ile1 5 107810PRTHomo sapiens 78Cys Arg Cys Met Thr Ser Ser Ser Pro Tyr1 5 107910PRTHomo sapiens 79Thr Arg Val His Gly Thr Ser Ser Pro Tyr1 5 108010PRTHomo sapiens 80Ala Arg Glu Lys His Val Lys Glu Val Ile1 5 108110PRTHomo sapiens 81Ser Arg Ile Glu Ala Val Lys Leu Gln Met1 5 108210PRTHomo sapiens 82Ser Gln Trp Met Pro Trp Ile Leu Pro Phe1 5 10839PRTHomo sapiens 83Cys Tyr Tyr Val Asn Gln Ser Gly Ile1 5849PRTHomo sapiens 84Phe Tyr Tyr Lys Leu Ser Gln Glu Leu1 5859PRTHomo sapiens 85Thr Tyr Thr Thr Asn Ser Gln Cys Ile1 5869PRTHomo sapiens 86Ser Phe Leu Val Pro Pro Met Thr Ile1 5879PRTHomo sapiens 87Tyr Tyr Val Asn Gln Ser Gly Ile Val1 5889PRTHomo sapiens 88Leu Phe Asn Thr Thr Leu Thr Gly Leu1 5899PRTHomo sapiens 89Leu Phe Gly Pro Cys Ile Phe Asn Leu1 5909PRTHomo sapiens 90Arg Trp Val Thr Pro Pro Thr Gln Ile1 59110PRTHomo sapiens 91Leu Pro Phe Leu Gly Pro Leu Ala Ala Ile1 5 109210PRTHomo sapiens 92Leu Pro Tyr His Ile Phe Leu Phe Thr Val1 5 109310PRTHomo sapiens 93Gly Ala Leu Gly Thr Gly Ile Gly Gly Ile1 5 109410PRTHomo sapiens 94Leu Pro Phe Val Ile Gly Ala Gly Val Leu1 5 10959PRTHomo sapiens 95Arg Arg Pro Leu Asp Arg Pro Ala Ser1 5969PRTHomo sapiens 96Phe Arg Pro Tyr Val Ser Ile Pro Val1 5979PRTHomo sapiens 97Arg Arg Ala Leu Asp Leu Leu Thr Ala1 5989PRTHomo sapiens 98Trp Arg Met Gln Arg Pro Gly Asn Ile1 59910PRTHomo sapiens 99Asp Arg Ile Gln Arg Arg Ala Glu Glu Leu1 5 1010010PRTHomo sapiens 100Leu Arg Thr His Thr Arg Leu Val Ser Leu1 5 1010110PRTHomo sapiens 101Glu Arg Val Ala Asp Ser Leu Val Thr Leu1 5 1010210PRTHomo sapiens 102Leu Phe Gly Pro Cys Ile Phe Asn Leu Leu1 5 1010310PRTHomo sapiens 103Gln Phe Tyr Tyr Lys Leu Ser Gln Glu Leu1 5 1010410PRTHomo sapiens 104Gln Trp Met Pro Trp Ile Leu Pro Phe Leu1 5 1010510PRTHomo sapiens 105Cys Tyr Tyr Val Asn Gln Ser Gly Ile Val1 5 1010610PRTHomo sapiens 106Asn Phe Val Ser Ser Arg Ile Glu Ala Val1 5 101079PRTHomo sapiens 107Gly Pro Leu Val Ser Asn Leu Glu Ile1 51089PRTHomo sapiens 108Leu Pro Leu Asn Phe Arg Pro Tyr Val1 510910PRTHomo sapiens 109Leu Pro Phe Leu Gly Pro Leu Ala Ala Ile1 5 1011010PRTHomo sapiens 110Glu Pro Lys Met Gln Ser Lys Thr Lys Ile1 5 1011110PRTHomo sapiens 111Leu Pro Tyr His Ile Phe Leu Phe Thr Val1 5 101129PRTHomo sapiens 112Arg Glu Lys His Val Lys Glu Val Ile1 511310PRTHomo sapiens 113Lys Pro Arg Asn Lys Arg Val Pro Ile Leu1 5 101149PRTHomo sapiens 114Val Val Leu Gln Asn Arg Arg Ala Leu1 511510PRTHomo sapiens 115Ala Val Val Leu Gln Asn Arg Arg Ala Leu1 5 101169PRTHomo sapiens 116Leu Pro Phe Val Ile Gly Ala Gly Val1 51179PRTHomo sapiens 117Asp Leu Tyr Ser Tyr Val Ile Ser Lys1 511810PRTHomo sapiens 118Thr Glu Gln Asp Leu Tyr Ser Tyr Val Ile1 5 101192615DNAHomo sapiens 119gaattccggg aagccagacg gttaacacag acaaagtgct gccgtgacac tcggccctcc 60agtgttgcgg agaggcaaga gcagcgaccg cgcacctgtc cgcccggagc tgggacgcgc 120gcccgggcgg ccggacgaag cgaggaggga ccgccgaggc tgcccccaag tgtaactcca 180gcactgtgag gtttcaggga ttggcagagg ggaccaaggg gacatgaaaa tggacatgga 240ggatgcggat atgactctgt ggacagaggc tgagtttgaa gagaagtgta catacattgt 300gaacgaccac ccctgggatt ctggtgctga tggcggtact tcggttcagg cggaggcatc 360cttaccaagg aatctgcttt tcaagtatgc caccaacagt gaagaggtta ttggagtgat 420gagtaaagaa tacataccaa agggcacacg ttttggaccc ctaataggtg aaatctacac 480caatgacaca gttcctaaga acgccaacag gaaatatttt tggaggatct attccagagg 540ggagcttcac cacttcattg acggctttaa tgaagagaaa agcaactgga tgcgctatgt 600gaatccagca cactctcccc gggagcaaaa cctggctgcg tgtcagaacg ggatgaacat 660ctacttctac accattaagc ccatccctgc caaccaggaa cttcttgtgt ggtattgtcg 720ggactttgca gaaaggcttc actaccctta tcccggagag ctgacaatga tgaatctcac 780acaaacacag agcagtctaa agcaaccgag cactgagaaa aatgaactct gcccaaagaa 840tgtcccaaag agagagtaca gcgtgaaaga aatcctaaaa ttggactcca acccctccaa 900aggaaaggac ctctaccgtt ctaacatttc acccctcaca tcagaaaagg acctcgatga 960ctttagaaga cgtgggagcc ccgaaatgcc cttctaccct cgggtcgttt accccatccg 1020ggcccctctg ccagaagact ttttgaaagc ttccctggcc tacgggatcg agagacccac 1080gtacatcact cgctccccca ttccatcctc caccactcca agcccctctg caagaagcag 1140ccccgaccaa agcctcaaga gctccagccc tcacagcagc cctgggaata cggtgtcccc 1200tgtgggcccc ggctctcaag agcaccggga ctcctacgct tacttgaacg cgtcctacgg 1260cacggaaggt ttgggctcct accctggcta cgcacccctg ccccacctcc cgccagcttt 1320catcccctcg tacaacgctc actaccccaa gttcctcttg cccccctacg gcatgaattg 1380taatggcctg agcgctgtga gcagcatgaa tggcatcaac aactttggcc tcttcccgag 1440gctgtgccct gtctacagca atctcctcgg tgggggcagc ctgccccacc ccatgctcaa 1500ccccacttct ctcccgagct cgctgccctc agatggagcc cggaggttgc tccagccgga 1560gcatcccagg gaggtgcttg tcccggcgcc ccacagtgcc ttctccttta ccggggccgc 1620cgccagcatg aaggacaagg cctgtagccc cacaagcggg tctcccacgg cgggaacagc 1680cgccacggca gaacatgtgg tgcagcccaa agctacctca gcagcgatgg cagcccccag 1740cagcgacgaa gccatgaatc tcattaaaaa caaaagaaac atgaccggct acaagaccct 1800tccctacccg ctgaagaagc agaacggcaa gatcaagtac gaatgcaacg tttgcgccaa 1860gactttcggc cagctctcca atctgaaggt ccacctgaga gtgcacagtg gagaacggcc 1920tttcaaatgt cagacttgca acaagggctt tactcagctc gcccacctgc agaaacacta 1980cctggtacac acgggagaaa agccacatga atgccaggtc tgccacaaga gatttagcag 2040caccagcaat ctcaagaccc acctgcgact ccattctgga gagaaaccat accaatgcaa 2100ggtgtgccct gccaagttca cccagtttgt gcacctgaaa ctgcacaagc gtctgcacac 2160ccgggagcgg ccccacaagt gctcccagtg ccacaagaac tacatccatc tctgtagcct 2220caaggttcac ctgaaaggga actgcgctgc ggccccggcg cctgggctgc ccttggaaga 2280tctgacccga atcaatgaag aaatcgagaa gtttgacatc agtgacaatg ctgaccggct 2340cgaggacgtg gaggatgaca tcagtgtgat ctctgtagtg gagaaggaaa ttctggccgt 2400ggtcagaaaa gagaaagaag aaactggcct gaaagtgtct ttgcaaagaa acatggggaa 2460tggactcctc tcctcagggt gcagccttta tgagtcatca gatctacccc tcatgaagtt 2520gcctcccagc aacccactac ctctggtacc tgtaaaggtc aaacaagaaa cagttgaacc 2580aatggatcct taagattttc agaaaacact tattt 261512029PRTHomo sapiens 120Leu Gln Asn Arg Arg Ala Leu Asp Leu Leu Thr Ala Glu Arg Gly Gly1 5 10 15Thr Cys Leu Phe Leu Gly Glu Glu Cys Cys Tyr Tyr Val 20 2512121DNAHomo sapiens 121cttcaaacaa caaccaggag g 2112220DNAHomo sapiens 122ttggggaggt tggccgacga 2012332PRTHomo sapiens 123Asn Ser Leu Ala Ala Val Val Leu Gln Asn Arg Arg Ala Leu Asp Leu1 5 10 15Leu Thr Ala Glu Ser Gly Gly Thr Phe Leu Phe Leu Glu Glu Lys Cys 20 25 3012432PRTHomo sapiens 124Asn Ser Leu Ala Ala Val Val Leu Gln Asn Arg Arg Ala Leu Asp Leu1 5 10 15Leu Thr Ala Glu Arg Gly Gly Thr Cys Leu Phe Leu Gly Glu Glu Cys 20 25 3012532PRTHomo sapiens 125Asp Ser Leu Ala Ala Val Thr Leu Gln Asn His Gln Gly Leu Asp Leu1 5 10 15Leu Thr Ala Glu Lys Gly Gly Leu Cys Tyr Phe Leu Gly Glu Asp Cys 20 25 3012632PRTHomo sapiens 126Asp Ser Leu Ala Ala Val Thr Leu Gln Asn His Gln Gly Leu Asp Leu1 5 10 15Leu Ile Ala Glu Lys Gly Gly Leu Cys Thr Phe Leu Gly Glu Glu Cys 20 25 3012732PRTHomo sapiens 127Asp Ser Leu Ala Ala Val Thr Leu Gln Asn Cys Arg Gly Leu Asp Leu1 5 10 15Leu Thr Ala Glu Lys Gly Gly His Tyr Thr Phe Leu Gly Glu Glu Cys 20 25 3012818PRTHomo sapiens 128Leu Gln Asn Arg Arg Gly Leu Asp Leu Leu Phe Leu Lys Glu Gly Gly1 5 10 15Leu Cys12932PRTType D retrovirus 129Asp Ser Leu Ala Lys Val Val Leu Gln Asn Arg Arg Gly Leu Asp Leu1 5 10 15Leu Thr Ala Glu Gln Gly Gly Ile Cys Leu Ala Leu Gln Glu Lys Cys 20 25 30130571DNAArtificial SequenceSynthetic DNA 130ttcctgagtt cttgcactaa cctcaaatga gagaagtgcc gccataactg caacccaaga 60gtttggcgat ccctggtatc tcagtcaggt caatgacagg atgacaacag aggaaagata 120atgattcccc acaggccagc aggcagttcc cagtgtagac cctcattagg acacagaatc 180agaacatgga gattggtgcc gcagacattt gctaacttgc gtgctagaag gactaaggaa 240aactaggaag atatgaatta ttcaatgatg tccactataa cacaggggaa aggaagaaaa 300tcctactgcc tttctggaga gactaaggga ggcattgagg aagcatacca ggcaagtgga 360cattggaggc tctggaaaag ggaaaagttg ggaaaagtat atgtctaata gggcttgctt 420ccagtgtggt ctacaaggac actttaaaaa agattgtcca atagaaataa gccaccacct 480cgtccatgcc ccttatgtca agggaatcac tggaaggccc actgccccag gggatgaagg 540tcctctgagt cagaagccac taaccagatg a 571131359DNAArtificial SequenceSynthetic DNA 131aacttgcgtg ctagaaggac taaggaaaac taggaagact atgaattatt caatgatgtc 60cactataaca caggggaaag gaagaaaatc ctactgcctt tctggagaga ctaagggagg 120cattgaggaa gcataccagg caagtggaca ttggaggctc tggaaaaggg aaaagttggg 180caaattgaat gcctaatagg gcttgcttcc agtgcagtct acaaggacgc tttagaaaag 240attgtccaag tagaaataag ccgcccctcg tccatgcccc ttatgtcaag ggaatcactg 300gaaggcctac tgccccaggg gacgaaggtc ctctgagtca gaagccacta acctgatga 359132694DNAArtificial SequenceSynthetic DNA 132cttccccaac taataaggac ccccctttca acccaaacag tccaaaagga catagacaaa 60ggagtaaaca atgaaccaaa gagtgccaat attccctggt tatgcaccct ccaagcggtg 120ggagaagaat tcggcccagc cagagtgcat gtaccttttt ctctctcaca cttgaagcaa 180attaaaaata gacntaggtn aattntcaga tagccctgat ggytatattg atgttttaca 240aggattagga caatcctttg atctgacatg gagagatata atattactgc taaatcagac 300gctaacctca aatgagagaa gtgctgccat aactggagcc cgagagtttg gcaatctctg 360gtatctcagt caggtcaatg ataggatgac aacggaggaa agagaacgat tccccacagg 420gcagcaggca gttcccagtg tagctcctca ttgggacaca gaatcagaac atggagattg 480gtgccgcaga catttactaa cttgcgtgct agaaggacta aggaaaacta ggaagactat 540gaattattca atgatgtcca ctataacaca ggggaaagga agaaaatcct actgcctttc 600tggagagact aagggaggca ttgaggaagc ataccaggca agtggacatt ggaggctctg 660gaaaagggaa aagttgggca aattgaatgc ctaa 694133494DNAArtificial SequenceSynthetic DNA 133cttccccaac taataaggac ccccctttca acccaaacag tccaaaagga catagacaaa 60ggagtaaaca atgaaccaaa gagtgccaat attccctggt tatgcaccct ccaagcggtg 120ggagaagaat tcggcccagc cagagtgcat gtaccttttt ctctctcaca cttgaagcaa 180attaaaatag acntaggtna attntcagat agccctgatg gntatattga tgttttacaa 240ggattaggac aatcctttga tctgacatgg agagatataa tattactgct aaatcagacg 300ctaacctcaa atgagagaag tgctgccata actggagccc gagagtttgg caatctctgg 360tatctcagtc aggtcaatga taggatgaca acggaggaaa gagaacgatt ccccacaggg 420cagcaggcag ttcccagtgt agctcctcat tgggacacag aatcagaaca tggagattgg 480tgccgcagac attt 494134184PRTArtificial SequenceSynthetic Peptide 134Arg Thr Pro Leu Ser Thr Gln Thr Val Gln Lys Asp Ile Asp Lys Gly1 5 10 15Val Asn Asn Glu Pro Lys Ser Ala Asn Ile Pro Trp Leu Cys Thr Leu 20 25 30Gln Ala Val Gly Glu Glu Phe Gly Pro Ala Arg Val His Val Pro Phe 35 40 45Ser Leu Ser His Leu Lys Gln Ile Lys Ile Asp Gly Ser Asp Ser Pro 50 55 60Asp Gly Tyr Ile Asp Val Leu Gln Gly Leu Gly Gln Ser Phe Asp Leu65 70 75 80Thr Trp Arg Asp Ile Ile Leu Leu Leu Asn Gln Thr Leu Thr Ser Asn 85 90 95Glu Arg Ser Ala Ala Ile Thr Gly Ala Arg Glu Phe Gly Asn Leu Trp 100 105 110Tyr Leu Ser Gln Val Asn Asp Arg Met Thr Thr Glu Glu Arg Glu Arg 115 120 125Phe Pro Thr Gly Gln Gln Ala Val Pro Ser Val Ala Pro His Trp Asp 130 135 140Thr Glu Ser Glu His Gly Asp Trp Cys Arg Arg His Leu Leu Thr Cys145 150 155 160Val Leu Glu Gly Leu Arg Lys Thr Arg Lys Thr Met Asn Tyr Ser Met 165 170 175Met Ser Thr Ile Thr Gln Gly Lys 180135891DNAArtificial SequenceSynthetic DNA 135gtctacctag ccaaggcata ttcttcttat gtggaacatc aacctatatc tgcctcccca 60ctaactggac aggcacctga accttagtct ttctaagtcc caacattaac attgccccag 120gaaatcagac cctattggta cctgtcaaag ctaaagtccc gtcagtgcag agccatacaa 180ctaatatccc tatttatagg gttaggaatg gctactgcta caggaactgg aatagccggt 240ttatctactt cattatccta ctaccataca ctctcaaaga atttctcaga cagtttgcaa 300gaaataatga aatctattct tactttacaa tcccaattag actctttggc agcaatgact 360ctccaaaacc gccgaggccc acacctcctc actgctgaga aaggaggact ctgcaccttc 420ttaggggaag agtgttgttt ttacactaac cagtcaggga tagtacgaga tgccacctgg 480catttacagg aaagggcttc tgatatcaga caatgccttt caaactctta taccaacctc 540tggagttggg caacatggct tcttccattt ctaggtccca tggcagccat cttgctgtta 600ctcacctttg ggccctgtat ttttaagctt cttgtcaaat ttgtttcctc taggatcgaa 660gccatcaagc tacagatggt cttacaaatg gaaccccaaa tgagttcaac taacaacttc 720taccaaggac ccctggaacg atccactggc acttccacta gcctagagat tcccctctgg 780aagacactac aactgcaggg ccccttcttt gcccctatcc agcaggaagt agctagagcg 840gtcatcggcc aaattcccaa cagcagttgg ggtgtcctgt ttagaggggg g 891136568DNAHomo sapiens 136actttacaat cccaaataag actctttggc agcagtgact ctccaaaacc gctgaggcct 60agatctcctc actgctgaaa aaggaggact ctgcaccttc ttaggggaag agtgttgttt 120ttacactaac cagtcaggga tagcatgaga tgccacccag cgtttacagg aaaaggcttc 180tgaaatcaga cgcctttcaa attcttatac caacctctgg agttgggcaa catggcttct 240cccctttcta ggtcccgtgg cagccatctt gctgttactc gcctttgggc cccgtatttt 300taaccttctt gtcaaatttg tttggtctag aatcgaggcc atcaagctac agatggtctt 360acaaatcgaa ccccaaatga gttcaactaa caacttctac cgaggacccc tggactgacc 420agctggcact tcccctggcc tagagagttc ccctctgaag gacactacaa ctgcaaagcc 480ccttcttcgc ccctatccag caggaagtag ctagagcagt catcggccaa attcccaaca 540gcagttgggg tgtcctgttg attgaggg 5681373273DNAHomo sapiens 137agttgcaatt ccttgcctcc actgtgagac aaaccccaga cacatctcca gcacacaaga 60acttcgaaat gcctcaacct caggtgccag gggttcctcc agaaccttct cccccaggag 120cttgctacaa gtgccagaaa

tctggccact gggccaagga atgcccacag accaggattc 180ctcctaagct gtatcccatc tctgtgggac cccactaaaa atcagactgt tcaactcacc 240tggcagccac ttccagagcc cctggaactc tagcccaagg ctctctgact gaccccttct 300gagatcttct tggcttagca gctgaagact gacactgcca gatcgcctcg gaagcctaca 360ggaccatcac agatgctcca ggtaactctc acagtagagg gtaagtctgt ccccttctta 420atcaatatgg aggctaccca ctgcacatta ccttcttttc aagggcctgt ttcctttgcc 480tccataactg ttgtgggtat tgacggccag gcttctaaac ctcttaaaac tccccaactc 540tagtaccaac ttagacaata ctcttttaag cactcctttt tagttatccc cacttgccca 600gttcccttat gaggccgaga cacttcaact aaattatctg cttccctgac tattcctgga 660ctacagctac atctcattgc tgcccttctt cccaatccaa agcctccttt gcatcttctt 720gtatccccca accttaaccc acaagtataa gatacctcta ttccctcctt ggtgaccaat 780catgcacccc ttaccatctc attaaaacct aatcactctt acccggctca atgccaagat 840cccatcccac agcatgcttt aaaaggatta aaacctgtta tcactcgcct gctagagcat 900ggccttttaa agcctataaa ctctccttac aattccccca ttttacctgt cctagaacca 960gacaagcctt acaggttcag gatctgtgtc ttatcaatga aattgttttc cctatccacc 1020ctgtggtgct gaacccatat actctcctat cctcaatacc tccctctaca acccattatt 1080ctgttctaga tctcaaacat gctttcttta ctatcccttt acacccttca acccagcctc 1140tcttcgtttt cacctggact gaccctgaca cccatcagtc ccagcagctt acctgggctg 1200taatgctgca aggtttcagg ggcagccctt attatttcag ccaagctctt tctcatgatt 1260tactttcttt ccacccctcc acttctcacc ttattcaata tattggtgat gttcttcttt 1320gtagcccctc ctttgaatct tctcaacaag acacacttct gctccttcag catttattct 1380ccaaaggata tccccctcca aagctcaaat gtcttctcca tccgttacct accttggcat 1440aattcttcat aaaaacacac gtgccctccc tgctgatagt gtctgactga tctctcaaac 1500cccaacccct tctacaaaac aacaactctt ttccatccta ggcatggttg gatactttcg 1560tgttaggata cctggttttg ccatcctaac aaaaccatta tataaactca caaaaggaaa 1620cctagttgac cccatagatc ctaaatcgtt tccccactcc tctttccatt ccttgaagac 1680agctttagag actgtctcca ctctagctct ccctgactca tcccaacact tttcattaca 1740cacagctgaa gtgcagggct gtgcagtcag aattcttaca caaggaccgg gatcgcatcc 1800tgtagccttt ttgtccaaac aacttgacct tactgtttta ggctggccat catgtctcca 1860tgcagcgtct gctgccaccc taatactttt agaggccctc aaaatcacaa actatgctca 1920actcattctc tacagctctc ataatttcca aaatctattt tcttcctcac acctgacaca 1980tatactttct gctccccggc tccttcagat atactcactc catttattct cccacaatta 2040ccattattcc tggcctggac ttcaatccgg cctcccacat tattctggat accatacctg 2100accctcatga ctgcatctct ctgatccacc tgacgttcac cccatttccc cacatttcct 2160tctgccctgt ttctcaccct gatcacactt ggtttattga tggcagttcc accaggccta 2220atcgccactc accagcaaag gcaggatatg ctatgaacta gttgccttaa ttcaagccct 2280cactcttgca aaaggactac gtgtcaatat ctatactgat tctaaatatg cctttcatat 2340tctgcaccac catgcggtca tatgggctga aagaggtttc ctcactacac aagtgtcctc 2400catcattaat gcctctttaa gaaaactctg ctcaaggctg ctttacttcc aaaggaagct 2460ggggtcattc actgcaaggg gcatcaaaag acttcagatc ccattgctct aggcaatgct 2520tatgctgata aggtggctag acaagcagct agctctccaa cttttgtccc tcatggccag 2580tttttctcct tcacatccgt cactcccacc tactccacag ctgaaacttc cacctatcaa 2640gctcttcccc cgcaaggtaa atggttctta gaccaaggaa aatatctcct tccagcctca 2700caggcccatt ctattctgtc gtcatttcat aaccttttcc atgtaggtta caagccacta 2760gcctgtctct taggacctct catttccttt ccatcatgga aatctatcct caaggagatc 2820acttctcagt gttccatctg ctattctgct acccctcagg gattgttcag gcctcctccc 2880tttcctacac ataaagctcg gggatttgcc cctgcccagg actggcaaat tgactttact 2940cacatgcctc gggtcagaaa actaaaatat ctcttagtct gggtagacac tttcactggg 3000tgggtagagg cctttcccat agagtctgag aaggccaccg cggtcatttc ttcccttctg 3060tcagacataa ttccttggtt tggccttccc ttctctatac agtctgataa cggaccagcc 3120tttactagtt aaatcaccca agcagtttct caggctcttg gtattcagtg gaaccttcat 3180atcccttaac atcctcaatc ttcaggaaag gtaaaaccga ctaatggtct tttaaagaca 3240cacctcacca agctcagcct ccaacttaaa aag 3273138823PRTHomo sapiens 138Ser Ser Phe Arg Arg Thr Lys Asn Arg Pro Leu Pro Trp Arg Arg Leu1 5 10 15Ala Thr Asp Phe Thr His Lys Pro Lys Pro Gln Gly Phe Gln Tyr Leu 20 25 30Leu Val Trp Val Asp Thr Phe Thr Gly Trp Ala Glu Ala Phe Pro Cys 35 40 45Arg Thr Glu Lys Ala Gln Glu Val Ile Lys Ala Leu Val His Glu Ile 50 55 60Ile Pro Arg Phe Gly Leu Pro Arg Gly Leu Gln Ser Asp Asn Ser Pro65 70 75 80Ala Phe Gln Ala Thr Val Thr Gln Gly Val Ser Gln Ala Leu Gly Ile 85 90 95Arg Tyr His Leu His Cys Ala Arg Pro Gln Ser Ser Gly Lys Val Glu 100 105 110Lys Met Asn Glu Thr Leu Lys Gly His Leu Lys Lys Gln Thr Gln Glu 115 120 125Thr His Leu Thr Trp Pro Ala Leu Leu Pro Ile Ala Leu Lys Arg Ile 130 135 140Cys Asn Phe Pro Gln Lys Ala Gly Leu Ser Pro Tyr Glu Met Leu Tyr145 150 155 160Gly Arg Pro Phe Ile Thr Asn Asp Leu Val Leu Asp Pro Arg Gln Pro 165 170 175Thr Leu Gln Thr Ser Pro Pro Pro Asn Ile Asn Lys Phe Leu Lys His 180 185 190Tyr Lys Glu Pro Ile Pro Glu Lys Arg Glu Lys Asn Tyr Ser Thr Leu 195 200 205Val Thr Trp Tyr Ser Ser Pro Phe Pro Leu Ile Pro His Pro Ile His 210 215 220Pro Gly Lys Asp Pro Thr Gln Ser Phe Tyr Leu Pro Gln Leu Arg Leu225 230 235 240Lys Trp Leu Glu Trp Ser Leu Gly Tyr Ile Thr Leu Glu Ser Asn Pro 245 250 255Gly Tyr Cys Gln Arg Asn Leu Lys Ile Gln Glu Thr Thr Leu Ala Ile 260 265 270Pro Val Asn Leu Arg Ile Cys Ala Cys Ser Ser Asn Asn Asn Gln Glu 275 280 285Glu Ser Asn Asn His Lys Ser Pro Trp Pro Ser Leu Ile Ile Phe Phe 290 295 300Ser Leu Leu Phe Phe Tyr Pro Leu Ser Leu Ser Leu His Pro Leu His305 310 315 320Ala Ala Val Pro Val Ala Pro Leu Thr Lys Ser Phe Tyr Gly Glu Cys 325 330 335Ser Val Pro Glu Ile Leu Met Pro His Arg Ile Gly Val Phe Leu Arg 340 345 350Glu Pro Pro Pro Ser Leu Pro Thr Pro Ile Cys Pro Ala Thr Ala Ile 355 360 365Thr Leu Pro Leu Phe Ala Cys Met Gln Ile Leu Ile Ile Gly Gln Glu 370 375 380Lys Leu Ile Leu Val Val Leu Glu Asp Leu Glu Ser Leu Ser Val Gly385 390 395 400Leu Thr Ser Pro Lys Leu Val Cys Leu Met Gly Val Glu Phe Lys Ile 405 410 415Arg Gln Glu Lys Asn Met Lys Lys Ser Pro Asn Ser Pro Gly Tyr Met 420 425 430Ala Pro Leu Ala Pro Thr Lys Asp Ile Ser Gln Asn Tyr Met Lys Pro 435 440 445Ser Val Pro Ile Leu Ala Trp Ala Tyr Leu Ile Pro Pro Ser Leu Gly 450 455 460Ser Met Arg Ser Arg Pro Lys Thr Leu Leu Thr Val Gly Tyr Ala Ser465 470 475 480Pro Thr Ser Gly His Met Phe Gln Ser Leu Tyr Leu Asn Asn Gly Thr 485 490 495Thr Ser Ala Gln Lys Thr Pro Leu Pro Phe Asp Leu Leu Phe Pro Ile 500 505 510Trp Lys Pro Ile Pro Gln Thr Ser Pro Val Asn Leu Ala Ile Leu His 515 520 525Thr Gln Pro Thr Pro Asn Ala Ser Gly Gly Leu Leu Pro His Lys Ser 530 535 540Ala Tyr Pro Gln Glu Tyr Phe Leu Ser Val Val Pro Gln Pro Ile Val545 550 555 560Val Met Ala Leu Gln Asn Leu Cys Ala Ser Ser His Ser Cys Pro Leu 565 570 575Pro Ser Thr Leu Asn Lys Ile Tyr Thr Val Met Ser Tyr Leu Ser Pro 580 585 590Ala Thr Lys Glu Tyr Pro Phe Phe Leu Leu Leu Glu Gln Glu Cys Val 595 600 605His Val Leu Ala Leu Ala Val Ser Gln Pro Leu Leu Ser Ser Thr Thr 610 615 620Asn Tyr Leu Lys Asn Met Gly Thr Trp Asn Gly Ser Pro Thr Pro Trp625 630 635 640Ser Pro Cys Lys Ile Asn Leu Thr Pro Gln Gln Ser Phe Lys Ile Glu 645 650 655Glu Leu Thr Cys Pro Leu Lys Glu Gly Glu Pro Val Tyr Phe Gly Lys 660 665 670Asn Ala Val Ile Met Leu Ile Asn Pro Glu Ser Ser Leu Arg Lys Leu 675 680 685Lys Lys Phe Glu Ile Glu Tyr Asn Val Glu Gln Arg Ser Phe Glu Thr 690 695 700Leu Asp Pro Gly Ala Ser Ser Ala Asn Gly Cys Pro Gly Phe Ser Pro705 710 715 720Ser Asp Leu Gln Leu Tyr Cys Tyr Ser Ser Leu Asp Pro Val Ser Leu 725 730 735Thr Ser Leu Leu Thr Leu Ser Leu Pro Glu Ser Lys Leu Asn Tyr Lys 740 745 750Trp Ser Pro Arg Cys Ser Pro Arg Leu Arg Ser Thr Ala Asp Pro Trp 755 760 765Thr Gly Leu Leu Ala His Asp Leu Met Leu Met Thr Ser Lys Ala Pro 770 775 780Leu Leu Arg Lys Ser Gln Leu His Asn Leu Tyr Tyr Ala Pro Ile Gln785 790 795 800Gln Glu Ala Val Arg Ala Val Val Gly Gln Pro Pro Gln Gln His Leu 805 810 815Gly Phe Pro Val Glu Met Gly 820139809PRTHomo sapiens 139Ala Pro Ser Gly Glu Gln Arg Thr Gly His Tyr Pro Gly Glu Asp Trp1 5 10 15Gln Leu Ile Leu Pro Thr Ser Pro Asn Leu Arg Asp Phe Ser Ile Tyr 20 25 30Ser Gly Ile Leu Ser Arg Val Gly Gln Arg Pro Ser Pro Val Gly Gln 35 40 45Lys Arg Pro Lys Arg Arg His Phe Met Lys Phe Pro Asp Ser Asp Phe 50 55 60Pro Glu Ala Tyr Arg Val Thr Ile Ala Leu Leu Ser Arg Pro Gln Pro65 70 75 80Arg Glu Tyr Pro Arg Arg Val Tyr Asp Ile Thr Tyr Thr Ala Pro Glu 85 90 95Gly His Ser Pro Gln Gly Arg Ser Arg Lys Met Lys His Ser Lys Asp 100 105 110Ile Lys Ser Lys Pro Arg Lys Pro Thr Ser His Gly Leu Leu Cys Cys 115 120 125Leu Pro Lys Glu Ser Ala Thr Phe Pro Lys Lys Gln Asp Leu Ala His 130 135 140Thr Lys Cys Cys Met Glu Gly Pro Ser Pro Met Thr Leu Cys Leu Thr145 150 155 160Gln Asp Ser Gln Leu Ser Cys Arg His His Leu Leu Ser Gln Ile Ser 165 170 175Thr Ser Ser Asn Ile Thr Arg Asn Leu Ser Leu Arg Arg Gly Lys Arg 180 185 190Thr Ile Pro Pro Leu His Gly Ile Ser Gln Val Pro Ser Leu Phe Pro 195 200 205Ile Pro Arg Tyr Ile Leu Gly Arg Thr Leu Pro Ser His Phe Ile Tyr 210 215 220Pro Asn Cys Gly Ser Gly Trp Ser Gly Val Leu Asp Thr Ser His Leu225 230 235 240Ser Gln Ile Leu Asp Thr Ala Lys Gly Thr Lys Ser Arg Arg Gln Arg 245 250 255Leu Phe Leu Thr Ser Arg Gly Phe Ala Pro Ala Leu Gln Thr Thr Thr 260 265 270Arg Arg Lys Val Thr Lys Ile Ile Asn Pro His Gly Pro Pro Leu Ser 275 280 285Tyr Phe Ser Leu Tyr Cys Ser Phe Thr Leu Phe His Ser His Cys Thr 290 295 300Pro Ser Met Pro Leu Tyr Asp Gln Leu Pro Leu Pro Arg Val Ser Met305 310 315 320Glu Asn Ala Ala Ser Arg Lys Tyr Cys Pro Ile Val Glu Ser Phe Gly 325 330 335Asn Pro His Leu His Cys Pro His Pro Tyr Ala Pro Gln Leu Leu Ser 340 345 350Leu Cys His Ser Leu His Ala Cys Lys Tyr Ser Leu Leu Asp Arg Lys 355 360 365Asn Asp Ser Leu Ser Trp Arg Thr Trp Ser His Cys Leu Leu Asp Leu 370 375 380Leu His Pro Asn Trp Tyr Val Trp Gly Trp Ser Ser Arg Ser Gly Lys385 390 395 400Arg Lys Thr Cys Lys Arg Ser Asn Leu Pro Thr His Pro Gly Thr Trp 405 410 415His Leu Pro Leu Gln Arg Thr Arg Ser Leu Lys Thr Thr Asn Pro Pro 420 425 430Tyr Pro Tyr Ser Pro Gly Lys Pro Ile Tyr His Pro His Trp Ala Pro 435 440 445Gly Leu Gly Pro Lys Pro Tyr Leu Leu Asp Met Pro Pro Pro Glu Leu 450 455 460Gln Ala Ile Cys Phe Asn Pro Cys Thr Thr Met Glu Gln Leu Gln His465 470 475 480Arg Asn Lys His His Phe Arg Phe Ser Arg Thr Ser Cys Phe Gln Ser 485 490 495Gly Asn Asn Pro Tyr Leu Lys Pro His Leu Cys Lys Ile Gln Tyr Tyr 500 505 510Ile His Asn Gln Leu Pro Met His Gln Val Gly Asn Ser Ser His Thr 515 520 525Asn Ser Leu Pro Thr Leu Arg Asn Ile Phe Cys Leu Trp Tyr Leu Ser 530 535 540Leu Ser Leu Phe Glu Trp Leu Phe Arg Ile Tyr Val Leu Pro Leu Ile545 550 555 560Leu Ser Ala Pro Tyr Asp His Leu His Thr Arg Phe Ile Gln Leu Cys 565 570 575His Ile Ala Pro Gln Gln Lys Ser Thr His Ser Ser Phe Cys Tyr Arg 580 585 590Ser Arg Ser Ala Arg Cys Thr Arg Tyr Trp His Trp Arg Tyr His Asn 595 600 605Leu Tyr Ser Val Leu Leu Gln Thr Ile Ser Arg Thr Lys Trp Gly His 610 615 620Gly Thr Gly Arg Arg Leu Pro Gly His Leu Ala Arg Ser Thr Leu Pro625 630 635 640Ser Ser Ser Ser Pro Ser Lys Ser Lys Ser Phe Arg Leu Ala Asn Arg 645 650 655Lys Arg Gly Asn Leu Phe Ile Phe Arg Gly Arg Met Leu Leu Leu Cys 660 665 670Ser Ile Arg Asn Arg His Glu Ser Arg Asn Ser Arg Ser Asn Thr Thr 675 680 685Ser Arg Gly Ala Ser Lys His Trp Thr Leu Gly Pro Pro Gln Pro Met 690 695 700Asp Ala Leu Asp Ser Pro Leu Leu Arg Thr Ser Ser Ser Tyr Asn Ile705 710 715 720Ala Thr Pro Leu Trp Thr Leu Tyr Leu Pro Pro Cys Leu Cys Leu Phe 725 730 735Gln Asn Arg Ser Cys Lys Thr Thr Asn Gly Ala Gln Asp Ala Val Gln 740 745 750Asp Asp Leu Pro Gln Thr Pro Gly Pro Ala Cys Pro Thr Ile Cys His 755 760 765Gln Arg His Pro Ser Gly Asn Leu Ser Cys Thr Thr Ser Thr Thr Pro 770 775 780Gln Phe Ser Arg Lys Gln Leu Glu Arg Ser Ser Ala Asn Leu Pro Asn785 790 795 800Ser Thr Val Phe Leu Leu Arg Trp Gly 805 1407140DNAHomo sapiens 140ttggtcttaa gaacacaaat gatatggctc caatgactgg aggaacacca gggtccttgg 60tctcacgctg atttagataa aacgactgtc aggcctctga gcccaagcta agccatcctc 120ccctgtgacc tgcacgtata catccagatg gcctgaagta accaaagaat cacaaaagca 180gtgaaaatgg cctgttcctg ccttaactga tgacattcca ccattgtgat ttgttcctgc 240cccatcttaa ctgagcgatt aaccttgtga aattccttct cctggctcaa aacctccccc 300actgagcacc ttgtgacccc cgcccctgcc cctaagagaa aacccccttt gattataatt 360ttccactacc cacccaaatc ctataaaatg gccccacccc tatctccctt cgctgactcc 420tttttcggac tcagcccgcc tgcacccagg tgaaataaac agccttgttg ctcacacaaa 480gcctgtttgg tggactctct tcacacggac gctcatgaca tttggtgcca aaacctggga 540taggaggact ccttcaggag accagtcccc tgtccttgcc ctcactctgt gaggacatcc 600acctacaacc ttgggtcctc agaccaacca gcccaaggaa cagctcacca atttcaaatc 660aggtaagcag tcttttcact ctcttctcca gcctctcttg ctacccttca aactccctct 720ctcactaccc ttcaatctcc ctgtccttcc aattccagtt ctttttcatc tctagtagag 780acaaaggaga cacattttat ccatggaccc aaaactccag caccagtcac ggacttggga 840agacagtctt cccttggtgt ttaatcactg cggggacgcc tgcctgatta ttcacccaca 900ctccattggt gtctgatcac ggtggggaca cctgccttgg tcactcaccc acattccctt 960ggtggtacgt caactgcaaa agcaggggac gcctgctttg gctgctcacc cacccccttc 1020tctgtgtctc tacctttctc tttaaactta cctccttcac tatgggcaaa cttctgccct 1080ccattccccc ttcttctccc ttagcctgtg ttcttaaaaa cctaaaacct cttcaactca 1140cacctgacct aaaacctaaa tgccttattt tcttctgcaa cactgcgtgg ctgcagtaca 1200aacttgataa tagctttaaa tggccagaat atggcacttt caatttctcc atcctacaag 1260atctagataa tttttgtgga aaaatggaaa aatggtctga gatgcctgac gtccaggcat 1320tcttttacac attggtccct ccctagtctc tgctcccaat gcgactcatc ccaaatcttt 1380cttctttctc tcctgtctgt tccttcagtc tccaccccaa gctctgagtc ctttgaatcc 1440tcctttgcta cagacccatc tgaactctcc cctcctcccc aggctgctcc tcaccaggcc 1500gagccaggtc ccaattcttc ctcagcctct gctcccccac cctataatcc ttttatcacc 1560tcctctcctc acactcagtc cggcttacag tttcgttctg tgactagccc tcccccatct 1620gcccaacaat ttcctcttaa agaggtggct ggagctaaag gcatagtcaa ggttaatgct 1680cctttttctt tatctgacct ctcccaaatc agttagcgtt tacgctcttt ttcatcaaat 1740ataaaaaccc agccagttca tggcccatct ggcaacaacc cttacaggct ttacagccct 1800agaccctgaa gggtcagaag gccgtcttat tctcaatatg cattttatta cccaatccgc 1860tcccaacatt aaataaagct ccaaaaatta aattctggcc ctcaaacccc acaacaggac

1920ttaattaacc tcacttcaag gtgtacaaga atagagtaga ggcagccaag tagcaacgta 1980tttgagttgc aattccttgc ctcaactctg agagaaaccc cagccacatc tccagcaaac 2040aagaacttca aaacacctga actgcagcag ccaggcgttc ctccaggacc acctccccca 2100ggatcttgct tcaagtgccg gaaatctgac cattgggcca aggaatgcct gcagcccagg 2160attcctccta agccacgtcc catttgtgca ggaccccact ggaaatcgga ctgtccaact 2220cacccggcag ccaatcccag agcccctgga actctggccc aaggctctct gactgactcc 2280ttcccagatc ttctcggctt agcagctgaa gactgacact gcccgatcac ttcagaagtc 2340ccctggacca tcacggatac tgagcttcag gtaactctca cagtggaggc taagtccatc 2400ccctgtttaa tcgatacagg ggctacccac tccacatcac cttcttttca agggcctgtt 2460tccctttccc ccataactgt tgtgggtatt gacggccaag cttcaaaacc ccttaaaact 2520cccccactct ggtgccaact tggacaacat tcttttatgc actctttttc agttatcctc 2580acctgcccag ttcccttatt aggccgagac attttaacca aattatctgc ttccccgact 2640attcctgggc tacagccaca tctccttgcc gcccttcttc ccaacccaaa gcctccttca 2700tatcttcctc tcatatcccc ccaccttaac ccacaagtat gggacacctc tactccctcc 2760ctggcaaccg atcacacgcc cattactatc ccattaaaac ctaatcaccc ttaccctgct 2820caatgccagt atcccatacc acaacaggct ttaaagggat tgaagcctgt tatcacttgc 2880ctgctacagc acgggcttct aaaacctata aactctccat acaattcccc cattttacct 2940gtctaaaaac cagataagtc ttacaggtta gttcagaatc tgcaccttat caaccaaatt 3000gttttgccta tccaccctgt agcacccaac tcgtacactc ttttgtcctc aatgccttcc 3060cccacaactc actattccgt tcttgatctt aaagatgctt ttttcactat tcccctgcac 3120ccctcatccc agcctctctt tgcttttacc tggactgacc ctgacaccca tcagtcccag 3180cagcttacct gggctgtact gccgcaaggc ttcagggaca gccctcatta cttcagccaa 3240gctctttctc atgatttact ttctttccac ctctctgctt ctcaccttat tcaatatatt 3300gatgaccttc tactttgtag cccctccttt aaatcttctc aacaagacac cctcctgctc 3360cttcaacatt tgttctccaa aggatatcgg gtatccccct ccaaagctca aatttcttct 3420ccatctgtta catacctcgg cataattctt catgaaaaca catgtgctct ccctgccaat 3480tgcgtctcca actgatctct caaatcccaa cctcttctac aaaacaacaa ctcctttccc 3540tcctaggcat ggttggatac ttttgccttt ggatacctgg ttttgccatc ctaacaaaat 3600cattatataa actcacaaaa ggaaacctag ctgaccccat agattctaaa tcctttcccc 3660actcctcttt ccattccttg aagacagctt tagagactgc tcccacacta gctctccctg 3720tctcatccca acccttttca ttacacacag ccgaagtgca gggctgtgca gtcggaattc 3780ttacacaagg accgggacca tgccctgtag cctttttgtc caaacaactt gaccttactg 3840ttttaggctc gccatcatgt ctccatgcgg tagcttccgc tgccctaata cttttagagg 3900ccctcaaaat cacaaactat gctcaactca ctctctacag ctctcacaac ttccaaaatc 3960tattttcttt ctcacacctg acgcatatac tttctgctcc ccggctcctt cagctgtatt 4020cactctttgt tgagtctccc acaattacca ttcttcctgg cccagacttc aatctggcct 4080cccacattat tctggatacc acacctgacc ctgatgattg tatgtctctg atctacctga 4140cattcacccc atttccccat atttccttct tttctgttcc tcatgttgat cacatttggt 4200ttactgacgg cagttccacc aggcctgatc gccactcacc agcaaaggca ggctatgcta 4260tagaatcttc cacatccatc attgaggcta ctgctctgcc cccctccact acctctcagc 4320aagccgaact gattgcctta actcgggcct tcactcttgc aaagggacta cacgtcaata 4380tttatactga ctctaaatat gccttccata tcttgcacca ccatgctgtt atatgggctg 4440aaagaggttt cctcactacg caagggtcct ccatcattaa tgcctcttta ataaaaactc 4500ttctcaaggc tgctttactt ccaaaggaag ctggagtcac acactgcaag ggccaccaaa 4560aggcgtcaga tcccattact ctaggaaatg cttatgctga taaggtagct aaagaagcac 4620ctagcgttcc aacttctgtc cctcatggcc agtttttctc cttcccatca gtcattccca 4680cctactcccc cattgaaact tccgcctatc aatctcttct cacacaaggc aaatggttct 4740tagaccaagg aaaatatctc cttccagcct cacaggccca ttctattctg tcatcatttc 4800ataacctctt ccatgtaggt tacaagccac tagtccacct cttagaacct ctcatttcct 4860tccatcgtgg aaacatatcc tcaaggaaat cacttctcag tgttccatct gctattctac 4920tacccctcag ggattgttca ggccccctcc cctccctaca catcaagctc ggggatttgc 4980ccctgcccag gactggcaaa ttgactttac tcacatgccc tgagtcagga aactaaaata 5040cctcttggtc tgggtagaca ctgtcactgg atgggtagag gcctttccca cagggtctga 5100gaaggccact gcagtcattt cttcccttct gtcagacata attccttggg ttggccttcc 5160cacctctata cagtccaata acggagcagc ctttattagt caaatcacct gagcagtttt 5220tcaggctctt ggtattcagt ggaaccttcg taccccttac tgtcctcaat cttcaggaaa 5280ggtagaatgg actaatggtc ttttaaaaac acaccccacc aaactcagcc tccaacttaa 5340aaaggaggat agagcccaaa aactcgcaac caagctagta attatgctga acccccttgg 5400gcactctcta attggatgtc ttaggtcctc ccaaatctta gtcctttaat atctgttttt 5460ctccttctct tattcggacc ttgtgtcttc cgtttagttt ttcaattcat acaaaaccgc 5520atccaggcca tcaccaatcg ttctatacaa taaatgctcc ttctaacaac cccacaatat 5580cgccccttac cacaaaatct tccttcagct taatctctcc cactctaggt tcccatgccg 5640cccataatcc ctctcgaagc agccctgaga aacatagccc attatctctc cataccaccc 5700ccaaaatttt tgctgcccca acacttcaac actattttac attatttttc ttattaatat 5760aagaagacag caatgtcagg cctctgagcc caagccatca tatcccctgt gacctgcaca 5820tatacatcca gatggcctga agtaactgaa gaatcacaaa agaagtgaaa atggcctgtt 5880cctgccttaa ccgatgacat tccaccactg tgatttgttc ctgccccacc ttaactgagc 5940aattaacctt gggaaattcc ttctcctggc tcaaaacctc ccccactgag caccttgtga 6000cccctgcccc tccactaccc acccaaatcc tataaaatgg ccccacccca tctcccttag 6060ctgactcctt ttttggactc agcccgcctg cacccaggtg aaataaacag ccttgttgct 6120cacacaaagc ctgtttggtg gactctcttc acagggacgg gggtgacaac aacacggaca 6180cacatggagt ggttttaagg agcagagagt ttaatacgca aaaaagaagg aagaggctcc 6240cctgtacaga cacagaggga gggggctcca agccgagaga aggaaacccc atgtgcagtg 6300gaaaagtggt tgattatact gggaggctgg aggaggcggt gtctgatttg cacagggccc 6360aggggattgg gttgaccagg tgtatcattc atgtaccccg caaaaaacct ggccctccca 6420cctcagccct ttaatatgca aatgtgggtt gccatgatgt tctgaaaaca catgaattat 6480ctggaggggg ccatgacact tggtacatgt gctgacaaga agagggtggg aatcgccatg 6540gtggccatgt tgggtggacc tagtttttaa tagcctgcat ttgcatatca aagtttgctg 6600gcctggctct ttaagctgtc ttttctgtta gaaaaggaat ggtttggaat gggtgagggt 6660tgcttcttat tacaagaaaa tttccaaaaa cctttactct ttctagctgc caaaaaacta 6720tttcttaata acttatgtat taccataatt aggcagcacc aaagatccct gcaggtcaga 6780ccactgcaat taacatgctg gctttactgc tgattatggt agctgcatcc acctagcctc 6840tcatattgca actgcctgac ctctgccacc ccacgagcca cttatcccca cttataatca 6900gcccatttcg attgtaacat ctgccactta ttcccgacgt tgtggtatat cctatagatg 6960aattcattca acatccattc caacaccacc tctcttgcct tcctatactc tctggagagt 7020gaattactga gtcacatgat cttcactgca gtcatttgtg gctatgtgac atagttctgg 7080acagtgaaca tagacagaag tccctggggc gggcttcctt tctgggatga gggcaaaacg 714014115000DNAHomo sapiens 141gatctcttga tcccaggagg tcaaggctgc aatgagctaa gatcaagcca ctgcattcca 60gcctgagtga tagtgggaga ccttgtcttt aaaacacaca cacacacaca cacacacacg 120agggcctttg accactcttg agtagaagac tcgagaagaa caaagtagaa ggccagagaa 180gaacaaagtt acttgaaaga tctcttatta aagagaatgt acaagctatg aaaaaaaaaa 240aacacacaca cacacacaaa cctcatctgg aatgaaaaaa acataatgca tttggtttct 300ggttccttag gctgttatgg aacaaccaaa gaacattatt ttggtttctg aggtcagaac 360tattttattc ccctcaagca cactatgctt atggtttgag ggagaatgag aaataggaaa 420ctaggaacag gctgaaatgg tctaatcttg accatctaat tctgcagtgt cttattctca 480ttctaaaaga gaatggttat attcgctgtt ctagcataaa aagtaatgat aaaaataaaa 540gatcccgtat taccagacaa taatccccta gactgtttta atgcttggtt gagtatttgc 600ttatgatctc agactttaaa agatggtctc cccctatggt gaagcttgtt aattatgtag 660gcatcattaa tgtctgttta cttatcaaaa ttttatcatt gttagttgta ttactacttg 720acagtccaat ttatttaatt gaaaagattg gttaacattt tatagtcaaa gtaattgttt 780cctgtgtttt ttcctgttta ggttattgga gtgatgagta aagaatacat accaaagggc 840acacgttttg gacccctaat aggtgaaatc tacaccaatg acacagttcc taagaacgcc 900aacaggaaat atttttggag ggtaagtaag ggaaatttct tcagacccat taaatgttag 960gaaaaaatgg agctaaaaga gctgggtggc tcacctttct catcctgtgc tgagaaatgc 1020tggggctcac ccataagtat ccagcatccc catggacaca gggaattctg aacaaatgtg 1080atgaaaccga tgaaatgtct ggcctgtagg tggttagtga tggagatacg ggctatatgt 1140gaatcttgat ttttgcaatt cattagagct ttgtaatgaa aggaaacagt ttgttgcttg 1200ctttaaggat aggttcattt gcatttctcc gcaaggaagt agtaatgagt taccaagcct 1260tagatttcac ccctttttga tttcttgctg acttaacttt aattgaatgg aagagttatc 1320acaaatgaat tatctttttg gttttttttt ttttgagatg gagtctcact ctgtcaccag 1380gctggagtgc aatggcatga tctcggctca ctgcaacctc cgcctcccag gttcaagcaa 1440ttgtcctgcc tcagcctccc gagtagctgg gactaaggtg cgcgccacca tgcccagtta 1500atttttgtat ttttagtaga gacggggttc cactatgttg gccatgatgg tctcgatctc 1560tggacctcgt gatccgccca ccttggcctc ccaaagtgct ggaattacag gcaagagcca 1620ccgcgcccag ccaggaatga caaatgaatt accttataag taaatgccat taaggaagga 1680tagctggaag atgggttgag gggaatggag gaccacagaa ctagtcctat ttaaatacat 1740gtgcatggta aaatgattcc atttgacaat aggttaatta tctcatagca taaggaaaat 1800gcttaacagt catatgcaag atgataagct ttcctatagc atccaaccaa aagatctagc 1860cagtacaatt tcctttgcta tattagggtt agaaaggccc ccagaggtga accaattaga 1920tggaatcctt gaataaaaca ctggattagc agtgaacaga aaaaagtcag attgctttcc 1980ttcttcccat agatgtctca gggatattta gtttcctcag aagataaaga atttagtaag 2040cgtttttttg tgcatactta catgaaatgt acattatttg aattctttaa aaagaaacag 2100ctgcatgata acaaaaattg tgttatgctt gctttagctg gtatttttgc ctagaacgat 2160tatatcgttc ggacaagaag ctattcctaa gaaacaatat ttttaatcca ggaagttttt 2220catttttaga aatttatctt actatttccc aagcaaaaga gggtagttac agattcacta 2280agaatcatgt gctcacaatt tttatttaat aattattcct ccttaaaata tattaatcac 2340ctgacttaca atggtggaac catgagtgca tttttgcctt tattgtcaat aacgtcttct 2400cagaagtgag ccacaaaggt gcatagttct tggagttaaa ggtctgaatt aagacaatcc 2460agcataagtc tcattaatgt gtgattattt tgagaaaagg caagaagtac ctaagaatct 2520ccccctcact gtccagttcc ctgtttcatt taaagattca ctgtaagtaa ctgaaaggct 2580ttccttggga ggatttattt gaatcagtct ttcacatgca aaggatattg tagaacatct 2640cgtttttgct ggcaggaata tgaacatctg ttgtgaggaa agaaaaagtt tcatgcaaat 2700tacactgcca aagaagggat gttcaagttg agaaaccagt gacatttctt gtaactgtac 2760tatgaatcag cgcattttaa tcttctagat aatatatgga agtgcaggaa ggtggtagga 2820aacggtgttc attttacata tgcgttattt tattctgtgt gagtgacttc atggcaccga 2880cattgctgtt tttaaatgag gatacagtaa attgcagtcc gaggaaggct aactggaatc 2940aacatacccg tagctttaga aagcagtttc cgcaccagcg aagagtacaa gagcgatgga 3000accccatgtt cctggaagtt tgcacatcag agtaaacaaa cttgaaaacc cctcttgata 3060gcagaattca cccagccttg ttccattttc tcttaacaaa acacaccgca aaagctctca 3120caagctgctt tgatgaagcc acatgtattt cccccttcac aatttacagg aagttactct 3180taaaagaaag tgattctggt gtttaccgcc tgtgttaaag ggacagagtt cctttttatt 3240tctgataacg tttgagcgaa atacagaaac tatctgtaga ctagcatagt cggtacgtga 3300gtaaggaaaa gcaataacct gctgtccggt gagcacaaaa ttcctgctac gaacagtgcc 3360ttactgctgc ttggagactg caagtcgcag atcacactag gtattgactg attgtataag 3420gaaatttctt aaagtctaaa gtaaaggtgg tacctcctaa aaagagggga agagagaaaa 3480ctttgtgtgg aaggataagg agtgtgttta tagtttcagt aagagtgtac gttttaattt 3540ttcttcttcc tctgcctctt tgccaagtag cctgagtgca tctgttatcc agaagtagta 3600ttactctagg acaaacttca aattcttcat tctgcgttgc ctttaaggaa caacatactt 3660tcttcctgtt ctttttccaa aaacacacgc ctatggctct gtgtgtggtg ttttagccag 3720cctcctccca gataaggggt tcccttccct cctttgcatt gaaaggaaag tgcaagtctg 3780gacatgttta tcaagaggaa aagtgacttc tcagtaatag actgtcaaat tcgggctgct 3840gcccgagtgt tcgctttgtt atggcaggtg aagttcacct ttgccccacc cagtgtttcc 3900acaaaaaggc aaggttccaa gtattcatat gaacaagtgt tactttagga cttggagggt 3960tgggggtgga ggatgtttgc atagttgaag ccttgggcgg gggtgtagga aacggcgagt 4020acagaggcca tagaaaaagc taagactcag tttgacgtcg tcagccggct tggtcttcta 4080cccagtgact caaagcacta aaagtcagca taatcggaac tgaagtcagt agcatcgccc 4140atttgccatt cactgcagta gcaaaagtag tactctgtgg tgggttaatc ggtttgaggc 4200agctccttaa atgaacattt gtgtttcatt tttctgttat tttcccgaac atgaaaagac 4260gataaaactg aaatggaaaa ggtaactgac aaaagtgtgc cttacctgtt tccgccctga 4320tttctgctga ttcaagacta ttctggctaa actgattgga ttctttttct aactaggcag 4380taggggatca gaaatcacac acggtaccgg ctgtgtttat tctgagaggt gctggggagc 4440tttgggtctg acttcctttt acatgcctgt cttctctttt ggacagatct attccagagg 4500ggagcttcac cacttcattg acggctttaa tgaagagaaa agcaactgga tgcgctatgt 4560gaatccagca cactctcccc gggagcaaaa cctggctgcg tgtcagaacg ggatgaacat 4620ctacttctac accattaagc ccatccctgc caaccaggaa cttcttgtgt ggtattgtcg 4680ggactttgca gaaaggcttc actaccctta tcccggagag ctgacaatga tgaatctcag 4740taagtggatt acagaacaaa aaaataaaaa atgccagtaa tgtcggttct gcccctttga 4800actaataaca tgttgtttaa ttatacggct ttgtcatgtg ttggatgaag taggtggctt 4860aagctaggga ctaggaagag gaaaaacatt ttttgagtcc ctattaacta ttaggaaact 4920tgatcattta aaagtatata tatatatgag gagctacctt gagttttgaa ttcaggatgt 4980tacaggaaga aatatatgtc caattctaat ttatccaaaa gcagttggga gaattacagg 5040gattggtcca gacatgctgc gtatgcaagg tatagccctc atctgtggta ctttggcagg 5100gcttagactg catcaaaata tttatagatg tacatttgag tgtacagtta ggatctgatg 5160tggaacattg taagatcatt gctagaaaaa ctttgtcata atttttcaat attattctaa 5220gtgaataacc gtaaagattt tacatcttag cttccttcct tacagtaaaa aaactatctg 5280atctcttgat cagtattata gtagccacct atcactttat cttaacaaat tctcaattcc 5340ttaggtttat gtgcttttac ttcttttatt tgattaaaat tgctgtcatg acctctctct 5400gcagagggct gcatcatttt ggtcattctc aagtgatctc tttgagcaat ttaagaattg 5460ccataagatt ctaacctctg ctgtaactat ggttgtgtgt tcttggttag accactaaat 5520cttattagca gttttaaaaa ttattccttt tggtttagaa gttaagacta aatgctgaag 5580tttttgtaac ttttggtttt gatatcattt caaacttaag aaaacatttg aagaaaagga 5640caaagaattt ccacttaccc tttacccagg tttaccagtt attgataagt atatccattt 5700gctttaccag aaggctaact tgttttagtt ctcattttca cctttgagac atttggaata 5760aatatcaatg ttaacataaa ttggaatttt gactttgatt ttaggaccaa tgaacaagcc 5820aagtacttac cctagtcata tataatccaa ctgtatggtt atttggtatt cattccacac 5880ttcattttac ttgatctccc ttaagattgc aagattgtgt ttgcagtttt tctgaaaatc 5940tggggctata aaagcatcag gacctccccc gtaggggagg tcgtgtgttt ggggtcctta 6000cacaacaggt tacccttgag cttcaggaaa agaactggct ctcagttccc cagttccagc 6060ttaatgggtc taattaggtc ctgaccaaaa aggtggcagt tcttttccct catgtctctt 6120cagcgctccc cgagactctg gagactctgt catatcccta gggctgagcc tcccaggaac 6180cattcggctg ttgtggcatc tgtgtatgcc atgcccagtg ctgaggacct agtaacaaac 6240gacaaatgca caggcacagt ggcatttttg tggaactcgt attccagctg tgcgtctcag 6300aagaagcgca cagctccctc ctggctttct taacatagtg agccacttcc acttaagggt 6360ctccttacat tccttgagtt taatcattca tggattcaga ggaaagtctt ttgatttttg 6420cttttcttta aacagttcat ttgaggtgac ctaccccagt gactttgcac caaccaccaa 6480gaaacttttt tgcatgcttc ccgcaccctg tgccaatcaa gggaagggtt taaaggcctg 6540gcgtttttat tcctcaaaga aaggttttgc acagtatttt aaggttcaag tgcttctact 6600ttgtgttcag aagcaactgt catatatact gtgaaatgac accttttatt tatccctttt 6660tatttatgca gtatgtcccc ttttattttg gcagaatttt ttctaaatgg tggtttaaca 6720ttttcaagca catttcattg tccaatattc atagtaaaga atgagagtta acaataacca 6780gtcacattaa aacaagattc ctgctgccag ttgtgaaacc ggttgtctta ggcgtggcag 6840ctgatgattg agactgtgat caggaaaatt tccactattt catcaggcct aataggtaga 6900ttgtgtctcc aaatgaactg tgttgggttt ccatgcttaa agcacaatag aggtggtgca 6960agaatctcca tgagggctta aatggcagtg atggttcagg cggtagagtt tggagaagaa 7020gggatttgaa acaaaccaaa ggaaagaaaa gtaagtagcc agaaatcaca aaatggcatt 7080tttctaaaaa caaaggaaaa ggaataaaag aactaataag tttgaaaccc ctacccctcc 7140caaatttggc agggggggag gtattttttt tctatctatc taactaaccc atctagaaaa 7200cagttgacca aattatagac ttctaaatgt taatctgctt tctcagtttc agttgaaaag 7260agactttgtt ttgcctactg cagaacttct aggttctttc ttatagtctt ggggttctta 7320ttatagatcg aaaatgtgag tcggcataat taagccattc ggagtcttca gaagcagttc 7380actcttgaaa tgactccgtc cgcctacagc catttaagat ttcagaacaa aaacagatct 7440tgattttctt tttcatgtta actcaagctg ttgctgagtg ggagagtcag aaatgacacc 7500agctccactg attactcagc tgctgaagga tgatttttta aaatgcacct ttactgtata 7560tggacttcct aatttccacc tgtagagcat cttagggagg ctaacatgtc actctggatg 7620ttcttttaga ataagatgca aatctatttt tctgaaggca ttagagatag caaacattta 7680ttgtgagttt actatatact aggcactgtg ctaagtgttt tgcatagaaa gtttaaaatt 7740ctggcttttt tgttggccca atcataagtt tcatatcagt tcaacattca aattatatta 7800aggtacttaa gaagaatccc tggctaaatg tgaggggcag tgccacagat ggactgaaac 7860tttatgctta ttgcacattt atgctattat tatttgttga attatagaac caagggagtg 7920tggaagccac tggaaaaaat atgagactta gatacataat ttgagtaaaa atggctcaaa 7980gtcatgaggg taaagttttt tgtatttcca ttttattcga gcggcatcgt ttttaaaaat 8040cattatgaat ttgaccctat atagatgttt ccaaataatt ctttttcacc ttcataaaat 8100tccttcctgt ggctgtgaga tgccttgcct atcagttttc aagcttagtt gtctttctca 8160tcctttacca ttttagcttt aaaaaacaaa agtgacaatt agaacttcct gcctgctggg 8220cctcactgaa agaccgatat tggcctgata aggagatatt tattttgttt tagtggcttc 8280agaaatccct ctccctcagc aagctttcca tcacggcccc cccgtcagca tcttccctga 8340tagcgttctt ctctgtgttt attctggggc ttcaggctcg cccaggagga actgataacc 8400gctggcagga gataacattc tctaaggggc tctcaaattg gaatcgaatc cctcaagcca 8460gtcagcctag agaatacatt taaagggttc agttctggag tttcacagag ttcatttcta 8520gacctatcag atagcaagtg tggagttctt tctcaactaa attcaagcag agacattttt 8580tagacgatga aggatatttg cacaaaggct tcagcatgat cccccaaacc tgctgcctct 8640gaaggcatct ccacacattg acagccaatg ccttcagtgc gttcctaggg caggtgtcct 8700ggcttgagtg actgtcctcc aataatcaga gctcaaacta aacatcgtat gttttacttt 8760tggtttccag gcaaggctga gcagggaatt ttcagttttc cctgcccaga tgggtgtttt 8820ttcctgaagg catcatttat tgtgtagcga ggagacaggg ctggctgtgg cagggatagt 8880ctagaactgt cctcattgct gctgttccta aatagtatct ttaccaagta ataacgtgcc 8940gtctttggga ataagtgctt tcctcttagc ctgttctgtt ttcttgggtg cgctaagtaa 9000ttgaactggc tcaggaagta cctattgtgg tttggcagag gtgactgtca cgccttgtga 9060ctccaggggc cagcactgct gggatcctgg ctagaccaga cagagccttg gtgaagtgct 9120taggctgtct gcacatcgcg aggaaggtgg tattcacttc gctaagctcc ttggcatagg 9180cagtttgaac agggctttat caaattcgta ttcaacaaga gtagaagcga aaattgatga 9240ctgtgtatta cttgaaatga gtcttaatct ttcacattta gttctcaggg tatgctgatt 9300tcctttaggt aaaccatgaa catcagaaag acttttatta acctatgaca gggtccccac 9360cccagtattt ttccactcca ttaaaatgga agtttttttt ttttttttct tttttgagac 9420agagttttgc tcttgttgcc cagtctggag tgcaatggca caatctcggc tcaccacaac 9480ctccacctcc cagattcaag cgattcttct gcctcagcct cccaagtagc tgggattaca 9540ggtgtgcgcc accacgccca gctaattttg tatttttagt agagatgggg tttctccatg 9600ttggtcaggc tggtctcgaa cttccgacct caggtgatcc gcccacctcg gcctcccaaa 9660gtgctgggat tacaggcaag agccactgca tccagcttag gctatcttac tccagcctaa 9720acagcaattt tctatcataa ggtctgtact aatgaaaaca gaatcaccca aggctgctgt 9780ttgttctgtc tgtgctgcca ttgtccgcat

tttgctgagg aggaaacgga actgcacttt 9840tgagtgagtg gcccagagcc ttctagaatg agagtgcgtt ggaagccaga tatgtggcga 9900ttgtgtcgcc agctgttact caggttttct caagaaggag gagcaacttt ggcagttttg 9960cttcagttct ctctagccct ctgtgtaatc gccccttttt ctttatttca gcacaaacac 10020agagcagtct aaagcaaccg agcactgaga aaaatgaact ctgcccaaag aatgtcccaa 10080agagagagta cagcgtgaaa gaaatcctaa aattggactc caacccctcc aaaggaaagg 10140acctctaccg ttctaacatt tcacccctca catcagaaaa ggacctcgat gactttagaa 10200gacgtgggag ccccgaaatg cccttctacc ctcgggtcgt ttaccccatc cgggcccctc 10260tgccagaaga ctttttgaaa gcttccctgg cctacgggat cgagagaccc acgtacatca 10320ctcgctcccc cattccatcc tccaccactc caagcccctc tgcaagaagc agccccgacc 10380aaagcctcaa gagctccagc cctcacagca gccctgggaa tacggtgtcc cctgtgggcc 10440ccggctctca agagcaccgg gactcctacg cttacttgaa cgcgtcctac ggcacggaag 10500gtttgggctc ctaccctggc tacgcacccc tgccccacct cccgccagct ttcatcccct 10560cgtacaacgc tcactacccc aagttcctct tgccccccta cggcatgaat tgtaatggcc 10620tgagcgctgt gagcagcatg aatggcatca acaactttgg cctcttcccg aggctgtgcc 10680ctgtctacag caatctcctc ggtgggggca gcctgcccca ccccatgctc aaccccactt 10740ctctcccgag ctcgctgccc tcagatggag cccggaggtt gctccagccg gagcatccca 10800gggaggtgct tgtcccggcg ccccacagtg ccttctcctt taccggggcc gccgccagca 10860tgaaggacaa ggcctgtagc cccacaagcg ggtctcccac ggcgggaaca gccgccacgg 10920cagaacatgt ggtgcagccc aaagctacct cagcagcgat ggcagccccc agcagcgacg 10980aagccatgaa tctcattaaa aacaaaagaa acatgaccgg ctacaagacc cttccctacc 11040cgctgaagaa gcagaacggc aagatcaagt acgaatgcaa cgtttgcgcc aagactttcg 11100gccagctctc caatctgaag gtaggccttg agagagagca gtccaagggg ctgtgagtgc 11160atgcttgtgt ttgtatttag cttgctttcc atggggtatc gattgcattt gcagtagtat 11220gagcccccgg ttggggatag tgggtatgga ttccgcctgg cttttgccac ttctagctct 11280ttgactttgg acaagtgact tcccttctcc tgattttctt ctgaataata aaaaaattag 11340gggtttggac tagaagatta ggtgaaactc cctgctagcc tgtgattttt gtgcttttaa 11400gaaaaacacc attctgaaaa catgaagatt tcttcttttt aagactgtct tgatgctttt 11460cttaagatat ttgcatcaac acttgagtct tggagcagaa atgttaggtc tcagagccag 11520cttgagagca gagctaacac atgtggcttc ttcccaggtc cacctgagag tgcacagtgg 11580agaacggcct ttcaaatgtc agacttgcaa caagggcttt actcagctcg cccacctgca 11640gaaacactac ctggtacaca cgggagaaaa gccacatgaa tgccaggtgc gcagtatttt 11700ctgggtagac cttctgacct ttgtagaaaa tgtctgtgag tcaccctccc atgtcctata 11760tagcccgtag ttaaagccaa caccagattc tgcgttgtcc catcctggac tgatggcact 11820atggtccttc ccagtacttt gtatctgctg atgacttgag atggcacagc cagcttccag 11880tgggtgggaa aatggtaggg gaaataaaca gcccctcgtg tgctgtgtgc ccacatcccc 11940ccgtttgctt aataccacac tggaggtgcc acaaggaggc ttctcacctc ctaggttgct 12000gggcgttggc cggtaagcct gcccctcccg ttggcaactc ttaatcttct ggccttcctg 12060tctcccttcc ctgctgtctc tctcccctac actgtaggtc tgccacaaga gatttagcag 12120caccagcaat ctcaagaccc acctgcgact ccattctgga gagaaaccat accaatgcaa 12180ggtgtgccct gccaagttca cccagtttgt gcacctgaaa ctgcacaagc gtctgcacac 12240ccgggagcgg ccccacaagt gctcccagtg ccacaagaac tacatccatc tctgtagcct 12300caaggttcac ctgaaaggga actgcgctgc ggccccggcg cctgggctgc ccttggaaga 12360tctgacccga atcaatgaag aaatcgagaa gtttgacatc agtgacaatg ctgaccggct 12420cgaggacgtg gaggatgaca tcagtgtgat ctctgtagtg gagaaggaaa ttctggccgt 12480ggtcagaaaa gagaaagaag aaactggcct gaaagtgtct ttgcaaagaa acatggggaa 12540tggactcctc tcctcagggt gcagccttta tgagtcatca gatctacccc tcatgaagtt 12600gcctcccagc aacccactac ctctggtacc tgtaaaggtc aaacaagaaa cagttgaacc 12660aatggatcct taagattttc agaaaacact tattttgttt cttaagttat gacttggtga 12720gtcagggtgc ctgtaggaag tggcttgtac ataatcccag ctctgcaaag ctctctcgac 12780agcaaatggt ttcccctcac ctctggaatt aaagaaggaa ctccaaagtt actgaaatct 12840cagggcatga acaaggcaaa ggccatatat atatatatat atatatctgt atacatatta 12900tatatactta tttacacctg tgtctatata tttgcccctg tgtattttga atatttgtgt 12960ggacatgttt gcatagcctt cccattacta agactattac ctagtcataa ttattttttc 13020aatgataatc cttcataatt tattatacaa tttatcattc agaaagcaat aattaaaaaa 13080gtttacaatg actggaaaga ttccttgtaa tttgagtata aatgtatttt tgtcttgtgg 13140ccattctttg tagataattt ctgcacatct gtataagtac ctaagattta gttaaacaaa 13200tatatgactt cagtcaacct ctctctctaa taatggtttg aaaatgaggt ttgggtaatt 13260gccaatgttg gacagttgat gtgttcattc ctgggatcct atcatttgaa cagcattgta 13320cataacttgg gggtatgtgt gcaggattac ccaagaataa cttaagtaga agaaacaaga 13380aagggaatct tgtatatttt tgttgatagt tcatgttttt cccccagcca caattttacc 13440ggaagggtga caggaaggct ttaccaacct gtctctccct ccaaaagagc agaatcctcc 13500caccgccctg ccctccccac cgagtcctgt ggccattcag agcggccaca tgacttttgc 13560atccattgta ttatcagaaa atgtgaagaa gaaaaaaatg ccatgtttta aaaccactgc 13620gaaaatttcc ccaaagcata ggtggctttg tgtgtgtgcg atttgggggc ttgagtctgg 13680gtggtgtttt gttgttggtt tttgttgctt tttttttttt ttttttttta atgtcaaaat 13740tgcacaaaca tggtgctcta ccaggaagga ttcgaggtag ataggctcag gccacacttt 13800aaaaacaaac acacaaacaa caaaaaacgg gtattctagt catcttgggg taaaagcggg 13860taatgaacat tcctatcccc aacacatcaa ttgtattttt tctgtaaaac tcagattttc 13920ctcagtattt gtgtttttac attttatggt taatttaatg gaagatgaaa gggcattgca 13980aagttgttca acaacagtta cctcattgag tgtgtccagt agtgcaggaa atgatgtctt 14040atctaatgat ttgcttctct agaggagaaa ccgagtaaat gtgctccagc aagatagact 14100ttgtgttatt ctatctttta ttctgctaag cccaaagatt acatgttggt gttcaaagtg 14160tagcaaaaaa tgatgtatat ttataaatct atttatacca ctatatcata tgtatatata 14220tttataacca cttaaattgt gagccaagcc atgtaaaaga tctacttttt ctaagggcaa 14280aaaaaaaaaa aaaaaaaaaa gaacactcct ttctgagact ttgcttaata cttggtgacc 14340tcacaatcac gtcggtatga ttgggcaccc ttgcctactg taagagaccc taaaaccttg 14400gtgcagtggt ggggaccaca aaacaaccag ggaggaagag atacatcatt ttttagtatt 14460aaggaccatc taagacagct ctattttttt tttgccactt tatgattatg tggtcacacc 14520caagtcacag aaataaaaaa ctgactttac cgctgcaatt tttctgtttt cctccttact 14580aaatactgat acattactcc aatctatttt ataattatat ttgacatttt gttcacatca 14640actaatgttc acctgtagaa gagaacaaat ttcgaataat ccagggaaac ccaagagcct 14700tactggtctt ctgtaacttc caagactgac agctttttat gtatcagtgt ttgataaaca 14760cagtccttaa ctgaaggtaa accaaagcat cacgttgaca ttagaccaaa tacttttgat 14820tcccaactac tcgtttgttc tttttctcct tttgtgcttt cccatagtga gaatttttat 14880aaagacttct tgcttctctc accatccatc cttctctttt ctgcctctta catgtgaatg 14940ttgagcccac aatcaacagt ggttttattt tttcctctac tcaaagttaa aactgaccaa 150001423242DNAHomo sapiens 142gtctggactt gtggtgcgct gccagggatc cgcagcgttg ccggttgtat tcgctggata 60ccagagggcg gaagtgcagc agggttcagc tccgacctcc gcgccggtgc tttttgcggc 120tgcgcgggct tcctggagtc ctgctaccgc gtccccgcag gacagtgtgt caggcgggca 180gcttgccccg ccgccccacc ggagcgcgga atctgggcgt ccccaccagt gcggggagcc 240ggaaggagga gccatagctt ggagtaggtt tggctttggt tgaaataaga atttagcctg 300tatgtactgc tttaactcct ggaagaatga cagatgacaa agatgtgctt cgagatgtgt 360ggtttggacg aattccaact tgtttcacgc tatatcagga tgagataact gaaagggaag 420cagaaccata ctatttgctt ttgccaagag taagttattt gacgttggta actgacaaag 480tgaaaaagca ctttcagaag gttatgagac aagaagacat tagtgagata tggtttgaat 540atgaaggcac accactgaaa tggcattatc caattggttt gctatttgat cttcttgcat 600caagttcagc tcttccttgg aacatcacag tacattttaa gagttttcca gaaaaagacc 660ttctgcactg tccatctaag gatgcaattg aagctcattt tatgtcatgt atgaaagaag 720ctgatgcttt aaaacataaa agtcaagtaa tcaatgaaat gcagaaaaaa gatcacaagc 780aactctggat gggattgcaa aatgacagat ttgaccagtt ttgggccatc aatcggaaac 840tcatggaata tcctgcagaa gaaaatggat ttcgttatat cccctttaga atatatcaga 900caacgactga aagacctttc attcagaagc tgtttcgtcc tgtggctgca gatggacagt 960tgcacacact aggagatctc ctcaaagaag tttgtccttc tgctattgat cctgaagatg 1020gggaaaaaaa gaatcaagtg atgattcatg gaattgagcc aatgttggaa acacctctgc 1080agtggctgag tgaacatctg agctacccgg ataattttct tcatattagt atcatcccac 1140agccaacaga ttgaaggatc aactatttgc ctgaacagaa tcatccttaa atgggattta 1200tcagagcatg tcaccctttt gcttcaatca ggtttggtgg aggcaacctg accagaaaca 1260cttcgctgct gcaagccaga caggaaaaag attccatgtc agataaggca actgggctgg 1320tcttactttg catcacctct gctttcctcc actgccatca ttaaacctca gctgtgacat 1380gaaagactta ccggaccact gaaggtcttc tgtaaaatat aatgaagctg aaacctttgg 1440cctaagaaga aaatggaagt atgtgccact cgatttgtat ttctgattaa caaataaaca 1500ggggtatttc ctaaggtgac catggttgaa ctttagctca tgaaagtgga aacattggtt 1560taattttcaa gagaattaag aaagtaaaag agaaattctg ttatcaataa cttgcaagta 1620attttttgta aaagattgaa ttacagtaaa cccatctttc cttaacgaaa atttcctatg 1680tttacagtct gtctattggt atgcaatctt gtaactttga taatgaacag tgagagattt 1740ttaaataaag cctctaaata tgttttgtca tttaataaca tacagttttg tcacttttca 1800agtactttct gactcacata cagtagatca ctttttactc tgtgttacca ttttgactgg 1860tcgtcattgg catggggtgg atatagggca taggattact tgtctcagaa gctgtcatag 1920aatttcttgc tgccaattaa aaaacctgtg ttctttacac actacacgta taaatattgt 1980aactgttcat ctttgttgtt ttatcactgt aagcctgtca aatcatagta tcctaagcat 2040ctgtaaatgc taattttgca tttttggaaa aacccattcc ttccaagcta gtgtttttca 2100ttggctccag gtctaatttt tcactgtggt ccctggcagc cagtcttttg aagtttaaag 2160attacctgtc tcttgactgc agtacctttt ctttaatttt taccaaaaat atccagaggt 2220tactggagtt cttattcaat ataaggaaag tttgctgcac tttattacca agcctctggg 2280attttaccag tcaaacatat ttgtgcatta catttcattt cttgtgagct agctggctgt 2340ccatattgaa tgttgaccca tttgagtacg ctaaaaggct tacagtatca gacacgatca 2400tggttttaga tcccataata aaaatgratg tttttcttat aaaaaattat acaaatgctg 2460aagtgagatt ctactattgt tcattgcttc cttttctttt tccttttgcg attttcactg 2520attaatagca catttcttca caaaattaga taaagttggt caaagaccag atattctgga 2580atggaaattg taaagcttaa tcaaaaagaa tagccagtac agcatacaat ctcagaaact 2640tagaagcaag tagaaaataa ttggttgatg taaacgaaag tgccatttta gtaaaggcag 2700gaaaaaaata gcaatatttg agttatgtaa ggataaaaaa tccactgact tgtatttttg 2760cacaagaggc tggtctgaat atgattgttc acattaagag tgtttattcg tcggttcatt 2820ttggggattt tcccccttga tgttttgaca gattgaagtg agctttagtg agcaaaagga 2880tcagaatgca gggaacacta agctgtgatg aagaaagtgt ggtaaaaagc cagagtagtt 2940ttatacagac aaaaccagtg tcaggccttt gcagtaggct tgagtgaact tctgatctag 3000atttgaaagt aaattttatg aagacattgc ccatttttac ttcctcattc attattgtac 3060cagcatcata gctttattac tctaatccca ggtaagtcaa gcctacaatg ccctagagga 3120agagtaaaac cagaaattca tgctggctta aataatctat ttttgtttct tttcatttga 3180atatttaaat tttatggttt attaaaaaat taaataaaaa agaaaaaaaa aaaaaaaaaa 3240aa 324214313695DNAHomo sapiens 143cgccgcctgt gcagccgctg ccgccgccgc cgccgccgcc gccgccgccg ccgccgccgc 60cgccgctgcc gccccggctg ccgcgccgcg ccgctgcctc tgccccggcc gcccccgccg 120ccgctgccgc cgccggcccg cagccagcca ggcgggcggc ccagcccgcc tgagcccgca 180gcggctgccg ccgcagcgtc gggtcgctgg gtgcgcgggc taccgcggac cgagcggacc 240cgagtgggcg accaggcgct tgcccgccca gtgccactgc cgccgcttcc tcgccggagc 300acaggaccag acacctccag cgcccgctgc tgctgccgat gcggcccgga cacttttagc 360tgggcgggag ggctggagag ccgggggccg ccgagaaccg ccagcgagct gtgccgagag 420ccgcgccgac ccgctgcgat cagggacagg cgcccgcccg ccgccgccgc ctggccgcta 480tggatctatt cgactttttc agagactggg acttggagca gcagtgtcac tatgaacaag 540accgtagtgc acttaaaaaa agggaatggg agcggaggaa tcaagaagtc cagcaagaag 600acgatctctt ttcttcaggc tttgatcttt ttggggagcc atacaaggta gctgaatata 660caaacaaagg tgatgcactt gccaaccgag tccagaacac gcttggaaac tatgatgaaa 720tgaagaattt gctaactaac cattctaatc agaatcacct agtgggaatt ccaaagaatt 780ctgtgcccca gaatcccaac aacaaaaatg aaccaagctt ttttccagaa caaaagaaca 840gaataattcc acctcaccag gataataccc atccttcagc accaatgcct ccaccttctg 900ttgtgatact gaattcaact ctaatacaca gcaacagaaa atcaaaacct gagtggtcac 960gtgatagtca taaccctagc actgtactgg caagccaggc cagtggtcag ccaaacaaga 1020tgcagacttt gacacaggac cagtctcaag ccaaactgga agacttcttt gtctacccag 1080ctgaacagcc ccagattgga gaagttgaag agtcaaaccc atctgcaaag gaagacagta 1140accctaattc tagtggagaa gatgctttca aagaaatctt tcaatccaat tcaccggaag 1200aatctgaatt cgccgtgcaa gcgcctgggt ctcccctagt ggcttcctct ttattagctc 1260ctagcagtgg cctttcagtt caaaacttcc caccagggct ttactgcaaa acaagcatgg 1320ggcagcaaaa gccaactgca tacgtcagac ccatggatgg ccaggaccag gcaccggaca 1380tctcaccaac actgaaacct tcaattgaat ttgagaacag ctttgggaat ctgtcatttg 1440gaacactctt ggatggaaaa cccagtgcag ccagttcaaa gactaaactg ccaaagttca 1500ccatcctcca aacaagtgaa gtaagccttc ccagtgatcc aagctgtgtt gaagaaatct 1560tgcgggagat gacccattcc tggcctactc ctctcacttc catgcatact gctggacact 1620ctgagcagag caccttttcc atcccaggac aggaatcgca gcatctgacc ccaggattca 1680ccttacaaaa gtggaatgac ccaaccacca gagcttctac aaagtcagtg tctttcaaat 1740cgatgcttga ggatgacctg aagctgagca gtgatgaaga tgaccttgag cctgtgaaga 1800ccttgaccac tcagtgcact gccactgagc tctaccaggc tgttgaaaag gcaaaaccta 1860ggaataatcc tgtgaaccca cccttggcca ctccccagcc cccacctgca gtgcaagcca 1920gcgggggttc tggcagctcc agcgaatcgg agagcagctc tgagtcggat tcagacactg 1980aaagtagcac cactgacagc gaatctaatg aggcacctcg tgtggcaact ccagagcctg 2040agccaccctc aaccaacaag tggcaactgg ataaatggct taacaaagtg acatcccaga 2100acaagtcttt tatttgtggc ccaaatgaaa cacccatgga gactatttct ctgcctcctc 2160caatcatcca accaatggaa gtccagatga aagtgaagac gaatgccagt caggtcccag 2220ctgaacccaa agaaaggcct ctcctcagtc tcattaggga gaaagcccgt ccacggccca 2280ctcagaaaat tccagaaaca aaggctttga agcataagtt gtcaacaact agtgagacag 2340tgtctcaaag gacaattggg aaaaaacagc ccaaaaaagt tgagaagaac accagcactg 2400acgagtttac ctggcccaaa ccaaatatta ccagcagcac tcccaaagaa aaagaaagtg 2460tggagcttca tgacccacca agaggccgca acaaagccac tgcccacaaa ccagccccta 2520ggaaagaacc aagacctaac atccctttgg ctcccgagaa gaagaagtac agagggcctg 2580gcaagattgt gccaaagtct cgggaattca ttgaaacaga ttcatctaca tctgactcca 2640acacagatca ggaagagacc ctgcaaatca aagtcctgcc tccgtgcatt atttctggag 2700gtaatactgc caaatccaag gaaatctgtg gtgccagcct gaccctcagc accttaatga 2760gtagcagtgg cagcaacaac aacttatcca tcagtaatga agagccaaca ttttcaccta 2820ttcctgtcat gcaaactgaa atcctgtccc ctctgcgaga tcatgagaac ctgaaaaacc 2880tctgggtgaa gattgacctt gacttactct ctagagtacc tggccacagc tcactccatg 2940cagcacctgc caagccagac cacaaggaga ctgccacaaa acccaagcgt cagacagctg 3000tcacagctgt ggagaaacca gcccctaagg gcaaacgtaa gcacaagcca atagaagttg 3060cagagaagat ccctgagaag aagcagcgcc tggaggaggc cacaactatc tgcttgctcc 3120ctccttgcat ctcaccagcc ccaccccaca agcctcccaa cactagagaa aataattcat 3180ccaggagagc aaatagaaga aaggaagaaa aactatttcc tcctccactt tccccactgc 3240cagaggaccc tccacgccgc agaaatgtca gtggcaataa tggtcccttt ggtcaagaca 3300aaaacatcgc catgactgga caaatcacat ctaccaaacc taagagaact gaaggcaaat 3360tctgtgctac tttcaaaggg atatcggtaa atgagggaga cactccaaaa aaggcatcct 3420ctgccaccat cactgtcacc aatactgcta ttgccactgc tactgtcact gctactgcca 3480ttgtcaccac cactgtcaca gctactgcca ccgccacggc caccaccaca actactacca 3540ctaccatttc caccatcacc tctaccatca ctactggcct catggatagc agtcacctgg 3600agatgacgtc ctgggcggct ctgccccttc tatccagcag cagcactaat gtccggagac 3660ccaagctcac ttttgatgac tcggttcaca atgctgatta ttacatgcaa gaagctaaga 3720agctgaagca caaagctgat gcactgttcg agaaatttgg caaagctgtg aattatgctg 3780atgccgccct ctccttcact gaatgtggca atgccatgga acgcgaccct ctggaagcaa 3840agtccccata caccatgtac tctgagactg tggagctcct caggtatgca atgaggctga 3900agaactttgc aagtcccttg gcttcggatg gggacaaaaa gctagcagta ctatgctacc 3960gatgtttatc actcctctat ttgagaatgt ttaagctgaa gaaggaccat gctatgaagt 4020actccagatc actgatggaa tattttaagc aaaatgcttc aaaagtcgca cagataccct 4080ctccatgggt aagcaatgga aagaacactc catccccagt gtctctcaac aacgtctccc 4140ccatcaacgc aatggggaac tgtaacaatg gcccagtcac cattccccag cgcattcacc 4200acatggctgc cagccacgtc aacatcacta gcaatgtgtt acggggctat gaacactggg 4260atatggccga caaactgaca agagaaaaca aagaattctt tggtgatctg gacacgctga 4320tggggcctct gacccagcac agcagcatga ccaatcttgt ccgctacgtt cgccaaggac 4380tgtgttggct gcgcatcgat gcccacttgt tgtagtgggt gttctcagat ctctagcatc 4440acgacccatc actctacctc taccagcgca ctgatggtca ctggtggaac tccactcact 4500ggggaacgtt ctctttggtt atgtttgttt ttatgcttct tttgttatct gtaaaaaaca 4560gaagtcattg taagttgaca ctacaactta agggcagtgt acgttttatt acttagtcat 4620tttttttctt ttagcatttg atatgcattt ctcagattcc accatctttt tgtgctttat 4680ggaatgacag tccctacaat attgttttaa gcccacacta cccaaaacaa agaatgggaa 4740gcacttgtga taaagacagg ctcctgagaa atgcaacaag tggtcttaca tatacatgag 4800aacttagaca caagggacca tcccccaaac tctactctta tacccagaaa agaacatatt 4860tcagaatctg tcaaactttt gtgtatccca cagattcaat cttcaggtga gaattttcat 4920tgtcaaaacc cactggttag atgttgtagc aacatcataa aatcaagagt atcaagaaaa 4980taaatgagca tagcaatgct actcttaaaa agatgctatg ccacacaacc agaggacttt 5040cttgttagca tccctttcct gattccctat tttgttaatt ttaatgataa gaagaaaggg 5100tgacatttat tttgacaagt tttaggcatc agctggcatc agtgtttttc aactccatta 5160tttgaagtgt aaatcctcac ctggggttct ctgtgtgcaa agctgtcctt ttgaagaaca 5220gtttggttga tgcatgcctt agtagccaaa atgctacact ctagacttac aagtgggagt 5280taagagaggt ctggaaagtg tccaacaagg aattcacacc tctgcctcct ttgcaacaac 5340aacatttaca cagttggtaa gtgggtccat aactggcagg atttttaaat tgtattttgc 5400tcaaatctat gggaacaaaa gtcaaggtat cactacctag aagtaatgat atacagtttt 5460cttcctagtg gcttgaaaat ctggacttcc tcaattatta ttcacatttt ctctcttata 5520ggttttctgt tttctacttt cttttttctc ttatctgtgt ttccctttcc tttgtttggc 5580tcattaactt ttgactgaat tacaattact ccttttatta aagtccatat tattgtgaat 5640catttccatg aaaatttcta agaaaactcc aaactctcta aatagtagct aacttttatt 5700tttttaaaat gagtcgtggg gtagtgcttc accttgagat gctttgaaag agccctaaac 5760attgggaacc attcacctaa tttggagaca tttctcactg gttgtgacta cccccttatg 5820atccttcaca ttcattttat gtccctaaac atcacaatgt aaatatcatt tttgatgttc 5880cagctcacca gaagattctt acacttgggg taaacactat ccatgcatta cttactggta 5940attacctgct ggtatataat tccatgtagc ctttaatatg ctgggttatc aaattctgtt 6000cactgagtta tgaccagata aataatagat atgcacatga aagatgcaaa cttgtgtgat 6060tattaaagcc agccatgcag gtccatgata gaaacagcag gtgatgactc tgcactctca 6120ttgtcaaggt tagctatatc cccagttgca aaacagccag acttgagctg tgctctggtc 6180atctttgagt ttaaggcctt ttgttgtata aggctgtgga agttgtactc caatggctga 6240agccatgttg ttaatatggc tgatgggagc atccctgcag ctgaacccag cactttttat 6300gctcccactg tggttgagct ttatgtttac agtctcagca acaacactta tgcatccaaa 6360cactcacaaa tgaaacctga aagaatcttt tctgagcctc ttaaaagagg aaaatgatga 6420taacattaaa gactctgaac acccaaggtt ggtgtcacat ataaaaatta agctgatgac 6480tttgcagtga ctcaagttgt ctctttatca

tggtttacca ggtagagtgc ctggctatta 6540ctatataatg aagcccactg gcttgacttg taagttcaac ctaaaccaca atcctagacc 6600atcatggatt taggagtaga ttcttcttga aatcccacat ccagaaacta gacattagaa 6660tgttgaggca gtttcccaga gaaacaagca tattgcctca tggatgaaag acttgtagtt 6720ctagtttcag tgacttgtta tatctactta catacaacag ggaggcaaga ggattctctg 6780tcatctctgg tgactgagtg taaaatatgt gccaagtctg cagcacagtg accaaatctg 6840acaatcgagc tctggatcac cacttgatta tgtagtagac tcatttataa agcagcttag 6900gaactaatta aacatggagg atgaattacc ttcctatccc ttgagataag acatctttca 6960gtttcatgat taaggattgt tgctgtttta tagttactct gttcatcaca gtgtaaatgg 7020tgatgcgtgt cgtaggtgtg cagctatttg agggactaag ggatggagat attctgtcaa 7080atgaatctct tcagtatacc agtttgtggg agggatatga gacatgtgga tggcagtgag 7140agatcgtgcc tctagatctt gatggaggct tggtgagaca cacttaaata agcacgtgga 7200ggttagaata gagggcagag taaaaggaag ctccatctga gcaagtacac caaatgatct 7260cagccctgca acttgaccca ggtagggcca ccactacgcc ttcacttgtc acccaagctc 7320caaccacaga gagtttgaca agtttgtgtt atgatgttgg cttggctttg tatttttaat 7380taactttgga tttttagtgg ttttgtcata taactgtctg agtttggtag gtaggattac 7440tttgaaaagg gtttactagt gtggtcctcc gggtagaatt tagctgtaac atgttgttag 7500ccagcctgta gactgttaat tacttaataa tctcattggg aaaatactag tagttttata 7560tttggatgac ataattggaa aaagcagatt agctgctact acttttaaaa gacttaaggt 7620cgggatgcct ttttttccat gtaaggaaat gaaaagaccc aaaatcttca ggcaaaaagc 7680aagttgcaaa attagaaacc attggctaaa aatgtgtttt gttgagtttc caaatggatg 7740aattttcatt tggacattac atcactaaat tcattagatt ttgtctgcat tggaaagata 7800ctcttctagc atatctttcc caaagatatc taatttggat tctgtttcat gcaaatttgc 7860atcccggagg ttgaagttgg agtttgaggt tggaaaatat ctttgaaggc agaatcagtt 7920gagttgtgag ggtgaagcct cacatacttc tcaacagaca tgataaaatt cacctgcatg 7980agttggcagg tgggagaacc aaactggatc actgggtaag actactcagt aaagcaatga 8040actgcttgct tagagaagca tcactatccc cattgagaaa aatgtgtggc aagatgatac 8100agctacacag tatcaaatga atgggtcaat tcagcacccc caaatttaat tctgtgggga 8160aaaattattg agccagttgt cagtgttctg ttacatgact ggcagactaa attcttcatc 8220gttgttgtta ttgttgttgt tgtttctcat tttcactcgc acggccttat tctcataatt 8280aaaatctaat tcattttctc tttagtgtta gtagactcca acaacagaag tggcatctgt 8340gtattcataa tcagcattta ccctggcagg agactaatca gataggccgg tctcagacat 8400taatcctacc atctgatatt tttggtgaag gaaaaagtat taattctctt tccatcctcc 8460tcctcagaaa tatagaagcc ctctttacca aaatcatcac attttactct gtaatctacc 8520agctaaaaga aaattgcatt gaagccccac aaagccagat tgcagttctt gccccttttt 8580gcgtctgaca tgagatgtta aagaattatt cattgtgctc acattgggtt aggggacact 8640gaactgcttt ttagatccat gatcagtcat cattcttcta agagattgga gctttgctgt 8700ttcattaact gtgcagtgta gactaatggt gtttaataaa aatcattcaa aatttcaaac 8760tcttttgcca gtgacctcaa ttttgttggc tctgtgattt gtatcagact ttgaggaggg 8820aagggggaag tgaaggaagc ctacgtccag gcccctgaca ggatgctgca gtagcaagct 8880caagctcgcc tgcctgccag cagttgctgg tgagcagcag catgcagacc agctgtggga 8940agcctcctga agaatgcccc agctgatgct ttcagctggg aatagtttgt tcctattggg 9000gaactcattg ttctccagtc tctgcagcag gaagccagct gtcatattcg gagggaattt 9060cagatgcttt acctttttgg ttttgtcctg catcactcat gtggctacga aagtgtctct 9120gagaatagag cccaatgtgg tgacaatggg tagtcaaatg caccccagat gctcaagccc 9180tgttgtggtt ctgcagtgtt tatgaaattg ggaggaagga gaccctggac agtaagcaaa 9240attggagaca ctccaacgag gctaagttaa tgccgtgttg cccagaacaa gatctagctt 9300ctcatttggt cagcctagca tgcaaccagt ggtgtgctgg taaaatgttt aacaaccagc 9360tcgctgagaa tagaaagcac ctggtttgca ccatttgcca atttccatgg cataaatact 9420accactttag atgattttaa gctaccaact gtgatgtcac tgaacacatg gttggaaaga 9480gatgcacgca gttggctctt gcaagcctgg gcaaaaatgc ttcaacacgc cactggatgc 9540agccagtcag agggttcata tttaatatat gtgttcatgt ggacacacac agacacacac 9600acacaaactc acccttacac acacacttcg atgactaaaa caattacata gttttaagat 9660atgaatcaat gtgtgaatgt agaaagctta tgataaggcc ctagaggtat gggttgccct 9720ggaagcctag gttttaagca ggagaatagc tgagaagaat gaagccctcc tgagctgaaa 9780ggagagatgg atcaatggag atggttccat catctccttc catatctcac aggtaaaatg 9840ggcactcaga aaaccctcac gattgatttt ttaaaaagat aagtgagtgt tttttatttt 9900attattattg tcatcattat tttgatttac aaatgctatt tgtaactttt acatgtaact 9960aggataaagt atttacggga actctatgga gaatagcaca atccagaatt tactgtgttt 10020ttcttttatg tgacgtggaa actcagtaat tctcccacct tcacattgtt gttcataaga 10080attttacttt agttattagg gaatctaagt tttttgttaa catttgtttt tagttaaaag 10140tatctactta ctgttttagc tctgaactca aaccagaata tctctgtatc aattgcatga 10200ctattcagaa acaataatcc aaaccaaaat aattcttttt ccacccagta cgaagaaaac 10260taagctcagt aacaagaagg cataaactaa agtatataat gaggctttca ttaaatacac 10320acacacacac actcacacac acacacatac actttttaaa tttttaaatt aggcctccac 10380acataaatca ttttgaaagt agaatagaaa atctcaaaga attcattctc ctggtcctgt 10440gcatcttctg cagttaataa gaggtttgta tctggaaaga tggaagaact tgttctaaaa 10500tcttattttt caaaaaaaaa tttccatttt ctctctgggc ctgtatccat ggttgaatgt 10560tagccctgga ggagatccat gtcttactcg ctctttctgg cccttctgtc ttttgcctct 10620gcaattcttt ttgtagctgg cacgatagca gggactgggg gtctatcctt tcatggtatt 10680gctacaatat ttgtccttac tggaaaatgg taacatccgg gtctgattta attggcatta 10740cacttacaca gggactctga gcacccccgt caccacacca gacagtggac cagttttcac 10800agctacaaag agctagaaat gtgtttaaca tcatccagtg catcccctaa ttcaaaacca 10860tcctcactaa tcaatcatat tcacccataa atattacaaa tgagattgat tccatctcaa 10920gacaatttgt caaatactta attttcttcc tggatgattc tacttactgg atattttaga 10980aagagaaatg tctgagataa aatccctcac atttactcaa tataacaaat tactgtttct 11040actcctattc tgagtagtgc ttctgaagat tgtttgctgt agtgttgtct ttgataaaat 11100gaatgtcagt agtgagcctt ttagagatac catgctcaga catcctcttt gggatcagaa 11160gatacctaaa attctcccct tttgcccact tggttagatg agtgatatat tctttggatc 11220ctgcaaagaa gagattggtt tcttttcttt tctggtggtg gtagtggttg tatctgtggc 11280tgtgatggtt gttgttactt gtctctctct ctctctggct ctggcttttg ctttcctgct 11340agtgttcttt ctctttccaa acaaatagtt aaattaaacg tgagcttctg aattgtactt 11400gttcatactt tcaaaacata acagattaat aaaaatagat gtgtcctgat ttaaaacatg 11460ccccctggaa aggcatgctg tattatgaaa tcgtgataat ataactgcat tattacatgg 11520cagtataaat attagtctgt tgaattcatt tgtccaattg tataactttg tggagcagtg 11580ttttgacctt tgatacataa ttctggagca agtggagtgg ttgcaggcag atgagacagt 11640gttatatcag gatttttcaa tcaactttag ttggaggcct ggcaattaca aacatcttca 11700gatgtttctg taaccattat aaatatgaaa aaaacctctt caaaaaattt cccatagtac 11760ttcagtcaag actttttagg tttatctttt ttttttcatt tctccttttc cttttccatt 11820atttttcgat gggggggttg ttatcattga ctgaagaaat attttgattg caatggtctc 11880tctctctctc cccctctctc tctctctcct ctattctttc ctccttccct ctgtccatca 11940cccctcatta aaatattgaa atctggagtc tttgataaat ctgcattaga ccaggctata 12000tgctaggaat gaaatctggg caaatatcga tgggttttca aagaatgctc catgttcatt 12060gggccctttc acaccccaca gtgataaatg aaaaggatag aggtagtttt ttcaaaagag 12120cactttaata atatcctctg agacctaatg cagtttaaca aatgactcca cctatttttc 12180cagtaggtaa attgactgag acttgcaaaa tacccctgag agttgtcagg ggtgtcttct 12240gcctggtcta tagcgtgtgt gtttgctttg tatctaacag gcacattcac gtctcgtgta 12300ctcatatgaa gtatttccta acattcccat tagcctgtat ataagaatca gaaagataat 12360cccaacatgt tgtaaatgaa gatgtgactc tataaccttt ctcttcttcc tggaaaaaaa 12420aggacatttt catgcatatt ttaaacagaa attttgtata tttaagtgtc atagaaaata 12480tttattgagt aactgggaca caaatgggaa tttaattgtc atcatatgct ttgtgtgtgg 12540ggatgcttac caacaccatg tcgctggacc attgtggcaa gccataactg cacaaagagt 12600acacatcgtc agtgtgtgtg tgtgtgtgtg tgtgcgcgca cgcacgtgcg tgtgtgtgtc 12660cctgcatgtg caacatgtct agcttgctgt ccttcatggg attttagctt tcccttcttg 12720aaaaacatta ttttacagtt ccaggaggcc ctggttacat tactatatga aggcagtgat 12780ttgaaatgaa aattcctttc ctcttggaag ctttggtcat aatatcatgg ttcaattaaa 12840cggattccac cggactttgt gatgaaaaag gctctgttaa aatccaattg agtttccaag 12900aggaaattgt agtaggtcaa gatgcatgag agggaagatg gaggccacct cagctggaga 12960acatgagctg agttgagccc tcagtgttga agttgacttg ctccaagctg cagtctaaaa 13020ccctggggcc cgtgcctggc ctatgctccc tcccaagtaa gtagaggagc agaaccatca 13080ggaacagcct gcctggctcc tatgaagaaa acttcctgac gtcctgtccc caaaggaaga 13140ccctttcccc aagggcaccc caggtggcca ttaaattgtg atgatcattc agaaagtgcc 13200cccttggctt tatgagaatc caattagtct tctgaaccac cttttcttgg gtgcagattt 13260ccaacattca tgctcattgc agatccacca actgtcactg ttcttaacaa gcatgctcgt 13320cttgtcagaa tttcagtaag ttccaatttc ctgtacagac cagggtaaac tgttctaaaa 13380tcaatcaatt aatgaaatgt tatctggttt ttaaaagctg gtttcatgtg ctttatgtgt 13440ataaaactat atctgcctgt gtggctttgc atttcaaatg tgtggcgcac aagcgttttg 13500ttggtgcttt gttctcagta cagtaactct gtgtacaaac attttaatgt ggttttgttg 13560ttttccaaca agatgtctct gtaaaaatga tattggctga gctggtgcgt tggtttctct 13620catagaggca ttaactatac tgccaatgca ttgaattatt taaaaatgca aaataaaatt 13680tttatgaaaa tctca 13695


Patent applications by Francois Rieger, Boulogne FR

Patent applications by Jean-Pierre Perin, Le Plessis-Robinson FR

Patent applications by Patrick M. Alliel, Clamart FR

Patent applications by INSTITUT NAT'L DE LA SANTE ET DE LA RECHERCHE MED.

Patent applications in class MEASURING OR TESTING PROCESS INVOLVING ENZYMES OR MICRO-ORGANISMS; COMPOSITION OR TEST STRIP THEREFORE; PROCESSES OF FORMING SUCH COMPOSITION OR TEST STRIP

Patent applications in all subclasses MEASURING OR TESTING PROCESS INVOLVING ENZYMES OR MICRO-ORGANISMS; COMPOSITION OR TEST STRIP THEREFORE; PROCESSES OF FORMING SUCH COMPOSITION OR TEST STRIP


User Contributions:

Comment about this patent or add new information about this topic:

CAPTCHA