Patent application title: METHOD FOR IDENTIFYING OR DETECTING GENOMIC REARRANGEMENTS IN A BIOLOGICAL SAMPLE
Inventors:
Jun Komatsu (Bagneux, FR)
Pierre Walrafen (Montrouge, FR)
Maurizio Ceppi (Issy - Les - Moulineaux, FR)
Emmanuel Conseiller (Paris, FR)
Assignees:
GENOMIC VISION
IPC8 Class: AC12Q168FI
USPC Class:
506 16
Class name: Library, per se (e.g., array, mixture, in silico, etc.) library containing only organic compounds nucleotides or polynucleotides, or derivatives thereof
Publication date: 2016-02-04
Patent application number: 20160032405
Abstract:
A method for detection, visualization and/or comparison of polynucleotide
sequences of interest using specially designed sets of long and short
probes that enhance resolution and simplify visualization and detection.
Probe compositions useful for practicing this method and procedures for
identifying useful probes and probe combinations. These methods are
useful for the detection of genomic rearrangements, especially those
associated with various diseases, disorders and conditions including
cancer or for assessment of genomic rearrangements associated with
therapy. The probe compositions may be used in kits for detection of
genetic rearrangements or in companion diagnostic products or kits, such
as kits for the diagnosis or assessment of predisposition to cancer such
as colorectal cancer.Claims:
1-48. (canceled)
49. A kit comprising a set of short probes or a set of short and a set of long probe(s); and optionally one or more components for binding said probes to a polynucleotide, for performing molecular combing, and/or for detecting whether hybridization has occurred; wherein said short probes 10 kb or less and said long probes are 12 kb or more; and (i) wherein the short probes comprise a set of probes that taken together bind to a continuous stretch of more than 12 kb of the genomic region of interest; or (ii) wherein the long probes bind to sequences outside the genomic region of interest and do not overlap the short probe sequences; and optionally, where the repetitive sequences have been removed from the long and/or short probes.
50. A kit according to claim 49 for the detection of genomic rearrangements associated with a condition selected from the group consisting of: colorectal cancer or genetic predisposition to colorectal cancer, breast cancer or genetic predisposition to breast cancer, ovarian cancer or genetic predisposition to ovarian cancer, and lung cancer or genetic predisposition to lung cancer.
51. A composition containing a set of short, or short and long probe(s), wherein at least two of said probes detect a genetic rearrangement by using Molecular Combing, said short probes binding to at least one region of interest without gaps longer than 15 kb between the portions of the target sequence bound by the short probes in each region of interest and said composition comprising either at least one short probe of less than 10 kb and at least one non-overlapping long probe of more than 14 kb that binds to a sequence near but outside of the region(s) of interest; or at least one group of at least two short probes, less than 10 kb each, which total length is longer than 14 kb and less than 150 kb, hybridizing contiguously on the genetic target.
52. The composition of claim 51, wherein the short probe(s) range from 0.5 kb to 9 kb.
53. The composition according to claim 51, wherein the long probe(s) range from 14 kb to 40 kb.
54. The composition according to claim 51, wherein the size of the short probes range from 0.5 to 9 kb and wherein at least 90% of the frequent repetitive sequences have been removed from the short probes.
55. The composition of according to claim 51, wherein the probe sequences hybridize specifically on the MSH2 gene or in the region of the MSH2 gene or on the MLH1 gene or in the region of the MLH1 gene.
56. The composition according to claim 51, wherein said short probe sequence(s) are selected from the group consisting of the group of short probes obtained by amplification using the primer pairs disclosed as SEQ ID NO: 21-60, SEQ ID NO:95-122; SEQ ID NO:163-172; SEQ ID NO:185-202 and SEQ ID NO:227-248 or the long probe sequence(s) are selected from the group consisting of the group of long probe obtained by amplification using the primer pairs disclosed as SEQ ID NO: 61-76 and SEQ ID NO:123-138.
57. A kit according to claim 49 wherein the short probes are at least 500 bp each.
58. A kit according to claim 57 wherein the long probes are 14 kb or more, and optionally wherein the long probes are shorter than 150 kb.
59. A kit according to claim 58 for the detection of genomic rearrangements associated with a condition selected from the group consisting of: colorectal cancer or genetic predisposition to colorectal cancer, breast cancer or genetic predisposition to breast cancer, ovarian cancer or genetic predisposition to ovarian cancer, and lung cancer or genetic predisposition to lung cancer.
60. A kit according to claim 59, wherein sequences of more than 200 bp, of which more than 10 copies with less than 20% mismatch are found within the regions of interest, have been removed from the short and/or long probes.
61. A kit according to claim 58, wherein sequences of more than 200 bp, of which more than 10 copies with less than 20% mismatch are found within the regions of interest, have been removed from the short and/or long probes
62. A kit according to claim 59, wherein sequences of more than 200 bp, of which more than 10 copies with less than 20% mismatch are found within the regions of interest, have been removed from the short and/or long probes
63. A composition according to claim 51, wherein sequences of more than 200 bp, of which more than 10 copies with less than 20% mismatch are found within the regions of interest, have been removed from the short and/or long probes.
64. A composition according to claim 52, wherein sequences of more than 200 bp, of which more than 10 copies with less than 20% mismatch are found within the regions of interest, have been removed from the short and/or long probes.
65. A composition according to claim 53, wherein sequences of more than 200 bp, of which more than 10 copies with less than 20% mismatch are found within the regions of interest, have been removed from the short and/or long probes.
66. A composition according to claim 52, wherein the sizes of long probes range from 14 kb to 150 kb, and wherein sequences of more than 200 bp, of which more than 10 copies with less than 20% mismatch are found within the regions of interest, have been removed from the short and/or long probes.
Description:
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] The present application is a continuation of U.S. Ser. No. 13/665,440, filed Oct. 31, 2012, which claims priority to U.S. Provisional Application No. 61/553,889, filed Oct. 31, 2011, the entire contents of which are incorporated herein by reference. On Oct. 30, 2012, International Application PCT/IB/2012/002423 was also filed with the same title, the entire contents of which are incorporated herein by reference.
BACKGROUND OF THE INVENTION
[0002] 1. Field of the Invention
[0003] The invention relates to high-resolution, precise method for detecting genomic rearrangements in vitro using specially designed combinations of polynucleotide probes. The invention concerns accurate methods of detection and diagnosis of conditions, disorders and diseases associated with rearrangement of genomic DNA.
[0004] 2. Description of the Related Art
[0005] The Multigenic Paradigm of Human Diseases
[0006] Advances in genetic analysis of human diseases have provided better insights into the molecular mechanisms contributing to disease initiation and progression. Previous associations were made between particular diseases and association and/or linkage disequilibrium to single base mutations in somatic genetic sequences or with particular single nucleotide polymorphisms ("SNPs") in genomic DNA. Newer technologies have provided evidence that larger genetic alterations and rearrangements are associated with, or can constitute major causes of diseases, disorders or conditions having a genetic origin or basis. Disease associations have now moved from a monogenic to a multigenic paradigm where a disease's origins and progression is mainly linked to more than one single genetic mutation or origin. While these new insights provide better avenues for disease detection and treatments, they also highlight the need for combinatorial genetic analysis that goes beyond detection of single mutational events or SNPs by assessing disease associations with larger genomic rearrangements. Such combinatorial genetic analysis would provide a better, more precise and accurate diagnosis of a particular condition, disorder, disease or pathology, but would also help establishing a more appropriate medical survey, more accurate therapeutic decisions and interventions, as well as help in assessing the efficacy of such therapies and interventions.
[0007] Multigenic Causes of Genetic Disease
[0008] Genetic disorders manifesting the same or similar clinical signs and consequences can arise from both single and exclusive, or combined, mutations in various genes. Such mutations can fall within either the single base alteration and/or the class of large genetic rearrangements. A few examples of such genetic disorders are Fragile X syndrome (mutations and expansions in the FMR1 gene), Ataxia Telangectasia (single base pair mutations in either intronic and exonic sequences as well as deletions and translocations of the ATM gene), Seckel syndrome (mutations as well as large rearrangements in SCKL1, SCKL2, SCKL3, PCTN and ATR), autism (mutations as well as large rearrangements in GLO1, MTF1 and SLC11A3), Spinal Muscular Atrophy (mutations, deletions, transconversions as well as cis-duplications involving the SMN1 and SMN2 genes) and myotonic dystrophy (trinucleotide/tetranucleotide expansions in DM1 and DM2).
[0009] Multigenic Causes of Cancer Predisposition
[0010] In the case of cancer predisposition, there are several examples of familial cancer predisposition syndromes for which one can nominate several causative genes for which both single base alterations and/or large rearrangements were identified.
[0011] Breast and Ovary Cancer. Causative genes: BRCA1, BRCA2, ATM . . . mutation type: higher proportion of point mutations identified so far.
[0012] Hereditary nonpolyposis colorectal cancer (Lynch syndroma). Causative genes: MSH2, MLH1, MSH6, EPCAM, . . . mutation type: equivalent proportion of point mutations has also been identified.
[0013] Multigenic Causes of Cancer Progression
[0014] Cancer progression is surely the human disease domain where the monogenic causative hypothesis was definitely ruled out since several years. First, the disease's initiation is strictly dependent of two molecular events (immortalizing and transforming) due to genetic alterations in at least two independent genes classified at either oncogene or tumor suppressor genes. Second, the disease's progression is linked to additional genetic alterations independent from the causative ones. Not only do these additional alterations play a role in cancer progression, they also were demonstrated to be the basis for appearance of resistance to therapy during treatments. Strikingly, in the list of cancer related genes, if extremely rare examples are only subject to discrete single base mutations (e.g., KRas or BRaf), the large majority is either subject to only large rearrangements (e.g., HER2, ALK . . . ) or to both single base mutations and large rearrangements (p53, c-myc, c-Met, EGFR . . . ).
[0015] The identification and characterization of multigenic conditions, disorders and diseases, including cancer, cardiovascular disease, diabetes and other heritable genetic conditions has been made difficult in part due to the imprecision of existing methods of molecular diagnosis. Molecular Combing is probably the sole approach allowing detecting all type of large genetic rearrangements (deletion, amplification, expansions, inversions, translocations . . . ) even in a complex and heterogeneous population (such as tumors).
[0016] High resolution barcodes allowing multiplex analysis of patients could help diagnostic at different level such as for patient stratification/classification and/or prognosis.
[0017] Multiplex High Resolution Barcodes for Identifying the Right Genetic Alterations as a Key Driver for Therapeutic Intervention
[0018] The Example of Myotonic Dystrophy
[0019] Myotonic Dystrophy (DM1) and Myotonic Dystrophy 2 (DM2) are two muscular dystrophies characterized by trinucleotide/tetranucleotide expansions in two different genes. If severe forms of DM1 can be clinically differentiated from DM2, milder DM1 forms are displayed extremely similar clinical signs than DM2. There is currently no cure for or treatment specific to myotonic dystrophy. However, DM1 patients exhibit Complications of the disease (heart problems, cataracts . . . ) not existing in DM2 that could can be treated but not cured. Differentiating DM1 and DM2 by the use of a multiplex assay of high resolution barcodes could thus help preventing and treating secondary effects
[0020] The Example of Hereditary Breast and Ovary Cancer
[0021] In certain countries (U.S.) detecting constitutional alterations in BRCA1/2 drives to therapeutic intervention (surgery/reconstitution). Thus, there is a clear need for an accurate diagnostic comprising all the potentially involved genes. Such a test could be made on the basis of a multiplex assay of high resolution barcodes comprising large chromosomal regions around genes known to be involved in this syndrome; BRCA1, BRCA2, ATM, ATR . . . .
[0022] DNA Damage and Response Inhibitors Example
[0023] Synthetic lethality became a strong reality for therapeutic decision to include Cancer patients in specific protocols/regimens. One of the first examples was given with the demonstration that Breast cancer patients with BRCA deficiency exhibit a higher sensitivity to PARP inhibitors, a new category of drug acting on DNA Damage and Response pathway. More recently, this was extended to other type of inhibitors in this category such as ATM inhibitors but also to more traditional anti-cancer drugs including all types of DNA polymerase and replication inhibitors.
[0024] Not only does this concept extended to other inhibitors, but it was also demonstrated that it could be extended to other types of cancers such as lung and metastatic melanoma.
[0025] Here, a multiplex high resolution barcode will allow detection of genetic alteration in genes involved in DNA damage and response that could help predicting sensitivity to this class of inhibitors. A list of such genes could include BRCA1, BRCA2, ATM, ATR, MSH2, MLH1, MSH6, EPCAM . . . .
[0026] The Lung Cancer Example
[0027] Numerous alterations involved in lung cancer could be multiplexed for a better patient classification such as:
[0028] LOH/Deletion (P53, STK11, LKB1, BRG1, KLF6);
[0029] Amplification (FGFR1, MET, EGFR, HER2 . . . );
[0030] Translocation: (ALK);
[0031] All these genetic alteration are associated to therapeutic treatments:
[0032] P53: Nutlin (low doses Actinomycin D produce similar effects)
[0033] FGFR1: Masitinib, PD173074, SU5402 TK1258 AZD4547 . . .
[0034] MET: GSK1363089, ARQ197, SGX523, XL184 . . .
[0035] EGFR: Tarceva, Erbitux, Vectibix . . .
[0036] HER2: Herceptin, Lapatinib . . .
[0037] ALK: Crizotinib
[0038] As at least 30% of NSCLCs were demonstrated to be dependent on at least one of these mutations, defining the genetic profile of the tumor could help driving therapeutic options. This could be made possible by designing multiplex assays combining high resolution barcodes covering this major genetic loci.
[0039] Localization of (Genetic) Sequences of Interest
[0040] Genetic sequence is the most fundamental information to synthesize functional protein. Alteration of genetic sequence sometimes results in loss of functional protein synthesis. In addition to alteration of genetic sequence, loss or gain of genetic sequence (copy number variation, CNV) also can be problematic for homeostasis of cellular activity. For example, loss of (functional) anti-tumor protein (p53) or gain of proto-oncogene (c-myc) results in cancer-prone cell. When such mutation happens (or exists) in germ cell, this mutation spreads whole cell in an individual who is either carrier or patient of genetic disease, or has a predisposition to cancer. The germline mutation can be heritable. These days CNV becomes more and more important to understand in the field of genetics (ref 1). However, copy number count alone is not always sufficient and it is often critical to establish the actual location of sequence elements. This is strikingly the case for e.g. balanced translocations. DNA sequencing and CNV detection methods such as array-based comparative genomic hybridization (aCGH) and quantitative PCR generally cannot detect these balanced mutations because these methods assess whether the sequence and the copy number are correct or not. FISH and its extended forms such as fiber-FISH or molecular combing can address these balanced mutations with different resolutions and precisions depending on methods.
[0041] Resolution and Precision
[0042] The use of BAC/PAC/cosmid probes on targeted regions was successfully conducted to detect large (a few kb to tens of kb) genomic rearrangements (ref 2). In these approaches, the minimum size of detectable events (e.g., the size of the deleted or amplified sequence), hereafter designated as the "resolution" of such an assay, is limited due to the large standard deviation involved in measuring probes or gaps of tens of kilobases. Indeed, in such assays the standard deviation of measurements increases with the length of the measured element. For example, a 40 kb-probe is measured with a standard deviation of ˜5 kb. Thus, if 16 measurements of a given probe are made on a slide, the precision on the size of the probe obtained as the mean value of measurements is in the order of magnitude of 2.5 kb (Considering the distribution is gaussian, and the precision is the half-width of the confidence interval, i.e. 2.sd/ n where sd=standard deviation and n=number of measurements). For a 10 kb-probe, where the standard deviation is ˜2 kb, the precision would be ˜1 kb. This illustrates the fact that shorter probes allow for better (lower) resolution.
[0043] Besides, the location of such an event (the position of the extremities of the event) may be defined with a precision (hereafter the location precision) limited by the size of the probe or gap within which it occurs: e.g. if a 40 kb probe is estimated to measure 39 kb in a sample, one can conclude that a 1 kb deletion occurred somewhere within the probe, with no further precision--thus, somewhere in a 40 kb genomic region. If the same 1 kb deletion had occurred within a 10 kb probe, the location of that deletion would be known with a better precision, as the range would be reduced to a 10 kb genomic region. Therefore, the smaller the probes and gaps, the better the location precision.
[0044] There are limits to small probes: (i) below a certain size, they become difficult to detect; (ii) they involve more complex color schemes (as there are relatively more probes); (iii) there are more distinct probes to cover a given region, and the experiments are therefore more expensive and time-consuming; (iv) most importantly, fast and reliable identification of probes, whether by a human operator or a piece of software, is easier with longer probes, as they are more readily distinguished from background. Indeed, background is mainly constituted of roughly circular fluorescent spots. When large enough, the shape of these spots allows to one to easily distinguish them from probes. However, when their size is small enough, they appear difficult to distinguish from small probes.
[0045] In operating conditions according to the invention, probes shorter than ˜3 kb are detected with a diminished efficiency. Within the 3-10 kb range, the standard deviation of measurements varies little, and there is therefore little benefit in resolution with the shorter probes within this range. Therefore, this range is usually considered to be a good compromise for probe size. However, in cases where probes are close enough (less than 10 kb gaps), smaller probes (within the 500-3000 bp range) are still useful, as they will be detected in at least a fraction of signals and the presence of the corresponding sequences may therefore be established with certainty. It was also found that detection of isolated probes longer than 12 kb (preferably longer than 14 kb) is more reliable, whether for a human operator or for automatic detection software.
[0046] Exclusion of Repeats
[0047] Eukaryotic genomic DNA contains various repetitive sequences, i.e., sequences that appear more than once (and more than statistically predicted based on their length and base content) in a normal haploid genome. Among these, some appear with very high frequency (tens of thousands to millions of copies). In human genomic DNA, the most abundant of these is the Alu family, which has ˜1,000,000 copies constituting ˜10% of the genome. In any hybridization procedure involving human genomic DNA, it is expected that probes carrying such repeats would hybridize on numerous targets, generating non-specific signal from regions throughout the genome. Other types of repetitive sequences exist, with lower frequency, and often more specific localization. The number of copies and repeat sequence length may vary widely, as well as the degree of homology. Beta-satellite sequences, for example, are present in multiple copies (hundreds to thousands), usually as tandem repeat arrays comprising hundreds of copies of the same 50-100 bp long sequence, specifically localized in a limited number of loci. Strategies to get rid of the non-specific signals depend on the type of procedure and probe. Schematically, when probes are very short sequences of DNA (oligonucleotides, typically less than 100 bp), as in aCGH procedures, the sequence of the oligonucleotides is chosen to be free of repetitive sequences, by comparison with repetitive sequences found in databases. This strategy is only practical for very short probes, as short sequences free of repetitive sequences are relatively abundant, but unpractical for longer probes, as long stretches completely devoid of repetitive elements are rare (although this has been adapted to longer FISH probes, in an approach that suffers multiple drawbacks, see below). Besides, even for short probes, it constrains the design of probes heavily and some genomic regions, rich in repetitive sequences, have lower density of coverage (and thus lower resolution of events) due to this constraint.
[0048] When probes are longer (typically PCR products or cloned DNA inserts--1 to 150 kb), in Southern Blot or in FISH procedures, non-labeled competitive DNA, enriched in repetitive elements such as Alu repeats (usually Cot-1 DNA), is added in large excess along with the labeled probe. Competition of unlabelled probes on the repetitive sequences minimizes the hybridization of labeled probes. This strategy is expensive and since the competitor DNA is not purely made of repetitive sequences, competition also occurs on the unique sequences for which the probes were designed, thus limiting the amount of competitor DNA that may be used. Therefore, the efficiency of this approach is limited.
[0049] An alternative approach for longer probes has been proposed by Knoll and collaborators (U.S. Pat. No. 7,014,997), resembling the strategy usually adopted for oligonucleotides: probes are chosen within sequence intervals devoid from repetitive elements. This strategy is based on bioinformatics analysis of the regions of interest and exclusion of known repetitive sequences by comparison with sequence databases. However, this approach has several limitations: prior knowledge of the repetitive sequences is required, which can be a problem e.g. in species where such knowledge is unavailable. More importantly, intervals longer than 2 kb devoid of repetitive sequences appear only once in 20-30 kb on average and are unevenly distributed (Considering the distribution is gaussian, and the precision is the half-width of the confidence interval, i.e. 2.sd/ n where sd=standard deviation and n=number o) so the design of probes would be highly constrained, impairing the possibility to design a high-resolution code. This would prove especially difficult in repeat-rich regions, and/or regions where pseudogenes are located next to homologous genes of interest--such low-copy repetitive sequences being also excluded with the strategy from Knoll and co (ref. 3). Since regions targeted in rearrangement tests, e.g., for diagnostics purposes, often display these features, this approach is not suitable for the design of high-resolution barcodes and especially not if such a code is to be used for diagnostics purposes. Distinctions between this approach and the invention are disclosed in more detail below.
BRIEF SUMMARY OF THE INVENTION
[0050] The present invention concerns the field of the in vitro diagnosis and detection of genetic rearrangements and is related to a method to identify or detect genetic rearrangements in a biological sample to be tested which are already known or which are new and provide markers for example of diseases as cancers or metabolic or foetal genetic diseases. The invention is characterized by using compositions containing purified or synthesized nucleic acid molecules (polynucleotides) having nucleotide sequences selected as short sequences with a length of less than 10 Kb and associated in the said method with other different nucleic acid molecules (polynucleotides) having nucleotide sequences non-overlapping with the former ones and having a size longer than 12 Kb. The selected nucleotide sequences (polynucleotides) used as probes are partly deleted of their natural frequently repeated sequences. The present invention concerns also improvements brought to the design of set of probe sequences for the detection of genetic rearrangements by hybridization as with fiber-FISH-like technologies such as Molecular Combing. The improvements described herein allow for high precision/high-resolution detection of rearrangements in time- and cost-efficient assays. This invention also relates to the use of probe sequences for diagnostics applications and companion diagnostics tests, to a method of detection of presence or absence of alterations in sequences and to a kit for the above uses. This is illustrated hereinafter with sets of nucleotide sequences corresponding to parts of at least two genes: MSH2 and MLH1 or to the regions of MSH2 and MLH1, whose mutations increase the risk of occurrence of human colorectal cancer.
[0051] The invention is related to the sets of polynucleotides or probes labeled or not which are specific of said genes. Presently, the detection of genetic rearrangements using current technologies is often insufficiently reliable for diagnostics use. Unlike most technologies used to detect genetic alterations, which suffer strong intrinsic limitations towards some types of rearrangements, direct technologies such as FISH or Fiber-FISH can intrinsically detect any type of rearrangements. Their use is mainly limited by their resolution. Molecular Combing, on the other hand, may reach sufficient resolution, but probe designs currently used fail to allow cost- and time-efficient high resolution analysis of rearrangements.
[0052] These improvements involve the combination within the same sets of probes of -typically shorter--probes designed to optimize the sensitive detection and precise measurement of rearrangements and--typically longer--probes to allow for fast and reliable detection of signals of interest when analyzing results. Alternative designs where the longer probes are replace with a combination of shorter probes having equivalent functions and effects are also disclosed.
[0053] Specific aspects of the invention based on the concept of combining small probes for resolution and long probes for ease of detection for the detection on one or more genomic region(s) of interest as disclosed in more detail below.
[0054] The invention thus concerns a method for detecting mutated or rearranged genomic polynucleotide (target) sequence comprising:
[0055] (a1) hybridizing a target genomic polynucleotide comprising one or more genomic region(s) of interest, where mutations or rearrangements are sought, to a set of short probes that bind to each region of interest without long gaps between the portions of the target sequence bound by the set of short probes, where on each genomic region a subset of short probes are selected so that when taken together they form a long contiguous stretch inside or outside the region of interest, and wherein the probes may optionally have frequent repetitive sequences removed and thus more generally are optionally devoid of such repetitive sequences; or
[0056] (a2) hybridizing a target genomic polynucleotide comprising one or more genomic region(s) of interest, where mutations or rearrangements are sought, to a set of short probes that bind to each region of interest without long gaps between the portions of the target sequence bound by the set of short probes and to one or more long (docking) probe(s) that bind to sequences near but outside of the region(s) of interest; wherein the sequence(s) of the long probe(s) does not overlap that of the short probes and wherein the short and/or long probes may optionally have frequent repetitive sequences removed and thus more generally are optionally devoid of such repetitive sequences;
[0057] (b) detecting the locations of hybridized probes on the genomic region(s) of interest; optionally,
[0058] (c) comparing the location of the hybridized probes on the target genomic polynucleotide sequence with one or more motifs based on the hybridization of said probes to a reference, control, normal, not mutated, or not rearranged genomic polynucleotide sequence; and optionally,
[0059] (d) correlating the presence of a mutated or rearranged genomic polynucleotide with a specific phenotype, disease, disorder, or condition.
[0060] The mutated or arranged genomic polynucleotide sequence can be obtained from a subject who has cancer or who is suspected to having cancer, for example, from a subject who has colorectal cancer or who is suspected of having colorectal cancer. In such a case, the short and long probes identify mutations or genomic rearrangements associated with colorectal cancer and a control or reference sample would not contain these mutations or rearrangements. The presence or risk of developing colorectal cancer is assessed by comparing a target genomic polynucleotide sequence with the reference and determining whether a mutation or rearrangement associated with colorectal cancer is present. This method can be practiced with specific probes corresponding to or derived from Probe sets 1, 2, 3 and 4. For colorectal cancer, a genomic region of interest can be selected from genes associated with this disease, such as MSH2, MLH1, MSH6, PMS2 or EPCAM.
[0061] Similarly, the method may be applied to samples obtained from subjects having or at risk of developing other kinds of cancer, such as breast cancer, ovary cancer, or lung cancer. The method may also be applied to samples obtained from subjects having or at risk of other kinds of diseases, disorders, or conditions, including cardiovascular disease, diabetes, neuromuscular disorders; such as myotonic dystrophy or spinal muscular atrophy or samples obtained from a subject who has, is suspected of having, or is suspected of being a carrier for a genetic or hereditary disease, disorder or condition, including known or unknown foetal genetic alterations. The sample can be obtained from a subject having a multigenic genetic or hereditary disease, disorder or condition or for a genetic or hereditary disease, disorder or condition associated with rearrangement of genomic DNA.
[0062] In some aspects of the invention, the sample will be obtained from a subject undergoing treatment for a disease, disorder or condition associated with a genomic or somatic genetic rearrangement and the results obtained are compared to results obtained at other time points before, during or after the termination of treatment. A companion test for evaluating the efficiency of a therapeutic drug on the mutated or rearranged nucleotide sequences of the gene or the region of the gene of interest can be performed using the short and long probes according to the invention.
[0063] Preferably, in the method described above, the hybridizing with the short and long probes in step a) will be performed simultaneously.
[0064] Preferably, the short probes range in length from 0.5 kb to 10 kb and the maximum size of the gaps between the short probes when they are bound to the target is 15 kb, preferably 12 kb and more preferably 10 kb.
[0065] The number of short probes employed in the method described above can range from 1, 2, 3 to 10, 15 or more.
[0066] The maximum size for the long probes is 150 kb and these probes preferably range from 12 kb to 40 kb in length. Preferably, in order to have "long probe(s) that bind to sequences near but outside of the region of interest", distance between the long probes and the region of interest is no longer than 150 kb, and more preferably no longer than 75 kb and even more preferably no longer than 25 kb from the region of interest. The minimum size for a genomic region to be tested or targeted is 50 kb. The minimum number of regions of interest is one for a singleplex test and two or more for a multiplex test. Examples of combinations of short and/or long probes include at least one short (less than 10 kb) sequence and at least one non-overlapping long sequence (more than 15 kb), or at least one group of at least two short sequences, less than 10 kb each, which total group length is longer than 14 kb and less than 150 kb, hybridizing contiguously on the mutated or rearranged polynucleotide sequence. The short probes can comprise a set of contiguous probes that span a stretch of the genomic polynucleotide sequences inside or outside the region of interest that is at least 15 kb.
[0067] The long probes may have repetitive DNA sequences excluded. These repetitive sequences to be excluded would ordinarily appear more than once and more often than statistically predicted based on their length and base content, for example, repetitive sequences between 50 and 400 bp can be excluded, though shorter or longer repetitive sequences that decrease sensitivity or specificity of the method can be identified and excluded. An example of such a sequence is the repetitive Alu family DNA sequences.
[0068] According to an embodiment of the invention, in order for the probes, either short probes or long probes, to have repetitive sequences excluded, these probes are designed to hybridize in regions of the genome which are free of such repetitive sequences, i.e. which have less than 10% preferably less than 2% of the selected type(s) of repetititve sequences to be excluded.
[0069] In the method described above, the short and long probes are preferably fluorescently tagged and different components of the probe sets may be tagged with different labels, such as labels with different colors. Tagging provides one means to identify motifs or submotifs characteristic of a mutated or rearranged sequence.
[0070] Compositions or kits comprising a set of short probes or a combination of short and long probes as described herein and optionally one or more components for binding said probes to a polynucleotide, for performing molecular combing, and/or for detecting whether hybridization has occurred are also contemplated. For example, a composition containing the short and long probe(s) described above, wherein at least two of said probe sequences detect a genetic rearrangement by using Molecular Combing, said composition comprising either at least one short (<12 kb) sequence and at least one non-overlapping long sequence (>14 kb), or at least one group of at least two short sequences, less than 10 kb each, which total length is longer than 14 kb and less than 150 kb, hybridizing contiguously on the genetic target. The short probe(s) in such a composition may preferably range from 0.5 kb to 12 kb and the long probe(s) range from 14 kb to 40 kb. Frequent repetitive sequences described above may be removed from the probes. Examples of probe sequences are those that hybridize specifically on the MSH2 gene or in the region of the MSH2 gene or on the MLH1 gene or in the region of the MLH1 gene. Specific kinds of short probe sequence(s) where repetitive sequences have been removed include those selected from the group consisting of or comprising the sequences obtained by PCR amplification on human genomic DNA using the primer pairs described in Table 1 in the lines:
[0071] MSH2-v1
[0072] P3 (primer pairs P3a_MSH2-v1 to P3c_MSH2-v1, SEQ ID NO:21-26)
[0073] P4 (primer pairs P4a_MSH2-v1 to P4b_MSH2-v1, SEQ ID NO:27-30)
[0074] P5 (primer pairs P5a_MSH2-v1 to P5c_MSH2-v1, SEQ ID NO:31-36)P6 (primer pairs P6a_MSH2-v1 to P6b_MSH2-v1, SEQ ID NO:37-40)
[0075] P7 (primer pairs P7a_MSH2-v1 to P7c_MSH2-v1, SEQ ID NO:41-46)
[0076] P8 (primer pairs P8a_MSH2-v1 to P8b_MSH2-v1, SEQ ID NO:47-50)
[0077] P9 (primer pairs P9a_MSH2-v1 to P9c_MSH2-v1, SEQ ID NO:51-56)
[0078] P10 (primer pairs P10a_MSH2-v1 to P10b_MSH2-v1, SEQ ID NO:57-60) MLH1-v1
[0079] P3 (primer pairs P3a_MLH1-v1 to P3d_MLH1-v1, SEQ ID NO:95-102)
[0080] P4 (primer pairs P4a_MLH1-v1 to P4b_MLH1-v1, SEQ ID NO:103-106)
[0081] P5 (primer pairs P5a_MLH1-v1 to P5b_MLH1-v1, SEQ ID NO:107-110)
[0082] P6 (primer pair P6a_MLH1-v1, SEQ ID NO:111-112)
[0083] P7 (primer pair P7a_MLH1-v1, SEQ ID NO:113-114
[0084] P8 (primer pairs P8a_MLH1-v1 to P8d_MLH1-v1, SEQ ID NO:115-122)
[0085] and the short probes may be used in combination with the long probe sequence(s) selected from the group consisting of or comprising the sequences obtained by PCR amplification on human genomic DNA using the primer pairs described in Table 1 in the lines
[0086] MSH2-v1
[0087] P11 (primer pairs P11a_MSH2-v1 to P11c_MSH2-v1, SEQ ID NO:61-66)
[0088] P12 (primer pairs P12a_MSH2-v1 to P12e_MSH2-v1, SEQ ID NO:67-76)
[0089] MLH1-v1
[0090] P9 (primer pairs P9a_MLH1-v1 to P9c_MLH1-v1, SEQ ID NO:123-128)
[0091] P10 (primer pairs P10a_MLH1-v1 to P10e_MLH1-v1, SEQ ID NO:129-138).
[0092] Specific kinds of contiguous short probe sequence(s) forming long stretches include those selected from the group consisting of or comprising the sequences obtained by PCR amplification on human genomic DNA using the primer pairs described in Table 1 in the lines:
[0093] MSH2-v2
[0094] PE1-2 (primer pairs PE1_MSH2-v2 to PE2_MSH2-v2, SEQ ID NO:163-166) and
[0095] PE3-6 (primer pairs PE3_MSH2-v2 to PE5-6_MSH2-v2, SEQ ID NO:167-172), together forming one stretch;
[0096] PE9 (primer pairs E9_MSH2-v2 and I9-10_MSH2-v2, SEQ ID NO:185-188),
[0097] PE10 (primer pair E10_MSH2-v2, SEQ ID NO:189-190),
[0098] PE11 (primer pairs E11_MSH2-v2 and I11-12_MSH2-v2, SEQ ID NO:191-194),
[0099] PE12-14 (primer pairs E12_MSH2-v2 and E13-14_MSH2-v2, SEQ ID NO:195-198) and
[0100] PE15-16 (primer pairs E15_MSH2-v2 and E16_MSH2-v2, SEQ ID NO:199-202), together forming one stretch;
[0101] MLH1-v2
[0102] PE1-2 (primer pairs E1_MLH1-v2 and E2_MLH1-v2, SEQ ID NO:227-230),
[0103] PE3-4 (primer pairs I23_MLH1-v2, E3_MLH1-v2 and E4_MLH1-v2, SEQ ID NO:231-236),
[0104] PE5-6 (primer pairs E5_MLH1-v2 and E6_MLH1-v2, SEQ ID NO:237-240),
[0105] PE7-9 (primer pairs E7-8_MLH1-v2 and E9_MLH1-v2, SEQ ID NO:241-244) and
[0106] PE10-11 (primer pairs E10_MLH1-v2 and E11_MLH1-v2, SEQ ID NO:245-248), together forming one stretch;
The primers designed for the purpose of preparing short probes of the invention may have a sequence of 20 to 40 nucleotides and comprise in their 3' end a sequence of at least 20 contiguous nucleotides that base pairs with the target. The primer sequence thus may also comprise additional nucleotides that do not base pair with the target in its 5' end. The nucleotides which do not base pair may be useful for the construction of the primers or for the cloning of the amplified sequence resulting from polymerization starting from the primers. In a particular embodiment the sequence of the primer that hybridizes to the target is longer than 20 nucleotides. Molecular Combing is a powerful FISH-based technique for direct visualization of single DNA molecules that are attached, uniformly and irreversibly, to specially treated glass surfaces (Herrick and Bensimon, 2009); (Schurra and Bensimon, 2009). This technology considerably improves the structural and functional analysis of DNA across the genome and is capable of visualizing the entire genome at high resolution (in the kb range) in a single analysis. Another embodiment of the invention is a method for designing a set of short probes or set of short and long probes as described above comprising:
[0107] identifying a polynucleotide containing a genomic region of interest,
[0108] selecting long probe sequences outside of the genomic region of interest but within 100 kb of the closest probe in the region of interest, and preferably within 30 kb of the closest probe in the region of interest and optionally removing frequently repeated sequences from said long probe sequences,
[0109] selecting a short probe sequences from within the genomic region of interest so that no gaps longer than 20 kb, and preferably no gaps longer than 12 kb appear between the short probes; or selecting a series of short probes that together form a long continuous stretch that covers the genomic region of interest;
[0110] hybridizing the probes to a genomic polynucleotide comprising the genomic region of interest,
[0111] detecting the hybridized probes, and
[0112] determining which sets of probes form motifs that specifically identify the genomic sequence of interest from a reference genomic sequence.
[0113] The comparison of the location of the hybridized probes on the target genomic polynucleotide sequence with one or more motifs based on the hybridization of said probes to a reference, control, normal, not mutated, or not rearranged genomic polynucleotide sequence, as disclosed in the databanks or experimentally obtained on samples.
[0114] The techniques disclosed herein may be applied to diagnosis of disease as well as for the identification of genetic rearrangements associated with a disease, disorder or condition. They may also be used as companion diagnostics to study the responses of a subject or group of subjects who have particular rearrangements to therapy, responses to environmental agents, or the effects of lifestyle choices. Specifically, the diagnostic products and methods of the invention are useful for diagnosis and assessments for subjects having or at risk of developing colorectal cancer. High resolution barcodes allow multiplex analysis of patients for extended or expanded diagnosis at the levels of patient stratification/classification and prognosis. Thus, the techniques disclosed herein can also be used to predict the course and probably outcome of a disease, disorder or condition as well as the likelihood of progression, stability, or recovery. Multiplex high resolution barcodes also permit the identification of key genetic alterations in a subject that would benefit from a particular kind of therapy as well as a way to assess the reaction of a subject to a particular kind of therapy or therapeutic intervention. Specific embodiments of the invention include the following, which embodiments are especially carried out in vitro.
[0115] A method for detecting mutated or rearranged genomic polynucleotide sequence comprising: (a1) hybridizing a target genomic polynucleotide comprising one or more genomic region(s) of interest, where mutations or rearrangements are sought, to a set of short probes that bind to each region of interest without long gaps between the portions of the target sequence bound by the set of short probes said set of short probes optionally including or being in combination with a (sub)set of short probes selected so that on each genomic region some of the short probes when taken together form a long contiguous stretch inside or outside the region of interest and where the short probes may optionally have frequent repetitive sequences removed; or (a2) hybridizing a target genomic polynucleotide comprising one or more genomic region(s) of interest, where mutations or rearrangements are sought, to a set of short probes that bind to each region of interest without long gaps between the portions of the target sequence bound by the set of short probes and to one or more long (docking) probe(s) that bind to sequences near but outside of the region(s) of interest; wherein the sequence(s) of the long probe(s) does not overlap that of the short probes and wherein the short and/or long probes may optionally have some or all of the frequently repeating sequences removed; (b) detecting the locations of hybridized probes on the genomic region(s) of interest; optionally, (c) comparing the location of the hybridized probes on the target genomic polynucleotide sequence with one or more motifs based on the hybridization of said probes to a reference, control, normal, not mutated, or not rearranged genomic polynucleotide)sequence; and optionally, and/or (d) correlating the presence of a mutated or rearranged genomic polynucleotide with a specific phenotype, disease, disorder, or condition.
[0116] The invention relates in particular to the method herein described wherein the mutated or rearranged genomic polynucleotide sequence is obtained from a subject who has cancer or who is suspected of having cancer or who is susceptible to have a genetic predisposition to cancer.
[0117] The invention also relates in a particular embodiment to a method wherein the mutated or rearranged genomic polynucleotide sequence is obtained from a subject who has colorectal cancer or who is suspected of having colorectal cancer or who is susceptible to have a genetic predisposition to colorectal cancer, wherein said short and long probes identify mutations or genomic rearrangements associated with colorectal cancer, wherein said control, not mutated or normal genomic sequence is obtained from a subject not at risk for colorectal cancer and wherein the detection of a genomic rearrangement; and assessing presence of or risk of developing colorectal cancer when said genomic rearrangement is detected. In this method the probes can hybridize specifically on the MSH2 gene, in the region of the MSH2 gene, on the MLH1 gene, or in the region of the MLH1 gene.
[0118] The invention also relates in a particular embodiment to a method wherein the mutated or rearranged genomic polynucleotide sequence is obtained from a subject who has breast cancer or who is suspected to having breast cancer or who is susceptible to have a genetic predisposition to breast cancer.
[0119] The invention also relates in a particular embodiment to a method wherein the mutated or rearranged genomic polynucleotide sequence is obtained from a subject who has ovarian cancer or who is suspected to having ovarian cancer or who is susceptible to have a genetic predisposition to ovarian cancer.
[0120] The invention also relates in a particular embodiment to a method wherein the mutated or rearranged genomic polynucleotide sequence is obtained from a subject who has lung cancer or who is suspected to having lung cancer or who is susceptible to have a genetic predisposition to lung cancer.
[0121] The invention also relates in a particular embodiment to a method wherein the mutated or rearranged genomic polynucleotide sequence is obtained from a subject who has a cardiovascular disease, disorder or condition or who is suspected of having cardiovascular disease, disorder or condition or who is susceptible to have a genetic predisposition to cardiovascular disease, disorder or condition.
[0122] The invention also relates in a particular embodiment to a method wherein the mutated or rearranged genomic polynucleotide sequence is obtained from a subject who has a diabetes or who is suspected of having diabetes or who is susceptible to have a genetic predisposition to diabetes.
[0123] The invention also relates in a particular embodiment to a method wherein the mutated or rearranged genomic polynucleotide sequence is obtained from a subject who has a neuromuscular disorder or who is suspected of having a neuromuscular disorder.
[0124] The invention also relates in a particular embodiment to a method wherein the mutated or rearranged genomic polynucleotide sequence is obtained from a subject who has, is suspected of having, or is susceptible of being a carrier for a genetic or hereditary disease, disorder or condition.
[0125] The invention also relates in a particular embodiment to a method wherein the short and long probe sequences are specific to human genes or to human genomic regions associated with cancer, colorectal cancer or a foetal genetic alteration known or unknown when said region or gene is mutated or genetically rearranged.
[0126] The invention also relates in a particular embodiment to a method wherein the mutated or rearranged genomic polynucleotide sequence is obtained from a subject who has, is suspected of having, or is suspected of being a carrier for a multigenic genetic or hereditary disease, disorder or condition or for a genetic or hereditary disease, disorder or condition associated with rearrangement of genomic DNA.
[0127] The invention also relates in a particular embodiment to a method wherein the mutated or rearranged genomic polynucleotide sequence is obtained from a subject undergoing treatment for a disease, disorder or condition associated with a genomic inherited or acquired rearrangement and the results obtained are compared to results obtained at other time points before, during or after the termination of treatment.
[0128] The invention relates to method of any of the embodiments described herein, characterized by the following features taken individually or in any combination: the hybridizing with the short and long probes in (a2) is performed simultaneously; the short probes are 10 kb or less; and/or the short probe(s) comprise at least one short (less than 10 kb) sequence and at least one non-overlapping long sequence (more than 12 kb), or at least one group of at least two short sequences, less than 5, 6, 7, 8, 9 or 10 kb each, total group length is longer than 12 kb and less than 150 kb, hybridizing contiguously on the mutated or rearranged polynucleotide sequence. In these methods the short probes may comprise a set of contiguous probes that span a stretch of the genomic polynucleotide sequences inside or outside the region of interest that is at least 14 kb; and/or the long probe(s) may comprise one or more docking probes of more than 14 kb and less than 40 kb. The long probe(s) may have a length of at least 14 kb and bind to a polynucleotide sequence outside the region of interest.
[0129] Both the long and short probes may be designed to exclude frequently occurring repetitive DNA sequences. These repetitive DNA sequences, which may be excluded from the long and short probes, will generally appear more than once and more often than statistically predicted based on their length and base content. For example, a repetitive DNA sequence between 50 and 400 contiguous nucleotides in length, which appear more than once and more often than statistically predicted based on their length and base content, can be excluded from the short and/or long probe(s). One example of a repetitive sequence that can be excluded from the short and long probes is or are members of the repetitive Alu family DNA sequences.
[0130] In some embodiments of the invention the probes in (b) of the first embodiment are fluorescently tagged so that they can be detected fluorometrically. In other embodiments in b) each probe is tagged with one of two or more fluorescent tags.
[0131] According to other embodiments of the methods above, motifs or easily identifiable subsets of the probes are detected and compared instead of every probe sequence.
[0132] The methods described above may employ at least 3, 4, 5, 6, 7, 8, 9, 10, 11, 12 or more short probes. These short probes may each have a length of least 500, 600, 700, 800, 900 or more base pairs (bp). In some embodiments of the methods above, the probes will be selected so that the gaps between short probes in the genomic region of interest are no more than 12 kb each. In further embodiments the short probes will bind to a single contiguous genomic region of interest or the short probes can be selected to bind to more than one non-contiguous genomic region of interest. The long probes used in the method above may be selected so as to be no more than 20, 30 or 40 kb. The or each of the genomic region(s) of interest in the methods described above can be selected to be longer than 50 kb.
[0133] Another embodiment of the invention is a kit comprising a set of short probes or a set of short and a set of long probe(s); and optionally one or more components for binding said probes to a polynucleotide, for performing molecular combing, and/or for detecting whether hybridization has occurred; (i) wherein the short probes comprise a set of probes that taken together bind to a long continuous stretch of the genomic region of interest; or (ii) wherein the long probes bind to sequences outside the genomic region of interest, do not overlap the short probe sequences; and optionally, where the repetitive sequences have been removed from the long and/or short probes. A kit of the invention is suitable and/or is specific for use in a method of the invention as disclosed herein. In a particular embodiment its short and/or long probes are characterized by the features described herein in relation with the methods. Such a kit may be employed for or contain instructions for the detection of genomic rearrangements associated with colorectal cancer or genetic predisposition to colorectal cancer; for the detection of genomic rearrangements associated with breast cancer or genetic predisposition to breast cancer; for the detection of genomic rearrangements associated with ovarian cancer or genetic predisposition to ovarian cancer; for the detection of genomic rearrangements associated with lung cancer or genetic predisposition to lung cancer.
[0134] Another embodiment of the invention is a composition containing the short, or short and long probe(s) described by the first embodiment above, wherein at least two of said probe sequences detect a genetic rearrangement by using Molecular Combing, said composition comprising either (a) at least one short (less than 10 kb) sequence and at least one non-overlapping long sequence (more than 14 kb), or (b) at least one group of at least two short sequences, less than 10 kb each, which total length is longer than 14 kb and less than 150 kb, hybridizing contiguously on the genetic target. In this composition the short probe(s) can range from 0.5 kb to 9 kb and the long probe(s) can range from 14 kb to 40 kb. The size of the short probes may range from 0.5 to 9 kb and at least 90% of the frequent repetitive sequences can be been removed from the short probe sequences. This composition may contain probes sequences that hybridize specifically on the MSH2 gene or in the region of the MSH2 gene or on the MLH1 gene or in the region of the MLH1 gene.
[0135] In yet another embodiment the invention involves a method for designing short and long probes described herein in relation to methods comprising (a) identifying a polynucleotide containing a genomic region of interest, (b) selecting long probe sequences outside of the genomic region of interest but within 100 kb of the closest probe within the region of interest and optionally removing frequently repeated sequences from the long probe sequences, (c) selecting a set of short probe sequences from within the genomic region of interest so that no gaps longer than 15 kb appear between the short probes; or selecting a series of short probes that together form a long continuous stretch that covers the genomic region of interest; (d) hybridizing the probes to a genomic polynucleotide comprising the genomic region of interest, (e) detecting the hybridized probes, and (f) determining which sets of probes form motifs that distinguish the genomic sequence of interest from a reference genomic sequence.
BRIEF DESCRIPTION OF THE DRAWINGS
[0136] FIG. 1, which includes sub-parts identified as FIG. 1A, FIG. 1B, and FIG. 1C. (A) FIG. 1A: Dot-plot of MSH2 gene sequence on RP11-1084A21 BAC clone. (B) FIG. 1B: probe code v1 (without repetitive element) on RP11-1084A21. (C) FIG. 1C: probe code-v2 on RP11-1084A21. Diagonal lines are perfectly matched region of DNA between two sequences. Dots are representatives of repetitive elements. Higher density of dots (or grey band) are higher density of repetitive element.
[0137] FIG. 2, which includes sub-parts identified as FIG. 2A, FIG. 2B, and FIG. 2C. Dot plot analysis of MLH1 region. (A) FIG. 2A: Dot-plot of MLH1 gene sequence on RP11-426N19 BAC clone. (B) FIG. 2B: probe code v1 (without repetitive element) on RP11-426N19. (C) FIG. 2C: probe code-v2 on RP11-426N19.
[0138] FIG. 3, which includes sub-parts identified as FIG. 3A and FIG. 2B. Designed probe set for MSH2 by exclusion of repetitive element. A) FIG. 3A: theoretical probe set (labeled in red and green in microscopy experiments represented here in grey and black, respectively), and position of exon (small numbered dots). (B) FIG. 3B: actual hybridization image corresponding to MSH2-v1 probe set. Original microscopy images consist of three channel images where each channel is the signal from a given fluorophore--these are acquired separately in the microscopy procedure. These channels are represented here as different shades on a grayscale: green probes are shown in black and red probes in gray, while the background (absence of signal) is white. The aspect ratio was not preserved, signals have been "widened" (i.e. stretched perpendicularly to the direction of the DNA fiber) in order to improve the visibility of the probes.
[0139] FIG. 4, which includes sub-parts identified as FIG. 4A and FIG. 4B. Designed probe set for MLH1 by exclusion of repetitive element. A) FIG. 4A: theoretical probe set (red and green), and position of exon (purple dot). (B) FIG. 4B: actual hybridization image corresponding to MLH1-v1 probe set. The same color conventions are used for diagrams and microscopy images as in panels A and B of FIG. 3.
[0140] FIG. 5, which includes sub-parts identified as FIG. 5A and FIG. 5B. Designed probe set for MSH2 with docking probes (v2). (A) FIG. 5A: theoretical probe set). B) FIG. 5B: actual hybridization image corresponding to MSH2-v2 probe set. The color conventions in this and the other 3-color microscopy images (and corresponding diagrams) is as follows: blue probes are represented in black, green probes in dark gray, red probes in light gray and the background is white.
[0141] FIG. 6, which includes sub-parts identified as FIG. 6A and FIG. 6B. Designed probe set for with docking probes (v2). (A) FIG. 6A: theoretical probe set). (B) FIG. 6B: actual hybridization image corresponding to MLH1-v1 probe set. The same color conventions are used for diagrams and microscopy images as in FIG. 5.
[0142] FIG. 7, which includes sub-parts identified as FIG. 7A, FIG. 7B, and FIG. 2C. Validation of genomic rearrangement in MSH2 in LoVo cell line with v2 probe set. Sketches of both theoretical probe set (top; FIG. 7A) and validated rearrangement (middle, FIG. 7B) by molecular combing. The photo (bottom, FIG. 7C) is the recurrent abnormal signal set which corresponding to deletion from exon 3 to exon 8 of MSH2 (as in middle). The same color conventions are used for diagrams and microscopy images as in FIG. 5
[0143] FIG. 8, which includes sub-parts identified as FIG. 8A, FIG. 8B, and FIG. 8C. Validation of genomic rearrangement in MLH1 in SK-OV-3 cell line with v2 probe set. Sketches of both theoretical probe set (top; FIG. 8A) and validated rearrangement (middle; FIG. 8B) by molecular combing. The photo (bottom; FIG. 8C) is the representative (but few cases) signal set corresponding to the upper stream of MLH1 probe set (left side of theoretical probe set). The difference of observation number between MSH2 probe signal (normal) and MLH1 (a part of left side) clearly demonstrates that deletion of exon 4 to 19 in MLH1 is homozygous, (consistent with reference 7). Molecular combing test also revealed that the breakpoint of deletion is larger than previously reported (downstream probes from exon 19 are all deleted). The same color conventions are used for diagrams and microscopy images as in FIG. 5
[0144] Table 1. describes primer sequences and coordinates on human genomic DNA used for hybridization fragment synthesis to design the probes of the invention. These primers or variant therefore obtained by adding nucleotides in the ends of the described sequences and having up to 40 nucleotides, are part of the invention.
[0145] Table 2. Analysis of sequence of probe sets and their covering region. These sequences and the sets of probes that are disclosed in particular, are part of the invention.
[0146] Sequence of each of probe sets or region was subjected to RepeatMasker test and some of representative values are shown in the table. Sum length: sum up of sequence of all probes in each set. For MLH1 and MSH2 regions, this is the total length of each region. Repeat length: sum of sequences recognized as sorts of repeat in human genome. This includes sequences other than SINE. Total repeat. % of repeat length in sum length. SINE: % of sequences categorized as SINE in sum length. ALUs: % of sequences categorized as Alu family sequences in sum length.
DETAILED DESCRIPTION OF THE INVENTION
[0147] The above described strategies, for the reasons mentioned, are unsuitable to design a high-resolution code for diagnostics applications using technologies such as molecular combing.
[0148] In the present invention, the probes are defined as follows: a short probe is a nucleic acid sequence complementary to a genomic sequence, which probe can be detected with a given marker (such as a fluorochrome) once hybridized on the genomic sequence. One probe may be either made of (i) one single fragment covering the whole sequence, or of (ii) several exactly contiguous fragments, and/or (iii) slightly overlapping fragments (with an overlap less than 250 bp) and/or (iv) fragments separated by a very short gap (less than 1000 bp). With such short overlaps or gaps, using Molecular combing in our current setup, the fragments appears almost contiguous. The distance may be adjusted depending on the specific technique and experimental conditions. For example, with less resolutive conditions, longer gaps (less than 2 kb) or overlaps may be tolerated, provided fragments separated by such a gap still appear contiguous. Under more resolutive conditions, gaps should be shorter (less than 200 bp) in order for the fragments to appear contiguous. Short probes range in size from 500 bp to 10 kb.
[0149] A long probe is a nucleic acid sequence complementary to a genomic sequence, which probe can be detected with a given marker (such as a fluorochrome) once hybridized on the genomic sequence. One probe may be either made of (i) one single fragment covering the whole sequence, or of (ii) several exactly contiguous fragments, and/or (iii) slightly overlapping fragments (with an overlap less than 250 bp) and/or (iv) fragments separated by a gap (less than 3.5 kb), provided that more than 70% of the target sequence stretch is covered by probes (i.e. provided the gaps represent less than 30% of the target sequence). With such overlaps or gaps, using Molecular combing in our current setup, the fragments are efficiently detected. The distance may be adjusted depending on the specific technique and experimental conditions. For example, with less resolutive conditions, longer gaps (less than 5 kb each, representing in total less than 50% of the sequence) or overlaps may be tolerated, provided fragments separated by such gaps are still detected efficiently. Also, under such conditions, longer probes should be used (more than 20 kb) to allow for efficient detection. Under more resolutive conditions, gaps should be shorter (less than 2 kb) in order for the fragments to be efficiently detected, and probes may still be efficiently detected with shorter size (more than 10 kb). Long probes range in size from 12 kb to 150 kb.
[0150] In the present invention, the size of probes reflects the length of the genomic sequence where the probe hybridizes, independently of the number of strands in the DNA molecules. Therefore, a probe may be described as 1 kb (1 kilobase=1000 bases) or, indifferently, as 1000 bp (base pairs): in both cases, the probe hybridizes over 1000 bases of one of the strands of the target DNA molecule (and, if the probe is double stranded, also on the 1000 complementary bases of the other strand of the target molecule).
[0151] In the present invention, a "barcode" designates a specific motif formed by a set of probes labeled with different markers, where the motif characteristics are the lengths of the probes in the set, the lengths of the gaps separating successive probes and the colors in which the probes are detected (or, more generally, the markers with which the probes are labeled).
[0152] If a high coverage barcode is to be designed for high resolution, probe and space lengths need to be roughly in the 0.5 kb to 10 kb range (see above). This makes it unpractical to design probes that completely exclude rearrangements, and yet are spaced closely enough for the code to allow high location precision. On the other hand, some non-specific hybridization (i.e. hybridization of [parts of] a probe on genomic regions that are not the designed target of that probe) of a probe is acceptable when using a code strategy for the reading of signals. Indeed, in applications such as Southern blot where the hybridization of a single probe is assessed or aCGH where hybridization of every probe is considered separately, the non-specific hybridization of probes on even a very limited number of regions may lead to completely unusable results. To a lesser extent, this is also the case with multiple-probe applications such as FISH, since the resolution of FISH is insufficient to distinguish genomic regions as far apart as several tens of megabases: a single non-specific hybridization would lead to unusable results if it were located close enough to the targeted region.
[0153] In molecular combing and other similar applications using a code strategy, the quantity of non-specifically hybridized probes is not in issue per se. If a probe (or fragments of a probe) hybridizes even multiple times outside the region of interest, it is unlikely it will recreate a motif sufficiently similar to the code to be confusing. Also, non-specific hybridization over short sequences (<<1 kb), even within the region of interest, would most likely not be detected, unless they are sufficiently clustered to generate a long (>1 kb) stretch of non-specific hybridization. For the above reasons, the inventors have developed an alternative approach for the design of probes when the main issue is the design of a (several) high resolution code(s) in a (several) given genomic region(s). The main step of this approach relies only on the knowledge of the sequence of the region(s) themselves. When designing such a code, the major issue is to avoid significant non-specific hybridization within the regions of interest(s). Non-specific hybridization becomes an issue only if several probes display non-specific hybridization on neighboring sequences outside the region of interest. In the latter case, there is a risk that the pattern of probes resembles the original code, or a rearranged version of it, and this would likely lead to false conclusions. Although the invention described herein does not allow excluding such occurrences, this is relatively easily done once the method described herein has been used to exclude other non-specific hybridizations (see below).
[0154] The basis for this approach is the detection and exclusion of sequences that are repetitive within the region(s) of interest. For this, only the corresponding sequence(s) (the target sequence(s)) have to be known. One easy way to detect such repeats is the search for local sequence alignments within the target sequence(s), which can be done with e.g. a dot-plot comparison of each target sequence with itself and the other target sequences. A dot-plot is a graph with the two (sets of) sequences that are being compared forming the two axis, while dots are printed at every point where the coordinates correspond to a local homology. For example, if nucleotide x from sequence A (horizontal axis) matches nucleotide y from sequence B (vertical axis), then a dot will appear at the point with (x; y) coordinates. Graphically, local alignments appear as diagonal lines. Some more elaborate tools inspired from dot-plots are available, that compare short sequences ("words", typically a few nucleotides/tens of nucleotides long) rather than single nucleotides, and display dots in various shades of gray depending on the extent of homology, thus allowing a direct visual reading of relaxed homologies (non-specific hybridization may well appear with incomplete homology). The comparison may also be done directly on both strands for one of the sequences, so homologies appear for both sense and reverse complement orientations. An example of such a tool is "Dotter" (ref. 4).
[0155] With these tools, very frequent repetitive sequences, such as Alu sequences in the Human genome, appear quite clearly, as they have local homologies with numerous other sequences within the target regions. Therefore, stretches with a high frequency of these sequences appear as a gray band (horizontal or vertical depending on whether the stretch is located on the vertical or horizontal axis). The exact appearance of these stretches with dot-plot display tools will depend on settings, and possibly word size. Settings were selected such that sequence stretches longer than 200 bp with more than 80% homology appear clearly and can be located with a roughly 10 bp precision.
[0156] A sequence of 200 bp or more that contains more than 10 significant homologous sequences (less than 1, 2, 3, 4, 5, 10, 15 or 20% nucleotide mismatch or insertion/deletion) within the regions of interest is a frequent repetitive sequence, prone to generate significant non-specific hybridization. It is generally possible to design probes in such a way that they are void of these frequent repetitive sequences, thus increasing the specificity and the high resolution of the present technology compared to the published previous methods.
[0157] "Docking" Probes
[0158] Although, as shown above, shorter probes make for more precise localization of breakpoints and measurement of deleted or amplified sequences, they are, generally speaking, more difficult to detect with fiber-fish techniques and molecular combing, as they appear as shorter stretches of signal, i.e., they are both smaller and less easy to distinguish from noise (fluorescent spots either unrelated to probes or to hybridization of probes). This is particularly true when considering automatic (computer-based) detection of signals.
[0159] It is therefore desirable to include longer probes in the code (for example, more than 12 kb and less than 150 kb, preferably more than 14 kb and less than 40 kb, in particular for the detection of genetic rearrangements in the regions of MSH2 or MLH1 genes). These probes would appear as actual lines (rather than spots), readily distinguishable from noise and easily detectable due to their size. Once the signals of interest are detected, the detection of other probes located on the same DNA fiber is easier.
[0160] This is especially true using technologies such as Molecular Combing where the linearity of the fibers implies the other probes, if any, are located in the alignment of the first probe. Therefore, the invention provides that the inclusion of longer (>12 kb, preferably >14 kb) probes in the set of probes is a step towards easier detection of signals of interest. Not all probes in the set need to be that long: in a fast and "rough" detection step, the long probes are sought, which allows the localization of signals of interest. These probes are called "docking probes" as they allow to "land" on the regions of interest efficiently. In a second step, the shorter probes are sought in the neighborhood of the docking probes (and more specifically in the case of Molecular Combing or related technologies, in the alignment of these probes). Although when performed by a human operator these steps can hardly be formally executed consecutively, if an operator may limit his search to longer probes, he can browse through images more rapidly, which would only allow him to detect these probes and spend more time on images where a docking probe is seen in order to look for other shorter probes. As the longer docking probes would locally diminish the location precision and the resolution of the code, it is preferable for them not to be located in the region where rearrangements are sought. This is possible if the probes are located near, but not in, the region of interest, e.g. at either end of this region.
[0161] If it is desirable to only consider complete signals in the analysis of a given region (i.e. signals covering the entire contiguous region), these longer probes may also be used to assess the integrity of the region: if there is a probe located at each end and both probes are present, no breakage of the fiber has occurred during the DNA preparation or stretching step. In cases where several non contiguous regions are analyzed in a single test, obviously each region has to have its "docking" probes in order to be correctly detected.
[0162] Continuous Stretch of Short Probes
[0163] An alternative to the "docking probes" approach above is to design the set of probes in such a way that at least some groups of shorter probes form a continuous stretch of signal. This is possible if probe sequences are adjacent. In that case, several probes, although short enough (less than 10 kb) to provide for sufficient resolution, may well combine to form a long enough (more than 14 kb) signal for fast and reliable detection. Indeed, if the operator may combine color channels to view images, this stretch would still appear as a long line rather than a spot, allowing its distinction from background noise. This is possible by using either common optical setups such as tri-color filters in fluorescence microscopy, or by using common image viewing software. In the case of automatic detection, it is also possible to use combined color information and therefore to make use of the very characteristic aspect of a multicolor line relatively to background spot-like noise.
[0164] Measurements
[0165] The probe designs described above likely lead to a large number of probes to be measured in a test. The usual approach for probe measurement is to measure all of the probes constituting a signal, as well as the gaps separating them. In a test with a large number of probes, the amount of work required for analyzing results is increased. In order to balance this, the invention relates to a more efficient designed approach for signal measurement. This approach consists in the measurement of subgroups of probes constituting easily recognizable motifs. The subgroups are two or several consecutive probes and the gaps between them, and possibly gaps at either end, chosen in order for their total length to remain within reasonably precise measurement range (10-30 kb).
[0166] There is likely to be a systematic bias in the measurement of digitalized images of fluorescent segments. Indeed, at the extremity of such a fragment, the intensity of the signal decreases gradually when moving away from the center, to reach the level of the background. Depending on where the operator/the software sets the threshold for the determination of the actual end there may be a systematic over- or under-estimation of the lengths. This bias is compensated for if the measured motifs have a probe at one end and a gap at the other. Therefore, it is preferable to design motifs in this way.
[0167] If a motif is found to have an abnormal length (different from the expected theoretical length) in a given sample, it remains possible to measure the probes and gaps within this motif in order to further precise the location of the rearrangement. With this approach, it is possible to measure in a fast and efficient way all of the signals for initial screening, while keeping the location precision allowed by small probes. The somewhat lower precision on measurements due to the larger size of the subgroups compared to the probes is essentially compensated for by the higher number of signals that can be measured within the same operator time.
[0168] Application to HNPCC--Rationale
[0169] Colorectal cancer is the 4th most frequent form of cancer in human and around 5% of the cancer is considered as a hereditary form. The most frequent form of hereditary colorectal cancer is known as Lynch syndrome, or HNPCC (hereditary non-polyposis colorectal cancer). HNPCC increases a lifetime risk of cancer development in up to 80% (lifetime risk is around 7% in normal population US). HNPCC also increases other cancers (endometrial, ovarian, stomach).
[0170] Genetic aspect of HNPCC is known as a result of mutation in some of Mismatch Repair (MMR) genes such as MSH2, MLH1, MSH6, PMS2, etc. MSH2 and MLH1 mutation accounts for more than 80% of all mutation of MMR genes in HNPCC. Both point mutation and large rearrangements are reported in mutation of those genes, and especially high % of large mutation in MSH2 is observed because of high level of small repetitive element in its genetic sequence. Today the molecular diagnosis is done after studies of familial cancer history, tumor characterization by microsatellite instability test.
[0171] Normally mutation one alleles of one of MMR genes is sufficient for molecular diagnosis of HNPCC. All HNPCC individuals have both wild and mutated genes. Point mutation of targeted MMR genes can be detected by sequencing of genes and current sequencing test investigates only the sequence of exons. In case of large rearrangements such as deletion and amplification (loss and gain of genetic elements, respectively), sequencing does not detect them because altered sequences do not exist, and frequently primer binding regions for sequencing are deleted. As a result, sequence information comes from only wild allele and gives false negative. Indeed, MSH2 and MLH1 genes are higher percentage of repetitive elements of SINE in their genetic sequence. To address this large rearrangement, the test should detect presence of deletion or amplification in the MMR genes. One approach is cartography of MMR genes with designed probes of hybridization. Causal large rearrangement has a wide range from sub-kb to loss of total gene (up to 100 kb). A given cartography has to be sensitive to this wide dynamic range of mutation. To cope with it specific probe design was done for MSH2 and MLH1 loci.
[0172] The present invention is also related to the detection of known or unknown genomic rearrangements. It is also related to kits containing probes according to the invention, for the detection of known or unknown genomic rearrangements and the associated pathologies, or associated predispositions to pathologies such as cancers or cardiovascular diseases for example.
EXAMPLES
Application to HNPCC--Materials and Method
[0173] Probe Design v1
[0174] Each probe (probe means continuous hybridization signal, can consist of multiple cloned DNA fragments, e.g., probe 1 of MSH2-v2 covers a 15 kb stretch and consists of five cloned DNA fragments of 3 kb. Since gap or overlap of each junction of these five fragments are smaller than resolution (<50 bp), they are considered and indeed look like continuous single probe of 15 kb) on a region of gene sequence itself has a length between 3-6 kb. In case of larger rearrangement than probe or gap size, obvious change of color pattern of designed probe will be observed. As well as large rearrangement in probe region, such rearrangement is also detectable in gap region, meaning any rearrangement larger than 1 kb at any position in the target genes are detectable. This is a uniqueness of cartography method with high resolution probe hybridization. Other techniques (MLPA, aCGH) can detect only such rearrangement involving probe sequence. For genes with high frequency of large rearrangement such as MSH2 and MLH1, presence of repetitive element in their genetic sequence limits a freedom of probe design for the other technology. Inclusion of repetitive element sequence in their probe design increases false detection a lot, their probe designing has to be free of repetitive element in principle.
[0175] Probe sequence was chosen by a dot plot analysis. BAC clone sequence of each gene (RP11-1084A21 (Ch2:47,574,044-47,785,729 for MSH2, RP11-426N19 (Ch3: 36,992,516-37,161,490) for MLH1 was self-plotted and all grey bands region were excluded from the target region of PCR primer design. PCR primer set was designed in the target regions by Primer3plus PCR primer design tool (ref 6). A list of the primers' sequence is shown in table 1A and B. Exclusion of Alu repeat was verified by both dot-plot analysis and RepeatMasker (http://www._repeatmasker.org). FIG. 1B and FIG. 2B show a lot less grey band on dot-plot of probe fragment sequence on BAC clone than dot-plot of gene (containing Alu repeat) on BAC clone. This indicates that sequence of designed probes does not include recurrent repetitive sequence in this target regions. RepeatMasker analysis (with default setting of web server) also clearly shows a dramatic reduction of % of Alu sequence in designed probe sequence (table 2).
[0176] Probe Design v2
[0177] To facilitate "recognition" of barcodes on hybridization images, an alternative design of probe set (called v2) was done as said in "Docking" probe section. Design process is same as v1 except no exclusion of repetitive elements based on dot-plot. For v2 probe design, each probe was designed to have more than 3 kb length, close to limit to be recognized as "line", and all exon sequences are covered by a probe stretch (no exons fall in gaps). Docking probes were designed on both extremities of each gene with 15-20 kb length. For MSH2-v2 code, specific probes covering EPCAM gene (see rationale part) was also included between two docking probes. DNA sequence of designed code v2 was subjected to dot-plot analysis to make sure that there is no segmental repeats inside of designed region (FIGS. 1C and 2C).
[0178] Cloning of Probe Fragments and Labeling for Hybridization Probe
[0179] Each fragment of probes was amplified by PCR, then the fragment was ligated into plasmid vector (pNEB193, pCR2.1-TOPO, pCRXL-TOPO). The ligation product was transformed into E. coli competent cells and end-sequences of cloned fragment were verified. Purified plasmid DNA set of each gene was separated into two (v1) or three (v2) gropes according to colors corresponding to theoretical barcodes (FIG. 3A and FIG. 4A for v1, FIG. 5 and FIG. 6 for v2 probe sets). Each group of plasmid DNA was labeled by random priming method. Either whole plasmids containing probe fragments' sequence or PCR amplified probe fragments were used as a template for random priming. There are three haptens to be used for three color detection, biotin (Biot), digoxigenin (Dig) and Alexa Fluor 488 (A488). Biot-labeling was done by BioPrime DNA labeling system (Invitrogen) with manufacture's instruction. For Dig and A488 labeling, dNTP mixture in the kit was replaced with home-blend dNTP mixtures (either 0.1 mM Digoxigenin-11-dUTP (Roche applied science) for Dig labeling or 0.1 mM ChromaTide® Alexa Fluor® 488-7-OBEA-dCTP (Invitrogen) for A488 labeling, 0.1 mM unmodified equivalent (dTTP or dCTP) and 0.2 mM each of other three deoxynucleotides in final labeling reaction solution.).
[0180] Sample DNA Preparation
[0181] 3 cell human cell lines were used for validation for large rearrangement detection in either MSH2 or MLH1. Cell line GM17939 was used as non-mutated sample. Cell line LoVo was used for MSH2 rearrangement validation, which is homozygous for deletion of exon 3-exon8 in MSH2. Another cell line SK-OV-3 was used for rearrangement validation of MLH1, which was reported as homozygous deletion of exon 4-exon 19 in MLH1. For each cell line, cell culture was prepared according to cell bank's instruction. Cultured cells were harvested (for LoVo and SK-OV-3 when 50-70% confluency) or collected by centrifuge (for GM17939 when between 300,000-400,000 cells/ml of medium. Cell pellet was resuspended in 1×PBS/Trypsin mixture to have 1,000,000 cells in 45 μl the cell suspension was mixed with an equal volume of 1.2%(w/v) NuSieve GTG agarose solution in 1×PBS (melted and equilibrated at 50° C. in advance). The cell/agarose mixture as poured into a well of gel plug mold, followed by gelification at 4° C. for 30 min. the gelified agarose plug was immersed in a mixture of 2 mg/ml of Proteinase K, 1%(w/v) of sarcosyl in 0.5M EDTA (pH8.0, 250 μl for each plug). The agarose plug was incubated at 50° C. overnight.
[0182] Next day the incubated plug was washed in 1×TE (10 mM Tris-HCl, 1 mM EDTA, pH8.0) 3 times for 1 hour each. The DNA plug can be stored in 0.5mEDTA at 4° C. The washed plug was stained in 100 μl of 33 μM YOYO-1 (Invitrogen) in TE40.2 (40 mM Tris-HCl, 2 mM EDTA pH8.0) for 1 hour in the dark. The stained plug was heated at 68° C. in 1 ml of combing buffer (0.5M MES pH5.5) for 20 min, then cooled at 42° C. 10 min prior to add 1.5 unit of beta agarase I (NEB). Beta agarase treatment was carried overnight at 42° C. in the dark.
[0183] The following day the treated DNA solution was poured into a combing reservoir and a level of the solution in the reservoir was adjusted with additional combing buffer.
[0184] Molecular Combing
[0185] The DNA solution was set on a Molecular Combing Machine (MCS, Genomic Vision). Molecular combing was performed on a silanized coverslips (Combicoverslips, Genomic Vision). The combed coverslips was fixed at 68° C. for 4 hours, then used for hybridization (or stored at -20° C. until use).
[0186] Hybridization and Detection of Probe
[0187] For one hybridization, 5 μl of each of labeled probe solutions (of both MSH2 and MLH1) was combined together and with 10 μg of sonicated herring or salmon sperm DNA and 10 μg of human Cot1-DNA (only for V2 probe sets), then purified by standard ethanol precipitation. The precipitate was resuspended with 20 μl of hybridization buffer (50% formamide, 2×SSC, 1% SDS and BlockAid blocking solution (Invitrogen)). The resuspended probe solution was set on a clean glass slide and covered with a DNA combed coverslip. The slide was heated at 90° C. for 5 min for co-denaturation of both probe and combed DNA then incubated at 37° C. overnight with an humidity for hybridization between labeled probes and combed DNA.
[0188] The hybridized coverslips was washed in 50% Formamid/2×SSC solution 3 times for 5 min each, followed by another 3 times washing with 2×SSC for 5 min each. The washed coveslips was then developed with two or three layers of fluorescently labeled antibodies or streptavidin. For each layer, antibodies for all haptens were diluted 25 times in BlockAid blocking solution (20 μl in final volume) and incubated for 20 min at 37° C. For Biot, Streptavidin Alexa Fluor 594 (Invitrogen) was used for the 1st and the 3rd layer, biotin conjugated-goat anti-streptavidin antibody was used for the 2nd layer. Fr Dig, mouse anti-Digoxin AMCA conjugated (Jackson immunoresearch) was for the 1st layer, rat anti-mouse AMCA conjugated (Jackson immunoresearch) conjugated was for the 2nd, the goat anti-rat Alexa Fluore 350 conjugated (Invitrogen) was used for the 3rd layer. For A488, rabbit anti-Alexa Fluor 488 (Invitrogen) was used for the 1st layer, goat anti-rabbit Alexa Fluor 488 conjugated was used for the 2nd layer (no third antibody for A488). After 20 min incubation of each layer of antibody, the coverslip was washed in 2×SSC/1% Tween 20 washing solution 3 times for 5 min each at room temperature. After the washing of 3rd layer, the coverslip was rinsed in 1×PBS, followed by successive bath of 70, 90 and 100% ethanol for 1 min each. The coverslip was dried at room temperature prior to microscopy.
[0189] Signal Acquisition and Measurement
[0190] Fluorescent signal of developed antibody on the coverslip was obtained by standard epi-fluorescent microscope system or automated fluorescent microscope system (Image Xpress Micro, Molecular Devices) with custom scanning configuration for molecular combing signal. Every set of linearly aligned fluorescent signals and gaps was measured by ImageJ. Each measured set of signals (with color information) was subjected to pattern matching to determine position (if the set is a part of one of probe set) and orientation by comparison with the theoretical probe sets. All unclassified sets (did not match with any positions and orientations of theoretical probe sets) were subjected to similarity check between them to find whether recurrent abnormal pattern appears or not.
[0191] Application to HNPCC--Results
[0192] FIGS. 3B and 4B are representative images of signal from hybridized DNA. Some of probes look like "dot" rather than "line" as expected from their length. There are some "random" spots on images of hybridization, but these spots do not interfere recognition of designed code. Although signals of some small probes (arrowed in FIG. 3B, for example) is not evident to measure "length" of probe signals for size evaluation, measurement of "distance" between probe signals is possible and equivalent to measurement of the length of probe and gaps in normal probe set hybridization
[0193] FIGS. 5B and 6B are the representative image of hybridization signal of barcodes-v2. Fluorescent signals are more continuous than the signals of barcodes-v1, and easier to find docking probes and measure the length of each probe and gap. These barcodes-v2 were used to visualize large genomic rearrangements of characterized cancer cell lines, LoVo and SK-OV-3 (ref. 5).
[0194] FIG. 7 is a result of hybridization of barcodes v2 on combed DNA from LoVo cell line; LoVo cell line is homozygous for deletion in MSH2 (from exon 3 to 8). Hybridization slide had many normal (identical to theoretical code) signal of MLH1 gene but none of normal MSH2 signals. Instead, there was a recurrent signal of truncated form of the normal MSH2 signal (FIG. 7B). By deduction from the truncated signals, this truncation results from loss of probes and gaps corresponding to ex3 to 8 of MSH2 gene.
[0195] FIG. 8 is a result of barcodes-v2 on SK-OV-3 cell line DNA, homozygous for deletion in MLH1 (from ex4 to 19). Among many normal MSH2 signals, only a few signals of part of MLH1 (from probe 1 to probe 3) were observed. This means a lack of following sequence of MLH1, which is consistent with reference. Moreover, a lack of the right (downstream of MLH1) docking probe indicates that this deletion affects beyond exon 19 of MLH1.
[0196] The sequences selected to detect predisposition to colorectal cancer linked to rearrangements in the MSH2 genomic region or the MLH1 genomic region are preferably chosen among the following nucleotide sequences and their corresponding complementary sequences and are described as:
[0197] The short probes covering the MSH2 gene region and constituting contiguous stretches (PE1-2 and PE3-6 (SEQ ID NO:354-358); PE9 to PE15-16 (SEQ ID NO:365-373) in table 1 under the header MSH2-v2) and the other short probes covering MSH2 gene region (PE7 and PE8, SEQ ID NO:359-364 in table 1 under the header MSH2-v2); the long probes neighboring the MSH2 gene (tPP1, EPCAM5', EPCAM3' (SEQ ID NO:342-353) and cPP1 (SEQ ID NO:374-378) in table 1 under the header MSH2-v2); the short probes covering the MLH1 gene region and constituting a contiguous stretch (PE1-2 to PE10-11, SEQ ID NO:386-396, in table 1 under the header MLH1-v2) and the other short probes covering MLH1 gene region (PE12-13, PE14-15 and PE16-19, SEQ ID NO:397-401, in table 1 under the header MLH1-v2); the long probes neighboring the MLH1 gene (tPP1 (SEQ ID NO:379-385) and cPP1 (SEQ ID NO:402-408) in table 1 under the header MLH1-v2). For example, these probes may be obtained by amplification of the fragments using the primers listed in Table 1 under the headers MSH2-v2 (SEQ ID NO:139-212) and MLH1-v2 (SEQ ID NO:213-272).
INCORPORATION BY REFERENCE
[0198] Each document, patent, patent application or patent publication cited by or referred to in this disclosure is incorporated by reference in its entirety, especially with respect to the specific subject matter surrounding the citation of the reference in the text. However, no admission is made that any such reference constitutes background art and the right to challenge the accuracy and pertinence of the cited documents is reserved.
TABLE-US-00001 TABLE 1 Name SEQ ID SEQ ID of Name of NO NO probe fragment (fragment) For/Rev (primer) Sequence (5'-3') start end MSH2-v1 P1 P1a_MSH2-v1 273 forward 1 TTCTTCCCAAGAGAGCCAAG 47595911 47595930 reverse 2 CTGTTTTGGAACCCCAAGTC 47597074 47597093 P1b_MSH2-v1 274 forward 3 GGCTTCAATCTGGGACTACG 47598716 47598735 reverse 4 GCTGTCACCGCCTCTTTTAC 47599478 47599497 P1c_MSH2-v1 275 forward 5 GCCAGGCACTTAGGCAGTAG 47600433 47600452 reverse 6 TTGGTCCTGACATCCTTTCC 47601671 47601690 P1d_MSH2-v1 276 forward 7 TTAGTTGAACAGGGCATGACAC 47602097 47602118 reverse 8 GGTAAAGGGGCCTGATGTC 47602743 47602761 P1e_MSH2-v1 277 forward 9 GAGCCTTGATGTTCCCTCTTAAC 47603695 47602743 reverse 10 ACCCAGATCCGAAACTGTTG 47604324 47603717 P1f_MSH2-v1 278 forward 11 CCGGCCTTACCTTTCATTTC 47605735 47604343 reverse 12 CCAGGATCCAGATCCAGTTG 47606965 47606984 P2 P2a_MSH2-v1 279 forward 13 GAGTTCCATGGCAGATCACC 47612521 47612540 reverse 14 GCAGCTTTCAATCACAAATCAG 47614067 47614088 P2b_MSH2-v1 280 forward 15 GAAGGGTTGGTCTTGCTGTC 47615115 47615134 reverse 16 ACCCTTTGCACCTCTCTGTG 47615632 47615651 P2c_MSH2-v1 281 forward 17 CCCGGTGTTGAATCATTTG 47616079 47616097 reverse 18 TTCAGCCCTGAAGGTAGAGG 47617513 47617532 P2d_MSH2-v1 282 forward 19 CTGGCCACTTTTTGGAAGAG 47618884 47618903 reverse 20 TGGGACGCAGAGTGATACAG 47619394 47619413 P3 P3a_MSH2-v1 283 forward 21 TTACTGGCGATCCTCAGAGC 47629651 47629670 reverse 22 AACGCCTCTTCCGTTGTATG 47631623 47631642 P3b_MSH2-v1 284 forward 23 GAAAGGACAGACCAAGTGCAG 47632605 47632625 reverse 24 AGCCTGTGCAGGGAAACTC 47633083 47633101 P3c_MSH2-v1 285 forward 25 AGTGGGATGCAGCTGAAAAG 47633591 47633610 reverse 26 CAACAGCATGGGAAAGATCC 47635238 47635257 P4 P4a_MSH2-v1 286 forward 27 TTGAAAGTTGGTCTTAGGAAGAGG 47643286 47643309 reverse 28 CCCAACAAACCTGGCTTTAG 47644179 47644198 P4b_MSH2-v1 287 forward 29 AGACGCCCAAAATCAACAAC 47645155 47645174 reverse 30 CCGCTTGCTGCTAAAAATTG 47646042 47646061 P5 P5a_MSH2-v1 288 forward 31 TGATTGCCAAGGAAGATTCAC 47657647 47657667 reverse 32 TGGAAGTAAATGCAGGTGCTC 47658763 47658783 P5b_MSH2-v1 289 forward 33 TCATTCTTGGGTGTTTCTCG 47659578 47659597 reverse 34 ATGGCGGTTTTGTGGAATAG 47660015 47660034 P5c_MSH2-v1 290 forward 35 GAGGGAGAGGGAACCTTTTG 47661699 47661718 reverse 36 GGGGACTATACCGCATTCAC 47662243 47662262 P6 P6a_MSH2-v1 291 forward 37 TGTTGATTCATGGGCATTTG 47669651 47669670 reverse 38 GCTGGGGAATCATGTATGAAG 47671879 47671899 P6b_MSH2-v1 292 forward 39 CATCAAGCACAGTTCCATTG 47672243 47672262 reverse 40 TTCTCTTTCCGTTTCCAGTG 47673113 47673132 P7 P7a_MSH2-v1 293 forward 41 GGAGCTTGGGAATTCAACTG 47678126 47678145 reverse 42 AGAAACGGGCATGTCATAGG 47679330 47679349 P7b_MSH2-v1 294 forward 43 CAGCCTACGTGCCCATTTC 47679649 47679667 reverse 44 TCAAAAGATGGCCAAAATGC 47681179 47681198 P7c_MSH2-v1 295 forward 45 GTGTTGCACCCATTAACTCG 47681915 47681934 reverse 46 AGCCTGGTGAGAGGTGACTG 47684723 47684742 P8 P8a_MSH2-v1 296 forward 47 CACGATGCCAGTCCAATTC 47689478 47689496 reverse 48 AAGGTGGACTTTAATGCAAAGG 47690835 47690856 P8b_MSH2-v1 297 forward 49 GGAGTGAGAGCGACACCTTG 47691634 47691653 reverse 50 CGACAGCTGACTGCTCTATGG 47694068 47694088 P9 P9a_MSH2-v1 298 forward 51 CACAATGGGAAAGGATGTAGC 47701939 47701959 reverse 52 CAGAGAAAAACACCCATGACC 47704112 47704132 P9b_MSH2-v1 299 forward 53 CACCGTGATCCTCCTTATTTC 47704395 47704415 reverse 54 GAACAAACAACGGATGAAAGG 47704945 47704965 P9c_MSH2-v1 300 forward 55 GTGGCATATCCTTCCCAATG 47705311 47705330 reverse 56 CCCCCAGACTGTGAATTAAGG 47705787 47705807 P10 P10a_MSH2-v1 301 forward 57 GATGCAGATCAGGGAAATGC 47711630 47711649 reverse 58 ATCTTGCTGGATGGACAAGG 47715272 47715291 P10b_MSH2-v1 302 forward 59 CTTAATCCTGAAAGGCAGGTG 47715788 47715808 reverse 60 TGTTTCTCAGGCAACCACAG 47717266 47717285 P11 P11a_MSH2-v1 303 forward 61 GAAACCACAGAATCGCCTTC 47731087 47731106 reverse 62 ACCTGGACAGTCCCACAGAC 47733482 47733501 P11b_MSH2-v1 304 forward 63 CAGTGCTTTTGCATCCTTCC 47734903 47734922 reverse 64 ATTTAATCCCCTGGCCAATC 47741649 47741668 P11c_MSH2-v1 305 forward 65 CACCTGTGCCCATCACATAG 47742239 47742258 reverse 66 GAGTCCCCTCTTGGAGAACC 47747829 47747848 P12 P12a_MSH2-v1 306 forward 67 AAAGCCATTTCCAGTGTCG 47753989 47754007 reverse 68 ATTGTGCAGCCAGAATTGAG 47758158 47758177 P12b_MSH2-v1 307 forward 69 TTCACAGCAAAGTGGCTCAG 47760593 47760612 reverse 70 GCTATTATGGGCTGCAAAGC 47764302 47764321 P12c_MSH2-v1 308 forward 71 TTCACTCCCAACAAGCACTG 47764863 47764882 reverse 72 TGCCCAGTCCTTTTTCACT 47765618 47765636 P12d_MSH2-v1 309 forward 73 AATCCCTCCTGCACACTTTC 47765925 47765944 reverse 74 AATGGATGCTTCCACTGTCC 47767687 47767706 P12e_MSH2-v1 310 forward 75 CCATCTGTGCAATTCCTTCC 47768105 47768124 reverse 76 GTTCAAAGGCAGAAGCCATC 47769886 47769905 MLH1-v1 P1 P1a_MLH1-v1 311 forward 77 GTCTGGATTCTTTCACAATGTAGC 37005551 37005576 reverse 78 TGCCAATCTTCTCCTCTGTTC 37006562 37006582 P1b_MLH1-v1 312 forward 79 AACCACCCAATGTGTTCACC 37006815 37006836 reverse 80 GTTCATTCCTGCGAGTAGGC 37007422 37007441 P1c_MLH1-v1 313 forward 81 GCCAAAGGTGGAAAATGTTG 37008987 37009008 reverse 82 GCCTTCTTCATGAAAGCACTG 37009873 37009893 P1d_MLH1-v1 314 forward 83 CCAGAAGGTGGAAGCTACAG 37011079 37011100 reverse 84 TGGGGTCAATGAAGCAAG 37011830 37011847 P1e_MLH1-v1 315 forward 85 ACATCGACCCAGAAAGTTCC 37012314 37012335 reverse 86 AATGTGCTTCGTACCACTGC 37012867 37012886 P1f_MLH1-v1 316 forward 87 AGCGTGCCATTGTACTCTCC 37013822 37013843 reverse 88 TTTCTGAGCCCATGATTTCC 37015267 37015286 P2 P2a_MLH1-v1 317 forward 89 GTGCCCAGCTAGTTCCATTC 37023623 37023644 reverse 90 TCAAGAGCGCTAATCCCATC 37025002 37025021 P2b_MLH1-v1 318 forward 91 TGCACATGCTCACTGAAAGAC 37026505 37026527 reverse 92 TTTTGCCTGCAAACTGACC 37027818 37027836 P2c_MLH1-v1 319 forward 93 CAGCAAGCACCAAATCACTG 37028305 37028326 reverse 94 AGTACCAGCCGTCCAAACTG 37032621 37032640 P3 P3a_MLH1-v1 320 forward 95 CCTGGCCAGAAAATTCATTG 37037607 37037628 reverse 96 ACCCTGCATTCCAAACTCAC 37039199 37039218 P3b_MLH1-v1 321 forward 97 GCAGTCCTTTGAGGATTTAGC 37042493 37042515 reverse 98 GAAAGATATCCAACAGGAAGTGAG 37043300 37043323 P3c_MLH1-v1 322 forward 99 TGGCCTTGTTTAAGGTCCTG 37043746 37043767 reverse 100 ATGGTCCTGCTGCTTCAGAG 37044723 37044742 P3d_MLH1-v1 323 forward 101 ACCCCGTCATAGCACAGTTC 37045295 37045316 reverse 102 CAAAGGCCATTCATCAGTTTC 37046439 37046459 P4 P4a_MLH1-v1 324 forward 103 GTGGCGTGATATCCTTGATTC 37053034 37053056 reverse 104 CTCTGGAATGACTGCTGCTG 37054289 37054308 P4b_MLH1-v1 325 forward 105 TGTGCTAGATGCCTCACTGG 37055182 37055203 reverse 106 TTGCCAAGAAGCACAACAAG 37058326 37058345 P5 P5a1_MLH1-v1 326 forward 107 CGGAGGCTCTACTGTTGGAC 37062345 37062366 reverse 108 TGCTGTCCACTCTGGAACTG 37064753 37064772 P5b_MLH1-v1 327 forward 109 ACATCAGAAGCCCTGGTTTG 37064571 37064592 reverse 110 GCTGGGAGTTCAAGCATCTC 37067377 37067396 P6 P6a_MLH1-v1 328 forward 111 TCGGTCTCAGTCACCATTTG 37072097 37072118 reverse 112 AACGCACCTGGCTGAAATAC 37075920 37075939 P7 P7a_MLH1-v1 329 forward 113 TGAACCTGCAATATCTCAGAGG 37079607 37079630 reverse 114 CTTACCGATAACCTGAGAACACC 37083805 37083827 P8 P8a_MLH1-v1 330 forward 115 CCCAGCCCATATATTTTAAAGC 37088387 37088410 reverse 116 CCAGCCACTCTCTGGACTATC 37089049 37089069 P8b_MLH1-v1 331 forward 117 GACATGGAGAGCCGAATCC 37089669 37089689 reverse 118 CCATTAAAATCGGGTCTGAAAG 37091446 37091467 P8c_MLH1-v1 332 forward 119 TCCAGACCCAGTGCACATC 37091887 37091907 reverse 120 CATGGTCAGTGCCATCAGAG 37092412 37092431 P8d_MLH1-v1 333 forward 121 AGCCTCCCAAAGTTAAGTGC 37092788 37092809 reverse 122 CCCAGCTAAAACCAACACAC 37093346 37093365 P9 P9a_MLH1-v1 334 forward 123 TGCCCTCAGCTACTCACTCC 37103285 37103306 reverse 124 AGGGCTCAGCCTTTAGGAAC 37105620 37105639 P9b_MLH1-v1 335 forward 125 GCCAGACTCTCGTTCCATTC 37106390 37106411 reverse 126 ACTCCCCATTCAGTCCCTTC 37111053 37111072 P9c_MLH1-v1 336 forward 127 AGGCACAACGTCAGGTTTTC 37114109 37114130 reverse 128 TTGGAATTTGTCCTGGTGTG 37117519 37117538 P10 P10a_MLH1-v1 337 forward 129 CACCATTGCCAACACTTCTG 37132898 37132919 reverse 130 GCCATTGGTTTGAAGGTGAC 37134201 37134220 P10b_MLH1-v1 338 forward 131 CTTAGTCACCGCCTGTCCTC 37134738 37134759 reverse 132 TAGCTGCATGTGGCTAATCG 37136986 37137005 P10c_MLH1-v1 339 forward 133 TGTGGCTCGCATTACATTTC 37137579 37137600 reverse 134 CGCTGTCATTACCTGCTTTG 37139742 37139761 P10d_MLH1-v1 340 forward 135 TGACCTCCAAAATCATCCAG 37140449 37140470 reverse 136 TTCTGAGCTAGGAGGTGCTG 37141321 37141340 P10e_MLH1-v1 341 forward 137 CCAGATTTGTAAATCCCTGTTC 37142008 37142031 reverse 138 TGTGTGGTTCTTAAGCATTCC 37142420 37142440 MSH2-v2 tPP1 tPP1a_MSH2-v2 342 forward 139 CTCAGTCCATCAGCCTCCTC 47574824 47577784 reverse 140 TGCTGTGCCCTGAGATTAAG 47574823 47577783 tPP1b_MSH2-v2 343 forward 141 AACTTAATCTCAGGGCACAGC 47577763 47580677 reverse 142 TGCAGCTTCAGCCTCTTG 47577762 47580676 tPP1c_MSH2-v2 344 forward 143 GCGTGGTGTTTCGTACCAG 47580604 47583785 reverse 144 GCTACTGGCCAGAAATCTTCC 47580603 47583784 tPP1d_MSH2-v2 345 forward 145 GCCCAGCCCTACTAAGGAAG 47583750 47586723 reverse 146 CTGTGCTCCCCTGCTAGAAC 47583749 47586722 tPP1e_MSH2-v2 346 forward 147 GTCGTCCTCTTCGACCTAGC 47586769 47589967 reverse 148 CAGCGCCTATTCTACAGCAG 47586768 47589966 EPCAM5' EPCa_MSH2-v2 347 forward 149 TTCTTCCCAAGAGAGCCAAG 47595912 47598965 reverse 150 CCACCTTTAATCTGCCCAAC 47595911 47598964 EPCb_MSH2-v2 348 forward 151 GTGTTGGGCAGATTAAAGGTG 47598944 47602122 reverse 152 GCAGTGTCATGCCCTGTTC 47598943 47602121 EPCc_MSH2-v2 349 forward 153 CTCTTTGTGCCCTTTCTTTTG 47601745 47604931 reverse 154 AGTTCCTTAAAGCAGAGAAGATGG 47601744 47604930 EPCAM3' EPCd_MSH2-v2 350 forward 155 AACCTGTCCCTGTGGATGAG 47604796 47607923 reverse 156 CCGAAGCATCCTTACATTCC 47604795 47607922 EPCe_MSH2-v2 351 forward 157 AATACCTGAACCCCCAAACC 47607722 47609876 reverse 158 CTCAGGCTATTTTCCAGATTCAC 47607721 47609875 EPCf_MSH2-v2 352 forward 159 GCATGCCTGTCATTCTGG 47609695 47612812 reverse 160 TCCAAGGGACTGAAACACAC 47609694 47612811 EPCg_MSH2-v2 353 forward 161 TTAGTGTGTTTCAGTCCCTTGG 47612790 47615135 reverse 162 GACAGCAAGACCAACCCTTC 47612789 47615134 PE1-2 E1_MSH2-v2 354 forward 163 GCACATTACGAGCTCAGTGC 47629942 47633045 reverse 164 CTACCAGGAGAACAGCACAGG 47629941 47633044 E2_MSH2-v2 355 forward 165 TGGGTTAGCATTGTGTTAGGTG 47632899 47636029 reverse 166 CCACAGGTGTGTGCCAATAG 47632898 47636028 PE3-6 E3_MSH2-v2 356 forward 167 AAGTTGCAGTTTGGCTGGTC 47635845 47638929 reverse 168 TTATCTCCAGCGGTGCTTATG 47635844 47638928 E4_MSH2-v2 357 forward 169 TACCATAAGCACCGCTGGAG 47638906 47642053 reverse 170 ACTCCACCAAGCCCAGTCTC 47638905 47642052 E5-6_MSH2-v2 358 forward 171 TTTAGAGACTGGGCTTGGTG 47642030 47644205 reverse 172 CTCTTCCCCAACAAACCTG 47642029 47644204 PE7 I6-7_MSH2-v2 359 forward 173 CCCAGTTTCAAGCGATTAAG 47651443 47654570 reverse 174 AGGAAAAGCATGTTATCTCCAG 47651442 47654569 E7_MSH2-v2 360 forward 175 TTCCGTAGCAGTAGGCATCC 47654026 47657170 reverse 176 TCACCACCACCAACTTTATGAG 47654025 47657169 I7-8_MSH2-v2 361 forward 177 TCCCAGATCTTAACCGACTTG 47656956 47660035 reverse 178 ATGGCGGTTTTGTGGAATAG 47656955 47660034 PE8 E8_MSH2-v2 362 forward 179 CCCAAACAACAGCATTAGCC 47670887 47673915 reverse 180 ACATCAGCCTCGGGACAAG 47670886 47673914 I8-9a_MSH2-v2 363 forward 181 TGAGCCCGTTGAATATAGTGG 47673830 47675514 reverse 182 AGTTTTCCTAAACGGGATGATG 47673829 47675513 I8-9b_MSH2-v2 364 forward 183 ATGGGTGTGCACGTGTGTAG 47675368 47678365 reverse 184 GCCATGTGCAATTGTGAGTC 47675367 47678364 PE9 E9_MSH2-v2 365 forward 185 CCTTGCATAGTTTGCTTCTGG 47688375 47690450 reverse 186 ATCATACAAGGGCCTGTTGG 47688374 47690449 I9-10_MSH2-v2 366 forward 187 AAACAGAAATCGCCCAACAG 47690418 47692377 reverse 188 TAGAGACCCACCCAGAAACG 47690417 47692376 PE10 E10_MSH2-v2 367 forward 189 CAGTCCGATTTCGTTTCTGG 47692347 47695506 reverse 190 CACACCTAGATTTGGCAATGG 47692346 47695505 PE11 E11_MSH2-v2 368 forward 191 TTCCATTGCCAAATCTAGGTG 47695484 47698468 reverse 192 GGCCCTAGTGTTTCCTTTCC 47695483 47698467 I11-12_MSH2-v2 369 forward 193 AAGGAAACACTAGGGCCTACAAC 47698452 47700589 reverse 194 CCTGGCCTCAGTACACTTTTG 47698451 47700588 PE12-14 E12_MSH2-v2 370 forward 195 AGGGATTCTCCCCACTTAGC 47700228 47702718 reverse 196 ATTGGAGGACTGGCTCAAAG 47700227 47702718 E13-14_MSH2-v2 371 forward 197 GCTTACCTTTGAGCCAGTCC 47702694 47705819 reverse 198 ACATGTTCCTACCCCCAGAC 47702693 47705818 PE15-16 E15_MSH2-v2 372 forward 199 TTTCTGCATCAGTTGGTTGC 47706613 47709532 reverse 200 GCCAAGTTATTGCTGCTTCAG 47706612 47709531 E16_MSH2-v2 373 forward 201 AGCCCTGTGAGGTTGGTAAC 47709413 47712504
reverse 202 TCAACAACAGCTGGAACTGC 47709412 47712503 cPP1 cPP1a_MSH2-v2 374 forward 203 CCTCTCAGGTCAGGCTTCTG 47730898 47733882 reverse 204 GCTCCCGCTAGAGAAACTCC 47730897 47733881 cPP1b_MSH2-v2 375 forward 205 GAGCGAAGCACCTAAAGCAC 47733879 47736946 reverse 206 AATTGGAGGGGGTGGAGTAG 47733878 47736945 cPP1c_MSH2-v2 376 forward 207 TGTCACCCAGTCAGGTCATC 47736760 47739876 reverse 208 TTGGAAGGAATCCAACAAGG 47736759 47739875 cPP1d_MSH2-v2 377 forward 209 TTCCCAGAACTCCTTGTTGG 47739846 47742962 reverse 210 TGCAAACCCCTTCTTTTCAG 47739845 47742961 cPP1e_MSH2-v2 378 forward 211 ACCCCATGCAGAAGCAATAG 47743027 47746218 reverse 212 AAATCCTGAAGGTGGGTTCC 47743026 47746217 MLH1v2 tPP1 tPP1b_MLH1-v2 379 forward 213 AGTTTCAGCCATGTTGCAG 37005587 37005605 reverse 214 TTGGCAAAATTGTGACTGAG 37007511 37007530 tPP1c_MLH1-v2 380 forward 215 CAGTCACAATTTTGCCAAGG 37007513 37007532 reverse 216 AGTTCGTGGCATCTAACTATCG 37009688 37009709 tPP1d_MLH1-v2 381 forward 217 GGTCCATGTGCTCCAAAAAG 37009460 37009479 reverse 218 TCCAAAACTGGGAACAAACC 37012624 37012643 tPP1e_MLH1-v2 382 forward 219 TGGTTTGTTCCCAGTTTTGG 37012623 37012642 reverse 220 TAGTGCACCACAGCCTCAAG 37015706 37015725 tPP1f_MLH1-v2 383 forward 221 GGATCACTTGAGGCTGTGGT 37015700 37015719 reverse 222 TCCAACAACTGCTGTGAAGG 37018677 37018696 tPP1g_MLH1-v2 384 forward 223 CACCACTGACCTTCCCTTCC 37018492 37018511 reverse 224 GCACAGAAAGACAAATATCACATGC 37020534 37020558 tPP1h_MLH1-v2 385 forward 225 CTCTTCCTCGTCTCCTCCTG 37020430 37020449 reverse 226 CCAATTCAATGCAAAACCTG 37022464 37022483 PE1-2 E1_MLH1-v2 386 forward 227 CGAGCAGCTCTCTCTTCAGG 37034273 37034292 reverse 228 AGCCTATAAGCACAGACCAACTG 37037250 37037272 E2_MLH1-v2 387 forward 229 TTCTCTAGCAGTTGGTCTGTGC 37037242 37037263 reverse 230 ACCCTGCATTCCAAACTCAC 37039199 37039218 PE3-4 I23_MLH1-v2 388 forward 231 GTTCATTTTGGGGCATGTTC 37039148 37039167 reverse 232 CTGCAACCTCCTTTGAGACAG 37042218 37042238 E3_MLH1-v2 389 forward 233 TGTCTCAAAGGAGGTTGCAG 37042219 37042238 reverse 234 CCAAAATGAAACTGCCTTCC 37044367 37044386 E4_MLH1-v2 390 forward 235 AGTTCCCTGGGTCATTTTCC 37044393 37044412 reverse 236 TTGTGGGAAGGCAAACTAGC 37046381 37046400 PE5-6 E5_MLH1-v2 391 forward 237 CCTGTGCTAGTTTGCCTTCC 37046376 37046395 reverse 238 GGTGGTCACCGTGGTAAAAG 37049553 37049572 E6_MLH1-v2 392 forward 239 GACCACCATGTGATTTCCAAG 37049566 37049586 reverse 240 TTGGTTGGCGGTTATTTCTC 37052510 37052529 PE7-9 E7-8_MLH1-v2 393 forward 241 TAACCGCCAACCAAGAAAAG 37052516 37052535 reverse 242 TGTCTGGAGACCTTCCCAAG 37055360 37055379 E9_MLH1-v2 394 forward 243 TGTGCTAGATGCCTCACTGG 37055182 37055201 reverse 244 ACTTGCCTACATTGCCCATC 37058175 37058194 PE10-11 E10_MLH1-v2 395 forward 245 ATGGGCAATGTAGGCAAGTC 37058176 37058195 reverse 246 TCTGCAGCCATGAATAAGTCC 37061070 37061090 E11_MLH1-v2 396 forward 247 CAGAGCTGAGGCGATAAATTG 37060960 37060980 reverse 248 TGCTCCTCTCCAATCCATTC 37063973 37063992 PE12-13 E12_MLH1-v2 397 forward 249 ATACTTTCCCAGCCCAAACC 37066434 37066453 reverse 250 TGATGGGGAAATGAGAGGAG 37069438 37069457 E13_MLH1-v2 398 forward 251 AGTGGCCTTTGTCCATTGAG 37069405 37069424 reverse 252 GACAGAGGTGAGAGCCTAGGAG 37071540 37071561 PE14-15 E14-15_MLH1-v2 399 forward 253 AATGTGTTGGGGAAGTGGTC 37081262 37081281 reverse 254 TTTGGACCACGGCTTTAGAC 37084405 37084424 PE16-19 E16-18_MLH1-v2 400 forward 255 AAGCTGAGGTCACGGATTTG 37087522 37087541 reverse 256 GATGGGCAAGTTTCATCTCC 37090568 37090587 E19_MLH1-v2 401 forward 257 TGGGACGAAGAAAAGGAATG 37090401 37090420 reverse 258 CACCGTGCCTCAGCCTATAC 37093446 37093465 cPP1 cPP1a_MLH1-v2 402 forward 259 GGACTAACCCACCTCCCTTC 37103239 37103258 reverse 260 GCTATAGGCAGCCCAGAGTG 37106372 37106391 cPP2a_MLH1-v2 403 forward 261 GCCAGACTCTCGTTCCATTC 37106390 37106409 reverse 262 AGGATTTGCCGTATGGACTC 37109450 37109469 cPP3a_MLH1-v2 404 forward 263 TCGCCCAAAGTCACAGTAAG 37109303 37109322 reverse 264 GATCTGTAGGCCCAGGATTTC 37112356 37112376 cPP4a_MLH1-v2 405 forward 265 AGGGGTTTCTATGGCTGGTC 37112314 37112333 reverse 266 CCTCCCTCAAACCTCCTCTC 37114423 37114442 cPP5a_MLH1-v2 406 forward 267 TTCTCCTGCAGAGGAAGAGG 37114369 37114388 reverse 268 TTGGAATTTGTCCTGGTGTG 37117519 37117538 cPP6a_MLH1-v2 407 forward 269 AAAGCCAGGGAGTGAATGG 37117566 37117584 reverse 270 ATGTGCATCTCCCTGGTGAC 37120703 37120722 cPP7a_MLH1-v2 408 forward 271 TGTGGGGAAATCAAAACCTG 37120784 37120803 reverse 272 GGGTAGACTGTGCGTGTGTG 37123930 37123949
TABLE-US-00002 TABLE 2 MLH1-v2 MLH1-v1 MLH1 MSH2-V2 MSH2-V1 MSH2 probe probe region probe probe region sum length 86366 55582 121536 106534 73609 171394 bp repeat 44684 18525 64712 53243 22133 94584 bp length total repeat 51.74 33.33 53.25 49.98 30.07 55.19 % SINE 24.93 2.58 23.85 34.68 5.03 35.95 % ALUs 22.38 0.09 21.85 32.85 0.76 34.15 %
REFERENCES
[0199] 1. "Gene copy number variation and common human disease", Fanciulli, et. al. Clinical Genetics, 2010 77, 201-213
[0200] 2. "Dynamic molecular combing: stretching the whole human genome for high-resolution studies" Michalet, et al., Science 1997 277, 1518-1523 and "Bar code screening on combed DNA for large rearrangemens of the BRCA1 and BRCA2 gene in French breast cancer families", Gad, et. al., J. Medical Genetics, 2002, 39, 817-821
[0201] 3. "Sequence-based design of single-copy genomic DNA probes for fluorescence in situ hybridization" Rogan, et. al., Genome Res. 2001 11, 1086-94.
[0202] 4. "A dot-matrix program with dynamic threshold control suited for genomic DNA and protein sequence analysis". Erik L. L. Sonnhammer and Richard Durbin. Gene 1995, 167:GC1-10
[0203] 5. "Microsatellite instability, mismatch repair deficiency and genetic defects in human cancer cel lines", Boyer J. C., et al. Cancer Research 1995 55, 6063-6070,
[0204] 6. "Primer3Plus, an enhanced web interface to Primer3", Untergasser A., et al. Nucleic Acids Research 2007 35, W71-W74
Sequence CWU
1
1
408120DNAHomo sapiens 1ttcttcccaa gagagccaag
20220DNAHomo sapiens 2ctgttttgga accccaagtc
20320DNAHomo sapiens 3ggcttcaatc
tgggactacg 20420DNAHomo
sapiens 4gctgtcaccg cctcttttac
20520DNAHomo sapiens 5gccaggcact taggcagtag
20620DNAHomo sapiens 6ttggtcctga catcctttcc
20722DNAHomo sapiens 7ttagttgaac
agggcatgac ac 22819DNAHomo
sapiens 8ggtaaagggg cctgatgtc
19923DNAHomo sapiens 9gagccttgat gttccctctt aac
231020DNAHomo sapiens 10acccagatcc gaaactgttg
201120DNAHomo sapiens
11ccggccttac ctttcatttc
201220DNAHomo sapiens 12ccaggatcca gatccagttg
201320DNAHomo sapiens 13gagttccatg gcagatcacc
201422DNAHomo sapiens
14gcagctttca atcacaaatc ag
221520DNAHomo sapiens 15gaagggttgg tcttgctgtc
201620DNAHomo sapiens 16accctttgca cctctctgtg
201719DNAHomo sapiens
17cccggtgttg aatcatttg
191820DNAHomo sapiens 18ttcagccctg aaggtagagg
201920DNAHomo sapiens 19ctggccactt tttggaagag
202020DNAHomo sapiens
20tgggacgcag agtgatacag
202120DNAHomo sapiens 21ttactggcga tcctcagagc
202220DNAHomo sapiens 22aacgcctctt ccgttgtatg
202321DNAHomo sapiens
23gaaaggacag accaagtgca g
212419DNAHomo sapiens 24agcctgtgca gggaaactc
192520DNAHomo sapiens 25agtgggatgc agctgaaaag
202620DNAHomo sapiens
26caacagcatg ggaaagatcc
202724DNAHomo sapiens 27ttgaaagttg gtcttaggaa gagg
242820DNAHomo sapiens 28cccaacaaac ctggctttag
202920DNAHomo sapiens
29agacgcccaa aatcaacaac
203020DNAHomo sapiens 30ccgcttgctg ctaaaaattg
203121DNAHomo sapiens 31tgattgccaa ggaagattca c
213221DNAHomo sapiens
32tggaagtaaa tgcaggtgct c
213320DNAHomo sapiens 33tcattcttgg gtgtttctcg
203420DNAHomo sapiens 34atggcggttt tgtggaatag
203520DNAHomo sapiens
35gagggagagg gaaccttttg
203620DNAHomo sapiens 36ggggactata ccgcattcac
203720DNAHomo sapiens 37tgttgattca tgggcatttg
203821DNAHomo sapiens
38gctggggaat catgtatgaa g
213920DNAHomo sapiens 39catcaagcac agttccattg
204020DNAHomo sapiens 40ttctctttcc gtttccagtg
204120DNAHomo sapiens
41ggagcttggg aattcaactg
204220DNAHomo sapiens 42agaaacgggc atgtcatagg
204319DNAHomo sapiens 43cagcctacgt gcccatttc
194420DNAHomo sapiens
44tcaaaagatg gccaaaatgc
204520DNAHomo sapiens 45gtgttgcacc cattaactcg
204620DNAHomo sapiens 46agcctggtga gaggtgactg
204719DNAHomo sapiens
47cacgatgcca gtccaattc
194822DNAHomo sapiens 48aaggtggact ttaatgcaaa gg
224920DNAHomo sapiens 49ggagtgagag cgacaccttg
205021DNAHomo sapiens
50cgacagctga ctgctctatg g
215121DNAHomo sapiens 51cacaatggga aaggatgtag c
215221DNAHomo sapiens 52cagagaaaaa cacccatgac c
215321DNAHomo sapiens
53caccgtgatc ctccttattt c
215421DNAHomo sapiens 54gaacaaacaa cggatgaaag g
215520DNAHomo sapiens 55gtggcatatc cttcccaatg
205621DNAHomo sapiens
56cccccagact gtgaattaag g
215720DNAHomo sapiens 57gatgcagatc agggaaatgc
205820DNAHomo sapiens 58atcttgctgg atggacaagg
205921DNAHomo sapiens
59cttaatcctg aaaggcaggt g
216020DNAHomo sapiens 60tgtttctcag gcaaccacag
206120DNAHomo sapiens 61gaaaccacag aatcgccttc
206220DNAHomo sapiens
62acctggacag tcccacagac
206320DNAHomo sapiens 63cagtgctttt gcatccttcc
206420DNAHomo sapiens 64atttaatccc ctggccaatc
206520DNAHomo sapiens
65cacctgtgcc catcacatag
206620DNAHomo sapiens 66gagtcccctc ttggagaacc
206719DNAHomo sapiens 67aaagccattt ccagtgtcg
196820DNAHomo sapiens
68attgtgcagc cagaattgag
206920DNAHomo sapiens 69ttcacagcaa agtggctcag
207020DNAHomo sapiens 70gctattatgg gctgcaaagc
207120DNAHomo sapiens
71ttcactccca acaagcactg
207219DNAHomo sapiens 72tgcccagtcc tttttcact
197320DNAHomo sapiens 73aatccctcct gcacactttc
207420DNAHomo sapiens
74aatggatgct tccactgtcc
207520DNAHomo sapiens 75ccatctgtgc aattccttcc
207620DNAHomo sapiens 76gttcaaaggc agaagccatc
207724DNAHomo sapiens
77gtctggattc tttcacaatg tagc
247821DNAHomo sapiens 78tgccaatctt ctcctctgtt c
217920DNAHomo sapiens 79aaccacccaa tgtgttcacc
208020DNAHomo sapiens
80gttcattcct gcgagtaggc
208120DNAHomo sapiens 81gccaaaggtg gaaaatgttg
208221DNAHomo sapiens 82gccttcttca tgaaagcact g
218320DNAHomo sapiens
83ccagaaggtg gaagctacag
208418DNAHomo sapiens 84tggggtcaat gaagcaag
188520DNAHomo sapiens 85acatcgaccc agaaagttcc
208620DNAHomo sapiens
86aatgtgcttc gtaccactgc
208720DNAHomo sapiens 87agcgtgccat tgtactctcc
208820DNAHomo sapiens 88tttctgagcc catgatttcc
208920DNAHomo sapiens
89gtgcccagct agttccattc
209020DNAHomo sapiens 90tcaagagcgc taatcccatc
209121DNAHomo sapiens 91tgcacatgct cactgaaaga c
219219DNAHomo sapiens
92ttttgcctgc aaactgacc
199320DNAHomo sapiens 93cagcaagcac caaatcactg
209420DNAHomo sapiens 94agtaccagcc gtccaaactg
209520DNAHomo sapiens
95cctggccaga aaattcattg
209620DNAHomo sapiens 96accctgcatt ccaaactcac
209721DNAHomo sapiens 97gcagtccttt gaggatttag c
219824DNAHomo sapiens
98gaaagatatc caacaggaag tgag
249920DNAHomo sapiens 99tggccttgtt taaggtcctg
2010020DNAHomo sapiens 100atggtcctgc tgcttcagag
2010120DNAHomo sapiens
101accccgtcat agcacagttc
2010221DNAHomo sapiens 102caaaggccat tcatcagttt c
2110321DNAHomo sapiens 103gtggcgtgat atccttgatt c
2110420DNAHomo sapiens
104ctctggaatg actgctgctg
2010520DNAHomo sapiens 105tgtgctagat gcctcactgg
2010620DNAHomo sapiens 106ttgccaagaa gcacaacaag
2010720DNAHomo sapiens
107cggaggctct actgttggac
2010820DNAHomo sapiens 108tgctgtccac tctggaactg
2010920DNAHomo sapiens 109acatcagaag ccctggtttg
2011020DNAHomo sapiens
110gctgggagtt caagcatctc
2011120DNAHomo sapiens 111tcggtctcag tcaccatttg
2011220DNAHomo sapiens 112aacgcacctg gctgaaatac
2011322DNAHomo sapiens
113tgaacctgca atatctcaga gg
2211423DNAHomo sapiens 114cttaccgata acctgagaac acc
2311522DNAHomo sapiens 115cccagcccat atattttaaa gc
2211621DNAHomo sapiens
116ccagccactc tctggactat c
2111719DNAHomo sapiens 117gacatggaga gccgaatcc
1911822DNAHomo sapiens 118ccattaaaat cgggtctgaa ag
2211919DNAHomo sapiens
119tccagaccca gtgcacatc
1912020DNAHomo sapiens 120catggtcagt gccatcagag
2012120DNAHomo sapiens 121agcctcccaa agttaagtgc
2012220DNAHomo sapiens
122cccagctaaa accaacacac
2012320DNAHomo sapiens 123tgccctcagc tactcactcc
2012420DNAHomo sapiens 124agggctcagc ctttaggaac
2012520DNAHomo sapiens
125gccagactct cgttccattc
2012620DNAHomo sapiens 126actccccatt cagtcccttc
2012720DNAHomo sapiens 127aggcacaacg tcaggttttc
2012820DNAHomo sapiens
128ttggaatttg tcctggtgtg
2012920DNAHomo sapiens 129caccattgcc aacacttctg
2013020DNAHomo sapiens 130gccattggtt tgaaggtgac
2013120DNAHomo sapiens
131cttagtcacc gcctgtcctc
2013220DNAHomo sapiens 132tagctgcatg tggctaatcg
2013320DNAHomo sapiens 133tgtggctcgc attacatttc
2013420DNAHomo sapiens
134cgctgtcatt acctgctttg
2013520DNAHomo sapiens 135tgacctccaa aatcatccag
2013620DNAHomo sapiens 136ttctgagcta ggaggtgctg
2013722DNAHomo sapiens
137ccagatttgt aaatccctgt tc
2213821DNAHomo sapiens 138tgtgtggttc ttaagcattc c
2113920DNAHomo sapiens 139ctcagtccat cagcctcctc
2014020DNAHomo sapiens
140tgctgtgccc tgagattaag
2014121DNAHomo sapiens 141aacttaatct cagggcacag c
2114218DNAHomo sapiens 142tgcagcttca gcctcttg
1814319DNAHomo sapiens
143gcgtggtgtt tcgtaccag
1914421DNAHomo sapiens 144gctactggcc agaaatcttc c
2114520DNAHomo sapiens 145gcccagccct actaaggaag
2014620DNAHomo sapiens
146ctgtgctccc ctgctagaac
2014720DNAHomo sapiens 147gtcgtcctct tcgacctagc
2014820DNAHomo sapiens 148cagcgcctat tctacagcag
2014920DNAHomo sapiens
149ttcttcccaa gagagccaag
2015020DNAHomo sapiens 150ccacctttaa tctgcccaac
2015121DNAHomo sapiens 151gtgttgggca gattaaaggt g
2115219DNAHomo sapiens
152gcagtgtcat gccctgttc
1915321DNAHomo sapiens 153ctctttgtgc cctttctttt g
2115424DNAHomo sapiens 154agttccttaa agcagagaag
atgg 2415520DNAHomo sapiens
155aacctgtccc tgtggatgag
2015620DNAHomo sapiens 156ccgaagcatc cttacattcc
2015720DNAHomo sapiens 157aatacctgaa cccccaaacc
2015823DNAHomo sapiens
158ctcaggctat tttccagatt cac
2315918DNAHomo sapiens 159gcatgcctgt cattctgg
1816020DNAHomo sapiens 160tccaagggac tgaaacacac
2016122DNAHomo sapiens
161ttagtgtgtt tcagtccctt gg
2216220DNAHomo sapiens 162gacagcaaga ccaacccttc
2016320DNAHomo sapiens 163gcacattacg agctcagtgc
2016421DNAHomo sapiens
164ctaccaggag aacagcacag g
2116522DNAHomo sapiens 165tgggttagca ttgtgttagg tg
2216620DNAHomo sapiens 166ccacaggtgt gtgccaatag
2016720DNAHomo sapiens
167aagttgcagt ttggctggtc
2016821DNAHomo sapiens 168ttatctccag cggtgcttat g
2116920DNAHomo sapiens 169taccataagc accgctggag
2017020DNAHomo sapiens
170actccaccaa gcccagtctc
2017120DNAHomo sapiens 171tttagagact gggcttggtg
2017219DNAHomo sapiens 172ctcttcccca acaaacctg
1917320DNAHomo sapiens
173cccagtttca agcgattaag
2017422DNAHomo sapiens 174aggaaaagca tgttatctcc ag
2217520DNAHomo sapiens 175ttccgtagca gtaggcatcc
2017622DNAHomo sapiens
176tcaccaccac caactttatg ag
2217721DNAHomo sapiens 177tcccagatct taaccgactt g
2117820DNAHomo sapiens 178atggcggttt tgtggaatag
2017920DNAHomo sapiens
179cccaaacaac agcattagcc
2018019DNAHomo sapiens 180acatcagcct cgggacaag
1918121DNAHomo sapiens 181tgagcccgtt gaatatagtg g
2118222DNAHomo sapiens
182agttttccta aacgggatga tg
2218320DNAHomo sapiens 183atgggtgtgc acgtgtgtag
2018420DNAHomo sapiens 184gccatgtgca attgtgagtc
2018521DNAHomo sapiens
185ccttgcatag tttgcttctg g
2118620DNAHomo sapiens 186atcatacaag ggcctgttgg
2018720DNAHomo sapiens 187aaacagaaat cgcccaacag
2018820DNAHomo sapiens
188tagagaccca cccagaaacg
2018920DNAHomo sapiens 189cagtccgatt tcgtttctgg
2019021DNAHomo sapiens 190cacacctaga tttggcaatg g
2119121DNAHomo sapiens
191ttccattgcc aaatctaggt g
2119220DNAHomo sapiens 192ggccctagtg tttcctttcc
2019323DNAHomo sapiens 193aaggaaacac tagggcctac aac
2319421DNAHomo sapiens
194cctggcctca gtacactttt g
2119520DNAHomo sapiens 195agggattctc cccacttagc
2019620DNAHomo sapiens 196attggaggac tggctcaaag
2019720DNAHomo sapiens
197gcttaccttt gagccagtcc
2019820DNAHomo sapiens 198acatgttcct acccccagac
2019920DNAHomo sapiens 199tttctgcatc agttggttgc
2020021DNAHomo sapiens
200gccaagttat tgctgcttca g
2120120DNAHomo sapiens 201agccctgtga ggttggtaac
2020220DNAHomo sapiens 202tcaacaacag ctggaactgc
2020320DNAHomo sapiens
203cctctcaggt caggcttctg
2020420DNAHomo sapiens 204gctcccgcta gagaaactcc
2020520DNAHomo sapiens 205gagcgaagca cctaaagcac
2020620DNAHomo sapiens
206aattggaggg ggtggagtag
2020720DNAHomo sapiens 207tgtcacccag tcaggtcatc
2020820DNAHomo sapiens 208ttggaaggaa tccaacaagg
2020920DNAHomo sapiens
209ttcccagaac tccttgttgg
2021020DNAHomo sapiens 210tgcaaacccc ttcttttcag
2021120DNAHomo sapiens 211accccatgca gaagcaatag
2021220DNAHomo sapiens
212aaatcctgaa ggtgggttcc
2021319DNAHomo sapiens 213agtttcagcc atgttgcag
1921420DNAHomo sapiens 214ttggcaaaat tgtgactgag
2021520DNAHomo sapiens
215cagtcacaat tttgccaagg
2021622DNAHomo sapiens 216agttcgtggc atctaactat cg
2221720DNAHomo sapiens 217ggtccatgtg ctccaaaaag
2021820DNAHomo sapiens
218tccaaaactg ggaacaaacc
2021920DNAHomo sapiens 219tggtttgttc ccagttttgg
2022020DNAHomo sapiens 220tagtgcacca cagcctcaag
2022120DNAHomo sapiens
221ggatcacttg aggctgtggt
2022220DNAHomo sapiens 222tccaacaact gctgtgaagg
2022320DNAHomo sapiens 223caccactgac cttcccttcc
2022425DNAHomo sapiens
224gcacagaaag acaaatatca catgc
2522520DNAHomo sapiens 225ctcttcctcg tctcctcctg
2022620DNAHomo sapiens 226ccaattcaat gcaaaacctg
2022720DNAHomo sapiens
227cgagcagctc tctcttcagg
2022823DNAHomo sapiens 228agcctataag cacagaccaa ctg
2322922DNAHomo sapiens 229ttctctagca gttggtctgt gc
2223020DNAHomo sapiens
230accctgcatt ccaaactcac
2023120DNAHomo sapiens 231gttcattttg gggcatgttc
2023221DNAHomo sapiens 232ctgcaacctc ctttgagaca g
2123320DNAHomo sapiens
233tgtctcaaag gaggttgcag
2023420DNAHomo sapiens 234ccaaaatgaa actgccttcc
2023520DNAHomo sapiens 235agttccctgg gtcattttcc
2023620DNAHomo sapiens
236ttgtgggaag gcaaactagc
2023720DNAHomo sapiens 237cctgtgctag tttgccttcc
2023820DNAHomo sapiens 238ggtggtcacc gtggtaaaag
2023921DNAHomo sapiens
239gaccaccatg tgatttccaa g
2124020DNAHomo sapiens 240ttggttggcg gttatttctc
2024120DNAHomo sapiens 241taaccgccaa ccaagaaaag
2024220DNAHomo sapiens
242tgtctggaga ccttcccaag
2024320DNAHomo sapiens 243tgtgctagat gcctcactgg
2024420DNAHomo sapiens 244acttgcctac attgcccatc
2024520DNAHomo sapiens
245atgggcaatg taggcaagtc
2024621DNAHomo sapiens 246tctgcagcca tgaataagtc c
2124721DNAHomo sapiens 247cagagctgag gcgataaatt g
2124820DNAHomo sapiens
248tgctcctctc caatccattc
2024920DNAHomo sapiens 249atactttccc agcccaaacc
2025020DNAHomo sapiens 250tgatggggaa atgagaggag
2025120DNAHomo sapiens
251agtggccttt gtccattgag
2025222DNAHomo sapiens 252gacagaggtg agagcctagg ag
2225320DNAHomo sapiens 253aatgtgttgg ggaagtggtc
2025420DNAHomo sapiens
254tttggaccac ggctttagac
2025520DNAHomo sapiens 255aagctgaggt cacggatttg
2025620DNAHomo sapiens 256gatgggcaag tttcatctcc
2025720DNAHomo sapiens
257tgggacgaag aaaaggaatg
2025820DNAHomo sapiens 258caccgtgcct cagcctatac
2025920DNAHomo sapiens 259ggactaaccc acctcccttc
2026020DNAHomo sapiens
260gctataggca gcccagagtg
2026120DNAHomo sapiens 261gccagactct cgttccattc
2026220DNAHomo sapiens 262aggatttgcc gtatggactc
2026320DNAHomo sapiens
263tcgcccaaag tcacagtaag
2026421DNAHomo sapiens 264gatctgtagg cccaggattt c
2126520DNAHomo sapiens 265aggggtttct atggctggtc
2026620DNAHomo sapiens
266cctccctcaa acctcctctc
2026720DNAHomo sapiens 267ttctcctgca gaggaagagg
2026820DNAHomo sapiens 268ttggaatttg tcctggtgtg
2026919DNAHomo sapiens
269aaagccaggg agtgaatgg
1927020DNAHomo sapiens 270atgtgcatct ccctggtgac
2027120DNAHomo sapiens 271tgtggggaaa tcaaaacctg
2027220DNAHomo sapiens
272gggtagactg tgcgtgtgtg
202731133DNAHomo sapiens 273ttcttcccaa gagagccaag atttcttctt tcctcttctt
tctttttttt ttctttctaa 60tttcaaagga gtataattaa attgccaggt aaaagctcaa
aggtcttttt tatagtgttc 120tggaaggttc tctgcctgtg tttgtatttc ctttagcctc
cacgttcctc tatccagttc 180ccgcaccctt ccccccaggc cccattcttc aaggcttcag
agcagcgctc ctccggttaa 240aaggaagtct cagcacagaa tcttcaaacc tcctcggagg
ccaccaaaga tccctaacgc 300cgccatggag acgaagcacc tggggcgggg cggagcgggg
cgcgcgggcc cacacctgtg 360gagagggccg cgccccaact gcagcgccgg ggctggggga
ggggagccta ctcactcccc 420caactcccgg gcggtgactc atcaacgagc accagcggcc
agaggtgagc agtcccggga 480aggggccgag aggcggggcc gccaggtcgg gcaggtgtgc
gctccgcccc gccgcgcgca 540cagagcgcta gtccttcggc gagcgagcac cttcgacgcg
gtccggggac cccctcgtcg 600ctgtcctccc gacgcggacc cgcgtgcccc aggcctcgcg
ctgcccggcc ggctcctcgt 660gtcccactcc cggcgcacgc cctcccgcga gtcccgggcc
cctcccgcgc ccctcttctc 720ggcgcgcgcg cagcatggcg cccccgcagg tcctcgcgtt
cgggcttctg cttgccgcgg 780cgacggcgac ttttgccgca gctcaggaag gtgaggcgcg
gattggagca gagttgtgga 840gctgggctgg gctggggggc agcggccccc ggccctcggc
ccccgaaacg ggcataatag 900ggaggggacc aagaggccgc gctttccagc gtggagaccg
gacggtgcgg ccgtgctccg 960gctcaggccc tccgcgcggt aggaaacggc gagggccgtc
ccggggagca gcctcacttc 1020gcagctttgc tcgccttggt agggaaatgg ccttgggcgg
aggcggggga caggcaggga 1080acggagtggc cacgtccagg tttcctgcgg ccaccgaacc
ggtgcctcgc gcc 1133274782DNAHomo sapiens 274ggcttcaatc
tgggactacg tacttaatgt taaattgctt taaagtggtc atagctgcta 60caggtttgtg
ctcagaaagt ctgcacctga ctggtctgat ttaaatttta cgccccttag 120gtatgaacag
tgtgttttaa acaagtacag gatggggctg cagaagattt aaacgcttga 180gaacaagtgc
tgtattttcc ccttttgtga ccccagtatt gagtttagtg ttgggcagat 240taaaggtggt
tcatatcgac tataacttga acagggaaaa attgaaatca acttagggta 300cttgggatac
gaaggatcaa tataaaaact ctggtttgtc atgctagctt tttctttttt 360ttcctcttca
gttgaactga ggagatagtt tttgttttta atgattgtgc tcttttaact 420agacaaaagg
aattagatag tcttgcctat tcgaagttaa atgaactttt gaggttgtta 480aggacaaaac
tattaaactg acatcaataa tacagaatgg gctgcttagt atcactttcc 540ttatcaggta
ctaggattta atttagttag gaaactcact taaagggagg actataactg 600cagttgaaag
tgtaattttt ccaagatata aaattgttta aagattgaat atattcctgt 660taagccccaa
aggaaacatc cctcatttaa gaaaatgggg tgggagagca agagaaggtg 720aggattcaca
gatcctagaa ttggaatagt tgattttttt ttgtaaaaga ggcggtgaca 780gc
7822751258DNAHomo sapiens 275gccaggcact taggcagtag tctatagctg aaaaataaaa
cattcagaac cactttttaa 60ggttttgtgt ccttgtaact ttaggcatta ttattacaat
ataacttagc tgggacatga 120gagttaatag atccacattt taaagtagat ttttttttta
attttctaga atgtgtctgt 180gaaaactaca agctggccgt aaactgcttt gtgaataata
atcgtcaatg ccagtgtact 240tcagttggtg cacaaaatac tgtcatttgc tcaaagcgtg
agtaaaatat cctaattacc 300tgtaagcttt attttgactt aatacttctt taattgatgt
gccttgagtt ggaaagagtt 360ttattggctt aaatctgaat catgttacaa agtaagtgtg
ggaacacata aatttcaaat 420aatctttgac cctggaactt tagagttaat tttttttttc
ccgtaatcat gaaatcagtt 480atttttcagt ttggcattaa ggtttctttt tcagtggctg
ccaaatgttt ggtgatgaag 540gcagaaatga atggctcaaa acttgggaga agagcaaaac
ctgaaggggc cctccagaac 600aatgatgggc tttatgatcc tgactgcgat gagagcgggc
tctttaaggc caagcagtgc 660aacggcacct ccatgtgctg gtgtgtgaac actgctgggg
tcagaagaac agacaaggac 720actgaaataa cctgctctga gcgagtgaga acctagtgag
tggggctgcc tatactactt 780gttttcatgc tgttcagatt catttaatta aatttatttt
tgattatgta atatgatttc 840atggtttaga attcagaaga tatgagtgtc cagtgaaaag
cttccttctc attccagtcc 900ccctcgctac ccattggacc tccacagaat tgatgttatt
gattattcta taaccttcca 960gagatagttg atgaatttgt tatatatctg ttttattatt
tttacataaa tgatagcata 1020ctaggtataa tttttctttt atatctttac ttaacattat
tcagtatttc attgttgcat 1080tagtagtaaa tgtatgtaat ttaacctatg tatttgctta
ttgattgtgt tttaaaagtg 1140agatatgctt gttttaggga ttgtttaatg aaaaggcaca
gaaacccact caagctagct 1200taagcaaaaa aagacttcat tggaagggac tagaaactgg
aaaggatgtc aggaccaa 1258276665DNAHomo sapiens 276ttagttgaac
agggcatgac actgccagct aaactttgac ttaatgtgac tttatgtatt 60gtgtccagag
aacagagggt caatattaga aaaggtgttc cctcctgggt gtgtccttta 120tgaaggatgt
gtaagggaag aaattatagg aatagctact gcataaattt tttttctctt 180agtccttata
attcgagaat tttaggatta gcttattagg aaaatagtat ggaagactga 240gttatagtca
actgacattg tctttttact ttatagctgg atcatcattg aactaaaaca 300caaagcaaga
gaaaaacctt atgatagtaa aagtttgcgg acgtaagtgc aattaaatgc 360atcatattct
tgcacagttg gtggctcaaa tcttccatcc tacaccatta gaaaaagcaa 420gtctaaatgc
ttttttatat ttctgaaaaa taaagttact tgaaatagag ttgcaagaat 480agcacagaga
ttctgggaat acacttcact cagattcacc aattaacatt ttggcacatt 540tgctttttat
atgtgtatgt gtggatgaat atgtgtgtgt gctttacatc agtgtatcta 600tgcatgtata
aatatttttc ccagaagcac atgagagcaa gttgtagaca tcaggcccct 660ttacc
665277649DNAHomo
sapiens 277gagccttgat gttccctctt aactaaaagc aggttatgca tttttgacag
gaaaactact 60taagcgatct tgtgtccttt ataatacttc acattaggag ttgcatgatg
tcagcttgtc 120cctttactag taaagtaaac tttggttaaa gtggtatcca ccaggttttt
ccactgtgaa 180gttaccattc tccctttgta atccataaat aatctatggg cagatacttg
gatactaagt 240aaatgttctt tttctaatta aactggtacc cagcagtttg aatatcaatg
gatgattcca 300gcctgaatca attattatta tgatagttgc aaaatggcag aaaaatttta
actttaatga 360cagttttaga ccctgagctg tctgcttaaa gagtagtgct tcttactgtt
gtgtggtaca 420aacatttttt tttaatacag attttaaatt ctttacagtg cacttcagaa
ggagatcaca 480acgcgttatc aactggatcc aaaatttatc acgagtattt tggtatgatt
ttttaataag 540tgagctttag cagacagttg gtgagacagt atgttttgag tataaggaca
gccagtgatt 600taagtggtgg ttaaatgcac ttactggagc aacagtttcg gatctgggt
6492781250DNAHomo sapiens 278ccggccttac ctttcatttc tttagtaatt
tagttttaaa gtagttctaa tccaaataaa 60atactttcat atcttattta aaaatctttt
caatataaga aaatcctctt aggaaaaatt 120gtacattgta attatgtttg gttgcatggc
tgtcttattt ccctttgata gatttagaga 180cctcccaaag atttcttgat tagtgataaa
cttagttatc cactaatgga aaggaacagt 240gatgcatgta gattatagaa aatcaaacac
tgaatattct gattctcaat taatgttatt 300ttcaaatgat tttgattata ttagtattaa
tttgtattat tcaatttttt tccccagtat 360gagaataatg ttatcactat tgatctggtt
caaaattctt ctcaaaaaac tcagaatgat 420gtggacatag ctgatgtggc ttattatttt
gaaaaagatg tgagtatcat cttctttatt 480cctgtgttca ggaatgtagt ctatcatgcc
tcaatgaatt aaatatattt catcaccttt 540ttatccactt acagatcaac caaatggttc
gctgctgccg ttaattttgt cctccctgtc 600actcacatgc atcttgcttg tttgtatatt
tatgcctctt atcaaattgt tctgcctaaa 660atatctcccc tctttcttat aattcttatt
tattatctac ttggtggtta cttagtttgt 720gcatatatgc tcccctatga tatttataat
ttacacaaat aaaagtctgt taaaaaagac 780tgtaactgat atgattaaaa tattttgttg
aaactttaat atattatagt gaggtatttt 840ctgctgaaat atgaggtttg cttcaaaata
atctgggcgg gggtgaaagg atgaaaggaa 900gaaaagatga agtaagagag gctatgtgtt
gttggccttg catctgggtg ataggtacat 960gggcatcatt gcactactct ttctactttc
gtgtatgttg aaaggttcct gtaataaaca 1020gttttttaaa gttccaataa attagattgt
tatcactaaa accataaaga ttcttggcag 1080cggttctttt ggcatacaat ttgtatgtaa
ttatatgtgg ccatggttgg tttccttaaa 1140tatttttaat tccttttctc cttttcaata
caggttaaag gtgaatcctt gtttcattct 1200aagaaaatgg acctgacagt aaatggggaa
caactggatc tggatcctgg 12502791568DNAHomo sapiens
279gagttccatg gcagatcacc atctgttttc tgcctcatag aagagtggaa tgggaagcct
60atggttttta ttctacaaag agtcaacatc taacagaatc ttctgaaggc atactccagt
120ggattcacct tggagaaact cattgtgact gatgatctga tttattatct ctatgccagt
180gaaataatca tttaatatga acttaatttg tcataatcta ttgtgtacta actagtctat
240actagtgtga catcaaagtg tcagattgtt agtgtgtttc agtcccttgg aattgaatat
300gaacacttat ccttgaaccc tatcaataac atttttcaca tatctcaatt tttgtgtgtc
360tttgtagttg tatgtgggcc acttactaat attttagcaa gtaataaaaa tagaaacgta
420aaggaatatt ggaaaaagtc taatggaacc agaaagttct agcatttttt tcccattctg
480tagtaggtca tctggtttat ttggtttggt gaccgcaagt ctagaagact aaccctgaat
540tgaatggtaa cagacaggca gaatgacaat gtagtgttgc agtgcagagc agtacagacc
600tgggtttggc tgggcaaaat tatataactt ctttaagcct ccatgtttcc tcatctgtaa
660aatgaggata atagatagta tggacctgtt gcaaggatta aacataatca gtgtaaagtg
720ttggtcccat gcttgccaca taagaaaata tttgtcaaca gagtggtagt tgtcattatc
780attgtctcag tttgcctgta actagttgtg tgatctgaga caaacactaa ttttgaactt
840gagtttcccc acatgtaaaa tgaaagattg ataatagaaa gtaaatcaat tttttctagc
900attaaaaata gtatgcattt aataaaaatc ttattcttaa tgatctagct tacctccaac
960ttgccctagt cactttggcg atcttgtctc taaatagaac cttgaaaaca cttaaatgtg
1020tgtttccttg caatataact ttttcttttt ttatttaaat aagtcttata aatgtgggaa
1080aaaattatct tgtgttcctt taatttcatt tttatttaat actattttca gaatgaacaa
1140aagattgaaa aattatttag aatttttttc tgtgcttttt cctgtttcag ataaaggaga
1200tgggtgagat gcatagggaa ctcaatgcat aactatataa tttgaagatt atagaagaag
1260ggaaatagca aatggacaca aattacaaat gtgtgtgcgt gggacgaaga catctttgaa
1320ggtcatgagt ttgttagttt aacatcatat atttgtaata gtgaaacctg tactcaaaat
1380ataagcagct tgaaactggc tttaccaatc ttgaaatttg accacaagtg tcttatatat
1440gcagatctaa tgtaaaatcc agaacttgga ctccatcgtt aaaattattt atgtgtaaca
1500ttcaaatgtg tgcattaaat atgcttccac agtaaaatct gaaaaactga tttgtgattg
1560aaagctgc
1568280537DNAHomo sapiens 280gaagggttgg tcttgctgtc tctgggtcgc agaggcagaa
gaggtggagg aggtagaagg 60gaggcaggtg cacactgggt gtaactttta ttgaaaaaaa
atgtgttcaa atatacccgc 120acaattcaaa cccatgttca gggtcaattg tagttgtgac
agacccaatg acccacagag 180tctaaaatgt ttattggaaa tgtttgctga cccctgctct
aggatgctgg gggaaagcta 240ttcctaggta ggtgtctcag cagacatgga aagcagccta
taatattgcc ccagcaggtg 300gggtatggaa cagatgctca gggaggctgc tggctgctgt
ccactgcagg cccagaagcg 360tcctggagaa gccaccccat gctgcaagag ccagatcatg
gagcagccct ggatgctgca 420ggagcctgtc aagccaggac accagagcta ggaaacaaaa
ccttcctttt cagtgcctct 480ctagcaccct ctgctgacag agcttcagat ccattttcac
agagaggtgc aaagggt 5372811454DNAHomo sapiens 281cccggtgttg
aatcatttgc tttctctgca gcctgtcacc aggatgactt ccccattctc 60taccccactg
tgttttcttt tttttccttt ggcttcactg gtatgaatct ctcctgattt 120tttccctctt
ctactgattg tgtaatactg atgttcccag gatttgtcct tagttctatt 180tttcctccct
ttttcttcct ggaggatttc atccactttc ttgcctttat taccttctgt 240gaggctcaat
gagaacagaa gcaccatctc catttctgtt cttttttcct gagatctata 300gtaaagtatg
tatatttcac ataactagtt tttaatgatt tgatcatctt catccacaaa 360catattttat
ggcttgtatt ctcagaatca attgatggca ttgccatgaa ccaagtccca 420aattccttct
cttttacttc ctccattgtt ttagtcacca gagcctgtga tcctccccga 480gaaacaaatg
ctcctctaat ctgccttctc ttacacactt tccctggtct ctttataaac 540aatctactcc
ctctccagcc caaacctcat attgctccca ggattatttg cctaaagttg 600atcacagcac
ttttttacaa cataagatgt acccctttcc cctgccagag gggcatatat 660aaagcctttc
agtgtggccc tgtagtgtga accttttctc cttttgtctg tctggcatcc 720catttcctct
gcaatttggg aagaattgtc tgcatgtttt gtaagagcca ggtctctacc 780tccctcttta
gaagtttaag ggcagagatc ctctcatggc tctcagcatc cagggcacag 840gcacccccac
atagtacttg gaatcttgtg atggaaagaa gtaagaacag ttgagaaaac 900acccatgggc
agtggtggca tgtgaccaat ggcagttata ggcatgttgc agtggtggca 960tcccagccct
tggtcatctc tgccctacat gaccctgtgg ccatccagct gtggttgctc 1020tgcttggcct
cccttggctg ctgattcctc tcccctccca ttgccaatac cctgctccag 1080agctttatca
catgcctgga ttaatttaat ttccgacttt cccaatacca aattctcccc 1140actctcaatc
ctttcccaac gtcattccca gaataactaa aatctagtca cagtctacct 1200tatttctcac
tgtacctcca atacatcaaa ttttcctgca tttaagattt cccttcccat 1260tcctccaatg
ttccttcttt ccttgcttcc ttccttcctt ccttccttcc tccctccctc 1320cctccctccc
tccctatctt cctttcttcc cttctttttt tctttttttc cctggacagg 1380tcttggtctc
ttgcccagtc tggagtgcag tggcacaaac acagctcact gcaacctcta 1440ccttcagggc
tgaa
1454282530DNAHomo sapiens 282ctggccactt tttggaagag gaaatgtgtg agcaagcagt
gtgggaacca gagtgagcga 60atgctggaac tggttggtcg gtcctctctg gcaggagcag
gctctgtaca gccctggcag 120cagcatccaa gcccctgccc tcttggcacc cgggttcttg
tctggcatcc aggaagaatc 180aggacaaatg aatggattga aggggtagtg tatgtggagg
attttactgg gtgatggaaa 240tggccctcag tgggatggca gctagaatgg ggatggtgca
ggaggtaccc tccgaagtta 300gctgcgtctg cagtctctga cactcagtag cttctctgct
cgctgcccag gcgcttatgt 360tgctctgcca gctgaagtca ttatgggcac aggatagggg
catggcaggc caaaaaggta 420acattgggca gaaaaatggg gtcagctgtt ttcacttagg
gccatggttc caggccttag 480gggtggggtt tggccagtag cccagccatt ctgtatcact
ctgcgtccca 5302831992DNAHomo sapiens 283ttactggcga
tcctcagagc caagaagagt ctgggacata gcaggccata taaatgtttt 60cgaatgagtg
aatcatcaac gagtggatga aacgataatg tggctaacag gcagcagtaa 120ggaggctgtg
tagaataaac ccgtaatccc gatgttggca gtttgcttag aaagaaaaag 180ggaggcagtc
ggagaggggc acacgtttta acaaaatact gggaggagga ggaaggctag 240tttttttttt
gttttcaagt ttccttctga tgttactccc atgcttccgg gcacattacg 300agctcagtgc
ctgccggaaa tctcccacct ggtggcaacc tacccttgca tacaccccac 360ccaggggctt
caagccttgc agctgagtaa acacagaaag gagctctact aaggatgcgc 420gtctgcgggt
ttccgcgcga cctaggcgca ggcatgcgca gtagctaaag tcaccagcgt 480gcgcgggaag
ctgggccgcg tctgcttatg attggttgcc gcggcagact cccacccacc 540gaaacgcagc
cctggaagct gattgggtgt ggtcgccgtg gccggacgcc gctcggggga 600cgtgggaggg
gaggcgggaa acagcttagt gggtgtgggg tcgcgcattt tcttcaacca 660ggaggtgagg
aggtttcgac atggcggtgc agccgaagga gacgctgcag ttggagagcg 720cggccgaggt
cggcttcgtg cgcttctttc agggcatgcc ggagaagccg accaccacag 780tgcgcctttt
cgaccggggc gacttctata cggcgcacgg cgaggacgcg ctgctggccg 840cccgggaggt
gttcaagacc cagggggtga tcaagtacat ggggccggca ggtgagggcc 900gggacggcgc
gtgctgggga gggacccggg gccttgtggc gcggctcctt tcccgcctca 960gagagtgggc
ggtgagcagc ctctccagtg cggaggcacg ggggcggaac gttggtgctt 1020gtgcggattc
cgccgtcccc aggttctgct tggctccgga gggacgcccc cctcagccct 1080gaaacccgtg
cctctccagc cgccccggat ctgaacttgt gatcacggag tgtttacgtc 1140gtgccaggca
ttttaatgca ttgttctagt tcattttcca gcagtcgcat tcctcgcctt 1200ggccctacat
gtagcgctca ttacaaacac ggccagaatc tcttattaac aaacagcagc 1260caggagtgag
atttaaaata gactgggggt ttaggagacc cttttatgac acgtaattct 1320gctcccacga
cgctcccatt tataccgccg gtccagctaa gggtctggta atggagcgcc 1380gttgaagagc
agtatgatga agtggtcagg accaacggac tctggagctg ggctgcttgg 1440gatcaagtcg
ctgcccctct gcttattaac gtgtgacctt gggccagtca tggacgctat 1500ctgcttcagc
tcagcattca gtgctctccg tcacccgacc ccatctatcc aggattatct 1560ctccctggaa
agctacaaac gtctcaccct atgtgggcca aatgttctgg ataggcctag 1620ttaacctctt
ctctccctgt tttctttgcg ctttcttgca gctatgtagt tatgctaatg 1680aaaagagcat
cctaggggga gcagagttgt ggattctagt cctgactaga ggactagtgc 1740aaatgcgata
ctcctgatga aaaatgtttc attcgttaga tataaatgtg ttaggcaggg 1800ttatggacac
tagatgaaaa aagaaatacc tctactttca tagagatcac tattggacag 1860caaggcagaa
ataattacaa ttcaagttgg aggcttatgg aggtgagctt gtaagaggtt 1920acaagaggcg
ccaaggcagg atcgccaaag acggaagact ttggaagagt ctcatacaac 1980ggaagaggcg
tt
1992284497DNAHomo sapiens 284gaaaggacag accaagtgca gtggttcgtt ccagcactta
gggatgccaa ggtgggagga 60ttgcttgatg ctaggagttg aagactagcc tgtgtaacat
agcgagaccc atctctacaa 120aaaaattaaa aagttacctt tagaacttac gatttttatg
tgtagactcc atataagcag 180agggtctatg cttattcact atttattacc ttccatagtc
cctgcacata taataggtgc 240ttcataaaca atttaatgaa tgaataaatt actgagaaaa
cactggaagt ttttgggtta 300gcattgtgtt aggtgcttga tatggtctgg ctgtgttccc
acccttatct catcttgaat 360tcccatgttt tgtgggaggt acctggtggg acataattga
atcatgtggg caggtttttc 420ctgtgctgtt ctcctggtag tgaataagcc tcacaagatc
tgatggtttt aaaaatggga 480gtttccctgc acaggct
4972851667DNAHomo sapiens 285agtgggatgc agctgaaaag
atacccgaaa atatggaagc aactttggag ctgggtaaca 60ggcagaggtc agagcagttt
agagggctca gaagaagacc agaaaatgtg ggaaagtttg 120gaacttccta gagacttgtt
caatggcttt gaccaaaatc ctgataatga tatggacaat 180gaaatccagg ctcatgtggt
ctcagatgga gatgaggaac ttgttgggaa ctggagcaaa 240ggtgacactt gttatgtttt
agtaaagaga ctggtggcat tttgccctgc cctagagatt 300tgtggagctt tgaacttgag
agaaatgatt ttgggtatct ggtgggagaa atttctaagc 360agcaaagcat tcaagaggtg
acttgggtgc tgttaaaggc attcagtttt aaaagggaaa 420cagcatgaaa gtttggaaaa
tttgcagcct gacaatgtga tagaaaagaa aatcccgttt 480tctgaggaga aattcaagct
agctacagaa atttgcataa gtaatgagga tcccaatgtt 540aatccccaag acaatgggaa
aaatgtttcc agggcatgtc agaggccttc atggcagccc 600ctctcatcac aagcctagag
gcctaggaga aaaaagtgat ttcatgggcc agcccggggt 660ccccatgctg tgtgcagcct
agtgacttgg tgccctgcat cccagctgcc ccagctgtgg 720ctgaaagggg ccaacctaga
gctcaggcca tggcttcaga gggtgcaagc ctgaaacctt 780gacagcttcc aggtggtgtt
gagcctgcag gtgcacagaa atcaataatt gaggtttgag 840aatctctgcc taggtttcaa
agatgtatgg aaacgcctgc atgtccaggc agaagtttgc 900tgcaggggtg gggtgctcat
tgagttcctc tgctagggca atgtagaagg gaaatgtagg 960gtcagagccc ccccacagag
tccctactgg ggcaccacct agtggagctg tgaaaagagg 1020gctaccattc tccagacctc
agaatggtag atccacagac agcttgcacc atgtgcctgg 1080aaaagctgta gacacttaac
gccatctcat gaaagcaacc aggcagtgtg ctgtaccctg 1140caaagccaca ggggcagagc
tgtccaaggc tgtggttgcc cagctcttgc atccgcatga 1200cctggacatg agacatagag
tcaaaggaga tcattttgga gctttaagat ttgactgcca 1260tgctggattt tggacttgca
tggggcctgt agcccctttg ttttggccaa tttctcccat 1320ttggaatggc tgtatttacc
caattcctat accccattgt atctgggaag taactaactt 1380gcttttgatt tgacaggctc
atatgcggaa aggacttacc ttgtcttgaa tgagactttg 1440gactggaatt ttgaattaat
gctgaaatga gttaaggctt tgggggactg ttgggaatgc 1500atgattggtt ttgaaatgtg
aggacatgag atttgggagg ggtcatggca gaatgatatg 1560gtttggctat gtccccacct
aaatcccatc ttgaattccc atgtattgtg ggagggacct 1620ggtgggagat agttgaatca
tggggatgga tctttcccat gctgttg 1667286913DNAHomo sapiens
286ttgaaagttg gtcttaggaa gaggaacttt ttgtggaaat ttcttaatat ttgaagaata
60ttatgttatt gttcctctgt ttttcatggc gtagtaaggt tttcactaat gagcttgcca
120ttctttctat tttatttttt gtttactagg gttctgttga agataccact ggctctcagt
180ctctggctgc cttgctgaat aagtgtaaaa cccctcaagg acaaagactt gttaaccagt
240ggattaagca gcctctcatg gataagaaca gaatagagga gaggtatgtt attagtttat
300actttcgtta gttttatgta acctgcagtt acccacatga ttataccact tattgtaata
360tgcagttttg gaagtatatg ttaccattta actgtacaga gtacatagta atagagtggt
420aattatttag attgattaaa gaactcattt ttttaaataa gttttttttt tttcactata
480aaagtttatt ttatttgaga tggtatggta tcgaacatgt tcatattgtg tgtaatcgtg
540ggtaaattac tcaaccttta tgtcatagtt tcttcacctt taaaatgaca ttaataaaag
600agctacttaa taggattata agcatgagat gatttaatat acataaaata cttacagtct
660gatatatagg aagcacttaa ctctttatcc tagaaaagat ttaaggtgac cttaacatat
720atgtcagaaa atctttaaaa ttgtggaaat aaaaggttgt ataattctgc tatcctaaaa
780ttactagtat ttcaatatat tttattttag tcttttcttt tagatacaag ttttaaaact
840tttaagtgaa gtgtaatata cgtaagtact gcttgatgaa tttaaggtga tttctaaagc
900caggtttgtt ggg
913287907DNAHomo sapiens 287agacgcccaa aatcaacaac aacaaaaacg atattggaat
gattggatcc ccaaagataa 60atgtttgagg tgatggatat ctcagttacc ctgagttaag
tattatacat tgtatacgtg 120tattaaaata ttacaaaccc ccaaatgtgt acaattatga
ggtatcaata aaagagattg 180gaaggactgg gtaatttgca agtaattaag gcaatttaca
atttttaatt tttatttgtg 240aataagtagt tatacgtgtc aaaattcaaa aaggacaggt
ggatatacag tgataagtca 300tccccccttc tctgtcagct ccataaagag cccctgtctt
gcatggctcc agggtcacat 360ttcctattgt attttgccac cacctgccct gggagcaaca
gtgttagttt cttgaacatc 420cttccaagca gagtctgggc ctacacaagc aaaacaagta
tgtctattct ctctcctctt 480taattttttt aaaggaagtg attgataatt taacactcaa
gctataggtc attggttata 540tttttaattt ccaatttatg ggaatagagg aagtgtcagt
gatccccttc tggtttaaga 600actggaggat gcatgtgttt agacccttta gaaacctgaa
atgtcaccta atataattat 660cagagtaaca ctttttagta agcaagctat ctatcaaaag
taggtttttg aagaagaggg 720taaggaaagg ttactttcat gggacatagc aataatttct
aaaatctaat ggttttacaa 780gacttgttca ttagaagtaa catctgtgag gatggcttta
tgagtcaaaa tattatctgc 840ttaatacccc acctgtaggg taagaagaaa tgtttttttc
ttggtgacaa tttttagcag 900caagcgg
9072881137DNAHomo sapiens 288tgattgccaa ggaagattca
cagggcctag aatggcagtg gttatgcatc tacagtttat 60tacaggagaa ggatacaatc
cagtagcagg attatggtaa ggatatgcat cacagtcaaa 120ggctgtcata gcaagtcatc
cagagagttc gggtgcaagt tccagttttc ctttgttgtg 180taaagtctgt ggtggggtgc
attttctctc tcagagcagg atgtgtgcac aggacacctt 240ggaacctagg agcccaaaat
agagtcttca ctggactttt taatattttt cttgtcaagc 300ggacatgttc ctgttctcta
actagcctct tcagtggagg tcagaggaag agcctcattg 360agaccaagtg caactcatca
atcacatgaa acaatgctga taaataaacc acctaaatat 420cccctgaccc acaaatacaa
aacaacacca ttcaatcagt atttttcatg ccttgatcag 480gggtcattgc catgcaggaa
ctttaacaaa acagtacagg ctaataatag aattgttgga 540attaactcac acagcacacc
tatgagagag agttaagata gagggtcttg gtggtctcta 600acagttgaat tcaaagtgaa
gttaccagag taaagtgagc aaagacacat attagtacaa 660tattggtaga taaaatcacg
ttgctctaat aagcatagtt ttaaacttta accatgtttc 720tccagtaatt ttagtaatta
tattgttgtt atgtctaata cataaagcat tttttacttt 780tttaaaaaat ttttaggcaa
tgtggggtcc aaagtaatta aaaaaaaatt tttttaacat 840aaagcatctt aaaattttac
ttaatcatga tcacttagaa ccattaaaac atacgttttg 900atattatggg gaagcttcgt
tgttcctttg tagacagact taaagaaata caactttatg 960atgacaagat ataagataat
tatagattta aattttatag aaaccttttc ccttatctag 1020tgcaagaggt agctaagtgc
ttattttctc aaagtactgt gttataaaaa gtattcctag 1080tgtagtcaaa gcttctcttt
agactgataa aacttagagc acctgcattt acttcca 1137289457DNAHomo sapiens
289tcattcttgg gtgtttctcg cagaggggga tttggcaggg tcacaggaca atagtggagg
60gaaggtcagc agataaacaa gtgaacaaag gtctctggtt ttcctaggca gaggaccctg
120cggccttccg cagtgtttgt gtccctgggt acttgagatt agggagtggt gatgactctt
180aatgagcgtg ctgccttcaa gcatctgttt aacaaagcac atcttgcacc acccttaatc
240cgttcaaccc tgagtggaca cagcacatgt ttcagagagc acagggttgg gggtaaggtc
300acagatcaac aggatcccaa ggcagaataa tttttcgtag tacagaacaa aatgaaaagt
360ctcccacgtc tacctctttc tacacagaca cggcaaccat ccgatttctc aatcttttcc
420ccacctttcc cccctttcta ttccacaaaa ccgccat
457290564DNAHomo sapiens 290gagggagagg gaaccttttg ttttattcca gtaggaccag
ctagaaacag aaggtgattg 60accagtatta gggatggaat cagggtacaa ttatggagac
aggctatcta aacaattcac 120tctcaccatt taaatcagct gtttgatcat tttttttcca
tatatcttta ccatcgcata 180gtaaataata tcctttttat tttcaagagg gagtattggc
cttaagttag gaactctctt 240aatttttttc ccccatcatc ccacccgcac ttcttactcc
ttacttccta cttgctttta 300ttctttactg gctctttacc actgcgtatt tttaggtgca
tacatctatt ttttaaaaaa 360gcacccttgt tcctgggtcc tcttccagta ccatctatta
atatatctct ctccctcttt 420ccactcccag ctgggtttct gaaagcgtgc acttcccatc
ttccattcat tcatctggtt 480tccagccctg accacagtac tgaaatggca tttgctaggt
gacctttatt tttttttaaa 540tccagtgaat gcggtatagt cccc
5642912249DNAHomo sapiens 291tgttgattca tgggcatttg
ggttggttcc acgtttttgc aattgtgaat tgtgctgcta 60taaacatccg tgtgcaagta
tcttttttgt ataatgatat cttttcccct gggtagatac 120ccagtagtgg gattgctgga
tcaactggta gttctacttt taaggaatct ccacactgtt 180ttccttagtg gttatactgg
tttacattcc caccagaagt gtagaagtgt tccctgttca 240ctgcatccac accaacatct
atttttgatt ttttgattat ggccattctt gcaggagtaa 300ggtggtatcg cattgtggtt
ttgatttaca tttccctgat cattagtgat gttgagcatt 360tttttatgtt tgtttgccat
ttgtatatct tcttgagaat tgtctattca tgtccttagc 420ccattttttg ataggattgt
ttgttttttt tcttgctagt ttgtttgagc ttgttgtaga 480ttctggttat tagtcctttg
tcagatttat agattgtgaa gatttttttc ccactctgtg 540ggttgtctgt ttttgtctgt
ttccttctgc tgactgttcc ttttgccatg caaaagctct 600ttttttttga gacagaatct
cgctctgtcg gccaggctgg taacaaagac acaggtactg 660gtaataactg ccatggctta
ttgcctacat taatgatgaa agcaaatgct aaatttcagc 720tagaggctag agaaaataag
cctggaattt tcttttatgt ttatatactg ctatgaatac 780caggagtcct tgggttaaga
ctgtagggct ttctaaagcc tgtgatcact agtggagaat 840gtagctttac aaagtctagt
tggaaattgg caactggggg ttagtacaag ttacaaggaa 900gggatggaat ttaagatgct
agtgaaagct tggaggataa gggagcaggt gaactcataa 960ggaagtttat gaactgagaa
gggctgcagc aaagtgggct catgtgcttg aggagccaga 1020ggacatgttg agggtgacat
aggttctgaa gttcgtacag atacttatgc agtatggatt 1080cttggaaaac cttctttagt
catgtgatag aaaaataaca gcttatggaa aaaacagggt 1140tgaggcagac ctgaaaatac
atgaaatttt aaaaaccgct tctaacagaa gcataacaga 1200ctgtaataaa aactgtggcc
ttcctggcat ttgcacccaa acaacagcat tagccaactc 1260tttgaagcct tagatctgtg
gctcttgttt tctcctttga ggtgtaggtc cttgagggca 1320tttgcttcta atagaggcta
gtttcatcag aattaaaaat ctgaaccatg gtatgaaatt 1380caattctttt ttttttttct
tttttgaaaa cactggcaaa tgttttgtat ccttgagctt 1440tcccacatat cttaacatag
tgagtggaaa gtacagtggc tgttaagcca actactctga 1500ggtcttcact gctaaggctt
actcttaatt gtgtgagagc ttaaccttga tccctttaaa 1560acattaatgg gctagaaaaa
aaaccattca taaaccagtg ccacctctga attttgctac 1620cacaattccc ttatttacca
atagtgcatg agctaatttg gaataaagaa ctaggcattg 1680tagcacaaca gacattatgt
gggcaaagtg ttgtttatat tctgtctaaa tagtgcttca 1740catgtatgta ctattttcta
aatatgtata gatgcttttg tgattaataa taaaacatga 1800attcttaaaa caattttgct
gacttcatag tagcttttca ccgttttttc agtagctgct 1860aaaatttctg gagaagtttg
ggaactattg ttttggagtg aaatgcagtg tgttagatat 1920cacttgcaga attcttctaa
gggtatttat tggcgattag aaaaaaaatc cttgtgttat 1980accagtagta atacaaagta
attgttcagc ttctgttaag tgtaaaggac tatacaagta 2040ttgtgtatag ttatctcatt
tattattttc tgggtagcta ttgttattat tacttcgtac 2100aaaaagggaa aaggaggctc
aaagtatcat gctccagata acagagccag taggtagcag 2160agctgggatt gctacccagg
tctctagtcc tgctttttca cactatatac tcattgcttc 2220acttactcct tcatacatga
ttccccagc 2249292890DNAHomo sapiens
292catcaagcac agttccattg tgtaaaaact tggcttgatt taacctgtta attggaacac
60tgtcattaat ggaaattagg aatatgaggt aagctagagg ttttatttta atgactttgg
120gttattaaat ctataagaaa tgaaattcat ttagtcataa ttaatgtcat gtttctgcat
180ctatattact tgttgggttt acagacgagg tagtgtatta ttagtgggaa gctttgagtg
240ctacatcatc tccctttcta taaaataaat tgagtacgaa acaatttgaa ttaaaacacc
300tgagtaaata gtaactttgg agacctgctg tactatttgt accttttgga tcaaatgatg
360cttgtttatc tcagtcaaaa ttttatgatt tgtattctgt aaaatgagat ctttttattt
420gtttgtttta ctactttctt ttaggaaaac accagaaatt attgttggca gtttttgtga
480ctcctcttac tgatcttcgt tctgacttct ccaagtttca ggaaatgata gaaacaactt
540tagatatgga tcaggtatgc aatatacttt ttaatttaag cagtagttat ttttaaaaag
600caaaggccac tttaagaaag tttgtagatt tttcttttta gtatctaatt gtagcacctt
660tgtggacagt ggatgtaata ttaagtgaca gatgggaaaa ggatttttaa aaaaatagca
720actgtttcag tggatgaaat aaagattatt agcagagaaa atgaatattg ggcataactg
780tcctggtgaa agacaatctc ataaatgaac aatttcataa tttcgtaaat gcaactgcat
840tttattttca aagagaagga aaattatagt cactggaaac ggaaagagaa
8902931224DNAHomo sapiens 293ggagcttggg aattcaactg acacacgaca gatttacagg
agaaaagttt tatttcaagt 60acacatgaga gcttcataga aaagaagtga agacctaaag
aaacagactg gagagttcat 120atgccatttt aataaaggat aatgtattag tctgttctca
tgctgctaat aaatacatac 180ccaagactgg gtaatttata aagaaaaaga ggtttaatcg
actcacaatt gcacatggct 240ggggaggcct tacaatcatg gcagaaggta aaggaggagc
aaaggcacat attacatggt 300gtcaggcaag agagtgtgtg caggggaact gccctttata
aaaccatcag atctcgagag 360acttattcac catcacaaga acggcatggg aaaaacctgc
ccccgtgatt caattacctc 420ccaccgggtc cctcccatga cacatgggga ttatgggagc
tacaactcaa gatgagattt 480gggtggggac acagccaaga catatcagat aataaattgt
ggagaggcag taagattgaa 540gaaaagaggt ttgagcttcg aggggtggta aattgtggga
aggtaattat ttggggcaaa 600ctaatggcac ataaggattg ttttagtaag gcttgttatg
catacccaaa acaagtgcca 660tctccagtaa tttaagagtc tatggtgatc aagagtagtt
ctcttcctgc tagaagaggg 720gtgggagaga acaccttcac aaagggaaat ttatattctg
ccttcatgca gaaagggggc 780gagcagagag ttcctacgta tactgtttct tcattatctt
cctctcaaaa gaatacttag 840gctaaagtgg catgatttgg ggtgacattc tgatcctctt
cagtgacaat ccttgatatt 900tttccttctt tctctccagg taaacagtgt taacatcctg
gtatgcttcc cccaattcca 960ttatactaac tctgtattgt gggttaaaga ttttttactt
tgatcagcag tatttgaaac 1020atacctgtta tactagatgt actctgactg taaaatagtg
gtcagtgtta cttctttaat 1080gatgctgtgg gattaaagga ttttattata aatgctggga
agagcctgga tttgaggaag 1140gtaagcagtg cagttaggtg gatgtagact agaagaggtc
atttgttctc atttcattgt 1200tgcccctatg acatgcccgt ttct
12242941550DNAHomo sapiens 294cagcctacgt gcccatttct
taaagtagaa aatttagtag ttgatgatgt cagggaagaa 60aagctttttc tctgccttac
gttaagtagt tgggggcaaa ttaaattaat aaaagacaga 120ttagtgagag aaaaggctgt
aagaatttgg actttatata ccatcataat agaggaagta 180aagggagatg aagggcactt
aagggaaaac agatgacttg taggaaagat aaatgaaccc 240ttaagagaat agatgagaaa
tatgaaggtt ttgtgacaat gtctgtttag gtggttactt 300ctcttcttgt tatgagagtc
agtcttctgg ttgctggaaa ctgctaggag atttataaca 360attgggctct ttcgagaggc
tcttctttta agcagataag ggagttcaca aaaaagcctg 420ttctcaaatg atttcagcac
acacacacac acacttacga cacagttaag tactgtgcca 480gtaagatgtg agttgtgcat
ttcttttttt tctctgagta gactgtttga ggttatttat 540atcaggactt gttatgcagg
taactgaaaa ctcaacataa tctaggttat gtagttaaaa 600gtatggagag aaggtaggct
ttatttagag ttgcttgatc ctgtagatct cttctacctt 660ttgtaatttt aatttcaacc
aaggatggtt ccctttttgg ccctaggacc agttagcagt 720tggggcacca ttccaagcaa
gagtcctgca gtttgtagtg atgggaccat ttaaacagtg 780attgtgacca gggacataga
atgggatgat tggcccgagg taaccatggt tgggtatgga 840gtcagcttcc ctgtaggaag
agatagacaa agtctgagca ctcctgggaa gggggaggaa 900gggaataact gttgtgtaaa
tcatcagcag tgtctactaa gataccatct gtaaccatag 960gcttctatgt tttataatat
aaggctgtct tttaaataaa tcagattccc tgttaaagat 1020ctgttctaga ttccctaggg
ggttgacctc atatagtatc ttctttttct ttggttacaa 1080acttttaaac ttgtctgagg
ttataaggtg aattcaactg tccactgtca atgtagatat 1140ttttaatgga tttagggatt
taaattacat gattcagaac cactttgagg aagtctaggg 1200aatatcagtt gtttctgtat
aatttctgaa agcttcactg ttttctaggt gtgcacttaa 1260ttcatgtgat gaagggaaca
gtatttacat gagtggtttg gttaattttt cccctcctaa 1320gcttagcttt gtgtatcgtg
cgtgcttcca gtgtttttgt ggctgcttta cataagtctt 1380ttagaagtat tttctatttt
tgaagtaaat gtggatcaaa accaccccaa gacaggattg 1440aaaaaaagac agtttttcgc
aagaaagtaa ataattttat ttagcttggg actttaaatg 1500atatgtctta aatgtaaaca
tttctatact gcattttggc catcttttga 15502952828DNAHomo sapiens
295gtgttgcacc cattaactcg tcatttacat taggtatatc tcctgatgct atccctcccc
60cctccctcca cccctcaaca ggtgtgtgat gttccccttc ctgtgtccaa gtgtcctcat
120tgttcaattc ccacctatga ctgagaacat gcggtgtttg gttttttgtc cttgcgatag
180tttgctgaga atgatagttt ccagcttcat ccatttccct acaaaggaca tgaactcatc
240attttttatg gctgcatagt attccatggt gtatatgtgc cacattttct taatctagtc
300tgtcattgtt ggacatttgg gttgattcca agtctttgat gttgtgaata gtgccgcaat
360aaacacacgt gtgcgtgagc ctttatagca gcatgattta taatcctttg ggtatatacc
420cagtaatggg atggctgggt ccaatggtat ttctacttct agatccctga ggaatcgcca
480cagtcttcca caatggttga actagtttac agtcccacca acagtgtaaa agtgttccta
540tttctccaca tcgtctccag cacctgtcgt ttcctgactt tttaatgatc gccattctaa
600ctggtgtgag atggtatctc attgtggttt tgatttgcat ttctctgatg gccagtgatg
660atgagctata gaaatccttt ttagaaacaa cagagccttg ttgtaaaaca ggtaaatgta
720cgtgaggact tcaaaaagtt tgtggaaaaa tggaattaaa agataaaatt taaaaacaca
780ttttaaattt atttcccaac ataagctcct caagttcaag acacttttat aaatgatgat
840ctcagctgtt tagttcatcc gtaaagaact gagggtacta gaaattttac catgtcaatg
900cagtctcttt acattactaa ctaaagaaaa ataggtgctc tttaaagatc ttttaagatt
960aggaacaaaa agaagtcaga agaagccaaa tcaaggtgga tgcttaacga ctttccatag
1020aaacttacaa aattggcctt gtttgatgag aagagcgtgc aggaacgttg tcatggtgga
1080aaaggacttt gatgatgctt tccctggcat ttttctgcaa aaacttggga taactttctc
1140aaaacactct aataataagc agagcttatg ttctttatcc ccccagaaca tcagcaagca
1200aaatgcctga acatcccaaa aaactgttgc catgaccttt gcccttgact ggtccacttt
1260tgcttcgact ggaccacttc catttttggt agccattgct ttgattgtgc tttgtcttca
1320ggatggcatt ggtaaagcca tgttttggct cctgttacag ttctttgaag aaatgcttca
1380ggatcttgat cccttgttta aatttctatg gaaagctctg ctcctgtctg cagttaatct
1440gggtgcaaca gttttgtcac ccatcaagtg aaaagtttgt tcagctttaa tttttcagtc
1500agaattgtgt aaactggacc aattgttgag atgcctgtag tgttggctat tgtttctgct
1560gttagtcatt agttctcttc aattagggaa tgaacaaaat taatttttcc tgaaaaattg
1620atgtggatgg tctgccgctg tgggcttcat cttcgacatg gtctcatccc ttgttagaac
1680aagttatccg tttgtaaact gctgatttcc taggagcatt gaccccataa aattttcata
1740aagcatcagt tatttcatta ttcttctatg cagacttcac tataaatttg ctgtttggtc
1800ttacttcaat tttagcagaa ctcatactgc tctgacatct aaactgatgt cttagccttc
1860atagtgtctc tgactagatc ctattcagac gtgttatagc aaattagtaa agtttatttt
1920ggtgccaaaa acttttgaat ccacgcatag ttttttcaca acacattttc catgaacttt
1980ttgaagaccc ttcatatatt ataagaagaa agttaaaaat atcccctgca tctactactc
2040agaaataacc actgttaaca ttaagtctgt tctcaactct aggcattatt gagggttttg
2100aggacaggtc ttgaaaattt ctatggctac cttttactgg gtggagacta gcatgtatag
2160ttgaccgcat aggttaatcc ctccactcaa aaagccacaa ttttaaagtg tagtattcac
2220tagcatttag tatattcaca gtgttgtgaa atgaccacca ccatctagtt tgaaaatatt
2280tcatcacaac caaaagaaaa cctcatatct attagcggtc tctcctgttt ccccagacac
2340cggcaaccac taatgtactt tttgtctctg tggacttgtc agttctggac attttatata
2400aatggaatca tgtgaccttt tatgattgac ctctttcact tagtataatg ttttagaggc
2460tcatccacat tgtagcatgt gtcagtactt catttccttg tgtattggtc cattcttgta
2520ctgctataaa gaaatagctg agactgagta atttataaag aaaagaggtt taattggctc
2580atggttctgc aggccgtaca ggaagcatga tgatggcacc tgctcagctt ctggggaggc
2640ctcaggaaat ttaaaatcat ggcagaaggg gaagtgcggc gtcttacatg gtggagcagg
2700agcaagagag agaaggggga ggtgctccac acttttaaac aactagatct caggacaagt
2760cagtcactat aatgagaaca gcacccaggg gaaaccgccc tcacgatcca gtcacctctc
2820accaggct
28282961379DNAHomo sapiens 296cacgatgcca gtccaattct tgtgtagttt tttaatcagc
tgaatttaac attcaaattc 60ttcttttaaa tcttccaata ggcagttatc tttataaaga
tcctatataa tcaagacttt 120gtttctgaat attttatgta tgtttttgct actgtaaatg
agatctattt ctcattgtgg 180tttcttgctg ttattactgg taagaattta gtgaaacaaa
gtacttaaga gtatgtcttt 240aaattgtgag attttgatga acttttaaga aataaaattc
tttagtttct tagagctttt 300tgagatttct aaggtagatc cttggtttgg gcaacatata
actattacaa gttttgcaca 360ttgaacgtta tttggtaatt tttagagagg acattttaaa
tgtttaggaa aaatataaat 420aaaatgtaga atactattgg gggcatatac atcatcagca
ctgtaactgt ttcatatgaa 480tcatttttgt acatatagaa ctctaaagtc ctaatgaaca
gaattttaca tttctataaa 540tagaaagtcc ttaatagttg tgactgaata acttatggat
agcaaattat ttaactgaaa 600acagtaaaat ttaagtggga ggaaatattt gctttataat
ttctgtcttt acccattatt 660tataggattt tgtcactttg ttctgtttgc aggtggaaaa
ccatgaattc cttgtaaaac 720cttcatttga tcctaatctc agtgaattaa gagaaataat
gaatgacttg gaaaagaaga 780tgcagtcaac attaataagt gcagccagag atcttggtaa
gaatgggtca ttggaggttg 840gaataattct tttgtctata cactgtatag acaaaatatt
gatgccagaa ttattttata 900agttccctgt ccccaagatg atgacttcac atctctgtca
aacagaaatc gcccaacagg 960cccttgtatg atgtcattta aacaagccct attttaaatg
tcacctccac tggtaacagg 1020atactcctag gaggatcacc aagcccaatt cttctaggag
tagtgcattg attaggcttt 1080ggggtttcca agcagttcat taatgtcact tttggaaaaa
gtctgtcttt cataccagct 1140tattaattcc ctatgggttc acacggtttt ttttcctgga
ttttcatcaa acatgtgtaa 1200ggtactcagt acaaagaagt ttagaaatcc agaacaaagc
agtgtattta agtagtagta 1260aacttccaga taatctgatg cccatatcta catatataaa
aaatttgcaa atagttctgt 1320agagagtcca aacatggagt agatccctaa ttaagagcct
ttgcattaaa gtccacctt 13792972455DNAHomo sapiens 297ggagtgagag
cgacaccttg tctttaaaaa aaaaaacaga ggaatgcatc atagtatata 60ttaaattatt
gcctattttt ttatctattt tattgagtgc taataagaaa attaatggca 120aaaacttgtt
ttttacagta taaattaagt ttaatttcat tttaaaatta agtaaatttg 180ttttattaaa
aagtatgttg aaagcaacat aaatagcact caaattgaga cagaaactgt 240aactgtagta
taagaagcat taggctggga attgggaaac acgagttcta gttgcagctt 300ggaaactttt
tctgaagctc tttacaaatt acttaatttc tctggttttc accacattgt 360tctatagcat
taacatgttg gattcattgc tttaattctt agacctacgt gtcatcagaa 420atgccattac
actttgagga tttgagcctt attttaaata aagttgtgat cctcatggca 480gcctaggttt
acatgtgtta aataaacagt attctgtaaa taccattgtc tttcatgttt 540agtgatgttg
ctgttgttaa cactgcagtg aaatgcatat ataagcaaac tacattacat 600actcatgaac
atggtccttt gttttgaaac tttgatcact gattgttcgc agtctttcat 660tgtggaacta
ctctttcact ttgaatgttt tgagaggttc ctttgttcag atcagtccga 720tttcgtttct
gggtgggtct ctactttccc ttttctcact ggtcaagcga ggtctgtcta 780attgtttgct
actactaaca tttgatggcc acgcttcagc aagtacattt gtagattctc 840tctctctgtc
tctcttaatt tgtggtctag agatcatatt ggttaatgaa attatgaaga 900gggaatgtat
ttataaaaac tcaaattctt gatgcagaag gtctagctga ttgtgaaccc 960aaaatatccg
agacaggtca caaccaattt agaaacttta ttttgccaag gttaaggatg 1020catccatgac
atagtctcac aaggttctaa tgacacatgc gcaaggtggt tagggtacag 1080cttggtttta
tacattttag ggagacatga gacatcagtc aacatgtgta agatgtacat 1140tgattctatc
cagaaaggca ggacaacttg aagcaagggg ctttcaggta ataagtagat 1200aagagacaaa
aggttgcata cttttgagtc cttgatcagc ctttcactga ataaacaagc 1260ttagtcttgt
tagtgaatct gcgtttttac ataaacagta ggtcagagga agcaatcaga 1320aatgcatttg
tgtcaggtga gccgagggat gactttctgt ccctcacctg tgaagataag 1380ctatcagttt
ccattgctag ggtgaaattc aacagaattg tttgagagtg aacatctgga 1440ggcccacaag
gactttcctt gtggagggga agtatgtagt gagggaagta tgtagttttt 1500aaatctttgt
cgctatctta tttagaaata agatggaagg caggtttgtc tgacatagtt 1560cccagcttga
cttttccctc ggcttagtga ttttgcggtt ccgagattta ttttcctttc 1620acatatcagt
cagatcattt ggtttgtgaa gtttcctatg cttaacagaa aatatgtgca 1680ctagttttcc
tagagtttca ttgtcagagt ctcaagtttt tgtttggaaa ttgtatttgg 1740tcacattaat
tatactctat gttagttcca aagaaatacc tttggttaag aaaagaattc 1800tcatgcataa
ctcctcgagg gtggggttac accttaatcc atcctcaggt gctcatggta 1860attggggcaa
atatgttgcc cagtgctggt gctctgcagc cttggatggg tttacccaga 1920aagcagcttt
caagtcagaa actaacattc ataagggagt taaggatttt ataaatagat 1980atccataatt
catgtagttt tcaagtaagt agtatttgaa tcttttctgg ttagataata 2040attgtgagta
tgttgtcata taataacagt atgtttttca ctatttaaat aattttagaa 2100ttacattgaa
aaatggtagt aggtatttat ggaatacttt ttcttttctt cttgattatc 2160aaggcttgga
ccctggcaaa cagattaaac tggattccag tgcacagttt ggatattact 2220ttcgtgtaac
ctgtaaggaa gaaaaagtcc ttcgtaacaa taaaaacttt agtactgtag 2280atatccagaa
gaatggtgtt aaatttacca acaggtttgc aagtcgttat tatattttta 2340accctttatt
aattccctaa atgctctaac atgatgtgaa tgttctatga taagttttac 2400taatgtagtc
atcaggtaag agtcaagctt tcttccatag agcagtcagc tgtcg
24552982194DNAHomo sapiens 298cacaatggga aaggatgtag caacacattt taaccctatg
ttgagtttta ggtgggttcc 60tttgaaattt tgttaaggct aacttttgtt aattttttta
aaaaagtgta aattaggaaa 120tgggttttga attcccaaat ggggggatta aatgtatttt
tacggcttat atctgtttat 180tattcagtat tcctgtgtac attttctgtt tttattttta
tacaggctat gtagaaccaa 240tgcagacact caatgatgtg ttagctcagc tagatgctgt
tgtcagcttt gctcacgtgt 300caaatggagc acctgttcca tatgtacgac cagccatttt
ggagaaagga caaggaagaa 360ttatattaaa agcatccagg catgcttgtg ttgaagttca
agatgaaatt gcatttattc 420ctaatgacgt atactttgaa aaagataaac agatgttcca
catcattact ggtaaaaaac 480ctggtttttg ggctttgtgg gggtaacgtt ttgttttttt
tttttttttt ttaatcttgg 540agtagaaata tatttaaaat tgatggagaa aattcccagt
tcttaacatt agaaagggaa 600tatattattc ttaccagtta gtaatctatt cacatttggt
ttagagggaa gatttagaag 660gtgagataaa agcttgtgag agaatagtgt attcatgtga
aacttcttcc atgggttcag 720agcatttaga aacaaacatc ccttcacact caaagcttac
ctttgagcca gtcctccaat 780agtgaggtct ttgaaggtca ggccaaattg gctgtgggag
gacctcaggt taggatagga 840attattttaa gacatggcac tatattcatg tgaaactcgc
aaaaactagc cttgcatata 900ggctcatgta tcatgtctca gctgagatgt ttgagagatc
ttaactagat tctagaaaac 960aaaaaaggaa gtagttttgg ggcaaatata tttgggaaac
agtttattgt atttcctttc 1020cccaaatgga ttttcaagtt cttcatataa tctaacccca
acaaataaat tgcctgtttt 1080tcaaaagaaa gatcatgtct tcaggttttt gtgtggggtt
taaatgattc gaaagatttg 1140accatactga tacattcact agtaacctta gttactaatg
agtaatggtt ttgagttaat 1200cagttaggcc tgaactactt ttctggaagt tagtaaatta
tctcacaggc agccctgtga 1260gccatgggaa aatgtgtata tggtctttct aggccacagt
caaattacag gtatatttgt 1320catggcttct cttgatgaaa ggcccagtat cggtttgtct
gaagatatat aatagcattg 1380cttttggggg taatatgggc agtaactctg tccacatctt
tgggcaggct gtggttctgc 1440ctttatatgc tatgtcagtg taaacctacg cgattaatca
tcagtgtaca gtttaggact 1500aacaatccat ttattagtag cagaaagaag tttaaaatct
tgctttctga tataatttgt 1560tttgtaggcc ccaatatggg aggtaaatca acatatattc
gacaaactgg ggtgatagta 1620ctcatggccc aaattgggtg ttttgtgcca tgtgagtcag
cagaagtgtc cattgtggac 1680tgcatcttag cccgagtagg ggctggtgac agtcaattga
aaggagtctc cacgttcatg 1740gctgaaatgt tggaaactgc ttctatcctc aggtaagtgc
atctcctagt cccttgaaga 1800tagaaatgta tgtctctgtc ctgtgagaag gaaaagtata
tttgcagatt ctcatgtaaa 1860aacatctgag aatgtttgtc ttagtttaat agttgttttc
ctgtggactt tatatacttt 1920gtattgtctt aaaagagtga ttgatggtag ctacggaaaa
ctttgatttt taaaattgtc 1980tctttaagta gacaatttat aagctactgg tacgagttca
ccttataaat ctccactacc 2040atgtttttgc ttggactgtt cacacttcct ggaatggtcc
ttcttgccgt ttatccaact 2100tctttctaat ttttaagtcc ctaatgatgg gaattctatt
tctgtagtga tttttctggt 2160catacgaccg taaggtcatg ggtgtttttc tctg
2194299571DNAHomo sapiens 299caccgtgatc ctccttattt
cttagtatct tctaaagaac attaaatata gtaggtgcct 60agtaaattat gtattgattt
aacttctttg aggttctgtt gtttgtgaag aattataaaa 120gcaatacaaa tgtttgtata
gtaattaagc aacaggttaa tattcatgac ttaaaagatt 180aaagaaataa gcaaaacatg
ttagctggca actcacagaa aaagaattaa attgccaatg 240agcacacgag cacatgaaaa
attagcaaaa gtttcacccc tttacatata tttggttaaa 300attgagaaaa gaatagtaat
agatggtatt ggtaggactg tggcaggcac acaatttaca 360tgaccaccaa aagtgtatgc
aggtatccat gtcaccacac cctggtctca tcttcattca 420gttttattta ttttttttaa
tctcggccta tttgattggc acgaaatgaa tgatagctgc 480cttatttgga attcctttga
ttactactag tgtgcttgat aatgtaaaac aatattcaaa 540atctgttttt cctttcatcc
gttgtttgtt c 571300497DNAHomo sapiens
300gtggcatatc cttcccaatg tattgtctta attttgtttt tgtatgtgta tgttaccaca
60ttttatgtga tgggaaattt catgtaatta tgtgcttcag gtctgcaacc aaagattcat
120taataatcat agatgaattg ggaagaggaa cttctaccta cgatggattt gggttagcat
180gggctatatc agaatacatt gcaacaaaga ttggtgcttt ttgcatgttt gcaacccatt
240ttcatgaact tactgccttg gccaatcaga taccaactgt taataatcta catgtcacag
300cactcaccac tgaagagacc ttaactatgc tttatcaggt gaagaaaggt atgtactatt
360ggagtactct aaattcagaa cttggtaatg ggaaacttac tacccttgaa atcatcagta
420attgccttat tctaagttag tataaattat tgatgttgtt atagaaccca tttacccctt
480aattcacagt ctggggg
4973013662DNAHomo sapiens 301gatgcagatc agggaaatgc aagtcaaaac cacaatgagc
tacaacttca cactgattac 60gatagttaaa atcaaaaagt cagatggtaa gtactggcaa
ggaagtggag aaattgaaac 120tgtcatgcgc tcttggtgcg aatgtaaaat ggtgcagctg
ctttggaaaa cagtctggca 180gttcctcaga caattccact ccaacgtata tccaagtgga
atcacaacat atgtccccac 240aaacttgtac ataaatgttt atagcaggat tattcataat
agccaaaagg tggaaacaac 300ccgaatgtcc atcagcagat gaatgcataa atgaaacgtg
gtctatccat acaatggagt 360atattattga gccattaaag gaatgaagta ctggtacatg
gtgcagctta gatgaacctt 420ggaaacattg tgctaaatga aagaagctgg ttacaagagt
caacacgtat gatttcattc 480atgtgaaagt tcagaataga gacagcagta gagacaaagt
agcagttcag ggttggtgcc 540agggaatagg gggtaggtgg ggtgaaagct aaaggatacg
gtgtttcttt gtgagatgga 600aattctaaaa taggtgatgt ttatacatgt ctgtgaatat
actaaaaacc attgaattgt 660acacattaaa tggatgaatt gtataggaat tatattttaa
taaagctatt taaaaaaatc 720cagacacttc acccaagagg aaatctaagt ggtccataaa
catgaaaagg tctttaatca 780ccagtcagaa aaatgaaaat gaaaaccatg ccaggccacc
tcccaccacc atagtgacaa 840gcatttcaag tgtggcagtt ccagctgttg ttgaggatgt
ggaataacac tggtaggggt 900gttaagatta tctggtgaaa ttgaaaagac gcatacgacc
cagcaattct gctcttaagt 960gcatactctg gagatgcttt tgcccattgt gctgcgagat
gtatacaaga atgttcctaa 1020tacctccaca ctggaaacaa ctcatcagtg aaaatgaact
acagctacac aaaatgacat 1080agatggaatc ttaaaacgtt tagtaaaaga aatgatacaa
aaggatacag tttttttttc 1140atttatgtga agtttaagaa taggtggtat tgtttaggga
tgcagtcttt gggatggcaa 1200ctgtaaagaa aaagtgattg tgttaatcag agtgattgtc
tttagggaaa tggagtgctg 1260atggggaggg ggcacattag ggcttctgga gggccacagt
tctggttttt aacctgagtg 1320gtggttttgc acgtgcttgc tttatagtta gctgcaattt
ttttttttaa tgcagttaaa 1380gtttggtatg agaacaaatg tatgaccgat gagtcctttc
agtttaccaa gttctttttc 1440gtcatcgtta atttagagtg ggttacatca gtttttcttt
tctggctgcc aaaggcttag 1500gaaaaaggca aactgacaga ggaagatttt aaatgtagaa
atatttattg gtttacaaat 1560ccttttaatc acttatacat gaaaagcttt catataattc
aaaaagcaaa ttttaaattc 1620caatgaaata gttcatcccg tggttgtgaa agagtgtttt
tagattgctg cacagaagca 1680tgtttaacgt ggaaatcagc tcatggtttt agttgttagg
gctacaagaa attgggggag 1740acttcattcc aagaaaacat gtagtctgtc aggctgtttt
cattcctcta aaagagacag 1800ttttctaaga tgtttttgaa aatgagaaaa tacgtaatag
atctgcttaa gaagtttcaa 1860acttaatctg tgcttattac atgaatatgc taatgtaaaa
ccaggccttc agttagtgtt 1920tccttccttt tagaatggtg tatgtaaagc aaaatataaa
ctaatttctg acctgtcaaa 1980ggttttttct taaaatttaa atttataatg tggtttggtt
tttctttccc actcaaacat 2040gaatttgggt aataccagaa taaagctgga tatataaatt
ttatccaaaa tttagaactc 2100tgttgttaag aaatctgttg accacataac catgtttctg
agaaaataca tgattttttg 2160catctttaaa aaaaattagc actaagaagc taagatgaag
ttgtttttgt aatttgattt 2220ttttttcctt aaaatactgt tttggagtta aaagttgtag
caaaactggt ataagaaaga 2280tgttttaaga tatatttaag tcttgtctca tactctattg
actaagctag cccggtgact 2340agggtagatg tatttaaaga ataacttttc ccccttaaaa
tctcaatatt ccacatcctg 2400ttagacttct tgagtattaa atacatcttc tatccttggt
ctttctgcat ttagcttttt 2460tgggaagtat gtttttaccc aagcatatgg tatgagctgc
tgattcagta ttgagtggct 2520ctttaagctt gttagttaca ttctgctgat taaaatggtg
tacagaatag tcaggaaaaa 2580ccagtccctg gtctgaaata aacaatgtta attagcttat
ggggaagaac aaatgagtaa 2640ggagaatttt catatacaaa ggaaatctct gattgtcttt
ctggactcag tgtgtttggg 2700ttaaggagat agggtgcggc tggagaaaat gatgaaaatg
ttcagaatgt tacatgtatt 2760tttacactga aactggaagt ggaagcccag tgtgatagtt
ttctgcccga tgttggcctg 2820tcttcacacc cacaccactt atcttgattg atagagctac
tacttcctct tatactgctt 2880cagaagttaa cctctgtggt gcagtgctag gatatcacag
aggaaataat cccttgtaga 2940cagtgtcttg ttgctgggag ttatcagtgc ctcctgttct
ctctaaggag ggcaatggga 3000agccctttcc ttgcatttgc tacagccgtt tccttgacct
ccctggaaga acagtatttc 3060atggtgtcag acaacattca gaacatgctc aaattaaatg
tatgtcagta tgcatttgct 3120ttggtgtgtg gtttgtccaa accaaagtgc ccatacatgt
ctctggtcca gccatgtggg 3180aaattcagca gtggggtgaa ccatatggaa atggcaggtg
ttgggcagcc ttgactggac 3240tgccctggtc ttacttctgg atttggtgag tagaagatca
cactgttgct tgctgccctg 3300ggttcacctc aaagagggaa agaagaatta gcaacttaaa
tggttaaatt tagaaacaag 3360aaaaagttct gtcagtgggc agtttcttac gtctaacaaa
aaaaacaaca gcagtgaatt 3420cttttgtgtt cagaattaac cagtaaacac aaccttagca
attaacctat cactatcacg 3480attgtttatt tcctggaatt ttgttgacag aattagtcca
ataaatgtta tgaataataa 3540ttttatgaat agagataggt taatgccaaa ttaagtataa
ttgaatctga gacattaatg 3600caactgtttt aaattcaacc actgacctgc aatttttata
tgccttgtcc atccagcaag 3660at
36623021498DNAHomo sapiens 302cttaatcctg
aaaggcaggt gcttttatta tcttttatct tattatctac attttccaga 60tgaggaaacg
taggtacaga ggtttagtaa cttgcccagg tcacatagcc agtaagtggc 120agagctggga
tttgaacccc agtaccctat ctccagcgaa tctgagatgt acatgtgata 180aatttaatct
ttctcaataa attattaagt gtcaaagcaa gtggtatggg caatgcacca 240ggattaagaa
aaacagtgtg tggtaaagat gtaaaatatt tctaattctg ttgtgggctg 300tggcactccc
gtggaaggct tgccacagac acagccagag gcatccacgt gggcccctgc 360tgcacacctg
gtttgctgct accaaggctg ctctcccgag gcttgttcac acaaaggaaa 420gtgagcagct
aggaagctgc atatttgaaa gttgactagt caccaaatgc tggcatccaa 480ccaagtgatt
gcattgtacc ctgtttggat gaaagattgt gtttaaatga aaagagagat 540gatgagccag
aagtgtggca aatgagttaa aataaattgt cagcagtgtt tgaagcaggt 600tgctgagggc
tggtgtcctg aaatccggtc acttggagga tgtatatgtt ccatcagggg 660ccggaaatgt
tttatccaag ctttagggaa taaccctgga gattctcttc gttactctac 720tgttaagtac
gtgcttacgg agtaaacttc gcatgactaa ggtttacagg cctgaatgtg 780caactgagtt
caagtaagca gcaatgtggt gtattaggaa gactgcttga cttgggttct 840aatccttgct
tcaccaccta gctgtgtgac tttaaacatc actgtttttc ctcctgcctt 900ccttctgtaa
cttaaggggg ttggattatt agagttctca aatgccatac cttcaaggcc 960aggtgcagga
tgcagagaat agtgggttaa agtgaacacc tcaatgtaaa atcattcaaa 1020aatttaaaaa
catcacggac caaacaaata tgtctttaaa tctgaatttg gttaaaggtc 1080acaagtttat
gccctttgga gtactctctg acattttcat gatgatatga aaggattttt 1140ccatacatac
tcaaaaggcg ctcacgcctc tgttgcagtc agtctggcca cttccaaata 1200gccaccccat
gttggtctcc acttcttccc tccctcttta agtgctatgt taataatcta 1260gcttataatt
ctctaatcag cagtagagca ctttgctact ttatttttta ttgttagggg 1320tgatcttagc
agcccaagta tgctaagtct tagaaatatt catcagtgat gtttttccct 1380gaagctcgtt
tggtgactgc taaactagaa ccagaattgg agaaaaacga ccctgtgaat 1440tccaagccaa
caaagccggg gaagaggcat tgagcaacct gtggttgcct gagaaaca
14983032415DNAHomo sapiens 303gaaaccacag aatcgccttc ctccccagtt atttatactt
caagtcatat tgtagagaga 60aaatttctgt cagcaaaaat ctcaggaatc ctcctcattt
ctatttgtat ggctttcaat 120cgttgacatg attttttcac atatgtcatc ttctggggat
ggattcgtat aaccctgctt 180cacttgcttc cctgtgggag gctcacttgc ttctcgacag
gctctggaag aactaggcag 240tctggtacat ggttgtgcaa gaacccttga gggggccttg
gagtgtgtgc ttgggccctg 300gaactcatgc ctaggatgga gggctgagat tgccccttcc
catccaccag ggagttgaca 360agggggagaa gaaacttctt gtgagcttgc gatgacttgt
ggcacttgca tcagaccttg 420gagttccctg gggagaggca ctcttgggta tgacactgta
tagtgccacc tgattgccat 480ttgacccagt ttggccctgg atccttgagc aagagggctg
gaaagaaaga caggcccact 540ttttgggaca ctattagggt ctgtagcatt ggtggggaga
gaattccccc aacccccaaa 600agagctgaaa atgagacacg cgtggagggg tgaaagtgga
gtgtggtcaa cagtgtggtt 660acagagatgt gtgtcggggc cactcccact caccagggag
actcatgaag cagaagggat 720ggggcacaat gtggcttcca taggcacacc aagccacctg
gagagcgcat cagccctttg 780ggtaccccca agcggaagga ggttgggtct ttgggtctgg
gaactttggt gcttgttctg 840gtgggaaggg cagggagtca agaccagctg tgtcttccac
tgctcttctt gtccactttg 900gttactggcc tctgttggca tgaactgggg aggcagaggc
tacctacaga cgaggaactg 960tgtggagtgc gagtgtatgc agtaaagggt tagcttagct
gacttgaggt actcacaccc 1020atattccgaa gaaaagactg gccctcagcc tgagcctccg
aaataatctc taagccctta 1080gaataccctg ctttgtattc aaagagtatc tttgaatgct
gaacttagaa ccactctaga 1140aaatgtatgc taacaatgcg atttatgatg aacacttgtc
tttgttcccc tggggccctg 1200ggccacattg tatcagtttg agccctagag ggacagagaa
tgagaaacta agatcagtca 1260tgcaggtgct ccaggcctat gtgaccaacc accaataaaa
accctgaaca tcaaggctca 1320agtgagcaat acagctggtc ccaacttaca gtggttcaac
ttgtgagttt tgcactctac 1380aatgggttta ttgggacata acccagtgga ggaggatctg
tacttcattc acatgtgttg 1440tcacatcatt actgggagaa ttaagcactg tccacgtgaa
tccactggga gaggataact 1500ggaagcttgc acctggcttc tcctggattc tgctctgtac
gcctttttcc cttgttaatt 1560ttaatctgta ttctttcact gtagtaatct acaactataa
gcagaatagc ttttctgagt 1620tctgtgagtc tttctagtga atcattgaat ccaaggtggt
cttggggacc tctaacaaaa 1680gatgtctgga cctgaacttc ctgttgtttc aaagatccta
tagcaggctg tcttaccaac 1740tttcagcatc aagaagctgg tggagagtgg gttagtttaa
aaatgaaact ggggagagag 1800atgaagccgg gggaagatgc cgtgaaatct caccttatag
gcagcctctg attcacctga 1860gggtttttcc ttgaatactt tctgggtaca agtatttgag
acaggtgatg tgctggtcac 1920tttattctca gctgcttgtg gcctagccct aacatgggca
ctggaaacaa tgggggtagg 1980ggttgatgat ggagaaatgg ggagtaaagg gatttaaaac
tttgaaaaac tgagctgttt 2040ccatgatttg tctcttttga ttctcacaaa acctttatga
aatatgtgct gacattttaa 2100gctctcactt atagtgagaa aagcaatctt cagcaaggtg
atgacttgtc caagggaaga 2160catggtcgcc cttgttcctt gggagatttt gtgctcccag
gggaaagcat aagccctcag 2220gagccatgat gagaacagct gtagaacagc aagtgaacag
gtgtgtatca gtcaggatag 2280gcaaggctaa gctgcagtaa taaataatcc ccggatctca
gtggcggaac attgaggagg 2340tttatttctt ctttatacaa atatgctgtg gatcaggatg
actctccagg caactgtctg 2400tgggactgtc caggt
24153046766DNAHomo sapiens 304cagtgctttt gcatccttcc
tgttaacttg tgtaggaata aaacattgtc acaataagat 60ttttttcctt tttattgttt
tgatttttta gccaatgaga aggaaaattc cttattaggg 120agggcgaggg tgaggatatg
tggggtgggg agaagcgaac gttccaagtt tcgaaaacag 180cgactctctc ttggactctc
tagccagtag aaacctccct cccactctct tgccccaaga 240tctggtgctt agaagagaat
caagggaagt tggaacccag aagacggaga cagattgagg 300gactgctgtg aaatgttggg
gtgtttggtg aataatatta gaagttgggc tggcagagac 360cctgtcacat aaacattaaa
tcaacactgg agactgagca tttgttagaa atgtaagcgg 420gaatggcaga aaacttgttt
ttaagggaaa gcatgttacg gcttatgttc agcctccatc 480ctctgaaggc aaaagttagc
aaagttgatg tatggcgttg ctttttctgg gaactttatc 540tcgtttggtg gggttcccat
ctctgtctcc caggagccaa gactttcccc tccctctgct 600ccagcagaag ccagtctcag
gcaaggctcc ctgtacctca tttacacttt ggtgtgaata 660tgttattgta acctctctcc
tggaggtgtc tgcattccaa gactgaactt ttctgtgaaa 720gttactgtca ctgtgaaagg
cagttcagcc cccagggatt gaaaaaggaa atcattttgg 780gtaaggggac agttagtcca
gattttttca gttgcaagta aacctaactc agccagtagg 840caaaggggga aattgctggt
ttgaactggt gggaagaaag ctgaggaaac tcctacactt 900gggggaagaa ctgcaggtgc
ctggctgcag ggaacgcagc gggggctcag gaccaggcag 960atgccctgcc tctgcttccc
ttggcacagt ggcctccttc tcccttcaag taggcagatg 1020ctgcctgtgg cagaggacag
cagctgattg gcagcccagc agggaggatg tggtagacag 1080gcactgagca tctcttctac
cctccttcta gagggctatc ctgtactgtt gaggctaaaa 1140gactgaaaac cacatttccc
agcctctctt gcagctacca atctggatga gagttagatt 1200ctacacatta gatgcacttt
agcaagattt tcaaaagcag attggagaag gagcccatgc 1260ttctgctggt tttttttgct
ggcaagtgag gggttctgtt tttcctggag tgactttatc 1320atggtggcat ctgaaaaagg
ctatttcttg atcagagaga cagcaaccct ctcagtgacc 1380tagttctgtg ggtgtgtctc
tcctgagagt taatcccaga gctcaaacta gagctcaacc 1440ctagagtctc ttcaggcttc
ccaggggtgg gggtgcattt aacagtccaa gttaaagaga 1500aaataaaggc cattaaagac
caaacattga gcactgagtg aaaaagtttt attgccaaac 1560aggaaacctg attcaggcca
gggtcttgga aggttgttca ggatgagatg ggggaggtga 1620aatggggtag gtctttgaaa
accaacagat tgcaaattct ctgtcccata gcaggaaacc 1680acagtctctg atgtcagctg
gctgccaaca cgtcagttgt atcagcatta gctggctgga 1740ggtggcctgc tgtgtgcaga
tggtacctgg tgcaggattg tggtgtccag gtgtctctcc 1800ttagcacata agaccctgtc
cgaggactgt ggcatgacgt gctggagtca cgattctgtc 1860acccagtcag gtcatcagtg
tcagagagct aggtggccag gttggagttg attgccaatg 1920ataggtcttt ttctgcttaa
atcagctgga ctggattcta ttgcattaac ttgaccctga 1980ctcatgccgc caggcctaat
ttataaacca agacaagaaa gggctactcc accccctcca 2040atttgtgtaa ggccagggga
cttccccccc actccccaac ctgaggcatg caccctccct 2100tagatcaatg gctgtttctc
tgagaatgcg gaaccgtgat taatccagcc ttgatgggga 2160ggcagcagga actgtaggca
ttctcacttc acacccatcc caatcccctc ccccttgctg 2220tcctcttgta cagaggactg
aaagcacaac actctctccc tccctccctt ataggtggtg 2280acgatcatgt gactctcttc
tggtcaatga gatgcagcag aaagtcctag ggaggtctag 2340gaaaagtcct gttgggagag
agcatttttt accttctccc tgctacttct tgctactagt 2400aacatggatg tgagccttgg
aggggtagct accatctggc acctggggtg gcaagccaac 2460atggaaagga tggcagagcg
ggaaggagga gccagcctta ccgatggcat cactgtcact 2520gcgctagccc cagaccacct
gctccagagt tctggttatg gtaatgaaat aaaccttgat 2580ttttattcct taaaactacc
cttcaatggg ttttctgttc attacagttg aatgctttca 2640taactgatac aggagggacc
ctgtgattgg cagttccact agactgcatg gagatgggtg 2700gagttatcta aaagaacaga
gatagtgtcc ctagaagaag gggacaggaa agcatcctgg 2760gtacacaaaa gtcaaggctc
caggatctgc cctgggggct atctcaacac ccctacactc 2820tcaccgcacg tatttggtca
gctatgaata tgaccaactc tcgtcgttta tctctattca 2880gtggaacaca gcagcactgt
gacctgccca cgagaagaag gatttttaga acttatctta 2940gggcaatttt aggtagagga
gcagacaaga tggtgtacag gagaaacagg tctattaacc 3000ctggtattaa tattaactgg
ctgcccagaa taaatgaaga atagcttatt ctttgccagg 3060ttgaagatag aaaaggaatg
aagggccgga gaagtacagc tgggtgaagc acagagcagc 3120ctagtgcttg gcatgggact
cagatctgaa gcagcctctc cgggacttct ctgagcctgc 3180ccctggtggt atgactgtga
tatccctgct tctatagttg gcaaccaaca tgtcctagct 3240cctagaccat agagggccag
attcatgtct cattgactgt gtaatctctg tgtggcccag 3300tacagagcat gcacaccgta
ggttctcaca tatgtttgtt gagtgaatga atacaatacc 3360aaacgaatgg acaggacaga
gctgtgggct agcaggaagg atatctggct tttgcttgaa 3420ttagctagtg aattgctgtg
tggcctcctt actgagcctc atttccctct gtctgcagag 3480tcaagcaaat cttccatttt
ttgttcccct gctgccagag catggcagag taaatgtgtg 3540agttgaaggg agcaacctca
tgaggttttg ctttgtgtct taattacagc catttgtgga 3600attaggcttt taatataaat
atttgtgtgc ctgcgcctgc atatatgtat ttggaccaat 3660gctctcatgt gtgcaaatac
atgtattcta aagaaatctg tccagaaccc cagcatctgt 3720ggtgtctgtg gtgggagggg
cttccatatt acagagagat gcccacagtg catgacgtta 3780cccgcacagg tgtgacatca
cagggtaacc aaatgctttt gccctggggg tgggagaggg 3840atgggtgcac ggtgaacagc
aggtgggggt ctttccatag gggatgagga agacaaggcc 3900acttggaggc agaggagacc
acagtggggc atgatggttg gggaaggcct tttacttctg 3960ccccttaagg atgccctgga
attcaggctt tcggatccca gagctctcat tagagcagcc 4020ctgcgttgta gacttttctg
cagtgacaga aatgttctat atctgtgcta tccaatatgg 4080tagccacaag ttacatgtgg
ctattgaaca cttgaaatgg ggttagtgca attgacgagc 4140tgaaaatgta gtttaaattc
acttacattt aaatagctgt gtgtggcttg tggctgccta 4200ttggactgtg cagttctgga
gaatggtact ttacttgtcc ttggggaagc agaaacaaat 4260gaaaacgagg atctggagct
catgaagttt ctcatggggt ggggtatgtg tgttgaagct 4320gcaccttcag caggaacctg
gccagtcctt agtggaggac atttctttcc atcctgcatc 4380cagatggctg gtcctgctcc
tcccagtcca tggagaaaaa agaattgaac aaactgtcta 4440agctgggtca ggtactctgc
agatgtttgc tgagtatcgt tcttgatgga aatccccgtg 4500gaactcctac attttctcct
ctcttctcct tcctttcaga acctcagagt gacagagcca 4560aaagaccagt gcctcatttt
gctgacatgg aaaaggaaac ttcgtggggg aaagagatct 4620gcttgcagtc ggccagagag
acagaaccag ggcagtggtg agctctcatg acctggtgtc 4680tgttgccttc tggttaagtt
tttcatttgt aattctacaa acatcccttc tgtaaacatt 4740tccctcaaaa tggagcagga
agctctcaaa aatggaccag aaaggggtca ggaatataac 4800tttctctgcc cagattccag
gacttacagt gagaaagcgc cttctgggaa cttcacaatg 4860gctaaagtgt gctaatggga
tgatgtgccc ttgtacaccc actgcctctg aactctgctc 4920tgcattgctg agcaaactac
atttcccaga actccttgtt ggattccttc caaacaggtt 4980taccactggg agagcctgtt
ggttggggag ggcaggaaga gggaggaaag aggaagggac 5040tcacttcctg tttccagctg
aagtctaaat caatccacta tcaacaggta gctatcatac 5100taccctcatt gtcacccctc
agaggtccca ctgcagctgc ataatgtccc ctcagtggcc 5160tgaacatgag atgaacaaca
ctcttcttgg gagtaccagc cttgcttggt tcatggccac 5220ttttcctgat tatcttgcag
ctatattagg tcatgtgaca aagttctggc cagtggcaag 5280ggaacacaag tgataggtac
agatagaagt gtctgatact acatagatta tgcttgcact 5340cactcttaag agagagacat
gaacttttac caacggaagc cagtattatt ttgaacctct 5400gttagagtgg cttgaatctg
tatcctaact tgtatcccta atgtgtgacc catgaaaatt 5460agccaggcag caccagttcc
aaagaagctc acactcccct gcggctgctt ctgccaaggt 5520cactgatatt tccctttgct
aaatcttgtg ggtgttttct tcagtccttg tcttaatcac 5580tcagtggcac ttggcactta
ttccttcttg aaacccttgt ttcccttggc tttgtggcat 5640cctgtgctct tggttttctc
ccatatctct gaccctcttt ccttagtctt ttttcttctt 5700cctcctgtcc cttaaatgct
ggttgtgatc ctctttttat ctcattctac acactcacag 5760cctgagtaat tcacaccatc
ttgatgctga gaacttccaa aatgttggtc tagcctgggt 5820cattgttatg agctctagac
tcacaaggcc aattgcttgg tgggaacccc tcccccatgg 5880ttatctcatg ggtccctgaa
gtccaacttc tccttcattg aactcatcac ctcttctgtt 5940cctcctcctg ggttcccagg
ctcagtggtg gcaccactgt ctacctggct gcttagcctg 6000agacctggct ccgtcccaat
tcctctctct cagtcttatc atccccatcc aggcaaatca 6060ttgattctgt ggacctactc
tttcgggtgt ccctcaaatc tctccacgtc tctgtgttct 6120cactagcact accttggtcc
accctgccat ctgctttcct cctccactcc tgcattctga 6180gtcattttcg gcagcacacg
catccttaaa acccctccac tggcttgcca gtgtcctcag 6240gattaggcga aaagtctttg
ctttgtttta caaggccctt cgctatctgg ccccctcatt 6300acctcccttg ctctgcatgc
tccagtcctg cagaactaca cacagttccc ccaacaaggc 6360cctgctctgt tcttcccaca
cactgctcct ctgcctgggc cactcttcct gctccttgtc 6420agcaggcttg ctgctctcag
gctcagcatg gacagctgct tctgagagcc ttctctgcct 6480acccaggctg ggtggctgcc
tctctttggt gtgcccatgg cagcccagaa tgcctggtgg 6540acagggagcc ctcagcaggc
cgtactgcag cgccctgccc ccgtcagcct ccaggagcct 6600ggagtccagg gacatcaagg
gcggtcctgt ctttctcacc cttgtctctc cagcccctaa 6660cacaggggat gcctgacccc
aaactagacg agttacttga cctctctgac ccaagacaaa 6720atgggaggaa agtgccaaat
ttccaagatt ggccagggga ttaaat 67663055610DNAHomo sapiens
305cacctgtgcc catcacatag ctggggcaca gctggagacc ccaacagaga ggagagctga
60tgggtgacga gaaatcaggc ctctccgcca cggcagccta gctaatgggt cttggctgga
120agctaacagg aaggcctctt tccagaaaca ctgtaagcca gtgtttctca gattgctggg
180tgtaattcat aggcagatca tgaaatcagt ttaatagctt tgaccagcat taacctattt
240atgcctagcg ttcccttatt ggaacactaa gtctgtgaga gttatttaca tcctactgct
300taaggtcatc gccaaaatct gattttttac acaaaaaatt tgcaacctcc agcataaatg
360ggttaaaaca agacaaaaca aaacaatacc agaatggaaa atagtgcatg atctgtacag
420tatagttgta gaaaacttct tgttttatca tttgatgtca tgaaagtccc tgctgtagat
480aaaagatgga gcttgtgctt ctgagtggtc atgctcaaca gggtggggag cccaggggag
540tggggagtga tcgtatagac agaggtgggt ggggccagtg tgagcctgat ggtcaattac
600ttctcatttc tagggaaaat tgaaggaaaa gaaggagggg gatgtggagg ggagagaagg
660cctcagtaga gtttgcacta ttattagggc aagtaagctg cttctgaaaa gaaggggttt
720gcaaagccaa cccaggcaaa agcaatctgc tggaagaact tcatccccag ctgacactgt
780gggaaggacc ccatgcagaa gcaatagggc agcctggtcc catatcctca tgaaatgcct
840cttataattg tgacatcttg caattgtgga ggactttaca cttttcggag ttcctagccc
900ctcacttatt tctcgtaaga ccgctgggag gtggggggat ggtatcatca tcccacttta
960gagatgagga aacaggatca gagtgagcta aatgactgcc agatccaaaa ctagaattca
1020gacctcctag tttctaagtg gacgctcttt ctacaccacc ataatgtgag tgttctgtgt
1080ttacagggtg tattcaagtc catgactgcc cattagaatc cccccaaaaa attccaggac
1140tggcctgagt tgctccttag accaatgaaa tcagactcct gggagtacgg cccgggcctc
1200gggatccttt aaagctccat ttggagagcc tcgggcacag ccaggttgga tccatctccc
1260agtcccccag ccttggctca gcctggccaa gctgcccagg aggtcccttg gtgccctggg
1320ctctgtttca ctgttgtttt gtagagcaac ttcccagtga tgctgccact gggccccatc
1380ctaacagtga agtcccccgg gccctcctga gaggaggtgt gaactggaag atggggaggc
1440aggcggctct gacagacaga aagcaaacag ctcagagggg tggcaggctg cattttattc
1500atcgttaatt taaacaccct tcaagtcctc tcttggaatg ctgctcagaa aaatagatgt
1560attgtttgag aaaccctgca ggcttgtccc gcatgctcta gccccctcct gagagaacag
1620atagcataaa aaatgatttg taaagcaagg gggagcttcc ttagggaaga aggggaaggg
1680gaagagggtt tggggccagg tccgagtgca gaaatcctca atgcatgaga ctagcgtgga
1740aggtgtagca attgtgctct ggggtgcctg aaagtgccag agctgcttca ggggcaagag
1800tccaggcccc aagtccatgc tgatgagccc accctggggg tcaggaatgg cctcagcagg
1860ccctccctcc ctccctctcc accctacaaa gtgaggagcc ttgagtcacc accagcacat
1920tatacaacaa tacaagaacc ctgcaacaga taaagcccca gcgcctcttc tggactcaga
1980tgccctaggc tggctgtctg gctgtgcttt ccagacagtg tgtatgtgga attgtgcttt
2040ttgtttttta agaatgtaaa aagttacagt aagatcgaac cacagggccc gtcgctccta
2100tggtctctgc ctgactgggc tgccgtctgc ctcagttccc cagaagcttc tcctttggcc
2160atgagggctc agtcatccct caccccagag tccacaggaa gagggggtct gctgggaggc
2220ctgtctgaag gacggaggat cctgggtcaa tttagcagct attttccagg gtttggcttg
2280ggtttggatg ctggcttctg tgtgaaacct gaatacatgc aaattgtaca taaaactccc
2340ccaaggcaga gagggatttt ccaggccctg gtacatctct agagagttaa aaatgggaaa
2400tctttcttct taaagtggcc cagactgaga cttttccttg gggaaaaggg ttagtagctc
2460tttgtaaggc tggtgtgtat gtgtgtgtgt atatatatat acatatatgc atgatgctgt
2520gcaaatgccc agggctgtct ggcattttcc acaaaatgag agcctgagat tgcctaagcc
2580ttctgatgcc ttctccaggc ctggaggcac tgcttcattc agaggacaca aaggcctgac
2640cacctggctt tagcaagcta ggacacccag ggtggcttct ttacctttct cctcagctct
2700gagaaggctg ctagccaaga ctctggattc tctgtggcca cagtcatatg gtgagggcct
2760cttggagttc attcaaactt taagggagcc ccacagcacc ggcatgatgg gtaagtccag
2820gcctaaggtt aggaagcaaa tcctggagca tgaggaaatt gtaggctaca gtgagctacc
2880agtggtgtgc aaactggaga cccccaagac agtgagagag gccacagcat ctgagggaat
2940ggagctcttt cttggcctga ggttcagaag aacctgcacc aaagaaaggc atccctatca
3000atgtcactgt tcctgaaatg atgggagaac cacatccctg cttcagggaa gcagtccctg
3060tcgtctgggg cgctgagccc tttggcctga gatgaaggat gatggtgtga tgtatcatgg
3120cagtgtgact gagactggat tgggggatgg ggacagggga acataggcaa aaatacacat
3180gtgccactgg atcctgagct gccattgtac cttggaggac tggcgtttct ctgggaagtt
3240gggaggtggg aagaggaagg gtctcatttt cctgcccctt gaaaccatgc ttaccattcc
3300tttagaagat tgctcaagct gcctccaatt gcctctttcc aaaaccaaag cataggaaaa
3360caagtaaaaa cagctgaggc tgcagcataa gcaacttagg atagagtcta ggaagcaccg
3420ccaacagaga agactgccaa gaaacatttt gagtttttct tctctggagg tgggtcctgg
3480ttcctcccat ggagaccacg attctgtgta gtcctgcacg ctgggcgggg gattgcctgg
3540aggtttcttt agacctgtct agctcacaca gtcttgatgc ctgggtttta ggctgctgta
3600ctgttgctgg ggctcacttc ctgtgggtag gctgttattt tgcccgcaga tcaagtcctc
3660actgtctaga tgcctctatc atggggatct cttcttccct ctctggatgg ctctgatccc
3720caagttattt cctgttgcct aggtaacacc tctaattgga tgccttttaa tcgttccctt
3780ttttaaaggg ataaatgtgg attttatttc caggtcctgt cagagggccc tgccctagag
3840aacacgtgcg cccctgcgtg ggcaatccct tcactgtgac cgcaaccatg ggttggatgg
3900ggggcactca ctgggctggc ctgacagtca cagtgaatcc tgaaagcatg gttttcacag
3960gaacccacct tcaggattta gcaagactga cgtctctcct ggccagcgct gcttcactgg
4020cttcacccca gattagggcc tgtgtttaaa aaccaatccc aactcaaatc agaaattacc
4080caaaatagct ggagagtcac tgagatctca ggtaagcttt cccttcctgc catgaactag
4140aaggggaaag aagagtttga cattcaagtt tgactctaat gctgggtgcg tgagcgcatg
4200cgtgcatgtt tgtgtgtgtg tgtgttccac gcacatttgc cagggagaga gatttcacag
4260catggctcca gctggaggcg gtgaggcggt gcttttctaa gacttcctat cagaagctgt
4320gcatactggt gggtcacgcc gtgcctgtat aaactctggc acctgtcctt gccctcatca
4380tatatgagaa aaatgggcag agagagtgtt cgtttacacc cccagaccac tatcctttca
4440atgaagcctg ggtatctggc cttcctccag gtcagggacc ccctatgctg cagaaggcaa
4500gtctgggaga atctgtccct cagcccgaga gcaaaactgt aatcctaaca ttacttccat
4560ccaccagttt caccagctac ctccctcctg ccttcctctg cctccaatag gctgtgcatg
4620gagaagacaa atcctcttga taaacaatat ttagaaaggg attctatctt tcctgacccc
4680aaacacatca tggcctctgg agccaaatac cctgacattt gcaagatggc ttcttttggg
4740ttcctggtgc tgcaggccct ggttcccaag gacgcagctg gcagaggtgc ctccttcaga
4800ggaggaggag gagaagctgg agggctgcgc ccggcaaccc catgatctct taaaggggga
4860aaagttgaac tgatcaacag tagttaagaa aaaaaaaatc cacaccaaca aataaatatc
4920ttgtctgaga agactcagat attcctggtt aatattgaaa agcactgctg tggatgagct
4980tgtgaaagaa aggacggttg ggggattcaa gatctgccga tccgagcctg gagatcagcc
5040agctaaaagc ccagcagggc tcctgcagcc tcaccgctcc cctcctcaca ggtgccctgg
5100accgcccacc attagaagta gctgccctgt gctctgtgct aaatggacta actctgagct
5160gagaaaggcc agctaagccc ctcaccactg caatttccaa atctggggga aatgccacag
5220tccgcaagtt ggtgctatgt ttcatctcat tgcataatac tacaccattc tctgtgtgta
5280gtggctgttc tatatatata catcgggagg caacatatgg ctgtcccaac ccccacctgt
5340caaaactgtg actatatcac ttctgacgac cagaaggaag ctgctaggct gggccaggat
5400tctaaatgct gaggaggtaa ttcagagcca cgaaaagttg caccatatgc tttggggttg
5460ccggctgctt ctgtgcatgg ggacggggtt tagtgccagt ctgcaaaacc ctcctcgctg
5520cggtatgccc tgggtgtggg cctggggggc cacgttttct ctccctgaag gagaatctgc
5580tgggggccac ggttctccaa gaggggactc
56103064189DNAHomo sapiens 306aaagccattt ccagtgtcgg cccatttaat aaactggttc
cacctggatt ttctcttcat 60tgtggtgaaa gccaccacct aacaatgctg gccctgcctg
catcatcgca tgtcatcatg 120acaacctatg ggggagaaca gctccattca acagatgcag
aaactgcagg ttaaagggtg 180aaaagcagtt gcgaagtccc cacagcttgg aaatggtgga
gccgagactg aaacccaggt 240gtgctagcat ctaaagttca tgctctttcc accacattag
actgtattct gagggcacca 300aggaagctcc atttttctta agaaaccaaa ttgcagtcct
ccaggaccac agccagggga 360gcatcttcgt gggagagtgg ctgctgctca gagttgtgac
tcccatcctt aagagtcctc 420tgtcctctct ggcctccttt ctactgatca ttgctgtccc
cttcacaggg gagaggggcc 480atggcctatc ccctaaagag tctgccaagg tagactcata
acctccccgt ggcacagctc 540agacaagctg ggctatttac ataagacttg acccagggct
tgaggacagc gcgaggaatg 600aggtgcagag gagactgctg cttctgggtg acagtctgcc
tggctgacca cagctggggt 660actcattggc ctcttgaggc cccccacagg cctgccctgc
ctgacctact cttgtgaggc 720caaggccatc tcctccactc tctgggggcc tcttctgact
cctccaaact cttccatgtc 780tggactcctg gcttctgccc aaggccatct attggagttt
gggtttccag ttgaggatct 840gcctttttcc tggatgacca aacctagaat gtgaccggcc
tcatgctccc tcctcacaag 900ggtgtggctt tatccgaggg cctcagcaag gcaaccaaca
ccagacatga agctggtaag 960accaacatcc acctattcat tccttcagca aacatttact
gtggacagca tcaggttggc 1020agtatcatgc aaggctcaga aatatggtgg aaaaccagac
agatgcagtt cctgtcctca 1080gtctctaggc tgccttccta gaggcccctc actgggtttc
ttagcagttt tgtacagatc 1140ctaccccctt tgctgccagc aggctcacct ctgggacagg
gcgcatatag tctgggccag 1200aacttgtcct ggggtcttct tacggccctg gttcagcttg
cattcagcat aacaacttag 1260ctagggagtg ctgcaggccc caaatgatgc taaatactag
actagctgtg taacaccatc 1320tgcccagaat gaagggacag gtgaggcaga agggtctccg
acagcgcaca gggcaaccag 1380tgaaagcgtc cttactgtcc tgttcctgag gtctctctgt
gcctgcttta ctgcccttcg 1440ctttcctaca gagcacactc agctcatcct gggagacaag
gtgggggtgg aggatggtcc 1500atccttcttc cgcatcaagg tcagtaggtt cagagctctg
ggggggtgct gagaccctgg 1560gacaggcttc ctgctgaggg cactggggcc tatgcttgtg
ccactgccta gccagttgcc 1620tcccagagta gagaagcagt ctcccaagct cttgcaattt
gtggggagcc aagctgctct 1680ggagaggggc ctcaaagctt cagccagaga aaaggcaaac
ccagccaccc tgagaatctc 1740ctcctccccc tcaatcacac cctgcagagg cgtgatctgt
ccctgggttc gcaccaagcc 1800tgctattttg tttatgccac aattgatctg ccatcccagt
ttgcaaagag cagacacttg 1860ggggctttat tatgccactt tgacaaaagc tgtgaagctc
gttcccacag cctgtctggt 1920gccgccttcg caaatggggc cctggtgatg gggccttcgg
agttcagctc agagagcatg 1980gaagtgagat ggagaggcca gcactgatct gtatcgtgca
gccctgggcg gcagcctcgg 2040ttgggccctt gacacactcc tcccatccag gcccccagcc
accctgtgag ggaggcacta 2100ttacgcccaa aaatgcaggc aaggaaatgg gctgagggag
gggaagcatt cactgaagtt 2160agtttgtacg tggctgagct ggccctgaag cccatgccct
ttccacttgc cagacagatg 2220ggaagtcttg actcattacc cgctggagac ttttcctgct
gggctctgca ctgtcaactg 2280tgagagaggg aaataaaacg tactgtacag tccaatctgg
gatacttctg agagtgaaag 2340tggctctact agtaattacc ccaggacagc atgtataaac
cagggctgtt ccaagcaact 2400gggacacatg attaaaatgc agattccatg gcaggtcccg
ctcagaggtt tatttagtgg 2460gtcagaaaat gggcccagga atttcatttt aacaaatgtc
tccagataaa tctgatgtaa 2520atggaatatt cctttaagaa tgccattcct ttaagaaata
atgttaataa ggtattccag 2580atgaccctat tggttggaat ctgtccaact acaaatattt
tgatttaaat ttccattgac 2640ctaaaatttt tgtggtgggc actgcagctc tctccttacc
aatcattctc ccagcctgta 2700ctatattgag tagcagccag gctacttgga gaacagactg
aactccagga atgggctcta 2760ttagtctagg ccaatcagga taatccagtt ccttaccatg
actggctcaa gaatgggtag 2820acctaagtca atcagtgcag agcattttca tgactacaga
gaaaccatgg gaaggctgag 2880gcgtggactc agtgggcagg gatggaaaaa gactcagaaa
tactgggcat ggcccatggc 2940tcattagggt tgccaggtaa aatacagaac acccagttaa
atatgacttg ggtaaacaag 3000aaatcatttt ttaagtataa gtatgtccca aatattgcat
aggacatact tatactaaaa 3060tatagttgat tatctgaaat tcaaatttca ctgggcatgc
tgtcttttta tttcctaaac 3120ctggcaaccc tactcatgac tccacttgtc agcttggtac
actcctctga gaagcttctc 3180ccaatattcc catctcattt gatccttcct gactggccag
tgttgggatt aagagcctca 3240ctttaatcaa ggaacccaag gctcacacaa ggctgagccc
tgcccagcca ggccagcggc 3300caggcctccc catgtccctc tctctcctaa cagggaggag
tgggagatgg gggagggctg 3360tgggatggag ggcggggctg ccaacagcct gctccgtggc
tggaactgcg acatcgcctc 3420tctgggagta ggagtgggct tctggccaga ctcagtgggg
gaggactgga cacttgagag 3480ggcactgggc caaagacttc cctgggacat gtgccccagc
cctggtacct cagggctgca 3540catgtcagcc atctccatgt cacacccccg gggaggacaa
ccgaccaccg tggacaagcc 3600ctgaaccttt tgggaaagct ggtgctaaaa gaatagctgg
agaagtcact actgaggact 3660gaaaatgcgg agggtataat aactgctgtt gtgtatcgag
cctcactatt tccacgcacc 3720gtgccaagtg ctgcacgcgt atcaattcat ttaatcctca
caacaactgc atgaggctcc 3780atgttttcac tgattccaag tcacagcttt tttcctctca
tgttaacatc tctaaaatcg 3840tgcatcttac agtcaatggc ctgatagttt atttggcagt
attttaaatt cctaatggta 3900cgtaccatga tggtgtgttc ataaccaata gtgtcttaga
tttgatgaac tatagtgggg 3960atgattattg tccccagttg tgagattagt aaactgagtt
gcatttaaca gataaagaca 4020ttgaagttca gtaacttgcc caagatcaca aagcaagcac
atggcagaga tttgaaacta 4080gaggaaggag cttgcaatgt gataaagcag caaatgtaca
agagtttgga agaaggagag 4140gtaggttgct tttggcaaga gcagcaattc tcaattctgg
ctgcacaat 41893073729DNAHomo sapiens 307ttcacagcaa
agtggctcag gtgaggcagg caaggaatgg gcaaatcacg acatgacata 60tggatttcca
tggcagggaa atgcccccgt aggcacagtc aagcctggct ctaccattgg 120ctcgccgttc
tcctacctcg ctgggcctcc atctccccac ctctggctca cttcctgctt 180tggcccctac
gctgggtagg aggcccggct agaggttagg caccatcttt tccagtcccc 240aaagtgagag
tgtgtgtgtg tgggagagat atttttaaat ggggctgttg tggaaaagct 300gagaccgtgg
gctgctctat ttgttggcgc ttgctggttt gtctgatttg cagagctgga 360tggactgctc
cctgaggaca gaagctcttg gttttcttcc ctccgaagcc aggcgtgggg 420tgggagcatc
cagtgcaccc ctcttgcatt gggtgcgcag tgatccggac agagaggctc 480cagtcagcca
ggcacagaga aaatggccct ctgcccctgt tctgcttgtt tttgtcttgt 540tctctggggg
cctttgaggt gactttcttc atttgatgac aacaagatgg gaggcgggga 600cagctgaggt
ggcaggagta ggggagctag ggacagagga tgaaccccac aggctcaggc 660cagtgacttc
taacattaga gaggttttgg ttaactggga gcaaatgcaa gtgacttctt 720tgaatcgact
ttgtacctcg gcacagcctt ccttgctagc agggctgact tcaaccaccc 780cccactctgt
gctttatctc tgggattaag gttttctctc ctcaccagaa atcattcagc 840aaaatgagtt
attaaaagcc ggttaaccac tcctgcctcc gggtagctcc cgtttaacaa 900cctctcctgg
ggagcagctg tcaagctcgg ccctgagctg gcgggaagat gactcattta 960catacagccc
gtctccaggc cccccccacc gccaccccaa gatctgtccc tgtctccctg 1020atgactaatc
ctttccaggg atgagatcac tgccccttct aacccccccc cccgccccca 1080cacacagaaa
gagcagagcc ctcatctcag cccagaattt tgggagaaga ctaaatccaa 1140gaccaaggga
ggcctttgat gggacaaaga cgtgactgat gaacccggag tgaggagcaa 1200tgagatgaag
aaagctctgc ccacctaccc cgtccctcac tcctccctcc cacctcaggg 1260cgctcatgtg
gggcttgtgt ggggaacagc tccagggtca taccacctct cagaagggag 1320acagaccagc
caggcgtgag gtgacagacc agcgggcagc tcagagcagc aagacaatgt 1380caattcaatc
actttacctc aattcctcta tcacacagga ggagatttta aaaggaagtc 1440tctggtggtt
tgtaaagcaa caaatcctgc tctcaagtgg atagttccaa gccctctcaa 1500tgaattcagt
tttatacacc tggagaagca cagcctcgtc ctttccatgg agctacaagc 1560cacatctggg
ggcgctcagt gcccaggctg agggggcacg cagagccctc ggggacgact 1620caatgcacag
aggccactcc ttaagggccc ggctccctca aactgaggtg tccccatgct 1680tggtcttccc
acagaagcca gcctggttgg ctgcttcaaa ggaggaataa agatgaggag 1740ccatgatgca
aacaaaccca cacctttcag ctgcagccag ggaggtgctc tagaggccca 1800cggagagctg
tgtgtctgct ctgctagccc gacctgcacc tgccctatgg gctggtaaag 1860gggctgccca
cagcacctca gcacatgggt ctctctctct tttcatccag cccaaaatgt 1920caaagcacaa
gggtctctgt cagggcctgg ctgtggtcac tggactgcgg ctgaggggta 1980aggtgcaccc
ctcctctaat gggggcgcac ccctcctcta atgggggtgg ggctggagct 2040aatggcacat
tccactctca gctgccacac acagatgggg aggttgatgg cccgcacaca 2100ggaagtgagg
gatggtgggg actgaattta tggagcccct atcccagacc aagcactctg 2160ctggtacttt
cacaggtgta atccccagag cagctcctgg gggaggtgtc attatgccgg 2220tgaggaaacc
aaggctcaga gaagtaaggc agcacaggtc ccagccccac tccacctctt 2280gaggcctgac
tcagccttag gttcagagaa tgcaactgtg atttttccct gagatgagca 2340ttcaatcata
ctgcgccagg gtacttgctg tggccaaaac gcctgccctg gatctgtggc 2400atgactttgt
gtcagaccct gtgcgataga aggagatgga agctgaggtt cagagacgtt 2460aatgaccttg
aaggtcaggg tcacagaaaa tggcacaacc gggattgcaa ctcagttctg 2520cccaatttca
aatctacctt cctgccacct ccttgccttg cctattgtcc ccctcccttc 2580atatgacctg
ggacccagac tccctggttt tctggaaata ttctctctcc ttttggctca 2640ttctgtgcac
tgtcccagtt ggtggtattg aggcacagct ctgcccagat cactctccag 2700ctcagctgcc
ctgagctccc cagccccacc tttcaaggtc aggaatgact tatttccttt 2760tctcttcgcc
tgtctgaatc ctcgccatca tctgacagca ctttcagctc accaggcatt 2820taactgtgtg
ctgtcttgtg acagcccctg tcctgaccat tgtcccagag atttaaccct 2880ttgtgttttt
ctatgttagc ttgccagtga ggacatggcc catgtcttgg acttcttgcc 2940accctccaga
atgcaggtcc tcagggagca gctgctaaat tcccaacaga actgacagtt 3000tgtccagtga
tcaccagcag acactgtacc agaaacattg tctccatctt agtttcagtt 3060aactcaacat
gtttttgaga cctgctgtgt gccagacatg agggcagggg aaggaagact 3120atggtgaata
agattgccca cagacctcca ggaacacacc tgtagtcaaa acacaattga 3180gcttactggc
tcactgcaat gaggaagctt ggccaccatg gattctgggg catctcagtc 3240agagggtatc
aaagagggct aattctagga gttgggcttg agttcttgag ttaggtgatt 3300tatggaggac
ttaaggaagc aggcttcgct ctggatggga tgctgccaaa gagcggaagt 3360acttctatga
ctgggcatct taattcttag ctggaagatg ggaacgacat agcgaggcca 3420agctgtgatt
ggccaagaag cagcagttgc tcatactaac cagatgaggg atgtttggtc 3480tttttagtgg
tttggacgat gttcttgttt tggtctgtgt ttggacatga ttatggggtg 3540aatggtcttg
tttttctctc actccatctt ggtcataagg cggccttgcc tgatcagggg 3600ttctgtgaaa
tgcttatgct ccatggcaga tcctcccagc cccactgtga gtgccaggcc 3660agctctcggc
gctcaggggc tgccttcatc tttctcaaaa tgctgctgtg ctttgcagcc 3720cataatagc
3729308774DNAHomo
sapiens 308ttcactccca acaagcactg ttaagtttat aaataaaaat gaataagata
tgatccctgt 60cctcatacat aatgaccaca atgccctatg aaaatgacag taagggagat
gtcaggggat 120tccgggaagc aagagaagcg gtttgggagg gtgccccaga gcaggtgcca
ttggaactga 180gtagtgaaga tgccttagcc aggagatgga ggcagttggg cacaaggtct
gcaaggtcca 240ggttggcatt caagggttca ggtgtcatgt ggataagact ttccagggag
cacgtgtggt 300atgagaatga aggcccctga ggaacttcag tatctaaagg gcagaagagg
agtctatgaa 360ggagaccaca ttcattcaac aaacatttat tgagggccta ctgtggccag
gaactgtgct 420aggcccttgg gattcaacag tgatctaaaa gacaaagtcc cctaccctca
agaagcttac 480atttcagtat gggagatgga ataataaact acaaacaatt acgtggcctg
tgagaaagtg 540gcaagtacta caaaaaataa aaaatagagc aggctaagag ggtgaggagt
gtgcgggcag 600ggaggaggca ggtcatggtt ttaagtgggg tgggtccagg gaggcctcgt
taaggaacgg 660atgtttgata aaggactagg gtaagagtgt ttcaggctag tcaacagcac
gagcaaaggc 720cctgaggcag aggggctgga aacagagaag aaaagagtga aaaaggactg
ggca 7743091782DNAHomo sapiens 309aatccctcct gcacactttc
agggcaaagt ttaggtgata taaatgtccc tgaaatgaga 60aaaaccatga ctttcatttg
attttaatgt gagggagaaa cataaactag tagttttaca 120aaaagaaaaa gaaatataat
attcaagtag atttcaagca acagcagata tgctgaattt 180atttgataac tgtcttcttt
ttctctgtca gcagagtctc atgcaatttt aaaaggaaat 240tcgatgaaac gaacacccat
caacatcttt aatagctgca agcaatgtgg agcaaatttt 300ttgtcttatt taatgtggtc
atcaccataa cccagtaaag acaatatcat cattgctccc 360attttgtaga cagggaaact
gaatccagga taaataatgt agcttgcatg acaccaatct 420tcctcaagtc tgagccagaa
tttatatctc ccatttctca acctcatctc tcaagcctat 480aatctttcag ttataagaag
ggaaacactt gagggtgtat cagtttgtgt tttgttcata 540gtgtttatat gctctcaatc
aaggactgtt tattaaaaaa ttttaggagg tggtagtcaa 600aaagtgtctc tggctgcagt
actggggaca gactgcaggg gtgagttcag cgagtctagt 660tcagaggctg tggatcaaac
aggtggggtg gcccagacca ggagagtagc caaaagggga 720ctaaggaagg gaggccaaag
ggaggccaca aacgcaggag aggatgaagg tgctgaagct 780agaggccatt caggaaggaa
ggaatgacgg ggcaaggggt cagaaatttc tagaagaatc 840tagtaagatg gaaacctaac
agtcctactg ggatttggca actgggagga agctggctct 900gtgcaggaaa gaagggggca
ccgctgtgcg gacgccagac tgcgaagggc tgcagaagga 960gccgaaaggg gaagaaacgg
acgcaggtag gggtggctgc tgttaaagcc gcttcccggg 1020gaggccaagg acatccacag
ctgaagtgct caggaccatc cacagctgaa gtgctcagac 1080actgcgtttt ctttatctca
gagaggctgt gtgacttgcc cacgtatgag tacagtggct 1140aaatcacaag ccctggagtc
aagggtttag gttgatccag cccccactac tcactggtgg 1200ctgtcagcaa gctactcgct
gtgcctcagt ttccccatct atcaagtaga cagcactgcc 1260ttacagatgg ttgtggggat
cagaggggag gggacagctg gcggatttag cagagtacgt 1320ggcacagagg aaacactaaa
tatgcttctt cagctcctta tcaaggttag gcctccacaa 1380agggtggagc agggaagaga
aggcctcacc gggcagacct atcttggaga agatacaagc 1440aatggtgctg aagtttcaca
acagtgtcaa ccccctccct catgtgtgta ctcacagcta 1500ctcactttcc tactctgtgc
cagccatgag gtgtagtcac tgtgccaggg ggctgagtgt 1560ccggcctggg acgtgagagg
gcatgggctc acctgctcag ggtttgaatg agaccccggt 1620aaccgcagca gtaaagaccc
ctcaaatgcc atctctaaat taaaatgggt gatcagaaaa 1680tagcaggtga acgatagtgc
cctcactgcc cacagaagtg ccttcagtca gatttagcgc 1740tccatcttct gcctttctga
agggacagtg gaagcatcca tt 17823101801DNAHomo sapiens
310ccatctgtgc aattccttcc tcctagaatt cagaatctga ggtgctggtt tcctgaggac
60acttgtgact tgctgccttt tattgaactc tgagtgccct attgcccagt ttgagtgttc
120caatgggaag tgcagagcca ccgtggccat tcattgctgt agagctgcgc cccagtacct
180gatacatccc tcaccctttt ccaattgatt tttagcttcc ttcatccctc cctctttccc
240ttgtcctctt cgtgtccaca ggaagcctgt tgggagcctg ctatggcaag tgctgtgcta
300ggacacggtc ctgcactctt agagtttgtg gttcagttat tccagtttca gcacttacat
360tcattcaaat gctttgtgga agcaagctgg cttttagtca ccagcaatag caatttctga
420aaatcaccaa gccacaccaa atatatgaaa tatctttctc taaggtggtc tttaaaattt
480gggctgactc tcctccctct aggaatgttc tgatgagttt cagtctgaag gcagggagat
540ggtctcggtg acctcctggg cccctgttct gcactgaact gtatgcccat acattcatag
600gttgagatcg taacactcca gtacctcaga atgttactgc attggtagaa aggcttttta
660aaaaagggaa tcaaggtaaa acgaggccat tagggtgagc cctaatccaa tatggctggt
720gtcctcacag gaagagtgta ttaagataca gacatacaca aggaaaacca cgtgaagata
780tggagaaggt ggttgtctgc aagccaagga gagagtcctc aggagaaacc aaccctgcca
840gccccttgat cttagacttc tggcctccag aattgtgaga aaatacattt ccattgttta
900agtcccccag tccgtggtac tttgttatgg cagccggaag gagactgggg ccgcctgttt
960gcttggctgc agaagcccca cgtggctgca ccctggctca ttctgttttc tgtagcagca
1020gcagcagcag cagcggcagc agggagccca ggatgcaaag cttggtttct gagccctgat
1080caggaggctg tgtttatatt tatcctgcta actgcagggg actgtttatt cccagagaaa
1140taacctcctg ggcaggatag gggcagccaa ggaaccagct gcttccatca ggcctgctgg
1200gctcctccag gttctcatca taccacttct gtcgaggctc tctctgacgc agctctcctc
1260actccacacc aggcttgggc ccaggggcac agcctggtct tcctgaggat gctcagacgc
1320agggaccgac tgctcctcac aagcaccctg gcacatgcac agcccaggga ctggagcctt
1380cgcaaacaag tcacagtcct agtctgagat tcagtgcaac actaggcgct tagtagatgc
1440tcagtaaaca gaacaacaag gattttcttt tttagtttta aaacattagt ctacccatgc
1500cttgataaac tgtaaaatgc ctctgccacc cattctccct tcttgctccc tttcatggga
1560gctctgaggg gaaggtctct ggggtgggtt ccagcaaccc tgggcctgtt ctggggtcct
1620gcagccaggt tgggctttca ggagcctata tttcatctgg gccccagtca cactacatag
1680atttttgttt tatcacagaa atcactgcca cactgtgacc cttaaggtcc tcagcaggga
1740tggcgcgagg tgagagtatc aaagccaggt gagagcactc agatggcttc tgcctttgaa
1800c
18013111032DNAHomo sapiens 311gtctggattc tttcacaatg tagcataatg ctctggagtt
tcagccatgt tgcagcatgc 60atcagtactt catttctttt tatagctgaa taatattcca
tagtatttat atatcaaaat 120ttgtttatcc attaacctgt ggagggacat ttaggctgtt
tccacctttt ggctattgtg 180aatggtgcta ctataaacat gtgtacacat gcctgtttaa
gtatatgttt tcagttcttt 240ggggtatata cctaggagtg gaattgtaga atcatgtggt
aattttgttt aactttttgg 300aaaaatatca agctgtaccc aaagtggttg caccattttg
catttccacc agcaaaatgt 360gagagttcca gtttctccat atccttgcca atacttattt
ttctttttaa aaaatagcta 420tcctagtaca tgggaagtga cattcattgt ggttttaatt
tgcatttccc taatgattag 480tgatgttgag catcttttca tgtgtttatt agtcatctgg
atatctttgg agaaatggct 540attcaagccc tttgtccatt tttaactggg ttgttcggtt
ttgttgttga gttgtaggag 600ttcattatgt attctggata ttaatcactt acctgataca
tgatttgcaa atattttctc 660ccattctgtg ggatgccttt tcattctctt catagtgtcc
tttgatacac aaaagttttt 720cattttgatg aagtccaatt cacctgtttt tttcttgacc
aaaaagtaga aacaactgaa 780atgtccacca actcatgaac agataaacaa aatgtgtata
taatgggata tattcagcca 840taaaatgaat gaagtacaaa cacatacaac atggatgaac
cttggaaact ttatgctaag 900tgaatacagt cagatacaaa aagggaacta ttgtataatt
ctatgcatgt gaggtacaca 960gaatagtcat tttcataagg acaggaaatg gaatagtggt
tagcaggggc tgaacagagg 1020agaagattgg ca
1032312627DNAHomo sapiens 312aaccacccaa tgtgttcacc
ttgcccgctg cctagacaga gccgatttat caagacagga 60taactgcaat ggagaaagag
taattcacac agagctggct gtgcaggaaa ccggagtttt 120attattactc aaatcagtct
ccccaagcat tcggggatca gggtttttaa agataatttg 180gcaggtagga gtttgggaag
tggggagtgc tgattggtca ggttagagat ggaatcatag 240gtggttgaag tgagtttttc
ttgctgtctt ctgttcttgg gtgtgatggc agaactggtt 300gagccagatt cctggtctga
gtggtgtcag ctgatccatt gagtgtaggg tctgcaaata 360tctcaagcac tgatcttagg
ttttacaata gtgatgttat ccccagaagc aattagggga 420agttcagact ctaggcgcca
gaggtggcat gatccctaaa ctgtaatttc taatcttgta 480gctaatttgt tagttcgcaa
aggcagactg gtccccaggc aagaaggggg tcttttcagg 540aaagggctgt tattaatttt
gtttcagagt caaaccatga actgaattcc ttcccaaggt 600tagtttggcc tactcgcagg
aatgaac 627313907DNAHomo sapiens
313gccaaaggtg gaaaatgttg atgtagactt ctaagatttt gacaaaattt tgttttatgg
60cctggtggtt atataaatat ttactgtata acaattcatt aagatacaca tttgtgtttt
120ttgtatatat gtgttctatt tcacaatctt aaatgttcct taattaatta atggagcaca
180ccttcagagt tgggtgggaa aataattctg cctagaaatc caaacttaga caagctagct
240atcaagactg aggacaaact aaagccattc ttacacctgt aaggattcag ggtttatcta
300ctatttatgc tatctgaagg agacaattga atatgttggc caggaaacca agtgtgagga
360gtatgtagaa aacagaagat gatagtacta accctgttaa tctaataaaa agaaacccca
420ggatgactgc ttgcagtggg gtttgaaaga aatctattca aattaaaaca ggaggtccat
480gtgctccaaa aagatattct ttttttttaa atatatatat atcttttatt atactttaag
540ttctagggta catgtacaca acgtgcaggt ttgttacata tgcatacatg tgccatgttg
600gtgtgctgca cccattaact cctcatttac attaggtatg tctcctaatg ctatccctcc
660cccctcccct accccataac aggccccagt gtgtgaaaaa acgatagtta gatgccacga
720actaggtggc aatgccttaa ccgtatgtgt gttgtcaggc ctgagggcct cttccatcct
780tgtcaagggg agtactaacc ttctcccctt tcatacaaca caaagatatt cttaagactt
840ctagaataga ccctgaacaa ttttagagta aggaactaat agatatcagt gctttcatga
900agaaggc
907314769DNAHomo sapiens 314ccagaaggtg gaagctacag tgagccaaca gagtgagacc
atctcaaaaa aaatttaaaa 60aaatgaagaa ggaaggaagg aagagaggga gggagggagc
gtgggcgggg gggggggggt 120ggaggaggag gagagaagga gtgggaggag tggagaagga
gggggaggag gagaaggata 180aaaggttaca agtggttgtt actaggaatg ggggagaaga
gaagtgggta atggcactga 240agctttttat tatgtctttc agcattctct gattgttctt
aaaccatcaa cagatctcag 300tatgtagact aaaagggaat atttggtgaa gagatcttct
ttcactattg tacacttgct 360atggacatgt ccatgcctgc tgcctggcag gcaccattca
ttaagtaggc ccctgttgcc 420aaggaaacca gctcttcact gataccaaag ataatgcaga
ggcctgccgc tcaccaagca 480accttcctca tgagctatgc ccccaccttc ctgaactgtc
tcttgctcct gtttgatact 540gtcatgctgc acgaagctta cacttgctat ctctcacttc
cctcttagtc atctgtgatg 600ctggctaagg gagctaggcc agtcagcagt gacctgttgc
ccttggttta ttataagcaa 660actgttcaca agaaatgaac ttctgttgtt ttataaatga
tatgcatcac agaacacaga 720ataatatcaa aaccacatta gttttttcat acttgcttca
ttgacccca 769315573DNAHomo sapiens 315acatcgaccc
agaaagttcc ttctgtcagt agcagttcac ccccccatgc ccccaaccct 60tggcctccct
gccttcccat ctccactccc aaccctcact gctctgattc tatcaccatt 120gttttgattc
ttctgctgtt gatcttcata aaaccagtat atttcctttt gtgtctggtt 180tattttcctc
agaataatgt ttttaacatt tatccatatt gttatgtgta tcagtcgttt 240cttccagatt
agtactctat tgtatggata gagcctattt tgtttaccca tttcctgttg 300acagacattt
ggtttgttcc cagttttgga ttataatgaa taaagctgct atgaacattc 360ttgaacgatg
aacatttttg tggacatatg ttttgatttt tttgtgtaaa tacctaggag 420tgaaattatt
gaggtatggt ataggtttat gcttaatttt atagagtact taaacttgat 480tcttttattt
aaaattgtga taaaatacac ataacataaa atgaaccgtc ttaactgttt 540ttaactgtac
agtgcagtgg tacgaagcac att
5733161465DNAHomo sapiens 316agcgtgccat tgtactctcc ctgggtgaca aagcaaggcc
ctatctaaaa caaacaaaca 60agcaaacaaa aaaccccaaa actggaactc tgtatctatt
aaacagtaat ctctcattga 120gtggtgttaa gagtaaaatt ttttttaaca aaagaaaaaa
gtaaaaagta aattttgaaa 180aaagaattaa aaacaaaaaa tctccattac cccctccccc
agcccctggc aaccaccatt 240ctactttctg tctttctgaa tttgactact gcacataacc
ttatataggt ggaatcaaac 300agtatttgtc tttttgtgac tgacttattt cacttaggat
agtgccctca gcttttaaaa 360ggaaagacat tttgatatat gctacaacat aatattccat
tgtatgtaca taccaaattt 420tattaacgat ttcatctgtc aatgaacatt tgggttgctt
ccaccttttg tctattgtga 480ataatgctgc cgcgaacatg tttaagtcct tgctttcact
tttttgtgta tacacccaga 540agttgaaatg ctggattata tgtaattcta tttttaatat
gagtgactgc catactgttt 600tctatagtgg ctgtaccgtt ttacgttccc actaagagaa
catgagtgtt ccagtttcac 660catatcctca ccaacactta ttttctgttt tgttggtggt
agccatccta ctggatgtaa 720actttattca tttttcgaac ctttttaata tggaattttc
aaacacacac aaaagatgag 780agatctccag gtacccacca caagctttaa taatgattaa
catttggtag caggtggaca 840aagatatacc ttctctatag cagctataag atcagggaca
aacaaagatc tatttggaac 900tccaactaag aatggtgttt tgtaggctgc ctgatgaata
aggttagata actaatggcc 960agtctttcag cctgtgctca agggatagga taacaataaa
gcatagttgg tgaaggagca 1020gcagataaag gtcacaatag ataggccata agagaaccct
cactatcact taccattcag 1080accattcgct tcatattcta acaagttatt ttcctttcat
aaaaggaagc tgaagctttt 1140atttgtgttt gtggtgcatg tgatccatga gaggggactc
aaccaggtgc tatgtgtgag 1200tagtacttaa tccgacagta ttagtgggct ggtgggcttt
cctggttaca tgggaaccct 1260agaaacccaa gccaagcaca aaagccaaga ctgaattctc
cagtaagtca cctggtagcc 1320ttgacatgct catgcttaaa aaagagccag tgacctatta
ataggaagct cctgaaatga 1380gtcctctgaa catctgcaag tatggtcagc tacacctgag
ctgagacttg cctgtttccc 1440tgccaggaaa tcatgggctc agaaa
14653171399DNAHomo sapiens 317gtgcccagct agttccattc
ttttcagata aattttttca aatcctctag aaacaggtaa 60tatttgtgct ttttaaacag
ttcagaatac acacaaaata taaaatgttt ctaatattta 120ttatatatct aacatattga
taatctaata gaattccaga ttcctaaagg atttctgtat 180aagcactgag agataacctg
tcttagccat tgctagtcag aaccaaagaa aataccatga 240agattctgag gtcttccacc
aaaaaaagtt tcttaaaaga aatggaggca tgaaggcagt 300caggtaatga cagcaatgac
agaatgagaa aagtactgca acagttcaaa aaactgcttt 360ttcttcctgg ttctgctaca
taattcaaga taactttaac cacctctctg gggcccaatt 420tctttacatt gcaaagaagt
tatggaccct ttaatactca gttccacaaa ttctgactca 480gagggttcag tgagaactcc
aataattggg aggcaataaa ctcactggat agctttgagt 540aagacgactt ttggtgtgcc
tgtcagttca tatcctccta taaagtctct aacctcaacc 600catcccaacc acaggcctgg
gggcctgtag ctatgtatta tggatccttt taggaaaaag 660tatcttgcta gtcacaacta
tgttctccct tgaagaaaaa tgagcaggtg aagctgctgt 720tcagacagaa tgaagcggat
gtgcaaaggg accacagaca accatcacgg taggaaatac 780cgcttgcttt actgctgaat
ctccagtgcc tagatcagtg cctggcccta gcaggttttc 840atcaaataat tattgaagga
ccactgaatt tcattccctc atgtggtttc catgagatac 900ttctgtattt ctctaatcat
tcaattattc ctccccctta agctagcaca agtttctttc 960ttacaaccag aaagcccttc
caaatacatt atgatattct ccccttcata gccaccactt 1020acttcactac aggtatatgt
cagacctcag gaaagacacc accgaagact ggatcacatg 1080tccccactca ggaatacaga
attggcacat gagattaggt cagttggtca gcagcactaa 1140aggtggtgat agacaccaat
gcagcgcata aaggctggcc ggcaggcgaa gtgataagaa 1200agcagacaca aacaggaaag
tagacaatgg tggttctgag acatccctat attttcctgc 1260tatggactga atgtttgtgt
ctcccacacc cccattcata tgttgaaatg ctaacatcca 1320acagtatttc gaggtggggc
ctttgagagg caactagttc ataaatgtgg agccctcatg 1380atgggattag cgctcttga
13993181332DNAHomo sapiens
318tgcacatgct cactgaaaga cagatgtgat cattttcata gtaactttat tcataatgac
60cacaagctgg aaacaaccta aatttctatc aacggtagaa tgggtaagtt gtgatgtagt
120cacacaatgg aatactacac agaagtaaaa catgaactgc tactacatat aacatggatt
180actctcacag atacaatgtt gaacaagaca cggaagagtg catataatta tttcacttac
240ataaagtttt aaaacaggca aaactaatcc gtggtgataa aaggggttgg tgggggcctg
300gggaacataa ggacacctac tgaagtgcta gaaatattct atattttgac ctgggtggtg
360cattcaaggg catatataaa aacccactga ggtgtacact taatatcggt acacttttaa
420attttacctc agtaacaagt tgaaaatata ttggaagaaa gcatataaat gaagatgtta
480atcagtggtg ttgtcttcca aatatttctg gtttttctcc caggtatatg caaggatcac
540actcctccta gaactttaat gtggctatgt gatttgcttt ggccaatgaa agtcctttaa
600gagtccttaa gtgatttgcc atactctttt ttcatttccc aaagttagca cccacactcc
660tgatggtatc tgctttgtca gcctgagacc cagaatgaag gcaacttaga gcacaaattg
720gtgaactttc tctgtaaagg gccacagaat aagtatttta ggttttgtga gatgtacaca
780ctctgtagca actactcagc tgtgccattg tagaaccaaa gtagccacag ataataaata
840aatggacaag gctatattcc agtaaaactt tatttagata aataaggagc tagacagatt
900ttgcccatgg accatagttt gatcatggcc aacctatgat ctaaaccaaa gtccctaggc
960aagttgcaat agaaaagttg tgtgagacag tgagattttg cctttgttat tcaatggcaa
1020gctagcccat gctgacacag aaattggtcc cttgtttttt aaaatgctaa aatactgaac
1080actggcttaa tggttagctg gcaggcattc aggaaattgc tatcagaaga tggaagaaat
1140aacttacaca agggtaattt atattatgga aaggtgaaac tgccccttga gataacctga
1200aaggcagatc atctaccctt tactaaagaa aaagacagaa aaacaaaata ttttgtagtt
1260gctgttcact gcacttgaca aaacgtacag aagagatgaa ctcagaaaag actggtcagt
1320ttgcaggcaa aa
13323194336DNAHomo sapiens 319cagcaagcac caaatcactg gatgactatg acacctactg
ttaagatctc tgaatgactt 60aatatgcctc agactaaaac tccaactaag attatactag
ttaatcccat aaattggtca 120aaatggctta aggaaacaga ctaagggtgt ggctttccca
cctaagcctg aaaagcctca 180gttacccaga aattaagttc agagaggcat gggaacaaga
aggcaaagga atacagcaaa 240ctttagaagt atacattaac tagatgtcac ttttgggtca
tctccctttg ggtgataagg 300acattctgca ggtgtaaaaa caagtgaact gaatatatag
tgaccagaag ggctgagtca 360gtcttctact ttgttaaatt tacttacttt taaatcccag
aagagccaaa aatcatctgt 420aaaggaaagt agcaagtaaa tagcatgcca ttttctccta
gtgttgatgc ctacaaaagg 480aaaaatgatt attaactctt aggcggcatt atctttttcc
aatactaaat gtaacattta 540gtaaaaacat attgttttag ttcactaagt agttgtctaa
tcttttccct ctatatgtag 600gttcctttgg aaatttaaaa aaaaaaaaaa aatatatata
tatatatgac actgtttaca 660aagggtaaaa aaaaataaga ttctatacat ggatatgcaa
acttaatatt acatagtgga 720tttggtgtgt atatttggtt ttgaatcctg agtttactac
ttactgtgtt taccagggaa 780aaccgcaatt tgttatcctt ctctcctttc atacaacaga
gaaaaccaca atttgtttaa 840cagtcacata atacatattt aagtcacatg ctaagcacta
cactaaattc tgagaataca 900acgaggtccc tagttacaaa gaacttgtct tcatttttca
attagtaata tgtggataaa 960agttacccaa tggacagtct aaggcagaat gactgtgaag
gtcaaataag actgtgaaag 1020agcttcaaaa attgtaaaac actacacaaa tattcgtttg
tccaaacatt tattgaaatg 1080ccaggcattg tgctaagcac tagagatata acagtgaaca
aggcttatat ggtccctgcc 1140cttacaaagc ttacagtcta gcagtgatca ataagcagta
acaataaagt gtgccaagtg 1200tatgtctggg aaagaacagg gtgtataggg aatggatagt
aagggcacct aatctagagg 1260gcatcaatga aggtttccta gatgaagtgg catactgaga
ccttaaagat gaagataaat 1320ttgtattgta ccctaagagc aatggtgaaa gcaatgcagt
gacatgatca gtaagtcttt 1380tggagcaatt tggttgtagt gtagaaagga ataaaaataa
aaaacaggga gactaataag 1440gaggctgttg ctataattta ggtaggttga tggcctgaat
taaaatggca gcattggaga 1500attggtaaaa aggacaaatg aatggttggt agtggtaatg
ccatttagtc aaagaggaaa 1560catgagagga ggagcaggat tgggggcaag atcaatgaca
tgtagtgcct gagacagcaa 1620agagcatttg ttagcaatta gatacatcaa ttagggaaga
tctagaaagg agatatgaat 1680ctgacagtca tttgcatata aatggtaagg aaaccatgga
aggaaatgag atcagctagc 1740gagctgacac agaacaaggc agtctaaaac aaattttttt
aaaaatacga agaacagata 1800ttgaagggaa gaggtgcctg caaagactaa gaaagcacac
ctggagatgg tatctcctca 1860aagctaaagt catcaagtgt tcaagtgttt caaggagggt
aagactatta acaaggactt 1920agcatagtag agcaatttga gtggcaatac gggacactgg
gaatacaaat ctgtcaagaa 1980aactagtagg aatgagctat aggacagtaa ctggtaagga
cctaataatt ttttttttaa 2040tgtacgtatt ttaactatat tcactgctac aacaggacca
gtaacaacta tatttattta 2100aaaaaaaaaa gactgccatg cagttacaga attacttaat
acagaaaaca gtaaaataca 2160cttttttctt tttctttttt tttttttttt tttacaaaca
agactagctt atagcaaatt 2220ctctatagct aagggtcaat ttaaaatcct tggcttatat
ctccccctca ctcaatgact 2280acatgatgca aactaatttt attaacacct taagcaaaac
atactggaat ttcacaaaat 2340gtacaagatt tcaatattta aggaactggg gttagaaagc
agaagtggct ttcaggtctt 2400ccagtctttc tctcaagtaa taaagctctg ctgtgaatat
tcaaagctat tgggaaatta 2460ccggtagatt tttctgtttt tttttttcgg ttttccacta
tgttgtttct ctagatatgt 2520aagcttactc tattaaccaa aatctcagct tgaccattct
tgataagtac ctaatcgaca 2580tgtaactttt tttctgcctt aaatatgtat aacaggacag
agcccttaaa tctgattcaa 2640ttattaattc ctgatttaca agtgctatgg tgagctaaca
gaacttatca atgcctttat 2700tgcactttac tagccaaatt tagaaggttg gaattagtct
ctcctatcta gtattctgtc 2760agtttgccca gcttgtactt ttaattttgc ttctaatggt
aatctgccct atcccttgaa 2820ataaaataat ctacattttg ggagggctaa ttcttcattg
tgccaggctg tcccatgcac 2880tgcaggggtg agtgtcttta ggcttaaatg ccaacagaag
cccctagtaa atatgacaac 2940caaaaaagtg cccctacaca tttctcagca tcctctggaa
tgacaggtta ctgcctctag 3000ttgaaagcca ctggcacaac tttggttttt aagctcttat
gccatttatt ttaattgccc 3060agacatcaat tccacctaaa ttcttagtca tagcctggtt
ccttgaattt gctggattag 3120taaccacaga ttaaggtgtt tcaatagtta agacaggact
ttggaacaag agtttttaaa 3180ttgtataata cttgagagga tctatgaata taaattgggt
cctgtttata attagtttta 3240cataatgaac tttaagattg ccttttcatg gtgaacagaa
gtttggaaat tactgttttg 3300gcacaaagca gattatctta gtagaaatac agaattactg
caatctgtga ataagactgc 3360ttttaaatat ttctacttgt gtgctatctt acatatagaa
tgtgtacgac agttccaaat 3420tttagaataa atccatttct agcatctaac aaaatctgat
actgtatcat tttaaaacaa 3480agtgtttact ttaggcagga ttttttaaaa taaagcagca
atacccacgc agataagaca 3540aaaaagctaa aatatctcac acctcctaat cctggagtgc
aatctttttt cctcatcgtt 3600tttgataggg ccaaacttgt gtctacagta aaaaaaaaaa
aaaaaagaat tactaactgg 3660caaccattaa gattctatac ttaccatagt cctttaatag
gcaagctgat aaaatagccc 3720ccagttatta aaaaaaaaat ccaaggaaaa cccccaataa
ttagtcttat ctccaaattg 3780catgaagtct cctatatctg aaacttaaaa atgattctaa
tgacttcctc tatcagtaat 3840gtgttatcac tgaggtgggt gatggggagg gaagagggaa
gaaatctgtc agtattacct 3900tcgaactcag aaatgtttaa aaaaaagtct caaacatttt
gatggttaga caaaacacct 3960ccactgttat gtatgggctt cctttttgga aacttatgaa
cttgctatgt gagcttctgc 4020aaattggttc aaaagcacat ttaaggagtt gataatttaa
gactatatga atcagaattt 4080taacactcca ttaaaataag agctgaaatt tttggcattt
atcttcagaa cacctaaaaa 4140acagactgca aattcaactc acattaatac taaatctctt
taaaattaac tatatcataa 4200aagacaatga ctttgtcact aaactaagtt ttaaaaaagg
tggcattctc atgtttcagt 4260cccatgctgc catttgagat gaaaaaaaag gcaactgtca
gaattttaat tgtgatcagt 4320ttggacggct ggtact
43363201612DNAHomo sapiens 320cctggccaga aaattcattg
acttcctaaa gatttattaa ctttctgcat tacttttttt 60tttcccctcc atcgtaaata
taaaagggaa tagtagagaa aatcattcag aattttattt 120tttagtgaca ttatttagtg
acattttatt agagtcactt aggaacctga ggctgaataa 180agttcaggta aaagtaaaat
tagttgagaa gagacatctg ccaaaagaaa tctattttta 240acttcacttg ctgtctttcc
tagaggaaca gaaatagtgc tgaatgtcct attagaaatg 300atggttgctc tgcccgtctc
ttccctctct ctcacacaat atgtaaactc atacagtgta 360tgagcctgta agacaaagga
aaaacacgtt aatgaggcac tattgtttgt atttggagtt 420tgttatcatt gcttggctca
tattaaaata tgtacattag agtagttgca gactgataaa 480ttattttctg tttgatttgc
cagtttagat gcaaaatcca caagtattca agtgattgtt 540aaagagggag gcctgaagtt
gattcagatc caagacaatg gcaccgggat cagggtaagt 600aaaacctcaa agtagcagga
tgtttgtgcg cttcatggaa gagtcaggac ctttctctgt 660tctggaaact aggcttttgc
agatgggatt ttttcactga aaaattcaac accaacaata 720aatatttatt gagtacctat
tatttgctgg gcactgttca ggggatgtgt cagtgaataa 780aatagattaa aatctattct
cttctgatgc ttacattata gtggtgggag acaaaatggg 840tataataaat attatattag
atagcattaa gtgctgtgga gaaaactaaa gcagggagga 900agataggagt gtgcaagcca
gaaaggttgc aattaaattg agtagttcag gaaggcttca 960atatggatgt gatatttgag
agaccggtgg aagtcaagga gcaagttgtg aggctattta 1020aaggtattct tggcttacag
aacaatatac gcaaagacta ttaaatggaa gcatacctga 1080catgttaaag gactatcaag
gaggccagtt tgtctagagg ctgaaaagga aagagtaata 1140ggagatgagg tctgagtgaa
aacacgtaaa tccttgtggg ccaaggtaaa atctttagct 1200ttttttctga atatggtggg
atactgttag agggttttaa gcagaggtta cgtggtgtgg 1260tgagtttttt ttttttaatc
ctttgtcttt ctgtgtggaa aatagcagga cagggcagaa 1320gcagtctgtc ctgcagactg
cttggtcgca gtagagatgt aagaagcagt gagattctgg 1380gttaattatg gaggcaaagt
tctcagaatt tgctgatata gggtatgaga gaaagaggaa 1440tcaggaatga tttcaaggtt
ttggtctgct aaatggaagg agttgccatt tactaagatg 1500ggaaagacta tgaaagaagc
agattttcag agagatcaga agttcatttt ggggcatgtt 1560caatttaaga tgcctgttag
ttggatgttt atgtgagttt ggaatgcagg gt 1612321831DNAHomo sapiens
321gcagtccttt gaggatttag ccagtatttc tacctatggc tttcgaggtg aggtaagcta
60aagattcaag aaatgtgtaa aatatcctcc tgtgatgaca ttgtctgtca tttgttagta
120tgtatttctc aacatagata aataaggttt ggtacctttt acttgttaaa tgtatgcaaa
180tctgagcaaa cttaatgaac tttaactttc aaagactgag aattgttcat aaataaacta
240ttttacctgc agagacctct gatatatgtt tcttgatgga agtacccagt accacctatg
300aagttttctt gtcaaaaaat caaatgtgaa tctgatcatt acttagatct aagtaccaat
360atatgaaaaa tataggagac aaggaagcat ggtaaatgat actgagattg ggagactaca
420tggaaaaaga cttgttccct tcaacagata gacagcaggg aaaaaagaat agagaaagga
480gtaaagaacc tgtagattaa aagacattta agggacatat gaaccaggtc cagtgtatag
540atcttaccta aatcctgatg gagcaaacta taaaaaaatt tttttgagac aaatgtttga
600atacaggttg actatttgat ggcattaagg agaaattatg aattatcttg gtataagaat
660attgtcatgg gttttttttt ttgagtcctt acctgttaag atacatacta aaatatttgt
720gggtaaaatt atatgacgta taggagtata tgatttagaa aacggattaa aatataaaag
780gataaaatag gatcttatat tttgtgactc acttcctgtt ggatatcttt c
831322997DNAHomo sapiens 322tggccttgtt taaggtcctg atgagtattc ttataggtac
actgtgtttc gtttaattat 60ttccttagga taaatttata gaaataacat tccttggtaa
aagaatacat attttaaaaa 120ctgtattagt ttcctgttgc tgtcaaaaaa tttccagaaa
cttagtggca ttaaacaata 180caaattaatt attctacagt tctggagatc agaagatacg
ggtcttacta ggcctcacta 240ggctaaaatc aaggttttgg cagggctgtg ttcctctatg
gaggttccaa gggaccagag 300aaactacttt acagtagtta ttttaaggga atgaaagtga
agatggggtt gggcagtcaa 360agaggctgtt acttttcatt tttggccttt cagtagtttg
aattttttta tcatatacat 420gtattacttt aatttttaaa aagtaaaaag cagctgtgat
tcagtctctg taatttagat 480caatttacat caaactaggg tggtctcatg tgttgtcttg
ctcacagtga ccactagatt 540attccaagaa gggacaattt ccaagacttg gtttacactg
agacggctcc tgattttaag 600gataccttag atcaaactct aggaaggcag tttcattttg
gccttgcagt tccctgggtc 660attttccaag cccatggcct cctggagtct tcgcctagct
gtaggttatc tttgtggcta 720ttatttcact gtaattatac aggaagattt attgagggat
ttctgtgtac cagccgtggt 780tctcagcact ttgtatactt tgtattaact ctgactcctg
acagtaactc tacagaggtt 840ctgctgttac ccagttttac atagaaacat ggccagcgga
cgcagttaga aaatggcaaa 900gtggggatta gaaactaggc agtttgactc cagagtctgt
gcccctgtcc acttggctcc 960actgctgggg aagaggcctc tgaagcagca ggaccat
9973231165DNAHomo sapiens 323accccgtcat agcacagttc
ctgagttaca tctttacata ctgtagtatc cttcttgtga 60aaaaagatac agattccaaa
ggtctgagaa accaatcttg gttataaagg ggaaaaatgg 120tcatgggttt ttaaaatttg
ttttgtctta attgcatttc aaatttacat ttctaaatga 180ataattgctt atataaagca
gttttgatta acaatataaa acactatcta tttggagtga 240ttcctttacc catttctgaa
ggcaagtttt aaaaattact agaagacact tcattgagaa 300tattattaaa catgcctata
gttctaccac ctcaacacaa ttgcttatta acacattaat 360gttttggtgt gttttggact
ttttaatatg tatttttcac ttgttctagt aattatgcta 420cagattgatc atttcttttt
caacatgtca tcaaagcaag tgagcaaagt gctcatcgtt 480gccacatatt aatacaaaat
ggaagcagca gttcagataa cctttccctt tggtgaggtg 540acagtgggtg acccagcagt
gagtttttct ttcagtctat tttcttttct tccttaggct 600ttggccagca taagccatgt
ggctcatgtt actattacaa cgaaaacagc tgatggaaag 660tgtgcataca ggtatagtgc
tgacttcttt tactcatata tattcattct gaaatgtatt 720ttttgcctag gtctcagagt
aatcctgtct caacaccagt gttatctttt ttggcagaga 780tcttgagtac gttttctttt
ctccttattg ataaattgat aatcctcaag gatgattatt 840aggtgatact cttacttcat
ggattcttaa aagatatgat ttaacatatt acaagtgcct 900agcaaggtgt ctgttacacg
taggtatttt aagtaaatgg tagctgctga tgtaatttct 960gcccctttgc ccttcagttg
gggtattgct ttggaccgat tagagggctg tggctgggat 1020gctaaaggtt catgtttcct
tagctggctc ctgagccacc agctcccacc acctgtgtat 1080acctgtgcta gtttgccttc
ccacaagtag ctgctggcta tctgttatgc tggtacagtt 1140ttcagaaact gatgaatggc
ctttg 11653241275DNAHomo sapiens
324gtggcgtgat atccttgatt ctatcagcaa cctataaaag tagagaggag tctgtgtttt
60gattcagtca cctttagcat ttttatttcc atgaagtttc tgctggttta tttttctgtg
120ggtaaaatat taataggctg tatggagata tttttcttta tatgtacctt tgtttagatt
180actcaactcc actaatttat ttaactaaaa gggggctctg acatctagtg tgtgtttttg
240gcaactcttt tcttactctt ttgtttttct tttccaggta ttcagtacac aatgcaggca
300ttagtttctc agttaaaaaa gtaagttctt ggtttatggg ggatggtttt gttttatgaa
360aagaaaaaag gggattttta atagtttgct ggtggagata aggttatgat gtttcagtct
420cagccatgag acaataaatc cttgtgtctt ctgctgtttg tttatcagca aggagagaca
480gtagctgatg ttaggacact acccaatgcc tcaaccgtgg acaatattcg ctccatcttt
540ggaaatgctg ttagtcggta tgtcgataac ctatataaaa aaatctttta catttattat
600cttggtttat cattccatca cattattttg gaacctttca agatattatg tgtgttaaga
660gtttgcttta gtcaaataca caggcttgtt ttatgcttca gatttgttaa tggagttctt
720atttcacgta atcaacactt tctaggtgta tgtaatctcc tagattctgt ggcgtgaatc
780atgtgttctt tcaaggtctt agtcttgaaa atatttatag tgtagtagaa ctattttatc
840ctccaatgct ccttcttttc cttgtatttc cattatcatc actttaggat ttcacttatt
900tatcattcaa catttattaa ttgcctctca tattccaggc tttgtgctag aagttaggga
960tataaagaca aataagatat ttcctgccct taaagactag attcgtgttg ctaagtcttc
1020attatcaaga aaagcataag tggggaaaag tgcttgcatt atggattcct catagttgct
1080cccctctgca tgtaaaaatc accatttcca tcatagattc ctagcggtct caggacttta
1140taaagcccaa agtgcctatg tcataatatg aggaaaaata ctgagaccct tccatatatg
1200ggaggtatat ggatgagaca gctcctgact tcacttttcc cagaaatctg aaaagcagca
1260gcagtcattc cagag
12753253164DNAHomo sapiens 325tgtgctagat gcctcactgg aaaaataaag gacatgatgg
aaaactctgt agggtcagag 60aaagggatca ttagagaagg ttctttgaag aaatattttt
tgaaatatga aggataaata 120ggaattaact aggtaccaat aggttaggag tagagctttc
cagacagagg gactagttct 180tgggaaggtc tccagacaga aataagtgtg gcttgtctga
ggacctctta ttcgcctatt 240aaccttccct ccccagtaaa cactcctggg aacaacacac
attgtagaac cacgttgtgg 300tgctgttcag tatagcaagt aattcagcag agataagttc
ttggaatctc atctttggga 360tttagttact aagatacatt caagtttgag caaaataagg
tctcagagct tggattcatt 420gttctgttcc agcaattaga gcagtacctg gcacatagca
caagtgcttg aaaacactga 480ctgagtaggg taggtgggtg agtgggtggg tgggtgggtg
ggtggatgga tggatgggag 540gatgggtggg tgaatgggtg aacagacaaa tggatggatg
aatggacagg cacaggagga 600cctcaaatgg accaagtctt cggggccctc atttcacaaa
gttagtttat gggaaggaac 660cttgtgtttt taaattctga ttcttttgta atgtttgagt
tttgagtatt ttcaaaagct 720tcagaatctc ttttctaata gagaactgat agaaattgga
tgtgaggata aaaccctagc 780cttcaaaatg aatggttaca tatccaatgc aaactactca
gtgaagaagt gcatcttctt 840actcttcatc aaccgtaagt taaaaagaac cacatgggaa
atccactcac aggaaacacc 900cacagggaat tttatgggac catggaaaaa tttctgatcc
ataggtttga ttaaacatgg 960agaaacctca tggcaaagtt tggttttatt gggaagcatg
tataattttt gtcctaagtc 1020tgtgctcagc cctcccacat gtgctcattg ctggttgact
gttggagtct ggttcttacc 1080tctaagagga agcccaggag agggcataaa gccagcacac
tgtcctcacc tgatggtgtc 1140agagtcctta cgagtaagcc ctagccagaa cattgctgga
agagatcaag ggccactgtt 1200tgaaattgca cagcaggata cggaaaaggg gtaccttagg
tataggcatt gtcattaaag 1260aaattgctaa gatacttgag attttcctgt ttaaggaatg
agctttatga tacaaagagc 1320agttctaaaa attagggagg gaattaacta aattaattag
gatatttctc aaattccttt 1380acagtttttg tctctctgct gatatagtgt ttacatgatt
gttatttact aaacaaatgc 1440tattttgtat tgtgctcctt ataacttaat tgtttattac
aaggttttga tggtgaccta 1500ccaacaacaa gtaatcccaa acacagtctg aattttttgt
tttccatcca gaaataagat 1560gaatctttcc atttccgtgt tttcagtttt catcattttt
atcctatagg ttacttatct 1620ttattttaaa gcatttcata ataattttat agtttttgtt
ttgtttgctt gtttgctgtt 1680ggaaatggaa tattccctcc ttccatttag actgctaacc
agctgtaaat gtttcaaaat 1740atgcatgttt tacagcagtt gttcaaagca atacaggaac
agtaaggaca gagccagtca 1800ttttacaacc acattctgtt aaactgatgt ctattagcag
ggtttttcct attttattag 1860gaaggactta cacctgatat ataacaaagc ttgttttaat
caaggctcag aaaatgtttt 1920tcattagttt ttttcctaac catgaagaat aactgctttg
taacacacat gctggctata 1980aagcagacaa aaaattcact gtaggtgctg cctgactggc
ctctgtccgt gtttctgttg 2040gggctgctta ccacagcctc tgcattatca ttagctagtg
tgttcacaat accaagttcc 2100cagtagcaaa gaaaggtcaa gctcttacgc atgccattca
tttatctaca ctgtgcaggc 2160gcactcaggt ggcagggaca aagaccactc ctttggcgca
tctcaagttc agaattctca 2220gtagaggggc tccagctgtc cttttgtcag gtgcccatgc
ctgctccagg cctgtgtggt 2280caggacacgt gttacagagt acagtgacat taatgatggg
gccatggata tggtcagcac 2340tcagaggatg ttagtctctt cattgataaa gtcacaacca
cttttcctgt tggaaataaa 2400aagatttgac gtatccttgt ctacagcaac acaggacaac
agataatcag caggtcatct 2460aaatctgttc agagagaaag gagagctgtt tcctgaaaat
acatcttccc ctgattttag 2520tcttattttt ttctgccttt attgctttct accctcttca
aaccagcctc atttcctaaa 2580ttaccttgaa tatgcattga cacttgtact gcctgaaatt
ctggaaaact cagtatggct 2640actccaccgt cagaacttcc tgagcaaagt tagttgctct
ctcggctcac tgttttgttt 2700tgttttgttt tcctgcctca ggtttatttg tacaaatagc
acaggaggac cagccccatg 2760cagatggtag cccaggggcg ggggtagggg gtcacaccag
tccttctgtc ctcatgttgg 2820cagagatatc tactctgaag cctttgtagg ggcctgggca
cctttgggag cctgagctgg 2880aactgaaggt ggagctgcag cctgggcctt ggtttgatcc
ttggccttgg cctttggccg 2940gcacagcctg agccccttgg caatacgggc acgagcacgc
ttcccaagct tgggatgggc 3000aatgtaggca agtcgatcga gcttgcggct gacacccttt
gggatcttgg gcttaacctc 3060cttgggcttt acgagggcct tgatagcctc ggcacgtgca
ctcatggcct tggcattgtt 3120ggcctgcatc ttctttaggc ccttcttgtt gtgcttcttg
gcaa 31643262468DNAHomo sapiens 326cggaggctct
actgttggac tgctgtccac tctggaactg cggaggctct actgttggac 60agacctgggt
taccagccgt gtgactagcc ttccctggcc tccatatccc cctcagtaat 120gaaggaatgt
gtcatcccca aatccaggga cagttacaag cagtcagtga acagaaagtg 180tctggtacag
gttctaagtg cttattattc taagtcactt cacttacctg agttctcagt 240tttcctatct
ataagataag caggttggat aaaatgttct ccaatatact cctggtcctg 300agatgatgtg
attgtgggca gccctttaat catggtgaag atgttcatca taagcacact 360gaaactacaa
aataggaata taaatatttt ctccattaaa ttatgctgga tcctagaagc 420aaaaactgga
actgtgaaac cctacttcac agaaaactta aaattcccaa gcagatgaat 480gcttctcgga
aggacactga cagttaccta cctggaaaga atctagatgg aggtggcatg 540ggcactaagc
ggtgagatta aacccagtta gggcagcccc accagccttg gaacccacac 600atctggagat
tgttgatgca gagagaaagg ttcctactgg tgagacctga aagggatatg 660tggcaggtgg
gaggaagaag ttctgtctgg aaaccaaccc ttgttcctcc gttattgatt 720gactcctggt
accaacatga gccctaggtc ttatagaggc cataagtccc tatgccttat 780agtgcccatg
gatgagatga ggccacacat gcccccagtg ggttaacatg tctagcgtgg 840gtaaggctct
tggagcacta tgatacacag gaaatgccca gtaactctta gttggtttga 900tatctgttcc
cattgctcac ttaagctcag tgccccttta ctgatccttt tattctgcct 960ccctctgcac
atgtgcattg agactcctat ctgagacaca cactgtgttg ggtgcccagg 1020gatgcagcat
agatgttgct gccttccaca gaagcgctca tggtctgcta gagaatatat 1080cccatgggag
agaaaaacag actcgggaga atatagcagg ggcccttgtc ctggactttg 1140gcagttagga
aagggaggga agagacatgg aggctgggac ccaaaggcta aataggaatt 1200tgctgggcca
aaggggaggg ggaatgaaaa gagtgtttct ggcagaggaa atggcaagga 1260taaaggcctg
gaggcgcaag agaatatgtg tttgaggatc tgaaagttga gtgcagtggg 1320tccagtgttc
tctaccctgg ctgccattag aattacctgg gaaactttta gaaaattcca 1380gtgtctgggc
cctccctaaa acaataaatc attcttgggt ggtggggtct gggcatcagg 1440attgtttaaa
accctcccca ggtactgtca tgtgcagctg gggttaagct gtgctggggt 1500ctgagtatgg
atctgttagg gcaagtggcg gtgatggagt tgaggctgca gaattcaggc 1560caaatagaga
ggttttcatc aggatattaa agagtttaga tttcaatttg gtgggaatgg 1620atgggatctt
atttgcattt tatgaagagc tccctggttg caatatcaga atggattgga 1680gaggagcaag
atggaagcct acagtgattt gggagaagtg gtgagggact tgagacacag 1740gaagtagccc
cattcactaa tagttgagta tgtagatttg ctaggacctg gaaatggttt 1800ggctggtggg
gagtgggaag aaaggcccaa agtgtgaaat gaagatggag agcacattgc 1860ctagcccaga
gtgattgcca tttgctctgt cccagttgag gtccaagggg ttggccagag 1920atcatggagt
ctgtggctcc atggggagaa gaacctctca gcatgcctcc ttgtcttatc 1980ctgggttagt
cagattcatt ttgttagatt acattttttt tccagtggaa ctctgcttaa 2040gtcctgacca
gtatgttttc agaaggatca gagggcctgc ccttgtccat tggtgcatga 2100caccagcttg
gtgggttcct tgctgctccc tgttttcata gggttatcag aataccttct 2160ctccctgcca
ccagcaggtc acactggctc ctgacttttt ggcccatgga accaccatct 2220ttctgcttct
tagattgtgc cttgtactcc actgatcatg gccagtacat cagaagccct 2280ggtttgcagt
gaatgcattt gatatggaaa tcaggaaccc tggggatacc actcatcata 2340tttggttgct
gtgtttttcc tccaatcttt caccataaca acaatcaact caaaagattt 2400ctataaccac
ttgtgtgggg gtttctcccc acacactaaa caagcagtca gttccagagt 2460ggacagca
24683272826DNAHomo
sapiens 327acatcagaag ccctggtttg cagtgaatgc atttgatatg gaaatcagga
accctgggga 60taccactcat catatttggt tgctgtgttt ttcctccaat ctttcaccat
aacaacaatc 120aactcaaaag atttctataa ccacttgtgt gggggtttct ccccacacac
taaacaagca 180gtcagttcca gagtggacag cagctggtct cctccaattt aattccaaca
ctgtctactt 240ggagatagca ttagatccca caggttgagg gtgcagtccc ctagactgcc
cccagtctcc 300tgcttcagac accagtcaca agtccaggac tctagaagtt ctgaccagtt
tcaagttggg 360gttcccacaa ccccccactt tatttttgat taatttgctg gagtggctca
tagaactcag 420ggaaacactt agttttctgg acttattaca aagatttaaa aagataccaa
taaatagcca 480aataaagaga tatacagggc tagatctgga agggtctgga gcgcaggagc
ttctgtcccc 540atctacttgg ctcccagcag atggatgagt tcttattcat tttcttgtca
gcttcgacat 600gttcagctct ctggaagccc gcaaactctt gtcttcttgg gccttttatg
gagacgtcgt 660taggcaggca tgattgaaac atggacaact gtgtcgaaat atgattggac
ataaaggggt 720ctaaactcag tgaggcctgt ttgttcagat tcttcttggc ctctctgtgg
ccattctttc 780ctccaggata tggggcagga cccctatgga atgagggtct tatgacccac
aatcaaatta 840gagtcctgcc ttgggcaagt gaaaggaaag caggagaagg taagagaaat
tctgttgcct 900aagaccttct gaggcctaaa gcaccccaac attataacag aagacgataa
caggactatg 960ggagttatga gctgggaacc ttggacaaaa atatatacat attaaataaa
tattaagtgt 1020atatatatac ttacgtatat taagtgtatg tgtgtgtgtg tatatatata
tttttttaat 1080ttactggttg gttttgggaa gcagaaatta ccataactac tcttaaaaat
cttttaagtc 1140tctttgaagt tagaaaagtc actgtacctt tttgtttcca ttggccctgt
acttcttatt 1200ataccccagc aggaggagca taatgtgttg ttatatcatt ctggtgataa
gattcataag 1260tgggttcagc tggtgacagc ctgattccct cattgtaaac ttatccatca
acatgtagct 1320taatcgtttc accttttgtg atgaccatta cctgaatcag ttatttcatt
agattgcaag 1380attatgcttt tctgatttta tcatttcttc tgtattgact gtaattcttt
ggtatagaag 1440aactttccct tgttaatagc tatttggttg tcctgaagta cagttcttac
tagaaagtaa 1500gaccaaatgc tgaattatat ccctctagct atcaattttc gaaggaatga
atggtgtcct 1560agtaatttcc agtggtgttt aattacgttt tcccttctct ttctccttct
cttattccct 1620ccctctccat ctcctccctc ctcactttca gttttttgct ctttcagtat
tttgtcatag 1680ctgttaacag agcaacatat tttaatcaat tgtagtcatt tttctttttg
gtgctcaaat 1740tatcccgtct tagtcccatg gaagcaagcc cttggagcta gggccctcta
ccttttgatg 1800gatttccatt tgtcttgata atttccttgt ttctgacaag acaagatgtt
gcaggcacat 1860tttatacttt cccagcccaa accctggaat aggccttttc tccgaggagc
tctagttcat 1920tttagtggga aatggtattt agagactata atctgggatc tgggagtcct
cattgctact 1980gagtagtcat tacttttagg cttttccagt ggtcagagct aggaaatatg
tatatttaaa 2040aatggacagt tgaatggttg ttgccaggag ctgggaggaa ggggaagtga
gaaattgttt 2100aatgggcaca gagtttcagt ttggggaaga tgaaaaagtt ctagagatag
ctggtggtga 2160tggttgcgca acaatgtaaa tgccactgag ctctcattta aaaatggtta
aaatggtaaa 2220ttttatatat attttaccac aataaaaaaa agtcttcttc tgggagcacc
cccccaagac 2280aaaaatatga aaattttaca ctgatacttc catttcaaga taattttaag
attataagga 2340ttttgcttaa ttcttgaatt ttatacctgt aaacctttta tacttcaaat
ttcgggcaga 2400attgcttcta taacaatgat aattatacct catactagct tctttcttag
tactgctcca 2460tttggggacc tgtatatcta tacttcttat tctgagtctc tccactatat
atatatatat 2520atatatatat tttttttttt tttttttttt aatacagact ttgctaccag
gacttgctgg 2580cccctctggg gagatggtta aatccacaac aagtctgacc tcgtcttcta
cttctggaag 2640tagtgataag gtctatgccc accagatggt tcgtacagat tcccgggaac
agaagcttga 2700tgcatttctg cagcctctga gcaaacccct gtccagtcag ccccaggcca
ttgtcacaga 2760ggataagaca gatatttcta gtggcagggc taggcagcaa gatgaggaga
tgcttgaact 2820cccagc
28263283843DNAHomo sapiens 328tcggtctcag tcaccatttg tctaagcaaa
ttcaggcagg cttcaccttg cctttctaca 60tttgttccct tttcttagca ttttgggcct
ttgtttacac gtgggaaaag acccacaggt 120cgtctctccc tttgggcagg atacaggctt
cctgtgactg aggttttgct agctgtagaa 180gtggctgcca attggcttct ggtttttatt
tccatgattt gctccagtgg ctcttccctt 240ccatcattgt tagctttcaa gctaggaact
tttaaaatgc ttttaaataa aagtgagctg 300ttacttgatg catttagcag tcttcctcac
agtggttttg atagacagac tccctcagtt 360tggaatttat gagttttctt taagggtttg
tctccctcat gtatagcagg ctgttgaaag 420ttacaatgtc aataactttc tgaatagtat
caaactgttt tcagtgcagt gtattaacaa 480aactaacctg cctcaagttt ggtcagcttt
ggagtcttac tgaggctaaa atgataaatc 540taaatgattt aaaattgtgt attcctacac
agtatctcac ttaattatgt aatagtcttg 600tgagtgaggc agagcagatg ccgttttctc
tattttaaag atgaggaaaa tggaatggaa 660aatggaaagg acagactaat tgcaacatcc
tcgcaatcaa aaacaggccc aggttcatgc 720cttgttggca gtgggttgct actggctgtg
gccttcatgc aggaaggcta gatgcataac 780caggtcaaca gcccgtgcag gacaagcacg
ccatgtaatt ctgattccat cgactgaggc 840tggtgttttc aaacgtgctg gtgtagggtc
ttacagacag agtcatctgt gctatgggga 900atggaatgtg ctcttgcttt ggagccagaa
ctcctctgaa gctcccacca cctacaccat 960tcagaggcca gacagaaatt tgttcaccat
tttgggcatg attttcgtgc ttttgtaaaa 1020tgtgcttcac tgcagccctt actgggctgt
ggtgatgaac acttaagata ctgtgtgtgt 1080gctttataat ctgtaaggca ctgttcaagg
ggagggacct ctgccatgag cccctaccca 1140ctggtatctg gttgacatcc aaagccccag
cctgggagaa gctgattctc tagttgaatg 1200ctgtataggg atttgactga ggctcagatt
tggtgaggaa gaccactaac cttaacagac 1260caacaggctg gctactccct gatgaagttc
cccaggccat gaaagaagta agagatacat 1320tccttgtaac agctttctta gttgcacctg
tatgattatt tgatcagtgt gttgtctgtg 1380cagggatcat gtctgtggag ctcaccacct
cgtcctcggt gctgagcaga gtgcctggca 1440tgtgtactca gtagatattt gctaagggag
cgagtcagtg attgagagga gcagcctggg 1500aggtaaagcc ctagaatctt tattttaaag
ggatatcaaa gttgaacatt cagttagaca 1560gttctcttga gtccagggat ttacccatcc
atggtggaca cactttcagt taaaaagtaa 1620ggttaatttt gacaggttgc agtatccagg
caagcattct atggaataag gctcatctca 1680gggattagta atgactgaat taacttactg
ctagtcccat aattttgacg ttaattaatg 1740gggttaagaa atgtcataag ctatttggta
ccatttaaag tgaaaatacc cttaacgttt 1800tttgcctcca gatatccaca cttaatttca
ttttcttgct ctttggtgaa cagtcctggg 1860tctgaatgta tatatccatg gtttgtcact
aggtgacagg tttttttgga acaagaaatc 1920agttcagtga acatttgtca agtatcttct
ctgtaaaaag tgtaatgtgc caagctcaga 1980agtaggaagt gaaatggata aactatgacc
cctgccttaa agaacaccat ggtgttgtat 2040gggaattgtt taggtagaat gaaagaaatc
ctctaataga gatatgaggc cagttcagca 2100gaaagccagg gtgagatctc ctgagaggga
tggaagggtg tcttgatcat ctctggtagc 2160agcaaaggca ctggcataca gtggccactg
gaagacaacc agcaggggat gggggcgttt 2220acccttgcaa gtgagcatta ggaactagag
gactgattgc cctttcttca gctttggttt 2280cccttgctgc agaaaaagat gctgagactc
atggcctcgg ttatgaactc agatatgtgg 2340tttggctttg aagcacagat ggattttgtc
cgattttggc agggaaatgc ctacagacag 2400cactatgggc atatttaggt tagggacgaa
atgcaagttg attaagtcct gataagaggc 2460tgtgaagagg tccaagaagc ctcacaatgc
ccaatgaaga aaagccctgt gcttggtgct 2520gccgcctccc ttccccgtcc tgctggcagg
gctgcgcttc agtagctctg gatgcgtcag 2580agcagtccat gaacattctg tgtggaaaat
ctctgactgt tttagtggat tacactgctc 2640tccctttcct ccagtgcctc gttattcagt
attatttgat gttctccagc ttttaaaata 2700atcattttcc gcctacgcag aacatcctgt
agagacgttg aggttccagt gggaacagag 2760aggaatactt attctaaaaa tgaagaaaat
aaaccttttt ttatggagtg ggtgatagta 2820ttgcagaact tctataatag tatgagaatt
cacttgtggt gccaaagctt aaaaaaaaag 2880tatagtaaaa acataatgta taggcttatt
gctgtgctat gacccatgcc ccgttttctc 2940caacctctct tgtcctcact cttccttttt
gctggtgata tttttactta tttcatgaaa 3000aaaaagataa catatacaca cacatagata
tatgcacaag tatatgtata tatgtgtgca 3060taacacacat aaacatatac attggtaaat
ttaaaaacat atttatgaaa tatatgtagc 3120atctacagaa aaacatgaac acttgtgaga
atagcatctg cctaaaaaat aggacatcac 3180catcaccttt gaggctctta tgtgctgctc
ccctgtgcca ttcccttccc ttcttcctta 3240gaggtgatta ctattctaaa ttttgggatt
attatttcct ttttttatta tagtgtttta 3300attacagttt tattacctgt atttgtattc
ctaaaaattt gtttactttt gcaagcttta 3360gattttataa aagtagaatt acactgtaag
tttaattttt ctgtaattta tatatagcta 3420cacatatatt cctaagattc atccatcttg
ttacatatag ctctggttta ccttttctgt 3480ataatataga ttctgcttcg tgaatttaca
gttcattcat tcttctgtta aaggacagtt 3540ggaggactca tatggcctca gtctctgtgt
ccccacatgc caccctgctt cccagcctca 3600tatgagttga ttggtggcct ggcatactgg
atgagaagct ctaggtcata tatttaagag 3660agttattgct gggtcataaa atgacagatt
gttttccaga ggggtcatat tgatttaaat 3720tatcaccaac aattatattg tcagattttt
accagtttgg tgattgtgaa acagtgtctg 3780atggtagttt ttatttgcat tttcttggtt
gaaataaagt tgtgtatttc agccaggtgc 3840gtt
38433294221DNAHomo sapiens
329tgaacctgca atatctcaga ggtatgcctg tatctacttg ttctgtgata cttgttattg
60tcagtttgtt tggatttacc acatattatt tgatcataat tctttcctgt agatgtttta
120tggtctgcct aaacctttag tggggccttt gatggcttag tcctttcagg cttaagacaa
180tagaagttta tttctcagag ttctaaaagc tgggaagtcc aagatcaagg caccgacaga
240tttagtgtct agtgaaggcc cgcttcctca tacatggcac cttctagctg tatccttaca
300tagtggaagg gaatagctag ctctctggag tttctttcat aagggctaat cccactaatc
360ccaattatga gggaagacct aatcacctcc caaaggcccc acctcctaat agtatcacct
420tgggggttag gatttaacat atgaattttg tggggacaca gacattcaaa caatagccat
480ggcaaacttt tttgctttgt ctaattcact cttattttga aaagtatttg tgttgggttt
540aaaactccag attggtaatt attttttctt agtgcattga aggtaatagt gtatcatttt
600ctgatttcta ctcttgctct tgaaaattca gctatcaatc ttaaaattta ttacctgttg
660aaaatccagc taccagtctt atattttatt tacttagtgg gtaatctctc ttctgagtac
720ctttaagatc tcctttcaga aataccatgt agtaaccctg tgtgtcacgt gtggattttg
780ttgggcttgc tagctgagac ttgacagttt tcatcacttc tgggatattc tcaggtattt
840tgtcttcaaa gtcttcagat attgtcctct tcctgccctc tctccgactc cttctggaac
900atgagttatg tatttattat ctcccatgtg cataagttat ctttacatat tttcaatttc
960tttatctttc tgtgctacat tctggataat tttgttgatc taccttccag ttaattagct
1020tgttaacttt gtcaaatctc tttttaagtc tatcttgatt tttcttttca attattgtat
1080ttttcatttt taaaaacttt atgtgctctt ttggaaatct tgatcccagg agatagtgga
1140tagtgtcctg ctgcttactc atggttttaa tagttcttga gcatgctgaa catacttatt
1200ttatgttatt tgctaatctt tccaattcct gaaaccttta cagatctcat tctgtggatt
1260cttctggatt ctaattcatg gggcattttt tttgtttttt gttaattcct catactttat
1320ctgtggggaa ttacttgaag cctgggttga caatgaaatt ctgcagagag aatttgcatt
1380tgattctact ggaggaacag tcagccccga tatcagttta aattaaaatc tctgcttaag
1440gttttcaggc aacctgctta gcatgaatcc tggctggaaa agcatgtgag gaccagttta
1500tgattacaca ttcacagggt gtcatgtttt cttccaacac caatgctaga ggtggcagtt
1560ttgcttactg cccttggagg gacaggggag tgggcatggg catagtagta tggttttcct
1620tttcactggg ggtgcagccc ttggagtctc agcttaatgt gttggggaag tggtctccta
1680ttagactctc catttcaaac cattccatga ttttgtcctc cttttgccac cttccgagcc
1740tgtaaaaact aatgtttgtg attcctgagg tttctctaat gtcttttaat aaagttgacc
1800tcagagatct cgttacctct ctgagttcct gctttgtctt agattttgat ccttgagtgt
1860tctttaatct tttagcaatt ccttgttgca tgttaaaaga ttagttatat tttattcctc
1920atttgtgttc gttttcacca ggaggctcaa ttcaggcttc tttgcttact tggtgtctct
1980agttctggtg cctggtgctt tggtcaatga agtggggttg gtaggattct attacttacc
2040tgttttttgg ttttattttt tgttttgcag ttctccggga gatgttgcat aaccactcct
2100tcgtgggctg tgtgaatcct cagtgggcct tggcacagca tcaaaccaag ttataccttc
2160tcaacaccac caagcttagg taaatcagct gagtgtgtga acaagcagag ctactacaac
2220aatggtccag ggagcacagg cacaaaagct aaggagagca gcatgaggta gttgggaggg
2280cacaggcttt ggagtcagac acatgtggtt tcaaatccaa gttcgaccat ttcccattta
2340tttgactgta gacaagttac attcctaaac tatgtctcag atttctcatc tgtaagttgt
2400ggtattacta gttaacatgc aggggttttg tttgtttgtt tgtttgtttg tttgtgaggg
2460taagaaataa cccaagaagc ctagtccttg gtagttgctc agtgccctat aaatgttgtg
2520aaccaggtgg tgagggtttg gtgctgctag agaattctgg tatctgctct gtgcaacaga
2580gtactgtagg tgatgcaaga gaaagaagac ctgatgcctt ctttcctccc agctttgaga
2640atggagcaaa ggcctacccc agccaccaag tgagccagtg ggcttgatca gcacaggaaa
2700ggtgaccccg gcagtttcat ttgactattg catggctggc aacatttcta ttgattgttt
2760ccagggacct tggcggatga gctcctgttg agtctagcat ctctgttaaa tctgttctca
2820aataggtaat gcatatggga ggatgctgcc accttgcatc tactagacat cacctatcta
2880ctgtgagact ctccctctaa gccctgctgt ggcctcagag tgcttattgg ccctgtgagt
2940ggggcagcca ctatacattg catggagttg gtacatgaga tagaaaccta ttcgccatcc
3000cttgaaactg ccccagtcca gaagcttcct gttagcacat gtacctcctt gtatgtattc
3060agaactcatt ccatttaggc ttggaaaccc gtttggtgca actctgttca agttccattg
3120tctgctttga gaatgcttgg gcttgtatag tgagctgtca ctttttaatt tgttaggaat
3180tctactcgcc ttgctttttc ttttccagca tgtttaaggg aatgacctcc aaggccccaa
3240atcacagttg tattcatgtt ctttcatttc acagatacaa tccaggccag tcccagattt
3300gcagctgtta ataaatgtga atggttttcc agtaaggggg tagaaaaaca tagggagaga
3360accgggttca gagttcaata tctggattca agtccttcct ttagcacttt actaactgat
3420gtagaataag tcagctactc aataggtgcc tcagtttccc caccaaaatg cagacataga
3480aggtgctttg tctgctttga tgagaagtct ttaagcaagt ctatggggtt caatgtgttt
3540taagaactat aaagtaccat ataaatgtgg cctttattcc cattgtgttc ttggaagtaa
3600ttcaatatag tgtgtacttc atagctgctt ttggactatt gccagccagt gtatcatcct
3660aaactacatg tcagcatagt ataatcctgc cttaggtcta cttttgatta tttaggaaga
3720ctccctgccc ttcctataca tttcacataa tttttaataa gttgtaaaaa agtgatttat
3780aggattcttt gtaagtgggg gaagttaagc agacaaaaag tttttaaatc ttactgcaga
3840gtgtcaggaa ccttttatag caccagacag gtagggacag aacatgagtg gcagcaagcc
3900agacttggtc ttagtgctct aacctgtctg ttagaggctg gccagtcaga cccctggttg
3960aagacgttgg gaatcccagc tctttggagg ggtaagagat tttgttagac tgttaaccag
4020attccacagc caggcagaac tatttctgtc tcatccatgt ttcagggatt acttctccca
4080ttttgtccca actggttgta tctcaagcat gaattcagct tttccttaaa gtcacttcat
4140ttttattttc agtgaagaac tgttctacca gatactcatt tatgattttg ccaattttgg
4200tgttctcagg ttatcggtaa g
4221330683DNAHomo sapiens 330cccagcccat atattttaaa gctctgttat tgggtacata
aacatttagg attgttatat 60ccttttgata atggactctt ctattatgaa aagataatat
actgtgggtt tataacatat 120gtaaaagtat gagtaacata ttatcagaag gggagaaatg
gaagataact taggcatctt 180atttttaagc atagttttcc ctttgtttct gcattagatg
atttacctga aatgtcattc 240aatttaactt actctccatc ctcacccgcc cagctttggt
tatgaggcag tagaaagaaa 300tgatctgcct gtggttttct agaaatacga aagttgagtc
cttaaggcta cacagaaaga 360aagtacctcc ccagggcttc acccttccca tcctttcagc
aggctttttg tctgtcgtat 420cttctctgtt gaaatggcca ttgacaagag gaggaaaggg
gttttgttgt ggattgttca 480ggcacttcct ttggggtata tgggggatga gtgttacatt
tatggtttct cacctgccat 540tctgatagtg gattcttggg aattcaggct tcatttggat
gctccgttaa agcttgctcc 600ttcatgttct tgcttcttcc taggagccag caccgctctt
tgaccttgcc atgcttgcct 660tagatagtcc agagagtggc tgg
6833311799DNAHomo sapiens 331gacatggaga gccgaatccc
tgcaggccat tataaatgag attatgccat ttgctcccat 60ttcttcttat tctttcattt
ttggggctct ccatcttgat gtgttctttg gatcgtgaac 120agatccaaag aaaaggttgt
tctgccgtgc tgtttgtcag gatgaaaaac tcttttttaa 180gtgtttaggt ctgcccccag
tgcccagccc aatcaagtaa cgtggtcacc cagagtggca 240gataggagca caaggcctgg
gaaagcactg gagaaatggg atttgtttaa actatgacag 300cattatttct tgttcccttg
tcctttttcc tgcaagcagg aagggaacct gattggatta 360ccccttctga ttgacaacta
tgtgccccct ttggagggac tgcctatctt cattcttcga 420ctagccactg aggtcagtga
tcaagcagat actaagcatt tcggtacatg catgtgtgct 480ggagggaaag ggcaaatgac
caccctttga tctggaatga taaagatgat aagggtggga 540tagctgaagg cctgctctca
tccccactaa tattcattcc cagcaatatt cagcagtccc 600atttacagtt ttaacgccta
aagtatcaca tttcgttttt tagctttaag tagtctgtga 660tctccgttta gaatgagaat
gtttaaattc gtacctattt tgaggtattg aatttctttg 720gaccaggtga attgggacga
agaaaaggaa tgttttgaaa gcctcagtaa agaatgcgct 780atgttctatt ccatccggaa
gcagtacata tctgaggagt cgaccctctc aggccagcag 840gtacagtggt gatgcacact
ggcaccccag gactaggaca ggacctcata caatctttag 900gagatgaaac ttgcccatct
ctaaaatttc gggatttctt tgtacccaac aaggttcaaa 960cacaacagtc agcttttatt
catgattttt acttccatct gctgatgtag aacatacctc 1020cagagtgacc tcagaaattg
tcaaatgtga aaacacaagc catcacagtg agaaatggga 1080ggttgagtta gattgtctaa
ggctggagag tccatatact cccactgtta gctctgaagt 1140gtgtagccag tcttcagatt
ctgggtcagt tgcctcagtc tctcttagct tttgccttac 1200tctttatccg accactgccc
tgccaggaaa acaaggctct ataactcctc ttacaggtca 1260gcttgacaca aaaagggtgc
ctggattcct aatgtttcat tgtcactttt cccagtcaga 1320tgataatgct tttcaaatca
acatatattt tgggggaggt tggaagggag agttgaaata 1380ttctaagaat caaagagtag
cccactttaa tcagagtatg acccctgatt gctcacagtc 1440atctcctgag cagtgtgagc
gagtttcaga tgaggaggct gaaggccagt caggcatgct 1500cgaggattcc aagtctgtag
gtgggagggc agagatttag tcctgttggc caaagcctct 1560agggaatttc tcactccagt
ggagaaggca acacacttac caaactgtgt ggaaactatc 1620tcatttgatt agaaatttta
cctcaagaag aggaaggaca gttgagaaag aacattttct 1680tacacatgag acagctaagg
cttacaagaa ggagaggaat aatgaggcaa aataatcctc 1740attaatattt tcattcctcc
cctggggatt agaactactt tcagacccga ttttaatgg 1799332545DNAHomo sapiens
332tccagaccca gtgcacatcc catcagccag gacaccagtg tatgttggga tgcaaacagg
60gaggcttatg acatctaatg tgttttccag agtgaagtgc ctggctccat tccaaactcc
120tggaagtgga ctgtggaaca cattgtctat aaagccttgc gctcacacat tctgcctcct
180aaacatttca cagaagatgg aaatatcctg cagcttgcta acctgcctga tctatacaaa
240gtctttgaga ggtgttaaat atggttattt atgcactgtg ggatgtgttc ttctttctct
300gtattccgat acaaagtgtt gtatcaaagt gtgatataca aagtgtacca acataagtgt
360tggtagcact taagacttat acttgccttc tgatagtatt cctttataca cagtggattg
420attataaata aatagatgtg tcttaacata atttcttatt taattttatt atgtatatat
480tgtgtcagtt cagatgccaa aaagaggtct tgaacatgtc acaggctctg atggcactga
540ccatg
545333578DNAHomo sapiens 333agcctcccaa agttaagtgc tgggattaca ggcatgagcc
actgcggccg gcattaagta 60tgagttttta agttagccca ctttgttaat gactatgagt
actaatagct taagataaag 120aagtttctag gtaatcttgt ttgaaggatg atgtaaaaat
ataaatttaa actgtgagtg 180acaaaataaa cttccttaat atttgcctac atttagagaa
atggagcatt cagctcagaa 240aggaagaatg tctgtggttt taaggtaaaa tccatattcc
aagactcagt gaagaaagtt 300cagtgataaa gaacagacta ctctcatctt atgaagaaat
ggagcaattt cacttggaaa 360gactaggaag acaaaatgtt acagacgtat ttgttgtgcc
acaaaatagg caaggtcagt 420tttgaacaat aagaactcca taaagtagac cagggcatct
cagaagtgag gttccatgag 480cccaggtggg gcacaggctg ggtgatcttg agtggagagg
aagaggggtt ttctgagctt 540caagagctgg gccacacagt gtgttggttt tagctggg
5783342355DNAHomo sapiens 334tgccctcagc tactcactcc
cacagtcata taccagatct catcattaga caattgtaat 60ccctacacaa tttagttcca
tgtatcctct ctctaaccac tattcctcat ctttccaggt 120cattctctct agacccgaat
tccaacaacc cttcaaccac actggtacca ctaatctaca 180gattacatct tctttctact
ataccttgat gtgttcctga atatctcccg aatcctcttc 240atccagttta atttcaaggt
ccatcattat aatcattttc ttacatactc cctcacctct 300cctgccccat taatactgtc
ctagtaaaat ctagctctct acccactcca tgcctgcccc 360tatgctgctg taagtagcca
gagaaacaca tataataaat gcattcacac aaaccttcta 420acatatcata taatattgtc
tgatgtcttc ctactagaat gcctctcagg caggaatttt 480ttttttctaa actaatttat
tcactgaaat atcccagtgc ctagaatagt gcatgttaaa 540tagtagaatc tcactcaaca
tttgttgaat gactgaatag gagttccaaa atagagaaca 600cagcatatgg gaggggaaaa
aaatcagtaa caaaatcatt caagaaattt tcccagaact 660aaaggatggg agctcctaga
attgacaggg gcccagcatc acacatgaaa acttcaaatc 720acatgactat cttcaaatta
caccagaatg ctagagagaa agagaatagg atacaagctt 780ccacaaagag gagaaaaata
gatcacaaat cagaaaagat cagaactcaa aatgttcatg 840aaaactcaac agccatgctc
gaagtcacag cacaatgaag aaatgtcctt ttaaaaaatc 900ttaaggagaa ccatggcaac
tcaggattct ctacccagcc aaactatttt aatcaagtga 960gagggtagaa tgaagacatc
ttcaggcctg caaggtcatg aaaaattaac aatccacaaa 1020ccctcttctc aggaagctac
tggaagatgt accaaaataa gagaataaat aaggagaaag 1080gcatgagaca ccggaaaaag
ggaacccaac ctaaatcaca tgcaaagaaa atctccagat 1140gccaatgaag ggtgaccaca
tctatgtacc gagagggcaa gtcactagtt tagaaaggga 1200caagtcagat gcaccaagat
tcaacaaact ggaactgaaa taacaccaga tgcatctgaa 1260aatactgagt gggattaatc
tactcttgga gattctgtgg ctaaattgat gatagaaaac 1320caagcaaata caaagaaaaa
ccataacatt aactttagag gaaactaata gttctgaggg 1380agatgatcct agaatgcaac
ctggctccac tgtgtgagta gtgtttagag ggtcctaatg 1440acacaagcag gctggaatta
cactgttcct ttattaggag gatataagag tggaaaataa 1500gtatgtgtgt ggcagggaca
aaggatgaaa aacagctaaa tcctcatctt ccataaaagg 1560atgtcaatat agaatgcctg
aagcagaaca atcaagatgc aacataagta tgttatacag 1620agatacaagg acagtacaca
agaatcagct aaaagtattt aacagaaatg gtcaggggcg 1680aggtcagagg agccagggca
ggggactgct gtgttcataa caagctttgt aaaaaactat 1740atgactcctt aaactatgtg
tccttaaaaa aatgttttaa gaacagaaaa taacaaagag 1800gtaaaatatg aattatctat
ccttcatatc tcacttgagt actgatgttt gaaagaagca 1860tattttttta atgaacattt
caattagcca gtattttacc atgtaacttt gttaaaatta 1920tattacactc caataagaat
gcctttacct gtgacagtag ttcttccttc tctccagcaa 1980gttttcgtag ccttacatct
aaaacaaatg aaaaagatca taaactaaat atgtgatgat 2040atagtacata aacaattaaa
aatttttcaa actcataaac agctaatatt atctgataaa 2100ttacattact tacagctctg
aatatctaaa gaaataaagg tgttaatagc attacagaaa 2160agttcttaac tatctaaaaa
gtatttccac acaactgata tttatcaggg caccaaatcc 2220aacatttgtt ccccacagca
gtgatttgcc acttaaagac aaacagaagt acaaaggagg 2280tcatttcctt gtttcaagct
ttcactagta gacagacaac tcaaatgtca agtgtgttcc 2340taaaggctga gccct
23553354683DNAHomo sapiens
335gccagactct cgttccattc tccagatctc tcttgctcac ccagcatcct gttttattca
60aagtgcccta caatcacatt tctggaatgc acattagaga atgtgcttac taactttcaa
120aatgtttttc agtttgcttc acacttgtat ctctcactcc tctaagaagc ttacacatat
180atgaaaacaa gatgaaaaac aaaaaaattg ttttttttta aaataaaagt gagctaatga
240tacagtatct atctgtgcca tttttcttcc tctagagtag atttctttgt ggggtcaatg
300gatgggtgac tttgatttct cagacagagg tgtcagcaac tttgtggttt cctggagaga
360ggtgtcagat tctcaaaggg ttaaatttaa gaggtttaga ctttaagagt ctgggaagcc
420ctgctctgga agtcatactt ctctgatatc tttttggtca tctgtttctt ggcttaagaa
480atgtggtgga aaagaggtac agaaccctgg ggtaagcagt ggaacataaa accagatgtt
540ccaaggatga gaaacttata acacacttga gaagtctcct gctagcctac tgctccccta
600gcacaggtat actagactat ctctttgcag aacagtttgt agttaagtaa aaaccgatgt
660gtataggccc atagtacttc catccacagg ccttacagtt acacttattg ccttacagtg
720acccagatgc tgatttccca aggtcaagga tgtctgaaga caatgtgcca atgtgcccag
780attcttctag ttaaggatct acttgagtct cagcccttat gctgtttttg ttttccaagc
840tgggatatga aaaagcagaa aacccaatag ggtaacatta atccaagtca acatagcaac
900cagtatctta cctaatggcc cttctcctgc tgactccaag acctgagcag cttcctgaga
960cacaacagtg atggctccag ccactggttc atgactgaca tcaccattgg gagtgccatc
1020ggggattata actaagccat gtttctgcag gggggaaaaa cccaccatca caaaaggccc
1080gtatggaagc tgtaagctct gtgaggtcac tctgcaacaa tacatgtttg ctacaggtaa
1140aacctggtta gaatcagtta catgaaatat agctctgtgt aagaaatagc ttcaacctac
1200caaatctgga ttagagaata aacactgtag tttgtattta ggctaggaaa gatggcagga
1260tgaaaggaag gaagatagag agtaaaacag tgagggacct gaattccagg ctaatgctaa
1320catacctctc ccgtcttcac tgtctcctgc aggtcagcca gctcctctct gagcatatct
1380cgctcattcc taaggcaggc aatgtattct ttctgtttct ctagggcctg gttttaggta
1440aggtagcaag ggaaacaatg gcacagaaaa agagcaggtg aaaggtagca gagaagtacc
1500taattcaaat aagcaaagat aaaggcataa aaagcaagaa agcagtcaaa agattggaaa
1560caaacagtca gatatgggag gaaatacaga gttacatgga tatacatctc cagaagagac
1620ttctcataga aactggttct catgcatcaa tttggcaaaa catgtttaat cacatcaagc
1680agggaaataa atcttttcca gtcaatgaaa aaaataaaac aggaaaagga agataaagag
1740agaagccaga gtaaaataaa gctttcctta ctgactgcct aagtgcattt ttatttggtg
1800aacaaaaaaa accccacatt tcatgtttaa ctaaactagt ttattcaaga atacagttga
1860ttttttaaaa aatagttctg gaataaaaat aactattata cataggtatt ttaatttaat
1920attggctgta gatttttctc caagtagtgt ggcaaaatac tcaaatacca cttaattcaa
1980aatagttaac ctccaaaagg attcaaagat caacttctga caacttaatt aaatataact
2040gagactcatt tggctttctg ttatactccc aaaatgtgaa aaacaaaaat aaacactgac
2100aaaataaata cagccaagct atgaagagtt acagaatatg gatttcagaa tcaggctttt
2160gggttctggc acatacttgt cctatgcctc agtttcctca ctggaaaaac agaagggata
2220atagcaccca tcccaagggc agaggcataa atcaaggtaa agcattgcct gtaatgccta
2280gatagcaggg acagttcagg agaatcaggt tggtgatttc atttgtaaat tccctgccat
2340ttccttaatc tcacaactgt cagctgagga caatgcagaa gcaggaacat actttggtca
2400tcaatgaaaa ataaaatcta ctatgaaaaa ataaaatcta ttgtaaaaga aaataaccca
2460gaattaaaaa tacacccaag gtaagtagtc tatgcaggaa tctgattact ggcctatttg
2520aaaaagcctt tccccaaata tttttgttca tatatttaat gtcttctgtt agcattccca
2580ttaatccaag aagttaaact atatcaggta actttcctct cagttcactg ggtttggaag
2640tgggacagcg aattgctgag aaattgatag ctgaatagct gggcaattca aaaaatcatt
2700ataatcctgt tttgcaacca aatagggagc aagtaaataa gggatgatag caactacgat
2760ttgtatagca caaattatat ggcaggcact attttatata atttctctct tatacattat
2820tttacatttg aaacctctac atatcctgtg aggtacttgt attatcccca tttaacagat
2880cagaaaattg aggctcacag tggttatatt ttttcgccca aagtcacagt aagtggcaaa
2940accagaaaat gaatctggtt gtttttgttt ccaaagccct taaatagttt tttaaatatc
3000acagctctat gaaggccaca ttatattccc ttattgttag cccagatgat gctaggaaag
3060gagtccatac ggcaaatcct actctttact tatccaaact gcaatgtcaa tatctgactt
3120cttttcaaca atttacattc acactatatg atgtgtctca agtctgcctg tgaattaaca
3180atgtgcattt ctagcaccat ctagctagtg ttaacactcc attatgttaa taattaataa
3240taactgaaac attgggaaaa caaagcacaa caatactttc ccatgtgttg agtgtcactt
3300tatggattag gtatttttgg ttactggtat ctgcatgcat agttatgtca tgtatcacca
3360catataagtg ggtaaatgat cactgtcaca acatgctcta cataaacaac aacactgaat
3420aaaaaagacc tctgaggaac aggccaattt gaaactagga attctagcaa atgatataca
3480tgacatttgc tcttcttcca catcgtattg cactgggttt tatttttacc ttcggacttt
3540ttaatttcct cttcccataa ttacagatga gaaaataaaa tacatcctgt aaattcaccc
3600acttcaccac aaagtttgaa gactactaaa ataccttata attggatcaa atgtattcaa
3660gctggatcta aaaccctctg tattacctga ccatataacc actacccttg tgtttgtgtg
3720caacaatagc tcctacagta gatttttttt agggtaaaaa gtacacgctt gtagagttca
3780aaataactct ttatccctga cctaacctca aatcctacca cccggaagcc aaaaggatgt
3840gtataatggg ctgaactttt gggcaagggg ttaattctcc acataattgt actggggaac
3900aaatatcttt ggtcagaatg gaagtgagtt tatgctgggc tatagagata cgcaagttct
3960tcatacgcac ctattctata catgggctcc tggtgtttag aaccgcagtg gagctagagg
4020caagaccact aatgaactga actttaacct gggaataatg gacatatttc ttcattaagt
4080tactaaatgt aaatcttaaa aatgaagcta gagacaagta gttactgacc atactgaaaa
4140tgtgtcttaa aagtcaaggg aggaccactg cccttgtatt ataatgataa caaatgttgg
4200caaggacatg gagaaattgg aacccttgtt cactagtggt gggaatgtaa aatggtacat
4260ctgctacaga acacagtata actgttactc aaaaaaatta aacacagaat taccatatga
4320tccagcaatt ccacttctgg gtacataccg aaaacaactg aaggcagagt cttgaagagt
4380tatttgaata cccatgttca cagcagcatt attcacaatg gccaaaaggt agatgtgttg
4440atatatcaac agaagaatgt ggtatataca tacaatggaa tatgattcag ccttaaaagg
4500gatggacatt ctgacatatg ctgcaaaatg aaccttgagg gcataatgcc aagtgaaata
4560aatcagatac tgtatgattc cacttacatg aagtacctag agcagtcaaa ttcacagaga
4620cagaaggtgg aatggtagtt gccattccac caggggtttg ggagaaggga ctgaatgggg
4680agt
46833363430DNAHomo sapiens 336aggcacaacg tcaggttttc ctatggaagt ctctgtctcc
tactgactca tttttcatac 60tgtgtaaatg ctcaagaaga atcaaaagga caggtttttt
caatctctag gttaaattct 120actgtagtcc tcatcaatga gcttctaacc aaagcccaat
ttcatttcat accccaattt 180ttttatcttt ccaaagaagt gtctcctgga ggtcaaacac
ctcttttgtc atggtgtcta 240ttttctgctg catgcgctgc ttctcctgca gaggaagagg
ggaagagaag taataaaaga 300gcagaaagaa aagggagagg aggtttgagg gaggaaacaa
aaataaagcc gataaagaaa 360cttaaccaaa agggaaagtc tgtgatgaac aggaaaagca
aaattggtct gccaaaagaa 420aagatgacat tcacagtctt ggccacaaga ttcttattgg
cttgccccta caaaagtaag 480caaaggaacc aggaataatt gttccaacca cagctacgtg
gcagcaagcc agctagaatt 540tctgtgtaca tacagctcca tatgtatatt ctttctttga
taactgcctt tttaccaaac 600aagaacttac attcctagag agggaaattt aggtttgctt
atgaacaaat gatctttcat 660cttagagaac aagcagtttt gaattttatt ttttaagcag
aactgatcat tttgaatttc 720tgttagcaaa atctatgaca gcaagaacac catgaatttt
gtattatttt aaaattatat 780tattttgaaa catttaaatt tagcatttaa caatccttaa
atgacctttc taattaggca 840atggtgctta acaggttttc ttcttatgca ttattggtaa
attattatgt cctcctttcc 900ctactcatac attaggtact ttaccatgga attttcaatt
ccaaagacca aaaaacatta 960tttgtaatat ttaaagtttt tcagcataac catagatact
aacatctaaa agatgttcat 1020tctagatgta aaaaacatct aaaactatag ttctcaaagt
ttgtatacct agcaccctaa 1080gcttttaaag aagccacagt gatgaactat agaaatcaag
cattatattc ttcttaaatg 1140caattacaat taattactag aacactttac cagtcctaac
ttaagctatt gaatttgaga 1200agcagccccc aaagcaggtt tattatttta tgtggttggc
attttggcac aaaaagataa 1260aagaacaaaa agggaaagaa tttcacatta ttttaaaata
ccagcaggat acagattctg 1320gaaaatatgc ttcctacctt atatggagaa aaaccaagaa
aattaacttc acatgtaatc 1380tgatagatcc aaaaggttat ctgtatctgc acttgaaatc
cacaaattct gagtatgttc 1440aattattctt aatgatgaca aaaattaaca cgtcttcaaa
tttaaagtca tttctttttc 1500tctattaaat ggtttttaaa aatcatttgt agagagacat
attaagaggt aggtccgagg 1560ggaaagagag aaagagggag agaaaaagaa aggctaaggt
ctgagtagcc aggaatgtgg 1620acaagtgtgg ttgtgagatc tctctcctgg gatcattaac
aatctatgct tcctgacatc 1680tctggcgtgt caacactaac ttaacattag atgcctttga
tagccacacc tagatagtgg 1740gcaggatccc ccttcaaact tatttccata tttatctaaa
aacatcgtct caggagggaa 1800aaccacattt aaagaaaaaa gatgcatgca atgtagcagg
cctgcaagga tgactaatgt 1860tttcaaagag ttcttggtag actatgcttc attccattcc
taagatgttg ccagcaatgt 1920ggcagagtcc cttcgcttgc agaaacctga accttcagac
taaccattct ttaccttttt 1980gtacagaacg tatcttgatg tttcttcttt tttcatttag
ccacctgaga aatgtattta 2040cctgagtgaa aatcaaactt attccccaag aatcatgtcc
caaaagatgg cattcactaa 2100ttccaaagaa taatgttatt ctataatttt tccttttgcc
catttcctaa gatatctgta 2160ggaaacagtg tgcttaggaa taaaagacac aaaaatttct
gctaccaaag tggggtaatg 2220tttataggat ttatagtatt aatttttaag cataatctgg
tttatgtttg aaaatttgta 2280gtgtacagtc aaatataaag agacaaactc tgatgcatct
taactctcct tccctcccaa 2340cacatcctca tcccattcaa ctcatttttt ttcaaaatta
agtattccca cagttcatgt 2400acatacctca ataagctcat ctctttgccg caggccttct
ttaagttctt ccatcttatg 2460ctgcagcaca ctacacatat gtttctgcct ttctaactcc
tgttattaaa caaataatat 2520catttacaca ggtcatggca cacaagaaat ttgaacatac
acaatacaac acagaggtta 2580agtatgacct ccagaaacat gcccaaactc ctgattcata
gtaacttaga aaaattgtgt 2640attctataga aaagttaaga aaattttaaa attccatctt
gtataattat caggaaaacc 2700tgaactaatc aatggcaaaa ttattaaaaa caaaagataa
tttagtaaag taacaggtta 2760taaaatgaac atatacaatt caatgacatt catatacaaa
taaaattcaa agaaggaata 2820ataaatgcaa tatcaaaata aaatcaatat taaataaaaa
acatacatgt aaacttacaa 2880aatatatcaa aaacctatat gaggaaaatt atataaagca
ttcccaaaag acagagaaat 2940aggattgaat aaatggaaag gcataccgtc ttcttggatt
aaaagtctca caacattata 3000aaaatgccag ttctccctaa attaatctat acatttaatg
tagtacaaat aaaaatacca 3060tcaggttttt cttttatcat catcagagca agttgatttg
aaagaaaaac acaagaaaaa 3120gtagccagaa aaatacatac tgaaaaagaa gaaagccggc
cttattaggt attaaaacat 3180attataaagc ttctataatt aaaacaatgt tgttatggca
catgaatata gaccaaggga 3240gcagaataga gaattcagga aaaacccact taaatataca
aatatattta aaaacaataa 3300aaataagagc atctcaaatc aatgagaagg aaagactttt
aaattagtaa tgttgggata 3360actggatatc catttggaaa aagataaaat tggaactata
cctcatacca cacaccagga 3420caaattccaa
34303371323DNAHomo sapiens 337caccattgcc aacacttctg
catacagagc atgcttgggc tgcagaatgg gccctgatac 60ctttagttct ttaagcccct
gcatgtatct cccttctaca tcctgtatct ggtccttaag 120gtcatagata tcctgcagga
cataggaatg aaccattgca taaaaccatg cacaaacgta 180tcttaaatcg caaaggattc
agatgaaatg tgactcactt ttgtatattc cagactaaaa 240gcagagaaat caaagtacaa
aaacataact cccactccca accactgaaa agggcaaata 300ggtcagggga tagtgggact
gggggaaggt ctagagtaat caatctaatg ttaaatattt 360tcttggcatt aaattctgtt
aataacagtg tagcaaatgg ggacagggct atatatggag 420gaaaaagcta tataaattat
aatatttaaa atcatacaac ttttaatatt ttataaatca 480cttaaaattt ttttagcaca
atgcttcacc tagaactagt aaaatataca gtaaattaat 540aagaggtggc aaattttaat
gattcatgca aaagtttttt aaaaatataa taaatgaata 600tgaacaaagt tttctttcaa
tgacttggtt actggccaca attaacttga gagaaaggga 660gtaaagggag ggaagttaaa
cattttgaac agaatgtcaa atgagatatt ctatcctgag 720gataccattt aaatgatgag
aaaagcactt gctccaaatg ttactataat ccttataaga 780aaagtgaaac aggtcaaatt
ttaaatggaa aatactgttt ccctgtgtct gaccttgtta 840tataaggtct attctgaatg
ctcgatttat gtccgaaata actgcacagg gccctaaata 900caattctgca attacaagca
ggatcaatta ttaaaggctg attatacaca tttttggtat 960tatattttcc ctgcctcctt
cattgcctct ccagtaggtt tgactgtctt ttacttatcg 1020attcattcaa tacacattta
tgaaggacct atttaggagg cagatggtag gatacaaaaa 1080taacacttcc ttcaagaagg
tcattctctc aaggaagaaa aaacaagcaa caagtagtat 1140gacaaaagaa agctaagtct
aaggctggaa ggccttgtcc tctcatatcc ttttgtccca 1200ttagaatgca gtagctaaag
gcaagagttt atcttattca actttccacc ctggcattta 1260tgtctggtac atagtaagag
ctggtaagag ctcaattaat actgtcacct tcaaaccaat 1320ggc
13233382268DNAHomo sapiens
338cttagtcacc gcctgtcctc tacctcccct ccagtgaaga ggggacctcc agtccaggca
60agctcatcca tctgagtcct ggactctcct ctcacccacg caggtatttt gatctgtcag
120ttacccctct ctccttcctc aaccttcctc ctctctctac tggttcaacc caaccaaaga
180aaataataat cactctcacc tttaaaagaa agctaaaaac cttttcttaa atttctctct
240ccctaccaat tgcccaacca attcccagtc tctcgtattc actccagagc aagtttctca
300tattttctcc ccttgcagtt ttgacttgct cacctcatcc tcactgttga accacagggt
360tactgaactt ctggcctgca atgcttcctc tgactgtgat caaagttatt ttttaagaat
420gcaaagcaga tagtattgct ctcctattta atatcccaca gttgtcaggg taaaactgaa
480ttcctttcag gctagcatac aagatgcttt gtgatttggc ccctcactac ttttccagcc
540ttctctctta cacgctacta ttcttcactt ctcatgccac cctctggcaa ctttgctata
600caggtagctt tctgggttgt ctgaatgcat ctctttgagg aagccttccc ccttcaatta
660ctttccttgg ttaattttta tttttccttc aaactttagc tcagagtttg tcttttctga
720aaagcctttt gttacctctt ctcctagtcc aatttaaatg atgctttctt gaattacctt
780aacagcccca ttcttgctac taccattgta taacaccacc tggttttatt ggtttgtatc
840agtttgtctt ccctatgaaa aagggaaatc ctgaaggcca gaacatattt tgcctcttaa
900atctccaagg ccaagtgctc aataaatctt gaatgaacaa atgtatgcaa tttctctcac
960tatcccctgc ctcacaaaca gcaggttcag caaataaaat atatattttt taaagttctc
1020tttttccaag taaaggggtt aataaggatg gagacttaat gagtcaaata agcttttcat
1080tatctcaggt atttctgtta cccaaatatt attacacttc tattactatg aaaaactgag
1140agataactat attcaagcag ctgaatacca tcaaggctca aaagactaat aattcccaca
1200atctagttct ttgtacactc acatattttc catttaaaac atggccagtt gtttctgaga
1260gaattacttt tacatacatt acaaagtggg ttttttttgg catttgtaaa ttagcagaaa
1320ttccaacacc attaaattta caggaaagag gacagaaaaa agaactatct aatctccttc
1380ctttcaagct ctgaaacact tgagtcacac ttttgtacag tacttatttt ttaaaaagaa
1440atcatattcc ttctttctac taagtcacat taaaggttaa aagttccagt atattaccaa
1500gtcaaaaaac tgcttttaaa aagaaaacca aaaacttcag tttacccgca attcacttaa
1560tgaagtgtct ggatctatta agctgctggt gtccccactt cctcgtctgg atgagtttcc
1620acttagaggg gttgttgctg aggcagaatt tcgagatgaa ggctagaaag agaaatatga
1680cccttctaag tctcatttct ttaaaaatat gtttgctgtg gaaactaaaa acagggcagg
1740gtggttgggg gaagatgtgg gtttcagatg aagaagttac catgtgtaat caatggtcaa
1800tttagattgg ctttgcaggt atccaggtat acagagaagg gcccaagcat actaagcaaa
1860atttcatgac acaaatacct tatcaaaatc tataattcgt aataaccaaa aagaaaatcg
1920tttttatgag ggttattttc taagtttatc tttgctatcc tgtaaacaat acttaagtga
1980gcacatcaga ttaacttaaa gcagttttag aattttataa ctgatatgct aatcagcact
2040tccccttttt aatttctgaa ctgcttatca atctcttctt tctctaatac tctgttgagt
2100tgataaaggt gaagatacta tttgcagcag agagtagcaa cactggatat cactatgaat
2160gtagactgtt cttcccaaaa ttcatttaaa cgtgagatta tgctattgat taataaactg
2220catgaatctg tgctgtccaa tatacttgcg attagccaca tgcagcta
22683392183DNAHomo sapiens 339tgtggctcgc attacatttc tactggacag agttaattta
tctttcttgg gtatagcata 60aaattaatct tgataaatca tctccctgac tttctaataa
tactttcaga agaaagagac 120tttaacacaa agcaaattct agaaaggagt tattttactt
caagaaacaa tttatatata 180taaaaggaaa aaatatctca gagtttgtca ccaccattcc
taaagaaact tcattttcag 240tgtttactgg cttcttctca ggatcaattt ctagaggaat
gaaaatcaaa agagcagcca 300aaagctaaag aggtgacatt caaggaaata aaatgcagtg
cctcttctct tcccagggcc 360tcacatgatt taactccgaa ggaattttgg agggtcaagg
acaggaaata gatgaaggga 420ggaagagcag ctacatcaga ccaattgttg taaaataaat
aatgaacaag ttacagatag 480caagtttaat tgaccttgag aatttccctt aggaatttac
atactcactc ttgtataatt 540ttcagcatac tgtttgtcag atttttcatc caactagaaa
agaacaaaga taaaaatcag 600taaagatgct tagcaagttg actttttttg ggaggaaaaa
agtgatcttg agttttccaa 660gttcaatgat gtttgaagcc taaaaataaa catattaaaa
atgaaattaa aattccagtt 720taacataaga taaaatgtca gcaactttaa tttttatcct
aagggagaca ctgagtaagc 780aatctatgtt tactcaaaag tttagaactt ttgggtaaat
tttctgttta aggtatttaa 840aacacattaa agaaggctac atataaagtt ttagcactgc
aaatgttaat acaaaaaaag 900cttgtacata tcaaaaagaa aagtctgata aaaccaaaag
cgaaaaactt ttatttctaa 960acattattag gacacaggac acacagataa agactagtac
actcagccaa aaaagttcag 1020attagattaa agcatgttaa atatgtcacc aaaatatttt
cccccacatc aaggattttt 1080tgttttctca gtaatacatg cacatgacca aaaaaaaaaa
aaaaatcaaa cagcacaaaa 1140gagtatgagt agggctctct ccctaccatg ccataccctg
accaccagtc tctaatagca 1200accaaccaca aagtctttaa aaaaattcta gtaagtgtcc
catatatgtt ctacatcttc 1260tttttcccct taccaagtct cagagatttg agaaattact
tcatctttct tcatagcatt 1320tatcactccc tgacactacg gtatttatcc atttatttat
tgtctgctct ctttctataa 1380tgtaagctct atatcagtat gttcttttct gtactcctag
tatcaagaat agtgtctggc 1440atagagaagg gctcaaaatt tgttggatga atgaataaat
cataccccat ctatactaat 1500agacttccct catttatttt aaacagcttc atagtagcct
gttatctcaa cttgccacaa 1560ttaaattaac ctatcttctg tagataggcc ttaaggttgt
ttcagacagt atttcaagaa 1620gggaggtacc ttaaaagtct agtccaaacc ctgagtttta
tagctaagga aaaagaaatt 1680cagaggtgac agattttgct gaaggttgca gagctgaatc
caacccttag ttttgtaaat 1740tttcaattca gtgttcatat tgtttacctg aatatatatg
ttattccaat tatcaccagg 1800agaaaaaagg cacataggga tgtaactaaa tgaaaataac
tccaactcct acacaaacct 1860tctagaccca atctagagaa gagatctgaa aaccttctca
gttaattatt aaaacatgaa 1920acatcatttc tggctctact cagaacactt caccctaact
tccacatatc tgaaagtttt 1980tagtgttctt tttattaccc acacttatgt ttataaatat
tctttattat gatgatagta 2040ttacatcagt aatataaaca ctatttttag acatttaact
ctttgttact tcttgctgta 2100aaaactctga aaaagaaact aattttagtt cattttaatt
tttaaaacaa catataaatg 2160tttcaaagca ggtaatgaca gcg
2183340892DNAHomo sapiens 340tgacctccaa aatcatccag
tttttaatag actgtccatg gaataatctt atatttcaga 60ggttgcatca ctgaactaat
ccatcacaat gttgactcta ttgctttcac catcttctga 120gagatcccaa ttgtcgcaac
ccaaactgat catgcaccaa acctcggaga ttgtcaagcc 180agcaaaaatt acccaaagca
accatgccat agttctctta ttttggctac actaacttgg 240caagttctca cttcttagac
ttgtactatc ttgtgtgcta tcttccattt agacaaaaac 300ctagctgaaa cttggtattc
taaaagccca ataggatcca acagccaaga gttctacatg 360gctgcaatgc aataacaagc
taatgttcat aatgatgatc cagagtctgc cagatccaga 420aaaaagcttt ttaaacagaa
taaaatttaa tcatatttaa tatattaagc tgtaaaactg 480tagtttaata aagtatttac
agtgtagttt cttcaatgca gtcattggat cttgatcttt 540tgttaatttc taatttatat
atatactaac ttgataatgc tactcaaaaa ttgtttgaaa 600aaatatctga tttccacaac
cacagaaggg aagacatcaa tcaattttaa caatttccta 660aagcaaaact ggctaaggat
tccatttaga aatgggttta attattaaca atcagcaaaa 720tttcagtttt gcttaaaaaa
catttacatt tgtttgcttt attgtagcct gacttacaaa 780acagaaataa agccaatgta
gaagaaaata agaggctaga acccccaaaa atattatata 840ggcttgcaga aacatgacct
ggcacccttt ctcagcacct cctagctcag aa 892341433DNAHomo sapiens
341ccagatttgt aaatccctgt tctagagaac tttttcctta tgttaaagag ttacagttaa
60ctagtagggg cagggggaga ttttttaaac tgtcagcact ttcacttata tgtaagtact
120aaagtaaatc atcacaaaat aaatataagg aagacataac tattaattca gatattataa
180attaagaggc tgacagaatt cagtttttaa gagaacagat gtaaacagta agtttaattg
240tccatactta tgggggaaaa acggggattg ctaggcaatt taatataaag ggcaatattt
300caggggaatg acttggaaac cagtcttcta actttgtatg cagagagagt tattgattag
360acccacacca agatgatact gttgcatagc cagatttgag aaccaatgat ctggaatgct
420taagaaccac aca
4333422961DNAHomo sapiens 342tcagtccatc agcctcctcc agcttccttg tacctcctct
ccaatctata cctcactgaa 60tatcacctta cccacttgcc tgccaggagc ttcggtgctg
ttaagaaaaa ttattggctg 120ggcacagcgg ctcatgcctg taatcccaac acgttgggag
gccgaggcag gcggatcacc 180tgaggtcagg cattcaagac cagcctggcc aacatgatga
aaccctgtct ctactaaaag 240tacgaaaaat tagccgggct tggtggcaga tgcctgtaat
cccagctact tgagaggctg 300aggcaggaga atcgcttgaa cccaggaggc ggaggttgca
gtgagccaag atcgtaccac 360tgcactgcag cctgggagac aagagaaaaa ctgtgttaaa
aaaaaaaaaa aaaagagaga 420gagtccgggc acagtggctc atgcctgtaa tcccagcact
ttgggaggcc aaggtgggtg 480gatcacctaa ggtcaggagc tcaagaccag cttgaccagc
atggagaaac cccgtctcta 540ctaaaaaaat acaaaattag ccaggtgtgc tggtggtcgc
ctgtagtccc agatgctcag 600gaggctgagg caggagaatt gcttgaacct gggaggtgga
ggttgtggtg agccgagatc 660gtgccattgc actccaggtt gggcaagaag agtgtgaaac
tctgtctcaa aaaaaaaaaa 720aaaaaaatgt atccaaaact tggaaacaag aaaagaggat
gactgcatgt cctttcaaag 780ttgtttaatg tggtttacag gctcactcaa ggattattta
attatttaga agaataaatc 840ctcttagaac cctgggtttt taggccaaag tctcacctat
ttattttctt tttttttctt 900ttcttttttt tttttttttt gaggtggaat ctcgctctgt
tgcccaggct ggagtgcagt 960ggcacaatct cagctcactg caacctctgc ctcccgggtt
caagcaattc tcctgcctca 1020gcctccggag tagctgggac tacaggcaca tgccaccatg
cttggctaat gtttctattt 1080ttagtagata cagggtttca ccatgttggc caggatggtc
tcaatctctt gacctcgtga 1140tccaaccatc tcgacctacc aaagtgctgg gattacaggc
gtgaggcact gcgcctggcc 1200agcctcacct atttcattat attcccttaa atgtacagtc
atgggtcact tcacaatggg 1260gatacattct gagaaatgtg ttgttaggtg attttgtggt
tgtgcaagca tcatacagtg 1320tgcttaccca aacttggatg atgtagcctg ctatacaact
aggctatatg gtgtagctta 1380ttgttcctag gctataacct gcataacatg ttactgtact
aagtactgta gacagttgta 1440gcacaatggt aagtatttgt gtatctaaac atacttcaac
agaaaaggta cagtaaaaat 1500atggtataac agattaaaaa tggtacatcc atatagggaa
gttaccatga atggaacttg 1560caggactgga agtgagtgag tgagtgggtg agtgaatgtg
aagtcctagg atatgatcat 1620acactactat agactttata aatgctgtac acttaggcta
catgaaatta aaaatatatc 1680tttttcttgg acgggcatgg tagctcatgt ctgtaatccc
agcactttgg gaggccgagg 1740tgggtgggtc acttgagatc agcaattcaa gaccagcctg
gccaacatgg tgaaaccctg 1800tctctaccaa aatatacaaa aattagccac atgtgatggc
gtgtgtatgt agtcccagct 1860actcgggagg ctgaggtagg agaatcgctt gaacctggga
ggcagaggtt gcagtgagca 1920gagatagcac cattgcaccc cagcctgggc aacagagtga
gaccctgtct caaaataaat 1980aaacaaataa aaataaaaag atgtccagtg cctaatctaa
tcaacattga ttggaaaggc 2040agatgattaa gcacaaataa agcctgttta gtcaagaaat
ttactattgg atacataaga 2100tgaatctata tataaaaatg aaaaggcaat gttagagcta
tgtaggaatg aatgcaaatt 2160aataaaatgt cattggagta agaacagaac ttactgacaa
gacaagattt ccattgaaat 2220cttagtgtgt gtggcttttt ttttttttag atggagtctt
gctccgttgc ccagtctgaa 2280gtgcaacggc tcgatctctg ctcactgcaa cctccacctc
ccgggcttaa gcaattctcc 2340tgcctcagcc tcctgagtag ctgggattac aggcacccat
cactacgtct ggctaatttt 2400tgtattttta gtagagacgg gttttcacca tgttgcccag
actggtctta aactcctgac 2460ttcaaatgat ccaccagcct cagcctccca aagtgctggg
attacaggca tgagccactg 2520tgcccagcca tcctagtgat ttttttaatc tatatatata
tatatatata tatatatttt 2580tttttttttg agacaggtct cgctctatca cccaggctga
agtacaatgg cacaatcatg 2640gctcattgca gcctcgatct cccaagctaa agcgatcctt
ccacctcagc gtcccaagta 2700gctgaggcta caggtgtgtg ccaccatacc catttagttt
ttttaaattt atttttcttt 2760tgtagagaca gggtctcacc atgttaccca ggttggtctt
gaactcctgg gctcaagcga 2820tcctccaagg cctcccacct cagcctccca aagttctagg
attataggta tgagccactg 2880tggcccctct agatttttaa atgatgttat tatttgcagt
aattcttcaa ttcataggaa 2940cttaatctca gggcacagca a
29613432915DNAHomo sapiens 343acttaatctc agggcacagc
aagagacaga acttcaatgc attttttttt tttttttttt 60tttgagacag agtctcaccc
tgtcgcccag gctggagtgc aatggtgcga tctcagctca 120ctgcaagctc cgcctcctgg
gttcacactg ttctcctgcc tcagcctccc gagtagctgg 180gactacaggc atctgccacc
acgaccagct aatttttttt ttgtattttt ggtagagaca 240gggtttcgcc ctgttagcaa
gaatggtctt gatctcctga cctccttcgt gatccaccca 300cctcggcctc ccaaagtgct
gggattacag gcatgagcca ccacgcctgg caatgcattt 360tttaaataac cattttttcc
cataccattg aatgaattat ccatctcttt ccaggggaat 420gaaattcccc tgtaaagatg
agcccttgac tcacacctct aatcccagca ctttgggagg 480ctaaggcggg cggatcactg
gaggccagga gtttgagacc atcctggcca acatggtgga 540aacctgtctc tgctaaaaat
gcaaaaagca gtcaggcata gtgcatgcac ctgtagtctc 600agctacccgg gagtctgagg
cactagaatc acttgaacct gggaggtaga ggttgcagtg 660agccaagaca gcactactgc
actccaatct aggtgtggag agacactctg tctcattaaa 720aaaaaaaaaa aaaaaagatg
agccctcaat tacaaacttc ttttgggatc aatatcaatc 780agaagttatt aagtgctata
gtttgtctga tgcagaagta aacatttaaa gttttgacat 840aaactttagg gttggcaaga
gcattaagtg agttaatgca tacatgttag ctattacatc 900acaaatcact gaaatgttgt
agtttaatgt caaattatta caagttgcta aaatagactt 960gcatgggaat ctaaagtaca
gtaaaaataa tgcttaatta ttagccaaag tgctctctca 1020gctaaaatgt ttactcattg
gtctgccatg aatgctttca aataacaatc attctttttg 1080gtgttcagga aaagatcagt
atccagactt aaatttggga gctctgaaag gaaagcaatg 1140aactttccct ccaacacttt
tggatgtttt tatgtactcc ctcaacccca tgggccccca 1200cggtaccctt atgatacttt
tgggaaccat gatgtctctt tcttctgaaa agctcaatgc 1260tgcactctgg tactattgcc
tgatattaat atgtagttat ttatttttta actttatgtg 1320tatgtctgat ccctcccaac
tgggatggaa gcttctcaag aacaggatct gacttggcac 1380agtggctcac gcctgtcatc
ccagcacttt ggaaggctta ggcaggagga tcacttgcac 1440tcagcagttt gagatctgcc
tggacaacat gacggaacca cgtctccaca aaaaaacaca 1500aaaattagct gggtgtggta
gcgcttgcct gtaatcccag ctactcagga ggctgaggtg 1560ggaagattgg ttaagcctgg
gaggttgagg ctgtagggag ccatgattgt gccattgcac 1620ttccagcctg ggcatgactc
tgactaaaaa aaaaaaaaaa aagtaaataa gttacaaatt 1680aataccatac gagcctgtgc
ttcattacag tgattagaaa ggtcctaaaa ggctctgatg 1740cctgaaatat gtctctcaaa
gacatgcttc tgtgccttag agcctccact attgcttact 1800tcttttattt ttatttacgt
atgtatgtat gtatgtatgt atgtatttat ttatttttga 1860gacagaatct tgctcttgtt
gcccaggctg gagtgcagtg gcgtgatctc agctcactgc 1920aacctctgcc tcccaggttc
aagcaattct cctgcttcag cctcccaagt agctgggatt 1980acaggtgtcc gccaccatgc
gtggctaatt tttttgtatt tttagtagag atgggttttc 2040accatgttgg ccaggctggt
cttaaactcc tgacctcagg tgattgccca cctcggcctc 2100ccaaagtgct gggattacag
gtgtgagcca ccgtgcccgg ccatgtattt atttttttga 2160gacagggtct cgctctcttg
cccaggctgg agtgcagtgg tgtgattatg gttcatggcg 2220gcctcagcct cctaggctca
agagatcctc ctacctcagc ctcctgagta gctgggacca 2280caggcaccac cacattagcc
accacgcctg gatgattttt tatttttatt ttttgagaca 2340gagttttgct cttgttgccc
aggctggagt gcaatggcgt gatcttggct cactgcaacc 2400tccacctcct gggttcaagc
gattctcctg cctcagccac cctagtagct gagattacag 2460gcatgtgcca ccatgcccag
ctaattttgt atttttaata gagatggggt ttctccatgt 2520tggtcaggct ggtcttaaac
tcccgacctc aggtgatttg cccacctcgg cctcccaaag 2580tgctgggatt ataggcgtga
gccactgctc ttggctgatt tttgtatttt ttgtagagat 2640ggggttccac cgtgttgccc
gggcttcgct tgcttttttt agataaaatg ttgtctccag 2700gccagatgtg gtggctcaca
cttgtaatcc cagcgttttg agaggtcgag gggggaggat 2760cacttgagcc taggagtttg
agaccagcct gggcagtata gtgagacccc tgtttctaca 2820aaaaataaaa aattagccag
gcgtggtgtt tcgtaccagc tacttgggag gctgaggcag 2880gaggattgct tgagcccaag
aggctgaagc tgcag 29153443182DNAHomo sapiens
344cgtggtgttt cgtaccagct acttgggagg ctgaggcagg aggattgctt gagcccaaga
60ggctgaagct gcagtgagct gtgagtgtgc cactggactc cggcctgggc aagagagtga
120gactctgtct caaaaacaaa caaacgtggc cgggtgcggt ggctcatacc tgtaatcccg
180atactttggg atgctgaggc gggcggatca cttgaggcca ggagttcaag accagcccgg
240ccaacatggc gaaactccat ctctactaaa aattcaaaaa tcagccagat gtggtggtgt
300gtgcctgtag tcccagctac tcgggaggct gaggcacgag aatcacttga acccaatagg
360tgggggttgc tgtgagccaa gatcatacga atgtattcca gcctgggtga taaggcaaga
420tcttgtctca aaacaaaaaa caataacaac aacaacgaaa acacaaagaa caaaacaaaa
480ccaaagaaac acaaactttg tctccagaag gcctctatta gaatctaaat acctaacctt
540cgaggtgtaa ctcactagca cgttgtctct ctaacagttt cctagcagac agttcaggtc
600taggattgta tccagggaca gagctagaga agccggagcc ccactgtggg gatgctgatg
660aggcagaccc ctcagtgagg ccagtgaaca gatgagtcca ctgggctggg cacctgtgag
720atggggcaga ggaacaccca gataggttaa agggcatctt gacacaacca gagtttatct
780gtagcatagt cttcacaaac caagccagaa cccaagccag agccgcatga gagtgaattc
840ccatctggct ttgggggaca aatgactcat ccaaggctac actcagcgct gagtggtgac
900tgggagccag tgcgctctgc tgactgctcc actttcagaa atacttgcag atctcaatta
960tctaattgca attgcaacga gaaccaaagc aggggagcag agacaaacaa tttctgaggt
1020aaccagatgg ctttattaac tcaagttctc acctaaaatt gccctcaaga atcctgtggg
1080aatgggttgc agtggtgtgg ccctggattc acaaccgaca gagcttctga attctgagtg
1140atctgtacac aaacacacct ctgcctgggt tacacggtaa gggcctcatg tacataatcg
1200cagcatgctt tcctagaaat cgcttggtag cgtgatgggt gggattcaga agtcagcagg
1260aacccaaagt gagtggagag gtcatggcca tgagtcagag gcctctatcc ttcagcagcc
1320tccaacagga agcagacagg gaaggttcct atagttacaa gggcttggct ggtttattac
1380tttcattcta atgggcgttt ttataagaca taagcaaagt acgaaatatt ttatagccat
1440tcggagagga agtccgccac acatttcaaa gaatgaatgc cctctgtaag gataagcagc
1500taactacaag cttcttttgg aattgactag aagttattaa gtaccaaagc ttatctgata
1560ctcaaataaa tatttctcta agttttgact cttgagaggt aaacttcaag gttgacagca
1620ccgaggctat gtggtatact aaaagggtgt gggatttaga gtcccacaga cctaggatca
1680agatttatgg ccaccattca ccaacaccaa caataggatc ttgggtttta cttaattttg
1740gttaactagt tttctcattt gtaaaataaa aataaatgat acctagctca cagtttctgt
1800gaagattaag tgagataata tgaaagaaat cacattgtac ttaccaaatg tttcgtgttc
1860ttttctcttc cttcatatgt gttaagctga attaacaaac tactaagtaa gatttctttt
1920tttttttttt ttcccgagac agggtcttgc tctgccgccc aggctggagt gcagtgacat
1980gatcatggct cactacagcc ttgaccttca cctcccaggt gcaagccatc ctctcgcctc
2040agcctctcat gtagctggga ccacaggcat aaaccaccat gcctggccaa tttttttaat
2100ttttagtaaa gacagggtct cactgtgttg cccaggctgg cctcaaattc ctgggctctc
2160caatcacatt tgggattagg taaaaaatta aaaacaaaaa aaaataaaaa aaaacttctg
2220ggctcaagtg ttcctcccac atcagcctcc caaagtgctg gaattacagg gatgagtcat
2280aatgcctggc ctaaaccact tactcttttt tttttttttt ttgagacgga gttttgctct
2340tgttgcccag actggagtgc aacggcacaa tcttggctca ctgcaacctc tgccttccta
2400gttcaagtga ttctcctgcc tcagcctcct gagtagctgg gattacaggt gtgtgccacc
2460acggccagct aattttgtat tttcagtaga gacggggttt ctccatgttg gtaaggctgg
2520tctcgactcc tgacctcagg taatccgcct gcctaggcct cccaaagtgc tgggattaca
2580ggcatgagcc accatgccca gcctttttgt tgttgttgtt gtttttgaga cagggtcttg
2640ctgtgttgcc caggctggag tgcagtggta cgaacttggc tcactgcaac ctcttcctcc
2700caggttcaag ccattctcct gcttcagcct tcccagtagc tgggactaca tgtgggcacc
2760accgcacctg actaattttt gtgtttttag tagagacagg gttttaccat gttggccagg
2820ctggtctcaa acttcttctt tctttctttc tttctttttt tttttttcct tgagacagag
2880tctcactctg tcacccaggc tggagtgcag tggcgtgatc tcggctcact gcaacctcca
2940cctcctggga tcaagcaatt cttctgcctc agcctcccga gtagctggga ctatgggcgc
3000acgccaccac atccagctaa tttttgtatt tttagtagag atggggtttc accatattgg
3060ctaggctggt cttgaacttc tgacctcaag tgatccaccc gcctcagcct cccaaagtgc
3120tgggattaca ggcatgagcc actgagccca gccctactaa ggaagatttc tggccagtag
3180cc
31823452974DNAHomo sapiens 345cccagcccta ctaaggaaga tttctggcca gtagccaaat
aggactttga aaatctttaa 60gaataggaag aatctgaaaa ataatcttca aaaaagaaag
cagcatgttt catgaaaatg 120tgtaattatg tataactggt agtggccagc caatgctaat
tctactaatt ctgtgtacta 180gagttatcta tgtggatatc tagaaactcc tcaagggaat
gtgtgaggat ggagaattca 240tcttgttgac catgaatctc tacagaacta agtacagagc
cttatagttt gataattcat 300tgaaacagaa tcattttata tccccttcgc actaactggt
tctgaaatac cattctcctt 360gggtaatttt gtttttttgt tttttgtttg tttgagacag
tctcactccg ttgcccaggc 420tggagtacag tggtgggcac tatatcgtct cactgcaacc
tccacctcct gggttcaagt 480gatgctcctg cctcagccaa ccgagtagct ggaattacag
gcatgtgcca gcacgcccgg 540ccaatttttg tttttttagt acagacgaag tttcaccatg
ctgggcaagc tggtcttgaa 600ctcctggcct caagagatct gattgctttg acctcccaaa
gtgcaaggat tacaggcatg 660agccactgtg cctggcctct tcaggtaatt ttggatcccc
taaaggctca ctcacaggcc 720ggctctcaca tttttgccca cactttatgt tcaaaacatg
tatcagtggt tacctatgct 780ttcggacaga atattcctac aagagtgagc cagcttgcac
cacagacaag ccaaactatg 840cctgtgtcct tatctatcgc tgcataatcc caatggttag
tgatctccat tccacggacc 900ccgtgctgtc tcatacaaag catttcggac ttaaggatag
aagcaaactg ccatgtcctc 960tatgccatga tgcttactaa tcctttacca ctctgagatt
ttcttggagc tatttatagc 1020tgattttcct gggctgactt tcgaccaaag aggagatgga
aactttgttc ttaacagtgc 1080tccaactgtg tgattcaact tggctgcatt ccagcaagtt
ctgtgagttg ttaatggagg 1140tgagaaagga gtggggtggg gagtcacagg gatgctaact
gtagatctgc tttttctctt 1200ttttttaaat gtttgttttt agagacaggg tcttgctctg
ttgctgagcc tggagtgtag 1260tggcataatc atggttcgtt gaagcctcaa actcctgggc
taaaacgatc ctcccacctc 1320agcctctcaa gtagctggaa ctacaggtat gcatcaccag
gcctggctaa ttaaaaaaaa 1380aaaatttata gagacagggg tcttgctatg ttcctcaggc
tggtctcaac tcctgtcctc 1440aagcaatcct ctgaccttag cctcccaaag tgctgcaatt
tcagttgtaa gccaccatgc 1500ccagccctgc agatttgctt tttttttttt ttttttttga
gacggagttt cgctgttgtt 1560gtctaggctg gaatgcaatg gtgcgatctc tgctcaccgc
aacctctgcc tctggggttc 1620aagtgattct cctgccttag cctcctgagc agctgggatt
acaggcatgc accaccacgc 1680tcggctaatt ttgtgttttt agtagagacg gggtttctcc
atgttggtca ggctggtctt 1740gaactctcaa cctcaggtga tctgcccacc tcagcctccc
aaagtgctgg gattgcaggc 1800gtgagtcaca gcgcccagcc tagatttgct ttctatagga
ctttatattg tcatcctcat 1860caccactatt ttaacaagct gctagtttac ctagtaaatc
ctacatgaaa tagaaatgtg 1920gtcattattg gctggtgcag tggctcacgc ctgtagtccc
agcactttag gaggccgaag 1980cgggtcgatc acaaggtcag gagttcgaga ccagcctggc
caacatggtg aaacctcgtc 2040tctactaaaa atacaaaaat tagccaggtg tggtggtgcg
cacctgcaat cccagctact 2100ggggaggctg aggcaggaga attgcttgaa cccaggaggc
agaggttgca gtgagctgag 2160atcgcgccac tgcactccag cctgggggac agagcaagac
tctgtctgcg tgggggggaa 2220aaggaagaag tttgagacca gcctggacaa catggtgaaa
tgctgtccct gctaaaaata 2280caaaaattag ccaggcgtag gccgggtgcg gtggctcaca
cctgtaatcc cagcactttg 2340ggaggccaag gcaggcggat cacaaggtca ggagattgag
accatcttgg ctaacactgt 2400gaaacgccgt ctctactaaa aatacaaaaa aattagccag
gtgtagtggc gggcgcctgt 2460agtcccagct gctggggagg ctgaggcagg agaatggcgt
gaacccagga ggcagagctt 2520gcagtgagcc aagatcatgc cactgcactc cagcctgggc
aacagagcga gactgtctca 2580aaaaaaaaaa aaaaaaaaaa aaaaattagc caggcgtggt
ggctggcggc tgcaatccca 2640atcccagcta cttgggaggc tgaggcagga gaatcacttg
aacccaggag gcagaggctg 2700cagtgagcca cgatcacacc actgcgctcc agcctgggtg
acagagcaag actccatctc 2760aaaaaaaaaa aatgtggtta ttactttatc tattcacaac
acttccctac agactcctgg 2820agttcacctt ctttccgtaa acagggaacc aaccaacaga
cacgacatat cctccctctc 2880ccactactct atccacattc ttggtttcct tttttctttc
acttccttct ggaacttgag 2940agcttgtttg gaggttctag caggggagca cagc
29743463199DNAHomo sapiens 346tcgtcctctt cgacctagca
tgcagctttg ggagggacgc acatggagcg gtgagagagg 60aaggagacac ctacctatcc
agccagatca gctgaatcaa ccctggcgat caatggggtg 120acagatgtcg taggaacctt
atcaatctgg gtattctgag tcagtttcgt gtacagtgat 180gatgatgatt atgtatagct
cagccagact atgacacttg acaactccct catcctgagt 240aggagtacaa ataaaattaa
gtttgtgaca tttagttcat tctttttttt ttttttgagg 300tggagtctgg ctctgtcacc
caggctggag tacagtggtg caatctcagc tcactgcaac 360ctctgcctgc tgggttcaaa
tgattctcct gcctagcctc ccaagtacct gggattacag 420gcacacacca ctatgcccgg
ctaatttttt ttgtattttt agtagagacg gggttttgcc 480atgttggcca ggctggtcaa
gtgatccaac tgtctaggtc tcccaaagtg ctgggattac 540aggcatgagc caccacgcca
ggcccgttta gttcattctt actacacacc ttgattttcc 600atgaacatct caggaatcgg
aacatacaga taattccaga aaggagagga atctgtgtat 660ttttcttctt ttgtttcctt
attatgcctt gtgagaggcc aatgcatgag tttttaacta 720ggtccatgag aacccacaga
gacagcctcg tttgacccag tctggttatc agaagaggga 780agttccttat aattgtgtat
gtatacctgg ttggttcaca gatgtcctta aacatgagaa 840cgactatgtc tgaaaaaaac
tctcaagttt caccggggct gttgcacacc ctataaatga 900cccatcataa agacctcacc
cctctctgat aggataaggc aaaggttaag gtccatcctg 960ttagccacac tctattttcc
ttctagctag gccagaacat aatatctgga accaactgtt 1020ctctctctca gctggctgta
agaatgctgt atgctttttt tttttttttt ttttgagaca 1080ggctcttgct ctgtcgccca
ggctggagtg cagtggtgtg atctcggctc actgcagcct 1140ctgcctcctg ggttcaaacg
attctcctgc cccagcctcc tgagtagctg ggattacagg 1200cacacgccac catgccaggc
taatttttat atttttagta gagacagggt ttcaccatgt 1260tggccaggct ggtctcgaac
tcccgacttc aggtgatccg cccccctcgg cctcccaaag 1320ggctggggtt acaggctgta
tgctttttat agtgttgggt ggttaagtct tacacaaagt 1380aaatgcccag taaatactta
ttactggtca tgactcaacc attcaggttg ttactaagct 1440aagaccagtc accccatagt
ccctgccata ccatatgctc ccagagagag cacttctggc 1500cctccctatg atggctgcca
ccaccactac tttgtgggga agaatagtca tcctgacggt 1560tagtcatccc taacctttgg
actaactatt cacaattcag tttaggctga tttctctttg 1620caccttatat tcctatgtgc
ctcagtcact agaagaataa gccttctaga tcatccaaca 1680tggatagatc atcaacagtg
gatactatcc cagtaccctg agtccactgc taatctgatc 1740aagcccctct ccctctcctt
cccaaattct tcaatgtgcc tttgcaactc cagatctgtc 1800gccatcaaat gtctttgtag
cctcgtcctc ttctttgaat gttcccttca ccacttggca 1860ataaatgaac ctggctgtcc
ctgagcagcc catctcctga gcagtcctct gaggtagaag 1920ctgctttact tttcccctga
catttcaggc tcctaagggc cagggggtat agtaggtttc 1980ctacttgcca tttccaaact
gttccttgcc tctcctcctt cagacacgca gcttctttga 2040agcctctctg atgacctcct
aaccttccag ctcacttact caagaactcc cactgtctca 2100gttcttcaac tgtatctgac
accatttctc tcttctctta tcttcacttc ccaacctcac 2160ttaagttcca gggcccagca
tttttattcc acatttgcaa atactgtcca caacaaactt 2220atccttctct cttctatcgt
attttatttc taaacagagt ctcgctctac aatagtcact 2280tttaaaatta tttattagcc
aggcgaggtg gttcatgcct gtaatcccag cattttggga 2340ggccaaagcg ggcagatcac
ctgaggtcag gagtttgaga ccagcctggc caacattgcg 2400aaaccccatc tctaccaaaa
atacaacaat tagccaggtg tggtggcacg tgcctataat 2460cccagctact ctggaggctg
aggcaggaga attgcttgaa cctgggaggc ggaggttaca 2520ctgagctgag atcacgccac
tgcactccag cctgggcaac agagcgggac tctgtctcaa 2580aaaaaaagac taccattcag
accattcagc aaactccagt gcccaggcgg cctggtcctc 2640catctcaccc ttcgctccat
cttgccctta gcctgagcaa ccctgcaccc tgctccttct 2700ccccctggtc atttgcggtc
acagtgcacc aagagaagag gacgccacct tcctggtctc 2760atccctactc aggtgtgcac
cctttgctag ggcccgtgcc tccacccagg tcagagcttg 2820gagattcacc ctcttgcttt
cacgtttaaa taagatgcaa gcaagggccg ggcgcagtgg 2880ctcactcctg taatcccagc
acattgggag gccgaggcgg gtggatcacg aggtcaggag 2940atcaagacca tcctggctaa
cacggtgaaa ctccgtctct actaaaaata caaaaaatta 3000gctgggcgtg gttgtgggcg
cctgtatatt cccagctact caggaggctg aggcaagaga 3060atggcgtgaa cccaggaggc
ggagcttgca gtgagccaag atcacgccac tgcactccag 3120cttgggcaac agagccagac
tccgtcccaa aaagaaaaaa aaaagatgca ggcaaaggct 3180gctgtagaat aggcgctgc
31993473054DNAHomo sapiens
347tcttcccaag agagccaaga tttcttcttt cctcttcttt cttttttttt tctttctaat
60ttcaaaggag tataattaaa ttgccaggta aaagctcaaa ggtctttttt atagtgttct
120ggaaggttct ctgcctgtgt ttgtatttcc tttagcctcc acgttcctct atccagttcc
180cgcacccttc cccccaggcc ccattcttca aggcttcaga gcagcgctcc tccggttaaa
240aggaagtctc agcacagaat cttcaaacct cctcggaggc caccaaagat ccctaacgcc
300gccatggaga cgaagcacct ggggcggggc ggagcggggc gcgcgggccc acacctgtgg
360agagggccgc gccccaactg cagcgccggg gctgggggag gggagcctac tcactccccc
420aactcccggg cggtgactca tcaacgagca ccagcggcca gaggtgagca gtcccgggaa
480ggggccgaga ggcggggccg ccaggtcggg caggtgtgcg ctccgccccg ccgcgcgcac
540agagcgctag tccttcggcg agcgagcacc ttcgacgcgg tccggggacc ccctcgtcgc
600tgtcctcccg acgcggaccc gcgtgcccca ggcctcgcgc tgcccggccg gctcctcgtg
660tcccactccc ggcgcacgcc ctcccgcgag tcccgggccc ctcccgcgcc cctcttctcg
720gcgcgcgcgc agcatggcgc ccccgcaggt cctcgcgttc gggcttctgc ttgccgcggc
780gacggcgact tttgccgcag ctcaggaagg tgaggcgcgg attggagcag agttgtggag
840ctgggctggg ctggggggca gcggcccccg gccctcggcc cccgaaacgg gcataatagg
900gaggggacca agaggccgcg ctttccagcg tggagaccgg acggtgcggc cgtgctccgg
960ctcaggccct ccgcgcggta ggaaacggcg agggccgtcc cggggagcag cctcacttcg
1020cagctttgct cgccttggta gggaaatggc cttgggcgga ggcgggggac aggcagggaa
1080cggagtggcc acgtccaggt ttcctgcggc caccgaaccg gtgcctcgcg ccctggcgca
1140cccacgtcct cggttcgggg tggacttggg gttccaaaac agccccagcc ggtggcggag
1200tctttacgac agggaccagc gggctcgccc ttgtccttgc agcgggcccc ggatgtgggc
1260ctcaggcggg gacaggcgcc cgcagggagg cctccagggc cgctatgcac ctgcgcgcgg
1320caggcggccc ggaccacaca gggcgtgtgg gtgttttccc ttttctaagg atcatatgag
1380taatgccagg cttattgtag ggaacgcaga aataataacc gtaaagagta aaaacatata
1440atcccagcat tttgagaatc ccataattag taattaggtg tatctttctt tctttttatt
1500tatttattta attttttgag actgagtctt gctctgtcgc ccaagctgga gtgcaatggc
1560gcgatctcgg ctcactgcaa ctttcgcctc ccgggttcaa gtgattctcc tgcctcagcc
1620tcctgagtag agtagctggg attacaggcg cgcgccacca ccccccgcta atttttgtat
1680tgttagtaga gacggggttt ctccatgttg gtcaggctgg tctctaactc ctgagctcgt
1740gatccgcccg cctcggcctc ccaaagtgct gtgattacag gcgtgagcca ccgtgcccgg
1800cctattttat ttttttattt gaaacagcct tgttctgtca cccaggctgg agtgcaatgg
1860caagatcttg actcattgta gactacgcct cccggcctca gaccatcctt ctgcgtcagc
1920ctttatgcct ggctaatttt tgtatttatt atttattatt attattatta tttttgagac
1980agagtttcgc tcttgttgcc caggctggag tacaacggcg cgatctcatc tcactgcaat
2040tcaggcgatt ctcctgcctc agcctcccga gtagctggga ctacaggcat gcaccaccac
2100ggtcagctaa tttgtatttt ttgtagagag gggtttcgcc atgttggcca ggctggtctc
2160gaactcctga cctcaggtga tccaccgacc ttggcctccc aaagtgctgg gattacagac
2220gtcagccaca gtgccagccg aatatttgta tttgtagaga cgacatctca ctatgttgcc
2280caggctggtc tcgaactcct gggctcaagt gatcactccg tctgggcctc ccagagtgct
2340gggattacag gcgtgcatca ccacacccgg ccttaaaaac aagatttaaa atggtgactg
2400gtatgttgca ccgttattca aatgttagac atgtagtttg atttcagttt ctcttaactg
2460tggaataaac aacttggctg ccgtctctct ctctctcttt tttttggaaa cagtgtctcc
2520gtctgtcgtt cagcctggag tgcagtggca catttacatg tcactgcgtc ctccatttcc
2580caggctcaag cgatgctctt acttggacct cccaaagtgc tgggattaca ggcatgagcc
2640accggtccgg catctcttgg tttatttgta agatggtgcc tagaagtgga gtggcgtttg
2700ccaaaggtct ctggaagggc ttttacactt tcaccaatgg agtggcctaa attcagtaat
2760tatactctca aagtaatgca gttttagtca actcatgttt ttctggcttc aatctgggac
2820tacgtactta atgttaaatt gctttaaagt ggtcatagct gctacaggtt tgtgctcaga
2880aagtctgcac ctgactggtc tgatttaaat tttacgcccc ttaggtatga acagtgtgtt
2940ttaaacaagt acaggatggg gctgcagaag atttaaacgc ttgagaacaa gtgctgtatt
3000ttcccctttt gtgaccccag tattgagttt agtgttgggc agattaaagg tggt
30543483179DNAHomo sapiens 348tgttgggcag attaaaggtg gttcatatcg actataactt
gaacagggaa aaattgaaat 60caacttaggg tacttgggat acgaaggatc aatataaaaa
ctctggtttg tcatgctagc 120tttttctttt ttttcctctt cagttgaact gaggagatag
tttttgtttt taatgattgt 180gctcttttaa ctagacaaaa ggaattagat agtcttgcct
attcgaagtt aaatgaactt 240ttgaggttgt taaggacaaa actattaaac tgacatcaat
aatacagaat gggctgctta 300gtatcacttt ccttatcagg tactaggatt taatttagtt
aggaaactca cttaaaggga 360ggactataac tgcagttgaa agtgtaattt ttccaagata
taaaattgtt taaagattga 420atatattcct gttaagcccc aaaggaaaca tccctcattt
aagaaaatgg ggtgggagag 480caagagaagg tgaggattca cagatcctag aattggaata
gttgattttt ttttgtaaaa 540gaggcggtga cagccgggca tggtggctca cgtctgtaat
cccagcactt taggaggccg 600aggtgggtgg gttacctgag gtcaggagtc ctagaccagc
ctgaccaaca tggtgaaaac 660ccgtctctac taaaaataga aaaaaaaagc cgggcgcggt
ggctgacacc tgtaatccca 720gcactttggt aggccgaggc ggacggatca tgaggtcagg
agtttgagat cagcctggcc 780attatgctga aaccccgtct ctactaaaaa tacaaaaatt
agccaggtgt ggtggcatac 840ccctgtagtc ccagctactt gggaggctga ggcaggagaa
tcgcttgatc ctgggagatg 900gatgttgcag tgagctgcga ttgtaccact gcaatccagc
ctgcacgaca gagtgagact 960ctgtctcaag aagaaaaaca aaaaaaggca gtgactaaca
gggatgttac ttagcaggac 1020aggactgtgg aaggagctaa gactgggagt ttcacaaaga
caaagctaga aatgatactt 1080ggagagctgt gttcttgttt taaaaaaatt gtaacaggag
gccaggcaca gtggctcatg 1140cctgtaatcc cagcactttg ggaggctgag gcaggaggat
tgcttgaggc caggagttca 1200aaaccagcct gggcaacatg gcgaaacccc gtatctacaa
aaagttaaaa attagccagg 1260catggtggtg catggctgta gtcccagcta cttgggaggc
tgagacagga ggatcacttg 1320agccctgtag gtccatgctg cagtaaacca agattgtgcc
actgcattcc agcctgggcc 1380acagagtgag accctatctt taaaaaaaaa aaaaaaaaaa
aaaaaaaaaa acaggaatgc 1440atgcagatta aactatgtgt ctgtatacag tatgcaaact
ttagcaagtg ccaggcactt 1500aggcagtagt ctatagctga aaaataaaac attcagaacc
actttttaag gttttgtgtc 1560cttgtaactt taggcattat tattacaata taacttagct
gggacatgag agttaataga 1620tccacatttt aaagtagatt ttttttttaa ttttctagaa
tgtgtctgtg aaaactacaa 1680gctggccgta aactgctttg tgaataataa tcgtcaatgc
cagtgtactt cagttggtgc 1740acaaaatact gtcatttgct caaagcgtga gtaaaatatc
ctaattacct gtaagcttta 1800ttttgactta atacttcttt aattgatgtg ccttgagttg
gaaagagttt tattggctta 1860aatctgaatc atgttacaaa gtaagtgtgg gaacacataa
atttcaaata atctttgacc 1920ctggaacttt agagttaatt ttttttttcc cgtaatcatg
aaatcagtta tttttcagtt 1980tggcattaag gtttcttttt cagtggctgc caaatgtttg
gtgatgaagg cagaaatgaa 2040tggctcaaaa cttgggagaa gagcaaaacc tgaaggggcc
ctccagaaca atgatgggct 2100ttatgatcct gactgcgatg agagcgggct ctttaaggcc
aagcagtgca acggcacctc 2160catgtgctgg tgtgtgaaca ctgctggggt cagaagaaca
gacaaggaca ctgaaataac 2220ctgctctgag cgagtgagaa cctagtgagt ggggctgcct
atactacttg ttttcatgct 2280gttcagattc atttaattaa atttattttt gattatgtaa
tatgatttca tggtttagaa 2340ttcagaagat atgagtgtcc agtgaaaagc ttccttctca
ttccagtccc cctcgctacc 2400cattggacct ccacagaatt gatgttattg attattctat
aaccttccag agatagttga 2460tgaatttgtt atatatctgt tttattattt ttacataaat
gatagcatac taggtataat 2520ttttctttta tatctttact taacattatt cagtatttca
ttgttgcatt agtagtaaat 2580gtatgtaatt taacctatgt atttgcttat tgattgtgtt
ttaaaagtga gatatgcttg 2640ttttagggat tgtttaatga aaaggcacag aaacccactc
aagctagctt aagcaaaaaa 2700agacttcatt ggaagggact agaaactgga aaggatgtca
ggaccaaagt gggcactttg 2760tttttctgtt ctggtcttct ggagcctcgt tgtcagtttt
ctctttgtgc cctttctttt 2820gttttttctt ttttcttttc ttttcttttt ttttcgagat
ggaatttcca ctcttgttgc 2880ccaggttgga gtgcagtggc acaatctcag ctcactgcaa
cctctgcctc ccgggttcaa 2940gcaactctcc tgccttagcc tcctgagtag ctgggactac
agctatacca cacctgacta 3000atttttgtat tttagtagag atggggtttc accatgttgg
ccaggctggt ctccaactcc 3060tgacctcagg caatccaccc acctccacct cccaaagtgt
tggattacag ttgtgagcca 3120ccatgcccgg gcctttcatg ccttttcatc tttttagttg
aacagggcat gacactgcc 31793493187DNAHomo sapiens 349tctttgtgcc
ctttcttttg ttttttcttt tttcttttct tttctttttt tttcgagatg 60gaatttccac
tcttgttgcc caggttggag tgcagtggca caatctcagc tcactgcaac 120ctctgcctcc
cgggttcaag caactctcct gccttagcct cctgagtagc tgggactaca 180gctataccac
acctgactaa tttttgtatt ttagtagaga tggggtttca ccatgttggc 240caggctggtc
tccaactcct gacctcaggc aatccaccca cctccacctc ccaaagtgtt 300ggattacagt
tgtgagccac catgcccggg cctttcatgc cttttcatct ttttagttga 360acagggcatg
acactgccag ctaaactttg acttaatgtg actttatgta ttgtgtccag 420agaacagagg
gtcaatatta gaaaaggtgt tccctcctgg gtgtgtcctt tatgaaggat 480gtgtaaggga
agaaattata ggaatagcta ctgcataaat tttttttctc ttagtcctta 540taattcgaga
attttaggat tagcttatta ggaaaatagt atggaagact gagttatagt 600caactgacat
tgtcttttta ctttatagct ggatcatcat tgaactaaaa cacaaagcaa 660gagaaaaacc
ttatgatagt aaaagtttgc ggacgtaagt gcaattaaat gcatcatatt 720cttgcacagt
tggtggctca aatcttccat cctacaccat tagaaaaagc aagtctaaat 780gcttttttat
atttctgaaa aataaagtta cttgaaatag agttgcaaga atagcacaga 840gattctggga
atacacttca ctcagattca ccaattaaca ttttggcaca tttgcttttt 900atatgtgtat
gtgtggatga atatgtgtgt gtgctttaca tcagtgtatc tatgcatgta 960taaatatttt
tcccagaagc acatgagagc aagttgtaga catcaggccc ctttacccct 1020aagtacttca
gtgtatgttt tcctaagaac aaaaggcatt cttttatata aaccactata 1080caacgatcaa
atttaggaaa aatttttttt ttttttttta gacggagtct cgctctgtca 1140cccaggctgg
cgtgcagtgg cgtgatctca gctcactgca acctgcgcct gccggtttca 1200agcgattctc
ctgcctcagc cttccaagta gctgggacta caggtgcctg ccactacgcc 1260ctgctaattt
ttgtagtttt agtagaaaca gggtttcacc atattggcca ggctggtctc 1320gaactcctga
acttgtgatc ctcccgcctc tgcctcccaa agtgctgcaa ttacaggtgt 1380gagcttccgc
gcccggccag gaaatttaac gttatatcac gttgtgccca ttttcccaat 1440attgtccttt
gtagtaattt ttcccctctg attcaggacc cagtccaaga tccatgtatc 1500acatttagtt
gtcatgactc tttagtctct taatatcgaa cagtttcttg gcctttcttt 1560gtcttccatg
aacttgctat ttttaaagag catgggcaag tcattatata taatgtccct 1620caaattttga
tttgtctgat atttcctcct tttttttttt tttttttgag ttggagtttt 1680cccttttgtt
gcccaggctg gagtgcaatg gtgcaatcac ggctcaccgc aacctctgct 1740tcccggattc
aagcgattct cctgcctcag cctcctgagt agctgggatt acaggcgtgc 1800gccaccatgc
ctggctaatt ttttttgtat ttttagtaga cacggggttt ctccacgttg 1860gtcaggctgg
tctcgaactc ccaacctcag gtgatctgcc cacctcagca tcccaaagtg 1920ctgggattac
aggcatgagc cacctcaccc gagccttgat gttccctctt aactaaaagc 1980aggttatgca
tttttgacag gaaaactact taagcgatct tgtgtccttt ataatacttc 2040acattaggag
ttgcatgatg tcagcttgtc cctttactag taaagtaaac tttggttaaa 2100gtggtatcca
ccaggttttt ccactgtgaa gttaccattc tccctttgta atccataaat 2160aatctatggg
cagatacttg gatactaagt aaatgttctt tttctaatta aactggtacc 2220cagcagtttg
aatatcaatg gatgattcca gcctgaatca attattatta tgatagttgc 2280aaaatggcag
aaaaatttta actttaatga cagttttaga ccctgagctg tctgcttaaa 2340gagtagtgct
tcttactgtt gtgtggtaca aacatttttt tttaatacag attttaaatt 2400ctttacagtg
cacttcagaa ggagatcaca acgcgttatc aactggatcc aaaatttatc 2460acgagtattt
tggtatgatt ttttaataag tgagctttag cagacagttg gtgagacagt 2520atgttttgag
tataaggaca gccagtgatt taagtggtgg ttaaatgcac ttactggagc 2580aacagtttcg
gatctgggta cttaatgtga atttcctgtt actgtttttt tttgtttgtt 2640tgtttcttta
agacagacta ttgctctctt ccccaggctg gagtgtcatg gcaagatctc 2700ggctcaatgt
aacctctgct tccaaggttc aagcaattct catgcctcag cctcccgagg 2760agctgggact
acaggcacat gtcaccatgc ccagctaatt tttgtatttt tagtgtcggc 2820ggggttttgc
tatgttggcc aggctggtct cgaactcctg gcctcaagtg atctgtctgc 2880ctcagcctct
caaagtgttg ggattacagg tgtgagccac cacgcccggc ccattgtttt 2940tggttatcgt
tgttttcctt ccatagcctt tgaaaagcct agttttactc ctaaagaaaa 3000cgtagtatct
cttagtatcc ctaaaacatt tgagttttct tatcctggag aacctgtccc 3060tgtggatgag
ctccagtaac atcttaaagt aaatatgcac caaaattact tttggtaaat 3120acagttttgg
tgcatattta ctttaggatg ttactggagc tcccatcttc tctgctttaa 3180ggaacta
31873503128DNAHomo
sapiens 350acctgtccct gtggatgagc tccagtaaca tcttaaagta aatatgcacc
aaaattactt 60ttggtaaata cagttttggt gcatatttac tttaggatgt tactggagct
cccatcttct 120ctgctttaag gaactagtcc ttaactagtt agcccttact taactcttta
aactctggtt 180taaaaaataa aaagaagctt gaatagtgtg acggaactct ttaaaggtag
tatgaattta 240ttcaagagtc tttagaaaga atgtactttt tttactcttt aaaaacaaaa
tgatggccgg 300gcacggtggc tcacgcctgt aatcccagca ctttgggagg ccgaggcagg
tggatcacaa 360gatcaggaga tcgagaccat cgtggctaac acagtgaaac cctgtctcta
ctaaaaacat 420acaaaatagc cgaatgtggt ggtgggcacc tgtagtccca gctactcggg
aggcttgagg 480caggagtatg gcgtgaacct gagaggcgga gcttgcagtc agctgagatt
gtgccactgc 540actccagcct gggcgacaca gcaagactcc gtctcaaaaa caaaacaaaa
aaacaacatg 600gaaaatgcat gctgcgtttt accttgcatt tctttttctt ttcttttttt
tttttttttt 660ttgagacgga gtttcgctct tgttgcccag gctggagtgc aatggcgcca
tctcggctca 720ccacgacttt tgcctcccag gttcaagcga ttctcctgcc tcagcttccc
tggtagctgg 780gattacaggc aatgtgtcac cacgcctggc taattttgta tttttagtag
agatggggtt 840tctccatgtt ggtcaggctg gtcttgaact ccggacctca ggtgatccgc
ccacctcagc 900ctcccaaagt gctgggatta caggcatgag ccactgcacc cggccttacc
tttcatttct 960ttagtaattt agttttaaag tagttctaat ccaaataaaa tactttcata
tcttatttaa 1020aaatcttttc aatataagaa aatcctctta ggaaaaattg tacattgtaa
ttatgtttgg 1080ttgcatggct gtcttatttc cctttgatag atttagagac ctcccaaaga
tttcttgatt 1140agtgataaac ttagttatcc actaatggaa aggaacagtg atgcatgtag
attatagaaa 1200atcaaacact gaatattctg attctcaatt aatgttattt tcaaatgatt
ttgattatat 1260tagtattaat ttgtattatt caattttttt ccccagtatg agaataatgt
tatcactatt 1320gatctggttc aaaattcttc tcaaaaaact cagaatgatg tggacatagc
tgatgtggct 1380tattattttg aaaaagatgt gagtatcatc ttctttattc ctgtgttcag
gaatgtagtc 1440tatcatgcct caatgaatta aatatatttc atcacctttt tatccactta
cagatcaacc 1500aaatggttcg ctgctgccgt taattttgtc ctccctgtca ctcacatgca
tcttgcttgt 1560ttgtatattt atgcctctta tcaaattgtt ctgcctaaaa tatctcccct
ctttcttata 1620attcttattt attatctact tggtggttac ttagtttgtg catatatgct
cccctatgat 1680atttataatt tacacaaata aaagtctgtt aaaaaagact gtaactgata
tgattaaaat 1740attttgttga aactttaata tattatagtg aggtattttc tgctgaaata
tgaggtttgc 1800ttcaaaataa tctgggcggg ggtgaaagga tgaaaggaag aaaagatgaa
gtaagagagg 1860ctatgtgttg ttggccttgc atctgggtga taggtacatg ggcatcattg
cactactctt 1920tctactttcg tgtatgttga aaggttcctg taataaacag ttttttaaag
ttccaataaa 1980ttagattgtt atcactaaaa ccataaagat tcttggcagc ggttcttttg
gcatacaatt 2040tgtatgtaat tatatgtggc catggttggt ttccttaaat atttttaatt
ccttttctcc 2100ttttcaatac aggttaaagg tgaatccttg tttcattcta agaaaatgga
cctgacagta 2160aatggggaac aactggatct ggatcctggt caaactttaa tttattatgt
tgatgaaaaa 2220gcacctgaat tctcaatgca gggtctaaaa gctggtgtta ttgctgttat
tgtggttgtg 2280gtgatagcag ttgttgctgg aattgttgtg ctggtgagta cagaacaagt
aaaatttcat 2340ttaagggtat attttttcaa gaaaaagtaa tagtggctgg gcgcggtggc
tcaccacacc 2400tgttatccct acactttggg aggctgagac aggtggatca cttgagccca
ggagtttgag 2460accacactgg gcaacatggt gaaaccttat ctgtagtaaa aatacaaaaa
ttagtcagat 2520gtgatggctt gcacctgtgg tcccatctac ttaggaggct gatgtgggag
tggtcagttg 2580agtccaggag gtcaaagctg cagtgagcca tgatcacacc actgcactcc
agcctgggca 2640acacagcagg accctgtctc aaaaagaaga aaaaaggaaa tatgaaaaag
taacatccat 2700attccaaaac attcagggaa aaaaatcttc atttttaaat aattttttta
tggtgaatga 2760atctattgta tctctggtct ctttttacaa aagtcatttt atgaagcaag
aaaggatgct 2820aatattaaaa agcttgtggc tgtgcacctc acaggccagt taaattgcca
tctagcagca 2880agcgtctttc agttgtcact gcaaacaatt caacacctag tgcaaaatac
ctgaaccccc 2940aaaccactca ataagatgga acaacagaac acaaagttaa cgttagccat
acaaaagagt 3000taaaagtgat atgtgaatca atacttccaa gtaaagatga gcaaattgaa
tttaacagtg 3060cttcagcaaa agaatgtatt gcttgaagaa gtgaaaggtt tattttagga
atgtaaggat 3120gcttcggt
31283512155DNAHomo sapiens 351atacctgaac ccccaaacca ctcaataaga
tggaacaaca gaacacaaag ttaacgttag 60ccatacaaaa gagttaaaag tgatatgtga
atcaatactt ccaagtaaag atgagcaaat 120tgaatttaac agtgcttcag caaaagaatg
tattgcttga agaagtgaaa ggtttatttt 180aggaatgtaa ggatgcttcg gtatcaagaa
atcttactaa cactggccag gtgtgatggc 240tcaggcctgt aatcgcagca ctttggaagg
ctgaggcggg tagatcactt gagatcagaa 300gttcgagacc agcctggcca acatggtgaa
accctgtctc tactgaacat acaaaaaaat 360tagctgggcg tggtggcaca tgcctgtaat
ctattcggga ggctgaggca ggagaatagc 420ttgaacctgg gaagcagagg ttgtagtgcg
ccaagatcat gccactgcac tctaatctgg 480gtgacagagc aagactctgt ctcaaaaaaa
aaaaaaaaaa aaaaaaaaaa aaggccaggc 540acagtggctc atgcctgtaa tcccagcact
ttgggaggct gaggcgggtg gatcacccga 600ggtcgggagt tcgagaccag cctgaccaac
gtggagaaac cccatctcta ctaaaaatac 660aaaattatcc gggcatggtg tctcatgcct
gtaatcccag ctactcagga ggctgaggca 720ggagaatcac ttgaacccag gaggtggagg
ttgcagctga gatcatgcca ttgcactcca 780gcctgagcaa caagagtgaa actccgtctc
aaaaaaaaaa aaaaaaaaaa gaaatcttac 840taacacaaca gaattcagaa agaggtttga
gggtatttag gaacttagat ttccagttca 900atcaaccatg tttggctatc catctggaac
aaaatgaaag ttgaattcct atttcactcc 960accaggctgg ccatattgcc cagctgtgtg
agggtggcat gtccagagca cagtagtagg 1020aaaggcgttg ggcagtgtat ccattttcaa
agacatttac atatttaaaa atacaaaaaa 1080gtaaactccc aagaaaatta attgagggaa
tgtttgtaca accttgtggt aggggaaatt 1140atgtaaggca agaaatctgg aatccatgaa
agaaaagata catatatgtg tatgtatatt 1200ttgagagagg gtcttgctgt gtcacccagg
ctggagtgca gtagcatgat cataactcac 1260tgcaacctcc aattcctgga cttaagtaat
cctcctgacc tatcctccca agtagcaagg 1320actacaggta tgtgccacta tacctggcta
attttttaat ttttagtaga gacgaattct 1380tgctatggct gccgaggctg ggcttgaact
cctaggctca agcagttctt ttggcttagc 1440ctcccaaact gctgggatta caggcatgag
ccattgcacc tagtcctata tatatatatt 1500ttggcttcat taaaattaag cattttatat
ggcaaagaaa ctgtaaagta aaaaataacg 1560atgggcatga aaaaaatatg gcgcataaag
caaaaatgga tattatacat aatatacaaa 1620gagttcttac aaattgatga ggaaacctaa
agaaagaatg acaacaggta gggatagaca 1680gttaatagaa atttcagatg gcaaatgaac
acaagtggtt aatgctggaa gtctaattgt 1740tctgtagaaa taaatgaaaa cacaagtgca
ataaggaagc acattgttat tgtatcatag 1800cattgcttgt aaaggtgaat ctggccaggc
gtggtggctt acgcctataa tcccagcact 1860ttgggaggct gaggtgggca gatcacctga
ggctgggagt ccgagaccag cctgaccaac 1920acggagaaac cccgtctcta ctaaaaatac
aaaatgagcc aggcatggtg gtgcatgcct 1980gtcattctgg ctactcagga ggctgaggca
ggagagtcac ttgaacccag gaggcagagg 2040ttgtagtgag ccgagatcat gtcattgcac
tccagcctgg gcaacgagag caaaactctg 2100tctcaaaaaa tgaataaaaa caacaacaaa
agtgaatctg gaaaatagcc tgagt 21553523118DNAHomo sapiens
352catgcctgtc attctggcta ctcaggaggc tgaggcagga gagtcacttg aacccaggag
60gcagaggttg tagtgagccg agatcatgtc attgcactcc agcctgggca acgagagcaa
120aactctgtct caaaaaatga ataaaaacaa caacaaaagt gaatctggaa aatagcctga
180gtgtgtatca gtaagagagt aaattatgtt tattgtatct acgataggga ataatgtgaa
240tggtgaatga gttcgatctt tatctttgga tctggaatgg ttgctatgat gttgatacaa
300gctgtgcaca ggtggtgatg atactgcatg gtcccatttt tagaccccaa aacttagatg
360catgtgttta tatatgatat ttgtattagt gtggaaaagg aggatgtgga agaatgcaca
420ccaaactgtt aaatttcttt cttttttttt tttggaatgg agtctcgctg gccggacgtg
480gtggctcact gctgtaatcc cagcactttg ggaggccaag gcagctgggt cacgaggtca
540ggagatcgag gccatcctgg ctaacacggt gaaaccctgt ctctactaaa aatacaaaaa
600attagccagg tgtggtggcg ggcacctgta atcccagcta cttgggaggc tgaggcagga
660gaatggcgtg aacctgggag gcggagcttg cagtgagccg agattgcacc actgcactcc
720agcctgggtg acagagcgag actccatctg aaaaaaaaaa aaaaaagaaa aggagtctct
780ctgtgttgcc ctggctggag tgcagtgtca tgatctcggc tcactgcagc ctccacctgc
840cgggttcaat tgattctcct gcctcaccct cccgagtagc cgggactaca tgcagaagcc
900accatgtcca gctaattttt gtattttttg gtagagacag ggtttcacca tattggccag
960gctggtctcg aactcatcac ctcgtgatcc gcctgcctcg gcctctcaaa gtgctaggat
1020tacaggcatg agccactgtg cccggcttct tctttttttt tttttttttt tttttttttt
1080tttttttttt tttttttgag atggagtctt gctctgttgc ccaggctgga gtgcagtggc
1140acgatctcgg ctcactgcaa cctccatctc ccaggttcaa gccattttct tgcctcagct
1200tcccaagtag ctgggactac aggcgtgcac caccatacct ggctaatttt tttgtatttc
1260tagtagagat agggtttcac catgttggcc aggctgatct cgaaatcctg atgtcaggtg
1320atctgctcac ttcggcctcc caaagtgctg tgattatagg cgtgaaccac catgcctggc
1380ctaaactgtt aaatttcttt aaagattatt cattgtttcc tttttttctt tctctttctt
1440ttctgttgtc ccattggatc cagcattgtt tttgattttg atttttgttt gtttgtttca
1500cttgtcgtgg tagacttttt tttgtttagt agtgaaagtt tttattttat tttatttatt
1560tatggagaca gagtctcctt ctgttgccca ggctggagtg caatggtgca tgatcttggc
1620tcactgcaac ctctgccccc caggttcaag ctattctcct gcctcagcct cccgagtaga
1680tgggattaca ggcgcctgcc accacgcctg gctaattttt gtatctttag tagagatgag
1740gtttcacaat attggccagg ctggtcttga actcctgacc taaagtgatc cacccacctc
1800agcctctgaa agtggtaaga ttacaggcat gagccatcat gcctgaccta ttttatttta
1860ttttaatttt tttttagaga tggagtccca ctctgtcgcc caggctggag tgcaatggcg
1920ccatctcggc tcactgcaac ctctgcctct cgggttcaag tgattttcct gcttcagtct
1980cccaagtagc tgggattaca ggcgaccacc accgcgcctg gctaattttt ttgttttttt
2040agtagagtcg gggggtttca tcatgttggc caggctggtc ttgacctcct gacctcaagt
2100gatccgccca cctcggcccc acaaagtacc ggtgagccac cacgcccagc ccaccttatt
2160tatttttaag agacagggtc ttactctgta gcccaggctg gagagcagtg atgccatctc
2220cactcactgc aacctctgcc tcctgggttc aagcaattct ggtgccttag cctcctgagt
2280agctgggact acaggtgcgt gccatgacac ctggctaatt tttgtatttg tagtagagat
2340ggggtttcac cgtgttggct gggctggtct gaaactcctg acctcagatg atcttcccgc
2400ttcggcctcc caaagtgccg ggattacagg catgagccac tccactggtg tgaaattttt
2460aatttaagaa gcaataaatg tttatggata gatgttaaaa ttagtttttt ttcagatcaa
2520aattatgtcc attaaaagca tatatgtctg tttagataat ctttttttga atagcagtcc
2580taaaacaata gttgtctttc ttccactcag gttatttcca gaaagaagag aatggcaaag
2640tatgagaagg ctgaggtaaa tggattactt acctaaatag aaaggccctg ttgaatctct
2700tactcctaat cactctacct tcctacacac tgatgcattt cagttatact ggagtccctt
2760tatactgttg tctttagggt cttagggaca gtcttagaat gtactcttac ctaaatattc
2820ttgcgtgagt tccatggcag atcaccatct gttttctgcc tcatagaaga gtggaatggg
2880aagcctatgg tttttattct acaaagagtc aacatctaac agaatcttct gaaggcatac
2940tccagtggat tcaccttgga gaaactcatt gtgactgatg atctgattta ttatctctat
3000gccagtgaaa taatcattta atatgaactt aatttgtcat aatctattgt gtactaacta
3060gtctatacta gtgtgacatc aaagtgtcag attgttagtg tgtttcagtc ccttggaa
31183532346DNAHomo sapiens 353tagtgtgttt cagtcccttg gaattgaata tgaacactta
tccttgaacc ctatcaataa 60catttttcac atatctcaat ttttgtgtgt ctttgtagtt
gtatgtgggc cacttactaa 120tattttagca agtaataaaa atagaaacgt aaaggaatat
tggaaaaagt ctaatggaac 180cagaaagttc tagcattttt ttcccattct gtagtaggtc
atctggttta tttggtttgg 240tgaccgcaag tctagaagac taaccctgaa ttgaatggta
acagacaggc agaatgacaa 300tgtagtgttg cagtgcagag cagtacagac ctgggtttgg
ctgggcaaaa ttatataact 360tctttaagcc tccatgtttc ctcatctgta aaatgaggat
aatagatagt atggacctgt 420tgcaaggatt aaacataatc agtgtaaagt gttggtccca
tgcttgccac ataagaaaat 480atttgtcaac agagtggtag ttgtcattat cattgtctca
gtttgcctgt aactagttgt 540gtgatctgag acaaacacta attttgaact tgagtttccc
cacatgtaaa atgaaagatt 600gataatagaa agtaaatcaa ttttttctag cattaaaaat
agtatgcatt taataaaaat 660cttattctta atgatctagc ttacctccaa cttgccctag
tcactttggc gatcttgtct 720ctaaatagaa ccttgaaaac acttaaatgt gtgtttcctt
gcaatataac tttttctttt 780tttatttaaa taagtcttat aaatgtggga aaaaattatc
ttgtgttcct ttaatttcat 840ttttatttaa tactattttc agaatgaaca aaagattgaa
aaattattta gaattttttt 900ctgtgctttt tcctgtttca gataaaggag atgggtgaga
tgcataggga actcaatgca 960taactatata atttgaagat tatagaagaa gggaaatagc
aaatggacac aaattacaaa 1020tgtgtgtgcg tgggacgaag acatctttga aggtcatgag
tttgttagtt taacatcata 1080tatttgtaat agtgaaacct gtactcaaaa tataagcagc
ttgaaactgg ctttaccaat 1140cttgaaattt gaccacaagt gtcttatata tgcagatcta
atgtaaaatc cagaacttgg 1200actccatcgt taaaattatt tatgtgtaac attcaaatgt
gtgcattaaa tatgcttcca 1260cagtaaaatc tgaaaaactg atttgtgatt gaaagctgcc
tttctattta cttgagtctt 1320gtacatacat acttttttat gagctatgaa ataaaacatt
ttaaactgaa tttcttaact 1380ttgacatttc aaatttcttc ttctttttct tttctttttt
tttttttttt gagatggagt 1440cccactctgt tgccaggctg gagtgcagtg gcacaatctc
ggctcactgc aacttctgcc 1500tcctaggttc aagcgattct tctgcctcag cctcccgagt
agctgactac aggcgcccac 1560caccattcct ggctaatttt tgtattttta gtagagacaa
agtttcacca tattggccac 1620gctagtctcg aactcctgac ctcacgatcc acccacctct
acctcccata gtgctggggt 1680tacaggcgtg agccaccgcg cccggcctct ttttttcttt
ttgttttgtt ttttcttttt 1740ttttttgaga caggatcttg ctctgtggcc taggctggag
tgtagtggtg cgatctcagc 1800tcactgcagg attcaagcga ttctcctgcc tcagcctacc
aagtagctag gattacaggc 1860tcccactacc atgcccggct aatttttgta tttttagtag
agaaaaggtt tctttttctt 1920ttttcttttc tttttttctt tttttttttt tgggggggtg
agacagagcc taactctgtt 1980gcccaggctg gagtgcagtg gcacaatctc agctcactgc
aacttctgcc tcctgggttc 2040aagcaattct cctgcctcag cttcccaagt agctgggact
acagatgtgc accaccatgc 2100ccggattatt tttgtatttg tagtagagac agagtttcgc
catgttggcc aggctgatct 2160cgaactcctg acctcaagtg atccacccac cttggtctcc
caaagtgctg ggattacatg 2220tgtgagccac catgcctggt cctatttact ctttgttaag
tggaagtgga tcatcataaa 2280ggtcttgatc ctcatagttt tcactttgag taggctgagg
aagaggaagg gttggtcttg 2340ctgtct
23463543104DNAHomo sapiens 354cacattacga gctcagtgcc
tgccggaaat ctcccacctg gtggcaacct acccttgcat 60acaccccacc caggggcttc
aagccttgca gctgagtaaa cacagaaagg agctctacta 120aggatgcgcg tctgcgggtt
tccgcgcgac ctaggcgcag gcatgcgcag tagctaaagt 180caccagcgtg cgcgggaagc
tgggccgcgt ctgcttatga ttggttgccg cggcagactc 240ccacccaccg aaacgcagcc
ctggaagctg attgggtgtg gtcgccgtgg ccggacgccg 300ctcgggggac gtgggagggg
aggcgggaaa cagcttagtg ggtgtggggt cgcgcatttt 360cttcaaccag gaggtgagga
ggtttcgaca tggcggtgca gccgaaggag acgctgcagt 420tggagagcgc ggccgaggtc
ggcttcgtgc gcttctttca gggcatgccg gagaagccga 480ccaccacagt gcgccttttc
gaccggggcg acttctatac ggcgcacggc gaggacgcgc 540tgctggccgc ccgggaggtg
ttcaagaccc agggggtgat caagtacatg gggccggcag 600gtgagggccg ggacggcgcg
tgctggggag ggacccgggg ccttgtggcg cggctccttt 660cccgcctcag agagtgggcg
gtgagcagcc tctccagtgc ggaggcacgg gggcggaacg 720ttggtgcttg tgcggattcc
gccgtcccca ggttctgctt ggctccggag ggacgccccc 780ctcagccctg aaacccgtgc
ctctccagcc gccccggatc tgaacttgtg atcacggagt 840gtttacgtcg tgccaggcat
tttaatgcat tgttctagtt cattttccag cagtcgcatt 900cctcgccttg gccctacatg
tagcgctcat tacaaacacg gccagaatct cttattaaca 960aacagcagcc aggagtgaga
tttaaaatag actgggggtt taggagaccc ttttatgaca 1020cgtaattctg ctcccacgac
gctcccattt ataccgccgg tccagctaag ggtctggtaa 1080tggagcgccg ttgaagagca
gtatgatgaa gtggtcagga ccaacggact ctggagctgg 1140gctgcttggg atcaagtcgc
tgcccctctg cttattaacg tgtgaccttg ggccagtcat 1200ggacgctatc tgcttcagct
cagcattcag tgctctccgt cacccgaccc catctatcca 1260ggattatctc tccctggaaa
gctacaaacg tctcacccta tgtgggccaa atgttctgga 1320taggcctagt taacctcttc
tctccctgtt ttctttgcgc tttcttgcag ctatgtagtt 1380atgctaatga aaagagcatc
ctagggggag cagagttgtg gattctagtc ctgactagag 1440gactagtgca aatgcgatac
tcctgatgaa aaatgtttca ttcgttagat ataaatgtgt 1500taggcagggt tatggacact
agatgaaaaa agaaatacct ctactttcat agagatcact 1560attggacagc aaggcagaaa
taattacaat tcaagttgga ggcttatgga ggtgagcttg 1620taagaggtta caagaggcgc
caaggcagga tcgccaaaga cggaagactt tggaagagtc 1680tcatacaacg gaagaggcgt
tatatgagac accaaagtcc acgttgagtc ttggtggact 1740agaagtttgc tagggagagg
gcttgaaacg aggtagattg gcgttgctgg tgtagaaaag 1800gaaggagact ggcccaggtg
ggtggggtta gatgaccaaa ggcttttagt gtggtgttga 1860gctgttgaaa ttttatgctg
tagccaatga aaagtctgaa atgttttttt tttttttttt 1920tctgagacgg agtcttactg
tgtcgcccag gttggagttc agtggtgtaa tcctggctca 1980ctgcaacctc cacctcctgg
gttcaagcga ttctcctgcc tcagccaccg gagtagctgg 2040gattacaggc acgtgccacc
acgcctagct aatttttgta tttttagtag agatggggtt 2100tcaccatgtt ggccaggctg
gtctcaaact cctgacctca agtgatccac ccaccttggc 2160ctcccgatgt gctgggatta
caggtattag ccactgcacc tgacctacat agattttaca 2220taagacttta aaacagggcg
ggcgcagtga ctcacgcttg taatcccagc actttgggag 2280gctgaggtgg gcggatcaca
aggtcaggag atcaagacca tcctggctaa catggtgaaa 2340ccctgtctac actaaaaata
caaaaatccc agcactttgg gaggctgagg tgggcggatc 2400acgaggtcag gagatccaga
ccatcctggt taacactgtg aaaccctgtc tctactaaaa 2460atacaaaaaa ttagctgggt
gcggtggcag gtgtctgtag tcccagctac ttgggaggct 2520gaggcaggag aatggtgtga
acccggaagg cagagcttgc agtgagccga gattgtgcca 2580ctgcactcca gcctgggcaa
cagagcgaga ctccatctca aaaaaaaaaa aaaaaaaaaa 2640aaagacttta aaaaaaatta
taagaaagga cagaccaagt gcagtggttc gttccagcac 2700ttagggatgc caaggtggga
ggattgcttg atgctaggag ttgaagacta gcctgtgtaa 2760catagcgaga cccatctcta
caaaaaaatt aaaaagttac ctttagaact tacgattttt 2820atgtgtagac tccatataag
cagagggtct atgcttattc actatttatt accttccata 2880gtccctgcac atataatagg
tgcttcataa acaatttaat gaatgaataa attactgaga 2940aaacactgga agtttttggg
ttagcattgt gttaggtgct tgatatggtc tggctgtgtt 3000cccaccctta tctcatcttg
aattcccatg ttttgtggga ggtacctggt gggacataat 3060tgaatcatgt gggcaggttt
ttcctgtgct gttctcctgg tagt 31043553131DNAHomo sapiens
355gggttagcat tgtgttaggt gcttgatatg gtctggctgt gttcccaccc ttatctcatc
60ttgaattccc atgttttgtg ggaggtacct ggtgggacat aattgaatca tgtgggcagg
120tttttcctgt gctgttctcc tggtagtgaa taagcctcac aagatctgat ggttttaaaa
180atgggagttt ccctgcacag gctctctctc tttgcctgcc gccatccatg taagatgtaa
240cttgctcctc cttgccttcc tcaatgattg tgaggcctcc tcagccatgt ggaactggct
300gcagagtcat taaacttcgt tcttttgtaa attgcccagt ctcaggtatg tcttttttta
360tttttttttg agacagagtc tggctctgtg gctaggctgg agtgcagtgg tgcgatctcg
420actcactgca gcctccgcct cccgggttca agcgattctc ctgcctcagc ctcccaagta
480gctgggacta taggtgcacg ccaccatgcc cagctaattt ttgtattttt aatagagacg
540gagtttcact gtgttggcca ggatggtctt gatctcttga cctcgtgatc ttcccgcctc
600cgccttccaa agagctggga ttacctaccc agctgggtat gtctttatta gcagcgtgaa
660aacagactaa aacagtaaac tgataccaat agagtgggat gcagctgaaa agatacccga
720aaatatggaa gcaactttgg agctgggtaa caggcagagg tcagagcagt ttagagggct
780cagaagaaga ccagaaaatg tgggaaagtt tggaacttcc tagagacttg ttcaatggct
840ttgaccaaaa tcctgataat gatatggaca atgaaatcca ggctcatgtg gtctcagatg
900gagatgagga acttgttggg aactggagca aaggtgacac ttgttatgtt ttagtaaaga
960gactggtggc attttgccct gccctagaga tttgtggagc tttgaacttg agagaaatga
1020ttttgggtat ctggtgggag aaatttctaa gcagcaaagc attcaagagg tgacttgggt
1080gctgttaaag gcattcagtt ttaaaaggga aacagcatga aagtttggaa aatttgcagc
1140ctgacaatgt gatagaaaag aaaatcccgt tttctgagga gaaattcaag ctagctacag
1200aaatttgcat aagtaatgag gatcccaatg ttaatcccca agacaatggg aaaaatgttt
1260ccagggcatg tcagaggcct tcatggcagc ccctctcatc acaagcctag aggcctagga
1320gaaaaaagtg atttcatggg ccagcccggg gtccccatgc tgtgtgcagc ctagtgactt
1380ggtgccctgc atcccagctg ccccagctgt ggctgaaagg ggccaaccta gagctcaggc
1440catggcttca gagggtgcaa gcctgaaacc ttgacagctt ccaggtggtg ttgagcctgc
1500aggtgcacag aaatcaataa ttgaggtttg agaatctctg cctaggtttc aaagatgtat
1560ggaaacgcct gcatgtccag gcagaagttt gctgcagggg tggggtgctc attgagttcc
1620tctgctaggg caatgtagaa gggaaatgta gggtcagagc ccccccacag agtccctact
1680ggggcaccac ctagtggagc tgtgaaaaga gggctaccat tctccagacc tcagaatggt
1740agatccacag acagcttgca ccatgtgcct ggaaaagctg tagacactta acgccatctc
1800atgaaagcaa ccaggcagtg tgctgtaccc tgcaaagcca caggggcaga gctgtccaag
1860gctgtggttg cccagctctt gcatccgcat gacctggaca tgagacatag agtcaaagga
1920gatcattttg gagctttaag atttgactgc catgctggat tttggacttg catggggcct
1980gtagcccctt tgttttggcc aatttctccc atttggaatg gctgtattta cccaattcct
2040ataccccatt gtatctggga agtaactaac ttgcttttga tttgacaggc tcatatgcgg
2100aaaggactta ccttgtcttg aatgagactt tggactggaa ttttgaatta atgctgaaat
2160gagttaaggc tttgggggac tgttgggaat gcatgattgg ttttgaaatg tgaggacatg
2220agatttggga ggggtcatgg cagaatgata tggtttggct atgtccccac ctaaatccca
2280tcttgaattc ccatgtattg tgggagggac ctggtgggag atagttgaat catggggatg
2340gatctttccc atgctgttgt gatagtgaat aagcctcatg agatctgatg gttttaaaaa
2400cggaagtcta cctgcacaag ctctttcttt gcctgctgcc atccatgtaa gacatgactt
2460gttcctcctt gccttctgcc atgattgtga gacctcccca gccatgtgga actataagtc
2520cagtaagcct ctttttcttc ccagtctcgg gtatgtcttt atcagcagca tgaagtccag
2580ctaatacagt gcttgaacat gtaatatctc aaatctgtaa tgtacttttt ttttttttaa
2640ggagcaaaga atctgcagag tgttgtgctt agtaaaatga attttgaatc ttttgtaaaa
2700gatcttcttc tggttcgtca gtatagagtt gaagtttata agaatagagc tggaaataag
2760gcatccaagg agaatgattg gtatttggca tataaggtaa ttatcttcct ttttaattta
2820cttatttttt taagagtaga aaaataaaaa tgtgaagaat ttaattgtgt tttagtattt
2880taagtagatt gtgatagtag aatggtttga gacactttaa tagcaattag catgtggttt
2940ttaaaaagtt gcagtttggc tggtcgcagt ggctcatgct tgtaatccca gtattttggg
3000aggctgaggc aggtaggttg cctgagccca ggagttcaag accagcctgc ccaacgtggt
3060aaagccccat ctctactgaa gataaaaaaa tttaaaaaaa ttagctgggg ctattggcac
3120acacctgtgg t
31313563085DNAHomo sapiens 356agttgcagtt tggctggtcg cagtggctca tgcttgtaat
cccagtattt tgggaggctg 60aggcaggtag gttgcctgag cccaggagtt caagaccagc
ctgcccaacg tggtaaagcc 120ccatctctac tgaagataaa aaaatttaaa aaaattagct
ggggctattg gcacacacct 180gtggtcccag ctaatcaaga ggatgaggtt agaggatcac
ttgagcccag gaggttgagg 240ttacagttta actttcagag gccaaggcag gaggattgct
tgagtccagg agtttgagac 300caccctgggg aatgtaggga gatcccatct ctatagaggg
atagattaga tagataattt 360ctgaggggag gggaggggga gggccaggga aggggaggga
aaggggaggg gagggcaggg 420ccagcagtaa ggtcataata gagacatgta tctgtaagat
ccttataata ggtgaggatg 480gccacaaatt agcgccacag atttgtattt ttagtagaga
caaggtttta ccatgttggc 540caggctggtc ttgaactcct gacctcaagt gatccgcctg
ccttggcctc ccaaagtgct 600gagattacag atgtgagcca ccatgcccaa ccacaagcat
ttatttattt atttatttat 660ttatttattt atttatttag agacagtctt gctctgtcgc
caggctggag tgcagtggcg 720ccatctgggc tcactgcaaa ctctgactcc ctggttcaag
cttttctccc gcctcagcct 780cccgagtagc tgggattaca ggtgcatgct gcaacacccg
gctaattttt gtatttttag 840tagagatggg gtttcaccat gttggccagg acggtctcga
tctcctgacc tcgtgatccg 900cctgccttgg cctcccaaag tgttgggatt acaggcgtga
gccacagcac tcagccagtt 960atttttttat aagaaaacat tttactggcc aggcctggtg
gctcacacct gtaatcccag 1020cactttggga ggccgaggca ggcggatcac gaggtcagga
gttcgagacc agcctggcca 1080acatggtgaa accccatctc tactaaaaat acaaaaatta
gccaggcgtg gtggtgtgcg 1140cctgtattcc cagctactgg ggaggctgaa gcaggagaat
cgattgaacc cttgaggcag 1200aggttgcagt gagttgagat cgcaccattg cactctagcc
tgggtgacag agcaagactt 1260catctcaaaa aaaagagaaa acattttatt aataaggttc
atagagtttg gatttttcct 1320ttttgcttat aaaattttaa agtatgttca agagtttgtt
aaatttttaa aattttattt 1380ttacttaggc ttctcctggc aatctctctc agtttgaaga
cattctcttt ggtaacaatg 1440atatgtcagc ttccattggt gttgtgggtg ttaaaatgtc
cgcagttgat ggccagagac 1500aggttggagt tgggtatgtg gattccatac agaggaaact
aggactgtgt gaattccctg 1560ataatgatca gttctccaat cttgaggctc tcctcatcca
gattggacca aaggaatgtg 1620ttttacccgg aggagagact gctggagaca tggggaaact
gagacaggta agcaaattga 1680gtctagtgat agaggagatt ccaggcctag gaaaggctct
ttaattgaca tgatactgtt 1740tcatttaagg aaaaataata aaaaaactct tttttttgta
tctaattaaa ataatgttct 1800gatgtttaca gaaactttgt atatttaatt ggacattaga
acaagctgtt tgttgtgtaa 1860gatttatttt acctcagatc ttttctcccc cctttccttt
ctgtcttgtg ttccaaaaga 1920gtaattatta cggtaaatat tactgtaatt atggatttat
caaataagat gcagttcttt 1980agcatttttt gataaatcga gtggaacttt agcctgttat
tttactattt gttttatttt 2040aactaaattc tgattgtgtc attttttttt tttttttttg
ggaccgagtc tcgctctgtc 2100gcccaggctg gagtgcagtg gtgcgatctc ggctcactgc
aacctctgcc tcccaggttc 2160aagcaattct tctgcctcag cctcctgagt agctgggatt
acaggtgtgt accaccacac 2220ccagctaatt tttgtatttt tagtagaggt gaggtttcac
catcttggcc aggctggtct 2280tgaactcctc acctcgtgat ccacccacct gggcctccca
aagtgctggg attacagcca 2340tgagccacca tgctcggctt tgattgtgtc atttgtatag
gcatgtggtt tattatttag 2400ttattttttt ttttttcttt gaggtggagt atcactcttg
gtgcccaggc tggagtgtaa 2460tggcgtgatc tcagctcact gcaacctcta cctcctgggt
tcaagcaatt ctcctgcccc 2520agcaggagta gcttgggatt acaggcatgc cccaccacac
ctggccaatt ttgtgttttt 2580agtagagaca gggttccacc atgttggtca ggctggtctt
gaactcctga cctcaggtga 2640tctgcccacc tcagcctccc agagtgctgg gattataggc
atgagccacg gtgcccagca 2700tatttagatt tttttttttt tgagactgag tctgactctg
tcacccaggc tagagtgcag 2760tggcacgatc cacgatcttg gctcactgca gcctccacct
tatgggttca agcgattctt 2820ctgcctcagc ctcccaagta gctgggactg caggcacatg
ccaacacgcc cggcttattt 2880ttgtattttt atagagacgg ggtttcatca tattggtcag
gctggtctct aactcctgac 2940cttgtgatcc acccgccttg gcctcccata gttctgggat
tacaggcatg agccacagcg 3000ccaggcctag atgtttctta aggtatgtat ctcccaaaga
ttctttttgt ggtcctcaag 3060taccataagc accgctggag ataac
30853573148DNAHomo sapiens 357accataagca ccgctggaga
taacacatgt gatgggcatt tttagcatag attgtatcta 60agcaactttc cacaagtaat
agttctgtta agggttgtta ttgtggccgg gcgcggtggc 120tcacacctgt aatcctggca
ctttgggaag ctgaggcggc cggatcacct gaggtcaggg 180attcgagacc agcctgtcca
atgtgctgaa accctgtctc tactaaaaat gcaaagaaaa 240aaaaaatcta gccaagcatg
gtggcttgct cctgtaatcc tagctacttg ggaggctgag 300gcaggagaat tgcttgaacc
tgggaggcag aggtagcagt gagccaagat cgtgtcaccg 360cattccatcc tgggcgacag
tgagactctg tctcaaaaca aaaaaagagt tgttaccgtt 420gggactattt tttgaaagct
ttatgtgaac gtaattttat attttgatga aaatttagtt 480tattgatgta aaaagtgtat
cagtacatca tatcagtgtc ttgcacattg tataaacatt 540taatgtaggt gaatctgtta
tcactatagt tatcaatgtt ataattttca tttttgcttt 600tcttattcct tttctcatag
tagtttaaac tatttctttc aaaatagata attcaaagag 660gaggaattct gatcacagaa
agaaaaaaag ctgacttttc cacaaaagac atttatcagg 720acctcaaccg gttgttgaaa
ggcaaaaagg gagagcagat gaatagtgct gtattgccag 780aaatggagaa tcaggtacat
ggattataaa tgtgaattac aatatatata atgtaaatat 840gtaatatata ataaataata
tgtaaactat agtgactttt tagaaggata tttctgtcat 900atttatctca aaacctaaac
tgtgtatcaa tgatattaag cttttttttt tttttgagac 960agagtttcac ttttgttgcc
caggctggag tacaatggcg cgatcttggc tcaccacatc 1020ctctgcctcc caggttcaag
tgatcctcct gccttggcct cctgagtagc tgggattaca 1080ggcatgtgcc accacgcctg
gctcatcttt tttgtatttt tagtagagat ggggtttctc 1140tatgttggtc aggctggtct
caaactcctg aacctcaggt gatccgcccg cctcgggctt 1200ccaaagcgct gagattgcag
gcatgagcca ctgtgtctgg cctattttta tagtttatgt 1260acttggaatt atataatata
ttctgcctag cttctttcat tcaatatttg taagatttat 1320ccatattatt gagtgtagtt
gtggattttt gcatttatat ttcatagcac gagcatgtca 1380gaatttatcc attttacttc
ccttctgccc gccactgcta ctctccccat tttacctttt 1440tttttgtttt tttgagatgg
agtctcagaa tttcgctctg tcgcccaggc tggagtgctg 1500tggcacggtc tcagctcact
gcaacttctg cctctgggtt cagctgcacg ccaccatgcc 1560tggctaattt ttgtattttc
agtagagggg attttgctat gttggccagg ctggtcttga 1620actcctgacc tcaggtgatc
cacccacctt ggcctgccag agtgctgtga ttacaggcgt 1680gaaccaccgt gcccgacccc
cattctaatt ttgatggaca tttgggtaat tttcattttt 1740ggctgttata aatactgctg
caattacagt taattttcac agtttttttt tttttttttt 1800tttttttttt tttttgaggt
gagtttcgct cttgttgctc aggctggagt gcagtggtgc 1860gatctcagcc cactgcaacc
ttcaccttct ggattcaagc aattctcctt tctcatctcc 1920taagtagctg gggtttacag
gcatgtgcca ccatgcccag ctaatttttg tattttaatt 1980tcacagttct ggaggctggg
aagttcagaa ttaaggcact ggctgatctg ttgtctggtg 2040agggcccact tgttcataga
taaccatttt ctcactctaa cctcacaagg ttgaaagggc 2100ctaatttttg tgtttttagt
agagacgggg tttcactatg ttggctaggc tggtctcaaa 2160ctcctagcct cgagtcatcc
acccgcctcg tcctcccgga gtgcttggat tacagcatga 2220gccactgcgc ccggccccca
ttttagtttt gatggacatt tgggtaattt tcttttttgg 2280ctattctaaa taatgctgca
attactgtta attttcacct tgtaaaaacc attttcaaat 2340ctcaagagat taacctttag
ttttcttggt ttggattggg aaggaacacc aaggaaaatg 2400agggacttca gaatttattt
tcattttgca tttgtttttt aaaatcttta gaactggatc 2460cagtggtata gaaatcttcg
atttttaaat tcttaatttt aggttgcagt ttcatcactg 2520tctgcggtaa tcaagttttt
agaactctta tcagatgatt ccaactttgg acagtttgaa 2580ctgactactt ttgacttcag
ccagtatatg aaattggata ttgcagcagt cagagccctt 2640aacctttttc aggtaaaaaa
aaaaaaaaaa aaaaaaaaaa agggttaaaa atgttgaatg 2700gttaaaaaat gttttcattg
acatatactg aagaagctta taaaggagct aaaatatttt 2760gaaatattat tatacttgga
ttagataact agctttaaat ggctgtattt ttctctcccc 2820tcctccactc cactttttaa
cttttttttt tttaagtcag agtctcactt gttccctagg 2880ccagagtgca gtggcacaat
ctcagcccac tctaacctcc acctcccaag tagttgggat 2940tacagttgcc tgccaccatg
cctggttaat ttttatattt ttagtagggt tgcggggaca 3000gggtttcacc atgttggcca
ggttggtctc aaacttctga ccttaggtga tcctcccacc 3060tcggcttccc aaagtgctgg
gattacaggc ttgagccatc gtgcccagcc tactttttac 3120ttttttagag actgggcttg
gtggagtg 31483582176DNAHomo sapiens
358ttagagactg ggcttggtgg agtgaagtgg caagatcata gctcactgca gtattgaact
60cctgggctca agcgatcttc ctgcttcaac ctcatgagta gctgggtcta caggcacaag
120ccaccatgct tgcctaattt taaaattttt gcagagttgg agtttcacag tgttgcccag
180gatgttcgct cactcctgac ttcaagtgat tcttctgcct tagcctctag agtggtagct
240gggattacag gcatgaacca ccatgctctg ctattttttt tcaaggtttt tttttttttt
300tttttttttg agagactggt atgactatgt atgctcccta ggctggagtg cagtggctat
360tcacaggaag tgccatcaga gtgtactaca gcttcaaact cctgggctca agcacttcta
420tcatagtctc caaagtagct gggactacga gtgtgtctca ttgtgccttg ctctcgaatt
480gctttttttt tttttttctg gtttcaagct atctatgtgg tattagtcct cactttatga
540ataattttgt atactactaa tagcaatttt tttttttttt ttttttttga gacggagtct
600cattcttgtc gcccaggctg gagtgcagtg gtgtgatctt agctcactgc aacctctgcc
660tctccggttt gggcaattag ctgggattag aggcgcctgc caccatgccc agctaatttt
720tgtattttta gtagacatgg ggtttcatct tgttggctag gctggactct aactccaggt
780gatctgcctg cctcggcctc ccaaattgat gggattacag gtgtaaacca ctgggcctgg
840cctagcaatt taaaatgaca ttctaagaag ttttatgtct aaatctgcag taagtggctg
900ggtgacgtgg ctcatgcctg taatcccaac gctttgggag tccagggtgg gaggatgact
960tgaggccagg agttgagacc agcctgggca acatagtgag actctgtctc tacaaaagaa
1020aaaattagcg gggcttagtg gcgtgcgcct gtagtctcag ctactcgaaa ggctgaagtg
1080ggaggattct ttgagcccca agggttctgg cttgccgtga gccaggatgg caccactgca
1140ctccagtctg ggcaatagag tcagaccctg tctcaacaaa taaaataaaa ctgtagtaat
1200tataaagtgg ttttggctgg gggagaaatg tacagttgaa catacggatt aagaggttga
1260aagttggtct taggaagagg aactttttgt ggaaatttct taatatttga agaatattat
1320gttattgttc ctctgttttt catggcgtag taaggttttc actaatgagc ttgccattct
1380ttctatttta ttttttgttt actagggttc tgttgaagat accactggct ctcagtctct
1440ggctgccttg ctgaataagt gtaaaacccc tcaaggacaa agacttgtta accagtggat
1500taagcagcct ctcatggata agaacagaat agaggagagg tatgttatta gtttatactt
1560tcgttagttt tatgtaacct gcagttaccc acatgattat accacttatt gtaatatgca
1620gttttggaag tatatgttac catttaactg tacagagtac atagtaatag agtggtaatt
1680atttagattg attaaagaac tcattttttt aaataagttt tttttttttc actataaaag
1740tttattttat ttgagatggt atggtatcga acatgttcat attgtgtgta atcgtgggta
1800aattactcaa cctttatgtc atagtttctt cacctttaaa atgacattaa taaaagagct
1860acttaatagg attataagca tgagatgatt taatatacat aaaatactta cagtctgata
1920tataggaagc acttaactct ttatcctaga aaagatttaa ggtgacctta acatatatgt
1980cagaaaatct ttaaaattgt ggaaataaaa ggttgtataa ttctgctatc ctaaaattac
2040tagtatttca atatatttta ttttagtctt ttcttttaga tacaagtttt aaaactttta
2100agtgaagtgt aatatacgta agtactgctt gatgaattta aggtgatttc taaagccagg
2160tttgttgggg aagagg
21763593128DNAHomo sapiens 359ccagtttcaa gcgattaagt gattctcctt cctcagcctc
ccgagtagct gggattacag 60gcgtgtgcca ccacgtcgcc accacgtctg gctaattttt
gtatttttag tagagacagg 120gtttcgccat gttggccagg ctggtcatga actcctgacc
tcaggtgatc cacctgcttg 180gcctcccaaa gtgctaggat tacaggtgtg agccactgtg
cctggcttaa gttttgtatt 240tttagtagag acggtgttcc atcatgttgg tcaggctggt
gtcaaactcc tgaccatgcg 300atccgcctgc ctcggcctcc caaagtgctg agattacagg
cgagagccac cgtgcccggc 360ctgtttgagt atcttttaaa accagtaagg acaaactaga
ggtgtcagct ctcttcatgg 420gctttggaga aacaagacaa aaaggaaaga gatgtttcgc
cgggcgcggt ggctcactcc 480tgtaatccca gcactttggg aggctgagga cagcggatca
cccgaggtca ggagtcaaga 540ccagagccat tgcactccag cctgggcaac aagagcacaa
ctttatctca aaaaaaaaaa 600aaccaaaaaa gaaacaggaa agagatgttt tgatttttta
agtctagagt gttctgttct 660tactctacag cacttagcag tagtccatct atcctccttg
tttgttcttt acaacaaaac 720cccattggtt ctctcttacc aagtttgctt tattcttggt
ttatcctttg taagatgtga 780aagggatatg aagagcaaat aggaagtgtt actcttgctg
cttgagagaa agctgtttta 840caatttgttg gcaaacaatt tgtaaaagta caacaaaagt
gtgcattttt ggcttcttat 900ttatgtttta tcattgctat atctcataat ttgtgatttt
taaaataact ttttatttga 960aaagcactac agggtcacgt catgttttta aaaaataaat
taagaaggta aacacccgta 1020cttctacttt acctctagtc ctagtctatg gtggtaatca
gtgttaacag tttagtttgt 1080gttcttaccc ttccaggggt tttttttctc tatgtataca
gatatatgca tttttaaaaa 1140catagttaac acttaaaaac aatatgggat cgtattagga
atacaatctg tattccttcc 1200caacagtata tacagttttt ttccatttca ctatgtatct
atttataaat tttttatttc 1260taataatttc tcttgaatag gtgagacatc atatagtata
aaattcagta gaaaatcagt 1320ttttcagagg tacaaaattg gctgactttg cacagactcc
tttcatttca caggtaggga 1380tgcacagcca cctcttccac cgacgagagg aaaggatatg
tgtgcctgtg ggctcttcaa 1440ctctgttgat tagttatgat ttattttctg gtcagtttga
gaggaaacag tgataaaata 1500ctgggaacag ggaagaagca taagattatt attgtttttt
tttttttttt tgagacagag 1560tcttgctcag ttgcccaggc tggagtgcag tggtgcgatc
tttgctcact gcaagctccg 1620cctcccgggt ccatgccatt ctcctgcctc agcctcccga
gtagctggga ctacaggcgc 1680ccgccaccac gccctgctaa tttttttttt gtatttttag
tagagacagg gtttcaccat 1740gttagccagg atggtctcga tttcctgacc tcgtgatcca
cccgcctcgg cctcccaaag 1800tgctgggatt ataggcgtga accaccgcgc ccagctttaa
tttttttttt tttttttttt 1860ttgagacaga gtcttgctct gtcgcccagg ctgaagtgca
gtggcgcgat ctcggcttgc 1920tgcaagctcc gcctcccagg ttcacgccat tctcctgcct
cagcctcctg agtagctggg 1980attacaggca cccgtcacca tgcccagcta attacgggac
ctcgctctgt cgcccgggct 2040ggagtgcagt ggcacagtct cgctcactgc aatctggcaa
gtgattctct tgcctcagcc 2100tccagagtag ctgggactac aggtgtgcgc cgctacgccc
agctaatttt tgtattttta 2160gtagagatag ggtttcgcca tgttggttgg ccaggatggt
ctcgatctct tgacctcgtg 2220atccgccctt ctcggcctcc caaagtgctg ggattaccgg
tgtgagccat cgcacctggc 2280cttcctactt tattaagata cctaagggat ttctgtgatt
gttaggattc aaatttctgt 2340gagcataaga atcaagctgt gtgcataata attgcatggg
atttcacagc tgggccccat 2400tcccagggat tttgtattat ctacctccaa gtgattttga
tgctggtgat ccttggacca 2460gacttggtga agctcaatgc ttagctagga aagccccaaa
aatttgcttt attggattgt 2520gtaatttgac tacatccatt gtttcttttt tcaaatgtag
agttatatgc cacaaaaata 2580ttttccgtag cagtaggcat cctaattaat ctcgatgttt
gtttatagcc ccattgatgg 2640ggctataaac ttggcagcaa attgttttcc cactaatttg
gcattttcca taaatgtttg 2700tttatagccc cattgatggg gctataaact tggcagcaaa
ttgttttccc actaatttgg 2760cattttccat aaaaaacacg tatctgttgt tagctgccta
gacgttagct ggacatggtt 2820taggttactt ttctcttaaa aagtaaattt taattcaagt
tcctttaagc cagcagtctc 2880aacctggggc agtttttccc tccaggggac attcagcagt
gtctagagac atttttggtt 2940gtcatgctga ggaagagagt gtatagtggg tagaatccag
ggatgctgtt aagcatggaa 3000cagcccctta caacaaaaaa ttatgtagcc taaaatggca
gtgttgccaa gattgagaaa 3060ttatgcttta aatgtgtttt tatatatggc cattttgtgt
ttactctgga gataacatgc 3120ttttcctc
31283603145DNAHomo sapiens 360tccgtagcag taggcatcct
aattaatctc gatgtttgtt tatagcccca ttgatggggc 60tataaacttg gcagcaaatt
gttttcccac taatttggca ttttccataa atgtttgttt 120atagccccat tgatggggct
ataaacttgg cagcaaattg ttttcccact aatttggcat 180tttccataaa aaacacgtat
ctgttgttag ctgcctagac gttagctgga catggtttag 240gttacttttc tcttaaaaag
taaattttaa ttcaagttcc tttaagccag cagtctcaac 300ctggggcagt ttttccctcc
aggggacatt cagcagtgtc tagagacatt tttggttgtc 360atgctgagga agagagtgta
tagtgggtag aatccaggga tgctgttaag catggaacag 420ccccttacaa caaaaaatta
tgtagcctaa aatggcagtg ttgccaagat tgagaaatta 480tgctttaaat gtgtttttat
atatggccat tttgtgttta ctctggagat aacatgcttt 540tcctcatata acatgcttga
taaacatttt ggtaacacag gaattgtaaa tgctggtgat 600gtcagtaaat agttaagaaa
tttagggctg tgcgcggtgg ctcacgcctg taatcccagc 660actttgggag gccgaggcgg
gtggatcccg aggtcaggag atcgagacca tcctggctaa 720catggtgaaa ccccgtctct
actaaaaata caaaaaaatg agccgggtgt ggtggcaggc 780acctgtagtc ccagctactc
aatttagaaa gcagatttgt ttcctttcta tacctgtgta 840atttgaggtt tagtttactg
tcacatcgtt tataaacata aggaagatcg ttgctcatct 900gatagcattc cgaaccttga
gtcatctgta atgcctatgg cctccagaaa agcttctcta 960atactgtact tagagatgtg
taaaatatgt aggaacattt tcccaccttc gattgttagt 1020ttacctttca gcttcagtaa
tttacctttc agctattact ttagtaacat cttcaacatt 1080gtttttcaaa ctgcaaggtg
tgacccagta gtgggtcgtt aaattagtag gtgacagagc 1140atttttgaag aattaaatac
aatagaacat agcagagtgg gctcacgcct gtaatcccag 1200cactttggga ggcgaggctg
gcaggtcaca aggtcggcag gtcacaaggt cagaagatcg 1260agaccttcct ggctctaaca
tggtgaaacc ccgtctctac taatagtaca aaaaattagc 1320ggggtgtggt ggcatgcgtc
tctagtccca gctactcagg aggctgaggc acgagaatca 1380cttgaatccg ggagctggag
gttgcagtga gccgagattg caccactgca ctccagcctc 1440agcaacagag caagactatt
tcaaaaaaaa aaaaaaaaaa aaagaaagaa agaaaaaaag 1500aaaatagagt gtatcacata
attagagtag caagtattga tttgtgaaac ctattttaat 1560catagatcta tgtatgtatg
tgctggattg tgatgtaaag acatttcttg ctgtggttac 1620actgaaaaaa atgaaaagtc
actgatttcc aataacttac agaagcagta tgaactacat 1680attctgtcgt tcttgaaaca
agctgagatt ttattgactt tgggaagcag tagaattatt 1740ttagtttttt aattaacagt
ttttggcttt gtactgtcaa gaggtaattt tagaaagcat 1800tctaaaaatg taagtactgg
atttggcaac attcttgaac tgtaattctg tttcgttaaa 1860catcactatt tacatgtgca
acagcgtgtc tgtaacaatg tcccagtaat gaaattcttt 1920cttctattta aggcatgtct
gtttgataaa agtcaaacaa aattgggtat atgtcagtgt 1980cttatgatac tgcttaatta
aacattaatt tgactcttag ctaatcagga aatgtttgcc 2040tcacagtctt acagagcttt
ccaccttcta aaaaagctaa cgtttcagaa tagattcagg 2100attcaacctt ctttctgtct
tttttttttt ttgtttgaga cagagtcttg ctctgttgcc 2160caggctggag tacagtggcg
ctatctcggc tcactgcaac ctccgcctcc tgggttcaag 2220caattctcct gcctcagcct
cccgagtagc cggggttaca ggcgtgcgcc accatgccca 2280gctaattttt ttgtattttt
agtagagaca gggtttcacc atgctgggtg gccaggcggg 2340tctcaaactt ctgaccttga
gatctgccca ccgtggcttc ccaaaatgct gggattatag 2400gcgtgagcca ccgcacctag
cctagattca ggctgcttct tttttttttt ttttttgaga 2460cagagtcttg ctcttgttgc
ccaggctgga gtgccatggc atgatctcag tgcaccacaa 2520tctctgcttc ccaggtttaa
gcgattctcc tgcctcagcc tcccaagtag atgggatcac 2580aggcatgagc caccatgcct
ggctaatttt gtattttttg tacagacggg gtttctccat 2640gttggtcagg ccagtctcga
actccctacc tcaggtgatc tgcctgcctc ggcctctcaa 2700agtgctggga ttacaggtgt
gagccactgc gcccagcaga ttcaagcttt ttaaatggaa 2760ttttgagctg atttagttga
gacttacgtg cttagttgat aaattttaat tttatactaa 2820aatattttac attaattcaa
gttaatttat ttcagattga atttagtgga agcttttgta 2880gaagatgcag aattgaggca
gactttacaa gaagatttac ttcgtcgatt cccagatctt 2940aaccgacttg ccaagaagtt
tcaaagacaa gcagcaaact tacaagattg ttaccgactc 3000tatcagggta taaatcaact
acctaatgtt atacaggctc tggaaaaaca tgaaggtaac 3060aagtgatttt gtttttttgt
tttccttcaa ctcatacaat atatacttgg caatgtgctg 3120tcctcataaa gttggtggtg
gtgac 31453613080DNAHomo sapiens
361cccagatctt aaccgacttg ccaagaagtt tcaaagacaa gcagcaaact tacaagattg
60ttaccgactc tatcagggta taaatcaact acctaatgtt atacaggctc tggaaaaaca
120tgaaggtaac aagtgatttt gtttttttgt tttccttcaa ctcatacaat atatacttgg
180caatgtgctg tcctcataaa gttggtggtg gtgactcact cttaggacac attcagattt
240cttttttttt tttttttgag aaggagtctt gctccgttgc caaggctaga gtgcagtggc
300acaatctcag ctcactgcaa cctctgcctc ctgggttcaa gcgattctcc tgcctcagct
360tcctgagtgg ctgggattac aggcatgtgc caccatgccc ggctaatttt tgtactttta
420gttttaccat gttggccagg ttcgtctgga actcccaatc tcaggtgacc cacctgcctc
480ggcctcccaa agtgctggga gtacaggcgt gagccacaga gcctggccat gttcagactt
540ctaataacag gtttgtattg actcttagcc tcatggcaga agccaagaga catgagacag
600cttagaaatt tttgcttttt ggaaatgaat gttagagtta ctggtttgtg attaaggcct
660attgcactga cagaggcagt gaaaaagggt ttgattgcca aggaagattc acagggccta
720gaatggcagt ggttatgcat ctacagttta ttacaggaga aggatacaat ccagtagcag
780gattatggta aggatatgca tcacagtcaa aggctgtcat agcaagtcat ccagagagtt
840cgggtgcaag ttccagtttt cctttgttgt gtaaagtctg tggtggggtg cattttctct
900ctcagagcag gatgtgtgca caggacacct tggaacctag gagcccaaaa tagagtcttc
960actggacttt ttaatatttt tcttgtcaag cggacatgtt cctgttctct aactagcctc
1020ttcagtggag gtcagaggaa gagcctcatt gagaccaagt gcaactcatc aatcacatga
1080aacaatgctg ataaataaac cacctaaata tcccctgacc cacaaataca aaacaacacc
1140attcaatcag tatttttcat gccttgatca ggggtcattg ccatgcagga actttaacaa
1200aacagtacag gctaataata gaattgttgg aattaactca cacagcacac ctatgagaga
1260gagttaagat agagggtctt ggtggtctct aacagttgaa ttcaaagtga agttaccaga
1320gtaaagtgag caaagacaca tattagtaca atattggtag ataaaatcac gttgctctaa
1380taagcatagt tttaaacttt aaccatgttt ctccagtaat tttagtaatt atattgttgt
1440tatgtctaat acataaagca ttttttactt ttttaaaaaa tttttaggca atgtggggtc
1500caaagtaatt aaaaaaaaat ttttttaaca taaagcatct taaaatttta cttaatcatg
1560atcacttaga accattaaaa catacgtttt gatattatgg ggaagcttcg ttgttccttt
1620gtagacagac ttaaagaaat acaactttat gatgacaaga tataagataa ttatagattt
1680aaattttata gaaacctttt cccttatcta gtgcaagagg tagctaagtg cttattttct
1740caaagtactg tgttataaaa agtattccta gtgtagtcaa agcttctctt tagactgata
1800aaacttagag cacctgcatt tacttccaac aaagcagaat taaagaaaat gagacttggc
1860cgggtacgtt tgtaatccca gcactttggg aggccgaggc aggtggatca tgaggttagg
1920agatcaagac cattctggct aacatggtga aaccctgtct ctaccaaaaa tacaaaaaat
1980tagctgacat ggtggtgcgc acctgtagtc ccagcttctc aggtggctga ggcaggagaa
2040tcgcttgaac ccaggaggtg gaggttgcag tgagctgaga tcacaccact gcgctccagc
2100ttgggcaaca aaaaaaaaaa aaaaaaaaag aaaaagaaaa tgagtcttta ctggctgggc
2160acagtggctc acacctgtaa tcccagcact ttgggagacc gagacgggca gatcacctga
2220ggtcgggcat tcgagaccag cctgaccaat atggagaaac cccatttgta ctaaaaatac
2280aaaattagcg gggcgtggtg gcgcatgcct gtaatcccag ctattcggga ggctgaggca
2340ggagaattgc ctgaacccgg gaggcggagg ttgcggtgag cagagatcgt gccgttgcac
2400tccattctgg gcaacaagag cgaaactctc catctcaaaa aaaagaaaat gagtctatac
2460tttgctgttt tcatactctc ttagtgtggt gtaggcagcc atgtatcccc cttgtgcctc
2520tatttctcca ttctgtgaat gagtgtcttc cactgctgtg cttttctgat tccgtaacct
2580ttgtttgttt gtttgtttgt ttgtttgttt gttttttatt gatcattctt gggtgtttct
2640cgcagagggg gatttggcag ggtcacagga caatagtgga gggaaggtca gcagataaac
2700aagtgaacaa aggtctctgg ttttcctagg cagaggaccc tgcggccttc cgcagtgttt
2760gtgtccctgg gtacttgaga ttagggagtg gtgatgactc ttaatgagcg tgctgccttc
2820aagcatctgt ttaacaaagc acatcttgca ccacccttaa tccgttcaac cctgagtgga
2880cacagcacat gtttcagaga gcacagggtt gggggtaagg tcacagatca acaggatccc
2940aaggcagaat aatttttcgt agtacagaac aaaatgaaaa gtctcccacg tctacctctt
3000tctacacaga cacggcaacc atccgatttc tcaatctttt ccccaccttt cccccctttc
3060tattccacaa aaccgccatt
30803623029DNAHomo sapiens 362ccaaacaaca gcattagcca actctttgaa gccttagatc
tgtggctctt gttttctcct 60ttgaggtgta ggtccttgag ggcatttgct tctaatagag
gctagtttca tcagaattaa 120aaatctgaac catggtatga aattcaattc tttttttttt
ttcttttttg aaaacactgg 180caaatgtttt gtatccttga gctttcccac atatcttaac
atagtgagtg gaaagtacag 240tggctgttaa gccaactact ctgaggtctt cactgctaag
gcttactctt aattgtgtga 300gagcttaacc ttgatccctt taaaacatta atgggctaga
aaaaaaacca ttcataaacc 360agtgccacct ctgaattttg ctaccacaat tcccttattt
accaatagtg catgagctaa 420tttggaataa agaactaggc attgtagcac aacagacatt
atgtgggcaa agtgttgttt 480atattctgtc taaatagtgc ttcacatgta tgtactattt
tctaaatatg tatagatgct 540tttgtgatta ataataaaac atgaattctt aaaacaattt
tgctgacttc atagtagctt 600ttcaccgttt tttcagtagc tgctaaaatt tctggagaag
tttgggaact attgttttgg 660agtgaaatgc agtgtgttag atatcacttg cagaattctt
ctaagggtat ttattggcga 720ttagaaaaaa aatccttgtg ttataccagt agtaatacaa
agtaattgtt cagcttctgt 780taagtgtaaa ggactataca agtattgtgt atagttatct
catttattat tttctgggta 840gctattgtta ttattacttc gtacaaaaag ggaaaaggag
gctcaaagta tcatgctcca 900gataacagag ccagtaggta gcagagctgg gattgctacc
caggtctcta gtcctgcttt 960ttcacactat atactcattg cttcacttac tccttcatac
atgattcccc agcatgtact 1020cttttttttt tttttttttt tttgtttgag atagaatctc
gctctctgtt gcccaggctg 1080gcaggcagta gtgtgatctt gggctaactg caacctccat
ctcctgcatt caagcagttc 1140tcctgcttca acctcctgag tagctgagat tataagccta
tgctaccacg cctggctaat 1200ttttgtattt ttagcagaga tgaggtttcg ccttgttggc
caggctggtc tcaaactcct 1260gaactcaagt gatctgccca cctcagcctc cgaaagtgct
gggattatag gcatgagcca 1320tcatgtccgg cctccccatc atgtaccctt aaataccatc
aagcacagtt ccattgtgta 1380aaaacttggc ttgatttaac ctgttaattg gaacactgtc
attaatggaa attaggaata 1440tgaggtaagc tagaggtttt attttaatga ctttgggtta
ttaaatctat aagaaatgaa 1500attcatttag tcataattaa tgtcatgttt ctgcatctat
attacttgtt gggtttacag 1560acgaggtagt gtattattag tgggaagctt tgagtgctac
atcatctccc tttctataaa 1620ataaattgag tacgaaacaa tttgaattaa aacacctgag
taaatagtaa ctttggagac 1680ctgctgtact atttgtacct tttggatcaa atgatgcttg
tttatctcag tcaaaatttt 1740atgatttgta ttctgtaaaa tgagatcttt ttatttgttt
gttttactac tttcttttag 1800gaaaacacca gaaattattg ttggcagttt ttgtgactcc
tcttactgat cttcgttctg 1860acttctccaa gtttcaggaa atgatagaaa caactttaga
tatggatcag gtatgcaata 1920tactttttaa tttaagcagt agttattttt aaaaagcaaa
ggccacttta agaaagtttg 1980tagatttttc tttttagtat ctaattgtag cacctttgtg
gacagtggat gtaatattaa 2040gtgacagatg ggaaaaggat ttttaaaaaa atagcaactg
tttcagtgga tgaaataaag 2100attattagca gagaaaatga atattgggca taactgtcct
ggtgaaagac aatctcataa 2160atgaacaatt tcataatttc gtaaatgcaa ctgcatttta
ttttcaaaga gaaggaaaat 2220tatagtcact ggaaacggaa agagaagtta gaggtaaaca
taggacacac aagaaaactt 2280tcattttgtt tattttcttg tttttctttt gagacagggt
ttccctctgt tacccaggct 2340taagtgcagt gacactatca tagttcacta acccctcaaa
ttcctgggtt caagtaatcc 2400tcctgcctta gccttagtag gtgtaaatac aggtgtgtac
caccatgcct ggcgaatttt 2460aaaaaaactt ttttatagag atgagctctc gccgtgttgc
ccaagctggt cctaaaacgc 2520tggcctcaag ctatcctccg gcctcagtct tagcctccca
aaatgctggg gtttcagtag 2580aagccaccat gccgggccac ttctgtttct tttccatgta
gagttctttg caggaggagg 2640ttagaatagg tgtgcatctc ctaaatagtt gtcgaatata
actaaaaagt taaccaggac 2700tctaaatact atttacttct aaaatttgtt aattgggaac
atttagggtt taactgatct 2760atatcttatg tctttaacaa ttttgaatga taattatatg
taaagtaaga acagtttgtg 2820aaatagttga aaatatcctt acatgaaagt gaattttaaa
gcacagttta tgtaatgtta 2880atgttttgtt ttgtatctgt taaaaatttg tttatatgaa
caagtttaca ggtttactgt 2940ggtgagcccg ttgaatatag tgggtttttt ttgtttgttt
tgtttttgtt tttgagatga 3000agtctcactc ttgtcccgag gctgatgtg
30293631685DNAHomo sapiens 363gagcccgttg aatatagtgg
gttttttttg tttgttttgt ttttgttttt gagatgaagt 60ctcactcttg tcccgaggct
gatgtgcaat ggcgcgatct tggctcactg caacctctgc 120ctcctgggtt caagcgattc
tcctgcctta gcctcccgag tagctgggat tataggcacc 180tgtcaccaaa cccggctaag
ttttgtattt ttggtagaga tgggatctca gcatgttggc 240caggctggac tcaggtgatc
cgtctgcctc ggcctcccaa gtgctgggat tacaggtgtg 300agccaccatg ccgagcctga
atatagtgtt tttaagttgc aggactttaa aaataatatt 360ttgaaatttt tctaagttaa
attccctgtt aaaatggtca tgcaggaata tacgcttgca 420ttattcatat tagggtaact
gtttggtttg ctagttgtta gattctttgc attccttttt 480tttttttttt tttttttttt
ttttgagacg gagtttcact ctttttgaca aggctggagt 540gcaatggcgc tatctcggct
cacctcaacc tccgcctcct gggttcaagc gattctcctg 600cctcagcctc ccaagtagct
ggaattacag gaatacgcca ccaagcccgg ctaattttgt 660atttttagta gagatggggt
ttctccatgt tggtcaggct ggtctcaaac tcccagtctc 720aggtgatcag cccacctcgg
cctcccaaag tgctgggatt acaggagtaa tcccccaccc 780ttttaaaaaa atgagacaga
gttttattct gtcacccagg gtggagtgca gtggtgcgat 840catggttcac cgcagccttg
aatctgggct caagtgatcc tcccacttca gcctcccaag 900tagttggaac catagatgtg
catcaccaca cctggctgat ttttaaatta tttgtagaga 960tgaggtcttg cttgttgtct
aggctggtct taaacttctg ggcttcagca gtcctcctgc 1020ctcagcctcc cagagtgctg
agatgataga catgggccac tgcccctggc cgcatttttc 1080ttttcttttc ctttcttttt
tttttttttt ttttgaaacg gagttttgcc attgtcgccc 1140aggctggagt gcagtggcac
gatctctgct cactgcaacc tctgcctccc gagttcaagc 1200cattcttctg cctcagcctt
ccagttatct gggattacag tcatgtgcca ccacgcccag 1260ctaatttttg tatttttagt
agaaacaggg tttctctatg ttggtcaggc ttgtcccaaa 1320ctcctgacct cagatgatcc
acctgcgtct gcctcccaaa gtgctgggat tataggcgtg 1380agccaccatg cccggcccta
actgcatttt tcttagtatt tgtggtttga gttaatactt 1440gccctatgtg atgttgattt
attattactg gatcattaag tgaggtttaa agaagctaaa 1500tgccatttgc tctatgccct
ctggatttta aaagtgcatg ggtgtgcacg tgtgtaggta 1560taaatgtttc catattctag
tatattctgt gtcagtgata gagcagtctt agagctgtct 1620tttccattta cttgtaggtt
aagaagccaa aaaaagttgt gtcatcatcc cgtttaggaa 1680aactt
16853642998DNAHomo sapiens
364tgggtgtgca cgtgtgtagg tataaatgtt tccatattct agtatattct gtgtcagtga
60tagagcagtc ttagagctgt cttttccatt tacttgtagg ttaagaagcc aaaaaaagtt
120gtgtcatcat cccgtttagg aaaacttaca ttttggctat tgtttcctct agtgctgcta
180ttagtggaat gattttaggt gttcaacttt cagatcaatg ggagacagaa atattgttct
240gagacatctg gaagccgaat gtgttttatt cctgcctgtc tgaggatgtg gtcttgcctt
300tgatagggca aagttatttg taaacattgc tttaaataaa aacatgtaaa ggtgtttttg
360atggttaaca aaaactatga gtataataga gcctagtccc tattacggac tggtattgat
420ctggtgtggg aagagtattg agcttttcag tgtcacctac ctgtattccc ttgaagggac
480ccagagccca ggcaaagctc tgctgaggtc gggcgtggtg gttcacgcct gtaatcctag
540cattttagga gaccaaggcg ggtggatcac ctgaggtcag gagttcaaga ccagcctagc
600caacatggtg aaaccctgtc tctactaaaa atacaaaaat tagctgggtg tggtggtgca
660tgcctgtaat cccagctatc tgggaggctg aggcaagaga attgcttgaa cccaggagac
720ggaggttgca atgagccgag atcatgccac tgcactctag gtgggtccct gagtgagact
780ccatctcaaa aaaaaaaaca aacaaaaaaa aaaaaaaaaa aaaacctctg ctgaaatgct
840acagttaatt ttgccatttg tggtcagcat tcttcttcta aattgctata atcttgcctt
900catattatgt gtctcaaatt taagcaggta tcagaatgtc cacgggaaca aattgccatg
960gctctaagcc cagaatcaga ttcttcagat ctggagtagg gctggggaat ttgcatttct
1020aacacacaag tttgttgatg ctgtttgtct ggggtccacg cttgcctaac ttctgatgtg
1080atttatttct gccagtttct ttttttgttg ttgttttatt tttttgagat ggagtctcgc
1140tctgtcactc aggctagggt gcagtggcat gatcttggct cactgcaacc cctgcctcct
1200gggttcaagc gattctcctg cctcagcctc ctgagtagct ggggttatag gcacactgca
1260ccacacccag ctaatttttg tatttttcgt agagacaggg tttcaccatg ttggccaggc
1320tggtcttgaa ctcctgacct caggtgatcc attggcctcg gcctcccaaa gtgctcggat
1380tacaggtgtg agccacccac catgcctggc ccttcctacc aatttctatc ctccctgaaa
1440tgctgcacac ttaggcagtc actggacaat atctgcccca aaattggttt gtataattga
1500gaatatttaa gaggttgtta aaatttgaac cactttctat tcttctatta agtgtacaca
1560tctattaaag atccccttgt agctcttttt atctgggcca tcacatttct gcccagcaga
1620tgcagaggcc ctgtcctctc ttccacctcc ccactacctc tccttcccta cttttggact
1680gtaaaagctg tctttctgca gttaattgtt ttattctttg taggttctac tcgttgataa
1740tgttatctac tgctataata attacagacg gcaacaggat gatcaaatct tggatatttt
1800aaatttacat tatgcctttt ttattttatt tttttaaagt ctctgcttga cagcaaataa
1860gcctaacgtt ccctaacaaa tgatgatgtc ccattaatga tttgatgact tcctgtttgt
1920agtttttatt tagagtgctt gtgggtagtt tttcataacg acatttaaaa atcaggatat
1980aaataatttt ttaagttttt tttttaggcg gggcacagtg gctcacacct gtaattccag
2040cattttggga ggctgaggtg ggcagatctt gtgaggtcag gagttcaaca ccagcctggc
2100caacagggcg acaccccatt tctactaaaa atacaaaaat taggccgggt gcggtggctc
2160acacctgtaa tcccagcact ttgggaggcc gaggcaggca gatcacaagg tcaggagatc
2220gagaccatcc tggctaacac ggtgaaaccc catctctact aaaaatgcaa aaaattagcc
2280gggcatggtg gcaggcgcct atagtcccag ctactcggaa ggctgaggca ggagaatggc
2340ttgaacccag gaggtggagc ttgcagtgag ccgagatggc gctgctgcac tccaacctgg
2400gcgagagtgc gagactctgt ctcaaaaaaa taaacaaata aaaaataaaa aaattaacca
2460ggcatggtgg cgcatacctg tagtcccagc tacttgggag gctgggacag tagaatcgct
2520tgaactcggg aggtggaggt tgcagtgagc tgagatcacc cactgaactc cagcctgggc
2580aacagagcaa gactctgtct ccaaaaaaaa aaaatgtatt tttctttgaa gcttttctac
2640ttttaaatgt aatgtatagt attataacaa gtgaacaaaa tgatacaaag aagtatggcg
2700ggaaaggtgt ggtagagatg ggaaaacata tttcctccag cctcttaggt tcattggagg
2760agcttgggaa ttcaactgac acacgacaga tttacaggag aaaagtttta tttcaagtac
2820acatgagagc ttcatagaaa agaagtgaag acctaaagaa acagactgga gagttcatat
2880gccattttaa taaaggataa tgtattagtc tgttctcatg ctgctaataa atacataccc
2940aagactgggt aatttataaa gaaaaagagg tttaatcgac tcacaattgc acatggct
29983652076DNAHomo sapiens 365cttgcatagt ttgcttctgg tatgttaaag tgtgctctct
ctaagtgggt agtaattagg 60aacaatttat ctcaacctca tttattgaat gttttaaatc
aagagaacgg actctgttat 120attaagcttc tatatataat tgtctgtttc actgtaatgc
ctagtaagga tacacttcat 180tcttttttta gatgttcttt cacaatttca tgtaaatttt
agttgttttg tttcaaaaaa 240caattcctat tgaacaatct ctaggaatag atagcttaat
aataatatta gatctagttt 300tctcttttca tagtttacct cttctttctt ttctttcttt
tttttttttt tttgagacgg 360agtctcgctg tgtcgcccag gctggagtgc agtggtgcga
tctctgctca cagcaagctc 420cgcctcccag gttcgcgcca ttctcctgcc tcagcctccc
aactagctgg gactacaggt 480gccccccacc actcctggct aatttttttt tttttttttt
tttttttttt tttgtatttt 540tagtagagac agggtttcat tgtgttagcc aggatggtct
caatctcctg acctcgtgat 600ccgcccacct cggcctccca aagtggatta caggcgtgag
ccaccgcgcc cagcctctgt 660ctctcttttc tttttctttt tcttttcttt tcttttcttt
tcttttcttt tcttttcttt 720tcttttcttt tctttccttt cctttccttt cctttccttt
cctttccttt ccttttcttt 780tctttctttt cttttcttct ctcttctctt ctcttctctt
ctcttctctg tctttttttg 840acgagtctca gtatgtcacc taggctggag tacagttgca
caatgttggc tcattgcaac 900ctctgcctcc cttgttcaag tgattgtcct gcctcagcct
gccaaatagc tgggactaca 960ggtgcgcact gctacgcccg gctaattttg tatttttagt
agagatgggg tttcaccatg 1020ttggccaagc cggtctcaaa ctcctgacct caagagatcc
acctgcctcg gcctcccaaa 1080gtgctgggat tacaagtatg agccacgatg ccagtccaat
tcttgtgtag ttttttaatc 1140agctgaattt aacattcaaa ttcttctttt aaatcttcca
ataggcagtt atctttataa 1200agatcctata taatcaagac tttgtttctg aatattttat
gtatgttttt gctactgtaa 1260atgagatcta tttctcattg tggtttcttg ctgttattac
tggtaagaat ttagtgaaac 1320aaagtactta agagtatgtc tttaaattgt gagattttga
tgaactttta agaaataaaa 1380ttctttagtt tcttagagct ttttgagatt tctaaggtag
atccttggtt tgggcaacat 1440ataactatta caagttttgc acattgaacg ttatttggta
atttttagag aggacatttt 1500aaatgtttag gaaaaatata aataaaatgt agaatactat
tgggggcata tacatcatca 1560gcactgtaac tgtttcatat gaatcatttt tgtacatata
gaactctaaa gtcctaatga 1620acagaatttt acatttctat aaatagaaag tccttaatag
ttgtgactga ataacttatg 1680gatagcaaat tatttaactg aaaacagtaa aatttaagtg
ggaggaaata tttgctttat 1740aatttctgtc tttacccatt atttatagga ttttgtcact
ttgttctgtt tgcaggtgga 1800aaaccatgaa ttccttgtaa aaccttcatt tgatcctaat
ctcagtgaat taagagaaat 1860aatgaatgac ttggaaaaga agatgcagtc aacattaata
agtgcagcca gagatcttgg 1920taagaatggg tcattggagg ttggaataat tcttttgtct
atacactgta tagacaaaat 1980attgatgcca gaattatttt ataagttccc tgtccccaag
atgatgactt cacatctctg 2040tcaaacagaa atcgcccaac aggcccttgt atgatg
20763661960DNAHomo sapiens 366aacagaaatc gcccaacagg
cccttgtatg atgtcattta aacaagccct attttaaatg 60tcacctccac tggtaacagg
atactcctag gaggatcacc aagcccaatt cttctaggag 120tagtgcattg attaggcttt
ggggtttcca agcagttcat taatgtcact tttggaaaaa 180gtctgtcttt cataccagct
tattaattcc ctatgggttc acacggtttt ttttcctgga 240ttttcatcaa acatgtgtaa
ggtactcagt acaaagaagt ttagaaatcc agaacaaagc 300agtgtattta agtagtagta
aacttccaga taatctgatg cccatatcta catatataaa 360aaatttgcaa atagttctgt
agagagtcca aacatggagt agatccctaa ttaagagcct 420ttgcattaaa gtccaccttc
ctcatttcat agctaaggat attgaggctc agagagttta 480tgtgtctgga gttaaagtta
ttttgtgttt ccttaatttt tgacttacta gaaagttaaa 540gtacctacag atttctgtgt
ttcactatat gttaacttgc ttggctggaa gtttttctgc 600tgataattgg ttttatgaag
gaagaatcct gttaagaatg catcattgga ctgggtgtgg 660tggctcacgc ctgtagtgat
cctagcagtt tgagagaccg aggtgggcag attgcttgag 720tccaggagtt tgacactaac
ctgggcaaca tgatgaaacc ctgtctctac aacaaataca 780aaaattggcc atacatggtg
gcacgcacct gtggtcccag ctactcagga ggctgaggtg 840agaggatcac ttgagccagg
gaggttgagg ctataatgag ccataattgc actactgcac 900tccagcctgg gtgacagggt
gagatcctgt ctcaaaataa gaaaagagaa tgcatcattg 960gccaggcaca gtgactcatg
cctataatcc caatacttta ggaggatcac ttcagcccag 1020gagttcaaga ctagcctgtg
ccacatagac cacatttcta ccaaaaatca aaaggaaaaa 1080acttgctggg tgtggtgatg
cacacctgtg gtcccagcta ctcgggaggc tgaggtgaga 1140ggattgcttt agcttaggtg
gttgaggctg cagtgagcca tgatagcacc actgcattcc 1200atccagcctg agggacggag
tgagagcgac accttgtctt taaaaaaaaa aacagaggaa 1260tgcatcatag tatatattaa
attattgcct atttttttat ctattttatt gagtgctaat 1320aagaaaatta atggcaaaaa
cttgtttttt acagtataaa ttaagtttaa tttcatttta 1380aaattaagta aatttgtttt
attaaaaagt atgttgaaag caacataaat agcactcaaa 1440ttgagacaga aactgtaact
gtagtataag aagcattagg ctgggaattg ggaaacacga 1500gttctagttg cagcttggaa
actttttctg aagctcttta caaattactt aatttctctg 1560gttttcacca cattgttcta
tagcattaac atgttggatt cattgcttta attcttagac 1620ctacgtgtca tcagaaatgc
cattacactt tgaggatttg agccttattt taaataaagt 1680tgtgatcctc atggcagcct
aggtttacat gtgttaaata aacagtattc tgtaaatacc 1740attgtctttc atgtttagtg
atgttgctgt tgttaacact gcagtgaaat gcatatataa 1800gcaaactaca ttacatactc
atgaacatgg tcctttgttt tgaaactttg atcactgatt 1860gttcgcagtc tttcattgtg
gaactactct ttcactttga atgttttgag aggttccttt 1920gttcagatca gtccgatttc
gtttctgggt gggtctctac 19603673160DNAHomo sapiens
367agtccgattt cgtttctggg tgggtctcta ctttcccttt tctcactggt caagcgaggt
60ctgtctaatt gtttgctact actaacattt gatggccacg cttcagcaag tacatttgta
120gattctctct ctctgtctct cttaatttgt ggtctagaga tcatattggt taatgaaatt
180atgaagaggg aatgtattta taaaaactca aattcttgat gcagaaggtc tagctgattg
240tgaacccaaa atatccgaga caggtcacaa ccaatttaga aactttattt tgccaaggtt
300aaggatgcat ccatgacata gtctcacaag gttctaatga cacatgcgca aggtggttag
360ggtacagctt ggttttatac attttaggga gacatgagac atcagtcaac atgtgtaaga
420tgtacattga ttctatccag aaaggcagga caacttgaag caaggggctt tcaggtaata
480agtagataag agacaaaagg ttgcatactt ttgagtcctt gatcagcctt tcactgaata
540aacaagctta gtcttgttag tgaatctgcg tttttacata aacagtaggt cagaggaagc
600aatcagaaat gcatttgtgt caggtgagcc gagggatgac tttctgtccc tcacctgtga
660agataagcta tcagtttcca ttgctagggt gaaattcaac agaattgttt gagagtgaac
720atctggaggc ccacaaggac tttccttgtg gaggggaagt atgtagtgag ggaagtatgt
780agtttttaaa tctttgtcgc tatcttattt agaaataaga tggaaggcag gtttgtctga
840catagttccc agcttgactt ttccctcggc ttagtgattt tgcggttccg agatttattt
900tcctttcaca tatcagtcag atcatttggt ttgtgaagtt tcctatgctt aacagaaaat
960atgtgcacta gttttcctag agtttcattg tcagagtctc aagtttttgt ttggaaattg
1020tatttggtca cattaattat actctatgtt agttccaaag aaataccttt ggttaagaaa
1080agaattctca tgcataactc ctcgagggtg gggttacacc ttaatccatc ctcaggtgct
1140catggtaatt ggggcaaata tgttgcccag tgctggtgct ctgcagcctt ggatgggttt
1200acccagaaag cagctttcaa gtcagaaact aacattcata agggagttaa ggattttata
1260aatagatatc cataattcat gtagttttca agtaagtagt atttgaatct tttctggtta
1320gataataatt gtgagtatgt tgtcatataa taacagtatg tttttcacta tttaaataat
1380tttagaatta cattgaaaaa tggtagtagg tatttatgga atactttttc ttttcttctt
1440gattatcaag gcttggaccc tggcaaacag attaaactgg attccagtgc acagtttgga
1500tattactttc gtgtaacctg taaggaagaa aaagtccttc gtaacaataa aaactttagt
1560actgtagata tccagaagaa tggtgttaaa tttaccaaca ggtttgcaag tcgttattat
1620atttttaacc ctttattaat tccctaaatg ctctaacatg atgtgaatgt tctatgataa
1680gttttactaa tgtagtcatc aggtaagagt caagctttct tccatagagc agtcagctgt
1740cgcaacacca tttgttaaat agtccgtctg ttctccattg actgaagtgg tactttgggt
1800ctattttaaa gactctactt ttacctcgtc tcaccattct tttgtctaca caaaatatat
1860tttatcgctt attctgtgtt accatatcta ttagagctag ttccccctca tatctctgct
1920ttagttattt tcacatgttt cttttatctt tttttttttt ggagatggag tctcgctctg
1980ttgcccaggc tggagtgcag tggcatgatc tcggctcact gcaagctccg ccttccgggt
2040tcacgccatt ctcctgcctc agcctcccga gtagctggga ctacaggcgc ccgccactgc
2100gcccagctaa ttttttgtat ttttagtaga gacggggttt caccgtggtc tcgatctcct
2160gacctcgtga tccgcctgcc tctgcctccc aaagtactgg gattacaggt gtgagccacc
2220gcgcccagcc ttatcttttt tttttttccc cctgagacag agtcttgctg tgtcgcccag
2280gctggagtgc agtgacgcgc agtcttgact cactgcagcc tccacctccc ggattcaagc
2340gattctcatg cttcagcttc ctgagtagct aggattatag gcatgcacca ccacgcctag
2400ttcatttttg tatttttagt agagatgggt tttcaccatg ttggacaggc tggtctcgga
2460ctcctggcct caagtgatcc acctgcctca gcttcccaaa gtgctgagat tacaggtgtg
2520agccaccgtg cctgacccac atgtttattt tttctaagaa aactttacta tcatttatca
2580agttaagaaa attattctga tatttcaatt gggtgtttaa attagttgag ggaaatatga
2640ggccattcac tagatgatag gttttttttg ttttaatcat gtttcatgtt gaaacaaaaa
2700agttttttcc tgccagtttt ctggctaatc tcaggaagtc cctgaaacaa attattgata
2760agtaaaaaaa attatttaaa aaattttaaa ttatatttaa aatcttctgt gacttatggt
2820ggggggaggc taaagccttt ctccttctgt actgttctgg aaactatggc ctgttctact
2880ccctcccctc ctgaattttc ccagaacttt acaggtagct tttatatata tgatcccctg
2940tcgtctgttt aacaagtact ttgagtgtct attatatgca gacattctag gtgttcagac
3000accctagtaa ttagtttgtt cctcataatt ctcagtaaag aagacatgta tatttctcat
3060tttataggtg aagaagctaa gactttactt ttcctcagtt agacagctag tgctggtggg
3120tgcctaaact tagatcttcc attgccaaat ctaggtgtgt
31603682985DNAHomo sapiens 368tccattgcca aatctaggtg tgttgttttt ccagcacact
agaatcctcc tggttcaaga 60aatgtatata ttttagcttg gataagatac aacttttgga
gtgttctaat catcttcaag 120tttttcgtgg attagttata acatatgaaa aaagataggg
ctgaatgggc cacatgatgc 180caaaagtgaa aaagtcactc actagattat gacctgcaga
atctggtcct tgcctgcctc 240tgcttttata ttttgcagct tgtcccttca cacagtggtc
tcacttttat aatgtctttc 300cctcatgcat ttctttaatt ctttttattt gcctgttcca
tagtagtctg tttgtgctgc 360tacttgcccc tgtactgttc ttgagctata catatacatg
tctgctgtgc cattgagtga 420ttccatcaag gccacaatta tcatcttgat gaactgattt
tctcccactg ctgataatta 480cttctctctc ctttctttct cctttacatc acctcttttt
gttcttaatt tcattccctc 540cttgatgcca gtgagtattt ttttcttatt ttattctcat
cttccttgag tattgtttat 600ttcaacctct tttttttttt ttttttttgg agaagggttt
ggctttgtcg ctcaggctgg 660agtgcagtgg cacaattttg gcccactgca acctccacct
cctgggctca agccatccca 720cctcagccac ccaagtagct gggactacag gtgttgccca
ctgctttgta tttttaatag 780acacaggatt tccccatgtt gctcaggctg gtctcgaact
cctgggctca agcagtccac 840ctgccttgcc ctcccaaagt tctgggatta caggattaca
gatgctgtgc ccggcccaac 900ctctaatttt aattttctct tcaaattgtt caataagatt
tagtttcaag acattttcct 960ggccgggcat ggtggcttac gcctataatt tcaacacttt
gggaggccga ggcaggtgga 1020tcacttgagg tcaagagttc aagaccagcc tggccagcgt
ggtgaaaccc catctctact 1080aaaaaataca aaaattagcc gggtgtggtg gtacatgcct
gtaatcgtag ctattgtgga 1140ggccgaggca tgagaatcgc ttgagcccgg gaagcagagg
ttgcagtgag ttgagatgac 1200accactgaaa tccagcccgg gcaacagagt cagactacgt
ctcaaaaaaa acaaaacaag 1260ctgggcgccg tggctcacgc ctgtaatccc agcactttgg
gaggccgagg ccggtggatc 1320acgaggtcag gagatcgaga ccatcctggc taacacggtg
gtgaaaccct acctctagta 1380aaaatataaa acattagccg ggcgtagtgg ttggtgcctg
tagtcccagc tactcaggag 1440gctgaggcag gagaatggtg tgaagccggg aggcagaggt
tgcagtgagc ctagatcgcg 1500ccactgcact ttagcctggg tgacagaaca agactccgtc
tcaaaaaaaa aaccattttt 1560cttattttga aaacttttgg tattgaaaga tatttatact
acagtaatga gaaatactgt 1620gtgtgtgtat atatgtttgt gttttttttt ttgttttttt
ctttctctct ctctcttttt 1680tttttttttg acagagtttt gctcctgttg tccaggctgg
agtgcagtgg tgctatctcg 1740actcaccaca acctctgcct cccgggttca agtgattctc
ctccctcagc ctcccgaata 1800gctgggatta caggaatgtg ccaccacacc taactttgta
tttttagtag agacgggttt 1860tccccatgtt ggtcaggctg gtcttgaact cctgacctca
ggtgatccac ctgcctcggc 1920ctcccaaagt gctgggatta caggcaccct gcctgtgttt
gtgttttaaa aggggtaata 1980gcttcagtct tttttttctt tctctgagac ggagttttag
ttttgttgcc caggctggaa 2040tgcaatggtg tgttcttggc tcaccacaac ctccatttcc
tgggttcaag cgattctcct 2100gcctcagcct cctgagaagc tgggattaca agcacgcgcc
accatgctgg gctaattttt 2160gtatttttag tagagacggg gtttctccat gttggtcagg
ctggtctcga actcctgacc 2220tcaggcaatc caccgacctc aggtgatcca cccgcctcag
cctcccaaag ttctggggtt 2280acaggcgtga gccaccacgc ccggctgtct tcaatcttaa
ataaggattc catttaaata 2340ttttgtaaaa ggacacagat cacagtttta ctcaggggaa
tataattgtt atagcaggaa 2400ttgtgccatt gcgctattcc aaacagtgta aaagaacatt
aataaattga attctaacta 2460catttgtccc taaggagttg ttcgttttcc acttgtattt
ccattttaat tatcattatt 2520tggatgtttc ataggatact ttggatatgt ttcacgtagt
acacattgct tctagtacac 2580attttaatat ttttaataaa actgttattt cgatttgcag
caaattgact tctttaaatg 2640aagagtatac caaaaataaa acagaatatg aagaagccca
ggatgccatt gttaaagaaa 2700ttgtcaatat ttcttcaggt aaacttaata gaactaataa
tgttctgaat gtcacctggc 2760ttttggtaac agaagaaaaa tcatgatatt tgaagtgtgt
tttgttattt tcgcaagcca 2820ttacattctg actatttaat atgttaggtt tcctatataa
aataaggcat ggtatgttac 2880agtaggacac ataactggaa gttactcttg cacatagaaa
caaaaaatgg cagaaaagca 2940caaaacttac tatagttgta acagggaaag gaaacactag
ggcct 29853692138DNAHomo sapiens 369aggaaacact
agggcctaca acgtactaat gtcttgggtc atctatgggc tcatgaggct 60ctaggttatg
gaagtaaata ccactgaaaa gcaaatatta attacacatg aggcaagcct 120ttttgagttc
tgtatgtcat tttgtagatt ttgagttcat tctagtggca ccatttgaga 180tcattttcat
gtaattaaag gaacacagca acctggcact gtgttattgc ccttagaatg 240gaatgaatat
atgtttagca caaggtagga agtgatgcgt taagttggaa ggctttgccg 300atcatggtgt
gtatgttgac taacctttat tgtgccttta aaaaatatac tcaagaacta 360ccttaaccaa
gtaattaaag tcaagattac cagttgtggg acaaatgaca tgtacttcct 420ggtgtgatat
agaaggaagg acacagtatc acctatatag tattcttgac cagaatattt 480aacctgattt
taaacaagaa gtaaaaattc aaataaattt agattgtggt gcattcaagg 540cctgaacttt
aataaatgtc catgtcacgg cagcaaaaaa gaaatcaaca ggtcttaaag 600agacagggca
accaaacgca gtaggcagta gttgattaga tcccaattta gaggttggag 660ttggggaata
gctatagagg acactattgg ggcgaattga gaaagtttaa tatgagacaa 720tatggtgtta
gtgtcagatt tcttgtgtga aatggtagtg ttatgattag gagaatgtcc 780ttgttctcag
gatatgcatg ctaaattatt taaggacaaa tattttttta aaaggttatg 840tgcatgagta
attctataaa ttgtgttgct attatgaatt gtcatggtaa atcaaaagga 900aacataaaac
tcaaaaggtt ttattttaat acactttatg tattgaaatg aatggaattg 960atttgtaaag
attacatttt tgcttgttgg tgtcagataa ctgtgacgta ataatctttt 1020gctgaattat
gtttcttagg ctagatttca ttttaaagaa ccctgtaaat accatttatt 1080tgaactgtgg
atcttcctta aaaaataata tttattaagc acctagcagg gtaaagtttt 1140tagattttaa
catttaaatt gaaggtttta tattagaagt caacctgaat ttaaatgaaa 1200cttcttcttg
gtctgatatt acatattatg agctattttt atttaaaaat gtaatggcgg 1260ccagacatgg
tgattcacac ctgtaatccc agcactttgg gaggctgagc tgggaggatt 1320gcttaagccc
agaagtttga gaccagccta gccaacatag ggggacccca actctacaaa 1380aaaatccaaa
aaatattagc cggctgtggt ggtacatgcc tgtagtccca gctactcagg 1440aggctgaggc
aggagaatca cttgaaccca ggaggtcgag ggtgtggtga gccataatta 1500tgctactgta
cttcagcctg ggcgacagag caagactccc atctcaaaaa gtgtaatgga 1560tcactttaat
aattttctat catacaatta agtcataaaa ggtcatgcta ttaagagcca 1620gttatgtgac
atgccaagta tagactctta attaagatgc tttggtttgc tttttattta 1680tttatttatt
tttcagatgg ggtcttacca tgttgcccag gctttagtgc agtgatgcga 1740tcatgactca
ctgcagcctc aacctcctag gttcaaggga ttctccccac ttagcctccc 1800aagtagcttg
ggactactac atgtagtagt gccaccacac ctggttaatt tttttttaat 1860tatcttttgt
ggagatgaag tctcactctg ttgcccaggc cagactcaag cagtcttcct 1920gccttggcct
ccgaaagtgt tgggattaca ggcgtgagcc accctgccca gcctagtttt 1980ctttttttta
ctataaactt attcttgtca gtatgctagc aattttacaa gttttaaagt 2040agttatagca
agtacttcac tcatgtttaa ttcttaaagg cttctattgc tatataatag 2100ggtagtctga
attcttcaaa agtgtactga ggccaggt
21383702491DNAHomo sapiens 370gggattctcc ccacttagcc tcccaagtag cttgggacta
ctacatgtag tagtgccacc 60acacctggtt aatttttttt taattatctt ttgtggagat
gaagtctcac tctgttgccc 120aggccagact caagcagtct tcctgccttg gcctccgaaa
gtgttgggat tacaggcgtg 180agccaccctg cccagcctag ttttcttttt tttactataa
acttattctt gtcagtatgc 240tagcaatttt acaagtttta aagtagttat agcaagtact
tcactcatgt ttaattctta 300aaggcttcta ttgctatata atagggtagt ctgaattctt
caaaagtgta ctgaggccag 360gtgcagtagc tcacacctat aatcccagca ttttgggagg
ccgaggcggg tggatcacct 420gaggtcagga gttcgaaact ggcctaacca acatgttgaa
accctgtctt tactaaaagt 480acaaaaatta gctgggtatg gtggcaggtg cctgtaatcc
cagctactca ggaggctgag 540gcaggagaat cgcttgaacc caggaggcgg aggttgcagt
gagccaagat cacaccattg 600cactccagcc tgggcgacag agcaagactc tgtctccaaa
aaaaaaaaaa aaaaaaaaaa 660aaaaaagtat actgaaacag aggaagataa ttaggtctgc
ttggccattg ttaagttgat 720ttttattttc aaaacatttg atcactgttg tggggaacaa
gggaataaaa aataagttaa 780atttccagcc cctagattaa actaataatt tttggttttc
ctagaattaa atgcttttat 840cttgaatgtt ctgtgaagct tttgacatga ttgatagctg
tatgatagtc tgaatgacat 900gtgggtcatg caccagcccc tccaacctgt taacatttag
aatctattca gaaaaattta 960agcattgtta atttcctttg ttttttgtct agcatgtgtc
agattttttt aaatgtattt 1020attaatagct tttaatgtta atactctaga acagtagaat
cttgaaaatg ttttaagtga 1080caattagaga tttaaattta tgctgacatc ctctgcatgt
gatactgatg aggaaagaaa 1140gccaaactgt cttacggtca gttcgtacaa tataccaggc
cttgatggtc acatttcaac 1200ttgctacctt tttgcttaca tttttcttat ggtgattttg
aggtgtcatt ctggtttctc 1260agatacttaa aatataggaa aaggtgtgtc ttaaaattga
gagaatgtct tggataagca 1320gctgtgtagt tttatatttt gctgataagg gaaggtactc
tatttttgtt ttttgtgtgt 1380ttttgtttgt ttgtttttga gacagaattg cccaggctgg
agtgctgtgg cgcaatctca 1440gcttactgca acttccacct tctgggttca tgcaattctg
gtgcctcagc ctcccaagta 1500tctgggttta cagacatgca ccaccatacc tggctaattt
ttgtattttt ggtagagatg 1560gggtttcgcc gtgttaccag gctggtcttg aattcctggc
cccatgtgat cccccggcct 1620catgcgatct gcccgcctca gcctccctaa gtgctgggat
tataggcgtg agccacccaa 1680cccagccagt actctgtttt tgatagctat tcacaatggg
aaaggatgta gcaacacatt 1740ttaaccctat gttgagtttt aggtgggttc ctttgaaatt
ttgttaaggc taacttttgt 1800taattttttt aaaaaagtgt aaattaggaa atgggttttg
aattcccaaa tggggggatt 1860aaatgtattt ttacggctta tatctgttta ttattcagta
ttcctgtgta cattttctgt 1920ttttattttt atacaggcta tgtagaacca atgcagacac
tcaatgatgt gttagctcag 1980ctagatgctg ttgtcagctt tgctcacgtg tcaaatggag
cacctgttcc atatgtacga 2040ccagccattt tggagaaagg acaaggaaga attatattaa
aagcatccag gcatgcttgt 2100gttgaagttc aagatgaaat tgcatttatt cctaatgacg
tatactttga aaaagataaa 2160cagatgttcc acatcattac tggtaaaaaa cctggttttt
gggctttgtg ggggtaacgt 2220tttgtttttt tttttttttt tttaatcttg gagtagaaat
atatttaaaa ttgatggaga 2280aaattcccag ttcttaacat tagaaaggga atatattatt
cttaccagtt agtaatctat 2340tcacatttgg tttagaggga agatttagaa ggtgagataa
aagcttgtga gagaatagtg 2400tattcatgtg aaacttcttc catgggttca gagcatttag
aaacaaacat cccttcacac 2460tcaaagctta cctttgagcc agtcctccaa t
24913713126DNAHomo sapiens 371cttacctttg agccagtcct
ccaatagtga ggtctttgaa ggtcaggcca aattggctgt 60gggaggacct caggttagga
taggaattat tttaagacat ggcactatat tcatgtgaaa 120ctcgcaaaaa ctagccttgc
atataggctc atgtatcatg tctcagctga gatgtttgag 180agatcttaac tagattctag
aaaacaaaaa aggaagtagt tttggggcaa atatatttgg 240gaaacagttt attgtatttc
ctttccccaa atggattttc aagttcttca tataatctaa 300ccccaacaaa taaattgcct
gtttttcaaa agaaagatca tgtcttcagg tttttgtgtg 360gggtttaaat gattcgaaag
atttgaccat actgatacat tcactagtaa ccttagttac 420taatgagtaa tggttttgag
ttaatcagtt aggcctgaac tacttttctg gaagttagta 480aattatctca caggcagccc
tgtgagccat gggaaaatgt gtatatggtc tttctaggcc 540acagtcaaat tacaggtata
tttgtcatgg cttctcttga tgaaaggccc agtatcggtt 600tgtctgaaga tatataatag
cattgctttt gggggtaata tgggcagtaa ctctgtccac 660atctttgggc aggctgtggt
tctgccttta tatgctatgt cagtgtaaac ctacgcgatt 720aatcatcagt gtacagttta
ggactaacaa tccatttatt agtagcagaa agaagtttaa 780aatcttgctt tctgatataa
tttgttttgt aggccccaat atgggaggta aatcaacata 840tattcgacaa actggggtga
tagtactcat ggcccaaatt gggtgttttg tgccatgtga 900gtcagcagaa gtgtccattg
tggactgcat cttagcccga gtaggggctg gtgacagtca 960attgaaagga gtctccacgt
tcatggctga aatgttggaa actgcttcta tcctcaggta 1020agtgcatctc ctagtccctt
gaagatagaa atgtatgtct ctgtcctgtg agaaggaaaa 1080gtatatttgc agattctcat
gtaaaaacat ctgagaatgt ttgtcttagt ttaatagttg 1140ttttcctgtg gactttatat
actttgtatt gtcttaaaag agtgattgat ggtagctacg 1200gaaaactttg atttttaaaa
ttgtctcttt aagtagacaa tttataagct actggtacga 1260gttcacctta taaatctcca
ctaccatgtt tttgcttgga ctgttcacac ttcctggaat 1320ggtccttctt gccgtttatc
caacttcttt ctaattttta agtccctaat gatgggaatt 1380ctatttctgt agtgattttt
ctggtcatac gaccgtaagg tcatgggtgt ttttctctga 1440attcctcttg agatgcctgt
aacttgaacc acgtttttat tctagacatt actgaaatgt 1500tttgtcttta tttcactttt
taggagcttc cttgaaggta gggactatac cttctatttc 1560ttggtatctt tttctttctt
tttttaaaag ttttttagag agacagggtc tcactctttt 1620gcccagactg gtctcgaact
cctgggctca ggtgatcttc ctgccttggc ttcccagagt 1680gctgggatta caggcatgaa
ccaccgtgat cctccttatt tcttagtatc ttctaaagaa 1740cattaaatat agtaggtgcc
tagtaaatta tgtattgatt taacttcttt gaggttctgt 1800tgtttgtgaa gaattataaa
agcaatacaa atgtttgtat agtaattaag caacaggtta 1860atattcatga cttaaaagat
taaagaaata agcaaaacat gttagctggc aactcacaga 1920aaaagaatta aattgccaat
gagcacacga gcacatgaaa aattagcaaa agtttcaccc 1980ctttacatat atttggttaa
aattgagaaa agaatagtaa tagatggtat tggtaggact 2040gtggcaggca cacaatttac
atgaccacca aaagtgtatg caggtatcca tgtcaccaca 2100ccctggtctc atcttcattc
agttttattt atttttttta atctcggcct atttgattgg 2160cacgaaatga atgatagctg
ccttatttgg aattcctttg attactacta gtgtgcttga 2220taatgtaaaa caatattcaa
aatctgtttt tcctttcatc cgttgtttgt tcatgttcat 2280gacctttttt tttttttcct
attctcctcc ctccctccct ccctccctcc cttccttcct 2340tccctccttc cctccttccc
tccctccctc ccacacaaag gtgtgtgcta ccatacctgg 2400ctagttttta attttttttt
tttttttttt ttttagaggc aaggtctcac tatgttgctc 2460aggctggtct gggctcaagt
gatcctccca cctccgcctt ccaaagtgct gggattacag 2520acgtgagcca tcatgcctgg
cccttgccca tttttctatt gaagttttag tgctttttat 2580tgactttgtt tatatattaa
gataatccat tatgtttgtg gcatatcctt cccaatgtat 2640tgtcttaatt ttgtttttgt
atgtgtatgt taccacattt tatgtgatgg gaaatttcat 2700gtaattatgt gcttcaggtc
tgcaaccaaa gattcattaa taatcataga tgaattggga 2760agaggaactt ctacctacga
tggatttggg ttagcatggg ctatatcaga atacattgca 2820acaaagattg gtgctttttg
catgtttgca acccattttc atgaacttac tgccttggcc 2880aatcagatac caactgttaa
taatctacat gtcacagcac tcaccactga agagacctta 2940actatgcttt atcaggtgaa
gaaaggtatg tactattgga gtactctaaa ttcagaactt 3000ggtaatggga aacttactac
ccttgaaatc atcagtaatt gccttattct aagttagtat 3060aaattattga tgttgttata
gaacccattt accccttaat tcacagtctg ggggtaggaa 3120catgta
31263722920DNAHomo sapiens
372ttctgcatca gttggttgca catgagtgag ataatcttgg ttctttatcc tttgttattt
60gtacttcatt gggaatcctt ttgagttagt atatttgagt cattattatt attgctgtag
120aattcaggaa cttttagtag atctggcagc ataaaatttt gcttttaaat cattgtttgt
180gttttgtatg ctatagaaat gggttcagaa tattttttaa aaggccagat gaagtgtgaa
240gatagaaaaa cttcatcctt cactgtgaat gtttaacaaa catttgcttc tactttattt
300ttgtttgctt cctttagttg tgcaaagtat tcagttctag aatgcatgag atatatgaca
360aagccaaaaa attctttata gttgataaat aattgtggca aaaacagctg tatagtaact
420ttgcaagcat catttgatta aatgcttaaa aagtcttgac tcagttttaa ctatttcctg
480caaataatca atatttaatt aaagctactc caaattagtg acactttacg tgtctgtctt
540tctccctccc cttctccctt ctcccttccc ccttctccca ttctcccatt ctcccttctc
600tcttcttcct ttcctcttcc cttcccttcc cctttccctt cccccttccc tcttctcttc
660ccctccccct tcccatcccc catcccttcc cttcccccat ccctttcctt tccccttccc
720ttccctcctc ttcctccttc ccttccccct tcctccttcc cttccccctt cctccttccc
780tttcctcttc cctttcccct tcccttcccc cttcccttcc ctcttccctt ccccttcccc
840tttcccctcc ccctctcctc ccctccctta ccttcccatg aaatgagaaa gcctcagaga
900tagtggcttg attaattttt ctttagatta agatatttgt ctaagccttt aaggtttatc
960tattgagctt ttttgtctcc tatttttatt tttcctacta tgtttgtcga ggataaaata
1020cagcactgtg tgccaagtca taatcacttt tcatttgaga cttaattaaa atgcctttat
1080tttaatgata tatttggcta atgtatttga agtaatccga aattaagttt tctaatgaca
1140aggtgagaag gataaattcc atttacataa attgctgtct cttctcatgc tgtcccctca
1200cgcttcccca aatttcttat aggtgtctgt gatcaaagtt ttgggattca tgttgcagag
1260cttgctaatt tccctaagca tgtaatagag tgtgctaaac agaaagccct ggaacttgag
1320gagtttcagt atattggaga atcgcaagga tatgatatca tggaaccagc agcaaagaag
1380tgctatctgg aaagagaggt ttgtcagttt gttttcatag tttaacttag cttctctatt
1440attacataaa caggacacta agatgaaggt tttttgttgt tgtttgtttt cctctgtgtt
1500tctagtgctt attttttaat cagttttttt gatggcaaag aatctatctc tgtgttattt
1560tgatttctgc agtatataca tctgcatgat caatattcga tttcaagtac caaagtagga
1620gtaaaggaat attaacctag gtttaaaatt agtcatttca ctaaaattag ttattatgga
1680cgatagatgt ctaggtatat ctttgttcat aaacgaatat atcaagttca gttattaaat
1740tacacattag gtaagaaaag gacaaagaaa taaaaaagca tgattcataa ttcctgccct
1800ctatttgtct agaatttagt tgggaagata agaataacga acgtgacaca gagaataaag
1860tggcatatga caaatattta ttcaagaaag ctatatgtgg acgggatgtt tcagttctca
1920tgggagaagt ggattttatg gtgcctttga gtaatgggtc atatttgggc gttcacacag
1980aaagacccaa gcatatgcct aattttttat tattattatt ttttatttat ttatttattt
2040tttagacgga gtctcgctct gtcgcccagg ctggagagca ggggcgcgcg atctcggctc
2100actgcaaact ctgcctcctg ggttcacacc attctcctgc ctcaggctcc cgagcagctg
2160ggactacagg cgcctgccac cacgcccggc taaatttttt gtatttttta gtagagatgg
2220ggtttcaccg tgttagccag gatggtctcg atctcctgac ctcatgatct gcctgccttg
2280gcctcccaaa gtgccgggat tacaggagtg ggccactgtg cccggccctt tttttttttt
2340ttttttttaa attagaggat tactagttct cttcaattat aaaaataaaa gaatcttatt
2400tcactgcctg gtcctggaaa catgtactgc aatatacatt gtgacaactt tttacctgtc
2460atgtttttag cttttacctg tgaatgtctt atcattgttc ttatctgaag gatagatagt
2520tgctacaata ataatagatg gtgtgtatgg tttttgagcc taaaaagtgt agttttatct
2580gttgtaccta tacaagcagg agaaatataa cttgttaata attttaggta tggcaggctg
2640ccatcctaaa tatgaagtgg tctttgtatt tgcactttaa tgtgttgaaa tcatagcttt
2700cagtgatcca ggattaggca gactctttta tgcaatctct tgtttccagt tagaatagaa
2760gtcgtgtact tttgataaca ttaattataa tatattttga gccctgtgag gttggtaaca
2820ttattcccat tttatgaatg aggaatgtgt gttaaggagt ttgcccaaga gtcacatagc
2880aagtcatagt catgctctct gaagcagcaa taacttggca
29203733092DNAHomo sapiens 373gccctgtgag gttggtaaca ttattcccat tttatgaatg
aggaatgtgt gttaaggagt 60ttgcccaaga gtcacatagc aagtcatagt catgctctct
gaagcagcaa taacttggca 120ataaaataaa aatgaagcat cttctgtatg tgttaacttt
tcagtgactg tttatgcctt 180ccagtattct ttgtaaacct tgaattcttt ttttcacaga
tgattaaagt ttatcaattg 240taaaggtgga ggaatttggg aactagacag tgcacacata
aataataaat atgttcttca 300aatattgggt gggctaatgt gggaggagtt tgagaccagc
ctgggcaaca tagtgagacc 360ctcgtctcta aaaatatgaa aaataaaaaa aaaatttttt
aaatgtgtga tatgtttaga 420tggaaatgaa acaatttgtc actgtctaac atgactttta
gaaaagatat tttaattact 480aatgggacat tcacatgtgt ttcagcaagg tgaaaaaatt
attcaggagt tcctgtccaa 540ggtgaaacaa atgcccttta ctgaaatgtc agaagaaaac
atcacaataa agttaaaaca 600gctaaaagct gaagtaatag caaagaataa tagctttgta
aatgaaatca tttcacgaat 660aaaagttact acgtgaaaaa tcccagtaat ggaatgaagg
taatattgat aagctattgt 720ctgtaatagt tttatattgt tttatattaa ccctttttcc
atagtgttaa ctgtcagtgc 780ccatgggcta tcaacttaat aagatattta gtaatatttt
actttgagga cattttcaaa 840gatttttatt ttgaaaaatg agagctgtaa ctgaggactg
tttgcaattg acataggcaa 900taataagtga tgtgctgaat tttataaata aaatcatgta
gtttgtggaa tttgagatgc 960attgtagttc ttcgcagtgt gacttcaaat attttggaag
aaacaaatag ctcagagacc 1020tcgtaaaata tcttaaactg gagggctcca tggagatcat
tgcgagtgac tcccccagaa 1080tgtccatctg ttgacaggag ccaggctggc tgcatacgaa
ttagctaagg agcttattat 1140atatccagag tcctaccgtg agcctccatc ccgtctgcca
ttctcccatc cctggtctat 1200gataagactt agaaatctgg attttaacaa aacgtttcag
attgagaacc ttgatttagt 1260ctacttctcc tattttacaa taaagagatg aagcggttaa
gaattagcta atcctacgca 1320aagtgaggga aaaaggacag tctttttaat aaatgcggcg
ggctggtggg gtatccatat 1380aggaagaaat gacattggac ccctactcca tgtcatatat
aaaaacctcc actttgggag 1440gcgaagcagg caatcacttg aactcaggag atcaagacca
gcctggacaa catgacgaaa 1500ccccatctct acaaaaataa atgcaaaaat tagccgggca
tagtggtgct tgcctgtagt 1560cccagctact caggaggctg aggtgggagg atcacgtgat
ctgggagagg ttgaggttac 1620agtgagctgc actccatcct gggtaataca gtgataactg
tgtctcaaac aaaacaaaac 1680aaatcacctt cagtgatttt tagaccaaat gtacaaggta
atactctcaa ggttttaatg 1740ttttatagtt ctgcagaaga taacatagga aaatattttt
atgtccttgg ctttgggaag 1800aatttaagtc acagaaaaac accatccata aagtttgact
tatttagcta tttgaaatta 1860acaacttcta ttaaaaggca ccacaagtga aaagacatga
atcgtaatgg aagaacatac 1920tggtacgtta taaaatatca aagagttggg catggtgtcc
catgcttgta gtcccagcta 1980ctcaggaggc tgaggcagga ggatcacttg agcccagcag
ttcaagtctc agcagttcaa 2040gtccagcctg ggcaatatag caagactgca tttcttttct
tcttcttttt taatacctgg 2100aataaagaac tcctataaaa tcactaagaa aaggggtcac
ttaagaatct cattaacaaa 2160aagaatttga atatttttcc aaggaagata tgcaaatgga
ctgtaagcac atgaaaagat 2220gcagatcagg gaaatgcaag tcaaaaccac aatgagctac
aacttcacac tgattacgat 2280agttaaaatc aaaaagtcag atggtaagta ctggcaagga
agtggagaaa ttgaaactgt 2340catgcgctct tggtgcgaat gtaaaatggt gcagctgctt
tggaaaacag tctggcagtt 2400cctcagacaa ttccactcca acgtatatcc aagtggaatc
acaacatatg tccccacaaa 2460cttgtacata aatgtttata gcaggattat tcataatagc
caaaaggtgg aaacaacccg 2520aatgtccatc agcagatgaa tgcataaatg aaacgtggtc
tatccataca atggagtata 2580ttattgagcc attaaaggaa tgaagtactg gtacatggtg
cagcttagat gaaccttgga 2640aacattgtgc taaatgaaag aagctggtta caagagtcaa
cacgtatgat ttcattcatg 2700tgaaagttca gaatagagac agcagtagag acaaagtagc
agttcagggt tggtgccagg 2760gaataggggg taggtggggt gaaagctaaa ggatacggtg
tttctttgtg agatggaaat 2820tctaaaatag gtgatgttta tacatgtctg tgaatatact
aaaaaccatt gaattgtaca 2880cattaaatgg atgaattgta taggaattat attttaataa
agctatttaa aaaaatccag 2940acacttcacc caagaggaaa tctaagtggt ccataaacat
gaaaaggtct ttaatcacca 3000gtcagaaaaa tgaaaatgaa aaccatgcca ggccacctcc
caccaccata gtgacaagca 3060tttcaagtgt ggcagttcca gctgttgttg ag
30923742985DNAHomo sapiens 374ctctcaggtc aggcttctga
caagccacaa tgtgggtgag cctttgtgca ctgcctgccc 60acctctcacc aggagccctc
tctccccatg gcctcaaggt accagtgagg cttttttctg 120tctcagcctg gccataagca
gccctctgca agagttccgt taccagtcat ttgcattgta 180gtataagtgg aaaccacaga
atcgccttcc tccccagtta tttatacttc aagtcatatt 240gtagagagaa aatttctgtc
agcaaaaatc tcaggaatcc tcctcatttc tatttgtatg 300gctttcaatc gttgacatga
ttttttcaca tatgtcatct tctggggatg gattcgtata 360accctgcttc acttgcttcc
ctgtgggagg ctcacttgct tctcgacagg ctctggaaga 420actaggcagt ctggtacatg
gttgtgcaag aacccttgag ggggccttgg agtgtgtgct 480tgggccctgg aactcatgcc
taggatggag ggctgagatt gccccttccc atccaccagg 540gagttgacaa gggggagaag
aaacttcttg tgagcttgcg atgacttgtg gcacttgcat 600cagaccttgg agttccctgg
ggagaggcac tcttgggtat gacactgtat agtgccacct 660gattgccatt tgacccagtt
tggccctgga tccttgagca agagggctgg aaagaaagac 720aggcccactt tttgggacac
tattagggtc tgtagcattg gtggggagag aattccccca 780acccccaaaa gagctgaaaa
tgagacacgc gtggaggggt gaaagtggag tgtggtcaac 840agtgtggtta cagagatgtg
tgtcggggcc actcccactc accagggaga ctcatgaagc 900agaagggatg gggcacaatg
tggcttccat aggcacacca agccacctgg agagcgcatc 960agccctttgg gtacccccaa
gcggaaggag gttgggtctt tgggtctggg aactttggtg 1020cttgttctgg tgggaagggc
agggagtcaa gaccagctgt gtcttccact gctcttcttg 1080tccactttgg ttactggcct
ctgttggcat gaactgggga ggcagaggct acctacagac 1140gaggaactgt gtggagtgcg
agtgtatgca gtaaagggtt agcttagctg acttgaggta 1200ctcacaccca tattccgaag
aaaagactgg ccctcagcct gagcctccga aataatctct 1260aagcccttag aataccctgc
tttgtattca aagagtatct ttgaatgctg aacttagaac 1320cactctagaa aatgtatgct
aacaatgcga tttatgatga acacttgtct ttgttcccct 1380ggggccctgg gccacattgt
atcagtttga gccctagagg gacagagaat gagaaactaa 1440gatcagtcat gcaggtgctc
caggcctatg tgaccaacca ccaataaaaa ccctgaacat 1500caaggctcaa gtgagcaata
cagctggtcc caacttacag tggttcaact tgtgagtttt 1560gcactctaca atgggtttat
tgggacataa cccagtggag gaggatctgt acttcattca 1620catgtgttgt cacatcatta
ctgggagaat taagcactgt ccacgtgaat ccactgggag 1680aggataactg gaagcttgca
cctggcttct cctggattct gctctgtacg cctttttccc 1740ttgttaattt taatctgtat
tctttcactg tagtaatcta caactataag cagaatagct 1800tttctgagtt ctgtgagtct
ttctagtgaa tcattgaatc caaggtggtc ttggggacct 1860ctaacaaaag atgtctggac
ctgaacttcc tgttgtttca aagatcctat agcaggctgt 1920cttaccaact ttcagcatca
agaagctggt ggagagtggg ttagtttaaa aatgaaactg 1980gggagagaga tgaagccggg
ggaagatgcc gtgaaatctc accttatagg cagcctctga 2040ttcacctgag ggtttttcct
tgaatacttt ctgggtacaa gtatttgaga caggtgatgt 2100gctggtcact ttattctcag
ctgcttgtgg cctagcccta acatgggcac tggaaacaat 2160gggggtaggg gttgatgatg
gagaaatggg gagtaaaggg atttaaaact ttgaaaaact 2220gagctgtttc catgatttgt
ctcttttgat tctcacaaaa cctttatgaa atatgtgctg 2280acattttaag ctctcactta
tagtgagaaa agcaatcttc agcaaggtga tgacttgtcc 2340aagggaagac atggtcgccc
ttgttccttg ggagattttg tgctcccagg ggaaagcata 2400agccctcagg agccatgatg
agaacagctg tagaacagca agtgaacagg tgtgtatcag 2460tcaggatagg caaggctaag
ctgcagtaat aaataatccc cggatctcag tggcggaaca 2520ttgaggaggt ttatttcttc
tttatacaaa tatgctgtgg atcaggatga ctctccaggc 2580aactgtctgt gggactgtcc
aggtgggctt ggatcacctg gtgttgggcc ttgaagtcgg 2640taatggagag gacatgttag
aagagaagga acttacaagc agtgggagtg cagcgccctt 2700ttgtggatag gggtcaaggc
aatgctttcc aaggctatga cttggtgtgg tcgaaaaagt 2760caagcagtct tcactttttg
ctgtggtccc agcaaatctg cttccaatcc aggcttctcc 2820catataaaaa gcctcctttg
tgtacagtga gtgaactaga acagggagga gatgccagtg 2880gagcttggct tgctccttct
gtggccagct ggcttgtttt accactgcct ttggggtaca 2940gtggcagctg tggcaaatct
ctctggagtt tctctagcgg gagcg 29853753068DNAHomo sapiens
375agcgaagcac ctaaagcaca tgggtgcagg agcagccagg cctgcaccca tagacatggt
60acagagagga gcagggaagc ccgctgcctg cagacttcag gaggagagag gtaggggtgg
120tgcaggggag agggccttaa tgccttcagg gaaaggagtc aaagaggaat acccaggaga
180caactagact ttagaattct tggggccaga aacttgattc cacctctagt gctttctttt
240agatttcttt ctctctttac tttctttctt tctttctctt tctttctctc tctctctctc
300tctccctccc tccctccctc cctccctctc tctctctctc tctctctctc tctctccctc
360cctctctctc cctctctctc tctctctcct ttctctctct ctcttcttaa gactgggtct
420cgcagttggg cacagtggct catacctgta atcccagcac tttaggaggc tgaggtgggt
480acatcacatg aggccaggag ttcaagacca ggctgggcaa cactgtgaaa cccatctcta
540ctaaaaacac aaaaatttgc caggcatggt ggcagatgcc tgtaatccca gctactcagg
600aggctgaggc aggagaatcg cttgaacctg gcaggtggaa gttgcagtga gccgagattg
660cactactgca ctctagcctg ggtaacagaa caagactcta tctcaaaaaa aaaataaata
720aataaaataa aagggatacc gggtcttgct ctgtgtccta ggctggagta ccatggtgtg
780atcatggctc actgcagcct ccacctcccg ggttcaagca attctcctgt ctcagcctcc
840caagtgagta cctgggacca caggcatgtg ccaccatgcc tggctaattt ttaaattttt
900tgtagagatg aggtcttgat acgttgtcca ggctggtctt gaactcctgg gcccaagcag
960tcctcccact ttggcctcct gaagtgctgg gggtacaggc gtgagcctcc acctggccag
1020cctccagtgc ttttgcatcc ttcctgttaa cttgtgtagg aataaaacat tgtcacaata
1080agattttttt cctttttatt gttttgattt tttagccaat gagaaggaaa attccttatt
1140agggagggcg agggtgagga tatgtggggt ggggagaagc gaacgttcca agtttcgaaa
1200acagcgactc tctcttggac tctctagcca gtagaaacct ccctcccact ctcttgcccc
1260aagatctggt gcttagaaga gaatcaaggg aagttggaac ccagaagacg gagacagatt
1320gagggactgc tgtgaaatgt tggggtgttt ggtgaataat attagaagtt gggctggcag
1380agaccctgtc acataaacat taaatcaaca ctggagactg agcatttgtt agaaatgtaa
1440gcgggaatgg cagaaaactt gtttttaagg gaaagcatgt tacggcttat gttcagcctc
1500catcctctga aggcaaaagt tagcaaagtt gatgtatggc gttgcttttt ctgggaactt
1560tatctcgttt ggtggggttc ccatctctgt ctcccaggag ccaagacttt cccctccctc
1620tgctccagca gaagccagtc tcaggcaagg ctccctgtac ctcatttaca ctttggtgtg
1680aatatgttat tgtaacctct ctcctggagg tgtctgcatt ccaagactga acttttctgt
1740gaaagttact gtcactgtga aaggcagttc agcccccagg gattgaaaaa ggaaatcatt
1800ttgggtaagg ggacagttag tccagatttt ttcagttgca agtaaaccta actcagccag
1860taggcaaagg gggaaattgc tggtttgaac tggtgggaag aaagctgagg aaactcctac
1920acttggggga agaactgcag gtgcctggct gcagggaacg cagcgggggc tcaggaccag
1980gcagatgccc tgcctctgct tcccttggca cagtggcctc cttctccctt caagtaggca
2040gatgctgcct gtggcagagg acagcagctg attggcagcc cagcagggag gatgtggtag
2100acaggcactg agcatctctt ctaccctcct tctagagggc tatcctgtac tgttgaggct
2160aaaagactga aaaccacatt tcccagcctc tcttgcagct accaatctgg atgagagtta
2220gattctacac attagatgca ctttagcaag attttcaaaa gcagattgga gaaggagccc
2280atgcttctgc tggttttttt tgctggcaag tgaggggttc tgtttttcct ggagtgactt
2340tatcatggtg gcatctgaaa aaggctattt cttgatcaga gagacagcaa ccctctcagt
2400gacctagttc tgtgggtgtg tctctcctga gagttaatcc cagagctcaa actagagctc
2460aaccctagag tctcttcagg cttcccaggg gtgggggtgc atttaacagt ccaagttaaa
2520gagaaaataa aggccattaa agaccaaaca ttgagcactg agtgaaaaag ttttattgcc
2580aaacaggaaa cctgattcag gccagggtct tggaaggttg ttcaggatga gatgggggag
2640gtgaaatggg gtaggtcttt gaaaaccaac agattgcaaa ttctctgtcc catagcagga
2700aaccacagtc tctgatgtca gctggctgcc aacacgtcag ttgtatcagc attagctggc
2760tggaggtggc ctgctgtgtg cagatggtac ctggtgcagg attgtggtgt ccaggtgtct
2820ctccttagca cataagaccc tgtccgagga ctgtggcatg acgtgctgga gtcacgattc
2880tgtcacccag tcaggtcatc agtgtcagag agctaggtgg ccaggttgga gttgattgcc
2940aatgataggt ctttttctgc ttaaatcagc tggactggat tctattgcat taacttgacc
3000ctgactcatg ccgccaggcc taatttataa accaagacaa gaaagggcta ctccaccccc
3060tccaattt
30683763117DNAHomo sapiens 376gtcacccagt caggtcatca gtgtcagaga gctaggtggc
caggttggag ttgattgcca 60atgataggtc tttttctgct taaatcagct ggactggatt
ctattgcatt aacttgaccc 120tgactcatgc cgccaggcct aatttataaa ccaagacaag
aaagggctac tccaccccct 180ccaatttgtg taaggccagg ggacttcccc cccactcccc
aacctgaggc atgcaccctc 240ccttagatca atggctgttt ctctgagaat gcggaaccgt
gattaatcca gccttgatgg 300ggaggcagca ggaactgtag gcattctcac ttcacaccca
tcccaatccc ctcccccttg 360ctgtcctctt gtacagagga ctgaaagcac aacactctct
ccctccctcc cttataggtg 420gtgacgatca tgtgactctc ttctggtcaa tgagatgcag
cagaaagtcc tagggaggtc 480taggaaaagt cctgttggga gagagcattt tttaccttct
ccctgctact tcttgctact 540agtaacatgg atgtgagcct tggaggggta gctaccatct
ggcacctggg gtggcaagcc 600aacatggaaa ggatggcaga gcgggaagga ggagccagcc
ttaccgatgg catcactgtc 660actgcgctag ccccagacca cctgctccag agttctggtt
atggtaatga aataaacctt 720gatttttatt ccttaaaact acccttcaat gggttttctg
ttcattacag ttgaatgctt 780tcataactga tacaggaggg accctgtgat tggcagttcc
actagactgc atggagatgg 840gtggagttat ctaaaagaac agagatagtg tccctagaag
aaggggacag gaaagcatcc 900tgggtacaca aaagtcaagg ctccaggatc tgccctgggg
gctatctcaa cacccctaca 960ctctcaccgc acgtatttgg tcagctatga atatgaccaa
ctctcgtcgt ttatctctat 1020tcagtggaac acagcagcac tgtgacctgc ccacgagaag
aaggattttt agaacttatc 1080ttagggcaat tttaggtaga ggagcagaca agatggtgta
caggagaaac aggtctatta 1140accctggtat taatattaac tggctgccca gaataaatga
agaatagctt attctttgcc 1200aggttgaaga tagaaaagga atgaagggcc ggagaagtac
agctgggtga agcacagagc 1260agcctagtgc ttggcatggg actcagatct gaagcagcct
ctccgggact tctctgagcc 1320tgcccctggt ggtatgactg tgatatccct gcttctatag
ttggcaacca acatgtccta 1380gctcctagac catagagggc cagattcatg tctcattgac
tgtgtaatct ctgtgtggcc 1440cagtacagag catgcacacc gtaggttctc acatatgttt
gttgagtgaa tgaatacaat 1500accaaacgaa tggacaggac agagctgtgg gctagcagga
aggatatctg gcttttgctt 1560gaattagcta gtgaattgct gtgtggcctc cttactgagc
ctcatttccc tctgtctgca 1620gagtcaagca aatcttccat tttttgttcc cctgctgcca
gagcatggca gagtaaatgt 1680gtgagttgaa gggagcaacc tcatgaggtt ttgctttgtg
tcttaattac agccatttgt 1740ggaattaggc ttttaatata aatatttgtg tgcctgcgcc
tgcatatatg tatttggacc 1800aatgctctca tgtgtgcaaa tacatgtatt ctaaagaaat
ctgtccagaa ccccagcatc 1860tgtggtgtct gtggtgggag gggcttccat attacagaga
gatgcccaca gtgcatgacg 1920ttacccgcac aggtgtgaca tcacagggta accaaatgct
tttgccctgg gggtgggaga 1980gggatgggtg cacggtgaac agcaggtggg ggtctttcca
taggggatga ggaagacaag 2040gccacttgga ggcagaggag accacagtgg ggcatgatgg
ttggggaagg ccttttactt 2100ctgcccctta aggatgccct ggaattcagg ctttcggatc
ccagagctct cattagagca 2160gccctgcgtt gtagactttt ctgcagtgac agaaatgttc
tatatctgtg ctatccaata 2220tggtagccac aagttacatg tggctattga acacttgaaa
tggggttagt gcaattgacg 2280agctgaaaat gtagtttaaa ttcacttaca tttaaatagc
tgtgtgtggc ttgtggctgc 2340ctattggact gtgcagttct ggagaatggt actttacttg
tccttgggga agcagaaaca 2400aatgaaaacg aggatctgga gctcatgaag tttctcatgg
ggtggggtat gtgtgttgaa 2460gctgcacctt cagcaggaac ctggccagtc cttagtggag
gacatttctt tccatcctgc 2520atccagatgg ctggtcctgc tcctcccagt ccatggagaa
aaaagaattg aacaaactgt 2580ctaagctggg tcaggtactc tgcagatgtt tgctgagtat
cgttcttgat ggaaatcccc 2640gtggaactcc tacattttct cctctcttct ccttcctttc
agaacctcag agtgacagag 2700ccaaaagacc agtgcctcat tttgctgaca tggaaaagga
aacttcgtgg gggaaagaga 2760tctgcttgca gtcggccaga gagacagaac cagggcagtg
gtgagctctc atgacctggt 2820gtctgttgcc ttctggttaa gtttttcatt tgtaattcta
caaacatccc ttctgtaaac 2880atttccctca aaatggagca ggaagctctc aaaaatggac
cagaaagggg tcaggaatat 2940aactttctct gcccagattc caggacttac agtgagaaag
cgccttctgg gaacttcaca 3000atggctaaag tgtgctaatg ggatgatgtg cccttgtaca
cccactgcct ctgaactctg 3060ctctgcattg ctgagcaaac tacatttccc agaactcctt
gttggattcc ttccaaa 31173773117DNAHomo sapiens 377tcccagaact
ccttgttgga ttccttccaa acaggtttac cactgggaga gcctgttggt 60tggggagggc
aggaagaggg aggaaagagg aagggactca cttcctgttt ccagctgaag 120tctaaatcaa
tccactatca acaggtagct atcatactac cctcattgtc acccctcaga 180ggtcccactg
cagctgcata atgtcccctc agtggcctga acatgagatg aacaacactc 240ttcttgggag
taccagcctt gcttggttca tggccacttt tcctgattat cttgcagcta 300tattaggtca
tgtgacaaag ttctggccag tggcaaggga acacaagtga taggtacaga 360tagaagtgtc
tgatactaca tagattatgc ttgcactcac tcttaagaga gagacatgaa 420cttttaccaa
cggaagccag tattattttg aacctctgtt agagtggctt gaatctgtat 480cctaacttgt
atccctaatg tgtgacccat gaaaattagc caggcagcac cagttccaaa 540gaagctcaca
ctcccctgcg gctgcttctg ccaaggtcac tgatatttcc ctttgctaaa 600tcttgtgggt
gttttcttca gtccttgtct taatcactca gtggcacttg gcacttattc 660cttcttgaaa
cccttgtttc ccttggcttt gtggcatcct gtgctcttgg ttttctccca 720tatctctgac
cctctttcct tagtcttttt tcttcttcct cctgtccctt aaatgctggt 780tgtgatcctc
tttttatctc attctacaca ctcacagcct gagtaattca caccatcttg 840atgctgagaa
cttccaaaat gttggtctag cctgggtcat tgttatgagc tctagactca 900caaggccaat
tgcttggtgg gaacccctcc cccatggtta tctcatgggt ccctgaagtc 960caacttctcc
ttcattgaac tcatcacctc ttctgttcct cctcctgggt tcccaggctc 1020agtggtggca
ccactgtcta cctggctgct tagcctgaga cctggctccg tcccaattcc 1080tctctctcag
tcttatcatc cccatccagg caaatcattg attctgtgga cctactcttt 1140cgggtgtccc
tcaaatctct ccacgtctct gtgttctcac tagcactacc ttggtccacc 1200ctgccatctg
ctttcctcct ccactcctgc attctgagtc attttcggca gcacacgcat 1260ccttaaaacc
cctccactgg cttgccagtg tcctcaggat taggcgaaaa gtctttgctt 1320tgttttacaa
ggcccttcgc tatctggccc cctcattacc tcccttgctc tgcatgctcc 1380agtcctgcag
aactacacac agttccccca acaaggccct gctctgttct tcccacacac 1440tgctcctctg
cctgggccac tcttcctgct ccttgtcagc aggcttgctg ctctcaggct 1500cagcatggac
agctgcttct gagagccttc tctgcctacc caggctgggt ggctgcctct 1560ctttggtgtg
cccatggcag cccagaatgc ctggtggaca gggagccctc agcaggccgt 1620actgcagcgc
cctgcccccg tcagcctcca ggagcctgga gtccagggac atcaagggcg 1680gtcctgtctt
tctcaccctt gtctctccag cccctaacac aggggatgcc tgaccccaaa 1740ctagacgagt
tacttgacct ctctgaccca agacaaaatg ggaggaaagt gccaaatttc 1800caagattggc
caggggatta aataagataa atatgcaagt ctcttatctg ggggtctggc 1860ttggtaaata
taaagttctt ttttcttttc ttcctttttc tttttttttt ttctttcttt 1920ttgagacagg
gtcttactct gtcaccaagg ctggagtgca gtggcatgat catggctcaa 1980tgaaacatcg
acttcctggg ctcaggcgat cctcccacct cagccccctg agtctcttgg 2040actccaggcg
tgcaccacca tgactggcta attttttgta tttttagtaa agacagggtt 2100tcgacatgtt
gcctaggctg gtctcgaact cctaggctaa agtgatccac ttgtctcagc 2160ctcccaaagt
gctgggatta tagacatgag ccaccatgcc cagctaaaag ttccttttta 2220aaatctgctt
gttagataca ctcatagaaa ggtaactggc cacagaaggg agaggaatgg 2280cagtccatcc
agggatcact ggagtgtcat atgaaatgtt ataggaatca caggccttag 2340aacttgaaag
gaacccaagg atcatctagg ctactttatg caggtaaaac agccacctgt 2400gcccatcaca
tagctggggc acagctggag accccaacag agaggagagc tgatgggtga 2460cgagaaatca
ggcctctccg ccacggcagc ctagctaatg ggtcttggct ggaagctaac 2520aggaaggcct
ctttccagaa acactgtaag ccagtgtttc tcagattgct gggtgtaatt 2580cataggcaga
tcatgaaatc agtttaatag ctttgaccag cattaaccta tttatgccta 2640gcgttccctt
attggaacac taagtctgtg agagttattt acatcctact gcttaaggtc 2700atcgccaaaa
tctgattttt tacacaaaaa atttgcaacc tccagcataa atgggttaaa 2760acaagacaaa
acaaaacaat accagaatgg aaaatagtgc atgatctgta cagtatagtt 2820gtagaaaact
tcttgtttta tcatttgatg tcatgaaagt ccctgctgta gataaaagat 2880ggagcttgtg
cttctgagtg gtcatgctca acagggtggg gagcccaggg gagtggggag 2940tgatcgtata
gacagaggtg ggtggggcca gtgtgagcct gatggtcaat tacttctcat 3000ttctagggaa
aattgaagga aaagaaggag ggggatgtgg aggggagaga aggcctcagt 3060agagtttgca
ctattattag ggcaagtaag ctgcttctga aaagaagggg tttgcaa
31173783192DNAHomo sapiens 378ccccatgcag aagcaatagg gcagcctggt cccatatcct
catgaaatgc ctcttataat 60tgtgacatct tgcaattgtg gaggacttta cacttttcgg
agttcctagc ccctcactta 120tttctcgtaa gaccgctggg aggtgggggg atggtatcat
catcccactt tagagatgag 180gaaacaggat cagagtgagc taaatgactg ccagatccaa
aactagaatt cagacctcct 240agtttctaag tggacgctct ttctacacca ccataatgtg
agtgttctgt gtttacaggg 300tgtattcaag tccatgactg cccattagaa tccccccaaa
aaattccagg actggcctga 360gttgctcctt agaccaatga aatcagactc ctgggagtac
ggcccgggcc tcgggatcct 420ttaaagctcc atttggagag cctcgggcac agccaggttg
gatccatctc ccagtccccc 480agccttggct cagcctggcc aagctgccca ggaggtccct
tggtgccctg ggctctgttt 540cactgttgtt ttgtagagca acttcccagt gatgctgcca
ctgggcccca tcctaacagt 600gaagtccccc gggccctcct gagaggaggt gtgaactgga
agatggggag gcaggcggct 660ctgacagaca gaaagcaaac agctcagagg ggtggcaggc
tgcattttat tcatcgttaa 720tttaaacacc cttcaagtcc tctcttggaa tgctgctcag
aaaaatagat gtattgtttg 780agaaaccctg caggcttgtc ccgcatgctc tagccccctc
ctgagagaac agatagcata 840aaaaatgatt tgtaaagcaa gggggagctt ccttagggaa
gaaggggaag gggaagaggg 900tttggggcca ggtccgagtg cagaaatcct caatgcatga
gactagcgtg gaaggtgtag 960caattgtgct ctggggtgcc tgaaagtgcc agagctgctt
caggggcaag agtccaggcc 1020ccaagtccat gctgatgagc ccaccctggg ggtcaggaat
ggcctcagca ggccctccct 1080ccctccctct ccaccctaca aagtgaggag ccttgagtca
ccaccagcac attatacaac 1140aatacaagaa ccctgcaaca gataaagccc cagcgcctct
tctggactca gatgccctag 1200gctggctgtc tggctgtgct ttccagacag tgtgtatgtg
gaattgtgct ttttgttttt 1260taagaatgta aaaagttaca gtaagatcga accacagggc
ccgtcgctcc tatggtctct 1320gcctgactgg gctgccgtct gcctcagttc cccagaagct
tctcctttgg ccatgagggc 1380tcagtcatcc ctcaccccag agtccacagg aagagggggt
ctgctgggag gcctgtctga 1440aggacggagg atcctgggtc aatttagcag ctattttcca
gggtttggct tgggtttgga 1500tgctggcttc tgtgtgaaac ctgaatacat gcaaattgta
cataaaactc ccccaaggca 1560gagagggatt ttccaggccc tggtacatct ctagagagtt
aaaaatggga aatctttctt 1620cttaaagtgg cccagactga gacttttcct tggggaaaag
ggttagtagc tctttgtaag 1680gctggtgtgt atgtgtgtgt gtatatatat atacatatat
gcatgatgct gtgcaaatgc 1740ccagggctgt ctggcatttt ccacaaaatg agagcctgag
attgcctaag ccttctgatg 1800ccttctccag gcctggaggc actgcttcat tcagaggaca
caaaggcctg accacctggc 1860tttagcaagc taggacaccc agggtggctt ctttaccttt
ctcctcagct ctgagaaggc 1920tgctagccaa gactctggat tctctgtggc cacagtcata
tggtgagggc ctcttggagt 1980tcattcaaac tttaagggag ccccacagca ccggcatgat
gggtaagtcc aggcctaagg 2040ttaggaagca aatcctggag catgaggaaa ttgtaggcta
cagtgagcta ccagtggtgt 2100gcaaactgga gacccccaag acagtgagag aggccacagc
atctgaggga atggagctct 2160ttcttggcct gaggttcaga agaacctgca ccaaagaaag
gcatccctat caatgtcact 2220gttcctgaaa tgatgggaga accacatccc tgcttcaggg
aagcagtccc tgtcgtctgg 2280ggcgctgagc cctttggcct gagatgaagg atgatggtgt
gatgtatcat ggcagtgtga 2340ctgagactgg attgggggat ggggacaggg gaacataggc
aaaaatacac atgtgccact 2400ggatcctgag ctgccattgt accttggagg actggcgttt
ctctgggaag ttgggaggtg 2460ggaagaggaa gggtctcatt ttcctgcccc ttgaaaccat
gcttaccatt cctttagaag 2520attgctcaag ctgcctccaa ttgcctcttt ccaaaaccaa
agcataggaa aacaagtaaa 2580aacagctgag gctgcagcat aagcaactta ggatagagtc
taggaagcac cgccaacaga 2640gaagactgcc aagaaacatt ttgagttttt cttctctgga
ggtgggtcct ggttcctccc 2700atggagacca cgattctgtg tagtcctgca cgctgggcgg
gggattgcct ggaggtttct 2760ttagacctgt ctagctcaca cagtcttgat gcctgggttt
taggctgctg tactgttgct 2820ggggctcact tcctgtgggt aggctgttat tttgcccgca
gatcaagtcc tcactgtcta 2880gatgcctcta tcatggggat ctcttcttcc ctctctggat
ggctctgatc cccaagttat 2940ttcctgttgc ctaggtaaca cctctaattg gatgcctttt
aatcgttccc ttttttaaag 3000ggataaatgt ggattttatt tccaggtcct gtcagagggc
cctgccctag agaacacgtg 3060cgcccctgcg tgggcaatcc cttcactgtg accgcaacca
tgggttggat ggggggcact 3120cactgggctg gcctgacagt cacagtgaat cctgaaagca
tggttttcac aggaacccac 3180cttcaggatt ta
31923791944DNAHomo sapiens 379agtttcagcc atgttgcagc
atgcatcagt acttcatttc tttttatagc tgaataatat 60tccatagtat ttatatatca
aaatttgttt atccattaac ctgtggaggg acatttaggc 120tgtttccacc ttttggctat
tgtgaatggt gctactataa acatgtgtac acatgcctgt 180ttaagtatat gttttcagtt
ctttggggta tatacctagg agtggaattg tagaatcatg 240tggtaatttt gtttaacttt
ttggaaaaat atcaagctgt acccaaagtg gttgcaccat 300tttgcatttc caccagcaaa
atgtgagagt tccagtttct ccatatcctt gccaatactt 360atttttcttt ttaaaaaata
gctatcctag tacatgggaa gtgacattca ttgtggtttt 420aatttgcatt tccctaatga
ttagtgatgt tgagcatctt ttcatgtgtt tattagtcat 480ctggatatct ttggagaaat
ggctattcaa gccctttgtc catttttaac tgggttgttc 540ggttttgttg ttgagttgta
ggagttcatt atgtattctg gatattaatc acttacctga 600tacatgattt gcaaatattt
tctcccattc tgtgggatgc cttttcattc tcttcatagt 660gtcctttgat acacaaaagt
ttttcatttt gatgaagtcc aattcacctg tttttttctt 720gaccaaaaag tagaaacaac
tgaaatgtcc accaactcat gaacagataa acaaaatgtg 780tatataatgg gatatattca
gccataaaat gaatgaagta caaacacata caacatggat 840gaaccttgga aactttatgc
taagtgaata cagtcagata caaaaaggga actattgtat 900aattctatgc atgtgaggta
cacagaatag tcattttcat aaggacagga aatggaatag 960tggttagcag gggctgaaca
gaggagaaga ttggcagtta ttatttaatg gacatagagt 1020gtttttcttt gaatgattaa
taagttatgg aactagatag tgataatcat gaatgtactt 1080aataccactg aattgtacat
tttaaaattg ttaaaatggg gctgggaaca gtggctcatg 1140cctgtaatct aatcctagca
ctttgggagg acaaggaggg aggatggcat gagccttgga 1200gttcgaagtt acagtgaact
ctgattgtaa ccacccaatg tgttcacctt gcccgctgcc 1260tagacagagc cgatttatca
agacaggata actgcaatgg agaaagagta attcacacag 1320agctggctgt gcaggaaacc
ggagttttat tattactcaa atcagtctcc ccaagcattc 1380ggggatcagg gtttttaaag
ataatttggc aggtaggagt ttgggaagtg gggagtgctg 1440attggtcagg ttagagatgg
aatcataggt ggttgaagtg agtttttctt gctgtcttct 1500gttcttgggt gtgatggcag
aactggttga gccagattcc tggtctgagt ggtgtcagct 1560gatccattga gtgtagggtc
tgcaaatatc tcaagcactg atcttaggtt ttacaatagt 1620gatgttatcc ccagaagcaa
ttaggggaag ttcagactct aggcgccaga ggtggcatga 1680tccctaaact gtaatttcta
atcttgtagc taatttgtta gttcgcaaag gcagactggt 1740ccccaggcaa gaagggggtc
ttttcaggaa agggctgtta ttaattttgt ttcagagtca 1800aaccatgaac tgaattcctt
cccaaggtta gtttggccta ctcgcaggaa tgaacaaaga 1860cagcttaaag gttagaagca
agatggagtt atttaggtct gattgctttc attgtcataa 1920tttcctcagt cacaattttg
ccaa 19443802197DNAHomo sapiens
380cagtcacaat tttgccaagg cggtttcatg atcatgcaac tgcactccag cctgggcaac
60agagcaagac cttgcctcta aaaaaagtaa ataaaatggt taaaatggaa atttttatat
120tatgtgtatt ttaccatgat aaaaaaaatg aaagaaaact ggtctagctt tattaatatg
180agacaaaaca gaatttagga caaaaaaatt agagaggacc acttaattat gataaaagct
240tcaagtcatc aggaataatt aacattggta caaaatatgt atgtaccaaa tattattgcc
300ttgacatgta taaagcaaaa gctgtcagaa tcacagagaa actcacaatc cttgcgggag
360atttgaacaa aattatctca gtaactgata gaacaagcag tcaaaaattt tctttcggcc
420gggcgcggtg gctcacgcct gtaatcccag cactttggga ggccgaggcg ggcagttcac
480gaagtcagga gttcgagacc agcctggcca acacagcgaa accctgtctc tactaaaaaa
540tacaaaaaat tagctggtca tggtggcggg cacctgtaat cccagctact cgggaggctg
600aggcaagaga attgcttgaa cccgggaggc agaggttgca aggagcctag atcacgccat
660tacgctacag cccaggcgac agtgcgagac tctgtctcaa aaaaaaaaaa aattttcttt
720cacatcaggg tgagaaaact catacaaaga tcttcctagc agcattattc atgacagcct
780caaactggaa ccgacctatt aataaatatc tatcactagt agaagagata aacacattgt
840attagattaa tccaatgtaa tactgaacag caatggaaat gaaatgaact gtaggtacat
900ccaacaacat ggatgaattt caaaacataa tgctaagcaa ataaagccag actcaaaata
960atatatgctg tattattcca tttacgtgaa gctcaaaaat aagcaaacta aattatatgt
1020gtagagaagc atatttattt gataacatta tttttataaa gcaagaaagt tatttccata
1080aaattcagaa ttgtagattt tttttttttt tttgaaacag agtctcattc tgtcgccagg
1140ctggagtgca gtggcatgat ctcagctcac tgcaatctcc gcctccaggt tcgagtgatt
1200accctgcctc agcctcccta gtagctggga ttacaggtgt gcaccaccac gcccagctaa
1260ttttttgtat tttagtagag acagggtttc accatgttgg ccaggatggt ctcgatctcc
1320tgacctcgtg atccgcccac ctcggcctcc caaagtgctg ggattacagg catgagccac
1380tgtggctggc ctcttttttt ttgagacaga gtctcgtaat tgtggatatt tctaagagga
1440aagaggaaca ttggaattgg aaagagacca gtgggccaaa ggtggaaaat gttgatgtag
1500acttctaaga ttttgacaaa attttgtttt atggcctggt ggttatataa atatttactg
1560tataacaatt cattaagata cacatttgtg ttttttgtat atatgtgttc tatttcacaa
1620tcttaaatgt tccttaatta attaatggag cacaccttca gagttgggtg ggaaaataat
1680tctgcctaga aatccaaact tagacaagct agctatcaag actgaggaca aactaaagcc
1740attcttacac ctgtaaggat tcagggttta tctactattt atgctatctg aaggagacaa
1800ttgaatatgt tggccaggaa accaagtgtg aggagtatgt agaaaacaga agatgatagt
1860actaaccctg ttaatctaat aaaaagaaac cccaggatga ctgcttgcag tggggtttga
1920aagaaatcta ttcaaattaa aacaggaggt ccatgtgctc caaaaagata ttcttttttt
1980ttaaatatat atatatcttt tattatactt taagttctag ggtacatgta cacaacgtgc
2040aggtttgtta catatgcata catgtgccat gttggtgtgc tgcacccatt aactcctcat
2100ttacattagg tatgtctcct aatgctatcc ctcccccctc ccctacccca taacaggccc
2160cagtgtgtga aaaaacgata gttagatgcc acgaact
21973813184DNAHomo sapiens 381ggtccatgtg ctccaaaaag atattctttt tttttaaata
tatatatatc ttttattata 60ctttaagttc tagggtacat gtacacaacg tgcaggtttg
ttacatatgc atacatgtgc 120catgttggtg tgctgcaccc attaactcct catttacatt
aggtatgtct cctaatgcta 180tccctccccc ctcccctacc ccataacagg ccccagtgtg
tgaaaaaacg atagttagat 240gccacgaact aggtggcaat gccttaaccg tatgtgtgtt
gtcaggcctg agggcctctt 300ccatccttgt caaggggagt actaaccttc tcccctttca
tacaacacaa agatattctt 360aagacttcta gaatagaccc tgaacaattt tagagtaagg
aactaataga tatcagtgct 420ttcatgaaga aggcttttgc ttctcctgat gagggaaaaa
ttataaaaat tctaagacag 480gaaagttatg atccaaactt gaaataaaca aatgtggtat
gaatttgggc aactgtggtt 540ctttaagaaa agagaatcca ccatgcaatt ttctttttct
tttttttttt tttttttttt 600tgagacaggg tctcactctg tcacccaggc tggagtgcag
tggtgatctc agttcactgc 660aacctctgct tcccggggtc aagtgattct catgcctcag
cgtcccaagc agctgggatt 720acaggtgccc gccacaacac ctggctaatt tttgtatttt
ttagtagaga caggatttca 780ccctgtttgt caggctggtc ttgaactccc gaactcaagt
gatctgccaa cctcagcctc 840ccaaagtgct gggattatag gcgtgagcca ccgcgcctgg
cccatcatag aattttctag 900gaatattgtc ctttgagagg tctagggtga tgacataatt
atacaagaaa acataatgtc 960ataacaattt aatattttta gtaattttaa atttgtgtca
tcaacctaca gacaaaggat 1020gggggttcag gtttctgaac agaatgtaaa ttttcaacct
caacaatgta aatatcaaag 1080tgaagctcac agaaaccaga aggtagaagt aggaaaagag
atggaggcaa gggtagggga 1140aaaaagtcaa gagacgttag tgaaaattga cagaattaaa
aacaaatagt ttaagaacag 1200aatctaaatg tataaaagta accaatggaa agaaaaccaa
tgatacaaca aaagtcatgg 1260taaaaagaag aaaaggagaa atggggtggt attaattagt
taaatcctta ttaatcataa 1320gcaataagta gacaatgcct acagttgata aattaagaat
tagcaatata cagaaatata 1380tatgaaacca aaataactaa tgaaagaaaa ggaggctggg
cacggtggct caggcctgta 1440atcccagcac tttgggaggc agaggcaggc agatcatttg
agtcccggag tttgagacca 1500gcctaagcaa cgtagtgaga cctcatcgct acaaaaaaac
agaaaaatta gctgggtgtg 1560gtggtgtatg cctgtattcc cagctacttc agaggctgag
gcaggagaat cagttgagcc 1620cagaaggtgg aagctacagt gagccaacag agtgagacca
tctcaaaaaa aatttaaaaa 1680aatgaagaag gaaggaagga agagagggag ggagggagcg
tgggcggggg ggggggggtg 1740gaggaggagg agagaaggag tgggaggagt ggagaaggag
ggggaggagg agaaggataa 1800aaggttacaa gtggttgtta ctaggaatgg gggagaagag
aagtgggtaa tggcactgaa 1860gctttttatt atgtctttca gcattctctg attgttctta
aaccatcaac agatctcagt 1920atgtagacta aaagggaata tttggtgaag agatcttctt
tcactattgt acacttgcta 1980tggacatgtc catgcctgct gcctggcagg caccattcat
taagtaggcc cctgttgcca 2040aggaaaccag ctcttcactg ataccaaaga taatgcagag
gcctgccgct caccaagcaa 2100ccttcctcat gagctatgcc cccaccttcc tgaactgtct
cttgctcctg tttgatactg 2160tcatgctgca cgaagcttac acttgctatc tctcacttcc
ctcttagtca tctgtgatgc 2220tggctaaggg agctaggcca gtcagcagtg acctgttgcc
cttggtttat tataagcaaa 2280ctgttcacaa gaaatgaact tctgttgttt tataaatgat
atgcatcaca gaacacagaa 2340taatatcaaa accacattag ttttttcata cttgcttcat
tgaccccagg ggaagagggg 2400agagcaggga gaggactttc tcttttttta aatactaatt
atattgaggt ataaagaaca 2460tatagtaagt tcacagacct taagtataca gtttgatgag
ttttggcaaa tatgtatacc 2520tgtggaacca acacctcagt caagatataa atacttacat
cagccgggcg cagtggctca 2580tgcccgtaat cctagcactt tgggaggcca aggcaggtgg
atcatgaggt caggagatcg 2640agaccatcct ggctgtccac taaaaataca aaaaattagc
caggcatggt ggcacatgcc 2700tgttgtccca gctactcagg aggttgaggc aggaaaatcg
cttgaacccg ggaggcagag 2760gttgcattga gccgagatag caccactgca ctccagcctg
ggcaacagag agagactccg 2820tctcaaaaaa acaaaaaaca acaaaaaaaa ccatacatcg
acccagaaag ttccttctgt 2880cagtagcagt tcaccccccc atgcccccaa cccttggcct
ccctgccttc ccatctccac 2940tcccaaccct cactgctctg attctatcac cattgttttg
attcttctgc tgttgatctt 3000cataaaacca gtatatttcc ttttgtgtct ggtttatttt
cctcagaata atgtttttaa 3060catttatcca tattgttatg tgtatcagtc gtttcttcca
gattagtact ctattgtatg 3120gatagagcct attttgttta cccatttcct gttgacagac
atttggtttg ttcccagttt 3180tgga
31843823103DNAHomo sapiens 382tggtttgttc ccagttttgg
attataatga ataaagctgc tatgaacatt cttgaacgat 60gaacattttt gtggacatat
gttttgattt ttttgtgtaa atacctagga gtgaaattat 120tgaggtatgg tataggttta
tgcttaattt tatagagtac ttaaacttga ttcttttatt 180taaaattgtg ataaaataca
cataacataa aatgaaccgt cttaactgtt tttaactgta 240cagtgcagtg gtacgaagca
cattcacatt gttgcacaac catcaccacc atccatctgc 300agaattattt ttatcttgca
aaactggaac tctgaaccag gtgcggtggc tcacgactgt 360aatctcagca ctttgagagg
ccgaagcagg aggatcgctt cagcccagga gtttgagacc 420agctggggca atatagtgag
acaccgtctc tataaaaaca aaataaaaat agaccaggcg 480cgatggctca tgcctgtaat
cccagcactt tgggaggcca tggtgggcag attgcctgag 540ctcaggagtt caagaccagc
ctgcccaaca tggtgaaacc ccatctgtac taaaaataca 600aaaaattacc tgggcatggt
ggcgcgcacc tgtagtcccg gttactctgg aggctgcagc 660aggagaatcg tttgaacctg
ggaggcggag gttgcagtga gccaagatcg tgccactgca 720ctacagcctg ggcaacagag
tgagactcta tctcagaaaa ataaaatagc tgggcgcggt 780ggctcatgcc tgtaatccca
gcactttggg aggctgaggc gggcggatca cgaggtcagg 840agattgagac catcctggct
aacacgtgaa accccgtctc tactaaaaat acaaaaaaat 900tagctgggag tagtggcggg
cgcctgtagt cccagctact caggaggctg aggcaggtga 960atggcatgaa cccgtgaggt
ggagcttgca gtgagccgag atcatgccac tgcactccag 1020cctgggcgac agagcgagac
tccatctcaa aataaataaa taaataaatg aaatgaaata 1080aaataaaata aaataaaaat
agccaagtat agtgatacac atctgtagtc ccagctactc 1140aggaagctga ggtgggaggg
tcacttgagc ccaggagttc aaggctgcag tgagctttga 1200gcgtgccatt gtactctccc
tgggtgacaa agcaaggccc tatctaaaac aaacaaacaa 1260gcaaacaaaa aaccccaaaa
ctggaactct gtatctatta aacagtaatc tctcattgag 1320tggtgttaag agtaaaattt
tttttaacaa aagaaaaaag taaaaagtaa attttgaaaa 1380aagaattaaa aacaaaaaat
ctccattacc ccctccccca gcccctggca accaccattc 1440tactttctgt ctttctgaat
ttgactactg cacataacct tatataggtg gaatcaaaca 1500gtatttgtct ttttgtgact
gacttatttc acttaggata gtgccctcag cttttaaaag 1560gaaagacatt ttgatatatg
ctacaacata atattccatt gtatgtacat accaaatttt 1620attaacgatt tcatctgtca
atgaacattt gggttgcttc caccttttgt ctattgtgaa 1680taatgctgcc gcgaacatgt
ttaagtcctt gctttcactt ttttgtgtat acacccagaa 1740gttgaaatgc tggattatat
gtaattctat ttttaatatg agtgactgcc atactgtttt 1800ctatagtggc tgtaccgttt
tacgttccca ctaagagaac atgagtgttc cagtttcacc 1860atatcctcac caacacttat
tttctgtttt gttggtggta gccatcctac tggatgtaaa 1920ctttattcat ttttcgaacc
tttttaatat ggaattttca aacacacaca aaagatgaga 1980gatctccagg tacccaccac
aagctttaat aatgattaac atttggtagc aggtggacaa 2040agatatacct tctctatagc
agctataaga tcagggacaa acaaagatct atttggaact 2100ccaactaaga atggtgtttt
gtaggctgcc tgatgaataa ggttagataa ctaatggcca 2160gtctttcagc ctgtgctcaa
gggataggat aacaataaag catagttggt gaaggagcag 2220cagataaagg tcacaataga
taggccataa gagaaccctc actatcactt accattcaga 2280ccattcgctt catattctaa
caagttattt tcctttcata aaaggaagct gaagctttta 2340tttgtgtttg tggtgcatgt
gatccatgag aggggactca accaggtgct atgtgtgagt 2400agtacttaat ccgacagtat
tagtgggctg gtgggctttc ctggttacat gggaacccta 2460gaaacccaag ccaagcacaa
aagccaagac tgaattctcc agtaagtcac ctggtagcct 2520tgacatgctc atgcttaaaa
aagagccagt gacctattaa taggaagctc ctgaaatgag 2580tcctctgaac atctgcaagt
atggtcagct acacctgagc tgagacttgc ctgtttccct 2640gccaggaaat catgggctca
gaaatggcag gtaccatgtg tattaactat atttccttac 2700tttctgtctt cttgatgttc
tagcatcagg tgcctctttg acctaagaga cttcccctcc 2760taggactagc taattcctag
aaatatcaaa ccactcccct gtaagcatgc cattcctatg 2820caaaccaacc aatccagagc
ccatactcga aaccacttcc tttacctggc tcttccacac 2880cagagggcaa tgttcctctg
tcctaatcat tctcagggct agatatcaga taactacaaa 2940tgctccttga cttatggtgg
ggttacatcc taataaaccc atcataagtt gaaaatatca 3000taagtcaaaa gtacatttaa
atcaggtgtg gtggcacatg cctgtagtcc cagctatctt 3060gggaggctga gacaggagga
tcacttgagg ctgtggtgca cta 31033832997DNAHomo sapiens
383ggatcacttg aggctgtggt gcactatgat catgcctgtg aatagccact gcaccccagc
60ctgggcaaca gagtgagacc tcatacctta aaaaaaaatt aaaaaacaag ctctcaccag
120tcgaagatga ggcaacataa gcaataagca ctaaaaagaa taatgactgc aaaccaaaat
180aaataagaag ataaaagtcc acaagtttat aaataaaagg ttttatttaa aagcccacaa
240gtaaaaaaga ggagagaaac gctaactcct aactctgaaa attagtaatt aaagggaaag
300aagccttcaa caattttttt ttttctgaga cagagtcttg ctctgtcact caggctgaag
360tacagtggca taatcacagc tcactgcacc ctcaaactcc tgggcttaag caattctcct
420gcctcagcct cccaagtggc ttggactaca ggtgtgtgcc tccacactcg gctgatttta
480aatgttttgt agagatgggg tctcactata ttgcccaggc tggtctccaa ctcctgagct
540caagcaaaac tcccacctca gcctcccaag tagccagggc cataggtgtg caccaccatg
600cccagctaat tttccttttt tccattttgt agagatgggc tctcgccatg ttgcccaagc
660tgttctccaa ctcctgagct cagacaatcc tcccgcctcg gcctcccagg tgtgagccac
720tatgcctggc caaaaatttt ttaatgaagt ccccctgggt ccaggcactg gtttgggcac
780tgaagattca gcagtaaagt aaaataaatt tcctcattga tcttgtcagt gactcctttg
840catgcttgct tcacactata tttcaaggta accaaatagt tgtagttaga aaaagttcca
900tcttacagaa aactttcagc taataaatgt agagggaatg ataaagttag aaaaataact
960atattttaag tcctaatgaa acaacagacc cacacaacaa tgaccaacgg atgaaaaata
1020tcaggtgaaa cacttatacg gaacctgtca gtggcaagat tgggctgtaa ccacctgaaa
1080ccactgacca atctcggcat tactaaaaca gggctgacca gatagtctgt gattctgatg
1140taaagcaata aggagtacat agcaccacct tttcagtagc caaaacagtt aaacctgaat
1200ctaatcaaga ctttagaatt acctttacga ttggatgaaa tatggagagc agaagaacaa
1260attcaacagc acaaaaagga agaaaacaga taaatctaga gtgggccacg ttctacaaaa
1320ctgagctggt ttcttggtca agacaatagc atggaaaaaa atgggaagta ggcaaagaga
1380ctcctctgga ttgaatgaaa tttaagagac acaatagcca agtatgatgt gtggaccttg
1440tttggatcta gatttgaaga aatcgattgt aaaaagtaat ttttgaaaac aaatagggaa
1500atctgaatat gggctaggca ttagttttta ccaaagaatt attattaggt gtgataatgg
1560tactgtaatt acgtaagatt gtaattattt atattgtttt tagagataaa aataagacct
1620tcagtgctga agaagggcac agtggcacat ggcacagcac agcatctaca tcatcagtca
1680aataagaatt tttttttttt tttttgagac agagtctcgc tctgtcggcc aggttggagt
1740gcagtggcaa gatctcgact cactgcaacc cctgcctccc gggctcaagc aattctcctg
1800cctcagcctc ccgagtagct gggattacag gggtgtgcca ccatgcccgg ctaatttttt
1860ttgtgttttt agtagagacg gggtttcacc atgttggcca ggctggtctt gaactcctga
1920cctcaggtaa tccacccgcc tcggcctccc aaagtgctgg gattataggc gtgagccacc
1980gtgcccggcc cagtcaaata attcaaggca gctgtcaggc taaagttcgg cgagcgacac
2040gcggctgggc ggcgggagga aacgcggggc cgggccgggc gctggagatg gtccccggcg
2100ccgcgggctg gtgttgtctc gtgctctggc ttcccacggc ttccgtatcc atgattattt
2160gtactttcaa gtgctgagtc ctggggacat tcgatacatc ttcacagcca cacctgccaa
2220ggacttcggt ggtatctttc acacaaggta tgagcagatt caccttgtcc ctgctgaacc
2280tccagaggcc tgcggggaac tcagcaggtt tcttcatcca ggaccagatc gctctggtgg
2340agagtggggg ctgctccctc ctctccaaga ctcgggtggt ccaagagcat ggcgggcggg
2400ccgtgatcat ctctgacaat gcggttgaca atgacagctt ctatgtggcg atgatccagg
2460acagtaccca gcgcacagct gacatctccg ccctctttct tctcagccga gaggctacat
2520gatccgccgc tccctggaac agcctgggct gccatgggcc atcatttcca tcccagtcaa
2580tgtcaccagt atccccacct ttgagctgca gcaaccgtcc tggtccttct ggtagaagag
2640tttgtcccac attccagcca taagtgactc tgagatggta aggggaaacc caggaatttt
2700gctatttaga atttgggaat agcatttggg gacaagtgga gccaggtaga ggaaaaggat
2760ttgggcgttg ctaggctgaa agagggaaac cacaccactg accttccctt ccccagggcc
2820cccaagggtg tcccagaaga ggtaagagac aggccccagg gcttctggat agaacctgaa
2880acaaaaggtg ctgaaggtag gtggcctgag agccatctgt gacctgtcac atctcacctg
2940gctccagcct cccctaccca gggtctctgc acagtgacct tcacagcagt tgttgga
29973842067DNAHomo sapiens 384caccactgac cttcccttcc ccagggcccc caagggtgtc
ccagaagagg taagagacag 60gccccagggc ttctggatag aacctgaaac aaaaggtgct
gaaggtaggt ggcctgagag 120ccatctgtga cctgtcacat ctcacctggc tccagcctcc
cctacccagg gtctctgcac 180agtgaccttc acagcagttg ttggagtggt ttaaagagcc
ggtgtttggg gactcaataa 240accctcattg cctttttagc aattaaaaaa aaaaggcaat
aaaaggcata atataggttt 300tagaaattta tatttataat gggtttgatg tacaataaag
atacattagt tattaaacaa 360ggtataaaaa tactcaattc aaggatatgg aaaaataatg
aaaaaaataa gaaaatagga 420agaattaatt ttaaaaagca gaagtcaatg aaatagaaaa
taataatact gatatatagg 480ctgggtgtgg tggctcatgc ctgtaatccc agcaatttag
gaggccaagg caggaggatt 540gcttgagcct aggagttgga gaacagcctg ggcaatatag
gaagacccca tctctacaaa 600aaatttaaaa tcagccagac atggaggtgt gcgcctgtag
acccagctgc aggggaggat 660cacttgagcc caggatcctg aagctgcagt gtgccatgtt
tgcaccactg cactccagcc 720tgggtgacag agggagaccc tgtcaggaag gaaagaagag
aggaaggaag gaaataataa 780taataatata taaatgcagg aataaattct tttaaaaaga
caaaaataat ctgtggtgag 840cctaattaag aaaaagagaa agcccatgag agagggagca
taacctgaga tacagagaaa 900acaaaaatgc taaaaataac tcaataaatt tgaaaacctt
aatgaaaaac tccctaggaa 960aatttgttaa aattgaaatt aattcaatat gtgtaagata
gaagaaatgg aaaagttgtc 1020agagaactac ctaaagtgaa gctgggtgcg gtggctgaca
cctgtaatcc cagcactttg 1080ggaggttggg ggcgagagga tcatttgagc tcaggagttc
aagaccagcc tgggcaacag 1140ggcaaaaacc ccatctccac caaaaaaaaa cattaaaatt
agccgggtgt ggtagagtgt 1200gcctgtagtt ccagctacta aggaggctgt ggtgggagga
tcacttgaac ctggaggtca 1260aggctgcagt gagttgtgat tatcccaccg cacagcctgg
gtgacagagt gagaccctgt 1320ctcaaaaaaa ccaaaccaaa ataaaccgaa aaaaaaaaaa
aacctaaagt gacaccatcc 1380tcattctttc ttaaaaaatg aattattggc cgggtgcggt
ggctcacgcc tgtaatccca 1440gcactttggg aggccaagtc gggtggatca cgaggtcagg
agatcgagac catcctggct 1500aacacggtga aaccccatct ctactaaaaa tacaaaaaat
tagctgggcg tggtggtgga 1560cacctgtagt cccagatact cgggaggctg agacaggaga
atggcgtgaa cccgggaggc 1620ggagcttgca gtgagccaag atcatgccac tgcactccag
cctgggcgac agagcaagac 1680tccgtctcaa aaaaaaaaaa aaattatttt actgatgtat
aataggtaca catagatttg 1740gagtacatgg gattaataaa gttcaaattg gtgtacttgg
gacatccatc accttaaata 1800tttgtctttt ctttacactg gaaacatcca agctattctc
ttctagctac tttgaaatgt 1860acaagattac tgtaaactat caaacactag gtcatatttc
ttctataaaa ccatatattt 1920gtatcagttg atcaacttct cttcctcgtc tcctcctgat
acctttcctg gcctctggta 1980accataaatc tactctctat cttcatgaga tccaattttt
tagtttccac atatgagtaa 2040gagcatgtga tatttgtctt tctgtgc
20673852054DNAHomo sapiens 385ctcttcctcg tctcctcctg
atacctttcc tggcctctgg taaccataaa tctactctct 60atcttcatga gatccaattt
tttagtttcc acatatgagt aagagcatgt gatatttgtc 120tttctgtgct tgacttattt
catttagcat gatgaccttt aattccatgt tgctacaaat 180gacaggattt catttttatg
gctgaataat attctatttt gtatatgtac cacatacaca 240ttttcttttt ccttttcttt
tttttttttt ttttttttga gatggagtct cgctctgttg 300cccaggctgg agtgcagtgg
ttccatctcg gctcactgca agctctgcct cctgggttca 360tgccattctc ctgccttagc
ctcccgagta gctgggacta caggcgcccg ccacaacgcc 420cggctagttt tttttgtttt
gtttttgttt tctgtatttt tagtagagat ggggtttcac 480cgtgttagcc aggatggtct
cgatctcctg accttgtgat ctgcccacct tggcctccca 540aagtgctggg attacaggcg
tgagccaccg tgcctggccc acatccacat tttctttacc 600tattcatccg ttgatgagca
ctttgattcc atatttgagc tattgtgagt agtgctgcaa 660caaacatgag agtgcagata
cctctttcgt atactgattt tctttctttt ggatatacac 720tcagtagtgg aattgctgga
tcatatggta gttctagatt tatgaagaaa cgccatactg 780ttctccatag tgactgtact
aatttacatt cccaccaaca gtgtacaagg gttccccttt 840ctccacatcc tcaccagcat
ccgttattgc ctgttgtttt gataaaagcc attttaactg 900gggtaagctg acatctcatt
gtagttttga tttgcattta tctaatgatt agtgatgttg 960agcacttctt catgtacctg
ttggccattt gtgtgtcttc ttttgagaac tgtctattca 1020gatcttttgt ccatttttaa
atcggatttt ttttctattt gtttgagctc cttgtatatt 1080ctggtcacta actccttgtt
agatgggtag tttgcaaata ttttctccta ttctgtgggt 1140tgtctcttta gtctgctgat
tgtttccttt actgtgccgc ttcttagctt gatgtaagct 1200cacttgtcta ctttcgcttt
ggttgcctgt gccgttgagg tcttacacaa aaaatttgcc 1260cagatcactg tcctgaagaa
gaaactgtct ccagtttctt ctaacagttt cacattagag 1320ttaagtcttt tttttttttc
tttaagacag aatctcgccc tgttgcccag gctggagtcc 1380aatggtgcga tctcggctca
ctgcaaccac agcctgtggg ttcacgccat tctcctgcct 1440cagcctcccg agtagctggg
actacaggtg tacgccatca tgcctggata attttttgta 1500ttttcagtag agatgggttt
tcaccatgct ggccaggctg gtctcgaact cctgacatcg 1560tgatctgccc gcctccgcgt
cccaaagtgc tgggattaca ggtgtgagcc accgcgccta 1620gcccagactt aggtctttaa
tcaattttga tgtgattttt ttttttgtat ggtgagagat 1680agtttagttt atttcttctg
catatagtta tccagttttc ccagtaacac ttactgaaga 1740gactgtcttt ttcccattgt
atattcttgg tacctttgtc aaagatgagt tggctgggtg 1800gatttacatg agttctctat
tctgttccat tggtctatgt ctctattttt atgccagtac 1860catgctaatt tggttactac
agctttgcag taaattttga agtcaggtag tgaaatgcct 1920tcagctttat tctttttgct
caggattgtt ttgtctatta ggggtctttt ctagttccac 1980ataaatttaa ggattttttt
tctatttctg tgaagaatgt cgttggtatt ttcacaggtt 2040ttgcattgaa ttgg
20543863000DNAHomo sapiens
386cgagcagctc tctcttcagg agtgaaggag gccacgggca agtcgccctg acgcagacgc
60tccaccaggg ccgcgcgctc gccgtccgcc acataccgct cgtagtattc gtgctcagcc
120tcgtagtggc gcctgacgtc gcgttcgcgg gtagctacga tgaggcggcg acagaccagg
180cacagggccc catcgccctc cggaggctcc accaccaaat aacgctgggt ccactcgggc
240cggaaaacta gagcctcgtc gacttccatc ttgcttcttt tgggcgtcat ccacattctg
300cgggaggcca caagagcagg gccaacgtta gaaaggccgc aaggggagag gaggagcctg
360agaagcgcca agcacctcct ccgctctgcg ccagatcacc tcagcagagg cacacaagcc
420cggttccggc atctctgctc ctattggctg gatatttcgt attccccgag ctcctaaaaa
480cgaaccaata ggaagagcgg acagcgatct ctaacgcgca agcgcatatc cttctaggta
540gcgggcagta gccgcttcag ggagggacga agagacccag caacccacag agttgagaaa
600tttgactggc attcaagctg tccaatcaat agctgccgct gaagggtggg gctggatggc
660gtaagctaca gctgaaggaa gaacgtgagc acgaggcact gaggtgattg gctgaaggca
720cttccgttga gcatctagac gtttccttgg ctcttctggc gccaaaatgt cgttcgtggc
780aggggttatt cggcggctgg acgagacagt ggtgaaccgc atcgcggcgg gggaagttat
840ccagcggcca gctaatgcta tcaaagagat gattgagaac tggtacggag ggagtcgagc
900cgggctcact taagggctac gacttaacgg gccgcgtcac tcaatggcgc ggacacgcct
960ctttgcccgg gcagaggcat gtacagcgca tgcccacaac ggcggaggcc gccgggttcc
1020ctgacgtgcc agtcaggcct tctccttttc cgcagaccgt gtgtttcttt accgctctcc
1080cccgagacct tttaagggtt gtttggagtg taagtggagg aatatacgta gtgttgtctt
1140aatggtaccg ttaactaagt aaggaagcca cttaatttaa aattatgtat gcagaacatg
1200cgaagttaaa agatgtataa aagcttaaga tggggagaaa aacctttttt cagagggtac
1260tgtgttactg ttttcttgct tttcattcat tccagaaatc atctgttcac atccaaaggc
1320acaattcatt ttgagtttct ttcaaaacaa atcgtttgta gttttaggac aggctgatgc
1380actttgggct tgacttctga ttaccctatt gttaaattag tgacccctct tagtgttttc
1440ctgtccttta tttcggagga cgcacttcga agataccaga ttttatgggt catccttgga
1500ttttgaagct tataactgtg acaaaaaatg tgaagggaag agatttgaaa catgtggaag
1560gaaaagtgag tgcagactat aaacttccaa aaagacaagc ccaaaataca cctaaacgtt
1620atgtcagatt attttgttaa aatcagttgt tagtgacgtc cgtacgttaa tagaaaaaag
1680aatgcttcag tttggagtgg taggtttcta gagggattta ttgtgaaagt ataaactatt
1740cagggcaatg ggactgagag aacagtgggt agaaaggacc actgaaggaa aggaagagaa
1800ttggaaggta gatgaaagaa ggagcaagaa cctggggatg ttttttcctt ttcacttgta
1860atagtagtaa cagaagcaat ggcagactgg cttttgtttc tactgtgtta gaatgaattg
1920acaggacaac tgggcctatt attgtactgt gccagaatac tgtaaaacaa aactaaacat
1980actagcttgg tggcttgtaa ttaattactt aagtggagat ttttattttt tttttatttt
2040ttttttagac ggagtctcac tttgtcaccc aggctggagt gcagtggcgc gatctcagct
2100gactgcaacc tcctcctcac aggttcaagg gagattctcc tgcctcagcc tcccgagtag
2160ctaggactat aggcatgtgc caccacacct ggctaatttt gtatttttag tagagatggg
2220atttctccat gttggtcagg ctggtgtcaa aactctcgat ctcaggtgaa ccgcctgcct
2280cagccttcca aagtgctggg attacaggcg tgagccaccg cgccctgcag ttttttgtat
2340ttttaataga gacagggttt caccatgtta gccaggatgg tctcgatttc ctgacctcag
2400gtgatctgcc cgctttggcc tcccaaagtg ctgggattac aagcatgagc caccgcgccc
2460ggctcaagtg gagattttta tatggagtcc agttatactc tttttaatat ataagttgag
2520atgactaata caacttcaat acaggggctc atgagaaatg tctgtaatat ttaagtaact
2580tattgtcttc tttctttttt ttttaagatg aagtcttact ctgttgccca ggcggaagtg
2640cagtggcgtg atcttggctc agggcaacct ctgcctcctg gtttcaagcg atcttcctgc
2700ctcagcctcc cgagtagctg ggagtacagg cgtgcatgac cacacccggc taattttttt
2760atttttagta gagacggggt ttctccatgt tggccgggct ggtcttgaac tcctgacctc
2820aggtgatccg cccacctcag cctccccaag tgttgggatt acaggtgtga gcccccgtgc
2880ccagcctatt atcttatttc tgaataaaga attgtctgtg tggggaatag ataactcttt
2940ctcatgcagc ccctgctaga aaatttgttt tctctagcag ttggtctgtg cttataggct
30003871977DNAHomo sapiens 387ttctctagca gttggtctgt gcttataggc tactctttga
aagcacaaaa aatttattga 60cttctttttt ttgggttttt tttttttttt gagacagagt
tttgcccttg ttgcccaggt 120tggagtgcaa tggcgcgatc tcagctcacc gcaacctcca
cctcctgggt tcaagtgatt 180ctcctgcctt agcctcctga gtagctggga ttacaggcat
gcgtcaccat gcctggctaa 240ttttgtattt ttagtacaaa tggggtttct ccatgttggt
caggctggtc tcaaactcct 300gacctcaggt gatccacccg ccttggcctc ccaaagtgct
gggattatgg gtgtgagcca 360ttgcgcctgg ccagaaaatt cattgacttc ctaaagattt
attaactttc tgcattactt 420ttttttttcc cctccatcgt aaatataaaa gggaatagta
gagaaaatca ttcagaattt 480tattttttag tgacattatt tagtgacatt ttattagagt
cacttaggaa cctgaggctg 540aataaagttc aggtaaaagt aaaattagtt gagaagagac
atctgccaaa agaaatctat 600ttttaacttc acttgctgtc tttcctagag gaacagaaat
agtgctgaat gtcctattag 660aaatgatggt tgctctgccc gtctcttccc tctctctcac
acaatatgta aactcataca 720gtgtatgagc ctgtaagaca aaggaaaaac acgttaatga
ggcactattg tttgtatttg 780gagtttgtta tcattgcttg gctcatatta aaatatgtac
attagagtag ttgcagactg 840ataaattatt ttctgtttga tttgccagtt tagatgcaaa
atccacaagt attcaagtga 900ttgttaaaga gggaggcctg aagttgattc agatccaaga
caatggcacc gggatcaggg 960taagtaaaac ctcaaagtag caggatgttt gtgcgcttca
tggaagagtc aggacctttc 1020tctgttctgg aaactaggct tttgcagatg ggattttttc
actgaaaaat tcaacaccaa 1080caataaatat ttattgagta cctattattt gctgggcact
gttcagggga tgtgtcagtg 1140aataaaatag attaaaatct attctcttct gatgcttaca
ttatagtggt gggagacaaa 1200atgggtataa taaatattat attagatagc attaagtgct
gtggagaaaa ctaaagcagg 1260gaggaagata ggagtgtgca agccagaaag gttgcaatta
aattgagtag ttcaggaagg 1320cttcaatatg gatgtgatat ttgagagacc ggtggaagtc
aaggagcaag ttgtgaggct 1380atttaaaggt attcttggct tacagaacaa tatacgcaaa
gactattaaa tggaagcata 1440cctgacatgt taaaggacta tcaaggaggc cagtttgtct
agaggctgaa aaggaaagag 1500taataggaga tgaggtctga gtgaaaacac gtaaatcctt
gtgggccaag gtaaaatctt 1560tagctttttt tctgaatatg gtgggatact gttagagggt
tttaagcaga ggttacgtgg 1620tgtggtgagt tttttttttt taatcctttg tctttctgtg
tggaaaatag caggacaggg 1680cagaagcagt ctgtcctgca gactgcttgg tcgcagtaga
gatgtaagaa gcagtgagat 1740tctgggttaa ttatggaggc aaagttctca gaatttgctg
atatagggta tgagagaaag 1800aggaatcagg aatgatttca aggttttggt ctgctaaatg
gaaggagttg ccatttacta 1860agatgggaaa gactatgaaa gaagcagatt ttcagagaga
tcagaagttc attttggggc 1920atgttcaatt taagatgcct gttagttgga tgtttatgtg
agtttggaat gcagggt 19773883091DNAHomo sapiens 388gttcattttg
gggcatgttc aatttaagat gcctgttagt tggatgttta tgtgagtttg 60gaatgcaggg
tagagattta gggatgaata tttggtagtt gtctgcattt taatggtatt 120aaaagccacg
agaaggatgg gcatggtggc tcacacctgt aatcccagca ctttgggagg 180ccaaggcggg
cagatcacct gaggtcggga gttcgagacc agcctgacca acatggagaa 240accccatctc
tactaaaaat atataattag ccgggcgtgg tggcacatgc ctgtaatccc 300agctactcgg
gaggctgagg caggagaatc gcttgaacct gggaggtgga ggttgcgatg 360agccgagatc
gcaccgttgc actccagctt gggcaacaag agcaaaactc catcaaaaaa 420aaaaaaaaaa
aaaaaaaaaa gccttgagac tcacctgaaa agatgctcaa cattattggt 480cattaggaaa
atgaatgaaa accacaatga gataccactt cacacctatt aggatggcta 540ttatcaaaaa
caaaaacaag tgtttgcaag gatgtagaga ttggaattct tgtgtattgc 600tagagggaat
gtaaaatagt gcagggtgct gtggaaaatg ctgtggtgat tcctcaaaaa 660attaaacata
attatataat ccagtaattc cacttctgag ttattcccaa aagaagggat 720gcaagcagat
atttgtacac tcatattcat ggcagcatta tttacagtag ccaaaaggtg 780aaagcaacct
aagtgtccgt cagtggatga atggataaac aaaatggaat aatttcagcc 840ttaaatagaa
ataaaatgtt gacacatgtt gcaacatata cgaaccttga agacatcatg 900ttaagttaaa
taagttggtc actaaaggac aaatattgta tgattcccct tatgaggttc 960ctagagtagt
cacattcata gagacagtag agtggtggtt gcccagggcc ggggggagcg 1020aggagaatgg
aaattattgt ttattgggta cagagtttct gtttggggaa gatgaaaaaa 1080ttctggagat
ggatcatgat gatagttaac acagcagtgt gaatatagtt aatggcacag 1140aactgtacat
ttaaaaatgg ttaagatgga aaattttctg ttacatatat tttactgcaa 1200tttttttaaa
ttttattatt atactttaag ttttagggta catgtgcaca acatgcaggt 1260ttgttacata
tgtatacatg tgccatgttg gtgtgctgca cccattaagt catcatttag 1320cattaggtat
atctcctaat gctatccctc ccccctcccc caccccacaa cagtccccag 1380tgtgtgatgt
tcccctttct gtgtccatgt gttctcattg ttcaattccc acctatgagt 1440gagcacatgc
agtgtttggt tttttgtcct tgtgatagtt tgctgagaat gatggtttcc 1500agcttcatcc
atgtccctgc aaaggacatg aactcatcat tttttgtggc tgcatagtat 1560tccatggtgt
atatgtgcca ccttttctta atccagtcta tcattgttgg acatttgggt 1620tggttccaag
tctttgctgt tgcgaatagt gctgcagtaa acatacgtgt gcatgtgtct 1680ttatagcagc
atgatttata atcctttggg tatataccca gtaatgggat ggctgggtca 1740aatggtattt
ctagttctag atccctgagg aattgccaca ctgacttcca caatggttga 1800actagtttac
agtcccacca acagtgtaaa agtgttccta tttctccaca tcctctccag 1860cacctgttgt
ttcctgactt tttaagatcg ccattctaac tggtgtgaga tggtatctca 1920ttgtggtttt
gatttgcatt tctctgatgg ccagtgatga tgagcatttc ttcatgtgtt 1980ttttggctgc
ataaatgtct tctttcgaga agtgtctgtt catatccttc actcactttt 2040tgatggggtt
gtttgttttt ttcttgtaaa tttgagttca ttgaaaaatt agaatttttt 2100tttttttccc
ttttttagag gcaaggtctc actctgtcgc ccacactgga gtgcagtagt 2160gtaagcatag
ctcactgtaa ccttgaactc ctgggctcaa gcaattctgt catctcagcc 2220agctgaagta
gtaactgtag gttcacacca ccatgcctat ttttgttttt gtagaaatag 2280ggccttgctt
tgttgccaag gctggtcttg aactcctgac ctcaagcagt cctcctgtct 2340cagcctccca
aagtgctggg attataggtg tgagccactg cacccagcct tggagatttt 2400taataaagaa
gcttgtcaat taaacaaaca acaaaaagcc ctgagactga atgagataat 2460caagagagta
tgtgtagata gagaagaggt ccaaggaagg agtcttgggt gactctgatg 2520tcaagtgagg
acatgaggca gaaacagcag tgactgagaa ggagccacct agtaagaaag 2580gaggaacacc
aggacagtgt ggtattctgg attccaaaca aggaagttac tgctaatttt 2640aaagctcttc
tcaggctggg catggtggct cacacctgta gtcccagcac ttcgggaggc 2700tgaggtaggt
aaatcacttg agctcatgtg tttgagacca gcttgggcaa catggtgaaa 2760cctcatctct
actaaaaata taagaaatta aggccaggtg tggtagttca tgcctgtaat 2820cccagtgctt
tgggaggtca aggcagccag atcatttgag atcaggagtt cgagaccagc 2880atggccagca
tagtgaagcc ccatctctac taaaaataca agaaaaaatt aaccaagcat 2940ggtggcgcat
acctgtaatc ccagccactc tggaggctga gacatgaaaa ttgcttgaac 3000ccgggaggcg
gaggttgcag tgagctgaga tctcgccact gcacttcagc ctgggtgaca 3060gagcaagact
ctgtctcaaa ggaggttgca g
30913892168DNAHomo sapiens 389tgtctcaaag gaggttgcag tgagctgaga tctcgccact
gcacttcagc ctgggtgaca 60gagcaagact ctgtctcaaa aaaaaaaaaa acaaaaacca
agaaaagaaa aaaaaactct 120tctaagagga tttttttttc ctggattaaa tcaagaaaat
gggaattcaa agagatttgg 180aaaaatgagt aacatgatta tttactcatc tttttggtat
ctaacagaaa gaagatctgg 240atattgtatg tgaaaggttc actactagta aactgcagtc
ctttgaggat ttagccagta 300tttctaccta tggctttcga ggtgaggtaa gctaaagatt
caagaaatgt gtaaaatatc 360ctcctgtgat gacattgtct gtcatttgtt agtatgtatt
tctcaacata gataaataag 420gtttggtacc ttttacttgt taaatgtatg caaatctgag
caaacttaat gaactttaac 480tttcaaagac tgagaattgt tcataaataa actattttac
ctgcagagac ctctgatata 540tgtttcttga tggaagtacc cagtaccacc tatgaagttt
tcttgtcaaa aaatcaaatg 600tgaatctgat cattacttag atctaagtac caatatatga
aaaatatagg agacaaggaa 660gcatggtaaa tgatactgag attgggagac tacatggaaa
aagacttgtt cccttcaaca 720gatagacagc agggaaaaaa gaatagagaa aggagtaaag
aacctgtaga ttaaaagaca 780tttaagggac atatgaacca ggtccagtgt atagatctta
cctaaatcct gatggagcaa 840actataaaaa aatttttttg agacaaatgt ttgaatacag
gttgactatt tgatggcatt 900aaggagaaat tatgaattat cttggtataa gaatattgtc
atgggttttt ttttttgagt 960ccttacctgt taagatacat actaaaatat ttgtgggtaa
aattatatga cgtataggag 1020tatatgattt agaaaacgga ttaaaatata aaaggataaa
ataggatctt atattttgtg 1080actcacttcc tgttggatat ctttctaccc agtaaatata
gtcctatcta ggttttaatg 1140gctacatgta tgtactgtag tttgtttaaa tggtttccta
ttgaacattt atgctctttg 1200ccattttttc ctgtttaacg ttctgttttt ttttttgttt
tttttttttt ttgagacagt 1260cttgctctgt tatccagact ggagtgcagt gacatgatct
cagctcactg caacctctgc 1320cttctgggtt caagctattc tcctgcctca gcctcctgaa
tagctgtgat tacaggcgtg 1380caccactatg cccagctaat ttttgtattt tgggtagaga
cagggtttgg ccatgttggc 1440caggctggtc ttgaactcct gaccttgaat gatctgcccg
ccttggcctt gcaaagtgct 1500ggggttacag gcatgagcca ccacgtctgg ccttgtttaa
ggtcctgatg agtattctta 1560taggtacact gtgtttcgtt taattatttc cttaggataa
atttatagaa ataacattcc 1620ttggtaaaag aatacatatt ttaaaaactg tattagtttc
ctgttgctgt caaaaaattt 1680ccagaaactt agtggcatta aacaatacaa attaattatt
ctacagttct ggagatcaga 1740agatacgggt cttactaggc ctcactaggc taaaatcaag
gttttggcag ggctgtgttc 1800ctctatggag gttccaaggg accagagaaa ctactttaca
gtagttattt taagggaatg 1860aaagtgaaga tggggttggg cagtcaaaga ggctgttact
tttcattttt ggcctttcag 1920tagtttgaat ttttttatca tatacatgta ttactttaat
ttttaaaaag taaaaagcag 1980ctgtgattca gtctctgtaa tttagatcaa tttacatcaa
actagggtgg tctcatgtgt 2040tgtcttgctc acagtgacca ctagattatt ccaagaaggg
acaatttcca agacttggtt 2100tacactgaga cggctcctga ttttaaggat accttagatc
aaactctagg aaggcagttt 2160cattttgg
21683902008DNAHomo sapiens 390agttccctgg gtcattttcc
aagcccatgg cctcctggag tcttcgccta gctgtaggtt 60atctttgtgg ctattatttc
actgtaatta tacaggaaga tttattgagg gatttctgtg 120taccagccgt ggttctcagc
actttgtata ctttgtatta actctgactc ctgacagtaa 180ctctacagag gttctgctgt
tacccagttt tacatagaaa catggccagc ggacgcagtt 240agaaaatggc aaagtgggga
ttagaaacta ggcagtttga ctccagagtc tgtgcccctg 300tccacttggc tccactgctg
gggaagaggc ctctgaagca gcaggaccat ctgctgtgcc 360gtgtgtagtg gtactctatc
ttcctggtgt gatgttgtgt tctactttgc attttcatgt 420ctttccttat acaggtctca
aaatcattta cttttttttt tttttttttg agacggagtc 480tcactctgtt gcccaggcta
gagtgtagtg gcatagtctc actcactgca acctccgcct 540ccgaggttca agtaattctc
ctgcctcagc ctcccaagta gctcggatta caggcacatg 600ccaccacagc tagcaaattt
ttgtattttt agtagagatt ggtgtttcac catgttggcc 660aggctgttct tgaactcctg
acctcaggtg atccacccac ctaggcctcc caaagtgctg 720ggattacagg cgtgagccac
cccacccagc cttatatttt ttaatgatgc acattagctc 780aattacataa accagggaaa
tccagctagg acctggtgat ttctgagcct gacccatgtg 840actttcaatg aactgaactt
gccacagctg tatttactgt ctactgagat gctgtcacac 900agaccccgtc atagcacagt
tcctgagtta catctttaca tactgtagta tccttcttgt 960gaaaaaagat acagattcca
aaggtctgag aaaccaatct tggttataaa ggggaaaaat 1020ggtcatgggt ttttaaaatt
tgttttgtct taattgcatt tcaaatttac atttctaaat 1080gaataattgc ttatataaag
cagttttgat taacaatata aaacactatc tatttggagt 1140gattccttta cccatttctg
aaggcaagtt ttaaaaatta ctagaagaca cttcattgag 1200aatattatta aacatgccta
tagttctacc acctcaacac aattgcttat taacacatta 1260atgttttggt gtgttttgga
ctttttaata tgtatttttc acttgttcta gtaattatgc 1320tacagattga tcatttcttt
ttcaacatgt catcaaagca agtgagcaaa gtgctcatcg 1380ttgccacata ttaatacaaa
atggaagcag cagttcagat aacctttccc tttggtgagg 1440tgacagtggg tgacccagca
gtgagttttt ctttcagtct attttctttt cttccttagg 1500ctttggccag cataagccat
gtggctcatg ttactattac aacgaaaaca gctgatggaa 1560agtgtgcata caggtatagt
gctgacttct tttactcata tatattcatt ctgaaatgta 1620ttttttgcct aggtctcaga
gtaatcctgt ctcaacacca gtgttatctt ttttggcaga 1680gatcttgagt acgttttctt
ttctccttat tgataaattg ataatcctca aggatgatta 1740ttaggtgata ctcttacttc
atggattctt aaaagatatg atttaacata ttacaagtgc 1800ctagcaaggt gtctgttaca
cgtaggtatt ttaagtaaat ggtagctgct gatgtaattt 1860ctgccccttt gcccttcagt
tggggtattg ctttggaccg attagagggc tgtggctggg 1920atgctaaagg ttcatgtttc
cttagctggc tcctgagcca ccagctccca ccacctgtgt 1980atacctgtgc tagtttgcct
tcccacaa 20083913197DNAHomo sapiens
391cctgtgctag tttgccttcc cacaagtagc tgctggctat ctgttatgct ggtacagttt
60tcagaaactg atgaatggcc tttgaacaga acaaaaatga gattcagaat aacaaaattg
120cacctttgtt tttataagca ctggccattc actagttgaa gactggtagg aatacctaat
180tcatgccaaa agaaagataa tttttaaaaa tcacacaggt tgtttgtaga ttaaaaggga
240aaataggcta ggtatagtgg ctttgcctgt gagtttggga ggctgaagtg ggaggattgc
300ttgaagtcag gagtttgaga ccagcctggg aaacagagca agaccccgtc tctacagaaa
360atttttaaaa aattagctgg gcatggtgat gcatatctgt agtcttagct actccggagg
420tgggaagatt gcttgagccc agcagtttga ggctgcagtg agctgtgatt acaccactgt
480actccaacct taaaataaat aaataaataa gggaaaatat cttcaacaaa ggatagttct
540gtctgtttct cagtcttcct caacagataa atgtgtgaag taatggaagg tggagatttc
600agattacaca acattaatgc taagggcgtt tgactctgtg tgaattctaa ttgccctaga
660tctagacggg ctgatactat tagaatcccc tgtcactaac tgaagacaga gttgtaagtt
720aatgccttcc tagatagcct agattgtggt atgctgctgc atgctaaaat ggctcccctt
780ccatagcagg atgaaataga gtcattatct tggcaaccag cccctgccaa tgtgctctca
840gtctgccttt ccagcccctt ctctctacct attcccagct gccatgtatt ctaaagcctc
900tatgctttca tttttgtttt tgccttcctg gatggtcttt cctgctgtct ccacctgaaa
960ctattcctct ctaaagaaca gatgaattgc catctctctg ggatgctttt acccaccctc
1020actcccacct caggctgaat ggacccttct ctagatcgct tagcatattg ttctacagtt
1080aggtaaaaag tctacctatc actagatcaa gagctttgtt tttttttatt aatttaattt
1140tctttttttt ttttcttttt tttttgagac agagtctcgc tctgtcgccc aggctggagt
1200gcagtgcaca atcttggctc actgcaagct ccgcctccca ggttcacacc attctcctgc
1260ctcagcctcc cgagtagccg ggactacagg cgcccaccac cacgcccagc taattttttg
1320tatttttagt agagacgggg tttcaccatg ttagttagcc aggatggtct cgatctcctg
1380acctcgtgat ccacccacct cggcctccca aagcactggg attacaggca tgagccaccg
1440cgccgagccc caagaccttt ctttattacc agggcttcca cagacctgac acatggtagt
1500tcctcaataa ataattgcag aattactgaa aaattttact gttaacttag gcagtggtaa
1560aaccattgtt tggtagctca gaactcagca agtaaatagc aacatttgct ggaagaacag
1620atagtttttc aaatccaatt caaggactgg gtatggtggc tcatgcctgt aatcccagca
1680ctttgggagg ccgaggcagg cgtatccagg agttcgagac tagcctgacc aacatggtga
1740aactccgtct ctactaaaaa tacaaaatta gccaggtgtg gtggtgggca cctgtaatct
1800cagctacttg ggaggctgag gcaggagaat cgcttgaacc tggtaggcgg aggttgtagt
1860gagctgagat tgtgccattg ctctccagcc tgggaaacaa gagcaaaact ccgtctcaaa
1920aaaaaaaaaa atccaattca aatgattatg gaagtagtgg agaaataaac aggaaaatga
1980taaataatta agataatata taatatggct atattttaat ctattgttga tatgattttc
2040tcttttcccc ttgggattag tatctatctc tctactggat attaatttgt tatattttct
2100cattagagca agttactcag atggaaaact gaaagcccct cctaaaccat gtgctggcaa
2160tcaagggacc cagatcacgg taagaatggt acatgggaga gtaaattgtt gaagctttgt
2220ttgtataaat attggaataa aaaataaaat tgcttctaag ttttcagggt aataataaaa
2280tgaatttgca ctagttaatg gaggtcccaa gatatcctct aagcaagata aatgactatt
2340ggcttttgtg gcatggcagc ctgccacgtc cttgtctttt ttaagggcta ggagattctt
2400tattgggatg gcaaaagtca atggcagggt agttgtcatt gaaagaagat taagcttgac
2460cccagaaggc atgggttaga gcccagcctt gtcactcaat ggttgtatgt ccagaggcaa
2520gtcacttaac atcccttaac cccagttttc tcatctgtca aatgaagcaa agaatacttg
2580ccctcttgac ttaaagggtg tctgatgaga catatgactg tatcattagc tgggagaaag
2640tccatcgtgc tgcctatgta tagtgcctca agttggtctc tttcccttct atgattacac
2700aaagcactcc gctgtcatgt tatccatccc gcccctccat tccaagtccc atctagagca
2760catcttcttg aagtccactg taacctgcct aatcctggat gtgacgagcc aggcaggagg
2820cagaaaagaa tgtgtgtttt gcaatacatg ttaagagaca tcttgggctg ggcacggtgg
2880ctcacacctg taatctcagc actttgggag gctgaggagg gcggatcatc tgaggttggg
2940agttcgagac cagcctgacc aacatggaga aaccccatct ctactaaaaa tacaaaatta
3000gccaggcgtg atggcgcatg cctgtaatcc cagctactca ggaaggctga ggcaggagaa
3060ttgcttgaac ccgggaggca gaggttgtgg tgagttgaga tcatgccact gcactccagc
3120ctgggcaaca agagtgaaac agggtctcaa aaacaaaaac aaacaaacaa aaaaaatctt
3180ttaccacggt gaccacc
31973922964DNAHomo sapiens 392gaccaccatg tgatttccaa gaacttcaaa tgatctaaga
aattttgtga ttattactag 60tttgaaaaat actttttttt tttttgagac aaagtctcac
tctgttgccc aggctgaagt 120gcagtggtgt gatctcagct cactgcaatc actacctctt
gagttcaagc agttgtcctg 180cctcagcctc ttgagtacct gggattacag gcatgcgtca
ccatgcccgg ctaatttttg 240tatttttagt agagacaggg tttcaccatg ttggccaggc
tggtctcgaa ctcctgacct 300caggtgaccc acccaccttg gcctcccaaa gttctgggat
tacagacgtg agccactgca 360cccagcctga aaaatatctt tgaatgccat gtgatactat
acttgtcagt ttacatgtgt 420gtcccactaa atcatgtact ctcctgagca ggatcatgct
ttgtcttcat attttctgta 480caaagcaaag actctgacac aaagctagcc cccagtgcat
agttgagaaa tcagtgaatg 540aatgtgggag gcaggaaaaa tgtcctttaa ttcttctgtt
aatgctgtct tatccctggc 600cccagtcagt gcttagaact gtgctgttgg taaatataat
tggattcact atcttaagac 660ctcgcttttg ccaggacatc ttgggtttta ttttcaagta
cttctatgaa tttacaagaa 720aaatcaatct tctgttcagg tggaggacct tttttacaac
atagccacga ggagaaaagc 780tttaaaaaat ccaagtgaag aatatgggaa aattttggaa
gttgttggca ggtacagtcc 840aaaatctggg agtgggtctc tgagatttgt catcaaagta
atgtgttcta gtgctcatac 900attgaacagt tgctgagcta gatggtgaaa agtaaaacta
gcttacagat agtttctggt 960caaggtttag ccaccaattt tgcagtttct ctcatctccc
caggaaagag cagttggtct 1020ttagatcaat gagagctctt ttatggcaga caaaacaaag
tgactctagc caacttgagc 1080taaaaagaaa tttagtggaa ggctaggagt taccacatga
agtgtgtgca gctgcccctt 1140ggagagaata agaaccaggg tgcctctggg acttaacatc
attactgtac tccagttgtt 1200ttcattcttt tcctgacttt gctctagagt cagtttccta
acagagtaca ttcgatgatc 1260atgtgcccat atctgtgggg agaagatttc ttgattggca
gtcttactaa gggtgcatat 1320caagtagaat ggaatagagg tagtttccta aaggaagatg
agaggctgtt accaggagga 1380ggagaaggga ttcaggacag atgaaaacaa cgttatatcc
atgatagact tacgctgctg 1440gtacagatgg tacaggtggc ttcagtatag gctctccgaa
cccacatatc attgattatg 1500atagggatat gttaactatt tttcagtgta tatatgtata
tgtgtgtgtg tatatatatg 1560tatatgtata tatatatgta tgtgtatata tgtatatgta
tatatttata tatgtatatg 1620tatatattta tatatgtata tgtatatatt tatatatgta
tatgtatata tatttatata 1680tgtatatgtg tgtatatata tatttatata tatgtatatg
tgtgtatata tatatatttt 1740tttttgaaac ggaatttcgc tcttgttgcc caggctggag
tgcaatggtg cgatctcagc 1800tcactgcaac ctctgcctcc tgggttcaag cgattctcct
gtctcagcct cccgagtagc 1860tgggattaca ggcacttgcc accatgcccg gcaatttttt
ttttgttttt ttttagtaga 1920gagggggttt aatcattttg gccaggctgg tcttgaactc
ctgacctcag gtgatctgcc 1980tgccttggcc tcctaaagtg ctgggattac aggcgtgagc
caccatgcct ggccattttt 2040cagtatttct tttttttttt tttttttttt tttttttgag
acagagtttc actcttgttg 2100cacaggctgg agtacaatgg tgtgatctcg gctcaccgca
acctctactt cccaggttca 2160agcaattcgc ctgcctcagc cttctcaagt agctgggatt
acaggcatat gccaccatgc 2220ccggctaatt ttgtgttttt agtagagatg gggtttctcc
atgttggtca ggctagtctc 2280aaactcccga cctcagatga tcctcccgcc ttggcctccc
agagtgctgg gattactggc 2340atgagccagc gctcctggcc catttttcag tatttctaaa
aaaaatctaa agtgggtcaa 2400acatttcacc ttaatagaat gacaggtttg tacatcaagt
ttctttgctt tttcttggaa 2460ttttatactt tttttttttt tttggagaca gagtcttgct
gtgttaccca ggctggagtg 2520cagtggtgcg atctcagctc accacaacct ccacctccag
gttgaagcaa ttctcctacc 2580tcagcctcct gagtagctgg gattacaggc acatgccacc
acacccggct aatttttttt 2640ttttttttgt atttttagta gagacagggt ttcaccatgt
tgtccaggct ggtctcgaac 2700tcctgacctc aggtgatccg cccatctcgg cccaccaaag
tgctgggatt acaggcgtga 2760gccactgcac ccggcctttt tcttggaatt ttatcaatca
gtgtcagaat attcattacc 2820tcctaaaaat aaaggagttc tagttggctg ttttgattct
aggtgtggta aagtgaaata 2880ttgttactta ataaatgcat tttgctagac acaatccttc
ggttcacgag ctctgtagag 2940aaaagagaaa taaccgccaa ccaa
29643932864DNAHomo sapiens 393taaccgccaa ccaagaaaag
attgggagat actagaataa gacccagggg caggaagaag 60ccagtgagaa ggagggcatg
ttgagagctc tgagagagaa taaaagcagg ggttgttgga 120gctagcttct caagatgtcc
ttgaggcaaa ccagaccttt gggacactct gaaaataaaa 180ctgaaagtga agagattgtg
ggccgaatgt ggtggctcac gcctgtaatc ccagcacttt 240gggaggtcga ggcgggtgga
tcacctgaga tcaggagttc gataccagcc tggccaacat 300ggcgaaacgc catctctact
aaaaatacaa aaaaaattag ctgggcctgg tggcaggcgc 360ctataatccc agctactcgg
gaggctgagg cgggagaatc gcttgagtcc aggaggcgga 420ggttgcagtg agctgagatc
gtgccattgc actccagcct gggcaacaag agcaaaactc 480tgtctcaaaa ataaataaaa
ataaataaaa aagagatagt ggcgtgatat ccttgattct 540atcagcaacc tataaaagta
gagaggagtc tgtgttttga ttcagtcacc tttagcattt 600ttatttccat gaagtttctg
ctggtttatt tttctgtggg taaaatatta ataggctgta 660tggagatatt tttctttata
tgtacctttg tttagattac tcaactccac taatttattt 720aactaaaagg gggctctgac
atctagtgtg tgtttttggc aactcttttc ttactctttt 780gtttttcttt tccaggtatt
cagtacacaa tgcaggcatt agtttctcag ttaaaaaagt 840aagttcttgg tttatggggg
atggttttgt tttatgaaaa gaaaaaaggg gatttttaat 900agtttgctgg tggagataag
gttatgatgt ttcagtctca gccatgagac aataaatcct 960tgtgtcttct gctgtttgtt
tatcagcaag gagagacagt agctgatgtt aggacactac 1020ccaatgcctc aaccgtggac
aatattcgct ccatctttgg aaatgctgtt agtcggtatg 1080tcgataacct atataaaaaa
atcttttaca tttattatct tggtttatca ttccatcaca 1140ttattttgga acctttcaag
atattatgtg tgttaagagt ttgctttagt caaatacaca 1200ggcttgtttt atgcttcaga
tttgttaatg gagttcttat ttcacgtaat caacactttc 1260taggtgtatg taatctccta
gattctgtgg cgtgaatcat gtgttctttc aaggtcttag 1320tcttgaaaat atttatagtg
tagtagaact attttatcct ccaatgctcc ttcttttcct 1380tgtatttcca ttatcatcac
tttaggattt cacttattta tcattcaaca tttattaatt 1440gcctctcata ttccaggctt
tgtgctagaa gttagggata taaagacaaa taagatattt 1500cctgccctta aagactagat
tcgtgttgct aagtcttcat tatcaagaaa agcataagtg 1560gggaaaagtg cttgcattat
ggattcctca tagttgctcc cctctgcatg taaaaatcac 1620catttccatc atagattcct
agcggtctca ggactttata aagcccaaag tgcctatgtc 1680ataatatgag gaaaaatact
gagacccttc catatatggg aggtatatgg atgagacagc 1740tcctgacttc acttttccca
gaaatctgaa aagcagcagc agtcattcca gagcccagtt 1800tctactttga agggcagatt
atttattctt tgagctaacc tgactgagga acaattagtt 1860tgcttttaat ttactatttt
ctttttcttt tcttttcttt tttgagacag agtctcactc 1920tgttgcctag gctggagtgc
agtggctcaa acttggctca ctgcaagctc cgcctcccgg 1980gttcacgcca ttctcctgcc
tcagcctccc gagtagctgg gactacaggc gcctgtcacc 2040acacccagct aattttttgt
attttttagt agagacgggg tttcatcgtg ttagccagga 2100tgatctcgat ctccagacct
cgtgatccac ccacctcggc ctcccaaagt gctgggatta 2160caggcgtgag ccaccgtgcc
cagccactat tttctttcta attgttaatg aattaatttt 2220ttaaaactgt gctcctagag
cgaagggaga gctctgttta cagtgtaact tttcagagct 2280tctttaacta gattttaaga
tcagaattag ttgttgtgaa atcttaggga ctgtacaaga 2340ttagaaatcc tctatagcag
catttcccaa agcaggcttc cagaacacta gcctcatgag 2400gcattttggg aaaaaagagt
ttgctggttc agtgtgtatg ggcagtgcca caagccgtac 2460cctccgttga agacactcat
tccacacatt actgcataaa aagcttccac cagccattcg 2520gcaaacttat tgagtgtctg
ctatttcctg ggtattgtgc tatatggtag ggttatagta 2580gtgaacaaag aagaaatgat
gcctgctctc agctgacttt gcagttggaa agacacatga 2640aataattacg ccattcatta
gcagattgtg ctagatgcct cactggaaaa ataaaggaca 2700tgatggaaaa ctctgtaggg
tcagagaaag ggatcattag agaaggttct ttgaagaaat 2760attttttgaa atatgaagga
taaataggaa ttaactaggt accaataggt taggagtaga 2820gctttccaga cagagggact
agttcttggg aaggtctcca gaca 28643943013DNAHomo sapiens
394tgtgctagat gcctcactgg aaaaataaag gacatgatgg aaaactctgt agggtcagag
60aaagggatca ttagagaagg ttctttgaag aaatattttt tgaaatatga aggataaata
120ggaattaact aggtaccaat aggttaggag tagagctttc cagacagagg gactagttct
180tgggaaggtc tccagacaga aataagtgtg gcttgtctga ggacctctta ttcgcctatt
240aaccttccct ccccagtaaa cactcctggg aacaacacac attgtagaac cacgttgtgg
300tgctgttcag tatagcaagt aattcagcag agataagttc ttggaatctc atctttggga
360tttagttact aagatacatt caagtttgag caaaataagg tctcagagct tggattcatt
420gttctgttcc agcaattaga gcagtacctg gcacatagca caagtgcttg aaaacactga
480ctgagtaggg taggtgggtg agtgggtggg tgggtgggtg ggtggatgga tggatgggag
540gatgggtggg tgaatgggtg aacagacaaa tggatggatg aatggacagg cacaggagga
600cctcaaatgg accaagtctt cggggccctc atttcacaaa gttagtttat gggaaggaac
660cttgtgtttt taaattctga ttcttttgta atgtttgagt tttgagtatt ttcaaaagct
720tcagaatctc ttttctaata gagaactgat agaaattgga tgtgaggata aaaccctagc
780cttcaaaatg aatggttaca tatccaatgc aaactactca gtgaagaagt gcatcttctt
840actcttcatc aaccgtaagt taaaaagaac cacatgggaa atccactcac aggaaacacc
900cacagggaat tttatgggac catggaaaaa tttctgatcc ataggtttga ttaaacatgg
960agaaacctca tggcaaagtt tggttttatt gggaagcatg tataattttt gtcctaagtc
1020tgtgctcagc cctcccacat gtgctcattg ctggttgact gttggagtct ggttcttacc
1080tctaagagga agcccaggag agggcataaa gccagcacac tgtcctcacc tgatggtgtc
1140agagtcctta cgagtaagcc ctagccagaa cattgctgga agagatcaag ggccactgtt
1200tgaaattgca cagcaggata cggaaaaggg gtaccttagg tataggcatt gtcattaaag
1260aaattgctaa gatacttgag attttcctgt ttaaggaatg agctttatga tacaaagagc
1320agttctaaaa attagggagg gaattaacta aattaattag gatatttctc aaattccttt
1380acagtttttg tctctctgct gatatagtgt ttacatgatt gttatttact aaacaaatgc
1440tattttgtat tgtgctcctt ataacttaat tgtttattac aaggttttga tggtgaccta
1500ccaacaacaa gtaatcccaa acacagtctg aattttttgt tttccatcca gaaataagat
1560gaatctttcc atttccgtgt tttcagtttt catcattttt atcctatagg ttacttatct
1620ttattttaaa gcatttcata ataattttat agtttttgtt ttgtttgctt gtttgctgtt
1680ggaaatggaa tattccctcc ttccatttag actgctaacc agctgtaaat gtttcaaaat
1740atgcatgttt tacagcagtt gttcaaagca atacaggaac agtaaggaca gagccagtca
1800ttttacaacc acattctgtt aaactgatgt ctattagcag ggtttttcct attttattag
1860gaaggactta cacctgatat ataacaaagc ttgttttaat caaggctcag aaaatgtttt
1920tcattagttt ttttcctaac catgaagaat aactgctttg taacacacat gctggctata
1980aagcagacaa aaaattcact gtaggtgctg cctgactggc ctctgtccgt gtttctgttg
2040gggctgctta ccacagcctc tgcattatca ttagctagtg tgttcacaat accaagttcc
2100cagtagcaaa gaaaggtcaa gctcttacgc atgccattca tttatctaca ctgtgcaggc
2160gcactcaggt ggcagggaca aagaccactc ctttggcgca tctcaagttc agaattctca
2220gtagaggggc tccagctgtc cttttgtcag gtgcccatgc ctgctccagg cctgtgtggt
2280caggacacgt gttacagagt acagtgacat taatgatggg gccatggata tggtcagcac
2340tcagaggatg ttagtctctt cattgataaa gtcacaacca cttttcctgt tggaaataaa
2400aagatttgac gtatccttgt ctacagcaac acaggacaac agataatcag caggtcatct
2460aaatctgttc agagagaaag gagagctgtt tcctgaaaat acatcttccc ctgattttag
2520tcttattttt ttctgccttt attgctttct accctcttca aaccagcctc atttcctaaa
2580ttaccttgaa tatgcattga cacttgtact gcctgaaatt ctggaaaact cagtatggct
2640actccaccgt cagaacttcc tgagcaaagt tagttgctct ctcggctcac tgttttgttt
2700tgttttgttt tcctgcctca ggtttatttg tacaaatagc acaggaggac cagccccatg
2760cagatggtag cccaggggcg ggggtagggg gtcacaccag tccttctgtc ctcatgttgg
2820cagagatatc tactctgaag cctttgtagg ggcctgggca cctttgggag cctgagctgg
2880aactgaaggt ggagctgcag cctgggcctt ggtttgatcc ttggccttgg cctttggccg
2940gcacagcctg agccccttgg caatacgggc acgagcacgc ttcccaagct tgggatgggc
3000aatgtaggca agt
30133952915DNAHomo sapiens 395atgggcaatg taggcaagtc gatcgagctt gcggctgaca
ccctttggga tcttgggctt 60aacctccttg ggctttacga gggccttgat agcctcggca
cgtgcactca tggccttggc 120attgttggcc tgcatcttct ttaggccctt cttgttgtgc
ttcttggcaa agtgcatgtt 180cctcaggaac ttggggtcca cccccttaag agattcgtat
ctttgtgatc ggggtttctt 240gataccattt ctgtgccatt ttcgggactg gttgtgtgtg
gtgtggttct tggacttcgc 300catgtctaca ccttaagccg cggctcccga agcacctaga
accggaagag ttggctcact 360atttagcaca cacacacgtc tataatagtg ctggccactt
ggggttggaa ttagtttatt 420tatcagcatg ttgtctccca gcacttggtg tgtgtgatat
gcagtatgta tttgcagaat 480gaaaagtctg agggctgaca tcatatttcc cactgtgccc
agaaagagca cagttagtcc 540acatgagcta atgggggcaa agggaagtga ggagggagaa
tgtactgcct tatcatgttt 600tctattactt ggctgaagta aaacagtccc aagccgatag
taagatagtg ggctggaaag 660tggcgacagg taaaggtgca cctttcttcc tggggatgtg
atgtgcatat cactacagaa 720atgtctttcc tgaggtgatt tcatgacttt gtgtgaatgt
acacctgtga cctcacccct 780caggacagtt ttgaactggt tgctttcttt ttattgttta
gatcgtctgg tagaatcaac 840ttccttgaga aaagccatag aaacagtgta tgcagcctat
ttgcccaaaa acacacaccc 900attcctgtac ctcaggtaat gtagcaccaa actcctcaac
caagactcac aaggaacaga 960tgttctatca ggctctcctc tttgaaagag atgagcatgc
taatagtaca atcagagtga 1020atcccataca ccactggcaa aaggatgttc tgtcccttct
tacaggtaca aggcacagtt 1080ttccttcatt tattcactaa tttagcagaa cctcactaag
agcctcctat atgccaggct 1140ctgcgttagc aataaaagga atgccatgcc tcaccccatc
aggaggtgct gatagcttgt 1200aggcggagtg gaaacagatg tgctctagag gctctaaata
ttacttctgc tggggtcagt 1260tgggaagcca caacagctac tgttcatctt ccataaaaga
caatcagccg ggcacagtgg 1320ctcacacctg taaatcccag cactttggga ggctgaggtg
ggtggatcac aaggtcaggt 1380gtttgagacc agcctggcca acgtggcgaa accctgtctc
tactaaaaat acaaaaatta 1440gccaggcatg gtggcgggcg cctgtagtcc cagctactcg
ggaggctgag gcaggagaat 1500cgcttgaacc taggaggtgg aggttgcagt gagctgagac
tgtaccactg cactccagcc 1560tgggcgacag agcgagactc catctcaaaa aaaaaaaaaa
aaagactggg ttctgttctg 1620tggaggttct tgtcttaaca tatccactgt tgattgccca
gatgttgatg taattaattt 1680agcagtcgta aatagtttag cacttgcatt aaatagacca
aaccccatag taggtatttg 1740aaatacagaa taaatgtgag gtacccctgc tctaaaggag
tttatagtcc agagctgact 1800tatggaggat ttctttctat tatttctggg tctgctacta
atttgtctat ttcatatcct 1860aattatcctt gttttcattt tgattgaaag ggggagagca
tagaaattgt ggtaaaaggt 1920agttttattt tttatttgag atggagtctt gctctgtcac
ccaggctgga gtgcagtggc 1980acaatctcat ctcattgcaa cctccacctc ccgcgttcaa
gcaattctcc tgcctcagcc 2040tcccgagtag ctgggattac aggtgtgcac caccacgccc
agctaatttt tgtattttta 2100gtagagatgg attttaccat gttggccagt ctggtcttga
actcccgacc tcaggtgatc 2160ctctcacttt ggcctcccaa agtgttagga ttacaggcct
cagccactgc acccagccta 2220aagttagttt tagattaagt gttttcatgt tttcccttgc
aaagtaataa actggtcaag 2280ttatcacctt gttccatctc catattaatc agggtccaaa
caggagatag aaaccatgca 2340acaatttgag tagttgaata aagaattata aacaggagat
tagagtaata ggggattaga 2400tagtaagagg tgaagagata ggaacagcag atataaagaa
caaccatttc ctcctatggc 2460tgagatacca tcccctcacc acactccccc acctactcac
tgagatgcag accttattga 2520agagaatgta actggcttgc tgcgaggtaa agtcaatgag
gcgctcccca gtaccactct 2580gaggggatgc tggggaaaac tgcccatgag aagagggcac
atgctgctgg ccacttgtgc 2640taaagaactt gaagtctgat aggagtgcac cctaacctgg
catagaaacc ctttcttcct 2700gctgagtccc tctagcacct tatactggca aagctttaca
ttgcaaacct ccattatcac 2760agagcaagca atgaaagatg gactcagagc tgaggcgata
aattgatagc tagcatagcc 2820tctaaactga cttttatgac tacattttat ggatagaaag
tgttcttata tatattgttt 2880ctttacataa taggggactt attcatggct gcaga
29153963033DNAHomo sapiens 396cagagctgag gcgataaatt
gatagctagc atagcctcta aactgacttt tatgactaca 60ttttatggat agaaagtgtt
cttatatata ttgtttcttt acataatagg ggacttattc 120atggctgcag atgagaaaac
agatcctaag aagttaagtg acttgcccaa ggtcacacaa 180agaattccac tagttctaaa
atgacagtaa ttacagttaa catacattgt atgtggcaga 240tacatataaa gcacatggca
ttaatttttt tttttgagat ggagtcttgc tctgtcgcca 300agctggagtg cagtggcacg
atctcggctt actgcaacct ctgactccct ggttgaaggg 360attctcctcc ctcagcctcc
cgagtacctg ggattacagg catgcgccac cacgcccagc 420taatttttgt atttttagta
gagacgtggt ttcatcatgt tggccaggat ggtctcgatc 480tcctgacctt gtgatccacc
cgcctcggcc tccccaaatg ctgggattac aggcgtgagc 540caccacgccc ggccacttgg
catgaattta attcccgcca taaacctgtg agataggtaa 600ttctgttata tccactttac
aaatgaagag actgaggcaa agaaagatga tgtaacttac 660gcaaagctac acagctctta
agtagcagtg ccaatatttg aacacactca gactcgatcc 720tgaggttttg accactgtgt
catctggcct caaatcttct ggccaccaca tacaccatat 780gtgggctttt tctccccctc
ccactatcta aggtaattgt tctctcttat tttcctgaca 840gtttagaaat cagtccccag
aatgtggatg ttaatgtgca ccccacaaag catgaagttc 900acttcctgca cgaggagagc
atcctggagc gggtgcagca gcacatcgag agcaagctcc 960tgggctccaa ttcctccagg
atgtacttca cccaggtcag ggcgcttctc atccagctac 1020ttctctgggg cctttgaaat
gtgcccggcc agacgtgaga gcccagattt ttgcctgtta 1080tttaggaact ttctttgcaa
gtattacctg gatagtttta acattttctt ctttgaacct 1140agttataaag gtattgtgct
gttgttccta ggcttagagt cataaggcct gagctcactt 1200cctcactttg cctccatctg
gaaccttaga ccaacttcct aggaaaacga gctgtctgaa 1260aacagaatag ggtgcctctt
caatgtgctc ttcactggag atgttcagga ggaggctact 1320cccacctaca cagggtgcag
tggagggtct gggccccagg gaggcagcag gaagagtgga 1380aagagcggag gctctactgt
tggacagacc tgggttacca gccgtgtgac tagccttccc 1440tggcctccat atccccctca
gtaatgaagg aatgtgtcat ccccaaatcc agggacagtt 1500acaagcagtc agtgaacaga
aagtgtctgg tacaggttct aagtgcttat tattctaagt 1560cacttcactt acctgagttc
tcagttttcc tatctataag ataagcaggt tggataaaat 1620gttctccaat atactcctgg
tcctgagatg atgtgattgt gggcagccct ttaatcatgg 1680tgaagatgtt catcataagc
acactgaaac tacaaaatag gaatataaat attttctcca 1740ttaaattatg ctggatccta
gaagcaaaaa ctggaactgt gaaaccctac ttcacagaaa 1800acttaaaatt cccaagcaga
tgaatgcttc tcggaaggac actgacagtt acctacctgg 1860aaagaatcta gatggaggtg
gcatgggcac taagcggtga gattaaaccc agttagggca 1920gccccaccag ccttggaacc
cacacatctg gagattgttg atgcagagag aaaggttcct 1980actggtgaga cctgaaaggg
atatgtggca ggtgggagga agaagttctg tctggaaacc 2040aacccttgtt cctccgttat
tgattgactc ctggtaccaa catgagccct aggtcttata 2100gaggccataa gtccctatgc
cttatagtgc ccatggatga gatgaggcca cacatgcccc 2160cagtgggtta acatgtctag
cgtgggtaag gctcttggag cactatgata cacaggaaat 2220gcccagtaac tcttagttgg
tttgatatct gttcccattg ctcacttaag ctcagtgccc 2280ctttactgat ccttttattc
tgcctccctc tgcacatgtg cattgagact cctatctgag 2340acacacactg tgttgggtgc
ccagggatgc agcatagatg ttgctgcctt ccacagaagc 2400gctcatggtc tgctagagaa
tatatcccat gggagagaaa aacagactcg ggagaatata 2460gcaggggccc ttgtcctgga
ctttggcagt taggaaaggg agggaagaga catggaggct 2520gggacccaaa ggctaaatag
gaatttgctg ggccaaaggg gagggggaat gaaaagagtg 2580tttctggcag aggaaatggc
aaggataaag gcctggaggc gcaagagaat atgtgtttga 2640ggatctgaaa gttgagtgca
gtgggtccag tgttctctac cctggctgcc attagaatta 2700cctgggaaac ttttagaaaa
ttccagtgtc tgggccctcc ctaaaacaat aaatcattct 2760tgggtggtgg ggtctgggca
tcaggattgt ttaaaaccct ccccaggtac tgtcatgtgc 2820agctggggtt aagctgtgct
ggggtctgag tatggatctg ttagggcaag tggcggtgat 2880ggagttgagg ctgcagaatt
caggccaaat agagaggttt tcatcaggat attaaagagt 2940ttagatttca atttggtggg
aatggatggg atcttatttg cattttatga agagctccct 3000ggttgcaata tcagaatgga
ttggagagga gca 30333973024DNAHomo sapiens
397atactttccc agcccaaacc ctggaatagg ccttttctcc gaggagctct agttcatttt
60agtgggaaat ggtatttaga gactataatc tgggatctgg gagtcctcat tgctactgag
120tagtcattac ttttaggctt ttccagtggt cagagctagg aaatatgtat atttaaaaat
180ggacagttga atggttgttg ccaggagctg ggaggaaggg gaagtgagaa attgtttaat
240gggcacagag tttcagtttg gggaagatga aaaagttcta gagatagctg gtggtgatgg
300ttgcgcaaca atgtaaatgc cactgagctc tcatttaaaa atggttaaaa tggtaaattt
360tatatatatt ttaccacaat aaaaaaaagt cttcttctgg gagcaccccc ccaagacaaa
420aatatgaaaa ttttacactg atacttccat ttcaagataa ttttaagatt ataaggattt
480tgcttaattc ttgaatttta tacctgtaaa ccttttatac ttcaaatttc gggcagaatt
540gcttctataa caatgataat tatacctcat actagcttct ttcttagtac tgctccattt
600ggggacctgt atatctatac ttcttattct gagtctctcc actatatata tatatatata
660tatatatttt tttttttttt tttttttaat acagactttg ctaccaggac ttgctggccc
720ctctggggag atggttaaat ccacaacaag tctgacctcg tcttctactt ctggaagtag
780tgataaggtc tatgcccacc agatggttcg tacagattcc cgggaacaga agcttgatgc
840atttctgcag cctctgagca aacccctgtc cagtcagccc caggccattg tcacagagga
900taagacagat atttctagtg gcagggctag gcagcaagat gaggagatgc ttgaactccc
960agcccctgct gaagtggctg ccaaaaatca gagcttggag ggggatacaa caaaggggac
1020ttcagaaatg tcagagaaga gaggacctac ttccagcaac cccaggtatg gccttttggg
1080aaaagtacag cctacctcct ttattctgta ataaaactgc cttctaactt tggcttttca
1140tgaatcactt gcatcttctc tctgcctgac ttgccctctg gaatggtgct ggaatggtcc
1200tgtggccttg tccactgtct gcctttgacc ataacttgaa agtcacccac catagtgtcc
1260tttgaaataa cttaaatgtc cacagttcca agcatgagtt aaaaacactt cagaatgtag
1320agtagttgtt caattgaata aacacacaca ccagaaaaaa aagcaagttt atcttttatt
1380tttagtaaag aattttgata gagcctcaac accagaaatg gctagagaga gaagcctaac
1440atatctggag gattattttt catcctactt aaagctgctt tcactttttt caggaaaaaa
1500cacacgttct gaatctaatt tataaaactc cctggccggg tgctgtggct cacacctata
1560atcccagcac tttgggaggc tgaggcaggt ggatcacctg aaatcaagag ttcaagacca
1620gcctgaccaa catggtgaaa ccccatctct actaaaaata caaaattagc cagacgtggt
1680ggcgcatgcc tgtaatcccc gctactcggg aggctgagac aggagaatga cttgaacccg
1740ggaggcggag gttgcagtga gccgagatcg cgccattgca ctccagcctg ggcaacaaga
1800gcgaaactcc gtctcaaaac aaacgaacaa acaaaaaccc caaaaatccc tgaagtacgt
1860gagctagtgg tgaaagaaag ctggagaaaa ggagcaggaa taataataat aataataata
1920ataataaaga ttgtcattta attttgagta cttccagtgt acactttgca ggtactctaa
1980gacattacct cactgaaatc tctaaggtag atattcttta tttaaagtgt acttgtatga
2040aacctggagc tcaaggtgaa ggaatttgcc caaggctgca cttgcactat cgtggcacta
2100attagccgtg tgaactggga cacgttactt cagtttgctc atttctgagt cagcctagca
2160agatgacttc taagaatttt ttccagccgg gtacattggc ctgtaatccc agcacttcga
2220gaggccaagg tggaagggtc acttgagtct aggagttaca cacaacacac acacacacac
2280acacacacac actagccagg catggtggca aatgcctgta gtctcagcta ctccggaggc
2340tcaggtggaa ggatcacttg agcccaggag gttggggctg cagtgagcca tgatcacgcc
2400actgcactcc agcctggctg acagagtgag atcctctgtc tcaaaaaaag aaaaaaaaaa
2460agattttttt ccagggaata ataaaggaag ctaatattta tggagcatct acggtgtgcc
2520aaatactttg catacgttat ctcatttaat gctcttatcc ctgcagggaa agtattaaca
2580tttgtttatc acttgcagaa ctaagtgata tttaccacag agtagacaaa tattttcaag
2640cccaaaatca agtggtatca cttttctgct gagaatgttt cagtggtttc ctttgctctt
2700gggataaaac ttaaatccct caccctaccc ttgctccaac cctccacttt ccttctccca
2760tgtggtgatt tggccataca gctcttgtgg ctgatctgaa ctgactgagc tttttaccct
2820tttgctcttg ctgttcttac agcctgggaa ccccctggtt acctcttggc ttggtgtggt
2880ggcttacatc tgtaatccca gcactctggg aggccaaggc ggacggatca cctgaggttg
2940ggagtatgag accagcaagt cacctcttgc cagtggcctt tgtccattga gtctgaagtt
3000ctttctcctc tcatttcccc atca
30243982157DNAHomo sapiens 398agtggccttt gtccattgag tctgaagttc tttctcctct
catttcccca tcattctatt 60atgctacctt gttttatttt cttcattgtg tttattgata
cttaaaatga tctcttttct 120gttgctgttt gactctccca ctagaaagta agcattgtag
atcgggcact gtggctcaca 180cctgtaatcc cagcactttg tggggcagag gcgggtggat
cacctgaggt caggagttcg 240agaccagcct ggccaacacg gtgaaacccc atctctacta
aaaatacaaa aaatagctgg 300gtatggtggc tcgtacctgt aatcccagct actcaggagg
ctgagacatg agactcactt 360gaacctggga ggcagaggct gcagtgagct gagatcacac
cacagcactc cagcctggaa 420gacatagtga gactctctct caaaaaaaaa aaaaaaaaaa
aaggaagtaa gcattgtgag 480ggcaggtacc ttctctgttt tgttcattgc tggatgtagt
tagtatacag cagtatctga 540tggatggata gatggaggaa tgaatgaatg agacttcaca
aattcagctc acttgctcaa 600ggccctgcag ctctacggga tgaagctata ctccagagtc
ctgctacatt ggctgtgtgg 660ccagctgctg ggatctgagg gttgtcagat aagcagtcta
ccagagaaca gactgatctt 720gttggccttc tgccagcaca ggggttcatt cacagctctg
tagaaccagc acagagaagt 780tgcttgctcc tccaaaatgc aacccacaaa atttggctaa
gtttaaaaac aagaataata 840atgatctgca cttccttttc ttcattgcag aaagagacat
cgggaagatt ctgatgtgga 900aatggtggaa gatgattccc gaaaggaaat gactgcagct
tgtacccccc ggagaaggat 960cattaacctc actagtgttt tgagtctcca ggaagaaatt
aatgagcagg gacatgaggg 1020tacgtaaacg ctgtggcctg cctgggatgc atagggcctc
aactgccaag gttttggaaa 1080tggagaaagc agtcatgttg tcagagtggc cactacagtt
ttgctgggca agctcctctt 1140cctttactaa cccacaatag catcagctta aagacaattt
ttgattggga gaaaagggag 1200aaaaataatc tctgtttatt ttaattagca ttaattggta
ttcttgttaa accataggag 1260tcagagtaaa tcagccattt caccaatttt cagtttgttt
ctgtcttagc taacagcagt 1320gtaatggtca gcaaaattct tatcttgtgt actgaatggc
atgtcctgtt gctgaaagtg 1380cacaggcttg ggaggtagcc atgagctcaa atcctggcac
taccacctct cttgtgtgac 1440cttagactcc tgacctttct atgcctcagt tctttcttac
ctataaaatg aaattaattt 1500tacccttaaa gatcatcgtg ctgattagag ataaaatata
aataataaca cttgttacag 1560agcaaggagt tgacactttt atattctgaa gacaaagtgg
taaatcatta tcatctatgt 1620cagaaatagc ttttgagaat acctgagtat agaactatct
tgatccctgt tacttcaaaa 1680ctaaaataat ggttttagga attaaaaggt gaggctagtc
acctccaagg gatgaactga 1740ctcagggatt gaggtatata acagtgaact ggtccaaaca
acagtcctga ccccacttta 1800tgagtgagac tatgagtaat ggtctaagtg tagacatcat
tgtccagggc tccagtaggc 1860agctctgtac ttgagaattt agcagtgacc cttctatttt
tcatctatta tacctttttt 1920tttttttttt tttgacacag ggtctcactt tgtcacccag
ctggagtgtg gtggtgcaat 1980catggcccac tgcagcctca acctccctgg gcttaggtga
tcctcccacc tcagcttcct 2040gagtagctgt aattacaggc atgtgccatc atgcccagct
aatttttctt ttcttagagg 2100tggggttttg ccatgtttcc caggctggtc ttgaactcct
aggctctcac ctctgtc 21573993163DNAHomo sapiens 399aatgtgttgg
ggaagtggtc tcctattaga ctctccattt caaaccattc catgattttg 60tcctcctttt
gccaccttcc gagcctgtaa aaactaatgt ttgtgattcc tgaggtttct 120ctaatgtctt
ttaataaagt tgacctcaga gatctcgtta cctctctgag ttcctgcttt 180gtcttagatt
ttgatccttg agtgttcttt aatcttttag caattccttg ttgcatgtta 240aaagattagt
tatattttat tcctcatttg tgttcgtttt caccaggagg ctcaattcag 300gcttctttgc
ttacttggtg tctctagttc tggtgcctgg tgctttggtc aatgaagtgg 360ggttggtagg
attctattac ttacctgttt tttggtttta ttttttgttt tgcagttctc 420cgggagatgt
tgcataacca ctccttcgtg ggctgtgtga atcctcagtg ggccttggca 480cagcatcaaa
ccaagttata ccttctcaac accaccaagc ttaggtaaat cagctgagtg 540tgtgaacaag
cagagctact acaacaatgg tccagggagc acaggcacaa aagctaagga 600gagcagcatg
aggtagttgg gagggcacag gctttggagt cagacacatg tggtttcaaa 660tccaagttcg
accatttccc atttatttga ctgtagacaa gttacattcc taaactatgt 720ctcagatttc
tcatctgtaa gttgtggtat tactagttaa catgcagggg ttttgtttgt 780ttgtttgttt
gtttgtttgt gagggtaaga aataacccaa gaagcctagt ccttggtagt 840tgctcagtgc
cctataaatg ttgtgaacca ggtggtgagg gtttggtgct gctagagaat 900tctggtatct
gctctgtgca acagagtact gtaggtgatg caagagaaag aagacctgat 960gccttctttc
ctcccagctt tgagaatgga gcaaaggcct accccagcca ccaagtgagc 1020cagtgggctt
gatcagcaca ggaaaggtga ccccggcagt ttcatttgac tattgcatgg 1080ctggcaacat
ttctattgat tgtttccagg gaccttggcg gatgagctcc tgttgagtct 1140agcatctctg
ttaaatctgt tctcaaatag gtaatgcata tgggaggatg ctgccacctt 1200gcatctacta
gacatcacct atctactgtg agactctccc tctaagccct gctgtggcct 1260cagagtgctt
attggccctg tgagtggggc agccactata cattgcatgg agttggtaca 1320tgagatagaa
acctattcgc catcccttga aactgcccca gtccagaagc ttcctgttag 1380cacatgtacc
tccttgtatg tattcagaac tcattccatt taggcttgga aacccgtttg 1440gtgcaactct
gttcaagttc cattgtctgc tttgagaatg cttgggcttg tatagtgagc 1500tgtcactttt
taatttgtta ggaattctac tcgccttgct ttttcttttc cagcatgttt 1560aagggaatga
cctccaaggc cccaaatcac agttgtattc atgttctttc atttcacaga 1620tacaatccag
gccagtccca gatttgcagc tgttaataaa tgtgaatggt tttccagtaa 1680gggggtagaa
aaacataggg agagaaccgg gttcagagtt caatatctgg attcaagtcc 1740ttcctttagc
actttactaa ctgatgtaga ataagtcagc tactcaatag gtgcctcagt 1800ttccccacca
aaatgcagac atagaaggtg ctttgtctgc tttgatgaga agtctttaag 1860caagtctatg
gggttcaatg tgttttaaga actataaagt accatataaa tgtggccttt 1920attcccattg
tgttcttgga agtaattcaa tatagtgtgt acttcatagc tgcttttgga 1980ctattgccag
ccagtgtatc atcctaaact acatgtcagc atagtataat cctgccttag 2040gtctactttt
gattatttag gaagactccc tgcccttcct atacatttca cataattttt 2100aataagttgt
aaaaaagtga tttataggat tctttgtaag tgggggaagt taagcagaca 2160aaaagttttt
aaatcttact gcagagtgtc aggaaccttt tatagcacca gacaggtagg 2220gacagaacat
gagtggcagc aagccagact tggtcttagt gctctaacct gtctgttaga 2280ggctggccag
tcagacccct ggttgaagac gttgggaatc ccagctcttt ggaggggtaa 2340gagattttgt
tagactgtta accagattcc acagccaggc agaactattt ctgtctcatc 2400catgtttcag
ggattacttc tcccattttg tcccaactgg ttgtatctca agcatgaatt 2460cagcttttcc
ttaaagtcac ttcattttta ttttcagtga agaactgttc taccagatac 2520tcatttatga
ttttgccaat tttggtgttc tcaggttatc ggtaagttta gatccttttc 2580acttctgaaa
tttcaactga tcgtttctga aaatagtagc tctccactaa tatcttattt 2640gtagtatgtt
aaatttttct aaaacttcta aggatagttg ctgtattgta tgatttgcat 2700atggaggtat
ctataagaag ttttatactt tttagcaaaa tagtcatttg gtagccaact 2760taaacaaatg
tttattaata tagaagttaa taatatctac tgatactcgg ccgggtgcgg 2820tggctcatgc
ctgtaatccc accactttgg gaggctgagg cgggcagatc atttgaggtc 2880aggagttcaa
gaccagcctg accaatatga tgaaaccctg tctctactaa attacaaata 2940ttagcagggt
atggtggtgg gcgcctgtaa tcccagctac tcaggaggct aaggcaggag 3000aatcatttga
acccaggagg cagaggttgc aatgagctga gatcacgcca ctgcactcca 3060gcctgggcaa
cagagcaaga ttccctcaaa aaaataaata tctactgaca cttaatactt 3120ggaaagggat
aaaaataaac attgtctaaa gccgtggtcc aaa
31634003066DNAHomo sapiens 400aagctgaggt cacggatttg agacctttct tcttttctaa
tacaggtgtt aagtgctaca 60aatatccctt aagcactgct tcaacagcat cccacaaatt
ttgatagttt gttttcattt 120tcattcagtt caaaatacct tctaatttcc cttttgattt
cgtctttgac ctacaggttt 180tttagaactg tgttatttag tttccaatct cttgaggatt
tttaaaacaa tatgttattg 240atttctaatt tatttccatc tcagtcaaag aacatacttg
ccttttttta tacatttatt 300gaaacttttt ttatggccca gaatatggtc tgtgttggta
aatgttccat gtgtacttga 360aaataatttg tattctgatc tcattgagtt gaatgttcta
ggtatatcaa gttgatagtg 420atgcccaagt ctcctgtatc tttactgatt ttctgcctgt
tctgttattg agaaaggggt 480attgaaactt ccaactataa ttatgatttg tctgttctct
ttgcagttct cttagttttt 540gccttcatat atatatacat atatatgtat atatatatat
attttttttt ttttgagatg 600gagtcttgct ctgttgccca ggctggagtg cagtggtgtg
atcttggctc actgcaagct 660ccgcctccca ggttcacgcc attctcctgc ctcagcctcc
cgaatagctg ggactacagg 720cgcccaccac cacgcccagc taattttttg tatttttagt
agagacaggg tttcaccatg 780ttagcaagga tggtctcgat ctgacctcgt gatccgccca
gcttagcctc ccaaagtgct 840gggattacag gcatgagcca ctgcacccag cccatatatt
ttaaagctct gttattgggt 900acataaacat ttaggattgt tatatccttt tgataatgga
ctcttctatt atgaaaagat 960aatatactgt gggtttataa catatgtaaa agtatgagta
acatattatc agaaggggag 1020aaatggaaga taacttaggc atcttatttt taagcatagt
tttccctttg tttctgcatt 1080agatgattta cctgaaatgt cattcaattt aacttactct
ccatcctcac ccgcccagct 1140ttggttatga ggcagtagaa agaaatgatc tgcctgtggt
tttctagaaa tacgaaagtt 1200gagtccttaa ggctacacag aaagaaagta cctccccagg
gcttcaccct tcccatcctt 1260tcagcaggct ttttgtctgt cgtatcttct ctgttgaaat
ggccattgac aagaggagga 1320aaggggtttt gttgtggatt gttcaggcac ttcctttggg
gtatatgggg gatgagtgtt 1380acatttatgg tttctcacct gccattctga tagtggattc
ttgggaattc aggcttcatt 1440tggatgctcc gttaaagctt gctccttcat gttcttgctt
cttcctagga gccagcaccg 1500ctctttgacc ttgccatgct tgccttagat agtccagaga
gtggctggac agaggaagat 1560ggtcccaaag aaggacttgc tgaatacatt gttgagtttc
tgaagaagaa ggctgagatg 1620cttgcagact atttctcttt ggaaattgat gaggtgtgac
agccattctt atacttctgt 1680tgtattcttc aaataaaatt tccagccggg tgcggtggct
catggctgta atcccagcac 1740tttgggaggc tgaggtgggc agataacttg gggtcaggag
ttcaaaacca gctggccaac 1800atgatgaaac cccgtctcta ctaaaaaaat agaaaaatta
gccaggcgtg gtggcgggta 1860cctgtaatcc aagctgctca ggaggctgag gcagaagaat
cacttaaacc caagaggtag 1920aagttgcagt gagccgagat tgcaccactg cactctagcc
taggcgacag cgagactgcg 1980tctcaaaaaa aaaaaaaaag aacgttccaa ggtcaggact
aggcctcccc tcagaagcag 2040caagtgacat atgtgacatc ctctccactc cctatttgca
tttctaggtt atataactgt 2100actactatcc atgcatgcct actcttgttc ccagggtgaa
ggacccagac atggagagcc 2160gaatccctgc aggccattat aaatgagatt atgccatttg
ctcccatttc ttcttattct 2220ttcatttttg gggctctcca tcttgatgtg ttctttggat
cgtgaacaga tccaaagaaa 2280aggttgttct gccgtgctgt ttgtcaggat gaaaaactct
tttttaagtg tttaggtctg 2340cccccagtgc ccagcccaat caagtaacgt ggtcacccag
agtggcagat aggagcacaa 2400ggcctgggaa agcactggag aaatgggatt tgtttaaact
atgacagcat tatttcttgt 2460tcccttgtcc tttttcctgc aagcaggaag ggaacctgat
tggattaccc cttctgattg 2520acaactatgt gccccctttg gagggactgc ctatcttcat
tcttcgacta gccactgagg 2580tcagtgatca agcagatact aagcatttcg gtacatgcat
gtgtgctgga gggaaagggc 2640aaatgaccac cctttgatct ggaatgataa agatgataag
ggtgggatag ctgaaggcct 2700gctctcatcc ccactaatat tcattcccag caatattcag
cagtcccatt tacagtttta 2760acgcctaaag tatcacattt cgttttttag ctttaagtag
tctgtgatct ccgtttagaa 2820tgagaatgtt taaattcgta cctattttga ggtattgaat
ttctttggac caggtgaatt 2880gggacgaaga aaaggaatgt tttgaaagcc tcagtaaaga
atgcgctatg ttctattcca 2940tccggaagca gtacatatct gaggagtcga ccctctcagg
ccagcaggta cagtggtgat 3000gcacactggc accccaggac taggacagga cctcatacaa
tctttaggag atgaaacttg 3060cccatc
30664013065DNAHomo sapiens 401tgggacgaag aaaaggaatg
ttttgaaagc ctcagtaaag aatgcgctat gttctattcc 60atccggaagc agtacatatc
tgaggagtcg accctctcag gccagcaggt acagtggtga 120tgcacactgg caccccagga
ctaggacagg acctcataca atctttagga gatgaaactt 180gcccatctct aaaatttcgg
gatttctttg tacccaacaa ggttcaaaca caacagtcag 240cttttattca tgatttttac
ttccatctgc tgatgtagaa catacctcca gagtgacctc 300agaaattgtc aaatgtgaaa
acacaagcca tcacagtgag aaatgggagg ttgagttaga 360ttgtctaagg ctggagagtc
catatactcc cactgttagc tctgaagtgt gtagccagtc 420ttcagattct gggtcagttg
cctcagtctc tcttagcttt tgccttactc tttatccgac 480cactgccctg ccaggaaaac
aaggctctat aactcctctt acaggtcagc ttgacacaaa 540aagggtgcct ggattcctaa
tgtttcattg tcacttttcc cagtcagatg ataatgcttt 600tcaaatcaac atatattttg
ggggaggttg gaagggagag ttgaaatatt ctaagaatca 660aagagtagcc cactttaatc
agagtatgac ccctgattgc tcacagtcat ctcctgagca 720gtgtgagcga gtttcagatg
aggaggctga aggccagtca ggcatgctcg aggattccaa 780gtctgtaggt gggagggcag
agatttagtc ctgttggcca aagcctctag ggaatttctc 840actccagtgg agaaggcaac
acacttacca aactgtgtgg aaactatctc atttgattag 900aaattttacc tcaagaagag
gaaggacagt tgagaaagaa cattttctta cacatgagac 960agctaaggct tacaagaagg
agaggaataa tgaggcaaaa taatcctcat taatattttc 1020attcctcccc tggggattag
aactactttc agacccgatt ttaatggtaa gttaggtact 1080tcctacagtt gccatccaaa
tatcagtcag gatcagacat gatgttagct cctgctacaa 1140taaaaccatt ttctccctga
atgaaaacaa aggttccaca ggagacagtc ccacagagca 1200gtggcttctt ttcctccctt
taaaacctca tgttggctgg acacagtggc tcacacctgt 1260aatcccagca ttttaggagg
ctgaggtggg aagatggctt aagcccagga gtttgaggct 1320gtagagctat gatcacacca
ctgcccttca gcctgggtga cagagcaaga ccttgtctct 1380aaataaacaa acaaacaaaa
aatcctcttg tgttcaggcc tgtgggatcc cctgagaggc 1440tagcccacaa gatccacttc
aaaagcccta gataacacca agtctttcca gacccagtgc 1500acatcccatc agccaggaca
ccagtgtatg ttgggatgca aacagggagg cttatgacat 1560ctaatgtgtt ttccagagtg
aagtgcctgg ctccattcca aactcctgga agtggactgt 1620ggaacacatt gtctataaag
ccttgcgctc acacattctg cctcctaaac atttcacaga 1680agatggaaat atcctgcagc
ttgctaacct gcctgatcta tacaaagtct ttgagaggtg 1740ttaaatatgg ttatttatgc
actgtgggat gtgttcttct ttctctgtat tccgatacaa 1800agtgttgtat caaagtgtga
tatacaaagt gtaccaacat aagtgttggt agcacttaag 1860acttatactt gccttctgat
agtattcctt tatacacagt ggattgatta taaataaata 1920gatgtgtctt aacataattt
cttatttaat tttattatgt atatattgtg tcagttcaga 1980tgccaaaaag aggtcttgaa
catgtcacag gctctgatgg cactgaccat ggagaaagct 2040tgatttgatc atctggtgtc
tacaataacc aaagctaatt attaaggaaa aaaacttgaa 2100gaaagaaaat agtccttact
tcatctataa tgaggttttt gtttttttgt tttgagacgg 2160agtcttgctt tgttgcccag
gccggagtgc agtggcgcga tattggctca ctgcaacctc 2220cgcttaccgg gttcaagcaa
ttctcctgcc tcagcttcct gagtagctgg gattacaggc 2280acctgccacc acgcccggct
aatttttgta tttttagtag agatgtggtt tcacgatgct 2340ggccaagctg gtctcaaact
cctgacctca ggtgatcctc ccacctcagc ctcccaaagt 2400taagtgctgg gattacaggc
atgagccact gcggccggca ttaagtatga gtttttaagt 2460tagcccactt tgttaatgac
tatgagtact aatagcttaa gataaagaag tttctaggta 2520atcttgtttg aaggatgatg
taaaaatata aatttaaact gtgagtgaca aaataaactt 2580ccttaatatt tgcctacatt
tagagaaatg gagcattcag ctcagaaagg aagaatgtct 2640gtggttttaa ggtaaaatcc
atattccaag actcagtgaa gaaagttcag tgataaagaa 2700cagactactc tcatcttatg
aagaaatgga gcaatttcac ttggaaagac taggaagaca 2760aaatgttaca gacgtatttg
ttgtgccaca aaataggcaa ggtcagtttt gaacaataag 2820aactccataa agtagaccag
ggcatctcag aagtgaggtt ccatgagccc aggtggggca 2880caggctgggt gatcttgagt
ggagaggaag aggggttttc tgagcttcaa gagctgggcc 2940acacagtgtg ttggttttag
ctgggatgga gttctagaac aaacctgcac tttagaacac 3000ctttctaccc acccccaacc
acacaacttg ctactattag taaatgtata ggctgaggca 3060cggtg
30654023153DNAHomo sapiens
402ggactaaccc acctcccttc caaggatctt gctatctatc ccttcttgcc ctcagctact
60cactcccaca gtcatatacc agatctcatc attagacaat tgtaatccct acacaattta
120gttccatgta tcctctctct aaccactatt cctcatcttt ccaggtcatt ctctctagac
180ccgaattcca acaacccttc aaccacactg gtaccactaa tctacagatt acatcttctt
240tctactatac cttgatgtgt tcctgaatat ctcccgaatc ctcttcatcc agtttaattt
300caaggtccat cattataatc attttcttac atactccctc acctctcctg ccccattaat
360actgtcctag taaaatctag ctctctaccc actccatgcc tgcccctatg ctgctgtaag
420tagccagaga aacacatata ataaatgcat tcacacaaac cttctaacat atcatataat
480attgtctgat gtcttcctac tagaatgcct ctcaggcagg aatttttttt ttctaaacta
540atttattcac tgaaatatcc cagtgcctag aatagtgcat gttaaatagt agaatctcac
600tcaacatttg ttgaatgact gaataggagt tccaaaatag agaacacagc atatgggagg
660ggaaaaaaat cagtaacaaa atcattcaag aaattttccc agaactaaag gatgggagct
720cctagaattg acaggggccc agcatcacac atgaaaactt caaatcacat gactatcttc
780aaattacacc agaatgctag agagaaagag aataggatac aagcttccac aaagaggaga
840aaaatagatc acaaatcaga aaagatcaga actcaaaatg ttcatgaaaa ctcaacagcc
900atgctcgaag tcacagcaca atgaagaaat gtccttttaa aaaatcttaa ggagaaccat
960ggcaactcag gattctctac ccagccaaac tattttaatc aagtgagagg gtagaatgaa
1020gacatcttca ggcctgcaag gtcatgaaaa attaacaatc cacaaaccct cttctcagga
1080agctactgga agatgtacca aaataagaga ataaataagg agaaaggcat gagacaccgg
1140aaaaagggaa cccaacctaa atcacatgca aagaaaatct ccagatgcca atgaagggtg
1200accacatcta tgtaccgaga gggcaagtca ctagtttaga aagggacaag tcagatgcac
1260caagattcaa caaactggaa ctgaaataac accagatgca tctgaaaata ctgagtggga
1320ttaatctact cttggagatt ctgtggctaa attgatgata gaaaaccaag caaatacaaa
1380gaaaaaccat aacattaact ttagaggaaa ctaatagttc tgagggagat gatcctagaa
1440tgcaacctgg ctccactgtg tgagtagtgt ttagagggtc ctaatgacac aagcaggctg
1500gaattacact gttcctttat taggaggata taagagtgga aaataagtat gtgtgtggca
1560gggacaaagg atgaaaaaca gctaaatcct catcttccat aaaaggatgt caatatagaa
1620tgcctgaagc agaacaatca agatgcaaca taagtatgtt atacagagat acaaggacag
1680tacacaagaa tcagctaaaa gtatttaaca gaaatggtca ggggcgaggt cagaggagcc
1740agggcagggg actgctgtgt tcataacaag ctttgtaaaa aactatatga ctccttaaac
1800tatgtgtcct taaaaaaatg ttttaagaac agaaaataac aaagaggtaa aatatgaatt
1860atctatcctt catatctcac ttgagtactg atgtttgaaa gaagcatatt tttttaatga
1920acatttcaat tagccagtat tttaccatgt aactttgtta aaattatatt acactccaat
1980aagaatgcct ttacctgtga cagtagttct tccttctctc cagcaagttt tcgtagcctt
2040acatctaaaa caaatgaaaa agatcataaa ctaaatatgt gatgatatag tacataaaca
2100attaaaaatt tttcaaactc ataaacagct aatattatct gataaattac attacttaca
2160gctctgaata tctaaagaaa taaaggtgtt aatagcatta cagaaaagtt cttaactatc
2220taaaaagtat ttccacacaa ctgatattta tcagggcacc aaatccaaca tttgttcccc
2280acagcagtga tttgccactt aaagacaaac agaagtacaa aggaggtcat ttccttgttt
2340caagctttca ctagtagaca gacaactcaa atgtcaagtg tgttcctaaa ggctgagccc
2400ttagcgggag agatccaaat atgtgaaaga agatggggta agagcaggac tgggcaaagg
2460aagctagaga agagaaaaga gaggagcata atgctggaag aagcaaagtc cccaaaagct
2520agtagggagg gaaggggacc cactccaaga tgtgggcagc caggccaggt gtggtggctc
2580acgcctgtaa tcccagcact ttgggaggcc taggtgggtg gatcacttga ggtcaggagt
2640tcaagaccag cctggccaaa atggtgaaac cacgtctcta ctaaacaaat acaaaaatta
2700accaagcgtg gtggcaggcg cctgtaatcc cagctactcg ggaagccgtc tgaacctggg
2760agatagaggt tgtatgttgc agtgaactga gatcgcgcca ctacaatcct gcctgggcga
2820ccgagtgaga cttcgtctca aaaaaaaaaa aaaaaaagat gtgggcatcc atgggtagat
2880ctgcggcttg gtggcagcac cattagggct cactcctagc ctgtggaggt ttgactcttc
2940ccaagcctgc attaaaatag gccccttcag ctgaggagat accatggttc acttaaaaaa
3000gcagctgata cctcgccaac cactaccctc tgtaaacagg gtcatagcca aataaagatt
3060ttggtttctt ctgcaccttc caagcagatc tgcctgcttg gacctgcaga cgagggaaaa
3120aacagcaggg aaacactctg ggctgcctat agc
31534033080DNAHomo sapiens 403gccagactct cgttccattc tccagatctc tcttgctcac
ccagcatcct gttttattca 60aagtgcccta caatcacatt tctggaatgc acattagaga
atgtgcttac taactttcaa 120aatgtttttc agtttgcttc acacttgtat ctctcactcc
tctaagaagc ttacacatat 180atgaaaacaa gatgaaaaac aaaaaaattg ttttttttta
aaataaaagt gagctaatga 240tacagtatct atctgtgcca tttttcttcc tctagagtag
atttctttgt ggggtcaatg 300gatgggtgac tttgatttct cagacagagg tgtcagcaac
tttgtggttt cctggagaga 360ggtgtcagat tctcaaaggg ttaaatttaa gaggtttaga
ctttaagagt ctgggaagcc 420ctgctctgga agtcatactt ctctgatatc tttttggtca
tctgtttctt ggcttaagaa 480atgtggtgga aaagaggtac agaaccctgg ggtaagcagt
ggaacataaa accagatgtt 540ccaaggatga gaaacttata acacacttga gaagtctcct
gctagcctac tgctccccta 600gcacaggtat actagactat ctctttgcag aacagtttgt
agttaagtaa aaaccgatgt 660gtataggccc atagtacttc catccacagg ccttacagtt
acacttattg ccttacagtg 720acccagatgc tgatttccca aggtcaagga tgtctgaaga
caatgtgcca atgtgcccag 780attcttctag ttaaggatct acttgagtct cagcccttat
gctgtttttg ttttccaagc 840tgggatatga aaaagcagaa aacccaatag ggtaacatta
atccaagtca acatagcaac 900cagtatctta cctaatggcc cttctcctgc tgactccaag
acctgagcag cttcctgaga 960cacaacagtg atggctccag ccactggttc atgactgaca
tcaccattgg gagtgccatc 1020ggggattata actaagccat gtttctgcag gggggaaaaa
cccaccatca caaaaggccc 1080gtatggaagc tgtaagctct gtgaggtcac tctgcaacaa
tacatgtttg ctacaggtaa 1140aacctggtta gaatcagtta catgaaatat agctctgtgt
aagaaatagc ttcaacctac 1200caaatctgga ttagagaata aacactgtag tttgtattta
ggctaggaaa gatggcagga 1260tgaaaggaag gaagatagag agtaaaacag tgagggacct
gaattccagg ctaatgctaa 1320catacctctc ccgtcttcac tgtctcctgc aggtcagcca
gctcctctct gagcatatct 1380cgctcattcc taaggcaggc aatgtattct ttctgtttct
ctagggcctg gttttaggta 1440aggtagcaag ggaaacaatg gcacagaaaa agagcaggtg
aaaggtagca gagaagtacc 1500taattcaaat aagcaaagat aaaggcataa aaagcaagaa
agcagtcaaa agattggaaa 1560caaacagtca gatatgggag gaaatacaga gttacatgga
tatacatctc cagaagagac 1620ttctcataga aactggttct catgcatcaa tttggcaaaa
catgtttaat cacatcaagc 1680agggaaataa atcttttcca gtcaatgaaa aaaataaaac
aggaaaagga agataaagag 1740agaagccaga gtaaaataaa gctttcctta ctgactgcct
aagtgcattt ttatttggtg 1800aacaaaaaaa accccacatt tcatgtttaa ctaaactagt
ttattcaaga atacagttga 1860ttttttaaaa aatagttctg gaataaaaat aactattata
cataggtatt ttaatttaat 1920attggctgta gatttttctc caagtagtgt ggcaaaatac
tcaaatacca cttaattcaa 1980aatagttaac ctccaaaagg attcaaagat caacttctga
caacttaatt aaatataact 2040gagactcatt tggctttctg ttatactccc aaaatgtgaa
aaacaaaaat aaacactgac 2100aaaataaata cagccaagct atgaagagtt acagaatatg
gatttcagaa tcaggctttt 2160gggttctggc acatacttgt cctatgcctc agtttcctca
ctggaaaaac agaagggata 2220atagcaccca tcccaagggc agaggcataa atcaaggtaa
agcattgcct gtaatgccta 2280gatagcaggg acagttcagg agaatcaggt tggtgatttc
atttgtaaat tccctgccat 2340ttccttaatc tcacaactgt cagctgagga caatgcagaa
gcaggaacat actttggtca 2400tcaatgaaaa ataaaatcta ctatgaaaaa ataaaatcta
ttgtaaaaga aaataaccca 2460gaattaaaaa tacacccaag gtaagtagtc tatgcaggaa
tctgattact ggcctatttg 2520aaaaagcctt tccccaaata tttttgttca tatatttaat
gtcttctgtt agcattccca 2580ttaatccaag aagttaaact atatcaggta actttcctct
cagttcactg ggtttggaag 2640tgggacagcg aattgctgag aaattgatag ctgaatagct
gggcaattca aaaaatcatt 2700ataatcctgt tttgcaacca aatagggagc aagtaaataa
gggatgatag caactacgat 2760ttgtatagca caaattatat ggcaggcact attttatata
atttctctct tatacattat 2820tttacatttg aaacctctac atatcctgtg aggtacttgt
attatcccca tttaacagat 2880cagaaaattg aggctcacag tggttatatt ttttcgccca
aagtcacagt aagtggcaaa 2940accagaaaat gaatctggtt gtttttgttt ccaaagccct
taaatagttt tttaaatatc 3000acagctctat gaaggccaca ttatattccc ttattgttag
cccagatgat gctaggaaag 3060gagtccatac ggcaaatcct
30804043074DNAHomo sapiens 404tcgcccaaag tcacagtaag
tggcaaaacc agaaaatgaa tctggttgtt tttgtttcca 60aagcccttaa atagtttttt
aaatatcaca gctctatgaa ggccacatta tattccctta 120ttgttagccc agatgatgct
aggaaaggag tccatacggc aaatcctact ctttacttat 180ccaaactgca atgtcaatat
ctgacttctt ttcaacaatt tacattcaca ctatatgatg 240tgtctcaagt ctgcctgtga
attaacaatg tgcatttcta gcaccatcta gctagtgtta 300acactccatt atgttaataa
ttaataataa ctgaaacatt gggaaaacaa agcacaacaa 360tactttccca tgtgttgagt
gtcactttat ggattaggta tttttggtta ctggtatctg 420catgcatagt tatgtcatgt
atcaccacat ataagtgggt aaatgatcac tgtcacaaca 480tgctctacat aaacaacaac
actgaataaa aaagacctct gaggaacagg ccaatttgaa 540actaggaatt ctagcaaatg
atatacatga catttgctct tcttccacat cgtattgcac 600tgggttttat ttttaccttc
ggacttttta atttcctctt cccataatta cagatgagaa 660aataaaatac atcctgtaaa
ttcacccact tcaccacaaa gtttgaagac tactaaaata 720ccttataatt ggatcaaatg
tattcaagct ggatctaaaa ccctctgtat tacctgacca 780tataaccact acccttgtgt
ttgtgtgcaa caatagctcc tacagtagat tttttttagg 840gtaaaaagta cacgcttgta
gagttcaaaa taactcttta tccctgacct aacctcaaat 900cctaccaccc ggaagccaaa
aggatgtgta taatgggctg aacttttggg caaggggtta 960attctccaca taattgtact
ggggaacaaa tatctttggt cagaatggaa gtgagtttat 1020gctgggctat agagatacgc
aagttcttca tacgcaccta ttctatacat gggctcctgg 1080tgtttagaac cgcagtggag
ctagaggcaa gaccactaat gaactgaact ttaacctggg 1140aataatggac atatttcttc
attaagttac taaatgtaaa tcttaaaaat gaagctagag 1200acaagtagtt actgaccata
ctgaaaatgt gtcttaaaag tcaagggagg accactgccc 1260ttgtattata atgataacaa
atgttggcaa ggacatggag aaattggaac ccttgttcac 1320tagtggtggg aatgtaaaat
ggtacatctg ctacagaaca cagtataact gttactcaaa 1380aaaattaaac acagaattac
catatgatcc agcaattcca cttctgggta cataccgaaa 1440acaactgaag gcagagtctt
gaagagttat ttgaataccc atgttcacag cagcattatt 1500cacaatggcc aaaaggtaga
tgtgttgata tatcaacaga agaatgtggt atatacatac 1560aatggaatat gattcagcct
taaaagggat ggacattctg acatatgctg caaaatgaac 1620cttgagggca taatgccaag
tgaaataaat cagatactgt atgattccac ttacatgaag 1680tacctagagc agtcaaattc
acagagacag aaggtggaat ggtagttgcc attccaccag 1740gggtttggga gaagggactg
aatggggagt tgtttaatgg gtacagattt cagctgggga 1800agactaaaaa gttctatggt
ggtgacagta gcacaacaac atgaatgtac tcaatgccac 1860tgaactgtac acttaaaaat
agttaaaatg gtaaatttta tgttatttgt actttagcac 1920aatttttcaa attaaaaaag
agtcaactcg tgattcaata acttggaaga atcttgaggg 1980acttatacag agtgaaaagg
gataattcca aaaggttaac atatactata taattccatt 2040tttataacat tcttaaaaga
gcaaaactac acaaatgaag aaaagattag tggttcttag 2100ggcttgggag gggaagggga
gattaaggct atgactataa aaaggcaaga ggaggagaaa 2160tcccttatat tgatggaaat
gttctgtatc ttcaccatat caagggcaat atcctggttg 2220tgatattgta ctatagtttt
gtaagatgtt acatttgggg aagattgagc aaagaatata 2280taggatctct gttaaatttc
ctcttttttt tttttttttt gaaacagggt tttggtctgt 2340tgcccaggct ggactgcagt
gacatgatct cagctcactg caaccttggc ctcccggatt 2400caagtgattc tcatgcctca
gcctcccaag tagctgggat tacaggtgtg caccaccatg 2460cctggctaat ttttgtattt
ttagtagaga cagcgtttta ccatgttggt caggctggtc 2520tcgaactctt gacctcaagg
gatccaccct ctttggcctc ccaaagtgct gggattaaag 2580gcataagcca ccatgcccag
ccctgtttaa tttcttacaa ctgcatgtaa atctaaaatt 2640ctggccgggc acagtggctc
atgcctgtag taccaccact ttggaaggcc gaggtgggtg 2700gatcacttga ggtcaggagt
tcgagaccag cctggccaac atggtgaaac cccatctcta 2760ctaaaaatac aaaaattagc
cggacgtggt ggtgcacacc tgtagtccca gctactcagg 2820aggctgaggc aggagaattg
cttgaaccca ggaggttgca gtgagctgag atcgtgccac 2880tatactccag ccttggggag
agagagagat tccatctcaa aaaaagaata ctaaaataaa 2940atattttcca attaaaagcc
aaataattta tattttaaac tgagacatct gaggggtttc 3000tatggctggt ccaagattat
cagtttaaaa tattaaggca ctcatacaag ctagaaatcc 3060tgggcctaca gatc
30744052129DNAHomo sapiens
405caagattatc agtttaaaat attaaggcac tcatacaagc tagaaatcct gggcctacag
60atctgtgtta aagaaaatta tgtgaagtcc taaaagaagc ccattctaga cagtgaccag
120attttaacta aaaattttaa attacctact ttgccccacg ttttttcacc tcttatattt
180cccaagcaaa aatttaaatc aaatcaatgg ttcaacaatc aaatttcact tttcaatttc
240aaatttcaat tttaaaatca tagtttcaaa acacctaaca gaattattga tttattcccc
300agaaggcttt tctaggttta gatggaattt ttaatactca gcaatttgaa agtcagagaa
360ttatctataa agtagctttt gttctttaaa tttttggtct acaaactttt ttaaagaaag
420ggtatcactc tattgcttag gctggagtgc catggcacga tcatagctca ctgcagcctc
480catcgtgtgg gctccagtga tcctcccacc tcagcctcct aagtagctgg ggcaggtgca
540tgctctgcaa attttaaaat tcttttgcag agacagggtg ggtctcacta tgttgcccaa
600gctggtctca agctcctgac ctcaagcaat cctctggcct caagcgatcc tccgtgctag
660gattccaggc atgagcctac aaactcttaa gaggtaatgt aatcttccca tgtgtatatt
720aatgagagag gtccttgaag tgatgaaaaa gactggatcc tgctgactac tggttgggct
780tcagaatgtt cctaacaaca ttctgagggt ataatccaca ggatttcata tccaggcctg
840tctctcaaga gatgttctca gggatcttta acaactattc cctactcccc ctaaccttaa
900gcagaacaag acttttctta catgttctat ttcctctgcc cttccctgac aaggtaagcc
960tctggcaact atggctaagt ggttccccta ctgtagaaca gagagctcag ccaggtgatg
1020ggactgccaa tcaaaggcca catgagatga actggaggga attttttcca gcttttggtg
1080tacatggaat ctacctgcaa ggcttagcaa aacagcaatg aagacatttc gtttatctgg
1140gcccttactt gggggagttc tgtggttata attacagaca gccaccctag aaagtcttac
1200attcctatcc atttctgtaa ttgaattgat tttaatctct tcctatttta tacaccaagg
1260atttatagga tgctaataac tttctcccca ccactaccct cttcttatcc aaattcctgt
1320aacgtaagga tatcaagtta accacagagt ttgaattgaa tgcctgtggc tgtttctgga
1380taagaatctg aagggaggcc aggcatggtg gctcacgcct ataatcctag cactttggga
1440ggccaaggtg ggtggattac ctgaggtcag gagttggaga ccagcctggc taacatggtg
1500aaaccccgtc tctaataaaa agacaaaaaa ttagctgggc atggttgtgt gtgcctgtaa
1560tcccagctac tcgggaggct gaggcaggag aatcacttga acccaggagg cagaggttgt
1620agtgagccga gatcatgcca ctgcactcca gcctgggcaa cagagtgaga ctccgtctca
1680aaaaaaaaaa aaaaaaaaaa aaaagaatcc taagggaata cagagaaact tcctttaaaa
1740gttcctactt atacatttta caagcctagt gtttgctgaa aagaagagtt cttccaggca
1800caacgtcagg ttttcctatg gaagtctctg tctcctactg actcattttt catactgtgt
1860aaatgctcaa gaagaatcaa aaggacaggt tttttcaatc tctaggttaa attctactgt
1920agtcctcatc aatgagcttc taaccaaagc ccaatttcat ttcatacccc aattttttta
1980tctttccaaa gaagtgtctc ctggaggtca aacacctctt ttgtcatggt gtctattttc
2040tgctgcatgc gctgcttctc ctgcagagga agaggggaag agaagtaata aaagagcaga
2100aagaaaaggg agaggaggtt tgagggagg
21294063170DNAHomo sapiens 406ttctcctgca gaggaagagg ggaagagaag taataaaaga
gcagaaagaa aagggagagg 60aggtttgagg gaggaaacaa aaataaagcc gataaagaaa
cttaaccaaa agggaaagtc 120tgtgatgaac aggaaaagca aaattggtct gccaaaagaa
aagatgacat tcacagtctt 180ggccacaaga ttcttattgg cttgccccta caaaagtaag
caaaggaacc aggaataatt 240gttccaacca cagctacgtg gcagcaagcc agctagaatt
tctgtgtaca tacagctcca 300tatgtatatt ctttctttga taactgcctt tttaccaaac
aagaacttac attcctagag 360agggaaattt aggtttgctt atgaacaaat gatctttcat
cttagagaac aagcagtttt 420gaattttatt ttttaagcag aactgatcat tttgaatttc
tgttagcaaa atctatgaca 480gcaagaacac catgaatttt gtattatttt aaaattatat
tattttgaaa catttaaatt 540tagcatttaa caatccttaa atgacctttc taattaggca
atggtgctta acaggttttc 600ttcttatgca ttattggtaa attattatgt cctcctttcc
ctactcatac attaggtact 660ttaccatgga attttcaatt ccaaagacca aaaaacatta
tttgtaatat ttaaagtttt 720tcagcataac catagatact aacatctaaa agatgttcat
tctagatgta aaaaacatct 780aaaactatag ttctcaaagt ttgtatacct agcaccctaa
gcttttaaag aagccacagt 840gatgaactat agaaatcaag cattatattc ttcttaaatg
caattacaat taattactag 900aacactttac cagtcctaac ttaagctatt gaatttgaga
agcagccccc aaagcaggtt 960tattatttta tgtggttggc attttggcac aaaaagataa
aagaacaaaa agggaaagaa 1020tttcacatta ttttaaaata ccagcaggat acagattctg
gaaaatatgc ttcctacctt 1080atatggagaa aaaccaagaa aattaacttc acatgtaatc
tgatagatcc aaaaggttat 1140ctgtatctgc acttgaaatc cacaaattct gagtatgttc
aattattctt aatgatgaca 1200aaaattaaca cgtcttcaaa tttaaagtca tttctttttc
tctattaaat ggtttttaaa 1260aatcatttgt agagagacat attaagaggt aggtccgagg
ggaaagagag aaagagggag 1320agaaaaagaa aggctaaggt ctgagtagcc aggaatgtgg
acaagtgtgg ttgtgagatc 1380tctctcctgg gatcattaac aatctatgct tcctgacatc
tctggcgtgt caacactaac 1440ttaacattag atgcctttga tagccacacc tagatagtgg
gcaggatccc ccttcaaact 1500tatttccata tttatctaaa aacatcgtct caggagggaa
aaccacattt aaagaaaaaa 1560gatgcatgca atgtagcagg cctgcaagga tgactaatgt
tttcaaagag ttcttggtag 1620actatgcttc attccattcc taagatgttg ccagcaatgt
ggcagagtcc cttcgcttgc 1680agaaacctga accttcagac taaccattct ttaccttttt
gtacagaacg tatcttgatg 1740tttcttcttt tttcatttag ccacctgaga aatgtattta
cctgagtgaa aatcaaactt 1800attccccaag aatcatgtcc caaaagatgg cattcactaa
ttccaaagaa taatgttatt 1860ctataatttt tccttttgcc catttcctaa gatatctgta
ggaaacagtg tgcttaggaa 1920taaaagacac aaaaatttct gctaccaaag tggggtaatg
tttataggat ttatagtatt 1980aatttttaag cataatctgg tttatgtttg aaaatttgta
gtgtacagtc aaatataaag 2040agacaaactc tgatgcatct taactctcct tccctcccaa
cacatcctca tcccattcaa 2100ctcatttttt ttcaaaatta agtattccca cagttcatgt
acatacctca ataagctcat 2160ctctttgccg caggccttct ttaagttctt ccatcttatg
ctgcagcaca ctacacatat 2220gtttctgcct ttctaactcc tgttattaaa caaataatat
catttacaca ggtcatggca 2280cacaagaaat ttgaacatac acaatacaac acagaggtta
agtatgacct ccagaaacat 2340gcccaaactc ctgattcata gtaacttaga aaaattgtgt
attctataga aaagttaaga 2400aaattttaaa attccatctt gtataattat caggaaaacc
tgaactaatc aatggcaaaa 2460ttattaaaaa caaaagataa tttagtaaag taacaggtta
taaaatgaac atatacaatt 2520caatgacatt catatacaaa taaaattcaa agaaggaata
ataaatgcaa tatcaaaata 2580aaatcaatat taaataaaaa acatacatgt aaacttacaa
aatatatcaa aaacctatat 2640gaggaaaatt atataaagca ttcccaaaag acagagaaat
aggattgaat aaatggaaag 2700gcataccgtc ttcttggatt aaaagtctca caacattata
aaaatgccag ttctccctaa 2760attaatctat acatttaatg tagtacaaat aaaaatacca
tcaggttttt cttttatcat 2820catcagagca agttgatttg aaagaaaaac acaagaaaaa
gtagccagaa aaatacatac 2880tgaaaaagaa gaaagccggc cttattaggt attaaaacat
attataaagc ttctataatt 2940aaaacaatgt tgttatggca catgaatata gaccaaggga
gcagaataga gaattcagga 3000aaaacccact taaatataca aatatattta aaaacaataa
aaataagagc atctcaaatc 3060aatgagaagg aaagactttt aaattagtaa tgttgggata
actggatatc catttggaaa 3120aagataaaat tggaactata cctcatacca cacaccagga
caaattccaa 31704073157DNAHomo sapiens 407aaagccaggg
agtgaatggg ggaagaggga agggaaggga gaacaaactg tacaggaata 60aaagtaacca
aggagtgggg caatctttac tgaaaaaatg actcaaaaat ccacaagcaa 120tgaggtaatg
gtacaataaa ttagactaca taaaaataaa aattttgggc tgggcatggt 180agctcatgcc
tgtaatccca gcactctggg aggctgaagc aggcagatcc cttgagccga 240agaattcaag
accctgcctg ggcaacatgg caaaacccca tctctataaa aaaaattcaa 300aaattagcca
gggttggtgg cgtgtgcctg tagtcccagg tactctggag gctgaggtgg 360gaggaccacc
tgagcctggg gaggtcaagg ctatagtgag ccatgatggt gccattgcac 420tccagcctga
cgacggagtg agactctgtc tccaaaaaat aaatacataa ataaataaaa 480cttttgaatg
gcagagaatc tctaaaacta ggccaggcac ggtggctcac gcctgtaatc 540ccagcacttt
gggaggccga ggtgggcgga tcacctgagg tcgggagttc gagaccagcc 600tgaccaacat
agagaaaccc tgtctctact aaaaatacaa aattagtcag gcgtggtggt 660gcatgcctgt
aatcccagct actcgggagg ctgaggcagg agaattgctt gaacctggga 720ggcggagatt
gtggtgagcc gagattgcgc cattgcactc cagcctgggc aacaggagcg 780aaactctgtc
ttaaaaaaca aacaaacaaa aactagaaag aaagaaaaca ctaactgcat 840agaataataa
gctacggaaa cggacagttt acagaaaaag aaatagaaat agctctgaat 900atgaaaagat
actcatacta agagaaacgg aaacaaacaa aatactagca aaagttcaaa 960aacttgacaa
catattccag aacagaacta tggggaaaga aataggccct catacatttt 1020ggtgagaatg
caaatggtat aatgcttaca aaggagacta cagcagtatc tgcaaaacta 1080catacctttc
gacccagcaa tctcactctt catcatagat acattggcaa aaatacaaaa 1140agacctatgc
agtatgttat ttctacagga ctatttttaa cagcaaaaca tgacaaactt 1200gaatgtctat
taatataggg aactggtaga ataaagtgtg gtacatccat actgtggaat 1260aattatgcag
tggtgaaaaa gaatgagcaa gatatctcta tacaacattc ataaggtgat 1320aaaatctaca
tgcacgacag catttatatt aacaatatgc tactattttc taagaagagt 1380aagaaataca
tatatttgta catatatttt gaatgattat atatacatat atatcttttt 1440agattaaaaa
tggctaccta atttatcttc ttggatttaa aacatggaaa gataaaccat 1500taaaatttaa
aattccctaa aggaaggaga aaatagagac agagacaggg atagaagatt 1560aacttcttca
gatatattct ttgttaacgt gactccagat ctatgtaact attctagata 1620gttacaaaat
tgtaaaacaa aattaaattt aaaaagcaat tcctagtgga aacaattcaa 1680gtggccattg
atagatgaat gcataagcaa aatgtggtat atatacagtg gaatattatt 1740cagctttaaa
aaagaaactc ttgtcacacg ctacaacatg gatgagcctt aggacattat 1800gctaaatgaa
gtaaaccagt caagaaaaga tatattatta tatgattcta cttacttatg 1860agtttggtac
acagagtagc caaactcaca gagacagaaa gtaggatggt agttgccaag 1920ggctggtggt
agggagaaat gggtaattgt ttaatgggta cagagttcta gttttgaaag 1980ataaaaaatt
tctggagatc tgatgtacac taatgtgaat atgcttcaca ctactgaact 2040ctacacttta
aaatggttaa gataataaat ttatcatgtt ttttaatgat aattaaattt 2100tttaaaataa
aaataatttt aaaagtaatt ccaaatattg aaaataaaat gcagtgaacc 2160taactataca
tccagttaga agcgcagaaa gaaactattt caagtaactt ctaaaaacaa 2220tagtttggcc
acacacagac tagtggcaaa aataacagcc aagcaaaaca aacaaaataa 2280aaatctttta
actattttca gtaattaaat tgttggtgtt aatgttggta ttgctattct 2340aggcaacttc
ggataaagca aagagaacag aacgtaacat aattactatc atccctagaa 2400actttgagaa
ctaggaattg tagtgtaggg gaaacaaaca aagatacaga tgtaagacag 2460aagaggttaa
ataaccctag agtcctgcat gtgaactgga actatcagca agaactcata 2520atgtattttg
ttgttaaaaa caaaaaaaaa cttcacacac atatttccca aatttatcca 2580ctgaaaagac
ccataaacac tgacctactt ggtggcaatg agcatctcta gcactcatac 2640taaaacagaa
ctagggctcc ttggacaaat ggctgattcc aggtctgggg cagaaaatgt 2700acaagatgag
actgggacat cttctgccag aaagcaaaga agctatcaaa aacaattaga 2760ctcaccagaa
gtacttgaga atcaacctca agaggttcac attagccaaa gatgggacaa 2820ttttatcatc
aaaaagaata aaagctgcaa tgcaactgaa gatatcaaat gcttgaattt 2880atgacttcat
attgatattt taagagaaaa gtaattggtc acaaccaatt cctttttttg 2940aaaactcata
aagggagaaa atatttgcga ataatatatc tgataaaatt cttatatcta 3000gaatatataa
accttacaag tcaataataa taaggcaaaa tccaatttta aaatggacaa 3060aggatctgaa
tagacatttt gccaaggaag atatgcaaat agccaataag cccatgaaaa 3120tatgttcaaa
atcattagtc accagggaga tgcacat
31574083166DNAHomo sapiens 408tgtggggaaa tcaaaacctg catacattgc tggtgtgaga
atataaaatg gtgcagccac 60ttcggaaaac agtctggcac tgttcaaatg gttaaacaca
gatttatcgt atgaatcagc 120aattccactc ccaagtatat acttaagaaa aaggaaagca
tatatccaga ccatgagcag 180tggctcatgc ctgtaattcc aacactttgg gaggccaagg
caggaagatt gcttgaggcc 240aggagttacc agcctgggca acatagcaag gccccatctc
ttagaataaa aaagaaaaag 300aaaacttacg tccaaaaaac aacctgtata caaatgttta
tggaagcatt attcttaaca 360ggaaaaagta taaacaaccc aaatgtcaat caatgcacaa
atgggtaaac aaaatgtagc 420atacccaaaa aaatggaata ttaataggct ataaaaagga
attaagtatt gatacatggt 480ataacatgga tgaaccttga aaacatcacg ctaagcgaaa
gaagccagtc acaaaagacc 540atgtattata tgactccttt catatgagtc tagaataggg
aactctatag atagaaagta 600gatcagtggt tacttaagac tgaggggttt gggggaaagg
aagatgatac taaagggtat 660atggtttctt tctgaggtaa tgaaaacata ctaaaagtaa
ctgtgatgaa ggttacacat 720atatgtgaat atactaaaaa ccactgaatt gtacacttta
aatggatgat ttgtatgtta 780tttgaattat atctcaataa agctgcttaa aaataacatt
aaataggcca ggcgcagtgg 840ctcacacctg taatcccagc actttgggag gccgaggtgg
gtggatcacc tgaggtcagt 900tcgggaccag cctgaccaac aaagtgaaac cccatctcta
ctaaaaatac aaaattagcc 960aggcgtggtg gtccatgcct gtaatcccag ctactcagaa
ggctgaggca ggagaatcac 1020ttgaacccgg gaggcagagg ttgcagtgag ccgagattgc
gccattgcac tccaccctgg 1080gcaacaagag caaaactcgt ctaaaaaaat aaaataaata
aaatttaaaa aaataaataa 1140aataacataa aataaaataa attggtaatg ataaaatcag
aacatcccat tttgcagccc 1200ctagtcaatt aaaggatcta agcacacaca tacagcctaa
cagtcagcca cacatctggg 1260cctcctgaag gaagacaact gcatcatcta tgaagcagac
tttcaaaaaa aactgaagtt 1320atatttgatt aagcctctgt gcctaactac ctatttacag
agaatacaga ggaaagggaa 1380acatggtaaa gatactatgg ggacgaaaac ggaaaaactt
gtaagactgg gaaatattaa 1440gcaacccagt ttcttcaaca tatatattat aaggagaaaa
aacatgaaag aggacctata 1500catgaaaaga gacttaaaag atttatcaac tcattctaag
gtgtgaaact tacctggatc 1560ccgatttttt taagtgtaaa aaagaaaaat catttatgac
attttgaaac tactgaaatt 1620ttaacattga ttagatatat aatatgaatt attgttaatt
ttacaggtgt gaaatggtct 1680tttgattatg ctttaaaaga gaaacaatgg gctgggcaaa
gtggctcatg cctgtaatcc 1740caacactttg ggaggtcaaa gaaggaggat tgggcaatac
agcaaggccc catctccaca 1800aaaagatttt taaaaaacag ccaggcatgg tagcatgtgc
ctacagttct agctactcca 1860gattacttaa gcgtaggaga tcaaagttac agtgggctat
gatcgtgcca ctgcactcca 1920gcctggggag cagagcaaga ccctgtctgt aaaaacaata
aaattaaatt aaattaaaat 1980ataaaacaaa atagttattg taattccagg aggaggcacc
aaagacattt tgaaaatttt 2040tattaatact gtataattat tatggaatgg tccttctcta
ctctcactac ctgcttctgt 2100aatgaaatac accagaatgc ctgacctctt tgtgtatctc
tgaatataaa cattctactt 2160tataaagcaa tagttgttta gaacaagaga gctcggagta
aaggtatgtt agaaagagat 2220actgtcacaa agtgttctgc atcacagtac gtgaagcaaa
tttccagggt attaaaaaaa 2280acaatacatt ttaggtttga aaagtacaag acctgccacg
gtggctcaca actgtaatct 2340cagcactgtg ggaagctgag gtgggtggat tgcttgagcc
caggagttcg agaccagcct 2400gggcaacatg gtgaaaccgt ttctataaaa aaaaatttgt
ctttaattag ccaggcgtgg 2460tggtacgtgc ctgtagtccc agctactcag gaggctgagg
tgggaggatc acttgagcct 2520ggagacagag gctgcagtga gctatacttg cgctactcca
ctccagcctg ggctacagag 2580tgagaccttg tctccaaaag aaaaaaaaaa aaaaaaaaag
gacaagatta agaaaacaac 2640tctatgcata ctaaaataac tctattacct tctaaaatga
tggagggaaa aactaagttg 2700gttaaaatag ctaattataa gggttttctt tatttttaaa
aatgctgaag gatatttgag 2760acgatggaaa gataattcat tttagtaatg acccttaaat
acttagcatg agatttttac 2820tctagtctgc tagaagcaaa gagccaaaga cagataagaa
aaaatgtgtg ggacagcaaa 2880aaaaggagtc cagaggctct aggaaaagca tgatggctca
gaatctccaa aaatggccta 2940tgggatatat aagaagaatc aaagatatac atatctagca
tatataatgc tcttcaatgg 3000gccactgtca gtccaatatg aaaacaacta gaagcctctt
tttcaccatc taaataccaa 3060tcctaaataa acaattttag agcagtttaa gattcacagt
aaaattatgt ttaaagtaca 3120gagagttttc atatactccc tgtccccaca cacgcacagt
ctaccc 3166
User Contributions:
Comment about this patent or add new information about this topic: