Patent application title: METHODS AND MATERIALS FOR IDENTIFYING POLYMORPHIC VARIANTS, DIAGNOSING SUSCEPTIBILITIES, AND TREATING DISEASE
Inventors:
Lawrence C. Brody (Baltimore, MD, US)
Anne Parle-Mcdermott (Dublin, IE)
John Scott (Dublin, IE)
Peadar Kirke (Dublin, IE)
James Mills (Rockville, MD, US)
Faith Pangilinan (Rockville, MD, US)
Anne Molloy (Dublin, IE)
Assignees:
The United States of America, as represented by Secretary, Department of Health and Human Services
The Provost Fellows and Scholars of the College of the Holy and Undivided Trinity of Queen Elizabeth
The Health Research Board
IPC8 Class: AC40B3000FI
USPC Class:
506 7
Class name: Combinatorial chemistry technology: method, library, apparatus method of screening a library
Publication date: 2011-07-21
Patent application number: 20110177958
Abstract:
The invention is directed to materials and methods associated with
polymorphic variants in two enzymes involved in folate-dependent and
one-carbon metabolic pathways: MTHFD1 (5,10-methylenetetrahydrofolate
dehydrogenase, 5,10-methenyltetrahydrofolate cyclohydrolase,
10-formyltetrahydrofolate synthetase) and methylenetetrahydrofolate
dehydrogenase (NADP+ dependent) 1-like (MTHFD1L). Diagnostic and
therapeutic methods are provided involving the correlation of polymorphic
variants in MTHFD1, MTHFD1, and other genes with relative susceptibility
for various pregnancy-related and other complications.Claims:
1.-2. (canceled)
3. A method of testing for an increased susceptibility for a complication related to a defect in a one-carbon metabolic pathway, the method comprising: (a) screening a sample from a subject to detect the presence or absence of a polymorphic variant of a polymorphism in at least one chromosomal copy of the MTHFD1L gene, wherein the polymorphic variant is associated with an increased susceptibility for a complication related to a defect in a one-carbon metabolic pathway; and (b) diagnosing the susceptibility of the subject for a complication related to a defect in a one-carbon metabolic pathway based on the presence or absence of the polymorphic variant of at least one chromosomal copy of the MTHFD1L gene.
4. The method of claim 3, wherein the complication is related to administration of a drug.
5. The method of claim 4, wherein the drug is selected from the group consisting of a chemotherapeutic drug, a cardiovascular drug, and a central nervous system (CNS) drug.
6. The method of claim 3, wherein the complication is selected from the group consisting of a pregnancy-related complication, miscarriage, second trimester miscarriage, placental abruption, severe placental abruption, neural tube defect, and cardiovascular disease.
7. The method of claim 6, wherein the complication is a neural tube defect, and wherein the neural tube defect is in a child of the subject screened.
8. The method of claim 7, wherein the neural tube defect is selected from the group consisting of anencephaly, encephalocele, iniencephaly, and spina bifida.
9. The method of claim 3, wherein the subject is selected from the group consisting of a female, a female of child-bearing age, a pregnant female, a female that has had complications during a previous pregnancy, a female that has had complication becoming pregnant, a male, and a gestating child.
10. The method of claim 3, wherein the sample comprises an egg, a sperm, a somatic cell, or blood.
11. The method of claim 3, wherein the polymorphism variant is selected from the group consisting of a single nucleotide polymorphism (SNP) and a short tandem repeat polymorphism (STRP).
12. The method of claim 3, wherein the polymorphic variant is present in a single chromosomal copy of the gene, and wherein heterozygosity is associated with an increased risk for the complication.
13. The method of claim 3, wherein the sample comprises two chromosomal copies of the gene, wherein the polymorphic variant is present in both chromosomal copies of the gene, wherein homozygosity of the polymorphic variant is associated with an increased risk for the complication, and wherein the complication is diagnosed if homozygosity of the polymorphic variant is detected.
14. The method of claim 3, wherein the sample comprises a nucleic acid selected from the group consisting of (a) a nucleic acid encoding MTHFD1L, (b) a fragment of (a) comprising at least 30 contiguous nucleotides of SEQ ID NO: 12 wherein the 30 contiguous nucleotides comprise the polymorphism, (c) a complement of (a) or (b), and (d) a combination of two or more of (a), (b), and (c).
15. The method of claim 3, wherein the screening comprises assaying a sample comprising a nucleic acid selected from the group consisting of (a) a nucleic acid encoding MTHFD1L, (b) a fragment of (a) comprising at least 30 contiguous nucleotides of SEQ ID NO: 12 wherein the 30 contiguous nucleotides comprise the tandem repeated "ATT" sequence of the rs3832406 polymorphism of MTHFD1L, (c) a complement of (a) or (b), and (d) a combination of two or more of (a), (b), and (c).
16. The method of claim 15, wherein the polymorphic variant is an "ATT" tandem repeat of rs3832406, and the diagnosis is for an increased susceptibility for the condition if the tandem repeat comprises seven or fewer "ATT" repeats.
17. The method of claim 15, wherein the polymorphic variant causes an alternative splicing of a mRNA derived from the MTHFD1 L gene or a decrease in the synthetase activity of the MTHFD1 L gene product.
18. The method of claim 3, wherein the sample is further screened for a polymorphic variant of a polymorphism in a gene selected from the group consisting of the MTHFR gene, the coagulation factor II gene, the coagulation factor V gene, and the transcobabalamin II gene, and wherein the polymorphic variant is associated with the complication.
19. The method of claim 3, wherein the sample is screened using a method selected from the group consisting of a nucleic acid array, allele-specific-oligonucleotide (ASO) hybridization, PCR-RFLP analysis, PCR, single-strand conformation polymorphic variant (SSCP) technique, an amplification refractory mutation system (ARMS) technique, nucleotide sequencing, an antibody specific to the protein encoded by the polymorphic variant containing gene, and mass spectrometry.
20.-23. (canceled)
Description:
CROSS REFERENCE TO RELATED APPLICATIONS
[0001] This patent application is a divisional of copending U.S. patent application Ser. No. 11/958,126, filed Dec. 17, 2007, which is a continuation-in-part of International Patent Application No. PCT/US2005/021288, filed Jun. 16, 2005, which are incorporated by reference herein.
INCORPORATION-BY-REFERENCE OF MATERIAL ELECTRONICALLY FILED
[0002] Incorporated by reference in its entirety herein is a computer-readable nucleotide/amino acid sequence listing submitted concurrently herewith and identified as follows: One 493,113 Byte ASCII (Text) file named "707465ST25.TXT," created on Jan. 7, 2011.
BACKGROUND OF THE INVENTION
[0003] An important enzyme involved in one carbon metabolism is the NADP- dependent trifunctional enzyme MTHFD1 (5,10-methylenetetrahydrofolate dehydrogenase, 5,10-methenyltetrahydrofolate cyclohydrolase, 10-formyltetrahydrofolate synthetase) [Hum et al., J. Biol. Chem., 263:15946-15950 (1988)]. MTHFD1 is often referred to as the "C1-THF synthase" and catalyses the interconversion of tetrahydrofolate to 10-formyl, 5,10-methenyl, and 5,10-methylene derivatives. These derivatives form an important part of de novo DNA synthesis. Promotion of DNA synthesis is desirable in placental and fetal development. In other contexts, such as cancer treatment, blocking of DNA synthesis is desirable. Maternal folate status and/or homocysteine levels have been implicated in a range of pregnancy-related complications, most notably in pregnancies affected by a neural tube defect (NTD).
[0004] A polymorphic variant at position 1958 of MTHFD1 at which guanine is replaced with an adenosine results in the substitution of a conserved arginine amino acid with a glutamine at position 653. One study has disclosed that this polymorphic variant is a maternal risk factor for neural tube defects (NTDs) [Brody et al., Am. J. Hum. Genet., 71:1207-1215 (2002)]. Neural tube defects (NTDs) are common congenital malformations that can be presented as anencephaly, encephalocele, and spina bifida. NTDs' etiology likely includes both genetic and environmental factors. Intervention trials have shown that maternal supplementation with folic acid in the period before pregnancy can prevent the majority of NTD-affected pregnancies.
[0005] Abruptio placentae or placental abruption is thought to arise from a sudden rupture of the spiral arteries, resulting in the premature separation of a normally implanted placenta [Anath et al., Obstet. Gynecol., 88:309-318 (1996); Eskes, Eur. J. Obstet. Gyn. R. B., 95:206-212 (2001)]. This event leads to increased risk of adverse outcomes to both mother and baby. The underlying cause of abruptio placentae is unknown, but several factors have been suggested to increase risk including folate deficiency, hyperhomocysteinemia, preeclampsia and history of a prior pregnancy abruption [Kramer et al., Obstet. Gynecol., 89:221-226 (1997); Misra et al., J. Clin. Epidemiol., 52:453-461 (1999); Ray et al., Placenta, 20:519-529 (1999); Eskes, Eur. J. Obstet. Gyn. R. B., 95:206-212 (2001).] Non-genetic risk factors have been described including cigarette smoking, preeclampsia and increased maternal age [Misra et al., J. Clin. Epidemiol., 52:453-461 (1999); Eskes, Eur. J. Obstet. Gyn. R. B., 95:206-212 (2001)]. Additional risk factors include elevated homocysteine [Goddijn-Wessel et al., Br. Med. J., 2:1431-1436 (1996); van der Molen, et al., Am. J. Obstet. Gynecol., 182:1258-1263 (2000)] and low folate levels [Hibbard et al., Br. Med. J., 2:1431-1436 (1963); Streiff et al., N. Engl. J. Med., 276:776-779 (1967); Whalley et al., Am. J. Obstet. Gynecol., 105:670-678 (1969); Hibbard, S. Afr. Med. J., 49:1223-1226 (1975); Goddijn-Wessel et al., Br. Med. J., 2:1431-1436 (1996)].
[0006] A substantial proportion (15-50%) of second trimester pregnancy losses remain unexplained [Gaillard et al., Arch. Pathol. Lab. Med, 117:1022-1026 (1993); Faye-Petersen et al., Obstet. Gynecol., 94:915-920 (1999); Incerpi et al., Am. J. Obstet. Gynecol., 178:1121-1125 1998); Drakeley et al., Hum. Reprod., 13:1975-1980 (1998)]. Although placental insufficiency is a common finding in these cases [Faye-Petersen et al., Obstet. Gynecol., 94:915-920 (1999)], its etiology is often unknown. Sub-optimal folate or B12 metabolism due to either a deficient diet or a genetic predisposition appears to increase the risk of a number of pregnancy complications including spontaneous abortion.
[0007] Polymorphisms have been studied in the context of a variety of cancers and other diseases. [See, e.g., Chen, et al., Int. J. Cancer, 110:617-620 (2004), Krajinovic et al., Pharmacogenomics J., 4:66-72 (2004); U.S. Pat. Nos. 5,449,605; 5,688,647; 5,719,026; 5,942,390; 6,294,399; 6,312,898; 6,537,759; 6,548,245; 6,627,401; 6,664,062; 6,759,200; 6,818,758; 6,833,243, and 6,872,533; and U.S. Patent Application Publication Nos: 2005/0084849; 2005/0089905; 2005/0095593; and 2005/0112680.
[0008] Methylenetetrahydrofolate dehydrogenase (NADP+ dependent) 1-like (MTHFD1L) is a trifunctional enzyme localized to mitochondria which has been reported to have one or more enzymatic activities in common with MTHFD1. MTHFD1L has been shown to be transcribed into two mRNA transcripts: 1.1 kb and 3.6 kb in size. The shorter transcript is produced by splicing Exon7 with alternative Exon8A and therefore it lacks the 10-formyltetrahydrofolate synthase (synthetase) sequence [Prasannan et al., J. Biol. Chem., 278(44):43178-43187 (2003)].
[0009] Methylenetetrahydrofolate reductase (MTHFR) is involved in the remethylation of homocysteine to methionine by generating the necessary methyl group donor 5-methyltetrahydrofolate from 5,10-methylenetetrahydrofolate. Polymorphisms within MTHFR have been extensively studied in relation to a wide variety of diseases including NTDs, cancer and pre-eclampsia. The MTHFR polymorphic variant at 677, where a C is replaced with a T, has a functional effect on the MTHFR enzyme and is associated with elevated plasma homocysteine levels when folate status is low. A relatively small number of studies have examined the MTHFR polymorphisms in relation to abruptio placentae and results have not yielded a clear indication of increased risk for the disorder.
[0010] Coagulation factor II is proteolytically cleaved to form thrombin in the first step of the coagulation cascade, which ultimately results in the stemming of blood loss. F2 also plays a role in maintaining vascular integrity during development and postnatal life. Mutations in F2 can lead to various forms of thrombosis and dysprothrombinemia.
[0011] Coagulation factor V (F5) is an essential factor of the blood coagulation cascade, and circulates in plasma, and is converted to the active form by the release of the activation peptide by thrombin during coagulation. Once activated, factor V is a cofactor that participates with activated coagulation factor X to activate prothrombin to thrombin. Defects in this gene result in either an autosomal recessive hemorrhagic diathesis or an autosomal dominant form of thrombophilia, which is known as activated protein C (APC) resistance. A variant of factor V with a particular single point mutation associated with APC resistance is known as factor V Leiden [Bertina et al., Nature, 369:64-67 (1994)].
[0012] Transcobalamin II (TCNII) is a member of the vitamin B12-binding protein family. TCNII binds cobalamin and mediates the transport of cobalamin into cells. TCNII polymorphic variant 776C>G (P259R) has been reported to confer an increased fetal genetic risk of early spontaneous abortion [Zetterberg et al., Hum. Reprod., 17:3033-3036 (2002)] and influence levels of circulating vitamin B12 bound to TCNII [Afman et al., Eur. J. Hum. Genet., 10:433-438 (2002), Miller et al., Blood, 100:718-720 (2002)]. TCNII 776C>G polymorphic variant may interact with the MTHFR 677TT genotype to confer an even higher fetal genetic risk of spontaneous abortion than either polymorphism separately [Zetterberg et al., Hum. Reprod., 18:1948-1950 (2003)].
[0013] Chemotherapy of cancer has involved use of highly toxic drugs with narrow therapeutic indices, and, most adult solid cancers remain highly resistant to treatment. Chemotherapy often results in a significant fraction of treated patients suffering unpleasant or life-threatening side effects while receiving little or no clinical benefit; other patients may suffer few side effects and/or have complete remission or even cure. Chemotherapy is also expensive, based not only on the cost of drugs, but the medical care involved with their administration. Tests are needed that better predict chemotherapy efficacy; such tests would allow for more selective use of toxic drugs. In those cases where toxicity of chemotherapy or other drug regemin is at least partially a result of genetic differences, the identification of relevant polymorphic variants will allow for more effective and safer drug use.
[0014] Accordingly, to better diagnose and treat pregnancy complications and neoplastic disorders, cardiovascular disorders, Alzheimer's disease and other conditions associated with one carbon metabolic pathways, there is a need to identify polymorphic variants that indicate a relative susceptibility to such diseases and/or relative response to treatments for such diseases.
BRIEF SUMMARY OF THE INVENTION
[0015] The invention provides a method of screening for an increased susceptibility for at least one pregnancy-related complication selected from the group consisting of second trimester miscarriage and placental abruption. A sample from a subject is screened to detect the presence or absence of a polymorphic variant of a polymorphism in at least one chromosomal copy of the MTHFD1 gene, wherein the polymorphic variant is associated with an increased susceptibility for at least one pregnancy-related complication selected from the group consisting of second trimester miscarriage and placental abruption. The susceptibility of the subject for at least one pregnancy-related complication selected from the group consisting of second trimester miscarriage and placental abruption is diagnosed based on the presence or absence of the polymorphic variant of at least one chromosomal copy of the MTHFD1 gene.
[0016] The invention provides a method of testing for an increased susceptibility for a complication related to a defect in a one-carbon metabolic pathway. A sample from a subject is screened to detect the presence or absence of a polymorphic variant of a polymorphism in at least one chromosomal copy of the MTHFD1L gene, wherein the polymorphic variant is associated with an increased susceptibility for a complication related to a defect in a one-carbon metabolic pathway. The susceptibility of the subject for a complication related to a defect in a one-carbon metabolic pathway is diagnosed based on the presence or absence of the polymorphic variant of at least one chromosomal copy of the MTHFD1L gene.
[0017] The invention provides a kit. The kit includes a nucleic acid comprising at least 30 nucleotides of SEQ ID NO: 2 or a complement thereof, the nucleic acid further comprising the sequence of a polymorphic variant associated with an increased susceptibility for at least one pregnancy-related complication selected from the group consisting of second trimester miscarriage, placental abruption, and severe placental abruption. The kit includes instructions for screening a sample from a subject using the nucleic acid. The kit further includes instructions for diagnosing an increased susceptibility for one of said pregnancy-related complications if the polymorphic variant of at least one chromosomal copy of the MTHFD1 gene is detected in the sample.
[0018] The invention provides a kit. The kit includes a nucleic acid comprising at least 30 nucleotides of SEQ ID NO: 12 or a complement thereof, the nucleic acid further comprising the sequence of a polymorphic variant associated with an increased susceptibility for at least one complication related to a defect in a one-carbon metabolic pathway. The kit includes instructions for screening a sample from a subject using the nucleic acid. The kit further includes instructions for diagnosing an increased susceptibility for a complication related to a defect in a one-carbon metabolic pathway if the polymorphic variant of at least one chromosomal copy of the MTHFD1L gene is detected in the sample.
[0019] In addition to the foregoing, the invention includes, as an additional aspect, all embodiments of the invention narrower in scope in any way than the variations specifically mentioned above. Although the applicant(s) invented the full scope of the claims appended hereto, the claims appended hereto are not intended to encompass within their scope the prior art work of others. Therefore, in the event that statutory prior art within the scope of a claim is brought to the attention of the applicants by a Patent Office or other entity or individual, the applicant(s) reserve the right to exercise amendment rights under applicable patent laws to redefine the subject matter of such a claim to specifically exclude such statutory prior art or obvious variations of statutory prior art from the scope of such a claim. Variations of the invention defined by such amended claims also are intended as aspects of the invention.
DETAILED DESCRIPTION OF THE INVENTION
Determination of Polymorphisms
[0020] This invention involves one or more polymorphic variants useful in the field of diagnostics and therapeutics for optimizing efficacy and safety of drug therapy for specific diseases or conditions and for establishing diagnostic tests for pregnancy-related and other complications affected by one carbon metabolic pathways. Methods are presented for identifying polymorphic variants and determining their utility in diagnostic and therapeutic methods, along with probes, kits, and related materials that are useful, for example, in identifying the presence and genotype of a particular polymorphic variant in an individual.
[0021] In identifying new correlations between polymorphic variants and disease susceptibilities and treatment approaches, different population groups based on racial, ethnic, gender, and/or geographic origin can be studied. Individuals with a particular disease or condition of interest or altered relative susceptibility thereto can have a higher frequency of certain polymorphic variants than the general population. The polymorphic variants can be predictive of differential, increased or decreased, susceptibility to various disease states, conditions, and complications, independent of ethnicity, race, or geographic origin, even if the polymorphic variant and disease association was originally identified in a particular population, for example, European, Celtic, and Irish populations. Distributions for some of the polymorphic variants are discussed herein.
[0022] "Differential" or "differentially" generally refers to a statistically significant different level in the specified property or effect. Preferably, the difference is also functionally significant. "Differential binding or hybridization" is a sufficient difference in binding or hybridization to allow discrimination using an appropriate detection technique. "Differential effect" or "differentially active" in connection with a therapeutic treatment or drug refers to a difference in the level of the effect or activity which is distinguishable using relevant parameters and techniques for the effect or activity being considered. In some embodiments, the difference in effect or activity is also sufficient to be clinically significant, such that a corresponding difference in the course of treatment or treatment outcome would be expected, at least on a probabilistic basis.
[0023] "Population" refers to a geographically, ethnically, racially, gender, and/or culturally defined group of individuals or a group of individuals with a particular disease or condition or individuals that may be treated with a specific drug. In most cases a population will preferably encompass at least one hundred, one thousand, ten thousand, one hundred thousand, one million, ten million, or more individuals, with the larger numbers being more preferable. In some embodiments, the population refers to individuals with relative susceptibility to a specific disease or condition and/or amenability to a particular drug regimen. The frequency of one or more polymorphic variants that is predictive of a differential susceptibility to a disease response and/or a response to a particular treatment is determined in one or more populations using a diagnostic test.
[0024] Nucleic acid samples, for use in polymorphic variant identification, can be obtained from a variety of sources as known to those skilled in the art, or can be obtained from genomic or cDNA sources by known methods. For example, the Coriell Cell Repository (Camden, N.J.) maintains over 6,000 human cell cultures, mostly fibroblast and lymphoblast cell lines comprising the NIGMS Human Genetic Mutant Cell Repository. A catalog (http://locus.umdnj.edu/nigms) provides racial or ethnic identifiers for many of the cell lines. Cell lines may also be obtained from the Beijing Cancer Institute.
[0025] "Allele frequency" is the fraction of genes in a population that have one specific polymorphic variant or set of polymorphic variants. The allele frequencies for any gene should sum to 1. In some embodiments, a polymorphic variant has an allele frequency of at least 0.001, 0.01, 0.05, or 0.10. Another measure of frequency known in the art is the "heterozygote frequency" namely, the fraction of individuals in a population who carry two alleles, or two forms of a particular polymorphic variant or variant form of a gene, one inherited from each parent. Alternatively, the number of individuals who are homozygous for a particular form of a gene may be a useful measure. The relationship between allele frequency, heterozygote frequency, and homozygote frequency is described for many genes by the Hardy-Weinberg equation. Most human polymorphic variants are substantially in Hardy-Weinberg equilibrium. The allele frequency, heterozygote frequency, or homozygote frequency can be determined experimentally.
[0026] To establish the association between a specific condition and one or more polymorphic variants, a study is commonly performed in controlled clinical trials using a limited number of patients that are considered to be representative of the population with the disease or relative susceptibility for the same. The populations should preferably be large enough to have a reasonable chance to find correlations between a particular genetic variant and susceptibility to the disease of interest. In addition, the allele frequency of the genetic variant in a population or subpopulation with the disease or pathology should vary from its allele frequency in the population without the disease pathology (control population) by at least 1%, by at least 2%, by at least 4%, or by at least 8%.
[0027] The association between case-control status and genotype can be examined using a number of standard odds ratios. In order to have a common approach for all analyses, a log linear model can be employed. The statistical software (SAS PROC NLMIXED) allows estimation of nonlinear functions of the parameters of the model, and provides standard errors calculated using the delta method [Agresti, Categorical Data Analysis (1990)]. The parameterization of the model can easily be modified for the computation of different odds ratios. This approach enables the researcher to estimate log odds ratios and their standard errors for the computation of confidence intervals, as well as to check the goodness of fit of different models. Potential gene-gene interaction effects can also be examined. Tests of interactive dominant or recessive effects of specific combined genotypes can be performed using a series of non-hierarchical logistic regression models [Piegorsch et al., Stat. Med., 13:153-162 (1994)]. Statistical significance can be assessed using likelihood ratio chi-square tests.
[0028] The polymorphism variant(s) showing the strongest correlation with an altered relative susceptibility for a disease state within a given gene are likely either to have a causative role in the manifestation of the phenotype or to be in linkage disequilibrium with the causative variants. Such a role can be confirmed by in vitro gene expression of the variant gene or by producing a transgenic animal expressing a human gene bearing such a polymorphic variant and determining whether the animal develops a relevant disease. Polymorphic variants in coding regions that result in amino acid changes can change relative susceptibility for a disease state by decreasing, increasing, or otherwise altering the activity of the protein encoded by the gene in which the polymorphism occurs. Polymorphic variants in coding regions that introduce stop codons can change relative susceptibility for a disease state by reducing (heterozygote) or eliminating (homozygote) functional protein produced by the gene. In some embodiments, stop codons result in production of a truncated peptide with aberrant activities relative to the full-length protein. Polymorphisms in regulatory regions can change relative susceptibility for a disease state by causing increased or decreased expression of the protein encoded by the gene in which the polymorphism occurs. Polymorphic variants in intronic or untranslated sequences can change relative susceptibility for a disease state either through the same mechanism as polymorphic variants in regulatory sequences or by causing altered splicing patterns resulting in an altered protein.
Types of Polymorphisms
[0029] As used herein, a "gene" is a sequence of DNA present in a cell that directs the expression of a "gene product," most commonly by transcription to produce RNA and translation to produce protein. An "allele" is a particular form of a gene. The term allele is relevant when there are two or more forms of a particular gene. Genes and alleles are not limited to the open reading frame of the genomic sequence or the cDNA sequence corresponding to processed RNA. A gene and allele can also include sequence upstream and downstream of the genomic sequence such as promoters and enhancers. The terms "gene product," or "polymorphic variant allele product" refer to a product resulting from transcription of a gene. Gene and polymorphic variant allele products include partial, precursor, and mature transcription products such as pre-mRNA and mRNA, and, translation products with or without further processing including, without limitation, lipidation, phosphorylation, glycosylation, other modifications known in the art, and combinations of such processing. RNA may be modified without limitation by complexing with proteins, polyadenylation, splicing, capping or export from the nucleus.
[0030] A "polymorphism" is a site in the genome that varies between two or more individuals or within an individual in the case of a heterozygote. The frequency of the variation can be defined above a specific value for inclusion of variations generally observed in a population as opposed to random mutations. Polymorphisms that can be screened according to the invention include variation both inside and outside the open reading frame. When outside the reading frame the polymorphism can occur within 200, 500, 1000, 2000, 3000, 5000, or more nucleotides of either the 5' or 3' end of the open reading frame. When inside the reading frame, the polymorphism may occur within an exon or intron, or overlapping an exon/intron boundary. A polymorphism could also overlap the open reading frame and sequence outside of that frame. Many polymorphisms have been given a "rs" designation in the SNP database of NCBI's Entrez, some of these designations have been provided herein for the polymorphisms that can be screened according to the invention.
[0031] A "polymorphic variant" is a particular form or embodiment of a polymorphism. For example if the polymorphism is a single nucleotide polymorphism, a particular variant could potentially be an "A" (adenosine), "G" (guanine), "T" (thymine), and "C" (cytosine). When the variant is a "T", it is understood that a "U" can occur in those instances wherein the relevant nucleic acid molecule is RNA, and vice versa in respect to DNA. The convention "PositionNUC1>NUC2" is used to indicate a polymorphism contrasting one variant from another. For example, 242A>C would refer to a cytosine instead of an adenosine occurring at position 242 of a particular nucleic acid sequence. When 242A>C is used in respect to a mRNA/cDNA, it can also be used to represent the polymorphism as it occurs in the genomic DNA with the understanding that the position number will likely be different in the genome. Sequence and polymorphic location information for both coding domain sequence and genomic sequence is described herein for the genes relevant to the invention. "Polymorphic variant allele" refers to an allele comprising a particular polymeric variant or a particular set of polymorphic variants corresponding to a particular set of polymorphisms. Two alleles can both be considered the same polymorphic variant allele if they share the same variant or set of variants defined by the polymorphic variant allele even though they may differ in respect to other polymorphisms or variation outside the definition. For a mutation at the amino acid level, the convention "AA1PositionAA2" is used. For example, in the context of amino acid sequence, M726L, would indicate that the underlying, nucleotide level polymorphism(s) has resulted in a change from a methionine to a leucine at position 726 in the amino acid sequence.
[0032] A "genotype" can refer to a characterization of an individual's genome in respect to one or both alleles and/or one or more polymorphic variants within that allele. A subject can be characterized at the level that the subject contains a particular allele, or at the level of identifying both members of an allelic pair, the corresponding alleles on the set of two chromosomes. One can also be characterized at the level of having one or more polymorphic variants. The term "haplotype" refers to a cis arrangement of two or more polymorphic variants, on a particular chromosome such as in a particular gene. The haplotype preserves the information of the phase of the polymorphic nucleotides--that is, which set of polymorphic variants were inherited from one parent, and which from the other. Wherein methods, materials, and experiments are described for the invention in respect to polymorphic variants, one will understand that can also be adapted for use with an analogous haplotype.
[0033] A single nucleotide polymorphism (SNPs) refers to a variation at a single nucleotide location. In some cases the variations at the position could be any one of the four nucleotide bases, in others the variation is some subset of the four bases. For example, the variation could be between either purine base or either pyrimidine base. Simple-sequence length polymophisms (SSLPs) or short tandem repeat polymorphisms (STRPs) involve the repeat of a particular sequence of one or more nucleotides. A restriction fragment length polymorphism (RFLP) is a variation in the genetic sequence that results in the appearance or disappearance of an enzymatic cleavage site depending on which base(s) are present in a particular allele.
[0034] A diagnosis for a given susceptibility in accordance with this invention includes detection of homozygosity and/or heterozygosity for a given polymorphism(s). Heterozygosity and homozygosity are relevant wherein the cell tested has two chromosomal copies. In other contexts, such as in a sperm or egg, only a single chromosome is present so that the issue of homozygosity or heterozygosity does not directly present itself. In the some embodiments, such as those involving cancer, homozygosity or heterozygosity can be lost or at least obscured because of deletion or inactivation of one of the two gene copies.
[0035] In those embodiments where a sample is screened to detect the presence or absence of more than one polymorphic variant associated with a given condition, the combination of the polymorphic variants can be additive, synergistic, or even antagonists in regards to correlative strength--although not overly antagonist if the susceptibility or drug effect probability is lost. When screening for multiple polymorphisms all can be heterozygous, all can be homozygous, or a combination with one or more polymorphism homozygous, and one or more polymorphism heterozygous, depending on the particular susceptibility relationship for a given set of polymorphic variants and a condition or drug response.
[0036] The polymorphic variants described herein can be associated with an altered susceptibility to one or more complications and/or therapeutic treatments. How a polymorphism is associated with this susceptibility need not be known for the usefulness and operability of the invention. The polymorphism need not actually cause or contribute to etiology or severity of the condition. In some embodiments, the polymorphism can cause or contribute to the condition. In some embodiments, the polymorphism serves as a marker for another polymorphism(s) responsible for causing or contributing to the condition. In such a situation, the polymorphism(s) screened for can be in linkage disequilibrium with the responsible polymorphism(s).
[0037] Linkage is the tendency of genes or DNA sequences, for example, polymorphisms, to be inherited together as a consequence of their physical proximity on a single chromosome. The closer together the markers are, the lower the probability that they will be separated during DNA crossing over, and hence the greater the probability that they will be inherited together. If a mutational event introduces a "new" allele in the close proximity of a gene or an allele, the new allele will tend to be inherited together with the alleles present on the "ancestral," chromosome or haplotype. However, the resulting association, called linkage disequilibrium, will decline over time due to recombination. Linkage disequilibrium has been used to map disease genes. In general, both allele and haplotype frequencies differ among populations. Linkage disequilibrium is varied among the populations, being absent in some and highly significant in others.
[0038] Linkage disequilibrium (LD) or allelic association means the preferential association of a particular allele or genetic marker with a specific allele, or genetic marker at a nearby chromosomal location more frequently than expected by chance for any particular allele frequency in the population. For example, if locus P has alleles x and y, which occur with equal frequency, and linked locus Q has alleles w and z, which occur with equal frequency, one would expect the haplotype ac to occur with a frequency of 0.25 in a population of individuals. If xw occurs more frequently, then alleles x and w are considered in linkage disequilibrium. Linkage disequilibrium may result from natural selection of a certain combination of alleles or because an allele has been introduced into a population too recently to have reached equilibrium between linked alleles.
[0039] A marker in linkage disequilibrium with disease predisposing variants can be particularly useful in detecting susceptibility to disease or association with sub-clinical phenotypes notwithstanding that the marker does not cause the disease. For example, a marker P that is not itself a causative element of a disease, but which is in linkage disequilibrium with a gene Q that is a causative element of a phenotype, can be used to indicate susceptibility to the disease in circumstances in which the gene Q may not have been identified or may not be readily detectable. Relatively young evolutionarily alleles are expected to have a larger genomic segment in linkage disequilibrium. The age of an allele can be determined from whether the allele is shared among different human ethnic groups and/or between humans and related species.
[0040] The polymorphisms described herein can also be used to establish physical linkage between a genetic locus associated with a trait of interest and polymorphic markers that are not associated with the trait, but are in physical proximity with the genetic locus responsible for the trait and co-segregate with the responsible variation. Such analysis is useful for mapping a genetic locus associated with a phenotypic trait to a chromosomal position and thereby cloning gene(s) responsible for the trait [Landau et al., Proc. Natl. Acad. Sci. (USA), 83:7353-7357 (1986); Landau et al., Proc. Natl. Acad. Sci. (USA), 84:2363-2367 (1987); Donis-Keller et al., Cell, 51:319-337 (1987); Landau et al., Genetics, 121:185-199 (1989))]. Genes localized by linkage can be cloned by a process known as directional cloning. [Wainwright, Med. J. Australia, 159:170-174 (1993); Collins, Nature Genetics, 1:3-6 (1992)]. Linkage studies can be performed on members of a family. Available members of the family are characterized for the presence or absence of a phenotypic trait and for a set of polymorphic markers. The distribution of polymorphic markers in an informative meiosis is then analyzed to determine which polymorphic markers co-segregate with a phenotypic trait. [See, e.g., Kerem et al., Science, 245:1073-1080 (1989); Monaco et al., Nature, 316:842 (1985); Yamoka et al., Neurology, 40:222-226 (1990); Rossiter et al., FASEB J., 5:21-27 (1991).]
[0041] Linkage is analyzed by calculation of lod (log of the odds) values. A lod value is the relative likelihood of obtaining observed segregation data for a marker and a genetic locus when the two are located at a recombination fraction 0, versus the situation in which the two are not linked, and thus segregating independently [Thompson & Thompson, Genetics in Medicine (5th ed, W. B. Saunders Company, Philadelphia, 1991); Strachan, "Mapping the Human Genome" in The Human Genome (BIOS Scientific Publishers Ltd, Oxford), Chapter 4]. A series of likelihood ratios are calculated at various recombination fractions (O), ranging from θ=0.0 (coincident loci) to θ=0.50 (unlinked). The computed likelihoods are usually expressed as the log10 of this ratio, known as a "lod" score. For example, a lod score of 3 indicates 1000:1 odds against an apparent observed linkage being a coincidence. The use of logarithms allows data collected from different families to be combined by simple addition. Computer programs are available for the calculation of lod scores for differing values of 0, for example, LIPED, MLINK [Lathrop, Proc. Nat. Acad. Sci. (USA), 81:3443-3446 (1984)]. For any particular lod score, a recombination fraction may be determined from mathematical tables. [See Smith et al., Mathematical tables for research workers in human genetics (Churchill, London, 1961); Smith, Ann. Hum. Genet., 32:127-150 (1968).] The value of 0 at which the lod score is the highest is considered to be the best estimate of the recombination fraction. Positive lod score values suggest that the two loci are linked, whereas negative values suggest that linkage is less likely (at that value of θ) than the possibility that the two loci are unlinked. By convention, a combined lod score of +3 or greater (equivalent to greater than 1000:1 odds in favor of linkage) is considered definitive evidence that two loci are linked. Similarly, by convention, a negative lod score of -2 or less is taken as definitive evidence against linkage of the two loci being compared. Negative linkage data are useful in excluding a chromosome or a segment thereof from consideration. The search focuses on the remaining non-excluded chromosomal locations.
[0042] In those embodiments where the screened for polymorphic variant(s) is responsible in part or whole for the condition(s), the polymorphic variant(s) can result in a change in the steady state level of mRNA, for example, through a decrease in transcription and/or mRNA stability. Some polymorphic variants can alter the exon/intron boundary and/or effect how splicing occurs. When the polymorphic variant occurs within or overlaps with the protein-encoding sequence of the gene, the polymorphic variant may be silent resulting in no change at the amino acid level, result in a change of one or more amino acid residues, a deletion of one or more amino acids, addition of one or more amino acids, or some combination of such changes. For some polymorphic variants, the result is premature termination of translation. The effect may be neutral, beneficial, or detrimental, or both beneficial and detrimental, depending on the circumstances. Polymorphic variants occurring in noncoding regions can exert phenotypic effects indirectly via influence on replication, transcription, and translation. Polymorphic variants in DNA can affect the basal transcription or regulated transcription of a gene locus. Such polymorphic variants may be located in any part of the gene but are most likely to be located in the promoter region, the first intron, or in 5' or 3' flanking DNA, where enhancer or silencer elements may be located. A single polymorphism can affect more than one phenotypic trait. A single phenotypic trait may be affected by polymorphisms in different genes. Some polymorphisms predispose an individual to a distinct mutation that is causally related to a certain phenotype.
[0043] Determining what effect, if any, a polymorphic variant has on the disease state, condition, or complication with which it is correlated, can be useful in the context of certain aspects of the invention, for example, choosing a proper therapy. Methods for analyzing transcription are well known to those skilled in the art. Transcriptional run off assay is one useful method. Detailed protocols for useful methods can be found in texts such as: Current Protocols in Molecular Biology edited by: F. M. Ausubel, R. Brent, R. E. Kingston, D. D. Moore, J. G. Seidman, K. Struhl, John Wiley & Sons, Inc. (1999), or Molecular Cloning: A Laboratory Manual by J. Sambrook, E. F. Fritsch and T. Maniatis, Cold Spring Harbor Laboratory Press, 2nd edition (1989).
[0044] RNA polymorphic variants can affect a wide range of processes including RNA splicing, polyadenylation, capping, export from the nucleus, interaction with translation initiation, elongation or termination factors, or the ribosome, or interaction with cellular factors including regulatory proteins, or factors that may affect mRNA half life. An effect of polymorphic variants on RNA function can ultimately be measurable as an effect on RNA levels--either basal levels or regulated levels or levels in some abnormal cell state. One method for assessing the effect of RNA polymorphic variants on RNA function is to measure the levels of RNA produced by different alleles in one or more conditions of cell or tissue growth. Such measuring can be done by conventional methods such as Northern blots or RNAase protection assays, which can employ kits available from Ambion, Inc., or by methods such as the Taqman assay, or by using arrays of oligonucleotides or arrays of cDNAs or other nucleic acids attached to solid surfaces, such as a multiplex chip. Systems for arraying cDNAs are available commercially from companies such as Nanogen and General Scanning. Complete systems for gene expression analysis are available from companies such as Molecular Dynamics. See also supplement to volume 21 of Nature Genetics entitled "The Chipping Forecast." Additional methods for analyzing the effect of polymorphic variants on RNA include secondary structure probing, and direct measurement of half life or turnover. Secondary structure can be determined by techniques such as enzymatic probing with use of enzymes such as T1, T2, and S1 nuclease, chemical probing or RNAase H probing using oligonucleotides. Some RNA structural assays can be performed in vitro or on cell extracts.
[0045] To determine if one or more polymorphic variants have an effect on protein levels and/or activity, a variety of techniques may be employed. The in vitro protein activity can be determined by transcription or translation in bacteria, yeast, baculovirus, COS cells (transient), CHO, or study directly in human cells. Further, one can perform pulse chase experiments for the determination of changes in protein stability such as half life measurements. One can manipulate the cell assay to address grouping the cells by genotypes or phenotypes. For example, identification of cells with different genotypes and phenotype can be performed using standardized laboratory molecular biological protocols. After identification and grouping, one skilled in the art could determine whether there exists a correlation between cellular genotype and cellular phenotype.
[0046] Correlation between one or more polymorphic variants can be performed for a population of individuals who have been tested for the presence or absence of a pregnancy complication or a disease state such as cancer or an intermediate phenotype. Correlation can be performed by standard statistical methods including, but not limited to, chi-squared test, Analyses of polymorphic variant, parametric linkage analysis, non-parametric linkage analysis, etc. and statistically significant correlations between polymorphic form(s) and phenotypic characteristics also can be used.
Genes and Polymorphic Variants
[0047] MTHFD1
[0048] Methylenetetrahydrofolate dehydrogenase (NADP+ dependent) 1, methenyltetrahydrofolate cyclohydrolase, formyltetrahydrofo late synthetase (MTHFD1) is a trifunctional enzyme localized to the cytoplasm. MTHFD1 has the further aliases HGNC:7432, MTHFC, and MTHFD. MTHFD1 has the further designations 5,10-methylenetetrahydro fo late dehydrogenase, 5,10-methylenetetrahydrofolate cyclohydrolase, 10-formyltetrahydrofolate synthetase; C1-THF synthase; MTHFC; MTHFD; NADP-dependent cyclohydrolase/formyltetrahydrofolate synthetase; cytoplasmic C-1-tetrahydro folate synthase; methylenetetrahydrofolate dehydrogenase (NADP+ dependent), methenyltetrahydrofolate cyclohydrolase, formyltetrahydrofolate synthetase; methylenetetrahydrofolate dehydrogenase 1. MTHFD1 has been assigned Gene ID 4522, and is positioned on chromosome 14 at locus 14q24. Further information for MTHFD1 is found on the NCBI website in the Entrez Gene database and Online Mendelian Inheritance in Man (OMIM) website under entry +172460.
[0049] MTHFD1 nucleic acid and amino acid sequences relevant to the invention include genomic, cDNA, and fragments thereof. The particular sequences identified herein by sequence identification number and/or accession number are representative of MTHFD1 sequences. One of skill in the art can appreciate that there can be variability in the gene or gene fragment distinct from the polymorphism(s) of interest and that such allelic variants still fall within the scope of the invention. As the polymorphism will be reflected in both strands of the DNA, the screening in the context of the invention can involve one or both of the strand sequences. Accordingly, where the sequence for a given strand is provided, the invention also includes the use of its complement.
[0050] The following are representative sequences for MTHFD1. NM005956 includes coding nucleic acid sequence of MTHFD1 (SEQ ID NOS: 1 and 2, with SEQ ID NO: 2 providing the nucleic acid sequence of the coding region) and also provides the amino acid sequence of MTHFD1, which is the translation of the coding region (SEQ ID NO: 3). Other relevant sequence information includes J04031; NP005947; BC001014, AAH01014; BC009806; AAH09806; BC050420; AAH50420; J04031; AAA59574; P11586. Screening with a fragment of at least 30 nucleic acids is within the scope of the invention, however, smaller fragments are also possible provided that they comprise the relevant polymorphism(s) and provide a sequence unique in the human genome. An example of such a fragment is provided in SEQ ID NO: 4. The genomic sequence is provided in SEQ ID NO: 5 and corresponds to positions 63924886 and 63996474 inclusive in NC--000014. Screening with a genomic fragment of at least 30 nucleic acids is within the scope of the invention, however, smaller fragments are also possible provided that they comprise the relevant polymorphism(s) and provide a sequence unique in the human genome. An example of such a fragment is also provided in SEQ ID NO: 4. SEQ ID NOS: 1-5 indicate the variability corresponding to the MTHFD1 1958G>A polymorphism (at the nucleotide level: position 2011 in SEQ ID NO: 1; position 1958 in SEQ ID NO: 2; position 15 in SEQ ID NO: 4; 63978638 in the NC--000014 genomic sequence corresponding to position 53753 in SEQ ID NO: 5) and the Arg653Gln polymorphism in the amino acid sequence (SEQ ID NOS: 1 and 3). This polymorphism is given the designation rs2236225 in the SNP database of NCBI's Entrez. Allele frequencies for the MTHFD1 1958G>A polymorphic variant are as follows:
TABLE-US-00001 Geographical/Ethnic Populations A Allele Frequency Ireland 0.45 The Netherlands 0.45 Germany 0.40 Italy 0.45 Turkey 0.45 Africa 0.16 Israel 0.47 Pakistan 0.50 Northern China 0.24 Mexico 0.61 Brazil 0.79 [Brody et al., Am. J. Hum. Genet., 71: 1207-1215 (2002); Hol et al., Clin. Genet. 53: 119-125 (1998); Akar & Akar, Acta Haematol., 102: 199-200 (1999); Konrad et al., J. Neurol., 251: 1242-1248 (2004); Cheng et al., Biomed. Environ. Sci., 18: 58-64 (2005); Shi et al., Birth Defects Res. Part A, 67: 545-549 (2003); DeMarco et al., 48th Annual Meeting of the Society for Research into Hydrocephalus and Spina Bifida, Dublin 23-26 Jun. 2004.]
[0051] MTHFD1L
[0052] Methylenetetrahydrofolate dehydrogenase (NADP+ dependent) 1-like (MTHFD1L) is a trifunctional enzyme localized to mitochondria with enzymtatic activity similar to MTHFD1, sharing at least one enzymatic activity with that enzyme. MTHFD1L has the further aliases HGNC:21055, DKFZp586G1517, FLJ21145, FTHFSDC1, dJ292B18.2, and further designations RP1-292B18.2; formyltetrahydrofolate synthetase domain containing 1; mitochondrial C1-tetrahydrofolate synthase; mitochondrial C1-tetrahydrofolate synthetase. MTHFD1L has been assigned Gene ID 25902, and is positioned on chromosome 6 at locus 6q25.1. Further information for MTHFD1L is found on the NCBI website in the Entrez Gene database.
[0053] MTHFD1L nucleic acid and amino acid sequences relevant to the invention include genomic, cDNA, and fragments thereof. The particular sequences identified herein by sequence identification number and/or accession number are representative of MTHFD1L sequences. One of skill in the art can appreciate that there can be variability in the gene or gene fragment distinct from the polymorphism(s) of interest and that such allelic variants still fall within the scope of the invention. As the polymorphism will be reflected in both strands of the DNA, the screening in the context of the invention can involve one or both of the strand sequences. Accordingly, where the sequence for a given strand is provided, the invention also includes the use of its complement.
[0054] The following are representative sequences for MTHFD1 L. NM0015440 includes coding nucleic acid sequence of MTHFD1L (SEQ ID NOS: 6 and 7, with SEQ ID NO: 7 providing the nucleic acid sequence of the coding region) and also provides the translation of the coding region (SEQ ID NO: 8). These sequences correspond to a 3.6 kb transcript. AY374131 includes coding nucleic acid sequence of MTHFD1L (SEQ ID NOS: 9 and 10, with SEQ ID NO: 10 providing the nucleic acid sequence of the coding region) and amino acid sequence (SEQ ID NO: 11) for a 1.1 kb transcript of MTHFD1L. Other relevant sequence information includes NP056255; AA478842; AL117452; AV704883; BE735249; BQ062382; AL035086; CAI42788; CAI42793; CAI42794; CAI42795; AL133260; CAC03667; AA478842; AB127387; BAD93193; AK024798; BAB15009; AK127089; AL117452; CAB55934; AV704883; AY374130; AAQ82696; AAQ82697; BC008629; AAH08629; BC017477; AAH17477; BE735249; BQ062382. Screening with a fragment of at least 30 nucleic acids is within the scope of the invention, however, smaller fragments are also possible provided that they comprise the relevant polymorphism(s) and provide a sequence unique in the human genome. The genomic sequence is provided in SEQ ID NO: 12 and corresponds to positions 151278805 and 151515137 inclusive in NC--000006. Screening with a genomic fragment of at least 30 nucleic acids is within the scope of the invention, however, smaller fragments are also possible provided that they comprise the relevant polymorphism(s) and provide a sequence unique in the human genome. An example of such a fragment is provided in SEQ ID NO: 13. SEQ ID NOS: 12 and 13 indicate the variability corresponding to the "ATT" short tandem repeat polymorphism at starting at position 151312078 in the source genomic sequence (position 33274 of SEQ ID NO: 12 and position 5 of SEQ ID NO: 13). This polymorphism is given the designation rs3832406 in the SNP database of NCBI's Entrez, and also corresponds to position 55374834 in NT--025741. As this polymorphism is located in an intron, the polymorphism is not designated in SEQ ID NOS: 6-11. In some embodiments, the relevant polymorphism has an effect on splicing, and accordingly an effect on the transcription and amino acid sequence encoded by the same.
[0055] Allele frequencies for the MTHFD1L rs3832406 "ATT" Intron 7 tandem repeat are as follows with Allele 1 comprising "ATT" repeated seven times, Allele 2 comprising "ATT" repeated eight times, and Allele 3 comprising "ATT" repeated nine times.
TABLE-US-00002 Geographical/Ethnic Population Allele 1 Allele 2 Allele 3 Ireland 0.64 0.21 0.15
[0056] The MTHFD1L gene produces two mRNA transcripts. The shorter one originates from the use of an alternative exon 8A that may be derived from an Alu element. Although not wishing to be bound by any particular theory, it appears that these alleles affect how efficiently alternative exon 8A is used. Any putative effect is relevant to folate metabolism since alternative exon 8A produces a premature stop codon that translates into a protein product that lacks a synthetase domain.
Other Diagnostic Genes and Polymorphic Variants
[0057] Polymorphic variants to be screened for are principally located in or in close proximity to the MTHFD1 and/or MTHFD1L genes. Representative, polymorphic variants that can be tested for in addition to MTHFD1 and/or MTHFD1 variant(s), include those associated with following described genes without limitation to polymorphism or gene. In some embodiments, the screened for polymorphic variants are correlated with the same disease. In some embodiments, the screened for polymorphic variants are correlated with different diseases.
[0058] MTHFR
[0059] 5,10-methylenetetrahydrofolate reductase (NADPH) (MTHFR) is an enzyme involved in one-carbon metabolic pathways such as folate-dependent one-carbon pathways. MTHFD1 has the further alias HGNC:7436. MTHFR has the further designations methylenetetrahydrofolate reductase; methylenetetrahydrofolate reductase intermediate form. MTHFR has been assigned Gene ID 4524, and is positioned on chromosome 1 at locus 106.3. Further information for MTHFR is found on the NCBI website in the Entrez Gene database and Online Mendelian Inheritance in Man (OMIM) website under entry *607093. Polymorphic variants that can be screened for in addition to one or more of the MTHFD1 and MTHFD1L polymorphic variants relevant to the invention include the polymorphic variant described in the OMIM MTHFR entry *607093 as allelic variant 0.0003 MTHFR 677C>T, Ala222Val. Frosst et al., Mammalian Genome, 7:864-869 (1995), reported the 677C>T mutation in the MTHFR gene, resulting in an Ala222Val substitution. Polymorphic variants that can be screened for in addition to one or more of the MTHFD1 and MTHFD1L polymorphic variants relevant to the invention include the polymorphic variant described in the OMIM MTHFR entry *607093 as allelic variant 0.0004 MTHFR 1298A>C, Glu429. Van der Put et al., Am. J. Hum. Genet., 62:1044-1051 (1998), identified another polymorphism of the MTHFR gene: a 1298A>C mutation resulting in a Glu429Ala substitution.
[0060] MTHFR nucleic acid and amino acid sequences relevant to the invention include both genomic, cDNA, and fragments thereof. The particular sequences identified herein by sequence identification number and/or accession number are representative of MTHFR sequences. One of skill in the art can appreciate that there can be variability in the gene or gene fragment distinct from the polymorphism(s) of interest and that such allelic variants still fall within the scope of the invention. As the polymorphism will be reflected in both strands of the DNA, the screening in the context of the invention can involve one or both of the strand sequences. Accordingly, where the sequence for a given strand is provided, the invention also includes the use of its complement.
[0061] The following are representative sequences for MTHFR. NM005957 includes coding nucleic acid sequence of MTHFR and also provides the translation of the coding region. Other relevant sequence information includes AF105977; AAD17965; AF105978; AAD17965; AF105979; AAD17965; AF105980; AAD17965; AF105981; AAD17965; AF105982; AAD17965; AF105983; AAD17965; AF105984; AAD17965; AF105985; AAD17965; AF105986; AAD17965; AF105987; AAD17965; AF398930; AAN40863; AAN40864; AAN40865; AJ249275; CAB81551; CAB81552; AL953897; CAI15885; CAI15886; CAI15887; CAI15888; CAI15889; AY338232; AAP88033; AB209113; BAD92350; AJ237672; CAB41971; AY046560; AAL17646; AY046561; AAL17647; AY046562; AAL17648; AY046563; AAL17649; AY046564; AAL17650; AY046565; AAL17651; BC011614; BC018766; AAH18766; BC053509; AAH53509; P42898. Screening with a fragment of at least 30 nucleic acids is within the scope of the invention, however, smaller fragments are also possible provided that they comprise the relevant polymorphism(s) and provide a sequence unique in the human genome. The genomic sequence corresponds to positions 11780945 and 11800248 inclusive in NC--000001. Screening with a genomic fragments of at least 30 nucleic acids are within the scope of the invention, however, smaller fragments are also possible provided that they comprise the relevant polymorphism(s) and provide a sequence unique in the human genome. The variability corresponding to the 677C>T polymorphism occurs at position 11790510 in the genomic sequence. The variability corresponding to the 1298A>C polymorphism occurs at position 11792412 in the genomic sequence.
[0062] Allelic frequencies for MTHFR 677C>T are as follows:
TABLE-US-00003 Geographical/Ethnic T Allele Geographical/Ethnic T Allele Populations Frequency Populations Frequency Ireland 0.29 Ashkenazi Jewish 0.48 Spain 0.34 Northern Han Chinese 0.44 France 0.36 Southern Han Chinese 0.34 Germany 0.29 Australian white 0.29 The Netherlands 0.27 Mexico 0.57 Russia 0.27 African Americans 0.12 Italy 0.41 US Caucasians 0.32 Southern Italy 0.46 US Hispanics 0.45 Israel 0.26 US Asian 0.21 Canadian White 0.25 [Wilcken et al., J. Med. Genet., 40: 619-625 (2003); Rady et al., Am. J. Med. Genet., 107: 162-168 (2002); Kirke et al., BMJ, 328: 1535-1536 (2004); Konrad et al., J Neurol., 251: 1242-1248 (2004).]
[0063] Factor II
[0064] Coagulation factor II (F2) is a factor that is cleaved from prothrombin to thrombin in the blood clotting cascade. F2 has the further aliases HGNC:3535 and PT. F2 has the further designations prothrombin; prothrombin B-chain; serine protease. F2 has been assigned Gene ID 2147, and is positioned on chromosome 11 at locus 11p11-q12. Further information for F2 is found on the NCBI website in the Entrez Gene database and Online Mendelian Inheritance in Man (OMIM) website under entry +176930. Polymorphic variants that can be screened for in addition to one or more of the MTHFD1 and MTHFD1L polymorphic variants relevant to the invention include the polymorphic variant described in the OMIM Factor V Deficiency +176930 entry as allelic variant 0.0009; 20210G>A. Poort et al., Blood, 88:3698-3703 (1996), described this common genetic variation in the 3-prime untranslated region of the gene that is associated with elevated plasma prothrombin levels and an increased risk of venous thrombosis: a G-to-A transition at position 20210, see Degen and Davie, Biochemistry, 26:6165-6177 (1987).
[0065] F2 nucleic acid and amino acid sequences relevant to the invention include both genomic, cDNA, and fragments thereof. The particular sequences identified herein by sequence identification number and/or accession number are representative of F2 sequences. One of skill in the art can appreciate that there can be variability in the gene or gene fragment distinct from the polymorphism(s) of interest and that such allelic variants still fall within the scope of the invention. As the polymorphism will be reflected in both strands of the DNA, the screening in the context of the invention can involve one or both of the strand sequences. Accordingly, where the sequence for a given strand is provided, the invention also includes the use of its complement.
[0066] The following are representative sequences for F2. NM000506 includes coding nucleic acid sequence of F2 and also provides the translation of the coding region. Other relevant sequence information includes M17262,V00595, AF478696; AAL77436; AF493953; AAM11680; AJ544114; CAD80258; M17262; AAC63054; 550162; AAB24476; AY344793; AAR08142; AY344794; AAR08143; BCO51332; AAH51332; M33031; AAA60220; V00595; CAA23842; P00734. Screening with a fragment of at least 30 nucleic acids is within the scope of the invention, however, smaller fragments are also possible provided that they comprise the relevant polymorphism(s) and provide a sequence unique in the human genome. The genomic sequence corresponds to positions 46697331 and 46717631 inclusive in NC--000011. Screening with a genomic fragment of at least 30 nucleic acids is within the scope of the invention, however, smaller fragments are also possible provided that they comprise the relevant polymorphism(s) and provide a sequence unique in the human genome. This polymorphism is provided in the SNP database of NCBI's Entrez.
[0067] Factor V
[0068] Coagulation factor V (proaccelerin, labile factor) (F5) is a factor in the blood clotting cascade. F5 has the further aliases HGNC:3542, FVL, PCCF, factor V. F5 has the further designations activated protein c cofactor; coagulation factor V; coagulation factor V jinjiang A2 domain; factor V Leiden; labile factor. F5 has been assigned Gene ID 2153, and is positioned on chromosome 1 at locus 1q23. Further information for F5 is found on the NCBI website in the Entrez Gene database and Online Mendelian Inheritance in Man (OMIM) website under entry +227400. Polymorphic variants that can be screened for in addition to one or more of the MTHFD1 and MTHFD1 L polymorphic variants relevant to the invention include the polymorphic variant described in the OMIM Factor V Deficiency +227400 entry as allelic variant 0.0001, Arg506Gln, 1691G>A, "Factor V Leiden." The Factor V Leiden polymorphic variant was reported by Bertina et al., Nature, 369:64-67 (1994).
[0069] F5 nucleic acid and amino acid sequences relevant to the invention include both genomic, cDNA, and fragments thereof. The particular sequences identified herein by sequence identification number and/or accession number are representative of F5 sequences. One of skill in the art can appreciate that there can be variability in the gene or gene fragment distinct from the polymorphism(s) of interest and that such allelic variants still fall within the scope of the invention. As the polymorphism will be reflected in both strands of the DNA, the screening in the context of the invention can involve one or both of the strand sequences. Accordingly, where the sequence for a given strand is provided, the invention also includes the use of its complement.
[0070] The following are representative sequences for F5. NM000130 includes coding nucleic acid sequence for F5 and also provides the translation of the coding region. Other relevant sequence information includes AH005274, M14335, AF119360; AAF32515; AF285083; AAG30113; AY046060; AAL09164; AY136818; AAN12307; AY364535; AAQ55063; L32755; AAB59401; L32779; AAB59401; Z99572; CAB16748; CAI23065; AJ297254; CAC82572; AJ297255; CAC82573; M14335; AAB59532; M16967; AAA52424; M94010; AAA52416; P12259. Screening with a fragment of at least 30 nucleic acids is within the scope of the invention, however, smaller fragments are also possible provided that they comprise the relevant polymorphism(s) and provide a sequence unique in the human genome. The genomic sequence corresponds to positions 166287379 and 166215067 inclusive in NC--000001. Screening with a genomic fragment of at least 30 nucleic acids is within the scope of the invention, however, smaller fragments are also possible provided that they comprise the relevant polymorphism(s) and provide a sequence unique in the human genome.
[0071] TCNII
[0072] Transcobalamin II (TCNII) is a Vitamin B12 binding protein. TCNII has the further aliases HGNC:11653, D22S676, D22S750, and TC2. TCNII has been assigned Gene ID 6948, and is positioned on chromosome 22 at locus 22q12.2. Further information for TCNII is found on the NCBI website in the Entrez Gene database and Online Mendelian Inheritance in Man (OMIM) website under entry +275350.
[0073] TCNII nucleic acid and amino acid sequences relevant to the invention include both genomic, cDNA, fragments, and products thereof. The particular sequences identified herein by sequence identification number and/or accession number are representative of TCNII sequences. One of skill in the art can appreciate that there can be variability in the gene or gene fragment distinct from the polymorphism(s) of interest and that such allelic variants still fall within the scope of the invention. As the polymorphism will be reflected in both strands of the DNA, the screening in the context of the invention can involve one or both of the strand sequences. Accordingly, where the sequence for a given strand is provided, the invention also includes the use of its complement.
[0074] The following are representative sequences for TCNII. NM000355 includes coding nucleic acid sequence for TCNII (SEQ ID NOS: 14 and 15, with SEQ ID NO: 15 providing the nucleic acid sequence of the coding region) and also provides the translation of the coding region (SEQ ID NO: 16). Other relevant sequence information includes AF047576; AAC05491; AF076647; AAG24506; BC001176; AAH01176; BC011239; AAH11239; CR456591; CAG30477; L02647; AAA61056; L02648; AAA61057; M60396; AAA61054; P20062; AAB25526. Screening with a fragment of at least 30 nucleic acids is within the scope of the invention, however, smaller fragments are also possible provided that they comprise the relevant polymorphism(s) and provide a sequence unique in the human genome. An example of such a fragment is provided in SEQ ID NO: 17. The genomic sequence is provided in SEQ ID NO: 18 and corresponds to positions 29327715 and 29347601 inclusive in NC--000022. Screening with a genomic fragment of at least 30 nucleic acids is within the scope of the invention, however, smaller fragments are also possible provided that they comprise the relevant polymorphism(s) and provide a sequence unique in the human genome. An example of such a fragment is also provided in SEQ ID NO: 17. SEQ ID NOS: 14-18 indicate the variability corresponding to the 776C>G polymorphism (at the nucleotide level: position 934 in SEQ ID NO: 14; position 776 in SEQ ID NO: 15; position 16 in SEQ ID NO: 17; position 8450 in SEQ ID NO: 18--position 29336164 in the source genomic sequence) and the Pro259Arg polymorphism in the amino acid sequence (SEQ ID NOS: 14 and 16). This polymorphism is given the designation rs1801198 in the SNP database of NCBI's Entrez.
[0075] The invention also includes use of other polymorphic variants of the genes and proteins described herein. Use of both the nucleic acids described herein and their complements are within the scope of the invention. In connection with the provision and description of nucleic acid sequences, the references herein to gene names and to GenBank and OMIM reference numbers provide the relevant sequences, recognizing that the described sequences will, in most cases, also have other corresponding allelic variants. Although the referenced sequences may contain sequencing error, such error does not interfere with identification of a relevant gene or portion of a gene, and can be readily corrected by redundant sequencing of the relevant sequence (preferably using both strands of DNA). Nucleic acid molecules or sequences can be readily obtained or determined utilizing the reference sequences. Molecules such as nucleic acid hybridization probes and amplification primers can be provided and are described by the selected portion of the reference sequence with correction if appropriate. In some embodiments, probes comprise 5, 6, 10, 12, 13, 14, 15, 16, 17, 18, 19, 20, 23, 25, 27, 30, 35, 40, 45, 50, or more nucleotides.
Diagnosis
[0076] The terms "disease" or "condition" are commonly recognized in the art and designate the presence of signs and/or symptoms in an individual or patient that are generally recognized as abnormal. Unless indicated as otherwise, the terms "disease," "disease state," condition," "disorder," and "complication" can be used interchangeably. Diseases or conditions can be diagnosed and categorized based on pathological changes. Signs can include any objective evidence of a disease such as changes that are evident by physical examination of a patient or the results of diagnostic tests which may include, among others, laboratory tests to determine the presence of polymorphic variants or variant forms of certain genes in a patient. Symptoms can include a patient's perception of an abnormal condition that differs from normal function, sensation, or appearance, which may include, for example, physical disabilities, morbidity, pain, and other changes from the normal condition experienced by an individual. Various diseases or conditions include, but are not limited to, those categorized in medical texts.
[0077] Unless otherwise indicated, the term "suffering from a disease or condition" can refer to a person that currently has signs and symptoms, or is more likely to develop such signs and symptoms than a normal person in the population. For example, a person suffering from a condition can include a developing fetus, a person subject to a treatment or environmental condition that enhances the likelihood of developing the signs or symptoms of a condition, or a person who is being given or will be given a treatment that increases the likelihood of the person developing a particular condition. Methods of the invention relating to treatments of patients can include primary treatments directed to a presently active disease or condition, secondary treatments that are intended to cause a biological effect relevant to a primary treatment, and prophylactic treatments intended to delay, reduce, or prevent the development of a disease or condition, as well as treatments intended to cause the development of a condition different from that which would have been likely to develop in the absence of the treatment.
[0078] Combined detection of several such polymorphic variants typically increases the probability of an accurate diagnosis. Analysis of the polymorphisms of the invention can be combined with that of other polymorphisms or other risk factors such as family history. Polymorphisms can be used to diagnose a disease at the pre-symptomatic stage, as a method of post-symptomatic diagnosis, as a method of confirmation of diagnosis or as a post-mortem diagnosis. Ethical issues to be considered in screening and diagnosis are discussed generally in Reich, et al., Genet. Med., 5:133-143 (2003).
[0079] Pregnancy-Related Complications
[0080] Pregnancy-related complications include not just complications that occur during the course of pregnancy, but also include infertility complications. That is, a pregnancy-related complication can also, or in the alternative, involve a complication that prevents pregnancy from occurring or diminishes the probability that pregnancy will occur. Accordingly, the polymorphic variants relevant to the invention can be correlated with infertility. Particular pregnancy-related complications are described as follows without limitation to other relevant pregnancy-related complications correlating with one or more polymorphic variants relevant to the invention. Screening for polymorphic variants in the context of pregnancy-related complications can include screening of the mother as well as the father and the unborn child(ren). Both males and females can be screened using the methods of the invention. A female screened may be of any age, born or unborn, and need not be pregnant when screened. In some embodiments, the subject screened is female and has had complications becoming pregnant, which can include any number of different infertility factors. In some embodiments, the woman screened has been pregnancy previously but has suffered complications during pregnancy. In some embodiments, the woman screened is pregnant, but is not carrying an embryo or fetus with a neural tube defect. The sample screened from any subject may be derived from any number of different sources such as cells, tissues, and organs. In some embodiments, the sample comprises blood. In some embodiments, the sample comprises an egg and/or sperm. In some embodiments, the sample screened comprises a somatic cell.
[0081] Placental Abruption
[0082] The diagnosis of abruptio placentae or placental abruption can be based on hemorrhage and accumulation of blood between the placenta and the wall of the uterus. In some embodiments, diagnosis is based on a sudden rupture of the spiral arteries, resulting in the premature separation of a normally implanted placenta. Severe placental abruption is generally characterized by more extensive manifestations of placental abruption and can also comprise worse clinical outcomes such as death of the mother and or children. In some embodiments, severe placental abruption is diagnosis based on a retroplacental clot and/or accidental haemorrhage with associated clinical signs of abruption and/or a statement in the case records that the patient was a definite case of abruptio placentae. Data on gestational age at delivery, maternal hypertension, maternal blood transfusion, and pregnancy outcome can be collected. Control pregnancies can be selected from women with no history of abruptio placentae, and can be matched for the same date and clinic as the cases where the genetically tested blood sample was provided.
[0083] Diagnosis for an increased susceptibility for severe placental abruption is rendered when a particular polymorphic variant that has been correlated with severe placental abruption is identified. In some embodiments, the polymorphic variant is an adenosine at position 1958 of MTHFD1. In some embodiments, the subject tested is homozygous for the 1958A variant, in some embodiments, the subject is heterozygous for the 1958A variant.
[0084] Miscarriage
[0085] Miscarriage is the loss of one or more children before birth. In some embodiments, the miscarriage occurs in the second trimester. In other embodiments, the miscarriage occurs in the first or third trimester. In some embodiments, the miscarriage has no clinical explanation. A miscarriage can comprise a spontaneous abortion and/or fetal death.
[0086] Diagnosis for an increased susceptibility for miscarriage is rendered when a particular polymorphic variant that has been correlated with miscarriage is identified. This correlation can be with first, second, and/or third trimester miscarriage. In some embodiments, the correlation is with unexplained second trimester miscarriage. In some embodiments, the polymorphic variant is an adenosine at position 1958 of MTHFD1. In some embodiments, the subject tested is homozygous for the 1958A variant, in some embodiments, the subject is heterozygous for the 1958A variant.
[0087] Neural Tube Defects
[0088] Neural tube defects include, for example, anencephaly, encephalocele, iniencephaly, and spina bifida, and are diagnosed by symptoms commonly accepted in medical field. Diagnosis for an increased susceptibility for a neural tube defect is rendered when a particular polymorphic variant that has been correlated with a neural tube defect is identified. In some embodiments, the increased susceptibility for a neural tube defect is rendered when a 7-repeat variant of the MTHFD1L ATT polymorphism rs3832406 (position 55374834 in NT 025741) is identified. In some embodiments, the subject tested is homozygous for the 7-repeat ATT polymorphism, in some embodiments the subject is heterozygous for the 7-repeat ATT polymorphism. In some embodiments, one or two copies of a 8-repeat repeat variant of the MTHFD1L polymorphism rs3832406, wherein the 8-repeat variant is correlated with a protective effect, that is a decreased susceptibility for a NTD. In some embodiments, diagnosis is based not only on a polymorphic variant in the MTHFD1L gene, but also with the MTHFD1 1958A variant. In some embodiments, the subject tested is homozygous for the 1958A variant, in some embodiments, the subject is heterozygous for the 1958A variant.
[0089] Neoplastic Diseases
[0090] Diagnosis for an increased susceptibility for a drug dosage complication can be rendered based on a polymorphic variant in MTHFD1L that has been correlated with such a complication. In some embodiments, the increased susceptibility for a neural tube defect is rendered when a 7-repeat variant of the MTHFD1L ATT polymorphism rs3832406 (position 55374834 in NT--025741) is identified. In some embodiments, the subject tested is homozygous for the 7-repeat ATT polymorphism, in some embodiments the subject is heterozygous for the 7-repeat ATT polymorphism.
[0091] As used herein, the term "cancer" is meant any malignant growth or tumor caused by abnormal and uncontrolled cell division that may spread to other parts of the body through the lymphatic system or the blood stream. The cancer can be, for example, breast cancer, prostate cancer, lung cancer, colon cancer, rectal cancer, urinary bladder cancer, non-Hodgkin lymphoma, melanoma, renal cancer, pancreatic cancer, cancer of the oral cavity, pharynx cancer, ovarian cancer, thyroid cancer, stomach cancer, brain cancer, multiple myeloma, esophageal cancer, liver cancer, cervical cancer, larynx cancer, cancer of the intrahepatic bile duct, acute myeloid leukemia, soft tissue cancer, small intestine cancer, testicular cancer, chronic lymphocytic leukemia, Hodgkin lymphoma, chronic myeloid cancer, acute lymphocytic cancer, cancer of the anus, anal canal, or anorectum, cancer of the vulva or cancer of the neck, gallbladder, pleura, malignant mesothelioma, bone cancer, cancer of the joints, hypopharynx cancer, cancer of the eye, cancer of the nose, nasal cavity, neck, or middle ear, nasopharynx cancer, ureter cancer, peritoneum, omentum, or mesentery cancer, or gastrointestinal carcinoid tumor.
[0092] Those skilled in the art will understand whether the polymorphic variants or gene forms in normal or disease cells are most indicative of the expected treatment response, and will generally utilize a diagnostic test with respect to the appropriate cells. Such a cell type indication or suggestion can be contained in a regulatory statement, for example, on a label or in a product insert.
[0093] Alzheimer's Disease
[0094] Intermediates and defects in one carbon metabolic pathways have been shown to play a role in Alzheimer's disease. Accordingly, the invention includes methods of and materials for predicting altered susceptibility to Alzheimer's disease and response to Alzheimer's disease therapeutic agents based on correlation with the polymorphic variants discussed herein. The invention is also relevant to other central nervous system (CNS) diseases and therapeutic agents.
[0095] Cardiovascular Disease
[0096] Intermediates and defects in one carbon metabolic pathways have been shown to play a role in some cardiovascular diseases. Accordingly, the invention includes methods of and materials for predicting altered susceptibility to cardiovascular disease and response to cardiovascular disease therapeutic agents based on correlation with the polymorphic variants discussed herein.
Detection Probes
[0097] The detection of the presence or absence of a polymorphic variant can involve contacting a nucleic acid sequence corresponding to one of the genes identified above or a product of such a gene with a probe. The probe is able to distinguish a particular form of the gene, gene product, polymorphic variant allele product, or allele product, or the presence or a particular polymorphic variant or polymorphic variants, for example, by differential binding or hybridization. The term "probe" refers to a molecule that can detectably distinguish between target molecules differing in structure. Detection can be accomplished in a variety of different ways depending on the type of probe used and the type of target molecule. Thus, for example, detection may be based on discrimination of activity levels of the target molecule, but preferably is based on detection of specific binding. Examples of such specific binding include antibody binding and nucleic acid probe hybridization. Probes can comprise one or more of the following, a protein, carbohydrate, polymer, or small molecule, that is capable of binding to one polymorphic variant or variant form of the gene or gene product to a greater extent than to a form of the gene having a different base at one or more polymorphic variant sites, such that the presence of the polymorphic variant or variant form of the gene can be determined. A probe can incorporate one or more markers including, but not limited to, radioactive labels, such as radionuclides, fluorophores or fluorochromes, peptides, enzymes, antigens, antibodies, vitamins or steroids. A probe can distinguish at least one of the polymeric variant described herein. The probe can also have specificity for the particular gene or gene product, at least to an extent such that binding to other genes or gene products does not prevent use of the assay to identify the presence or absence of the particular polymorphic variant or polymorphic variants of interest.
[0098] Nucleic Acids
[0099] The nucleic acid molecules relevant to the invention can readily be obtained in a variety of ways, including, without limitation, chemical synthesis, cDNA or genomic library screening, expression library screening, and/or PCR amplification of cDNA. These methods and others useful for isolating such DNA are set forth, for example, by Sambrook, et al., "Molecular Cloning: A Laboratory Manual," Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (1989); by Ausubel, et al., eds., "Current Protocols In Molecular Biology," Current Protocols Press (1994), and by Berger and Kimmel, "Methods In Enzymology: Guide To Molecular Cloning Techniques," vol. 152, Academic Press, Inc., San Diego, Calif. (1987). Nucleic acid sequences are mammalian sequences. In some embodiments, the nucleic acid sequences are human, rat, and mouse.
[0100] Chemical synthesis of a nucleic acid molecule can be accomplished using methods well known in the art, such as those set forth by Engels et al., Angew. Chem. Intl. Ed., 28:716-734 (1989). These methods include, inter alia, the phosphotriester, phosphoramidite and H-phosphonate methods of nucleic acid synthesis. Nucleic acids larger than about 100 nucleotides in length can be synthesized as several fragments, each fragment being up to about 100 nucleotides in length. The fragments can then be ligated together to form a full length nucleic acid encoding the polypeptide. A preferred method is polymer-supported synthesis using standard phosphoramidite chemistry.
[0101] Alternatively, the nucleic acid may be obtained by screening an appropriate cDNA library prepared from one or more tissue source(s) that express the polypeptide, or a genomic library from any subspecies. The source of the genomic library may be any tissue or tissues from any mammalian or other species believed to harbor a gene encoding a protein relevant to the invention. The library can be screened for the presence of a cDNA/gene using one or more nucleic acid probes (oligonucleotides, cDNA or genomic DNA fragments that possess an acceptable level of homology to the gene or gene homologue cDNA or gene to be cloned) that will hybridize selectively with the gene or gene homologue cDNA(s) or gene(s) that is(are) present in the library. The probes preferably are complementary to or encode a small region of the DNA sequence from the same or a similar species as the species from which the library was prepared. Alternatively, the probes may be degenerate, as discussed below. After hybridization, the blot containing the library is washed at a suitable stringency, depending on several factors such as probe size, expected homology of probe to clone, type of library being screened, number of clones being screened, and the like. Stringent washing solutions are usually low in ionic strength and are used at relatively high temperatures.
[0102] Another suitable method for obtaining a nucleic acid in accordance with the invention is the polymerase chain reaction (PCR). In this method, poly(A)+RNA or total RNA is extracted from a tissue that expresses the gene product. cDNA is then prepared from the RNA using the enzyme reverse transcriptase. Two primers typically complementary to two separate regions of the cDNA (oligonucleotides) are then added to the cDNA along with a polymerase such as Taq polymerase, and the polymerase amplifies the cDNA region between the two primers.
[0103] The invention provides for the use of isolated, purified or enriched nucleic acid sequences of 15 to 500 nucleotides in length, 15 to 100 nucleotides in length, 15 to 50 nucleotides in length, and 15 to 30 nucleotides in length, which have sequence that corresponds to a portion of one of the genes identified for aspects above. In some embodiments the nucleic acid is at least 17, 20, 22, or 25 nucleotides in length. In some embodiments, the nucleic acid sequence is 30 to 300 nucleotides in length, or 45 to 200 nucleotides in length, or 45 to 100 nucleotides in length. In some embodiments, the probe is a nucleic acid probe at least 15, 17, 20, 22, 25, 30, 35, 40, or more nucleotides in length, or 500, 250, 200, 100, 50, 40, 30 or fewer nucleotides in length. In preferred embodiments, the probe has a length in a range from any one of the above lengths to any other of the above lengths including endpoints. The nucleic acid sequence includes at least one polymorphic variant site. Such sequences can, for example, be amplification products of a sequence that spans or includes a polymorphic variant site in a gene identified herein. A nucleic acid with such a sequence can be utilized as a primer or amplification oligonucleotide that is able to bind to or extend through a polymorphic variant site in such a gene. Another example is a nucleic acid hybridization probe comprised of such a sequence. In such probes, primers, and amplification products, the nucleotide sequence can contain a sequence or site corresponding to a polymorphic variant site or sites, for example, a polymorphic variant site identified herein. The design and use of allele-specific probes for analyzing polymorphisms is known generally in the art, see, for example, Saiki et al., Nature, 324:163-166 (1986); Dattagupta, EP 235,726, Saiki, WO 89/11548. Allele-specific probes can be designed that hybridize to a segment of target DNA from one individual but do not hybridize to the corresponding segment from another individual due to the presence of different polymorphic forms in the respective segments from the two individuals. A nucleic acid hybridization probe may span two or more polymorphic variant sites. Unless otherwise specified, a nucleic acid probe can include one or more nucleic acid analogs, labels or other substituents or moieties so long as the base-pairing function is retained. The nucleic acid sequence includes at least one polymorphic variant site. The probe may also comprise a detectable label, such as a radioactive or fluorescent label. A variety of other detectable labels are known to those skilled in the art. Nucleic acid probe can also include one or more nucleic acid analogs.
[0104] In connection with nucleic acid probe hybridization, the term "specifically hybridizes" indicates that the probe hybridizes to a sufficiently greater degree to the target sequence than to a sequence having a mismatched base at least one polymorphic variant site to allow distinguishing of such hybridization. The term "specifically hybridizes" means that the probe hybridizes to the target sequence, and not to non-target sequences, at a level which allows ready identification of probe/target sequence hybridization under selective hybridization conditions. "Selective hybridization conditions" refer to conditions that allow such differential binding. Similarly, the terms "specifically binds" and "selective binding conditions" refer to such differential binding of any type of probe, and to the conditions that allow such differential binding. Hybridization reactions to determine the status of variant sites in patient samples can be carried out with two different probes, one specific for each of the possible variant nucleotides. The complementary information derived from the two separate hybridization reactions is useful in corroborating the results.
[0105] A variety of variables can be adjusted to optimize the discrimination between two variant forms of a gene, including changes in salt concentration, temperature, pH and addition of various compounds that affect the differential affinity of GC vs. AT base pairs, such as tetramethyl ammonium chloride. [See, Current Protocols in Molecular Biology, Ausubel et al. (Editors), John Wiley & Sons.] Hybridization conditions should be sufficiently stringent such that there is a significant difference in hybridization intensity between alleles, and preferably an essentially binary response, whereby a probe hybridizes to only one of the alleles. Hybridizations are usually performed under stringent conditions that allow for specific binding between an oligonucleotide and a target nucleic acid containing one of the polymorphic sites described herein or identified using the techniques described herein. Stringent conditions are defined as any suitable buffer concentrations and temperatures that allow specific hybridization of the oligonucleotide to highly homologous sequences spanning at least one polymorphic site and any washing conditions that remove non-specific binding of the oligonucleotide. For example, conditions of 5×SSPE (750 mM NaCl, 50 mM Na Phosphate, 5 mM EDTA, pH 7.4) and a temperature of 25-30° C. are suitable for allele-specific probe hybridizations. The washing conditions usually range from room temperature to 60° C. Some probes are designed to hybridize to a segment of target DNA such that the polymorphic site aligns with a central position of the probe. This probe design achieves good discrimination in hybridization between different allelic forms.
[0106] Allele-specific probes are can be used in pairs, one member of a pair showing a perfect match to a reference form of a target sequence and the other member showing a perfect match to a variant form. Several pairs of probes can then be immobilized on the same support for simultaneous analysis of multiple polymorphisms within the same target sequence. The polymorphisms can also be identified by hybridization to nucleic acid arrays, some examples of which are described by WO 95/11995. Arrays may be provided in the form of a multiplex chip.
[0107] One use of probe(s) is as a primer(s) that hybridizes to a nucleic acid sequence containing at least one sequence polymorphic variant. Preferably such primers hybridize to a sequence not more than 300 nucleotides, more preferably not more than 200 nucleotides, still more preferably not more than 100 nucleotides, and most preferably not more than 50 nucleotides away from a polymorphic variant site which is to be analyzed. Preferably, a primer is 100 nucleotides or fewer in length, more preferably 50 nucleotides or fewer, still more preferable 30 nucleotides or fewer, and most preferably 20 or fewer nucleotides in length. In some embodiments, the set includes primers or amplification oligonucleotides adapted to bind to or extend through a plurality of sequence polymorphic variants in a gene(s) identified herein. In some embodiments, the plurality of polymorphic variants comprises a haplotype. In certain embodiments, the oligonucleotides are designed and selected to provide polymorphic variant-specific amplification.
[0108] Proteins and Expression of Nucleic Acids
[0109] Polymorphic variant alleles or fragments thereof can be expressed in an expression vector in which a variant gene is operably linked to a native or other promoter. Usually, the promoter is a eukaryotic promoter for expression in a mammalian cell. The transcription regulation sequences typically include a heterologous promoter and optionally an enhancer that is recognized by the host. The selection of an appropriate promoter, for example trp, lac, phage promoters, glycolytic enzyme promoters and tRNA promoters, depends on the host selected. Commercially available expression vectors can be used. Vectors can include host-recognized replication systems, amplifiable genes, selectable markers, host sequences useful for insertion into the host genome, and the like.
[0110] The means of introducing the expression construct into a host cell varies depending upon the particular construction and the target host. Suitable means include fusion, conjugation, transfection, transduction, electroporation or injection, as described in Sambrook, supra. A wide variety of host cells can be employed for expression of the variant gene, both prokaryotic and eukaryotic. Suitable host cells include bacteria such as E. coli, yeast, filamentous fungi, insect cells, mammalian cells, typically immortalized, e.g., mouse, CHO, human and monkey cell lines and derivatives thereof. Host cells can be selected to process the variant gene product to produce an appropriate mature polypeptide. Processing includes glycosylation, ubiquitination, disulfide bond formation, and general post-translational modification.
[0111] The protein can be isolated by conventional means of protein biochemistry and purification to obtain a substantially pure product, i.e., 80, 95 or 99% free of cell component contaminants, as described in Jacoby, Methods in Enzymology Volume 104, Academic Press, New York (1984); Scopes, Protein Purification, Principles and Practice, 2nd Edition, Springer-Verlag, New York (1987); and Deutscher (ed), Guide to Protein Purification, Methods in Enzymology, Vol. 182 (1990). If the protein is secreted, it can be isolated from the supernatant in which the host cell is grown. If not secreted, the protein can be isolated from a lysate of the host cells.
[0112] In addition to substantially full-length polypeptides expressed by variant genes, the invention includes use of biologically active fragments of the polypeptides, or analogs thereof, including organic molecules that simulate the interactions of the peptides. Biologically active fragments include any portion of the full-length polypeptide that confers a biological function on the variant gene product, including ligand binding and antibody binding. Ligand binding includes binding by nucleic acids, proteins or polypeptides, small biologically active molecules or large cellular structures.
[0113] Antibodies
[0114] Another type of probe is a peptide or protein, for example, an antibody or antibody fragment that specifically or preferentially binds to a polypeptide expressed by a particular form of a gene as characterized by the presence or absence of at least one polymorphic variant. Such antibodies may be polyclonal or monoclonal antibodies, and can be prepared by methods well-known in the art.
[0115] Antibodies can be used to probe for presence of a given polymorphism variant for those polymorphism variants that have an effect on the polypeptide encoded by the gene. For example, an antibody can recognize a change in one or more amino acid residues in the resulting protein. In some embodiments, the antibody is used to recognize polypeptides encoded by differential splice variants. If the polymorphism introduces or eliminates a surface feature of the protein such as a glycosylation site, lipid modification, etc., an antibody can also be used to identify a particular variant.
[0116] Polyclonal and/or monoclonal antibodies and antibody fragments capable of binding to a portion of the gene product relevant for identifying a given polymorphism variant are provided. Antibodies can be made by injecting mice or other animals with the variant gene product or synthetic peptide fragments thereof. Monoclonal antibodies are screened as are described, for example, in Harlow & Lane, Antibodies, A Laboratory Manual, Cold Spring Harbor Press, New York (1988); Goding, Monoclonal antibodies, Principles and Practice (2d ed.) Academic Press, New York (1986). Monoclonal antibodies are tested for specific immunoreactivity with a variant gene product and lack of immunoreactivity to the corresponding prototypical gene product. These antibodies are useful in diagnostic assays for detection of the variant form, or as an active ingredient in a pharmaceutical composition.
[0117] Polyclonal or monoclonal therapeutic antibodies useful in practicing this invention can be prepared in laboratory animals or by recombinant DNA techniques using the following methods. Polyclonal antibodies are raised in animals by multiple subcutaneous (sc) or intraperitoneal (ip) injections of the gene product molecule or fragment thereof in combination with an adjuvant such as Freund's adjuvant (complete or incomplete). To enhance immunogenicity, it may be useful to first conjugate the gene product molecule or a fragment containing the target amino acid sequence to a protein that is immunogenic in the species to be immunized, e.g., keyhole limpet hemocyanin, serum albumin, bovine thyroglobulin, or soybean trypsin inhibitor using a bifunctional or derivatizing agent, for example, maleimidobenzoyl sulfosuccinimide ester (conjugation through cysteine residues), N-hydroxysuccinimide (through lysine residues), glutaraldehyde, succinic anhydride, SOCl, or R1N═C═NR, where R and R1 are different alkyl groups. Alternatively, immunogenic conjugates can be produced recombinantly as fusion proteins.
[0118] Animals are immunized against the immunogenic conjugates or derivatives (such as a fragment containing the target amino acid sequence) by combining about 1 mg or about 1 microgram of conjugate (for rabbits or mice, respectively) with about 3 volumes of Freund's complete adjuvant and injecting the solution intradermally at multiple sites. Approximately 7 to 14 days later, animals are bled and the serum is assayed for antibody titer. Animals are boosted with antigen repeatedly until the titer plateaus. The animal can be boosted with the same molecule or fragment thereof as was used for the initial immunization, but conjugated to a different protein and/or through a different cross-linking agent. In addition, aggregating agents such as alum are used in the injections to enhance the immune response.
[0119] Monoclonal antibodies can be prepared by recovering spleen cells from immunized animals and immortalizing the cells in conventional fashion, e.g. by fusion with myeloma cells. The clones are then screened for those expressing the desired antibody. The monoclonal antibody preferably does not cross-react with other gene products.
[0120] Preparation of antibodies using recombinant DNA methods such as the phagemid display method, may be accomplished using commercially available kits, as for example, the Recombinant Phagemid Antibody System available from Pharmacia (Uppsala, Sweden), or the SurfZAP® phage display system (Stratagene Inc., La Jolla, Calif.).
[0121] Bispecific antibodies that specifically bind to one protein and that specifically bind to other antigens relevant to pathology and/or treatment are produced, isolated, and tested using standard procedures that have been described in the literature. [See, e.g., Pluckthun & Pack, Immunotechnology, 3:83-105 (1997); Carter, et al., J. Hematotherapy, 4:463-470 (1995); Renner & Pfreundschuh, Immunological Reviews, 1995, No. 145, pp. 179-209; Pfreundschuh U.S. Pat. No. 5,643,759; Segal, et al., J. Hematotherapy, 4:377-382 (1995); Segal, et al., Immunobiology, 185:390-402 (1992); and Bolhuis, et al., Cancer Immunol. Immunother., 34: 1-8 (1991).]
Transgenic Animals
[0122] The invention further provides the making and use of transgenic nonhuman animals capable of expressing an exogenous variant gene and/or having one or both alleles of an endogenous variant gene inactivated. Expression of an exogenous variant gene is usually achieved by operably linking the gene to a promoter and optionally an enhancer, and microinjecting the construct into a zygote. [See, Hogan et al., "Manipulating the Mouse Embryo, A Laboratory Manual," Cold Spring Harbor Laboratory.] Inactivation of endogenous variant genes can be achieved by forming a transgene in which a cloned variant gene is inactivated by insertion of a positive selection marker. [See, Capecchi, Science, 244:1288-1292 (1989).] The transgene is then introduced into an embryonic stem cell, where it undergoes homologous recombination with an endogenous variant gene. Mice and other rodents are preferred animals. Such animals provide useful drug screening systems.
[0123] The nucleic acids relevant to the invention can be used to generate genetically modified non-human animals or site specific gene modifications in cell lines. The term "transgenic" is intended to encompass genetically modified animals having a deletion or other knock-out of an endogenous gene, having an exogenous allele that is stably transmitted in the host cells, and/or having an exogenous allele promoter operably linked to a reporter gene. Transgenic animals may be made through homologous recombination, where the allele locus is altered. Alternatively, a nucleic acid construct is randomly integrated into the genome. Vectors for stable integration include plasmids, retroviruses and other animal viruses, YACs, and the like. Transgenic mammals or relevance include cows, pigs, goats, horses, etc., and particularly rodents, e.g. rats, mice, etc.
[0124] Transgenic animals can be made having exogenous genes comprising the polymorphic variants relevant to the invention so as to "humanize" the animal in respect that gene(s), such a process involves deletion of the analogous endogenous gene when appropriate. The exogenous gene is usually either from a different species than the animal host, or is otherwise altered in its coding or non-coding sequence. The introduced gene can be a wild-type gene, naturally occurring polymorphism, or a genetically manipulated sequence, for example those previously described with deletions, substitutions or insertions in the coding or non-coding regions. Where the introduced gene is a coding sequence, it usually operably linked to a promoter, which may be constitutive or inducible, and other regulatory sequences required for expression in the host animal. A detectable marker, such as lac Z can be introduced together with the exogenous gene to demonstrate incorporation of the exogenous gene.
[0125] The modified cells or animals are useful in the study of the physiological effect, if any, of the polymorphic variant. Animals can be used in functional studies, drug screening, etc., for example, to determine the effect of a candidate drug. By providing expression of a polymorphic variant in cells in which it is otherwise not normally produced, one can induce changes in cell behavior. Transgenic animals are also useful as part of a preclinical program.
[0126] DNA constructs for homologous recombination can comprise at least a portion of the polymorphic variant with the desired genetic modification, and can include regions of homology to the target locus. DNA constructs for random integration need not include regions of homology to mediate recombination. Conveniently, markers for positive and negative selection can be included. Methods for generating cells having targeted gene modifications through homologous recombination are known in the art. For various techniques for transfecting mammalian cells, see Keown et al., Methods in Enzymology 185:527-537 (1990).
Screening Techniques for Identifying Polymorphic Variants
[0127] The molecules and probes relevant to the invention can be used in screening techniques. A variety of screening techniques are known in the art for detecting the presence of one or more copies of one or more polymorphic variants in a sample or from a subject. Many of these assays have been reviewed by Landegren et al., Genome Res., 8:769-776, 1998. Determination of polymorphic variants within a particular nucleotide sequence among a population can be determined by any method known in the art, for example and without limitation, direct sequencing, restriction length fragment polymorphism (RFLP), single-strand conformational analysis (SSCA), denaturing gradient gel electrophoresis (DGGE) [see, e.g., Van Orsouw et al., Genet Anal., 14(5-6):205-13 (1999)], heteroduplex analysis (HET) [see, e.g., Ganguly A, et al., Proc. Natl. Acad. Sci. (USA), 90(21):10325-9 (1993)], chemical cleavage analysis (CCM) [see, e.g., Ellis T P, et al., Human Mutation, 11(5):345-53 (1998)] (either enzymatic as with T4 Endonuclease 7, or chemical as with osmium tetroxide and hydroxylamine) and ribonuclease cleavage. Screening for polymorphic variants can be performed when a polymorphic variant is already known to be associated with a particular disease or condition. In some embodiments, the screening is performed in pursuit of identifying one or more polymorphic variants and determining whether they are associated with a particular disease or condition.
[0128] In respect to DNA, polymorphic variant screening can include genomic DNA screening and/or cDNA screening. Genomic polymorphic variant detection can include screening the entire genomic segment spanning the gene from the transcription start site to the polyadenylation site. In some embodiments, genomic polymorphic variant detection can include the exons and some region around them containing the splicing signals, for example, but not all of the intronic sequences. In addition to screening introns and exons for polymorphic variants, regulatory DNA sequences can be screened for polymorphic variants. Promoter, enhancer, silencer and other regulatory elements have been described in human genes. The promoter is generally proximal to the transcription start site, although there may be several promoters and several transcription start sites. Enhancer, silencer and other regulatory elements can be intragenic or can lie outside the introns and exons, possibly at a considerable distance, such as 100 kb away. Polymorphic variants in such sequences can affect basal gene expression or regulation of gene expression.
[0129] The presence or absence of the at least one polymorphic variant can be determined by nucleotide sequencing. Sequencing can be carried out by any suitable method, for example, dideoxy sequencing [Sanger et al., Proc. Natl. Acad. Sci. (USA), 74:5463-5467 (1977)], chemical sequencing [Maxam and Gilbert, Proc. Natl. Acad. Sci. (USA), 74:560-564, (1977)] or variations thereof. Methods for sequencing can also be found in Ausubel et al., eds., Short Protocols in Molecular Biology, 0.3rd ed., Wiley, 1995 and Sambrook et al., Molecular Cloning, 2nd ed., Ch. 13, Cold Spring Harbor Laboratory Press, 1989. The sequencing can involve sequencing of a portion or portions of a gene and/or portions of a plurality of genes that includes at least one polymorphic variant site, and can include a plurality of such sites. The portion can be of sufficient length to discern whether the polymorphic variant(s) of interest is present. In some embodiments the portion is 500, 250, 100, 75, 65, 50, 45, 35, 25 nucleotides or less in length. Sequencing can also include the use of dye-labeled dideoxy nucleotides, and the use of mass spectrometric methods. Mass spectrometric methods can also be used to determine the nucleotide present at a polymorphic variant site.
[0130] RFLP analysis is useful for detecting the presence of genetic variants at a locus in a population when the variants differ in the size of a probed restriction fragment within the locus, such that the difference between the variants can be visualized by electrophoresis [see, e.g. U.S. Pat. Nos. 5,324,631 and 5,645,995]. Such differences will occur when a variant creates or eliminates a restriction site within the probed fragment. RFLP analysis is also useful for detecting a large insertion or deletion within the probed fragment. RFLP analysis is useful for detecting, for example, an Alu or other sequence insertion or deletion.
[0131] Single-strand conformational polymorphisms (SSCPs) can be detected in <220 by PCR amplicons with high sensitivity [Orita et al, Proc. Natl. Acad. Sci. (USA), 86:2766-2770, 1989; Warren et al., In: Current Protocols in Human Genetics, Dracopoli et al., eds, Wiley, 1994, 7.4.1-7.4.6.]. Double strands are first heat-denatured. The single strands are then subjected to polyacrylamide gel electrophoresis under non-denaturing conditions at constant temperature with low voltage and long run times at two different temperatures, typically 4-10° C. and 23° C., or appropriate ambient temperature. At low temperatures such as 4-10° C., the secondary structure of short single strands, the degree of intrachain hairpin formation, is sensitive to even single nucleotide changes, and can be detected as a large change in electrophoretic mobility. Polymorphisms appear as new banding patterns when the gel is stained.
[0132] SSCP is usually paired with a DNA sequencing method, because the SSCP method does not provide the nucleotide identity of polymorphic variants. One useful sequencing method, for example, is DNA cycle sequencing of radiolabeled PCR products using the Femtomole DNA cycle sequencing kit from Promega (WI) and the instructions provided with the kit. Fragments are selected for DNA sequencing based on their behavior in the SSCP assay. Single strand conformation polymorphism screening is a widely used technique for identifying and discriminating DNA fragments that differ from each other by as little as a single nucleotide. The SSCP technique can be used on genomic DNA [Orita et al. Proc. Natl. Acad. Sci. (USA), 86(8):2766-70, 1989] as well as PCR amplified DNA as well.
[0133] The basic steps of the SSCP procedure can be as follows. SSCP can be used to analyze cDNAs or genomic DNAs. If cDNA is used any suitable reverse transcriptase procedure and/or kit may be utilized such as a Superscript II kit from Life Technologies. Material for SSCP analysis can be prepared by PCR amplification of the cDNA in the presence of radiolabeled dNTP, such as dCTP. Usually the concentration of nonradioactive dCTP is dropped from 200 uM (the standard concentration for each of the four dNTPs) to about 100 uM, and .α32PdCTP is added to a concentration of about 0.1-0.3 μM. This process involves adding a 0.3-1 μl (3-10 μCi) of 32P cCTP to a 10 μl PCR reaction. In some embodiments, about 200 base pair PCR products for SSCP. In some embodiments, about 0.8-1.4 kb fragments are amplified and then several cocktails of restriction endonucleases are used to digest those into smaller fragments of about 0.1-0.3 kb, aiming to have as many fragments possible between 0.15 and 0.3 kb. In some embodiments, several different restriction enzyme digests can be performed on each set of samples, and then each of the digests can be run separately on SSCP gels. After digestion, the radiolabelled PCR products are diluted 1:5 by adding formamide load buffer (80% formamide, 1×SSCP gel buffer) and then denatured by heating to 90° C. for 10 minutes, and then allowed to renature by quickly chilling on ice. The secondary structure of the single strands influences their mobility on nondenaturing gels. Even single base differences consistently produce changes in intrastrand folding sufficient to register as mobility differences on SSCP. The resulting single strands are resolved on one or more gels, one a 5.5% acrylamide, 0.5×TBE gel, the other an 8% acrylamide, 10% glycerol, 1×TTE gel, or other appropriate gel recipe known in the art. The use of two gels provides a greater opportunity to recognize mobility differences. Both glycerol and acrylamide concentration have been shown to influence SSCP performance.
[0134] Another method for detecting polymorphic variants is the T4 endonuclease VII (T4E7) mismatch cleavage method: T4E7 specifically cleaves heteroduplex DNA containing single base mismatches, deletions or insertions. The site of cleavage is 1 to 6 nucleotides 3' of the mismatch. The enzyme pinpoints the site of sequence variation, so that sequencing can be confined to a 25-30 nucleotide segment. The major steps in identifying sequence variations in candidate genes using T4E7 are as follows. First, 400-600 by segments are PCR amplified from a panel of DNA samples. Second, a fluorescently-labeled probe DNA is mixed with the sample DNA. Third, the samples are heated and cooled to allow the formation of heteroduplexes. Fourth, the T4E7 enzyme is added to the samples with incubation for 30 minutes at 37° C., during which cleavage occurs at sequence polymorphic variant mismatches. Fifth, the samples are run on an ABI 377 sequencing or other suitable apparatus to identify cleavage bands, which indicate the presence and location of polymorphic variants in the sequence. Sixth, a subset of PCR fragments showing cleavage is sequenced to identify the exact location and identity of each polymorphic variant. A subset of the samples containing each unique T4E7 cleavage site is selected for sequencing. DNA sequencing can, for example, be performed on ABI 377 automated DNA sequencers using BigDye chemistry and cycle sequencing. Analysis of the sequencing runs can be limited to the 30-40 bases marked by the T4E7 procedure as having the polymorphic variant.
[0135] Denaturing gradient gel electrophoresis (DGGE) can detect single base mutations based on differences in migration between homoduplexes and heteroduplexes [Myers et al., Nature, 313:495-498 (1985)]. The DNA sample to be tested is hybridized to a labeled wild type probe. The duplexes formed are then subjected to electrophoresis through a polyacrylamide gel that contains a gradient of DNA denaturant parallel to the direction of electrophoresis. Heteroduplexes formed due to single base variations are detected on the basis of differences in migration between the heteroduplexes and the homoduplexes formed.
[0136] In heteroduplex analysis (HET) [Keen et al., Trends Genet., 7:5 (1991)], genomic DNA is amplified by the polymerase chain reaction followed by an additional denaturing step that increases the chance of heteroduplex formation in heterozygous individuals. The PCR products are then separated on Hydrolink gels where the presence of the heteroduplex is observed as an additional band.
[0137] Chemical cleavage analysis (CCM) is based on the chemical reactivity of thymine (T) when mismatched with cytosine, guanine or thymine and the chemical reactivity of cytosine (C) when mismatched with thymine, adenine or cytosine [Cotton et al., Proc. Natl. Acad. Sci. (USA), 85:4397-4401 (1988)]. Duplex DNA formed by hybridization of a wild type probe with the DNA to be examined, is treated with osmium tetroxide for T and C mismatches and hydroxylamine for C mismatches. T and C mismatched bases that have reacted with the hydroxylamine or osmium tetroxide are then cleaved with piperidine. The cleavage products are analyzed by gel electrophoresis.
[0138] Ribonuclease cleavage involves enzymatic cleavage of RNA at a single base mismatch in an RNA:DNA hybrid [Myers et al., Science, 230:1242-1246 (1985)]. 32P labeled RNA probe complementary to the wild type DNA is annealed to the test DNA and then treated with ribonuclease A. If a mismatch occurs, ribonuclease A will cleave the RNA probe and the location of the mismatch can then be determined by size analysis of the cleavage products following gel electrophoresis.
[0139] In addition to the physical methods described herein and others known to those skilled in the art, see, for example, Housman, U.S. Pat. No. 5,702,890; Housman et al., U.S. patent application Ser. No. 09/045,053, polymorphisms can be detected using computational methods, involving computer comparison of sequences from two or more different biological sources, which can be obtained in various ways, for example from public sequence databases. The term "polymorphic variant scanning" refers to a process of identifying sequence polymorphic variants using computer-based comparison and analysis of multiple representations of at least a portion of one or more genes. Computational polymorphic variant detection involves a process to distinguish true polymorphic variants from sequencing errors or other artifacts, and thus does not require perfectly accurate sequences. Such scanning can be performed in a variety of ways as known to those skilled in the art, preferably, for example, as described in U.S. patent application Ser. No. 09/300,747. The "gene" and "SNP" databases of Pubmed Entrez can also be utilized for identifying polymorphisms.
[0140] Genomic and cDNA sequences can both or in the alternative be used in identifying polymorphisms. Genomic sequences are useful where the detection of polymorphism in or near splice sites is sought, such polymorphism can be in introns, exons, or overlapping intron/exon boundaries. Nucleic acid sequences analyzed may represent full or partial genomic DNA sequences for a gene or genes. Partial cDNA sequences can also be utilized although this is less preferred. As described herein, the polymorphic variant scanning analysis can utilize sequence overlap regions, even from partial sequences. While the present description is provided by reference to DNA, for example, cDNA, some sequences can be provided as RNA sequences, for example, mRNA sequences.
[0141] Interpreting the location of the polymorphic variant in the gene depends on the correct assignment of the initial ATG of the encoded protein (the translation start site). The correct ATG can be incorrect in GenBank, but that one skilled in the art will know how to carry out experiments to definitively identify the correct translation initiation codon (which is not always an ATG). In the event of any potential question concerning the proper identification of a gene or part of a gene, due for example, to an error in recording an identifier or the absence of one or more of the identifiers, the priority for use to resolve the ambiguity is GenBank accession number, OMIM identification number, HUGO identifier, common name identifier.
[0142] Allele and genotype frequencies can be compared between cases and controls using statistical software (for example, SAS PROC NLMIXED). The odds ratios can be calculated using a log linear model by the delta method [Agresti, N.Y.: John Wiley & Sons (1990)] and statistical significance was assessed via the chi-square test. Likelihood ratios (G2) were used to assess goodness of fit of different models i.e., G2 provides a measure of the reliability of the odds ratio; small G2 P-values indicate a poor fit to the model being tested. Combined genotypes can be analysed by estimating, maximum likelihood estimation, the gamete frequencies in cases and controls using a model of the four combinations of alleles as described by Weir, Sunderland, Mass.: Sinauer (1996). Gene-gene interactive effects can be tested using a series of non-hierarchical logistic models [Piegorsch et al., Stat. Med., 13:153-162 (1994)] to estimate interactive dominant and recessive effects. A sample size as large as possible from a relatively homogenous population to minimize variables outside the focus of the study.
[0143] Genomic DNA can be extracted from cases and controls using the QIAamp DNA Blood Mini Kit from Qiagen, UK. Genotyping of polymorphisms was performed using PCR-RFLP (Restriction Fragment Length Polymorphism) using appropriate restriction sites for the gene(s) being studied [Frosst et al., Nature Genet., 10:111-113 (1995); Hol et al., Clin. Genet., 53:119-125 (1998); Brody et al., Am. J. Hum. Genet., 71:1207-1215 (2002)]. A polymorphism may be genotyped using an allele-specific primer extension assay and scored by matrix-assisted laser desorption/ionization-time of flight (MALDI-TOF) mass spectrometry (Sequenom, San Diego). Appropriate controls should be included in all assays. genotyping consistency can be tested by analyzing between 10-15% of samples in duplicate.
[0144] One type of assay has been termed an array hybridization assay, an example of which is the multiplexed allele-specific diagnostic assay (MASDA) [U.S. Pat. No. 5,834,181; Shuber et al., Hum. Molec. Genet., 6:337-347 (1997)]. In MASDA, samples from multiplex PCR are immobilized on a solid support. A single hybridization is conducted with a pool of labeled allele specific oligonucleotides (ASO). The support is then washed to remove unhybridized ASOs remaining in the pool. Labeled ASO remaining on the support are detected and eluted from the support. The eluted ASOs are then sequenced to determine the mutation present.
[0145] Two assays depend on hybridization-based allele-discrimination during PCR. The TaqMan assay [U.S. Pat. No. 5,962,233; Livak et al., Nature Genet., 9:341-342, (1995)] uses allele specific (ASO) probes with a donor dye on one end and an acceptor dye on the other end such that the dye pair interact via fluorescence resonance energy transfer (FRET). A target sequence is amplified by PCR modified to include the addition of the labeled ASO probe. The PCR conditions are adjusted so that a single nucleotide difference will effect binding of the probe. Due to the 5' nuclease activity of the Taq polymerase enzyme, a perfectly complementary probe is cleaved during the PCR while a probe with a single mismatched base is not cleaved. Cleavage of the probe dissociates the donor dye from the quenching acceptor dye, greatly increasing the donor fluorescence.
[0146] An alternative to the TaqMan assay is the molecular beacons assay [U.S. Pat. No. 5,925,517; Tyagi et al., Nature Biotech., 16:49-53 (1998)]. In the molecular beacons assay, the ASO probes contain complementary sequences flanking the-target specific species so that a hairpin structure is formed. The loop of the hairpin is complimentary to the target sequence while each arm of the hairpin contains either donor or acceptor dyes. When not hybridized to a donor sequence, the hairpin structure brings the donor and acceptor dye close together thereby extinguishing the donor fluorescence. When hybridized to the specific target sequence, however, the donor and acceptor dyes are separated with an increase in fluorescence of up to 900 fold. Molecular beacons can be used in conjunction with amplification of the target sequence by PCR and provide a method for real time detection of the presence of target sequences or can be used after amplification.
[0147] High throughput screening for SNPs that affect restriction sites can be achieved by Microtiter Array Diagonal Gel Electrophoresis (MADGE) [Day and Humphries, Anal. Biochem., 222:389-395, (1994)]. In this assay restriction fragment digested PCR products are loaded onto stackable horizontal gels with the wells arrayed in a microtiter format. During electrophoresis, the electric field is applied at an angle relative to the columns and rows of the wells allowing products from a large number of reactions to be resolved.
[0148] Additional assays depend on mismatch distinction by polymerases and ligases. The polymerization step in PCR places high stringency requirements on correct base pairing of the 3' end of the hybridizing primers. This has allowed the use of PCR for the rapid detection of single base changes in DNA by using specifically designed oligonucleotides in a method variously called PCR amplification of specific alleles (PASA) [Sommer et al., Mayo Clin. Proc., 64:1361-1372 (1989); Sarkar et al., Anal. Biochem., 186:64-68 (1990), allele-specific amplification (ASA), allele-specific PCR, and amplification refractory mutation system (ARMS) [Newton et al., Nuc. Acids Res., 17:2503-16 (1989); Nichols et al., Genomics, 5:535-40 (1989); Wu et al., Proc. Natl. Acad. Sci. (USA), 86:2757-60 (1989)]. In these methods, an oligonucleotide primer is designed that perfectly matches one allele but mismatches the other allele at or near the 3' end. This results in the preferential amplification of one allele over the other. By using three primers that produce two differently sized products, it can be determine whether an individual is homozygous or heterozygous for the imitation [Dutton and Sommer, Bio Techniques, 11:700-702 (1991)]. In another method, termed bi-PASA, four primers are used; two outer primers that bind at different distances from the site of the SNP and two allele specific inner primers [Liu et al., Genome Res., 7:389-398 (1997)]. Each of the inner primers have a non-complementary 5' end and form a mismatch near the 3' end if the proper allele is not present. Using this system, zygosity is determined based on the size and number of PCR products produced.
[0149] The joining by DNA ligases of two oligonucleotides hybridized to a target DNA sequence is quite sensitive to mismatches close to the ligation site, especially at the 3' end. This sensitivity has been utilized in the oligonucleotide ligation assay [Landegren et al., Science, 241:1077-1080 (1988)] and the ligase chain reaction [LCR; Barany, Proc. Natl. Acad. Sci. (USA), 88:189-193 (1991)]. In OLA, the sequence surrounding the SNP is first amplified by PCR, whereas in LCR, genomic DNA can by used as a template.
[0150] In one method for mass screening based on the OLA, amplified DNA templates are analyzed for their ability to serve as templates for ligation reactions between labeled oligonucleotide probes [Samotiaki et al., Genomics, 20:238-242, (1994)]. In this assay, two allele-specific probes labeled with either of two lanthanide labels (europium or terbium) compete for ligation to a third biotin labeled phosphorylated oligonucleotide and the signals from the allele specific oligonucleotides are compared by time-resolved fluorescence. After ligation, the oligonucleotides are collected on an avidin-coated 96-pin capture manifold. The collected oligonucleotides are then transferred to microtiter wells in which the europium and terbium ions are released. The fluorescence from the europium ions is determined for each well, followed by measurement of the terbium fluorescence.
[0151] In alternative gel-based OLA assays, polymorphic variants can be detected simultaneously using multiplex PCR and multiplex ligation [U.S. Pat. No. 5,830,711; Day et al., Genomics, 29:152-162 (1995); Grossman et al., Nuc. Acids Res., 22:4527-4534, (1994)]. In these assays, allele specific oligonucleotides with different markers, for example, fluorescent dyes, are used. The ligation products are then analyzed together by electrophoresis on an automatic DNA sequencer distinguishing markers by size and alleles by fluorescence. In the assay by Grossman et al., 1994, mobility is further modified by the presence of a non-nucleotide mobility modifier on one of the oligonucleotides.
[0152] A further modification of the ligation assay has been termed the dye-labeled oligonucleotide ligation (DOL) assay [U.S. Pat. No. 5,945,283; Chen et al., Genome Res., 8:549-556 (1998)]. DOL combines PCR and the oligonucleotide ligation reaction in a two-stage thermal cycling sequence with fluorescence resonance energy transfer (FRET) detection. In the assay, labeled ligation oligonucleotides are designed to have annealing temperatures lower than those of the amplification primers. After amplification, the temperature is lowered to a temperature where the ligation oligonucleotides can anneal and be ligated together. This assay uses a thermostable ligase and a thermostable DNA polymerase without 5' nuclease activity. Because FRET occurs only when the donor and acceptor dyes are in close proximity, ligation is inferred by the change in fluorescence.
[0153] In another method for the detection of polymorphic variants termed minisequencing, the target-dependent addition by a polymerase of a specific nucleotide immediately downstream (3') to a single primer is used to determine which allele is present (U.S. Pat. No. 5,846,710). Using this method, several variants can be analyzed in parallel by separating locus specific primers on the basis of size via electrophoresis and determining allele specific incorporation using labeled nucleotides.
[0154] Determination of individual variants using solid phase minisequencing has been described by [Syvanen et al., Am. J. Hum. Genet., 52:46-59 (1993)]. In this method the sequence including the polymorphic site is amplified by PCR using one amplification primer which is biotinylated on its 5' end. The biotinylated PCR products are captured in streptavidin-coated microtitration wells, the wells washed, and the captured PCR products denatured. A sequencing primer is then added whose 3' end binds immediately prior to the polymorphic site, and the primer is elongated by a DNA polymerase with one single labeled dNTP complementary to the nucleotide at the polymorphic site. After the elongation reaction, the sequencing primer is released and the presence of the labeled nucleotide detected. Alternatively, dye labeled dideoxynucleoside triphosphates (ddNTPs) can be used in the elongation reaction [U.S. Pat. No. 5,888,819; Shumaker et al., Human Mut., 7:346-354, (1996)]. In this method, incorporation of the ddNTP is determined using an automatic gel sequencer.
[0155] Minisequencing has also been adapted for use with microarrays [Shumaker et al., Human Mut., 7:346-354 (1996)]. In this case, elongation (extension) primers are attached to a solid support such as a glass slide. Methods for construction of oligonucleotide arrays are well known to those of ordinary skill in the art and can be found, for example, in Nature Genetics, Suppl., Jan. 21, 1999. PCR products are spotted on the array and allowed to anneal. The extension (elongation) reaction is carried out using a polymerase, a labeled dNTP and noncompeting ddNTPs. Incorporation of the labeled DNTP is then detected by the appropriate means. In a variation of this method suitable for use with multiplex PCR, extension is accomplished with the use of the appropriate labeled ddNTP and unlabeled ddNTPs [Pastinen et al., Genome Res., 7:606-614 (1997)].
[0156] Solid phase minisequencing has also been used to detect multiple polymorphic nucleotides from different templates in an undivided sample [Pastinen et al., Clin. Chem., 42:1391-1397 (1996)]. In this method, biotinylated PCR products are captured on the avidin-coated manifold support and rendered single stranded by alkaline treatment. The manifold is then placed serially in four reaction mixtures containing extension primers of varying lengths, a DNA polymerase and a labeled ddNTP, and the extension reaction allowed to proceed. The manifolds are inserted into the slots of a gel containing formamide which releases the extended primers from the template. The extended primers are then identified by size and fluorescence on a sequencing instrument.
[0157] Fluorescence resonance energy transfer (FRET) has been used in combination with minisequencing to detect polymorphic variants [U.S. Pat. No. 5,945,283; Chen et al., Proc. Natl. Acad. Sci. USA, 94:10756-10761 (1997)]. In this method, the extension primers are labeled with a fluorescent dye, for example fluorescein. The ddNTPs used in primer extension are labeled with an appropriate FRET dye. Incorporation of the ddNTPs is determined by changes in fluorescence intensities.
[0158] The above discussion of methods for the detection of SNPs is exemplary only and is not intended to be exhaustive. Those of ordinary skill in the art will be able to envision other methods for detection of polymorphic variants that are within the scope and spirit of the invention.
[0159] Polymorphisms are detected in a target nucleic acid from an individual being analyzed. For assay of genomic DNA, virtually any biological sample other than pure red blood cells is suitable. "Tissue" means any sample taken from any subject, preferably a human. For example, convenient tissue samples include whole blood, semen, saliva, tears, urine, fecal material, sweat, buccal epithelium, skin and hair. For assay of cDNA or mRNA, the tissue sample should be obtained from an organ in which the target nucleic acid is expressed.
[0160] Many of the methods described involve amplification of DNA from target samples. This can be accomplished by e.g., PCR. Other suitable amplification methods include the ligase chain reaction (LCR) [see Wu and Wallace, Genomics, 4:560 (1989), Landegren et al., Science, 241:1077 (1988)], transcription amplification [Kwoh et al., Proc. Natl. Acad. Sci. (USA), 86:1173 (1989)], self-sustained sequence replication [Guatelli et al., Proc. Nat. Acad. Sci. (USA), 87:1874 (1990)] and nucleic acid based sequence amplification (NASBA). The latter two amplification methods involve isothermal reactions based on isothermal transcription, which produce both single stranded RNA (ssRNA) and double stranded DNA (dsDNA) as the amplification products in a ratio of about 30 or 100 to 1, respectively.
[0161] Single base extension methods are described by e.g., U.S. Pat. No. 5,846,710, U.S. Pat. No. 6,004,744, U.S. Pat. No. 5,888,819 and U.S. Pat. No. 5,856,092. Generally, the methods work by hybridizing a primer that is complementary to a target sequence such that the 3' end of the primer is immediately adjacent to, but does not span a site of, potential variation in the target sequence. That is, the primer comprises a subsequence from the complement of a target polynucleotide terminating at the base that is immediately adjacent and 5' to the polymorphic site. The term primer refers to a single-stranded oligonucleotide capable of acting as a point of initiation of template-directed DNA synthesis under appropriate conditions (i.e., in the presence of four different nucleoside triphosphates and an agent for polymerization, such as DNA or RNA polymerase or reverse transcriptase) in an appropriate buffer and at a suitable temperature. The appropriate length of a primer depends on the intended use of the primer but typically ranges from 15 to 40 nucleotides. Short primer molecules generally require cooler temperatures to form sufficiently stable hybrid complexes with the template. A primer need not reflect the exact sequence of the template but should be sufficiently complementary to hybridize with a template. The term primer site refers to the area of the target DNA to which a primer hybridizes. The term primer pair means a set of primers including a 5' upstream primer that hybridizes with the 5' end of the DNA sequence to be amplified and a 3', downstream primer that hybridizes with the complement of the 3' end of the sequence to be amplified. Hybridization probes are capable of binding in a base-specific manner to a complementary strand of nucleic acid. Such probes include nucleic acids and peptide nucleic acids as described in [Nielsen et al., Science, 254:1497-1500 (1991)]. A probe primer can be labeled, if desired, by incorporating a label detectable by spectroscopic, photochemical, biochemical, immunochemical, or chemical means. For example, useful labels include 32P, fluorescent dyes, electron dense reagents, enzymes (as commonly used in an ELISA), biotin, or haptens and proteins for which antisera or monoclonal antibodies are available. A label can also be used to "capture" the primer, so as to facilitate the immobilization of either the primer or a primer extension product, such as amplified DNA, on a solid support. The hybridization is performed in the presence of one or more labeled nucleotides complementary to base(s) that may occupy the site of potential variation. For example, for biallelic polymorphisms, two differentially labeled nucleotides can be used. For tetraallelic polymorphisms, four differentially-labeled nucleotides can be used. In some methods, particularly methods employing multiple differentially labeled nucleotides, the nucleotides are dideoxynucleotides. Hybridization is performed under conditions permitting primer extension if a nucleotide complementary to a base occupying the site of variation if the target sequence is present. Extension incorporates a labeled nucleotide thereby generating a labeled extended primer. If multiple differentially-labeled nucleotides are used and the target is heterozygous then multiple differentially-labeled extended primers can be obtained. Extended primers are detected providing an indication of which base(s) occupy the site of variation in the target polynucleotide.
[0162] An allele-specific primer hybridizes to a site on target DNA overlapping a polymorphism and only primes amplification of an allelic form to which the primer exhibits perfect complementarily. [See Gibbs, Nucleic Acid Res., 17:2427-2448 (1989).] This primer is used in conjunction with a second primer that hybridizes at a distal site. Amplification proceeds from the two primers leading to a detectable product signifying that the particular allelic form is present. A control is usually performed with a second pair of primers, one of which shows a single base mismatch at the polymorphic site and the other of which exhibits perfect complementarily to a distal site. The single-base mismatch prevents amplification and no detectable product is formed. In some methods, the mismatch is included in the 3'-most position of the oligonucleotide aligned with the polymorphism because this position is most destabilizing to elongation from the primer. [See, e.g., WO 93/22456.] In other methods, a double-base mismatch is used in which the first mismatch is included in the 3'-most position of the oligonucleotide aligned with the polymorphism and a second mismatch is positioned at the immediately adjacent base (the pen-ultimate 3' position). This double mismatch further prevents amplification in instances in which there is no match between the 3' position of the primer and the polymorphism.
[0163] Amplification products generated using the polymerase chain reaction can be analyzed by the use of denaturing gradient gel electrophoresis. Different alleles can be identified based on the different sequence-dependent melting properties and electrophoretic migration of DNA in solution. [Erlich, ed., PCR Technology, Principles and Applications for DNA Amplification, (W. H. Freeman and Co, New York, (1992)), Chapter 7.]
[0164] Arrays provide a high throughput technique that can assay a large number of polynucleotides in a sample. In one aspect of the invention, an array is constructed comprising one or more of the genes, proteins or antibodies relevant to the invention, comprising one or more of these sequences. This technology can be used as a tool to test for differential expression, or for genotyping. Arrays can be created by spotting polynucleotide probes onto a substrate (e.g., glass, nitrocellulose, etc.) in a two-dimensional matrix or array having bound probes. The probes can be bound to the substrate by either covalent bonds or by non-specific interactions, such as hydrophobic interactions. Techniques for constructing arrays and methods of using these arrays are described in, for example, Schena et al., Proc. Natl. Acad. Sci. (USA), 93(20):10614-9 (1996); Schena et al., Science, 270(5235):467-70 (1995); Shalon et al., Genome Res., 6(7):639-45 (1996), U.S. Pat. No. 5,807,522, EP 799 897; WO 97/29212; WO 97/27317; EP 785 280; WO 97/02357; U.S. Pat. No. 5,593,839; U.S. Pat. No. 5,578,832; EP 728 520; U.S. Pat. No. 5,599,695; EP 721 016; U.S. Pat. No. 5,556,752; WO 95/22058; and U.S. Pat. No. 5,631,734.
[0165] In some embodiments, an array comprises probes specific for one or more allelic variants for a given gene. Probes that specifically bind to the allele of interest can be used, and reaction conditions for hybridization to the array can be adjusted accordingly. The probes utilized in the arrays can be of varying types and can include, for example, synthesized probes of relatively short length (e.g., a 20-mer or a 25-mer), cDNA (full length or fragments of gene), amplified DNA, fragments of DNA (generated by restriction enzymes, for example) and reverse transcribed RNA. Both custom and generic arrays can be utilized in detecting differential expression levels. Custom arrays can be prepared using probes that hybridize to particular preselected subsequences of mRNA gene sequences or amplification products prepared from them. Many variations on methods of detection using arrays are within the skill in the art and within the scope of the invention. For example, rather than immobilizing the probe to a solid support, the test sample can be immobilized on a solid support that is then contacted with the probe.
[0166] Screening may also be based on the functional or antigenic characteristics of the protein. Immunoassays designed to detect predisposing polymorphisms in proteins relevant to the invention can be used in screening. Antibodies specific for a polymorphism variant or gene products may be used in screening immunoassays. A sample is taken from a subject. Samples, as used herein, include biological fluids such as tracheal lavage, blood, cerebrospinal fluid, tears, saliva, lymph, dialysis fluid and the like; organ or tissue culture derived fluids; and fluids extracted from physiological tissues. Samples can also include derivatives and fractions of such fluids. In some embodiments, the sample is derived from a biopsy. The number of cells in a sample will generally be at least about 103, usually at least 104 more usually at least about 105. The cells can be dissociated, in the case of solid tissues, or tissue sections may be analyzed. Alternatively a lysate of the cells can be prepared.
[0167] In some embodiments, detection utilizes staining of cells or histological sections, performed in accordance with conventional methods. The antibodies of interest are added to the cell sample, and incubated for a period of time sufficient to allow binding to the epitope, usually at least about 10 minutes. The antibody may be labeled with radioisotopes; enzymes, fluorescers, chemiluminescers, or other labels for direct detection. Alternatively or in addition, a second stage antibody or reagent can be used to amplify the signal. For example, the primary antibody can be conjugated to biotin, with horseradish peroxidase-conjugated avidin added as a second stage reagent. Final detection uses a substrate that undergoes a color change in the presence of the peroxidase. The absence or presence of antibody binding may be determined by various methods, including flow cytometry of dissociated cells, microscopy, radiography, scintillation counting, etc.
[0168] An alternative method for diagnosis depends on the in vitro detection of binding between antibodies and protein encoded by the polymorphic variant in a lysate. Measuring the concentration of protein binding in a sample or fraction thereof can be accomplished by a variety of specific assays. A conventional sandwich type assay may be used. For example, a sandwich assay can first attach polymorphic variant protein specific antibodies to an insoluble surface or support. The particular manner of binding is not crucial so long as it is compatible with the reagents and overall methods of the invention. Binding may be covalent or non-covalent.
[0169] Other immunoassays are known in the art and may find use as diagnostics. Ouchterlony plates provide a simple determination of antibody binding. Western blots can be performed on protein gels or protein spots on filters, using a detection system specific for polymorphic variant protein as desired, conveniently using a labeling method as described for the sandwich assay.
[0170] The invention provides a method for determining a genotype of an individual in relation to one or more polymorphic variants in one or more of the genes identified in above aspects by using mass spectrometric determination of a nucleic acid sequence that is a portion of a gene identified for other aspects of this invention or a complementary sequence. Such mass spectrometric methods are known to those skilled in the art. In preferred embodiments, the method involves determining the presence or absence of a polymorphic variant in a gene; determining the nucleotide sequence of the nucleic acid sequence; the nucleotide sequence is 100 nucleotides or less in length, preferably 50 or less, more preferably 30 or less, and still more preferably 20 nucleotides or less. In general, such a nucleotide sequence includes at least one polymorphic variant site, preferably a polymorphic variant site which is informative with respect to the expected response of a patient to a treatment as described for above aspects.
Therapies
[0171] The invention provides methods for choosing a relevant therapeutic strategy based on the detection of one or more polymorphic variants. In some embodiments, the polymorphic variant indicates an altered susceptibility to a particular disease state. In embodiments, where the variant is associated with an increased susceptibility for that disease state. In some embodiments, for the MTHFD1 1958A variant, a resulting diagnosis for an increased susceptibility for a pregnancy complication such as severe placental abruption or a second trimester miscarriage would indicate a therapy that helps minimize or eliminate such complications. In some embodiments, for the MTHFD1L "ATT" seven repeat intron variant of rs3832406, a resulting diagnosis for an increased susceptibility for a pregnancy complication such as a neural tube defect indicates a therapy that helps minimize or eliminate such complications. In some embodiments, for the MTHFD1L "ATT" seven repeat intron variant of rs3832406, a resulting diagnosis for an increased susceptibility for a cancer drug complication would indicate that helps minimize such a complication. Accordingly, the invention provides a method for determining whether a compound has a differential effect due to the presence or absence of at least one polymorphic variant in a gene or a variant form of a gene. In some embodiments, the method comprises identifying a subset of patients with enhanced or diminished response or tolerance to a treatment method or a method of administration of a treatment where the treatment is for a disease or condition in the patient. General methods of testing effects of a polymorphic variant for an effect on drug efficacy are known to those of skill in the art and are provided in various sources such as U.S. Pat. Nos. 6,537,759; 6,664,062; and 6,759,200.
[0172] One or more polymorphic variants in one or more genes in a plurality of patients can be correlated with response to a particular treatment such as a drug or more specifically a drug regimen including dosage, administration, and other relevant parameters. The correlation can be performed by determining the one or more polymorphic variants in the one or more genes in the plurality of patients and correlating the presence or absence of each of the polymorphic variants (alone or in various combinations) with the patient's response to a particular treatment. The polymorphic variants can be previously known to exist or can also be determined de novo, or combinations of prior information and newly determined information may be used. The enhanced or diminished response should be statistically significant, preferably such that p=0.10 or less, more preferably 0.05 or less, and most preferably 0.02 or less. A positive correlation between the presence of one or more polymorphic variants and an enhanced response to treatment is indicative that the treatment is particularly effective in the group of patients having those polymorphic variants. A positive correlation of the presence of the one or more polymorphic variants with a diminished response to the treatment is indicative that the treatment will be less effective in the group of patients having those polymorphic variants. Such information is useful, for example, for selecting or de-selecting patients for a particular treatment or method of administration of a treatment, or for demonstrating that a group of patients exists for which the treatment or method of treatment would be particularly beneficial or contra-indicated. Such demonstration can be beneficial, for example, for obtaining government regulatory approval for a new drug or a new use of a drug.
[0173] In some embodiments, a first patient or set of patients suffering from a disease or condition are identified whose response to a treatment differs from the response (to the same treatment) of a second patient or set of patients suffering from the same disease or condition, and then determining whether the frequency of at least one polymorphic variant in at least one gene differs in frequency between the first patient or set of patients and the second patient or set of patients. A correlation between the presence or absence of the polymorphic variant or polymorphic variants and the response of the patient or patients to the treatment indicates that the polymorphic variant provides information about variable patient response. The method can involve identifying at least one polymorphic variant in at least one gene. In some embodiments, a first patient or set of patients suffering from a disease or condition and having a particular genotype, haplotype or combination of genotypes or haplotypes is identified, and a second patient or set of patients suffering from the same disease or condition that have a genotype or haplotype or sets of genotypes or haplotypes that differ in a specific way from those of the first set of patients is identified. The extent and magnitude of clinical response can be compared between the first patient or set of patients and the second patient or set of patients. A correlation between the presence or absence of a polymorphic variant or polymorphic variants or haplotypes and the response of the patient or patients to the treatment indicates that the polymorphic variant provides information about variable patient response and is useful for the invention.
[0174] Polymorphic variants of relevance include those that can affect one or more of: the susceptibility of individuals to a disease; the course or natural history of a disease; and the response of a patient with a disease to a medical intervention, such as, for example, a drug, a biologic substance, physical energy such as radiation therapy, or a specific dietary regimen. Variation in any of these three parameters can constitute the basis for initiating a pharmacogenetic study directed to the identification of the genetic sources of interpatient variation. The effect of a DNA sequence polymorphic variant or polymorphic variants on disease susceptibility or natural history are of particular interest as the polymorphic variants can be used to define patient subsets that behave differently in response to medical interventions. Useful gene sequence polymorphic variants for this invention can be described as polymorphic variants that partition patients into two or more groups that respond differently to a therapy, regardless of the reason for the difference, and regardless of whether the reason for the difference is known.
[0175] Once the presence or absence of a polymorphic variant or polymorphic variants in a gene or genes is shown to correlate with the efficacy or safety of a treatment method, that information can be used to select an appropriate treatment method for a particular patient. In the case of a treatment which is more likely to be effective when administered to a patient who has at least one copy of a gene with a particular polymorphic variant or polymorphic variants (in some cases the correlation with effective treatment is for patients who are homozygous for polymorphic variant or set of polymorphic variants in a gene) than in patients with a different polymorphic variant or set of polymorphic variants, a method of treatment is selected (and/or a method of administration) which correlates positively with the particular polymorphic variant presence or absence which provides the indication of effectiveness. Such selection can involve a variety of different choices, and the correlation can involve a variety of different types of treatments, or choices of methods of treatment. In some cases, the selection can include choices between treatments or methods of administration where more than one method is likely to be effective, or where there is a range of expected effectiveness or different expected levels of contra-indication or deleterious effects. In such cases, the selection can be performed to select a treatment that will be as effective or more effective than other methods, while having a comparatively low level of deleterious effects. Similarly, where the selection is between methods with differing levels of deleterious effects, preferably a method is selected that has low such effects but that is expected to be effective in the patient. Alternatively, in cases where the presence or absence of the particular polymorphic variant or polymorphic variants is indicative that a treatment or method of administration is more likely to be ineffective or contra-indicated in a patient with that polymorphic variant or polymorphic variants, then such treatment or method of administration is generally eliminated for use in that patient.
[0176] The term "therapy" refers to a process that is intended to produce a beneficial change in the condition of a mammal, for example, a human, often referred to as a patient. A beneficial change can include one or more of: restoration of function; reduction of symptoms; limitation or retardation of progression of a disease; disorder; or condition or prevention, limitation or retardation of deterioration of a patient's condition; disease or disorder. Such therapy can involve nutritional modifications, administration of radiation, administration of a drug, behavioral modifications and combinations of these, among others.
[0177] The terms "inhibit," "prevent," and "treat," as well as words stemming therefrom, as used herein, do not necessarily imply 100% or complete inhibition, prevention, or treatment. Rather, there are varying degrees of inhibition, prevention, or treatment of which one of ordinary skill in the art recognizes as having a potential benefit or therapeutic effect. In this respect, the present inventive methods can provide any amount of inhibition of metastasis of a cancer cell, any level of prevention of metastasis of a cancer cell of cancer, or any degree of treatments of a cancer in a subject. The term "patient" refers to both human and veterinary subjects. The term "subject" or "individual" typically refers to humans, but also to mammals and other animals, multicellular organisms such as plants, and single-celled organisms or viruses.
[0178] If a given polymorphism variant correlates with an increased the expression level or activity of the protein encoded by the variant, the complications associated with the variant can be treated by administering an antagonist of the protein. If a given polymorphism variant correlates with a complication involving decrease in the expression level or activity of the protein encoded by the variant, the complications can be treated by administering the protein itself, a nucleic acid encoding the protein that can be expressed in a patient, or an analog or agonist of the protein. In the case of pregnancy complications and polymorphism variants of genes encoding enzymes involved in a one carbon metabolic pathway such as MTHFD1, MTHFD1L, and MTHFR, folate, Vitamin B12, and/or other B vitamins are administered to the woman subject who is pregnant or planning a pregnancy. Other treatments can include, but are not limited to, surgery, the administration of pharmaceutical compounds or nutritional supplements, and behavioral changes such as improved diet, increased exercise, reduced alcohol intake, smoking cessation, etc.
[0179] The invention comprises a method for determining a method of treatment effective to treat a disease or condition by altering the level of activity of a product of an allele of a gene selected from the genes described herein, and determining whether that alteration provides a differential effect related to reducing or alleviating a disease or condition as compared to at least one alternative allele or an alteration in toxicity or tolerance of the treatment by a patient or patients. The presence of such a differential effect indicates that altering that level of activity provides at least part of an effective treatment for the disease or condition.
[0180] Information gained from analyzing genetic material for the presence of polymorphisms can be used to design treatment regimes involving gene therapy. For example, detection of a polymorphism that either affects the expression of a gene or results in the production of a mutant protein can be used to design an artificial gene to aid in the production of normal, wild type protein or help restore normal gene expression. Once designed, the gene can be placed in the individual by any suitable means known in the art. [Gene Therapy Technologies, Applications and Regulations, Meager, ed., Wiley (1999); Gene Therapy Principles and Applications, Blankenstein, ed., Birkhauser Verlag (1999); Jain, Textbook of Gene Therapy, Hogrefe and Huber (1998)].
[0181] There are several methods that can be used for assessing the medical and pharmaceutical implications of a polymorphic variant include computational methods, in vitro and/or in vivo experimental methods, prospective human clinical trials, and other laboratory and clinical measures. Informatics-based approaches include DNA and protein sequence analysis, such as phylogenetic approaches and motif searching, and protein modeling. Tools are available for modeling the structure of proteins with unsolved structure, particularly if there is a related protein with known structure. [Rost et al., J. Mol. Biol., 270:471-480 (1997); Firestine et al., Chem. Biol., 3:779-783 (1996)]. Methods are also available for identifying conserved domains and vital amino acid residues of proteins of unknown structure by analysis of phylogenetic relationships. [Deleage et al., Biochimie, 79:681-686 (1997); Taylor et al., Protein Sci., 3:1858-1870 (1994).] These methods can permit the prediction of functionally important polymorphic variants, either on the basis of structure or evolutionary conservation. Phylogenetic approaches to understanding sequence variation can also be used. If a sequence polymorphic variant occurs at a nucleotide or encoded amino acid residue where there is usually little or no variation in homologs of the protein of interest from non-human species, particularly evolutionarily remote species, then the polymorphic variant can be more likely to affect function of the RNA or protein.
[0182] Clinical Trial
[0183] A clinical trial can be used to evaluate differential efficacy of or tolerance to a treatment in a subset of patients who have a particular polymorphic variant or polymorphic variants in at least one gene. A "clinical trial" is the testing of a therapeutic intervention in a volunteer human population for the purpose of determining whether a therapeutic intervention is safe and/or efficacious in the human volunteer or patient population for a given disease, disorder, or condition. Clinical trials can comprise Phase I, II, III, or IV trials. In general, the polymorphisms relevant to the invention are useful for conducting clinical trials of drug candidates for the disease state, conditions and complications of the invention. Such trials can be performed on treated or control populations having similar or identical polymorphic profiles at a defined collection of polymorphic sites. Use of genetically matched populations eliminates or reduces variation in treatment outcome due to genetic factors, leading to a more accurate assessment of the efficacy of a potential drug. In some embodiments, the set of polymorphisms may be used to stratify the enrolled patients into disease sub-types or classes. In some embodiments, the polymorphisms are used to identify subsets of patients with similar polymorphic profiles who have an unusually high or low response to treatment or who do not respond at all. Information about the underlying genetic factors influencing response to treatment can be used in many aspects of the development of treatments, such as identification of new targets, through the design of new trials, product labeling, and patient targeting. Additionally, the polymorphisms can be used to identify the genetic factors involved in adverse response to treatment.
[0184] Diagnostic tests for a specific polymorphic variant or variant form of a gene can be incorporated in the clinical trial protocol as inclusion or exclusion criteria for enrollment in the trial, to allocate certain patients to treatment or control groups within the clinical trial or to assign patients to different treatment cohorts. In some embodiments, diagnostic tests for specific polymorphic variants are performed on all patients within a clinical trial, and statistical analysis performed comparing and contrasting the efficacy or safety of a drug between individuals with different polymorphic variants or variant forms of the gene or genes. Diagnostic tests for polymorphic variants can be performed on groups of patients known to have efficacious responses to the drug to identify differences in the frequency of polymorphic variants between responders and non-responders. In some embodiments, diagnostic tests for polymorphic variants are performed on groups of patients known to have toxic responses to the drug to identify differences in the frequency of the polymorphic variant between those having adverse events and those not having adverse events. Such outlier analyses are useful if a limited number of patient samples are available for analysis. Embodiments involving clinical trials include the genetic stratification strategies, phases, statistical analyses, sizes, and other relevant parameters.
[0185] Prior to establishment of a diagnostic test for use in the selection of a treatment method or elimination of a treatment method, the presence or absence of one or more specific polymorphic variants in a gene or in multiple genes is correlated with a differential treatment response. Such a differential response can be determined using prospective and/or retrospective data. The determination can be performed by analyzing the presence or absence of particular polymorphic variants in patients who have previously been treated with a particular treatment method, and correlating the polymorphic variant presence or absence with the observed course, outcome, and/or development of adverse events in those patients. Alternatively, the analysis can be performed prospectively, where the presence or absence of the polymorphic variant or polymorphic variants in an individual is determined and the course, outcome, and/or development of adverse events in those patients is subsequently or concurrently observed and then correlated with the polymorphic variant determination.
[0186] General methods for performing clinical trials are well known in the art. [Guide to Clinical Trials by Bert Spilker, Raven Press, 1991; The Randomized Clinical Trial and Therapeutic Decisions by Niels Tygstrup (Editor), Marcel Dekker; Recent Advances in Clinical Trial Design and Analysis (Cancer Treatment and Research, Ctar 75) by Peter F. Thall (Editor) Kluwer Academic Pub, 1995.] Additional design considerations include defining what the genetic hypothesis is, how it is to be tested, how many patients will need to be enrolled to have adequate statistical power to measure an effect of a specified magnitude, definition of primary and secondary endpoints, and methods of statistical analysis. The design of the trial can incorporate the preclinical data sets to determine the primary and secondary endpoints. Endpoints can include whether the therapeutic intervention is efficacious, efficacious with undesirable side effects, ineffective, ineffective with undesirable side effects, or ineffective with deleterious effects. Pharmacoeconomic analyses can be incorporated in order to support the efficacious intervention, efficacious with undesirable side effects cases, whereby the clinical outcome is positive, and economic analyses are carried out for the support of overall benefit to the patient and to society. The strategies for designing a clinical trial to test the effect of a genotypic polymorphic variant or polymorphic variants can be modified based upon the data and information from the preclinical studies and the patient symptomatic parameters unique to the target indication.
[0187] A clinical trial in which pharmacogenetic related efficacy or toxicity endpoints are included in the primary or secondary endpoints can be part of a retrospective or prospective clinical trial. In the design of these trials, the allelic differences is identified and stratification based upon these genotypic differences among patient or subject groups are used to ascertain the significance of the impact a genotype has on the candidate therapeutic intervention. Retrospective pharmacogenetic trials can be conducted at each of the phases of clinical development, with the assumption that sufficient data is available for the correlation of the physiologic effect of the candidate therapeutic intervention and the allelic polymorphic variant or polymorphic variants within the treatment population. In the case of a retrospective trial, the data collected from the trial can be re-analyzed by imposing the additional stratification on groups of patients by specific allelic polymorphic variants that may exist in the treatment groups. Retrospective trials can be useful to ascertain whether a hypothesis that a specific polymorphic variant has a significant effect on the efficacy or toxicity profile for a candidate therapeutic intervention. Retrospective or prospective human clinical trials are performed to test whether the identified allelic polymorphic variant, polymorphic variants, or haplotypes or combination thereof influence the efficacy or toxicity profiles for a given drug or other therapeutic intervention.
[0188] In designing a pharmacogenetic trial, retrospective analysis of Phase II or Phase III clinical data can indicate trial variables for which further analysis should be obtained. A placebo controlled pharmacogenetics clinical trial design can be one in which target allelic polymorphic variant or polymorphic variants is identified and a diagnostic test is performed to stratify the patients based upon presence, absence, or combination thereof of these polymorphic variants. In the Phase II or phase III stage of clinical development, determination of a specific sample size of a prospective trial is described to include factors such as expected differences between a placebo and treatment on the primary or secondary endpoints and a consideration of the allelic frequencies.
[0189] A prospective clinical trial has the advantage that the trial can be designed to ensure the trial objectives can be met with statistical certainty. In these cases, power analysis, which includes the parameters of allelic polymorphic variant frequency, number of treatment groups, and ability to detect positive outcomes can ensure that the trial objectives are met.
[0190] The design of a pharmacogenetics clinical trial can include a description of the allelic polymorphic variant impact on the observed efficacy between the treatment groups. Using this type of design, the type of genetic and phenotypic relationship display of the efficacy response to a candidate therapeutic intervention is analyzed. For example, a genotypically dominant allelic polymorphic variant or polymorphic variants are those in which both heterozygotes and homozygotes demonstrate a specific phenotypic efficacy response different from the homozygous recessive genotypic group. A pharmacogenetic approach is useful for clinicians and public health professionals to include or eliminate small groups of responders or non-responders from treatment in order to avoid unjustified side-effects. Further, adjustment of dosages when clear clinical difference between heterozygous and homozygous individuals may be beneficial for therapy with the candidate therapeutic intervention.
[0191] In some embodiments, a recessive allelic polymorphic variant or polymorphic variants are those in which only the homozygote recessive for that or those polymorphic variants will demonstrate a specific phenotypic efficacy response different from the heterozygotes or homozygous wildtype. In some embodiments, allelic polymorphic variant or polymorphic variants organized by haplotypes from additional gene or genes are included to help explain clinical phenotypic outcome differences among the treatment groups. These types of clinical studies can identify an allelic polymorphic variant and its role in the efficacy or toxicology pattern within the treatment population.
[0192] Statistical Analysis of Data
[0193] A variety of informative comparisons can be used to identify correlations in the clinical data. In some embodiments, a plurality of pairwise comparisons of treatment response and the presence or absence of at least one polymorphic variant can be performed for a plurality of patients. The response of at least one patient homozygous for at least one polymorphic variant can be compared with at least one patient homozygous for the alternative form of that polymorphic variant or polymorphic variants. The response of at least one patient heterozygous for at least one polymorphic variant can be compared with the response of at least one patient homozygous for the at least one polymorphic variant. The heterozygous patient response can be compared to both alternative homozygous forms, or the response of heterozygous patients is grouped with the response of one class of homozygous patients and said group is compared to the response of the alternative homozygous group.
[0194] One approach to analyzing the clinical data is as follows. First, variability between patients in the response to a particular treatment is observed. Second, at least a portion of the variable response is correlated with the presence or absence of at least one polymorphic variant in at least one gene. Third, an analytical or diagnostic test is provided to determine the presence or absence of the at least one polymorphic variant in individual patients. Fourth, the presence or absence of the polymorphic variant or polymorphic variants is used to select a patient for a treatment or to select a treatment for a patient, or the polymorphic variant information is used in other methods described herein.
[0195] Polymorphic variants in a gene can be correlated empirically with treatment response, which can be used to identify polymorphic variants in a gene that exist in a population. The presence of the different polymorphic variants or haplotypes in individuals of a study group, which can be representative of a population or populations, is determined. This polymorphic variant information is then correlated with treatment response of the various individuals as an indication that genetic variability in the gene is at least partially responsible for differential treatment response. Statistical measures known to those skilled in the art can be used to measure the fraction of interpatient variation attributable to any one polymorphic variant. Useful methods for identifying genes relevant to the physiologic action of a drug or other treatment are known to those skilled in the art, and include large scale analysis of gene expression in cells treated with the drug compared to control cells, or large scale analysis of the protein expression pattern in treated vs. untreated cells, or the use of techniques for identification of interacting proteins or ligand-protein interactions.
[0196] The gene comprising the polymorphic variant can be involved in drug action, and the variant forms of the gene are associated with variability in the action of the drug. In some embodiments, one variant form of the gene is associated with the action of the drug such that the drug will be effective in an individual who is heterozygous or homozygous for the variant. In some embodiments, a variant form of the gene is associated with the action of the drug such that the drug will be toxic or otherwise contra-indicated in a homozygous or heterozygous individual.
[0197] In one embodiment, patients are stratified by genotype by one candidate polymorphic variant in the candidate gene locus. Genetic stratification of patients can be accomplished in several ways, including the following, where "X" is the more frequent form of the polymorphic variant being assessed and "x" is the less frequent form): (a) XX vs. xx; (b) XX vs. Xx vs. xx; (c) XX vs. (Xx+xx); (d) (XX+Xx) vs. xx. The effect of genotype on drug response phenotype can be affected by a variety of nongenetic factors, and it can be beneficial to measure the effect of genetic stratification in a subgroup of the overall clinical trial population. Subgroups can be defined in a number of ways including, for example, biological, clinical, pathological or environmental criteria. Biological criteria include sex (gender), age, hormonal status and reproductive history, ethnic, racial, or geographic origin, or surrogate markers of ethnic, racial or geographic origin. Clinical criteria include disease status and disease manifestations. Pathological criteria include histopathologic features of disease tissue, or pathological diagnosis; pathological stage; loss of heterozygosity (LOH), pathology studies, and laboratory studies. Frequency of responders is measured in each genetic subgroup. Subgroups can be defined in several ways: more than two age groups, and age related status such as pre or post-menopausal. One can also stratify by haplotype at one candidate locus where the haplotype is made up of two polymorphic variants, three polymorphic variants or greater than three polymorphic variants. A variety of statistical methods exist for measuring the difference between two or more groups in a clinical trial. One skilled in the art will recognize that different methods are suited to different data sets. In general, there is a family of methods customarily used in clinical trials, and another family of methods customarily used in genetic epidemiological studies. Methods from either family can be suitable for performing statistical analysis of pharmacogenetic clinical trial data.
[0198] Conventional clinical trial statistics include hypothesis testing and descriptive methods. Guidance in the selection of appropriate statistical tests for a particular data set can be obtained from texts such as: [Biostatistics: A Foundation for Analysis in the Health Sciences, 7th edition (Wiley Series in Probability and Mathematical Statistics, Applied Probability and statistics) by Wayne W. Daniel, John Wiley & Sons, 1998; Bayesian Methods and Ethics in a Clinical Trial Design (Wiley Series in Probability and Mathematical Statistics. Applied Probability Section) by J. B. Kadane (Editor), John Wiley & Sons, 1996].
[0199] Hypothesis testing statistical procedures include the following examples: one-sample procedures (binomial confidence interval, Wilcoxon signed rank test, permutation test with general scores, generation of exact permutational distributions); two-sample procedures (t-test, Wilcoxon-Mann-Whitney test, Normal score test, Median test, Van der Waerden test, Savage test, Logrank test for censored survival data, Wilcoxon-Gehan test for censored survival data, Cochran-Armitage trend test, permutation test with general scores, generation of exact permutational distributions); RxC contingency tables (Fisher's exact test, Pearson's chi-squared test, Likelihood ratio test, Kruskal-Wallis test, Jonckheere-Terpstra test, Linear-by linear association test, McNemar's test, marginal homogeneity test for matched pairs); Stratified 2×2 contingency tables (test of homogeneity for odds ratio, test of unity for the common odds ratio, confidence interval for the common odds ratio); Stratified 2xC contingency tables (all two-sample procedures listed above with stratification, confidence intervals for the odds ratios and trend, generation of exact permutational distributions); General linear models (simple regression, multiple regression, analysis of polymorphic variant--ANOVA--, analysis of copolymorphic variant, response-surface models, weighted regression, polynomial regression, partial correlation, multiple analysis of polymorphic variant--MANOVA--, repeated measures analysis of polymorphic variant); analysis of polymorphic variant and copolymorphic variant with a nested (hierarchical) structure designs and randomized plans for nested and crossed experiments (completely randomized design for two treatment, split-splot design, hierarchical design, incomplete block design, latin square design); nonlinear regression models; logistic regression for unstratified or stratified data, for binary or ordinal response data, using the logit link function, the normit function or the complementary log-log function; probit, logit, ordinal logistic and gompit regression models, fitting parametric models to failure time data that may be right-, left-, or interval-censored; tested distributions can include extreme value, normal and logistic distributions, and, by using a log transformation, exponential, Weibull, lognormal, loglogistic and gamma distributions; compute non-parametric estimates of survival distribution with right-censored data and compute rank tests for association of the response variable with other variables.
[0200] Descriptive statistical methods include factor analysis with rotations, canonical correlation, principal component analysis for quantitative variables, principal component analysis for qualitative data, hierarchical and dynamic clustering methods to create tree structure, dendrogram or phenogram, simple and multiple correspondence analysis using a contingency table as input or raw categorical data. Specific instructions and computer programs for performing the above calculations can be obtained from companies such as: SAS/STAT Software, SAS Institute Inc., Cary, N.C., USA; BMDP Statistical Software, BMDP Statistical Software Inc., Los Angeles, Calif., USA; SYSTAT software, SPSS Inc., Chicago, Ill., USA; StatXact & LogXact, CYTEL Software Corporation, Cambridge, Mass., USA.
[0201] Genetic epidemiological methods can also be useful in carrying out statistical tests for the invention. Guidance in the selection of appropriate genetic statistical tests for analysis of a particular data set can be obtained from texts such as: [Fundamentals of Genetic Epidemiology (Monographs in Epidemiology and Biostatistics, Vol. 22) by M. J. Khoury, B. H. Cohen & T. H. Beaty, Oxford Univ. Press, 1993; Methods in Genetic Epidemiology by Newton E. Morton, S. Karger Publishing, 1983; Methods in Observational Epidemiology, 2nd edition (Monographs in Epidemiology and Biostatistics, V. 26) by J. L. Kelsey (Editor), A. S. Whittemore & A. S. Evans, 1996; Clinical Trials: Design, Conduct, and Analysis (Monographs in Epidemiology and Biostatistics, Vol 8) by C. L. Meinert & S. Tonascia, 1986)].
[0202] Parsimony methods can be used to classify DNA sequences, haplotypes or phenotypic characters. Parsimony principle maintains that the best explanation for the observed differences among sequences, phenotypes (individuals, species) etc., is provided by the smallest number of evolutionary changes. Alternatively, simpler hypotheses are used to explain a set of data or patterns, than more complicated ones [Molecular Systematics, Hillis et al. (1996)]. These methods for inferring relationship among sequences operate by minimizing the number of evolutionary steps or mutations, changes from one sequence/character, required to explain a given set of data. To obtain relationships among a set of sequences and construct a structure, such as a tree or topology, the minimum number of mutations that are required for explaining the observed evolutionary changes among a set of sequences are first counted. A structure is constructed based on this number. Additional structures are tried and the structure that requires the smallest number of mutational steps is chosen as the likely structure/evolutionary tree for the sequences studied.
[0203] If the computed frequency of the polymorphic variants and/or haplotypes is equal to the number of individuals in the population, then there will be a consideration of utilizing additional methods. For these cases and if there is a small population, then the number of haplotypes will be considered relative to the number of entrants. Homozygotes can be assigned one unambiguous haplotype. If there is a single site polymorphic variant (mutation) at one of the chromosomes then it will have two haplotypes. As the number of polymorphic variants increase in the diploid chromosomes, each of these polymorphic variants are compared with the haplotypes of the original population. Then a frequency is assigned to the new polymorphic variant based upon the Hardy-Weinberg expected frequencies. [See generally, Clark, Mol. Biol. and Evol. (1990).]
[0204] The statistical significance of the differences between polymorphic variant frequencies can be assessed by a Pearson chi-squared test of homogeneity of proportions with n-1 degrees of freedom. Then, in order to determine which polymorphic variant(s) is responsible for an eventual significance, one can consider each polymorphic variant individually against the rest, up to n comparisons, each based on a 2×2 table. This approach should result in chi-sequared tests that are individually valid; taking the most significant of these tests is a form of multiple testing. A Bonferroni's adjustment for multiple testing can be made to the P-values, such as p*=1-(1-p)n. The statistical significance of the difference between genotype frequencies associated to every polymorphic variant can be assessed by a Pearson chi-squared test of homogeneity of proportions with 2 degrees of freedom, using the same Bonferroni's adjustment as above.
[0205] Testing for unequal haplotype frequencies between cases and controls can be considered in the same framework as testing for unequal polymorphic variant frequencies, because a single polymorphic variant can be considered as a haplotype of a single locus. The relevant likelihood ratio test compares a model where two separate sets of haplotype frequencies apply to the cases and controls, to one where the entire sample is characterized by a single common set of haplotype frequencies. This comparison can be performed by repeated use of a computer program [Terwilliger and Ott, 1994, Handbook of Human Linkage Analysis, Baltimore, John Hopkins University Press] to successively obtain the log-likelihood corresponding to the set of haplotpe frequency estimates on the cases (lnL case), on the controls (lnLcontrol), and on the overall (lnLcombined). The test statistic 2((lnLcase)+(lnLcontrol)-(lnLcombined)) is then chi-squared with degrees of freedom, where r is the number of haplotypes. To test for potentially confounding effects or effect-modifiers, such as sex, age, etc., logistic regression can be used with case-control status as the outcome variable, and genotypes and covariates, plus possible interactions, as predictor variables.
[0206] Drug Screening
[0207] Drug screening assays can be performed on cells that have been transfected with a nucleic acid encoding all or part of one of the polymorphic variants relevant to the invention. In some embodiments, no endogenous equivalents of transfected nucleic acids are present in the cells. The cells can be transfected with RNA in which case expression of the polymorphic variant protein is transient. Alternatively, the nucleic acid can be stably introduced into the cell line. In those embodiments wherein the nucleic acid encodes a one carbon metabolic pathway enzyme, cells expressing protein are monitored for relative levels of pathway molecules. The control can be vehicle without an agent or can be an agent known not to have any effect on a one carbon metabolic pathway. The control can be a known agonist and/or antagonist of a one carbon metabolic pathway. Transfected cells are also useful for identifying genes whose expression pattern is altered in the presence of one or more of the polymorphic variants relevant to the invention relative to wildtype form. Such genes themselves are potential therapeutic or diagnostic targets.
[0208] In some embodiments, drug screening assays are performed on transgenic animals. Some transgenic animals have an exogenous human transgene bearing a polymorphic variant relevant to the invention. In some such animals, the endogenous equivalent(s) of transfected gene(s) transgene is/are knocked out. In other transgenic animals, the endogenous gene is mutated to contain one of the variant forms relevant to the invention. Potential agents are administered to the transgenic animal, and relevant parameters are measured. The performance can be compared with that of a transgenic animal administered a control substance or with a nontransgenic animal administered the agent or a control substance.
[0209] The invention provides a pharmaceutical composition that includes a compound that has a differential effect in patients having at least one copy, or alternatively, two copies of a form of a gene as identified for aspects above and a pharmaceutically acceptable carrier, excipient, or diluent. The composition is adapted to be preferentially effective to treat a patient with cells containing one, two, or more copies of the form of the gene.
[0210] The methods and materials of the invention can utilize conventional pharmaceutical compositions more effectively by identifying patients who are likely to benefit from a particular treatment, patients for whom a particular treatment is less likely to be effective, or for whom a particular treatment is likely to produce undesirable or intolerable effects. In some embodiments, compositions are adapted to be preferentially effective in patients who possess particular genetic characteristics, i.e., in whom a particular polymorphic variant or polymorphic variants in one or more genes is present or absent--depending on whether the presence or the absence of the polymorphic variant or polymorphic variants in a patient is correlated with an increased expectation of beneficial response. In some embodiments, one or more polymorphic variants indicates that a patient can beneficially receive a significantly higher dosage of a drug than a patient having a different polymorphic variant or polymorphic variants. An indication or suggestion can specify that a patient be heterozygous, or alternatively, homozygous for a particular polymorphic variant or polymorphic variants or variant form of a gene. In some embodiments, an indication or suggestion specifies that a patient have no more than one copy, or zero copies, of a particular polymorphic variant, polymorphic variants, or variant form of a gene.
[0211] In some embodiments involving pharmaceutical compositions, active compounds, or drugs, the material is subject to a regulatory limitation or restriction on approved uses or indications, e.g., by the U.S. Food and Drug Administration (FDA), limiting approved use of the composition to patients having at least one copy of the particular form of the gene that contains at least one polymorphic variant. In some embodiments, the composition is subject to a regulatory limitation or restriction on approved uses indicating that the composition is not approved for use or should not be used in patients having at least one copy of a form of the gene including at least one polymorphic variant. In some embodiments, the composition is packaged, and the packaging includes a label or insert indicating or suggesting beneficial therapeutic approved use of the composition in patients having one or two copies of a form of the gene including at least one polymorphic variant. Alternatively, the label or insert limits approved use of the composition to patients having zero or one or two copies of a form of the gene including at least one polymorphic variant. The latter embodiment would be likely where the presence of the at least one polymorphic variant in one or two copies in cells of a patient means that the composition would be ineffective or deleterious to the patient. In some embodiments, the composition is indicated for use in treatment of a disease or condition which is one of those identified for aspects above. In some embodiments, the at least one polymorphic variant includes at least one polymorphic variant from those identified herein.
[0212] The term "packaged" means that the drug, compound, or composition is prepared in a manner suitable for distribution or shipping with a box, vial, pouch, bubble pack, or other protective container, which may also be used in combination. The packaging can have printing on it and/or printed material may be included in the packaging. In some embodiments, the drug is subject to a regulatory limitation or suggestion or warning as described above that limits or suggests limiting approved use to patients having specific polymorphic variants or variant forms of a gene in order to achieve maximal benefit and avoid toxicity or other deleterious effect.
[0213] A pharmaceutical composition can be adapted to be preferentially effective in a variety of ways. In some embodiments, an active compound is selected that was not previously known to be differentially active, or which was not previously recognized as a potential therapeutic compound. In some embodiments, the concentration of an active compound that has differential activity can be adjusted such that the composition is appropriate for administration to a patient with the specified polymorphic variants. In some embodiments, the presence of a specified polymorphic variant may allow or require the administration of a much larger dose, which would not be practical with a previously utilized composition. In some embodiments, a patient requires a much lower dose, such that administration of such a dose with a prior composition would be impractical or inaccurate. The composition can be prepared in a higher or lower unit dose form, or prepared in a higher or lower concentration of the active compound or compounds. In yet other cases, the composition can include additional compounds needed to enable administration of a particular active compound in a patient with the specified polymorphic variants, which was not in previous compositions, for example, because the majority of patients did not require or benefit from the added component.
[0214] In some embodiments, a drug is explicitly indicated for, and/or for which approved use is restricted to individuals in the population with specific polymorphic variants or combinations of polymorphic variants, as determined by diagnostic tests for polymorphic variants or variant forms of certain genes involved in the disease or condition or involved in the action of the drug. Such drugs can provide more effective treatment for a disease or condition in a population identified or characterized with the use of a diagnostic test for a specific polymorphic variant or variant form of the gene if the gene is involved in the action of the drug or in determining a characteristic of the disease or condition. Such drugs can be developed using the diagnostic tests for specific polymorphic variants or variant forms of a gene to determine the inclusion of patients in a clinical trial.
[0215] The invention also comprises a method for producing a pharmaceutical composition by identifying a compound that has differential activity against a disease or condition in patients having at least one polymorphic variant in a gene, compounding the pharmaceutical composition by combining the compound with a pharmaceutically acceptable carrier, excipient, or diluent such that the composition is preferentially effective in patients who have at least one copy of the polymorphic variant or polymorphic variants. In some embodiments, the patient has two copies of the polymorphic variant or polymorphic variants. In some embodiments, the disease or condition, gene or genes, polymorphic variants, methods of administration, or method of determining the presence or absence of polymorphic variants is as described for other aspects of this invention.
[0216] The invention also comprises a method for producing a pharmaceutical agent by identifying a compound which has differential activity against a disease or condition in patients having at least one copy of a form of a gene having at least one polymorphic variant and synthesizing the compound in an amount sufficient to provide a pharmaceutical effect in a patient suffering from the disease or condition. The compound can be identified by conventional screening methods and its activity confirmed. Compound libraries can be screened to identify compounds which differentially bind to products of variant forms of a particular gene product, or which differentially affect expression of variant forms of the particular gene, or which differentially affect the activity of a product expressed from such gene.
[0217] The invention also includes methods of manufacturing a medicament comprising one or more of the materials of the invention in the treatment of one or more of the diseases of the invention. Therapeutic agents and regimens further include homocysteine monitoring, B vitamin supplementation, for example, folate, FOLTX®, B12, and chemotherapeutic agents. Each FOLTX® tablet contains 2.5 mg of folacin (folic acid), 25 mg of pyridoxine (Vitamin B6), and 2 mg of cyanocobalamin (Vitamin B12).
[0218] Formulation
[0219] A therapeutic agent, which can be a compound and/or a composition, relevant to the invention can comprise a small molecule, a nucleic acid, a protein, an antibody, or any other agent with one or more therapeutic property. The therapeutic agent can be formulated in any pharmaceutically acceptable manner. In some embodiments, the therapeutic agent is prepared in a depot form to allow for release into the body to which it is administered is controlled with respect to time and location within the body (see, for example, U.S. Pat. No. 4,450,150). Depot forms of therapeutic agents can be, for example, an implantable composition comprising the therapeutic agent and a porous or non-porous material, such as a polymer, wherein the therapeutic agent is encapsulated by or diffused throughout the material and/or degradation of the non-porous material. The depot is then implanted into the desired location within the body and the therapeutic agent is released from the implant at a predetermined rate.
[0220] The therapeutic agent that is used in the invention can be formed as a composition, such as a pharmaceutical composition comprising a carrier and a therapeutic compound. Pharmaceutical compositions containing the therapeutic agent can comprise more than one therapeutic agent. The pharmaceutical composition can alternatively comprise a therapeutic agent in combination with other pharmaceutically active agents or drugs, such as chemotherapeutic agents, for example, a cancer drug.
[0221] The carrier can be any suitable carrier. Preferably, the carrier is a pharmaceutically acceptable carrier. With respect to pharmaceutical compositions, the carrier can be any of those conventionally used and is limited only by chemico physical considerations, such as solubility and lack of reactivity with the active compound(s), and by the route of administration. In addition to the following described pharmaceutical composition, the therapeutic compounds of the present inventive methods can be formulated as inclusion complexes, such as cyclodextrin inclusion complexes, or liposomes.
[0222] The pharmaceutically acceptable carriers described herein, for example, vehicles, adjuvants, excipients, and diluents, are well-known to those skilled in the art and are readily available to the public. The pharmaceutically acceptable carrier can be chemically inert to the active agent(s) and one which has no detrimental side effects or toxicity under the conditions of use. The choice of carrier can be determined in part by the particular therapeutic agent, as well as by the particular method used to administer the therapeutic compound. There are a variety of suitable formulations of the pharmaceutical composition of the invention. The following formulations for oral, aerosol, parenteral, subcutaneous, transdermal, transmucosal, intestinal, parenteral, intramedullary injections, direct intraventricular, intravenous, intranasal, intraocular, intramuscular, intraarterial, intrathecal, interperitoneal, rectal, and vaginal administration are exemplary and are in no way limiting. More than one route can be used to administer the therapeutic agent, and in some instances, a particular route can provide a more immediate and more effective response than another route. Depending on the specific conditions being treated, such agents can be formulated and administered systemically or locally. Techniques for formulation and administration may be found in [Remington's Pharmaceutical Sciences, 18th ed., Mack Publishing Co., Easton, Pa. (1990)].
[0223] Formulations suitable for oral administration can include (a) liquid solutions, such as an effective amount of the inhibitor dissolved in diluents, such as water, saline, or orange juice; (b) capsules, sachets, tablets, lozenges, and troches, each containing a predetermined amount of the active ingredient, as solids or granules; (c) powders; (d) suspensions in an appropriate liquid; and (e) suitable emulsions. Liquid formulations may include diluents, such as water and alcohols, for example, ethanol, benzyl alcohol, and the polyethylene alcohols, either with or without the addition of a pharmaceutically acceptable surfactant. Capsule forms can be of the ordinary hard or soft shelled gelatin type containing, for example, surfactants, lubricants, and inert fillers, such as lactose, sucrose, calcium phosphate, and corn starch. Tablet forms can include one or more of lactose, sucrose, mannitol, corn starch, potato starch, alginic acid, microcrystalline cellulose, acacia, gelatin, guar gum, colloidal silicon dioxide, croscarmellose sodium, talc, magnesium stearate, calcium stearate, zinc stearate, stearic acid, and other excipients, colorants, diluents, buffering agents, disintegrating agents, moistening agents, preservatives, flavoring agents, and other pharmacologically compatible excipients. Lozenge forms can comprise the inhibitor in a flavor, usually sucrose and acacia or tragacanth, as well as pastilles comprising the inhibitor in an inert base, such as gelatin and glycerin, or sucrose and acacia, emulsions, gels, and the like containing, in addition to, such excipients as are known in the art.
[0224] Pharmaceutical preparations that can be used orally include push-fit capsules made of gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or sorbitol. The push-fit capsules can contain the active ingredients in admixture with filler such as lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate and, optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended in suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, stabilizers may be added.
[0225] The therapeutic agent, alone or in combination with other suitable components, can be made into aerosol formulations to be administered via inhalation. These aerosol formulations can be placed into pressurized acceptable propellants, such as dichlorodifluoromethane, propane, nitrogen, and the like. They also can be formulated as pharmaceuticals for non pressured preparations, such as in a nebulizer or an atomizer. Such spray formulations also may be used to spray mucosa. Topical formulations are well known to those of skill in the art. Such formulations are particularly suitable in the context of the invention for application to the skin.
[0226] Injectable formulations are in accordance with the invention. The parameters for effective pharmaceutical carriers for injectable compositions are well-known to those of ordinary skill in the art [see, e.g., Pharmaceutics and Pharmacy Practice, J.B. Lippincott Company, Philadelphia, Pa., Banker and Chalmers, eds., pages 238-250 (1982), and ASHP Handbook on Injectable Drugs, Toissel, 4th ed., pages 622-630 (1986)]. For injection, the agents of the invention can be formulated in aqueous solutions, preferably in physiologically compatible buffers such as Hanks's solution, Ringer's solution, or physiological saline buffer. For such transmucosal administration, penetrants appropriate to the barrier to be permeated are used in the formulation. Such penetrants are generally known in the art.
[0227] Formulations suitable for parenteral administration include aqueous and non aqueous, isotonic sterile injection solutions, which can contain anti oxidants, buffers, bacteriostats, and solutes that render the formulation isotonic with the blood of the intended recipient, and aqueous and non aqueous sterile suspensions that can include suspending agents, solubilizers, thickening agents, stabilizers, and preservatives. The therapeutic agent can be administered in a physiologically acceptable diluent in a pharmaceutical carrier, such as a sterile liquid or mixture of liquids, including water, saline, aqueous dextrose and related sugar solutions, an alcohol, such as ethanol or hexadecyl alcohol, a glycol, such as propylene glycol or polyethylene glycol, dimethylsulfoxide, glycerol, ketals such as 2,2-dimethyl-1,3-dioxolane-4-methanol, ethers, poly(ethyleneglycol) 400, oils, fatty acids, fatty acid esters or glycerides, or acetylated fatty acid glycerides with or without the addition of a pharmaceutically acceptable surfactant, such as a soap or a detergent, suspending agent, such as pectin, carbomers, methylcellulose, hydroxypropylmethylcellulose, or carboxymethylcellulose, or emulsifying agents and other pharmaceutical adjuvants.
[0228] Oils, which can be used in parenteral formulations include petroleum, animal, vegetable, or synthetic oils. Specific examples of oils include peanut, soybean, sesame, cottonseed, corn, olive, petrolatum, and mineral. Suitable fatty acids for use in parenteral formulations include oleic acid, stearic acid, and isostearic acid. Ethyl oleate and isopropyl myristate are examples of suitable fatty acid esters.
[0229] Suitable soaps for use in parenteral formulations include fatty alkali metal, ammonium, and triethanolamine salts, and suitable detergents include (a) cationic detergents such as, for example, dimethyl dialkyl ammonium halides, and alkyl pyridinium halides, (b) anionic detergents such as, for example, alkyl, aryl, and olefin sulfonates, alkyl, olefin, ether, and monoglyceride sulfates, and sulfosuccinates, (c) nonionic detergents such as, for example, fatty amine oxides, fatty acid alkanolamides, and polyoxyethylenepolypropylene copolymers, (d) amphoteric detergents such as, for example, alkyl-β-aminopropionates, and 2-alkyl-imidazoline quaternary ammonium salts, and (e) mixtures thereof.
[0230] The parenteral formulations will typically contain from about 0.5% to about 25% by weight of the inhibitor in solution. Preservatives and buffers may be used. In order to minimize or eliminate irritation at the site of injection, such compositions may contain one or more nonionic surfactants having a hydrophile-lipophile balance (HLB) of from about 12 to about 17. The quantity of surfactant in such formulations will typically range from about 5% to about 15% by weight. Suitable surfactants include polyethylene glycol sorbitan fatty acid esters, such as sorbitan monooleate and the high molecular weight adducts of ethylene oxide with a hydrophobic base, formed by the condensation of propylene oxide with propylene glycol. The parenteral formulations can be presented in unit-dose or multi-dose sealed containers, such as ampoules and vials, and can be stored in a freeze-dried (lyophilized) condition requiring only the addition of the sterile liquid excipient, for example, water, for injections, immediately prior to use. Extemporaneous injection solutions and suspensions can be prepared from sterile powders, granules, and tablets of the kind previously described.
[0231] The therapeutic agent can be made into suppositories by mixing with a variety of bases, such as emulsifying bases or water-soluble bases. Formulations suitable for vaginal administration can be presented as pessaries, tampons, creams, gels, pastes, foams, or spray formulas containing, in addition to the active ingredient, such carriers as are known in the art to be appropriate.
[0232] The exact formulation, route of administration and dosage can be chosen by the individual physician in view of the patient's condition. [See, e.g., Fingl et. al., in The Pharmacological Basis of Therapeutics, 1975, Ch. 1, p. 1]. The attending physician can determine when to terminate, interrupt, or adjust administration due to toxicity, or to organ dysfunctions. Conversely, the attending physician can also adjust treatment to higher levels if the clinical response were not adequate, precluding toxicity. The magnitude of an administrated dose in the management of disorder of interest will vary with the severity of the condition to be treated and the route of administration. The severity of the condition may, for example, be evaluated, in part, by standard prognostic evaluation methods. The dose and perhaps dose frequency, can vary according to the age, body weight, and response of the individual patient. A program comparable to that discussed above can be used in veterinary medicine.
[0233] Use of pharmaceutically acceptable carriers to formulate the compounds herein disclosed for the practice of the invention into dosages suitable for systemic administration is within the scope of the invention. With proper choice of carrier and suitable manufacturing practice, the compositions relevant to the invention, in particular, those formulated as solutions, can be administered parenterally, such as by intravenous injection. The compounds can be formulated readily using pharmaceutically acceptable carriers well known in the art into dosages suitable for oral administration. Such carriers enable the compounds relevant to the invention to be formulated as tablets, pills, capsules, liquids, gels, syrups, slurries, tablets, dragees, solutions, suspensions and the like, for oral ingestion by a patient to be treated.
[0234] Agents intended to be administered intracellularly may be administered using techniques well known to those of ordinary skill in the art. For example, such agents may be encapsulated into liposomes, then administered as described above. Liposomes are spherical lipid bilayers with aqueous interiors. All molecules present in an aqueous solution at the time of liposome formation are incorporated into the aqueous interior. The liposomal contents are both protected from the external microenvironment and, because liposomes fuse with cell membranes, are efficiently delivered into the cell cytoplasm. Additionally, due to their hydrophobicity, small organic molecules may be directly administered intracellularly.
[0235] Pharmaceutical compositions suitable for use in the invention include compositions wherein the active ingredients are contained in an effective amount to achieve its intended purpose. In addition to the active ingredients, these pharmaceutical compositions can contain suitable pharmaceutically acceptable carriers comprising excipients and auxiliaries which facilitate processing of the active compounds into preparations which can be used pharmaceutically. The pharmaceutical compositions relevant to the invention can be manufactured in a manner that is itself known, for example, mixing, dissolving, granulating, dragee-making, levitating, emulsifying, encapsulating, entrapping or lyophilizing processes.
[0236] Pharmaceutical formulations for parenteral administration include aqueous solutions of the active compounds in water-soluble form. Additionally, suspensions of the active compounds can be prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles include fatty oils such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate or triglycerides, or liposomes. Aqueous injection suspensions can contain substances which increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or dextran. Optionally, the suspension may also contain suitable stabilizers or agents which increase the solubility of the compounds to allow for the preparation of highly concentrated solutions.
[0237] Pharmaceutical preparations for oral use can be obtained by combining the active compounds with solid excipient, optionally grinding a resulting mixture, and processing the mixture of granules, after adding suitable auxiliaries, if desired, to obtain tablets or dragee cores. Suitable excipients are, in particular, fillers such as sugars, including lactose, sucrose, mannitol, or sorbitol; cellulose preparations such as, for example, maize starch, wheat starch, rice starch, potato starch, gelatin, gum tragacanth, methyl cellulose, hydroxypropylmethyl-cellulose, sodium carboxymethylcellulose, and/or polyvinylpyrrolidone (PVP). If desired, disintegrating agents may be added, such as the cross-linked polyvinyl pyrrolidone, agar, or alginic acid or a salt thereof such as sodium alginate. Dragee cores are provided with suitable coatings. For this purpose, concentrated sugar solutions can be used, which may optionally contain gum arabic, talc, polyvinyl pyrrolidone, carbopol gel, polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or solvent mixtures. Dyestuffs or pigments may be added to the tablets or dragee coatings for identification or to characterize different combinations of active compound doses.
[0238] Administration
[0239] The invention also provides selecting a method of administration of an agent to a patient suffering from a disease or condition, by determining the presence or absence of at least one polymorphic variant in cells of the patient, where such presence or absence is indicative of an appropriate method of administration of the agent. The selection of a treatment regimen can involve selecting a dosage level or frequency of administration or route of administration of the agent(s) or combinations of those parameters. In some embodiments, two or more agents are administered, and the selecting involves selecting a method of administration for one, two, or more than two of the agents, jointly, concurrently, or separately. As understood by those skilled in the art, such plurality of agents is often used in combination therapy, and thus may be formulated in a single drug, or may be separate drugs administered concurrently, serially, or separately. Other embodiments are as indicated above for selection of second treatment methods, methods of identifying polymorphic variants, and methods of treatment as described for aspects above. The frequency of administration is generally selected to achieve a pharmacologically effective average or peak serum level without excessive deleterious effects. In some embodiments, the serum level of the drug is maintained within a therapeutic window of concentrations for the greatest percentage of time possible without such deleterious effects as would cause a prudent physician to reduce the frequency of administration for a particular dosage level. Administration of a particular treatment, for example, administration of a therapeutic compound or combination of compounds, is chosen depending on the disease or condition which is to be treated. In some embodiments, the disease or condition is one for which administration of a treatment is expected to provide a therapeutic benefit. In embodiments involving selection of a patient for a treatment, selection of a method or mode of administration of a treatment, and selection of a patient for a treatment or a method of treatment, the selection can be positive selection or negative selection. The methods can include modifying or eliminating a treatment for a patient, modifying or eliminating a method or mode of administration of a treatment to a patient, or modification or elimination of a patient for a treatment or method of treatment. A patient can be selected for a method of administration of a treatment, by detecting the presence or absence of at least one polymorphic variant in a gene as identified herein, where the presence or absence of the at least one polymorphic variant is indicative that the treatment or method of administration will be effective in the patient. If the at least one polymorphic variant is present in the patient's cells, then the patient is selected for administration of the treatment.
[0240] Dosage
[0241] The term "drug" or "therapeutic agent" as used herein refers to a chemical entity or biological product, or combination of chemical entities or biological products, administered to a person to treat, or prevent or control a disease or condition. In some embodiments, the chemical entity or biological product is a low molecular weight compound. A "low molecular weight compound" has a molecular weight<5,000 Da, <2500 Da, <1000 Da, or <700 Da. In some embodiments, the chemical entity is a larger compound, for example, an oligomer of nucleic acids, amino acids, or carbohydrates including without limitation proteins, oligonucleotides, ribozymes, DNAzymes, glycoproteins, lipoproteins, and modifications and combinations thereof. In some embodiments, the biological product is a monoclonal or polyclonal antibody or fragment thereof such as a variable chain fragment cells; or an agent or product arising from recombinant technology, such as, without limitation, a recombinant protein, recombinant vaccine, or DNA construct developed for therapeutic use. The term "drug" or "therapeutic agent" can include, without limitation, compounds that are approved for sale as pharmaceutical products by government regulatory agencies such as the U.S. Food and Drug Administration (USFDA or FDA), the European Medicines Evaluation Agency (EMEA), and a world regulatory body governing the Internation Conference of Harmonization (ICH) rules and guidelines, compounds that do not require approval by government regulatory agencies, food additives or supplements including compounds commonly characterized as vitamins, natural products, and completely or incompletely characterized mixtures of chemical entities including natural compounds or purified or partially purified natural products. In some embodiments, the drug is approved by a government agency for treatment of a specific disease or condition. The term "drug" as used herein is synonymous with the terms "agent," "therapeutic agent," "compound," "therapeutic compound," "composition," "therapeutic composition," "medicine," "pharmaceutical product," or "product."
[0242] In treating a patient exhibiting a disorder of interest, a therapeutically effective amount of a agent or agents is administered. A therapeutically effective dose refers to that amount of the compound that results in amelioration of one or more symptoms or a prolongation of survival in a patient. The amount or dose of the therapeutic compound administered should be sufficient to affect a therapeutic response in the subject or animal over a reasonable time frame. For example, in the case of cancer, the dose of the therapeutic compound should be sufficient to inhibit metastasis, prevent metastasis, treat or prevent cancer in a period of from about 2 hours or longer, e.g., 12 to 24 or more hours, from the time of administration. In certain embodiments, the time period could be even longer. The dose can be determined by the efficacy of the particular therapeutic agent and the condition of the subject, as well as the body weight of the subject to be treated. Many assays for determining an administered dose are known in the art.
[0243] The dose of the therapeutic compound can also be determined by the existence, nature and extent of any adverse side effects that might accompany the administration of a particular therapeutic compound. The attending physician can decide the dosage of the inhibitor relevant to the invention with which to treat each individual patient using the correlation between polymorphic variant and disease and/or drug efficacies provided by the invention and taking into consideration a variety of factors, such as age, body weight, general health, diet, sex, inhibitor to be administered, route of administration, and the severity of the condition being treated. In some embodiments, the dose of the therapeutic compound is about 0.001 to about 1000 mg/kg body weight of the subject being treated/day, from about 0.01 to about 10 mg/kg body weight/day, about 0.01 mg to about 1 mg/kg body weight/day.
[0244] Toxicity and therapeutic efficacy of therapeutic agents can be determined by standard pharmaceutical procedures in cell cultures or experimental animals, for example, for determining the LD50 and the ED50. The dose ratio between toxic and therapeutic effects is the therapeutic index and it can be expressed as the ratio LD50/ED50. In some embodiments, compounds that exhibit large therapeutic indices are used. The data obtained from these cell culture assays and animal studies can be used in formulating a range of dosage for use in humans. The dosage of such compounds can lie within a range of circulating concentrations that can include the ED50 with little or no toxicity. The dosage can vary within this range depending upon the dosage form and route of administration utilized. The therapeutically effective dose can be estimated initially from cell culture assays. For example, a dose can be formulated in animal models to achieve a circulating plasma concentration range that includes the IC50 as determined in cell culture. Such information can be used to more accurately determine useful doses in humans. Levels in plasma may be measured, for example, by HPLC.
[0245] In connection with the administration of a drug, a drug which is "effective against" a disease or condition indicates that administration in a clinically appropriate manner results in a beneficial effect for at least a statistically significant fraction of patients, such as a improvement of symptoms, a cure, a reduction in disease load, reduction in tumor mass or cell numbers, extension of life, improvement in quality of life, or other effect generally recognized as positive by those of skill in the art.
[0246] In some embodiments, dosage is in respect to B vitamins administered as part of a therapy for a pregnancy-related complication. The following are dietary reference intakes (DRIs, per diem) for exemplary B vitamins. While in some embodiments, a subject is administered a dose about equal to that of DRI, generally the subject is administered one or more vitamin in doses greater than that of the DRI. Vitamin B2 (riboflavin), DRI of 1.1 milligrams; Vitamin B6 (pyridoxine), DRI of 1.3 milligrams; Vitamin B9 (folic acid, folate, pteroylglutamic acid), DRI of 400 micrograms; and Vitamin B12 (cyano-cobalamin) DRI of 2.4 micrograms. Analogous, pro-drug, salts, and bioactive equivalents of these vitamins can also be employed. For example other folate-related compounds include folinic acid (5-formyl-tetrahydropteroylglutamate), and other B12-related compounds include methylcobalamin, hydroxycobalamin, and adenosylcobalamin (5'-deoxyadenosylcobalamin, dibencozide).
[0247] Kits
[0248] The invention includes kits for the detection of polymorphic variants associated with disease states, conditions or complications. The kits can comprise a polynucleotide of at least 30 contiguous nucleotides of one of the variants described herein. In one embodiment, the polynucleotide contains at least one polymorphism of the invention. Alternatively, the 3' end of the polynucleotide is immediately 5' to a polymorphic site, preferably a polymorphic site of the invention. In one embodiment, the polymorphic site contains a genetic variant. In still another embodiment, the genetic variant is located at the 3' end of the polynucleotide. In yet another embodiment, the polynucleotide of the kit contains a detectable label. Suitable labels include, but are not limited to, radioactive labels, such as radionuclides, fluorophores or fluorochromes, peptides, enzymes, antigens, antibodies, vitamins or steroids. The kit may also contain additional materials for detection of the polymorphisms. A kit can contain one or more of the following: buffer solutions, enzymes, nucleotide triphosphates, and other reagents and materials useful for the detection of genetic polymorphisms. Kits can contain instructions for conducting analyses of samples for the presence of polymorphisms and for interpreting the results obtained.
[0249] In some embodiments, the kit contains one or more pairs of allele-specific oligonucleotides hybridizing to different forms of a polymorphism. In some embodiments, the kit contains at least one probe or at least one primer or both corresponding to a gene or genes relevant to the invention. The kit can be adapted and configured to be suitable for identification of the presence or absence of one or more polymorphic variants. The kit can contain a plurality of either or both of such probes and/or primers, for example, 2, 3, 4, 5, 6, or more of such probes and/or primers. The plurality of probes and/or primers are adapted to provide detection of a plurality of different sequence polymorphic variants in a gene or plurality of genes, for example, in 2, 3, 4, 5, or more genes or to sequence a nucleic acid sequence including at least one polymorphic variant site in a gene or genes. In some embodiments, the kit contains components for detection of a plurality of polymorphic variants indicative of the effectiveness of a treatment or treatment against a plurality of diseases. Additional kit components can include one or more of the following: a buffer or buffers, such as amplification buffers and hybridization buffers, which may be in liquid or dry form, a DNA polymerase, such as a polymerase suitable for carrying out PCR, and deoxy nucleotide triphosphases (dNTPs). Preferably a probe includes a detectable label, for example, a fluorescent label, enzyme label, light scattering label, or other label. Additional components of the kit can also include restriction enzymes, reverse-transcriptase or polymerase, the substrate nucleoside triphosphates, means used to label, for example, an avidin-enzyme conjugate and enzyme substrate and chromogen if the label is biotin, and the appropriate buffers for reverse transcription, PCR, or hybridization reactions.
[0250] In some kits, the allele-specific oligonucleotides are provided immobilized to a substrate. For example, the same substrate can comprise allele-specific oligonucleotide probes for detecting any or all of the polymorphism variants described herein. Accordingly, the kit may comprise an array including a nucleic acid array and/or a polypeptide array. The array can include a plurality of different antibodies, a plurality of different nucleic acid sequences. Sites in the array can allow capture and/or detection of nucleic acid sequences or gene products corresponding to different polymorphic variants in one or more different genes. The array can be arranged to provide polymorphic variant detection for a plurality of polymorphic variants in one or more genes which correlate with the effectiveness of one or more treatments of one or more diseases.
[0251] The kit also can contain instructions for carrying out the methods. In some embodiments, the instructions include a listing of the polymorphic variants correlating with a particular treatment or treatments for a disease of diseases. The kit components can be selected to allow detection of a polymorphic variant described herein, and/or detection of a polymorphic variant indicative of a treatment, for example, administration of a drug.
[0252] The following examples further illustrate the invention but, of course, should not be construed as in any way limiting its scope.
Example 1
[0253] Candidate polymorphic variant sites in folate/homocysteine-related genes were investigated for a potential maternal association with increased risk of developing clinically severe abruptio placentae during pregnancy. [Parle-McDermott et al., Am. J. Med. Genetics, 132A:365-368 (2005).] The polymorphic variants tested included MTHFD1 1958G>A (R653Q), which had not been tested previously in relation to abruptio placentae, and the MTHFR polymorphisms 677C>T (A222V) and 1298A>C (E429A).
[0254] Blood samples were obtained from 56,049 pregnant women attending the three main maternity hospitals in Dublin between 1986 and 1990. Samples were taken on the first visit to the clinic and the gestational ages ranged between 15 and 17 weeks. Approximately 90% of births in the Dublin area are delivered in these hospitals as previously described [Kirke et al., Obstget. Gynecol., 89:221-226 (1993)]. In a global context, the Irish can be described as a Caucasian Northern European population [Cavalli-Sforza, Princeton, N.J.: Princeton University Press (1993)]. Morever, the low level of immigration into Ireland during the last century, means that population stratification is unlikely to confound genetic analyses. Pregnancies affected by severe abruptio placentae (n=62) and control pregnancies (n=184) were identified from hospital records in two of the hospitals. The diagnosis of severe abruptio placentae was based on having a retroplacental clot and/or accidental hemorrhage with associated clinical signs of abruption and/or a statement in the case records that the patient was a definite case of abruptio placentae. Data on gestational age at delivery, maternal hypertension, maternal blood transfusion, and pregnancy outcome were collected on all cases. Control pregnancies were selected from women with no history of abruptio placentae, and were matched for the same date and clinic as the cases where the blood sample was provided. Ethical approval was obtained for all samples collected and samples were anonymised prior to genotyping.
[0255] Genomic DNA was extracted using QIAamp DNA Blood Mini Kit (Qiagen, UK). Analysis of the MTHFD1 1958G>A (R653Q) polymorphism was performed by PCR-RFLP (restriction fragment length polymorphism) as detailed previously [Brody et al., Am. J. Hum. Genet., 71:1207-1215 (2002)]. Analysis of the MTHFR 677C>T (A222V) polymorphism was performed by PCR-RFLP using Hinf I as previously described [Frosst et al., Nature Genet., 10:111:113 (1995)]. The MTHFR 1298A>C (E429A) polymorphism was PCR amplified as described in van der Put et al., Am. J. Hum. Genet., 62:1044-1051 (1998) and genotyping was carried out via ASO (allele specific oligonucleotide) analysis as described previously [Parle-McDermott et al., J. Hum. Genet., 48:190-193 (2003)].
[0256] Allele and genotype frequencies were compared between cases and controls using statistical software (SAS PROC NLMIXED). The odds ratios were calculated using a log linear model by the delta method [Agresti, N.Y.: John Wiley & Sons (1990)] and statistical significance was assessed via the chi-square test. Likelihood ratios (G2) were used to assess goodness of fit of different models i.e., G2 provides a measure of the reliability of the odds ratio (small G2 P-values indicate a poor fit to the model being tested).
[0257] Combined MTHFR genotypes were analyzed by estimating (maximum likelihood estimation) the gamete frequencies in cases and controls using a model of the four combinations of alleles as described by Weir, Genetic Data Analysis II, Sunderland, Mass.: Sinauer (1996). A gene-gene interactive effect of MTHFR 677C>T (A222V) with MTHFD1 1958G>A (R653Q) or MTHFR 1298A>C (E429A) was tested using a series of non-hierarchical logistic models [Piegorsch et al., Stat. Med., 13:153-162 (1994)] to estimate interactive dominant and recessive effects.
[0258] The case group (n=62) consisted of mothers whose pregnancies were affected by clinically severe abruptio placentae. As expected in severe cases of abruption placentae, there was considerable co-morbidity. Intrauterine fetal death occurred in 31 of 62 (50%) cases. Blood transfusion was required in 26 of 62 (42%) cases. Maternal antepartum hypertension pre-abruptio was present in 17 of 62 (27%). Preterm delivery (<37 weeks gestational age) occurred in 29 of 62 (47%). Genotyping of the abruptio placentae cases and controls was successful in 100% (246/246) of subjects for MTHFD1 1958G>A (R653Q), 99.2% (244/246) for MTHFR 677C>T (A222V) and 100% (246/246) for MTHFR 1298A>C (E429A). The allele and genotype frequencies and comparisons for each polymorphism are shown in Table I. Although several models were tested for each polymorphism, only the best fitting model (largest goodness of fit (G2) P-value) is shown in Table I.
[0259] The `Q` allele of the MTHFD1 1958G>A (R653Q) polymorphism was more common in severe abruptio placentae cases than in controls due to an increase in `QQ` homozygotes among cases (Table I). Thus, pregnant women who are homozygous for the `Q` allele (`QQ`) have a greater risk of developing severe abruptio placentae during their pregnancy (odds ratio 2.85 (1.47-5.53), P=0.002) compared to women who are heterozygous (`RQ`) or homozygous wildtype (`RR`). Among women with severe abruptio placentae, those who were `QQ` homozygous were not significantly more likely than women who were `RR` homozygous wildtype or heterozygous to have hypertension, pre-term deliveries, or to require transfusions. However, an effect may have been missed due to the small number of individuals within each subgroup. The allele frequencies in the controls are similar to those previously reported in the Dutch [Hol et al., Clin. Genet., 53:119-125 (1998)] and Turkish [Akar et al., Acta. Haematol., 102:199-200 (1999)] populations and in previously published Irish control population [Brody et al., Am. J. Hum. Genet., 71:1207-1215 (2002)].
TABLE-US-00004 TABLE I Comparison of MTHFD1 1958G > A (R653Q), MTHFR 677C > T and MTHFR 1298A > C Polymorphisms in placental abruption. Genotypes Alleles MTHFD1 R653Q `RR` `RQ` `QQ` `R` `Q` Abruptio Placentae .sup. 18 (.29)1 23 (.37) 21 (.34) 59 (.48) 65 (.52) Controls 60 (.33) 96 (.52) 28 (.15) 216 (.59) 152 (.41) `Q` vs. `R` Odds Ratio 1.57 (1.012-2.443), P = 0.0474 `QQ` vs. `RQ`/`RR` Odds Ratio 2.85 (1.47-5.53), P = 0.0025 MTHFR 677C > T CC CT TT C T Abruptio Placentae 26 (.42) 31 (.50) 5 (.08) 83 (.67) 41 (.33) Controls 80 (.44) 80 (.44) 22 (.12) 240 (.66) 124 (.34) T vs. C Odds Ratio 0.96 (0.63-1.44), P = 0.83 TT vs. CT/CC Odds Ratio 0.64 (0.23-1.76), P = 0.396 MTHFR 1298A > C AA AC CC A C Abruptio Placentae 25 (.40) 31 (.50) 6 (.10) 81 (.65) 43 (.35) Controls 91 (.49) 75 (.41) 18 (.10) 257 (.70) 111 (.30) C vs. A Odds Ratio 1.23 (0.81-1.86), P = 0.33 CC/AC vs. AA Odds Ratio 1.45 (0.81-2.60), P = 0.217 1Data in parentheses are allele or genotype frequencies; 2Lower limit of 95% Confidence Interval; 3Upper limit of 95% Confidence Interval; 4Assessed with use of chi-squared analysis; 5Goodness of fit statistic G2 P = 0.53; 6Goodness of fit statistic G2 P = 0.57; 7Goodness of fit statistic G2 P = 0.67.
[0260] Analysis identified the MTHFD1 G1958GA (R653Q) polymorphism as a genetic risk factor for having a pregnancy affected by severe abruptio placentae. Pregnant mothers who are `QQ` homozygous have almost a tripled risk of having this pregnancy complication.
[0261] Case-control comparisons of the MTHFR polymorphisms 677C>T (A222V) and 1298A>C (E429A) did not reveal significant differences between cases and controls (Table I). The association between MTHFR 677C>T and 1298A>C was also examined and in agreement with previous MTHFR data [Parle-McDermott et al., J. Hum. Genet., 48:190-193 (2003)], there was clear evidence of linkage disequilibrium between the two polymorphisms in both cases and controls. However, analysis of combined MTHFR genotypes showed similar frequencies in cases and controls, indicating that there is no interactive effect of these MTHFR polymorphisms on risk of abruptio placentae; this finding was confirmed by the non-hierarchical logistic model analysis. Therefore, the MTHFR 677C>T (A222V) and 1298A>C (E429A) polymorphisms are in linkage disequilibrium but do not show an association with severe abruptio placentae risk in this cohort when analyzed either independently or in combination.
[0262] Combined analysis of MTHFR 677C>T (A222V) with MTHFD1 1958G>A (R653Q) genotypes by the non-hierarchical logistic model analysis also did not show any significant effects and therefore, there does not appear to be an interactive effect of these two polymorphisms and risk of severe abruptio placentae. Analysis of the MTHFR polymorphisms 677C>T (A222V) and 1298A>C (E429A) in the largest group of clinically defined severe abruptio placentae patients to date (n=62) and controls (n=182) does not support their role as genetic risk factors.
[0263] Pregnant women who are homozygous for the MTHFD1 R653Q polymorphism i.e., `QQ`, are almost three times more likely to develop severe abruptio placentae than pregnant women who are either heterozygous (`RQ`) or homozygous wildtype (`RR`) (odds ratio 2.85 (1.47-5.53), P=0.002). The possibility of fetal DNA differentially affecting the results due to fetal-maternal transfusion can be ruled out as all the blood samples were taken between 15-17 weeks gestation, prior to the diagnosis of placental abruption. Moreover, the genetic consequences of this possibility would be to increase the apparent number of heterozygotes in affected mothers.
[0264] Without being held to any particular theory of mechanism, the following theories have been contemplated. The effect of the MTHFD1 R653Q polymorphism appears to act through the `QQ` homozygous genotype. Even if the MTHFD1 R653Q polymorphism does not have a direct effect on folate and homocysteine levels, this polymorphism may alter nucleotide pools available for DNA synthesis and thus affect cell division. The MTHFD1 653 `Q` allele, which resides in the synthetase domain of the trifunctional enzyme, may be less efficient at DNA synthesis particularly when folate status is low. This lower efficiency may produce effects at the cellular level without causing major perturbations in plasma metabolites. Alternatively, this polymorphism may be in linkage disequilibrium with an unknown variant that alters enzyme activity.
Example 2
[0265] This study investigated whether the MTHFD1 1958G>A, MTHFR 677C>T, or TCNII C667G polymorphism influences the maternal genetic risk of second trimester pregnancy loss. Cases and controls were derived from a bank of blood samples from 56,049 pregnant women drawn during their first clinical visit at the three main Dublin maternity hospitals between 1986 and 1990. These hospitals deliver approximately 90% of births within the Dublin area as previously described [Kirke, et al., Q. J. Med., 86:703-708 (1993)]. This bank of samples is representative of a homogeneous population and due to the low level of immigration into Ireland during the collection period population stratification is unlikely to confound the performed genetic analyses. Women with a history of at least one unexplained second trimester pregnancy loss (n=125), during a previous pregnancy were identified retrospectively from the computerised records of the Coombe Women's Hospital. Individual chart reviews were then performed to confirm the details of each case. Cases were women with a previous history of spontaneous abortion or in utero fetal demise occurring spontaneously between 13 and 26 weeks gestation. Women in whom a clinical explanation for the spontaneous abortion or fetal death was apparent were excluded. Thus, women with incompetent cervix, preterm premature rupture of membranes, preterm labor, placental abruption, maternal medical disease, or fetal malformations were not included. The control group (n=625) consisted of a systematic random sample of women from the same bank. Data on parity and maternal age when the blood sample was collected was available for all cases except one and for 118/625 of the controls. Personal identifiers were removed from all samples prior to genetic testing. Appropriate ethical approval was obtained for all samples collected.
[0266] Genomic DNA was extracted from cases and controls using the QIAamp DNA Blood Mini Kit from Qiagen, UK. Genotyping of the MTHFR 677C>T and MTHFD1 1958G>A polymorphisms was performed using PCR-RFLP (Restriction Fragment Length Polymorphism) using Hinf I and Msp I respectively as previously described [Frosst et al., Nature Genet., 10:111-113 (1995); Hol et al., Clin. Genet., 53:119-125 (1998); Brody et al., Am. J. Hum. Genet., 71:1207-1215 (2002)]. The TCNII 776C>G polymorphism was genotyped using an allele-specific primer extension assay and scored by matrix-assisted laser desorption/ionization-time of flight (MALDI-TOF) mass spectrometry (Sequenom, San Diego). Appropriate controls were included in all assays and genotyping consistency was tested by analyzing between 10-15% of samples in duplicate, resulting in 100% agreement.
[0267] The PCR conditions used for the experiment are set out in part in Table II. The reactions were set up on ice, and ran with thermocycle program "MTHFDRQ" using three GeneAmp PCR 9700 machines. When the temperature was approx. 90° C. the machine was paused and the tray was placed inside machine and the program was allowed to run. The program "MTHFDRQ" comprises the following parameters: 95° C. 3 mins, (94° C. 30 secs, 58° C. 1 min, 72° C. 1 min)×35 cycles, 72° C. 10 mins, Hold at 15° C. The primers used were R653Q Forward Primer 5' cactccagtgtttgtccatg 3' (SEQ ID NO: 19) and R653Q Reverse Primer 5' gcatcttgagagccctgac 3' (SEQ ID NO: 20). The primer stocks were diluted as follows: 1/25 60 μl+1,440 μl water for the forward primer and 1/23 65 μl+1,435 μl water for the reverse primer.
TABLE-US-00005 TABLE II PCR Reactants Reagent 100 Reactions Per Reaction 10 x PCR BUFFER 500 μl 5 25 mM MgCl2 300 μl 3 2.5 mM dNTPs 400 μl 4 F primer 1/25 (10 pmol/μl) 250 μl 2.5 R primer 1/23 (10 pmol/μl) 250 μl 2.5 Taq (5 U/μl, Sigma) 20 μl 0.2 H2O* 3180 μl 31.8 DNA* 1 μl + 49 μl Mix 1 μl *Adjust H2O volume depending on how much DNA is added.
[0268] The PCR products were digested with restriction enzyme MspI as indicated in Table III. Digestions took place in 37° C. waterbath for at least 3 hours, and can also be left overnight.
TABLE-US-00006 TABLE III PCR Product Digest Parameters Reagent 100 Digests Per Digest Msp I (20 U/μl) 100 μl 1 μl NEB2 Buffer 300 μl 3 μl H2O 1,100 μl 11 μl PCR product 15 μl + 15 μl Mix 15 μl
[0269] The products of the digest were with mixed with Orange G loading dye and loaded all on 1.5% agarose gel (use centipede with large combs: half a tray per gel) and allowed to run until orange G is just at the bottom of the gel. The bottom half of the gel can be stained in an ethidium bromide bath. The uncut product should be approximately 330 bp. For an AA genotype, the digest products should be approximately 267 bp and 71 base pairs. For a GG genotype, the digest product should be approximately 196 bp, 71 bp, and 55 bp. For an AG genotype, the digest product should be approximately 267 bp, 196 bp, 71 bp, and 55 bp.
[0270] The association between case-control status and genotype was examined using a number of standard odds ratios. In order to have a common approach for all analyses, a log linear model was employed. The statistical software (SAS PROC NLMIXED) allows estimation of nonlinear functions of the parameters of the model, and provides standard errors calculated using the delta method [Agresti, Categorical Data Analysis (1990)]. The parameterization of the model can easily be modified for the computation of different odds ratios. This approach enabled estimation of log odds ratios and their standard errors for the computation of confidence intervals, as well as checking the goodness of fit of different models. Potential gene-gene interaction effects were also examined. Tests of interactive dominant or recessive effects of specific combined genotypes were performed using a series of non-hierarchical logistic regression models [Piegorsch et al., Stat. Med. 13:153-162 (1994)]. Statistical significance was assessed using likelihood ratio chi-square tests.
[0271] The majority of cases (116/125) had experienced just one second trimester pregnancy loss. The remaining cases experienced two (n=7) or three (n=2) second trimester pregnancy losses. The average age of the study cases was 30+/-5.23 and controls were 26.3+/-5.09 (data on just 118/625 controls). Among the case group 12% of women had a parity of 0 and 88% had a parity of 1. Among the control group where data was available, 43% had a parity of 0 and 57% had a parity of 1.
[0272] Three polymorphisms were genotyped in the second trimester pregnancy loss case (n=125) and control (n=625) groups with 98.9% of all subjects successfully genotyped for MTHFD1 1958G>A, 98.4% for MTHFR 677C>T and 97.8% for TCNII 776C>G. Comparison of allele and genotype frequencies between cases and controls is shown in Table IV.
TABLE-US-00007 TABLE IV COMPARISON OF MTHFD1 1958G > A, MTHFR 677C > T AND TCNII 776C > G POLYMORPHISMS IN MOTHERS WITH A HISTORY OF SECOND TRIMESTER PREGNANCY LOSS AND CONTROL MOTHERS. Genotypes Alleles MTHFD1 1958G > A GG AG AA G A Case Mothers .sup. 32 (.26)1 58 (.47) 33 (.27) 122 (.50) 124 (.50) Control Mothers 173 (.28) 333 (.54) 113 (.18) 679 (.55) 559 (.45) A vs. G Odds Ratio 1.23 (95% CI 0.93-1.63), P = 0.142 AA vs. AG/GG Odds Ratio 1.64 (95% CI 1.05-2.57), P = 0.033 MTHFR 677C > T CC CT TT C T Case Mothers 55 (.44) 55 (.44) 14 (.11) 165 (.67) 83 (.33) Control Mothers 271 (.44) 270 (.44) 73 (.12) 812 (.66) 416 (.34) T vs. C Odds Ratio 0.98 (95% CI 0.73-1.31), P = 0.90 TT vs. CT/CC Odds Ratio 0.94 (95% CI 0.51-1.73), P = 0.854 TCNII 776C > G CC CG GG C G Case Mothers 33 (.27) 61 (.50) 29 (.24) 127 (.52) 119 (.48) Control Mothers 184 (.30) 306 (.50) 121 (.20) 674 (.55) 548 (.45) C vs. G Odds Ratio 1.15 (95% CI 0.88-1.52), P = 0.31 GG vs. CC/CG Odds Ratio 1.25 (95% CI 0.79-1.98), P = 0.345 1Data in parentheses are allele or genotype frequencies; 2Chi-squared analysis; 3Goodness of fit statistic G2 P = 0.80; 4Goodness of fit statistic G2 P = 0.99; 5Goodness of fit statistic G2 P = 0.65
[0273] The MTHFD1 1958AA genotype is clearly enriched in the second trimester pregnancy loss case group compared to controls. MTHFD1 1958AA women have a significantly increased risk of having an unexplained second trimester pregnancy loss than women who are MTHFD1 1958AG or 1958GG (odds ratio 1.64 (1.05-2.57) P=0.03). The control group shows deviation from Hardy-Weinberg equilibrium with slightly more MTHFD1 1958AG heterozygotes than expected (P=0.03). Published frequencies from other populations including Dutch (Hol et al., Clin Genet. 53, 119-125 (1998)), Turkish (Akar et al., Thromb. Res., 102:115-120 (2001)), Italian (De Marco et al., Annual Meeting of the Society for Research into Hydrocephalus and Spina Bifida, Dublin, 23-26 (2004)) and Mexican (Shi et al., Birth Defects Res. Part A Clin. Mol. Teratol., 67:545-549 (2003)) are also skewed toward heterozygote excess although these deviations from Hardy-Weinberg equilibrium are not statistically significant.
[0274] Increased frequencies of the TCNII 776G allele (48% vs 45%) and the 776GG genotype (24% vs 20%) were observed in cases compared to controls (Table IV). Although this difference was not statistically significant, the TCNII 776C>G polymorphism cannot be completely ruled out as a risk factor for second trimester loss. Comparison of the allele and genotype frequencies of the MTHFR 677C>T polymorphism showed no difference between cases and controls. Thus, the MTHFR 677C>T polymorphism does not appear to be a significant risk factor for unexplained second trimester pregnancy loss in the Irish population.
[0275] Data was also examined for the possibility of combined genetic factors having an additive effect on risk of second trimester loss. The following genotype combinations for the possibility of an interactive effect: MTHFD1 1958AA and MTHFR 677TT (OR 1.25, P=0.75), MTHFD1 1958AA and TCNII 776GG (OR 1.20, P=0.75) or 776CG/GG (OR 1.16, P=0.77), MTHFR 677TT and TCNII 776GG (OR 0.81, P=0.78) or 776CG/GG (OR 0.70, P=0.59). The results of these analyses show no significant genotype interactive effects on the risk of second trimester pregnancy loss.
[0276] While there has been some evidence to support a role of the MTHFR 677C>T polymorphism as both a maternal and fetal genetic risk factor for early pregnancy loss [Reviewed in Zetterberg et al., Reprod. Biol. Endocrinol., 2:7 (2004)], analysis of the MTHFR 677C>T polymorphism in the second trimester cohort showed no evidence of an association.
[0277] The results indicate that the maternal TCNII 776C>G genotype does not independently contribute to risk of second trimester pregnancy loss. Although the 776GG genotype showed an increased frequency in the second trimester case group compared to controls (24% vs 20%), this result was not statistically significant.
[0278] Although the variants in MTHFR and TCNII were not found to be independent maternal risk factors, each may contribute to second trimester loss in combination with some other factor. For example, an interactive effect between TCNII 776CG or 776GG and MTHFR 677TT on early fetal loss has been reported [Zetterberg et al., Hum. Reprod., 18:1948-1950 (2003)]. While an odds ratio comparison showed that these genotype combinations were significantly higher in spontaneously aborted fetuses, a statistical method that differentiates between independent and interactive effects would have tested more effectively whether these polymorphisms act synergistically. Logistic regression analysis was applied to this data [reconstructed from Zetterberg et al., Eur. J. Hum. Genet., 10:113-118 (2002), Zetterberg et al., Hum. Reprod., 17:3033-3036 (2002), and Zetterberg et al., Hum. Reprod., 18:1948-1950 (2003)]. This method confirmed that each polymorphism acts as an independent risk factor (TCNII 776CG or 776GG, P=0.0006; MTHFR 677TT P=0.033) but the interaction between MTHFR 677TT and TCNII 776CG/GG was not significant (P=0.77). Similarly, no evidence was found in the analysis of second trimester pregnancy loss cases and controls for interactive effects between the MTHFR 677TT and TCNII 776GG (P=0.78) or TCNII 776CG/GG (P=0.59).
[0279] The study did consider maternal age and the mean age among cases was 30 years, well under the threshold (35+ years) at which substantially increased complications related to maternal age are expected [Cunningham and Leveno, N. Engl. J. Med., 333:1002-1004 (1995)]. All losses with fetal malformations were excluded. Even if miscarriages due to unrecognized NTDs were present in the study, such miscarriages were unlikely to have had a significant impact on the analyses as the rate of NTD associated pregnancy losses is 1/50.
[0280] This experimental study has identified the MTHFD1 1958AA genotype as an independent maternal risk factor for unexplained pregnancy loss during the second trimester of pregnancy. Analyses of the MTHFR 677C>T and TCNII 776C>G polymorphisms did not indicate that these variants either independently or in combination had any significant affect on risk of pregnancy loss.
Example 3
[0281] The second trimester study of Example 2 is repeated, but also gathering data for maternal risk factors such as tobacco or alcohol use, which contribute to fetal loss. In addition, prenatal diagnosis and routine ultrasound can be performed. Genetic testing as described above for MTHFD1, MTHFD1L, etc. is carried out. These genetic test results can be combined with the risk associated with alcohol or tabacco use. The resulting risk estimate provides greater accuracy than those based on genetic testing or environmental exposure measurements alone.
Example 4
[0282] The second trimester study of Example 2 is repeated, but with further testing for inherited or acquired thrombophilia. This testing involves testing for the polymorphic variants described herein in respect to F2 and F5. By testing for multiple risk factors, one can achieve greater predictive value.
Example 5
[0283] The results in Example 2 showed significantly more 1958AG heterozygotes in the general population than expected and the apparent selection against transmission of the 1958A allele in the earlier MTHFD1 NTD study suggested that the 1958G>A polymorphism in the fetus may also have a role in fetal loss. The second trimester study is repeated with the spontaneously aborted embryos/fetuses tested for the MTHFD1 1958G>A polymorphism with the tentative prediction that more than expected would carry the 1958AA genotype.
Example 6
[0284] The following study revealed a correlation between neural tube defects and a particular variant of the rs3832406 polymorphism of MTHFD1L is predictative of an increased susceptibility for a having a child with a neural tube defect.
[0285] The study group consisted of NTD-affected children plus their parents (triads) who were recruited throughout Ireland from 1993 to date with the assistance of various branches of the Irish Association for Spina Bifida and Hydrocephalus. The NTD population comprised 387 NTD cases, 349 fathers of NTD cases and 386 mothers of NTD cases. The control population (n=280) was obtained from between 1986 and 1990 from 56,049 pregnant women attending the three main maternity hospitals in the Dublin area. Details of this collection have been described previously [Kirke, et al., Q. J. Med., 86:703-708 (1993)]. Informed consent and ethical approval were obtained for all samples collected.
[0286] Extraction of genomic DNA was carried out using the QIAamp DNA Blood Mini Kit, Qiagen, UK. Genotyping of the MTHFD1L intron 7 deletion insertion polymorphism, rs3832406, was carried out under the conditions outlined below. The sequences for the PCR primers were as follows: MTHFD1L.F 5'* TTCTCTTTCTTAGCCCCACG 3' (SEQ ID NO: 21) and MTHFD1L.R 5' AGAGCTTGCAGTGAGCCTAGA 3' (SEQ ID NO: 22)*6-FAM (BLUE) LABEL. An ABI GeneAmp PCR system 9700 was used for the thermocycling using the following program conditions: 94° C. 3 mins, (94° C. 30 secs, 60° C. 30 secs, 72° C. 30 secs)×35 cycles, 72° C. 5 mins. PCR reactant parameters are provided in Table V.
TABLE-US-00008 TABLE V MTHFD1L PCR REACTANTS Reagent 100 Reactions Per Reaction 10 x PCR BUFFER 250 μl 2.5 25 mM MgCl2 150 μl 1.5 2.5 mM dNTPs 200 μl 2 F primer 1/90 (10 pmol/μl) 200 μl 2 R primer 1/20 (10 pmol/μl) 50 μl 0.5 Taq (5 U/μl, Sigma) 10 μl 0.1 H2O* 1390 μl 13.9 DNA* 2.5 μl + 22.5 μl Mix
[0287] PCR products were resolved on a 6% denaturing polyacrylamide gel on an ABI 377 DNA sequencer and sized using the Genescan software. Genotypes were analysed using the Genotyper software. Analysis of the transmission of alleles from parents to affected NTD case was performed using an extended transmission disequilibrium test as described by Sham and Curtis, Ann. Hum. Genet., 59(Pt 3):323-36 (1995), using the ETDT software. Allele and genotype frequencies were compared between NTD groups and controls and statistical significance was assessed by chi-squared analysis. The allele and genotype frequencies for MTHFD1L intron 7 deletion insertion polymorphism, rs3832406, are shown in Table VI. The alleles are represented by the following numbers: Allele 1=7 x ATT; Allele 2=8 x ATT; Allele 3=9 x ATT.
TABLE-US-00009 TABLE VI Allele and Genotype Frequencies in NTD Groups and Controls Cases Fathers Mothers Controls Genotypes 1-1 184 (.49) 134 (.39) 164 (.44) 107 (.39) 1-2 74 (.20) 91 (.27) 84 (.23) 75 (.28) 1-3 69 (.19) 66 (.19) 75 (.20) 58 (.21) 2-2 10 (.03) 14 (.04) 10 (.03) 14 (.05) 2-3 23 (.06) 16 (.05) 22 (.06) 10 (.04) 3-3 12 (.03) 19 (.06) 18 (.05) 8 (.03) Total 387 (96.1%) 349 (97.4%) 386 (96.6%) 280 (97.1%) H-W (2df) P = 0.044 P = 0.004 P = 0.080 P = 0.102 Alleles 1 511 (.69) 425 (.63) 487 (.65) 347 (.64) 2 117 (.16) 135 (.20) 126 (.17) 113 (.21) 3 116 (.16) 120 (.18) 133 (.18) 84 (.15)
[0288] Comparison of cases to controls showed that the "1-1" genotype appears to be associated with increased risk of an NTD. In contrast, the "2" allele appears to be protective. The ETDT test confirmed the case versus control comparisons and showed over transmission of the "1" allele from parents to affected offspring, while the "2" allele showed under transmission. A summary of this analysis is shown in Table VII.
TABLE-US-00010 TABLE VII Case Vs Control Comparisons Allele 1 Vs 2/3; OR 0.80 (0.64-1.01) P = 0.066 2 Vs 1/3; OR 1.41 (1.06-1.87) P = 0.020 3 Vs 1/2; OR 0.99 (0.73-1.34) P = 0.941 Genotypes 1-1 Vs the Rest; OR 1.51 (1.10-2.07) P = 0.011 2-2/1-2/2-3 Vs the Rest; OR 0.71 (0.51-0.99) P = 0.041 3-3 Vs the Rest; OR 1.10 (0.44-2.73) P = 0.837 If ignore Allele 3: 1-1 Vs 1-2/2-2 OR 1.83 (1.25-2.68) P = 0.002 1-1 genotype = Risk 1-2 or 2-2 = Protective Other genotypes = no effect
[0289] Logistic Regression TDT was performed using extended transmission disequilibrium test-Sham and Curtis 1995 Software ETDT, supra, results were as follows: Chi-squared for allele-wise TDT=2*(L1-L0)=9.496, 2df, P=0.0087. Chi-squared for genotype-wise TDT 2*(L2-L0)=10.887, 3df, P=0.0124. Chi-squared for goodness of fit of allele-wise model=2*(L2-L1)=1.391, 1df, P=0.238. L0=Log likelihood that there is a probability of equal transmission, i.e., null hypothesis. L1=Alternative hypothesis that transmission probabilities are determined in an allele specific way. L2=Transmission probabilities may be independent for each genotype, that is, alleles are transmitted in a genotype specific fashion.
[0290] A summary of transmissions from all heterozygous parents is provided in Table VIII; maternal and paternal results are displayed in Tables IX and X respectively.
TABLE-US-00011 TABLE VIII Summary of Transmissions from All Heterozygous Parents Allele 1 Allele 2 Allele 3 Passed 137 (58%) 63 (38%) 66 (51%) Not Passed: 100 (42%) 102 (62%) 64 (49%) Chi-Squared (1df): 5.776 9.218 0.031 P-values$: 0.0163 0.0024 0.8608 $these values can be corrected for multiple testing.
TABLE-US-00012 TABLE IX Maternal Transmissions only Allele 1 Allele 2 Allele 3 Passed 65 (61%) 28 (41%) 27 (42%) Not Passed: 41 (39%) 41 (59%) 38 (58%) Chi-Squared (1df): 5.434 2.449 1.862 P-values$: 0.0198 0.1176 0.1725 Chi-squared for allele-wise TDT = 2* (L1 - L0) = 5.489, 2 df, P = 0.064 Chi-squared for genotype-wise TDT 2* (L2 - L0) = 7.115, 3df, P = 0.068 Chi-squared for goodness of fit of allele-wise model = 2* (L2 - L1) = 1.627, 1df, P = 0.202 $these values can be corrected for multiple testing.
TABLE-US-00013 TABLE X Paternal Transmissions only Allele 1 Allele 2 Allele 3 Passed 63 (56%) 30 (35%) 35 (61%) Not Passed: 50 (44%) 56 (65%) 22 (39%) Chi-Squared (1df): 1.496 7.860 2.965 P-values$: 0.2214 0.0051 0.0852 Chi-squared for allele-wise TDT = 2* (L1 - L0) = 9.341, 2 df, P = 0.009 Chi-squared for genotype-wise TDT 2* (L2 - L0) = 9.404, 3df, P = 0.024 Chi-squared for goodness of fit of allele-wise model = 2* (L2 - L1) = 0.062, 1df, P = 0.802 $these values can be corrected for multiple testing.
[0291] The 1-1 genotype appears to be a risk for NTD cases. Preferential transmission of allele 1 is observed in the TDT. Having at least one copy of allele 2 appears to protect against NTDs i.e., 1-2, 2-2 or 2-3 genotypes. The TDT shows that allele 2 is transmitted significantly less than expected. Allele 3 appears to have no effect on risk of NTDs. The fathers and NTD cases are significantly out of Hardy-Weinberg equilibrium, presumably this situation is driven by the case genotypes.
Example 7
[0292] The hypothesis being tested in the following series of experiments is that polymorphism rs3832406 within the MTHFD1L gene affects the splicing efficiency of the alternative transcript and could ultimately impact on the level of mitochondrial 10-formyltetrahydrofolate synthase.
Confirmation of the Alternatively Spliced Transcript
[0293] Total RNA was extracted from transformed lymphoblast cell lines using Ultraspec® II (Biotex, Houston, USA). These cell lines were obtained from the Coriell Cell Repository, having been transformed by culturing primary lymphocytes with Epstein-Barr Virus (EBV). RNA from five cell lines was pooled, although pooling need not be carried out for this experiment. These five cell lines and their genotypes were 15083 (7ATT/7ATT), 17102 (7ATT/7ATT), 17133 (7ATT/8ATT), 17219 (7ATT/7ATT), and 17259 (7ATT/8ATT). Dnasel (Invitrogen) treated RNA (1 μg) was reverse transcribed using Superscript II (Invitrogen) as described by the manufacturer. PCR primers were designed to amplify both transcripts (Table XI), the 1.1 kb transcript only or the 3.6 kb transcript only (Table XI). The results of this experiment confirm the presence of both transcripts that are specific to the MTHFD1L gene.
TABLE-US-00014 TABLE XI Primer Sequence Details for RT-PCR Assays Primer Sequences mRNA PCR Temp. Forward (SEQ ID NO: 23): 1.1 kb and 56° C. CCATCGTCAGAGAAGTCATTCA 3.6 kb Reverse (SEQ ID NO: 24): CTGGTTGATTTCCTGCATCA Forward (SEQ ID NO: 25) 1.1 kb only 58° C. GGTCTTTGGAAGCTGCTCTACA Reverse (SEQ ID NO: 26): TTGCAGTGAGCCTAGATCACG Forward (SEQ ID NO: 27): 3.6 kb only 58° C. GATCACACCCACCCCTCTTG Reverse (SEQ ID NO: 28): CCTCCTTTCACTCCAAACGTC
Determination of mRNA Levels
[0294] Taqman assays are performed to examine the levels of MTHFD1L mRNA in relation to the rs3832406 polymorphism. Lymphoblast Coriell cell lines that are representative of rs3832406 genotypes have been identified. Total RNA is extracted and DnaseI treated as described above. Taqman® assays have been designed to distinguish the expression level of the long and short transcripts of MTHFD1L. A control assay that detects both transcripts is localized between Exons 1 and 2 of the MTHFD1L mRNA transcript (Applied Biosystems (ABI) assay ID Hs--00920574). A second assay detects the longer transcript only and is localized to Exons 19/20 (ABI assay ID Hs--003836161). A third assay has been custom designed by ABI and is localized to Exons 7/8A. These assays will be used to examine the relative expression levels of both transcripts to determine if there are differences that are correlated with rs3832406 genotype.
[0295] Folate/Homocysteine Levels
[0296] A correlation between the rs3832406 polymorphism and folate/homocysteine levels is determined. A collection of DNA samples where folate and homocysteine levels have already been assayed are genotyped for the rs3832406 polymorphism using the procedures described herein. A correlation may be found between genotype and folate/homocysteine levels. As folate and homocysteine levels may predict vascular disease and cancer risk, genotypes at rs3832406 may prove useful in estimating the risk for these diseases.
Example 8
[0297] The objective of these experiments is to determine if a polymorphism in MTHFD1L, for example, rs3832406, has an effect on the efficacy or proper dosage for a chemotherapeutic drug such as 5-fluorouracil (5-FU), and more generally whether a particular variant has an effect on the metabolic pathways that affect 5-FU/folinic acid (FA) action. Variable response of patients to administration of 5-FU or other drugs relevant to folate metabolism, or administration of the specific drugs can be used in identifying polymorphic variants responsible for such variable response. As described above, those polymorphic variants can then be used in diagnostic tests and methods of treatment.
[0298] 5-fluorouracil (5-FU) is a widely used chemotherapy drug. The effectiveness of 5-FU is potentiated by folinic acid (FA; generic name: leukovorin). The combination of 5-FU and FA is standard therapy for stage III/IV colon cancer. 5-FU is used in the standard treatment of gastrointestinal such as colorectal, breast and head and neck cancers. Clinical trials have also shown responses in cancer of the bladder, ovary, cervix, prostate and pancreas. Patient responses to 5-FU and 5-FU/FA vary widely, ranging from complete remission of cancer to severe toxicity.
[0299] This study compares the variance frequency distribution in the MTHFD1L rs3832406 polymorphism between groups of patients with solid tumors, treated by weekly or monthly regimen of 5-FU+FA and defined by level of toxicity (graded according to the NCl common toxicity criteria) as: Group 1: patients with high toxicity (grade III/IV on NCl criteria) Group 2: patients with minimal toxicity (grade 0/I/II on NCl criteria). This study helps determine whether the seven, eight, nine, or other multiple "ATT" repeat polymorphic variant affects the efficacy of the 5-FU+FA regemin, and can be readily adapted to test other drug regemins as well. The groups differ in the degree of toxicity experienced with treatment, if any: patients with high toxicity (grade III/IV on NCl criteria), and patients with minimal toxicity (grade 0/I/II on NCl criteria). Analyses are performed globally, then by regimen (monthly vs. weekly) and by type of toxicity (gastrointestinal vs. bone marrow). The statistical significance of the differences between polymorphic variant frequencies can be assessed by a Pearson chi-squared test of homogeneity of proportions with n-1 degrees of freedom.
[0300] In one embodiment, the number of subjects in the study is as follows: about 50-100 patients to each group. However, prior to testing to identify the presence of sequence polymorphic variants in a particular gene or genes, it is useful to understand how many individuals should be screened to provide confidence that most or nearly all pharmacogenetically relevant polymorphic variants will be found. The answer depends on the frequencies of the phenotypes of interest and what assumptions were made about heterogeneity and magnitude of genetic effects. At the beginning, only known phenotype frequencies, for example, responders vs. no responders, frequency of various side effects, etc., are known. The occurrence of serious 5-FU/FA toxicity, for example, toxicity requiring hospitalization is often>10%. The occurrence of life threatening toxicity is in the 1-3% range [Buroker et al., J. Clinical Oncology, 12:14-20 (1994)]. The occurrence of complete remissions is on the order of 2-8%. The lowest frequency phenotypes are about 2%.
[0301] In one embodiment, if homogeneous genetic effects are responsible for half the phenotypes of interest and for the most part the extreme phenotypes represent recessive genotypes, then one should detect alleles that will be present at about 10% frequency (0.1.x0.1=0.01, or 1% frequency of homozygotes) if the population is at Hardy-Weinberg equilibrium. To have an about 99% chance of identifying such alleles would involve searching a population of 22 individuals. If the major phenotypes are associated with heterozygous genotypes then alleles present at about 0.5% frequency (2×0.005×0.995=0.00995, or about 1% frequency of heterozygotes) should be detected. A 99% chance of detecting such alleles would involve about 40 individuals. Given the heterogeneity of the North American and other populations, one should not necessarily assume that all genotypes are present in Hardy-Weinberg proportions; a substantial oversampling is performed to increase the chances of detecting relevant polymorphic variants: For initial screening, 50-100 individuals of known race/ethnicity can be screened for polymorphic variant. Polymorphic variant detection studies can be extended to outliers for the phenotypes of interest to cover the possibility that important polymorphic variants were missed in the normal population screening.
[0302] Two major dosing regimens can be used: 5-FU plus low dose FA given for five consecutive days followed by a 23 day interval, or once weekly bolus intravenous 5-FU plus high dose FA. The higher FA dose results in plasma FA concentrations of 1 to 10 μM, comparable to those used for optimal 5-FU/FA synergy in tissue culture, however low dose FA (20 mg/m2 vs. 500 mg/m2) has produced comparable clinical benefit.
[0303] Leukovorin (folinic acid) is the most widely used 5-FU modulator, however a variety of other molecules have been used with 5-FU, including, for example, interferon-alpha, hydroxyurea, N-phosphonacetyl-L-aspartate, dipyridamole, levamisole, methotrexate, trimetrexate glucuronate, cisplatin and radiotherapy. S-1 is a novel oral anticancer drug, composed of the 5-FU prodrug tegafur plus gimestat (CDHP) and otastat potassium (Oxo) in a molar ratio of 1:0.4:1, with CDHP inhibiting dihydropyrimidine dehydrogenase in order to prolong 5-FU concentrations in blood and tumour and Oxo present as a gastrointestinal protectant. The experimental study can be carried out with one of these modulator in addition to 5-FU.
[0304] 5-FU toxicity has been well documented in randomized clinical trials. Accordingly, during the course of the experimental study, participants are monitored for such toxicities. Patients receiving 5-FU/FA are at even greater risk of toxic reactions and should be monitored carefully during therapy. A variety of side effects have been observed, affecting the gastrointestinal tract, bone marrow, heart and CNS. The most common toxic reactions are nausea and anorexia, which can be followed by life threatening mucositis, enteritis and diarrhea. Leukopenia and stomatitis is also a problem in some patients, particularly with the weekly dosage regimen. Toxicity is a major cost of 5-FU/FA therapy, measured both in patient suffering and in financial terms (the cost of care for drug induced illness).
[0305] Many non-genetic factors can influence the response of cancers to drugs, including tumor location, vasculature, cell growth fraction and various drug resistance mechanisms. Accordingly, in performing the drug trial, these non-polymorphic variables are controlled for by selecting participants with common attributes.
[0306] There are many potential candidate therapeutic interventions or drugs that can affect the folate and pyrimidine pathways. Categories of these are 5-FU prodrugs, drugs that affect DNA methylation pathways, and other drugs that have been developed for similar indications as 5-FU. The study can be performed using one of these drug in the alternative or in addition to 5-FU. 5-FU prodrugs are generally modified fluoropyrimidines that require one or more enzymatic activation steps for conversion into 5-FU. The activation steps may result in prolonged drug half-life and/or selective drug activation (i.e. conversion to 5-FU) in tumor cells. Examples of such drugs include capecitabine (Xeloda, Roche), a drug that is converted to 5-FU by a three-step pathway involving carboxylesterase 1, cytidine deaminase and thymidine phosphorylase. Another 5-FU prodrug is 5' deoxy 5-FU (Furtulon, Roche), which is converted to 5-FU by thymidine phosphorylase and/or uridine phosphorylase. Another 5-FU prodrug is 1-(tetrahydro-2-furanyl)-5-fluorouracil (FT, ftorafur, Tegafur, Taiho--Bristol Myers Squibb), a prodrug that is converted to 5-FU by cytochrome P450 enzyme, CYP3A4. In some embodiments, drugs acting on DNA methyation pathways are substituted or used in combination with 5-FU.
[0307] A variety of drugs are being developed for similar indications as 5-FU, and/or are being tested in combinations with 5-FU/leukovorin. These drugs can be substituted or used in combination with 5-FU in this study. Identification of patients likely to respond to 5-FU with or without leukovorin, may be useful in selecting optimal responders to other drugs. Alternatively, identification of patients likely to suffer toxic response to 5-FU containing regimens can allow identification of patients best treated with other drugs. Other drugs with activity against cancers usually treated with regimens containing 5-FU or in the alternative include the platinum compound oxaliplatin (L-OHP), the topoisomerase I inhibitors irinotecan (CPTI 1, Pharmacia-UpJohn) and topotecan, Suramin, a bis-hexasulfonated napthylurea; 6-hydroxymethylacylfulvene (HMAF; MGI 114); LY295501; bizelesin (U-7779; NSC615291), ONYX-015, monoclonal antibodies, for example, 17-IA and MN-14, protein synthesis inhibitors such as RA 700, angiogenesis inhibitors such as PF 4, and cyclooxygenase inhibitors. Additional drugs that can be substituted for or used in combination with 5-FU in accordance with this study include the following: quinazoline derivatives such as ZD1694 (Tomudex, AstraZeneca); ZD9331 (AstraZeneca); LY231514 (Eli Lilly); GW1843 (1843U89, GlaxoWellcome); AG337; and AG331; trimetrexate (US Bioscience); edatrexate, piritrexim; and lometrexol. More generally, 5,8-dideazaisofolic acid (LAHQ), 5,10-dideazatetrahydrofolic acid (DDATHF), and 5-deazafolic acid are structures into which a variety of modifications have been introduced in the pteridine/quinazoline ring, the C9-N10 bridge, the benzoyl ring, and the glutamate side chain (see article below). Other drugs include 2,4-diaminopyrido[2,3-d]pyrimidine based antifolates.
Example 9
[0308] The experimental study described in Example 8 is repeated using a relevant cardiovascular drug. This study and similar studies are helpful in improving therapies for atherosclerosis, thromboembolic diseases and other forms of vascular and heart disease. Homocysteine is a proven risk factor for cardiovascular disease. One important role of the folate cofactor 5-methyltetrahydrofolate is the provision of a methyl group for the remethylation of homocysteine to methionine by the enzyme methionine synthase. Variation in the enzymes of folate metabolism, for example methionine synthase or methylenetetrahydrofolate reductase (MTHFR), may affect the levels of 5-methyltetrahydrofolate or other folates that in turn influence homocysteine levels. The contribution of elevated homocysteine to atherosclerosis, thromboembolic disease and other forms of vascular and heart disease may vary from one patient to another. Such variation may be attributable, at least in part, to genetically determined variation in the levels or function of the enzymes of folate and one carbon metabolism described in this application. Understanding which patients are most likely to benefit from particular drugs assists in the clinical development or use of drugs to treat cardiovascular diseases. Such drugs include those aimed at the modulation of folate levels, for example, supplemental folate, and at other known causes of cardiovascular disease, for example, lipid lowering drugs such as statins, or antithrombotic drugs such as salicylates, heparin or GPIIIa/IIb inhibitors. In some embodiments, patients are included whose disease is significantly attributable to elevated homocysteine from treatment with agents aimed at the amelioration of other etiological causes, such as elevated cholesterol.
Example 10
[0309] The experimental study described in Example 8 is repeated using a relevant central nervous system (CNS) drug. Phencyclidine, an NMDA receptor antagonist, has been shown to induce a psychotic state closely resembling schizophrenia in normal individuals has led to attempts to modulate NMDA receptor function in schizophrenic patients. The amino acid glycine is an obligatory coagonist, with glutamate, at NMDA receptors via its action at a strychnine-insensitive binding site on the NMDA receptor complex, and consequently glycine or glycinergic agents, for example, glycine, the glycine receptor partial agonist, D-cycloserine, or the glycine prodrug milacemide, have been tried as an adjunct to conventional antipsychotics for the treatment of schizophrenia. Several trials have demonstrated a moderate improvement in negative symptoms of schizophrenia. Because the folate pathway modulates levels of serine and glycine, the endogenous levels of glycine in neurons may affect the response to glycine or glycinergic drugs. CNS drugs can also include drugs for treatment or prevention of Alzheimer's disease or other dementia.
[0310] All references, including publications, patent applications, and patents, cited herein are hereby incorporated by reference to the same extent as if each reference were individually and specifically indicated to be incorporated by reference and were set forth in its entirety herein.
[0311] The use of the terms "a" and "an" and "the" and similar referents in the context of describing the invention (especially in the context of the following claims) are to be construed to cover both the singular and the plural, unless otherwise indicated herein or clearly contradicted by context. The terms "comprising," "having," "including," and "containing" are to be construed as open-ended terms (i.e., meaning "including, but not limited to,") unless otherwise noted. Recitation of ranges of values herein are merely intended to serve as a shorthand method of referring individually to each separate value falling within the range, unless otherwise indicated herein, and each separate value is incorporated into the specification as if it were individually recited herein. All methods described herein can be performed in any suitable order unless otherwise indicated herein or otherwise clearly contradicted by context. The use of any and all examples, or exemplary language (e.g., "such as") provided herein, is intended merely to better illuminate the invention and does not pose a limitation on the scope of the invention unless otherwise claimed. No language in the specification should be construed as indicating any non-claimed element as essential to the practice of the invention.
[0312] Preferred embodiments of this invention are described herein, including the best mode known to the inventors for carrying out the invention. Variations of those preferred embodiments may become apparent to those of ordinary skill in the art upon reading the foregoing description. The inventors expect skilled artisans to employ such variations as appropriate, and the inventors intend for the invention to be practiced otherwise than as specifically described herein. Accordingly, this invention includes all modifications and equivalents of the subject matter recited in the claims appended hereto as permitted by applicable law. Moreover, any combination of the above-described elements in all possible variations thereof is encompassed by the invention unless otherwise indicated herein or otherwise clearly contradicted by context.
Sequence CWU
1
2813110DNAHomo sapiensCDS(54)..(2861)misc_feature(653)..(653)Xaa at
position 653 of the amino acid sequence = Arg or Gln 1gtggaacctc
gatattggtg gtgtccatcg tgggcagcgg actaataaag gcc atg 56
Met
1gcg cca gca gaa atc ctg aac ggg aag
gag atc tcc gcg caa ata agg 104Ala Pro Ala Glu Ile Leu Asn Gly Lys
Glu Ile Ser Ala Gln Ile Arg 5 10
15gcg aga ctg aaa aat caa gtc act cag ttg aag gag caa gta cct ggt
152Ala Arg Leu Lys Asn Gln Val Thr Gln Leu Lys Glu Gln Val Pro Gly
20 25 30ttc aca cca cgc ctg gca ata
tta cag gtt ggc aac aga gat gat tcc 200Phe Thr Pro Arg Leu Ala Ile
Leu Gln Val Gly Asn Arg Asp Asp Ser 35 40
45aat ctt tat ata aat gtg aag ctg aag gct gct gaa gag att ggg atc
248Asn Leu Tyr Ile Asn Val Lys Leu Lys Ala Ala Glu Glu Ile Gly Ile50
55 60 65aaa gcc act cac
att aag tta cca aga aca acc aca gaa tct gag gtg 296Lys Ala Thr His
Ile Lys Leu Pro Arg Thr Thr Thr Glu Ser Glu Val 70
75 80atg aag tac att aca tct ttg aat gaa gac
tct act gta cat ggg ttc 344Met Lys Tyr Ile Thr Ser Leu Asn Glu Asp
Ser Thr Val His Gly Phe 85 90
95tta gtg cag cta cct tta gat tca gag aat tcc att aac act gaa gaa
392Leu Val Gln Leu Pro Leu Asp Ser Glu Asn Ser Ile Asn Thr Glu Glu
100 105 110gtg atc aat gct att gca ccc
gag aag gat gtg gat gga ttg act agc 440Val Ile Asn Ala Ile Ala Pro
Glu Lys Asp Val Asp Gly Leu Thr Ser 115 120
125atc aat gct ggg aga ctt gct aga ggt gac ctc aat gac tgt ttc att
488Ile Asn Ala Gly Arg Leu Ala Arg Gly Asp Leu Asn Asp Cys Phe Ile130
135 140 145cct tgt acg cct
aag gga tgc ttg gaa ctc atc aaa gag aca ggg gtg 536Pro Cys Thr Pro
Lys Gly Cys Leu Glu Leu Ile Lys Glu Thr Gly Val 150
155 160ccg att gcc gga agg cat gct gtg gtg gtt
ggg cgc agt aaa ata gtt 584Pro Ile Ala Gly Arg His Ala Val Val Val
Gly Arg Ser Lys Ile Val 165 170
175ggg gcc ccg atg cat gac ttg ctt ctg tgg aac aat gcc aca gtg acc
632Gly Ala Pro Met His Asp Leu Leu Leu Trp Asn Asn Ala Thr Val Thr
180 185 190acc tgc cac tcc aag act gcc
cat ctg gat gag gag gta aat aaa ggt 680Thr Cys His Ser Lys Thr Ala
His Leu Asp Glu Glu Val Asn Lys Gly 195 200
205gac atc ctg gtg gtt gca act ggt cag cct gaa atg gtt aaa ggg gag
728Asp Ile Leu Val Val Ala Thr Gly Gln Pro Glu Met Val Lys Gly Glu210
215 220 225tgg atc aaa cct
ggg gca ata gtc atc gac tgt gga atc aat tat gtc 776Trp Ile Lys Pro
Gly Ala Ile Val Ile Asp Cys Gly Ile Asn Tyr Val 230
235 240cca gat gat aaa aaa cca aat ggg aga aaa
gtt gtg ggt gat gtg gca 824Pro Asp Asp Lys Lys Pro Asn Gly Arg Lys
Val Val Gly Asp Val Ala 245 250
255tac gac gag gcc aaa gag agg gcg agc ttc atc act cct gtt cct ggc
872Tyr Asp Glu Ala Lys Glu Arg Ala Ser Phe Ile Thr Pro Val Pro Gly
260 265 270ggc gta ggg ccc atg aca gtt
gca atg ctc atg cag agc aca gta gag 920Gly Val Gly Pro Met Thr Val
Ala Met Leu Met Gln Ser Thr Val Glu 275 280
285agt gcc aag cgt ttc ctg gag aaa ttt aag cca gga aag tgg atg att
968Ser Ala Lys Arg Phe Leu Glu Lys Phe Lys Pro Gly Lys Trp Met Ile290
295 300 305cag tat aac aac
ctt aac ctc aag aca cct gtt cca agt gac att gat 1016Gln Tyr Asn Asn
Leu Asn Leu Lys Thr Pro Val Pro Ser Asp Ile Asp 310
315 320ata tca cga tct tgt aaa ccg aag ccc att
ggt aag ctg gct cga gaa 1064Ile Ser Arg Ser Cys Lys Pro Lys Pro Ile
Gly Lys Leu Ala Arg Glu 325 330
335att ggt ctg ctg tct gaa gag gta gaa tta tat ggt gaa aca aag gcc
1112Ile Gly Leu Leu Ser Glu Glu Val Glu Leu Tyr Gly Glu Thr Lys Ala
340 345 350aaa gtt ctg ctg tca gca cta
gaa cgc ctg aag cac cgg cct gat ggg 1160Lys Val Leu Leu Ser Ala Leu
Glu Arg Leu Lys His Arg Pro Asp Gly 355 360
365aaa tac gtg gtg gtg act gga ata act cca aca ccc ctg gga gaa ggg
1208Lys Tyr Val Val Val Thr Gly Ile Thr Pro Thr Pro Leu Gly Glu Gly370
375 380 385aaa agc aca act
aca atc ggg cta gtg caa gcc ctt ggt gcc cat ctc 1256Lys Ser Thr Thr
Thr Ile Gly Leu Val Gln Ala Leu Gly Ala His Leu 390
395 400tac cag aat gtc ttt gcg tgt gtg cga cag
cct tct cag ggc ccc acc 1304Tyr Gln Asn Val Phe Ala Cys Val Arg Gln
Pro Ser Gln Gly Pro Thr 405 410
415ttt gga ata aaa ggt ggc gct gca gga ggc ggc tac tcc cag gtc att
1352Phe Gly Ile Lys Gly Gly Ala Ala Gly Gly Gly Tyr Ser Gln Val Ile
420 425 430cct atg gaa gag ttt aat ctc
cac ctc aca ggt gac atc cat gcc atc 1400Pro Met Glu Glu Phe Asn Leu
His Leu Thr Gly Asp Ile His Ala Ile 435 440
445act gca gct aat aac ctc gtt gct gcg gcc att gat gct cgg ata ttt
1448Thr Ala Ala Asn Asn Leu Val Ala Ala Ala Ile Asp Ala Arg Ile Phe450
455 460 465cat gaa ctg acc
cag aca gac aag gct ctc ttt aat cgt ttg gtg cca 1496His Glu Leu Thr
Gln Thr Asp Lys Ala Leu Phe Asn Arg Leu Val Pro 470
475 480tca gta aat gga gtg aga agg ttc tct gac
atc caa atc cga agg tta 1544Ser Val Asn Gly Val Arg Arg Phe Ser Asp
Ile Gln Ile Arg Arg Leu 485 490
495aag aga cta ggc att gaa aag act gac cct acc aca ctg aca gat gaa
1592Lys Arg Leu Gly Ile Glu Lys Thr Asp Pro Thr Thr Leu Thr Asp Glu
500 505 510gag ata aac aga ttt gca aga
ttg gac att gat cca gaa acc ata act 1640Glu Ile Asn Arg Phe Ala Arg
Leu Asp Ile Asp Pro Glu Thr Ile Thr 515 520
525tgg caa aga gtg ttg gat acc aat gat aga ttc ctg agg aag atc acg
1688Trp Gln Arg Val Leu Asp Thr Asn Asp Arg Phe Leu Arg Lys Ile Thr530
535 540 545att gga cag gct
cca acg gag aag ggt cac aca cgg acg gcc cag ttt 1736Ile Gly Gln Ala
Pro Thr Glu Lys Gly His Thr Arg Thr Ala Gln Phe 550
555 560gat atc tct gtg gcc agt gaa att atg gct
gtc ctg gct ctc acc act 1784Asp Ile Ser Val Ala Ser Glu Ile Met Ala
Val Leu Ala Leu Thr Thr 565 570
575tct cta gaa gac atg aga gag aga ctg ggc aaa atg gtg gtg gca tcc
1832Ser Leu Glu Asp Met Arg Glu Arg Leu Gly Lys Met Val Val Ala Ser
580 585 590agt aag aaa gga gag ccc gtc
agt gcc gaa gat ctg ggg gtg agt ggt 1880Ser Lys Lys Gly Glu Pro Val
Ser Ala Glu Asp Leu Gly Val Ser Gly 595 600
605gca ctg aca gtg ctt atg aag gac gca atc aag ccc aat ctc atg cag
1928Ala Leu Thr Val Leu Met Lys Asp Ala Ile Lys Pro Asn Leu Met Gln610
615 620 625aca ctg gag ggc
act cca gtg ttt gtc cat gct ggc ccg ttt gcc aac 1976Thr Leu Glu Gly
Thr Pro Val Phe Val His Ala Gly Pro Phe Ala Asn 630
635 640atc gca cat ggc aat tcc tcc atc att gca
gac cng atc gca ctc aag 2024Ile Ala His Gly Asn Ser Ser Ile Ile Ala
Asp Xaa Ile Ala Leu Lys 645 650
655ctt gtt ggc cca gaa ggg ttt gta gtg acg gaa gca gga ttt gga gca
2072Leu Val Gly Pro Glu Gly Phe Val Val Thr Glu Ala Gly Phe Gly Ala
660 665 670gac att gga atg gaa aag ttt
ttt aac atc aaa tgc cgg tat tcc ggc 2120Asp Ile Gly Met Glu Lys Phe
Phe Asn Ile Lys Cys Arg Tyr Ser Gly 675 680
685ctc tgc ccc cac gtg gtg gtg ctt gtt gcc act gtc agg gct ctc aag
2168Leu Cys Pro His Val Val Val Leu Val Ala Thr Val Arg Ala Leu Lys690
695 700 705atg cac ggg ggc
ggc ccc acg gtc act gct gga ctg cct ctt ccc aag 2216Met His Gly Gly
Gly Pro Thr Val Thr Ala Gly Leu Pro Leu Pro Lys 710
715 720gct tac ata cag gag aac ctg gag ctg gtt
gaa aaa ggc ttc agt aac 2264Ala Tyr Ile Gln Glu Asn Leu Glu Leu Val
Glu Lys Gly Phe Ser Asn 725 730
735ttg aag aaa caa att gaa aat gcc aga atg ttt gga att cca gta gta
2312Leu Lys Lys Gln Ile Glu Asn Ala Arg Met Phe Gly Ile Pro Val Val
740 745 750gtg gcc gtg aat gca ttc aag
acg gat aca gag tct gag ctg gac ctc 2360Val Ala Val Asn Ala Phe Lys
Thr Asp Thr Glu Ser Glu Leu Asp Leu 755 760
765atc agc cgc ctt tcc aga gaa cat ggg gct ttt gat gcc gtg aag tgc
2408Ile Ser Arg Leu Ser Arg Glu His Gly Ala Phe Asp Ala Val Lys Cys770
775 780 785act cac tgg gca
gaa ggg ggc aag ggt gcc tta gcc ctg gct cag gcc 2456Thr His Trp Ala
Glu Gly Gly Lys Gly Ala Leu Ala Leu Ala Gln Ala 790
795 800gtc cag aga gca gca caa gca ccc agc agc
ttc cag ctc ctt tat gac 2504Val Gln Arg Ala Ala Gln Ala Pro Ser Ser
Phe Gln Leu Leu Tyr Asp 805 810
815ctc aag ctc cca gtt gag gat aaa atc agg atc att gca cag aag atc
2552Leu Lys Leu Pro Val Glu Asp Lys Ile Arg Ile Ile Ala Gln Lys Ile
820 825 830tat gga gca gat gac att gaa
tta ctt ccc gaa gct caa cac aaa gct 2600Tyr Gly Ala Asp Asp Ile Glu
Leu Leu Pro Glu Ala Gln His Lys Ala 835 840
845gaa gtc tac acg aag cag ggc ttt ggg aat ctc ccc atc tgc atg gct
2648Glu Val Tyr Thr Lys Gln Gly Phe Gly Asn Leu Pro Ile Cys Met Ala850
855 860 865aaa aca cac ttg
tct ttg tct cac aac cca gag caa aaa ggt gtc cct 2696Lys Thr His Leu
Ser Leu Ser His Asn Pro Glu Gln Lys Gly Val Pro 870
875 880aca ggc ttc att ctg ccc att cgc gac atc
cgc gcc agc gtt ggg gct 2744Thr Gly Phe Ile Leu Pro Ile Arg Asp Ile
Arg Ala Ser Val Gly Ala 885 890
895ggt ttt ctg tac ccc tta gta gga acg atg agc aca atg cct gga ctc
2792Gly Phe Leu Tyr Pro Leu Val Gly Thr Met Ser Thr Met Pro Gly Leu
900 905 910ccc acc cgg ccc tgt ttt tat
gat att gat ttg gac cct gaa aca gaa 2840Pro Thr Arg Pro Cys Phe Tyr
Asp Ile Asp Leu Asp Pro Glu Thr Glu 915 920
925cag gtg aat gga tta ttc taa acagatcacc atccatcttc aagaagctac
2891Gln Val Asn Gly Leu Phe930 935tttgaaagtc tggccagtgt
ctattcaggc ccactgggag ttaggaagta taagtaagcc 2951aagagaagtc agcccctgcc
cagaagatct gaaactaata gtaggagttt ccccagaagt 3011cattttcagc cttaattctc
atcatgtata aattaacata aatcatgcat gtctgtttac 3071tttagtgacg ttccacagaa
taaaaggaaa caagtttgc 311022808DNAHomo
sapiensmisc_feature(1958)..(1958)n at position 1958 of the nucleic
sequence = g or a 2atggcgccag cagaaatcct gaacgggaag gagatctccg
cgcaaataag ggcgagactg 60aaaaatcaag tcactcagtt gaaggagcaa gtacctggtt
tcacaccacg cctggcaata 120ttacaggttg gcaacagaga tgattccaat ctttatataa
atgtgaagct gaaggctgct 180gaagagattg ggatcaaagc cactcacatt aagttaccaa
gaacaaccac agaatctgag 240gtgatgaagt acattacatc tttgaatgaa gactctactg
tacatgggtt cttagtgcag 300ctacctttag attcagagaa ttccattaac actgaagaag
tgatcaatgc tattgcaccc 360gagaaggatg tggatggatt gactagcatc aatgctggga
gacttgctag aggtgacctc 420aatgactgtt tcattccttg tacgcctaag ggatgcttgg
aactcatcaa agagacaggg 480gtgccgattg ccggaaggca tgctgtggtg gttgggcgca
gtaaaatagt tggggccccg 540atgcatgact tgcttctgtg gaacaatgcc acagtgacca
cctgccactc caagactgcc 600catctggatg aggaggtaaa taaaggtgac atcctggtgg
ttgcaactgg tcagcctgaa 660atggttaaag gggagtggat caaacctggg gcaatagtca
tcgactgtgg aatcaattat 720gtcccagatg ataaaaaacc aaatgggaga aaagttgtgg
gtgatgtggc atacgacgag 780gccaaagaga gggcgagctt catcactcct gttcctggcg
gcgtagggcc catgacagtt 840gcaatgctca tgcagagcac agtagagagt gccaagcgtt
tcctggagaa atttaagcca 900ggaaagtgga tgattcagta taacaacctt aacctcaaga
cacctgttcc aagtgacatt 960gatatatcac gatcttgtaa accgaagccc attggtaagc
tggctcgaga aattggtctg 1020ctgtctgaag aggtagaatt atatggtgaa acaaaggcca
aagttctgct gtcagcacta 1080gaacgcctga agcaccggcc tgatgggaaa tacgtggtgg
tgactggaat aactccaaca 1140cccctgggag aagggaaaag cacaactaca atcgggctag
tgcaagccct tggtgcccat 1200ctctaccaga atgtctttgc gtgtgtgcga cagccttctc
agggccccac ctttggaata 1260aaaggtggcg ctgcaggagg cggctactcc caggtcattc
ctatggaaga gtttaatctc 1320cacctcacag gtgacatcca tgccatcact gcagctaata
acctcgttgc tgcggccatt 1380gatgctcgga tatttcatga actgacccag acagacaagg
ctctctttaa tcgtttggtg 1440ccatcagtaa atggagtgag aaggttctct gacatccaaa
tccgaaggtt aaagagacta 1500ggcattgaaa agactgaccc taccacactg acagatgaag
agataaacag atttgcaaga 1560ttggacattg atccagaaac cataacttgg caaagagtgt
tggataccaa tgatagattc 1620ctgaggaaga tcacgattgg acaggctcca acggagaagg
gtcacacacg gacggcccag 1680tttgatatct ctgtggccag tgaaattatg gctgtcctgg
ctctcaccac ttctctagaa 1740gacatgagag agagactggg caaaatggtg gtggcatcca
gtaagaaagg agagcccgtc 1800agtgccgaag atctgggggt gagtggtgca ctgacagtgc
ttatgaagga cgcaatcaag 1860cccaatctca tgcagacact ggagggcact ccagtgtttg
tccatgctgg cccgtttgcc 1920aacatcgcac atggcaattc ctccatcatt gcagaccnga
tcgcactcaa gcttgttggc 1980ccagaagggt ttgtagtgac ggaagcagga tttggagcag
acattggaat ggaaaagttt 2040tttaacatca aatgccggta ttccggcctc tgcccccacg
tggtggtgct tgttgccact 2100gtcagggctc tcaagatgca cgggggcggc cccacggtca
ctgctggact gcctcttccc 2160aaggcttaca tacaggagaa cctggagctg gttgaaaaag
gcttcagtaa cttgaagaaa 2220caaattgaaa atgccagaat gtttggaatt ccagtagtag
tggccgtgaa tgcattcaag 2280acggatacag agtctgagct ggacctcatc agccgccttt
ccagagaaca tggggctttt 2340gatgccgtga agtgcactca ctgggcagaa gggggcaagg
gtgccttagc cctggctcag 2400gccgtccaga gagcagcaca agcacccagc agcttccagc
tcctttatga cctcaagctc 2460ccagttgagg ataaaatcag gatcattgca cagaagatct
atggagcaga tgacattgaa 2520ttacttcccg aagctcaaca caaagctgaa gtctacacga
agcagggctt tgggaatctc 2580cccatctgca tggctaaaac acacttgtct ttgtctcaca
acccagagca aaaaggtgtc 2640cctacaggct tcattctgcc cattcgcgac atccgcgcca
gcgttggggc tggttttctg 2700taccccttag taggaacgat gagcacaatg cctggactcc
ccacccggcc ctgtttttat 2760gatattgatt tggaccctga aacagaacag gtgaatggat
tattctaa 28083935PRTHomo
sapiensmisc_feature(653)..(653)Xaa at position 653 of the amino acid
sequence = Arg or Gln 3Met Ala Pro Ala Glu Ile Leu Asn Gly Lys Glu
Ile Ser Ala Gln Ile1 5 10
15Arg Ala Arg Leu Lys Asn Gln Val Thr Gln Leu Lys Glu Gln Val Pro
20 25 30Gly Phe Thr Pro Arg Leu Ala
Ile Leu Gln Val Gly Asn Arg Asp Asp 35 40
45Ser Asn Leu Tyr Ile Asn Val Lys Leu Lys Ala Ala Glu Glu Ile
Gly 50 55 60Ile Lys Ala Thr His Ile
Lys Leu Pro Arg Thr Thr Thr Glu Ser Glu65 70
75 80Val Met Lys Tyr Ile Thr Ser Leu Asn Glu Asp
Ser Thr Val His Gly 85 90
95Phe Leu Val Gln Leu Pro Leu Asp Ser Glu Asn Ser Ile Asn Thr Glu
100 105 110Glu Val Ile Asn Ala Ile
Ala Pro Glu Lys Asp Val Asp Gly Leu Thr 115 120
125Ser Ile Asn Ala Gly Arg Leu Ala Arg Gly Asp Leu Asn Asp
Cys Phe 130 135 140Ile Pro Cys Thr Pro
Lys Gly Cys Leu Glu Leu Ile Lys Glu Thr Gly145 150
155 160Val Pro Ile Ala Gly Arg His Ala Val Val
Val Gly Arg Ser Lys Ile 165 170
175Val Gly Ala Pro Met His Asp Leu Leu Leu Trp Asn Asn Ala Thr Val
180 185 190Thr Thr Cys His Ser
Lys Thr Ala His Leu Asp Glu Glu Val Asn Lys 195
200 205Gly Asp Ile Leu Val Val Ala Thr Gly Gln Pro Glu
Met Val Lys Gly 210 215 220Glu Trp Ile
Lys Pro Gly Ala Ile Val Ile Asp Cys Gly Ile Asn Tyr225
230 235 240Val Pro Asp Asp Lys Lys Pro
Asn Gly Arg Lys Val Val Gly Asp Val 245
250 255Ala Tyr Asp Glu Ala Lys Glu Arg Ala Ser Phe Ile
Thr Pro Val Pro 260 265 270Gly
Gly Val Gly Pro Met Thr Val Ala Met Leu Met Gln Ser Thr Val 275
280 285Glu Ser Ala Lys Arg Phe Leu Glu Lys
Phe Lys Pro Gly Lys Trp Met 290 295
300Ile Gln Tyr Asn Asn Leu Asn Leu Lys Thr Pro Val Pro Ser Asp Ile305
310 315 320Asp Ile Ser Arg
Ser Cys Lys Pro Lys Pro Ile Gly Lys Leu Ala Arg 325
330 335Glu Ile Gly Leu Leu Ser Glu Glu Val Glu
Leu Tyr Gly Glu Thr Lys 340 345
350Ala Lys Val Leu Leu Ser Ala Leu Glu Arg Leu Lys His Arg Pro Asp
355 360 365Gly Lys Tyr Val Val Val Thr
Gly Ile Thr Pro Thr Pro Leu Gly Glu 370 375
380Gly Lys Ser Thr Thr Thr Ile Gly Leu Val Gln Ala Leu Gly Ala
His385 390 395 400Leu Tyr
Gln Asn Val Phe Ala Cys Val Arg Gln Pro Ser Gln Gly Pro
405 410 415Thr Phe Gly Ile Lys Gly Gly
Ala Ala Gly Gly Gly Tyr Ser Gln Val 420 425
430Ile Pro Met Glu Glu Phe Asn Leu His Leu Thr Gly Asp Ile
His Ala 435 440 445Ile Thr Ala Ala
Asn Asn Leu Val Ala Ala Ala Ile Asp Ala Arg Ile 450
455 460Phe His Glu Leu Thr Gln Thr Asp Lys Ala Leu Phe
Asn Arg Leu Val465 470 475
480Pro Ser Val Asn Gly Val Arg Arg Phe Ser Asp Ile Gln Ile Arg Arg
485 490 495Leu Lys Arg Leu Gly
Ile Glu Lys Thr Asp Pro Thr Thr Leu Thr Asp 500
505 510Glu Glu Ile Asn Arg Phe Ala Arg Leu Asp Ile Asp
Pro Glu Thr Ile 515 520 525Thr Trp
Gln Arg Val Leu Asp Thr Asn Asp Arg Phe Leu Arg Lys Ile 530
535 540Thr Ile Gly Gln Ala Pro Thr Glu Lys Gly His
Thr Arg Thr Ala Gln545 550 555
560Phe Asp Ile Ser Val Ala Ser Glu Ile Met Ala Val Leu Ala Leu Thr
565 570 575Thr Ser Leu Glu
Asp Met Arg Glu Arg Leu Gly Lys Met Val Val Ala 580
585 590Ser Ser Lys Lys Gly Glu Pro Val Ser Ala Glu
Asp Leu Gly Val Ser 595 600 605Gly
Ala Leu Thr Val Leu Met Lys Asp Ala Ile Lys Pro Asn Leu Met 610
615 620Gln Thr Leu Glu Gly Thr Pro Val Phe Val
His Ala Gly Pro Phe Ala625 630 635
640Asn Ile Ala His Gly Asn Ser Ser Ile Ile Ala Asp Xaa Ile Ala
Leu 645 650 655Lys Leu Val
Gly Pro Glu Gly Phe Val Val Thr Glu Ala Gly Phe Gly 660
665 670Ala Asp Ile Gly Met Glu Lys Phe Phe Asn
Ile Lys Cys Arg Tyr Ser 675 680
685Gly Leu Cys Pro His Val Val Val Leu Val Ala Thr Val Arg Ala Leu 690
695 700Lys Met His Gly Gly Gly Pro Thr
Val Thr Ala Gly Leu Pro Leu Pro705 710
715 720Lys Ala Tyr Ile Gln Glu Asn Leu Glu Leu Val Glu
Lys Gly Phe Ser 725 730
735Asn Leu Lys Lys Gln Ile Glu Asn Ala Arg Met Phe Gly Ile Pro Val
740 745 750Val Val Ala Val Asn Ala
Phe Lys Thr Asp Thr Glu Ser Glu Leu Asp 755 760
765Leu Ile Ser Arg Leu Ser Arg Glu His Gly Ala Phe Asp Ala
Val Lys 770 775 780Cys Thr His Trp Ala
Glu Gly Gly Lys Gly Ala Leu Ala Leu Ala Gln785 790
795 800Ala Val Gln Arg Ala Ala Gln Ala Pro Ser
Ser Phe Gln Leu Leu Tyr 805 810
815Asp Leu Lys Leu Pro Val Glu Asp Lys Ile Arg Ile Ile Ala Gln Lys
820 825 830Ile Tyr Gly Ala Asp
Asp Ile Glu Leu Leu Pro Glu Ala Gln His Lys 835
840 845Ala Glu Val Tyr Thr Lys Gln Gly Phe Gly Asn Leu
Pro Ile Cys Met 850 855 860Ala Lys Thr
His Leu Ser Leu Ser His Asn Pro Glu Gln Lys Gly Val865
870 875 880Pro Thr Gly Phe Ile Leu Pro
Ile Arg Asp Ile Arg Ala Ser Val Gly 885
890 895Ala Gly Phe Leu Tyr Pro Leu Val Gly Thr Met Ser
Thr Met Pro Gly 900 905 910Leu
Pro Thr Arg Pro Cys Phe Tyr Asp Ile Asp Leu Asp Pro Glu Thr 915
920 925Glu Gln Val Asn Gly Leu Phe 930
935430DNAHomo sapiensmisc_feature(15)..(15)n = g or a
4catcattgca gaccngatcg cactcaagct
30571629DNAHomo sapiensmisc_feature(53753)..(53753)n at position 53753 of
the nucleic sequence = g or a 5gtggaacctc gatattggtg gtgtccatcg
tgggcagcgg actaataaag gccatggcgc 60cagcagaaat cctgaacggg aaggagatct
ccgcgtaagc acctgacatt gttgttgagt 120gtggggctgt ggacgagggc tctgagggtg
tgcaggtccc ccggacccat tttttgcggg 180agggacactt gtagcggaag agttaggccc
ataacacctg aagacctgca gccaaagaaa 240gagaacccgg gcacaacctg cgtttgcaag
tggactgcct agtggctgta gccgctggct 300cctgtgcttc ggggaaaagt cctgtgccga
ctatgctccc aaacgcgcag ctgctggtga 360ctttctgccg ggagagtggg tagttgcggc
ttcctggagc ccccctagtc agaactcaga 420aatccaatct acttaattcc cgttaatttg
gaggtgagta gatggagact agtgaggtga 480caaggcttga caagactaca aggctgataa
tgacagaggg tccttccagc tcatagcttt 540tcccactata ccatagtctt atcatacgct
aagtgaaatc taatacattt ttcctttctc 600ttcaatctcg tgcttctctc taacagccat
ctctctgccc tggtcctaga gttacttgaa 660acgtaatttg gctcctgagt cctgagaata
ttctgcattc cacttgacct ctgcctgttc 720tacccttgcg ggacatctga atgcttaatt
agtgggtatc gttgccagct ttactcccag 780gaagtaacac tggctgcttt ttgtctggga
ggtgactggt gaggaaaaat gagagtagaa 840ggaagaggga aaaatatatt ccctctagta
gatgaaggag ggaaagatat attctgtgtg 900accttactga gtttgctctt gggacctttc
acatcacccg ttatgtaagt aactgttgcc 960agccacagca gttactaaat tgtaattgtc
ctgtaactcc taactttgta ataggttttt 1020gagtgaatta caaagtcatc ggtggtgaaa
agtccgggcc cagtggctca tgcctgtaat 1080cccagcactt tgggaggccg aggtgggtgg
atcacctgag gtctggagtt caagaccagc 1140ctatccaaaa tggtgaaacc ccatctctac
taaaaataca aaaaattagc cgagtgtggt 1200ggcaggcatc tgtaatccca gctacttggg
aggctgaggc aggagaatca cttgaacctg 1260gggggcagag gttgcagtga gctgagatgc
gccattgcac tccagcctgg gcaacaagag 1320cgaaactccg tctaagaaaa aaaaaaaaag
aaatcacaat acttactgct tagtgctttt 1380caacatgtgg cttgctaaac tgtatttctg
ctttactttc tgtttttgtt tgcttgtatt 1440caaacttact tccttttcca gtggcttttt
gttttaatat aaaaatcttc cttttagaaa 1500gactgattgt gaagagataa tttagtgcat
aataatagtg cagataattt agtacaataa 1560taatgcatct gggccagtac tgagaatagg
tgctcactga atatactttc gttaattgga 1620gagcaaatta aatatgttta tcaactccat
aagggcaaat acatgtattg tcttattttt 1680gtctatatcc taaacacctg gaaaccctgt
ttcctgcctt actacttgtg tggttttaag 1740caaattaatt aacctttctg tgccaaagtt
tcctcaaatg ttaagtaaaa gtaatgtcta 1800ggtcattggg atgttagtag aattaaatgg
gttcctaaca gatgatcatt tataattctt 1860cctggcacat aataagtatt cgatatatgt
tgtattaata atcagtaatt agttaatgta 1920tatatgaatg attcattatc tttttttttt
tttacatggt agctgctgcc ttgagatcta 1980tttttttttt tgagatgaag tcttgctgtg
tccaatcttg gctcactgca acctctgcct 2040cctgcataca agcaattctc tcatatcagc
ctcctgagta cctgggatta caggtgcccg 2100ccatcacgcc tagctaattt ttttttttga
gatggagttt cactcgtcgc ccagactgga 2160gtacaatggc gtgatctcgg ctcactgcaa
cctctgcctc ctgggttcaa gcgattctcc 2220tgcctcagcc tcccaagtag ctgggattac
aggcgtccgc caccatgtcc agctagtttg 2280tttgtttgtt tgagacagag tttcactctt
gttgcccagg ctggagtgca atggcatgat 2340cttggctcac tgcaacctcc acctcccagg
ttcaagcgat tctcctgcct cagcctcccg 2400agtagctgag actacaggtg tgtgccaccc
cgtccggcta atttttgtat ttttagtaga 2460gacgggcttt caccatgttg gtcaggctgg
tctgtaactc ctgaccccat gtgatccacc 2520cacctcggcc tcccaaagtg ctgggattac
aggcatgagc cactgcaccg gtcctagaat 2580ttcttaaaaa tccagcccac ttcccacatt
ccatcctgcg gattaggaaa tcatttgagg 2640ccagaaaagt gatgtgattt gtccaagggt
gaatggcaat ggctcaacta aaactaaggt 2700cttcacccag tatatgatgt ttttcattct
aaccagcact tactacttgc taaaaggaat 2760ttcgtaagct tttctttttc ttttctttct
ttctttcttt ttatttttca gacaaggtct 2820tgctctgttg ctcaggctgg agtgcaatgg
cacgatgttg tctcactgta acctccactt 2880cctgggttca aacgattctc ctgcctcagc
ctcccaggta gctgggacta cgggcatgca 2940ctactatgcc cggctaattt ttgaattttt
agtagagatg gggtttcgcc atgttggtca 3000agctggtctt gaactcctga cctcaagtga
tctgtccacc tccacctcct aaagtgctag 3060gattacaggt gtgagccact gcgcccccgc
tggcgctttg ctttttcttt acagtcttcc 3120tctcattagc atgtttccca ccctgctgag
aatgctatcc ctgactcgtc ctcttctatt 3180attattctcc ttttatgcct tcctgggagc
tcggacatca acaaagttca gcattccaga 3240ttccttctct acctgctcta ctctgtggga
aactgggtac ttacatgatg gatttagttg 3300ccaaattatc tttctttgga agtaacagcc
ccttttctcc ctcacgggga attctaagtt 3360tgcagcctga atttgacaac cgggggaagg
tctgagagtt agagacaagt atctgttctt 3420tgtgggtatt tattcactta ctcaacaagt
agctgtatgt gtcccctaag ccaggcaatc 3480ttcaaagctc tgagagtcta atagtgaaca
aaacagccca aatccctacc ctcatgaaac 3540ttgcatactg atgggtagag acagacattt
aaataggtga aacacattgt gagctggtgg 3600gaagcacaaa gagaacacca gagcagaaaa
gagggacagg agtttctgtg gtatggtggc 3660tgcaatttta aatatgggag tcatagtctt
tgtaaaagtg acttacgggc aaagacttga 3720aggaggtgga gccttaagcc tgccaatgtt
gtgagggaaa tagctcctga ccaaaaacac 3780agcctgtgcc atagaccttt gtcctcacca
gttactaact tactgctttg gggggtcaat 3840ggcaatggcc ctgtgagaag ctgaccgagt
ggaggctcct gagtagagga acctgaccct 3900gccctgggca gaagggtggt gtcttttaaa
agattatgca aagtgggaga aggatgtgca 3960ttagtgttgg aagacccacc ttgctttgta
ggcaagaaag catagttgtt ctataagtgg 4020gcctttcaaa ggggggctta ctacatttat
atgatttgtt tggtttttcc ttcccttagt 4080ttgctataat taagcctgct ctctccattg
gtggcatttt tcttgattaa gtctcctttc 4140taaagagact ggttgtgaag aggcagatgg
tttgagtagg tgctgcgagt gtttgctggc 4200caagtgcagg gcaagttggg tatatttgta
agctcgctgt ggatggtagc tccagagcct 4260aagcacggtg ctgcctcctc tgtgcatcac
tgggtcatgc aatttacctc tgtgccttag 4320cttctgtata ttggggtagt aatgtcacct
aactcataag gttgttagaa ttaaatgagt 4380ttaataccta taaagcattt agagaagttg
ttggtacatg gtaagtttta ataaataggt 4440acttagaaag tactcattga ataaaaaagt
gatgtttgtg tttatccagg ctgtaccttt 4500gggagctggc ttataatctt gagacttaaa
attagaattt atcctagtga tcaaagataa 4560aagtttggaa ccttggctgt acatttagct
acagggctag aagaactgtt tttgtgaaaa 4620taatttgctt accatagtga agggcaggtc
ttcttggtta ctaagttaac atggataaga 4680cttttacttt gtcctgtaga ttaaatacat
gtaggccctt ttgtgcttaa aaagatttgt 4740ctgggcaggg cgtggtggct catgcctgta
atcctaggac tttgggaggc ccaggcagga 4800ggatcacctg agatcaggag ttggagacca
gcctgactaa catggtgaaa ccccatctct 4860actaaaaata caaaattagc tgggcatggt
ggcccatgcc tgtaatccca gctactccag 4920aggctgagac aggagaatgg cttgaacctg
ggaggaggag gttgcagtga gccagtgagc 4980caagatggcg ccattgcact ccagcctggg
caacaagagc aaaacttcat ctcagaaaga 5040aaaaaaaaaa aaatttgcct gtaatgaact
ccaaagtgca ggttgctcaa aagttacatg 5100gaaacttgac tgtcatagct ggattgattc
tggaagcctg tgcaggagag gtcggagttc 5160cagagaggct gggaccttca gctaggaccc
cagccaggct gttgggcagg ttgcagcgat 5220agtgcagcag agcccagagc tggcagccaa
ctactatggt catggcacct gattggggat 5280gtggtttgca aataagtgtc tgtccttgtc
catcagggga gttgggatca ggcagtttgg 5340gatttgggta gtggggtttt gagcagaggt
ctgtggttcc ctgttacatc ccctatttgc 5400atgaagaatc ttctagaagg aaagctggcc
tagtacagag gagggtcctt ggagtggaag 5460caaatggctc ccttctcctc taattgactc
agtagaggtt ggtagaatag aggttaaggg 5520gcagggatgc ttgtgtggcc ttggaaaagt
tagcttcact ttcctttctg tgcagtaggg 5580atgatgagaa ctgtctactt cctgacgttg
ccccaaggat taaaagaatg taagactctc 5640aaaacaagct tctttgacgt cccatccaca
gtgcagaagg cccccattac ccctgcactg 5700tgcggttggc tctcctttgc accccttaat
tgaccaaaaa gcagcttctg tcaccctgta 5760aatccttgac cttgaagaag tgggtggcta
aaggtgccac agtcaagatt cctcagccac 5820tcggctacta cccttaccct caattttgca
gtttatgaca ctttagccag aagctaagtg 5880acttgcccag gtgttccaag gtaggtggca
gaggctcgat ggtgcatttc tttccctacc 5940atgggcttgt cattggcttg ctagaaggaa
ttgcaaactt ctaccctcca gcaggagaca 6000ccctgggttt ggttctccct gaggcctttc
cattgtgagg aaatcattcc acccatcagt 6060ctggttggga agcacccaaa cagattctct
tcaaaaagtc atttttgaag ttatagtcca 6120attttgactg gttagagaca ggtcagacct
acttactttg ctggctaagc tttttttttt 6180tctttgcccc tcaccacaag cctggacggg
agacgtactg cttggtatct ggtgcttgag 6240aggtcccagg agataaggga gcatttctgt
gaaaggtttt gccagagcct gtcaggaact 6300taatcttgac tgcaatgaat gacttaacat
tataagtgat ctttgggttt aggtgcccaa 6360ataccagtcc ctccttagta agcagccttt
tagacctaac cttttctacc taaggttttg 6420ttcagtcttt ttgaaaagca ggaaggccag
gtccttgtga ggcttttctc attacaagct 6480cttggcaggt tcttccctcc aagtggagaa
cccttacctt ccccattctt tctaccttcc 6540cctctgatga tccagagggc tgcaagggac
ctagagctgt tcattggagc cctagtcagc 6600tctagcggcc agatctatgg ttaggtaatc
ctccaatagt gagactaatt ctcagtgtta 6660gaattagggg tctgcctgtt ctcaccagca
gacccactgg catcaactct cctttgttga 6720catatagtta agcaccgctt cttacagatc
ctcatactca gagccaacct tgtcttgtct 6780cataactctt gctactggca gaaggtgata
aaattgtggc ctagaggcca aatgtttttt 6840gtttggctta cagaattaga ttacttctca
acacatttat tactttacaa cattagatta 6900cttcttaaca cacttatgcc tgtagttaca
cataaaaatc tggatgcttg actttttgaa 6960agatgtgaac agtttagcaa tactgtactg
tctatcttca tgccatcaat ttgttggatc 7020tgagtagatg ctgcctaccc cctcatcctt
gagtgggtgg gggtggggga ggtgggacct 7080cattgttcag gaatcctacc acacttggtt
gcctgtacca ggactccgaa ggttctttcc 7140tctttgagac ctttttctga atgaacttat
ggtccacttt taaaatccta aattttataa 7200agccttaacc agtccaacct agtccgttta
ttgcattaga tcagaggttg tcaaatgggg 7260cagttttttc ctccctctac ctcagggcat
tgggcatgcc tggagtcttt tgggttgtca 7320tatggggaag gggtatgcta ttggcatcta
gcagtagagg ccaggactgg tgccgagcat 7380cctgcagtgc acaggatagt ccccataaca
aaggatgatc tggtccaaat gtcagtggtg 7440ctgagatcga aaaaccctga tcaagaagct
taatgtgagt tattgtgtca ttgtgtgtgt 7500ggattaatga tggactttac cctcactaag
aattatagga ttcattaaaa ttttacacta 7560ttctgactta ttttcatgtc ataaatatca
agcagtttac ctttgctttc ttagggcttt 7620gagttagaga caattggatg gaagaacttt
acattataaa ctttcaatga agcataaaat 7680agcttcctca cattaagctt tcaaatacgt
atttgaggta actgtcaaat ttggtaaatg 7740tctgggttaa attataataa tttttttagt
aaatattctt atgtaataat agtactattg 7800tcagttgact gtcttgtata ctgatatagc
ttttgctata ttagtgatag gtgaaaatca 7860gttttctttc aacaagaagg taatttagtg
ttttaatttg acttaaatga ctagcaattt 7920gaaatatatg tatttcttgg aacactaggt
ggccttacaa tcacaattat ttttacaagg 7980aactacatat taagttttta aaaatttttt
gaaggttttt tttttttttt ttttgagatg 8040gagtttcact cttgttgcct aggctggagt
gcaatggcgc gatctcggct caccgcaacc 8100tctgcctccc gggttcaagc gattcccgtg
cctcagcctc ccgagtagct gggattacag 8160gcatgcgcca ccgctcccag ttaatttttt
tttttttttt ttgtattttt agtagagacg 8220gggtttcacc gctcaggctg tgtcaaaccc
cccagctcag gtgatccacc cacctcagcc 8280tcccaaagtg ctgggattat aggcgtgagc
caccgtgccc agccgaagta ttttttagta 8340gtataaaaat tgttaatgaa caaatgaaat
aaatcaacaa atgaacaaaa tgggaataaa 8400acatattaaa taaaacattt gtgttctgca
ttttaaaaat cttataatag ctgggcgccg 8460tggctcactc ctgtaatccc agcactttgg
gaggccgagg agggcagatc acgaggtcag 8520gagatcgaga ccatcttggc taacacggtg
aaaccccgtc tctactaaaa aatacaaaaa 8580attagctggg cgtggtggcg ggcgcctgta
gtcccagcta cttgggaggc tgaggcagga 8640gaatggcttg aacccgggag gcggagcttg
cagtgagccg agatcgcgcc actgcactcc 8700agcctgggag acggagggag actgcgtctc
aaaaaaaaaa aaaaaaaaaa aaaaaaaaaa 8760aaaaatatat atatatatat atatatatat
atatatatat atatatatat atatataaaa 8820aacagtaatc tattggggca ggggataaat
tgtgtccttg tttagaagaa gacttttttt 8880tttttagaag aagccctgtt gtttttgttg
ttgttatttg tttgtttgtt tgttttgaga 8940cggagtctca gtgggtcgcc caggctggag
tgcagtggcg cgatctctgc tcactgcaac 9000ctctgcctcc cgggttcaag cgattctcct
gcctcagcct cctgagtagc tgagactaca 9060ggcgcgtgcc accacgcctg gctgattttt
cgtatttttt agtggagacg gggtttcacc 9120atattggcca ggctggtctc gaactcctga
cctcgtgatc tgcctgcctt ggcctcccaa 9180agtgctggga ttgcaggcgt gagccatcat
gcccggcccg aagccctgta ttttgaaggt 9240gaatgttttt gcttcttctc tgatcagtta
ctaaaagtgt tttccctgtg ccatttgaga 9300attatttgag aactactcta agcttatggc
acatttcagc atttttgttg tctaaaggca 9360gctgtatttt aatcatgtaa atttcatgct
gcctcctttg cctaccatat agcttctgtt 9420gaaatgtcca tgttatattt tattttattt
actcattttt ctatatccat aataagagtt 9480aaaggttata ttttagcaat gtgttttttt
ctctgtcata acatgttaaa tattttaaat 9540tcagccatga tgttaaacag agtgaattgg
aatttctaac ggatttcggc aggaatgatt 9600atataagact tagggccaaa gagagcctgg
gtgactgtgt aggggtatta ggctatgccc 9660agcaccacat ctggttgtca ttcaggggag
cagctagggc tggaactgct ggggagtgca 9720aactgcagta ttccctgttc ccttgtctca
gagcagaatt gtggcctcta tccaggtcca 9780atcccagtta aaaaaaaaaa ttccaatggt
ctattcttct tttgagctat attattgtag 9840agaatactga cttgaaaaaa caaagtcact
aactagaaat aaaatttgaa ctgggtgcag 9900tggcttataa ctgtaatccc agcacttttg
ggaggccaag gtgggaggat cacttgagct 9960caggagttca agatcagcct gggcaacatg
gcaaaatccc atctctacaa aaaatacaaa 10020aattagctga agtgtggtgg tgtgcacctg
tagttccagc tacttgggag gctgaagtgg 10080gaggatggct tgagcccagg aggtggaggt
tgcagtgagc tgtgattgca tcactgcact 10140ccagcctggg tgatagagcc agaccctgtc
tcaaaaaata aaataaataa ataaaaatca 10200aaataaataa aatttgaaaa tcttgggtat
gacttctttc ttgtaccatt caacattctt 10260gatgtttact aggccagtcc aaataatcta
tttcttgtgt gcctgtttat gtcagttttt 10320tgtgacgtgt ggagaagtgg aactgggagt
aaccaaatca gaatgaagga agagctgttt 10380gtgaatgtcc atatgacaaa tgacaaaact
ggttttccaa gagggacaga gcatgttcaa 10440gttgaacatt gtatcatata tcataactgg
gcaaggttga gctattcaga tatgtattta 10500tggatgtaat ttcatttact tgacatccta
gtgtattgtt gagaagttgt atgttggtaa 10560attgatcatc ttagttttgc aagaggttat
actttctttc actctaataa ggagactgca 10620ggaggttata cttttagcct tgtgttaaat
tctttgtaaa gaaaattctt ttgtgaatta 10680gaatataatt tgtaagttta tactgaattt
ggagtgggat tcattgattt atcattcaac 10740cttatgtgga tacgctgtgg tcatcttact
agtgcaggtt gagcaaaagg tgtgtgcttg 10800agtttgttcc tgtaaaattt ttcatgacag
cacctggaga gtggaaatgt gaaaaaagat 10860gagcatcagc actttctaga agtgttggtc
taagttttca tttatttact atatgatttt 10920gaatgccatc aactaataca ttttcaatta
tccctgaacc ctgtggctca gttagtgtca 10980gtgggagtaa gaaaagatcc acgtgtccgg
gaacagtggc tcacgcctgt aatcccagca 11040ctttgggagg ttgagatggg tggatcacga
agtcaggagt tcgagaccag cctgaccaac 11100atggtgaaac cccatctcta ctaaaaatac
aaaaattagc cgggcgtggt ggcacgtgcc 11160tgtaatctca gctactcagg agactgaggc
agcagaatcg cttgaaccca ggaggtggag 11220gttgcagtga gcagatatcg caccattgca
ctccagcctg ggcaaaagcg agtccccatc 11280tcaaaaaaaa aaaaaaaaaa aagaaaaaag
aaaaagattc acatctactt gaggcctgga 11340aaattgttgg gaagttcctg tacatattaa
accagataat gtccattcag tggtgttttt 11400attttatttt attttatttt ggagacagtc
ttgctctatc gtccaggctg gagtgcaatg 11460gcaccatcat agctcactgc agcctcaacc
tcctgggctc cagccatcct cctgccgcag 11520cctcctgagt agctgggact acaggggcct
gccaccactc ccaactaatt ttttattctt 11580tgtagagatg gggtctcact atgttgccca
ggatagtctt aaactcctgt tctaaagcaa 11640tcctcttgcc ttaggttccc aaagtactga
gattacaggc atgaaccatt gcctgctcct 11700ggccagtgtt tttgatttaa aagagaaata
caggccgggt gcagtggctc acgcctgtaa 11760tcccagcact ttgggaagct gaggtggggg
gatcacctga ggtcgggagt tcgagaccag 11820cctgaccaac atagagaatc tccacgttta
ctagaaatac aaaattagcc aggcttggtg 11880gcgcatgcct gtaatcctag ctactcggga
ggctgaggca ggagaatcga ttgaacctgg 11940gaggtggagg ttgtggtaag ctgagatcgc
accattgcac tccagcctgg gcaatgagag 12000caaaacttgg tctcaaaaaa aaagagaaat
acaggccggg tgcagtggct cacgcctgta 12060atcccagcac tttgggaggc cgaggtccta
gatcgcttga gcccaggaat tcaagaccag 12120cttgggcaat ggtgaaaccc atctctacca
aaagtacaaa aattagccag gtgtggtggc 12180acagacctgt aaagtcccag ctactcagga
ggctggggcg aaaggatcgc ttgagcccag 12240gaggtcgagg ctgcagtgag ctttgattgt
gccactgtac tcccagcctg ggtgacagag 12300cgagaccctg tgttaataaa caaacaaaca
aataaaagag aaatacatct aataatctgc 12360atcacttatt tgtgcttgaa acattcagtg
ttaaccccac cctttccttt ttctctaggc 12420aaataagggc gagactgaaa aatcaagtca
ctcagttgaa ggagcaagta cctggtttca 12480caccacgcct ggcaatatta caggtattat
gatagtgtat tttcattaat gttgtctttg 12540cttgatttct cagtatcatt tcagacctct
ccctctactt tcctcctttt tttttggctg 12600ttgtgtaatc tcatgccttt tcagtctctc
ttggcaggct cccccctttg gctcagcttc 12660taaatgttgg catatcctgg ccctctgctc
ttctctgtgg ataccctttc tctaggccag 12720tgtctcttaa cctttggtct tagtagggcc
cctttacaat cttttttttc tttaatttaa 12780tgtatttatt tatttattta cttttgagag
acaaggtctt actctgttgc caggctggaa 12840tgtagtagtg caatcatagc tcactgtagc
gttgaactcc tggcttcaag cagtcctcct 12900gcctctgcct gccaaagcat gggtattaca
ggcgtgagcc actgtacctg gccccgttta 12960cactcttaaa gatgtaagac ctcaagaagc
tattgtttat acagatttat ccattaatgt 13020ttactatatt agaaattagg aagttttggc
cggacacggt ggctcctgcc tgtaatccag 13080cactttggga ggccgaggtg ggcagatcac
ctgaggtcag gagtttgaga ccagcctggc 13140caacatggtg aaaccccatc tttattaaaa
acacaaaaat tagctgggcg tggtggtggg 13200cgcctgtaat cccagctact ctggaggctg
aggcaggaga atcacttgaa cccgggaggt 13260ggaggttgca gtgagccaag attgtgccac
tgcactccag cctgggcgac agagagattc 13320cgtctcaaaa aaaaaaaaaa gagaaattag
gaagttttaa aaatgtttat tcatttaaaa 13380atgccaatag tgagctcatt tattatatgt
taacataaat aacattttat ggaacaaaaa 13440cttttttaaa aagaaaattt agtgaaaaga
ggggcattat tttatatttg tgcacacctc 13500tttcctgtat ggcttagtag aagagagctg
gagtcctatc tgcctctgca ttcagtcttt 13560tgctgtgtgc tggtttgatt gaaatatatg
gagaaaattt ggcttcatac agatgtgtag 13620ctataaaagg gaagaatttt tttaacagtc
cttccaggta attgtgggta tttttctttg 13680atgttgtacc aggactcagt aagtaaattt
taaaagatta gttgcaatgt agaatctgaa 13740actctatcag tgaactttga tactttgtta
catcttctca gacttgcaca ctttgttaca 13800tcaaaatgtt acatttggaa tgaatattta
ccccatgcat ggttttgtaa catcctttgt 13860tcctcatttg gaaaataata gttcccggaa
ttatgcagat cttccaaatg ttaacatttc 13920attatgcaat atgaaagaat tggaatcatt
aacatcacca acaagctcat cagagaagtt 13980ttcaagtatt gagaagctgt caagctcaca
ggggtggata aaaaatttta attttcaccc 14040aaaaactcac attttatcat tggcaacaaa
tacaattaat tgtttccctt gaaataccag 14100tttcacttta ttcatttttg agaaaatatt
tgtcaaatac tcaagcctga ataatcattg 14160cttgtcagtt gttatttcaa gtaggtgttt
catataaaaa tgcatctgtt tcagctaaca 14220aggcaaaaaa gcacaagtgc agcctggaca
acaaagtgag atcccatctc cacaaaaaaa 14280ttaaaaaatg agccaggagt ggtcgtatag
aggtgtggtc ccaactacat gagagggaga 14340caggagatca cttgagccca ggaagtcgag
gctgcaggga gttgtgttca tgccactgta 14400ctccagctta ggtgacagtg agatcctgtc
tcaaaaaaaa ataaaaataa aaataaaaaa 14460gcactaatgc tttagtaaca accacaacca
ctgtacttct gtatatagca gaagtgcatt 14520atgagtgctt cccaattcct tacatagact
attaaaaaca catgcactca agagttgagg 14580tttatttata tatatatatt ttttactttt
ttgggacaga gttttactgt tactcagtgc 14640agtggcacaa tcttggttca ctgcaacctt
catctcccag gctcaagtga tcctcccact 14700cagcctccag agtagctggg attacaggca
tgcgccacca tgcctaattt ttttttttga 14760gacggagtct tgctctgtcg cccaggctgg
agtgcagtgg tgcaatcttg gcttactgca 14820gcctccgctc ccagattcat gcaattctcc
tgccttggcc tcctgagtag ctgggattac 14880aggcacctgc cattgcaccc tgctaatttt
ttttttttga aacagaatct cgctctatcc 14940cccaggctgg aatgtaatga cataatctca
gctcactgca acctttgcct cctgggttca 15000agtgattctc atgcctcagc ctctggagta
gctgggatta caggtgccac gcccagctaa 15060tttttgtaat tttagtagag atagggtttc
accattttgg ccaggctggt ctcgaactct 15120tgacctcagg tgatccacct gcctcagctt
cccaaagtgc tgagattaca ggtgtaagcc 15180actgcgccca gccttaattt tttgtatttt
tttgtagagt ttttttttgc cttgttgccc 15240tagctggtct tgaactctgg gacccaaagt
gatcctccca ttttagcctc ccaaagtgct 15300gggattacag gcatgagcta ctgcgcttgg
ccttaatttt ttgtattttt ttgtagagac 15360ggttttttgc cttgttgccc aggctagtct
ccaactcctg ggctcaaagt gatcctcccg 15420ccttggcctc ccaaagtcct gagattacag
gcatgagcca ctgtgctcgg ccaagttgaa 15480gtttaagaaa attaataatg tatatttctt
catcaagggt ggtcctagtg aaactggcat 15540ttagttttac tgggagtgtg tggtagtgaa
ggataccttg actactagtc caaagccatt 15600gtcttgattt gtgctggttc actagtagtc
ttacacacca ttgcttttgc accatcggta 15660cagacgttaa cacaatgaaa aaggcaaaca
acattgtagt ggtattatta tgaagctagt 15720ctggccctca taaatctttg aaaaggtctt
ggggtccctc aagagtgtgt gggccacatt 15780atcagaacca ttattttagg cagtttcatc
tatcaggcca gtgacttcta aatctagatt 15840tcttctgagc tccaggctct tggtctggga
caccactggt ttccttccca ccttcaagtc 15900tcctttgctt gttgttgctc atcaccctaa
tacctaaacc atttgaatgc cttgggctca 15960cccttgagat ctcttcccca tcttattctt
tccctaagta acctcaccca gtcccatagc 16020ttcatgaggg aatgcttcca aatttaggtc
tctatccagt atttcttccc tgagctccag 16080attcacccag tagggagata ttcaatacat
atttttacgg acaaatgtat gaaggaatgt 16140ttgcaatctt gatttggatg acaggaaact
cacggctttc atttgaaact cccatcagag 16200tttcacccaa gttggacata gtctccttct
caatctcttt atttgtatgt ctccaataga 16260tcatttaatc ccttgctaaa tttctatccc
ttgttcatca agaatgtcaa ggtgggcatg 16320gtgcggtggc tcatgcctat aatcccagca
ctttgggagg ctaaggcggg tagatcactt 16380aaggtcagga gtttgagact agcctggcca
acatggtgag aactcatctc tactcaaaat 16440acaaaaatta gccaggtgtg gtggtgcaca
cctgtgatca cagctactca ggaggctgag 16500gcaggagaat cgcttgaacc tgggggacag
aggttgcagt gagctgagat tgtgccactg 16560cactccagcc taagactccg tctcaattta
aaaaaaaaaa agtcaaagtg aatagtactt 16620aaatattccc ctctcctcct ctgctgtttg
tctaagaggg cttagctgag aagaaaagca 16680cagaaggcct gtcactctct gcctggcatg
ttctctgctg tggtggattt tccctttttt 16740gcaaattact aagaatcaaa agtatatatt
gtgggtgtat actggtacat ttggtggctg 16800gttatttctt ggcttatttt tgcaaattgt
ttgttgtaga aagagtaaat gtcataggca 16860agaagtatac cctttttaaa gcaacattgg
gggtgttttt tttcctccat tttaaaaact 16920gacttccaac cttgtttttt aaatttattg
attacaaatg tttttattac ataaacactg 16980ttctacagct cgtctcctac gtccttgctg
ataaaacagt ggtagaatct tttaattttg 17040tactttagca ccagagtcaa cagctttctg
tgatacaagg tttatataat atgttgtaat 17100cacatctccc tgcctcagac ctctagtctt
tttcgctgtg gtgctggcac caccatcttc 17160ccacattccc tgagctagaa gcctgaggcc
gtctccttag gagctctctt cctcactccc 17220tgcatccagt taggggctat tgtcctattg
attgtgcctc tcatcaactc ttggaatctg 17280ttgcctcccc ttgattccta tttctcccaa
ccctggggcc actatagtta cctgcttaat 17340gatttggggt tctatctata tacctcctgt
ctgtcctcct gactgcaatt gcaatccttg 17400ttaaaatgca agtctgatca caatctatcc
agccacactc actctccatc ctttaccaga 17460atgatagtac aagttgagta tcccttatcc
gaaatgcttg ggacaagagt gttttagatg 17520ttggaatctg tatatacata gggatgagat
ccaattctaa agcaaaattt atttgtgttt 17580cattatatac cttatacttg tagactgaag
gtaattttat atatatgtgt gtgtgtgtat 17640atatatatat aataataatt attattatta
ttattttttt tgagactgag ttttgctctt 17700gtcacccagg ctggagtgca atggcacgat
ctcggcgcac tgcaacctcc acctcctggg 17760ttcaagtgat tctcctgcct cagcctcctg
agtagctggg attacaggtg tgttccacca 17820cgcctggcta atttttgtat ttttagttga
gatgagattt caccatgttg gtcaggctgg 17880tcttgaactc ctgacctcag gtgatccgcc
tgcctcggcc tctcaaagtg ctaggattac 17940ggacatgagc cactgctccc ggccaatttt
atatattttt aataatattt tgcatgaaat 18000gcacttttaa ctgtgttttg actgtaagcc
atcacatgag gccaggtgtg gaattttcta 18060cttttggcgt tgtcagtgct caaagtttct
gattttggag cattttttat ttgtggtttt 18120tttttttttt ttttttgaga cagggtctca
ctctgttgtc caggctggag tgcaggggtg 18180tgatcacagt tcattgcagc ctggatctcc
caggctcaag ctatcctccc atctcagcct 18240cctgagtagc tgggactaca ggcatccagc
taattttagt atttgagaca gaatttctcc 18300atattgccca ggctagtctc agactgagct
caagtgatcc acctgccttg gcctcccaaa 18360gtgccgagtc accgcacctg actgattttg
gatttttgag ttagggatct tcaacctgta 18420ctacttatat agagtgtcta caataattca
aattgtttgt ttatatttct aatcgttcta 18480tgttttatac taccctacga tgtaggagat
gtccccttat agccatagta actcatgctc 18540aaagatcttg ggtagcttgc ccaaagccag
tcaggaaggg actgagtctg ggaaatgaac 18600ccagatctgt ctctaaaacc catttgttct
tcccctttgt tttcctaatt ccttagcaat 18660gcattcaaga acattcatga tctggcccaa
gtatatattt aaaagttgag ttttcatttt 18720tttttaataa agtggatgag taacttccca
aataactgaa tatttatagg aaattaagac 18780ttgacttttc attataagag atactaatgg
gctaggcatg gtggctcacg cctgtaatcc 18840caccacttta ggaggccaag gcgagtggat
tgcttgagcc caggagttca agaccagcct 18900gggaaatata gagagacttt gtctccaaaa
aaatacaaaa attagctggt tgtggtggtg 18960cttgcctgta gtccccacta ctgaggaggc
tgagctggga ggattgctca cgcctgggag 19020gttgaggctg cagtgagctg taatcgtgtc
attgcactcc agcctgggtg acagagcaag 19080actttgtctc aaaaaaaaaa aaaagtttct
ggtaaggtga tagaaataca gcctcagccc 19140catagcctca ttagtatctc tcactctctc
tctttttttt ttttttttga gtcagaatct 19200tgttctgtca tgcaggctgg agtgcagtga
tgtgatctct gctcactgaa acctctgccc 19260tcccctgtcc cccaggggat cctcccacct
tagcctccca aatgagctgg gactacaggc 19320acaccaccat gcccagataa tttttgtatt
ttttgtagag atggggtttc gatatgttgc 19380ccaggctggt ctttaactcc tgggctcaag
caatccacac accttgacct cccaaagtgc 19440tgggagtata ggcatgagcc actgcacctg
gccaaaaaac tgatttttaa cagtattcgg 19500gaactattca gagtgcttta tgtatagtaa
ctcattcagt cctcacaaaa tacatggaca 19560gaatacagtt aataaactca actagcctgt
atcttcataa ggttttctga aacagaaaga 19620aaaatgaacc agtagaccaa tacccaagat
atcttacggt gtagaacaca gattagtaaa 19680ctgtggcccg tatccaaatg ctgcccagtt
tataaagagt taaaaaagca atagcagcag 19740ctgctgtgac acggtatgtg tcccacaaag
cctaaaatat acaccgtctg cccctctaca 19800gaaagtttgc agactcctgg tataaaaaaa
aagtgtccct aacctgttct ggaataggat 19860cttttttgga tgatgtagta ctcagccacc
ttcaaggaga aatcagcatt cttttttttt 19920tttttttttt tgagatggag tttcactctt
gtcgcccagg ctggagtgca atggcacgat 19980cttggctcac tgcaacttct gcccccgggg
ttcaagggat tctcctgcct cagcttccca 20040agtagctggg attacaggtg cacaccatca
tgactggcta atatttgtat ttttagtata 20100gatggggttt caccatcttg gccaggctag
tctcgaactc ctgaccttag gtgatctgcc 20160cgccttagcc tcccaaagtg ctgggattac
aggcgtgagc caccatgccc ggccagaata 20220cacgttcttt aaacattctt tttgcatcta
gtctatcaat ggttctataa ccactttccc 20280ctataaaatg cctatactgg ccaagtgtgg
tggctcacac ctacctagca ctttggcagg 20340ctgaggcagg aggattgctt aagcccaggt
gtttgagacc agcctgggca atagagaaac 20400tccatttcta caaaaaatac aaaaattggc
tgtttgtggc agcgtgtact ggtagtccta 20460gcttcttggg aagctgaggt gggaggattg
cttgagccta ggagttgcag gctgcagtga 20520gccatgatca tgctactgta ctctaggctg
ggtggcagag taagaccctg tctcaaaaat 20580aataaataaa taaataaaac ctatactaag
agtgactctt tgaattggct tttagatgtc 20640ttttccaata aaactttgac atacgctcat
aactataact attaagtttt ctttctggaa 20700ctggttgtag ttttagtgtt ttagacaaac
tccttcagaa atattcaggc cgaaaatagt 20760tacatgtaaa ggaacaacaa cagcgagaga
cttcagactt ctcctgtgtg acattagatg 20820ccaaagacaa catctccgaa ggaaaatgct
gtcactccaa atcaggcaag gccagtcaag 20880tgcaagggac agcagaaaaa tattttcagt
tatacagact tagaaacatg tccacctacc 20940tttcctggaa aactgactta taagttacat
ttcacttact tgaaacatga atgaagataa 21000attctagaac agagaagttg tggcttcctt
actggcagca agtgataaaa caatttaatc 21060tcagagttaa gactaaataa ctagtggtag
gtataaatat gaatgaaatt gtaaaaaaat 21120aatacttttg tatttagaac cacctccttt
cctactttga tctaaaaatt ccggaatgta 21180tcaaatctgg taactgaacc agtgagggtt
gggtggtggg gcaaagaaca tacatagaaa 21240gtgaccaaag aggaaacatg agaggcacga
aaagtctagg gaagccccta cgagaagcat 21300ctaatgtact tagatacaaa taacccatta
aaaaaaaagg tagaattccc ctcaaaggac 21360tcttctggct cttgcaattc aaattagcac
cacaaaactg agaactgtct gtataaggga 21420caactgaagc taagattcag gccatgtcct
gagatctgag gagagcgtcc agtaatagca 21480ccaatatgac aagctcagaa aatcctgtgt
tacaaaagcc ggcttctttc tctgccatgg 21540tcctgttaaa aggagctgtg cttcagttgt
ctttgatatg aagggaccac cacctttccc 21600tctgtgatac cctctgcaat tctgggagaa
acacaactta tggtagcatg tatcaacctt 21660gagaaggaga aataccaacc actggagcat
gaaacaatat tagtacaata aaattttaca 21720aataaatggt tataatgata aactataaat
atgttcatgg tcaaagaaac agagggaaca 21780tagtgagaca ctgcatggtg gttcaaagga
tttagcctct gcactacaag tttctccctg 21840tctctctctt ttttttttaa gagacaaggt
ctctctctgt tgcccaggct gcccaggcat 21900agtaccatga ctcactgtgc actggtgcaa
tcatggaaaa actcttgacc tcaagtaatc 21960ctcccacctt ggcctcctga gtgctgggac
tacaggcagg caccactatg cctggctaat 22020tttttgtaaa gatggggggg ggggtctcac
tatgttgccc aggctggtct cgaactcctg 22080ggctcaaacg atcctcctcc cttggcctcc
caaagtgcag ggattataga cctgagttgc 22140tgcacccggc ctcctctctg tgttggggcc
agcagattgc cctttgtctt gcctggctgt 22200ttattctgtg taaactagag ggaggtcctt
ccaccctcct ttctattgtc tgtgtggcgg 22260ggccagggag ctttccattg cccctgcggg
tgggggagag ggtgtggaat gtgccctctg 22320ggtgctcctg cctctcaggc atttttcttg
cctgatgaca gtttctcagc cctccatact 22380ctgtttttct ggtatggcag cagaaatccc
acatgtttct agtaggcctg cagcatcctt 22440gcttagtttc cagtgcagat gcagatatcc
tctactttct gtttggggtt tggcagtggg 22500aaccaggtgg gccttgttta ggggaggggc
tgtaacacca tctgtgttca cataatccct 22560cctgccccca cagcctccct atactcctgt
aaaagttctc tgcaacaaga gtcgcatgat 22620taaacatttc ttttattaaa actaaaattg
tagaagcttt tctgtgcact ttgcctttcc 22680ctaatcatct gatttgcatg catttattat
tctaggttgg caacagagat gattccaatc 22740tttatataaa tgtgaagctg aaggctgctg
aagaggtaac gccagaagag ctgtgccatc 22800accctcccca acctccacct gggtcctatc
gattacccct tgcccttttt ggtcctccct 22860gtgaaaactt ttatcaccca caggagacat
tagtttatct attttgtaag tcagaaatgg 22920gtgggttaga tctgtggcct ctggtttgcc
cgtgagtttt tgttttctca ttaatgattt 22980tcttcccaaa gagcaggatg tcaattgagg
aaagtgacaa gtctcgatca ttttaggaga 23040tttatttgcc aaaattaagg acacacctgg
gagacaggtc tatacctttc tccgaagatg 23100atttcgagtt ctccaaattt aaagaggaaa
gggtgggtta ttgaaaagta cacgttttca 23160tgtaagaggt gggtagggaa aaatagtcat
tcatgacctt ctctggctca gtgaatctgc 23220attttttaca taagatgaca gatggccgga
cgcggtggct cacgccagta atcccagcac 23280tttgggaggc cgaggcgggc ggatcacctg
aggtcgggag ttttagacca gcctaagaaa 23340catggagaaa ccccatctct actaaaaata
caaaaaatta gctgggtgtg gtggcgcatg 23400cctgtaatcc cagctactcg ggaggctgag
gcaggagaat cgcttgaacc tgggaggcgg 23460aggttgtggt gagccaagat agtgccattg
cactccagcc tgggcaataa gagtgaagct 23520ccaaccagaa aaaaaaaaga cgatatagac
aagtggggca gaggaaaaat gcaggcaatc 23580tgcattttac ataagataac atagacagaa
ttgggcaggg gaaaaatcag atatacattt 23640gtgtctggtg ggctgggggg atttacattg
ccatggtgaa attttaacag aaacatcata 23700aagatgttgc agctcactac gaatttcttt
gtgggcaaaa tatgggggag gcatgtagct 23760tttcatcttg taacatctta tttaggaacc
aaaaaggggg aggcagattt gcatgaccca 23820gttcccagct tggcttttcc ctttgactta
atgaatttgg ggtcccaaga tttaatttcc 23880tttctcaaga aatagcaagc atgacattgc
tgaagtgcag tggtggcaat taacaagtga 23940atgatgaatg ggttgaagtg agggactcat
tcttcagaac atataagggt gaaatgatga 24000ccaacacctc cttgcttgtt agtcatgtct
gtaaaagaaa tggttcaata tctcaagttg 24060tcttgccatt tacctgcttt taatgtctgt
tttgcagatt gggatcaaag ccactcacat 24120taagttacca agaacaacca cagaatctga
ggtgagcttt tatgagttga ttgtgaagag 24180ggaaggtgaa gtggtttcct ctctggtttt
ggtttacggg ggctttgggg actgacacat 24240ttagccaaga cctccaggta tttttttttt
tttttttttt gagacacagt tttgctcatg 24300tcacccaggc tggagagcaa tgatgcgatc
tcgactctct gcaacctccg cctcccaggt 24360tcaagcaatt ctcctgcctc agcctcccga
gtagctggga ttacaggcgc ccaccgcctc 24420acctggctaa tttttatatt tttaatagag
acgcggtttc accacgttgg ccaggctggt 24480cttgaactcc tgacctgagg tgatttaccc
gccttggcct cccaaagtgc tgggattaca 24540ggcgtgagcc accacgcctg gcacctccag
gtattctttc ttcacagact cccacagagg 24600agatggtgca gggatgaaca tttggtactc
aaagtgtgac tgcagtactg tttgcttcca 24660aatcatctgt tgtgtttgtt aaaactgatc
cttggagcca ggcacggtgg ctcatgcctg 24720taatcccagc attttgggag gccgaggtgg
gcagatcacc tgaggtcggg agttcgagac 24780cagcctggcc aacatggaga aaccctgtct
ctactaaaaa atacaaaatt tgccaggcgt 24840ggtggcgcat gcctgtaatc ccagctactt
ggaaggttga ggcaggagaa tcacttgaac 24900ccgggaggcg gaggttgcag tgagccaaga
tcgtaccatt gcactccaac ctgggaaaaa 24960agagcaaaac tccgtctcaa aaaaataggt
aaaaccgatc cttggcctgt aattatctgc 25020attctaaaaa ccaaatatgt attcctttta
tatgaggttc aggactaggc agaacaaatc 25080agcggtgatt aaagtcagaa ctgtggttgc
ctgtaggtgg gagattaatg ctaaactgta 25140ggaggataaa gcaactttct tgaggttatg
gcaatgttcc atatatgcgc tgtccagtat 25200gggagccact agccccatga ggctatttga
atttagttaa aatgaaataa aattaaaaat 25260tcacttcctt ggttgtacta gccactttgc
aaggattaag tagctgctat tgtattgtaa 25320ggtgcaaatt atagaaaatt tctgccatcg
cagaaaagat gattggacat tgctgctcta 25380catcttgatt tggatgattg taacatgggt
attattgctg taataagtta ttagagttct 25440tagctctttt tattttattg tattgtattt
ttagagacag ggtctcactc ttttgcctca 25500gctgcagcac agtggtgtga tcatagctca
ttgtagcctc aacctcctaa gctcaagtga 25560tcctcccacc ttagcctctt gagcagctgg
gactataggc atgagccacc atgcccagct 25620aattttattt tatttttgag gcagagtttt
actcttgtcg cccaggctag agtgcagtgg 25680tgcaatattg gctcactgca acctccacct
cccaggttca agcgattctc atgcctcagc 25740ctcctgagta gctgtgacta caggcatgcg
ccaccacgcc cagctaattt ttgtattttt 25800aattgcccat attgcccagg ctggtctcaa
actactgacc tcaagtgatc tgcccactca 25860agcctcccaa agtgctggga ttacaggcgt
gagccaccgc acctgtcccc gctttttttt 25920tttttttttt ttttttgaga cagagtcttg
ctctgttgcc caggctggat gcagtggcat 25980gatctcagct cactgcagcc tccgcttcct
gggttcaagc gattctcatg cctcagcctc 26040ctgagaggct gagattacag gcatgcgcca
ctatgcccgg cttatttttt atatttttgt 26100agagacaggg tttcaccatg ttggccaggc
tggtttcaaa ctcctggctt caagtgatcc 26160agtagctgct attgtattgt aaggtgcaaa
ttatagaaaa taattttcta tattttggga 26220ggcctgcctt ggcttcccaa agtgctggga
tgacaggtgt gagccaccac acccagccct 26280aatttttttt tttttttttt gaggtggagt
ttcactattg ttgcccaggc tggagtgcaa 26340tggcacaatc tcaactcact gctggagtgc
agtggcacta tctcagcgga ctgcaacctc 26400cacctcccag gttcaagcaa ttctcctgcc
tcagcctccc gagtagctag gattacaggc 26460atgtgccacc acacctggct aattttgtat
ttttagtaga gacagggttt cactatgttg 26520gcccagctgg tctcaaactc ccaacctcag
gtgatccacc cacctcagcc tcccaaagtg 26580ctggaattac agtcgtgaac caccgcaccc
agcccctggc cctaattttt aaaaagtttt 26640tgtagatatc acatttcact gtgttgtcca
ggctggtctt gaactcctgg gttcaagtaa 26700tcctcctgac ttggcctccc aaagtgctgg
gattacaggc atgagacacc tcgcccagcc 26760tcttagctct tttatttgat ggacttctta
ccagacacct aggggatgca ttctatgtta 26820aagtacaggt gctctcagga ttcaagggga
aggtacattc tgaaaggact ataattaaag 26880tgttgggggg aaatagggca acctaatttt
tgccttagaa agtgtttctt cctacctttt 26940gatttctgtg tgatatacaa attatatatg
gtattacttt taggtgatga agtacattac 27000atctttgaat gaagactcta ctgtacatgg
gttcttagtg cagctacctt tagattcaga 27060gaattccatt aacactgaag aagtgatcaa
tgctattgca cccgagaagg atgtggatgg 27120gtaagtgtgg cttggcttcc tatgtctcat
tgcatgcttt catttataat atgtttctat 27180ttggggaata cagcataaat gtttctaaag
agtaaagaca tctaattgcc tttatttcct 27240tcttatttcc atcacttttt taagattgac
tagcatcaat gctgggaaac ttgctagagg 27300tgacctcaat gactgtttca ttccttgtac
gcctaaggga tgcttggaac tcatcaaaga 27360gacaggtaaa aacaacaaac caaacaacaa
gaaagcacca tttcctgaat cctggtcttg 27420acatgtggtg gcatagagga ggtacattgt
gggccataaa taaatggact tgattggcag 27480aagggcctgt ctactaagga tgttacctaa
attttctctc aattttctca gcaactctgt 27540gaagtaggta ctgttattgt tcaactgcta
agaaaatagg ctcaggccag gcacagcggc 27600tcatgcctgt tatcccagca ctttgggagg
ctgaagcggg ctgagatcac ctgaggtcag 27660gaattcgaga ccagcctggt caacatggtg
aaaccctgtc tctactaaaa atacaaaaaa 27720attatccagg cagggtggca tgcacctgta
ttcccagcta ttcgggaggc agaggtggga 27780ggattgattg aacctgggag gcggaggttg
cagtgagccg agatcacctg actgcactcc 27840agcctgggca acagagcgag actctgtctc
taaaataaac aaacaaacaa acaaacaaac 27900aaaaccagct cagagagagc aagtaacttg
tctgggagac aacggaacca agatttgaat 27960tcagagttat ctgagttcaa agcaatttat
tttaacctat acagtaataa aaatttggat 28020agtttaaaag gatatgaagt gaaaagcccc
acctcccccc acccccaaaa tgggcactga 28080atctccacag tagttccagg actcgacaga
tggctggaaa tgggtctcct gcacagttgg 28140ccatattgtc ccgttactag ttagcacctt
gctattttaa tttcattatg aatactttta 28200gcagaggtta ttttttacaa ggaaatgggt
cctggtaaaa agaagtaggt cagaagtttc 28260aagtggaaag ttttggccag cataaatgaa
aattttctgt ggttggcaag aaaaggtcaa 28320aggtcaagct ttgacagttc atttctacag
ttaaatgtat attcttcagc atatttagca 28380aacattgtag tgccttgctt aaaagaaaaa
aatcccacga ggacagcacc tgaaggcctc 28440ccactgcctc ttattgagtg atgctaccat
ttgccctgat gcagctttaa ctcttctgct 28500gaatcctaaa aagtatagaa agttaccttt
taaaactttg aggtctttac ttctagatgt 28560tcatgatgtt cacattatta taagagttta
tctttttctg aaaaaaaatg attttatgcg 28620agaagcacaa tttaaaccct tgctgctttg
agcaagggtc cactgccctc agccttgact 28680gccctgagga ctctgatgaa gtcgtcaact
tctctctgct ttcttctttg tggacagctc 28740attttcgggc tccactttgt ggccacaaac
acctacagcc tggttccatc ctcatgccta 28800acagtcatgg gattggatct tatcagctga
gggaagaaat atggtcccct gtccactact 28860gggtactcac tcaggctgtc tttagtagag
tatgagtcag ccctgaaaag gacattggta 28920tattgccaga aaaatggatg ctgtgatagc
aaaaagaaag ctttgctgcc acagttcaat 28980ttccccttta tcacttccct gcaagtaacc
ctatggctca ggccctaagc caaaagagaa 29040atttgggtga ggaactgtat aaacaaactt
accaaagtta gaaggaatta ccgatactaa 29100tctaatttct tacacatctc tccagcctgt
tacatacacc ctatttgttt taactttttt 29160tccccatgta atttaatctg atgtcttctt
ccgtatggct gattagctgt atgtcagggt 29220ttcttttttc tttgagtgta ggttttcttc
ccactcccct atgactcaat gaatgatctt 29280ggcttaagct gctcccaaaa acccacggcc
agaaggacac aagtacaagt cccaatttat 29340agctgaaacc ctagagttga gaatattatt
gccaaagaag gaatgctttt aggaaggttt 29400ctgtttgatg ttaagaacct gtttttgtgc
acttattcta atttctccac gtggcatgcg 29460aaggagggca gcttctatcc tccaactctg
atgcaggctg gcttttcttt caggggtgcc 29520gattgccgga aggcatgctg tggtggttgg
gcgcagtaaa atagttgggg ccccgatgca 29580tgacttgctt ctgtggaaca atgccacagt
gaccacctgc cactccaaga ctgcccatct 29640ggatgaggag gtagggtgtc cagaggagag
gtaaaggtgt tacggtgggg agggtgggtg 29700tgccagaggc tgccatgtcc tttacactca
tgacctcatt taaccccatc atctcatttt 29760tacaagatga aaaaacaaat tcaaattaaa
ggctgagtgg ggtggctgac acctgtagtc 29820ccaacacttt gggagtctga ggcaggagga
ttgtttgagc ccaagagttt ttgagaccag 29880cctgggcaac atagtgagac gctttctctg
caaaaaaaat taaaaattag ccagatgggg 29940tgggcacatg cctgtagtcc cagctactcg
ggaggttgag gcaagaggat tgcttgagcc 30000caggaggtcg aggctgcagt aagctacgat
cacaccactg cactccagcc tgggctacag 30060agcaagaccc tgtctcaaaa aaaaaaaaaa
attaattaat taaaaaaaac ataaaattaa 30120aagcatgctg aagtttatac agagtttgtt
ctgtgtactt tctctcttcc tttttctttt 30180ttctttattt attttttttg agatggagtt
tcactcttgt tgcccaggct ggagtgcaat 30240ggcgcaatct cagctcacca caacctctgc
ctcctgggtt caagtgattc tcctgcctca 30300gcctcctgag tagctgggat tacaggcatg
tgccaccacg cctggctaat tttgtatttt 30360tagtagagac aaggtttctc catgttggtc
aggccggtct tgaactcctg acctcaggtg 30420atccacctgc cttggcctcc caaagtactg
ggattacagg catgagccac catgcccgac 30480ctttgtcttt tttcttttga gacagggtct
tgctctgtta tttaggctag agtgcagtgg 30540cagtccatga gagcccactg cagctttgaa
ctcctgggct cgagtgatcc tcctgcatca 30600accttctgag tagctgggac tacaggcatg
tgccaccatg cctggctaat ttttaagctt 30660tttgtagcga caggggtctc actgtgttgc
ccaggctgga ctcaaactcc tggactcaag 30720cgatcctcct ggttcacctt cctaaagtgc
tgagattaca ggcgtgaccc aacatgcctg 30780gcctctcacc aagtactttc taccctaatc
tgcctcgtgg tggacaggga ttagcagtac 30840aggcactcag attgacctct ggcagccaga
tcccatgtgt gtaaaagtgg tggcctcaga 30900gtgaaggtgc tccagaggca gtcgtgggca
gtagtctgct aatagctatg gcagcttgtt 30960ctagatccct gcatcagatt cctgagccca
agaatgcgca cacacacaca cgtgcctgtt 31020gacagtgcag tctcttaaat gtggggagag
agcaggagag aggcctagta ctttctcttt 31080taatcctccc aaagtagtgc tctacattct
gtaggagaag gagcagtgtc tggacataac 31140acgcagggcc ccaagcctgt ttgcccctct
gccttgcagg ttgacactga tttccctatt 31200gggtaaaatg gctggtgaaa gggagctctt
tggcatgtgt agaaatgagc acggcatgtg 31260aagggctgtc agaaacagtg tgtggcttac
atttgaggcc ttaacccaat atcttatgaa 31320atagcttatg acacttaaga atttaataca
aagctcaggg atatcctttt catttctggg 31380tttttttttg gtgtgtattt cattatttca
tttctgatgt ccaaatcccc tacccctagg 31440taaataaagg tgacatcctg gtggttgcaa
ctggtcagcc tgaaatggtt aaaggggagt 31500ggatcaaacc tggggcaata gtcatcgact
gtggaatcaa ttatgtccca ggtgagtgtt 31560gttggaggag taaggtggct gctggtattg
aggatggtat ctaggtcatc aaaagaagcc 31620tgagagagaa gcttgtctca ttttatgctt
gtagtactaa ggtttaggag actgactagc 31680ccaagggtgc acagagttag tagatagaag
agttgccact tgcaggcagg tctgcctggt 31740gccagagcct ttgattttca tcatgtgctg
tacagcttcc aagatgactt tgtccaggga 31800gaaagtgaga gggagggcag gagggatata
tcactttatc ttcatgtaga gatcaccctt 31860tattttagca aagttgatgg atggatttaa
aatacagtat aattattgca aagtgctatg 31920agagtccaga agagggactc attctgccaa
gaggagggct ttacaaagaa gatatttgac 31980ctgggttttt ttctaggctg agaggataaa
tctagccttt taatcatgtc caaaactgta 32040ctcggcatcc tcatccccaa gttacccctt
actcccgtct tttggttagt atcaccattc 32100ttccttcccg caagtctgaa gccttttagt
ttccctccca agcattccct atattggacc 32160agtctctaaa tattaggaat tgttttctct
ctcctctgtt ctgagtgtca ccactgagtg 32220ttaccttctc agctgatggg ctgtgataac
ttccttgcag gtgtccctcc ctccagctgc 32280ttcccactca gatctatccc cgtgctctcg
ccttgccacg ttcaccttct ggcatggaac 32340tctgagcctc ttccttctta cttgccagac
tccttagcct cacactaatt tcagtgacct 32400ctgcagcctg tgatctcact tgcgcatggt
gaatacctgt cttcaacttg taaagtggac 32460cactttccct tccctgcgca tgcctcgttt
gccctttgta gttccctcac ctgcattctt 32520agtggagtgg gctactcttt caaacagaaa
acttcacaga acaatagggc aaaacttgcg 32580tacgtcctca tagatccaag aaaggaacaa
tctttaacgg agcaactact gtgtatccag 32640cacaatgcca ggccttgggc tacacaactg
aataaggttc aacttcacct tcaccttcag 32700ctgcttcctc cccatctcct ctctgtcatg
tctgttattc cttccaaacc ttgtttgatt 32760tccccttatc aagggatgat ctttccttct
ctccctccct cctctcccat agaacatgtg 32820tattgtatct tttgttgcat gtgtctcatg
tacggcgtcc tgtgctctcc tgcctagata 32880gctcctcaag ggcagtggtc actgtggcgc
cacatccagc ttcctgccat ctgaccactg 32940gcactcatat ctattgaatt gtcgaaggga
gttatagctg gaagtctttt ttctcccttc 33000ccatctcaag tttcagatgt agacgctgag
ctgccaggaa cattagtcac tgtcaagcct 33060tcaccgtttt taacctccct cccactctcc
ctaccgtgct tttaagacag tagctcataa 33120ttgatcactc attttcttgt agtgtaatta
tctaacaggc gataaagaaa ttgctctcct 33180acaactcctc actttgaata cttgcgatat
gacattcaag ctcagtcttt ttatttattt 33240atttttttaa tttttttttt ttttcttgag
acggagtctc gctctgttgc ccaggctgga 33300gtgctgtggc gcgatctcgg ctcactgcaa
gctccgcctt gcgggttcat gccattctcc 33360cgcctccgcc tcccgagtag ctgggactac
aggcgcccgc caccacgccc ggctaatttt 33420ttttgtattt ttagtagaga cggggtttca
ccatgttagc caggatggtc tcgatctcct 33480gacctcgtga tccgcccacc tttgcctccc
aaagtgctgg gattacaggc gtgagccacc 33540gcacccagca agctcagtct ttttaaaacc
tcgcatcttc tgtatcatat ttctgctttt 33600ggcatttttt ttttggatta catgtgagaa
atggagaaga aaaacctgag tggggatctt 33660gagtgttgca gataggtagg aaatgttgac
ttaaacatga aacttgccat ccttttcaca 33720ttgattcctt atgtagacaa taaaatatga
ataaaatcaa agcagtatct tcccttctgg 33780tcaacagtgt ttttactgat cttgggcata
taggcaaatt aatgctgata cagctggatg 33840ttattgtgtt ccccaaagga tggcagaatc
actcagttac agaccagcaa atgggctgca 33900ggacctaaag actgatattt gcaaatgttc
actttagaag caaacttttc taccaccgag 33960ctacttcttg gttcatattt tgattattta
aatagtttgc atggtccact cagggataac 34020attgtaagtc acttgccaaa ttaagaatga
atcaaggtgt gtataccttt tacaaagaag 34080taatattttt aaagattcaa gtgcagggca
gaaagtattg gccagagaaa cacataaaag 34140tgctcttcct gcttggctta ctaaagataa
attgagaagg gccagatcaa tagtgtttta 34200tttggaaggg ttaaaaagtg ggaacagatt
ccccacccta atgcaatcca gtctcaacct 34260atccagtgta gctgcttctg gactggcctc
tattattagg gtccccaaat taaatgtaga 34320agagtcaggg ctcagctgga tctttagttg
atttttgtac atgtgatttc ccaaagaaag 34380agaaattcta gggctggggg tagataacca
aaaggtttgc tcattactga ggtcatatgc 34440ttaggctgag tagtggtcac tttttccact
gaccacatca tcctatccac gaagcactgc 34500atgtttagca gtgtctcgtc aaccctgaca
gtatttggtt taggagcaaa actgaactca 34560ccagtgctgc gtttagtcat gtagtcaaag
atatgtgtct cttacctgtg ggaaaacgaa 34620aatgactaac attaacacgt agaaatagat
gcagcgggta tttcaagcta actacagtgt 34680gaagacgatt gaaacttgta cagagggcaa
acctctatct atatttcctt ggcattttga 34740aatgactcta ggggctgtaa gaggaaatgt
tctgccagtt tgctgcttta taatcttcta 34800gaaacttgaa accatcccaa tgagttgtgt
tgaccagctc cttttactcc atataaaaaa 34860tatcttacgt atctgtggtg gaaagagaaa
ctcttgcgtt tggaggctgc tttttcaatt 34920tcagtaatgt acacattttc tcattttgcc
ctcagcaaat ttgtgatagg tagtgttaag 34980ccttctttgc attttggact tcagattaat
atggagccca caggagctgc agaactaact 35040tttttttttg agacggagtt tcgctcttgt
tgcccaggct ggagtggaat gctcactgca 35100gcctccgcct cccgggttca agtgagtctc
ctgcctcagc ctctgagcag ctgggattac 35160aggtgcccgc taccaagccc agctgatttt
ttaatatttt tagtagagac ggggtttcac 35220catgctggcc aggctggtct cggactcctg
acctcaggtg atccacccgc cttggcctcc 35280caaagtgctg gcattacagg tgtgagccac
cacacctggc cctaattttt aataaaagag 35340ctgatgctac agctaatctc ttttttttct
gagacagagt ctcgctctgt tgcccaggct 35400ggagtgcagt gacgcaatct ccgctcactg
caagctccgc cccccgggtt catgccattc 35460tcctgcctca gcctcccaag tagctgggac
tacaggtgcc cgccaccatg cccagctaat 35520ttttttgtat ttttagcaga gacggggttt
cactgtgtta gccaggatgg tcttgatctc 35580ctgatctggt gatccgcccg cctcggcctc
ccaaagtgcc tgctaatctc ttaattctgg 35640cttcttgcct ccaccttgga gcttcttctc
agttgtggcg atctgatcag agagaagtgg 35700tgatgtcgct aaaactacca gatgcagcgc
gggagcctta ttgcttcgag gtctaggccc 35760gcccccagcc cctcctggaa gtgcactgaa
gccaagtgta ggatgttctt tgtgtgagtg 35820agcaatgatg catcattggg ttttgttctc
ttgaatagct attctttgct ggtaagattt 35880gtagttgttc ttgtgtcctt ttaagggaag
tgtaattata ttctttgcct ttagcataca 35940tttaagtagc tgtgtctttg gaattggatg
ttgaattgtt cctccagctc acacatggag 36000ctcacaaaat gctgacagtt gctgtagaaa
cgatatctgg cagctgtgac agagttggat 36060ctttgtgcac ccaaaaaatt tcctttgtag
gctttagcaa attaacattt taaaatgggt 36120gctcattata tactgggtag ttggttttaa
gaataaaaaa cagttggtat tttatttgcc 36180acctgagaga ctacaaaagg attctccagg
ttctgtaaaa ccacttatgc atcttgtgaa 36240tttttgcagg taaaataatt ttctccctaa
gaaatagcaa acatctgtca ttccatagct 36300tttaagttaa gtaggcactg gagcctgatg
ggactaacac accctacaat tctgctgagt 36360ttttgtcgta actgatcacc aggactgtga
ttctgagact tagttttgat ttctccccca 36420cttgaccaga tgataaaaaa ccaaatggga
gaaaagttgt gggtgatgtg gcatacgacg 36480aggccaaaga gagggcgagc ttcatcactc
ctgttcctgg cggcgtaggg cccatgacag 36540ttgcaatgct catgcaggta attgtgaata
aaagtttcta taagagttct gaaaagctga 36600tcgagttttg gctgcttttc tccccaagta
gaaatgtcaa aaacctactc agaagtaaag 36660atgtgaattt attgagaaaa agggaaggga
agaatagaga aataagatca ggcacattaa 36720tagaggaaag tagggatggc ataattccag
tctgttttgg aatataataa ctttgaaaga 36780agaaaagtgc gatcacactg gcttggccag
gtctgttttg gtttgccttc cctttctttt 36840tttttttttt tttttgagac ggagttttgc
tcttgctgcc caggctggag tgcaatggtg 36900tgatcttggc ttaccgcaac ctccatctcc
ctgattcaag tgattctcct gcctcagcct 36960cctgagtagc tgggattaca ggtgtgcacc
accacgcccg gcgaattttg tatttttagt 37020agagacaggt ttctccacgt tggttaggct
ggtctggaac tcccgacctc aggtgatccg 37080cccaccttgg ccttccaaag tgctgggatt
agaggtgtga gccaccgtgc ccggcctgtt 37140ttccctttct aacatgcacc agctgacagg
ggccaggcga gcagccccaa agcatgttag 37200gtgcctcttg actcttagct aggattgaag
catggtaata agtgggtctg aagcagttca 37260ttttgtgatt gaactggagt gacctggagc
ttgaaactga ggtgaggatt aaaataacca 37320cctcttctaa gtttcattta tttcattttg
cttagagcac agtagagagt gccaagcgtt 37380tcctggagaa atttaagcca ggaaagtgga
tgattcagta taacaacctt aacctcaaga 37440cacctgttcc aaggtaaaaa taaagtttta
ctgatttaaa actttgtgaa ttgttggttt 37500ttagttgaca gatactgtgg gttcacatat
gctccctctg aaggtgcctt cagtggttgg 37560ggttgggttg ggggcttgtt actactgtgg
gattaattaa actcagtgga taaacattaa 37620tacctatttt ctatgtttcc aaagtgacat
tgatatatca cgatcttgta aaccgaagcc 37680cattggtaag ctggctcgag aaattggtct
gctgtctgaa gaggtagaat tatatggtga 37740aacaaaggcc aaagttctgc tgtcagcact
agaacgcctg aagcaccggc ctgatgggaa 37800atacgtggtg gtgactgggt atgcttttta
ttcatgttgc catccaaatc ttagtatcag 37860tcctgatact aaggcgttgc atttgcactt
ggcacatgta tgtagaggtg cctttaattt 37920gattttagca ttttcacccg tatttgatct
catttgatcc ttacagcaat cctgagaggt 37980aggtaagaat agattataat catagatgag
aaaattggtc aaggttatcc atctagtaag 38040tggaagagtt tggtcttaaa acccggtctt
aagatagaat caccttttgc tgttgttgtt 38100gttgttgttg ttttgagaca gagtctcgct
ctgttgccca ggctggagtg cagtggcacg 38160atctcggctc accacaacct ctgtctccca
agttcaagcg gttctcctgc ctcagcctcc 38220cgagtagctg ggactacagg tgcaaggcac
cacacccagc taatttttgt atttttagta 38280gagagtgtgt ttcactatgt tggccaggct
ggtctcgaac tcctgacctt gtgatccacc 38340cacctcagcc tcccaaagtg ctgggattac
aggcgtgagc caccacaccc ggccagaatc 38400acctttttga acccaaaatt cattaattcg
atggatttac taagcattca tttttattgc 38460tgggctgagg ttatattaaa aagctagttt
ttgagaatat aaaatgagaa attaaaatta 38520tagccgtgtc tctacaagat aaagtatatt
tgcttcttac agcaagaaga gctaaaatac 38580aggaatcatt ttttttctta aagttttttg
gaaattacaa ctaagacttt tgaaaaagat 38640atctttcagg gcttttgtct ttctttttct
ttgtcttttc tttctttttt tttctttaag 38700agatgggggt ttgctctgtt gcccaggctg
gagtacagtg gtgcagtcat agcttactgt 38760agcctggaac tcctgggctc aagtgatcct
cttgcctcag cctcctatat agctgggact 38820acaggcatac actgctatgc ctgactttgt
ctttcttaat ctttagatta cgagtcattc 38880tctaggatct gctaggtatg acttagtgat
tcagatacac ttaattcttt aaaccttttt 38940ctcttttgct ttcgtcttga agaataactc
caacacccct gggagaaggg aaaagcacaa 39000ctacaatcgg gctagtgcaa gcccttggtg
cccatctcta ccagaatgtc tttgcgtgtg 39060tgcgacagcc ttctcagggc cccacctttg
gaataaaagg tactagtgag actggaccat 39120gggtggtgac aggggacctg cttctccttc
agtcctccca ggcccacgca acctatgata 39180cttatggggt cttcaactca tttcaacacc
aggaatgtca ttctcacaaa cctctgtagt 39240catgcttttg atgaaggctg tcattggatc
tccccagctc ctgctgtcac tgttggccac 39300tgcacaggga ttctctggga tgggtgatat
gtagctggag gtgcttttat ttgaccctca 39360tagtccactg ctacagtaac cataaaggta
atcatagcta ctgatcatct tacagacact 39420ctagagacaa tgacaaaccc cacagatgtt
tcaccttgtt gggaataaga aaagcctacc 39480tttattgaat atatgctttg tgccagatat
tgtgctgagc agtttttgca tattattcta 39540tcatccagta cgagatagta ttgtcaccat
tttaaaatag cagcaactgg gacttcagag 39600gtggcttggt caaaggcacc tgtattaagt
accagagtga ttcttccatg tctatttcaa 39660gctctaaatt actaccatac tcgcacttta
ccttcagatt tttattgctg cggtactaag 39720aacatgtgtt tttttttttt tttttttttt
tttgagacag ggtctcactc actctgtcgc 39780ccaggctgga atgcagtggt gtgatctcgg
ctcattgcaa cctctccacc ttctgggttc 39840aagagactct cctgccacag cctcccgagt
agctgggatt acaggcacgc atgaccatgc 39900ctggctaatt tttgtatttt tggtagagac
ggggtttcac cattttggcc aggctggtct 39960tgaacttctg acctcaagtg atctgcctgt
ctcagccttc caaactggta ggactatagg 40020catgagccac cgtgccaagc tgctgtctta
attaaaataa tgtttattaa ttatgatagg 40080ttcttagagt tagctttctc tgcttggtga
gggtatcaca ttttgcttgt tctagcaatt 40140actacaaatg ttgacaccgg gaacaccttc
acccttccac cttttttttt ttttttggag 40200acacagtctc gctctgttgc ctgggctgga
gtacagtggc acgatcttgg ctcactgcaa 40260cctccgcctc ctggtttcaa gcaattctcc
tgcctcagcc tcccaagtag ctgggactac 40320aggtgcgtgc caccatgcct ggctaatttt
tgtgttttta gtagagatgg ggtttcacta 40380tattggccag gctggtcttg aactcctgac
cttgtgatct gcccaccttg gcctcccaat 40440gtgctgggat aataggcatg agccaccgtg
cccagcccac ccttccacct attttataaa 40500tgtaatcact ttgttctgct cttgagcttg
tcttaacccc ctttcaactt cagcacgagc 40560tattccttgt tctgcttatt gaacccatta
ttttcctttt actcttactg caaattgcct 40620cactgcacgg tctctgcctc acaaaaacaa
aatacatatt aatatatctc tgggctgggt 40680gcagtggctc atgcctaaaa tcccagcact
ttgggagggt gaggtggtag attacttgag 40740gccaggagtt cgagaccagc caggccaaca
tggcaaaccc cgtctctatt aaaaacacaa 40800aaattagctg ggcatggtgg tatgtgcctg
taatcccaga ccattaccct ccagcctgtg 40860agacagtatg agactctgtc taaaaaaata
tatatctgat ttacatttat tgcactgata 40920atgtggtaac ctggactttt tttgtttgtt
tgtttgtttt ttttgagaca gagtcttgct 40980ctgtctggag tcccagctca ggctggagtg
cagtggcatg atcataactc actgcagcct 41040tgatctccca ggctaggact ataggcatga
gcagccatgc ttggtgtcct gaacattttc 41100aactggcttt tggctaatac agattgagca
tcagaaatct agaaacctga aatctgaaag 41160gctccaaaat tctaaacttt ttgagtgctg
acacgatgct caaaggacat gctcattgga 41220tcatttcaga ttttaaatcc tggatttggg
atgctggatc agtatgatga aaatattcca 41280aaatctggaa aacaaaaacc caaatccaaa
acccaaatcc aagcatttta aataagggat 41340actcaacctg taatccaagt agcccaaaaa
aatctagcaa ttaggctgct ttctaacagc 41400ctaatctatt tatctctatt taggaaaaat
tatagtcttt ttgttaggtg tatgtatttt 41460aaatatttct gttttaaaag ccaaagtagc
tttatgagag taggaaattg aacaggttgc 41520ctttatgtaa ctgtcagctg ggtgttttaa
caaatgccag taattacaag gaacatattt 41580ctaaagataa tgagtaggta tatatttgtt
catggttttg tcatttaaac tcagatttca 41640attgagagac tagtattaga aaacacttga
tcaaaactgt ccaaatgatt ctaaaaaata 41700aataaataaa tgtgcttagt agttacaata
atgcatttgc atttatttga tttctaagtc 41760attggagaag catgcttaac tgagcttcca
cccttgacct gtcccctagg tggcgctgca 41820ggaggcggct actcccaggt cattcctatg
gaagaggtaa agtattctgg gatttggctg 41880aattagatcc cccttttttt gtcggggggg
agggtgctga attagctccc tttttttccc 41940catgacggag tcttgctctg ttgcccagag
ctggagtgca atggcgcaat ctcggctcac 42000tgcaacctct gcctcccagg ttcaagcaac
tctcctgcgt cagcctcccg agtagctggg 42060attacaggca tgcaccacca cgctcagcta
attttgtatt tttagtagag acggggtttc 42120accgtgttgg ccaagctggt cttgaactcc
tgacctcgtg atccgcccac ctcggcctcc 42180caaagtgctg gtattatagg cgtgagccac
cgcacccagc ctagattctt tattatagct 42240ttttctctct ggagattagc cttttttgta
gagctcttag cggacaaatg acttgtttct 42300cagcaaaagc aacacctagg gggaaaacaa
cttaatttgc agacttgtga tacagcttgg 42360tttcaaactt tggctttcag tgggaatgat
tattataagc aaatgaccaa atgaagcatg 42420gtatcgggaa gcaccaacac acagcgattt
cagaattcat ttatgtaact gcagaaattg 42480gaacattaca tagacataga gtcggtgctg
ctcccgtgca ctcctcctcg ctgttttgag 42540gtgtgtcctc agtaacccag tgtggctgtt
gatcccagtg gtgtggtatt tgtggtctct 42600ggacatctca ttttggaaaa actgcaagtt
ctgtatttgg gaggagattc ttctgtggac 42660agataattag tcttttatgt tttctttttc
tttttctttt tctttttttt ttttttttaa 42720gacagagtct cactctgtca cccaggctgg
agtgtagtgg tgtgatcttg ggtcactgca 42780acctccacct cccaagttct cctgcctcag
ccttccaagt agctgggatt acaggtgcgc 42840actaccacac ttggctactt tttatgtttt
tagtagagat ggggtttcgc catgttgccc 42900agggtggtct tgaactcctg gcctgaagtg
atccgcccgc ctcggcctcc caaagtgctg 42960gtattatagt tgtgagctac cacacccagc
cagtcattta tgttttcaag aactccctat 43020attgggtgtt tgtccactaa ctgggctgcc
ttcagtattc cattggcttt aaaataggga 43080ttgagctttg caatagaatg aaagttggaa
agatacagag cagctgggag actaatgtgg 43140cttctgttct tttgtagttt aatctccacc
tcacaggtga catccatgcc atcactgcag 43200ctaataacct cgttgctgcg gccattgatg
ctcggatatt tcatgaactg acccagacag 43260acaaggtagg atgccaaagc cccatgaacc
ccattgaaca gttttgaaag tattgaacca 43320tctgaattag tgttggtgtc ttgggtgctg
aggatgcagg tagcaaagca gtggggctcc 43380tctcatttta aagccccttt cttttcttta
aggctctctt taatcgtttg gtgccatcag 43440taaatggagt gagaaggttc tctgacatcc
aaatccgaag gttaaaggta agcttttttt 43500cttccacatt ttttatattg tatggaatct
ggaatctgat cattgattag gacataaaag 43560tcttcttgga gtagcctatt ttagatgaag
ttacatcagt aatcatagtc ttaaagtcat 43620gatatacata gcaaagatgg atggtagagg
ttatttctca tttatctata gattagaagt 43680ggctagtgtg gaataaatag aacactgaca
acctgccctc agctaagaaa gaaacgacac 43740ttcctttata acatgaacaa gagtcaggat
gtcactgtgg cggcatagca gcctttggag 43800tttgcatcat tgggacctta agcaaatttg
ttaaatgact tgtttctccg cttacttgtc 43860tgtaaaataa tacttgcctt gccctgaata
gataagggtc tgtgttaaac agttaaaacc 43920atagttatat ctatatttta tttgaatttc
tgtaacaaat acatagttag taagcataca 43980ccataccacc tgacccagaa atctcaccca
aaataaatga aaactctgtt cacataaaaa 44040tctgtacatt aatgttttta gcagctctat
taataatttc caaaaactgg gaactaatct 44100aaatgtcttt cattggatga ataaataaac
aaactggtac atccatccaa taaaaataaa 44160tgagctattg atacaagcaa caacttggat
gaatttcagt ggcattatgc tgggtgaaag 44220aagccagtct caaaagcttg catattgcat
gcctccattt atatgatatt ctcaataaga 44280gaaaactata gtgctaggga acagatcaat
agatgttaag ggtatggatg aagggagggt 44340atgaccataa aggaacagca aaggagaggt
gttttgtttt gttttttgtt ttcttttttg 44400agacagcgtc tctctctgta acccggctgg
agtgcagagg cacaatcaca gctcactata 44460gcttccacct tccaggttca agcaatcctc
tcatcttagc ctcctgagca cctgggacta 44520taggtacaca caccaccaca cctggctaat
tttttatttt atttttttgt agagaccgtg 44580tctcaccata ttgtccaggc tggtctcaaa
ctcctggtct aaagcgatgg tcctgcctca 44640gcctcccaaa gtgttgggat tactggcgtg
agccaccacg tctggccaca agggattttt 44700tgtttttgtt tttctgtttt ttaatttttc
aaaaatttat tggttggttg tttgtttgtt 44760tgtttaagac agagtctcac tctatctccc
aggctggagt gcaatggtgt gatctcagct 44820cactgcaacc tccacctctc aggttcaagc
agttctcctg cctcagcctg aatagctggg 44880gttacaggtg cctgccacca tgcccagcta
attttggtat ttttagtaga gagaaggttt 44940catcatgttg gccaggctgg tctcgaactc
ctgacctcag gtgacccacc cgccttggcc 45000tctcaaagtg ctgggattac aggcgtgagc
cactgtaccc ggcctgtttg tttatgtttt 45060tgagaaaagc ttttactctg tcacccaggc
tggagtgccg tggtgcaatc acatcttact 45120acagcctcaa cctccaggac tcaagtgatt
ctcccacctc acccaactag gtagctgaga 45180ctacagccag gtaccaccat gcccagattt
ttatatattg tgtagagacc gggttttgcc 45240gtgttgccca ggctgatctc aagcttctgg
gctcaagcaa tctgcctacc ttggcatacc 45300aaagtgctgg gattacttgc atgaaccact
gagctcagca tttttttttt tttttttttt 45360ttaagagatg aggaggccag gcgtggtggc
tcatacctgt aatcccagaa cgttgggaag 45420ctgaggcagg tggatcacct gaggttggga
gtttgagacc agcctgacca acatggagaa 45480accccatctc tactaaaaat acaaaattag
ctgggcatgg tggtgcacgc ctgtagcccc 45540agctactcga gaggctgaca caagagaatc
gcttgaacct gggaggcgga ggttgcagtg 45600agctgaggat gccccactgc actccagcct
gggtgacaga cagagactgt ctcaaaaaaa 45660attgggtttg tgttttacaa cacttcttgg
tcactatgat actatgatta tttaggtcta 45720ggcttaggga aatcttctgg tgctcagtta
tagggacaaa catgttaata atacctttgt 45780acatgtgtga gtaatgtgaa tttgcagata
tcgggataaa atcacttttg tcagacccag 45840aaaaaacagg gctgggaaaa catgaagaag
aggaggctta cacttacgtg tctgtgataa 45900gaactgtttg caaggcctgt aatcccagca
ctttgggagg ccaaggcagg aggatgactt 45960gagcccagca gttcgagacc agcctgagca
atgaagcaaa acctcgtctc tacaaaaaaa 46020tttaaaaacg agccaggtgt ggtggcatgc
acctgtagtt atacccagct actcgggagg 46080ctaaggcagg agcatccctt gaacccagaa
gttcaaggtt gcagtgagct atcattgggc 46140cactataatc tagcctaagt gacagaatga
gacctcatct caaaaaaaca aaagcaaaaa 46200ttgtttccaa caactttata aaaccccaaa
agaaatccct tcacgtcctt cacacatctc 46260ctgctttgca cagcttgcac atttcaaatg
tatacttatg tttccatgac aagacttatc 46320actagacatt cttcaggatg ggagtaattc
agataagatg ctcttgaaag tacacttatc 46380cagcaacggc aactgcacca atgaattgac
aactctggct ttgagcttct gggagcgatg 46440aattctgttt tttttttttt tttgagacag
ggtcttgctc tgtcacccag gctgtagtgc 46500agtggcacaa tcatggctca ctgcagcctt
gacctgaact ctctaagcag actacgtgaa 46560gctctccctc ttttcttttt tttttttttt
tttttttttt tgagatggag tctcacccag 46620gctggagtgc agtggcgtga tcttggctta
ctgcaacctc tacctcctgg gttcaagcag 46680ttctgctgtc tcagcctccc aagtagctgg
gattatagtc acccaccacc atgcccagct 46740aatttttgta ttttttagta gagacagggt
ttcaccgtgt tggccaggct ggtctcaaac 46800tcctgacctc aggtgatctg cccatctcag
cttcccatag tgctgggatt acaggctcag 46860ccacaggctg agcttcacgc ctagctgaag
ctctccttct ttgctagtaa aatcttccct 46920tccctcacta ggcatatttg tggcatccca
ggttataatc ctcgttgctc attcctgaat 46980aaactcaacc tacttggaga taattttttg
taatgtcttt ctttaggctg acagtaggaa 47040ataaccccca gctttcagtt tttccttctg
aatttaagat ggtttgcatt tcctagaggg 47100gatcccattt agaggtaggg gtcaaaattt
ttgtcttacg tttattgtgc ttctgtttgt 47160cttattttat gttttcattt tcctacactt
tcagagacta ggcattgaaa agactgaccc 47220taccacactg acagatgaag agataaacag
atttgcaaga ttggacattg atccagaaac 47280cataacttgg caaagaggta ccagagcagt
atacaagccc cgtttgtttt ggctatattg 47340cacaccctac accctccaga agagttgtta
aaatgtgagg ctcagaagct tccatgttgg 47400tattttacca gctaaggcat gaccaagatg
gtgacattat ttctttttct tgccacctct 47460ctcagtatct ctccttatta gctatcatca
aatctctttg ttggattggt cgaacatttt 47520gctggctatt cagtgttgac tgaaattagg
gtagttaaac caaaaatcac attttacaaa 47580tttaatttta tgcaaaaagt tttagtgtca
tgaagtagag aaattatgtt taatcacttc 47640ttgaatcatg tgtgaccttt attcttagaa
ctcttgtgtg gcataaatga cttgattttt 47700ttctgtagtt ctctagctta cctaatcttt
ttttttttga gatggagtct cgctctgtcg 47760cccaggctag agtgcagtgg cacagtatcg
gctcactgca agctctgcct cccaggttca 47820cgccattctc ctgcctcagc ctcccgagta
actgggacta caggcacccg ccatcatgcc 47880cggctaattt tttgtatttt tagtagagac
agggtttcac catgttagct aggatggtct 47940caatctcctg acctcatgat ccgcccacct
caggctccca aagtgctggg attacaggca 48000tgagccaccg cacccggcct acctaatcat
ttttttacct cattttatta aattctctta 48060tagcatatta ggtagccttc aaggtttaaa
tacttaccag ttcagtaagt aggtttcctg 48120gcttcactgc catatgctac ttttctcaat
gcacaactag tgtagcttag atttcacttt 48180gcatcaacaa tgcaaccttc tattctgtat
cactattgag aaagaatctt catccactca 48240tgcagaatag agctgaaggc cagtttattg
actgtttttg tgattttcct tgactcaata 48300accgttgcta gctaatggcc ttatttagat
gacactctca atgtctaaaa ccacattgga 48360atttctcttc ctttgcttca atagctttag
acaaatggta taatatttta taaggaataa 48420gctaaaagtc cattttgaag atttagtatc
cgatttactt ttcttttgct tcgtgtctca 48480ttcatgtttt ctaagcaact agaggcagtc
atacctattg actagctatt ttttacagaa 48540atattagtta catgaaacct gatacttttt
tcttttgaaa aataagggtt ggctttaaca 48600acacattaca gcaagagtgg ggggtgtgtg
tgtctgattg ctatataaga cactgctttt 48660ggtatgaaat tatgacagtc tttatggaaa
ataatttggc atattatcaa gtgccttaaa 48720aatgcttctc ctccttgact catttagagt
atgaactggg tttttttgtt cattttgtgt 48780gtgtgtgtgt ttttactttg aaattatttt
aattttgaaa tagaaaacat cttgaaaaaa 48840aaacccacaa aatatatagg caaatttgta
ctctgtgagc ttctcacatc cttacactgc 48900ttcaccaatc tctaagatga aaaactctgt
tctaaaacat attcagccat cagcaatgat 48960gttattagta ggaatatgga tcaatttaat
tttgtgtttc tttgatattg tttggcattt 49020gatactgaaa tggcaactcc tgtttcctct
ctatactcat agtactctca gcacctcact 49080tctgacacca gatgggtagt ttttctcttc
cacaacaatt cagctttcac ttctccagtg 49140gatattgact gagtatccta tgatttaact
caattctgac actaactgtc tggagcagat 49200cctacaggta aagggctcag tcctaaaaga
ctgccccact tcagacacga atcgcaagtc 49260caggttgtca cctgtgcttc tgaccaactg
gttataaact ggagggtccc cacaaccccc 49320tgctcaggtt catttgctag aatggctcac
agaactcaga gaagcatttc acttgtgttt 49380atggtttatt ataaaggata cagctcagaa
acaaccagat ggaagagaag cgtagggggc 49440atcactctcc cagcacctgg atgtgttcac
cagcccagag gctcatcaaa tcatgttgtt 49500caagagtttt ttttttgaga cggagttttg
ttcttgttgc ctaggctaga gtgcaatgga 49560gtgatctcag ctcactgcaa cctctgcttc
ccgggttcaa gcgattctcc tgcctcagcc 49620tcctgagaag ctgggattac aggcatgcac
caccacaccc ggctaatttt gtatttttag 49680tagcgacagg gtttttccat gttggtcagg
ctggcctcga acttttgacc tcaggtgatc 49740tgcccacctt ggccccccaa agtgctggga
tttacaggcg tgagccactg tgcccggcct 49800gttcaagagt tttatacacc cctaacacct
tacccagagg ttggtgggtg ggactgaaag 49860ttccagccct ctaatcattt ggtctttttg
gtgaccattc ctttcctgag gctgtctagg 49920agacctacca taagtaactt attaccataa
actcaggtgt gatcaaaagg agtttgtaat 49980gaataacaag acactcctat cacccttgga
attccaagag ctctgtgagc cttttgtcag 50040gaacccatga caaaggtcaa atatatttcc
tattacacta cacataccaa aagccagaac 50100tgagggtaca ctaaaatgtt aacagtagtt
agtcctggga ggtcgttagt tggttagttg 50160cttggtcttt gccatgtttt ttgtggttat
ataattggat agatattctg tcatcagaat 50220aaaaattctc tctgctaagt gtgctatgtt
tcctaaattt ccagaccagg cctggcaccc 50280agcgtgtttc aaaagatatt tgcacttgcc
ctgatttgat ggctcctgca tttcttacgg 50340tggtggaaag tcaatgagtc atcatcccac
agaacaacaa cacaaacctc agtcttttga 50400aaaaatgtag actgctctgt ttttcttata
tgtggaatat gctttaaaat agattccaga 50460tttctagtta actgtttgaa ttctctagat
gggataaaac caagccattc cacattcctt 50520tgctgtaaaa ataactcctg tgtatttatg
gtatatgaac atcatttctt aactttaaca 50580catattctag ctttgccatc cattttgaaa
tattgataga caatcatgaa attagactta 50640ttgctttaaa actctgctgc ttcctttaac
tttgtggtcc ttttactcaa aatcgttcta 50700tatcttgcct gtctgctgta gtgttggata
ccaatgatag attcctgagg aagatcacga 50760ttggacaggc tccaacggag aagggtcaca
cacggacggt aacaatttgt ccctttccaa 50820ggaaattagt tcagaggcac tagatcttgc
tgcttctctc ctcctccatc ctctctcata 50880cgactgatac tgccaggtgt tgttctgcct
tctccctttt aaaatggaag tgagtaggtg 50940tgtggcattg agcactcctg acagcttttg
caataaatcc catgtgtcac ctgtgtgttt 51000ttcccccggc attttttcct ggaggtggag
agccccaagt gaatatcctt gcatagaaat 51060tgattaaaat tgcttctttt ttgtggccat
ttcactgtat atctccttgg tttaagaggt 51120accatattat tcccttgtat atgaaagaaa
caataagcgc atgattataa gcggccttca 51180tagggataaa atcatagctt tttggtccta
ctataacatt tattttaaaa aactgacaac 51240tcagggctgg gcgtggtggc tcacgcctgt
aatcccagca ctttgggagg ccgaggcggg 51300cggatcactt gaggtcagga gttcgagatc
agcccggcca aaccccatct ctactaaaaa 51360aatacaaaaa ctagctgggc atggtggcac
gcgcctgtaa tcccagctac ttgggaggct 51420gaggcgcgaa aattgcctga acccaggagg
tggatgttgc attgagccaa gattgtgcca 51480ctgcactcca gcctgggcaa cagaccgaga
ctctgtctcc aaaaaaaaaa aaaaaaaaaa 51540aaaatgacaa ctcagggtct tggaataatt
aattaatata tttttttgct attagaggat 51600tattttcatt aagtattttg ggtagtattc
tgttattcta tccttttaag ctatttgttt 51660aagcctcaga tgtaaaagag gatcactttt
gcaattcaca cttctggttt aacacaaagt 51720ttttgctagt acctcttttc cctgcccaca
ggcccagttt gatatctctg tggccagtga 51780aattatggct gtcctggctc tcaccacttc
tctagaagac atgagagaga gactgggcaa 51840aatggtggtg gcatccagta agaaaggaga
gcccgtcagt gccgaagatc tggtgggtac 51900ccagacacgc caggcttggc gacatatctg
tgtctgttgt cctagggtct ttcagcagtt 51960attaataaca aaatgtaatg ctgtacttaa
gacattgcaa ttaattcatc aatttaatcc 52020tattttgcta attacaagat aattatgaaa
tgtttaaaaa gtacatcaga gtgaataata 52080gctggtatgc tttagtaata gatcacaagt
tccagatgat ttgagtaaca tcttcatcct 52140gtgggcatct ctaaaagtgg gtgtatgact
tcaatttgca gagaacatca agagtaccat 52200tgctttatct ttgttgctag gtagttcagc
tacactttga tggcttaata gagtgacctg 52260atgttaaaag ttggtattga actcttgcgt
aatattgcag tttcagcttt ggaagacaca 52320tattgagttt ggtatttata gaaatttctg
cccagatacc agagttacaa ataactgagt 52380actctgggaa cgagcattgt cgtgctctaa
aatagtctag aatggatttc tcacatcagt 52440cagcagctga gcttgattaa gatgcttgtt
tctagagtta catgtttttc cagatattac 52500atgtgaatgt catttgtatt ctatctggca
actttggcca ggcgtggtgg ctcatgcctg 52560taatcccagc actttgggag gctgaggcgg
gtggatcaca aggtcaggag attgagacca 52620tcctggctaa cacaatgaaa ccctgtctct
actaaaaata caaaaaaatt agccgggcat 52680agtggcgggt gcctgtagtc ccagctactc
gggaggctga ggcaggagaa tggcgtgaac 52740ctgggaggtg gagcttgcag tgagccgaga
ttgcaccact gcactccagc ctgggcaaca 52800cagccagact ccgtctcaaa aaataaataa
tttaataatt ggcaacttca tagcatatcc 52860agaaaaaaac catgaatggt ccaattcagg
tgaaagtatc aattggtcca agatggctaa 52920catgggtaag cctagaatgt caaatattgg
tttcagaagg ttgggttttt ttgctggtgg 52980gagttgatgc tgcacacatt tgttttgtag
ggggtgagtg gtgcactgac agtgcttatg 53040aaggacgcaa tcaagcccaa tctcatgcag
acactggagg tgagcagagt gactcctgcc 53100ttcttgaatt ggttttggac agtcagagca
gagtggttat aaagcacact tgcaaggcgc 53160ggtggctcac acctgtaatc ccagcacttt
gggaggccaa ggtgggcaga tcacaaggtc 53220aggagatcga gaccatcctg gctaacatgg
tgaaaccccg tctctactaa aaatacaaaa 53280aattagccgg gcatggtggc gggcgcctgt
agtcccagct gctcggaagg ttgaggcaga 53340atggcgtgaa cctgggaggc agagcttgca
gtgagccgag atcacgccac tgcactccag 53400actgggcgac agagcgagac tccgtgtcaa
aaaaaaaaaa aaaaaattaa gcacacttaa 53460cctgggtagt caagaccttc tccacttgct
catctctttc ttctcattct tcctcacacc 53520tgtgactggg acgttactga aataaaagag
atgactttat attttccctc tgggaagtat 53580tcttccttcc gattccaaat caattccata
ccgttgaatg tgtgatccca ctttgaagca 53640ggattggcag ctcagctcac ggtgtcctgg
tttccacagg gcactccagt gtttgtccat 53700gctggcccgt ttgccaacat cgcacatggc
aattcctcca tcattgcaga ccngatcgca 53760ctcaagcttg ttggcccaga agggtttgta
ggttagtgtt ttttgcaaaa ccagtgaata 53820gactgtatgt ttcttttaac atcaggggaa
ttgggatggc atttttactg ttgctttcct 53880ctttacagtg acggaagcag gatttggagc
agacattgga atggaaaagt tttttaacat 53940caaatgccgg tattccggcc tctgccccca
cgtggtggtg cttgttgcca ctgtcagggc 54000tctcaagatg cacgggggcg gccccacggt
gagtggtggg ttgaagtatc tgattatcgg 54060cagtgtgctg acggccaaaa ggaagttgga
tgacttctgc ctgtttcttc attgagttgc 54120tcttatcctc gtgattaaca ggcagcaaaa
gcaagggacg ggcagacagc ccttgtgttg 54180gctgccttat cactcacagg caccgtcagc
gctcagcatt ttaagagggc caaaatcggc 54240tgcctggcct acctttcgga ggacatctga
caatatctaa acccctccaa ggacgtgtac 54300tcggctagcc ccacaaaaaa ctgttttacg
ttcagactgt tctttttaaa tctgcatccc 54360tctgtatagt gacactggca accagccaac
caaaccagtt agctgagaca gaggtcagcc 54420ccttcttgag tatactgttc atgctggaca
gggcctccac accctaggga gggagtgggg 54480gagaggagaa tggcagatct gcccagggct
gccctgggta acagagaaac aggtggagac 54540tggctcatcc tcacctccag cctgaatttt
gtatttctct gaattcctac tgcttagaaa 54600ttcctactac tttttatcct agtattttaa
caccttttct ttagggccta catgaaaaga 54660catggtgcta ggttttctga aatcatttga
aaaactatta aattcctgtc caccaggagc 54720ttgtattata acttaggaaa taaggactcc
ctggcctaca ggataaaaat caaatttgtt 54780ttggtcaggg tttgtcctaa tgtagcagtt
ctctgttttg gcctcagaac ctctttacac 54840tcttgacagt tattgagagc ctaaaagagt
ttttgttcat atgggttata gctattgata 54900tttaccatat tagaaattaa aactgagtta
catgtaaaca taaagaactt tgtaaacatg 54960aaaatgcatt atcttttccc agcaaaattt
aggcattgta catttttcag atcttctctg 55020ggtctggctt aattgaagac agcacttgat
ctgttgtgac atcacatgtc tcataacctt 55080tggaaaagtc cattatatgc tcaatgagag
aatgataatg agagtaaaag ggccataaga 55140tgttacagaa tcttatggaa atagttttga
cttcatggac cccttcaaag gattgggtcc 55200ccggagcaca ctgggaatgc tgccataaca
accccagcac ccctatccag tcttagttat 55260ccctgagtgc caaaggcact cccaacccag
cacactcctt tctgctagta gcacaggata 55320gactcccttt tttttcccag tcacaactaa
cattcctctc tcttacaccc ctcactcacc 55380ctcacaccat cacacctcct ttccttgctt
cctcctgacc tcacagtcag gataatgtcc 55440cataatgctg tgcatggagg ggaacatttg
cacatatggc tctgtcaaga gagtaggcct 55500cctgctctca ttagcttgct gacgggaagc
tacctgtaag gtccttgctt cttgctctgc 55560ctccacacag ccccttctgg agttggagca
catagtctca ctgcaatgga agagggcaga 55620gcttcttaat cactatccac tgcccatggt
ggggcccatg aaatggacac cttcacccat 55680ttgctcattt ctgtggatgc agcagggctg
agccacggca tagatcaaga ctgggggctg 55740gggggcgggg cggggaggca ccagacacct
ccatccttta gatatactct acaaaagaga 55800aagaaaatcc ttctgtattg ttttccctct
tgtttgtatt gaccctgcac cacggttcag 55860ctccaccaat gctaggttgg cactgtggac
cagttcatga aaccatggca gggctcatca 55920cgggctctat cgacagacat cccagggtga
cttccgcagg cccgaccgct tcctctttcc 55980ccacactccc cttgacttcc aaccccactg
ctatggcctt cactcgagtc actgcacctg 56040cttggtacac ccctttccac ctctcctgcc
atcgatccgg tccttctcac ctccttaatt 56100caggtcatag aggtcttcca cgctgcccaa
agcagtcgat tttgttctgt gctgtgtgat 56160tggtcacata ccctaagtcc ttagggcagg
aaataatctt atactactgt acagcctgca 56220agcattgcag cgtcttgcct ttaacacatg
aaaagcacca ttttaagcta ggagatttaa 56280ataggaaatg cctctgactc tgtttctttt
ccttccaggt cactgctgga ctgcctcttc 56340ccaaggctta catacaggag gtaacctgag
ttatttctca tcacgtgtcc tagaaagcac 56400cacaccttga atcagcattt ctccacctgg
cccatgtgga aatgaatggg ttattgctgt 56460gacccagggt gcagggtgcc aaaaggcctg
cagtgtgctg aggagttata ttaaatgagt 56520agatagggaa gatgtacctc acaaaataga
attgtcccac atagggtgtc aataagccac 56580tgccctagac cagtggttga ttctcaacag
ctgattttga ccccggggac atttggcaat 56640gtctagagac tttttttatt attacaactg
gaggtgggga gtgctactgg aatctagtgg 56700gtagaaactg gggatgctgg ctaaacatcc
tttaatgcac aggacagccc ctacaacatt 56760atccagctca aatgtcggag tgtgaggctg
agaaaccctg tcctagacta atagcagttc 56820ccttccattg tcttccctgc attggctggg
taagttccaa atgccattca gatcagaagc 56880aagtagagac cttttttgaa atggcttctc
ctctgccgat ttaaatccat attaacgggc 56940caagatatat ggtcaggatt ctggttgttt
ttctagacaa agcttcctgg atatttatag 57000catcaaacag gaggctagag aacaactgtg
agctcaaatc ttttcatatt tatgtcattt 57060accatcaaag aaacatctct aaagccagtt
tggggtggtt acaggaaaga caggccatca 57120gataaggtgg caaggtcacc tgctgtcact
gctaattcaa tttttctgcc tttttggaag 57180gcttttaaat tatcaagcaa tacagggtca
ctgttcaggc cccaaatgaa gtatttcaaa 57240actggctaga aaaaattgct ggtaccttca
gtctgcctgt cttctgagtt atggtttgcc 57300ttttatggtc atttgtctga ttcacgcgta
ggaggggaat tcacagttct ggccctcctg 57360ccaccccaca gcctttcctg tgtctaatta
cagtgactgc ccctagctgg cccagctggt 57420gtcttcattt aagaaattct acttatttaa
aactgttctt gatcagcgtt acatattaat 57480tggttttcat gtgactagag gtaaagaatt
agcagtgttc tttagaagct caaataaatc 57540ataatctcag ttttcaatta ttaaaggaaa
aaagcataaa tcacatttat aatgaaccta 57600gttgttttcc accttttatc ctacctattt
ctgctttttt gggatcaaca aacattagtg 57660ctgggttttg ctcgatgatt cagtatatgt
atttttatag aagactcttt tatttttaaa 57720atatttaatg tattgttaag cctacaatag
taatataccc tttgtaaaaa ataataatta 57780aacattataa gaatgtaata tatgtgagtt
gagtcccctt aacagttcgg tgcatatttc 57840tcgacattta aaaatttttc atgaattata
tacatgtatt tttgtgtttg tcctaaaatg 57900gaataattaa gctctccatg gttttgcacc
tttagtacat cttggacagg ttccatgtta 57960acacagatgg gtctcacttt cataacatct
gcatgataaa tatttattga gtgtatatgt 58020cataattcat tggcccagtc ccccttggag
gaggtttatg tttataattt tccaatgcta 58080caagcagggc tgcagtaaac atacttaaag
atacatttcc ttaattgctt ccctttagtt 58140tgtagtactt ggtttttggt gggttttttt
gttttgtttt gttttgagat ggagtctggc 58200tctattaccc aggctggagt gcagtggtgt
gatctcggct cactgcaagc tccgcctcct 58260gggttcacac cattctcctg cctcagcctc
ccaagaagct gcgactacaa gcacccacca 58320tcatgtccgg ctaatttttt tgtattttta
gttgagacgg ggtttcaccg tgttagccag 58380gatggtctcg atctcctgac ctcgtgttcc
acccgcctcg gcctcccaaa gtgctgggat 58440tacaggcatg agccaccacg cccagccggt
ttttggggtt ttttttacag acagggtctc 58500actctgtcac ctaggctgga ctgcagaggc
gtgatcatag ctgcagcctc taactcttgg 58560gcttaagcag tctccccgct tcaacctccc
aactagctgg aagaacagga acacgccacc 58620acaccaggct aattttttaa ttttttgtag
agacggggtc tccctatgtt gcccaggcag 58680gctgctgtcg aactcctagg tgcaagtgat
cctccgatct cagcctcgca aagtactggg 58740attacaggtg tgagctacca catccagctc
agttcttttt tttttttttt tttttttttt 58800gagacagagt cttgctctgt caccaggctg
gagtgcagtg gtgcaatttc aactcactgc 58860aacctccgcc tctggggttc aagtgattct
cctccctcag cctcccaagc agctgggact 58920acaggttcct gccaccacac ctagttaatt
tttatatttt taatagatac ggggtttcac 58980catgttggcc agtatagtct taatctcttg
accttgtgat ccacctgcct cggcccccca 59040aagtgctggg atgacaggca taagccacca
ggtccggccc tttttttttt ttttttttta 59100attaacttct cagttgtaga gatggggtct
tgttatgttt cctaggcttg tctcaaactc 59160ctgggctcaa gtggattctc ctgcctcagc
ctcccaaagt cctgggatta caagtgtgag 59220ccactgggcc cagccactgg cctgattcct
aaagatagta tttatcttat ttcttgccct 59280cgttcacggt cgagttcatt aaaacctttg
aaaacatcac tgcttacttc atatttcctg 59340gcagtattta cggcagccat ataacgaata
cctcttctgt gccagccact atgttgttag 59400gctctgtctt ctgatggaag ggtctgtgct
gtgcagagta gagttaattt gtattatcat 59460aaccctatga atacataaaa tctataaaca
caattggtaa atataatttg ggtaagggta 59520actaattcct ttcaaacagc aaataagcta
taatgatgtg gcataatgac aggaaacata 59580ccagctcttt ttttcttgaa ggctgggaaa
caagcagagc tctactctag tgtcataggc 59640tggagtgaaa gctgcagttc tccatagcat
ccactcaccg cactggaaaa aatgcaccaa 59700gggaccttct ctcttctttc ttggcctcct
tgtcctgata gtgagtggct gctggctcaa 59760ggaggttgtt tgcctttgaa gaagaacctg
cagtggtttt caactctctg atctcatagg 59820cctttcactg ttgtttttac agaacctgga
gctggttgaa aaaggcttca gtaacttgaa 59880gaaacaaatt gaaaatgcca gaatgtttgg
aattccagta gtagtggccg tgaatgcatt 59940caagtaagtg tagagtgtaa gcgaaaagga
tgaatgtgga aaatctcctg gagctgattg 60000tacagcactg cttttagctt tattttgttt
ttcagttact gctattggtt atagcctgtg 60060gttttagcct gcactgtgac aggcattgca
tacgtcactg cgggctcacc acctcaccca 60120ctgagggtcc caaagtcaga ttcagctcct
atgggccctt ttcctttcca ctgggggaag 60180taacacacaa gcctgaagtt caatgttcct
ttttcatctt tagaggcctt ttatgctaac 60240agaaggtcat aggcctatag tgttccatta
ttttactaga tctggtggga tgttgtcttg 60300acaaagaatt gggcatatta ctctggaagg
ggtcatgcaa gaacttgtgt gggttgttta 60360ggtattgcat tggtggtcag ggttttgtta
taatggcctc accgtagtag aatgggaagc 60420aaaaaatctt tttttttttt tgggggacga
agtctgtctc tgttgcccag gctggagtgc 60480agtggcacga tctccactcg ctgcaagctc
tgcctcccgg gttcatgcca ttctcctgcc 60540tcagcctccc tagtagctgg gactacaggt
gcctgccacc atgcccagct aattttttgt 60600atttttagta gagacggggt ttcaccgtgt
tagccaggat ggtctcgatc tcctgacctc 60660gtgattctcc cacctcggcc tcccaaagtg
ttgggattac aagtgtgagc caccgcgccc 60720agccagcaaa attcttaaat gctagtattt
actgattcct tctggagtgc tgggaatttt 60780ccaagtattt tacatggatt taccatctga
gtctcataac gtcccaatga aataagtgct 60840gtgattattc gcgtattacc agggtggaaa
ttgaggcaca taaagattag gtaactggcc 60900aaaagtcacc agctagtaag tttcggagct
gagattcaaa cccaggccaa atttcttggt 60960tagtgttcgt ttatcagtgg gtaattcaca
tgaggtttaa tggcaaaaag aagacaattc 61020tgtctctcca gccattttcc atgctttgat
atgaaatgct tcctttatag gacggataca 61080gagtctgagc tggacctcat cagccgcctt
tccagagaac atggggcttt tgatgccgtg 61140aagtgcactc actgggcaga agggggcaag
ggtgccttag ccctggctca ggccgtccag 61200agagcagcac aagcacccag cagcttccag
ctcctttatg acctcaaggt gggtgatttg 61260ctgtctgcaa aaaaagaaaa aagacgaaaa
gggcacagtg aagtttctgt gtggctactt 61320ctattagagc cccatgcttc gcagcttcag
ccttgcggtt tgattcccac gtaggtgact 61380tacacaacca cagcctggac tcccagctgt
agacagcctt ctttttattt ttgagacgga 61440gtcttgctct tttgcccagg ctcgagtgca
gaggcatgat ctcagctcac tgcaacctct 61500gcctcctggg ttcaagtgat tctcctgcct
cagcttccca agtagctggg attacaggcg 61560tctgccaccg tgcccagcta atttttctta
tttttttagt agagatgggg tttcaccatc 61620ttggccaggc tggtctcgaa ctcctgacct
cgtgatccac ctgccttggc ctcccaaagt 61680gctggaatta caggcgtgag ccactgcgtc
cgcccagaca gccttcttat aaacatatgc 61740cactgcttac ttacagaggg gatttaagca
ggaatacccc agggtggagg gggaaccgtt 61800gctcatatcc taacttcgaa aagggtcaaa
gtacaggtgg cagaagagaa ggttacatat 61860agaccaaata tgtaattcac tgtagccttt
aaaacaaggc accaaccatg tagcagattg 61920gtaaggggct gcatcgagct tacaaaattt
ttctgatggc ttttcagtaa taggggaaag 61980aagtgtgccc ccacgttcac ttcctggtaa
ggtgctgcct caagttgagg gccctcacat 62040gcaaaaccaa agtactccag gaagcttcat
cattttttcc atcactaagc aaattaaatt 62100acacctggtc taagtagagc cctgcttact
ctatgataca ataaagttag ttcaggaaaa 62160ggttcttatt atagggttgt gttttgtttt
catttatctg tccagtgtat ttcaggatgt 62220gatttgacat taagtagtat ggaaacaaca
actggaaact ttgttaggaa tatgagatat 62280atgctgtagt taatttttcc tattctctag
gctagtttct ggcttgattt ttttgttttg 62340ttttttgttt taagacaggg tcttgctctg
ttgctcaggc tggagtgcag tggtacaatc 62400atggcacact gcagcctcaa cctcctggac
ccaagtggtc ctcccacctc agccccctga 62460gtaggtggga ccacagacat gtactaccaa
gcctggctaa tttgttttat tggccaggca 62520cagtggctca cccctgtaac cccagcactt
tgggaggccg aggtgggtgg atcacctaag 62580gtcaagagtt ggagaccagc ctggccaaca
tggtgaaacc ccatctctac taaaaataca 62640aaaaattagc tgggtgtggt ggcaggcacc
tgtaatccca gctacttgga aggctgaggc 62700aggagaatcg cttgaacccg ggaggcggac
gttgcaatga gctgagatca ctgcactcta 62760gcctgggcaa caagtgcgaa acttcgtctc
aaaaaaaaaa ttttgtttta ttttatttta 62820tttttcttgt agtgactggg tctccctata
ttgctcaggc tggtctcgaa ctcctgggtt 62880caagtgatcc tcctgcctca gcctcccaaa
gtgctgggat tataggcatg agccaatgtg 62940cctggcctgg cttggttttt aatagataat
catagacttt caaaaacagc cataaaaaag 63000cagaaggaat aaacttatta ctcttgagtt
ttagtgttct cttatttctc tttgcccctt 63060gcagatataa tacacttaca gttaattaac
attgacattg acattctcct gttgagcttt 63120gattttatag gcaagtttta aaatctgttc
agcaatcatc cttttagaaa aatgccataa 63180acacaaattt gcacacaaat gcaaagcatt
tataggcacc ctctccaggc tccaatgcat 63240agcacccaag gctggaagcc cagtccttga
aaaccagcag gagacctgaa agagcaggga 63300catatggact ttagaaccca tttctacttg
caacttttat ccattctggc aaacaggagg 63360ctgccacttt ttgtcttact cagcgtctgc
tagtcttaaa tcacccttca accaaagtgc 63420ttttgcttca gaagttctaa atgtgccttg
gcatttatag agctccagtt atgagatgca 63480tatgaaaata atctgtgttt catctttata
aagtccttat tgaatgttca ctttattcat 63540gcccattata taaagcccct tcataaatta
tcattgttta gttcaaaagg gaatttttgt 63600ttttatagat agattattgc tccttttaaa
aacaaaactt ggctgggcgc tgtggtcaca 63660cctgtaatcc cagcactttg ggaagctgag
gtgggtggat cacctaaggt caggagttcg 63720agaccagcct gaccaagatg gtgaaactcc
gtctctacta aaaatataaa attagccaag 63780catggtggcc catgcctgtg atcccagcta
cttgggaggc tgaggcagga gaatctcttg 63840aacccgggag gcggaggttg cagtgagctg
agatgacgcc attgcactcc agcctgggca 63900acaagtgcaa aactctgtct caaataaata
agaacctgcc taatatgttc ttttagcttt 63960agttgctata aagacagata tattgcctaa
tttttgtaat ttgctcatcc aaattaccca 64020tgagtagctt tgcttctgat ttcctttagg
caacctctcc gtatgctgtt ctctttgaat 64080cttaatgccc tgaactagtc tacacagcat
ttaagaagcc agctccactt ctctgaaggc 64140tttgccccgg aggaaacagc ccatgggact
gaggccctca ccagttagac cactggtctc 64200tgtccactgc actgtttgct tcaggtgcca
cagaaagtgg acatgcccaa tgcgtgttga 64260tcatgtaaat cactgggttg ttttttctaa
tacctgggtg attttgattt tatgactatt 64320tcctaattcc tggtgcttgc acttaatcaa
ttgtgttaag ggtggcatct gcctctcata 64380tacagactgg gagttatgta gatggacatt
ggttgtctta tttcataaac agtgtcccca 64440ggtctggttt gacaggactg atctccagta
ttagaataga acatctactt tttaattttt 64500attattattt taaaattttt attttttaaa
gagacagagt gttgctatgt tgcccaggct 64560ggtttcaaac tcctgggctc aagtgatcct
cctgccttgg cctcctccca aagtgctggg 64620attacaggga tgagccacca tacccagcct
gaacatctac tttttattta tgaagaaata 64680attattcttg ggtagtcagc aacactgttt
atgatgtact tagcgagaga ttttaaatat 64740gtacttatgt aagtatatat ttaaaacctt
aatatatatt aatatgtgta tctatatcta 64800tatatatatg tatgtgtgta tatgtatatc
tgcatatata gatataaaaa ctttacctct 64860gtgggacatt ttcacttata aaatctgagg
cttgatatta aaagaaagtg ttcctggctg 64920ggtgcagtgg ctcacacctg taatcccagc
actttgggag gccaaggctg gcagatcacc 64980tgaggttggg agttcaagac cagcctgacc
aacatggaga aactccgtct ctactaaaaa 65040tacagaatta gccgggcttg gttgcgcatg
cctataatcc cagctactcg ggaggctgag 65100gcaggagaat cgcttgaacc tgggaggcag
gggttacagt gaactgagat cgcgccattg 65160cactccagcc tggcaacaga gcgaaactcc
gtctcaaaaa aacaaaaaga aagtgttctt 65220aacccaatca tcaacctcaa actatttgct
tatgtatatg tatataaaaa ccatcttctg 65280ccccttttag ggactctgac ctagaatgtg
gctatgtcct ggttaaatgt cctcacatgt 65340gtccagtcat ggtgtcccca tctctttctt
gtgcattagc tcccagttga ggataaaatc 65400aggatcattg cacagaagat ctatggagca
gatgacattg aattacttcc cgaagctcaa 65460cacaaagctg aagtctacac gaagcaggta
gatgtttggt tagtttgtcc tttcaactct 65520ttgcaaagca ggacttggag tcacaatctc
tgggtcctcc cagccctgcc aagtacccgt 65580tgcttcactt ctctgggtgg cagccttctc
tcctctgaaa tactgagttt ggattaggtg 65640gcacagtccc tggttcttta tttacccata
ggcgttatat gaatgtgtag aaatccatga 65700ggaaatacag aatgtcttac caaaatattg
agaacgtctc cagtttttgt tattttgaga 65760tagggtctgg ctccattgcc caggctagag
tgcagtggca caatttcagc tcactgcagc 65820ctctgactcc taggctcaag cgatcctccc
acctcagcct cctgagtagc tgcgcctaca 65880ggcacgcacc accacacctg gctaattttt
gtgttttttg ttgagatggg gttttgccct 65940gttgcccagg ctggtcttga actcctgggc
tcaagcagtc cacccatctc agtctcccaa 66000agtgctggaa tttataggca tgaaccactg
cacctgaccc tctccaattt tattctcata 66060tctactccaa actattattt taaattagtt
ttatgttatt tccatagatg gaaataaaat 66120attcacttgt ggactacagt tgtgagtacc
taactccctc acccaacagt tctttttttt 66180tttttttggc ttaataaata agctggagga
cagcaagata gagatgtata taaagggagt 66240tggatgtttc gaataaattg aagaatccat
gtaatcacag ggcccagatg gtcattgctg 66300ggttgtcatc ttttcatttg tcctccctct
cttcccttct ttccccaggg ctttgggaat 66360ctccccatct gcatggctaa aacacacttg
tctttgtctc acaacccaga gcaaaaaggt 66420gtccctacag gcttcattct gcccattcgc
gacatccgcg ccagcgttgg ggctggtttt 66480ctgtacccct tagtaggaac ggtaagtgca
tgctgcaagg gagtagtggg cgcatctgca 66540cttctcgtct gaagtgtgtt gccgaaacca
tcaagcaaat gccaagtgag cagagttcac 66600tgcccagaag aaaattggaa tcgggcctat
tatgattgct gtggatcata agctataaag 66660caggagctat atagctcttg ctgtggacca
tcttggtgtc aaatcaagaa tgatgccagg 66720actactcata ctgaataaaa aattctgttt
cccagggaca gtatgctctt cactgaatct 66780aggatgagcc cagttgatag gctggggtag
tgctaacata atcacagtta atgtttattg 66840ggagccttac atagattaac tcagtccccc
cacagcccta ggaggtaggg acagaggttt 66900tactgacagg cctagaactt acgtctgaaa
ttcagggggt tcattatttt taagtaagga 66960atttccccat taaaaaatgc aaatggaccc
ttgagatatg aaaggtgagg ttgcttattt 67020acgcttcaag gtgaacaagg tgtccattgc
tttcttatct tacagatttt atgcatattc 67080aaagaagaat attgccagtt gggaagcaga
ggggagggat gcttaaggac ataatcatag 67140tttgggctga cattgttcca aggttaactt
ggatcctgaa tctaggttat cccctaaatc 67200tgccctgtat cctgtatcta cagattgtac
tgccatgcaa tacaggactg aaatattaaa 67260ctaaatcaac acacatggaa aagaaaccaa
catacatcct tttggattta tgtcattctt 67320tcagtatagt aagaaagtgg taggaggtgg
cattatagag ttgaaaacgt aggacattag 67380agctgaattc attttcacct cttcagatct
tggccttggg caaattatga aaggtgagag 67440ccctgctgcc ctcagctcac actgtaggca
acatcatcac atcacagtgg tgcttagcac 67500aaatgcctgc atctcactgc ctccttagct
ctctatagag agcagaaggc ttagacttgc 67560tatgatgtac atcagtctcc aggatttgta
gaaatccctt gtggtaggaa ttttccacca 67620tttttaccca gcaccttgaa tagtgcagaa
atatgtgttg ttctttttgg agaagttgtc 67680agcagctacc agctgaaaga cgggtagctg
tactctttct tgtcactttt gatcttcaca 67740tgaagaaaag ggtaacagta gaggtgaaag
caaatctgga agcagtcagg ccagtgagag 67800tcaggctgta aggtggagag agacacagcc
tgccgaactc aagactttct catttagaac 67860aaagtatgtg tcagaggctt ttaactttaa
gttggaccaa ggcgtttcca cacatgatca 67920aatggtttaa tcactattct aacaccttcc
agttgccgtt ggggccttaa cgagactcag 67980attatttgaa tattttctat attaattcaa
agtgtgatgg cttattttca gaactttata 68040tctaggttca tacatgtgcg tacacataat
attgtctttt tttgtatttt ttctgattat 68100cagtgaccag tgtttaggag ggaaaatcct
cttggggaat gagtgaaaat atcaaaaatc 68160ctactcatca ctgtttaggg aaccatcttt
agggttttct gctgacatgc acatgcccgt 68220ctacaacaat gggcatgtgt cttttatctt
tataatgtca tcttccggtt tttctgctgt 68280gtctaaggac ccatcctttc ctctctccta
cttgtctcac acctagcacc tccttacctc 68340ttacacaata taagaatata aaccagttgt
cagatcccta taatgtgttt tggggaatat 68400tgattggtac tcagaagtca gtcagaatga
ggttttctaa agtattgatg atgttttgtg 68460ttacagaaga tgactctttt taaataacct
ttttatagtt aagttttaga agtaacctaa 68520gctgttttcc tgtctagatt tgatttggaa
cttgtgtgta ttcattatat aggacattgt 68580tacagataaa aatatacgga gttttcactt
gcaagccgta catatttgca tctaagtaga 68640aatttgtcaa agccgagtct ttagaaaagt
agtgagtaca aatataattt gtaagcatta 68700caagtttagt tttagaaggc cattttgggc
cagtaagaat ggtgattata aaaccatcta 68760gattatgtac aaaagtgtct tggcccaagt
taaaggatac taactcctac catcgtgttg 68820ctgaagcaaa aatcacatgg gcagctcagc
tagcaagact gaacgaaaca aaatttaaaa 68880gcaggcaagc cctgtacctc cttttggtct
tgcaaagcgt gagctgagga ggcgagtgag 68940caactcctct actccccagc actcattctt
gtaaggtgaa ggtggaaaca cctcctgcta 69000ctgcttgtga ccacatggaa gatgatagtg
aaggctggca tttggtttgc agaaggacat 69060gaggaatcag tttcttttct tttccttttt
tttttttttg agatggagtc ttgctctgtc 69120acccaggctg gagtgcagtg gcaagatctc
ggctcactgc aacctccgcc tcccaggttc 69180aagcgattct cctgcctcag cctcccaagt
agctgggatt acaggcacac gccaccatgc 69240ccagctaatt tttgtatttt tagtagagat
ggggtttcac catgttggtc aggctggtct 69300caaattcctg acctcaggtg atccatctgc
cttggcctcc cagagtgcta ggattatagg 69360catgagccac cacgcccggc agggatcagt
ttctgagttt tttaacagga tcagtgactt 69420gatttttaat atgtaacaag agtacccact
ttattgacaa ctataaaatg atgttctatc 69480tttatgcctt gggctttctt tctgaaggaa
aagtgaaagt aaagatgact ttacgtttat 69540tttagacatg gccttgctct gtcgcccagg
caatcggatg atcatggctc actgtggcct 69600caacctccta gcctcaagca atcctcctgc
cttagcatcc caagtagctg gaactacagg 69660cacatgccat catatccagc taatttttta
tttcctgtag agatgggggt ctcactatgt 69720tgcccagggt ggtttagaac tcctggcctc
aagtgatcct ctccacctca gcctgggatt 69780ataggcgtga gccactgtgc ccagcattag
tttatttgta atgctttttt acatcctaga 69840tgagcacaat gcctggactc cccacccggc
cctgttttta tgatattgat ttggaccctg 69900aaacagaaca ggtgaatgga ttattctaaa
caggtaagtt gttactgggt aataatttgg 69960cttttttcct catgtagctt atttatgaat
tatagggctc aaactgacgc tataaaaatt 70020cacattctaa tgctttcaaa acatttcttt
tcagacttcc ctgaggggag aggggatagt 70080aatgagagtt ggcaataacc aatgaattga
aatatttata tttcaacaca tttttggaag 70140aagcgcagca tggcttgtca caggttgaca
gtgtagggga gaaaactgct gtccaataaa 70200ctacagagga aactacagga gaggttagac
tccctctgct gaagccccag gaggcttggg 70260gttcagagtt gatgacacat tgcaggtcag
gaacagctga cggtctgtat actctgaata 70320ggtagttaga aggcaagaat tagcaaatgt
cttaggaagt ataatagaaa agagaaggaa 70380acaggaccaa tctggcagtt cttagggcct
taggtgaaag ggccctgagt accgactgca 70440atgagtgaca ttgcattgtt tgatattttt
gaaatttcaa caggacgtgt ctctccttag 70500gcagaccctt ggggctgaag cagcaagttg
cagtgtgcac ttatatccct agacacatcc 70560tgttgaaact gcaaaaatat caaacatgaa
aatcctaaca tctttctcca gagacgaaaa 70620aagtggcttg cctatgaaaa gaatgagaac
tagacttaac gcccgtctta ctggcaataa 70680gaagctttta gacaatggag gagtgtcctg
taaggtcaca gggaagatga ctttcaacat 70740gtgctcacac ctagccgaat ctatctggat
attttcagac gtgcaagggt ttgccacccc 70800acatctcttc ctaagaagtt atttagcaat
gtacttagca aaatgaggaa ttaaccaaga 70860aaaaaggaag acgtgagttt tgggaaactg
aaccctcatt tggacattag cgaaggggca 70920ttctgataac catgcaaatg acctgaagtt
acaaatcctg tgtggaacag gttgggggag 70980cacctgagaa aaactagaaa aagctagaaa
agacaaggat atcaaagggt catagtataa 71040aacaaaagct cttgcattat ttgagaaagt
taaggtccaa gtagaagcaa actgtagttt 71100gatgctagaa cactattctt tgagagggta
tgtgtgatac tcgaaatgca gtcatggctc 71160acggcttggt ctggtcttga acagtattca
cacagtacta tggaaatacc atgcatcatt 71220ttcaagtcca tttatggaca cgccctagaa
gagctggtgg caggaaagga tgtaaatgtt 71280aacagctttg gcaatgtgag tgggtaatgg
tgcagtatgg aaggaacagg aaacatttca 71340gtgcttgctt agagaaaaca tttatttctt
gtttttcctt ccagatcacc atccatcttc 71400aagaagctac tttgaaagtc tggccagtgt
ctattcaggc ccactgggag ttaggaagta 71460taagtaagcc aagagaagtc agcccctgcc
cagaagatct gaaactaata gtaggagttt 71520ccccagaagt cattttcagc cttaattctc
atcatgtata aattaacata aatcatgcat 71580gtctgtttac tttagtgacg ttccacagaa
taaaaggaaa caagtttgc 7162963613DNAHomo
sapiensCDS(269)..(3205) 6ccactccgca ccccaccctc tgtctggtac agcttaccaa
accaaagtgc ccaaagccgt 60gacatcccgg ccggcggctc gcaggccccc gccctccgca
cgtcacggcc gccgggtgca 120gtgcccccta ggggcccctg ggacgaggag gaagcgccag
gtccttcccg ccgccgccgc 180cgccgccgcc gcctgctccc ctggcacgcg ccccgccgcc
ctcggcagcc gcagctccgt 240gtcccctgag aaccagccgt cccgcgcc atg ggc acg
cgt ctg ccg ctc gtc 292 Met Gly Thr
Arg Leu Pro Leu Val 1 5ctg
cgc cag ctc cgc cgc ccg ccc cag ccc ccg ggc cct ccg cgc cgc 340Leu
Arg Gln Leu Arg Arg Pro Pro Gln Pro Pro Gly Pro Pro Arg Arg 10
15 20ctc cgt gtg ccc tgt cgc gct agc agc ggc
ggc ggc gga ggc ggc ggc 388Leu Arg Val Pro Cys Arg Ala Ser Ser Gly
Gly Gly Gly Gly Gly Gly25 30 35
40ggt ggc cgg gag ggc ctg ctt gga cag cgg cgg ccg cag gat ggc
cag 436Gly Gly Arg Glu Gly Leu Leu Gly Gln Arg Arg Pro Gln Asp Gly
Gln 45 50 55gcc cgg agc
agc tgc agc ccc ggc ggc cga acg ccc gcg gcg cgg gac 484Ala Arg Ser
Ser Cys Ser Pro Gly Gly Arg Thr Pro Ala Ala Arg Asp 60
65 70tcc atc gtc aga gaa gtc att cag aat tca
aaa gaa gtt cta agt tta 532Ser Ile Val Arg Glu Val Ile Gln Asn Ser
Lys Glu Val Leu Ser Leu 75 80
85ttg caa gaa aaa aac cct gcc ttc aag ccg gtt ctt gca att atc cag
580Leu Gln Glu Lys Asn Pro Ala Phe Lys Pro Val Leu Ala Ile Ile Gln 90
95 100gca ggt gac gac aac ttg atg cag
gaa atc aac cag aat ttg gct gag 628Ala Gly Asp Asp Asn Leu Met Gln
Glu Ile Asn Gln Asn Leu Ala Glu105 110
115 120gag gct ggt ctg aac atc act cac att tgc ctc cct
cca gat agc agt 676Glu Ala Gly Leu Asn Ile Thr His Ile Cys Leu Pro
Pro Asp Ser Ser 125 130
135gaa gcc gag att ata gat gaa atc tta aag atc aat gaa gat acc aga
724Glu Ala Glu Ile Ile Asp Glu Ile Leu Lys Ile Asn Glu Asp Thr Arg
140 145 150gta cat ggc ctt gcc ctt
cag atc tct gag aac ttg ttt agc aac aaa 772Val His Gly Leu Ala Leu
Gln Ile Ser Glu Asn Leu Phe Ser Asn Lys 155 160
165gtc ctc aat gcc ttg aaa cca gaa aaa gat gtg gat gga gta
aca gac 820Val Leu Asn Ala Leu Lys Pro Glu Lys Asp Val Asp Gly Val
Thr Asp 170 175 180ata aac ctg ggg aag
ctg gtg cga ggg gat gcc cat gaa tgt ttt gtt 868Ile Asn Leu Gly Lys
Leu Val Arg Gly Asp Ala His Glu Cys Phe Val185 190
195 200tca cct gtt gcc aaa gct gta att gaa ctt
ctt gaa aaa tca ggt gtc 916Ser Pro Val Ala Lys Ala Val Ile Glu Leu
Leu Glu Lys Ser Gly Val 205 210
215aac cta gat gga aag aag att ttg gta gtg ggg gcc cat ggg tct ttg
964Asn Leu Asp Gly Lys Lys Ile Leu Val Val Gly Ala His Gly Ser Leu
220 225 230gaa gct gct cta caa tgc
ctg ttc cag aga aaa ggg tcc atg aca atg 1012Glu Ala Ala Leu Gln Cys
Leu Phe Gln Arg Lys Gly Ser Met Thr Met 235 240
245agc atc cag tgg aaa aca cgc cag ctt caa agc aag ctt cac
gag gct 1060Ser Ile Gln Trp Lys Thr Arg Gln Leu Gln Ser Lys Leu His
Glu Ala 250 255 260gac att gtg gtc cta
ggc tca cct aag cca gaa gag att ccc ctt act 1108Asp Ile Val Val Leu
Gly Ser Pro Lys Pro Glu Glu Ile Pro Leu Thr265 270
275 280tgg ata caa cca gga act act gtt ctc aac
tgc tcc cat gac ttc ctg 1156Trp Ile Gln Pro Gly Thr Thr Val Leu Asn
Cys Ser His Asp Phe Leu 285 290
295tca ggg aag gtt ggg tgt ggc tct cca aga ata cat ttt ggt gga ctc
1204Ser Gly Lys Val Gly Cys Gly Ser Pro Arg Ile His Phe Gly Gly Leu
300 305 310att gag gaa gat gat gtg
att ctc ctt gct gca gct ctg cga att cag 1252Ile Glu Glu Asp Asp Val
Ile Leu Leu Ala Ala Ala Leu Arg Ile Gln 315 320
325aac atg gtc agt agt gga agg aga tgg ctt cgt gaa cag cag
cac agg 1300Asn Met Val Ser Ser Gly Arg Arg Trp Leu Arg Glu Gln Gln
His Arg 330 335 340cgg tgg aga ctt cac
tgc ttg aaa ctt cag cct ctc tcc cct gtg cca 1348Arg Trp Arg Leu His
Cys Leu Lys Leu Gln Pro Leu Ser Pro Val Pro345 350
355 360agt gac att gag att tca aga gga caa act
cca aaa gct gtg gat gtc 1396Ser Asp Ile Glu Ile Ser Arg Gly Gln Thr
Pro Lys Ala Val Asp Val 365 370
375ctt gcc aag gag att gga ttg ctt gca gat gaa att gaa atc tat ggc
1444Leu Ala Lys Glu Ile Gly Leu Leu Ala Asp Glu Ile Glu Ile Tyr Gly
380 385 390aaa agc aaa gcc aaa gta
cgt ttg tcc gtg cta gaa agg tta aag gat 1492Lys Ser Lys Ala Lys Val
Arg Leu Ser Val Leu Glu Arg Leu Lys Asp 395 400
405caa gca gat gga aaa tac gtc tta gtt gct ggg atc aca ccc
acc cct 1540Gln Ala Asp Gly Lys Tyr Val Leu Val Ala Gly Ile Thr Pro
Thr Pro 410 415 420ctt gga gaa ggg aag
agc aca gtc acc atc ggg ctt gtg cag gct ctg 1588Leu Gly Glu Gly Lys
Ser Thr Val Thr Ile Gly Leu Val Gln Ala Leu425 430
435 440acc gca cac ctg aat gtc aac tcc ttt gcc
tgc ttg agg cag cct tcc 1636Thr Ala His Leu Asn Val Asn Ser Phe Ala
Cys Leu Arg Gln Pro Ser 445 450
455caa gga ccg acg ttt gga gtg aaa gga gga gcc gcg ggt ggt gga tat
1684Gln Gly Pro Thr Phe Gly Val Lys Gly Gly Ala Ala Gly Gly Gly Tyr
460 465 470gcc cag gtc atc ccc atg
gag gag ttc aac ctt cac ttg act gga gac 1732Ala Gln Val Ile Pro Met
Glu Glu Phe Asn Leu His Leu Thr Gly Asp 475 480
485atc cac gcc atc acc gct gcc aat aac ttg ctg gct gcc gcc
atc gac 1780Ile His Ala Ile Thr Ala Ala Asn Asn Leu Leu Ala Ala Ala
Ile Asp 490 495 500acg agg att ctt cat
gaa aac acg caa aca gat aag gct ctg tat aat 1828Thr Arg Ile Leu His
Glu Asn Thr Gln Thr Asp Lys Ala Leu Tyr Asn505 510
515 520cgg ctg gtt cct tta gtg aat ggt gtc aga
gaa ttt tca gaa att cag 1876Arg Leu Val Pro Leu Val Asn Gly Val Arg
Glu Phe Ser Glu Ile Gln 525 530
535ctt gct cgg cta aaa aaa ctg gga ata aat aag act gat ccg agc aca
1924Leu Ala Arg Leu Lys Lys Leu Gly Ile Asn Lys Thr Asp Pro Ser Thr
540 545 550ctg aca gaa gag gaa gtg
agt aaa ttt gcc cgt ctc gac atc gac cca 1972Leu Thr Glu Glu Glu Val
Ser Lys Phe Ala Arg Leu Asp Ile Asp Pro 555 560
565tct acc atc acg tgg cag aga gta ttg gat aca aat gac cga
ttt cta 2020Ser Thr Ile Thr Trp Gln Arg Val Leu Asp Thr Asn Asp Arg
Phe Leu 570 575 580cga aaa ata acc atc
ggg cag gga aac aca gag aag ggc cat tac cgg 2068Arg Lys Ile Thr Ile
Gly Gln Gly Asn Thr Glu Lys Gly His Tyr Arg585 590
595 600cag gcg cag ttt gac atc gca gtg gcc agc
gag atc atg gcg gtg ctg 2116Gln Ala Gln Phe Asp Ile Ala Val Ala Ser
Glu Ile Met Ala Val Leu 605 610
615gcc ctg acg gac agc ctc gca gac atg aag gca cgg ctg gga agg atg
2164Ala Leu Thr Asp Ser Leu Ala Asp Met Lys Ala Arg Leu Gly Arg Met
620 625 630gtg gtg gcc agt gac aaa
agc ggg cag cct gtg aca gca gat gat ttg 2212Val Val Ala Ser Asp Lys
Ser Gly Gln Pro Val Thr Ala Asp Asp Leu 635 640
645ggg gtg aca ggt gct ttg aca gtt ttg atg aaa gat gca ata
aaa cca 2260Gly Val Thr Gly Ala Leu Thr Val Leu Met Lys Asp Ala Ile
Lys Pro 650 655 660aac ctg atg cag acc
ctg gaa ggg aca cct gtg ttc gtg cat gcg ggc 2308Asn Leu Met Gln Thr
Leu Glu Gly Thr Pro Val Phe Val His Ala Gly665 670
675 680cct ttt gct aac att gct cac ggc aac tct
tca gtg ttg gct gat aaa 2356Pro Phe Ala Asn Ile Ala His Gly Asn Ser
Ser Val Leu Ala Asp Lys 685 690
695att gcc ctg aaa ctg gtt ggt gaa gaa gga ttt gta gtg acc gaa gct
2404Ile Ala Leu Lys Leu Val Gly Glu Glu Gly Phe Val Val Thr Glu Ala
700 705 710ggc ttt ggt gct gac atc
gga atg gag aaa ttc ttc aac atc aag tgc 2452Gly Phe Gly Ala Asp Ile
Gly Met Glu Lys Phe Phe Asn Ile Lys Cys 715 720
725cga gct tcc ggc ttg gtg ccc aac gtg gtt gtg tta gtg gca
acg gtg 2500Arg Ala Ser Gly Leu Val Pro Asn Val Val Val Leu Val Ala
Thr Val 730 735 740cga gct ctg aag atg
cat gga ggc ggg cca agt gta acg gct ggt gtt 2548Arg Ala Leu Lys Met
His Gly Gly Gly Pro Ser Val Thr Ala Gly Val745 750
755 760cct ctt aag aaa gaa tat aca gag gag aac
atc cag ctg gtg gca gac 2596Pro Leu Lys Lys Glu Tyr Thr Glu Glu Asn
Ile Gln Leu Val Ala Asp 765 770
775ggc tgc tgt aac ctc cag aag caa att cag atc act cag ctc ttt ggg
2644Gly Cys Cys Asn Leu Gln Lys Gln Ile Gln Ile Thr Gln Leu Phe Gly
780 785 790gtt ccc gtt gtg gtg gct
ctg aat gtc ttc aag acc gac acc cgc gct 2692Val Pro Val Val Val Ala
Leu Asn Val Phe Lys Thr Asp Thr Arg Ala 795 800
805gag att gac ttg gtg tgt gag ctt gca aag cgg gct ggt gcc
ttt gat 2740Glu Ile Asp Leu Val Cys Glu Leu Ala Lys Arg Ala Gly Ala
Phe Asp 810 815 820gca gtc ccc tgc tat
cac tgg tcc gtt ggt gga aaa gga tcg gtg gac 2788Ala Val Pro Cys Tyr
His Trp Ser Val Gly Gly Lys Gly Ser Val Asp825 830
835 840ttg gct cgg gct gtg aga gag gct gcg agt
aaa aga agc cga ttc cag 2836Leu Ala Arg Ala Val Arg Glu Ala Ala Ser
Lys Arg Ser Arg Phe Gln 845 850
855ttc ctg tat gat gtt cag gtt cca att gtg gac aag ata agg acc att
2884Phe Leu Tyr Asp Val Gln Val Pro Ile Val Asp Lys Ile Arg Thr Ile
860 865 870gct cag gct gtc tat gga
gcc aaa gat att gaa ctc tct cct gag gca 2932Ala Gln Ala Val Tyr Gly
Ala Lys Asp Ile Glu Leu Ser Pro Glu Ala 875 880
885caa gcc aaa ata gat cgt tac act caa cag ggt ttt gga aat
ttg ccc 2980Gln Ala Lys Ile Asp Arg Tyr Thr Gln Gln Gly Phe Gly Asn
Leu Pro 890 895 900atc tgc atg gca aag
acc cac ctt tct cta tct cac caa cct gac aaa 3028Ile Cys Met Ala Lys
Thr His Leu Ser Leu Ser His Gln Pro Asp Lys905 910
915 920aaa ggt gtg cca agg gac ttc atc tta cct
atc agt gac gtc cgg gcc 3076Lys Gly Val Pro Arg Asp Phe Ile Leu Pro
Ile Ser Asp Val Arg Ala 925 930
935agc ata ggc gct ggg ttc att tac cct ttg gtc gga acg atg agc acc
3124Ser Ile Gly Ala Gly Phe Ile Tyr Pro Leu Val Gly Thr Met Ser Thr
940 945 950atg cca gga ctg ccc acc
cgg ccc tgc ttt tat gac ata gat ctt gat 3172Met Pro Gly Leu Pro Thr
Arg Pro Cys Phe Tyr Asp Ile Asp Leu Asp 955 960
965acc gaa aca gaa caa gtt aaa ggc ttg ttc taa gtggacaagg
ctctcacagg 3225Thr Glu Thr Glu Gln Val Lys Gly Leu Phe 970
975acccgatgca gactcctgaa acagactact ctttgccttt ttgctgcagt
tggagaagaa 3285actgaatttg aaaaatgtct gttatgcaat gctggagacg tggtgaaata
ggccaaagat 3345ttcttcttcg ttcaagatga attctgttca cagtggagta tggtgttcgg
caaaaggacc 3405tccaccaaga ctgaaagaaa ctaatttatt tctgtttctg tggagtttcc
attatttcta 3465ctgcttacac tttagaatgt ttattttatg gggactaagg gattaggagt
gtgaactaaa 3525aggtaacatt ttccactctc aagttttcta ctttgtcttt gaactgaaaa
taaacatgga 3585tctagaaaac caaaaaaaaa aaaaaaaa
361372937DNAHomo sapiens 7atgggcacgc gtctgccgct cgtcctgcgc
cagctccgcc gcccgcccca gcccccgggc 60cctccgcgcc gcctccgtgt gccctgtcgc
gctagcagcg gcggcggcgg aggcggcggc 120ggtggccggg agggcctgct tggacagcgg
cggccgcagg atggccaggc ccggagcagc 180tgcagccccg gcggccgaac gcccgcggcg
cgggactcca tcgtcagaga agtcattcag 240aattcaaaag aagttctaag tttattgcaa
gaaaaaaacc ctgccttcaa gccggttctt 300gcaattatcc aggcaggtga cgacaacttg
atgcaggaaa tcaaccagaa tttggctgag 360gaggctggtc tgaacatcac tcacatttgc
ctccctccag atagcagtga agccgagatt 420atagatgaaa tcttaaagat caatgaagat
accagagtac atggccttgc ccttcagatc 480tctgagaact tgtttagcaa caaagtcctc
aatgccttga aaccagaaaa agatgtggat 540ggagtaacag acataaacct ggggaagctg
gtgcgagggg atgcccatga atgttttgtt 600tcacctgttg ccaaagctgt aattgaactt
cttgaaaaat caggtgtcaa cctagatgga 660aagaagattt tggtagtggg ggcccatggg
tctttggaag ctgctctaca atgcctgttc 720cagagaaaag ggtccatgac aatgagcatc
cagtggaaaa cacgccagct tcaaagcaag 780cttcacgagg ctgacattgt ggtcctaggc
tcacctaagc cagaagagat tccccttact 840tggatacaac caggaactac tgttctcaac
tgctcccatg acttcctgtc agggaaggtt 900gggtgtggct ctccaagaat acattttggt
ggactcattg aggaagatga tgtgattctc 960cttgctgcag ctctgcgaat tcagaacatg
gtcagtagtg gaaggagatg gcttcgtgaa 1020cagcagcaca ggcggtggag acttcactgc
ttgaaacttc agcctctctc ccctgtgcca 1080agtgacattg agatttcaag aggacaaact
ccaaaagctg tggatgtcct tgccaaggag 1140attggattgc ttgcagatga aattgaaatc
tatggcaaaa gcaaagccaa agtacgtttg 1200tccgtgctag aaaggttaaa ggatcaagca
gatggaaaat acgtcttagt tgctgggatc 1260acacccaccc ctcttggaga agggaagagc
acagtcacca tcgggcttgt gcaggctctg 1320accgcacacc tgaatgtcaa ctcctttgcc
tgcttgaggc agccttccca aggaccgacg 1380tttggagtga aaggaggagc cgcgggtggt
ggatatgccc aggtcatccc catggaggag 1440ttcaaccttc acttgactgg agacatccac
gccatcaccg ctgccaataa cttgctggct 1500gccgccatcg acacgaggat tcttcatgaa
aacacgcaaa cagataaggc tctgtataat 1560cggctggttc ctttagtgaa tggtgtcaga
gaattttcag aaattcagct tgctcggcta 1620aaaaaactgg gaataaataa gactgatccg
agcacactga cagaagagga agtgagtaaa 1680tttgcccgtc tcgacatcga cccatctacc
atcacgtggc agagagtatt ggatacaaat 1740gaccgatttc tacgaaaaat aaccatcggg
cagggaaaca cagagaaggg ccattaccgg 1800caggcgcagt ttgacatcgc agtggccagc
gagatcatgg cggtgctggc cctgacggac 1860agcctcgcag acatgaaggc acggctggga
aggatggtgg tggccagtga caaaagcggg 1920cagcctgtga cagcagatga tttgggggtg
acaggtgctt tgacagtttt gatgaaagat 1980gcaataaaac caaacctgat gcagaccctg
gaagggacac ctgtgttcgt gcatgcgggc 2040ccttttgcta acattgctca cggcaactct
tcagtgttgg ctgataaaat tgccctgaaa 2100ctggttggtg aagaaggatt tgtagtgacc
gaagctggct ttggtgctga catcggaatg 2160gagaaattct tcaacatcaa gtgccgagct
tccggcttgg tgcccaacgt ggttgtgtta 2220gtggcaacgg tgcgagctct gaagatgcat
ggaggcgggc caagtgtaac ggctggtgtt 2280cctcttaaga aagaatatac agaggagaac
atccagctgg tggcagacgg ctgctgtaac 2340ctccagaagc aaattcagat cactcagctc
tttggggttc ccgttgtggt ggctctgaat 2400gtcttcaaga ccgacacccg cgctgagatt
gacttggtgt gtgagcttgc aaagcgggct 2460ggtgcctttg atgcagtccc ctgctatcac
tggtccgttg gtggaaaagg atcggtggac 2520ttggctcggg ctgtgagaga ggctgcgagt
aaaagaagcc gattccagtt cctgtatgat 2580gttcaggttc caattgtgga caagataagg
accattgctc aggctgtcta tggagccaaa 2640gatattgaac tctctcctga ggcacaagcc
aaaatagatc gttacactca acagggtttt 2700ggaaatttgc ccatctgcat ggcaaagacc
cacctttctc tatctcacca acctgacaaa 2760aaaggtgtgc caagggactt catcttacct
atcagtgacg tccgggccag cataggcgct 2820gggttcattt accctttggt cggaacgatg
agcaccatgc caggactgcc cacccggccc 2880tgcttttatg acatagatct tgataccgaa
acagaacaag ttaaaggctt gttctaa 29378978PRTHomo sapiens 8Met Gly Thr
Arg Leu Pro Leu Val Leu Arg Gln Leu Arg Arg Pro Pro1 5
10 15Gln Pro Pro Gly Pro Pro Arg Arg Leu
Arg Val Pro Cys Arg Ala Ser 20 25
30Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly Arg Glu Gly Leu Leu Gly
35 40 45Gln Arg Arg Pro Gln Asp Gly
Gln Ala Arg Ser Ser Cys Ser Pro Gly 50 55
60Gly Arg Thr Pro Ala Ala Arg Asp Ser Ile Val Arg Glu Val Ile Gln65
70 75 80Asn Ser Lys Glu
Val Leu Ser Leu Leu Gln Glu Lys Asn Pro Ala Phe 85
90 95Lys Pro Val Leu Ala Ile Ile Gln Ala Gly
Asp Asp Asn Leu Met Gln 100 105
110Glu Ile Asn Gln Asn Leu Ala Glu Glu Ala Gly Leu Asn Ile Thr His
115 120 125Ile Cys Leu Pro Pro Asp Ser
Ser Glu Ala Glu Ile Ile Asp Glu Ile 130 135
140Leu Lys Ile Asn Glu Asp Thr Arg Val His Gly Leu Ala Leu Gln
Ile145 150 155 160Ser Glu
Asn Leu Phe Ser Asn Lys Val Leu Asn Ala Leu Lys Pro Glu
165 170 175Lys Asp Val Asp Gly Val Thr
Asp Ile Asn Leu Gly Lys Leu Val Arg 180 185
190Gly Asp Ala His Glu Cys Phe Val Ser Pro Val Ala Lys Ala
Val Ile 195 200 205Glu Leu Leu Glu
Lys Ser Gly Val Asn Leu Asp Gly Lys Lys Ile Leu 210
215 220Val Val Gly Ala His Gly Ser Leu Glu Ala Ala Leu
Gln Cys Leu Phe225 230 235
240Gln Arg Lys Gly Ser Met Thr Met Ser Ile Gln Trp Lys Thr Arg Gln
245 250 255Leu Gln Ser Lys Leu
His Glu Ala Asp Ile Val Val Leu Gly Ser Pro 260
265 270Lys Pro Glu Glu Ile Pro Leu Thr Trp Ile Gln Pro
Gly Thr Thr Val 275 280 285Leu Asn
Cys Ser His Asp Phe Leu Ser Gly Lys Val Gly Cys Gly Ser 290
295 300Pro Arg Ile His Phe Gly Gly Leu Ile Glu Glu
Asp Asp Val Ile Leu305 310 315
320Leu Ala Ala Ala Leu Arg Ile Gln Asn Met Val Ser Ser Gly Arg Arg
325 330 335Trp Leu Arg Glu
Gln Gln His Arg Arg Trp Arg Leu His Cys Leu Lys 340
345 350Leu Gln Pro Leu Ser Pro Val Pro Ser Asp Ile
Glu Ile Ser Arg Gly 355 360 365Gln
Thr Pro Lys Ala Val Asp Val Leu Ala Lys Glu Ile Gly Leu Leu 370
375 380Ala Asp Glu Ile Glu Ile Tyr Gly Lys Ser
Lys Ala Lys Val Arg Leu385 390 395
400Ser Val Leu Glu Arg Leu Lys Asp Gln Ala Asp Gly Lys Tyr Val
Leu 405 410 415Val Ala Gly
Ile Thr Pro Thr Pro Leu Gly Glu Gly Lys Ser Thr Val 420
425 430Thr Ile Gly Leu Val Gln Ala Leu Thr Ala
His Leu Asn Val Asn Ser 435 440
445Phe Ala Cys Leu Arg Gln Pro Ser Gln Gly Pro Thr Phe Gly Val Lys 450
455 460Gly Gly Ala Ala Gly Gly Gly Tyr
Ala Gln Val Ile Pro Met Glu Glu465 470
475 480Phe Asn Leu His Leu Thr Gly Asp Ile His Ala Ile
Thr Ala Ala Asn 485 490
495Asn Leu Leu Ala Ala Ala Ile Asp Thr Arg Ile Leu His Glu Asn Thr
500 505 510Gln Thr Asp Lys Ala Leu
Tyr Asn Arg Leu Val Pro Leu Val Asn Gly 515 520
525Val Arg Glu Phe Ser Glu Ile Gln Leu Ala Arg Leu Lys Lys
Leu Gly 530 535 540Ile Asn Lys Thr Asp
Pro Ser Thr Leu Thr Glu Glu Glu Val Ser Lys545 550
555 560Phe Ala Arg Leu Asp Ile Asp Pro Ser Thr
Ile Thr Trp Gln Arg Val 565 570
575Leu Asp Thr Asn Asp Arg Phe Leu Arg Lys Ile Thr Ile Gly Gln Gly
580 585 590Asn Thr Glu Lys Gly
His Tyr Arg Gln Ala Gln Phe Asp Ile Ala Val 595
600 605Ala Ser Glu Ile Met Ala Val Leu Ala Leu Thr Asp
Ser Leu Ala Asp 610 615 620Met Lys Ala
Arg Leu Gly Arg Met Val Val Ala Ser Asp Lys Ser Gly625
630 635 640Gln Pro Val Thr Ala Asp Asp
Leu Gly Val Thr Gly Ala Leu Thr Val 645
650 655Leu Met Lys Asp Ala Ile Lys Pro Asn Leu Met Gln
Thr Leu Glu Gly 660 665 670Thr
Pro Val Phe Val His Ala Gly Pro Phe Ala Asn Ile Ala His Gly 675
680 685Asn Ser Ser Val Leu Ala Asp Lys Ile
Ala Leu Lys Leu Val Gly Glu 690 695
700Glu Gly Phe Val Val Thr Glu Ala Gly Phe Gly Ala Asp Ile Gly Met705
710 715 720Glu Lys Phe Phe
Asn Ile Lys Cys Arg Ala Ser Gly Leu Val Pro Asn 725
730 735Val Val Val Leu Val Ala Thr Val Arg Ala
Leu Lys Met His Gly Gly 740 745
750Gly Pro Ser Val Thr Ala Gly Val Pro Leu Lys Lys Glu Tyr Thr Glu
755 760 765Glu Asn Ile Gln Leu Val Ala
Asp Gly Cys Cys Asn Leu Gln Lys Gln 770 775
780Ile Gln Ile Thr Gln Leu Phe Gly Val Pro Val Val Val Ala Leu
Asn785 790 795 800Val Phe
Lys Thr Asp Thr Arg Ala Glu Ile Asp Leu Val Cys Glu Leu
805 810 815Ala Lys Arg Ala Gly Ala Phe
Asp Ala Val Pro Cys Tyr His Trp Ser 820 825
830Val Gly Gly Lys Gly Ser Val Asp Leu Ala Arg Ala Val Arg
Glu Ala 835 840 845Ala Ser Lys Arg
Ser Arg Phe Gln Phe Leu Tyr Asp Val Gln Val Pro 850
855 860Ile Val Asp Lys Ile Arg Thr Ile Ala Gln Ala Val
Tyr Gly Ala Lys865 870 875
880Asp Ile Glu Leu Ser Pro Glu Ala Gln Ala Lys Ile Asp Arg Tyr Thr
885 890 895Gln Gln Gly Phe Gly
Asn Leu Pro Ile Cys Met Ala Lys Thr His Leu 900
905 910Ser Leu Ser His Gln Pro Asp Lys Lys Gly Val Pro
Arg Asp Phe Ile 915 920 925Leu Pro
Ile Ser Asp Val Arg Ala Ser Ile Gly Ala Gly Phe Ile Tyr 930
935 940Pro Leu Val Gly Thr Met Ser Thr Met Pro Gly
Leu Pro Thr Arg Pro945 950 955
960Cys Phe Tyr Asp Ile Asp Leu Asp Thr Glu Thr Glu Gln Val Lys Gly
965 970 975Leu
Phe91049DNAHomo sapiensCDS(108)..(935) 9tccttcccgc cgccgccgcc gccgccgccg
cctgctcccc tggcacgcgc cccgccgccc 60tcggcagccg cagctccgtg tcccctgaga
accagccgtc ccgcgcc atg ggc acg 116
Met Gly Thr
1cgt ctg ccg ctc gtc ctg cgc cag ctc cgc cgc ccg ccc cag ccc ccg
164Arg Leu Pro Leu Val Leu Arg Gln Leu Arg Arg Pro Pro Gln Pro Pro 5
10 15ggc cct ccg cgc cgc ctc cgt gtg
ccc tgt cgc gct agc agc ggc ggc 212Gly Pro Pro Arg Arg Leu Arg Val
Pro Cys Arg Ala Ser Ser Gly Gly20 25 30
35ggc gga ggc ggc ggc ggt ggc cgg gag ggc ctg ctt gga
cag cgg cgg 260Gly Gly Gly Gly Gly Gly Gly Arg Glu Gly Leu Leu Gly
Gln Arg Arg 40 45 50ccg
cag gat ggc cag gcc cgg agc agc tgc agc ccc ggc ggc cga acg 308Pro
Gln Asp Gly Gln Ala Arg Ser Ser Cys Ser Pro Gly Gly Arg Thr 55
60 65ccc gcg gcg cgg gac tcc atc gtc
aga gaa gtc att cag aat tca aaa 356Pro Ala Ala Arg Asp Ser Ile Val
Arg Glu Val Ile Gln Asn Ser Lys 70 75
80gaa gtt cta agt tta ttg caa gaa aaa aac cct gcc ttc aag ccg gtt
404Glu Val Leu Ser Leu Leu Gln Glu Lys Asn Pro Ala Phe Lys Pro Val
85 90 95ctt gca att atc cag gca ggt gac
gac aac ttg atg cag gaa atc aac 452Leu Ala Ile Ile Gln Ala Gly Asp
Asp Asn Leu Met Gln Glu Ile Asn100 105
110 115cag aat ttg gct gag gag gct ggt ctg aac atc act
cac att tgc ctc 500Gln Asn Leu Ala Glu Glu Ala Gly Leu Asn Ile Thr
His Ile Cys Leu 120 125
130cct cca gat agc agt gaa gcc gag att ata gat gaa atc tta aag atc
548Pro Pro Asp Ser Ser Glu Ala Glu Ile Ile Asp Glu Ile Leu Lys Ile
135 140 145aat gaa gat acc aga gta
cat ggc ctt gcc ctt cag atc tct gag aac 596Asn Glu Asp Thr Arg Val
His Gly Leu Ala Leu Gln Ile Ser Glu Asn 150 155
160ttg ttt agc aac aaa gtc ctc aat gcc ttg aaa cca gaa aaa
gat gtg 644Leu Phe Ser Asn Lys Val Leu Asn Ala Leu Lys Pro Glu Lys
Asp Val 165 170 175gat gga gta aca gac
ata aac ctg ggg aag ctg gtg cga ggg gat gcc 692Asp Gly Val Thr Asp
Ile Asn Leu Gly Lys Leu Val Arg Gly Asp Ala180 185
190 195cat gaa tgt ttt gtt tca cct gtt gcc aaa
gct gta att gaa ctt ctt 740His Glu Cys Phe Val Ser Pro Val Ala Lys
Ala Val Ile Glu Leu Leu 200 205
210gaa aaa tca ggt gtc aac cta gat gga aag aag att ttg gta gtg ggg
788Glu Lys Ser Gly Val Asn Leu Asp Gly Lys Lys Ile Leu Val Val Gly
215 220 225gcc cat ggg tct ttg gaa
gct gct cta caa tgc ctg ttc cag aga aaa 836Ala His Gly Ser Leu Glu
Ala Ala Leu Gln Cys Leu Phe Gln Arg Lys 230 235
240ggg tcc atg aca atg agc atc cag tgg aaa aca cgc cag ctt
caa agc 884Gly Ser Met Thr Met Ser Ile Gln Trp Lys Thr Arg Gln Leu
Gln Ser 245 250 255aag acg gag tct cgt
tct gtc acc agg ctg gag tgc agg cgc gtg atc 932Lys Thr Glu Ser Arg
Ser Val Thr Arg Leu Glu Cys Arg Arg Val Ile260 265
270 275tag gctcactgca agctctgcct cccaggttga
agtgattctc ctgtgaaagg 985gaattatttt tgatgagtca ttaaagtata
tccattccca gaatggtgct gcatttttcc 1045tttt
104910828DNAHomo sapiens 10atgggcacgc
gtctgccgct cgtcctgcgc cagctccgcc gcccgcccca gcccccgggc 60cctccgcgcc
gcctccgtgt gccctgtcgc gctagcagcg gcggcggcgg aggcggcggc 120ggtggccggg
agggcctgct tggacagcgg cggccgcagg atggccaggc ccggagcagc 180tgcagccccg
gcggccgaac gcccgcggcg cgggactcca tcgtcagaga agtcattcag 240aattcaaaag
aagttctaag tttattgcaa gaaaaaaacc ctgccttcaa gccggttctt 300gcaattatcc
aggcaggtga cgacaacttg atgcaggaaa tcaaccagaa tttggctgag 360gaggctggtc
tgaacatcac tcacatttgc ctccctccag atagcagtga agccgagatt 420atagatgaaa
tcttaaagat caatgaagat accagagtac atggccttgc ccttcagatc 480tctgagaact
tgtttagcaa caaagtcctc aatgccttga aaccagaaaa agatgtggat 540ggagtaacag
acataaacct ggggaagctg gtgcgagggg atgcccatga atgttttgtt 600tcacctgttg
ccaaagctgt aattgaactt cttgaaaaat caggtgtcaa cctagatgga 660aagaagattt
tggtagtggg ggcccatggg tctttggaag ctgctctaca atgcctgttc 720cagagaaaag
ggtccatgac aatgagcatc cagtggaaaa cacgccagct tcaaagcaag 780acggagtctc
gttctgtcac caggctggag tgcaggcgcg tgatctag 82811275PRTHomo
sapiens 11Met Gly Thr Arg Leu Pro Leu Val Leu Arg Gln Leu Arg Arg Pro
Pro1 5 10 15Gln Pro Pro
Gly Pro Pro Arg Arg Leu Arg Val Pro Cys Arg Ala Ser 20
25 30Ser Gly Gly Gly Gly Gly Gly Gly Gly Gly
Arg Glu Gly Leu Leu Gly 35 40
45Gln Arg Arg Pro Gln Asp Gly Gln Ala Arg Ser Ser Cys Ser Pro Gly 50
55 60Gly Arg Thr Pro Ala Ala Arg Asp Ser
Ile Val Arg Glu Val Ile Gln65 70 75
80Asn Ser Lys Glu Val Leu Ser Leu Leu Gln Glu Lys Asn Pro
Ala Phe 85 90 95Lys Pro
Val Leu Ala Ile Ile Gln Ala Gly Asp Asp Asn Leu Met Gln 100
105 110Glu Ile Asn Gln Asn Leu Ala Glu Glu
Ala Gly Leu Asn Ile Thr His 115 120
125Ile Cys Leu Pro Pro Asp Ser Ser Glu Ala Glu Ile Ile Asp Glu Ile
130 135 140Leu Lys Ile Asn Glu Asp Thr
Arg Val His Gly Leu Ala Leu Gln Ile145 150
155 160Ser Glu Asn Leu Phe Ser Asn Lys Val Leu Asn Ala
Leu Lys Pro Glu 165 170
175Lys Asp Val Asp Gly Val Thr Asp Ile Asn Leu Gly Lys Leu Val Arg
180 185 190Gly Asp Ala His Glu Cys
Phe Val Ser Pro Val Ala Lys Ala Val Ile 195 200
205Glu Leu Leu Glu Lys Ser Gly Val Asn Leu Asp Gly Lys Lys
Ile Leu 210 215 220Val Val Gly Ala His
Gly Ser Leu Glu Ala Ala Leu Gln Cys Leu Phe225 230
235 240Gln Arg Lys Gly Ser Met Thr Met Ser Ile
Gln Trp Lys Thr Arg Gln 245 250
255Leu Gln Ser Lys Thr Glu Ser Arg Ser Val Thr Arg Leu Glu Cys Arg
260 265 270Arg Val Ile
27512236333DNAHomo sapiensmisc_feature(33274)..(33274)The number of "att"
repeats staring at position 33274 is seven, but there can be a
greater number of such "att" repeats than seven. 12ccactccgca
ccccaccctc tgtctggtac agcttaccaa accaaagtgc ccaaagccgt 60gacatcccgg
ccggcggctc gcaggccccc gccctccgca cgtcacggcc gccgggtgca 120gtgcccccta
ggggcccctg ggacgaggag gaagcgccag gtccttcccg ccgccgccgc 180cgccgccgcc
gcctgctccc ctggcacgcg ccccgccgcc ctcggcagcc gcagctccgt 240gtcccctgag
aaccagccgt cccgcgccat gggcacgcgt ctgccgctcg tcctgcgcca 300gctccgccgc
ccgccccagc ccccgggccc tccgcgccgc ctccgtgtgc cctgtcgcgc 360tagcagcggc
ggcggcggag gcggcggcgg tggccgggag ggcctgcttg gacagcggcg 420gccgcaggat
ggccaggccc ggagcagctg cagccccggc ggccgaacgc ccgcggcgcg 480ggactccatc
gtcaggtgag tgtcgggtct ggccctggcc caggtctcca gcggctgtgg 540ccccgaccca
ggggcggagc gcccgggacc gccgtgggga gcggggcgcg gggctgcgag 600tgtttggtgc
caactcatta ggggggccgg gcggtgtgtc gggaaacgcg ggcttgggca 660ctgcggccgg
gagcgctggc ggagaacggg ggatccagta cccgaccggg cccgcagcgc 720aggtgggcgt
gggcatctcc aatgggcgcc tagcggcatg gaccgcacgc ccggctccac 780gtgcgcagcc
ggccgccggc tctgatgcaa tcgcgccggg cgcgacccag acggtaaagg 840ggcggtgcgg
ctggggcgga aaccagggga gggggcgagg agcaggagga gggcggggct 900gcggctcggc
gcgccggctg gcccggggtt cgggaagggc aagcggagct cgggagaggc 960gggctcgggc
ccagcgccgc ccgcgcgaag ctccctggtg ttgtgcgccc ttccccgcgc 1020gcggcccctc
ctgctgggct ggggtctgtg gctgacgtcc gcttttcccg gaacggcaaa 1080ggtggagccg
ggcccccggg acagccgccg gggggaatcc gagaggtctc agcgctggtt 1140tccagcttcg
ccgcgcagcg cccacggagt ccccgcgcag ctgggtttgc agggttttct 1200gcgggcccag
agcgaactgg gagccccggg cccggccttt cccgtcccgg tccccgcccc 1260tgggctccag
ccttccccgc cctcgcccag aggggcgcac gggtctcggg ccttgcaggg 1320cgagcccttc
gggacaactt gggggtggca agaaagcccc ttccgcctcc ctcggaactg 1380ggattaggaa
cgagttccag gatgcggttt ccccggccag cggaccacct cctccattca 1440catcctcttc
cctttccttc tagccttctt tcttagtcat tgcctgcttg cctttctctc 1500tgcctttcct
tcctccctct gctctcccat tttcctcccc actccccttc tcccatctca 1560gtaaaatacg
tttcctaggc tctttattat gtggcggatg attttaaggg gccatttatt 1620tagtttttct
tgttctctgt cttctttttc ttttagaaat ggagtctcgc tttgttgctc 1680aggctggagt
tcagtggtgc catctcggct caatgcagcc tccgcctcct gggctcaagc 1740gattctcccg
ccttagcctc cagtgtaact gggaccacag gtgtgcgcta ccatgcccgg 1800attattttat
tttattttca tagagacaga gcaccactct gttaccaggc tggtctggaa 1860ctcctgggcg
caagccatca tccctgcccc caaaagcgct gggattaatt acacgtcagg 1920tcattaggtt
ttaacctgca cgttgctgag actcatctta ccggttagag tctgaagcca 1980ggataggcgg
caaaaaaact tctcccgtca tgctgtttta ttttctaggt aaactgatta 2040ttacctgaaa
cttccttctt tgctcatttc ttgtctgtct cacccacctg gaatatgagc 2100ttcatgagac
agccgtctct tcggtgctat atcctaggtc ctggagagag tgcccagcat 2160gtaataagca
cttgtgaaat ggttacagaa ggaatgcttc aagatttggg gtcatctgtg 2220ggttttattc
atacatattc tttttttttt tttttttttt tgagacggag tcttgctctg 2280tcactcaggc
tggagtgcaa tggcacgatc ttggctccct gcaacctctg cctcccaggt 2340tcaggcgatt
cccctgcctc agcctcccga gtagctggga ttacaggagc acgccaccac 2400gcccggctaa
tttttgtatt ttgagtaaag acaggatgtc tcaccatgtt ggccaggctg 2460gtcttgaact
cctgacctca ggtgatccga ctgccttggc ctcccaaagt gttgggatta 2520caggcgtgac
ccaccgcgcc tgggctattc atgcatactt ttaggtttaa agccatgact 2580gccttagaga
tgagtaaaga ggttagtagg gggctttagg gcctgcactg ggttctccaa 2640actcaattcc
ttcaggaaaa tgggaggagt aacagtactt acctcaaggg gttatggaaa 2700caattatgtg
agaggacaca tgcaaaatac atagcccagg actgtcacat acaagacaag 2760tccctcaaca
aatggtggct attaattttt tttttttttt tttgagacgg aatctcactc 2820tgtcgcacag
gttggagtgc agtggcggga tctcggctca ctgcaacctc cgcctcccgg 2880gttcaagcga
ttcccctgcc tcagcctcct gagtagctgg gattacaggc tcgcactacc 2940acgcctggct
tattttgtgt ttttagtaga gacggggttt caccatgttg gccaggctgg 3000tctcaaactc
ctgacctcgt gatctgcctg cctcagcctc ccaaagtgct gggattacag 3060gtgtgagcta
ccatgcccag caatattttt acaattcact aagccagggg tccacagacg 3120ttttatatat
aatgggccag atagtaaata ttttaggttt agtgatccta gaggcaaatc 3180aaaattatta
gtaggtacat ttataacaag agaaaagaca aatttaggcc aggcaccatg 3240gttcatgcct
gtaatcccag cactttgaga ggccaaagtg ggatgatcac attcccagga 3300gttcaagacc
agcctgggca acatagtgag acacccatct ctgtaaaaaa ttttaaaaaa 3360ttaactgggc
gtggcgcgtg cctgtagttc cagctactca gaaggctgag gcaggaggat 3420cacttgagcc
caggaggtca aggctacagt gagcggagat tgcgctactg cactgcagcc 3480tgggcaatgg
agtgagacct tattttttat ttttgcaatt aagtattttt aattaaggta 3540tgtacatttt
ttagacaatg cttttgcaca cttaatagat ggcagtatag ggtaaccata 3600acttttattt
ttattttcta ttttacttta agttctggga aacgtgccga acgtgcaggt 3660ttgttacata
ggtatacatg cgccatggtg gtttgctgca cccatcaacc cgtcatctag 3720gttttaagcc
ccacatgcat taggtatttg tcctaatgct ctccctcctc ttgccctccc 3780accccctgac
agaccctggt gtgtgatgtt cccctccctg ggtccacgtg ttcttattgt 3840tcaactccca
cttatgagtg agaacatggg gtgtttggtt tctgttcctg tgttggtttg 3900ctgaggatga
tggtttccag cttcttccat gtccctgcag aggacatgaa ctcatgcttt 3960cttatggctg
caattttatt tttgtagaga tgaagtctca ctatgttggc caggctggtc 4020tcaaactcct
gggttcaagt gatcctgctg cctcagcctc ccaaagtgct gagattgcag 4080gcatgagcca
ccacgcctgg ctatattgta cctttcaatt gagttatgtg catggtttac 4140tgaaactcta
gccctgaatt gattctccga ggattggagc aaagtgggtg ggggtgtgtg 4200tcagctagaa
gtcacctgga atagacatcc aggagcggaa gtgtattagt cagaaaaatt 4260aaagaatgag
ctccctcaaa agcatagtga cctgttgttc ctggtgatgg gactagatga 4320ccataatccc
tcagcctccc aagtagctgg gattacaggc aggcaccacc acacccagct 4380aatttttgta
tttttagtag agacggggtt tcaccatgtt ggccaggcta gtcttgaact 4440cctgacctca
aatgatccac ccgccttggc ctcccaaagt gttgggatta caggcatgag 4500ccactgcccc
tggccagtaa taatacattc tgtataatag tcacagtaat ggtgaaaata 4560acaaagctgc
tgctgatggc ttagttctga gttttagcta cacatggcaa cctagctaga 4620gagctcacat
tccttcccac tgtcctcaac caaagtggaa gctcacaagg ccatttttct 4680gccccaagga
catgaaaatc tctgattaga aacatttatg ctactttttg gatttattga 4740gcctaatgac
tcctttttct ttcttgcttt gtcactgtag gaatgagtaa ccccaaaaaa 4800tgaaacatta
cacatataac aagttgtagg gagcatgaac ctattttgtt ttattagctt 4860gggtggtagt
agcgttgaac tcctttacat acctgtacag gcacccctaa gatattgtgg 4920gttcagttgc
agaccactgc aataagggga atatcgcagt aaaacgagtc acacaaattt 4980tgtagtttcc
cagtgcatat aaaagttatg tttagacaat actgtagcct attaagtatg 5040cactagcgtt
ttgtctaaaa aaaaacccat atataaggcc gggcgtggta gctcatgact 5100gtaatcccag
cactttggga ggctgaagtg ggtggatcac gaggtcagga gttcgagacc 5160agcctggcca
acatagtaaa accctgtctc tactaaaaat acaaaaatta gccaggcatg 5220gtggtgtgcg
cctgtagtcc tggccacttg ggaggctgag gcaggagaat cgctttaacc 5280cgggaggtgg
aggttgtggg gaggcggagg ttgtggtaag ccactacact ccagcctggg 5340aaacagagcg
agactccgtc tcaaaacaaa acaaaacaaa aaaacccaaa aaacatatat 5400atatatatta
tatatataaa atatatataa tataatatac cttaattata atataatata 5460attatatata
ccttaattta tatatatacc ttataatata tataccttaa tatatatacc 5520ttaattatat
atatacacac cttaatatat atataatata tattaatata tatcttaata 5580tatatatata
aggtaatata tatatacacc ttaataataa tatatatacc ttaattttaa 5640aatactttat
taccaagaaa tgccagtgat ctggtggagg atttgcctag atgttgatgg 5700ttattgactg
atcagggtga ttgctaaaga ttagggtggc tgtggcagtt ttttaaagta 5760agacaatgag
gtttgctgta ttgatggaac tcttcctttc ctggaagatt tctctgtagc 5820agtgctacag
agaaaatggt aacattttac ccacagtagt aggacttctt tcaaaatggg 5880cgtcaatcct
ctcaaaccct gcaactgctt taccaagtac ttgatatcaa ctactgtaat 5940cttttttttt
tttttttttt tttttgagat ggaatcttgc tctgtcgccc aggctggact 6000gcagtgctgc
gatctcggct cgctggaacc tccgcctctc aggttcaagt gattctcctg 6060cctcagcctc
ctgagtagct gagactacag gcgcctgcca ccacgcctgg ctaatttttt 6120tgtattttta
gtagaaacgg ggtttcacca tattggccag gctggtctcg aactcctgac 6180cttgtgatcc
gcccacttcg gcctcccaaa ctgctgggat tacaggtgtg agccactgtg 6240cctgcctaac
tactgtaatc ttataaatcc tttgttgtca ttttatttta ttttaatttt 6300attttatttt
attttaattt aattttattt tattttattt tttgagacag tctcactctt 6360tcacccaggc
tggagtgcag tggcgcgatc tctgctcact gcaacctctg cctcccgggt 6420tcaataaatt
ctcctgcttc agcctcccga gtagctggga ttataggcgt gcaccaccat 6480gccctgctaa
tttttttatt tttagtagag acggggtttc actatgttgg ccaggctggt 6540ctccaactcc
tgacctcgtg atccgcccgc ctcggcctcc caaagtgctg ggattacagg 6600tgtgagccac
tgcgcccggc ctaataaggc tgttttctta tcatttgtgt atccacggga 6660gtagcacttt
taaatttctt caatagcttt tcctttgcat tcacagcttg gctaactgtt 6720tggctcaaga
ggcctagctt ttgacctatc ttggctttca atacaccctc ctcactaagt 6780ttaattattt
ctcggttttt atttaaagtg agagacctgc aactcttcac ttgaacactt 6840agaggccact
gtagttacta attgtcctaa tttcaatatt gttgtttctc agggaatagg 6900gaagccagag
gagagggaga gagaaggggg aacagccagt ccagtgaagc agtgagaaca 6960cacaccatca
tgaaccatca tggccacggt ttggccaaca caattacaac agtaacatca 7020aagatcgtca
atcacacatc accagaacaa acatactaaa aatgaaaaag tttgaaatat 7080tgtaagaatt
atcaaaatgt gacacaaatt gagcagatgc cattggaaaa atgactccca 7140tagacttaaa
acacaggttg ccacaaactt tcaatttgta gaaaaacaca gtatctgtaa 7200attgcagtaa
agcaaagcac aaaaaatgat gtatgcctgt attttttttt ctatccacat 7260ccacccccca
actttttttt tttttttggt agagaccagg tcttgctatc ttgcccaggc 7320tggccttgaa
ctcctgagct caagtgatcc tctcacctca gcctcccaaa gtgctgggat 7380tacaggcatg
agccactgcg cccagccccc atccccaact attttaaaag tctaattact 7440ttttgacgat
tgccgactta cctatagaag tgttcctttc agcccggcga ggtggctcac 7500ccctgtaatt
ccagcacttt gggaggccta ggtgggcgga tcacgaggtc aagagatcga 7560gaccaccctg
gccaacttgg tgaaatcctg tctctactaa aaatacaaaa attagtggga 7620cgtggtggca
tgcaccagta gtcccagcta ctcaggagac tgaggcagga gaatcgcttg 7680aacctgggag
atggaggttg cagtgagccg agatcgtgcc actgccctcc agcctggaaa 7740caaagtgaga
ctccgtctca aaaaaataaa aagaagtgtt cctttcatca actatgaatg 7800gatcaagact
atgaatgttt ctgagaaaat tattgctgtt agtaagaatg tttttagtgt 7860ttaagggtag
ctttggggca cagaaaaaaa caaacgtaca tttataagaa aaaggggaaa 7920gagccactgt
agtatacctg ttttacagat gaggaatctg agactcttag agacttgtcc 7980acgctagtaa
atgaaatgcc cagtaccaat cctgggccgt atttccactt agtgtctcag 8040aattacctgg
actctgttaa aaatgcacat ttctgggctc cactgagaca cgttgaatca 8100gactttctag
ggctggaact gttatttttt ttgagacaga gtctcgctct gtcacccagc 8160tggagtgcag
tggcatgatt ttggctcact gcagcctctg cctcctgggt tcaagtgatt 8220ctcctgcctc
agcctcctga gtagctctgg gattacaggt gtgcaccacc atgcctggct 8280aatttttgta
tttttactag agatggggtt tcaccatgtt ggccaggatg gtctcaaact 8340cctgacctca
agtgatccat ttcaccttgg cctcccgaag tactgggatt ataggtgtga 8400gccactgcac
ctggcctgga acctttactt ttaccaaact ctccagtgat ccttatacca 8460tgaaccgctt
gggtcccaca ctccggtgct ttcctggacc cagacaaaaa taagaaccca 8520gcatagccaa
attaacagtc acgacaggct ggaaagagtg cggtatatgg gcctcagtca 8580cccattattg
gatggatgga ttttagtgct tatttataag gtgatttgta aactttccca 8640tatggtcaca
tttattcagc acaataatac aattaattcc ctgggaatga agtagcttta 8700tcccaagtgt
ttccaaacaa aactaactga aatgttccca ggttatagac agttgcctgc 8760acccagtcca
tctgttccgg cccagctggc agtagaaaga ctggtcctgg ttgtgaatct 8820gagcagctct
ctcccctaaa gccccgcctg ggaactgcag tgcctctggt tccatttgtc 8880agctgtgtct
ctctgaaaac tggggaaagc ttcccgatgc ctttctgctc ttggcagtgt 8940accgttgatg
gcattggctg gaacttcttc accccgcatt tttgttaggt gcagtgcaag 9000accctgtcgg
agactgaaaa caaaatgtaa aggtggtctt agcactgaag cctgggggct 9060cctgtggctg
cgcctttggt ctattgtgag agggggaaaa gccagctccc agtgctcatg 9120tgctgctgag
aacatatcat agggtggtgg cgaaggtagc ctctgcagtc actcatccca 9180gaaggttcca
taaagagctg agctcctaac actttttttt ttttttgaga cggagtcttg 9240ctgtgttgcc
caggctggag tgcggtggca caatctcggc tcactgcaac ctctgcctcc 9300caggttcaag
agattctccc gagtagctca gattataggc gcctgccacc acacccagct 9360aatttttgta
tttctggtag agacggggtt tcaccatatt ggccacggtg gtcttgaact 9420cctgaccttg
tgatccaccc gcctcagctt cccaaagtgc tgggattaca ggcgtgagcc 9480accgctcctg
gacttttttt ttttttcttt ttttaaaaag aaattgagtc ttgctctgcc 9540acccaggctg
gagtgcagtg gtgcagtcat agcttactac agcctcaaac tcatgggctc 9600aagcggtcct
ccagcttcag cctcccaagt atctgggact acagtcacgt gccatcacgc 9660ccagctaaat
tgtttataga gatgggatct cgttatgtta cccaggctgg tctcaaactc 9720ctggtctcaa
gcagtcctac cgccttggcc tctcaaagtg ctgggactac aggtgcatgc 9780caccatgcca
ggccttctac cacttttacc taggcctgct caccccaggc tggcacctcc 9840ctcagaggtg
ggaggtgaag gtggctgctg ctgggcattt aggaggtaca ggtgcagact 9900atgttgtgga
ttctcagagg actgagaagc tctctgtctc tctctctagg aggctttgat 9960agtgttatca
aagtgggtct ccagtgtttc tggaactttc ctccccttcc ttccttacct 10020aattccttct
ttaaattcca ggacctggaa tttatcctgt ctcgacactt cttgtccaac 10080agtgagcccc
ctgtcttcac ccccctaatc ccctgtcttc acatccctct tcccacttta 10140tttgctttcc
ccatggaccc atactcaggg ttttgggagg atggcagcgg agtatggtgg 10200aggagagcga
ttatttaaag tagaaaccca aagcttttat tatagtaaac ttggaaaata 10260caagagaaaa
attcacccct agaaaattat tgtaaacatc tagtttgttt ccctctgcat 10320attttaaaaa
ctggattgcc gcattttacc ttgtgttagt gtcttgattt cattgttacg 10380ccgcctaccc
cctccacagt gtacacttgg agtgcgtcct tctgtggtct agttattttc 10440acttggagga
aggaactttc acagaaaatg tgttgtgtca gtcctttcta ctcattaacc 10500aagaaatcac
aatgttgttt tctttctcaa tgtagagaag tcattcagaa ttcaaaagaa 10560gttctaagtt
tattgcaaga aaaaaaccct gccttcaagc cggttcttgc aattatccag 10620gtaagccgag
aacaaggttc agtcctacta ttttaggatg cctttcaatt tagaaaatga 10680ttgtatacaa
aagagcagct gcatttcttg ccaaagagga gagggctcag ttggcaatgc 10740tgttttcttt
ggtcttttgc ttctttgggg tcacgggtga agtactgccc tgcttgagtg 10800tgtcttcttt
tgtgaatttt cctctctgag gtatgggtag ctttcagagg tctaggaggt 10860aagcacttac
atgcattgca ttacacagga tatatcgtgg tccatgtgtc ctcacaaatg 10920cttaccaaat
gtgatttgaa ggtgctgggg gaaaaagtgg cactttgttt caatggacac 10980agatgctaga
tgaaacatgg atatttggta atctttttac tctatctttt aaaacctgcc 11040ataaacttaa
tgggacaaaa gtcattaccc tgtcaaaagg gctttgcatt ccacatactt 11100cttttggctg
caggctctgg ttctgagcca cagttggccc cagttgccat cctgcttggc 11160acacttgtgt
gatggcaggt gctgagaaat aaggaggtga gccaagcctg gaagcaaaca 11220cagcctgaca
gttgggctgc taccgcaggg cttggcatgc ccttagcgta tcaaaatagc 11280agcctctggt
tccaccgtgc tgcgtatttc aggattacag agctgtgctc aagttactta 11340cattctctgg
gatctgtagc cacatcttgt tacaaagaaa gacaagttag aacaatagaa 11400agaaaattag
attagggctc gggcctcctg ggaatctgct tactccctgg ctttccatct 11460aaaggcaaac
catttcaact cctacctttg gtatccccat tttttggctt ctttttgtga 11520tggggtcttg
ctctgttgcc tgagctggag tgcagtggca tgatcttggc tcaccgtagc 11580tttgaccttc
tgggctcaag tgatcctccc acctcagcct cctgagtagc taggactgca 11640ggcatgcacc
atcacgccgg gctaatgttt gtattttttg tagagatggg gtttcaccat 11700gttgcccagg
ctggtctcga actcctgggc tcaagcaatc caccgcctca gccttctgca 11760gtgctgggat
tacaggcatg agccactgtg cccagtcggt gtcctcattt ttagagagga 11820aagctggcta
ggttactatg tcttctaata ggagggatat tattttcctt aaaaaagtga 11880cataatttga
aggcctgagg aattactctg ttaaatactg gccccaagtc cctttttata 11940agcagcaaga
ttgtttttct gcaccttcca tattatcttt gtaattagat catctgtaat 12000cacaaaattc
atctctggat tatcttttgt gccttttcaa gctattttta tgttgttatg 12060ttgtttttac
cttcctaagg caggtgacga caacttgatg caggaaatca accagaattt 12120ggctgaggag
gtgaggactg ctgcttttaa aaaattcact ataactttta acaatacatt 12180ctcttatagt
gacgaataac cttgtgttcc tttttcaggc tggtctgaac atcactcaca 12240tttgcctccc
tccagatagc agtgaagccg aggtaataat ggcagagctc taaactcttg 12300cttcttcttt
cctcttgaga cttaataggc ccattggcgt ctcttagtgg tggctgggac 12360agatctaaga
tttgcttgtg actgaatgtt agagggaaga cttcacttct tttctctagg 12420gtaaatgctt
catgatatcc ttgtttcttg tcatttgtat gtttctactg aagacttgag 12480ttgcgacttg
ttgagttcgg ttgtgtttct tcagctgcgg tggcagcgtc atctagtgga 12540aattgtcagg
ccgaggcgag gaaactggac cttggaccac agatccctgt gccagggccc 12600ggcccttctg
tggttagctt ttcttcctac tggcttcatg tgacagaaaa cctcaaataa 12660tggaatctta
aataatgtac aagtttattt ctctctctct ctcttttttt ttttgagatg 12720gaattttact
tttgttgccc aggctggagt gcaatggtgc catctcggct cactgcaacc 12780tccgcctccc
aggttcaagc aattctcctg cctcagcctc ctgagtagct gagattacag 12840gtgcgtgcca
ccacgcctgg ctaatttttg tatctttagt agagatgggg tttcaccatt 12900ttggccaggc
tggtcttgaa ctcctgacct caagtgatcc accctccttg gcctcccaaa 12960gtgctgggat
tacaagtgtg agctactgtg cccggcctag aagtgtattt ttctgggaca 13020tgaataaagt
ttggaagtag gcagtccctg gttggtgata tcaccaagtc gcaggagctc 13080aggctcctgt
ccctctgctc tgccgccttt gtggtttggt taacaagcac tttgttctga 13140ttgatcagtg
aggcccagca ggtctgctag tcatttgtga gacaaagaat gggaatttgt 13200agagtttgta
tctggccttg tgctatgtaa acaagggggc attggtgagt cttatttgag 13260tcctatggag
aagagcagtt ctttgcagta cacagaggct gcagggattt gttaaccttc 13320cctcttttcc
aggatcacag ggctcaggta aagttcgttg tcaccaagaa gaaaaaggaa 13380ggacaaagac
aggcccctgc tgtctttaaa gacatttcct gaaagtcaca gttctacatc 13440catcccgttg
tcccctactt agttgcctgg ccaactgcaa aggaggctgg gaagtgtagt 13500ttttatacca
ggaagcctta gatcatttat tagcaagcta aaaaccaaga gtttattact 13560aagaagagaa
aggaaactat atactgggcg ataagtgttt gccacacttg ccattgtagc 13620gtggtacaaa
tatgaagtgt taaatatgta acaaatacct atgataccag cagtgacagc 13680atgcaaagga
agttgttgtt catttgcacg atacccactg tctgcagcag aatttgcttt 13740tctttctttt
tttttgagat ggagtctcgc tctgtcaggc tggagtgcgg tggtgcaatc 13800ttggctcact
gcaacctcta cctcccgggt tcaagtcatt ctcctgcctc agcctcccaa 13860gtagctggga
ttacaggcgc ccgccaccat gcctggctaa ttttttgtgt ttttagtaga 13920gacagggttt
catcatgttg accaggctgg tcttgaactc ctgacctcag gtgatggcct 13980gcctctgcct
cccaaagtgc taggattaca ggtgtgagcc actgcgccca gctggaattt 14040gcttttcttt
aggttgtttt gccactggtg tttttttttt tttttttttt tctgagacgg 14100agtctcactc
tgtcacccag gctagagtcc aatggcgcaa tctcctctca ctgcaacttc 14160cacctcccat
gttcaagtga ttctccctgc tcagcctccc aagtaactgg gtttacaggc 14220acccgccatt
atgcccagct aatttttgca tttttgtaga gacggggttt caccatgtta 14280gccaggccgt
tctcgaactc ctgacctcag gtgatctgcc ctccctggcc tcccaaagcg 14340ctgggattac
aggtatgagc cacctcacct ggccttgaca ctggtattaa tagcccttga 14400ttttcgtcgt
tgttgtttat aattgacaca taataattgt gcatgtgtgt ggggtacagt 14460gtggtgtttc
ggtacttgca tatgttgtgt aatgatcaaa tcaggttagt tacattcatc 14520acctagaacg
tttatcattt ctttgtggtg atagctttca agatcttttc tagctgtctg 14580gaaatataca
atacattgtc attagcttta gtcactctac tatgtaagag aacactggaa 14640tttattcctc
ctatctaagt gtaactttat acctactgat ggaccaacct ctccccattt 14700cccctccccc
atccctctcc agcctctggt aaccaccatt ctattctcta tttctatgag 14760atcagctttc
ctagattgca catgtaagtg acatcctatg atatttgcct ttctgcacct 14820ggcttatttc
acttcatgtt atgtcctcca ggttcatcca tgttgccaca aatgacagga 14880tttcattcct
ttttatggtt gaatgatact ccattgtgta tacacaccac actttcttta 14940tcctccatta
atgggcattt ggattggttc catctcttgg ctactgtgaa tagtgctgta 15000acaaacatgg
gaatgcagac ggcttctcag catagcggtt tcagttccca tgcgtatata 15060cctagtggtg
ggaatggctg ggtcatatgg tagttgtttt tttaattttg tgagaaatct 15120tcatactgtt
ttctctaatg gctatattaa tctgcattcc caccaacagt gtacaagagc 15180tcccctttcc
acatatcctc gccagcattt gttattgtca tttttgataa tagccattct 15240agcttgatat
ctcattgtgg ttttgatttg catttccctg ataattaatg atgttgagtg 15300ttttgtcata
tacctgcagg ccatttgtat gtcctttggg agatgtctgt tcacgtcttt 15360tgcacatttt
ttatttgaat tatttgtttt tttgttatta agttatttga gttccttata 15420tattctggat
gttaacccct tgtcggacgc atactttgca aatattttct cccattctgt 15480aggttgtctc
tgctttgttg attgtttgct ttgctatgaa gaatgttttt agtttgatat 15540aatcccactg
atgtattttt gctttcgttg cctattttgt tgaggtccta tccaaaaaaa 15600ccttgtccag
accaatgtca tgaagcattt cccctgtgtt ttcttctagt agttttatgg 15660tttgagacct
tatatttaag tctttaattc cttttgagtt gatttttata gaagtgagcg 15720atgagggcct
actttcattc ttctgcatgt ggatatccag ttttcccagt gctatttatt 15780gaagaggctc
tctttccaca atgtgtgttc ttagtacctt tgccttgact tattttttga 15840caaggaatca
tttattccgt atttttcttt gggaaaattc tcatagagtg aatttgtatg 15900gtgttatatt
gaaaaagagt aggtacctta atttattagt taagatttgt gttggcattt 15960gttgcttcta
ccaaatgctt ggtacaggtg gcaatagttt cctcgcttca ttctggtctc 16020tgcttggaca
tcgtggcctc tgaccaccct ccacccatca tcactctgtt agccttttac 16080cccactgaac
ctttattttt ctgcatgtca tttattcttt ccagtgattt tattattatt 16140attatttgga
agggaggaac ttccctcctc cactagagta taagatccac gaggacagtt 16200ttgtttatag
ctgtatttcc tttacttaga actgatacag agtaagtgct caaaccattt 16260gttgaatgaa
tgaaagaatg ccaaatgaat gaaacaaagc tttctttctg attttgtacc 16320tcaaatggtt
tcccctgcca ctgcacaggt agtgattttt gctaggcaat gaaacaaaaa 16380tggtaaacca
acccttcccc agagcctctg agtgagatct atatgcaaag ataagtatct 16440tctcacttgt
gagcaaatat ttttgttaag tgagatgaat caggatctta tctcctggca 16500taaaatagac
agagaggaca aagcattgct tatttagaga cacacccaga catttattag 16560cttcccaaaa
tgtagcttgg tctttttcct aaatagtgaa ttattttagt gtttcccata 16620aaaggagctg
tgcatgctaa gaatttttct agcataagcc tttggaaagg cttcttgtgt 16680gtcattgagg
tccagataac ccgagggcct ctgtgactgc ttggacgtca tgattagtga 16740gctgtgttga
acacattgtg tttgaaacgc acaggcttag ctgggccaga ttggaatgct 16800tcagattcat
gcttcctgtc cttcctgaaa actggcagtg tgtttggctt aaagtatgct 16860tgttgacaga
gaagtgtcct ttgtctatct aaaccttaat agctgttgag ctttggtttt 16920gttctgtttt
gtctttttct ggcacagagt agagccatag aaaggcagag aagtcactaa 16980aaaattcagc
caagcctttt aatttttttt ttgagccttg tacacatgat gaaaagaaaa 17040aaataggctt
tttattatgc tttacaccaa gtttccaaca aggagcaaaa caacaacaaa 17100ctataacaga
tgcattttca tgaatccact ttttatataa agcttccttt gttgggtctc 17160attttcacaa
aactaatttt tattttgttt ttttgttttc tctttagatt atagatgaaa 17220tcttaaagat
caatgaagat accagagtac atggccttgc ccttcagatc tctgagaact 17280tgtttagcaa
caaagtcctc aatgccttga aaccagaaaa agatgtggat gggtaagaaa 17340ataaaatcaa
ataatcaacc tttatggcta aatcactttt tctattgtgc ctgcttttat 17400ttttctaaat
ttcttctatt tatttcaact gtgtcagtaa ttggataaaa aaatcatttt 17460aaatatttca
ggagtctata ggattttttt ttttttgaga tggagtctcg ctctgttgcc 17520caggctagag
tgcagtggcg tgatcttggc tcaccacaac ctccatctcc tgggttcaag 17580cgattttcct
gcctcagcct cctgagtagc tgggattaca ggcatgcgct accatgcctg 17640gctaattttt
gtatttttag tagagacggg gttttaccat gttggccagg atggtgtcta 17700tctcctgacc
ttgtgatcca ctcacctcag cctcccaaaa tgttgggatt acaggcgtga 17760gccaccgtgt
ctggccatct gtagcctttt aaaattctgc atgcaaaagt tacatatttt 17820ggaaaataca
ttagcccttt tatagtgctt caatgccata tgatcttaca ccatcagatt 17880catttaataa
attccggtct gacttataca atctttagcc agctctattc ttctaatagt 17940tttttccttg
gtaattctac cagaccagag aagaaaactt agaaacctga ttccccagca 18000gtaccatatt
ctgtcttagt cccaggccag ttgagaaaag caaaataaat acttttctta 18060atgacacaag
gttttatttg aaatttggag gggaacatca taactattgg gtgctgtaat 18120gttggcctgt
gattcatgtc atggagtgac ataagcttca tagaacaaag gataccggcc 18180gtctccagtg
ggggataagc atgcttcctg gggattaaaa ccccattttc ctccatggaa 18240atataggaag
tgttttgtac tgtggccaaa aagaacacaa tgtgttaatg gttataggtt 18300gtaagcagat
ttataaagtt agattccaga tccggtagac cctgcagggg ctggatgggg 18360aggctaccac
cttcctgaga cacccagcaa tcttccattc tagcctttta ttgtgacttg 18420agtagaccca
aacacaggct gaaaaggtgt catgaacacc tccagtcctg gctgggcaca 18480gtggctcaca
cctgtaatcc tagcactttg ggaggccaag gcaggaggat ctcttgatct 18540caggaggtca
agaccagcct ggacaacata gtgagacctc atctctattt tttttttttt 18600tttttgagac
agagtctcac tctgttgccc aggctggagt gcagtgacac gatctcggct 18660cactgcaagc
tctgcctcct gggttcacac cattctcctg cctcagcctc ccgagtagct 18720gggactacag
gtgcccacca ccacacctgg ctaatttttg tacttttagt agagacgagg 18780tttcaccgtg
ttagccagga tggtctcgat ctcctgacct cgtgacccgc ctgcctcggc 18840ctcccaaagt
gctgggatta caggagtgag ccaccgcgcc tggccccttg tctctatttt 18900taagtaaata
attaaacatt tttttaaaaa acgacaatct ccagtcctgt cgtagactaa 18960ctgtaaaagg
agaagaagca ctaatccatg agaaaacttt cagaacttga aggtcctagg 19020gcatagtttc
aagtctttca taaaaggagg ggaattcatg ctgaggccca gccaggaaaa 19080accaaccaaa
caaaaacaac agagagtaaa aggagaaatt acgaaaatgg ttctaaatgt 19140ctccattctg
ttactgagat tgacgtgctc agtttaggct agatagctgt ggaatgtttt 19200gtcctagatt
tgaacaggga aaggcacaga acccttcaga agtgtgtggt cactcatttt 19260ttccactgcc
ccccaccccg acttctcaag gagataaata tcgaggtcaa gttgaaatca 19320gacctgtgac
ctgtgcctca atcccaggtc ttactgggtc actccagatt tataggattg 19380gttctgggtt
ttgagtggac tgcaggacca tgcttctgtg taactctagg gatgctagtc 19440cattggggtc
cattgaaagg agctccctgg agttgcgaag tgaagccctg tatggcttta 19500ggctgtgtcc
tggtatattg gcagaaccgg aacacccaat gtcagggcca gggtgaagtt 19560tcctaaggta
ctgccttaga attttctaag acctttattg aatggccttt attgcctgct 19620ttctctttaa
ttcatttttt atgggttttt tttttttttt gagatggagt ctcgctctgt 19680tgcccaggct
ggagtgcagt ggcatgatct caactcactg caacctctgc ctcccaggtt 19740caagcgatcc
tcctgcctca gccccctagt agctgggatt acaggcacgt gccatcacgc 19800ccagctacct
tttgtatttt tagtggagtt gaggtttcgc catgttggcc aggctagtct 19860cgagctcctg
acctcatgtg atccacccgc ctcagcctcc caaaatgctg ggattacagg 19920cgtgagccac
catgcctggt cctctttaat tcatttttat gtattctgtc tatcaaacat 19980tgacttggat
gttatttggg gttaatatat gtacattaca cgtgagtgat ttttttggct 20040gggctttgac
ttaacctact tctttatttt ctgattcaga gtaacagaca taaacctggg 20100gaagctggtg
cgaggggatg cccatgaatg ttttgtttca cctgttgcca aagctgtaat 20160tgaacttctt
gaaaaatcag gtaggatgct ccttgagaaa tgccgctgat ttctatttat 20220tggtatttac
attattttta ataaggattt gtttggggaa gaattaaaaa tttttttaaa 20280ttattattct
gatatccttt taggatgttt ctagaagatt ctaattgaac atagaagtga 20340tatttcacgt
tgactttttt tatttttctc atcttttggg agatacagat gaatacctag 20400aaactttttc
tagtaagacc atcttataaa ttaaaagaga aattagtaga gataaacgat 20460tagccctgtt
ttaaacatgc tcaatttctc tcatgaaaca gtcaaaataa ccacagtaaa 20520atatgcttcc
agtttcagct tactaaatct atcaatatgt agccttagat aagacttacg 20580ttgaagactt
aaatgaggcc aggcacagcg gctcatgcct ataatctcag tgctaggcga 20640ggccaaggcc
agaggatcac ttcaggccag gagttcgaga ccagcctggg caaacatact 20700gagaccttgt
ctctacaaaa aaaatttaaa aattagctgg gtatggtggt gtgcacgtgt 20760agtcccagct
atgctggagg ctcaggcggg aagatcgctt gagcccagga gtttgaggtt 20820acagtgagct
gtgattgcac cactgcactc tagcctgggc aatatagcaa gaccctgcct 20880caaaaaaaaa
aaaaaaaaaa aaaagagaag aagaagaagc cggtagatgt agtttagggc 20940acaggagcct
aggttgggat ttattggtga gcagcttgct tttcagtagt caggacttaa 21000tttccattgt
tgtccccttg acttggcgtt tttatgtcag tgaaatagaa atgaatattt 21060tctctttctt
cattattaag tttaatgagt aaattactac caagtattta tatatttaag 21120ggctttctga
gaacataaag attcaactaa ttgggccggc acagtagctc acacctgtaa 21180tcccagcact
ttgtggagcc aagatgggag gattgcttga gcccaggagt ttgagaccag 21240cttggggaac
atagggagac tccatctcta ccaaaaaaaa tttttttttt aattagccag 21300gtgtggtgac
ttgtgcctgt agtcccggct acttgggagg ctgaggtagg agaatggctg 21360gagcctggga
ggtcaaggct tcagtgagct gtgatcacac cgcagcattc catcctgggc 21420aacaagcaag
atcctatctg aaaaaaaaaa aaaaagaaaa gaaaaaattc aacgaactaa 21480atctttttct
ctactccagt ttgaatagaa atgaagcaaa tgcccctccc tacccctgaa 21540catccaacaa
tcaaagaaaa atttcaaaaa tatatggcaa agttttatta aaaggttaac 21600attcaggata
tatatgcttt ttggacagta atattaagct caaatctatg aaataatcca 21660tatatacagt
gctttttcat tacgtggatt atcaagatga agattctgag ccacttcatt 21720agaaagtact
caaagaaaag agtaaattag gctggtgagt tatagttaca tattgtggaa 21780agaaattcag
caacaggaat tggaaccaaa agggtttgaa tagagaattt ttgttgatta 21840caacacccca
ctcctagttc ctagtcattg taggttggtg ctgagaaagt tcaagtttta 21900aaagggtttt
ttgttgttgt tgagacattc ttgctctgtt gcccaggctg gagtgcagtg 21960gcgtgatctt
ggctcactgc aacctctgcc tcccaggttc aagtgatttt cctgcctcag 22020ccttctaagt
agctgggatt ataggcatgt gccatgacgc ctggctaatt ttctatattt 22080ttagtggagg
cggggtttta ccatgttggc caggctgatc ttgaactcgt gacctgaagt 22140gatccgcctg
cctcagcctc ccaaagtgct ggaattatag gcatgagcca ccacacccag 22200cctaaaaggg
tctttttttc ttaagataaa gttcttaaag gaaggatctg aagttttgag 22260tataatattg
aattttcatt tggttagtag gtgtcaacct agatggaaag aagattttgg 22320tagtgggggc
ccatgggtct ttggaagctg ctctacaatg cctgttccag agaaaagggt 22380ccatgacaat
gagcatccag tggaaaacac gccagcttca aagcaaggta aatttcatgt 22440tagaaacata
catcttgaag gcttctgtta gaacagaaga aagtctagaa atgtataagc 22500taagtgctta
attcaagaaa ttagaggaag aaaattgaat agacccatat ccattaaaga 22560cattgaatga
gtctactcac cagacaaaca aacaagaaaa ggcaacaggt ggttttacaa 22620atgagctcta
tcaaacaaag aacagatcat tctaatacaa tccttttctc cagcaaatgg 22680aaaaagaagg
aacgtggcct aacagacttt atgagtgcca aaaccaaaga agggtaacat 22740gctaaaggaa
aattaacagg ccagtttcag ctatgaatat aaatgaaata atcgtgagcg 22800aaatctgaga
acaccaaata cagcaggaag tagggtttat tttaggattt ccaggtgatt 22860gaacatgaga
aagtctgtca ctgtaattca ccacattgtg attaaaaact aaattctcac 22920catagtaagt
acagaacagt catttggtaa atctaatatg cattatgacc ttttgaaagt 22980aattataatt
ttgatgaaaa ttattaagaa gacccatata aaagcagagg tatcatcttc 23040atggagaaga
tgcaatatct taaagatact aattgtctgc aaattttgta gaatttgaca 23100gtctgattct
gaaatccaaa tggaagcata aaggacaaag agagtcaaga caattttgtg 23160gaaaaactag
gtgggagaca ttcccagcca cacagtgtga attattatgc acttacagta 23220attaggataa
tatggtagac aaacagacga tcggaacaga agaaagtgga aatttatgca 23280tatatggaag
cttgagagca ggtaccacag atcagtgggg aaaagattaa ctattaaagg 23340ccgggcgtgg
tggctcatgc ctgtaatccc agcactctgg gaggctgagg cgggcaaatc 23400acttgagctc
aggagtttca gaccagcctg gccaatatgg tgaaacaccg tctctactaa 23460aaatacaaaa
aaattagccg ggcgtgtcgg cgcaagcctg taatcccagc tactcgggaa 23520gctgaggcag
gagaatcgct tgaacccggg aggcaggggt tgcagtgagc cgcgatcgtg 23580ccactgcact
ccagcctgcg cgacagagtg agactccgtc tcaaaaaaaa aagctaaaac 23640attaactatt
aaaaatggtg ctgacttaac tagttatgta tatgagtcta gcataggatt 23700actaacctgc
cactacatat aaaaatcact tctatattaa aatacctaaa tgtggaaagc 23760aaaactgtaa
aacttataga aaaaaatata ggcgtatgca tttaagaggt tggcgttgga 23820aggatagctt
aagagacccc aaaagcacag attgctaaag aaaaggtata taaatctgac 23880tgcatccaaa
tttttaaaaa ccccaaagtt tttctttctg aaaagaagtt cttaaaaaag 23940tgaaaagaca
gctatagtgt agaagatatt tacaacaaac atttagccaa taattagtgt 24000cttataaaag
attcctacaa atcagtgaga caaagacaat ctgttagaaa aatggacaca 24060tgacagacca
ctaacaggtg aggaacccga ggtttaaaag acacgtgaaa agctgtctgg 24120tttcctgctg
tttttgctca ccagtcatat tggcagtagg taaaaagtgc cagttagcca 24180tattgacaaa
tgtctggagc tatgggaatt catatattgc tggcgggagt ttaaatttga 24240aataggcatt
tcggaaagca gtttaatgga gtttgagcta ctctttgacc caggaatttc 24300actctaaata
tatatgatat ggccaggcat gatagctcac ccctgtaatt ccagcacttt 24360gggaggctga
ggcaggcaga ccacctgagg ttggggtttc aagaccagcc cgaccaacat 24420ggagaaaccc
catctctact aaaaatataa aattagctgg atgtggtggc acatgcctgt 24480aatcccagct
attcaggagg ctgaggcagg agaattgctt gaacccggga ggcagaggtt 24540gcggtgagcc
gagatcacac cattgtactc cagcctgggc aacaagaacg aaactccgtc 24600tcaaaaaaat
aataataaaa agtgtatata tatgtgtgtg tgtatatata tatataatag 24660acaaaacttg
cccatatata tacaaaggtg tgtttacaaa tggtcagagc agcatgttta 24720tagtgcaaaa
agttttgttt gtttgtttaa gacagagggg cagtggtggt tctgtgggtc 24780ttggcccctt
tctgctgtca gggctgagtt ggctccatga gttggtgaaa gcggtatcta 24840gccaggttca
agggagacag gggcaggatt tcgccttcac tgatgggcag gccagcggtc 24900ccctgtggaa
cggtgactat tgaatattga gaggacagta tccctccctg agcatcttct 24960ggagcttgat
gaccactagg gaatgtatct gagtcacgcg caccaaatgt gttaccggtg 25020gcagctattc
gagttatccc gagttcgtgc tggtgtatcc gtacagttct gcagcaactt 25080cagctcttgc
ctcctcggaa gaaagaattg gactgagggc cataaggcag aaggagagac 25140tgaggcaagt
ttcagagcag gacggaaagt ctagcaaaaa gctctagaac agtaaggaaa 25200ggaaagaagg
aaagtataac ttggaagaga gccaagcagg cgacttgaaa aaccaagtgc 25260actgcaacaa
gtatttaaac agtgatctaa atgcccatca gggtgaaagg ataaatcagt 25320atagttgcat
gtagagtatg catagtcata gttcataaaa caatgatata caagcagtga 25380aaattaataa
actagagctg tagatagcaa catgttttct tgttttgttt tttgtttgtt 25440tgtttttttg
agacgaagtc tccctctatc ccccaggctg cagtgcagtg gcgcgaactc 25500ggctcactgc
aacctctgtc tcctgggttc aagtgattct tctgcctcag cctcctgagt 25560agctgggact
acaggtgtgc gccaccatgc ccagctaatt tttgtatttt tagtagagac 25620agggtttcac
catgttggcc aggctggtct ggaactcctg acctcaggtg atccgcccac 25680ctcgacctcc
caaaatgctg agattacagg catgagctac cgcgcccggg ctggaagaat 25740tttttaaaac
actgttcagt gaacaaaaac ccaagttgca gaaggatgta tttcatacaa 25800caccatttat
atgtgtctga aagcaacaca acatatatat atatatgtaa tctaatacat 25860acatacatat
aatatgtatt acatataatc cttatatata tatagaatct taatatatat 25920atttaggtca
ggacatttag atcactgttt aatatatata ttaggattat atataatata 25980tattatatta
ttatgtatta attatataat ccttatatat atattaggat tccatgcaaa 26040tgaagtaaaa
atataaagaa ataatgcaaa ctacgaacca aatccgaaaa gcatttattg 26100aggcaaggat
gtggatatag atggaaagga aacctagaca ccttcatgcc ttttttctta 26160aactgggcaa
taggtgttgt ctaggtatgt taggttattg ttttccattt ctatgtctga 26220aatgattgtt
tttcatttta aaaaatgatg gtacagctca cagatctaaa gcaaagtgaa 26280gttttaaatg
aatgcattaa gtacaatcag taactacaga atgtgatcta aaaatacatg 26340ttcctcatgt
tgcagggtga gaatggaaat gaggagtaaa ttgtaatcgt gccagggaag 26400ccaggtgtct
cagcagattt ggaatctttg catggcaggt ggatccaatt tatgtaatgc 26460ttgggttaag
acagctcagg aaggcagcct tgcaatgaaa cgtgaacctg tgtgcgtttt 26520tagcagtcac
cacatggtgg cagcctatcc aagaaaattc aagggcaata gtttgcactt 26580gagagcccag
aaattttgat ttggaagctc agtggaagag gaaaataaac cgtgccctgc 26640actgaacagc
tgtggtgaag agagaaaata ccttccttgg caattgaatg ttttcgcagc 26700tgtcacagat
acttttgcta gttgtatcta tatttttgaa aaacaaagac atttttcaaa 26760gtactttttt
attacgggaa taaatcacct tgtctctgta ctttgggatg aatctcatct 26820gaactttgag
aacgattaag tggcttcact ttctggagaa tctggtcatt gggtaacatt 26880cttgttttca
tatggaccag aaggttctct ttcatagttg atgttgcaca cacgtttttt 26940tccctaagtc
cttggtattt atagtggcac cattgcgaat gttagaggct atcaactttc 27000attcttatta
ttgcttcatc tttgtagcca aaaatatggg aagaaatgtt catagccccg 27060ccccatactt
gtccccctct acctggccta cctacctacg tcaggctcct gtcccaccac 27120aaatgacagc
ctgtgtcttt catgttagca gctagtcttt cactagccac atacttcccc 27180gtaacccata
tgcctccctg tgtctgcctg acgtttgaat tgtcactatt ctgttgttgt 27240atttttacaa
gaagaatgta tggggcagaa aacccttgcc attcatttcc tgagtaggaa 27300ttctagcctt
ccttggagtc gtggtaatat ggagacatgg taatatgtgg attgtaccac 27360tcaatggctg
tctggaggcc acagtggtgc aaaaagccac ctaggcatgg gttagtgtcc 27420tggggcaccc
aatcaggaca ggacacctgg cacacccttg gaaaagcttc actgtgaaaa 27480acactggaag
taaaaatttt aacaacagat agggattttt ttccccttga gacagaggct 27540cactctgtta
cccaggctgg agtgcagtgg cgcgatctcg gctcactaaa gcctctgcct 27600catgggttca
agagattctc atgcctcagc cacctgagta gctgggatta caggcgagca 27660ccaccatgcc
tggctaattt ttatattttt agtagagatg gagtttcgcc atgttggcca 27720ggctggtctc
gaactcctga cctcaagtga tccacccaca tggccctctc aaagtgctgg 27780aattagaggc
ctgagtcact gtgcctggcc cacatgggga tttttgttgc ccagcttaat 27840caaccaagaa
taaaagtgca ttttaagatt ataatctatt taacttctcc ttattggcaa 27900tggtaaaaga
gttaaggaaa aaagagaata agcttaggat tatggctact aagtagaagg 27960acgttatctt
tgacgggcat ttctgttgtg cttatttgta cctgacatgg aacaagaccc 28020ccccatcccc
tcaacacata cctgctgtgg gacatctggc tagagatttc caaagctgag 28080ggtgaaaggt
ttgggcatcg gtgtgactgt gtaacactag agttctatct accatctctt 28140aaactacagg
tagtcttacc ggtgtgggct gaccttccgg ttatcaggca gcctgctgca 28200accactagaa
actgcctttg aaggtttcca gtccggcaca acatttgtgt tctcattaaa 28260cagtagagca
taggctttgg ggacacacag ctctgggctc aagccttagc tgttgctgct 28320ctgagcctgt
tccctcgcgt agaaaggcaa tcataatact ccctacttta tggggcagtt 28380gtgggaatgg
aattaaatgc taatgtgggc cgggtgcagt ggttcacgcc tgtaatccca 28440gcactttggg
aggctgagga gggcagattg cttgagccca ggagtttgag accagcctgg 28500caacatggcg
aaactccatc tctaaaaaag tacaaaaatt agccaggcgt ggtggctgta 28560gtcccaggta
ctttggaggc tgaggtggga tgatcacctg aatctgggga ggtcgaggtt 28620gcagtgagct
gagattgcac caccgcactg caacctggag tgcaggctgg aggaagagaa 28680atatatgggt
caattaggca aatcactcag atcgtaagta gatatactag taaaatagcg 28740tgacttaggg
tccttcactg gaagcctggt attaatgctt ccattgtgtg ggtgggggaa 28800ggagagagtg
tctccagctc tctgtctgtg aggtctccca gccgaatatt ggccaaggga 28860gtgaaaccag
aagcaagact accatgagac aacgttccag agaaaggagg gactaaccag 28920agcaggtgcc
cagctttggc cagtcaggct catcttccag tcctcagtag attattatca 28980actttaagtg
atgtcacaga ctcccttcct ccaaagctgt ggacctcttc cttcccatgc 29040ctcttgttac
ttggaagtca ctctacttcc aaacatgtcg gctggagatc cctgatgggc 29100cagttactac
tttcaggctc tgtaggttta aagaatggtt atatccagtt agttacaaat 29160agaaatcgat
cctttctctg tccgtctgtc taaacggaag gatattctgt ggacgctgat 29220taggtataaa
atctccacgg agcctgagca tgaagaattg accttcttag ggttccacag 29280aaaccagaat
agacttctgt gtttggtagc tttgtaactt acgcccgttg aggtgtacgg 29340ttgaaaattg
agaataaatt tcctgttctg tttctgagca tcttttccag gcttggagat 29400gcgttgagca
tttccacgga gaccctagag gtggggaggg gaagaacgcc tgggcatttg 29460tccaaattag
ccatatgctg tggagtgtca catgtcagga agccaaggca ctggagctag 29520tgtttttggg
cagggctagg gaaccccccg cctttcctcc acccaaacgc aggcccccac 29580acccagcgtc
ctgggccctg caggggaatg acactgttgg tttcactctc aggaatgagc 29640ctagatttgg
aggtaacctc ttcctgaccc cgatgatttc actctgatgc caaaaccagt 29700ctgaagtggg
tgtggttttg catttggtgt ttacaaggtg gagtggtgcc ctctgcacgg 29760aagggagagg
tgggcaggta cggcagggcc ctgctccctc gggagtctcc tctgaacctg 29820tggttagggc
acagctggac aatgctgaaa tggtctccag ctgggagatg aggtaaacct 29880cagctgcatg
ccgaagaaat aacctagaaa tcagagaacc gttttatttt gtttttctca 29940gaggatctca
tttctccctc tcctcctgag cacagagacc ctcaaaactg gtattcacag 30000aaggaaaaaa
tataggtctt tcacctttga aggatcaaag tgtctagggt tttttgtggt 30060tttttttttt
tttttttgaa caaagctgaa ctattacaaa gtccttttca gtttaaattt 30120agtgcatggc
atggctcatg aaagactttt aaaagtgaat tggaaaattc taaatgtact 30180tgtattatac
atgtgaaagg atgacttctt aaaaaaagaa taatgtctgt cccaaacggg 30240agaaggaagg
aactctagct tttaaactgc actacattag tgagttacat gacaattctc 30300cttttaacag
gatggaatgc acttaaagca gtaatgtgct atggtgacag taatgacaga 30360caatgttaat
tgacagaatg gctactctga accaggcact gtcttgtact cctgttatct 30420cagtggtctt
cacacttatt ctgttaggca ggactgtcat cctccccaca ttacagatga 30480ggaaactgac
tgaggcttag agaggagact ttgcaaagat catgtgatta aaaaatggta 30540gaggtggcaa
aacagcctgt gtttttaaca ctgtcccatg tcagctctta ttccagatca 30600tcacataaag
tctaatctct aatcttgaaa accgaaaagc acctgccacc tcagataaga 30660gagaccaaga
ccctagacat tcgacttgtg ttgaattcac tcttattttg ttttgatgtt 30720tgtaatccca
gagccatact tgttaatgag ctgaagtaga gtgaatgctg tcccctccct 30780gaggcctggg
gctcatcatc agctctccag gctccccttt cagaggggat ggtgagcacc 30840ttcggagagg
gcagggatgg acacacaaga cagcctcagt tatgtgagtt catcccctcc 30900cctgcctgct
gtgggtgaga tggagaaaaa agcactggac cctcctgggc atcacagcct 30960tgtacttagc
gagtggctga ggtcatcagt agcccaggtc atcagtagtg tggaattaag 31020tgcactgaat
agtgttggcc cctgagcttt gtttgacaga tttagggggt cagctcacca 31080ctttcccctc
ttgcttctgt ctgatcattg ctctgtttgt taataccaag gaagaaagtg 31140aatagaactt
tttagagagc caaaaaaact ttattagact ttatgccatt cattatactt 31200agcagtgttg
agggaaggat ttgtgagacg atatattcaa actaatgggc atgagaccaa 31260gaaggggctg
tgtggcgggc gcggtggctc acacctgtaa ccccagcact ttgggaggcc 31320gaggcaggtg
gatcatgagg tcaggaatcg agaccatcct ggctaacacg gtgaaaccct 31380gtctctacta
aaaaaaaaaa tacaaaaaca aaattagcag ggtgtggtgg caggcacctg 31440aagtctctgc
tactcgggag gctgaggcag aagaatagca tgaacccggg aggtggagct 31500tgcagtgagc
cgagatcgcg ccactgcact ccagcctggg caatagagtg agactccatc 31560taaaaaaaag
ccggttgggg tgggggcatg taattaacag actgattggt cttaggtggc 31620tcagaattta
agctgagaaa ttaagattga gaagaaatga tgtattgtcg ttatatcaaa 31680tatgtgagta
aaaacataat taaaacacca tctatctcca gttttcccaa cattacctcc 31740cttttatgat
gcccttcctg taccttatta ggcaaaatat tctaatttca gtctcattgg 31800attatgttct
aaatgtttga aggttggtct gaacatggaa tgtattttct catagatata 31860atagtataaa
tggtgttagg gtcatgggat tgcgaagcca gttccactgc agtagaatca 31920tattgacttg
ctttccgatt tttctgtggg aaacttcatc tggagcttta atgcacttct 31980tagctgtggg
gtaagtttca gcctccatga cgcttgtgtt actttgtagt cacctgtggc 32040caaccacaaa
taaatctttg tggtcatcag gggccagtga agtgtatact gcacaaaaga 32100gagcctaccc
caattctgga agaagggctg caggcctgtg tcccacaaag aacaggcagg 32160aagcatcacc
ttcagcccac tggcagcagg tcatctccca tttgcttttc tgcactatca 32220gcagagtctg
gcttcctttt ccaccccctc tccaatgtga atgcttttta aggtcgtgga 32280attagaattt
aaaaaaaaaa tttttttgag acagagtttt gctcttctcg cccaggctgg 32340agtgcaatgg
tgcaatctca gctcaccgca gtctccgcct cccaggttca aatgattctc 32400ctgcctcagc
ctcctgagta gctgggatta caagcatgcg ccaccacgcc cagctaattt 32460tgtattttta
gtagggacag agtttctcca tgttggtcag gctggtctcg aactcctgac 32520ctcaggtgat
ccacccacct cggcctccca aagtgcaggg attacaggca tgagccaccg 32580cgcctggcca
gtattagaat cttttaatgg gaacgtctta aacataacct ataacaatga 32640agcaccttat
aagtgtgtgg agatttcctt ttcatggctt ttcatatatg cctcgcactc 32700acccagcaac
cctgcaaact ggatgttata ctcaccagag agccttgacc ctgattgtca 32760ctggcaagga
gcagtgtctg gacttgaacg caggtcgcag gactccactc atggtgctct 32820ccccactgca
cctcaattcc tcacttttca aaggaggcag cggaagcctg gagagtttct 32880caaaggaaac
ggaatgcaga aatctgtctg tgtgcaatgt gaggctcaga actgaaaacc 32940ataaatgtca
ggaaatgcaa agcagaattt tcccctgcaa cggcagtttg catgaatatg 33000aagacgctgt
atggatgaaa gtttgtggct gcagcatgag cataaatcct cggggctgct 33060gttcgtctag
gagcctgact tgcctggtgt tgccctgaga actgcaactt gagcatttcc 33120ttaattttgt
actaaaggaa ccaaaatgag gtattttggt tctaagtctt tgtcatgtga 33180aggaagcttc
ctgttaccac cccccatcta catacacatt ctctttctta gccccacgtt 33240tgaattttat
gtttttccta aagtataggg aagattatta ttattattat tattttcttt 33300ttcagacgga
gtctcgttct gtcaccaggc tggagtgcag gcgcgtgatc taggctcact 33360gcaagctctg
cctcccaggt tgaagtgatt ctcctgtgaa agggaattat ttttgatgag 33420tcattaaagt
atatccattc ccagaatggt gctgcatttt tccttttatc atctgtcatg 33480acttttttaa
gtgtgaagct tacttccaga ggaattgtct tttctagcat tttggacaca 33540actaaatatc
tttgatattt tcttctcata tgttgagcag taaattaagt tgaatggtga 33600agtcagaggt
gaacttagga acatttagaa aaaggaaggt catttattat ttttcagggg 33660aaaagtttgg
tagaaaaatg ctgtaatcct aattcccaga tgtccatttg gcttgtagtc 33720attttgattt
atgaaataat cataagaaat gtcttcaaat tttaaaagcc aataatttgt 33780tctactaatc
acagatgaga gtagaaataa aagaaacaca cccaattcct gaaaggaaaa 33840aaattctccc
aggatgagat cactgggtgg acgtgaagtg ggtaaaatac cttttgctgc 33900aattgtcatc
cccttatctt gccaaagtac gacaacatct gaatgaagat ggtttgatgt 33960tcttagttta
gagtagttat actattattt taacattcct atttaatgat atgttttaat 34020actgcagggg
aaaggtagat acagattcat agctacaaag caatactctt atgatcaatg 34080ttcatttaat
ttcaaataca aagttagaca ttcagagtta aaaaaaatta atgcataggc 34140tgggtgcagt
ggctcacgcc tgtaatccca gcactttggg aggccgaggc aggcggatca 34200tgaggtcagg
agtgtgagac cagcctgacc aacatgggga aactccatct ctactaaaaa 34260tacaaaaatt
agctgggtgt ggtggcatgc gcctgtaatc ctagctactc aggaggctga 34320gacaggagaa
tcgttgaacc caggaggcag aggttgcagc gagctgagat tgcaccactg 34380cactccagcc
tgggccacag agtgagattc catctcaaaa aaaaatagat atatatatat 34440tttttatatt
atattatata ttatatattt atatttttat atatataaat gaataaacaa 34500ggcatctgct
tatgaaacag tctgcctgaa tgaaatctcc atttgcaggc aacacagtct 34560acagtaatgg
caggactgta ttaggcatgg ttcctaatgc ccatttctgt agcaaagtca 34620ccagtagttc
ctaattgcca aaagtgagtc accggtagtt cctaattgcc aaaagtgtat 34680tttttgcttt
tatcttgctt gaccacagct gtcatataaa cctgttggcc acaccctttt 34740cccttagcct
gaatgaaaat gccccttcct gcccttcgta gctcctgagc caccaggaca 34800tggtggtctc
catcatctgt tcgttatctc taaccattta agattagtgc tccccaagga 34860tgtctgttaa
cctgctttcc ttcccattct gccagtttcc cctggggaat ccacacttct 34920ggttttaacc
atcacctaaa tgctgtgtgt cccgacccgc atccgcatgc tcccaggcca 34980ttctcagttc
cagacccggg cccccagctt cctgcagacg cctcctccag gagatctcag 35040tatcttgctt
ttcctcctac cccttctaag gcattctcaa tccagcttcc agaggaatct 35100tcctaaaaca
catctgactg tgtgagccac atactgtagc ccaacctgac agcttctttt 35160ctatattaag
aaaaagtgcg gccgggcgcg gtggctcacg cctgtaatcc cagctctttg 35220ggaggctgag
gcgggcagat cacgaggtca ggagatcgag accatcctgg ataacatggt 35280gaaaccctgt
ctctactgaa aatacaaaaa attagccggg catggtggca ggtgcctgta 35340gtcccagcta
ctcgggaggc tgaggcagga gaatggcatg aacccaggag gcggagcttg 35400cagtgagccg
ggatcgcgcc actgcactcc agcctggacg acagagtgag actcgtctca 35460aaaaaaaaaa
agaaaagaaa agaaaaagtg cacacatttt gacatctgag aaaagtgatg 35520tttttcttag
aaaagtgaca tctaactctt tatagaaact ttgccacaag gtagtctagg 35580agtcccctta
ctacctaaga ttttcatgga catgccctgt ctgatcataa agcatgtggc 35640acagtggctc
acgcctgtaa tcccaacact ttggaaggcc caggcgggca gatcacttga 35700gcccaggaat
tcaagaccag cctgggcaac atggtgaaac cctgtctcta caaaaaatat 35760ttaaaaaaca
aaatgaactg agcatggtga agcacatcta tagtcccagc tacttagagg 35820ctgaggtagg
aggatcgctt aagcccagaa gtcagggctg cagtgagctg agactgcagt 35880gccactgcat
tccagcctag aggacagagt gagaccccgt ctccaaaaat aaataaatta 35940attaaataaa
gccttgtgaa gaatctgccg tggagcataa tgtaacccat aaagccactt 36000aactgggcgc
acataaacta cataaactac agtgtgtccc agttcattga gagtgaaaat 36060caagcaaggg
accacctcag tgtggtagag ccccctccct gccctccaga ccctgctctg 36120ttctctgtgt
ctgaccattg actggccatc actcaggcac tagttgagaa tgccaatgtc 36180tttagttatt
gtaatttcag aagcacatcc ttgctggaca aagaacctac ctgttaattt 36240tttaaagtaa
tgttcatcat tttgaacatt gttttctttc ttttttattc agctgaccct 36300catgaggaat
ttaacttggt tttctgaact gtgttggtaa catttgaagg acatagaaac 36360agttttcata
ggtctcaagt gatataaacc acaattcttt agaaaatatc tttttttttt 36420tacccattct
ggattttgaa atatatacag atctcatacc tcagattctc atatgatatg 36480aactgtttca
aaacacattg atagcagaag atacatactg tcttctagtc aagagggtag 36540cgctgtattt
agtttaaaac attttaatgc attttatttg ctaaatacaa aattataata 36600ttctgtataa
ctttccaaat ctcatctgtc ctccctttcc cccatctctt tatattcaga 36660tgcttgtccc
tccagcttaa aaattcatga ctttgagaaa attcgtcttc atgttatcta 36720aactactgca
cccccacacg ctccctgtta agtgtctgca tatctgtgtc ttactgttgt 36780ttaaaattca
cttgtctgcc tggggagtgg ggtggtttgc tttaagctgc tccactcaag 36840tctcagcaga
gaaagcaata atttgttatg agatagagtt ttaagccaag ctttggccag 36900ggtctctccc
atttgggagg aggaaaatag gaacaagtga gatgataatt ttccttgaat 36960tttagctcat
taaaatcttc ccttgtcctg ggcaggctat ttactgattt aagcatctag 37020cacttttcct
ttccagaaag aattaacatc tatgatgcaa tcattgctaa gggaggaggt 37080ttttgtgttt
tctctgtgga attagctgtt ctccagttgc agatgagcac agcccacttt 37140agagttgtgt
gtaggtatgg agttgagggg tcggaatgag atttccaaga tttataggta 37200atgaaaagaa
caggttgtat caaagacacc acgtgtagga aactaactgt tcttagggga 37260cttgtgtgaa
cctgtgatcc ttggctttct ccaaaatgaa catttcagcc ttgttcatat 37320taacctcctt
tttacagttg ttttaccgtg ttctggtttt gctgctattc tgcctgaaaa 37380tctagcaaaa
tggcgtttct tatctgcaga actgcgacaa agcttaaaca tttgtatttc 37440tcttgaggat
tactgtggtt gaatttttga cagtactata tataaatgaa gaaataaaat 37500tttttataca
gatgtttaaa atcctctttc cactagaaca tttctttctc cctaaatata 37560ctattacctg
gaactctagc tgtttttcag atctgtagag atcataggtg gcttaaggga 37620ctagattgct
ggtgatattg gctgcaaaat tttttttttt tttttttttt tttttttttt 37680tgagacagag
tcttgctcca tcgcccaggc tggagtgcag tggtgcaatc ttggcttcac 37740tgcaacctct
gcctcccagg ttcaagtgat tctcatttgg ctgcaaagtt cttagtgagg 37800tttagtggta
ttttggccct gtggcacaga agaatggtta tttagcaaat gtgttttgct 37860tttgattttg
ttttgttttt gagacagggt cttgccctgt tgcccaggct ggagtgaagt 37920ggctcgatca
cagctcactg cagccatgac ctcctgggct caagcgatcc ttccacctca 37980acctcccaag
tagctaggac cacaggcatg cctcaccaca cctggctaat ttttatttta 38040ttttattttt
gtagagacag ggccttacta tgttgcccaa gctgttctca aacttttggg 38100ctcaagcagt
cttcccaccc tggcctccca aagtgctggg attacccagc atggcagcat 38160gagccaacag
gtctggcctc ggcaaaggtt ttagtagggc tttctctttt cttttgaaga 38220tgattaatac
agtagattca tacagccaga agctgaaaag cagggaatac tgtttggctt 38280attaatggat
aatcaatcag tgttaataca cttttcccta tttacatggt caaattgatc 38340aaatatttcc
aaggtattag cttttaaaag ttgactttgt cagctctatg aaaagacatg 38400catttacatt
tttatcgtgc tcctcctctg tcaggggact tggatctgtc cttgatcttg 38460gcttttctgt
tcataaacca ggtaggactg tcatgggttt gctcccatta ccatctcttg 38520tcccaagatt
tctaaaaggc atcacctgat gcaccctttt gcccaaatag tgttaaaagt 38580cctcaggctg
aggagctgtc ctggttcttc tacaggatct ttaatgatca gcctgtctaa 38640gctgagaatg
gataagggtg tccccaacac gtttatttgg tggtttggtg ggggtagtga 38700cactggggat
gcttttgtgt gttgtaatct gcccctctgt ctgtttgacc ccagagactc 38760tgtggttagt
tgcaggacat caaaatacgc aggaaagaca gtgggggaag agaccatcag 38820tcgccctctt
gctgcctagg ttcactagaa tctctaagac tgaattcact aggagacaca 38880ggacataccg
tagagctgtt tatgtcttga gagccccagt ttcctagtct gttaaatgag 38940catagtaaca
ccaatctcac tcctttggaa tattcaatgg tgtaatactt acaacatcta 39000gctgacctgt
acaattatag ctattctctt ctgcaagtct aatattccta ttcacccctc 39060cctctgcata
gggatccagt ttgaaatcat cagttgctca gtagggcctg gggagctagc 39120tggtcattct
ttggttaaag tcaaagccaa cctctttgcc tacctttttg tcttaatgtc 39180aacttcctac
cagacatgtt cctcccgcac tggatgatca aaatgcaaat aggatggaca 39240cgatcaatgc
attttgttgg agtggttgca ttctttctac tttcttaggg caaaaaaatg 39300aatggttgga
caagagacta accaaggcct caaagaaagt gaaattcttt ctcaagctcc 39360tgaaagtcct
ttcatgggta tagtaacagt tctatagggg tctcattgtg atggtgccat 39420tatacaggcc
agcaagcccc ctgtgtgagc cagacctcat ggccttagcc cttgttttcc 39480tctttttttt
tttttttttt tttgagacag ggtctcgctc tgttgttgga gtgcagtggt 39540gtgatctcgg
ctcactgcaa cctctgcctc ctgggttcaa gtgattctcg tgctgtagcc 39600tcctgagtag
ctgagattac aggcccccac cagcacgcct ggctaatttt tgtattttta 39660gtagagacag
ggttttacca tgttggccag tctggtctcg aactcctgac ctcgtgatcc 39720acctgcctca
gcctccctaa gtgctgggat tacaggtgtg agccactatg cccggctgcc 39780ccatcctttt
tcttgatatt atagttagtg gtgaaaacat gtaaatgata atactccatg 39840cagtatgtat
gctgaagtga ctaccctgaa gacatttttt aagtcctatt ttcttttctt 39900tctaggatat
gataatcatt ttctgtttgt attaatacat gcttgaaagg aattagtaaa 39960gcgattccat
ccttgtcagc agcttggaac aaagctgtgg atctgggatt gtgtagaact 40020catttgacat
cagtagttac ttgtgtcttt atgcctcaga tcaagatgtg cgtgtttttt 40080tctttctcct
tttagcttca cgaggctgac attgtggtcc taggctcacc taagccagaa 40140gagattcccc
ttacttggat acaaccagga actactgttc tcaactgctc ccatgacttc 40200ctgtcaggta
aatgtcttca cattggtgtt gagccactga ttaggtcaga tagttcaaag 40260aaatgttaac
cttttcctaa gaggttatgt ttctgaaatg ttagcagtcg tcaacattaa 40320tatatagggt
aggaaactta gactgtgggc tacctgatag aaccatgctg ttgagcaaag 40380agagaccaag
atggcggttg aggtgtctgc tcccacagtg aagtgccccc tactggcctc 40440tagcagtcag
gtcagttgtg acctcaagta gatgatttgt atgtgtgtga ggtaaactag 40500aaatagcggt
tttcacagtt tcttaaaaat tagtaaaacc ctttcttcca aagaaatctt 40560cgtggaatct
cactatgtaa aagcagatga gcgccggatt gttgggaggg atgggaatag 40620gatcagcctg
tgtgtctccc tgatttcctc ttgcattgac ccatgaagca catccttgga 40680gccccagagt
catctcagaa cagtatgaaa atcaccacta gatgcagaaa ctgttgtagg 40740attggtcctg
ctatttgatg gcattttgcc aaggtatcca ggaatactca agaccacacc 40800aggcccactt
tgggggcagc ccaagcagtg cccgcaacag ggcagccagc agccccgggt 40860ccagcaggag
ccggcccaga gtgggttcct ttttccagtt cgtagaagga ctgaggctaa 40920gaaagccttg
gagcacaccc aggccctcag caagtccctg agtagcacag gtgagccaga 40980gacgatggca
cagggaggca gaggagagcc cctcttcctg attccttctg gctttggcct 41040ccaaatgttt
gtgtggaaga gaggactgtg ctgagctggg ctggaggaca ggggcagctt 41100ttatccctaa
agctgggcag aacatgcacc aaggttggca gcctctgctc tccttctcct 41160ccaccaggac
catacttgag gttgaaatct tgaaaccttg taggcagctc cataaaagca 41220tgagccctag
ccagaggttg ttactgaccc tcctctctag gaaatgtagc tcctgccaga 41280attgaccgat
cagattaaaa accccttggg ggctggagag ggtggaatgg aacagagtcc 41340aaggaaagag
gcattaagag ctgtggccct aatggctctt tttaacaaac agaaagcatg 41400tactaaataa
cattttaaaa atatgtgtaa tttattcata taactttttt tcccctgaaa 41460aaaaaaaaaa
acaaagccta acctattaat agtggacagg gcttcctctg ttcagcatcc 41520cagtcagatt
aggaaaatca gttccaacat ttgtttcttg taaatgaatg ctgctttgct 41580gtaatattta
cttgctgtgt tttattaaag ttttcatgta ggacagtttt ggcagtcatg 41640gctggagata
ggagtctact ttgatgttct ggaatctctt tgagctctgt ggttatatga 41700cagtgacata
tattttgtta gtgaaagtgc tttattttta ataaggtttc cttttaggac 41760ccttattttt
gaaaaaaagt attccctgaa atattagata tcaatgtaat gattctctgg 41820gggaaaaaaa
gaccaaggcg ttacagatgt tcctcgactg accatggggt tatgcacaga 41880taaacccatc
agaagttgaa aatatcatta gtaaaaaatg ggcatttgta gccaggatgg 41940gatgtgaaaa
cagaaaacaa cgtccaaaaa tgctggcagc acagtacgct gtagagtatt 42000ggttgtttgg
cctcgtgatt gcatggctga ccggaaacca cgatttgctg tcactgctgc 42060ccagcgcctc
aagacagcct tgttctttgt atagctggcc tgggaaaaga tcaaaattta 42120aagcacagtt
tctactgaat gcatgtccct ttcccaccat cataaagtgg aaaaatagta 42180agtgaaaagg
ttgtaaatca gggatggtct gtgtatttca aaatgattct ataagtagaa 42240tttatttgaa
atttatctgg cccaactgta atagttttct tgttggtttt tgaatgccca 42300gtgtccagaa
tagtacctga cgcttagtaa ccgctctgtg aaattttttt tttttttttg 42360gagacagagt
cttgctctgt caccctggct ttagtgcagt gggtgtgatc ttggctcact 42420gcaacctcca
ccttctgggt tcaggtgatt ctcctacctc agcctcccaa gtagctagga 42480ttacaggtcc
ctgtcaccac gcctggccaa tttttttttt tttttgtatt tttagtagaa 42540gtgtggatgc
accatgtttg ccaggttggt ttcgaactcc tgacctcaag tgatccaccc 42600acctcggcct
cccagagtgc taggattacg ggcatgagcc actgggcctg gctaactctg 42660tgaatattga
gtgtttgaca agttaataag catgaatcaa aattctactc tgcttgtagt 42720tttaagttag
tggcatgttt agaaacattt ccttgaattt gtctcccact taaagatggg 42780gctgatggcc
aggcacagtg gctcacgcct gtaatcctaa cactttggga ggccgaggca 42840ggtggatcac
ctgaggtcgg gagtttgaga cctagcctgg ccaacatggt gaaactcttg 42900tctctattaa
aaatacaaaa aaattagctg ggcttggtgg caggtgcctg taataccagc 42960tactcgggaa
gctgaggcag gagaatggct tgaacccagg aggtggaggt tgcagtgagc 43020cgagattgca
ccattgcact ccagcctgga tgacaagagc tttttttttt ttctcaaaaa 43080aaaagaaaag
aaaaaaaaat gggctcaact aggcagtcat tgagataaag atgaggaatg 43140ctatgttgat
gaaacagatg aacagtttcg gggtggtgct acccataata gcacatttct 43200aaatagtcta
caagttttaa ttttagcaac tttgtttagt atgaaaattt gatctaggaa 43260taatggagaa
aggttttttt tcccctcttt ttttcataat cccactactg taaagaaatg 43320aataaacttt
ttgaaatatt aaaaagaaaa gagctcttat ttaaacttaa gatgagaaga 43380acttttaata
aacttgtggc tgggtgaaga cagagtttta taatgaaatt tttattctta 43440ccttcatggt
acagagtccc tcctccttac ccccgtaaaa atagatggga tgttgtgatt 43500tatgatgagg
caattctgta gagacatttt atggaatacg taatcctagg agagattttc 43560ctgatggcta
atatcccaga cacagagctc agaaaacaat tatacccagc tagttggttc 43620tgttcactct
actattgcat gagtccaatg tttagatttt aaaacaagga aatttcaaca 43680ttctgttttc
tagctgtaag agtaactact ttaaaaaaaa aaaaaaagac aaccaaaaaa 43740acctatttct
tagaaaaagt cttgctctgt tgcccaggct agagtacagt ggcacgatca 43800cagcctcaaa
ctcctgggct ccagcaatcc tccctcctca gctaattaaa acaatttttt 43860ttgttgttgt
tgaggcaggg ttctcactgt gttgcccagg ctggtcttga actcccggcc 43920tcaaatagtc
ctcccgcctc cacctccaaa aagtgctgag attactggtg taaaccacca 43980tgtctggcct
aaaaaaatta tttcaaatga aaaaactacg cactgctctt gtagtgccct 44040gcttgctaaa
cagcagattt catcataagc aaaatcattt tgtaccctaa atattaggac 44100ttatttactg
atgtaacaaa tgggaccaag gagaagctgg tcttccgtct taggctcaga 44160aaacagtatc
aaaacagcaa ctctgtggct cagagtaatt tatagcatca ttttaagcta 44220atgtaattct
atttagtgtg gaagagggga agagccctgg gatgctgatg tgctgggggt 44280cccgagggac
cttgagactt tctctcgtat ggctctgaag ggagacatta gaacttactg 44340gaggctgggc
gtggtgtggc tcatgcctgt aatcccagca ctttgggagg ccgaggccag 44400tggatcactt
gaagtcagga gttcgagacc agcctgagca acatagtgaa acctcatctc 44460tattaaaaat
acaaaagtta gccgggcatg gtggcaggca cctgtaatcc cagctacttg 44520ggaggctggg
gcaggagaat cgcttgaatc caagaggcag aggttgcagt gagccgagat 44580cacgccattg
cactccagcc tgggcagcaa gagcaaagct ccatctcaaa aaattaaaaa 44640aaaaatgtag
tggaaagaac atggactttg gagtcaggat taacctaact caaggacaag 44700ggagctgttt
ttgtaaaaaa aattcttggc cggatgcggt ggctcacgcc tgtaatccta 44760gcactttggg
aggctgaggc aggcggatca cctgcggtgg ggagttcgag accagcctga 44820ccaacgtgga
gaaaccccat ctctactaaa aatacaaaat tagccgggca tggtggcgca 44880tgcctgtaat
cccagctact caggaggctg aggcaggaga atcgcttgaa cccaggaggt 44940ggaggttacg
gtgagctgag atcgcgctcc agcctgggca ccaagagcga aactccatct 45000caaaaaaaaa
aaaaaaagaa attcttaatt gcattgcact tacaggctgt gtgactttga 45060ggaaataagc
tctctgatta attaataaga ttcccaattt ctttgtgggt aaattgggag 45120caaggctgtt
gggaggaatg actgagatga tacctcgggg cagtgcctct ttgatggtag 45180ctgccaataa
atgctggtcc acttcctgca ttcctccctc cacttccttg ttgccatgtc 45240acctgcacac
aaaacagctt tcagactcca tcttggcaca atatacattt atatttgaaa 45300aatcaaaccc
aagaaaggaa gaagtccaat gaaattctga acatgtaaga aaagtcagct 45360ttagagaaat
tggacatccg tgtaattttc attatgccaa agcaattgat ataaaattag 45420gatctttgct
tgtgactctt gaactttagt aattccaaaa tccattgtac ctatcactgt 45480tttcttttat
ttattttttc ctgtaaattc tggatattct agatctatcc ctgaagtata 45540ttgccagttt
tccatttcaa attttaatta gttctcaata caatgtattt aatattttgc 45600tttttttaaa
aaaagtaaag gttcatgtca tttagaatat gcagcactgt ataaatttcc 45660ttgttaattt
ggtttcagtg gtccaaatta agttattact aatagcttat tagtatcagt 45720ttactaatat
tttaattttt ttaatcatgt cttacattgc attcttttaa aaactagtga 45780tatgctgtca
tttgacagta agaattcaaa taatttaaac taggattgca actctcaaga 45840acactgtcac
aaaaatttta ccaaatgata tgtattagtc cattctcaca ctgctataaa 45900gaaatacctg
agactgggta atttataaag aaaagaagtt taattgtctc acacttccac 45960ggactgtaca
ggtagtatga tgctggcatt tgcttggaag gccttctgga aggcctcagg 46020aaacttacaa
ttatagtaga aggggaaggg gaatcacata tatcttacat ggccagagca 46080ggagcaagag
agagggaggg cgtattagtc cattttcgca ctgctgataa agacatacac 46140gagacgggga
agaaaaagag gtttaattgg acttacagtt ccacatggct ggggaggcct 46200caaaatcatg
gcgggaggtg aaaagcactt cttacatggc agcagcaaga gaaaatcagg 46260aagatgcaaa
agcggaaacc cctgataaaa ccatctgatc tcatgagacg tatttactac 46320catgagaaga
gtatggggaa aaccaccccc atgattcaaa ttatctccca ctgggtccct 46380cctacaacac
gtgggaatta tgggagtaca attcaagatg agatttgggt ggggacacag 46440ccaaaccata
tcagaggagg aggtgcatac tacatacttt taaaaaccag atctcaggat 46500aactcactca
ctatcatgag aacagcacca atgggatggc gctaagcctt tcatgaaggg 46560ccacccccat
gatcctgtca cctcccacca ggccccaccc ccaacactgg ggattacaat 46620tgaacatgag
atttggtgga gacacagaac caaaccatat catgatactc attttattta 46680ttgcatttta
ctttatatcc tacaaagtca atatatttaa ctgaaaaact ggctaccagt 46740gtttttgttt
ttttttttca agttctacct gtactgattt ctaattttta gccattcctg 46800attttaaaac
ctcgcataca ttttggggat aatttgttct cctagatggc gagaaaatat 46860ttaagtctgt
gggcttggag aatgcttttt tgataagaag gatgcccttg gccagactgc 46920atagaagctg
agatgtttcc gcagagccga gatggcatga aggctgaaga attttcctgc 46980atccctccca
ctgcagtgat tgttctggca aagttcccag ggtggtctct gcactgctca 47040tattagtgaa
agggtggttc tgatttctga ggaaggcatg ataacacttt ggccttaaat 47100tttctttttg
cagaaagttt gattatcatt agaagaatgg ccatctttta aagtgattca 47160ttgacttgtc
cctggagaaa tcatgagtta gatgagagtg tttagtggga agtgaaaaga 47220cccgtccacg
gtttcggatg agatcctcgt tggactgtga tcccaggata ttacccaggc 47280ccagatcgta
gacaacctcc caatggagtc ctgcccctac acgggcaggg ggtgaaataa 47340tgattccctt
tcttaatttt ctttagccat gaaatttatt aaaaatgtat gtctgattct 47400gagttgccag
tcttaattgc ttgagattct ctggattcag aacttatttt tatttttttt 47460atttttattt
tttgagacag agtctcgctc tgtcatccag gcaggaatgc agtggcgcaa 47520ctgcaggccc
actgcatcct gagctcaagc gatcctcctg ccccagcccc caggtagctg 47580ggactacaga
catatgccac catgcccaac taattttttt ttttgaggca gagtctcgct 47640ccctccccca
ggctggagtg cagtggcatg agctcggctc actgcaagct ccgcctcccg 47700ggttcacgcc
attctcctgc ttcagcctcc cgagtatctg ggactaaagg cgcccgccac 47760cacgcccggc
tagttttttt ttttgtattt ttagtagaga cggggtttca ccgtgttagc 47820caggatggtc
tcgatctcct gacctggtca tccgcccgac tcggcctccc aaagtgctgg 47880gattacaggc
gtgagccacc gcgcccagtc ccaactgatt tttgtacctt ttctagagac 47940gaggtttcac
caggttgctc aggctgctct cgaactcctg gacttaagca atcctcctgc 48000cttggcctct
gaaagtgctg ggattacagg tgtgagctac tgcgcccagc ctgaatgcag 48060tcttaaaact
gaattagaaa atctgagtgc aggctagagt tatatataat taagagagtg 48120tgggttttaa
gttgtgtctt aaaattgtgt tgtttttggt cgagtctggt ggctcacact 48180tgtaatccca
gcactttggg gggcctaagt gggctgattg cttgagctca cgagttcgag 48240accagcctgg
gcaacatagt gaaactctgc ctctacaaaa aatacaaaaa ttaagcccag 48300gtgctgtggc
tcatgcctgt aatcctagca ctttgggagg ccaaggctgg tggatcatct 48360gaggtcagga
gtttgagacc agcctggccg tggtgaaacc ctgcctctac taaaaaaata 48420caaaaactaa
ccaggcgtgg tggcacatgc ctgtagtccc agctacatgg gaggcagagg 48480cagaagagtc
ccttgaaccc aggaggcgga ggctgcagtg agccaagatc gcaccactgc 48540actccagcct
gggcgacaga gcaacagtcc gtctcaaaaa caaaaaacaa aaattagcca 48600ggcattgtgg
catgcgcctg tagtcccagc tacttgggag gctgaggtgg gaagatggct 48660tgagcctggg
aggcgggggt tacagggagc tgagatggtg ccactgtact ccagcctggg 48720tgatcgggcc
agaccttatc tcaaaaaaaa gtgtttttaa ttacttgtaa ttttaattgc 48780ttgtaattaa
aaaacagtaa ttaaaaggac caaaagttaa ttttgcaggt agtaaacctt 48840agcctactgt
acgtaaaaat cagaaggcca aggcaggagg attgcctgag tccaggagct 48900caagaccagc
ttgggcaaca tagtgagacc ccgtctctat aaaacaactt aacaatgtgc 48960caggtgtggt
ggcacaccct tgcaatccca actactcagg aggctgaggt ggacggattg 49020cttgagcctg
ggaggtcaag gctgcaatga tcgtgatcac atcactgcac tccagcctgg 49080gcaacagagc
aagaccctct gtaaataata acagtaataa tcagaaagca acactgtatg 49140tgaataaagt
caccttacct agctgccggt ggacagcctt tggtgtttca aggacttagt 49200ttttatgcgt
tcctgctttc gatatttgtt gactgtgagc tttgggctta tccagggtcc 49260atgaagaaga
aatggctctt ccttatctca ggcttttgtt attttggggt gttttaaatt 49320cctctcctgt
ggttactcaa tccgacccta cttgcctctc tcttccacaa atttctcaaa 49380tgtcattcat
ggtacccttt taactcttac ggacttactt gtttttaaca tatcttttat 49440tcactttgaa
tggagtttcg actggaaata gagggtagtg ttgtgtgtac cacctaccat 49500cttcagctag
acactttcaa agagtttgtt ttgttaatta tatcttctta catggctcat 49560tatgcttgtt
gaaatttgca agatgcttta aaccaatcct gtccaaacca cagcctgtgg 49620gccgcatgta
gcccaggata gctttgaatg cagcccaaca caaattcata aactttctta 49680aaacacgaaa
ttttttttgc aatttttttt ttccttagct catcagctat catagtgtta 49740gtgtatttta
tgtgtggcta tggcccaaga caattcttct tccagtgtgg cccagggaag 49800ccaaaagatt
acacattgct gaaaacttta gatgtatcag gaatcatata gcttttcaaa 49860ataaaatatc
cacacttttg ctaagttcta tgctatgcaa gtgactgaac tataccagga 49920cttatcttgt
aaacttggaa gtgaaatatt cttcactcgc aagggtattt tgtgattaaa 49980ttctggaaat
atgcatttat aagttgatac cagctcctgg aatgtaactt tttctttttt 50040ctttttttct
tttttagatt ttttgagatg aagtctcact ctgttgccca gggtggagtg 50100cagtggcaca
atcttggctc actgcagcct ctgggtcccg aggtcaagca attcgctggc 50160ctcagcctct
gagtaggtgg gactacaggc atctgccacc acagctggct aacttttttt 50220tgtattttta
gtagagacag ggtttcacct tgttggccag gctggtcttg aactcctgac 50280ctcaagtgat
ccacccacct cggcctccca aagtgctggg attacaggcg tgagccaccg 50340cacccggatg
ctttttctca tttctccatc gcttaaagat agaggattga gagaggcccg 50400tttgtggaca
cgatcttaag tacagatctc tagtggttaa aacgaaacac aacaattctc 50460tctcatcctc
tagactcttg gcttttgcct aaatatgatt cataaagctg aagtgttgct 50520aatctttaaa
taacacccaa gggatatgat tatatgatta cccaacaccc taatacagtt 50580taagaataga
gtgatccaaa gaaggagaaa atagtgtctt caaaaatgtt gctattagaa 50640atatttcttt
tctctaaggg aggatgcagg ctaaagtatt tagcatgtgt gtttactttt 50700ataaattttc
taagtgatga aggaaggaag gtagaatctt tttttttttt tttttttttt 50760tttttcagat
agagtttcac tcttgttgcc caggctggag tgagtggtgt gatctcggct 50820caccgcaatc
tccgcctccc aggttcaagc gattctctgc ttcagcctcc ctagtagctg 50880ggatcacagg
tgtgcaccac cacgcccagc taattttttt tttttttttt tttttttttt 50940tttttggtat
ttttagtaga gatggggttg caccatgttg gccaggctgg tctcaaactc 51000ctgacctcag
gtgatccacc ctactcggcc tcctagagtc ctgggattac aggcgtgagc 51060cactatgccc
tgccagaagt agaattctta tcaacatctt tctaggttga taatcaggta 51120atccctaaag
ttaataccct aaaatgcaat tttttagttt tgatggaaaa ttgattctat 51180cccttttttt
tttttttttt tttttttttt tttttttttt tttttgaggt ggagttttgc 51240tcttgttgcc
caggctggag agcaatggca cgatctcagc tcactgcaac ctcggcctcc 51300cggttcaagc
gattctcctg cctcagcctc ctgagtagct gggattacag gcatgtgcca 51360ccacgcctgg
ctaattttgt atttttagta gagatggggt ttctccatgt tggtcaggct 51420gttctcaaac
ccccgacctc agatgatcca cccggcttgg cctcccaaat tgctgggatt 51480acaggcgtga
gccaccatgc ctggccggtt ctataacttc tatgatagag aatttttaaa 51540ttatcaaaaa
agcctgatct ttcatgtaac actttgacca gtcttaccct gtgagtgttt 51600tcatctgcct
tctcacgctt tgcatccaca agtgaaagtg cttctatttt tctatgtatt 51660gtacagtaaa
gattttagag ctttctaggc caggcatggt ggctcgcacc tataacccca 51720gcactttggg
aggccgaggt gggcagatca ctgaggccag gagtttgaga ccagcctggc 51780caacatggtg
aaaccctgtg tctactaaaa atacaaaaat tagccgggag tgttggcagg 51840tgccgtaatc
ccagctgcta ataaggtgga ggttgcggtg agctgagatc gtgccattgc 51900actccagcct
gggtgacaga gagagactct atctcaaaaa taaaaaaaaa aaattttttt 51960agatccttct
aataagatga agcccactgt gtcaaaaaaa aatctaaagg ttaggatatg 52020aagtcattct
gacagattat gctgtggaag aatatatatc cctgttcctt tcttctgagc 52080cttactttca
tgttaatagt ttaaactaca tactatatgc caagaacttt atatacctta 52140tcatatttca
ttttcacaaa attttaggag gttgatatta ctcacattaa ttaacaggcc 52200caacatcaca
atttctagca agaggaagag gtatgctttt aatgcaagtc tgactctgaa 52260ttctgtggcc
taatctgcag tctatgttgc atccacatgt tttaaatgtt cgctatgaat 52320tctcttgagc
tattatttct atctctaatt atgccataat ttaaaaatca gaatacttgg 52380gctcgtgtga
agagatggtg tgcctagatg tttcttcctg gctcagtgtg agatatgatt 52440ttgaaagcac
atgatgtgct tcattacatt ttttaaatcc atttatccca ggttaatagc 52500ttagatggcc
tttttttttt tttttttttg agatggaatc ttgctatgtc gcccaggttg 52560cagtgcagtg
gcgtgatctt ggctcactgc aacctccacc ttccagcttc aagcgattct 52620cctgcctcag
cctcccaagt agctgggact acgggtacac gccaccatgt ccagctgatt 52680tttatatttt
tagtagagat ggggttttgc catgttggcc aggctggtct tgaactgacc 52740tcaggtgatc
cacctgcctc ggcctcccaa agtgctggga ttacaggtgt gagccactgc 52800acccggccag
atggccattt ctctaatgta aaatgtgaag caggcccaat ggagggcccc 52860aagcctgtgc
cagaggtgac cagtctgcag ccgtgccccg acatcctaac ttccttggtg 52920aaaaatgaga
ccactattca gtgtgctagc tgtttacaag gagcaattgg ttagaaatga 52980cagtgtactg
aaagtctttt ttttccctcc aattttttac agggaaggtt gggtgtggct 53040ctccaagaat
acattttggt ggactcattg aggaagatga tgtgattctc cttgctgcag 53100ctctgcgaat
tcaggtttgt tcaacatagc tgtctgagaa tcttgagttt ggaagtttga 53160ttaatagttg
ctaacatttt gcaagtggtt tctgtttacc aggtacagtg ttaaacactt 53220cacacacagt
gttaggccca tttgataagt gaagaaatgg tgattttgaa aaatgaagtg 53280agttgcctat
gatcacatgg cagtgtgggg cagggaccca tcctggggac tgtgtatgtt 53340atcttgtgct
gtgtaataaa ttatcccaaa atttagcaac tgaagcaaca aggcatttat 53400tatctcacag
tttcttcaag tcagagattt ggtagtagat tagcagatta ggtggtatgt 53460tagtccattc
ttgcattgct ataaagaaat accagctact caggaggcta aagcaggagg 53520atcgcttgaa
cccaggagac agaggttgtg gtgagctgag attgcaccac tgaactccaa 53580cctgggtgac
agagggagac tgaaaaaaaa aaaaaagaaa tacctgagac tggttaactt 53640agaaagaaaa
cagatttaat tggctcacgg ttcttttttt tccccttggg acagagtctc 53700actttgtcac
ccaggctgga gtgcagtggt gcgatcttgg ctcactgcaa cctccacctc 53760ctgggtttaa
gagattctca tgcctcagcc tcctgagtag ccgggactgc aggcacacac 53820caccacgcct
ggctaatttt ttgtattttt agtagagaca gggtttcacc atgtcggcca 53880ggctggtctc
aaactcctgg cctcatgtga tccacccgcc ttggcctccc aaagtgctgg 53940gattacaggc
atgagccacc gcgcctgacc tggctcacag ttctgcaggc cttacaggaa 54000gcataacaca
ggcatctgct tctggagagg cctcaccaag cttccaagaa tggcagaaag 54060tgaagggaaa
gcaggcatct cacaaggcgg gaacaggagc agagagtgag ggatggtgct 54120atatactttt
aaatgaccag atctcatgag aactcattca ctatcacaag gacgctaccg 54180agggagatga
tgctaaacca tcatgagaaa ctagcccctg tgatcccatc accctccacc 54240aggccccacc
tctaacactg gggattacat ttcaagatgg gatttgggca gggacacaca 54300tccaaactat
atcaaattgt ttaggctcag gatctctcat gcatttgaag tgaagatgtc 54360agctggggct
gcagtcatct gaggcattaa ctggggctgg aggatctgtt tccaagatgg 54420catactcaca
tggctgtggg catgtgtatt tcattcctca ctggccatca gcaggatgcc 54480ttggttcctc
atcatgtcca cctctctgta gggctgctca cagtgtggca gctaacttcc 54540cccagtgtga
gcaatccaag ggaggcaacc aggaggagtc tgcagtgtgg atgacctggg 54600ctctaacctc
atgtgccgcc acttcctttt atcctgtttg ttagaagtag atcactaagt 54660ccagttcaca
ttcaagagaa ggagaattgt cctcacttct tcaagacagg agtattaaag 54720tatttggagg
cattttttca aaccaccacc acctgtgtga cctaaaggcg aaggtcatca 54780gtgttccacc
atataagctg agtgttctca tggtgctgga gcctgcagca gtggtggcct 54840ctggcagtga
catttagttc atttctttct tatgtacagg gtaatacaca attgaaactc 54900tttacctcaa
aatgggtcat tagacatttt tgtaaggatt ataagttaca tttccatcaa 54960agccaaattt
tatttttatt tatttttatt tcatttactt ataatttata tgcatacata 55020acactactca
gtagttcaat ttgcttgctc ttttatacca actatggaag gatggtattt 55080ggaagctggg
catccaagct gatcattcac gcagtcctat attttcttca tttttctttt 55140ctaaattgga
tgacgaacct attcagtata agtttatcca tgtacatgat gtcttttgtg 55200aacacttaat
ctatttattt atttatttat ttatttgaga cggggtctca ctgtgtcacc 55260caggctggaa
tgcagtggca cgatctcagc tcactgcagc ctccacctcc caggttcaag 55320tgattctcct
gcctcagcct cctgggtagc tgggattaca ggcacatgcc accacaccca 55380gataattttt
tttttttttt tttttttttt ttttgagaca gagtttagct cttgttgccc 55440aggctagagt
gtaatgtcgc aaactcggct cactgcaacc tccatctccc gggctcaagt 55500gattctcccg
cctcagcctc ctgagtagtt gggattacag gcacatgcca ccaccacgcc 55560cggctaattt
tttgtatttt ttgagatgga gttttgctct tgttgcccag gctggggtgc 55620aatggcacga
tctcagctca ctgcaacctc cacctcccgg gttcaagtga ttctcctgcc 55680tcagcctccc
gagtagctgg gattacaggc gtgtgccacc acacccagct aattttgtat 55740ttttagtaga
gacggggttt ctccatgttg gtcaggctgg tctcgaactc ccgacctcag 55800ttgatcccgc
ctacctcagc ctcccaaaga ggattacagg cgtgagccac tgcacctggc 55860ctaatttttg
tatttttagt agagacaggg tttcaccatg ttgcccaggc tggtcttgaa 55920ttcctgacct
caagtgatct gcccacctcg gcctcccaaa gtgctgggat tacaggcatg 55980agccacctca
cctggcctat ttattcattt atttttaaag agacagaatc tcagctagga 56040tgaagtacag
tggtgtgatc ttgactcact gcagcctcaa acttctgggc tcaaacaatc 56100ctcccacctc
agcctcccga gtagctggga ttataggtgc gcatcaccac acctggattt 56160tttaatagtg
tattttctca cattttcctg tttttctttt taaaatttaa caaaaacaat 56220gaacatagtc
agaagttctg gcctgtgact tgaaataacc ctggatgtac atttttaaaa 56280acattctcag
attcattgct gtaagtatga agagctaata gtctgcaagg gaactataga 56340attaagctgg
atcatcaagt tcatgcatgc atatttaaga aacagagcag cttcttttaa 56400tactatcaaa
ctcttcagtg cgatattgaa tagttgtata tctaactagt ggttttcacg 56460ttttcctttt
taagaaaaac ctacaaacct agttacacgt gaataaaaat attcctaata 56520tcctcctggc
ttattgtgcg actcgataat cttgttttgt aacatccaca aacaatattt 56580ccatggttta
agtgtgtgcc ctgtcagctt ttaccattct aacatgtttt cccctctatc 56640cctgctgaag
aacatggtca gtagtggaag gagatggctt cgtgaacagc agcacaggcg 56700gtggagactt
cactgcttga aacttcagcc tctctcccct gtgccaaggt aacactggtg 56760ttttatttac
actgatgtca gctcagcaca gtgccttagc cctagttagt catgggggag 56820aagcacaact
catggtactg tggttttgat ggtagataag tttctctatt cattaaattg 56880ccaccctatt
ttctctattg taaattctcc aaaattaaat cataatgaat gtattactgt 56940gttctgtcaa
ttctaagaag gcatattttt ttatttcaat gatgttttct atattaaatg 57000gcagtttttt
aatgatgttt taaaattaat ttatattaat ttaatttcaa tgatgtttcc 57060tatatttaat
ggcagtgttt tcccccccca cccttttttt tttttttgag atggagtctt 57120gctctgttgc
ccaggctaga gtgcaatggt gcgatctcgg ctcactgcaa cctctacctc 57180ccaggttcaa
gcgattctcc tgcctcagcc tcccgggtag ctgagattac aggtgcacgc 57240caccacaccc
agctaatttt tgtattttta gtagagatgg gggtttcacc attttggcca 57300gccggtctca
aactcctgac ctcaggtgat ctgcccacct tggcctccca aagtgctggg 57360attacaggcg
tgagcgacca cgcctggcct ggcagtggtt tttcttctta gtggtccatg 57420aagtaatggt
tttaaatttg acaaaatatg gtaatgtaat atttggaata tgcttttcaa 57480tgtcagaaat
gcccaacacg tatatatccg cctggtgtta agggccatat aggtttctac 57540tctggggata
aaggaccgtc caaaccaaac attattccac gtgggaactg agctcagctt 57600tcttcagaac
gatctgatag aatagtctgt gctcgtcaag gtggatattt tcctcagagt 57660gctctctgat
gaggtctgag ctacatattc tatctataaa tttggaaata taaactgtag 57720gtcatagtct
cagcttgtgg tttagaatga ttgggtaggg atgatgtctg aatcagacag 57780ctgcattttc
cagcctgatg atgagtcatc tgtgtctgtc ccactccaag gaattggccc 57840cacccccatt
atccaggtgt gcataatact aggtgtgcat ccccaggtaa tctcatccag 57900caacgttgga
atgcacacag tgccgaggag ataaaatccc tctagtaact gacactttat 57960ttatttattt
atttatttat ttattttttc tttttttttt tttttttttt gagacagagt 58020cttgctctgg
tcttgaactc ttgacctcag gtgttccacc tgcctcggcc taccaaagtg 58080ctgggattac
aagcatgagc caccgcaccg gggctgactc attttttacc tgataaccag 58140ttgctttata
gaattgaact ccatttcctt ggacagtcag atgaacaccc tggttcagtt 58200ctacccacta
ggagggtgtg accctcctgg acatgttttc atccctgcaa tagacgaaac 58260atctcagtta
tgctctctgt tcattaatta tataagttgc aatgaggaat gttattcttt 58320gtaatatcat
ccctgggctt gtgcaatctg tttcataaaa tatcttattt tagagttatt 58380tttgttgtta
taagacctta gattttaaaa gtaaatcaat aatttctact tacaaagtgc 58440taaaaaagat
taagatagat tccaccaaat tgtgtaccaa agtttcaaag taaattcttc 58500ttgttaaata
attaaatcct tttctccccc aagaagaatt tactttgaaa tcattgattt 58560ctagacccac
tgggcaaaga gaagcaggct gagtagttta taaatcaccc caaaaatcgg 58620atgatgtctc
tgtgactagc tagtaaagag aaaaccttac aggtgtttgt tttgttttgt 58680ttttgagaca
gagccttgct ctgttgccca ggctggagtg cagtggcttc atctctgctc 58740attgtaacct
tcacttcctg agttcaagcg aggacatctg gctaattttt gtatttttag 58800tagagacagg
gtttcaccat gtttgccagg ctggtcttga actcctgacc tcaagtgatc 58860tgccctcctg
tgctcccaaa gtgctgggct tacaggcatg agccaccatg ccaggcctgt 58920tttgtttgct
tgtttgtttt ttgagacagt cttattctgt tgcccaggct ggagtgcagt 58980ggtgcaatct
cagctcacct caacctccgc ctcccaggtt caagtgattg tcctgcctcg 59040gcctcccgag
tagctgggac tacaggtgcc accaccacac ctggctaatt tgtttgtatt 59100tttagtagag
acgggatttt gctatgttgg ccaggctggt ctcaaactca tggactcaag 59160tgatcctccc
gccttggcct cccaaagtgc tgggattaca ggcgtgagcc tctgcagcct 59220gccaaaagct
tacacattgt tcaagattaa gctcttcaat ttaggagatt gtgggaagag 59280agagacagaa
aaaaactatg gccaggaagg caaaacaaag taggaggcta gaaaggaaag 59340gtcggatagg
tggaggcctg gaaaggaata ctcagcggta cttgaggggg gtaacctgaa 59400ctcctagttt
ctctggtgcc ctttggagaa gtctgggtgg aggtgtgttt gctgttgcct 59460cttacccccg
ggtcaggccc ctacaggatc ctggaggaga gtctagggtg gtctgaacac 59520tgagggtaga
ttacctgcag tagctacaga aacagtcagg gccaaggggc tgactaaagt 59580ccactgaggg
ggttgatctg cccaccgcaa ggaggcagac cacaacaaaa ataagtgttc 59640tggggtgagt
gaaggaatag ctgtcctctc tgatgcaacc tttaagggta gtcccccgcc 59700gcgtggtctc
ccggggttag tcttgctgct gggggaacag gtgtgccttc catctctcac 59760tctcagacac
atgaagggcc tgactgccca gcaaagaaat gactgagcac atgagcccca 59820gaggaaagag
tccagcatta cacatgctgg tatccagacc actcctgtaa gacgagcagt 59880ggtggccagt
aaaatagaag agttcagtta cgtgcccaat gactctctgg agttcctaca 59940atttcctgtc
tgaacttggg ctcagggcac taggtccttt taccgacagt tatgtaatta 60000ataaaacatt
ttgaaatatg attttagact gtaaatagag agggggaagt gtggacacag 60060ataaaacgaa
atagcatttc cacatttttt cttttttttt gtacttaatc tctaatgtga 60120tagtaatgtt
cacatttttg aatagccaag agaaacatgt ggccagcccg atcggagatt 60180attatgcaat
gaccctctat tcatcgctaa tgaataccat gatggcccat acagagcatt 60240ttgctctgta
gtcgaatcac acttgcagga gggttttgga cacattataa tatgctgttc 60300ttgtctctat
gccataggtg ttatacttcc acctggacac aattctagga tctttgtgag 60360ttttttggag
ttgcatttct aaaatgatac accattttct catacacgtt aattattagc 60420agttgtgctt
attactgcct gggagaaagg accccaaagt aattgcattg atttcatcgt 60480tggcgtgatg
tgtggctgtt ttcactccag ttgtgaccac ctaagctgag aagccttttc 60540tctctttcgt
attgtctttc ccctcagtga cattgagatt tcaagaggac aaactccaaa 60600agctgtggat
gtccttgcca aggagattgg attgcttgca gatgaaattg aaatctatgg 60660caaaagcaaa
gccaaagtac gtttgtccgt gctagaaagg ttaaaggatc aagcagatgg 60720aaaatacgtc
ttagttgctg ggtaagacac catctaacat ctacttgatc aagcagacat 60780atttacaaaa
ctcttcccta tttatctctc tcctcgtacc cctcaatcca tcctattctc 60840acatttgaca
tttcgtccat ttcatttggc atttctttat ttggtgtgag ataagttttt 60900attttcagtg
cagagcaaac tagtcgactc tacacaagat aaagtcaaac aagttgcagt 60960gctttatctt
ttgtttctca tattatgttg ttattacccc tgctcctcct tgactttttg 61020aatttcattt
ttcattataa aaatcaaaat aggaattgaa agtaaactat tttgaactga 61080gcagtttgtt
ttgtgcgttc tacttgaaat attttcgtgg aaaaggtaca gataatcata 61140tgaaacatgt
tttcctgtta atatccttga tcttacctac atattcacat ttcaaaagct 61200cattatttcc
ctttttctgc atttcaaaac taatgttggg ccattttcat ttgtgttggt 61260tattttataa
aagacaaatt catagttttt gtgaaaagga gtgtttgcct ttttcccctt 61320ctttttaaaa
aaagaagtac attcactagc tgaaaagatg atagaaagca agggatttca 61380aatatttatt
cagtgtgacc tgagggaaca tcatgagacc atataaattc ctttgctgcc 61440ttgaaatttc
atttcagtga ctattgctta tgatgtatac aaagcttcag gcacaaaacc 61500tatttgtcag
gcagagacat ttcttgtaat tggaattgac tcctaggtaa ctttgttttt 61560acgtatttaa
ttattaatgt aagtgctttt ttctttattt ctttttctgg ttataaaaag 61620aatgtagttt
cattgtagaa atattctaaa aattcagaaa acagaagtga aaaagtaaaa 61680catcccaaac
ccaagcactc agaggaaaca actgttgacc tgcgaggact gttttgacgc 61740tgatagagtc
tctcagccat gaggtcacac tcacatgcta ttgtgtaagc aacctgatta 61800ttcaagcata
tgtggcatac tgagctccat gtgagtgaat attgagctat gtcaccattt 61860ttcatgacaa
cataacattc catcactcac atcccagatt cttttttttt tttttttttt 61920gagacagagt
cttgctctgt tgccaagact ggggtgcagt ggtgtgatct cagctcactg 61980caaactctgc
ctcccgggtt caagcgattc tcctgcctca gcctcccgag tagctgggat 62040tacaggcatg
cgccaccatg cccagctaat ttttgtattt ttagtcgaga tggggtttca 62100ccatgttggc
caggcttttc tcgaactcct gacctcaagt gatccaccca ccttggcatc 62160ccaaagtgct
gggattacag gtgtgagcca ccgcacctgg ccacatccca gattcttaaa 62220catgttgggg
aagcaggggc tgggggagta gcaaaatgcc caggatgagg gtggagatcc 62280catcttacac
atcttctggc acctctagtt ccatcaataa gtgctaccct ggtagttctg 62340agtgtttcta
attgtccaag tgcactgatg cagaaaaata atttgaacat agaataggga 62400tgttccacaa
cttacttgac caattttcta ttgatgggca atcaaagtgt tcccaatttt 62460tgccatttta
aactgctaga attaacaacc ttgggcatgt ggatttgtgc atctgtttaa 62520ttatctcctt
aggaaaaatc cccggaagtg agttcaatta tttataacca caaataaagc 62580aaaaatatta
tccaaaacag gccaagcgcg gtggctcaca cctgtaatcc cagctctttg 62640ggaggccgag
gcaggtggat cacaaggtca ggagatcgag accatcctgg ctaacatggt 62700gaaaccccgt
ctctactaaa aataccaaaa aattagccag gcgtggtggc aggcacctgt 62760agtcccagct
actcaggagg ctgaggcagg agaatggcgt gaacctggga ggcggagctt 62820gcggtgagcc
aagatcacgc cactgcactc cagcctaggt gacagagcaa gactgtctca 62880aaaaaaaaaa
aaaaaaaaaa aaatcacctc tttaatgata tagtgatttc ttcaaatcct 62940ttttccctcc
tcctgtccca ctttcctgtc ctcctctttc tccttctctt tgtgtagctt 63000ttcttttttt
tttttgagat gaagtcaccc aggctggagt gcactggtgc gattttggct 63060cgctgcaacc
tctgccccct gggttcaagc aattcttgtg cctcagcttc ccaagtagct 63120gggattccag
gcatgcaccc ccacgcctgg ctaactgttt ttgtattttt atagagacgg 63180ggtttcgcca
cattagccag gctggccttg aactcctggc ctcaagtggt ccgcccgcct 63240cctccttcca
aagtgctagg attacaggtg tgagccatca tgcccagctt ctgtagcttt 63300tttgtaagag
ggacagacag ccagggcctc tctgaatccc tcagcagttg gcattgtaag 63360gaattcatcc
tttgctccac accttggttg cctgtggctt cacccttctc ccgcaggctt 63420ccaggctcac
tgcagttaga ttcctgagcc cctttccaac ctagaacccc aaacttaatg 63480tccccataat
cgagtgtggg aaagttcctg gctgagggga ggggttggct ccacctgcag 63540gcaccctctg
ggcccacacc ctgcaaggag ctgcctctgg ttagattggc tccggggtct 63600ctgagtttag
ggcacagttt agtcagggtt gttttatggc agtatgcttg atgcatattt 63660gtctgcaatt
aataaacaaa ctcttggcca ttaagacctt gaaaaagcat ttgatttgtg 63720aaattggttc
ctttcttcca ggcttcagat gatcttggta ggtgggtttg caggctttct 63780caccctgttc
agataggaac acttaaggaa aacttggtag gatgtgttgt ctgtttatga 63840aggaagaatc
ctgcagcaga ttttcttatc cagtccttgt gctttactcc attttgggct 63900tgtcaataaa
atcagaaatc cagatatgga tggctctgac cttagcttgg aacacaaaac 63960tgatacttct
aaaacaaagc tgtgatgata ctgttttcag gttccttact tgtaaacaaa 64020gctgggcctc
cagcctcatg ctggagaatg ctagctagca cgactgcctg gggttacagc 64080caatgcagtc
catggccagt agaacatttc tccacgctgg ctttttgcta gcattaatat 64140gtataatcaa
gctgaaattg gcctgccatg ttgtaaatag aaaagaagac taaagatgaa 64200ttttcaagaa
cagattaaac accaagggat caacaattct ttttctgaaa ttctaagttt 64260ctttacctaa
caaagacctt tgaaggaaca aaaattaaat ggcctgtctc ttttttaact 64320gtagtggaaa
aaaataaccg tgctggtaag attatgatat atgtatgcat ttgtggcaca 64380gcttgaagat
ttttaggaaa tagccacact ggattctata agggaagaat gaacagcccc 64440ttctccccaa
cacaacaaat gatatatttt ttagggctgt aaactgtcat cagtatctaa 64500tcccttaata
gctgaatatt gatatcagtt tagatcacag gcaactattt tatcgatagc 64560ctaaagtggg
cctggcatga cgggttgtgc ctataaccca attactctgg agactgtggg 64620tggatcactt
gaggccagga gttcaagacc agcctgagct gcatagcaag accccatctt 64680taaaaaatac
aaacaaacag caaaaagcag cactggttta aagcaacatt tgtaatctaa 64740agtgaaaagt
caatttgaaa taaaaatcat cttcatattt ttacatttga ccaggtcaag 64800cgttaggggg
aaatgactgt tatcaaacca ttatatgcac tgttgcaact tctttaaaaa 64860tataaatgac
ccctccatta aatggagaat tatttccttt ttctaaaagt cttttctaat 64920tctgaaaata
atattagctt attgtaaaaa atttaacata aagtaaaaag gagataacta 64980ggtctctcat
aactcaatat tttatttata tattcatctt tttacacatt gagatcatgc 65040tataaatatc
atttttgtcc tgcctttttc tctgaacact gtaacataaa tctttttcca 65100tgctttagaa
gaattcaaat acccaagtaa tttttaaata atactaaata acaatatatt 65160tgttattaca
tattttaatt tatatagaaa tgtgatttaa atatatcagc aatgggatta 65220aaatataaaa
caatgagatt tttttaaatc ttctattatt aataactcaa atgcagaaca 65280gattaattct
tttaaaaagc cattcttcaa actttaggat ttcttattca gtcagaatca 65340tctgcatttc
atcaactaca tatcgcatgg tatttacacc tacacatcac tttatttgct 65400ttgcctgaca
aaatgaaaag tattgggaat tgttcaggga aatatgccaa aagaatctta 65460gggactcatg
agttttacat attttcccag ttacactaat gaaaaggaat tttgtttgag 65520aacttgactt
ggagtgcagc tgcagggcat ttgaatctca gtctccatca ggcctgcacc 65580catttctgtg
cagaagtttc tagaatttcc ctgaattggt tggacagagc agctaaaaaa 65640gcagcccaaa
ctgatctggg aatagactga atttcacatt aaagaaaaat ttgaatcatt 65700cacatcattt
catctttttt tttttttttt tttttttctg cagagggcca aatacccgat 65760tggatttttt
aaaaatgttt caacccttta tttcggtaca atgttaaaat aatctgactt 65820ttctatgatt
tggcttttct gccttgagta actatttaaa tatctgtgtg attttctttt 65880atttgtgcta
cttctagaac aaaacagagg tatttagaag aaaccacttc ccacagggcc 65940tttgaaccgt
ttacctaagt caagtgtaat gagaaacata accaaatgca ccatggggtt 66000tattgttaga
taataaaagg cttaaaaagc ccctagaccc taaagatgcc tgggatggat 66060gatttatgtt
catatgctac ttgagcatgt agttttggag tgcatggtga ggcccatggc 66120taatggcagt
ggtgctattc aacaagattc tgggtcttta gaatagttca aagttttacc 66180acttctctct
ggataagcca tctttttgac ctttgagtaa attataaact tcttttcaga 66240ttgtgtccaa
atggtcacag gtactttgac agtttttact gtaactgcca ataagtaata 66300ctcatcttta
aaaagacatc attctggcag ggcatggtgg ctcacgcctg taatgccagc 66360cctttgggag
gccgaggcgg gcggatcatt tgaggtcagg agttcgagac cagcctggcc 66420aacatggtga
aaccctgttt ctattaaaaa tacaaaaatt agctgggcat ggtgatgggt 66480gcctgtaatc
ccagctactc aagagggtga ggcaagagaa ttgcttgagc ccgggaggtg 66540gaggttgcaa
tgagcagaga tcacaccatt gcagtccagc ctgggcaaca gagtgagact 66600ccatctcaaa
aaaaaaaaaa aaaaaaaaaa agcatcattc cattgtcaac ccttttttgc 66660tgttctgtgc
cctgaacagg tgatgtgata gcggtgcaaa gagggtgagc attgtccttt 66720acctcaacaa
gtgtgtggcc tggtaggcga gacaggcatg tcagaatgct tatcatgaag 66780ccatctgaaa
aattctagaa taatcatata atcaatgctc tagatatcag agagggggta 66840cagaaatctg
ccgggagaag tggaggaatg taatgaacaa aaatcacagt tccatctcta 66900tgaactctaa
gctcccatag aaaaagaaat tagagggctg ggcgtggtgg ctcacgcctg 66960taatcccagc
actttgggag gctgaggtgg gcggatcaca aggtcaggag tttgagacca 67020gcctggccaa
caaggtgaaa ccccgtctct actaaaaata caaaaattag ccaggtgtga 67080tgacgtgcct
gtagtcccag ctactcagga ggccgaggca ggagaattgc ttgaacctgg 67140gaggcagaga
ttgcagtgag ctgagattgt gccactgcac tccagcctgg gcaacaagag 67200cgaaactcca
tgtgtctcca aaaaaaaaag agaaaggaag aaagggagga agagagggag 67260ggaggaagga
aggaaggaag gaaggaaggg agagagagta attagattac cttaaggaaa 67320gaaggaagga
aggaaggaag gaaaagagag aaagaaagag aaattagatt accttggtgc 67380tcattctttc
gtatatcccc gttgatgatt tggaatctga gctaaagata ctggggggaa 67440aactaaacca
aattttgata ccaaatacag gaaggaggtg aaataagctt ttttattgta 67500gctctccttt
tattgctcac aataaatact tcctactttg tgattttccc gaactgtatt 67560agttggcaga
aagctgagca gtcctgtgtt tctaattaag aattggatgg ttagctggag 67620ttttgaggag
gtcatctctg ggcgtgtctg cttttggctt tcttgggcag ctatgctagc 67680atatccttgg
gaatgaaaat gtaatccttc atgtgacaca gtttaggtaa accacttagt 67740cctttgcaaa
caccaaatgc tcagtgcggt acttaaatga caccatattt gatgccacag 67800tttctttaca
ggctcttcta agattgtaat gcctgtgggc ttttaactgc ccttattgat 67860gttgtatcca
agaaaacaga aagttgactc caagcagaag tattaatatc tttgtataac 67920tttcagtctt
cacgttgacc ccactgcaga tctgctgttt agacagatgc agaggtgact 67980gataagccgg
ggtcatgtgg cttcaggagt cttaggtttc tccatgcagc ggtgtcagga 68040taggctcacc
agctcctggc cacagtgggt accagcgacc tccttgttgc tgaatccacc 68100gaccactctc
ggtcctcatc ctgcttcctg cctcagtggc gtttgacttt cctgttctgg 68160aaacagtctc
atccttggcc tccttctgag acttcctgtc atgttctgcc ttcttctcca 68220gggtcctctt
attttttgtg tttttagaga cagggtctca ctatattgcc caggctggtc 68280tcaaactccc
tagctcaagc agtcctttta ccttggcctc ccaaagtgct aggattacaa 68340gcatgaacca
ctgcacacag ccgggttctc ttattttata ctacgccttc ttagtttcct 68400tttggactct
cttcttctga caactcatta caagcaaatg tgcttaactc ttctgtcctt 68460atcccttttc
tcactcggtg cactgtcctt gggtgatctt acccattcca tgctttaaat 68520ggctatagtt
atgatttcca tccatggaga acctggttaa gtcagatggc tggctttgcc 68580tcccagctct
actgcttact agctgtgtga ctgtcggcaa gttgcttaca ctctgtgcct 68640cagtttcctc
atctatcaag tgagcgtaca aataattaac cccataaata tgaagcatag 68700cacctctcct
attaactgcc ccaatccact ctcagtttta tctagttaac atttattaat 68760ttgccgggtc
tcacctaaga tgttacttcc tacaggaaac attcccagat gttcaaatct 68820agttagatcc
tttttccatg taccctcata attcagtctg actgccctat ttccctgtct 68880tagtgtttgt
cacatttatc gtaattgctg atatattcag atcagctgga agctcgtgag 68940gatagagatg
aagtctgact ggatcatcat tttgtcccca gcgtctagct cagtgcctgg 69000tacatagcag
tacttgacaa atatttgctg aatgaatgac tcatggaatt gtagggtatg 69060atgacgcctc
taaagacatt ctttctttgt ggttacatcc ttaggttttg tagatcagaa 69120aactctatta
gattggttgt ggttgagtat ccttgacaac atggctgaca gtaactcatg 69180gaagaccaaa
ctgtaaaccc taatgacagc ctttaaattt tttaacgatg cttgagagct 69240aagctggaag
gacgtggggt gcttatccct cgtagaaatg cctgccaatg ctttctcatt 69300tggacccaaa
ctccagatcg ggagcagtct tatagctgga tcagctacca agagaaattc 69360taaagcaaga
agagaaaagc atttcaattt gggacattta tttgcacctg gaaatgggga 69420atgggctatc
agaccagact tctatcctgt ccaacctgcc ttcattcagt ccttccacat 69480tgttattctg
ggtttggact gtgctggaaa gacaactgtc ttatacaggc tgcagttcaa 69540tgaatttgta
aataccatac ctaccaaagg atttaacact gagaaaatta aggtaagctt 69600gggaaattct
aaaacagtca cttttcactt ctgggatgta ggtggtcagg agaaattaag 69660gccactgtgg
aagtcatata ccagatgcac agatggcatt gtatttgttg tggactctgt 69720tgatgtcgaa
aggatggaag aagccaaaac tgaacttcac aaaataacta ggatatcaga 69780aaatcaagga
gtccctgtac ttatagttgc taacaaacaa gacttgagga actcattgtc 69840tctttcagaa
attgagaaat tgttagcaat gggtgaactg agctcatcag ctccttggca 69900tttgtagcct
acctgtgcaa tcataggaga tggcctaaag gaaggacttg agaaactaca 69960tgatatgatc
attaaaagaa aaattttgcg gcaacagaaa aagaaaagat gaatatcaat 70020atctattata
tctgtgtgga gtaggttttc tctggtctga ttttgacaaa tagaagagtg 70080tctacagcgt
ggtttgcctg tctgccctcc tggatgctat taaagctttg ttttgttgaa 70140caatcagatg
cccaactctg ttgccttgtg gaagatgagt aaatgcagtg cttcttaaag 70200tggtttcttc
tccctacccc acacatcttt tggtactacc atttggggaa gccaagcaag 70260gatagtaaat
tgatcagaac acagttgtgg gaatttggcc tgaagttagt gaaataaaac 70320tttaaagagt
ggaaaaaaaa aatttttaac aatgcttgtg atgtactaaa gaacaatttt 70380atgaggcctt
acttggtaat gtaataggct ctcttcaata cctaatactg tgatgttatg 70440aacatgggaa
catttggttt tgcaaagtgt tgtcatcaac aataagattc tcaagagcag 70500aagtttccgc
catagtatgc acagggcctg tggacttttt gaatatttga aggactgtca 70560tatggaaagg
gatcaggctt agtgtgtagg actccaagag ctaggacaca acatgaatgc 70620agatgccttc
tctctagaac aaagaacttt tgaactgtaa cagctgtctg taaatgagtg 70680ggccacccca
gaggtcatga ccttatcacc atctgtgttc aagcatggca gggagaagcc 70740ctgggcaggg
atatctggaa ggagaaggac aagtggggac tttagggtcc cttctaatcc 70800tgagagtctg
gcagtgccct actcacccaa ggaccgttga gctaggccct gtccccaaac 70860cataggccct
gtgcccacac cctgccccaa taaatggctc aggaagaaca actgctatat 70920actcatcagc
aagaaatgcc ccaccatcag ctgaattatt taaaccattt tgatgagtgg 70980ttttatggcc
tttcttgaaa ctgatgttgt tattagcact gcctaatacc ttttcctttt 71040ttggtttctt
ggtaaaggtg tgtctgtatc ttgactttag cttatgcctg tcctgtgtga 71100tgtgagtaga
ccataatagc ccgagaaaat taaattatgg agtgcatttt gaaatgagca 71160ccctcagatt
tgattctggt gtcttcggct ggttttgcat aggttgtctg ggaaatatta 71220taaaattcac
tttctccccc cgaactcagg atcacaccca cccctcttgg agaagggaag 71280agcacagtca
ccatcgggct tgtgcaggct ctgaccgcac acctgaatgt caactccttt 71340gcctgcttga
ggcagccttc ccaaggaccg acgtttggag tgaaaggtac tgtctttaac 71400aactagtgta
cttttcaggg caaagttggg gggagagaat tgtaaaaaga acattgtcaa 71460cagataaaga
gttaagactt ttcaggggac ttggtacatt tcttcactgc acactcagta 71520catagtggtc
gccccatgtt ggttcccccg ggcatgaaaa agagcagtgg gagccgccag 71580ggcttcactg
gatgcctcag gagcctgttc ctggggctgc gttgctcaag gctcccgcct 71640cgagtgcggt
ctcccatttc atcagggcct tctttgccaa gtgatcttga aacacggcca 71700tccatatcgg
tgtttgatca tcccagtccc ggggaaaggt ctttccctct gctttagtct 71760cggctcattg
gagttgctgt gtggggcccg cagagctctc tggagacgga ctggctgtgg 71820acagcgagtc
gtgatgaaat tcctccttgg ccgtggtcgg gctgatatct ggcacttccc 71880cattcattac
tctttcttcc ctgatttatc atgaaatcat tccttagctc actttacagt 71940ggtggttatt
gctgcaatta cagtttgtcc ttggcacttc gtgatttttt aagaatattg 72000catttgtatt
actgttaaac atattaataa cgtatctcat aaagtactga taacgcacag 72060agtgtcatgc
atcgtgtagt ggaatgaaca ctagggttgg agcgagaaag ccaaggtttt 72120ggtcacagct
gtgtttccta ccgccatgtg acccttggcc acttgagcca gtcacaacct 72180ctcggagccg
catctctaaa agtccaaaat gctgagagcg gcaccaggct cacccagttg 72240ttgtgaggat
caaatgagaa agcacgtaaa gaacgagggc tactgtcaaa tgatacacac 72300atgggcggtg
ttgttctgct ttaggaaaga gcagtccttt tactctaaca accaacattt 72360tgagggatac
gcagcatgac aaatctcaca tactattatc tgaaataaat tgaagtttat 72420aatttactca
tttgaggtgg ttatgcggtg gtttttttgt ttttgttttt gtttttgttt 72480tgagacagag
tctcacttgg ttgcccaggc tggaatgcag tggtgcaatc tcggctcact 72540ccaatctccg
cctcccagtt tcaagtgatt ctcctgcctc agcctcccaa gtagctggga 72600ttacaggcac
ccgccaccat gcctagctaa tttttatatt tttaatagag atagggtttc 72660atcatgttgg
ccaggccggt ctcgaactcc tgacctcaag tgatccgcct gcctcagcct 72720cccaaagtgc
tgggtttaca ggcatgagcc acctcgccca gccacagtgt atttttttta 72780aacaaagttt
gctcagatat gaaatcatat acaattgatt tagataaagt atagtcacag 72840agatctaata
ataaattatc aaaatattct catatcctgg tatgttttac agtaatgaca 72900tctgctctat
ttttaccttt tttttttaag caaagaaaca gaggtatagt tagacaaagt 72960gtcaactttt
tttgttgttg ttctgcttgg atataaccat tagtgtctct cactgccttt 73020tgttacttca
tcttgggtgt gacgcctctc tgttgggttt ggggagctgt gtcccttggc 73080ctgtgtccat
cgtggcagct gtacttggtg acccgtttga aatgcctctt gctctgttgt 73140ttaggaggag
ccgcgggtgg tggatatgcc caggtcatcc ccatggagga ggtaagacct 73200tgaagagatg
cgggtcagct atcactgtgt ttccttcctg acatcttgtc tctgctccca 73260tccacacagg
cccacagacc ctcatccata atcccaaagt cattaaggct ctgaaacccc 73320aaaatgtttg
cataactcat gcagtgacaa accctagtat ttactgtgat gagctataat 73380tatatggatt
ttgtacgtgc tacttggcgt gaatattcat atacattgct gtggaaaaat 73440tatttttgat
taacaggtgc tgacctagcc cctgcttggg gtgttttaca aataaactac 73500atgtaaatca
cactgccttc ctaaacttca aaaaattcag aatcttgaaa ccctgctggc 73560cttgcgggtc
tggatgagag aatgtgggtc tgtgcctccc ttgatcatct gctgtactgc 73620caagttcacg
aagacttgac gcttcccctc tgtcaattca ggctctagtg cgtgcctcgg 73680cagcatagat
actaaaactg gaatgacaca gagaggatta gcatggtccc tgcacgagga 73740tgacacacaa
atttgtgaag tgtcccatta aaaataaaag accccaaaat aaaagagttc 73800cagaggaaaa
tgggcaaaaa gaactgacgt tgttaattct cacttctgta agaactccac 73860ctttgttcca
cataacctct ttgttgcctg taactaacca gtgaccagct cttttcagtt 73920tcactctctg
catcagcctt atcattcagg gcctggacaa tcttccaaaa cccaaatctg 73980agcaggtcac
tttcctgttc tgtcatttcc gccgttgcct gtcaccctcc agcacagcac 74040aggaggtcct
cgtgggtggc tgctcctgct tccctgtcca caagtcccca cctgccttct 74100ctctactgcg
tcaagctcct tgcagactct tttttttttt ttttttttga gacagagttt 74160tgctttgttg
ctcaggctgg agtgcagtgg cacagtctcg gctcactgca acctctgcct 74220cccaggttca
agcaattctc atgcctcagc ctcctgagta gctgggatta caggcaactg 74280ccattgtgcc
tgactgattt tttgtatttt agtagagacg gggtttcacc atgttggcca 74340ggcttgtctg
gaactcgtga gctcagacaa tccactggtc tcagcctccc aaagtgttag 74400gattacaggc
gtgagccact gtgcccagcc cttgcagact cttttcctgt tctctcttgg 74460ttccaggatt
ttcccatgcc tggaacattc ttccttttct gtcctcttta cccagtttag 74520tcaaggtttt
ttgtcaaggt tcttctctac cccacagact ccctctgtgt ttacggctat 74580ctggttcagt
tggctgtggc gagttgcggg cctttcttga ggtctcaggt cccagcattg 74640agcagagtgc
ccagcacata gtaggaactc actagaaagt acttgttgac tttgttaaag 74700gaatagaagt
gtttacccat ggaggtctga gtgtcaggga agctaaagag aaaatctgta 74760ggaacttcaa
gtactgggag tgggggtgag aaagagtagt ccaaggcact gctgaaaaaa 74820atcgtcccag
gcctcaggtc agaatgggga tgggtaccat tcatgagcca gcaagaccca 74880gaaccaggag
gccaaattag aggtggtgta caaggggcat acacagggga atggctcggg 74940tttagccagc
aaattggaag gacattggtg tcccgctggt ttagggcatc agctgcgtgg 75000cttggaggag
agagctactt ttattttatt ttattttatt ttttgagatg gagtctcgct 75060ctgtcaccca
ggctggagtg caatggtgca atctcggctc gctgcaacct ctgcctccca 75120ggttcaagtg
attctcctgc ctcagcctcc cgagaagctg ggattacagg cacctgccac 75180catgcccagc
taatttttat attttaatac agatggggtt tcaccatgtt ggtcaggccg 75240gtcttgaact
cctgacctca ggtgacccac ccacctcggt cttcaaaagt gctaggatta 75300caggcgtgag
ccaccgcgcc cagccaagaa gactacctta tgagggtaga gtgggccaga 75360cacactcagg
caatttgttc cttgaagttg agtgcctcca gtcttttctt catctgtatc 75420ctcccagctc
tgagcaccgt gcctgacact tggtggatgt ttgttgactg tgtgccaggc 75480acagacaata
tgtactggac ggaacataat aaatcaccat aggggaaaat gttagaataa 75540attccagatg
ggctgaggaa ttgaaagtga aaacactatg actggaagga tgggaaggag 75600ccggccttgt
gaagaatgag ggagagcatt cagccagtgg gaactgctgg tgggaaagtc 75660ctgaggcagg
aaagggtgtg gcctgttcag ggaactggag gacaccaggg tggctggagc 75720cccgtgggcc
agcgggagtt gggggggttg aggtagagaa gtgtgcaggc gccagctctg 75780tcgagcgatt
gcaatttgta gccaagaacc tggattttat ttgaagtgcc aaggtcagcc 75840attgaaggga
tcttgacaga taacacaatg cagattgagt tttattaaaa tcaccctgtc 75900ttctgtgtga
tgaaaggacc aggggcctct cagtagacac gatgaatcca ggaggaagac 75960tatggaagta
gcctagggaa agatgtcagt gacttggccc aggatggggg cagtgaggat 76020ggaaaagagg
aaaaaacagt ttaagatttg tttgctgatg tttgcaggat tagatgttgg 76080gggttagaaa
agaggaggat tcaaggacga catccagggt actggcttga gcaactcagt 76140agactcattt
gttgaggcag agaacagggg agtgatggta tgaggtttga actgtgtggt 76200gtggggtgtg
gtggagggtg gtagtcagaa ggagggagtc agtttagagg cctatgaaac 76260atgcaggtgg
acatcaagta gacggctgga aagaaaaact tacattccag ttgtgaaatg 76320gccaaagagt
tcaatagaaa catcgaaaaa gaaatattaa ctaaatgcat attcatcttc 76380accaataatc
aagtaaataa atctacatta aaaccactat caggccgggc acggtggctc 76440atgcctgtaa
tcccagcact ttgggaggct gaggcgagtg gatcacctga ggtcaggagt 76500ttgagaccag
cctggccaac atggggaaac cctgtctcta ctaaaaatac aaaaagtagc 76560tgggcgtggt
ggcaggcgcc tgtaatccca gctccttggg aggcggaggc aggaggatca 76620ctcaaacctg
ggaggcagag gttgcagtga gccgagatca tgccactgca ctccagccca 76680ggcaacaaga
gcgagactcc gtctccaaat aaataaataa ataaaaataa aaccactatc 76740agttaccatt
ttacttgtca aattgacaaa attggatttt tcaaaatagt tatgatacca 76800aattttggca
aagaaattga aatggactcc cacacactgc tggaaagtgt atgttgataa 76860aaacacttca
ggaagaagag tagaaataca tattaagtgt tttaccgtgc atatcctttg 76920gcccaaataa
attcattttt agtgtaccct aaggaaaaaa tgaatgtatg ggaatataga 76980tcattttaaa
ttaaatatac cctcacatag tttataacat taaaaaatgg ggggataata 77040ataggacatc
aggttaactg tattttgaag aatctttaat gatgtaaaaa tgcttgctat 77100gttgagagtg
taagagctaa agtataaact gttacgatgt tatgattcca atatttaaaa 77160gtacaaatat
atatttataa aatatatata catatgtaat ttttcatgtt ttctccagtt 77220aacattattt
tatgttataa agtaaaattt taacaagaaa atatttaacc ttccaagaaa 77280ttaaggaaat
gcagctttaa aaagaataac actttctcaa ctattaaatt atttcttagt 77340ctaaaagtta
ctttatttct gaagtaactg atctactact agtgttttgc agatgacact 77400tcatctcttt
caaaaagcaa tttggtgata ttatcaagat ctcaaaggcc ggatgtggtg 77460gctcatgcct
gtaatcccag cactttggga gtctgaggtg ggcagatcac aaggtcagga 77520gtttgagacc
agcctggcca acatggtgaa accacatctc tactaaaaat acaaaaatta 77580accaggtgtg
gtggcaggcg cctatagtcc cagctacttg ggaggcagga gaatcccttg 77640aacctgggag
gcggaggttg cattgagctg agatcatgcc actgcacacc agcctaggtg 77700aaagagcgaa
actccgtctc aaaaaaaaat aaataaaata aaaatacttc atccctttca 77760aaaagcaatt
tggtgatatt atcaagatcc caagggctgg gcgtggtggc tcatgcctgg 77820aatcctagca
ctttgcgggg ctgaagcgag aggatcactt gagtccagga atctgagacc 77880aacctgtgca
acatagtgag actgtgtctc tacagaaaaa aaaagaaaaa aaagaaaaga 77940aagagagctc
atttaaccta gtaactacct cctgaaaata tattactcat tagcaattaa 78000tttttttaat
aaaaggaaaa gctatttgtg tatattcttt ataactgtgg cctttcttgt 78060aaaagtgaac
atttggtagt agcgtatata tccagcctga aacttttggt acatctaccg 78120tatacagctt
ctataaacta tacaaaagat tattatagaa aaagttttgt gaaaaacgaa 78180acttggaagt
ttgtaggaag atgattctac aaatgataat cgagtgtgct catgagactg 78240gaaggaaaat
aacagaatga gtagagtcat gagggtaaaa tactctattt ctccaaagtt 78300ctgtaatatt
tcatattatt ttattgttac tgtttatttt atgccctgcc tcactccaaa 78360gtatatttga
tatgatactg ttttctagat aataaaacaa tgaataaacg tagtgagtga 78420ctataaggag
attcattatt aaatgcatca gaatgttttg ccttaacctg aggatggaag 78480aatttatgat
ctgagtaatt gctcaaaggt tctttgttgt cacgtggaag gctttgacct 78540tgattttctg
atttaggcat tcctatagac tcagctcatt ataaggatat gtaatttgca 78600tgctatttta
tttattcagt ccttaaattc catagttgtt taaaacaaat aggaagatgt 78660cttagccgag
cacagtggcc caagagctca taaccagcct aggcaacata gcaagacccc 78720catctccaca
gaaaattcaa aaattagcca ggcacaatgg tgtgcaccta tagtcccagc 78780ctactcagta
ggcttaggtg agaggattgc ttgagcccaa gagtttgaga atacagtgag 78840ctatgatcat
gccactgcac tccagcctgg gcaacaaagc gagaccctat ttctgaaaat 78900aaataaatag
aaagatatct tatttctcta gttcaacctt cacttgactg gagacatcca 78960cgccatcacc
gctgccaata acttgctggc tgccgccatc gacacgagga ttcttcatga 79020aaacacgcaa
acagataagg tgagaaggat gccttgctag ccattttggg tattgtatcc 79080tggaattcct
ggattcttaa ggagtttgtg gaaagaattc tggtgtctgc aaactctgaa 79140atgttattca
attcaatttt gtgattttta catatgtccc tgtttcccaa ggagaaggag 79200ccatcctttg
atcagattct caaatgggtt tatgagcact gccaagttat tgggagaatt 79260gtagtatgtt
gtaaaagcca aatattctct aagatgagtc tttagtttta aagatggtag 79320gtagaccagt
tctcttcatt ctggtcatag gtccaatctc tgatctgttc taaggggatt 79380ggacagatgt
cgttttaaga ctaatgttat gtggagctat tttagatgtg ctgaaaagat 79440gtcagagacc
cttattgggc ttcatgtgct tgtaggttaa gagttaaacc acacccccaa 79500aggggacagc
ccaccagaag ctaattcctc acagatatag gacatagata aaaggggtac 79560cttctggcac
attttacaac aagggtcctc actttcttta attaaagtag ttccttagtg 79620agtctttgct
acataatcca gatacatctt catatggtag ttgtagcgat atagatcaag 79680aaatatattt
atgcaaactc tttgcaatgt gtattatttc agatcttgtc agttttatga 79740aaatgacaga
atttaagtca acattgttta tattagccca caaattttaa tagggtcttt 79800tatctaaata
gttcggttat aaatgcaatg aatttttgtc atatcctgtg aattgtccaa 79860gttcatggac
aactaacttt tgtgtggtct catttttgtt ttgtgttttt aggctctgta 79920taatcggctg
gttcctttag tgaatggtgt cagagaattt tcagaaattc agcttgctcg 79980gctaaaagta
agtttccagt tagcaattct ttaaaaagaa aatatcgtag aaacgcatca 80040gaaagattgt
caaagcctga aattttcaac ttggtttgat tttggttttc agatatcctt 80100ttgggtatta
aactctatag agaccgaaca ctggagcaga gagagcacag tgttggaagc 80160ctgaaccctg
gttctacacg aaagagccag gtggacaaat taattaacct tttttggtct 80220cagtttcttc
gtctatgaaa tgagaggact gttttgctaa tttatgagta ctctagtcaa 80280ctgacaagtt
agctgggcac aggagcacat acatacctgt agtccgagct actcatgatg 80340ctgatgtgag
aggtttgctt gagccaagga gttcaaggtt tatagtgcac catgatggcc 80400cctgtgaata
gccactgcat gccagcctga gcaacacagc aagagcccat ctcttaaaaa 80460aaaataacaa
attagaattt ttggtgagtc cttctctttc taaagtgaaa tatttttatg 80520aaatttgtta
aatttctagg aaattatagg ataacctttg aaaactcatc tatctcattt 80580gtggttttct
gttttgtttt gttttgtttt ttgagacaga gttttactct tgttgcccag 80640gctggagtgc
agtggcatga tcttggctca ctgcaacctc tgccttcctg gttcaagtga 80700ttctcctgcc
tcagcctccc aagtagctgt gattacagac acgcaccacc gcacctggct 80760aatttttgta
ttttcagtac agatgggatt tcaccatgtt ggccaggctg gtctcaaacg 80820cctgacctca
agtgatccat ccaccttggc ctcccaaagt gctgggatta caggtgtgag 80880ctactgcgcc
cagcctcatc tacgtttttg ttgcagcaaa cactacaatt atacttgcgt 80940atgccatctt
aattcttaaa acctgtctta tcttttttct gatgccattt gctgcctctg 81000gtcttctaat
catattgctt gctactagcc ctgtttcaga gactaaaaaa taaccctaaa 81060tggtataact
taacaacctt actggttggt attaagtggt tttaaaaatt cacctcctgc 81120ctcttactga
accaagttca gtgtgttgaa ttagtgagta attccatcct cggcagctat 81180aacaacatgc
agtttctgtt tcccagatct tttttatgct tcattaaaaa tcactatttt 81240gccaggcaca
gtggctcacg cctgtaatcc cagcactttg ggaggccgag gcaggcagat 81300cctgaggtca
ggagattgag accatcctgg ctaacacggt gaaaccccat ctctactaaa 81360aatacaaaaa
attagccggg cgtggtggca ggcgcctgta gtcccagctt ctcgggaggc 81420tggggcagga
gaatggcgtg aacccaggag gcggagcttg cagtgagccg agatcgtgcc 81480attgcactcc
agcctgggca acagagcaag actccatctc aaaaaagaaa aaaaaaaaag 81540aaaaatcact
attttgtgtg tgtgtgacag tcttgctctg ttgctcaggc tggagtgcaa 81600tggcacgatc
tcggctcgct gcaacctccg cctcctaggt tcaagcagtt ctgcctcggc 81660cttctgagta
gctgggatta caggtgccca ccaccatgcc agcgaatttt ttgtattttt 81720agtagagaag
gggcttcacc atgttggcca ggctggtctc gaactcctga cctcaggtga 81780tccacctgct
tcagcctccc aaagtgctgg gattacaggc atgagccacc acacccagcc 81840aaaaatcact
ttcttaagat gttttattta cacaaaataa ataggaagat ttcttggctt 81900ggtgtggtgg
ctcacccctg taatctcagc actttgggag gccaagacaa gaggattgct 81960tgagcccagg
agttcatgac cagcccaggc aacatagcaa gacccccgtc tctacaaaaa 82020atttaaaaat
tggctgggca ttaaaagttt taattttttt ttttctcctc cataaattag 82080gaataactta
ctttgaaaca taggaataaa cttaacttga attactggta tttgtatttg 82140aaaaactaaa
gaactgcggc atcaattata tggacagaca gcagtttatg tctcttctga 82200gcagcgtaca
tggtggccac atagagcagc ccaaggagcc cctggctaac gcagccagca 82260gcaggcacaa
gtggtagtta gctgtatgct gtccactgca ggacatcttc agatcaatcc 82320ttttcagagt
ctattacgaa ctcacagtat cttactcgta gaagaaatct ggccataggt 82380atttgtctga
cttggcctaa cctataattc actcaaagtt cagtttagtt tagttttttt 82440ttgtttgttt
tttgtttttt gttttttttt gagacggagt ttcactcttg ttgcccaggc 82500tggagtgcaa
tggcgcaatc ttggctcact gcaacttctg cctcccaggt taaagcgatt 82560ctcctgcctc
agcctcccaa gtagctggga ttacaggcat gtgccaccac gccaggctaa 82620ttttgtattt
ttagtagaga cagagtttct ccatgttggt caggctggtc tcgaactcct 82680gacctcaggt
gatctgccca cctccgcctc ccaaagcgct gggattacag gcgtgagcca 82740ccacgcccag
cttagttttg atttttgttt tttggttttt ttggagacag agtcttgctc 82800tgtcactcag
gctggagtgc agtggcagga tctcagctca ttgcaacctc tgcctcccag 82860gttcaagtga
ttcttctgcc tcagcctccc aagtagctgg gattacaggt gctctccacc 82920atacctggct
aattttttta tgtattttta gtagagacgg ggattcacca tgttggccag 82980tctggtctcg
aactcctgac cttaggtgat ccacctgcct ccaccttcca aagtgctggg 83040attacgggca
tgagccaccg cacctggccc agtttagtgt tttttttgtt ttgttttgtt 83100tttgagatgg
agtttcgctc ttgttgccca ggctggagtg caatggcacg attttggctc 83160accgcaacct
ccgcctccca ggttacagcg attctcctgc ctcagtttcc caagtagctg 83220ggattacagg
catgcgccac catgcccagc taatttttgc atttttagta gagatggggt 83280ttctccatgt
tggtcaggct ggtctcgaac tcctgacctc aggtgatctg cccaccttgg 83340cttcccaaag
tgccagttta gttttttaaa aggctttacc aatcatggag aaaataaaat 83400tagagggata
attttggtga ccctttgctt tctgtttctt ctcaaacttc tcaccttttt 83460tcttcttaat
cattagaaac tgggaataaa taagactgat ccgagcacac tgacagaaga 83520ggaagtgagt
aaatttgccc gtctcgacat cgacccatct accatcacgt ggcagagagg 83580tgggtgctgg
ggagatgcca gcaggctgat ggccaggtgg ggaggcgtgt tcgagccgaa 83640gcgcttaaat
tcagaaactt acagtttgat cctgtccagt gatcttccca gcctgtgctt 83700catattccca
ccatttattt cagttactat tatcgtacca ccctccctta tccttgcctc 83760caattaaaac
aaaaattctg tccagggtca atcccatcta ctcctgtgtc ttctaaattt 83820ccaaggaaaa
ttacctaatt ttctcttctg tttctctctc ctttggcctc ttgtatctat 83880attaagaaaa
aaaaaaagaa acttttttta aagaccacgc ctccctggct ttgtcctgct 83940cctcctccac
ccaccatccc accttcttct gttcacttcc ttaccactgg ctctcggcag 84000ctctctgtgt
gtttccagtg ctggattttc ctccaagtct gatatgtgct ttcacccact 84060taacgaggca
tcgctccact gcttcacatg catccaggtg gagtccccgt gagctcccca 84120ccacctcgac
ttccctgctt ccctctcagt tgtgaagtcc ccttccttct cccacactgc 84180cctcagtctc
atcgctcact atgggatgag acagagggaa atgcactaaa caggaagtta 84240ggagccctga
gtgtgtgtct cccctgccac tgactcactg catgatctcg agcaggtcgc 84300atgcatgttc
ttggcgttgg agatggagat ggtcacatac gctgcacaga aacacggcaa 84360ccacatttta
tgaaccctgt gcaactgcag aaatgttcaa tggcatggat ccctgattct 84420tcctggatta
ttcttctcat acttgtgccc tcatttacct ctttatgctc acatttggaa 84480catatttatc
agatgcgcat tttatagcag ttcccgacta atagattatc ttgcttccgt 84540ttccatcccc
cagttcagag ggttcacatt aacagagata tcccgttgtg atcagaggcc 84600tccagtggct
ccccacatcc tgggtgatac agttgccatc gcagagcttg gtcttcagga 84660cacccgcagt
ggcttccctc actctgctcc ctgcctgctt tctagtcagc agtgccgacc 84720tccatcctcc
tgccagccct agctgggcca tctccatcct gtctcctgcc taggccagtg 84780ttgccctcta
agatgcaccc aggtctcgtc ccttcaggga gcttcccttc tgggaatcca 84840cccagccagt
ccattaagaa tttagtgtag tcccacaaac catgaggtag agagacaaag 84900accagagact
gagcgttggc tctgcggttt atgggctgca caactttgga cagactgctc 84960actcagtttc
ctcatcttta aaatggaatc ataattccag ctgccttgct gggaggcttc 85020actgaaacat
aagcatcctt tccagaccca aaagctctgc ccacacgatc atcagaagtc 85080actgtggtga
tagctgtatt gagtagagtt ttttgtttgt tgttattttg agacagagtc 85140tcacttgccc
tgtcacctag gctgaagtgc agtggtgcga tctaggctca ctgcaacctc 85200tgcatcactg
gctcagtcga ttctcttgcc tcatccttac gagtagctgg gattacaggc 85260acacaccacc
acacgtggct aatttttgta ttttactaga gatggggttt caccatgttg 85320gccaggctgg
tctggaactc ctgacctcaa acgatccgcc cacctcagcc tcccaaagtg 85380ctgggattac
atgcgtgagc cactgtgcct ggcttgggta gtttttttga gtgtctttgt 85440ctcccaaact
aaactaaaat ctccttggaa actttatggt tttttttttg tttttttttt 85500gagacagggc
ctcattctgt cacccaggtt ggagtgcagt agtgcaatct ccactcactg 85560caagctctgc
ctcccaggct caagcaattc tcctgcctca gcctcctgag tagctggagg 85620ctgtgcgcca
caacactcag ctaattttcg tatttttagt agagatgggg tttcatcatg 85680ctgcccaggc
tggtctcaaa ctcctgagct caggtgatct tcctgcctcg gcctcccaaa 85740gtgctgggat
tacagacatg aaccactgca cccagcctag aaactttcaa ggatacattt 85800ttcattttga
aataagtaat aacattatag agattttgga aatgtctcaa aagaatttta 85860aaaaccttgc
tatactaata aaactactat tcatagtttt ttgtgtgcat ctagtttttt 85920ggtataactc
tagttataac tttagtgtat agattttata tcctacgagt ttcattcatt 85980tatatcataa
gcattttgta ttttgttagc tctttgttgt aatggccatt ttaaatactc 86040tagaatattc
ctttgtggat gaaccacaac ttacttaagc aaggctcagt tcttgaaaac 86100accaacagca
ttttaaataa tggacatgaa tactttaagc attttattga gaagcctaca 86160tattttcttt
gaaaattttt tttataacta ggaagattta ttattaaaat gtcatcatgt 86220ccaaaatgta
ttgtcatctt tttttttttc aaagcatttg tgattttgtg ttactcttca 86280aaactttttt
ggaggtttac tgatgatagg aaaatcagga aggtggaaat tagtgtattt 86340taaaccaagc
atatggatgt gctaaaagag aaagttttat gccaattggt attctttggt 86400gtctgctttg
gggttccagc ctgagcagcc atcagaaaca gggcaatggc ttagacatgg 86460atcttggaag
agaaaggcta catcatcttt tatacctggg tctggtctgc aattggtttc 86520tctctggagc
caggaaggac tgaagtgttt tgtggttgtg aatcactttt aatggaagtg 86580acaactttac
aaaattcttc cggaaaaaaa tttgaattcc attattgttc agggcactga 86640agtaaagaaa
actttattgc tgtggacaag ctacctgggt cagaggaatt tggaatgaca 86700tttgataagt
agtttggctc ttatgaccat gaattaataa cagaaatagg actcttcatc 86760aaaagaattg
agaacatttg ctagtagggt tttttggctt aaacctagaa ctaatgtaac 86820ccttgtgatg
aaattggatc cttatttgat ttgacaacta gagcaggaca gtcagggtgc 86880acggtggtgg
gagtgagacc ctcttctcag gccatatgga tgttgcgcca tttaatgacc 86940accccttcct
ttcagcctgt gtacaaaatc tgatgattgt ttcatctttt tttttgagac 87000agagtctcac
tctcttgcca ggctggagtg cagtgtgcga tctcggctca ccgcaacctc 87060caactccctg
gatcaagcaa ttatcctgcc tcagcctccc gagtagctgg gactacaggc 87120atgcgccacc
atgcccagct aatttttgta tttttagtag agacggggtt tcatcgtgtt 87180ggccaggatg
gtctcgattt cctgacctcg tgatccgcct gccttggcct cccgaagtgc 87240tgcgattaca
ggcgtgagcc accgcgtcca gcagattgtt tcatcttatg cctcataccg 87300aaagtttaaa
ggcctcatgt tcctggaccc agagtggaac aaggaaccca cagacctttc 87360cagaaatact
taacataagc agaagaaaat atagacatac agccattttg ttacagagca 87420taattttgca
cagatgagcc tctgcctgtc cgagccccgg ggcctcacag gcttgccgtc 87480tctccatctc
cctcgcatgc ccaactcctg gccatggacc aagggcccac aagcctccct 87540cagcctcagg
gaatcaacca tcaagtggct ggttccttgc cccaacaaca ctctagggaa 87600tcagtaccgt
gtcttccact ggggccgaaa cagtggacct gcccctgatt gaacaccttg 87660tactgccagg
ctcttggcca atattggctg tttcatcagc cattaggcat actgtggatg 87720ttccttagga
catagtcagc actgtagtat agtcagcact gtagtactaa aagggatggt 87780ccagagcaaa
aaataagcaa acagatctcc cccggctgcc tccaccccca cacccactgg 87840cattgtccta
gcatgcctta cataagaggc tggcactgtc ttcactatat ctctgtacag 87900ctggctcggg
actaggcaca agtagcccct caatacatac tccaggagtg aataagtgtt 87960tcttattcca
cctttagttc tatcattcag agaccaggaa ttccattttg gatcactaac 88020ttagggttac
tttctagtat ctttcttgtg ggctggaatt taaccttcac atttcttatg 88080tatcactgat
gcatcactta ccgggccatg ttaaaggcag aggtgggctt tggaggccac 88140tgattctagt
tgatcttgcc agacattttt ttttcttgtt ttctctttgc tagataaaaa 88200acagctgaat
ttctcttgtt gattttaaag aaatttgata aaaagattcc ctcaatctgg 88260tgccaagatc
gtgtatacta cagagcagga aagaggtttt tagttgattg actttgcacc 88320ccatgtacat
cttgggaaga acagtggtgg ctgaggatac tattgaaaag caggcctttc 88380attttgtata
tgctctgctt atagctgaaa aactcataaa gaaggtttct gtgtgtttat 88440atgagtgtca
gttggtgttt gtgtgagggt gtgagtgtgt ctgagagaga gggagcttgt 88500tataaagact
aacaacagcc gagtcaggga acagccatag gccaagaggg caagctggcc 88560tgctggtgct
ttgcgtatga aagaaaaggt taactttctt catcctacgg atgcctatag 88620taccaaatac
gtaatttcca tcaccatttt cataagtgct gtatcgtata cattgagtaa 88680agtggatcta
aattgtcaat gtggacagct tgttatttga agctgcataa attaacaaag 88740tcagtgtact
ggctttgttt tgttgcattg gtattcacta aaccaaacct aattttcaca 88800aaatgtaaat
aactgcaatt cttgttttgt tttgttttga gacagggtct tggctgggtg 88860cagtggctca
cgccagttat cccagcactt tgggaggccg aggcaggtgg atcacatgag 88920gtcaggagtt
caagaccagc ctggccaaaa tggcaaaacc ccatctcttc taaaaataca 88980aaaaattagc
caggcatggt ggcaggcgcc tgtaatccca gctacttggg aggctgagac 89040aggagaatcg
cttgatccca ggaggcgaag gttgtagtga gccaagatca cgccactgca 89100ctccagcctg
catgacaaaa aaaaaaagag agacaaggtc tcacttccat ctcccaaatg 89160ggagtgcaga
gacacaatca tggtcattgc agcctcaacc cccaagactc aagcaatcct 89220cccacctcat
ttttgtattc tttgtagaga caaggtctca ctgtgttgcc caggctgttc 89280tcaaattcct
aggtgaccct cccgcctcag cctcccaaag tgctgggagg tgtgtgccac 89340cacacccagc
caataactga aattctaata agccagtgaa gaaaataaag agaactggta 89400cttgctctat
tcaccattca cccccgccac cccccacaat tctactttat ttacttttgc 89460tttttaaaat
tgtggtaaaa tatacataac gaaatatttt tccccatttg taagcacaca 89520attcatggca
ttaaatacat tcacaatgat tgcactatca tcactatcta tacccagaac 89580cttttcatta
tccccaacaa aaactcctta tccattaaat aattactccc ctcttctgcc 89640cctgcctatt
ctaggtgcct gatataagtg gactcatata gtacttgcct ttctgtgtct 89700ggcatatttc
actatgcacc atgttttcaa catccattca tgtatcaaca tgaccttctt 89760ttatagccgt
ctaatattcc attgcctgtc tgtgccacat tttgtttatc cattcacctg 89820ttggtcgctt
cctccttttg actattttga atagtgctgc tatgaacatg ggtgtacaga 89880tatctgagtg
tctccttttg gttctttcgg gtaatgccaa gaagggaagt tgctggttgg 89940gttttttttt
tttttttttt ttttagaggg ggtctcactc tgtcgcccag gctggagtgc 90000agtggtgcaa
tctcagctca ctgcaacctc cgcctcccag gttcaagtaa ttctcctgcc 90060tcagcctcct
gagtagctgg gactacaggc acacacaacc atgcctggca aatttttata 90120tttttagtag
agatggggtt tcaccatgtt ggccaggctg gtctggaact cctgacctca 90180aacgatccac
ccacctcagc ctcccaaagt gctgggatta caggcgtgag ccactgtgct 90240tggcctctag
tggatttttt tgtttgtttg ttgtttattt gattctttct ctatgtctac 90300ccactagatg
gtcagcttgg gggagcctga ccttttctct ttcaccctct gcaacttcct 90360gagcatcaga
acaatgccag acacacacca ggtggctggt ccatatttgt cgactgaatg 90420actaatctgt
tctctttcag tattggatac aaatgaccga tttctacgaa aaataaccat 90480cgggcaggga
aacacagaga agggccatta ccggcaggta ggtggtgctt tcttcccggg 90540tgtggctctg
ttcttacgtt catttctact ggcctgggag ctcactcttt gcctttcttg 90600tctcatcccc
aacctgttgt ggccagtggt tctctcactg ttttgggtgc tcagcagggc 90660tgggagagga
tcagaagaca tagaggctcc agtgtctttt taccatcttt tctatgtatg 90720tttatttcat
tttctttttt tcatgtatgt ttccaattgt gtttccagga agagccactc 90780agggtattct
ttgaatataa attctcactt attttattgg gaaccatcca ggtaatacta 90840catctgcata
actgataaat gaaaatctca gacacattac tcaaatttca tatatcaaag 90900aacaaacacc
tagttattag acttaatagc atgaggattt tttaaaggat aatggcttat 90960taatacaaaa
atggcttatt aataaaatac agagaatgca gaagaaaaat tatgtctgac 91020ttcattgtaa
tagatattgg atgtgcgtca atggcactta gggaatacaa attaattcct 91080atgatttttc
atccaggcac agtgggcttc taaattgtgt tgtgtttttt tttcttttga 91140attgattata
aactgtacat ttaagaacag ttttagattt ctgtccttta tgttaagttt 91200ttattagtga
aaaaagagtc accttactat aataaccaat ccttacgcca aatagacttg 91260tgaccttttg
aagtgtcttg aattaaaaag tgaatgtagg tattggatta agaagagaaa 91320cttaagttcg
tggacctcgt ctcctcccag tcctgtaact tacacacaca cacacaaaca 91380gatttcaaat
atccatgcta gcaaactgga atcattgcca tgaagtttta gatagttctg 91440aaaaatagga
agatggaatt agattgatga ttaaaaggac agcccaaaat atggtggact 91500tccatcctta
acaagcattc ttgcttttca tgttcttagc ctctcctgct aaagtgggag 91560ctttataaaa
gcagaggtta ttcccctttt tatttgctgc tgtgttccct atgttcagaa 91620cagaactttg
ttacatatag tcgtaagtgt tcagtaaata tctgttgaac aaatgagtag 91680aataagtaaa
agacaattta tggatacaga actaaaatgc caacataaaa agatgttcat 91740gaataatcat
aaaactgcaa attgaaacaa tggagtatca tttacctagg agcttgtcaa 91800gtgaaaatag
taggtactaa atgctagtga ggatgcaatg aaatgggggc atttatgcag 91860cgttagtggg
agtagaaatc ctttttggat gagtttgtta ttactaaaat ttgaaatgac 91920ccaacaatta
caactttgga actaaaaata ttttaatgaa aagaacaaga aaatgtgtaa 91980cgtacaggaa
aaaattctgt gtattaggat atttgttgtg ccattatttg cagcggtgaa 92040aattggaaac
aatccaaatg tccaacagta gagatggttg catatatttg gtccgccatt 92100ttgtcccatg
ggagagttga aattaataaa tgtgcgtatg ggtgtcttga catgagtaga 92160gagctctcat
acaccattgg aaaggttgta aaaaccattt taaaactttg aagtaaaata 92220ctgatattgt
aaacctgttt atgttaagaa aaacccagcg tgtttactta cattataggc 92280acttacaact
tatttgtgta tataagtgcc tagaatgaaa agtagaaaga tgtatgccaa 92340ctgttggcat
tagttgcctg tgtaataaaa gcgagtattg aaaaggccct ttctctgcat 92400agttttttac
tgtttaggta ctggagagat ggaagattca cagtggcctt gatttcagtg 92460cctactctgt
gcattgtatt gatagttcca ttacatatat tagcacagtt catttgtaaa 92520gctgatttgc
aaatgaggaa attgggatgt tatgttaaaa aaacttgccc aaggttatat 92580cttgtgttgc
ttcgtgcata gatattttta agtaaaaagg ggttatagtg tatttaagag 92640ctttaactgc
tttttattta tcataagact gtagtgaaga catcttttta tgtcgataaa 92700ggtagacctg
tatcttcatt tttaatgacc acatcatatc tcatagaggt accatagtag 92760attcagccat
ttccctgtta atgaacatgt agagtgtttg cagtttcagg ctatcacaag 92820taagctgtag
taagtataca catatatttg tgtgtatact tgatactatc cttaggatat 92880tattccagat
gtgtgattgc agagtccaag aggatacaaa tttccaatgt tgacatatgt 92940tttgacatta
acttccagga aattcatacc aattcacatc cttgccaata tcccatattg 93000gcctttgttt
aaatgtgtgc taaatccctc atttgcccta tcgtgattat tacacattgc 93060atacctgtat
cagaatagtt catgcaatcc acaaatatat acacctacta tatacctaaa 93120aaaatgaaaa
ggtaaatact aaatcaataa atgtgtgata agctgatgaa caataacgtt 93180ttcattggca
tctttatatg attttgaaat attttatttg caaaacagaa atgagcagcc 93240ttccgctatg
cacgtcattt agtcattaaa gtgaatgaca gtgatacttt aattgaccca 93300gtgtgtttca
ttttttttat gctcaagatg tgaaattttg taattgtaac aattacaaat 93360gaagagcttg
aatatcgatt atatattatt cttatttaca cagctaatat ttcaagaatg 93420tttctagtct
catgttttgt acagaaaatc tccatatttt ttagtcattt tctcatatgc 93480actgtttata
atagaatggt agtatagaga accctaaatt tcatacaatg aaagaccact 93540tatgtttgga
atgagttata ttgagaatgt aataattgaa tatagttgta tttgactttc 93600aggtaatatt
ttttcactaa atcagtgact aatgcaatct caaccaggaa gtagccagaa 93660agctccttat
ttccttcctt tccttacttg tcagtgatgg aaatggataa taaaatgtct 93720cataaggtga
tcccaccagg gcacagccat tctctctctg cgtaagcttg aacttgtctg 93780gctcacactc
cccttgggaa gaagagaatt aagctggggc gtaatgaagt ggttctgtcc 93840agagaaacat
tcctagtggc tgaattatga ggaggaggaa gtacacattc ctgtgtccct 93900tttattccat
gttggacttc gtacaacgtg gagccacagc caagagatgg gaataggaag 93960tcctagaact
ttttgcaaaa agtctgtgct ctctagcttt cactactgaa agcttcaaga 94020tctgatgtgg
ccatatcctc ctgggcagct gaaaattgag tctgtgccac cagttgactg 94080ggttgccctc
cctgctgtgt tgagatccag ctctgggaaa gtagcctcac atgcaggaag 94140ccagcatctt
tcactgcccc ttatctctgt gccagacaga caatgcagcc ttcttgacag 94200acctattact
aacaagggag gtggcccgta ggctagtcat taattcatct gatgcatatt 94260gtatgcctac
tgggtctcat gctccgtgct ggggattcag aagtgaacct gcctaatata 94320gtccctgccg
tcatggcact tacggtccag tggtagactg ataatgacca cactagactg 94380ataatgacat
tagtaatcaa ttaagtctga gacatgctac aacagcaaac catgtggtag 94440tgctgtgagc
gggtgtaaca ggggtcctaa ctcagcccac cctttgctga gcagagcatg 94500gcaaagagca
cattccgcct gggaggccac cctagggagc accaggcaca aaccctgcat 94560ggtccagaca
cgcagaaaac tgatgggatt gcatgcatgg ccttttgagg gtagactctc 94620tgggcctgtg
gttgcttcat ggctgaagtc cagctccttg tgggaatggc catcccactg 94680gcactgtcgc
tgaccactac ctgtgtttgg gctggttcag gcgcagtttg acatcgcagt 94740ggccagcgag
atcatggcgg tgctggccct gacggacagc ctcgcagaca tgaaggcacg 94800gctgggaagg
atggtggtgg ccagtgacaa aagcgggcag cctgtgacag cagatgattt 94860ggtgagtgtt
tccaactcgg aagcttcagg gagtggacgg tcctcgttct tcgttaccag 94920aaaaagaaag
gttttcttcc ttttatcagt gagtgtaatg aggatacaga aaaagtttcc 94980acacacacat
ttcttccccg acaagcatgt ttgcctaact gcttcctaat gttctcatag 95040tcatatatga
cgtgcttatg cctggctgca acaaaaacat gtgaaattta tttcaggagc 95100tttccaccat
ccttgccaat cctgatctct gcttctgcct gggctgggga ggagtcaccc 95160ccctgcagtc
acttggtgac ttttaaaaaa tgtaccttct atgcctgcct gaaattgagg 95220gcaaatcacc
tttccaagga acagaaggaa gtggctctgc ttccatgaac aggaagataa 95280agttaaaatt
ggcacatttc tggcaccagc ctggcttatt atttaatttg gggccttaga 95340tttgggtttt
gtattataaa agcaaatctc tgcctcaatc agtcttgctt ttctagttcc 95400cctttgagta
gttgtggtgc tttctgtaaa gcttccatgg aacaccattt ccttttttag 95460tgctttttct
aattggcatg ctagtcttct tttcttcttc gtctttctgg tataataaca 95520acctttcctg
ttaacttttt tttttttttt tttttttctg agacagtctg gctctgttgc 95580ccaggctgga
gtgcaatggc acgatctcag ctcactgcag cccctgcctc ccggattcaa 95640gcgattctcc
tgcctccgcc tcctgagtag ctgagattac aggcgcacgc caccacgccc 95700agttaatttt
tgtattttta gtagagatgg ggtttcacca ggttgtccag gctggtctcg 95760aactcctgac
ctcaggtgat ccacccaact tggcctccca aaatgctggg attacaggca 95820tgagccacca
cacctggcct tttccagtta acttaaactc cacatattcc ataggtgttt 95880tgtttgcttg
catttttgta tataagtgca tgggtgtgga cacaggcctc taagatagcc 95940attactccaa
atttaagttt actttcccat atactgaatc tcctacaata atggaactta 96000aaatgcacac
tggtgattgg taagcagttg tgaacagaga gaggcaaagt catattttat 96060taatttcaag
ggcttagggt ataatttgaa ttctttttca aagtgtccgg attgtttttc 96120agctgctttt
gaagaaatga atctaactca tctcagaagt cttgtcaagt aaattcaata 96180gatcaacgta
tgtcgcagcc tatggaatac ttagtataaa aattgaagag aaaaccctca 96240ggttgctata
aaagtgctgc tttattaaaa ttaccaaaat ctcactattt ctatctaaac 96300tgacatacta
cagtttctgt ctaaatacat tgctctgtat tttattaatt tgtatggcaa 96360taaatatgtg
tcaggaacag ttttaaaact taatcctcaa atactgcttc cttaaatcta 96420tttatttatt
tttatttatt tattttgttt ttgagatgga gtttcgctct tgttgcccag 96480gctggagtac
aatgccgtga tcttggctca ctgcaacctc tgcctcccag attcaagtga 96540ttctcctgcc
tcagcctccc aagtagctgg gattatgggt gcccactgcc acacccggct 96600aatttttgta
tttttagtag agatggggtt tcaccatttt ggccaggctg gtctcgaact 96660cctcaggtga
tctgcccacc tcgacctccc aaagtgctgg gattgcaggc atgagccacc 96720acacccagcc
cttaaatgta tttttgtgtg agtgtatgat aaaaatcttg gatctggaac 96780aaacccaaag
attactcttt ttagttacct ttttaaataa actctttatt tatcttttta 96840ataaactctt
tttagttatc ttttgtgttt cttgtttgca aaaagtaact aatttattaa 96900attgtgcttc
atccagtggg agctattgag agttggccac ccatagaaaa ttgagggagg 96960attttaggaa
ctgctgaagg tttctgccac atgcatgatt ctgatatacc ctggaagaat 97020ccatctgagc
cttccccaca tggacacaga gtgtggaatg ctagcagcac acagagggtg 97080gcagagcttg
ccctgcacgt gacagcacag gcgctgagat gtgtcccaga aaaatgagca 97140ggtagaagcc
tacctgagac atccaacaga aacatccctc ttaacttcct tcctccttta 97200gtgttttgaa
tcagtttatt tggttgggaa gtgtaagcaa ggatagtgta cagagaagta 97260ggaaaactac
tcctcttctt ccacctgctc tgcattagca caacaggaca actctcactg 97320agccttcctg
tcctgaagat ttcttttttc ttttcttttc tttttttttt tttttttttg 97380agatggagtc
ttgctctatt gcccaggctg gaatgcaatg gtgcaatcta agctcactgc 97440aacctccaac
tcccaggttc aagtgattct cctgcctcag cctctcgagt tgctgggatg 97500acaggcacct
gccaccatgc ctggctaatt tttgtatttt tagtagagat ggggttttgc 97560tatgttggcc
aggctggtct cgaactcctg accccaggtg atccacccgc cttggcctcc 97620caacgtgttg
ggattacagg tgtgtgccac catgcccagc ctgtcctgca gctttcaaag 97680tgaaccatct
tcgtgtatgt ataatgagtg ggttgcattc atatcatgtg tgttttatac 97740actgaggtag
attccagtta cgtaggaata tagttagtaa tagtgtattc ctgaaaattg 97800ctaagagtgg
ttataaagta ttctcaccac caaaatgata actgtgaggt aatgcatatg 97860ttaattaact
aggtttaatc attccacagc atatacttca aaacatcatg ttatacacaa 97920taaatacata
cagttttatc tgtcaatttg aagttaagga agaaaataga ttggattttc 97980ttacaccttt
taaatgatta aatattataa aaatagaaac caatcacatg agagaaacat 98040ttactactgt
tgttgataaa agtacgggga gaatgtaacg cactttagag tgaggaagat 98100tctctggatt
ctaccactgc atctgttagc gccatggcat actgtggtaa aatggcactg 98160ttttccgatg
ctatgtacag caagggatag cgccctgttt tcccccccta gagctaccgt 98220tacaagcatt
tgcttatgtt atagccttcc aataactaat atgttgaaaa caccaagaca 98280ccacttaagg
tctttacaaa gtgagaagga attgggaaaa aaagaaacaa aggccgggca 98340cggtggctca
cgcctgtaat gctagcactt tgagaggccg aggcaggcag atcatgaagt 98400caggagttcg
cgaccagcct ggccaatatg gtgaaacccc atctctgcta aaaatacaaa 98460aaaaattagc
cgtgtgtggt ggtgcatgcc tgtagtccca gctactgggg aggctgaggc 98520agaagaatcg
cttgaaccct ggaggcagag gttgcagtga gcctccagcc tgggcaacag 98580agagagattc
cgttctcaaa aaaagaaaga aaagaatgtg aatattgaca tcacattaaa 98640aaataataac
aaactttagc cttgtggcct ttttttggca catgttttcc gctagttctc 98700atgcatcatc
cctattctct ctcttctctg tcttttcttc atccaggaac agagaaagag 98760gacttagaag
gaactcaacc cagtacttcc taggattgga aggaagctgc cttatttcat 98820gtataagtgg
caccagcttg actgtaggaa gagagatagc gtgaagcctt tcacatctaa 98880actagaaagc
ttttaaacgt ggtagtttgt tgagttattc tctctctgtg gaatgaatga 98940tgtagagatg
gtgagaaatc agagtgggtg attttagaca tttcccttta ccctacttca 99000ttcctgctct
gtacagagcc ttgtttctgt tcttcacctt atttctttga gcaagagtaa 99060agcctctgat
tatgaaaata cactgtgcat aaatagccag aaaacacatt ttcttgcaat 99120gaaagatttg
ttttatggaa aatctcataa acaccatcaa ggaaggtttt ggaagaaggt 99180caaatgtcct
tcaaaatgct attaccccat caccattcac atcttagtgt gactgacatc 99240cgttctcagc
tctggtgggg agaggtggac agagacagca gcactgtccc ctgtggccca 99300gggttgcctg
tggggacgcc cacccaagaa gcaggctgga gagaagtgcc aaggggttaa 99360ccattgcctg
gatgatattt tgtcaggaat ctaatgtttt tctctctcct gtagggggtg 99420acaggtgctt
tgacagtttt gatgaaagat gcaataaaac caaacctgat gcagaccctg 99480gaagtaagtg
gttttcttct tatagtaatt gtttgcatta actggattgc cacacaatgt 99540gaacattttt
agccaatatg aggcaggcat agagcatgcc tggggcagca gtgtgctttg 99600caatctcagg
cagaaagagt gaggatctgg ttaggaacct gtattagtcc attttgacat 99660tgctacaagg
aactacctga gactgggtaa tttatgaaga agcatcagaa taacttcttg 99720gccaaccaaa
aaaaaaaatt tttaagtgta ttcatagctt ctttaatata agtcatgtct 99780tggcttaatt
gacccgagta gaaacttaga aagtcaatgt ctgaggccag gcgcggcggc 99840tcatgcctgt
aatcccagca ctttgtgagg ccgaggtggg cggattatga ggtcaggagt 99900tcaagaccgg
cctgaccaat atggtgaaac cccatctcta ctaaaaatac aaaaaaatta 99960gccgggcatt
gtggcacaca cctgtagtcc cagctatttg ggaggctgag gcaggagaat 100020cgcttgaacc
cgggaggcag aggttgcagt gagctgagat cgcaccaccg cactccagcc 100080tgggcgacag
agcgagactc cgtctgaaaa aaaaaaaaag agaaagtcaa tgtctaacat 100140ttcaataggt
tgtatggcca tgtttacaat ttattttctg tgttccctta atctctcata 100200atcattgagt
tgtggatttt ttggtttgct tttttattgt tgtttgtttg tttccaagca 100260aaatacaata
aatgccaaga ctttccagga aagcatttgc caaaggcaag tcaataaaca 100320cacagcatgt
tagttggtgg taaggtctgg agaagaaaaa tcagaggtca tgaggggcag 100380agtgatgtgg
ggtatgtgtg tctgtgcctg cccgcctgtt aaagcagtac cagcacagag 100440ctgagtaaac
caagggggtg agccatgcac gtgtcttgaa gaacatttgg acagagggtt 100500acctttagtg
caggggcctt gggcagcaag gtgttttgca atttcagaca gcaagagcga 100560ggatctggtt
aggaacctgt attagtttgt tctcgcattg ccgtaaggaa ctacctgaga 100620ctgggtaatt
taagaagaaa attggtttaa ttggctcaca gttctgcagg ctgtacagga 100680agcatggctg
gggaggcctc aggaaactta tcatggtgga aggcgaaggg gaagcagaca 100740agtcttcaca
tggcagaaca ggagagagag ggcgaaggag gaagtgctac acacttttaa 100800acaactagat
ctcatgagga ctcactcaca atcatgagaa cagcaagggg gaaattcact 100860cccagaatcc
agtcacctct ccccaggccc ctcctccaac attgaaggtt ataattcgac 100920atgagatgtg
ggtggggaca cagagccaaa ccatatcaga accaatatga cagaaatggg 100980gggattggcc
aggcacggtg gctcaagcct gtaatcccag cactttggga ggccaaggcg 101040ggcagatccc
ttgaggtcag gaacctcgag accagcctgg ccaacatagt gaatcccatg 101100tctactaaaa
atacaaaata aattagctgg gcatggtggc acgtgcctga aatcccagct 101160ccttgggagg
ctgaggcagg agaatcactt gaacaggagg cgaagattac agtgaactga 101220gattgcacca
ctgcactcca gcctgggcga cagagtgagc ctcttatctt ttaaaaaaag 101280ggaatagaaa
aaaatggggg gaccatttct gaaggtgccc agtgaggatg cagactgcag 101340gtgtcccctg
agctgaggaa gggaggactg aacgcctttg aacaggtgtc tgcagccccg 101400ggggaggggt
gcaggatgca caggtgccaa ggccacaccc ttctgcggca ggttgtcctt 101460cccgtctttc
ctctcccctg atcattctgc cagcatgcac gtatcctcag aggtaggcca 101520tcttaaaaag
caaacaaaaa catttcaaaa acctcctgcc cccacctgca tcctcatcta 101580gcttctgccc
agctcttgtc tgtcactggg gtctccatgt taactgacct cgggttccac 101640ctctcctgac
tgcccttgtc caggtcactt ttcccttgga gacaccaagt ctttgttgct 101700gtgccctcct
cagagaagct cagcagagtt tgacacagct gcccttgtct tccttcagga 101760gcacctggtt
cctctggcgt ctggggtccc cccctcccca ggcctgctcg ctgctcctcc 101820tccgctttgc
tggcttctct gatgacccct cacctttttt gccccgttct cctccagaat 101880tagccaggtg
atgactccca aactcttgtt tggaatatca tgcacctgga atttcagact 101940tgcaggtata
gttaccagga gccacgtggg tggggaaaag ccatttcaaa ctgatacata 102000aaccaggacc
cacctcctac ctacttcctt ccagctcgct gcttccctga aaccttccca 102060gtcattcaag
gcagaaaccc gaggatgatc ctagaggcct gttttccctt cccctcttcc 102120tgtgtccatc
cctgcagagt tgtccttggt tacatgactc tgccctgtcc acgatgctca 102180atgcctccct
ccttcctgct ttagctgcca gcacgcccca cctgcacgag tgcagcagcc 102240tcccaaccag
tctctctgct gctactcttt tcttccaaaa tactctctcc tcagcagcca 102300tagagactga
aacctaatga agccctgttg ccttcctact tagagttctt ccatgattta 102360cccatgtttc
caccccggtc tgaaaagccc tagggatctg gctctcaccc cttcccgctc 102420cttgtccctt
atcctcatct ctcatcctct gggggacctt tgcacggcta acaccactac 102480ctgggccact
tcctgatatt ctcatgtcca gctccttcgc ctccctctgt tctcatgtca 102540tgtaagcatc
agctgctcag agaagccaca ggaatgtgca gcccccgtcc ctctgctgtc 102600attgcagtac
atggcacaga atgctgagag cttcctattt ccttatctgt tttctgtatt 102660tctcctccca
ccgagatgac agcaagacct tttccatcct gttcaccacc gtatctccag 102720cttctagaaa
gtgcccgaca cctattagag gttcagtaaa gacttgttga ataattaagt 102780gaatgtttat
tatttttact acattctcct aaaaatacac atttattaat aaataactga 102840ataagagaaa
aaatacgaag tgatgtgata ttccttcata acacttctcc ctcaaagatg 102900aatacttatt
attagggata atcattttct ttattgttgt tgtttgtttt gttttgtttt 102960gttttgtttt
tttgagacgg agtctctccc tgtcgcccag cctggagtgc aatggtggga 103020tctcggctca
ctgcaacctc tgcctcccag gttcaagcga ttctcttgcc tcagcctccc 103080aggtagctgg
gattataggc gcctgccacc acacccggct aatttttgta tttttttttt 103140aagtagagat
ggggtttcac cacgttggcc aggctggtct caaactcctg acctcaggtg 103200atctgcccgc
ctgagccccc aaagtgctgg aattacaggc gtgagccacc atgccccagc 103260caggataatc
attttcttgt tttaaataat tctttttttt tttttttttt tgaaacggag 103320ttttgctctt
gttgcccagg ctggagtgca atggtgtgat cttggctcac tgcaacctct 103380gccttccggt
ttcaagcgat tctcctgcct cagcctcctg agcatctggg attacaggcg 103440cccgctacca
tgcctggcta atttttgtat ttttagtaga gacagggttt taccatgttg 103500gccaggctgg
tctcgaactc ctgacctcat gatccaccca cctcggcctc ccaaagtgct 103560gggattacag
gcgtgggcca ccacgcccag caaatttttg tatttttagt agaggcaggg 103620tttcaccatg
ttggccaggc cagtctcaaa ctcctgacct caggtgatcc gtctgcctct 103680gcctctcaaa
gtactgggat tataggtgtg agccaccgtg cccggcctga gcacttttta 103740aaacataatc
taaacctgtt atttctttgc tgcctaataa atcagagatt cagaaccctt 103800aaaaagcaaa
taaaagtaag ggcaataaga cctgctgagg ctgaggcagg agaatagctt 103860gaaccgggga
ggtggaggtt gcagtgagca gagatcatgc cactgcactc cagcctgggc 103920gacagattga
gacataaaat agactgagac cctgcctcaa aaaaaaaaaa aaagagtcct 103980caaactgttc
atatacagga ctgggcttgc tagggaaaga agggggaatt cttcagacta 104040ctttacaagg
acacaagaac tcctcaacct aaacaaagta tgaaggtaca tagacaagtt 104100tcttgtaagt
cggaaaacag agaaatccac agtaactata atacatccct taaaggataa 104160gcatgcattt
gtaggaagca aacaaagctt tccatagaga aaccacttcc accagatgat 104220tatatggact
tgggttttgt ttctgcatct gtcacttggc aaatgaagat accatcttca 104280ggatgttccg
tgtcatcgat ttcatttaag gcataggctg ttttaagatt actttaaaag 104340atatggacct
ttttacaaaa taaaacactg cattcactta aaatatttgc agacgtgaac 104400tcttgtaccc
tgatcctatg ttgtcactct gggttttatg tatctaatca ttgactagga 104460atctatctca
gcagataatc caaaaatcta gtttgactag ttggtagatt atgctttccc 104520gctcacatct
gaacacacct ggttgttagt ctggtaggag gtcatcaagg gccacatcct 104580tacaggttgc
caggagacca gtagttctta atcctgactg cacttgagaa gctcctggga 104640aggtttaaga
atattgattg attgatgccg ggctccccat cagacccagt gaacgtaatg 104700tctgaggata
aggcttgggt atcatgtatt gccagactga ggaagactta agacaagaaa 104760acagtgagag
ctgttaacat tttcccacag gttagccgtg tgtggtgttg atggaaactt 104820cacatgcgag
agcacactgt ccaatatggc ggccactgga catgtggcca tttcaatcta 104880aattacctaa
acttcaataa aatttggaat tcagttcttc agtcatttta gccacatttc 104940catttatgca
tttatttgtt tatattttac tgttcatatt taagtgctta gaactctggt 105000aagctaacaa
ttacttgaaa ttacttgaaa tattaatctt atgaaatgtt tgaaagggcc 105060attatggaga
atattcatag caagatagtt cctgccttat tttcctgagt gttctgcatg 105120taaaataaaa
cccagaactc aagataaatg tgtgtatgcc tttgagttag tcatagaatt 105180ttttaagcaa
catgtttaat ttgatgtact gtatcttata gctatatcca tgtttgagtg 105240tgcaagtgtg
tgtgtacctg tgtttggaag tctaacaatg acattccaat atttttttct 105300caccaactcc
atactgggag ttttaaaacc agagccctaa tcttgactta attattaatt 105360tcctccgtgg
ccctgaagaa ttcccttagc ctctctgggc cttaaatttc tggataaaga 105420agttggctgc
ttcgtgctct caacattatc tgattctgtg atttctagct aaagctacat 105480ttatacaaaa
atataaagtg cttttgcaaa gtaaagagta gaatagtggt tatcagaggc 105540agtgggggat
ggagaagttg accaaataat acaaaattcc agttagacac cacctgtgta 105600ttcataaaca
tgtgtcccta ggttttattt atttattttt atttattatt tttttcattt 105660ttgagtcaga
gtcggtctgt cacccacact ggagtgcagt ggtgcgatct cagctcactg 105720caacctccac
ctcccaggtt caagctgttc ttgtgcttca gcctcccgag tagctgagct 105780gggtttacag
gagcacacca ccacaccctg ctaaattttg tatttttagt agagacgggt 105840ttcactgttg
gccaggccag tcttgaactc ctgacctcag gcgatccgcc catcttggcc 105900tcccaaagta
cggggattac aggcgtgagt cactgtgccc agccaattgt aatgattttt 105960atggttacct
tttcttgaat ggaatcattt cttaattaaa aaaataaata taccttttat 106020accatcaggg
tttaatttca ttcttttaaa aaaatatagt gcatggcaac ataaaataag 106080gtaaataatt
tatttcatgg tcattgaggc aactaacaaa atggtttaac aatattttgg 106140aaggaagtga
ctgtattctc acagttttgt ctctcaggtt cagcattaaa tgtagaataa 106200catttttcta
gaatccggag ggaatttggt cctttgattg agtcagggtg ttttgactgt 106260ctttgaaata
ttttatgcat taaatatgca gtattcataa aagctttcct accccatttg 106320ccccaaattg
atcttggttt ccatgcttct gtttgtcttt tgggattttg atttgctttt 106380tcacttctct
aggggacacc tgtgttcgtg catgcgggcc cttttgctaa cattgctcac 106440ggcaactctt
cagtgttggc tgataaaatt gccctgaaac tggttggtga agaaggattt 106500gtaggtaagt
tatttctttt aaatttagtt gttgtagaat attgaagaaa tctaaaagct 106560gatgactgga
atggagtgat atccatcctg acatgaaaca tattctttca ggcaccctct 106620ttcatagagg
gtctcaataa tggaaaagcc ctaaggtctg gtctatactc gtgaaattat 106680gcttgtgctt
gaatcataga gtctggtatc aagtacatat gggtttgagt ataaactctg 106740actcttgcaa
actgagtgac attggacaca ttttaattac tactgagctt cggttttctt 106800gtgtgtaaaa
tagtgataac aatactgtat caaacctttt agggttgtta tgaaaactga 106860atgagatatt
ttagtcattc agcaaatgtt ttttgagaac ctaccactta gaaggcatgt 106920tagctaggga
tccagtggta agcaaaacag ataggcgcag tggcacatgc ctcgctactc 106980aggaagccga
ggcaggagga ttgctcgagc ccaggagttt gtgtccagcc tgggcagcat 107040agaagaccct
gcctctgtga aaaaattaaa attaaaacaa ccagccttag cccctgctcc 107100tgtggagcgt
actgtctaat ggtaattgaa taagagaaag tgctcattga ctgtcagcca 107160ttactaccct
tactatgtag tactgataac ttgaagtcgt gggtttattt ccatgtacag 107220tttctccttg
ttttttgtga gctaatatag gagagattcc caaccagtag acttagaaat 107280gttaattggt
agtactgcac accttttgct gctgtcaagt tctttgcaga acctgttcct 107340ctttggcttt
tcttgcatat tccagctaga aagaaggggc aactctcata gagtttacta 107400agtcagtgcc
agccccagtc tcacccacaa tactaaggca gcaaggaaac aaaagacaaa 107460accaaaacac
aggcattcat ggaggagtcc cttaatctga catatgaggg actgcattgg 107520cagatttccc
atgctaacag gtaattccca ttaaccacac gtatacctat aatcggagac 107580agcaatctct
tctttcactc ttatcatcag ctttggatca aactcaaaag agttatttta 107640ttagaagaga
gtgtgctgga caaaagaact tggcactgca ttttatgctc agacatcaca 107700aagttcattc
agtgttattg agactctttt aagaattcct tgtggactgg cagtcaggat 107760cagatgaggc
ttggtggagc tccactcctc catgatctgt tttctgcagc tctcatgtac 107820tttgctcctt
cggtagcaaa tccactgcaa gagtggctag acatactgtc taagtggatt 107880tgctcaattg
atcgctgcct aattgtagag ctaatggacg aggaggctgg gtcagggcgg 107940gcactggaaa
gttgggtgct taagtgaagt ggtggttctg atgcatcagc tctgcaaata 108000aagtagcttc
tagaaggaag cactctattt tatgctgaca gagatctctg cccataccag 108060ttcttacttt
gtcaaacaca tatttataaa atacatgtta tcatggcaga atactttcaa 108120agatataaac
tgatgatgta tgcaaagcaa acgagccctc tgtgtgacct tgaacaagcc 108180attagcctct
ctgggtgtgt cccctgtaga cattctcatc tgaaaaatga gaagttggac 108240cagatgagga
gttggatgtg gatgtgtagc cgtggactca gagttagaca ctgtggaatt 108300ggagccctga
ctggtagtgt gttttcattt ctctagggct caatttttta tttaataaaa 108360agaatacttc
tgtattttaa tgaataggat caagtgagaa actgctttgt aaaatatgaa 108420ctatgttcag
gtcagcaatt atactaattc tttctatgat ctcatctagt ctaagatgcc 108480gtaagtgggg
gggtttttca tgaggatgaa atctgtccgt tgccagatgg ttcaaagggc 108540aagccctaag
tcagaggttt tacctcaact gctttcaagt cacagataac cttgggcaca 108600tgaacccttc
tcttccttgg tttccctacc tgtaaaatag ggctttggat agggcttccc 108660tctcgaaagg
acttggtagg tttgaaatga ggcatcaatg agatcatggg gttgagtcct 108720ggcacctgac
acatggtaag actcaatcct tcttggccac tcctggctgt cattgttacc 108780cccactattg
atacactact gatgaaggtt acattttctt ttttctttgg atctttttat 108840tttttggatg
attttattta tagttgttat aaaatacacg taacatgaaa gttaccatct 108900tcaccatttt
tgagtgtacg ttcattagtg aagtacacct tcattgttgt gcaaccaagc 108960tgcagaaccg
aagtcccatg cccattaaat attcctcatt ctccagcccc tggcagtgcc 109020cttctacttt
ctgtctcaat gaagttgact cctggatacc ccatgtgagt ggaatcatgt 109080agtatttgtc
cttctgtgac tgacttattt cacttagcat aatgtcctca aggttcatcc 109140atgttgcagc
aggcgtcaga attctcttcc tttttttttt tttttttttt attttttgtg 109200agatggagtt
tcgctcttgt tgcccaggct ggagcgcaat ggcacaatct cagctcactg 109260caacctccgc
ctcccaggtt caagcgatcc tcctacctca gcctcccgag tagctgggat 109320tacaaacatg
caccaccacg cccagctaat ttttgtattt tttagtagag atggggtttc 109380tccatgttgg
tcagtctggt ctcaaactcc tgacctcagg tgatccaccc atatcagcct 109440cccaaagtgc
tgggattaca ggtgtgagcc accacacccg gctagaattc tcttcctttt 109500taaggcggaa
taatattcca tcgtatggat agaccatact ttgtttatcc actcatccta 109560cattggacat
ttgacttgtt tccacctttg gctcttatga ataatgctgc tatgaacata 109620gatgtacaaa
tatattttaa tgtctaaaaa tcgtgctcta tgatgtctaa atatgtgatg 109680gcaatgactt
atattcacaa atacgtacca cggaactcta gccctgtaat gttgaactct 109740cccgaaggtg
gtgtgtccat gctgtggctc tgttggaatg ctggtggcaa agccagtgta 109800ttgcacagtg
ggaacggggt gtggtcagga gacagtgttc cttgcccatt gcttggtcaa 109860gcctgcactg
tggggctcgt tcattccttg cccttggatg aagtttataa tagttccctc 109920tcattctcag
ttaatatcta gtccctctaa cacatatatt ttttaattcc tctggtcact 109980ttcaatcata
agaatagccc ctaaaattaa aaatgcatat ccagttgtat atcaaaacca 110040aaaactccaa
atgtcactca gacttgagct cctctcctcc cctgtctggg cctcctctgg 110100ctgacagtgg
gaaggggaag gtggctgctt gcttctctgc cacccttgtc ttgcacttgg 110160gctcgccctg
gcccctccta ggttgaagct ttggcatgaa atggtgccca tgtagcctct 110220accagcagac
gtcctcacac ttttggcaga agccccctga gcagtacctc atttagactt 110280tcctacctct
gattccacac ctggtctcgg gtaccacaga cctctgacct ctggctgggg 110340acacgccctg
agccttgcca actggtcttc tggtcctcac ttgtggctta agggggaagc 110400ggggacgcca
aggcgggtgg atcacctgag gtcaggtgtt caaaaccagc ctggccaaca 110460tggtgaaacc
ccgtctctac taaaaataca aaaaaattta gcggggcttg gtggcgcatg 110520atgtaatatc
agctactcag gaggctgagg caggagaatc gcttgaaccc caggaggcag 110580aggttgcagt
gagctgagat cacaccattg cactccagcc taggcaacag agacagactc 110640tgtctcaaaa
aataaaaata agggggaagc agcttcgcat gctcttcact gggagctggg 110700aggggagctg
gatgtggcac actgataact ttctttgaag aaattctttc ccaggttcta 110760acccgtcaac
ccttttgaag cctcaatggg ctaaggcata caaaatctat ttcaaactgc 110820tttgcaaacc
ctcccaggtc agccagcacc atcctttggc atgtgagcaa attgacacca 110880atttgaggcc
cgaaacaaaa gtgagacaaa aagacgttca tttcaacatc cttctacaga 110940agttaaatag
agtaggcaca tctgaggtgt atctgaagtc tgagcaggaa ttcatgttct 111000tcaaaaagtt
gtcgtttata gtagactatc tttatgacct cataataggg aaggatctct 111060taaaatatga
aaagctcaaa ccatagagaa aaatgtcgat aatttggcct gcattaagaa 111120gtaaactgta
tgtgataaaa gatacaagga taaaaaacaa gtgacagatt ggaaagaaac 111180gtttgcaaag
atagtagaca atgattagca ttgagaatat gtaaaacact aaagactaaa 111240gactgataag
agaaagacaa ataatccaat acaaaaatag agaatttgaa aaggtgattc 111300gcagcaaatg
agcaatttgg tggaatttct aggtagaact gaaaatgaac aaatgattta 111360gcaatttcac
ttctatgtat actctggaaa agaagtagtc aaaaatatat ttaagtcctt 111420tgaaataatt
gacttcagta cccctttaat gagcatttat taagcttctg ctatatgtca 111480agtactatct
gctaggagct gggaatcaaa tatgaataag acaggtttct tgccctccag 111540taattgatga
tgtgctgtga gagacaagaa cttcaatggt aagtacaatg cacttgatta 111600agtactatat
tagttaaggt tcttaactaa gaatttcaag ggatgatggg catctcttag 111660atagaactgt
ctccatatta ctggttttgc tgatttagtc taatttttgt tctaacaaaa 111720agcatacaca
aggaaggaat tattttgtct attctgaata attagtctaa gataatagaa 111780tcagtagtcc
atttagactt aaaacagagt gacatttatt ttaaaaaagc atgaatttgc 111840aaaacacttt
atgagaattt ttcaaaaact gctgtgatca ctgttgtggc tgtttcttcc 111900gttagtcagc
agtcctcatg ggagttcacc taacttccat ttacatgctg taaaggcaaa 111960cagatatttg
gcatcgaatt ctaatcatgt attacattta aaaaataaag caaaaggaaa 112020gaaagaaagg
aaaaagcagt attgagttcc tggattttaa aaacattatt atctcatctt 112080taaactattt
gtaggaattt caaacttacc cattaactta ggatttttta atttgttttc 112140tgctctctca
gttggttttg attccagttt tcagataaag atcttgtctt tgcagaggag 112200tgcagaatgc
aattgtgtat ccttataggc aggtgtcatt aactgagggt caccagccac 112260tggtggactc
ctggatagac ttcgatctac aactctgtcc cataattaaa tataggttcg 112320tgcatatgga
gacatatata tatacacctt tttttttttt ttttgagatg gagtcccact 112380ctgtcttcca
ggctggagtg cagtggcgtg acctcagctc acggtgacct ccacctccca 112440ggttcaagcg
attctcctac ctcagcctct tgagtagctt ggattacagg cgcatgccac 112500cacactcggc
taatttttgt attttcagta gagacctggt ttcaccatgt tagccaggct 112560ggtctcgaac
tgacctcagg tgttccaccc gcctcggcct cccaaagtgc taggattaca 112620ggtgtgagct
gccgcacctg gccacctttt ttttttaacc tggagaaaat tacacaattc 112680ttggattctc
acaggggacc atgacataag agagagaaag aactgttggt gcatatctca 112740aagcctagct
ctacttttga cggtagcttt cgatgtccca tacacatttg gcatttatta 112800tccttatcca
gaggtgcttg tcaagaagct acctaattga ttgcaactct gctgcttgct 112860ctctgtagag
aatcacacag acctggctgt gttccccagg tgcttcctgg gtacatgagc 112920ataaacactc
atgcagcaga ggagaaatat gagacaagtg cccaattaaa acacagggtg 112980atctttgcct
cagacatttc aggattagga tttagaaatc agaatgaatc gacttagcag 113040ttccaagaca
aattgaatgg gattgggaaa caaagccttg ttaatatttt cattaaaaac 113100gcccccgttt
ctgagccatt cctgagcatt agaccggtgc tgagactatc atataccttc 113160tcactcaatc
cttaaaaccg tctgtgggat cagtgttact atcctcatct tacagatgat 113220gtaacagaag
cttagcaaga tgatgataca tgtccacgat caatcagaga gtaagtgaca 113280gaggacgtga
cctgagggct cctcctcgcc aaagcttgtg ggtgatcagg cggctattgt 113340gctcagtgtt
tttctcaatg ctctttacct ggttgccacc ctctcccatt aaaaaataat 113400tttagctggg
cgcagtggct cactcctgta atcctagcac tttgggaggc cgaggtgggt 113460ggatcacttg
aggtcaggag ttcgagacta gcctgaccaa catggtgaaa ccctgtctct 113520gctgaaaata
caaaattagc cgagcgtggt tgtgggccct gtaatcccag ctacttggga 113580gggtgaggca
ggagaatcgc ttgaacctgg gaggcagagg ctgcagtgag ctgagattgt 113640gccatcgcac
tccagcctgg gcaacaagag caaaactctg tctcaaaaaa ataagaagaa 113700ttttgggtga
gctgtctaca gcagatggtc acagttacag gaagtaccaa tataactata 113760ttgtaccaat
atatcgtaat gtgcgccaac agtggtggtc ctttttggtc ctttttttct 113820tttctgtgtt
ctactttaac atttcttcct cccaagtagg ttactgattg aagatgacct 113880tatcgagtcc
cactgtttcc ttgctctacc tctcaacctg tatttttggg ggacagtttt 113940ttaatttttt
tattttcctt ttttgagacg gagtctcact ctgtctccta ggcttgagtt 114000cagtggcacg
atcttggctc actgcaacct ccgcctcccg ggttcaagcg attctactgc 114060cttagcctcc
caagtagctg ggattacagg cacgcaccac catgcccggc tagttttgta 114120tttttagtag
agatggagtt tctccatgtt agccaggcta gtctcaaatt cctgacctca 114180ggcgatccac
tcacccctgc ctcccaaagt gtagagatta tagctgtgca ccactgcacc 114240cagcctaggg
gacagtttta taatcagtct tcttggcctt cagaggactc tgctggtggc 114300tatacccagc
tagtggcttg ttctctcttt ccacttttat tgccctgaaa ttctaaatat 114360tccatcattt
tggtagaaga aaaccagacc aaactaaaca ttctcatgat ttcatccacc 114420tttaaaaaaa
aaaaaaaatc tgacgtcaag tgatcctgca gctctaagct gtaattctcc 114480cagaccttgc
cctgtttaca ttatttaaaa acactgatct cagcctgggc tccacttttg 114540ttattatgat
gcttccaggt tttttgtctt tagtaaattt attcataatt aaatcaaggt 114600tgagaagtgc
tggcttatta gagactctgg ctgaaggtca tgtgcgaact caggaaactg 114660ctaatcacag
gtcgtgtttt tcagcccctg tcttcaggaa ggcttaactc taagggaggg 114720ttgttttgtg
tcatctccag agctctcatt tctcctgtgt ggcttggtgc cgaagctcat 114780tcgtcccctc
gctgtctgtt cggcccttgt cctacctccc ctttctcttt ccacgctttt 114840gtgtaaagta
gccctctttc aagctgctcc tctgtccttt gaaaacaaat gaaaaagcaa 114900aatgttcttc
atatacacta aacaggaaag tatgcaaagt aatgttacaa acacccatgt 114960tcccaccact
ctgtttcatc aaaatcttaa catgtcgctg tgtgtccttt agtttttttg 115020tttttttaaa
gagatacgtt tagctgggca tggtggcgca tgcctatagt cctagctact 115080tgggagtctg
aggcaggagg atcaattatg cccagcagtt taattctcct gggcaacaga 115140gtgagaccct
gtctctaaaa aaaaaaaaaa aagaaagaaa gaaaagaaaa gagagaagag 115200aagaaatcag
atattagaga cacagttgaa gttcccgtgg accctcctga tcccgttctt 115260ttttctgctt
ctctcctgat ttgagtgatt atcgtttccg ggtgtgtgtt tctactttga 115320ccacacacat
atatgtacac tagagatctg tagcggaggt tctcagactt tctctgctcg 115380tgaccattaa
tggcccagca atttttttca tggtaccgcc cgggcaagag aaatacctaa 115440catgtccatt
tattaaatag gtccaaacaa cttaatgagt atttatgtcc ttacaactaa 115500gtggccacct
gaaaaaaaaa tacatatgaa ttcgaaggaa aaaataacat catttcattt 115560ttaaattacc
accatatgtt atgaatgaga tgtgtgtcct tgtcaggcac cagacaactt 115620ctgacccttt
gtaaccagtt gacacccgca cctcatttcc tgttccacgt tgattttcac 115680acgatttttg
cattttgctg cggtacctgc caaaaacttt gctttgcaaa tatatgatgt 115740catggaaagg
gatgtagagc actattgttg aagcagtgaa aaacctcaca tgagtagtat 115800ttgcagtgtc
caacggatgt cctgtattac cgcgtttccg tcaaaaattt aaaatacccc 115860acggtgcctc
tgagtttgct gcaatgcctc acggcacctc attgcacatt agaatcacag 115920atgtgcggct
tttcttggta ttttttatgt gttttcaaat tttatatatt aatagatatt 115980aactgactgc
attgttctct aatgaggtca ctgcactttt tctgtaaatg gcaagatcat 116040aaatagtttg
ggtcacagag tctctgtccc aagtactcaa tttagaaagc agccataggt 116100aaagcatgaa
ttcataaact agtccatgtt taggaatgag tctggctgtg ttccaacaga 116160tgttattatt
ggttcctaac atttgaattt catataattt tcacgtgtaa taatactctt 116220ctgatattgt
tggcaacatt ttaaaaggtg gccaggcaca gtggctcatg cctgtaattc 116280cactgctttt
ggaggccaag gtgggaggat tgcttgagcc caggggttca agaccagact 116340gggcaacata
gcaagatccc atctctataa aaaaaatttt taattagcca ggtttggtgg 116400cacatgcctg
tagttccagc tactcgggag gctgaggtgg gaagatcact tgagtccaaa 116460agtttaaggc
tgcagcaagc tatgatcaca cctagctact tggtaagcta ctatggacaa 116520catggtgaaa
ccctgtcttg aaagaaagag agagagagag ggagaggggg agagagagag 116580ataatacttt
ttaaaagaga aagaaagaaa agaaagaaag agaacacttt ttaaaaggat 116640aaataatttt
taaaaactta ataatttaac atgtaaaaaa aaaaagatgt agaccatata 116700caaaacccag
aggcacctgg atttgtcctg ctagccatac tggccaaccc ctgctctata 116760acttgatatt
tttgtgcaac ataacagttt tgcatttctc tatgtcactg tgatccaaaa 116820agcctcattt
tgcctgcagt ttagccttct gtgcgtgcat gtagcacgtg catttgtgca 116880ttatttgtca
aggggctttc agattgcttc cagtgttctg ctgttacaaa taatgctacg 116940ggggacattc
ttgtccatgt ctctgagatt tagtctaaaa ggtcttctag ggactatata 117000caggagtgag
atttgggggg ttaaaagtac gcttagcaac aacttaatag ataaagccaa 117060attgttctgc
aacgtaatac caatttatat cactaccaat tgtgtgtgtg agatttctga 117120cctctccacg
tttgtcctga cacttctttt tttttttttt cttgagacag agtctctctc 117180tgttgcccag
gctggcgtgc agtggcatga tctcagctca ttgcaacctc cacctcccag 117240gttcaagcaa
ttctcctgcc tcagcctccc aagtagctgg gcttacaggt gcacaccatc 117300acacgtggct
aagttttgta tttttagtag agatggggtt tcgccaagtt gaccaggttg 117360gtttcaaact
cctgacctca ggtgatccac ccaccttggc ctcccaaagt actgggatta 117420caggcatgag
ccactatgac cggctgaacg cttctattgt tagatttttt tagctcttgc 117480taatctgatg
gtagttcatt atagttgtga gatttagcat ctttatggct gtttctgggc 117540atttgtgaat
tgtctatttt gtattgtgct atttaccttt ttcttactaa ttaatggaaa 117600tcctttatag
attccgaata cgaatacttt gccattatga gttacaaata tcttctcccc 117660tgtatggctt
atcttttaaa tatgcttgtt gtaccttttc ttgtccaccc ttttctttat 117720gatttctgct
tttggaggtt ataaagataa tcatttgtat tgccttataa ctgttgtctc 117780ttgcattgag
gcctttaatc atcttagaat ttgtttgtgt aatggtgtct tttttcctta 117840actcttacag
tttgtgagac ttttctgccc tggtgatatg tgcatcagtg gttgttagcc 117900cagccctatg
aaaagggcca gctggcctcc atagatctaa ggcagccgag agtgctgtcc 117960cctttgtccc
aggctctgca gcatccaagg cgagtgcaga tggggccagc tgttcttcca 118020aaaccatcat
ctcttcttta cctgctcctt ctgagcttca cctgacatgg ctgtgcagtc 118080cccaggaaaa
agaaagtctc tcttgtcttc ctacttccta agtcaatcct aggaactgtt 118140acgttcaatt
ccaagcagtt gtcataaaag aagtttcctt ggaactttga aaacaaaaga 118200aagggtttaa
ttttcttctt ttcgttttgt tttgttttgt ttttaatttt ttgagacaga 118260gtctcgttct
atcccccagg ctaacagctc actgcaacct ctacctcctg ggctcgagcg 118320atcctcccac
ctctgcctcc tgagtagctg agactgcagg cacaatccac tacacccagc 118380caattgttca
atttttttct ggagatgagg tctcactacg ttgcccaggc tggtcttaaa 118440ctcctgagct
caagtgatcg cctgcctcag cctcccaaaa tgctgggatt acaggcatga 118500accactgcgc
ccatccaaaa ggatttaatt ttatggtcaa attctacatt gtctattgac 118560tttaatattt
ttataaaatt ctttaaataa aatgtcttta cctggaatac cttcaatgta 118620cttactggat
tagaggcagg taggtcctgt ttgtatggct agtatgttca ttaaatatga 118680aagtaatgga
aagatgttta tgacataagg agagaggaac cttcgcctgt caggataata 118740gaaattgaaa
gatttttatg acctggctca gcagaaaaag gagtatatga tgttagcaat 118800tttttagaat
tgatttttca gaagaaacat gcaaaacaaa tgagaattaa gtagctagaa 118860agcctttttc
ccccctttat tcagataaag taactgaaga cttttaatga gaagcctggt 118920ttctgtcttt
ctacacggtg cactgggaat agtaagctgt tagaagtcag tgcttagaag 118980tacctaccat
cctaaacaca ggaaccaaaa acaagggaaa aaaaaaaaaa aaggaagata 119040aataaagtaa
aagtaaaaaa gtacctaccg tcccaaattt tgtatatgat gatgttcacg 119100tttttgaact
tacacatatt cttctgtata agtaaaatat ttcataaaat taaaaatgtt 119160ttataaaatt
aggaacagaa tttcttaaag atcaacatgg tactaataca tcttacaatc 119220tgtctttcaa
agtttcaata gacttaactt agagtcattt tttaaattgc attttgatgc 119280caatataatt
catgtgtcat tttccgctat aatacaatga aaatccagat aatggagatg 119340gattttcttc
tgagtagcca cgcatattag gtttcaggct ccaattgcct ggttttgaat 119400tgctctttgc
catgtgctgt ctgtgacctt gggtgggtca cttcacctcc cccaaatcta 119460ggtttctcca
atacctacct ccaggcttgt tctaggcatg aagtgcagtg atgatggaga 119520aggcatggac
tgtggcacaa atagaagagt caggggttgg cagctcctat tcccactgta 119580cgatgccaag
gatctttacc tataggataa gccttcagaa acataatttg gtctttcaag 119640tgctgaatac
ctctgtgagt atactttttt ccatcttttt tcttttggta aatgaaactc 119700actgagaaat
ttgacttctg ataatgggaa acagtgctgc ttatgtgttc tccttgtaca 119760aaggtgtctg
ggaccattat acttagaggt gatctgagaa atcggagggg ctctgattgt 119820tttgaaaact
ctcctttgag gtcgggcaca gtggctcatg cctgtaatcc cagcactttg 119880ggaggccgag
gcgggtggat cacttgagac cagcttggcc aacatgatga aacctgtctc 119940tactaaaaat
acaaaaatta gccgggcttg gtagcatgca tttgtaatcc cagctacttg 120000gggggctgag
gaaagagaat tgcttgaacc tggaaggtgg aggttgcggt gagccaagat 120060cacaccattg
cactccagcc tgggtgacag agtgagactc tgtctcaaaa aaaaaaaaaa 120120aaaaaaagaa
agaaagaaaa agaaaactct cctttgaatt aaggtaatcc ttcttgaaag 120180gcttgtttaa
aatgtttcct tgttacctgg gagagagcag gaagcactgg gaaatagcac 120240gtcactggtg
cctcgtgcca tttccgtgcc tttccctgtc ttgcctcagc cttgcactgt 120300ggaatgcaaa
agagtagaag aaactccaac gtgaaaattg tcgacaaaca atgcagagaa 120360tcttcggtgg
cattaaacat taaaattaaa gacacatggt cttaaattca gctgatatac 120420ttgacacctc
agatatactc ttggtgccaa aagggatgtt aagcaaaatt tgagggattt 120480gactccttat
ttgggaaaga ccatgactac caaggggaga gaaaccagag gcaaacctta 120540ttgaaaaatg
gagtagctaa gtcattttgt atatgttact tttacccagc caagtacagt 120600ctgtagtttt
acacagtgaa tttgatcttt actctaaatt cttttccttt aatgaaagtg 120660ctagtcagta
atgcagaact tgggcaagtt ttataaatat gtttgcttta caaccagtct 120720tattaagtca
gggtatcatc taagggccca atcaaggact tttcatttaa aaaagaaaaa 120780ctctaaattg
cattgtgata taacccttaa attccaactg tcatggggaa tcccaagacc 120840tacatctgta
tgtatgtgct gtgttctaaa cactgctccc caaccttttt ggcaccaggg 120900accggtgtct
tggaagacaa catttccgtg gttttgggat taaacggttc cacctcagat 120960cataaggcgt
gggtcctaga tcccttacat gcgcagttca cagtagcatt cacactccta 121020tgagaatcta
atgccactgc tgatctgaca ggaggtggag ctcaggcagt aatgctcact 121080tgcccgctgc
tcacctcctg ctgtgcgacc cagttcctta caggggctgc agaccagggg 121140gttaggggtc
tctgtttttg tttttttgga gatcaagtct cactctgtcg cccaggctga 121200agtgcagtgg
cacaatatcg gctcaccaca ctcttcacct cccaggttca agtgattctc 121260ctgtttcagc
ctcccaagta gctgggacta caggcatcta ccatgcccag ttaatttttg 121320tcattttagt
agagactgag tttcaccatg ttggccaggc tggtctcgaa ctcctgacct 121380caagtgatct
gcccccctca gcctctcaaa gtgctgggat tacagacgtg agccaccgcg 121440cctggctggg
gacctctgtt ctaatatgct taccatgcca aatgcattat gaagaaaaac 121500agaaacaagg
cttttaatgt taaagtctta ttttctcgct actgaaatag gcaaaatgtt 121560tcaattttgc
aaaaataata tagtattgtg tataggatct tccctaacct gctttttaca 121620tttaattagg
cttctacgag tcattcgggt agttgcatct agcaggctat ttatccattc 121680tttatagacc
ctttagaggt ttgagacccc tttatagagg tttgagagcc ctttcttgcc 121740ccagttctct
tgtctgtaaa gctgtaggaa ataatatgac ctaacataca aggcagtgtg 121800aggacagcag
aagtgacttc tgtaatgcat atggtgctgt tccaggcatg gataaatctt 121860tgctaacagt
cgtaagagtt tacctcctgt gaggacctgg ctacgtgctg agcaccctgc 121920ttggtgcttt
gtacattgtg actcattcag ttgctcaaac aacaccatga gtagtttacc 121980agaattaccc
ccatcttgca ggtgagaaaa ctgagcatcc gagtgagtta gtagtttgct 122040caaaggcagg
cagctagcgc aagggggagc aggaatttga acctgagtgt cggccccaca 122100gcctttgctt
ttaccaccgt gctttgccac ctctgaatgt tcactttcat tccatagcag 122160catcatacag
ccctgtgcat aacaggaagt gcacgtttgt taaatgagaa actagtaaca 122220cttctggtag
ctgaatgact taactactgg taaacctgtc tgcatggcca ctaggctttt 122280tatggtgaac
acaggcatat tttccatgtg ctgtgttgaa gcagcaatta aataaatcag 122340gtaaaagagt
aattaaattt aaaaggtaaa acgttttgga aggagttttg aggtttaggg 122400ctgtctaacc
tgaccttgac tgtttctcta tgtgctgtaa gatgcagatt gattttcaat 122460gagatattca
gcccatccct gcttatgatt tatatatatt ctaagtaata tgcttagaat 122520gtagttctgc
ttaccatcat aaaatgtgga gttaaaataa gatccttgtc aaaacattgt 122580catgcagaga
ttagtgctat gaaagttaac attaaatgat tttaaaatgt gatccagagg 122640ttaaactgaa
gttttgaaat tcataaaatg cataagcttt ttgagaattt ttaaaaatat 122700ccgttacata
ataactttaa aattacgtta tatgattgct atcttacccc atcttagcca 122760agagcattca
gcctgaggat actcaaaaag agtcaaatta ccccttcctt gtataagcta 122820tagtttagga
tattgcaatt atatgaggtc gttttaaagt ttttagcaac gtttggtata 122880attacaatat
tgtgctgttc ttacacaaag aaaacataac agaattttga gctaggaata 122940ttgggaaacc
agccattgct aaaggaagat gcatttagtg tatctttcta tgtacctctt 123000aacatctttt
gtggggtgta gaaaaacaga ttattagggc tttctttttt tgtggggggg 123060gcgggagcgg
ggatggagtt tcactcttgt tgcccaggct ggagtgcaat gatgtaatct 123120cggctcacca
caacctcctc ctcccaggtt taagtgattc ttctgcctca gcctcccaag 123180tagctgggat
tacaggcatg tgccaccaca cccggctaat tttgtatttt tagtagaggt 123240ggggtttctc
catgttggtc aggctggtct cgaactcctg acctcaggtg atctgcccac 123300cttggcctcc
caaagcgctg ggattacagg catgtgccac catacccagc taattttgta 123360tttttagtag
agatggggtt tctccatgtt ggtcaggctg gtgtcgaact cctgacctca 123420ggtgatctgc
ccacctcagc ttcccaaagt gctgggatta caggcgtgag ccaccacgcc 123480tggcctatta
gggatctcta aaagccgcaa gcctcttgct agccatagct gtgaatgttg 123540tcttaaccat
taatgcttca gtataaataa tatcaattta aaagctgcct aagatttaaa 123600agtactatgc
aaattacaaa ctgcttgggt atctagatca aaataaagca gagcaaagaa 123660gtaactttag
gaatgaaagg gtatctaagt ggaatgataa aggaagactt catgcaacag 123720aagaatttaa
gtaaaactga aaaactgaaa ctaatgtgat catggcatta aatagtgtaa 123780ccctgaagac
gcatccagta tataaggaac accacagcag tgcctatact caccgttatt 123840attcatcatt
attctgaagc atgaataaga acaatttcaa aaattataaa tattggaaag 123900gaagagacac
attttattat ttatgtacag aataattata taactagaaa attcaagaga 123960ggcaactgac
aacttatttt aagtaataaa agttttagtt tgctagtgga tgcaatgtaa 124020atatgtaaat
gtcaatagtt ttcttttttt gttagccata aatacatact gtaatagaga 124080aaatgtccta
cttatacatg tcccccaaag gatagaatac ttaggaaata agtttagcaa 124140gaaatgagca
catagaataa aactataaaa ctttattgaa agacaaacaa caagacctga 124200ataaatggta
agacacatga catactctta gatgggaata actggtatta atcatccttc 124260ctctgtcaaa
ctaacaccta aattactata aaattactat actatatcat gccacactat 124320actatactgt
atttactgta aaaactgcag aggaaaatga acttcgtggt taaattaatc 124380acagcatttg
tctagattct gcaagaatga attcatgagg atggatttaa aagctttttc 124440aaggccgggc
gcagtggctc acgcctgtaa tctcagcact ttgggaggcc aaggtgggca 124500gatcacttga
ggtcaggagt tcgagaccag cttggccaac atggtgaaac cctgtctcta 124560ctaaaaatac
aaaaattagc caggtgaggt ggtgcgtgcc tgtagtccca gctacttggg 124620aggctgaggc
aggagaatcg cttgaaccag ggaggcagag gttgcagtga gcctagattg 124680taccactgca
ctccagcccg gcgacggagt gagattctgt ctcaaaaaaa aaaaaggctt 124740tttcaagaac
catgatggat gattaaacta attgtaagta tcagaacaag ataaccctat 124800agcaaaacag
ttaatagcag aggaataccc tggagtgtcc tgagatagat ttcttccagt 124860agtaatttga
taagatgaca ttttattttt atttttattt ttttcttttc ttttattttt 124920ttgagacaga
gtttcactct tcttgcccag gctggagtgc aatggcgtga tcttagctca 124980ctgcaacctc
cacctcccag gttcaagtga ttctcctgcc tcagcctccc gagtagccgg 125040gattacaggc
atgcgccacc atgcccagct atttttttgt attttttagt agagacaggg 125100cttcaccagg
ttggccagcc tggtcttgaa ctcccaacct caggtgatcc atctgccttg 125160gccttccaac
gtgctgggat tacaggcatg agccactgca cctggccaga tgacatttta 125220aattagtggt
gaatggatgg gatactgatt aaatgatgtt ggaacatttg actaataatt 125280ttaaaacata
aaccatagag tttgttgttg ttgttgtttg aaatgtagtc tcgctctgtc 125340acccaggctg
gagcgcagtg gtgcgatctc agctcactgc aacctccgcc tcctgcgttc 125400aagtgatttt
cctgcctcag cctccctgag tagctgggat tacaggcaca tgccaccatg 125460cctggctaat
ttttgtattt ttagtagaga cggagtttcg ccatgttgtc caggctgctc 125520ttgaactcgt
gacctcaggt gatccatctg cctctgcttc ccaaagtgct gggattacag 125580gcatgagctg
ccatgccctg ccaaaatata aaccatagag ctttgtatca caatatattc 125640aaaataaaga
tagagtgact ttttactttt tagtgtgcaa attaaacata aagcttccag 125700gaaaaagcaa
aaaaaaaaaa ttattgttgg tatcctatca aggctagtaa aaagaataga 125760taaatgtata
catggaacat gggaaaacat tcacaaaata taaagttaaa aaaatcaggt 125820tgtaaaataa
tatatctcca atttgtatga gatttatatc tatatatgga atgaagagat 125880atagaagtat
atcaaatgat tatttttgtt tgcctggagg taatggaatc atggtgattt 125940tcacttcaca
ctttgtgatt tggcattctc aaatttttac aattactgtg tattactttt 126000gtagtagttt
tttttataaa aggaaaaaat gacttgaaaa atcaagtaat ttgacatgta 126060gacacaggga
aattacattc gaggcagaga gcaaacgaca ggcatgtgac tgtagctgac 126120aagtgtcagc
agattgtggg catagtttag agtctaagac atctggggga gatgagtttc 126180atcattagat
gaagtcatat tctggagggg aataaatgct aatctgaaat actatgcttt 126240caattgccag
tgaggagaaa ttgacagttt tgaaatagtg gtttcaggaa ggctctcctg 126300agaagatggc
actgaggcga gccttgaaga aggtgagagg ggagaaagat tggcctgtgt 126360tcaaggaatc
aaaatagttc gctagttcag agacagggta aagcaaatag actggagtct 126420cattcaatgt
ctttaaaagt catggagaag agttagtata gtttctaatc catgaggact 126480tcctgaatga
ttttgccagg actgtacttt aagatactgt ttctttccaa ccccccatca 126540gataagtcat
accacttttc tgaagtcgaa taagtgaagg gaaagagtgg acaaattgtt 126600atctaacagg
gctccctccc caggaaagga ctccttcgtc agtaagagat gtgcacgtgg 126660tataaccaac
aatgctactt ctgggtatgt ccacactgct ggagctcaga aatgcacaat 126720gaaaaaacaa
aacgggttgt ggaagtgtgt atacaatctg ttacattttt gaaaaacaca 126780taacacaatg
ctatattctg tttatgcatg tgtgcttatg tagaaaaagt ctgaaaacct 126840gatctgcata
agctctgagt tcataggagt tatgaccatg ggaagaagta gaggggtggg 126900atgagcgcat
gggatactga actcagctat gctgctttac ttctctgaaa gaggaacaat 126960cttaagtaat
catggtaaaa tgttaatgtt tgtccatttc agggtgggga aggaaattta 127020ggtctgtaat
ttaggagaga ggttgaggcc agagccaaga ttggcagttg tgatgagtat 127080gtggttacat
cttcactttt tttagaacta aatctggatg tcaatggcca tgtttagtgg 127140gccaggggcc
ttacattact ttcttgcagc actgatggct tttgtttgag gctgcacaaa 127200ttcctgcatt
tcccttgggt tgaatggtag ggatgcgggc agttggtgac tgggtgaacc 127260acctgacttg
agcagggcta cgactctctc tgcaaacgaa acccagagac atgaacagtg 127320ctgagatttc
tcagtggttt cccatgtagg ctgctttcca agggcagcaa gcatggcttc 127380atcactcacc
cagtgcttct gattcagcac tgtgatgctc ggttaagttt taatgaggtt 127440ttaaattttt
tcgtatgttt gagtgtttat gcaaacaaag atgctgaatt gtaaacacca 127500gcaatctgag
taccttcttt tgatttctct ctccatattg tttagtccct ttttctttgt 127560ttgttttatg
aaattgcata gatgttttgg ttgagagtag attgttttta cttttcaggg 127620acaggcaaca
atttcatgat ataacttcta aaaatatttt aaacaagtat gaaaatagtt 127680ttttcttcca
aaaaaccatt tcttcatttg gcacatacac aaacaaatga gcagaattta 127740gagtaattta
gagacagaat ttagataatt tgggtgggtt tttttttgga cagtcttgcc 127800ctgttgtcca
ggctggagtg cagtgatgtg atctcggcgc accacagcct cctcctccca 127860ggctcaagcg
attcttgtgc cttagcctct caagctacag gcacacacca ccacgtccag 127920ctaatttttg
tatttttatt agagacgagt ttttgccatg ttgcccaggc tggtcttgaa 127980ctcctggcct
taagtgatcc tcctgcctcg gcctccttaa gggctgggat tacaggcatg 128040agccaccaca
gccggcctag tgtgaaagca gtttttaagg acagttgaac aggaaagctc 128100ctggattcct
gcacatagat cttgtgcccc tccaaagtgc cgtaagtacc gatttctgct 128160gtcttgattt
aactcagtgt ctggagctta cgatccttcc tctctgtttc cttcagtggc 128220gtagcttggg
ggtgtcaatt aaagagttat tgcaagatct cagcctctga aagtttagct 128280tattcatcag
caaaacttta agcaagtctc acctcttcta aattagaatt gtaaccttga 128340ctttagtctc
agagttaaca ttgtgcagaa aaagaaaaca ctgcgcagtg ttgaagcagt 128400ttgcaaactg
aaatgataca aatgaaaaat gctatgatgt ttctcagtag tctctgtatt 128460tgcaaatgag
tcagttagga ttagttttac ccacaaataa caacaacagt aaaaaaaaaa 128520aaaaaagaaa
gaaagaaaga aagtgaccca gcactataat acaagtttat ttctctctca 128580catcaaggaa
gttcagggta gccagtctgg gctggtttag caactccttg atcatcatgg 128640gttctgtgct
acctggtagt ccaggatggt tgcacaagct ccagccattg catctgtatc 128700ccaatcagca
agaaaaagga agaaaggcat accccctttt caggaggctt cctggaaatc 128760acatacaaca
tacctggtta catctcatca gtgaccttag tcacacagcc atacttagct 128820gcaagacaag
cgaagacatt cagattctac ctcttcttct tgaaatagaa aggtgagttg 128880ccaatccact
gtcagcgtga cgacagttca agaggttttc ctagacacag agttgagtta 128940atcccgtcat
ctgtgtcatc gtctctattt tgacagcatc tgtcaggtgg tcaaccagag 129000tttaggcaag
aagacctatg acttctctac acagacatga ttaggattat cttctaaatc 129060aaccaaactg
accagcccac cctttcattc ttctgtgcat cacagatcat ttttgaaaaa 129120gagtttagga
cacatccaat agtaagagtg tagttttata aatcaggact cagtcttaca 129180gtggaagatt
atatggtagt tagtgtcatg gagaaaagct tcatattaga ttagttgaat 129240cgattgggat
acagaaccgt ctttagagca tggcatcgtt acattgcaga taaagcatgt 129300aggtagagaa
tgatatcttt gttggggaga gccccaaact gctcttcagg aacctttctc 129360cgagatttgc
aagccaatta ggcataatga aaccttgttt ttaatgcata tcttttgcca 129420gtataataaa
cataataatt aaatgtttat atatctttcg tattttcctg tataatacaa 129480ctaatcatat
ttttgtcagg ctgaggacat ctcaagagga gggaaaggtg gcaaattgag 129540ggtctgttct
tcttttatac cagatggcaa ttataacaaa ttacttctgg acttagggat 129600gggggagggg
tggggagatg ggaacatgag aattattcct gatttttgtt actttttgtc 129660ctgtaaaata
tgcttaccag gccgggcacg gtggctcacg cctgtatccc agcactttgg 129720gaggccgagg
caggcagatc gcctgaggtc gggagttcga gaccagcctg accaacatgg 129780agaaacactg
tctctactaa aaatacaaaa ttagccaggc gtggtggcac atgcctgtaa 129840tcccagctac
taggaagggt gaggcaggag aatcacttga acctgggagg cggaggtggc 129900ggtgagctga
gatcacgcca ttgcattcca gcctgggcaa caagagcgaa actccgtctc 129960aaaaaaaaaa
aaaaaaatgc ttaccgggac atctcaagga catcattcct ttgtcaacgg 130020tacaaagata
acctgagaca taaagtgaaa atgatttatg gttttttgtt ttcttcctta 130080cattaaattt
tccaataaca tatcgctaat aacatatttc ttttgttttg agacggagtt 130140ttgctcttgt
tgcccaggct ggagtgcagt ggtgcaatct cgactcaccg caacctccgc 130200ctcccaggtt
caagtgattt ctcctgcctc agcctcccaa gtagctggga ttacctgcca 130260ccacatcagg
ctaatttttt gtatttttag tagagacggg gtttcaccat gttggccagg 130320ctggttttga
actcctgact ttgtgatcca cctgccttgg cctcccaaag tgctgggatt 130380acaggcgtga
gccacggtgc ctggccaaca tatttctaat aatatattaa tttgataaac 130440agaagttatt
ttggaaataa gccaaaacaa caaaaaatgc tcatccctgc ttttcaaggc 130500tattatctgt
agtttgaggt tttatcaaat gtagatatga actgcgggcc tcttcaggat 130560gtcctcattg
cttctgttaa tgcttttacc atccagtacc actgaacagg cagcagggtg 130620ccttcttggg
cacaccggaa ggacccagga aaaggaagat agtgtggcct gttcttgtct 130680ccatgcccca
aaatggacca ctacaccagt ggggtaggaa atagcaggtg tgttatgggt 130740tgagtttgga
gaccttgact tgatcaacca tctgctcacc caggtcatct ttcctttttt 130800tgcagaggtg
aggtctcact atcttgccca ggctggtctc ctgggctcaa gtgatcttac 130860cgcctttacc
tcccaaagtg ctggtattac aggtgtgagc caccatgcct ggttcttgtc 130920accattctta
acatcagcag cagcagaaaa accccctgtt tagccttcaa tgttaatttc 130980ctccttgatc
tatgtgttct acacaaacca ctcccttgtt gtcccccata tattacattt 131040gtggactctt
ctcaaactct gcagctcagc cctggttcct gctgagccta cagccccagg 131100cctgagcagc
cggctgggcc aagttaccct cagtctcaag atgacttcct cacaaggcaa 131160aacatttctc
cagctgggtt ggagacctaa gctggggaac tgggctctca cattccaaga 131220tagaggtttg
cctatggacc cctgaaagta ggcgtgtaga agcaggttcc tctgatggca 131280gaggccctct
gtttatctct gaaagatcct gaatgctatg cttctttcat cccctctgcc 131340ttgaattcgc
ctgctttcta ggtgcggtgg agaaagctgg gaaaagagcc acctctgctc 131400ccaggggcca
ctgagagggt agaacactga ccgaatccca gactcactac ctcaggggct 131460gagatgggtc
caaaggggcc cttaacctct cacttgatca cctgactctt accgctgtca 131520ttgatctgac
atgttctcct gggcgctgtc ccagctctgt cccagcttct cagcactgtg 131580tcctggtcag
ccctgagggc tcatttcctt tatccatact gaggaccaag acctgtcttt 131640cctgtagggc
agtgagggcc acgggctagt gcctgagccg atgagtgcac tgctggtact 131700gtgctgagcc
ctggacaagg gtgtaggttg ctctaaaaaa caacttccca atattctaca 131760aaataacgtt
caaactgaac agagcttgac tgaagattct tcttacttca tttgttttca 131820tgctacattt
ttgctgtccg gctccaagtc tgtgacaatg ggtcacggga tcagtgcgag 131880taacagtgaa
gatgtttttc ttcatctttg ggctatgatt gtgctttgtg atcctaagta 131940taaatacatt
gaggctgggc acggtgtctc acacctgtaa tcccagcact ttgggaggcc 132000aaggcaggtg
gatgcctcga gcccaggagt tcgagaccag cctgggcaac atgtcaaaac 132060cccatctcta
caaaaaaaat acaaaaatta gccaggtgtg gtggtgtgca cctgtggtcc 132120cagctgctca
ggaggctgag gtgggaggat cacctgagcc ctggagatca aggctgaagt 132180gagttatgat
cgtaccactg cactccagcc tgtctcaaaa aacaaaacaa aaaaaataca 132240tacactgact
aataacacca gcaatattgt ttcatgggag ctgcctctgc tgtttgtacc 132300aaaactgagc
agattcttta ggccaggccc gagtaatcat taaatcgatc tgcttgtgag 132360aaaccggtag
gtctaggcat gagcacaaag tggcaggtcc gtctcttccc ttcttaaata 132420aggaatcacc
agaggtagac tctgtgcctg tgactcttga atgtttaact gaaatccaga 132480gcagcaattg
cattttatat gacaccatgt gtgtgcaaaa actgaatcaa aatcttgtga 132540aacagtgcta
ctttcactta ctatatatga tgtagtctaa taattttcta tcctatttta 132600tttccttttt
ttaaatgctg gttgaaattc tgtaaattta tataatgaca aactactggg 132660actctatctt
ggttcaaaag tccatgatca gcctgcttgt attgagttac tacattgtac 132720gatcttataa
ttgtaaagtt taagatattt tccttctgtc tgtcttaatt tattctgtaa 132780tctaagttgt
cctgccagga atgctggcat atttactttg tttttgaatt caataatcat 132840agtttctctt
gctaagttac attaatggta ccaaaaaaaa gttactgcat ggtttttgaa 132900actgttcttt
cctaggaagt tttaaagtaa aatgttttta ccttttattg tttacacagg 132960ttaagcatcc
ctaattcaaa aatccaaaat ccaaagaagt gctcaaaaat ccaaagtttt 133020ttgagtgaca
acatgatgcc accagtggaa aattccacac ctgatctcgt gtggcgggtt 133080gcagtcagca
tgcagttaaa tctttcatat atgcacaaaa ttattaaaac tattgtccag 133140ggccaggcac
agtggctcac acctgtaatc ccagcatttt gggaggccag ggcaggcaga 133200tcacttgagc
tcaggagttc gagaccagcc taggcaaggt gatgaaaccc tgtctcctct 133260aaaaatacaa
aaaaaaaaaa aaaacattag ctgggcacgg tggcacacac ctgtagtccc 133320agctatttgg
gaggctgagg tgggagaatc acttgaactg ggtggcagag gttgcagtga 133380gccaagatca
tgtcactgca ctccagcctg ggcaaacaga gggagaccct gtcttaaaaa 133440aatatatata
catatatata tatagtttag gctatgtcta taaggagtat atgaaatata 133500aatgaatttc
atgtttagac aggggtccca accccaagat atctcgttat gtatatgcaa 133560atattacaaa
atctgaaaaa atttaaaagc caaaaatcac ttcagtctgg tgccaaccat 133620ttttgcataa
ttgcattttc catgttgcat tttgcaacca ttttgcataa ttctgtactt 133680ctttcccttt
tgggtttgtt ttcataactg ctcagtaaaa agtgttctgt gtcatctttc 133740cattattttg
ggctaaatca tgttaatggc caagcctgtc agaattcatt tgttttgagt 133800taacttcctg
gtgtccagtt ctcaggagtt gcaagaatag agctcttccc tcatgcaatc 133860atccaaaatc
ccaggcctct aaccaggatt ggggaacacc actgttggcc aagagcagaa 133920ggcttaaagc
tctggtcata gtcatggcac tgtacaaact gttcttgttc cctacaagaa 133980catgggaact
atttggataa tggatttatg aatttaattc tgatggattt ttataaagga 134040gctaaaaata
agcctatata atttattaca tgtataatca tataatttta aatgtgtata 134100atttataata
ttcatcgaaa catttctttt tggaaattaa atttcacttt ggtcaaaatc 134160atcttacaaa
acaagatgac tcatgatggt tctaaataat gtaaaccggt gtctgtaact 134220aattaatgct
ctggaggaag gaaattctga gttctgacac aaaagcacgt ctaagtggta 134280ttggtatttt
caaagctttt actacaacca aaaatatgtt cacgtgtacc tgtctcctct 134340ttacactgct
tactttctct atcgcttttc ccaaatacgt ccccagtaac ttcctcatgt 134400tctgtgcacc
acctccaatc ttcccctaat gtgtatgaac ttggtcatca ccaaagaggt 134460ggaaaggaag
aagggcatgt gctaatatta atattgaggt ttgggctggg tgtggtggct 134520catgcctgta
atcccagaat tttgggaagc cgaggcaggt ggatcacgag gtcaagacat 134580tgagaccatc
ctggccacat ggtgaaaccc cgtctctact aaaaatacaa aaattagctg 134640agtgtgatgg
tgcacaactg tagtcccagc tactcgggag gctgaggcag gagaattgct 134700tgaaccaggg
aggtggaggt tgcagtgagc cgagatcgtg ccactgcact ccagcctggc 134760aacagggaga
gactctgtct caaaaaaaaa aaaaaaaaga aaaaaattga ggtttgtgcc 134820agtttctttt
tcctttaggt ctcctgatat gtgtctagtt agttctatat acttcctgga 134880aattcgactc
caagatattt aataaaacta aacagggtta tatgttgaat agacaaatcc 134940taaagagcag
aaaggattta gtggttagtg aaatttcatt gacttaagat gagttggatc 135000ctcaaaatta
atatattcta aacaaagttc tatttttaag ctgcagtaat aatgccgaat 135060ttctcttata
gttaattgaa tgatttatcc tggtgttgtc tcctggtagt cactgaaaaa 135120gtcattacaa
tcctcacttt ctccccttta attcaagttc agaattcatg tgttgctgtg 135180actttttttg
agtttatctg cagacagctg ccattgaata aacatgttta tatagggcag 135240aaaaaggcag
cctgaagatt ctactttcta tttcttttag cactgtttgg cagttaacac 135300caacagcgct
tcagaatgag gtactgtcca gcatacacct tgcttgttcc tctacaaagc 135360ttctattttt
gtttttgatt tgttgtgagg aagcaaagtg tagcctttac ttgaatgaaa 135420gctacagttt
tatttttctt tcttatttac aagaaattct tttgatgatt aagttagcca 135480gatacgcccc
gtgcgctcat atttcttaac ctattttgag ctgagctagt tgcaaggact 135540agttcttctg
tgaaacctct ggcagtccat aaattcaagt tcgtgtccag acgtaatggt 135600gaactcaata
tgttgtatat ttgatatatg tttgttgcac acatggagag aaaagttgct 135660accattgtga
agtagccatg aaatccgaca gttctaattt ctttatcttc tgagtaccca 135720tggaataaag
gacctcttag tgccatagca aggcagaagt ttaatttgca attctggtgg 135780caacatccag
gtatttcaag tagaaattaa gaaaacttta cggaaaaact atctcagcac 135840ttcagtttaa
ctatttttca aagtccaagg agggttttcc ccagtgtaaa agcagaaatg 135900agaatttcag
atctaaagcg gttcttgcct tttcttttcc ctccatgcac atgtgctgac 135960aacagaaagt
aaaatttgaa aagaaaaccc aggctagagc attggggtca ctcatttccc 136020atccagcgaa
gggagagctg gtgcctggct gggggaggtt aggaaggcaa gtcttgggat 136080ctgatcactc
ggctgggagt gggtaaaata cctggacctc aggaggccgg attgcagcgc 136140cagtgttctc
tacctgtgtg gcctgttcct gcctggggtt gggtaccaag tccaggtgga 136200agtggtccct
ggagcatcct ctctctgaag tgtttctttg tcaaacagaa tactccctgc 136260tgctgcccca
gaactcatca ctgttgacac atggaccaag aactagcaga ggaggtgatt 136320tataatctct
aaatgaattg ctctaaaaga gacaacaagc actctgactt aagtaaagga 136380agggagggtg
ggtggaaagg aaggaggatc ataaagaaca ttctagattg catactttgg 136440taatcacagc
agggccttat tgcagttaga acaaaagcct ggacacactt acctctcagt 136500tcaggaaaat
aatttagagc tacttgttaa atgctggagt attattgaag taccaagtgt 136560taagattttc
actgaaagca aaactgacca agcagctcag ctagggaaat gctatgtgat 136620cagttgattg
tttgatgagg agtatggagc agggactttg gcagaggttg ctagcagaca 136680ggccttggtt
ttagtccaag ctctatcgct taatagctgt gtgaccttgg ggaaaattat 136740aaaattttcc
agtccttagt atcataaact gtaaatggag gagaaagatg cctcacagat 136800gttgaaggtg
aagtgagaca ggacacctag tacattgcct ggctacatag aaaatgtttc 136860acaaatgtta
acttgaggcc ctctccccat gcctcggtct cttagaatgg tttttcccca 136920tcgtgtgcac
attgagcggt aaggcggaca gctgccgccg cccatgctgc cccatgttga 136980gagcatttct
gttgaccttc ccttcttccc tgtgaagctg gttcagcagt cacctcatca 137040ctgtttttct
tctccagaga gtgctgggtt gtagcgacca tactgttgac gcaacacatc 137100ccaaccacgc
tggacggaat ttttactgcg ataattcata ttccaggatc ttacagtgca 137160cagggttaca
acgattagca taatgatctt gcacaatctt tctgcctagc atgtagtgca 137220ttcgtaatat
tctcagagga ggcgtctaga aagcgccagg ctccttcctc ctgcgtgagc 137280cactgcaagc
tgtcggcgcc tcagtgctat ggagaccttg ggtggattat tccttgttgg 137340cggggggagg
gtcccagctg tcctgtgcat tataggatat ttagcagcat ccttgctaga 137400tgccatttac
aacaatcaaa aatgtctcca ggcattgcta agtgtccagg tggggagcgg 137460aataggcaga
accgcctcca gtcgagaagg gctgaactgt acttaggatt tgtgattctg 137520cagaaatggg
ttctgtgaat gcactaaagc atgagaacat ccaaaaatca gtgtggcaca 137580gaactgaatt
caaatttacc tggatccagg taaataagca caattttgaa agtaccaaac 137640agaattcaag
aaaaacaaca acaacaacaa aaaacacatt aaaaacagaa aagttggccg 137700ggcgccgtgg
ctcacgcctg taatcccaac actttgggag gccgaggtgg gcggatcacc 137760tgaggtcaga
agttcgagac cagcccgacc aacatggtga aaccctgact ctacaaaaaa 137820tacaaaaatt
agccgagcgt ggtggtgggt gcctgtagtc ccacctactc gggaggctga 137880ggcaggagaa
tcgctcgaac ctggagggcg gaggttgcag taagccgaga tcgcaccatt 137940gcattccagc
ctgggcaaaa agagtgaaac tccatatcca aaaaaaaagg ggggaaaggt 138000cagaaagacg
ttctttttca ctaggtgatg aattaatctt tcccatcttt gtcatgttaa 138060aaatgtgcct
gtggctgttt ctccactaaa tgaagacact ggcacacaga tgattgtgtg 138120taaacactag
gcaagacact aggcacaatc gtaggggttc agtaattatt gaatgagaag 138180aaagaaaggt
tgttcttgaa ttctgtgatg tgagtgttta atgatgtttt tgaggagaat 138240gccatatgta
gcagacagaa ccctgggctg ggcatcagag ggaccggggg caccacactg 138300ggctttgtgt
cattatctcc agaaagtaac tggcctggtg cacaggagaa agttagagta 138360cattgtccat
acagttccca catagctgta gggttttatg aacccatatt ttaaagggcc 138420ttctaatgaa
gtgaggatgt taatgctttt ttttaaaaaa aaaaaaaaaa aaagagagag 138480agagattggt
gattacttta aaatctgagt cctggccagg actcacgcct gtaatcccag 138540cactttggga
ggctgagacg ggcagatcat ttgaggtcag gagttcgaga ccagcctggc 138600caacatggtg
aaatcctgtc tctactaaat acaaaaatca cgctcctgta atcccagtta 138660cttgggaggc
tgaggcagaa gaatcacttg aacgtgggag gcggaggttg cagtgagcca 138720agatcacacc
actgcactcc agcctgggca acagaatgag actccatctc aaaacaaaca 138780aaaaaatcca
agtcctaatt atcgctgcag ctttctttaa tgtgtgcagt aaggtgggga 138840catacagaat
aacttaagat gtagaaatcg tctctccctc cctgaagaat atcatggata 138900ctacatttgc
ctgccaggaa tgttgtgttg ttctggtgtt ctcgtttctg ccgttatcat 138960tgtttctgct
taatttttac aacacaactc tggccgcacc cgtggatgat catggaaagg 139020cagctgcctg
gctttccaga attgttttga ctctccatct gaacccagag ctctgtttcc 139080tgtacaggtt
gctagcagta tttacccaaa ttgtcccttc ccaactcaga cccactgacc 139140cttcctttct
cagctgatta cgggtttaag tttgaatcaa accatgctat agaatcacta 139200tctagcagct
tctgtgcttg cacaataaga gtattagtca gcgtttcctg ttttcattca 139260tattgtaagt
tgaatagctg taacaaaagg gaaagagatt gccaagacaa ataattctta 139320tggcttgtgc
aaacaattta ttacaagcct tattaaatat gcagctttca gaagtctacg 139380tgatatgcct
ttcagcaaag ctaaatccta ctatgtggac ttgcttcatg acttaagaac 139440aagtctatat
tgaatttaac attaaataat tatttggcct gtatctcttt taagctttgt 139500taagaactgc
aaatatgata agggaataaa ttgctcattt caagtcacta agaaaaccta 139560acctataggt
tatatatgtt ggcagttccc aagaagtacc taggaaatgg aaccttccta 139620taagagtggt
ttgtcccctt ctcacgagat gggtcttcat atcaaggccc cctaaggcac 139680agatacacag
cactccacag tgtccaagtt cgtagtaatt ggtggcacct tattaaagta 139740gtttttactt
tcataattac tgattacaca tcagcatacc ttttccatgt tctgtttcct 139800tggcagcctg
atatttgaat acaaacgaca taaaagtgat tacatgcagc atttcaggac 139860agttttcaaa
ctgagagaat ttggcacttt cttttatttt cttcccattc ttcgtctatt 139920tcctttgcat
ttggaagcca aactcctcga acctttactt ctgcatgttg ttttcgtccc 139980ctagctagag
ttctctttat tactactttt ttgtttccaa ttggtctctt tttaaaaata 140040aaacatggtg
aaaccccatc aatacaaaaa ttagccgggc atgatggcgg gtgcctgtga 140100tcccagctac
tcgggagact gaggtgggag aatcaattga acccaggaga cggaggttgc 140160agtgagctgt
gatcgtgcca ttgcactcca gcctgggcga caagagtgaa actccgtttc 140220aaaaataaat
gaataagtaa attaaaatat gaaattaagc ttagctagcc tgtgtagatg 140280ttttagatgg
tcatatcttt atcggatatg acatttatgc ctttcagtaa aaccattgta 140340gtattgtaag
aaaaaaaggg aagtatgata gggaggacta agaacatatt tgtacttcta 140400tgtaatgcta
ttttatgtgt ctatttccat tgaagtgtgg ccttgacatc actggctccc 140460aagtcccatc
ctgttttttc attgacctgc ttgtgccagg tactgcgtta ggaagtggag 140520ctaccagatg
aataaggcgt ggtcccgccc tccatgtggg agacggcacg tcagtgtact 140580taggatgctg
catgtggtga cagatgaggg ccctgcaagg gagcacacca cagggccggg 140640agggtgggat
gggcgggagg agtagccaac tttgcccgaa agcatcagga caagcttcca 140700ggaaatggca
catccaagct gagtcttgga cagcgggtta aagaggcaga gaggggagag 140760aggaatgttc
cacatgggaa aagaggtctg gaggtgacaa caagtaacat agagtatgtg 140820ggctgtgccg
gaagcttgat attacagagg cacgaagtct gaggcaggga gtgtgaagga 140880agatgaaggt
aggggaagga agtttacaca agggtggcct tgacttttat attaaggcac 140940tttagggaag
caggaaccac tgaagacagg agaacatggt gactacacgg gcgcattgaa 141000ggggggagga
gctaggtttt tgaagggaat ttgagcttgc aatttttcaa gggcattcaa 141060tggttataaa
tgaaactttt cattaatata tttgtgtcca agtgttttta gatgtgcttt 141120tctgctactc
tctttttatt tggtttggag atagatggtg actttccttc agtgccttct 141180atgtgtcttg
acatgttaat gtctcaaaca catggtccgt atcatcatgg agcttgcaat 141240ctaacagggt
agtttaacac ataaatatgc atccagttgg aaagaactta gagatcactt 141300gatcagcccc
cgttatgtaa aatgtatgca acaatctacc tccaactata gggcgacatt 141360aaacagtttt
tcaaaatacc atggcattga aaaaaatgcg taacatttgg gttacttcag 141420tcccactccc
cggtactaat caggccagtg aaccagtctt gctggtgaaa taatgacttc 141480tccagtcatg
atggatgatt tcccttcatc atgaccattt cttccctggc tctgccttta 141540cctatgaatc
tcttctttcc tccctctcct tcctcctctt ggtctctgcc tgccataccc 141600aactccacat
ttctctgctt ggaaaaatcc tacttgattc tcagagcctg actccatcac 141660cccctctctc
caaccctccc aggcaaactc cgtccctccc agtgtgccgc cctccagctc 141720tctctagttg
aatagcagat gttctctctc ccacttccct gcagggcttc agggtagtga 141780ccctaagagt
gagcttctac cagtgagaaa aacgcctgga tggagcagga attcccacaa 141840atgcttagga
attcaatatg atttttagca agtgggagaa ttatcctttt ggttacaaag 141900atgtgaacta
tttttgtgag ttctcagtat gcaccaagaa aaaccccagt gactcagtaa 141960tgtacctgtg
ccgtctcact taacccttcc aacaaggaac ttgccaaggt gcacagctgg 142020gagggatccg
gggtcccgtc cgtgtgctaa cctgcaggac agcgccgcat aatccagtac 142080tggggcctta
ttgtccagcc ttccacctcc aggagcccac tcagtgatga cttttgacaa 142140aagtcacccg
gattatccct cttttggggt ccatgatgcc accaacggct aaagttaaca 142200aatgcattac
caggaggtac cactgaggta aggattgtta ctgtaagtga ttttctctga 142260ttttgaccac
tacaagtaaa atcacaagat ttgtcccaaa tactatataa agaggaatgt 142320cttttatttc
tcgaaattgt tctgccactt tgtttgttca aataacacat aataaaggat 142380aaatggcttg
tagctatatt tatccaaatt gaacattttt gatgtaattt ttttaatttg 142440tcaaaagcat
atcaatggtc ataaaaattt tttatgtcct ctggccctat taccttaccc 142500ttgggaattt
attccaagaa cgtaatttga aagagattaa agcacatagg cacaaatatt 142560ttcagtgcag
tattgtttat attggaaaca acccaaatgt tcagtaacag ggggaacaat 142620tttgtaaatc
ttgctacagc aacttaacaa aattataagc tgccacttat aattgtaaat 142680tttgaaagtt
tgtagcacag caaggaaact gatcatgtta aatgtttctg atatcttagt 142740taatttttct
cccgccagtt ccgatcggtg tcctggtcat accgtcagca tataagtatg 142800tatttcataa
acatttactt tgtattaact cagtagtatt cccttcctag agtggtttgc 142860ttcccagatt
gtaatggaca atcatctcac ctggggatct tgtaacaatg tgattctgat 142920tctgtaaact
ttgtggggag cccaagattc cacatgtctt agaggctctc agatactgct 142980gatgctgttg
atcacgaatc acgcattgag cagcaaggtc ctagagctct ttgtgtatga 143040agaagatact
actgacccag tgcctcgcta tcttgaaagg cagctgtgac ctttctagca 143100actgaggggg
tggggactag aaacgagtgt tttctggtgt ttctctaata aagctgctgt 143160cttgtatcca
gattgaccat cagtatcctt atgcccttca actaacttga aatttgtcag 143220ttaacagaaa
ttactgggtg gcactaaaat aaattattct taatataaag ggtaagaaag 143280ggtcgggtgt
ggtggctcat gcctgtaatc ccagcacttt gggaggccaa ggtgggcaga 143340tcacctgaga
tcaggacttt gagaccagcc tggccaacat ggcaaaaacc cgtctctact 143400aacaataaaa
aaaaaaaatt agccgggtgt ggtggtgggt tcctgtaatc ccagctactt 143460gggaagctga
ggcaggagaa tcgcttgaac ccgggaggcg gaggttgcag tgagctgaga 143520ttgtgccatt
gcactccagc ctgggggaca agagcgaaac tccatctcaa aaaaaaaaaa 143580aaaaaaaaga
gtaagaaagg atcttccatt gtaatcatac tatcattgcc ttttgaaaca 143640atttggaacc
tgattacaaa gttgaaaatc cccatctgta tgaagaaaag cttcactggc 143700tggtcacctt
ggctcaacgc ctgtaatccc agcactttgt gaggctgagg cgggtggatc 143760acaaggtcag
gagtttgaga ccagtctggc caacatggtg aaaccccatc tctactaaaa 143820atacaaaaat
aaaaaattag ccaggtgtgg tggcaaacac ctgtagctac tcaggaggct 143880gagtcaggag
aatcgcttgt gcccgggagg cagaggttgc agtgagccga gatcatgccg 143940ctgtactcca
gcctgtgcga cagggcaaga caccatctcc aaaaaaaaaa gcttcactgt 144000gagttatgat
atgggaagaa aagtcttggc ctggccacca caaggaatcc ctcaggcctg 144060gcggtgggaa
gcagaggagg gcgcagtctc tttagggcag aaagtgaaga acatttctat 144120gacaatgggg
aaatcagttc acctttgtgt tggtaaacaa aaggaatgta agctgtcctt 144180tgagctcgaa
tcatagtggt gattgccaaa acctaccttt gtttttagaa gataatagca 144240catattccac
ttgcttcccc acagtgaccg aagctggctt tggtgctgac atcggaatgg 144300agaaattctt
caacatcaag tgccgagctt ccggcttggt gcccaacgtg gttgtgttag 144360tggcaacggt
gcgagctctg aagatgcatg gaggcgggcc aagtgtaagt gcccacaccg 144420ccttcctaat
accaaagaaa tttcattatg ttaaaacaga attgagcatt tgaaaaattc 144480ctgcccattt
aatgactata gttttctaga gtatttttct taaaactttt tttttcctgt 144540agacctccct
cgtggacatc taaaacgttt tttgttttgt gttgtttttt gttttttccc 144600aacaccacca
tagggcactt tgagtttagt aagttcgagt aaacagtaac acccacagtt 144660ctttctaaac
aatattttgg aaaggagcat aatgcaacca ctttttagtc tcaagcttca 144720catctgatac
tcaggccaag attgtataac catatggtaa tgggaggtta gaggctcttg 144780tccctgaacg
aagaacaaga ggattagagt gagttctttg atgacagaga tttttcattg 144840tttgtacaga
tagtttttcg gttcactgat gctgcaactc atcagatttc atctaaaact 144900attttccttc
tctttctgtt tattatgatt tctcctcaca actctgttat aatactggat 144960tgtttctaag
agctttgctt ggaaaacaca aattcgattt taacttcttt aaagactttt 145020tatatttacc
tatcctaata tatatagaaa ttacagacat tataaaggaa atatattcct 145080ttccacagaa
aacctgtgca tagatttttg ggggtttttt tgttgttgtt tgcttgtttg 145140ttgttttttt
gtttcatttt taggaatttc caagtggaga ataagacatt ttatactgat 145200taatcatttt
gaaataaaat aaaatttctc tatcataata ttcacatatg agacgaactg 145260ccaaatctga
tttcttaatt ggtatactgt actgtgtatg tgacaaaagt catatattca 145320atgattattg
catagtcaaa acaagtagca ggtttgccat catgaaacgc tgtaaagcaa 145380ttctttctcc
tgggagaggt ggacatgatt tttataccat tttacagata aggaagctga 145440gttttgatag
gcttagggaa tatggacaat gtcagcaggc cagcagagag ccgagaactt 145500cagatctaaa
tctcctacta tgtccaccat gcagagctgg atgttaatga agggcagaag 145560ttttctaaat
ccaaatccac tcctaatacc cgacacttaa gcacttcatt tttgcctaaa 145620gcttcatcat
catttcaaat gcaagaatga tataatacag attaaaagta gtaaatcata 145680aagaattctg
acttcactca aaacacccct tgttgctaca gtccttcctg gattttagac 145740attgaatctg
tataacaact tctattttta aaaactactt ccgcccatgt aggatgcatt 145800tggaattctc
cctccctgcc ctgccaccct gaaaattata gaaagggctt agagattgtc 145860tgatctcatt
cagtctttat aaagagaaaa tctctgaggc acagggagac gcagtgactt 145920gagtggtgtc
acacatctat ttcagggact gatcagatct acagtcctgg tctcttgaca 145980tcagtcccat
agttttccag actctggtct ctgtcctgta aaaattctct acatggttta 146040caatcctatt
tcattcatct taacatatgt gctctgttat tcagtggttg cactttcgtg 146100ccacattaat
cctttgtgtc tcttccactg ctgcttaatg tcatggttct cagccagtga 146160tcatggtaac
atggaacagg gttagtaggc acagatatgg gtggaggata agaaaaggtg 146220gggaccagcc
aggcgtggtg gctcatgcct gtaatcccag cactttggag gctgaggcag 146280gtggatcacc
tgaggagttc aagaccagtg gtggcgggcg cctgtaattc ccactactca 146340ggaggctgag
gcaagagaat cgcttgaact tgggaggcag aggttgcagt gatctgagat 146400tgcaccattg
cactccagcc tgggtgacaa gaacaaaagt ccatctcaac aacaacaaca 146460acaacaaaaa
aaaaaaaaaa aaaaaaaaaa aaaaaaacca gcagggacca ggaagccctg 146520tgaagctagg
catccctctc ctgggctcca gtagtacctt ctgcctcaca ttgtttcatg 146580aatgccttgt
tctgtgtctc cctatgtaat tctcaattcc ttgagtttag ggactgtgtc 146640ttattaattg
gtatggtcca agtgtctggc atggtctata tgcagggagt gttttcaata 146700atggcaagat
aaatgaactt tcatcctctt aatcctctat tccatagtgc tcatctcggt 146760gtccttcctt
catattgaat agctctgtat ttttgaacca tgattattca tttctacttc 146820tgttttcttt
tttctttttt gctacaaata ctgcgctggg atcgacttcc ttttgatcat 146880tacacaaata
accatggctg cctcttatgt ttattcagct tgtttttttt ccataatttg 146940tacagtggaa
aatcttgtgg cccttgaatt tgcccatttt aaatttgccc tcagtcattt 147000taaactcaca
tcctgtcccc cagagtgctc ggccattccc agtgcaatcc aataactaat 147060attgtgctat
gctttgggat ccacagattc tttcacaatt agatctttaa actaaccatt 147120gtatgaatca
gcattcttca cacatagaac caataaggat gtgtgtatat atagagaaag 147180atatttattg
taaggaattg ggtcacatga ttacagaagc tggcaagtcc aaaatctgca 147240ctgtgggctg
gcaagctgaa gacccaggag aggcagtggt gctaatgcag tctgaaggca 147300gcagtctgct
ggagaattct cctgctctca ggggaggcca gtctttttgt tgtactcaaa 147360aaggcctttg
actggtcagt tgagacccac ccacattatg taggacaact tgcctactca 147420gaggctgctg
atttccatat caatttcatc atagacaccc tcacagaaac acgcaaaata 147480atgtttgaac
aaatactgga caccctatgg ccccagcaaa gttgacacat aagattaacc 147540acaaggccag
gcacagtgtt tcaccctgta atgccagcac tttgagaggc cgaggtggga 147600ggactgcttg
agcccaagag ttcgagacca gcctggggaa gatagcaaga cctcgtctct 147660acaaaataat
aaaaaattat ctgggtgtgg tggtacacac ctgtattcct agctactggg 147720gaggctgagg
caggaggatc acttgagccc aggagtttga gaccgcagtg agctgtgatt 147780gtgccactgc
cctccagcct ggacaacaga actttatggc aattaccgta aaatgatttc 147840gaatggtgat
gtactctgta taaatcaagt attaaacata ttattagttt tatgttctaa 147900tttgaaccaa
tgtttgagaa gtttcatact gttccattgg ctcaaattag aatcatccag 147960ggagatttta
aacattactg atcccaggtt cccctcccag aacttctgct ttcattggtc 148020tgggtggagc
ccagggatca gtgttttcta aaagcttccc gcaggagttt aggtttagcc 148080agtgctgcca
gctactgttt taaaagcaca aaacatagag gtacatttct aaaaattgaa 148140tgtgattatg
ttgttctttc tttcagatcg gttgtgtaag catttttata ggattattct 148200tcatgttttc
tctccttatt tcaggtaacg gctggtgttc ctcttaagaa agaatataca 148260gaggaggtaa
gaggagctgt ttagatgctt atgtgaagaa tctcttaaat ccagtgactc 148320gttggctttc
agtgaatgag ccctccttca gtggcctctt agaaaagtgc tgttctgtcc 148380gggcgcagtg
actcacacct gtaatttcag cactttggga ggccaaggcg accggatcac 148440atgaggccag
gagttcaaga ccagccaggc aaaacccatg gccaacatgg caaaacctca 148500tctctactaa
aaatacaaaa attagctggg catggtggct cacgcctgta gtcccagcat 148560gagaatcgct
tgaacctggg aggcagaggt tgcagtgagc tgcgatcgca ccactgtact 148620ccagcctgga
cgacagagtg agactctgtc tcaaaaaaaa gaaaagaaaa gtgccgttct 148680aagaatccga
gaggaaatgg atgggatgaa aggataaaat tcattcagtt aactcttaaa 148740tatcaagaaa
agatggatat aggctttgtc tactattgtg tactcccagg gtagccatca 148800gagcaggaac
tcaaggtctc caggtcagaa acaaatatga agcttacact tctgtattct 148860tttgtcatgg
aaggtctata tacagttggc tttatgattt acaggagtgg cttactcttt 148920tctgtaaata
gactttgatt tgctggcagc agtacctctt agctcctaca ggtggcgcca 148980gggatctgag
ccaggaatac ctggtcagag ctcctcaccc acctggggcc ttgtgcctgt 149040caggcctcaa
cctgcagagc atgggcgcat tgcatagacc ctgcagcctg tcctcagcag 149100ggtgagctgc
caggcttcag acccacccat atgccattgg atccagaatc attttagaaa 149160ttattttgaa
gacgaacctc tttattcttg cagccatctt tcttggagga aaaagtgaaa 149220aaagatgtag
caaatcctgc cagagaggtg ctggcagttt agcttggcag gtttggtggg 149280agacaataca
caggggtctg ggttttgctt ttttcttttt tacagaacat ccagctggtg 149340gcagacggct
gctgtaacct ccagaagcaa attcagatca ctcagctctt tggggttccc 149400gttgtggtgg
ctctgaatgt cttcaagtaa gtccagcctc ctcctttaaa tgtgggcatt 149460atcactaggc
caccctgtga acgatatttg tgtcttgggg tatttggtct ttctgtgctt 149520catctaaaag
tatacagaaa tacccagttc tttctgtttg tttatttggt ttggtttcat 149580tttagcaaat
gtgttctcac aatgttatta aggaagtgaa atttatatat agtgagttac 149640tatctccttg
gctttttttt tttttttttt ttttaattct acccagcatt tttgtctgaa 149700acaaactgtg
aatgatcaaa gggatatgtt atggaatttt cttgataaca cccttaacga 149760cttgaaatat
ttgagtcact ttagagagga cttttttttt ccatgcttta agtctgattt 149820atttaaacac
gatgtgacca cttttgatgt ttaaaatgta ttcagtgtta gggagcgaag 149880ttctgtataa
atggtgagtc tttctttgaa aacatagcta aatggataca gatgcaaaac 149940cacaaacatt
ttcttcacta ggaccgacac ccgcgctgag attgacttgg tgtgtgagct 150000tgcaaagcgg
gctggtgcct ttgatgcagt cccctgctat cactggtccg ttggtggaaa 150060aggatcggtg
gacttggctc gggctgtgag agaggctgcg agtaaaagaa gccgattcca 150120gttcctgtat
gatgttcagg taagatctag taaaaacaat ggctcacatt tcttacacct 150180tagcatgggt
tgtcccattc tgttgtcacc acaaccctac agataagaaa actaaagata 150240gaagagttta
ggcccggtgc agtggctcac gcctgtaatc ccaacacttt gggaggctag 150300ggcgggcaga
tcacttgagg tcaggagttc gagaccagcc tggccaacat agagaaaccc 150360catctctact
aaaaatacaa aaattaggtg ggcatggtgg cgtgcaccta tagtcccagc 150420tactcgggag
gctgaggtgg gagaattgct tgaacccagg aggtggaggc tgcagtgagc 150480caagatcgag
atcacaccac tgcactccag cctgggcgac agagtgagac cctgtctcaa 150540acaaacaaac
aaaaaggtag aagagtttaa gccatatgac aaagtcacat ttccaagtac 150600agaccctagc
tccttgtccc aaagcctgtg cttttgaaga ctcccaaatc gcagtcctaa 150660gtgaggaggt
actttagcga acaggatttg acagctgacg ctttcaggag ttgcacagga 150720acaagagcaa
tttcgtgagg attgggaaac catgggaggt attttccttg ttcatttctc 150780tttttttttt
ttgagacagt ctctgttgct caggctggag tacagtggtg caatctcggc 150840ttactgcaac
ctctgcctcc caggttcaag cacttctctg cctcagcctc ctgactagct 150900gggattacag
gcacctgcca acacgtctgg ctaagttttt gtatttttag tagagacggg 150960gtttcaccat
cttgggcagg ctggtctcaa actcctgatc tcatggttca cccacctcgg 151020cctcccaaag
tgctgggatt ataggcatga gccaccacac ctggcctctg cttgttcatt 151080tctgttatgg
ttcttttctc ttgtggtttt attgaactat gtctagtgta tgtgtgtgta 151140tatatacata
cataatatat ttatttattt tcattgtggt aaatatatat ataacaaaat 151200ttactatctt
agcctttttt tttttttttt ttttttgaga cagagccttg ctctgttgcc 151260caggctggag
tgcaatggcg tgatctcggc tcactgcaac ctccacctcc cgggttcaag 151320caattcttct
gcctcagcct cccaagtagc tgggactaca ggcgcaagca agcgtgcctg 151380gctaatttct
tttttatttt agtagagtcg gggtttcacc gtgttgcccg ggctggtctc 151440gaactcctga
gctccggcag tccacccgcc tcggcctccc agagtgctag ggttgcaggc 151500gtgagccacc
acacctggcc catcttaacc atttttaagt gcacgtcagt agtgttaagt 151560acattcacat
tgtcatgcag ccaatctcca gaacgctttt catcttgcaa aactgaaact 151620gtacccactg
aacaactccc catgcccctc cctacagtcc ctgccagcta ccattctcct 151680tcatttccat
gaatgtaaca actccaggga cctcatgtaa gtagcatcat actgtggctg 151740gcttatttca
cttagcataa tgtgcgtgca tgttgtagag tgtcagaatt tcccttcttt 151800tttttttttg
agatggagtt tcgcttttgt tgcccgggct ggagtgcaga ggcaccatct 151860cagctcaccg
caaccttctg ctctcgggtt caagcaattc tcctgcctca ggctcccgag 151920tagctgggat
tacaggcgtg caccaccacg cccagctaat ttttgtattt ttagcagaga 151980cagggtttca
ccatgttcgt caggctggtc tcaaactcct gaccttgtga tccacccacc 152040tcggcttccc
aaagtgctgg gattacaggt gtgagccacc gcgcccggcc agaatttcct 152100ttatttttcg
ggctgaatac gctcccattc tttggatatc ccacattttg tttctttgtc 152160catctgtgga
tggacacttg agctccccct cacagctatt gtaagtaatg ctgtatgagc 152220atgcgtgttt
atggcatatg gcatatgtac aagtatccct tcaagaccgt gttttctttt 152280tttttttttt
ttttttttga gacggagtct tgctctgtcg cccaggctgg agtgcagtgg 152340cgcaatctcg
gctcactgca agctccgcct cccgggttca tgccatgttt tcaattcttt 152400tggaaatgta
ttcccagaag tgggattgct gggtcctcta ggatattttt aagcacatgt 152460cgatggttca
gtcagaagac ctgagcttcc cagggctcac tctcctcctc agatcgtctg 152520tctttgcacc
cgtctgacat ctccattctg aattttgtgg ttccgattct ggctgatgga 152580tttgtcttca
ggggtagtgc atgattgtgg agtgcccagg cttcaagctg attgctgctt 152640ctcctgttcc
ttcatccctc acatattcta ttgttgagtc actcagatgt cacagacact 152700cttctttcct
gttcagtggc ataaatcaat agcataagtg aagaagaaat ccgtgaagca 152760agagcttagc
agcgttgttg ataagcctgg agtcacagtg gtgagagaac ggggctgatg 152820tattaatatt
taatattacc acagctctcg cttttaagga gttgtttgtg agctgggtat 152880atgggcgctt
tgcacatgtt cacctctttg gtcctaacat ccctcttagg tggaaactgg 152940tgatatcttc
ccattttaca gacaagaaaa ctaaagcacg aagagactaa gtcaactgct 153000cagtgtccca
tagctaggaa gaggcagagc tgggagaaag aatatccaga tggttgaggt 153060catcaccctg
ctctgttctt cagagctaag tactgtgctc aaaagtgtgt gtgtattgga 153120ggtctggggg
gagacatagt taaaggggac agtgactgag tgacccccct gaagagaaag 153180ggtgacagtg
atgggcatag gcacagattc catgccacac aggggacatc atcgctggca 153240cgcagtcagt
gtttacggaa ggaaggacac agggaattaa ttggaggcta gctcataaaa 153300ggtttttcat
ttttatgagg ccattttgtg aatgaggaac tagaatattc tggactacag 153360aggcggtaga
gatcaattga gggagaagat tttggatgag tgaaaagaaa aatgttccag 153420ttggaagctt
tgagttctca gacctagagg tgttcagcgc aaaatagatg tagccctgct 153480gtgggcaagt
cagaggcatt gaagccaact cagatggcag gtggccaagg agaccctggg 153540ttcctgttct
ttggtcctgg gtaaggcttg ccagggtacc tacaaccctt ggcacatgcc 153600agagcctttc
tacataaacc actgtaggga gagtcacccc cagatcctgc tgtgagtgtg 153660cctctgatca
tgccatctga tcatgccatc ttgccgtctg atcatgaacc accccactct 153720gcatagcaaa
ggcagttatt ttcatgaatg atttcacaaa gtgcggctct tttcaaggaa 153780aacatcccag
gtgaaaaaga ggtcagtaat cagaagaaat gcttctgagt tatgctgtct 153840cagacatgca
ctaaccctaa catgaaccct tttcctttcc actgatattt caattttttt 153900ttctgaattc
ccattcttca tgcagataat gctgctaaaa tgtcagctct cttctgaaag 153960gaggaaaatg
tttaataata atgtggtcta aagtcctcta ttgataggcg acctcaaggc 154020caggatccct
gcctcctttg ccaacacact tctcgctttt cttgtaacag aaattcgatt 154080tgagctgcat
aattgtttgt tagataatca gcctcccagc ctactgcagc atgacagttc 154140atgaaaagag
ctgctgctgc tgctggctct ctttgtactg tctgcctgtt agctgtgttc 154200ctcacaccag
caagatttag atcatctaag accatttaca gtcttgtctt gtcaccatgt 154260aaaaatctga
attcattggt gcagtcagta atagagctca agccttcttt gccccggatt 154320tcttgcattg
attggtgctg gtaaaacttt agtaacttaa gtaatggttt tctggaagcc 154380ctttttacct
cccctacaac cccacaccct cacaaccaac ccttgggtgc tgggcagtga 154440gaaagatatg
gcattgctct ataggagaaa gaagttataa atgctttgca tatgacactc 154500tcccaccttt
gcttctcaaa gcaaactggg tttctgtgtg ccccttggtg tctccccaga 154560ctttatatcc
ccaagaatgt attcagtaca agactaatgg catcttgtgt gctattcttt 154620ctccacccca
gtcttgtgga aggaaaacac cggagatgat ttagggtaat gggaaatagt 154680gctagagccc
agggcatgac tgcatttgaa tgtgaagtag aaattgtctc ggaaccccac 154740tggagagtga
ttcagctgga gagagtgggg cacctggctc tcagaggcac tgtgcggcca 154800gacagcagtg
tgctggactt cctaatggcg agctcacagg ttaagtgaga aacaggctgc 154860acttctgact
tgggagtctc tgctgaactt tatagaaaac tgacttcagc agcaactgct 154920gtctagagaa
caatttggca tctctttttg ccagagtaac ttctaactca agagtaggca 154980acttttaatt
atggtaccgg atggaggcag caatgaataa tttagaataa agattgaaaa 155040atatggaata
aggtgtagta gactaattgc tcagaagaag ttttattgtg tgaaaatgaa 155100tgtgtttaaa
cttgaaatac agaggatcat ttcctctcta cccttgaggg tgattgcagt 155160cccttttaag
aagctgttaa aagtccccct caagatcatc ttctctttga attttccttt 155220ctaatgtgac
ttttgacaat tatatcttta tcgttcacta agagaatatc tccatgtctg 155280cagggatgca
gggaaatatg atggctgaaa gaaacgctaa tctacatgcg cactttatta 155340gctaagaaat
ccctcctcca tacgttcttc caggcatgcc tccgcacatc ccatctgcta 155400tgtgggctca
actctgcatt atagatggga aggtgatgga cttgtccata agaaatggac 155460gcttgatagg
agaaggaaca tattacatag agctaacagg ctaaagcttt gggatcatgc 155520aaattaagtt
taaaccttga gtaggcctcc tactgtggac acaatttggg gaagctgact 155580aatctctcga
gcctcagcat cctcatttgt acaatggtgt ctgacagtga caccaacttc 155640accaagttgc
tgtgagaccc aaatgaaatg ccccagagta aaatgctgga acatagaaag 155700caggcaatga
aggccaggag tggtggctct cgcctgtaat cccagcacct tgggaggctg 155760aggcaggtgg
atcacctgag gccaggagtt cgagaccagc ctgggaaaca tggtgaaacc 155820ccatctctac
taaaaataca aaaactagac aggtgtggtg tcacgcacct gtagtcccag 155880ctactcggga
ggctgaggca ggagaatcgc ttgaacctgg gaggtagagg ttgcagtgag 155940ctgagatcac
accactgcac tccagcctgg gcaacagagt tagactctgt ctcaaaaaga 156000aaaaaatgga
ggaaagtagt caatcaatgc tggttcttcc tcctgtcatg ttcacttgga 156060aagagagtgg
gcctcctgaa ttcaccttca agaattctcc tgtaactcaa ggtaactttg 156120tcaaactcca
agattggttt attttgctct gggttaaaac caggaattag aagtcaagtc 156180tcctaaagca
ctactgaact aaaccaagac actgacgttt ttccttctcc gtgagaattg 156240ggagccctta
gtgatatcat aaaacatcag tcatttttgc aaagaaggag gtatgacccg 156300gagaagtcag
cttgacattc agtcggaata ttgtttgtta aatgaggagt taaggttgaa 156360aatcatctct
gtatccttcc tctagccttg caaacatttt aaggtgtagc cttgacagca 156420agacaggata
gtgtttgttt ttattaaaag gtctttcctg ttccttgcgt gaagccattc 156480tcttgatccc
agtgggggct ggatgaagga agagcaaaac aaaccacgaa gacagcctgg 156540agccgcagcc
agctctgctg acttctgaac ttcacgctct cactgcccag atggccaggg 156600gtgtttcctc
ctctgctggt cttttttaag gcaggatgtt tgttgttatt gttgttgttt 156660ttctggttgg
ttcttaagaa attattttct acatcttgaa aatgccagct ggaacctaca 156720ccagttgttg
gtggggtgtg agaactatcc tttacatgga cttggagcat ttgcaactag 156780agcctccaca
ttgaaataaa catttaaaaa tgcatatcta tggatataca gtgggggatc 156840ttccattgtg
gtaaaaatat gtataacata caatttcccc tttaatcgct tttacatgta 156900tagtctgtgg
cattaagtac attcacattg tacatacagg gtttaattaa taggatctta 156960aaacttaaaa
gggtattgat gcacatctct gttgcctttt acatttcaca gtaattagaa 157020gggaaaaaac
acactctagg ttgaaaggct gtcctttctt ttggttgtcc ttgtccctgg 157080aaaagctgtc
ttccccctca gggcctttga cctcccccgg ctctccctcc catcttcaga 157140cacgcccagg
gccagcctga gtcagtagcc ctttcttcag cgctccctct tctcctgttg 157200tgcattagaa
tgtttggagt gttctgtgga aagctggtcc ttacattttc cagtcagtta 157260tgcatctcac
tcataaacct gtttcttccc ccatgctctt taatattctg ctttcttaat 157320tatggttgag
ctgcctttaa acagtcttag attgttcaat gtgagtggct tcaacaagcc 157380tcccataagc
ctcttatccc ctggccccct ttccatgatc caaccccaaa tctccagata 157440ccccctgctt
tttttatgtt tttttgtttg ttggttggtt ggttggtttt tgggtttttt 157500tttttttttt
gaggcagagt ctcgctctgt cacccaggct ggagtgcatt ggcacgatct 157560cggctcactg
caacctccac ctcccggttt caagcaattc ttctgcctca gcctcctgag 157620tagctgggat
tacaggtgtg ggccactatg cccagcaaat ttttgtattt ttagtaaagg 157680cagagtttca
ccatgttggc tgggctggtc tcgaactcct gacctcaagt gatctgtcct 157740cctcagcctc
ccaaagtgcc gggattacag gcatgagcca ctgcaccaag ccccggatac 157800ctcctgcttg
accagacctt ctgcaaacac accttagatc cctctcaaat ctagatgctt 157860ctccaacact
ggcttctagg ttctttatga aacctctgct ggccggcagt atataaccca 157920ccacccagcc
tgccaggcac gtcattcagc tgtgttagac agtttcgagg gatctgcacc 157980tgtcatgagc
actggcccaa ccttctcaaa gcatcccact catccctgtg attctggcaa 158040agcttgcccc
agctttcttt cctcgatggt tgaaatagca cccaatgaga aaaaaacagt 158100ggcctgtatt
tattatactt acacatgtca caagaatccc tgtagtgtca ttgaaagctt 158160tcttctagca
tgcaaaatga ttgtcactat gatgtaagtt tctagaaatg ccttatttca 158220tctaaaatta
tgagagaagg taaaatgatg aactatcaaa gtcacactac atgttggccg 158280gctgccttct
accactcact ccatccctcc ccaggctttc tgacttaaat ttagatcaca 158340tgagaccatt
gcagtggaga gagtgtccac tttctcaatc taaaagcaag ctcgtgaaga 158400ccattaaata
tgatcacagg aacaaacatt aaaggtacca cacgtaatgg accacttaag 158460acagaaaact
acgatcttag ggtttagggg aaaaaaacac tcaaacagca caggcttcca 158520gtctccccag
tctctgagag gttgggattt ttttttttaa taaagtacaa gggctccttg 158580aacatgaaca
cggaacgagc gttgcacctg tattcattca tttcgctctt gtaacacatt 158640gatgttccta
agtcagattt gtagaagagg aacgagcaga tgcatcatga gaacactgtg 158700cttggccagg
cgcggcggct catgcctgta atcctagcac ttagggaggc caaggtagga 158760ggatcgctag
aatcgaggag ttcaagacta acctgggtga catggcagat cctgtctcta 158820ctaaaaatat
tttaaaaatt agccaggcat agtggcacat gcactgtagt ctcagctact 158880caggaggctg
aggtgggagg attgcttgag cttgggaggc agaggttgca gtgagccaag 158940atcaccccac
tgcattccag cctaagcaac agagccaagc tcaaaaaaaa aaggccaggc 159000acgatggctc
cctcctataa tctcaaccct ttgggagact gaggcgggca gatcacttga 159060agtcaggagt
tggagaccag cctggccaac atggcaaaac cccatctgta ctaaaaatac 159120aaaaatgagc
cgggcatcgt ggtgcgtgcc tgtaattcca gctactctgg aggctaaggc 159180ccaagaatca
cttgaacctg ggaagtggag gttgcggtga gccaagaaca tgccactaca 159240caccatcatg
cccggcctgt tttttttgtg tgtgtttgtt ttgggttttt tttgagcctg 159300ggcaacagaa
caagactctg tctcaaagaa aacaaaacag aacactgtac atgagtgttc 159360ctgggcccag
cagagtgctc gaaccagtca gcactttgcc ctcttttggg tgagatgaga 159420tctttaaaat
cacttattcc catatattcc gttttttggc attgctacac tattctattt 159480gagagtaata
aaagattacc catgtggatt tcttactatt tcatgtggag aaatgcagct 159540tttctagcgg
cgagcctttc ttccatttcc cagagtccgg ttaccctcac cgcaccccag 159600gtttgccgta
gaaggatgag attccctaca tgtaggcgtc ttcagagtga acctgttgct 159660cagggcccgc
gagcccacag ggctccatgc ccactgagca ggctcgtgga cgtcggcacg 159720tagctcagat
gtgctgattt ccagccaagg acatgctttg tgtcaacgat acaggggcct 159780gttgttctca
gtaacaaggg cagagccgct cccgggactc cagttggtat tggtaatgca 159840agcccttgca
cctagtgtgg tgtggtcagt ataaattaat atcgacagct tcatgttgta 159900ctttaatttg
aagtacagga aaggctgctt gccaaactgt atttgtagat catttaaaat 159960ttatgcttca
tttaaaatgc aaaatacgaa tggaaataat tactgtgtat ccggaaatgt 160020ctattcaact
tacaagttta ttaattgatc cttctcttta catgttttac gggacttgga 160080atacactttg
ctaatatcac agcataggaa gatttaaccc atatgaattc aactctgtag 160140ccttggtgaa
agaaaagaac catgtatttg tcattcactg tggcccccga tgtatgccaa 160200cgacttcatt
tggtgcaggg ctgaagaaat ggcagtgtgt tccaatactt cacaacagtg 160260gggcctcatt
aagagatgct agggtattga gagtggagtg ttacatagtg tttttcatag 160320gtcttctaaa
agatttttca gagttgctga cttaagaatt tggcctctga gtaagatgag 160380acgcccctac
tacgttataa agccttcatg gatctttatt tctaacagtt catttttaaa 160440cttttgttcc
atttcatttt attttgtcca ttttttgtca tcaaaaaatt attttaaaat 160500gtaacagctc
ttctagaatg tacagatcaa ttttattcca ctaaaaaaca ggacttcaat 160560aatttaattg
aagaattcta gtgagcattt ccagagcctt tccaagaaag ctatactgtt 160620taactttgag
tgactcagct tctaggaccc tggtccacca cacctacaaa atgaatttaa 160680taaagcatga
aggtcaccaa atcctctact atttcttcgt ggagtcattc agctagtgtt 160740tatgttgtgc
ctgctctgtg cgagagactg tgccaccaat ataaccactg gtcacagtta 160800ttttctagag
ccatactgcc tgatatccca tgtggctaag attgacagtt cagtttctgg 160860attgttccag
tcccatttcc agtgctttta atagccacat gaggccagtg aacacacgat 160920acatcttcca
ttgtgataga aagtggcagt ggacagcttt gttctagagt atttccgttt 160980ctgaaataca
tgtttaagaa gtttggaaag gaattagtgt agagtatgag aagggttaca 161040tttttggtga
atatgaagat ttaacataat gttagttcaa aaatgaccaa agacacaagt 161100gtagacacca
ggcttttgaa gtagctaata tgttaagcag ctgtgcatta tatagacaga 161160atattcacta
tgtcccactc agccctgtgt ttgacaccat ttcaccttgg cttctgcctt 161220tctttccctt
ctttgcctat tgagattttt cctgtccttt ctatcctttt tttttttttt 161280tttttttttt
tttgaggcag agtcttgctg tgtcaccagg ctggagtcca gtggcacgat 161340ctcagctcac
tgcagtctcc acctcccggg ttcaagtgat tctcctgcct caccctccca 161400agtagctttg
actacaggaa tgcgccacca ctcccagcta atttttgtat tatttaaata 161460gagatggggt
ttcaccatgt tggccaggat ggtctcgatc tcttaacctc atgatccgcc 161520caccttggcc
tcccaaagtg ctgggattac aggtgtgagc caccgcacca ggcctcctgt 161580cctttcttta
atgcactatt ccagcttttt ttttaatatc ttgccagctt ttgctgattt 161640tcatgcctcc
ctctgcaacc tgtgatctct tgactttaat agtaatgttt atacagagtc 161700ttgactgcac
actctcttct gtcatcgttg tttgttgtgt ctcattgtcc ccaatccatc 161760agaagttctt
ggtacacaaa gtccctgaag ctgtatgaac aatgccctga cgtcttgaaa 161820ggacttgcca
taggtggtta tttgatcact tagcagatgc ctttgggatc ctactttgtg 161880caggccactc
tccaaggcac aatgtggcaa gtaaatatat tagtcaatat ttacagaacc 161940aagagaaagc
cagcaatgat taatgcataa acagtcataa attggccatt ttatttgttt 162000ataaaagcta
tcatttacta cagagaagta caagtgaata tcaagctcag atcaagacct 162060ttggactgag
cttggctttt tttttttttt tctttaaaga ttacacttct cccccaaatc 162120aagcagatta
accataggaa gttccttgcc tgctaacaga atataaatag attgcatgca 162180aggggaagga
ccttatcagg gcagatttta gtatttacac taaacccaaa agattatttc 162240cttaattgta
cttgtgagtg aaaaacaaaa attgtagcag gagtcccaag agcactctgg 162300ccatgggctt
ttgagctgtg agttgaagca ctttattcag catgtggctg cagggttgta 162360gctcaggttg
tagacagtgc attctcttca ggtcccagga aacagtgtca tacctgtgga 162420agccaccaga
atgcaccttc atcttctcca gtggtggcga ggttgaggcc aagcatcaga 162480agtgctgagt
gctgggaaac tgtggggctg gatagatggc agtgcctgct gggaagcaca 162540cagccctggg
ttgaggcagg gaagcagggc cgcccagttg ggattcattc gtactactgg 162600cccctcacag
taccgtcatg tgtgggtggt cgcgcttgtc ctacaggaag taaaccacaa 162660ctactcatct
cacacagtca gcccggcacc tccttggaat tcccaaaaca ctttccaata 162720aacattggaa
aggaagaaag tttagttggt aacaaaaaaa tgtatgttgc cttacgccca 162780aaagcattga
aagcagggac tcaagcaggt actcatgcag ccctgttgat agcagcattg 162840ttcccagtag
ctgaaaggtg ggaaccccct tgtctgtcag cggggatcag ctaaatatgg 162900cacacacata
ctatggaaca ttattcagcc ttgggaagga cacctcctat cacacggatg 162960aaccttgaga
acattatgct aaatgaaata caccagacac ggaaggacag atactgcttg 163020attccacttg
taggggcctg tagagccgtc acattcatag acaggaagta gaatggtggt 163080tgtcaggggc
caaggggagg ggcatggaaa gttgctgctt aatgggtaca gtttcagttt 163140tgctaaatac
agagagttct ggaggttggt tgcccgacac cataaatgga cttaacgcta 163200ctgaactgta
cacttagaaa tggttaaaat ggtaaattag gccaggcatg gtagcttgcg 163260gctgtaatcc
cagcactttg gaagtctgag gcaggcagat cacttgagcc caggagttca 163320agaccagcct
ggacaacata gcaagacccc catctctaca acaattttag aaattagctg 163380gtggcttaca
cctgtagtca tagctacaca gtatgctgag gtaagaggat cacttgagcc 163440caggaagttg
aggctgcagt gagccgtgat cacatcaccg aactccagaa tgagaccctg 163500ggtcacaaaa
aaagaaaaga agatggtaaa ttatatgtta tgtgtacgtt accaaaatta 163560aagttcttaa
aaatatatat tttaggccag acacagtggc tcacacctga aatcccagca 163620ctttgggagg
ccgaggcagg cagatcacct gaggtcagga gttcaagacc agcctggcca 163680acatagtgaa
accccatctc tactaaaaat acaaaaatta gccaggcgtg gtggcgggag 163740cctgtaatcc
cagctgttcg ggaggctgag gcagaaagaa ttgcttgacc cagaaggcag 163800aggttaccgt
gagccaagat cgcgccacta cactaccagc ctgggtaaca gagtgagact 163860ctgtctcaaa
taaataaata aataatatat atatacatat tatatagata gatagataga 163920tttcttacaa
aacattttcg tgtttaaaaa taacaaatca ggccaggcac agtggctcac 163980gcctgtaatc
ccagcacttt gggaggccga ggcaagcgga tcacctgagg tcaggagttt 164040gagaccagcc
tgaccaacat ggtgaaaccc tgtctctact aaaaatacaa aattagccgg 164100gcgtggtggc
acatgcctgt aatcccagct acttgggagg ctgaggcagg agaatcactt 164160gaacccagga
ggcagaggtt gcagtcagcc aagatctcgc cattccactc caccctgggt 164220gacagagtga
gtctctgtat aaaaaaaatt tttttaaata acaactcagc catccactcg 164280catcaattta
ttcactagtc attcgattgc tattactttg ctcccatgtg agttttttaa 164340attattaaca
gaaaaatcac actctttatt tttgaattat gatcttgagt tcagtaaaga 164400atgcattctg
ttgattaaat gtgttatgag taacagtatc ttgttttgct gtacttccaa 164460gcaaaatgta
attctttcag tcctccctac cgttctcact ccataataat atgacttaag 164520gtttgtatag
catgacgttt acaacgtgct tgcgttttac actgactcaa cgtgtgcagt 164580acagtgatgt
atagtgaagt attcacatcc tcatcttaga aatgaggaaa taaattcagg 164640gaggttacat
gatttaacca cacataagga ggggctgcat tctctctgag cccttctacc 164700tcacacagcc
accaagctcc tttgtgcaga gcccatggag ggttggagtg gccacactgg 164760ggtaggtgca
ggccagagac tggatgcaac aggaacattg cctgcttccc aaaaccactg 164820ttctactgcc
cggagaacac acccttctgg gagctgactt ctaacaaagc aaggtccttg 164880gacttgccct
gcggcttctt tcctgcaaga agtatcttcc agcatcctac caacttttat 164940ttcttaaaaa
aggttgttcc ttgccaggca ttggttgatc agcagaggag ctcgcattag 165000tttcagtgca
gtgttgggag caagcacttg agtgcccagc aacataggct gctcatgatc 165060taagattcat
ggaaaaagtc ttctgcgctg ttgcctgcag cacatcaaca cctgaatttt 165120cttatcatct
tgtgtggcta ttgaaccaca gggaacccaa gcttagtcaa taggaaatat 165180agaaatattc
attggttcaa ctgtataggg agggccttgt atatattcca ggggctgtat 165240cctatcttgc
ctctaggtaa ataagagatg gtcccgaggc tcaaagagtt tatcaaccat 165300tctttattct
tcagctaata tgaacattga atcttttaca ttagaagttg cccatgcagg 165360gcacgcctgt
aatcccagca acctgggagg ccaatgcagg aaaatcactt gaggccggga 165420gtttgagacc
agcctgggca acatggtgag acctcatctc taccaaaaat aaattttaaa 165480aaaagttacc
cttaaatttc tttctctaac ccagcctcaa ctttgtactc aggcagtaac 165540tctttgtaca
acattcctgg caccatattc agcttataca gaaccctcca ggcaaggaca 165600ggtcactgct
aagtgagaga gctctggtta ttatagttct tccttaaaca ctgcactcaa 165660aacgacttcc
ttctaacgtt tgtcccttgg ttcacattct gccctccttt cgattttttg 165720tatggcatgg
ccctccaaga gatttcagtg tcagtgtggt gctgccaagt ctgtgtgact 165780gttcacacac
tcgggctcca ttcacacacc ccaagattgc atgagctttt gctgcagcca 165840ctcacagtca
ggccccagtg aatccctgtc ctctgaagct gcaggtagga gcgccccaag 165900ggcatctaga
tgtcaaagaa gaggatgctc acatgtgcct aaggttcatc catgaaggct 165960tcgtggagga
ggtggctttt gggatagatc ttagaaagct cagagagttt taacaaagat 166020tggtccttga
agattactaa tcttttgact ttataaatac ttgcatatat cttcttccag 166080tctgtctctt
attttttgac cttatttata atgtcttctt aaaacaagtt ttaaattttg 166140ctgtagtgaa
atctatatat catttccttt atgggttatt ctattgttac ctcatttaag 166200aaatccttct
ctacttcaga ataatagagt tcccactgtt ttcttccaat atgtagagtt 166260ataattttca
agtttatgcc tttgttccat attgggttta tttggaggcc tggtatgtgg 166320taggaatata
attttatttc tctccagcca aaagtcagat ggtagtcagc ctttctctgt 166380tgatttgtgt
cacctctgtt atatactatc ctgcacatgc tgagtgtctt tgtgggttct 166440ttcttctttt
ccgttgatta atttgtctat acttgtgcca gtacccatgg aatttattag 166500catagttttg
tggtatgatt gatagctagt agaaaagtcc tctttctttg ttcttttggt 166560tttgttttat
ttttaaagtt atctgactat ttatgccatt accctgtgtt agttcactct 166620cgagttgcta
taaaggcatc cctgagactg gttaatggat aaagaaaagt agtttatttt 166680ggctcatggt
tctgaaggct gtgcaagcat ggcaccaaca tctgctgggc ttctggtgag 166740ggcctcagga
agcttccact cagggcagaa ggtgaaggcg gggggagcca gtgcatcaca 166800tggcaagagc
gggagcaaga gatggggtgg gaggtgccaa caagcagacc tcctgagaac 166860tcagagtgag
aactcactca ttatcgtgag gacagtacca agccattcat gagggatccg 166920cccccgtgat
ccaaacacct cccacctggc cccacctcca acattgagga ttacatttca 166980ccatgagatt
tggaggggac caccatccaa accacatcaa ccctttattg taagtttcag 167040aactaattgg
ccaagtaacc aaaaatgagc ttgggatttt tattggactt gcattgaact 167100gagtaagttt
cagagaagtg acgtctgtac agtgttaagt ctcctcatcc atgagcaaac 167160tattttcctc
cattgactca gactcttttt atgtccctca gtgaaaggtt ataaggcttg 167220caccttcatt
tctttttatt tgttagaata ttcaatattt atgttccaga ccacagaaca 167280cctctacagt
aataaaaact gtagtaatga aaaacacgtt ttaaagcagt tttgggttct 167340atcagggttg
ggaggggaag agtgctagct tttatttatt ttctcttggg taactcttgt 167400atcaatagta
tttcatacaa atgtgacact agctgttgcc aagcaaagtc cttgaggcac 167460atcacgggag
acttgtgccc cttggttgac atctgttcca tcagcatttg gttcagcagt 167520tgttcaccaa
acagtaaatc cactaattgc atcatgcata tgctttcttt tttttttttt 167580ttgagacaga
gtcttgctct gtcacccagg cttgaagtgc agtggcgcga tcttggctca 167640ctgcaacctc
tgcctcccag ggtcaagcga ttctcctgcc tcagcctcct gagtagctgg 167700gattacagaa
gtgcgctgcc acacccagct gatttttgta cttttagcag agacggggtt 167760tcaccatgtt
ggccacgctg atcttgaaca cctgacctca agtgatccac ccacctcagc 167820ctcccaaagt
gctgggatta caggtgtgag ccactgcacc tggccacata tgcatcttta 167880cccacaggag
gattgtagat aattttgcca aatgtgtttc ttaaatttag aattacttta 167940tctgtggaat
attattttcc taatgaatca ttcttgtttc tgaagatccc ttgccgtatg 168000ctatcaatag
tctgtgtcct ggttttcaat ctaacattat ggaaatcttc tcctccaaca 168060cacttatttg
cctgtatcta gtcttctcac atttcttcca tgccacatga cttctcataa 168120tttgcttaag
tgacagtagg ttcaacctct gattttccaa tatagtgttt gatcacatca 168180gcatttatgt
taaaatctca atccattact gctcatgaaa aaatacttgt ttttaatact 168240atgaacccgt
attcagaccc acttaaaaac cagcactcac aatttcgttg ggagactcct 168300gtcctgacat
atatgattat ttgaaaatta tagtaaacag gccaggcgtg atggctcatg 168360cctgtaatcc
cagtactttg ggaggccgag gtggacagtt tgcttgagcc caggagttca 168420agaccaacct
gggcaacgtg gtgaaaccca gtctctacaa aaaatgcaga aaaaaaatta 168480gcttggagtc
atggtgcacc cctgtagtcg cagctactca ggaggctgag atgggaggat 168540cacttgagcc
tgggaggttg aggctgcagt gggctgagat tgtgccactg cacgacagag 168600tgatactgtc
ttaaaaaaaa aaaagaaatt atagtaaact atattttttt aatgctacaa 168660tggataaaaa
tggttacaac tagataagaa gcctctaaat tgcctgaaca cctaagactt 168720atttgctagt
tgacatctga tctatttctg acactgaaaa tttcagtgct ttagggtgct 168780aaaaggaaaa
aagagggttt gctaggaatc ctgcagttgc tgagggggta ctctgataag 168840ggaattttaa
gcaattacag aacatctcac taaagttttt aaaaaatctt taaccagatt 168900aaacttatac
aattttgtta taatttgtgt ttctccaggt tccaattgtg gacaagataa 168960ggaccattgc
tcaggctgtc tatggagcca aagatattga actctctcct gaggcacaag 169020ccaaaataga
tcgttacact caacaggtaa aagttctact tttaggggaa aagaaaaaat 169080tcaccttagg
ctctcagaat actcagcttg acttgaggat ttgtacatat cgcaccagct 169140aacctttgct
taatctattg tatggttaac aaagatgaag gtaatatcct cgggtagagt 169200gtagactata
tttagaactt tatggtgagg catatatcct ctgatccatg catcatttac 169260ttctgtgact
ataaatgtgt ctgatatggg tggtatccct gtttttaggt gatgtgtgat 169320ctttcatccc
tcccactcag ccctgaatgt tgaagctatc ctttggagta gagctgcgga 169380gtcagattta
gatgaattgc aatgcttcct cttccttaca ggcctcttac ttactacctt 169440aaaaatgcta
gcgtggtgcc agggtaggca gataggagtg cagcctataa agatgggaat 169500gtttgctgtt
cttcttatgc aagcggttca ctggcttttt actgagcttg tccgctgaga 169560ttgaaggctc
catcatcttc tacctctagc cactgacaaa ggcaagtagg caaacagctg 169620ggaaggtggc
tgtgatctga cagcatgcat caccatctag tccagtgaca gtcatgatca 169680atcaactcca
ctacaggagg ctcagccacc ttcaccaaaa ggaccgtatg cctcgagtgt 169740ccctcctact
ttggtgtagt cggatagata gcagtcagaa cactttaaga gcctgctggc 169800taaagaactt
catgatgggt agcgttgacc ttttctttaa aaaaaatctc caactcctta 169860ttcttttttg
taaacccatt tgaaattttg ttaaatgctg ccgttgccag ccagggtatt 169920tgttctccta
ggtttggtaa aatagtcagc aactcaggaa gttctaaatt cttagaattt 169980tccctgtatg
ttcttggctt atcctttaca aaaaggattc ccaagcactg ctctgtgcac 170040aggcattgtg
gtgaaatttt cattcatgtg caatgaacca agtaaaataa ggtcattctc 170100ataaggttgg
tttgtttttt tttccgtgag tttttaataa tgatatttaa gactatcatt 170160tattctaata
ttaagtcttt tctcctttca gtgttaaaat ggtatttctt ttatgaagta 170220atctggtgag
agtatatggc atattggttg gtttttttct ttttaatatg cctactgaca 170280aaatttttac
attagcaaac ctgtattagt tcatagaaat tacttcaata tttaaccatg 170340tctgaaattt
aaaattctgg caccatttct gaaagacagt tgtgattgtc actcagcaaa 170400attcttatcg
gtatagaaga gtggagattt caaaaagaaa atggccccaa gtccaagtcc 170460tccctgagtt
gataagactc taagatcttc caaatagttt taggctaagt cttatgaaaa 170520tttaactaaa
gatttcattc cctttatctc tatcaaggaa atattcctta aaagtgatgt 170580caaccgggca
cggtgctcac acctgtaatc ccagcacttt gtggggtcaa ggcagaggat 170640cagttgaagc
caggagtttg agaccactct gggcaacata gcaaaacctc catctctatt 170700tttatttttt
ttttaaagtt agtgtcaagt atgtggttgc aatgaattag caaagcctct 170760aaaggaacac
tgtgtgccat ttaatcagaa ccaggctttg aggacactgt gctttttgtg 170820agacaccaca
tgtgacttcc tgctctttca cctttggaac tctgggtcag acatagactt 170880tggtccttaa
aaacaagaag cttaaatagt gacctcatct tagttcttat aatgtaattg 170940cagatcaaat
agccatgatt cataaaccaa cacaaaggaa ccctttgctt cctgactgac 171000tccagaatga
gtgaacaaca tgagtaagag cctcgtcgca gagcctttct ccagactacc 171060ttgctgggtc
ccggatatta atagtcttgg aaagtggttt ctacagctgc tgctttgcct 171120gaaaatagcc
atgctttccc caaacatgca cataacactg ccttctctta aagagagcct 171180ccaaaataaa
caatagtcac ccttgacctg ctaaacagtc atcacatgga gatgagtttt 171240tatcctcacc
ttcttgcttc ttcagtgaac aagtagtaac aagcagctcc tgcgtgtgcc 171300tcgatctgag
ccggcatacc attgctgccg agactgtata aagtgcaaat tgaaaagcac 171360aggagactaa
catctgtatc agtgaacaca ttttctttcc tttctcacag ggttttggaa 171420atttgcccat
ctgcatggca aagacccacc tttctctatc tcaccaacct gacaaaaaag 171480gtgtgccaag
ggacttcatc ttacctatca gtgacgtccg ggccagcata ggcgctgggt 171540tcatttaccc
tttggtcgga acggtgagtg agtcacattt tccaaaaacc ctccccattc 171600tgcattgctg
tggtgcctca aatcgttatg ctccgcccgc tctttaaaaa tcatggatta 171660gggaaaattg
ggagtaatta atagtacagt attcgctatt tttctaaaca ctgtgtctca 171720cctacttttg
ccattgtggt ttggattttt ctttttttag atcatgtgtt cagggtcctt 171780agagtcataa
tactaatttc cttcctaagg tgagtaagaa cataacttgg gagaattcag 171840tctgttattt
aaatggtaga atggcttaag acagtggttg tcaaccttta tcacacatag 171900cacccaaaat
gatgtcatca cggtggggtt gactgctcac accagacagg atctgcccag 171960tcaccccaag
ggctggggaa agcagtcctc accctctctc acttccccag cacgtcaggt 172020gggaagctct
ggcttaggaa attactgaat cataggagtt ctcatttttg atgttgttgt 172080gatgggatca
tgttgctatt ggatgattat tctgttattt tgtgttatct atgggggtaa 172140aggagatggg
cagataaata aaagcttatc aaagtgctta aaatccatga agatgggctt 172200catttggtac
agctacacag tgatagcctt ccgttggtac cgtcagtggc ctaaatttga 172260cttcttgtct
ttgggagtgc tcctaaattt ggtttagaac aaaaaagaat tcttacaacc 172320tagcagatcc
catggcaagc aaagaaaaaa atattctaac ccatcattat cgtcatcatt 172380agtgcaaagt
gatctgagct taaacagcat acattatcat aattttttat aataaaccat 172440gttgctttat
tgtgcaaaca agcaatagat agataaaaag tgcagattct tgccttgata 172500tttacatact
tcatctaata ttggacacaa tgttttttca atcagcattt tttgaattct 172560tcctgcatgt
gaggcacaga gattttagga tgagcaggtg tgagacgcat gggtttgaat 172620agcacagagg
tatctggaga gcactgtgtt ggtattctga atgccctcag agcccagggg 172680gataacagct
cattatgcct ggggtagaac ttacccagga tgtggaagaa gaggggattc 172740ttcaaaaaga
gctcagcctc tgaactggca aaacagtgag ttttgactca aaggcaccaa 172800agtactaata
catgacatcc tctcctgggg tgatccgatg tggctggagt tggatgaggg 172860ggggtggctg
gggatgggaa tgggaatggg attgggatag caggacaggg tcacatggcc 172920aaggatcttg
aagaccatct catggaactt gaatctgatc tctacacact ggggagccac 172980ggtagggtgg
gttttttgtt tgtttgtttc ttggtttttt gtttttttgg ttttaaataa 173040ggaggagaag
aatgtggtct gatgtgtatt ttagaaaaat agctctggca gcaatatgga 173100gaatgtttag
tgaaaagaga accaggccca gagaggtggg ttaattcagc tcgggccaga 173160ggcaatcaag
gtccgcactt tggccacaga agtagataca gagagtgggg gtgaattgca 173220gacatactct
aaaagtatcc tcctaccgtg ggggccgggg gaagggaccg ttctagcttg 173280gtgactctgg
ctagtgatgc catcggtcat ggaaaggaca cgggagaagg cagattggca 173340gagccagaaa
tgagcctggc ttgaacagag agagctcaga gggttaatag gatctaagga 173400atagtctgtg
tctccactgg aaatacagac acagtgcttc aaagaaagag gcaggaaggt 173460caccctggtg
ggtttaagga tatgggtgaa ggcagtcgtt aaacaagaga cacagggata 173520gctggaaatc
agtgagaaga cagtgaagga agacacccct gaaacaccca cgttaatgca 173580gccgactcct
agcaggggtg ccttccagga ggcgggaagg aaggagcagc ggcatggcag 173640agaacgtcca
gaaaaatccc acagacaccg ctcgtagcag tgttggcaca gggagaaggg 173700cctggatgaa
atgggttctg aatgacattt tcaggtctac agttcagatc agtggttgcc 173760aggagttggg
ggaggggaag cgtttgactg cagaggggca ggagggaact ttttgggata 173820atgcaaatac
cctatgtttt gacgtggtag tgattccatg gctatataca ttcttccaga 173880ctcatagacc
tattcacgtg tgttgcccag gctggactca aactcctggg ctcaagcaat 173940cctcccacct
cagcttccca agtagctgtg actacaggct catgccactg tgccaggcct 174000gtgtataatt
taattatacc tcagttaaca aaaatgggtg ctgaagagct tgctttatat 174060tcgttagaat
ggcctcagtc agccgggtgc ggtggctcac gcctgtaatc ccagcacttt 174120gggaggccga
ggcgggcgga tcacaaggtc aggagttcga gacctgcctg gccaaaatgg 174180tgaaacccca
tcactactaa aaatacaaaa atttgccggg cggggtggtg ggcacctata 174240atcccagctg
ctcaggaggc tgaggcagga gaatcgcttg aacccgggag gcagaagttg 174300cagtaagcca
agattgcgcc attgcacttc agcctgggtg acagagcaag acttcatctc 174360taaatacata
catacataca tacatacata catgcataca tgcatacatg catacataga 174420atggcctcag
tgatggccac tttactcctg agctttagct ggtgaaagcc ttggcttagg 174480gataagaggg
cggctggaca gtgcaggccc agagagactg gaccatcagg ggaggtgagg 174540ggtgacggag
gtgcccaaac agagcatgaa atggtagagt agctgagaag gttagcctca 174600gtcacgatct
gtcctgtcct accctctgag gactctaaac cccgcagtag ttcacccaac 174660aaattcattc
actcggccaa tgctgcataa ggcactcagc tcggggctac agggagcccc 174720aactttaaga
agctttctct gcttcaggta gtttacaggt ggtagacgca gatagaagtg 174780aattagtaaa
gagaatagtt tggtgtaaaa attgtgcatt taactgaaac cttaattcac 174840tactgagcat
ttaatagcca ttaaataaaa accagggccc gcacagtggc tcacacttac 174900aatccgagca
cttttggaag ccaaggtaga aggattgctt caggccagga gtttgagacg 174960cctgggcaac
atagccagac tttgtctcta cttaaacaaa aattagccag gagtcatggc 175020acgcacctgg
agtcccagct actcaggagg ctgaaatggg aggattactt gagcccagga 175080gtttgaggct
acagtgagct atgatcatat cattgcactc cagcctggga gatagagtga 175140gatctctgtc
tctaaaaaaa tgaaataaaa taataaaaaa cacaatataa tttaaaatct 175200ttggagtcac
taaacaaata tacaatgtga atctcctctc actccagcta acgccaccat 175260atccaacact
aggtaaagac caaagccgtt ctggaaaatc agaatctgtt gccatggctc 175320atgcctggac
accaggctgt ccgctcctga cgtcactctt tcacttacga cttgttagaa 175380aatgaatgtg
tggaatggcc ctggatagct gctgcagctc cctggtttcc tgaagtgagc 175440cctctgtgtc
attatctggt cttctcagcc ctcagccgag ttcctcctgt ggccacacac 175500ggggtcgcag
tgagcgtcag tgcacagtga tgaaatctcc tggaacagtc gggtgtgggg 175560catgaggaag
ggagtggtgc ctgactacag aagttcctgg aggtcaggaa actctcactg 175620gaggcggtgg
ccctagaggg ctgctttcta tgacagagca ggaagctgca tgtgtagtgg 175680gacacacgca
ccagaggggc caggatctcc taataaggac tggtatttat ttttacttga 175740ccccattctg
ggccttgcca cagtctgctg aaatgccttg aacattcatc acccgtgtgg 175800gagacaggat
agtaatttcc tgctgagatg actgagggga cggggaatgg ggacaccagg 175860gggactggct
cctgcaggtg ggaaaattct agcaaaatga atcactcttc tgcttagtca 175920cccaatgtgc
acttattagg gtgctgctga aatacatatt catctatcca ctccagaaag 175980atagtccaaa
ctgagacctc aggatcagaa agcctcccat tctcctgcag ggttcattca 176040tttactcagt
gaattattaa atgcctacag cacgcagggc catcagcccc acagctatga 176100gaaagattca
gtccttccct cacaggcctt atgatagact ccagtaaata aaaccataca 176160ggcacagaat
gcggcgggct ttataacaaa ggcagaaagc ccctcttgtt gtgtcctcaa 176220gggaggcaga
gaagctcccc tctggctcta gttggagaat cccaacccaa gcaagtccat 176280ccttaaacaa
taaacatctc cagcattgct ggctttggaa gaccccatgg tgagatgctg 176340gagctttttt
ccgctccaga tttcgcttct aaatatttac ttttactttc catgttcact 176400tataggccag
ggtttttttt tcctaccatt gaattatttc gtcatgttct atttaattat 176460tgctttaatt
ggcattggtg accacaaata ataataaata ctattaactg gtgtaacagc 176520ttcactagtc
tatttcactg ccccatgctg ttgtaactga agcccccagc acaagcattc 176580ccagctcgtg
gtcatctaag acctttgagg tcattcttac atgcattgct agtgttttcc 176640atattccttg
atggtagcca atttttcctt cttaaaataa ctttcttcat tcctagtaag 176700cttcatatgt
attttatgta ttcttccctc ttttttatga catgatacat attattgaaa 176760gctaggaaat
aatgcaagat tcttgaacga cataatacca cttagtacat ctttttgtaa 176820gagatagttt
ttctaataca aatctttgat atgggaagaa acaagcaagt tgttttttaa 176880ggttttccaa
ccctatattc tactacaaat taccctgtta tggtatgctt ccaaaattct 176940ttcaaggatt
atggtttata gagttcaata aacctaatag tttataaaac ttctcaaaaa 177000ataatctcat
gccaaaaata atttagtagt aggtggtctt tgtcttgtgg tcacatggtt 177060ttgatattgt
ctttaagttt ctataattta caagggctgt ttaatcagtt ggcataaatt 177120tcatgacaaa
aatactaata tgtaagcaaa gtatatgaaa caatattatt gagaagtaaa 177180ttacattcca
tgtagaaaaa gactagaaag acaacaccaa aaccttaaat tagacagtgt 177240tattatgagt
agcttttttt cttctctttt tccatataga tcaaattact acattgggcg 177300tatatcactt
ttataacatg aaaatacagt aaatagggcg atgataaact gcaagtgctt 177360ggggagaatg
cttaactcag gccagttgca agagagtgag aacacacacg cagcctgtgg 177420gagcgggctc
catgccatca cctggccggt cctggctgtg ttgctgtgtt ttcgcacctc 177480aaaagttggg
acagcaaaga aaggccataa gagctagaac gtttccataa gaagtgtatt 177540cagtatgatt
tgtctagctc tgactaatgt gtgcaaaccc cacattccac taagcaaatg 177600caagattgtt
tttccctgtt catttgttcc tgggcccttc ttgcttactg tgttgggagt 177660tggagagggg
ataattgact tctttcttct cacagtgttt tctctttttt tttttttttt 177720ttttttttga
gacagagtat tgctctgtcg cccaggctgg agtgcagtag catgatctcg 177780gctcactgca
acctccgcct tccaggttca agtgatactc ctgcctcagc ctcccaagta 177840gctgggacta
caggtatatg ccaccacatc tggctcattt tttgtatttt tcaaagagac 177900tggggtctca
ccatgttggc caggctggtc tcaaactcct gacctcaagt aagccactgc 177960gcccggccat
ctctcacagt gttttttaaa acaacaatgc attttcataa cagttggcct 178020ggagaggtgt
ccgaccaagt tggaattcat agtagaatcc attggaagtt aatttttgat 178080gtatttgact
gttttccatt gacttcgcat tgaggcagtg agagagactt gaagaactta 178140gaatacgcac
ccttttgtca ctggcaaaca tgggacatag aggcgtgtct tgacagcaat 178200ttgtggcaac
actctgggct ggaatctagg ggttagacag cctggccggt tttgtaaatt 178260agatagccag
gccaccctca gagtcagttt gagggttgag tgggaagcct gaataaaaag 178320agaagttaaa
cttttaagaa aagggtcaag ctttaaaact ggaattgggt tttcaaattt 178380aatgaggaaa
aagaaacccc aaaactggaa ttgtaattgt ggagaaatac aactgataat 178440atttggaata
tcatctacaa ggaatgaaac tatccagaag catttagatg gtagtcagtg 178500ccatgtaggg
cacctctggg ggcagggtgg ctcctggcac tgtggagaaa gacgactggt 178560gcttccttct
cccaatcgga ggaaagggat acactgtgga ggtcccagca gcttagggcc 178620tgccccccat
gtggttatct ctgtgccgtt tccttccctc tgtcttcctg taactccttc 178680actttggccc
ctctgccctt tctgggcaga gggtgacagg aggcaccgca tgggtacatt 178740cttttttttt
ttttttcttt tttttgagac agagtctcac tctgtcacct gggctggagt 178800gcagtggcgc
aatctcgact cactgcaacc tccatctccg ggtttcaagc gattctcctg 178860cctcagcctc
ccgagtagct gggattacag gtggccgcca ctacgccccg ctagtttttt 178920gtatttttaa
tagagatggc atttcactat gttgttcagg ctggtctcaa actcctgacc 178980tcgtgattca
cctgcctcag cctcccaaag tgctgggatt acaggagtga gccactgcgc 179040ccggccaact
acatgctttt ttcctcctgt cgggattggc ctgggctgta cctatctcat 179100ggtggaaacc
tgcccaggga ccaggcccta gaagaccagc agtttattta actgggttgg 179160cctttgtccc
cactccctgt gacccggccc tgccacatct ttacccaggt gcaccactga 179220catcatcgct
tggctggact ctggaaagca ggaaactgct ggagcgtgtt tggggcatca 179280gtgatgtcac
ccgctacaaa tgctgggtgg ggttactgga ctgagggcca tgaaaaaaga 179340aagctgcggc
cccggcccgt ggactcactg ccattctgct ctttctcacg tttcagttca 179400gctcggtttg
ttttccgtcc ccctccctct actgtaaaag tatgtactca gtgtttcgtt 179460tcctgctgga
tctgattcag gtcagtgatg aagttatttg tgaactcagt ggggaggtgt 179520ttcctcccct
ttgtaccggc cctacttagg gatgcacggt tggttatacg cttgcttcac 179580tcggttaggc
atggaggaga attcattcag ccctcatagg tttaaatcaa atgcatgacc 179640cttcacattt
tccagagatt tacggccagt tacaatgaga caattaaaca ttcacccccc 179700cgccaagctg
cacttggaga tgtgtaagca gtaatgtagt catgcgctcc ccatgatgtg 179760agagcgtgca
gaacgccact ccccccagca ggtccgccag gacctccgca gggagcatct 179820tgcctgcaaa
atcacacatc cctttttccc cctctctcca ctttccctac tacttttcca 179880ctcccacacc
cacaaataaa caaagaaacc ttgagcttat tttacaaaag tctttgaaat 179940ggccctcgtc
tctccaatgc tcctgggtcc tggtttggcc ccatctgtgc aatctctctg 180000atgagtgtga
aataagccag gtttggctca tgctctgccg gtgagcttgc cctccccatg 180060agcctgtcct
ctccatctgg cctcactttg tttgtgtccc tgtgtccttt gtctcccatc 180120ctgggatcct
ggtggtttcc tcctcccctc ccagcatgag tccttttccc cagggttctt 180180ccctatctct
acatcagaat ttcctgcttt ctcccaaata tgcatttccc tggcccaggc 180240atccactgct
tcctctcatc gtgaggtccc tgccgcaggc tgctgatggt attgtggccg 180300tgccttcctc
cacaacccag ggaaagctac gtgtgtgtct gtcccatagg gaaggagtca 180360tccctgtctc
ctcagtgtgg tgtgttcagg aggaaggaaa gtaaccagcg tcgttctcaa 180420tttccagaaa
ctgtttatca tattgacaag aagaagaaag ctttgtgact catgagaatg 180480atgttttcct
ctctggtaca taaggttatc taggtccaat ccatttgtgt agaggatctt 180540ctctccccta
gtgtgatatg cacttatttt tagcacaagt ataaaactac tttaaatgaa 180600atcagcctca
gccagggaaa tatgctgagt aataatgttg ccaggtacta taccactgag 180660ttgagtttgt
aattcactgc tattaatccc tgtgtattag ttctgatttt ttactctttg 180720catacataga
aaaatggtgt ttctcttcag agtcaaggag ggaaaaaaag aaaagataaa 180780agatgcttat
aacacttttg tgtccacccc taaaatcagc atattgatct attatttttt 180840ctaggtattg
atgagatgtt gactcatata ctttcataca actggtaata tatatatata 180900tataatatgt
gtgtgtgtat atatatatat atatatatat atatatatat atatatatat 180960agactcattt
ttcttccact ttcatctaca gcctacttgt tttcttcccc ccatttttcg 181020tatggattca
tagcataaat tgacaacaaa gatacatctc aatttagagc ttccaaaagg 181080cacccataaa
agtatctggt gctcatggaa ttgctctttc ttctatgtct cttagttaca 181140cttttctctt
attacacttt tctatcccag ccatggctaa gaggcgtata atcagaagca 181200gctaagcaat
atatgcaagg gaaataaaat gaatgcaaac caagataccg gttttgtaaa 181260ttagatagcc
aggccaccct cagagtcagt ttgagggtat agtgggaagc ctgaataaaa 181320agagaagtta
aacttttaag aaaagggtca agctttaaaa ctggaattgg gttttcaaat 181380ctaatgagta
aaaagaaacc ccaaaactgg aattgtaatt gtggagaaat ataacggata 181440atatctggaa
ctcgcggcag tttacaggct taactatgta tgtgttcctt aaattatcca 181500cccgcagata
tcagattaca ataagtcctc acttaatgtc attgataggt tcttggaaac 181560taccacttta
agcaaaaaga catgatgcat atgaaaccag ttttcccata gtctaattga 181620taggaaaaag
agttaagtta tgcagccaca cagtacctcg tttggcttaa agtcactgtt 181680tccaagaacc
tatccacaat gttaagtaag gacttaccgt atatgaaaat atttgcagta 181740aattgctttt
gagagaattg gttatcacac tcttctctgg gtatcattga aaattcagaa 181800tgtttaatta
catcaggcta tactcactgg tataaaatca gggagcaagt ataaagtagg 181860tgtagggaaa
taatttatgt tggattaaat gatccaatga agggatttta ttaagcccct 181920agaagtccca
gaatgaaaat gtataactgc atgactaacc cctaagaatt tgctggggtt 181980tctgcattaa
gtacgcttcc caggccattt gtaatgaagg taccgtgatc tgttaaggaa 182040gagacctaca
gctgggaaca aaaagagcca attccatcct agacatcagt ctcctgtttg 182100actcattcca
accccaagaa tatgttgtag agtgtaggac ccatgaaagt tacttactgt 182160gcttccctgt
accttccgtc atccagttcc cagacatatc tgtgtttctt ctctgaatgc 182220ctagttactt
gggatcccat gctgtattgg tctgttctca cactgctgta aagaactacc 182280tgagactgga
tagtttataa agaaaagagg tttaattgac tcagttttac aggctatacg 182340ggaagcatgg
ctggggaggc ctcaggaaac ttacaatcat ggtggaaggc aaaggggaag 182400caagcacatc
ttcacatggc ggcaggaaag agagagagtg aaggggcagt gccacacact 182460tttaaacaac
cagatctcat gagaactcac tatcacgaga acaacaaggg ggaaattcac 182520ccccatgatc
cagtcacctc ccaccaggcc cctcctccaa cactgaagat cataattcat 182580catgagattt
ggctggggat acagcgccaa accatagcac atgcctttcc caggactggt 182640gatggaggaa
cggtgccctt gtcacaccta ccagaatata ctcacccacc tccagacact 182700ccatttctct
ttcaagcaca cctcttcact tgtgccgcca gctaaggttt actgcagtta 182760cttcctttgt
gtgtcctctt accttaccaa gtcgcctcac ccaccatcta ctgttaatta 182820aattaccacc
tcccccactt cccacctcca gagtagagac tgtgtcttca gttctcttgg 182880ccttcccaca
atgggaagtc cagtgtagaa cataccctag gagccaacta aatgtgctag 182940tggcctgcca
accctgcaga gggctaggga atcttcaagg agtctgctgc cagtgaaact 183000caccagtact
agaatcccag gctccaaaga agtctagaaa agtacatctt ttaaggaggt 183060aataagaggc
tcaagtgccc acaaatccta tctgctcttg gcctagttct tttgggatca 183120atgtgctagt
ctctcttgtc gaaccagtta ctgaccagca agactaaaaa accaggccca 183180aacacattct
attcatgtgc ttagatatgg acctacggat caaatatccc actgtaaagc 183240aaaacagcaa
actgtttttc ccttgtactt tcacactcaa caatagcacg cttctgtggc 183300tggctgtgtg
ggggcttttc cttacacacc aaatatttct cacaaacacc aaccaggtgt 183360cctcttattc
aattcaatcc tgacactgtc tacctggaga tggtgtcgga tcccacagat 183420tgacggctca
gtcccatgag accagcccca cttcaggcac taattgcaaa tccaggccat 183480gcatacttct
ctgactgcct ggctaaaaac cgcattcaca gccgggcacg gtggctcatg 183540cctgtaatgc
caacactttg ggagaccaag gtaggtggat cacttgaggt cggaggttcg 183600agaccagcct
ggccaacatg gtgaaacccc atctctatta aaaatacaaa gattaggccg 183660ggcacagtgg
ctgacgccag taatcccggc actttgggag gccgaggcag gcagatcacg 183720aggtcaggag
attgagacca tcctggctaa cacagtgaaa ccctgtctct actaaaaata 183780caaaaaaatt
agccaggtgt ggtggcagac gcctgtagtc ccagctactc aggaggctga 183840ggcaggagaa
tggcgtgaac ccgggaggcg gagcttgcag tgagcggaga tcgcgccact 183900gcactccagc
ctgggcaaga cagcgagact ccgtctcaaa aaaaaaaaaa aaaaaaaatt 183960agctgggtgt
ggtggcgggc tcctgtaagc caagctacta gggaggctga ggcaggagaa 184020tcgcttgaac
ccaggaggcg gaagttgcag tgagttgaga tcctgccact gcactccagc 184080ctgggagaca
gagtgagact ccatctcaaa aaaaaaaaaa aaaaatgggt tcgcatgagc 184140cccctccttg
ggttcaatta agttcctagg atggctcaca gaactgagag aaacacttat 184200ttacagaggt
gttatgatat atattgattt tcgtccatgg ttcctggttc ctgaatccca 184260taaaccttct
tacagccttt tgttataatg ttgggggtgt cacgcctcag gggcaggcct 184320ctgaccttcc
ctgccctcct tgcactgtaa tgtttccccc atttctgatt gtgagtctta 184380agaccctccc
atgagagggt cccaccctac accttgtggg aaggaatgtg gatgtcatga 184440agtgtccata
aaaacccaag aggacagggt tcaaggggat tccagatagc cagacatgtg 184500gagcttcctg
gaaggcaggg cacccaggtc gggcatggaa gttctgcacc tcttccccca 184560tacctcgccc
tacacacctc ttcatatgtg tcctttgcaa tatctttttg ttgttgttgt 184620ttgtttgaga
cagagtctcg ctctgctgcc caggctggaa tacaatggca tgatcttggc 184680tccctgctac
ctccgcctcc tgggttcaag cgactctccc acctcagcct cctgagtagc 184740tgggattaca
ggcatccacc atcatgcctg gctagttttt gtatttttgt agagacgggg 184800tttcaccatg
ttggccaggc tggtcttgaa ctcctgacct taggtgaccc gcccgccttg 184860gcctcccaaa
gtgctgggat tacaggcgtg agccaccgtg cctggcctgt agtatccttt 184920ataataaact
ggaaatgtta agcatgtgtt tcctttagtt ctgttagctg ctccagcaaa 184980ttaatcaaac
ccaaagaggg gagtcctagg aactccaatt tgaaaccagt tggtcagaag 185040ttccagaggc
ccagacttgc gactggtatg gtggggggca ctcggccctg tgggatctga 185100cactatctct
gggtagacag ggtcggaact gaactggaag ccaccccacc tggtgtctgc 185160tgcttggtgt
atggggaaac cccatacaca tttggtcaca gaagtcttct atgttgatga 185220ctgttgtggt
gcgacagcag aggaaaacac ggtttgagga gagcttttcc caacacgctt 185280atgtgtgcca
gtttattata gaagacacag atgaacacca gatgaagaga tgcatagggc 185340atggtctgtg
gagttggggt tcacctccct cctggcacaa ggatgccttc accaacccag 185400aaactcatca
agtcttgttc aaggattttt gtagagctta atctccactc cccgccctcc 185460acccttcctg
gaggttgatg ggtggggctg aaagttccag ccctccaatc ttctacttac 185520ttggtctttc
catcctgagg ctagctggag gcccaccccg tcactccatt accttaagct 185580caggtgttat
tgaaggagct ccttatgaat agagcaaaat cacccctatc agaaatttca 185640agagttttag
gaaacctgtg actggaacca aggacaaaga tgaaatatat tttgtatgat 185700agcacaccca
ccagattgga tcgtggggcc atatggcctt tagtgtggaa gctgcttgac 185760cacattttcc
gcccatgcac tgccattctc atttcaggct gcgtcatcat catcatcacc 185820atcacctgtt
cttagcaaag gtcgcacacc ttccaaatgc tttgtataga ttagctcatc 185880actccccata
ctcaccctgg gagcatagtc agggataaga ggtagagttc tagcgctctg 185940tggctactat
ggtttatggg atctttgcgc aggaataact ggactttaaa catcttggct 186000tttcttttca
gtaggtcagt cggcatgtac tcacagggcc gtgcatgctg agcattgaga 186060cacatcacaa
gggagtgaaa acctgctgcc ataccccccc agaggcagca ggatgtttgg 186120ataaataagg
cataaacacc tggagaatta aagcaagtac aattagtgca gcctttgcca 186180agagaaaggt
gtaagtgcta aatgcagtag atgttcaaat acggcagaga ccacagtggg 186240cagggtaggc
caagaatgct gcctaaggag tggggcttgg ggtgaaactg gggcatttga 186300gacactgcag
gcccatcacc ctggctgcgc cctcctgagg gaggcaggga ctgaggcggg 186360aggcctggag
aggtctgtgc gtgagcctgg gggctcattg ataaggagct tccagtgggg 186420gcaataacat
tggaagtgga gaggatggca tgtgtgccgg gagacgcagt ggagtggaag 186480tcgccggaac
tgacaaggga tgggatgtga gcaaggagtc caggtgagtc ctggggacct 186540gaaaggatgg
acccagagag ataaacgact tcaaaccggc acggcagtga agtgactgac 186600ccgcgagtgc
cctgcaccga ggctgacatt tgacttcctg ccaagttggc catcagacag 186660ttagaaatca
attttggagt tcaggagaga ggccagagct gaaaattggg gagttgttta 186720cattgggagt
gatatacagt ctctacattt ccatccagtt caaaacattc ttgcgtgtct 186780gctatgcact
gtgcactgag gttaccaaga ctggaaagac ccaatccctg tcctggaagc 186840ttagtgttgg
caggaggcag acaagctgat acatgcaaac aaagggctca caggtgccag 186900gatgccaagg
gatacagaag aaggatggtt gcttctctac ccaggagtgc cctgaggtgt 186960cttaaaaaga
ggagcattta tgactcaggc agggtggcac attccagacg gcagtgggca 187020ggagcctgac
tccagagcgc cttagaggct gccgctggga gttgagctgg accctgaagg 187080cagtagggta
agccataaat tagggttaag gaaccaaggg aataagggga tcagaccatt 187140gcatagctcc
agcagcacca ttactagcag cctgtgtgaa tgatggtccc ctcttacatg 187200cccaccatgc
aagggctcta ggccattctt cactctttaa tgcagaaaag caaaacttcc 187260agctgttttc
ccctacccaa aatgtgtgag cctgagccca gccccaggat ggatcttgtc 187320ctaggtaacc
aagaaggcct taagtgtcag aagaggtagc tccaggctga ataaagggac 187380aaatgaagca
ttactgcact gagatcgccc gtcgtggggt gggtctatac ccatctagtc 187440attaagcctc
aataatttgg tttctaaatg gactattatt tgctacagat caaaagttgt 187500tggaaactat
attacacaga taaatagaat tgtcctgcaa ataaaggaag taatctcttc 187560tctgtttgag
ataccatgca gtatttgtcc aacttgtaca aatagttctg atccatgaga 187620atcctccact
ttcaacttac ttgctgctat atcccaacta ttataaggcc atggagacca 187680gtagcctggt
ttccattatc tttgctctgt ttttttttat ggcatggttg acactacctc 187740cagtagcaaa
gagttaactt gtcatcagag ctataaaatc acaatgataa gtgaaaagtt 187800ctcttttgta
ctttagcatc caagagaagt gtaattccca actacaatgc tcaaaataat 187860agagagtttt
tactttgggc ctgaaaatac ttataggtca ggtctcctag ttgggtccat 187920gctgttgttg
gtgttccctc ccaagtaaat acatactgaa tttagcaaat tattcatttg 187980tgatcctctg
tgccatctaa aaatatatat agtttaaaac gctggccagg cgcagtggct 188040catgcctata
atctcagcac tttgggaggc tgaggcaggt ggatcacctg aggtcgggaa 188100ttgaagacca
aactggccaa catggtgaaa ccccctctct actaaaaata caaaaattag 188160ctgggcatgg
tgttggactc ctataatccc agctactagg gaggctgagg caggagaatc 188220tcttgaaccc
gggaggtgga ggttgcagtg agccgagatt gtggcactgt actccagcct 188280gggtgacaga
gtgagactcc atctcaaaaa aaaaaaaaaa aatgcttatc attgcacaga 188340tagctgtaca
gttttttatt atttatgata acggtaactg actttttgtt aaagctggtt 188400tggaatctgg
aggttatttt aattttgata actttatgat tttaactttt aattttaact 188460ttatgatgtt
tgtcataaaa gagtggaatg tgctcctatt ttgagatgga aacatttcgg 188520cagggagaga
gaagagagac accaaaccac ctaacagcca ggttccaagg ggagattcat 188580tcagcgtctt
tgaagcattg ttagttttca tttgaggcat aaattggaaa cttatatctt 188640tctttggagt
aggaagtatt tcagacagaa aaaaaatgat tgctacaaca acatcacgaa 188700aaagaacaga
aggtttcatg aagagtaata aagttaagtt tatgaattag agcttcagat 188760gggaaatctt
atttggacag taaatggtta atggtacagg gacaccatga tgggggatct 188820ttgcagcatt
gaaaaaacac gcgtccataa atccttctct agcacttctt tgtgatcaca 188880gagtcaagct
gaggacaatg atggagtgct ccagttcatc cgaagtgcac gagtcccctt 188940tgcagcagac
tctgacattc ccggcccctc tccggagtgt tatcaacatg cagtcagtgc 189000tctctacgtc
attactttaa aaacaaaaac tcggccagca gcagttcagt acatcttcaa 189060agatcttgac
ttgtaatttt ttcccctgca tcttctctat tgagcacatg aactgacatc 189120tttctacccc
caaataatca gttctttccc cctaaataaa atgggtaatc caaaaaagac 189180ccatcaaaag
gaagctaaag atagtaatgc ctaggtagtt tcgggaccag aatgattgtc 189240tctaaatatg
tacatgtaca gtgattacgt gtttaaaatg tttaatgact tggagaaata 189300gtgtgttgag
tggagaaaaa tgagatttag tagagcataa cgtcaacagt gtaaaaatac 189360atagaaaagc
aagcaggaat aaaggaatgt ttaaataaat tacaaagata aaaattaaaa 189420ccgtaattca
gccgagcctg gtgggtcatg cctgtaatcc cagcactttg agaggctgag 189480gcaggcagat
cacttgaggt caggattttg agaccagcct ggccaacgtg gtgaaacctc 189540atttctacta
aaaatacaaa aattagccgg gcatggtgtt gggtgcctgt aatcccagct 189600actcagaagg
ctgaggcaga agaatcgctt aaacccagga gatggagttt gcagtgagcc 189660tagttcgcgc
cactggacta gactaggtga cagagggaga ctccgtctca aggaaaaaaa 189720aaaaacttaa
ctagaacatt tattgagcac ttactaggtg ccagctagtg ctctgagtgc 189780tttgtgtgta
ttaactcatt aatgccccca gcacctcttg aggtagatta aaatctccat 189840ctcggtttta
catgcaagga aaccagagca ggaagagctt caggaacttg ccctcctagt 189900aaagatagaa
agtagcaaac ccaggattca accccagaaa tattatatac tatgtttgtg 189960ctgaatcttt
tttttttttt tttgtttttg gagatggagt tttgctcttg ttgcccaggc 190020tggagtgcag
tggcgcgatc tcggctcacc gcaacctccg cctcccaggt tcaagtgatt 190080ctcctgcctc
agcctcccaa gtagctggga ttgcaggtgc tcactaccac gccggctaat 190140tttgtatttt
tagtagagac agagtttctc catgttggtc aggctgttct caagctcccg 190200acctcgggtg
atctgcctgc ctcggcctcc caaagtgttg ggattacagg cgagagccac 190260cgcacctggc
ctttgtgctg attctatagt cttattttca catcctatca gttactgtaa 190320gtaactcttg
attccaagaa ctgtataata ggaaagggta gcttccaatt atctactctt 190380gcttgataat
gcattaattc cacatagcct ttaaaagttt agtaaatttt aggataaatt 190440ttcgaactgt
tatttcctat tggctgttag aaaattgtgt tgctcaaagg ggaacatgtc 190500tattgtgcct
tgtggagatt ttcctacagc ttttctggct gacccgtcac ttgcactctg 190560gaactctcct
aaaccagata cggcccacca cgtggcatgg tgtggatcct ggactttctg 190620tcatccgtta
ggttcatatt cttgaagcaa gtctagaact attttctgag agattccctg 190680gctgttgttc
tgtgtttggg tgggttagtc acctttttag gtgagagact tgatttgggt 190740acacagtttc
acaaaagtat tgtcaaccct gctggccaag tgactgacgt gctgggcatg 190800ggagcacact
ccatcggcat ggcagagtgg attcactcca tgccattgcc agagaatagg 190860actttgaacc
acagccaccc ggctttttct gaaaggcctg cttgggcaat gatgcctgga 190920gaattatctg
aagaatgcag tctgtccatt cctctggaga tggtctagtt tttatcccca 190980gaccaagctg
aacctcttag agctgtgcat ttatattgtg tggaattgcc ttcagcgacg 191040tgttctataa
cgtcgtgttc ttgtcacctt ttgagggaca ctggtgctgt ggggaagatc 191100ctcaccaata
cgcaacagca gggccctgtt ctctcacctg ccatcccctc tggaaagctc 191160acagagaggg
tttcctgggt tgctcctttt ccaagcagga cttctactgt agacttacag 191220ctccagttac
aagcctgtgt gcttgtctgt ttccccacaa ggtgttctga cctgggaggg 191280aagggacgga
atcctattca tctctgaatc ctaaatgcca ggcctagcac ttggcacaca 191340gtgggttcca
gttgtggaat gaattttttt cagtcctgac aataaatgtt ggggttcttt 191400agtttgtttg
ttgttattgt ctgttttatt tgtattttgg ggttttttgc tttttccccc 191460agggaatgaa
aatgccttct ccatatgaac cagacatatc ctggggctgg ggctgagtgt 191520cctttgttgt
ttgctctaat tataactaat gaacagttca gctctggtaa ttgtaggctt 191580gctggaccca
gtgcctgcct tctaactgaa tcttgactct gctccattat gtattttcca 191640tgatcctgat
ttctgtcaat ctaattttta aatttcattc taaggcaaac ctgctcatgt 191700tcttcgctga
aatcaacctc ccctttctgg aatggacata tggaaaattg gaggcaaaat 191760ataataatgg
tggtgactca gactgcttta aggcctgaga gaccggccag gtactgtggc 191820tcacgtctgt
aatcccaaca ttttgggagg ctgaggtggg tggattgctt gagcccagga 191880gtttgagacc
agcctgggca acataatgaa aactcatctc tacaaaaagt tactgggctc 191940agtggtgtgc
acctgtagtc ccagctactc aggaggctgg ggtgggacca tcgcttgggc 192000ccaggaggta
aaggctgaag tgagctgtga ttgcgccact gcactccagc ctgggcaaca 192060gagtgagacc
ctgtcttaaa aaaacaaaaa gacccatgag acctagactc cccctgtcaa 192120aaatactcac
acacacacat atacgataag atagtatttc taccttttct ttctcaaaac 192180attccacaga
acagtcttac cttcgaggag agccgtatgc atgacctttt gtttgtttgt 192240ttgagacgga
gtctggctct gtggcccagg ctggagtgca gtggtgccat ctcggctcac 192300tgcaacctcc
gcctcccagg ctcaagtgat tttcctgcct cagcctccca agtagctggg 192360attacaggcg
cgtgccacca tgcccagcta atttttgtat ttttagaaga gacagtttca 192420ccatattggc
cagactggtc tcaaactcct gaccgcacat tatctgcctg ccaccacctc 192480ccaaagtgct
aggatttcag gcataagcca ccatgcccag ctggaggcct tatttttact 192540gctattcaga
gaacctgtgg cttatccatg gagggcttaa gctttgcctt ttctcttgct 192600catttccata
cctagcaaga gcttaagctg gtttattgca ctgacattct aatgaaatat 192660aaatagcttc
tcaacagaga gataagaaag aggagtatct agtacagcaa gaccctctgg 192720gtgggtcagg
gccagctggg gccgtgccgg ggctctgagt ggaggaggtg gcgtgggagc 192780actgatgctc
gggctctgtc acatccgttt ttggcatttg tcctggcaaa ggctgcattt 192840ctgatctttc
agaaagctgt gggtacactg tacagaaatg tgttttgctt gctttgaagg 192900atgacctact
ggctgacagc cagccagaat taattccata tacatttgca gattttcagt 192960atgagagacg
gcaattagcc tacagaaacc catttaaccc ccgaggtgga gctgtatggt 193020gaagactgtt
atttatggga gtctgtgtaa tttgcctctg tgacaaggga cagccatggc 193080tcaggagggc
ccacgccacc ctccttttca aggactcctg cccctgcctg agtggcttgg 193140cctggcctgg
ccatgtgagt cttgtggaca cgtgcacatc agtcagggac ccctgtggta 193200tggaccaaag
ggttgttttt tactgtttta acttatttaa tagcatctca tcagtattac 193260aggcatgtgc
cgccacgcca ggctaatttt ttttgtattt ttaatagaga cggggtttca 193320ccatattggc
caggctggtc ttgaactcct gacctcaggt gatcaccccg tgtcagcttc 193380caaagtgctg
ggagtacagg cgtgagccac cgcacctggc ccacagagct tctaaaggaa 193440atgtgttggt
tttttaatca aagttgaggc agtgcctcag tgaggggcag gctggagcag 193500tggtaagcgc
gctgaaggag tcgggctctc tgggcccaaa gtcctgccct ggccacagag 193560tggctccctt
ccccctgagg tgggatgatg ttctccaggg ccttttagct ccaggctgct 193620acagttctaa
actggtggtg gtggtggtgg tggtggtggt tgtggtggtt gttttagatg 193680cagtctcact
cttttcccca ggctgggatg cagtggcatg atctcggctc actgcaacct 193740tcgcctcctg
ggttcaagcg attttcctgc ctgcgccacc acgcacagct aatttttgta 193800ttagtagagt
tggggttttg ccatgttggc caggctgctc tcgaactcct gacctcaggt 193860gatccgctca
cctcagcctc ccaaagtgct gggattacag atacgagcca ccgcacctgg 193920ccatgattgt
tgtttttata aaatcagaat ataaaatttg aaaatgttct tttttggtga 193980gaaatgagaa
ctaccaaagg agtaaaatgc tctaaaagcc atcaggtcct tcccacagta 194040ttgaggctct
gctcttacca ttttgttaaa taagttttgt tcctaacact ctttgtgata 194100tgaagtattc
ttgctaatca agctgcccgg agactggcgt ttcccagagc agtttcgaaa 194160gtggaaatga
tgctgcattt cctacccaca gtctggaggg tggctgtgct ctctgcacgg 194220tgggtctgtg
ctgctcagtt acgggatggt aagggaaggc tctaactcgg tgtcttacac 194280catgcactag
ggagtcgccc gctttattta cgagcgagtt gatgctttca gtgggttttg 194340gctttgcctg
gttccagact caagcccttc ctctaatgta gggcgggtac aagagagact 194400tgagcaagga
tattgggatt taagggaagg gcagtgcagg ccaaggtcag caagaggaca 194460tctctaagat
gaacagcact gcatttgtgg gaagcaaccc tgccataaac aactcagtta 194520acgtttacat
agtaaataat tgcattgcct tgggctggaa tagggacgcc agaaggtata 194580ggaccatgtg
gaattcagaa aagggagccc tgaagaatga gattttattg ttcttaccag 194640gagagtccag
gtgctagagt tctctccttg taagtgacta accttctccc ttttggtaca 194700caatcctcag
atgaacggag atgattcatt aggccattct agcttaatgg atgcatcact 194760gtgcaaccac
gcaaatgcct catttctgat tcactcattt aatatttatt gaggagtgct 194820gggtactagg
ccattagaag aacaaacaac aacaacatag gcctggacct tgcacttgca 194880gagttcacag
gctagagagg cacacaagtt aaatatagag acagctgaaa gagctgacgt 194940ctgggacaga
gggagccccc tgctccagga gccagggagc agatgccatg gggggctggc 195000agggataggg
aagtcaggcc aggaatgggg ttacctaagc cggatcatga aggacgagaa 195060cagactcacc
aggggtcgag ggacaaggag gcagcaagtc cgcagcatga aggcaggaga 195120gaacattgct
gtaagccttt accatggcca ctgcatgggg aggaaacagg agatttggaa 195180tgagctgtga
cactgggcag gaaagcaaga cccaggtctt gtgaggcctt gagcgtcaag 195240cttaggagtc
aggacctggg gattttaagt acagcaatat cataattaca ttcatgtttc 195300agagagatca
ctcaggcagc agtctggaag gtggagtgga aaacggagaa attagtcatg 195360aagacctatt
tagaaagtag taggcaagaa atacccaggg cctaacctga ggtgtaggca 195420ggggatggga
atgagagcaa ttaccgaact agaatgaact aaactcaatg aacaggagcg 195480gggaagaggg
aatgcagaaa tggctctcag atttctgact tgttcgctta atgactgggg 195540ttgtcatttg
tttagataag caataaagaa cctagaaaca attttaggcg gaagatagtt 195600gtgttgggag
cccctaagac catgatttat tacagcacaa ggatacaaag caaaattagc 195660aaaaggaaac
agcacgtggg gtcaagtcca ggggagctca ggcgcaagct tccagagtac 195720tctccccata
gagtcacaca agatacactt agttcctcca gcattgaact atgacgacac 195780ctgaaatatt
gtctgcagaa gaggctcatt agagactcag tccccaaggt ttttattggg 195840taatggtcac
gtaccaaatt tcagtctctc agaacaaaag caaatgttca gcatagacca 195900ttttgtttgt
gaaatttagg cacagtgagc ccctcttgtc agttagagga tagttgaaac 195960cctcccacaa
tccaagttct cagactcagg cctgctacat tatttctttc ctacacagta 196020gtggatttgg
ttactaacat gttggatctg aagcatttgt tgtaatttac agtgaagatg 196080tccactgcac
agttccgtat tgagtctgag gttcactgga gacatctgtg ctatcaatga 196140aagtttctgg
agtcattagc ccctcaaagg gtcattaaaa ccaggtcttc tagagagaaa 196200gtctaaccct
gtattccttt aaaccaccat agagaccatc tagttcttca cattataaaa 196260atgagaaaac
tgagtcccca gcgatgtggc ttggttgcac atttttctat ttcaagtctg 196320ttactccttc
catcatacca gactgcctca caaaatgaca aagtcaggca ttgttcataa 196380aagaacaaaa
tggcatttag gttataaatc aggcccttgt aagtttggtt aatacaactt 196440atttcccaaa
gatcctggtc ctcagaccct caaaaggaga gctgatcaca tcaaggcctc 196500tacctggaag
gaatttctaa gcagcctcta agtcaactgt tcatgttcag tccaccttga 196560tttacaaaga
acagtgaatc attaagacct ttctctccaa ggccgggtgc ggtggctcac 196620gcctgtaatc
ccagcacttt ggaaggccga ggcgagcgga tcacgaggtc aggagatcga 196680gaccattcta
gctaatacga tgaaaccccg tctctactaa aaatacaaaa aaaattagcc 196740aggcttggtg
gcgtgcacct gtagtcccag ctactcggga ggctgaggta ggagaatggc 196800gtgaatccgg
ggggcagagc ttgcagtgag ccaagatcgc gccactgcac tccagcctgg 196860agaacagaga
ctccatctca aaaaaaaaga ccttttgctc ctctttgctc ttgtgtctgt 196920ttaacagatt
tttatgttaa caggcactct gtttgccact aggaataaaa agataaatgg 196980gacaaagtct
ttgccctcaa gttacttata ttctagagcc aagaaagaga tagatgaaga 197040acacatctta
tatattagga ctgtgttggg ggcaggagta ggcagagtaa ccctttggat 197100atgggaaatg
aaaataaatc aaactcaaaa caagtttatt atttgcttga agctctggta 197160aaagataaag
tgaaacctgc aaactgtaaa atgaagttgc tgctacacat ttctccagtt 197220ttggatttca
ttcttggatg gcctttaact tggatagaag gatacaagac agccatatat 197280atatatatat
acagctgttg ctctttaaga tatccacttc accctgttgt ggggtgggag 197340gaggggggag
ggatagcatt aggagatata cctaatgtaa atgtcgagtt aatgggtgca 197400gcacaccaac
atggcacatg tatacatatg taacaaacct gcaccttgtg cgcatgtacc 197460ctagaactta
aagtacaata aaaaaaaaaa ttaaagaaaa gagatccact tcgctcagca 197520catgtgattt
ggctttggtc tttggtgtta atttggatgc ctaaagaact gactttcctt 197580gatgctcact
ccagcctttt taggactaga ttggatacat ttacatgttt gctggctgtt 197640ttttctacag
tcacacatcc tacccataat aatattgcaa tattctaagt tgtcttggag 197700gatatcatca
agtaggtgtc aagtaaaact taatggtaga ctgtaaattt taaaacaagt 197760tacaggtcat
ttgtgtagtc tctctctcta tatatattaa aagaaacaaa acttttccct 197820cttgaatctt
tattcctatc tcagcttcat ttattgcctc gtttgtcatc taaatggtct 197880ttgttctttc
agctttcaaa catgattaat tgacattgtg cagcatagca cagggaaaag 197940gctgaatatt
ttgtatgatg agtaatctag ttaaacgccc atctttcttt tcaactttac 198000caacaaatgt
gtcttatgca gatgtaaggg cggtttgctt atttccctgt tcaaaatttt 198060atgctattag
cctagcatag catgtttctt ttccattgag ctagccaaaa ataatattga 198120caaattcatg
aagccttcca tctgatactg ttgcaaactt atgcaaatgt attttagact 198180gttaagggaa
tgtaagaatt aaataattaa acatccaaat gtcaagaatg ggataggaag 198240aagtaactgg
cctcatgttg gaagttaatg aaatatttga aaatatacct ttaaaatcaa 198300agacatgaat
ttgtgttttg ggtttttttt tttcagattt gaataaagca gagtataata 198360ttatagttgc
tgcaggaagt tatgagcatt tctggaagtg cttattttca gcacatgcta 198420cttagaattt
taaaggacat tcagtgtatc tttgtcatga atctttctac tgggaccagg 198480ccactcctgg
atgaactcac agattgagag cttctcaaaa tgtaaaatca aattgtcttt 198540taaaatatgc
tattttaaag ttctggagaa tcaatgaact ttaaatgaaa atcggccaat 198600acttggcaaa
aagtcaattt gaagagttgt ctctttataa tctctgattt cacttataga 198660gcttgcttcc
aaaaaaagta ctacgtatgt cataaatctg aaatgttttg cttttatgcc 198720atttttaagt
cctgataaat attcttgaag tgttaaatta ttattattat tattatttga 198780gatggagtca
ctctgtcgcc caggctggag tacagtggtg tgatcttggc tcattgcaac 198840ctccgccccc
ccggattcga gtgattctcc tgcctcagcc tcccgagtat ctgggattac 198900aggtgcctgc
cactgcatcc ggctaatgtt ttgtattttt agtagagaca gggttttacc 198960atcttggcca
ggctgttctt gaactcctga ccttgtgatc cgcctgcctc ggcctcccaa 199020agtgctggaa
ttataggcat gagccgccgt gcccagccag tgttaaatta tttttaactt 199080gttagaaata
attgtcccac aacttactga aggttggttt aatattcact attgttccca 199140aaaaactctt
aaaatattaa accaactcat agctcataac ttttcaaggt agcaagtgtt 199200tgctaatctt
aaaatgtgtt actattcttt ttttgttttt tttttgtttt tttgtttttt 199260tgttttgaga
tggagtctcc ctctgtcacc caggctggag tgcagtggtg tgatcttgga 199320tcactgcaac
ctccacctcc tgggttcaag tgattctcct gcctcagcct ccccaagtag 199380ctgggattac
aggcgtgcca ctacgcccag ctaatttgct tgtattttta gtagagatga 199440ggtttcacca
tgttggccag actggtctcg aacccttgac ctcaggtgat ccacccgcct 199500tggactccca
aagtgctggg attacaggtg tgagccacca tgcccggcca ctattcttaa 199560gtgaccttct
gcacatctct gtagttttat gcctggtgat ggcgtttaat ttaggagtat 199620ctggcatgcc
actccagtct gacagctagg ggcatgttag caaagggggc atttttcagg 199680tgagtgggtg
agcatagccg tgctgaccat ctttcctatc cagagggaag gcctagcagg 199740gactcacgcc
atcttgactt gcctgcccca ggctccctct cactgctccc ctggcctctt 199800cctcttctcc
agacaatggg gccttatgct cgctgtgttt gtcagaactc tcttgtggtt 199860gtggctgcaa
gtaagagaaa ctcaactcga gcttattcag aaagagggga tggattgacc 199920cagataacag
aaaagtgcta ggctagatct ggccttagcc acagctggat ccagccttct 199980ggcatttgtg
ctccgttcct cagccctgcc ccactgggag tgatgccttc tgttcacagt 200040gtgatctgct
tcatactcaa aactccctat ccagaggaag aggaatcttt tctctcccac 200100catctgcaca
gcagtattca ggaatcccct gccaggctct gttggcctca tgtgccctgt 200160cctaatccac
cactgtgccc agccagggtg tggccataca tgcccatcct gtgcataggg 200220caggagcaga
cccacctgca ccacatgcgg tggccgccat gaccacaaga ccaggatagg 200280agtgctgatc
aggccagacc ccaagggctg ttgctcagac catacctggc ctgtagagta 200340gaagaactaa
atgtctcaca caggcaaaga ctactcttca attcctttaa ttgcaggtgt 200400atgttggagt
ccatgaaata caaagttatt gagaagcata actaaattta tagtagctcc 200460tcattatata
ttttcatgtt gataggattc tgcccccatg ggaattaatt ttcttcatca 200520aaagtgatct
ccatactcac aaacactgaa tagctttggg cttttaaaaa caatgttttg 200580gctttaaaag
aaccacagga ggctgggcgc ggtggcttac gcctgtaatc ccagcacttt 200640gggaggccga
ggcgggcgga tcatgaggtc agaagattga gaccatccta gctaacatgg 200700tgaaaccctg
tctctactaa aaatacaaaa aaatcagcct ggcgtggtgg caggcacctg 200760tagtcccagc
tattcaggag gctgaggcag gggaatggtg tgaacccagg aggcagagct 200820tgcaagtgag
ctgagatcat gccactgcac tccagcctgg gcgacagagc aagactccat 200880ctcaaaaaaa
aaaaaaaaaa aaaaaagaac catggcaaaa gtgtccttta gccgggtgcg 200940gtggctcaca
cctgtaatcc cagcactttg ggaggccgag gcgggtggat cacgaggtca 201000ggagattgag
accatcctgg ctgatacagt gaaaccccgt ctctactaaa catacaaaaa 201060attagccagg
cgtggtggca ggcgcctgtt gtcccagcta ctcaggaggc tgaggcagga 201120gaatggcgtg
aacccgggag gcggagcttg cagtgagcca agattgcgcc attgcattcc 201180agcctgggca
atagagcaaa actccgtctc aaaaaaaaaa aaaaaagtgt cctttagtct 201240actgttggtt
cttccaaaga gccatccttt cgctgatgct ttaggaactc ccaacctggg 201300tcctgtcaag
gataagctct catcttattc ccacaaaatt cctagagtta ctcatcattt 201360tcctttcata
ttcagctcat taaagcagtt tgtaggtctc tgtagtccat attagtaagt 201420gttaaaaatg
tatcaatttg attttttttt tttttttgag acagagtctt gctctgtagc 201480ccaggctgga
gtgcagtgtt gcaatcatgg ctcactgcaa cctctgcctc ctgggtttaa 201540gggattctcc
tgcctcagcc tcctgagtag ctgggattat aggcgtgcac caccacgcct 201600ggctaatttt
tgcattttta gtagaaacgg ggtttcacca cgttggtcag gctggtcttg 201660aactcctgac
ctcatgagcc accgctcccg gccatcgatt tgattattat aaaatcaggt 201720tatattggtt
tgaccagtta aaaacatcac aggttattta atctttttat ttatttattt 201780atttattgag
acagagtctg gctctgtcac ccaggctggt gtacagtggt gcaatctcag 201840ctcactgcag
cttcgacctc ctacgtccaa gcaattatcg tgcctcagcc tcctgtgtag 201900ctgggattcc
aggtgtgcgc caccacaccc agctaatttt tgtattttta gtagagacca 201960ggcttcacca
cgttggccag gctggtctcg aactcctggc ctcaagtgat ccacccgcct 202020tggcctccca
aagtgctggg attacaggtg tgagccactg tggccagcta atcacttttt 202080taaatcacaa
tttgtgttac gttcatgagt aatgggtgcc agaaataatg ttgtgtgtta 202140aacataaatg
tagatgggta accatgtata caaatgcata tctcctcaac ccagagagtg 202200aaatttcccc
accccagtga ccctgcctcg ctgtatggtt cattcttctc caggaattga 202260aacatggtgg
cagcctgcca gcttggcgtt accattaatt cttattctct tcccctgccg 202320atgtcatggg
gacaccagcc aaatgaactc tgatgtgtct cacgtaatgc atgtggactt 202380ctggtgcatg
taaacagagg gtggttattc ttgattgcac atttgtgatt tttagtgagg 202440ctctaaggga
gaagcaagag gacttgaata acagatttat tctcgggcac tggtacatga 202500aagagatgag
taatgccaaa caaataacac aattctgatt cacgccattg gtaatggagg 202560agacttctgg
gtaaacaaac caaagtaaca gctagaatgc agatctgatc acagcagtgt 202620gttttgattc
agtctaaaga tgacaaagaa aagtttctgc tggtcctatc tgtagcaaca 202680agggcctgcc
cgcctgacca cactgcagtg gaacagccac attgtgattt tgcgggcctg 202740aagcctttcg
tctggtgacc ctttaagagg aacttgagcc aagatcttat cagtgatgga 202800gatggagaca
cctcaggatt ttccagtatc gtttttcact ttttgttctt ggacatagtt 202860aataacgctg
ccaggtcttt caaatgctga ggcatttcct catagctcca agtagctgct 202920gcgggtcaga
ttctgcttga accgagggtt caattacaac atccgtatgc tttccagccc 202980actctctgac
ctttgttctt tttttcagta ctttttcagg ctcttaatgt ttccatccac 203040taatctcagg
agactctgta aatattctga gtatacatag cccaattact tttgaaaagc 203100taatttaata
gaaacttctt gatatgaatt gaataaattt tttaacttaa aaatcattta 203160cttacaactt
aagacagtta aacttgagag tatttgaaag ctagtaaact tcagagcttt 203220cttcagtagg
cacgaacttt accatttttc tgacactcag cacggtcagc aatgacacat 203280tcattggagg
actcatcaat atccctaaaa atagtagcag tactcgtgca cagcctgatt 203340tcttttgaac
ttggctcaat ttcttggaag cattgatcat gggatcccgc tttgattcct 203400tttctgtaga
ctagtgccag tttgcccagt ttcattacat ttttggaaag agctaaaact 203460atggtatcgt
tttattgcag tcctcacatc aatcataatt tggagcagtc ttggtgaaag 203520atgaggaatt
cagactggga tcctgggcaa ggcaggtcca gggggcaaga gggatggaat 203580gcagtgagta
cttattttat atcataagta cagccacctc agcctcatga ctttttagcc 203640acggcttcat
tccaaagtga ccacagtcat aaggcccaaa actatcttct ctctctttct 203700ctctctctct
ctctctctct ctctctccct ccctctctct gtcataggga cctccctaat 203760cctccaggtt
gtacagccag gaagggtaag gtattgtcaa aaacaagtga ggtaccccct 203820tgatagctca
gtgggccaga gggcctctgg atgctggctg atcatggcaa aaactaaata 203880cagaatccat
acagtggatt ctctgtgctt tttctcttca cccctcctta cccttgacca 203940aatttctgac
tgtgttattt tcttattgca gtcctccctg ctagatatac aactttgact 204000ttcatcagaa
gaaagtgctc ccattgcatt attagctaaa ccatgttgca gtggcaacgc 204060tccttcttcc
agcttgcctc ggtgtttctg tagttagcat caggtgtcct tggttctgtt 204120taccagcctg
tccagggatc acttccctgg tgggatctct ctggtgaccc tcatctcaga 204180gctcagcggt
gcctttgggg ctacttgcca gacaactcca tgtgacagag ggccaggtcc 204240aagtgctcga
tcaccccagc caccatttca gagttctcgg attatagatc cctcaacgag 204300tcccagtgag
acaccgaaca tgccatagaa gctatgtttt tctatttcag cacacagtaa 204360aacactacct
cttagatgtc gatgagtaca tgctgctttt atgttgacgc agaggacctg 204420tgttgactct
tcatcggcaa ggtgtctttt ttttaagttt caattgacac ataccacctc 204480tggcccaagt
ggaccacaca gttatgcatg ccctttggtg caagatacat cacgtctcca 204540agaactcggg
tgtttttctg atcatggaga attagattcc acatatagct tgtatttagg 204600atctgcagat
taggttttat aaattatgtt tggcaaatga taatccacaa agtagattct 204660aagttaaaat
actataaagt actataattt atgtttaaat gtctttcaac ctgagagtta 204720ctctccctta
gaagtcccca aaaatgctga aaccacacca ggttgatatc tgacctttca 204780cttttggaaa
atgtgtatat tgttgaacct ctaatctaga ttcggacacc atcaccaacc 204840agcttctctc
tttcagttgc catctgcact ttactgttta gtttttgttg tctttaggga 204900aagttgtaaa
atacccacat tcttctcaag ttcttcaagt cttttggttt ttagtaaacc 204960cttttgtgac
attgtctttg agaacagagg tcaacaaaca tttccttaat ggactagata 205020gtagatatat
caggacttgc agaccacatg gtctctgtcc taactactca actctgccgt 205080tatagcacaa
aaccagccgt actcaatata taaatgaatg accctgtggc tgtgttccac 205140taaaacttca
taaaaataag cagctggcca gatttgctgt gtgggtctta gtttgccaac 205200ccctgatcta
gaagaaaaga gttttgtgta atgtttgggt aataaaataa acataccaaa 205260ccatctcagg
aatccctaac cttttcctca ttgggaacta ttttcccagt atccccaagt 205320actggtggga
atgagataaa aagggccagg catgccattc cctaaagcag atagccaacc 205380ctctttcttt
ccctgtggaa atcccctctg cctacaggga ggagaagcaa ctggtttgga 205440agctgccagc
cagtccgttt atttagaaaa ttctaagccc agttccagat cactttgcgt 205500caactctaaa
atacagccac aaatgaaata aatgttgttt ataatcctac caagtctagt 205560aggtgtgtat
gtgcttgagg ttaaactttt gtgtaagctc cagacttact ttaggtttat 205620agaaaattac
atcgcaattg atacaatggt tgtgttgggt gatggagata gaaagaggga 205680gaagaagtga
aagtgaaatt ttggtcaaaa ttggatacca ccctaataag ccgggagata 205740aggggtgctg
cagaactgtt ttctgtgata cagtgcatct acattgataa tttgtgtttt 205800atgtactttt
aaaatgaggc tttattggct tgatttctgt gagaaaaatc actaatgggc 205860tgatcgtact
gctctacttc aacatgcaag agaattgctc agacgcttgt cagatgcctg 205920aaccacatac
ttatttgaga atgtgccatc ataggaagct ttcattcacc aaggtcaccg 205980agagggcagc
ccagggaagc acgttgaact tcgcatggtt ctggccactc gtgtcagtag 206040caaagcaagg
aaataaggag ctgacttgcc caaaaattac aagttcagag aaaacggact 206100tgttttgttt
ggaaaaggca gataacggag cagtaactaa agaatcctct ttttttatta 206160ttattgttgt
tgctcaattt ataaacctaa acagaaaatg tgtgttgcct tatttcctct 206220tcttttctac
agataacttt cttttaaaac attctaaata ttaagcaaca ttgtaagtac 206280agattttttt
tttttttttt tttttgagtt ggagtctcac tccgatcatc cagactggag 206340tgcaatggcg
cgatcttggc tcactgcaac ctctgcctcc tgggttcaag cagttctcct 206400gcctcggcct
cctgagtagc tgggattcca ggcacccacc accacatttg gttaattttt 206460gtatttttag
tagagatggg gtttccccat gttggccagg ctagtctcga actcctgact 206520cacctcagcc
tcccaaagtg ctgagagtac aggcgtgagc caccacgccc ggctagtaca 206580gaatttttta
tggccccttt taaagactgc aaaccttttt ataccaaaaa aaaattattt 206640ttcctgtatt
tattttacac attagtaaat gaatgaaatt ttggctctag gtttagaaag 206700caatttcatt
ttaagctcat tttactgtgt cattggactt tttattaatt ttattcttga 206760actacttagc
tgaaaatcga attctgtcat tatatgtaat catataatgc ttctcataat 206820gttacacttt
aattagatgc agttgaataa taacctaaaa aatcttcata gtctataaaa 206880ttcacactta
tatttataag acaatattat ctcacccatt gatacataat gttatgttaa 206940aagtacacaa
tatggccagg tgtggtggct cacacctgga atcccagtac tttgggaggc 207000tgaggcggca
gatcacttga gcccaggagt tcgacatcag catgagcaac atgttgaaac 207060cccgtctcta
ccaaaaatac aaaaattagc caggcatggt ggcgcacatc tgtaatccca 207120gcaacttggg
aggctgaggc aggaggatca cttgaaccca ggggacagag gttgcagtga 207180acagaggttg
cagtgagcgg agattgtgcc tcctctgcac tccagcctgg atgacagagt 207240gatactctgt
ctcaaaaaat aaaaagagta cacaatatta tctcacccat tgatacataa 207300tgttatgtta
aaagcacact gcattctaga gaaggtagag taacagggtc ttactttacc 207360ctcccacttg
aaacaagtaa aaaactggac aaagtatatg aaaaatacat cactcagggc 207420attggacacc
aggcagcaaa aaacagtggc ccctgagaaa ttggaaacaa agtgagtcct 207480ataattgtcc
ttatctagaa aaagtttcct ggctactgtg tagagagaag gaacccagga 207540gaaagagctc
aactgtctcc ctgagttgaa gacagagctg gggatacatc aagggaaacc 207600aaggcagcta
gtttgcagag gtggaggacc ggagaggaga gaactgcata gagagagctt 207660tgcggatgta
cagagtcctc cttgagcatg caagtgcgta ctgatcaaag tgtggttgtg 207720agaaaactac
ccatcaccag agaaaggatc acccgaaaga atgagggaac agtgcctgcc 207780tcccacacag
tagccaagaa gaggtgcctg ttgccaacag ctagagtgaa aacctctgaa 207840tgcaggggca
ctggttagaa tactaagaag ggtcttgcca gacatccaat ccaaaattag 207900caggcatgca
aagatctagg aaaataccac ctaaaatgag aaaagtcaat caaaactaac 207960ccattataca
gatgatagaa ttcatagaca cggacattaa agtagttgtt gtgactgtag 208020tttgcatgtt
taagaaacta gaagaaggat ttaacatgtt aaatagagac aaagaagata 208080taacaaagat
ataaatcaaa ctgaagagat gaaaactgca gtgcctgaaa tgataaaaaa 208140aaaaaactct
taattaatag cagattagac attgcaaaag aaaagcctag ttaacttaaa 208200gacatgcagt
agaaactatc tgaaatgaaa cagaaaaagg ctgaaaagaa gcaagagagc 208260aacattgagc
tttgggatca ttttaagtag ccaaaaatat gtaaaactag agtcccacag 208320gagtggaaaa
aaggaactga aaaatatttg aagaaataat gggcaaaacg ttttcaaatt 208380tgatagaaac
tataaactca tacattcaaa aaaatgagaa cattttaaaa agagaggata 208440aatgacacat
tatgtagaga ggaacaaaga taaattggct gatgatttct tgttggaaac 208500aacacaaact
agaaggtggt aggtcaatat ttttaaagca gtgaaagaaa attcacagcc 208560tagagttttt
tacctggtta aaatatcttt ccaaaacaaa gctgaaataa agactatttt 208620agacacgtaa
aagctgaaaa aattcatcac tagctaacct gaagtacaaa aagtgttaca 208680gaaattcctg
ctggcagaaa gaaaatgata ccagttggaa acctgaatct acccaaagga 208740atgaagaaca
ccaaaaatag taaccgcatg gatagaaaga cttgttctta ctgtttatct 208800taaaagctat
tactgtatac agtaattact gcataaagca aaaataataa aaatgtagta 208860tagcgttaaa
aacatttaga agctaaatat atggcaacaa taacacaaag tctaagagga 208920gagaaataga
agcatgcttt tgtaaaattc ttaggctata tgtgaagtag tataactata 208980tgtggtgtgg
tagtataatt atatgtgaag gtagactgtg caaagtgaaa tgtgtatgta 209040gtcatgtgct
gtataataac gttttggtca aaaacaaacc gtgtatacaa cagcagttcc 209100atcagattat
aatactgtat tttttactgt accctttcta ttgtatgttt gcatgtataa 209160atacttacca
ttctgttaca gccacctact gtattcagaa caataacatg ctatgcaggc 209220ttgtagccct
ggtgcagtag gctgtaccat acagccaagg tgtgtaggag gctataccat 209280ctaggtttgc
aagtacactg tatgatgttc acacagtgac aaaatcaccc agtgatgtgt 209340ttctcagaat
gtacccgtct ttaagcaaca caggactgtg ctgtaaatgc aaggtaacca 209400tttcagtaat
tcaacaacgt tatagttaaa gagttgtaac cttgttatag ttaacaaagt 209460taatatatag
ttatctacaa agtagataaa atgaaatcat taaaaataat ccagaggaag 209520cagaaaaggg
aaaaaggaac aaagaacaga taggacaata agaaaaagaa tagcaaaata 209580gtagatttaa
actcaactat atcgataatc acactagata taaatgtcct aaatacacta 209640attaaaatgc
aaagaatgtc agattggcta aaaatgcaag agccgactat tactgtcaac 209700aagaaaccct
ttttaaatct aaagacataa gtaggttaaa ggtaaaagga tggaaaagcg 209760ctaatactaa
tcaaaagaaa attgaattgg tatatcaata tcaaagtaga actagagcaa 209820agagtagtac
caaggatata aaagaggctt gtttcattat gacaaagggg ccgtttcacg 209880aacaggacat
agcagttcta agtgtttaag catataataa cagagcatca acataaatga 209940agaaaaaaaa
agataaaatt atagggaaaa atagaaaatc cacaaatata gtcagaattt 210000tttaaacccc
tctcagaaca agtagagtaa aaatcaataa ggatttagaa gacttaaaca 210060atgctacaat
tcagcttgac tccacccaac aacagcagat tacgcattct tttcaaacgt 210120acacagaaca
tttgccaaaa tagaccatat tctgagtcac aaaatgtctt aataaatttg 210180taagggctca
aatcatacaa aatatgttct ctaaccacag tgggattaaa ttagaaatca 210240acactgaaag
acatctggaa acactccaaa tatttggaaa taagctacag agtgggagaa 210300aatatttgca
agttatctat ttgatataag gatttgtata tggaatacta aaactcaaaa 210360ctcaatgaga
aaataacgct tttaaaagat gactgaaaga tttgaacaca tatttcaccc 210420aagaaaatac
atgaaaatat gcttaccatc acaggcatta gggaaatgga aatttaaacc 210480acagtgaata
ccgctgcata gctattagaa tgtctacaat taaaaagatt gaccagctaa 210540gcacagtagc
tcatgcctgt aatcccaaca ctttgacagg cagaggcagg aggatggctt 210600gagcccagcc
tggacaacat agggaggaga ctctgtgtct acaataaaag aaaaagaaaa 210660ttagccagtt
gtggtggcct agctgctcac ttgggaggat gaggtgggag gattgcttga 210720gcccaggttg
aggctgtagt aagctgtgat tgcaccactg cactccagcc tgcaagacag 210780agcaagaccc
tgtcccaaag aaaaaaaaga ttgggctggg cacggtggct cacagctgta 210840attgcatcac
tttgggaggc tgaggcaggt ggatcacttg aggtcaggag ttcaagacca 210900gcctgggcaa
catggggaaa ccccaactct actaaaaata caaaaaatta attgggcatg 210960gtggcacacg
cctgtaatcc cagctactcg ggaggctgag gcccgagaat tgcttgaacc 211020caggaaacag
aggttgcagt gagccaagat tgtaccactg cactccagcc tgggtgacag 211080agtgatactc
tgtctaaaaa aaaaaaaaaa aaagactgac cataccaaat gttggcaagg 211140atgtggaaca
actggaaccc tcatacactg ctggtaggaa tatataacag taccaccccc 211200ttggaaatca
atttggggtc gtttcttaaa aagttaaagt gcacctgcca tatagtcaag 211260acattctctt
tctaaatatt tacccaagag aaatgcagta tatgtccatg taaagatccg 211320tgcatgaatg
ttcgtagcag cttttttgta ataatataca gtaagcttta cactttgaca 211380tgtgcagttt
attgtatgta atttgcctca ataaaagttt taaataatac aattttatcc 211440aaaataaagt
tgtgaaaagt agaaaaatga ggaaaaagtt tcaaaggtta gaggggtcag 211500tccaggagac
ctaacatcta actaatggga attccaataa gacagaagaa accagaaggg 211560aagaaattat
cagagaaata atcctcagcc ttccacccaa aattttctct cctagaactg 211620aaggactccc
aaatcaaaag ggtcagcaaa ataaatgtat agactcacac caacacactt 211680catcataata
gtaaaaagaa cagtagagtc aaaaagagga tactaaaagc tttcagaggg 211740ggaaaaaaca
aaacatgcaa ataatctgga acccgaatgg catcagactt ttcaacggta 211800atactgggct
gggtgcagtg gctcacgcct gtaatcccag cactttggga ggccaaggca 211860ggtgaatcac
ctgaggtcag gaattcaaga ccagcctggt caacctgatg aaaccccatc 211920tctattaata
gtacaaaaat tagcttattt cttaaaagcc agcaaggggg atggtttttt 211980ccttgggctg
gtggcacgca tctgtaatcc tagctactcg ggaagctgag gcatgagaat 212040tgcttgaact
tgggaggtag aggttgcagt gagccaagat cgtgccacta tactctagcc 212100tgggcggcag
agtgaaactg tgtcttaaaa gaaaaaaata aataaaagtt atcaaaaaac 212160aaaaacaaaa
ccataatact ggaggctaga ttacaataga gcaatatctt caaaattcaa 212220ggaaaatggc
caggcgcggt ggctcacgct tgtaatccca gcactttgga aggccgaggc 212280gggcggagca
cgaggtcagg agatccagac cacggtgaaa ccctgtctct actaaaaata 212340caaaaaatta
gccgggcgtg gtggcaggcg cctgtagccc cagctactcc ggagaggctg 212400aggcaggaga
atggcatgaa cccaggaggt ggagcttgca gtgagccaag atcgcgccac 212460tgcactccag
cctgggcgac agagcaagac tctgtctcaa aaaaaaaaaa aaaaaaaaaa 212520tcaaggaaaa
taatataccc agctaaacca tcatgaagtg tgataagaat aatgacgaag 212580ccagacattt
tagaaaaatc tacccctagc gccctttttc tcaagaagcc actagaggag 212640tatttcactc
aaatgagcaa gcagaggaag gaggattgct tgagcccagc ctgggcaatg 212700tacagagacc
ccatgtctac aaagaaataa aagaagaaaa ttagccaggt gtggtggcct 212760gtagtcctag
ctactcactc aggaggatga ggtaggagaa ttgcaagtaa atgagcaagt 212820aaaccacaga
agaatattaa ttttctcttt tttctgctgc ataacaaatg agcacacatt 212880tagtggctta
aaacagctca catgtattat cttccatttc tctgggtcag cttagctggg 212940ttctctgctt
cagactgcga aaattgtctc atctgaggcc tgggttcctc atcaaagctc 213000atttggcaga
attccattcc ttgaagttgt gggactgagg gcccagcttt ttgctggctg 213060tcagctggag
gccacaccca ggtcccagag gccatgtggg tttttgccat gagaccttct 213120ccttcgtcca
ttcacagcat ggcagcttat ttcttgaaag ccagcagggg ggatgggttt 213180tctctggaat
ctgttagatg aagtgacctc ccttcactgt cactcgtgcc atattctgct 213240ggtgagaagc
aagtcacagg cctcacctgc actcagggag ggactcactg ggagtcgccg 213300ccacagtgtg
gctccaggaa tggaaactac cacactgcag aggtgcaggg aatccccagg 213360attatggtca
agaaagagcc cactaagtgt tttgttcagt aggcctgcag gatgaagggt 213420gccaggagga
tggctccatg aaagactgag cctatacgtg aacgtactga aagctgactt 213480acagttcagc
cagtgtggtg gtatgtgatt gacagcgccc tagcaagatg agcaacaaag 213540atacaagact
ccagcagaaa caagctgcag aagaacagga gtgtaatcaa taaatcagat 213600ggccttgctg
tgaacggtta catagctctg ataaatcctg atctaaccaa aagtcatgac 213660agctcgatat
ttgggagctg gagggggtgg ggagtagggg tagtaggata tgaaggagct 213720aagtccccat
tgttgatatt agaaatccag tgggtaacat ctgaacggca agaccaggac 213780agcagcgaaa
ggcaaaaaga caccactgag tgttgacagt ggctgccttt caggagtggg 213840aatgccactg
ggggcagggc agtggggggc agggaacagg tttttctgtt tgttttttgt 213900ttgtttgttt
gttttgtttt gagatggagt ttcactcttg ttgcccaggc tggagtgcga 213960tggcatgatc
tcggctcaca gctggagtgc gatggcgtga tctcggctca ccgcaacctc 214020cgccccccgg
attcaagcaa ttctcctgct tcagcctctc aagtagctag gattacaggc 214080atgcaccacc
acgcccagct aattttgtat ttttagtaga gacggagttt caccatgttg 214140gtcacgctgg
tctcaaactc ccgacctcat gtgatctgcc cacctcggcc tcccaaagtg 214200ctgggattac
aggtgtgagc cacttcgcct ggccatggga acagtcatgt tgcatagtaa 214260gccctaaaga
gcagtgtgat ctcttaagcc agcacgtgat ttactttggt aaaaattaaa 214320ttaaaaaaaa
agaaattagg ccaggcgtgg tggctcacac ctgtaatccc aggactttgg 214380agagccgagg
taggcggttc acctaaggtc aggagttcaa gaccagcctg gccaacgtgg 214440tgaaaccccg
tctctactta aaatacaaaa aaaatagctg ggcgtggtag tgcacatcca 214500taaatcccag
ccactcggga ggctgaggca ggagaattgc ttgaacctgg gaggtggaga 214560tttcagtgag
ccgagattgc gccattgcac tccagcctgg atgacggagt gagactctgg 214620cccaaaaaag
aaaaaaataa tcctattggt ctcatagtgt ttctctgtat ttttaatata 214680ctgtgtatgt
gcttatgaga aaccgaaaga gaagtgattt gtgaacgatt tgaggacttt 214740tggtccacgt
ttttttctcc cacttatact tgtggcataa tccagaggaa gaagggcagc 214800tgcggagtcc
cagcagtgtt gcgggtatca gggcagccag tgagaggatg gaggagaaac 214860ccaggagagg
gcagaggagg agggacagct gtgcccacct ggtcctttca ttgttgtctg 214920cctctgccct
tgcagttaga aaaagagtgt tctgtgtata gaagcctttg caagtagcat 214980agttaggagt
ctttcggtcc aggaattgaa ggtgctaaag caggaggaaa acatacatag 215040aagaaaggaa
atgtccacat aagtaagtgt attagtctgc tagggctgcc gtaacaaaag 215100accatagact
gggtggcttc aacaacagaa gtttatttta ttttctcaca gttctggagg 215160cgggaagtcc
aagatcaggg tgctggcagg gctggtttct gctgaggcct ctcttcttgg 215220cttgcagaca
gctgtcttcc ctgtatgtcg tcttctctgt acacaaacat ccctgtgacc 215280tcttcctctt
cttttaagga cactcagtcc tattggatta gggccccatg cttatgacct 215340ttttgggttg
ttttttgtgt ttgttttgag atggggtctt gctctgtcac ccaggatgga 215400gtgcagtggc
acgatcatgg ctccctacag cctcgacttc ctgtctccaa atgtagtcgt 215460attgggggat
taggatgtca atataggggt tttgagggga tgcagttcag tccattatat 215520acagtaagcc
acttctagac ccagtctgaa gtcctgagca ttcagctggt atctggagtg 215580gatcacctaa
gaaacattac agaacaatat aggcctctgg tatttaaaat agtttaatta 215640cgtcacaggg
tagtcctgaa ttctttccta cttcagccag tcttggccag ccgagtttca 215700gggtagtgat
gagcactgat gaggtgactc ccccttctcc aaactcttcc tttgccactg 215760tcccaagaaa
accataaagg ggaagtttaa atgatggatg taattttcct gacttgggct 215820gggcgcagtg
gctcacacct gtaattccag cactttggga ggctgcggtg cgcagatcac 215880ctgaggtcag
gagttccaga ccaggctggc caacatggtg aaacgctgcc tctactaaaa 215940atgcaaaaaa
aaaaaaaaaa aaaaaaaaat agccaggcat ggtggcacac gccagtaatc 216000ccagctactt
gggaggctga ggcaggagaa ttacttgaac ctgggaggca gaggttgcag 216060tgagctgaga
tcttgccact gcaccccagc ctgggtgaca gagactgtct caaaaataat 216120aatgatttcc
ctgtcttatg tttttctact acacagctta ttatctctgg gcaggagctt 216180tcccagggtc
ggaggggcaa gggtctagtt gtgagggaga gaggtgaaag caaaagggaa 216240atgctaaatg
acatggcaca gcaagcaagg tctaggcagc tgcgcctttg ataacaagct 216300gcttgaagag
aaagaccaga tcggctgagt gaaagcactg tccatcctcc ccaccccttc 216360cttcctgctt
cctcctcttc ccatctcctc cagcttctcc ttaaagggac taccgaatga 216420tgatttcctt
gccagtagga ccttgacata aaacccagcc cacaacatca cagttgattt 216480tcaagggatg
agcttctcag aaaccagaat ttaattactt aattttgagt ttccaaacaa 216540taaaatatat
tattaaattg atttgttttc cacacgtagt ggagattttc atggtgtgct 216600acctgtcggt
caaacggctg atgaatgagt atgtcgcaaa aactctaggt gggaatatga 216660cacatgagga
atggaatttg tgcttcagct cttcccagca tcttcttggg ctgcacaaga 216720gttggtagtc
tttcatctcc taaggactga agggccattt agaatcacac ctgttttgag 216780tttatagtca
cgcttctgaa actttctctc ttacagcctt atctgacctc aatcaagcta 216840cccctgttca
tttaaatact ttaggtcttg gctattggtt tagcttaaca tttgattgcc 216900ataaataaga
actacagaaa gcgtgacttg taaaattgta gacccagaaa aattagggaa 216960caatctcata
atactaatga aacctttttt cccataaaat aaatacataa ttagatatag 217020aagatttttt
ttttggttta agctatcttt tcctatttct atcttaaata atttacaact 217080ctcctttaat
ccaggcaatt atcaattcta taataaaata acaacaatgg gtactgtgta 217140ttaaactcta
aactagggtc cttcatctca tttgaatttt gaaatgactc cataatatat 217200tgttatactc
atgctgtagt tatagataaa gaacgagggc tgacctttga caatcacaat 217260caaaaaaaac
aaaacaaaag aagtgaaggc tgagcgaagt aactcaccca gcaagatgca 217320attcgtatag
ggtagagtcc cctgtctccc gataatgctt tgccccacac ctgaaagtct 217380tgatctggtt
cccattaccc ctcagacttt gggtccccag tagaaccttt gcacttgcag 217440ttctccttcc
taatgggctc agggaaactc acatgactgc ctggaggctg tttctgcacc 217500tgtagatggg
aatgggcaag tctctcccga ctcacttgtc tcatagtgtt gttgaaaggc 217560tcaggtgtga
tcattacagt gaaataagga aaggcaggga agaatctcca tcgttctcca 217620aatacgaaga
atgaaatgca attttttttt ttttcttttt gagacgaagt ctcactcttg 217680tcacccaggc
tggagtgcag tggtgccatc tcggctcact gcaacctctg cctcccaggt 217740tcaagcgatt
ctcctgcctc aacctcccga gtagctgaga ttacaggcac ctgccaccat 217800gcccagctaa
tttgtgtatt tttagtagag atggagtttc accatgttgg ccaggatggt 217860ctcgatttct
tgacctcgtg atctgcccac ctcggcctct caaagtgctg ggattacagg 217920cgtgagccac
agcgcccggc cacaaataag tttataccct tgtagtcatg gcctttcttt 217980tatctttctg
gtaatgaata aaaatgaagc atataagtga cagagaaacc agctacttga 218040tcaatagtgg
cattgtttca caattgtctg tattggattt tgccaaccca ctagagagtt 218100accaaagcac
atttctgaca taactatgat tgattttttt ttaatttttg actaaatatg 218160ttactttgac
taaattggtt acttcattgc ataatggtga ccatttacac ttttcagata 218220ggtacaaaaa
atctgtgtat tctagctggc catgaaagag tagcaagcct gttattttta 218280agaaaaaagc
tcgagtagtt actgtatttg tatcattttg gaaatctctc atttttctaa 218340tcagcaaaat
ggggataatg taaataacaa taaccctgac ctcacagggt tcttgggatt 218400aaatgagata
tggtgcatac aacgtgctta gcatggcgcc tggagtatgg tcagaactta 218460acaaacacaa
gttataaatg aggtcccgaa atacaagaaa ttcacttcct cagaaacctt 218520ccttgagcct
ggctacatct atagctacat tttctacatg ttaaatgcct ccctcagaag 218580tgtcactgtt
ggagttgcca gggcaccaat ttattacttt gaaaatagat aaataaaaga 218640atcaagacaa
aagaaaaagg tgctaccaaa gaagacattc tgtcttacac tggatcacac 218700gagatgtgga
tatttggccc ggaactgctc acatgggtca gcaagcctga gaatgaccgt 218760ggagagaaat
ggctggggct gcaggactca gccaagcctg gcacgcgtgt cttccgggca 218820ccggagcagg
acgttgtctg aaaacatgaa gaatgaaatg gaagtaatgt aatacctggg 218880gtcggggctg
ggggaagctt tctaagtaca ggcagggtct ttctatgttg cccagggtgg 218940tcttgaactc
ctggccccaa gcgatcctcc cacctcggct tcccaaagtg tggggattgc 219000gagtgtgggc
cactgtgcct ggcctaatta ctgattttaa atattaatct ttctaaaagt 219060caaactccag
aactgtgcat gtagaactat tcaatttctg tatataacta tagaaatata 219120tattagacgg
gccgggcgtg gtggctcatg cctgtaatcc cagcactttg ggaggccgag 219180gcgggcagat
cacgaggtca ggagatcgag accatcctgg ctaacacacg gtgaaacccc 219240gtctctacta
aaaatacaaa aaattagcca ggcgtggtgg cgggtgcctg tagtcccagc 219300tactcgggag
gctgaggcag gagaatcgct tgaacccggg aggtggagct tgcagtgagc 219360cgagatcgcg
ccactgcact ccagcctggg tgacagagtg agactccatc tcaaagaaaa 219420aaaaaaaaaa
atatatatat atatatatat gtatatattt atatatgtat atacacacac 219480acacacatta
gataagcaca ttgtttatat aggaatcaaa aattacctgt gagtaatttt 219540tatttatgta
atttaataat tacataaaaa ttatttgcag aggttacctc tggaagcagt 219600attagaagtg
gaggactttt ttcttctatg ttgaaaatat tttaagtaaa tttttaaaaa 219660cgaatagaaa
tttaatcagg aacaggcttt ctaccattaa aaaactttat agtcttcttt 219720acaaaagaag
agaaataatt gtacttgttc agtcattcaa ggccactgaa ataattggca 219780tagcttgagc
tctgtagtac tgtcacaata tgcaaacaca gtgtggtaat ttttatagta 219840attgaacatg
acttacgtct tagaaagtca taagttagga atttattctg ttcgttcttt 219900tgaagtgaaa
agtaaagaaa gcaaaacatt ttttggccgg gcgcgttgtc tcacgcctgt 219960aatcccagca
ctttgggaag ccgaggcggg tggatcacct gaggtcagga gtttgagacc 220020agcctggcca
acatggcaaa accccatctc tactaaaaat tcaaaaatta cctggccatg 220080gtggtgggtg
cctgtaatcc cagctactcg ggaggctgaa gcaggagaat cacttgaggc 220140caggaggcag
aggttgcagt gagcagagat tgtgctacta cacttcagcc tgggcgacag 220200agcgagactc
tgtctcaaaa aaaaaaacaa tgactttgac ctgaatgtcc caagggtctg 220260ctttgtgtag
tctgcccctc tacaagcagg gaccaggtct tggtcacctg gtgtccttgt 220320agtaagctct
tagcagggac tgggtcctgg tcacctgggt gtcctggtag taagctctta 220380gcagggactg
ggtcctggtc acctgggtgt ccttgtagta agctcttagc aaaggttgct 220440gagtcgaatg
aacttcaccg tttcggtcct ttggtacagt aacaggcact caagagagtc 220500cacgctgtag
ggttgatcag ccaccatgag agagaagtat actctaagag cacacaccat 220560caaccccaga
gcatccctgc cgaccaggaa tgtagttaca gcggtgcagg ttcccagcta 220620ccccaaagaa
atcccaggag agtccatctc tactaagtca agcaaataga aattgcaact 220680gtagcagcac
aatctatttt cattatatag tgtgtgtctc tctctcccct catctctctc 220740tctctctcct
ttcccctcgg tcacttgaca ttgatactgg gccatttatc ctcgtgcatc 220800atcttagagt
tacatttgta taattatcaa catggccatt tgatataaac aattccgtcc 220860ctcacatcca
aagaaaaggg aagagacaga aagaaaaggc ctaatttctg agcaattttt 220920gttgtttgtt
tcccattttg taattgtcct ccatctttac tcatctgttt ttttaaatat 220980taattatttt
tttgagacag ggtctggctc tgttgctcag gctggagtac agtggtgcac 221040atggcttact
gtagcctcga cctcctgggc tcaagagatc ctcccacctc agccccctaa 221100gtaactggga
ccacaattgt gcaccaccac acctggctaa tttttgtttg ttttttgtag 221160agatggggct
ttgccatatt gcccaggctg gtcttgaact cctgggctca atccatcctc 221220ctacctgagc
ctctcaaagt gtcaggatta caggcgtgaa ccaccatgcc tgaccaaaaa 221280tactttctga
gggtctaata ttgccaggca ccgtgttaac aattcagaaa atactcctgc 221340ttaataaata
aagacttgaa tttttttagt tacttcattg aaaagaaaat ctttttattt 221400aaattgtatc
gtcagccagg cgcaattgct cacccctata atcccagcac tttgggaggc 221460cgaggcagac
gaatcatttg agtcaggagt tcaagactag tctggccacc gtggtgaaac 221520cttgtctcta
ccaaaaatac aaaaattagc caggcatggc ggcgggcacc tgtaatccca 221580gctatttggg
aggctgagac aggagaattg cttgaaccca cgaggcggag gctgcagtga 221640gctgagatca
cgctgctgca gtccagcctg ggcgacagag ccagaggcca tctcaaaaat 221700aaataaataa
ataaataaat aaataaataa ataaataaat aataaaattg tgatgtagtt 221760aactggtata
atcttttcag ttatttttaa atttttgaaa caacacgctg tatatcagcc 221820atccatcaga
gaaggaaaga aatatgtaga aggactgtct tctgcttcag atcatgaact 221880gtaggaacaa
aatagatatg tgtaaaaatg ctgctccttt tctgaaccaa aataaaatta 221940gatttttgtt
gttgtggttg ttgttggaaa aggtagtaga tttcaggcaa aatcagaaaa 222000taaattattt
cagaaacaca acaaaactaa gtacaacgat attttggtta agaaaattat 222060ggttagaaat
tatgattcta tcactttatg aaacagaatc tttaataatt tttcagctta 222120taccaaggat
attcttttat ttcttaatga ctagttttaa atatatgact attctctctg 222180tacagatttt
taattaatgc tcataaaggc tattgataga actcattctt tctccaaaat 222240ggagttaacc
cgaaatgggt agagattcag aagcttcatt gtagttaaaa ttagggtcat 222300gaagggcaaa
agcccaggta tttcagcctt tgtagaagaa tccattaatg cagcattaca 222360agtaagtgta
atttagtggc tccttagtta ttttaagcaa gaggcctcct taaagggcac 222420acagataaag
tcaacataga tctccactga ccccaggcgt gcggccagag gtcaactccc 222480aacggctgtc
tgggctcctt tttccctgca gtgctgtatg tctggggggg tgacaatcct 222540ttgataacag
ttttttttat tgggagaaaa tttgtgggaa tgcataaggt cttgctaaca 222600taattccaca
ttatgcttgc tcaacaggag aagttttagt ttaaatgtag acttattgga 222660gatggctgta
tatctgcagg tcagggagtc tcgcgtctgc agattctttg tggtttctcg 222720tctgtcgtcg
cttatcccaa ataagtagct gcacttcttg gaagagatga ttgcgtcact 222780tgaaggggtc
atgggagtct actgattgaa ctcatttttc tcccatctga aagaaagaaa 222840agttgaaaca
agaggaaaga catttcacac tttctggcat ttaatatttc ggctagtttt 222900tgtttgtttg
tttctttttt gagacagatt ctctctctgt tgcccaggct ggagtgcatt 222960ggtgcactct
cggctcactg caacctctgc ctcccgagtt caaacgattc tccttcctcg 223020gcctcccaag
tagctaaaac gacgggtgct cattaccaca cccagctaat tttttttttt 223080ttttttttgt
atttgtagaa gagacagagt ttcgccatgt tggccaggct gacttcaagt 223140gatccgcccg
cctcggcctc ccaaaatgct gggattacag gcatgagcca ctgcgcccgg 223200cctggccagg
tttagatcct ggttttgtga aagaatcaga atgactagaa gattcatgtt 223260ccgatccact
gcagctaaca aatacagctc caggaaccgc acaaatcctg catcgccttt 223320catcagcaga
tagttaaagg cggagaattg ggttttcttg tttgtaaaga tagtgcgtgt 223380aaacaagcat
gcagtaaatt atgttatctg cagtagagtc ttccattggt agctttggtc 223440agacgagctg
aaatacactg ccttcagggt tcttcttggt ttgtcaccgt gagtagagat 223500ttgaatttac
actgcctcat ctgtactgaa ttctgccctt gctgtgaatc atgtcacagt 223560ctcagaagca
gcagcagcca tagagtccag tgacatcctt accagtaaga tacctttgac 223620ctagagagtt
catttcaggg gtggggcagc atcagtgtgg ctaaatgtga atattcattt 223680ctttttctta
aactataaaa tgcttgttct tttccaaatc catagctagg cacagattat 223740agaggcatct
taaaaagcct tctatagctg ggtgcagtgg ctcacgcctg taatcccagc 223800actttgggag
gctgaggcgg gtggatcact tgaggtcagg agtttgagac cagcctagcc 223860aacatggtca
aaccctgtct ctaccaaaaa tacaaaaaat tagccgggca tggtggcagg 223920tgcctataat
cccagctact cgggaggccg aagcaggaga atcgcttgaa cccgggaggt 223980ggagattgca
gtgagccgag atcgtgccac tgcactccag cctgggtgag agtgagactc 224040tgtctcaaaa
acaaaaacct cctataaatg ccataagatg ccggcaaatg ccatctcaaa 224100atgcagttag
gcaggaaaat tacctggtat gagttgtaac ctaggcaact ggaaaaaaat 224160atgtgtccac
cagttaccct tgtgtttgat gtttcaagta gtagaacagt ataagtgtgg 224220ttttctttct
ggggtctccc ccattgtatg cactcactgt agtaaccaga ggaaaatgtt 224280accttcctgc
tgggaacctt aacttccgag gctgtgatac ccatcttcat acctaaagtt 224340ttaacatata
gcaagtcccc acctacaagc aggtcttatg tcaaaagtgt agttgcagtt 224400tattgcctgg
aacccggaat ggattttccc ataggcgtgt tatgaacggc agctgtattc 224460caggcagctg
tggaagcctt tttagcccat ttatatacct gaagtgcagt catgtgcaat 224520tagcaagaca
agaaccagcc atcatccatc acaatgtttt atgtgttccc ccagcctccc 224580cactcccact
ctcagttggc ccagctaggc agggaagggg gtgttaatga ggcctgcagg 224640tgcacagatt
tactccccac ctgctgcatc tgcaggggtt aaaccggctt ttccctcccc 224700ttcgctctca
ccattccttc tctccctctc cccaccccca gtctcccttt ccctgtgctg 224760aaggggctgg
gcagtgaagt gaggaagggt ggctggagct gttgggggct gaggccacat 224820tgtgcagttc
acatcagaac cccctgggga cttgctccag ttgcagattc aaatgcagca 224880ggtctggggt
gggctctgaa attccgaata tctaacaggc tcccaggtaa tgcccatgca 224940caggcccaca
ggccccattt gagtaacagg agtctctaaa caccatgcag ggaggagagg 225000ctggtgaaga
gggaaacctc gagtcacgac tgaaaggcag tggagggtga gagctgagtt 225060ctattctcta
ggccttaggg aggtcccttt tacactgaga actttttgcc acatgggaag 225120aagcaggcaa
accctggatg ccatcaggtg ctggcacatc gcagcctcat ctcatgcaag 225180gtgttcttgg
ccagggcctg tggaggttct acatgatatg gcattaaccc tcgggagagc 225240aggcacagta
gaacctgaat gtctcctcca agcgactggg tgcagcatcg ctccatgtga 225300ctgggtgcag
catcgcccca tgcgactggg tgcagcattg ccccatgcga ctgggtgcag 225360catcgttcca
tgcgactggg tgcagcatcg ttccatgcga ctgggtgcag catcgcccca 225420tgcgactggg
tgcagcatcg ttccatgcga ctgggtgcag catcgttcca tgcgactggg 225480tgcagcatcg
ccccatgcga ctgggtgcag catcgttcca tgcgactggg tgcagcatcg 225540ttccatgcga
ctgggtacag catcgttcta tgtgactggg tgcagcatcg ccccatgtga 225600ctgggtgcag
catcgttcca tgcgactggg tgcagcatcg ttccatgcga ctgggtgcag 225660cattgcccca
tgtgactggg tgcagcatcg ttccatgcga ctgggtgcag cattgctcca 225720tgtgactggg
tgcatattgc tccatgggca tggtttgtat gaaacactca ctattaaagc 225780acagtaggaa
atgctgaaat ggaaaaggaa tgggtcatcc gggagcccca ggatctagac 225840cctggagcag
tgtttcaggg cctgaggatc cgtcaagggg gccagcatca gaagccccag 225900cacacagggg
gcttctgatg caggaggaga gtgtgcggtt tgaagatctt tcatcccaac 225960acagcaaggc
ggtgaagaac cagttttcat cctgtctcca ccactaacta gccatgtgtc 226020cttgagctca
taccctcatg cctaagtgtc ctctgtcctc atccgtaaag taaggctaaa 226080atgagtacct
acctcattag ggtgctggga agctcaatga gattatttct gtaaaaccct 226140tagactgtcc
attaagtgtc acaaaagcag tcgtaggggt ggctgtggcg gtatggtagt 226200ggtgaaatga
gtaatttgag attataggcc taaagtcaaa tttctggttt tgcaactttt 226260tactagtgtg
acctgggcaa gtgacttagc ctctccaaac ctcagtttct tcccttgtaa 226320aatggaaata
ataaaagttc ctacctgcta tattattgag aggataaaat gaggaaagca 226380cgtagcacag
tgcccatcac aaagcaagct ctcagtagat atcagttact gttattgagg 226440gagaccgggg
tgacgactgg catctgaaca ggagttttcc agctgtgctc caggagctat 226500aggttttaca
aaaatgtgtc ttgggggtga gaaggaggcc attattcacc ctgcccacac 226560ctgctgcccc
cttcaacctg atgggcttgt ctttgttggc gttcagcatg aagattttat 226620tgagaagaaa
aggtttggag tttggaagat cactgttttg tagttcgggt gtgttatggg 226680gccacaggga
aggtaaatgg tctcaatttt caggaagttg acatttgcct tttctacttc 226740atttccttaa
acaaaaattg aaatatcaga tgacaaattt aaagagatat atcccatata 226800aaacctaaag
ttctatgagg ctgtattgaa cgatagagtt aatttgcatc atcagatgtt 226860gtggccgctt
tgtagcattt gctaatctgt aacgcttggt tttctccccc agatgagcac 226920catgccagga
ctgcccaccc ggccctgctt ttatgacata gatcttgata ccgaaacaga 226980acaagttaaa
ggcttgttct aagtggacaa ggctctcaca ggacccgatg caggtaggct 227040gatgtgcttc
aactttttct ctctttgttt ttttccagtt catatccttt ttttgtcatt 227100ttttttttct
gataccacag aagtgatctg ataaatcctt tccttccttt gttactcaaa 227160ttcctaccaa
atgattttcc gtgactggag ttcactcctg ggtgtggctg tgagttacat 227220gctcagatgc
tcagtacaca acccttagtg tcctgcgctc gagtccagtg aaagccaagt 227280tatgtcatcc
ctgaatatag atgtgtttag agcatctctg accataacag aactctgtaa 227340actcattgtt
cttttccagg attaaaacca gcgtctgtta taattttaaa agagtaactt 227400cctaccttcc
tgccacgtat ctcttggtac agagctagaa tcatgtgaaa gaggaatctt 227460ggtgtatccg
tttcctgggg ctgccatgac aaagtatcgc aaactgggtg gcttaaaaca 227520gtggaaagtt
tgaaacagtt ctggaggcta gaagtccaaa gtcaaggtgt cagccaggcc 227580atacgccacc
tcgagactct gggtggaatc cttgctggcc tcttcctagc atccagtggt 227640ggctgtcact
ccctggcgtt cccaccttgc agctgcgtca ctcccgtctc agcctctgtg 227700gtcccaccac
gttctccctg tgtctgtgtc ttcacatgat gttcacctct tataaggacc 227760ccagtcctat
tggattaggg cccaccctaa gaacctcctt ttaacttgat tactctgcaa 227820agaccctcct
tccaaataag atcactctca gccaagcact ttgggaggcc gaggtgggtg 227880gatcacgaag
tcaggagttc aagaccagcc tggccaagat ggtgaaaccc cgtctctact 227940aaaaatacga
aaattagctg ggcgtgttgg tgggcacctg taatcccagc tactcgggag 228000gctgaggcag
agaattgctt gaacccggga ggcggaggtt gcagtgagct gagattgcgc 228060cactgcactc
cagcctgggc gacagagcaa gactctcctt ctaaaaaaaa aaaaatcact 228120ctcacaggtg
ctcagggtta ggacttaagc atatctttta ggggacacag ttcaacccag 228180aacactcaac
tgtgtggcac aacaggtgtg ttcggctctc ttgtccctgg acagacacat 228240agccatggga
cagaccattt aactagaggg ccccatgggt tccaggaagg gcagggcagc 228300agaaagtagg
ccacttgatg acttcgggtt tgcacagtct caggttgggc tttgactctg 228360agctgtctta
ggcagcccag acctggggac tctcggtctt attcatgtcc acgccacagc 228420actgcagcca
tacctgctgg cttctatcct gactccccat cgtgaagaaa aatgaccaca 228480ttcctaaacc
tcttccctgg gagacaactc ctaatcagaa gcttgaaagg ctcagggctg 228540cttctcctac
cccagggatc aaacatgtcc tcaccatatt ctaaagacac attctttccc 228600cgcaagcacc
ccgcctcctt ctccccactg actcctgttc ctgagcccag aacctctccc 228660tcttgccagc
atattctcgg cctttgcaga tgagcttttg tggcctaaag accttcttct 228720tcaaccatct
ctgagttacg gctccaagcc aggcacccag gattctcatt tcatttgttt 228780cactaccagg
agtccacttg aggacttgtt gttgttgttg ttgtttgaga cagagtctcg 228840cgctatcacc
taggctggag ttcagtggcg tgatctctgc tcactgcaac ctccgcctcc 228900cgggttcaag
caattctctt gcctcaccct cccaagtagc tgggattaca ggtgtgtgcc 228960accacgcccc
gctgattttt gtattttatt tatgtattta tttatttgcg acagagtctt 229020actctgttgc
caaggctgga gtgcagcggc actgtgtggg ctcactacaa cctctgtctc 229080ctgggttcaa
gcaattctcc tgcctcagcc tcccaagcag ttgggattac aggcgtccac 229140caccatgccc
agctagtttt tttattttta gtagagacag ggtttcacca tgttggccag 229200gctggtctcg
aactcctgac ctcaggtgat ccacccgcct cagcctccca aagtgctgag 229260attacaggcg
ggagccacca cgcccagcct aatttttgta ttttaataga gatggggttt 229320taccgtgttg
gccaggctgg tctcaaactc ctgacctcaa gtgagccacc tcgtctggcc 229380aagttttttt
gttgaggctt tgttgctaaa acactaaaga cctaggctat ggtgagaaag 229440ataagggaac
cataaatgaa attacattaa aacaaagtgt gaaattgttt gttttgagtt 229500gtggatgttg
tttgggggtg tttgtatgtg ttttttgctg aggagcggga gctgtgttaa 229560gtttgagcca
taactgcatc tttgatcttt gttatttcag aaaaaataaa actctagaaa 229620atgaagtaag
ggagactagt ttaacaaaga aacccataaa ctcaagtgtt gataaaatgt 229680tgtttcatct
tttggtgtga gcttgtgtgt tgccaatgtc ttccattccc cagtaaggct 229740gtggggaaat
aattgctatt tccccaattg ctagttggcc tagcaattct ctttttatgt 229800caggaaccca
ttatgttaca tttctgccat ttgtcatttt tctccagaat aagtgtgttt 229860gattctccca
tctctcttgg acagtatttt tccagctttt ggtgagtttt cacatgctct 229920ccaagtaggt
catataagtc caattatacc ttaatcactg tagttaggac tcaggtccta 229980ctcagggata
catccttagc attattaatg cataccctta aatcctttat tacctactcc 230040ctaacaagaa
ttcctccaga tcttatatac ggggcctgca gggcaggaca caggcctatg 230100gtaattactt
ccaatgtatc gggcattgtg ctaagcactt tgcatgggtg gtcacagcac 230160ctctggagcc
aagtgttgtc ccataagggc cacaaggctt caagaggacg gagcctaccc 230220cttggtcact
gaggggtggc cctggtctgc aaggctacag aactgccttg ccaccctgtg 230280gcactgcctc
tcatttcata gctggcacat gtcattttgc cacccccatc acatgtcatg 230340ctggcatcaa
cccagggagg tgggtagtat ttttgttgaa gagctgcaag aactctgccc 230400agagaagtcg
acaacctgct caagttgtgc agctaataac tacaaaagcc aagactcaac 230460tcagatctgt
tgactctcag tattacccca acctttatgc cccaaaccaa aggatgccct 230520tttcatttca
gtcacgggta aaatgaactt tggcccattc cgaaagatga taccagctat 230580ttgaacagac
aggtacttga ggataatgag gtaatttttt ctaataagta aagataattg 230640tttgggtatt
aaaagctatg attttgtcat aattcttttt agtccaagta taatagagcc 230700agctctttcc
taattctcta atggaatggc atttctaccc cttacccacc ccccaatcca 230760aggcctcttt
cttctttgtg aattgctaaa tacagttctg ggctagtggg cagtggttca 230820aaacccatcg
tttgaccctg cttgaacaat caaaaacaga tccagtgtcc taagataaaa 230880ccactataca
aagcaaacat tccctctcag tggctcctcg gtaatgaccc agcaagtatt 230940gtctgtttcc
cgtgcctctc actatagcca tcagggacca cttctactaa taatcagagg 231000tctgcggcta
ttcaaaatgc ataaaataaa aatgttaaag cttaggtgac agcttggaca 231060aacgtgggca
tctgaaagga aatgtctggc ttgctgacgt tgctccttca gggatcacac 231120cgtagatggt
gtgcataccc aggtgcggag cagcacactc attattcccc caggaaagtg 231180ttgaaggcgc
gttccactca ggaagaagat tgtgcttggt tgatgacaac tttctgtcct 231240tcccttcctc
acctttctag cctgtctcac ctcctaggaa accgtgtgaa accaaaggtg 231300aacattatat
gccaaaaaac atcagaattg gagctctcct tgctccctct tcccaagact 231360tctctgtagt
gccttcactg tggttagaat aagggcatga ctcattcttt tggatgtcac 231420taaatagaac
taattaaaat agattttttg tcccagtgaa attgatccgg ctgttgagtc 231480ttttgtttaa
agattattgg aatgaaggat ttgatacttt tccccccatt cttttttctc 231540acttggaaac
tttatcatat ttctgagctt tggcaagatt cattcaaatg aacacacaat 231600aaagtctggc
aaatttcctg aagaattcaa ggactccctt acttctttat tgtactgtgc 231660aaagttacgt
tgagtacgga atcagattca ctccagtcct gctacaaatg tttgttattt 231720acaaaagaat
tgctttttct tgaattacat acacagaaat aagttcagga aagaaattag 231780tttctttcct
aaactggctt tagggcatct acagacaaat ggttaaagaa aatgtggtat 231840gtatagagag
tggaatacta cttagccatt aaaaaagaat gaaataatgt catttacagc 231900agcatggatg
gaagtggaag ttattatgtg aaataagctg ggcacggaaa gactaatgct 231960gcatgttctc
actctcatgt ggaagctaaa aagcttaccc tcatagacag aataaatggt 232020agataccaga
ggctgggaag gatgggtggc tgggaggagg ttatgaagaa aggttggtta 232080tgggtacaaa
catacagtta gacgaagaaa taagctctaa tgtttgatag caaactagga 232140tgactatact
cagcaacaat attttgtata cttcaaagta acaggaagag agtggggtgt 232200acttgaaatg
ataccaacat acagtagtaa tgataaatac tcaagataat ggacacccca 232260ggtaccctga
cttgatcact acacattcca ggcacataat aaacactcat atatacccca 232320taagaatata
aaatattagg ccaggtgtgg tggctcatgc ttctaatctc agcagtttgg 232380gagtccgagt
gggtggatca cttgagccca ggaattcgag accagcctgg gaaacatggc 232440aaaaccctgt
ccctactaaa aatagaaaaa ttagccaggc gtcatggcac gcacctgtaa 232500tctcagctac
tcaggaggct gaggcaggca aatctcttga acccaggagg tggaggctgc 232560agtgagctga
gatcatgcca ctgcactcta gcctgggtga cagagcaaga ctcggtctca 232620aaaatatata
taatgtatca atacaaggga aaaatgggca gaaattttca atgatattta 232680ataaccataa
attaatcttg agttgggttg ctgcctgcca gcccttctgt gaacatgatt 232740tgttcagtcc
caagtcacaa accacagggt tacaaagtac cctgggaggg tgcctgagct 232800ctgtggtttt
agtcttgaaa ctttatcttt tcacaaagca attcaaaaat tgctcatcac 232860cattgtgttc
cgagaaaagg ggcgccaagg catcaccctg agccatccga ggagtgctgt 232920gtgccttctt
cctttctgct tccaagtagg atggaagctc cagctttata cttaaataat 232980cctcatcaac
atctaaaaga tagtgataat taaatctctc ctatctaact aaaatggaat 233040gttttaaagc
ttaggtaaca attctgaaat tcttaattcc ccccaaaaca aactttttct 233100gagcacattt
tgatttctta ttcccatgcc agaatagact gttttgtata tggtatcttc 233160caagatattt
tcccactgag ttaacttctg aaattgcttc ttccaacgca taagcaataa 233220acaacagatt
cgtttttaca ttctcacctt agctattgca tctctgattg ttttcatcta 233280taatgagact
ttctatactt tggccagtcc ttctgttttt cttaaaatgt gggaccacag 233340gccaggcacg
gtagctcgca tctgtaatcc cagcaatttg ggaggccgag gcaggcggat 233400cacctgaggt
caggagttca agaccagcct ggccaacatg gtgaaacccc atctctacta 233460aaaatacaaa
aattagtcgg gcatggtggc gggtacctgt aatcccagct acttgggaag 233520ctgaaccctg
agaattgctt ggacccagga ggcgaaggtt gcagtgagct gatatcgcat 233580cgctgcactc
cagtgtgggc aacagagcaa gactccatct caaaaaaaaa aaaaaaaagt 233640gggaccacat
actctaagca cctttagaca aaaccacagg atacaaatgt aatgtggcta 233700atcagctggt
cttccaccac ccacctctgg agcatggaag gccttgtcca gccttgcctt 233760gccaagccaa
gggtcatctc tagtccactg agcccaccct ctaccctagc acccctcctg 233820tcccaagagt
cattgaccct caggtcaagg gataccagag ccactacagt gaggctggcc 233880gtgtctctga
aatattttct tttttttttt ttgcccttct ctcttcctcg gcgctgccta 233940tggaggtggc
agccatctcc tcctcggcat cttggccgcc ctcagacccc ttgtgaagcc 234000caagatcgtc
aaaaagagaa ccaagaagtt catccggcac cagtcagacc attatgtcaa 234060aattaagcgt
aactggtgga aacccagagg cattgacaac agggttcgta gaagattcag 234120ggtccagatc
ttgatgccca acactggtta tgggagcaac aaaaaaacaa agcacatgct 234180gcccagtggc
ttccggaagt tcctggtcca caacgtcaag gagctggaag tgctgctgat 234240gtgcaacaat
cgtacttgtg ccgagatcgc tcacaatgtt tcctccaaga accgcaaagc 234300catcgtggaa
agagctgccc aactggccat cacagtcacc aaccccagtg ccaggctgtg 234360cagcgaagaa
aatgagtaga cagctcgtgt gcacattttc tgtttaaata aatgtaaaaa 234420cagccatctg
gcatcttcct ttaaaaaaaa agaaaagaaa gaaatatttt ctttcttttc 234480tttttttttt
tttttttgag acgcagtctc gctctgtcac ccaggctgaa gtgcagtggc 234540acgatctcgg
ctcactgcaa gctccgcctc ccgggttcac gccattctcc tgcctcaacc 234600tccagagtag
ctgggactac aggcacctgc caccacgccc agctaatttt tttgtatttt 234660tttaatagac
acggggtttc accgtgttag acaggatggt cttgatctcc tgacctcgtg 234720atccacccgt
gagccaccgc gcctggcctg aaatatttta tttttgttca tctttgttgt 234780gcttataata
ctcctgtttt ctgtttattc ccctgacttg aaccagatgt taggggagta 234840tttgaagctc
ctcgttccct ctcttcatca atatcttctg tctagaaact aaccaagcac 234900cttatctggc
tccagacctc tcagtgatct gtgggaggcc aactcgcagg atttcagcag 234960agtttgcact
ttctaggcct aaaagataag tccttccatg tgtaagttca gattaaaggc 235020aacatattgc
atttacattt tagagaggtg aacacattat tttctgggac ttaagtgttg 235080ttttgtttct
tttttttttt tttaatgtag tttatttctc tttctaagag attacaggat 235140tcaagaatgg
caatgaatct taatagcctt tgggagaaag cagcaaagct ttaatactaa 235200gttccaggac
catgaacttc catgttgtcc taaatgcatt tttcaaacat tgctttctgt 235260gcctactcta
taaaacctaa aaatccaagg tgacagtatt ccttatctaa aatcctaaac 235320aaggctgggc
gtggtggttc atacctgtaa tcccagcact ttgggaggac aaggcaggcg 235380gatcacttga
gcccaggagt tcaagaccag cctggccaac gtggtgaaac ctcgtcttta 235440ctaaaaatac
aaaaattagc cgggcatggt ggtgcacact tgtaatccca gctactccgg 235500aggctgtggc
gggaggattg cttgaacccg ggagacggag gttgcagtga gccgagatct 235560cgccactaca
ctccagcctg ggcgacaaag tgagactctg tctcaaaaaa ttaaaaaaaa 235620atttttttaa
atcctaaata ttctcattcc acaggatcct gcaaactgta gatcccaaac 235680ttggaagttc
tgggaattct gccagtatcc ccaagtaaat ctcttgggaa gtttagaatg 235740ctgcatggcg
tagctgtgcc tgctcttata gagtggtgca gactctgctt ctccacgtct 235800gctaagtaaa
atattccaaa cacagatcag ccttatctag cgaaccatca ctctgctctt 235860gaattttctt
ttggcctcaa actctagagc tataacatgt taacggttta acaccagagg 235920agaaagctat
tcttaaacta ctcaaacatg gttttttttt ctatttttca gactcctgaa 235980acagactact
ctttgccttt ttgctgcagt tggagaagaa actgaatttg aaaaatgtct 236040gttatgcaat
gctggagaca tggtgaaata ggccaaagat ttcttcttcg ttcaagatga 236100attctgttca
cagtggagta tggtgttcgg caaaaggacc tccaccaaga ctgaaagaaa 236160ctaatttatt
tctgtttctg tggagtttcc attatttcta ctgcttacac tttagaatgt 236220ttattttatg
gggactaagg gattaggagt gtgaactaaa aggtaacatt ttccactctc 236280aagttttcta
ctttgtcttt gaactgaaaa taaacatgga tctagaaaac caa 2363331330DNAHomo
sapiensmisc_feature(5)..(5)The number of "att" repeats starting at
position 5 is seven, but there can be a greater number of such "att"
repeats than seven. 13gaagattatt attattatta ttattttctt
30141987DNAHomo
sapiensCDS(159)..(1442)misc_feature(259)..(259)Xaa at position 259 of the
amino acid sequence = Arg or Gln 14gcagacttag ccgtgcattg caggcatgga
ggattaatca gtgacaggaa gctgcgtctc 60tcggagcggt gaccagctgt ggtcaggaga
gcctcagcag ggccagcccc aggagtcttt 120cccgattctt gctcactgct cacccacctg
ctgctgcc atg agg cac ctt ggg gcc 176
Met Arg His Leu Gly Ala 1
5ttc ctc ttc ctt ctg ggg gtc ctg ggg gcc ctc act gag atg
tgt gaa 224Phe Leu Phe Leu Leu Gly Val Leu Gly Ala Leu Thr Glu Met
Cys Glu 10 15 20ata cca gag
atg gac agc cat ctg gta gag aag ttg ggc cag cac ctc 272Ile Pro Glu
Met Asp Ser His Leu Val Glu Lys Leu Gly Gln His Leu 25
30 35tta cct tgg atg gac cgg ctt tcc ctg gag cac
ttg aac ccc agc atc 320Leu Pro Trp Met Asp Arg Leu Ser Leu Glu His
Leu Asn Pro Ser Ile 40 45 50tat gtg
ggc cta cgc ctc tcc agt ctg cag gct ggg acc aag gaa gac 368Tyr Val
Gly Leu Arg Leu Ser Ser Leu Gln Ala Gly Thr Lys Glu Asp55
60 65 70ctc tac ctg cac agc ctc aag
ctt ggt tac cag cag tgc ctc cta ggg 416Leu Tyr Leu His Ser Leu Lys
Leu Gly Tyr Gln Gln Cys Leu Leu Gly 75 80
85tct gcc ttc agc gag gat gac ggt gac tgc cag ggc aag
cct tcc atg 464Ser Ala Phe Ser Glu Asp Asp Gly Asp Cys Gln Gly Lys
Pro Ser Met 90 95 100ggc cag
ctg gcc ctc tac ctg ctc gct ctc aga gcc aac tgt gag ttt 512Gly Gln
Leu Ala Leu Tyr Leu Leu Ala Leu Arg Ala Asn Cys Glu Phe 105
110 115gtc agg ggc cac aag ggg gac agg ctg gtc
tca cag ctc aaa tgg ttc 560Val Arg Gly His Lys Gly Asp Arg Leu Val
Ser Gln Leu Lys Trp Phe 120 125 130ctg
gag gat gag aag aga gcc att ggg cat gat cac aag ggc cac ccc 608Leu
Glu Asp Glu Lys Arg Ala Ile Gly His Asp His Lys Gly His Pro135
140 145 150cac act agc tac tac cag
tat ggc ctg ggc att ctg gcc ctg tgt ctc 656His Thr Ser Tyr Tyr Gln
Tyr Gly Leu Gly Ile Leu Ala Leu Cys Leu 155
160 165cac cag aag cgg gtc cat gac agc gtg gtg gac aaa
ctt ctg tat gct 704His Gln Lys Arg Val His Asp Ser Val Val Asp Lys
Leu Leu Tyr Ala 170 175 180gtg
gaa cct ttc cac cag ggc cac cat tct gtg gac aca gca gcc atg 752Val
Glu Pro Phe His Gln Gly His His Ser Val Asp Thr Ala Ala Met 185
190 195gca ggc ttg gca ttc acc tgt ctg aag
cgc tca aac ttc aac cct ggt 800Ala Gly Leu Ala Phe Thr Cys Leu Lys
Arg Ser Asn Phe Asn Pro Gly 200 205
210cgg aga caa cgg atc acc atg gcc atc aga aca gtg cga gag gag atc
848Arg Arg Gln Arg Ile Thr Met Ala Ile Arg Thr Val Arg Glu Glu Ile215
220 225 230ttg aag gcc cag
acc ccc gag ggc cac ttt ggg aat gtc tac agc acc 896Leu Lys Ala Gln
Thr Pro Glu Gly His Phe Gly Asn Val Tyr Ser Thr 235
240 245cca ttg gca tta cag ttc ctc atg act tcc
ccc atg cnt ggg gca gaa 944Pro Leu Ala Leu Gln Phe Leu Met Thr Ser
Pro Met Xaa Gly Ala Glu 250 255
260ctg gga aca gca tgt ctc aag gcg agg gtt gct ttg ctg gcc agt ctg
992Leu Gly Thr Ala Cys Leu Lys Ala Arg Val Ala Leu Leu Ala Ser Leu
265 270 275cag gat gga gcc ttc cag aat
gct ctc atg att tcc cag ctg ctg ccc 1040Gln Asp Gly Ala Phe Gln Asn
Ala Leu Met Ile Ser Gln Leu Leu Pro 280 285
290gtt ctg aac cac aag acc tac att gat ctg atc ttc cca gac tgt ctg
1088Val Leu Asn His Lys Thr Tyr Ile Asp Leu Ile Phe Pro Asp Cys Leu295
300 305 310gca cca cga gtc
atg ttg gaa cca gct gct gag acc att cct cag acc 1136Ala Pro Arg Val
Met Leu Glu Pro Ala Ala Glu Thr Ile Pro Gln Thr 315
320 325caa gag atc atc agt gtc acg ctg cag gtg
ctt agt ctc ttg ccg ccg 1184Gln Glu Ile Ile Ser Val Thr Leu Gln Val
Leu Ser Leu Leu Pro Pro 330 335
340tac aga cag tcc atc tct gtt ctg gcc ggg tcc acc gtg gaa gat gtc
1232Tyr Arg Gln Ser Ile Ser Val Leu Ala Gly Ser Thr Val Glu Asp Val
345 350 355ctg aag aag gcc cat gag tta
gga gga ttc aca tat gaa aca cag gcc 1280Leu Lys Lys Ala His Glu Leu
Gly Gly Phe Thr Tyr Glu Thr Gln Ala 360 365
370tcc ttg tca ggc ccc tac tta acc tcc gtg atg ggg aaa gcg gcc gga
1328Ser Leu Ser Gly Pro Tyr Leu Thr Ser Val Met Gly Lys Ala Ala Gly375
380 385 390gaa agg gag ttc
tgg cag ctt ctc cga gac ccc aac acc cca ctg ttg 1376Glu Arg Glu Phe
Trp Gln Leu Leu Arg Asp Pro Asn Thr Pro Leu Leu 395
400 405caa ggt att gct gac tac aga ccc aag gat
gga gaa acc att gag ctg 1424Gln Gly Ile Ala Asp Tyr Arg Pro Lys Asp
Gly Glu Thr Ile Glu Leu 410 415
420agg ctg gtt agc tgg tag cccctgagct ccctcatccc agcagcctcg
1472Arg Leu Val Ser Trp 425cacactccct aggcttctac cctccctcct
gatgtccctg gaacaggaac tcgcctgacc 1532ctgctgccac ctcctgtgca ctttgagcaa
tgccccctgg gatcacccca gccacaagcc 1592cttcgagggc cctataccat ggcccacctt
ggagcagaga gccaagcatc ttccctggga 1652agtctttctg gccaagtctg gccagcctgg
ccctgcaggt ctcccatgaa ggccacccca 1712tggtctgatg ggcatgaagc atctcagact
ccttggcaaa aaacggagtc cgcaggccgc 1772aggtgttgtg aagaccactc gttctgtggt
tggggtcctg caagaaggcc tcctcagccc 1832gggggctatg gccctgaccc cagctctcca
ctctgctgtt agagtggcag ctccgagctg 1892gttgtggcac agtagctggg gagacctcag
cagggctgct cagtgcctgc ctctgacaaa 1952attaaagcat tgatggcctg tggacctgca
aaaaa 1987151284DNAHomo
sapiensmisc_feature(776)..(776)n at position 776 of the nucleic sequence
= c or g 15atgaggcacc ttggggcctt cctcttcctt ctgggggtcc tgggggccct
cactgagatg 60tgtgaaatac cagagatgga cagccatctg gtagagaagt tgggccagca
cctcttacct 120tggatggacc ggctttccct ggagcacttg aaccccagca tctatgtggg
cctacgcctc 180tccagtctgc aggctgggac caaggaagac ctctacctgc acagcctcaa
gcttggttac 240cagcagtgcc tcctagggtc tgccttcagc gaggatgacg gtgactgcca
gggcaagcct 300tccatgggcc agctggccct ctacctgctc gctctcagag ccaactgtga
gtttgtcagg 360ggccacaagg gggacaggct ggtctcacag ctcaaatggt tcctggagga
tgagaagaga 420gccattgggc atgatcacaa gggccacccc cacactagct actaccagta
tggcctgggc 480attctggccc tgtgtctcca ccagaagcgg gtccatgaca gcgtggtgga
caaacttctg 540tatgctgtgg aacctttcca ccagggccac cattctgtgg acacagcagc
catggcaggc 600ttggcattca cctgtctgaa gcgctcaaac ttcaaccctg gtcggagaca
acggatcacc 660atggccatca gaacagtgcg agaggagatc ttgaaggccc agacccccga
gggccacttt 720gggaatgtct acagcacccc attggcatta cagttcctca tgacttcccc
catgcntggg 780gcagaactgg gaacagcatg tctcaaggcg agggttgctt tgctggccag
tctgcaggat 840ggagccttcc agaatgctct catgatttcc cagctgctgc ccgttctgaa
ccacaagacc 900tacattgatc tgatcttccc agactgtctg gcaccacgag tcatgttgga
accagctgct 960gagaccattc ctcagaccca agagatcatc agtgtcacgc tgcaggtgct
tagtctcttg 1020ccgccgtaca gacagtccat ctctgttctg gccgggtcca ccgtggaaga
tgtcctgaag 1080aaggcccatg agttaggagg attcacatat gaaacacagg cctccttgtc
aggcccctac 1140ttaacctccg tgatggggaa agcggccgga gaaagggagt tctggcagct
tctccgagac 1200cccaacaccc cactgttgca aggtattgct gactacagac ccaaggatgg
agaaaccatt 1260gagctgaggc tggttagctg gtag
128416427PRTHomo sapiensmisc_feature(259)..(259)Xaa at
position 259 of the amino acid sequence = Pro or Arg 16Met Arg His
Leu Gly Ala Phe Leu Phe Leu Leu Gly Val Leu Gly Ala1 5
10 15Leu Thr Glu Met Cys Glu Ile Pro Glu
Met Asp Ser His Leu Val Glu 20 25
30Lys Leu Gly Gln His Leu Leu Pro Trp Met Asp Arg Leu Ser Leu Glu
35 40 45His Leu Asn Pro Ser Ile Tyr
Val Gly Leu Arg Leu Ser Ser Leu Gln 50 55
60Ala Gly Thr Lys Glu Asp Leu Tyr Leu His Ser Leu Lys Leu Gly Tyr65
70 75 80Gln Gln Cys Leu
Leu Gly Ser Ala Phe Ser Glu Asp Asp Gly Asp Cys 85
90 95Gln Gly Lys Pro Ser Met Gly Gln Leu Ala
Leu Tyr Leu Leu Ala Leu 100 105
110Arg Ala Asn Cys Glu Phe Val Arg Gly His Lys Gly Asp Arg Leu Val
115 120 125Ser Gln Leu Lys Trp Phe Leu
Glu Asp Glu Lys Arg Ala Ile Gly His 130 135
140Asp His Lys Gly His Pro His Thr Ser Tyr Tyr Gln Tyr Gly Leu
Gly145 150 155 160Ile Leu
Ala Leu Cys Leu His Gln Lys Arg Val His Asp Ser Val Val
165 170 175Asp Lys Leu Leu Tyr Ala Val
Glu Pro Phe His Gln Gly His His Ser 180 185
190Val Asp Thr Ala Ala Met Ala Gly Leu Ala Phe Thr Cys Leu
Lys Arg 195 200 205Ser Asn Phe Asn
Pro Gly Arg Arg Gln Arg Ile Thr Met Ala Ile Arg 210
215 220Thr Val Arg Glu Glu Ile Leu Lys Ala Gln Thr Pro
Glu Gly His Phe225 230 235
240Gly Asn Val Tyr Ser Thr Pro Leu Ala Leu Gln Phe Leu Met Thr Ser
245 250 255Pro Met Xaa Gly Ala
Glu Leu Gly Thr Ala Cys Leu Lys Ala Arg Val 260
265 270Ala Leu Leu Ala Ser Leu Gln Asp Gly Ala Phe Gln
Asn Ala Leu Met 275 280 285Ile Ser
Gln Leu Leu Pro Val Leu Asn His Lys Thr Tyr Ile Asp Leu 290
295 300Ile Phe Pro Asp Cys Leu Ala Pro Arg Val Met
Leu Glu Pro Ala Ala305 310 315
320Glu Thr Ile Pro Gln Thr Gln Glu Ile Ile Ser Val Thr Leu Gln Val
325 330 335Leu Ser Leu Leu
Pro Pro Tyr Arg Gln Ser Ile Ser Val Leu Ala Gly 340
345 350Ser Thr Val Glu Asp Val Leu Lys Lys Ala His
Glu Leu Gly Gly Phe 355 360 365Thr
Tyr Glu Thr Gln Ala Ser Leu Ser Gly Pro Tyr Leu Thr Ser Val 370
375 380Met Gly Lys Ala Ala Gly Glu Arg Glu Phe
Trp Gln Leu Leu Arg Asp385 390 395
400Pro Asn Thr Pro Leu Leu Gln Gly Ile Ala Asp Tyr Arg Pro Lys
Asp 405 410 415Gly Glu Thr
Ile Glu Leu Arg Leu Val Ser Trp 420
4251730DNAHomo sapiensmisc_feature(16)..(16)n at position 16 of the
nucleic sequence = c or g 17tgacttcccc catgcntggg gcagaactgg
301819887DNAHomo
sapiensmisc_feature(8450)..(8450)n at position 8450 of the nucleic
sequence = c or g 18gcagacttag ccgtgcattg caggcatgga ggattaatca
gtgacaggaa gctgcgtctc 60tcggagcggt gaccagctgt ggtcaggaga gcctcagcag
ggccagcccc aggagtcttt 120cccgattctt gctcactgct cacccacctg ctgctgccat
gaggcacctt ggggccttcc 180tcttccttct gggggtcctg ggggccctca ctgagatgtg
tggtgagtaa ctcgcctcta 240tcctgtgcct ctttcctcct gggtccttag tggggtggct
agggcatagg atgagggaac 300ttacctgccc ttctaagctc ccatagcagt ttgggcttag
ctggacctca gcatttaaca 360catcctattg tgattgatta tatgtttgac tcctcaccag
acaagatctc cgttaattca 420gtcattcgtt cacacattca ttcagcgcat actgagcctt
ttctgtgtca ggcccagtgt 480tagcctttgg ggaacgtgca aagcatgaga caagtctaat
ccctgccatc ctagagctta 540tgttctaggg aagggggaca gacaaaagaa atggttaggt
gctcccacct gaaatctcag 600cattttggaa ggctgaggcg ggaggggagg atcgcttgag
ctcaacagtt caaggtcagc 660ctgggcaaca tagggagacc ccatctctac aaaaaataaa
aaaaattaaa aaatagctgg 720gcatggggaa gactttctga agaccaagag gacacatggg
agctgaaact cgaaggaaga 780aaaggagctg gcaggaaagg agtgggggac acacattcta
ggcagcagga agtgagcctt 840cggaggtcct gcctgctcca gctctgtgcc ccaaggggtc
tcttggagca cagtctcctg 900ggacctgtct atgagtctga gcttagaggc tcagggctgc
tccttcagac aggaggcaga 960aggcaggctt tgggaacttt gggccgccca cgcgcctttt
ctcctcctct gcacctagga 1020ttacgttgag caatacactt tcacccccat ggtctcttga
gaccctgggg aaaccctgag 1080aggtgggtgc agtcatgtcc aggtgtcaag tgaagaagtc
gagggttgga ggggctgagt 1140gacccactca gggtgctcca ccttttccag agctttgctg
aacttagttt ttagaacttg 1200aagcctcgtt tgttttcgtt ttgttttttg ttgagagagg
ttctccctct gttgcccagg 1260ctggagtgca gtggcacgat cttggctcac tgcagcctct
gccttgtggg ttcaagtgat 1320tcccccacct cagcctccca agtagctgga gactgcatgt
gcatactacc atgcttggct 1380aatttttgta tttttttgta gagacagggt ttcgccatgt
tgcccaggct ggtctcgaac 1440tcctgggctc aagtgaaact cttgcgtcgg cctcccaaat
tgctgagatt acaggcgtga 1500gccaccgtgc ccggccagaa ctccaagcct ctcatctgtg
ttccataaat gcaatcagac 1560acctcaggtc tgggcccagg aaccccagct cttggttcat
gtccggacag tccccagggg 1620agttctgggt tcaaccagca agagctcttc ctcctggctg
atctggtcct cagccttgga 1680cagttagtcc attaacctga ccccacagga gccccaatcc
cttggggtct ggggaatctt 1740gaactggggt ttggggtgca aatatctgca ctgagtcact
taattgcacc cagcctcatt 1800cctttatctg taaagtgggc taagaatgct cccctgcctt
cctcctcggt gtagtacaag 1860gaaggatccc atgacacctg ctctcccagt ttaaagctct
atatgtatgt tgtgaaattg 1920acagggatcg ctgcacaaac gctaatgcaa agtgggctcc
tgtgcttcct tttctctttc 1980ttcttctttt tttttttttt aattttcttc tagagatgag
gtctcactat attgcccagg 2040gttggtttca aactcctagg gtcaagcgat cctcccacct
tggcctccca aactgctggt 2100attacaggcg tgagccactc tgtctggctc ctgtgcttgt
gaatgtcaac agcaatcagc 2160ccttagctgg cagggctggg ttggtagggc gagagctcac
ccaaggctgc ttttattacc 2220ctgcgtgaat ctgcctggcc ccttccttct aaggaggttg
ctctgtggtt gtcagtctct 2280ccctttacag ctggatcctg atctttcagt ttctaaccct
gtgctgactc atcgtgctgg 2340aagtgagagc ccggggtgag gtcagggaac tcccttgcgc
gtttcaagaa aagggaaaag 2400gaaagagagg tgaggagggg ggcagatgac cagagagaca
caggctgaga gagactgaga 2460cagacccaga gagcctcaca cattgagtga cagagacgga
gaaatggaga taggcaccaa 2520aaaatggttc tcagtgacag aaagggaaaa aagcaacccc
ccagtctctc ttaacatctg 2580gtgagaaacc agccatgtgc tttggtctgg gcccacacag
caaaggatta tgtagggttt 2640catgctggtg gatggtcacc ttatagcaac aggtatctgg
ggctgtcggg aaaacagaca 2700cgaggttgtg ggacccagac ccacagagat ggagctgttc
taggagctct ggtcctcgtt 2760ctggtcccct gggatatggc acagtgaagg ccaccatcag
gcagctggag cccagcagca 2820actgggaggc agtaaacagg gaccgaaagt gcaaggttac
ctccgaggca aactactcta 2880agctaccctg tgctgagctc aagtcccttg gaactatccc
taaggcttcc gcttccagag 2940tgtttgagta ttttcgttgc acagcttcga ataaatccca
cagcaacagg taaacggctg 3000caagctgtga ctgttttcta agagctcatc tcacaatctc
aggtcctctt catttaaaca 3060gagatggcag gaaaggcgtt attttgagat ctgcatggag
gaagttcacc aggcagcctc 3120aattcaccag ctggaagttt gcgttgtttg gaaatttgat
gtgtaacacg ttctgcatgt 3180gggctgatgt ttttgtaaac gggtagcaca cacattcagc
agggcaccaa agagcggggg 3240ctttgcagtt aggtccatcc ttggctctgc agccttgtgt
aagacatgac acgactttga 3300acttctgttt cctcttctgt gcaaagcaat gatgacagta
tctacatcac aggactggca 3360tgaggaccaa gtgagattgg gcaaggtgcc cgggcacacc
agtctcactg tcactgctga 3420tgggcagagt ggttgcctgg cagtagcatc ctctatcttc
agcccaccac ctctcttgct 3480ggctcactcc aactgctctt tagagataca cgcttcccct
cttttctcct cccactgcct 3540ttcagtatgg ctgcatttcc ccctgcaagt tggtgtgtgc
tgggtggagg tgggggtgag 3600gacatgtatt ctctggagaa ggccctggta acgtcaaagc
acttctttgc tggtggcctg 3660gccctgtgac ctcatttgta ccattttctt ttctaagaaa
taccagagat ggacagccat 3720ctggtagaga agttgggcca gcacctctta ccttggatgg
accggctttc cctggagcac 3780ttgaacccca gcatctatgt gggcctacgc ctctccagtc
tgcaggctgg gaccaaggaa 3840gacctctacc tgcacagcct caagcttggt taccagcagt
gcctcctagg gtattgccac 3900actctctttt tccatgtctt gctccacata ctaagagatg
ggaaacttgg gtactagttt 3960gggcctgtca ccactttgtg ggcagacctt aggcaaattt
tctccatcta tagaatggag 4020gacctttgtc catctataga atgaaggggt tggttggatt
agatcagaga tgctaatgca 4080aggctccttt tgctactact gtccatcatg tgtctgaggc
agacataact aatccgtgac 4140tatactcttt gatgatgagc ccaggagcag catctgactc
tatgctccct tagtgtgcct 4200gaggcagata tcactaatcg atgactgcag tcttctacat
tgagcttaga agcagcatct 4260gactctgtat gctcccctcc catgcatgag gcagacatca
gtaatccatg accgcattct 4320ttcatactga gcccagaagc agcatctttt cttttctttc
ctctcactct gttgcccagg 4380ctagagtgca gtggcacaat cttggcttgc cccaacctcc
aattcccggg ttcaagtgat 4440tctcgtgcct cagccacctg aatagctggg attacaggcg
tgtgccacca tgcccagctg 4500atttttgtat ttttggtaga gatagggttt caccatgttg
gccaggctgg tcttgaactc 4560ctgacctcag gtgatccgcc tgtcttggct tcccaaagtg
ttgggattat aggcatgagc 4620cactgcacca atccaaaagc agcatctttg tgctcccttt
tcaagaggca tcacagagag 4680gcctgttttg gggtttgaat gagaggcgaa gaatcagcca
tggagtgcct ctttctcaga 4740ctccctcttg agaagtggat gcaggggtgg agagaaaaga
agactaggca tagtggctca 4800tacctgtaat cccaacattt tgggaggctg aggcaggaag
attgcttgag ctcaggagtt 4860tgagaccagc ctaggcaaca tagtgagacc acatctctta
aaaaaaagaa aaagaaaaaa 4920aatgagccag gtgtagtgac tcatgcctgt ggtccccact
tctccggagg caaaggtggg 4980aggatctttt gaggctgaga aatcgaggct acagtgagcc
atggtggcac cactgcactc 5040cagcctggga gacagagaga ccctatctca gtaaaaaaaa
aaaataaaaa tatggctggg 5100tgtggtggct cacgcctgta atcccagcac tttgggaggc
caaggtaggt agatcacatg 5160aggttaggag ttcgaaacca gtctggccaa catagtgaaa
ccctgtctct actgaaaata 5220caaaaaatta gccaagggtg gtggtgggca actgtaatcc
cagctacttg ggaggccgag 5280gcagaagaat cgcttgaact cgggaggcgg aggttgcagt
gagctgagaa catgccactg 5340cactccagcc tgggcaacaa gagcgaaact ctgtctcaaa
gaaaataaat aaataaaata 5400aaaaaataaa aaaggagggg gcatatgggt gaagtatgga
caaaatagtg gggcaggcac 5460agatgatctg gacacaggag cccttggagt ttattcttga
atctaactgt tcatctttat 5520taaatatttg tggcatacac ctcacaacaa catagccaac
acacctcctt ttggagcttt 5580tatcaaagtt tcccactgtt aagatttttt cccgctttgt
gatgcgggtg gggtgggtgc 5640tgtaagcagg cttacggggt ggcagtttct cacaaaggca
ttaactggcc ttgtcctagg 5700tctgccttca gcgaggatga cggtgactgc cagggcaagc
cttccatggg ccagctggcc 5760ctctacctgc tcgctctcag agccaactgt gagtttgtca
ggggccacaa gggggacagg 5820ctggtctcac agctcaaatg gttcctggag gatgagaaga
gagccattgg tgagcagaca 5880ccatccgctg ggggtgggga gcagctggga gggctcatca
gatgatattc tccaatgaga 5940atcagaactt tgggttttct ccccaggcgt ctttcccacc
atccattctg cccatctcac 6000tgcctacgta gaggctcgaa cctgtcccca tagccatcct
tgacccagct tttcccgcgc 6060tgcacacata ctattgacag gtgtgtttcg tggttttttg
ttttttgttt gtttgtttgt 6120tttgagttgg aggtttgctc ttgctgccca ggctggagta
caatggcgca atctcagctc 6180accgcaatct ctgcctcctg ggttcaagca attctcctgc
ctcagcctcc tgagtagctg 6240ggattacagg catgcgccac cacacccagc taattttgta
tttttagtag acgtggggtt 6300tctccatgtt ggtcaggctg gtctcgaact cctgacctca
ggtgatccgc ttgccttagc 6360ctccgaaagt gctgggatta caggcatgag ccactgcgtt
aggcccactg acaagccttg 6420tattggctag ccaccaagat tgacttgatt atccaccttc
gggacaactg gacagcctgc 6480ttatgactta cgccatagtc tgtctctact agctctcctg
ccctgacttg acccagcata 6540caacagccag agccagcctt ttcaatataa acctgatctt
gctggcactg cttaaaccct 6600gcaggggcct cgcactgctc catggcccag cctgtctacc
cttaccttct gcccaggctg 6660tgctcatcca ttctctgcct cccacacacc tgccctctgt
gggctccagc cataccatct 6720ctcaactcat aagccagttt tttcatacag gctccctcca
tctggactgg cttccctgcg 6780tgcagttcac tcctgctcta cctttggctc tgcctccacc
catcctcagc cgtctccagc 6840attacctcct tggagaatcc tgccttgact tcccagccac
ccaaatatca ctacttggtc 6900tgcattctcg ttgcaattgc agtcgcatga gcaattgctg
tggttgaggc ccgaactgcg 6960cagtgcctgt ctgccatggg tctcctgctt cctctaagca
cagtgcctga cacacagtga 7020gacctcagca cgtatgggct gaggcaatga aggaatgaag
gatcccatga cccaaaagag 7080cgtgttggaa agtgcaggcc agggtcccag gtgctggcgg
ggctggctgc tgggtggggg 7140cagagaggca acccctctgt ttttttccct ctcagggcat
gatcacaagg gccaccccca 7200cactagctac taccagtatg gcctgggcat tctggccctg
tgtctccacc agaagcgggt 7260ccatgacagc gtggtggaca aacttctgta tgctgtggaa
cctttccacc agggccacca 7320ttctgtgggt gagtaggtca gaccgtgcca aggccaggct
ggcactccct cagtccccag 7380gtctgcactg atgacctcca taccctggcc cccacactca
cctttccttg gggctcctcc 7440gaatcaagtc ctttagggac gaattggcga gggctcatgg
gtgatgcttc agctgtgagc 7500cagctttgga gctggtaggt ggatctcttg aggccaggag
ttcaagacaa cgtggtgaaa 7560ccccatctct actaaaaata aaaaagttag ccgggcatgg
tggcacatgc ctgtagtccc 7620agctactcgg gaggctgagg caggagaatc acttgaacct
gggaggcgga ggctgcagtg 7680agtggagatc gcaccactgc cctccagcct gggcaacaga
gtgagtgaga ctctgtctca 7740aaaaataaaa aataaaataa aactccccta gtgattccaa
tgtgcagcta agtttggaaa 7800taggtggtat ggggtcaagt cctcttgggc ctccctcctc
cagtccttct ccctaacctc 7860tagccctcaa gttgcagagt gatcagccaa accagtttgc
ccagaaatga gcagtttcct 7920gggacacagg attttcagag tccagacaag gaaagtcttg
ggcagaccag gttgagttgg 7980tgcccttagc tgatctgacc atgttgccct tcttctccaa
gccctcctgt ggttgtccat 8040agctacaagg gcctgaccct caagcccctg cctgtcctgg
cccctttggc tctccagctc 8100attgcatgtt ctgtccccca cttcaagaca cagcagccat
ggcaggcttg gcattcacct 8160gtctgaagcg ctcaaacttc aaccctggtc ggagacaacg
gatcaccatg gccatcagaa 8220cagtgcgaga ggagatcttg aaggcccaga cccccgaggg
ccactttggg aatgtctaca 8280gcaccccatt ggcattacag gtgggaaaga gaccctggag
ccatggccac cctggggaac 8340agtcaggggt ggagtggtca ggtgctggaa cacctagccc
ctccctgccg gctgacttcc 8400tctctctctt cctcactcta tcaccagttc ctcatgactt
cccccatgcn tggggcagaa 8460ctgggaacag catgtctcaa ggcgagggtt gctttgctgg
ccagtctgca ggatggagcc 8520ttccagaatg ctctcatgat ttcccagctg ctgcccgttc
tgaaccacaa gacctacatt 8580gatctgatct tcccagactg tctggcacca cgaggtagcc
caactttttg tggaagcaca 8640gccctttaca atctgctgcg cacccattga cgtcccagtg
aggggaggtt gcttcatcct 8700gatttgctga gtcagcacaa gattgtgggt gtgcatggga
cacagcagcc aaaatgtggt 8760catagcttct agaagctcac agtgtgggga ggaagacagt
aaatggagat ccctgggcat 8820atcgcttgtg tgatacccag tacagaaatg tttggatgga
tggatggatg gatggatgga 8880tggatggatg gatggatgga tgaggagaga cacattttgg
ttaactctaa tacaacatga 8940taagccccag tagcagcatg atccaggctt tctctgagag
agggtctgag gacgtgactg 9000ggatttgcca attaagaatg gagaaagagg ccaggtgcag
tgactcatgc ctgtaatccc 9060aacactttgg gaggccgagg cgggtggctc acctgaggtc
aggagttcga gaccagcctg 9120gctaacatgg cgaaactcca tctattaaaa atacaaaaaa
gtagctgggt gtggtggcga 9180gtgcctgtaa ccccagctaa gctactcagg aggctgaggc
aagagaatca cttgaacctc 9240agaggtggag gttgcagtga gccaagatca tgccactgca
ctccagtctg ggtgacagag 9300taagactatg tctcaaaaaa aaaaaaaaaa aatggagaag
aaggaagctg gacatggtgg 9360ctcgtgctta taatcctagc actctgggaa gctgaggcag
atggattgcc tgagcccagg 9420agtttgagac cagcctgggc aacatggtga aaccctgtct
ttactaaaat acgaaagatt 9480agccaggcat ggtggtagac acctataatc ccagctacta
gggaggctga gccacaagaa 9540tcacttgaac ctgggagaca gaggttgcag tgagccgaga
tcgcgccatt gcacttcagc 9600ctgggcgaca gtgtgagact ctgtctccag aaaaaacaag
aatggataga gtggagccaa 9660gaagaggcag gaagaacaaa gacacagagg tgcacagagt
ttgggggaat tttgaggaat 9720ggtcttgcaa aagagtggga tctgggagaa tgagtgggag
tggaaagcag atgaatgaag 9780agaaggtgag cgcatcaggg taacagagat gcgttgtgaa
caaatgcatg ttctaggaag 9840agccctctgg agtgctaggt gccagagagg tgggaggaag
gatactggaa gcagagaaac 9900cagtgagggg cctgatcttg ggtggtgggg aatgagggac
aggggaggcc gggatggaag 9960ccaggtggtg gggaatgagg gacaggggag gccgggatgg
aagccaggtt tcagctgagc 10020aggtggcggt ggcattgatg gagatgagga catggggaag
gacaaagtcc aggtgtcctt 10080gagggaagac aagaagacaa ataatccagg ctctctgtcc
tcacaccagc tgcccgcccc 10140tttcttcctg gcacagtcat gttggaacca gctgctgaga
ccattcctca gacccaagag 10200atcatcagtg tcacgctgca ggtgcttagt ctcttgccgc
cgtacagaca gtccatctct 10260gttctggccg ggtccaccgt ggaagatgtc ctgaagaagg
cccatgagtt aggaggattc 10320acgtgagact cccacctccc agtcctcacc ccacccaacc
tcacatgcct gataacaggg 10380tcacagaaga gacggggaac agaggagagg gttccctcgg
gagagacact ggccctgctt 10440ctgcttctac ctgctcagct cctttcttgc ccacggtgtt
atggaaacag ggagccatag 10500gccagcattg tcactgagag agcaggcttt ggaggcagag
ccccccagtt ggaatcccaa 10560ctctaaccag ctaggttcca ggtaggcacc cacaattcac
cgaggagaac agttgtgccc 10620cttccctgca gggccagtgt gaagagtcca ggagttagta
cacatagaga tagtggcatg 10680tgctttttat atgtgcaagg tccagcacat agcaagcgct
caacacagcg ttgctttcat 10740cagagtaaga actgtttttt gtttgtttgt ttgtttgttt
ttaagagaca gggtctcaat 10800cttatcaccc aggctggagt gtaattgtgc aatcacgtct
cactgcagtc tcgaactctg 10860gggatgaagc aaccctactg tcctgcctca gcctcccaaa
tagctgagac tataggcacg 10920tgccacacaa ccctgggtaa tttttttttt tttttttttt
gagatagggt ctctgtctgt 10980tgcccaggct ggtctcaaat tcctggcctc aaaccatcct
cacacctgag gcgctcaaaa 11040tattgggatt ataggtgcga gccatcatgc tcagccagaa
taataactgg ttttttttgt 11100tttttttttt gagacagagt ctcactctat tacccaggct
ctggaggccc aactcgtgtt 11160tgtgtatttg tttattttta tttatttatt tatttcgaga
cagagcctct ctctttcacc 11220taggctggag tgcagtggcg caatctcggc tcactgcaac
ctccgtctcc tgggttcaag 11280tgattgtcct gcctcagcct cctgagtagc tggcgctaca
ggcgcgtgcc accatgccca 11340gctaattttt gtatttttag tagagacagg gttttactat
gttggccagc tggtttctaa 11400ctcctgaact cgggtgatct gcctgcctcg gcctcccaaa
gtgctgggat tacaggcatg 11460ggcctccgtg cccggccatg tatttattta ggcaaggtct
ctctctgtta tccaggctga 11520agtgcagtgg cacattcata gctcactgca gcctcaaatt
atccaagtaa cagggactac 11580aggcatgcac caccacaccc atctactttt ttttgagatg
gagtctccct ctgtcgccca 11640gactgggttg cagtggcaca atttcagctc atggcagcat
ctacctccca ggttcaagcg 11700attctccttc ctcagtctcc cgagtagctg ggactatggg
catgcaccac catacctggc 11760taatgtttat attttgagta gagatggaat tttgccattt
tggccaggct ggtcttgagc 11820tcttgacctc aagtgatatg tctgcctcag cctcccaaag
tgccgggatt acaggcatga 11880gccactatgc ctggcccaga ataagaactt ttattttatt
tgagacaggg tcttactctg 11940tcacctaggc tagagtgcag tggcacaatc acagctcact
gtagcctcaa tcttccgggc 12000tctgatcctc ccaccttagc ctcccaggta gctgggaaca
acaggcatgc gccaccacgc 12060ttggcaaata tttttttaaa attttttgta gacatgggat
tgcttcatgt tgcccagact 12120ggtctcaaac tcctggactc aagcgatcta cccaccttgg
cctcccaaag tgctgagata 12180acaggcgtga gccgccgtgg ctggccagaa taagaacttt
tattatgaaa taagtttggc 12240cctatcctgt tattaattac ctcaaagagg ttggacaatg
tccttacact tgcttctcac 12300cctgagctta gaatagtttg agaatgagca tgtctggact
gttgtttgac aggtcctgtc 12360catgatgaaa taagatgaat gtaattctca tgttatcagt
ggggagtcca gcccagagaa 12420gttgaatgcc ttccaccagg tcacgcacac cacagacttc
tcccctagaa tcatgggcct 12480atctctgtcc ccagtgggag ttcctggaaa acagcatcct
gcatccaggg agggcccaca 12540gttggttcct tcgggtcctc taggctagag aatctcagac
cggagaaatg caaagactcc 12600tctgaagtac aaacattctc ccagacccca ggttaagtta
tttcttctat aatatcactt 12660agatgtgtac tgataatttt attaaatata aagtctttgg
ccagagctgg tggctcactc 12720ctgtaattcc agtgatttgg gaggctgagg caggaggatc
acttgagcct aggagtttga 12780gaccagcctg ggcaacatag tgagactgcc atctcttcaa
aaaaattttt ttaattagct 12840gggcatgatg gtgtgcacct ataatcctag ctacccagaa
ggctgaggtg ggaggactgc 12900ttgagcccag gagttcaagg cggcagcaag ctatgatgac
gtcactgtcc tgtagcctgg 12960atgacaaaac tttttttttt tttttaaaag acaaagggct
gagtgtgatg gctaacacct 13020gtaatcccag cactttggga ggccaaggtg ggcagattac
ttgaggccag gagttcaaaa 13080ccagccgggc caacgtggcg aaagcccgtc tctaccacac
acaaaaaaag tcctttctct 13140ttctagctct agacagctat aaatccccaa caactttata
aattaaatag aaaagtataa 13200aaaccaatac cattcaaata aggatgctct gcggaggtgc
tggcctgcac tttagcacct 13260ccctgtattc aagctgcagt gaagcctggg cactcctggc
ctggctcagc cacttcattc 13320actatgaggg ccacctccgt ttccttcctg tggcctatgg
cacataacgc aatgctgctg 13380ggcaccagcg cagtcactga tggccacgca taggctgagt
gcccttggga gggctcagac 13440ccacagactg agattaccca gaaccagctg gtattcatgt
ttgaatattg tcatcaaaat 13500gaaagacaat tagaatttaa cgtttttatt cagagctcgg
ggatctctgg gaagacactg 13560tggactccag tctgagaaat ggttctcttg ttagtgtgca
tggatgcctc aactgggttc 13620aggcatgaag actatgtttg aggcccctga atgcacagca
gagaaagctc aactgcagac 13680acagatagac agatgtggag agaggtagcc ctgtgcagcc
acatggccct gcagagttaa 13740ggaatgggaa aagggctact ctgtgacatt ccttagaaca
gtagtttccc ttcgctaaag 13800tcaccccaaa tagggtggtg gttgttgttg ttgttttgag
acggagttgc attcagtcgc 13860ccaggctgga gtgcagtggc acgatctcgg ctcactgcaa
cctccacctc ctgggttcaa 13920gcgattctcc tgcctcagcc tcctgagcag ctgggattac
aggggcccac catcacacct 13980ggctagtttt tgtattttta gtagaaatag ggtttcacca
tgtgagccag gctggtgtca 14040aactcccaac ctcaggtgat ccacccacct ccgcctcctc
aagtgctggg attacagatg 14100tgacccaccg cgccccacct gtgtttctta aaaccaaaaa
aatccctcta agactgcaac 14160catcgataac ttagggctgt attaccccag agagcacttt
gtagaagatg cgtttgttca 14220catcacttcc ctgttacttt gacggtctca cctccttttc
agtcatttgc attctcccag 14280aacctgctca tttgttgttt tttgtttgtt tgcttggttt
tgtttttgag aagcagtctt 14340gctctgttac ccaggctgga gtgcagtggt gcgacctcag
gtcactgcaa cctccgccta 14400ctaggttcaa gtgattctcc tgcctcagtc tcctgaatag
ctgggattac aggtgtgcac 14460caccatgccc agcctaattt ttgtattatt agtagagatg
gggtttcacc atgttgtcca 14520ggctggtcat gaactcctga cctcaagtga tccacccgct
ttggcctccc aaagtgctgg 14580gattacaagc atgagccaca gtgcctggcc tgaccctgct
cttttgaaag accattcccc 14640caaattctgt gcacctgtgt gcctttcttc tctctgcctc
ctctcagctc tgccccgctc 14700tcctcccttc tcctctggca aatcccactc atctcttgaa
gcccttcttc caggggaagc 14760cctgatcatg ctgctttctc ctgtgggagg gatgaaggac
gtggcccacg gagtttgttt 14820tgttttgttt tgagatggag ttttgctcat gttgcccagg
ctggggtaca atggtacgat 14880ctcagctcac tgcaacctct acgtcccggg ttcaagcggt
tctcctgcct tagcctcccc 14940agtagctggg attactggca tgaaccacca cacctggcta
attttgtgtt tttagtagag 15000atggggtttc ttcatgttgg tcaggctggt ctcgaactcc
caacctcagg tgatctgcct 15060gccttggcct cccaaagtac tgggattaca gggttgagcc
actgtgcctg gcccaggccc 15120acggagtttt aagaggcttc ctgtggcagt ggcatccaga
cggagtgcag aaactcaaag 15180ttgaaggcca gaagctcagg gaagggggag tgtgagttga
ggagtctctt ggctgccagg 15240gccagaaacc gaactccaag cctctccaca acagcgggtg
tagagcatgt agaatcagag 15300aggaggctga gccatgcagc cccgagaaga ggggaatgcc
actgagccac agagacccag 15360tgccactgcc aggtgtctct gcctccactt cccatgaccc
tgcctgtctc tgtatgcagg 15420cttcaccctc tctcgttgta cattgtacac attctaggtg
acaccagcag cttctgattc 15480tcatccccca taacatcagc cccccagaga ggggacaact
gctgagctga taacataata 15540gatgcccctt tcctggaggc catggtcatg gtcagcgtgg
agaggatgaa gcctgagcag 15600gcaggatcgg gggtctggag gggaaggagg tggaagttga
gatcacagac ctgtggccag 15660gtggcctggg aagggtttga cgagtgtcgg cccaaagagc
ttggaaggga ttttgctgct 15720gtgggtgagc actgcctctc cccttaggga caacagccac
ctcttctctc cccatttgcc 15780tttcccttct gtagatatga aacacaggcc tccttgtcag
gcccctactt aacctccgtg 15840atggggaaag cggccggaga aagggagttc tggcagcttc
tccgagaccc caacacccca 15900ctgttgcaag gtgagtcatg gcctgacact ctggatgtgt
cccctacccc aagcttactc 15960agccaagagg cttcatcaac tcaccccagc tttccctagc
accctcctgg gccacacctt 16020cacaaaatca ctgatgctca aagttggata taatatattg
aactgaagcc ttagaatttt 16080tatgcaagtt actgtggaaa ttctaggaaa ccagacagat
tacaaaaaaa aaaaaaaact 16140agaagaaaat taacatcacc taggatatac tacctaggaa
taacgtcttt tattttgaga 16200tggagtttcg ctcttgttgc ccaggctgga gtgcagcggt
atgatctcgg ctcgctgcaa 16260cctccgcctc ctgggttcat gtgattcttc cacctcagcc
ttcctagagc ccaagtggtc 16320tgcctgcctc tgcctcccaa agttctggga ttacaggcat
gagccaccgc acccagccaa 16380aattacttaa cttttcttct agatactttt taaaaatatg
gcagtaagtt tttcataaaa 16440aatggagcca tgctatccag tggaaattta atgttgccca
catgtataac ttaaaaattt 16500catatatgtg tatacatata tatgaaatat atatatatac
agacacacat atatatgtat 16560acatatatat acacacatat atatgtatat atatatacac
acatatatat gtatacatat 16620atacacacat acacatatat acacacacat acacacacat
atacacacac atacatacac 16680atacatacac acatatatac acacatatat acacacatat
atatgtatac atatatatac 16740acacatatat atgtatacat atatacacac acacatatgt
atacatatat acacacacat 16800atatatgtat acatatatac acacatatat atgtatacat
atatacacac atatatatgt 16860atacatatat acacacacac acacacacat atatatatat
atatatattt tttttttttg 16920agatggagtc ttgctctgtt gcccaggccg gagtacaatg
gcgaggtctc agctcactgc 16980agcctctgcc ttctgagctc aagcaattct cctgcctcag
cctcccaagt agctgggatt 17040acaggtgtgt gccaccatgc ccagttattt ttgtattttt
agtagagacg gggtttcacc 17100atgttggcca ggctggtctc aaactcctga cctcaagtga
tctgcctgcc tcagcctccc 17160aaagtgctgg gattacaggc gtgagccacc acgcctggcc
tgcaaacaga cccttaaaag 17220tcatctgttc ctactcttgt tatttaagtt gctttgatac
tccacccctg aagaactttt 17280atggggcact taccatgtgt caagcattgt ggcgagtgct
tatagagaag gtgggacttt 17340ctgagggagc ccacaaaaaa gcattctttt cttgccctca
ttctgaataa attgaggaat 17400aaaaaatagg agctacagat ggagacccca agtctggcca
gttctattac tgataaagac 17460aggagggtga gagctaggtg gacccagcca ggaagcagac
ctgggcagtt gtagcctcta 17520tctccttccc cctctgccct tgagtctaga aggaagcccg
ctcccaggag aaaaatagcg 17580cccaggctac acatgcctag tttattcttc ccaaatcaac
cagttgcata aatcactcct 17640ctatcttcct tggggtggaa agtggatggg agttataatt
tgagttctct tttgtcttag 17700tccattgaag ctgctattac aaaataccat aaactgggtg
gcttataaac agcagaaatg 17760aggccgggtg cggtggctca tgcctataat tccagcactt
tgggaggcca aggcaggtgg 17820atcacctgag atcagtagtt caagactagc ctgaccaaca
tggtgaaacc ctgtctctac 17880taaaaataca aaaaattagc tgggggtggt ggcgggcacc
tgtaatccca gctactcagg 17940aggctgaggc aggagaatcg cttgaaccca ggaggcggag
gttgccgtga gctgagatca 18000cgccattgca tttcagcctg ggcaacaaga gtgaaactcc
atctcaaaat gaaataaaat 18060aacagaaatg tatttcttaa cagttctgga ggttgggtgg
gcagtcccag atcaggacac 18120tgacagattc agtgtctgat gggggcccac tttctggtgt
tacctgctgg ctgtgttctc 18180acatggtgga aggaacatgg caactttctg gggccttgtt
ttttaattta aaaaaaaaaa 18240atattttcct ggcccttgcc tgctgaagga acctctttta
taatggtact taaaaatttt 18300tttttttgag atgggggtct cactctgtca cccacgctga
gtgcagtatc acaatctcag 18360ctcactgcaa cctctgcctc cctggcttaa gcgatcctcc
cacctcagcc tcctgagtac 18420gtgtgaccat aggcccatgg cacaaagccc agctaatttt
ttgtattttt agtagaaatg 18480tggtttcacc atgttgcata ggctggtctc gaacttctga
actcaagtga tctgcctgcc 18540ttggcctccc aaagtgctgg gattctaggt atgagccacc
ctgctcggcc tataatggca 18600ctttcctatc ccattgatga ggctctactc tcatgaccta
atcatctccc aaaggcccta 18660aggcctcctg ataccatcac ctttggggtt aggttttaac
atatacattt tggggggaca 18720cagacatttt agaccatagc acctccattg aaaggaaaca
tttctgacac ctggctatct 18780caaagggccc tttcagttcc cctgcaggct gcattcccac
atcaccaaca agagcagcga 18840cactcactca gaggttaaat aacttgtcca gagtcacagc
agtaatgaat gacagagctg 18900gggcttgaat ccaggcgtcc tcctagagcc tggattctgt
gtagtgagtg aaagctgact 18960cctgggagac ttctgcgtgg tcctggttct ctctccagac
tgcactgcgc aagtttctct 19020tcctgatggt ccctagggta ttacaaagac agtggccctg
cctgtcaggt gtttttatta 19080ccagatgagg tcatggcctc aggaaccctg tgggaagctg
agttcagagt ctttgagcag 19140gctttaggga ggttccagct tcccaccacc aagccccagg
tggattctta cagactctag 19200cctcagggtg gggggtctgg aagatgaggt tgcggggtgc
gatattctgc ccaattcgcc 19260cctccttgct caatctgttt ctgcaggtat tgctgactac
agacccaagg atggagaaac 19320cattgagctg aggctggtta gctggtagcc cctgagctcc
ctcatcccag cagcctcgca 19380cactccctag gcttctaccc tccctcctga tgtccctgga
acaggaactc gcctgaccct 19440gctgccacct cctgtgcact ttgagcaatg ccccctggga
tcaccccagc cacaagccct 19500tcgagggccc tataccatgg cccaccttgg agcagagagc
caagcatctt ccctgggaag 19560tctttctggc caagtctggc cagcctggcc ctgcaggtct
cccatgaagg ccaccccatg 19620gtctgatggg catgaagcat ctcagactcc ttggcaaaaa
acggagtccg caggccgcag 19680gtgttgtgaa gaccactcgt tctgtggttg gggtcctgca
agaaggcctc ctcagcccgg 19740gggctatggc cctgacccca gctctccact ctgctgttag
agtggcagct ccgagctggt 19800tgtggcacag tagctgggga gacctcagca gggctgctca
gtgcctgcct ctgacaaaat 19860taaagcattg atggcctgtg gacctgc
198871920DNAHomo sapiens 19cactccagtg tttgtccatg
202019DNAHomo sapiens
20gcatcttgag agccctgac
192120DNAHomo sapiens 21ttctctttct tagccccacg
202221DNAHomo sapiens 22agagcttgca gtgagcctag a
212322DNAHomo sapiens
23ccatcgtcag agaagtcatt ca
222420DNAHomo sapiens 24ctggttgatt tcctgcatca
202522DNAHomo sapiens 25ggtctttgga agctgctcta ca
222621DNAHomo sapiens
26ttgcagtgag cctagatcac g
212720DNAHomo sapiens 27gatcacaccc acccctcttg
202821DNAHomo sapiens 28cctcctttca ctccaaacgt c
21
User Contributions:
Comment about this patent or add new information about this topic:
People who visited this patent also read: | |
Patent application number | Title |
---|---|
20110174856 | TANK HOLDING MECHANISM FOR GAS TANK AND VEHICLE IN WHICH GAS TANK IS MOUNTED USING THE SAME |
20110174830 | DRINKING CUP |
20110174829 | CONTAINER FOR STORING MOTOR VEHICLE FLUID |
20110174826 | COOKING ITEM COMPRISING A NON-STICK COATING WITH IMPROVED PROPERTIES OF ADHESION TO THE SUBSTRATE |
20110174825 | LIQUID STORAGE SYSTEM |