Inventors list |
Assignees list |
Classification tree browser |
Top 100 Inventors |
Top 100 Assignees |
Patent application title: GALNTS AS MODIFIERS OF THE IGFR PATHWAY AND METHODS OF USE
Inventors:
Lori S. Friedman (San Carlos, CA, US)
Helen Francis-Lang (San Francisco, CA, US)
Annette L. Parks (Newton, MA, US)
Kenneth James Shaw (Brisbane, CA, US)
Timothy S. Heuer (El Granada, CA, US)
Lynn Margaret Bjerke (Surrey, GB)
Assignees:
Exelixis, Inc.
IPC8 Class: AA61K39395FI
USPC Class:
4241301
Class name: IMMUNOGLOBULIN, ANTISERUM, ANTIBODY, OR ANTIBODY FRAGMENT, EXCEPT CONJUGATE OR COMPLEX OF THE SAME WITH NONIMMUNOGLOBULIN MATERIAL
Publication date: 01/29/2009
Patent application number: 20090028844
Sign up to receive free email alerts when patent applications with chosen keywords are published SIGN UP
Abstract:
Human MIGFR genes are identified as modulators of the IGFR pathway, and
thus are therapeutic targets for disorders associated with defective IGFR
function. Methods for identifying modulators of IGFR, comprising
screening for agents that modulate the activity of MIGFR are provided.Claims:
1. A method of identifying a candidate IGFR pathway modulating agent, said
method comprising the steps of:(a) providing an assay system comprising a
GALNT polypeptide or nucleic acid;(b) contacting the assay system with a
test agent under conditions whereby, but for the presence of the test
agent, the system provides a reference activity; and(c) detecting a test
agent-biased activity of the assay system, wherein a difference between
the test agent-biased activity and the reference activity identifies the
test agent as a candidate IGFR pathway modulating agent.
2. The method of claim 1 wherein the assay system comprises cultured cells that express the GALNT polypeptide.
3. The method of claim 2 wherein the cultured cells additionally have defective IGFR function.
4. The method of claim 1 wherein the assay system includes a screening assay comprising a GALNT polypeptide, and the candidate test agent is a small molecule modulator.
5. The method of claim 4 wherein the assay is a transferase assay.
6. The method of claim 1 wherein the assay system is selected from the group consisting of an apoptosis assay system, a cell proliferation assay system, an angiogenesis assay system, and a hypoxic induction assay system.
7. The method of claim 1 wherein the assay system includes a binding assay comprising a GALNT polypeptide and the candidate test agent is an antibody.
8. The method of claim 1 wherein the assay system includes an expression assay comprising a GALNT nucleic acid and the candidate test agent is a nucleic acid modulator.
9. The method of claim 8 wherein the nucleic acid modulator is an antisense oligomer.
10. The method of claim 8 wherein the nucleic acid modulator is a PMO.
11. The method of claim 1 additionally comprising:(d) administering the candidate IGFR pathway modulating agent identified in (c) to a model system comprising cells defective in IGFR function and, detecting a phenotypic change in the model system that indicates that the IGFR function is restored.
12. The method of claim 11 wherein the model system is a mouse model with defective IGFR function.
13. A method for modulating a IGFR pathway of a cell comprising contacting a cell defective in IGFR function with a candidate modulator that specifically binds to a GALNT polypeptide, whereby IGFR function is restored.
14. The method of claim 13 wherein the candidate modulator is administered to a vertebrate animal predetermined to have a disease or disorder resulting from a defect in IGFR function.
15. The method of claim 13 wherein the candidate modulator is selected from the group consisting of an antibody and a small molecule.
16. The method of claim 1, comprising the additional steps of:(d) providing a secondary assay system comprising cultured cells or a non-human animal expressing GALNT,(e) contacting the secondary assay system with the test agent of (b) or an agent derived therefrom under conditions whereby, but for the presence of the test agent or agent derived therefrom, the system provides a reference activity; and(f) detecting an agent-biased activity of the second assay system,wherein a difference between the agent-biased activity and the reference activity of the second assay system confirms the test agent or agent derived therefrom as a candidate IGFR pathway modulating agent,and wherein the second assay detects an agent-biased change in the IGFR pathway.
17. The method of claim 16 wherein the secondary assay system comprises cultured cells.
18. The method of claim 16 wherein the secondary assay system comprises a non-human animal.
19. The method of claim 18 wherein the non-human animal mis-expresses an IGFR pathway gene.
20. A method of modulating IGFR pathway in a mammalian cell comprising contacting the cell with an agent that specifically binds a GALNT polypeptide or nucleic acid.
21. The method of claim 20 wherein the agent is administered to a mammalian animal predetermined to have a pathology associated with the IGFR pathway.
22. The method of claim 20 wherein the agent is a small molecule modulator, a nucleic acid modulator, or an antibody.
23. A method for diagnosing a disease in a patient comprising:obtaining a biological sample from the patient;contacting the sample with a probe for GALNT expression;comparing results from step (b) with a control;determining whether step (c) indicates a likelihood of disease.
24. The method of claim 23 wherein said disease is cancer.
25. The method according to claim 24, wherein said cancer is a cancer as shown in Table 1 as having >25% expression level.
Description:
REFERENCE TO RELATED APPLICATIONS
[0001]This application claims priority to U.S. provisional patent application 60/581,696 filed Jun. 21, 2004. The contents of the prior application are hereby incorporated in their entirety.
BACKGROUND OF THE INVENTION
[0002]Somatic mutations in the PTEN (Phosphatase and Tensin homolog deleted on chromosome 10) gene are known to cause tumors in a variety of human tissues. In addition, germline mutations in PTEN are the cause of human diseases (Cowden disease and Bannayan-Zonana syndrome) associated with increased risk of breast and thyroid cancer (Nelen M R et al. (1997) Hum Mol Genet, 8:1383-1387; Liaw D et al. (1997) Nat Genet, 1:64-67; Marsh D J et al. (1998) Hum Mol Genet, 3:507-515). PTEN is thought to act as a tumor suppressor by regulating several signaling pathways through the second messenger phosphatidylinositol 3,4,5 triphosphate (PIP3). PTEN dephosphorylates the D3 position of PIP3 and downregulates signaling events dependent on PIP3 levels (Maehama T and Dixon J E (1998) J Biol Chem, 22, 13375-8). In particular, pro-survival pathways downstream of the insulin-like growth factor (IGF) pathway are regulated by PTEN activity. Stimulation of the IGF pathway, or loss of PTEN function, elevates PIP3 levels and activates pro-survival pathways associated with tumorigenesis (Stambolic V et al. (1998) Cell, 95:29-39). Consistent with this model, elevated levels of insulin-like growth factors I and II correlate with increased risk of cancer (Yu H et al (1999) J Natl Cancer Inst 91:151-156) and poor prognosis (Takanami I et al, 1996, J Surg Oncol 61(3):205-8). In addition, increased levels or activity of positive effectors of the IGF pathway, such as Akt and PI(3) kinase, have been implicated in several types of human cancer (Nicholson K M and Anderson N G (2002) Cellular Signalling, 14:381-395).
[0003]In Drosophila melanogaster, as in vertebrates, the Insulin Growth Factor Receptor (IGFR) pathway includes the positive effectors PI(3) kinase, Akt, and PDK and the inhibitor, PTEN. These proteins have been implicated in multiple processes, including the regulation of cell growth and size as well as cell division and survival (Oldham S and Hafen E. (2003) Trends Cell Biol. 13:79-85; Garafolo R S. (2002) Trends Endocr. Metab. 13:156-162; Backman S A et al. (2002) Curr. Op. Neurobio. 12:1-7; Tapon N et al. (2001) Curr Op. Cell Biol. 13:731-737). Activation of the pathway in Drosophila can result in increases in cell size, cell number and organ size (Oldham S et al. (2002) Dev. 129:4103-4109; Prober D A and Edgar B A. (2002) Genes & Dev. 16:2286-2299; Potter C J et al. (2001) Cell 105:357-368; Verdu J et al. (1999) Cell Biol. 1:500-506).
[0004]GALNT1 (UDP-N-ACETYL-ALPHA-D-GALACTOSAMINE:POLYPEPTIDE N-ACETYLGALACTOSAMINYLTRANSFERASE 1) transfers an N-acetyl galactosamine (GalNAc) to a serine or threonine residue in the first step of O-linked oligosaccharide biosynthesis. GALNT1 is a member of the UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase (GalNAc-T) family of enzymes. GalNAc-Ts initiate mucin-type O-linked glycosylation in the Golgi apparatus by catalyzing the transfer of GalNAc to serine and threonine residues on target proteins. They are characterized by an N-terminal transmembrane domain, a stem region, a lumenal catalytic domain containing a GT1 motif and Gal/GalNAc transferase motif, and a C-terminal ricin/lectin-like domain. GalNAc-Ts have different, but overlapping, substrate specificities and patterns of expression. Transcript variants derived from this gene that utilize alternative polyA signals have been described in the literature. GALNT13 (UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase 13; GalNAc-T13) is another member of the UDP-N-acetyl-alpha-D-galactosamine:polypeptide N-acetylgalactosaminyltransferase family, which initiate O-linked glycosylation of mucins by the initial transfer of N-acetylgalactosamine (GalNAc) with an alpha-linkage to a serine or threonine residue.
[0005]The ability to manipulate the genomes of model organisms such as Drosophila provides a powerful means to analyze biochemical processes that, due to significant evolutionary conservation, have direct relevance to more complex vertebrate organisms. Due to a high level of gene and pathway conservation, the strong similarity of cellular processes, and the functional conservation of genes between these model organisms and mammals, identification of the involvement of novel genes in particular pathways and their functions in such model organisms can directly contribute to the understanding of the correlative pathways and methods of modulating them in mammals (see, for example, Mechler B M et al., 1985 EMBO J 4:1551-1557; Gateff E. 1982 Adv. Cancer Res. 37: 33-74; Watson K L., et al., 1994 J Cell Sci. 18: 19-33; Miklos G L, and Rubin G M. 1996 Cell 86:521-529; Wassarman D A, et al., 1995 Curr Opin Gen Dev 5: 44-50; and Booth D R. 1999 Cancer Metastasis Rev. 18: 261-284). For example, a genetic screen can be carried out in an invertebrate model organism having underexpression (e.g. knockout) or overexpression of a gene (referred to as a "genetic entry point") that yields a visible phenotype. Additional genes are mutated in a random or targeted manner. When a gene mutation changes the original phenotype caused by the mutation in the genetic entry point, the gene is identified as a "modifier" involved in the same or overlapping pathway as the genetic entry point. When the genetic entry point is an ortholog of a human gene implicated in a disease pathway, such as IGFR, modifier genes can be identified that may be attractive candidate targets for novel therapeutics.
[0006]All references cited herein, including patents, patent applications, publications, and sequence information in referenced Genbank identifier numbers, are incorporated herein in their entireties.
SUMMARY OF THE INVENTION
[0007]We have discovered genes that modify the IGFR pathway in Drosophila, and identified their human orthologs, hereinafter referred to as UDP-N-ACETYL-ALPHA-D-GALACTOSAMINE:POLYPEPTIDE N-ACETYLGALACTOSAMINYLTRANSFERASE (GALNT). The invention provides methods for utilizing these IGFR modifier genes and polypeptides to identify GALNT-modulating agents that are candidate therapeutic agents that can be used in the treatment of disorders associated with defective or impaired IGFR function and/or GALNT function. Preferred GALNT-modulating agents specifically bind to GALNT polypeptides and restore IGFR function. Other preferred GALNT-modulating agents are nucleic acid modulators such as antisense oligomers and RNAI that repress GALNT gene expression or product activity by, for example, binding to and inhibiting the respective nucleic acid (i.e. DNA or mRNA).
[0008]GALNT modulating agents may be evaluated by any convenient in vitro or in vivo assay for molecular interaction with a GALNT polypeptide or nucleic acid. In one embodiment, candidate GALNT modulating agents are tested with an assay system comprising a GALNT polypeptide or nucleic acid. Agents that produce a change in the activity of the assay system relative to controls are identified as candidate IGFR modulating agents. The assay system may be cell-based or cell-free. GALNT-modulating agents include GALNT related proteins (e.g. dominant negative mutants, and biotherapeutics); GALNT-specific antibodies; GALNT-specific antisense oligomers and other nucleic acid modulators; and chemical agents that specifically bind to or interact with GALNT or compete with GALNT binding partner (e.g. by binding to a GALNT binding partner). In one specific embodiment, a small molecule modulator is identified using a transferase assay. In specific embodiments, the screening assay system is selected from a binding assay, an apoptosis assay, a cell proliferation assay, an angiogenesis assay, and a hypoxic induction assay.
[0009]In another embodiment, candidate IGFR pathway modulating agents are further tested using a second assay system that detects changes in the IGFR pathway, such as angiogenic, apoptotic, or cell proliferation changes produced by the originally identified candidate agent or an agent derived from the original agent. The second assay system may use cultured cells or non-human animals. In specific embodiments, the secondary assay system uses non-human animals, including animals predetermined to have a disease or disorder implicating the IGFR pathway, such as an angiogenic, apoptotic, or cell proliferation disorder (e.g. cancer).
[0010]The invention further provides methods for modulating the GALNT function and/or the IGFR pathway in a mammalian cell by contacting the mammalian cell with an agent that specifically binds a GALNT polypeptide or nucleic acid. The agent may be a small molecule modulator, a nucleic acid modulator, or an antibody and may be administered to a mammalian animal predetermined to have a pathology associated with the IGFR pathway.
DETAILED DESCRIPTION OF THE INVENTION
[0011]A dominant loss of function screen was carried out in Drosophila to identify genes that interact with or modulate the IGFR signaling pathway. Modifiers of the IGFR pathway and their orthologs were identified. The CG9152 gene was identified as a modifier of the IGFR pathway. Accordingly, vertebrate orthologs of this modifier, and preferably the human orthologs, GALNT genes (i.e., nucleic acids and polypeptides) are attractive drug targets for the treatment of pathologies associated with a defective IGFR signaling pathway, such as cancer.
[0012]In vitro and in vivo methods of assessing GALNT function are provided herein. Modulation of the GALNT or their respective binding partners is useful for understanding the association of the IGFR pathway and its members in normal and disease conditions and for developing diagnostics and therapeutic modalities for IGFR related pathologies. GALNT-modulating agents that act by inhibiting or enhancing GALNT expression, directly or indirectly, for example, by affecting a GALNT function such as enzymatic (e.g., catalytic) or binding activity, can be identified using methods provided herein. GALNT modulating agents are useful in diagnosis, therapy and pharmaceutical development.
[0013]Nucleic Acids and Polypeptides of the Invention
[0014]Sequences related to GALNT nucleic acids and polypeptides that can be used in the invention are disclosed in Genbank (referenced by Genbank identifier (GI) number) as GI#s 13124890 (SEQ ID NO:1), 1136284 (SEQ ID NO:2), 2292903 (SEQ ID NO:3), 1699343 (SEQ ID NO:4), 42741666 (SEQ ID NO:5), 20535693 (SEQ ID NO:6), 29730439 (SEQ ID NO:7), and 27530992 (SEQ ID NO:8) for nucleic acid, and GI#s 13124891 (SEQ ID NO:9) and 42741667 (SEQ ID NO:10) for polypeptides.
[0015]The term "GALNT polypeptide" refers to a full-length GALNT protein or a functionally active fragment or derivative thereof. A "functionally active" GALNT fragment or derivative exhibits one or more functional activities associated with a full-length, wild-type GALNT protein, such as antigenic or immunogenic activity, enzymatic activity, ability to bind natural cellular substrates, etc. The functional activity of GALNT proteins, derivatives and fragments can be assayed by various methods known to one skilled in the art (Current Protocols in Protein Science (1998) Coligan et al., eds., John Wiley & Sons, Inc., Somerset, N.J.) and as further discussed below. In one embodiment, a functionally active GALNT polypeptide is a GALNT derivative capable of rescuing defective endogenous GALNT activity, such as in cell based or animal assays; the rescuing derivative may be from the same or a different species. For purposes herein, functionally active fragments also include those fragments that comprise one or more structural domains of a GALNT, such as a binding domain. Protein domains can be identified using the PFAM program (Bateman A., et al., Nucleic Acids Res, 1999, 27:260-2). For example, the Ricin-type beta-trefoil lectin domain (PFAM 00652) of GALNT from GI#s13124891 and 42741667 (SEQ ID NOs:9 and 10, respectively) is located respectively at approximately amino acid residues 430 to 548 and 429 to 547. Likewise, the Glycosyl transferase domain of the same sequences is located respectively at approximately amino acid residues 119 to 303 and 118 to 302. Methods for obtaining GALNT polypeptides are also further described below. In some embodiments, preferred fragments are functionally active, domain-containing fragments comprising at least 25 contiguous amino acids, preferably at least 50, more preferably 75, and most preferably at least 100 contiguous amino acids of a GALNT. In further preferred embodiments, the fragment comprises the entire functionally active domain.
[0016]The term "GALNT nucleic acid" refers to a DNA or RNA molecule that encodes a GALNT polypeptide. Preferably, the GALNT polypeptide or nucleic acid or fragment thereof is from a human, but can also be an ortholog, or derivative thereof with at least 70% sequence identity, preferably at least 80%, more preferably 85%, still more preferably 90%, and most preferably at least 95% sequence identity with human GALNT. Methods of identifying orthlogs are known in the art. Normally, orthologs in different species retain the same function, due to presence of one or more protein motifs and/or 3-dimensional structures. Orthologs are generally identified by sequence homology analysis, such as BLAST analysis, usually using protein bait sequences. Sequences are assigned as a potential ortholog if the best hit sequence from the forward BLAST result retrieves the original query sequence in the reverse BLAST (Huynen M A and Bork P, Proc Natl Acad Sci (1998) 95:5849-5856; Huynen M A et al. Genome Research (2000) 10:1204-1210). Programs for multiple sequence alignment, such as CLUSTAL (Thompson J D et al, 1994, Nucleic Acids Res 22:4673-4680) may be used to highlight conserved regions and/or residues of orthologous proteins and to generate phylogenetic trees. In a phylogenetic tree representing multiple homologous sequences from diverse species (e.g., retrieved through BLAST analysis), orthologous sequences from two species generally appear closest on the tree with respect to all other sequences from these two species. Structural threading or other analysis of protein folding (e.g., using software by ProCeryon, Biosciences, Salzburg, Austria) may also identify potential orthologs. In evolution, when a gene duplication event follows speciation, a single gene in one species, such as Drosophila, may correspond to multiple genes (paralogs) in another, such as human. As used herein, the term "orthologs" encompasses paralogs. As used herein, "percent (%) sequence identity" with respect to a subject sequence, or a specified portion of a subject sequence, is defined as the percentage of nucleotides or amino acids in the candidate derivative sequence identical with the nucleotides or amino acids in the subject sequence (or specified portion thereof), after aligning the sequences and introducing gaps, if necessary to achieve the maximum percent sequence identity, as generated by the program WU-BLAST-2.0a19 (Altschul et al., J. Mol. Biol. (1997) 215:403-410) with all the search parameters set to default values. The HSP S and HSP S2 parameters are dynamic values and are established by the program itself depending upon the composition of the particular sequence and composition of the particular database against which the sequence of interest is being searched. A % identity value is determined by the number of matching identical nucleotides or amino acids divided by the sequence length for which the percent identity is being reported. "Percent (%) amino acid sequence similarity" is determined by doing the same calculation as for determining % amino acid sequence identity, but including conservative amino acid substitutions in addition to identical amino acids in the computation.
[0017]A conservative amino acid substitution is one in which an amino acid is substituted for another amino acid having similar properties such that the folding or activity of the protein is not significantly affected. Aromatic amino acids that can be substituted for each other are phenylalanine, tryptophan, and tyrosine; interchangeable hydrophobic amino acids are leucine, isoleucine, methionine, and valine; interchangeable polar amino acids are glutamine and asparagine; interchangeable basic amino acids are arginine, lysine and histidine; interchangeable acidic amino acids are aspartic acid and glutamic acid; and interchangeable small amino acids are alanine, serine, threonine, cysteine and glycine.
[0018]Alternatively, an alignment for nucleic acid sequences is provided by the local homology algorithm of Smith and Waterman (Smith and Waterman, 1981, Advances in Applied Mathematics 2:482-489; database: European Bioinformatics Institute; Smith and Waterman, 1981, J. of Molec. Biol., 147:195-197; Nicholas et al., 1998, "A Tutorial on Searching Sequence Databases and Sequence Scoring Methods" (www.psc.edu) and references cited therein; W. R. Pearson, 1991, Genomics 11:635-650). This algorithm can be applied to amino acid sequences by using the scoring matrix developed by Dayhoff (Dayhoff: Atlas of Protein Sequences and Structure, M. O. Dayhoff ed., 5 suppl. 3:353-358, National Biomedical Research Foundation, Washington, D.C., USA), and normalized by Gribskov (Gribskov 1986 Nucl. Acids Res. 14(6):6745-6763). The Smith-Waterman algorithm may be employed where default parameters are used for scoring (for example, gap open penalty of 12, gap extension penalty of two). From the data generated, the "Match" value reflects "sequence identity."
[0019]Derivative nucleic acid molecules of the subject nucleic acid molecules include sequences that hybridize to the nucleic acid sequence of a GALNT. The stringency of hybridization can be controlled by temperature, ionic strength, pH, and the presence of denaturing agents such as formamide during hybridization and washing. Conditions routinely used are set out in readily available procedure texts, (e.g., Current Protocol in Molecular Biology, Vol. 1, Chap. 2.10, John Wiley & Sons, Publishers (1994); Sambrook et al., Molecular Cloning, Cold Spring Harbor (1989)). In some embodiments, a nucleic acid molecule of the invention is capable of hybridizing to a nucleic acid molecule containing the nucleotide sequence of a GALNT under high stringency hybridization conditions that are: prehybridization of filters containing nucleic acid for 8 hours to overnight at 65° C. in a solution comprising 6× single strength citrate (SSC) (1×SSC is 0.15 M NaCl, 0.015 M Na citrate; pH 7.0), 5×Denhardt's solution, 0.05% sodium pyrophosphate and 100 μg/ml herring sperm DNA; hybridization for 18-20 hours at 65° C. in a solution containing 6×SSC, 1×Denhardt's solution, 100 μg/ml yeast tRNA and 0.05% sodium pyrophosphate; and washing of filters at 65° C. for 1 h in a solution containing 0.1×SSC and 0.1% SDS (sodium dodecyl sulfate).
[0020]In other embodiments, moderately stringent hybridization conditions are used that are: pretreatment of filters containing nucleic acid for 6 h at 40° C. in a solution containing 35% formamide, 5×SSC, 50 mM Tris-HCl (pH7.5), 5 mM EDTA, 0.1% PVP, 0.1% Ficoll, 1% BSA, and 500 μg/ml denatured salmon sperm DNA; hybridization for 18-20 h at 40° C. in a solution containing 35% formamide, 5×SSC, 50 mM Tris-HCl (pH7.5), 5 mM EDTA, 0.02% PVP, 0.02% Ficoll, 0.2% BSA, 100 μg/ml salmon sperm DNA, and 10% (wt/vol) dextran sulfate; followed by washing twice for 1 hour at 55° C. in a solution containing 2×SSC and 0.1% SDS.
[0021]Alternatively, low stringency conditions can be used that are: incubation for 8 hours to overnight at 37° C. in a solution comprising 20% formamide, 5×SSC, 50 mM sodium phosphate (pH 7.6), 5×Denhardt's solution, 10% dextran sulfate, and 20 μg/ml denatured sheared salmon sperm DNA; hybridization in the same buffer for 18 to 20 hours; and washing of filters in 1×SSC at about 37° C. for 1 hour.
[0022]Isolation, Production, Expression, and Mis-Expression of GALNT Nucleic Acids and Polypeptides
[0023]GALNT nucleic acids and polypeptides are useful for identifying and testing agents that modulate GALNT function and for other applications related to the involvement of GALNT in the IGFR pathway. GALNT nucleic acids and derivatives and orthologs thereof may be obtained using any available method. For instance, techniques for isolating cDNA or genomic DNA sequences of interest by screening DNA libraries or by using polymerase chain reaction (PCR) are well known in the art. In general, the particular use for the protein will dictate the particulars of expression, production, and purification methods. For instance, production of proteins for use in screening for modulating agents may require methods that preserve specific biological activities of these proteins, whereas production of proteins for antibody generation may require structural integrity of particular epitopes. Expression of proteins to be purified for screening or antibody production may require the addition of specific tags (e.g., generation of fusion proteins). Overexpression of a GALNT protein for assays used to assess GALNT function, such as involvement in cell cycle regulation or hypoxic response, may require expression in eukaryotic cell lines capable of these cellular activities. Techniques for the expression, production, and purification of proteins are well known in the art; any suitable means therefore may be used (e.g., Higgins S J and Hames B D (eds.) Protein Expression: A Practical Approach, Oxford University Press Inc., New York 1999; Stanbury P F et al., Principles of Fermentation Technology, 2nd edition, Elsevier Science, New York, 1995; Doonan S (ed.) Protein Purification Protocols, Humana Press, New Jersey, 1996; Coligan J E et al, Current Protocols in Protein Science (eds.), 1999, John Wiley & Sons, New York). In particular embodiments, recombinant GALNT is expressed in a cell line known to have defective IGFR function. The recombinant cells are used in cell-based screening assay systems of the invention, as described further below.
[0024]The nucleotide sequence encoding a GALNT polypeptide can be inserted into any appropriate expression vector. The necessary transcriptional and translational signals, including promoter/enhancer element, can derive from the native GALNT gene and/or its flanking regions or can be heterologous. A variety of host-vector expression systems may be utilized, such as mammalian cell systems infected with virus (e.g. vaccinia virus, adenovirus, etc.); insect cell systems infected with virus (e.g. baculovirus); microorganisms such as yeast containing yeast vectors, or bacteria transformed with bacteriophage, plasmid, or cosmid DNA. An isolated host cell strain that modulates the expression of, modifies, and/or specifically processes the gene product may be used.
[0025]To detect expression of the GALNT gene product, the expression vector can comprise a promoter operably linked to a GALNT gene nucleic acid, one or more origins of replication, and, one or more selectable markers (e.g. thymidine kinase activity, resistance to antibiotics, etc.). Alternatively, recombinant expression vectors can be identified by assaying for the expression of the GALNT gene product based on the physical or functional properties of the GALNT protein in in vitro assay systems (e.g. immunoassays).
[0026]The GALNT protein, fragment, or derivative may be optionally expressed as a fusion, or chimeric protein product (i.e. it is joined via a peptide bond to a heterologous protein sequence of a different protein), for example to facilitate purification or detection. A chimeric product can be made by ligating the appropriate nucleic acid sequences encoding the desired amino acid sequences to each other using standard methods and expressing the chimeric product. A chimeric product may also be made by protein synthetic techniques, e.g. by use of a peptide synthesizer (Hunkapiller et al., Nature (1984) 310:105-111).
[0027]Once a recombinant cell that expresses the GALNT gene sequence is identified, the gene product can be isolated and purified using standard methods (e.g. ion exchange, affinity, and gel exclusion chromatography; centrifugation; differential solubility; electrophoresis). Alternatively, native GALNT proteins can be purified from natural sources, by standard methods (e.g. immunoaffinity purification). Once a protein is obtained, it may be quantified and its activity measured by appropriate methods, such as immunoassay, bioassay, or other measurements of physical properties, such as crystallography.
[0028]The methods of this invention may also use cells that have been engineered for altered expression (mis-expression) of GALNT or other genes associated with the IGFR pathway. As used herein, mis-expression encompasses ectopic expression, over-expression, under-expression, and non-expression (e.g. by gene knock-out or blocking expression that would otherwise normally occur).
[0029]Genetically Modified Animals
[0030]Animal models that have been genetically modified to alter GALNT expression may be used in in vivo assays to test for activity of a candidate IGFR modulating agent, or to further assess the role of GALNT in a IGFR pathway process such as apoptosis or cell proliferation. Preferably, the altered GALNT expression results in a detectable phenotype, such as decreased or increased levels of cell proliferation, angiogenesis, or apoptosis compared to control animals having normal GALNT expression. The genetically modified animal may additionally have altered IGFR expression (e.g. IGFR knockout). Preferred genetically modified animals are mammals such as primates, rodents (preferably mice or rats), among others. Preferred non-mammalian species include zebrafish, C. elegans, and Drosophila. Preferred genetically modified animals are transgenic animals having a heterologous nucleic acid sequence present as an extrachromosomal element in a portion of its cells, i.e. mosaic animals (see, for example, techniques described by Jakobovits, 1994, Curr. Biol. 4:761-763.) or stably integrated into its germ line DNA (i.e., in the genomic sequence of most or all of its cells). Heterologous nucleic acid is introduced into the germ line of such transgenic animals by genetic manipulation of, for example, embryos or embryonic stem cells of the host animal.
[0031]Methods of making transgenic animals are well-known in the art (for transgenic mice see Brinster et al., Proc. Nat. Acad. Sci. USA 82: 4438-4442 (1985), U.S. Pat. Nos. 4,736,866 and 4,870,009, both by Leder et al., U.S. Pat. No. 4,873,191 by Wagner et al., and Hogan, B., Manipulating the Mouse Embryo, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., (1986); for particle bombardment see U.S. Pat. No. 4,945,050, by Sandford et al.; for transgenic Drosophila see Rubin and Spradling, Science (1982) 218:348-53 and U.S. Pat. No. 4,670,388; for transgenic insects see Berghammer A. J. et al., A Universal Marker for Transgenic Insects (1999) Nature 402:370-371; for transgenic Zebrafish see Lin S., Transgenic Zebrafish, Methods Mol Biol. (2000); 136:375-3830); for microinjection procedures for fish, amphibian eggs and birds see Houdebine and Chourrout, Experientia (1991) 47:897-905; for transgenic rats see Hammer et al., Cell (1990) 63:1099-1112; and for culturing of embryonic stem (ES) cells and the subsequent production of transgenic animals by the introduction of DNA into ES cells using methods such as electroporation, calcium phosphate/DNA precipitation and direct injection see, e.g., Teratocarcinomas and Embryonic Stem Cells, A Practical Approach, E. J. Robertson, ed., IRL Press (1987)). Clones of the nonhuman transgenic animals can be produced according to available methods (see Wilmut, I. et al. (1997) Nature 385:810-813; and PCT International Publication Nos. WO 97/07668 and WO 97/07669).
[0032]In one embodiment, the transgenic animal is a "knock-out" animal having a heterozygous or homozygous alteration in the sequence of an endogenous GALNT gene that results in a decrease of GALNT function, preferably such that GALNT expression is undetectable or insignificant. Knock-out animals are typically generated by homologous recombination with a vector comprising a transgene having at least a portion of the gene to be knocked out. Typically a deletion, addition or substitution has been introduced into the transgene to functionally disrupt it. The transgene can be a human gene (e.g., from a human genomic clone) but more preferably is an ortholog of the human gene derived from the transgenic host species. For example, a mouse GALNT gene is used to construct a homologous recombination vector suitable for altering an endogenous GALNT gene in the mouse genome. Detailed methodologies for homologous recombination in mice are available (see Capecchi, Science (1989) 244:1288-1292; Joyner et al., Nature (1989) 338:153-156). Procedures for the production of non-rodent transgenic mammals and other animals are also available (Houdebine and Chourrout, supra; Pursel et al., Science (1989) 244:1281-1288; Simms et al., Bio/Technology (1988) 6:179-183). In a preferred embodiment, knock-out animals, such as mice harboring a knockout of a specific gene, may be used to produce antibodies against the human counterpart of the gene that has been knocked out (Claesson M H et al., (1994) Scan J Immunol 40:257-264; Declerck P J et al., (1995) J Biol Chem. 270:8397-400).
[0033]In another embodiment, the transgenic animal is a "knock-in" animal having an alteration in its genome that results in altered expression (e.g., increased (including ectopic) or decreased expression) of the GALNT gene, e.g., by introduction of additional copies of GALNT, or by operatively inserting a regulatory sequence that provides for altered expression of an endogenous copy of the GALNT gene. Such regulatory sequences include inducible, tissue-specific, and constitutive promoters and enhancer elements. The knock-in can be homozygous or heterozygous.
[0034]Transgenic nonhuman animals can also be produced that contain selected systems allowing for regulated expression of the transgene. One example of such a system that may be produced is the cre/loxP recombinase system of bacteriophage P1 (Lakso et al., PNAS (1992) 89:6232-6236; U.S. Pat. No. 4,959,317). If a cre/loxP recombinase system is used to regulate expression of the transgene, animals containing transgenes encoding both the Cre recombinase and a selected protein are required. Such animals can be provided through the construction of "double" transgenic animals, e.g., by mating two transgenic animals, one containing a transgene encoding a selected protein and the other containing a transgene encoding a recombinase. Another example of a recombinase system is the FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et al. (1991) Science 251:1351-1355; U.S. Pat. No. 5,654,182). In a preferred embodiment, both Cre-LoxP and Flp-Frt are used in the same system to regulate expression of the transgene, and for sequential deletion of vector sequences in the same cell (Sun X et al (2000) Nat Genet 25:83-6).
[0035]The genetically modified animals can be used in genetic studies to further elucidate the IGFR pathway, as animal models of disease and disorders implicating defective IGFR function, and for in vivo testing of candidate therapeutic agents, such as those identified in screens described below. The candidate therapeutic agents are administered to a genetically modified animal having altered GALNT function and phenotypic changes are compared with appropriate control animals such as genetically modified animals that receive placebo treatment, and/or animals with unaltered GALNT expression that receive candidate therapeutic agent.
[0036]In addition to the above-described genetically modified animals having altered GALNT function, animal models having defective IGFR function (and otherwise normal GALNT function), can be used in the methods of the present invention. For example, a IGFR knockout mouse can be used to assess, in vivo, the activity of a candidate IGFR modulating agent identified in one of the in vitro assays described below. Preferably, the candidate IGFR modulating agent when administered to a model system with cells defective in IGFR function, produces a detectable phenotypic change in the model system indicating that the IGFR function is restored, i.e., the cells exhibit normal cell cycle progression.
[0037]Modulating Agents
[0038]The invention provides methods to identify agents that interact with and/or modulate the function of GALNT and/or the IGFR pathway. Modulating agents identified by the methods are also part of the invention. Such agents are useful in a variety of diagnostic and therapeutic applications associated with the IGFR pathway, as well as in further analysis of the GALNT protein and its contribution to the IGFR pathway. Accordingly, the invention also provides methods for modulating the IGFR pathway comprising the step of specifically modulating GALNT activity by administering a GALNT-interacting or -modulating agent.
[0039]As used herein, a "GALNT-modulating agent" is any agent that modulates GALNT function, for example, an agent that interacts with GALNT to inhibit or enhance GALNT activity or otherwise affect normal GALNT function. GALNT function can be affected at any level, including transcription, protein expression, protein localization, and cellular or extra-cellular activity. In a preferred embodiment, the GALNT-modulating agent specifically modulates the function of the GALNT. The phrases "specific modulating agent", "specifically modulates", etc., are used herein to refer to modulating agents that directly bind to the GALNT polypeptide or nucleic acid, and preferably inhibit, enhance, or otherwise alter, the function of the GALNT. These phrases also encompass modulating agents that alter the interaction of the GALNT with a binding partner, substrate, or cofactor (e.g. by binding to a binding partner of a GALNT, or to a protein/binding partner complex, and altering GALNT function). In a further preferred embodiment, the GALNT-modulating agent is a modulator of the IGFR pathway (e.g. it restores and/or upregulates IGFR function) and thus is also a IGFR-modulating agent.
[0040]Preferred GALNT-modulating agents include small molecule compounds; GALNT-interacting proteins, including antibodies and other biotherapeutics; and nucleic acid modulators such as antisense and RNA inhibitors. The modulating agents may be formulated in pharmaceutical compositions, for example, as compositions that may comprise other active ingredients, as in combination therapy, and/or suitable carriers or excipients. Techniques for formulation and administration of the compounds may be found in "Remington's Pharmaceutical Sciences" Mack Publishing Co., Easton, Pa., 19th edition.
[0041]Small Molecule Modulators
[0042]Small molecules are often preferred to modulate function of proteins with enzymatic function, and/or containing protein interaction domains. Chemical agents, referred to in the art as "small molecule" compounds are typically organic, non-peptide molecules, having a molecular weight up to 10,000, preferably up to 5,000, more preferably up to 1,000, and most preferably up to 500 daltons. This class of modulators includes chemically synthesized molecules, for instance, compounds from combinatorial chemical libraries. Synthetic compounds may be rationally designed or identified based on known or inferred properties of the GALNT protein or may be identified by screening compound libraries. Alternative appropriate modulators of this class are natural products, particularly secondary metabolites from organisms such as plants or fungi, which can also be identified by screening compound libraries for GALNT-modulating activity. Methods for generating and obtaining compounds are well known in the art (Schreiber S L, Science (2000) 151: 1964-1969; Radmann J and Gunther J, Science (2000) 151:1947-1948).
[0043]Small molecule modulators identified from screening assays, as described below, can be used as lead compounds from which candidate clinical compounds may be designed, optimized, and synthesized. Such clinical compounds may have utility in treating pathologies associated with the IGFR pathway. The activity of candidate small molecule modulating agents may be improved several-fold through iterative secondary functional validation, as further described below, structure determination, and candidate modulator modification and testing. Additionally, candidate clinical compounds are generated with specific regard to clinical and pharmacological properties. For example, the reagents may be derivatized and re-screened using in vitro and in vivo assays to optimize activity and minimize toxicity for pharmaceutical development.
[0044]Protein Modulators
[0045]Specific GALNT-interacting proteins are useful in a variety of diagnostic and therapeutic applications related to the IGFR pathway and related disorders, as well as in validation assays for other GALNT-modulating agents. In a preferred embodiment, GALNT-interacting proteins affect normal GALNT function, including transcription, protein expression, protein localization, and cellular or extra-cellular activity. In another embodiment, GALNT-interacting proteins are useful in detecting and providing information about the function of GALNT proteins, as is relevant to IGFR related disorders, such as cancer (e.g., for diagnostic means).
[0046]A GALNT-interacting protein may be endogenous, i.e. one that naturally interacts genetically or biochemically with a GALNT, such as a member of the GALNT pathway that modulates GALNT expression, localization, and/or activity. GALNT-modulators include dominant negative forms of GALNT-interacting proteins and of GALNT proteins themselves. Yeast two-hybrid and variant screens offer preferred methods for identifying endogenous GALNT-interacting proteins (Finley, R. L. et al. (1996) in DNA Cloning-Expression Systems: A Practical Approach, eds. Glover D. & Hames B. D (Oxford University Press, Oxford, England), pp. 169-203; Fashema S F et al., Gene. (2000) 250:1-14; Drees B L Curr Opin Chem Biol (1999) 3:64-70; Vidal M and Legrain P Nucleic Acids Res (1999) 27:919-29; and U.S. Pat. No. 5,928,868). Mass spectrometry is an alternative preferred method for the elucidation of protein complexes (reviewed in, e.g., Pandley A and Mann M, Nature (2000) 405:837-846; Yates JR 3rd, Trends Genet (2000) 16:5-8).
[0047]A GALNT-interacting protein may be an exogenous protein, such as a GALNT-specific antibody or a T-cell antigen receptor (see, e.g., Harlow and Lane (1988) Antibodies, A Laboratory Manual, Cold Spring Harbor Laboratory; Harlow and Lane (1999) Using antibodies: a laboratory manual. Cold Spring Harbor, N.Y.: Cold Spring Harbor Laboratory Press). GALNT antibodies are further discussed below.
[0048]In preferred embodiments, a GALNT-interacting protein specifically binds a GALNT protein. In alternative preferred embodiments, a GALNT-modulating agent binds a GALNT substrate, binding partner, or cofactor.
[0049]Antibodies
[0050]In another embodiment, the protein modulator is a GALNT specific antibody agonist or antagonist. The antibodies have therapeutic and diagnostic utilities, and can be used in screening assays to identify GALNT modulators. The antibodies can also be used in dissecting the portions of the GALNT pathway responsible for various cellular responses and in the general processing and maturation of the GALNT.
[0051]Antibodies that specifically bind GALNT polypeptides can be generated using known methods. Preferably the antibody is specific to a mammalian ortholog of GALNT polypeptide, and more preferably, to human GALNT. Antibodies may be polyclonal, monoclonal (mAbs), humanized or chimeric antibodies, single chain antibodies, Fab fragments, F(ab')2 fragments, fragments produced by a FAb expression library, anti-idiotypic (anti-Id) antibodies, and epitope-binding fragments of any of the above. Epitopes of GALNT which are particularly antigenic can be selected, for example, by routine screening of GALNT polypeptides for antigenicity or by applying a theoretical method for selecting antigenic regions of a protein (Hopp and Wood (1981), Proc. Nati. Acad. Sci. U.S.A. 78:3824-28; Hopp and Wood, (1983) Mol. Immunol. 20:483-89; Sutcliffe et al., (1983) Science 219:660-66) to the amino acid sequence of a GALNT. Monoclonal antibodies with affinities of 108 M-1 preferably 109 M-1 to 1010 M-1, or stronger can be made by standard procedures as described (Harlow and Lane, supra; Goding (1986) Monoclonal Antibodies: Principles and Practice (2d ed) Academic Press, New York; and U.S. Pat. Nos. 4,381,292; 4,451,570; and 4,618,577). Antibodies may be generated against crude cell extracts of GALNT or substantially purified fragments thereof. If GALNT fragments are used, they preferably comprise at least 10, and more preferably, at least 20 contiguous amino acids of a GALNT protein. In a particular embodiment, GALNT-specific antigens and/or immunogens are coupled to carrier proteins that stimulate the immune response. For example, the subject polypeptides are covalently coupled to the keyhole limpet hemocyanin (KLH) carrier, and the conjugate is emulsified in Freund's complete adjuvant, which enhances the immune response. An appropriate immune system such as a laboratory rabbit or mouse is immunized according to conventional protocols.
[0052]The presence of GALNT-specific antibodies is assayed by an appropriate assay such as a solid phase enzyme-linked immunosorbant assay (ELISA) using immobilized corresponding GALNT polypeptides. Other assays, such as radioimmunoassays or fluorescent assays might also be used.
[0053]Chimeric antibodies specific to GALNT polypeptides can be made that contain different portions from different animal species. For instance, a human immunoglobulin constant region may be linked to a variable region of a murine mAb, such that the antibody derives its biological activity from the human antibody, and its binding specificity from the murine fragment. Chimeric antibodies are produced by splicing together genes that encode the appropriate regions from each species (Morrison et al., Proc. Natl. Acad. Sci. (1984) 81:6851-6855; Neuberger et al., Nature (1984) 312:604-608; Takeda et al., Nature (1985) 31:452-454). Humanized antibodies, which are a form of chimeric antibodies, can be generated by grafting complementary-determining regions (CDRs) (Carlos, T. M., J. M. Harlan. 1994. Blood 84:2068-2101) of mouse antibodies into a background of human framework regions and constant regions by recombinant DNA technology (Riechmann L M, et al., 1988 Nature 323: 323-327). Humanized antibodies contain ˜10% murine sequences and ˜90% human sequences, and thus further reduce or eliminate immunogenicity, while retaining the antibody specificities (Co MS, and Queen C. 1991 Nature 351: 501-501; Morrison S L. 1992 Ann. Rev. Immun. 10:239-265). Humanized antibodies and methods of their production are well-known in the art (U.S. Pat. Nos. 5,530,101, 5,585,089, 5,693,762, and 6,180,370).
[0054]GALNT-specific single chain antibodies which are recombinant, single chain polypeptides formed by linking the heavy and light chain fragments of the Fv regions via an amino acid bridge, can be produced by methods known in the art (U.S. Pat. No. 4,946,778; Bird, Science (1988) 242:423-426; Huston et al., Proc. Natl. Acad. Sci. USA (1988) 85:5879-5883; and Ward et al., Nature (1989) 334:544-546).
[0055]Other suitable techniques for antibody production involve in vitro exposure of lymphocytes to the antigenic polypeptides or alternatively to selection of libraries of antibodies in phage or similar vectors (Huse et al., Science (1989) 246:1275-1281). As used herein, T-cell antigen receptors are included within the scope of antibody modulators (Harlow and Lane, 1988, supra).
[0056]The polypeptides and antibodies of the present invention may be used with or without modification. Frequently, antibodies will be labeled by joining, either covalently or non-covalently, a substance that provides for a detectable signal, or that is toxic to cells that express the targeted protein (Menard S, et al., Int J. Biol Markers (1989) 4:131-134). A wide variety of labels and conjugation techniques are known and are reported extensively in both the scientific and patent literature. Suitable labels include radionuclides, enzymes, substrates, cofactors, inhibitors, fluorescent moieties, fluorescent emitting lanthanide metals, chemiluminescent moieties, bioluminescent moieties, magnetic particles, and the like (U.S. Pat. Nos. 3,817,837; 3,850,752; 3,939,350; 3,996,345; 4,277,437; 4,275,149; and 4,366,241). Also, recombinant immunoglobulins may be produced (U.S. Pat. No. 4,816,567). Antibodies to cytoplasmic polypeptides may be delivered and reach their targets by conjugation with membrane-penetrating toxin proteins (U.S. Pat. No. 6,086,900).
[0057]When used therapeutically in a patient, the antibodies of the subject invention are typically administered parenterally, when possible at the target site, or intravenously. The therapeutically effective dose and dosage regimen is determined by clinical studies. Typically, the amount of antibody administered is in the range of about 0.1 mg/kg--to about 10 mg/kg of patient weight. For parenteral administration, the antibodies are formulated in a unit dosage injectable form (e.g., solution, suspension, emulsion) in association with a pharmaceutically acceptable vehicle. Such vehicles are inherently nontoxic and non-therapeutic. Examples are water, saline, Ringer's solution, dextrose solution, and 5% human serum albumin. Nonaqueous vehicles such as fixed oils, ethyl oleate, or liposome carriers may also be used. The vehicle may contain minor amounts of additives, such as buffers and preservatives, which enhance isotonicity and chemical stability or otherwise enhance therapeutic potential. The antibodies' concentrations in such vehicles are typically in the range of about 1 mg/ml to about 10 mg/ml. Immunotherapeutic methods are further described in the literature (U.S. Pat. No. 5,859,206; WO0073469).
[0058]Nucleic Acid Modulators
[0059]Other preferred GALNT-modulating agents comprise nucleic acid molecules, such as antisense oligomers or double stranded RNA (dsRNA), which generally inhibit GALNT activity. Preferred nucleic acid modulators interfere with the function of the GALNT nucleic acid such as DNA replication, transcription, translocation of the GALNT RNA to the site of protein translation, translation of protein from the GALNT RNA, splicing of the GALNT RNA to yield one or more mRNA species, or catalytic activity which may be engaged in or facilitated by the GALNT RNA.
[0060]In one embodiment, the antisense oligomer is an oligonucleotide that is sufficiently complementary to a GALNT mRNA to bind to and prevent translation, preferably by binding to the 5' untranslated region. GALNT-specific antisense oligonucleotides, preferably range from at least 6 to about 200 nucleotides. In some embodiments the oligonucleotide is preferably at least 10, 15, or 20 nucleotides in length. In other embodiments, the oligonucleotide is preferably less than 50, 40, or 30 nucleotides in length. The oligonucleotide can be DNA or RNA or a chimeric mixture or derivatives or modified versions thereof, single-stranded or double-stranded. The oligonucleotide can be modified at the base moiety, sugar moiety, or phosphate backbone. The oligonucleotide may include other appending groups such as peptides, agents that facilitate transport across the cell membrane, hybridization-triggered cleavage agents, and intercalating agents.
[0061]In another embodiment, the antisense oligomer is a phosphothioate morpholino oligomer (PMO). PMOs are assembled from four different morpholino subunits, each of which contain one of four genetic bases (A, C, G, or T) linked to a six-membered morpholine ring. Polymers of these subunits are joined by non-ionic phosphodiamidate intersubunit linkages. Details of how to make and use PMOs and other antisense oligomers are well known in the art (e.g. see WO99/18193; Probst J C, Antisense Oligodeoxynucleotide and Ribozyme Design, Methods. (2000) 22(3):271-281; Summerton J, and Weller D. 1997 Antisense Nucleic Acid Drug Dev.:7:187-95; U.S. Pat. No. 5,235,033; and U.S. Pat. No. 5,378,841).
[0062]Alternative preferred GALNT nucleic acid modulators are double-stranded RNA species mediating RNA interference (RNAi). RNAi is the process of sequence-specific, post-transcriptional gene silencing in animals and plants, initiated by double-stranded RNA (dsRNA) that is homologous in sequence to the silenced gene. Methods relating to the use of RNAi to silence genes in C. elegans, Drosophila, plants, and humans are known in the art (Fire A, et al., 1998 Nature 391:806-811; Fire, A. Trends Genet. 15, 358-363 (1999); Sharp, P. A. RNA interference 2001. Genes Dev. 15, 485-490 (2001); Hammond, S. M., et al., Nature Rev. Genet. 2, 110-1119 (2001); Tuschl, T. Chem. Biochem. 2, 239-245 (2001); Hamilton, A. et al., Science 286, 950-952 (1999); Hammond, S. M., et al., Nature 404, 293-296 (2000); Zamore, P. D., et al., Cell 101, 25-33 (2000); Bernstein, E., et al., Nature 409, 363-366 (2001); Elbashir, S. M., et al., Genes Dev. 15, 188-200 (2001); WO0129058; WO9932619; Elbashir S M, et al., 2001 Nature 411:494-498; Novina C D and Sharp P. 2004 Nature 430:161-164; Soutschek J et al 2004 Nature 432:173-178; Hsieh A C et al. (2004) NAR 32(3):893-901).
[0063]Nucleic acid modulators are commonly used as research reagents, diagnostics, and therapeutics. For example, antisense oligonucleotides, which are able to inhibit gene expression with exquisite specificity, are often used to elucidate the function of particular genes (see, for example, U.S. Pat. No. 6,165,790). Nucleic acid modulators are also used, for example, to distinguish between functions of various members of a biological pathway. For example, antisense oligomers have been employed as therapeutic moieties in the treatment of disease states in animals and man and have been demonstrated in numerous clinical trials to be safe and effective (Milligan J F, et al, Current Concepts in Antisense Drug Design, J Med Chem. (1993) 36:1923-1937; Tonkinson J L et al., Antisense Oligodeoxynucleotides as Clinical Therapeutic Agents, Cancer Invest. (1996) 14:54-65). Accordingly, in one aspect of the invention, a GALNT-specific nucleic acid modulator is used in an assay to further elucidate the role of the GALNT in the IGFR pathway, and/or its relationship to other members of the pathway. In another aspect of the invention, a GALNT-specific antisense oligomer is used as a therapeutic agent for treatment of IGFR-related disease states.
[0064]Assay Systems
[0065]The invention provides assay systems and screening methods for identifying specific modulators of GALNT activity. As used herein, an "assay system" encompasses all the components required for performing and analyzing results of an assay that detects and/or measures a particular event. In general, primary assays are used to identify or confirm a modulator's specific biochemical or molecular effect with respect to the GALNT nucleic acid or protein. In general, secondary assays further assess the activity of a GALNT modulating agent identified by a primary assay and may confirm that the modulating agent affects GALNT in a manner relevant to the IGFR pathway. In some cases, GALNT modulators will be directly tested in a secondary assay.
[0066]In a preferred embodiment, the screening method comprises contacting a suitable assay system comprising a GALNT polypeptide or nucleic acid with a candidate agent under conditions whereby, but for the presence of the agent, the system provides a reference activity (e.g. transferase activity), which is based on the particular molecular event the screening method detects. A statistically significant difference between the agent-biased activity and the reference activity indicates that the candidate agent modulates GALNT activity, and hence the IGFR pathway. The GALNT polypeptide or nucleic acid used in the assay may comprise any of the nucleic acids or polypeptides described above.
[0067]Primary Assays
[0068]The type of modulator tested generally determines the type of primary assay.
[0069]Primary Assays for Small Molecule Modulators
[0070]For small molecule modulators, screening assays are used to identify candidate modulators. Screening assays may be cell-based or may use a cell-free system that recreates or retains the relevant biochemical reaction of the target protein (reviewed in Sittampalam G S et al., Curr Opin Chem Biol (1997) 1:384-91 and accompanying references). As used herein the term "cell-based" refers to assays using live cells, dead cells, or a particular cellular fraction, such as a membrane, endoplasmic reticulum, or mitochondrial fraction. The term "cell free" encompasses assays using substantially purified protein (either endogenous or recombinantly produced), partially purified or crude cellular extracts. Screening assays may detect a variety of molecular events, including protein-DNA interactions, protein-protein interactions (e.g., receptor-ligand binding), transcriptional activity (e.g., using a reporter gene), enzymatic activity (e.g., via a property of the substrate), activity of second messengers, immunogenicity and changes in cellular morphology or other cellular characteristics. Appropriate screening assays may use a wide range of detection methods including fluorescent, radioactive, calorimetric, spectrophotometric, and amperometric methods, to provide a read-out for the particular molecular event detected.
[0071]Cell-based screening assays usually require systems for recombinant expression of GALNT and any auxiliary proteins demanded by the particular assay. Appropriate methods for generating recombinant proteins produce sufficient quantities of proteins that retain their relevant biological activities and are of sufficient purity to optimize activity and assure assay reproducibility. Yeast two-hybrid and variant screens, and mass spectrometry provide preferred methods for determining protein-protein interactions and elucidation of protein complexes. In certain applications, when GALNT-interacting proteins are used in screens to identify small molecule modulators, the binding specificity of the interacting protein to the GALNT protein may be assayed by various known methods such as substrate processing (e.g. ability of the candidate GALNT-specific binding agents to function as negative effectors in GALNT-expressing cells), binding equilibrium constants (usually at least about 107 M-1, preferably at least about 108M-1, more preferably at least about 109 M-1), and immunogenicity (e.g. ability to elicit GALNT specific antibody in a heterologous host such as a mouse, rat, goat or rabbit). For enzymes and receptors, binding may be assayed by, respectively, substrate and ligand processing.
[0072]The screening assay may measure a candidate agent's ability to specifically bind to or modulate activity of a GALNT polypeptide, a fusion protein thereof, or to cells or membranes bearing the polypeptide or fusion protein. The GALNT polypeptide can be full length or a fragment thereof that retains functional GALNT activity. The GALNT polypeptide may be fused to another polypeptide, such as a peptide tag for detection or anchoring, or to another tag. The GALNT polypeptide is preferably human GALNT, or is an ortholog or derivative thereof as described above. In a preferred embodiment, the screening assay detects candidate agent-based modulation of GALNT interaction with a binding target, such as an endogenous or exogenous protein or other substrate that has GALNT-specific binding activity, and can be used to assess normal GALNT gene function.
[0073]Suitable assay formats that may be adapted to screen for GALNT modulators are known in the art. Preferred screening assays are high throughput or ultra high throughput and thus provide automated, cost-effective means of screening compound libraries for lead compounds (Fernandes P B, Curr Opin Chem Biol (1998) 2:597-603; Sundberg S A, Curr Opin Biotechnol 2000, 11:47-53). In one preferred embodiment, screening assays uses fluorescence technologies, including fluorescence polarization, time-resolved fluorescence, and fluorescence resonance energy transfer. These systems offer means to monitor protein-protein or DNA-protein interactions in which the intensity of the signal emitted from dye-labeled molecules depends upon their interactions with partner molecules (e.g., Selvin P R, Nat Struct Biol (2000) 7:730-4; Fernandes P B, supra; Hertzberg R P and Pope A J, Curr Opin Chem Biol (2000) 4:445-451).
[0074]A variety of suitable assay systems may be used to identify candidate GALNT and IGFR pathway modulators (e.g. U.S. Pat. Nos. 5,550,019 and 6,133,437 (apoptosis assays); and U.S. Pat. Nos. 5,976,782, 6,225,118 and 6,444,434 (angiogenesis assays), among others). Specific preferred assays are described in more detail below.
[0075]Glycosyltransferases mediate changes in glycosylation patterns that, in turn, may affect the function of glycoproteins and/or glycolipids and, further downstream, processes of development, differentiation, transformation and cell-cell recognition. An assay for glycosyltransferase uses scintillation methods to measure the transfer of carbohydrate from radiolabeled sugar-nucleotide donor to a synthetic glycopolymer acceptor that is coupled to polyacrylamide and coated on plastic microtiter plates (Donovan R S et al., Glycoconj J (1999) 16:607-615).
[0076]Apoptosis assays. Apoptosis or programmed cell death is a suicide program is activated within the cell, leading to fragmentation of DNA, shrinkage of the cytoplasm, membrane changes and cell death. Apoptosis is mediated by proteolytic enzymes of the caspase family. Many of the altering parameters of a cell are measurable during apoptosis. Assays for apoptosis may be performed by terminal deoxynucleotidyl transferase-mediated digoxigenin-11-dUTP nick end labeling (TUNEL) assay. The TUNEL assay is used to measure nuclear DNA fragmentation characteristic of apoptosis (Lazebnik et al., 1994, Nature 371, 346), by following the incorporation of fluorescein-dUTP (Yonehara et al., 1989, J. Exp. Med. 169, 1747). Apoptosis may further be assayed by acridine orange staining of tissue culture cells (Lucas, R., et al., 1998, Blood 15:4730-41). Other cell-based apoptosis assays include the caspase-3/7 assay and the cell death nucleosome ELISA assay. The caspase 3/7 assay is based on the activation of the caspase cleavage activity as part of a cascade of events that occur during programmed cell death in many apoptotic pathways. In the caspase 3/7 assay (commercially available Apo-ONE® Homogeneous Caspase-3/7 assay from Promega, cat# 67790), lysis buffer and caspase substrate are mixed and added to cells. The caspase substrate becomes fluorescent when cleaved by active caspase 3/7. The nucleosome ELISA assay is a general cell death assay known to those skilled in the art, and available commercially (Roche, Cat# 1774425). This assay is a quantitative sandwich-enzyme-immunoassay which uses monoclonal antibodies directed against DNA and histones respectively, thus specifically determining amount of mono- and oligonucleosomes in the cytoplasmic fraction of cell lysates. Mono and oligonucleosomes are enriched in the cytoplasm during apoptosis due to the fact that DNA fragmentation occurs several hours before the plasma membrane breaks down, allowing for accumulation in the cytoplasm. Nucleosomes are not present in the cytoplasmic fraction of cells that are not undergoing apoptosis. The Phospho-histone H2B assay is another apoptosis assay, based on phosphorylation of histone H2B as a result of apoptosis. Fluorescent dyes that are associated with phosphohistone H2B may be used to measure the increase of phosphohistone H2B as a result of apoptosis. Apoptosis assays that simultaneously measure multiple parameters associated with apoptosis have also been developed. In such assays, various cellular parameters that can be associated with antibodies or fluorescent dyes, and that mark various stages of apoptosis are labeled, and the results are measured using instruments such as Cellomics® ArrayScan® HCS System. The measurable parameters and their markers include anti-active caspase-3 antibody which marks intermediate stage apoptosis, anti-PARP-p85 antibody (cleaved PARP) which marks late stage apoptosis, Hoechst labels which label the nucleus and are used to measure nuclear swelling as a measure of early apoptosis and nuclear condensation as a measure of late apoptosis, TOTO-3 fluorescent dye which labels DNA of dead cells with high cell membrane permeability, and anti-alpha-tubulin or F-actin labels, which assess cytoskeletal changes in cells and correlate well with TOTO-3 label.
[0077]An apoptosis assay system may comprise a cell that expresses a GALNT, and that optionally has defective IGFR function (e.g. IGFR is over-expressed or under-expressed relative to wild-type cells). A test agent can be added to the apoptosis assay system and changes in induction of apoptosis relative to controls where no test agent is added, identify candidate IGFR modulating agents. In some embodiments of the invention, an apoptosis assay may be used as a secondary assay to test a candidate IGFR modulating agents that is initially identified using a cell-free assay system. An apoptosis assay may also be used to test whether GALNT function plays a direct role in apoptosis. For example, an apoptosis assay may be performed on cells that over- or under-express GALNT relative to wild type cells. Differences in apoptotic response compared to wild type cells suggests that the GALNT plays a direct role in the apoptotic response. Apoptosis assays are described further in U.S. Pat. No. 6,133,437.
[0078]Cell proliferation and cell cycle assays. Cell proliferation may be assayed via bromodeoxyuridine (BRDU) incorporation. This assay identifies a cell population undergoing DNA synthesis by incorporation of BRDU into newly-synthesized DNA. Newly-synthesized DNA may then be detected using an anti-BRDU antibody (Hoshino et al., 1986, Int. J. Cancer 38, 369; Campana et al., 1988, J. Immunol. Meth. 107, 79), or by other means.
[0079]Cell proliferation is also assayed via phospho-histone H3 staining, which identifies a cell population undergoing mitosis by phosphorylation of histone H3. Phosphorylation of histone H3 at serine 10 is detected using an antibody specific to the phosphorylated form of the serine 10 residue of histone H3. (Chadlee, D. N. 1995, J. Biol. Chem 270:20098-105). Cell Proliferation may also be examined using [3H]-thymidine incorporation (Chen, J., 1996, Oncogene 13:1395-403; Jeoung, J., 1995, J. Biol. Chem. 270:18367-73). This assay allows for quantitative characterization of S-phase DNA syntheses. In this assay, cells synthesizing DNA will incorporate [3H]-thymidine into newly synthesized DNA. Incorporation can then be measured by standard techniques such as by counting of radioisotope in a scintillation counter (e.g., Beckman LS 3800 Liquid Scintillation Counter). Another proliferation assay uses the dye Alamar Blue (available from Biosource International), which fluoresces when reduced in living cells and provides an indirect measurement of cell number (Voytik-Harbin S L et al., 1998, In Vitro Cell Dev Biol Anim 34:239-46). Yet another proliferation assay, the MTS assay, is based on in vitro cytotoxicity assessment of industrial chemicals, and uses the soluble tetrazolium salt, MTS. MTS assays are commercially available, for example, the Promega CellTiter 96® AQueous Non-Radioactive Cell Proliferation Assay (Cat.# G5421).
[0080]Cell proliferation may also be assayed by colony formation in soft agar, or clonogenic survival assay (Sambrook et al., Molecular Cloning, Cold Spring Harbor (1989)). For example, cells transformed with GALNT are seeded in soft agar plates, and colonies are measured and counted after two weeks incubation.
[0081]Cell proliferation may also be assayed by measuring ATP levels as indicator of metabolically active cells. Such assays are commercially available, for example Cell Titer-Glo®, which is a luminescent homogeneous assay available from Promega.
[0082]Involvement of a gene in the cell cycle may be assayed by flow cytometry (Gray J W et al. (1986) Int J Radiat Biol Relat Stud Phys Chem Med 49:237-55). Cells transfected with a GALNT may be stained with propidium iodide and evaluated in a flow cytometer (available from Becton Dickinson), which indicates accumulation of cells in different stages of the cell cycle.
[0083]Involvement of a gene in cell cycle may also be assayed by FOXO nuclear translocation assays. The FOXO family of transcription factors are mediators of various cellular functions including cell cycle progression and cell death, and are negatively regulated by activation of the P13 kinase pathway. Akt phosphorylation of FOXO family members leads to FOXO sequestration in the cytoplasm and transcriptional inactivation (Medema, R. H et al (2000) Nature 404: 782-787). PTEN is a negative regulator of P13 kinase pathway. Activation of PTEN, or loss of PI3 kinase or AKT, prevents phosphorylation of FOXO, leading to accumulation of FOXO in the nucleus, transcriptional activation of FOXO regulated genes, and apoptosis. Alternatively, loss of PTEN leads to pathway activation and cell survival (Nakamura, N. et al (2000) Mol Cell Biol 20: 8969-8982). FOXO translocation into the cytoplasm is used in assays and screens to identify members and/or modulators of the PTEN pathway. FOXO translocation assays using GFP or luciferase as detection reagents are known in the art (e.g., Zhang X et al (2002) J Biol Chem 277:45276-45284; and Li et al (2003) Mol Cell Biol 23:104-118).
[0084]Accordingly, a cell proliferation or cell cycle assay system may comprise a cell that expresses a GALNT, and that optionally has defective IGFR function (e.g. IGFR is over-expressed or under-expressed relative to wild-type cells). A test agent can be added to the assay system and changes in cell proliferation or cell cycle relative to controls where no test agent is added, identify candidate IGFR modulating agents. In some embodiments of the invention, the cell proliferation or cell cycle assay may be used as a secondary assay to test a candidate IGFR modulating agents that is initially identified using another assay system such as a cell-free assay system. A cell proliferation assay may also be used to test whether GALNT function plays a direct role in cell proliferation or cell cycle. For example, a cell proliferation or cell cycle assay may be performed on cells that over- or under-express GALNT relative to wild type cells. Differences in proliferation or cell cycle compared to wild type cells suggests that the GALNT plays a direct role in cell proliferation or cell cycle.
[0085]Angiogenesis. Angiogenesis may be assayed using various human endothelial cell systems, such as umbilical vein, coronary artery, or dermal cells. Suitable assays include Alamar Blue based assays (available from Biosource International) to measure proliferation; migration assays using fluorescent molecules, such as the use of Becton Dickinson Falcon HTS FluoroBlock cell culture inserts to measure migration of cells through membranes in presence or absence of angiogenesis enhancer or suppressors; and tubule formation assays based on the formation of tubular structures by endothelial cells on Matrigel® (Becton Dickinson). Accordingly, an angiogenesis assay system may comprise a cell that expresses a GALNT, and that optionally has defective IGFR function (e.g. IGFR is over-expressed or under-expressed relative to wild-type cells). A test agent can be added to the angiogenesis assay system and changes in angiogenesis relative to controls where no test agent is added, identify candidate IGFR modulating agents. In some embodiments of the invention, the angiogenesis assay may be used as a secondary assay to test a candidate IGFR modulating agents that is initially identified using another assay system. An angiogenesis assay may also be used to test whether GALNT function plays a direct role in cell proliferation. For example, an angiogenesis assay may be performed on cells that over- or under-express GALNT relative to wild type cells. Differences in angiogenesis compared to wild type cells suggests that the GALNT plays a direct role in angiogenesis. U.S. Pat. Nos. 5,976,782, 6,225,118 and 6,444,434, among others, describe various angiogenesis assays.
[0086]Hypoxic induction. The alpha subunit of the transcription factor, hypoxia inducible factor-1 (HIF-1), is upregulated in tumor cells following exposure to hypoxia in vitro. Under hypoxic conditions, HIF-1 stimulates the expression of genes known to be important in tumour cell survival, such as those encoding glyolytic enzymes and VEGF. Induction of such genes by hypoxic conditions may be assayed by growing cells transfected with GALNT in hypoxic conditions (such as with 0.1% O2, 5% CO2, and balance N2, generated in a Napco 7001 incubator (Precision Scientific)) and normoxic conditions, followed by assessment of gene activity or expression by Taqman®. For example, a hypoxic induction assay system may comprise a cell that expresses a GALNT, and that optionally has defective IGFR function (e.g. IGFR is over-expressed or under-expressed relative to wild-type cells). A test agent can be added to the hypoxic induction assay system and changes in hypoxic response relative to controls where no test agent is added, identify candidate IGFR modulating agents. In some embodiments of the invention, the hypoxic induction assay may be used as a secondary assay to test a candidate IGFR modulating agents that is initially identified using another assay system. A hypoxic induction assay may also be used to test whether GALNT function plays a direct role in the hypoxic response. For example, a hypoxic induction assay may be performed on cells that over- or under-express GALNT relative to wild type cells. Differences in hypoxic response compared to wild type cells suggests that the GALNT plays a direct role in hypoxic induction.
[0087]Cell adhesion. Cell adhesion assays measure adhesion of cells to purified adhesion proteins, or adhesion of cells to each other, in presence or absence of candidate modulating agents. Cell-protein adhesion assays measure the ability of agents to modulate the adhesion of cells to purified proteins. For example, recombinant proteins are produced, diluted to 2.5 g/mL in PBS, and used to coat the wells of a microtiter plate. The wells used for negative control are not coated. Coated wells are then washed, blocked with 1% BSA, and washed again. Compounds are diluted to 2× final test concentration and added to the blocked, coated wells. Cells are then added to the wells, and the unbound cells are washed off. Retained cells are labeled directly on the plate by adding a membrane-permeable fluorescent dye, such as calcein-AM, and the signal is quantified in a fluorescent microplate reader.
[0088]Cell-cell adhesion assays measure the ability of agents to modulate binding of cell adhesion proteins with their native ligands. These assays use cells that naturally or recombinantly express the adhesion protein of choice. In an exemplary assay, cells expressing the cell adhesion protein are plated in wells of a multiwell plate. Cells expressing the ligand are labeled with a membrane-permeable fluorescent dye, such as BCECF, and allowed to adhere to the monolayers in the presence of candidate agents. Unbound cells are washed off, and bound cells are detected using a fluorescence plate reader.
[0089]High-throughput cell adhesion assays have also been described. In one such assay, small molecule ligands and peptides are bound to the surface of microscope slides using a microarray spotter, intact cells are then contacted with the slides, and unbound cells are washed off. In this assay, not only the binding specificity of the peptides and modulators against cell lines are determined, but also the functional cell signaling of attached cells using immunofluorescence techniques in situ on the microchip is measured (Falsey J R et al., Bioconjug Chem. 2001 May-June; 12(3):346-53).
Primary Assays for Antibody Modulators
[0090]For antibody modulators, appropriate primary assays test is a binding assay that tests the antibody's affinity to and specificity for the GALNT protein. Methods for testing antibody affinity and specificity are well known in the art (Harlow and Lane, 1988, 1999, supra). The enzyme-linked immunosorbant assay (ELISA) is a preferred method for detecting GALNT-specific antibodies; others include FACS assays, radioimmunoassays, and fluorescent assays.
[0091]In some cases, screening assays described for small molecule modulators may also be used to test antibody modulators.
Primary Assays for Nucleic Acid Modulators
[0092]For nucleic acid modulators, primary assays may test the ability of the nucleic acid modulator to inhibit or enhance GALNT gene expression, preferably mRNA expression. In general, expression analysis comprises comparing GALNT expression in like populations of cells (e.g., two pools of cells that endogenously or recombinantly express GALNT) in the presence and absence of the nucleic acid modulator. Methods for analyzing mRNA and protein expression are well known in the art. For instance, Northern blotting, slot blotting, ribonuclease protection, quantitative RT-PCR (e.g., using the TaqMan®, PE Applied Biosystems), or microarray analysis may be used to confirm that GALNT mRNA expression is reduced in cells treated with the nucleic acid modulator (e.g., Current Protocols in Molecular Biology (1994) Ausubel F M et al., eds., John Wiley & Sons, Inc., chapter 4; Freeman W M et al., Biotechniques (1999) 26:112-125; Kallioniemi O P, Ann Med 2001, 33:142-147; Blohm D H and Guiseppi-Elie, A Curr Opin Biotechnol 2001, 12:41-47). Protein expression may also be monitored. Proteins are most commonly detected with specific antibodies or antisera directed against either the GALNT protein or specific peptides. A variety of means including Western blotting, ELISA, or in situ detection, are available (Harlow E and Lane D, 1988 and 1999, supra).
[0093]In some cases, screening assays described for small molecule modulators, particularly in assay systems that involve GALNT mRNA expression, may also be used to test nucleic acid modulators.
Secondary Assays
[0094]Secondary assays may be used to further assess the activity of GALNT-modulating agent identified by any of the above methods to confirm that the modulating agent affects GALNT in a manner relevant to the IGFR pathway. As used herein, GALNT-modulating agents encompass candidate clinical compounds or other agents derived from previously identified modulating agent. Secondary assays can also be used to test the activity of a modulating agent on a particular genetic or biochemical pathway or to test the specificity of the modulating agent's interaction with GALNT.
[0095]Secondary assays generally compare like populations of cells or animals (e.g., two pools of cells or animals that endogenously or recombinantly express GALNT) in the presence and absence of the candidate modulator. In general, such assays test whether treatment of cells or animals with a candidate GALNT-modulating agent results in changes in the IGFR pathway in comparison to untreated (or mock- or placebo-treated) cells or animals. Certain assays use "sensitized genetic backgrounds", which, as used herein, describe cells or animals engineered for altered expression of genes in the IGFR or interacting pathways.
Cell-Based Assays
[0096]Cell based assays may detect endogenous IGFR pathway activity or may rely on recombinant expression of IGFR pathway components. Any of the aforementioned assays may be used in this cell-based format. Candidate modulators are typically added to the cell media but may also be injected into cells or delivered by any other efficacious means.
Animal Assays
[0097]A variety of non-human animal models of normal or defective IGFR pathway may be used to test candidate GALNT modulators. Models for defective IGFR pathway typically use genetically modified animals that have been engineered to mis-express (e.g., over-express or lack expression in) genes involved in the IGFR pathway. Assays generally require systemic delivery of the candidate modulators, such as by oral administration, injection, etc.
[0098]In a preferred embodiment, IGFR pathway activity is assessed by monitoring neovascularization and angiogenesis. Animal models with defective and normal IGFR are used to test the candidate modulator's affect on GALNT in Matrigel® assays. Matrigel® is an extract of basement membrane proteins, and is composed primarily of laminin, collagen IV, and heparin sulfate proteoglycan. It is provided as a sterile liquid at 4° C., but rapidly forms a solid gel at 37° C. Liquid Matrigel® is mixed with various angiogenic agents, such as bFGF and VEGF, or with human tumor cells which over-express the GALNT. The mixture is then injected subcutaneously (SC) into female athymic nude mice (Taconic, Germantown, N.Y.) to support an intense vascular response. Mice with Matrigel® pellets may be dosed via oral (PO), intraperitoneal (IP), or intravenous (IV) routes with the candidate modulator. Mice are euthanized 5-12 days post-injection, and the Matrigel® pellet is harvested for hemoglobin analysis (Sigma plasma hemoglobin kit). Hemoglobin content of the gel is found to correlate the degree of neovascularization in the gel.
[0099]In another preferred embodiment, the effect of the candidate modulator on GALNT is assessed via tumorigenicity assays. Tumor xenograft assays are known in the art (see, e.g., Ogawa K et al., 2000, Oncogene 19:6043-6052). Xenografts are typically implanted SC into female athymic mice, 6-7 week old, as single cell suspensions either from a pre-existing tumor or from in vitro culture. The tumors which express the GALNT endogenously are injected in the flank, 1×105 to 1×107 cells per mouse in a volume of 100 μL using a 27 gauge needle. Mice are then ear tagged and tumors are measured twice weekly. Candidate modulator treatment is initiated on the day the mean tumor weight reaches 100 mg. Candidate modulator is delivered IV, SC, IP, or PO by bolus administration. Depending upon the pharmacokinetics of each unique candidate modulator, dosing can be performed multiple times per day. The tumor weight is assessed by measuring perpendicular diameters with a caliper and calculated by multiplying the measurements of diameters in two dimensions. At the end of the experiment, the excised tumors may be utilized for biomarker identification or further analyses. For immunohistochemistry staining, xenograft tumors are fixed in 4% paraformaldehyde, 0.1M phosphate, pH 7.2, for 6 hours at 4° C., immersed in 30% sucrose in PBS, and rapidly frozen in isopentane cooled with liquid nitrogen.
[0100]In another preferred embodiment, tumorogenicity is monitored using a hollow fiber assay, which is described in U.S. Pat. No. 5,698,413. Briefly, the method comprises implanting into a laboratory animal a biocompatible, semi-permeable encapsulation device containing target cells, treating the laboratory animal with a candidate modulating agent, and evaluating the target cells for reaction to the candidate modulator. Implanted cells are generally human cells from a pre-existing tumor or a tumor cell line. After an appropriate period of time, generally around six days, the implanted samples are harvested for evaluation of the candidate modulator. Tumorogenicity and modulator efficacy may be evaluated by assaying the quantity of viable cells present in the macrocapsule, which can be determined by tests known in the art, for example, MTT dye conversion assay, neutral red dye uptake, trypan blue staining, viable cell counts, the number of colonies formed in soft agar, the capacity of the cells to recover and replicate in vitro, etc.
[0101]In another preferred embodiment, a tumorogenicity assay use a transgenic animal, usually a mouse, carrying a dominant oncogene or tumor suppressor gene knockout under the control of tissue specific regulatory sequences; these assays are generally referred to as transgenic tumor assays. In a preferred application, tumor development in the transgenic model is well characterized or is controlled. In an exemplary model, the "RIP1-Tag2" transgene, comprising the SV40 large T-antigen oncogene under control of the insulin gene regulatory regions is expressed in pancreatic beta cells and results in islet cell carcinomas (Hanahan D, 1985, Nature 315:115-122; Parangi S et al, 1996, Proc Natl Acad Sci USA 93: 2002-2007; Bergers G et al, 1999, Science 284:808-812). An "angiogenic switch," occurs at approximately five weeks, as normally quiescent capillaries in a subset of hyperproliferative islets become angiogenic. The RIP 1-TAG2 mice die by age 14 weeks. Candidate modulators may be administered at a variety of stages, including just prior to the angiogenic switch (e.g., for a model of tumor prevention), during the growth of small tumors (e.g., for a model of intervention), or during the growth of large and/or invasive tumors (e.g., for a model of regression). Tumorogenicity and modulator efficacy can be evaluating life-span extension and/or tumor characteristics, including number of tumors, tumor size, tumor morphology, vessel density, apoptotic index, etc.
Diagnostic and Therapeutic Uses
[0102]Specific GALNT-modulating agents are useful in a variety of diagnostic and therapeutic applications where disease or disease prognosis is related to defects in the IGFR pathway, such as angiogenic, apoptotic, or cell proliferation disorders. Accordingly, the invention also provides methods for modulating the IGFR pathway in a cell, preferably a cell pre-determined to have defective or impaired IGFR function (e.g. due to overexpression, underexpression, or misexpression of IGFR, or due to gene mutations), comprising the step of administering an agent to the cell that specifically modulates GALNT activity. Preferably, the modulating agent produces a detectable phenotypic change in the cell indicating that the IGFR function is restored. The phrase "function is restored", and equivalents, as used herein, means that the desired phenotype is achieved, or is brought closer to normal compared to untreated cells. For example, with restored IGFR function, cell proliferation and/or progression through cell cycle may normalize, or be brought closer to normal relative to untreated cells. The invention also provides methods for treating disorders or disease associated with impaired IGFR function by administering a therapeutically effective amount of a GALNT-modulating agent that modulates the IGFR pathway. The invention further provides methods for modulating GALNT function in a cell, preferably a cell pre-determined to have defective or impaired GALNT function, by administering a GALNT-modulating agent. Additionally, the invention provides a method for treating disorders or disease associated with impaired GALNT function by administering a therapeutically effective amount of a GALNT-modulating agent.
[0103]The discovery that GALNT is implicated in IGFR pathway provides for a variety of methods that can be employed for the diagnostic and prognostic evaluation of diseases and disorders involving defects in the IGFR pathway and for the identification of subjects having a predisposition to such diseases and disorders.
[0104]Various expression analysis methods can be used to diagnose whether GALNT expression occurs in a particular sample, including Northern blotting, slot blotting, ribonuclease protection, quantitative RT-PCR, and microarray analysis. (e.g., Current Protocols in Molecular Biology (1994) Ausubel F M et al., eds., John Wiley & Sons, Inc., chapter 4; Freeman W M et al., Biotechniques (1999) 26:112-125; Kallioniemi O P, Ann Med 2001, 33:142-147; Blohm and Guiseppi-Elie, Curr Opin Biotechnol 2001, 12:41-47). Tissues having a disease or disorder implicating defective IGFR signaling that express a GALNT, are identified as amenable to treatment with a GALNT modulating agent. In a preferred application, the IGFR defective tissue overexpresses a GALNT relative to normal tissue. For example, a Northern blot analysis of mRNA from tumor and normal cell lines, or from tumor and matching normal tissue samples from the same patient, using full or partial GALNT cDNA sequences as probes, can determine whether particular tumors express or overexpress GALNT. Alternatively, the TaqMan® is used for quantitative RT-PCR analysis of GALNT expression in cell lines, normal tissues and tumor samples (PE Applied Biosystems).
[0105]Various other diagnostic methods may be performed, for example, utilizing reagents such as the GALNT oligonucleotides, and antibodies directed against a GALNT, as described above for: (1) the detection of the presence of GALNT gene mutations, or the detection of either over- or under-expression of GALNT mRNA relative to the non-disorder state; (2)--the detection of either an over- or an under-abundance of GALNT gene product relative to the non-disorder state; and (3) the detection of perturbations or abnormalities in the signal transduction pathway mediated by GALNT.
[0106]Kits for detecting expression of GALNT in various samples, comprising at least one antibody specific to GALNT, all reagents and/or devices suitable for the detection of antibodies, the immobilization of antibodies, and the like, and instructions for using such kits in diagnosis or therapy are also provided.
[0107]Thus, in a specific embodiment, the invention is drawn to a method for diagnosing a disease or disorder in a patient that is associated with alterations in GALNT expression, the method comprising: a) obtaining a biological sample from the patient; b) contacting the sample with a probe for GALNT expression; c) comparing results from step (b) with a control; and d) determining whether step (c) indicates a likelihood of the disease or disorder. Preferably, the disease is cancer, most preferably a cancer as shown in TABLE 1. The probe may be either DNA or protein, including an antibody.
EXAMPLES
[0108]The following experimental section and examples are offered by way of illustration and not by way of limitation.
I. Drosophila IGFR Screen
[0109]A dominant loss of function screen was carried out in Drosophila to identify genes that interact with or modulate the IGFR signaling pathway. Activation of the pathway by overexpression of IGFR at early stages in the developing Drosophila eye leads to an increase in cell number which results in a larger and rougher adult eye (Potter C J et al. (2001) Cell 105:357-368; Huang et al., 1999. Dev. 126:5365-5372). We generated a fly stock with an enlarged eye due to overexpression of IGFR and identified modifiers of this phenotype. We then identified human orthologues of these modifiers.
[0110]The screening stock carried two transgenes. The genotype is as follows:
[0111]+; +; P{DmIGFR-pExp-UAS)} P{Gal4-pExp-1Xey}/TM6B
[0112]Screening stock females of the above genotype were crossed to males from a collection of 3 classes of piggyBac-based transposons. The resulting progeny, which contain both the transgenes and the transposon, were scored for the effect of the transposon on the eye overgrowth phenotype (either enhancement, suppression or no effect). All data was recorded and all modifiers were retested with a repeat of the original cross. CG9152 was an enhancer of the eye phenotype. Orthologs of the modifiers are referred to herein as GALNT.
[0113]BLAST analysis (Altschul et al., supra) was employed to identify orthologs of Drosophila modifiers. [For example, representative sequences from GALNT, GI# 13124891 (SEQ ID NO:9), and GI#42741667 (SEQ ID NO:10) share 65% and 67% amino acid identity, respectively, with the Drosophila CG9152.
[0114]Various domains, signals, and functional subunits in proteins were analyzed using the PSORT (Nakai K., and Horton P., Trends Biochem Sci, 1999, 24:34-6; Kenta Nakai, Protein sorting signals and prediction of subcellular localization, Adv. Protein Chem. 54, 277-344 (2000)), PFAM (Bateman A., et al., Nucleic Acids Res, 1999, 27:260-2), SMART (Ponting C P, et al., SMART: identification and annotation of domains from signaling and extracellular protein sequences. Nucleic Acids Res. 1999 Jan. 1; 27(1):229-32), TM-HMM (Erik L. L. Sonnhammer, Gunnar von Heijne, and Anders Krogh: A hidden Markov model for predicting transmembrane helices in protein sequences. In Proc. of Sixth Int. Conf. on Intelligent Systems for Molecular Biology, p 175-182 Ed J. Glasgow, T. Littlejohn, F. Major, R. Lathrop, D. Sankoff, and C. Sensen Menlo Park, Calif.: AAAI Press, 1998), and clust (Remm M, and Sonnhammer E. Classification of transmembrane protein families in the Caenorhabditis elegans genome and identification of human orthologs. Genome Res. 2000 November; 10(11):1679-89) programs. For example, the Ricin-type beta-trefoil lectin domain (PFAM 00652) of GALNT from GI#s13124891 and 4241667 (SEQ ID NOs:9 and 10, respectively) is located respectively at approximately amino acid residues 430 to 548 and 429 to 547. Likewise, the Glycosyl transferase domain of the same sequences is located respectively at approximately amino acid residues 119 to 303 and 118 to 302.
II. High-Throughput In Vitro Fluorescence Polarization Assay
[0115]Fluorescently-labeled GALNT peptide/substrate are added to each well of a 96-well microtiter plate, along with a test agent in a test buffer (10 mM HEPES, 10 mM NaCl, 6 mM magnesium chloride, pH 7.6). Changes in fluorescence polarization, determined by using a Fluorolite FPM-2 Fluorescence Polarization Microtiter System (Dynatech Laboratories, Inc), relative to control values indicates the test compound is a candidate modifier of GALNT activity.
III. High-Throughput In Vitro Binding Assay.
33P-labeled GALNT peptide is added in an assay buffer (100 mM KCl, 20 mM HEPES pH 7.6, 1 mM MgCl2, 1% glycerol, 0.5% NP-40, 50 mM beta-mercaptoethanol, 1 mg/ml BSA, cocktail of protease inhibitors) along with a test agent to the wells of a Neutralite-avidin coated assay plate and incubated at 25° C. for 1 hour. Biotinylated substrate is then added to each well and incubated for 1 hour. Reactions are stopped by washing with PBS, and counted in a scintillation counter. Test agents that cause a difference in activity relative to control without test agent are identified as candidate IGFR modulating agents.
IV. Immunoprecipitations and Immunoblotting
[0117]For coprecipitation of transfected proteins, 3×106 appropriate recombinant cells containing the GALNT proteins are plated on 10-cm dishes and transfected on the following day with expression constructs. The total amount of DNA is kept constant in each transfection by adding empty vector. After 24 h, cells are collected, washed once with phosphate-buffered saline and lysed for 20 min on ice in 1 ml of lysis buffer containing 50 mM Hepes, pH 7.9, 250 mM NaCl, 20 mM-glycerophosphate, 1 mM sodium orthovanadate, 5 mM p-nitrophenyl phosphate, 2 mM dithiothreitol, protease inhibitors (complete, Roche Molecular Biochemicals), and 1% Nonidet P-40. Cellular debris is removed by centrifugation twice at 15,000×g for 15 min. The cell lysate is incubated with 25 μl of M2 beads (Sigma) for 2 h at 4° C. with gentle rocking.
[0118]After extensive washing with lysis buffer, proteins bound to the beads are solubilized by boiling in SDS sample buffer, fractionated by SDS-polyacrylamide gel electrophoresis, transferred to polyvinylidene difluoride membrane and blotted with the indicated antibodies. The reactive bands are visualized with horseradish peroxidase coupled to the appropriate secondary antibodies and the enhanced chemiluminescence (ECL) Western blotting detection system (Amersham Pharmacia Biotech).
V. Expression Analysis
[0119]All cell lines used in the following experiments are NCI (National Cancer Institute) lines, and are available from ATCC (American Type Culture Collection, Manassas, Va. 20110-2209). Normal and tumor tissues were obtained from Impath, U C Davis, Clontech, Stratagene, Ardais, Genome Collaborative, and Ambion.
[0120]TaqMan® analysis was used to assess expression levels of the disclosed genes in various samples.
[0121]RNA was extracted from each tissue sample using Qiagen (Valencia, Calif.) RNeasy kits, following manufacturer's protocols, to a final concentration of 50 ng/μl. Single stranded cDNA was then synthesized by reverse transcribing the RNA samples using random hexamers and 500 ng of total RNA per reaction, following protocol 4304965 of Applied Biosystems (Foster City, Calif.).
[0122]Primers for expression analysis using TaqMan® assay (Applied Biosystems, Foster City, Calif.) were prepared according to the TaqMan® protocols, and the following criteria: a) primer pairs were designed to span introns to eliminate genomic contamination, and b) each primer pair produced only one product. Expression analysis was performed using a 7900HT instrument.
[0123]TaqMan® reactions were carried out following manufacturer's protocols, in 25 μl total volume for 96-well plates and 10 μl total volume for 384-well plates, using 300 nM primer and 250 nM probe, and approximately 25 ng of cDNA. The standard curve for result analysis was prepared using a universal pool of human cDNA samples, which is a mixture of cDNAs from a wide variety of tissues so that the chance that a target will be present in appreciable amounts is good. The raw data were normalized using 18S rRNA (universally expressed in all tissues and cells).
[0124]For each expression analysis, tumor tissue samples were compared with matched normal tissues from the same patient. A gene was considered overexpressed in a tumor when the level of expression of the gene was 2 fold or higher in the tumor compared with its matched normal sample. In cases where normal tissue was not available, a universal pool of cDNA samples was used instead. In these cases, a gene was considered overexpressed in a tumor sample when the difference of expression levels between a tumor sample and the average of all normal samples from the same tissue type was greater than 2 times the standard deviation of all normal samples (i.e., Tumor-average(all normal samples)>2×STDEV(all normal samples)).
[0125]Results are shown in Table 1. Number of pairs of tumor samples and matched normal tissue from the same patient are shown for each tumor type. Percentage of the samples with at least two-fold overexpression for each tumor type is provided. A modulator identified by an assay described herein can be further validated for therapeutic effect by administration to a tumor in which the gene is overexpressed. A decrease in tumor growth confirms therapeutic utility of the modulator. Prior to treating a patient with the modulator, the likelihood that the patient will respond to treatment can be diagnosed by obtaining a tumor sample from the patient, and assaying for expression of the gene targeted by the modulator. The expression data for the gene(s) can also be used as a diagnostic marker for disease progression. The assay can be performed by expression analysis as described above, by antibody directed to the gene target, or by any other available detection method.
TABLE-US-00001 TABLE 1 SEQ ID NO 1 5 Breast 31% 25% # of Pairs 36 36 Colon 8% 22% # of Pairs 40 37 Head And 23% 50% Neck # of Pairs 13 12 Liver 22% 0% # of Pairs 9 1 Lung 30% 15% # of Pairs 40 40 Lymphoma 0% 25% # of Pairs 4 4 Ovary 32% 21% # of Pairs 19 19 Pancreas 58% 75% # of Pairs 12 12 Prostate 21% 12% # of Pairs 24 24 Skin 0% 0% # of Pairs 7 6 Stomach 27% 18% # of Pairs 11 11 Testis 0% 12% # of Pairs 8 8 Thyroid 7% 64% Gland # of Pairs 14 14 Uteris 30% 18% # of Pairs 23 22
VI. GALNT Functional Assays
[0126]RNAi experiments were carried out to knock down expression of GALNT (SEQ ID NO:1) in various cell lines using small interfering RNAs (siRNA, Elbashir et al, supra).
[0127]Effect of GALNT RNAi on cell proliferation and growth. BrdU and Cell Titer-Glo® assays, as described above, were employed to study the effects of decreased GALNT expression on cell proliferation. The results of these experiments indicated that RNAi of GALNT of SEQ ID NO:1 decreased proliferation in 231T breast cancer cells, PC3 prostate cancer cells, and U87MG glioblastoma cells. Further, RNAi of GALNT of SEQ ID NO:1 also decreased cell count in A549 lung cancer cells, as compared with normal controls.
[0128]Effect of GALNT RNAI on apoptosis. The Phospho-histone H2B assay, as described above, was employed to study the effects of decreased GALNT expression on apoptosis. The results of this experiment indicated that RNAi of GALNT of SEQ ID NO:1 increased apoptosis in 231T breast cancer cells, PC3 prostate cancer cells, and U87MG glioblastoma cells.
[0129]Involvement in PTEN/IGFR pathway: GALNT FOXO nuclear translocation assays. FOXO nuclear translocation assays, as described above, were employed to assess involvement of GALNT in the PTEN/IGF pathway. In these experiments, cells with reduced expression of GALNT by RNAi were transiently transfected with a plasmid expressing GFP-tagged FOXO. Automated imaging of cellular components, such as nucleus and cytoplasm were then carried out to assess translocation of FOXO. Alternatively, cells were co-transfected with siRNA directed to GALNT along with a plasmid containing FOXO, and a cassette containing a promoter, a FOXO response element, and luciferase. Cells were then analyzed for luciferase activity and compared with cells with no siRNA. Results indicated that reduced expression of GALNT of SEQ ID NO:1 led translocation of FOXO to the cytoplasm, similar to loss of PTEN in A2780 ovarian cancer cells and PC3 prostate cancer cells. These results suggest involvement of GALNT in the IGFR pathway.
[0130]Pan-AKT assays. This assay was developed to detect involvement of GALNT in the PTEN/IGF pathway. The assay detects changes in phosphorylation for several substrates of AKT, such as PRAS40, BAD, 4EBP1, and RPS6. For this experiment, antibodies were raised against phosphorylated AKT substrates, including the consensus phosphorylated AKT substrate sequence RxRxxS/T. Expression levels of phosphorylated substrates were then quantitated at normal levels, in presence of a negative control, a positive control (AKT), and then with GALNT knockout.
[0131]We used 4EBP1 as the substrate for one subset of the experiments. For this substrate, AKT pathway inhibition causes decreased cytoplasmic staining and increased nuclear staining. Cells were plated in 96 well plates, transfected with RNAi for GALNT of SEQ ID NO:1, fixed, treated with 4EBP1 antibody, and stained. Measurements were based on percentage of population of cells with increased or decreased nuclear/cytoplasmic staining ratio compared with negative or positive control cells. Results of this experiment showed that reduced expression of GALNT resulted in altered levels of phospho 4EBP1 protein in 231T and PC3 cells, thus suggesting an involvement in the IGFR pathway.
[0132]We used BAD as the substrate for another subset of the experiments. For this substrate, AKT pathway inhibition causes decreased cytoplasmic staining and unchanged or increased nuclear staining. Cells were plated in 96 well plates, transfected with RNAi for NEWGENE, fixed, permeabilized and stained with anti-phospho-BAD antibody. Measurements were based on the percentage of the population of cells with a decreased Cytoplasmic/Nuclear staining ratio compared with negative or positive control cells. Results of this experiment showed that reduced expression of GALNT of SEQ ID NO:1 caused a reduction in the level of phospho-BAD protein in the cytoplasm in 231T, A549, and PC3 cells, thus suggesting an involvement in the IGFR pathway.
Sequence CWU
1
1013778DNAHomo sapiens 1tttttaaatt ttgcatttga cttaaagtgc catgagaaaa
tttgcatact gcaaggtggt 60cctagccacc tccttgattt gggtactctt ggatatgttc
ctgctgcttt acttcagtga 120atgcaacaaa tgtgatgaaa aaaaggagag aggacttcct
gctggagatg ttctagagcc 180agtacaaaag cctcatgaag gtcctggaga aatggggaaa
ccagtcgtca ttcctaaaga 240ggatcaagaa aagatgaaag agatgtttaa aatcaatcag
ttcaatttaa tggcaagtga 300gatgattgca ctcaacagat ctttaccaga tgttaggtta
gaagggtgta aaacaaaggt 360gtatccagat aatcttccta caacaagtgt ggtgattgtt
ttccacaatg aggcttggag 420cacacttctg cgaactgtcc atagtgtcat taatcgctca
ccaagacaca tgatagaaga 480aattgttcta gtagatgatg ccagtgaaag agactttttg
aaaaggcctt tagagagtta 540tgtgaaaaaa ctaaaagtac cagttcatgt aattcgaatg
gaacaacgtt ctggattgat 600cagagctaga ttaaaaggag ctgctgtgtc taaaggccaa
gtgatcacct tcctggatgc 660ccattgtgag tgtacagtgg gatggctgga gcctctcttg
gccaggatca aacatgacag 720gagaacagtg gtgtgtccca tcatcgatgt gatcagtgat
gatacttttg agtacatggc 780aggctctgat atgacctatg gtgggttcaa ctggaagctc
aattttcgct ggtatcctgt 840tccccaaaga gaaatggaca gaaggaaagg tgatcggact
cttcctgtca ggacacctac 900catggcagga ggcctttttt caatagacag agattacttt
caggaaattg gaacatatga 960tgctggaatg gatatttggg gaggagaaaa cctagaaatt
tcctttagga tttggcagtg 1020tggaggaact ttggaaattg ttacatgctc acatgttgga
catgtgtttc ggaaagctac 1080accttacacg tttccaggag gcacagggca gattatcaat
aaaaataaca gacgacttgc 1140agaagtgtgg atggatgaat tcaagaattt cttctatata
atttctccag gtgttacaaa 1200ggtagattat ggagatatat cgtcaagagt tggtctaaga
cacaaactac aatgcaaacc 1260tttttcctgg tacctagaga atatatatcc tgattctcaa
attccacgtc actatttctc 1320attgggagag atacgaaatg tggaaacgaa tcagtgtcta
gataacatgg ctagaaaaga 1380gaatgaaaaa gttggaattt ttaattgcca tggtatgggg
ggtaatcagg ttttctctta 1440tactgccaac aaagaaatta gaacagatga cctttgcttg
gatgtttcca aacttaatgg 1500cccagttaca atgctcaaat gccaccacct aaaaggcaac
caactctggg agtatgaccc 1560agtgaaatta accctgcagc atgtgaacag taatcagtgc
ctggataaag ccacagaaga 1620ggatagccag gtgcccagca ttagagactg caatggaagt
cggtcccagc agtggcttct 1680tcgaaacgtc accctgccag aaatattctg agaccaaatt
tacaaaaaaa cgaaaaaaat 1740aaggattgac tgggctacct cagcatacat ttctgccaca
ttcttaagta gcaaaaaagg 1800aaaagtgctt tcctcctctg caggatgtaa ggtttatcag
ccattaaaac ttagacttct 1860ctagcttttc actagctgtg aaccagcctt cctgtccatg
gacgtgaaac tgcatagtaa 1920tgagactgtg cacactgatg tttacaagat tgaaagagtc
tttctccgaa aatcatggta 1980aagaatactg agacaatgaa aaaaaatcaa caaaatatgc
tttctggaga actgtacctt 2040ctatggtttg cttgcacatc agtagtttct gctgaacgtg
ctgtcataat gaagagattt 2100ccaagatttt ttttcctgat tagaacgggt agccagtata
ttaaatattg atagaaaaat 2160aaaagaactg gaaccagatt cagaatcttg aaaacaacat
tttttacaac aaacaaaaaa 2220actatattaa acagggttta aaggaaaatt aaaacagaac
tatgaagaag tacaatttgt 2280tatagtatag tatcaaattt ctatatagat tttatacctc
agtggggaaa aataactgat 2340tccaatgaca ttcattttgt tttcatctgt gatagtcatg
gatgctttta ttttccttgg 2400ggtgctgaaa ttgagctgaa aaaaaaaggc tctttgaata
tagttttaat ttctctctac 2460agtttttttt gtttggtttg tgggctgttg gaattgtaat
ttttaattgc cttctaaaaa 2520atggaaattt aacaatgtct gatctcagct gaacaaatta
gatgtttcag ttgctcttgg 2580gtcaactggc ttacagattt acatgtgcac acacacacaa
atttcttatc acattttcga 2640cttcttcact tgacctaact gattatgcga aatacccaag
attcatgcta ctgttccaca 2700tttgttttca cagcaataaa tcttcagttc tgttgtttat
gattccactt aacaaggggc 2760ctgcaaatgt gatttattat ttgggtattt ggagataata
catttgaggg ttttttggaa 2820aacctttttc actccatact caaatatgct tcattgtcaa
atgcatattt aaattaaatt 2880attgaattgt aatgtttatc tgctgctttt tttaaataaa
atttgactga aaatgtttaa 2940ttggcatttt ttaatgactt acccaagaaa agtgcagcta
ttattccata ttaataggct 3000tgcatttctt ttcctaaatc ttatttaggc taaatcagtt
ttattgtcct ctgatttttt 3060ttaataccac agaaatcacc tgagtgtcaa ttgaaaagtt
gtcaattaaa aggtaacctt 3120ttaactctcg taggaggaat ctcattaaga catttttcct
gatatgtaga gcagtctgtt 3180ggcaaaaatg catatatttt ctttcatatt tgtaaaatta
tatttaatgg aattcttttc 3240tttgattatc aaggactttc actgcaggca gtgctatttc
ttgtgcctaa gaatgtttcc 3300aaaagtcgca tcgctaatga tatttgccaa gttgagtgta
cacaaagttt ctcatatcct 3360gttcaagtta atcaacatca aacacatggg gatgctttag
ggtgagtcta taatacaaaa 3420tgcataaacc atgtccccag gaaatttgaa aggaagcaag
tgctgaatgg aatttttttc 3480cttttccatg agctgtgtta attctatctc cagtaggcct
aatgcttgaa ataagcaaga 3540tgtctaatca ataaattatt ttcatgctca gaatttcagg
tttttgtact ccagcatagc 3600ttggtcttat ttcttactgt atgaaagctt aacagcaatg
tgatttaagg ttttgtttta 3660aatgggagat gtaagtgatt taattcatgg gtacttttag
aacctgatag ataatcccat 3720tgcctttatt tttctaatta aagaatccta aatactttga
aaatacaaaa tattcctg 377822185DNAHomo sapiens 2tttttaaatt ttgcatttga
cttaaagtgc catgagaaaa tttgcatact gcaaggtggt 60cctagccacc tccttgattt
gggtactctt ggatatgttc ctgctgcttt acttcagtga 120atgcaacaaa tgtgatgaaa
aaaaggagag aggacttcct gctggagatg ttctagagcc 180agtacaaaag cctcatgaag
gtcctggaga aatggggaaa ccagtcgtca ttcctaaaga 240ggatcaagaa aagatgaaag
agatgtttaa aatcaatcag ttcaatttaa tggcaagtga 300gatgattgca ctcaacagat
ctttaccaga tgttaggtta gaagggtgta aaacaaaggt 360gtatccagat aatcttccta
caacaagtgt ggtgattgtt ttccacaatg aggcttggag 420cacacttctg cgaactgtcc
atagtgtcat taatcgctca ccaagacaca tgatagaaga 480aattgttcta gtagatgatg
ccagtgaaag agactttttg aaaaggcctt tagagagtta 540tgtgaaaaaa ctaaaagtac
cagttcatgt aattcgaatg gaacaacgtt ctggattgat 600cagagctaga ttaaaaggag
ctgctgtgtc taaaggccaa gtgatcacct tcctggatgc 660ccattgtgag tgtacagtgg
gatggctgga gcctctcttg gccaggatca aacatgacag 720gagaacagtg gtgtgtccca
tcatcgatgt gatcagtgat gatacttttg agtacatggc 780aggctctgat atgacctatg
gtgggttcaa ctggaagctc aattttcgct ggtatcctgt 840tccccaaaga gaaatggaca
gaaggaaagg tgatcggact cttcctgtca ggacacctac 900catggcagga ggcctttttt
caatagacag agattacttt caggaaattg gaacatatga 960tgctggaatg gatatttggg
gaggagaaaa cctagaaatt tcctttagga tttggcagtg 1020tggaggaact ttggaaattg
ttacatgctc acatgttgga catgtgtttc ggaaagctac 1080accttacacg tttccaggag
gcacagggca gattatcaat aaaaataaca gacgacttgc 1140agaagtgtgg atggatgaat
tcaagaattt cttctatata atttctccag gtgttacaaa 1200ggtagattat ggagatatat
cgtcaagagt tggtctaaga cacaaactac aatgcaaacc 1260tttttcctgg tacctagaga
atatatatcc tgattctcaa attccacgtc actatttctc 1320attgggagag atacgaaaag
aggaaacgaa tcagtgtcta gataacatgg ctagaaaaga 1380gaatgaaaaa gttggaattt
ttaattgcca tggtatgggg ggtaatcagg ttttctctta 1440tactgccaac aaagaaatta
gaacagatga cctttgcttg gatgtttcca aacttaatgg 1500cccagttaca atgctcaaat
gccaccacct aaagggcaac caactctggg agtatgaccc 1560agtgaaatta accctgcagc
atgtgaacag taatcagtgc ctggataaag ccacagaaga 1620ggatagccag gtgcccagca
ttagagactg caatggaagt cggtcccagc agtggcttct 1680tcgaaacgtc accctgccag
aaatattctg agaccaaatt tacaaaaaaa cgaaaaaaat 1740aaggattgac tgggctacct
cagcatacat ttctgccaca ttcttaagta gcaaaaaagg 1800aaaagtgctt tcctcctctg
caggatgtaa ggtttatcag ccattaaaac ttagacttct 1860ctagcttttc actagctgtg
aaccagcctt cctgtccatg gacgtgaaac tgcatagtaa 1920tgagactgtg cacactgatg
tttacaagat tgaaagagtc tttctccgaa aatcatggta 1980aagaatactg agacaatgaa
aaaaaatcaa caaaatatgc tttctggaga actgtacctt 2040ctatggtttg cttgcacatc
agtagtttct gctgaacgtg ctgtcataat gaagagattt 2100ccaagatttt ttttcctgat
tagaacgggt agccagtata ttaaatattg atagaaaaat 2160aaaagaactg gaaccagatt
cagaa 218532541DNAHomo sapiens
3tgagaccaaa tttacaaaaa aacgaaaaaa ataaggattg actgggctac ctcagcatac
60atttctgcca cattcttaag tagcaaaaaa ggaaaagtgc tttcctcctc tgcaggatgt
120aaggtttatc agccattaaa acttagactt ctctagcttt tcactagctg tgaaccagcc
180ttcctgtcca tggacgtgaa actgcatagt aatgagactg tgcacactga tgtttacaag
240attgaaagag tctttctccg aaaatcatgg taaagaatac tgagacaatg aaaaaaaatc
300aacaaatatg ctttctggag aactgtacct tttatggttt gcttgcacat cagtagtttc
360tgctgaacgt gctgtcataa tgaagagatt tccaggattt tttttcctga ttagaactgg
420tagccagtat attaaatatt gatataaaaa taaaagaact ggaaccagat tccgaatctt
480gaaaacaaca ttttttacaa caaacaaaaa aactatatta aacagggttt aaaggaaaat
540taaaacagaa ctatgaagaa gtacaatttg ttatagtata gtatcaaatt tctatataga
600ttttatacct cagtggggaa aaataactga ttccaatgac attcattttg ttttcatctg
660tgatagtcat ggatgctttt attttccttg gggtgctgaa attgagctga aaaaaaaagg
720ctctttgaat atagttttaa tttctctcta cagttttttt tgtttggttt gtgggctgtt
780ggaattgtaa tttttaattg ccttctaaaa aatggaaatt taacaatgtc tgatctcagc
840tgaacaaatt agatgtttca gttgctcttg ggtcaactgg cttacagatt tacatgtgca
900cacacacaca aatttcttat cacattttcg acttcttcac ttgacctaac tgattatgcg
960aaatacccaa gattcatgct actgttccac atttgttttc acagcaataa atcttcagtt
1020ctgttgttta tgattccact taacaagggg cctgcaaatg tgatttatta tttgggtatt
1080tggagataat acatttgagg gttttttgga aaaccttttt cactccatac tcaaatatgc
1140ttcattgtca aatgcatatt taaattaaat tattgaattg taatgtttat ctgctgcttt
1200ttttaaataa aatttgactg aaaatgttta attggcattt tttaatgact tacccaagaa
1260aagtgcagct attattccat attaataggc ttgcatttct tttcctaaat cttatttagg
1320ctaaatcagt tttattgtcc tctgattttt tttaatacca cagaaatcac ctgagtgtca
1380attgaaaagt tgtcaattaa aaggtaacct tttaactctc gtaggaggaa tctcattaag
1440acatttttcc tgatatgtag agcagtctgt tggcaaaaat gcatatattt tctttcatat
1500ttgtaaaatt atatttaatg gaattctttt ctttgattat caaggacttt cactgcaggc
1560agtgctattt cttgtgccta agaatgtttc caaaagtcgc atcgctaatg atatttgcca
1620agttgagtgt acacaaagtt tctcatatcc tgttcaagtt aatcaacatc aaacacatgg
1680ggatgcttta gggtgagtct ataatacaaa atgcataaac catgtcccca ggaaatttga
1740aaggaagcaa gtgctgaatg gaattttttt ccttttccat gagctgtgtt aattctatct
1800ccagtaggcc taatgcttga aataagcaag atgtctaatc aataaattat tttcatgctc
1860agaatttcag gtttttgtac tccagcatag cttggtctta tttcttactg tatgaaagct
1920taacagcaat gtgatttaag gttttgtttt aaatgggaga tgtaagtgat ttaattcatg
1980ggtactttta gaacctgata gataatccca ttgcctttat ttttctaatt aaagaatcct
2040aaatactttg aaaatacaaa atattcctga atagttgtgc cttacattgg ttttatttga
2100agttagagtg gatccctgtt tatcataatt acttataatt aggcagatgc ttatcactct
2160attataactc aaattgggaa gaaaagaatg aaaaagaaaa caatggctgg gtactggctc
2220atacctgtaa ttttagcatt ttgggaaggt gagacgatag gaccacttga gggcaggagt
2280ttgaggttgc agtgagctat gatggtgcca ctgcactcca gcctgggcaa cagagaagaa
2340accatgtatc ttaaaaaaaa agaaaaaatg aaggatgaaa aagtgggatt attatttaac
2400cagtgaagaa ttcataaatg tgacaattaa ggataatttg aatctcacaa atctattcac
2460cccaccccag tcaaggtcta tttaatatat ctatcttcat tggcattctt gagaaataat
2520gctgccattg aacattgggg g
254141902DNAHomo sapiens 4aaattaaccc tgcagcatgt gaacagtaat cagtgcctgg
ataaagccac agaagaggat 60agccaggtgc ccagcattag agactgcaat ggaagtcggt
cccagcagtg gcttcttcga 120aacgtcaccc tgccagaaat attctgagac caaatttaca
aaaaaacgaa aaaaataagg 180attgactggg ctacctcagc atacatttct gccacattct
taagtagcaa aaaaggaaaa 240gtgctttcct cctctgcagg atgtaaggtt tatcagccat
taaaacttag acttctctag 300cttttcacta gctgtgaacc agccttcctg tccatggacg
tgaaactgca tagtaatgag 360actgtgcaca ctgatgttta caagattgaa agagtctttc
tccgaaaatc atggtaaaga 420atactgagac aatgaaaaaa aatcaacaaa atatgctttc
tggagaactg taccttctat 480ggtttgcttg cacatcagta gtttctgctg aacgtgctgt
cataatgaag agatttccaa 540gatttttttt cctgattaga acgggtagcc agtatattaa
atattgataa gaaaaataaa 600agaactggaa ccagattcag aatcatgaaa acaacattaa
aacaacaaca aaaaaactat 660attaaacagg gtttaaagga aattaaaaca gaactatgag
aagtacaatt tgtaatagta 720tagtatcaaa tttctatata gatttaatac ctcagtgggg
aaaaataact gagtccaatg 780tcattcattt tgtgtcatct ttgatagtca tggatccgtt
tatttccctt ggggtgctga 840aattcagctg taaaaaaaaa ggctctttga gtatagtttt
aatttctctc tacagttttt 900tttgtttggt ttgtgggctg ttggaattgt aatttttaat
tgccttctaa aaaatggaaa 960tttaacaatg tctgatctca gctgaacaaa ttagatgttt
cagttgctct tgggtcaact 1020ggcttacaga tttacatgtg cacacacaca caaatttctt
atcacatttc gacttcttca 1080cttgacctaa ctgattatgc gaaataccca agattcatgc
tactgtacca cagatttgtg 1140ttcacagcaa taaatcttca gttctttgtt tatgattcca
cttaacaaaa ggcctgcaga 1200agtgatttat tatttgggta tttggagata atacatttga
tggttttttg gaaaaccttt 1260ttcactccat actcagatat gcttcattgt caaatgcata
tttagattag attattgaat 1320tgtaatgttt atctgctgct ttttttaaat aaaatttgac
tgaaaatgtt taattggcat 1380ttttaatgac ttagccaaag aagtgcagct attattccat
attaataggc ttgcatttct 1440tttcctaaat cttatttagg ctaaatcagt tttttttttc
tctgattttt tttaatacca 1500cagaatcacc tgagtgtcaa ttgaaagttg tcaattaaaa
ggtaactttt aatctcgtag 1560gaggaatctc attaagacat ttttcctgat atgtagagca
gtctgttggc aaaaatgcat 1620atatttcctt tcatatttgt aaaattatat ttaatggaat
tcttttactt tgaccatcaa 1680ggactttcac tgcaggcagt gctatttcct tgtgcctaag
aaatgtttcc aaaagtcgca 1740tcgctaatga tatttgccaa gttgagtgta cacaaagttt
ctcatatcct gttcaagtta 1800atcaacatca agcacatggg gatgctttag ggtgagtcta
tagtacaaaa tgcataaacc 1860atgtccccag gaaatttgaa aggaagcagg tgctgaatgg
aa 190255096DNAHomo sapiens 5atgaggagat ctgtctactg
caaggtggtt ctagccactt cgctgatgtg ggttcttgtt 60gatgtcttct tactgctgta
cttcagtgaa tgtaacaaat gtgatgacaa gaaggagaga 120tctctgctgc ctgcattgag
ggctgttatt tcaagaaacc aagaagggcc aggagaaatg 180ggaaaagctg tgttgattcc
taaagatgac caggagaaaa tgaaagagct gtttaaaatc 240aatcagttta accttatggc
cagtgatttg attgccctta atagaagtct gccagatgta 300agattagaag gatgtaagac
aaaagtctac cctgatgaac ttccaaacac aagtgtagtc 360attgtgtttc ataatgaagc
ttggagcact ctccttagaa ctgtttacag tgtgataaat 420cgttccccac actatctact
ctcagaggtc atcttggtag atgatgccag tgaaagagat 480tttctcaagt tgacattaga
gaattacgtg aaaaatttag aagtgccagt aaaaattatt 540aggatggaag aacgctctgg
gttaatacgt gcccgtcttc gaggagcagc tgcttcaaaa 600gggcaggtca taacttttct
tgatgcacac tgtgaatgca cgttaggatg gctggagcct 660ttgctggcaa gaataaagga
agacaggaaa acggttgtct gccctatcat tgatgtgatt 720agtgatgata cttttgaata
tatggctggg tcagacatga cttatggggg ttttaactgg 780aaactgaatt tccgctggta
tcctgttccc caaagagaaa tggacaggag gaaaggagac 840agaacattac ctgtcaggac
ccctactatg gctggtggcc tattttctat tgacagaaac 900tactttgaag agataggaac
ttacgatgca ggaatggata tctggggtgg agagaatctt 960gaaatgtctt ttaggatttg
gcaatgtgga ggctccttgg agattgttac ttgctcccat 1020gttggtcatg tttttcggaa
ggcaactcca tacacttttc ctggtggcac tggtcatgtc 1080atcaacaaga acaacaggag
actggcagaa gtttggatgg atgaatttaa agatttcttc 1140tacatcatat ccccaggtgt
tgtcaaagtg gattatggag atgtgtcagt cagaaaaaca 1200ctaagagaaa atctgaagtg
taagcccttt tcttggtacc tagaaaacat ctatccggac 1260tcccagatcc caagacgtta
ttactcactt ggtgagataa gaaatgttga aaccaatcag 1320tgtttagaca acatgggccg
caaggaaaat gaaaaagtgg gtatattcaa ctgtcatggt 1380atgggaggaa atcaggtatt
ttcttacact gctgacaaag aaatccgaac cgatgacttg 1440tgcttggatg tttctagact
caatggacct gtaatcatgt taaaatgcca ccatatgaga 1500ggaaatcagt tatgggaata
tgatgctgag agactcacgt tgcgacatgt taacagtaac 1560caatgtctcg atgaaccttc
tgaagaagac aaaatggtgc ctacaatgca ggactgtagt 1620ggaagcagat cccaacagtg
gctgctaagg aacatgacct tgggcacatg aagatcatgt 1680cctccaagcc atgaaagtgt
ctacgctttt gtttttccat tatttcaatt gggggaaaat 1740attaactttg ctgaattgaa
agttttaaaa atccttttag tattctaaaa cacaattgtt 1800tctaattcgt ttctagaaat
gtttgcttat ttccctacta aaatttgtat ctgatcaaag 1860cacataagaa tataaataat
agcaaactac tattaaacaa cagaacaact tgtaaaacaa 1920attgtgtttg ctttaagaaa
aatctttatt gcactcatgt catagggtta attggaggtt 1980attttatttt tggttgtcat
ggtgattgaa agagataatg taaatgcctt ataaaatctt 2040cattatgaaa tattatcagt
tgctttataa actcactctt tttatggatc cttcatggaa 2100acatgtttga tttctgtgtc
tacaacaggg ccacatatta aattacttct gaatggtgaa 2160ttcatcttac aaaatgttcc
aagttttgga caaggaaaaa cattacattg gatattgaat 2220tcatggagcc tttacagaag
catcacattt aaaccaatgt gaattcaaaa aagagaagga 2280aactttttaa attcaattta
gagtacataa aaaaagaaat ccgcgaagca ttgaggggaa 2340caagatgagt ctaatccgca
gcacttcgat cactgtgaga gaactcaaag tgggttgcaa 2400tcattcctaa cacaggctga
aactaaacaa tcttgctccc agagtttatg ttggatactc 2460aattcttaac caaaatcttg
gtctccacaa actctaccat cccttttctc ttcactctat 2520agacagtggc atgccattga
tgctgtacag aattgcaggt gaaagggaga attttagact 2580taatttttaa ctctattgca
cttaaagatt ttagttaggt taccactgtc attttcattt 2640tctatgttaa agaatacctt
tcagtgctgt gctccagtat ctaaaaattt tatcaccagg 2700ggaataaact caatacacat
tcattaaact tttgttgtaa ttaaactgct atatattgtt 2760tgccatatat tatggcccaa
ggaatatagt tataatcagg ctacattcac attttctttt 2820ctcatcttaa aaccctttat
gttcaaaata cattagcaga ggccatgaaa gtgaaaaaaa 2880atcagatttt tgctagtaag
tttttatatt tgacatcatt ttaaatgctc aaagagttgc 2940cctatatatg catgttatcc
attataaatg cactcagcag ctaagagaaa tttgagagga 3000aaagtggaag aagtgttttc
tggagcagaa gattccactt ctttggtcct catctttact 3060taaaaagctc ttatagaaag
gattaatcat tttgactgca tacaaactat gtctctgaca 3120tgcacataca ctgctttata
atgaaaatga ggacactttc tgatggtcag taaaatctta 3180aatgattttt gccactggta
ttccttcctc ttttgacttt ttattggtac aactgtaggg 3240aagaatctac aacctggagc
atttcaggtt tgctctttac atatagacac ccctcaccat 3300gagctctctg tgaatccgtg
aatttcctaa aattgtaagc aaaatgttga atatcagggc 3360attttttttt ctgaggtggg
agtatgtcac tcacagcaaa cttggaaaga aggctatgac 3420tcccaagagg cagtgaacca
ttgccttaaa ttaatagcag cctctcaagt ccattgaggc 3480ctcatgtaaa attcccattt
ttcatcttta cactcctttt ctcccctcat ttcctcttct 3540ccttactaag gtaaaaagac
cctgcacagc attattactt aagttgaaaa agcttcaact 3600ctcaaggact catgtatacc
atatgttcca cttattgctt tgttagtcaa aatgcatgtt 3660atgtagaaaa tggtcaatgg
caaattttta taaatacagc taccccatca aagctgaact 3720tgaaacgttt tagcagaatt
ttaaatatgt gaaagttcat tgtggtcatg gtgagaaaac 3780taaaactaca tgagtttcat
gttagaaaag aatttttaaa caatcttcag atggcttgaa 3840cacgtgccat tatagtatct
aacaaaaagt ggcataaaat attacaaaac atccaacact 3900agaaagcaat gactaacatt
taatttgttt acaaaaatgt ttattgtctt cagtacaaaa 3960agttaaaacc atgattggat
tcataatgca cttgtcttat cccttattta tggagctaga 4020ctgacatatt tcaaaccaat
ctcttctgtg gtcttctcca tctcagttaa tgcaacttca 4080accttccaga tgctcaggtc
aaaaactgaa agttgctttt gactcctcta tttctgtttc 4140aactctcgtc taatccaaca
gcaaatcccc ttgatgctac ctttgaaata taactagaaa 4200agaatgactt gccaccatat
ccactaggtg tcactacctc taaccttttt aatcgcaata 4260tgcttattat ttagttccct
aatttcccag tccttgtctt gaacaatctt ttctcagtac 4320aacagccata gtgattctgt
taaccatgaa tttgattgtg tcacttccct gctcacaatc 4380ttccaatgac ttctccctcc
ggtggccttc aaggtccttt atgagtgtta cccctctaca 4440cgtccacacc actcaccaca
catacataca catgcacaca cacacacatg caattctacc 4500tttttctaac accatctctt
atctctcatt attctgcctc ttacttaccc tattatggct 4560cccaagaaag ttcctgcttt
agtgttttgt tccctctgcc ccagagagct ccccttagtg 4620atctcagttc aaatattatc
ttatcagtaa gacaaaccct tctcccatct gccttcccta 4680tcccctgtac ctgacaatct
atgagtttgc ttctttattt ctgtctatac ttccccctcc 4740ccagccccaa acatgcccat
cacatgccct agaaggtaag tcccatgaag gcaggggctt 4800tgtcagtttt gttcatttgg
tatttctggc atatatattc tctaaatatt tcttgaatta 4860atgaactgaa aaatgtgtgt
taaagttgct aagtgtaact gtatcatact ttttttgtat 4920tttaaatttt aaaataaagg
ctaatatatt tagacaggat ttccaaaaac tttgtcttca 4980aattttaatt ttcttcctaa
tattcttcac attaaactat ggactctaat tttctggata 5040aaatatcctg ccttatggaa
atgaaatatg aaattttatt aaaatgctgc tatatt 509664957DNAHomo sapiens
6gctgttattt caagaaacca agaagggcca ggagaaatgg gaaaagctgt gttgattcct
60aaagatgacc aggagaaaat gaaagagctg tttaaaatca atcagtttaa ccttatggcc
120agtgatttga ttgcccttaa tagaagtctg ccagatgtaa gattagaagg atgtaagaca
180aaagtctacc ctgatgaact tccaaacaca agtgtagtca ttgtgtttca taatgaagct
240tggagcactc tccttagaac tgtttacagt gtgataaatc gttccccaca ctatctactc
300tcagaggtca tcttggtaga tgatgccagt gaaagagatt ttctcaagtt gacattagag
360aattacgtga aaaatttaga agtgccagta aaaattatta ggatggaaga acgctctggg
420ttaatacgtg cccgtcttcg aggagcagct gcttcaaaag ggcaggtcat aacttttctt
480gatgcacact gtgaatgcac gttaggatgg ctggagcctt tgctggcaag aataaaggaa
540gacaggaaaa cggttgtctg ccctatcatt gatgtgatta gtgatgatac ttttgaatat
600atggctgggt cagacatgac ttatgggggt tttaactgga aactgaattt ccgctggtat
660cctgttcccc aaagagaaat ggacaggagg aaaggagaca gaacattacc tgtcagagga
720cccctactat ggctggtggc ctattttcta ttgacagaaa ctactttgaa gagataggaa
780cttacgatgc aggaatggat atctggggtg gagagaatct tgaaatgtct tttaggattt
840ggcaatgtgg aggctccttg gagattgtta cttgctccca tgttggtcat gtttttcgga
900aggcaactcc atacactttt cctggtggca ctggtcatgt catcaacaag aacaacagga
960gactggcaga agtttggatg gatgaattta aagatttctt ctacatcata tccccaggtg
1020ttgtcaaagt ggattatgga gatgtgtcag tcagaaaaac actaagagaa aatctgaagt
1080gtaagccctt ttcttggtac ctagaaaaca tctatccgga ctcccagatc ccaagacgtt
1140attactcact tggtgagata agaaatgttg aaaccaatca gtgtttagac aacatgggcc
1200gcaaggaaaa tgaaaaagtg ggtatattca actgtcatgg tatgggagga aatcaggtat
1260tttcttacac tgctgacaaa gaaatccgaa ccgatgactt gtgcttggat gtttctagac
1320tcaatggacc tgtaatcatg ttaaaatgcc accatatgag aggaaatcag ttatgggaat
1380atgatgctga gagactcacg ttgcgacatg ttaacagtaa ccaatgtctc gatgaacctt
1440ctgaagaaga caaaatggtg cctacaatgc aggactgtag tggaagcaga tcccaacagt
1500ggctgctaag gaacatgacc ttgggcacat gaagatcatg tcctccaagc catgaaagtg
1560tctacgcttt tgtttttcca ttatttcaat tgggggaaaa tattaacttt gctgaattga
1620aagttttaaa aatcctttta gtattctaaa acacaattgt ttctaattcg tttctagaaa
1680tgtttgctta tttccctact aaaatttgta tctgatcaaa gcacataaga atataaataa
1740tagcaaacta ctattaaaca acagaacaac ttgtaaaaca aattgtgttt gctttaagaa
1800aaatgtttat tgcactcatg tcatagggtt aattggaggt tattttattt ttggttgtca
1860tggtgattga aagagataat gtaaatgcct tataaaatct tcattatgaa atattatcag
1920ttgctttata aactcactct ttttatggat ccttcatgga aacatgtttg atttctgtgt
1980ctacaacagg gccacatatt aaattacttc tgaatggtga attcatctta caaaatgttc
2040caagttttgg acaaggaaaa acattacatt ggatattgaa ttcatggagc ctttacagaa
2100gcatcacatt taaaccaatg tgaattcaaa aaagagaagg aaacttttta aattcaattt
2160agagtacata aaaaaagaaa tccgcgaagc attgagggga acaagatgag tctaatccgc
2220agcacttcga tcactgtgag agaactcaaa gtgggttgca atcattccta acacaggctg
2280aaactaaaca atcttgctcc tagagtttat gttggatact caattcttaa ccaaaatctt
2340ggtctccaca aactctacca tcccttttct cttcactcta tagacagtgg catgccattg
2400atgctgtaca gaattgcagg tgaaagggag aattttagac ttaattttta actctattgc
2460acttaaagat tttagttagg ttaccactgt cattttcatt ttctatgtta aagaatacct
2520ttcagtgctg tgctccagta tctaaaaatt ttatcaccag gggaataaac tcaatacaca
2580ttcattaaac ttttgttgta attaaactgc tatatattgt ttgccatata ttatggccca
2640aggaatatag ttataatcag gctacattca cattttcttt tctcatctta aaacccttta
2700tgttcaaaat acattagcag aggccatgaa agtgaaaaaa aatcagattt ttgctagtaa
2760gtttttatat ttgacatcat tttaaatgct caaagagttg ccctatatat gcatgttatc
2820cattataaat gcactcagca gctaagagaa atttgagagg aaaagtggaa gaagtgtttt
2880ctggagcaga agattccact tctttggtcc tcatctttac ttaaaaagct cttatagaaa
2940ggattaatca ttttgactgc atacaaacta tgtctctgac atgcacatac actgctttat
3000aatgaaaatg aggacacttt ctgatggtca gtaaaatctt aaatgatttt tgccactggt
3060attccttcct cttttgactt tttattggta caactgtagg gaagaatcta caacctggag
3120catttcaggt ttgctcttta aatatagaca cccctcacca tgagctctct gtgaatccgt
3180gaatttccta aaattgtaag caaaatgttg aatatcaggg catttttttt tctgaggtgg
3240gagtatgtca ctcacagcaa acttggaaag aaggctatga ctcccaagag gcagtgaacc
3300attgccttaa attaatagca gcctctcaag tccattgagg cctcatgtaa aattcccatt
3360tttcatcttt acactccttt tctcccctca tttcctcttc tccttactaa ggtaaaaaga
3420ccctgcacag cattattact taagttgaaa aagcttcaac tctcaaggac tcatgtatac
3480catatgttcc acttattgct ttgttagtca aaatgcatgt tatgtagaaa atggtcaatg
3540gcaaattttt ataaatacag ctaccccatc aaagctgaac ttgaaacgtt ttagcagaat
3600tttaaatatg tgaaagttca ttgtggtcat ggtgagaaaa ctaaaactac atgagtttca
3660tgttagaaaa gaatttttaa acaatcttca gatggcttga acacgtgcca ttatagtatc
3720taacaaaaag tggcataaaa tattacaaaa catccaacac tagaaagcaa tgactaacat
3780ttaatttgtt tacaaaaatg tttattgtct tcagtacaaa aagttaaaac catgattgga
3840ttcataatgc acttgtctta tcccttattt atggagctag actgacatat ttcaaaccaa
3900tctcttctgt ggtcttctcc atctcagtta atgcaacttc aaccttccag atgctcaggt
3960caaaaactga aagttgcttt tgactcctct atttctgttt caactctcgt ctaatccaac
4020agcaaatccc cttgatgcta cctttgaaat ataactagaa aagaatgact tgccaccata
4080tccactaggt gtcactacct ctaacctttt taatcgcaat atgcttatta tttagttccc
4140taatttccca gtccttgtct tgaacaatct tttctcagta caacagccat agtgattctg
4200ttaaccatga atttgattgt gtcacttccc tgctcacaat cttccaatga cttctccctc
4260cggtggcctt caaggtcctt tatgagtgtt acccctctac acgtccacac cactcaccac
4320acatacatac acatgcacac acacacacat gcaattctac ctttttctaa caccatctct
4380tatctctcat tattctgcct cttacttacc ctattatggc tcccaagaaa gttcctgctt
4440tagtgttttg ttccctctgc cccagagagc tccccttagt gatctcagtt caaatattat
4500cttatcagta agacaaaccc ttctcccatc tgccttccct atcccctgta cctgacaatc
4560tatgagtttg cttctttatt tctgtctata cttccccctc cccagcccca aacatgccca
4620tcacatgccc tagaaggtaa gtcccatgaa ggcaggggct ttgtcagttt tgttcatttg
4680gtatttctgg catatatatt ctctaaatat ttcttgaatt aatgaactga aaaatgtgtg
4740ttaaagttgc taagtgtaac tgtatcatac tttttttgta ttttaaattt taaaataaag
4800gctaatatat ttagacagga tttccaaaaa ctttgtcttc aaattttaat tttcttccta
4860atattcttca cattaaacta tggactctaa ttttctggat aaaatatcct gccttatgga
4920aatgaaatat gaaattttat taaaatgctg ctatatt
495775096DNAHomo sapiens 7atgaggagat ttgtctactg caaggtggtt ctagccactt
cgctgatgtg ggttcttgtt 60gatgtcttct tactgctgta cttcagtgaa tgtaacaaat
gtgatgacaa gaaggagaga 120tctctgctgc ctgcattgag ggctgttatt tcaagaaacc
aagaagggcc aggagaaatg 180ggaaaagctg tgttgattcc taaagatgac caggagaaaa
tgaaagagct gtttaaaatc 240aatcagttta accttatggc cagtgatttg attgccctta
atagaagtct gccagatgta 300agattagaag gatgtaagac aaaagtctac cctgatgaac
ttccaaacac aagtgtagtc 360attgtgtttc ataatgaagc ttggagcact ctccttagaa
ctgtttacag tgtgataaat 420cgttccccac actatctact ctcagaggtc atcttggtag
atgatgccag tgaaagagat 480tttctcaagt tgacattaga gaattacgtg aaaaatttag
aagtgccagt aaaaattatt 540aggatggaag aacgctctgg gttaatacgt gcccgtcttc
gaggagcagc tgcttcaaaa 600gggcaggtca taacttttct tgatgcacac tgtgaatgca
cgttaggatg gctggagcct 660ttgctggcaa gaataaagga agacaggaaa acggttgtct
gccctatcat tgatgtgatt 720agtgatgata cttttgaata tatggctggg tcagacatga
cttatggggg ttttaactgg 780aaactgaatt tccgctggta tcctgttccc caaagagaaa
tggacaggag gaaaggagac 840agaacattac ctgtcaggac ccctactatg gctggtggcc
tattttctat tgacagaaac 900tactttgaag agataggaac ttacgatgca ggaatggata
tctggggtgg agagaatctt 960gaaatgtctt ttaggatttg gcaatgtgga ggctccttgg
agattgttac ttgctcccat 1020gttggtcatg tttttcggaa ggcaactcca tacacttttc
ctggtggcac tggtcatgtc 1080atcaacaaga acaacaggag actggcagaa gtttggatgg
atgaatttaa agatttcttc 1140tacatcatat ccccaggtgt tgtcaaagtg gattatggag
atgtgtcagt cagaaaaaca 1200ctaagagaaa atctgaagtg taagcccttt tcttggtacc
tagaaaacat ctatccggac 1260tcccagatcc caagacgtta ttactcactt ggtgagataa
gaaatgttga aaccaatcag 1320tgtttagaca acatgggccg caaggaaaat gaaaaagtgg
gtatattcaa ctgtcatggt 1380atgggaggaa atcaggtatt ttcttacact gctgacaaag
aaatccgaac cgatgacttg 1440tgcttggatg tttctagact caatggacct gtaatcatgt
taaaatgcca ccatatgaga 1500ggaaatcagt tatgggaata tgatgctgag agactcacgt
tgcgacatgt taacagtaac 1560caatgtctcg atgaaccttc tgaagaagac aaaatggtgc
ctacaatgca ggactgtagt 1620ggaagcagat cccaacagtg gctgctaagg aacatgacct
tgggcacatg aagatcatgt 1680cctccaagcc atgaaagtgt ctacgctttt gtttttccat
tatttcaatt gggggaaaat 1740attaactttg ctgaattgaa agttttaaaa atccttttag
tattctaaaa cacaattgtt 1800tctaattcgt ttctagaaat gtttgcttat ttccctacta
aaatttgtat ctgatcaaag 1860cacataagaa tataaataat agcaaactac tattaaacaa
cagaacaact tgtaaaacaa 1920attgtgtttg ctttaagaaa aatgtttatt gcactcatgt
catagggtta attggaggtt 1980attttatttt tggttgtcat ggtgattgaa agagataatg
taaatgcctt ataaaatctt 2040cattatgaaa tattatcagt tgctttataa actcactctt
tttatggatc cttcatggaa 2100acatgtttga tttctgtgtc tacaacaggg ccacatatta
aattacttct gaatggtgaa 2160ttcatcttac aaaatgttcc aagttttgga caaggaaaaa
cattacattg gatattgaat 2220tcatggagcc tttacagaag catcacattt aaaccaatgt
gaattcaaaa aagagaagga 2280aactttttaa attcaattta gagtacataa aaaaagaaat
ccgcgaagca ttgaggggaa 2340caagatgagt ctaatccgca gcacttcgat cactgtgaga
gaactcaaag tgggttgcaa 2400tcattcctaa cacaggctga aactaaacaa tcttgctcct
agagtttatg ttggatactc 2460aattcttaac caaaatcttg gtctccacaa actctaccat
cccttttctc ttcactctat 2520agacagtggc atgccattga tgctgtacag aattgcaggt
gaaagggaga attttagact 2580taatttttaa ctctattgca cttaaagatt ttagttaggt
taccactgtc attttcattt 2640tctatgttaa agaatacctt tcagtgctgt gctccagtat
ctaaaaattt tatcaccagg 2700ggaataaact caatacacat tcattaaact tttgttgtaa
ttaaactgct atatattgtt 2760tgccatatat tatggcccaa ggaatatagt tataatcagg
ctacattcac attttctttt 2820ctcatcttaa aaccctttat gttcaaaata cattagcaga
ggccatgaaa gtgaaaaaaa 2880atcagatttt tgctagtaag tttttatatt tgacatcatt
ttaaatgctc aaagagttgc 2940cctatatatg catgttatcc attataaatg cactcagcag
ctaagagaaa tttgagagga 3000aaagtggaag aagtgttttc tggagcagaa gattccactt
ctttggtcct catctttact 3060taaaaagctc ttatagaaag gattaatcat tttgactgca
tacaaactat gtctctgaca 3120tgcacataca ctgctttata atgaaaatga ggacactttc
tgatggtcag taaaatctta 3180aatgattttt gccactggta ttccttcctc ttttgacttt
ttattggtac aactgtaggg 3240aagaatctac aacctggagc atttcaggtt tgctctttaa
atatagacac ccctcaccat 3300gagctctctg tgaatccgtg aatttcctaa aattgtaagc
aaaatgttga atatcagggc 3360attttttttt ctgaggtggg agtatgtcac tcacagcaaa
cttggaaaga aggctatgac 3420tcccaagagg cagtgaacca ttgccttaaa ttaatagcag
cctctcaagt ccattgaggc 3480ctcatgtaaa attcccattt ttcatcttta cactcctttt
ctcccctcat ttcctcttct 3540ccttactaag gtaaaaagac cctgcacagc attattactt
aagttgaaaa agcttcaact 3600ctcaaggact catgtatacc atatgttcca cttattgctt
tgttagtcaa aatgcatgtt 3660atgtagaaaa tggtcaatgg caaattttta taaatacagc
taccccatca aagctgaact 3720tgaaacgttt tagcagaatt ttaaatatgt gaaagttcat
tgtggtcatg gtgagaaaac 3780taaaactaca tgagtttcat gttagaaaag aatttttaaa
caatcttcag atggcttgaa 3840cacgtgccat tatagtatct aacaaaaagt ggcataaaat
attacaaaac atccaacact 3900agaaagcaat gactaacatt taatttgttt acaaaaatgt
ttattgtctt cagtacaaaa 3960agttaaaacc atgattggat tcataatgca cttgtcttat
cccttattta tggagctaga 4020ctgacatatt tcaaaccaat ctcttctgtg gtcttctcca
tctcagttaa tgcaacttca 4080accttccaga tgctcaggtc aaaaactgaa agttgctttt
gactcctcta tttctgtttc 4140aactctcgtc taatccaaca gcaaatcccc ttgatgctac
ctttgaaata taactagaaa 4200agaatgactt gccaccatat ccactaggtg tcactacctc
taaccttttt aatcgcaata 4260tgcttattat ttagttccct aatttcccag tccttgtctt
gaacaatctt ttctcagtac 4320aacagccata gtgattctgt taaccatgaa tttgattgtg
tcacttccct gctcacaatc 4380ttccaatgac ttctccctcc ggtggccttc aaggtccttt
atgagtgtta cccctctaca 4440cgtccacacc actcaccaca catacataca catgcacaca
cacacacatg caattctacc 4500tttttctaac accatctctt atctctcatt attctgcctc
ttacttaccc tattatggct 4560cccaagaaag ttcctgcttt agtgttttgt tccctctgcc
ccagagagct ccccttagtg 4620atctcagttc aaatattatc ttatcagtaa gacaaaccct
tctcccatct gccttcccta 4680tcccctgtac ctgacaatct atgagtttgc ttctttattt
ctgtctatac ttccccctcc 4740ccagccccaa acatgcccat cacatgccct agaaggtaag
tcccatgaag gcaggggctt 4800tgtcagtttt gttcatttgg tatttctggc atatatattc
tctaaatatt tcttgaatta 4860atgaactgaa aaatgtgtgt taaagttgct aagtgtaact
gtatcatact ttttttgtat 4920tttaaatttt aaaataaagg ctaatatatt tagacaggat
ttccaaaaac tttgtcttca 4980aattttaatt ttcttcctaa tattcttcac attaaactat
ggactctaat tttctggata 5040aaatatcctg ccttatggaa atgaaatatg aaattttatt
aaaatgctgc tatatt 509681671DNAHomo sapiens 8atgaggagat ctgtctactg
caaggtggtt ctagccactt cgctgatgtg ggttcttgtt 60gatgtcttct tactgctgta
cttcagtgaa tgtaacaaat gtgatgacaa gaaggagaga 120tctctgctgc ctgcattgag
ggctgttatt tcaagaaacc aagaagggcc aggagaaatg 180ggaaaagccg tgttgattcc
taaagatgac caggagaaaa tgaaagagct gtttaaaatc 240aatcagttta accttatggc
cagtgatttg attgccctta atagaagtct gccagatgta 300agattagaag gatgtaagac
aaaagtctac cctgatgaac ttccaaacac aagtgtagtc 360attgtgtttc ataatgaagc
ttggagcact ctccttagaa ctgtttacag tgtgataaat 420cgttccccac actatctact
ctcagaggtc atcttggtag atgatgccag tgaaagagat 480tttctcaagt tgacattaga
gaattacgtg aaaaatttag aagtgccagt aaaaattatt 540aggatggaag aacgctctgg
gttaatacgt gcccgtcttc gaggagcagc tgcttcaaaa 600gggcaggtca taacttttct
tgatgcacac tgtgaatgca cgttaggatg gctggagcct 660ttgctggcaa gaataaagga
agacaggaaa acggttgtct gccctatcat tgatgtgatt 720agtgatgata cttttgaata
tatggctggg tcagacatga cttatggggg ttttaactgg 780aaactgaatt tccgctggta
tcctgttccc caaagagaaa tggacaggag gaaaggagac 840agaacattac ctgtcaggac
ccctactatg gctggtggcc tattttctat tgacagaaac 900tactttgaag agataggaac
ttacgatgca ggaatggata tctggggtgg agagaatctt 960gaaatgtctt ttaggatttg
gcaatgtgga ggctccttgg agattgttac ttgctcccat 1020gttggtcatg tttttcggaa
ggcaactcca tacacttttc ctggtggcac tggtcatgtc 1080atcaacaaga acaacaggag
actggcagaa gtttggatgg atgaatttaa agatttcttc 1140tacatcatat ccccaggtgt
tgtcaaagtg gattatggag atgtgtcagt cagaaaaaca 1200ctaagagaaa atctgaagtg
taagcccttt tcttggtacc tagaaaacat ctatccggac 1260tcccagatcc caagacgtta
ttactcactt ggtgagataa gaaatgttga aaccaatcag 1320tgtttagaca acatgggccg
caaggaaaat gaaaaagtgg gtatattcaa ctgtcatggt 1380atgggaggaa atcaggtatt
ttcttacact gctgacaaag aaatccgaac cgatgacttg 1440tgcttggatg tttctagact
caatggacct gtaatcatgt taaaatgcca ccatatgaga 1500ggaaatcagt tatgggaata
tgatgctgag agactcacgt tgcgacatgt taacagtaac 1560caatgtctcg atgaaccttc
tgaagaagac aaaatggtgc ctacaatgca ggactgtagt 1620ggaagcagat cccaacagtg
gctgctaagg aacatgacct tgggcacatg a 16719559PRTHomo sapiens
9Met Arg Lys Phe Ala Tyr Cys Lys Val Val Leu Ala Thr Ser Leu Ile1
5 10 15Trp Val Leu Leu Asp Met
Phe Leu Leu Leu Tyr Phe Ser Glu Cys Asn 20 25
30Lys Cys Asp Glu Lys Lys Glu Arg Gly Leu Pro Ala Gly
Asp Val Leu 35 40 45Glu Pro Val
Gln Lys Pro His Glu Gly Pro Gly Glu Met Gly Lys Pro 50
55 60Val Val Ile Pro Lys Glu Asp Gln Glu Lys Met Lys
Glu Met Phe Lys65 70 75
80Ile Asn Gln Phe Asn Leu Met Ala Ser Glu Met Ile Ala Leu Asn Arg
85 90 95Ser Leu Pro Asp Val Arg
Leu Glu Gly Cys Lys Thr Lys Val Tyr Pro 100
105 110Asp Asn Leu Pro Thr Thr Ser Val Val Ile Val Phe
His Asn Glu Ala 115 120 125Trp Ser
Thr Leu Leu Arg Thr Val His Ser Val Ile Asn Arg Ser Pro 130
135 140Arg His Met Ile Glu Glu Ile Val Leu Val Asp
Asp Ala Ser Glu Arg145 150 155
160Asp Phe Leu Lys Arg Pro Leu Glu Ser Tyr Val Lys Lys Leu Lys Val
165 170 175Pro Val His Val
Ile Arg Met Glu Gln Arg Ser Gly Leu Ile Arg Ala 180
185 190Arg Leu Lys Gly Ala Ala Val Ser Lys Gly Gln
Val Ile Thr Phe Leu 195 200 205Asp
Ala His Cys Glu Cys Thr Val Gly Trp Leu Glu Pro Leu Leu Ala 210
215 220Arg Ile Lys His Asp Arg Arg Thr Val Val
Cys Pro Ile Ile Asp Val225 230 235
240Ile Ser Asp Asp Thr Phe Glu Tyr Met Ala Gly Ser Asp Met Thr
Tyr 245 250 255Gly Gly Phe
Asn Trp Lys Leu Asn Phe Arg Trp Tyr Pro Val Pro Gln 260
265 270Arg Glu Met Asp Arg Arg Lys Gly Asp Arg
Thr Leu Pro Val Arg Thr 275 280
285Pro Thr Met Ala Gly Gly Leu Phe Ser Ile Asp Arg Asp Tyr Phe Gln 290
295 300Glu Ile Gly Thr Tyr Asp Ala Gly
Met Asp Ile Trp Gly Gly Glu Asn305 310
315 320Leu Glu Ile Ser Phe Arg Ile Trp Gln Cys Gly Gly
Thr Leu Glu Ile 325 330
335Val Thr Cys Ser His Val Gly His Val Phe Arg Lys Ala Thr Pro Tyr
340 345 350Thr Phe Pro Gly Gly Thr
Gly Gln Ile Ile Asn Lys Asn Asn Arg Arg 355 360
365Leu Ala Glu Val Trp Met Asp Glu Phe Lys Asn Phe Phe Tyr
Ile Ile 370 375 380Ser Pro Gly Val Thr
Lys Val Asp Tyr Gly Asp Ile Ser Ser Arg Val385 390
395 400Gly Leu Arg His Lys Leu Gln Cys Lys Pro
Phe Ser Trp Tyr Leu Glu 405 410
415Asn Ile Tyr Pro Asp Ser Gln Ile Pro Arg His Tyr Phe Ser Leu Gly
420 425 430Glu Ile Arg Asn Val
Glu Thr Asn Gln Cys Leu Asp Asn Met Ala Arg 435
440 445Lys Glu Asn Glu Lys Val Gly Ile Phe Asn Cys His
Gly Met Gly Gly 450 455 460Asn Gln Val
Phe Ser Tyr Thr Ala Asn Lys Glu Ile Arg Thr Asp Asp465
470 475 480Leu Cys Leu Asp Val Ser Lys
Leu Asn Gly Pro Val Thr Met Leu Lys 485
490 495Cys His His Leu Lys Gly Asn Gln Leu Trp Glu Tyr
Asp Pro Val Lys 500 505 510Leu
Thr Leu Gln His Val Asn Ser Asn Gln Cys Leu Asp Lys Ala Thr 515
520 525Glu Glu Asp Ser Gln Val Pro Ser Ile
Arg Asp Cys Asn Gly Ser Arg 530 535
540Ser Gln Gln Trp Leu Leu Arg Asn Val Thr Leu Pro Glu Ile Phe545
550 55510556PRTHomo sapiens 10Met Arg Arg Ser Val
Tyr Cys Lys Val Val Leu Ala Thr Ser Leu Met1 5
10 15Trp Val Leu Val Asp Val Phe Leu Leu Leu Tyr
Phe Ser Glu Cys Asn 20 25
30Lys Cys Asp Asp Lys Lys Glu Arg Ser Leu Leu Pro Ala Leu Arg Ala
35 40 45Val Ile Ser Arg Asn Gln Glu Gly
Pro Gly Glu Met Gly Lys Ala Val 50 55
60Leu Ile Pro Lys Asp Asp Gln Glu Lys Met Lys Glu Leu Phe Lys Ile65
70 75 80Asn Gln Phe Asn Leu
Met Ala Ser Asp Leu Ile Ala Leu Asn Arg Ser 85
90 95Leu Pro Asp Val Arg Leu Glu Gly Cys Lys Thr
Lys Val Tyr Pro Asp 100 105
110Glu Leu Pro Asn Thr Ser Val Val Ile Val Phe His Asn Glu Ala Trp
115 120 125Ser Thr Leu Leu Arg Thr Val
Tyr Ser Val Ile Asn Arg Ser Pro His 130 135
140Tyr Leu Leu Ser Glu Val Ile Leu Val Asp Asp Ala Ser Glu Arg
Asp145 150 155 160Phe Leu
Lys Leu Thr Leu Glu Asn Tyr Val Lys Asn Leu Glu Val Pro
165 170 175Val Lys Ile Ile Arg Met Glu
Glu Arg Ser Gly Leu Ile Arg Ala Arg 180 185
190Leu Arg Gly Ala Ala Ala Ser Lys Gly Gln Val Ile Thr Phe
Leu Asp 195 200 205Ala His Cys Glu
Cys Thr Leu Gly Trp Leu Glu Pro Leu Leu Ala Arg 210
215 220Ile Lys Glu Asp Arg Lys Thr Val Val Cys Pro Ile
Ile Asp Val Ile225 230 235
240Ser Asp Asp Thr Phe Glu Tyr Met Ala Gly Ser Asp Met Thr Tyr Gly
245 250 255Gly Phe Asn Trp Lys
Leu Asn Phe Arg Trp Tyr Pro Val Pro Gln Arg 260
265 270Glu Met Asp Arg Arg Lys Gly Asp Arg Thr Leu Pro
Val Arg Thr Pro 275 280 285Thr Met
Ala Gly Gly Leu Phe Ser Ile Asp Arg Asn Tyr Phe Glu Glu 290
295 300Ile Gly Thr Tyr Asp Ala Gly Met Asp Ile Trp
Gly Gly Glu Asn Leu305 310 315
320Glu Met Ser Phe Arg Ile Trp Gln Cys Gly Gly Ser Leu Glu Ile Val
325 330 335Thr Cys Ser His
Val Gly His Val Phe Arg Lys Ala Thr Pro Tyr Thr 340
345 350Phe Pro Gly Gly Thr Gly His Val Ile Asn Lys
Asn Asn Arg Arg Leu 355 360 365Ala
Glu Val Trp Met Asp Glu Phe Lys Asp Phe Phe Tyr Ile Ile Ser 370
375 380Pro Gly Val Val Lys Val Asp Tyr Gly Asp
Val Ser Val Arg Lys Thr385 390 395
400Leu Arg Glu Asn Leu Lys Cys Lys Pro Phe Ser Trp Tyr Leu Glu
Asn 405 410 415Ile Tyr Pro
Asp Ser Gln Ile Pro Arg Arg Tyr Tyr Ser Leu Gly Glu 420
425 430Ile Arg Asn Val Glu Thr Asn Gln Cys Leu
Asp Asn Met Gly Arg Lys 435 440
445Glu Asn Glu Lys Val Gly Ile Phe Asn Cys His Gly Met Gly Gly Asn 450
455 460Gln Val Phe Ser Tyr Thr Ala Asp
Lys Glu Ile Arg Thr Asp Asp Leu465 470
475 480Cys Leu Asp Val Ser Arg Leu Asn Gly Pro Val Ile
Met Leu Lys Cys 485 490
495His His Met Arg Gly Asn Gln Leu Trp Glu Tyr Asp Ala Glu Arg Leu
500 505 510Thr Leu Arg His Val Asn
Ser Asn Gln Cys Leu Asp Glu Pro Ser Glu 515 520
525Glu Asp Lys Met Val Pro Thr Met Gln Asp Cys Ser Gly Ser
Arg Ser 530 535 540Gln Gln Trp Leu Leu
Arg Asn Met Thr Leu Gly Thr545 550 555
User Contributions:
Comment about this patent or add new information about this topic:
