Inventors list |
Assignees list |
Classification tree browser |
Top 100 Inventors |
Top 100 Assignees |
Patent application title: Biosynethetic gene cluster for jerangolids
Inventors:
Christopher Reeves (Orinda, CA, US)
Ralph C. Reid (San Rafael, CA, US)
IPC8 Class: AC07K200FI
USPC Class:
530300
Class name: PEPTIDES OF 3 TO 100 AMINO ACID RESIDUES
Publication date: 08/28/2008
Patent application number: 20080207873
Sign up to receive free email alerts when patent applications with chosen keywords are published SIGN UP
Abstract:
Domains of jerangolid polyketide synthase and modification enzymes and
polynucleotides encoding them are provided. Methods to prepare jerangolid
in pharmaceutically useful quantities are described, as are methods to
prepare jerangolid analogs and other polyketides using the
polynucleotides encoding jerangolid synthase domains or modifying
enzymes.Claims:
1. A purified or recombinant nucleic acid comprising a nucleotide sequence
that encodes at least one polypeptide required for the biosynthesis of
jerangolid, wherein the complement of said nucleotide sequence hybridizes
to a sequence selected from the group consisting of nucleotides 1-67323
of SEQ ID NO:1, under conditions of hybridization at 65.degree. C. for 36
hours and washing 3 times at high stringency with 0.1.times.SSC and 0.5%
SDS for 20 minutes at 65.degree. C.
2. A purified or recombinant nucleic acid a nucleotide sequence that encodes at least one module of the jerangolid polyketide synthase, wherein the complement of said nucleotide sequence hybridizes to a sequence selected from the group consisting of nucleotides that encode modules of the jerangolid PKS as listed in Table 1.
3. A purified or recombinant nucleic acid according to claim 1, wherein said polypeptide comprises a β-ketoacylsynthase domain and wherein the complement of said nucleotide sequence hybridizes to a sequence selected from the group consisting of β-ketoacylsynthase domains as listed in Table 1, under conditions of hybridization at 65.degree. C. for 36 hours and washing 3 times at high stringency with 0.1.times.SSC and 0.5% SDS for 20 minutes at 65.degree. C.
4. A purified or recombinant nucleic acid according to claim 1, wherein said polypeptide comprises an acyltransferase domain and wherein the complement of said nucleotide sequence hybridizes to a sequence selected from the group consisting of acyltransferase domains as listed in Table 1, under conditions of hybridization at 65.degree. C. for 36 hours and washing 3 times at high stringency with 0.1.times.SSC and 0.5% SDS for 20 minutes at 65.degree. C.
5. A purified or recombinant nucleic acid according to claim 1, wherein said polypeptide comprises a β-ketoreductase domain and wherein the complement of said nucleotide sequence hybridizes to a sequence selected from the group consisting of β-ketoreductase domains as listed in Table 1, under conditions of hybridization at 65.degree. C. for 36 hours and washing 3 times at high stringency with 0.1.times.SSC and 0.5% SDS for 20 minutes at 65.degree. C.
6. A purified or recombinant nucleic acid according to claim 1, wherein said polypeptide comprises a dehydratase domain and wherein the complement of said nucleotide sequence hybridizes to a sequence selected from the group consisting of dehydratase domains as listed in Table 1, under conditions of hybridization at 65.degree. C. for 36 hours and washing 3 times at high stringency with 0.1.times.SSC and 0.5% SDS for 20 minutes at 65.degree. C.
7. A purified or recombinant nucleic acid according to claim 1, wherein said polypeptide comprises an enoylreductase domain and wherein the complement of said nucleotide sequence hybridizes to enoylreductase domains as listed in Table 1, under conditions of hybridization at 65.degree. C. for 36 hours and washing 3 times at high stringency with 0.1.times.SSC and 0.5% SDS for 20 minutes at 65.degree. C.
8. A purified or recombinant nucleic acid according to claim 1, wherein said polypeptide comprises an acyl carrier protein domain and wherein the complement of said nucleotide sequence hybridizes to a sequence selected from the group consisting of acyl carrier protein domains as listed in Table 1, under conditions of hybridization at 65.degree. C. for 36 hours and washing 3 times at high stringency with 0.1.times.SSC and 0.5% SDS for 20 minutes at 65.degree. C.
9. A purified or recombinant polypeptide involved in the biosynthesis of an jerangolid, wherein said polypeptide has an amino acid sequence that can be encoded by a nucleic acid sequence of claim 1.
10. The polypeptide of claim 9 that can be encoded by the gene jerA.
11. The polypeptide of claim 9 that can be encoded by the gene jerB.
12. The polypeptide of claim 9 that can be encoded by the gene jerC.
13. The polypeptide of claim 9 that can encoded by the gene jerD.
14. The polypeptide of claim 9 that can be encoded by the gene jerE.
15. The polypeptide of claim 9 that can be encoded by the gene jerF.
16. A method of making an jerangolid or jerangolid analog, said method comprising expressing at least one recombinant gene of claim 1 in a host cell capable of producing polyketides.
Description:
CROSS REFERENCE TO RELATED APPLICATIONS
[0001]This application is a divisional of U.S. patent application Ser. No. 11/109,593, filed 18 Apr. 2005, now U.S. Pat. No. 7,285,405, issued 23 Oct. 2007, which claims benefit under 35 U.S.C. §119 to U.S. provisional application Ser. No. 60/563,843, filed 19 Apr. 2005, the entire contents of each prior application being incorporated herein by reference.
[0002]Polyketides are complex natural products that are produced by microorganisms such as fungi and mycelial bacteria. There are about 10,000 known polyketides, from which numerous pharmaceutical products in many therapeutic areas have been derived, including: adriamycin, epothilone, erythromycin, mevacor, rapamycin, tacrolimus, tetracycline, rapamycin, and many others. However, polyketides are made in very small amounts in microorganisms and are difficult to make or modify chemically. For this and other reasons, biosynthetic methods are preferred for production of therapeutically active polyketides. See PCT publication Nos. WO 93/13663; WO 95/08548; WO 96/40968; WO 97/02358; and WO 98/27203; U.S. Pat. Nos. 4,874,748; 5,063,155; 5,098,837; 5,149,639; 5,672,491; 5,712,146 and 6,410,301; Fu et al., 1994, Biochemistry 33:9321-26; McDaniel et al., 1993, Science 262: 1546-1550; Kao et al., 1994, Science, 265:509-12, and Rohr, 1995, Angew. Chem. Int. Ed. Engl. 34: 881-88, each of which is incorporated herein by reference.
[0003]Biosynthesis of polyketides may be accomplished by heterologous expression of Type I or modular polyketide synthase enzymes (PKSs). Type I PKSs are large multifunctional protein complexes, the protein components of which are encoded by multiple open reading frames (ORF) of PKS gene clusters. Each ORF of a Type I PKS gene cluster can encode one, two, or more modules of ketosynthase activity. Each module activates and incorporates a two-carbon (ketide) unit into the polyketide backbone. Each module also contains multiple ketide-modifying enzymatic activities, or domains. In classical Type I PKSs, the number and order of modules, and the types of ketide-modifying domains within each module, determine the structure of the resulting product. Recently, variants of Type I PKSs have been found in which single modules may be used in an iterative fashion to add more than one two-carbon unit to the growing polyketide chain (see, for example, Muller 2004). Polyketide synthesis may also involve the activity of nonribosomal peptide synthetases (NRPSs) to catalyze incorporation of an amino acid-derived building block into the polyketide, as well as post-synthesis modification, or tailoring enzymes. The modification enzymes modify the polyketide by oxidation or reduction, addition of carbohydrate groups or methyl groups, or other modifications.
[0004]In PKS polypeptides, the regions that encode enzymatic activities (domains) are separated by linker regions. These regions collectively can be considered to define boundaries of the various domains. Generally, this organization permits PKS domains of different or identical substrate specificities to be substituted (usually at the level of encoding DNA) from other PKSs by various available methodologies. Using this method, new polyketide synthases (which produce novel polyketides) can be produced. It will be recognized from the foregoing that genetic manipulation of PKS genes and heterologous expression of PKSs can be used for the efficient production of known polyketides, and for production of novel polyketides structurally related to, but distinct from, known polyketides (see references above, and Hutchinson, 1998, Curr. Opin. Microbiol. 1:319-29; Carreras and Santi, 1998, Curr. Opin. Biotech. 9:403-11; and U.S. Pat. Nos. 5,712,146 and 5,672,491, each of which is incorporated herein by reference).
[0005]One valuable class of polyketides includes the jerangolids and their analogs (FIG. 1), produced by various strains of the myxobacterium Sorangium cellulosum. Jerangolid A (1) as produced by Sorangium cellulosum strain So ce 307 was described by Gerth et al. "The Jerangolids: A Family of New Antifungal Compounds from Sorangium cellulosum (Myxobacteria); Production, Pysico-chemical and Biological Properties of Jerangolid A," J. Antibiotics 49: 71-75 (1996), along with four closely related analogs, jerangolids B, C, D, and E.
[0006]The jerangolids are anti-fungal agents showing partial structural resemblance with the ambruticins.
[0007]Given the promise of jerangolids in the treatment of fungal infections, there exists an unmet need for a production system that can provide large quantities of these polyketides. The present invention meets this need by providing the biosynthetic genes responsible for the production of jerangolids and providing for their expression in heterologous hosts.
SUMMARY OF THE INVENTION
[0008]The present invention provides recombinant nucleic acids encoding polyketide synthases and polyketide modification enzymes. The recombinant nucleic acids of the invention are useful in the production of polyketides, including but not limited to jerangolids and jerangolid analogs and derivatives in recombinant host cells. The biosynthesis of the jerangolids is performed by a modular polyketide synthase (PKS) together with polyketide modification enzymes. The jerangolid PKS is made up of several proteins, each having one or more modules. The modules have domains with specific synthetic functions.
[0009]The present invention also provides domains and modules of the jerangolid PKS and corresponding nucleic acid sequences encoding them and/or parts thereof. Such compounds are useful in the production of hybrid PKS enzymes and the recombinant genes that encode them.
[0010]The present invention also provides modifying genes of the jerangolid biosynthetic gene cluster, including but not limited to isolated and recombinant forms and forms incorporated into a vector or the chromosomal DNA of a host cell.
[0011]The present invention also provides recombinant host cells that contain the nucleic acids of the invention. In one embodiment, the host cell provided by the invention is a Streptomyces host cell that produces a jerangolid modification enzyme and/or a domain, module, or protein of the jerangolid PKS. Methods for the genetic manipulation of Streptomyces are described in Kieser et al, "Practical Streptomyces Genetics," The John Innes Foundation, Norwich (2000), which is incorporated herein by reference in its entirety. In other embodiments, the host cells provided by the invention are eubacterial cells such as Escherichia coli, yeast cells such as Saccharomyces cerevisiae, or myxobacterial cells such as Myxococcus xanthus.
[0012]Accordingly, there is provided a recombinant PKS wherein at least 10, 15, 20, or more consecutive amino acids in one or more domains of one or more modules thereof are derived from one or more domains of one or more modules of the jerangolid polyketide synthase. Preferably at least an entire domain of a module of the jerangolid synthase is included. Representative jerangolid PKS domains useful in this aspect of the invention include, for example, KR, DH, ER, AT, ACP and KS domains. In one embodiment of the invention, the PKS is assembled from polypeptides encoded by DNA molecules that comprise coding sequences for PKS domains, wherein at least one encoded domain corresponds to a domain of jerangolid PKS. In such DNA molecules, the coding sequences are operably linked to control sequences so that expression therefrom in host cells is effective. In this manner, jerangolid PKS coding sequences or modules and/or domains can be made to encode PKS to biosynthesize compounds having antibiotic or other useful bioactivity other than jerangolid.
[0013]These and other aspects of the present invention are described in more detail in the Detailed Description of the Invention, below.
BRIEF DESCRIPTION OF THE DRAWINGS
[0014]FIG. 1 shows the chemical structure of Jerangolid A
[0015]FIG. 2 shows the organization of the jerangolid biosynthetic cluster as deduced from SEQ ID NO:1. FIG. 2A shows the organization of the portion of the gene cluster upstream of the polyketide synthase genes. FIG. 2B shows the organization of the portion of the gene cluster containing the polyketide synthase genes. FIG. 2C shows the organization of the portion of the gene cluster downstream of the polyketide synthase genes.
DETAILED DESCRIPTION OF THE INVENTION
[0016]The present invention provides recombinant materials for the production of polyketides. In an aspect, the invention provides recombinant nucleic acids encoding at least one domain of a polyketide synthase required for jerangolid biosynthesis. Methods and host cells for using these genes to produce a polyketide in recombinant host cells are also provided.
[0017]The nucleotide sequences encoding jerangolid PKS domains, modules and polypeptides of the present invention were isolated from Sorangium cellulosum So ce 307 as described in Example 1. Given the valuable properties of jerangolid and its derivatives and analogs, means to produce useful quantities of these molecules in a highly pure form is of great potential value. The compounds produced may be used as antitumor agents or for other therapeutic uses, and/or intermediates for further enzymatic or chemical modification. The nucleotide sequences of the jerangolid biosynthetic gene cluster encoding domains, modules and polypeptides of jerangolid synthase, and modifying enzymes, and other polypeptides can be used, for example, to make both known and novel polyketides.
[0018]In one aspect of the invention, purified and isolated DNA molecules are provided that comprise one or more coding sequences for one or more domains or modules of jerangolid synthase. Examples of such encoded domains include jerangolid synthase KR, DH, ER, AT, ACP, and KS domains. Domains will herein be referred to according to the module in which they are found as "domain(module)"; for example, the module 1 AT domain will be referred to as "AT(1)." In one aspect, the invention provides DNA molecules in which sequences encoding one or more polypeptides of jerangolid synthase are operably linked to expression control sequences that are effective in suitable host cells to produce jerangolid, its analogs or derivatives, or novel polyketides.
[0019]The sequence of the beginning of the jerangolid PKS gene cluster was assembled from sequences deduced from the cosmid 10K10B3 (FIG. 2) and is shown as SEQ ID NO:1. This partial PKS gene cluster is found to comprise five open reading frames (ORFs), named jerA, jerB, jerC, jerD, and jerE. The jerA gene encodes the loading module of the jerangolid PKS, also referred to herein as "module 0," and comprises KS and AT domains. The KS(0) domain is apparently inactive as a ketosynthase, having the active site cysteine residue replaced with a serine, and is thought to act as a decarboxylase to prime the PKS with a propionate group derived from methylmalonate. The AT(0) domain comprises the signature amino acid sequences (GHSQ and YASH) of a methylmalonyl-specific AT domain. The jerB gene encodes modules 1 and 2 of the jerangolid PKS, the jerC gene encodes modules 3 and 4, the jerD gene encodes module 5, and the jerE gene encodes modules 6 and 7 along with a chain terminating thioesterase (TE) domain. Table 1 provides a description of the genes, modules, and domains of the five jerangolid PKS proteins. A further gene, jerF, encodes an O-methyltransferase thought to be involved in addition of the methyl group to O-3 of jerangolide.
TABLE-US-00001 TABLE 1 Genes, modules, and domains of the five proteins of the jerangolid PKS determined from the nucleotide sequence given in SEQ ID NO: 1. Gene Module Domain boundaries JerA 15751-18978 module 0 15859-18831 KS(0) 15859-17133 AT(0) 17461-18513 ACP(0) 18577-18831 JerB 19013-30074 module 1 19134-23507 KS(1) 19134-20408 AT(1) 20715-21767 KR(1) 22398-23219 ACP(1) 23250-23507 module 2 23559-29816 KS(2) 23559-24836 AT(2) 25167-26234 DH(2) 26268-26819 ER(2) 27822-28697 KR(2) 28707-29522 ACP(2) 29559-29816 JerC 30071-41035 module 3 30170-35440 KS(3) 30170-31447 AT(3) 31772-32824 DH(3) 32858-33409 KR(3) 34322-35161 ACP(3) 35183-35440 module 4 35507-40789 KS(4) 35507-36784 AT(4) 37115-38182 DH(4) 38216-38776 KR(4) 39695-40519 ACP(4) 40532-40789 JerD 41032-46674 module 5 41131-46416 KS(5) 41131-42408 AT(5) 42733-43800 DH(5) 43834-44430 KR(5) 45307-46125 ACP(5) 46159-46416 JerE 46671-55280 module 6 46773-51383 KS(6) 46773-48050 AT(6) 48381-49448 KR(6) 50295-50960 ACP(6) 51126-51383 module 7 51462-54443 KS(7) 51462-52742 AT(7) 53052-54098 ACP(7) 54189-54443 TE 54444-55280
[0020]In one aspect, the invention provides an isolated or recombinant DNA molecule comprising a nucleotide sequence that encodes at least one domain, alternatively at least one module, alternatively at least one polypeptide, involved in the biosynthesis of an jerangolid.
[0021]In one aspect, the invention provides an isolated or recombinant DNA molecule comprising a sequence identical or substantially similar to SEQ ID NO:1 or its complement. Hereinafter, each reference to a nucleic acid sequence is also intended to refer to and include the complementary sequence, unless otherwise stated or apparent from context. In an embodiment the subsequence comprises a sequence encoding a complete jerangolid PKS domain, module or polypeptide.
[0022]In one aspect, the present invention provides an isolated or recombinant DNA molecule comprising a nucleotide sequence that encodes an open reading frame, module or domain having an amino acid sequence identical or substantially similar to an ORF, module or domain encoded by SEQ ID NO: 1. Generally, a polypeptide, module or domain having a sequence substantially similar to a reference sequence has substantially the same activity as the reference protein, module or domain (e.g., when integrated into an appropriate PKS framework using methods known in the art). In certain embodiments, one or more activities of a substantially similar polypeptide, module or domain are modified or inactivated as described below.
[0023]In one aspect, the invention provides an isolated or recombinant DNA molecule comprising a nucleotide sequence that encodes at least one polypeptide, module or domain encoded by SEQ ID NO:1, e.g., a polypeptide, module or domain involved in the biosynthesis of an jerangolid, wherein said nucleotide sequence comprises at least 10, 20, 25, 30, 35, 40, 45, or 50 contiguous base pairs identical to a sequence of SEQ ID NO: 1. In one aspect, the invention provides an isolated or recombinant DNA molecule comprising a nucleotide sequence that encodes at least one polypeptide, module or domain encoded by SEQ ID NO:1, e.g., a polypeptide, module or domain involved in the biosynthesis of a jerangolid, wherein said polypeptide, module or domain comprises at least 10, 15, 20, 30, or 40 contiguous residues of a corresponding polypeptide, module or domain comprising a sequence of SEQ ID NO: 1.
[0024]It will be understood that SEQ ID NO: 1 was determined using the inserts of cosmids 307K-3F11, 307K-5G2, and 307K-2C8. Accordingly, the invention provides an isolated or recombinant DNA molecule comprising a sequence identical or substantially similar to an ORF encoding sequence of the insert of cosmids 307K-3F11, 307K-5G2, or 307K-2C8.
[0025]Those of skill will recognize that, due to the degeneracy of the genetic code, a large number of DNA sequences encode the amino acid sequences of the domains, modules, and proteins of the jerangolid PKS, the enzymes involved in jerangolid modification and other polypeptides encoded by the genes of the jerangolid biosynthetic gene cluster. The present invention contemplates all such DNAs. For example, it may be advantageous to optimize sequence to account for the codon preference of a host organism. The invention also contemplates naturally occurring genes encoding the jerangolid PKS that are polymorphic or other variants.
[0026]As used herein, the terms "substantial identity," "substantial sequence identity," or "substantial similarity" in the context of nucleic acids, refers to a measure of sequence similarity between two polynucleotides. Substantial sequence identity can be determined by hybridization under stringent conditions, by direct comparison, or other means. For example, two polynucleotides can be identified as having substantial sequence identity if they are capable of specifically hybridizing to each other under stringent hybridization conditions. Other degrees of sequence identity (e.g., less than "substantial") can be characterized by hybridization under different conditions of stringency. "Stringent hybridization conditions" refers to conditions in a range from about 5° C. to about 20° C. or 25° C. below the melting temperature (Tm) of the target sequence and a probe with exact or nearly exact complementarity to the target. As used herein, the melting temperature is the temperature at which a population of double-stranded nucleic acid molecules becomes half-dissociated into single strands. Methods for calculating the Tm of nucleic acids are well known in the art (see, e.g., Berger and Kimmel, 1987, Methods In Enzymology, Vol. 152: Guide To Molecular Cloning Techniques, San Diego: Academic Press, Inc. and Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, 2nd Ed., Vols. 1-3, Cold Spring Harbor Laboratory). Typically, stringent hybridization conditions for probes greater than 50 nucleotides are salt concentrations less than about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion at pH 7.0 to 8.3, and temperatures at least about 50° C., preferably at least about 60° C. As noted, stringent conditions may also be achieved with the addition of destabilizing agents such as formamide, in which case lower temperatures may be employed. Exemplary conditions include hybridization at 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO4 pH 7.0, 1 mM EDTA at 65° C.; wash with 2×SSC, 1% SDS, at 50° C.
[0027]Alternatively, substantial sequence identity can be described as a percentage identity between two nucleotide or amino acid sequences. Two nucleic acid sequences are considered substantially identical when they are at least about 70% identical, or at least about 80% identical, or at least about 90% identical, or at least about 95% or 98% identical. Two amino acid sequences are considered substantially identical when they are at least about 60%, sequence identical, more often at least about 70%, at least about 80%, or at least about 90% sequence identity to the reference sequence. Percentage sequence (nucleotide or amino acid) identity is typically calculated using art known means to determine the optimal alignment between two sequences and comparing the two sequences. Optimal alignment of sequences may be conducted using the local homology algorithm of Smith and Waterman (1981) Adv. Appl. Math. 2: 482, by the homology alignment algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 48: 443, by the search for similarity method of Pearson and Lipman (1988) Proc. Natl. Acad. Sci. U.S.A. 85: 2444, by the BLAST algorithm of Altschul (1990) J. Mol. Biol. 215: 403-410; and Shpaer (1996) Genomics 38:179-191, or by the Needleham et al. (1970) J. Mol. Biol. 48: 443-453; and Sankoff et al., 1983, Time Warps, String Edits, and Macromolecules, The Theory and Practice of Sequence Comparison, Chapter One, Addison-Wesley, Reading, Mass.; generally by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, Wis.; BLAST from the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). In each case default parameters are used (for example the BLAST program uses as defaults a wordlength (W) of 11, the BLOSUM62 scoring matrix (see Henikoff (1992) Proc. Natl. Acad. Sci. USA 89: 10915-10919) alignments (B) of 50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands).
[0028]The invention methods may be directed to the preparation of an individual polyketide. The polyketide may or may not be novel, but the method of preparation permits a more convenient or alternative method of preparing it. The resulting polyketides may be further modified to convert them to other useful compounds. Examples of chemical structures of that can be made using the materials and methods of the present invention include known analogs, such as those described in Kalesse & Christmann, 2002, "The Chemistry and Biology of the Jerangolid Family" Synthesis (8):981-1003 and the refereneces cited therein, and novel molecules produced by modified or chimeric PKSs comprising a portion of the jerangolid PKS sequence, molecules produced by the action of polyketide modifying enzymes from the jerangolid PKS cluster on products of other PKSs, molecules produced by the action on products of the jerangolid PKS of polyketide modifying enzymes from other PKSs, and the like. As noted, in one aspect the invention provides recombinant PKS wherein at least 10, 15, 20, or more consecutive amino acids in one or more domains of one or more modules thereof are derived from one or more domains of one or more modules of the jerangolid polyketide synthase. A polyketide synthase "derived from" a naturally occurring PKS contains the scaffolding encoded by all the portion employed of the naturally occurring synthase gene, contains at least two modules that are functional, and contains mutations, deletions, or replacements of one or more of the activities of these functional modules so that the nature of the resulting polyketide is altered. This definition applies both at the protein and genetic levels. Particular embodiments include those wherein a KS, AT, KR, DH, or ER has been deleted or replaced by a version of the activity from a different PKS or from another location within the same PKS, and derivatives where at least one noncondensation cycle enzymatic activity (KR, DH, or ER) has been deleted or wherein any of these activities has been added or mutated so as to change the ultimate polyketide synthesized. There are at least five degrees of freedom for constructing a polyketide synthase in terms of the polyketide that will be produced. See, U.S. Pat. No. 6,509,455 for a discussion.
[0029]As can be appreciated by those skilled in the art, polyketide biosynthesis can be manipulated to make a product other than the product of a naturally occurring PKS biosynthetic cluster. For example, AT domains can be altered or replaced to change specificity. The variable domains within a module can be deleted and or inactivated or replaced with other variable domains found in other modules of the same PKS or from another PKS. See e.g., Katz & McDaniel, Med Res Rev 19: 543-558 (1999) and WO 98/49315. Similarly, entire modules can be deleted and/or replaced with other modules from the same PKS or another PKS. See e.g., Gokhale et al., Science 284: 482 (1999) and WO 00/47724 each of which are incorporated herein by reference. Protein subunits of different PKSs also can be mixed and matched to make compounds having the desired backbone and modifications. For example, subunits of 1 and 2 (encoding modules 1-4) of the pikromycin PKS were combined with the DEBS3 subunit to make a hybrid PKS product (see Tang et al., Science, 287: 640 (2001), WO 00/26349 and WO 99/6159). Mutations can be introduced into PKS genes such that polypeptides with altered activity are encoded. Polypeptides with "altered activity" include those in which one or more domains are inactivated or deleted, or in which a mutation changes the substrate specificity of a domain, as well as other alterations in activity. Mutations can be made to the native sequences using conventional techniques. The substrates for mutation can be an entire cluster of genes or only one or two of them; the substrate for mutation may also be portions of one or more of these genes. Techniques for mutation include preparing synthetic oligonucleotides including the mutations and inserting the mutated sequence into the gene encoding a PKS subunit using restriction endonuclease digestion. (See, e.g., Kunkel, T. A. Proc Natl Acad Sci USA (1985) 82:448; Geisselsoder et al. BioTechniques (1987) 5:786.) Alternatively, the mutations can be effected using a mismatched primer (generally 10-20 nucleotides in length) that hybridizes to the native nucleotide sequence (generally cDNA corresponding to the RNA sequence), at a temperature below the melting temperature of the mismatched duplex. The primer can be made specific by keeping primer length and base composition within relatively narrow limits and by keeping the mutant base centrally located. (See Zoller and Smith, Methods in Enzymology (1983) 100:468). Primer extension is effected using DNA polymerase. The product of the extension reaction is cloned, and those clones containing the mutated DNA are selected. Selection can be accomplished using the mutant primer as a hybridization probe. The technique is also applicable for generating multiple point mutations. (See, e.g., Dalbie-McFarland et al. Proc Natl Acad Sci USA (1982) 79:6409). PCR mutagenesis can also be used for effecting the desired mutations. Random mutagenesis of selected portions of the nucleotide sequences encoding enzymatic activities can be accomplished by several different techniques known in the art, e.g., by inserting an oligonucleotide linker randomly into a plasmid.
[0030]In addition to providing mutated forms of regions encoding enzymatic activity, regions encoding corresponding activities from different PKS synthases or from different locations in the same PKS synthase can be recovered, for example, using PCR techniques with appropriate primers. By "corresponding" activity encoding regions is meant those regions encoding the same general type of activity--e.g., a ketoreductase activity in one location of a gene cluster would "correspond" to a ketoreductase-encoding activity in another location in the gene cluster or in a different gene cluster; similarly, a complete reductase cycle could be considered corresponding--e.g., KR/DH/ER could correspond to KR alone.
[0031]If replacement of a particular target region in a host polyketide synthase is to be made, this replacement can be conducted in vitro using suitable restriction enzymes or can be effected in vivo using recombinant techniques involving homologous sequences framing the replacement gene. One such system involving plasmids of differing temperature sensitivities is described in PCT application WO 96/40968. Another useful method for modifying a PKS gene (e.g., making domain substitutions or "swaps") is a RED/ET cloning procedure developed for constructing domain swaps or modifications in an expression plasmid without first introducing restriction sites. The method is related to ET cloning methods (see, Datansko & Wanner, 2000, Proc. Natl. Acad. Sci. U.S.A. 97, 6640-45; Muyrers et al, 2000, Genetic Engineering 22:77-98). The RED/ET cloning procedure is used to introduce a unique restriction site in the recipient plasmid at the location of the targeted domain. This restriction site is used to subsequently linearize the recipient plasmid in a subsequent ET cloning step to introduce the modification. This linearization step is necessary in the absence of a selectable marker, which cannot be used for domain substitutions. An advantage of using this method for PKS engineering is that restriction sites do not have to be introduced in the recipient plasmid in order to construct the swap, which makes it faster and more powerful because boundary junctions can be altered more easily.
[0032]In a further aspect, the invention provides methods for expressing chimeric or hybrid PKSs and products of such PKSs. For example, the invention provides (1) encoding DNA for a chimeric PKS that is substantially patterned on a non-jerangolid producing enzyme, but which includes one or more functional domains, modules or polypeptides of jerangolid PKS; and (2) encoding DNA for a chimeric PKS that is substantially patterned on the jerangolid PKS, but which includes one or more functional domains, modules, or polypeptides of another PKS or NRPS.
[0033]With respect to item (1) above, in one embodiment, the invention provides chimeric PKS enzymes in which the genes for a non-jerangolid PKS function as accepting genes, and one or more of the above-identified coding sequences for jerangolid domains or modules are inserted as replacements for one or more domains or modules of comparable function. Construction of chimeric molecules is most effectively achieved by construction of appropriate encoding polynucleotides. In making a chimeric molecule, it is not necessary to replace an entire domain or module accepting of the PKS with an entire domain or module of jerangolid PKS: subsequences of a PKS domain or module that correspond to a peptide subsequence in an accepting domain or module, or which otherwise provide useful function, may be used as replacements. Accordingly, appropriate encoding DNAs for construction of such chimeric PKS include those that encode at least 10, 15, 20 or more amino acids of a selected jerangolid domain or module.
[0034]Recombinant methods for manipulating modular PKS genes to make chimeric PKS enzymes are described in U.S. Pat. Nos. 5,672,491; 5,843,718; 5,830,750; and 5,712,146; and in PCT publication Nos. 98/49315 and 97/02358. A number of genetic engineering strategies have been used with DEBS to demonstrate that the structures of polyketides can be manipulated to produce novel natural products, primarily analogs of the erythromycins (see the patent publications referenced supra and Hutchinson, 1998, Curr Opin Microbiol. 1:319-329, and Baltz, 1998, Trends Microbiol. 6:76-83). In one embodiment, the components of the chimeric PKS are arranged onto polypeptides having interpolypeptide linkers that direct the assembly of the polypeptides into the functional PKS protein, such that it is not required that the PKS have the same arrangement of modules in the polypeptides as observed in natural PKSs. Suitable interpolypeptide linkers to join polypeptides and intrapolypeptide linkers to join modules within a polypeptide are described in PCT publication WO 00/47724.
[0035]A partial list of sources of PKS sequences for use in making chimeric molecules, for illustration and not limitation, includes Avermectin (U.S. Pat. No. 5,252,474; MacNeil et al., 1993, Industrial Microorganisms: Basic and Applied Molecular Genetics, Baltz, Hegeman, & Skatrud, eds. (ASM), pp. 245-256; MacNeil et al., 1992, Gene 115: 119-25); Candicidin (FRO008) (Hu et al., 1994, Mol. Microbiol. 14: 163-72); Epothilone (U.S. Pat. No. 6,303,342); Erythromycin (WO 93/13663; U.S. Pat. No. 5,824,513; Donadio et al., 1991, Science 252:675-79; Cortes et al., 1990, Nature 348:176-8); FK-506 (Motamedi et al., 1998, Eur. J. Biochem. 256:528-34; Motamedi et al., 1997, Eur. J. Biochem. 244:74-80); FK-520 (U.S. Pat. No. 6,503,737; see also Nielsen et al., 1991, Biochem. 30:5789-96); Lovastatin (U.S. Pat. No. 5,744,350); Nemadectin (MacNeil et al., 1993, supra); Niddamycin (Kakavas et al., 1997, J. Bacteriol. 179:7515-22); Oleandomycin (Swan et al., 1994, Mol. Gen. Genet. 242:358-62; U.S. Pat. No. 6,388,099; Olano et al., 1998, Mol. Gen. Genet. 259:299-308); Platenolide (EP Pat. App. 791,656); Rapamycin (Schwecke et al., 1995, Proc. Natl. Acad. Sci. USA 92:7839-43); Aparicio et al., 1996, Gene 169:9-16); Rifamycin (August et al., 1998, Chemistry & Biology, 5: 69-79); Soraphen (U.S. Pat. No. 5,716,849; Schupp et al., 1995, J. Bacteriology 177: 3673-79); Spiramycin (U.S. Pat. No. 5,098,837); Tylosin (EP 0 791,655; Kuhstoss et al., 1996, Gene 183:231-36; U.S. Pat. No. 5,876,991). Additional suitable PKS coding sequences remain to be discovered and characterized, but will be available to those of skill (e.g., by reference to GenBank).
[0036]The jerangolid PKS-encoding polynucleotides of the invention may also be used in the production of libraries of PKSs (i.e., modified and chimeric PKSs comprising at least a portion of the jerangolid PKS sequence. The invention provides libraries of polyketides by generating modifications in, or using a portion of, the jerangolid PKS so that the protein complexes produced by the cluster have altered activities in one or more respects, and thus produce polyketides other than the natural jerangolid product of the PKS. Novel polyketides may thus be prepared, or polyketides in general prepared more readily, using this method. By providing a large number of different genes or gene clusters derived from a naturally occurring PKS gene cluster, each of which has been modified in a different way from the native PKS cluster, an effectively combinatorial library of polyketides can be produced as a result of the multiple variations in these activities. Expression vectors containing nucleotide sequences encoding a variety of PKS systems for the production of different polyketides can be transformed into the appropriate host cells to construct a polyketide library. In one approach, a mixture of such vectors is transformed into the selected host cells and the resulting cells plated into individual colonies and selected for successful transformants. Each individual colony has the ability to produce a particular PKS synthase and ultimately a particular polyketide. A variety of strategies can be devised to obtain a multiplicity of colonies each containing a PKS gene cluster derived from the naturally occurring host gene cluster so that each colony in the library produces a different PKS and ultimately a different polyketide. The number of different polyketides that are produced by the library is typically at least four, more typically at least ten, and preferably at least 20, more preferably at least 50, reflecting similar numbers of different altered PKS gene clusters and PKS gene products. The number of members in the library is arbitrarily chosen; however, the degrees of freedom outlined above with respect to the variation of starter, extender units, stereochemistry, oxidation state, and chain length is quite large. The polyketide producing colonies can be identified and isolated using known techniques and the produced polyketides further characterized. The polyketides produced by these colonies can be used collectively in a panel to represent a library or may be assessed individually for activity.
[0037]Colonies in the library are induced to produce the relevant synthases and thus to produce the relevant polyketides to obtain a library of candidate polyketides. The polyketides secreted into the media can be screened for binding to desired targets, such as receptors, signaling proteins, and the like. The supernatants per se can be used for screening, or partial or complete purification of the polyketides can first be effected. Typically, such screening methods involve detecting the binding of each member of the library to receptor or other target ligand. Binding can be detected either directly or through a competition assay. Means to screen such libraries for binding are well known in the art. Alternatively, individual polyketide members of the library can be tested against a desired target. In this event, screens wherein the biological response of the target is measured can be included.
[0038]As noted above, the DNA compounds of the invention can be expressed in host cells for production of proteins and of known and novel compounds. Preferred hosts include fungal systems such as yeast and procaryotic hosts, but single cell cultures of, for example, mammalian cells could also be used. A variety of methods for heterologous expression of PKS genes and host cells suitable for expression of these genes and production of polyketides are described, for example, in U.S. Pat. Nos. 5,843,718 and 5,830,750; WO 01/31035, WO 01/27306, and WO 02/068613; and U.S. patent application Ser. Nos. 10/087,451 (published as US2002000087451); 60/355,211; and 60/396,513 (corresponding to published application 20020045220).
[0039]Appropriate host cells for the expression of the hybrid PKS genes include those organisms capable of producing the needed precursors, such as malonyl-CoA, methylmalonyl-CoA, ethylmalonyl-CoA, and methoxymalonyl-ACP, and having phosphopantotheinylation systems capable of activating the ACP domains of modular PKSs. See, for example, U.S. Pat. No. 6,579,695. However, as disclosed in U.S. Pat. No. 6,033,883, a wide variety of hosts can be used, even though some hosts natively do not contain the appropriate post-translational mechanisms to activate the acyl carrier proteins of the synthases. Also see WO 97/13845 and WO 98/27203. The host cell may natively produce none, some, or all of the required polyketide precursors, and may be genetically engineered so as to produce the required polyketide precursors. Such hosts can be modified with the appropriate recombinant enzymes to effect these modifications. Suitable host cells include Streptomyces, E. coli, yeast, and other procaryotic hosts which use control sequences compatible with Streptomyces spp. Examples of suitable hosts that either natively produce modular polyketides or have been engineered so as to produce modular polyketides include but are not limited to actinomyctes such as Streptomyces coelicolor, Streptomyces venezuelae, Streptomyces fradiae, Streptomyces ambofaciens, and Saccharopolyspora erythraea, eubacteria such as Escherichia coli, myxobacteria such as Myxococcus xanthus, and yeasts such as Saccharomyces cerevisiae.
In one embodiment, any native modular PKS genes in the host cell have been deleted to produce a "clean host," as described in U.S. Pat. No. 5,672,491, incorporated herein by reference.
[0040]In some embodiments, the host cell expresses, or is engineered to express, a polyketide "tailoring" or "modifying" enzyme. Once a PKS product is released, it is subject to post-PKS tailoring reactions. These reactions are important for biological activity and for the diversity seen among polyketides. Tailoring enzymes normally associated with polyketide biosynthesis include oxygenases, glycosyl- and methyl-transferases, acyltransferases, halogenases, cyclases, aminotransferases, and hydroxylases. In addition to biosynthetic accessory activities, secondary metabolite clusters often code for activities such as transport.
[0041]Tailoring enzymes for modification of a product of the jerangolid PKS, a non-jerangolid PKS, or a chimeric PKS, can be those normally associated with jerangolid biosynthesis or "heterologous" tailoring enzymes. Tailoring enzymes can be expressed in the organism in which they are naturally produced, or as recombinant proteins in heterologous hosts. In some cases, the structure produced by the heterologous or hybrid PKS may be modified with different efficiencies by post-PKS tailoring enzymes from different sources. In such cases, post-PKS tailoring enzymes can be recruited from other pathways to obtain the desired compound. For example, the tailoring enzymes of the jerangolid PKS gene cluster can be expressed heterologously to modify polyketides produced by non-jerangolid synthases or can be inactivated in the Jerangolid producer. Alternatively, the unmodified polyketide compounds can be produced in the recombinant host cell, and the desired modification (e.g., oxidation) steps carried out in vitro (e.g., using purified enzymes, isolated from native sources or recombinantly produced) or in vivo in a converting cell different from the host cell (e.g., by supplying the converting cell with the unmodified polyketide).
[0042]It will be apparent to one of skill in the art that a variety of recombinant vectors can be utilized in the practice of aspects of the invention. As used herein, "vector" refers to polynucleotide elements that are used to introduce recombinant nucleic acid into cells for either expression or replication. Selection and use of such vehicles is routine in the art. An "expression vector" includes vectors capable of expressing DNAs that are operatively linked with regulatory sequences, such as promoter regions. Thus, an expression vector refers to a recombinant DNA or RNA construct, such as a plasmid, a phage, recombinant virus or other vector that, upon introduction into an appropriate host cell, results in expression of the cloned DNA. Appropriate expression vectors are well known to those of skill in the art and include those that are replicable in eukaryotic cells and/or prokaryotic cells and those that remain episomal or those that integrate into the host cell genome.
[0043]The vectors used to perform the various operations to replace the enzymatic activity in the host PKS genes or to support mutations in these regions of the host PKS genes may be chosen to contain control sequences operably linked to the resulting coding sequences in a manner that expression of the coding sequences may be effected in an appropriate host. Suitable control sequences include those that function in eucaryotic and procaryotic host cells. If the cloning vectors employed to obtain PKS genes encoding derived PKS lack control sequences for expression operably linked to the encoding nucleotide sequences, the nucleotide sequences are inserted into appropriate expression vectors. This can be done individually, or using a pool of isolated encoding nucleotide sequences, which can be inserted into host vectors, the resulting vectors transformed or transfected into host cells, and the resulting cells plated out into individual colonies.
[0044]Suitable control sequences for single cell cultures of various types of organisms are well known in the art. Control systems for expression in yeast are widely available and are routinely used. Control elements include promoters, optionally containing operator sequences, and other elements depending on the nature of the host, such as ribosome binding sites. Particularly useful promoters for procaryotic hosts include those from PKS gene clusters that result in the production of polyketides as secondary metabolites, including those from Type I or aromatic (Type II) PKS gene clusters. Examples are act promoters, tcm promoters, spiramycin promoters, and the like. However, other bacterial promoters, such as those derived from sugar metabolizing enzymes, such as galactose, lactose (lac) and maltose, are also useful. Additional examples include promoters derived from biosynthetic enzymes such as for tryptophan (trp), the β-lactamase (bla), bacteriophage lambda PL, and T5. In addition, synthetic promoters, such as the tac promoter (U.S. Pat. No. 4,551,433), can be used.
[0045]As noted, particularly useful control sequences are those which themselves, or with suitable regulatory systems, activate expression during transition from growth to stationary phase in the vegetative mycelium. The system contained in the plasmid identified as pCK7, i.e., the actI/actIII promoter pair and the actII-ORF4 (an activator gene), is particularly preferred. Particularly preferred hosts are those that lack their own means for producing polyketides so that a cleaner result is obtained. Illustrative control sequences, vectors, and host cells of these types include the modified S. coelicolor CH999 and vectors described in PCT publication WO 96/40968 and similar strains of S. lividans. See U.S. Pat. Nos. 5,672,491; 5,830,750, 5,843,718; and 6,177,262, each of which is incorporated herein by reference.
[0046]Other regulatory sequences may also be desirable which allow for regulation of expression of the PKS sequences relative to the growth of the host cell. Regulatory sequences are known to those of skill in the art, and examples include those which cause the expression of a gene to be turned on or off in response to a chemical or physical stimulus, including the presence of a regulatory compound. Other types of regulatory elements may also be present in the vector, for example, enhancer sequences. Selectable markers can also be included in the recombinant expression vectors. A variety of markers are known which are useful in selecting for transformed cell lines and generally comprise a gene whose expression confers a selectable phenotype on transformed cells when the cells are grown in an appropriate selective medium. Such markers include, for example, genes that confer antibiotic resistance or sensitivity to the plasmid. Alternatively, several polyketides are naturally colored, and this characteristic provides a built-in marker for screening cells successfully transformed by the present constructs.
[0047]The various PKS nucleotide sequences, or a mixture of such sequences, can be cloned into one or more recombinant vectors as individual cassettes, with separate control elements or under the control of a single promoter. The PKS subunits or components can include flanking restriction sites to allow for the easy deletion and insertion of other PKS subunits so that hybrid or chimeric PKSs can be generated. The design of such restriction sites is known to those of skill in the art and can be accomplished using the techniques described above, such as site-directed mutagenesis and PCR. Methods for introducing the recombinant vectors of the present invention into suitable hosts are known to those of skill in the art and typically include the use of CaCl2 or other agents, such as divalent cations, lipofection, DMSO, protoplast transformation, conjugation, and electroporation.
[0048]Thus, the present invention provides recombinant DNA molecules and vectors comprising those recombinant DNA molecules that encode at least a portion of the jerangolid PKS and that, when transformed into a host cell and the host cell is cultured under conditions that lead to the expression of said jerangolid PKS enzymes, results in the production of polyketides including but not limited to jerangolid and/or analogs or derivatives thereof in useful quantities. The present invention also provides recombinant host cells comprising those recombinant vectors.
[0049]Suitable culture conditions for production of polyketides using the cells of the invention will vary according to the host cell and the nature of the polyketide being produced, but will be know to those of skill in the art. See, for example, the examples below and WO 98/27203 "Production of Polyketides in Bacteria and Yeast" and WO 01/83803 "Overproduction Hosts For Biosynthesis of Polyketides."
[0050]The polyketide product produced by host cells of the invention can be recovered (i.e., separated from the producing cells and at least partially purified) using routine techniques (e.g., extraction from broth followed by chromatography).
[0051]The compositions, cells and methods of the invention may be directed to the preparation of an individual polyketide or a number of polyketides. The polyketide may or may not be novel, but the method of preparation permits a more convenient or alternative method of preparing it.
[0052]The following Examples are intended to illustrate, but not limit, the scope of the invention.
EXAMPLE 1
Isolation of Jerangolid PKS Cosmids
[0053]Genomic DNA was isolated from Sorangium cellulosum Soce307, the producer of jerangolid using an established protocol (Jaoua, S., Neff, S., and Schupp, T. "Transfer of mobilizable plasmids to Sorangium cellulosum and evidence for their integration into the chromosome," 1992 Plasmid 28:157-165). The DNA was partially digested with Sau3AI using a serial dilution method and libraries were constructed in SuperKOS (a smaller derivative of SuperCos-1) using the protocol for SuperCos-1 from Stratagene. Colonies were picked, cosmid DNA was isolated on the Qiagen robot, and the DNA was submitted for end sequencing. The data was analyzed by BLAST and all PKS positive cosmids were prepared in larger amounts for further analysis.
[0054]End sequencing of cosmid and fosmid libraries of the Soce307 genome gave 13 cosmids with PKS sequence on at least one end. Five of these cosmid/fosmid end sequences were highly similar (>92% identity at the nucleotide level) to sequence from the ambruticin PKS, disclosed in co-pending U.S. application Ser. No. 60/551,103, filed 2 Mar. 2004 and incorporated herein by reference in its entirety, indicating they probably contain the jerangolid cluster.
[0055]All publications and patent documents cited herein are incorporated herein by reference as if each such publication or document was specifically and individually indicated to be incorporated herein by reference.
[0056]Although the present invention has been described in detail with reference to specific embodiments, those of skill in the art will recognize that modifications and improvements are within the scope and spirit of the invention. Citation of publications and patent documents is not intended as an admission that any such document is pertinent prior art, nor does it constitute any admission as to the contents or date of the same. The invention having now been described by way of written description, those of skill in the art will recognize that the invention can be practiced in a variety of embodiments and that the foregoing description are for purposes of illustration and not limitation of the following claims.
Sequence CWU
1
43167323DNASorangium cellulosum 1gatcgtcctg ggcgacacgc tggagcaggt
ggcgacgcgg ctgctcgagg aggacctcgc 60ggcgtgccac acgaccggcg aggcggcgga
cgtgctgctg aacggggtgc tcgcgtcgag 120cgcccgcgcc gtggccgcgg cgctgcgcgc
gtgcgacgag ttcgccgcgg gcgacagcga 180tctgccgtcg ctggcccggg cgtgccgcgc
gttcgcgggg ctcgcgtcgt tcgggtcgtc 240gcggtcgctg tcgtcgctcg gcgacggggt
gatcgcgccg atgctggaga agacgttcgc 300gcgcgcggtc ctgcgcgtcc acgggggctg
cacgggcagc gacgaggcgg tcgccgccgc 360caaggaggcg ctgcgcacgc tgcacgacgt
ggcgctgtcg cagccgatcg tcgaccgcgg 420ggcgtggctc gacgcggcgc gggggctcgt
ggacagcgag gtggtgaacc cgacggcgtc 480cggcctcgcg tgcgggctgc tctacctggc
gcaggcgatc gacgacgccg aggtggcgcg 540ggtcgtcggc ctgcggctcg ggggcgcggc
cgagcccgag gcggcggcgt cgttcctggc 600cgggttcctc gaggtgaacg cgctggtgct
ggtgaagagc cggcccgtgg tcgaggcgct 660ggacgcgttc ctccgggcga tcgcgccgga
gcgcttcaag gacacgctgc cggtccttcg 720gcgcgcgttc gctgggctcg gcgcgacgga
gcggcggtac ctgctcgaga acgtgctcgc 780ggcgcggaag ctgggggaca aggcgcgcgc
ggcgcaggcg gtgctcctgg agaaggaccg 840ggagaagctg aaggagatga gcgaggacct
ctcacaggcg atggacgacc tggacgagtt 900gctctgacac gacccgtgag acgacccgtg
agacgacgac ctggacgacc tggacgacct 960gaaccaggtt gcggacgacc tggacgaccc
ggacgacccg aaccaggttg agttctaggt 1020tgttctaggt tcacggtaca gaacggttcc
cgccggtcaa gcatagcccc ccgacaacac 1080ggcttctgtt ccgcttcttg cacgctcacc
acggcatgac gcatgcgtac ccactctcgt 1140gatgccgcat ggaggatcta ccagttgctc
ggcctcctga aggcacacgc tcgcctgaga 1200gcacgtccat gtcgtcgccc acccctcgat
ccttctggcc atcccacgat cccgctcccg 1260caggcggctc tacaccacaa gcaccacgtc
attcacgctg accccgtgaa acattcgcat 1320ccaccatgtc cccaccggga atacggcgct
gcgtacaccg gcacgccagc gttcgagcgc 1380cgcgcggtac gtcgcgcgga acgcccgcac
ggacgcggca gcggtttgcc acgctcctcc 1440ctgctcgcgg cccaccgcaa aggtaggatt
gcggccaagc agcgcctcga agctcgtggc 1500acgctggtaa ggcgagacgt tgctggctcg
ctcggcacca aagaagcgga gtccttgccg 1560ctccacctcg gcatgggcca gcgcctcctg
tcgttcgagc tcggccaaga tctcgcggcg 1620gaatgcctcc gcggcgtcct gctcgatgat
aggcggtagt gcgagcggca gcgttgcttc 1680ctggggccac tgtggatttt ccgggtcgag
gtatgccgag ggacgagccg cgcgcagcat 1740ggcgtgaccg atctcaccca catctacctt
gacaccgggc cattctctcg aattacggac 1800gagaccggca gcaacagggt ttgcaaggac
ataggcaatc ttttccacaa ccgccgcgcg 1860agtcagcagc cgtaccacgc tcgtggcttc
atggtcccat accgggccct cccacttgcg 1920cagcaccttc gtgccgagcg cgacgattcg
atggaagaac tgcaagaagc gcggcaggac 1980gccccgcaca tcggtcacga cgaggtgcag
atgcgtcgac atcgcgcaaa gggcatgaac 2040ctcgacgccg tagcggtggg ccgcgacagc
gagggcatag accaagaact ggttcatggc 2100cgcatcgggg cgaaacagga aatgtcggcg
cagaacgcga cgggtgatca ggtacgtggc 2160gccgggagcg atctcgcgag gctggctcat
ggccggatca catagacaaa cgctgcgcca 2220tcaagcgatg tgcatcattt caatgactta
ccccgccgcg gggggtggcg gccgccgatt 2280tccgccaatt tccgccacgc tgggccgccg
cttgtcacgc gaccagctgg tgtcggaccg 2340cctcggatta cccccgccgg tagacctccg
ccgatagacc tcaacccggg tcgggtcggc 2400cccccagagt tgggtcgggt cgggtcgaca
acctgggtcg gccaacctgg gtcgggttca 2460ccaaccggcc aacctgggtc aggttgcggc
accaaccggc caacctgggt cgggtcgcgg 2520cggggttgcg gcaggctgca ccccaaccgg
ccaacctggg tcgggtcgcg ccgggctcac 2580tcctcctggc cggcccgcga ccagccgccg
agcttgacgc ccggcgtctg cgccaggaag 2640gcctgcaccg ccggcggcgg cgacgcccac
gccgcgaggc gccccgggct ccgcgggatc 2700ggcgtcgtcg ccgagcgcga gtaggagctc
cacgcgccgt cgctgcaggc atagcaggtc 2760gagcgcgcga gctgcgcggt gagccggagc
gacccccgcg cgtcgtaggc cggcgcgatc 2820ttcacgagct gcggggggtc cttctctcgc
cgcgcggcct ccggatcctc ctcgatgtcg 2880tcgagcgtcg cctcggccgc gcgctgcagc
gacggcacgc cggacacctc ggcgagcagg 2940tcgaccgccg cgccgcgctc ggcgtcccac
acagtgaacg cggccccctc ggagccgtgc 3000gcgccgcaga gatcgctgta cgtgcgctcc
tcgatgaaca ggtacggccc cacgctgccg 3060atcagcgcgg cgctgtgctg gaacacgttg
ctctcgtcct gctccggcac cgtgatcacc 3120gcctggcgct cgccgtcccc gcgcagcacg
agcgccgcgt cggtcgccgt gccctcgccc 3180ggctcgcgcg tgtccccgtc ccagagctcg
cacggcgtcg tctcgacctc cttcgcgcgc 3240gcctcccagc gccactcccc gcgccgggtc
gccacgacga tcccgggctc ctcgcggatc 3300accgcgccgc cctcggcgac gtgccacgtc
ctcggcgtcc cctcgcccgt gctgccccag 3360acgaggacca ccccgtcggc agcaacgttc
ccggccgact cggcctcggg cgggcgcggg 3420gcgggcgcga gctggatgtc ggtggaggcg
gccggcgcag gcccgggccg cgcggcacag 3480ccggagatca gcgccagcga gacgatcaac
gcagcaggtc gggagcgcag catgcggagc 3540cgaaagagca tggtgtgtgc cgcggcccac
ggccagggaa gcccgcgccc cgcgcgccgg 3600aggtgtagcc gcgcgcccac gccgcgtggc
acgagcgccc cgtcaccggc gtcgggaccc 3660gacgatcgag ctccgacgcg ggcagcggca
cgaacggcgc gtcgcccgcg tcggcgagcg 3720cgccgtcctc aggtgccgga acgcgagcgc
gcccggcgca ggagcgcccg ctgcgcggcg 3780ttcagccgga tcccgggcct gcgctgccgg
atccgggcct ccatcgcgtc gacgtcggcg 3840gcgagctcgc gctccagcag cagcgcggcc
gcgagggtcg ccgatcggcc gcggcccgag 3900gcgcagtgca ggtacatgcc gtccacgccc
cgcagccgct cgaggagctc caggagccgc 3960tccacctcgg gccccgtgcc gtcgagcgtc
ggcacgcaga ggtagccggg gtggcgccgc 4020accgcggcgg ccgccggaaa ctcggccgtc
atgtccacga cgagccgcac gcccgccggc 4080agctcgtggg cgagcggccg ccggccgacc
cagagcccgg gcgccacctc gttggcgcag 4140tccgcccgcc cgagcgcccg ctccgcgcgc
cataccgccc aggtcaggag gaagtacgga 4200ccgagcagga cgagcgccca cgcggcctgc
gtgccgtctg gccgcttgcc cagcagcgcg 4260ggccggcgcg cgaggtacgc ggcgccgacc
aggccgaagc tcagcgccgg ccagagcagg 4320gcgagcgcag ccccgccggc caggacggcg
agcgcggcga gcgaggcgct caggacgagg 4380aaggtcaggc cgtatcgcat ggatcggaga
ggcgtctgct cggccatgct ctcacatcgg 4440gcgctcctcg atgaagtcga gcagccgctc
gctcgcgtcg aacgcctcgc tcttccggcc 4500gaacaccggc ggcagatccc ctccgcccgg
ccacgtgcgc ccgcccccca cgacagcgca 4560gcgctcgacc tcggcctcgt cggcgcagcg
cgaccgcgcc gtgcacgtcg tgtcgccgtg 4620cgcgtacgtg atgtgggaga cgtcagcgca
gccgtcgcga tcgcgccaga tcggagagag 4680gacagtgcga tgggccggac atcatcacag
ctctctcttt cgggatctcg agggccatcc 4740ggaggtgcgc tgccggcgcg gacagcgtct
cacacgcccg gcaaagctgg cctcgagtcc 4800accgcgaccg ccctctggat ggcgtcgcgg
agccacccgt ggccctcgtc gtgctcggag 4860cgctccggcc agacgagcgt cagcgtgtag
ccctcgagcg cgaacgggca cggccgcacc 4920acgagatcga gcctccgggc cagggccgcg
gcgacgcgcg cggacacggt gagcagcagg 4980tcggaaccgg agacgatgaa cggcgcgaca
aggaaatggg acacggtcag cgtcacccgc 5040cggcgtgttc cctgctccgc cagcgcccga
tcgatggcgc cgtggtcctc tccgtgcggc 5100gagaccatca ggtgctcgca agcagcgtag
cgcgccgcgg tgagcggcct ccgggacgcc 5160gggtgtccgc ggcgcatcac acagacgatc
tcctcggccg cgagcagcgt ggagcgacag 5220ccgtcgggca ccggtccgcc gcgcccgagc
ttgccgtcga gctcgccgcg gcgcaggagc 5280tcggcgaagt cggccgggat gttccggcag
cgcaggttga cgcgcggcgc ctcgacggcg 5340agcagcgcgg tcagcgccgg gagcacgagc
agctcgaggt tgtcggtcgc gacaagccgg 5400aacgtgcgct gcgaccgccg cgggtcgaac
cgctcgaccg ggcggaagac ctgctcgagc 5460cgctcgacgg cctcggccgc ccgcggggcc
aggtcccgcg cccgctcgct cagcgtcatc 5520tgcctgccga cctggatgag cagcgggtcc
gcgaaatggg cgcgcagccg cgcgagcgcg 5580tggctcatcg agggctgcgt cacgcccacg
cggcgcgcgg cgcgcgtgac gctcttctcc 5640tggagcaggg cgtgcaacgc cacgacgagg
tgggtgtcga ccgactgcag ccgcatggtc 5700gatggatacc acgtcgatcc atcgacggcg
tctatggatc gccgcgccga ctgccgattc 5760gacgcccggg gccgtgggtg cctatctctc
ctctccggac ggcgcatgcc gccgcgcggc 5820gcgcgcctac cccccagccg aggagagcaa
ccccatgatc atcgagtacg ttcgctacac 5880gatccccgcg gagcaagaga aggagttcct
ggccgcctac cgcgacgccg ccgcggagct 5940gcgcgggtcg gagcattgcc tcgactacga
gatctcccgc tgcgtcgagg atccgacgag 6000ctacgtcgtc cgcatctgct gggactcgct
gcaaggccat ctccagggct tccgcaaggc 6060ggcggcgttc ccgtcgttct tcgccaaggt
gaagccgttc tacgagcgta tccaggagat 6120gaggcactac gccttgaccg acgtcgccgc
gcggcaggcg gggacggccg cgacgggctg 6180aagggtagac cctgcggccc tccgaacgtc
gaggccgcct gcgccggcct cggctgctcc 6240ccgccagcct gtccgcgcct cacatcgagc
cccttgcagg cccagcgcgc ccggtgaggt 6300gcggagtgac gccgcgatcc cggaaagccg
ctggggagac cgcgcgggga aagcgatgcg 6360ccgcttccgc cgcggtgcgg gcgggtgcag
gatgcggcca tgggaatgcc tccggcgctc 6420gaccgagacc accgccgccg cgccccgcgc
gcgcccgccg ccgcgctcat cgcgctcctc 6480gcagccggcg ccgcgctcgc ggcctgctcc
aggagcacag gcgggccgaa gcaccgcgag 6540gcggcgccgg agcgcgacag cgcctgcacc
gatccagcga agcccagggc gtacttctat 6600cctgcggaga accggacgga ctacgcgcct
gacgatccct ggaaggacgg ctgcgccatg 6660ctggtgccgg atcacctgtt ctgctgtccg
gagaaggcct ccaccggctc gccctgatcc 6720gcgccgcccc gccccgccgc gcgcgcacat
gccgctcgtc cggagcgcag ccgccccgcg 6780cgcgagcgcc acacaggccg caaacgtccc
acacgctgcg cctgcaggcc gagcgcaggg 6840cgccctgcgg agcgccgcgc gcccacctcg
ggcgccgtcg cgcggcgacc gacgcggccg 6900tcgcgcggcg atcgacgcgc gggcagcgcg
cttcacggcg cgcgtgggga taccctggcc 6960tggccgtgga tctgttgagc tacgccgggg
cgaacctgca ggaccgcggg ccgagctcgc 7020tgcgcgttcg cttccccgca gcctgaagcg
ggcgagcgcg gcgccgcggc gggacggccg 7080acacgggtgc cgcacaacgc ggcatgtcgc
attctgcggc ggcgtcgagc ggatggctgg 7140acgcgcgcac ctgcgcgcgc cacctgcgct
aggacgccgg acatgaagct cgcgcgcaag 7200ctgacgctcg ccctcgtgtt cggggtattc
ctcgtgctcg cgctgagcgc ctacgcccag 7260atccgcagag aggccaggat cttcgagaac
gacgtccagc gcgaccatca cacgatggcc 7320cgcgcgctcg cggccgcggt catggaggtg
tggcgctccg agggaaccgc gcgggcgctg 7380cgcctcgtgg aggacgccaa cgagcgggaa
cagcaggcga acatccgctg ggtctggctc 7440gatggccagg ccgacgagcc ccatcgcccc
cggctggcgc cggagctgct cgcccccgtc 7500gccgaggggc gcgcggtcgt gcgccggatc
ccccagaaag acgcggatct gctcgtgacc 7560tgcgtgccgg tgtccgtgcc cggcgaccgc
gccggcgcgc tcgagctctc cgagtcgctc 7620gcgggcgcgc gccggtacat ccggagcatg
atcctgagca cggcgatcac cacagccgcg 7680ctgacgctgg tatgtgggtt gcttacaacg
ggcctcggag tctggctggt gggacgcccc 7740atgcgcacgt tgatcgacca ggcgcggcgg
atcggcgccg gcgatctctc cgggcggctg 7800tcgctgcgcc aggaagacga gatcggcgag
ctcgggcgcg agatgaacgc catgtgcgat 7860cgcctcgccg cggcgaacca gaagctcgag
tccgaggccg ccgcgcggat cgccgcgctc 7920cagcagctcc gtcacgccga gcggctcgcg
accgtcggca agctcgcgtc cggcatcgcg 7980cacgagctgg gcgcgcccct ccaggtcgtc
acggggcgcg cgcggatgct cgtcgacggc 8040gacgtgtcgg gcgatgaggt gccgatcaat
ggacagatca tcctcgagca gtcgcagcgg 8100atgacccaga tcatccgcca gctgctcgac
ttcgcccggc gccgcagcgc cgagaagcag 8160gagaccgcgc tccgcggcgt catccgcggc
acgttcacga tgctgaagcc gctggcggac 8220aagcagggtg tcacgatcgt cgaggaggga
gacacgccgg atcgggtggt ccacgccgac 8280gccgaccagc tccagcaggc gctcacgaac
gtcgtcgtca acgcgatcca ggccatgccg 8340tccggcggca cgatcacggt gggcgtccgg
accgtccgcg ccagcccccc gcccgaccag 8400ggaggggccg agggcgacta catcgcgctg
tcggtgcgcg acgagggaca gggcatgacg 8460gccgacgtcc tcgagcacgt cttcgagccg
ttcttcacga ccaagcccgt cggcgagggg 8520accgggctcg gcctgccggt cgcctacggc
atcatcaagg agcacggcgg ctggatcgac 8580gtcgacagcc gccccggctc cgggagccag
ttcacgatgt acctgccgca ggagaagcca 8640tgaccggacg cgtcctgatc gtcgacgatg
agcgaggcgt ctgcgagctc ctcgacgccg 8700ggctgaagaa gcggggattc caggcggcgt
ggcgcacgtc ggccgccgag gcgctcgagc 8760tcctcggcgc ggaggacttc gacgtcgtcg
tcaccgacat gaccatgcgc ggcatgaacg 8820gcctcgagct ctgcgagcgc atcgcccaga
accggcccga tctgccggtc atcgtcatca 8880ccgcgttcgg gagcctcgac accgccacgt
cggcgatccg cgccggcgcc tacgacttcg 8940tgaccaagcc gttcgagctc gacgcgctcc
ggctcaccgt cgagcgcgcc ctgcgccacc 9000gcgccctccg cgaggaggtg cgccggctgc
ggcgcgccgt ggacgactcc caccgttacg 9060agcagatcct cggcggcagc ccggcgatga
agggcgtctt cgatctgctc gaccgggtcg 9120ccgactcgga cacctcgatc ctcatcaccg
gcgagagcgg caccggcaag gagctcgtcg 9180cgcgcgccgt gcaccagcgc agccggcgcg
gccagggcgc gttcatcgcg gtgaactgcg 9240cggcggtccc ggacgccctg ctcgagaccg
agctgttcgg ccacgcgcgg ggcgccttca 9300ccgacgccaa gggggcgagg agcggcctgt
tcgcgcgggc ccacggcggc accctgttcc 9360tcgacgagat cggcgagctg ccggtcgggc
tccagccgaa gctcctgcgc gccctccagg 9420agcgcgtcgt ccggcccgtc ggcgcggacg
aggaggtccc cgtggacgtg cggctcatcg 9480cggcgaccaa ccgcgacctg gagaccgcga
tcgaggagcg ccgcttccgc gaggacctct 9540attaccggat caacgtggtc cacgtcgatc
tgccgccgct ccgctcccgc ggcgccgacg 9600tgctgctgct cgcgcagcgc ttcctcgagc
acttcgcgac cgtcaaggag cggcccatca 9660agggcctctc ggcgcccgcg gccgagaagc
tcgtcgccta cgcgtggccc ggcaacgtcc 9720gcgagctcca gaactgcatc gagcgggccg
tcgcgctcgc gcggtacgat cagatcacgg 9780tcgacgatct ccccgagaag atacggagtt
accggcgctc ccacgtcctt gtctcgagcg 9840acgacccgac cgagctcgtc cccatggagg
aggtcgagcg gcgctacatc ctgcgcgtcc 9900tggaggtggt cggcggaaac aagagccagg
cagcccaggt cctgggcttc gatcgagcga 9960ccctgtaccg gaagctcgag cggtacggcc
tgcgcgccgg gcgcgcgggc gacccgaggc 10020cgtgatccgc ccggcgccgc gccggaggtg
attccaggag cgcctcgcgg cggcggcgcc 10080gctcgtcctc acgctgttgc agaacgcgac
actcgcggcg tcgcgcgatc ggcagcgccg 10140cgcgcgggcg cgcgcgatct gcgctggtgg
catgagagct gcctgaaagc agggcgcgaa 10200catgagccac acaacggcgg cgtctgctgc
tccggaccgg agagtgccag tcgactggat 10260cgcgctcgcg aacgcgttcg acaacatcgc
tcgaggcgtg cgccatttcc ttcacctcga 10320cacgggcgcc gtgctccggc tgaacgagcg
gctcgtcgat cccgccacgc gcgcgcgcat 10380cgaggaagac ccggggtgcg tgctcatcga
ggccatcgcc gctcgggatc agtatcgatg 10440gctggaggcg ttcattccca cggtggaaga
cctggagttc cggctggcgc tcctgcacag 10500catgcagggc ccaggatcgt tccggcggtt
caaggccgcg ctctcggcca ggccggagca 10560gctccgccgc tggcgcgcct tccggcagga
gcagatccgg gtcgcgatcc tccggtggtt 10620tcacgcccgc ggactgacgc ccgtcgcgct
cgagctctcc tcgccggacg cgagctccga 10680gccgccgagc tcccgcgccg tcgactcggc
tcgccagcag ctttactccg cggcggacgg 10740cctctgtccg agcgacctcc aggcgctgac
gtcgctcgcg gagtacctgc gcgcggcgcg 10800ctcggcgctg cggatccccg ccgattcatc
catgggagac gccgcgcggg ccgtccgcct 10860ggtcccttga cgacgagcgc ggcagctcga
tccggaagac cgagccggcg ccgggacggc 10920tctcgacgaa gaggcggccg ccgtgcgcct
cgacgatgcg cttcgccacc gcgaggccga 10980ggcccgtgcc cgggatggat ccagacgtgg
acttgagccg ccggaacggc tcgaagaggt 11040gcgccagatc ctcgggctcg atcccgagcc
cgcgatcgcg aacggcgatc tcggccccct 11100cgccgccggc gcggaccgcc acgtcgacct
gccccccggc gggggagtac ttgagcgcgt 11160tcgacaggag gttgttcagc acctgctcga
tccgggtcgc gtcgcagcgg acgagcaccg 11220gtgtctcggg gagcgagagc tcgatggggt
gctccggcga gacagggcga tagaggtcca 11280ccgcctcctg cgcgagatcg cgcaggtcgc
gctcctccac ccggagatcg agcttgcagg 11340cctcgatctg cgacgcgtcg aggaggtccc
cgaccatgcg atcgagccgg tcgacctgcc 11400gcccgacgag cgccatggtc cggcgcacgc
tcgactccag gggccggttg tcggcgtcga 11460ggacgtgcac ggacagccgg agcgccgaca
gcgggttcct gaggtcgtgg gccacgccgc 11520cgaggaacgc gaactgcgcc tcgcgctggc
gctccagcga ctctgccatg tcgttgaagg 11580cgcgcgcgat ctccccgagc tcgcgcggcc
cgatcagcgg cgcgcgcgcg gcgcggtcgc 11640ccgcgccgta gcgcccgatg gcctcctgga
tcgcgacgat ggggcggtag atgagccgcc 11700gcgcgctgag gaggatcgtg gacgcgcccg
cgaggaagaa caccaccgcc gcgaggccgg 11760cgccggtcgt gcgccgggtc aggtgcgcga
cgagcgcctc cgacgcgcgg gcctgctcga 11820ggttgatctc gaccaggtga tcgagcgccc
tgaacgcctc gtcgagcgcg gggtcgtgca 11880cgccgagcag cgcgggatcg tgcgcgccag
gcgccgacgg gagctcgtgg gcgtcggcgg 11940cgcggcgccg ggcgaggtag tcctccacgc
gccgctccgc gtgctcgagg atcctgccct 12000cctccgggct gctcacgtgg tcgcgcgccg
ccgcgaggcc gctcctcagg ccttgctccc 12060acgccgccag ggagggggcc agctccccgc
ggccggagcc gaccgcgcgg ctgctctggt 12120gcgcgtcgag caggaggtcg atctccagcc
tctccacgag ccggacgctc tcgaccgtgg 12180cgccgaggat ccgggtggtc tgttgcatgg
tcgtcgacgc gaccatcagc gcgcccgcga 12240caacgatggc cacgctcgtg agaagaagcg
tggcggcccc gaggagcgcg ctcaggcgca 12300cgggccgcgg aagacggggc cagctcaggc
cctgcggagt tggctgtcgc atcttcctgc 12360gttggttcgg atccgcgacg gatgcaacgt
cgcctcgatg gagagattga actgcagagg 12420cacagagcac atcgaggcag ggagcgataa
gcgcgctgcg cccgcggcgc tccccgcccc 12480tccgcgccgg cctcaccggc ctctcgcgcc
tcgatcagct cggtcctctg gacggtgatc 12540cccgtttcct cgacactgcg cgagatgccc
gcccgcaccc cccgcaagcc cccgccgccc 12600gcctcgcccg ctggtcccgc cggcgcgccg
gacgacctca ccgacagcga tcgcgacgcg 12660ctgctgcgct ggcggctcgc gctcgggccc
gaggccgagc gggtcgaccc gcgcctctcc 12720ctcggcgggc tcgggggcgc ggcgcccgcg
ctcgacgtcg acgcgcggcg gctcggcgac 12780ctcgacaagg cgctctcgtt catctacgac
gagcgcgccg gcggcctcgg cggctcgcgg 12840ccctacgtgc ccgagtggct ctccgccgtg
cgcgagttct tcagccacga ggtcgtcgcc 12900ctcgtccaga aggacgccat cgagcgaaag
gggctgacgc agctcctctt cgagcccgag 12960acgctgccgt tcctcgagaa gaacgtcgag
ctcgtcgcca cgctcatgag cgccaagggc 13020ctcatcccgg acgccgcgcg ggacaccgcc
cggcagatcg tgcgcgaggt cgtcgaggag 13080gtgcggcgcg cgctcgaggc cgaggtccgc
accgccgtcc tcggcgcgct gcgccggaac 13140acgacgagcc cgctgcgcgt cctcaggaac
ctcgactgga agcgcaccat ccgcaagaac 13200ctgaaggggt gggacgcgga gcggcgccgc
ctcgtccccg acaagctcta tttctgggcg 13260aaccagacgc gacggcacga gtgggacgtc
gccatcctcg tcgaccagtc gggctcgatg 13320ggcgagagcg tcgtctacag ctccatcatg
gccgcgatct tcgcgtcgct cgacgtcctc 13380cgcacccggc tcctcttctt cgacaccgag
gtcgtcgacg tgactccgat gctcgtcgat 13440ccggtcgacg tgctgttcac ggcgcagctc
ggcggcggca ccgacatcaa ccgcgccgtg 13500gcctacgccc aggcgaactt catcgagcgg
cccgagaaga cgctgctcat cctgatcacc 13560gacctgttcg agggcggcaa cgccgaggag
ctcgtcgcgc gcatgcgcca gctcgccgac 13620agcaaggtga agtcgatctg cctgctcgcg
ctgtcggacg gcggaaagcc ctcgtacgac 13680cacgagatgg cgcagaagct cgccgcgctc
gggaccccgt gcttcggctg cacgccgaag 13740ctcctcgtca aggtggtgga gcggctcatg
cgaggtcagg acctcggccc gctgctcggc 13800gccgaggcgc ggtgagcgcc ccgcgcggcg
cgggatcacg gaagcacaga ggacgcagag 13860gcacttgtct ctcctctgcg tcctctgcgc
ctccatggcc gcccgtcagg ggccccgaaa 13920ccgactggcg cggctcgcga acctcgtcga
cgtcagcgaa ggcgcgccct ggacatccgc 13980ggccgcgcag gccgcgtcag cgcgcgcgac
ggatcggctt ggcggcgcgc tcgtccgccc 14040gccgggcggc ggcgcgcttt cgcgacgtgg
cgccgtgggc agcgctcgcg gagacgcgac 14100ggcgggcgcc ggccgcgcga accaccgctt
cgagcgaggg tgactggccc acgagaggac 14160cagtgctgat cgaggggccg actaggctga
tagaaagttt cacttgaact accgatgtgg 14220tggcggaccg atcacgtcgc tcagcggagg
gctcgtcgac ctataaactg ttttgatcgt 14280ttgcgcagcg tcacgatgcg gagatcacga
cccctgagcg cccgtccgga cgtgaacttg 14340tccccccggg ggatccacac gccttccgcc
tctcacgacg gacgtacgca cacaccacgg 14400aggcacgaag gcacggttgt gggttcgctc
cgtgccttcg tgtctccgcg gtgcttggcg 14460agggactgcc cccggaggtt gcaccgggcg
ctctgtcacg agctggttgc acgatgcagg 14520ccaacgatgg ccggatgccg gcgtcgcccg
ttttccgggg atggccatgg tccgcctttc 14580atggttgaaa ccactggttg caaccatggc
ggaacggagc ggcgtcgctg cccccgcggc 14640gcctggcgcc ccggggagag cgcctctcgg
cccgcaggac cggtcagcgg ggatccaggg 14700cgctggccca gcggccccga cgatccagcc
gcgcggggcg ggagcggcag cgtggatcac 14760tgcggacttg ccgtcgatcg aggtccgcat
ctggatcggc tcgccgcgac cgtatccgat 14820ggcgttcatg agcgggctgg cgagcaccgg
gtcgacgcgt aggcggcccc accgaccgat 14880cgccgaggtc gagcgcctgg gcgacgttcc
ccggcgagca cgcgacgcgc aggccgggac 14940gcccgctcgt cagcaccccg ccgccgctgc
ccacgccgag acgagctcct gcaggcgatc 15000tcgggttcac cctcgcgcga ggacctgtgc
tccgcggtga gcccgataga cgctgcccgg 15060gcctcccccc tactccgtac ctgcccgaca
aaggaccagc ccacgcgcgc tgtcattcgg 15120ttgagcaccc gccttctgtt cgcagggcgc
gccttgaaga gtcggacagg tcgccttccg 15180gaaaggcagt ggcctggtat ccgccatgtt
tccggtgtgc ttcgctgcta tccggtggcc 15240tggtgtccac catgtttccg gtgtgcttcg
ctgctatccg gtggcctggt gtccgccatg 15300tttccggtgt gcttcgctgc tatccggtgg
cctggtgtcc gccatgtttc cggtgtgcct 15360ctctgctcga gccacgggcc acctctaccc
gagcaactcg acctgatgca atgtagttga 15420gcccgcctgg ctggcagcgg tgccatcccc
gtcctgcctc tgacagcagc ggatcgcaga 15480cccgcctgcg atgccggtag cgggacatcg
gcacagatga ctgttcaccg tgcgggcagt 15540gttcctggct ggaagaataa tcccgtatca
attcaataag atgccctggc ggcgccaagc 15600tcaccacagc ctactcggcg caaccactca
gccctcacga caactatgta atttttctca 15660caacatgagc acttgattga aagattggaa
aagtgaacga cgaaaggttg cgtagattac 15720cgtaggtgct agcctggcgc gcactttcct
atgcccgaca cgtcgtcgtc gagccccgta 15780atggcgatgg ggctatcgga ctcgaaagcc
cggtccgtgg aggatgcacg gcctgcctcg 15840gggcttcctc gtccacccgc gggcatcgct
gtggtgggaa tgggatgtcg cttccccggc 15900ggcatcgatt cgcccggatc cttgtgggcg
gccctatctc aagggcgcga ccttatcagc 15960gaggtcccgc cggaccggtg ggatgtcaat
gcccactacg acgccgacgc aagcgtcccc 16020gggaagattg cgacccgcca tggcggcttc
ctcgccgggg tcgcggcgtt cgacgcgcct 16080ttcttcgacc tctcgccgcg cgaagcgaag
catatggatc cgcagcagcg cctcggcctc 16140gagacggcgt gggaggcgct ggaggacgca
ggcctggacg cgaggagctt gcggggcagc 16200cgggcagggg tgttcgtcgg ctcgatgtgg
gcggagtacg acgtgctcgc gtcgcgacat 16260cccgaatcca tctcgccgca cggggccacg
gggagcgacc cggggatgat cgctgcgcgc 16320atcgcctaca ccttcggcct tcgtgggccg
gccttgtcgg tgaatacggc gtcgtcgtcc 16380tccctcgtgg cggtgcatct cgcattgcag
agcttgcaga gcggagagtg cgagctcgcg 16440ctggccggcg gcgcgaacct catcctgacc
ccatacaaca cgatcaagat gacgaagctc 16500gggacgatgt cgcccgacgg ccggtgcaag
gcgttcgacc accgcgccaa cggctacgtg 16560cgcgccgagg gcgtcgggtt cgtggtcctg
aagccgctgt cgcgagcgac cgcggacggg 16620gatcggatct atgcggtcgt gcgtggctcg
gccgtgaaca acgacgggct caccgacggg 16680ctgaccgcgc cgagcgggga ggcgcaggag
gccgtgctgc gagaggcgta tgcgcgcgcc 16740ggggtgtctc ccgccgaggt ggactacgtc
gaggcgcatg ggacgggaac gccgctcggc 16800gaccgcgtgg aggcgacggc gctgggacgg
gtgctcggcg caggacgcgc ggcggatcgc 16860gcgctgcggg tcggttcggt caagacaaac
ctcggtcacg cggaggcagc cgccggggtc 16920atcggtctga tgaagacagc gctgtcgctg
cgtcacgggt cgcttccggc gagcctgcac 16980gtcgagcgcc cgaaccccga gatacccctc
gaatcgctgg gcctccggct ccagacggcg 17040cacggcgtgt ggccggaggt cgatcggccc
cggcgagcag gcgtgagctc attcggcttc 17100ggcggcacga actgccatgt ggtgatcgag
gagtggcgcg ggggcctcca gcagagcgcc 17160gccgaggcgg gcagcgaccc cggcgccgcc
gtaccgccgc ctggccttcc ccttgtgctg 17220tcggcgaggg accacggggc gctgcgggcg
caggcgggcc ggtgggcggc gtggctcacg 17280gagcaccgcg aggcgcgctg ggcggacgtc
gtccacacgg cggcagtgcg gcggacgcac 17340ctgggcgctc gggccgcggt gatggcggcg
ggcgtggccg aggccgtcga tgcgctgaag 17400gccctggccg acgggcgcgc ccacggggcc
gtgacggtcg gcgaggcgcg cgagcggggc 17460aaggtggtct tcgtgtttcc gggccagggc
agccagtggc cggcgatggg gcgagcgctc 17520ctgtccgcgt cgaaggtgtt cgccgaggcc
gtcgaggcgt gcgacgcggc gctgaggccg 17580ctgacgggct ggtcggtgct ctcgttgctg
cgcggcgacg ccggggaggc agcgccgtcg 17640ctcgaccgcg tcgacgcggt gcagccggcc
ctgttcgcga tggctgtcgg cctggccgct 17700gtctttcgcg cgtggggcct cgatccttcg
gccgtggtgg gccacagcca gggcgaggtc 17760ccggcggcgt acgtcgcggg ggcgctctcg
ctcgacgacg cggcgcgggt cgtggcggtc 17820cgaagcgcgc tcgtgcggcg gctcgcgggc
gcaggggcga tggcggcggt ggagctgccg 17880gccggcgagg tggagcgccg cctggcgccg
ttcggggggg ctctggccat tgcggtggtc 17940aacacgtcga gctcgacggc cgtttctgga
gacgccgagg cggtggacag gctggtcgcg 18000cagctcgagg ccgaaggcat cttctgccga
aaggtgaacg tcgattacgc atcccacagc 18060gcgcacgtgg acgtcgtgct accagagctc
ctggagcgcc tggcgccggt ccggccaggg 18120gccacgagga tccccttcta ttcgacagtg
accggcggtg tgctggaggg gacggcgctc 18180gacggggcgt actggtgccg caacctgcgc
cagccggtgc ggctggaccg cgcgctcgcc 18240cggctgctgg acgacgggca tggcgtcttc
gtggaggtca gtgcgcaccc ggtgctggcg 18300tcgccgctga ccgcggcgtg cgccgagcgc
gagggcgtgg ttgtcggcag cttgcagcgc 18360gacgacggcg ggctcgcgcg gctgctcggc
tcgctgggcg cgctgcatgt gcagggccag 18420ccggtcgact ggcgcgcggt gctggcgccg
ttcggcggca gcctggtgga cctgccgacc 18480tatgcattcc agcgccagcg ttactggttc
gatacggatg agagcgtcgc cctcgcagcg 18540gcgtccagcg tcgcggaaga gtcgtggtca
gaaaagctgg ccgggctgtc ttccgcgcga 18600cgggaagaac ggctgctcga atgggtgcgc
gcagagattg cagcggtgct cgggctggag 18660gcgccggcgg tgccgccaga cgtcttgctg
cgggatctcg gattgaaatc gccgatcgcc 18720gtggagctgg ggagccggct gggacgcagg
acacgccgga agctgcccgt gaccttcgtt 18780tacaaccacc cgacgccacg agcgatcgct
cgcgccctcc tggagggaat gttttcctcg 18840atcaaggact ctgcttcgag cgccgctgac
gaccgccgcc cgccgggggt gctcgaagac 18900gttgcccccc cacaggcgct cgagacgtcc
gagatgtccg acgatgagct gttccagtcc 18960atcgatgcgc tcgtctaggg agaccgcgct
ctcgtcgaag aaggttgttc aacgctgcgg 19020gtcgaggatt gctcgtggat cgaagcgata
aactgcgtgc gtatctggag aagaccacgg 19080cctcgctggt cgaggcgaag ggccggatcc
gggagctgga agcgcgttcg cgcgagccga 19140tcgcgatcgt ggcgatggcg tgccggtttc
cgggcggcgt cgacagcccc gagaagctct 19200gggccctgct ggacgaggag agggacgcca
tcaccgaggt gccgccctcg cgatgggacc 19260tcgagcgctt ctatgacccc gatccggacg
ccgcgggcaa gacctacagc cgctggggcg 19320gcttcgttgg cgatctggac cgtttcgacg
cggcgttttt cgggatcagc ccccgcgagg 19380cccggagcat cgacccgcaa gagcgctggc
tgctggagac cacgtgggag gccctcgagc 19440gggccggcgt gcgcgcagac acgctggaag
ggaccctggg gggcgtttac atcggcctgt 19500ccggctcgga gtaccagacg gaggcattcc
acgatgcgga gcgcatcgac gcctattcgc 19560tgaccggcgc ttcgccgagc acgaccgtgg
ggcgcctcgc ctactggctc gggctacgag 19620gccccgcggt cgccgtggac accgcgtgca
gctcctcgct cgtcgcggtg cacctggcct 19680gccaggcgct gcggaacggg gagtgcgatt
ttgcgctggc aggcggcgtc aatgcgctcc 19740tggcccccga gagctatgtt gccttctgcc
gcctcagggc gctgtccccc accgggcggt 19800gccagacctt ctccgcggac gccgatggct
acgtgcgcgc ggaagggtgc ggggtgctgc 19860tgctcaagcg tctgtcgcac gcgcagcggg
atggagaccg tgtgctcgcg gtcatccggg 19920gcaatgccat caaccaggac ggccgcagcc
aagggttgac ggcgccgaac gggctcgcgc 19980aggaggacgt catccgcagg gcgctgtcgc
aagccgccgt ggagccgacg accgtcgatg 20040tggtcgaatg ccacgggacc ggcacggcgc
tcggcgatcc gatcgaggtc caggcgctcg 20100gggctgttta cggcgatggg cgccccggag
acaggccgct cgtgatcggc tccgtcaaga 20160cgaacatcgg tcataccgag gcggccgcgg
gcatggccgg cctcatcaag gccgtccttt 20220cgctgcagca cgcccaggtc cctcgatcgc
tgcacttcgc ggcgccgagc ccttacatcc 20280cctgggatac cctccccgtc cgcgtggccg
cgcagcgcgt cgcatgggag cggcgcgagc 20340acccgcggcg cgccgggatc tcctcgttcg
ggatcagcgg caccaacgcg cacgtgatcc 20400tcgaggaggc gccggaagcg ccggcgacgg
cgccggaggc ggcggcggtg acgtcgacgc 20460tgccgttgct tgtgtcgggg cgggatgagg
cggcgctcag ggcgcaggcg gagcggtggg 20520cggcgtggct cgcggcgcac ccggaggcgc
gctgggcgga cgtggtgcac acggccgccg 20580tgcggcgcac gcacctggag gcgcgcgcgg
cggtggccgc ggggaacgcc gccgacgccg 20640ccgcggcgct gggggcgctg gccgccgggc
agccgcacaa ggcggtgtcc ctgggcgagg 20700cgcgcgcgcg cggcgatgtc gtgttcgtgg
ttccgggcca ggggagccaa tggccggcga 20760tggggcgggc gctgctggcc gagtccgagg
tgtttgccgc cgctgtcgcg gcctgcgacg 20820cggcgctgcg gccgttcacg ggctggtcgg
tgctctcggt gttgcgcggg gagcagggcg 20880aggcggtgcc gcccgccgac cgcgtggacg
tggtgcagcc ggcgctgttc gcgatggccg 20940tggggctctc ggcggtctgg cgggcgtggg
gcatcgagcc ctcggcggtg gtcggccaca 21000gccagggcga ggtcgcggcg gcgtacgtcg
ccggggcgct gacgctcgag gacgcggcgc 21060gggtggtggc gctgcgcagc cagctcgtgc
ggcgcatcgc cggcggcggc gcgatggccg 21120tgatcgagcg ccccgtcggc gaggtggagc
agcggctttc tcggttcgga gggcagctct 21180cggtggcggc ggtgaacacg ccgggctcga
cggtggtgtc cggggacgcc gcagcggtcg 21240atcgtttgct ggccgagctg gagaccgcgc
gggtgttcgc gcggcggatc aaggtcgatt 21300acgcgtcgca cagcgcgcac gtggacgcga
tcctgccgga gctcgaggcc tgcctggcct 21360cggtcgagcc ccgtacctgc gccatcccgc
tgtactcgac ggtgacggga gaagtgctcg 21420ccggcccgga gctcggcgcg acatactggt
gccgcaacct gcgcgagccg gtgcggctcg 21480accgggcgct ctcgcggctg ctggcggacg
ggcacggggt gttcgtggag gtcagcgcgc 21540atccggtgct ggccatgccg ctgtcggccg
cgagcgccga gcgcggcggc gtggtggtgg 21600gcagcctgca gcgcgacgac ggcggtctgg
ggcggctgac gtcgatgctt ggcgcgctgc 21660acgtgcacgg ccacgccgtg agctggcagc
gggtgctggc gccgtacggc ggggcgctcg 21720tgggcctgcc gacgtacgcg ttccagcgcc
agcgccactg gctcgaggcg ccgcggtacg 21780cggcggagga tacggacggc gcggcgcggc
gcgacccgct gtaccgggtc acgtggatcg 21840aggcggcgct ggaagaagcg ccgtgggcgc
ccgagcgcca cgtcgtgctc ggcgggggcg 21900gcgcgctggc ggcggggctg ggggcgctcg
cgctggcggg gctgccggag ctgctcgagg 21960cgctggagaa cagggcggcg gcgcccgagc
ggctggtgct ggacctgacg gagggccgcc 22020caggcgcggt ggcggagtcc gtgcacgcca
cgacgcgcga cgcgctcgcg ctggtccagg 22080catggcttgc ggcgccgcgg ctctcgggca
ccgagctggt cgtggtgacg cgggaggcgg 22140tggcggccgg cccggacgag ggcgtggcgg
cgctgggccc cgccgctgtc tgggggctgc 22200tgcgcacggc ccgcgtcgag caccccgagc
gcgcggtgcg cgcggtggat ctggggcgcg 22260agccgctgga cgtcgcggtc ttgcggcggg
cgctgggggc ggtggccgag ccggagctcg 22320cgctgcgcgc gggcggggcg cgggctgcgc
gcctgcgcgc tgtcgacgcc ggcgcgggcg 22380ccagggagcc ggcggctgcg ctggacccgc
agggcacggt gtggatcacg ggcggcaccg 22440gggagctggg gcggcagatc gcgcggcacc
tggtcgcggc gcacggcgtg cggcacctcc 22500tgctgacgtc gcggcggggc gcggccgcgc
cggacgccga ggcgctcgtc gagcagctgc 22560gggccgacgg cgccgagacg gtcgaggtcg
tggcgtgcga cgtgacggac ggcgcggcgc 22620tttcggcagc agtccaggcg gctgcggcaa
ggcacccgct gacggccgtg gtgcacaccg 22680ccggggagct ggcggacggg gtgctcacgg
ggctgacggc ggagcagctc gcgcgggtgc 22740tggcgccgaa ggtcgacggg gcgtgccacg
tgtacgccgc cgcgcaggac cagccgctcg 22800cggccttcgt gctgttctcc tcgatcgtgg
gcacgctggg caacgcgggc caggcgaact 22860acggggccgc caatgcgttc ctggacgcgt
tcgcggcgca gcttcgcgcg cgcggcgtgc 22920cggcgacgag cctcgcgtgg ggcttctggg
agcaggcagg gctcggcatg acgtcgcacc 22980tcggcgcggc cgacctggcg cgcctcaggc
ggcagggcct tgcgccgctg tcggtcgcgc 23040agggcctgcg cctgctcgac cgggcgctcg
cgcgcgcgga ggcgacgctg gtgccggcgg 23100cgctcgatct tccggcgctc cagcgtgcgg
cgagcgacgc cggacgggtg cctccactgc 23160tgcgcgggct ggtgcgcacg agtcccggcc
gccccacggc gaccgcgacc cccgaggccg 23220ggccggcggc gtcggcgctg cgcgcacggc
tctcggcgtt gcccgaggcc gagcggccgg 23280gcgcgctgct ggatctggtg cgcacggagg
tggcggtcgt gctgcagctg gcagggccgg 23340cgcaggtgcc cgcggacaag ccgctgaagg
agctggggct cgattcgctc acggccgtcg 23400agctgaggaa ccgcctcggc gcgcgcgccg
agacggtgct gccgacgacc ctcgcgttcg 23460accatccgac gccgcgcgcg atcgcggatc
tgctgcttca gcgtgcgttc tcggagctcg 23520cggcggcgaa ggcgacgcgc gcgcggggag
cgcacgacga gccgatcgcg atcgtgtcga 23580tggcgtgccg gctcccgggc agcgtcgata
cccccgcggc gctgtggaag ctcctggcgg 23640aggggcggga cgcgatcggg ccgttccccg
aggggcgcgg ctgggacgtg gcggggctgt 23700acgatccgga cccggatgtg ccgggcaagt
cgatcaccac gcaaggcggc ttcctctacg 23760acgccgaccg cttcgatccg acgttcttcg
gcatcagccc gcgcgaggcc gagcgcatgg 23820acccgcagca gcgtctgctg ctcgagtgcg
cctgggaggc gctcgagcgc gcgggcctgg 23880cgccccacgc gctcgaggcg agcgccaccg
gcgtcttcgt cgggctcgct cacggtgact 23940acggcgggcg gctcttgcag cagctcgagt
ccttcgacgg ccacgtcctc accggcaact 24000tcctcagcgt cggctcgggg cgcatcgcgt
acacgctggg gctccgcggc cctgcgatga 24060ccgtcgacac ggcgtgctcg tcgtcgctcg
tggcggtcca cctcgcgtgc atgtcgctcc 24120gcgcgggcga gtgcgacatg gcgctcgccg
gcggcgccac cgtgatggcc acgccgatga 24180tcttcgtcga gttcagccgc cagcgcggca
cggcgctgga cggtcgttgc aaggcgttcg 24240gcgccggggc cgatggcgcc ggctggtcgg
aggggtgcgg gatcctggcg ctgaagcggc 24300tgtcggacgc gcagcgcgac ggcgaccgcg
tcctggcggt gatccgcggc tccgccgtca 24360accaggacgg ccgcagccag gggctcaccg
cccccaacgg cccggcccag caggacgtca 24420tccgccaggc cctggccgcg gcggggctca
cgcccgccga cgtcgacgcc gtcgaggcgc 24480acggcaccgg cacgcgcctc ggtgacccca
tcgaggcgca ggcgctgctg gcgacctacg 24540gcgccgcgca cacagcggag cggccgctct
ggctcggctc gctcaagtcg aacctcgggc 24600acacgcaggt cgccgcgggc gtgtcggggc
tgatgaagct cgtgctggcc ttgcagcacg 24660cagagctgcc gaggacgctg cacgccgacc
cgccctcgcc gcacgtcgac tggtcgcagg 24720ggcacgtcaa gctcctgaac gagcccgtgc
cgtggccgcg caccgacagg ccgcggcgcg 24780cggcggtctc gtccttcggc atcagcggca
ccaacgcgca cgtcatcgtc gaggaggcgc 24840cggccgaagc gccggcgaca gcggcggacg
caaagtcggt ggaggcgctt ccgatcctgc 24900cgctgctggt ctcggggtcc gacgagccgg
cgctgcgcgc gcaggtgcgg cggctggtgg 24960agcacctgcg gtcgcacccg gacgagcggc
tgctggacgt ggcagcgagc cttgcgacca 25020cgcgcgcgca tctcgcgatg cggctcgcgc
tgcccgtctc ggcaggggcg ccccgggatg 25080cgtgggtgga tgagctggag gcatttgcca
ggggaggagc ggctccgacg caggcatcgc 25140agacccccgc cgagagcagc gcgggcaagg
tcgcggtgct cttcaccggc cagggcagcc 25200agcgcgccgc catggggcgc gccctgtacg
ccacccaccc cgtcttccgc gccgcgctcg 25260acgccgcatg cgccgagctc gaccgccacc
tcgacaggcc cctccacagc gtcctcttcg 25320cagacgccgg caccgaggcc gccgcgctgc
tcgaccagac aggatgggca cagcccgccc 25380tgttcgctct cgaggtcgcg ctctaccgac
agtgggaggc ctggggtctg cgccccgagc 25440tgctgctcgg ccacagcatc ggcgagctcg
ccgccgccca cgtcgccggc gtgctcgacc 25500tccccgacgc ctccgccctg gtcgccgccc
gcggacggct catgcaggcc ctcccccacg 25560gcggcgccat ggcctccatc gaggccaccg
agcacgagct cctacccctg ctcgaccagc 25620acaccggacg cctctcgctc gccgccctca
acgctccacg ccagtcggtc gtcagcggcg 25680acctgcacgc cgtcgaccag gtctgcgccc
acttcatcgc cctcggccga cgcgccaagc 25740ggctcgacgt cagccacgcc ttccactcgg
cgcacatgca gcccatgctc gacgccttcg 25800ccagcgtcgc ccgcggcctg accttccacc
cgccacggct gcccatcgtc agcagcgtca 25860ccggcgcacg cgccaccacc gaccagctca
cctcgcccga ctactgggtg cagcaggtgc 25920gcgagcccgt gcgcttcctc gacgccatgc
gctccctgca cgccgccggc gccgccacct 25980tcgtcgagtg cgggccgcac ggcgtgctca
ccgccgcagg cgccgagtgc ctcgctcccg 26040agggcgctcg cgacgccggc ttcgtcacca
gcctccgcaa ggaccgcgac gaggccctcg 26100ccctggtcca cgccgcctgc gccgtccatg
tccgcgggca cgccctcgac tggctccgct 26160tcttcgacgc caccggcgct cgccgcgtcg
agctgcccac ctacgccttc cagcgacagc 26220gctactggct cgaggcgcca aggcctcgcc
ccagcctcga gggcgtcggc ctcaccgccg 26280caaaccaccc atggctcggc gccgccgtgc
gcctcgcaga ccgcgatggc tacgtcctca 26340gcggccgcct ctccaccatc gaccacccgt
gggtcctcga ccacgtggtg ctgggcacgg 26400cgctgctccc gggcacgggc ttcgtcgagc
tggcgtgggc ggcggcagag gcggtcgggc 26460tgcccggggt atcggagctg gcgatcgagg
cgccgctggc gctcccggcg cgcggggcgg 26520tggcgctgca gatcgcgatc gaggcgccgg
acccggcggg gcgccgcggc gtcgcgatct 26580acagccgccc cgacggcgca gccgacgcgc
cctggacagc gcacgcgcgc ggcgtgctgg 26640gcgccgcggc gcccgacagg gacgcggcgt
gggcacaggg cgcgtggccg ccgccggggg 26700ccgtgcctgt cgatgtgacg cagcggatcg
agatcgtgga cgcgtgggtc ggcccggcgt 26760tccggggcgt caccgcgctg tggcgcgtcg
ggcggacgat ctacgccgac gttgcgctgc 26820cggacggtgt ggcgagcacg gcgcaggact
tcgggctgca tccggccttg ctcgatgtgg 26880cgctacgcgc gttcctgaga gcggagctcg
gcgccgatcc ctcgccacgg gagggcacgg 26940tggtgccgtt cgcgtggtcg gacgtggtgc
tcgaggcgcg tgggacggcg gcgctgcggg 27000tgcgcgtgga ggtggcggcc gatggggacg
gcgacgcgat cacggcgtcg atccagctgg 27060ccgacgggca gggccgcccc gtcgcgcggg
tgggcgcgct ccagatgcgg tggacgacgg 27120ccgagcgggt gcgcgcggcc gcgggcgcgg
cggagcgcga tctgtaccgc gtcgcgtgga 27180cggacgtggc gctggacgac gcggcgtttg
cgccggagga gcacgtcgtg gtcggcggcg 27240acggcgcgct ggcggcggcg ctcggtgcac
gcgtggtggc ggggctgccc gagctgctcg 27300cgtcgctgcc ggacggcgcg gcggcgccac
gccggctggt ggtggacctc acggcggacg 27360ccgcgggcgc ggtcgtcgac gccgtgcacg
ccgcagcgcg cgacgcgctg tccctggtgc 27420agggatggct ggcggcgccg cagctggcgg
cgacggagct cgtggtcgtg acgcgcggcg 27480cggtggcggt cgcgccggac gagggcgtgg
cggcgctggg ccccgcggcg gtctgggggc 27540tgctccgcgc gacgcgcgtc gagcatgcgg
atcgcacggt ccgcgtgctc gatctggggt 27600ccgcggcgcc ggacatgacg ctcttgcgcc
gggcgctcac ggcggccgag gagccagagc 27660tcgcgctgcg cgcgggcggg gcgcgggcgc
cgcgcctcga cgcggccagc gagaccgaag 27720gagagctggc gccgcccggc ggggcgcgct
ctcttcgcct gtccatccgg acgaagggct 27780cgttcgacgc gctccacctc gcggacgctc
ccgatgcgct gcgcccgctc gggccggggc 27840aggtccggct cgctgtccgc gccacggggc
tcaacttccg cgatgtcttg aacgtcctgg 27900ggacgtaccg cggcgaagcg gggcctctcg
gtctggaggg ggctggggtg gtgctggacg 27960tgggcgaggg agtcaccgcc cttcgacccg
gcgaccgggt gatgggcatg ctgcacgcgg 28020gcatggcgac ccatgcggtc gtcgacgccc
ggctgctgac gcacatcccg cgggggcttt 28080ccttcgtgga agcggcgacg attccagcgg
ccttcctcac cgctctgtac gggctgcgcg 28140acctcggcgc gctgaaggcg gggcagcgcg
tgctggtgca cgccgccgcc ggcggggtgg 28200gcatggcggc ggtccagctt gcgcgcctct
ggggagccga ggtgttcgcg acggcgagcg 28260agggcaagtg gccggcgctg cgtcggatgg
ggatcgacca ggcccatatc gcctcgtcgc 28320ggaccctcca cttcaggaaa gccttcctcg
atgcaacgca gggacagggc gtcgacgtgg 28380tgctcgacgc gctcgcgggc gagttcgtcg
acgcttcgct cgacctgctc ccgcgcgggg 28440gcgcgttcgt ggagatgggc aagagcgatg
tgcgggatcc cgagcgcgtc gccaaggacc 28500acccccgcgt tcgctacacg gccttcgatc
tgctcgacgc ggggccagac cacatccagg 28560cgatgctgcg ggagctcgtc ccgctgttcg
aggagggcgt cctcgctccc cttccctccg 28620tggcctacga cctgcgtcgc gccccgcacg
ccttccgctc catggccaac gcacgccaca 28680taggcaagct cgtgctggtg ccgcccgcga
cgctcgaccc tgacggcacg gcgttgatca 28740cgggcggcac gggagagctc gggcggcaga
tcgcgcggca cctggtggcg gcgcacggcg 28800tgcgccacct ggtgctgacg tcacggcgcg
gcatggacgc gcccgacgcc gcagcgctgg 28860tggaatcgct gcgcgcggcg ggcgccgcga
cggtggaggt cgcggcgtgc gatgtgacgg 28920accgtgacgc gctggcggcc atcgtgcagg
cgatccccgc ggcgcgcccg ctgaccgccg 28980tcgtgcacac ggccgccgtg ctggacgacg
gcaccgtggc ggggctctcg gccgagcagc 29040tcgcgcgcgt gctgcggccg aaggtcgacg
gcgcctggca gctctacgag gcgacgaggg 29100acgcgccgct cgcggcgttc atgctcttct
cgtcggtcgc cggcacgctg ggcagctcgg 29160ggcaggcgaa ctacgccgcc gcgaacgcgt
tcctcgacgg gctggcggca gagctccgcg 29220cgcgcggcgt gccggcgatg agcctcgcgt
ggggcttctg ggagcagggc gggatcggga 29280tgacggcgca cctcggcgcc gccgatctgg
cgcggctgaa gcggcagggc atcgtgccga 29340tgacggtcgc gcacggcctg cggctgctcg
accgcgccct cgagcgcccg gacgcggcgc 29400tggtgcccgc ctccctggac atggcggtga
tccagcggac ggcgagcgac caccgtcagg 29460tgccgcccat gctgcgcggg ctggtccgcg
tcgcgccgcg gcaggcggca ggggcagcca 29520gcggcaggag ccatgaggcc tcgaccctgc
ggcagcagct cgccgcgctg cccgaaccgg 29580agcggcagcg agcgttgctc gatctggtcc
ggaccgaggc agccgccgtc cttgtgctgc 29640gcgggccgga cgctgtcccc gccgacaagc
cgctcaggga gctcgggctc gactcgctca 29700cggcagtgga gctcaggaat cggctcagga
cccgtgcgca gaccgatctc ccatcgaccc 29760tcgccttcga ctacccgacg ccgaaagcgg
tcgccgtgta tctggcccag gagctcgacc 29820ttcacgacgt catgacggag atgcgcggac
cgagcttgcg ctctgacgac gagctcaagt 29880cggccatcgc gagcatccgg atctcgacgc
tacgccaggc ggggctgctc gacagcctgc 29940ttcggctcgc cgccagcgaa gccgtctcca
catccagcga cacgacacct gaaaccgacg 30000agctgacgct gcagcatgtt ggagacgatg
agctggcacg gcttgtcttc gacctcgccg 30060gaggagcgca atgaaagaag agatctccgc
ccgtcaagct ctcgagaaga gcttcattga 30120acttcgccgt atcaagcggg agctcgatca
gctcaaggcg aagtcgagcg agccgatcgc 30180gatcgtgtcg atggcgtgcc ggctcccggg
cggcgtcgat acccccgcgg cgctgtggca 30240gctgctctcg gaggggcggg acgcgatcgg
gccgttcccc gaggggcgcg agtgggacgt 30300ggcggggctg tacgacccgg acccggacgc
gccgggcaag tcgatcactg cgcaaggcgg 30360cttcctctac gacgccgacc gcttcgatcc
ggcgttcttc gccatcagcc cgcgcgaggc 30420cgagcggatg gacccgcagc agcggctgct
gctcgagtgc gcctgggagg cgctcgagcg 30480cgcgggcctg gcgccccacg cgctcgaggc
gagcgccacg ggcgtcttcg tcgggctgtc 30540ggtcacggac tacggcgggc ggctgctgca
cgatcccgag gccctcgacg gctacatcgc 30600caccggcacc ctgcccagcg tcggctcggg
gcgcatcgcc tacacgctgg ggctccgcgg 30660ccccgcgatg accgtcgaca cggcgtgctc
gtcgtcgctc gtgtcgctcc acctcgcgtg 30720catgtcgctc cgcgcgggcg agtgcgacat
ggcgctcgcc ggcggcgcca ccgtgatggc 30780cacgccgatg gccttcatcg agttcagccg
ccagcgcggc acggcgctgg acggtcgttg 30840caaggcgttc ggcgccgggg ccgatggcgc
cggctggtcg gaggggtgcg ggatcctggc 30900gctgaagcgg ctgtcggacg cgcagcgcga
cggcgaccgc gtcctggcgg tgatccgcgg 30960ctccgccgtc aaccaggacg gccgcagcca
ggggctcacc gcccccaacg gcccggccca 31020gcaggacgtc atccgccagg ccctggccgc
ggcggggctc acgcccgccg acgtcgacgc 31080cgtcgaggcg cacggcaccg gcacgcgcct
cggcgacccc atcgaggcgc aggcgctgct 31140ggcgacctac ggcgccgcgc acacagcgga
gcggccgctc tggctcggct cgctcaagtc 31200gaacctcggg cacacgcagg ccgccgcggg
cgtgtcgggg ctgatgaagc tcgtgctggc 31260cttgcagcac gcggagctgc cgaggacgct
gcacgccgac ccgccctcgc cgcacgtcga 31320ctggtcgcgg gggcacgtca agctcctgaa
cgagcccgtg ccgtggccgc gcaccgacag 31380gccgcggcgc gcggcggtct cgtccttcgg
cttcagcggc accaacgcgc acatcatcat 31440cgaggaggcg ccggcggcct ccgccgaggc
gacgagccgc ggggagaaga cgtccgcggc 31500cgcgccgccg tcgatgatgc cgctgctggt
ctcgggggtg gacgaggcgg cgctacgagc 31560gcaggcgggg cggtgggcgg cgtggatcga
ggcgcacccg gaggcaggct gggcggacgt 31620tgtgtacacc gcggcagcgc ggcggacgca
cctgggggcc cgtgcggcgc tgacggcggc 31680ggacgcggcc ggcgctgtcg cggcgctgac
ggcgctctcg caagggcagc cgcacgccgc 31740gctcgccgtg ggcgaggcgc gcgctcgggg
gaaggtcgcc ttcgtgtttc cgggccaggg 31800cagccagtgg ccggcgatgg ggcgggcgct
gctctcgcag tcggaggtgt tcgccgcggc 31860ggtcacggcg tgcgacgcgg cgctgcggcc
gttcaccggc tggtcggtgc tctcggtgct 31920gcgcggcgac tcgggcgcgg aggtgccgcc
gctggagcgc gtcgacgtcg tgcagccggc 31980gctgttcgcg atggcggtgg ggctcgccgc
tgtgtggcgc gcgtggggcc tcgagccgtc 32040ggcggtggtg ggccacagcc agggggaggt
cccggcggcg tacgtcgcgg gggcgctgtc 32100gctcgaggac gcggcgcgga tcgtggcgct
gcgcagccag ctcgtgcggc gcctgtccgg 32160ggctggcgcg atggccgtga tcgagcgccc
ggtaggcgag gtcgagcagc ggctctcgcg 32220gttcggcggc gcgctgtcgg tggcggcggt
caacacgccg cgctcgacgg tggtgtcggg 32280agatatcgag gcggtcgacc gcctgctggc
ggagttcgag ggcgagcagg tcttcgcgcg 32340gaaggtcaac gtcgactacg cgtcgcacag
ccgacacatc gacgggctgc tgccggagct 32400ggagaacggc ctgggcgcgg tgcggccgcg
cgcgagcacg atcccgttct actcgacggt 32460gaccgggacg gtgctgacgg gcgcggagct
ggacgccgcg tactggtgtc gcaacctgcg 32520cgagccggtg cggctcgacc gggcgctctc
gtggctcctg gacgacgggc acggcctgtt 32580cgtcgaggtc agcgcgcacc cggtgctgac
gctgccgctc acaggagcga gcgcggcgag 32640cggcggtgtg gttgtcggca gcctgcagcg
cgacgacggc gggctcgggc ggctcctggg 32700ggtgctggcc gcgctgcacg tgcacggcca
cgacgtcgac tggcgcgcgg tgctggctcc 32760gtggggcgga ggcgtggcgg acttgccgac
ctacgcgttc cagcggcagc gctactggct 32820cgaggcaccg cgcggccggg cagggctgga
gagcggaggg ctcctggccg tgaatcaccc 32880gtggctcagc gcggcggtgc ggctggccga
ccgcgacggc tatgtgctga gcggacggct 32940gtcgacggtc gagcacgcgt gggtcctgga
ccacgtggtg ctgggcacgg tgatcctccc 33000gggcacggcg ttcgtcgagc tggcgctcgc
ggcggccgat gcggtcggac tgccctcggt 33060gtcagagctc acgatcgagg cgccgctggc
gctgccggcg cgaggggcgg tggcgctgca 33120ggtgacggtc gaggcgccgg acgcgacggg
gcggcggggc ttcgcggtct acagccggcc 33180cgacggcgcg cacgacgcgc cgtggacggc
gcacgcgcgc ggcgtgctcg gcgcagcgcc 33240cgcggcggcc acgacggcgt gggcggcggg
cgcgtggccg ccggcggggg ccgagccggt 33300cgacgtcacg cggtgggtcg aggcgctgga
cgcgtgggtc ggcccggcgt tccggggcgt 33360gacggcggcg tggcgcgtgg ggcggtcgat
ctacgccgac ctggcgttgc ccgagggggt 33420ctcggagcgg gcgcaggact tcggcctgca
tccggccttg ctcgatgcag cgctccaggc 33480cctcctgagg gcggagctcg gcgcaggcgc
gtcgccgcgg gagggcatcc cgatgccctt 33540cgcgtggtcg gacgtggcgc tcgaggcgcg
gggggcagcg gcgctgcggg cgcgcgtgga 33600ggtcgaggac gccagcgatg gggaccagct
cgcggcgtcg atcgagctgg ccgacgcgca 33660ggggcagccg gtcgcgcgcg cagggacgtt
ccgggcgcgg tgggcgacgg cggagcacgt 33720gcgcatggct gcggcgggct cgagcgagcg
tgacctgtac cgggtcacgt gggcggacgt 33780ggtgctggaa gaagcggcgt gggcgccgga
ggagcacgtc gtgctcggcg gcgacggcgc 33840gctcgcggcg gcgctgggcg cgcgcacggc
ggcgctgccg gagctcatcg cggcgctgcc 33900ggagggcgcg gccgcgccgc gccggctggt
gatcgacgcg gccgcgggcg accccggcga 33960cggcctggtc gcggcggcgc acgcggcggc
gcagcgggtc ctgtcgctgg tgcaggggtg 34020gctctcggag gcgcggctcg cggacagcga
gctggtggtg gtgacgcgcg gcgctgtggc 34080cgccgggccc gacgacggcg tcgcggcgtt
gagccacgcg ccgctgtggg gactcgtgcg 34140cacggcgcgc caggagaacc ccggccgggc
ggtgcgcctc gtggacctgg ggcccgagcc 34200gctggacgga gcgctcctgc gccgggtggt
ggcggcggcc gaggagccgg agctcgcgct 34260gcgcgggggc gcggcgcgcg cgccacgcct
gcgcgaggtg cgcgcgggcg cggccgacgc 34320ggcgcggccg acgcggctgg atcccggcgg
gacggtgctg atcacgggcg gcaccgggga 34380gctcgggcgg caggtcgcgc ggcacctcgt
ggcgtcgcac ggcgtgcggc acctcgtgct 34440cacgtcgcgg cgcgggatgg gtgcgccgga
cgccgcggcg ctggtggacg agctgcgcgc 34500cgcgggcgcc gcgacggtcg acgtcgcggc
gtgcgacgtc gccgacggcg cggcgctggg 34560ggcggtcatc gcggcgatcc cggctgcaca
ccccctcacg gcggtcgtgc acatggcggg 34620cgtgctggac gacgtcatcg tgacgaagct
ctcggccgag cagctcacgc gcgtgctgcg 34680gccgaagatc gacggcggct ggcacctggc
cgcggcgacg cgaggccatc ggctcgcggc 34740cttcgtgctg ttctcgtcgg cggccggcac
gctgggcagc ccggggcagg cgaactacgc 34800cgcggccaac acgttccttg acgcgctcgc
ggcgcagctc cgcgcgcgcg gcgtgcccgc 34860gatgagcctc gcgtggggct tctgggagca
ggcagggctc ggcatgacgg cgcacctcgg 34920cgcggccgac ctggcacgcc tcaggcggca
gggcatcgcg ccgatcgcgc tcgcgcaggg 34980catgcagctg ctggaccggg cgctcgcgcg
cccggaggcg gcgctggtgc cggcggcgct 35040cgaccttccg gcgctccagc gtgcggcgag
cgacgccggg caggtgccgg cgctgctgcg 35100cgggctcgtg cgcccggcgg tcgggcggcg
cgcggcggcg cctgcggccg ccgcgaccgg 35160agcggcggcg ctgcgcgcgc ggctcgcgcc
gctgcccgag gccgagcggc acgacgtggt 35220gctcgacctg gtgcgcgccg aggcggcggc
cgtgctgcag ctggcggggc cggcgcaggt 35280ccccgcggac aagccgctga aggagctggg
gctcacctcg ctcacggcgg tcgagctgag 35340gaaccgcctc ggcgcgcgcg ccgagacggc
gctgccggcg accctcgcgt tcgaccatcc 35400gacgccgcgc gcgatcgcgg gtctgctgct
tcagcgtgcg ttctcggagc tcgcggcggc 35460ggtggcgacg cgcgcacagg cgccacgcgc
gcagggggcg cacgacgagc cgatcgcgat 35520cgtgtcgatg gcgtgccggc tcccgggcgg
cgtcgatacg cccgcccgga tgtggcagct 35580cctggcggag gggcgggacg cgatcgggcc
gttccccgag gggcgcggct gggacgtggc 35640ggggctgtac gaccccgacc cggacgcgcc
gggcaagtcg gtcaccaacc tgggcggctt 35700cctctacgac gccgaccact tcgatccgac
gttcttcggc atcagcccgc gcgaggccga 35760gcgcatcgac ccgcagcagc ggctgctgct
cgagtgcgcc tgggaggcgc tcgagcgcgc 35820gggcctggcg ccccacacgc tcgaggcgag
cgccaccggc gtctttgtcg ggctggtgta 35880cagcgactac ggcgggcggt tgctggagca
cctcgagtcc ttcgacggct acatcgccac 35940cggcagcttt cccagcgtcg gctcggggcg
catcgcctac acgctggggc tccgcggccc 36000tgcgatgacc gtcgacacgg cgtgctcgtc
gtcgctcgtg tcgctccacc tcgcgtgcat 36060gtcgctccgc gcgggcgagt gcgacatggc
gctcgccggc ggcgccaccg tgatggccac 36120gccgatggcc ttcatcgagt tcagccgcca
gcgcggcatg gcccccgacg cacggtgcaa 36180ggccttcggg gcggaggcga acggcatcgg
ccccgcggag ggctgcggga tcctggtgct 36240caagcggctg tcggacgcgc ggcgcgacgg
cgaccgcgtc ctggcggtga tccgcggctc 36300cgccgtcaac caggacggcc gcagccaggg
gctcaccgcc cccaacggcc cggcccagca 36360ggacgtcatc cgccaggccc tggccgcggc
ggggctcacg cccgccgacg tcgacgccgt 36420cgaggcgcac ggcaccggca cgcgcctcgg
cgatcccatc gaggcgcagg cgttgctggc 36480gacctacggc accgcgcaca cagcggagcg
gccgctctgg ctcggctcga tcaagtcgaa 36540cctcgggcac acgcaggccg ccgcgggggt
tgtggggctg atgaagctcg tgctggcgat 36600gcagcacgcg gagctgccga ggacgctgta
tgcggagccc cgatcgccgc acatcgactg 36660gtcgcagggg cacatcaacc tcctgaacga
gcccgtgccg tggccgcgca ccgacaggcc 36720gcggcgcgcg gcggtctcgt ccttcggcat
cagcggcacc aacgcgcacg tcatcatcga 36780ggaggcgccg gccgaagcgc cggcgacagc
ggcggacgca aagtcggtgg aggcgcttcc 36840gatcctgccg ctgctcctgt cgggtcgcga
cgagccggcg ctgcgcgccc aggccgggcg 36900gctcgccgag cacctgcgcg cccacccggg
cgagcggctg ctcgacatcg ccgcgggcct 36960ggccacgacg cgcacgcacc tcgccacgcg
gctcgcgctg ccggtcgccg cggacgcagc 37020cgcggaggag ctgggcgccc gccttgcgca
gttcgccgcc ggcggcccgg cgcccagcgg 37080cgccgccgtg accgcgccgg ggcagccgcc
cggcaaggtc gcggtgctct tcaccggcca 37140gggcagccag cgcgccggca tggggcgcgc
cctgtacgcc acccaccccg tcttccgcgc 37200cgcgctcgac gccgcatgcg ccgagctcga
ccgccacctc gacaggcccc tccacagcgt 37260cctcttcgca gacgccggca ccgaggccgc
cgcgctgctc gaccagacag gatgggcgca 37320gcccgccctg ttcgctctcg aggtcgcgct
ctaccgacag tgggaggcct ggggtctgcg 37380ccccgagctg ctgctcggcc acagcatcgg
cgagctcgcc gccgcccacg tcgccggcgt 37440gctcgacctc cccgacgcct ccgccctggt
cgccgcccgc ggacggctca tgcaggccct 37500cccccacggc ggcgccatgg cctccatcga
ggccaccgag cacgagctcc tacccctgct 37560cgaccagcac acggggcgcc tctcgctcgc
cgccctcaac gctccacgcc agtcggtcgt 37620cagcggcgac cagcccgccg tcgaccatgt
ctgcgctcac ttcatcgccc tcggccgacg 37680cgccaagcgg ctcgacgtca gccacgcctt
ccactcggcg cacatgcaac ccatgctcga 37740cgccttcgcc agcgtcgccc gcggcctgac
cttccacccg ccacggctgc ccatcgtcag 37800cagcgtcacc ggcgcacgcg ccaccaccga
ccagctcacc tcgcccgact actgggtgca 37860gcaggtgcgc gagcccgtgc gcttcctcga
cgccatgcgc tccctgcacg ccgccggcgc 37920cgccaccttc gtcgagtgcg ggccgcacgg
cgtgctcacc gccgcaggcg ccgagtgcct 37980cgctcccgag ggcgctcgcg acgccggctt
cgtcaccagc ctccgcaagg accgcgacga 38040ggccctcgcc ctggtccacg ccgcctgcgc
cgtccatgtc cgcgggcacg ccctcgactg 38100gctccgcttc ttcgacgcca ccggcgctcg
ccgcgtcgag ctgcccacct acgccttcca 38160gcgacagcgc tactggctcg aggcgccaag
gcctcgcccc agcctcgagg gtgtcggcct 38220caccgccgca aaccacccat ggctcggcgc
cgccgtgcgc ctcgcagacc gcgatggcta 38280cgtcctcagc ggccgcctct ccaccatcga
ccacccgtgg gtcctcgacc acgtggtggc 38340aggcacagtg atcttgccag gaacggcgtt
cgtcgagctg gcgtgggcgg cggccgaggt 38400ggtgggcgcc gccgcggtgt ccgaggtgac
cttcacgacg ccgctcgtgc tgccgccgcg 38460cagcgtggtg gagctgcagg tgaggatcgg
cgagccggac gcgtccgggc ggcggacgtt 38520cgccgcgtac agccgcgcgg acgcggcgat
cgaggcggag tggacgcaac acgcgaccgg 38580cgtgctgagc gcgcaggcgg cggccggggc
cgacgtggcg gacctttcgg tgtggccacc 38640gccgggcgcc gaggtggtgg cgctcgacgg
cggctacgcc tggctggcgg cgcagggcta 38700cggctacggc ccggcgttcc aggcgctgcg
cgaggtgtgg cgcgcgggca cgacgctgta 38760cgcgcgggtc gcgctgccgg acgcggtggc
ggacacggcg cggggcttcg ggatccatcc 38820ggcgctgctc gacgcggtgc tgcactcgtt
gctggcgccg tcggcgcagg aggaggcgtc 38880cgacgacgac aaggtgctgc tggcgttcgc
gttctcggac gtggtgatcg aggcgcgcgg 38940ggcagcggag gtgcgcgtcc gcctgaacaa
gcaggccgga gacgacgggg agggggtcac 39000ggcgtcgatt cacctcgccg acgcgcaggg
gcggccggtc gcgcgcgtgg gggcgttcca 39060ggcgcgggcg acgaccacgg agcgggtgcg
cgcgctcgcg ggcgcgagcg agcgcgacct 39120gcaccgggtc acgtggacgg acgtgacgct
ggaagagacg ccgtgggcgc acgaggacag 39180cgtcgtggtc ggcggcgacg gcgcgctggc
ggcggcgctg ggcgtgcgcg cggtggccgg 39240gctgcccgag ctgctcgcgg gcggcgcggc
ggcgccgcgt cgtctggtga tcgacgcgac 39300cgcgggcgac cccggcgacg gcctggtcgc
ggcgacgcac gcggcgacgc agcggggcct 39360cgcgctcttg cagggatggc tctcggaggc
gcggctcgcg gcgacggagc tggtgctcgt 39420gacgcgcggc gcggcggcgg ccgagccgga
cgagggtgtg gcggcgctga gccacgcgcc 39480gctctggggg ctcgtgcgcg cggcgcgcga
agagcacccg gcgcgcgcgc tgcgccttgt 39540cgacctgggg cgcgaggcgc cggacggggc
gatcctgcgc cgggcgatcg cggcggacga 39600cgagccggag ctcgtggtcc gccgcggggc
gctgcgggcc gcgcgcctga gcctcgccca 39660cgctggcccg gacaccgcgg ggcaagcgac
gcggctggcc cccggcggga cggtgctgat 39720cacgggcggc acgggagagc tcggacggca
ggtcgcgcgg cacctggtgg cggcgcacgg 39780cgttcgccac ctggtgctga cgtcacggcg
cggaatggac gcgcccgacg ccgcggcgct 39840ggtggagtcg ctgcgcgcgg cgggcgccgc
gacggtggag atcgcggcgt gcgacgtggc 39900ggacgggcat gcgctggcgg cggtgctccg
gaccatcccg gcggagcatc cgctgaccgc 39960ggtcgtgcac acggcgggcg tgctcgaaga
cggcgtcgtg accgggctct cggccgagca 40020gctcgcgcgc gtgctgcggc cgaaggtcga
cggcgcctgg cagctctacg aggcgacgaa 40080ggacgcgccg ctcgcggcgt tcatgctctt
ctcgtcggcg gcgggcacgc tgggcagcgc 40140ggggcaggcg aactacgccg ctgcgaacgc
gttcctcgat gcgctggcgg cagagctccg 40200cgcgcgcggc gtgccggcga tgagcctggc
ctggggcttc tgggagcaag gcgggatcgg 40260catgacggcg cacctcggcg ccgccgacat
ggcgcgggtc aagcggcagg gcatcgtacc 40320gatgacggtc gcgcacggcc tgcggctgct
cgaccgcgcg ctggagcggc ccgaggcgac 40380gctggtgccc ctatcgctcg acgtggcggc
gcttcagcgc gcggcgagcg acgccggacg 40440ggtgccggcg ctgctgcgtg gcctggtgcg
cccggcggcc gcccggcgca cggcggcgcc 40500ggcggccgcg gcgacagggc tccgcgcgcg
gctcttgccg ttgtccgagg ccgagcgcca 40560ggacgtcttg ctcgatctgg tgcgcacgga
gatcgcggat atcctcgcgc tgtccgggcc 40620agcggcggtg cctcccgatc aacccatcag
ggagctgggg ctcgattcgc tcacggcggt 40680ggacgttcgg agccggcttg tgcagaggag
cgagatcgac ctcgccgtga ccctcgcgta 40740cgattacccg accgcgcgag cgatcgcggg
acatctgagc gagcagatgg gactcgaagg 40800agcgccggaa gatcgtgagt cggcgctcga
cgagagccag atccgcgccc tgctcatgca 40860gattcctatc cccacgttgc gccagtcggg
gctgctcgga gacctggttc gcctggcctc 40920cccgcaagcg cccccgcgcg aagaaggtga
gagcgagacg ttgagcttcg atcaccttgg 40980aaatgaagag ttcctcagcc tcgcgtcgaa
gctcattgca gaggagggat catgaaccaa 41040gagactgttc ttcggcagac actcgagaag
agtctccaca agatccagca cctcaatcgg 41100gagctcgagc gtctcaaggc gaagtcgagc
gagccgatcg cgatcgtgtc gatggcgtgc 41160cgctacccgg gcggcgtcga cggtcccgca
cggctgtggg agctgctctc ggaggggcgg 41220gacgcgatcg ggccgttccc cgaggggcgc
ggctgggacg tggcggggct gtacgacccc 41280gacccggacg cgccgggcaa gtcggtcacc
acgcagggcg gcttcctcta cgacgccgac 41340cgcttcgatc cgacgttctt cggcatcagc
ccgcgcgagg ccgagcggat ggacccgcag 41400cagcggctgc tgctcgagtg cgcctgggag
gcgctcgagc gcgcgggcgt cgcgccccac 41460acgctcgagg cgagcgccac cggcgtcttc
gtcgggctgg tgtacagcga ctacggcggg 41520cggctgctgg agcacctcga ggtcttcgac
ggctacgtcg ccaccggcag ctttcccagc 41580gtcggctcgg ggcgcatcgc ctatacgctg
gggctccgcg gccctgcggt gaccgtcgac 41640acggcgtgct cgtcgtcgct cgtgtcgctc
cacctcgcgt gcatgtcgct ccgcgcgggc 41700gagtgcgaca tggcgctcgc cggcggcgcc
accgtgatgg ccacgccgat ggccttcatc 41760gagttcagcc gccagcgcgg catggccccg
gacgcacggt gcaaggcctt cggggcggcg 41820gcgaacggca tcggccccgc ggagggctgc
gggatcctgg tgctcaagcg gctgtcggac 41880gcgcggcgcg acggcgaccg cgtcctggca
gtgatccgcg gctccgccgt caaccaggac 41940ggccgcagcc aggggctcac cgcccccaac
ggcccggccc agcaggacgt catccgccag 42000gccctggccg cggcggggct cacgcccgcc
gacgtcgacg ccgtcgaggc gcacggcacc 42060ggcacgcccc tcggcgatcc catcgaggcg
caggcgctgc tggcgaccta cggcaagacg 42120cacacagcgg agcggccgct ctggctcggc
tcgatcaagt ccaacttcgg gcacacgcag 42180gccgccgcag gggtggcggg catcatcaag
ctggtgctgg cgatgcagca cgcggagctg 42240ccgaggacgc tgtatgcgga gccccgatcg
ccgcacgtcg actggtcgca ggggcacgtc 42300aagctcctca acgagcccgt gccgtggccg
cgcaccgaca ggccgcggcg cgcggcggtc 42360tcgtccttcg gcgtcagcgg caccaacgcg
cacgtcatcc tcgaggaggc gccggccgaa 42420gcgcccgcgg ccgcgcaaac agcggcgggg
gtgccgtcga cgctgccgct gctcctgtcg 42480ggtcgcgacg agccggcgct gcgcgcccag
gccgggcggc tcgccgagca cctgcgcgcc 42540cacccggacg agcggctgct cgacatcgcc
gcgggcctgg ccacgacgcg cacgcacctc 42600gccacgcggc tcgcgctgcc ggtcgccgcg
gacgcagccg cggaggagct gagcgcccgc 42660cttgcgcagt tcgccgccgg cggcccggcg
cccagcggcg ccgccgtgac cgcgccgggg 42720cagccgcccg gcaaggtcgc ggtgctcttc
accggccagg gcagccagcg cgccgccatg 42780gggcgcgccc tgtacgccac ccaccccgtc
ttccgcgccg cgctcgacgc cgcatgcgcc 42840gagctcgacc gccacctcga caggcccctc
cacagcgtcc tcttcgcaga cgccggcacc 42900gaggccgccg cgctgctcga ccagacaggc
tgggcacagc ccgccctgtt cgctctcgag 42960gtcgcgctct accgacagtg ggaggcctgg
ggcctgcgcg cccacgcgct gctcggccac 43020agcctcggcg agatcgtcgc cgcccacatc
gccggcgtgc tcgacctccc cgacgcctcc 43080gccctggtcg ccgcccgcgg acggctcatg
caggccctcc cccacggcgg cgccatggcc 43140tccatcgagg ccaccgagca cgagctccta
cccctgctcg accagcacac cggacgcctc 43200tcgctcgccg ccctcaacgc tccacgccag
tcggtcgtca gcggcgacca gcccgccgtc 43260gaccatgtct gcgctcactt caaggccctc
ggccggcgcg ccaagcggct cgacgtcagc 43320cacgccttcc actcggcccg catggaaccc
atgctcgacg ccttcgcccg cgtcgcccgc 43380ggcctgacct accgcgcccc gcgcctgccc
gtcgtgagca atgtcaccgg ccgcatggcc 43440accgccgacg agctcacctc gcccgactac
tgggtgcgcc acgtgcgcga gcccgtgcgc 43500ttcgtcgccg gcgtgcgcgc gctgcacgcc
accggcgtcg ccacctacct cgagtgcggg 43560cccgatccgg tgctcggcgg catggccgca
gactgcctca cctccgacga gagccgcgac 43620ccaggcctga tccccagcct ccgcaaggac
cgcgacgagg ccctcgccat cgcccaggcc 43680gcctgcgccc tgcacgtccg cggacacgcc
ctcgactggc cccgcctctt cgacgccacc 43740ggcgctcgcc gcgtcgagct gccaacctac
gccttccagc ggcagcgcta ctggatcgat 43800gcgccgcggc gcgcggcggg gctcgaaagc
gtcggcctca cggccgcaga ccacccctgg 43860ctgggcgcgg cggtgcggct cgccgaccgg
gacgtctacg tgctgagcgg gcggctgtcg 43920acggtcgacc acccgtggat cctggaccac
gtggtgacgg gcacggcgct gatgccagga 43980acggggttcg tcgagctggc gtgggcgacg
gcccaggcgg tgaacgccgc cgcgatcgcg 44040gagctcaccc tgacgactcc actcgtgttg
ccggcgcgcg gcgcggtgca gctccaggtg 44100acggtcgacg aggccgacgc ggatggccgg
cgggcattcg cgatccacag ccggccgcat 44160gggcccgtcg acctcgagtg gacgcaacac
gcgaccggcg tgctgagcgc ggaggcgccg 44220gcgggagccg acgaggcggc ggggctctcg
gagtggccgc cgccgggcgc ggaggcggtg 44280gcgctcgacg gcgggtatga gcagctgtcc
gagcacggct acggccatgg cccggcgttc 44340caggggctcc gcgggctctg gcgcgcggac
cagacgctgt acgcgcacgt cgcgctgccg 44400gacgctgtcg cgggcacgga gcagggcttc
gggctccatc cggcgctctt cgatgcggcg 44460ctgcagtcgc tggcgcggct gtcgcgcgag
gaggcggccg ctggcgaccc ggtgctggtg 44520ccgttcgcgt ggacggacgt ggcgctgtac
gcggccggcg cgaccgagct gcgggcgcgc 44580atcgcgctgg agcaggcgga gggcggcgcg
ccggcggtgg cgtcgctgct gctggccgac 44640gcgcacggac gaaccgtggc gacgacaggg
cgggtgcgcg gggcgagcgc ggcgcagacg 44700cggtccgccg cgagccgtgc ggagccgatg
tacagggtcg cgtggacgga cgtggcgctg 44760gaggcggcgg cgtgggcgcc cgaagagcac
gtcgtgctcg gcggtgacgg tgcgctggcg 44820tcggcgctgg gcgtgcgcgc ggcggccggg
ctgccggagc tgctcgaggc gctggcggac 44880ggcgcggccg cgccgcggcg gcttgtcgtg
gacctgacgg cgggcgacgc gggcgctgtc 44940gtcgcggccg tgcacgccgc ggcgcgcggc
gcgctggccc tggtgcaggg atggctcgcc 45000gcgccgcagc tgacggcgac ggagctcctc
gtggtgacgc gctgcgccgt ggcgacaggg 45060ccggacgagg gcgttgacgc gctggggccg
gcggccgtct gggggctgct gcgggccacg 45120cgcgccgagc accccgaccg cgcggtccgg
gtgctggacc tggggcgcga gccgctggac 45180ggggcgctcc tgcgcagggc gctggccgcg
gtggcggagc cggagctgtc gttgcgccgc 45240ggcgaggcgc gcgcgcctcg cctgcgcgag
gcaaagcccg ccgcggcgcc ggcgacacgg 45300ctggaccctg aagggacggt gctggtcacg
ggcggcaccg gggagctggg gcggcaggtc 45360gcccggcacc tggtggcggc gcacggcgtg
cggcacctcg tgctgacgtc gcggcgcggg 45420atggacgcgc ccgacgccgc ggcgctggta
gaagagctgc gcgcggcggg cgcggcgacg 45480gtcgacgtcg ccgcgtgcga cgtcgccgct
ggcccggccc tggcggcggt cgtggaggcg 45540atcccggcgg cgcatcccct gaccgcggtc
gtgcacatgg cgggcgtgct ggacgacggc 45600atcgtgacga agctctcggc cgagcagctc
acgcgcgtgc tgcggccgaa ggtcgacggc 45660gccattcatc tccacgagct cacgaagcac
gcgccgctcg cggccttcgt gatgttctcg 45720tccgcggcgg gcacgctggg cagcccgggg
caggcgaact acacggcggc caacgtgttc 45780ctggacgcgc tggcggcgcg actgcgcgcg
cgcggcgtgc ccgcgatgag cctggcgtgg 45840ggcttctggg agcaaggcgg gatcggcatg
acggcgcacc tcggcgccgc cgatcgggcg 45900cggatgaagc gacacggcgt cgtggcgatg
tcggtcgcgc agggcctgcg gctgctcgat 45960cgcgcgctcg cgcaccccga ggcggcgctg
gtgccgctcg cgctcgacct ctcgtcgctg 46020cacgcggggg ccagcggcgc cggaccggtg
ccgccgctgc tgcgcgggct ggtacgcgcg 46080cccgccggcc ggcgcacggc ggcgtccgcg
gcccggacga acgggaaggg cacggcattg 46140gcggcgctcc gcgcgcggct cttgccgttg
ccgcaggccg agcgcgagga cctcttgctc 46200gagctcgtgt gcaccgaggt cgcggaggtg
ctgcagttgc cggggccggc gcacgtcccg 46260gcggatcagc cgctccgcga cctggggctc
gactcgctca tgaccgtgga gctgcgcaac 46320cgtctcggcg cgcgcgccga gacgacgctg
cccaccacgc tcgcgttcga ctacccgacg 46380cccagggccc ttgcgtccta tctggagacg
ttgctcggca tctccgacga gaacgggcat 46440tcgggtgagt tgctgcacgt tccgcagaac
gaggacgaga tccgctccgc gatagcgcgc 46500atcccgatag cgaccctgcg cgaggcgggg
ctcctccaga gcttgctgcg gctcgccccc 46560ggcaaggcgg tggccggtga cgtcacgcac
ccggtcgatg agctgctggt cgagcacatc 46620gaggatgaag agctgcttcg actcgctttc
gaggccaccg gaggtatcaa gtgaaagacg 46680aggctctctc gtttcgccga gccctggaga
agacggtcgt cgagatccgc cgtctcaatc 46740gggagatcga cgacctgcgg gcgaagtcga
gcgagcccat cgcgatcgtg tcgatggcgt 46800gccggttccc cggcggcgtc gagaaccccg
aggcattgtg gcggctggtc tccgaggggc 46860aggacgcgat cgggccgttc cccgaggggc
gcggctggga cgtggcgggg ctgtacgacc 46920ccgacccgga tgtgccgggc aagtcgatca
ccgcgcgggg cggcttcctc tacgacgccg 46980atcgcttcga tccggagttc ttcggcatca
gcccgcgcga ggccgagcgc atcgatccgc 47040agcagcggct gctgctcgag tgcgcctggg
aggcgctcga gcgcgcgggc gtcgcgcccc 47100acacgaagga ggcgagcgcc accggcgtct
tcgtcgggct gatgtacacg gactacggcc 47160tgcggctgct gaaccacccc gaggccctcg
acggctacat cggcatcggc agcacgggga 47220gcacgggctc ggggcgcatc gcctacacgc
tgggcctgca gggacctgcg atcacggtgg 47280acacggcgtg ctcgtcatcg ctcgtggcgc
tccacatggc ctgcgcgtcc ctgcgcgggg 47340gagagtgcaa cctggcgctt gtcggaggcg
tcgccgtgat gacgacgccg acaacgttca 47400tcgagttcag ccggcagcgg ggcctctcgc
tcgacggccg gtgcaagtca ttcggtgccg 47460aggccgaggg cgtcggctgg ggcgaaggct
gcggaatcct ggcgctgaag cggctgtcgg 47520acgcgcggcg cgacggcgac cgcgtgctcg
cgatcatccg cggctccgcc gtcaaccagg 47580acggccgcag ccaggggttc accgccccca
acggcccgag ccagagggcg gtcatccagc 47640gggcgctggc ggcggcgggg ctgaccgcgg
cggacgtcga cgccgtcgag gggcacggca 47700ccggcacgcg cctcggcgac cccatcgagg
cgcaggcgct gctggcgacc tacggcaagg 47760cgcacacagc ggagcggccg ctctggctcg
gctcgatcaa gtccaacttc gggcacacgc 47820aggccgccgc aggggtggcg ggcatcatca
agctggtgct ggcgatgcag cacgcggagc 47880tcccgaggac gctgcacgcc gacacgccct
cgccgcacgt cgactggtcg caggggcacg 47940tcaagctcct caacgagccc gtgccgtggc
cgcgcaccga caggccgcgg cgcgcggcgg 48000tctcgtcctt cggcatcagc ggcaccaacg
cgcacgtcat cctcgaggag gcgccggccg 48060aagcgcccgc ggccgcgcaa acaccagcgg
cggcgggggt gccgtcaacg ctgccgctgc 48120tcctgtcggg tcgcgacgag ccggcgctgc
gcgcccaggc cgggcggctc gccgagcacc 48180tgcgcgccca cccgggcgag cggctgctcg
acatcgccgc gggcctggcc acgacgcgca 48240cgcacctcgc cacgcggctc gcgctgccgg
tcgccgcgga cgcagccgcg gaggagctga 48300gcgcccgcct tgcgcagttc gccgccggcg
gcccggcgcc cagcggcgcc gccgtgaccg 48360cgccggggca gccgcccggc aaggtcgcgg
tgctcttcac cggccagggc agccagcgcg 48420ccgccatggg gcgcgccctg tacgccaccc
accccgtctt ccgcgccgcg ctcgacgccg 48480catgcgccga gctcgaccgc cacctcgaca
ggcccctcca cagcgtcctc ttcgcagacg 48540ccggcaccga ggccgccgcg ctgctcgacc
agacaggctg ggcacagccc gccctgttcg 48600ctctcgaggt cgcgctctac cgacagtggg
aggcctgggg cctgcgcgcc cacgcgctgc 48660tcggccacag cctcggcgag atcgtcgccg
cccacatcgc cggcgtgttc gacctccccg 48720acgcctccgc cctggtcgcc gcccgcggac
ggctcatgca ggccctcccc cacggcggcg 48780ccatggcctc catcgaggcc accgagcacg
agctcctacc cctgctcgac cagcacaccg 48840gacgcctctc gctcgccgcc ctcaacgctc
cacgccagtc ggtcgtcagc ggcgaccagc 48900ccgccgtcga ccaggtctgc gcccacttca
aggccctcgg ccggcgcgcc aagcggctcg 48960acgtcagcca cgccttccac tcggcccgca
tggaacccat gctcgacgcc ttcgcccgcg 49020tcgcccgcgg cctgacctac cgcgccccgc
gcctgcccgt cgtgagcaat gtcaccggcc 49080gcatggccac cgccgacgag ctcacctcgc
ccgactactg ggtgcgccac gtgcgcgagc 49140ccgtgcgctt cgtcgccggc gtgcgcgcgc
tgcacgccac cggcgtcgcc acctacctcg 49200agtgcgggcc cgatccggtg ctcggcggca
tggccgcaga ctgcctcacc tccgacgaga 49260gccgcgaccc aggcctgatc cccagcctcc
gcaaggaccg cgacgaggcc ctcgccatcg 49320cccaggccgc ctgcgccctg cacgtccgcg
gacacgccct cgactggccc cgcctcttcg 49380acgccaccgg cgctcgccgc gtcgagctgc
caacctacgc cttccagcgg cagcgctact 49440ggctcgagac gccccagacg ccgggcgccg
acggggcctc caacctatct tcgcccgccg 49500aaagccgctt ctgggaggct gtcgagagag
cggacatcat ccccctcgcc gaggcgctgc 49560gcctcgagga tgaggcgcaa cgcgcttcgc
tggcgaccct gctgcccgcg ctctcgacct 49620ggcgccgccg acgccacgag cagagcaccg
ccgacgcctg gcgttaccgc gttgcctgga 49680aaccccttgc catcgacgcc cggagcgatc
tctcgggggt ctggctgttc ctcgcgcctc 49740cggatcacgc gaaggacgac ctcgcgcgcg
cggtccttcg cgcgctcgcc gagagcggcg 49800cgacggtcgt ccctgtgctg gtggccgagg
gcgacgtcga ccgcgccctc ctgagcgcgc 49860ggctgcgcga gcaggtcggc gacggcggcg
cgatccgcgg cgtgatctcg ctcctcgccc 49920tggacgagac ctcgctgccg cagcacgacg
ggctgccccg gggcctcgcc ttcacgctcg 49980cgctcgtcca ggccctggga gacacggcga
tcgcagcgcc tctatggctg ctcacccgtg 50040gcgccgtctc cgtgggtcgt tccgaccgcc
tcgagcgccc gctgcaggcg ctgacgtggg 50100gcctcgggcg cgtggtggcg ctggagcacc
ccgagcgctg gggtggactc atcgatctcg 50160ccggcgcgct cgacgaaaag gcgctcaagc
ggctcgtcgc cgccctcggt ggtcgcgacg 50220ccgaggatca gctcgccctg cgcccctccg
gactcttcgc gcgacggctg gtcagagcgc 50280ccctgggtga agcgaccgcg gttcgcgcct
ggaaggcgcg cggcaccgcg ctcgtcaccg 50340gcggcacggg ggacctgggc gcccacgtcg
cccggtggct cgcccagaat ggcgccgagc 50400acctcgtcct caccagccgc cgcggacagg
acgcccccgg agcggccgag ctcacggccg 50460agctcacggc gctcggcgcc cgcgtcacca
tcgccgcctg cgactcgtcc gaccgacagg 50520cgctcgcggc cctgctccag cgcctgaggg
ccgaaggccc ccccctccgc gccgtcgtcc 50580acgctgcggg tgtcgaccag gtcaccccgc
tggccaggac cagcctggcc gagttcgcag 50640gcatcgcctc cggcaaggtc gcaggtgctc
ggcacctcga cgacttgctc ggcaatgccc 50700ccctcgacgc cttcatcctc ttctcctcgg
tcgcaggcgt ctgggggagc ggctttcagg 50760gcgcttacgc ggcggccaac gccttcctgg
acgcgctggc cgagcagcgc cgcgccctgg 50820gctcgacggc cacgtcgatc gcctggggcc
tctggggcgg caaaagcatg gccgacgacg 50880ccgccaaaga tcatctcagc aagcgcggcg
tgtccccgat gccgccccag ctcgcgatcg 50940cggccctgca gcgggcgctc gaccacgacg
agaccacact caccctcgcc gacgtcaact 51000ggtcacgctt tgccccggcc tttgccgccg
cccgcccgcg cccgttgctg cacgatctcc 51060cggaagcccg gagcgctctc gagtccccct
cgccggcgcc ccgcgaggcc gagctgctca 51120cccggctcca gggcctctcc agcaccgagc
gcgtccgcca cctcgtctcc ctcgtgctgg 51180cggagaccgc cgtcgtcctc ggccatcctg
acgcctcccg cctcgaccct cacacaggct 51240tcgcggatct cggcctcgac tcgctgatgg
ccgtcgagat gcgccggcgg ctccagcagg 51300caacgggggt gagcctgccg gcgaccctga
ccttcgacca cccctcgccc caccacatcg 51360cgaccttcct cctcgacgag gtcttcgcgc
cggccctcgg ccaggccccc ggcgccgagg 51420aagacgaagc gatcgcccag gccgggctcg
cctcgggcga cgagcccgtc gccctcatcg 51480gcgtggggct gcgtctcccc ggcggagcca
ccgacctcga cgggctctgg cgccttctgg 51540agcaggggat cgacgttgtc ggccccgtcc
ctgaagaccg cggctggagc atggacgagc 51600tctacgatcc cgaccccgac tccctcggca
agagctacgt gcgcgaagcg gctttcctcg 51660atcgcatcga cctcttcgac gcgggcttct
tcggcatcag cccccgcgag gcgagccacg 51720tggacccgca gcaccgcctc ctgctcgagg
ccgcgtggca ggccctcgag cacgcaggca 51780tcgtcccggc ctcgctccag gactcccaga
ccggcgtctt cgtgggctca ggcccgagcg 51840actacgcctt gctccacaac ccggcccagg
aggatgaagc ctacaggctt acggggacgc 51900agccctcgtt cgcgccaggc cggctctcgt
tcagcctggg attgcaggga ccggcgctct 51960ccgtggacac cgcctgctcc tcctcgctcg
tcgcgctcca cctcgccgcc caggccctgc 52020gccgcggcga gtgcgggctc gccctcgtcg
gcagcgcgca ggtgatggct gctcccgacg 52080ccttcgtgac gctctcccgc gctcgcgcca
tcgctcccga cggccgctcg aagaccttct 52140ccgcccaggc cgatggctac ggccgcggcg
agggggtcat cgtcttcgtc ctcgagcgcc 52200tgagcgacgc ccgcgcgaga gggcgcgacg
tcctcgcggt cctccgcggc agcgccgtca 52260accacgacgg cgccagcagc ggcatcaccg
cgccgaacgg cacctcccag cagaaggtgc 52320ttcgtgccgc gctccacgat gcgcggctca
cgccagcgga cgtcgacgtg gtggagtgcc 52380acggcacggg cacttccctc ggcgacccca
tcgaggtgca agccctggcc gccgtctacg 52440gaaaggagcg ctccgccgat cggccgctga
tgctcggcgc gctcaagacc aacgtcggcc 52500acctcgaggc cgcgtccggt ctcgccggcg
tcgcgaaggt cgtcgcggcg ttgcgccacg 52560aggcgctgcc ggcgacgctg cacaccgccg
cgcgcaaccc tcatatccag tgggatacgc 52620tgcccgtcca ggtcgtcgac accttgcgtc
cctggccgcg gcgcgaggac ggcacccccc 52680gccgcgccgg cgtgtcggcg ttcgggctct
ccggcaccaa cgcccacgtc ctcctcgagg 52740aagctccgcc tgtccagccg agcacacagg
cggagcagcc tgccgcgccg ccgtggttgc 52800cgctgctcct gtcgggcaag acggacgcgg
ccctgcgagc gcaggccgag cggctgcggg 52860cgcacctcga cgcccatgcc gacctcgggc
ttgccgacgt cgcctattcc ctcgccacga 52920cgcggacgca tttcgcgcat cgggcggtgg
tcgtcgcgga cgctggcgcg accctcttcg 52980aagggctgga cgccatcgcg cgcggcaacg
ccgcttccca cgtggtggtc gacgaggcca 53040agatcgacgg caagaccgtc ttcgtcttcc
cgggacaggg ctcgcagtgg gcccagatgg 53100cgcagccgct gctcgagacc tccgagctct
ttcgcgagcg tatcgaggcg tgcgcgcacg 53160ccctcgcgcc tcacgtcgac tggtcgctgc
tcgccgtcct ccgcggcgaa gaaggcgccc 53220cctcactgga gcgggtcgac gtggtgcagc
cggtgctctt cgccgtgatg gtctcgctcg 53280ctgccctctg gcgctcgatg ggcgtcgagc
cggacgccgt cgtcggccat agccagggcg 53340agatcgccgc cgcctgcgtg gcgggcgcgc
tgtcgctcgc ggacgccgcc aaggtggtgg 53400cgctgcgcag ccgcgcgctc gcgcggctcg
ccggccgggg cgccatggcc gtcgtggagc 53460tccccgccgc cgagctcgcc gagcgcatga
agcgctgggg cgagcggctg tccatcgcag 53520cgctcaacag ccctcgttcc accgtgatct
ccggcgatcc ggacgccgtc gacgcgctgc 53580tccgggagct cgactcggcg gagatcttcg
cccgcaaggt gcgcgtcgac tacgcctccc 53640actgctccca tgtggaggcg attcgccacc
agctcctggc cgagctcgcg ggcatcgagc 53700cgctcccgtc cacgctcccg ctctactcca
cggtgagcgg ggacaagctc gatggcgtcg 53760cgctcgacgc ctcgtactgg taccggaacc
tccggcagac cgtccgcttc tcggacgcca 53820cgcagcggct cgtctccgcg ggacatcgct
tcttcgtcga ggtcagcccg catccggtgc 53880tgacgttcgc cgtgcaggat gtcctcgatg
ccgagggggt gcccgccgct gtcgtcggct 53940cgctacggcg cggcgagggc gacctgcggc
ggttccttgt gtcgctgtcc gagctcttca 54000cccgcggcct cgccctggat tggtccaggg
ttctgcccag cggccggcgc gtatcgctgc 54060ccacctacgc cttccagcgc gagcgctact
ggctcggggc tcacagggct cgcggcaccg 54120acgcgacatc cgccggcctg gcatcggacg
agcccacgcg cggcgcgtcg atgccagtgc 54180ggctctcgtt gcgggacgtg ccgcccgagg
agcgccaggg agcgctggag cggttcgtcc 54240gggagcagct cgcggccgtc ctgcgcatgg
atgcggcgcg gatcgagggg cagacgacga 54300tcaagacgct cgggatcgac tcgctcatgg
cgctcgagat ccgcaaacgg ctggaagccg 54360gactggccgt gaccttgcca tcgacgctca
tctggcagtt cccgcacgcc gaagggctcg 54420cacggcacct catgacgcgg ctccccgcgg
gggacggaga aggatctgcc gtggtccagc 54480ccgtggagca gccgcgcgcg ccgaaggagg
tgcccgtatc catggatccc tcggcgtggg 54540tgcaccgccc gcgccccagg gccgacgcgc
gcgttcgact gttctgcctt ccctacgccg 54600gcgcgggcgc ctcgcgcttc cgggcgtggc
cagagctgct cccctcctgg gtggaggtct 54660gcccgatcca gctccccggc agggaagagc
gcctccacga gccggccttc gagacgatgg 54720acgcgctcgt cgacgcgctc gttcccgccg
tcgaggcgca catcgatcgg ccctttgcgc 54780tgttcggctg cagcatgggt gccctcctgg
ccttcgagct cgcccgggcg cttcaatccc 54840gtcatcgctt ggtggcgcgg catctgttcg
gcgcggcgag ctcctcacct cggcgcgtga 54900gcccggtacg ggagcagctc tccgcggtgg
tctcccctgg aacggtgcga tcggacgcga 54960tggcctcgct gcgccagctc ggtctgctgt
cgtcctcgtc cctccaggac gaagagatgc 55020tggacgaggt gtggcccgcg ttccgtgcgg
atctatccct gacgctgaag tacacgtgca 55080gggacgcaac ccccctcgac gcccccatct
cggtcttcgg gggcaccgag gaccggaccg 55140tagggcgcga ggatctcgtc gcctggcata
cgctgacgaa ggacgcgttc caggtcgcca 55200tgctgcccgg gggtcacctg ttcatggacg
cgacgccgaa gcggctcttc catcacatcg 55260agcacgcgct ccagctctag tggaccgtcc
gacaggccct tcgacatcgt cctcggcgga 55320gggcggcgac tccgcgcgga gagcgagccg
cgatcgcgcg gcgccgtcca cgatcttcct 55380gggatttttt ttggacagtt caccagaagc
tgcgggatac caaacagaag cgaccatggg 55440aagcaacgaa gggagtatcg cttgacgatc
aacgacgagg tgcggaccag cgacgccgtg 55500tgggctggtg ccgcgggcta taccagggcg
cgtcttcagg tctatgactt cttcatctac 55560ggcttcaaca gccctgtcgc atggaagtgc
ccgggcgagg agctcctcga gaactacaat 55620cggcacgtct cgggcaatca cctcgacgtc
ggcgtgggga cggggtacct gctcgaccgc 55680tgccgcttcc ccaccgccaa gccgcgtgtg
tttctgatgg atctgaaccc ggacgctctg 55740caggtgacgg cgcagcgact gcaccgcttt
cagcctcaga ccttgcggcg gaacgtcctt 55800gatcccatcc gcttcgacgg agagcccttc
gactccatcg ggatgaacta cctcatgcac 55860tgcgtccctg gatccatccc ggagaaggcc
gtgatgttcg accacctgag cgccttgctg 55920aagccgggcg gcgtgatctt cggcagcacg
gtgctctcgg agggcgtgga caaggggatc 55980gtggcgcgag ccatcatgga ccgcttcaac
aagaagggga tcttctcgaa cacccgagac 56040gccgcctccg atctgacgcg agcgctggag
gagcgcttcg acgacgtctc ggtccgcgtc 56100gtcggctgcg tcgggctgtt ctcagccagg
aagcgtacct gcgcgggaac cgagtcgccg 56160gcgtgaggtg agcggggacg gcgctcaggg
cgcggcgagc ggcagcctgc gtgccgggcg 56220cgcggcctcg tgtccgtccc ccgcctcggc
cacccgcccg cggtagatgc gatcgatccg 56280atcgcgcgcg atgaccaggg gcttgtcgaa
ccggccaagc acgttgccct tcaggatccc 56340gcgcttgtcc gtcaagcggt ccagcaaccg
catatcgagg cgcagctcga tgttcatggc 56400cacctgcatg gcgggccaga ggacggcgcc
ggccccgaac ttgctccagg gcgccaggct 56460cgcgaagaga aacgtataca tctccgacga
ctccgggccc accgggttga agaagaccgc 56520tgaccggagc gggaaggtga cgggctgatt
ggtcttcgga tccctgaggg agtggttgta 56580gatcgtgtag accggcgaga agtaggatgt
ccagtccacc acgaatatcg catcctccgg 56640gatgccgagc agcttctcca tcgcccgcgg
catgggccgc ctcggacccg aatgcacgac 56700ccggatcgtt tcgtcggtca gggtcacccg
cgcctcgacc tctggcatcc gctcgagcgg 56760gtagccgagc atgaagtgga cgaagggcgt
gtgctcgatc tcgatgaaat tgtcgagcgc 56820cagctcgaac ggcacggtcg cgcggtggcg
gaggagaccg cgcggcacat atccctcgcc 56880ctcgaggcgc gggaacgctg cctgcgaccc
cgcccgcttc acccagatgg caccgtaccg 56940ctccacggcc tcgaacatgt cctcgcgccg
cgcgcacggc cgcgccgccg gggtagccgg 57000gatctcgccg cggccgtcca cggcccaacg
ccagccatgg taggcgcaca ccagccgatc 57060gccctcgacc cacccctcgc tcaggcgcat
gctgcggtgg gggcaacgat ccgtgaatgc 57120accgaggccg cccgacgagg tccgaaacac
cacgatctca tgccccgcga gccgcacatt 57180gcggggcttg cggcggagct cgtggctcag
cagtacaggg tgccagtggt cgagctcagc 57240catgatcagt tcaccccttg gatgtgccgc
gcaatccgcg gcgcctcggc tgcgatgtcg 57300cggatctgcc ccgtgatggg attgcggaag
ccgatgaaga acagcccagg cgccggcgtc 57360ggcgcgccgt gccaccgcgg gcagccgtgc
tcgtccgtgt agcgcgttgc attctcgaga 57420aaatcatcga gcccgggccg gtaccccgtg
gcgagcacca cgacgtcgaa gggcagccca 57480cggccgtccg tgaacgtcac gcccgtttcc
gtgaatgccc gcgggccggg caccaccttg 57540atcttgccct gctggatcag cgccaccgtg
ccgatgtcga tcaacggcat gcggccttcc 57600ttcaacgccc gggtaccggg gccgaccgcg
ggccgacgga tcccccagcg cgacagatcc 57660cccacggcgc gagacaggat cgcggtcgcg
aggcgatccc cgacggccag cgggaggcgc 57720tcgaagaggg caagggcgtt gaactgcgca
ggcagcttga acagctcgcg ggggatcacg 57780tggttgccgc tgcggaccga gagggtcgtc
tccgcgcaat gctcccacag atccagcgcg 57840atctcgctgg cggagttgcc ggcgcccacc
acgagcacgc gctggccccg gaattccgca 57900ccagatcggt aggcagagct atgaaggatg
cgaccgcgga agcgctcctg gtcgggccag 57960gtggggacgt tgggatgacg gctgtagccg
gtggccacga cgagcgcctg gctcctgagc 58020tcccccgcgt gcgttcgggt cacccaccgc
gatccgtcgt ggtacgcgcg ctccacctcg 58080acacccaggc gcggctccag gcggaatcgc
tcggcgtaac gctcgaggta atcgaccatc 58140tccacccggg agggatacgg cgcagaatac
tcgggccagg gctgcccggg cagcgcggag 58200agctgcttga tcgtgttgag gtgcagccgg
tcgtagtggc gccgccacgt ggcgccgacg 58260gcctccgact tctcgaggag aacgaacggg
attccctgct cgcgcaggca tgcgcccacc 58320gctagcccag acggaccagc gccgacgata
accacatggc actcttcaac gtgcacgcat 58380gaagtctaac caaaattcgc cccggatgcc
aactccactt gtgcgggcgt cgcttccggc 58440aactcgtatg ctggtgagcg gcttcggatc
gtgatggaaa gctctgagct cgcccgcagc 58500tccggagatc ccccgcgtct tcgcgaggag
cctggcggac gcgcgcgccc cgcgagcgga 58560cacggcgacg ctacagcgcg cggacgtcac
gcactcgcat gcccgacgcc cgtgccttct 58620gcctcgcccc gcgtctcgcc gaagtagatg
gagcgcatca ggcggtggtt gtgcacgagc 58680gtcgcgtcgt atttgttgag ccgcatccct
ttcatctcga agggcgtatc ggccacgtgc 58740gggatgaact tcacatcgtc gcggatctcc
ttccaggaga gcgctatcgc cgccgatttg 58800acgaccggaa gcagcggacg gaagcgggga
tcggtgatct tgacgaacag gaacgcgcgc 58860acgaacgtgg tgcgctccgt ctctggcacg
aagaagatgc cggcgcgcgc cacgacagga 58920cgctccatcc cgttctgcgc cgtccaccag
gacgtgtaca cggtgtagac ggggctgaag 58980cgggtcaccc actggttgtg aaatgtgtcg
cctggctgga gcagcatcag ccgcgcgagc 59040gtcgaggggc gctgcggcgc cgagtacttg
acctcggtgc ggtcctcgaa gacgtcgcac 59100gagaagtcga tgcgcgccgc gtcctcgggc
gtccagccga ggcggccgtg aacgaacggc 59160gtgtgctcgt cctcggagga attgtcgaag
atgacgtgca ggggcgccgg cgcgaggtgc 59220gagaaggtgc cggcatattc gaagccatcg
ctgctgaagt cgagctcggg cagcgccgag 59280cgcggcgtat cccggtgggc tagccacagg
tatccaagct gctcgacgag ctgaaaggag 59340cgtgtatcgc atcgggtgag cgacggttgc
gaggggcagg ctccccgccc ctcggcgtcg 59400aaatgccacc cgtgataggg gcattccagg
cgcccgtccg gccggacacg cccctgcgat 59460agcggcgcga gccggtgggg gcacgcatcg
gcgagcgcgg cggggcggcc ctgctcatcg 59520cggaagagag cgtaagcatt gcccgcaagg
acaacgcgaa ccggcttccg gccgagtttc 59580gaggccggca agacggggtg aaaatggcgg
atgaggtcgc gagcaggcgc ggcgtgcatt 59640gcgagaccat aacacatccg cgacgccggt
tggaaggagc tcccgcgcgc gcgcgacgcc 59700gatccgcttc cgaaacctcc tgcgcgatgg
cgtcgagcga ccgaagtacg aggatctcct 59760atcggtaggc gacgatgcca ccgaacggcc
acttcgcgtg gtcctcgggc gccggataga 59820cctcccattc ggagaacccg gccgcgcgga
gcgacagctc ccactcttgc agcgtcaggt 59880aaccgacatg ctggcggcga ggcggatcga
gcttggcctt gctgtaggtg tgcagcatcg 59940actgaaaaaa ttcattgggg aagaacaccc
cgggccgatc gcggaacgac atggtgaacg 60000cgagctgacc gcccggcttc agcatcgtgt
ggaacgcctg gagggtggcg tgaagatcgc 60060gcacgtcgta gagcacgtgc tcgaggacga
tcagatcgac cgacgcggcc cgggcgaacg 60120tgctgccagc ggagggcagc gtgtccaggt
ccaggcgctg gaaatgaatg cgctgaaaca 60180cgtcggccgg cgcgtgggtc cgcagccact
gcttccccgt ctccatcaac agggcgctga 60240tgtcggtgta atcgtagcgg gcgaggttct
tgctcagcgg gaggaaccgc ggatcggaca 60300acgcctgccg cagcaccacg ccgagccccg
cgcccccctc gaatacagag atccccggcc 60360cctctgcgag cttggccatc agcgcccgcg
ccagcatcac gttgcatggc ttcttggcgg 60420gaaggctgat catcgagtat tcccagaatt
tcagcgaggc ctgcatcccg tactggagat 60480ccatggtggc cagcgcgtcc ttgcccgcca
gcaccggccc ggccaggccc cgatagcgct 60540ggaggaactc gaccatctcg cccaggatcg
cgcggtctgc gagcgcgatg gactccttct 60600cggcgacgcg ctttcgcacc gcctcgctgg
gcaccagccg cccgctgggg tcctgggtga 60660ggtctccctt gtcgctgaag tagtcgagca
gcttcctgcg aaactgatag gcggtgaccg 60720acggagccga ctccggacga tcgtcgagcc
cccggacagc gccgctcggg tcgacgaggt 60780gctcgagcag gatctcgctg gcaacaagct
cggtctgacg acggaatgct tctatgtaag 60840cggtgtaagc gtcgttgtag agatcggtca
cgtccaatcg ttgtcgcatg caggtcctcg 60900cgggtgtggc gcccatcctg cgcagcgcag
ggacgaagca ggtcatggaa tggtccagct 60960cgcccgggaa cgcaaggacg gaccgtccgc
tgccggcggg cgccgcgcct ccgagcgcct 61020cgcgcgccgc gcgtcacctg gagctcagcg
cctgcccgtc gttcccgcgg ttcttgtgca 61080caatggcgta caggatgagc atgtaggcga
agagccggaa caggtacagg taatggatgg 61140cgtcttcctc gacgcgattc agggcgacgg
cgatgcggcc cagcatcatc agccagaacg 61200ccgccgagaa cttcgcgaac agccggtcgc
ccgtcttctt ccagaagcgg aggaagaaga 61260gcgcgacggt cgcgtacccg aacgtcatcg
aaccgatcag gaagtcgttc aaaggtccta 61320cctcgcctct acgcgcgttt actcgcgcag
gtcccagatg aggccataaa ggagcagggc 61380cagcccgatg agcgcggtga ggtggcgcag
cgatgataga tcgacgctcc ggatcacgac 61440gaggtccacg aagagcagga tgttgttcgc
tgcgagcgcg gcgaagcaga gcccgctcca 61500caagaggaga cggaccttgc gctgcgcgta
tccgcgcagg agcagcacgg cgcacgcgat 61560gctggtcagg gcgcagagga tgtagaccgc
cgctgccatg gctagccgcc ctttcccttc 61620ttcgtgatca ggaatgcgtc cgagaagctc
tggatgtcgc tcggcggggg cgtggcgtag 61680atgtgattga tcacgctcag ccggcgctcc
ttgtacgcct gcgccaggtc gtcgatcgtc 61740cggcgggtct catcgtctgc cggggcgtac
cggtagaaga tgtcctcccc gtcctcccgg 61800gccacgatca ggcccctgct ggccaggcct
ccgaaccggt cctggatcga catcatgctg 61860gaccctatct cgcgcgccat cgcggccgcg
ctccactcgc gctccgccgt gcgacgcatg 61920agcagaagca cttcgagttg ctcgatcgag
gagatgtgcg cgccgaggaa gcgctggacc 61980cggtcgggga gcccgctaga cacgagctcc
tcgccggccg agggtccctc cggtcaccgg 62040tgcaaccata gccgcagcat agcgagcagg
tgctcgggat ccaccggctt cgagatgtaa 62100tcgttcgcgc ccgcctcgaa gcacttctcc
cggtcgccct tcatcgcctt ggccgtgacc 62160gcgatgatgg gcagcgcatg gtgctcgggc
ttcgcgcgga tggcacggat cgtgtcgtag 62220ccgtccatct ctggcatcat gatgtccatg
agcacgatct cgatgtccgg cgtccgctgc 62280agcatctcga tcgccgctct gcccgtctcc
acgtagaccg tcttcatctg ctgggcgtcg 62340aggatggtcg tcatcgcgaa gatgttccgg
acgtcgtcgt cgacgaccag caccttcttg 62400cccgcgagca ccttgttcga ctggtgcagc
tcctggaggg tctgccgctg tcgctcggag 62460agcgccgcca cagggcggtg caggaacagg
gagacgtcgt cgaagagccg ctccttggag 62520cggacgtgct tgagcaccat cagctggctg
aagcggctca gctgcgcctc gtccgcggcc 62580gagatctcct ccggcgcgta gaccaggacg
ggcagctccg tcggcccgct gccctgcgcg 62640agctgcccga tcagatcgaa gcagcgcatg
tcgggcaggt cgaggtgcag gatgaggaca 62700tcggccccct cggtgaggag cgcgtcgagc
gcctcctccc cggaggccac gctccggatc 62760gtgacgtcgt cgccgccgag gagctcgacg
agctcctggc gctcggcctc gtccggctcg 62820gcgagcacga ccgtccgccg gcgcgacacc
atgaactgcg agaggcgcct gaaggtctcg 62880tcgagcgcgt cccgggtctt gagcggcttg
cagagcaccc ccgtcgcgcc catccggagc 62940gcgcgctcgc gctcctcgtc cgtcgtgatc
acctggacgg ggatgtgccg cgtcgcgagg 63000tcgcgcttca cccggtcgag cacgcgccag
ccgtccatgt ccggcaggtt gatgtcgagc 63060gtgatcgcgt tcacccgccg ctcgcggacg
atggagagcg ccgccccgcc gcggtaggcg 63120aggatcgcct tgaacccgtg gtcgtgcgcg
acatccatga cgaagtgcgc gaagctcgcg 63180tcgttctcga cgatgagcac cacggagtcg
ctgggctgga ggctcgcgct gtcgtcgacg 63240ctctggttga gcaggtgcgg cggcggctcg
gccgccgacc gcggcgcgac gtcgcccgag 63300acgagggccg gcggcgccga gggcacctcc
gcggcctgct ccttcctgcg cgggcgcgcc 63360ggcgtgtacg tgagcggcag gtaaagcgtg
aaggtgctcc cgctccccgg cctgctcgag 63420agcttgatct cgccgccgag catccacgcg
atctcgcggc tgatcgcgag cccgaggccg 63480gtgccgccgt acttccggct cgtcgagccg
tccgcctgct ggaaggcctc gaagatgatc 63540tgctgcttgt cgtgcgggat gccgatgccc
gtgtcccgca ccgacatggc gatcgccgcg 63600ccggcgcgcg agaggccctc gttctcgatg
gtccaccccg aggtgaccag atcgacgtcg 63660agcgcgacgc tgccgcgctc cgtgaacttg
aaggagttcg agagcaggtt cttgagcacc 63720tgctgtacgc gcttcgcgtc cgtgtagatg
acctgcggca ggttctgcgc gaagttgagc 63780tcgaactcga gcctcttcga ctcggcgacg
tgctggaacg tgcgctcgac gtagtcttgc 63840aggtcgctga acgacagctc gcccacgtcg
acgatcacgg tccccgactc gatcttggac 63900aggtccagga tgtcgttgat cagcgcgagc
aggtcgttgc ccgacgagtg gatcgtcttg 63960gcgaactcga cctgccgccc cgtgaggttg
cggtcggtgt tcttcgagag ctgatcggac 64020aggatgagga ggctgttcag cggcgtccgg
agctcgtgcg acatgttcgc gaggaactcc 64080gacttgtact tggaggtgat ggcgagctgc
cgcgccttct cctcgagcgc ctgccgcgcc 64140tgctcgacct cgcggttctt ccgctcgacc
tcgacgttct gctgggcgag caggcgagcc 64200ttctccccga gctcggcgtt cgtctgctgc
agctcctcct gctggctctg gagctcgcgc 64260gcgagggact gcgactgctt gagcaggtcc
tctgtgcgca tgttcgcctc gatcgtgttg 64320agcacgatcc cgatcgactc cgtgagctgg
tcgaggaacg cctggtgggt cgggctgaat 64380cgctcgaacg acgcgagctc gatgaccgcc
ttgacctgcc cctcgaagag cacggggatg 64440acgatgatgt tgaccggcgg cgcctcgccg
agcccgctcg tgatgcggat gtagtcgggg 64500ggcgcgttga cgaggaggat cttctccttc
tcgagcgcgc attgcccgac gagcccttcg 64560ccgagcttga aatggttgtc gacgtgcttc
cgcaccttgt acgcgtagct cgcgaggagc 64620ttgaggatcg gctcctcctt cgccacgtcc
atcgtgaaga acacgccctg ctgcgcgccg 64680acgaccgggg ccagctcgga caggatgagc
cgaccgacag tgagcagatc cttctgcccc 64740tggagcatgc gcgagaactt ggcgaggttg
gtcttgagcc agtcctgctc gctgttcttc 64800agcgtcgtgt ccttgaggtt ccggatcatc
tcattgatgg tgtccttgag cgccgcgacc 64860tccccctgcg cctcgacctt gatggaccgg
gtgaggtcgc ccttggtcac ggcggtggcg 64920acctcggcga tcgcgcgcac ctgcgtggtg
aggttcgcgg cgagccggtt cacgttgtcg 64980gtcaggtcct tccacgtgcc ggccgcgccg
gggacgctcg cctgaccgcc gagcttgccc 65040tcgacgccga cctcgcgcgc caccgttgtc
acctggtcgg cgaaggtcgc gagcgtctcg 65100atcacgccgt tgatcgtgtc cgccagcgcc
gcgatctcgc ccttcgcgtc gaaggccagc 65160ttgcgcttca ggtcgccgtt cgcgaccgcg
gtcacgacct tggcgatgcc gcgcacctgg 65220ttcgtcaggt tgccggccat gaagttcacg
ttgtcggtca ggtccttcca cgtgccggcg 65280acgccgggga cgctggcctg cccgccgagc
ttgccctcgg tgcccacctc gcgcgccacg 65340cgcgtcacct ccgacgcgaa cgcgttgagc
tggtccacca tcgtgtagtt gatggtgttc 65400ttcagctcca ggatctcgcc gcggacatcg
acggtgatct tcttcgacag gtcgccgttg 65460gccacggccg ttgtgacggc ggcgatgttg
cgcacctgcg cggtcaggtt cgacgccatc 65520gagttgacgg agtcggtcag gtccttccac
gtgccggcga cgccggggac gctggcctgg 65580ccgccgagct tgccctcggt gcccacctcg
cgcgccacgc gcgtcacctc cgacgcgaac 65640gagcggagct gatccaccat cgtgttgaag
gtgtccttca gctccaggat ctcgccgcgg 65700acatcgacgg tgatcttctt cgacaggtcg
ccgttggcca cggccgttgt gacggcggcg 65760atgttgcgca cctgcgcggt caggttcgac
gccatcgagt tgacggagtc ggtcaggtcc 65820ttccacgtgc cggcgacgcc cttcacctcg
gcctgcccgc cgagcttgcc ctcggtgcct 65880acctcgcgcg cgacgcgcgt cacctcggcc
gcgaaggagc tgagctgatc caccatcgtg 65940ttgaaggtgt tcttcagctc caggatctcg
cccttgacgt cgacggtgat cttcttcgac 66000aggtcgccgc gggccacggc cgtggtcacg
tcggcgatgt tgcgcacctg cgcggtcagg 66060ttcgacgcca tcgaattgac ggagtcggtc
aggtccttcc acgtgccggc gacgccgggg 66120acgctggcct ggccgccgag ctttccctcg
gtgcccacct cgcgcgccac gcgcgtcacc 66180tccgacgcga acgagcggag ctgatccacc
atcgtgttga aggtgtcctt cagctccagg 66240atctcgccgc ggacatcgac ggtgatcttc
ttcgacaggt cgccgttggc gacggccgtg 66300gtgacggcgg cgatgttgcg cacctgcgcg
gtcaggttcg acgccatcga gttgacggag 66360tcggtcaggt ccttccacgt gccggcgacg
ccggggacgc tggcctggcc gccgagcttg 66420ccctcggtgc ccacctcgcg cgccacgcgc
gtcacctccg acgcgaacga gcggagctga 66480tccaccatcg tgttgaaggt gtccttcagc
tccaggatct tcttcgacag gtcgccgttg 66540gccacggccg ttgtgacggc ggcgatgttg
cgcacctgcg cggtcaggtt cgacgccatc 66600gagttgacgg agtcggtcag gtccttccac
gtgccggcga cgcccttcac ctcggcctgc 66660ccgccgagct tgccctcggt gcctacctcg
cgcgcgacgc gcgtcacctc ggccgcgaag 66720gagctgagct gatccaccat cgtgttgaag
gtgttcttca gctccaggat ctcgcccttg 66780acgtcgacgg tgatcttctt cgacaggtcg
ccgcgggcca cggccgtggt cacgtcggcg 66840atgttgcgca cctgcgcggt caggttcgac
gccatcgaat tgacggagtc ggtcaggtcc 66900ttccacgtgc cggcgacgcc ggggacgctg
gcctggccgc cgagctttcc ctcggtgccc 66960acctcgcgcg ccacgcgcgt cacctccgac
gcgaacgagc ggagctgatc caccatcgtg 67020ttgaaggtgt ccttcagctc caggatctcg
ccgcggacat cgacggtgat cttcttcgac 67080aggtcgccgt tggcgacggc cgtggtgacg
tcggcgatgt tgcggacctg cgcggtcagg 67140ttcgacgcca tcgagttgac ggagtcggtc
aggtccttcc acgtgccggc gacgcctgtc 67200acctcggcct gcccgccgag cttgccctcg
gtgcctacct cgcgcgccac gcgcgtcacc 67260tgggccgcga aggagcggag ctgatccacc
atcgtgttga aggtgttctt cagctccagg 67320atc
6732323228DNASorangium cellulosum
2atgcccgaca cgtcgtcgtc gagccccgta atggcgatgg ggctatcgga ctcgaaagcc
60cggtccgtgg aggatgcacg gcctgcctcg gggcttcctc gtccacccgc gggcatcgct
120gtggtgggaa tgggatgtcg cttccccggc ggcatcgatt cgcccggatc cttgtgggcg
180gccctatctc aagggcgcga ccttatcagc gaggtcccgc cggaccggtg ggatgtcaat
240gcccactacg acgccgacgc aagcgtcccc gggaagattg cgacccgcca tggcggcttc
300ctcgccgggg tcgcggcgtt cgacgcgcct ttcttcgacc tctcgccgcg cgaagcgaag
360catatggatc cgcagcagcg cctcggcctc gagacggcgt gggaggcgct ggaggacgca
420ggcctggacg cgaggagctt gcggggcagc cgggcagggg tgttcgtcgg ctcgatgtgg
480gcggagtacg acgtgctcgc gtcgcgacat cccgaatcca tctcgccgca cggggccacg
540gggagcgacc cggggatgat cgctgcgcgc atcgcctaca ccttcggcct tcgtgggccg
600gccttgtcgg tgaatacggc gtcgtcgtcc tccctcgtgg cggtgcatct cgcattgcag
660agcttgcaga gcggagagtg cgagctcgcg ctggccggcg gcgcgaacct catcctgacc
720ccatacaaca cgatcaagat gacgaagctc gggacgatgt cgcccgacgg ccggtgcaag
780gcgttcgacc accgcgccaa cggctacgtg cgcgccgagg gcgtcgggtt cgtggtcctg
840aagccgctgt cgcgagcgac cgcggacggg gatcggatct atgcggtcgt gcgtggctcg
900gccgtgaaca acgacgggct caccgacggg ctgaccgcgc cgagcgggga ggcgcaggag
960gccgtgctgc gagaggcgta tgcgcgcgcc ggggtgtctc ccgccgaggt ggactacgtc
1020gaggcgcatg ggacgggaac gccgctcggc gaccgcgtgg aggcgacggc gctgggacgg
1080gtgctcggcg caggacgcgc ggcggatcgc gcgctgcggg tcggttcggt caagacaaac
1140ctcggtcacg cggaggcagc cgccggggtc atcggtctga tgaagacagc gctgtcgctg
1200cgtcacgggt cgcttccggc gagcctgcac gtcgagcgcc cgaaccccga gatacccctc
1260gaatcgctgg gcctccggct ccagacggcg cacggcgtgt ggccggaggt cgatcggccc
1320cggcgagcag gcgtgagctc attcggcttc ggcggcacga actgccatgt ggtgatcgag
1380gagtggcgcg ggggcctcca gcagagcgcc gccgaggcgg gcagcgaccc cggcgccgcc
1440gtaccgccgc ctggccttcc ccttgtgctg tcggcgaggg accacggggc gctgcgggcg
1500caggcgggcc ggtgggcggc gtggctcacg gagcaccgcg aggcgcgctg ggcggacgtc
1560gtccacacgg cggcagtgcg gcggacgcac ctgggcgctc gggccgcggt gatggcggcg
1620ggcgtggccg aggccgtcga tgcgctgaag gccctggccg acgggcgcgc ccacggggcc
1680gtgacggtcg gcgaggcgcg cgagcggggc aaggtggtct tcgtgtttcc gggccagggc
1740agccagtggc cggcgatggg gcgagcgctc ctgtccgcgt cgaaggtgtt cgccgaggcc
1800gtcgaggcgt gcgacgcggc gctgaggccg ctgacgggct ggtcggtgct ctcgttgctg
1860cgcggcgacg ccggggaggc agcgccgtcg ctcgaccgcg tcgacgcggt gcagccggcc
1920ctgttcgcga tggctgtcgg cctggccgct gtctttcgcg cgtggggcct cgatccttcg
1980gccgtggtgg gccacagcca gggcgaggtc ccggcggcgt acgtcgcggg ggcgctctcg
2040ctcgacgacg cggcgcgggt cgtggcggtc cgaagcgcgc tcgtgcggcg gctcgcgggc
2100gcaggggcga tggcggcggt ggagctgccg gccggcgagg tggagcgccg cctggcgccg
2160ttcggggggg ctctggccat tgcggtggtc aacacgtcga gctcgacggc cgtttctgga
2220gacgccgagg cggtggacag gctggtcgcg cagctcgagg ccgaaggcat cttctgccga
2280aaggtgaacg tcgattacgc atcccacagc gcgcacgtgg acgtcgtgct accagagctc
2340ctggagcgcc tggcgccggt ccggccaggg gccacgagga tccccttcta ttcgacagtg
2400accggcggtg tgctggaggg gacggcgctc gacggggcgt actggtgccg caacctgcgc
2460cagccggtgc ggctggaccg cgcgctcgcc cggctgctgg acgacgggca tggcgtcttc
2520gtggaggtca gtgcgcaccc ggtgctggcg tcgccgctga ccgcggcgtg cgccgagcgc
2580gagggcgtgg ttgtcggcag cttgcagcgc gacgacggcg ggctcgcgcg gctgctcggc
2640tcgctgggcg cgctgcatgt gcagggccag ccggtcgact ggcgcgcggt gctggcgccg
2700ttcggcggca gcctggtgga cctgccgacc tatgcattcc agcgccagcg ttactggttc
2760gatacggatg agagcgtcgc cctcgcagcg gcgtccagcg tcgcggaaga gtcgtggtca
2820gaaaagctgg ccgggctgtc ttccgcgcga cgggaagaac ggctgctcga atgggtgcgc
2880gcagagattg cagcggtgct cgggctggag gcgccggcgg tgccgccaga cgtcttgctg
2940cgggatctcg gattgaaatc gccgatcgcc gtggagctgg ggagccggct gggacgcagg
3000acacgccgga agctgcccgt gaccttcgtt tacaaccacc cgacgccacg agcgatcgct
3060cgcgccctcc tggagggaat gttttcctcg atcaaggact ctgcttcgag cgccgctgac
3120gaccgccgcc cgccgggggt gctcgaagac gttgcccccc cacaggcgct cgagacgtcc
3180gagatgtccg acgatgagct gttccagtcc atcgatgcgc tcgtctag
3228311040DNASorangium cellulosum 3gtggatcgaa gcgataaact gcgtgcgtat
ctggagaaga ccacggcctc gctggtcgag 60gcgaagggcc ggatccggga gctggaagcg
cgttcgcgcg agccgatcgc gatcgtggcg 120atggcgtgcc ggtttccggg cggcgtcgac
agccccgaga agctctgggc cctgctggac 180gaggagaggg acgccatcac cgaggtgccg
ccctcgcgat gggacctcga gcgcttctat 240gaccccgatc cggacgccgc gggcaagacc
tacagccgct ggggcggctt cgttggcgat 300ctggaccgtt tcgacgcggc gtttttcggg
atcagccccc gcgaggcccg gagcatcgac 360ccgcaagagc gctggctgct ggagaccacg
tgggaggccc tcgagcgggc cggcgtgcgc 420gcagacacgc tggaagggac cctggggggc
gtttacatcg gcctgtccgg ctcggagtac 480cagacggagg cattccacga tgcggagcgc
atcgacgcct attcgctgac cggcgcttcg 540ccgagcacga ccgtggggcg cctcgcctac
tggctcgggc tacgaggccc cgcggtcgcc 600gtggacaccg cgtgcagctc ctcgctcgtc
gcggtgcacc tggcctgcca ggcgctgcgg 660aacggggagt gcgattttgc gctggcaggc
ggcgtcaatg cgctcctggc ccccgagagc 720tatgttgcct tctgccgcct cagggcgctg
tcccccaccg ggcggtgcca gaccttctcc 780gcggacgccg atggctacgt gcgcgcggaa
gggtgcgggg tgctgctgct caagcgtctg 840tcgcacgcgc agcgggatgg agaccgtgtg
ctcgcggtca tccggggcaa tgccatcaac 900caggacggcc gcagccaagg gttgacggcg
ccgaacgggc tcgcgcagga ggacgtcatc 960cgcagggcgc tgtcgcaagc cgccgtggag
ccgacgaccg tcgatgtggt cgaatgccac 1020gggaccggca cggcgctcgg cgatccgatc
gaggtccagg cgctcggggc tgtttacggc 1080gatgggcgcc ccggagacag gccgctcgtg
atcggctccg tcaagacgaa catcggtcat 1140accgaggcgg ccgcgggcat ggccggcctc
atcaaggccg tcctttcgct gcagcacgcc 1200caggtccctc gatcgctgca cttcgcggcg
ccgagccctt acatcccctg ggataccctc 1260cccgtccgcg tggccgcgca gcgcgtcgca
tgggagcggc gcgagcaccc gcggcgcgcc 1320gggatctcct cgttcgggat cagcggcacc
aacgcgcacg tgatcctcga ggaggcgccg 1380gaagcgccgg cgacggcgcc ggaggcggcg
gcggtgacgt cgacgctgcc gttgcttgtg 1440tcggggcggg atgaggcggc gctcagggcg
caggcggagc ggtgggcggc gtggctcgcg 1500gcgcacccgg aggcgcgctg ggcggacgtg
gtgcacacgg ccgccgtgcg gcgcacgcac 1560ctggaggcgc gcgcggcggt ggccgcgggg
aacgccgccg acgccgccgc ggcgctgggg 1620gcgctggccg ccgggcagcc gcacaaggcg
gtgtccctgg gcgaggcgcg cgcgcgcggc 1680gatgtcgtgt tcgtggttcc gggccagggg
agccaatggc cggcgatggg gcgggcgctg 1740ctggccgagt ccgaggtgtt tgccgccgct
gtcgcggcct gcgacgcggc gctgcggccg 1800ttcacgggct ggtcggtgct ctcggtgttg
cgcggggagc agggcgaggc ggtgccgccc 1860gccgaccgcg tggacgtggt gcagccggcg
ctgttcgcga tggccgtggg gctctcggcg 1920gtctggcggg cgtggggcat cgagccctcg
gcggtggtcg gccacagcca gggcgaggtc 1980gcggcggcgt acgtcgccgg ggcgctgacg
ctcgaggacg cggcgcgggt ggtggcgctg 2040cgcagccagc tcgtgcggcg catcgccggc
ggcggcgcga tggccgtgat cgagcgcccc 2100gtcggcgagg tggagcagcg gctttctcgg
ttcggagggc agctctcggt ggcggcggtg 2160aacacgccgg gctcgacggt ggtgtccggg
gacgccgcag cggtcgatcg tttgctggcc 2220gagctggaga ccgcgcgggt gttcgcgcgg
cggatcaagg tcgattacgc gtcgcacagc 2280gcgcacgtgg acgcgatcct gccggagctc
gaggcctgcc tggcctcggt cgagccccgt 2340acctgcgcca tcccgctgta ctcgacggtg
acgggagaag tgctcgccgg cccggagctc 2400ggcgcgacat actggtgccg caacctgcgc
gagccggtgc ggctcgaccg ggcgctctcg 2460cggctgctgg cggacgggca cggggtgttc
gtggaggtca gcgcgcatcc ggtgctggcc 2520atgccgctgt cggccgcgag cgccgagcgc
ggcggcgtgg tggtgggcag cctgcagcgc 2580gacgacggcg gtctggggcg gctgacgtcg
atgcttggcg cgctgcacgt gcacggccac 2640gccgtgagct ggcagcgggt gctggcgccg
tacggcgggg cgctcgtggg cctgccgacg 2700tacgcgttcc agcgccagcg ccactggctc
gaggcgccgc ggtacgcggc ggaggatacg 2760gacggcgcgg cgcggcgcga cccgctgtac
cgggtcacgt ggatcgaggc ggcgctggaa 2820gaagcgccgt gggcgcccga gcgccacgtc
gtgctcggcg ggggcggcgc gctggcggcg 2880gggctggggg cgctcgcgct ggcggggctg
ccggagctgc tcgaggcgct ggagaacagg 2940gcggcggcgc ccgagcggct ggtgctggac
ctgacggagg gccgcccagg cgcggtggcg 3000gagtccgtgc acgccacgac gcgcgacgcg
ctcgcgctgg tccaggcatg gcttgcggcg 3060ccgcggctct cgggcaccga gctggtcgtg
gtgacgcggg aggcggtggc ggccggcccg 3120gacgagggcg tggcggcgct gggccccgcc
gctgtctggg ggctgctgcg cacggcccgc 3180gtcgagcacc ccgagcgcgc ggtgcgcgcg
gtggatctgg ggcgcgagcc gctggacgtc 3240gcggtcttgc ggcgggcgct gggggcggtg
gccgagccgg agctcgcgct gcgcgcgggc 3300ggggcgcggg ctgcgcgcct gcgcgctgtc
gacgccggcg cgggcgccag ggagccggcg 3360gctgcgctgg acccgcaggg cacggtgtgg
atcacgggcg gcaccgggga gctggggcgg 3420cagatcgcgc ggcacctggt cgcggcgcac
ggcgtgcggc acctcctgct gacgtcgcgg 3480cggggcgcgg ccgcgccgga cgccgaggcg
ctcgtcgagc agctgcgggc cgacggcgcc 3540gagacggtcg aggtcgtggc gtgcgacgtg
acggacggcg cggcgctttc ggcagcagtc 3600caggcggctg cggcaaggca cccgctgacg
gccgtggtgc acaccgccgg ggagctggcg 3660gacggggtgc tcacggggct gacggcggag
cagctcgcgc gggtgctggc gccgaaggtc 3720gacggggcgt gccacgtgta cgccgccgcg
caggaccagc cgctcgcggc cttcgtgctg 3780ttctcctcga tcgtgggcac gctgggcaac
gcgggccagg cgaactacgg ggccgccaat 3840gcgttcctgg acgcgttcgc ggcgcagctt
cgcgcgcgcg gcgtgccggc gacgagcctc 3900gcgtggggct tctgggagca ggcagggctc
ggcatgacgt cgcacctcgg cgcggccgac 3960ctggcgcgcc tcaggcggca gggccttgcg
ccgctgtcgg tcgcgcaggg cctgcgcctg 4020ctcgaccggg cgctcgcgcg cgcggaggcg
acgctggtgc cggcggcgct cgatcttccg 4080gcgctccagc gtgcggcgag cgacgccgga
cgggtgcctc cactgctgcg cgggctggtg 4140cgcacgagtc ccggccgccc cacggcgacc
gcgacccccg aggccgggcc ggcggcgtcg 4200gcgctgcgcg cacggctctc ggcgttgccc
gaggccgagc ggccgggcgc gctgctggat 4260ctggtgcgca cggaggtggc ggtcgtgctg
cagctggcag ggccggcgca ggtgcccgcg 4320gacaagccgc tgaaggagct ggggctcgat
tcgctcacgg ccgtcgagct gaggaaccgc 4380ctcggcgcgc gcgccgagac ggtgctgccg
acgaccctcg cgttcgacca tccgacgccg 4440cgcgcgatcg cggatctgct gcttcagcgt
gcgttctcgg agctcgcggc ggcgaaggcg 4500acgcgcgcgc ggggagcgca cgacgagccg
atcgcgatcg tgtcgatggc gtgccggctc 4560ccgggcagcg tcgatacccc cgcggcgctg
tggaagctcc tggcggaggg gcgggacgcg 4620atcgggccgt tccccgaggg gcgcggctgg
gacgtggcgg ggctgtacga tccggacccg 4680gatgtgccgg gcaagtcgat caccacgcaa
ggcggcttcc tctacgacgc cgaccgcttc 4740gatccgacgt tcttcggcat cagcccgcgc
gaggccgagc gcatggaccc gcagcagcgt 4800ctgctgctcg agtgcgcctg ggaggcgctc
gagcgcgcgg gcctggcgcc ccacgcgctc 4860gaggcgagcg ccaccggcgt cttcgtcggg
ctcgctcacg gtgactacgg cgggcggctc 4920ttgcagcagc tcgagtcctt cgacggccac
gtcctcaccg gcaacttcct cagcgtcggc 4980tcggggcgca tcgcgtacac gctggggctc
cgcggccctg cgatgaccgt cgacacggcg 5040tgctcgtcgt cgctcgtggc ggtccacctc
gcgtgcatgt cgctccgcgc gggcgagtgc 5100gacatggcgc tcgccggcgg cgccaccgtg
atggccacgc cgatgatctt cgtcgagttc 5160agccgccagc gcggcacggc gctggacggt
cgttgcaagg cgttcggcgc cggggccgat 5220ggcgccggct ggtcggaggg gtgcgggatc
ctggcgctga agcggctgtc ggacgcgcag 5280cgcgacggcg accgcgtcct ggcggtgatc
cgcggctccg ccgtcaacca ggacggccgc 5340agccaggggc tcaccgcccc caacggcccg
gcccagcagg acgtcatccg ccaggccctg 5400gccgcggcgg ggctcacgcc cgccgacgtc
gacgccgtcg aggcgcacgg caccggcacg 5460cgcctcggtg accccatcga ggcgcaggcg
ctgctggcga cctacggcgc cgcgcacaca 5520gcggagcggc cgctctggct cggctcgctc
aagtcgaacc tcgggcacac gcaggtcgcc 5580gcgggcgtgt cggggctgat gaagctcgtg
ctggccttgc agcacgcaga gctgccgagg 5640acgctgcacg ccgacccgcc ctcgccgcac
gtcgactggt cgcaggggca cgtcaagctc 5700ctgaacgagc ccgtgccgtg gccgcgcacc
gacaggccgc ggcgcgcggc ggtctcgtcc 5760ttcggcatca gcggcaccaa cgcgcacgtc
atcgtcgagg aggcgccggc cgaagcgccg 5820gcgacagcgg cggacgcaaa gtcggtggag
gcgcttccga tcctgccgct gctggtctcg 5880gggtccgacg agccggcgct gcgcgcgcag
gtgcggcggc tggtggagca cctgcggtcg 5940cacccggacg agcggctgct ggacgtggca
gcgagccttg cgaccacgcg cgcgcatctc 6000gcgatgcggc tcgcgctgcc cgtctcggca
ggggcgcccc gggatgcgtg ggtggatgag 6060ctggaggcat ttgccagggg aggagcggct
ccgacgcagg catcgcagac ccccgccgag 6120agcagcgcgg gcaaggtcgc ggtgctcttc
accggccagg gcagccagcg cgccgccatg 6180gggcgcgccc tgtacgccac ccaccccgtc
ttccgcgccg cgctcgacgc cgcatgcgcc 6240gagctcgacc gccacctcga caggcccctc
cacagcgtcc tcttcgcaga cgccggcacc 6300gaggccgccg cgctgctcga ccagacagga
tgggcacagc ccgccctgtt cgctctcgag 6360gtcgcgctct accgacagtg ggaggcctgg
ggtctgcgcc ccgagctgct gctcggccac 6420agcatcggcg agctcgccgc cgcccacgtc
gccggcgtgc tcgacctccc cgacgcctcc 6480gccctggtcg ccgcccgcgg acggctcatg
caggccctcc cccacggcgg cgccatggcc 6540tccatcgagg ccaccgagca cgagctccta
cccctgctcg accagcacac cggacgcctc 6600tcgctcgccg ccctcaacgc tccacgccag
tcggtcgtca gcggcgacct gcacgccgtc 6660gaccaggtct gcgcccactt catcgccctc
ggccgacgcg ccaagcggct cgacgtcagc 6720cacgccttcc actcggcgca catgcagccc
atgctcgacg ccttcgccag cgtcgcccgc 6780ggcctgacct tccacccgcc acggctgccc
atcgtcagca gcgtcaccgg cgcacgcgcc 6840accaccgacc agctcacctc gcccgactac
tgggtgcagc aggtgcgcga gcccgtgcgc 6900ttcctcgacg ccatgcgctc cctgcacgcc
gccggcgccg ccaccttcgt cgagtgcggg 6960ccgcacggcg tgctcaccgc cgcaggcgcc
gagtgcctcg ctcccgaggg cgctcgcgac 7020gccggcttcg tcaccagcct ccgcaaggac
cgcgacgagg ccctcgccct ggtccacgcc 7080gcctgcgccg tccatgtccg cgggcacgcc
ctcgactggc tccgcttctt cgacgccacc 7140ggcgctcgcc gcgtcgagct gcccacctac
gccttccagc gacagcgcta ctggctcgag 7200gcgccaaggc ctcgccccag cctcgagggc
gtcggcctca ccgccgcaaa ccacccatgg 7260ctcggcgccg ccgtgcgcct cgcagaccgc
gatggctacg tcctcagcgg ccgcctctcc 7320accatcgacc acccgtgggt cctcgaccac
gtggtgctgg gcacggcgct gctcccgggc 7380acgggcttcg tcgagctggc gtgggcggcg
gcagaggcgg tcgggctgcc cggggtatcg 7440gagctggcga tcgaggcgcc gctggcgctc
ccggcgcgcg gggcggtggc gctgcagatc 7500gcgatcgagg cgccggaccc ggcggggcgc
cgcggcgtcg cgatctacag ccgccccgac 7560ggcgcagccg acgcgccctg gacagcgcac
gcgcgcggcg tgctgggcgc cgcggcgccc 7620gacagggacg cggcgtgggc acagggcgcg
tggccgccgc cgggggccgt gcctgtcgat 7680gtgacgcagc ggatcgagat cgtggacgcg
tgggtcggcc cggcgttccg gggcgtcacc 7740gcgctgtggc gcgtcgggcg gacgatctac
gccgacgttg cgctgccgga cggtgtggcg 7800agcacggcgc aggacttcgg gctgcatccg
gccttgctcg atgtggcgct acgcgcgttc 7860ctgagagcgg agctcggcgc cgatccctcg
ccacgggagg gcacggtggt gccgttcgcg 7920tggtcggacg tggtgctcga ggcgcgtggg
acggcggcgc tgcgggtgcg cgtggaggtg 7980gcggccgatg gggacggcga cgcgatcacg
gcgtcgatcc agctggccga cgggcagggc 8040cgccccgtcg cgcgggtggg cgcgctccag
atgcggtgga cgacggccga gcgggtgcgc 8100gcggccgcgg gcgcggcgga gcgcgatctg
taccgcgtcg cgtggacgga cgtggcgctg 8160gacgacgcgg cgtttgcgcc ggaggagcac
gtcgtggtcg gcggcgacgg cgcgctggcg 8220gcggcgctcg gtgcacgcgt ggtggcgggg
ctgcccgagc tgctcgcgtc gctgccggac 8280ggcgcggcgg cgccacgccg gctggtggtg
gacctcacgg cggacgccgc gggcgcggtc 8340gtcgacgccg tgcacgccgc agcgcgcgac
gcgctgtccc tggtgcaggg atggctggcg 8400gcgccgcagc tggcggcgac ggagctcgtg
gtcgtgacgc gcggcgcggt ggcggtcgcg 8460ccggacgagg gcgtggcggc gctgggcccc
gcggcggtct gggggctgct ccgcgcgacg 8520cgcgtcgagc atgcggatcg cacggtccgc
gtgctcgatc tggggtccgc ggcgccggac 8580atgacgctct tgcgccgggc gctcacggcg
gccgaggagc cagagctcgc gctgcgcgcg 8640ggcggggcgc gggcgccgcg cctcgacgcg
gccagcgaga ccgaaggaga gctggcgccg 8700cccggcgggg cgcgctctct tcgcctgtcc
atccggacga agggctcgtt cgacgcgctc 8760cacctcgcgg acgctcccga tgcgctgcgc
ccgctcgggc cggggcaggt ccggctcgct 8820gtccgcgcca cggggctcaa cttccgcgat
gtcttgaacg tcctggggac gtaccgcggc 8880gaagcggggc ctctcggtct ggagggggct
ggggtggtgc tggacgtggg cgagggagtc 8940accgcccttc gacccggcga ccgggtgatg
ggcatgctgc acgcgggcat ggcgacccat 9000gcggtcgtcg acgcccggct gctgacgcac
atcccgcggg ggctttcctt cgtggaagcg 9060gcgacgattc cagcggcctt cctcaccgct
ctgtacgggc tgcgcgacct cggcgcgctg 9120aaggcggggc agcgcgtgct ggtgcacgcc
gccgccggcg gggtgggcat ggcggcggtc 9180cagcttgcgc gcctctgggg agccgaggtg
ttcgcgacgg cgagcgaggg caagtggccg 9240gcgctgcgtc ggatggggat cgaccaggcc
catatcgcct cgtcgcggac cctccacttc 9300aggaaagcct tcctcgatgc aacgcaggga
cagggcgtcg acgtggtgct cgacgcgctc 9360gcgggcgagt tcgtcgacgc ttcgctcgac
ctgctcccgc gcgggggcgc gttcgtggag 9420atgggcaaga gcgatgtgcg ggatcccgag
cgcgtcgcca aggaccaccc ccgcgttcgc 9480tacacggcct tcgatctgct cgacgcgggg
ccagaccaca tccaggcgat gctgcgggag 9540ctcgtcccgc tgttcgagga gggcgtcctc
gctccccttc cctccgtggc ctacgacctg 9600cgtcgcgccc cgcacgcctt ccgctccatg
gccaacgcac gccacatagg caagctcgtg 9660ctggtgccgc ccgcgacgct cgaccctgac
ggcacggcgt tgatcacggg cggcacggga 9720gagctcgggc ggcagatcgc gcggcacctg
gtggcggcgc acggcgtgcg ccacctggtg 9780ctgacgtcac ggcgcggcat ggacgcgccc
gacgccgcag cgctggtgga atcgctgcgc 9840gcggcgggcg ccgcgacggt ggaggtcgcg
gcgtgcgatg tgacggaccg tgacgcgctg 9900gcggccatcg tgcaggcgat ccccgcggcg
cgcccgctga ccgccgtcgt gcacacggcc 9960gccgtgctgg acgacggcac cgtggcgggg
ctctcggccg agcagctcgc gcgcgtgctg 10020cggccgaagg tcgacggcgc ctggcagctc
tacgaggcga cgagggacgc gccgctcgcg 10080gcgttcatgc tcttctcgtc ggtcgccggc
acgctgggca gctcggggca ggcgaactac 10140gccgccgcga acgcgttcct cgacgggctg
gcggcagagc tccgcgcgcg cggcgtgccg 10200gcgatgagcc tcgcgtgggg cttctgggag
cagggcggga tcgggatgac ggcgcacctc 10260ggcgccgccg atctggcgcg gctgaagcgg
cagggcatcg tgccgatgac ggtcgcgcac 10320ggcctgcggc tgctcgaccg cgccctcgag
cgcccggacg cggcgctggt gcccgcctcc 10380ctggacatgg cggtgatcca gcggacggcg
agcgaccacc gtcaggtgcc gcccatgctg 10440cgcgggctgg tccgcgtcgc gccgcggcag
gcggcagggg cagccagcgg caggagccat 10500gaggcctcga ccctgcggca gcagctcgcc
gcgctgcccg aaccggagcg gcagcgagcg 10560ttgctcgatc tggtccggac cgaggcagcc
gccgtccttg tgctgcgcgg gccggacgct 10620gtccccgccg acaagccgct cagggagctc
gggctcgact cgctcacggc agtggagctc 10680aggaatcggc tcaggacccg tgcgcagacc
gatctcccat cgaccctcgc cttcgactac 10740ccgacgccga aagcggtcgc cgtgtatctg
gcccaggagc tcgaccttca cgacgtcatg 10800acggagatgc gcggaccgag cttgcgctct
gacgacgagc tcaagtcggc catcgcgagc 10860atccggatct cgacgctacg ccaggcgggg
ctgctcgaca gcctgcttcg gctcgccgcc 10920agcgaagccg tctccacatc cagcgacacg
acacctgaaa ccgacgagct gacgctgcag 10980catgttggag acgatgagct ggcacggctt
gtcttcgacc tcgccggagg agcgcaatga 11040410965DNASorangium cellulosum
4atgaaagaag agatctccgc ccgtcaagct ctcgagaaga gcttcattga acttcgccgt
60atcaagcggg agctcgatca gctcaaggcg aagtcgagcg agccgatcgc gatcgtgtcg
120atggcgtgcc ggctcccggg cggcgtcgat acccccgcgg cgctgtggca gctgctctcg
180gaggggcggg acgcgatcgg gccgttcccc gaggggcgcg agtgggacgt ggcggggctg
240tacgacccgg acccggacgc gccgggcaag tcgatcactg cgcaaggcgg cttcctctac
300gacgccgacc gcttcgatcc ggcgttcttc gccatcagcc cgcgcgaggc cgagcggatg
360gacccgcagc agcggctgct gctcgagtgc gcctgggagg cgctcgagcg cgcgggcctg
420gcgccccacg cgctcgaggc gagcgccacg ggcgtcttcg tcgggctgtc ggtcacggac
480tacggcgggc ggctgctgca cgatcccgag gccctcgacg gctacatcgc caccggcacc
540ctgcccagcg tcggctcggg gcgcatcgcc tacacgctgg ggctccgcgg ccccgcgatg
600accgtcgaca cggcgtgctc gtcgtcgctc gtgtcgctcc acctcgcgtg catgtcgctc
660cgcgcgggcg agtgcgacat ggcgctcgcc ggcggcgcca ccgtgatggc cacgccgatg
720gccttcatcg agttcagccg ccagcgcggc acggcgctgg acggtcgttg caaggcgttc
780ggcgccgggg ccgatggcgc cggctggtcg gaggggtgcg ggatcctggc gctgaagcgg
840ctgtcggacg cgcagcgcga cggcgaccgc gtcctggcgg tgatccgcgg ctccgccgtc
900aaccaggacg gccgcagcca ggggctcacc gcccccaacg gcccggccca gcaggacgtc
960atccgccagg ccctggccgc ggcggggctc acgcccgccg acgtcgacgc cgtcgaggcg
1020cacggcaccg gcacgcgcct cggcgacccc atcgaggcgc aggcgctgct ggcgacctac
1080ggcgccgcgc acacagcgga gcggccgctc tggctcggct cgctcaagtc gaacctcggg
1140cacacgcagg ccgccgcggg cgtgtcgggg ctgatgaagc tcgtgctggc cttgcagcac
1200gcggagctgc cgaggacgct gcacgccgac ccgccctcgc cgcacgtcga ctggtcgcgg
1260gggcacgtca agctcctgaa cgagcccgtg ccgtggccgc gcaccgacag gccgcggcgc
1320gcggcggtct cgtccttcgg cttcagcggc accaacgcgc acatcatcat cgaggaggcg
1380ccggcggcct ccgccgaggc gacgagccgc ggggagaaga cgtccgcggc cgcgccgccg
1440tcgatgatgc cgctgctggt ctcgggggtg gacgaggcgg cgctacgagc gcaggcgggg
1500cggtgggcgg cgtggatcga ggcgcacccg gaggcaggct gggcggacgt tgtgtacacc
1560gcggcagcgc ggcggacgca cctgggggcc cgtgcggcgc tgacggcggc ggacgcggcc
1620ggcgctgtcg cggcgctgac ggcgctctcg caagggcagc cgcacgccgc gctcgccgtg
1680ggcgaggcgc gcgctcgggg gaaggtcgcc ttcgtgtttc cgggccaggg cagccagtgg
1740ccggcgatgg ggcgggcgct gctctcgcag tcggaggtgt tcgccgcggc ggtcacggcg
1800tgcgacgcgg cgctgcggcc gttcaccggc tggtcggtgc tctcggtgct gcgcggcgac
1860tcgggcgcgg aggtgccgcc gctggagcgc gtcgacgtcg tgcagccggc gctgttcgcg
1920atggcggtgg ggctcgccgc tgtgtggcgc gcgtggggcc tcgagccgtc ggcggtggtg
1980ggccacagcc agggggaggt cccggcggcg tacgtcgcgg gggcgctgtc gctcgaggac
2040gcggcgcgga tcgtggcgct gcgcagccag ctcgtgcggc gcctgtccgg ggctggcgcg
2100atggccgtga tcgagcgccc ggtaggcgag gtcgagcagc ggctctcgcg gttcggcggc
2160gcgctgtcgg tggcggcggt caacacgccg cgctcgacgg tggtgtcggg agatatcgag
2220gcggtcgacc gcctgctggc ggagttcgag ggcgagcagg tcttcgcgcg gaaggtcaac
2280gtcgactacg cgtcgcacag ccgacacatc gacgggctgc tgccggagct ggagaacggc
2340ctgggcgcgg tgcggccgcg cgcgagcacg atcccgttct actcgacggt gaccgggacg
2400gtgctgacgg gcgcggagct ggacgccgcg tactggtgtc gcaacctgcg cgagccggtg
2460cggctcgacc gggcgctctc gtggctcctg gacgacgggc acggcctgtt cgtcgaggtc
2520agcgcgcacc cggtgctgac gctgccgctc acaggagcga gcgcggcgag cggcggtgtg
2580gttgtcggca gcctgcagcg cgacgacggc gggctcgggc ggctcctggg ggtgctggcc
2640gcgctgcacg tgcacggcca cgacgtcgac tggcgcgcgg tgctggctcc gtggggcgga
2700ggcgtggcgg acttgccgac ctacgcgttc cagcggcagc gctactggct cgaggcaccg
2760cgcggccggg cagggctgga gagcggaggg ctcctggccg tgaatcaccc gtggctcagc
2820gcggcggtgc ggctggccga ccgcgacggc tatgtgctga gcggacggct gtcgacggtc
2880gagcacgcgt gggtcctgga ccacgtggtg ctgggcacgg tgatcctccc gggcacggcg
2940ttcgtcgagc tggcgctcgc ggcggccgat gcggtcggac tgccctcggt gtcagagctc
3000acgatcgagg cgccgctggc gctgccggcg cgaggggcgg tggcgctgca ggtgacggtc
3060gaggcgccgg acgcgacggg gcggcggggc ttcgcggtct acagccggcc cgacggcgcg
3120cacgacgcgc cgtggacggc gcacgcgcgc ggcgtgctcg gcgcagcgcc cgcggcggcc
3180acgacggcgt gggcggcggg cgcgtggccg ccggcggggg ccgagccggt cgacgtcacg
3240cggtgggtcg aggcgctgga cgcgtgggtc ggcccggcgt tccggggcgt gacggcggcg
3300tggcgcgtgg ggcggtcgat ctacgccgac ctggcgttgc ccgagggggt ctcggagcgg
3360gcgcaggact tcggcctgca tccggccttg ctcgatgcag cgctccaggc cctcctgagg
3420gcggagctcg gcgcaggcgc gtcgccgcgg gagggcatcc cgatgccctt cgcgtggtcg
3480gacgtggcgc tcgaggcgcg gggggcagcg gcgctgcggg cgcgcgtgga ggtcgaggac
3540gccagcgatg gggaccagct cgcggcgtcg atcgagctgg ccgacgcgca ggggcagccg
3600gtcgcgcgcg cagggacgtt ccgggcgcgg tgggcgacgg cggagcacgt gcgcatggct
3660gcggcgggct cgagcgagcg tgacctgtac cgggtcacgt gggcggacgt ggtgctggaa
3720gaagcggcgt gggcgccgga ggagcacgtc gtgctcggcg gcgacggcgc gctcgcggcg
3780gcgctgggcg cgcgcacggc ggcgctgccg gagctcatcg cggcgctgcc ggagggcgcg
3840gccgcgccgc gccggctggt gatcgacgcg gccgcgggcg accccggcga cggcctggtc
3900gcggcggcgc acgcggcggc gcagcgggtc ctgtcgctgg tgcaggggtg gctctcggag
3960gcgcggctcg cggacagcga gctggtggtg gtgacgcgcg gcgctgtggc cgccgggccc
4020gacgacggcg tcgcggcgtt gagccacgcg ccgctgtggg gactcgtgcg cacggcgcgc
4080caggagaacc ccggccgggc ggtgcgcctc gtggacctgg ggcccgagcc gctggacgga
4140gcgctcctgc gccgggtggt ggcggcggcc gaggagccgg agctcgcgct gcgcgggggc
4200gcggcgcgcg cgccacgcct gcgcgaggtg cgcgcgggcg cggccgacgc ggcgcggccg
4260acgcggctgg atcccggcgg gacggtgctg atcacgggcg gcaccgggga gctcgggcgg
4320caggtcgcgc ggcacctcgt ggcgtcgcac ggcgtgcggc acctcgtgct cacgtcgcgg
4380cgcgggatgg gtgcgccgga cgccgcggcg ctggtggacg agctgcgcgc cgcgggcgcc
4440gcgacggtcg acgtcgcggc gtgcgacgtc gccgacggcg cggcgctggg ggcggtcatc
4500gcggcgatcc cggctgcaca ccccctcacg gcggtcgtgc acatggcggg cgtgctggac
4560gacgtcatcg tgacgaagct ctcggccgag cagctcacgc gcgtgctgcg gccgaagatc
4620gacggcggct ggcacctggc cgcggcgacg cgaggccatc ggctcgcggc cttcgtgctg
4680ttctcgtcgg cggccggcac gctgggcagc ccggggcagg cgaactacgc cgcggccaac
4740acgttccttg acgcgctcgc ggcgcagctc cgcgcgcgcg gcgtgcccgc gatgagcctc
4800gcgtggggct tctgggagca ggcagggctc ggcatgacgg cgcacctcgg cgcggccgac
4860ctggcacgcc tcaggcggca gggcatcgcg ccgatcgcgc tcgcgcaggg catgcagctg
4920ctggaccggg cgctcgcgcg cccggaggcg gcgctggtgc cggcggcgct cgaccttccg
4980gcgctccagc gtgcggcgag cgacgccggg caggtgccgg cgctgctgcg cgggctcgtg
5040cgcccggcgg tcgggcggcg cgcggcggcg cctgcggccg ccgcgaccgg agcggcggcg
5100ctgcgcgcgc ggctcgcgcc gctgcccgag gccgagcggc acgacgtggt gctcgacctg
5160gtgcgcgccg aggcggcggc cgtgctgcag ctggcggggc cggcgcaggt ccccgcggac
5220aagccgctga aggagctggg gctcacctcg ctcacggcgg tcgagctgag gaaccgcctc
5280ggcgcgcgcg ccgagacggc gctgccggcg accctcgcgt tcgaccatcc gacgccgcgc
5340gcgatcgcgg gtctgctgct tcagcgtgcg ttctcggagc tcgcggcggc ggtggcgacg
5400cgcgcacagg cgccacgcgc gcagggggcg cacgacgagc cgatcgcgat cgtgtcgatg
5460gcgtgccggc tcccgggcgg cgtcgatacg cccgcccgga tgtggcagct cctggcggag
5520gggcgggacg cgatcgggcc gttccccgag gggcgcggct gggacgtggc ggggctgtac
5580gaccccgacc cggacgcgcc gggcaagtcg gtcaccaacc tgggcggctt cctctacgac
5640gccgaccact tcgatccgac gttcttcggc atcagcccgc gcgaggccga gcgcatcgac
5700ccgcagcagc ggctgctgct cgagtgcgcc tgggaggcgc tcgagcgcgc gggcctggcg
5760ccccacacgc tcgaggcgag cgccaccggc gtctttgtcg ggctggtgta cagcgactac
5820ggcgggcggt tgctggagca cctcgagtcc ttcgacggct acatcgccac cggcagcttt
5880cccagcgtcg gctcggggcg catcgcctac acgctggggc tccgcggccc tgcgatgacc
5940gtcgacacgg cgtgctcgtc gtcgctcgtg tcgctccacc tcgcgtgcat gtcgctccgc
6000gcgggcgagt gcgacatggc gctcgccggc ggcgccaccg tgatggccac gccgatggcc
6060ttcatcgagt tcagccgcca gcgcggcatg gcccccgacg cacggtgcaa ggccttcggg
6120gcggaggcga acggcatcgg ccccgcggag ggctgcggga tcctggtgct caagcggctg
6180tcggacgcgc ggcgcgacgg cgaccgcgtc ctggcggtga tccgcggctc cgccgtcaac
6240caggacggcc gcagccaggg gctcaccgcc cccaacggcc cggcccagca ggacgtcatc
6300cgccaggccc tggccgcggc ggggctcacg cccgccgacg tcgacgccgt cgaggcgcac
6360ggcaccggca cgcgcctcgg cgatcccatc gaggcgcagg cgttgctggc gacctacggc
6420accgcgcaca cagcggagcg gccgctctgg ctcggctcga tcaagtcgaa cctcgggcac
6480acgcaggccg ccgcgggggt tgtggggctg atgaagctcg tgctggcgat gcagcacgcg
6540gagctgccga ggacgctgta tgcggagccc cgatcgccgc acatcgactg gtcgcagggg
6600cacatcaacc tcctgaacga gcccgtgccg tggccgcgca ccgacaggcc gcggcgcgcg
6660gcggtctcgt ccttcggcat cagcggcacc aacgcgcacg tcatcatcga ggaggcgccg
6720gccgaagcgc cggcgacagc ggcggacgca aagtcggtgg aggcgcttcc gatcctgccg
6780ctgctcctgt cgggtcgcga cgagccggcg ctgcgcgccc aggccgggcg gctcgccgag
6840cacctgcgcg cccacccggg cgagcggctg ctcgacatcg ccgcgggcct ggccacgacg
6900cgcacgcacc tcgccacgcg gctcgcgctg ccggtcgccg cggacgcagc cgcggaggag
6960ctgggcgccc gccttgcgca gttcgccgcc ggcggcccgg cgcccagcgg cgccgccgtg
7020accgcgccgg ggcagccgcc cggcaaggtc gcggtgctct tcaccggcca gggcagccag
7080cgcgccggca tggggcgcgc cctgtacgcc acccaccccg tcttccgcgc cgcgctcgac
7140gccgcatgcg ccgagctcga ccgccacctc gacaggcccc tccacagcgt cctcttcgca
7200gacgccggca ccgaggccgc cgcgctgctc gaccagacag gatgggcgca gcccgccctg
7260ttcgctctcg aggtcgcgct ctaccgacag tgggaggcct ggggtctgcg ccccgagctg
7320ctgctcggcc acagcatcgg cgagctcgcc gccgcccacg tcgccggcgt gctcgacctc
7380cccgacgcct ccgccctggt cgccgcccgc ggacggctca tgcaggccct cccccacggc
7440ggcgccatgg cctccatcga ggccaccgag cacgagctcc tacccctgct cgaccagcac
7500acggggcgcc tctcgctcgc cgccctcaac gctccacgcc agtcggtcgt cagcggcgac
7560cagcccgccg tcgaccatgt ctgcgctcac ttcatcgccc tcggccgacg cgccaagcgg
7620ctcgacgtca gccacgcctt ccactcggcg cacatgcaac ccatgctcga cgccttcgcc
7680agcgtcgccc gcggcctgac cttccacccg ccacggctgc ccatcgtcag cagcgtcacc
7740ggcgcacgcg ccaccaccga ccagctcacc tcgcccgact actgggtgca gcaggtgcgc
7800gagcccgtgc gcttcctcga cgccatgcgc tccctgcacg ccgccggcgc cgccaccttc
7860gtcgagtgcg ggccgcacgg cgtgctcacc gccgcaggcg ccgagtgcct cgctcccgag
7920ggcgctcgcg acgccggctt cgtcaccagc ctccgcaagg accgcgacga ggccctcgcc
7980ctggtccacg ccgcctgcgc cgtccatgtc cgcgggcacg ccctcgactg gctccgcttc
8040ttcgacgcca ccggcgctcg ccgcgtcgag ctgcccacct acgccttcca gcgacagcgc
8100tactggctcg aggcgccaag gcctcgcccc agcctcgagg gtgtcggcct caccgccgca
8160aaccacccat ggctcggcgc cgccgtgcgc ctcgcagacc gcgatggcta cgtcctcagc
8220ggccgcctct ccaccatcga ccacccgtgg gtcctcgacc acgtggtggc aggcacagtg
8280atcttgccag gaacggcgtt cgtcgagctg gcgtgggcgg cggccgaggt ggtgggcgcc
8340gccgcggtgt ccgaggtgac cttcacgacg ccgctcgtgc tgccgccgcg cagcgtggtg
8400gagctgcagg tgaggatcgg cgagccggac gcgtccgggc ggcggacgtt cgccgcgtac
8460agccgcgcgg acgcggcgat cgaggcggag tggacgcaac acgcgaccgg cgtgctgagc
8520gcgcaggcgg cggccggggc cgacgtggcg gacctttcgg tgtggccacc gccgggcgcc
8580gaggtggtgg cgctcgacgg cggctacgcc tggctggcgg cgcagggcta cggctacggc
8640ccggcgttcc aggcgctgcg cgaggtgtgg cgcgcgggca cgacgctgta cgcgcgggtc
8700gcgctgccgg acgcggtggc ggacacggcg cggggcttcg ggatccatcc ggcgctgctc
8760gacgcggtgc tgcactcgtt gctggcgccg tcggcgcagg aggaggcgtc cgacgacgac
8820aaggtgctgc tggcgttcgc gttctcggac gtggtgatcg aggcgcgcgg ggcagcggag
8880gtgcgcgtcc gcctgaacaa gcaggccgga gacgacgggg agggggtcac ggcgtcgatt
8940cacctcgccg acgcgcaggg gcggccggtc gcgcgcgtgg gggcgttcca ggcgcgggcg
9000acgaccacgg agcgggtgcg cgcgctcgcg ggcgcgagcg agcgcgacct gcaccgggtc
9060acgtggacgg acgtgacgct ggaagagacg ccgtgggcgc acgaggacag cgtcgtggtc
9120ggcggcgacg gcgcgctggc ggcggcgctg ggcgtgcgcg cggtggccgg gctgcccgag
9180ctgctcgcgg gcggcgcggc ggcgccgcgt cgtctggtga tcgacgcgac cgcgggcgac
9240cccggcgacg gcctggtcgc ggcgacgcac gcggcgacgc agcggggcct cgcgctcttg
9300cagggatggc tctcggaggc gcggctcgcg gcgacggagc tggtgctcgt gacgcgcggc
9360gcggcggcgg ccgagccgga cgagggtgtg gcggcgctga gccacgcgcc gctctggggg
9420ctcgtgcgcg cggcgcgcga agagcacccg gcgcgcgcgc tgcgccttgt cgacctgggg
9480cgcgaggcgc cggacggggc gatcctgcgc cgggcgatcg cggcggacga cgagccggag
9540ctcgtggtcc gccgcggggc gctgcgggcc gcgcgcctga gcctcgccca cgctggcccg
9600gacaccgcgg ggcaagcgac gcggctggcc cccggcggga cggtgctgat cacgggcggc
9660acgggagagc tcggacggca ggtcgcgcgg cacctggtgg cggcgcacgg cgttcgccac
9720ctggtgctga cgtcacggcg cggaatggac gcgcccgacg ccgcggcgct ggtggagtcg
9780ctgcgcgcgg cgggcgccgc gacggtggag atcgcggcgt gcgacgtggc ggacgggcat
9840gcgctggcgg cggtgctccg gaccatcccg gcggagcatc cgctgaccgc ggtcgtgcac
9900acggcgggcg tgctcgaaga cggcgtcgtg accgggctct cggccgagca gctcgcgcgc
9960gtgctgcggc cgaaggtcga cggcgcctgg cagctctacg aggcgacgaa ggacgcgccg
10020ctcgcggcgt tcatgctctt ctcgtcggcg gcgggcacgc tgggcagcgc ggggcaggcg
10080aactacgccg ctgcgaacgc gttcctcgat gcgctggcgg cagagctccg cgcgcgcggc
10140gtgccggcga tgagcctggc ctggggcttc tgggagcaag gcgggatcgg catgacggcg
10200cacctcggcg ccgccgacat ggcgcgggtc aagcggcagg gcatcgtacc gatgacggtc
10260gcgcacggcc tgcggctgct cgaccgcgcg ctggagcggc ccgaggcgac gctggtgccc
10320ctatcgctcg acgtggcggc gcttcagcgc gcggcgagcg acgccggacg ggtgccggcg
10380ctgctgcgtg gcctggtgcg cccggcggcc gcccggcgca cggcggcgcc ggcggccgcg
10440gcgacagggc tccgcgcgcg gctcttgccg ttgtccgagg ccgagcgcca ggacgtcttg
10500ctcgatctgg tgcgcacgga gatcgcggat atcctcgcgc tgtccgggcc agcggcggtg
10560cctcccgatc aacccatcag ggagctgggg ctcgattcgc tcacggcggt ggacgttcgg
10620agccggcttg tgcagaggag cgagatcgac ctcgccgtga ccctcgcgta cgattacccg
10680accgcgcgag cgatcgcggg acatctgagc gagcagatgg gactcgaagg agcgccggaa
10740gatcgtgagt cggcgctcga cgagagccag atccgcgccc tgctcatgca gattcctatc
10800cccacgttgc gccagtcggg gctgctcgga gacctggttc gcctggcctc cccgcaagcg
10860cccccgcgcg aagaaggtga gagcgagacg ttgagcttcg atcaccttgg aaatgaagag
10920ttcctcagcc tcgcgtcgaa gctcattgca gaggagggat catga
1096555643DNASorangium cellulosum 5atgaaccaag agactgttct tcggcagaca
ctcgagaaga gtctccacaa gatccagcac 60ctcaatcggg agctcgagcg tctcaaggcg
aagtcgagcg agccgatcgc gatcgtgtcg 120atggcgtgcc gctacccggg cggcgtcgac
ggtcccgcac ggctgtggga gctgctctcg 180gaggggcggg acgcgatcgg gccgttcccc
gaggggcgcg gctgggacgt ggcggggctg 240tacgaccccg acccggacgc gccgggcaag
tcggtcacca cgcagggcgg cttcctctac 300gacgccgacc gcttcgatcc gacgttcttc
ggcatcagcc cgcgcgaggc cgagcggatg 360gacccgcagc agcggctgct gctcgagtgc
gcctgggagg cgctcgagcg cgcgggcgtc 420gcgccccaca cgctcgaggc gagcgccacc
ggcgtcttcg tcgggctggt gtacagcgac 480tacggcgggc ggctgctgga gcacctcgag
gtcttcgacg gctacgtcgc caccggcagc 540tttcccagcg tcggctcggg gcgcatcgcc
tatacgctgg ggctccgcgg ccctgcggtg 600accgtcgaca cggcgtgctc gtcgtcgctc
gtgtcgctcc acctcgcgtg catgtcgctc 660cgcgcgggcg agtgcgacat ggcgctcgcc
ggcggcgcca ccgtgatggc cacgccgatg 720gccttcatcg agttcagccg ccagcgcggc
atggccccgg acgcacggtg caaggccttc 780ggggcggcgg cgaacggcat cggccccgcg
gagggctgcg ggatcctggt gctcaagcgg 840ctgtcggacg cgcggcgcga cggcgaccgc
gtcctggcag tgatccgcgg ctccgccgtc 900aaccaggacg gccgcagcca ggggctcacc
gcccccaacg gcccggccca gcaggacgtc 960atccgccagg ccctggccgc ggcggggctc
acgcccgccg acgtcgacgc cgtcgaggcg 1020cacggcaccg gcacgcccct cggcgatccc
atcgaggcgc aggcgctgct ggcgacctac 1080ggcaagacgc acacagcgga gcggccgctc
tggctcggct cgatcaagtc caacttcggg 1140cacacgcagg ccgccgcagg ggtggcgggc
atcatcaagc tggtgctggc gatgcagcac 1200gcggagctgc cgaggacgct gtatgcggag
ccccgatcgc cgcacgtcga ctggtcgcag 1260gggcacgtca agctcctcaa cgagcccgtg
ccgtggccgc gcaccgacag gccgcggcgc 1320gcggcggtct cgtccttcgg cgtcagcggc
accaacgcgc acgtcatcct cgaggaggcg 1380ccggccgaag cgcccgcggc cgcgcaaaca
gcggcggggg tgccgtcgac gctgccgctg 1440ctcctgtcgg gtcgcgacga gccggcgctg
cgcgcccagg ccgggcggct cgccgagcac 1500ctgcgcgccc acccggacga gcggctgctc
gacatcgccg cgggcctggc cacgacgcgc 1560acgcacctcg ccacgcggct cgcgctgccg
gtcgccgcgg acgcagccgc ggaggagctg 1620agcgcccgcc ttgcgcagtt cgccgccggc
ggcccggcgc ccagcggcgc cgccgtgacc 1680gcgccggggc agccgcccgg caaggtcgcg
gtgctcttca ccggccaggg cagccagcgc 1740gccgccatgg ggcgcgccct gtacgccacc
caccccgtct tccgcgccgc gctcgacgcc 1800gcatgcgccg agctcgaccg ccacctcgac
aggcccctcc acagcgtcct cttcgcagac 1860gccggcaccg aggccgccgc gctgctcgac
cagacaggct gggcacagcc cgccctgttc 1920gctctcgagg tcgcgctcta ccgacagtgg
gaggcctggg gcctgcgcgc ccacgcgctg 1980ctcggccaca gcctcggcga gatcgtcgcc
gcccacatcg ccggcgtgct cgacctcccc 2040gacgcctccg ccctggtcgc cgcccgcgga
cggctcatgc aggccctccc ccacggcggc 2100gccatggcct ccatcgaggc caccgagcac
gagctcctac ccctgctcga ccagcacacc 2160ggacgcctct cgctcgccgc cctcaacgct
ccacgccagt cggtcgtcag cggcgaccag 2220cccgccgtcg accatgtctg cgctcacttc
aaggccctcg gccggcgcgc caagcggctc 2280gacgtcagcc acgccttcca ctcggcccgc
atggaaccca tgctcgacgc cttcgcccgc 2340gtcgcccgcg gcctgaccta ccgcgccccg
cgcctgcccg tcgtgagcaa tgtcaccggc 2400cgcatggcca ccgccgacga gctcacctcg
cccgactact gggtgcgcca cgtgcgcgag 2460cccgtgcgct tcgtcgccgg cgtgcgcgcg
ctgcacgcca ccggcgtcgc cacctacctc 2520gagtgcgggc ccgatccggt gctcggcggc
atggccgcag actgcctcac ctccgacgag 2580agccgcgacc caggcctgat ccccagcctc
cgcaaggacc gcgacgaggc cctcgccatc 2640gcccaggccg cctgcgccct gcacgtccgc
ggacacgccc tcgactggcc ccgcctcttc 2700gacgccaccg gcgctcgccg cgtcgagctg
ccaacctacg ccttccagcg gcagcgctac 2760tggatcgatg cgccgcggcg cgcggcgggg
ctcgaaagcg tcggcctcac ggccgcagac 2820cacccctggc tgggcgcggc ggtgcggctc
gccgaccggg acgtctacgt gctgagcggg 2880cggctgtcga cggtcgacca cccgtggatc
ctggaccacg tggtgacggg cacggcgctg 2940atgccaggaa cggggttcgt cgagctggcg
tgggcgacgg cccaggcggt gaacgccgcc 3000gcgatcgcgg agctcaccct gacgactcca
ctcgtgttgc cggcgcgcgg cgcggtgcag 3060ctccaggtga cggtcgacga ggccgacgcg
gatggccggc gggcattcgc gatccacagc 3120cggccgcatg ggcccgtcga cctcgagtgg
acgcaacacg cgaccggcgt gctgagcgcg 3180gaggcgccgg cgggagccga cgaggcggcg
gggctctcgg agtggccgcc gccgggcgcg 3240gaggcggtgg cgctcgacgg cgggtatgag
cagctgtccg agcacggcta cggccatggc 3300ccggcgttcc aggggctccg cgggctctgg
cgcgcggacc agacgctgta cgcgcacgtc 3360gcgctgccgg acgctgtcgc gggcacggag
cagggcttcg ggctccatcc ggcgctcttc 3420gatgcggcgc tgcagtcgct ggcgcggctg
tcgcgcgagg aggcggccgc tggcgacccg 3480gtgctggtgc cgttcgcgtg gacggacgtg
gcgctgtacg cggccggcgc gaccgagctg 3540cgggcgcgca tcgcgctgga gcaggcggag
ggcggcgcgc cggcggtggc gtcgctgctg 3600ctggccgacg cgcacggacg aaccgtggcg
acgacagggc gggtgcgcgg ggcgagcgcg 3660gcgcagacgc ggtccgccgc gagccgtgcg
gagccgatgt acagggtcgc gtggacggac 3720gtggcgctgg aggcggcggc gtgggcgccc
gaagagcacg tcgtgctcgg cggtgacggt 3780gcgctggcgt cggcgctggg cgtgcgcgcg
gcggccgggc tgccggagct gctcgaggcg 3840ctggcggacg gcgcggccgc gccgcggcgg
cttgtcgtgg acctgacggc gggcgacgcg 3900ggcgctgtcg tcgcggccgt gcacgccgcg
gcgcgcggcg cgctggccct ggtgcaggga 3960tggctcgccg cgccgcagct gacggcgacg
gagctcctcg tggtgacgcg ctgcgccgtg 4020gcgacagggc cggacgaggg cgttgacgcg
ctggggccgg cggccgtctg ggggctgctg 4080cgggccacgc gcgccgagca ccccgaccgc
gcggtccggg tgctggacct ggggcgcgag 4140ccgctggacg gggcgctcct gcgcagggcg
ctggccgcgg tggcggagcc ggagctgtcg 4200ttgcgccgcg gcgaggcgcg cgcgcctcgc
ctgcgcgagg caaagcccgc cgcggcgccg 4260gcgacacggc tggaccctga agggacggtg
ctggtcacgg gcggcaccgg ggagctgggg 4320cggcaggtcg cccggcacct ggtggcggcg
cacggcgtgc ggcacctcgt gctgacgtcg 4380cggcgcggga tggacgcgcc cgacgccgcg
gcgctggtag aagagctgcg cgcggcgggc 4440gcggcgacgg tcgacgtcgc cgcgtgcgac
gtcgccgctg gcccggccct ggcggcggtc 4500gtggaggcga tcccggcggc gcatcccctg
accgcggtcg tgcacatggc gggcgtgctg 4560gacgacggca tcgtgacgaa gctctcggcc
gagcagctca cgcgcgtgct gcggccgaag 4620gtcgacggcg ccattcatct ccacgagctc
acgaagcacg cgccgctcgc ggccttcgtg 4680atgttctcgt ccgcggcggg cacgctgggc
agcccggggc aggcgaacta cacggcggcc 4740aacgtgttcc tggacgcgct ggcggcgcga
ctgcgcgcgc gcggcgtgcc cgcgatgagc 4800ctggcgtggg gcttctggga gcaaggcggg
atcggcatga cggcgcacct cggcgccgcc 4860gatcgggcgc ggatgaagcg acacggcgtc
gtggcgatgt cggtcgcgca gggcctgcgg 4920ctgctcgatc gcgcgctcgc gcaccccgag
gcggcgctgg tgccgctcgc gctcgacctc 4980tcgtcgctgc acgcgggggc cagcggcgcc
ggaccggtgc cgccgctgct gcgcgggctg 5040gtacgcgcgc ccgccggccg gcgcacggcg
gcgtccgcgg cccggacgaa cgggaagggc 5100acggcattgg cggcgctccg cgcgcggctc
ttgccgttgc cgcaggccga gcgcgaggac 5160ctcttgctcg agctcgtgtg caccgaggtc
gcggaggtgc tgcagttgcc ggggccggcg 5220cacgtcccgg cggatcagcc gctccgcgac
ctggggctcg actcgctcat gaccgtggag 5280ctgcgcaacc gtctcggcgc gcgcgccgag
acgacgctgc ccaccacgct cgcgttcgac 5340tacccgacgc ccagggccct tgcgtcctat
ctggagacgt tgctcggcat ctccgacgag 5400aacgggcatt cgggtgagtt gctgcacgtt
ccgcagaacg aggacgagat ccgctccgcg 5460atagcgcgca tcccgatagc gaccctgcgc
gaggcggggc tcctccagag cttgctgcgg 5520ctcgcccccg gcaaggcggt ggccggtgac
gtcacgcacc cggtcgatga gctgctggtc 5580gagcacatcg aggatgaaga gctgcttcga
ctcgctttcg aggccaccgg aggtatcaag 5640tga
564368610DNASorangium cellulosum
6gtgaaagacg aggctctctc gtttcgccga gccctggaga agacggtcgt cgagatccgc
60cgtctcaatc gggagatcga cgacctgcgg gcgaagtcga gcgagcccat cgcgatcgtg
120tcgatggcgt gccggttccc cggcggcgtc gagaaccccg aggcattgtg gcggctggtc
180tccgaggggc aggacgcgat cgggccgttc cccgaggggc gcggctggga cgtggcgggg
240ctgtacgacc ccgacccgga tgtgccgggc aagtcgatca ccgcgcgggg cggcttcctc
300tacgacgccg atcgcttcga tccggagttc ttcggcatca gcccgcgcga ggccgagcgc
360atcgatccgc agcagcggct gctgctcgag tgcgcctggg aggcgctcga gcgcgcgggc
420gtcgcgcccc acacgaagga ggcgagcgcc accggcgtct tcgtcgggct gatgtacacg
480gactacggcc tgcggctgct gaaccacccc gaggccctcg acggctacat cggcatcggc
540agcacgggga gcacgggctc ggggcgcatc gcctacacgc tgggcctgca gggacctgcg
600atcacggtgg acacggcgtg ctcgtcatcg ctcgtggcgc tccacatggc ctgcgcgtcc
660ctgcgcgggg gagagtgcaa cctggcgctt gtcggaggcg tcgccgtgat gacgacgccg
720acaacgttca tcgagttcag ccggcagcgg ggcctctcgc tcgacggccg gtgcaagtca
780ttcggtgccg aggccgaggg cgtcggctgg ggcgaaggct gcggaatcct ggcgctgaag
840cggctgtcgg acgcgcggcg cgacggcgac cgcgtgctcg cgatcatccg cggctccgcc
900gtcaaccagg acggccgcag ccaggggttc accgccccca acggcccgag ccagagggcg
960gtcatccagc gggcgctggc ggcggcgggg ctgaccgcgg cggacgtcga cgccgtcgag
1020gggcacggca ccggcacgcg cctcggcgac cccatcgagg cgcaggcgct gctggcgacc
1080tacggcaagg cgcacacagc ggagcggccg ctctggctcg gctcgatcaa gtccaacttc
1140gggcacacgc aggccgccgc aggggtggcg ggcatcatca agctggtgct ggcgatgcag
1200cacgcggagc tcccgaggac gctgcacgcc gacacgccct cgccgcacgt cgactggtcg
1260caggggcacg tcaagctcct caacgagccc gtgccgtggc cgcgcaccga caggccgcgg
1320cgcgcggcgg tctcgtcctt cggcatcagc ggcaccaacg cgcacgtcat cctcgaggag
1380gcgccggccg aagcgcccgc ggccgcgcaa acaccagcgg cggcgggggt gccgtcaacg
1440ctgccgctgc tcctgtcggg tcgcgacgag ccggcgctgc gcgcccaggc cgggcggctc
1500gccgagcacc tgcgcgccca cccgggcgag cggctgctcg acatcgccgc gggcctggcc
1560acgacgcgca cgcacctcgc cacgcggctc gcgctgccgg tcgccgcgga cgcagccgcg
1620gaggagctga gcgcccgcct tgcgcagttc gccgccggcg gcccggcgcc cagcggcgcc
1680gccgtgaccg cgccggggca gccgcccggc aaggtcgcgg tgctcttcac cggccagggc
1740agccagcgcg ccgccatggg gcgcgccctg tacgccaccc accccgtctt ccgcgccgcg
1800ctcgacgccg catgcgccga gctcgaccgc cacctcgaca ggcccctcca cagcgtcctc
1860ttcgcagacg ccggcaccga ggccgccgcg ctgctcgacc agacaggctg ggcacagccc
1920gccctgttcg ctctcgaggt cgcgctctac cgacagtggg aggcctgggg cctgcgcgcc
1980cacgcgctgc tcggccacag cctcggcgag atcgtcgccg cccacatcgc cggcgtgttc
2040gacctccccg acgcctccgc cctggtcgcc gcccgcggac ggctcatgca ggccctcccc
2100cacggcggcg ccatggcctc catcgaggcc accgagcacg agctcctacc cctgctcgac
2160cagcacaccg gacgcctctc gctcgccgcc ctcaacgctc cacgccagtc ggtcgtcagc
2220ggcgaccagc ccgccgtcga ccaggtctgc gcccacttca aggccctcgg ccggcgcgcc
2280aagcggctcg acgtcagcca cgccttccac tcggcccgca tggaacccat gctcgacgcc
2340ttcgcccgcg tcgcccgcgg cctgacctac cgcgccccgc gcctgcccgt cgtgagcaat
2400gtcaccggcc gcatggccac cgccgacgag ctcacctcgc ccgactactg ggtgcgccac
2460gtgcgcgagc ccgtgcgctt cgtcgccggc gtgcgcgcgc tgcacgccac cggcgtcgcc
2520acctacctcg agtgcgggcc cgatccggtg ctcggcggca tggccgcaga ctgcctcacc
2580tccgacgaga gccgcgaccc aggcctgatc cccagcctcc gcaaggaccg cgacgaggcc
2640ctcgccatcg cccaggccgc ctgcgccctg cacgtccgcg gacacgccct cgactggccc
2700cgcctcttcg acgccaccgg cgctcgccgc gtcgagctgc caacctacgc cttccagcgg
2760cagcgctact ggctcgagac gccccagacg ccgggcgccg acggggcctc caacctatct
2820tcgcccgccg aaagccgctt ctgggaggct gtcgagagag cggacatcat ccccctcgcc
2880gaggcgctgc gcctcgagga tgaggcgcaa cgcgcttcgc tggcgaccct gctgcccgcg
2940ctctcgacct ggcgccgccg acgccacgag cagagcaccg ccgacgcctg gcgttaccgc
3000gttgcctgga aaccccttgc catcgacgcc cggagcgatc tctcgggggt ctggctgttc
3060ctcgcgcctc cggatcacgc gaaggacgac ctcgcgcgcg cggtccttcg cgcgctcgcc
3120gagagcggcg cgacggtcgt ccctgtgctg gtggccgagg gcgacgtcga ccgcgccctc
3180ctgagcgcgc ggctgcgcga gcaggtcggc gacggcggcg cgatccgcgg cgtgatctcg
3240ctcctcgccc tggacgagac ctcgctgccg cagcacgacg ggctgccccg gggcctcgcc
3300ttcacgctcg cgctcgtcca ggccctggga gacacggcga tcgcagcgcc tctatggctg
3360ctcacccgtg gcgccgtctc cgtgggtcgt tccgaccgcc tcgagcgccc gctgcaggcg
3420ctgacgtggg gcctcgggcg cgtggtggcg ctggagcacc ccgagcgctg gggtggactc
3480atcgatctcg ccggcgcgct cgacgaaaag gcgctcaagc ggctcgtcgc cgccctcggt
3540ggtcgcgacg ccgaggatca gctcgccctg cgcccctccg gactcttcgc gcgacggctg
3600gtcagagcgc ccctgggtga agcgaccgcg gttcgcgcct ggaaggcgcg cggcaccgcg
3660ctcgtcaccg gcggcacggg ggacctgggc gcccacgtcg cccggtggct cgcccagaat
3720ggcgccgagc acctcgtcct caccagccgc cgcggacagg acgcccccgg agcggccgag
3780ctcacggccg agctcacggc gctcggcgcc cgcgtcacca tcgccgcctg cgactcgtcc
3840gaccgacagg cgctcgcggc cctgctccag cgcctgaggg ccgaaggccc ccccctccgc
3900gccgtcgtcc acgctgcggg tgtcgaccag gtcaccccgc tggccaggac cagcctggcc
3960gagttcgcag gcatcgcctc cggcaaggtc gcaggtgctc ggcacctcga cgacttgctc
4020ggcaatgccc ccctcgacgc cttcatcctc ttctcctcgg tcgcaggcgt ctgggggagc
4080ggctttcagg gcgcttacgc ggcggccaac gccttcctgg acgcgctggc cgagcagcgc
4140cgcgccctgg gctcgacggc cacgtcgatc gcctggggcc tctggggcgg caaaagcatg
4200gccgacgacg ccgccaaaga tcatctcagc aagcgcggcg tgtccccgat gccgccccag
4260ctcgcgatcg cggccctgca gcgggcgctc gaccacgacg agaccacact caccctcgcc
4320gacgtcaact ggtcacgctt tgccccggcc tttgccgccg cccgcccgcg cccgttgctg
4380cacgatctcc cggaagcccg gagcgctctc gagtccccct cgccggcgcc ccgcgaggcc
4440gagctgctca cccggctcca gggcctctcc agcaccgagc gcgtccgcca cctcgtctcc
4500ctcgtgctgg cggagaccgc cgtcgtcctc ggccatcctg acgcctcccg cctcgaccct
4560cacacaggct tcgcggatct cggcctcgac tcgctgatgg ccgtcgagat gcgccggcgg
4620ctccagcagg caacgggggt gagcctgccg gcgaccctga ccttcgacca cccctcgccc
4680caccacatcg cgaccttcct cctcgacgag gtcttcgcgc cggccctcgg ccaggccccc
4740ggcgccgagg aagacgaagc gatcgcccag gccgggctcg cctcgggcga cgagcccgtc
4800gccctcatcg gcgtggggct gcgtctcccc ggcggagcca ccgacctcga cgggctctgg
4860cgccttctgg agcaggggat cgacgttgtc ggccccgtcc ctgaagaccg cggctggagc
4920atggacgagc tctacgatcc cgaccccgac tccctcggca agagctacgt gcgcgaagcg
4980gctttcctcg atcgcatcga cctcttcgac gcgggcttct tcggcatcag cccccgcgag
5040gcgagccacg tggacccgca gcaccgcctc ctgctcgagg ccgcgtggca ggccctcgag
5100cacgcaggca tcgtcccggc ctcgctccag gactcccaga ccggcgtctt cgtgggctca
5160ggcccgagcg actacgcctt gctccacaac ccggcccagg aggatgaagc ctacaggctt
5220acggggacgc agccctcgtt cgcgccaggc cggctctcgt tcagcctggg attgcaggga
5280ccggcgctct ccgtggacac cgcctgctcc tcctcgctcg tcgcgctcca cctcgccgcc
5340caggccctgc gccgcggcga gtgcgggctc gccctcgtcg gcagcgcgca ggtgatggct
5400gctcccgacg ccttcgtgac gctctcccgc gctcgcgcca tcgctcccga cggccgctcg
5460aagaccttct ccgcccaggc cgatggctac ggccgcggcg agggggtcat cgtcttcgtc
5520ctcgagcgcc tgagcgacgc ccgcgcgaga gggcgcgacg tcctcgcggt cctccgcggc
5580agcgccgtca accacgacgg cgccagcagc ggcatcaccg cgccgaacgg cacctcccag
5640cagaaggtgc ttcgtgccgc gctccacgat gcgcggctca cgccagcgga cgtcgacgtg
5700gtggagtgcc acggcacggg cacttccctc ggcgacccca tcgaggtgca agccctggcc
5760gccgtctacg gaaaggagcg ctccgccgat cggccgctga tgctcggcgc gctcaagacc
5820aacgtcggcc acctcgaggc cgcgtccggt ctcgccggcg tcgcgaaggt cgtcgcggcg
5880ttgcgccacg aggcgctgcc ggcgacgctg cacaccgccg cgcgcaaccc tcatatccag
5940tgggatacgc tgcccgtcca ggtcgtcgac accttgcgtc cctggccgcg gcgcgaggac
6000ggcacccccc gccgcgccgg cgtgtcggcg ttcgggctct ccggcaccaa cgcccacgtc
6060ctcctcgagg aagctccgcc tgtccagccg agcacacagg cggagcagcc tgccgcgccg
6120ccgtggttgc cgctgctcct gtcgggcaag acggacgcgg ccctgcgagc gcaggccgag
6180cggctgcggg cgcacctcga cgcccatgcc gacctcgggc ttgccgacgt cgcctattcc
6240ctcgccacga cgcggacgca tttcgcgcat cgggcggtgg tcgtcgcgga cgctggcgcg
6300accctcttcg aagggctgga cgccatcgcg cgcggcaacg ccgcttccca cgtggtggtc
6360gacgaggcca agatcgacgg caagaccgtc ttcgtcttcc cgggacaggg ctcgcagtgg
6420gcccagatgg cgcagccgct gctcgagacc tccgagctct ttcgcgagcg tatcgaggcg
6480tgcgcgcacg ccctcgcgcc tcacgtcgac tggtcgctgc tcgccgtcct ccgcggcgaa
6540gaaggcgccc cctcactgga gcgggtcgac gtggtgcagc cggtgctctt cgccgtgatg
6600gtctcgctcg ctgccctctg gcgctcgatg ggcgtcgagc cggacgccgt cgtcggccat
6660agccagggcg agatcgccgc cgcctgcgtg gcgggcgcgc tgtcgctcgc ggacgccgcc
6720aaggtggtgg cgctgcgcag ccgcgcgctc gcgcggctcg ccggccgggg cgccatggcc
6780gtcgtggagc tccccgccgc cgagctcgcc gagcgcatga agcgctgggg cgagcggctg
6840tccatcgcag cgctcaacag ccctcgttcc accgtgatct ccggcgatcc ggacgccgtc
6900gacgcgctgc tccgggagct cgactcggcg gagatcttcg cccgcaaggt gcgcgtcgac
6960tacgcctccc actgctccca tgtggaggcg attcgccacc agctcctggc cgagctcgcg
7020ggcatcgagc cgctcccgtc cacgctcccg ctctactcca cggtgagcgg ggacaagctc
7080gatggcgtcg cgctcgacgc ctcgtactgg taccggaacc tccggcagac cgtccgcttc
7140tcggacgcca cgcagcggct cgtctccgcg ggacatcgct tcttcgtcga ggtcagcccg
7200catccggtgc tgacgttcgc cgtgcaggat gtcctcgatg ccgagggggt gcccgccgct
7260gtcgtcggct cgctacggcg cggcgagggc gacctgcggc ggttccttgt gtcgctgtcc
7320gagctcttca cccgcggcct cgccctggat tggtccaggg ttctgcccag cggccggcgc
7380gtatcgctgc ccacctacgc cttccagcgc gagcgctact ggctcggggc tcacagggct
7440cgcggcaccg acgcgacatc cgccggcctg gcatcggacg agcccacgcg cggcgcgtcg
7500atgccagtgc ggctctcgtt gcgggacgtg ccgcccgagg agcgccaggg agcgctggag
7560cggttcgtcc gggagcagct cgcggccgtc ctgcgcatgg atgcggcgcg gatcgagggg
7620cagacgacga tcaagacgct cgggatcgac tcgctcatgg cgctcgagat ccgcaaacgg
7680ctggaagccg gactggccgt gaccttgcca tcgacgctca tctggcagtt cccgcacgcc
7740gaagggctcg cacggcacct catgacgcgg ctccccgcgg gggacggaga aggatctgcc
7800gtggtccagc ccgtggagca gccgcgcgcg ccgaaggagg tgcccgtatc catggatccc
7860tcggcgtggg tgcaccgccc gcgccccagg gccgacgcgc gcgttcgact gttctgcctt
7920ccctacgccg gcgcgggcgc ctcgcgcttc cgggcgtggc cagagctgct cccctcctgg
7980gtggaggtct gcccgatcca gctccccggc agggaagagc gcctccacga gccggccttc
8040gagacgatgg acgcgctcgt cgacgcgctc gttcccgccg tcgaggcgca catcgatcgg
8100ccctttgcgc tgttcggctg cagcatgggt gccctcctgg ccttcgagct cgcccgggcg
8160cttcaatccc gtcatcgctt ggtggcgcgg catctgttcg gcgcggcgag ctcctcacct
8220cggcgcgtga gcccggtacg ggagcagctc tccgcggtgg tctcccctgg aacggtgcga
8280tcggacgcga tggcctcgct gcgccagctc ggtctgctgt cgtcctcgtc cctccaggac
8340gaagagatgc tggacgaggt gtggcccgcg ttccgtgcgg atctatccct gacgctgaag
8400tacacgtgca gggacgcaac ccccctcgac gcccccatct cggtcttcgg gggcaccgag
8460gaccggaccg tagggcgcga ggatctcgtc gcctggcata cgctgacgaa ggacgcgttc
8520caggtcgcca tgctgcccgg gggtcacctg ttcatggacg cgacgccgaa gcggctcttc
8580catcacatcg agcacgcgct ccagctctag
86107687DNASorangium cellulosum 7gtgcggacca gcgacgccgt gtgggctggt
gccgcgggct ataccagggc gcgtcttcag 60gtctatgact tcttcatcta cggcttcaac
agccctgtcg catggaagtg cccgggcgag 120gagctcctcg agaactacaa tcggcacgtc
tcgggcaatc acctcgacgt cggcgtgggg 180acggggtacc tgctcgaccg ctgccgcttc
cccaccgcca agccgcgtgt gtttctgatg 240gatctgaacc cggacgctct gcaggtgacg
gcgcagcgac tgcaccgctt tcagcctcag 300accttgcggc ggaacgtcct tgatcccatc
cgcttcgacg gagagccctt cgactccatc 360gggatgaact acctcatgca ctgcgtccct
ggatccatcc cggagaaggc cgtgatgttc 420gaccacctga gcgccttgct gaagccgggc
ggcgtgatct tcggcagcac ggtgctctcg 480gagggcgtgg acaaggggat cgtggcgcga
gccatcatgg accgcttcaa caagaagggg 540atcttctcga acacccgaga cgccgcctcc
gatctgacgc gagcgctgga ggagcgcttc 600gacgacgtct cggtccgcgt cgtcggctgc
gtcgggctgt tctcagccag gaagcgtacc 660tgcgcgggaa ccgagtcgcc ggcgtga
6878906DNASorangium cellulosum
8atcgtcctgg gcgacacgct ggagcaggtg gcgacgcggc tgctcgagga ggacctcgcg
60gcgtgccaca cgaccggcga ggcggcggac gtgctgctga acggggtgct cgcgtcgagc
120gcccgcgccg tggccgcggc gctgcgcgcg tgcgacgagt tcgccgcggg cgacagcgat
180ctgccgtcgc tggcccgggc gtgccgcgcg ttcgcggggc tcgcgtcgtt cgggtcgtcg
240cggtcgctgt cgtcgctcgg cgacggggtg atcgcgccga tgctggagaa gacgttcgcg
300cgcgcggtcc tgcgcgtcca cgggggctgc acgggcagcg acgaggcggt cgccgccgcc
360aaggaggcgc tgcgcacgct gcacgacgtg gcgctgtcgc agccgatcgt cgaccgcggg
420gcgtggctcg acgcggcgcg ggggctcgtg gacagcgagg tggtgaaccc gacggcgtcc
480ggcctcgcgt gcgggctgct ctacctggcg caggcgatcg acgacgccga ggtggcgcgg
540gtcgtcggcc tgcggctcgg gggcgcggcc gagcccgagg cggcggcgtc gttcctggcc
600gggttcctcg aggtgaacgc gctggtgctg gtgaagagcc ggcccgtggt cgaggcgctg
660gacgcgttcc tccgggcgat cgcgccggag cgcttcaagg acacgctgcc ggtccttcgg
720cgcgcgttcg ctgggctcgg cgcgacggag cggcggtacc tgctcgagaa cgtgctcgcg
780gcgcggaagc tgggggacaa ggcgcgcgcg gcgcaggcgg tgctcctgga gaaggaccgg
840gagaagctga aggagatgag cgaggacctc tcacaggcga tggacgacct ggacgagttg
900ctctga
90691038DNASorangium cellulosum 9tcacacgccc ggcaaagctg gcctcgagtc
caccgcgacc gccctctgga tggcgtcgcg 60gagccacccg tggccctcgt cgtgctcgga
gcgctccggc cagacgagcg tcagcgtgta 120gccctcgagc gcgaacgggc acggccgcac
cacgagatcg agcctccggg ccagggccgc 180ggcgacgcgc gcggacacgg tgagcagcag
gtcggaaccg gagacgatga acggcgcgac 240aaggaaatgg gacacggtca gcgtcacccg
ccggcgtgtt ccctgctccg ccagcgcccg 300atcgatggcg ccgtggtcct ctccgtgcgg
cgagaccatc aggtgctcgc aagcagcgta 360gcgcgccgcg gtgagcggcc tccgggacgc
cgggtgtccg cggcgcatca cacagacgat 420ctcctcggcc gcgagcagcg tggagcgaca
gccgtcgggc accggtccgc cgcgcccgag 480cttgccgtcg agctcgccgc ggcgcaggag
ctcggcgaag tcggccggga tgttccggca 540gcgcaggttg acgcgcggcg cctcgacggc
gagcagcgcg gtcagcgccg ggagcacgag 600cagctcgagg ttgtcggtcg cgacaagccg
gaacgtgcgc tgcgaccgcc gcgggtcgaa 660ccgctcgacc gggcggaaga cctgctcgag
ccgctcgacg gcctcggccg cccgcggggc 720caggtcccgc gcccgctcgc tcagcgtcat
ctgcctgccg acctggatga gcagcgggtc 780cgcgaaatgg gcgcgcagcc gcgcgagcgc
gtggctcatc gagggctgcg tcacgcccac 840gcggcgcgcg gcgcgcgtga cgctcttctc
ctggagcagg gcgtgcaacg ccacgacgag 900gtgggtgtcg accgactgca gccgcatggt
cgatggatac cacgtcgatc catcgacggc 960gtctatggat cgccgcgccg actgccgatt
cgacgcccgg ggccgtgggt gcctatctct 1020cctctccgga cggcgcat
103810327DNASorangium cellulosum
10atgatcatcg agtacgttcg ctacacgatc cccgcggagc aagagaagga gttcctggcc
60gcctaccgcg acgccgccgc ggagctgcgc gggtcggagc attgcctcga ctacgagatc
120tcccgctgcg tcgaggatcc gacgagctac gtcgtccgca tctgctggga ctcgctgcaa
180ggccatctcc agggcttccg caaggcggcg gcgttcccgt cgttcttcgc caaggtgaag
240ccgttctacg agcgtatcca ggagatgagg cactacgcct tgaccgacgt cgccgcgcgg
300caggcgggga cggccgcgac gggctga
327111461DNASorangium cellulosum 11atgaagctcg cgcgcaagct gacgctcgcc
ctcgtgttcg gggtattcct cgtgctcgcg 60ctgagcgcct acgcccagat ccgcagagag
gccaggatct tcgagaacga cgtccagcgc 120gaccatcaca cgatggcccg cgcgctcgcg
gccgcggtca tggaggtgtg gcgctccgag 180ggaaccgcgc gggcgctgcg cctcgtggag
gacgccaacg agcgggaaca gcaggcgaac 240atccgctggg tctggctcga tggccaggcc
gacgagcccc atcgcccccg gctggcgccg 300gagctgctcg cccccgtcgc cgaggggcgc
gcggtcgtgc gccggatccc ccagaaagac 360gcggatctgc tcgtgacctg cgtgccggtg
tccgtgcccg gcgaccgcgc cggcgcgctc 420gagctctccg agtcgctcgc gggcgcgcgc
cggtacatcc ggagcatgat cctgagcacg 480gcgatcacca cagccgcgct gacgctggta
tgtgggttgc ttacaacggg cctcggagtc 540tggctggtgg gacgccccat gcgcacgttg
atcgaccagg cgcggcggat cggcgccggc 600gatctctccg ggcggctgtc gctgcgccag
gaagacgaga tcggcgagct cgggcgcgag 660atgaacgcca tgtgcgatcg cctcgccgcg
gcgaaccaga agctcgagtc cgaggccgcc 720gcgcggatcg ccgcgctcca gcagctccgt
cacgccgagc ggctcgcgac cgtcggcaag 780ctcgcgtccg gcatcgcgca cgagctgggc
gcgcccctcc aggtcgtcac ggggcgcgcg 840cggatgctcg tcgacggcga cgtgtcgggc
gatgaggtgc cgatcaatgg acagatcatc 900ctcgagcagt cgcagcggat gacccagatc
atccgccagc tgctcgactt cgcccggcgc 960cgcagcgccg agaagcagga gaccgcgctc
cgcggcgtca tccgcggcac gttcacgatg 1020ctgaagccgc tggcggacaa gcagggtgtc
acgatcgtcg aggagggaga cacgccggat 1080cgggtggtcc acgccgacgc cgaccagctc
cagcaggcgc tcacgaacgt cgtcgtcaac 1140gcgatccagg ccatgccgtc cggcggcacg
atcacggtgg gcgtccggac cgtccgcgcc 1200agccccccgc ccgaccaggg aggggccgag
ggcgactaca tcgcgctgtc ggtgcgcgac 1260gagggacagg gcatgacggc cgacgtcctc
gagcacgtct tcgagccgtt cttcacgacc 1320aagcccgtcg gcgaggggac cgggctcggc
ctgccggtcg cctacggcat catcaaggag 1380cacggcggct ggatcgacgt cgacagccgc
cccggctccg ggagccagtt cacgatgtac 1440ctgccgcagg agaagccatg a
1461121386DNASorangium cellulosum
12atgaccggac gcgtcctgat cgtcgacgat gagcgaggcg tctgcgagct cctcgacgcc
60gggctgaaga agcggggatt ccaggcggcg tggcgcacgt cggccgccga ggcgctcgag
120ctcctcggcg cggaggactt cgacgtcgtc gtcaccgaca tgaccatgcg cggcatgaac
180ggcctcgagc tctgcgagcg catcgcccag aaccggcccg atctgccggt catcgtcatc
240accgcgttcg ggagcctcga caccgccacg tcggcgatcc gcgccggcgc ctacgacttc
300gtgaccaagc cgttcgagct cgacgcgctc cggctcaccg tcgagcgcgc cctgcgccac
360cgcgccctcc gcgaggaggt gcgccggctg cggcgcgccg tggacgactc ccaccgttac
420gagcagatcc tcggcggcag cccggcgatg aagggcgtct tcgatctgct cgaccgggtc
480gccgactcgg acacctcgat cctcatcacc ggcgagagcg gcaccggcaa ggagctcgtc
540gcgcgcgccg tgcaccagcg cagccggcgc ggccagggcg cgttcatcgc ggtgaactgc
600gcggcggtcc cggacgccct gctcgagacc gagctgttcg gccacgcgcg gggcgccttc
660accgacgcca agggggcgag gagcggcctg ttcgcgcggg cccacggcgg caccctgttc
720ctcgacgaga tcggcgagct gccggtcggg ctccagccga agctcctgcg cgccctccag
780gagcgcgtcg tccggcccgt cggcgcggac gaggaggtcc ccgtggacgt gcggctcatc
840gcggcgacca accgcgacct ggagaccgcg atcgaggagc gccgcttccg cgaggacctc
900tattaccgga tcaacgtggt ccacgtcgat ctgccgccgc tccgctcccg cggcgccgac
960gtgctgctgc tcgcgcagcg cttcctcgag cacttcgcga ccgtcaagga gcggcccatc
1020aagggcctct cggcgcccgc ggccgagaag ctcgtcgcct acgcgtggcc cggcaacgtc
1080cgcgagctcc agaactgcat cgagcgggcc gtcgcgctcg cgcggtacga tcagatcacg
1140gtcgacgatc tccccgagaa gatacggagt taccggcgct cccacgtcct tgtctcgagc
1200gacgacccga ccgagctcgt ccccatggag gaggtcgagc ggcgctacat cctgcgcgtc
1260ctggaggtgg tcggcggaaa caagagccag gcagcccagg tcctgggctt cgatcgagcg
1320accctgtacc ggaagctcga gcggtacggc ctgcgcgccg ggcgcgcggg cgacccgagg
1380ccgtga
1386131527DNASorangium cellulosum 13tcatccatgg gagacgccgc gcgggccgtc
cgcctggtcc cttgacgacg agcgcggcag 60ctcgatccgg aagaccgagc cggcgccggg
acggctctcg acgaagaggc ggccgccgtg 120cgcctcgacg atgcgcttcg ccaccgcgag
gccgaggccc gtgcccggga tggatccaga 180cgtggacttg agccgccgga acggctcgaa
gaggtgcgcc agatcctcgg gctcgatccc 240gagcccgcga tcgcgaacgg cgatctcggc
cccctcgccg ccggcgcgga ccgccacgtc 300gacctgcccc ccggcggggg agtacttgag
cgcgttcgac aggaggttgt tcagcacctg 360ctcgatccgg gtcgcgtcgc agcggacgag
caccggtgtc tcggggagcg agagctcgat 420ggggtgctcc ggcgagacag ggcgatagag
gtccaccgcc tcctgcgcga gatcgcgcag 480gtcgcgctcc tccacccgga gatcgagctt
gcaggcctcg atctgcgacg cgtcgaggag 540gtccccgacc atgcgatcga gccggtcgac
ctgccgcccg acgagcgcca tggtccggcg 600cacgctcgac tccaggggcc ggttgtcggc
gtcgaggacg tgcacggaca gccggagcgc 660cgacagcggg ttcctgaggt cgtgggccac
gccgccgagg aacgcgaact gcgcctcgcg 720ctggcgctcc agcgactctg ccatgtcgtt
gaaggcgcgc gcgatctccc cgagctcgcg 780cggcccgatc agcggcgcgc gcgcggcgcg
gtcgcccgcg ccgtagcgcc cgatggcctc 840ctggatcgcg acgatggggc ggtagatgag
ccgccgcgcg ctgaggagga tcgtggacgc 900gcccgcgagg aagaacacca ccgccgcgag
gccggcgccg gtcgtgcgcc gggtcaggtg 960cgcgacgagc gcctccgacg cgcgggcctg
ctcgaggttg atctcgacca ggtgatcgag 1020cgccctgaac gcctcgtcga gcgcggggtc
gtgcacgccg agcagcgcgg gatcgtgcgc 1080gccaggcgcc gacgggagct cgtgggcgtc
ggcggcgcgg cgccgggcga ggtagtcctc 1140cacgcgccgc tccgcgtgct cgaggatcct
gccctcctcc gggctgctca cgtggtcgcg 1200cgccgccgcg aggccgctcc tcaggccttg
ctcccacgcc gccagggagg gggccagctc 1260cccgcggccg gagccgaccg cgcggctgct
ctggtgcgcg tcgagcagga ggtcgatctc 1320cagcctctcc acgagccgga cgctctcgac
cgtggcgccg aggatccggg tggtctgttg 1380catggtcgtc gacgcgacca tcagcgcgcc
cgcgacaacg atggccacgc tcgtgagaag 1440aagcgtggcg gccccgagga gcgcgctcag
gcgcacgggc cgcggaagac ggggccagct 1500caggccctgc ggagttggct gtcgcat
1527141251DNASorangium cellulosum
14atgcccgccc gcaccccccg caagcccccg ccgcccgcct cgcccgctgg tcccgccggc
60gcgccggacg acctcaccga cagcgatcgc gacgcgctgc tgcgctggcg gctcgcgctc
120gggcccgagg ccgagcgggt cgacccgcgc ctctccctcg gcgggctcgg gggcgcggcg
180cccgcgctcg acgtcgacgc gcggcggctc ggcgacctcg acaaggcgct ctcgttcatc
240tacgacgagc gcgccggcgg cctcggcggc tcgcggccct acgtgcccga gtggctctcc
300gccgtgcgcg agttcttcag ccacgaggtc gtcgccctcg tccagaagga cgccatcgag
360cgaaaggggc tgacgcagct cctcttcgag cccgagacgc tgccgttcct cgagaagaac
420gtcgagctcg tcgccacgct catgagcgcc aagggcctca tcccggacgc cgcgcgggac
480accgcccggc agatcgtgcg cgaggtcgtc gaggaggtgc ggcgcgcgct cgaggccgag
540gtccgcaccg ccgtcctcgg cgcgctgcgc cggaacacga cgagcccgct gcgcgtcctc
600aggaacctcg actggaagcg caccatccgc aagaacctga aggggtggga cgcggagcgg
660cgccgcctcg tccccgacaa gctctatttc tgggcgaacc agacgcgacg gcacgagtgg
720gacgtcgcca tcctcgtcga ccagtcgggc tcgatgggcg agagcgtcgt ctacagctcc
780atcatggccg cgatcttcgc gtcgctcgac gtcctccgca cccggctcct cttcttcgac
840accgaggtcg tcgacgtgac tccgatgctc gtcgatccgg tcgacgtgct gttcacggcg
900cagctcggcg gcggcaccga catcaaccgc gccgtggcct acgcccaggc gaacttcatc
960gagcggcccg agaagacgct gctcatcctg atcaccgacc tgttcgaggg cggcaacgcc
1020gaggagctcg tcgcgcgcat gcgccagctc gccgacagca aggtgaagtc gatctgcctg
1080ctcgcgctgt cggacggcgg aaagccctcg tacgaccacg agatggcgca gaagctcgcc
1140gcgctcggga ccccgtgctt cggctgcacg ccgaagctcc tcgtcaaggt ggtggagcgg
1200ctcatgcgag gtcaggacct cggcccgctg ctcggcgccg aggcgcggtg a
1251151059DNASorangium cellulosum 15tcagggcgcg gcgagcggca gcctgcgtgc
cgggcgcgcg gcctcgtgtc cgtcccccgc 60ctcggccacc cgcccgcggt agatgcgatc
gatccgatcg cgcgcgatga ccaggggctt 120gtcgaaccgg ccaagcacgt tgcccttcag
gatcccgcgc ttgtccgtca agcggtccag 180caaccgcata tcgaggcgca gctcgatgtt
catggccacc tgcatggcgg gccagaggac 240ggcgccggcc ccgaacttgc tccagggcgc
caggctcgcg aagagaaacg tatacatctc 300cgacgactcc gggcccaccg ggttgaagaa
gaccgctgac cggagcggga aggtgacggg 360ctgattggtc ttcggatccc tgagggagtg
gttgtagatc gtgtagaccg gcgagaagta 420ggatgtccag tccaccacga atatcgcatc
ctccgggatg ccgagcagct tctccatcgc 480ccgcggcatg ggccgcctcg gacccgaatg
cacgacccgg atcgtttcgt cggtcagggt 540cacccgcgcc tcgacctctg gcatccgctc
gagcgggtag ccgagcatga agtggacgaa 600gggcgtgtgc tcgatctcga tgaaattgtc
gagcgccagc tcgaacggca cggtcgcgcg 660gtggcggagg agaccgcgcg gcacatatcc
ctcgccctcg aggcgcggga acgctgcctg 720cgaccccgcc cgcttcaccc agatggcacc
gtaccgctcc acggcctcga acatgtcctc 780gcgccgcgcg cacggccgcg ccgccggggt
agccgggatc tcgccgcggc cgtccacggc 840ccaacgccag ccatggtagg cgcacaccag
ccgatcgccc tcgacccacc cctcgctcag 900gcgcatgctg cggtgggggc aacgatccgt
gaatgcaccg aggccgcccg acgaggtccg 960aaacaccacg atctcatgcc ccgcgagccg
cacattgcgg ggcttgcggc ggagctcgtg 1020gctcagcagt acagggtgcc agtggtcgag
ctcagccat 1059161131DNASorangium cellulosum
16tcagttcacc ccttggatgt gccgcgcaat ccgcggcgcc tcggctgcga tgtcgcggat
60ctgccccgtg atgggattgc ggaagccgat gaagaacagc ccaggcgccg gcgtcggcgc
120gccgtgccac cgcgggcagc cgtgctcgtc cgtgtagcgc gttgcattct cgagaaaatc
180atcgagcccg ggccggtacc ccgtggcgag caccacgacg tcgaagggca gcccacggcc
240gtccgtgaac gtcacgcccg tttccgtgaa tgcccgcggg ccgggcacca ccttgatctt
300gccctgctgg atcagcgcca ccgtgccgat gtcgatcaac ggcatgcggc cttccttcaa
360cgcccgggta ccggggccga ccgcgggccg acggatcccc cagcgcgaca gatcccccac
420ggcgcgagac aggatcgcgg tcgcgaggcg atccccgacg gccagcggga ggcgctcgaa
480gagggcaagg gcgttgaact gcgcaggcag cttgaacagc tcgcggggga tcacgtggtt
540gccgctgcgg accgagaggg tcgtctccgc gcaatgctcc cacagatcca gcgcgatctc
600gctggcggag ttgccggcgc ccaccacgag cacgcgctgg ccccggaatt ccgcaccaga
660tcggtaggca gagctatgaa ggatgcgacc gcggaagcgc tcctggtcgg gccaggtggg
720gacgttggga tgacggctgt agccggtggc cacgacgagc gcctggctcc tgagctcccc
780cgcgtgcgtt cgggtcaccc accgcgatcc gtcgtggtac gcgcgctcca cctcgacacc
840caggcgcggc tccaggcgga atcgctcggc gtaacgctcg aggtaatcga ccatctccac
900ccgggaggga tacggcgcag aatactcggg ccagggctgc ccgggcagcg cggagagctg
960cttgatcgtg ttgaggtgca gccggtcgta gtggcgccgc cacgtggcgc cgacggcctc
1020cgacttctcg aggagaacga acgggattcc ctgctcgcgc aggcatgcgc ccaccgctag
1080cccagacgga ccagcgccga cgataaccac atggcactct tcaacgtgca c
1131171071DNASorangium cellulosum 17tcacgcactc gcatgcccga cgcccgtgcc
ttctgcctcg ccccgcgtct cgccgaagta 60gatggagcgc atcaggcggt ggttgtgcac
gagcgtcgcg tcgtatttgt tgagccgcat 120ccctttcatc tcgaagggcg tatcggccac
gtgcgggatg aacttcacat cgtcgcggat 180ctccttccag gagagcgcta tcgccgccga
tttgacgacc ggaagcagcg gacggaagcg 240gggatcggtg atcttgacga acaggaacgc
gcgcacgaac gtggtgcgct ccgtctctgg 300cacgaagaag atgccggcgc gcgccacgac
aggacgctcc atcccgttct gcgccgtcca 360ccaggacgtg tacacggtgt agacggggct
gaagcgggtc acccactggt tgtgaaatgt 420gtcgcctggc tggagcagca tcagccgcgc
gagcgtcgag gggcgctgcg gcgccgagta 480cttgacctcg gtgcggtcct cgaagacgtc
gcacgagaag tcgatgcgcg ccgcgtcctc 540gggcgtccag ccgaggcggc cgtgaacgaa
cggcgtgtgc tcgtcctcgg aggaattgtc 600gaagatgacg tgcaggggcg ccggcgcgag
gtgcgagaag gtgccggcat attcgaagcc 660atcgctgctg aagtcgagct cgggcagcgc
cgagcgcggc gtatcccggt gggctagcca 720caggtatcca agctgctcga cgagctgaaa
ggagcgtgta tcgcatcggg tgagcgacgg 780ttgcgagggg caggctcccc gcccctcggc
gtcgaaatgc cacccgtgat aggggcattc 840caggcgcccg tccggccgga cacgcccctg
cgatagcggc gcgagccggt gggggcacgc 900atcggcgagc gcggcggggc ggccctgctc
atcgcggaag agagcgtaag cattgcccgc 960aaggacaacg cgaaccggct tccggccgag
tttcgaggcc ggcaagacgg ggtgaaaatg 1020gcggatgagg tcgcgagcag gcgcggcgtg
cattgcgaga ccataacaca t 1071181188DNASorangium cellulosum
18ctatcggtag gcgacgatgc caccgaacgg ccacttcgcg tggtcctcgg gcgccggata
60gacctcccat tcggagaacc cggccgcgcg gagcgacagc tcccactctt gcagcgtcag
120gtaaccgaca tgctggcggc gaggcggatc gagcttggcc ttgctgtagg tgtgcagcat
180cgactgaaaa aattcattgg ggaagaacac cccgggccga tcgcggaacg acatggtgaa
240cgcgagctga ccgcccggct tcagcatcgt gtggaacgcc tggagggtgg cgtgaagatc
300gcgcacgtcg tagagcacgt gctcgaggac gatcagatcg accgacgcgg cccgggcgaa
360cgtgctgcca gcggagggca gcgtgtccag gtccaggcgc tggaaatgaa tgcgctgaaa
420cacgtcggcc ggcgcgtggg tccgcagcca ctgcttcccc gtctccatca acagggcgct
480gatgtcggtg taatcgtagc gggcgaggtt cttgctcagc gggaggaacc gcggatcgga
540caacgcctgc cgcagcacca cgccgagccc cgcgcccccc tcgaatacag agatccccgg
600cccctctgcg agcttggcca tcagcgcccg cgccagcatc acgttgcatg gcttcttggc
660gggaaggctg atcatcgagt attcccagaa tttcagcgag gcctgcatcc cgtactggag
720atccatggtg gccagcgcgt ccttgcccgc cagcaccggc ccggccaggc cccgatagcg
780ctggaggaac tcgaccatct cgcccaggat cgcgcggtct gcgagcgcga tggactcctt
840ctcggcgacg cgctttcgca ccgcctcgct gggcaccagc cgcccgctgg ggtcctgggt
900gaggtctccc ttgtcgctga agtagtcgag cagcttcctg cgaaactgat aggcggtgac
960cgacggagcc gactccggac gatcgtcgag cccccggaca gcgccgctcg ggtcgacgag
1020gtgctcgagc aggatctcgc tggcaacaag ctcggtctga cgacggaatg cttctatgta
1080agcggtgtaa gcgtcgttgt agagatcggt cacgtccaat cgttgtcgca tgcaggtcct
1140cgcgggtgtg gcgcccatcc tgcgcagcgc agggacgaag caggtcat
118819255DNASorangium cellulosum 19tcacctggag ctcagcgcct gcccgtcgtt
cccgcggttc ttgtgcacaa tggcgtacag 60gatgagcatg taggcgaaga gccggaacag
gtacaggtaa tggatggcgt cttcctcgac 120gcgattcagg gcgacggcga tgcggcccag
catcatcagc cagaacgccg ccgagaactt 180cgcgaacagc cggtcgcccg tcttcttcca
gaagcggagg aagaagagcg cgacggtcgc 240gtacccgaac gtcat
25520261DNASorangium cellulosum
20ttactcgcgc aggtcccaga tgaggccata aaggagcagg gccagcccga tgagcgcggt
60gaggtggcgc agcgatgata gatcgacgct ccggatcacg acgaggtcca cgaagagcag
120gatgttgttc gctgcgagcg cggcgaagca gagcccgctc cacaagagga gacggacctt
180gcgctgcgcg tatccgcgca ggagcagcac ggcgcacgcg atgctggtca gggcgcagag
240gatgtagacc gccgctgcca t
26121402DNASorangium cellulosum 21ctagccgccc tttcccttct tcgtgatcag
gaatgcgtcc gagaagctct ggatgtcgct 60cggcgggggc gtggcgtaga tgtgattgat
cacgctcagc cggcgctcct tgtacgcctg 120cgccaggtcg tcgatcgtcc ggcgggtctc
atcgtctgcc ggggcgtacc ggtagaagat 180gtcctccccg tcctcccggg ccacgatcag
gcccctgctg gccaggcctc cgaaccggtc 240ctggatcgac atcatgctgg accctatctc
gcgcgccatc gcggccgcgc tccactcgcg 300ctccgccgtg cgacgcatga gcagaagcac
ttcgagttgc tcgatcgagg agatgtgcgc 360gccgaggaag cgctggaccc ggtcggggag
cccgctagac ac 402225289DNASorangium cellulosum
22tcaccggtgc aaccatagcc gcagcatagc gagcaggtgc tcgggatcca ccggcttcga
60gatgtaatcg ttcgcgcccg cctcgaagca cttctcccgg tcgcccttca tcgccttggc
120cgtgaccgcg atgatgggca gcgcatggtg ctcgggcttc gcgcggatgg cacggatcgt
180gtcgtagccg tccatctctg gcatcatgat gtccatgagc acgatctcga tgtccggcgt
240ccgctgcagc atctcgatcg ccgctctgcc cgtctccacg tagaccgtct tcatctgctg
300ggcgtcgagg atggtcgtca tcgcgaagat gttccggacg tcgtcgtcga cgaccagcac
360cttcttgccc gcgagcacct tgttcgactg gtgcagctcc tggagggtct gccgctgtcg
420ctcggagagc gccgccacag ggcggtgcag gaacagggag acgtcgtcga agagccgctc
480cttggagcgg acgtgcttga gcaccatcag ctggctgaag cggctcagct gcgcctcgtc
540cgcggccgag atctcctccg gcgcgtagac caggacgggc agctccgtcg gcccgctgcc
600ctgcgcgagc tgcccgatca gatcgaagca gcgcatgtcg ggcaggtcga ggtgcaggat
660gaggacatcg gccccctcgg tgaggagcgc gtcgagcgcc tcctccccgg aggccacgct
720ccggatcgtg acgtcgtcgc cgccgaggag ctcgacgagc tcctggcgct cggcctcgtc
780cggctcggcg agcacgaccg tccgccggcg cgacaccatg aactgcgaga ggcgcctgaa
840ggtctcgtcg agcgcgtccc gggtcttgag cggcttgcag agcacccccg tcgcgcccat
900ccggagcgcg cgctcgcgct cctcgtccgt cgtgatcacc tggacgggga tgtgccgcgt
960cgcgaggtcg cgcttcaccc ggtcgagcac gcgccagccg tccatgtccg gcaggttgat
1020gtcgagcgtg atcgcgttca cccgccgctc gcggacgatg gagagcgccg ccccgccgcg
1080gtaggcgagg atcgccttga acccgtggtc gtgcgcgaca tccatgacga agtgcgcgaa
1140gctcgcgtcg ttctcgacga tgagcaccac ggagtcgctg ggctggaggc tcgcgctgtc
1200gtcgacgctc tggttgagca ggtgcggcgg cggctcggcc gccgaccgcg gcgcgacgtc
1260gcccgagacg agggccggcg gcgccgaggg cacctccgcg gcctgctcct tcctgcgcgg
1320gcgcgccggc gtgtacgtga gcggcaggta aagcgtgaag gtgctcccgc tccccggcct
1380gctcgagagc ttgatctcgc cgccgagcat ccacgcgatc tcgcggctga tcgcgagccc
1440gaggccggtg ccgccgtact tccggctcgt cgagccgtcc gcctgctgga aggcctcgaa
1500gatgatctgc tgcttgtcgt gcgggatgcc gatgcccgtg tcccgcaccg acatggcgat
1560cgccgcgccg gcgcgcgaga ggccctcgtt ctcgatggtc caccccgagg tgaccagatc
1620gacgtcgagc gcgacgctgc cgcgctccgt gaacttgaag gagttcgaga gcaggttctt
1680gagcacctgc tgtacgcgct tcgcgtccgt gtagatgacc tgcggcaggt tctgcgcgaa
1740gttgagctcg aactcgagcc tcttcgactc ggcgacgtgc tggaacgtgc gctcgacgta
1800gtcttgcagg tcgctgaacg acagctcgcc cacgtcgacg atcacggtcc ccgactcgat
1860cttggacagg tccaggatgt cgttgatcag cgcgagcagg tcgttgcccg acgagtggat
1920cgtcttggcg aactcgacct gccgccccgt gaggttgcgg tcggtgttct tcgagagctg
1980atcggacagg atgaggaggc tgttcagcgg cgtccggagc tcgtgcgaca tgttcgcgag
2040gaactccgac ttgtacttgg aggtgatggc gagctgccgc gccttctcct cgagcgcctg
2100ccgcgcctgc tcgacctcgc ggttcttccg ctcgacctcg acgttctgct gggcgagcag
2160gcgagccttc tccccgagct cggcgttcgt ctgctgcagc tcctcctgct ggctctggag
2220ctcgcgcgcg agggactgcg actgcttgag caggtcctct gtgcgcatgt tcgcctcgat
2280cgtgttgagc acgatcccga tcgactccgt gagctggtcg aggaacgcct ggtgggtcgg
2340gctgaatcgc tcgaacgacg cgagctcgat gaccgccttg acctgcccct cgaagagcac
2400ggggatgacg atgatgttga ccggcggcgc ctcgccgagc ccgctcgtga tgcggatgta
2460gtcggggggc gcgttgacga ggaggatctt ctccttctcg agcgcgcatt gcccgacgag
2520cccttcgccg agcttgaaat ggttgtcgac gtgcttccgc accttgtacg cgtagctcgc
2580gaggagcttg aggatcggct cctccttcgc cacgtccatc gtgaagaaca cgccctgctg
2640cgcgccgacg accggggcca gctcggacag gatgagccga ccgacagtga gcagatcctt
2700ctgcccctgg agcatgcgcg agaacttggc gaggttggtc ttgagccagt cctgctcgct
2760gttcttcagc gtcgtgtcct tgaggttccg gatcatctca ttgatggtgt ccttgagcgc
2820cgcgacctcc ccctgcgcct cgaccttgat ggaccgggtg aggtcgccct tggtcacggc
2880ggtggcgacc tcggcgatcg cgcgcacctg cgtggtgagg ttcgcggcga gccggttcac
2940gttgtcggtc aggtccttcc acgtgccggc cgcgccgggg acgctcgcct gaccgccgag
3000cttgccctcg acgccgacct cgcgcgccac cgttgtcacc tggtcggcga aggtcgcgag
3060cgtctcgatc acgccgttga tcgtgtccgc cagcgccgcg atctcgccct tcgcgtcgaa
3120ggccagcttg cgcttcaggt cgccgttcgc gaccgcggtc acgaccttgg cgatgccgcg
3180cacctggttc gtcaggttgc cggccatgaa gttcacgttg tcggtcaggt ccttccacgt
3240gccggcgacg ccggggacgc tggcctgccc gccgagcttg ccctcggtgc ccacctcgcg
3300cgccacgcgc gtcacctccg acgcgaacgc gttgagctgg tccaccatcg tgtagttgat
3360ggtgttcttc agctccagga tctcgccgcg gacatcgacg gtgatcttct tcgacaggtc
3420gccgttggcc acggccgttg tgacggcggc gatgttgcgc acctgcgcgg tcaggttcga
3480cgccatcgag ttgacggagt cggtcaggtc cttccacgtg ccggcgacgc cggggacgct
3540ggcctggccg ccgagcttgc cctcggtgcc cacctcgcgc gccacgcgcg tcacctccga
3600cgcgaacgag cggagctgat ccaccatcgt gttgaaggtg tccttcagct ccaggatctc
3660gccgcggaca tcgacggtga tcttcttcga caggtcgccg ttggccacgg ccgttgtgac
3720ggcggcgatg ttgcgcacct gcgcggtcag gttcgacgcc atcgagttga cggagtcggt
3780caggtccttc cacgtgccgg cgacgccctt cacctcggcc tgcccgccga gcttgccctc
3840ggtgcctacc tcgcgcgcga cgcgcgtcac ctcggccgcg aaggagctga gctgatccac
3900catcgtgttg aaggtgttct tcagctccag gatctcgccc ttgacgtcga cggtgatctt
3960cttcgacagg tcgccgcggg ccacggccgt ggtcacgtcg gcgatgttgc gcacctgcgc
4020ggtcaggttc gacgccatcg aattgacgga gtcggtcagg tccttccacg tgccggcgac
4080gccggggacg ctggcctggc cgccgagctt tccctcggtg cccacctcgc gcgccacgcg
4140cgtcacctcc gacgcgaacg agcggagctg atccaccatc gtgttgaagg tgtccttcag
4200ctccaggatc tcgccgcgga catcgacggt gatcttcttc gacaggtcgc cgttggcgac
4260ggccgtggtg acggcggcga tgttgcgcac ctgcgcggtc aggttcgacg ccatcgagtt
4320gacggagtcg gtcaggtcct tccacgtgcc ggcgacgccg gggacgctgg cctggccgcc
4380gagcttgccc tcggtgccca cctcgcgcgc cacgcgcgtc acctccgacg cgaacgagcg
4440gagctgatcc accatcgtgt tgaaggtgtc cttcagctcc aggatcttct tcgacaggtc
4500gccgttggcc acggccgttg tgacggcggc gatgttgcgc acctgcgcgg tcaggttcga
4560cgccatcgag ttgacggagt cggtcaggtc cttccacgtg ccggcgacgc ccttcacctc
4620ggcctgcccg ccgagcttgc cctcggtgcc tacctcgcgc gcgacgcgcg tcacctcggc
4680cgcgaaggag ctgagctgat ccaccatcgt gttgaaggtg ttcttcagct ccaggatctc
4740gcccttgacg tcgacggtga tcttcttcga caggtcgccg cgggccacgg ccgtggtcac
4800gtcggcgatg ttgcgcacct gcgcggtcag gttcgacgcc atcgaattga cggagtcggt
4860caggtccttc cacgtgccgg cgacgccggg gacgctggcc tggccgccga gctttccctc
4920ggtgcccacc tcgcgcgcca cgcgcgtcac ctccgacgcg aacgagcgga gctgatccac
4980catcgtgttg aaggtgtcct tcagctccag gatctcgccg cggacatcga cggtgatctt
5040cttcgacagg tcgccgttgg cgacggccgt ggtgacgtcg gcgatgttgc ggacctgcgc
5100ggtcaggttc gacgccatcg agttgacgga gtcggtcagg tccttccacg tgccggcgac
5160gcctgtcacc tcggcctgcc cgccgagctt gccctcggtg cctacctcgc gcgccacgcg
5220cgtcacctgg gccgcgaagg agcggagctg atccaccatc gtgttgaagg tgttcttcag
5280ctccaggat
5289231075PRTSorangium cellulosum 23Met Pro Asp Thr Ser Ser Ser Ser Pro
Val Met Ala Met Gly Leu Ser1 5 10
15Asp Ser Lys Ala Arg Ser Val Glu Asp Ala Arg Pro Ala Ser Gly
Leu 20 25 30Pro Arg Pro Pro
Ala Gly Ile Ala Val Val Gly Met Gly Cys Arg Phe 35
40 45Pro Gly Gly Ile Asp Ser Pro Gly Ser Leu Trp Ala
Ala Leu Ser Gln 50 55 60Gly Arg Asp
Leu Ile Ser Glu Val Pro Pro Asp Arg Trp Asp Val Asn65 70
75 80Ala His Tyr Asp Ala Asp Ala Ser
Val Pro Gly Lys Ile Ala Thr Arg 85 90
95His Gly Gly Phe Leu Ala Gly Val Ala Ala Phe Asp Ala Pro
Phe Phe 100 105 110Asp Leu Ser
Pro Arg Glu Ala Lys His Met Asp Pro Gln Gln Arg Leu 115
120 125Gly Leu Glu Thr Ala Trp Glu Ala Leu Glu Asp
Ala Gly Leu Asp Ala 130 135 140Arg Ser
Leu Arg Gly Ser Arg Ala Gly Val Phe Val Gly Ser Met Trp145
150 155 160Ala Glu Tyr Asp Val Leu Ala
Ser Arg His Pro Glu Ser Ile Ser Pro 165
170 175His Gly Ala Thr Gly Ser Asp Pro Gly Met Ile Ala
Ala Arg Ile Ala 180 185 190Tyr
Thr Phe Gly Leu Arg Gly Pro Ala Leu Ser Val Asn Thr Ala Ser 195
200 205Ser Ser Ser Leu Val Ala Val His Leu
Ala Leu Gln Ser Leu Gln Ser 210 215
220Gly Glu Cys Glu Leu Ala Leu Ala Gly Gly Ala Asn Leu Ile Leu Thr225
230 235 240Pro Tyr Asn Thr
Ile Lys Met Thr Lys Leu Gly Thr Met Ser Pro Asp 245
250 255Gly Arg Cys Lys Ala Phe Asp His Arg Ala
Asn Gly Tyr Val Arg Ala 260 265
270Glu Gly Val Gly Phe Val Val Leu Lys Pro Leu Ser Arg Ala Thr Ala
275 280 285Asp Gly Asp Arg Ile Tyr Ala
Val Val Arg Gly Ser Ala Val Asn Asn 290 295
300Asp Gly Leu Thr Asp Gly Leu Thr Ala Pro Ser Gly Glu Ala Gln
Glu305 310 315 320Ala Val
Leu Arg Glu Ala Tyr Ala Arg Ala Gly Val Ser Pro Ala Glu
325 330 335Val Asp Tyr Val Glu Ala His
Gly Thr Gly Thr Pro Leu Gly Asp Arg 340 345
350Val Glu Ala Thr Ala Leu Gly Arg Val Leu Gly Ala Gly Arg
Ala Ala 355 360 365Asp Arg Ala Leu
Arg Val Gly Ser Val Lys Thr Asn Leu Gly His Ala 370
375 380Glu Ala Ala Ala Gly Val Ile Gly Leu Met Lys Thr
Ala Leu Ser Leu385 390 395
400Arg His Gly Ser Leu Pro Ala Ser Leu His Val Glu Arg Pro Asn Pro
405 410 415Glu Ile Pro Leu Glu
Ser Leu Gly Leu Arg Leu Gln Thr Ala His Gly 420
425 430Val Trp Pro Glu Val Asp Arg Pro Arg Arg Ala Gly
Val Ser Ser Phe 435 440 445Gly Phe
Gly Gly Thr Asn Cys His Val Val Ile Glu Glu Trp Arg Gly 450
455 460Gly Leu Gln Gln Ser Ala Ala Glu Ala Gly Ser
Asp Pro Gly Ala Ala465 470 475
480Val Pro Pro Pro Gly Leu Pro Leu Val Leu Ser Ala Arg Asp His Gly
485 490 495Ala Leu Arg Ala
Gln Ala Gly Arg Trp Ala Ala Trp Leu Thr Glu His 500
505 510Arg Glu Ala Arg Trp Ala Asp Val Val His Thr
Ala Ala Val Arg Arg 515 520 525Thr
His Leu Gly Ala Arg Ala Ala Val Met Ala Ala Gly Val Ala Glu 530
535 540Ala Val Asp Ala Leu Lys Ala Leu Ala Asp
Gly Arg Ala His Gly Ala545 550 555
560Val Thr Val Gly Glu Ala Arg Glu Arg Gly Lys Val Val Phe Val
Phe 565 570 575Pro Gly Gln
Gly Ser Gln Trp Pro Ala Met Gly Arg Ala Leu Leu Ser 580
585 590Ala Ser Lys Val Phe Ala Glu Ala Val Glu
Ala Cys Asp Ala Ala Leu 595 600
605Arg Pro Leu Thr Gly Trp Ser Val Leu Ser Leu Leu Arg Gly Asp Ala 610
615 620Gly Glu Ala Ala Pro Ser Leu Asp
Arg Val Asp Ala Val Gln Pro Ala625 630
635 640Leu Phe Ala Met Ala Val Gly Leu Ala Ala Val Phe
Arg Ala Trp Gly 645 650
655Leu Asp Pro Ser Ala Val Val Gly His Ser Gln Gly Glu Val Pro Ala
660 665 670Ala Tyr Val Ala Gly Ala
Leu Ser Leu Asp Asp Ala Ala Arg Val Val 675 680
685Ala Val Arg Ser Ala Leu Val Arg Arg Leu Ala Gly Ala Gly
Ala Met 690 695 700Ala Ala Val Glu Leu
Pro Ala Gly Glu Val Glu Arg Arg Leu Ala Pro705 710
715 720Phe Gly Gly Ala Leu Ala Ile Ala Val Val
Asn Thr Ser Ser Ser Thr 725 730
735Ala Val Ser Gly Asp Ala Glu Ala Val Asp Arg Leu Val Ala Gln Leu
740 745 750Glu Ala Glu Gly Ile
Phe Cys Arg Lys Val Asn Val Asp Tyr Ala Ser 755
760 765His Ser Ala His Val Asp Val Val Leu Pro Glu Leu
Leu Glu Arg Leu 770 775 780Ala Pro Val
Arg Pro Gly Ala Thr Arg Ile Pro Phe Tyr Ser Thr Val785
790 795 800Thr Gly Gly Val Leu Glu Gly
Thr Ala Leu Asp Gly Ala Tyr Trp Cys 805
810 815Arg Asn Leu Arg Gln Pro Val Arg Leu Asp Arg Ala
Leu Ala Arg Leu 820 825 830Leu
Asp Asp Gly His Gly Val Phe Val Glu Val Ser Ala His Pro Val 835
840 845Leu Ala Ser Pro Leu Thr Ala Ala Cys
Ala Glu Arg Glu Gly Val Val 850 855
860Val Gly Ser Leu Gln Arg Asp Asp Gly Gly Leu Ala Arg Leu Leu Gly865
870 875 880Ser Leu Gly Ala
Leu His Val Gln Gly Gln Pro Val Asp Trp Arg Ala 885
890 895Val Leu Ala Pro Phe Gly Gly Ser Leu Val
Asp Leu Pro Thr Tyr Ala 900 905
910Phe Gln Arg Gln Arg Tyr Trp Phe Asp Thr Asp Glu Ser Val Ala Leu
915 920 925Ala Ala Ala Ser Ser Val Ala
Glu Glu Ser Trp Ser Glu Lys Leu Ala 930 935
940Gly Leu Ser Ser Ala Arg Arg Glu Glu Arg Leu Leu Glu Trp Val
Arg945 950 955 960Ala Glu
Ile Ala Ala Val Leu Gly Leu Glu Ala Pro Ala Val Pro Pro
965 970 975Asp Val Leu Leu Arg Asp Leu
Gly Leu Lys Ser Pro Ile Ala Val Glu 980 985
990Leu Gly Ser Arg Leu Gly Arg Arg Thr Arg Arg Lys Leu Pro
Val Thr 995 1000 1005Phe Val Tyr
Asn His Pro Thr Pro Arg Ala Ile Ala Arg Ala Leu 1010
1015 1020Leu Glu Gly Met Phe Ser Ser Ile Lys Asp Ser
Ala Ser Ser Ala 1025 1030 1035Ala Asp
Asp Arg Arg Pro Pro Gly Val Leu Glu Asp Val Ala Pro 1040
1045 1050Pro Gln Ala Leu Glu Thr Ser Glu Met Ser
Asp Asp Glu Leu Phe 1055 1060 1065Gln
Ser Ile Asp Ala Leu Val 1070 1075243679PRTSorangium
cellulosum 24Met Asp Arg Ser Asp Lys Leu Arg Ala Tyr Leu Glu Lys Thr Thr
Ala1 5 10 15Ser Leu Val
Glu Ala Lys Gly Arg Ile Arg Glu Leu Glu Ala Arg Ser 20
25 30Arg Glu Pro Ile Ala Ile Val Ala Met Ala
Cys Arg Phe Pro Gly Gly 35 40
45Val Asp Ser Pro Glu Lys Leu Trp Ala Leu Leu Asp Glu Glu Arg Asp 50
55 60Ala Ile Thr Glu Val Pro Pro Ser Arg
Trp Asp Leu Glu Arg Phe Tyr65 70 75
80Asp Pro Asp Pro Asp Ala Ala Gly Lys Thr Tyr Ser Arg Trp
Gly Gly 85 90 95Phe Val
Gly Asp Leu Asp Arg Phe Asp Ala Ala Phe Phe Gly Ile Ser 100
105 110Pro Arg Glu Ala Arg Ser Ile Asp Pro
Gln Glu Arg Trp Leu Leu Glu 115 120
125Thr Thr Trp Glu Ala Leu Glu Arg Ala Gly Val Arg Ala Asp Thr Leu
130 135 140Glu Gly Thr Leu Gly Gly Val
Tyr Ile Gly Leu Ser Gly Ser Glu Tyr145 150
155 160Gln Thr Glu Ala Phe His Asp Ala Glu Arg Ile Asp
Ala Tyr Ser Leu 165 170
175Thr Gly Ala Ser Pro Ser Thr Thr Val Gly Arg Leu Ala Tyr Trp Leu
180 185 190Gly Leu Arg Gly Pro Ala
Val Ala Val Asp Thr Ala Cys Ser Ser Ser 195 200
205Leu Val Ala Val His Leu Ala Cys Gln Ala Leu Arg Asn Gly
Glu Cys 210 215 220Asp Phe Ala Leu Ala
Gly Gly Val Asn Ala Leu Leu Ala Pro Glu Ser225 230
235 240Tyr Val Ala Phe Cys Arg Leu Arg Ala Leu
Ser Pro Thr Gly Arg Cys 245 250
255Gln Thr Phe Ser Ala Asp Ala Asp Gly Tyr Val Arg Ala Glu Gly Cys
260 265 270Gly Val Leu Leu Leu
Lys Arg Leu Ser His Ala Gln Arg Asp Gly Asp 275
280 285Arg Val Leu Ala Val Ile Arg Gly Asn Ala Ile Asn
Gln Asp Gly Arg 290 295 300Ser Gln Gly
Leu Thr Ala Pro Asn Gly Leu Ala Gln Glu Asp Val Ile305
310 315 320Arg Arg Ala Leu Ser Gln Ala
Ala Val Glu Pro Thr Thr Val Asp Val 325
330 335Val Glu Cys His Gly Thr Gly Thr Ala Leu Gly Asp
Pro Ile Glu Val 340 345 350Gln
Ala Leu Gly Ala Val Tyr Gly Asp Gly Arg Pro Gly Asp Arg Pro 355
360 365Leu Val Ile Gly Ser Val Lys Thr Asn
Ile Gly His Thr Glu Ala Ala 370 375
380Ala Gly Met Ala Gly Leu Ile Lys Ala Val Leu Ser Leu Gln His Ala385
390 395 400Gln Val Pro Arg
Ser Leu His Phe Ala Ala Pro Ser Pro Tyr Ile Pro 405
410 415Trp Asp Thr Leu Pro Val Arg Val Ala Ala
Gln Arg Val Ala Trp Glu 420 425
430Arg Arg Glu His Pro Arg Arg Ala Gly Ile Ser Ser Phe Gly Ile Ser
435 440 445Gly Thr Asn Ala His Val Ile
Leu Glu Glu Ala Pro Glu Ala Pro Ala 450 455
460Thr Ala Pro Glu Ala Ala Ala Val Thr Ser Thr Leu Pro Leu Leu
Val465 470 475 480Ser Gly
Arg Asp Glu Ala Ala Leu Arg Ala Gln Ala Glu Arg Trp Ala
485 490 495Ala Trp Leu Ala Ala His Pro
Glu Ala Arg Trp Ala Asp Val Val His 500 505
510Thr Ala Ala Val Arg Arg Thr His Leu Glu Ala Arg Ala Ala
Val Ala 515 520 525Ala Gly Asn Ala
Ala Asp Ala Ala Ala Ala Leu Gly Ala Leu Ala Ala 530
535 540Gly Gln Pro His Lys Ala Val Ser Leu Gly Glu Ala
Arg Ala Arg Gly545 550 555
560Asp Val Val Phe Val Val Pro Gly Gln Gly Ser Gln Trp Pro Ala Met
565 570 575Gly Arg Ala Leu Leu
Ala Glu Ser Glu Val Phe Ala Ala Ala Val Ala 580
585 590Ala Cys Asp Ala Ala Leu Arg Pro Phe Thr Gly Trp
Ser Val Leu Ser 595 600 605Val Leu
Arg Gly Glu Gln Gly Glu Ala Val Pro Pro Ala Asp Arg Val 610
615 620Asp Val Val Gln Pro Ala Leu Phe Ala Met Ala
Val Gly Leu Ser Ala625 630 635
640Val Trp Arg Ala Trp Gly Ile Glu Pro Ser Ala Val Val Gly His Ser
645 650 655Gln Gly Glu Val
Ala Ala Ala Tyr Val Ala Gly Ala Leu Thr Leu Glu 660
665 670Asp Ala Ala Arg Val Val Ala Leu Arg Ser Gln
Leu Val Arg Arg Ile 675 680 685Ala
Gly Gly Gly Ala Met Ala Val Ile Glu Arg Pro Val Gly Glu Val 690
695 700Glu Gln Arg Leu Ser Arg Phe Gly Gly Gln
Leu Ser Val Ala Ala Val705 710 715
720Asn Thr Pro Gly Ser Thr Val Val Ser Gly Asp Ala Ala Ala Val
Asp 725 730 735Arg Leu Leu
Ala Glu Leu Glu Thr Ala Arg Val Phe Ala Arg Arg Ile 740
745 750Lys Val Asp Tyr Ala Ser His Ser Ala His
Val Asp Ala Ile Leu Pro 755 760
765Glu Leu Glu Ala Cys Leu Ala Ser Val Glu Pro Arg Thr Cys Ala Ile 770
775 780Pro Leu Tyr Ser Thr Val Thr Gly
Glu Val Leu Ala Gly Pro Glu Leu785 790
795 800Gly Ala Thr Tyr Trp Cys Arg Asn Leu Arg Glu Pro
Val Arg Leu Asp 805 810
815Arg Ala Leu Ser Arg Leu Leu Ala Asp Gly His Gly Val Phe Val Glu
820 825 830Val Ser Ala His Pro Val
Leu Ala Met Pro Leu Ser Ala Ala Ser Ala 835 840
845Glu Arg Gly Gly Val Val Val Gly Ser Leu Gln Arg Asp Asp
Gly Gly 850 855 860Leu Gly Arg Leu Thr
Ser Met Leu Gly Ala Leu His Val His Gly His865 870
875 880Ala Val Ser Trp Gln Arg Val Leu Ala Pro
Tyr Gly Gly Ala Leu Val 885 890
895Gly Leu Pro Thr Tyr Ala Phe Gln Arg Gln Arg His Trp Leu Glu Ala
900 905 910Pro Arg Tyr Ala Ala
Glu Asp Thr Asp Gly Ala Ala Arg Arg Asp Pro 915
920 925Leu Tyr Arg Val Thr Trp Ile Glu Ala Ala Leu Glu
Glu Ala Pro Trp 930 935 940Ala Pro Glu
Arg His Val Val Leu Gly Gly Gly Gly Ala Leu Ala Ala945
950 955 960Gly Leu Gly Ala Leu Ala Leu
Ala Gly Leu Pro Glu Leu Leu Glu Ala 965
970 975Leu Glu Asn Arg Ala Ala Ala Pro Glu Arg Leu Val
Leu Asp Leu Thr 980 985 990Glu
Gly Arg Pro Gly Ala Val Ala Glu Ser Val His Ala Thr Thr Arg 995
1000 1005Asp Ala Leu Ala Leu Val Gln Ala
Trp Leu Ala Ala Pro Arg Leu 1010 1015
1020Ser Gly Thr Glu Leu Val Val Val Thr Arg Glu Ala Val Ala Ala
1025 1030 1035Gly Pro Asp Glu Gly Val
Ala Ala Leu Gly Pro Ala Ala Val Trp 1040 1045
1050Gly Leu Leu Arg Thr Ala Arg Val Glu His Pro Glu Arg Ala
Val 1055 1060 1065Arg Ala Val Asp Leu
Gly Arg Glu Pro Leu Asp Val Ala Val Leu 1070 1075
1080Arg Arg Ala Leu Gly Ala Val Ala Glu Pro Glu Leu Ala
Leu Arg 1085 1090 1095Ala Gly Gly Ala
Arg Ala Ala Arg Leu Arg Ala Val Asp Ala Gly 1100
1105 1110Ala Gly Ala Arg Glu Pro Ala Ala Ala Leu Asp
Pro Gln Gly Thr 1115 1120 1125Val Trp
Ile Thr Gly Gly Thr Gly Glu Leu Gly Arg Gln Ile Ala 1130
1135 1140Arg His Leu Val Ala Ala His Gly Val Arg
His Leu Leu Leu Thr 1145 1150 1155Ser
Arg Arg Gly Ala Ala Ala Pro Asp Ala Glu Ala Leu Val Glu 1160
1165 1170Gln Leu Arg Ala Asp Gly Ala Glu Thr
Val Glu Val Val Ala Cys 1175 1180
1185Asp Val Thr Asp Gly Ala Ala Leu Ser Ala Ala Val Gln Ala Ala
1190 1195 1200Ala Ala Arg His Pro Leu
Thr Ala Val Val His Thr Ala Gly Glu 1205 1210
1215Leu Ala Asp Gly Val Leu Thr Gly Leu Thr Ala Glu Gln Leu
Ala 1220 1225 1230Arg Val Leu Ala Pro
Lys Val Asp Gly Ala Cys His Val Tyr Ala 1235 1240
1245Ala Ala Gln Asp Gln Pro Leu Ala Ala Phe Val Leu Phe
Ser Ser 1250 1255 1260Ile Val Gly Thr
Leu Gly Asn Ala Gly Gln Ala Asn Tyr Gly Ala 1265
1270 1275Ala Asn Ala Phe Leu Asp Ala Phe Ala Ala Gln
Leu Arg Ala Arg 1280 1285 1290Gly Val
Pro Ala Thr Ser Leu Ala Trp Gly Phe Trp Glu Gln Ala 1295
1300 1305Gly Leu Gly Met Thr Ser His Leu Gly Ala
Ala Asp Leu Ala Arg 1310 1315 1320Leu
Arg Arg Gln Gly Leu Ala Pro Leu Ser Val Ala Gln Gly Leu 1325
1330 1335Arg Leu Leu Asp Arg Ala Leu Ala Arg
Ala Glu Ala Thr Leu Val 1340 1345
1350Pro Ala Ala Leu Asp Leu Pro Ala Leu Gln Arg Ala Ala Ser Asp
1355 1360 1365Ala Gly Arg Val Pro Pro
Leu Leu Arg Gly Leu Val Arg Thr Ser 1370 1375
1380Pro Gly Arg Pro Thr Ala Thr Ala Thr Pro Glu Ala Gly Pro
Ala 1385 1390 1395Ala Ser Ala Leu Arg
Ala Arg Leu Ser Ala Leu Pro Glu Ala Glu 1400 1405
1410Arg Pro Gly Ala Leu Leu Asp Leu Val Arg Thr Glu Val
Ala Val 1415 1420 1425Val Leu Gln Leu
Ala Gly Pro Ala Gln Val Pro Ala Asp Lys Pro 1430
1435 1440Leu Lys Glu Leu Gly Leu Asp Ser Leu Thr Ala
Val Glu Leu Arg 1445 1450 1455Asn Arg
Leu Gly Ala Arg Ala Glu Thr Val Leu Pro Thr Thr Leu 1460
1465 1470Ala Phe Asp His Pro Thr Pro Arg Ala Ile
Ala Asp Leu Leu Leu 1475 1480 1485Gln
Arg Ala Phe Ser Glu Leu Ala Ala Ala Lys Ala Thr Arg Ala 1490
1495 1500Arg Gly Ala His Asp Glu Pro Ile Ala
Ile Val Ser Met Ala Cys 1505 1510
1515Arg Leu Pro Gly Ser Val Asp Thr Pro Ala Ala Leu Trp Lys Leu
1520 1525 1530Leu Ala Glu Gly Arg Asp
Ala Ile Gly Pro Phe Pro Glu Gly Arg 1535 1540
1545Gly Trp Asp Val Ala Gly Leu Tyr Asp Pro Asp Pro Asp Val
Pro 1550 1555 1560Gly Lys Ser Ile Thr
Thr Gln Gly Gly Phe Leu Tyr Asp Ala Asp 1565 1570
1575Arg Phe Asp Pro Thr Phe Phe Gly Ile Ser Pro Arg Glu
Ala Glu 1580 1585 1590Arg Met Asp Pro
Gln Gln Arg Leu Leu Leu Glu Cys Ala Trp Glu 1595
1600 1605Ala Leu Glu Arg Ala Gly Leu Ala Pro His Ala
Leu Glu Ala Ser 1610 1615 1620Ala Thr
Gly Val Phe Val Gly Leu Ala His Gly Asp Tyr Gly Gly 1625
1630 1635Arg Leu Leu Gln Gln Leu Glu Ser Phe Asp
Gly His Val Leu Thr 1640 1645 1650Gly
Asn Phe Leu Ser Val Gly Ser Gly Arg Ile Ala Tyr Thr Leu 1655
1660 1665Gly Leu Arg Gly Pro Ala Met Thr Val
Asp Thr Ala Cys Ser Ser 1670 1675
1680Ser Leu Val Ala Val His Leu Ala Cys Met Ser Leu Arg Ala Gly
1685 1690 1695Glu Cys Asp Met Ala Leu
Ala Gly Gly Ala Thr Val Met Ala Thr 1700 1705
1710Pro Met Ile Phe Val Glu Phe Ser Arg Gln Arg Gly Thr Ala
Leu 1715 1720 1725Asp Gly Arg Cys Lys
Ala Phe Gly Ala Gly Ala Asp Gly Ala Gly 1730 1735
1740Trp Ser Glu Gly Cys Gly Ile Leu Ala Leu Lys Arg Leu
Ser Asp 1745 1750 1755Ala Gln Arg Asp
Gly Asp Arg Val Leu Ala Val Ile Arg Gly Ser 1760
1765 1770Ala Val Asn Gln Asp Gly Arg Ser Gln Gly Leu
Thr Ala Pro Asn 1775 1780 1785Gly Pro
Ala Gln Gln Asp Val Ile Arg Gln Ala Leu Ala Ala Ala 1790
1795 1800Gly Leu Thr Pro Ala Asp Val Asp Ala Val
Glu Ala His Gly Thr 1805 1810 1815Gly
Thr Arg Leu Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu Ala 1820
1825 1830Thr Tyr Gly Ala Ala His Thr Ala Glu
Arg Pro Leu Trp Leu Gly 1835 1840
1845Ser Leu Lys Ser Asn Leu Gly His Thr Gln Val Ala Ala Gly Val
1850 1855 1860Ser Gly Leu Met Lys Leu
Val Leu Ala Leu Gln His Ala Glu Leu 1865 1870
1875Pro Arg Thr Leu His Ala Asp Pro Pro Ser Pro His Val Asp
Trp 1880 1885 1890Ser Gln Gly His Val
Lys Leu Leu Asn Glu Pro Val Pro Trp Pro 1895 1900
1905Arg Thr Asp Arg Pro Arg Arg Ala Ala Val Ser Ser Phe
Gly Ile 1910 1915 1920Ser Gly Thr Asn
Ala His Val Ile Val Glu Glu Ala Pro Ala Glu 1925
1930 1935Ala Pro Ala Thr Ala Ala Asp Ala Lys Ser Val
Glu Ala Leu Pro 1940 1945 1950Ile Leu
Pro Leu Leu Val Ser Gly Ser Asp Glu Pro Ala Leu Arg 1955
1960 1965Ala Gln Val Arg Arg Leu Val Glu His Leu
Arg Ser His Pro Asp 1970 1975 1980Glu
Arg Leu Leu Asp Val Ala Ala Ser Leu Ala Thr Thr Arg Ala 1985
1990 1995His Leu Ala Met Arg Leu Ala Leu Pro
Val Ser Ala Gly Ala Pro 2000 2005
2010Arg Asp Ala Trp Val Asp Glu Leu Glu Ala Phe Ala Arg Gly Gly
2015 2020 2025Ala Ala Pro Thr Gln Ala
Ser Gln Thr Pro Ala Glu Ser Ser Ala 2030 2035
2040Gly Lys Val Ala Val Leu Phe Thr Gly Gln Gly Ser Gln Arg
Ala 2045 2050 2055Ala Met Gly Arg Ala
Leu Tyr Ala Thr His Pro Val Phe Arg Ala 2060 2065
2070Ala Leu Asp Ala Ala Cys Ala Glu Leu Asp Arg His Leu
Asp Arg 2075 2080 2085Pro Leu His Ser
Val Leu Phe Ala Asp Ala Gly Thr Glu Ala Ala 2090
2095 2100Ala Leu Leu Asp Gln Thr Gly Trp Ala Gln Pro
Ala Leu Phe Ala 2105 2110 2115Leu Glu
Val Ala Leu Tyr Arg Gln Trp Glu Ala Trp Gly Leu Arg 2120
2125 2130Pro Glu Leu Leu Leu Gly His Ser Ile Gly
Glu Leu Ala Ala Ala 2135 2140 2145His
Val Ala Gly Val Leu Asp Leu Pro Asp Ala Ser Ala Leu Val 2150
2155 2160Ala Ala Arg Gly Arg Leu Met Gln Ala
Leu Pro His Gly Gly Ala 2165 2170
2175Met Ala Ser Ile Glu Ala Thr Glu His Glu Leu Leu Pro Leu Leu
2180 2185 2190Asp Gln His Thr Gly Arg
Leu Ser Leu Ala Ala Leu Asn Ala Pro 2195 2200
2205Arg Gln Ser Val Val Ser Gly Asp Leu His Ala Val Asp Gln
Val 2210 2215 2220Cys Ala His Phe Ile
Ala Leu Gly Arg Arg Ala Lys Arg Leu Asp 2225 2230
2235Val Ser His Ala Phe His Ser Ala His Met Gln Pro Met
Leu Asp 2240 2245 2250Ala Phe Ala Ser
Val Ala Arg Gly Leu Thr Phe His Pro Pro Arg 2255
2260 2265Leu Pro Ile Val Ser Ser Val Thr Gly Ala Arg
Ala Thr Thr Asp 2270 2275 2280Gln Leu
Thr Ser Pro Asp Tyr Trp Val Gln Gln Val Arg Glu Pro 2285
2290 2295Val Arg Phe Leu Asp Ala Met Arg Ser Leu
His Ala Ala Gly Ala 2300 2305 2310Ala
Thr Phe Val Glu Cys Gly Pro His Gly Val Leu Thr Ala Ala 2315
2320 2325Gly Ala Glu Cys Leu Ala Pro Glu Gly
Ala Arg Asp Ala Gly Phe 2330 2335
2340Val Thr Ser Leu Arg Lys Asp Arg Asp Glu Ala Leu Ala Leu Val
2345 2350 2355His Ala Ala Cys Ala Val
His Val Arg Gly His Ala Leu Asp Trp 2360 2365
2370Leu Arg Phe Phe Asp Ala Thr Gly Ala Arg Arg Val Glu Leu
Pro 2375 2380 2385Thr Tyr Ala Phe Gln
Arg Gln Arg Tyr Trp Leu Glu Ala Pro Arg 2390 2395
2400Pro Arg Pro Ser Leu Glu Gly Val Gly Leu Thr Ala Ala
Asn His 2405 2410 2415Pro Trp Leu Gly
Ala Ala Val Arg Leu Ala Asp Arg Asp Gly Tyr 2420
2425 2430Val Leu Ser Gly Arg Leu Ser Thr Ile Asp His
Pro Trp Val Leu 2435 2440 2445Asp His
Val Val Leu Gly Thr Ala Leu Leu Pro Gly Thr Gly Phe 2450
2455 2460Val Glu Leu Ala Trp Ala Ala Ala Glu Ala
Val Gly Leu Pro Gly 2465 2470 2475Val
Ser Glu Leu Ala Ile Glu Ala Pro Leu Ala Leu Pro Ala Arg 2480
2485 2490Gly Ala Val Ala Leu Gln Ile Ala Ile
Glu Ala Pro Asp Pro Ala 2495 2500
2505Gly Arg Arg Gly Val Ala Ile Tyr Ser Arg Pro Asp Gly Ala Ala
2510 2515 2520Asp Ala Pro Trp Thr Ala
His Ala Arg Gly Val Leu Gly Ala Ala 2525 2530
2535Ala Pro Asp Arg Asp Ala Ala Trp Ala Gln Gly Ala Trp Pro
Pro 2540 2545 2550Pro Gly Ala Val Pro
Val Asp Val Thr Gln Arg Ile Glu Ile Val 2555 2560
2565Asp Ala Trp Val Gly Pro Ala Phe Arg Gly Val Thr Ala
Leu Trp 2570 2575 2580Arg Val Gly Arg
Thr Ile Tyr Ala Asp Val Ala Leu Pro Asp Gly 2585
2590 2595Val Ala Ser Thr Ala Gln Asp Phe Gly Leu His
Pro Ala Leu Leu 2600 2605 2610Asp Val
Ala Leu Arg Ala Phe Leu Arg Ala Glu Leu Gly Ala Asp 2615
2620 2625Pro Ser Pro Arg Glu Gly Thr Val Val Pro
Phe Ala Trp Ser Asp 2630 2635 2640Val
Val Leu Glu Ala Arg Gly Thr Ala Ala Leu Arg Val Arg Val 2645
2650 2655Glu Val Ala Ala Asp Gly Asp Gly Asp
Ala Ile Thr Ala Ser Ile 2660 2665
2670Gln Leu Ala Asp Gly Gln Gly Arg Pro Val Ala Arg Val Gly Ala
2675 2680 2685Leu Gln Met Arg Trp Thr
Thr Ala Glu Arg Val Arg Ala Ala Ala 2690 2695
2700Gly Ala Ala Glu Arg Asp Leu Tyr Arg Val Ala Trp Thr Asp
Val 2705 2710 2715Ala Leu Asp Asp Ala
Ala Phe Ala Pro Glu Glu His Val Val Val 2720 2725
2730Gly Gly Asp Gly Ala Leu Ala Ala Ala Leu Gly Ala Arg
Val Val 2735 2740 2745Ala Gly Leu Pro
Glu Leu Leu Ala Ser Leu Pro Asp Gly Ala Ala 2750
2755 2760Ala Pro Arg Arg Leu Val Val Asp Leu Thr Ala
Asp Ala Ala Gly 2765 2770 2775Ala Val
Val Asp Ala Val His Ala Ala Ala Arg Asp Ala Leu Ser 2780
2785 2790Leu Val Gln Gly Trp Leu Ala Ala Pro Gln
Leu Ala Ala Thr Glu 2795 2800 2805Leu
Val Val Val Thr Arg Gly Ala Val Ala Val Ala Pro Asp Glu 2810
2815 2820Gly Val Ala Ala Leu Gly Pro Ala Ala
Val Trp Gly Leu Leu Arg 2825 2830
2835Ala Thr Arg Val Glu His Ala Asp Arg Thr Val Arg Val Leu Asp
2840 2845 2850Leu Gly Ser Ala Ala Pro
Asp Met Thr Leu Leu Arg Arg Ala Leu 2855 2860
2865Thr Ala Ala Glu Glu Pro Glu Leu Ala Leu Arg Ala Gly Gly
Ala 2870 2875 2880Arg Ala Pro Arg Leu
Asp Ala Ala Ser Glu Thr Glu Gly Glu Leu 2885 2890
2895Ala Pro Pro Gly Gly Ala Arg Ser Leu Arg Leu Ser Ile
Arg Thr 2900 2905 2910Lys Gly Ser Phe
Asp Ala Leu His Leu Ala Asp Ala Pro Asp Ala 2915
2920 2925Leu Arg Pro Leu Gly Pro Gly Gln Val Arg Leu
Ala Val Arg Ala 2930 2935 2940Thr Gly
Leu Asn Phe Arg Asp Val Leu Asn Val Leu Gly Thr Tyr 2945
2950 2955Arg Gly Glu Ala Gly Pro Leu Gly Leu Glu
Gly Ala Gly Val Val 2960 2965 2970Leu
Asp Val Gly Glu Gly Val Thr Ala Leu Arg Pro Gly Asp Arg 2975
2980 2985Val Met Gly Met Leu His Ala Gly Met
Ala Thr His Ala Val Val 2990 2995
3000Asp Ala Arg Leu Leu Thr His Ile Pro Arg Gly Leu Ser Phe Val
3005 3010 3015Glu Ala Ala Thr Ile Pro
Ala Ala Phe Leu Thr Ala Leu Tyr Gly 3020 3025
3030Leu Arg Asp Leu Gly Ala Leu Lys Ala Gly Gln Arg Val Leu
Val 3035 3040 3045His Ala Ala Ala Gly
Gly Val Gly Met Ala Ala Val Gln Leu Ala 3050 3055
3060Arg Leu Trp Gly Ala Glu Val Phe Ala Thr Ala Ser Glu
Gly Lys 3065 3070 3075Trp Pro Ala Leu
Arg Arg Met Gly Ile Asp Gln Ala His Ile Ala 3080
3085 3090Ser Ser Arg Thr Leu His Phe Arg Lys Ala Phe
Leu Asp Ala Thr 3095 3100 3105Gln Gly
Gln Gly Val Asp Val Val Leu Asp Ala Leu Ala Gly Glu 3110
3115 3120Phe Val Asp Ala Ser Leu Asp Leu Leu Pro
Arg Gly Gly Ala Phe 3125 3130 3135Val
Glu Met Gly Lys Ser Asp Val Arg Asp Pro Glu Arg Val Ala 3140
3145 3150Lys Asp His Pro Arg Val Arg Tyr Thr
Ala Phe Asp Leu Leu Asp 3155 3160
3165Ala Gly Pro Asp His Ile Gln Ala Met Leu Arg Glu Leu Val Pro
3170 3175 3180Leu Phe Glu Glu Gly Val
Leu Ala Pro Leu Pro Ser Val Ala Tyr 3185 3190
3195Asp Leu Arg Arg Ala Pro His Ala Phe Arg Ser Met Ala Asn
Ala 3200 3205 3210Arg His Ile Gly Lys
Leu Val Leu Val Pro Pro Ala Thr Leu Asp 3215 3220
3225Pro Asp Gly Thr Ala Leu Ile Thr Gly Gly Thr Gly Glu
Leu Gly 3230 3235 3240Arg Gln Ile Ala
Arg His Leu Val Ala Ala His Gly Val Arg His 3245
3250 3255Leu Val Leu Thr Ser Arg Arg Gly Met Asp Ala
Pro Asp Ala Ala 3260 3265 3270Ala Leu
Val Glu Ser Leu Arg Ala Ala Gly Ala Ala Thr Val Glu 3275
3280 3285Val Ala Ala Cys Asp Val Thr Asp Arg Asp
Ala Leu Ala Ala Ile 3290 3295 3300Val
Gln Ala Ile Pro Ala Ala Arg Pro Leu Thr Ala Val Val His 3305
3310 3315Thr Ala Ala Val Leu Asp Asp Gly Thr
Val Ala Gly Leu Ser Ala 3320 3325
3330Glu Gln Leu Ala Arg Val Leu Arg Pro Lys Val Asp Gly Ala Trp
3335 3340 3345Gln Leu Tyr Glu Ala Thr
Arg Asp Ala Pro Leu Ala Ala Phe Met 3350 3355
3360Leu Phe Ser Ser Val Ala Gly Thr Leu Gly Ser Ser Gly Gln
Ala 3365 3370 3375Asn Tyr Ala Ala Ala
Asn Ala Phe Leu Asp Gly Leu Ala Ala Glu 3380 3385
3390Leu Arg Ala Arg Gly Val Pro Ala Met Ser Leu Ala Trp
Gly Phe 3395 3400 3405Trp Glu Gln Gly
Gly Ile Gly Met Thr Ala His Leu Gly Ala Ala 3410
3415 3420Asp Leu Ala Arg Leu Lys Arg Gln Gly Ile Val
Pro Met Thr Val 3425 3430 3435Ala His
Gly Leu Arg Leu Leu Asp Arg Ala Leu Glu Arg Pro Asp 3440
3445 3450Ala Ala Leu Val Pro Ala Ser Leu Asp Met
Ala Val Ile Gln Arg 3455 3460 3465Thr
Ala Ser Asp His Arg Gln Val Pro Pro Met Leu Arg Gly Leu 3470
3475 3480Val Arg Val Ala Pro Arg Gln Ala Ala
Gly Ala Ala Ser Gly Arg 3485 3490
3495Ser His Glu Ala Ser Thr Leu Arg Gln Gln Leu Ala Ala Leu Pro
3500 3505 3510Glu Pro Glu Arg Gln Arg
Ala Leu Leu Asp Leu Val Arg Thr Glu 3515 3520
3525Ala Ala Ala Val Leu Val Leu Arg Gly Pro Asp Ala Val Pro
Ala 3530 3535 3540Asp Lys Pro Leu Arg
Glu Leu Gly Leu Asp Ser Leu Thr Ala Val 3545 3550
3555Glu Leu Arg Asn Arg Leu Arg Thr Arg Ala Gln Thr Asp
Leu Pro 3560 3565 3570Ser Thr Leu Ala
Phe Asp Tyr Pro Thr Pro Lys Ala Val Ala Val 3575
3580 3585Tyr Leu Ala Gln Glu Leu Asp Leu His Asp Val
Met Thr Glu Met 3590 3595 3600Arg Gly
Pro Ser Leu Arg Ser Asp Asp Glu Leu Lys Ser Ala Ile 3605
3610 3615Ala Ser Ile Arg Ile Ser Thr Leu Arg Gln
Ala Gly Leu Leu Asp 3620 3625 3630Ser
Leu Leu Arg Leu Ala Ala Ser Glu Ala Val Ser Thr Ser Ser 3635
3640 3645Asp Thr Thr Pro Glu Thr Asp Glu Leu
Thr Leu Gln His Val Gly 3650 3655
3660Asp Asp Glu Leu Ala Arg Leu Val Phe Asp Leu Ala Gly Gly Ala
3665 3670 3675Gln253654PRTSorangium
cellulosum 25Met Lys Glu Glu Ile Ser Ala Arg Gln Ala Leu Glu Lys Ser Phe
Ile1 5 10 15Glu Leu Arg
Arg Ile Lys Arg Glu Leu Asp Gln Leu Lys Ala Lys Ser 20
25 30Ser Glu Pro Ile Ala Ile Val Ser Met Ala
Cys Arg Leu Pro Gly Gly 35 40
45Val Asp Thr Pro Ala Ala Leu Trp Gln Leu Leu Ser Glu Gly Arg Asp 50
55 60Ala Ile Gly Pro Phe Pro Glu Gly Arg
Glu Trp Asp Val Ala Gly Leu65 70 75
80Tyr Asp Pro Asp Pro Asp Ala Pro Gly Lys Ser Ile Thr Ala
Gln Gly 85 90 95Gly Phe
Leu Tyr Asp Ala Asp Arg Phe Asp Pro Ala Phe Phe Ala Ile 100
105 110Ser Pro Arg Glu Ala Glu Arg Met Asp
Pro Gln Gln Arg Leu Leu Leu 115 120
125Glu Cys Ala Trp Glu Ala Leu Glu Arg Ala Gly Leu Ala Pro His Ala
130 135 140Leu Glu Ala Ser Ala Thr Gly
Val Phe Val Gly Leu Ser Val Thr Asp145 150
155 160Tyr Gly Gly Arg Leu Leu His Asp Pro Glu Ala Leu
Asp Gly Tyr Ile 165 170
175Ala Thr Gly Thr Leu Pro Ser Val Gly Ser Gly Arg Ile Ala Tyr Thr
180 185 190Leu Gly Leu Arg Gly Pro
Ala Met Thr Val Asp Thr Ala Cys Ser Ser 195 200
205Ser Leu Val Ser Leu His Leu Ala Cys Met Ser Leu Arg Ala
Gly Glu 210 215 220Cys Asp Met Ala Leu
Ala Gly Gly Ala Thr Val Met Ala Thr Pro Met225 230
235 240Ala Phe Ile Glu Phe Ser Arg Gln Arg Gly
Thr Ala Leu Asp Gly Arg 245 250
255Cys Lys Ala Phe Gly Ala Gly Ala Asp Gly Ala Gly Trp Ser Glu Gly
260 265 270Cys Gly Ile Leu Ala
Leu Lys Arg Leu Ser Asp Ala Gln Arg Asp Gly 275
280 285Asp Arg Val Leu Ala Val Ile Arg Gly Ser Ala Val
Asn Gln Asp Gly 290 295 300Arg Ser Gln
Gly Leu Thr Ala Pro Asn Gly Pro Ala Gln Gln Asp Val305
310 315 320Ile Arg Gln Ala Leu Ala Ala
Ala Gly Leu Thr Pro Ala Asp Val Asp 325
330 335Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly
Asp Pro Ile Glu 340 345 350Ala
Gln Ala Leu Leu Ala Thr Tyr Gly Ala Ala His Thr Ala Glu Arg 355
360 365Pro Leu Trp Leu Gly Ser Leu Lys Ser
Asn Leu Gly His Thr Gln Ala 370 375
380Ala Ala Gly Val Ser Gly Leu Met Lys Leu Val Leu Ala Leu Gln His385
390 395 400Ala Glu Leu Pro
Arg Thr Leu His Ala Asp Pro Pro Ser Pro His Val 405
410 415Asp Trp Ser Arg Gly His Val Lys Leu Leu
Asn Glu Pro Val Pro Trp 420 425
430Pro Arg Thr Asp Arg Pro Arg Arg Ala Ala Val Ser Ser Phe Gly Phe
435 440 445Ser Gly Thr Asn Ala His Ile
Ile Ile Glu Glu Ala Pro Ala Ala Ser 450 455
460Ala Glu Ala Thr Ser Arg Gly Glu Lys Thr Ser Ala Ala Ala Pro
Pro465 470 475 480Ser Met
Met Pro Leu Leu Val Ser Gly Val Asp Glu Ala Ala Leu Arg
485 490 495Ala Gln Ala Gly Arg Trp Ala
Ala Trp Ile Glu Ala His Pro Glu Ala 500 505
510Gly Trp Ala Asp Val Val Tyr Thr Ala Ala Ala Arg Arg Thr
His Leu 515 520 525Gly Ala Arg Ala
Ala Leu Thr Ala Ala Asp Ala Ala Gly Ala Val Ala 530
535 540Ala Leu Thr Ala Leu Ser Gln Gly Gln Pro His Ala
Ala Leu Ala Val545 550 555
560Gly Glu Ala Arg Ala Arg Gly Lys Val Ala Phe Val Phe Pro Gly Gln
565 570 575Gly Ser Gln Trp Pro
Ala Met Gly Arg Ala Leu Leu Ser Gln Ser Glu 580
585 590Val Phe Ala Ala Ala Val Thr Ala Cys Asp Ala Ala
Leu Arg Pro Phe 595 600 605Thr Gly
Trp Ser Val Leu Ser Val Leu Arg Gly Asp Ser Gly Ala Glu 610
615 620Val Pro Pro Leu Glu Arg Val Asp Val Val Gln
Pro Ala Leu Phe Ala625 630 635
640Met Ala Val Gly Leu Ala Ala Val Trp Arg Ala Trp Gly Leu Glu Pro
645 650 655Ser Ala Val Val
Gly His Ser Gln Gly Glu Val Pro Ala Ala Tyr Val 660
665 670Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala Arg
Ile Val Ala Leu Arg 675 680 685Ser
Gln Leu Val Arg Arg Leu Ser Gly Ala Gly Ala Met Ala Val Ile 690
695 700Glu Arg Pro Val Gly Glu Val Glu Gln Arg
Leu Ser Arg Phe Gly Gly705 710 715
720Ala Leu Ser Val Ala Ala Val Asn Thr Pro Arg Ser Thr Val Val
Ser 725 730 735Gly Asp Ile
Glu Ala Val Asp Arg Leu Leu Ala Glu Phe Glu Gly Glu 740
745 750Gln Val Phe Ala Arg Lys Val Asn Val Asp
Tyr Ala Ser His Ser Arg 755 760
765His Ile Asp Gly Leu Leu Pro Glu Leu Glu Asn Gly Leu Gly Ala Val 770
775 780Arg Pro Arg Ala Ser Thr Ile Pro
Phe Tyr Ser Thr Val Thr Gly Thr785 790
795 800Val Leu Thr Gly Ala Glu Leu Asp Ala Ala Tyr Trp
Cys Arg Asn Leu 805 810
815Arg Glu Pro Val Arg Leu Asp Arg Ala Leu Ser Trp Leu Leu Asp Asp
820 825 830Gly His Gly Leu Phe Val
Glu Val Ser Ala His Pro Val Leu Thr Leu 835 840
845Pro Leu Thr Gly Ala Ser Ala Ala Ser Gly Gly Val Val Val
Gly Ser 850 855 860Leu Gln Arg Asp Asp
Gly Gly Leu Gly Arg Leu Leu Gly Val Leu Ala865 870
875 880Ala Leu His Val His Gly His Asp Val Asp
Trp Arg Ala Val Leu Ala 885 890
895Pro Trp Gly Gly Gly Val Ala Asp Leu Pro Thr Tyr Ala Phe Gln Arg
900 905 910Gln Arg Tyr Trp Leu
Glu Ala Pro Arg Gly Arg Ala Gly Leu Glu Ser 915
920 925Gly Gly Leu Leu Ala Val Asn His Pro Trp Leu Ser
Ala Ala Val Arg 930 935 940Leu Ala Asp
Arg Asp Gly Tyr Val Leu Ser Gly Arg Leu Ser Thr Val945
950 955 960Glu His Ala Trp Val Leu Asp
His Val Val Leu Gly Thr Val Ile Leu 965
970 975Pro Gly Thr Ala Phe Val Glu Leu Ala Leu Ala Ala
Ala Asp Ala Val 980 985 990Gly
Leu Pro Ser Val Ser Glu Leu Thr Ile Glu Ala Pro Leu Ala Leu 995
1000 1005Pro Ala Arg Gly Ala Val Ala Leu
Gln Val Thr Val Glu Ala Pro 1010 1015
1020Asp Ala Thr Gly Arg Arg Gly Phe Ala Val Tyr Ser Arg Pro Asp
1025 1030 1035Gly Ala His Asp Ala Pro
Trp Thr Ala His Ala Arg Gly Val Leu 1040 1045
1050Gly Ala Ala Pro Ala Ala Ala Thr Thr Ala Trp Ala Ala Gly
Ala 1055 1060 1065Trp Pro Pro Ala Gly
Ala Glu Pro Val Asp Val Thr Arg Trp Val 1070 1075
1080Glu Ala Leu Asp Ala Trp Val Gly Pro Ala Phe Arg Gly
Val Thr 1085 1090 1095Ala Ala Trp Arg
Val Gly Arg Ser Ile Tyr Ala Asp Leu Ala Leu 1100
1105 1110Pro Glu Gly Val Ser Glu Arg Ala Gln Asp Phe
Gly Leu His Pro 1115 1120 1125Ala Leu
Leu Asp Ala Ala Leu Gln Ala Leu Leu Arg Ala Glu Leu 1130
1135 1140Gly Ala Gly Ala Ser Pro Arg Glu Gly Ile
Pro Met Pro Phe Ala 1145 1150 1155Trp
Ser Asp Val Ala Leu Glu Ala Arg Gly Ala Ala Ala Leu Arg 1160
1165 1170Ala Arg Val Glu Val Glu Asp Ala Ser
Asp Gly Asp Gln Leu Ala 1175 1180
1185Ala Ser Ile Glu Leu Ala Asp Ala Gln Gly Gln Pro Val Ala Arg
1190 1195 1200Ala Gly Thr Phe Arg Ala
Arg Trp Ala Thr Ala Glu His Val Arg 1205 1210
1215Met Ala Ala Ala Gly Ser Ser Glu Arg Asp Leu Tyr Arg Val
Thr 1220 1225 1230Trp Ala Asp Val Val
Leu Glu Glu Ala Ala Trp Ala Pro Glu Glu 1235 1240
1245His Val Val Leu Gly Gly Asp Gly Ala Leu Ala Ala Ala
Leu Gly 1250 1255 1260Ala Arg Thr Ala
Ala Leu Pro Glu Leu Ile Ala Ala Leu Pro Glu 1265
1270 1275Gly Ala Ala Ala Pro Arg Arg Leu Val Ile Asp
Ala Ala Ala Gly 1280 1285 1290Asp Pro
Gly Asp Gly Leu Val Ala Ala Ala His Ala Ala Ala Gln 1295
1300 1305Arg Val Leu Ser Leu Val Gln Gly Trp Leu
Ser Glu Ala Arg Leu 1310 1315 1320Ala
Asp Ser Glu Leu Val Val Val Thr Arg Gly Ala Val Ala Ala 1325
1330 1335Gly Pro Asp Asp Gly Val Ala Ala Leu
Ser His Ala Pro Leu Trp 1340 1345
1350Gly Leu Val Arg Thr Ala Arg Gln Glu Asn Pro Gly Arg Ala Val
1355 1360 1365Arg Leu Val Asp Leu Gly
Pro Glu Pro Leu Asp Gly Ala Leu Leu 1370 1375
1380Arg Arg Val Val Ala Ala Ala Glu Glu Pro Glu Leu Ala Leu
Arg 1385 1390 1395Gly Gly Ala Ala Arg
Ala Pro Arg Leu Arg Glu Val Arg Ala Gly 1400 1405
1410Ala Ala Asp Ala Ala Arg Pro Thr Arg Leu Asp Pro Gly
Gly Thr 1415 1420 1425Val Leu Ile Thr
Gly Gly Thr Gly Glu Leu Gly Arg Gln Val Ala 1430
1435 1440Arg His Leu Val Ala Ser His Gly Val Arg His
Leu Val Leu Thr 1445 1450 1455Ser Arg
Arg Gly Met Gly Ala Pro Asp Ala Ala Ala Leu Val Asp 1460
1465 1470Glu Leu Arg Ala Ala Gly Ala Ala Thr Val
Asp Val Ala Ala Cys 1475 1480 1485Asp
Val Ala Asp Gly Ala Ala Leu Gly Ala Val Ile Ala Ala Ile 1490
1495 1500Pro Ala Ala His Pro Leu Thr Ala Val
Val His Met Ala Gly Val 1505 1510
1515Leu Asp Asp Val Ile Val Thr Lys Leu Ser Ala Glu Gln Leu Thr
1520 1525 1530Arg Val Leu Arg Pro Lys
Ile Asp Gly Gly Trp His Leu Ala Ala 1535 1540
1545Ala Thr Arg Gly His Arg Leu Ala Ala Phe Val Leu Phe Ser
Ser 1550 1555 1560Ala Ala Gly Thr Leu
Gly Ser Pro Gly Gln Ala Asn Tyr Ala Ala 1565 1570
1575Ala Asn Thr Phe Leu Asp Ala Leu Ala Ala Gln Leu Arg
Ala Arg 1580 1585 1590Gly Val Pro Ala
Met Ser Leu Ala Trp Gly Phe Trp Glu Gln Ala 1595
1600 1605Gly Leu Gly Met Thr Ala His Leu Gly Ala Ala
Asp Leu Ala Arg 1610 1615 1620Leu Arg
Arg Gln Gly Ile Ala Pro Ile Ala Leu Ala Gln Gly Met 1625
1630 1635Gln Leu Leu Asp Arg Ala Leu Ala Arg Pro
Glu Ala Ala Leu Val 1640 1645 1650Pro
Ala Ala Leu Asp Leu Pro Ala Leu Gln Arg Ala Ala Ser Asp 1655
1660 1665Ala Gly Gln Val Pro Ala Leu Leu Arg
Gly Leu Val Arg Pro Ala 1670 1675
1680Val Gly Arg Arg Ala Ala Ala Pro Ala Ala Ala Ala Thr Gly Ala
1685 1690 1695Ala Ala Leu Arg Ala Arg
Leu Ala Pro Leu Pro Glu Ala Glu Arg 1700 1705
1710His Asp Val Val Leu Asp Leu Val Arg Ala Glu Ala Ala Ala
Val 1715 1720 1725Leu Gln Leu Ala Gly
Pro Ala Gln Val Pro Ala Asp Lys Pro Leu 1730 1735
1740Lys Glu Leu Gly Leu Thr Ser Leu Thr Ala Val Glu Leu
Arg Asn 1745 1750 1755Arg Leu Gly Ala
Arg Ala Glu Thr Ala Leu Pro Ala Thr Leu Ala 1760
1765 1770Phe Asp His Pro Thr Pro Arg Ala Ile Ala Gly
Leu Leu Leu Gln 1775 1780 1785Arg Ala
Phe Ser Glu Leu Ala Ala Ala Val Ala Thr Arg Ala Gln 1790
1795 1800Ala Pro Arg Ala Gln Gly Ala His Asp Glu
Pro Ile Ala Ile Val 1805 1810 1815Ser
Met Ala Cys Arg Leu Pro Gly Gly Val Asp Thr Pro Ala Arg 1820
1825 1830Met Trp Gln Leu Leu Ala Glu Gly Arg
Asp Ala Ile Gly Pro Phe 1835 1840
1845Pro Glu Gly Arg Gly Trp Asp Val Ala Gly Leu Tyr Asp Pro Asp
1850 1855 1860Pro Asp Ala Pro Gly Lys
Ser Val Thr Asn Leu Gly Gly Phe Leu 1865 1870
1875Tyr Asp Ala Asp His Phe Asp Pro Thr Phe Phe Gly Ile Ser
Pro 1880 1885 1890Arg Glu Ala Glu Arg
Ile Asp Pro Gln Gln Arg Leu Leu Leu Glu 1895 1900
1905Cys Ala Trp Glu Ala Leu Glu Arg Ala Gly Leu Ala Pro
His Thr 1910 1915 1920Leu Glu Ala Ser
Ala Thr Gly Val Phe Val Gly Leu Val Tyr Ser 1925
1930 1935Asp Tyr Gly Gly Arg Leu Leu Glu His Leu Glu
Ser Phe Asp Gly 1940 1945 1950Tyr Ile
Ala Thr Gly Ser Phe Pro Ser Val Gly Ser Gly Arg Ile 1955
1960 1965Ala Tyr Thr Leu Gly Leu Arg Gly Pro Ala
Met Thr Val Asp Thr 1970 1975 1980Ala
Cys Ser Ser Ser Leu Val Ser Leu His Leu Ala Cys Met Ser 1985
1990 1995Leu Arg Ala Gly Glu Cys Asp Met Ala
Leu Ala Gly Gly Ala Thr 2000 2005
2010Val Met Ala Thr Pro Met Ala Phe Ile Glu Phe Ser Arg Gln Arg
2015 2020 2025Gly Met Ala Pro Asp Ala
Arg Cys Lys Ala Phe Gly Ala Glu Ala 2030 2035
2040Asn Gly Ile Gly Pro Ala Glu Gly Cys Gly Ile Leu Val Leu
Lys 2045 2050 2055Arg Leu Ser Asp Ala
Arg Arg Asp Gly Asp Arg Val Leu Ala Val 2060 2065
2070Ile Arg Gly Ser Ala Val Asn Gln Asp Gly Arg Ser Gln
Gly Leu 2075 2080 2085Thr Ala Pro Asn
Gly Pro Ala Gln Gln Asp Val Ile Arg Gln Ala 2090
2095 2100Leu Ala Ala Ala Gly Leu Thr Pro Ala Asp Val
Asp Ala Val Glu 2105 2110 2115Ala His
Gly Thr Gly Thr Arg Leu Gly Asp Pro Ile Glu Ala Gln 2120
2125 2130Ala Leu Leu Ala Thr Tyr Gly Thr Ala His
Thr Ala Glu Arg Pro 2135 2140 2145Leu
Trp Leu Gly Ser Ile Lys Ser Asn Leu Gly His Thr Gln Ala 2150
2155 2160Ala Ala Gly Val Val Gly Leu Met Lys
Leu Val Leu Ala Met Gln 2165 2170
2175His Ala Glu Leu Pro Arg Thr Leu Tyr Ala Glu Pro Arg Ser Pro
2180 2185 2190His Ile Asp Trp Ser Gln
Gly His Ile Asn Leu Leu Asn Glu Pro 2195 2200
2205Val Pro Trp Pro Arg Thr Asp Arg Pro Arg Arg Ala Ala Val
Ser 2210 2215 2220Ser Phe Gly Ile Ser
Gly Thr Asn Ala His Val Ile Ile Glu Glu 2225 2230
2235Ala Pro Ala Glu Ala Pro Ala Thr Ala Ala Asp Ala Lys
Ser Val 2240 2245 2250Glu Ala Leu Pro
Ile Leu Pro Leu Leu Leu Ser Gly Arg Asp Glu 2255
2260 2265Pro Ala Leu Arg Ala Gln Ala Gly Arg Leu Ala
Glu His Leu Arg 2270 2275 2280Ala His
Pro Gly Glu Arg Leu Leu Asp Ile Ala Ala Gly Leu Ala 2285
2290 2295Thr Thr Arg Thr His Leu Ala Thr Arg Leu
Ala Leu Pro Val Ala 2300 2305 2310Ala
Asp Ala Ala Ala Glu Glu Leu Gly Ala Arg Leu Ala Gln Phe 2315
2320 2325Ala Ala Gly Gly Pro Ala Pro Ser Gly
Ala Ala Val Thr Ala Pro 2330 2335
2340Gly Gln Pro Pro Gly Lys Val Ala Val Leu Phe Thr Gly Gln Gly
2345 2350 2355Ser Gln Arg Ala Gly Met
Gly Arg Ala Leu Tyr Ala Thr His Pro 2360 2365
2370Val Phe Arg Ala Ala Leu Asp Ala Ala Cys Ala Glu Leu Asp
Arg 2375 2380 2385His Leu Asp Arg Pro
Leu His Ser Val Leu Phe Ala Asp Ala Gly 2390 2395
2400Thr Glu Ala Ala Ala Leu Leu Asp Gln Thr Gly Trp Ala
Gln Pro 2405 2410 2415Ala Leu Phe Ala
Leu Glu Val Ala Leu Tyr Arg Gln Trp Glu Ala 2420
2425 2430Trp Gly Leu Arg Pro Glu Leu Leu Leu Gly His
Ser Ile Gly Glu 2435 2440 2445Leu Ala
Ala Ala His Val Ala Gly Val Leu Asp Leu Pro Asp Ala 2450
2455 2460Ser Ala Leu Val Ala Ala Arg Gly Arg Leu
Met Gln Ala Leu Pro 2465 2470 2475His
Gly Gly Ala Met Ala Ser Ile Glu Ala Thr Glu His Glu Leu 2480
2485 2490Leu Pro Leu Leu Asp Gln His Thr Gly
Arg Leu Ser Leu Ala Ala 2495 2500
2505Leu Asn Ala Pro Arg Gln Ser Val Val Ser Gly Asp Gln Pro Ala
2510 2515 2520Val Asp His Val Cys Ala
His Phe Ile Ala Leu Gly Arg Arg Ala 2525 2530
2535Lys Arg Leu Asp Val Ser His Ala Phe His Ser Ala His Met
Gln 2540 2545 2550Pro Met Leu Asp Ala
Phe Ala Ser Val Ala Arg Gly Leu Thr Phe 2555 2560
2565His Pro Pro Arg Leu Pro Ile Val Ser Ser Val Thr Gly
Ala Arg 2570 2575 2580Ala Thr Thr Asp
Gln Leu Thr Ser Pro Asp Tyr Trp Val Gln Gln 2585
2590 2595Val Arg Glu Pro Val Arg Phe Leu Asp Ala Met
Arg Ser Leu His 2600 2605 2610Ala Ala
Gly Ala Ala Thr Phe Val Glu Cys Gly Pro His Gly Val 2615
2620 2625Leu Thr Ala Ala Gly Ala Glu Cys Leu Ala
Pro Glu Gly Ala Arg 2630 2635 2640Asp
Ala Gly Phe Val Thr Ser Leu Arg Lys Asp Arg Asp Glu Ala 2645
2650 2655Leu Ala Leu Val His Ala Ala Cys Ala
Val His Val Arg Gly His 2660 2665
2670Ala Leu Asp Trp Leu Arg Phe Phe Asp Ala Thr Gly Ala Arg Arg
2675 2680 2685Val Glu Leu Pro Thr Tyr
Ala Phe Gln Arg Gln Arg Tyr Trp Leu 2690 2695
2700Glu Ala Pro Arg Pro Arg Pro Ser Leu Glu Gly Val Gly Leu
Thr 2705 2710 2715Ala Ala Asn His Pro
Trp Leu Gly Ala Ala Val Arg Leu Ala Asp 2720 2725
2730Arg Asp Gly Tyr Val Leu Ser Gly Arg Leu Ser Thr Ile
Asp His 2735 2740 2745Pro Trp Val Leu
Asp His Val Val Ala Gly Thr Val Ile Leu Pro 2750
2755 2760Gly Thr Ala Phe Val Glu Leu Ala Trp Ala Ala
Ala Glu Val Val 2765 2770 2775Gly Ala
Ala Ala Val Ser Glu Val Thr Phe Thr Thr Pro Leu Val 2780
2785 2790Leu Pro Pro Arg Ser Val Val Glu Leu Gln
Val Arg Ile Gly Glu 2795 2800 2805Pro
Asp Ala Ser Gly Arg Arg Thr Phe Ala Ala Tyr Ser Arg Ala 2810
2815 2820Asp Ala Ala Ile Glu Ala Glu Trp Thr
Gln His Ala Thr Gly Val 2825 2830
2835Leu Ser Ala Gln Ala Ala Ala Gly Ala Asp Val Ala Asp Leu Ser
2840 2845 2850Val Trp Pro Pro Pro Gly
Ala Glu Val Val Ala Leu Asp Gly Gly 2855 2860
2865Tyr Ala Trp Leu Ala Ala Gln Gly Tyr Gly Tyr Gly Pro Ala
Phe 2870 2875 2880Gln Ala Leu Arg Glu
Val Trp Arg Ala Gly Thr Thr Leu Tyr Ala 2885 2890
2895Arg Val Ala Leu Pro Asp Ala Val Ala Asp Thr Ala Arg
Gly Phe 2900 2905 2910Gly Ile His Pro
Ala Leu Leu Asp Ala Val Leu His Ser Leu Leu 2915
2920 2925Ala Pro Ser Ala Gln Glu Glu Ala Ser Asp Asp
Asp Lys Val Leu 2930 2935 2940Leu Ala
Phe Ala Phe Ser Asp Val Val Ile Glu Ala Arg Gly Ala 2945
2950 2955Ala Glu Val Arg Val Arg Leu Asn Lys Gln
Ala Gly Asp Asp Gly 2960 2965 2970Glu
Gly Val Thr Ala Ser Ile His Leu Ala Asp Ala Gln Gly Arg 2975
2980 2985Pro Val Ala Arg Val Gly Ala Phe Gln
Ala Arg Ala Thr Thr Thr 2990 2995
3000Glu Arg Val Arg Ala Leu Ala Gly Ala Ser Glu Arg Asp Leu His
3005 3010 3015Arg Val Thr Trp Thr Asp
Val Thr Leu Glu Glu Thr Pro Trp Ala 3020 3025
3030His Glu Asp Ser Val Val Val Gly Gly Asp Gly Ala Leu Ala
Ala 3035 3040 3045Ala Leu Gly Val Arg
Ala Val Ala Gly Leu Pro Glu Leu Leu Ala 3050 3055
3060Gly Gly Ala Ala Ala Pro Arg Arg Leu Val Ile Asp Ala
Thr Ala 3065 3070 3075Gly Asp Pro Gly
Asp Gly Leu Val Ala Ala Thr His Ala Ala Thr 3080
3085 3090Gln Arg Gly Leu Ala Leu Leu Gln Gly Trp Leu
Ser Glu Ala Arg 3095 3100 3105Leu Ala
Ala Thr Glu Leu Val Leu Val Thr Arg Gly Ala Ala Ala 3110
3115 3120Ala Glu Pro Asp Glu Gly Val Ala Ala Leu
Ser His Ala Pro Leu 3125 3130 3135Trp
Gly Leu Val Arg Ala Ala Arg Glu Glu His Pro Ala Arg Ala 3140
3145 3150Leu Arg Leu Val Asp Leu Gly Arg Glu
Ala Pro Asp Gly Ala Ile 3155 3160
3165Leu Arg Arg Ala Ile Ala Ala Asp Asp Glu Pro Glu Leu Val Val
3170 3175 3180Arg Arg Gly Ala Leu Arg
Ala Ala Arg Leu Ser Leu Ala His Ala 3185 3190
3195Gly Pro Asp Thr Ala Gly Gln Ala Thr Arg Leu Ala Pro Gly
Gly 3200 3205 3210Thr Val Leu Ile Thr
Gly Gly Thr Gly Glu Leu Gly Arg Gln Val 3215 3220
3225Ala Arg His Leu Val Ala Ala His Gly Val Arg His Leu
Val Leu 3230 3235 3240Thr Ser Arg Arg
Gly Met Asp Ala Pro Asp Ala Ala Ala Leu Val 3245
3250 3255Glu Ser Leu Arg Ala Ala Gly Ala Ala Thr Val
Glu Ile Ala Ala 3260 3265 3270Cys Asp
Val Ala Asp Gly His Ala Leu Ala Ala Val Leu Arg Thr 3275
3280 3285Ile Pro Ala Glu His Pro Leu Thr Ala Val
Val His Thr Ala Gly 3290 3295 3300Val
Leu Glu Asp Gly Val Val Thr Gly Leu Ser Ala Glu Gln Leu 3305
3310 3315Ala Arg Val Leu Arg Pro Lys Val Asp
Gly Ala Trp Gln Leu Tyr 3320 3325
3330Glu Ala Thr Lys Asp Ala Pro Leu Ala Ala Phe Met Leu Phe Ser
3335 3340 3345Ser Ala Ala Gly Thr Leu
Gly Ser Ala Gly Gln Ala Asn Tyr Ala 3350 3355
3360Ala Ala Asn Ala Phe Leu Asp Ala Leu Ala Ala Glu Leu Arg
Ala 3365 3370 3375Arg Gly Val Pro Ala
Met Ser Leu Ala Trp Gly Phe Trp Glu Gln 3380 3385
3390Gly Gly Ile Gly Met Thr Ala His Leu Gly Ala Ala Asp
Met Ala 3395 3400 3405Arg Val Lys Arg
Gln Gly Ile Val Pro Met Thr Val Ala His Gly 3410
3415 3420Leu Arg Leu Leu Asp Arg Ala Leu Glu Arg Pro
Glu Ala Thr Leu 3425 3430 3435Val Pro
Leu Ser Leu Asp Val Ala Ala Leu Gln Arg Ala Ala Ser 3440
3445 3450Asp Ala Gly Arg Val Pro Ala Leu Leu Arg
Gly Leu Val Arg Pro 3455 3460 3465Ala
Ala Ala Arg Arg Thr Ala Ala Pro Ala Ala Ala Ala Thr Gly 3470
3475 3480Leu Arg Ala Arg Leu Leu Pro Leu Ser
Glu Ala Glu Arg Gln Asp 3485 3490
3495Val Leu Leu Asp Leu Val Arg Thr Glu Ile Ala Asp Ile Leu Ala
3500 3505 3510Leu Ser Gly Pro Ala Ala
Val Pro Pro Asp Gln Pro Ile Arg Glu 3515 3520
3525Leu Gly Leu Asp Ser Leu Thr Ala Val Asp Val Arg Ser Arg
Leu 3530 3535 3540Val Gln Arg Ser Glu
Ile Asp Leu Ala Val Thr Leu Ala Tyr Asp 3545 3550
3555Tyr Pro Thr Ala Arg Ala Ile Ala Gly His Leu Ser Glu
Gln Met 3560 3565 3570Gly Leu Glu Gly
Ala Pro Glu Asp Arg Glu Ser Ala Leu Asp Glu 3575
3580 3585Ser Gln Ile Arg Ala Leu Leu Met Gln Ile Pro
Ile Pro Thr Leu 3590 3595 3600Arg Gln
Ser Gly Leu Leu Gly Asp Leu Val Arg Leu Ala Ser Pro 3605
3610 3615Gln Ala Pro Pro Arg Glu Glu Gly Glu Ser
Glu Thr Leu Ser Phe 3620 3625 3630Asp
His Leu Gly Asn Glu Glu Phe Leu Ser Leu Ala Ser Lys Leu 3635
3640 3645Ile Ala Glu Glu Gly Ser
3650261880PRTSorangium cellulosum 26Met Asn Gln Glu Thr Val Leu Arg Gln
Thr Leu Glu Lys Ser Leu His1 5 10
15Lys Ile Gln His Leu Asn Arg Glu Leu Glu Arg Leu Lys Ala Lys
Ser 20 25 30Ser Glu Pro Ile
Ala Ile Val Ser Met Ala Cys Arg Tyr Pro Gly Gly 35
40 45Val Asp Gly Pro Ala Arg Leu Trp Glu Leu Leu Ser
Glu Gly Arg Asp 50 55 60Ala Ile Gly
Pro Phe Pro Glu Gly Arg Gly Trp Asp Val Ala Gly Leu65 70
75 80Tyr Asp Pro Asp Pro Asp Ala Pro
Gly Lys Ser Val Thr Thr Gln Gly 85 90
95Gly Phe Leu Tyr Asp Ala Asp Arg Phe Asp Pro Thr Phe Phe
Gly Ile 100 105 110Ser Pro Arg
Glu Ala Glu Arg Met Asp Pro Gln Gln Arg Leu Leu Leu 115
120 125Glu Cys Ala Trp Glu Ala Leu Glu Arg Ala Gly
Val Ala Pro His Thr 130 135 140Leu Glu
Ala Ser Ala Thr Gly Val Phe Val Gly Leu Val Tyr Ser Asp145
150 155 160Tyr Gly Gly Arg Leu Leu Glu
His Leu Glu Val Phe Asp Gly Tyr Val 165
170 175Ala Thr Gly Ser Phe Pro Ser Val Gly Ser Gly Arg
Ile Ala Tyr Thr 180 185 190Leu
Gly Leu Arg Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser 195
200 205Ser Leu Val Ser Leu His Leu Ala Cys
Met Ser Leu Arg Ala Gly Glu 210 215
220Cys Asp Met Ala Leu Ala Gly Gly Ala Thr Val Met Ala Thr Pro Met225
230 235 240Ala Phe Ile Glu
Phe Ser Arg Gln Arg Gly Met Ala Pro Asp Ala Arg 245
250 255Cys Lys Ala Phe Gly Ala Ala Ala Asn Gly
Ile Gly Pro Ala Glu Gly 260 265
270Cys Gly Ile Leu Val Leu Lys Arg Leu Ser Asp Ala Arg Arg Asp Gly
275 280 285Asp Arg Val Leu Ala Val Ile
Arg Gly Ser Ala Val Asn Gln Asp Gly 290 295
300Arg Ser Gln Gly Leu Thr Ala Pro Asn Gly Pro Ala Gln Gln Asp
Val305 310 315 320Ile Arg
Gln Ala Leu Ala Ala Ala Gly Leu Thr Pro Ala Asp Val Asp
325 330 335Ala Val Glu Ala His Gly Thr
Gly Thr Pro Leu Gly Asp Pro Ile Glu 340 345
350Ala Gln Ala Leu Leu Ala Thr Tyr Gly Lys Thr His Thr Ala
Glu Arg 355 360 365Pro Leu Trp Leu
Gly Ser Ile Lys Ser Asn Phe Gly His Thr Gln Ala 370
375 380Ala Ala Gly Val Ala Gly Ile Ile Lys Leu Val Leu
Ala Met Gln His385 390 395
400Ala Glu Leu Pro Arg Thr Leu Tyr Ala Glu Pro Arg Ser Pro His Val
405 410 415Asp Trp Ser Gln Gly
His Val Lys Leu Leu Asn Glu Pro Val Pro Trp 420
425 430Pro Arg Thr Asp Arg Pro Arg Arg Ala Ala Val Ser
Ser Phe Gly Val 435 440 445Ser Gly
Thr Asn Ala His Val Ile Leu Glu Glu Ala Pro Ala Glu Ala 450
455 460Pro Ala Ala Ala Gln Thr Ala Ala Gly Val Pro
Ser Thr Leu Pro Leu465 470 475
480Leu Leu Ser Gly Arg Asp Glu Pro Ala Leu Arg Ala Gln Ala Gly Arg
485 490 495Leu Ala Glu His
Leu Arg Ala His Pro Asp Glu Arg Leu Leu Asp Ile 500
505 510Ala Ala Gly Leu Ala Thr Thr Arg Thr His Leu
Ala Thr Arg Leu Ala 515 520 525Leu
Pro Val Ala Ala Asp Ala Ala Ala Glu Glu Leu Ser Ala Arg Leu 530
535 540Ala Gln Phe Ala Ala Gly Gly Pro Ala Pro
Ser Gly Ala Ala Val Thr545 550 555
560Ala Pro Gly Gln Pro Pro Gly Lys Val Ala Val Leu Phe Thr Gly
Gln 565 570 575Gly Ser Gln
Arg Ala Ala Met Gly Arg Ala Leu Tyr Ala Thr His Pro 580
585 590Val Phe Arg Ala Ala Leu Asp Ala Ala Cys
Ala Glu Leu Asp Arg His 595 600
605Leu Asp Arg Pro Leu His Ser Val Leu Phe Ala Asp Ala Gly Thr Glu 610
615 620Ala Ala Ala Leu Leu Asp Gln Thr
Gly Trp Ala Gln Pro Ala Leu Phe625 630
635 640Ala Leu Glu Val Ala Leu Tyr Arg Gln Trp Glu Ala
Trp Gly Leu Arg 645 650
655Ala His Ala Leu Leu Gly His Ser Leu Gly Glu Ile Val Ala Ala His
660 665 670Ile Ala Gly Val Leu Asp
Leu Pro Asp Ala Ser Ala Leu Val Ala Ala 675 680
685Arg Gly Arg Leu Met Gln Ala Leu Pro His Gly Gly Ala Met
Ala Ser 690 695 700Ile Glu Ala Thr Glu
His Glu Leu Leu Pro Leu Leu Asp Gln His Thr705 710
715 720Gly Arg Leu Ser Leu Ala Ala Leu Asn Ala
Pro Arg Gln Ser Val Val 725 730
735Ser Gly Asp Gln Pro Ala Val Asp His Val Cys Ala His Phe Lys Ala
740 745 750Leu Gly Arg Arg Ala
Lys Arg Leu Asp Val Ser His Ala Phe His Ser 755
760 765Ala Arg Met Glu Pro Met Leu Asp Ala Phe Ala Arg
Val Ala Arg Gly 770 775 780Leu Thr Tyr
Arg Ala Pro Arg Leu Pro Val Val Ser Asn Val Thr Gly785
790 795 800Arg Met Ala Thr Ala Asp Glu
Leu Thr Ser Pro Asp Tyr Trp Val Arg 805
810 815His Val Arg Glu Pro Val Arg Phe Val Ala Gly Val
Arg Ala Leu His 820 825 830Ala
Thr Gly Val Ala Thr Tyr Leu Glu Cys Gly Pro Asp Pro Val Leu 835
840 845Gly Gly Met Ala Ala Asp Cys Leu Thr
Ser Asp Glu Ser Arg Asp Pro 850 855
860Gly Leu Ile Pro Ser Leu Arg Lys Asp Arg Asp Glu Ala Leu Ala Ile865
870 875 880Ala Gln Ala Ala
Cys Ala Leu His Val Arg Gly His Ala Leu Asp Trp 885
890 895Pro Arg Leu Phe Asp Ala Thr Gly Ala Arg
Arg Val Glu Leu Pro Thr 900 905
910Tyr Ala Phe Gln Arg Gln Arg Tyr Trp Ile Asp Ala Pro Arg Arg Ala
915 920 925Ala Gly Leu Glu Ser Val Gly
Leu Thr Ala Ala Asp His Pro Trp Leu 930 935
940Gly Ala Ala Val Arg Leu Ala Asp Arg Asp Val Tyr Val Leu Ser
Gly945 950 955 960Arg Leu
Ser Thr Val Asp His Pro Trp Ile Leu Asp His Val Val Thr
965 970 975Gly Thr Ala Leu Met Pro Gly
Thr Gly Phe Val Glu Leu Ala Trp Ala 980 985
990Thr Ala Gln Ala Val Asn Ala Ala Ala Ile Ala Glu Leu Thr
Leu Thr 995 1000 1005Thr Pro Leu
Val Leu Pro Ala Arg Gly Ala Val Gln Leu Gln Val 1010
1015 1020Thr Val Asp Glu Ala Asp Ala Asp Gly Arg Arg
Ala Phe Ala Ile 1025 1030 1035His Ser
Arg Pro His Gly Pro Val Asp Leu Glu Trp Thr Gln His 1040
1045 1050Ala Thr Gly Val Leu Ser Ala Glu Ala Pro
Ala Gly Ala Asp Glu 1055 1060 1065Ala
Ala Gly Leu Ser Glu Trp Pro Pro Pro Gly Ala Glu Ala Val 1070
1075 1080Ala Leu Asp Gly Gly Tyr Glu Gln Leu
Ser Glu His Gly Tyr Gly 1085 1090
1095His Gly Pro Ala Phe Gln Gly Leu Arg Gly Leu Trp Arg Ala Asp
1100 1105 1110Gln Thr Leu Tyr Ala His
Val Ala Leu Pro Asp Ala Val Ala Gly 1115 1120
1125Thr Glu Gln Gly Phe Gly Leu His Pro Ala Leu Phe Asp Ala
Ala 1130 1135 1140Leu Gln Ser Leu Ala
Arg Leu Ser Arg Glu Glu Ala Ala Ala Gly 1145 1150
1155Asp Pro Val Leu Val Pro Phe Ala Trp Thr Asp Val Ala
Leu Tyr 1160 1165 1170Ala Ala Gly Ala
Thr Glu Leu Arg Ala Arg Ile Ala Leu Glu Gln 1175
1180 1185Ala Glu Gly Gly Ala Pro Ala Val Ala Ser Leu
Leu Leu Ala Asp 1190 1195 1200Ala His
Gly Arg Thr Val Ala Thr Thr Gly Arg Val Arg Gly Ala 1205
1210 1215Ser Ala Ala Gln Thr Arg Ser Ala Ala Ser
Arg Ala Glu Pro Met 1220 1225 1230Tyr
Arg Val Ala Trp Thr Asp Val Ala Leu Glu Ala Ala Ala Trp 1235
1240 1245Ala Pro Glu Glu His Val Val Leu Gly
Gly Asp Gly Ala Leu Ala 1250 1255
1260Ser Ala Leu Gly Val Arg Ala Ala Ala Gly Leu Pro Glu Leu Leu
1265 1270 1275Glu Ala Leu Ala Asp Gly
Ala Ala Ala Pro Arg Arg Leu Val Val 1280 1285
1290Asp Leu Thr Ala Gly Asp Ala Gly Ala Val Val Ala Ala Val
His 1295 1300 1305Ala Ala Ala Arg Gly
Ala Leu Ala Leu Val Gln Gly Trp Leu Ala 1310 1315
1320Ala Pro Gln Leu Thr Ala Thr Glu Leu Leu Val Val Thr
Arg Cys 1325 1330 1335Ala Val Ala Thr
Gly Pro Asp Glu Gly Val Asp Ala Leu Gly Pro 1340
1345 1350Ala Ala Val Trp Gly Leu Leu Arg Ala Thr Arg
Ala Glu His Pro 1355 1360 1365Asp Arg
Ala Val Arg Val Leu Asp Leu Gly Arg Glu Pro Leu Asp 1370
1375 1380Gly Ala Leu Leu Arg Arg Ala Leu Ala Ala
Val Ala Glu Pro Glu 1385 1390 1395Leu
Ser Leu Arg Arg Gly Glu Ala Arg Ala Pro Arg Leu Arg Glu 1400
1405 1410Ala Lys Pro Ala Ala Ala Pro Ala Thr
Arg Leu Asp Pro Glu Gly 1415 1420
1425Thr Val Leu Val Thr Gly Gly Thr Gly Glu Leu Gly Arg Gln Val
1430 1435 1440Ala Arg His Leu Val Ala
Ala His Gly Val Arg His Leu Val Leu 1445 1450
1455Thr Ser Arg Arg Gly Met Asp Ala Pro Asp Ala Ala Ala Leu
Val 1460 1465 1470Glu Glu Leu Arg Ala
Ala Gly Ala Ala Thr Val Asp Val Ala Ala 1475 1480
1485Cys Asp Val Ala Ala Gly Pro Ala Leu Ala Ala Val Val
Glu Ala 1490 1495 1500Ile Pro Ala Ala
His Pro Leu Thr Ala Val Val His Met Ala Gly 1505
1510 1515Val Leu Asp Asp Gly Ile Val Thr Lys Leu Ser
Ala Glu Gln Leu 1520 1525 1530Thr Arg
Val Leu Arg Pro Lys Val Asp Gly Ala Ile His Leu His 1535
1540 1545Glu Leu Thr Lys His Ala Pro Leu Ala Ala
Phe Val Met Phe Ser 1550 1555 1560Ser
Ala Ala Gly Thr Leu Gly Ser Pro Gly Gln Ala Asn Tyr Thr 1565
1570 1575Ala Ala Asn Val Phe Leu Asp Ala Leu
Ala Ala Arg Leu Arg Ala 1580 1585
1590Arg Gly Val Pro Ala Met Ser Leu Ala Trp Gly Phe Trp Glu Gln
1595 1600 1605Gly Gly Ile Gly Met Thr
Ala His Leu Gly Ala Ala Asp Arg Ala 1610 1615
1620Arg Met Lys Arg His Gly Val Val Ala Met Ser Val Ala Gln
Gly 1625 1630 1635Leu Arg Leu Leu Asp
Arg Ala Leu Ala His Pro Glu Ala Ala Leu 1640 1645
1650Val Pro Leu Ala Leu Asp Leu Ser Ser Leu His Ala Gly
Ala Ser 1655 1660 1665Gly Ala Gly Pro
Val Pro Pro Leu Leu Arg Gly Leu Val Arg Ala 1670
1675 1680Pro Ala Gly Arg Arg Thr Ala Ala Ser Ala Ala
Arg Thr Asn Gly 1685 1690 1695Lys Gly
Thr Ala Leu Ala Ala Leu Arg Ala Arg Leu Leu Pro Leu 1700
1705 1710Pro Gln Ala Glu Arg Glu Asp Leu Leu Leu
Glu Leu Val Cys Thr 1715 1720 1725Glu
Val Ala Glu Val Leu Gln Leu Pro Gly Pro Ala His Val Pro 1730
1735 1740Ala Asp Gln Pro Leu Arg Asp Leu Gly
Leu Asp Ser Leu Met Thr 1745 1750
1755Val Glu Leu Arg Asn Arg Leu Gly Ala Arg Ala Glu Thr Thr Leu
1760 1765 1770Pro Thr Thr Leu Ala Phe
Asp Tyr Pro Thr Pro Arg Ala Leu Ala 1775 1780
1785Ser Tyr Leu Glu Thr Leu Leu Gly Ile Ser Asp Glu Asn Gly
His 1790 1795 1800Ser Gly Glu Leu Leu
His Val Pro Gln Asn Glu Asp Glu Ile Arg 1805 1810
1815Ser Ala Ile Ala Arg Ile Pro Ile Ala Thr Leu Arg Glu
Ala Gly 1820 1825 1830Leu Leu Gln Ser
Leu Leu Arg Leu Ala Pro Gly Lys Ala Val Ala 1835
1840 1845Gly Asp Val Thr His Pro Val Asp Glu Leu Leu
Val Glu His Ile 1850 1855 1860Glu Asp
Glu Glu Leu Leu Arg Leu Ala Phe Glu Ala Thr Gly Gly 1865
1870 1875Ile Lys 1880272869PRTSorangium
cellulosum 27Met Lys Asp Glu Ala Leu Ser Phe Arg Arg Ala Leu Glu Lys Thr
Val1 5 10 15Val Glu Ile
Arg Arg Leu Asn Arg Glu Ile Asp Asp Leu Arg Ala Lys 20
25 30Ser Ser Glu Pro Ile Ala Ile Val Ser Met
Ala Cys Arg Phe Pro Gly 35 40
45Gly Val Glu Asn Pro Glu Ala Leu Trp Arg Leu Val Ser Glu Gly Gln 50
55 60Asp Ala Ile Gly Pro Phe Pro Glu Gly
Arg Gly Trp Asp Val Ala Gly65 70 75
80Leu Tyr Asp Pro Asp Pro Asp Val Pro Gly Lys Ser Ile Thr
Ala Arg 85 90 95Gly Gly
Phe Leu Tyr Asp Ala Asp Arg Phe Asp Pro Glu Phe Phe Gly 100
105 110Ile Ser Pro Arg Glu Ala Glu Arg Ile
Asp Pro Gln Gln Arg Leu Leu 115 120
125Leu Glu Cys Ala Trp Glu Ala Leu Glu Arg Ala Gly Val Ala Pro His
130 135 140Thr Lys Glu Ala Ser Ala Thr
Gly Val Phe Val Gly Leu Met Tyr Thr145 150
155 160Asp Tyr Gly Leu Arg Leu Leu Asn His Pro Glu Ala
Leu Asp Gly Tyr 165 170
175Ile Gly Ile Gly Ser Thr Gly Ser Thr Gly Ser Gly Arg Ile Ala Tyr
180 185 190Thr Leu Gly Leu Gln Gly
Pro Ala Ile Thr Val Asp Thr Ala Cys Ser 195 200
205Ser Ser Leu Val Ala Leu His Met Ala Cys Ala Ser Leu Arg
Gly Gly 210 215 220Glu Cys Asn Leu Ala
Leu Val Gly Gly Val Ala Val Met Thr Thr Pro225 230
235 240Thr Thr Phe Ile Glu Phe Ser Arg Gln Arg
Gly Leu Ser Leu Asp Gly 245 250
255Arg Cys Lys Ser Phe Gly Ala Glu Ala Glu Gly Val Gly Trp Gly Glu
260 265 270Gly Cys Gly Ile Leu
Ala Leu Lys Arg Leu Ser Asp Ala Arg Arg Asp 275
280 285Gly Asp Arg Val Leu Ala Ile Ile Arg Gly Ser Ala
Val Asn Gln Asp 290 295 300Gly Arg Ser
Gln Gly Phe Thr Ala Pro Asn Gly Pro Ser Gln Arg Ala305
310 315 320Val Ile Gln Arg Ala Leu Ala
Ala Ala Gly Leu Thr Ala Ala Asp Val 325
330 335Asp Ala Val Glu Gly His Gly Thr Gly Thr Arg Leu
Gly Asp Pro Ile 340 345 350Glu
Ala Gln Ala Leu Leu Ala Thr Tyr Gly Lys Ala His Thr Ala Glu 355
360 365Arg Pro Leu Trp Leu Gly Ser Ile Lys
Ser Asn Phe Gly His Thr Gln 370 375
380Ala Ala Ala Gly Val Ala Gly Ile Ile Lys Leu Val Leu Ala Met Gln385
390 395 400His Ala Glu Leu
Pro Arg Thr Leu His Ala Asp Thr Pro Ser Pro His 405
410 415Val Asp Trp Ser Gln Gly His Val Lys Leu
Leu Asn Glu Pro Val Pro 420 425
430Trp Pro Arg Thr Asp Arg Pro Arg Arg Ala Ala Val Ser Ser Phe Gly
435 440 445Ile Ser Gly Thr Asn Ala His
Val Ile Leu Glu Glu Ala Pro Ala Glu 450 455
460Ala Pro Ala Ala Ala Gln Thr Pro Ala Ala Ala Gly Val Pro Ser
Thr465 470 475 480Leu Pro
Leu Leu Leu Ser Gly Arg Asp Glu Pro Ala Leu Arg Ala Gln
485 490 495Ala Gly Arg Leu Ala Glu His
Leu Arg Ala His Pro Gly Glu Arg Leu 500 505
510Leu Asp Ile Ala Ala Gly Leu Ala Thr Thr Arg Thr His Leu
Ala Thr 515 520 525Arg Leu Ala Leu
Pro Val Ala Ala Asp Ala Ala Ala Glu Glu Leu Ser 530
535 540Ala Arg Leu Ala Gln Phe Ala Ala Gly Gly Pro Ala
Pro Ser Gly Ala545 550 555
560Ala Val Thr Ala Pro Gly Gln Pro Pro Gly Lys Val Ala Val Leu Phe
565 570 575Thr Gly Gln Gly Ser
Gln Arg Ala Ala Met Gly Arg Ala Leu Tyr Ala 580
585 590Thr His Pro Val Phe Arg Ala Ala Leu Asp Ala Ala
Cys Ala Glu Leu 595 600 605Asp Arg
His Leu Asp Arg Pro Leu His Ser Val Leu Phe Ala Asp Ala 610
615 620Gly Thr Glu Ala Ala Ala Leu Leu Asp Gln Thr
Gly Trp Ala Gln Pro625 630 635
640Ala Leu Phe Ala Leu Glu Val Ala Leu Tyr Arg Gln Trp Glu Ala Trp
645 650 655Gly Leu Arg Ala
His Ala Leu Leu Gly His Ser Leu Gly Glu Ile Val 660
665 670Ala Ala His Ile Ala Gly Val Phe Asp Leu Pro
Asp Ala Ser Ala Leu 675 680 685Val
Ala Ala Arg Gly Arg Leu Met Gln Ala Leu Pro His Gly Gly Ala 690
695 700Met Ala Ser Ile Glu Ala Thr Glu His Glu
Leu Leu Pro Leu Leu Asp705 710 715
720Gln His Thr Gly Arg Leu Ser Leu Ala Ala Leu Asn Ala Pro Arg
Gln 725 730 735Ser Val Val
Ser Gly Asp Gln Pro Ala Val Asp Gln Val Cys Ala His 740
745 750Phe Lys Ala Leu Gly Arg Arg Ala Lys Arg
Leu Asp Val Ser His Ala 755 760
765Phe His Ser Ala Arg Met Glu Pro Met Leu Asp Ala Phe Ala Arg Val 770
775 780Ala Arg Gly Leu Thr Tyr Arg Ala
Pro Arg Leu Pro Val Val Ser Asn785 790
795 800Val Thr Gly Arg Met Ala Thr Ala Asp Glu Leu Thr
Ser Pro Asp Tyr 805 810
815Trp Val Arg His Val Arg Glu Pro Val Arg Phe Val Ala Gly Val Arg
820 825 830Ala Leu His Ala Thr Gly
Val Ala Thr Tyr Leu Glu Cys Gly Pro Asp 835 840
845Pro Val Leu Gly Gly Met Ala Ala Asp Cys Leu Thr Ser Asp
Glu Ser 850 855 860Arg Asp Pro Gly Leu
Ile Pro Ser Leu Arg Lys Asp Arg Asp Glu Ala865 870
875 880Leu Ala Ile Ala Gln Ala Ala Cys Ala Leu
His Val Arg Gly His Ala 885 890
895Leu Asp Trp Pro Arg Leu Phe Asp Ala Thr Gly Ala Arg Arg Val Glu
900 905 910Leu Pro Thr Tyr Ala
Phe Gln Arg Gln Arg Tyr Trp Leu Glu Thr Pro 915
920 925Gln Thr Pro Gly Ala Asp Gly Ala Ser Asn Leu Ser
Ser Pro Ala Glu 930 935 940Ser Arg Phe
Trp Glu Ala Val Glu Arg Ala Asp Ile Ile Pro Leu Ala945
950 955 960Glu Ala Leu Arg Leu Glu Asp
Glu Ala Gln Arg Ala Ser Leu Ala Thr 965
970 975Leu Leu Pro Ala Leu Ser Thr Trp Arg Arg Arg Arg
His Glu Gln Ser 980 985 990Thr
Ala Asp Ala Trp Arg Tyr Arg Val Ala Trp Lys Pro Leu Ala Ile 995
1000 1005Asp Ala Arg Ser Asp Leu Ser Gly
Val Trp Leu Phe Leu Ala Pro 1010 1015
1020Pro Asp His Ala Lys Asp Asp Leu Ala Arg Ala Val Leu Arg Ala
1025 1030 1035Leu Ala Glu Ser Gly Ala
Thr Val Val Pro Val Leu Val Ala Glu 1040 1045
1050Gly Asp Val Asp Arg Ala Leu Leu Ser Ala Arg Leu Arg Glu
Gln 1055 1060 1065Val Gly Asp Gly Gly
Ala Ile Arg Gly Val Ile Ser Leu Leu Ala 1070 1075
1080Leu Asp Glu Thr Ser Leu Pro Gln His Asp Gly Leu Pro
Arg Gly 1085 1090 1095Leu Ala Phe Thr
Leu Ala Leu Val Gln Ala Leu Gly Asp Thr Ala 1100
1105 1110Ile Ala Ala Pro Leu Trp Leu Leu Thr Arg Gly
Ala Val Ser Val 1115 1120 1125Gly Arg
Ser Asp Arg Leu Glu Arg Pro Leu Gln Ala Leu Thr Trp 1130
1135 1140Gly Leu Gly Arg Val Val Ala Leu Glu His
Pro Glu Arg Trp Gly 1145 1150 1155Gly
Leu Ile Asp Leu Ala Gly Ala Leu Asp Glu Lys Ala Leu Lys 1160
1165 1170Arg Leu Val Ala Ala Leu Gly Gly Arg
Asp Ala Glu Asp Gln Leu 1175 1180
1185Ala Leu Arg Pro Ser Gly Leu Phe Ala Arg Arg Leu Val Arg Ala
1190 1195 1200Pro Leu Gly Glu Ala Thr
Ala Val Arg Ala Trp Lys Ala Arg Gly 1205 1210
1215Thr Ala Leu Val Thr Gly Gly Thr Gly Asp Leu Gly Ala His
Val 1220 1225 1230Ala Arg Trp Leu Ala
Gln Asn Gly Ala Glu His Leu Val Leu Thr 1235 1240
1245Ser Arg Arg Gly Gln Asp Ala Pro Gly Ala Ala Glu Leu
Thr Ala 1250 1255 1260Glu Leu Thr Ala
Leu Gly Ala Arg Val Thr Ile Ala Ala Cys Asp 1265
1270 1275Ser Ser Asp Arg Gln Ala Leu Ala Ala Leu Leu
Gln Arg Leu Arg 1280 1285 1290Ala Glu
Gly Pro Pro Leu Arg Ala Val Val His Ala Ala Gly Val 1295
1300 1305Asp Gln Val Thr Pro Leu Ala Arg Thr Ser
Leu Ala Glu Phe Ala 1310 1315 1320Gly
Ile Ala Ser Gly Lys Val Ala Gly Ala Arg His Leu Asp Asp 1325
1330 1335Leu Leu Gly Asn Ala Pro Leu Asp Ala
Phe Ile Leu Phe Ser Ser 1340 1345
1350Val Ala Gly Val Trp Gly Ser Gly Phe Gln Gly Ala Tyr Ala Ala
1355 1360 1365Ala Asn Ala Phe Leu Asp
Ala Leu Ala Glu Gln Arg Arg Ala Leu 1370 1375
1380Gly Ser Thr Ala Thr Ser Ile Ala Trp Gly Leu Trp Gly Gly
Lys 1385 1390 1395Ser Met Ala Asp Asp
Ala Ala Lys Asp His Leu Ser Lys Arg Gly 1400 1405
1410Val Ser Pro Met Pro Pro Gln Leu Ala Ile Ala Ala Leu
Gln Arg 1415 1420 1425Ala Leu Asp His
Asp Glu Thr Thr Leu Thr Leu Ala Asp Val Asn 1430
1435 1440Trp Ser Arg Phe Ala Pro Ala Phe Ala Ala Ala
Arg Pro Arg Pro 1445 1450 1455Leu Leu
His Asp Leu Pro Glu Ala Arg Ser Ala Leu Glu Ser Pro 1460
1465 1470Ser Pro Ala Pro Arg Glu Ala Glu Leu Leu
Thr Arg Leu Gln Gly 1475 1480 1485Leu
Ser Ser Thr Glu Arg Val Arg His Leu Val Ser Leu Val Leu 1490
1495 1500Ala Glu Thr Ala Val Val Leu Gly His
Pro Asp Ala Ser Arg Leu 1505 1510
1515Asp Pro His Thr Gly Phe Ala Asp Leu Gly Leu Asp Ser Leu Met
1520 1525 1530Ala Val Glu Met Arg Arg
Arg Leu Gln Gln Ala Thr Gly Val Ser 1535 1540
1545Leu Pro Ala Thr Leu Thr Phe Asp His Pro Ser Pro His His
Ile 1550 1555 1560Ala Thr Phe Leu Leu
Asp Glu Val Phe Ala Pro Ala Leu Gly Gln 1565 1570
1575Ala Pro Gly Ala Glu Glu Asp Glu Ala Ile Ala Gln Ala
Gly Leu 1580 1585 1590Ala Ser Gly Asp
Glu Pro Val Ala Leu Ile Gly Val Gly Leu Arg 1595
1600 1605Leu Pro Gly Gly Ala Thr Asp Leu Asp Gly Leu
Trp Arg Leu Leu 1610 1615 1620Glu Gln
Gly Ile Asp Val Val Gly Pro Val Pro Glu Asp Arg Gly 1625
1630 1635Trp Ser Met Asp Glu Leu Tyr Asp Pro Asp
Pro Asp Ser Leu Gly 1640 1645 1650Lys
Ser Tyr Val Arg Glu Ala Ala Phe Leu Asp Arg Ile Asp Leu 1655
1660 1665Phe Asp Ala Gly Phe Phe Gly Ile Ser
Pro Arg Glu Ala Ser His 1670 1675
1680Val Asp Pro Gln His Arg Leu Leu Leu Glu Ala Ala Trp Gln Ala
1685 1690 1695Leu Glu His Ala Gly Ile
Val Pro Ala Ser Leu Gln Asp Ser Gln 1700 1705
1710Thr Gly Val Phe Val Gly Ser Gly Pro Ser Asp Tyr Ala Leu
Leu 1715 1720 1725His Asn Pro Ala Gln
Glu Asp Glu Ala Tyr Arg Leu Thr Gly Thr 1730 1735
1740Gln Pro Ser Phe Ala Pro Gly Arg Leu Ser Phe Ser Leu
Gly Leu 1745 1750 1755Gln Gly Pro Ala
Leu Ser Val Asp Thr Ala Cys Ser Ser Ser Leu 1760
1765 1770Val Ala Leu His Leu Ala Ala Gln Ala Leu Arg
Arg Gly Glu Cys 1775 1780 1785Gly Leu
Ala Leu Val Gly Ser Ala Gln Val Met Ala Ala Pro Asp 1790
1795 1800Ala Phe Val Thr Leu Ser Arg Ala Arg Ala
Ile Ala Pro Asp Gly 1805 1810 1815Arg
Ser Lys Thr Phe Ser Ala Gln Ala Asp Gly Tyr Gly Arg Gly 1820
1825 1830Glu Gly Val Ile Val Phe Val Leu Glu
Arg Leu Ser Asp Ala Arg 1835 1840
1845Ala Arg Gly Arg Asp Val Leu Ala Val Leu Arg Gly Ser Ala Val
1850 1855 1860Asn His Asp Gly Ala Ser
Ser Gly Ile Thr Ala Pro Asn Gly Thr 1865 1870
1875Ser Gln Gln Lys Val Leu Arg Ala Ala Leu His Asp Ala Arg
Leu 1880 1885 1890Thr Pro Ala Asp Val
Asp Val Val Glu Cys His Gly Thr Gly Thr 1895 1900
1905Ser Leu Gly Asp Pro Ile Glu Val Gln Ala Leu Ala Ala
Val Tyr 1910 1915 1920Gly Lys Glu Arg
Ser Ala Asp Arg Pro Leu Met Leu Gly Ala Leu 1925
1930 1935Lys Thr Asn Val Gly His Leu Glu Ala Ala Ser
Gly Leu Ala Gly 1940 1945 1950Val Ala
Lys Val Val Ala Ala Leu Arg His Glu Ala Leu Pro Ala 1955
1960 1965Thr Leu His Thr Ala Ala Arg Asn Pro His
Ile Gln Trp Asp Thr 1970 1975 1980Leu
Pro Val Gln Val Val Asp Thr Leu Arg Pro Trp Pro Arg Arg 1985
1990 1995Glu Asp Gly Thr Pro Arg Arg Ala Gly
Val Ser Ala Phe Gly Leu 2000 2005
2010Ser Gly Thr Asn Ala His Val Leu Leu Glu Glu Ala Pro Pro Val
2015 2020 2025Gln Pro Ser Thr Gln Ala
Glu Gln Pro Ala Ala Pro Pro Trp Leu 2030 2035
2040Pro Leu Leu Leu Ser Gly Lys Thr Asp Ala Ala Leu Arg Ala
Gln 2045 2050 2055Ala Glu Arg Leu Arg
Ala His Leu Asp Ala His Ala Asp Leu Gly 2060 2065
2070Leu Ala Asp Val Ala Tyr Ser Leu Ala Thr Thr Arg Thr
His Phe 2075 2080 2085Ala His Arg Ala
Val Val Val Ala Asp Ala Gly Ala Thr Leu Phe 2090
2095 2100Glu Gly Leu Asp Ala Ile Ala Arg Gly Asn Ala
Ala Ser His Val 2105 2110 2115Val Val
Asp Glu Ala Lys Ile Asp Gly Lys Thr Val Phe Val Phe 2120
2125 2130Pro Gly Gln Gly Ser Gln Trp Ala Gln Met
Ala Gln Pro Leu Leu 2135 2140 2145Glu
Thr Ser Glu Leu Phe Arg Glu Arg Ile Glu Ala Cys Ala His 2150
2155 2160Ala Leu Ala Pro His Val Asp Trp Ser
Leu Leu Ala Val Leu Arg 2165 2170
2175Gly Glu Glu Gly Ala Pro Ser Leu Glu Arg Val Asp Val Val Gln
2180 2185 2190Pro Val Leu Phe Ala Val
Met Val Ser Leu Ala Ala Leu Trp Arg 2195 2200
2205Ser Met Gly Val Glu Pro Asp Ala Val Val Gly His Ser Gln
Gly 2210 2215 2220Glu Ile Ala Ala Ala
Cys Val Ala Gly Ala Leu Ser Leu Ala Asp 2225 2230
2235Ala Ala Lys Val Val Ala Leu Arg Ser Arg Ala Leu Ala
Arg Leu 2240 2245 2250Ala Gly Arg Gly
Ala Met Ala Val Val Glu Leu Pro Ala Ala Glu 2255
2260 2265Leu Ala Glu Arg Met Lys Arg Trp Gly Glu Arg
Leu Ser Ile Ala 2270 2275 2280Ala Leu
Asn Ser Pro Arg Ser Thr Val Ile Ser Gly Asp Pro Asp 2285
2290 2295Ala Val Asp Ala Leu Leu Arg Glu Leu Asp
Ser Ala Glu Ile Phe 2300 2305 2310Ala
Arg Lys Val Arg Val Asp Tyr Ala Ser His Cys Ser His Val 2315
2320 2325Glu Ala Ile Arg His Gln Leu Leu Ala
Glu Leu Ala Gly Ile Glu 2330 2335
2340Pro Leu Pro Ser Thr Leu Pro Leu Tyr Ser Thr Val Ser Gly Asp
2345 2350 2355Lys Leu Asp Gly Val Ala
Leu Asp Ala Ser Tyr Trp Tyr Arg Asn 2360 2365
2370Leu Arg Gln Thr Val Arg Phe Ser Asp Ala Thr Gln Arg Leu
Val 2375 2380 2385Ser Ala Gly His Arg
Phe Phe Val Glu Val Ser Pro His Pro Val 2390 2395
2400Leu Thr Phe Ala Val Gln Asp Val Leu Asp Ala Glu Gly
Val Pro 2405 2410 2415Ala Ala Val Val
Gly Ser Leu Arg Arg Gly Glu Gly Asp Leu Arg 2420
2425 2430Arg Phe Leu Val Ser Leu Ser Glu Leu Phe Thr
Arg Gly Leu Ala 2435 2440 2445Leu Asp
Trp Ser Arg Val Leu Pro Ser Gly Arg Arg Val Ser Leu 2450
2455 2460Pro Thr Tyr Ala Phe Gln Arg Glu Arg Tyr
Trp Leu Gly Ala His 2465 2470 2475Arg
Ala Arg Gly Thr Asp Ala Thr Ser Ala Gly Leu Ala Ser Asp 2480
2485 2490Glu Pro Thr Arg Gly Ala Ser Met Pro
Val Arg Leu Ser Leu Arg 2495 2500
2505Asp Val Pro Pro Glu Glu Arg Gln Gly Ala Leu Glu Arg Phe Val
2510 2515 2520Arg Glu Gln Leu Ala Ala
Val Leu Arg Met Asp Ala Ala Arg Ile 2525 2530
2535Glu Gly Gln Thr Thr Ile Lys Thr Leu Gly Ile Asp Ser Leu
Met 2540 2545 2550Ala Leu Glu Ile Arg
Lys Arg Leu Glu Ala Gly Leu Ala Val Thr 2555 2560
2565Leu Pro Ser Thr Leu Ile Trp Gln Phe Pro His Ala Glu
Gly Leu 2570 2575 2580Ala Arg His Leu
Met Thr Arg Leu Pro Ala Gly Asp Gly Glu Gly 2585
2590 2595Ser Ala Val Val Gln Pro Val Glu Gln Pro Arg
Ala Pro Lys Glu 2600 2605 2610Val Pro
Val Ser Met Asp Pro Ser Ala Trp Val His Arg Pro Arg 2615
2620 2625Pro Arg Ala Asp Ala Arg Val Arg Leu Phe
Cys Leu Pro Tyr Ala 2630 2635 2640Gly
Ala Gly Ala Ser Arg Phe Arg Ala Trp Pro Glu Leu Leu Pro 2645
2650 2655Ser Trp Val Glu Val Cys Pro Ile Gln
Leu Pro Gly Arg Glu Glu 2660 2665
2670Arg Leu His Glu Pro Ala Phe Glu Thr Met Asp Ala Leu Val Asp
2675 2680 2685Ala Leu Val Pro Ala Val
Glu Ala His Ile Asp Arg Pro Phe Ala 2690 2695
2700Leu Phe Gly Cys Ser Met Gly Ala Leu Leu Ala Phe Glu Leu
Ala 2705 2710 2715Arg Ala Leu Gln Ser
Arg His Arg Leu Val Ala Arg His Leu Phe 2720 2725
2730Gly Ala Ala Ser Ser Ser Pro Arg Arg Val Ser Pro Val
Arg Glu 2735 2740 2745Gln Leu Ser Ala
Val Val Ser Pro Gly Thr Val Arg Ser Asp Ala 2750
2755 2760Met Ala Ser Leu Arg Gln Leu Gly Leu Leu Ser
Ser Ser Ser Leu 2765 2770 2775Gln Asp
Glu Glu Met Leu Asp Glu Val Trp Pro Ala Phe Arg Ala 2780
2785 2790Asp Leu Ser Leu Thr Leu Lys Tyr Thr Cys
Arg Asp Ala Thr Pro 2795 2800 2805Leu
Asp Ala Pro Ile Ser Val Phe Gly Gly Thr Glu Asp Arg Thr 2810
2815 2820Val Gly Arg Glu Asp Leu Val Ala Trp
His Thr Leu Thr Lys Asp 2825 2830
2835Ala Phe Gln Val Ala Met Leu Pro Gly Gly His Leu Phe Met Asp
2840 2845 2850Ala Thr Pro Lys Arg Leu
Phe His His Ile Glu His Ala Leu Gln 2855 2860
2865Leu28228PRTSorangium cellulosum 28Met Arg Thr Ser Asp Ala
Val Trp Ala Gly Ala Ala Gly Tyr Thr Arg1 5
10 15Ala Arg Leu Gln Val Tyr Asp Phe Phe Ile Tyr Gly
Phe Asn Ser Pro 20 25 30Val
Ala Trp Lys Cys Pro Gly Glu Glu Leu Leu Glu Asn Tyr Asn Arg 35
40 45His Val Ser Gly Asn His Leu Asp Val
Gly Val Gly Thr Gly Tyr Leu 50 55
60Leu Asp Arg Cys Arg Phe Pro Thr Ala Lys Pro Arg Val Phe Leu Met65
70 75 80Asp Leu Asn Pro Asp
Ala Leu Gln Val Thr Ala Gln Arg Leu His Arg 85
90 95Phe Gln Pro Gln Thr Leu Arg Arg Asn Val Leu
Asp Pro Ile Arg Phe 100 105
110Asp Gly Glu Pro Phe Asp Ser Ile Gly Met Asn Tyr Leu Met His Cys
115 120 125Val Pro Gly Ser Ile Pro Glu
Lys Ala Val Met Phe Asp His Leu Ser 130 135
140Ala Leu Leu Lys Pro Gly Gly Val Ile Phe Gly Ser Thr Val Leu
Ser145 150 155 160Glu Gly
Val Asp Lys Gly Ile Val Ala Arg Ala Ile Met Asp Arg Phe
165 170 175Asn Lys Lys Gly Ile Phe Ser
Asn Thr Arg Asp Ala Ala Ser Asp Leu 180 185
190Thr Arg Ala Leu Glu Glu Arg Phe Asp Asp Val Ser Val Arg
Val Val 195 200 205Gly Cys Val Gly
Leu Phe Ser Ala Arg Lys Arg Thr Cys Ala Gly Thr 210
215 220Glu Ser Pro Ala22529301PRTSorangium cellulosum
29Ile Val Leu Gly Asp Thr Leu Glu Gln Val Ala Thr Arg Leu Leu Glu1
5 10 15Glu Asp Leu Ala Ala Cys
His Thr Thr Gly Glu Ala Ala Asp Val Leu 20 25
30Leu Asn Gly Val Leu Ala Ser Ser Ala Arg Ala Val Ala
Ala Ala Leu 35 40 45Arg Ala Cys
Asp Glu Phe Ala Ala Gly Asp Ser Asp Leu Pro Ser Leu 50
55 60Ala Arg Ala Cys Arg Ala Phe Ala Gly Leu Ala Ser
Phe Gly Ser Ser65 70 75
80Arg Ser Leu Ser Ser Leu Gly Asp Gly Val Ile Ala Pro Met Leu Glu
85 90 95Lys Thr Phe Ala Arg Ala
Val Leu Arg Val His Gly Gly Cys Thr Gly 100
105 110Ser Asp Glu Ala Val Ala Ala Ala Lys Glu Ala Leu
Arg Thr Leu His 115 120 125Asp Val
Ala Leu Ser Gln Pro Ile Val Asp Arg Gly Ala Trp Leu Asp 130
135 140Ala Ala Arg Gly Leu Val Asp Ser Glu Val Val
Asn Pro Thr Ala Ser145 150 155
160Gly Leu Ala Cys Gly Leu Leu Tyr Leu Ala Gln Ala Ile Asp Asp Ala
165 170 175Glu Val Ala Arg
Val Val Gly Leu Arg Leu Gly Gly Ala Ala Glu Pro 180
185 190Glu Ala Ala Ala Ser Phe Leu Ala Gly Phe Leu
Glu Val Asn Ala Leu 195 200 205Val
Leu Val Lys Ser Arg Pro Val Val Glu Ala Leu Asp Ala Phe Leu 210
215 220Arg Ala Ile Ala Pro Glu Arg Phe Lys Asp
Thr Leu Pro Val Leu Arg225 230 235
240Arg Ala Phe Ala Gly Leu Gly Ala Thr Glu Arg Arg Tyr Leu Leu
Glu 245 250 255Asn Val Leu
Ala Ala Arg Lys Leu Gly Asp Lys Ala Arg Ala Ala Gln 260
265 270Ala Val Leu Leu Glu Lys Asp Arg Glu Lys
Leu Lys Glu Met Ser Glu 275 280
285Asp Leu Ser Gln Ala Met Asp Asp Leu Asp Glu Leu Leu 290
295 30030345PRTSorangium cellulosum 30Met Arg Arg Pro
Glu Arg Arg Asp Arg His Pro Arg Pro Arg Ala Ser1 5
10 15Asn Arg Gln Ser Ala Arg Arg Ser Ile Asp
Ala Val Asp Gly Ser Thr 20 25
30Trp Tyr Pro Ser Thr Met Arg Leu Gln Ser Val Asp Thr His Leu Val
35 40 45Val Ala Leu His Ala Leu Leu Gln
Glu Lys Ser Val Thr Arg Ala Ala 50 55
60Arg Arg Val Gly Val Thr Gln Pro Ser Met Ser His Ala Leu Ala Arg65
70 75 80Leu Arg Ala His Phe
Ala Asp Pro Leu Leu Ile Gln Val Gly Arg Gln 85
90 95Met Thr Leu Ser Glu Arg Ala Arg Asp Leu Ala
Pro Arg Ala Ala Glu 100 105
110Ala Val Glu Arg Leu Glu Gln Val Phe Arg Pro Val Glu Arg Phe Asp
115 120 125Pro Arg Arg Ser Gln Arg Thr
Phe Arg Leu Val Ala Thr Asp Asn Leu 130 135
140Glu Leu Leu Val Leu Pro Ala Leu Thr Ala Leu Leu Ala Val Glu
Ala145 150 155 160Pro Arg
Val Asn Leu Arg Cys Arg Asn Ile Pro Ala Asp Phe Ala Glu
165 170 175Leu Leu Arg Arg Gly Glu Leu
Asp Gly Lys Leu Gly Arg Gly Gly Pro 180 185
190Val Pro Asp Gly Cys Arg Ser Thr Leu Leu Ala Ala Glu Glu
Ile Val 195 200 205Cys Val Met Arg
Arg Gly His Pro Ala Ser Arg Arg Pro Leu Thr Ala 210
215 220Ala Arg Tyr Ala Ala Cys Glu His Leu Met Val Ser
Pro His Gly Glu225 230 235
240Asp His Gly Ala Ile Asp Arg Ala Leu Ala Glu Gln Gly Thr Arg Arg
245 250 255Arg Val Thr Leu Thr
Val Ser His Phe Leu Val Ala Pro Phe Ile Val 260
265 270Ser Gly Ser Asp Leu Leu Leu Thr Val Ser Ala Arg
Val Ala Ala Ala 275 280 285Leu Ala
Arg Arg Leu Asp Leu Val Val Arg Pro Cys Pro Phe Ala Leu 290
295 300Glu Gly Tyr Thr Leu Thr Leu Val Trp Pro Glu
Arg Ser Glu His Asp305 310 315
320Glu Gly His Gly Trp Leu Arg Asp Ala Ile Gln Arg Ala Val Ala Val
325 330 335Asp Ser Arg Pro
Ala Leu Pro Gly Val 340 34531108PRTSorangium
cellulosum 31Met Ile Ile Glu Tyr Val Arg Tyr Thr Ile Pro Ala Glu Gln Glu
Lys1 5 10 15Glu Phe Leu
Ala Ala Tyr Arg Asp Ala Ala Ala Glu Leu Arg Gly Ser 20
25 30Glu His Cys Leu Asp Tyr Glu Ile Ser Arg
Cys Val Glu Asp Pro Thr 35 40
45Ser Tyr Val Val Arg Ile Cys Trp Asp Ser Leu Gln Gly His Leu Gln 50
55 60Gly Phe Arg Lys Ala Ala Ala Phe Pro
Ser Phe Phe Ala Lys Val Lys65 70 75
80Pro Phe Tyr Glu Arg Ile Gln Glu Met Arg His Tyr Ala Leu
Thr Asp 85 90 95Val Ala
Ala Arg Gln Ala Gly Thr Ala Ala Thr Gly 100
10532486PRTSorangium cellulosum 32Met Lys Leu Ala Arg Lys Leu Thr Leu Ala
Leu Val Phe Gly Val Phe1 5 10
15Leu Val Leu Ala Leu Ser Ala Tyr Ala Gln Ile Arg Arg Glu Ala Arg
20 25 30Ile Phe Glu Asn Asp Val
Gln Arg Asp His His Thr Met Ala Arg Ala 35 40
45Leu Ala Ala Ala Val Met Glu Val Trp Arg Ser Glu Gly Thr
Ala Arg 50 55 60Ala Leu Arg Leu Val
Glu Asp Ala Asn Glu Arg Glu Gln Gln Ala Asn65 70
75 80Ile Arg Trp Val Trp Leu Asp Gly Gln Ala
Asp Glu Pro His Arg Pro 85 90
95Arg Leu Ala Pro Glu Leu Leu Ala Pro Val Ala Glu Gly Arg Ala Val
100 105 110Val Arg Arg Ile Pro
Gln Lys Asp Ala Asp Leu Leu Val Thr Cys Val 115
120 125Pro Val Ser Val Pro Gly Asp Arg Ala Gly Ala Leu
Glu Leu Ser Glu 130 135 140Ser Leu Ala
Gly Ala Arg Arg Tyr Ile Arg Ser Met Ile Leu Ser Thr145
150 155 160Ala Ile Thr Thr Ala Ala Leu
Thr Leu Val Cys Gly Leu Leu Thr Thr 165
170 175Gly Leu Gly Val Trp Leu Val Gly Arg Pro Met Arg
Thr Leu Ile Asp 180 185 190Gln
Ala Arg Arg Ile Gly Ala Gly Asp Leu Ser Gly Arg Leu Ser Leu 195
200 205Arg Gln Glu Asp Glu Ile Gly Glu Leu
Gly Arg Glu Met Asn Ala Met 210 215
220Cys Asp Arg Leu Ala Ala Ala Asn Gln Lys Leu Glu Ser Glu Ala Ala225
230 235 240Ala Arg Ile Ala
Ala Leu Gln Gln Leu Arg His Ala Glu Arg Leu Ala 245
250 255Thr Val Gly Lys Leu Ala Ser Gly Ile Ala
His Glu Leu Gly Ala Pro 260 265
270Leu Gln Val Val Thr Gly Arg Ala Arg Met Leu Val Asp Gly Asp Val
275 280 285Ser Gly Asp Glu Val Pro Ile
Asn Gly Gln Ile Ile Leu Glu Gln Ser 290 295
300Gln Arg Met Thr Gln Ile Ile Arg Gln Leu Leu Asp Phe Ala Arg
Arg305 310 315 320Arg Ser
Ala Glu Lys Gln Glu Thr Ala Leu Arg Gly Val Ile Arg Gly
325 330 335Thr Phe Thr Met Leu Lys Pro
Leu Ala Asp Lys Gln Gly Val Thr Ile 340 345
350Val Glu Glu Gly Asp Thr Pro Asp Arg Val Val His Ala Asp
Ala Asp 355 360 365Gln Leu Gln Gln
Ala Leu Thr Asn Val Val Val Asn Ala Ile Gln Ala 370
375 380Met Pro Ser Gly Gly Thr Ile Thr Val Gly Val Arg
Thr Val Arg Ala385 390 395
400Ser Pro Pro Pro Asp Gln Gly Gly Ala Glu Gly Asp Tyr Ile Ala Leu
405 410 415Ser Val Arg Asp Glu
Gly Gln Gly Met Thr Ala Asp Val Leu Glu His 420
425 430Val Phe Glu Pro Phe Phe Thr Thr Lys Pro Val Gly
Glu Gly Thr Gly 435 440 445Leu Gly
Leu Pro Val Ala Tyr Gly Ile Ile Lys Glu His Gly Gly Trp 450
455 460Ile Asp Val Asp Ser Arg Pro Gly Ser Gly Ser
Gln Phe Thr Met Tyr465 470 475
480Leu Pro Gln Glu Lys Pro 48533461PRTSorangium
cellulosum 33Met Thr Gly Arg Val Leu Ile Val Asp Asp Glu Arg Gly Val Cys
Glu1 5 10 15Leu Leu Asp
Ala Gly Leu Lys Lys Arg Gly Phe Gln Ala Ala Trp Arg 20
25 30Thr Ser Ala Ala Glu Ala Leu Glu Leu Leu
Gly Ala Glu Asp Phe Asp 35 40
45Val Val Val Thr Asp Met Thr Met Arg Gly Met Asn Gly Leu Glu Leu 50
55 60Cys Glu Arg Ile Ala Gln Asn Arg Pro
Asp Leu Pro Val Ile Val Ile65 70 75
80Thr Ala Phe Gly Ser Leu Asp Thr Ala Thr Ser Ala Ile Arg
Ala Gly 85 90 95Ala Tyr
Asp Phe Val Thr Lys Pro Phe Glu Leu Asp Ala Leu Arg Leu 100
105 110Thr Val Glu Arg Ala Leu Arg His Arg
Ala Leu Arg Glu Glu Val Arg 115 120
125Arg Leu Arg Arg Ala Val Asp Asp Ser His Arg Tyr Glu Gln Ile Leu
130 135 140Gly Gly Ser Pro Ala Met Lys
Gly Val Phe Asp Leu Leu Asp Arg Val145 150
155 160Ala Asp Ser Asp Thr Ser Ile Leu Ile Thr Gly Glu
Ser Gly Thr Gly 165 170
175Lys Glu Leu Val Ala Arg Ala Val His Gln Arg Ser Arg Arg Gly Gln
180 185 190Gly Ala Phe Ile Ala Val
Asn Cys Ala Ala Val Pro Asp Ala Leu Leu 195 200
205Glu Thr Glu Leu Phe Gly His Ala Arg Gly Ala Phe Thr Asp
Ala Lys 210 215 220Gly Ala Arg Ser Gly
Leu Phe Ala Arg Ala His Gly Gly Thr Leu Phe225 230
235 240Leu Asp Glu Ile Gly Glu Leu Pro Val Gly
Leu Gln Pro Lys Leu Leu 245 250
255Arg Ala Leu Gln Glu Arg Val Val Arg Pro Val Gly Ala Asp Glu Glu
260 265 270Val Pro Val Asp Val
Arg Leu Ile Ala Ala Thr Asn Arg Asp Leu Glu 275
280 285Thr Ala Ile Glu Glu Arg Arg Phe Arg Glu Asp Leu
Tyr Tyr Arg Ile 290 295 300Asn Val Val
His Val Asp Leu Pro Pro Leu Arg Ser Arg Gly Ala Asp305
310 315 320Val Leu Leu Leu Ala Gln Arg
Phe Leu Glu His Phe Ala Thr Val Lys 325
330 335Glu Arg Pro Ile Lys Gly Leu Ser Ala Pro Ala Ala
Glu Lys Leu Val 340 345 350Ala
Tyr Ala Trp Pro Gly Asn Val Arg Glu Leu Gln Asn Cys Ile Glu 355
360 365Arg Ala Val Ala Leu Ala Arg Tyr Asp
Gln Ile Thr Val Asp Asp Leu 370 375
380Pro Glu Lys Ile Arg Ser Tyr Arg Arg Ser His Val Leu Val Ser Ser385
390 395 400Asp Asp Pro Thr
Glu Leu Val Pro Met Glu Glu Val Glu Arg Arg Tyr 405
410 415Ile Leu Arg Val Leu Glu Val Val Gly Gly
Asn Lys Ser Gln Ala Ala 420 425
430Gln Val Leu Gly Phe Asp Arg Ala Thr Leu Tyr Arg Lys Leu Glu Arg
435 440 445Tyr Gly Leu Arg Ala Gly Arg
Ala Gly Asp Pro Arg Pro 450 455
46034508PRTSorangium cellulosum 34Met Arg Gln Pro Thr Pro Gln Gly Leu Ser
Trp Pro Arg Leu Pro Arg1 5 10
15Pro Val Arg Leu Ser Ala Leu Leu Gly Ala Ala Thr Leu Leu Leu Thr
20 25 30Ser Val Ala Ile Val Val
Ala Gly Ala Leu Met Val Ala Ser Thr Thr 35 40
45Met Gln Gln Thr Thr Arg Ile Leu Gly Ala Thr Val Glu Ser
Val Arg 50 55 60Leu Val Glu Arg Leu
Glu Ile Asp Leu Leu Leu Asp Ala His Gln Ser65 70
75 80Ser Arg Ala Val Gly Ser Gly Arg Gly Glu
Leu Ala Pro Ser Leu Ala 85 90
95Ala Trp Glu Gln Gly Leu Arg Ser Gly Leu Ala Ala Ala Arg Asp His
100 105 110Val Ser Ser Pro Glu
Glu Gly Arg Ile Leu Glu His Ala Glu Arg Arg 115
120 125Val Glu Asp Tyr Leu Ala Arg Arg Arg Ala Ala Asp
Ala His Glu Leu 130 135 140Pro Ser Ala
Pro Gly Ala His Asp Pro Ala Leu Leu Gly Val His Asp145
150 155 160Pro Ala Leu Asp Glu Ala Phe
Arg Ala Leu Asp His Leu Val Glu Ile 165
170 175Asn Leu Glu Gln Ala Arg Ala Ser Glu Ala Leu Val
Ala His Leu Thr 180 185 190Arg
Arg Thr Thr Gly Ala Gly Leu Ala Ala Val Val Phe Phe Leu Ala 195
200 205Gly Ala Ser Thr Ile Leu Leu Ser Ala
Arg Arg Leu Ile Tyr Arg Pro 210 215
220Ile Val Ala Ile Gln Glu Ala Ile Gly Arg Tyr Gly Ala Gly Asp Arg225
230 235 240Ala Ala Arg Ala
Pro Leu Ile Gly Pro Arg Glu Leu Gly Glu Ile Ala 245
250 255Arg Ala Phe Asn Asp Met Ala Glu Ser Leu
Glu Arg Gln Arg Glu Ala 260 265
270Gln Phe Ala Phe Leu Gly Gly Val Ala His Asp Leu Arg Asn Pro Leu
275 280 285Ser Ala Leu Arg Leu Ser Val
His Val Leu Asp Ala Asp Asn Arg Pro 290 295
300Leu Glu Ser Ser Val Arg Arg Thr Met Ala Leu Val Gly Arg Gln
Val305 310 315 320Asp Arg
Leu Asp Arg Met Val Gly Asp Leu Leu Asp Ala Ser Gln Ile
325 330 335Glu Ala Cys Lys Leu Asp Leu
Arg Val Glu Glu Arg Asp Leu Arg Asp 340 345
350Leu Ala Gln Glu Ala Val Asp Leu Tyr Arg Pro Val Ser Pro
Glu His 355 360 365Pro Ile Glu Leu
Ser Leu Pro Glu Thr Pro Val Leu Val Arg Cys Asp 370
375 380Ala Thr Arg Ile Glu Gln Val Leu Asn Asn Leu Leu
Ser Asn Ala Leu385 390 395
400Lys Tyr Ser Pro Ala Gly Gly Gln Val Asp Val Ala Val Arg Ala Gly
405 410 415Gly Glu Gly Ala Glu
Ile Ala Val Arg Asp Arg Gly Leu Gly Ile Glu 420
425 430Pro Glu Asp Leu Ala His Leu Phe Glu Pro Phe Arg
Arg Leu Lys Ser 435 440 445Thr Ser
Gly Ser Ile Pro Gly Thr Gly Leu Gly Leu Ala Val Ala Lys 450
455 460Arg Ile Val Glu Ala His Gly Gly Arg Leu Phe
Val Glu Ser Arg Pro465 470 475
480Gly Ala Gly Ser Val Phe Arg Ile Glu Leu Pro Arg Ser Ser Ser Arg
485 490 495Asp Gln Ala Asp
Gly Pro Arg Gly Val Ser His Gly 500
50535416PRTSorangium cellulosum 35Met Pro Ala Arg Thr Pro Arg Lys Pro Pro
Pro Pro Ala Ser Pro Ala1 5 10
15Gly Pro Ala Gly Ala Pro Asp Asp Leu Thr Asp Ser Asp Arg Asp Ala
20 25 30Leu Leu Arg Trp Arg Leu
Ala Leu Gly Pro Glu Ala Glu Arg Val Asp 35 40
45Pro Arg Leu Ser Leu Gly Gly Leu Gly Gly Ala Ala Pro Ala
Leu Asp 50 55 60Val Asp Ala Arg Arg
Leu Gly Asp Leu Asp Lys Ala Leu Ser Phe Ile65 70
75 80Tyr Asp Glu Arg Ala Gly Gly Leu Gly Gly
Ser Arg Pro Tyr Val Pro 85 90
95Glu Trp Leu Ser Ala Val Arg Glu Phe Phe Ser His Glu Val Val Ala
100 105 110Leu Val Gln Lys Asp
Ala Ile Glu Arg Lys Gly Leu Thr Gln Leu Leu 115
120 125Phe Glu Pro Glu Thr Leu Pro Phe Leu Glu Lys Asn
Val Glu Leu Val 130 135 140Ala Thr Leu
Met Ser Ala Lys Gly Leu Ile Pro Asp Ala Ala Arg Asp145
150 155 160Thr Ala Arg Gln Ile Val Arg
Glu Val Val Glu Glu Val Arg Arg Ala 165
170 175Leu Glu Ala Glu Val Arg Thr Ala Val Leu Gly Ala
Leu Arg Arg Asn 180 185 190Thr
Thr Ser Pro Leu Arg Val Leu Arg Asn Leu Asp Trp Lys Arg Thr 195
200 205Ile Arg Lys Asn Leu Lys Gly Trp Asp
Ala Glu Arg Arg Arg Leu Val 210 215
220Pro Asp Lys Leu Tyr Phe Trp Ala Asn Gln Thr Arg Arg His Glu Trp225
230 235 240Asp Val Ala Ile
Leu Val Asp Gln Ser Gly Ser Met Gly Glu Ser Val 245
250 255Val Tyr Ser Ser Ile Met Ala Ala Ile Phe
Ala Ser Leu Asp Val Leu 260 265
270Arg Thr Arg Leu Leu Phe Phe Asp Thr Glu Val Val Asp Val Thr Pro
275 280 285Met Leu Val Asp Pro Val Asp
Val Leu Phe Thr Ala Gln Leu Gly Gly 290 295
300Gly Thr Asp Ile Asn Arg Ala Val Ala Tyr Ala Gln Ala Asn Phe
Ile305 310 315 320Glu Arg
Pro Glu Lys Thr Leu Leu Ile Leu Ile Thr Asp Leu Phe Glu
325 330 335Gly Gly Asn Ala Glu Glu Leu
Val Ala Arg Met Arg Gln Leu Ala Asp 340 345
350Ser Lys Val Lys Ser Ile Cys Leu Leu Ala Leu Ser Asp Gly
Gly Lys 355 360 365Pro Ser Tyr Asp
His Glu Met Ala Gln Lys Leu Ala Ala Leu Gly Thr 370
375 380Pro Cys Phe Gly Cys Thr Pro Lys Leu Leu Val Lys
Val Val Glu Arg385 390 395
400Leu Met Arg Gly Gln Asp Leu Gly Pro Leu Leu Gly Ala Glu Ala Arg
405 410 41536352PRTSorangium
cellulosum 36Met Ala Glu Leu Asp His Trp His Pro Val Leu Leu Ser His Glu
Leu1 5 10 15Arg Arg Lys
Pro Arg Asn Val Arg Leu Ala Gly His Glu Ile Val Val 20
25 30Phe Arg Thr Ser Ser Gly Gly Leu Gly Ala
Phe Thr Asp Arg Cys Pro 35 40
45His Arg Ser Met Arg Leu Ser Glu Gly Trp Val Glu Gly Asp Arg Leu 50
55 60Val Cys Ala Tyr His Gly Trp Arg Trp
Ala Val Asp Gly Arg Gly Glu65 70 75
80Ile Pro Ala Thr Pro Ala Ala Arg Pro Cys Ala Arg Arg Glu
Asp Met 85 90 95Phe Glu
Ala Val Glu Arg Tyr Gly Ala Ile Trp Val Lys Arg Ala Gly 100
105 110Ser Gln Ala Ala Phe Pro Arg Leu Glu
Gly Glu Gly Tyr Val Pro Arg 115 120
125Gly Leu Leu Arg His Arg Ala Thr Val Pro Phe Glu Leu Ala Leu Asp
130 135 140Asn Phe Ile Glu Ile Glu His
Thr Pro Phe Val His Phe Met Leu Gly145 150
155 160Tyr Pro Leu Glu Arg Met Pro Glu Val Glu Ala Arg
Val Thr Leu Thr 165 170
175Asp Glu Thr Ile Arg Val Val His Ser Gly Pro Arg Arg Pro Met Pro
180 185 190Arg Ala Met Glu Lys Leu
Leu Gly Ile Pro Glu Asp Ala Ile Phe Val 195 200
205Val Asp Trp Thr Ser Tyr Phe Ser Pro Val Tyr Thr Ile Tyr
Asn His 210 215 220Ser Leu Arg Asp Pro
Lys Thr Asn Gln Pro Val Thr Phe Pro Leu Arg225 230
235 240Ser Ala Val Phe Phe Asn Pro Val Gly Pro
Glu Ser Ser Glu Met Tyr 245 250
255Thr Phe Leu Phe Ala Ser Leu Ala Pro Trp Ser Lys Phe Gly Ala Gly
260 265 270Ala Val Leu Trp Pro
Ala Met Gln Val Ala Met Asn Ile Glu Leu Arg 275
280 285Leu Asp Met Arg Leu Leu Asp Arg Leu Thr Asp Lys
Arg Gly Ile Leu 290 295 300Lys Gly Asn
Val Leu Gly Arg Phe Asp Lys Pro Leu Val Ile Ala Arg305
310 315 320Asp Arg Ile Asp Arg Ile Tyr
Arg Gly Arg Val Ala Glu Ala Gly Asp 325
330 335Gly His Glu Ala Ala Arg Pro Ala Arg Arg Leu Pro
Leu Ala Ala Pro 340 345
35037376PRTSorangium cellulosum 37Met His Val Glu Glu Cys His Val Val Ile
Val Gly Ala Gly Pro Ser1 5 10
15Gly Leu Ala Val Gly Ala Cys Leu Arg Glu Gln Gly Ile Pro Phe Val
20 25 30Leu Leu Glu Lys Ser Glu
Ala Val Gly Ala Thr Trp Arg Arg His Tyr 35 40
45Asp Arg Leu His Leu Asn Thr Ile Lys Gln Leu Ser Ala Leu
Pro Gly 50 55 60Gln Pro Trp Pro Glu
Tyr Ser Ala Pro Tyr Pro Ser Arg Val Glu Met65 70
75 80Val Asp Tyr Leu Glu Arg Tyr Ala Glu Arg
Phe Arg Leu Glu Pro Arg 85 90
95Leu Gly Val Glu Val Glu Arg Ala Tyr His Asp Gly Ser Arg Trp Val
100 105 110Thr Arg Thr His Ala
Gly Glu Leu Arg Ser Gln Ala Leu Val Val Ala 115
120 125Thr Gly Tyr Ser Arg His Pro Asn Val Pro Thr Trp
Pro Asp Gln Glu 130 135 140Arg Phe Arg
Gly Arg Ile Leu His Ser Ser Ala Tyr Arg Ser Gly Ala145
150 155 160Glu Phe Arg Gly Gln Arg Val
Leu Val Val Gly Ala Gly Asn Ser Ala 165
170 175Ser Glu Ile Ala Leu Asp Leu Trp Glu His Cys Ala
Glu Thr Thr Leu 180 185 190Ser
Val Arg Ser Gly Asn His Val Ile Pro Arg Glu Leu Phe Lys Leu 195
200 205Pro Ala Gln Phe Asn Ala Leu Ala Leu
Phe Glu Arg Leu Pro Leu Ala 210 215
220Val Gly Asp Arg Leu Ala Thr Ala Ile Leu Ser Arg Ala Val Gly Asp225
230 235 240Leu Ser Arg Trp
Gly Ile Arg Arg Pro Ala Val Gly Pro Gly Thr Arg 245
250 255Ala Leu Lys Glu Gly Arg Met Pro Leu Ile
Asp Ile Gly Thr Val Ala 260 265
270Leu Ile Gln Gln Gly Lys Ile Lys Val Val Pro Gly Pro Arg Ala Phe
275 280 285Thr Glu Thr Gly Val Thr Phe
Thr Asp Gly Arg Gly Leu Pro Phe Asp 290 295
300Val Val Val Leu Ala Thr Gly Tyr Arg Pro Gly Leu Asp Asp Phe
Leu305 310 315 320Glu Asn
Ala Thr Arg Tyr Thr Asp Glu His Gly Cys Pro Arg Trp His
325 330 335Gly Ala Pro Thr Pro Ala Pro
Gly Leu Phe Phe Ile Gly Phe Arg Asn 340 345
350Pro Ile Thr Gly Gln Ile Arg Asp Ile Ala Ala Glu Ala Pro
Arg Ile 355 360 365Ala Arg His Ile
Gln Gly Val Asn 370 37538356PRTSorangium cellulosum
38Met Cys Tyr Gly Leu Ala Met His Ala Ala Pro Ala Arg Asp Leu Ile1
5 10 15Arg His Phe His Pro Val
Leu Pro Ala Ser Lys Leu Gly Arg Lys Pro 20 25
30Val Arg Val Val Leu Ala Gly Asn Ala Tyr Ala Leu Phe
Arg Asp Glu 35 40 45Gln Gly Arg
Pro Ala Ala Leu Ala Asp Ala Cys Pro His Arg Leu Ala 50
55 60Pro Leu Ser Gln Gly Arg Val Arg Pro Asp Gly Arg
Leu Glu Cys Pro65 70 75
80Tyr His Gly Trp His Phe Asp Ala Glu Gly Arg Gly Ala Cys Pro Ser
85 90 95Gln Pro Ser Leu Thr Arg
Cys Asp Thr Arg Ser Phe Gln Leu Val Glu 100
105 110Gln Leu Gly Tyr Leu Trp Leu Ala His Arg Asp Thr
Pro Arg Ser Ala 115 120 125Leu Pro
Glu Leu Asp Phe Ser Ser Asp Gly Phe Glu Tyr Ala Gly Thr 130
135 140Phe Ser His Leu Ala Pro Ala Pro Leu His Val
Ile Phe Asp Asn Ser145 150 155
160Ser Glu Asp Glu His Thr Pro Phe Val His Gly Arg Leu Gly Trp Thr
165 170 175Pro Glu Asp Ala
Ala Arg Ile Asp Phe Ser Cys Asp Val Phe Glu Asp 180
185 190Arg Thr Glu Val Lys Tyr Ser Ala Pro Gln Arg
Pro Ser Thr Leu Ala 195 200 205Arg
Leu Met Leu Leu Gln Pro Gly Asp Thr Phe His Asn Gln Trp Val 210
215 220Thr Arg Phe Ser Pro Val Tyr Thr Val Tyr
Thr Ser Trp Trp Thr Ala225 230 235
240Gln Asn Gly Met Glu Arg Pro Val Val Ala Arg Ala Gly Ile Phe
Phe 245 250 255Val Pro Glu
Thr Glu Arg Thr Thr Phe Val Arg Ala Phe Leu Phe Val 260
265 270Lys Ile Thr Asp Pro Arg Phe Arg Pro Leu
Leu Pro Val Val Lys Ser 275 280
285Ala Ala Ile Ala Leu Ser Trp Lys Glu Ile Arg Asp Asp Val Lys Phe 290
295 300Ile Pro His Val Ala Asp Thr Pro
Phe Glu Met Lys Gly Met Arg Leu305 310
315 320Asn Lys Tyr Asp Ala Thr Leu Val His Asn His Arg
Leu Met Arg Ser 325 330
335Ile Tyr Phe Gly Glu Thr Arg Gly Glu Ala Glu Gly Thr Gly Val Gly
340 345 350His Ala Ser Ala
35539395PRTSorangium cellulosum 39Met Thr Cys Phe Val Pro Ala Leu Arg Arg
Met Gly Ala Thr Pro Ala1 5 10
15Arg Thr Cys Met Arg Gln Arg Leu Asp Val Thr Asp Leu Tyr Asn Asp
20 25 30Ala Tyr Thr Ala Tyr Ile
Glu Ala Phe Arg Arg Gln Thr Glu Leu Val 35 40
45Ala Ser Glu Ile Leu Leu Glu His Leu Val Asp Pro Ser Gly
Ala Val 50 55 60Arg Gly Leu Asp Asp
Arg Pro Glu Ser Ala Pro Ser Val Thr Ala Tyr65 70
75 80Gln Phe Arg Arg Lys Leu Leu Asp Tyr Phe
Ser Asp Lys Gly Asp Leu 85 90
95Thr Gln Asp Pro Ser Gly Arg Leu Val Pro Ser Glu Ala Val Arg Lys
100 105 110Arg Val Ala Glu Lys
Glu Ser Ile Ala Leu Ala Asp Arg Ala Ile Leu 115
120 125Gly Glu Met Val Glu Phe Leu Gln Arg Tyr Arg Gly
Leu Ala Gly Pro 130 135 140Val Leu Ala
Gly Lys Asp Ala Leu Ala Thr Met Asp Leu Gln Tyr Gly145
150 155 160Met Gln Ala Ser Leu Lys Phe
Trp Glu Tyr Ser Met Ile Ser Leu Pro 165
170 175Ala Lys Lys Pro Cys Asn Val Met Leu Ala Arg Ala
Leu Met Ala Lys 180 185 190Leu
Ala Glu Gly Pro Gly Ile Ser Val Phe Glu Gly Gly Ala Gly Leu 195
200 205Gly Val Val Leu Arg Gln Ala Leu Ser
Asp Pro Arg Phe Leu Pro Leu 210 215
220Ser Lys Asn Leu Ala Arg Tyr Asp Tyr Thr Asp Ile Ser Ala Leu Leu225
230 235 240Met Glu Thr Gly
Lys Gln Trp Leu Arg Thr His Ala Pro Ala Asp Val 245
250 255Phe Gln Arg Ile His Phe Gln Arg Leu Asp
Leu Asp Thr Leu Pro Ser 260 265
270Ala Gly Ser Thr Phe Ala Arg Ala Ala Ser Val Asp Leu Ile Val Leu
275 280 285Glu His Val Leu Tyr Asp Val
Arg Asp Leu His Ala Thr Leu Gln Ala 290 295
300Phe His Thr Met Leu Lys Pro Gly Gly Gln Leu Ala Phe Thr Met
Ser305 310 315 320Phe Arg
Asp Arg Pro Gly Val Phe Phe Pro Asn Glu Phe Phe Gln Ser
325 330 335Met Leu His Thr Tyr Ser Lys
Ala Lys Leu Asp Pro Pro Arg Arg Gln 340 345
350His Val Gly Tyr Leu Thr Leu Gln Glu Trp Glu Leu Ser Leu
Arg Ala 355 360 365Ala Gly Phe Ser
Glu Trp Glu Val Tyr Pro Ala Pro Glu Asp His Ala 370
375 380Lys Trp Pro Phe Gly Gly Ile Val Ala Tyr Arg385
390 3954084PRTSorangium cellulosum 40Met Thr
Phe Gly Tyr Ala Thr Val Ala Leu Phe Phe Leu Arg Phe Trp1 5
10 15Lys Lys Thr Gly Asp Arg Leu Phe
Ala Lys Phe Ser Ala Ala Phe Trp 20 25
30Leu Met Met Leu Gly Arg Ile Ala Val Ala Leu Asn Arg Val Glu
Glu 35 40 45Asp Ala Ile His Tyr
Leu Tyr Leu Phe Arg Leu Phe Ala Tyr Met Leu 50 55
60Ile Leu Tyr Ala Ile Val His Lys Asn Arg Gly Asn Asp Gly
Gln Ala65 70 75 80Leu
Ser Ser Arg4186PRTSorangium cellulosum 41Met Ala Ala Ala Val Tyr Ile Leu
Cys Ala Leu Thr Ser Ile Ala Cys1 5 10
15Ala Val Leu Leu Leu Arg Gly Tyr Ala Gln Arg Lys Val Arg
Leu Leu 20 25 30Leu Trp Ser
Gly Leu Cys Phe Ala Ala Leu Ala Ala Asn Asn Ile Leu 35
40 45Leu Phe Val Asp Leu Val Val Ile Arg Ser Val
Asp Leu Ser Ser Leu 50 55 60Arg His
Leu Thr Ala Leu Ile Gly Leu Ala Leu Leu Leu Tyr Gly Leu65
70 75 80Ile Trp Asp Leu Arg Glu
8542125PRTSorangium cellulosum 42Met Gln Arg Phe Leu Gly Ala His
Ile Ser Ser Ile Glu Gln Leu Glu1 5 10
15Val Leu Leu Leu Met Arg Arg Thr Ala Glu Arg Glu Trp Ser
Ala Ala 20 25 30Ala Met Ala
Arg Glu Ile Gly Ser Ser Met Met Ser Ile Gln Asp Arg 35
40 45Phe Gly Gly Leu Ala Ser Arg Gly Leu Ile Val
Ala Arg Glu Asp Gly 50 55 60Glu Asp
Ile Phe Tyr Arg Tyr Ala Pro Ala Asp Asp Glu Thr Arg Arg65
70 75 80Thr Ile Asp Asp Leu Ala Gln
Ala Tyr Lys Glu Arg Arg Leu Ser Val 85 90
95Ile Asn His Ile Tyr Ala Thr Pro Pro Pro Ser Asp Ile
Gln Ser Phe 100 105 110Ser Asp
Ala Phe Leu Ile Thr Lys Lys Gly Lys Gly Gly 115
120 125431762PRTSorangium cellulosum 43Ile Leu Glu Leu
Lys Asn Thr Phe Asn Thr Met Val Asp Gln Leu Arg1 5
10 15Ser Phe Ala Ala Gln Val Thr Arg Val Ala
Arg Glu Val Gly Thr Glu 20 25
30Gly Lys Leu Gly Gly Gln Ala Glu Val Thr Gly Val Ala Gly Thr Trp
35 40 45Lys Asp Leu Thr Asp Ser Val Asn
Ser Met Ala Ser Asn Leu Thr Ala 50 55
60Gln Val Arg Asn Ile Ala Asp Val Thr Thr Ala Val Ala Asn Gly Asp65
70 75 80Leu Ser Lys Lys Ile
Thr Val Asp Val Arg Gly Glu Ile Leu Glu Leu 85
90 95Lys Asp Thr Phe Asn Thr Met Val Asp Gln Leu
Arg Ser Phe Ala Ser 100 105
110Glu Val Thr Arg Val Ala Arg Glu Val Gly Thr Glu Gly Lys Leu Gly
115 120 125Gly Gln Ala Ser Val Pro Gly
Val Ala Gly Thr Trp Lys Asp Leu Thr 130 135
140Asp Ser Val Asn Ser Met Ala Ser Asn Leu Thr Ala Gln Val Arg
Asn145 150 155 160Ile Ala
Asp Val Thr Thr Ala Val Ala Arg Gly Asp Leu Ser Lys Lys
165 170 175Ile Thr Val Asp Val Lys Gly
Glu Ile Leu Glu Leu Lys Asn Thr Phe 180 185
190Asn Thr Met Val Asp Gln Leu Ser Ser Phe Ala Ala Glu Val
Thr Arg 195 200 205Val Ala Arg Glu
Val Gly Thr Glu Gly Lys Leu Gly Gly Gln Ala Glu 210
215 220Val Lys Gly Val Ala Gly Thr Trp Lys Asp Leu Thr
Asp Ser Val Asn225 230 235
240Ser Met Ala Ser Asn Leu Thr Ala Gln Val Arg Asn Ile Ala Ala Val
245 250 255Thr Thr Ala Val Ala
Asn Gly Asp Leu Ser Lys Lys Ile Leu Glu Leu 260
265 270Lys Asp Thr Phe Asn Thr Met Val Asp Gln Leu Arg
Ser Phe Ala Ser 275 280 285Glu Val
Thr Arg Val Ala Arg Glu Val Gly Thr Glu Gly Lys Leu Gly 290
295 300Gly Gln Ala Ser Val Pro Gly Val Ala Gly Thr
Trp Lys Asp Leu Thr305 310 315
320Asp Ser Val Asn Ser Met Ala Ser Asn Leu Thr Ala Gln Val Arg Asn
325 330 335Ile Ala Ala Val
Thr Thr Ala Val Ala Asn Gly Asp Leu Ser Lys Lys 340
345 350Ile Thr Val Asp Val Arg Gly Glu Ile Leu Glu
Leu Lys Asp Thr Phe 355 360 365Asn
Thr Met Val Asp Gln Leu Arg Ser Phe Ala Ser Glu Val Thr Arg 370
375 380Val Ala Arg Glu Val Gly Thr Glu Gly Lys
Leu Gly Gly Gln Ala Ser385 390 395
400Val Pro Gly Val Ala Gly Thr Trp Lys Asp Leu Thr Asp Ser Val
Asn 405 410 415Ser Met Ala
Ser Asn Leu Thr Ala Gln Val Arg Asn Ile Ala Asp Val 420
425 430Thr Thr Ala Val Ala Arg Gly Asp Leu Ser
Lys Lys Ile Thr Val Asp 435 440
445Val Lys Gly Glu Ile Leu Glu Leu Lys Asn Thr Phe Asn Thr Met Val 450
455 460Asp Gln Leu Ser Ser Phe Ala Ala
Glu Val Thr Arg Val Ala Arg Glu465 470
475 480Val Gly Thr Glu Gly Lys Leu Gly Gly Gln Ala Glu
Val Lys Gly Val 485 490
495Ala Gly Thr Trp Lys Asp Leu Thr Asp Ser Val Asn Ser Met Ala Ser
500 505 510Asn Leu Thr Ala Gln Val
Arg Asn Ile Ala Ala Val Thr Thr Ala Val 515 520
525Ala Asn Gly Asp Leu Ser Lys Lys Ile Thr Val Asp Val Arg
Gly Glu 530 535 540Ile Leu Glu Leu Lys
Asp Thr Phe Asn Thr Met Val Asp Gln Leu Arg545 550
555 560Ser Phe Ala Ser Glu Val Thr Arg Val Ala
Arg Glu Val Gly Thr Glu 565 570
575Gly Lys Leu Gly Gly Gln Ala Ser Val Pro Gly Val Ala Gly Thr Trp
580 585 590Lys Asp Leu Thr Asp
Ser Val Asn Ser Met Ala Ser Asn Leu Thr Ala 595
600 605Gln Val Arg Asn Ile Ala Ala Val Thr Thr Ala Val
Ala Asn Gly Asp 610 615 620Leu Ser Lys
Lys Ile Thr Val Asp Val Arg Gly Glu Ile Leu Glu Leu625
630 635 640Lys Asn Thr Ile Asn Tyr Thr
Met Val Asp Gln Leu Asn Ala Phe Ala 645
650 655Ser Glu Val Thr Arg Val Ala Arg Glu Val Gly Thr
Glu Gly Lys Leu 660 665 670Gly
Gly Gln Ala Ser Val Pro Gly Val Ala Gly Thr Trp Lys Asp Leu 675
680 685Thr Asp Asn Val Asn Phe Met Ala Gly
Asn Leu Thr Asn Gln Val Arg 690 695
700Gly Ile Ala Lys Val Val Thr Ala Val Ala Asn Gly Asp Leu Lys Arg705
710 715 720Lys Leu Ala Phe
Asp Ala Lys Gly Glu Ile Ala Ala Leu Ala Asp Thr 725
730 735Ile Asn Gly Val Ile Glu Thr Leu Ala Thr
Phe Ala Asp Gln Val Thr 740 745
750Thr Val Ala Arg Glu Val Gly Val Glu Gly Lys Leu Gly Gly Gln Ala
755 760 765Ser Val Pro Gly Ala Ala Gly
Thr Trp Lys Asp Leu Thr Asp Asn Val 770 775
780Asn Arg Leu Ala Ala Asn Leu Thr Thr Gln Val Arg Ala Ile Ala
Glu785 790 795 800Val Ala
Thr Ala Val Thr Lys Gly Asp Leu Thr Arg Ser Ile Lys Val
805 810 815Glu Ala Gln Gly Glu Val Ala
Ala Leu Lys Asp Thr Ile Asn Glu Met 820 825
830Ile Arg Asn Leu Lys Asp Thr Thr Leu Lys Asn Ser Glu Gln
Asp Trp 835 840 845Leu Lys Thr Asn
Leu Ala Lys Phe Ser Arg Met Leu Gln Gly Gln Lys 850
855 860Asp Leu Leu Thr Val Gly Arg Leu Ile Leu Ser Glu
Leu Ala Pro Val865 870 875
880Val Gly Ala Gln Gln Gly Val Phe Phe Thr Met Asp Val Ala Lys Glu
885 890 895Glu Pro Ile Leu Lys
Leu Leu Ala Ser Tyr Ala Tyr Lys Val Arg Lys 900
905 910His Val Asp Asn His Phe Lys Leu Gly Glu Gly Leu
Val Gly Gln Cys 915 920 925Ala Leu
Glu Lys Glu Lys Ile Leu Leu Val Asn Ala Pro Pro Asp Tyr 930
935 940Ile Arg Ile Thr Ser Gly Leu Gly Glu Ala Pro
Pro Val Asn Ile Ile945 950 955
960Val Ile Pro Val Leu Phe Glu Gly Gln Val Lys Ala Val Ile Glu Leu
965 970 975Ala Ser Phe Glu
Arg Phe Ser Pro Thr His Gln Ala Phe Leu Asp Gln 980
985 990Leu Thr Glu Ser Ile Gly Ile Val Leu Asn Thr
Ile Glu Ala Asn Met 995 1000
1005Arg Thr Glu Asp Leu Leu Lys Gln Ser Gln Ser Leu Ala Arg Glu
1010 1015 1020Leu Gln Ser Gln Gln Glu
Glu Leu Gln Gln Thr Asn Ala Glu Leu 1025 1030
1035Gly Glu Lys Ala Arg Leu Leu Ala Gln Gln Asn Val Glu Val
Glu 1040 1045 1050Arg Lys Asn Arg Glu
Val Glu Gln Ala Arg Gln Ala Leu Glu Glu 1055 1060
1065Lys Ala Arg Gln Leu Ala Ile Thr Ser Lys Tyr Lys Ser
Glu Phe 1070 1075 1080Leu Ala Asn Met
Ser His Glu Leu Arg Thr Pro Leu Asn Ser Leu 1085
1090 1095Leu Ile Leu Ser Asp Gln Leu Ser Lys Asn Thr
Asp Arg Asn Leu 1100 1105 1110Thr Gly
Arg Gln Val Glu Phe Ala Lys Thr Ile His Ser Ser Gly 1115
1120 1125Asn Asp Leu Leu Ala Leu Ile Asn Asp Ile
Leu Asp Leu Ser Lys 1130 1135 1140Ile
Glu Ser Gly Thr Val Ile Val Asp Val Gly Glu Leu Ser Phe 1145
1150 1155Ser Asp Leu Gln Asp Tyr Val Glu Arg
Thr Phe Gln His Val Ala 1160 1165
1170Glu Ser Lys Arg Leu Glu Phe Glu Leu Asn Phe Ala Gln Asn Leu
1175 1180 1185Pro Gln Val Ile Tyr Thr
Asp Ala Lys Arg Val Gln Gln Val Leu 1190 1195
1200Lys Asn Leu Leu Ser Asn Ser Phe Lys Phe Thr Glu Arg Gly
Ser 1205 1210 1215Val Ala Leu Asp Val
Asp Leu Val Thr Ser Gly Trp Thr Ile Glu 1220 1225
1230Asn Glu Gly Leu Ser Arg Ala Gly Ala Ala Ile Ala Met
Ser Val 1235 1240 1245Arg Asp Thr Gly
Ile Gly Ile Pro His Asp Lys Gln Gln Ile Ile 1250
1255 1260Phe Glu Ala Phe Gln Gln Ala Asp Gly Ser Thr
Ser Arg Lys Tyr 1265 1270 1275Gly Gly
Thr Gly Leu Gly Leu Ala Ile Ser Arg Glu Ile Ala Trp 1280
1285 1290Met Leu Gly Gly Glu Ile Lys Leu Ser Ser
Arg Pro Gly Ser Gly 1295 1300 1305Ser
Thr Phe Thr Leu Tyr Leu Pro Leu Thr Tyr Thr Pro Ala Arg 1310
1315 1320Pro Arg Arg Lys Glu Gln Ala Ala Glu
Val Pro Ser Ala Pro Pro 1325 1330
1335Ala Leu Val Ser Gly Asp Val Ala Pro Arg Ser Ala Ala Glu Pro
1340 1345 1350Pro Pro His Leu Leu Asn
Gln Ser Val Asp Asp Ser Ala Ser Leu 1355 1360
1365Gln Pro Ser Asp Ser Val Val Leu Ile Val Glu Asn Asp Ala
Ser 1370 1375 1380Phe Ala His Phe Val
Met Asp Val Ala His Asp His Gly Phe Lys 1385 1390
1395Ala Ile Leu Ala Tyr Arg Gly Gly Ala Ala Leu Ser Ile
Val Arg 1400 1405 1410Glu Arg Arg Val
Asn Ala Ile Thr Leu Asp Ile Asn Leu Pro Asp 1415
1420 1425Met Asp Gly Trp Arg Val Leu Asp Arg Val Lys
Arg Asp Leu Ala 1430 1435 1440Thr Arg
His Ile Pro Val Gln Val Ile Thr Thr Asp Glu Glu Arg 1445
1450 1455Glu Arg Ala Leu Arg Met Gly Ala Thr Gly
Val Leu Cys Lys Pro 1460 1465 1470Leu
Lys Thr Arg Asp Ala Leu Asp Glu Thr Phe Arg Arg Leu Ser 1475
1480 1485Gln Phe Met Val Ser Arg Arg Arg Thr
Val Val Leu Ala Glu Pro 1490 1495
1500Asp Glu Ala Glu Arg Gln Glu Leu Val Glu Leu Leu Gly Gly Asp
1505 1510 1515Asp Val Thr Ile Arg Ser
Val Ala Ser Gly Glu Glu Ala Leu Asp 1520 1525
1530Ala Leu Leu Thr Glu Gly Ala Asp Val Leu Ile Leu His Leu
Asp 1535 1540 1545Leu Pro Asp Met Arg
Cys Phe Asp Leu Ile Gly Gln Leu Ala Gln 1550 1555
1560Gly Ser Gly Pro Thr Glu Leu Pro Val Leu Val Tyr Ala
Pro Glu 1565 1570 1575Glu Ile Ser Ala
Ala Asp Glu Ala Gln Leu Ser Arg Phe Ser Gln 1580
1585 1590Leu Met Val Leu Lys His Val Arg Ser Lys Glu
Arg Leu Phe Asp 1595 1600 1605Asp Val
Ser Leu Phe Leu His Arg Pro Val Ala Ala Leu Ser Glu 1610
1615 1620Arg Gln Arg Gln Thr Leu Gln Glu Leu His
Gln Ser Asn Lys Val 1625 1630 1635Leu
Ala Gly Lys Lys Val Leu Val Val Asp Asp Asp Val Arg Asn 1640
1645 1650Ile Phe Ala Met Thr Thr Ile Leu Asp
Ala Gln Gln Met Lys Thr 1655 1660
1665Val Tyr Val Glu Thr Gly Arg Ala Ala Ile Glu Met Leu Gln Arg
1670 1675 1680Thr Pro Asp Ile Glu Ile
Val Leu Met Asp Ile Met Met Pro Glu 1685 1690
1695Met Asp Gly Tyr Asp Thr Ile Arg Ala Ile Arg Ala Lys Pro
Glu 1700 1705 1710His His Ala Leu Pro
Ile Ile Ala Val Thr Ala Lys Ala Met Lys 1715 1720
1725Gly Asp Arg Glu Lys Cys Phe Glu Ala Gly Ala Asn Asp
Tyr Ile 1730 1735 1740Ser Lys Pro Val
Asp Pro Glu His Leu Leu Ala Met Leu Arg Leu 1745
1750 1755Trp Leu His Arg 1760
User Contributions:
Comment about this patent or add new information about this topic:
