Inventors list

Assignees list

Classification tree browser

Top 100 Inventors

Top 100 Assignees

Patent application title: Biosynethetic gene cluster for jerangolids

Inventors:  Christopher Reeves (Orinda, CA, US)  Ralph C. Reid (San Rafael, CA, US)
IPC8 Class: AC07K200FI
USPC Class: 530300
Class name: PEPTIDES OF 3 TO 100 AMINO ACID RESIDUES
Publication date: 08/28/2008
Patent application number: 20080207873






Sign up to receive free email alerts when patent applications with chosen keywords are published SIGN UP

Abstract:

Domains of jerangolid polyketide synthase and modification enzymes and polynucleotides encoding them are provided. Methods to prepare jerangolid in pharmaceutically useful quantities are described, as are methods to prepare jerangolid analogs and other polyketides using the polynucleotides encoding jerangolid synthase domains or modifying enzymes.

Claims:

1. A purified or recombinant nucleic acid comprising a nucleotide sequence that encodes at least one polypeptide required for the biosynthesis of jerangolid, wherein the complement of said nucleotide sequence hybridizes to a sequence selected from the group consisting of nucleotides 1-67323 of SEQ ID NO:1, under conditions of hybridization at 65.degree. C. for 36 hours and washing 3 times at high stringency with 0.1.times.SSC and 0.5% SDS for 20 minutes at 65.degree. C.

2. A purified or recombinant nucleic acid a nucleotide sequence that encodes at least one module of the jerangolid polyketide synthase, wherein the complement of said nucleotide sequence hybridizes to a sequence selected from the group consisting of nucleotides that encode modules of the jerangolid PKS as listed in Table 1.

3. A purified or recombinant nucleic acid according to claim 1, wherein said polypeptide comprises a β-ketoacylsynthase domain and wherein the complement of said nucleotide sequence hybridizes to a sequence selected from the group consisting of β-ketoacylsynthase domains as listed in Table 1, under conditions of hybridization at 65.degree. C. for 36 hours and washing 3 times at high stringency with 0.1.times.SSC and 0.5% SDS for 20 minutes at 65.degree. C.

4. A purified or recombinant nucleic acid according to claim 1, wherein said polypeptide comprises an acyltransferase domain and wherein the complement of said nucleotide sequence hybridizes to a sequence selected from the group consisting of acyltransferase domains as listed in Table 1, under conditions of hybridization at 65.degree. C. for 36 hours and washing 3 times at high stringency with 0.1.times.SSC and 0.5% SDS for 20 minutes at 65.degree. C.

5. A purified or recombinant nucleic acid according to claim 1, wherein said polypeptide comprises a β-ketoreductase domain and wherein the complement of said nucleotide sequence hybridizes to a sequence selected from the group consisting of β-ketoreductase domains as listed in Table 1, under conditions of hybridization at 65.degree. C. for 36 hours and washing 3 times at high stringency with 0.1.times.SSC and 0.5% SDS for 20 minutes at 65.degree. C.

6. A purified or recombinant nucleic acid according to claim 1, wherein said polypeptide comprises a dehydratase domain and wherein the complement of said nucleotide sequence hybridizes to a sequence selected from the group consisting of dehydratase domains as listed in Table 1, under conditions of hybridization at 65.degree. C. for 36 hours and washing 3 times at high stringency with 0.1.times.SSC and 0.5% SDS for 20 minutes at 65.degree. C.

7. A purified or recombinant nucleic acid according to claim 1, wherein said polypeptide comprises an enoylreductase domain and wherein the complement of said nucleotide sequence hybridizes to enoylreductase domains as listed in Table 1, under conditions of hybridization at 65.degree. C. for 36 hours and washing 3 times at high stringency with 0.1.times.SSC and 0.5% SDS for 20 minutes at 65.degree. C.

8. A purified or recombinant nucleic acid according to claim 1, wherein said polypeptide comprises an acyl carrier protein domain and wherein the complement of said nucleotide sequence hybridizes to a sequence selected from the group consisting of acyl carrier protein domains as listed in Table 1, under conditions of hybridization at 65.degree. C. for 36 hours and washing 3 times at high stringency with 0.1.times.SSC and 0.5% SDS for 20 minutes at 65.degree. C.

9. A purified or recombinant polypeptide involved in the biosynthesis of an jerangolid, wherein said polypeptide has an amino acid sequence that can be encoded by a nucleic acid sequence of claim 1.

10. The polypeptide of claim 9 that can be encoded by the gene jerA.

11. The polypeptide of claim 9 that can be encoded by the gene jerB.

12. The polypeptide of claim 9 that can be encoded by the gene jerC.

13. The polypeptide of claim 9 that can encoded by the gene jerD.

14. The polypeptide of claim 9 that can be encoded by the gene jerE.

15. The polypeptide of claim 9 that can be encoded by the gene jerF.

16. A method of making an jerangolid or jerangolid analog, said method comprising expressing at least one recombinant gene of claim 1 in a host cell capable of producing polyketides.

Description:

CROSS REFERENCE TO RELATED APPLICATIONS

[0001]This application is a divisional of U.S. patent application Ser. No. 11/109,593, filed 18 Apr. 2005, now U.S. Pat. No. 7,285,405, issued 23 Oct. 2007, which claims benefit under 35 U.S.C. §119 to U.S. provisional application Ser. No. 60/563,843, filed 19 Apr. 2005, the entire contents of each prior application being incorporated herein by reference.

[0002]Polyketides are complex natural products that are produced by microorganisms such as fungi and mycelial bacteria. There are about 10,000 known polyketides, from which numerous pharmaceutical products in many therapeutic areas have been derived, including: adriamycin, epothilone, erythromycin, mevacor, rapamycin, tacrolimus, tetracycline, rapamycin, and many others. However, polyketides are made in very small amounts in microorganisms and are difficult to make or modify chemically. For this and other reasons, biosynthetic methods are preferred for production of therapeutically active polyketides. See PCT publication Nos. WO 93/13663; WO 95/08548; WO 96/40968; WO 97/02358; and WO 98/27203; U.S. Pat. Nos. 4,874,748; 5,063,155; 5,098,837; 5,149,639; 5,672,491; 5,712,146 and 6,410,301; Fu et al., 1994, Biochemistry 33:9321-26; McDaniel et al., 1993, Science 262: 1546-1550; Kao et al., 1994, Science, 265:509-12, and Rohr, 1995, Angew. Chem. Int. Ed. Engl. 34: 881-88, each of which is incorporated herein by reference.

[0003]Biosynthesis of polyketides may be accomplished by heterologous expression of Type I or modular polyketide synthase enzymes (PKSs). Type I PKSs are large multifunctional protein complexes, the protein components of which are encoded by multiple open reading frames (ORF) of PKS gene clusters. Each ORF of a Type I PKS gene cluster can encode one, two, or more modules of ketosynthase activity. Each module activates and incorporates a two-carbon (ketide) unit into the polyketide backbone. Each module also contains multiple ketide-modifying enzymatic activities, or domains. In classical Type I PKSs, the number and order of modules, and the types of ketide-modifying domains within each module, determine the structure of the resulting product. Recently, variants of Type I PKSs have been found in which single modules may be used in an iterative fashion to add more than one two-carbon unit to the growing polyketide chain (see, for example, Muller 2004). Polyketide synthesis may also involve the activity of nonribosomal peptide synthetases (NRPSs) to catalyze incorporation of an amino acid-derived building block into the polyketide, as well as post-synthesis modification, or tailoring enzymes. The modification enzymes modify the polyketide by oxidation or reduction, addition of carbohydrate groups or methyl groups, or other modifications.

[0004]In PKS polypeptides, the regions that encode enzymatic activities (domains) are separated by linker regions. These regions collectively can be considered to define boundaries of the various domains. Generally, this organization permits PKS domains of different or identical substrate specificities to be substituted (usually at the level of encoding DNA) from other PKSs by various available methodologies. Using this method, new polyketide synthases (which produce novel polyketides) can be produced. It will be recognized from the foregoing that genetic manipulation of PKS genes and heterologous expression of PKSs can be used for the efficient production of known polyketides, and for production of novel polyketides structurally related to, but distinct from, known polyketides (see references above, and Hutchinson, 1998, Curr. Opin. Microbiol. 1:319-29; Carreras and Santi, 1998, Curr. Opin. Biotech. 9:403-11; and U.S. Pat. Nos. 5,712,146 and 5,672,491, each of which is incorporated herein by reference).

[0005]One valuable class of polyketides includes the jerangolids and their analogs (FIG. 1), produced by various strains of the myxobacterium Sorangium cellulosum. Jerangolid A (1) as produced by Sorangium cellulosum strain So ce 307 was described by Gerth et al. "The Jerangolids: A Family of New Antifungal Compounds from Sorangium cellulosum (Myxobacteria); Production, Pysico-chemical and Biological Properties of Jerangolid A," J. Antibiotics 49: 71-75 (1996), along with four closely related analogs, jerangolids B, C, D, and E.

[0006]The jerangolids are anti-fungal agents showing partial structural resemblance with the ambruticins.

[0007]Given the promise of jerangolids in the treatment of fungal infections, there exists an unmet need for a production system that can provide large quantities of these polyketides. The present invention meets this need by providing the biosynthetic genes responsible for the production of jerangolids and providing for their expression in heterologous hosts.

SUMMARY OF THE INVENTION

[0008]The present invention provides recombinant nucleic acids encoding polyketide synthases and polyketide modification enzymes. The recombinant nucleic acids of the invention are useful in the production of polyketides, including but not limited to jerangolids and jerangolid analogs and derivatives in recombinant host cells. The biosynthesis of the jerangolids is performed by a modular polyketide synthase (PKS) together with polyketide modification enzymes. The jerangolid PKS is made up of several proteins, each having one or more modules. The modules have domains with specific synthetic functions.

[0009]The present invention also provides domains and modules of the jerangolid PKS and corresponding nucleic acid sequences encoding them and/or parts thereof. Such compounds are useful in the production of hybrid PKS enzymes and the recombinant genes that encode them.

[0010]The present invention also provides modifying genes of the jerangolid biosynthetic gene cluster, including but not limited to isolated and recombinant forms and forms incorporated into a vector or the chromosomal DNA of a host cell.

[0011]The present invention also provides recombinant host cells that contain the nucleic acids of the invention. In one embodiment, the host cell provided by the invention is a Streptomyces host cell that produces a jerangolid modification enzyme and/or a domain, module, or protein of the jerangolid PKS. Methods for the genetic manipulation of Streptomyces are described in Kieser et al, "Practical Streptomyces Genetics," The John Innes Foundation, Norwich (2000), which is incorporated herein by reference in its entirety. In other embodiments, the host cells provided by the invention are eubacterial cells such as Escherichia coli, yeast cells such as Saccharomyces cerevisiae, or myxobacterial cells such as Myxococcus xanthus.

[0012]Accordingly, there is provided a recombinant PKS wherein at least 10, 15, 20, or more consecutive amino acids in one or more domains of one or more modules thereof are derived from one or more domains of one or more modules of the jerangolid polyketide synthase. Preferably at least an entire domain of a module of the jerangolid synthase is included. Representative jerangolid PKS domains useful in this aspect of the invention include, for example, KR, DH, ER, AT, ACP and KS domains. In one embodiment of the invention, the PKS is assembled from polypeptides encoded by DNA molecules that comprise coding sequences for PKS domains, wherein at least one encoded domain corresponds to a domain of jerangolid PKS. In such DNA molecules, the coding sequences are operably linked to control sequences so that expression therefrom in host cells is effective. In this manner, jerangolid PKS coding sequences or modules and/or domains can be made to encode PKS to biosynthesize compounds having antibiotic or other useful bioactivity other than jerangolid.

[0013]These and other aspects of the present invention are described in more detail in the Detailed Description of the Invention, below.

BRIEF DESCRIPTION OF THE DRAWINGS

[0014]FIG. 1 shows the chemical structure of Jerangolid A

[0015]FIG. 2 shows the organization of the jerangolid biosynthetic cluster as deduced from SEQ ID NO:1. FIG. 2A shows the organization of the portion of the gene cluster upstream of the polyketide synthase genes. FIG. 2B shows the organization of the portion of the gene cluster containing the polyketide synthase genes. FIG. 2C shows the organization of the portion of the gene cluster downstream of the polyketide synthase genes.

DETAILED DESCRIPTION OF THE INVENTION

[0016]The present invention provides recombinant materials for the production of polyketides. In an aspect, the invention provides recombinant nucleic acids encoding at least one domain of a polyketide synthase required for jerangolid biosynthesis. Methods and host cells for using these genes to produce a polyketide in recombinant host cells are also provided.

[0017]The nucleotide sequences encoding jerangolid PKS domains, modules and polypeptides of the present invention were isolated from Sorangium cellulosum So ce 307 as described in Example 1. Given the valuable properties of jerangolid and its derivatives and analogs, means to produce useful quantities of these molecules in a highly pure form is of great potential value. The compounds produced may be used as antitumor agents or for other therapeutic uses, and/or intermediates for further enzymatic or chemical modification. The nucleotide sequences of the jerangolid biosynthetic gene cluster encoding domains, modules and polypeptides of jerangolid synthase, and modifying enzymes, and other polypeptides can be used, for example, to make both known and novel polyketides.

[0018]In one aspect of the invention, purified and isolated DNA molecules are provided that comprise one or more coding sequences for one or more domains or modules of jerangolid synthase. Examples of such encoded domains include jerangolid synthase KR, DH, ER, AT, ACP, and KS domains. Domains will herein be referred to according to the module in which they are found as "domain(module)"; for example, the module 1 AT domain will be referred to as "AT(1)." In one aspect, the invention provides DNA molecules in which sequences encoding one or more polypeptides of jerangolid synthase are operably linked to expression control sequences that are effective in suitable host cells to produce jerangolid, its analogs or derivatives, or novel polyketides.

[0019]The sequence of the beginning of the jerangolid PKS gene cluster was assembled from sequences deduced from the cosmid 10K10B3 (FIG. 2) and is shown as SEQ ID NO:1. This partial PKS gene cluster is found to comprise five open reading frames (ORFs), named jerA, jerB, jerC, jerD, and jerE. The jerA gene encodes the loading module of the jerangolid PKS, also referred to herein as "module 0," and comprises KS and AT domains. The KS(0) domain is apparently inactive as a ketosynthase, having the active site cysteine residue replaced with a serine, and is thought to act as a decarboxylase to prime the PKS with a propionate group derived from methylmalonate. The AT(0) domain comprises the signature amino acid sequences (GHSQ and YASH) of a methylmalonyl-specific AT domain. The jerB gene encodes modules 1 and 2 of the jerangolid PKS, the jerC gene encodes modules 3 and 4, the jerD gene encodes module 5, and the jerE gene encodes modules 6 and 7 along with a chain terminating thioesterase (TE) domain. Table 1 provides a description of the genes, modules, and domains of the five jerangolid PKS proteins. A further gene, jerF, encodes an O-methyltransferase thought to be involved in addition of the methyl group to O-3 of jerangolide.

TABLE-US-00001 TABLE 1 Genes, modules, and domains of the five proteins of the jerangolid PKS determined from the nucleotide sequence given in SEQ ID NO: 1. Gene Module Domain boundaries JerA 15751-18978 module 0 15859-18831 KS(0) 15859-17133 AT(0) 17461-18513 ACP(0) 18577-18831 JerB 19013-30074 module 1 19134-23507 KS(1) 19134-20408 AT(1) 20715-21767 KR(1) 22398-23219 ACP(1) 23250-23507 module 2 23559-29816 KS(2) 23559-24836 AT(2) 25167-26234 DH(2) 26268-26819 ER(2) 27822-28697 KR(2) 28707-29522 ACP(2) 29559-29816 JerC 30071-41035 module 3 30170-35440 KS(3) 30170-31447 AT(3) 31772-32824 DH(3) 32858-33409 KR(3) 34322-35161 ACP(3) 35183-35440 module 4 35507-40789 KS(4) 35507-36784 AT(4) 37115-38182 DH(4) 38216-38776 KR(4) 39695-40519 ACP(4) 40532-40789 JerD 41032-46674 module 5 41131-46416 KS(5) 41131-42408 AT(5) 42733-43800 DH(5) 43834-44430 KR(5) 45307-46125 ACP(5) 46159-46416 JerE 46671-55280 module 6 46773-51383 KS(6) 46773-48050 AT(6) 48381-49448 KR(6) 50295-50960 ACP(6) 51126-51383 module 7 51462-54443 KS(7) 51462-52742 AT(7) 53052-54098 ACP(7) 54189-54443 TE 54444-55280

[0020]In one aspect, the invention provides an isolated or recombinant DNA molecule comprising a nucleotide sequence that encodes at least one domain, alternatively at least one module, alternatively at least one polypeptide, involved in the biosynthesis of an jerangolid.

[0021]In one aspect, the invention provides an isolated or recombinant DNA molecule comprising a sequence identical or substantially similar to SEQ ID NO:1 or its complement. Hereinafter, each reference to a nucleic acid sequence is also intended to refer to and include the complementary sequence, unless otherwise stated or apparent from context. In an embodiment the subsequence comprises a sequence encoding a complete jerangolid PKS domain, module or polypeptide.

[0022]In one aspect, the present invention provides an isolated or recombinant DNA molecule comprising a nucleotide sequence that encodes an open reading frame, module or domain having an amino acid sequence identical or substantially similar to an ORF, module or domain encoded by SEQ ID NO: 1. Generally, a polypeptide, module or domain having a sequence substantially similar to a reference sequence has substantially the same activity as the reference protein, module or domain (e.g., when integrated into an appropriate PKS framework using methods known in the art). In certain embodiments, one or more activities of a substantially similar polypeptide, module or domain are modified or inactivated as described below.

[0023]In one aspect, the invention provides an isolated or recombinant DNA molecule comprising a nucleotide sequence that encodes at least one polypeptide, module or domain encoded by SEQ ID NO:1, e.g., a polypeptide, module or domain involved in the biosynthesis of an jerangolid, wherein said nucleotide sequence comprises at least 10, 20, 25, 30, 35, 40, 45, or 50 contiguous base pairs identical to a sequence of SEQ ID NO: 1. In one aspect, the invention provides an isolated or recombinant DNA molecule comprising a nucleotide sequence that encodes at least one polypeptide, module or domain encoded by SEQ ID NO:1, e.g., a polypeptide, module or domain involved in the biosynthesis of a jerangolid, wherein said polypeptide, module or domain comprises at least 10, 15, 20, 30, or 40 contiguous residues of a corresponding polypeptide, module or domain comprising a sequence of SEQ ID NO: 1.

[0024]It will be understood that SEQ ID NO: 1 was determined using the inserts of cosmids 307K-3F11, 307K-5G2, and 307K-2C8. Accordingly, the invention provides an isolated or recombinant DNA molecule comprising a sequence identical or substantially similar to an ORF encoding sequence of the insert of cosmids 307K-3F11, 307K-5G2, or 307K-2C8.

[0025]Those of skill will recognize that, due to the degeneracy of the genetic code, a large number of DNA sequences encode the amino acid sequences of the domains, modules, and proteins of the jerangolid PKS, the enzymes involved in jerangolid modification and other polypeptides encoded by the genes of the jerangolid biosynthetic gene cluster. The present invention contemplates all such DNAs. For example, it may be advantageous to optimize sequence to account for the codon preference of a host organism. The invention also contemplates naturally occurring genes encoding the jerangolid PKS that are polymorphic or other variants.

[0026]As used herein, the terms "substantial identity," "substantial sequence identity," or "substantial similarity" in the context of nucleic acids, refers to a measure of sequence similarity between two polynucleotides. Substantial sequence identity can be determined by hybridization under stringent conditions, by direct comparison, or other means. For example, two polynucleotides can be identified as having substantial sequence identity if they are capable of specifically hybridizing to each other under stringent hybridization conditions. Other degrees of sequence identity (e.g., less than "substantial") can be characterized by hybridization under different conditions of stringency. "Stringent hybridization conditions" refers to conditions in a range from about 5° C. to about 20° C. or 25° C. below the melting temperature (Tm) of the target sequence and a probe with exact or nearly exact complementarity to the target. As used herein, the melting temperature is the temperature at which a population of double-stranded nucleic acid molecules becomes half-dissociated into single strands. Methods for calculating the Tm of nucleic acids are well known in the art (see, e.g., Berger and Kimmel, 1987, Methods In Enzymology, Vol. 152: Guide To Molecular Cloning Techniques, San Diego: Academic Press, Inc. and Sambrook et al., 1989, Molecular Cloning: A Laboratory Manual, 2nd Ed., Vols. 1-3, Cold Spring Harbor Laboratory). Typically, stringent hybridization conditions for probes greater than 50 nucleotides are salt concentrations less than about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion at pH 7.0 to 8.3, and temperatures at least about 50° C., preferably at least about 60° C. As noted, stringent conditions may also be achieved with the addition of destabilizing agents such as formamide, in which case lower temperatures may be employed. Exemplary conditions include hybridization at 7% sodium dodecyl sulfate (SDS), 0.5 M NaPO4 pH 7.0, 1 mM EDTA at 65° C.; wash with 2×SSC, 1% SDS, at 50° C.

[0027]Alternatively, substantial sequence identity can be described as a percentage identity between two nucleotide or amino acid sequences. Two nucleic acid sequences are considered substantially identical when they are at least about 70% identical, or at least about 80% identical, or at least about 90% identical, or at least about 95% or 98% identical. Two amino acid sequences are considered substantially identical when they are at least about 60%, sequence identical, more often at least about 70%, at least about 80%, or at least about 90% sequence identity to the reference sequence. Percentage sequence (nucleotide or amino acid) identity is typically calculated using art known means to determine the optimal alignment between two sequences and comparing the two sequences. Optimal alignment of sequences may be conducted using the local homology algorithm of Smith and Waterman (1981) Adv. Appl. Math. 2: 482, by the homology alignment algorithm of Needleman and Wunsch (1970) J. Mol. Biol. 48: 443, by the search for similarity method of Pearson and Lipman (1988) Proc. Natl. Acad. Sci. U.S.A. 85: 2444, by the BLAST algorithm of Altschul (1990) J. Mol. Biol. 215: 403-410; and Shpaer (1996) Genomics 38:179-191, or by the Needleham et al. (1970) J. Mol. Biol. 48: 443-453; and Sankoff et al., 1983, Time Warps, String Edits, and Macromolecules, The Theory and Practice of Sequence Comparison, Chapter One, Addison-Wesley, Reading, Mass.; generally by computerized implementations of these algorithms (GAP, BESTFIT, FASTA, and TFASTA in the Wisconsin Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, Wis.; BLAST from the National Center for Biotechnology Information (http://www.ncbi.nlm.nih.gov/). In each case default parameters are used (for example the BLAST program uses as defaults a wordlength (W) of 11, the BLOSUM62 scoring matrix (see Henikoff (1992) Proc. Natl. Acad. Sci. USA 89: 10915-10919) alignments (B) of 50, expectation (E) of 10, M=5, N=-4, and a comparison of both strands).

[0028]The invention methods may be directed to the preparation of an individual polyketide. The polyketide may or may not be novel, but the method of preparation permits a more convenient or alternative method of preparing it. The resulting polyketides may be further modified to convert them to other useful compounds. Examples of chemical structures of that can be made using the materials and methods of the present invention include known analogs, such as those described in Kalesse & Christmann, 2002, "The Chemistry and Biology of the Jerangolid Family" Synthesis (8):981-1003 and the refereneces cited therein, and novel molecules produced by modified or chimeric PKSs comprising a portion of the jerangolid PKS sequence, molecules produced by the action of polyketide modifying enzymes from the jerangolid PKS cluster on products of other PKSs, molecules produced by the action on products of the jerangolid PKS of polyketide modifying enzymes from other PKSs, and the like. As noted, in one aspect the invention provides recombinant PKS wherein at least 10, 15, 20, or more consecutive amino acids in one or more domains of one or more modules thereof are derived from one or more domains of one or more modules of the jerangolid polyketide synthase. A polyketide synthase "derived from" a naturally occurring PKS contains the scaffolding encoded by all the portion employed of the naturally occurring synthase gene, contains at least two modules that are functional, and contains mutations, deletions, or replacements of one or more of the activities of these functional modules so that the nature of the resulting polyketide is altered. This definition applies both at the protein and genetic levels. Particular embodiments include those wherein a KS, AT, KR, DH, or ER has been deleted or replaced by a version of the activity from a different PKS or from another location within the same PKS, and derivatives where at least one noncondensation cycle enzymatic activity (KR, DH, or ER) has been deleted or wherein any of these activities has been added or mutated so as to change the ultimate polyketide synthesized. There are at least five degrees of freedom for constructing a polyketide synthase in terms of the polyketide that will be produced. See, U.S. Pat. No. 6,509,455 for a discussion.

[0029]As can be appreciated by those skilled in the art, polyketide biosynthesis can be manipulated to make a product other than the product of a naturally occurring PKS biosynthetic cluster. For example, AT domains can be altered or replaced to change specificity. The variable domains within a module can be deleted and or inactivated or replaced with other variable domains found in other modules of the same PKS or from another PKS. See e.g., Katz & McDaniel, Med Res Rev 19: 543-558 (1999) and WO 98/49315. Similarly, entire modules can be deleted and/or replaced with other modules from the same PKS or another PKS. See e.g., Gokhale et al., Science 284: 482 (1999) and WO 00/47724 each of which are incorporated herein by reference. Protein subunits of different PKSs also can be mixed and matched to make compounds having the desired backbone and modifications. For example, subunits of 1 and 2 (encoding modules 1-4) of the pikromycin PKS were combined with the DEBS3 subunit to make a hybrid PKS product (see Tang et al., Science, 287: 640 (2001), WO 00/26349 and WO 99/6159). Mutations can be introduced into PKS genes such that polypeptides with altered activity are encoded. Polypeptides with "altered activity" include those in which one or more domains are inactivated or deleted, or in which a mutation changes the substrate specificity of a domain, as well as other alterations in activity. Mutations can be made to the native sequences using conventional techniques. The substrates for mutation can be an entire cluster of genes or only one or two of them; the substrate for mutation may also be portions of one or more of these genes. Techniques for mutation include preparing synthetic oligonucleotides including the mutations and inserting the mutated sequence into the gene encoding a PKS subunit using restriction endonuclease digestion. (See, e.g., Kunkel, T. A. Proc Natl Acad Sci USA (1985) 82:448; Geisselsoder et al. BioTechniques (1987) 5:786.) Alternatively, the mutations can be effected using a mismatched primer (generally 10-20 nucleotides in length) that hybridizes to the native nucleotide sequence (generally cDNA corresponding to the RNA sequence), at a temperature below the melting temperature of the mismatched duplex. The primer can be made specific by keeping primer length and base composition within relatively narrow limits and by keeping the mutant base centrally located. (See Zoller and Smith, Methods in Enzymology (1983) 100:468). Primer extension is effected using DNA polymerase. The product of the extension reaction is cloned, and those clones containing the mutated DNA are selected. Selection can be accomplished using the mutant primer as a hybridization probe. The technique is also applicable for generating multiple point mutations. (See, e.g., Dalbie-McFarland et al. Proc Natl Acad Sci USA (1982) 79:6409). PCR mutagenesis can also be used for effecting the desired mutations. Random mutagenesis of selected portions of the nucleotide sequences encoding enzymatic activities can be accomplished by several different techniques known in the art, e.g., by inserting an oligonucleotide linker randomly into a plasmid.

[0030]In addition to providing mutated forms of regions encoding enzymatic activity, regions encoding corresponding activities from different PKS synthases or from different locations in the same PKS synthase can be recovered, for example, using PCR techniques with appropriate primers. By "corresponding" activity encoding regions is meant those regions encoding the same general type of activity--e.g., a ketoreductase activity in one location of a gene cluster would "correspond" to a ketoreductase-encoding activity in another location in the gene cluster or in a different gene cluster; similarly, a complete reductase cycle could be considered corresponding--e.g., KR/DH/ER could correspond to KR alone.

[0031]If replacement of a particular target region in a host polyketide synthase is to be made, this replacement can be conducted in vitro using suitable restriction enzymes or can be effected in vivo using recombinant techniques involving homologous sequences framing the replacement gene. One such system involving plasmids of differing temperature sensitivities is described in PCT application WO 96/40968. Another useful method for modifying a PKS gene (e.g., making domain substitutions or "swaps") is a RED/ET cloning procedure developed for constructing domain swaps or modifications in an expression plasmid without first introducing restriction sites. The method is related to ET cloning methods (see, Datansko & Wanner, 2000, Proc. Natl. Acad. Sci. U.S.A. 97, 6640-45; Muyrers et al, 2000, Genetic Engineering 22:77-98). The RED/ET cloning procedure is used to introduce a unique restriction site in the recipient plasmid at the location of the targeted domain. This restriction site is used to subsequently linearize the recipient plasmid in a subsequent ET cloning step to introduce the modification. This linearization step is necessary in the absence of a selectable marker, which cannot be used for domain substitutions. An advantage of using this method for PKS engineering is that restriction sites do not have to be introduced in the recipient plasmid in order to construct the swap, which makes it faster and more powerful because boundary junctions can be altered more easily.

[0032]In a further aspect, the invention provides methods for expressing chimeric or hybrid PKSs and products of such PKSs. For example, the invention provides (1) encoding DNA for a chimeric PKS that is substantially patterned on a non-jerangolid producing enzyme, but which includes one or more functional domains, modules or polypeptides of jerangolid PKS; and (2) encoding DNA for a chimeric PKS that is substantially patterned on the jerangolid PKS, but which includes one or more functional domains, modules, or polypeptides of another PKS or NRPS.

[0033]With respect to item (1) above, in one embodiment, the invention provides chimeric PKS enzymes in which the genes for a non-jerangolid PKS function as accepting genes, and one or more of the above-identified coding sequences for jerangolid domains or modules are inserted as replacements for one or more domains or modules of comparable function. Construction of chimeric molecules is most effectively achieved by construction of appropriate encoding polynucleotides. In making a chimeric molecule, it is not necessary to replace an entire domain or module accepting of the PKS with an entire domain or module of jerangolid PKS: subsequences of a PKS domain or module that correspond to a peptide subsequence in an accepting domain or module, or which otherwise provide useful function, may be used as replacements. Accordingly, appropriate encoding DNAs for construction of such chimeric PKS include those that encode at least 10, 15, 20 or more amino acids of a selected jerangolid domain or module.

[0034]Recombinant methods for manipulating modular PKS genes to make chimeric PKS enzymes are described in U.S. Pat. Nos. 5,672,491; 5,843,718; 5,830,750; and 5,712,146; and in PCT publication Nos. 98/49315 and 97/02358. A number of genetic engineering strategies have been used with DEBS to demonstrate that the structures of polyketides can be manipulated to produce novel natural products, primarily analogs of the erythromycins (see the patent publications referenced supra and Hutchinson, 1998, Curr Opin Microbiol. 1:319-329, and Baltz, 1998, Trends Microbiol. 6:76-83). In one embodiment, the components of the chimeric PKS are arranged onto polypeptides having interpolypeptide linkers that direct the assembly of the polypeptides into the functional PKS protein, such that it is not required that the PKS have the same arrangement of modules in the polypeptides as observed in natural PKSs. Suitable interpolypeptide linkers to join polypeptides and intrapolypeptide linkers to join modules within a polypeptide are described in PCT publication WO 00/47724.

[0035]A partial list of sources of PKS sequences for use in making chimeric molecules, for illustration and not limitation, includes Avermectin (U.S. Pat. No. 5,252,474; MacNeil et al., 1993, Industrial Microorganisms: Basic and Applied Molecular Genetics, Baltz, Hegeman, & Skatrud, eds. (ASM), pp. 245-256; MacNeil et al., 1992, Gene 115: 119-25); Candicidin (FRO008) (Hu et al., 1994, Mol. Microbiol. 14: 163-72); Epothilone (U.S. Pat. No. 6,303,342); Erythromycin (WO 93/13663; U.S. Pat. No. 5,824,513; Donadio et al., 1991, Science 252:675-79; Cortes et al., 1990, Nature 348:176-8); FK-506 (Motamedi et al., 1998, Eur. J. Biochem. 256:528-34; Motamedi et al., 1997, Eur. J. Biochem. 244:74-80); FK-520 (U.S. Pat. No. 6,503,737; see also Nielsen et al., 1991, Biochem. 30:5789-96); Lovastatin (U.S. Pat. No. 5,744,350); Nemadectin (MacNeil et al., 1993, supra); Niddamycin (Kakavas et al., 1997, J. Bacteriol. 179:7515-22); Oleandomycin (Swan et al., 1994, Mol. Gen. Genet. 242:358-62; U.S. Pat. No. 6,388,099; Olano et al., 1998, Mol. Gen. Genet. 259:299-308); Platenolide (EP Pat. App. 791,656); Rapamycin (Schwecke et al., 1995, Proc. Natl. Acad. Sci. USA 92:7839-43); Aparicio et al., 1996, Gene 169:9-16); Rifamycin (August et al., 1998, Chemistry & Biology, 5: 69-79); Soraphen (U.S. Pat. No. 5,716,849; Schupp et al., 1995, J. Bacteriology 177: 3673-79); Spiramycin (U.S. Pat. No. 5,098,837); Tylosin (EP 0 791,655; Kuhstoss et al., 1996, Gene 183:231-36; U.S. Pat. No. 5,876,991). Additional suitable PKS coding sequences remain to be discovered and characterized, but will be available to those of skill (e.g., by reference to GenBank).

[0036]The jerangolid PKS-encoding polynucleotides of the invention may also be used in the production of libraries of PKSs (i.e., modified and chimeric PKSs comprising at least a portion of the jerangolid PKS sequence. The invention provides libraries of polyketides by generating modifications in, or using a portion of, the jerangolid PKS so that the protein complexes produced by the cluster have altered activities in one or more respects, and thus produce polyketides other than the natural jerangolid product of the PKS. Novel polyketides may thus be prepared, or polyketides in general prepared more readily, using this method. By providing a large number of different genes or gene clusters derived from a naturally occurring PKS gene cluster, each of which has been modified in a different way from the native PKS cluster, an effectively combinatorial library of polyketides can be produced as a result of the multiple variations in these activities. Expression vectors containing nucleotide sequences encoding a variety of PKS systems for the production of different polyketides can be transformed into the appropriate host cells to construct a polyketide library. In one approach, a mixture of such vectors is transformed into the selected host cells and the resulting cells plated into individual colonies and selected for successful transformants. Each individual colony has the ability to produce a particular PKS synthase and ultimately a particular polyketide. A variety of strategies can be devised to obtain a multiplicity of colonies each containing a PKS gene cluster derived from the naturally occurring host gene cluster so that each colony in the library produces a different PKS and ultimately a different polyketide. The number of different polyketides that are produced by the library is typically at least four, more typically at least ten, and preferably at least 20, more preferably at least 50, reflecting similar numbers of different altered PKS gene clusters and PKS gene products. The number of members in the library is arbitrarily chosen; however, the degrees of freedom outlined above with respect to the variation of starter, extender units, stereochemistry, oxidation state, and chain length is quite large. The polyketide producing colonies can be identified and isolated using known techniques and the produced polyketides further characterized. The polyketides produced by these colonies can be used collectively in a panel to represent a library or may be assessed individually for activity.

[0037]Colonies in the library are induced to produce the relevant synthases and thus to produce the relevant polyketides to obtain a library of candidate polyketides. The polyketides secreted into the media can be screened for binding to desired targets, such as receptors, signaling proteins, and the like. The supernatants per se can be used for screening, or partial or complete purification of the polyketides can first be effected. Typically, such screening methods involve detecting the binding of each member of the library to receptor or other target ligand. Binding can be detected either directly or through a competition assay. Means to screen such libraries for binding are well known in the art. Alternatively, individual polyketide members of the library can be tested against a desired target. In this event, screens wherein the biological response of the target is measured can be included.

[0038]As noted above, the DNA compounds of the invention can be expressed in host cells for production of proteins and of known and novel compounds. Preferred hosts include fungal systems such as yeast and procaryotic hosts, but single cell cultures of, for example, mammalian cells could also be used. A variety of methods for heterologous expression of PKS genes and host cells suitable for expression of these genes and production of polyketides are described, for example, in U.S. Pat. Nos. 5,843,718 and 5,830,750; WO 01/31035, WO 01/27306, and WO 02/068613; and U.S. patent application Ser. Nos. 10/087,451 (published as US2002000087451); 60/355,211; and 60/396,513 (corresponding to published application 20020045220).

[0039]Appropriate host cells for the expression of the hybrid PKS genes include those organisms capable of producing the needed precursors, such as malonyl-CoA, methylmalonyl-CoA, ethylmalonyl-CoA, and methoxymalonyl-ACP, and having phosphopantotheinylation systems capable of activating the ACP domains of modular PKSs. See, for example, U.S. Pat. No. 6,579,695. However, as disclosed in U.S. Pat. No. 6,033,883, a wide variety of hosts can be used, even though some hosts natively do not contain the appropriate post-translational mechanisms to activate the acyl carrier proteins of the synthases. Also see WO 97/13845 and WO 98/27203. The host cell may natively produce none, some, or all of the required polyketide precursors, and may be genetically engineered so as to produce the required polyketide precursors. Such hosts can be modified with the appropriate recombinant enzymes to effect these modifications. Suitable host cells include Streptomyces, E. coli, yeast, and other procaryotic hosts which use control sequences compatible with Streptomyces spp. Examples of suitable hosts that either natively produce modular polyketides or have been engineered so as to produce modular polyketides include but are not limited to actinomyctes such as Streptomyces coelicolor, Streptomyces venezuelae, Streptomyces fradiae, Streptomyces ambofaciens, and Saccharopolyspora erythraea, eubacteria such as Escherichia coli, myxobacteria such as Myxococcus xanthus, and yeasts such as Saccharomyces cerevisiae.

In one embodiment, any native modular PKS genes in the host cell have been deleted to produce a "clean host," as described in U.S. Pat. No. 5,672,491, incorporated herein by reference.

[0040]In some embodiments, the host cell expresses, or is engineered to express, a polyketide "tailoring" or "modifying" enzyme. Once a PKS product is released, it is subject to post-PKS tailoring reactions. These reactions are important for biological activity and for the diversity seen among polyketides. Tailoring enzymes normally associated with polyketide biosynthesis include oxygenases, glycosyl- and methyl-transferases, acyltransferases, halogenases, cyclases, aminotransferases, and hydroxylases. In addition to biosynthetic accessory activities, secondary metabolite clusters often code for activities such as transport.

[0041]Tailoring enzymes for modification of a product of the jerangolid PKS, a non-jerangolid PKS, or a chimeric PKS, can be those normally associated with jerangolid biosynthesis or "heterologous" tailoring enzymes. Tailoring enzymes can be expressed in the organism in which they are naturally produced, or as recombinant proteins in heterologous hosts. In some cases, the structure produced by the heterologous or hybrid PKS may be modified with different efficiencies by post-PKS tailoring enzymes from different sources. In such cases, post-PKS tailoring enzymes can be recruited from other pathways to obtain the desired compound. For example, the tailoring enzymes of the jerangolid PKS gene cluster can be expressed heterologously to modify polyketides produced by non-jerangolid synthases or can be inactivated in the Jerangolid producer. Alternatively, the unmodified polyketide compounds can be produced in the recombinant host cell, and the desired modification (e.g., oxidation) steps carried out in vitro (e.g., using purified enzymes, isolated from native sources or recombinantly produced) or in vivo in a converting cell different from the host cell (e.g., by supplying the converting cell with the unmodified polyketide).

[0042]It will be apparent to one of skill in the art that a variety of recombinant vectors can be utilized in the practice of aspects of the invention. As used herein, "vector" refers to polynucleotide elements that are used to introduce recombinant nucleic acid into cells for either expression or replication. Selection and use of such vehicles is routine in the art. An "expression vector" includes vectors capable of expressing DNAs that are operatively linked with regulatory sequences, such as promoter regions. Thus, an expression vector refers to a recombinant DNA or RNA construct, such as a plasmid, a phage, recombinant virus or other vector that, upon introduction into an appropriate host cell, results in expression of the cloned DNA. Appropriate expression vectors are well known to those of skill in the art and include those that are replicable in eukaryotic cells and/or prokaryotic cells and those that remain episomal or those that integrate into the host cell genome.

[0043]The vectors used to perform the various operations to replace the enzymatic activity in the host PKS genes or to support mutations in these regions of the host PKS genes may be chosen to contain control sequences operably linked to the resulting coding sequences in a manner that expression of the coding sequences may be effected in an appropriate host. Suitable control sequences include those that function in eucaryotic and procaryotic host cells. If the cloning vectors employed to obtain PKS genes encoding derived PKS lack control sequences for expression operably linked to the encoding nucleotide sequences, the nucleotide sequences are inserted into appropriate expression vectors. This can be done individually, or using a pool of isolated encoding nucleotide sequences, which can be inserted into host vectors, the resulting vectors transformed or transfected into host cells, and the resulting cells plated out into individual colonies.

[0044]Suitable control sequences for single cell cultures of various types of organisms are well known in the art. Control systems for expression in yeast are widely available and are routinely used. Control elements include promoters, optionally containing operator sequences, and other elements depending on the nature of the host, such as ribosome binding sites. Particularly useful promoters for procaryotic hosts include those from PKS gene clusters that result in the production of polyketides as secondary metabolites, including those from Type I or aromatic (Type II) PKS gene clusters. Examples are act promoters, tcm promoters, spiramycin promoters, and the like. However, other bacterial promoters, such as those derived from sugar metabolizing enzymes, such as galactose, lactose (lac) and maltose, are also useful. Additional examples include promoters derived from biosynthetic enzymes such as for tryptophan (trp), the β-lactamase (bla), bacteriophage lambda PL, and T5. In addition, synthetic promoters, such as the tac promoter (U.S. Pat. No. 4,551,433), can be used.

[0045]As noted, particularly useful control sequences are those which themselves, or with suitable regulatory systems, activate expression during transition from growth to stationary phase in the vegetative mycelium. The system contained in the plasmid identified as pCK7, i.e., the actI/actIII promoter pair and the actII-ORF4 (an activator gene), is particularly preferred. Particularly preferred hosts are those that lack their own means for producing polyketides so that a cleaner result is obtained. Illustrative control sequences, vectors, and host cells of these types include the modified S. coelicolor CH999 and vectors described in PCT publication WO 96/40968 and similar strains of S. lividans. See U.S. Pat. Nos. 5,672,491; 5,830,750, 5,843,718; and 6,177,262, each of which is incorporated herein by reference.

[0046]Other regulatory sequences may also be desirable which allow for regulation of expression of the PKS sequences relative to the growth of the host cell. Regulatory sequences are known to those of skill in the art, and examples include those which cause the expression of a gene to be turned on or off in response to a chemical or physical stimulus, including the presence of a regulatory compound. Other types of regulatory elements may also be present in the vector, for example, enhancer sequences. Selectable markers can also be included in the recombinant expression vectors. A variety of markers are known which are useful in selecting for transformed cell lines and generally comprise a gene whose expression confers a selectable phenotype on transformed cells when the cells are grown in an appropriate selective medium. Such markers include, for example, genes that confer antibiotic resistance or sensitivity to the plasmid. Alternatively, several polyketides are naturally colored, and this characteristic provides a built-in marker for screening cells successfully transformed by the present constructs.

[0047]The various PKS nucleotide sequences, or a mixture of such sequences, can be cloned into one or more recombinant vectors as individual cassettes, with separate control elements or under the control of a single promoter. The PKS subunits or components can include flanking restriction sites to allow for the easy deletion and insertion of other PKS subunits so that hybrid or chimeric PKSs can be generated. The design of such restriction sites is known to those of skill in the art and can be accomplished using the techniques described above, such as site-directed mutagenesis and PCR. Methods for introducing the recombinant vectors of the present invention into suitable hosts are known to those of skill in the art and typically include the use of CaCl2 or other agents, such as divalent cations, lipofection, DMSO, protoplast transformation, conjugation, and electroporation.

[0048]Thus, the present invention provides recombinant DNA molecules and vectors comprising those recombinant DNA molecules that encode at least a portion of the jerangolid PKS and that, when transformed into a host cell and the host cell is cultured under conditions that lead to the expression of said jerangolid PKS enzymes, results in the production of polyketides including but not limited to jerangolid and/or analogs or derivatives thereof in useful quantities. The present invention also provides recombinant host cells comprising those recombinant vectors.

[0049]Suitable culture conditions for production of polyketides using the cells of the invention will vary according to the host cell and the nature of the polyketide being produced, but will be know to those of skill in the art. See, for example, the examples below and WO 98/27203 "Production of Polyketides in Bacteria and Yeast" and WO 01/83803 "Overproduction Hosts For Biosynthesis of Polyketides."

[0050]The polyketide product produced by host cells of the invention can be recovered (i.e., separated from the producing cells and at least partially purified) using routine techniques (e.g., extraction from broth followed by chromatography).

[0051]The compositions, cells and methods of the invention may be directed to the preparation of an individual polyketide or a number of polyketides. The polyketide may or may not be novel, but the method of preparation permits a more convenient or alternative method of preparing it.

[0052]The following Examples are intended to illustrate, but not limit, the scope of the invention.

EXAMPLE 1

Isolation of Jerangolid PKS Cosmids

[0053]Genomic DNA was isolated from Sorangium cellulosum Soce307, the producer of jerangolid using an established protocol (Jaoua, S., Neff, S., and Schupp, T. "Transfer of mobilizable plasmids to Sorangium cellulosum and evidence for their integration into the chromosome," 1992 Plasmid 28:157-165). The DNA was partially digested with Sau3AI using a serial dilution method and libraries were constructed in SuperKOS (a smaller derivative of SuperCos-1) using the protocol for SuperCos-1 from Stratagene. Colonies were picked, cosmid DNA was isolated on the Qiagen robot, and the DNA was submitted for end sequencing. The data was analyzed by BLAST and all PKS positive cosmids were prepared in larger amounts for further analysis.

[0054]End sequencing of cosmid and fosmid libraries of the Soce307 genome gave 13 cosmids with PKS sequence on at least one end. Five of these cosmid/fosmid end sequences were highly similar (>92% identity at the nucleotide level) to sequence from the ambruticin PKS, disclosed in co-pending U.S. application Ser. No. 60/551,103, filed 2 Mar. 2004 and incorporated herein by reference in its entirety, indicating they probably contain the jerangolid cluster.

[0055]All publications and patent documents cited herein are incorporated herein by reference as if each such publication or document was specifically and individually indicated to be incorporated herein by reference.

[0056]Although the present invention has been described in detail with reference to specific embodiments, those of skill in the art will recognize that modifications and improvements are within the scope and spirit of the invention. Citation of publications and patent documents is not intended as an admission that any such document is pertinent prior art, nor does it constitute any admission as to the contents or date of the same. The invention having now been described by way of written description, those of skill in the art will recognize that the invention can be practiced in a variety of embodiments and that the foregoing description are for purposes of illustration and not limitation of the following claims.

Sequence CWU 1

43167323DNASorangium cellulosum 1gatcgtcctg ggcgacacgc tggagcaggt ggcgacgcgg ctgctcgagg aggacctcgc 60ggcgtgccac acgaccggcg aggcggcgga cgtgctgctg aacggggtgc tcgcgtcgag 120cgcccgcgcc gtggccgcgg cgctgcgcgc gtgcgacgag ttcgccgcgg gcgacagcga 180tctgccgtcg ctggcccggg cgtgccgcgc gttcgcgggg ctcgcgtcgt tcgggtcgtc 240gcggtcgctg tcgtcgctcg gcgacggggt gatcgcgccg atgctggaga agacgttcgc 300gcgcgcggtc ctgcgcgtcc acgggggctg cacgggcagc gacgaggcgg tcgccgccgc 360caaggaggcg ctgcgcacgc tgcacgacgt ggcgctgtcg cagccgatcg tcgaccgcgg 420ggcgtggctc gacgcggcgc gggggctcgt ggacagcgag gtggtgaacc cgacggcgtc 480cggcctcgcg tgcgggctgc tctacctggc gcaggcgatc gacgacgccg aggtggcgcg 540ggtcgtcggc ctgcggctcg ggggcgcggc cgagcccgag gcggcggcgt cgttcctggc 600cgggttcctc gaggtgaacg cgctggtgct ggtgaagagc cggcccgtgg tcgaggcgct 660ggacgcgttc ctccgggcga tcgcgccgga gcgcttcaag gacacgctgc cggtccttcg 720gcgcgcgttc gctgggctcg gcgcgacgga gcggcggtac ctgctcgaga acgtgctcgc 780ggcgcggaag ctgggggaca aggcgcgcgc ggcgcaggcg gtgctcctgg agaaggaccg 840ggagaagctg aaggagatga gcgaggacct ctcacaggcg atggacgacc tggacgagtt 900gctctgacac gacccgtgag acgacccgtg agacgacgac ctggacgacc tggacgacct 960gaaccaggtt gcggacgacc tggacgaccc ggacgacccg aaccaggttg agttctaggt 1020tgttctaggt tcacggtaca gaacggttcc cgccggtcaa gcatagcccc ccgacaacac 1080ggcttctgtt ccgcttcttg cacgctcacc acggcatgac gcatgcgtac ccactctcgt 1140gatgccgcat ggaggatcta ccagttgctc ggcctcctga aggcacacgc tcgcctgaga 1200gcacgtccat gtcgtcgccc acccctcgat ccttctggcc atcccacgat cccgctcccg 1260caggcggctc tacaccacaa gcaccacgtc attcacgctg accccgtgaa acattcgcat 1320ccaccatgtc cccaccggga atacggcgct gcgtacaccg gcacgccagc gttcgagcgc 1380cgcgcggtac gtcgcgcgga acgcccgcac ggacgcggca gcggtttgcc acgctcctcc 1440ctgctcgcgg cccaccgcaa aggtaggatt gcggccaagc agcgcctcga agctcgtggc 1500acgctggtaa ggcgagacgt tgctggctcg ctcggcacca aagaagcgga gtccttgccg 1560ctccacctcg gcatgggcca gcgcctcctg tcgttcgagc tcggccaaga tctcgcggcg 1620gaatgcctcc gcggcgtcct gctcgatgat aggcggtagt gcgagcggca gcgttgcttc 1680ctggggccac tgtggatttt ccgggtcgag gtatgccgag ggacgagccg cgcgcagcat 1740ggcgtgaccg atctcaccca catctacctt gacaccgggc cattctctcg aattacggac 1800gagaccggca gcaacagggt ttgcaaggac ataggcaatc ttttccacaa ccgccgcgcg 1860agtcagcagc cgtaccacgc tcgtggcttc atggtcccat accgggccct cccacttgcg 1920cagcaccttc gtgccgagcg cgacgattcg atggaagaac tgcaagaagc gcggcaggac 1980gccccgcaca tcggtcacga cgaggtgcag atgcgtcgac atcgcgcaaa gggcatgaac 2040ctcgacgccg tagcggtggg ccgcgacagc gagggcatag accaagaact ggttcatggc 2100cgcatcgggg cgaaacagga aatgtcggcg cagaacgcga cgggtgatca ggtacgtggc 2160gccgggagcg atctcgcgag gctggctcat ggccggatca catagacaaa cgctgcgcca 2220tcaagcgatg tgcatcattt caatgactta ccccgccgcg gggggtggcg gccgccgatt 2280tccgccaatt tccgccacgc tgggccgccg cttgtcacgc gaccagctgg tgtcggaccg 2340cctcggatta cccccgccgg tagacctccg ccgatagacc tcaacccggg tcgggtcggc 2400cccccagagt tgggtcgggt cgggtcgaca acctgggtcg gccaacctgg gtcgggttca 2460ccaaccggcc aacctgggtc aggttgcggc accaaccggc caacctgggt cgggtcgcgg 2520cggggttgcg gcaggctgca ccccaaccgg ccaacctggg tcgggtcgcg ccgggctcac 2580tcctcctggc cggcccgcga ccagccgccg agcttgacgc ccggcgtctg cgccaggaag 2640gcctgcaccg ccggcggcgg cgacgcccac gccgcgaggc gccccgggct ccgcgggatc 2700ggcgtcgtcg ccgagcgcga gtaggagctc cacgcgccgt cgctgcaggc atagcaggtc 2760gagcgcgcga gctgcgcggt gagccggagc gacccccgcg cgtcgtaggc cggcgcgatc 2820ttcacgagct gcggggggtc cttctctcgc cgcgcggcct ccggatcctc ctcgatgtcg 2880tcgagcgtcg cctcggccgc gcgctgcagc gacggcacgc cggacacctc ggcgagcagg 2940tcgaccgccg cgccgcgctc ggcgtcccac acagtgaacg cggccccctc ggagccgtgc 3000gcgccgcaga gatcgctgta cgtgcgctcc tcgatgaaca ggtacggccc cacgctgccg 3060atcagcgcgg cgctgtgctg gaacacgttg ctctcgtcct gctccggcac cgtgatcacc 3120gcctggcgct cgccgtcccc gcgcagcacg agcgccgcgt cggtcgccgt gccctcgccc 3180ggctcgcgcg tgtccccgtc ccagagctcg cacggcgtcg tctcgacctc cttcgcgcgc 3240gcctcccagc gccactcccc gcgccgggtc gccacgacga tcccgggctc ctcgcggatc 3300accgcgccgc cctcggcgac gtgccacgtc ctcggcgtcc cctcgcccgt gctgccccag 3360acgaggacca ccccgtcggc agcaacgttc ccggccgact cggcctcggg cgggcgcggg 3420gcgggcgcga gctggatgtc ggtggaggcg gccggcgcag gcccgggccg cgcggcacag 3480ccggagatca gcgccagcga gacgatcaac gcagcaggtc gggagcgcag catgcggagc 3540cgaaagagca tggtgtgtgc cgcggcccac ggccagggaa gcccgcgccc cgcgcgccgg 3600aggtgtagcc gcgcgcccac gccgcgtggc acgagcgccc cgtcaccggc gtcgggaccc 3660gacgatcgag ctccgacgcg ggcagcggca cgaacggcgc gtcgcccgcg tcggcgagcg 3720cgccgtcctc aggtgccgga acgcgagcgc gcccggcgca ggagcgcccg ctgcgcggcg 3780ttcagccgga tcccgggcct gcgctgccgg atccgggcct ccatcgcgtc gacgtcggcg 3840gcgagctcgc gctccagcag cagcgcggcc gcgagggtcg ccgatcggcc gcggcccgag 3900gcgcagtgca ggtacatgcc gtccacgccc cgcagccgct cgaggagctc caggagccgc 3960tccacctcgg gccccgtgcc gtcgagcgtc ggcacgcaga ggtagccggg gtggcgccgc 4020accgcggcgg ccgccggaaa ctcggccgtc atgtccacga cgagccgcac gcccgccggc 4080agctcgtggg cgagcggccg ccggccgacc cagagcccgg gcgccacctc gttggcgcag 4140tccgcccgcc cgagcgcccg ctccgcgcgc cataccgccc aggtcaggag gaagtacgga 4200ccgagcagga cgagcgccca cgcggcctgc gtgccgtctg gccgcttgcc cagcagcgcg 4260ggccggcgcg cgaggtacgc ggcgccgacc aggccgaagc tcagcgccgg ccagagcagg 4320gcgagcgcag ccccgccggc caggacggcg agcgcggcga gcgaggcgct caggacgagg 4380aaggtcaggc cgtatcgcat ggatcggaga ggcgtctgct cggccatgct ctcacatcgg 4440gcgctcctcg atgaagtcga gcagccgctc gctcgcgtcg aacgcctcgc tcttccggcc 4500gaacaccggc ggcagatccc ctccgcccgg ccacgtgcgc ccgcccccca cgacagcgca 4560gcgctcgacc tcggcctcgt cggcgcagcg cgaccgcgcc gtgcacgtcg tgtcgccgtg 4620cgcgtacgtg atgtgggaga cgtcagcgca gccgtcgcga tcgcgccaga tcggagagag 4680gacagtgcga tgggccggac atcatcacag ctctctcttt cgggatctcg agggccatcc 4740ggaggtgcgc tgccggcgcg gacagcgtct cacacgcccg gcaaagctgg cctcgagtcc 4800accgcgaccg ccctctggat ggcgtcgcgg agccacccgt ggccctcgtc gtgctcggag 4860cgctccggcc agacgagcgt cagcgtgtag ccctcgagcg cgaacgggca cggccgcacc 4920acgagatcga gcctccgggc cagggccgcg gcgacgcgcg cggacacggt gagcagcagg 4980tcggaaccgg agacgatgaa cggcgcgaca aggaaatggg acacggtcag cgtcacccgc 5040cggcgtgttc cctgctccgc cagcgcccga tcgatggcgc cgtggtcctc tccgtgcggc 5100gagaccatca ggtgctcgca agcagcgtag cgcgccgcgg tgagcggcct ccgggacgcc 5160gggtgtccgc ggcgcatcac acagacgatc tcctcggccg cgagcagcgt ggagcgacag 5220ccgtcgggca ccggtccgcc gcgcccgagc ttgccgtcga gctcgccgcg gcgcaggagc 5280tcggcgaagt cggccgggat gttccggcag cgcaggttga cgcgcggcgc ctcgacggcg 5340agcagcgcgg tcagcgccgg gagcacgagc agctcgaggt tgtcggtcgc gacaagccgg 5400aacgtgcgct gcgaccgccg cgggtcgaac cgctcgaccg ggcggaagac ctgctcgagc 5460cgctcgacgg cctcggccgc ccgcggggcc aggtcccgcg cccgctcgct cagcgtcatc 5520tgcctgccga cctggatgag cagcgggtcc gcgaaatggg cgcgcagccg cgcgagcgcg 5580tggctcatcg agggctgcgt cacgcccacg cggcgcgcgg cgcgcgtgac gctcttctcc 5640tggagcaggg cgtgcaacgc cacgacgagg tgggtgtcga ccgactgcag ccgcatggtc 5700gatggatacc acgtcgatcc atcgacggcg tctatggatc gccgcgccga ctgccgattc 5760gacgcccggg gccgtgggtg cctatctctc ctctccggac ggcgcatgcc gccgcgcggc 5820gcgcgcctac cccccagccg aggagagcaa ccccatgatc atcgagtacg ttcgctacac 5880gatccccgcg gagcaagaga aggagttcct ggccgcctac cgcgacgccg ccgcggagct 5940gcgcgggtcg gagcattgcc tcgactacga gatctcccgc tgcgtcgagg atccgacgag 6000ctacgtcgtc cgcatctgct gggactcgct gcaaggccat ctccagggct tccgcaaggc 6060ggcggcgttc ccgtcgttct tcgccaaggt gaagccgttc tacgagcgta tccaggagat 6120gaggcactac gccttgaccg acgtcgccgc gcggcaggcg gggacggccg cgacgggctg 6180aagggtagac cctgcggccc tccgaacgtc gaggccgcct gcgccggcct cggctgctcc 6240ccgccagcct gtccgcgcct cacatcgagc cccttgcagg cccagcgcgc ccggtgaggt 6300gcggagtgac gccgcgatcc cggaaagccg ctggggagac cgcgcgggga aagcgatgcg 6360ccgcttccgc cgcggtgcgg gcgggtgcag gatgcggcca tgggaatgcc tccggcgctc 6420gaccgagacc accgccgccg cgccccgcgc gcgcccgccg ccgcgctcat cgcgctcctc 6480gcagccggcg ccgcgctcgc ggcctgctcc aggagcacag gcgggccgaa gcaccgcgag 6540gcggcgccgg agcgcgacag cgcctgcacc gatccagcga agcccagggc gtacttctat 6600cctgcggaga accggacgga ctacgcgcct gacgatccct ggaaggacgg ctgcgccatg 6660ctggtgccgg atcacctgtt ctgctgtccg gagaaggcct ccaccggctc gccctgatcc 6720gcgccgcccc gccccgccgc gcgcgcacat gccgctcgtc cggagcgcag ccgccccgcg 6780cgcgagcgcc acacaggccg caaacgtccc acacgctgcg cctgcaggcc gagcgcaggg 6840cgccctgcgg agcgccgcgc gcccacctcg ggcgccgtcg cgcggcgacc gacgcggccg 6900tcgcgcggcg atcgacgcgc gggcagcgcg cttcacggcg cgcgtgggga taccctggcc 6960tggccgtgga tctgttgagc tacgccgggg cgaacctgca ggaccgcggg ccgagctcgc 7020tgcgcgttcg cttccccgca gcctgaagcg ggcgagcgcg gcgccgcggc gggacggccg 7080acacgggtgc cgcacaacgc ggcatgtcgc attctgcggc ggcgtcgagc ggatggctgg 7140acgcgcgcac ctgcgcgcgc cacctgcgct aggacgccgg acatgaagct cgcgcgcaag 7200ctgacgctcg ccctcgtgtt cggggtattc ctcgtgctcg cgctgagcgc ctacgcccag 7260atccgcagag aggccaggat cttcgagaac gacgtccagc gcgaccatca cacgatggcc 7320cgcgcgctcg cggccgcggt catggaggtg tggcgctccg agggaaccgc gcgggcgctg 7380cgcctcgtgg aggacgccaa cgagcgggaa cagcaggcga acatccgctg ggtctggctc 7440gatggccagg ccgacgagcc ccatcgcccc cggctggcgc cggagctgct cgcccccgtc 7500gccgaggggc gcgcggtcgt gcgccggatc ccccagaaag acgcggatct gctcgtgacc 7560tgcgtgccgg tgtccgtgcc cggcgaccgc gccggcgcgc tcgagctctc cgagtcgctc 7620gcgggcgcgc gccggtacat ccggagcatg atcctgagca cggcgatcac cacagccgcg 7680ctgacgctgg tatgtgggtt gcttacaacg ggcctcggag tctggctggt gggacgcccc 7740atgcgcacgt tgatcgacca ggcgcggcgg atcggcgccg gcgatctctc cgggcggctg 7800tcgctgcgcc aggaagacga gatcggcgag ctcgggcgcg agatgaacgc catgtgcgat 7860cgcctcgccg cggcgaacca gaagctcgag tccgaggccg ccgcgcggat cgccgcgctc 7920cagcagctcc gtcacgccga gcggctcgcg accgtcggca agctcgcgtc cggcatcgcg 7980cacgagctgg gcgcgcccct ccaggtcgtc acggggcgcg cgcggatgct cgtcgacggc 8040gacgtgtcgg gcgatgaggt gccgatcaat ggacagatca tcctcgagca gtcgcagcgg 8100atgacccaga tcatccgcca gctgctcgac ttcgcccggc gccgcagcgc cgagaagcag 8160gagaccgcgc tccgcggcgt catccgcggc acgttcacga tgctgaagcc gctggcggac 8220aagcagggtg tcacgatcgt cgaggaggga gacacgccgg atcgggtggt ccacgccgac 8280gccgaccagc tccagcaggc gctcacgaac gtcgtcgtca acgcgatcca ggccatgccg 8340tccggcggca cgatcacggt gggcgtccgg accgtccgcg ccagcccccc gcccgaccag 8400ggaggggccg agggcgacta catcgcgctg tcggtgcgcg acgagggaca gggcatgacg 8460gccgacgtcc tcgagcacgt cttcgagccg ttcttcacga ccaagcccgt cggcgagggg 8520accgggctcg gcctgccggt cgcctacggc atcatcaagg agcacggcgg ctggatcgac 8580gtcgacagcc gccccggctc cgggagccag ttcacgatgt acctgccgca ggagaagcca 8640tgaccggacg cgtcctgatc gtcgacgatg agcgaggcgt ctgcgagctc ctcgacgccg 8700ggctgaagaa gcggggattc caggcggcgt ggcgcacgtc ggccgccgag gcgctcgagc 8760tcctcggcgc ggaggacttc gacgtcgtcg tcaccgacat gaccatgcgc ggcatgaacg 8820gcctcgagct ctgcgagcgc atcgcccaga accggcccga tctgccggtc atcgtcatca 8880ccgcgttcgg gagcctcgac accgccacgt cggcgatccg cgccggcgcc tacgacttcg 8940tgaccaagcc gttcgagctc gacgcgctcc ggctcaccgt cgagcgcgcc ctgcgccacc 9000gcgccctccg cgaggaggtg cgccggctgc ggcgcgccgt ggacgactcc caccgttacg 9060agcagatcct cggcggcagc ccggcgatga agggcgtctt cgatctgctc gaccgggtcg 9120ccgactcgga cacctcgatc ctcatcaccg gcgagagcgg caccggcaag gagctcgtcg 9180cgcgcgccgt gcaccagcgc agccggcgcg gccagggcgc gttcatcgcg gtgaactgcg 9240cggcggtccc ggacgccctg ctcgagaccg agctgttcgg ccacgcgcgg ggcgccttca 9300ccgacgccaa gggggcgagg agcggcctgt tcgcgcgggc ccacggcggc accctgttcc 9360tcgacgagat cggcgagctg ccggtcgggc tccagccgaa gctcctgcgc gccctccagg 9420agcgcgtcgt ccggcccgtc ggcgcggacg aggaggtccc cgtggacgtg cggctcatcg 9480cggcgaccaa ccgcgacctg gagaccgcga tcgaggagcg ccgcttccgc gaggacctct 9540attaccggat caacgtggtc cacgtcgatc tgccgccgct ccgctcccgc ggcgccgacg 9600tgctgctgct cgcgcagcgc ttcctcgagc acttcgcgac cgtcaaggag cggcccatca 9660agggcctctc ggcgcccgcg gccgagaagc tcgtcgccta cgcgtggccc ggcaacgtcc 9720gcgagctcca gaactgcatc gagcgggccg tcgcgctcgc gcggtacgat cagatcacgg 9780tcgacgatct ccccgagaag atacggagtt accggcgctc ccacgtcctt gtctcgagcg 9840acgacccgac cgagctcgtc cccatggagg aggtcgagcg gcgctacatc ctgcgcgtcc 9900tggaggtggt cggcggaaac aagagccagg cagcccaggt cctgggcttc gatcgagcga 9960ccctgtaccg gaagctcgag cggtacggcc tgcgcgccgg gcgcgcgggc gacccgaggc 10020cgtgatccgc ccggcgccgc gccggaggtg attccaggag cgcctcgcgg cggcggcgcc 10080gctcgtcctc acgctgttgc agaacgcgac actcgcggcg tcgcgcgatc ggcagcgccg 10140cgcgcgggcg cgcgcgatct gcgctggtgg catgagagct gcctgaaagc agggcgcgaa 10200catgagccac acaacggcgg cgtctgctgc tccggaccgg agagtgccag tcgactggat 10260cgcgctcgcg aacgcgttcg acaacatcgc tcgaggcgtg cgccatttcc ttcacctcga 10320cacgggcgcc gtgctccggc tgaacgagcg gctcgtcgat cccgccacgc gcgcgcgcat 10380cgaggaagac ccggggtgcg tgctcatcga ggccatcgcc gctcgggatc agtatcgatg 10440gctggaggcg ttcattccca cggtggaaga cctggagttc cggctggcgc tcctgcacag 10500catgcagggc ccaggatcgt tccggcggtt caaggccgcg ctctcggcca ggccggagca 10560gctccgccgc tggcgcgcct tccggcagga gcagatccgg gtcgcgatcc tccggtggtt 10620tcacgcccgc ggactgacgc ccgtcgcgct cgagctctcc tcgccggacg cgagctccga 10680gccgccgagc tcccgcgccg tcgactcggc tcgccagcag ctttactccg cggcggacgg 10740cctctgtccg agcgacctcc aggcgctgac gtcgctcgcg gagtacctgc gcgcggcgcg 10800ctcggcgctg cggatccccg ccgattcatc catgggagac gccgcgcggg ccgtccgcct 10860ggtcccttga cgacgagcgc ggcagctcga tccggaagac cgagccggcg ccgggacggc 10920tctcgacgaa gaggcggccg ccgtgcgcct cgacgatgcg cttcgccacc gcgaggccga 10980ggcccgtgcc cgggatggat ccagacgtgg acttgagccg ccggaacggc tcgaagaggt 11040gcgccagatc ctcgggctcg atcccgagcc cgcgatcgcg aacggcgatc tcggccccct 11100cgccgccggc gcggaccgcc acgtcgacct gccccccggc gggggagtac ttgagcgcgt 11160tcgacaggag gttgttcagc acctgctcga tccgggtcgc gtcgcagcgg acgagcaccg 11220gtgtctcggg gagcgagagc tcgatggggt gctccggcga gacagggcga tagaggtcca 11280ccgcctcctg cgcgagatcg cgcaggtcgc gctcctccac ccggagatcg agcttgcagg 11340cctcgatctg cgacgcgtcg aggaggtccc cgaccatgcg atcgagccgg tcgacctgcc 11400gcccgacgag cgccatggtc cggcgcacgc tcgactccag gggccggttg tcggcgtcga 11460ggacgtgcac ggacagccgg agcgccgaca gcgggttcct gaggtcgtgg gccacgccgc 11520cgaggaacgc gaactgcgcc tcgcgctggc gctccagcga ctctgccatg tcgttgaagg 11580cgcgcgcgat ctccccgagc tcgcgcggcc cgatcagcgg cgcgcgcgcg gcgcggtcgc 11640ccgcgccgta gcgcccgatg gcctcctgga tcgcgacgat ggggcggtag atgagccgcc 11700gcgcgctgag gaggatcgtg gacgcgcccg cgaggaagaa caccaccgcc gcgaggccgg 11760cgccggtcgt gcgccgggtc aggtgcgcga cgagcgcctc cgacgcgcgg gcctgctcga 11820ggttgatctc gaccaggtga tcgagcgccc tgaacgcctc gtcgagcgcg gggtcgtgca 11880cgccgagcag cgcgggatcg tgcgcgccag gcgccgacgg gagctcgtgg gcgtcggcgg 11940cgcggcgccg ggcgaggtag tcctccacgc gccgctccgc gtgctcgagg atcctgccct 12000cctccgggct gctcacgtgg tcgcgcgccg ccgcgaggcc gctcctcagg ccttgctccc 12060acgccgccag ggagggggcc agctccccgc ggccggagcc gaccgcgcgg ctgctctggt 12120gcgcgtcgag caggaggtcg atctccagcc tctccacgag ccggacgctc tcgaccgtgg 12180cgccgaggat ccgggtggtc tgttgcatgg tcgtcgacgc gaccatcagc gcgcccgcga 12240caacgatggc cacgctcgtg agaagaagcg tggcggcccc gaggagcgcg ctcaggcgca 12300cgggccgcgg aagacggggc cagctcaggc cctgcggagt tggctgtcgc atcttcctgc 12360gttggttcgg atccgcgacg gatgcaacgt cgcctcgatg gagagattga actgcagagg 12420cacagagcac atcgaggcag ggagcgataa gcgcgctgcg cccgcggcgc tccccgcccc 12480tccgcgccgg cctcaccggc ctctcgcgcc tcgatcagct cggtcctctg gacggtgatc 12540cccgtttcct cgacactgcg cgagatgccc gcccgcaccc cccgcaagcc cccgccgccc 12600gcctcgcccg ctggtcccgc cggcgcgccg gacgacctca ccgacagcga tcgcgacgcg 12660ctgctgcgct ggcggctcgc gctcgggccc gaggccgagc gggtcgaccc gcgcctctcc 12720ctcggcgggc tcgggggcgc ggcgcccgcg ctcgacgtcg acgcgcggcg gctcggcgac 12780ctcgacaagg cgctctcgtt catctacgac gagcgcgccg gcggcctcgg cggctcgcgg 12840ccctacgtgc ccgagtggct ctccgccgtg cgcgagttct tcagccacga ggtcgtcgcc 12900ctcgtccaga aggacgccat cgagcgaaag gggctgacgc agctcctctt cgagcccgag 12960acgctgccgt tcctcgagaa gaacgtcgag ctcgtcgcca cgctcatgag cgccaagggc 13020ctcatcccgg acgccgcgcg ggacaccgcc cggcagatcg tgcgcgaggt cgtcgaggag 13080gtgcggcgcg cgctcgaggc cgaggtccgc accgccgtcc tcggcgcgct gcgccggaac 13140acgacgagcc cgctgcgcgt cctcaggaac ctcgactgga agcgcaccat ccgcaagaac 13200ctgaaggggt gggacgcgga gcggcgccgc ctcgtccccg acaagctcta tttctgggcg 13260aaccagacgc gacggcacga gtgggacgtc gccatcctcg tcgaccagtc gggctcgatg 13320ggcgagagcg tcgtctacag ctccatcatg gccgcgatct tcgcgtcgct cgacgtcctc 13380cgcacccggc tcctcttctt cgacaccgag gtcgtcgacg tgactccgat gctcgtcgat 13440ccggtcgacg tgctgttcac ggcgcagctc ggcggcggca ccgacatcaa ccgcgccgtg 13500gcctacgccc aggcgaactt catcgagcgg cccgagaaga cgctgctcat cctgatcacc 13560gacctgttcg agggcggcaa cgccgaggag ctcgtcgcgc gcatgcgcca gctcgccgac 13620agcaaggtga agtcgatctg cctgctcgcg ctgtcggacg gcggaaagcc ctcgtacgac 13680cacgagatgg cgcagaagct cgccgcgctc gggaccccgt gcttcggctg cacgccgaag 13740ctcctcgtca aggtggtgga gcggctcatg cgaggtcagg acctcggccc gctgctcggc 13800gccgaggcgc ggtgagcgcc ccgcgcggcg cgggatcacg gaagcacaga ggacgcagag 13860gcacttgtct ctcctctgcg tcctctgcgc ctccatggcc gcccgtcagg ggccccgaaa 13920ccgactggcg cggctcgcga acctcgtcga cgtcagcgaa ggcgcgccct ggacatccgc 13980ggccgcgcag gccgcgtcag cgcgcgcgac ggatcggctt ggcggcgcgc tcgtccgccc 14040gccgggcggc ggcgcgcttt cgcgacgtgg cgccgtgggc agcgctcgcg gagacgcgac 14100ggcgggcgcc ggccgcgcga accaccgctt cgagcgaggg tgactggccc acgagaggac 14160cagtgctgat cgaggggccg actaggctga tagaaagttt cacttgaact accgatgtgg 14220tggcggaccg atcacgtcgc tcagcggagg gctcgtcgac ctataaactg ttttgatcgt 14280ttgcgcagcg tcacgatgcg gagatcacga cccctgagcg cccgtccgga cgtgaacttg 14340tccccccggg ggatccacac gccttccgcc tctcacgacg gacgtacgca cacaccacgg 14400aggcacgaag gcacggttgt gggttcgctc cgtgccttcg tgtctccgcg gtgcttggcg 14460agggactgcc cccggaggtt gcaccgggcg ctctgtcacg agctggttgc acgatgcagg 14520ccaacgatgg ccggatgccg gcgtcgcccg ttttccgggg atggccatgg tccgcctttc 14580atggttgaaa ccactggttg caaccatggc ggaacggagc ggcgtcgctg cccccgcggc 14640gcctggcgcc ccggggagag cgcctctcgg cccgcaggac cggtcagcgg ggatccaggg 14700cgctggccca gcggccccga cgatccagcc gcgcggggcg ggagcggcag cgtggatcac 14760tgcggacttg ccgtcgatcg aggtccgcat ctggatcggc tcgccgcgac cgtatccgat 14820ggcgttcatg agcgggctgg cgagcaccgg gtcgacgcgt aggcggcccc accgaccgat 14880cgccgaggtc gagcgcctgg gcgacgttcc ccggcgagca cgcgacgcgc aggccgggac 14940gcccgctcgt cagcaccccg ccgccgctgc ccacgccgag acgagctcct gcaggcgatc 15000tcgggttcac cctcgcgcga ggacctgtgc

tccgcggtga gcccgataga cgctgcccgg 15060gcctcccccc tactccgtac ctgcccgaca aaggaccagc ccacgcgcgc tgtcattcgg 15120ttgagcaccc gccttctgtt cgcagggcgc gccttgaaga gtcggacagg tcgccttccg 15180gaaaggcagt ggcctggtat ccgccatgtt tccggtgtgc ttcgctgcta tccggtggcc 15240tggtgtccac catgtttccg gtgtgcttcg ctgctatccg gtggcctggt gtccgccatg 15300tttccggtgt gcttcgctgc tatccggtgg cctggtgtcc gccatgtttc cggtgtgcct 15360ctctgctcga gccacgggcc acctctaccc gagcaactcg acctgatgca atgtagttga 15420gcccgcctgg ctggcagcgg tgccatcccc gtcctgcctc tgacagcagc ggatcgcaga 15480cccgcctgcg atgccggtag cgggacatcg gcacagatga ctgttcaccg tgcgggcagt 15540gttcctggct ggaagaataa tcccgtatca attcaataag atgccctggc ggcgccaagc 15600tcaccacagc ctactcggcg caaccactca gccctcacga caactatgta atttttctca 15660caacatgagc acttgattga aagattggaa aagtgaacga cgaaaggttg cgtagattac 15720cgtaggtgct agcctggcgc gcactttcct atgcccgaca cgtcgtcgtc gagccccgta 15780atggcgatgg ggctatcgga ctcgaaagcc cggtccgtgg aggatgcacg gcctgcctcg 15840gggcttcctc gtccacccgc gggcatcgct gtggtgggaa tgggatgtcg cttccccggc 15900ggcatcgatt cgcccggatc cttgtgggcg gccctatctc aagggcgcga ccttatcagc 15960gaggtcccgc cggaccggtg ggatgtcaat gcccactacg acgccgacgc aagcgtcccc 16020gggaagattg cgacccgcca tggcggcttc ctcgccgggg tcgcggcgtt cgacgcgcct 16080ttcttcgacc tctcgccgcg cgaagcgaag catatggatc cgcagcagcg cctcggcctc 16140gagacggcgt gggaggcgct ggaggacgca ggcctggacg cgaggagctt gcggggcagc 16200cgggcagggg tgttcgtcgg ctcgatgtgg gcggagtacg acgtgctcgc gtcgcgacat 16260cccgaatcca tctcgccgca cggggccacg gggagcgacc cggggatgat cgctgcgcgc 16320atcgcctaca ccttcggcct tcgtgggccg gccttgtcgg tgaatacggc gtcgtcgtcc 16380tccctcgtgg cggtgcatct cgcattgcag agcttgcaga gcggagagtg cgagctcgcg 16440ctggccggcg gcgcgaacct catcctgacc ccatacaaca cgatcaagat gacgaagctc 16500gggacgatgt cgcccgacgg ccggtgcaag gcgttcgacc accgcgccaa cggctacgtg 16560cgcgccgagg gcgtcgggtt cgtggtcctg aagccgctgt cgcgagcgac cgcggacggg 16620gatcggatct atgcggtcgt gcgtggctcg gccgtgaaca acgacgggct caccgacggg 16680ctgaccgcgc cgagcgggga ggcgcaggag gccgtgctgc gagaggcgta tgcgcgcgcc 16740ggggtgtctc ccgccgaggt ggactacgtc gaggcgcatg ggacgggaac gccgctcggc 16800gaccgcgtgg aggcgacggc gctgggacgg gtgctcggcg caggacgcgc ggcggatcgc 16860gcgctgcggg tcggttcggt caagacaaac ctcggtcacg cggaggcagc cgccggggtc 16920atcggtctga tgaagacagc gctgtcgctg cgtcacgggt cgcttccggc gagcctgcac 16980gtcgagcgcc cgaaccccga gatacccctc gaatcgctgg gcctccggct ccagacggcg 17040cacggcgtgt ggccggaggt cgatcggccc cggcgagcag gcgtgagctc attcggcttc 17100ggcggcacga actgccatgt ggtgatcgag gagtggcgcg ggggcctcca gcagagcgcc 17160gccgaggcgg gcagcgaccc cggcgccgcc gtaccgccgc ctggccttcc ccttgtgctg 17220tcggcgaggg accacggggc gctgcgggcg caggcgggcc ggtgggcggc gtggctcacg 17280gagcaccgcg aggcgcgctg ggcggacgtc gtccacacgg cggcagtgcg gcggacgcac 17340ctgggcgctc gggccgcggt gatggcggcg ggcgtggccg aggccgtcga tgcgctgaag 17400gccctggccg acgggcgcgc ccacggggcc gtgacggtcg gcgaggcgcg cgagcggggc 17460aaggtggtct tcgtgtttcc gggccagggc agccagtggc cggcgatggg gcgagcgctc 17520ctgtccgcgt cgaaggtgtt cgccgaggcc gtcgaggcgt gcgacgcggc gctgaggccg 17580ctgacgggct ggtcggtgct ctcgttgctg cgcggcgacg ccggggaggc agcgccgtcg 17640ctcgaccgcg tcgacgcggt gcagccggcc ctgttcgcga tggctgtcgg cctggccgct 17700gtctttcgcg cgtggggcct cgatccttcg gccgtggtgg gccacagcca gggcgaggtc 17760ccggcggcgt acgtcgcggg ggcgctctcg ctcgacgacg cggcgcgggt cgtggcggtc 17820cgaagcgcgc tcgtgcggcg gctcgcgggc gcaggggcga tggcggcggt ggagctgccg 17880gccggcgagg tggagcgccg cctggcgccg ttcggggggg ctctggccat tgcggtggtc 17940aacacgtcga gctcgacggc cgtttctgga gacgccgagg cggtggacag gctggtcgcg 18000cagctcgagg ccgaaggcat cttctgccga aaggtgaacg tcgattacgc atcccacagc 18060gcgcacgtgg acgtcgtgct accagagctc ctggagcgcc tggcgccggt ccggccaggg 18120gccacgagga tccccttcta ttcgacagtg accggcggtg tgctggaggg gacggcgctc 18180gacggggcgt actggtgccg caacctgcgc cagccggtgc ggctggaccg cgcgctcgcc 18240cggctgctgg acgacgggca tggcgtcttc gtggaggtca gtgcgcaccc ggtgctggcg 18300tcgccgctga ccgcggcgtg cgccgagcgc gagggcgtgg ttgtcggcag cttgcagcgc 18360gacgacggcg ggctcgcgcg gctgctcggc tcgctgggcg cgctgcatgt gcagggccag 18420ccggtcgact ggcgcgcggt gctggcgccg ttcggcggca gcctggtgga cctgccgacc 18480tatgcattcc agcgccagcg ttactggttc gatacggatg agagcgtcgc cctcgcagcg 18540gcgtccagcg tcgcggaaga gtcgtggtca gaaaagctgg ccgggctgtc ttccgcgcga 18600cgggaagaac ggctgctcga atgggtgcgc gcagagattg cagcggtgct cgggctggag 18660gcgccggcgg tgccgccaga cgtcttgctg cgggatctcg gattgaaatc gccgatcgcc 18720gtggagctgg ggagccggct gggacgcagg acacgccgga agctgcccgt gaccttcgtt 18780tacaaccacc cgacgccacg agcgatcgct cgcgccctcc tggagggaat gttttcctcg 18840atcaaggact ctgcttcgag cgccgctgac gaccgccgcc cgccgggggt gctcgaagac 18900gttgcccccc cacaggcgct cgagacgtcc gagatgtccg acgatgagct gttccagtcc 18960atcgatgcgc tcgtctaggg agaccgcgct ctcgtcgaag aaggttgttc aacgctgcgg 19020gtcgaggatt gctcgtggat cgaagcgata aactgcgtgc gtatctggag aagaccacgg 19080cctcgctggt cgaggcgaag ggccggatcc gggagctgga agcgcgttcg cgcgagccga 19140tcgcgatcgt ggcgatggcg tgccggtttc cgggcggcgt cgacagcccc gagaagctct 19200gggccctgct ggacgaggag agggacgcca tcaccgaggt gccgccctcg cgatgggacc 19260tcgagcgctt ctatgacccc gatccggacg ccgcgggcaa gacctacagc cgctggggcg 19320gcttcgttgg cgatctggac cgtttcgacg cggcgttttt cgggatcagc ccccgcgagg 19380cccggagcat cgacccgcaa gagcgctggc tgctggagac cacgtgggag gccctcgagc 19440gggccggcgt gcgcgcagac acgctggaag ggaccctggg gggcgtttac atcggcctgt 19500ccggctcgga gtaccagacg gaggcattcc acgatgcgga gcgcatcgac gcctattcgc 19560tgaccggcgc ttcgccgagc acgaccgtgg ggcgcctcgc ctactggctc gggctacgag 19620gccccgcggt cgccgtggac accgcgtgca gctcctcgct cgtcgcggtg cacctggcct 19680gccaggcgct gcggaacggg gagtgcgatt ttgcgctggc aggcggcgtc aatgcgctcc 19740tggcccccga gagctatgtt gccttctgcc gcctcagggc gctgtccccc accgggcggt 19800gccagacctt ctccgcggac gccgatggct acgtgcgcgc ggaagggtgc ggggtgctgc 19860tgctcaagcg tctgtcgcac gcgcagcggg atggagaccg tgtgctcgcg gtcatccggg 19920gcaatgccat caaccaggac ggccgcagcc aagggttgac ggcgccgaac gggctcgcgc 19980aggaggacgt catccgcagg gcgctgtcgc aagccgccgt ggagccgacg accgtcgatg 20040tggtcgaatg ccacgggacc ggcacggcgc tcggcgatcc gatcgaggtc caggcgctcg 20100gggctgttta cggcgatggg cgccccggag acaggccgct cgtgatcggc tccgtcaaga 20160cgaacatcgg tcataccgag gcggccgcgg gcatggccgg cctcatcaag gccgtccttt 20220cgctgcagca cgcccaggtc cctcgatcgc tgcacttcgc ggcgccgagc ccttacatcc 20280cctgggatac cctccccgtc cgcgtggccg cgcagcgcgt cgcatgggag cggcgcgagc 20340acccgcggcg cgccgggatc tcctcgttcg ggatcagcgg caccaacgcg cacgtgatcc 20400tcgaggaggc gccggaagcg ccggcgacgg cgccggaggc ggcggcggtg acgtcgacgc 20460tgccgttgct tgtgtcgggg cgggatgagg cggcgctcag ggcgcaggcg gagcggtggg 20520cggcgtggct cgcggcgcac ccggaggcgc gctgggcgga cgtggtgcac acggccgccg 20580tgcggcgcac gcacctggag gcgcgcgcgg cggtggccgc ggggaacgcc gccgacgccg 20640ccgcggcgct gggggcgctg gccgccgggc agccgcacaa ggcggtgtcc ctgggcgagg 20700cgcgcgcgcg cggcgatgtc gtgttcgtgg ttccgggcca ggggagccaa tggccggcga 20760tggggcgggc gctgctggcc gagtccgagg tgtttgccgc cgctgtcgcg gcctgcgacg 20820cggcgctgcg gccgttcacg ggctggtcgg tgctctcggt gttgcgcggg gagcagggcg 20880aggcggtgcc gcccgccgac cgcgtggacg tggtgcagcc ggcgctgttc gcgatggccg 20940tggggctctc ggcggtctgg cgggcgtggg gcatcgagcc ctcggcggtg gtcggccaca 21000gccagggcga ggtcgcggcg gcgtacgtcg ccggggcgct gacgctcgag gacgcggcgc 21060gggtggtggc gctgcgcagc cagctcgtgc ggcgcatcgc cggcggcggc gcgatggccg 21120tgatcgagcg ccccgtcggc gaggtggagc agcggctttc tcggttcgga gggcagctct 21180cggtggcggc ggtgaacacg ccgggctcga cggtggtgtc cggggacgcc gcagcggtcg 21240atcgtttgct ggccgagctg gagaccgcgc gggtgttcgc gcggcggatc aaggtcgatt 21300acgcgtcgca cagcgcgcac gtggacgcga tcctgccgga gctcgaggcc tgcctggcct 21360cggtcgagcc ccgtacctgc gccatcccgc tgtactcgac ggtgacggga gaagtgctcg 21420ccggcccgga gctcggcgcg acatactggt gccgcaacct gcgcgagccg gtgcggctcg 21480accgggcgct ctcgcggctg ctggcggacg ggcacggggt gttcgtggag gtcagcgcgc 21540atccggtgct ggccatgccg ctgtcggccg cgagcgccga gcgcggcggc gtggtggtgg 21600gcagcctgca gcgcgacgac ggcggtctgg ggcggctgac gtcgatgctt ggcgcgctgc 21660acgtgcacgg ccacgccgtg agctggcagc gggtgctggc gccgtacggc ggggcgctcg 21720tgggcctgcc gacgtacgcg ttccagcgcc agcgccactg gctcgaggcg ccgcggtacg 21780cggcggagga tacggacggc gcggcgcggc gcgacccgct gtaccgggtc acgtggatcg 21840aggcggcgct ggaagaagcg ccgtgggcgc ccgagcgcca cgtcgtgctc ggcgggggcg 21900gcgcgctggc ggcggggctg ggggcgctcg cgctggcggg gctgccggag ctgctcgagg 21960cgctggagaa cagggcggcg gcgcccgagc ggctggtgct ggacctgacg gagggccgcc 22020caggcgcggt ggcggagtcc gtgcacgcca cgacgcgcga cgcgctcgcg ctggtccagg 22080catggcttgc ggcgccgcgg ctctcgggca ccgagctggt cgtggtgacg cgggaggcgg 22140tggcggccgg cccggacgag ggcgtggcgg cgctgggccc cgccgctgtc tgggggctgc 22200tgcgcacggc ccgcgtcgag caccccgagc gcgcggtgcg cgcggtggat ctggggcgcg 22260agccgctgga cgtcgcggtc ttgcggcggg cgctgggggc ggtggccgag ccggagctcg 22320cgctgcgcgc gggcggggcg cgggctgcgc gcctgcgcgc tgtcgacgcc ggcgcgggcg 22380ccagggagcc ggcggctgcg ctggacccgc agggcacggt gtggatcacg ggcggcaccg 22440gggagctggg gcggcagatc gcgcggcacc tggtcgcggc gcacggcgtg cggcacctcc 22500tgctgacgtc gcggcggggc gcggccgcgc cggacgccga ggcgctcgtc gagcagctgc 22560gggccgacgg cgccgagacg gtcgaggtcg tggcgtgcga cgtgacggac ggcgcggcgc 22620tttcggcagc agtccaggcg gctgcggcaa ggcacccgct gacggccgtg gtgcacaccg 22680ccggggagct ggcggacggg gtgctcacgg ggctgacggc ggagcagctc gcgcgggtgc 22740tggcgccgaa ggtcgacggg gcgtgccacg tgtacgccgc cgcgcaggac cagccgctcg 22800cggccttcgt gctgttctcc tcgatcgtgg gcacgctggg caacgcgggc caggcgaact 22860acggggccgc caatgcgttc ctggacgcgt tcgcggcgca gcttcgcgcg cgcggcgtgc 22920cggcgacgag cctcgcgtgg ggcttctggg agcaggcagg gctcggcatg acgtcgcacc 22980tcggcgcggc cgacctggcg cgcctcaggc ggcagggcct tgcgccgctg tcggtcgcgc 23040agggcctgcg cctgctcgac cgggcgctcg cgcgcgcgga ggcgacgctg gtgccggcgg 23100cgctcgatct tccggcgctc cagcgtgcgg cgagcgacgc cggacgggtg cctccactgc 23160tgcgcgggct ggtgcgcacg agtcccggcc gccccacggc gaccgcgacc cccgaggccg 23220ggccggcggc gtcggcgctg cgcgcacggc tctcggcgtt gcccgaggcc gagcggccgg 23280gcgcgctgct ggatctggtg cgcacggagg tggcggtcgt gctgcagctg gcagggccgg 23340cgcaggtgcc cgcggacaag ccgctgaagg agctggggct cgattcgctc acggccgtcg 23400agctgaggaa ccgcctcggc gcgcgcgccg agacggtgct gccgacgacc ctcgcgttcg 23460accatccgac gccgcgcgcg atcgcggatc tgctgcttca gcgtgcgttc tcggagctcg 23520cggcggcgaa ggcgacgcgc gcgcggggag cgcacgacga gccgatcgcg atcgtgtcga 23580tggcgtgccg gctcccgggc agcgtcgata cccccgcggc gctgtggaag ctcctggcgg 23640aggggcggga cgcgatcggg ccgttccccg aggggcgcgg ctgggacgtg gcggggctgt 23700acgatccgga cccggatgtg ccgggcaagt cgatcaccac gcaaggcggc ttcctctacg 23760acgccgaccg cttcgatccg acgttcttcg gcatcagccc gcgcgaggcc gagcgcatgg 23820acccgcagca gcgtctgctg ctcgagtgcg cctgggaggc gctcgagcgc gcgggcctgg 23880cgccccacgc gctcgaggcg agcgccaccg gcgtcttcgt cgggctcgct cacggtgact 23940acggcgggcg gctcttgcag cagctcgagt ccttcgacgg ccacgtcctc accggcaact 24000tcctcagcgt cggctcgggg cgcatcgcgt acacgctggg gctccgcggc cctgcgatga 24060ccgtcgacac ggcgtgctcg tcgtcgctcg tggcggtcca cctcgcgtgc atgtcgctcc 24120gcgcgggcga gtgcgacatg gcgctcgccg gcggcgccac cgtgatggcc acgccgatga 24180tcttcgtcga gttcagccgc cagcgcggca cggcgctgga cggtcgttgc aaggcgttcg 24240gcgccggggc cgatggcgcc ggctggtcgg aggggtgcgg gatcctggcg ctgaagcggc 24300tgtcggacgc gcagcgcgac ggcgaccgcg tcctggcggt gatccgcggc tccgccgtca 24360accaggacgg ccgcagccag gggctcaccg cccccaacgg cccggcccag caggacgtca 24420tccgccaggc cctggccgcg gcggggctca cgcccgccga cgtcgacgcc gtcgaggcgc 24480acggcaccgg cacgcgcctc ggtgacccca tcgaggcgca ggcgctgctg gcgacctacg 24540gcgccgcgca cacagcggag cggccgctct ggctcggctc gctcaagtcg aacctcgggc 24600acacgcaggt cgccgcgggc gtgtcggggc tgatgaagct cgtgctggcc ttgcagcacg 24660cagagctgcc gaggacgctg cacgccgacc cgccctcgcc gcacgtcgac tggtcgcagg 24720ggcacgtcaa gctcctgaac gagcccgtgc cgtggccgcg caccgacagg ccgcggcgcg 24780cggcggtctc gtccttcggc atcagcggca ccaacgcgca cgtcatcgtc gaggaggcgc 24840cggccgaagc gccggcgaca gcggcggacg caaagtcggt ggaggcgctt ccgatcctgc 24900cgctgctggt ctcggggtcc gacgagccgg cgctgcgcgc gcaggtgcgg cggctggtgg 24960agcacctgcg gtcgcacccg gacgagcggc tgctggacgt ggcagcgagc cttgcgacca 25020cgcgcgcgca tctcgcgatg cggctcgcgc tgcccgtctc ggcaggggcg ccccgggatg 25080cgtgggtgga tgagctggag gcatttgcca ggggaggagc ggctccgacg caggcatcgc 25140agacccccgc cgagagcagc gcgggcaagg tcgcggtgct cttcaccggc cagggcagcc 25200agcgcgccgc catggggcgc gccctgtacg ccacccaccc cgtcttccgc gccgcgctcg 25260acgccgcatg cgccgagctc gaccgccacc tcgacaggcc cctccacagc gtcctcttcg 25320cagacgccgg caccgaggcc gccgcgctgc tcgaccagac aggatgggca cagcccgccc 25380tgttcgctct cgaggtcgcg ctctaccgac agtgggaggc ctggggtctg cgccccgagc 25440tgctgctcgg ccacagcatc ggcgagctcg ccgccgccca cgtcgccggc gtgctcgacc 25500tccccgacgc ctccgccctg gtcgccgccc gcggacggct catgcaggcc ctcccccacg 25560gcggcgccat ggcctccatc gaggccaccg agcacgagct cctacccctg ctcgaccagc 25620acaccggacg cctctcgctc gccgccctca acgctccacg ccagtcggtc gtcagcggcg 25680acctgcacgc cgtcgaccag gtctgcgccc acttcatcgc cctcggccga cgcgccaagc 25740ggctcgacgt cagccacgcc ttccactcgg cgcacatgca gcccatgctc gacgccttcg 25800ccagcgtcgc ccgcggcctg accttccacc cgccacggct gcccatcgtc agcagcgtca 25860ccggcgcacg cgccaccacc gaccagctca cctcgcccga ctactgggtg cagcaggtgc 25920gcgagcccgt gcgcttcctc gacgccatgc gctccctgca cgccgccggc gccgccacct 25980tcgtcgagtg cgggccgcac ggcgtgctca ccgccgcagg cgccgagtgc ctcgctcccg 26040agggcgctcg cgacgccggc ttcgtcacca gcctccgcaa ggaccgcgac gaggccctcg 26100ccctggtcca cgccgcctgc gccgtccatg tccgcgggca cgccctcgac tggctccgct 26160tcttcgacgc caccggcgct cgccgcgtcg agctgcccac ctacgccttc cagcgacagc 26220gctactggct cgaggcgcca aggcctcgcc ccagcctcga gggcgtcggc ctcaccgccg 26280caaaccaccc atggctcggc gccgccgtgc gcctcgcaga ccgcgatggc tacgtcctca 26340gcggccgcct ctccaccatc gaccacccgt gggtcctcga ccacgtggtg ctgggcacgg 26400cgctgctccc gggcacgggc ttcgtcgagc tggcgtgggc ggcggcagag gcggtcgggc 26460tgcccggggt atcggagctg gcgatcgagg cgccgctggc gctcccggcg cgcggggcgg 26520tggcgctgca gatcgcgatc gaggcgccgg acccggcggg gcgccgcggc gtcgcgatct 26580acagccgccc cgacggcgca gccgacgcgc cctggacagc gcacgcgcgc ggcgtgctgg 26640gcgccgcggc gcccgacagg gacgcggcgt gggcacaggg cgcgtggccg ccgccggggg 26700ccgtgcctgt cgatgtgacg cagcggatcg agatcgtgga cgcgtgggtc ggcccggcgt 26760tccggggcgt caccgcgctg tggcgcgtcg ggcggacgat ctacgccgac gttgcgctgc 26820cggacggtgt ggcgagcacg gcgcaggact tcgggctgca tccggccttg ctcgatgtgg 26880cgctacgcgc gttcctgaga gcggagctcg gcgccgatcc ctcgccacgg gagggcacgg 26940tggtgccgtt cgcgtggtcg gacgtggtgc tcgaggcgcg tgggacggcg gcgctgcggg 27000tgcgcgtgga ggtggcggcc gatggggacg gcgacgcgat cacggcgtcg atccagctgg 27060ccgacgggca gggccgcccc gtcgcgcggg tgggcgcgct ccagatgcgg tggacgacgg 27120ccgagcgggt gcgcgcggcc gcgggcgcgg cggagcgcga tctgtaccgc gtcgcgtgga 27180cggacgtggc gctggacgac gcggcgtttg cgccggagga gcacgtcgtg gtcggcggcg 27240acggcgcgct ggcggcggcg ctcggtgcac gcgtggtggc ggggctgccc gagctgctcg 27300cgtcgctgcc ggacggcgcg gcggcgccac gccggctggt ggtggacctc acggcggacg 27360ccgcgggcgc ggtcgtcgac gccgtgcacg ccgcagcgcg cgacgcgctg tccctggtgc 27420agggatggct ggcggcgccg cagctggcgg cgacggagct cgtggtcgtg acgcgcggcg 27480cggtggcggt cgcgccggac gagggcgtgg cggcgctggg ccccgcggcg gtctgggggc 27540tgctccgcgc gacgcgcgtc gagcatgcgg atcgcacggt ccgcgtgctc gatctggggt 27600ccgcggcgcc ggacatgacg ctcttgcgcc gggcgctcac ggcggccgag gagccagagc 27660tcgcgctgcg cgcgggcggg gcgcgggcgc cgcgcctcga cgcggccagc gagaccgaag 27720gagagctggc gccgcccggc ggggcgcgct ctcttcgcct gtccatccgg acgaagggct 27780cgttcgacgc gctccacctc gcggacgctc ccgatgcgct gcgcccgctc gggccggggc 27840aggtccggct cgctgtccgc gccacggggc tcaacttccg cgatgtcttg aacgtcctgg 27900ggacgtaccg cggcgaagcg gggcctctcg gtctggaggg ggctggggtg gtgctggacg 27960tgggcgaggg agtcaccgcc cttcgacccg gcgaccgggt gatgggcatg ctgcacgcgg 28020gcatggcgac ccatgcggtc gtcgacgccc ggctgctgac gcacatcccg cgggggcttt 28080ccttcgtgga agcggcgacg attccagcgg ccttcctcac cgctctgtac gggctgcgcg 28140acctcggcgc gctgaaggcg gggcagcgcg tgctggtgca cgccgccgcc ggcggggtgg 28200gcatggcggc ggtccagctt gcgcgcctct ggggagccga ggtgttcgcg acggcgagcg 28260agggcaagtg gccggcgctg cgtcggatgg ggatcgacca ggcccatatc gcctcgtcgc 28320ggaccctcca cttcaggaaa gccttcctcg atgcaacgca gggacagggc gtcgacgtgg 28380tgctcgacgc gctcgcgggc gagttcgtcg acgcttcgct cgacctgctc ccgcgcgggg 28440gcgcgttcgt ggagatgggc aagagcgatg tgcgggatcc cgagcgcgtc gccaaggacc 28500acccccgcgt tcgctacacg gccttcgatc tgctcgacgc ggggccagac cacatccagg 28560cgatgctgcg ggagctcgtc ccgctgttcg aggagggcgt cctcgctccc cttccctccg 28620tggcctacga cctgcgtcgc gccccgcacg ccttccgctc catggccaac gcacgccaca 28680taggcaagct cgtgctggtg ccgcccgcga cgctcgaccc tgacggcacg gcgttgatca 28740cgggcggcac gggagagctc gggcggcaga tcgcgcggca cctggtggcg gcgcacggcg 28800tgcgccacct ggtgctgacg tcacggcgcg gcatggacgc gcccgacgcc gcagcgctgg 28860tggaatcgct gcgcgcggcg ggcgccgcga cggtggaggt cgcggcgtgc gatgtgacgg 28920accgtgacgc gctggcggcc atcgtgcagg cgatccccgc ggcgcgcccg ctgaccgccg 28980tcgtgcacac ggccgccgtg ctggacgacg gcaccgtggc ggggctctcg gccgagcagc 29040tcgcgcgcgt gctgcggccg aaggtcgacg gcgcctggca gctctacgag gcgacgaggg 29100acgcgccgct cgcggcgttc atgctcttct cgtcggtcgc cggcacgctg ggcagctcgg 29160ggcaggcgaa ctacgccgcc gcgaacgcgt tcctcgacgg gctggcggca gagctccgcg 29220cgcgcggcgt gccggcgatg agcctcgcgt ggggcttctg ggagcagggc gggatcggga 29280tgacggcgca cctcggcgcc gccgatctgg cgcggctgaa gcggcagggc atcgtgccga 29340tgacggtcgc gcacggcctg cggctgctcg accgcgccct cgagcgcccg gacgcggcgc 29400tggtgcccgc ctccctggac atggcggtga tccagcggac ggcgagcgac caccgtcagg 29460tgccgcccat gctgcgcggg ctggtccgcg tcgcgccgcg gcaggcggca ggggcagcca 29520gcggcaggag ccatgaggcc tcgaccctgc ggcagcagct cgccgcgctg cccgaaccgg 29580agcggcagcg agcgttgctc gatctggtcc ggaccgaggc agccgccgtc cttgtgctgc 29640gcgggccgga cgctgtcccc gccgacaagc cgctcaggga gctcgggctc gactcgctca 29700cggcagtgga gctcaggaat cggctcagga cccgtgcgca gaccgatctc ccatcgaccc 29760tcgccttcga ctacccgacg ccgaaagcgg tcgccgtgta tctggcccag gagctcgacc 29820ttcacgacgt catgacggag atgcgcggac cgagcttgcg ctctgacgac gagctcaagt 29880cggccatcgc gagcatccgg atctcgacgc tacgccaggc ggggctgctc gacagcctgc 29940ttcggctcgc cgccagcgaa gccgtctcca catccagcga cacgacacct gaaaccgacg 30000agctgacgct gcagcatgtt ggagacgatg agctggcacg gcttgtcttc gacctcgccg 30060gaggagcgca atgaaagaag agatctccgc

ccgtcaagct ctcgagaaga gcttcattga 30120acttcgccgt atcaagcggg agctcgatca gctcaaggcg aagtcgagcg agccgatcgc 30180gatcgtgtcg atggcgtgcc ggctcccggg cggcgtcgat acccccgcgg cgctgtggca 30240gctgctctcg gaggggcggg acgcgatcgg gccgttcccc gaggggcgcg agtgggacgt 30300ggcggggctg tacgacccgg acccggacgc gccgggcaag tcgatcactg cgcaaggcgg 30360cttcctctac gacgccgacc gcttcgatcc ggcgttcttc gccatcagcc cgcgcgaggc 30420cgagcggatg gacccgcagc agcggctgct gctcgagtgc gcctgggagg cgctcgagcg 30480cgcgggcctg gcgccccacg cgctcgaggc gagcgccacg ggcgtcttcg tcgggctgtc 30540ggtcacggac tacggcgggc ggctgctgca cgatcccgag gccctcgacg gctacatcgc 30600caccggcacc ctgcccagcg tcggctcggg gcgcatcgcc tacacgctgg ggctccgcgg 30660ccccgcgatg accgtcgaca cggcgtgctc gtcgtcgctc gtgtcgctcc acctcgcgtg 30720catgtcgctc cgcgcgggcg agtgcgacat ggcgctcgcc ggcggcgcca ccgtgatggc 30780cacgccgatg gccttcatcg agttcagccg ccagcgcggc acggcgctgg acggtcgttg 30840caaggcgttc ggcgccgggg ccgatggcgc cggctggtcg gaggggtgcg ggatcctggc 30900gctgaagcgg ctgtcggacg cgcagcgcga cggcgaccgc gtcctggcgg tgatccgcgg 30960ctccgccgtc aaccaggacg gccgcagcca ggggctcacc gcccccaacg gcccggccca 31020gcaggacgtc atccgccagg ccctggccgc ggcggggctc acgcccgccg acgtcgacgc 31080cgtcgaggcg cacggcaccg gcacgcgcct cggcgacccc atcgaggcgc aggcgctgct 31140ggcgacctac ggcgccgcgc acacagcgga gcggccgctc tggctcggct cgctcaagtc 31200gaacctcggg cacacgcagg ccgccgcggg cgtgtcgggg ctgatgaagc tcgtgctggc 31260cttgcagcac gcggagctgc cgaggacgct gcacgccgac ccgccctcgc cgcacgtcga 31320ctggtcgcgg gggcacgtca agctcctgaa cgagcccgtg ccgtggccgc gcaccgacag 31380gccgcggcgc gcggcggtct cgtccttcgg cttcagcggc accaacgcgc acatcatcat 31440cgaggaggcg ccggcggcct ccgccgaggc gacgagccgc ggggagaaga cgtccgcggc 31500cgcgccgccg tcgatgatgc cgctgctggt ctcgggggtg gacgaggcgg cgctacgagc 31560gcaggcgggg cggtgggcgg cgtggatcga ggcgcacccg gaggcaggct gggcggacgt 31620tgtgtacacc gcggcagcgc ggcggacgca cctgggggcc cgtgcggcgc tgacggcggc 31680ggacgcggcc ggcgctgtcg cggcgctgac ggcgctctcg caagggcagc cgcacgccgc 31740gctcgccgtg ggcgaggcgc gcgctcgggg gaaggtcgcc ttcgtgtttc cgggccaggg 31800cagccagtgg ccggcgatgg ggcgggcgct gctctcgcag tcggaggtgt tcgccgcggc 31860ggtcacggcg tgcgacgcgg cgctgcggcc gttcaccggc tggtcggtgc tctcggtgct 31920gcgcggcgac tcgggcgcgg aggtgccgcc gctggagcgc gtcgacgtcg tgcagccggc 31980gctgttcgcg atggcggtgg ggctcgccgc tgtgtggcgc gcgtggggcc tcgagccgtc 32040ggcggtggtg ggccacagcc agggggaggt cccggcggcg tacgtcgcgg gggcgctgtc 32100gctcgaggac gcggcgcgga tcgtggcgct gcgcagccag ctcgtgcggc gcctgtccgg 32160ggctggcgcg atggccgtga tcgagcgccc ggtaggcgag gtcgagcagc ggctctcgcg 32220gttcggcggc gcgctgtcgg tggcggcggt caacacgccg cgctcgacgg tggtgtcggg 32280agatatcgag gcggtcgacc gcctgctggc ggagttcgag ggcgagcagg tcttcgcgcg 32340gaaggtcaac gtcgactacg cgtcgcacag ccgacacatc gacgggctgc tgccggagct 32400ggagaacggc ctgggcgcgg tgcggccgcg cgcgagcacg atcccgttct actcgacggt 32460gaccgggacg gtgctgacgg gcgcggagct ggacgccgcg tactggtgtc gcaacctgcg 32520cgagccggtg cggctcgacc gggcgctctc gtggctcctg gacgacgggc acggcctgtt 32580cgtcgaggtc agcgcgcacc cggtgctgac gctgccgctc acaggagcga gcgcggcgag 32640cggcggtgtg gttgtcggca gcctgcagcg cgacgacggc gggctcgggc ggctcctggg 32700ggtgctggcc gcgctgcacg tgcacggcca cgacgtcgac tggcgcgcgg tgctggctcc 32760gtggggcgga ggcgtggcgg acttgccgac ctacgcgttc cagcggcagc gctactggct 32820cgaggcaccg cgcggccggg cagggctgga gagcggaggg ctcctggccg tgaatcaccc 32880gtggctcagc gcggcggtgc ggctggccga ccgcgacggc tatgtgctga gcggacggct 32940gtcgacggtc gagcacgcgt gggtcctgga ccacgtggtg ctgggcacgg tgatcctccc 33000gggcacggcg ttcgtcgagc tggcgctcgc ggcggccgat gcggtcggac tgccctcggt 33060gtcagagctc acgatcgagg cgccgctggc gctgccggcg cgaggggcgg tggcgctgca 33120ggtgacggtc gaggcgccgg acgcgacggg gcggcggggc ttcgcggtct acagccggcc 33180cgacggcgcg cacgacgcgc cgtggacggc gcacgcgcgc ggcgtgctcg gcgcagcgcc 33240cgcggcggcc acgacggcgt gggcggcggg cgcgtggccg ccggcggggg ccgagccggt 33300cgacgtcacg cggtgggtcg aggcgctgga cgcgtgggtc ggcccggcgt tccggggcgt 33360gacggcggcg tggcgcgtgg ggcggtcgat ctacgccgac ctggcgttgc ccgagggggt 33420ctcggagcgg gcgcaggact tcggcctgca tccggccttg ctcgatgcag cgctccaggc 33480cctcctgagg gcggagctcg gcgcaggcgc gtcgccgcgg gagggcatcc cgatgccctt 33540cgcgtggtcg gacgtggcgc tcgaggcgcg gggggcagcg gcgctgcggg cgcgcgtgga 33600ggtcgaggac gccagcgatg gggaccagct cgcggcgtcg atcgagctgg ccgacgcgca 33660ggggcagccg gtcgcgcgcg cagggacgtt ccgggcgcgg tgggcgacgg cggagcacgt 33720gcgcatggct gcggcgggct cgagcgagcg tgacctgtac cgggtcacgt gggcggacgt 33780ggtgctggaa gaagcggcgt gggcgccgga ggagcacgtc gtgctcggcg gcgacggcgc 33840gctcgcggcg gcgctgggcg cgcgcacggc ggcgctgccg gagctcatcg cggcgctgcc 33900ggagggcgcg gccgcgccgc gccggctggt gatcgacgcg gccgcgggcg accccggcga 33960cggcctggtc gcggcggcgc acgcggcggc gcagcgggtc ctgtcgctgg tgcaggggtg 34020gctctcggag gcgcggctcg cggacagcga gctggtggtg gtgacgcgcg gcgctgtggc 34080cgccgggccc gacgacggcg tcgcggcgtt gagccacgcg ccgctgtggg gactcgtgcg 34140cacggcgcgc caggagaacc ccggccgggc ggtgcgcctc gtggacctgg ggcccgagcc 34200gctggacgga gcgctcctgc gccgggtggt ggcggcggcc gaggagccgg agctcgcgct 34260gcgcgggggc gcggcgcgcg cgccacgcct gcgcgaggtg cgcgcgggcg cggccgacgc 34320ggcgcggccg acgcggctgg atcccggcgg gacggtgctg atcacgggcg gcaccgggga 34380gctcgggcgg caggtcgcgc ggcacctcgt ggcgtcgcac ggcgtgcggc acctcgtgct 34440cacgtcgcgg cgcgggatgg gtgcgccgga cgccgcggcg ctggtggacg agctgcgcgc 34500cgcgggcgcc gcgacggtcg acgtcgcggc gtgcgacgtc gccgacggcg cggcgctggg 34560ggcggtcatc gcggcgatcc cggctgcaca ccccctcacg gcggtcgtgc acatggcggg 34620cgtgctggac gacgtcatcg tgacgaagct ctcggccgag cagctcacgc gcgtgctgcg 34680gccgaagatc gacggcggct ggcacctggc cgcggcgacg cgaggccatc ggctcgcggc 34740cttcgtgctg ttctcgtcgg cggccggcac gctgggcagc ccggggcagg cgaactacgc 34800cgcggccaac acgttccttg acgcgctcgc ggcgcagctc cgcgcgcgcg gcgtgcccgc 34860gatgagcctc gcgtggggct tctgggagca ggcagggctc ggcatgacgg cgcacctcgg 34920cgcggccgac ctggcacgcc tcaggcggca gggcatcgcg ccgatcgcgc tcgcgcaggg 34980catgcagctg ctggaccggg cgctcgcgcg cccggaggcg gcgctggtgc cggcggcgct 35040cgaccttccg gcgctccagc gtgcggcgag cgacgccggg caggtgccgg cgctgctgcg 35100cgggctcgtg cgcccggcgg tcgggcggcg cgcggcggcg cctgcggccg ccgcgaccgg 35160agcggcggcg ctgcgcgcgc ggctcgcgcc gctgcccgag gccgagcggc acgacgtggt 35220gctcgacctg gtgcgcgccg aggcggcggc cgtgctgcag ctggcggggc cggcgcaggt 35280ccccgcggac aagccgctga aggagctggg gctcacctcg ctcacggcgg tcgagctgag 35340gaaccgcctc ggcgcgcgcg ccgagacggc gctgccggcg accctcgcgt tcgaccatcc 35400gacgccgcgc gcgatcgcgg gtctgctgct tcagcgtgcg ttctcggagc tcgcggcggc 35460ggtggcgacg cgcgcacagg cgccacgcgc gcagggggcg cacgacgagc cgatcgcgat 35520cgtgtcgatg gcgtgccggc tcccgggcgg cgtcgatacg cccgcccgga tgtggcagct 35580cctggcggag gggcgggacg cgatcgggcc gttccccgag gggcgcggct gggacgtggc 35640ggggctgtac gaccccgacc cggacgcgcc gggcaagtcg gtcaccaacc tgggcggctt 35700cctctacgac gccgaccact tcgatccgac gttcttcggc atcagcccgc gcgaggccga 35760gcgcatcgac ccgcagcagc ggctgctgct cgagtgcgcc tgggaggcgc tcgagcgcgc 35820gggcctggcg ccccacacgc tcgaggcgag cgccaccggc gtctttgtcg ggctggtgta 35880cagcgactac ggcgggcggt tgctggagca cctcgagtcc ttcgacggct acatcgccac 35940cggcagcttt cccagcgtcg gctcggggcg catcgcctac acgctggggc tccgcggccc 36000tgcgatgacc gtcgacacgg cgtgctcgtc gtcgctcgtg tcgctccacc tcgcgtgcat 36060gtcgctccgc gcgggcgagt gcgacatggc gctcgccggc ggcgccaccg tgatggccac 36120gccgatggcc ttcatcgagt tcagccgcca gcgcggcatg gcccccgacg cacggtgcaa 36180ggccttcggg gcggaggcga acggcatcgg ccccgcggag ggctgcggga tcctggtgct 36240caagcggctg tcggacgcgc ggcgcgacgg cgaccgcgtc ctggcggtga tccgcggctc 36300cgccgtcaac caggacggcc gcagccaggg gctcaccgcc cccaacggcc cggcccagca 36360ggacgtcatc cgccaggccc tggccgcggc ggggctcacg cccgccgacg tcgacgccgt 36420cgaggcgcac ggcaccggca cgcgcctcgg cgatcccatc gaggcgcagg cgttgctggc 36480gacctacggc accgcgcaca cagcggagcg gccgctctgg ctcggctcga tcaagtcgaa 36540cctcgggcac acgcaggccg ccgcgggggt tgtggggctg atgaagctcg tgctggcgat 36600gcagcacgcg gagctgccga ggacgctgta tgcggagccc cgatcgccgc acatcgactg 36660gtcgcagggg cacatcaacc tcctgaacga gcccgtgccg tggccgcgca ccgacaggcc 36720gcggcgcgcg gcggtctcgt ccttcggcat cagcggcacc aacgcgcacg tcatcatcga 36780ggaggcgccg gccgaagcgc cggcgacagc ggcggacgca aagtcggtgg aggcgcttcc 36840gatcctgccg ctgctcctgt cgggtcgcga cgagccggcg ctgcgcgccc aggccgggcg 36900gctcgccgag cacctgcgcg cccacccggg cgagcggctg ctcgacatcg ccgcgggcct 36960ggccacgacg cgcacgcacc tcgccacgcg gctcgcgctg ccggtcgccg cggacgcagc 37020cgcggaggag ctgggcgccc gccttgcgca gttcgccgcc ggcggcccgg cgcccagcgg 37080cgccgccgtg accgcgccgg ggcagccgcc cggcaaggtc gcggtgctct tcaccggcca 37140gggcagccag cgcgccggca tggggcgcgc cctgtacgcc acccaccccg tcttccgcgc 37200cgcgctcgac gccgcatgcg ccgagctcga ccgccacctc gacaggcccc tccacagcgt 37260cctcttcgca gacgccggca ccgaggccgc cgcgctgctc gaccagacag gatgggcgca 37320gcccgccctg ttcgctctcg aggtcgcgct ctaccgacag tgggaggcct ggggtctgcg 37380ccccgagctg ctgctcggcc acagcatcgg cgagctcgcc gccgcccacg tcgccggcgt 37440gctcgacctc cccgacgcct ccgccctggt cgccgcccgc ggacggctca tgcaggccct 37500cccccacggc ggcgccatgg cctccatcga ggccaccgag cacgagctcc tacccctgct 37560cgaccagcac acggggcgcc tctcgctcgc cgccctcaac gctccacgcc agtcggtcgt 37620cagcggcgac cagcccgccg tcgaccatgt ctgcgctcac ttcatcgccc tcggccgacg 37680cgccaagcgg ctcgacgtca gccacgcctt ccactcggcg cacatgcaac ccatgctcga 37740cgccttcgcc agcgtcgccc gcggcctgac cttccacccg ccacggctgc ccatcgtcag 37800cagcgtcacc ggcgcacgcg ccaccaccga ccagctcacc tcgcccgact actgggtgca 37860gcaggtgcgc gagcccgtgc gcttcctcga cgccatgcgc tccctgcacg ccgccggcgc 37920cgccaccttc gtcgagtgcg ggccgcacgg cgtgctcacc gccgcaggcg ccgagtgcct 37980cgctcccgag ggcgctcgcg acgccggctt cgtcaccagc ctccgcaagg accgcgacga 38040ggccctcgcc ctggtccacg ccgcctgcgc cgtccatgtc cgcgggcacg ccctcgactg 38100gctccgcttc ttcgacgcca ccggcgctcg ccgcgtcgag ctgcccacct acgccttcca 38160gcgacagcgc tactggctcg aggcgccaag gcctcgcccc agcctcgagg gtgtcggcct 38220caccgccgca aaccacccat ggctcggcgc cgccgtgcgc ctcgcagacc gcgatggcta 38280cgtcctcagc ggccgcctct ccaccatcga ccacccgtgg gtcctcgacc acgtggtggc 38340aggcacagtg atcttgccag gaacggcgtt cgtcgagctg gcgtgggcgg cggccgaggt 38400ggtgggcgcc gccgcggtgt ccgaggtgac cttcacgacg ccgctcgtgc tgccgccgcg 38460cagcgtggtg gagctgcagg tgaggatcgg cgagccggac gcgtccgggc ggcggacgtt 38520cgccgcgtac agccgcgcgg acgcggcgat cgaggcggag tggacgcaac acgcgaccgg 38580cgtgctgagc gcgcaggcgg cggccggggc cgacgtggcg gacctttcgg tgtggccacc 38640gccgggcgcc gaggtggtgg cgctcgacgg cggctacgcc tggctggcgg cgcagggcta 38700cggctacggc ccggcgttcc aggcgctgcg cgaggtgtgg cgcgcgggca cgacgctgta 38760cgcgcgggtc gcgctgccgg acgcggtggc ggacacggcg cggggcttcg ggatccatcc 38820ggcgctgctc gacgcggtgc tgcactcgtt gctggcgccg tcggcgcagg aggaggcgtc 38880cgacgacgac aaggtgctgc tggcgttcgc gttctcggac gtggtgatcg aggcgcgcgg 38940ggcagcggag gtgcgcgtcc gcctgaacaa gcaggccgga gacgacgggg agggggtcac 39000ggcgtcgatt cacctcgccg acgcgcaggg gcggccggtc gcgcgcgtgg gggcgttcca 39060ggcgcgggcg acgaccacgg agcgggtgcg cgcgctcgcg ggcgcgagcg agcgcgacct 39120gcaccgggtc acgtggacgg acgtgacgct ggaagagacg ccgtgggcgc acgaggacag 39180cgtcgtggtc ggcggcgacg gcgcgctggc ggcggcgctg ggcgtgcgcg cggtggccgg 39240gctgcccgag ctgctcgcgg gcggcgcggc ggcgccgcgt cgtctggtga tcgacgcgac 39300cgcgggcgac cccggcgacg gcctggtcgc ggcgacgcac gcggcgacgc agcggggcct 39360cgcgctcttg cagggatggc tctcggaggc gcggctcgcg gcgacggagc tggtgctcgt 39420gacgcgcggc gcggcggcgg ccgagccgga cgagggtgtg gcggcgctga gccacgcgcc 39480gctctggggg ctcgtgcgcg cggcgcgcga agagcacccg gcgcgcgcgc tgcgccttgt 39540cgacctgggg cgcgaggcgc cggacggggc gatcctgcgc cgggcgatcg cggcggacga 39600cgagccggag ctcgtggtcc gccgcggggc gctgcgggcc gcgcgcctga gcctcgccca 39660cgctggcccg gacaccgcgg ggcaagcgac gcggctggcc cccggcggga cggtgctgat 39720cacgggcggc acgggagagc tcggacggca ggtcgcgcgg cacctggtgg cggcgcacgg 39780cgttcgccac ctggtgctga cgtcacggcg cggaatggac gcgcccgacg ccgcggcgct 39840ggtggagtcg ctgcgcgcgg cgggcgccgc gacggtggag atcgcggcgt gcgacgtggc 39900ggacgggcat gcgctggcgg cggtgctccg gaccatcccg gcggagcatc cgctgaccgc 39960ggtcgtgcac acggcgggcg tgctcgaaga cggcgtcgtg accgggctct cggccgagca 40020gctcgcgcgc gtgctgcggc cgaaggtcga cggcgcctgg cagctctacg aggcgacgaa 40080ggacgcgccg ctcgcggcgt tcatgctctt ctcgtcggcg gcgggcacgc tgggcagcgc 40140ggggcaggcg aactacgccg ctgcgaacgc gttcctcgat gcgctggcgg cagagctccg 40200cgcgcgcggc gtgccggcga tgagcctggc ctggggcttc tgggagcaag gcgggatcgg 40260catgacggcg cacctcggcg ccgccgacat ggcgcgggtc aagcggcagg gcatcgtacc 40320gatgacggtc gcgcacggcc tgcggctgct cgaccgcgcg ctggagcggc ccgaggcgac 40380gctggtgccc ctatcgctcg acgtggcggc gcttcagcgc gcggcgagcg acgccggacg 40440ggtgccggcg ctgctgcgtg gcctggtgcg cccggcggcc gcccggcgca cggcggcgcc 40500ggcggccgcg gcgacagggc tccgcgcgcg gctcttgccg ttgtccgagg ccgagcgcca 40560ggacgtcttg ctcgatctgg tgcgcacgga gatcgcggat atcctcgcgc tgtccgggcc 40620agcggcggtg cctcccgatc aacccatcag ggagctgggg ctcgattcgc tcacggcggt 40680ggacgttcgg agccggcttg tgcagaggag cgagatcgac ctcgccgtga ccctcgcgta 40740cgattacccg accgcgcgag cgatcgcggg acatctgagc gagcagatgg gactcgaagg 40800agcgccggaa gatcgtgagt cggcgctcga cgagagccag atccgcgccc tgctcatgca 40860gattcctatc cccacgttgc gccagtcggg gctgctcgga gacctggttc gcctggcctc 40920cccgcaagcg cccccgcgcg aagaaggtga gagcgagacg ttgagcttcg atcaccttgg 40980aaatgaagag ttcctcagcc tcgcgtcgaa gctcattgca gaggagggat catgaaccaa 41040gagactgttc ttcggcagac actcgagaag agtctccaca agatccagca cctcaatcgg 41100gagctcgagc gtctcaaggc gaagtcgagc gagccgatcg cgatcgtgtc gatggcgtgc 41160cgctacccgg gcggcgtcga cggtcccgca cggctgtggg agctgctctc ggaggggcgg 41220gacgcgatcg ggccgttccc cgaggggcgc ggctgggacg tggcggggct gtacgacccc 41280gacccggacg cgccgggcaa gtcggtcacc acgcagggcg gcttcctcta cgacgccgac 41340cgcttcgatc cgacgttctt cggcatcagc ccgcgcgagg ccgagcggat ggacccgcag 41400cagcggctgc tgctcgagtg cgcctgggag gcgctcgagc gcgcgggcgt cgcgccccac 41460acgctcgagg cgagcgccac cggcgtcttc gtcgggctgg tgtacagcga ctacggcggg 41520cggctgctgg agcacctcga ggtcttcgac ggctacgtcg ccaccggcag ctttcccagc 41580gtcggctcgg ggcgcatcgc ctatacgctg gggctccgcg gccctgcggt gaccgtcgac 41640acggcgtgct cgtcgtcgct cgtgtcgctc cacctcgcgt gcatgtcgct ccgcgcgggc 41700gagtgcgaca tggcgctcgc cggcggcgcc accgtgatgg ccacgccgat ggccttcatc 41760gagttcagcc gccagcgcgg catggccccg gacgcacggt gcaaggcctt cggggcggcg 41820gcgaacggca tcggccccgc ggagggctgc gggatcctgg tgctcaagcg gctgtcggac 41880gcgcggcgcg acggcgaccg cgtcctggca gtgatccgcg gctccgccgt caaccaggac 41940ggccgcagcc aggggctcac cgcccccaac ggcccggccc agcaggacgt catccgccag 42000gccctggccg cggcggggct cacgcccgcc gacgtcgacg ccgtcgaggc gcacggcacc 42060ggcacgcccc tcggcgatcc catcgaggcg caggcgctgc tggcgaccta cggcaagacg 42120cacacagcgg agcggccgct ctggctcggc tcgatcaagt ccaacttcgg gcacacgcag 42180gccgccgcag gggtggcggg catcatcaag ctggtgctgg cgatgcagca cgcggagctg 42240ccgaggacgc tgtatgcgga gccccgatcg ccgcacgtcg actggtcgca ggggcacgtc 42300aagctcctca acgagcccgt gccgtggccg cgcaccgaca ggccgcggcg cgcggcggtc 42360tcgtccttcg gcgtcagcgg caccaacgcg cacgtcatcc tcgaggaggc gccggccgaa 42420gcgcccgcgg ccgcgcaaac agcggcgggg gtgccgtcga cgctgccgct gctcctgtcg 42480ggtcgcgacg agccggcgct gcgcgcccag gccgggcggc tcgccgagca cctgcgcgcc 42540cacccggacg agcggctgct cgacatcgcc gcgggcctgg ccacgacgcg cacgcacctc 42600gccacgcggc tcgcgctgcc ggtcgccgcg gacgcagccg cggaggagct gagcgcccgc 42660cttgcgcagt tcgccgccgg cggcccggcg cccagcggcg ccgccgtgac cgcgccgggg 42720cagccgcccg gcaaggtcgc ggtgctcttc accggccagg gcagccagcg cgccgccatg 42780gggcgcgccc tgtacgccac ccaccccgtc ttccgcgccg cgctcgacgc cgcatgcgcc 42840gagctcgacc gccacctcga caggcccctc cacagcgtcc tcttcgcaga cgccggcacc 42900gaggccgccg cgctgctcga ccagacaggc tgggcacagc ccgccctgtt cgctctcgag 42960gtcgcgctct accgacagtg ggaggcctgg ggcctgcgcg cccacgcgct gctcggccac 43020agcctcggcg agatcgtcgc cgcccacatc gccggcgtgc tcgacctccc cgacgcctcc 43080gccctggtcg ccgcccgcgg acggctcatg caggccctcc cccacggcgg cgccatggcc 43140tccatcgagg ccaccgagca cgagctccta cccctgctcg accagcacac cggacgcctc 43200tcgctcgccg ccctcaacgc tccacgccag tcggtcgtca gcggcgacca gcccgccgtc 43260gaccatgtct gcgctcactt caaggccctc ggccggcgcg ccaagcggct cgacgtcagc 43320cacgccttcc actcggcccg catggaaccc atgctcgacg ccttcgcccg cgtcgcccgc 43380ggcctgacct accgcgcccc gcgcctgccc gtcgtgagca atgtcaccgg ccgcatggcc 43440accgccgacg agctcacctc gcccgactac tgggtgcgcc acgtgcgcga gcccgtgcgc 43500ttcgtcgccg gcgtgcgcgc gctgcacgcc accggcgtcg ccacctacct cgagtgcggg 43560cccgatccgg tgctcggcgg catggccgca gactgcctca cctccgacga gagccgcgac 43620ccaggcctga tccccagcct ccgcaaggac cgcgacgagg ccctcgccat cgcccaggcc 43680gcctgcgccc tgcacgtccg cggacacgcc ctcgactggc cccgcctctt cgacgccacc 43740ggcgctcgcc gcgtcgagct gccaacctac gccttccagc ggcagcgcta ctggatcgat 43800gcgccgcggc gcgcggcggg gctcgaaagc gtcggcctca cggccgcaga ccacccctgg 43860ctgggcgcgg cggtgcggct cgccgaccgg gacgtctacg tgctgagcgg gcggctgtcg 43920acggtcgacc acccgtggat cctggaccac gtggtgacgg gcacggcgct gatgccagga 43980acggggttcg tcgagctggc gtgggcgacg gcccaggcgg tgaacgccgc cgcgatcgcg 44040gagctcaccc tgacgactcc actcgtgttg ccggcgcgcg gcgcggtgca gctccaggtg 44100acggtcgacg aggccgacgc ggatggccgg cgggcattcg cgatccacag ccggccgcat 44160gggcccgtcg acctcgagtg gacgcaacac gcgaccggcg tgctgagcgc ggaggcgccg 44220gcgggagccg acgaggcggc ggggctctcg gagtggccgc cgccgggcgc ggaggcggtg 44280gcgctcgacg gcgggtatga gcagctgtcc gagcacggct acggccatgg cccggcgttc 44340caggggctcc gcgggctctg gcgcgcggac cagacgctgt acgcgcacgt cgcgctgccg 44400gacgctgtcg cgggcacgga gcagggcttc gggctccatc cggcgctctt cgatgcggcg 44460ctgcagtcgc tggcgcggct gtcgcgcgag gaggcggccg ctggcgaccc ggtgctggtg 44520ccgttcgcgt ggacggacgt ggcgctgtac gcggccggcg cgaccgagct gcgggcgcgc 44580atcgcgctgg agcaggcgga gggcggcgcg ccggcggtgg cgtcgctgct gctggccgac 44640gcgcacggac gaaccgtggc gacgacaggg cgggtgcgcg gggcgagcgc ggcgcagacg 44700cggtccgccg cgagccgtgc ggagccgatg tacagggtcg cgtggacgga cgtggcgctg 44760gaggcggcgg cgtgggcgcc cgaagagcac gtcgtgctcg gcggtgacgg tgcgctggcg 44820tcggcgctgg gcgtgcgcgc ggcggccggg ctgccggagc tgctcgaggc gctggcggac 44880ggcgcggccg cgccgcggcg gcttgtcgtg gacctgacgg cgggcgacgc gggcgctgtc 44940gtcgcggccg tgcacgccgc ggcgcgcggc gcgctggccc tggtgcaggg atggctcgcc 45000gcgccgcagc tgacggcgac ggagctcctc gtggtgacgc gctgcgccgt ggcgacaggg 45060ccggacgagg gcgttgacgc gctggggccg gcggccgtct gggggctgct gcgggccacg 45120cgcgccgagc accccgaccg cgcggtccgg

gtgctggacc tggggcgcga gccgctggac 45180ggggcgctcc tgcgcagggc gctggccgcg gtggcggagc cggagctgtc gttgcgccgc 45240ggcgaggcgc gcgcgcctcg cctgcgcgag gcaaagcccg ccgcggcgcc ggcgacacgg 45300ctggaccctg aagggacggt gctggtcacg ggcggcaccg gggagctggg gcggcaggtc 45360gcccggcacc tggtggcggc gcacggcgtg cggcacctcg tgctgacgtc gcggcgcggg 45420atggacgcgc ccgacgccgc ggcgctggta gaagagctgc gcgcggcggg cgcggcgacg 45480gtcgacgtcg ccgcgtgcga cgtcgccgct ggcccggccc tggcggcggt cgtggaggcg 45540atcccggcgg cgcatcccct gaccgcggtc gtgcacatgg cgggcgtgct ggacgacggc 45600atcgtgacga agctctcggc cgagcagctc acgcgcgtgc tgcggccgaa ggtcgacggc 45660gccattcatc tccacgagct cacgaagcac gcgccgctcg cggccttcgt gatgttctcg 45720tccgcggcgg gcacgctggg cagcccgggg caggcgaact acacggcggc caacgtgttc 45780ctggacgcgc tggcggcgcg actgcgcgcg cgcggcgtgc ccgcgatgag cctggcgtgg 45840ggcttctggg agcaaggcgg gatcggcatg acggcgcacc tcggcgccgc cgatcgggcg 45900cggatgaagc gacacggcgt cgtggcgatg tcggtcgcgc agggcctgcg gctgctcgat 45960cgcgcgctcg cgcaccccga ggcggcgctg gtgccgctcg cgctcgacct ctcgtcgctg 46020cacgcggggg ccagcggcgc cggaccggtg ccgccgctgc tgcgcgggct ggtacgcgcg 46080cccgccggcc ggcgcacggc ggcgtccgcg gcccggacga acgggaaggg cacggcattg 46140gcggcgctcc gcgcgcggct cttgccgttg ccgcaggccg agcgcgagga cctcttgctc 46200gagctcgtgt gcaccgaggt cgcggaggtg ctgcagttgc cggggccggc gcacgtcccg 46260gcggatcagc cgctccgcga cctggggctc gactcgctca tgaccgtgga gctgcgcaac 46320cgtctcggcg cgcgcgccga gacgacgctg cccaccacgc tcgcgttcga ctacccgacg 46380cccagggccc ttgcgtccta tctggagacg ttgctcggca tctccgacga gaacgggcat 46440tcgggtgagt tgctgcacgt tccgcagaac gaggacgaga tccgctccgc gatagcgcgc 46500atcccgatag cgaccctgcg cgaggcgggg ctcctccaga gcttgctgcg gctcgccccc 46560ggcaaggcgg tggccggtga cgtcacgcac ccggtcgatg agctgctggt cgagcacatc 46620gaggatgaag agctgcttcg actcgctttc gaggccaccg gaggtatcaa gtgaaagacg 46680aggctctctc gtttcgccga gccctggaga agacggtcgt cgagatccgc cgtctcaatc 46740gggagatcga cgacctgcgg gcgaagtcga gcgagcccat cgcgatcgtg tcgatggcgt 46800gccggttccc cggcggcgtc gagaaccccg aggcattgtg gcggctggtc tccgaggggc 46860aggacgcgat cgggccgttc cccgaggggc gcggctggga cgtggcgggg ctgtacgacc 46920ccgacccgga tgtgccgggc aagtcgatca ccgcgcgggg cggcttcctc tacgacgccg 46980atcgcttcga tccggagttc ttcggcatca gcccgcgcga ggccgagcgc atcgatccgc 47040agcagcggct gctgctcgag tgcgcctggg aggcgctcga gcgcgcgggc gtcgcgcccc 47100acacgaagga ggcgagcgcc accggcgtct tcgtcgggct gatgtacacg gactacggcc 47160tgcggctgct gaaccacccc gaggccctcg acggctacat cggcatcggc agcacgggga 47220gcacgggctc ggggcgcatc gcctacacgc tgggcctgca gggacctgcg atcacggtgg 47280acacggcgtg ctcgtcatcg ctcgtggcgc tccacatggc ctgcgcgtcc ctgcgcgggg 47340gagagtgcaa cctggcgctt gtcggaggcg tcgccgtgat gacgacgccg acaacgttca 47400tcgagttcag ccggcagcgg ggcctctcgc tcgacggccg gtgcaagtca ttcggtgccg 47460aggccgaggg cgtcggctgg ggcgaaggct gcggaatcct ggcgctgaag cggctgtcgg 47520acgcgcggcg cgacggcgac cgcgtgctcg cgatcatccg cggctccgcc gtcaaccagg 47580acggccgcag ccaggggttc accgccccca acggcccgag ccagagggcg gtcatccagc 47640gggcgctggc ggcggcgggg ctgaccgcgg cggacgtcga cgccgtcgag gggcacggca 47700ccggcacgcg cctcggcgac cccatcgagg cgcaggcgct gctggcgacc tacggcaagg 47760cgcacacagc ggagcggccg ctctggctcg gctcgatcaa gtccaacttc gggcacacgc 47820aggccgccgc aggggtggcg ggcatcatca agctggtgct ggcgatgcag cacgcggagc 47880tcccgaggac gctgcacgcc gacacgccct cgccgcacgt cgactggtcg caggggcacg 47940tcaagctcct caacgagccc gtgccgtggc cgcgcaccga caggccgcgg cgcgcggcgg 48000tctcgtcctt cggcatcagc ggcaccaacg cgcacgtcat cctcgaggag gcgccggccg 48060aagcgcccgc ggccgcgcaa acaccagcgg cggcgggggt gccgtcaacg ctgccgctgc 48120tcctgtcggg tcgcgacgag ccggcgctgc gcgcccaggc cgggcggctc gccgagcacc 48180tgcgcgccca cccgggcgag cggctgctcg acatcgccgc gggcctggcc acgacgcgca 48240cgcacctcgc cacgcggctc gcgctgccgg tcgccgcgga cgcagccgcg gaggagctga 48300gcgcccgcct tgcgcagttc gccgccggcg gcccggcgcc cagcggcgcc gccgtgaccg 48360cgccggggca gccgcccggc aaggtcgcgg tgctcttcac cggccagggc agccagcgcg 48420ccgccatggg gcgcgccctg tacgccaccc accccgtctt ccgcgccgcg ctcgacgccg 48480catgcgccga gctcgaccgc cacctcgaca ggcccctcca cagcgtcctc ttcgcagacg 48540ccggcaccga ggccgccgcg ctgctcgacc agacaggctg ggcacagccc gccctgttcg 48600ctctcgaggt cgcgctctac cgacagtggg aggcctgggg cctgcgcgcc cacgcgctgc 48660tcggccacag cctcggcgag atcgtcgccg cccacatcgc cggcgtgttc gacctccccg 48720acgcctccgc cctggtcgcc gcccgcggac ggctcatgca ggccctcccc cacggcggcg 48780ccatggcctc catcgaggcc accgagcacg agctcctacc cctgctcgac cagcacaccg 48840gacgcctctc gctcgccgcc ctcaacgctc cacgccagtc ggtcgtcagc ggcgaccagc 48900ccgccgtcga ccaggtctgc gcccacttca aggccctcgg ccggcgcgcc aagcggctcg 48960acgtcagcca cgccttccac tcggcccgca tggaacccat gctcgacgcc ttcgcccgcg 49020tcgcccgcgg cctgacctac cgcgccccgc gcctgcccgt cgtgagcaat gtcaccggcc 49080gcatggccac cgccgacgag ctcacctcgc ccgactactg ggtgcgccac gtgcgcgagc 49140ccgtgcgctt cgtcgccggc gtgcgcgcgc tgcacgccac cggcgtcgcc acctacctcg 49200agtgcgggcc cgatccggtg ctcggcggca tggccgcaga ctgcctcacc tccgacgaga 49260gccgcgaccc aggcctgatc cccagcctcc gcaaggaccg cgacgaggcc ctcgccatcg 49320cccaggccgc ctgcgccctg cacgtccgcg gacacgccct cgactggccc cgcctcttcg 49380acgccaccgg cgctcgccgc gtcgagctgc caacctacgc cttccagcgg cagcgctact 49440ggctcgagac gccccagacg ccgggcgccg acggggcctc caacctatct tcgcccgccg 49500aaagccgctt ctgggaggct gtcgagagag cggacatcat ccccctcgcc gaggcgctgc 49560gcctcgagga tgaggcgcaa cgcgcttcgc tggcgaccct gctgcccgcg ctctcgacct 49620ggcgccgccg acgccacgag cagagcaccg ccgacgcctg gcgttaccgc gttgcctgga 49680aaccccttgc catcgacgcc cggagcgatc tctcgggggt ctggctgttc ctcgcgcctc 49740cggatcacgc gaaggacgac ctcgcgcgcg cggtccttcg cgcgctcgcc gagagcggcg 49800cgacggtcgt ccctgtgctg gtggccgagg gcgacgtcga ccgcgccctc ctgagcgcgc 49860ggctgcgcga gcaggtcggc gacggcggcg cgatccgcgg cgtgatctcg ctcctcgccc 49920tggacgagac ctcgctgccg cagcacgacg ggctgccccg gggcctcgcc ttcacgctcg 49980cgctcgtcca ggccctggga gacacggcga tcgcagcgcc tctatggctg ctcacccgtg 50040gcgccgtctc cgtgggtcgt tccgaccgcc tcgagcgccc gctgcaggcg ctgacgtggg 50100gcctcgggcg cgtggtggcg ctggagcacc ccgagcgctg gggtggactc atcgatctcg 50160ccggcgcgct cgacgaaaag gcgctcaagc ggctcgtcgc cgccctcggt ggtcgcgacg 50220ccgaggatca gctcgccctg cgcccctccg gactcttcgc gcgacggctg gtcagagcgc 50280ccctgggtga agcgaccgcg gttcgcgcct ggaaggcgcg cggcaccgcg ctcgtcaccg 50340gcggcacggg ggacctgggc gcccacgtcg cccggtggct cgcccagaat ggcgccgagc 50400acctcgtcct caccagccgc cgcggacagg acgcccccgg agcggccgag ctcacggccg 50460agctcacggc gctcggcgcc cgcgtcacca tcgccgcctg cgactcgtcc gaccgacagg 50520cgctcgcggc cctgctccag cgcctgaggg ccgaaggccc ccccctccgc gccgtcgtcc 50580acgctgcggg tgtcgaccag gtcaccccgc tggccaggac cagcctggcc gagttcgcag 50640gcatcgcctc cggcaaggtc gcaggtgctc ggcacctcga cgacttgctc ggcaatgccc 50700ccctcgacgc cttcatcctc ttctcctcgg tcgcaggcgt ctgggggagc ggctttcagg 50760gcgcttacgc ggcggccaac gccttcctgg acgcgctggc cgagcagcgc cgcgccctgg 50820gctcgacggc cacgtcgatc gcctggggcc tctggggcgg caaaagcatg gccgacgacg 50880ccgccaaaga tcatctcagc aagcgcggcg tgtccccgat gccgccccag ctcgcgatcg 50940cggccctgca gcgggcgctc gaccacgacg agaccacact caccctcgcc gacgtcaact 51000ggtcacgctt tgccccggcc tttgccgccg cccgcccgcg cccgttgctg cacgatctcc 51060cggaagcccg gagcgctctc gagtccccct cgccggcgcc ccgcgaggcc gagctgctca 51120cccggctcca gggcctctcc agcaccgagc gcgtccgcca cctcgtctcc ctcgtgctgg 51180cggagaccgc cgtcgtcctc ggccatcctg acgcctcccg cctcgaccct cacacaggct 51240tcgcggatct cggcctcgac tcgctgatgg ccgtcgagat gcgccggcgg ctccagcagg 51300caacgggggt gagcctgccg gcgaccctga ccttcgacca cccctcgccc caccacatcg 51360cgaccttcct cctcgacgag gtcttcgcgc cggccctcgg ccaggccccc ggcgccgagg 51420aagacgaagc gatcgcccag gccgggctcg cctcgggcga cgagcccgtc gccctcatcg 51480gcgtggggct gcgtctcccc ggcggagcca ccgacctcga cgggctctgg cgccttctgg 51540agcaggggat cgacgttgtc ggccccgtcc ctgaagaccg cggctggagc atggacgagc 51600tctacgatcc cgaccccgac tccctcggca agagctacgt gcgcgaagcg gctttcctcg 51660atcgcatcga cctcttcgac gcgggcttct tcggcatcag cccccgcgag gcgagccacg 51720tggacccgca gcaccgcctc ctgctcgagg ccgcgtggca ggccctcgag cacgcaggca 51780tcgtcccggc ctcgctccag gactcccaga ccggcgtctt cgtgggctca ggcccgagcg 51840actacgcctt gctccacaac ccggcccagg aggatgaagc ctacaggctt acggggacgc 51900agccctcgtt cgcgccaggc cggctctcgt tcagcctggg attgcaggga ccggcgctct 51960ccgtggacac cgcctgctcc tcctcgctcg tcgcgctcca cctcgccgcc caggccctgc 52020gccgcggcga gtgcgggctc gccctcgtcg gcagcgcgca ggtgatggct gctcccgacg 52080ccttcgtgac gctctcccgc gctcgcgcca tcgctcccga cggccgctcg aagaccttct 52140ccgcccaggc cgatggctac ggccgcggcg agggggtcat cgtcttcgtc ctcgagcgcc 52200tgagcgacgc ccgcgcgaga gggcgcgacg tcctcgcggt cctccgcggc agcgccgtca 52260accacgacgg cgccagcagc ggcatcaccg cgccgaacgg cacctcccag cagaaggtgc 52320ttcgtgccgc gctccacgat gcgcggctca cgccagcgga cgtcgacgtg gtggagtgcc 52380acggcacggg cacttccctc ggcgacccca tcgaggtgca agccctggcc gccgtctacg 52440gaaaggagcg ctccgccgat cggccgctga tgctcggcgc gctcaagacc aacgtcggcc 52500acctcgaggc cgcgtccggt ctcgccggcg tcgcgaaggt cgtcgcggcg ttgcgccacg 52560aggcgctgcc ggcgacgctg cacaccgccg cgcgcaaccc tcatatccag tgggatacgc 52620tgcccgtcca ggtcgtcgac accttgcgtc cctggccgcg gcgcgaggac ggcacccccc 52680gccgcgccgg cgtgtcggcg ttcgggctct ccggcaccaa cgcccacgtc ctcctcgagg 52740aagctccgcc tgtccagccg agcacacagg cggagcagcc tgccgcgccg ccgtggttgc 52800cgctgctcct gtcgggcaag acggacgcgg ccctgcgagc gcaggccgag cggctgcggg 52860cgcacctcga cgcccatgcc gacctcgggc ttgccgacgt cgcctattcc ctcgccacga 52920cgcggacgca tttcgcgcat cgggcggtgg tcgtcgcgga cgctggcgcg accctcttcg 52980aagggctgga cgccatcgcg cgcggcaacg ccgcttccca cgtggtggtc gacgaggcca 53040agatcgacgg caagaccgtc ttcgtcttcc cgggacaggg ctcgcagtgg gcccagatgg 53100cgcagccgct gctcgagacc tccgagctct ttcgcgagcg tatcgaggcg tgcgcgcacg 53160ccctcgcgcc tcacgtcgac tggtcgctgc tcgccgtcct ccgcggcgaa gaaggcgccc 53220cctcactgga gcgggtcgac gtggtgcagc cggtgctctt cgccgtgatg gtctcgctcg 53280ctgccctctg gcgctcgatg ggcgtcgagc cggacgccgt cgtcggccat agccagggcg 53340agatcgccgc cgcctgcgtg gcgggcgcgc tgtcgctcgc ggacgccgcc aaggtggtgg 53400cgctgcgcag ccgcgcgctc gcgcggctcg ccggccgggg cgccatggcc gtcgtggagc 53460tccccgccgc cgagctcgcc gagcgcatga agcgctgggg cgagcggctg tccatcgcag 53520cgctcaacag ccctcgttcc accgtgatct ccggcgatcc ggacgccgtc gacgcgctgc 53580tccgggagct cgactcggcg gagatcttcg cccgcaaggt gcgcgtcgac tacgcctccc 53640actgctccca tgtggaggcg attcgccacc agctcctggc cgagctcgcg ggcatcgagc 53700cgctcccgtc cacgctcccg ctctactcca cggtgagcgg ggacaagctc gatggcgtcg 53760cgctcgacgc ctcgtactgg taccggaacc tccggcagac cgtccgcttc tcggacgcca 53820cgcagcggct cgtctccgcg ggacatcgct tcttcgtcga ggtcagcccg catccggtgc 53880tgacgttcgc cgtgcaggat gtcctcgatg ccgagggggt gcccgccgct gtcgtcggct 53940cgctacggcg cggcgagggc gacctgcggc ggttccttgt gtcgctgtcc gagctcttca 54000cccgcggcct cgccctggat tggtccaggg ttctgcccag cggccggcgc gtatcgctgc 54060ccacctacgc cttccagcgc gagcgctact ggctcggggc tcacagggct cgcggcaccg 54120acgcgacatc cgccggcctg gcatcggacg agcccacgcg cggcgcgtcg atgccagtgc 54180ggctctcgtt gcgggacgtg ccgcccgagg agcgccaggg agcgctggag cggttcgtcc 54240gggagcagct cgcggccgtc ctgcgcatgg atgcggcgcg gatcgagggg cagacgacga 54300tcaagacgct cgggatcgac tcgctcatgg cgctcgagat ccgcaaacgg ctggaagccg 54360gactggccgt gaccttgcca tcgacgctca tctggcagtt cccgcacgcc gaagggctcg 54420cacggcacct catgacgcgg ctccccgcgg gggacggaga aggatctgcc gtggtccagc 54480ccgtggagca gccgcgcgcg ccgaaggagg tgcccgtatc catggatccc tcggcgtggg 54540tgcaccgccc gcgccccagg gccgacgcgc gcgttcgact gttctgcctt ccctacgccg 54600gcgcgggcgc ctcgcgcttc cgggcgtggc cagagctgct cccctcctgg gtggaggtct 54660gcccgatcca gctccccggc agggaagagc gcctccacga gccggccttc gagacgatgg 54720acgcgctcgt cgacgcgctc gttcccgccg tcgaggcgca catcgatcgg ccctttgcgc 54780tgttcggctg cagcatgggt gccctcctgg ccttcgagct cgcccgggcg cttcaatccc 54840gtcatcgctt ggtggcgcgg catctgttcg gcgcggcgag ctcctcacct cggcgcgtga 54900gcccggtacg ggagcagctc tccgcggtgg tctcccctgg aacggtgcga tcggacgcga 54960tggcctcgct gcgccagctc ggtctgctgt cgtcctcgtc cctccaggac gaagagatgc 55020tggacgaggt gtggcccgcg ttccgtgcgg atctatccct gacgctgaag tacacgtgca 55080gggacgcaac ccccctcgac gcccccatct cggtcttcgg gggcaccgag gaccggaccg 55140tagggcgcga ggatctcgtc gcctggcata cgctgacgaa ggacgcgttc caggtcgcca 55200tgctgcccgg gggtcacctg ttcatggacg cgacgccgaa gcggctcttc catcacatcg 55260agcacgcgct ccagctctag tggaccgtcc gacaggccct tcgacatcgt cctcggcgga 55320gggcggcgac tccgcgcgga gagcgagccg cgatcgcgcg gcgccgtcca cgatcttcct 55380gggatttttt ttggacagtt caccagaagc tgcgggatac caaacagaag cgaccatggg 55440aagcaacgaa gggagtatcg cttgacgatc aacgacgagg tgcggaccag cgacgccgtg 55500tgggctggtg ccgcgggcta taccagggcg cgtcttcagg tctatgactt cttcatctac 55560ggcttcaaca gccctgtcgc atggaagtgc ccgggcgagg agctcctcga gaactacaat 55620cggcacgtct cgggcaatca cctcgacgtc ggcgtgggga cggggtacct gctcgaccgc 55680tgccgcttcc ccaccgccaa gccgcgtgtg tttctgatgg atctgaaccc ggacgctctg 55740caggtgacgg cgcagcgact gcaccgcttt cagcctcaga ccttgcggcg gaacgtcctt 55800gatcccatcc gcttcgacgg agagcccttc gactccatcg ggatgaacta cctcatgcac 55860tgcgtccctg gatccatccc ggagaaggcc gtgatgttcg accacctgag cgccttgctg 55920aagccgggcg gcgtgatctt cggcagcacg gtgctctcgg agggcgtgga caaggggatc 55980gtggcgcgag ccatcatgga ccgcttcaac aagaagggga tcttctcgaa cacccgagac 56040gccgcctccg atctgacgcg agcgctggag gagcgcttcg acgacgtctc ggtccgcgtc 56100gtcggctgcg tcgggctgtt ctcagccagg aagcgtacct gcgcgggaac cgagtcgccg 56160gcgtgaggtg agcggggacg gcgctcaggg cgcggcgagc ggcagcctgc gtgccgggcg 56220cgcggcctcg tgtccgtccc ccgcctcggc cacccgcccg cggtagatgc gatcgatccg 56280atcgcgcgcg atgaccaggg gcttgtcgaa ccggccaagc acgttgccct tcaggatccc 56340gcgcttgtcc gtcaagcggt ccagcaaccg catatcgagg cgcagctcga tgttcatggc 56400cacctgcatg gcgggccaga ggacggcgcc ggccccgaac ttgctccagg gcgccaggct 56460cgcgaagaga aacgtataca tctccgacga ctccgggccc accgggttga agaagaccgc 56520tgaccggagc gggaaggtga cgggctgatt ggtcttcgga tccctgaggg agtggttgta 56580gatcgtgtag accggcgaga agtaggatgt ccagtccacc acgaatatcg catcctccgg 56640gatgccgagc agcttctcca tcgcccgcgg catgggccgc ctcggacccg aatgcacgac 56700ccggatcgtt tcgtcggtca gggtcacccg cgcctcgacc tctggcatcc gctcgagcgg 56760gtagccgagc atgaagtgga cgaagggcgt gtgctcgatc tcgatgaaat tgtcgagcgc 56820cagctcgaac ggcacggtcg cgcggtggcg gaggagaccg cgcggcacat atccctcgcc 56880ctcgaggcgc gggaacgctg cctgcgaccc cgcccgcttc acccagatgg caccgtaccg 56940ctccacggcc tcgaacatgt cctcgcgccg cgcgcacggc cgcgccgccg gggtagccgg 57000gatctcgccg cggccgtcca cggcccaacg ccagccatgg taggcgcaca ccagccgatc 57060gccctcgacc cacccctcgc tcaggcgcat gctgcggtgg gggcaacgat ccgtgaatgc 57120accgaggccg cccgacgagg tccgaaacac cacgatctca tgccccgcga gccgcacatt 57180gcggggcttg cggcggagct cgtggctcag cagtacaggg tgccagtggt cgagctcagc 57240catgatcagt tcaccccttg gatgtgccgc gcaatccgcg gcgcctcggc tgcgatgtcg 57300cggatctgcc ccgtgatggg attgcggaag ccgatgaaga acagcccagg cgccggcgtc 57360ggcgcgccgt gccaccgcgg gcagccgtgc tcgtccgtgt agcgcgttgc attctcgaga 57420aaatcatcga gcccgggccg gtaccccgtg gcgagcacca cgacgtcgaa gggcagccca 57480cggccgtccg tgaacgtcac gcccgtttcc gtgaatgccc gcgggccggg caccaccttg 57540atcttgccct gctggatcag cgccaccgtg ccgatgtcga tcaacggcat gcggccttcc 57600ttcaacgccc gggtaccggg gccgaccgcg ggccgacgga tcccccagcg cgacagatcc 57660cccacggcgc gagacaggat cgcggtcgcg aggcgatccc cgacggccag cgggaggcgc 57720tcgaagaggg caagggcgtt gaactgcgca ggcagcttga acagctcgcg ggggatcacg 57780tggttgccgc tgcggaccga gagggtcgtc tccgcgcaat gctcccacag atccagcgcg 57840atctcgctgg cggagttgcc ggcgcccacc acgagcacgc gctggccccg gaattccgca 57900ccagatcggt aggcagagct atgaaggatg cgaccgcgga agcgctcctg gtcgggccag 57960gtggggacgt tgggatgacg gctgtagccg gtggccacga cgagcgcctg gctcctgagc 58020tcccccgcgt gcgttcgggt cacccaccgc gatccgtcgt ggtacgcgcg ctccacctcg 58080acacccaggc gcggctccag gcggaatcgc tcggcgtaac gctcgaggta atcgaccatc 58140tccacccggg agggatacgg cgcagaatac tcgggccagg gctgcccggg cagcgcggag 58200agctgcttga tcgtgttgag gtgcagccgg tcgtagtggc gccgccacgt ggcgccgacg 58260gcctccgact tctcgaggag aacgaacggg attccctgct cgcgcaggca tgcgcccacc 58320gctagcccag acggaccagc gccgacgata accacatggc actcttcaac gtgcacgcat 58380gaagtctaac caaaattcgc cccggatgcc aactccactt gtgcgggcgt cgcttccggc 58440aactcgtatg ctggtgagcg gcttcggatc gtgatggaaa gctctgagct cgcccgcagc 58500tccggagatc ccccgcgtct tcgcgaggag cctggcggac gcgcgcgccc cgcgagcgga 58560cacggcgacg ctacagcgcg cggacgtcac gcactcgcat gcccgacgcc cgtgccttct 58620gcctcgcccc gcgtctcgcc gaagtagatg gagcgcatca ggcggtggtt gtgcacgagc 58680gtcgcgtcgt atttgttgag ccgcatccct ttcatctcga agggcgtatc ggccacgtgc 58740gggatgaact tcacatcgtc gcggatctcc ttccaggaga gcgctatcgc cgccgatttg 58800acgaccggaa gcagcggacg gaagcgggga tcggtgatct tgacgaacag gaacgcgcgc 58860acgaacgtgg tgcgctccgt ctctggcacg aagaagatgc cggcgcgcgc cacgacagga 58920cgctccatcc cgttctgcgc cgtccaccag gacgtgtaca cggtgtagac ggggctgaag 58980cgggtcaccc actggttgtg aaatgtgtcg cctggctgga gcagcatcag ccgcgcgagc 59040gtcgaggggc gctgcggcgc cgagtacttg acctcggtgc ggtcctcgaa gacgtcgcac 59100gagaagtcga tgcgcgccgc gtcctcgggc gtccagccga ggcggccgtg aacgaacggc 59160gtgtgctcgt cctcggagga attgtcgaag atgacgtgca ggggcgccgg cgcgaggtgc 59220gagaaggtgc cggcatattc gaagccatcg ctgctgaagt cgagctcggg cagcgccgag 59280cgcggcgtat cccggtgggc tagccacagg tatccaagct gctcgacgag ctgaaaggag 59340cgtgtatcgc atcgggtgag cgacggttgc gaggggcagg ctccccgccc ctcggcgtcg 59400aaatgccacc cgtgataggg gcattccagg cgcccgtccg gccggacacg cccctgcgat 59460agcggcgcga gccggtgggg gcacgcatcg gcgagcgcgg cggggcggcc ctgctcatcg 59520cggaagagag cgtaagcatt gcccgcaagg acaacgcgaa ccggcttccg gccgagtttc 59580gaggccggca agacggggtg aaaatggcgg atgaggtcgc gagcaggcgc ggcgtgcatt 59640gcgagaccat aacacatccg cgacgccggt tggaaggagc tcccgcgcgc gcgcgacgcc 59700gatccgcttc cgaaacctcc tgcgcgatgg cgtcgagcga ccgaagtacg aggatctcct 59760atcggtaggc gacgatgcca ccgaacggcc acttcgcgtg gtcctcgggc gccggataga 59820cctcccattc ggagaacccg gccgcgcgga gcgacagctc ccactcttgc agcgtcaggt 59880aaccgacatg ctggcggcga ggcggatcga gcttggcctt gctgtaggtg tgcagcatcg 59940actgaaaaaa ttcattgggg aagaacaccc cgggccgatc gcggaacgac atggtgaacg 60000cgagctgacc gcccggcttc agcatcgtgt ggaacgcctg gagggtggcg tgaagatcgc 60060gcacgtcgta gagcacgtgc tcgaggacga tcagatcgac cgacgcggcc cgggcgaacg 60120tgctgccagc ggagggcagc gtgtccaggt ccaggcgctg gaaatgaatg cgctgaaaca 60180cgtcggccgg cgcgtgggtc cgcagccact

gcttccccgt ctccatcaac agggcgctga 60240tgtcggtgta atcgtagcgg gcgaggttct tgctcagcgg gaggaaccgc ggatcggaca 60300acgcctgccg cagcaccacg ccgagccccg cgcccccctc gaatacagag atccccggcc 60360cctctgcgag cttggccatc agcgcccgcg ccagcatcac gttgcatggc ttcttggcgg 60420gaaggctgat catcgagtat tcccagaatt tcagcgaggc ctgcatcccg tactggagat 60480ccatggtggc cagcgcgtcc ttgcccgcca gcaccggccc ggccaggccc cgatagcgct 60540ggaggaactc gaccatctcg cccaggatcg cgcggtctgc gagcgcgatg gactccttct 60600cggcgacgcg ctttcgcacc gcctcgctgg gcaccagccg cccgctgggg tcctgggtga 60660ggtctccctt gtcgctgaag tagtcgagca gcttcctgcg aaactgatag gcggtgaccg 60720acggagccga ctccggacga tcgtcgagcc cccggacagc gccgctcggg tcgacgaggt 60780gctcgagcag gatctcgctg gcaacaagct cggtctgacg acggaatgct tctatgtaag 60840cggtgtaagc gtcgttgtag agatcggtca cgtccaatcg ttgtcgcatg caggtcctcg 60900cgggtgtggc gcccatcctg cgcagcgcag ggacgaagca ggtcatggaa tggtccagct 60960cgcccgggaa cgcaaggacg gaccgtccgc tgccggcggg cgccgcgcct ccgagcgcct 61020cgcgcgccgc gcgtcacctg gagctcagcg cctgcccgtc gttcccgcgg ttcttgtgca 61080caatggcgta caggatgagc atgtaggcga agagccggaa caggtacagg taatggatgg 61140cgtcttcctc gacgcgattc agggcgacgg cgatgcggcc cagcatcatc agccagaacg 61200ccgccgagaa cttcgcgaac agccggtcgc ccgtcttctt ccagaagcgg aggaagaaga 61260gcgcgacggt cgcgtacccg aacgtcatcg aaccgatcag gaagtcgttc aaaggtccta 61320cctcgcctct acgcgcgttt actcgcgcag gtcccagatg aggccataaa ggagcagggc 61380cagcccgatg agcgcggtga ggtggcgcag cgatgataga tcgacgctcc ggatcacgac 61440gaggtccacg aagagcagga tgttgttcgc tgcgagcgcg gcgaagcaga gcccgctcca 61500caagaggaga cggaccttgc gctgcgcgta tccgcgcagg agcagcacgg cgcacgcgat 61560gctggtcagg gcgcagagga tgtagaccgc cgctgccatg gctagccgcc ctttcccttc 61620ttcgtgatca ggaatgcgtc cgagaagctc tggatgtcgc tcggcggggg cgtggcgtag 61680atgtgattga tcacgctcag ccggcgctcc ttgtacgcct gcgccaggtc gtcgatcgtc 61740cggcgggtct catcgtctgc cggggcgtac cggtagaaga tgtcctcccc gtcctcccgg 61800gccacgatca ggcccctgct ggccaggcct ccgaaccggt cctggatcga catcatgctg 61860gaccctatct cgcgcgccat cgcggccgcg ctccactcgc gctccgccgt gcgacgcatg 61920agcagaagca cttcgagttg ctcgatcgag gagatgtgcg cgccgaggaa gcgctggacc 61980cggtcgggga gcccgctaga cacgagctcc tcgccggccg agggtccctc cggtcaccgg 62040tgcaaccata gccgcagcat agcgagcagg tgctcgggat ccaccggctt cgagatgtaa 62100tcgttcgcgc ccgcctcgaa gcacttctcc cggtcgccct tcatcgcctt ggccgtgacc 62160gcgatgatgg gcagcgcatg gtgctcgggc ttcgcgcgga tggcacggat cgtgtcgtag 62220ccgtccatct ctggcatcat gatgtccatg agcacgatct cgatgtccgg cgtccgctgc 62280agcatctcga tcgccgctct gcccgtctcc acgtagaccg tcttcatctg ctgggcgtcg 62340aggatggtcg tcatcgcgaa gatgttccgg acgtcgtcgt cgacgaccag caccttcttg 62400cccgcgagca ccttgttcga ctggtgcagc tcctggaggg tctgccgctg tcgctcggag 62460agcgccgcca cagggcggtg caggaacagg gagacgtcgt cgaagagccg ctccttggag 62520cggacgtgct tgagcaccat cagctggctg aagcggctca gctgcgcctc gtccgcggcc 62580gagatctcct ccggcgcgta gaccaggacg ggcagctccg tcggcccgct gccctgcgcg 62640agctgcccga tcagatcgaa gcagcgcatg tcgggcaggt cgaggtgcag gatgaggaca 62700tcggccccct cggtgaggag cgcgtcgagc gcctcctccc cggaggccac gctccggatc 62760gtgacgtcgt cgccgccgag gagctcgacg agctcctggc gctcggcctc gtccggctcg 62820gcgagcacga ccgtccgccg gcgcgacacc atgaactgcg agaggcgcct gaaggtctcg 62880tcgagcgcgt cccgggtctt gagcggcttg cagagcaccc ccgtcgcgcc catccggagc 62940gcgcgctcgc gctcctcgtc cgtcgtgatc acctggacgg ggatgtgccg cgtcgcgagg 63000tcgcgcttca cccggtcgag cacgcgccag ccgtccatgt ccggcaggtt gatgtcgagc 63060gtgatcgcgt tcacccgccg ctcgcggacg atggagagcg ccgccccgcc gcggtaggcg 63120aggatcgcct tgaacccgtg gtcgtgcgcg acatccatga cgaagtgcgc gaagctcgcg 63180tcgttctcga cgatgagcac cacggagtcg ctgggctgga ggctcgcgct gtcgtcgacg 63240ctctggttga gcaggtgcgg cggcggctcg gccgccgacc gcggcgcgac gtcgcccgag 63300acgagggccg gcggcgccga gggcacctcc gcggcctgct ccttcctgcg cgggcgcgcc 63360ggcgtgtacg tgagcggcag gtaaagcgtg aaggtgctcc cgctccccgg cctgctcgag 63420agcttgatct cgccgccgag catccacgcg atctcgcggc tgatcgcgag cccgaggccg 63480gtgccgccgt acttccggct cgtcgagccg tccgcctgct ggaaggcctc gaagatgatc 63540tgctgcttgt cgtgcgggat gccgatgccc gtgtcccgca ccgacatggc gatcgccgcg 63600ccggcgcgcg agaggccctc gttctcgatg gtccaccccg aggtgaccag atcgacgtcg 63660agcgcgacgc tgccgcgctc cgtgaacttg aaggagttcg agagcaggtt cttgagcacc 63720tgctgtacgc gcttcgcgtc cgtgtagatg acctgcggca ggttctgcgc gaagttgagc 63780tcgaactcga gcctcttcga ctcggcgacg tgctggaacg tgcgctcgac gtagtcttgc 63840aggtcgctga acgacagctc gcccacgtcg acgatcacgg tccccgactc gatcttggac 63900aggtccagga tgtcgttgat cagcgcgagc aggtcgttgc ccgacgagtg gatcgtcttg 63960gcgaactcga cctgccgccc cgtgaggttg cggtcggtgt tcttcgagag ctgatcggac 64020aggatgagga ggctgttcag cggcgtccgg agctcgtgcg acatgttcgc gaggaactcc 64080gacttgtact tggaggtgat ggcgagctgc cgcgccttct cctcgagcgc ctgccgcgcc 64140tgctcgacct cgcggttctt ccgctcgacc tcgacgttct gctgggcgag caggcgagcc 64200ttctccccga gctcggcgtt cgtctgctgc agctcctcct gctggctctg gagctcgcgc 64260gcgagggact gcgactgctt gagcaggtcc tctgtgcgca tgttcgcctc gatcgtgttg 64320agcacgatcc cgatcgactc cgtgagctgg tcgaggaacg cctggtgggt cgggctgaat 64380cgctcgaacg acgcgagctc gatgaccgcc ttgacctgcc cctcgaagag cacggggatg 64440acgatgatgt tgaccggcgg cgcctcgccg agcccgctcg tgatgcggat gtagtcgggg 64500ggcgcgttga cgaggaggat cttctccttc tcgagcgcgc attgcccgac gagcccttcg 64560ccgagcttga aatggttgtc gacgtgcttc cgcaccttgt acgcgtagct cgcgaggagc 64620ttgaggatcg gctcctcctt cgccacgtcc atcgtgaaga acacgccctg ctgcgcgccg 64680acgaccgggg ccagctcgga caggatgagc cgaccgacag tgagcagatc cttctgcccc 64740tggagcatgc gcgagaactt ggcgaggttg gtcttgagcc agtcctgctc gctgttcttc 64800agcgtcgtgt ccttgaggtt ccggatcatc tcattgatgg tgtccttgag cgccgcgacc 64860tccccctgcg cctcgacctt gatggaccgg gtgaggtcgc ccttggtcac ggcggtggcg 64920acctcggcga tcgcgcgcac ctgcgtggtg aggttcgcgg cgagccggtt cacgttgtcg 64980gtcaggtcct tccacgtgcc ggccgcgccg gggacgctcg cctgaccgcc gagcttgccc 65040tcgacgccga cctcgcgcgc caccgttgtc acctggtcgg cgaaggtcgc gagcgtctcg 65100atcacgccgt tgatcgtgtc cgccagcgcc gcgatctcgc ccttcgcgtc gaaggccagc 65160ttgcgcttca ggtcgccgtt cgcgaccgcg gtcacgacct tggcgatgcc gcgcacctgg 65220ttcgtcaggt tgccggccat gaagttcacg ttgtcggtca ggtccttcca cgtgccggcg 65280acgccgggga cgctggcctg cccgccgagc ttgccctcgg tgcccacctc gcgcgccacg 65340cgcgtcacct ccgacgcgaa cgcgttgagc tggtccacca tcgtgtagtt gatggtgttc 65400ttcagctcca ggatctcgcc gcggacatcg acggtgatct tcttcgacag gtcgccgttg 65460gccacggccg ttgtgacggc ggcgatgttg cgcacctgcg cggtcaggtt cgacgccatc 65520gagttgacgg agtcggtcag gtccttccac gtgccggcga cgccggggac gctggcctgg 65580ccgccgagct tgccctcggt gcccacctcg cgcgccacgc gcgtcacctc cgacgcgaac 65640gagcggagct gatccaccat cgtgttgaag gtgtccttca gctccaggat ctcgccgcgg 65700acatcgacgg tgatcttctt cgacaggtcg ccgttggcca cggccgttgt gacggcggcg 65760atgttgcgca cctgcgcggt caggttcgac gccatcgagt tgacggagtc ggtcaggtcc 65820ttccacgtgc cggcgacgcc cttcacctcg gcctgcccgc cgagcttgcc ctcggtgcct 65880acctcgcgcg cgacgcgcgt cacctcggcc gcgaaggagc tgagctgatc caccatcgtg 65940ttgaaggtgt tcttcagctc caggatctcg cccttgacgt cgacggtgat cttcttcgac 66000aggtcgccgc gggccacggc cgtggtcacg tcggcgatgt tgcgcacctg cgcggtcagg 66060ttcgacgcca tcgaattgac ggagtcggtc aggtccttcc acgtgccggc gacgccgggg 66120acgctggcct ggccgccgag ctttccctcg gtgcccacct cgcgcgccac gcgcgtcacc 66180tccgacgcga acgagcggag ctgatccacc atcgtgttga aggtgtcctt cagctccagg 66240atctcgccgc ggacatcgac ggtgatcttc ttcgacaggt cgccgttggc gacggccgtg 66300gtgacggcgg cgatgttgcg cacctgcgcg gtcaggttcg acgccatcga gttgacggag 66360tcggtcaggt ccttccacgt gccggcgacg ccggggacgc tggcctggcc gccgagcttg 66420ccctcggtgc ccacctcgcg cgccacgcgc gtcacctccg acgcgaacga gcggagctga 66480tccaccatcg tgttgaaggt gtccttcagc tccaggatct tcttcgacag gtcgccgttg 66540gccacggccg ttgtgacggc ggcgatgttg cgcacctgcg cggtcaggtt cgacgccatc 66600gagttgacgg agtcggtcag gtccttccac gtgccggcga cgcccttcac ctcggcctgc 66660ccgccgagct tgccctcggt gcctacctcg cgcgcgacgc gcgtcacctc ggccgcgaag 66720gagctgagct gatccaccat cgtgttgaag gtgttcttca gctccaggat ctcgcccttg 66780acgtcgacgg tgatcttctt cgacaggtcg ccgcgggcca cggccgtggt cacgtcggcg 66840atgttgcgca cctgcgcggt caggttcgac gccatcgaat tgacggagtc ggtcaggtcc 66900ttccacgtgc cggcgacgcc ggggacgctg gcctggccgc cgagctttcc ctcggtgccc 66960acctcgcgcg ccacgcgcgt cacctccgac gcgaacgagc ggagctgatc caccatcgtg 67020ttgaaggtgt ccttcagctc caggatctcg ccgcggacat cgacggtgat cttcttcgac 67080aggtcgccgt tggcgacggc cgtggtgacg tcggcgatgt tgcggacctg cgcggtcagg 67140ttcgacgcca tcgagttgac ggagtcggtc aggtccttcc acgtgccggc gacgcctgtc 67200acctcggcct gcccgccgag cttgccctcg gtgcctacct cgcgcgccac gcgcgtcacc 67260tgggccgcga aggagcggag ctgatccacc atcgtgttga aggtgttctt cagctccagg 67320atc 6732323228DNASorangium cellulosum 2atgcccgaca cgtcgtcgtc gagccccgta atggcgatgg ggctatcgga ctcgaaagcc 60cggtccgtgg aggatgcacg gcctgcctcg gggcttcctc gtccacccgc gggcatcgct 120gtggtgggaa tgggatgtcg cttccccggc ggcatcgatt cgcccggatc cttgtgggcg 180gccctatctc aagggcgcga ccttatcagc gaggtcccgc cggaccggtg ggatgtcaat 240gcccactacg acgccgacgc aagcgtcccc gggaagattg cgacccgcca tggcggcttc 300ctcgccgggg tcgcggcgtt cgacgcgcct ttcttcgacc tctcgccgcg cgaagcgaag 360catatggatc cgcagcagcg cctcggcctc gagacggcgt gggaggcgct ggaggacgca 420ggcctggacg cgaggagctt gcggggcagc cgggcagggg tgttcgtcgg ctcgatgtgg 480gcggagtacg acgtgctcgc gtcgcgacat cccgaatcca tctcgccgca cggggccacg 540gggagcgacc cggggatgat cgctgcgcgc atcgcctaca ccttcggcct tcgtgggccg 600gccttgtcgg tgaatacggc gtcgtcgtcc tccctcgtgg cggtgcatct cgcattgcag 660agcttgcaga gcggagagtg cgagctcgcg ctggccggcg gcgcgaacct catcctgacc 720ccatacaaca cgatcaagat gacgaagctc gggacgatgt cgcccgacgg ccggtgcaag 780gcgttcgacc accgcgccaa cggctacgtg cgcgccgagg gcgtcgggtt cgtggtcctg 840aagccgctgt cgcgagcgac cgcggacggg gatcggatct atgcggtcgt gcgtggctcg 900gccgtgaaca acgacgggct caccgacggg ctgaccgcgc cgagcgggga ggcgcaggag 960gccgtgctgc gagaggcgta tgcgcgcgcc ggggtgtctc ccgccgaggt ggactacgtc 1020gaggcgcatg ggacgggaac gccgctcggc gaccgcgtgg aggcgacggc gctgggacgg 1080gtgctcggcg caggacgcgc ggcggatcgc gcgctgcggg tcggttcggt caagacaaac 1140ctcggtcacg cggaggcagc cgccggggtc atcggtctga tgaagacagc gctgtcgctg 1200cgtcacgggt cgcttccggc gagcctgcac gtcgagcgcc cgaaccccga gatacccctc 1260gaatcgctgg gcctccggct ccagacggcg cacggcgtgt ggccggaggt cgatcggccc 1320cggcgagcag gcgtgagctc attcggcttc ggcggcacga actgccatgt ggtgatcgag 1380gagtggcgcg ggggcctcca gcagagcgcc gccgaggcgg gcagcgaccc cggcgccgcc 1440gtaccgccgc ctggccttcc ccttgtgctg tcggcgaggg accacggggc gctgcgggcg 1500caggcgggcc ggtgggcggc gtggctcacg gagcaccgcg aggcgcgctg ggcggacgtc 1560gtccacacgg cggcagtgcg gcggacgcac ctgggcgctc gggccgcggt gatggcggcg 1620ggcgtggccg aggccgtcga tgcgctgaag gccctggccg acgggcgcgc ccacggggcc 1680gtgacggtcg gcgaggcgcg cgagcggggc aaggtggtct tcgtgtttcc gggccagggc 1740agccagtggc cggcgatggg gcgagcgctc ctgtccgcgt cgaaggtgtt cgccgaggcc 1800gtcgaggcgt gcgacgcggc gctgaggccg ctgacgggct ggtcggtgct ctcgttgctg 1860cgcggcgacg ccggggaggc agcgccgtcg ctcgaccgcg tcgacgcggt gcagccggcc 1920ctgttcgcga tggctgtcgg cctggccgct gtctttcgcg cgtggggcct cgatccttcg 1980gccgtggtgg gccacagcca gggcgaggtc ccggcggcgt acgtcgcggg ggcgctctcg 2040ctcgacgacg cggcgcgggt cgtggcggtc cgaagcgcgc tcgtgcggcg gctcgcgggc 2100gcaggggcga tggcggcggt ggagctgccg gccggcgagg tggagcgccg cctggcgccg 2160ttcggggggg ctctggccat tgcggtggtc aacacgtcga gctcgacggc cgtttctgga 2220gacgccgagg cggtggacag gctggtcgcg cagctcgagg ccgaaggcat cttctgccga 2280aaggtgaacg tcgattacgc atcccacagc gcgcacgtgg acgtcgtgct accagagctc 2340ctggagcgcc tggcgccggt ccggccaggg gccacgagga tccccttcta ttcgacagtg 2400accggcggtg tgctggaggg gacggcgctc gacggggcgt actggtgccg caacctgcgc 2460cagccggtgc ggctggaccg cgcgctcgcc cggctgctgg acgacgggca tggcgtcttc 2520gtggaggtca gtgcgcaccc ggtgctggcg tcgccgctga ccgcggcgtg cgccgagcgc 2580gagggcgtgg ttgtcggcag cttgcagcgc gacgacggcg ggctcgcgcg gctgctcggc 2640tcgctgggcg cgctgcatgt gcagggccag ccggtcgact ggcgcgcggt gctggcgccg 2700ttcggcggca gcctggtgga cctgccgacc tatgcattcc agcgccagcg ttactggttc 2760gatacggatg agagcgtcgc cctcgcagcg gcgtccagcg tcgcggaaga gtcgtggtca 2820gaaaagctgg ccgggctgtc ttccgcgcga cgggaagaac ggctgctcga atgggtgcgc 2880gcagagattg cagcggtgct cgggctggag gcgccggcgg tgccgccaga cgtcttgctg 2940cgggatctcg gattgaaatc gccgatcgcc gtggagctgg ggagccggct gggacgcagg 3000acacgccgga agctgcccgt gaccttcgtt tacaaccacc cgacgccacg agcgatcgct 3060cgcgccctcc tggagggaat gttttcctcg atcaaggact ctgcttcgag cgccgctgac 3120gaccgccgcc cgccgggggt gctcgaagac gttgcccccc cacaggcgct cgagacgtcc 3180gagatgtccg acgatgagct gttccagtcc atcgatgcgc tcgtctag 3228311040DNASorangium cellulosum 3gtggatcgaa gcgataaact gcgtgcgtat ctggagaaga ccacggcctc gctggtcgag 60gcgaagggcc ggatccggga gctggaagcg cgttcgcgcg agccgatcgc gatcgtggcg 120atggcgtgcc ggtttccggg cggcgtcgac agccccgaga agctctgggc cctgctggac 180gaggagaggg acgccatcac cgaggtgccg ccctcgcgat gggacctcga gcgcttctat 240gaccccgatc cggacgccgc gggcaagacc tacagccgct ggggcggctt cgttggcgat 300ctggaccgtt tcgacgcggc gtttttcggg atcagccccc gcgaggcccg gagcatcgac 360ccgcaagagc gctggctgct ggagaccacg tgggaggccc tcgagcgggc cggcgtgcgc 420gcagacacgc tggaagggac cctggggggc gtttacatcg gcctgtccgg ctcggagtac 480cagacggagg cattccacga tgcggagcgc atcgacgcct attcgctgac cggcgcttcg 540ccgagcacga ccgtggggcg cctcgcctac tggctcgggc tacgaggccc cgcggtcgcc 600gtggacaccg cgtgcagctc ctcgctcgtc gcggtgcacc tggcctgcca ggcgctgcgg 660aacggggagt gcgattttgc gctggcaggc ggcgtcaatg cgctcctggc ccccgagagc 720tatgttgcct tctgccgcct cagggcgctg tcccccaccg ggcggtgcca gaccttctcc 780gcggacgccg atggctacgt gcgcgcggaa gggtgcgggg tgctgctgct caagcgtctg 840tcgcacgcgc agcgggatgg agaccgtgtg ctcgcggtca tccggggcaa tgccatcaac 900caggacggcc gcagccaagg gttgacggcg ccgaacgggc tcgcgcagga ggacgtcatc 960cgcagggcgc tgtcgcaagc cgccgtggag ccgacgaccg tcgatgtggt cgaatgccac 1020gggaccggca cggcgctcgg cgatccgatc gaggtccagg cgctcggggc tgtttacggc 1080gatgggcgcc ccggagacag gccgctcgtg atcggctccg tcaagacgaa catcggtcat 1140accgaggcgg ccgcgggcat ggccggcctc atcaaggccg tcctttcgct gcagcacgcc 1200caggtccctc gatcgctgca cttcgcggcg ccgagccctt acatcccctg ggataccctc 1260cccgtccgcg tggccgcgca gcgcgtcgca tgggagcggc gcgagcaccc gcggcgcgcc 1320gggatctcct cgttcgggat cagcggcacc aacgcgcacg tgatcctcga ggaggcgccg 1380gaagcgccgg cgacggcgcc ggaggcggcg gcggtgacgt cgacgctgcc gttgcttgtg 1440tcggggcggg atgaggcggc gctcagggcg caggcggagc ggtgggcggc gtggctcgcg 1500gcgcacccgg aggcgcgctg ggcggacgtg gtgcacacgg ccgccgtgcg gcgcacgcac 1560ctggaggcgc gcgcggcggt ggccgcgggg aacgccgccg acgccgccgc ggcgctgggg 1620gcgctggccg ccgggcagcc gcacaaggcg gtgtccctgg gcgaggcgcg cgcgcgcggc 1680gatgtcgtgt tcgtggttcc gggccagggg agccaatggc cggcgatggg gcgggcgctg 1740ctggccgagt ccgaggtgtt tgccgccgct gtcgcggcct gcgacgcggc gctgcggccg 1800ttcacgggct ggtcggtgct ctcggtgttg cgcggggagc agggcgaggc ggtgccgccc 1860gccgaccgcg tggacgtggt gcagccggcg ctgttcgcga tggccgtggg gctctcggcg 1920gtctggcggg cgtggggcat cgagccctcg gcggtggtcg gccacagcca gggcgaggtc 1980gcggcggcgt acgtcgccgg ggcgctgacg ctcgaggacg cggcgcgggt ggtggcgctg 2040cgcagccagc tcgtgcggcg catcgccggc ggcggcgcga tggccgtgat cgagcgcccc 2100gtcggcgagg tggagcagcg gctttctcgg ttcggagggc agctctcggt ggcggcggtg 2160aacacgccgg gctcgacggt ggtgtccggg gacgccgcag cggtcgatcg tttgctggcc 2220gagctggaga ccgcgcgggt gttcgcgcgg cggatcaagg tcgattacgc gtcgcacagc 2280gcgcacgtgg acgcgatcct gccggagctc gaggcctgcc tggcctcggt cgagccccgt 2340acctgcgcca tcccgctgta ctcgacggtg acgggagaag tgctcgccgg cccggagctc 2400ggcgcgacat actggtgccg caacctgcgc gagccggtgc ggctcgaccg ggcgctctcg 2460cggctgctgg cggacgggca cggggtgttc gtggaggtca gcgcgcatcc ggtgctggcc 2520atgccgctgt cggccgcgag cgccgagcgc ggcggcgtgg tggtgggcag cctgcagcgc 2580gacgacggcg gtctggggcg gctgacgtcg atgcttggcg cgctgcacgt gcacggccac 2640gccgtgagct ggcagcgggt gctggcgccg tacggcgggg cgctcgtggg cctgccgacg 2700tacgcgttcc agcgccagcg ccactggctc gaggcgccgc ggtacgcggc ggaggatacg 2760gacggcgcgg cgcggcgcga cccgctgtac cgggtcacgt ggatcgaggc ggcgctggaa 2820gaagcgccgt gggcgcccga gcgccacgtc gtgctcggcg ggggcggcgc gctggcggcg 2880gggctggggg cgctcgcgct ggcggggctg ccggagctgc tcgaggcgct ggagaacagg 2940gcggcggcgc ccgagcggct ggtgctggac ctgacggagg gccgcccagg cgcggtggcg 3000gagtccgtgc acgccacgac gcgcgacgcg ctcgcgctgg tccaggcatg gcttgcggcg 3060ccgcggctct cgggcaccga gctggtcgtg gtgacgcggg aggcggtggc ggccggcccg 3120gacgagggcg tggcggcgct gggccccgcc gctgtctggg ggctgctgcg cacggcccgc 3180gtcgagcacc ccgagcgcgc ggtgcgcgcg gtggatctgg ggcgcgagcc gctggacgtc 3240gcggtcttgc ggcgggcgct gggggcggtg gccgagccgg agctcgcgct gcgcgcgggc 3300ggggcgcggg ctgcgcgcct gcgcgctgtc gacgccggcg cgggcgccag ggagccggcg 3360gctgcgctgg acccgcaggg cacggtgtgg atcacgggcg gcaccgggga gctggggcgg 3420cagatcgcgc ggcacctggt cgcggcgcac ggcgtgcggc acctcctgct gacgtcgcgg 3480cggggcgcgg ccgcgccgga cgccgaggcg ctcgtcgagc agctgcgggc cgacggcgcc 3540gagacggtcg aggtcgtggc gtgcgacgtg acggacggcg cggcgctttc ggcagcagtc 3600caggcggctg cggcaaggca cccgctgacg gccgtggtgc acaccgccgg ggagctggcg 3660gacggggtgc tcacggggct gacggcggag cagctcgcgc gggtgctggc gccgaaggtc 3720gacggggcgt gccacgtgta cgccgccgcg caggaccagc cgctcgcggc cttcgtgctg 3780ttctcctcga tcgtgggcac gctgggcaac gcgggccagg cgaactacgg ggccgccaat 3840gcgttcctgg acgcgttcgc ggcgcagctt cgcgcgcgcg gcgtgccggc gacgagcctc 3900gcgtggggct tctgggagca ggcagggctc ggcatgacgt cgcacctcgg cgcggccgac 3960ctggcgcgcc tcaggcggca gggccttgcg ccgctgtcgg tcgcgcaggg cctgcgcctg 4020ctcgaccggg cgctcgcgcg cgcggaggcg acgctggtgc cggcggcgct cgatcttccg 4080gcgctccagc gtgcggcgag cgacgccgga cgggtgcctc cactgctgcg cgggctggtg 4140cgcacgagtc ccggccgccc cacggcgacc gcgacccccg aggccgggcc ggcggcgtcg 4200gcgctgcgcg cacggctctc ggcgttgccc gaggccgagc ggccgggcgc gctgctggat 4260ctggtgcgca cggaggtggc ggtcgtgctg cagctggcag ggccggcgca ggtgcccgcg 4320gacaagccgc tgaaggagct ggggctcgat tcgctcacgg ccgtcgagct gaggaaccgc 4380ctcggcgcgc gcgccgagac ggtgctgccg acgaccctcg cgttcgacca tccgacgccg 4440cgcgcgatcg cggatctgct gcttcagcgt gcgttctcgg agctcgcggc ggcgaaggcg 4500acgcgcgcgc ggggagcgca cgacgagccg atcgcgatcg tgtcgatggc gtgccggctc 4560ccgggcagcg tcgatacccc cgcggcgctg

tggaagctcc tggcggaggg gcgggacgcg 4620atcgggccgt tccccgaggg gcgcggctgg gacgtggcgg ggctgtacga tccggacccg 4680gatgtgccgg gcaagtcgat caccacgcaa ggcggcttcc tctacgacgc cgaccgcttc 4740gatccgacgt tcttcggcat cagcccgcgc gaggccgagc gcatggaccc gcagcagcgt 4800ctgctgctcg agtgcgcctg ggaggcgctc gagcgcgcgg gcctggcgcc ccacgcgctc 4860gaggcgagcg ccaccggcgt cttcgtcggg ctcgctcacg gtgactacgg cgggcggctc 4920ttgcagcagc tcgagtcctt cgacggccac gtcctcaccg gcaacttcct cagcgtcggc 4980tcggggcgca tcgcgtacac gctggggctc cgcggccctg cgatgaccgt cgacacggcg 5040tgctcgtcgt cgctcgtggc ggtccacctc gcgtgcatgt cgctccgcgc gggcgagtgc 5100gacatggcgc tcgccggcgg cgccaccgtg atggccacgc cgatgatctt cgtcgagttc 5160agccgccagc gcggcacggc gctggacggt cgttgcaagg cgttcggcgc cggggccgat 5220ggcgccggct ggtcggaggg gtgcgggatc ctggcgctga agcggctgtc ggacgcgcag 5280cgcgacggcg accgcgtcct ggcggtgatc cgcggctccg ccgtcaacca ggacggccgc 5340agccaggggc tcaccgcccc caacggcccg gcccagcagg acgtcatccg ccaggccctg 5400gccgcggcgg ggctcacgcc cgccgacgtc gacgccgtcg aggcgcacgg caccggcacg 5460cgcctcggtg accccatcga ggcgcaggcg ctgctggcga cctacggcgc cgcgcacaca 5520gcggagcggc cgctctggct cggctcgctc aagtcgaacc tcgggcacac gcaggtcgcc 5580gcgggcgtgt cggggctgat gaagctcgtg ctggccttgc agcacgcaga gctgccgagg 5640acgctgcacg ccgacccgcc ctcgccgcac gtcgactggt cgcaggggca cgtcaagctc 5700ctgaacgagc ccgtgccgtg gccgcgcacc gacaggccgc ggcgcgcggc ggtctcgtcc 5760ttcggcatca gcggcaccaa cgcgcacgtc atcgtcgagg aggcgccggc cgaagcgccg 5820gcgacagcgg cggacgcaaa gtcggtggag gcgcttccga tcctgccgct gctggtctcg 5880gggtccgacg agccggcgct gcgcgcgcag gtgcggcggc tggtggagca cctgcggtcg 5940cacccggacg agcggctgct ggacgtggca gcgagccttg cgaccacgcg cgcgcatctc 6000gcgatgcggc tcgcgctgcc cgtctcggca ggggcgcccc gggatgcgtg ggtggatgag 6060ctggaggcat ttgccagggg aggagcggct ccgacgcagg catcgcagac ccccgccgag 6120agcagcgcgg gcaaggtcgc ggtgctcttc accggccagg gcagccagcg cgccgccatg 6180gggcgcgccc tgtacgccac ccaccccgtc ttccgcgccg cgctcgacgc cgcatgcgcc 6240gagctcgacc gccacctcga caggcccctc cacagcgtcc tcttcgcaga cgccggcacc 6300gaggccgccg cgctgctcga ccagacagga tgggcacagc ccgccctgtt cgctctcgag 6360gtcgcgctct accgacagtg ggaggcctgg ggtctgcgcc ccgagctgct gctcggccac 6420agcatcggcg agctcgccgc cgcccacgtc gccggcgtgc tcgacctccc cgacgcctcc 6480gccctggtcg ccgcccgcgg acggctcatg caggccctcc cccacggcgg cgccatggcc 6540tccatcgagg ccaccgagca cgagctccta cccctgctcg accagcacac cggacgcctc 6600tcgctcgccg ccctcaacgc tccacgccag tcggtcgtca gcggcgacct gcacgccgtc 6660gaccaggtct gcgcccactt catcgccctc ggccgacgcg ccaagcggct cgacgtcagc 6720cacgccttcc actcggcgca catgcagccc atgctcgacg ccttcgccag cgtcgcccgc 6780ggcctgacct tccacccgcc acggctgccc atcgtcagca gcgtcaccgg cgcacgcgcc 6840accaccgacc agctcacctc gcccgactac tgggtgcagc aggtgcgcga gcccgtgcgc 6900ttcctcgacg ccatgcgctc cctgcacgcc gccggcgccg ccaccttcgt cgagtgcggg 6960ccgcacggcg tgctcaccgc cgcaggcgcc gagtgcctcg ctcccgaggg cgctcgcgac 7020gccggcttcg tcaccagcct ccgcaaggac cgcgacgagg ccctcgccct ggtccacgcc 7080gcctgcgccg tccatgtccg cgggcacgcc ctcgactggc tccgcttctt cgacgccacc 7140ggcgctcgcc gcgtcgagct gcccacctac gccttccagc gacagcgcta ctggctcgag 7200gcgccaaggc ctcgccccag cctcgagggc gtcggcctca ccgccgcaaa ccacccatgg 7260ctcggcgccg ccgtgcgcct cgcagaccgc gatggctacg tcctcagcgg ccgcctctcc 7320accatcgacc acccgtgggt cctcgaccac gtggtgctgg gcacggcgct gctcccgggc 7380acgggcttcg tcgagctggc gtgggcggcg gcagaggcgg tcgggctgcc cggggtatcg 7440gagctggcga tcgaggcgcc gctggcgctc ccggcgcgcg gggcggtggc gctgcagatc 7500gcgatcgagg cgccggaccc ggcggggcgc cgcggcgtcg cgatctacag ccgccccgac 7560ggcgcagccg acgcgccctg gacagcgcac gcgcgcggcg tgctgggcgc cgcggcgccc 7620gacagggacg cggcgtgggc acagggcgcg tggccgccgc cgggggccgt gcctgtcgat 7680gtgacgcagc ggatcgagat cgtggacgcg tgggtcggcc cggcgttccg gggcgtcacc 7740gcgctgtggc gcgtcgggcg gacgatctac gccgacgttg cgctgccgga cggtgtggcg 7800agcacggcgc aggacttcgg gctgcatccg gccttgctcg atgtggcgct acgcgcgttc 7860ctgagagcgg agctcggcgc cgatccctcg ccacgggagg gcacggtggt gccgttcgcg 7920tggtcggacg tggtgctcga ggcgcgtggg acggcggcgc tgcgggtgcg cgtggaggtg 7980gcggccgatg gggacggcga cgcgatcacg gcgtcgatcc agctggccga cgggcagggc 8040cgccccgtcg cgcgggtggg cgcgctccag atgcggtgga cgacggccga gcgggtgcgc 8100gcggccgcgg gcgcggcgga gcgcgatctg taccgcgtcg cgtggacgga cgtggcgctg 8160gacgacgcgg cgtttgcgcc ggaggagcac gtcgtggtcg gcggcgacgg cgcgctggcg 8220gcggcgctcg gtgcacgcgt ggtggcgggg ctgcccgagc tgctcgcgtc gctgccggac 8280ggcgcggcgg cgccacgccg gctggtggtg gacctcacgg cggacgccgc gggcgcggtc 8340gtcgacgccg tgcacgccgc agcgcgcgac gcgctgtccc tggtgcaggg atggctggcg 8400gcgccgcagc tggcggcgac ggagctcgtg gtcgtgacgc gcggcgcggt ggcggtcgcg 8460ccggacgagg gcgtggcggc gctgggcccc gcggcggtct gggggctgct ccgcgcgacg 8520cgcgtcgagc atgcggatcg cacggtccgc gtgctcgatc tggggtccgc ggcgccggac 8580atgacgctct tgcgccgggc gctcacggcg gccgaggagc cagagctcgc gctgcgcgcg 8640ggcggggcgc gggcgccgcg cctcgacgcg gccagcgaga ccgaaggaga gctggcgccg 8700cccggcgggg cgcgctctct tcgcctgtcc atccggacga agggctcgtt cgacgcgctc 8760cacctcgcgg acgctcccga tgcgctgcgc ccgctcgggc cggggcaggt ccggctcgct 8820gtccgcgcca cggggctcaa cttccgcgat gtcttgaacg tcctggggac gtaccgcggc 8880gaagcggggc ctctcggtct ggagggggct ggggtggtgc tggacgtggg cgagggagtc 8940accgcccttc gacccggcga ccgggtgatg ggcatgctgc acgcgggcat ggcgacccat 9000gcggtcgtcg acgcccggct gctgacgcac atcccgcggg ggctttcctt cgtggaagcg 9060gcgacgattc cagcggcctt cctcaccgct ctgtacgggc tgcgcgacct cggcgcgctg 9120aaggcggggc agcgcgtgct ggtgcacgcc gccgccggcg gggtgggcat ggcggcggtc 9180cagcttgcgc gcctctgggg agccgaggtg ttcgcgacgg cgagcgaggg caagtggccg 9240gcgctgcgtc ggatggggat cgaccaggcc catatcgcct cgtcgcggac cctccacttc 9300aggaaagcct tcctcgatgc aacgcaggga cagggcgtcg acgtggtgct cgacgcgctc 9360gcgggcgagt tcgtcgacgc ttcgctcgac ctgctcccgc gcgggggcgc gttcgtggag 9420atgggcaaga gcgatgtgcg ggatcccgag cgcgtcgcca aggaccaccc ccgcgttcgc 9480tacacggcct tcgatctgct cgacgcgggg ccagaccaca tccaggcgat gctgcgggag 9540ctcgtcccgc tgttcgagga gggcgtcctc gctccccttc cctccgtggc ctacgacctg 9600cgtcgcgccc cgcacgcctt ccgctccatg gccaacgcac gccacatagg caagctcgtg 9660ctggtgccgc ccgcgacgct cgaccctgac ggcacggcgt tgatcacggg cggcacggga 9720gagctcgggc ggcagatcgc gcggcacctg gtggcggcgc acggcgtgcg ccacctggtg 9780ctgacgtcac ggcgcggcat ggacgcgccc gacgccgcag cgctggtgga atcgctgcgc 9840gcggcgggcg ccgcgacggt ggaggtcgcg gcgtgcgatg tgacggaccg tgacgcgctg 9900gcggccatcg tgcaggcgat ccccgcggcg cgcccgctga ccgccgtcgt gcacacggcc 9960gccgtgctgg acgacggcac cgtggcgggg ctctcggccg agcagctcgc gcgcgtgctg 10020cggccgaagg tcgacggcgc ctggcagctc tacgaggcga cgagggacgc gccgctcgcg 10080gcgttcatgc tcttctcgtc ggtcgccggc acgctgggca gctcggggca ggcgaactac 10140gccgccgcga acgcgttcct cgacgggctg gcggcagagc tccgcgcgcg cggcgtgccg 10200gcgatgagcc tcgcgtgggg cttctgggag cagggcggga tcgggatgac ggcgcacctc 10260ggcgccgccg atctggcgcg gctgaagcgg cagggcatcg tgccgatgac ggtcgcgcac 10320ggcctgcggc tgctcgaccg cgccctcgag cgcccggacg cggcgctggt gcccgcctcc 10380ctggacatgg cggtgatcca gcggacggcg agcgaccacc gtcaggtgcc gcccatgctg 10440cgcgggctgg tccgcgtcgc gccgcggcag gcggcagggg cagccagcgg caggagccat 10500gaggcctcga ccctgcggca gcagctcgcc gcgctgcccg aaccggagcg gcagcgagcg 10560ttgctcgatc tggtccggac cgaggcagcc gccgtccttg tgctgcgcgg gccggacgct 10620gtccccgccg acaagccgct cagggagctc gggctcgact cgctcacggc agtggagctc 10680aggaatcggc tcaggacccg tgcgcagacc gatctcccat cgaccctcgc cttcgactac 10740ccgacgccga aagcggtcgc cgtgtatctg gcccaggagc tcgaccttca cgacgtcatg 10800acggagatgc gcggaccgag cttgcgctct gacgacgagc tcaagtcggc catcgcgagc 10860atccggatct cgacgctacg ccaggcgggg ctgctcgaca gcctgcttcg gctcgccgcc 10920agcgaagccg tctccacatc cagcgacacg acacctgaaa ccgacgagct gacgctgcag 10980catgttggag acgatgagct ggcacggctt gtcttcgacc tcgccggagg agcgcaatga 11040410965DNASorangium cellulosum 4atgaaagaag agatctccgc ccgtcaagct ctcgagaaga gcttcattga acttcgccgt 60atcaagcggg agctcgatca gctcaaggcg aagtcgagcg agccgatcgc gatcgtgtcg 120atggcgtgcc ggctcccggg cggcgtcgat acccccgcgg cgctgtggca gctgctctcg 180gaggggcggg acgcgatcgg gccgttcccc gaggggcgcg agtgggacgt ggcggggctg 240tacgacccgg acccggacgc gccgggcaag tcgatcactg cgcaaggcgg cttcctctac 300gacgccgacc gcttcgatcc ggcgttcttc gccatcagcc cgcgcgaggc cgagcggatg 360gacccgcagc agcggctgct gctcgagtgc gcctgggagg cgctcgagcg cgcgggcctg 420gcgccccacg cgctcgaggc gagcgccacg ggcgtcttcg tcgggctgtc ggtcacggac 480tacggcgggc ggctgctgca cgatcccgag gccctcgacg gctacatcgc caccggcacc 540ctgcccagcg tcggctcggg gcgcatcgcc tacacgctgg ggctccgcgg ccccgcgatg 600accgtcgaca cggcgtgctc gtcgtcgctc gtgtcgctcc acctcgcgtg catgtcgctc 660cgcgcgggcg agtgcgacat ggcgctcgcc ggcggcgcca ccgtgatggc cacgccgatg 720gccttcatcg agttcagccg ccagcgcggc acggcgctgg acggtcgttg caaggcgttc 780ggcgccgggg ccgatggcgc cggctggtcg gaggggtgcg ggatcctggc gctgaagcgg 840ctgtcggacg cgcagcgcga cggcgaccgc gtcctggcgg tgatccgcgg ctccgccgtc 900aaccaggacg gccgcagcca ggggctcacc gcccccaacg gcccggccca gcaggacgtc 960atccgccagg ccctggccgc ggcggggctc acgcccgccg acgtcgacgc cgtcgaggcg 1020cacggcaccg gcacgcgcct cggcgacccc atcgaggcgc aggcgctgct ggcgacctac 1080ggcgccgcgc acacagcgga gcggccgctc tggctcggct cgctcaagtc gaacctcggg 1140cacacgcagg ccgccgcggg cgtgtcgggg ctgatgaagc tcgtgctggc cttgcagcac 1200gcggagctgc cgaggacgct gcacgccgac ccgccctcgc cgcacgtcga ctggtcgcgg 1260gggcacgtca agctcctgaa cgagcccgtg ccgtggccgc gcaccgacag gccgcggcgc 1320gcggcggtct cgtccttcgg cttcagcggc accaacgcgc acatcatcat cgaggaggcg 1380ccggcggcct ccgccgaggc gacgagccgc ggggagaaga cgtccgcggc cgcgccgccg 1440tcgatgatgc cgctgctggt ctcgggggtg gacgaggcgg cgctacgagc gcaggcgggg 1500cggtgggcgg cgtggatcga ggcgcacccg gaggcaggct gggcggacgt tgtgtacacc 1560gcggcagcgc ggcggacgca cctgggggcc cgtgcggcgc tgacggcggc ggacgcggcc 1620ggcgctgtcg cggcgctgac ggcgctctcg caagggcagc cgcacgccgc gctcgccgtg 1680ggcgaggcgc gcgctcgggg gaaggtcgcc ttcgtgtttc cgggccaggg cagccagtgg 1740ccggcgatgg ggcgggcgct gctctcgcag tcggaggtgt tcgccgcggc ggtcacggcg 1800tgcgacgcgg cgctgcggcc gttcaccggc tggtcggtgc tctcggtgct gcgcggcgac 1860tcgggcgcgg aggtgccgcc gctggagcgc gtcgacgtcg tgcagccggc gctgttcgcg 1920atggcggtgg ggctcgccgc tgtgtggcgc gcgtggggcc tcgagccgtc ggcggtggtg 1980ggccacagcc agggggaggt cccggcggcg tacgtcgcgg gggcgctgtc gctcgaggac 2040gcggcgcgga tcgtggcgct gcgcagccag ctcgtgcggc gcctgtccgg ggctggcgcg 2100atggccgtga tcgagcgccc ggtaggcgag gtcgagcagc ggctctcgcg gttcggcggc 2160gcgctgtcgg tggcggcggt caacacgccg cgctcgacgg tggtgtcggg agatatcgag 2220gcggtcgacc gcctgctggc ggagttcgag ggcgagcagg tcttcgcgcg gaaggtcaac 2280gtcgactacg cgtcgcacag ccgacacatc gacgggctgc tgccggagct ggagaacggc 2340ctgggcgcgg tgcggccgcg cgcgagcacg atcccgttct actcgacggt gaccgggacg 2400gtgctgacgg gcgcggagct ggacgccgcg tactggtgtc gcaacctgcg cgagccggtg 2460cggctcgacc gggcgctctc gtggctcctg gacgacgggc acggcctgtt cgtcgaggtc 2520agcgcgcacc cggtgctgac gctgccgctc acaggagcga gcgcggcgag cggcggtgtg 2580gttgtcggca gcctgcagcg cgacgacggc gggctcgggc ggctcctggg ggtgctggcc 2640gcgctgcacg tgcacggcca cgacgtcgac tggcgcgcgg tgctggctcc gtggggcgga 2700ggcgtggcgg acttgccgac ctacgcgttc cagcggcagc gctactggct cgaggcaccg 2760cgcggccggg cagggctgga gagcggaggg ctcctggccg tgaatcaccc gtggctcagc 2820gcggcggtgc ggctggccga ccgcgacggc tatgtgctga gcggacggct gtcgacggtc 2880gagcacgcgt gggtcctgga ccacgtggtg ctgggcacgg tgatcctccc gggcacggcg 2940ttcgtcgagc tggcgctcgc ggcggccgat gcggtcggac tgccctcggt gtcagagctc 3000acgatcgagg cgccgctggc gctgccggcg cgaggggcgg tggcgctgca ggtgacggtc 3060gaggcgccgg acgcgacggg gcggcggggc ttcgcggtct acagccggcc cgacggcgcg 3120cacgacgcgc cgtggacggc gcacgcgcgc ggcgtgctcg gcgcagcgcc cgcggcggcc 3180acgacggcgt gggcggcggg cgcgtggccg ccggcggggg ccgagccggt cgacgtcacg 3240cggtgggtcg aggcgctgga cgcgtgggtc ggcccggcgt tccggggcgt gacggcggcg 3300tggcgcgtgg ggcggtcgat ctacgccgac ctggcgttgc ccgagggggt ctcggagcgg 3360gcgcaggact tcggcctgca tccggccttg ctcgatgcag cgctccaggc cctcctgagg 3420gcggagctcg gcgcaggcgc gtcgccgcgg gagggcatcc cgatgccctt cgcgtggtcg 3480gacgtggcgc tcgaggcgcg gggggcagcg gcgctgcggg cgcgcgtgga ggtcgaggac 3540gccagcgatg gggaccagct cgcggcgtcg atcgagctgg ccgacgcgca ggggcagccg 3600gtcgcgcgcg cagggacgtt ccgggcgcgg tgggcgacgg cggagcacgt gcgcatggct 3660gcggcgggct cgagcgagcg tgacctgtac cgggtcacgt gggcggacgt ggtgctggaa 3720gaagcggcgt gggcgccgga ggagcacgtc gtgctcggcg gcgacggcgc gctcgcggcg 3780gcgctgggcg cgcgcacggc ggcgctgccg gagctcatcg cggcgctgcc ggagggcgcg 3840gccgcgccgc gccggctggt gatcgacgcg gccgcgggcg accccggcga cggcctggtc 3900gcggcggcgc acgcggcggc gcagcgggtc ctgtcgctgg tgcaggggtg gctctcggag 3960gcgcggctcg cggacagcga gctggtggtg gtgacgcgcg gcgctgtggc cgccgggccc 4020gacgacggcg tcgcggcgtt gagccacgcg ccgctgtggg gactcgtgcg cacggcgcgc 4080caggagaacc ccggccgggc ggtgcgcctc gtggacctgg ggcccgagcc gctggacgga 4140gcgctcctgc gccgggtggt ggcggcggcc gaggagccgg agctcgcgct gcgcgggggc 4200gcggcgcgcg cgccacgcct gcgcgaggtg cgcgcgggcg cggccgacgc ggcgcggccg 4260acgcggctgg atcccggcgg gacggtgctg atcacgggcg gcaccgggga gctcgggcgg 4320caggtcgcgc ggcacctcgt ggcgtcgcac ggcgtgcggc acctcgtgct cacgtcgcgg 4380cgcgggatgg gtgcgccgga cgccgcggcg ctggtggacg agctgcgcgc cgcgggcgcc 4440gcgacggtcg acgtcgcggc gtgcgacgtc gccgacggcg cggcgctggg ggcggtcatc 4500gcggcgatcc cggctgcaca ccccctcacg gcggtcgtgc acatggcggg cgtgctggac 4560gacgtcatcg tgacgaagct ctcggccgag cagctcacgc gcgtgctgcg gccgaagatc 4620gacggcggct ggcacctggc cgcggcgacg cgaggccatc ggctcgcggc cttcgtgctg 4680ttctcgtcgg cggccggcac gctgggcagc ccggggcagg cgaactacgc cgcggccaac 4740acgttccttg acgcgctcgc ggcgcagctc cgcgcgcgcg gcgtgcccgc gatgagcctc 4800gcgtggggct tctgggagca ggcagggctc ggcatgacgg cgcacctcgg cgcggccgac 4860ctggcacgcc tcaggcggca gggcatcgcg ccgatcgcgc tcgcgcaggg catgcagctg 4920ctggaccggg cgctcgcgcg cccggaggcg gcgctggtgc cggcggcgct cgaccttccg 4980gcgctccagc gtgcggcgag cgacgccggg caggtgccgg cgctgctgcg cgggctcgtg 5040cgcccggcgg tcgggcggcg cgcggcggcg cctgcggccg ccgcgaccgg agcggcggcg 5100ctgcgcgcgc ggctcgcgcc gctgcccgag gccgagcggc acgacgtggt gctcgacctg 5160gtgcgcgccg aggcggcggc cgtgctgcag ctggcggggc cggcgcaggt ccccgcggac 5220aagccgctga aggagctggg gctcacctcg ctcacggcgg tcgagctgag gaaccgcctc 5280ggcgcgcgcg ccgagacggc gctgccggcg accctcgcgt tcgaccatcc gacgccgcgc 5340gcgatcgcgg gtctgctgct tcagcgtgcg ttctcggagc tcgcggcggc ggtggcgacg 5400cgcgcacagg cgccacgcgc gcagggggcg cacgacgagc cgatcgcgat cgtgtcgatg 5460gcgtgccggc tcccgggcgg cgtcgatacg cccgcccgga tgtggcagct cctggcggag 5520gggcgggacg cgatcgggcc gttccccgag gggcgcggct gggacgtggc ggggctgtac 5580gaccccgacc cggacgcgcc gggcaagtcg gtcaccaacc tgggcggctt cctctacgac 5640gccgaccact tcgatccgac gttcttcggc atcagcccgc gcgaggccga gcgcatcgac 5700ccgcagcagc ggctgctgct cgagtgcgcc tgggaggcgc tcgagcgcgc gggcctggcg 5760ccccacacgc tcgaggcgag cgccaccggc gtctttgtcg ggctggtgta cagcgactac 5820ggcgggcggt tgctggagca cctcgagtcc ttcgacggct acatcgccac cggcagcttt 5880cccagcgtcg gctcggggcg catcgcctac acgctggggc tccgcggccc tgcgatgacc 5940gtcgacacgg cgtgctcgtc gtcgctcgtg tcgctccacc tcgcgtgcat gtcgctccgc 6000gcgggcgagt gcgacatggc gctcgccggc ggcgccaccg tgatggccac gccgatggcc 6060ttcatcgagt tcagccgcca gcgcggcatg gcccccgacg cacggtgcaa ggccttcggg 6120gcggaggcga acggcatcgg ccccgcggag ggctgcggga tcctggtgct caagcggctg 6180tcggacgcgc ggcgcgacgg cgaccgcgtc ctggcggtga tccgcggctc cgccgtcaac 6240caggacggcc gcagccaggg gctcaccgcc cccaacggcc cggcccagca ggacgtcatc 6300cgccaggccc tggccgcggc ggggctcacg cccgccgacg tcgacgccgt cgaggcgcac 6360ggcaccggca cgcgcctcgg cgatcccatc gaggcgcagg cgttgctggc gacctacggc 6420accgcgcaca cagcggagcg gccgctctgg ctcggctcga tcaagtcgaa cctcgggcac 6480acgcaggccg ccgcgggggt tgtggggctg atgaagctcg tgctggcgat gcagcacgcg 6540gagctgccga ggacgctgta tgcggagccc cgatcgccgc acatcgactg gtcgcagggg 6600cacatcaacc tcctgaacga gcccgtgccg tggccgcgca ccgacaggcc gcggcgcgcg 6660gcggtctcgt ccttcggcat cagcggcacc aacgcgcacg tcatcatcga ggaggcgccg 6720gccgaagcgc cggcgacagc ggcggacgca aagtcggtgg aggcgcttcc gatcctgccg 6780ctgctcctgt cgggtcgcga cgagccggcg ctgcgcgccc aggccgggcg gctcgccgag 6840cacctgcgcg cccacccggg cgagcggctg ctcgacatcg ccgcgggcct ggccacgacg 6900cgcacgcacc tcgccacgcg gctcgcgctg ccggtcgccg cggacgcagc cgcggaggag 6960ctgggcgccc gccttgcgca gttcgccgcc ggcggcccgg cgcccagcgg cgccgccgtg 7020accgcgccgg ggcagccgcc cggcaaggtc gcggtgctct tcaccggcca gggcagccag 7080cgcgccggca tggggcgcgc cctgtacgcc acccaccccg tcttccgcgc cgcgctcgac 7140gccgcatgcg ccgagctcga ccgccacctc gacaggcccc tccacagcgt cctcttcgca 7200gacgccggca ccgaggccgc cgcgctgctc gaccagacag gatgggcgca gcccgccctg 7260ttcgctctcg aggtcgcgct ctaccgacag tgggaggcct ggggtctgcg ccccgagctg 7320ctgctcggcc acagcatcgg cgagctcgcc gccgcccacg tcgccggcgt gctcgacctc 7380cccgacgcct ccgccctggt cgccgcccgc ggacggctca tgcaggccct cccccacggc 7440ggcgccatgg cctccatcga ggccaccgag cacgagctcc tacccctgct cgaccagcac 7500acggggcgcc tctcgctcgc cgccctcaac gctccacgcc agtcggtcgt cagcggcgac 7560cagcccgccg tcgaccatgt ctgcgctcac ttcatcgccc tcggccgacg cgccaagcgg 7620ctcgacgtca gccacgcctt ccactcggcg cacatgcaac ccatgctcga cgccttcgcc 7680agcgtcgccc gcggcctgac cttccacccg ccacggctgc ccatcgtcag cagcgtcacc 7740ggcgcacgcg ccaccaccga ccagctcacc tcgcccgact actgggtgca gcaggtgcgc 7800gagcccgtgc gcttcctcga cgccatgcgc tccctgcacg ccgccggcgc cgccaccttc 7860gtcgagtgcg ggccgcacgg cgtgctcacc gccgcaggcg ccgagtgcct cgctcccgag 7920ggcgctcgcg acgccggctt cgtcaccagc ctccgcaagg accgcgacga ggccctcgcc 7980ctggtccacg ccgcctgcgc cgtccatgtc cgcgggcacg ccctcgactg gctccgcttc 8040ttcgacgcca ccggcgctcg ccgcgtcgag ctgcccacct acgccttcca gcgacagcgc 8100tactggctcg aggcgccaag gcctcgcccc agcctcgagg gtgtcggcct caccgccgca 8160aaccacccat ggctcggcgc cgccgtgcgc ctcgcagacc gcgatggcta cgtcctcagc 8220ggccgcctct ccaccatcga ccacccgtgg gtcctcgacc acgtggtggc aggcacagtg 8280atcttgccag gaacggcgtt cgtcgagctg gcgtgggcgg cggccgaggt ggtgggcgcc 8340gccgcggtgt ccgaggtgac cttcacgacg ccgctcgtgc tgccgccgcg cagcgtggtg 8400gagctgcagg tgaggatcgg cgagccggac gcgtccgggc ggcggacgtt cgccgcgtac 8460agccgcgcgg acgcggcgat cgaggcggag tggacgcaac acgcgaccgg cgtgctgagc 8520gcgcaggcgg cggccggggc cgacgtggcg gacctttcgg tgtggccacc gccgggcgcc

8580gaggtggtgg cgctcgacgg cggctacgcc tggctggcgg cgcagggcta cggctacggc 8640ccggcgttcc aggcgctgcg cgaggtgtgg cgcgcgggca cgacgctgta cgcgcgggtc 8700gcgctgccgg acgcggtggc ggacacggcg cggggcttcg ggatccatcc ggcgctgctc 8760gacgcggtgc tgcactcgtt gctggcgccg tcggcgcagg aggaggcgtc cgacgacgac 8820aaggtgctgc tggcgttcgc gttctcggac gtggtgatcg aggcgcgcgg ggcagcggag 8880gtgcgcgtcc gcctgaacaa gcaggccgga gacgacgggg agggggtcac ggcgtcgatt 8940cacctcgccg acgcgcaggg gcggccggtc gcgcgcgtgg gggcgttcca ggcgcgggcg 9000acgaccacgg agcgggtgcg cgcgctcgcg ggcgcgagcg agcgcgacct gcaccgggtc 9060acgtggacgg acgtgacgct ggaagagacg ccgtgggcgc acgaggacag cgtcgtggtc 9120ggcggcgacg gcgcgctggc ggcggcgctg ggcgtgcgcg cggtggccgg gctgcccgag 9180ctgctcgcgg gcggcgcggc ggcgccgcgt cgtctggtga tcgacgcgac cgcgggcgac 9240cccggcgacg gcctggtcgc ggcgacgcac gcggcgacgc agcggggcct cgcgctcttg 9300cagggatggc tctcggaggc gcggctcgcg gcgacggagc tggtgctcgt gacgcgcggc 9360gcggcggcgg ccgagccgga cgagggtgtg gcggcgctga gccacgcgcc gctctggggg 9420ctcgtgcgcg cggcgcgcga agagcacccg gcgcgcgcgc tgcgccttgt cgacctgggg 9480cgcgaggcgc cggacggggc gatcctgcgc cgggcgatcg cggcggacga cgagccggag 9540ctcgtggtcc gccgcggggc gctgcgggcc gcgcgcctga gcctcgccca cgctggcccg 9600gacaccgcgg ggcaagcgac gcggctggcc cccggcggga cggtgctgat cacgggcggc 9660acgggagagc tcggacggca ggtcgcgcgg cacctggtgg cggcgcacgg cgttcgccac 9720ctggtgctga cgtcacggcg cggaatggac gcgcccgacg ccgcggcgct ggtggagtcg 9780ctgcgcgcgg cgggcgccgc gacggtggag atcgcggcgt gcgacgtggc ggacgggcat 9840gcgctggcgg cggtgctccg gaccatcccg gcggagcatc cgctgaccgc ggtcgtgcac 9900acggcgggcg tgctcgaaga cggcgtcgtg accgggctct cggccgagca gctcgcgcgc 9960gtgctgcggc cgaaggtcga cggcgcctgg cagctctacg aggcgacgaa ggacgcgccg 10020ctcgcggcgt tcatgctctt ctcgtcggcg gcgggcacgc tgggcagcgc ggggcaggcg 10080aactacgccg ctgcgaacgc gttcctcgat gcgctggcgg cagagctccg cgcgcgcggc 10140gtgccggcga tgagcctggc ctggggcttc tgggagcaag gcgggatcgg catgacggcg 10200cacctcggcg ccgccgacat ggcgcgggtc aagcggcagg gcatcgtacc gatgacggtc 10260gcgcacggcc tgcggctgct cgaccgcgcg ctggagcggc ccgaggcgac gctggtgccc 10320ctatcgctcg acgtggcggc gcttcagcgc gcggcgagcg acgccggacg ggtgccggcg 10380ctgctgcgtg gcctggtgcg cccggcggcc gcccggcgca cggcggcgcc ggcggccgcg 10440gcgacagggc tccgcgcgcg gctcttgccg ttgtccgagg ccgagcgcca ggacgtcttg 10500ctcgatctgg tgcgcacgga gatcgcggat atcctcgcgc tgtccgggcc agcggcggtg 10560cctcccgatc aacccatcag ggagctgggg ctcgattcgc tcacggcggt ggacgttcgg 10620agccggcttg tgcagaggag cgagatcgac ctcgccgtga ccctcgcgta cgattacccg 10680accgcgcgag cgatcgcggg acatctgagc gagcagatgg gactcgaagg agcgccggaa 10740gatcgtgagt cggcgctcga cgagagccag atccgcgccc tgctcatgca gattcctatc 10800cccacgttgc gccagtcggg gctgctcgga gacctggttc gcctggcctc cccgcaagcg 10860cccccgcgcg aagaaggtga gagcgagacg ttgagcttcg atcaccttgg aaatgaagag 10920ttcctcagcc tcgcgtcgaa gctcattgca gaggagggat catga 1096555643DNASorangium cellulosum 5atgaaccaag agactgttct tcggcagaca ctcgagaaga gtctccacaa gatccagcac 60ctcaatcggg agctcgagcg tctcaaggcg aagtcgagcg agccgatcgc gatcgtgtcg 120atggcgtgcc gctacccggg cggcgtcgac ggtcccgcac ggctgtggga gctgctctcg 180gaggggcggg acgcgatcgg gccgttcccc gaggggcgcg gctgggacgt ggcggggctg 240tacgaccccg acccggacgc gccgggcaag tcggtcacca cgcagggcgg cttcctctac 300gacgccgacc gcttcgatcc gacgttcttc ggcatcagcc cgcgcgaggc cgagcggatg 360gacccgcagc agcggctgct gctcgagtgc gcctgggagg cgctcgagcg cgcgggcgtc 420gcgccccaca cgctcgaggc gagcgccacc ggcgtcttcg tcgggctggt gtacagcgac 480tacggcgggc ggctgctgga gcacctcgag gtcttcgacg gctacgtcgc caccggcagc 540tttcccagcg tcggctcggg gcgcatcgcc tatacgctgg ggctccgcgg ccctgcggtg 600accgtcgaca cggcgtgctc gtcgtcgctc gtgtcgctcc acctcgcgtg catgtcgctc 660cgcgcgggcg agtgcgacat ggcgctcgcc ggcggcgcca ccgtgatggc cacgccgatg 720gccttcatcg agttcagccg ccagcgcggc atggccccgg acgcacggtg caaggccttc 780ggggcggcgg cgaacggcat cggccccgcg gagggctgcg ggatcctggt gctcaagcgg 840ctgtcggacg cgcggcgcga cggcgaccgc gtcctggcag tgatccgcgg ctccgccgtc 900aaccaggacg gccgcagcca ggggctcacc gcccccaacg gcccggccca gcaggacgtc 960atccgccagg ccctggccgc ggcggggctc acgcccgccg acgtcgacgc cgtcgaggcg 1020cacggcaccg gcacgcccct cggcgatccc atcgaggcgc aggcgctgct ggcgacctac 1080ggcaagacgc acacagcgga gcggccgctc tggctcggct cgatcaagtc caacttcggg 1140cacacgcagg ccgccgcagg ggtggcgggc atcatcaagc tggtgctggc gatgcagcac 1200gcggagctgc cgaggacgct gtatgcggag ccccgatcgc cgcacgtcga ctggtcgcag 1260gggcacgtca agctcctcaa cgagcccgtg ccgtggccgc gcaccgacag gccgcggcgc 1320gcggcggtct cgtccttcgg cgtcagcggc accaacgcgc acgtcatcct cgaggaggcg 1380ccggccgaag cgcccgcggc cgcgcaaaca gcggcggggg tgccgtcgac gctgccgctg 1440ctcctgtcgg gtcgcgacga gccggcgctg cgcgcccagg ccgggcggct cgccgagcac 1500ctgcgcgccc acccggacga gcggctgctc gacatcgccg cgggcctggc cacgacgcgc 1560acgcacctcg ccacgcggct cgcgctgccg gtcgccgcgg acgcagccgc ggaggagctg 1620agcgcccgcc ttgcgcagtt cgccgccggc ggcccggcgc ccagcggcgc cgccgtgacc 1680gcgccggggc agccgcccgg caaggtcgcg gtgctcttca ccggccaggg cagccagcgc 1740gccgccatgg ggcgcgccct gtacgccacc caccccgtct tccgcgccgc gctcgacgcc 1800gcatgcgccg agctcgaccg ccacctcgac aggcccctcc acagcgtcct cttcgcagac 1860gccggcaccg aggccgccgc gctgctcgac cagacaggct gggcacagcc cgccctgttc 1920gctctcgagg tcgcgctcta ccgacagtgg gaggcctggg gcctgcgcgc ccacgcgctg 1980ctcggccaca gcctcggcga gatcgtcgcc gcccacatcg ccggcgtgct cgacctcccc 2040gacgcctccg ccctggtcgc cgcccgcgga cggctcatgc aggccctccc ccacggcggc 2100gccatggcct ccatcgaggc caccgagcac gagctcctac ccctgctcga ccagcacacc 2160ggacgcctct cgctcgccgc cctcaacgct ccacgccagt cggtcgtcag cggcgaccag 2220cccgccgtcg accatgtctg cgctcacttc aaggccctcg gccggcgcgc caagcggctc 2280gacgtcagcc acgccttcca ctcggcccgc atggaaccca tgctcgacgc cttcgcccgc 2340gtcgcccgcg gcctgaccta ccgcgccccg cgcctgcccg tcgtgagcaa tgtcaccggc 2400cgcatggcca ccgccgacga gctcacctcg cccgactact gggtgcgcca cgtgcgcgag 2460cccgtgcgct tcgtcgccgg cgtgcgcgcg ctgcacgcca ccggcgtcgc cacctacctc 2520gagtgcgggc ccgatccggt gctcggcggc atggccgcag actgcctcac ctccgacgag 2580agccgcgacc caggcctgat ccccagcctc cgcaaggacc gcgacgaggc cctcgccatc 2640gcccaggccg cctgcgccct gcacgtccgc ggacacgccc tcgactggcc ccgcctcttc 2700gacgccaccg gcgctcgccg cgtcgagctg ccaacctacg ccttccagcg gcagcgctac 2760tggatcgatg cgccgcggcg cgcggcgggg ctcgaaagcg tcggcctcac ggccgcagac 2820cacccctggc tgggcgcggc ggtgcggctc gccgaccggg acgtctacgt gctgagcggg 2880cggctgtcga cggtcgacca cccgtggatc ctggaccacg tggtgacggg cacggcgctg 2940atgccaggaa cggggttcgt cgagctggcg tgggcgacgg cccaggcggt gaacgccgcc 3000gcgatcgcgg agctcaccct gacgactcca ctcgtgttgc cggcgcgcgg cgcggtgcag 3060ctccaggtga cggtcgacga ggccgacgcg gatggccggc gggcattcgc gatccacagc 3120cggccgcatg ggcccgtcga cctcgagtgg acgcaacacg cgaccggcgt gctgagcgcg 3180gaggcgccgg cgggagccga cgaggcggcg gggctctcgg agtggccgcc gccgggcgcg 3240gaggcggtgg cgctcgacgg cgggtatgag cagctgtccg agcacggcta cggccatggc 3300ccggcgttcc aggggctccg cgggctctgg cgcgcggacc agacgctgta cgcgcacgtc 3360gcgctgccgg acgctgtcgc gggcacggag cagggcttcg ggctccatcc ggcgctcttc 3420gatgcggcgc tgcagtcgct ggcgcggctg tcgcgcgagg aggcggccgc tggcgacccg 3480gtgctggtgc cgttcgcgtg gacggacgtg gcgctgtacg cggccggcgc gaccgagctg 3540cgggcgcgca tcgcgctgga gcaggcggag ggcggcgcgc cggcggtggc gtcgctgctg 3600ctggccgacg cgcacggacg aaccgtggcg acgacagggc gggtgcgcgg ggcgagcgcg 3660gcgcagacgc ggtccgccgc gagccgtgcg gagccgatgt acagggtcgc gtggacggac 3720gtggcgctgg aggcggcggc gtgggcgccc gaagagcacg tcgtgctcgg cggtgacggt 3780gcgctggcgt cggcgctggg cgtgcgcgcg gcggccgggc tgccggagct gctcgaggcg 3840ctggcggacg gcgcggccgc gccgcggcgg cttgtcgtgg acctgacggc gggcgacgcg 3900ggcgctgtcg tcgcggccgt gcacgccgcg gcgcgcggcg cgctggccct ggtgcaggga 3960tggctcgccg cgccgcagct gacggcgacg gagctcctcg tggtgacgcg ctgcgccgtg 4020gcgacagggc cggacgaggg cgttgacgcg ctggggccgg cggccgtctg ggggctgctg 4080cgggccacgc gcgccgagca ccccgaccgc gcggtccggg tgctggacct ggggcgcgag 4140ccgctggacg gggcgctcct gcgcagggcg ctggccgcgg tggcggagcc ggagctgtcg 4200ttgcgccgcg gcgaggcgcg cgcgcctcgc ctgcgcgagg caaagcccgc cgcggcgccg 4260gcgacacggc tggaccctga agggacggtg ctggtcacgg gcggcaccgg ggagctgggg 4320cggcaggtcg cccggcacct ggtggcggcg cacggcgtgc ggcacctcgt gctgacgtcg 4380cggcgcggga tggacgcgcc cgacgccgcg gcgctggtag aagagctgcg cgcggcgggc 4440gcggcgacgg tcgacgtcgc cgcgtgcgac gtcgccgctg gcccggccct ggcggcggtc 4500gtggaggcga tcccggcggc gcatcccctg accgcggtcg tgcacatggc gggcgtgctg 4560gacgacggca tcgtgacgaa gctctcggcc gagcagctca cgcgcgtgct gcggccgaag 4620gtcgacggcg ccattcatct ccacgagctc acgaagcacg cgccgctcgc ggccttcgtg 4680atgttctcgt ccgcggcggg cacgctgggc agcccggggc aggcgaacta cacggcggcc 4740aacgtgttcc tggacgcgct ggcggcgcga ctgcgcgcgc gcggcgtgcc cgcgatgagc 4800ctggcgtggg gcttctggga gcaaggcggg atcggcatga cggcgcacct cggcgccgcc 4860gatcgggcgc ggatgaagcg acacggcgtc gtggcgatgt cggtcgcgca gggcctgcgg 4920ctgctcgatc gcgcgctcgc gcaccccgag gcggcgctgg tgccgctcgc gctcgacctc 4980tcgtcgctgc acgcgggggc cagcggcgcc ggaccggtgc cgccgctgct gcgcgggctg 5040gtacgcgcgc ccgccggccg gcgcacggcg gcgtccgcgg cccggacgaa cgggaagggc 5100acggcattgg cggcgctccg cgcgcggctc ttgccgttgc cgcaggccga gcgcgaggac 5160ctcttgctcg agctcgtgtg caccgaggtc gcggaggtgc tgcagttgcc ggggccggcg 5220cacgtcccgg cggatcagcc gctccgcgac ctggggctcg actcgctcat gaccgtggag 5280ctgcgcaacc gtctcggcgc gcgcgccgag acgacgctgc ccaccacgct cgcgttcgac 5340tacccgacgc ccagggccct tgcgtcctat ctggagacgt tgctcggcat ctccgacgag 5400aacgggcatt cgggtgagtt gctgcacgtt ccgcagaacg aggacgagat ccgctccgcg 5460atagcgcgca tcccgatagc gaccctgcgc gaggcggggc tcctccagag cttgctgcgg 5520ctcgcccccg gcaaggcggt ggccggtgac gtcacgcacc cggtcgatga gctgctggtc 5580gagcacatcg aggatgaaga gctgcttcga ctcgctttcg aggccaccgg aggtatcaag 5640tga 564368610DNASorangium cellulosum 6gtgaaagacg aggctctctc gtttcgccga gccctggaga agacggtcgt cgagatccgc 60cgtctcaatc gggagatcga cgacctgcgg gcgaagtcga gcgagcccat cgcgatcgtg 120tcgatggcgt gccggttccc cggcggcgtc gagaaccccg aggcattgtg gcggctggtc 180tccgaggggc aggacgcgat cgggccgttc cccgaggggc gcggctggga cgtggcgggg 240ctgtacgacc ccgacccgga tgtgccgggc aagtcgatca ccgcgcgggg cggcttcctc 300tacgacgccg atcgcttcga tccggagttc ttcggcatca gcccgcgcga ggccgagcgc 360atcgatccgc agcagcggct gctgctcgag tgcgcctggg aggcgctcga gcgcgcgggc 420gtcgcgcccc acacgaagga ggcgagcgcc accggcgtct tcgtcgggct gatgtacacg 480gactacggcc tgcggctgct gaaccacccc gaggccctcg acggctacat cggcatcggc 540agcacgggga gcacgggctc ggggcgcatc gcctacacgc tgggcctgca gggacctgcg 600atcacggtgg acacggcgtg ctcgtcatcg ctcgtggcgc tccacatggc ctgcgcgtcc 660ctgcgcgggg gagagtgcaa cctggcgctt gtcggaggcg tcgccgtgat gacgacgccg 720acaacgttca tcgagttcag ccggcagcgg ggcctctcgc tcgacggccg gtgcaagtca 780ttcggtgccg aggccgaggg cgtcggctgg ggcgaaggct gcggaatcct ggcgctgaag 840cggctgtcgg acgcgcggcg cgacggcgac cgcgtgctcg cgatcatccg cggctccgcc 900gtcaaccagg acggccgcag ccaggggttc accgccccca acggcccgag ccagagggcg 960gtcatccagc gggcgctggc ggcggcgggg ctgaccgcgg cggacgtcga cgccgtcgag 1020gggcacggca ccggcacgcg cctcggcgac cccatcgagg cgcaggcgct gctggcgacc 1080tacggcaagg cgcacacagc ggagcggccg ctctggctcg gctcgatcaa gtccaacttc 1140gggcacacgc aggccgccgc aggggtggcg ggcatcatca agctggtgct ggcgatgcag 1200cacgcggagc tcccgaggac gctgcacgcc gacacgccct cgccgcacgt cgactggtcg 1260caggggcacg tcaagctcct caacgagccc gtgccgtggc cgcgcaccga caggccgcgg 1320cgcgcggcgg tctcgtcctt cggcatcagc ggcaccaacg cgcacgtcat cctcgaggag 1380gcgccggccg aagcgcccgc ggccgcgcaa acaccagcgg cggcgggggt gccgtcaacg 1440ctgccgctgc tcctgtcggg tcgcgacgag ccggcgctgc gcgcccaggc cgggcggctc 1500gccgagcacc tgcgcgccca cccgggcgag cggctgctcg acatcgccgc gggcctggcc 1560acgacgcgca cgcacctcgc cacgcggctc gcgctgccgg tcgccgcgga cgcagccgcg 1620gaggagctga gcgcccgcct tgcgcagttc gccgccggcg gcccggcgcc cagcggcgcc 1680gccgtgaccg cgccggggca gccgcccggc aaggtcgcgg tgctcttcac cggccagggc 1740agccagcgcg ccgccatggg gcgcgccctg tacgccaccc accccgtctt ccgcgccgcg 1800ctcgacgccg catgcgccga gctcgaccgc cacctcgaca ggcccctcca cagcgtcctc 1860ttcgcagacg ccggcaccga ggccgccgcg ctgctcgacc agacaggctg ggcacagccc 1920gccctgttcg ctctcgaggt cgcgctctac cgacagtggg aggcctgggg cctgcgcgcc 1980cacgcgctgc tcggccacag cctcggcgag atcgtcgccg cccacatcgc cggcgtgttc 2040gacctccccg acgcctccgc cctggtcgcc gcccgcggac ggctcatgca ggccctcccc 2100cacggcggcg ccatggcctc catcgaggcc accgagcacg agctcctacc cctgctcgac 2160cagcacaccg gacgcctctc gctcgccgcc ctcaacgctc cacgccagtc ggtcgtcagc 2220ggcgaccagc ccgccgtcga ccaggtctgc gcccacttca aggccctcgg ccggcgcgcc 2280aagcggctcg acgtcagcca cgccttccac tcggcccgca tggaacccat gctcgacgcc 2340ttcgcccgcg tcgcccgcgg cctgacctac cgcgccccgc gcctgcccgt cgtgagcaat 2400gtcaccggcc gcatggccac cgccgacgag ctcacctcgc ccgactactg ggtgcgccac 2460gtgcgcgagc ccgtgcgctt cgtcgccggc gtgcgcgcgc tgcacgccac cggcgtcgcc 2520acctacctcg agtgcgggcc cgatccggtg ctcggcggca tggccgcaga ctgcctcacc 2580tccgacgaga gccgcgaccc aggcctgatc cccagcctcc gcaaggaccg cgacgaggcc 2640ctcgccatcg cccaggccgc ctgcgccctg cacgtccgcg gacacgccct cgactggccc 2700cgcctcttcg acgccaccgg cgctcgccgc gtcgagctgc caacctacgc cttccagcgg 2760cagcgctact ggctcgagac gccccagacg ccgggcgccg acggggcctc caacctatct 2820tcgcccgccg aaagccgctt ctgggaggct gtcgagagag cggacatcat ccccctcgcc 2880gaggcgctgc gcctcgagga tgaggcgcaa cgcgcttcgc tggcgaccct gctgcccgcg 2940ctctcgacct ggcgccgccg acgccacgag cagagcaccg ccgacgcctg gcgttaccgc 3000gttgcctgga aaccccttgc catcgacgcc cggagcgatc tctcgggggt ctggctgttc 3060ctcgcgcctc cggatcacgc gaaggacgac ctcgcgcgcg cggtccttcg cgcgctcgcc 3120gagagcggcg cgacggtcgt ccctgtgctg gtggccgagg gcgacgtcga ccgcgccctc 3180ctgagcgcgc ggctgcgcga gcaggtcggc gacggcggcg cgatccgcgg cgtgatctcg 3240ctcctcgccc tggacgagac ctcgctgccg cagcacgacg ggctgccccg gggcctcgcc 3300ttcacgctcg cgctcgtcca ggccctggga gacacggcga tcgcagcgcc tctatggctg 3360ctcacccgtg gcgccgtctc cgtgggtcgt tccgaccgcc tcgagcgccc gctgcaggcg 3420ctgacgtggg gcctcgggcg cgtggtggcg ctggagcacc ccgagcgctg gggtggactc 3480atcgatctcg ccggcgcgct cgacgaaaag gcgctcaagc ggctcgtcgc cgccctcggt 3540ggtcgcgacg ccgaggatca gctcgccctg cgcccctccg gactcttcgc gcgacggctg 3600gtcagagcgc ccctgggtga agcgaccgcg gttcgcgcct ggaaggcgcg cggcaccgcg 3660ctcgtcaccg gcggcacggg ggacctgggc gcccacgtcg cccggtggct cgcccagaat 3720ggcgccgagc acctcgtcct caccagccgc cgcggacagg acgcccccgg agcggccgag 3780ctcacggccg agctcacggc gctcggcgcc cgcgtcacca tcgccgcctg cgactcgtcc 3840gaccgacagg cgctcgcggc cctgctccag cgcctgaggg ccgaaggccc ccccctccgc 3900gccgtcgtcc acgctgcggg tgtcgaccag gtcaccccgc tggccaggac cagcctggcc 3960gagttcgcag gcatcgcctc cggcaaggtc gcaggtgctc ggcacctcga cgacttgctc 4020ggcaatgccc ccctcgacgc cttcatcctc ttctcctcgg tcgcaggcgt ctgggggagc 4080ggctttcagg gcgcttacgc ggcggccaac gccttcctgg acgcgctggc cgagcagcgc 4140cgcgccctgg gctcgacggc cacgtcgatc gcctggggcc tctggggcgg caaaagcatg 4200gccgacgacg ccgccaaaga tcatctcagc aagcgcggcg tgtccccgat gccgccccag 4260ctcgcgatcg cggccctgca gcgggcgctc gaccacgacg agaccacact caccctcgcc 4320gacgtcaact ggtcacgctt tgccccggcc tttgccgccg cccgcccgcg cccgttgctg 4380cacgatctcc cggaagcccg gagcgctctc gagtccccct cgccggcgcc ccgcgaggcc 4440gagctgctca cccggctcca gggcctctcc agcaccgagc gcgtccgcca cctcgtctcc 4500ctcgtgctgg cggagaccgc cgtcgtcctc ggccatcctg acgcctcccg cctcgaccct 4560cacacaggct tcgcggatct cggcctcgac tcgctgatgg ccgtcgagat gcgccggcgg 4620ctccagcagg caacgggggt gagcctgccg gcgaccctga ccttcgacca cccctcgccc 4680caccacatcg cgaccttcct cctcgacgag gtcttcgcgc cggccctcgg ccaggccccc 4740ggcgccgagg aagacgaagc gatcgcccag gccgggctcg cctcgggcga cgagcccgtc 4800gccctcatcg gcgtggggct gcgtctcccc ggcggagcca ccgacctcga cgggctctgg 4860cgccttctgg agcaggggat cgacgttgtc ggccccgtcc ctgaagaccg cggctggagc 4920atggacgagc tctacgatcc cgaccccgac tccctcggca agagctacgt gcgcgaagcg 4980gctttcctcg atcgcatcga cctcttcgac gcgggcttct tcggcatcag cccccgcgag 5040gcgagccacg tggacccgca gcaccgcctc ctgctcgagg ccgcgtggca ggccctcgag 5100cacgcaggca tcgtcccggc ctcgctccag gactcccaga ccggcgtctt cgtgggctca 5160ggcccgagcg actacgcctt gctccacaac ccggcccagg aggatgaagc ctacaggctt 5220acggggacgc agccctcgtt cgcgccaggc cggctctcgt tcagcctggg attgcaggga 5280ccggcgctct ccgtggacac cgcctgctcc tcctcgctcg tcgcgctcca cctcgccgcc 5340caggccctgc gccgcggcga gtgcgggctc gccctcgtcg gcagcgcgca ggtgatggct 5400gctcccgacg ccttcgtgac gctctcccgc gctcgcgcca tcgctcccga cggccgctcg 5460aagaccttct ccgcccaggc cgatggctac ggccgcggcg agggggtcat cgtcttcgtc 5520ctcgagcgcc tgagcgacgc ccgcgcgaga gggcgcgacg tcctcgcggt cctccgcggc 5580agcgccgtca accacgacgg cgccagcagc ggcatcaccg cgccgaacgg cacctcccag 5640cagaaggtgc ttcgtgccgc gctccacgat gcgcggctca cgccagcgga cgtcgacgtg 5700gtggagtgcc acggcacggg cacttccctc ggcgacccca tcgaggtgca agccctggcc 5760gccgtctacg gaaaggagcg ctccgccgat cggccgctga tgctcggcgc gctcaagacc 5820aacgtcggcc acctcgaggc cgcgtccggt ctcgccggcg tcgcgaaggt cgtcgcggcg 5880ttgcgccacg aggcgctgcc ggcgacgctg cacaccgccg cgcgcaaccc tcatatccag 5940tgggatacgc tgcccgtcca ggtcgtcgac accttgcgtc cctggccgcg gcgcgaggac 6000ggcacccccc gccgcgccgg cgtgtcggcg ttcgggctct ccggcaccaa cgcccacgtc 6060ctcctcgagg aagctccgcc tgtccagccg agcacacagg cggagcagcc tgccgcgccg 6120ccgtggttgc cgctgctcct gtcgggcaag acggacgcgg ccctgcgagc gcaggccgag 6180cggctgcggg cgcacctcga cgcccatgcc gacctcgggc ttgccgacgt cgcctattcc 6240ctcgccacga cgcggacgca tttcgcgcat cgggcggtgg tcgtcgcgga cgctggcgcg 6300accctcttcg aagggctgga cgccatcgcg cgcggcaacg ccgcttccca cgtggtggtc 6360gacgaggcca agatcgacgg caagaccgtc ttcgtcttcc cgggacaggg ctcgcagtgg 6420gcccagatgg cgcagccgct gctcgagacc tccgagctct ttcgcgagcg tatcgaggcg 6480tgcgcgcacg ccctcgcgcc tcacgtcgac tggtcgctgc tcgccgtcct ccgcggcgaa 6540gaaggcgccc cctcactgga gcgggtcgac gtggtgcagc cggtgctctt cgccgtgatg 6600gtctcgctcg ctgccctctg gcgctcgatg ggcgtcgagc cggacgccgt cgtcggccat 6660agccagggcg agatcgccgc cgcctgcgtg gcgggcgcgc tgtcgctcgc ggacgccgcc 6720aaggtggtgg cgctgcgcag ccgcgcgctc gcgcggctcg ccggccgggg cgccatggcc 6780gtcgtggagc tccccgccgc cgagctcgcc gagcgcatga agcgctgggg cgagcggctg 6840tccatcgcag cgctcaacag ccctcgttcc accgtgatct ccggcgatcc ggacgccgtc

6900gacgcgctgc tccgggagct cgactcggcg gagatcttcg cccgcaaggt gcgcgtcgac 6960tacgcctccc actgctccca tgtggaggcg attcgccacc agctcctggc cgagctcgcg 7020ggcatcgagc cgctcccgtc cacgctcccg ctctactcca cggtgagcgg ggacaagctc 7080gatggcgtcg cgctcgacgc ctcgtactgg taccggaacc tccggcagac cgtccgcttc 7140tcggacgcca cgcagcggct cgtctccgcg ggacatcgct tcttcgtcga ggtcagcccg 7200catccggtgc tgacgttcgc cgtgcaggat gtcctcgatg ccgagggggt gcccgccgct 7260gtcgtcggct cgctacggcg cggcgagggc gacctgcggc ggttccttgt gtcgctgtcc 7320gagctcttca cccgcggcct cgccctggat tggtccaggg ttctgcccag cggccggcgc 7380gtatcgctgc ccacctacgc cttccagcgc gagcgctact ggctcggggc tcacagggct 7440cgcggcaccg acgcgacatc cgccggcctg gcatcggacg agcccacgcg cggcgcgtcg 7500atgccagtgc ggctctcgtt gcgggacgtg ccgcccgagg agcgccaggg agcgctggag 7560cggttcgtcc gggagcagct cgcggccgtc ctgcgcatgg atgcggcgcg gatcgagggg 7620cagacgacga tcaagacgct cgggatcgac tcgctcatgg cgctcgagat ccgcaaacgg 7680ctggaagccg gactggccgt gaccttgcca tcgacgctca tctggcagtt cccgcacgcc 7740gaagggctcg cacggcacct catgacgcgg ctccccgcgg gggacggaga aggatctgcc 7800gtggtccagc ccgtggagca gccgcgcgcg ccgaaggagg tgcccgtatc catggatccc 7860tcggcgtggg tgcaccgccc gcgccccagg gccgacgcgc gcgttcgact gttctgcctt 7920ccctacgccg gcgcgggcgc ctcgcgcttc cgggcgtggc cagagctgct cccctcctgg 7980gtggaggtct gcccgatcca gctccccggc agggaagagc gcctccacga gccggccttc 8040gagacgatgg acgcgctcgt cgacgcgctc gttcccgccg tcgaggcgca catcgatcgg 8100ccctttgcgc tgttcggctg cagcatgggt gccctcctgg ccttcgagct cgcccgggcg 8160cttcaatccc gtcatcgctt ggtggcgcgg catctgttcg gcgcggcgag ctcctcacct 8220cggcgcgtga gcccggtacg ggagcagctc tccgcggtgg tctcccctgg aacggtgcga 8280tcggacgcga tggcctcgct gcgccagctc ggtctgctgt cgtcctcgtc cctccaggac 8340gaagagatgc tggacgaggt gtggcccgcg ttccgtgcgg atctatccct gacgctgaag 8400tacacgtgca gggacgcaac ccccctcgac gcccccatct cggtcttcgg gggcaccgag 8460gaccggaccg tagggcgcga ggatctcgtc gcctggcata cgctgacgaa ggacgcgttc 8520caggtcgcca tgctgcccgg gggtcacctg ttcatggacg cgacgccgaa gcggctcttc 8580catcacatcg agcacgcgct ccagctctag 86107687DNASorangium cellulosum 7gtgcggacca gcgacgccgt gtgggctggt gccgcgggct ataccagggc gcgtcttcag 60gtctatgact tcttcatcta cggcttcaac agccctgtcg catggaagtg cccgggcgag 120gagctcctcg agaactacaa tcggcacgtc tcgggcaatc acctcgacgt cggcgtgggg 180acggggtacc tgctcgaccg ctgccgcttc cccaccgcca agccgcgtgt gtttctgatg 240gatctgaacc cggacgctct gcaggtgacg gcgcagcgac tgcaccgctt tcagcctcag 300accttgcggc ggaacgtcct tgatcccatc cgcttcgacg gagagccctt cgactccatc 360gggatgaact acctcatgca ctgcgtccct ggatccatcc cggagaaggc cgtgatgttc 420gaccacctga gcgccttgct gaagccgggc ggcgtgatct tcggcagcac ggtgctctcg 480gagggcgtgg acaaggggat cgtggcgcga gccatcatgg accgcttcaa caagaagggg 540atcttctcga acacccgaga cgccgcctcc gatctgacgc gagcgctgga ggagcgcttc 600gacgacgtct cggtccgcgt cgtcggctgc gtcgggctgt tctcagccag gaagcgtacc 660tgcgcgggaa ccgagtcgcc ggcgtga 6878906DNASorangium cellulosum 8atcgtcctgg gcgacacgct ggagcaggtg gcgacgcggc tgctcgagga ggacctcgcg 60gcgtgccaca cgaccggcga ggcggcggac gtgctgctga acggggtgct cgcgtcgagc 120gcccgcgccg tggccgcggc gctgcgcgcg tgcgacgagt tcgccgcggg cgacagcgat 180ctgccgtcgc tggcccgggc gtgccgcgcg ttcgcggggc tcgcgtcgtt cgggtcgtcg 240cggtcgctgt cgtcgctcgg cgacggggtg atcgcgccga tgctggagaa gacgttcgcg 300cgcgcggtcc tgcgcgtcca cgggggctgc acgggcagcg acgaggcggt cgccgccgcc 360aaggaggcgc tgcgcacgct gcacgacgtg gcgctgtcgc agccgatcgt cgaccgcggg 420gcgtggctcg acgcggcgcg ggggctcgtg gacagcgagg tggtgaaccc gacggcgtcc 480ggcctcgcgt gcgggctgct ctacctggcg caggcgatcg acgacgccga ggtggcgcgg 540gtcgtcggcc tgcggctcgg gggcgcggcc gagcccgagg cggcggcgtc gttcctggcc 600gggttcctcg aggtgaacgc gctggtgctg gtgaagagcc ggcccgtggt cgaggcgctg 660gacgcgttcc tccgggcgat cgcgccggag cgcttcaagg acacgctgcc ggtccttcgg 720cgcgcgttcg ctgggctcgg cgcgacggag cggcggtacc tgctcgagaa cgtgctcgcg 780gcgcggaagc tgggggacaa ggcgcgcgcg gcgcaggcgg tgctcctgga gaaggaccgg 840gagaagctga aggagatgag cgaggacctc tcacaggcga tggacgacct ggacgagttg 900ctctga 90691038DNASorangium cellulosum 9tcacacgccc ggcaaagctg gcctcgagtc caccgcgacc gccctctgga tggcgtcgcg 60gagccacccg tggccctcgt cgtgctcgga gcgctccggc cagacgagcg tcagcgtgta 120gccctcgagc gcgaacgggc acggccgcac cacgagatcg agcctccggg ccagggccgc 180ggcgacgcgc gcggacacgg tgagcagcag gtcggaaccg gagacgatga acggcgcgac 240aaggaaatgg gacacggtca gcgtcacccg ccggcgtgtt ccctgctccg ccagcgcccg 300atcgatggcg ccgtggtcct ctccgtgcgg cgagaccatc aggtgctcgc aagcagcgta 360gcgcgccgcg gtgagcggcc tccgggacgc cgggtgtccg cggcgcatca cacagacgat 420ctcctcggcc gcgagcagcg tggagcgaca gccgtcgggc accggtccgc cgcgcccgag 480cttgccgtcg agctcgccgc ggcgcaggag ctcggcgaag tcggccggga tgttccggca 540gcgcaggttg acgcgcggcg cctcgacggc gagcagcgcg gtcagcgccg ggagcacgag 600cagctcgagg ttgtcggtcg cgacaagccg gaacgtgcgc tgcgaccgcc gcgggtcgaa 660ccgctcgacc gggcggaaga cctgctcgag ccgctcgacg gcctcggccg cccgcggggc 720caggtcccgc gcccgctcgc tcagcgtcat ctgcctgccg acctggatga gcagcgggtc 780cgcgaaatgg gcgcgcagcc gcgcgagcgc gtggctcatc gagggctgcg tcacgcccac 840gcggcgcgcg gcgcgcgtga cgctcttctc ctggagcagg gcgtgcaacg ccacgacgag 900gtgggtgtcg accgactgca gccgcatggt cgatggatac cacgtcgatc catcgacggc 960gtctatggat cgccgcgccg actgccgatt cgacgcccgg ggccgtgggt gcctatctct 1020cctctccgga cggcgcat 103810327DNASorangium cellulosum 10atgatcatcg agtacgttcg ctacacgatc cccgcggagc aagagaagga gttcctggcc 60gcctaccgcg acgccgccgc ggagctgcgc gggtcggagc attgcctcga ctacgagatc 120tcccgctgcg tcgaggatcc gacgagctac gtcgtccgca tctgctggga ctcgctgcaa 180ggccatctcc agggcttccg caaggcggcg gcgttcccgt cgttcttcgc caaggtgaag 240ccgttctacg agcgtatcca ggagatgagg cactacgcct tgaccgacgt cgccgcgcgg 300caggcgggga cggccgcgac gggctga 327111461DNASorangium cellulosum 11atgaagctcg cgcgcaagct gacgctcgcc ctcgtgttcg gggtattcct cgtgctcgcg 60ctgagcgcct acgcccagat ccgcagagag gccaggatct tcgagaacga cgtccagcgc 120gaccatcaca cgatggcccg cgcgctcgcg gccgcggtca tggaggtgtg gcgctccgag 180ggaaccgcgc gggcgctgcg cctcgtggag gacgccaacg agcgggaaca gcaggcgaac 240atccgctggg tctggctcga tggccaggcc gacgagcccc atcgcccccg gctggcgccg 300gagctgctcg cccccgtcgc cgaggggcgc gcggtcgtgc gccggatccc ccagaaagac 360gcggatctgc tcgtgacctg cgtgccggtg tccgtgcccg gcgaccgcgc cggcgcgctc 420gagctctccg agtcgctcgc gggcgcgcgc cggtacatcc ggagcatgat cctgagcacg 480gcgatcacca cagccgcgct gacgctggta tgtgggttgc ttacaacggg cctcggagtc 540tggctggtgg gacgccccat gcgcacgttg atcgaccagg cgcggcggat cggcgccggc 600gatctctccg ggcggctgtc gctgcgccag gaagacgaga tcggcgagct cgggcgcgag 660atgaacgcca tgtgcgatcg cctcgccgcg gcgaaccaga agctcgagtc cgaggccgcc 720gcgcggatcg ccgcgctcca gcagctccgt cacgccgagc ggctcgcgac cgtcggcaag 780ctcgcgtccg gcatcgcgca cgagctgggc gcgcccctcc aggtcgtcac ggggcgcgcg 840cggatgctcg tcgacggcga cgtgtcgggc gatgaggtgc cgatcaatgg acagatcatc 900ctcgagcagt cgcagcggat gacccagatc atccgccagc tgctcgactt cgcccggcgc 960cgcagcgccg agaagcagga gaccgcgctc cgcggcgtca tccgcggcac gttcacgatg 1020ctgaagccgc tggcggacaa gcagggtgtc acgatcgtcg aggagggaga cacgccggat 1080cgggtggtcc acgccgacgc cgaccagctc cagcaggcgc tcacgaacgt cgtcgtcaac 1140gcgatccagg ccatgccgtc cggcggcacg atcacggtgg gcgtccggac cgtccgcgcc 1200agccccccgc ccgaccaggg aggggccgag ggcgactaca tcgcgctgtc ggtgcgcgac 1260gagggacagg gcatgacggc cgacgtcctc gagcacgtct tcgagccgtt cttcacgacc 1320aagcccgtcg gcgaggggac cgggctcggc ctgccggtcg cctacggcat catcaaggag 1380cacggcggct ggatcgacgt cgacagccgc cccggctccg ggagccagtt cacgatgtac 1440ctgccgcagg agaagccatg a 1461121386DNASorangium cellulosum 12atgaccggac gcgtcctgat cgtcgacgat gagcgaggcg tctgcgagct cctcgacgcc 60gggctgaaga agcggggatt ccaggcggcg tggcgcacgt cggccgccga ggcgctcgag 120ctcctcggcg cggaggactt cgacgtcgtc gtcaccgaca tgaccatgcg cggcatgaac 180ggcctcgagc tctgcgagcg catcgcccag aaccggcccg atctgccggt catcgtcatc 240accgcgttcg ggagcctcga caccgccacg tcggcgatcc gcgccggcgc ctacgacttc 300gtgaccaagc cgttcgagct cgacgcgctc cggctcaccg tcgagcgcgc cctgcgccac 360cgcgccctcc gcgaggaggt gcgccggctg cggcgcgccg tggacgactc ccaccgttac 420gagcagatcc tcggcggcag cccggcgatg aagggcgtct tcgatctgct cgaccgggtc 480gccgactcgg acacctcgat cctcatcacc ggcgagagcg gcaccggcaa ggagctcgtc 540gcgcgcgccg tgcaccagcg cagccggcgc ggccagggcg cgttcatcgc ggtgaactgc 600gcggcggtcc cggacgccct gctcgagacc gagctgttcg gccacgcgcg gggcgccttc 660accgacgcca agggggcgag gagcggcctg ttcgcgcggg cccacggcgg caccctgttc 720ctcgacgaga tcggcgagct gccggtcggg ctccagccga agctcctgcg cgccctccag 780gagcgcgtcg tccggcccgt cggcgcggac gaggaggtcc ccgtggacgt gcggctcatc 840gcggcgacca accgcgacct ggagaccgcg atcgaggagc gccgcttccg cgaggacctc 900tattaccgga tcaacgtggt ccacgtcgat ctgccgccgc tccgctcccg cggcgccgac 960gtgctgctgc tcgcgcagcg cttcctcgag cacttcgcga ccgtcaagga gcggcccatc 1020aagggcctct cggcgcccgc ggccgagaag ctcgtcgcct acgcgtggcc cggcaacgtc 1080cgcgagctcc agaactgcat cgagcgggcc gtcgcgctcg cgcggtacga tcagatcacg 1140gtcgacgatc tccccgagaa gatacggagt taccggcgct cccacgtcct tgtctcgagc 1200gacgacccga ccgagctcgt ccccatggag gaggtcgagc ggcgctacat cctgcgcgtc 1260ctggaggtgg tcggcggaaa caagagccag gcagcccagg tcctgggctt cgatcgagcg 1320accctgtacc ggaagctcga gcggtacggc ctgcgcgccg ggcgcgcggg cgacccgagg 1380ccgtga 1386131527DNASorangium cellulosum 13tcatccatgg gagacgccgc gcgggccgtc cgcctggtcc cttgacgacg agcgcggcag 60ctcgatccgg aagaccgagc cggcgccggg acggctctcg acgaagaggc ggccgccgtg 120cgcctcgacg atgcgcttcg ccaccgcgag gccgaggccc gtgcccggga tggatccaga 180cgtggacttg agccgccgga acggctcgaa gaggtgcgcc agatcctcgg gctcgatccc 240gagcccgcga tcgcgaacgg cgatctcggc cccctcgccg ccggcgcgga ccgccacgtc 300gacctgcccc ccggcggggg agtacttgag cgcgttcgac aggaggttgt tcagcacctg 360ctcgatccgg gtcgcgtcgc agcggacgag caccggtgtc tcggggagcg agagctcgat 420ggggtgctcc ggcgagacag ggcgatagag gtccaccgcc tcctgcgcga gatcgcgcag 480gtcgcgctcc tccacccgga gatcgagctt gcaggcctcg atctgcgacg cgtcgaggag 540gtccccgacc atgcgatcga gccggtcgac ctgccgcccg acgagcgcca tggtccggcg 600cacgctcgac tccaggggcc ggttgtcggc gtcgaggacg tgcacggaca gccggagcgc 660cgacagcggg ttcctgaggt cgtgggccac gccgccgagg aacgcgaact gcgcctcgcg 720ctggcgctcc agcgactctg ccatgtcgtt gaaggcgcgc gcgatctccc cgagctcgcg 780cggcccgatc agcggcgcgc gcgcggcgcg gtcgcccgcg ccgtagcgcc cgatggcctc 840ctggatcgcg acgatggggc ggtagatgag ccgccgcgcg ctgaggagga tcgtggacgc 900gcccgcgagg aagaacacca ccgccgcgag gccggcgccg gtcgtgcgcc gggtcaggtg 960cgcgacgagc gcctccgacg cgcgggcctg ctcgaggttg atctcgacca ggtgatcgag 1020cgccctgaac gcctcgtcga gcgcggggtc gtgcacgccg agcagcgcgg gatcgtgcgc 1080gccaggcgcc gacgggagct cgtgggcgtc ggcggcgcgg cgccgggcga ggtagtcctc 1140cacgcgccgc tccgcgtgct cgaggatcct gccctcctcc gggctgctca cgtggtcgcg 1200cgccgccgcg aggccgctcc tcaggccttg ctcccacgcc gccagggagg gggccagctc 1260cccgcggccg gagccgaccg cgcggctgct ctggtgcgcg tcgagcagga ggtcgatctc 1320cagcctctcc acgagccgga cgctctcgac cgtggcgccg aggatccggg tggtctgttg 1380catggtcgtc gacgcgacca tcagcgcgcc cgcgacaacg atggccacgc tcgtgagaag 1440aagcgtggcg gccccgagga gcgcgctcag gcgcacgggc cgcggaagac ggggccagct 1500caggccctgc ggagttggct gtcgcat 1527141251DNASorangium cellulosum 14atgcccgccc gcaccccccg caagcccccg ccgcccgcct cgcccgctgg tcccgccggc 60gcgccggacg acctcaccga cagcgatcgc gacgcgctgc tgcgctggcg gctcgcgctc 120gggcccgagg ccgagcgggt cgacccgcgc ctctccctcg gcgggctcgg gggcgcggcg 180cccgcgctcg acgtcgacgc gcggcggctc ggcgacctcg acaaggcgct ctcgttcatc 240tacgacgagc gcgccggcgg cctcggcggc tcgcggccct acgtgcccga gtggctctcc 300gccgtgcgcg agttcttcag ccacgaggtc gtcgccctcg tccagaagga cgccatcgag 360cgaaaggggc tgacgcagct cctcttcgag cccgagacgc tgccgttcct cgagaagaac 420gtcgagctcg tcgccacgct catgagcgcc aagggcctca tcccggacgc cgcgcgggac 480accgcccggc agatcgtgcg cgaggtcgtc gaggaggtgc ggcgcgcgct cgaggccgag 540gtccgcaccg ccgtcctcgg cgcgctgcgc cggaacacga cgagcccgct gcgcgtcctc 600aggaacctcg actggaagcg caccatccgc aagaacctga aggggtggga cgcggagcgg 660cgccgcctcg tccccgacaa gctctatttc tgggcgaacc agacgcgacg gcacgagtgg 720gacgtcgcca tcctcgtcga ccagtcgggc tcgatgggcg agagcgtcgt ctacagctcc 780atcatggccg cgatcttcgc gtcgctcgac gtcctccgca cccggctcct cttcttcgac 840accgaggtcg tcgacgtgac tccgatgctc gtcgatccgg tcgacgtgct gttcacggcg 900cagctcggcg gcggcaccga catcaaccgc gccgtggcct acgcccaggc gaacttcatc 960gagcggcccg agaagacgct gctcatcctg atcaccgacc tgttcgaggg cggcaacgcc 1020gaggagctcg tcgcgcgcat gcgccagctc gccgacagca aggtgaagtc gatctgcctg 1080ctcgcgctgt cggacggcgg aaagccctcg tacgaccacg agatggcgca gaagctcgcc 1140gcgctcggga ccccgtgctt cggctgcacg ccgaagctcc tcgtcaaggt ggtggagcgg 1200ctcatgcgag gtcaggacct cggcccgctg ctcggcgccg aggcgcggtg a 1251151059DNASorangium cellulosum 15tcagggcgcg gcgagcggca gcctgcgtgc cgggcgcgcg gcctcgtgtc cgtcccccgc 60ctcggccacc cgcccgcggt agatgcgatc gatccgatcg cgcgcgatga ccaggggctt 120gtcgaaccgg ccaagcacgt tgcccttcag gatcccgcgc ttgtccgtca agcggtccag 180caaccgcata tcgaggcgca gctcgatgtt catggccacc tgcatggcgg gccagaggac 240ggcgccggcc ccgaacttgc tccagggcgc caggctcgcg aagagaaacg tatacatctc 300cgacgactcc gggcccaccg ggttgaagaa gaccgctgac cggagcggga aggtgacggg 360ctgattggtc ttcggatccc tgagggagtg gttgtagatc gtgtagaccg gcgagaagta 420ggatgtccag tccaccacga atatcgcatc ctccgggatg ccgagcagct tctccatcgc 480ccgcggcatg ggccgcctcg gacccgaatg cacgacccgg atcgtttcgt cggtcagggt 540cacccgcgcc tcgacctctg gcatccgctc gagcgggtag ccgagcatga agtggacgaa 600gggcgtgtgc tcgatctcga tgaaattgtc gagcgccagc tcgaacggca cggtcgcgcg 660gtggcggagg agaccgcgcg gcacatatcc ctcgccctcg aggcgcggga acgctgcctg 720cgaccccgcc cgcttcaccc agatggcacc gtaccgctcc acggcctcga acatgtcctc 780gcgccgcgcg cacggccgcg ccgccggggt agccgggatc tcgccgcggc cgtccacggc 840ccaacgccag ccatggtagg cgcacaccag ccgatcgccc tcgacccacc cctcgctcag 900gcgcatgctg cggtgggggc aacgatccgt gaatgcaccg aggccgcccg acgaggtccg 960aaacaccacg atctcatgcc ccgcgagccg cacattgcgg ggcttgcggc ggagctcgtg 1020gctcagcagt acagggtgcc agtggtcgag ctcagccat 1059161131DNASorangium cellulosum 16tcagttcacc ccttggatgt gccgcgcaat ccgcggcgcc tcggctgcga tgtcgcggat 60ctgccccgtg atgggattgc ggaagccgat gaagaacagc ccaggcgccg gcgtcggcgc 120gccgtgccac cgcgggcagc cgtgctcgtc cgtgtagcgc gttgcattct cgagaaaatc 180atcgagcccg ggccggtacc ccgtggcgag caccacgacg tcgaagggca gcccacggcc 240gtccgtgaac gtcacgcccg tttccgtgaa tgcccgcggg ccgggcacca ccttgatctt 300gccctgctgg atcagcgcca ccgtgccgat gtcgatcaac ggcatgcggc cttccttcaa 360cgcccgggta ccggggccga ccgcgggccg acggatcccc cagcgcgaca gatcccccac 420ggcgcgagac aggatcgcgg tcgcgaggcg atccccgacg gccagcggga ggcgctcgaa 480gagggcaagg gcgttgaact gcgcaggcag cttgaacagc tcgcggggga tcacgtggtt 540gccgctgcgg accgagaggg tcgtctccgc gcaatgctcc cacagatcca gcgcgatctc 600gctggcggag ttgccggcgc ccaccacgag cacgcgctgg ccccggaatt ccgcaccaga 660tcggtaggca gagctatgaa ggatgcgacc gcggaagcgc tcctggtcgg gccaggtggg 720gacgttggga tgacggctgt agccggtggc cacgacgagc gcctggctcc tgagctcccc 780cgcgtgcgtt cgggtcaccc accgcgatcc gtcgtggtac gcgcgctcca cctcgacacc 840caggcgcggc tccaggcgga atcgctcggc gtaacgctcg aggtaatcga ccatctccac 900ccgggaggga tacggcgcag aatactcggg ccagggctgc ccgggcagcg cggagagctg 960cttgatcgtg ttgaggtgca gccggtcgta gtggcgccgc cacgtggcgc cgacggcctc 1020cgacttctcg aggagaacga acgggattcc ctgctcgcgc aggcatgcgc ccaccgctag 1080cccagacgga ccagcgccga cgataaccac atggcactct tcaacgtgca c 1131171071DNASorangium cellulosum 17tcacgcactc gcatgcccga cgcccgtgcc ttctgcctcg ccccgcgtct cgccgaagta 60gatggagcgc atcaggcggt ggttgtgcac gagcgtcgcg tcgtatttgt tgagccgcat 120ccctttcatc tcgaagggcg tatcggccac gtgcgggatg aacttcacat cgtcgcggat 180ctccttccag gagagcgcta tcgccgccga tttgacgacc ggaagcagcg gacggaagcg 240gggatcggtg atcttgacga acaggaacgc gcgcacgaac gtggtgcgct ccgtctctgg 300cacgaagaag atgccggcgc gcgccacgac aggacgctcc atcccgttct gcgccgtcca 360ccaggacgtg tacacggtgt agacggggct gaagcgggtc acccactggt tgtgaaatgt 420gtcgcctggc tggagcagca tcagccgcgc gagcgtcgag gggcgctgcg gcgccgagta 480cttgacctcg gtgcggtcct cgaagacgtc gcacgagaag tcgatgcgcg ccgcgtcctc 540gggcgtccag ccgaggcggc cgtgaacgaa cggcgtgtgc tcgtcctcgg aggaattgtc 600gaagatgacg tgcaggggcg ccggcgcgag gtgcgagaag gtgccggcat attcgaagcc 660atcgctgctg aagtcgagct cgggcagcgc cgagcgcggc gtatcccggt gggctagcca 720caggtatcca agctgctcga cgagctgaaa ggagcgtgta tcgcatcggg tgagcgacgg 780ttgcgagggg caggctcccc gcccctcggc gtcgaaatgc cacccgtgat aggggcattc 840caggcgcccg tccggccgga cacgcccctg cgatagcggc gcgagccggt gggggcacgc 900atcggcgagc gcggcggggc ggccctgctc atcgcggaag agagcgtaag cattgcccgc 960aaggacaacg cgaaccggct tccggccgag tttcgaggcc ggcaagacgg ggtgaaaatg 1020gcggatgagg tcgcgagcag gcgcggcgtg cattgcgaga ccataacaca t 1071181188DNASorangium cellulosum 18ctatcggtag gcgacgatgc caccgaacgg ccacttcgcg tggtcctcgg gcgccggata 60gacctcccat tcggagaacc cggccgcgcg gagcgacagc tcccactctt gcagcgtcag 120gtaaccgaca tgctggcggc gaggcggatc gagcttggcc ttgctgtagg tgtgcagcat 180cgactgaaaa aattcattgg ggaagaacac cccgggccga tcgcggaacg acatggtgaa 240cgcgagctga ccgcccggct tcagcatcgt gtggaacgcc tggagggtgg cgtgaagatc 300gcgcacgtcg tagagcacgt gctcgaggac gatcagatcg accgacgcgg cccgggcgaa 360cgtgctgcca gcggagggca gcgtgtccag gtccaggcgc tggaaatgaa tgcgctgaaa 420cacgtcggcc ggcgcgtggg tccgcagcca ctgcttcccc gtctccatca acagggcgct 480gatgtcggtg taatcgtagc gggcgaggtt cttgctcagc gggaggaacc gcggatcgga 540caacgcctgc cgcagcacca cgccgagccc cgcgcccccc tcgaatacag agatccccgg 600cccctctgcg agcttggcca tcagcgcccg cgccagcatc acgttgcatg gcttcttggc 660gggaaggctg atcatcgagt attcccagaa tttcagcgag gcctgcatcc cgtactggag 720atccatggtg gccagcgcgt ccttgcccgc cagcaccggc ccggccaggc cccgatagcg

780ctggaggaac tcgaccatct cgcccaggat cgcgcggtct gcgagcgcga tggactcctt 840ctcggcgacg cgctttcgca ccgcctcgct gggcaccagc cgcccgctgg ggtcctgggt 900gaggtctccc ttgtcgctga agtagtcgag cagcttcctg cgaaactgat aggcggtgac 960cgacggagcc gactccggac gatcgtcgag cccccggaca gcgccgctcg ggtcgacgag 1020gtgctcgagc aggatctcgc tggcaacaag ctcggtctga cgacggaatg cttctatgta 1080agcggtgtaa gcgtcgttgt agagatcggt cacgtccaat cgttgtcgca tgcaggtcct 1140cgcgggtgtg gcgcccatcc tgcgcagcgc agggacgaag caggtcat 118819255DNASorangium cellulosum 19tcacctggag ctcagcgcct gcccgtcgtt cccgcggttc ttgtgcacaa tggcgtacag 60gatgagcatg taggcgaaga gccggaacag gtacaggtaa tggatggcgt cttcctcgac 120gcgattcagg gcgacggcga tgcggcccag catcatcagc cagaacgccg ccgagaactt 180cgcgaacagc cggtcgcccg tcttcttcca gaagcggagg aagaagagcg cgacggtcgc 240gtacccgaac gtcat 25520261DNASorangium cellulosum 20ttactcgcgc aggtcccaga tgaggccata aaggagcagg gccagcccga tgagcgcggt 60gaggtggcgc agcgatgata gatcgacgct ccggatcacg acgaggtcca cgaagagcag 120gatgttgttc gctgcgagcg cggcgaagca gagcccgctc cacaagagga gacggacctt 180gcgctgcgcg tatccgcgca ggagcagcac ggcgcacgcg atgctggtca gggcgcagag 240gatgtagacc gccgctgcca t 26121402DNASorangium cellulosum 21ctagccgccc tttcccttct tcgtgatcag gaatgcgtcc gagaagctct ggatgtcgct 60cggcgggggc gtggcgtaga tgtgattgat cacgctcagc cggcgctcct tgtacgcctg 120cgccaggtcg tcgatcgtcc ggcgggtctc atcgtctgcc ggggcgtacc ggtagaagat 180gtcctccccg tcctcccggg ccacgatcag gcccctgctg gccaggcctc cgaaccggtc 240ctggatcgac atcatgctgg accctatctc gcgcgccatc gcggccgcgc tccactcgcg 300ctccgccgtg cgacgcatga gcagaagcac ttcgagttgc tcgatcgagg agatgtgcgc 360gccgaggaag cgctggaccc ggtcggggag cccgctagac ac 402225289DNASorangium cellulosum 22tcaccggtgc aaccatagcc gcagcatagc gagcaggtgc tcgggatcca ccggcttcga 60gatgtaatcg ttcgcgcccg cctcgaagca cttctcccgg tcgcccttca tcgccttggc 120cgtgaccgcg atgatgggca gcgcatggtg ctcgggcttc gcgcggatgg cacggatcgt 180gtcgtagccg tccatctctg gcatcatgat gtccatgagc acgatctcga tgtccggcgt 240ccgctgcagc atctcgatcg ccgctctgcc cgtctccacg tagaccgtct tcatctgctg 300ggcgtcgagg atggtcgtca tcgcgaagat gttccggacg tcgtcgtcga cgaccagcac 360cttcttgccc gcgagcacct tgttcgactg gtgcagctcc tggagggtct gccgctgtcg 420ctcggagagc gccgccacag ggcggtgcag gaacagggag acgtcgtcga agagccgctc 480cttggagcgg acgtgcttga gcaccatcag ctggctgaag cggctcagct gcgcctcgtc 540cgcggccgag atctcctccg gcgcgtagac caggacgggc agctccgtcg gcccgctgcc 600ctgcgcgagc tgcccgatca gatcgaagca gcgcatgtcg ggcaggtcga ggtgcaggat 660gaggacatcg gccccctcgg tgaggagcgc gtcgagcgcc tcctccccgg aggccacgct 720ccggatcgtg acgtcgtcgc cgccgaggag ctcgacgagc tcctggcgct cggcctcgtc 780cggctcggcg agcacgaccg tccgccggcg cgacaccatg aactgcgaga ggcgcctgaa 840ggtctcgtcg agcgcgtccc gggtcttgag cggcttgcag agcacccccg tcgcgcccat 900ccggagcgcg cgctcgcgct cctcgtccgt cgtgatcacc tggacgggga tgtgccgcgt 960cgcgaggtcg cgcttcaccc ggtcgagcac gcgccagccg tccatgtccg gcaggttgat 1020gtcgagcgtg atcgcgttca cccgccgctc gcggacgatg gagagcgccg ccccgccgcg 1080gtaggcgagg atcgccttga acccgtggtc gtgcgcgaca tccatgacga agtgcgcgaa 1140gctcgcgtcg ttctcgacga tgagcaccac ggagtcgctg ggctggaggc tcgcgctgtc 1200gtcgacgctc tggttgagca ggtgcggcgg cggctcggcc gccgaccgcg gcgcgacgtc 1260gcccgagacg agggccggcg gcgccgaggg cacctccgcg gcctgctcct tcctgcgcgg 1320gcgcgccggc gtgtacgtga gcggcaggta aagcgtgaag gtgctcccgc tccccggcct 1380gctcgagagc ttgatctcgc cgccgagcat ccacgcgatc tcgcggctga tcgcgagccc 1440gaggccggtg ccgccgtact tccggctcgt cgagccgtcc gcctgctgga aggcctcgaa 1500gatgatctgc tgcttgtcgt gcgggatgcc gatgcccgtg tcccgcaccg acatggcgat 1560cgccgcgccg gcgcgcgaga ggccctcgtt ctcgatggtc caccccgagg tgaccagatc 1620gacgtcgagc gcgacgctgc cgcgctccgt gaacttgaag gagttcgaga gcaggttctt 1680gagcacctgc tgtacgcgct tcgcgtccgt gtagatgacc tgcggcaggt tctgcgcgaa 1740gttgagctcg aactcgagcc tcttcgactc ggcgacgtgc tggaacgtgc gctcgacgta 1800gtcttgcagg tcgctgaacg acagctcgcc cacgtcgacg atcacggtcc ccgactcgat 1860cttggacagg tccaggatgt cgttgatcag cgcgagcagg tcgttgcccg acgagtggat 1920cgtcttggcg aactcgacct gccgccccgt gaggttgcgg tcggtgttct tcgagagctg 1980atcggacagg atgaggaggc tgttcagcgg cgtccggagc tcgtgcgaca tgttcgcgag 2040gaactccgac ttgtacttgg aggtgatggc gagctgccgc gccttctcct cgagcgcctg 2100ccgcgcctgc tcgacctcgc ggttcttccg ctcgacctcg acgttctgct gggcgagcag 2160gcgagccttc tccccgagct cggcgttcgt ctgctgcagc tcctcctgct ggctctggag 2220ctcgcgcgcg agggactgcg actgcttgag caggtcctct gtgcgcatgt tcgcctcgat 2280cgtgttgagc acgatcccga tcgactccgt gagctggtcg aggaacgcct ggtgggtcgg 2340gctgaatcgc tcgaacgacg cgagctcgat gaccgccttg acctgcccct cgaagagcac 2400ggggatgacg atgatgttga ccggcggcgc ctcgccgagc ccgctcgtga tgcggatgta 2460gtcggggggc gcgttgacga ggaggatctt ctccttctcg agcgcgcatt gcccgacgag 2520cccttcgccg agcttgaaat ggttgtcgac gtgcttccgc accttgtacg cgtagctcgc 2580gaggagcttg aggatcggct cctccttcgc cacgtccatc gtgaagaaca cgccctgctg 2640cgcgccgacg accggggcca gctcggacag gatgagccga ccgacagtga gcagatcctt 2700ctgcccctgg agcatgcgcg agaacttggc gaggttggtc ttgagccagt cctgctcgct 2760gttcttcagc gtcgtgtcct tgaggttccg gatcatctca ttgatggtgt ccttgagcgc 2820cgcgacctcc ccctgcgcct cgaccttgat ggaccgggtg aggtcgccct tggtcacggc 2880ggtggcgacc tcggcgatcg cgcgcacctg cgtggtgagg ttcgcggcga gccggttcac 2940gttgtcggtc aggtccttcc acgtgccggc cgcgccgggg acgctcgcct gaccgccgag 3000cttgccctcg acgccgacct cgcgcgccac cgttgtcacc tggtcggcga aggtcgcgag 3060cgtctcgatc acgccgttga tcgtgtccgc cagcgccgcg atctcgccct tcgcgtcgaa 3120ggccagcttg cgcttcaggt cgccgttcgc gaccgcggtc acgaccttgg cgatgccgcg 3180cacctggttc gtcaggttgc cggccatgaa gttcacgttg tcggtcaggt ccttccacgt 3240gccggcgacg ccggggacgc tggcctgccc gccgagcttg ccctcggtgc ccacctcgcg 3300cgccacgcgc gtcacctccg acgcgaacgc gttgagctgg tccaccatcg tgtagttgat 3360ggtgttcttc agctccagga tctcgccgcg gacatcgacg gtgatcttct tcgacaggtc 3420gccgttggcc acggccgttg tgacggcggc gatgttgcgc acctgcgcgg tcaggttcga 3480cgccatcgag ttgacggagt cggtcaggtc cttccacgtg ccggcgacgc cggggacgct 3540ggcctggccg ccgagcttgc cctcggtgcc cacctcgcgc gccacgcgcg tcacctccga 3600cgcgaacgag cggagctgat ccaccatcgt gttgaaggtg tccttcagct ccaggatctc 3660gccgcggaca tcgacggtga tcttcttcga caggtcgccg ttggccacgg ccgttgtgac 3720ggcggcgatg ttgcgcacct gcgcggtcag gttcgacgcc atcgagttga cggagtcggt 3780caggtccttc cacgtgccgg cgacgccctt cacctcggcc tgcccgccga gcttgccctc 3840ggtgcctacc tcgcgcgcga cgcgcgtcac ctcggccgcg aaggagctga gctgatccac 3900catcgtgttg aaggtgttct tcagctccag gatctcgccc ttgacgtcga cggtgatctt 3960cttcgacagg tcgccgcggg ccacggccgt ggtcacgtcg gcgatgttgc gcacctgcgc 4020ggtcaggttc gacgccatcg aattgacgga gtcggtcagg tccttccacg tgccggcgac 4080gccggggacg ctggcctggc cgccgagctt tccctcggtg cccacctcgc gcgccacgcg 4140cgtcacctcc gacgcgaacg agcggagctg atccaccatc gtgttgaagg tgtccttcag 4200ctccaggatc tcgccgcgga catcgacggt gatcttcttc gacaggtcgc cgttggcgac 4260ggccgtggtg acggcggcga tgttgcgcac ctgcgcggtc aggttcgacg ccatcgagtt 4320gacggagtcg gtcaggtcct tccacgtgcc ggcgacgccg gggacgctgg cctggccgcc 4380gagcttgccc tcggtgccca cctcgcgcgc cacgcgcgtc acctccgacg cgaacgagcg 4440gagctgatcc accatcgtgt tgaaggtgtc cttcagctcc aggatcttct tcgacaggtc 4500gccgttggcc acggccgttg tgacggcggc gatgttgcgc acctgcgcgg tcaggttcga 4560cgccatcgag ttgacggagt cggtcaggtc cttccacgtg ccggcgacgc ccttcacctc 4620ggcctgcccg ccgagcttgc cctcggtgcc tacctcgcgc gcgacgcgcg tcacctcggc 4680cgcgaaggag ctgagctgat ccaccatcgt gttgaaggtg ttcttcagct ccaggatctc 4740gcccttgacg tcgacggtga tcttcttcga caggtcgccg cgggccacgg ccgtggtcac 4800gtcggcgatg ttgcgcacct gcgcggtcag gttcgacgcc atcgaattga cggagtcggt 4860caggtccttc cacgtgccgg cgacgccggg gacgctggcc tggccgccga gctttccctc 4920ggtgcccacc tcgcgcgcca cgcgcgtcac ctccgacgcg aacgagcgga gctgatccac 4980catcgtgttg aaggtgtcct tcagctccag gatctcgccg cggacatcga cggtgatctt 5040cttcgacagg tcgccgttgg cgacggccgt ggtgacgtcg gcgatgttgc ggacctgcgc 5100ggtcaggttc gacgccatcg agttgacgga gtcggtcagg tccttccacg tgccggcgac 5160gcctgtcacc tcggcctgcc cgccgagctt gccctcggtg cctacctcgc gcgccacgcg 5220cgtcacctgg gccgcgaagg agcggagctg atccaccatc gtgttgaagg tgttcttcag 5280ctccaggat 5289231075PRTSorangium cellulosum 23Met Pro Asp Thr Ser Ser Ser Ser Pro Val Met Ala Met Gly Leu Ser1 5 10 15Asp Ser Lys Ala Arg Ser Val Glu Asp Ala Arg Pro Ala Ser Gly Leu 20 25 30Pro Arg Pro Pro Ala Gly Ile Ala Val Val Gly Met Gly Cys Arg Phe 35 40 45Pro Gly Gly Ile Asp Ser Pro Gly Ser Leu Trp Ala Ala Leu Ser Gln 50 55 60Gly Arg Asp Leu Ile Ser Glu Val Pro Pro Asp Arg Trp Asp Val Asn65 70 75 80Ala His Tyr Asp Ala Asp Ala Ser Val Pro Gly Lys Ile Ala Thr Arg 85 90 95His Gly Gly Phe Leu Ala Gly Val Ala Ala Phe Asp Ala Pro Phe Phe 100 105 110Asp Leu Ser Pro Arg Glu Ala Lys His Met Asp Pro Gln Gln Arg Leu 115 120 125Gly Leu Glu Thr Ala Trp Glu Ala Leu Glu Asp Ala Gly Leu Asp Ala 130 135 140Arg Ser Leu Arg Gly Ser Arg Ala Gly Val Phe Val Gly Ser Met Trp145 150 155 160Ala Glu Tyr Asp Val Leu Ala Ser Arg His Pro Glu Ser Ile Ser Pro 165 170 175His Gly Ala Thr Gly Ser Asp Pro Gly Met Ile Ala Ala Arg Ile Ala 180 185 190Tyr Thr Phe Gly Leu Arg Gly Pro Ala Leu Ser Val Asn Thr Ala Ser 195 200 205Ser Ser Ser Leu Val Ala Val His Leu Ala Leu Gln Ser Leu Gln Ser 210 215 220Gly Glu Cys Glu Leu Ala Leu Ala Gly Gly Ala Asn Leu Ile Leu Thr225 230 235 240Pro Tyr Asn Thr Ile Lys Met Thr Lys Leu Gly Thr Met Ser Pro Asp 245 250 255Gly Arg Cys Lys Ala Phe Asp His Arg Ala Asn Gly Tyr Val Arg Ala 260 265 270Glu Gly Val Gly Phe Val Val Leu Lys Pro Leu Ser Arg Ala Thr Ala 275 280 285Asp Gly Asp Arg Ile Tyr Ala Val Val Arg Gly Ser Ala Val Asn Asn 290 295 300Asp Gly Leu Thr Asp Gly Leu Thr Ala Pro Ser Gly Glu Ala Gln Glu305 310 315 320Ala Val Leu Arg Glu Ala Tyr Ala Arg Ala Gly Val Ser Pro Ala Glu 325 330 335Val Asp Tyr Val Glu Ala His Gly Thr Gly Thr Pro Leu Gly Asp Arg 340 345 350Val Glu Ala Thr Ala Leu Gly Arg Val Leu Gly Ala Gly Arg Ala Ala 355 360 365Asp Arg Ala Leu Arg Val Gly Ser Val Lys Thr Asn Leu Gly His Ala 370 375 380Glu Ala Ala Ala Gly Val Ile Gly Leu Met Lys Thr Ala Leu Ser Leu385 390 395 400Arg His Gly Ser Leu Pro Ala Ser Leu His Val Glu Arg Pro Asn Pro 405 410 415Glu Ile Pro Leu Glu Ser Leu Gly Leu Arg Leu Gln Thr Ala His Gly 420 425 430Val Trp Pro Glu Val Asp Arg Pro Arg Arg Ala Gly Val Ser Ser Phe 435 440 445Gly Phe Gly Gly Thr Asn Cys His Val Val Ile Glu Glu Trp Arg Gly 450 455 460Gly Leu Gln Gln Ser Ala Ala Glu Ala Gly Ser Asp Pro Gly Ala Ala465 470 475 480Val Pro Pro Pro Gly Leu Pro Leu Val Leu Ser Ala Arg Asp His Gly 485 490 495Ala Leu Arg Ala Gln Ala Gly Arg Trp Ala Ala Trp Leu Thr Glu His 500 505 510Arg Glu Ala Arg Trp Ala Asp Val Val His Thr Ala Ala Val Arg Arg 515 520 525Thr His Leu Gly Ala Arg Ala Ala Val Met Ala Ala Gly Val Ala Glu 530 535 540Ala Val Asp Ala Leu Lys Ala Leu Ala Asp Gly Arg Ala His Gly Ala545 550 555 560Val Thr Val Gly Glu Ala Arg Glu Arg Gly Lys Val Val Phe Val Phe 565 570 575Pro Gly Gln Gly Ser Gln Trp Pro Ala Met Gly Arg Ala Leu Leu Ser 580 585 590Ala Ser Lys Val Phe Ala Glu Ala Val Glu Ala Cys Asp Ala Ala Leu 595 600 605Arg Pro Leu Thr Gly Trp Ser Val Leu Ser Leu Leu Arg Gly Asp Ala 610 615 620Gly Glu Ala Ala Pro Ser Leu Asp Arg Val Asp Ala Val Gln Pro Ala625 630 635 640Leu Phe Ala Met Ala Val Gly Leu Ala Ala Val Phe Arg Ala Trp Gly 645 650 655Leu Asp Pro Ser Ala Val Val Gly His Ser Gln Gly Glu Val Pro Ala 660 665 670Ala Tyr Val Ala Gly Ala Leu Ser Leu Asp Asp Ala Ala Arg Val Val 675 680 685Ala Val Arg Ser Ala Leu Val Arg Arg Leu Ala Gly Ala Gly Ala Met 690 695 700Ala Ala Val Glu Leu Pro Ala Gly Glu Val Glu Arg Arg Leu Ala Pro705 710 715 720Phe Gly Gly Ala Leu Ala Ile Ala Val Val Asn Thr Ser Ser Ser Thr 725 730 735Ala Val Ser Gly Asp Ala Glu Ala Val Asp Arg Leu Val Ala Gln Leu 740 745 750Glu Ala Glu Gly Ile Phe Cys Arg Lys Val Asn Val Asp Tyr Ala Ser 755 760 765His Ser Ala His Val Asp Val Val Leu Pro Glu Leu Leu Glu Arg Leu 770 775 780Ala Pro Val Arg Pro Gly Ala Thr Arg Ile Pro Phe Tyr Ser Thr Val785 790 795 800Thr Gly Gly Val Leu Glu Gly Thr Ala Leu Asp Gly Ala Tyr Trp Cys 805 810 815Arg Asn Leu Arg Gln Pro Val Arg Leu Asp Arg Ala Leu Ala Arg Leu 820 825 830Leu Asp Asp Gly His Gly Val Phe Val Glu Val Ser Ala His Pro Val 835 840 845Leu Ala Ser Pro Leu Thr Ala Ala Cys Ala Glu Arg Glu Gly Val Val 850 855 860Val Gly Ser Leu Gln Arg Asp Asp Gly Gly Leu Ala Arg Leu Leu Gly865 870 875 880Ser Leu Gly Ala Leu His Val Gln Gly Gln Pro Val Asp Trp Arg Ala 885 890 895Val Leu Ala Pro Phe Gly Gly Ser Leu Val Asp Leu Pro Thr Tyr Ala 900 905 910Phe Gln Arg Gln Arg Tyr Trp Phe Asp Thr Asp Glu Ser Val Ala Leu 915 920 925Ala Ala Ala Ser Ser Val Ala Glu Glu Ser Trp Ser Glu Lys Leu Ala 930 935 940Gly Leu Ser Ser Ala Arg Arg Glu Glu Arg Leu Leu Glu Trp Val Arg945 950 955 960Ala Glu Ile Ala Ala Val Leu Gly Leu Glu Ala Pro Ala Val Pro Pro 965 970 975Asp Val Leu Leu Arg Asp Leu Gly Leu Lys Ser Pro Ile Ala Val Glu 980 985 990Leu Gly Ser Arg Leu Gly Arg Arg Thr Arg Arg Lys Leu Pro Val Thr 995 1000 1005Phe Val Tyr Asn His Pro Thr Pro Arg Ala Ile Ala Arg Ala Leu 1010 1015 1020Leu Glu Gly Met Phe Ser Ser Ile Lys Asp Ser Ala Ser Ser Ala 1025 1030 1035Ala Asp Asp Arg Arg Pro Pro Gly Val Leu Glu Asp Val Ala Pro 1040 1045 1050Pro Gln Ala Leu Glu Thr Ser Glu Met Ser Asp Asp Glu Leu Phe 1055 1060 1065Gln Ser Ile Asp Ala Leu Val 1070 1075243679PRTSorangium cellulosum 24Met Asp Arg Ser Asp Lys Leu Arg Ala Tyr Leu Glu Lys Thr Thr Ala1 5 10 15Ser Leu Val Glu Ala Lys Gly Arg Ile Arg Glu Leu Glu Ala Arg Ser 20 25 30Arg Glu Pro Ile Ala Ile Val Ala Met Ala Cys Arg Phe Pro Gly Gly 35 40 45Val Asp Ser Pro Glu Lys Leu Trp Ala Leu Leu Asp Glu Glu Arg Asp 50 55 60Ala Ile Thr Glu Val Pro Pro Ser Arg Trp Asp Leu Glu Arg Phe Tyr65 70 75 80Asp Pro Asp Pro Asp Ala Ala Gly Lys Thr Tyr Ser Arg Trp Gly Gly 85 90 95Phe Val Gly Asp Leu Asp Arg Phe Asp Ala Ala Phe Phe Gly Ile Ser 100 105 110Pro Arg Glu Ala Arg Ser Ile Asp Pro Gln Glu Arg Trp Leu Leu Glu 115 120 125Thr Thr Trp Glu Ala Leu Glu Arg Ala Gly Val Arg Ala Asp Thr Leu 130 135 140Glu Gly Thr Leu Gly Gly Val Tyr Ile Gly Leu Ser Gly Ser Glu Tyr145 150 155 160Gln Thr Glu Ala Phe His Asp Ala Glu Arg Ile Asp Ala Tyr Ser Leu 165 170 175Thr Gly Ala Ser Pro Ser Thr Thr Val Gly Arg Leu Ala Tyr Trp Leu 180 185 190Gly Leu Arg Gly Pro Ala Val Ala Val Asp Thr Ala Cys Ser Ser Ser 195 200 205Leu Val Ala Val His Leu Ala Cys Gln Ala Leu Arg Asn Gly Glu Cys 210 215 220Asp Phe Ala Leu Ala Gly Gly Val Asn Ala Leu Leu Ala Pro Glu Ser225 230 235 240Tyr Val Ala Phe Cys Arg Leu Arg Ala Leu Ser Pro Thr Gly Arg Cys 245 250

255Gln Thr Phe Ser Ala Asp Ala Asp Gly Tyr Val Arg Ala Glu Gly Cys 260 265 270Gly Val Leu Leu Leu Lys Arg Leu Ser His Ala Gln Arg Asp Gly Asp 275 280 285Arg Val Leu Ala Val Ile Arg Gly Asn Ala Ile Asn Gln Asp Gly Arg 290 295 300Ser Gln Gly Leu Thr Ala Pro Asn Gly Leu Ala Gln Glu Asp Val Ile305 310 315 320Arg Arg Ala Leu Ser Gln Ala Ala Val Glu Pro Thr Thr Val Asp Val 325 330 335Val Glu Cys His Gly Thr Gly Thr Ala Leu Gly Asp Pro Ile Glu Val 340 345 350Gln Ala Leu Gly Ala Val Tyr Gly Asp Gly Arg Pro Gly Asp Arg Pro 355 360 365Leu Val Ile Gly Ser Val Lys Thr Asn Ile Gly His Thr Glu Ala Ala 370 375 380Ala Gly Met Ala Gly Leu Ile Lys Ala Val Leu Ser Leu Gln His Ala385 390 395 400Gln Val Pro Arg Ser Leu His Phe Ala Ala Pro Ser Pro Tyr Ile Pro 405 410 415Trp Asp Thr Leu Pro Val Arg Val Ala Ala Gln Arg Val Ala Trp Glu 420 425 430Arg Arg Glu His Pro Arg Arg Ala Gly Ile Ser Ser Phe Gly Ile Ser 435 440 445Gly Thr Asn Ala His Val Ile Leu Glu Glu Ala Pro Glu Ala Pro Ala 450 455 460Thr Ala Pro Glu Ala Ala Ala Val Thr Ser Thr Leu Pro Leu Leu Val465 470 475 480Ser Gly Arg Asp Glu Ala Ala Leu Arg Ala Gln Ala Glu Arg Trp Ala 485 490 495Ala Trp Leu Ala Ala His Pro Glu Ala Arg Trp Ala Asp Val Val His 500 505 510Thr Ala Ala Val Arg Arg Thr His Leu Glu Ala Arg Ala Ala Val Ala 515 520 525Ala Gly Asn Ala Ala Asp Ala Ala Ala Ala Leu Gly Ala Leu Ala Ala 530 535 540Gly Gln Pro His Lys Ala Val Ser Leu Gly Glu Ala Arg Ala Arg Gly545 550 555 560Asp Val Val Phe Val Val Pro Gly Gln Gly Ser Gln Trp Pro Ala Met 565 570 575Gly Arg Ala Leu Leu Ala Glu Ser Glu Val Phe Ala Ala Ala Val Ala 580 585 590Ala Cys Asp Ala Ala Leu Arg Pro Phe Thr Gly Trp Ser Val Leu Ser 595 600 605Val Leu Arg Gly Glu Gln Gly Glu Ala Val Pro Pro Ala Asp Arg Val 610 615 620Asp Val Val Gln Pro Ala Leu Phe Ala Met Ala Val Gly Leu Ser Ala625 630 635 640Val Trp Arg Ala Trp Gly Ile Glu Pro Ser Ala Val Val Gly His Ser 645 650 655Gln Gly Glu Val Ala Ala Ala Tyr Val Ala Gly Ala Leu Thr Leu Glu 660 665 670Asp Ala Ala Arg Val Val Ala Leu Arg Ser Gln Leu Val Arg Arg Ile 675 680 685Ala Gly Gly Gly Ala Met Ala Val Ile Glu Arg Pro Val Gly Glu Val 690 695 700Glu Gln Arg Leu Ser Arg Phe Gly Gly Gln Leu Ser Val Ala Ala Val705 710 715 720Asn Thr Pro Gly Ser Thr Val Val Ser Gly Asp Ala Ala Ala Val Asp 725 730 735Arg Leu Leu Ala Glu Leu Glu Thr Ala Arg Val Phe Ala Arg Arg Ile 740 745 750Lys Val Asp Tyr Ala Ser His Ser Ala His Val Asp Ala Ile Leu Pro 755 760 765Glu Leu Glu Ala Cys Leu Ala Ser Val Glu Pro Arg Thr Cys Ala Ile 770 775 780Pro Leu Tyr Ser Thr Val Thr Gly Glu Val Leu Ala Gly Pro Glu Leu785 790 795 800Gly Ala Thr Tyr Trp Cys Arg Asn Leu Arg Glu Pro Val Arg Leu Asp 805 810 815Arg Ala Leu Ser Arg Leu Leu Ala Asp Gly His Gly Val Phe Val Glu 820 825 830Val Ser Ala His Pro Val Leu Ala Met Pro Leu Ser Ala Ala Ser Ala 835 840 845Glu Arg Gly Gly Val Val Val Gly Ser Leu Gln Arg Asp Asp Gly Gly 850 855 860Leu Gly Arg Leu Thr Ser Met Leu Gly Ala Leu His Val His Gly His865 870 875 880Ala Val Ser Trp Gln Arg Val Leu Ala Pro Tyr Gly Gly Ala Leu Val 885 890 895Gly Leu Pro Thr Tyr Ala Phe Gln Arg Gln Arg His Trp Leu Glu Ala 900 905 910Pro Arg Tyr Ala Ala Glu Asp Thr Asp Gly Ala Ala Arg Arg Asp Pro 915 920 925Leu Tyr Arg Val Thr Trp Ile Glu Ala Ala Leu Glu Glu Ala Pro Trp 930 935 940Ala Pro Glu Arg His Val Val Leu Gly Gly Gly Gly Ala Leu Ala Ala945 950 955 960Gly Leu Gly Ala Leu Ala Leu Ala Gly Leu Pro Glu Leu Leu Glu Ala 965 970 975Leu Glu Asn Arg Ala Ala Ala Pro Glu Arg Leu Val Leu Asp Leu Thr 980 985 990Glu Gly Arg Pro Gly Ala Val Ala Glu Ser Val His Ala Thr Thr Arg 995 1000 1005Asp Ala Leu Ala Leu Val Gln Ala Trp Leu Ala Ala Pro Arg Leu 1010 1015 1020Ser Gly Thr Glu Leu Val Val Val Thr Arg Glu Ala Val Ala Ala 1025 1030 1035Gly Pro Asp Glu Gly Val Ala Ala Leu Gly Pro Ala Ala Val Trp 1040 1045 1050Gly Leu Leu Arg Thr Ala Arg Val Glu His Pro Glu Arg Ala Val 1055 1060 1065Arg Ala Val Asp Leu Gly Arg Glu Pro Leu Asp Val Ala Val Leu 1070 1075 1080Arg Arg Ala Leu Gly Ala Val Ala Glu Pro Glu Leu Ala Leu Arg 1085 1090 1095Ala Gly Gly Ala Arg Ala Ala Arg Leu Arg Ala Val Asp Ala Gly 1100 1105 1110Ala Gly Ala Arg Glu Pro Ala Ala Ala Leu Asp Pro Gln Gly Thr 1115 1120 1125Val Trp Ile Thr Gly Gly Thr Gly Glu Leu Gly Arg Gln Ile Ala 1130 1135 1140Arg His Leu Val Ala Ala His Gly Val Arg His Leu Leu Leu Thr 1145 1150 1155Ser Arg Arg Gly Ala Ala Ala Pro Asp Ala Glu Ala Leu Val Glu 1160 1165 1170Gln Leu Arg Ala Asp Gly Ala Glu Thr Val Glu Val Val Ala Cys 1175 1180 1185Asp Val Thr Asp Gly Ala Ala Leu Ser Ala Ala Val Gln Ala Ala 1190 1195 1200Ala Ala Arg His Pro Leu Thr Ala Val Val His Thr Ala Gly Glu 1205 1210 1215Leu Ala Asp Gly Val Leu Thr Gly Leu Thr Ala Glu Gln Leu Ala 1220 1225 1230Arg Val Leu Ala Pro Lys Val Asp Gly Ala Cys His Val Tyr Ala 1235 1240 1245Ala Ala Gln Asp Gln Pro Leu Ala Ala Phe Val Leu Phe Ser Ser 1250 1255 1260Ile Val Gly Thr Leu Gly Asn Ala Gly Gln Ala Asn Tyr Gly Ala 1265 1270 1275Ala Asn Ala Phe Leu Asp Ala Phe Ala Ala Gln Leu Arg Ala Arg 1280 1285 1290Gly Val Pro Ala Thr Ser Leu Ala Trp Gly Phe Trp Glu Gln Ala 1295 1300 1305Gly Leu Gly Met Thr Ser His Leu Gly Ala Ala Asp Leu Ala Arg 1310 1315 1320Leu Arg Arg Gln Gly Leu Ala Pro Leu Ser Val Ala Gln Gly Leu 1325 1330 1335Arg Leu Leu Asp Arg Ala Leu Ala Arg Ala Glu Ala Thr Leu Val 1340 1345 1350Pro Ala Ala Leu Asp Leu Pro Ala Leu Gln Arg Ala Ala Ser Asp 1355 1360 1365Ala Gly Arg Val Pro Pro Leu Leu Arg Gly Leu Val Arg Thr Ser 1370 1375 1380Pro Gly Arg Pro Thr Ala Thr Ala Thr Pro Glu Ala Gly Pro Ala 1385 1390 1395Ala Ser Ala Leu Arg Ala Arg Leu Ser Ala Leu Pro Glu Ala Glu 1400 1405 1410Arg Pro Gly Ala Leu Leu Asp Leu Val Arg Thr Glu Val Ala Val 1415 1420 1425Val Leu Gln Leu Ala Gly Pro Ala Gln Val Pro Ala Asp Lys Pro 1430 1435 1440Leu Lys Glu Leu Gly Leu Asp Ser Leu Thr Ala Val Glu Leu Arg 1445 1450 1455Asn Arg Leu Gly Ala Arg Ala Glu Thr Val Leu Pro Thr Thr Leu 1460 1465 1470Ala Phe Asp His Pro Thr Pro Arg Ala Ile Ala Asp Leu Leu Leu 1475 1480 1485Gln Arg Ala Phe Ser Glu Leu Ala Ala Ala Lys Ala Thr Arg Ala 1490 1495 1500Arg Gly Ala His Asp Glu Pro Ile Ala Ile Val Ser Met Ala Cys 1505 1510 1515Arg Leu Pro Gly Ser Val Asp Thr Pro Ala Ala Leu Trp Lys Leu 1520 1525 1530Leu Ala Glu Gly Arg Asp Ala Ile Gly Pro Phe Pro Glu Gly Arg 1535 1540 1545Gly Trp Asp Val Ala Gly Leu Tyr Asp Pro Asp Pro Asp Val Pro 1550 1555 1560Gly Lys Ser Ile Thr Thr Gln Gly Gly Phe Leu Tyr Asp Ala Asp 1565 1570 1575Arg Phe Asp Pro Thr Phe Phe Gly Ile Ser Pro Arg Glu Ala Glu 1580 1585 1590Arg Met Asp Pro Gln Gln Arg Leu Leu Leu Glu Cys Ala Trp Glu 1595 1600 1605Ala Leu Glu Arg Ala Gly Leu Ala Pro His Ala Leu Glu Ala Ser 1610 1615 1620Ala Thr Gly Val Phe Val Gly Leu Ala His Gly Asp Tyr Gly Gly 1625 1630 1635Arg Leu Leu Gln Gln Leu Glu Ser Phe Asp Gly His Val Leu Thr 1640 1645 1650Gly Asn Phe Leu Ser Val Gly Ser Gly Arg Ile Ala Tyr Thr Leu 1655 1660 1665Gly Leu Arg Gly Pro Ala Met Thr Val Asp Thr Ala Cys Ser Ser 1670 1675 1680Ser Leu Val Ala Val His Leu Ala Cys Met Ser Leu Arg Ala Gly 1685 1690 1695Glu Cys Asp Met Ala Leu Ala Gly Gly Ala Thr Val Met Ala Thr 1700 1705 1710Pro Met Ile Phe Val Glu Phe Ser Arg Gln Arg Gly Thr Ala Leu 1715 1720 1725Asp Gly Arg Cys Lys Ala Phe Gly Ala Gly Ala Asp Gly Ala Gly 1730 1735 1740Trp Ser Glu Gly Cys Gly Ile Leu Ala Leu Lys Arg Leu Ser Asp 1745 1750 1755Ala Gln Arg Asp Gly Asp Arg Val Leu Ala Val Ile Arg Gly Ser 1760 1765 1770Ala Val Asn Gln Asp Gly Arg Ser Gln Gly Leu Thr Ala Pro Asn 1775 1780 1785Gly Pro Ala Gln Gln Asp Val Ile Arg Gln Ala Leu Ala Ala Ala 1790 1795 1800Gly Leu Thr Pro Ala Asp Val Asp Ala Val Glu Ala His Gly Thr 1805 1810 1815Gly Thr Arg Leu Gly Asp Pro Ile Glu Ala Gln Ala Leu Leu Ala 1820 1825 1830Thr Tyr Gly Ala Ala His Thr Ala Glu Arg Pro Leu Trp Leu Gly 1835 1840 1845Ser Leu Lys Ser Asn Leu Gly His Thr Gln Val Ala Ala Gly Val 1850 1855 1860Ser Gly Leu Met Lys Leu Val Leu Ala Leu Gln His Ala Glu Leu 1865 1870 1875Pro Arg Thr Leu His Ala Asp Pro Pro Ser Pro His Val Asp Trp 1880 1885 1890Ser Gln Gly His Val Lys Leu Leu Asn Glu Pro Val Pro Trp Pro 1895 1900 1905Arg Thr Asp Arg Pro Arg Arg Ala Ala Val Ser Ser Phe Gly Ile 1910 1915 1920Ser Gly Thr Asn Ala His Val Ile Val Glu Glu Ala Pro Ala Glu 1925 1930 1935Ala Pro Ala Thr Ala Ala Asp Ala Lys Ser Val Glu Ala Leu Pro 1940 1945 1950Ile Leu Pro Leu Leu Val Ser Gly Ser Asp Glu Pro Ala Leu Arg 1955 1960 1965Ala Gln Val Arg Arg Leu Val Glu His Leu Arg Ser His Pro Asp 1970 1975 1980Glu Arg Leu Leu Asp Val Ala Ala Ser Leu Ala Thr Thr Arg Ala 1985 1990 1995His Leu Ala Met Arg Leu Ala Leu Pro Val Ser Ala Gly Ala Pro 2000 2005 2010Arg Asp Ala Trp Val Asp Glu Leu Glu Ala Phe Ala Arg Gly Gly 2015 2020 2025Ala Ala Pro Thr Gln Ala Ser Gln Thr Pro Ala Glu Ser Ser Ala 2030 2035 2040Gly Lys Val Ala Val Leu Phe Thr Gly Gln Gly Ser Gln Arg Ala 2045 2050 2055Ala Met Gly Arg Ala Leu Tyr Ala Thr His Pro Val Phe Arg Ala 2060 2065 2070Ala Leu Asp Ala Ala Cys Ala Glu Leu Asp Arg His Leu Asp Arg 2075 2080 2085Pro Leu His Ser Val Leu Phe Ala Asp Ala Gly Thr Glu Ala Ala 2090 2095 2100Ala Leu Leu Asp Gln Thr Gly Trp Ala Gln Pro Ala Leu Phe Ala 2105 2110 2115Leu Glu Val Ala Leu Tyr Arg Gln Trp Glu Ala Trp Gly Leu Arg 2120 2125 2130Pro Glu Leu Leu Leu Gly His Ser Ile Gly Glu Leu Ala Ala Ala 2135 2140 2145His Val Ala Gly Val Leu Asp Leu Pro Asp Ala Ser Ala Leu Val 2150 2155 2160Ala Ala Arg Gly Arg Leu Met Gln Ala Leu Pro His Gly Gly Ala 2165 2170 2175Met Ala Ser Ile Glu Ala Thr Glu His Glu Leu Leu Pro Leu Leu 2180 2185 2190Asp Gln His Thr Gly Arg Leu Ser Leu Ala Ala Leu Asn Ala Pro 2195 2200 2205Arg Gln Ser Val Val Ser Gly Asp Leu His Ala Val Asp Gln Val 2210 2215 2220Cys Ala His Phe Ile Ala Leu Gly Arg Arg Ala Lys Arg Leu Asp 2225 2230 2235Val Ser His Ala Phe His Ser Ala His Met Gln Pro Met Leu Asp 2240 2245 2250Ala Phe Ala Ser Val Ala Arg Gly Leu Thr Phe His Pro Pro Arg 2255 2260 2265Leu Pro Ile Val Ser Ser Val Thr Gly Ala Arg Ala Thr Thr Asp 2270 2275 2280Gln Leu Thr Ser Pro Asp Tyr Trp Val Gln Gln Val Arg Glu Pro 2285 2290 2295Val Arg Phe Leu Asp Ala Met Arg Ser Leu His Ala Ala Gly Ala 2300 2305 2310Ala Thr Phe Val Glu Cys Gly Pro His Gly Val Leu Thr Ala Ala 2315 2320 2325Gly Ala Glu Cys Leu Ala Pro Glu Gly Ala Arg Asp Ala Gly Phe 2330 2335 2340Val Thr Ser Leu Arg Lys Asp Arg Asp Glu Ala Leu Ala Leu Val 2345 2350 2355His Ala Ala Cys Ala Val His Val Arg Gly His Ala Leu Asp Trp 2360 2365 2370Leu Arg Phe Phe Asp Ala Thr Gly Ala Arg Arg Val Glu Leu Pro 2375 2380 2385Thr Tyr Ala Phe Gln Arg Gln Arg Tyr Trp Leu Glu Ala Pro Arg 2390 2395 2400Pro Arg Pro Ser Leu Glu Gly Val Gly Leu Thr Ala Ala Asn His 2405 2410 2415Pro Trp Leu Gly Ala Ala Val Arg Leu Ala Asp Arg Asp Gly Tyr 2420 2425 2430Val Leu Ser Gly Arg Leu Ser Thr Ile Asp His Pro Trp Val Leu 2435 2440 2445Asp His Val Val Leu Gly Thr Ala Leu Leu Pro Gly Thr Gly Phe 2450 2455 2460Val Glu Leu Ala Trp Ala Ala Ala Glu Ala Val Gly Leu Pro Gly 2465 2470 2475Val Ser Glu Leu Ala Ile Glu Ala Pro Leu Ala Leu Pro Ala Arg 2480 2485 2490Gly Ala Val Ala Leu Gln Ile Ala Ile Glu Ala Pro Asp Pro Ala 2495 2500 2505Gly Arg Arg Gly Val Ala Ile Tyr Ser Arg Pro Asp Gly Ala Ala 2510 2515 2520Asp Ala Pro Trp Thr Ala His Ala Arg Gly Val Leu Gly Ala Ala 2525 2530 2535Ala Pro Asp Arg Asp Ala Ala Trp Ala Gln Gly Ala Trp Pro Pro 2540 2545 2550Pro Gly Ala Val Pro Val Asp Val Thr Gln Arg Ile Glu Ile Val 2555 2560 2565Asp Ala Trp Val Gly Pro Ala Phe Arg Gly Val Thr Ala Leu Trp 2570 2575 2580Arg Val Gly Arg Thr Ile Tyr Ala Asp Val Ala Leu Pro Asp Gly 2585 2590 2595Val Ala Ser Thr Ala Gln Asp Phe Gly Leu His Pro Ala Leu Leu 2600 2605 2610Asp Val Ala Leu Arg Ala Phe Leu Arg Ala Glu Leu Gly Ala Asp 2615 2620 2625Pro Ser Pro Arg Glu Gly Thr Val Val Pro Phe Ala Trp Ser Asp 2630 2635 2640Val Val Leu Glu Ala Arg Gly Thr Ala Ala Leu Arg Val Arg Val 2645 2650 2655Glu Val Ala Ala Asp Gly Asp Gly Asp Ala Ile Thr Ala Ser Ile 2660 2665 2670Gln Leu Ala Asp Gly Gln Gly Arg Pro Val Ala Arg Val Gly Ala 2675 2680 2685Leu Gln Met Arg Trp Thr Thr Ala Glu Arg Val Arg Ala Ala Ala 2690 2695

2700Gly Ala Ala Glu Arg Asp Leu Tyr Arg Val Ala Trp Thr Asp Val 2705 2710 2715Ala Leu Asp Asp Ala Ala Phe Ala Pro Glu Glu His Val Val Val 2720 2725 2730Gly Gly Asp Gly Ala Leu Ala Ala Ala Leu Gly Ala Arg Val Val 2735 2740 2745Ala Gly Leu Pro Glu Leu Leu Ala Ser Leu Pro Asp Gly Ala Ala 2750 2755 2760Ala Pro Arg Arg Leu Val Val Asp Leu Thr Ala Asp Ala Ala Gly 2765 2770 2775Ala Val Val Asp Ala Val His Ala Ala Ala Arg Asp Ala Leu Ser 2780 2785 2790Leu Val Gln Gly Trp Leu Ala Ala Pro Gln Leu Ala Ala Thr Glu 2795 2800 2805Leu Val Val Val Thr Arg Gly Ala Val Ala Val Ala Pro Asp Glu 2810 2815 2820Gly Val Ala Ala Leu Gly Pro Ala Ala Val Trp Gly Leu Leu Arg 2825 2830 2835Ala Thr Arg Val Glu His Ala Asp Arg Thr Val Arg Val Leu Asp 2840 2845 2850Leu Gly Ser Ala Ala Pro Asp Met Thr Leu Leu Arg Arg Ala Leu 2855 2860 2865Thr Ala Ala Glu Glu Pro Glu Leu Ala Leu Arg Ala Gly Gly Ala 2870 2875 2880Arg Ala Pro Arg Leu Asp Ala Ala Ser Glu Thr Glu Gly Glu Leu 2885 2890 2895Ala Pro Pro Gly Gly Ala Arg Ser Leu Arg Leu Ser Ile Arg Thr 2900 2905 2910Lys Gly Ser Phe Asp Ala Leu His Leu Ala Asp Ala Pro Asp Ala 2915 2920 2925Leu Arg Pro Leu Gly Pro Gly Gln Val Arg Leu Ala Val Arg Ala 2930 2935 2940Thr Gly Leu Asn Phe Arg Asp Val Leu Asn Val Leu Gly Thr Tyr 2945 2950 2955Arg Gly Glu Ala Gly Pro Leu Gly Leu Glu Gly Ala Gly Val Val 2960 2965 2970Leu Asp Val Gly Glu Gly Val Thr Ala Leu Arg Pro Gly Asp Arg 2975 2980 2985Val Met Gly Met Leu His Ala Gly Met Ala Thr His Ala Val Val 2990 2995 3000Asp Ala Arg Leu Leu Thr His Ile Pro Arg Gly Leu Ser Phe Val 3005 3010 3015Glu Ala Ala Thr Ile Pro Ala Ala Phe Leu Thr Ala Leu Tyr Gly 3020 3025 3030Leu Arg Asp Leu Gly Ala Leu Lys Ala Gly Gln Arg Val Leu Val 3035 3040 3045His Ala Ala Ala Gly Gly Val Gly Met Ala Ala Val Gln Leu Ala 3050 3055 3060Arg Leu Trp Gly Ala Glu Val Phe Ala Thr Ala Ser Glu Gly Lys 3065 3070 3075Trp Pro Ala Leu Arg Arg Met Gly Ile Asp Gln Ala His Ile Ala 3080 3085 3090Ser Ser Arg Thr Leu His Phe Arg Lys Ala Phe Leu Asp Ala Thr 3095 3100 3105Gln Gly Gln Gly Val Asp Val Val Leu Asp Ala Leu Ala Gly Glu 3110 3115 3120Phe Val Asp Ala Ser Leu Asp Leu Leu Pro Arg Gly Gly Ala Phe 3125 3130 3135Val Glu Met Gly Lys Ser Asp Val Arg Asp Pro Glu Arg Val Ala 3140 3145 3150Lys Asp His Pro Arg Val Arg Tyr Thr Ala Phe Asp Leu Leu Asp 3155 3160 3165Ala Gly Pro Asp His Ile Gln Ala Met Leu Arg Glu Leu Val Pro 3170 3175 3180Leu Phe Glu Glu Gly Val Leu Ala Pro Leu Pro Ser Val Ala Tyr 3185 3190 3195Asp Leu Arg Arg Ala Pro His Ala Phe Arg Ser Met Ala Asn Ala 3200 3205 3210Arg His Ile Gly Lys Leu Val Leu Val Pro Pro Ala Thr Leu Asp 3215 3220 3225Pro Asp Gly Thr Ala Leu Ile Thr Gly Gly Thr Gly Glu Leu Gly 3230 3235 3240Arg Gln Ile Ala Arg His Leu Val Ala Ala His Gly Val Arg His 3245 3250 3255Leu Val Leu Thr Ser Arg Arg Gly Met Asp Ala Pro Asp Ala Ala 3260 3265 3270Ala Leu Val Glu Ser Leu Arg Ala Ala Gly Ala Ala Thr Val Glu 3275 3280 3285Val Ala Ala Cys Asp Val Thr Asp Arg Asp Ala Leu Ala Ala Ile 3290 3295 3300Val Gln Ala Ile Pro Ala Ala Arg Pro Leu Thr Ala Val Val His 3305 3310 3315Thr Ala Ala Val Leu Asp Asp Gly Thr Val Ala Gly Leu Ser Ala 3320 3325 3330Glu Gln Leu Ala Arg Val Leu Arg Pro Lys Val Asp Gly Ala Trp 3335 3340 3345Gln Leu Tyr Glu Ala Thr Arg Asp Ala Pro Leu Ala Ala Phe Met 3350 3355 3360Leu Phe Ser Ser Val Ala Gly Thr Leu Gly Ser Ser Gly Gln Ala 3365 3370 3375Asn Tyr Ala Ala Ala Asn Ala Phe Leu Asp Gly Leu Ala Ala Glu 3380 3385 3390Leu Arg Ala Arg Gly Val Pro Ala Met Ser Leu Ala Trp Gly Phe 3395 3400 3405Trp Glu Gln Gly Gly Ile Gly Met Thr Ala His Leu Gly Ala Ala 3410 3415 3420Asp Leu Ala Arg Leu Lys Arg Gln Gly Ile Val Pro Met Thr Val 3425 3430 3435Ala His Gly Leu Arg Leu Leu Asp Arg Ala Leu Glu Arg Pro Asp 3440 3445 3450Ala Ala Leu Val Pro Ala Ser Leu Asp Met Ala Val Ile Gln Arg 3455 3460 3465Thr Ala Ser Asp His Arg Gln Val Pro Pro Met Leu Arg Gly Leu 3470 3475 3480Val Arg Val Ala Pro Arg Gln Ala Ala Gly Ala Ala Ser Gly Arg 3485 3490 3495Ser His Glu Ala Ser Thr Leu Arg Gln Gln Leu Ala Ala Leu Pro 3500 3505 3510Glu Pro Glu Arg Gln Arg Ala Leu Leu Asp Leu Val Arg Thr Glu 3515 3520 3525Ala Ala Ala Val Leu Val Leu Arg Gly Pro Asp Ala Val Pro Ala 3530 3535 3540Asp Lys Pro Leu Arg Glu Leu Gly Leu Asp Ser Leu Thr Ala Val 3545 3550 3555Glu Leu Arg Asn Arg Leu Arg Thr Arg Ala Gln Thr Asp Leu Pro 3560 3565 3570Ser Thr Leu Ala Phe Asp Tyr Pro Thr Pro Lys Ala Val Ala Val 3575 3580 3585Tyr Leu Ala Gln Glu Leu Asp Leu His Asp Val Met Thr Glu Met 3590 3595 3600Arg Gly Pro Ser Leu Arg Ser Asp Asp Glu Leu Lys Ser Ala Ile 3605 3610 3615Ala Ser Ile Arg Ile Ser Thr Leu Arg Gln Ala Gly Leu Leu Asp 3620 3625 3630Ser Leu Leu Arg Leu Ala Ala Ser Glu Ala Val Ser Thr Ser Ser 3635 3640 3645Asp Thr Thr Pro Glu Thr Asp Glu Leu Thr Leu Gln His Val Gly 3650 3655 3660Asp Asp Glu Leu Ala Arg Leu Val Phe Asp Leu Ala Gly Gly Ala 3665 3670 3675Gln253654PRTSorangium cellulosum 25Met Lys Glu Glu Ile Ser Ala Arg Gln Ala Leu Glu Lys Ser Phe Ile1 5 10 15Glu Leu Arg Arg Ile Lys Arg Glu Leu Asp Gln Leu Lys Ala Lys Ser 20 25 30Ser Glu Pro Ile Ala Ile Val Ser Met Ala Cys Arg Leu Pro Gly Gly 35 40 45Val Asp Thr Pro Ala Ala Leu Trp Gln Leu Leu Ser Glu Gly Arg Asp 50 55 60Ala Ile Gly Pro Phe Pro Glu Gly Arg Glu Trp Asp Val Ala Gly Leu65 70 75 80Tyr Asp Pro Asp Pro Asp Ala Pro Gly Lys Ser Ile Thr Ala Gln Gly 85 90 95Gly Phe Leu Tyr Asp Ala Asp Arg Phe Asp Pro Ala Phe Phe Ala Ile 100 105 110Ser Pro Arg Glu Ala Glu Arg Met Asp Pro Gln Gln Arg Leu Leu Leu 115 120 125Glu Cys Ala Trp Glu Ala Leu Glu Arg Ala Gly Leu Ala Pro His Ala 130 135 140Leu Glu Ala Ser Ala Thr Gly Val Phe Val Gly Leu Ser Val Thr Asp145 150 155 160Tyr Gly Gly Arg Leu Leu His Asp Pro Glu Ala Leu Asp Gly Tyr Ile 165 170 175Ala Thr Gly Thr Leu Pro Ser Val Gly Ser Gly Arg Ile Ala Tyr Thr 180 185 190Leu Gly Leu Arg Gly Pro Ala Met Thr Val Asp Thr Ala Cys Ser Ser 195 200 205Ser Leu Val Ser Leu His Leu Ala Cys Met Ser Leu Arg Ala Gly Glu 210 215 220Cys Asp Met Ala Leu Ala Gly Gly Ala Thr Val Met Ala Thr Pro Met225 230 235 240Ala Phe Ile Glu Phe Ser Arg Gln Arg Gly Thr Ala Leu Asp Gly Arg 245 250 255Cys Lys Ala Phe Gly Ala Gly Ala Asp Gly Ala Gly Trp Ser Glu Gly 260 265 270Cys Gly Ile Leu Ala Leu Lys Arg Leu Ser Asp Ala Gln Arg Asp Gly 275 280 285Asp Arg Val Leu Ala Val Ile Arg Gly Ser Ala Val Asn Gln Asp Gly 290 295 300Arg Ser Gln Gly Leu Thr Ala Pro Asn Gly Pro Ala Gln Gln Asp Val305 310 315 320Ile Arg Gln Ala Leu Ala Ala Ala Gly Leu Thr Pro Ala Asp Val Asp 325 330 335Ala Val Glu Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro Ile Glu 340 345 350Ala Gln Ala Leu Leu Ala Thr Tyr Gly Ala Ala His Thr Ala Glu Arg 355 360 365Pro Leu Trp Leu Gly Ser Leu Lys Ser Asn Leu Gly His Thr Gln Ala 370 375 380Ala Ala Gly Val Ser Gly Leu Met Lys Leu Val Leu Ala Leu Gln His385 390 395 400Ala Glu Leu Pro Arg Thr Leu His Ala Asp Pro Pro Ser Pro His Val 405 410 415Asp Trp Ser Arg Gly His Val Lys Leu Leu Asn Glu Pro Val Pro Trp 420 425 430Pro Arg Thr Asp Arg Pro Arg Arg Ala Ala Val Ser Ser Phe Gly Phe 435 440 445Ser Gly Thr Asn Ala His Ile Ile Ile Glu Glu Ala Pro Ala Ala Ser 450 455 460Ala Glu Ala Thr Ser Arg Gly Glu Lys Thr Ser Ala Ala Ala Pro Pro465 470 475 480Ser Met Met Pro Leu Leu Val Ser Gly Val Asp Glu Ala Ala Leu Arg 485 490 495Ala Gln Ala Gly Arg Trp Ala Ala Trp Ile Glu Ala His Pro Glu Ala 500 505 510Gly Trp Ala Asp Val Val Tyr Thr Ala Ala Ala Arg Arg Thr His Leu 515 520 525Gly Ala Arg Ala Ala Leu Thr Ala Ala Asp Ala Ala Gly Ala Val Ala 530 535 540Ala Leu Thr Ala Leu Ser Gln Gly Gln Pro His Ala Ala Leu Ala Val545 550 555 560Gly Glu Ala Arg Ala Arg Gly Lys Val Ala Phe Val Phe Pro Gly Gln 565 570 575Gly Ser Gln Trp Pro Ala Met Gly Arg Ala Leu Leu Ser Gln Ser Glu 580 585 590Val Phe Ala Ala Ala Val Thr Ala Cys Asp Ala Ala Leu Arg Pro Phe 595 600 605Thr Gly Trp Ser Val Leu Ser Val Leu Arg Gly Asp Ser Gly Ala Glu 610 615 620Val Pro Pro Leu Glu Arg Val Asp Val Val Gln Pro Ala Leu Phe Ala625 630 635 640Met Ala Val Gly Leu Ala Ala Val Trp Arg Ala Trp Gly Leu Glu Pro 645 650 655Ser Ala Val Val Gly His Ser Gln Gly Glu Val Pro Ala Ala Tyr Val 660 665 670Ala Gly Ala Leu Ser Leu Glu Asp Ala Ala Arg Ile Val Ala Leu Arg 675 680 685Ser Gln Leu Val Arg Arg Leu Ser Gly Ala Gly Ala Met Ala Val Ile 690 695 700Glu Arg Pro Val Gly Glu Val Glu Gln Arg Leu Ser Arg Phe Gly Gly705 710 715 720Ala Leu Ser Val Ala Ala Val Asn Thr Pro Arg Ser Thr Val Val Ser 725 730 735Gly Asp Ile Glu Ala Val Asp Arg Leu Leu Ala Glu Phe Glu Gly Glu 740 745 750Gln Val Phe Ala Arg Lys Val Asn Val Asp Tyr Ala Ser His Ser Arg 755 760 765His Ile Asp Gly Leu Leu Pro Glu Leu Glu Asn Gly Leu Gly Ala Val 770 775 780Arg Pro Arg Ala Ser Thr Ile Pro Phe Tyr Ser Thr Val Thr Gly Thr785 790 795 800Val Leu Thr Gly Ala Glu Leu Asp Ala Ala Tyr Trp Cys Arg Asn Leu 805 810 815Arg Glu Pro Val Arg Leu Asp Arg Ala Leu Ser Trp Leu Leu Asp Asp 820 825 830Gly His Gly Leu Phe Val Glu Val Ser Ala His Pro Val Leu Thr Leu 835 840 845Pro Leu Thr Gly Ala Ser Ala Ala Ser Gly Gly Val Val Val Gly Ser 850 855 860Leu Gln Arg Asp Asp Gly Gly Leu Gly Arg Leu Leu Gly Val Leu Ala865 870 875 880Ala Leu His Val His Gly His Asp Val Asp Trp Arg Ala Val Leu Ala 885 890 895Pro Trp Gly Gly Gly Val Ala Asp Leu Pro Thr Tyr Ala Phe Gln Arg 900 905 910Gln Arg Tyr Trp Leu Glu Ala Pro Arg Gly Arg Ala Gly Leu Glu Ser 915 920 925Gly Gly Leu Leu Ala Val Asn His Pro Trp Leu Ser Ala Ala Val Arg 930 935 940Leu Ala Asp Arg Asp Gly Tyr Val Leu Ser Gly Arg Leu Ser Thr Val945 950 955 960Glu His Ala Trp Val Leu Asp His Val Val Leu Gly Thr Val Ile Leu 965 970 975Pro Gly Thr Ala Phe Val Glu Leu Ala Leu Ala Ala Ala Asp Ala Val 980 985 990Gly Leu Pro Ser Val Ser Glu Leu Thr Ile Glu Ala Pro Leu Ala Leu 995 1000 1005Pro Ala Arg Gly Ala Val Ala Leu Gln Val Thr Val Glu Ala Pro 1010 1015 1020Asp Ala Thr Gly Arg Arg Gly Phe Ala Val Tyr Ser Arg Pro Asp 1025 1030 1035Gly Ala His Asp Ala Pro Trp Thr Ala His Ala Arg Gly Val Leu 1040 1045 1050Gly Ala Ala Pro Ala Ala Ala Thr Thr Ala Trp Ala Ala Gly Ala 1055 1060 1065Trp Pro Pro Ala Gly Ala Glu Pro Val Asp Val Thr Arg Trp Val 1070 1075 1080Glu Ala Leu Asp Ala Trp Val Gly Pro Ala Phe Arg Gly Val Thr 1085 1090 1095Ala Ala Trp Arg Val Gly Arg Ser Ile Tyr Ala Asp Leu Ala Leu 1100 1105 1110Pro Glu Gly Val Ser Glu Arg Ala Gln Asp Phe Gly Leu His Pro 1115 1120 1125Ala Leu Leu Asp Ala Ala Leu Gln Ala Leu Leu Arg Ala Glu Leu 1130 1135 1140Gly Ala Gly Ala Ser Pro Arg Glu Gly Ile Pro Met Pro Phe Ala 1145 1150 1155Trp Ser Asp Val Ala Leu Glu Ala Arg Gly Ala Ala Ala Leu Arg 1160 1165 1170Ala Arg Val Glu Val Glu Asp Ala Ser Asp Gly Asp Gln Leu Ala 1175 1180 1185Ala Ser Ile Glu Leu Ala Asp Ala Gln Gly Gln Pro Val Ala Arg 1190 1195 1200Ala Gly Thr Phe Arg Ala Arg Trp Ala Thr Ala Glu His Val Arg 1205 1210 1215Met Ala Ala Ala Gly Ser Ser Glu Arg Asp Leu Tyr Arg Val Thr 1220 1225 1230Trp Ala Asp Val Val Leu Glu Glu Ala Ala Trp Ala Pro Glu Glu 1235 1240 1245His Val Val Leu Gly Gly Asp Gly Ala Leu Ala Ala Ala Leu Gly 1250 1255 1260Ala Arg Thr Ala Ala Leu Pro Glu Leu Ile Ala Ala Leu Pro Glu 1265 1270 1275Gly Ala Ala Ala Pro Arg Arg Leu Val Ile Asp Ala Ala Ala Gly 1280 1285 1290Asp Pro Gly Asp Gly Leu Val Ala Ala Ala His Ala Ala Ala Gln 1295 1300 1305Arg Val Leu Ser Leu Val Gln Gly Trp Leu Ser Glu Ala Arg Leu 1310 1315 1320Ala Asp Ser Glu Leu Val Val Val Thr Arg Gly Ala Val Ala Ala 1325 1330 1335Gly Pro Asp Asp Gly Val Ala Ala Leu Ser His Ala Pro Leu Trp 1340 1345 1350Gly Leu Val Arg Thr Ala Arg Gln Glu Asn Pro Gly Arg Ala Val 1355 1360 1365Arg Leu Val Asp Leu Gly Pro Glu Pro Leu Asp Gly Ala Leu Leu 1370 1375 1380Arg Arg Val Val Ala Ala Ala Glu Glu Pro Glu Leu Ala Leu Arg 1385 1390 1395Gly Gly Ala Ala Arg Ala Pro Arg Leu Arg Glu Val Arg Ala Gly 1400 1405 1410Ala Ala Asp Ala Ala Arg Pro Thr Arg Leu Asp Pro Gly Gly Thr 1415 1420 1425Val Leu Ile Thr Gly Gly Thr Gly Glu Leu Gly Arg Gln Val Ala 1430 1435 1440Arg His Leu Val Ala Ser His Gly Val Arg His Leu Val Leu Thr 1445 1450 1455Ser Arg Arg Gly Met Gly Ala Pro Asp Ala Ala Ala Leu Val Asp 1460

1465 1470Glu Leu Arg Ala Ala Gly Ala Ala Thr Val Asp Val Ala Ala Cys 1475 1480 1485Asp Val Ala Asp Gly Ala Ala Leu Gly Ala Val Ile Ala Ala Ile 1490 1495 1500Pro Ala Ala His Pro Leu Thr Ala Val Val His Met Ala Gly Val 1505 1510 1515Leu Asp Asp Val Ile Val Thr Lys Leu Ser Ala Glu Gln Leu Thr 1520 1525 1530Arg Val Leu Arg Pro Lys Ile Asp Gly Gly Trp His Leu Ala Ala 1535 1540 1545Ala Thr Arg Gly His Arg Leu Ala Ala Phe Val Leu Phe Ser Ser 1550 1555 1560Ala Ala Gly Thr Leu Gly Ser Pro Gly Gln Ala Asn Tyr Ala Ala 1565 1570 1575Ala Asn Thr Phe Leu Asp Ala Leu Ala Ala Gln Leu Arg Ala Arg 1580 1585 1590Gly Val Pro Ala Met Ser Leu Ala Trp Gly Phe Trp Glu Gln Ala 1595 1600 1605Gly Leu Gly Met Thr Ala His Leu Gly Ala Ala Asp Leu Ala Arg 1610 1615 1620Leu Arg Arg Gln Gly Ile Ala Pro Ile Ala Leu Ala Gln Gly Met 1625 1630 1635Gln Leu Leu Asp Arg Ala Leu Ala Arg Pro Glu Ala Ala Leu Val 1640 1645 1650Pro Ala Ala Leu Asp Leu Pro Ala Leu Gln Arg Ala Ala Ser Asp 1655 1660 1665Ala Gly Gln Val Pro Ala Leu Leu Arg Gly Leu Val Arg Pro Ala 1670 1675 1680Val Gly Arg Arg Ala Ala Ala Pro Ala Ala Ala Ala Thr Gly Ala 1685 1690 1695Ala Ala Leu Arg Ala Arg Leu Ala Pro Leu Pro Glu Ala Glu Arg 1700 1705 1710His Asp Val Val Leu Asp Leu Val Arg Ala Glu Ala Ala Ala Val 1715 1720 1725Leu Gln Leu Ala Gly Pro Ala Gln Val Pro Ala Asp Lys Pro Leu 1730 1735 1740Lys Glu Leu Gly Leu Thr Ser Leu Thr Ala Val Glu Leu Arg Asn 1745 1750 1755Arg Leu Gly Ala Arg Ala Glu Thr Ala Leu Pro Ala Thr Leu Ala 1760 1765 1770Phe Asp His Pro Thr Pro Arg Ala Ile Ala Gly Leu Leu Leu Gln 1775 1780 1785Arg Ala Phe Ser Glu Leu Ala Ala Ala Val Ala Thr Arg Ala Gln 1790 1795 1800Ala Pro Arg Ala Gln Gly Ala His Asp Glu Pro Ile Ala Ile Val 1805 1810 1815Ser Met Ala Cys Arg Leu Pro Gly Gly Val Asp Thr Pro Ala Arg 1820 1825 1830Met Trp Gln Leu Leu Ala Glu Gly Arg Asp Ala Ile Gly Pro Phe 1835 1840 1845Pro Glu Gly Arg Gly Trp Asp Val Ala Gly Leu Tyr Asp Pro Asp 1850 1855 1860Pro Asp Ala Pro Gly Lys Ser Val Thr Asn Leu Gly Gly Phe Leu 1865 1870 1875Tyr Asp Ala Asp His Phe Asp Pro Thr Phe Phe Gly Ile Ser Pro 1880 1885 1890Arg Glu Ala Glu Arg Ile Asp Pro Gln Gln Arg Leu Leu Leu Glu 1895 1900 1905Cys Ala Trp Glu Ala Leu Glu Arg Ala Gly Leu Ala Pro His Thr 1910 1915 1920Leu Glu Ala Ser Ala Thr Gly Val Phe Val Gly Leu Val Tyr Ser 1925 1930 1935Asp Tyr Gly Gly Arg Leu Leu Glu His Leu Glu Ser Phe Asp Gly 1940 1945 1950Tyr Ile Ala Thr Gly Ser Phe Pro Ser Val Gly Ser Gly Arg Ile 1955 1960 1965Ala Tyr Thr Leu Gly Leu Arg Gly Pro Ala Met Thr Val Asp Thr 1970 1975 1980Ala Cys Ser Ser Ser Leu Val Ser Leu His Leu Ala Cys Met Ser 1985 1990 1995Leu Arg Ala Gly Glu Cys Asp Met Ala Leu Ala Gly Gly Ala Thr 2000 2005 2010Val Met Ala Thr Pro Met Ala Phe Ile Glu Phe Ser Arg Gln Arg 2015 2020 2025Gly Met Ala Pro Asp Ala Arg Cys Lys Ala Phe Gly Ala Glu Ala 2030 2035 2040Asn Gly Ile Gly Pro Ala Glu Gly Cys Gly Ile Leu Val Leu Lys 2045 2050 2055Arg Leu Ser Asp Ala Arg Arg Asp Gly Asp Arg Val Leu Ala Val 2060 2065 2070Ile Arg Gly Ser Ala Val Asn Gln Asp Gly Arg Ser Gln Gly Leu 2075 2080 2085Thr Ala Pro Asn Gly Pro Ala Gln Gln Asp Val Ile Arg Gln Ala 2090 2095 2100Leu Ala Ala Ala Gly Leu Thr Pro Ala Asp Val Asp Ala Val Glu 2105 2110 2115Ala His Gly Thr Gly Thr Arg Leu Gly Asp Pro Ile Glu Ala Gln 2120 2125 2130Ala Leu Leu Ala Thr Tyr Gly Thr Ala His Thr Ala Glu Arg Pro 2135 2140 2145Leu Trp Leu Gly Ser Ile Lys Ser Asn Leu Gly His Thr Gln Ala 2150 2155 2160Ala Ala Gly Val Val Gly Leu Met Lys Leu Val Leu Ala Met Gln 2165 2170 2175His Ala Glu Leu Pro Arg Thr Leu Tyr Ala Glu Pro Arg Ser Pro 2180 2185 2190His Ile Asp Trp Ser Gln Gly His Ile Asn Leu Leu Asn Glu Pro 2195 2200 2205Val Pro Trp Pro Arg Thr Asp Arg Pro Arg Arg Ala Ala Val Ser 2210 2215 2220Ser Phe Gly Ile Ser Gly Thr Asn Ala His Val Ile Ile Glu Glu 2225 2230 2235Ala Pro Ala Glu Ala Pro Ala Thr Ala Ala Asp Ala Lys Ser Val 2240 2245 2250Glu Ala Leu Pro Ile Leu Pro Leu Leu Leu Ser Gly Arg Asp Glu 2255 2260 2265Pro Ala Leu Arg Ala Gln Ala Gly Arg Leu Ala Glu His Leu Arg 2270 2275 2280Ala His Pro Gly Glu Arg Leu Leu Asp Ile Ala Ala Gly Leu Ala 2285 2290 2295Thr Thr Arg Thr His Leu Ala Thr Arg Leu Ala Leu Pro Val Ala 2300 2305 2310Ala Asp Ala Ala Ala Glu Glu Leu Gly Ala Arg Leu Ala Gln Phe 2315 2320 2325Ala Ala Gly Gly Pro Ala Pro Ser Gly Ala Ala Val Thr Ala Pro 2330 2335 2340Gly Gln Pro Pro Gly Lys Val Ala Val Leu Phe Thr Gly Gln Gly 2345 2350 2355Ser Gln Arg Ala Gly Met Gly Arg Ala Leu Tyr Ala Thr His Pro 2360 2365 2370Val Phe Arg Ala Ala Leu Asp Ala Ala Cys Ala Glu Leu Asp Arg 2375 2380 2385His Leu Asp Arg Pro Leu His Ser Val Leu Phe Ala Asp Ala Gly 2390 2395 2400Thr Glu Ala Ala Ala Leu Leu Asp Gln Thr Gly Trp Ala Gln Pro 2405 2410 2415Ala Leu Phe Ala Leu Glu Val Ala Leu Tyr Arg Gln Trp Glu Ala 2420 2425 2430Trp Gly Leu Arg Pro Glu Leu Leu Leu Gly His Ser Ile Gly Glu 2435 2440 2445Leu Ala Ala Ala His Val Ala Gly Val Leu Asp Leu Pro Asp Ala 2450 2455 2460Ser Ala Leu Val Ala Ala Arg Gly Arg Leu Met Gln Ala Leu Pro 2465 2470 2475His Gly Gly Ala Met Ala Ser Ile Glu Ala Thr Glu His Glu Leu 2480 2485 2490Leu Pro Leu Leu Asp Gln His Thr Gly Arg Leu Ser Leu Ala Ala 2495 2500 2505Leu Asn Ala Pro Arg Gln Ser Val Val Ser Gly Asp Gln Pro Ala 2510 2515 2520Val Asp His Val Cys Ala His Phe Ile Ala Leu Gly Arg Arg Ala 2525 2530 2535Lys Arg Leu Asp Val Ser His Ala Phe His Ser Ala His Met Gln 2540 2545 2550Pro Met Leu Asp Ala Phe Ala Ser Val Ala Arg Gly Leu Thr Phe 2555 2560 2565His Pro Pro Arg Leu Pro Ile Val Ser Ser Val Thr Gly Ala Arg 2570 2575 2580Ala Thr Thr Asp Gln Leu Thr Ser Pro Asp Tyr Trp Val Gln Gln 2585 2590 2595Val Arg Glu Pro Val Arg Phe Leu Asp Ala Met Arg Ser Leu His 2600 2605 2610Ala Ala Gly Ala Ala Thr Phe Val Glu Cys Gly Pro His Gly Val 2615 2620 2625Leu Thr Ala Ala Gly Ala Glu Cys Leu Ala Pro Glu Gly Ala Arg 2630 2635 2640Asp Ala Gly Phe Val Thr Ser Leu Arg Lys Asp Arg Asp Glu Ala 2645 2650 2655Leu Ala Leu Val His Ala Ala Cys Ala Val His Val Arg Gly His 2660 2665 2670Ala Leu Asp Trp Leu Arg Phe Phe Asp Ala Thr Gly Ala Arg Arg 2675 2680 2685Val Glu Leu Pro Thr Tyr Ala Phe Gln Arg Gln Arg Tyr Trp Leu 2690 2695 2700Glu Ala Pro Arg Pro Arg Pro Ser Leu Glu Gly Val Gly Leu Thr 2705 2710 2715Ala Ala Asn His Pro Trp Leu Gly Ala Ala Val Arg Leu Ala Asp 2720 2725 2730Arg Asp Gly Tyr Val Leu Ser Gly Arg Leu Ser Thr Ile Asp His 2735 2740 2745Pro Trp Val Leu Asp His Val Val Ala Gly Thr Val Ile Leu Pro 2750 2755 2760Gly Thr Ala Phe Val Glu Leu Ala Trp Ala Ala Ala Glu Val Val 2765 2770 2775Gly Ala Ala Ala Val Ser Glu Val Thr Phe Thr Thr Pro Leu Val 2780 2785 2790Leu Pro Pro Arg Ser Val Val Glu Leu Gln Val Arg Ile Gly Glu 2795 2800 2805Pro Asp Ala Ser Gly Arg Arg Thr Phe Ala Ala Tyr Ser Arg Ala 2810 2815 2820Asp Ala Ala Ile Glu Ala Glu Trp Thr Gln His Ala Thr Gly Val 2825 2830 2835Leu Ser Ala Gln Ala Ala Ala Gly Ala Asp Val Ala Asp Leu Ser 2840 2845 2850Val Trp Pro Pro Pro Gly Ala Glu Val Val Ala Leu Asp Gly Gly 2855 2860 2865Tyr Ala Trp Leu Ala Ala Gln Gly Tyr Gly Tyr Gly Pro Ala Phe 2870 2875 2880Gln Ala Leu Arg Glu Val Trp Arg Ala Gly Thr Thr Leu Tyr Ala 2885 2890 2895Arg Val Ala Leu Pro Asp Ala Val Ala Asp Thr Ala Arg Gly Phe 2900 2905 2910Gly Ile His Pro Ala Leu Leu Asp Ala Val Leu His Ser Leu Leu 2915 2920 2925Ala Pro Ser Ala Gln Glu Glu Ala Ser Asp Asp Asp Lys Val Leu 2930 2935 2940Leu Ala Phe Ala Phe Ser Asp Val Val Ile Glu Ala Arg Gly Ala 2945 2950 2955Ala Glu Val Arg Val Arg Leu Asn Lys Gln Ala Gly Asp Asp Gly 2960 2965 2970Glu Gly Val Thr Ala Ser Ile His Leu Ala Asp Ala Gln Gly Arg 2975 2980 2985Pro Val Ala Arg Val Gly Ala Phe Gln Ala Arg Ala Thr Thr Thr 2990 2995 3000Glu Arg Val Arg Ala Leu Ala Gly Ala Ser Glu Arg Asp Leu His 3005 3010 3015Arg Val Thr Trp Thr Asp Val Thr Leu Glu Glu Thr Pro Trp Ala 3020 3025 3030His Glu Asp Ser Val Val Val Gly Gly Asp Gly Ala Leu Ala Ala 3035 3040 3045Ala Leu Gly Val Arg Ala Val Ala Gly Leu Pro Glu Leu Leu Ala 3050 3055 3060Gly Gly Ala Ala Ala Pro Arg Arg Leu Val Ile Asp Ala Thr Ala 3065 3070 3075Gly Asp Pro Gly Asp Gly Leu Val Ala Ala Thr His Ala Ala Thr 3080 3085 3090Gln Arg Gly Leu Ala Leu Leu Gln Gly Trp Leu Ser Glu Ala Arg 3095 3100 3105Leu Ala Ala Thr Glu Leu Val Leu Val Thr Arg Gly Ala Ala Ala 3110 3115 3120Ala Glu Pro Asp Glu Gly Val Ala Ala Leu Ser His Ala Pro Leu 3125 3130 3135Trp Gly Leu Val Arg Ala Ala Arg Glu Glu His Pro Ala Arg Ala 3140 3145 3150Leu Arg Leu Val Asp Leu Gly Arg Glu Ala Pro Asp Gly Ala Ile 3155 3160 3165Leu Arg Arg Ala Ile Ala Ala Asp Asp Glu Pro Glu Leu Val Val 3170 3175 3180Arg Arg Gly Ala Leu Arg Ala Ala Arg Leu Ser Leu Ala His Ala 3185 3190 3195Gly Pro Asp Thr Ala Gly Gln Ala Thr Arg Leu Ala Pro Gly Gly 3200 3205 3210Thr Val Leu Ile Thr Gly Gly Thr Gly Glu Leu Gly Arg Gln Val 3215 3220 3225Ala Arg His Leu Val Ala Ala His Gly Val Arg His Leu Val Leu 3230 3235 3240Thr Ser Arg Arg Gly Met Asp Ala Pro Asp Ala Ala Ala Leu Val 3245 3250 3255Glu Ser Leu Arg Ala Ala Gly Ala Ala Thr Val Glu Ile Ala Ala 3260 3265 3270Cys Asp Val Ala Asp Gly His Ala Leu Ala Ala Val Leu Arg Thr 3275 3280 3285Ile Pro Ala Glu His Pro Leu Thr Ala Val Val His Thr Ala Gly 3290 3295 3300Val Leu Glu Asp Gly Val Val Thr Gly Leu Ser Ala Glu Gln Leu 3305 3310 3315Ala Arg Val Leu Arg Pro Lys Val Asp Gly Ala Trp Gln Leu Tyr 3320 3325 3330Glu Ala Thr Lys Asp Ala Pro Leu Ala Ala Phe Met Leu Phe Ser 3335 3340 3345Ser Ala Ala Gly Thr Leu Gly Ser Ala Gly Gln Ala Asn Tyr Ala 3350 3355 3360Ala Ala Asn Ala Phe Leu Asp Ala Leu Ala Ala Glu Leu Arg Ala 3365 3370 3375Arg Gly Val Pro Ala Met Ser Leu Ala Trp Gly Phe Trp Glu Gln 3380 3385 3390Gly Gly Ile Gly Met Thr Ala His Leu Gly Ala Ala Asp Met Ala 3395 3400 3405Arg Val Lys Arg Gln Gly Ile Val Pro Met Thr Val Ala His Gly 3410 3415 3420Leu Arg Leu Leu Asp Arg Ala Leu Glu Arg Pro Glu Ala Thr Leu 3425 3430 3435Val Pro Leu Ser Leu Asp Val Ala Ala Leu Gln Arg Ala Ala Ser 3440 3445 3450Asp Ala Gly Arg Val Pro Ala Leu Leu Arg Gly Leu Val Arg Pro 3455 3460 3465Ala Ala Ala Arg Arg Thr Ala Ala Pro Ala Ala Ala Ala Thr Gly 3470 3475 3480Leu Arg Ala Arg Leu Leu Pro Leu Ser Glu Ala Glu Arg Gln Asp 3485 3490 3495Val Leu Leu Asp Leu Val Arg Thr Glu Ile Ala Asp Ile Leu Ala 3500 3505 3510Leu Ser Gly Pro Ala Ala Val Pro Pro Asp Gln Pro Ile Arg Glu 3515 3520 3525Leu Gly Leu Asp Ser Leu Thr Ala Val Asp Val Arg Ser Arg Leu 3530 3535 3540Val Gln Arg Ser Glu Ile Asp Leu Ala Val Thr Leu Ala Tyr Asp 3545 3550 3555Tyr Pro Thr Ala Arg Ala Ile Ala Gly His Leu Ser Glu Gln Met 3560 3565 3570Gly Leu Glu Gly Ala Pro Glu Asp Arg Glu Ser Ala Leu Asp Glu 3575 3580 3585Ser Gln Ile Arg Ala Leu Leu Met Gln Ile Pro Ile Pro Thr Leu 3590 3595 3600Arg Gln Ser Gly Leu Leu Gly Asp Leu Val Arg Leu Ala Ser Pro 3605 3610 3615Gln Ala Pro Pro Arg Glu Glu Gly Glu Ser Glu Thr Leu Ser Phe 3620 3625 3630Asp His Leu Gly Asn Glu Glu Phe Leu Ser Leu Ala Ser Lys Leu 3635 3640 3645Ile Ala Glu Glu Gly Ser 3650261880PRTSorangium cellulosum 26Met Asn Gln Glu Thr Val Leu Arg Gln Thr Leu Glu Lys Ser Leu His1 5 10 15Lys Ile Gln His Leu Asn Arg Glu Leu Glu Arg Leu Lys Ala Lys Ser 20 25 30Ser Glu Pro Ile Ala Ile Val Ser Met Ala Cys Arg Tyr Pro Gly Gly 35 40 45Val Asp Gly Pro Ala Arg Leu Trp Glu Leu Leu Ser Glu Gly Arg Asp 50 55 60Ala Ile Gly Pro Phe Pro Glu Gly Arg Gly Trp Asp Val Ala Gly Leu65 70 75 80Tyr Asp Pro Asp Pro Asp Ala Pro Gly Lys Ser Val Thr Thr Gln Gly 85 90 95Gly Phe Leu Tyr Asp Ala Asp Arg Phe Asp Pro Thr Phe Phe Gly Ile 100 105 110Ser Pro Arg Glu Ala Glu Arg Met Asp Pro Gln Gln Arg Leu Leu Leu 115 120 125Glu Cys Ala Trp Glu Ala Leu Glu Arg Ala Gly Val Ala Pro His Thr 130 135 140Leu Glu Ala Ser Ala Thr Gly Val Phe Val Gly Leu Val Tyr Ser Asp145 150 155 160Tyr Gly Gly Arg Leu Leu Glu His Leu Glu Val Phe Asp Gly Tyr Val 165 170 175Ala Thr Gly Ser Phe Pro Ser Val Gly Ser Gly Arg Ile Ala Tyr Thr 180 185 190Leu Gly Leu Arg Gly Pro Ala Val Thr Val Asp Thr Ala Cys Ser Ser 195 200 205Ser Leu Val Ser Leu His Leu Ala Cys Met Ser Leu Arg Ala Gly Glu 210 215 220Cys Asp Met Ala Leu Ala Gly Gly Ala Thr Val Met Ala Thr Pro Met225 230 235 240Ala Phe Ile Glu Phe Ser Arg Gln Arg Gly Met Ala Pro Asp Ala Arg 245

250 255Cys Lys Ala Phe Gly Ala Ala Ala Asn Gly Ile Gly Pro Ala Glu Gly 260 265 270Cys Gly Ile Leu Val Leu Lys Arg Leu Ser Asp Ala Arg Arg Asp Gly 275 280 285Asp Arg Val Leu Ala Val Ile Arg Gly Ser Ala Val Asn Gln Asp Gly 290 295 300Arg Ser Gln Gly Leu Thr Ala Pro Asn Gly Pro Ala Gln Gln Asp Val305 310 315 320Ile Arg Gln Ala Leu Ala Ala Ala Gly Leu Thr Pro Ala Asp Val Asp 325 330 335Ala Val Glu Ala His Gly Thr Gly Thr Pro Leu Gly Asp Pro Ile Glu 340 345 350Ala Gln Ala Leu Leu Ala Thr Tyr Gly Lys Thr His Thr Ala Glu Arg 355 360 365Pro Leu Trp Leu Gly Ser Ile Lys Ser Asn Phe Gly His Thr Gln Ala 370 375 380Ala Ala Gly Val Ala Gly Ile Ile Lys Leu Val Leu Ala Met Gln His385 390 395 400Ala Glu Leu Pro Arg Thr Leu Tyr Ala Glu Pro Arg Ser Pro His Val 405 410 415Asp Trp Ser Gln Gly His Val Lys Leu Leu Asn Glu Pro Val Pro Trp 420 425 430Pro Arg Thr Asp Arg Pro Arg Arg Ala Ala Val Ser Ser Phe Gly Val 435 440 445Ser Gly Thr Asn Ala His Val Ile Leu Glu Glu Ala Pro Ala Glu Ala 450 455 460Pro Ala Ala Ala Gln Thr Ala Ala Gly Val Pro Ser Thr Leu Pro Leu465 470 475 480Leu Leu Ser Gly Arg Asp Glu Pro Ala Leu Arg Ala Gln Ala Gly Arg 485 490 495Leu Ala Glu His Leu Arg Ala His Pro Asp Glu Arg Leu Leu Asp Ile 500 505 510Ala Ala Gly Leu Ala Thr Thr Arg Thr His Leu Ala Thr Arg Leu Ala 515 520 525Leu Pro Val Ala Ala Asp Ala Ala Ala Glu Glu Leu Ser Ala Arg Leu 530 535 540Ala Gln Phe Ala Ala Gly Gly Pro Ala Pro Ser Gly Ala Ala Val Thr545 550 555 560Ala Pro Gly Gln Pro Pro Gly Lys Val Ala Val Leu Phe Thr Gly Gln 565 570 575Gly Ser Gln Arg Ala Ala Met Gly Arg Ala Leu Tyr Ala Thr His Pro 580 585 590Val Phe Arg Ala Ala Leu Asp Ala Ala Cys Ala Glu Leu Asp Arg His 595 600 605Leu Asp Arg Pro Leu His Ser Val Leu Phe Ala Asp Ala Gly Thr Glu 610 615 620Ala Ala Ala Leu Leu Asp Gln Thr Gly Trp Ala Gln Pro Ala Leu Phe625 630 635 640Ala Leu Glu Val Ala Leu Tyr Arg Gln Trp Glu Ala Trp Gly Leu Arg 645 650 655Ala His Ala Leu Leu Gly His Ser Leu Gly Glu Ile Val Ala Ala His 660 665 670Ile Ala Gly Val Leu Asp Leu Pro Asp Ala Ser Ala Leu Val Ala Ala 675 680 685Arg Gly Arg Leu Met Gln Ala Leu Pro His Gly Gly Ala Met Ala Ser 690 695 700Ile Glu Ala Thr Glu His Glu Leu Leu Pro Leu Leu Asp Gln His Thr705 710 715 720Gly Arg Leu Ser Leu Ala Ala Leu Asn Ala Pro Arg Gln Ser Val Val 725 730 735Ser Gly Asp Gln Pro Ala Val Asp His Val Cys Ala His Phe Lys Ala 740 745 750Leu Gly Arg Arg Ala Lys Arg Leu Asp Val Ser His Ala Phe His Ser 755 760 765Ala Arg Met Glu Pro Met Leu Asp Ala Phe Ala Arg Val Ala Arg Gly 770 775 780Leu Thr Tyr Arg Ala Pro Arg Leu Pro Val Val Ser Asn Val Thr Gly785 790 795 800Arg Met Ala Thr Ala Asp Glu Leu Thr Ser Pro Asp Tyr Trp Val Arg 805 810 815His Val Arg Glu Pro Val Arg Phe Val Ala Gly Val Arg Ala Leu His 820 825 830Ala Thr Gly Val Ala Thr Tyr Leu Glu Cys Gly Pro Asp Pro Val Leu 835 840 845Gly Gly Met Ala Ala Asp Cys Leu Thr Ser Asp Glu Ser Arg Asp Pro 850 855 860Gly Leu Ile Pro Ser Leu Arg Lys Asp Arg Asp Glu Ala Leu Ala Ile865 870 875 880Ala Gln Ala Ala Cys Ala Leu His Val Arg Gly His Ala Leu Asp Trp 885 890 895Pro Arg Leu Phe Asp Ala Thr Gly Ala Arg Arg Val Glu Leu Pro Thr 900 905 910Tyr Ala Phe Gln Arg Gln Arg Tyr Trp Ile Asp Ala Pro Arg Arg Ala 915 920 925Ala Gly Leu Glu Ser Val Gly Leu Thr Ala Ala Asp His Pro Trp Leu 930 935 940Gly Ala Ala Val Arg Leu Ala Asp Arg Asp Val Tyr Val Leu Ser Gly945 950 955 960Arg Leu Ser Thr Val Asp His Pro Trp Ile Leu Asp His Val Val Thr 965 970 975Gly Thr Ala Leu Met Pro Gly Thr Gly Phe Val Glu Leu Ala Trp Ala 980 985 990Thr Ala Gln Ala Val Asn Ala Ala Ala Ile Ala Glu Leu Thr Leu Thr 995 1000 1005Thr Pro Leu Val Leu Pro Ala Arg Gly Ala Val Gln Leu Gln Val 1010 1015 1020Thr Val Asp Glu Ala Asp Ala Asp Gly Arg Arg Ala Phe Ala Ile 1025 1030 1035His Ser Arg Pro His Gly Pro Val Asp Leu Glu Trp Thr Gln His 1040 1045 1050Ala Thr Gly Val Leu Ser Ala Glu Ala Pro Ala Gly Ala Asp Glu 1055 1060 1065Ala Ala Gly Leu Ser Glu Trp Pro Pro Pro Gly Ala Glu Ala Val 1070 1075 1080Ala Leu Asp Gly Gly Tyr Glu Gln Leu Ser Glu His Gly Tyr Gly 1085 1090 1095His Gly Pro Ala Phe Gln Gly Leu Arg Gly Leu Trp Arg Ala Asp 1100 1105 1110Gln Thr Leu Tyr Ala His Val Ala Leu Pro Asp Ala Val Ala Gly 1115 1120 1125Thr Glu Gln Gly Phe Gly Leu His Pro Ala Leu Phe Asp Ala Ala 1130 1135 1140Leu Gln Ser Leu Ala Arg Leu Ser Arg Glu Glu Ala Ala Ala Gly 1145 1150 1155Asp Pro Val Leu Val Pro Phe Ala Trp Thr Asp Val Ala Leu Tyr 1160 1165 1170Ala Ala Gly Ala Thr Glu Leu Arg Ala Arg Ile Ala Leu Glu Gln 1175 1180 1185Ala Glu Gly Gly Ala Pro Ala Val Ala Ser Leu Leu Leu Ala Asp 1190 1195 1200Ala His Gly Arg Thr Val Ala Thr Thr Gly Arg Val Arg Gly Ala 1205 1210 1215Ser Ala Ala Gln Thr Arg Ser Ala Ala Ser Arg Ala Glu Pro Met 1220 1225 1230Tyr Arg Val Ala Trp Thr Asp Val Ala Leu Glu Ala Ala Ala Trp 1235 1240 1245Ala Pro Glu Glu His Val Val Leu Gly Gly Asp Gly Ala Leu Ala 1250 1255 1260Ser Ala Leu Gly Val Arg Ala Ala Ala Gly Leu Pro Glu Leu Leu 1265 1270 1275Glu Ala Leu Ala Asp Gly Ala Ala Ala Pro Arg Arg Leu Val Val 1280 1285 1290Asp Leu Thr Ala Gly Asp Ala Gly Ala Val Val Ala Ala Val His 1295 1300 1305Ala Ala Ala Arg Gly Ala Leu Ala Leu Val Gln Gly Trp Leu Ala 1310 1315 1320Ala Pro Gln Leu Thr Ala Thr Glu Leu Leu Val Val Thr Arg Cys 1325 1330 1335Ala Val Ala Thr Gly Pro Asp Glu Gly Val Asp Ala Leu Gly Pro 1340 1345 1350Ala Ala Val Trp Gly Leu Leu Arg Ala Thr Arg Ala Glu His Pro 1355 1360 1365Asp Arg Ala Val Arg Val Leu Asp Leu Gly Arg Glu Pro Leu Asp 1370 1375 1380Gly Ala Leu Leu Arg Arg Ala Leu Ala Ala Val Ala Glu Pro Glu 1385 1390 1395Leu Ser Leu Arg Arg Gly Glu Ala Arg Ala Pro Arg Leu Arg Glu 1400 1405 1410Ala Lys Pro Ala Ala Ala Pro Ala Thr Arg Leu Asp Pro Glu Gly 1415 1420 1425Thr Val Leu Val Thr Gly Gly Thr Gly Glu Leu Gly Arg Gln Val 1430 1435 1440Ala Arg His Leu Val Ala Ala His Gly Val Arg His Leu Val Leu 1445 1450 1455Thr Ser Arg Arg Gly Met Asp Ala Pro Asp Ala Ala Ala Leu Val 1460 1465 1470Glu Glu Leu Arg Ala Ala Gly Ala Ala Thr Val Asp Val Ala Ala 1475 1480 1485Cys Asp Val Ala Ala Gly Pro Ala Leu Ala Ala Val Val Glu Ala 1490 1495 1500Ile Pro Ala Ala His Pro Leu Thr Ala Val Val His Met Ala Gly 1505 1510 1515Val Leu Asp Asp Gly Ile Val Thr Lys Leu Ser Ala Glu Gln Leu 1520 1525 1530Thr Arg Val Leu Arg Pro Lys Val Asp Gly Ala Ile His Leu His 1535 1540 1545Glu Leu Thr Lys His Ala Pro Leu Ala Ala Phe Val Met Phe Ser 1550 1555 1560Ser Ala Ala Gly Thr Leu Gly Ser Pro Gly Gln Ala Asn Tyr Thr 1565 1570 1575Ala Ala Asn Val Phe Leu Asp Ala Leu Ala Ala Arg Leu Arg Ala 1580 1585 1590Arg Gly Val Pro Ala Met Ser Leu Ala Trp Gly Phe Trp Glu Gln 1595 1600 1605Gly Gly Ile Gly Met Thr Ala His Leu Gly Ala Ala Asp Arg Ala 1610 1615 1620Arg Met Lys Arg His Gly Val Val Ala Met Ser Val Ala Gln Gly 1625 1630 1635Leu Arg Leu Leu Asp Arg Ala Leu Ala His Pro Glu Ala Ala Leu 1640 1645 1650Val Pro Leu Ala Leu Asp Leu Ser Ser Leu His Ala Gly Ala Ser 1655 1660 1665Gly Ala Gly Pro Val Pro Pro Leu Leu Arg Gly Leu Val Arg Ala 1670 1675 1680Pro Ala Gly Arg Arg Thr Ala Ala Ser Ala Ala Arg Thr Asn Gly 1685 1690 1695Lys Gly Thr Ala Leu Ala Ala Leu Arg Ala Arg Leu Leu Pro Leu 1700 1705 1710Pro Gln Ala Glu Arg Glu Asp Leu Leu Leu Glu Leu Val Cys Thr 1715 1720 1725Glu Val Ala Glu Val Leu Gln Leu Pro Gly Pro Ala His Val Pro 1730 1735 1740Ala Asp Gln Pro Leu Arg Asp Leu Gly Leu Asp Ser Leu Met Thr 1745 1750 1755Val Glu Leu Arg Asn Arg Leu Gly Ala Arg Ala Glu Thr Thr Leu 1760 1765 1770Pro Thr Thr Leu Ala Phe Asp Tyr Pro Thr Pro Arg Ala Leu Ala 1775 1780 1785Ser Tyr Leu Glu Thr Leu Leu Gly Ile Ser Asp Glu Asn Gly His 1790 1795 1800Ser Gly Glu Leu Leu His Val Pro Gln Asn Glu Asp Glu Ile Arg 1805 1810 1815Ser Ala Ile Ala Arg Ile Pro Ile Ala Thr Leu Arg Glu Ala Gly 1820 1825 1830Leu Leu Gln Ser Leu Leu Arg Leu Ala Pro Gly Lys Ala Val Ala 1835 1840 1845Gly Asp Val Thr His Pro Val Asp Glu Leu Leu Val Glu His Ile 1850 1855 1860Glu Asp Glu Glu Leu Leu Arg Leu Ala Phe Glu Ala Thr Gly Gly 1865 1870 1875Ile Lys 1880272869PRTSorangium cellulosum 27Met Lys Asp Glu Ala Leu Ser Phe Arg Arg Ala Leu Glu Lys Thr Val1 5 10 15Val Glu Ile Arg Arg Leu Asn Arg Glu Ile Asp Asp Leu Arg Ala Lys 20 25 30Ser Ser Glu Pro Ile Ala Ile Val Ser Met Ala Cys Arg Phe Pro Gly 35 40 45Gly Val Glu Asn Pro Glu Ala Leu Trp Arg Leu Val Ser Glu Gly Gln 50 55 60Asp Ala Ile Gly Pro Phe Pro Glu Gly Arg Gly Trp Asp Val Ala Gly65 70 75 80Leu Tyr Asp Pro Asp Pro Asp Val Pro Gly Lys Ser Ile Thr Ala Arg 85 90 95Gly Gly Phe Leu Tyr Asp Ala Asp Arg Phe Asp Pro Glu Phe Phe Gly 100 105 110Ile Ser Pro Arg Glu Ala Glu Arg Ile Asp Pro Gln Gln Arg Leu Leu 115 120 125Leu Glu Cys Ala Trp Glu Ala Leu Glu Arg Ala Gly Val Ala Pro His 130 135 140Thr Lys Glu Ala Ser Ala Thr Gly Val Phe Val Gly Leu Met Tyr Thr145 150 155 160Asp Tyr Gly Leu Arg Leu Leu Asn His Pro Glu Ala Leu Asp Gly Tyr 165 170 175Ile Gly Ile Gly Ser Thr Gly Ser Thr Gly Ser Gly Arg Ile Ala Tyr 180 185 190Thr Leu Gly Leu Gln Gly Pro Ala Ile Thr Val Asp Thr Ala Cys Ser 195 200 205Ser Ser Leu Val Ala Leu His Met Ala Cys Ala Ser Leu Arg Gly Gly 210 215 220Glu Cys Asn Leu Ala Leu Val Gly Gly Val Ala Val Met Thr Thr Pro225 230 235 240Thr Thr Phe Ile Glu Phe Ser Arg Gln Arg Gly Leu Ser Leu Asp Gly 245 250 255Arg Cys Lys Ser Phe Gly Ala Glu Ala Glu Gly Val Gly Trp Gly Glu 260 265 270Gly Cys Gly Ile Leu Ala Leu Lys Arg Leu Ser Asp Ala Arg Arg Asp 275 280 285Gly Asp Arg Val Leu Ala Ile Ile Arg Gly Ser Ala Val Asn Gln Asp 290 295 300Gly Arg Ser Gln Gly Phe Thr Ala Pro Asn Gly Pro Ser Gln Arg Ala305 310 315 320Val Ile Gln Arg Ala Leu Ala Ala Ala Gly Leu Thr Ala Ala Asp Val 325 330 335Asp Ala Val Glu Gly His Gly Thr Gly Thr Arg Leu Gly Asp Pro Ile 340 345 350Glu Ala Gln Ala Leu Leu Ala Thr Tyr Gly Lys Ala His Thr Ala Glu 355 360 365Arg Pro Leu Trp Leu Gly Ser Ile Lys Ser Asn Phe Gly His Thr Gln 370 375 380Ala Ala Ala Gly Val Ala Gly Ile Ile Lys Leu Val Leu Ala Met Gln385 390 395 400His Ala Glu Leu Pro Arg Thr Leu His Ala Asp Thr Pro Ser Pro His 405 410 415Val Asp Trp Ser Gln Gly His Val Lys Leu Leu Asn Glu Pro Val Pro 420 425 430Trp Pro Arg Thr Asp Arg Pro Arg Arg Ala Ala Val Ser Ser Phe Gly 435 440 445Ile Ser Gly Thr Asn Ala His Val Ile Leu Glu Glu Ala Pro Ala Glu 450 455 460Ala Pro Ala Ala Ala Gln Thr Pro Ala Ala Ala Gly Val Pro Ser Thr465 470 475 480Leu Pro Leu Leu Leu Ser Gly Arg Asp Glu Pro Ala Leu Arg Ala Gln 485 490 495Ala Gly Arg Leu Ala Glu His Leu Arg Ala His Pro Gly Glu Arg Leu 500 505 510Leu Asp Ile Ala Ala Gly Leu Ala Thr Thr Arg Thr His Leu Ala Thr 515 520 525Arg Leu Ala Leu Pro Val Ala Ala Asp Ala Ala Ala Glu Glu Leu Ser 530 535 540Ala Arg Leu Ala Gln Phe Ala Ala Gly Gly Pro Ala Pro Ser Gly Ala545 550 555 560Ala Val Thr Ala Pro Gly Gln Pro Pro Gly Lys Val Ala Val Leu Phe 565 570 575Thr Gly Gln Gly Ser Gln Arg Ala Ala Met Gly Arg Ala Leu Tyr Ala 580 585 590Thr His Pro Val Phe Arg Ala Ala Leu Asp Ala Ala Cys Ala Glu Leu 595 600 605Asp Arg His Leu Asp Arg Pro Leu His Ser Val Leu Phe Ala Asp Ala 610 615 620Gly Thr Glu Ala Ala Ala Leu Leu Asp Gln Thr Gly Trp Ala Gln Pro625 630 635 640Ala Leu Phe Ala Leu Glu Val Ala Leu Tyr Arg Gln Trp Glu Ala Trp 645 650 655Gly Leu Arg Ala His Ala Leu Leu Gly His Ser Leu Gly Glu Ile Val 660 665 670Ala Ala His Ile Ala Gly Val Phe Asp Leu Pro Asp Ala Ser Ala Leu 675 680 685Val Ala Ala Arg Gly Arg Leu Met Gln Ala Leu Pro His Gly Gly Ala 690 695 700Met Ala Ser Ile Glu Ala Thr Glu His Glu Leu Leu Pro Leu Leu Asp705 710 715 720Gln His Thr Gly Arg Leu Ser Leu Ala Ala Leu Asn Ala Pro Arg Gln 725 730 735Ser Val Val Ser Gly Asp Gln Pro Ala Val Asp Gln Val Cys Ala His 740 745 750Phe Lys Ala Leu Gly Arg Arg Ala Lys Arg Leu Asp Val Ser His Ala 755 760 765Phe His Ser Ala Arg Met Glu Pro Met Leu Asp Ala Phe Ala Arg Val 770 775 780Ala Arg Gly Leu Thr Tyr Arg Ala Pro Arg Leu Pro Val Val Ser Asn785 790 795 800Val Thr Gly Arg Met Ala Thr Ala Asp Glu Leu Thr Ser Pro Asp Tyr 805 810 815Trp Val Arg His Val Arg Glu Pro Val Arg Phe Val Ala Gly Val Arg

820 825 830Ala Leu His Ala Thr Gly Val Ala Thr Tyr Leu Glu Cys Gly Pro Asp 835 840 845Pro Val Leu Gly Gly Met Ala Ala Asp Cys Leu Thr Ser Asp Glu Ser 850 855 860Arg Asp Pro Gly Leu Ile Pro Ser Leu Arg Lys Asp Arg Asp Glu Ala865 870 875 880Leu Ala Ile Ala Gln Ala Ala Cys Ala Leu His Val Arg Gly His Ala 885 890 895Leu Asp Trp Pro Arg Leu Phe Asp Ala Thr Gly Ala Arg Arg Val Glu 900 905 910Leu Pro Thr Tyr Ala Phe Gln Arg Gln Arg Tyr Trp Leu Glu Thr Pro 915 920 925Gln Thr Pro Gly Ala Asp Gly Ala Ser Asn Leu Ser Ser Pro Ala Glu 930 935 940Ser Arg Phe Trp Glu Ala Val Glu Arg Ala Asp Ile Ile Pro Leu Ala945 950 955 960Glu Ala Leu Arg Leu Glu Asp Glu Ala Gln Arg Ala Ser Leu Ala Thr 965 970 975Leu Leu Pro Ala Leu Ser Thr Trp Arg Arg Arg Arg His Glu Gln Ser 980 985 990Thr Ala Asp Ala Trp Arg Tyr Arg Val Ala Trp Lys Pro Leu Ala Ile 995 1000 1005Asp Ala Arg Ser Asp Leu Ser Gly Val Trp Leu Phe Leu Ala Pro 1010 1015 1020Pro Asp His Ala Lys Asp Asp Leu Ala Arg Ala Val Leu Arg Ala 1025 1030 1035Leu Ala Glu Ser Gly Ala Thr Val Val Pro Val Leu Val Ala Glu 1040 1045 1050Gly Asp Val Asp Arg Ala Leu Leu Ser Ala Arg Leu Arg Glu Gln 1055 1060 1065Val Gly Asp Gly Gly Ala Ile Arg Gly Val Ile Ser Leu Leu Ala 1070 1075 1080Leu Asp Glu Thr Ser Leu Pro Gln His Asp Gly Leu Pro Arg Gly 1085 1090 1095Leu Ala Phe Thr Leu Ala Leu Val Gln Ala Leu Gly Asp Thr Ala 1100 1105 1110Ile Ala Ala Pro Leu Trp Leu Leu Thr Arg Gly Ala Val Ser Val 1115 1120 1125Gly Arg Ser Asp Arg Leu Glu Arg Pro Leu Gln Ala Leu Thr Trp 1130 1135 1140Gly Leu Gly Arg Val Val Ala Leu Glu His Pro Glu Arg Trp Gly 1145 1150 1155Gly Leu Ile Asp Leu Ala Gly Ala Leu Asp Glu Lys Ala Leu Lys 1160 1165 1170Arg Leu Val Ala Ala Leu Gly Gly Arg Asp Ala Glu Asp Gln Leu 1175 1180 1185Ala Leu Arg Pro Ser Gly Leu Phe Ala Arg Arg Leu Val Arg Ala 1190 1195 1200Pro Leu Gly Glu Ala Thr Ala Val Arg Ala Trp Lys Ala Arg Gly 1205 1210 1215Thr Ala Leu Val Thr Gly Gly Thr Gly Asp Leu Gly Ala His Val 1220 1225 1230Ala Arg Trp Leu Ala Gln Asn Gly Ala Glu His Leu Val Leu Thr 1235 1240 1245Ser Arg Arg Gly Gln Asp Ala Pro Gly Ala Ala Glu Leu Thr Ala 1250 1255 1260Glu Leu Thr Ala Leu Gly Ala Arg Val Thr Ile Ala Ala Cys Asp 1265 1270 1275Ser Ser Asp Arg Gln Ala Leu Ala Ala Leu Leu Gln Arg Leu Arg 1280 1285 1290Ala Glu Gly Pro Pro Leu Arg Ala Val Val His Ala Ala Gly Val 1295 1300 1305Asp Gln Val Thr Pro Leu Ala Arg Thr Ser Leu Ala Glu Phe Ala 1310 1315 1320Gly Ile Ala Ser Gly Lys Val Ala Gly Ala Arg His Leu Asp Asp 1325 1330 1335Leu Leu Gly Asn Ala Pro Leu Asp Ala Phe Ile Leu Phe Ser Ser 1340 1345 1350Val Ala Gly Val Trp Gly Ser Gly Phe Gln Gly Ala Tyr Ala Ala 1355 1360 1365Ala Asn Ala Phe Leu Asp Ala Leu Ala Glu Gln Arg Arg Ala Leu 1370 1375 1380Gly Ser Thr Ala Thr Ser Ile Ala Trp Gly Leu Trp Gly Gly Lys 1385 1390 1395Ser Met Ala Asp Asp Ala Ala Lys Asp His Leu Ser Lys Arg Gly 1400 1405 1410Val Ser Pro Met Pro Pro Gln Leu Ala Ile Ala Ala Leu Gln Arg 1415 1420 1425Ala Leu Asp His Asp Glu Thr Thr Leu Thr Leu Ala Asp Val Asn 1430 1435 1440Trp Ser Arg Phe Ala Pro Ala Phe Ala Ala Ala Arg Pro Arg Pro 1445 1450 1455Leu Leu His Asp Leu Pro Glu Ala Arg Ser Ala Leu Glu Ser Pro 1460 1465 1470Ser Pro Ala Pro Arg Glu Ala Glu Leu Leu Thr Arg Leu Gln Gly 1475 1480 1485Leu Ser Ser Thr Glu Arg Val Arg His Leu Val Ser Leu Val Leu 1490 1495 1500Ala Glu Thr Ala Val Val Leu Gly His Pro Asp Ala Ser Arg Leu 1505 1510 1515Asp Pro His Thr Gly Phe Ala Asp Leu Gly Leu Asp Ser Leu Met 1520 1525 1530Ala Val Glu Met Arg Arg Arg Leu Gln Gln Ala Thr Gly Val Ser 1535 1540 1545Leu Pro Ala Thr Leu Thr Phe Asp His Pro Ser Pro His His Ile 1550 1555 1560Ala Thr Phe Leu Leu Asp Glu Val Phe Ala Pro Ala Leu Gly Gln 1565 1570 1575Ala Pro Gly Ala Glu Glu Asp Glu Ala Ile Ala Gln Ala Gly Leu 1580 1585 1590Ala Ser Gly Asp Glu Pro Val Ala Leu Ile Gly Val Gly Leu Arg 1595 1600 1605Leu Pro Gly Gly Ala Thr Asp Leu Asp Gly Leu Trp Arg Leu Leu 1610 1615 1620Glu Gln Gly Ile Asp Val Val Gly Pro Val Pro Glu Asp Arg Gly 1625 1630 1635Trp Ser Met Asp Glu Leu Tyr Asp Pro Asp Pro Asp Ser Leu Gly 1640 1645 1650Lys Ser Tyr Val Arg Glu Ala Ala Phe Leu Asp Arg Ile Asp Leu 1655 1660 1665Phe Asp Ala Gly Phe Phe Gly Ile Ser Pro Arg Glu Ala Ser His 1670 1675 1680Val Asp Pro Gln His Arg Leu Leu Leu Glu Ala Ala Trp Gln Ala 1685 1690 1695Leu Glu His Ala Gly Ile Val Pro Ala Ser Leu Gln Asp Ser Gln 1700 1705 1710Thr Gly Val Phe Val Gly Ser Gly Pro Ser Asp Tyr Ala Leu Leu 1715 1720 1725His Asn Pro Ala Gln Glu Asp Glu Ala Tyr Arg Leu Thr Gly Thr 1730 1735 1740Gln Pro Ser Phe Ala Pro Gly Arg Leu Ser Phe Ser Leu Gly Leu 1745 1750 1755Gln Gly Pro Ala Leu Ser Val Asp Thr Ala Cys Ser Ser Ser Leu 1760 1765 1770Val Ala Leu His Leu Ala Ala Gln Ala Leu Arg Arg Gly Glu Cys 1775 1780 1785Gly Leu Ala Leu Val Gly Ser Ala Gln Val Met Ala Ala Pro Asp 1790 1795 1800Ala Phe Val Thr Leu Ser Arg Ala Arg Ala Ile Ala Pro Asp Gly 1805 1810 1815Arg Ser Lys Thr Phe Ser Ala Gln Ala Asp Gly Tyr Gly Arg Gly 1820 1825 1830Glu Gly Val Ile Val Phe Val Leu Glu Arg Leu Ser Asp Ala Arg 1835 1840 1845Ala Arg Gly Arg Asp Val Leu Ala Val Leu Arg Gly Ser Ala Val 1850 1855 1860Asn His Asp Gly Ala Ser Ser Gly Ile Thr Ala Pro Asn Gly Thr 1865 1870 1875Ser Gln Gln Lys Val Leu Arg Ala Ala Leu His Asp Ala Arg Leu 1880 1885 1890Thr Pro Ala Asp Val Asp Val Val Glu Cys His Gly Thr Gly Thr 1895 1900 1905Ser Leu Gly Asp Pro Ile Glu Val Gln Ala Leu Ala Ala Val Tyr 1910 1915 1920Gly Lys Glu Arg Ser Ala Asp Arg Pro Leu Met Leu Gly Ala Leu 1925 1930 1935Lys Thr Asn Val Gly His Leu Glu Ala Ala Ser Gly Leu Ala Gly 1940 1945 1950Val Ala Lys Val Val Ala Ala Leu Arg His Glu Ala Leu Pro Ala 1955 1960 1965Thr Leu His Thr Ala Ala Arg Asn Pro His Ile Gln Trp Asp Thr 1970 1975 1980Leu Pro Val Gln Val Val Asp Thr Leu Arg Pro Trp Pro Arg Arg 1985 1990 1995Glu Asp Gly Thr Pro Arg Arg Ala Gly Val Ser Ala Phe Gly Leu 2000 2005 2010Ser Gly Thr Asn Ala His Val Leu Leu Glu Glu Ala Pro Pro Val 2015 2020 2025Gln Pro Ser Thr Gln Ala Glu Gln Pro Ala Ala Pro Pro Trp Leu 2030 2035 2040Pro Leu Leu Leu Ser Gly Lys Thr Asp Ala Ala Leu Arg Ala Gln 2045 2050 2055Ala Glu Arg Leu Arg Ala His Leu Asp Ala His Ala Asp Leu Gly 2060 2065 2070Leu Ala Asp Val Ala Tyr Ser Leu Ala Thr Thr Arg Thr His Phe 2075 2080 2085Ala His Arg Ala Val Val Val Ala Asp Ala Gly Ala Thr Leu Phe 2090 2095 2100Glu Gly Leu Asp Ala Ile Ala Arg Gly Asn Ala Ala Ser His Val 2105 2110 2115Val Val Asp Glu Ala Lys Ile Asp Gly Lys Thr Val Phe Val Phe 2120 2125 2130Pro Gly Gln Gly Ser Gln Trp Ala Gln Met Ala Gln Pro Leu Leu 2135 2140 2145Glu Thr Ser Glu Leu Phe Arg Glu Arg Ile Glu Ala Cys Ala His 2150 2155 2160Ala Leu Ala Pro His Val Asp Trp Ser Leu Leu Ala Val Leu Arg 2165 2170 2175Gly Glu Glu Gly Ala Pro Ser Leu Glu Arg Val Asp Val Val Gln 2180 2185 2190Pro Val Leu Phe Ala Val Met Val Ser Leu Ala Ala Leu Trp Arg 2195 2200 2205Ser Met Gly Val Glu Pro Asp Ala Val Val Gly His Ser Gln Gly 2210 2215 2220Glu Ile Ala Ala Ala Cys Val Ala Gly Ala Leu Ser Leu Ala Asp 2225 2230 2235Ala Ala Lys Val Val Ala Leu Arg Ser Arg Ala Leu Ala Arg Leu 2240 2245 2250Ala Gly Arg Gly Ala Met Ala Val Val Glu Leu Pro Ala Ala Glu 2255 2260 2265Leu Ala Glu Arg Met Lys Arg Trp Gly Glu Arg Leu Ser Ile Ala 2270 2275 2280Ala Leu Asn Ser Pro Arg Ser Thr Val Ile Ser Gly Asp Pro Asp 2285 2290 2295Ala Val Asp Ala Leu Leu Arg Glu Leu Asp Ser Ala Glu Ile Phe 2300 2305 2310Ala Arg Lys Val Arg Val Asp Tyr Ala Ser His Cys Ser His Val 2315 2320 2325Glu Ala Ile Arg His Gln Leu Leu Ala Glu Leu Ala Gly Ile Glu 2330 2335 2340Pro Leu Pro Ser Thr Leu Pro Leu Tyr Ser Thr Val Ser Gly Asp 2345 2350 2355Lys Leu Asp Gly Val Ala Leu Asp Ala Ser Tyr Trp Tyr Arg Asn 2360 2365 2370Leu Arg Gln Thr Val Arg Phe Ser Asp Ala Thr Gln Arg Leu Val 2375 2380 2385Ser Ala Gly His Arg Phe Phe Val Glu Val Ser Pro His Pro Val 2390 2395 2400Leu Thr Phe Ala Val Gln Asp Val Leu Asp Ala Glu Gly Val Pro 2405 2410 2415Ala Ala Val Val Gly Ser Leu Arg Arg Gly Glu Gly Asp Leu Arg 2420 2425 2430Arg Phe Leu Val Ser Leu Ser Glu Leu Phe Thr Arg Gly Leu Ala 2435 2440 2445Leu Asp Trp Ser Arg Val Leu Pro Ser Gly Arg Arg Val Ser Leu 2450 2455 2460Pro Thr Tyr Ala Phe Gln Arg Glu Arg Tyr Trp Leu Gly Ala His 2465 2470 2475Arg Ala Arg Gly Thr Asp Ala Thr Ser Ala Gly Leu Ala Ser Asp 2480 2485 2490Glu Pro Thr Arg Gly Ala Ser Met Pro Val Arg Leu Ser Leu Arg 2495 2500 2505Asp Val Pro Pro Glu Glu Arg Gln Gly Ala Leu Glu Arg Phe Val 2510 2515 2520Arg Glu Gln Leu Ala Ala Val Leu Arg Met Asp Ala Ala Arg Ile 2525 2530 2535Glu Gly Gln Thr Thr Ile Lys Thr Leu Gly Ile Asp Ser Leu Met 2540 2545 2550Ala Leu Glu Ile Arg Lys Arg Leu Glu Ala Gly Leu Ala Val Thr 2555 2560 2565Leu Pro Ser Thr Leu Ile Trp Gln Phe Pro His Ala Glu Gly Leu 2570 2575 2580Ala Arg His Leu Met Thr Arg Leu Pro Ala Gly Asp Gly Glu Gly 2585 2590 2595Ser Ala Val Val Gln Pro Val Glu Gln Pro Arg Ala Pro Lys Glu 2600 2605 2610Val Pro Val Ser Met Asp Pro Ser Ala Trp Val His Arg Pro Arg 2615 2620 2625Pro Arg Ala Asp Ala Arg Val Arg Leu Phe Cys Leu Pro Tyr Ala 2630 2635 2640Gly Ala Gly Ala Ser Arg Phe Arg Ala Trp Pro Glu Leu Leu Pro 2645 2650 2655Ser Trp Val Glu Val Cys Pro Ile Gln Leu Pro Gly Arg Glu Glu 2660 2665 2670Arg Leu His Glu Pro Ala Phe Glu Thr Met Asp Ala Leu Val Asp 2675 2680 2685Ala Leu Val Pro Ala Val Glu Ala His Ile Asp Arg Pro Phe Ala 2690 2695 2700Leu Phe Gly Cys Ser Met Gly Ala Leu Leu Ala Phe Glu Leu Ala 2705 2710 2715Arg Ala Leu Gln Ser Arg His Arg Leu Val Ala Arg His Leu Phe 2720 2725 2730Gly Ala Ala Ser Ser Ser Pro Arg Arg Val Ser Pro Val Arg Glu 2735 2740 2745Gln Leu Ser Ala Val Val Ser Pro Gly Thr Val Arg Ser Asp Ala 2750 2755 2760Met Ala Ser Leu Arg Gln Leu Gly Leu Leu Ser Ser Ser Ser Leu 2765 2770 2775Gln Asp Glu Glu Met Leu Asp Glu Val Trp Pro Ala Phe Arg Ala 2780 2785 2790Asp Leu Ser Leu Thr Leu Lys Tyr Thr Cys Arg Asp Ala Thr Pro 2795 2800 2805Leu Asp Ala Pro Ile Ser Val Phe Gly Gly Thr Glu Asp Arg Thr 2810 2815 2820Val Gly Arg Glu Asp Leu Val Ala Trp His Thr Leu Thr Lys Asp 2825 2830 2835Ala Phe Gln Val Ala Met Leu Pro Gly Gly His Leu Phe Met Asp 2840 2845 2850Ala Thr Pro Lys Arg Leu Phe His His Ile Glu His Ala Leu Gln 2855 2860 2865Leu28228PRTSorangium cellulosum 28Met Arg Thr Ser Asp Ala Val Trp Ala Gly Ala Ala Gly Tyr Thr Arg1 5 10 15Ala Arg Leu Gln Val Tyr Asp Phe Phe Ile Tyr Gly Phe Asn Ser Pro 20 25 30Val Ala Trp Lys Cys Pro Gly Glu Glu Leu Leu Glu Asn Tyr Asn Arg 35 40 45His Val Ser Gly Asn His Leu Asp Val Gly Val Gly Thr Gly Tyr Leu 50 55 60Leu Asp Arg Cys Arg Phe Pro Thr Ala Lys Pro Arg Val Phe Leu Met65 70 75 80Asp Leu Asn Pro Asp Ala Leu Gln Val Thr Ala Gln Arg Leu His Arg 85 90 95Phe Gln Pro Gln Thr Leu Arg Arg Asn Val Leu Asp Pro Ile Arg Phe 100 105 110Asp Gly Glu Pro Phe Asp Ser Ile Gly Met Asn Tyr Leu Met His Cys 115 120 125Val Pro Gly Ser Ile Pro Glu Lys Ala Val Met Phe Asp His Leu Ser 130 135 140Ala Leu Leu Lys Pro Gly Gly Val Ile Phe Gly Ser Thr Val Leu Ser145 150 155 160Glu Gly Val Asp Lys Gly Ile Val Ala Arg Ala Ile Met Asp Arg Phe 165 170 175Asn Lys Lys Gly Ile Phe Ser Asn Thr Arg Asp Ala Ala Ser Asp Leu 180 185 190Thr Arg Ala Leu Glu Glu Arg Phe Asp Asp Val Ser Val Arg Val Val 195 200 205Gly Cys Val Gly Leu Phe Ser Ala Arg Lys Arg Thr Cys Ala Gly Thr 210 215 220Glu Ser Pro Ala22529301PRTSorangium cellulosum 29Ile Val Leu Gly Asp Thr Leu Glu Gln Val Ala Thr Arg Leu Leu Glu1 5 10 15Glu Asp Leu Ala Ala Cys His Thr Thr Gly Glu Ala Ala Asp Val Leu 20 25 30Leu Asn Gly Val Leu Ala Ser Ser Ala Arg Ala Val Ala Ala Ala Leu 35 40 45Arg Ala Cys Asp Glu Phe Ala Ala Gly Asp Ser Asp Leu Pro Ser Leu 50 55 60Ala Arg Ala Cys Arg Ala Phe Ala Gly Leu Ala Ser Phe Gly Ser Ser65 70 75 80Arg Ser Leu Ser Ser Leu Gly Asp Gly Val Ile Ala Pro Met Leu Glu 85 90 95Lys Thr Phe Ala Arg Ala Val Leu Arg Val His Gly Gly Cys Thr Gly 100 105 110Ser Asp Glu Ala Val Ala Ala Ala Lys Glu Ala Leu Arg Thr Leu His 115 120 125Asp Val Ala Leu Ser Gln Pro Ile Val Asp Arg Gly Ala Trp Leu Asp 130 135 140Ala Ala Arg Gly Leu Val Asp Ser Glu Val Val Asn Pro Thr Ala Ser145 150 155 160Gly Leu Ala Cys Gly Leu Leu Tyr Leu Ala Gln Ala Ile Asp Asp Ala

165 170 175Glu Val Ala Arg Val Val Gly Leu Arg Leu Gly Gly Ala Ala Glu Pro 180 185 190Glu Ala Ala Ala Ser Phe Leu Ala Gly Phe Leu Glu Val Asn Ala Leu 195 200 205Val Leu Val Lys Ser Arg Pro Val Val Glu Ala Leu Asp Ala Phe Leu 210 215 220Arg Ala Ile Ala Pro Glu Arg Phe Lys Asp Thr Leu Pro Val Leu Arg225 230 235 240Arg Ala Phe Ala Gly Leu Gly Ala Thr Glu Arg Arg Tyr Leu Leu Glu 245 250 255Asn Val Leu Ala Ala Arg Lys Leu Gly Asp Lys Ala Arg Ala Ala Gln 260 265 270Ala Val Leu Leu Glu Lys Asp Arg Glu Lys Leu Lys Glu Met Ser Glu 275 280 285Asp Leu Ser Gln Ala Met Asp Asp Leu Asp Glu Leu Leu 290 295 30030345PRTSorangium cellulosum 30Met Arg Arg Pro Glu Arg Arg Asp Arg His Pro Arg Pro Arg Ala Ser1 5 10 15Asn Arg Gln Ser Ala Arg Arg Ser Ile Asp Ala Val Asp Gly Ser Thr 20 25 30Trp Tyr Pro Ser Thr Met Arg Leu Gln Ser Val Asp Thr His Leu Val 35 40 45Val Ala Leu His Ala Leu Leu Gln Glu Lys Ser Val Thr Arg Ala Ala 50 55 60Arg Arg Val Gly Val Thr Gln Pro Ser Met Ser His Ala Leu Ala Arg65 70 75 80Leu Arg Ala His Phe Ala Asp Pro Leu Leu Ile Gln Val Gly Arg Gln 85 90 95Met Thr Leu Ser Glu Arg Ala Arg Asp Leu Ala Pro Arg Ala Ala Glu 100 105 110Ala Val Glu Arg Leu Glu Gln Val Phe Arg Pro Val Glu Arg Phe Asp 115 120 125Pro Arg Arg Ser Gln Arg Thr Phe Arg Leu Val Ala Thr Asp Asn Leu 130 135 140Glu Leu Leu Val Leu Pro Ala Leu Thr Ala Leu Leu Ala Val Glu Ala145 150 155 160Pro Arg Val Asn Leu Arg Cys Arg Asn Ile Pro Ala Asp Phe Ala Glu 165 170 175Leu Leu Arg Arg Gly Glu Leu Asp Gly Lys Leu Gly Arg Gly Gly Pro 180 185 190Val Pro Asp Gly Cys Arg Ser Thr Leu Leu Ala Ala Glu Glu Ile Val 195 200 205Cys Val Met Arg Arg Gly His Pro Ala Ser Arg Arg Pro Leu Thr Ala 210 215 220Ala Arg Tyr Ala Ala Cys Glu His Leu Met Val Ser Pro His Gly Glu225 230 235 240Asp His Gly Ala Ile Asp Arg Ala Leu Ala Glu Gln Gly Thr Arg Arg 245 250 255Arg Val Thr Leu Thr Val Ser His Phe Leu Val Ala Pro Phe Ile Val 260 265 270Ser Gly Ser Asp Leu Leu Leu Thr Val Ser Ala Arg Val Ala Ala Ala 275 280 285Leu Ala Arg Arg Leu Asp Leu Val Val Arg Pro Cys Pro Phe Ala Leu 290 295 300Glu Gly Tyr Thr Leu Thr Leu Val Trp Pro Glu Arg Ser Glu His Asp305 310 315 320Glu Gly His Gly Trp Leu Arg Asp Ala Ile Gln Arg Ala Val Ala Val 325 330 335Asp Ser Arg Pro Ala Leu Pro Gly Val 340 34531108PRTSorangium cellulosum 31Met Ile Ile Glu Tyr Val Arg Tyr Thr Ile Pro Ala Glu Gln Glu Lys1 5 10 15Glu Phe Leu Ala Ala Tyr Arg Asp Ala Ala Ala Glu Leu Arg Gly Ser 20 25 30Glu His Cys Leu Asp Tyr Glu Ile Ser Arg Cys Val Glu Asp Pro Thr 35 40 45Ser Tyr Val Val Arg Ile Cys Trp Asp Ser Leu Gln Gly His Leu Gln 50 55 60Gly Phe Arg Lys Ala Ala Ala Phe Pro Ser Phe Phe Ala Lys Val Lys65 70 75 80Pro Phe Tyr Glu Arg Ile Gln Glu Met Arg His Tyr Ala Leu Thr Asp 85 90 95Val Ala Ala Arg Gln Ala Gly Thr Ala Ala Thr Gly 100 10532486PRTSorangium cellulosum 32Met Lys Leu Ala Arg Lys Leu Thr Leu Ala Leu Val Phe Gly Val Phe1 5 10 15Leu Val Leu Ala Leu Ser Ala Tyr Ala Gln Ile Arg Arg Glu Ala Arg 20 25 30Ile Phe Glu Asn Asp Val Gln Arg Asp His His Thr Met Ala Arg Ala 35 40 45Leu Ala Ala Ala Val Met Glu Val Trp Arg Ser Glu Gly Thr Ala Arg 50 55 60Ala Leu Arg Leu Val Glu Asp Ala Asn Glu Arg Glu Gln Gln Ala Asn65 70 75 80Ile Arg Trp Val Trp Leu Asp Gly Gln Ala Asp Glu Pro His Arg Pro 85 90 95Arg Leu Ala Pro Glu Leu Leu Ala Pro Val Ala Glu Gly Arg Ala Val 100 105 110Val Arg Arg Ile Pro Gln Lys Asp Ala Asp Leu Leu Val Thr Cys Val 115 120 125Pro Val Ser Val Pro Gly Asp Arg Ala Gly Ala Leu Glu Leu Ser Glu 130 135 140Ser Leu Ala Gly Ala Arg Arg Tyr Ile Arg Ser Met Ile Leu Ser Thr145 150 155 160Ala Ile Thr Thr Ala Ala Leu Thr Leu Val Cys Gly Leu Leu Thr Thr 165 170 175Gly Leu Gly Val Trp Leu Val Gly Arg Pro Met Arg Thr Leu Ile Asp 180 185 190Gln Ala Arg Arg Ile Gly Ala Gly Asp Leu Ser Gly Arg Leu Ser Leu 195 200 205Arg Gln Glu Asp Glu Ile Gly Glu Leu Gly Arg Glu Met Asn Ala Met 210 215 220Cys Asp Arg Leu Ala Ala Ala Asn Gln Lys Leu Glu Ser Glu Ala Ala225 230 235 240Ala Arg Ile Ala Ala Leu Gln Gln Leu Arg His Ala Glu Arg Leu Ala 245 250 255Thr Val Gly Lys Leu Ala Ser Gly Ile Ala His Glu Leu Gly Ala Pro 260 265 270Leu Gln Val Val Thr Gly Arg Ala Arg Met Leu Val Asp Gly Asp Val 275 280 285Ser Gly Asp Glu Val Pro Ile Asn Gly Gln Ile Ile Leu Glu Gln Ser 290 295 300Gln Arg Met Thr Gln Ile Ile Arg Gln Leu Leu Asp Phe Ala Arg Arg305 310 315 320Arg Ser Ala Glu Lys Gln Glu Thr Ala Leu Arg Gly Val Ile Arg Gly 325 330 335Thr Phe Thr Met Leu Lys Pro Leu Ala Asp Lys Gln Gly Val Thr Ile 340 345 350Val Glu Glu Gly Asp Thr Pro Asp Arg Val Val His Ala Asp Ala Asp 355 360 365Gln Leu Gln Gln Ala Leu Thr Asn Val Val Val Asn Ala Ile Gln Ala 370 375 380Met Pro Ser Gly Gly Thr Ile Thr Val Gly Val Arg Thr Val Arg Ala385 390 395 400Ser Pro Pro Pro Asp Gln Gly Gly Ala Glu Gly Asp Tyr Ile Ala Leu 405 410 415Ser Val Arg Asp Glu Gly Gln Gly Met Thr Ala Asp Val Leu Glu His 420 425 430Val Phe Glu Pro Phe Phe Thr Thr Lys Pro Val Gly Glu Gly Thr Gly 435 440 445Leu Gly Leu Pro Val Ala Tyr Gly Ile Ile Lys Glu His Gly Gly Trp 450 455 460Ile Asp Val Asp Ser Arg Pro Gly Ser Gly Ser Gln Phe Thr Met Tyr465 470 475 480Leu Pro Gln Glu Lys Pro 48533461PRTSorangium cellulosum 33Met Thr Gly Arg Val Leu Ile Val Asp Asp Glu Arg Gly Val Cys Glu1 5 10 15Leu Leu Asp Ala Gly Leu Lys Lys Arg Gly Phe Gln Ala Ala Trp Arg 20 25 30Thr Ser Ala Ala Glu Ala Leu Glu Leu Leu Gly Ala Glu Asp Phe Asp 35 40 45Val Val Val Thr Asp Met Thr Met Arg Gly Met Asn Gly Leu Glu Leu 50 55 60Cys Glu Arg Ile Ala Gln Asn Arg Pro Asp Leu Pro Val Ile Val Ile65 70 75 80Thr Ala Phe Gly Ser Leu Asp Thr Ala Thr Ser Ala Ile Arg Ala Gly 85 90 95Ala Tyr Asp Phe Val Thr Lys Pro Phe Glu Leu Asp Ala Leu Arg Leu 100 105 110Thr Val Glu Arg Ala Leu Arg His Arg Ala Leu Arg Glu Glu Val Arg 115 120 125Arg Leu Arg Arg Ala Val Asp Asp Ser His Arg Tyr Glu Gln Ile Leu 130 135 140Gly Gly Ser Pro Ala Met Lys Gly Val Phe Asp Leu Leu Asp Arg Val145 150 155 160Ala Asp Ser Asp Thr Ser Ile Leu Ile Thr Gly Glu Ser Gly Thr Gly 165 170 175Lys Glu Leu Val Ala Arg Ala Val His Gln Arg Ser Arg Arg Gly Gln 180 185 190Gly Ala Phe Ile Ala Val Asn Cys Ala Ala Val Pro Asp Ala Leu Leu 195 200 205Glu Thr Glu Leu Phe Gly His Ala Arg Gly Ala Phe Thr Asp Ala Lys 210 215 220Gly Ala Arg Ser Gly Leu Phe Ala Arg Ala His Gly Gly Thr Leu Phe225 230 235 240Leu Asp Glu Ile Gly Glu Leu Pro Val Gly Leu Gln Pro Lys Leu Leu 245 250 255Arg Ala Leu Gln Glu Arg Val Val Arg Pro Val Gly Ala Asp Glu Glu 260 265 270Val Pro Val Asp Val Arg Leu Ile Ala Ala Thr Asn Arg Asp Leu Glu 275 280 285Thr Ala Ile Glu Glu Arg Arg Phe Arg Glu Asp Leu Tyr Tyr Arg Ile 290 295 300Asn Val Val His Val Asp Leu Pro Pro Leu Arg Ser Arg Gly Ala Asp305 310 315 320Val Leu Leu Leu Ala Gln Arg Phe Leu Glu His Phe Ala Thr Val Lys 325 330 335Glu Arg Pro Ile Lys Gly Leu Ser Ala Pro Ala Ala Glu Lys Leu Val 340 345 350Ala Tyr Ala Trp Pro Gly Asn Val Arg Glu Leu Gln Asn Cys Ile Glu 355 360 365Arg Ala Val Ala Leu Ala Arg Tyr Asp Gln Ile Thr Val Asp Asp Leu 370 375 380Pro Glu Lys Ile Arg Ser Tyr Arg Arg Ser His Val Leu Val Ser Ser385 390 395 400Asp Asp Pro Thr Glu Leu Val Pro Met Glu Glu Val Glu Arg Arg Tyr 405 410 415Ile Leu Arg Val Leu Glu Val Val Gly Gly Asn Lys Ser Gln Ala Ala 420 425 430Gln Val Leu Gly Phe Asp Arg Ala Thr Leu Tyr Arg Lys Leu Glu Arg 435 440 445Tyr Gly Leu Arg Ala Gly Arg Ala Gly Asp Pro Arg Pro 450 455 46034508PRTSorangium cellulosum 34Met Arg Gln Pro Thr Pro Gln Gly Leu Ser Trp Pro Arg Leu Pro Arg1 5 10 15Pro Val Arg Leu Ser Ala Leu Leu Gly Ala Ala Thr Leu Leu Leu Thr 20 25 30Ser Val Ala Ile Val Val Ala Gly Ala Leu Met Val Ala Ser Thr Thr 35 40 45Met Gln Gln Thr Thr Arg Ile Leu Gly Ala Thr Val Glu Ser Val Arg 50 55 60Leu Val Glu Arg Leu Glu Ile Asp Leu Leu Leu Asp Ala His Gln Ser65 70 75 80Ser Arg Ala Val Gly Ser Gly Arg Gly Glu Leu Ala Pro Ser Leu Ala 85 90 95Ala Trp Glu Gln Gly Leu Arg Ser Gly Leu Ala Ala Ala Arg Asp His 100 105 110Val Ser Ser Pro Glu Glu Gly Arg Ile Leu Glu His Ala Glu Arg Arg 115 120 125Val Glu Asp Tyr Leu Ala Arg Arg Arg Ala Ala Asp Ala His Glu Leu 130 135 140Pro Ser Ala Pro Gly Ala His Asp Pro Ala Leu Leu Gly Val His Asp145 150 155 160Pro Ala Leu Asp Glu Ala Phe Arg Ala Leu Asp His Leu Val Glu Ile 165 170 175Asn Leu Glu Gln Ala Arg Ala Ser Glu Ala Leu Val Ala His Leu Thr 180 185 190Arg Arg Thr Thr Gly Ala Gly Leu Ala Ala Val Val Phe Phe Leu Ala 195 200 205Gly Ala Ser Thr Ile Leu Leu Ser Ala Arg Arg Leu Ile Tyr Arg Pro 210 215 220Ile Val Ala Ile Gln Glu Ala Ile Gly Arg Tyr Gly Ala Gly Asp Arg225 230 235 240Ala Ala Arg Ala Pro Leu Ile Gly Pro Arg Glu Leu Gly Glu Ile Ala 245 250 255Arg Ala Phe Asn Asp Met Ala Glu Ser Leu Glu Arg Gln Arg Glu Ala 260 265 270Gln Phe Ala Phe Leu Gly Gly Val Ala His Asp Leu Arg Asn Pro Leu 275 280 285Ser Ala Leu Arg Leu Ser Val His Val Leu Asp Ala Asp Asn Arg Pro 290 295 300Leu Glu Ser Ser Val Arg Arg Thr Met Ala Leu Val Gly Arg Gln Val305 310 315 320Asp Arg Leu Asp Arg Met Val Gly Asp Leu Leu Asp Ala Ser Gln Ile 325 330 335Glu Ala Cys Lys Leu Asp Leu Arg Val Glu Glu Arg Asp Leu Arg Asp 340 345 350Leu Ala Gln Glu Ala Val Asp Leu Tyr Arg Pro Val Ser Pro Glu His 355 360 365Pro Ile Glu Leu Ser Leu Pro Glu Thr Pro Val Leu Val Arg Cys Asp 370 375 380Ala Thr Arg Ile Glu Gln Val Leu Asn Asn Leu Leu Ser Asn Ala Leu385 390 395 400Lys Tyr Ser Pro Ala Gly Gly Gln Val Asp Val Ala Val Arg Ala Gly 405 410 415Gly Glu Gly Ala Glu Ile Ala Val Arg Asp Arg Gly Leu Gly Ile Glu 420 425 430Pro Glu Asp Leu Ala His Leu Phe Glu Pro Phe Arg Arg Leu Lys Ser 435 440 445Thr Ser Gly Ser Ile Pro Gly Thr Gly Leu Gly Leu Ala Val Ala Lys 450 455 460Arg Ile Val Glu Ala His Gly Gly Arg Leu Phe Val Glu Ser Arg Pro465 470 475 480Gly Ala Gly Ser Val Phe Arg Ile Glu Leu Pro Arg Ser Ser Ser Arg 485 490 495Asp Gln Ala Asp Gly Pro Arg Gly Val Ser His Gly 500 50535416PRTSorangium cellulosum 35Met Pro Ala Arg Thr Pro Arg Lys Pro Pro Pro Pro Ala Ser Pro Ala1 5 10 15Gly Pro Ala Gly Ala Pro Asp Asp Leu Thr Asp Ser Asp Arg Asp Ala 20 25 30Leu Leu Arg Trp Arg Leu Ala Leu Gly Pro Glu Ala Glu Arg Val Asp 35 40 45Pro Arg Leu Ser Leu Gly Gly Leu Gly Gly Ala Ala Pro Ala Leu Asp 50 55 60Val Asp Ala Arg Arg Leu Gly Asp Leu Asp Lys Ala Leu Ser Phe Ile65 70 75 80Tyr Asp Glu Arg Ala Gly Gly Leu Gly Gly Ser Arg Pro Tyr Val Pro 85 90 95Glu Trp Leu Ser Ala Val Arg Glu Phe Phe Ser His Glu Val Val Ala 100 105 110Leu Val Gln Lys Asp Ala Ile Glu Arg Lys Gly Leu Thr Gln Leu Leu 115 120 125Phe Glu Pro Glu Thr Leu Pro Phe Leu Glu Lys Asn Val Glu Leu Val 130 135 140Ala Thr Leu Met Ser Ala Lys Gly Leu Ile Pro Asp Ala Ala Arg Asp145 150 155 160Thr Ala Arg Gln Ile Val Arg Glu Val Val Glu Glu Val Arg Arg Ala 165 170 175Leu Glu Ala Glu Val Arg Thr Ala Val Leu Gly Ala Leu Arg Arg Asn 180 185 190Thr Thr Ser Pro Leu Arg Val Leu Arg Asn Leu Asp Trp Lys Arg Thr 195 200 205Ile Arg Lys Asn Leu Lys Gly Trp Asp Ala Glu Arg Arg Arg Leu Val 210 215 220Pro Asp Lys Leu Tyr Phe Trp Ala Asn Gln Thr Arg Arg His Glu Trp225 230 235 240Asp Val Ala Ile Leu Val Asp Gln Ser Gly Ser Met Gly Glu Ser Val 245 250 255Val Tyr Ser Ser Ile Met Ala Ala Ile Phe Ala Ser Leu Asp Val Leu 260 265 270Arg Thr Arg Leu Leu Phe Phe Asp Thr Glu Val Val Asp Val Thr Pro 275 280 285Met Leu Val Asp Pro Val Asp Val Leu Phe Thr Ala Gln Leu Gly Gly 290 295 300Gly Thr Asp Ile Asn Arg Ala Val Ala Tyr Ala Gln Ala Asn Phe Ile305 310 315 320Glu Arg Pro Glu Lys Thr Leu Leu Ile Leu Ile Thr Asp Leu Phe Glu 325 330 335Gly Gly Asn Ala Glu Glu Leu Val Ala Arg Met Arg Gln Leu Ala Asp 340 345 350Ser Lys Val Lys Ser Ile Cys Leu Leu Ala Leu Ser Asp Gly Gly Lys 355 360 365Pro Ser Tyr Asp His Glu Met Ala Gln Lys Leu Ala Ala Leu Gly Thr 370 375 380Pro Cys Phe Gly Cys Thr Pro Lys Leu Leu Val Lys Val Val Glu Arg385 390 395 400Leu Met Arg Gly Gln Asp Leu Gly Pro Leu Leu Gly Ala Glu Ala Arg

405 410 41536352PRTSorangium cellulosum 36Met Ala Glu Leu Asp His Trp His Pro Val Leu Leu Ser His Glu Leu1 5 10 15Arg Arg Lys Pro Arg Asn Val Arg Leu Ala Gly His Glu Ile Val Val 20 25 30Phe Arg Thr Ser Ser Gly Gly Leu Gly Ala Phe Thr Asp Arg Cys Pro 35 40 45His Arg Ser Met Arg Leu Ser Glu Gly Trp Val Glu Gly Asp Arg Leu 50 55 60Val Cys Ala Tyr His Gly Trp Arg Trp Ala Val Asp Gly Arg Gly Glu65 70 75 80Ile Pro Ala Thr Pro Ala Ala Arg Pro Cys Ala Arg Arg Glu Asp Met 85 90 95Phe Glu Ala Val Glu Arg Tyr Gly Ala Ile Trp Val Lys Arg Ala Gly 100 105 110Ser Gln Ala Ala Phe Pro Arg Leu Glu Gly Glu Gly Tyr Val Pro Arg 115 120 125Gly Leu Leu Arg His Arg Ala Thr Val Pro Phe Glu Leu Ala Leu Asp 130 135 140Asn Phe Ile Glu Ile Glu His Thr Pro Phe Val His Phe Met Leu Gly145 150 155 160Tyr Pro Leu Glu Arg Met Pro Glu Val Glu Ala Arg Val Thr Leu Thr 165 170 175Asp Glu Thr Ile Arg Val Val His Ser Gly Pro Arg Arg Pro Met Pro 180 185 190Arg Ala Met Glu Lys Leu Leu Gly Ile Pro Glu Asp Ala Ile Phe Val 195 200 205Val Asp Trp Thr Ser Tyr Phe Ser Pro Val Tyr Thr Ile Tyr Asn His 210 215 220Ser Leu Arg Asp Pro Lys Thr Asn Gln Pro Val Thr Phe Pro Leu Arg225 230 235 240Ser Ala Val Phe Phe Asn Pro Val Gly Pro Glu Ser Ser Glu Met Tyr 245 250 255Thr Phe Leu Phe Ala Ser Leu Ala Pro Trp Ser Lys Phe Gly Ala Gly 260 265 270Ala Val Leu Trp Pro Ala Met Gln Val Ala Met Asn Ile Glu Leu Arg 275 280 285Leu Asp Met Arg Leu Leu Asp Arg Leu Thr Asp Lys Arg Gly Ile Leu 290 295 300Lys Gly Asn Val Leu Gly Arg Phe Asp Lys Pro Leu Val Ile Ala Arg305 310 315 320Asp Arg Ile Asp Arg Ile Tyr Arg Gly Arg Val Ala Glu Ala Gly Asp 325 330 335Gly His Glu Ala Ala Arg Pro Ala Arg Arg Leu Pro Leu Ala Ala Pro 340 345 35037376PRTSorangium cellulosum 37Met His Val Glu Glu Cys His Val Val Ile Val Gly Ala Gly Pro Ser1 5 10 15Gly Leu Ala Val Gly Ala Cys Leu Arg Glu Gln Gly Ile Pro Phe Val 20 25 30Leu Leu Glu Lys Ser Glu Ala Val Gly Ala Thr Trp Arg Arg His Tyr 35 40 45Asp Arg Leu His Leu Asn Thr Ile Lys Gln Leu Ser Ala Leu Pro Gly 50 55 60Gln Pro Trp Pro Glu Tyr Ser Ala Pro Tyr Pro Ser Arg Val Glu Met65 70 75 80Val Asp Tyr Leu Glu Arg Tyr Ala Glu Arg Phe Arg Leu Glu Pro Arg 85 90 95Leu Gly Val Glu Val Glu Arg Ala Tyr His Asp Gly Ser Arg Trp Val 100 105 110Thr Arg Thr His Ala Gly Glu Leu Arg Ser Gln Ala Leu Val Val Ala 115 120 125Thr Gly Tyr Ser Arg His Pro Asn Val Pro Thr Trp Pro Asp Gln Glu 130 135 140Arg Phe Arg Gly Arg Ile Leu His Ser Ser Ala Tyr Arg Ser Gly Ala145 150 155 160Glu Phe Arg Gly Gln Arg Val Leu Val Val Gly Ala Gly Asn Ser Ala 165 170 175Ser Glu Ile Ala Leu Asp Leu Trp Glu His Cys Ala Glu Thr Thr Leu 180 185 190Ser Val Arg Ser Gly Asn His Val Ile Pro Arg Glu Leu Phe Lys Leu 195 200 205Pro Ala Gln Phe Asn Ala Leu Ala Leu Phe Glu Arg Leu Pro Leu Ala 210 215 220Val Gly Asp Arg Leu Ala Thr Ala Ile Leu Ser Arg Ala Val Gly Asp225 230 235 240Leu Ser Arg Trp Gly Ile Arg Arg Pro Ala Val Gly Pro Gly Thr Arg 245 250 255Ala Leu Lys Glu Gly Arg Met Pro Leu Ile Asp Ile Gly Thr Val Ala 260 265 270Leu Ile Gln Gln Gly Lys Ile Lys Val Val Pro Gly Pro Arg Ala Phe 275 280 285Thr Glu Thr Gly Val Thr Phe Thr Asp Gly Arg Gly Leu Pro Phe Asp 290 295 300Val Val Val Leu Ala Thr Gly Tyr Arg Pro Gly Leu Asp Asp Phe Leu305 310 315 320Glu Asn Ala Thr Arg Tyr Thr Asp Glu His Gly Cys Pro Arg Trp His 325 330 335Gly Ala Pro Thr Pro Ala Pro Gly Leu Phe Phe Ile Gly Phe Arg Asn 340 345 350Pro Ile Thr Gly Gln Ile Arg Asp Ile Ala Ala Glu Ala Pro Arg Ile 355 360 365Ala Arg His Ile Gln Gly Val Asn 370 37538356PRTSorangium cellulosum 38Met Cys Tyr Gly Leu Ala Met His Ala Ala Pro Ala Arg Asp Leu Ile1 5 10 15Arg His Phe His Pro Val Leu Pro Ala Ser Lys Leu Gly Arg Lys Pro 20 25 30Val Arg Val Val Leu Ala Gly Asn Ala Tyr Ala Leu Phe Arg Asp Glu 35 40 45Gln Gly Arg Pro Ala Ala Leu Ala Asp Ala Cys Pro His Arg Leu Ala 50 55 60Pro Leu Ser Gln Gly Arg Val Arg Pro Asp Gly Arg Leu Glu Cys Pro65 70 75 80Tyr His Gly Trp His Phe Asp Ala Glu Gly Arg Gly Ala Cys Pro Ser 85 90 95Gln Pro Ser Leu Thr Arg Cys Asp Thr Arg Ser Phe Gln Leu Val Glu 100 105 110Gln Leu Gly Tyr Leu Trp Leu Ala His Arg Asp Thr Pro Arg Ser Ala 115 120 125Leu Pro Glu Leu Asp Phe Ser Ser Asp Gly Phe Glu Tyr Ala Gly Thr 130 135 140Phe Ser His Leu Ala Pro Ala Pro Leu His Val Ile Phe Asp Asn Ser145 150 155 160Ser Glu Asp Glu His Thr Pro Phe Val His Gly Arg Leu Gly Trp Thr 165 170 175Pro Glu Asp Ala Ala Arg Ile Asp Phe Ser Cys Asp Val Phe Glu Asp 180 185 190Arg Thr Glu Val Lys Tyr Ser Ala Pro Gln Arg Pro Ser Thr Leu Ala 195 200 205Arg Leu Met Leu Leu Gln Pro Gly Asp Thr Phe His Asn Gln Trp Val 210 215 220Thr Arg Phe Ser Pro Val Tyr Thr Val Tyr Thr Ser Trp Trp Thr Ala225 230 235 240Gln Asn Gly Met Glu Arg Pro Val Val Ala Arg Ala Gly Ile Phe Phe 245 250 255Val Pro Glu Thr Glu Arg Thr Thr Phe Val Arg Ala Phe Leu Phe Val 260 265 270Lys Ile Thr Asp Pro Arg Phe Arg Pro Leu Leu Pro Val Val Lys Ser 275 280 285Ala Ala Ile Ala Leu Ser Trp Lys Glu Ile Arg Asp Asp Val Lys Phe 290 295 300Ile Pro His Val Ala Asp Thr Pro Phe Glu Met Lys Gly Met Arg Leu305 310 315 320Asn Lys Tyr Asp Ala Thr Leu Val His Asn His Arg Leu Met Arg Ser 325 330 335Ile Tyr Phe Gly Glu Thr Arg Gly Glu Ala Glu Gly Thr Gly Val Gly 340 345 350His Ala Ser Ala 35539395PRTSorangium cellulosum 39Met Thr Cys Phe Val Pro Ala Leu Arg Arg Met Gly Ala Thr Pro Ala1 5 10 15Arg Thr Cys Met Arg Gln Arg Leu Asp Val Thr Asp Leu Tyr Asn Asp 20 25 30Ala Tyr Thr Ala Tyr Ile Glu Ala Phe Arg Arg Gln Thr Glu Leu Val 35 40 45Ala Ser Glu Ile Leu Leu Glu His Leu Val Asp Pro Ser Gly Ala Val 50 55 60Arg Gly Leu Asp Asp Arg Pro Glu Ser Ala Pro Ser Val Thr Ala Tyr65 70 75 80Gln Phe Arg Arg Lys Leu Leu Asp Tyr Phe Ser Asp Lys Gly Asp Leu 85 90 95Thr Gln Asp Pro Ser Gly Arg Leu Val Pro Ser Glu Ala Val Arg Lys 100 105 110Arg Val Ala Glu Lys Glu Ser Ile Ala Leu Ala Asp Arg Ala Ile Leu 115 120 125Gly Glu Met Val Glu Phe Leu Gln Arg Tyr Arg Gly Leu Ala Gly Pro 130 135 140Val Leu Ala Gly Lys Asp Ala Leu Ala Thr Met Asp Leu Gln Tyr Gly145 150 155 160Met Gln Ala Ser Leu Lys Phe Trp Glu Tyr Ser Met Ile Ser Leu Pro 165 170 175Ala Lys Lys Pro Cys Asn Val Met Leu Ala Arg Ala Leu Met Ala Lys 180 185 190Leu Ala Glu Gly Pro Gly Ile Ser Val Phe Glu Gly Gly Ala Gly Leu 195 200 205Gly Val Val Leu Arg Gln Ala Leu Ser Asp Pro Arg Phe Leu Pro Leu 210 215 220Ser Lys Asn Leu Ala Arg Tyr Asp Tyr Thr Asp Ile Ser Ala Leu Leu225 230 235 240Met Glu Thr Gly Lys Gln Trp Leu Arg Thr His Ala Pro Ala Asp Val 245 250 255Phe Gln Arg Ile His Phe Gln Arg Leu Asp Leu Asp Thr Leu Pro Ser 260 265 270Ala Gly Ser Thr Phe Ala Arg Ala Ala Ser Val Asp Leu Ile Val Leu 275 280 285Glu His Val Leu Tyr Asp Val Arg Asp Leu His Ala Thr Leu Gln Ala 290 295 300Phe His Thr Met Leu Lys Pro Gly Gly Gln Leu Ala Phe Thr Met Ser305 310 315 320Phe Arg Asp Arg Pro Gly Val Phe Phe Pro Asn Glu Phe Phe Gln Ser 325 330 335Met Leu His Thr Tyr Ser Lys Ala Lys Leu Asp Pro Pro Arg Arg Gln 340 345 350His Val Gly Tyr Leu Thr Leu Gln Glu Trp Glu Leu Ser Leu Arg Ala 355 360 365Ala Gly Phe Ser Glu Trp Glu Val Tyr Pro Ala Pro Glu Asp His Ala 370 375 380Lys Trp Pro Phe Gly Gly Ile Val Ala Tyr Arg385 390 3954084PRTSorangium cellulosum 40Met Thr Phe Gly Tyr Ala Thr Val Ala Leu Phe Phe Leu Arg Phe Trp1 5 10 15Lys Lys Thr Gly Asp Arg Leu Phe Ala Lys Phe Ser Ala Ala Phe Trp 20 25 30Leu Met Met Leu Gly Arg Ile Ala Val Ala Leu Asn Arg Val Glu Glu 35 40 45Asp Ala Ile His Tyr Leu Tyr Leu Phe Arg Leu Phe Ala Tyr Met Leu 50 55 60Ile Leu Tyr Ala Ile Val His Lys Asn Arg Gly Asn Asp Gly Gln Ala65 70 75 80Leu Ser Ser Arg4186PRTSorangium cellulosum 41Met Ala Ala Ala Val Tyr Ile Leu Cys Ala Leu Thr Ser Ile Ala Cys1 5 10 15Ala Val Leu Leu Leu Arg Gly Tyr Ala Gln Arg Lys Val Arg Leu Leu 20 25 30Leu Trp Ser Gly Leu Cys Phe Ala Ala Leu Ala Ala Asn Asn Ile Leu 35 40 45Leu Phe Val Asp Leu Val Val Ile Arg Ser Val Asp Leu Ser Ser Leu 50 55 60Arg His Leu Thr Ala Leu Ile Gly Leu Ala Leu Leu Leu Tyr Gly Leu65 70 75 80Ile Trp Asp Leu Arg Glu 8542125PRTSorangium cellulosum 42Met Gln Arg Phe Leu Gly Ala His Ile Ser Ser Ile Glu Gln Leu Glu1 5 10 15Val Leu Leu Leu Met Arg Arg Thr Ala Glu Arg Glu Trp Ser Ala Ala 20 25 30Ala Met Ala Arg Glu Ile Gly Ser Ser Met Met Ser Ile Gln Asp Arg 35 40 45Phe Gly Gly Leu Ala Ser Arg Gly Leu Ile Val Ala Arg Glu Asp Gly 50 55 60Glu Asp Ile Phe Tyr Arg Tyr Ala Pro Ala Asp Asp Glu Thr Arg Arg65 70 75 80Thr Ile Asp Asp Leu Ala Gln Ala Tyr Lys Glu Arg Arg Leu Ser Val 85 90 95Ile Asn His Ile Tyr Ala Thr Pro Pro Pro Ser Asp Ile Gln Ser Phe 100 105 110Ser Asp Ala Phe Leu Ile Thr Lys Lys Gly Lys Gly Gly 115 120 125431762PRTSorangium cellulosum 43Ile Leu Glu Leu Lys Asn Thr Phe Asn Thr Met Val Asp Gln Leu Arg1 5 10 15Ser Phe Ala Ala Gln Val Thr Arg Val Ala Arg Glu Val Gly Thr Glu 20 25 30Gly Lys Leu Gly Gly Gln Ala Glu Val Thr Gly Val Ala Gly Thr Trp 35 40 45Lys Asp Leu Thr Asp Ser Val Asn Ser Met Ala Ser Asn Leu Thr Ala 50 55 60Gln Val Arg Asn Ile Ala Asp Val Thr Thr Ala Val Ala Asn Gly Asp65 70 75 80Leu Ser Lys Lys Ile Thr Val Asp Val Arg Gly Glu Ile Leu Glu Leu 85 90 95Lys Asp Thr Phe Asn Thr Met Val Asp Gln Leu Arg Ser Phe Ala Ser 100 105 110Glu Val Thr Arg Val Ala Arg Glu Val Gly Thr Glu Gly Lys Leu Gly 115 120 125Gly Gln Ala Ser Val Pro Gly Val Ala Gly Thr Trp Lys Asp Leu Thr 130 135 140Asp Ser Val Asn Ser Met Ala Ser Asn Leu Thr Ala Gln Val Arg Asn145 150 155 160Ile Ala Asp Val Thr Thr Ala Val Ala Arg Gly Asp Leu Ser Lys Lys 165 170 175Ile Thr Val Asp Val Lys Gly Glu Ile Leu Glu Leu Lys Asn Thr Phe 180 185 190Asn Thr Met Val Asp Gln Leu Ser Ser Phe Ala Ala Glu Val Thr Arg 195 200 205Val Ala Arg Glu Val Gly Thr Glu Gly Lys Leu Gly Gly Gln Ala Glu 210 215 220Val Lys Gly Val Ala Gly Thr Trp Lys Asp Leu Thr Asp Ser Val Asn225 230 235 240Ser Met Ala Ser Asn Leu Thr Ala Gln Val Arg Asn Ile Ala Ala Val 245 250 255Thr Thr Ala Val Ala Asn Gly Asp Leu Ser Lys Lys Ile Leu Glu Leu 260 265 270Lys Asp Thr Phe Asn Thr Met Val Asp Gln Leu Arg Ser Phe Ala Ser 275 280 285Glu Val Thr Arg Val Ala Arg Glu Val Gly Thr Glu Gly Lys Leu Gly 290 295 300Gly Gln Ala Ser Val Pro Gly Val Ala Gly Thr Trp Lys Asp Leu Thr305 310 315 320Asp Ser Val Asn Ser Met Ala Ser Asn Leu Thr Ala Gln Val Arg Asn 325 330 335Ile Ala Ala Val Thr Thr Ala Val Ala Asn Gly Asp Leu Ser Lys Lys 340 345 350Ile Thr Val Asp Val Arg Gly Glu Ile Leu Glu Leu Lys Asp Thr Phe 355 360 365Asn Thr Met Val Asp Gln Leu Arg Ser Phe Ala Ser Glu Val Thr Arg 370 375 380Val Ala Arg Glu Val Gly Thr Glu Gly Lys Leu Gly Gly Gln Ala Ser385 390 395 400Val Pro Gly Val Ala Gly Thr Trp Lys Asp Leu Thr Asp Ser Val Asn 405 410 415Ser Met Ala Ser Asn Leu Thr Ala Gln Val Arg Asn Ile Ala Asp Val 420 425 430Thr Thr Ala Val Ala Arg Gly Asp Leu Ser Lys Lys Ile Thr Val Asp 435 440 445Val Lys Gly Glu Ile Leu Glu Leu Lys Asn Thr Phe Asn Thr Met Val 450 455 460Asp Gln Leu Ser Ser Phe Ala Ala Glu Val Thr Arg Val Ala Arg Glu465 470 475 480Val Gly Thr Glu Gly Lys Leu Gly Gly Gln Ala Glu Val Lys Gly Val 485 490 495Ala Gly Thr Trp Lys Asp Leu Thr Asp Ser Val Asn Ser Met Ala Ser 500 505 510Asn Leu Thr Ala Gln Val Arg Asn Ile Ala Ala Val Thr Thr Ala Val 515 520 525Ala Asn Gly Asp Leu Ser Lys Lys Ile Thr Val Asp Val Arg Gly Glu 530 535 540Ile Leu Glu Leu Lys Asp Thr Phe Asn Thr Met Val Asp Gln Leu Arg545 550 555 560Ser Phe Ala Ser Glu Val Thr Arg Val Ala Arg Glu Val Gly Thr Glu 565 570 575Gly Lys Leu Gly Gly Gln Ala Ser Val Pro Gly Val Ala Gly Thr Trp 580 585 590Lys Asp Leu Thr Asp Ser Val Asn Ser Met Ala Ser Asn Leu Thr Ala 595 600 605Gln Val Arg Asn Ile Ala Ala Val Thr Thr Ala Val Ala Asn Gly Asp 610 615 620Leu Ser Lys Lys Ile Thr Val Asp Val Arg Gly Glu Ile Leu Glu Leu625 630 635 640Lys Asn Thr Ile Asn Tyr Thr Met Val Asp Gln Leu Asn Ala Phe Ala 645 650 655Ser Glu Val Thr Arg Val Ala Arg Glu Val Gly Thr

Glu Gly Lys Leu 660 665 670Gly Gly Gln Ala Ser Val Pro Gly Val Ala Gly Thr Trp Lys Asp Leu 675 680 685Thr Asp Asn Val Asn Phe Met Ala Gly Asn Leu Thr Asn Gln Val Arg 690 695 700Gly Ile Ala Lys Val Val Thr Ala Val Ala Asn Gly Asp Leu Lys Arg705 710 715 720Lys Leu Ala Phe Asp Ala Lys Gly Glu Ile Ala Ala Leu Ala Asp Thr 725 730 735Ile Asn Gly Val Ile Glu Thr Leu Ala Thr Phe Ala Asp Gln Val Thr 740 745 750Thr Val Ala Arg Glu Val Gly Val Glu Gly Lys Leu Gly Gly Gln Ala 755 760 765Ser Val Pro Gly Ala Ala Gly Thr Trp Lys Asp Leu Thr Asp Asn Val 770 775 780Asn Arg Leu Ala Ala Asn Leu Thr Thr Gln Val Arg Ala Ile Ala Glu785 790 795 800Val Ala Thr Ala Val Thr Lys Gly Asp Leu Thr Arg Ser Ile Lys Val 805 810 815Glu Ala Gln Gly Glu Val Ala Ala Leu Lys Asp Thr Ile Asn Glu Met 820 825 830Ile Arg Asn Leu Lys Asp Thr Thr Leu Lys Asn Ser Glu Gln Asp Trp 835 840 845Leu Lys Thr Asn Leu Ala Lys Phe Ser Arg Met Leu Gln Gly Gln Lys 850 855 860Asp Leu Leu Thr Val Gly Arg Leu Ile Leu Ser Glu Leu Ala Pro Val865 870 875 880Val Gly Ala Gln Gln Gly Val Phe Phe Thr Met Asp Val Ala Lys Glu 885 890 895Glu Pro Ile Leu Lys Leu Leu Ala Ser Tyr Ala Tyr Lys Val Arg Lys 900 905 910His Val Asp Asn His Phe Lys Leu Gly Glu Gly Leu Val Gly Gln Cys 915 920 925Ala Leu Glu Lys Glu Lys Ile Leu Leu Val Asn Ala Pro Pro Asp Tyr 930 935 940Ile Arg Ile Thr Ser Gly Leu Gly Glu Ala Pro Pro Val Asn Ile Ile945 950 955 960Val Ile Pro Val Leu Phe Glu Gly Gln Val Lys Ala Val Ile Glu Leu 965 970 975Ala Ser Phe Glu Arg Phe Ser Pro Thr His Gln Ala Phe Leu Asp Gln 980 985 990Leu Thr Glu Ser Ile Gly Ile Val Leu Asn Thr Ile Glu Ala Asn Met 995 1000 1005Arg Thr Glu Asp Leu Leu Lys Gln Ser Gln Ser Leu Ala Arg Glu 1010 1015 1020Leu Gln Ser Gln Gln Glu Glu Leu Gln Gln Thr Asn Ala Glu Leu 1025 1030 1035Gly Glu Lys Ala Arg Leu Leu Ala Gln Gln Asn Val Glu Val Glu 1040 1045 1050Arg Lys Asn Arg Glu Val Glu Gln Ala Arg Gln Ala Leu Glu Glu 1055 1060 1065Lys Ala Arg Gln Leu Ala Ile Thr Ser Lys Tyr Lys Ser Glu Phe 1070 1075 1080Leu Ala Asn Met Ser His Glu Leu Arg Thr Pro Leu Asn Ser Leu 1085 1090 1095Leu Ile Leu Ser Asp Gln Leu Ser Lys Asn Thr Asp Arg Asn Leu 1100 1105 1110Thr Gly Arg Gln Val Glu Phe Ala Lys Thr Ile His Ser Ser Gly 1115 1120 1125Asn Asp Leu Leu Ala Leu Ile Asn Asp Ile Leu Asp Leu Ser Lys 1130 1135 1140Ile Glu Ser Gly Thr Val Ile Val Asp Val Gly Glu Leu Ser Phe 1145 1150 1155Ser Asp Leu Gln Asp Tyr Val Glu Arg Thr Phe Gln His Val Ala 1160 1165 1170Glu Ser Lys Arg Leu Glu Phe Glu Leu Asn Phe Ala Gln Asn Leu 1175 1180 1185Pro Gln Val Ile Tyr Thr Asp Ala Lys Arg Val Gln Gln Val Leu 1190 1195 1200Lys Asn Leu Leu Ser Asn Ser Phe Lys Phe Thr Glu Arg Gly Ser 1205 1210 1215Val Ala Leu Asp Val Asp Leu Val Thr Ser Gly Trp Thr Ile Glu 1220 1225 1230Asn Glu Gly Leu Ser Arg Ala Gly Ala Ala Ile Ala Met Ser Val 1235 1240 1245Arg Asp Thr Gly Ile Gly Ile Pro His Asp Lys Gln Gln Ile Ile 1250 1255 1260Phe Glu Ala Phe Gln Gln Ala Asp Gly Ser Thr Ser Arg Lys Tyr 1265 1270 1275Gly Gly Thr Gly Leu Gly Leu Ala Ile Ser Arg Glu Ile Ala Trp 1280 1285 1290Met Leu Gly Gly Glu Ile Lys Leu Ser Ser Arg Pro Gly Ser Gly 1295 1300 1305Ser Thr Phe Thr Leu Tyr Leu Pro Leu Thr Tyr Thr Pro Ala Arg 1310 1315 1320Pro Arg Arg Lys Glu Gln Ala Ala Glu Val Pro Ser Ala Pro Pro 1325 1330 1335Ala Leu Val Ser Gly Asp Val Ala Pro Arg Ser Ala Ala Glu Pro 1340 1345 1350Pro Pro His Leu Leu Asn Gln Ser Val Asp Asp Ser Ala Ser Leu 1355 1360 1365Gln Pro Ser Asp Ser Val Val Leu Ile Val Glu Asn Asp Ala Ser 1370 1375 1380Phe Ala His Phe Val Met Asp Val Ala His Asp His Gly Phe Lys 1385 1390 1395Ala Ile Leu Ala Tyr Arg Gly Gly Ala Ala Leu Ser Ile Val Arg 1400 1405 1410Glu Arg Arg Val Asn Ala Ile Thr Leu Asp Ile Asn Leu Pro Asp 1415 1420 1425Met Asp Gly Trp Arg Val Leu Asp Arg Val Lys Arg Asp Leu Ala 1430 1435 1440Thr Arg His Ile Pro Val Gln Val Ile Thr Thr Asp Glu Glu Arg 1445 1450 1455Glu Arg Ala Leu Arg Met Gly Ala Thr Gly Val Leu Cys Lys Pro 1460 1465 1470Leu Lys Thr Arg Asp Ala Leu Asp Glu Thr Phe Arg Arg Leu Ser 1475 1480 1485Gln Phe Met Val Ser Arg Arg Arg Thr Val Val Leu Ala Glu Pro 1490 1495 1500Asp Glu Ala Glu Arg Gln Glu Leu Val Glu Leu Leu Gly Gly Asp 1505 1510 1515Asp Val Thr Ile Arg Ser Val Ala Ser Gly Glu Glu Ala Leu Asp 1520 1525 1530Ala Leu Leu Thr Glu Gly Ala Asp Val Leu Ile Leu His Leu Asp 1535 1540 1545Leu Pro Asp Met Arg Cys Phe Asp Leu Ile Gly Gln Leu Ala Gln 1550 1555 1560Gly Ser Gly Pro Thr Glu Leu Pro Val Leu Val Tyr Ala Pro Glu 1565 1570 1575Glu Ile Ser Ala Ala Asp Glu Ala Gln Leu Ser Arg Phe Ser Gln 1580 1585 1590Leu Met Val Leu Lys His Val Arg Ser Lys Glu Arg Leu Phe Asp 1595 1600 1605Asp Val Ser Leu Phe Leu His Arg Pro Val Ala Ala Leu Ser Glu 1610 1615 1620Arg Gln Arg Gln Thr Leu Gln Glu Leu His Gln Ser Asn Lys Val 1625 1630 1635Leu Ala Gly Lys Lys Val Leu Val Val Asp Asp Asp Val Arg Asn 1640 1645 1650Ile Phe Ala Met Thr Thr Ile Leu Asp Ala Gln Gln Met Lys Thr 1655 1660 1665Val Tyr Val Glu Thr Gly Arg Ala Ala Ile Glu Met Leu Gln Arg 1670 1675 1680Thr Pro Asp Ile Glu Ile Val Leu Met Asp Ile Met Met Pro Glu 1685 1690 1695Met Asp Gly Tyr Asp Thr Ile Arg Ala Ile Arg Ala Lys Pro Glu 1700 1705 1710His His Ala Leu Pro Ile Ile Ala Val Thr Ala Lys Ala Met Lys 1715 1720 1725Gly Asp Arg Glu Lys Cys Phe Glu Ala Gly Ala Asn Asp Tyr Ile 1730 1735 1740Ser Lys Pro Val Asp Pro Glu His Leu Leu Ala Met Leu Arg Leu 1745 1750 1755Trp Leu His Arg 1760


Patent applications by Christopher Reeves, Orinda, CA US

Patent applications by Ralph C. Reid, San Rafael, CA US

Patent applications in class PEPTIDES OF 3 TO 100 AMINO ACID RESIDUES

Patent applications in all subclasses PEPTIDES OF 3 TO 100 AMINO ACID RESIDUES


User Contributions:

Comment about this patent or add new information about this topic:

CAPTCHA