Patent application title: Genetically Modified Organisms for Increased Microbial Production of 3-Hydroxypropionic Acid Involving an Oxaloacetate alpha-decarboxylase
Inventors:
Michael D. Lynch (Durham, NC, US)
Michael D. Lynch (Durham, NC, US)
Christopher P. Mercogliano (Minneapolis, MN, US)
IPC8 Class: AC12P742FI
USPC Class:
435135
Class name: Micro-organism, tissue cell culture or enzyme using process to synthesize a desired chemical compound or composition preparing oxygen-containing organic compound carboxylic acid ester
Publication date: 2015-03-12
Patent application number: 20150072384
Abstract:
Microorganism compositions are described that comprise combinations of
genetic modifications that include a genetic modification to increase
oxaloacetate alpha-decarboxylase enzymatic activity. By such genetic
modification a 3-hydroxypropionic acid ("3-HP") production pathway is
provided or improved. In various embodiments, comprising other genetic
modifications, including selected gene disruptions, 3-HP production is
greater than in a control microorganism lacking such combinations of
genetic modificationsClaims:
1-43. (canceled)
44. A method for producing an acrylic acid-based consumer product comprising i) combining a carbon source and a microorganism to produce 3-hydroxypropionic acid, wherein said microorganism comprises a) a heterologous nucleic acid sequence encoding an enzyme selected from the group consisting of: a native or mutated form of a mmsB protein, and a native or mutated form of a ydfG protein; b) a genetic modification to decrease or eliminate an enzymatic activity selected from the group consisting of: lactate dehydrogenase, phosphate acetyltransferase, and pyruvate-formate lyase; c) a heterologous nucleic acid sequence encoding an acetyl-CoA carboxylase; and d) a heterologous nucleic acid sequence encoding a cyanase, ii) converting said 3-hydroxypropionic acid to acrylic acid; and iii) processing said acrylic acid into a consumer product.
45. The method of claim 44, wherein said carbon source has a ratio of carbon-14 to carbon-12 of about 1.0.times.10.sup.-14 or greater.
46. The method of claim 44, wherein following (i) said method further comprises separating or purifying 3-hydroxypropionic acid from said cell culture by extraction of 3-hydroxypropionic acid from said culture in the presence of a tertiary amine.
47. The method of claim 44, wherein said 3-hydroxypropionic acid is produced at a higher level than in a non-genetically modified microorganism.
48. The method of claim 44, wherein said consumer product is selected from the group consisting of diapers, carpet, paint, adhesives, and acrylic glass.
49. The method of claim 48, wherein said consumer product is diapers.
50. The method of claim 44, wherein said carbon source is predominantly glucose, sucrose, fructose, dextrose, lactose, or a combination thereof.
51. The method of claim 44, wherein said carbon source is less than 50% glycerol.
52. The method of claim 44, wherein said lactate dehydrogenase is encoded by a nucleic acid sequence having at least 90% sequence identity to SEQ ID NO:19 or has an amino acid sequence having at least 90% sequence identity to SEQ ID NO: 20.
53. The method of claim 44 wherein said pyruvate-formate lyase is encoded by a nucleic acid sequence having at least 90% sequence identity to SEQ ID NOs: 21 or has an amino acid sequence having at least 90% sequence identity to SEQ ID NO: 22.
54. The method of claim 44, wherein said genetically modified microorganism further comprises a heterologous nucleic acid sequence encoding an acetyl-CoA carboxylase.
55. A method for producing an acrylic acid-based consumer product comprising i) combining a carbon source and a microorganism to produce 3-hydroxypropionic acid, wherein said microorganism comprises a) a heterologous nucleic acid sequence encoding a 3-hydroxypropionate dehydrogenase; b) a heterologous nucleic acid sequence encoding a cyanase or a carbonic anhydrase; and c) a heterologous nucleic acid sequence encoding an NADPH-dependent transhydrogenase or an acetyl-CoA carboxylase; ii) converting said 3-hydroxypropionic acid to acrylic acid; and iii) processing said acrylic acid into a consumer product.
56. The method of claim 55, wherein said genetically modified microorganism further comprises at least one heterologous nucleic acid sequence encoding an oxaloacetate alpha-decarboxylase.
57. The method of claim 56, wherein said oxaloacetate alpha-decarboxylase is encoded by a sequence selected from the group consisting of: SEQ ID NOs: 54, 55, and 56.
58. The method of claim 56, wherein said genetically modified microorganism further comprises at least one heterologous nucleic acid sequence encoding a phosphoenolpyruvate carboxykinase or a phosphoenol pyruvate carboxylase.
59. The method of claims 56, wherein said genetically modified microorganism further comprises at least one genetic modification to reduce enzymatic activity of a protein selected from the group consisting of: lactate dehydrogenase, pyruvate formate lyase, phosphate acetyltransferase, heat stable, histidyl phosphorylatable protein, phosphoryl transfer protein, polypeptide chain, pyruvate kinase I, and pyruvate kinase II.
60. The method of claim 55, wherein said genetically modified microorganism further comprises a modification of a gene to increase the NADPH/NADP.sup.+ ratio, wherein the modification is selected from the group consisting of: increasing activity of pgi, increasing activity of pntAB, gapA:gapN substitution or replacement, and disrupting sthA.
61. The method of claim 55, wherein the 3-hydroxypropionate dehydrogenase is a native or mutated form of a mmsB protein or a native or mutated form of a ydfG protein.
62. The method of claim 55, wherein the said genetically modified microorganism further comprises a heterologous nucleic acid sequence encoding an NADPH-dependent transhydrogenase.
63. The method of claim 55, wherein said genetically modified microorganism further comprises a heterologous nucleic acid sequence encoding a cyanase or a carbonic anhydrase that increases intracellular bicarbonate levels.
64. The method of claim 55, wherein said genetically modified microorganism further comprises a heterologous nucleic acid sequence encoding an acetyl-CoA carboxylase.
65. The method of claim 55, wherein said genetically modified microorganism is selected from the group consisting of: Zymomonas, Escherichia, Pseudomonas, Alcaligenes, Salmonella, Shigella, Burkholderia, Oligotropha, and Klebsiella.
66. The method of claim 55, wherein said genetically modified microorganism is selected from the group consisting of: Escherichia coli, Cupriavidus necator, Oligotropha carboxidovorans, and Pseudomonas putida.
67. The method of claim 55, wherein said genetically modified microorganism is an Escherichia coli.
68. The method of claim 55, wherein said genetically modified microorganism is a gram-positive bacterium.
69. The method of claim 55, wherein said genetically modified microorganism is selected from the group consisting of: Clostridium, Rhodococcus, Bacillus, Lactobacillus, Enterococcus, Paenibacillus, Arthrobacter, Corynebacterium, and Brevibacterium.
70. The method of claim 55, wherein said genetically modified microorganism is selected from the group consisting of: Bacillus licheniformis, Paenibacillus macerans, Rhodococcus erythropolis, Lactobacillus plantarum, Enterococcus faecium, Enterococcus gallinarium, Enterococcus faecalis, and Bacillus subtilis.
71. The method of claim 55, wherein said genetically modified microorganism is a fungus.
72. The method of claim 55, wherein said genetically modified microorganism is a yeast.
73. The method of claim 55, wherein said yeast is selected from the group consisting of: Pichia, Candida, Hansenula, and Saccharomyces.
74. A method for producing an acrylic acid-based consumer product comprising i) combining a carbon source and a microorganism to produce 3-hydroxypropionic acid, wherein said microorganism comprises a) a heterologous nucleic acid sequence encoding a 3-hydroxypropionate dehydrogenase; and b) a heterologous nucleic acid sequence encoding a cyanase or a carbonic anhydrase, ii) converting said 3-hydroxypropionic acid to acrylic acid; and iii) processing said acrylic acid into a consumer product.
75. The method of claim 72, wherein said genetically modified microorganism further comprises a heterologous nucleic acid sequence encoding an acetyl-CoA carboxylase.
Description:
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001] This application claims priority from U.S. Provisional Application 61/246,140, filed Sep. 27, 2009. The entire contents of this application are hereby incorporated by reference in its entirety.
SEQUENCE LISTING
[0002] The instant application contains a Sequence Listing which has been submitted in ASCII format via EFS-Web and is hereby incorporated by reference in its entirety. Said ASCII copy, created on Feb. 10, 2011, is named OPXX2005.txt and is 234,626 bytes in size.
FIELD OF THE INVENTION
[0003] This invention relates to metabolically engineered microorganisms, such as bacterial strains, in which an identified functional variant of oxaloacetate alpha-decarboxylase is provided for production of a chemical product, 3-hydroxypropionic acid (3-HP) and products made from 3-HP. The metabolically engineered microorganisms may be adapted to exhibit increased tolerance to 3-HP. Production of products made from 3-HP so produced also is disclosed.
SEQUENCE LISTING
[0004] This provisional patent application provides a paper copy of sequence listings that are to be provided on compact disk in appropriate format in a later filing or submission.
BACKGROUND OF THE INVENTION
[0005] There are various approaches to increasing a genetically modified microorganism's productivity of 3-hydroxypropionic acid ("3-HP"). These approaches may be applied to a microorganism intended to be used in a production strain having the purpose of 3-HP production in an industrial microbial production system, whether or not 3-HP is the intended final product.
[0006] For example, without being limiting, various 3-HP production pathways are described in U.S. Pat. No. 6,852,517, WO2002/042418 (PCT/US01/43607), and U.S. Patent Publication No. US2008/0199926.
[0007] Nonetheless, notwithstanding these and other various microbial 3-HP production pathways, there remains a need in the art for alternative 3-HP production pathways.
SUMMARY OF THE INVENTION
[0008] According to one embodiment, the invention is directed to a method for producing an acrylic acid-based consumer product, said method comprising i) combining a carbon source and a microorganism cell culture to produce 3-hydroxypropionic acid, wherein a) wherein said microorganism is genetically modified for increased enzymatic activity in the organism by introduction of a heterologous nucleic acid sequence coding for a polypeptide having oxaloacetate alpha-decarboxylase enzymatic activity, the heterologous nucleic acid sequence expressing a mutated oxaloacetate alpha-decarboxylaseoxaloacetate alpha-decarboxylase.
[0009] The carbon source according to the invention may be predominantly glucose, sucrose, fructose, dextrose, lactose, or a combination thereof. Alternatively, the carbon source is glycerol.
[0010] Included within the invention are embodiments where the cell culture comprises a genetically modified microorganism. The genetically modified microorganism may be modified for increased activity and specificity to convert oxaloacetate to malonate semialdehyde, increased tolerance to 3-hydroxypropionic acid, increased enzymatic activity in the organism's NADPH-dependent transhydrogenase pathway, increased intracellular bicarbonate levels, and combinations thereof.
[0011] In various embodiments, the genetically modified microorganism is modified for increased tolerance to 3-hydroxypropionic acid. The increase in tolerance to 3-hydroxypropionic acid may occur in one or more components of the 3-HP toleragenic complex (3HPTGC) complex.
[0012] The genetically modified bacteria may be further modified to decrease activity of, lactate dehydrogenase, phosphate acetyltransferase, pyruvate oxidase, or pyruvate-formate lyase, alcohol dehydrogenase, and combinations thereof.
[0013] The method according to the invention may further comprise separating and/or purifying 3-hydroxypropionic acid from said cell culture by extraction of 3-hydroxypropionic acid from said culture in the presence of a tertiary amine.
[0014] The method of the invention may include production of a consumer product, such as diapers, carpet, paint, adhesives, and acrylic glass. The invention includes biologically-produced 3-hydroxypropionic acid, where the 3-hydroxypropionic acid is produced according to the method of the invention. Such 3-hydroxypropionic acid may be essentially free of chemical catalyst, including a molybdenum and/or vanadium based catalyst. The 3-hydroxypropionic acid is produced according to the method of the invention may have a ratio of carbon-14 to carbon-12 of about 1.0×10-14 or greater. In various aspects, the 3-hydroxypropionic acid contains less than about 10% carbon derived from petroleum. In addition, 3-hydroxypropionic acid according to the invention may contain a residual amount of organic material related to its method of production. In various embodiments, the 3-hydroxypropionic acid contains a residual amount of organic material in an amount between 1 and 1,000 parts per million of the 3-hydroxypropionic acid.
[0015] Acrylic acid and a polymer produced from acrylic acid, where such are produced according to the method of the invention, are also included within the invention. Products, including commercial and consumer products, obtained from the polymers are also encompassed. For example, diapers, carpet, paint, adhesives, and acrylic glass are encompassed.
[0016] In addition, the invention encompasses a system for bioproduction of acrylic acid according to claim 40, said system comprising:a tank for saccharification of biomass; a line for passing the product of saccharification to a fermentation tank optionally via a pre-fermentation tank; a fermentation tank suitable for microorganism cell culture; a line for discharging contents from the fermentation tank to an extraction and/or separation vessel; an extraction and/or separation vessel suitable for removal of 3-hydroxypropionic acid from cell culture waste; a line for transferring 3-hydroxypropionic acid to a dehydration vessel; and a dehydration vessel suitable for conversion of 3-hydroxypropionic acid to acrylic acid. In various embodiments, the system further comprises one or more pre-fermentation tanks, distillation columns, centrifuge vessels, back extraction columns, mixing vessels, or combinations thereof. In various embodiments, the system has a minimum production capacity of at least 1 ton acrylic acid per year.
[0017] In various embodiments, a further genetic modification has been made that increases NADH/NADPH transhydrogenase activity. For example, the transhydrogenase activity may be soluble, may be membrane bound, may have a further genetic modification that has been made that increases cyanase activity, may include a further genetic modification that increases carbonic anhydrase activity, and/or may include a further genetic modification that increases pyruvate dehydrogenase activity.
[0018] In various embodiments, the invention includes a culture system comprising a carbon source in an aqueous medium and a genetically modified microorganism according to any one of claims, wherein said genetically modified microorganism is present in an amount selected from greater than 0.05 gDCW/L, 0.1 gDCW/L, greater than 1 gDCW/L, greater than 5 gDCW/L, greater than 10 gDCW/L, greater than 15 gDCW/L or greater than 20 gDCW/L, such as when the volume of the aqueous medium is selected from greater than 5 mL, greater than 100 mL, greater than 0.5 L, greater than 1 L, greater than 2 L, greater than 10 L, greater than 250 L, greater than 1000 L, greater than 10,000 L, greater than 50,000 L, greater than 100,000 L or greater than 200,000 L, and such as when the volume of the aqueous medium is greater than 250 L and contained within a steel vessel.
[0019] Variously, the carbon source for such culture systems is selected from dextrose, sucrose, a pentose, a polyol, a hexose, both a hexose and a pentose, and combinations thereof.
[0020] In various embodiments, the invention is an aqueous broth obtained from a culture system according to any one of claims, wherein said aqueous broth comprises i) a concentration of 3-hydroxypropionate selected from greater than 5 g/L, greater than 10 g/L, greater than 15 g/L, greater than 20 g/L, greater than 25 g/L, greater than 30 g/L, greater than 35 g/L, greater than 40 g/L, greater than 50 g/L, greater than 60 g/L, greater than 70 g/L, greater than 80 g/L, greater than 90 g/L, or greater than 100 g/L 3-hydroxypropionate; and ii) a concentration of 1,3-propanediol selected from less than 30 g/L; less than 20 g/L; less than 10 g/L; less than 5 g/L; less than 1 g/L; or less than 0.5 g/L. In some aspects, the aqueous broth comprises an amount of biomass selected from less than 20 gDCW/L biomass, less than 15 gDCW/L biomass, less than 10 gDCW/L biomass, less than 5 gDCW/L biomass or less than 1 gDCW/L biomass. Alternatively, the aqueous broth according to the invention is such that the 3-HP/succinate ratio (g3-HP/g succinate) is greater than 3, greater than 10 greater than 30, greater than 60, greater than 100, greater than 150 or greater than 200. In various aspects, the 3-HP/fumarate ratio (g3-HP/g fumarate) is greater than 3, greater than 10 greater than 30, greater than 60, greater than 100, greater than 150 or greater than 200, or the 3-HP/glycerol ratio (g3-HP/g glycerol) is greater than 3, greater than 10, greater than 30, greater than 60, greater than 100, greater than 150 or greater than 200, or the 3-HP/acetate ratio (g3-HP/g acetate) is greater than 1.5, greater than 3, greater than 10, greater than 30, greater than 60, greater than 100, greater than 150 or greater than 200, or the 3-HP/alanine ratio (g3-HP/g alanine) is greater than 3, greater than 10, greater than 30, greater than 60, greater than 100, greater than 150 or greater than 200, or the 3-HP/beta-alanine ratio (g3-HP/g beta-alanine) is greater than 1.5, greater than 3, greater than 10, greater than 30, greater than 60, greater than 100, greater than 150 or greater than 200, or the 3-HP/glutamate ratio (g3-HP/g glutamate) is greater than 3, greater than 10, greater than 30, greater than 60, greater than 100, greater than 150 or greater than 200, or the 3-HP/glutamine ratio (g3-HP/g glutamine) is greater than 3, greater than 10, greater than 30, greater than 60, greater than 100, greater than 150 or greater than 200, or the 3-HP/3-hydroxypropionaldehyde ratio (g3-HP/g 3-hydroxypropioaldehyde) is greater than 1.5, greater than 3, greater than 10, greater than 30, greater than 60, greater than 100, greater than 150 or greater than 200, or the 3-HP/1,3-propanediol ratio (g3-HP/g 1,3-propanediol) is greater than 1.5, greater than 3, greater than 10, greater than 30, greater than 60, greater than 100, greater than 150 or greater than 200, and/or the 3-HP/lactate ratio (g3-HP/g lactate) is greater than 3, greater than 10, greater than 30, greater than 60, greater than 100, greater than 150 or greater than 200.
BRIEF DESCRIPTION OF THE DRAWINGS
[0021] The novel features of the invention are set forth with particularity in the appended claims. A better understanding of the features and advantages of the present invention will be obtained by reference to the following detailed description that sets forth illustrative embodiments, in which the principles of the invention are utilized, and the accompanying drawings of which:
[0022] FIG. 1 depicts a CLUSTAL 2.0.12 multiple sequence alignment of Carbonic Anhydrase Polypeptides.
[0023] FIG. 2 depicts the reaction catalyzed by an oxaloacetate alpha-decarboxylase enzyme.
[0024] FIG. 3A depicts the natural mixed acid fermentation routes in E. coli
[0025] FIG. 3B depicts the proposed 3-HP fermentation pathway of the present invention
[0026] FIG. 4 depicts metabolic pathways of a microorganism related to aspects of the present invention, more particularly related to 3-HP production, with gene names of E. coli shown at certain enzymatic steps, the latter for example and not meant to be limiting.
[0027] FIG. 5 depicts the reaction catalyzed by the YdfG enzyme.
[0028] FIG. 6A depicts diaper manufacture.
[0029] FIG. 6B depicts diaper manufacture and diaper wear.
[0030] FIG. 7 provides a schematic of processing from biomass to polymerized acrylic acid.
[0031] FIG. 8 is a Western blot of purified Kgd, Oad, and Pdc proteins
[0032] FIG. 9 are the results of GC-MS analysis of 3-HP production by various Oad-2 candidate enzymes
[0033] Tables also are provided herein and are part of the specification.
DETAILED DESCRIPTION OF THE INVENTION
[0034] The present invention is related to various production methods and/or genetically modified microorganisms that have utility for fermentative production of 3-hydroxypropionic acid ("3-HP", CAS No. 503-66-2), which may be converted to a number of basic building blocks for polymers used in a wide range of industrial and consumer products. The present invention also is directed to methods of making 3-HP that utilize populations of these microorganisms in vessels, and to systems for chemical production that employ these microorganisms and methods. As noted herein, various aspects of the present invention are directed to a microorganism cell comprises a metabolic pathway from oxaloacetate to malonate semialdehyde, and also in various embodiments the capability to convert malonate semialdehyde to 3-HP, whether native and/or provided by a heterologous nucleic acid sequence encoding a protein having such activity.
[0035] Functional variants of an oxaloacetate alpha-decarboxylaseoxaloacetate alpha-decarboxylase are provided herein, with a demonstration of improved enzymatic conversion using these variant forms. In particular, three identified functional variant forms of an oxaloacetate alpha-decarboxylase from Leuconostoc mesenteroides are shown to have increased enzymatic activity. These may be provided in microorganisms that may also be provided with other genetic modifications described herein, resulting in improved capacity to produce 3-HP, which thereafter is converted to other chemicals, including acrylic acid, which is utilized for a number of industrial and consumer products.
[0036] Further as to the microorganisms of the present invention, in various embodiments additional genetic modifications may be made, such as to 1) increase intracellular bicarbonate levels, such as by increasing carbonic anhydrase, 2) increase enzymatic activity of NADPH-dependent transhydrogenase.
[0037] Additionally, genetic modifications for increasing tolerance may be combined with the present invention. Moreover, genetic modifications to increase expression and/or enzymatic activity of carbonic anhydrase and/or cyanase may provide dual-functions to advantageously improve both 3-HP production and 3-HP tolerance.
[0038] Other additional genetic modifications are disclosed herein for various embodiments.
Definitions
[0039] As used in the specification and the claims, the singular forms "a," "an," and "the" include plural referents unless the context clearly dictates otherwise. Thus, for example, reference to an "expression vector" includes a single expression vector as well as a plurality of expression vectors, either the same (e.g., the same operon) or different; reference to "microorganism" includes a single microorganism as well as a plurality of microorganisms; and the like.
[0040] As used herein, dry cell weight (DCW) for E. coli strains is calculated as 0.33 times the measured OD600 value, based on baseline DCW to OD600 determinations.
[0041] As used herein, "reduced enzymatic activity," "reducing enzymatic activity," and the like is meant to indicate that a microorganism cell's, or an isolated enzyme, exhibits a lower level of activity than that measured in a comparable cell of the same species or its native enzyme. That is, enzymatic conversion of the indicated substrate(s) to indicated product(s) under known standard conditions for that enzyme is at least 10, at least 20, at least 30, at least 40, at least 50, at least 60, at least 70, at least 80, or at least 90 percent less than the enzymatic activity for the same biochemical conversion by a native (non-modified) enzyme under a standard specified condition. This term also can include elimination of that enzymatic activity. A cell having reduced enzymatic activity of an enzyme can be identified using any method known in the art. For example, enzyme activity assays can be used to identify cells having reduced enzyme activity. See, for example, Enzyme Nomenclature, Academic Press, Inc., New York 2007.
[0042] The term "heterologous DNA," "heterologous nucleic acid sequence," and the like as used herein refers to a nucleic acid sequence wherein at least one of the following is true: (a) the sequence of nucleic acids is foreign to (i.e., not naturally found in) a given host microorganism; (b) the sequence may be naturally found in a given host microorganism, but in an unnatural (e.g., greater than expected) amount; or (c) the sequence of nucleic acids comprises two or more subsequences that are not found in the same relationship to each other in nature. For example, regarding instance (c), a heterologous nucleic acid sequence that is recombinantly produced will have two or more sequences from unrelated genes arranged to make a new functional nucleic acid.
[0043] The term "heterologous" is intended to include the term "exogenous" as the latter term is generally used in the art. With reference to the host microorganism's genome prior to the introduction of a heterologous nucleic acid sequence, the nucleic acid sequence that codes for the enzyme is heterologous (whether or not the heterologous nucleic acid sequence is introduced into that genome).
[0044] As used herein, the term "gene disruption," or grammatical equivalents thereof (and including "to disrupt enzymatic function," "disruption of enzymatic function," and the like), is intended to mean a genetic modification to a microorganism that renders the encoded gene product as having a reduced polypeptide activity compared with polypeptide activity in or from a microorganism cell not so modified. The genetic modification can be, for example, deletion of the entire gene, deletion or other modification of a regulatory sequence required for transcription or translation, deletion of a portion of the gene which results in a truncated gene product (e.g., enzyme) or by any of various mutation strategies that reduces activity (including to no detectable activity level) the encoded gene product. A disruption may broadly include a deletion of all or part of the nucleic acid sequence encoding the enzyme, and also includes, but is not limited to other types of genetic modifications, e.g., introduction of stop codons, frame shift mutations, introduction or removal of portions of the gene, and introduction of a degradation signal, those genetic modifications affecting mRNA transcription levels and/or stability, and altering the promoter or repressor upstream of the gene encoding the enzyme.
[0045] In various contexts, a gene disruption is taken to mean any genetic modification to the DNA, mRNA encoded from the DNA, and the corresponding amino acid sequence that results in reduced polypeptide activity. Many different methods can be used to make a cell having reduced polypeptide activity. For example, a cell can be engineered to have a disrupted regulatory sequence or polypeptide-encoding sequence using common mutagenesis or knock-out technology. See, e.g., Methods in Yeast Genetics (1997 edition), Adams et al., Cold Spring Harbor Press (1998). One particularly useful method of gene disruption is complete gene deletion because it reduces or eliminates the occurrence of genetic reversions in the genetically modified microorganisms of the invention. Accordingly, a disruption of a gene whose product is an enzyme thereby disrupts enzymatic function. Alternatively, antisense technology can be used to reduce the activity of a particular polypeptide. For example, a cell can be engineered to contain a cDNA that encodes an antisense molecule that prevents a polypeptide from being translated. Further, gene silencing can be used to reduce the activity of a particular polypeptide.
[0046] The term "antisense molecule" as used herein encompasses any nucleic acid molecule or nucleic acid analog (e.g., peptide nucleic acids) that contains a sequence that corresponds to the coding strand of an endogenous polypeptide. An antisense molecule also can have flanking sequences (e.g., regulatory sequences). Thus, antisense molecules can be ribozymes or antisense oligonucleotides.
[0047] As used herein, a ribozyme can have any general structure including, without limitation, hairpin, hammerhead, or axhead structures, provided the molecule cleaves RNA.
[0048] The term "reduction" or "to reduce" when used in such phrase and its grammatical equivalents are intended to encompass a complete elimination of such conversion(s).
[0049] Bio-production, as used herein, may be aerobic, microaerobic, or anaerobic. Also as used herein, the terms "production" and "bio-production" are used interchangeably when referring to microbial synthesis of 3-HP.
[0050] As used herein, the language "sufficiently homologous" refers to proteins or portions thereof that have amino acid sequences that include a minimum number of identical or equivalent amino acid residues when compared to an amino acid sequence of the amino acid sequences provided in this application (including the SEQ ID Nos./sequence listings) such that the protein or portion thereof is able to achieve the respective enzymatic reaction and/or other function. To determine whether a particular protein or portion thereof is sufficiently homologous may be determined by an assay of enzymatic activity, such as those commonly known in the art.
[0051] Descriptions and methods for sequence identity and homology are intended to be exemplary and it is recognized that these concepts are well-understood in the art. Further, it is appreciated that nucleic acid sequences may be varied and still encode an enzyme or other polypeptide exhibiting a desired functionality, and such variations are within the scope of the present invention.
[0052] Further to nucleic acid sequences, "hybridization" refers to the process in which two single-stranded polynucleotides bind non-covalently to form a stable double-stranded polynucleotide. The term "hybridization" may also refer to triple-stranded hybridization. The resulting (usually) double-stranded polynucleotide is a "hybrid" or "duplex." "Hybridization conditions" will typically include salt concentrations of less than about 1M, more usually less than about 500 mM and less than about 200 mM. Hybridization temperatures can be as low as 5° C., but are typically greater than 22° C., more typically greater than about 30° C., and often are in excess of about 37° C. Hybridizations are usually performed under stringent conditions, i.e. conditions under which a probe will hybridize to its target subsequence. Stringent conditions are sequence-dependent and are different in different circumstances. Longer fragments may require higher hybridization temperatures for specific hybridization. As other factors may affect the stringency of hybridization, including base composition and length of the complementary strands, presence of organic solvents and extent of base mismatching, the combination of parameters is more important than the absolute measure of any one alone. Generally, stringent conditions are selected to be about 5° C. lower than the Tm for the specific sequence at a defined ionic strength and pH. Exemplary stringent conditions include salt concentration of at least 0.01 M to no more than 1 M Na ion concentration (or other salts) at a pH 7.0 to 8.3 and a temperature of at least 25° C. For example, conditions of 5×SSPE (750 mM NaCl, 50 mM NaPhosphate, 5 mM EDTA, pH 7.4) and a temperature of 25-30° C. are suitable for allele-specific probe hybridizations. For stringent conditions, see for example, Sambrook and Russell and Anderson "Nucleic Acid Hybridization" 1st Ed., BIOS Scientific Publishers Limited (1999), which are hereby incorporated by reference for hybridization protocols. "Hybridizing specifically to" or "specifically hybridizing to" or like expressions refer to the binding, duplexing, or hybridizing of a molecule substantially to or only to a particular nucleotide sequence or sequences under stringent conditions when that sequence is present in a complex mixture (e.g., total cellular) DNA or RNA.
[0053] The term "identified enzymatic functional variant" means a polypeptide that is determined to possess an enzymatic activity and specificity of an enzyme of interest but which has an amino acid sequence different from such enzyme of interest. A corresponding "variant nucleic acid sequence" may be constructed that is determined to encode such an identified enzymatic functional variant. For a particular purpose, such as increased tolerance to 3-HP via genetic modification to increase enzymatic conversion at one or more of the enzymatic conversion steps of the 3HPTGC in a microorganism, one or more genetic modifications may be made to provide one or more heterologous nucleic acid sequence(s) that encode one or more identified 3HPTGC enzymatic functional variant(s). That is, each such nucleic acid sequence encodes a polypeptide that is not exactly the known polypeptide of an enzyme of the 3HPTGC, but which nonetheless is shown to exhibit enzymatic activity of such enzyme. Such nucleic acid sequence, and the polypeptide it encodes, may not fall within a specified limit of homology or identity yet by its provision in a cell nonetheless provide for a desired enzymatic activity and specificity. The ability to obtain such variant nucleic acid sequences and identified enzymatic functional variants is supported by recent advances in the states of the art in bioinformatics and protein engineering and design, including advances in computational, predictive and high-throughput methodologies. Functional variants more generally include enzymatic functional variants, and the nucleic acids sequences that encode them, as well as variants of non-enzymatic polypeptides, wherein the variant exhibits the function of the original (target) sequence.
[0054] The use of the phrase "segment of interest" is meant to include both a gene and any other nucleic acid sequence segment of interest. One example of a method used to obtain a segment of interest is to acquire a culture of a microorganism, where that microorganism's genome includes the gene or nucleic acid sequence segment of interest.
[0055] When the genetic modification of a gene product, i.e., an enzyme, is referred to herein, including the claims, it is understood that the genetic modification is of a nucleic acid sequence, such as or including the gene, that normally encodes the stated gene product, i.e., the enzyme.
[0056] In some embodiments a truncated respective polypeptide has at least about 90% of the full length of a polypeptide encoded by a nucleic acid sequence encoding the respective native enzyme, and more particularly at least 95% of the full length of a polypeptide encoded by a nucleic acid sequence encoding the respective native enzyme. By a polypeptide having an amino acid sequence at least, for example, 95% "identical" to a reference amino acid sequence of a polypeptide is intended that the amino acid sequence of the claimed polypeptide is identical to the reference sequence except that the claimed polypeptide sequence can include up to five amino acid alterations per each 100 amino acids of the reference amino acid of the polypeptide. In other words, to obtain a polypeptide having an amino acid sequence at least 95% identical to a reference amino acid sequence, up to 5% of the amino acid residues in the reference sequence can be deleted or substituted with another amino acid, or a number of amino acids up to 5% of the total amino acid residues in the reference sequence can be inserted into the reference sequence. These alterations of the reference sequence can occur at the amino or carboxy terminal positions of the reference amino acid sequence or anywhere between those terminal positions, interspersed either individually among residues in the reference sequence or in one or more contiguous groups within the reference sequence. In other embodiments truncation may be more substantial, as described elsewhere herein.
[0057] Species and other phylogenic identifications are according to the classification known to a person skilled in the art of microbiology.
[0058] Where methods and steps described herein indicate certain events occurring in certain order, those of ordinary skill in the art will recognize that the ordering of certain steps may be modified and that such modifications are in accordance with the variations of the invention. Additionally, certain steps may be performed concurrently in a parallel process when possible, as well as performed sequentially.
[0059] Prophetic examples provided herein are meant to be broadly exemplary and not limiting in any way. This applies to the examples regarding separation and purification of 3-HP, and conversions of 3-HP to downstream compounds, since there are numerous possible approaches to such steps and conversions, including those disclosed in references recited and incorporated herein.
[0060] The meaning of abbreviations is as follows: "C" means Celsius or degrees Celsius, as is clear from its usage, DCW means dry cell weight, "s" means second(s), "min" means minute(s), "h," "hr," or "hrs" means hour(s), "psi" means pounds per square inch, "nm" means nanometers, "d" means day(s), "μL" or "uL" or "ul" means microliter(s), "mL" means milliliter(s), "L" means liter(s), "mm" means millimeter(s), "nm" means nanometers, "mM" means millimolar, "μM" or "uM" means micromolar, "M" means molar, "mmol" means millimole(s), "μmol" or "uMol" means micromole(s)", "g" means gram(s), "μg" or "ug" means microgram(s) and "ng" means nanogram(s), "PCR" means polymerase chain reaction, "OD" means optical density, "OD600" means the optical density measured at a photon wavelength of 600 nm, "kDa" means kilodaltons, "g" means the gravitation constant, "bp" means base pair(s), "kbp" means kilobase pair(s), "% w/v" means weight/volume percent, "% v/v" means volume/volume percent, "IPTG" means isopropyl-μ-D-thiogalactopyranoiside, "RBS" means ribosome binding site, "rpm" means revolutions per minute, "HPLC" means high performance liquid chromatography, and "GC" means gas chromatography. As disclosed herein, "3-HP" means 3-hydroxypropionic acid and "3HPTGC" means the 3-HP toleragenic complex. Also, 10 5 and the like are taken to mean 105 and the like.
[0061] I. Carbon Sources
[0062] Bio-production media, which is used in the present invention with recombinant microorganisms having a biosynthetic pathway for 3-HP, must contain suitable carbon sources or substrates for the intended metabolic pathways. Suitable substrates may include, but are not limited to, monosaccharides such as glucose and fructose, oligosaccharides such as lactose or sucrose, polysaccharides such as starch or cellulose or mixtures thereof and unpurified mixtures from renewable feedstocks such as cheese whey permeate, cornsteep liquor, sugar beet molasses, and barley malt. Additionally the carbon substrate may also be one-carbon substrates such as carbon dioxide, carbon monoxide, or methanol for which metabolic conversion into key biochemical intermediates has been demonstrated. In addition to one and two carbon substrates methylotrophic organisms are also known to utilize a number of other carbon containing compounds such as methylamine, glucosamine and a variety of amino acids for metabolic activity.
[0063] Although it is contemplated that all of the above mentioned carbon substrates and mixtures thereof are suitable in the present invention as a carbon source, common carbon substrates used as carbon sources are glucose, fructose, and sucrose, as well as mixtures of any of these sugars. Other suitable substrates include xylose, arabinose, other cellulose-based C-5 sugars, high-fructose corn syrup, and various other sugars and sugar mixtures as are available commercially. Sucrose may be obtained from feedstocks such as sugar cane, sugar beets, cassava, bananas or other fruit, and sweet sorghum. Glucose and dextrose may be obtained through saccharification of starch based feedstocks including grains such as corn, wheat, rye, barley, and oats. Also, in some embodiments all or a portion of the carbon source may be glycerol. Alternatively, glycerol may be excluded as an added carbon source.
[0064] In one embodiment, the carbon source is selected from glucose, fructose, sucrose, dextrose, lactose, glycerol, and mixtures thereof. Variously, the amount of these components in the carbon source may be greater than about 50%, greater than about 60%, greater than about 70%, greater than about 80%, greater than about 90%, or more, up to 100% or essentially 100% of the carbon source.
[0065] In addition, methylotrophic organisms are known to utilize a number of other carbon containing compounds such as methylamine, glucosamine and a variety of amino acids for metabolic activity. For example, methylotrophic yeast are known to utilize the carbon from methylamine to form trehalose or glycerol (Hellion et al., Microb. Growth Cl Compd. (Int. Symp.), 7th (1993), 415-32. Editor(s): Murrell, J. Collin; Kelly, Don P. Publisher: Intercept, Andover, UK). Similarly, various species of Candida will metabolize alanine or oleic acid (Sulter et al., Arch. Microbiol. 153:485-489 (1990)). Hence it is contemplated that the source of carbon utilized in embodiments of the present invention may encompass a wide variety of carbon-containing substrates.
[0066] In addition, fermentable sugars may be obtained from cellulosic and lignocellulosic biomass through processes of pretreatment and saccharification, as described, for example, in U.S. Patent Publication No. 2007/0031918A1, which is herein incorporated by reference. Biomass refers to any cellulosic or lignocellulosic material and includes materials comprising cellulose, and optionally further comprising hemicellulose, lignin, starch, oligosaccharides and/or monosaccharides. Biomass may also comprise additional components, such as protein and/or lipid. Biomass may be derived from a single source, or biomass can comprise a mixture derived from more than one source; for example, biomass could comprise a mixture of corn cobs and corn stover, or a mixture of grass and leaves. Biomass includes, but is not limited to, bioenergy crops, agricultural residues, municipal solid waste, industrial solid waste, sludge from paper manufacture, yard waste, wood and forestry waste. Examples of biomass include, but are not limited to, corn grain, corn cobs, crop residues such as corn husks, corn stover, grasses, wheat, wheat straw, barley, barley straw, hay, rice straw, switchgrass, waste paper, sugar cane bagasse, sorghum, soy, components obtained from milling of grains, trees, branches, roots, leaves, wood chips, sawdust, shrubs and bushes, vegetables, fruits, flowers and animal manure. Any such biomass may be used in a bio-production method or system to provide a carbon source. Various approaches to breaking down cellulosic biomass to mixtures of more available and utilizable carbon molecules, including sugars, include: heating in the presence of concentrated or dilute acid (e.g., <1% sulfuric acid); treating with ammonia; treatment with ionic salts; enzymatic degradation; and combinations of these. These methods normally follow mechanical separation and milling, and are followed by appropriate separation processes.
[0067] In various embodiments, any of a wide range of sugars, including, but not limited to sucrose, glucose, xylose, cellulose or hemicellulose, are provided to a microorganism, such as in an industrial system comprising a reactor vessel in which a defined media (such as a minimal salts media including but not limited to M9 minimal media, potassium sulfate minimal media, yeast synthetic minimal media and many others or variations of these), an inoculum of a microorganism providing one or more of the 3-HP biosynthetic pathway alternatives, and the a carbon source may be combined. The carbon source enters the cell and is cataboliized by well-known and common metabolic pathways to yield common metabolic intermediates, including phosphoenolpyruvate (PEP). (See Molecular Biology of the Cell, 3rd Ed., B. Alberts et al. Garland Publishing, New York, 1994, pp. 42-45, 66-74, incorporated by reference for the teachings of basic metabolic catabolic pathways for sugars; Principles of Biochemistry, 3rd Ed., D. L. Nelson & M. M. Cox, Worth Publishers, New York, 2000, pp 527-658, incorporated by reference for the teachings of major metabolic pathways; and Biochemistry, 4th Ed., L. Stryer, W. H. Freeman and Co., New York, 1995, pp. 463-650, also incorporated by reference for the teachings of major metabolic pathways.)
[0068] Bio-based carbon can be distinguished from petroleum-based carbon according to a variety of methods, including without limitation ASTM D6866, or various other techniques. For example, carbon-14 and carbon-12 ratios differ in bio-based carbon sources versus petroleum-based sources, where higher carbon-14 ratios are found in bio-based carbon sources. In various embodiments, the carbon source is not petroleum-based, or is not predominantly petroleum based. In various embodiments, the carbon source is greater than about 50% non-petroleum based, greater than about 60% non-petroleum based, greater than about 70% non-petroleum based, greater than about 80% non-petroleum based, greater than about 90% non-petroleum based, or more. In various embodiments, the carbon source has a carbon-14 to carbon-12 ratio of about 1.0×10-14 or greater.
[0069] Various components may be excluded from the carbon source. For example, in some embodiments, acrylic acid, 1,4-butanediol, and/or glycerol are excluded or essentially excluded from the carbon source. As such, the carbon source according to some embodiments of the invention may be less than about 50% glycerol, less than about 40% glycerol, less than about 30% glycerol, less than about 20% glycerol, less than about 10% glycerol, less than about 5% glycerol, less than about 1% glycerol, or less. For example, the carbon source may be essentially glycerol-free. By essentially glycerol-free is meant that any glycerol that may be present in a residual amount does not contribute substantially to the production of the target chemical compound.
[0070] II. Microorganisms
[0071] Features as described and claimed herein may be provided in a microorganism selected from the listing herein, or another suitable microorganism, that also comprises one or more natural, introduced, or enhanced 3-HP bio-production pathways. Thus, in some embodiments the microorganism comprises an endogenous 3-HP production pathway (which may, in some such embodiments, be enhanced), whereas in other embodiments the microorganism does not comprise an endogenous 3-HP production pathway.
[0072] Varieties of these genetically modified microorganisms may comprise genetic modifications and/or other system alterations as may be described in other patent applications of one or more of the present inventor(s) and/or subject to assignment to the owner of the present patent application.
[0073] The examples describe specific modifications and evaluations to certain bacterial and yeast microorganisms. The scope of the invention is not meant to be limited to such species, but to be generally applicable to a wide range of suitable microorganisms. Generally, a microorganism used for the present invention may be selected from bacteria, cyanobacteria, filamentous fungi and yeasts.
[0074] For some embodiments, microbial hosts initially selected for 3-HP toleragenic bio-production should also utilize sugars including glucose at a high rate. Most microbes are capable of utilizing carbohydrates. However, certain environmental microbes cannot utilize carbohydrates to high efficiency, and therefore would not be suitable hosts for such embodiments that are intended for glucose or other carbohydrates as the principal added carbon source.
[0075] As the genomes of various species become known, the present invention easily may be applied to an ever-increasing range of suitable microorganisms. Further, given the relatively low cost of genetic sequencing, the genetic sequence of a species of interest may readily be determined to make application of aspects of the present invention more readily obtainable (based on the ease of application of genetic modifications to an organism having a known genomic sequence).
[0076] More particularly, based on the various criteria described herein, suitable microbial hosts for the bio-production of 3-HP that comprise tolerance aspects provided herein generally may include, but are not limited to, any gram negative organisms, more particularly a member of the family Enterobacteriaceae, such as E. coli, or Oligotropha carboxidovorans, or Pseudomonas sp.; any gram positive microorganism, for example Bacillus subtilis, Lactobaccilus sp. or Lactococcus sp.; a yeast, for example Saccharomyces cerevisiae, Pichia pastoris or Pichia stipitis; and other groups or microbial species. More particularly, suitable microbial hosts for the bio-production of 3-HP generally include, but are not limited to, members of the genera Clostridium, Zymomonas, Escherichia, Salmonella, Rhodococcus, Pseudomonas, Bacillus, Lactobacillus, Enterococcus, Alcaligenes, Klebsiella, Paenibacillus, Arthrobacter, Corynebacterium, Brevibacterium, Pichia, Candida, Hansenula and Saccharomyces. Hosts that may be particularly of interest include: Oligotropha carboxidovorans (such as strain OM5), Escherichia coli, Alcaligenes eutrophus (Cupriavidus necator), Bacillus licheniformis, Paenibacillus macerans, Rhodococcus erythropolis, Pseudomonas putida, Lactobacillus plantarum, Enterococcus faecium, Enterococcus gallinarium, Enterococcus faecalis, Bacillus subtilis and Saccharomyces cerevisiae.
[0077] More particularly, suitable microbial hosts for the bio-production of 3-HP generally include, but are not limited to, members of the genera Clostridium, Zymomonas, Escherichia, Salmonella, Rhodococcus, Pseudomonas, Bacillus, Lactobacillus, Enterococcus, Alcaligenes, Klebsiella, Paenibacillus, Arthrobacter, Corynebacterium, Brevibacterium, Pichia, Candida, Hansenula and Saccharomyces.
[0078] Hosts that may be particularly of interest include: Oligotropha carboxidovorans (such as strain OM5T), Escherichia coli, Alcaligenes eutrophus (Cupriavidus necator), Bacillus licheniformis, Paenibacillus macerans, Rhodococcus erythropolis, Pseudomonas putida, Lactobacillus plantarum, Enterococcus faecium, Enterococcus gallinarium, Enterococcus faecalis, Bacillus subtilis and Saccharomyces cerevisiae. Also, any of the known strains of these species may be utilized as a starting microorganism, as may any of the following species including respective strains thereof--Cupriavidus basilensis, Cupriavidus campinensis, Cupriavidus gilardi, Cupriavidus laharsis, Cupriavidus metallidurans, Cupriavidus oxalaticus, Cupriavidus pauculus, Cupriavidus pinatubonensis, Cupriavidus respiraculi, and Cupriavidus taiwanensis.
[0079] In some embodiments, the recombinant microorganism is a gram-negative bacterium. In some embodiments, the recombinant microorganism is selected from the genera Zymomonas, Escherichia, Pseudomonas, Alcaligenes, and Klebsiella. In some embodiments, the recombinant microorganism is selected from the species Escherichia coli, Cupriavidus necator, Oligotropha carboxidovorans, and Pseudomonas putida. In some embodiments, the recombinant microorganism is an E. coli strain.
[0080] In some embodiments, the recombinant microorganism is a gram-positive bacterium. In some embodiments, the recombinant microorganism is selected from the genera Clostridium, Salmonella, Rhodococcus, Bacillus, Lactobacillus, Enterococcus, Paenibacillus, Arthrobacter, Corynebacterium, and Brevibacterium. In some embodiments, the recombinant microorganism is selected from the species Bacillus licheniformis, Paenibacillus macerans, Rhodococcus erythropolis, Lactobacillus plantarum, Enterococcus faecium, Enterococcus gallinarium, Enterococcus faecalis, and Bacillus subtilis. In particular embodiments, the recombinant microorganism is a B. subtilis strain.
[0081] In some embodiments, the recombinant microorganism is a yeast. In some embodiments, the recombinant microorganism is selected from the genera Pichia, Candida, Hansenula and Saccharomyces. In particular embodiments, the recombinant microorganism is Saccharomyces cerevisiae.
[0082] III. Media and Culture Conditions
[0083] In addition to an appropriate carbon source, such as selected from one of the herein-disclosed types, bio-production media must contain suitable minerals, salts, cofactors, buffers and other components, known to those skilled in the art, suitable for the growth of the cultures and promotion of the enzymatic pathway necessary for 3-HP production, or other products made under the present invention.
[0084] Another aspect of the invention regards media and culture conditions that comprise genetically modified microorganisms of the invention and optionally supplements.
[0085] Typically cells are grown at a temperature in the range of about 25° C. to about 40° C. in an appropriate medium, as well as up to 70° C. for thermophilic microorganisms. Suitable growth media in the present invention are common commercially prepared media such as Luria Bertani (LB) broth, M9 minimal media, Sabouraud Dextrose (SD) broth, Yeast medium (YM) broth, (Ymin) yeast synthetic minimal media, and minimal media as described herein, such as M9 minimal media. Other defined or synthetic growth media may also be used, and the appropriate medium for growth of the particular microorganism will be known by one skilled in the art of microbiology or bio-production science. In various embodiments a minimal media may be developed and used that does not comprise, or that has a low level of addition of various components, for example less than 10, 5, 2 or 1 g/L of a complex nitrogen source including but not limited to yeast extract, peptone, tryptone, soy flour, corn steep liquor, or casein. These minimal medias may also have limited supplementation of vitamin mixtures including biotin, vitamin B12 and derivatives of vitamin B12, thiamin, pantothenate and other vitamins. Minimal medias may also have limited simple inorganic nutrient sources containing less than 28, 17, or 2.5 mM phosphate, less than 25 or 4 mM sulfate, and less than 130 or 50 mM total nitrogen.
[0086] Bio-production media, which is used in embodiments of the present invention with genetically modified microorganisms, must contain suitable carbon substrates for the intended metabolic pathways. As described hereinbefore, suitable carbon substrates include carbon monoxide, carbon dioxide, and various monomeric and oligomeric sugars.
[0087] Suitable pH ranges for the bio-production are between pH 3.0 to pH 10.0, where pH 6.0 to pH 8.0 is a typical pH range for the initial condition. However, the actual culture conditions for a particular embodiment are not meant to be limited by these pH ranges.
[0088] Bio-productions may be performed under aerobic, microaerobic, or anaerobic conditions, with or without agitation.
[0089] The amount of 3-HP or other product(s) produced in a bio-production media generally can be determined using a number of methods known in the art, for example, high performance liquid chromatography (HPLC), gas chromatography (GC), or GC/Mass Spectroscopy (MS). Specific HPLC methods for the specific examples are provided herein.
[0090] IV. Bio-Production Reactors and Systems
[0091] Fermentation systems utilizing methods and/or compositions according to the invention are also within the scope of the invention.
[0092] Any of the recombinant microorganisms as described and/or referred to herein may be introduced into an industrial bio-production system where the microorganisms convert a carbon source into 3-HP in a commercially viable operation. The bio-production system includes the introduction of such a recombinant microorganism into a bioreactor vessel, with a carbon source substrate and bio-production media suitable for growing the recombinant microorganism, and maintaining the bio-production system within a suitable temperature range (and dissolved oxygen concentration range if the reaction is aerobic or microaerobic) for a suitable time to obtain a desired conversion of a portion of the substrate molecules to 3-HP. Industrial bio-production systems and their operation are well-known to those skilled in the arts of chemical engineering and bioprocess engineering.
[0093] Bio-productions may be performed under aerobic, microaerobic, or anaerobic conditions, with or without agitation. The operation of cultures and populations of microorganisms to achieve aerobic, microaerobic and anaerobic conditions are known in the art, and dissolved oxygen levels of a liquid culture comprising a nutrient media and such microorganism populations may be monitored to maintain or confirm a desired aerobic, microaerobic or anaerobic condition. When syngas is used as a feedstock, aerobic, microaerobic, or anaerobic conditions may be utilized. When sugars are used, anaerobic, aerobic or microaerobic conditions can be implemented in various embodiments.
[0094] Any of the recombinant microorganisms as described and/or referred to herein may be introduced into an industrial bio-production system where the microorganisms convert a carbon source into 3-HP, and optionally in various embodiments also to one or more downstream compounds of 3-HP in a commercially viable operation. The bio-production system includes the introduction of such a recombinant microorganism into a bioreactor vessel, with a carbon source substrate and bio-production media suitable for growing the recombinant microorganism, and maintaining the bio-production system within a suitable temperature range (and dissolved oxygen concentration range if the reaction is aerobic or microaerobic) for a suitable time to obtain a desired conversion of a portion of the substrate molecules to 3-HP.
[0095] In various embodiments, syngas components or sugars are provided to a microorganism, such as in an industrial system comprising a reactor vessel in which a defined media (such as a minimal salts media including but not limited to M9 minimal media, potassium sulfate minimal media, yeast synthetic minimal media and many others or variations of these), an inoculum of a microorganism providing an embodiment of the biosynthetic pathway(s) taught herein, and the carbon source may be combined. The carbon source enters the cell and is catabolized by well-known and common metabolic pathways to yield common metabolic intermediates, including phosphoenolpyruvate (PEP). (See Molecular Biology of the Cell, 3rd Ed., B. Alberts et al. Garland Publishing, New York, 1994, pp. 42-45, 66-74, incorporated by reference for the teachings of basic metabolic catabolic pathways for sugars; Principles of Biochemistry, 3rd Ed., D. L. Nelson & M. M. Cox, Worth Publishers, New York, 2000, pp. 527-658, incorporated by reference for the teachings of major metabolic pathways; and Biochemistry, 4th Ed., L. Stryer, W. H. Freeman and Co., New York, 1995, pp. 463-650, also incorporated by reference for the teachings of major metabolic pathways.).
[0096] Further to types of industrial bio-production, various embodiments of the present invention may employ a batch type of industrial bioreactor. A classical batch bioreactor system is considered "closed" meaning that the composition of the medium is established at the beginning of a respective bio-production event and not subject to artificial alterations and additions during the time period ending substantially with the end of the bio-production event. Thus, at the beginning of the bio-production event the medium is inoculated with the desired organism or organisms, and bio-production is permitted to occur without adding anything to the system. Typically, however, a "batch" type of bio-production event is batch with respect to the addition of carbon source and attempts are often made at controlling factors such as pH and oxygen concentration. In batch systems the metabolite and biomass compositions of the system change constantly up to the time the bio-production event is stopped. Within batch cultures cells moderate through a static lag phase to a high growth log phase and finally to a stationary phase where growth rate is diminished or halted. If untreated, cells in the stationary phase will eventually die. Cells in log phase generally are responsible for the bulk of production of a desired end product or intermediate.
[0097] A variation on the standard batch system is the fed-batch system. Fed-batch bio-production processes are also suitable in the present invention and comprise a typical batch system with the exception that the nutrients, including the substrate, are added in increments as the bio-production progresses. Fed-Batch systems are useful when catabolite repression is apt to inhibit the metabolism of the cells and where it is desirable to have limited amounts of substrate in the media. Measurement of the actual nutrient concentration in Fed-Batch systems may be measured directly, such as by sample analysis at different times, or estimated on the basis of the changes of measurable factors such as pH, dissolved oxygen and the partial pressure of waste gases such as CO2. Batch and fed-batch approaches are common and well known in the art and examples may be found in Thomas D. Brock in Biotechnology: A Textbook of Industrial Microbiology, Second Edition (1989) Sinauer Associates, Inc., Sunderland, Mass., Deshpande, Mukund V., Appl. Biochem. Biotechnol., 36:227, (1992), and Biochemical Engineering Fundamentals, 2nd Ed. J. E. Bailey and D. F. Ollis, McGraw Hill, New York, 1986, herein incorporated by reference for general instruction on bio-production.
[0098] Although embodiments of the present invention may be performed in batch mode, or in fed-batch mode, it is contemplated that the invention would be adaptable to continuous bio-production methods. Continuous bio-production is considered an "open" system where a defined bio-production medium is added continuously to a bioreactor and an equal amount of conditioned media is removed simultaneously for processing. Continuous bio-production generally maintains the cultures within a controlled density range where cells are primarily in log phase growth. Two types of continuous bioreactor operation include a chemostat, wherein fresh media is fed to the vessel while simultaneously removing an equal rate of the vessel contents. The limitation of this approach is that cells are lost and high cell density generally is not achievable. In fact, typically one can obtain much higher cell density with a fed-batch process. Another continuous bioreactor utilizes perfusion culture, which is similar to the chemostat approach except that the stream that is removed from the vessel is subjected to a separation technique which recycles viable cells back to the vessel. This type of continuous bioreactor operation has been shown to yield significantly higher cell densities than fed-batch and can be operated continuously. Continuous bio-production is particularly advantageous for industrial operations because it has less down time associated with draining, cleaning and preparing the equipment for the next bio-production event. Furthermore, it is typically more economical to continuously operate downstream unit operations, such as distillation, than to run them in batch mode.
[0099] Continuous bio-production allows for the modulation of one factor or any number of factors that affect cell growth or end product concentration. For example, one method will maintain a limiting nutrient such as the carbon source or nitrogen level at a fixed rate and allow all other parameters to moderate. In other systems a number of factors affecting growth can be altered continuously while the cell concentration, measured by media turbidity, is kept constant. Methods of modulating nutrients and growth factors for continuous bio-production processes as well as techniques for maximizing the rate of product formation are well known in the art of industrial microbiology and a variety of methods are detailed by Brock, supra.
[0100] It is contemplated that embodiments of the present invention may be practiced using either batch, fed-batch or continuous processes and that any known mode of bio-production would be suitable. It is contemplated that cells may be immobilized on an inert scaffold as whole cell catalysts and subjected to suitable bio-production conditions for 3-HP production, or be cultured in liquid media in a vessel, such as a culture vessel. Thus, embodiments used in such processes, and in bio-production systems using these processes, include a population of genetically modified microorganisms of the present invention, a culture system comprising such population in a media comprising nutrients for the population, and methods of making 3-HP and thereafter, a downstream product of 3-HP.
[0101] Embodiments of the invention include methods of making 3-HP in a bio-production system, some of which methods may include obtaining 3-HP after such bio-production event. For example, a method of making 3-HP may comprise: providing to a culture vessel a media comprising suitable nutrients; providing to the culture vessel an inoculum of a genetically modified microorganism comprising genetic modifications described herein such that the microorganism produces 3-HP from syngas and/or a sugar molecule; and maintaining the culture vessel under suitable conditions for the genetically modified microorganism to produce 3-HP.
[0102] It is within the scope of the present invention to produce, and to utilize in bio-production methods and systems, including industrial bio-production systems for production of 3-HP, a recombinant microorganism genetically engineered to modify one or more aspects effective to increase tolerance to 3-HP (and, in some embodiments, also 3-HP bio-production) by at least 20 percent over control microorganism lacking the one or more modifications.
[0103] In various embodiments, the invention is directed to a system for bioproduction of acrylic acid as described herein, said system comprising: a tank for saccharification of biomass; a line for passing the product of saccharification to a fermentation tank optionally via a pre-fermentation tank; a fermentation tank suitable for microorganism cell culture; a line for discharging contents from the fermentation tank to an extraction and/or separation vessel; an extraction and/or separation vessel suitable for removal of 3-hydroxypropionic acid from cell culture waste; a line for transferring 3-hydroxypropionic acid to a dehydration vessel; and a dehydration vessel suitable for conversion of 3-hydroxypropionic acid to acrylic acid. In various embodiments, the system includes one or more pre-fermentation tanks, distillation columns, centrifuge vessels, back extraction columns, mixing vessels, or combinations thereof.
[0104] The following published resources are incorporated by reference herein for their respective teachings to indicate the level of skill in these relevant arts, and as needed to support a disclosure that teaches how to make and use methods of industrial bio-production of 3-HP, or other product(s) produced under the invention, from sugar sources, and also industrial systems that may be used to achieve such conversion with any of the recombinant microorganisms of the present invention (Biochemical Engineering Fundamentals, 2nd Ed. J. E. Bailey and D. F. Ollis, McGraw Hill, New York, 1986, entire book for purposes indicated and Chapter 9, pages 533-657 in particular for biological reactor design; Unit Operations of Chemical Engineering, 5th Ed., W. L. McCabe et al., McGraw Hill, New York 1993, entire book for purposes indicated, and particularly for process and separation technologies analyses; Equilibrium Staged Separations, P. C. Wankat, Prentice Hall, Englewood Cliffs, N.J. USA, 1988, entire book for separation technologies teachings).
[0105] V. Genetic Modifications, Nucleotide Sequences, and Amino Acid Sequences
[0106] Embodiments of the present invention may result from introduction of an expression vector into a host microorganism, wherein the expression vector contains a nucleic acid sequence coding for an enzyme that is, or is not, normally found in a host microorganism.
[0107] The ability to genetically modify a host cell is essential for the production of any genetically modified (recombinant) microorganism. The mode of gene transfer technology may be by electroporation, conjugation, transduction, or natural transformation. A broad range of host conjugative plasmids and drug resistance markers are available. The cloning vectors are tailored to the host organisms based on the nature of antibiotic resistance markers that can function in that host. Also, as disclosed herein, a genetically modified (recombinant) microorganism may comprise modifications other than via plasmid introduction, including modifications to its genomic DNA.
[0108] It has long been recognized in the art that some amino acids in amino acid sequences can be varied without significant effect on the structure or function of proteins. Variants included can constitute deletions, insertions, inversions, repeats, and type substitutions so long as the indicated enzyme activity is not significantly adversely affected. Guidance concerning which amino acid changes are likely to be phenotypically silent can be found, inter alia, in Bowie, J. U., et al., "Deciphering the Message in Protein Sequences: Tolerance to Amino Acid Substitutions," Science 247:1306-1310 (1990). This reference is incorporated by reference for such teachings, which are, however, also generally known to those skilled in the art.
[0109] In various embodiments polypeptides obtained by the expression of the polynucleotide molecules of the present invention may have at least approximately 50%, 60%, 70%, 80%, 90%, 95%, 96%, 97%, 98%, 99% or 100% identity to one or more amino acid sequences encoded by the genes and/or nucleic acid sequences described herein for the 3-HP tolerance-related and biosynthesis pathways.
[0110] As a practical matter, whether any particular polypeptide is at least 50%, 60%, 70%, 80%, 85%, 90%, 92%, 95%, 96%, 97%, 98% or 99% identical to any reference amino acid sequence of any polypeptide described herein (which may correspond with a particular nucleic acid sequence described herein), such particular polypeptide sequence can be determined conventionally using known computer programs such the Bestfit program (Wisconsin Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University Research Park, 575 Science Drive, Madison, Wis. 53711). When using Bestfit or any other sequence alignment program to determine whether a particular sequence is, for instance, 95% identical to a reference sequence according to the present invention, the parameters are set such that the percentage of identity is calculated over the full length of the reference amino acid sequence and that gaps in homology of up to 5% of the total number of amino acid residues in the reference sequence are allowed.
[0111] For example, in a specific embodiment the identity between a reference sequence (query sequence, i.e., a sequence of the present invention) and a subject sequence, also referred to as a global sequence alignment, may be determined using the FASTDB computer program based on the algorithm of Brutlag et al. (Comp. App. Biosci. 6:237-245 (1990)). Preferred parameters for a particular embodiment in which identity is narrowly construed, used in a FASTDB amino acid alignment, are: Scoring Scheme=PAM (Percent Accepted Mutations) 0, k-tuple=2, Mismatch Penalty=1, Joining Penalty=20, Randomization Group Length=0, Cutoff Score=1, Window Size=sequence length, Gap Penalty=5, Gap Size Penalty=0.05, Window Size=500 or the length of the subject amino acid sequence, whichever is shorter. According to this embodiment, if the subject sequence is shorter than the query sequence due to N- or C-terminal deletions, not because of internal deletions, a manual correction is made to the results to take into consideration the fact that the FASTDB program does not account for N- and C-terminal truncations of the subject sequence when calculating global percent identity. For subject sequences truncated at the N- and C-termini, relative to the query sequence, the percent identity is corrected by calculating the number of residues of the query sequence that are lateral to the N- and C-terminal of the subject sequence, which are not matched/aligned with a corresponding subject residue, as a percent of the total bases of the query sequence. A determination of whether a residue is matched/aligned is determined by results of the FASTDB sequence alignment. This percentage is then subtracted from the percent identity, calculated by the FASTDB program using the specified parameters, to arrive at a final percent identity score. This final percent identity score is what is used for the purposes of this embodiment. Only residues to the N- and C-termini of the subject sequence, which are not matched/aligned with the query sequence, are considered for the purposes of manually adjusting the percent identity score. That is, only query residue positions outside the farthest N- and C-terminal residues of the subject sequence are considered for this manual correction. For example, a 90 amino acid residue subject sequence is aligned with a 100 residue query sequence to determine percent identity. The deletion occurs at the N-terminus of the subject sequence and therefore, the FASTDB alignment does not show a matching/alignment of the first 10 residues at the N-terminus. The 10 unpaired residues represent 10% of the sequence (number of residues at the N- and C-termini not matched/total number of residues in the query sequence) so 10% is subtracted from the percent identity score calculated by the FASTDB program. If the remaining 90 residues were perfectly matched the final percent identity would be 90%. In another example, a 90 residue subject sequence is compared with a 100 residue query sequence. This time the deletions are internal deletions so there are no residues at the N- or C-termini of the subject sequence which are not matched/aligned with the query. In this case the percent identity calculated by FASTDB is not manually corrected. Once again, only residue positions outside the N- and C-terminal ends of the subject sequence, as displayed in the FASTDB alignment, which are not matched/aligned with the query sequence are manually corrected for.
[0112] More generally, nucleic acid constructs can be prepared comprising an isolated polynucleotide encoding a polypeptide having enzyme activity operably linked to one or more (several) control sequences that direct the expression of the coding sequence in a microorganism, such as E. coli, under conditions compatible with the control sequences. The isolated polynucleotide may be manipulated to provide for expression of the polypeptide. Manipulation of the polynucleotide's sequence prior to its insertion into a vector may be desirable or necessary depending on the expression vector. The techniques for modifying polynucleotide sequences utilizing recombinant DNA methods are well established in the art.
[0113] The control sequence may be an appropriate promoter sequence, a nucleotide sequence that is recognized by a host cell for expression of a polynucleotide encoding a polypeptide of the present invention. The promoter sequence contains transcriptional control sequences that mediate the expression of the polypeptide. The promoter may be any nucleotide sequence that shows transcriptional activity in the host cell of choice including mutant, truncated, and hybrid promoters, and may be obtained from genes encoding extracellular or intracellular polypeptides either homologous or heterologous to the host cell. Examples of suitable promoters for directing transcription of the nucleic acid constructs, especially in an E. coli host cell, are the lac promoter (Gronenborn, 1976, Mol. Gen. Genet. 148: 243-250), tac promoter (DeBoer et a/., 1983, Proceedings of the National Academy of Sciences USA 80: 21-25), trc promoter (Brosius et al, 1985, J. Biol. Chem. 260: 3539-3541), T7 RNA polymerase promoter (Studier and Moffatt, 1986, J. Mol. Biol. 189: 113-130), phage promoter pL (Elvin et al., 1990, Gene 87: 123-126), tetA prmoter (Skerra, 1994, Gene 151: 131-135), araBAD promoter (Guzman et al., 1995, J. Bacteriol. 177: 4121-4130), and rhaPBAD promoter (Haldimann et al., 1998, J. Bacteriol. 180: 1277-1286). Other promoters are described in "Useful proteins from recombinant bacteria" in Scientific American, 1980, 242: 74-94; and in Sambrook and Russell, "Molecular Cloning: A Laboratory Manual," Third Edition 2001 (volumes 1-3), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.
[0114] The control sequence may also be a suitable transcription terminator sequence, a sequence recognized by a host cell to terminate transcription. The terminator sequence is operably linked to the 3' terminus of the nucleotide sequence encoding the polypeptide. Any terminator that is functional in an E. coli cell may be used in the present invention. It may also be desirable to add regulatory sequences that allow regulation of the expression of the polypeptide relative to the growth of the host cell. Examples of regulatory systems are those that cause the expression of the gene to be turned on or off in response to a chemical or physical stimulus, including the presence of a regulatory compound. Regulatory systems in prokaryotic systems include the lac, tac, and trp operator systems.
[0115] For various embodiments of the invention the genetic manipulations may be described to include various genetic manipulations, including those directed to change regulation of, and therefore ultimate activity of, an enzyme or enzymatic activity of an enzyme identified in any of the respective pathways. Such genetic modifications may be directed to transcriptional, translational, and post-translational modifications that result in a change of enzyme activity and/or selectivity under selected and/or identified culture conditions and/or to provision of additional nucleic acid sequences such as to increase copy number and/or mutants of an enzyme related to 3-HP production. Specific methodologies and approaches to achieve such genetic modification are well known to one skilled in the art, and include, but are not limited to: increasing expression of an endogenous genetic element; decreasing functionality of a repressor gene; introducing a heterologous genetic element; increasing copy number of a nucleic acid sequence encoding a polypeptide catalyzing an enzymatic conversion step to produce 3-HP; mutating a genetic element to provide a mutated protein to increase specific enzymatic activity; over-expressing; under-expressing; over-expressing a chaperone; knocking out a protease; altering or modifying feedback inhibition; providing an enzyme variant comprising one or more of an impaired binding site for a repressor and/or competitive inhibitor; knocking out a repressor gene; evolution, selection and/or other approaches to improve mRNA stability as well as use of plasmids having an effective copy number and promoters to achieve an effective level of improvement. Random mutagenesis may be practiced to provide genetic modifications that may fall into any of these or other stated approaches. The genetic modifications further broadly fall into additions (including insertions), deletions (such as by a mutation) and substitutions of one or more nucleic acids in a nucleic acid of interest. In various embodiments a genetic modification results in improved enzymatic specific activity and/or turnover number of an enzyme. Without being limited, changes may be measured by one or more of the following: KM; kcat; and Kavidity.
[0116] In various embodiments, to function more efficiently, a microorganism may comprise one or more gene deletions. For example, in E. coli, the genes encoding the lactate dehydrogenase (ldhA), phosphate acetyltransferase (pta), pyruvate oxidase (poxB), and pyruvate-formate lyase (pflB) may be disrupted, including deleted. Such gene disruptions, including deletions, are not meant to be limiting, and may be implemented in various combinations in various embodiments. Gene deletions may be accomplished by mutational gene deletion approaches, and/or starting with a mutant strain having reduced or no expression of one or more of these enzymes, and/or other methods known to those skilled in the art. Gene deletions may be effectuated by any of a number of known specific methodologies, including but not limited to the RED/ET methods using kits and other reagents sold by Gene Bridges (Gene Bridges GmbH, Dresden, Germany, <<www.genebridges.com>>).
[0117] More particularly as to the latter method, use of Red/ET recombination, is known to those of ordinary skill in the art and described in U.S. Pat. Nos. 6,355,412 and 6,509,156, issued to Stewart et al. and incorporated by reference herein for its teachings of this method. Material and kits for such method are available from Gene Bridges (Gene Bridges GmbH, Dresden, Germany, <<www.genebridges.com>>), and the method may proceed by following the manufacturer's instructions. The method involves replacement of the target gene by a selectable marker via homologous recombination performed by the recombinase from X-phage. The host organism expressing λ-red recombinase is transformed with a linear DNA product coding for a selectable marker flanked by the terminal regions (generally ˜50 bp, and alternatively up to about ˜300 bp) homologous with the target gene. The marker could then be removed by another recombination step performed by a plasmid vector carrying the FLP-recombinase, or another recombinase, such as Cre.
[0118] Targeted deletion of parts of microbial chromosomal DNA or the addition of foreign genetic material to microbial chromomes may be practiced to alter a host cell's metabolism so as to reduce or eliminate production of undesired metabolic products. This may be used in combination with other genetic modifications such as described herein in this general example. In this detailed description, reference has been made to multiple embodiments and to the accompanying drawings in which is shown by way of illustration specific exemplary embodiments in which the invention may be practiced. These embodiments are described in sufficient detail to enable those skilled in the art to practice the invention, and it is to be understood that modifications to the various disclosed embodiments may be made by a skilled artisan.
[0119] Further, for 3-HP production, such genetic modifications may be chosen and/or selected for to achieve a higher flux rate through certain enzymatic conversion steps within the respective 3-HP production pathway and so may affect general cellular metabolism in fundamental and/or major ways.
[0120] It will be appreciated that amino acid "homology" includes conservative substitutions, i.e. those that substitute a given amino acid in a polypeptide by another amino acid of similar characteristics. Typically seen as conservative substitutions are the following replacements: replacements of an aliphatic amino acid such as Ala, Val, Leu and Ile with another aliphatic amino acid; replacement of a Ser with a Thr or vice versa; replacement of an acidic residue such as Asp or Glu with another acidic residue; replacement of a residue bearing an amide group, such as Asn or Gln, with another residue bearing an amide group; exchange of a basic residue such as Lys or Arg with another basic residue; and replacement of an aromatic residue such as Phe or Tyr with another aromatic residue.
[0121] For all nucleic acid and amino acid sequences provided herein, it is appreciated that conservatively modified variants of these sequences are included, and are within the scope of the invention in its various embodiments. Functionally equivalent nucleic acid and amino acid sequences (functional variants), which may include conservatively modified variants as well as more extensively varied sequences, which are well within the skill of the person of ordinary skill in the art, and microorganisms comprising these, also are within the scope of various embodiments of the invention, as are methods and systems comprising such sequences and/or microorganisms. In various embodiments, nucleic acid sequences encoding sufficiently homologous proteins or portions thereof are within the scope of the invention. More generally, nucleic acids sequences that encode a particular amino acid sequence employed in the invention may vary due to the degeneracy of the genetic code, and nonetheless fall within the scope of the invention. The following table provides a summary of similarities among amino acids, upon which conservative and less conservative substitutions may be based, and also various codon redundancies that reflect this degeneracy.
[0122] Degeneracy of the Amino Acid Code
TABLE-US-00001 Amino Acid Relationships DNA codons Alanine N, Ali GCT, GCC, GCA, GCG Proline N CCT, CCC, CCA, CCG Valine N, Ali GTT, GTC, GTA, GTG Leucine N, Ali CTT, CTC, CTA, CTG, TTA, TTG Isoleucine N, Ali ATT, ATC, ATA Methionine N ATG Phenylalanine N, Aro TTT, TTC Tryptophan N TGG Glycine PU GGT, GGC, GGA, GGG Serine PU TCT, TCC, TCA, TCG, AGT, AGC Threonine PU ACT, ACC, ACA, ACG Asparagine PU, Ami AAT, AAC Glutamine PU, Ami CAA, CAG Cysteine PU TGT, TGC Aspartic acid NEG, A GAT, GAC Glutamic acid NEG, A GAA, GAG Arginine POS, B CGT, CGC, CGA, CGG, AGA, AGG Lysine POS, B AAA, AAG Histidine POS CAT, CAC Tyrosine Aro TAT, TAC Stop Codons TAA, TAG, TGA Legend: side groups and other related properties: A = acidic; B = basic; Ali = aliphatic; Ami = amine; Aro = aromatic; N = nonpolar; PU = polar uncharged; NEG = negatively charged; POS = positively charged.
[0123] Also, variants and portions of particular nucleic acid sequences, and respective encoded amino acid sequences recited herein may be exhibit a desired functionality, e.g., enzymatic activity at a selected level, when such nucleic acid sequence variant and/or portion contains a 15 nucleotide sequence identical to any 15 nucleotide sequence set forth in the nucleic acid sequences recited herein including, without limitation, the sequence starting at nucleotide number 1 and ending at nucleotide number 15, the sequence starting at nucleotide number 2 and ending at nucleotide number 16, the sequence starting at nucleotide number 3 and ending at nucleotide number 17, and so forth. It will be appreciated that the invention also provides isolated nucleic acid that contains a nucleotide sequence that is greater than 15 nucleotides (e.g., 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30 or more nucleotides) in length and identical to any portion of the sequence set forth in nucleic acid sequences recited herein. For example, the invention provides isolated nucleic acid that contains a 25 nucleotide sequence identical to any 25 nucleotide sequence set forth in any one or more (including any grouping of) nucleic acid sequences recited herein including, without limitation, the sequence starting at nucleotide number 1 and ending at nucleotide number 25, the sequence starting at nucleotide number 2 and ending at nucleotide number 26, the sequence starting at nucleotide number 3 and ending at nucleotide number 27, and so forth. Additional examples include, without limitation, isolated nucleic acids that contain a nucleotide sequence that is 50 or more nucleotides (e.g., 100, 150, 200, 250, 300, or more nucleotides) in length and identical to any portion of any of the sequences disclosed herein. Such isolated nucleic acids can include, without limitation, those isolated nucleic acids containing a nucleic acid sequence represented in any one section of discussion and/or examples, such as regarding 3-HP production pathways, nucleic acid sequences encoding enzymes of the fatty acid synthase system, or 3-HP tolerance. For example, the invention provides an isolated nucleic acid containing a nucleic acid sequence listed herein that contains a single insertion, a single deletion, a single substitution, multiple insertions, multiple deletions, multiple substitutions, or any combination thereof (e.g., single deletion together with multiple insertions). Such isolated nucleic acid molecules can share at least 60, 65, 70, 75, 80, 85, 90, 95, 97, 98, or 99 percent sequence identity with a nucleic acid sequence listed herein (i.e., in the sequence listing).
[0124] Additional examples include, without limitation, isolated nucleic acids that contain a nucleic acid sequence that encodes an amino acid sequence that is 50 or more amino acid residues (e.g., 100, 150, 200, 250, 300, or more amino acid residues) in length and identical to any portion of an amino acid sequence listed or otherwise disclosed herein.
[0125] In addition, the invention provides isolated nucleic acid that contains a nucleic acid sequence that encodes an amino acid sequence having a variation of an amino acid sequence listed or otherwise disclosed herein. For example, the invention provides isolated nucleic acid containing a nucleic acid sequence encoding an amino acid sequence listed or otherwise disclosed herein that contains a single insertion, a single deletion, a single substitution, multiple insertions, multiple deletions, multiple substitutions, or any combination thereof (e.g., single deletion together with multiple insertions). Such isolated nucleic acid molecules can contain a nucleic acid sequence encoding an amino acid sequence that shares at least 60, 65, 70, 75, 80, 85, 90, 95, 97, 98, or 99 percent sequence identity with an amino acid sequence listed or otherwise disclosed herein.
[0126] Examples of properties that provide the bases for conservative and other amino acid substitutions are exemplified in Table 1. Accordingly, one skilled in the art may make numerous substitutions to obtain an amino acid sequence variant that exhibits a desired functionality. BLASTP, CLUSTALP, and other alignment and comparison tools may be used to assess highly conserved regions, to which fewer substitutions may be made (unless directed to alter activity to a selected level, which may require multiple substitutions). More substitutions may be made in regions recognized or believed to not be involved with an active site or other binding or structural motif. In accordance with Table 1, for example, substitutions may be made of one polar uncharged (PU) amino acid for a polar uncharged amino acid of a listed sequence, optionally considering size/molecular weight (i.e., substituting a serine for a threonine). Guidance concerning which amino acid changes are likely to be phenotypically silent can be found, inter alia, in Bowie, J. U., et Al., "Deciphering the Message in Protein Sequences: Tolerance to Amino Acid Substitutions," Science 247:1306-1310 (1990). This reference is incorporated by reference for such teachings, which are, however, also generally known to those skilled in the art. Recognized conservative amino acid substitutions comprise (substitutable amino acids following each colon of a set): ala:ser; arg:lys; asn:gln or his; asp:glu; cys:ser; gln:asn; glu:asp; gly:pro; his:asn or gln; ile:leu or val; leu:ile or val; lys:arg or gln or glu; met:leu or ile; phe:met or leu or tyr; ser:thr; thr:ser; trp:tyr; tyr:trp or phe; val:ile or leu.
[0127] It is noted that codon preferences and codon usage tables for a particular species can be used to engineer isolated nucleic acid molecules that take advantage of the codon usage preferences of that particular species. For example, the isolated nucleic acid provided herein can be designed to have codons that are preferentially used by a particular organism of interest. Numerous software and sequencing services are available for such codon-optimizing of sequences.
[0128] The invention provides polypeptides that contain the entire amino acid sequence of an amino acid sequence listed or otherwise disclosed herein. In addition, the invention provides polypeptides that contain a portion of an amino acid sequence listed or otherwise disclosed herein. For example, the invention provides polypeptides that contain a 15 amino acid sequence identical to any 15 amino acid sequence of an amino acid sequence listed or otherwise disclosed herein including, without limitation, the sequence starting at amino acid residue number 1 and ending at amino acid residue number 15, the sequence starting at amino acid residue number 2 and ending at amino acid residue number 16, the sequence starting at amino acid residue number 3 and ending at amino acid residue number 17, and so forth. It will be appreciated that the invention also provides polypeptides that contain an amino acid sequence that is greater than 15 amino acid residues (e. g., 16, 17, 18, 19, 20, 21, 22, 23, 24, 25, 26, 27, 28, 29, 30 or more amino acid residues) in length and identical to any portion of an amino acid sequence listed or otherwise disclosed herein For example, the invention provides polypeptides that contain a 25 amino acid sequence identical to any 25 amino acid sequence of an amino acid sequence listed or otherwise disclosed herein including, without limitation, the sequence starting at amino acid residue number 1 and ending at amino acid residue number 25, the sequence starting at amino acid residue number 2 and ending at amino acid residue number 26, the sequence starting at amino acid residue number 3 and ending at amino acid residue number 27, and so forth. Additional examples include, without limitation, polypeptides that contain an amino acid sequence that is 50 or more amino acid residues (e.g., 100, 150, 200, 250, 300 or more amino acid residues) in length and identical to any portion of an amino acid sequence listed or otherwise disclosed herein. Further, it is appreciated that, per above, a 15 nucleotide sequence will provide a 5 amino acid sequence, so that the latter, and higher-length amino acid sequences, may be defined by the above-described nucleotide sequence lengths having identity with a sequence provided herein.
[0129] In addition, the invention provides polypeptides that an amino acid sequence having a variation of the amino acid sequence set forth in an amino acid sequence listed or otherwise disclosed herein. For example, the invention provides polypeptides containing an amino acid sequence listed or otherwise disclosed herein that contains a single insertion, a single deletion, a single substitution, multiple insertions, multiple deletions, multiple substitutions, or any combination thereof (e.g., single deletion together with multiple insertions). Such polypeptides can contain an amino acid sequence that shares at least 60, 65, 70, 75, 80, 85, 90, 95, 97, 98 or 99 percent sequence identity with an amino acid sequence listed or otherwise disclosed herein. A particular variant amino acid sequence may comprise any number of variations as well as any combination of types of variations.
[0130] The invention includes, in various embodiments, an amino acid sequence having a variation of any of the polynucleotide and polypeptide sequences disclosed herein. As one example, variations are exemplified for the carbonic anhydrase (E. coli cynT) amino acid sequence set forth in SEQ ID NO:57. FIG. 1 provides a CLUSTAL multiple sequence alignment of the E. coli carbonic anhydrase aligned with carbonic anhydrases of eleven other species that had relatively high homology, based on low E values, in a BLASTP comparison. SEQ ID NO:57 is the fifth sequence shown. Multiple conservative and less conservative substitutions are shown (i.e., by the ":" and "." designations, respectively), which can lead to additional modifications by one skilled in the art. Thus, examples of variations of the sequence set forth in SEQ ID NO:57 include, without limitation, any variation of the sequences as set forth in FIG. 1. Such variations are provided in FIG. 1 in that a comparison of the amino acid residue (or lack thereof) at a particular position of the sequence set forth in SEQ ID NO:57 with the amino acid residue (or lack thereof) at the same aligned position of any of the other eleven amino acid sequences of FIG. 1 provides a list of specific changes for the sequence set forth in SEQ ID NO:57. For example, the "E" glutamic acid at position 14 of SEQ ID NO:57 can be substituted with a "D" aspartic acid or "N" asparagine as indicated in FIG. 1. It will be appreciated that the sequence set forth in SEQ ID NO:57 can contain any number of variations as well as any combination of types of variations. It is noted that the amino acid sequences provided in FIG. 1 can be polypeptides having carbonic anhydrase activity.
[0131] As indicated herein, polypeptides having a variant amino acid sequence can retain enzymatic activity. Such polypeptides can be produced by manipulating the nucleotide sequence encoding a polypeptide using standard procedures such as site-directed mutagenesis or various PCR techniques. As noted herein, one type of modification includes the substitution of one or more amino acid residues for amino acid residues having a similar chemical and/or biochemical property. For example, a polypeptide can have an amino acid sequence set forth in an amino acid sequence listed or otherwise disclosed herein comprising one or more conservative substitutions.
[0132] More substantial changes can be obtained by selecting substitutions that are less conservative, and/or in areas of the sequence that may be more critical, for example selecting residues that differ more significantly in their effect on maintaining: (a) the structure of the polypeptide backbone in the area of the substitution, for example, as a sheet or helical conformation; (b) the charge or hydrophobicity of the polypeptide at the target site; or (c) the bulk of the side chain. The substitutions that in general are expected to produce the greatest changes in polypeptide function are those in which: (a) a hydrophilic residue, e.g., serine or threonine, is substituted for (or by) a hydrophobic residue, e.g., leucine, isoleucine, phenylalanine, valine or alanine; (b) a cysteine or proline is substituted for (or by) any other residue; (c) a residue having an electropositive side chain, e.g., lysine, arginine, or histidine, is substituted for (or by) an electronegative residue, e.g., glutamic acid or aspartic acid; or (d) a residue having a bulky side chain, e.g., phenylalanine, is substituted for (or by) one not having a side chain, e.g., glycine. The effects of these amino acid substitutions (or other deletions or additions) can be assessed for polypeptides having enzymatic activity by analyzing the ability of the polypeptide to catalyze the conversion of the same substrate as the related native polypeptide to the same product as the related native polypeptide. Accordingly, polypeptides having 5, 10, 20, 30, 40, 50 or less conservative substitutions are provided by the invention.
[0133] Polypeptides and nucleic acids encoding polypeptides can be produced by standard DNA mutagenesis techniques, for example, M13 primer mutagenesis. Details of these techniques are provided in Sambrook and Russell, 2001. Nucleic acid molecules can contain changes of a coding region to fit the codon usage bias of the particular organism into which the molecule is to be introduced.
[0134] Alternatively, the coding region can be altered by taking advantage of the degeneracy of the genetic code to alter the coding sequence in such a way that, while the nucleic acid sequence is substantially altered, it nevertheless encodes a polypeptide having an amino acid sequence identical or substantially similar to the native amino acid sequence. For example, alanine is encoded in the open reading frame by the nucleotide codon triplet GCT. Because of the degeneracy of the genetic code, three other nucleotide codon triplets--GCA, GCC, and GCG--also code for alanine. Thus, the nucleic acid sequence of the open reading frame can be changed at this position to any of these three codons without affecting the amino acid sequence of the encoded polypeptide or the characteristics of the polypeptide. Based upon the degeneracy of the genetic code, nucleic acid variants can be derived from a nucleic acid sequence disclosed herein using standard DNA mutagenesis techniques as described herein, or by synthesis of nucleic acid sequences. Thus, for various embodiments the invention encompasses nucleic acid molecules that encode the same polypeptide but vary in nucleic acid sequence by virtue of the degeneracy of the genetic code.
[0135] The invention also provides an isolated nucleic acid that is at least about 12 bases in length (e.g., at least about 13, 14, 15, 16, 17, 18, 19, 20, 25, 30, 40, 50, 60, 100, 250, 500, 750, 1000, 1500, 2000, 3000, 4000, or 5000 bases in length) and hybridizes, under hybridization conditions, to the sense or antisense strand of a nucleic acid having a sequence listed or otherwise disclosed herein. The hybridization conditions can be moderately or highly stringent hybridization conditions. Also, in some embodiments the microorganism comprises an endogenous 3-HP production pathway (which may, in some such embodiments, be enhanced), whereas in other embodiments the microorganism does not comprise a 3-HP production pathway, but is provided with one or more nucleic acid sequences encoding polypeptides having enzymatic activity or activities to complete a pathway, described herein, resulting in production of 3-HP. In some embodiments, the particular sequences disclosed herein, or conservatively modified variants thereof, are provided to a selected microorganism, such as selected from one or more of the species and groups of species or other taxonomic groups listed herein.
[0136] VI. Specific Mutations, Genetic Modifications, and Combinations Thereof
[0137] The present invention, in its various embodiments, is directed to making one, or a combination of, genetic modifications in a microorganism to increase production therein of the commercially important compound 3-hydroxypropionic acid ("3-HP," CAS No. 503-66-2). More generally the present invention relates to methods, systems and compositions, including genetically modified microorganisms, e.g., recombinant microorganisms, comprising one or more genetic modifications directed to increased production of 3-HP based on introduction of a polynucleotide encoding identified functional variant polypeptides exhibiting increased oxaloacetate alpha-decarboxylaseoxaloacetate alpha-decarboxylase activity, and methods of making various products from 3-HP made by cultures of such microorganisms.
[0138] Choice of candidate oxaloacetate alpha-decarboxylase enzymes: The desired oxaloacetate alpha-decarboxylase activity catalyzes the conversion of oxaloacetate to malonate semialdehyde (see FIG. 2), and may be referred to more explicitly as an oxaloacetate alpha-decarboxylase. Candidate enzymes were identified and chosen according to their reported activities as decarboxylases of alpha-ketoacids and by their reported substrate preference for shorter chain molecules. The genes for pyruvate decarboxylase from Zymomonas mobilis (pdc), 2-oxoglutarate decarboxylase (SEQ ID NO:035), (oad) from Leuconostoc mesenteroides (SEQ ID NO:036) and alpha-ketoglutarate decarboxylase (kgd) from Mycobacterium tuberculosis (SEQ ID NO:037)were chosen for further evaluation.
[0139] Through such evaluation, three functional variants were identified from the 2-oxoglutarate decarboxylase (oad) of Leuconostoc mesenteroides (SEQ ID NO:036) These, respectively, have the following mutations of that sequence: N45T, R249L, D302G, V418A and L476Q (SEQ ID NO:054), T479N (SEQ ID NO:055), and R394C, D434G and T511A (SEQ ID NO:056). These respectively showed a 1.6, 2.2 and 2.8 fold improvement in enzymatic conversion of oxaloacetate to malonate semialdehyde compared with a control native enzyme lacking mutations in an in vitro system (see Examples).
[0140] In various embodiments additional genetic modifications may be made to increase metabolic flux at one or more specified enzymatic conversion steps, and/or to reduce or eliminate an enzymatic conversion along certain metabolic pathways. In some embodiments, a genetic modification is provided to a selected microorganism cell to increase an oxaloacetate alpha-decarboxylaseoxaloacetate alpha-decarboxylase enzyme activity. Such genetic modification may supplement an existing enzymatic activity, or may provide such activity in a selected microorganism that previously lacked such enzymatic activity (whether inherently or due or other genetic modifications).
[0141] As disclosed in U.S. Provisional Patent Application No. 61/246,140, filed Sep. 27, 2009 and to which priority is claimed, SEQ ID NO:001 provides one non-limiting example of an amino acid sequence of an oxaloacetate alpha-decarboxylaseoxaloacetate alpha-decarboxylase enzyme, obtained from Leuconostoc mesenteroides. This sequence is observed to have lacked a small portion of the N-terminal, and SEQ ID NO:036 is provided herein as the entire sequence.
[0142] FIG. 2 graphically depicts a reaction catalyzed by a oxaloacetate alpha-decarboxylase. A polynucleotide encoding such polypeptide sequence, or variants, including conservative variants thereof, exhibiting such enzymatic activity (hereinafter referred to as "oad-2"), may be provided into a selected microorganism to increase 3-HP production. This is described in more detail below. In various embodiments, such provision, by genetic modification, is practiced in combination with other genetic modifications so as to further increase 3-HP production.
[0143] Such sequence may be encoded by a polynucleotide that may be provided to a selected microorganism by a genetic modification, so as to provide or increase such enzymatic activity in the selected microorganism. For example, in some embodiments such genetic modification may comprise providing a plasmid, or other vector (e.g., cosmids, bacteria artificial chromosome ("BAC"), viruses (e.g., bacteriophage, animal viruses, plant viruses), and artificial chromosomes (e.g., yeast artificial chromosomes (YAC)), that comprises a polynucleotide that encodes a sequence as described herein and further comprises appropriate promoter(s), binding site(s), and stop codon, such that the amino acid sequence is expressed to a desirable level in the selected microorganism.
[0144] FIG. 3A depicts the natural metabolic pathways utilize by E. coli during bio-production which results in the natural products lactate, formate and acetate. By disruptions of appropriate genes, the production of these natural products are reduced or eliminated.
[0145] FIG. 3B depicts a proposed metabolic pathway to produce 3-HP as a bio-production product. Arrows represent enzymatic activities. Enzyme activities for FIGS. 3A and 3B are as follows: [i] glucokinase, [ii] phosphoglucose isomerase, [iii] 6-phosphofructose kinase, [iv] fructose bisphosphate aldolase, [v] triose-phosphate isomerase, [vi] glyceraldehyde 3-phosphate dehydrogenase, [vii] phosphoglycerate kinase, [viii] phosphoglycerate mutase, [ix] enolase, [x] pyruvate dehydrogenase, -, [xi] lactate dehydrogenase, [xii] pyruvate oxidase, [xiii] pyruvate-formate lyase, [xiv] phosphate acetyltransferase, [xv] acetate kinase, [xvi] phosphoenolpyruvate carboxykinase, [xvii] the proposed oxaloacetate alpha-decarboxylase (oad-2), [xviii] 3-hydroxypropionate dehydrogenase, [xix] pyruvate carboxylase.
[0146] FIG. 4 depicts a generalized embodiment comprising various metabolic modifications, some of which are optional for various embodiments of a genetically modified microorganism of the present invention. For various embodiments, an underlined gene name indicates that genetic modification is made to increase expression and/or enzymatic activity of the respective gene product (i.e., enzyme), a "Δ" before a gene name signifies that this gene is disrupted to decrease or eliminate expression and/or enzymatic activity of the respective gene product, and an asterisk indicates that this gene may be mutated to obtain a mutated form of the encoded gene product that exhibits increased enzymatic activity and/or specificity. Not all such genetic modifications need be made in a particular embodiment, and this full set of genetic modifications would be made for culture under anaerobic conditions.
[0147] As noted, the oxaloacetate alpha-decarboxylase is identified as "oad-2" or a grammatically equivalent identification. As shown in FIG. 2, oad-2 converts an oxaloacetate molecule to form malonate semialdehyde ("MSA"). The MSA is converted to 3-HP by a suitable enzyme (native, supplemented, or added), such as mmsB from Pseudomonas aeruginosa (SEQ ID NO:002) noting that the predominant reported reaction for this enzyme is conversion of 3-hydroxyisobutyrate to methylmalonate semialdehyde, and that in some embodiments mutant forms of mmsB may be produced, evaluated and identified that have greater, or more specific, activity for the conversion of MSA to 3-HP, such as by use of mutation and selection approaches described herein and/or known to those skilled in the art. Other enzymes for this latter conversion may be selected from enzymes capable of reductive conversions from MSA to 3-HP, such as a native or mutated form of ydfG or its functional equivalent in other species. This reaction is depicted in FIG.
[0148] The types of additional genetic modifications are generally divided into two groups--those that are made to increase enzymatic activity and/or specificity, generally so as to increase flux through a particular pathway and/or enzymatic conversion step, and those that are made to decrease or eliminate enzymatic activity at particular enzymatic conversion steps, thereby decreasing or eliminating conversion to particular metabolic intermediates or products. Various embodiments may include combinations of such genetic modifications from these two groups, in combination with the above-described genetic modification of a decarboxylase able to convert oxaloacetate to MSA.
[0149] Accordingly, in some embodiments, one or more of the genetic modifications in Table 1 may be provided to the selected microorganism that also is provided with the oxaloacetate alpha-decarboxylase enzymatic activity. Genetic modifications to the microorganism of these enzymes are made to increase enzymatic activity and/or specificity. One or more of these enzymatic functions may be native, and/or genetic modifications may be provided to supplement such native activity, or to provide such activity to a microorganism not previously demonstrating such activity. As described in a section below, there are various approaches to obtaining homologies that may be determined to exhibit a desired functional equivalence.
[0150] In some embodiments, one of the enzymatic activities of Table 1, the conversion of phosphoenolpyruvate ("PEP") to OAA, is provided via providing a mutated enzyme exhibiting such activity to a great level than a non-mutated enzyme. For example, the Ppc enzyme of E. coli, phosphoenolpyruvate carboxylase, may be mutated, such as by constructing a mutant library of ppc by use of an error-inducing PCR site-directed mutagenesis method. For example, use of the XL1-Red mutator strain, which is deficient in several repair mechanisms necessary for accurate DNA replication and generates mutations in plasmids at a rate 5,000 times that of the wild-type mutation rate, may be employed using appropriate materials following a manufacturer's instructions (see Stratagene QuikChange Mutagenesis Kit, Stratagene, La Jolla, Calif. USA). This technique or other techniques known to those skilled in the art, may be employed and then a population of such mutants, e.g., in a library, is evaluated, such as by a screening or selection method, to identify clones having a suitable or favorable mutation.
[0151] In other cases, such as for galP and pckA, genetic modifications can be made to increase overall expression of these protein functions in a microorganism cell. Various methods are known in the art for such types of genetic modifications, and are described in a section below.
[0152] In some embodiments a genetic modification provides increased enzymatic activity of pyruvate carboxylase (e.g., pyc from Corynebacterium glutamicum or Rhizobium etli).
[0153] Table 2 lists a number of protein functions, also providing enzyme classes and specific examples (with corresponding SEQ ID NOs.) in the group for which genetic modifications are made to reduce or eliminate the respective enzymatic activity. Once these protein functions are reduced or eliminated, more carbon and energy may flow to production of 3-HP (and also to biomass formation, to an extent, in some embodiments) rather than to the formation of metabolic products such as lactate, acetate, and formate.
[0154] Also, as gleaned from Tables 1 and 2, in various embodiments genetic modifications are made to increase enzymatic expression of galP and to decrease or eliminate enzymatic expression of all or part of the ptsHlcrr operon, or their equivalents in a selected species.
[0155] Deletions of the polynucleotides encoding the polypeptides exhibiting enzymatic activities or other protein functions in Table 2 may be made in a selected E. coli strain using a RED/ET homologous recombination method, such as with kits supplied by Gene Bridges (Gene Bridges GmbH, Heidelberg, Germany, www.genebridges.com) according to manufacturer's instructions. The successful deletion of these genes, as confirmed by standard methodologies, such as PCR, or DNA sequencing, results in a genetically modified microorganism having reduced or eliminated respective enzymatic activities or other protein functions. Methods for gene disruption in other species are known to those skilled in the art.
[0156] In view of Tables 1 and 2, it will be recognized that there are many possible combinations of increases in one or more protein functions, optionally with reductions in one or more protein functions. Protein functions can be independently varied, and any combination (i.e., a full factorial) of protein functions in Table 1 and Table 2 herein can be adjusted in various embodiments. In various embodiments of the present invention, these various combinations are provided in combination with one or more genetic modifications to provide, or to increase existing levels of, oxaloacetate alpha-decarboxylase enzymatic activity.
[0157] In various independent groupings of such embodiments, one or more protein functions selected from Table 1 may be added or increased, however excluding any substantial addition or change to any one of galP and its homologues, Ppc and its homologues, pckA and its homologues, or excluding two, three, four, or more, of such protein functions and their homologues from such smaller list or sub-list. In other independent groupings of embodiments, the various sub-lists developed from the list of Table 1 exclude one or more of the above-indicated protein functions but not their homologues.
[0158] In various independent groupings of such embodiments, one or more protein functions selected from Table 2 may be deleted or disrupted, however excluding any substantial reduction or change to any one of aceE and its homologues, aceF and its homologues, lpd and its homologues, ldhA and its homologues, pflB and its homologues, poxB and its homologues, Pta and its homologues, ptsH and its homologues, ptsI and its homologues, Crr and its homologues, pykA and its homologues, pykF and its homologues, or excluding two, three, four, five, six, seven, eight, nine, ten, eleven twelve, or more, of such protein functions and their homologues from such smaller list or sub-list. In other independent groupings of embodiments, the various sub-lists developed from the list of Table 2 exclude one or more of the above-indicated protein functions but not their homologues.
[0159] In addition to the above-described genetic modifications, in various embodiments genetic modifications also are provided to increase the pool and availability of the cofactor NADPH, and/or, consequently, the NADPH/NADP.sup.+ ratio. For example, in various embodiments for E. coli, this may be done by increasing activity, such as by genetic modification, of one or more of the following genes: pgi (in a mutated form), pntAB, overexpressed, gapA:gapN substitution/replacement, and disrupting or modifying a soluble transhydrogenase such as sthA, and/or genetic modifications of one or more of zwf, gnd, and edd.
[0160] Specific nucleic acid and amino acid sequences corresponding to the enzyme names and activities provided herein (e.g., for 3-HP production, and in Tables 1 and 2), including the claims, are readily found at widely used databases including www.metacyc.org, www.brenda-enzymes.org, and www.ncbi.gov. Also, the particular enzymatic activities by enzymes, and other protein functions (and the nucleic acid sequences encoding them) that are disclosed herein are not meant to be limiting particularly in view of the various known approaches, standard in the art, to achieve desired metabolic conversions and to identify functionally analogous enzymes in different species. Different enzymes for different species may also be found on these web sites. The enzyme functions disclosed herein may be related to the stated E.C. numbers provided herein, including Table 1 and 2, which are incorporated into this section. However, it is noted that some protein functions disclosed herein, including in the Tables 1 and 2, are not enzyme functions, and those functions are only stated in the "Protein Function" columns, and in various embodiments may be further limited by the function of the stated E. coli gene (which may be applied, in various embodiments, to functionally equivalent homologues in other species). Also, all information of Tables 1 and 2 are incorporated into the claims referring to these tables. Also, based on the teachings provided herein, it is appreciated that the members of a respective table are functionally related as to their overall metabolic effects on production of 3-HP through the 3-HP production pathway that includes oxaloacetate alpha-decarboxylase.
[0161] Thus, in various embodiments a genetically modified microorganism comprises a first set of genetic modification(s) to introduce and/or increase enzymatic activity of an oxaloacetate alpha-decarboxylase (such as SEQ ID NO:001 or SEQ ID NO:036), and optionally one or more enzymes capable of converting MSA to 3-HP (such as mmsB and a native or mutated ydfG). In some embodiments, a genetically modified microorganism demonstrates increased production of 3-HP compared to a control microorganism lacking such genetic modification(s). In some embodiments, a genetically modified microorganism comprises one or more genetic modifications to increase enzymatic activities or other protein functions identified in Table 1, and/or comprising one or more genetic modifications to reduce or eliminate enzymatic activities or other protein functions identified in Table 2, and optionally also comprising genetic modifications disclosed herein to increase microorganism NADPH pools and/or availability. Using approaches described herein, genes encoding enzymes having the enzyme activities described herein may be identified in other species, and evaluated as may be appropriate to the circumstance, toward obtaining a genetically modified microorganism of that species that comprises the genetic modifications taught herein to obtain a microorganism that exhibits higher production of 3-HP compared to a relevant control microorganism lacking the genetic modifications.
[0162] In some particular embodiments, a combination of genetic modifications are made to an E. coli strain, the combination comprising providing a polynucleotide encoding oad-2 and the combination also comprising other genetic modifications shown in FIGS. 3B and 4, as well as described for Tables 1 and 2. In some of these embodiments, at least one of additions of mmsB or a native or mutated ydfG are provided to increase enzymatic conversion of MSA to 3-HP.
[0163] Other aspects of the scope of the invention are described in the following paragraphs.
[0164] In various embodiments, to function more efficiently, a microorganism may comprise one or more gene deletions. These are summarized in Table 2, described above. For example, in E. coli, the genes encoding lactate dehydrogenase (ldhA), phosphate acetyltransferase (pta), pyruvate oxidase (poxB) and pyruvate-formate lyase (pflB), may be disrupted, including deleted. Such gene disruptions, including deletions, are not meant to be limiting, and may be implemented in various combinations in various embodiments. Gene deletions may be accomplished by mutational gene deletion approaches, and/or starting with a mutant strain having reduced or no expression of one or more of these enzymes, and/or other methods known to those skilled in the art. Gene deletions may be effectuated by any of a number of known specific methodologies, including but not limited to the RED/ET methods using kits and other reagents sold by Gene Bridges (Gene Bridges GmbH, Heidelberg, Germany, www.genebridges.com). Further, for 3-HP production, such genetic modifications may be chosen and/or selected for to achieve a higher flux rate through certain basic pathways within the respective 3-HP production pathway and so may affect general cellular metabolism in fundamental and/or major ways.
[0165] In some embodiments, the genetically modified microorganism that so possesses oxaloacetate alpha-decarboxylase genetic modification(s) additionally comprises at least one genetic modification to increase, in the genetically modified microorganism, a protein function selected from the protein functions of Table 1 (Glucose transporter function (such as by galP), phosphoenolpyruvate carboxylase, and phosphoenolpyruvate carboxykinase). In certain embodiments, the genetically modified microorganism comprises at least one genetic modification to increase two or three protein functions selected from the protein functions of Table 1.
[0166] In some embodiments, such genetically modified microorganism additionally comprises at least one genetic modification to decrease protein functions selected from the protein functions of Table 2 (pyruvate dehydrogenase E1p, dihydrolipoamide acetyltransferase, pyruvate dehydrogenase E3, lactate dehydrogenase, pyruvate formate lyase, pyruvate oxidase, phosphate acetyltransferase, histidyl phosphorylatable protein (of PTS), phosphoryl transfer protein (of PTS), polypeptide chain (of PTS), pyruvate kinase I, and pyruvate kinase II).
[0167] In various embodiments, such genetically modified microorganism comprises at least one genetic modification to decrease enzymatic activity of two, three, four, five, six, seven, eight, nine, ten, eleven, or twelve protein functions selected from the protein functions of Table 2.
[0168] It will be appreciated that, in various embodiments, there can be many possible combinations of increases in one or more protein functions of Table 1, with reductions in one or more protein functions of Table 2 in the genetically modified microorganism comprising at least one genetic modification to provide or increase oxaloacetate alpha-decarboxylase protein function (i.e, enzymatic activity). Protein functions can be independently varied, and any combination (i.e., a full factorial) of genetic modifications of protein functions in Table 1 and Table 2 herein can be adjusted by the methods taught and provided into the genetically modified microorganism.
[0169] In some embodiments, at least one genetic modification to decrease enzymatic activity is a gene disruption. In some embodiments, at least one genetic modification to decrease enzymatic activity is a gene deletion.
[0170] In various embodiments, to obtain 3-hydroxypropionic acid (3-HP) as a desired product, the genetically modified microorganism comprises a protein function effective for converting malonate semialdehyde to 3-HP. The protein function effective for converting malonate semialdehyde to 3-HP can be native to the microorganism, but that is by no means necessary.
[0171] In some embodiments, the protein function effective for converting malonate semialdehyde to 3-HP is a native or mutated form of mmsB from Pseudomonas aeruginosa, or a functional equivalent thereof. Alternatively, or additionally, this protein function can be a native or mutated form of ydfG, or a functional equivalent thereof.
[0172] Certain embodiments of the invention additionally comprise a genetic modification to increase the availability of the cofactor NADPH, which can increase the NADPH/NADP+ ratio as may be desired. Non-limiting examples for such genetic modification are pgi (E.C. 5,3.1.9, in a mutated form), pntAB (E.C. 1.6.1.2), overexpressed, gapA (E.C. 1.2.1.12):gapN (E.C. 1.2.1.9, from Streptococcus mutans) substitution/replacement, and disrupting or modifying a soluble transhydrogenase such as sthA (E.C. 1.6.1.2), and/or genetic modifications of one or more of zwf (E.C. 1.1.1.49), gnd (E.C. 1.1.1.44), and edd (E.C. 4.2.1.12). Sequences of these genes are available at www.metacyc.org.
[0173] In some embodiments, the genetic modification increases microbial synthesis of 3-HP above a rate or titer of a control microorganism lacking said at least one genetic modification to produce 3-HP. In some embodiments, the genetic modification is effective to increase enzymatic conversions to 3-HP by at least about 5 percent, at least about 10 percent, at least about 20 percent, at least about 30 percent, or at least about 50 percent above the enzymatic conversion of a control microorganism lacking the genetic modification.
[0174] Variations of the present invention also provide a culture system comprising a population of a genetically modified microorganism as described herein, and a media comprising nutrients for the population.
[0175] Variations of this invention additionally include methods. In some variations, the invention provides a method of making a genetically modified microorganism comprising providing to a selected microorganism at least one genetic modification to introduce oxaloacetate alpha-decarboxylase enzymatic activity.
[0176] In some embodiments, the method additionally comprises introducing at least one genetic modification to increase enzymatic activity of a protein function selected from the protein functions of Table 1. In various embodiments, the method comprises introducing at least one genetic modification to increase two or three protein functions selected from the protein functions of Table 1 (Glucose transporter function (such as by galP), phosphoenolpyruvate carboxylase, and phosphoenolpyruvate carboxykinase).
[0177] In some embodiments, the method additionally comprises introducing at least one genetic modification to decrease a protein function selected from the protein functions of Table 2. In various embodiments, the method comprises introducing at least one genetic modification to decrease two, three, four, five, six, seven, eight, nine, ten, eleven, or twelve protein functions selected from the protein functions of Table 2 (pyruvate dehydrogenase E1p, dihydrolipoamide acetyltransferase, pyruvate dehydrogenase E3, lactate dehydrogenase, pyruvate formate lyase, pyruvate oxidase, phosphate acetyltransferase, histidyl phosphorylatable protein (of PTS), phosphoryl transfer protein (of PTS), polypeptide chain (of PTS), pyruvate kinase I, and pyruvate kinase II).
[0178] Again, there are many possible combinations of increases in one or more protein functions, optionally with reductions in one or more protein functions, that may be provided in combination with at least one genetic modification to provide or increase oxaloacetate alpha-decarboxylase enzymatic function, so as to increase 3-HP production in a genetically modified microorganism. Protein functions can be independently varied, and any combination (i.e., a full factorial) of protein functions in Table 1 and Table 2 herein can be adjusted in various methods of the invention. Enzyme-activity and other protein function reductions can be accomplished by gene disruptions, such as gene deletions, or other modifications.
[0179] Additionally, genetic modifications and/or media supplements directed to improving tolerance to 3-HP may be provided, such as are taught in PCT Patent Publication No. US/2010/052748, published Jan. 28, 2010, and incorporated by reference for its teachings of tolerance-related genetic modifications and media supplements.
[0180] In some embodiments, the invention contemplates a culture system comprising: a) a population of a genetically modified microorganism as described herein; and b) a media comprising nutrients for the population. In some such embodiments the media additionally comprises at least 1 gram/liter of 3-HP.
[0181] As described in U.S. Provisional Patent Application No. 61/246,140, filed Sep. 27, 2009, incorporated by reference and to which priority is claimed, various combinations of genetic modifications may be implemented in various embodiments of the invention. These are described in the following paragraphs and also in Tables 1 and 2.
[0182] In some embodiments, the genetically modified microorganism that so possesses oxaloacetate alpha-decarboxylase genetic modification(s) additionally comprises at least one genetic modification to increase, in the genetically modified microorganism, a protein function selected from the protein functions of Table 1 (Glucose transporter function (such as by galP in E. coli), phosphoenolpyruvate carboxykinase (such as by pckA in E. coli), and phosphoenolpyruvate carboxylase (such as by ppc in E. coli). In certain embodiments, the genetically modified microorganism comprises at least one genetic modification to increase two, three, or four protein functions selected from the protein functions of Table 2.
[0183] In some embodiments, such genetically modified microorganism additionally comprises at least one genetic modification to decrease protein functions selected from the protein functions of Table 2, pyruvate dehydrogenase E1p, lipoate acetyltransferase/dihydrolipoamide acetyltransferase, pyruvate dehydrogenase E3 (lipoamide dehydrogenase), lactate dehydrogenase, pyruvate formate lyase, pyruvate oxidase, phosphate acetyltransferase, histidyl phosphorylatable protein (of PTS), phosphoryl transfer protein (of PTS), the polypeptide chain (of PTS), pyruvate kinase I, and pyruvate kinase II.
[0184] In various embodiments, such genetically modified microorganism comprises at least one genetic modification to decrease enzymatic activity of two, three, four, five, six, or seven protein functions selected from the protein functions of Table 2.
[0185] It will be appreciated that, in various embodiments, there can be many possible combinations of increases in one or more protein functions of Table 1, with reductions in one or more protein functions of Table 2 in the genetically modified microorganism comprising at least one genetic modification to provide or increase oxaloacetate alpha-decarboxylase protein function (i.e, enzymatic activity). Protein functions can be independently varied, and any combination (i.e., a full factorial) of genetic modifications of protein functions in Tables 1 and 2 herein can be adjusted by the methods taught and provided into said genetically modified microorganism.
[0186] In some embodiments, at least one genetic modification to decrease enzymatic activity is a gene disruption. In some embodiments, at least one genetic modification to decrease enzymatic activity is a gene deletion.
[0187] In various embodiments, to obtain 3-hydroxypropionic acid (3-HP) as a desired product, the genetically modified microorganism comprises a protein function effective for converting oxaloacetate to malonate semialdehyde, i.e., an oxaloacetate alpha-decarboxylase, and also a protein function effective for converting malonate semialdehyde to 3-HP. The latter can be native to the microorganism, but that is by no means necessary.
[0188] In some embodiments, the protein function effective for converting malonate semialdehyde to 3-HP is a native or mutated form of mmsB from Pseudomonas aeruginosa, or a functional equivalent thereof. Alternatively, or additionally, this protein function can be a native or mutated form of ydfG, or a functional equivalent thereof.
[0189] Certain embodiments of the invention additionally comprise a genetic modification to increase the availability of the cofactor NADPH, which can increase the NADPH/NADP+ ratio as may be desired. Non-limiting examples for such genetic modification are pgi (E.C. 5,3.1.9, in a mutated form), pntAB (E.C. 1.6.1.2), overexpressed, gapA (E.C. 1.2.1.12):gapN (E.C. 1.2.1.9, from Streptococcus mutans) substitution/replacement, and disrupting or modifying a soluble transhydrogenase such as sthA (E.C. 1.6.1.2), and/or genetic modifications of one or more of zwf (E.C. 1.1.1.49), gnd (E.C. 1.1.1.44), and edd (E.C. 4.2.1.12). Sequences of these genes are available at www.metacyc.org, and also are available at www.ncbi.gov.
[0190] It is appreciated that any combination of genetic modifications described herein may be used in embodiments of the present invention. Also included within the scope of the invention are isolated nucleic acid and amino acid sequences encoding or comprising the mutations to the polypeptide sequences described herein.
[0191] Additional genetic modifications may be provided in a microorganism strain of the present invention. Many such modifications may be provided to impart a particular phenotype.
[0192] As one example, a deletion, of multifunctional 2-keto-3-deoxygluconate 6-phosphate aldolase and 2-keto-4-hydroxyglutarate aldolase and oxaloacetate alpha-decarboxylase (eda in E. coli), may be provided to various strains.
[0193] For example, the ability to utilize sucrose may be provided, and this would expand the range of feed stocks that can be utilized to produce 3-HP. Common laboratory and industrial strains of E. coli, such as the strains described herein, are not capable of utilizing sucrose as the sole carbon source. Since sucrose, and sucrose-containing feed stocks such as molasses, are abundant and often used as feed stocks for the production by microbial fermentation, adding appropriate genetic modifications to permit uptake and use of sucrose may be practiced in strains having other features as provided herein. Various sucrose uptake and metabolism systems are known in the art (for example, U.S. Pat. No. 6,960,455), incorporated by reference for such teachings. These and other approaches may be provided in strains of the present invention. The examples provide at least two approaches.
[0194] Also, genetic modifications may be provided to add functionality for breakdown of more complex carbon sources, such as cellulosic biomass or products thereof, for uptake, and/or for utilization of such carbon sources. For example, numerous cellulases and cellulase-based cellulose degradation systems have been studied and characterized (see, for example, and incorporated by reference herein for such teachings, Beguin, P and Aubert, J-P (1994) FEMS Microbial. Rev. 13: 25-58; Ohima, K. et al. (1997) Biotechnol. Genet. Eng. Rev. 14: 365414).
[0195] In addition to the above-described genetic modifications, in various embodiments genetic modifications also are provided to increase the pool and availability of the cofactor NADPH, and/or, consequently, the NADPH/NADP+ ratio. For example, in various embodiments for E. coli, this may be done by increasing activity, such as by genetic modification, of one or more of the following genes--pgi (in a mutated form), pntAB, overexpressed, gapA:gapN substitution/replacement, and disrupting or modifying a soluble transhydrogenase such as sthA, and/or genetic modifications of one or more of zwf, gnd, and edd.
[0196] Any such genetic modifications may be provided to species not having such functionality, or having a less than desired level of such functionality.
[0197] More generally, and depending on the particular metabolic pathways of a microorganism selected for genetic modification, any subgroup of genetic modifications may be made to decrease cellular production of fermentation product(s) selected from the group consisting of acetate, acetoin, acetone, acrylic, malate, fatty acid ethyl esters, isoprenoids, glycerol, ethylene glycol, ethylene, propylene, butylene, isobutylene, ethyl acetate, vinyl acetate, other acetates, 1,4-butanediol, 2,3-butanediol, butanol, isobutanol, sec-butanol, butyrate, isobutyrate, 2-OH-isobutryate, 3-OH-butyrate, ethanol, isopropanol, D-lactate, L-lactate, pyruvate, itaconate, levulinate, glucarate, glutarate, caprolactam, adipic acid, propanol, isopropanol, fusel alcohols, and 1,2-propanediol, 1,3-propanediol, formate, fumaric acid, propionic acid, succinic acid, valeric acid, and maleic acid. Gene deletions may be made as disclosed generally herein, and other approaches may also be used to achieve a desired decreased cellular production of selected fermentation products.
[0198] VII. Separation and Purification of the Chemical Product 3-HP
[0199] When 3-HP is the chemical product, the 3-HP may be separated and purified by the approaches described in the following paragraphs, taking into account that many methods of separation and purification are known in the art and the following disclosure is not meant to be limiting. Osmotic shock, sonication, homogenization, and/or a repeated freeze-thaw cycle followed by filtration and/or centrifugation, among other methods, such as pH adjustment and heat treatment, may be used to produce a cell-free extract from intact cells. Any one or more of these methods also may be employed to release 3-HP from cells as an extraction step.
[0200] Further as to general processing of a bio-production broth comprising 3-HP, various methods may be practiced to remove biomass and/or separate 3-HP from the culture broth and its components. Methods to separate and/or concentrate the 3-HP include centrifugation, filtration, extraction, chemical conversion such as esterification, distillation (which may result in chemical conversion, such as dehydration to acrylic acid, under some reactive-distillation conditions), crystallization, chromatography, and ion-exchange, in various forms. Additionally, cell rupture may be conducted as needed to release 3-HP from the cell mass, such as by sonication, homogenization, pH adjustment or heating. 3-HP may be further separated and/or purified by methods known in the art, including any combination of one or more of centrifugation, liquid-liquid separations, including extractions such as solvent extraction, reactive extraction, two-phase aqueous extraction and two-phase solvent extraction, membrane separation technologies, distillation, evaporation, ion-exchange chromatography, adsorption chromatography, reverse phase chromatography and crystallization. Any of the above methods may be applied to a portion of a bio-production broth (i.e., a fermentation broth, whether made under aerobic, anaerobic, or microaerobic conditions), such as may be removed from a bio-production event gradually or periodically, or to the broth at termination of a bio-production event. Conversion of 3-HP to downstream products, such as described herein, may proceed after separation and purification, or, such as with distillation, thin-film evaporation, or wiped-film evaporation optionally also in part as a separation means.
[0201] For various of these approaches, one may apply a counter-current strategy, or a sequential or iterative strategy, such as multi-pass extractions. For example, a given aqueous solution comprising 3-HP may be repeatedly extracted with a non-polar phase comprising an amine to achieve multiple reactive extractions.
[0202] When a culture event (fermentation event) is at a point of completion, the spent broth may transferred to a separate tank, or remain in the culture vessel, and in either case the temperature may be elevated to at least 60° C. for a minimum of one hour in order to kill the microorganisms. (Alternatively, other approaches to killing the microorganisms may be practiced.) By spent broth is meant the final liquid volume comprising the initial nutrient media, cells grown from the microorganism inoculum (and possibly including some original cells of the inoculum), 3-HP, and optionally liquid additions made after providing the initial nutrient media, such as periodic additions to provide additional carbon source, etc. It is noted that the spent broth may comprise organic acids other than 3-HP, such as for example acetic acid and/or lactic acid.
[0203] A centrifugation step may then be practiced to filter out the biomass solids (e.g., dead microorganism cells). This may be achieved in a continuous or batch centrifuge, and solids removal may be at least about 80%, 85%, 90%, or 95% in a single pass, or cumulatively after two or more serial centrifugations.
[0204] An optional step is to polish the centrifuged liquid through a filter, such as microfiltration or ultrafiltration, or may comprise a filter press or other filter device to which is added a filter aid such as diatomaceous earth. Alternative or supplemental approaches to this and the centrifugation may include removal of cells by a flocculent, where the cells floc and are allowed to settle, and the liquid is drawn off or otherwise removed. A flocculent may be added to a fermentation broth after which settling of material is allowed for a time, and then separations may be applied, including but not limited to centrifugation.
[0205] After such steps, a spent broth comprising 3-HP and substantially free of solids is obtained for further processing. By "substantially free of solids" is meant that greater than 98%, 99%, or 99.5% of the solids have been removed.
[0206] In various embodiments this spent broth comprises various ions of salts, such as Na, Cl, SO4, and PO4. In some embodiments these ions may be removed by passing this spent broth through ion exchange columns, or otherwise contacting the spent broth with appropriate ion exchange material. Here and elsewhere in this document, "contacting" is taken to mean a contacting for the stated purpose by any way known to persons skilled in the art, such as, for example, in a column, under appropriate conditions that are well within the ability of persons of ordinary skill in the relevant art to determine. As but one example, these may comprise sequential contacting with anion and cation exchange materials (in any order), or with a mixed anion/cation material. This demineralization step should remove most such inorganic ions without removing the 3-HP. This may be achieved, for example, by lowering the pH sufficiently to protonate 3-HP and similar organic acids so that these acids are not bound to the anion exchange material, whereas anions, such as Cl and SO4, that remain charged at such pH are removed from the solution by binding to the resin. Likewise, positively charged ions are removed by contacting with cation exchange material. Such removal of ions may be assessed by a decrease in conductivity of the solution. Such ion exchange materials may be regenerated by methods known to those skilled in the art.
[0207] In some embodiments, the spent broth (such as but not necessarily after the previous demineralization step) is subjected to a pH elevation, after which it is passed through an ion exchange column, or otherwise contacted with an ion exchange resin, that comprises anionic groups, such as amines, to which organic acids, ionic at this pH, associate. Other organics that do not so associate with amines at this pH (which may be over 6.5, over 7.5, over 8.5, over 9.5, over 10.5, or higher pH) may be separated from the organic acids at this stage, such as by flushing with an elevated pH rinse. Thereafter elution with a lower pH and/or elevated salt content rinse may remove the organic acids. Eluting with a gradient of decreasing pH and/or increasing salt content rinses may allow more distinct separation of 3-HP from other organic acids, thereafter simplifying further processing.
[0208] This latter step of anion-exchange resin retention of organic acids may be practiced before or after the demineralization step. However, the following two approaches are alternatives to the anion-exchange resin step.
[0209] A first alternative approach comprises reactive extraction (a form of liquid-liquid extraction) as exemplified in this and the following paragraphs. The spent broth, which may be at a stage before or after the demineralization step above, is combined with a quantity of a tertiary amine such as Alamine336® (Cognis Corp., Cincinnati, Ohio USA) at low pH. Co-solvents for the Alamine336 or other tertiary amine may be added and include, but are not limited to benzene, carbon tetrachloride, chloroform, cyclohexane, disobutyl ketone, ethanol, #2 fuel oil, isopropanol, kerosene, n-butanol, isobutanol, octanol, and n-decanol that increase the partition coefficient when combined with the amine. After appropriate mixing a period of time for phase separation transpires, after which the non-polar phase, which comprises 3-HP associated with the Alamine336 or other tertiary amine, is separated from the aqueous phase.
[0210] When a co-solvent is used that has a lower boiling point than the 3-HP/tertiary amine, a distilling step may be used to remove the co-solvent, thereby leaving the 3-HP-tertiary amine complex in the non-polar phase.
[0211] Whether or not there is such a distillation step, a stripping or recovery step may be used to separate the 3-HP from the tertiary amine. An inorganic salt, such as ammonium sulfate, sodium chloride, or sodium carbonate, or a base such as sodium hydroxide or ammonium hydroxide, is added to the 3-HP/tertiary amine to reverse the amine protonation reaction, and a second phase is provided by addition of an aqueous solution (which may be the vehicle for provision of the inorganic salt). After suitable mixing, two phases result and this allows for tertiary amine regeneration and re-use, and provides the 3-HP in an aqueous solution. Alternatively, hot water may also be used without a salt or base to recover the 3HP from the amine.
[0212] In the above approach the phase separation and extraction of 3-HP to the aqueous phase can serve to concentrate the 3-HP. It is noted that chromatographic separation of respective organic acids also can serve to concentrate such acids, such as 3-HP. In similar approaches other suitable, non-polar amines, which may include primary, secondary and quaternary amines, may be used instead of and/or in combination with a tertiary amine.
[0213] A second alternative approach is crystallization. For example, the spent broth (such as free of biomass solids) may be contacted with a strong base such as ammonium hydroxide, which results in formation of an ammonium salt of 3-HP. This may be concentrated, and then ammonium-3-HP crystals are formed and may be separated, such as by filtration, from the aqueous phase. Once collected, ammonium-3-HP crystals may be treated with an acid, such as sulfuric acid, so that ammonium sulfate is regenerated, so that 3-HP and ammonium sulfate result.
[0214] Also, various aqueous two-phase extraction methods may be utilized to separate and/or concentrate a desired chemical product from a fermentation broth or later-obtained solution. It is known that the addition of polymers, such as dextran and glycol polymers, such as polyethylene glycol (PEG) and polypropylene glycol (PPG) to an aqueous solution may result in formation of two aqueous phases. In such systems a desired chemical product may segregate to one phase while cells and other chemicals partition to the other phase, thus providing for a separation without use of organic solvents. This approach has been demonstrated for some chemical products, but challenges associated with chemical product recovery from a polymer solution and low selectivities are recognized (See "Extractive Recovery of Products from Fermentation Broths," Joong Kyun Kim et al., Biotechnol. Bioprocess Eng., 1999(4)1-11, incorporated by reference for all of its teachings of extractive recovery methods).
[0215] Various substitutions and combinations of the above steps and processes may be made to obtain a relatively purified 3-HP solution. Also, methods of separation and purification disclosed in U.S. Pat. No. 6,534,679, issued Mar. 18, 2003, and incorporated by reference herein for such methods disclosures, may be considered based on a particular processing scheme. Also, in some culture events periodic removal of a portion of the liquid volume may be made, and processing of such portion(s) may be made to recover the 3-HP, including by any combination of the approaches disclosed above.
[0216] As noted, solvent extraction is another alternative. This may use any of a number of and/or combinations of solvents, including alcohols, esters, ketones, and various organic solvents. Without being limiting, after phase separation a distillation step or a secondary extraction may be employed to separate 3-HP from the organic phase.
[0217] The following published resources are incorporated by reference herein for their respective teachings to indicate the level of skill in these relevant arts, and as needed to support a disclosure that teaches how to make and use methods of industrial bio-production of 3-HP, and also industrial systems that may be used to achieve such conversion with any of the recombinant microorganisms of the present invention (Biochemical Engineering Fundamentals, 2nd Ed. J. E. Bailey and D. F. Ollis, McGraw Hill, New York, 1986, entire book for purposes indicated and Chapter 9, pp. 533-657 in particular for biological reactor design; Unit Operations of Chemical Engineering, 5th Ed., W. L. McCabe et al., McGraw Hill, New York 1993, entire book for purposes indicated, and particularly for process and separation technologies analyses; Equilibrium Staged Separations, P. C. Wankat, Prentice Hall, Englewood Cliffs, N.J. USA, 1988, entire book for separation technologies teachings).
[0218] VII. Conversion of 3-HP to Acrylic Acid and Downstream Products
[0219] As discussed herein, various embodiments described herein are related to production of a particular chemical product, 3-hydroxypropionic acid (3-HP). This organic acid, 3-HP, may be converted to various other products having industrial uses, such as but not limited to acrylic acid, esters of acrylic acid, and other chemicals obtained from 3-HP, referred to as "downstream products." Under some approaches the 3-HP may be converted to acrylic acid, acrylamide, and/or other downstream chemical products, in some instances the conversion being associated with the separation and/or purification steps. Many conversions to such downstream products are described herein. The methods of the invention include steps to produce downstream products of 3-HP.
[0220] As a C3 building block, 3-HP offers much potential in a variety of chemical conversions to commercially important intermediates, industrial end products, and consumer products. For example, 3-HP may be converted to acrylic acid, acrylates (e.g., acrylic acid salts and esters), 1,3-propanediol, malonic acid, ethyl-3-hydroxypropionate, ethyl ethoxy propionate, propiolactone, acrylamide, or acrylonitrile.
[0221] For example, methyl acrylate may be made from 3-HP via dehydration and esterification, the latter to add a methyl group (such as using methanol); acrylamide may be made from 3-HP via dehydration and amidation reactions; acrylonitrile may be made via a dehydration reaction and forming a nitrile moiety; propriolactone may be made from 3-HP via a ring-forming internal esterification reaction (eliminating a water molecule); ethyl-3-HP may be made from 3-HP via esterification with ethanol; malonic acid may be made from 3-HP via an oxidation reaction; and 1,3-propanediol may be made from 3-HP via a reduction reaction. Also, acrylic acid, first converted from 3-HP by dehydration, may be esterified with appropriate compounds to form a number of commercially important acrylate-based esters, including but not limited to methyl acrylate, ethyl acrylate, methyl acrylate, 2-ethylhexyl acrylate, butyl acrylate, and lauryl acrylate. Alternatively, 3HP may be esterified to form an ester of 3HP and then dehydrated to form the acrylate ester.
[0222] Additionally, 3-HP may be oligomerized or polymerized to form poly(3-hydroxypropionate) homopolymers, or co-polymerized with one or more other monomers to form various co-polymers. Because 3-HP has only a single stereoisomer, polymerization of 3-HP is not complicated by the stereo-specificity of monomers during chain growth. This is in contrast to (S)-2-Hydroxypropanoic acid (also known as lactic acid), which has two (D, L) stereoisomers that must be considered during its polymerizations.
[0223] As will be further described, 3-HP can be converted into derivatives starting (i) substantially as the protonated form of 3-hydroxypropionic acid; (ii) substantially as the deprotonated form, 3-hydroxypropionate; or (iii) as mixtures of the protonated and deprotonated forms. Generally, the fraction of 3-HP present as the acid versus the salt will depend on the pH, the presence of other ionic species in solution, temperature (which changes the equilibrium constant relating the acid and salt forms), and to some extent pressure. Many chemical conversions may be carried out from either of the 3-HP forms, and overall process economics will typically dictate the form of 3-HP for downstream conversion.
[0224] Also, as an example of a conversion during separation, 3-HP in an amine salt form, such as in the extraction step herein disclosed using Alamine 336 as the amine, may be converted to acrylic acid by contacting a solution comprising the 3-HP amine salt with a dehydration catalyst, such as aluminum oxide, at an elevated temperature, such as 170 to 180 C, or 180 to 190 C, or 190 to 200 C, and passing the collected vapor phase over a low temperature condenser. Operating conditions, including 3-HP concentration, organic amine, co-solvent (if any), temperature, flow rates, dehydration catalyst, and condenser temperature, are evaluated and improved for commercial purposes. Conversion of 3-HP to acrylic acid is expected to exceed at least 80 percent, or at least 90 percent, in a single conversion event. The amine may be re-used, optionally after clean-up. Other dehydration catalysts, as provided herein, may be evaluated. It is noted that U.S. Pat. No. 7,186,856 discloses data regarding this conversion approach, albeit as part of an extractive salt-splitting conversion that differs from the teachings herein. However, U.S. Pat. No. 7,186,856 is incorporated by reference for its methods, including extractive salt-splitting, the latter to further indicate the various ways 3-HP may be extracted from a microbial fermentation broth.
[0225] Further as to embodiments in which the chemical product being synthesized by the microorganism host cell is 3-HP, made as provided herein and optionally purified to a selected purity prior to conversion, the methods of the present invention can also be used to produce "downstream" compounds derived from 3-HP, such as polymerized-3-HP (poly-3-HP), acrylic acid, polyacrylic acid (polymerized acrylic acid, in various forms), methyl acrylate, acrylamide, acrylonitrile, propiolactone, ethyl 3-HP, malonic acid, and 1,3-propanediol. Numerous approaches may be employed for such downstream conversions, generally falling into enzymatic, catalytic (chemical conversion process using a catalyst), thermal, and combinations thereof (including some wherein a desired pressure is applied to accelerate a reaction).
[0226] As noted, an important industrial chemical product that may be produced from 3-HP is acrylic acid. Chemically, one of the carbon-carbon single bonds in 3-HP must undergo a dehydration reaction, converting to a carbon-carbon double bond and rejecting a water molecule. Dehydration of 3-HP in principle can be carried out in the liquid phase or in the gas phase. In some embodiments, the dehydration takes place in the presence of a suitable homogeneous or heterogeneous catalyst. Suitable dehydration catalysts are both acid and alkaline catalysts. Following dehydration, an acrylic acid-containing phase is obtained and can be purified where appropriate by further purification steps, such as by distillation methods, extraction methods, or crystallization methods, or combinations thereof.
[0227] Making acrylic acid from 3-HP via a dehydration reaction may be achieved by a number of commercial methodologies including via a distillation process, which may be part of the separation regime and which may include an acid and/or a metal ion as catalyst. More broadly, incorporated herein for its teachings of conversion of 3-HP, and other β-hydroxy carbonyl compounds, to acrylic acid and other related downstream compounds, is U.S. Patent Publication No. 2007/0219390 A1, published Sep. 20, 2007, now abandoned. This publication lists numerous catalysts and provides examples of conversions, which are specifically incorporated herein. Also among the various specific methods to dehydrate 3-HP to produce acrylic acid is an older method, described in U.S. Pat. No. 2,469,701 (Redmon). This reference teaches a method for the preparation of acrylic acid by heating 3-HP to a temperature between 130 and 190° C., in the presence of a dehydration catalyst, such as sulfuric acid or phosphoric acid, under reduced pressure. U.S. Patent Publication No. 2005/0222458 A1 (Craciun et al.) also provides a process for the preparation of acrylic acid by heating 3-HP or its derivatives. Vapor-phase dehydration of 3-HP occurs in the presence of dehydration catalysts, such as packed beds of silica, alumina, or titania. These patent publications are incorporated by reference for their methods relating to converting 3-HP to acrylic acid.
[0228] The dehydration catalyst may comprise one or more metal oxides, such as Al2O3, SiO2, or TiO2. In some embodiments, the dehydration catalyst is a high surface area Al2O3 or a high surface area silica wherein the silica is substantially SiO2. High surface area for the purposes of the invention means a surface area of at least about 50, 75, 100 m2/g, or more. In some embodiments, the dehydration catalyst may comprise an aluminosilicate, such as a zeolite.
[0229] For example, including as exemplified from such incorporated references, 3-HP may be dehydrated to acrylic acid via various specific methods, each often involving one or more dehydration catalysts. One catalyst of particular apparent value is titanium, such as in the form of titanium oxide, TiO(2). A titanium dioxide catalyst may be provided in a dehydration system that distills an aqueous solution comprising 3-HP, wherein the 3-HP dehydrates, such as upon volatilization, converting to acrylic acid, and the acrylic acid is collected by condensation from the vapor phase.
[0230] As but one specific method, an aqueous solution of 3-HP is passed through a reactor column packed with a titanium oxide catalyst maintained at a temperature between 170 and 190 C and at ambient atmospheric pressure. Vapors leaving the reactor column are passed over a low temperature condenser, where acrylic acid is collected. The low temperature condenser may be cooled to 30 C or less, 2 C or less, or at any suitable temperature for efficient condensation based on the flow rate and design of the system. Also, the reactor column temperatures may be lower, for instance when operating at a pressure lower than ambient atmospheric pressure. It is noted that Example 1 of U.S. Patent Publication No. 2007/0219390, published Sep. 20, 2007, now abandoned, provides specific parameters that employs the approach of this method. As noted, this publication is incorporated by reference for this teaching and also for its listing of catalysts that may be used in a 3-HP to acrylic acid dehydration reaction.
[0231] Further as to dehydration catalysts, the following table summarizes a number of catalysts (including chemical classes) that may be used in a dehydration reaction from 3-HP (or its esters) to acrylic acid (or acrylate esters). Such catalysts, some of which may be used in any of solid, liquid or gaseous forms, may be used individually or in any combination. This listing of catalysts is not intended to be limiting, and many specific catalysts not listed may be used for specific dehydration reactions. Further without being limiting, catalyst selection may depend on the solution pH and/or the form of 3-HP in a particular conversion, so that an acidic catalyst may be used when 3-HP is in acidic form, and a basic catalyst may be used when the ammonium salt of 3-HP is being converted to acrylic acid. Also, some catalysts may be in the form of ion exchange resins.
TABLE-US-00002 TABLE 4 Dehydration Catalysts Catalyst by Chemical Class Non-limiting Examples Acids (including H2SO4, HC1, titanic acids, metal oxide hydrates, weak and strong) metal sulfates (MSO4, where M = Zn, Sn, Ca, Ba, Ni, Co, or other transition metals), metal oxide sulfates, metal phosphates (e.g., M3, (PO4)2, where M = Ca, Ba), metal phosphates, metal oxide phosphates, carbon (e.g., transition metals on a carbon support), mineral acids, carboxylic acids, salts thereof, acidic resins, acidic zeolites, clays, SiO2/H3PO4, fluorinated Al2O3, Nb2O3/PO5-3, Nb2O3/SO4-2, Nb2O5H2O, phosphotungstic acids, phosphomolybdic acids, silicomolybdic acids, silicotungstic acids, carbon dioxide Bases (including NaOH, ammonia, polyvinylpyridine, metal weak and strong) hydroxides, Zr(OH)4, and substituted amines Oxides (generally TiO2, ZrO2, Al2O3, SiO2, ZnO2, SnO2, WO3, MnO2, metal oxides) Fe2O3, V2O5
[0232] As to another specific method using one of these catalysts, concentrated sulfuric acid and an aqueous solution comprising 3-HP are separately flowed into a reactor maintained at 150 to 165° C. at a reduced pressure of 100 mm Hg. Flowing from the reactor is a solution comprising acrylic acid. A specific embodiment of this method, disclosed in Example 1 of US2009/0076297, incorporated by reference herein, indicates a yield of acrylic acid exceeding 95 percent.
[0233] Based on the wide range of possible catalysts and knowledge in the art of dehydration reactions of this type, numerous other specific dehydration methods may be evaluated and implemented for commercial production.
[0234] The dehydration of 3-HP may also take place in the absence of a dehydration catalyst. For example, the reaction may be run in the vapor phase in the presence of a nominally inert packing such as glass, ceramic, a resin, porcelain, plastic, metallic or brick dust packing and still form acrylic acid in reasonable yields and purity. The catalyst particles can be sized and configured such that the chemistry is, in some embodiments, mass-transfer-limited or kinetically limited. The catalyst can take the form of powder, pellets, granules, beads, extrudates, and so on. When a catalyst support is optionally employed, the support may assume any physical form such as pellets, spheres, monolithic channels, etc. The supports may be co-precipitated with active metal species; or the support may be treated with the catalytic metal species and then used as is or formed into the aforementioned shapes; or the support may be formed into the aforementioned shapes and then treated with the catalytic species.
[0235] A reactor for dehydration of 3-HP may be engineered and operated in a wide variety of ways. The reactor operation can be continuous, semi-continuous, or batch. It is perceived that an operation that is substantially continuous and at steady state is advantageous from operations and economics perspectives. The flow pattern can be substantially plug flow, substantially well-mixed, or a flow pattern between these extremes. A "reactor" can actually be a series or network of several reactors in various arrangements.
[0236] For example, without being limiting, acrylic acid may be made from 3-HP via a dehydration reaction, which may be achieved by a number of commercial methodologies including via a distillation process, which may be part of the separation regime and which may include an acid and/or a metal ion as catalyst. More broadly, incorporated herein for its teachings of conversion of 3-HP, and other 0-hydroxy carbonyl compounds, to acrylic acid and other related downstream compounds, is U.S. Patent Publication No. 2007/0219390 A1, published Sep. 20, 2007, now abandoned. This publication lists numerous catalysts and provides examples of conversions, which are specifically incorporated herein.
[0237] For example, including as exemplified from such incorporated references, 3-HP may be dehydrated to acrylic acid via various specific methods, each often involving one or more dehydration catalysts. One catalyst of particular apparent value is titanium, such as in the form of titanium oxide, TiO2. A titanium dioxide catalyst may be provided in a dehydration system that distills an aqueous solution comprising 3-HP, wherein the 3-HP dehydrates, such as upon volatilization, converting to acrylic acid, and the acrylic acid is collected by condensation from the vapor phase.
[0238] As but one specific method, an aqueous solution of 3-HP is passed through a reactor column packed with a titanium oxide catalyst maintained at a temperature between 170 and 190° C. and at ambient atmospheric pressure. Vapors leaving the reactor column are passed over a low temperature condenser, where acrylic acid is collected. The low temperature condenser may be cooled to 30° C. or less, 20° C. or less, 2° C. or less, or at any suitable temperature for efficient condensation based on the flow rate and design of the system. Also, the reactor column temperatures may be lower, for instance when operating at a pressure lower than ambient atmospheric pressure. It is noted that Example 1 of U.S. Patent Publication No. 2007/0219390, published Sep. 20, 2007, now abandoned, provides specific parameters that employs the approach of this method. As noted, this publication is incorporated by reference for this teaching and also for its listing of catalysts that may be used in a 3-HP to acrylic acid dehydration reaction.
[0239] Crystallization of the acrylic acid obtained by dehydration of 3-HP may be used as one of the final separation/purification steps. Various approaches to crystallization are known in the art, including crystallization of esters.
[0240] As noted above, in some embodiments, a salt of 3-HP is converted to acrylic acid or an ester or salt thereof. For example, U.S. Pat. No. 7,186,856 (Meng et al.) teaches a process for producing acrylic acid from the ammonium salt of 3-HP, which involves a first step of heating the ammonium salt of 3-HP in the presence of an organic amine or solvent that is immiscible with water, to form a two-phase solution and split the 3-HP salt into its respective ionic constituents under conditions which transfer 3-HP from the aqueous phase to the organic phase of the solution, leaving ammonia and ammonium cations in the aqueous phase. The organic phase is then back-extracted to separate the 3-HP, followed by a second step of heating the 3-HP-containing solution in the presence of a dehydration catalyst to produce acrylic acid. U.S. Pat. No. 7,186,856 is incorporated by reference for its methods for producing acrylic acid from salts of 3-HP. Various alternatives to the particular approach disclosed in this patent may be developed for suitable extraction and conversion processes.
[0241] Methyl acrylate may be made from 3-HP via dehydration and esterification, the latter to add a methyl group (such as using methanol), acrylamide may be made from 3-HP via dehydration and amidation reactions, acrylonitrile may be made via a dehydration reaction and forming a nitrile moiety, propriolactone may be made from 3-HP via a ring-forming internal esterification reaction (eliminating a water molecule), ethyl-3-HP may be made from 3-HP via esterification with ethanol, malonic acid may be made from 3-HP via an oxidation reaction, and 1,3-propanediol may be made from 3-HP via a reduction reaction.
[0242] Malonic acid may be produced from oxidation of 3-HP as produced herein. U.S. Pat. No. 5,817,870 (Haas et al.) discloses catalytic oxidation of 3-HP by a precious metal selected from Ru, Rh, Pd, Os, Ir or Pt. These can be pure metal catalysts or supported catalysts. The catalytic oxidation can be carried out using a suspension catalyst in a suspension reactor or using a fixed-bed catalyst in a fixed-bed reactor. If the catalyst, preferably a supported catalyst, is disposed in a fixed-bed reactor, the latter can be operated in a trickle-bed procedure as well as also in a liquid-phase procedure. In the trickle-bed procedure the aqueous phase comprising the 3-HP starting material, as well as the oxidation products of the same and means for the adjustment of pH, and oxygen or an oxygen-containing gas can be conducted in parallel flow or counter-flow. In the liquid-phase procedure the liquid phase and the gas phase are conveniently conducted in parallel flow.
[0243] In order to achieve a sufficiently short reaction time, the conversion is carried out at a pH equal or greater than 6, preferably at least 7, and in particular between 7.5 and 9. According to a preferred embodiment, during the oxidation reaction the pH is kept constant, preferably at a pH in the range between 7.5 and 9, by adding a base, such as an alkaline or alkaline earth hydroxide solution. The oxidation is usefully carried out at a temperature of at least 10° C. and maximally 70° C. The flow of oxygen is not limited. In the suspension method it is important that the liquid and the gaseous phase are brought into contact by stirring vigorously. Malonic acid can be obtained in nearly quantitative yields. U.S. Pat. No. 5,817,870 is incorporated by reference herein for its methods to oxidize 3-HP to malonic acid.
[0244] 1,3-Propanediol may be produced from hydrogenation of 3-HP as produced herein. U.S. Patent Publication No. 2005/0283029 (Meng et al.) is incorporated by reference herein for its methods to hydrogenation of 3-HP, or esters of the acid or mixtures, in the presence of a specific catalyst, in a liquid phase, to prepare 1,3-propanediol. Possible catalysts include ruthenium metal, or compounds of ruthenium, supported or unsupported, alone or in combination with at least one or more additional metal(s) selected from molybdenum, tungsten, titanium, zirconium, niobium, vanadium or chromium. The ruthenium metal or compound thereof, and/or the additional metal(s), or compound thereof, may be utilized in supported or unsupported form. If utilized in supported form, the method of preparing the supported catalyst is not critical and can be any technique such as impregnation of the support or deposition on the support. Any suitable support may be utilized. Supports that may be used include, but are not limited to, alumina, titania, silica, zirconia, carbons, carbon blacks, graphites, silicates, zeolites, aluminosilicate zeolites, aluminosilicate clays, and the like.
[0245] The hydrogenation process may be carried out in liquid phase. The liquid phase includes water, organic solvents that are not hydrogenatable, such as any aliphatic or aromatic hydrocarbon, alcohols, ethers, toluene, decalin, dioxane, diglyme, n-heptane, hexane, xylene, benzene, tetrahydrofuran, cyclohexane, methylcyclohexane, and the like, and mixtures of water and organic solvent(s). The hydrogenation process may be carried out batch wise, semi-continuously, or continuously. The hydrogenation process may be carried out in any suitable apparatus. Exemplary of such apparatus are stirred tank reactors, trickle-bed reactors, high pressure hydrogenation reactors, and the like.
[0246] The hydrogenation process is generally carried out at a temperature ranging from about 20 to about 250° C., more particularly from about 100 to about 200° C. Further, the hydrogenation process is generally carried out in a pressure range of from about 20 psi to about 4000 psi. The hydrogen containing gas utilized in the hydrogenation process is, optionally, commercially pure hydrogen. The hydrogen containing gas is usable if nitrogen, gaseous hydrocarbons, or oxides of carbon, and similar materials, are present in the hydrogen containing gas. For example, hydrogen from synthesis gas (hydrogen and carbon monoxide) may be employed, such synthesis gas potentially further including carbon dioxide, water, and various impurities.
[0247] As is known in the art, it is also possible to convert 3-HP to 1,3-propanediol using biological methods. For example, 1,3-propanediol can be created from either 3-HP-CoA or 3-HP via the use of polypeptides having enzymatic activity. These polypeptides can be used either in vitro or in vivo. When converting 3-HP-CoA to 1,3-propanediol, polypeptides having oxidoreductase activity or reductase activity (e.g., enzymes from the 1.1.1.-class of enzymes) can be used. Alternatively, when creating 1,3-propanediol from 3-HP, a combination of a polypeptide having aldehyde dehydrogenase activity (e.g., an enzyme from the 1.1.1.34 class) and a polypeptide having alcohol dehydrogenase activity (e.g., an enzyme from the 1.1.1.32 class) can be used.
[0248] Another downstream production of 3-HP, acrylonitrile, may be converted from acrylic acid by various organic syntheses, including by not limited to the Sohio acrylonitrile process, a single-step method of production known in the chemical manufacturing industry
[0249] Also, addition reactions may yield acrylic acid or acrylate derivatives having alkyl or aryl groups at the carbonyl hydroxyl group. Such additions may be catalyzed chemically, such as by hydrogen, hydrogen halides, hydrogen cyanide, or Michael additions under alkaline conditions optionally in the presence of basic catalysts. Alcohols, phenols, hydrogen sulfide, and thiols are known to add under basic conditions. Aromatic amines or amides, and aromatic hydrocarbons, may be added under acidic conditions. These and other reactions are described in Ulmann's Encyclopedia of Industrial Chemistry, Acrylic Acid and Derivatives, WileyVCH Verlag GmbH, Wienham (2005), incorporated by reference for its teachings of conversion reactions for acrylic acid and its derivatives.
[0250] Acrylic acid obtained from 3-HP made by the present invention may be further converted to various chemicals, including polymers, which are also considered downstream products in some embodiments. Acrylic acid esters may be formed from acrylic acid (or directly from 3-HP) such as by condensation esterification reactions with an alcohol, releasing water. This chemistry described in Monomeric Acrylic Esters, E. H. Riddle, Reinhold, N.Y. (1954), incorporated by reference for its esterification teachings. Among esters that are formed are methyl acrylate, ethyl acrylate, n-butyl acrylate, hydroxypropyl acrylate, hydroxyethyl acrylate, isobutyl acrylate, and 2-ethylhexyl acrylate, and these and/or other acrylic acid and/or other acrylate esters may be combined, including with other compounds, to form various known acrylic acid-based polymers. Although acrylamide is produced in chemical syntheses by hydration of acrylonitrile, herein a conversion may convert acrylic acid to acrylamide by amidation.
[0251] Acrylic acid obtained from 3-HP made by the present invention may be further converted to various chemicals, including polymers, which are also considered downstream products in some embodiments. Acrylic acid esters may be formed from acrylic acid (or directly from 3-HP) such as by condensation esterification reactions with an alcohol, releasing water. This chemistry is described in Monomeric Acrylic Esters, E. H. Riddle, Reinhold, N.Y. (1954), incorporated by reference for its esterification teachings. Among esters that are formed are methyl acrylate, ethyl acrylate, n-butyl acrylate, hydroxypropyl acrylate, hydroxyethyl acrylate, isobutyl acrylate, and 2-ethylhexyl acrylate, and these and/or other acrylic acid and/or other acrylate esters may be combined, including with other compounds, to form various known acrylic acid-based polymers. Although acrylamide is produced in chemical syntheses by hydration of acrylonitrile, herein a conversion may convert acrylic acid to acrylamide by amidation.
[0252] Direct esterification of acrylic acid can take place by esterification methods known to the person skilled in the art, by contacting the acrylic acid obtained from 3-HP dehydration with one or more alcohols, such as methanol, ethanol, 1-propanol, 2-propanol, n-butanol, tert-butanol or isobutanol, and heating to a temperature of at least 50, 75, 100, 125, or 150° C. The water formed during esterification may be removed from the reaction mixture, such as by azeotropic distillation through the addition of suitable separation aids, or by another means of separation. Conversions up to 95%, or more, may be realized, as is known in the art.
[0253] Several suitable esterification catalysts are commercially available, such as from Dow Chemical (Midland, Mich. US). For example, Amberlyst® 131Wet Monodisperse gel catalyst confers enhanced hydraulic and reactivity properties and is suitable for fixed bed reactors. Amberlyst® 39Wet is a macroreticular catalyst suitable particularly for stirred and slurry loop reactors. Amberlyst® 46 is a macroporous catalyst producing less ether byproducts than conventional catalyst (as described in U.S. Pat. No. 5,426,199 to Rohm and Haas, which patent is incorporated by reference for its teachings of esterification catalyst compositions and selection considerations).
[0254] Acrylic acid, and any of its esters, may be further converted into various polymers. Polymerization may proceed by any of heat, light, other radiation of sufficient energy, and free radical generating compounds, such as azo compounds or peroxides, to produce a desired polymer of acrylic acid or acrylic acid esters. As one example, an aqueous acrylic acid solution's temperature raised to a temperature known to start polymerization (in part based on the initial acrylic acid concentration), and the reaction proceeds, the process frequently involving heat removal given the high exothermicity of the reaction. Many other methods of polymerization are known in the art. Some are described in Ulmann's Encyclopedia of Industrial Chemistry, Polyacrylamides and Poly(Acrylic Acids), WileyVCH Verlag GmbH, Wienham (2005), incorporated by reference for its teachings of polymerization reactions.
[0255] For example, the free-radical polymerization of acrylic acid takes place by polymerization methods known to the skilled worker and can be carried out either in an emulsion or suspension in aqueous solution or another solvent. Initiators, such as but not limited to organic peroxides, often are added to aid in the polymerization. Among the classes of organic peroxides that may be used as initiators are diacyls, peroxydicarbonates, monoperoxycarbonates, peroxyketals, peroxyesters, dialkyls, and hydroperoxides. Another class of initiators is azo initiators, which may be used for acrylate polymerization as well as co-polymerization with other monomers. U.S. Pat. Nos. 5,470,928; 5,510,307; 6,709,919; and 7,678,869 teach various approaches to polymerization using a number of initiators, including organic peroxides, azo compounds, and other chemical types, and are incorporated by reference for such teachings as applicable to the polymers described herein.
[0256] Accordingly, it is further possible for co-monomers, such as crosslinkers, to be present during the polymerization. The free-radical polymerization of the acrylic acid obtained from dehydration of 3-HP, as produced herein, in at least partly neutralized form and in the presence of crosslinkers is practiced in certain embodiments. This polymerization may result in hydrogels which can then be comminuted, ground and, where appropriate, surface-modified, by known techniques.
[0257] An important commercial use of polyacrylic acid is for superabsorbent polymers. This specification hereby incorporates by reference Modern Superabsorbent Polymer Technology, Buchholz and Graham (Editors), Wiley-VCH, 1997, in its entirety for its teachings regarding superabsorbent polymers components, manufacture, properties and uses. Superabsorbent polymers are primarily used as absorbents for water and aqueous solutions for diapers, adult incontinence products, feminine hygiene products, and similar consumer products. In such consumer products, superabsorbent materials can replace traditional absorbent materials such as cloth, cotton, paper wadding, and cellulose fiber. Superabsorbent polymers absorb, and retain under a slight mechanical pressure, up to 25 times or their weight in liquid. The swollen gel holds the liquid in a solid, rubbery state and prevents the liquid from leaking. Superabsorbent polymer particles can be surface-modified to produce a shell structure with the shell being more highly crosslinked. This technique improves the balance of absorption, absorption under load, and resistance to gel-blocking. It is recognized that superabsorbent polymers have uses in fields other than consumer products, including agriculture, horticulture, and medicine.
[0258] Superabsorbent polymers are prepared from acrylic acid (such as acrylic acid derived from 3-HP provided herein) and a crosslinker, by solution or suspension polymerization. Exemplary methods include U.S. Pat. Nos. 5,145,906; 5,350,799; 5,342,899; 4,857,610; 4,985,518; 4,708, 997; 5,180,798; 4,666,983; 4,734,478; and 5,331,059, each incorporated by reference for their teachings relating to superabsorbent polymers.
[0259] Among consumer products, a diaper, a feminine hygiene product, and an adult incontinence product are made with superabsorbent polymer that itself is made substantially from acrylic acid converted from 3-HP made in accordance with the present invention.
[0260] Diapers and other personal hygiene products may be produced that incorporate superabsorbent polymer made from acrylic acid made from 3-HP which is bio-produced by the teachings of the present application. The following provides general guidance for making a diaper that incorporates such superabsorbent polymer. The superabsorbent polymer first is prepared into an absorbent pad that may be vacuum formed, and in which other materials, such as a fibrous material (e.g., wood pulp) are added. The absorbent pad then is assembled with sheet(s) of fabric, generally a nonwoven fabric (e.g., made from one or more of nylon, polyester, polyethylene, and polypropylene plastics) to form diapers.
[0261] More particularly, in one non-limiting process, above a conveyor belt multiple pressurized nozzles spray superabsorbent polymer particles (such as about 400 micron size or larger), fibrous material, and/or a combination of these onto the conveyor belt at designated spaces/intervals. The conveyor belt is perforated and under vacuum from below, so that the sprayed on materials are pulled toward the belt surface to form a flat pad. In various embodiments, fibrous material is applied first on the belt, followed by a mixture of fibrous material and the superabsorbent polymer particles, followed by fibrous material, so that the superabsorbent polymer is concentrated in the middle of the pad. A leveling roller may be used toward the end of the belt path to yield pads of uniform thickness. Each pad thereafter may be further processed, such as to cut it to a proper shape for the diaper, or the pad may be in the form of a long roll sufficient for multiple diapers. Thereafter, the pad is sandwiched between a top sheet and a bottom sheet of fabric (one generally being liquid pervious, the other liquid impervious), such as on a conveyor belt, and these are attached together such as by gluing, heating or ultrasonic welding, and cut into diaper-sized units (if not previously so cut). Additional features may be provided, such as elastic components, strips of tape, etc., for fit and ease of wearing by a person. FIG. 6A depicts aspects of diaper manufacture and FIG. B depicts aspects of diaper manufacture and diaper wear. FIG. 7 provides a general, non-limiting depiction of the overall processes leading from biomass to a polymerized acrylic acid-based polymer such as may be used in diapers in accordance with embodiments of the present invention.
[0262] The ratio of the fibrous material to polymer particles is known to effect performance characteristics. In some embodiments, this ratio is between 75:25 and 90:10 (see U.S. Pat. No. 4,685,915, incorporated by reference for its teachings of diaper manufacture). Other disposable absorbent articles may be constructed in a similar fashion, such as for adult incontinence, feminine hygiene (sanitary napkins), tampons, etc. (see, for example, U.S. Pat. Nos. 5,009,653, 5,558,656, and 5,827,255 incorporated by reference for their teachings of sanitary napkin manufacture).
[0263] Low molecular-weight polyacrylic acid has uses for water treatment, flocculants, and thickeners for various applications including cosmetics and pharmaceutical preparations. For these applications, the polymer may be uncrosslinked or lightly crosslinked, depending on the specific application. The molecular weights are typically from about 200 to about 1,000,000 g/mol. Preparation of these low molecular-weight polyacrylic acid polymers is described in U.S. Pat. Nos. 3,904,685; 4,301,266; 2,798,053; and 5,093,472, each of which is incorporated by reference for its teachings relating to methods to produce these polymers.
[0264] Acrylic acid may be co-polymerized with one or more other monomers selected from acrylamide, 2-acrylamido-2-methylpropanesulfonic acid, N,N-dimethylacrylamide, N-isopropylacrylamide, methacrylic acid, and methacrylamide, to name a few. The relative reactivities of the monomers affect the microstructure and thus the physical properties of the polymer. Co-monomers may be derived from 3-HP, or otherwise provided, to produce co-polymers. Ulmann's Encyclopedia of Industrial Chemistry, Polyacrylamides and Poly(Acrylic Acids), WileyVCH Verlag GmbH, Wienham (2005), is incorporated by reference herein for its teachings of polymer and co-polymer processing.
[0265] Acrylic acid can in principle be copolymerized with almost any free-radically polymerizable monomers including styrene, butadiene, acrylonitrile, acrylic esters, maleic acid, maleic anhydride, vinyl chloride, acrylamide, itaconic acid, and so on. End-use applications typically dictate the co-polymer composition, which influences properties. Acrylic acid also may have a number of optional substitutions on it, and after such substitutions be used as a monomer for polymerization, or co-polymerization reactions. As a general rule, acrylic acid (or one of its co-polymerization monomers) may be substituted by any substituent that does not interfere with the polymerization process, such as alkyl, alkoxy, aryl, heteroaryl, benzyl, vinyl, allyl, hydroxy, epoxy, amide, ethers, esters, ketones, maleimides, succinimides, sulfoxides, glycidyl and silyl (see U.S. Pat. No. 7,678,869, incorporated by reference above, for further discussion). The following paragraphs provide a few non-limiting examples of copolymerization applications.
[0266] Paints that comprise polymers and copolymers of acrylic acid and its esters are in wide use as industrial and consumer products. Aspects of the technology for making such paints can be found in U.S. Pat. Nos. 3,687,885 and 3,891,591, incorporated by reference for its teachings of such paint manufacture. Generally, acrylic acid and its esters may form homopolymers or copolymers among themselves or with other monomers, such as amides, methacrylates, acrylonitrile, vinyl, styrene and butadiene. A desired mixture of homopolymers and/or copolymers, referred to in the paint industry as `vehicle` (or `binder`) are added to an aqueous solution and agitated sufficiently to form an aqueous dispersion that includes sub-micrometer sized polymer particles. The paint cures by coalescence of these `vehicle` particles as the water and any other solvent evaporate. Other additives to the aqueous dispersion may include pigment, filler (e.g., calcium carbonate, aluminum silicate), solvent (e.g., acetone, benzol, alcohols, etc., although these are not found in certain no VOC paints), thickener, and additional additives depending on the conditions, applications, intended surfaces, etc. In many paints, the weight percent of the vehicle portion may range from about nine to about 26 percent, but for other paints the weight percent may vary beyond this range.
[0267] Acrylic-based polymers are used for many coatings in addition to paints. For example, for paper coating latexes, acrylic acid is used from 0.1-5.0%, along with styrene and butadiene, to enhance binding to the paper and modify rheology, freeze-thaw stability and shear stability. In this context, U.S. Pat. Nos. 3,875,101 and 3,872,037 are incorporated by reference for their teachings regarding such latexes. Acrylate-based polymers also are used in many inks, particularly UV curable printing inks. For water treatment, acrylamide and/or hydroxy ethyl acrylate are commonly co-polymerized with acrylic acid to produce low molecular-weight linear polymers. In this context, U.S. Pat. Nos. 4,431,547 and 4,029,577 are incorporated by reference for their teachings of such polymers. Co-polymers of acrylic acid with maleic acid or itaconic acid are also produced for water-treatment applications, as described in U.S. Pat. No. 5,135,677, incorporated by reference for that teaching. Sodium acrylate (the sodium salt of glacial acrylic acid) can be co-polymerized with acrylamide (which may be derived from acrylic acid via amidation chemistry) to make an anionic co-polymer that is used as a flocculant in water treatment.
[0268] For thickening agents, a variety of co-monomers can be used, such as described in U.S. Pat. Nos. 4,268,641 and 3,915,921, incorporated by reference for description of these co-monomers. U.S. Pat. No. 5,135,677 describes a number of co-monomers that can be used with acrylic acid to produce water-soluble polymers, and is incorporated by reference for such description.
[0269] Also as noted, some conversions to downstream products may be made enzymatically. For example, 3-HP may be converted to 3-HP-CoA, which then may be converted into polymerized 3-HP with an enzyme having polyhydroxyacid synthase activity (EC 2.3.1.-). Also, 1,3-propanediol can be made using polypeptides having oxidoreductase activity or reductase activity (e.g., enzymes in the EC 1.1.1.-class of enzymes). Alternatively, when creating 1,3-propanediol from 3HP, a combination of (1) a polypeptide having aldehyde dehydrogenase activity (e.g., an enzyme from the 1.1.1.34 class) and (2) a polypeptide having alcohol dehydrogenase activity (e.g., an enzyme from the 1.1.1.32 class) can be used. Polypeptides having lipase activity may be used to form esters. Enzymatic reactions such as these may be conducted in vitro, such as using cell-free extracts, or in vivo.
[0270] Thus, various embodiments of the present invention, such as methods of making a chemical, include conversion steps to any such noted downstream products of microbially produced 3-HP, including but not limited to those chemicals described herein and in the incorporated references (the latter for jurisdictions allowing this). For example, one embodiment is making 3-HP molecules by the teachings herein and further converting the 3-HP molecules to polymerized-3-HP (poly-3-HP) or acrylic acid, and such as from acrylic acid then producing from the 3-HP molecules any one of polyacrylic acid (polymerized acrylic acid, in various forms), methyl acrylate, acrylamide, acrylonitrile, propiolactone, ethyl 3-HP, malonic acid, 1,3-propanediol, ethyl acrylate, n-butyl acrylate, hydroxypropyl acrylate, hydroxyethyl acrylate, isobutyl acrylate, 2-ethylhexyl acrylate, and acrylic acid or an acrylic acid ester to which an alkyl or aryl addition is made, and/or to which halogens, aromatic amines or amides, and aromatic hydrocarbons are added.
[0271] Also as noted, some conversions to downstream products may be made enzymatically. For example, 3-HP may be converted to 3-HP-CoA, which then may be converted into polymerized 3-HP with an enzyme having polyhydroxyacid synthase activity (EC 2.3.1.-). Also, 1,3-propanediol can be made using polypeptides having oxidoreductase activity or reductase activity (e.g., enzymes in the EC 1.1.1.-class of enzymes). Alternatively, when creating 1,3-propanediol from 3HP, a combination of (1) a polypeptide having aldehyde dehydrogenase activity (e.g., an enzyme from the 1.1.1.34 class) and (2) a polypeptide having alcohol dehydrogenase activity (e.g., an enzyme from the 1.1.1.32 class) can be used. Polypeptides having lipase activity may be used to form esters. Enzymatic reactions such as these may be conducted in vitro, such as using cell-free extracts, or in vivo.
[0272] Thus, various embodiments of the present invention, such as methods of making a chemical, include conversion steps to any such noted downstream products of microbially produced 3-HP, including but not limited to those chemicals described herein and in the incorporated references (the latter for jurisdictions allowing this). For example, one embodiment is making 3-HP molecules by the teachings herein and further converting the 3-HP molecules to polymerized-3-HP (poly-3-HP) or acrylic acid, and such as from acrylic acid then producing from the 3-HP molecules any one of polyacrylic acid (polymerized acrylic acid, in various forms), methyl acrylate, acrylamide, acrylonitrile, propiolactone, ethyl 3-HP, malonic acid, 1,3-propanediol, ethyl acrylate, n-butyl acrylate, hydroxypropyl acrylate, hydroxyethyl acrylate, isobutyl acrylate, 2-ethylhexyl acrylate, and acrylic acid or an acrylic acid ester to which an alkyl or aryl addition is made, and/or to which halogens, aromatic amines or amides, and aromatic hydrocarbons are added.
[0273] Reactions that form downstream compounds such as acrylates or acrylamides can be conducted in conjunction with use of suitable stabilizing agents or inhibiting agents reducing likelihood of polymer formation. See, for example, U.S. Patent Publication No. 2007/0219390 A1. Stabilizing agents and/or inhibiting agents include, but are not limited to, e.g., phenolic compounds (e.g., dimethoxyphenol (DMP) or alkylated phenolic compounds such as di-tert-butyl phenol), quinones (e.g., t-butyl hydroquinone or the monomethyl ether of hydroquinone (MEHQ)), and/or metallic copper or copper salts (e.g., copper sulfate, copper chloride, or copper acetate). Inhibitors and/or stabilizers can be used individually or in combinations as will be known by those of skill in the art. Also, in various embodiments, the one or more downstream compounds is/are recovered at a molar yield of up to about 100 percent, or a molar yield in the range from about 70 percent to about 90 percent, or a molar yield in the range from about 80 percent to about 100 percent, or a molar yield in the range from about 90 percent to about 100 percent. Such yields may be the result of single-pass (batch or continuous) or iterative separation and purification steps in a particular process.
[0274] Acrylic acid and other downstream products are useful as commodities in manufacturing, such as in the manufacture of consumer goods, including diapers, textiles, carpets, paint, adhesives, and acrylic glass.
[0275] In some embodiments, the invention contemplates a culture system comprising: a) a population of a genetically modified microorganism as described herein; and b) a media comprising nutrients for the population. In some such embodiments the media additionally comprises at least 1 gram/liter of 3-HP.
[0276] The teachings and results of the Examples that follow are hereby incorporated into this section for all applicable purposes.
[0277] Also, and more generally, in accordance with disclosures, discussions, examples and embodiments herein, there may be employed conventional molecular biology, cellular biology, microbiology, and recombinant DNA techniques within the skill of the art. Such techniques are explained fully in the literature. (See, e.g., Sambrook and Russell, Molecular Cloning: A Laboratory Manual, Third Edition 2001 (volumes 1-3), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y.; Animal Cell Culture, R. I. Freshney, ed., 1986). These published resources are incorporated by reference herein for their respective teachings of standard laboratory methods found therein. Further, all patents, patent applications, patent publications, and other publications referenced herein (collectively, "published resource(s)") are hereby incorporated by reference in this application. Such incorporation, at a minimum, is for the specific teaching and/or other purpose that may be noted when citing the reference herein. If a specific teaching and/or other purpose is not so noted, then the published resource is specifically incorporated for the teaching(s) indicated by one or more of the title, abstract, and/or summary of the reference. If no such specifically identified teaching and/or other purpose may be so relevant, then the published resource is incorporated in order to more fully describe the state of the art to which the present invention pertains, and/or to provide such teachings as are generally known to those skilled in the art, as may be applicable. However, it is specifically stated that a citation of a published resource herein shall not be construed as an admission that such is prior art to the present invention. Also, in the event that one or more of the incorporated published resources differs from or contradicts this application, including but not limited to defined terms, term usage, described techniques, or the like, this application controls.
[0278] While various embodiments of the present invention have been shown and described herein, it is emphasized that such embodiments are provided by way of example only. Numerous variations, changes and substitutions may be made without departing from the invention herein in its various embodiments. Specifically, and for whatever reason, for any grouping of compounds, nucleic acid sequences, polypeptides including specific proteins including functional enzymes, metabolic pathway enzymes or intermediates, elements, or other compositions, or concentrations stated or otherwise presented herein in a list, table, or other grouping (such as metabolic pathway enzymes shown in a figure), unless clearly stated otherwise, it is intended that each such grouping provides the basis for and serves to identify various subset embodiments, the subset embodiments in their broadest scope comprising every subset of such grouping by exclusion of one or more members (or subsets) of the respective stated grouping. Moreover, when any range is described herein, unless clearly stated otherwise, that range includes all values therein and all sub-ranges therein. Accordingly, it is intended that the invention be limited only by the spirit and scope of appended claims, and of later claims, and of either such claims as they may be amended during prosecution of this or a later application claiming priority hereto.
EXAMPLES
Example 1
Preparing a Genetically Modified E. Coli Host Cell Comprising an Oxaloacetate Alpha-Decarboxylase (oad-2) in Combination with Other Genetic Modifications to Increase 3-HP Production Relative to a Control E. Coli Cell (Prophetic)
[0279] Referring to FIG. 4 and Tables 1 and 2, genetic modifications are made to introduce a polypeptide that encodes SEQ ID NO:001 or SEQ ID NO:036, or its functional equivalents, and also to introduce a vector that encodes mmsB (SEQ ID NO:002). Vectors comprising galP and a native or mutated Ppc also may be introduced by methods known to those skilled in the art (see, e.g., Sambrook and Russell, Molecular Cloning: A Laboratory Manual, Third Edition 2001 (volumes 1-3), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y., "Sambrook and Russell, 2001"), additionally recognizing that mutations may be made by a method using the XL1-Red mutator strain, using appropriate materials following a manufacturer's instructions (Stratagene QuikChange Mutagenesis Kit, Stratagene, La Jolla, Calif. USA) and selected for or screened under standard protocols.
[0280] Also, genetic modifications are made to reduce or eliminate the enzymatic activities of the E. coli genes listed in Table 2. These genetic modifications are achieved by using the RED/ET homologous recombination method with kits supplied by Gene Bridges (Gene Bridges GmbH, Dresden, Germany, www.genebridges.com) according to manufacturer's instructions.
[0281] The so-genetically modified microorganism is evaluated and found to exhibit higher productivity of 3-HP compared with a control E. coli lacking said genetic modifications. Productivity is measured by standard metrics, such as volumetric productivity (grams of 3-HP/hour) under similar culture conditions.
Example 2
General Example of Genetic Modification to a Host Cell (Prophetic and Non-Specific)
[0282] This example is meant to describe a non-limiting approach to genetic modification of a selected microorganism to introduce a nucleic acid sequence of interest. Alternatives and variations are provided within this general example. The methods of this Example are conducted to achieve a combination of desired genetic modifications in a selected microorganism species, such as a combination of genetic modifications selected from those shown in FIG. 4, and their equivalents in species other than E. coli.
[0283] A gene or other nucleic acid sequence segment of interest is identified in a particular species (such as E. coli as described above) and a nucleic acid sequence comprising that gene or segment is obtained. For clarity below the use of the term "segment of interest" below is meant to include both a gene and any other nucleic acid sequence segment of interest. One example of a method used to obtain a segment of interest is to acquire a culture of a microorganism, where that microorganism's genome includes the gene or nucleic acid sequence segment of interest.
[0284] Based on the nucleic acid sequences at the ends of or adjacent the ends of the segment of interest, 5' and 3' nucleic acid primers are prepared. Each primer is designed to have a sufficient overlap section that hybridizes with such ends or adjacent regions. Such primers may include enzyme recognition sites for restriction digest of transposase insertion that could be used for subsequent vector incorporation or genomic insertion. These sites are typically designed to be outward of the hybridizing overlap sections. Numerous contract services are known that prepare primer sequences to order (e.g., Integrated DNA Technologies, Coralville, Iowa USA).
[0285] Once primers are designed and prepared, polymerase chain reaction (PCR) is conducted to specifically amplify the desired segment of interest. This method results in multiple copies of the region of interest separated from the microorganism's genome. The microorganism's DNA, the primers, and a thermophilic polymerase are combined in a buffer solution with potassium and divalent cations (e.g., Mg or Mn) and with sufficient quantities of deoxynucleoside triphosphate molecules. This mixture is exposed to a standard regimen of temperature increases and decreases. However, temperatures, components, concentrations, and cycle times may vary according to the reaction according to length of the sequence to be copied, annealing temperature approximations and other factors known or readily learned through routine experimentation by one skilled in the art.
[0286] In an alternative embodiment the segment of interest may be synthesized, such as by a commercial vendor, and prepared via PCR per above, rather than obtaining from a microorganism or other natural source of DNA.
[0287] The nucleic acid sequences then are purified and separated, such as on an agarose gel via electrophoresis. Optionally, once the region is purified it can be validated by standard DNA sequencing methodology and may be introduced into a vector. Any of a number of vectors may be used, which generally comprise markers known to those skilled in the art, and standard methodologies are routinely employed for such introduction. Commonly used vector systems are pSMART (Lucigen, Middleton, Wis.), pET E. coli EXPRESSION SYSTEM (Stratagene, La Jolla, Calif.), pSC-B StrataClone Vector (Stratagene, La Jolla, Calif.), pRANGER-BTB vectors (Lucigen, Middleton, Wis.), and TOPO vector (Invitrogen Corp, Carlsbad, Calif., USA). Similarly, the vector then is introduced into any of a number of host cells. Commonly used host cells are E. cloni 10G (Lucigen, Middleton, Wis.), E. cloni 10GF' (Lucigen, Middleton, Wis.), StrataClone Competent cells (Stratagene, La Jolla, Calif.), E. coli BL21, E. coli BW25113, and E. coli K12 MG1655. Some of these vectors possess promoters, such as inducible promoters, adjacent the region into which the sequence of interest is inserted (such as into a multiple cloning site), while other vectors, such as pSMART vectors (Lucigen, Middleton, Wis.), are provided without promoters and with dephosporylated blunt ends. The culturing of such plasmid-laden cells permits plasmid replication and thus replication of the segment of interest, which often corresponds to expression of the segment of interest.
[0288] Various vector systems comprise a selectable marker, such as an expressible gene encoding a protein needed for growth or survival under defined conditions. Common selectable markers contained on backbone vector sequences include genes that encode for one or more proteins required for antibiotic resistance as well as genes required to complement auxotrophic deficiencies or supply critical nutrients not present or available in a particular culture media. Vectors also comprise a replication system suitable for a host cell of interest.
[0289] The plasmids containing the segment of interest can then be isolated by routine methods and are available for introduction into other microorganism host cells of interest. Various methods of introduction are known in the art and can include vector introduction or genomic integration. In various alternative embodiments the DNA segment of interest may be separated from other plasmid DNA if the former will be introduced into a host cell of interest by means other than such plasmid.
[0290] While steps of the above general prophetic example involve use of plasmids, other vectors known in the art may be used instead. These include cosmids, viruses (e.g., bacteriophage, animal viruses, plant viruses), and artificial chromosomes (e.g., yeast artificial chromosomes (YAC) and bacteria artificial chromosomes (BAC)).
[0291] Host cells into which the segment of interest is introduced may be evaluated for performance as to a particular enzymatic step, and/or tolerance or bio-production of a chemical compound of interest. Selections of better performing genetically modified host cells may be made, selecting for overall performance, tolerance, or production or accumulation of the chemical of interest.
[0292] It is noted that this procedure may incorporate a nucleic acid sequence for a single gene (or other nucleic acid sequence segment of interest), or multiple genes (under control of separate promoters or a single promoter), and the procedure may be repeated to create the desired heterologous nucleic acid sequences in expression vectors, which are then supplied to a selected microorganism so as to have, for example, a desired complement of enzymatic conversion step functionality for any of the herein-disclosed metabolic pathways. However, it is noted that although many approaches rely on expression via transcription of all or part of the sequence of interest, and then translation of the transcribed mRNA to yield a polypeptide such as an enzyme, certain sequences of interest may exert an effect by means other than such expression.
[0293] The specific laboratory methods used for the above approaches are well-known in the art and may be found in various references known to those skilled in the art, such as Sambrook and Russell, Molecular Cloning: A Laboratory Manual, Third Edition 2001 (volumes 1-3), Cold Spring Harbor Laboratory Press, Cold Spring Harbor, N.Y. (hereinafter, Sambrook and Russell, 2001).
[0294] As an alternative to the above, other genetic modifications may also be practiced, such as a deletion of a nucleic acid sequence of the host cell's genome. One non-limiting method to achieve this is by use of Red/ET recombination, known to those of ordinary skill in the art and described in U.S. Pat. Nos. 6,355,412 and 6,509,156, issued to Stewart et al. and incorporated by reference herein for its teachings of this method. Material and kits for such method are available from Gene Bridges (Gene Bridges GmbH, Dresden, Germany, www.genebridges.com), and the method may proceed by following the manufacturer's instructions. Targeted deletion of genomic DNA may be practiced to alter a host cell's metabolism so as to reduce or eliminate production of undesired metabolic products, such as gene deletions for particular genes shown in FIG. 4. This may be used in combination with other genetic modifications such as described above in this general example.
[0295] The above-described approaches and compositions may be combined with other approaches and compositions that are directed to providing, or to improving, a desired property (e.g. productivity improvements in a microorganism). Various bio-production pathways, and methods of incorporating them into a microorganism, are known in the art and also are described in other patent applications having a common co-inventor with the present patent application. Any of such may be combined with any of the above-described approaches and combinations.
Example 3
Mutational Development of Selected Polynucleotides (Prophetic)
[0296] A selected gene sequence is subjected to a mutation development protocol, starting by constructing a mutant library of a native or previously evolved and/or codon-optimized polynucleotide by use of an error-inducing PCR site-directed mutagenesis method.
[0297] A polynucleotide exhibiting enzymatic activity of the selected gene (which may be any disclosed herein, e.g., a decarboxylase or mmsB) will be cloned into an appropriate expression system for E. coli. This sequence may be codon optimized. Cloning of a codon-optimized polynucleotide and its adequate expression of the will be accomplished via gene synthesis supplied from a commercial supplier using standard techniques. The gene will be synthesized with an eight amino acid C-terminal tag to enable affinity based protein purification. Once obtained using standard methodology, the gene will be cloned into an expression system using standard techniques.
[0298] The plasmid containing the above-described polynucleotide will be mutated by standard methods resulting in a large library of mutants (>106). The mutant sequences will be excised from these plasmids and again cloned into an expression vector, generating a final library of greater than 106 clones for subsequent screening. These numbers ensure a greater than 99% probability that the library will contain a mutation in every amino acid encoded by sequence. It is acknowledged that each method of creating a mutational library has its own biases, including transformation into mutator strains of E. coli, error prone PCR, and in addition more site directed mutagenesis. In some embodiments, various methods may be considered and possibly several explored in parallel.
[0299] One such method is the use of the XL1-Red mutator strain, which is deficient in several repair mechanisms necessary for accurate DNA replication and generates mutations in plasmids at a rate 5,000 times that of the wild-type mutation rate, may be employed using appropriate materials following a manufacturer's instructions (See Stratagene QuikChange Mutagenesis Kit, Stratagene, La Jolla, Calif. USA). This technique or other techniques known to those skilled in the art, may be employed and then a population of such mutants, e.g., in a library, is evaluated, such as by a screening or selection method, to identify clones having a suitable or favorable mutation.
[0300] With the successful construction of a mutant library, it will be possible to screen this library for increased activity. The screening process will be designed to screen the entire library of greater than 106 mutants. This is done by screening methods suited to the particular enzymatic reaction.
Example 4
Cloning of Oxaloacetate Alpha-Decarboxylase Candidate Enzymes
[0301] Candidate oxaloacetate alpha-decarboxylase genes were synthesized using codons optimized for expression in E. coli and sub-cloned into expression vectors to provide protein for oxaloacetate alpha-decarboxylase assays. For gene synthesis, the protein sequences for pyruvate decarboxylase (pdc) from Zymomonas mobilis (SEQ ID NO:035), 2-oxoglutarate decarboxylase (oad) from Leuconostoc mesenteroides (SEQ ID NO:036), and alpha-ketoglutarate decarboxylase (kgd) from Mycobacterium tuberculosis (SEQ ID NO:037) were used to create codon optimized genes for expression in E. coli by the service provided by DNA 2.0 (Menlo Park, Calif. USA). Additionally, the protein coding regions of the codon optimized PDC and OAD genes were augmented with six histidines and five histidines, respectively. These additional amino acids provide a C-terminal histidine-tag for protein purification via immobilized metal-affinity chromatography. These plasmids were designated pJ201:pdc (SEQ ID NO:038), pJ251:oad (SEQ ID NO:039), and pJ206:kgd (SEQ ID NO:040), respectively.
[0302] The pdc and oad genes were individually sub-cloned into a pTrc-HisB expression vector obtained from Invitrogen (SEQ ID NO:041) (Carlsbad, Calif.). Sub-cloning for pdc gene was accomplished as follows: First, the expression vector was subjected to enzymatic restriction digestion with the restriction enzymes NcoI and NheI. Restriction enzymes were obtained from New England BioLabs (Ipswich, Mass. USA), and used according to manufacturer's instructions. The digestion mixtures were separated by agarose gel electrophoresis, and visualized under UV transillumination as described in the Common Methods Section. Agarose gel slices containing the DNA piece corresponding to the cut vector product were cut from the gel and the DNA recovered with a standard gel extraction protocol and components from Qiagen according to manufacturer's instructions. The gene inserts were amplified by polymerase chain reactions, cut with restriction enzymes, purified, and ligated to the cut vector. For the pdc gene containing insert, a polymerase chain reaction was performed with the forward primer being GGGATATCAT GAGCTATACC GTTGG (SEQ ID NO:042), and the reverse primer being GAAATAGTTC TCTAGAGAAG CTTC (SEQ ID NO:043) and the pJ201:pdc plasmid was used as template. These primers provide a BspHI site at the start codon and an XbaI site after the stop codon. This PCR product was prepared for restriction digest using a PCR purification kit from Qiagen Corporation (Valencia, Calif., USA) using the manufacturer's instructions. The PCR product was prepared by restriction digest with BspHI and XbaI obtained from New England BioLabs (Ipswich, Mass. USA), and used according to manufacturer's instructions. After digestion, mixtures were separated by agarose gel electrophoresis, and visualized under UV transillumination as described in the Common Methods Section. An agarose gel slice containing the DNA piece corresponding to the amplified pdc gene fusion was cut from the gel and the DNA recovered with a standard gel extraction protocol and components from Qiagen according to manufacturer's instructions. The recovered product and the prepared vector backbone were ligated together with T4 DNA ligase obtained from New England BioLabs (Ipswich, Mass. USA) according to manufacturer's instructions. Ligation mixtures were transformed into chemically competent 100 cells (Lucigen, Middleton, Wis.) according to the manufacturer's instructions. Colonies possibly containing the new plasmid were cultured, and their DNA was isolated using a standard miniprep protocol and components from Qiagen (Valencia, Calif. USA) according to the manufacturer's instruction. Isolated plasmids were checked by restriction digests and confirmed by sequencing. The sequenced-verified isolated plasmids produced with this procedure were designated pTrc:pdc-his (SEQ ID NO:044).
[0303] Sub-cloning for oad gene was accomplished as follows: First, the expression vector was subjected to enzymatic restriction digestion with the restriction enzymes NcoI and NheI. Restriction enzymes were obtained from New England BioLabs (Ipswich, Mass. USA), and used according to manufacturer's instructions. The digestion mixtures were separated by agarose gel electrophoresis, and visualized under UV transillumination as described in the Common Methods Section. Agarose gel slices containing the DNA piece corresponding to the cut vector product were cut from the gel and the DNA recovered with a standard gel extraction protocol and components from Qiagen according to manufacturer's instructions. The gene inserts were amplified by polymerase chain reactions, cut with restriction enzymes, purified, and ligated to the cut vector. For the oad gene containing insert, a polymerase chain reaction was performed with the forward primer being GGAGAATTACCATGGCGGATACCCTG (SEQ ID NO:045), and the reverse primer being GGGAATCTAG ACTAATGATG ATGGTGG (SEQ ID NO:046), and the pJ251:oad plasmid was used as template. These primers provide an NcoI site and the start codon and an XbaI site after the stop codon. This PCR product was prepared for restriction digest purified using a PCR purification kit from Qiagen Corporation (Valencia, Calif., USA) using the manufacturer's instructions. The PCR product was prepared by restriction digest with NcoI and XbaI obtained from New England BioLabs (Ipswich, Mass. USA), and used according to manufacturer's instructions. After digestion, mixtures were separated by agarose gel electrophoresis, and visualized under UV transillumination as described in the Common Methods Section. An agarose gel slice containing the DNA piece corresponding to the amplified pdc gene fusion was cut from the gel and the DNA recovered with a standard gel extraction protocol and components from Qiagen according to manufacturer's instructions. The recovered product and the prepared vector backbone were ligated together with T4 DNA ligase obtained from New England BioLabs (Ipswich, Mass. USA) according to manufacturer's instructions. Ligations were transformed into chemically competent 100 cells from Lucigen according to the manufacturer's instructions. Colonies possibly containing the new plasmid were cultured, and their DNA was isolated using a standard miniprep protocol and components from Qiagen (Valencia, Calif. USA) according to the manufacturer's instruction. Isolated plasmids were checked by restriction digests and confirmed by sequencing. The sequenced-verified isolated plasmids produced with this procedure were designated pTrc:oad-his (SEQ ID NO:047).
[0304] The kgd gene was sub-cloned into a pKK223:cterm-His expression vector. Sub-cloning for kdg gene was accomplished as follows: First, the pKK223:c-term-His expression vector (SEQ ID NO:050) was subjected to enzymatic restriction digestion with the restriction enzymes NcoI and HindIII. Restriction enzymes were obtained from New England BioLabs (Ipswich, Mass. USA), and used according to manufacturer's instructions. The digestion mixtures were separated by agarose gel electrophoresis, and visualized under UV transillumination as described in the Common Methods Section. Agarose gel slices containing the DNA piece corresponding to the cut vector product were cut from the gel and the DNA recovered with a standard gel extraction protocol and components from Qiagen according to manufacturer's instructions. The gene inserts were amplified by polymerase chain reactions, cut with restriction enzymes, purified, and ligated to the cut vector. For the kgd gene containing insert, a polymerase chain reaction was performed with the forward primer being AGGTTCCCA TGGTGACTCA GGACCCG (SEQ ID NO:048), and the reverse primer being GTAAGCTTAG TGGTGATGGT GATGACCGAA CGCTTCGTCC (SEQ ID NO:049), and the pJ206:kgd plasmid was used as template. These primers provide an NcoI site and the start codon, and penta-histidine coding sequence for affinity purification of protein followed by a stop codon as well as a HindIll site. This PCR product was prepared for restriction digest purified using a PCR purification kit from Qiagen Corporation (Valencia, Calif., USA) using the manufacturer's instructions. The PCR product was prepared by restriction digest with NcoI and HindIIII obtained from New England BioLabs (Ipswich, Mass. USA), and used according to manufacturer's instructions. After digestion, mixtures were separated by agarose gel electrophoresis, and visualized under UV transillumination as described in the Common Methods Section. An agarose gel slice containing the DNA piece corresponding to the amplified pdc gene fusion was cut from the gel and the DNA recovered with a standard gel extraction protocol and components from Qiagen according to manufacturer's instructions. The recovered product and the prepared vector backbone were ligated together with T4 DNA ligase obtained from New England BioLabs (Ipswich, Mass. USA) according to manufacturer's instructions. Ligations were transformed into chemically competent 100 cells from Lucigen according to the manufacturer's instructions. Colonies possibly containing the new plasmid were cultured, and their DNA was isolated using a standard miniprep protocol and components from Qiagen (Valencia, Calif. USA) according to the manufacturer's instruction. Isolated plasmids were checked by restriction digests and confirmed by sequencing. The sequenced-verified isolated plasmids produced with this procedure were designated pKK223:kgd-his (SEQ ID NO:053).
Example 5
Preparation and Evaluation of Over-Expressed Oxaloacetate Alpha-Decarboxylase Candidate Enzymes and Selected Mutant Enzymes
[0305] In order to evaluate candidate enzymes for native oxaloacetate alpha-decarboxylase function, each of the protein was purified. Plasmids containing each of the genes were individually transformed into electro-competent BW25113 cells. For each enzyme that was tested, starter cultures from single colonies were grown overnight at 37° C. with 250 rpm agitation in 20 mL of LB media with 200 ug/mL ampicillin selection. The next morning, these cultures were used to inoculate a 1 L of TB media with 200 ug/mL ampicillin selection. Cultures were grown at 37° C. with 250 rpm agitation. Protein induction was initiated when the optical density of the culture measured at 600 nm reached between 0.5 and 0.7--(about 4 to 6 hr) with addition of IPTG to a final concentration of 1 mM. The cultures allowed to grow overnight at 30° C. The cells were harvested by centrifugation (6000×g for 15 minutes) and kept frozen at -70° C. until purification.
[0306] Purification was performed as follows: Cells were thawed on ice and then resuspended in 40 mL of Buffer A (25 mM Trizma base pH 8.0, 500 mM NaCl, and 1 mM Imidazole) supplemented with lysozyme (Sigma-Aldrich, St. Louis, Mo.), Dnase I (Sigma-Aldrich, St. Louis, Mo.), and Complete EDTA-free protease inhibitor cocktail (EMD, Gibbstown, N.J.). Resuspend cells were incubated on ice for at least 5 minutes or until the culture showed signs of lysis. To completely lyse the cells, the suspension was sonicated (85% power on a Branson probe tip sonicator) with five 10 second burst intervened by resting on ice for at least a minute. The cell lysate was diluted to 50 mL with Buffer A and clarified by centrifugation at 10000×g for 15 minutes.
[0307] Proteins were purified from the lysate as follows: The clarified lysate was subjected to affinity purification with a nickel-charged fast flow column (Pharmacia). Clarified cell lysates was loaded on to the column at 1 mL per minute. The column was then washed with at least 50 column volumes of Buffer A until the flow through had an absorption at 260 nm equal to that of Buffer A. The protein was eluted with Elution Buffer (25 mM Trizma base pH 8.0, 100 mM NaCl, and 300 mM Imidazole) at 1 mL per minute and collected in 1.0 mL fractions. Fractions containing protein were identified using the BioRad Total Protein Assay method. Fractions showing protein were combined and concentrated by centrifugation using a Amicon ultra concentration unit (10 kDa cutoff) (Millipore, Milford Mass.). Concentrated proteins were further purified to remove imidazole using a pre-equilibrated (25 mM Trizma base pH 8.0, 100 mM NaCl) PD Miditrap G25 column (GE Lifesciences, Piscataway, N.J.) according the manufacturer's instructions. Proteins were concentrated a final time, glycerol was added to about 10% volume, protein aliquots were snap frozen on dry ice, and the proteins were stored at -70° C. until use. Final protein concentrations were measured using BioRad Total Protein Assay (BioRad, Hercules, Calif.)
[0308] The purification of each candidate protein was confirmed by SDS-PAGE and by SDS-PAGE followed by visualization via western blot analysis. Protein samples were prepared for SDS-PAGE by dilution in 100 uL of SDS sample buffer (Tris-Cl pH6.8, SDS, glycerol, β-mercaptoethanol, Bromophenol blue), boiled for 5 minutes and spun at 17,000G for 5 minutes. A portion of the sample was loaded on a 10% pre-cast SDS-PAGE gel (BioRad Ready Gel Tris-HCl Gel-161-1101). Electrophoresis was carried out using a BioRad Mini-Protean II system according to manufacturer's instructions. SDS gels were either stained with Coomassie Stain or transferred to nitrocellulose membrane using the same BioRad Mini-Protean II wet transfer system according to manufacturer's specifications for western blot visualization. Membranes were blocked for 1 hour at room temperature using PBST (NaCl, KCl, Na2HPO4, KH2PO4, Tween 20)+5% w/v nonfat dry milk Blots were then probed with a rabbit polyclonal anti-6× HIS-HRP antibody (AbCam Ab1187, 1:5000 dilution) in PBST+5% w/v nonfat dry milk for 1 hour at room temperature, washed 4 times in PBST for 5 minutes, and followed by developing with TMB substrate (Promega TMB Stabilized Substrate for HRP, cat #W4121). Protein expression was assessed by the presence or absence of bands at the expected molecular weight for each proteins of interest. The presence of each enzyme was confirmed in this manner (FIG. 8).
Example 6
Evaluation of Native Oxaloacetate Alpha-Decarboxylase Function Via a Spectrophotometric Assay to Determine the Specific Activities of the Native Candidate Enzymes
[0309] A coupled assay was developed in order to biochemically assess the oxaloacetate alpha-decarboxylase activity of the candidate enzymes. The coupled assay used exogenously-added purified E. coli YdfG protein, which converts malonate semialdehyde to 3-HP with the concomitant conversion of NADPH to NADP+ (FIG. 5). As the reduced form, NADPH, has a strong absorbance peak at 340 nm and the oxidized form does not, it is possible to monitor the progress of these coupled reactions via a spectrophotometric assay. 3-HP is also detectable by standard analytical methods, for example HPLC and GS-MS.
[0310] Assays were performed as 200 microliter reactions in a 96-well plate format using a Molecular Dynamics SpectraMax 384 microplate reader with SoftmaxPro software (Molecular Dynamics, Sunnyvale, Calif.) to quantitate the rate of change in the 340 nm absorbance. All assays were conducted at 37° C., and the instrument was allowed to mix the plate for 1 second prior to each measurements. The progress of each reaction was monitored for 20 minutes during which measurements were made every 20 seconds. Reaction conditions consisted of 50 mM PIPES pH 6.8 (unless otherwise stated), 2 mM thymidine pyrophosphate, 10 mM dithiothreitol, 4 mM magnesium chloride, 1 mM NADPH (EMD Bioscience), 0.075 mg/mL purified YdfG (Molecular Throughput), and 5 mM oxaloacetate (unless otherwise stated). All chemicals were obtained from Sigma-Aldrich (St. Louis, Mo.) unless otherwise stated. Reactions were initiated with the addition of 10 microliters of concentrated purified protein to each reaction well. In addition, a negative control was performed with each experimental set to control for the rate of background NADPH oxidation. Once the reaction time course was read and the slopes of each well were calculated, the specific activities over the negative control of each candidate protein were calculated. All values reported are the average specific activities measured in triplicate. The following table shows the native specific activities of PDC and OAD determined by this method at varying oxaloacetate concentrations.
TABLE-US-00003 Specific Activities as Measured by the Spectrophotometric Assay at pH 6.8 average PDC Average OAD [OAA] Specific activity standard Specific activity standard (mM) (units/mg) deviation (units/mg) deviation 10 0.0076 0.0024 0.0144 0.0012. 5 0.003 0.0001 0.006 0.0003 2.5 m 0.0025 0.0003 0.003 0.0007 1 0.0015 0.0002 0.0017 0.0009 0.5 0.002 0.0002 0 0.0004 0.1 0.0004 0.0003 0.0014 0.001
[0311] Activity of KGD was not measured by the spectrophotometric assay. These reactions were subjected to GC-MS analysis to confirm that 3-HP was produced, and that the candidate oxaloacetate alpha-decarboxylase enzymes were able to convert oxaloacetate to malonate semialdehyde. These results are shown in the following table. 3-HP was produced by PDC, OAD, and KGD in the presence of 5 mM oxaloacetate. These results show that these enzymes possess low levels of native oxaloacetate alpha-decarboxylase function.
TABLE-US-00004 Production of 3HP from spectrophotometric assay samples Enzyme Beginning [OAA] Final [3HP] (g/L) Pdc (pH 6.0) 5 mM 0.041 +/- 0.003 Oad2 (pH 6.8) 5 mM 0.004 +/- 0.001 Kgd (pH 6.8) 5 mM 0.004 +/- 0.001
Example 7
Confirmation of Oxaloacetate Alpha-Decarboxylase Enzyme Activity--Using GC-MS--
[0312] Conformation of 3-HP production as well as assessment of the pH dependence of each candidate enzyme was performed as described above, except that coupled reactions with YdfG were evaluated for 3-HP production by GC-MS. Reaction conditions consisted of 50 mM PIPES pH 6.8, 2 mM thymidine pyrophosphate, 10 mM dithiothreitol, 4 mM magnesium chloride, 1 mM NADPH (EMD Bioscience), 0.075 mg/mL purified YdfG (Molecular Throughput), and 5 mM oxaloacetate (unless otherwise stated). All chemicals were obtained from Sigma-Aldrich unless otherwise stated. Reactions were performed at 0.6 mL volumes in triplicate and were initiated with the addition of 600 micrograms of purified protein. In addition, a negative control lacking protein was also performed with each set. Reactions were allowed to proceed at 37° C. for at least 12 hours.
[0313] The following method is used for GC-MS analysis of 3-HP. Soluble monomeric 3-HP is quantified using GC-MS after a single exaction with ethyl acetate from a reaction sample. Once the 3-HP has been extracted into the ethyl acetate, the active hydrogens on the 3-HP are replaced with trimethylsilyl groups using N,O-Bis-(Trimethylsilyl)trifluoroacetamide to make the compound volatile for GC analysis. A standard curve of known 3-HP concentrations is prepared at the beginning of the run and a known quantity of ketohexanoic acid (1 g/L) is added to both the standards and the samples to act as an internal standard for Quantitation, with tropic acid as an additional internal standard. The 3-HP content of individual samples is then assayed by examining the ratio of the ketohexanoic acid ion (m/z=247) to the 3-HP ion (219) and compared to the standard curve. 3-HP is quantified using a 3HP standard curve at the beginning of the run and the data are analyzed using HP Chemstation. The GC-MS system consists of a Hewlett Packard model 5890 GC and Hewlett Packard model 5972 MS. The column is Supelco SPB-1 (60 m×0.32 mm×0.25 um film thickness). The capillary coating is a non-polar methylsilicone. The carrier gas is helium at a flow rate of 1 mL/min. The 3-HP as derivatized is separated from other components in the ethyl acetate extract using either of two similar temperature regimes. In a first temperature gradient regime, the column temperature starts with 40° C. for 1 minute, then is raised at a rate of 10° C./minute to 235° C., and then is raised at a rate of 50° C./minute to 300° C. In a second temperature regime, which was demonstrated to process samples more quickly, the column temperature starts with 70° C. which is held for 1 min, followed by a ramp-up of 10° C./minute to 235° C. which is followed by a ramp-up of 50° C./minute to 300° C. FIG. 9 shows the detected amount of each candidate enzyme at pH 5.5, pH 6.0, and pH 6.8. PDC showed optimal activity at pH 5.5 while OAD and KGD showed optimal activity at pH 6.8.
Example 8
Variants with Increased Oxaloacetate Alpha-Decarboxylase Activity
[0314] Three variants derived from the oad gene were identified that had an increase in oxaloacetate alpha-decarboxylase activity when evaluated in the spectrophotometric assay. Results are provided in the following table. The variants had 1.6 fold, 2.2 fold, and 2.8 fold increases in oxaloacetate alpha-decarboxylase specific activity, respectively. The variants carried the following changes relative to the parent oad gene: N45T, R249L, D302G, V418A, and L476Q; T479N; and: R394C, D434G, and T511A, respectively, where the first amino acid is the one found in the parent sequence at the specified site, and the second amino acid is the one found at that site in the variant (using single-letter codes for the amino acid). It is known in the art that various combinations of any and all of these mutations may be used to produce enzymes with increased oxaloacetate alpha-decarboxylase activity over the parental OAD enzyme. As is also known in the art, substitutions of other amino acids at these sites could also increase oxaloacetate alpha-decarboxylase specific activity relative to that of the parental enzyme.
TABLE-US-00005 Average Specific Activity Standard Fold Enzymes Mutations (units/mg) Deviation Increase Native Oad None 0.0078 0.0008 -- Mutant Oad 1 N45T, R249L, 0.0121 0.0001 1.6 D302G, V418A, and L476Q Mutant Oad2 T479N 0.0173 0.0009 2.2 Mutant Oad 3 R394C, D434G, and 0.0222 0.0012 2.8 T511A
[0315] The following are non-limiting general prophetic examples directed to practicing the present invention in other microorganism species.
[0316] General Prophetic Example 9
Improvement of 3-HP Tolerance and/or Bio-Production in Rhodococcus Erythropolis
[0317] A series of E. coli-Rhodococcus shuttle vectors are available for expression in R. erythropolis, including, but not limited to, pRhBR17 and pDA71 (Kostichka et al., Appl. Microbiol. Biotechnol. 62:61-68(2003)). Additionally, a series of promoters are available for heterologous gene expression in R. erythropolis (see for example Nakashima et al., Appl. Environ. Microbiol. 70:5557-5568 (2004), and Tao et al., Appl. Microbiol. Biotechnol. 2005, DOI 10.1007/s00253-005-0064). Targeted gene disruption of chromosomal genes in R. erythropolis may be created using the method described by Tao et al., supra, and Brans et al. (Appl. Environ. Microbiol. 66: 2029-2036 (2000)). These published resources are incorporated by reference for their respective indicated teachings and compositions.
[0318] The nucleic acid sequences required for providing an increase in 3-HP tolerance, as described herein, optionally with nucleic acid sequences to provide and/or improve a 3-HP biosynthesis pathway, are cloned initially in pDA71 or pRhBR71 and transformed into E. coli. The vectors are then transformed into R. erythropolis by electroporation, as described by Kostichka et al., supra. The recombinants are grown in synthetic medium containing glucose and the tolerance to and/or bio-production of 3-HP are followed using methods known in the art or described herein.
General Prophetic Example 10
Improvement of 3-HP Tolerance and/or Bio-Production in B. Licheniformis
[0319] Most of the plasmids and shuttle vectors that replicate in B. subtilis are used to transform B. licheniformis by either protoplast transformation or electroporation. The nucleic acid sequences required for improvement of 3-HP tolerance, and/or for 3-HP biosynthesis are isolated from various sources, codon optimized as appropriate, and cloned in plasmids pBE20 or pBE60 derivatives (Nagarajan et al., Gene 114:121-126 (1992)). Methods to transform B. licheniformis are known in the art (for example see Fleming et al. Appl. Environ. Microbiol., 61(11):3775-3780 (1995)). These published resources are incorporated by reference for their respective indicated teachings and compositions.
[0320] The plasmids constructed for expression in B. subtilis are transformed into B. licheniformis to produce a recombinant microorganism that then demonstrates improved 3-HP tolerance, and, optionally, 3-HP bio-production.
General Prophetic Example 11
Improvement of 3-HP Tolerance and/or Bio-Production in Paenibacillus Macerans
[0321] Plasmids are constructed as described herein for expression in B. subtilis and used to transform Paenibacillus macerans by protoplast transformation to produce a recombinant microorganism that demonstrates improved 3-HP tolerance, and, optionally, 3-HP bio-production.
General Prophetic Example 12
Expression of 3-HP Tolerance and/or Bio-Production in Alcaligenes (Ralstonia) Eutrophus (Currently Referred to as Cupriavidus Necator)
[0322] Methods for gene expression and creation of mutations in Alcaligenes eutrophus are known in the art (see for example Taghavi et al., Appl. Environ. Microbiol., 60(10):3585-3591 (1994)). This published resource is incorporated by reference for its indicated teachings and compositions. Any of the nucleic acid sequences identified to improve 3-HP tolerance, and/or for 3-HP biosynthesis are isolated from various sources, codon optimized as appropriate, and cloned in any of the broad host range vectors described herein, and electroporated to generate recombinant microorganisms that demonstrate improved 3-HP tolerance, and, optionally, 3-HP bio-production. The poly(hydroxybutyrate) pathway in Alcaligenes has been described in detail, a variety of genetic techniques to modify the Alcaligenes eutrophus genome is known, and those tools can be applied for engineering a 3-HP toleragenic or, optionally, a 3-HP-gena-toleragenic recombinant microorganism.
General Prophetic Example 13
Improvement of 3-HP Tolerance and/or Bio-Production in Pseudomonas Putida
[0323] Methods for gene expression in Pseudomonas putida are known in the art (see for example Ben-Bassat et al., U.S. Pat. No. 6,586,229, which is incorporated herein by reference for these teachings). Any of the nucleic acid sequences identified to improve 3-HP tolerance, and/or for 3-HP biosynthesis are isolated from various sources, codon optimized as appropriate, and cloned in any of the broad host range vectors described herein, and electroporated to generate recombinant microorganisms that demonstrate improved 3-HP tolerance, and, optionally, 3-HP biosynthetic production. For example, these nucleic acid sequences are inserted into pUCP18 and this ligated DNA are electroporated into electrocompetent Pseudomonas putida KT2440 cells to generate recombinant P. putida microorganisms that exhibit increased 3-HP tolerance and optionally also comprise 3-HP biosynthesis pathways comprised at least in part of introduced nucleic acid sequences.
General Prophetic Example 14
Improvement of 3-HP Tolerance and/or Bio-Production in Lactobacillus Plantarum
[0324] The Lactobacillus genus belongs to the Lactobacillales family and many plasmids and vectors used in the transformation of Bacillus subtilis and Streptococcus are used for Lactobacillus. Non-limiting examples of suitable vectors include pAMβ1 and derivatives thereof (Renault et al., Gene 183:175-182 (1996); and O'Sullivan et al., Gene 137:227-231 (1993)); pMBB1 and pHW800, a derivative of pMBB1 (Wyckoff et al. Appl. Environ. Microbiol 62:1481-1486 (1996)); pMG1, a conjugative plasmid (Tanimoto et al., J. Bacteriol. 184:5800-5804 (2002)); pNZ9520 (Kleerebezem et al., Appl. Environ. Microbiol. 63:4581-4584 (1997)); pAM401 (Fujimoto et al., Appl. Environ. Microbiol. 67:1262-1267 (2001)); and pAT392 (Arthur et al., Antimicrob. Agents Chemother. 38:1899-1903 (1994)). Several plasmids from Lactobacillus plantarum have also been reported (e.g., van Kranenburg R, Golic N, Bongers R, Leer R J, de Vos W M, Siezen R J, Kleerebezem M. Appl. Environ. Microbiol. 2005 March; 71(3): 1223-1230).
General Prophetic Example 15
Improvement of 3-HP Tolerance and/or Bio-Production in Enterococcus faecium, Enterococcus gallinarium, and Enterococcus faecalis
[0325] The Enterococcus genus belongs to the Lactobacillales family and many plasmids and vectors used in the transformation of Lactobacillus, Bacillus subtilis, and Streptococcus are used for Enterococcus. Non-limiting examples of suitable vectors include pAMβ1 and derivatives thereof (Renault et al., Gene 183:175-182 (1996); and O'Sullivan et al., Gene 137:227-231 (1993)); pMBB1 and pHW800, a derivative of pMBB1 (Wyckoff et al. Appl. Environ. Microbiol. 62:1481-1486 (1996)); pMG1, a conjugative plasmid (Tanimoto et al., J. Bacteriol. 184:5800-5804 (2002)); pNZ9520 (Kleerebezem et al., Appl. Environ. Microbiol. 63:4581-4584 (1997)); pAM401 (Fujimoto et al., Appl. Environ. Microbiol. 67:1262-1267 (2001)); and pAT392 (Arthur et al., Antimicrob. Agents Chemother. 38:1899-1903 (1994)). Expression vectors for E. faecalis using the nisA gene from Lactococcus may also be used (Eichenbaum et al., Appl. Environ. Microbiol. 64:2763-2769 (1998). Additionally, vectors for gene replacement in the E. faecium chromosome are used (Nallaapareddy et al., Appl. Environ. Microbiol. 72:334-345 (2006)).
[0326] For each of the General Prophetic Examples 65-71, the following 3-HP bio-production comparison may be incorporated thereto: Using analytical methods for 3-HP such as are described in Subsection III of Common Methods Section, 3-HP is obtained in a measurable quantity at the conclusion of a respective bio-production event conducted with the respective recombinant microorganism (see types of bio-production events, incorporated by reference into each respective General Prophetic Example). That measurable quantity is substantially greater than a quantity of 3-HP produced in a control bio-production event using a suitable respective control microorganism lacking the functional 3-HP pathway so provided in the respective General Prophetic Example. Tolerance improvements also may be assessed by any recognized comparative measurement technique, such as by using a MIC protocol provided in the Common Methods Section.
[0327] Common Methods Section
[0328] All methods in this Section are provided for incorporation into the Examples where so referenced.
[0329] Subsection I. Microorganism Species and Strains, Cultures, and Growth Media
[0330] Bacterial species, that may be utilized as needed, are as follows:
[0331] Acinetobacter calcoaceticus (DSMZ #1139) is obtained from the German Collection of Microorganisms and Cell Cultures (Braunschweig, Germany) as a vacuum dried culture. Cultures are then resuspended in Brain Heart Infusion (BHI) Broth (RPI Corp, Mt. Prospect, Ill., USA). Serial dilutions of the resuspended A. calcoaceticus culture are made into BHI and are allowed to grow for aerobically for 48 hours at 37° C. at 250 rpm until saturated.
[0332] Bacillus subtilis is a gift from the Gill lab (University of Colorado at Boulder) and is obtained as an actively growing culture. Serial dilutions of the actively growing B. subtilis culture are made into Luria Broth (RPI Corp, Mt. Prospect, Ill., USA) and are allowed to grow for aerobically for 24 hours at 37° C. at 250 rpm until saturated.
[0333] Chlorobium limicola (DSMZ #245) is obtained from the German Collection of Microorganisms and Cell Cultures (Braunschweig, Germany) as a vacuum dried culture. Cultures are then resuspended using Pfennig's Medium I and II (#28 and 29) as described per DSMZ instructions. C. limicola is grown at 25° C. under constant vortexing.
[0334] Citrobacter braakii (DSMZ #30040) is obtained from the German Collection of Microorganisms and Cell Cultures (Braunschweig, Germany) as a vacuum dried culture. Cultures are then resuspended in Brain Heart Infusion (BHI) Broth (RPI Corp, Mt. Prospect, Ill., USA). Serial dilutions of the resuspended C. braakii culture are made into BHI and are allowed to grow for aerobically for 48 hours at 30° C. at 250 rpm until saturated.
[0335] Clostridium acetobutylicum (DSMZ #792) is obtained from the German Collection of Microorganisms and Cell Cultures (Braunschweig, Germany) as a vacuum dried culture. Cultures are then resuspended in Clostridium acetobutylicum medium (#411) as described per DSMZ instructions. C. acetobutylicum is grown anaerobically at 37° C. at 250 rpm until saturated.
[0336] Clostridium aminobutyricum (DSMZ #2634) is obtained from the German Collection of Microorganisms and Cell Cultures (Braunschweig, Germany) as a vacuum dried culture. Cultures are then resuspended in Clostridium aminobutyricum medium (#286) as described per DSMZ instructions. C. aminobutyricum is grown anaerobically at 37° C. at 250 rpm until saturated.
[0337] Clostridium kluyveri (DSMZ #555) is obtained from the German Collection of Microorganisms and Cell Cultures (Braunschweig, Germany) as an actively growing culture. Serial dilutions of C. kluyveri culture are made into Clostridium kluyveri medium (#286) as described per DSMZ instructions. C. kluyveri is grown anaerobically at 37° C. at 250 rpm until saturated.
[0338] Cupriavidus metallidurans (DMSZ #2839) is obtained from the German Collection of Microorganisms and Cell Cultures (Braunschweig, Germany) as a vacuum dried culture. Cultures are then resuspended in Brain Heart Infusion (BHI) Broth (RPI Corp, Mt. Prospect, Ill., USA). Serial dilutions of the resuspended C. metallidurans culture are made into BHI and are allowed to grow for aerobically for 48 hours at 30° C. at 250 rpm until saturated.
[0339] Cupriavidus necator (DSMZ #428) is obtained from the German Collection of Microorganisms and Cell Cultures (Braunschweig, Germany) as a vacuum dried culture. Cultures are then resuspended in Brain Heart Infusion (BHI) Broth (RPI Corp, Mt. Prospect, Ill., USA). Serial dilutions of the resuspended C. necator culture are made into BHI and are allowed to grow for aerobically for 48 hours at 30° C. at 250 rpm until saturated. As noted elsewhere, previous names for this species are Alcaligenes eutrophus and Ralstonia eutrophus.
[0340] Desulfovibrio fructosovorans (DSMZ #3604) is obtained from the German Collection of Microorganisms and Cell Cultures (Braunschweig, Germany) as a vacuum dried culture. Cultures are then resuspended in Desulfovibrio fructosovorans medium (#63) as described per DSMZ instructions. D. fructosovorans is grown anaerobically at 37° C. at 250 rpm until saturated.
[0341] Escherichia coli Crooks (DSMZ #1576) is obtained from the German Collection of Microorganisms and Cell Cultures (Braunschweig, Germany) as a vacuum dried culture. Cultures are then resuspended in Brain Heart Infusion (BHI) Broth (RPI Corp, Mt. Prospect, Ill., USA). Serial dilutions of the resuspended E. coli Crooks culture are made into BHI and are allowed to grow for aerobically for 48 hours at 37° C. at 250 rpm until saturated.
[0342] Escherichia coli K12 is a gift from the Gill lab (University of Colorado at Boulder) and is obtained as an actively growing culture. Serial dilutions of the actively growing E. coli K12 culture are made into Luria Broth (RPI Corp, Mt. Prospect, Ill., USA) and are allowed to grow for aerobically for 24 hours at 37° C. at 250 rpm until saturated.
[0343] Halobacterium salinarum (DSMZ #1576) is obtained from the German Collection of Microorganisms and Cell Cultures (Braunschweig, Germany) as a vacuum dried culture. Cultures are then resuspended in Halobacterium medium (#97) as described per DSMZ instructions. H. salinarum is grown aerobically at 37° C. at 250 rpm until saturated.
[0344] Lactobacillus delbrueckii (#4335) is obtained from WYEAST USA (Odell, Oreg., USA) as an actively growing culture. Serial dilutions of the actively growing L. delbrueckii culture are made into Brain Heart Infusion (BHI) broth (RPI Corp, Mt. Prospect, Ill., USA) and are allowed to grow for aerobically for 24 hours at 30° C. at 250 rpm until saturated.
[0345] Metallosphaera sedula (DSMZ #5348) is obtained from the German Collection of Microorganisms and Cell Cultures (Braunschweig, Germany) as an actively growing culture. Serial dilutions of M. sedula culture are made into Metallosphaera medium (#485) as described per DSMZ instructions. M. sedula is grown aerobically at 65° C. at 250 rpm until saturated.
[0346] Propionibacterium freudenreichii subsp. shermanii (DSMZ #4902) is obtained from the German Collection of Microorganisms and Cell Cultures (Braunschweig, Germany) as a vacuum dried culture. Cultures are then resuspended in PYG-medium (#104) as described per DSMZ instructions. P. freudenreichii subsp. shermanii is grown anaerobically at 30° C. at 250 rpm until saturated.
[0347] Pseudomonas putida is a gift from the Gill lab (University of Colorado at Boulder) and is obtained as an actively growing culture. Serial dilutions of the actively growing P. putida culture are made into Luria Broth (RPI Corp, Mt. Prospect, Ill., USA) and are allowed to grow for aerobically for 24 hours at 37° C. at 250 rpm until saturated.
[0348] Streptococcus mutans (DSMZ #6178) is obtained from the German Collection of Microorganisms and Cell Cultures (Braunschweig, Germany) as a vacuum dried culture. Cultures are then resuspended in Luria Broth (RPI Corp, Mt. Prospect, Ill., USA). S. mutans is grown aerobically at 37° C. at 250 rpm until saturated.
[0349] Subsection II: Gel Preparation, DNA Separation, Extraction, Ligation, and Transformation Methods:
[0350] Molecular biology grade agarose (RPI Corp, Mt. Prospect, Ill., USA) is added to 1× TAE to make a 1% Agarose in TAE. To obtain 50× TAE add the following to 900 ml distilled H2O:242 g Tris base (RPI Corp, Mt. Prospect, Ill., USA), 57.1 ml Glacial Acetic Acid (Sigma-Aldrich, St. Louis, Mo., USA), 18.6 g EDTA (Fisher Scientific, Pittsburgh, Pa. USA), and adjust volume to 1 L with additional distilled water. To obtain 1× TAE, add 20 mL of 50× TAE to 980 mL of distilled water. The agarose-TAE solution is then heated until boiling occurred and the agarose is fully dissolved. The solution is allowed to cool to 50° C. before 10 mg/mL ethidium bromide (Acros Organics, Morris Plains, N.J., USA) is added at a concentration of 5 ul per 100 mL of 1% agarose solution. Once the ethidium bromide is added, the solution is briefly mixed and poured into a gel casting tray with the appropriate number of combs (Idea Scientific Co., Minneapolis, Minn., USA) per sample analysis. DNA samples are then mixed accordingly with 5× TAE loading buffer. 5× TAE loading buffer consists of 5× TAE (diluted from 50× TAE as described herein), 20% glycerol (Acros Organics, Morris Plains, N.J., USA), 0.125% Bromophenol Blue (Alfa Aesar, Ward Hill, Mass., USA), and adjust volume to 50 mL with distilled water. Loaded gels are then run in gel rigs (Idea Scientific Co., Minneapolis, Minn., USA) filled with 1× TAE at a constant voltage of 125 volts for 25-30 minutes. At this point, the gels are removed from the gel boxes with voltage and visualized under a UV transilluminator (FOTODYNE Inc., Hartland, Wis., USA).
[0351] The DNA isolated through gel extraction is then extracted using the QIAquick Gel Extraction Kit following manufacturer's instructions (Qiagen (Valencia, Calif. USA)). Similar methods are known to those skilled in the art.
[0352] The thus-extracted DNA then may be ligated into pSMART (Lucigen Corp, Middleton, Wis., USA), StrataClone (Stratagene, La Jolla, Calif., USA) or pCR2.1-TOPO TA (Invitrogen Corp, Carlsbad, Calif., USA) according to manufacturer's instructions. These methods are described in the next subsection of Common Methods.
[0353] Ligation Methods:
[0354] For ligations into pSMART Vectors:
[0355] Gel extracted DNA is blunted using PCRTerminator (Lucigen Corp, Middleton, Wis., USA) according to manufacturer's instructions. Then 500 ng of DNA is added to 2.5 uL 4× CloneSmart vector premix, 1 ul CloneSmart DNA ligase (Lucigen Corp, Middleton, Wis., USA) and distilled water is added for a total volume of 10 ul. The reaction is then allowed to sit at room temperature for 30 minutes and then heat inactivated at 70° C. for 15 minutes and then placed on ice. E. cloni 10G Chemically Competent cells (Lucigen Corp, Middleton, Wis., USA) are thawed for 20 minutes on ice. 40 ul of chemically competent cells are placed into a microcentrifuge tube and 1 ul of heat inactivated CloneSmart Ligation is added to the tube. The whole reaction is stirred briefly with a pipette tip. The ligation and cells are incubated on ice for 30 minutes and then the cells are heat shocked for 45 seconds at 42° C. and then put back onto ice for 2 minutes. 960 ul of room temperature Recovery media (Lucigen Corp, Middleton, Wis., USA) and places into microcentrifuge tubes. Shake tubes at 250 rpm for 1 hour at 37° C. Plate 100 ul of transformed cells on Luria Broth plates (RPI Corp, Mt. Prospect, Ill., USA) plus appropriate antibiotics depending on the pSMART vector used. Incubate plates overnight at 37° C.
[0356] General Transformation and Related Culture Methodologies:
[0357] Chemically competent transformation protocols are carried out according to the manufacturer's instructions or according to the literature contained in Molecular Cloning (Sambrook and Russell, 2001). Generally, plasmid DNA or ligation products are chilled on ice for 5 to 30 min. in solution with chemically competent cells. Chemically competent cells are a widely used product in the field of biotechnology and are available from multiple vendors, such as those indicated in this Subsection. Following the chilling period cells generally are heat-shocked for 30 seconds at 42° C. without shaking, re-chilled and combined with 250 microliters of rich media, such as S.O.C. Cells are then incubated at 37° C. while shaking at 250 rpm for 1 hour. Finally, the cells are screened for successful transformations by plating on media containing the appropriate antibiotics.
[0358] Alternatively, selected cells may be transformed by electroporation methods such as are known to those skilled in the art.
[0359] The choice of an E. coli host strain for plasmid transformation is determined by considering factors such as plasmid stability, plasmid compatibility, plasmid screening methods and protein expression. Strain backgrounds can be changed by simply purifying plasmid DNA as described herein and transforming the plasmid into a desired or otherwise appropriate E. coli host strain such as determined by experimental necessities, such as any commonly used cloning strain (e.g., DH5α, Top10F', E. cloni 10G, etc.).
[0360] Plasmid DNA was prepared using the commercial miniprep kit from Qiagen (Valencia, Calif. USA) according to manufacturer's instructions.
[0361] Subsection IIIa. 3-HP Preparation
[0362] A 3-HP stock solution was prepared as follows. A vial of β-propriolactone (Sigma-Aldrich, St. Louis, Mo., USA) was opened under a fume hood and the entire bottle contents was transferred to a new container sequentially using a 25-mL glass pipette. The vial was rinsed with 50 mL of HPLC grade water and this rinse was poured into the new container. Two additional rinses were performed and added to the new container. Additional HPLC grade water was added to the new container to reach a ratio of 50 mL water per 5 mL β-propriolactone. The new container was capped tightly and allowed to remain in the fume hood at room temperature for 72 hours. After 72 hours the contents were transferred to centrifuge tubes and centrifuged for 10 minutes at 4,000 rpm. Then the solution was filtered to remove particulates and, as needed, concentrated by use of a rotary evaporator at room temperature. Assay for concentration was conducted, and dilution to make a standard concentration stock solution was made as needed.
[0363] Subsection IIIb. HPLC, GC/MS and Other Analytical Methods for 3-HP Detection (Analysis of Cultures for 3-HP Production)
[0364] For HPLC analysis of 3-HP, the Waters chromatography system (Milford, Mass.) consisted of the following: 600S Controller, 616 Pump, 717 Plus Autosampler, 486 Tunable UV Detector, and an in-line mobile phase Degas ser. In addition, an Eppendorf external column heater is used and the data are collected using an SRI (Torrance, Calif.) analog-to-digital converter linked to a standard desk top computer. Data are analyzed using the SRI Peak Simple software. A Coregel 64H ion exclusion column (Transgenomic, Inc., San Jose, Calif.) is employed. The column resin is a sulfonated polystyrene divinyl benzene with a particle size of 10 um and column dimensions are 300×7.8 mm. The mobile phase consisted of sulfuric acid (Fisher Scientific, Pittsburgh, Pa. USA) diluted with deionized (18 MSkm) water to a concentration of 0.02 N and vacuum filtered through a 0.2 μm nylon filter. The flow rate of the mobile phase is 0.6 mL/min. The UV detector is operated at a wavelength of 210 nm and the column is heated to 60° C. The same equipment and method as described herein is used for 3-HP analyses for relevant prophetic examples.
[0365] The following method is used for GC-MS analysis of 3-HP. Soluble monomeric 3-HP is quantified using GC-MS after a single extraction of the fermentation media with ethyl acetate. Once the 3-HP has been extracted into the ethyl acetate, the active hydrogens on the 3-HP are replaced with trimethylsilyl groups using N,O-Bis-(Trimethylsilyl)trifluoroacetamide to make the compound volatile for GC analysis. A standard curve of known 3-HP concentrations is prepared at the beginning of the run and a known quantity of ketohexanoic acid (1 g/L) is added to both the standards and the samples to act as an internal standard for Quantitation, with tropic acid as an additional internal standard. The 3-HP content of individual samples is then assayed by examining the ratio of the ketohexanoic acid ion (m/z=247) to the 3-HP ion (219) and compared to the standard curve. 3-HP is quantified using a 3HP standard curve at the beginning of the run and the data are analyzed using HP Chemstation. The GC-MS system consists of a Hewlett Packard model 5890 GC and Hewlett Packard model 5972 MS. The column is Supelco SPB-1 (60 m×0.32 mm×0.25 μm film thickness). The capillary coating is a non-polar methylsilicone. The carrier gas is helium at a flow rate of 1 mL/min. The 3-HP as derivatized is separated from other components in the ethyl acetate extract using either of two similar temperature regimes. In a first temperature gradient regime, the column temperature starts with 40° C. for 1 minute, then is raised at a rate of 10° C./minute to 235° C., and then is raised at a rate of 50° C./minute to 300° C. In a second temperature regime, which was demonstrated to process samples more quickly, the column temperature starts with 70° C. which is held for 1 min, followed by a ramp-up of 10° C./minute to 235° C. which is followed by a ramp-up of 50° C./minute to 300° C.
[0366] The embodiments, variations, sequences, and figures described herein should provide an indication of the utility and versatility of the present invention. Other embodiments that do not provide all of the features and advantages set forth herein may also be utilized, without departing from the spirit and scope of the present invention. Such modifications and variations are considered to be within the scope of the invention.
TABLE-US-00006 TABLE 1 E.C. Gene SEQ ID NO. SEQ ID NO. Protein Classi- Name in of E. coli of Expressed Function fication E. coli gene Enzyme Glucose N/A galP 003 004 transport Phosphoenol- 4.1.1.49 pckA 007 008 pyruvate carboxykinase Phosphoenol- 4.1.1.49 ppc 009 010 pyruvate carboxylase
TABLE-US-00007 TABLE 2 E.C. Gene SEQ ID NO. SEQ ID NO. Protein Classi- Name in of E. coli of Expressed Function fication E. coli gene Enzyme Pyruvate 1.2.4.1 aceE 011 012 dehydrogenase E1p l ipoate 2.3.1.12 aceF 013 014 acetyltransferase/ dihydrolipoamide acetyltransferase Pyruvate 1.8.1.4 l pd 015 016 dehydrogenase E3 (lipoamide dehydrogenase) Lactate 1.1.1.28 ldhA 017 018 dehydrogenase Pyruvate formate 2.3.1.-- pflB 019 020 lyase (B "inactive") Pyruvate oxidase 1.2.2.2 poxB 021 022 Phosphate 2.3.1.8 Pta 023 024 acetyltransferase Heat stable, N/A ptsH 025 026 histidyl (HPr) phosphorylatable protein (of PTS) Phosphoryl N/A ptsI 027 028 transfer protein (of PTS) Polypeptide chain N/A Crr 029 030 (of PTS) Pyruvate kinase I 2.7.1.40 pykA 031 032 Pyruvate kinase II 2.7.1.40 pykF 033 034
Sequence CWU
1
1
681452PRTBacillus cereus 1Met Met Met Met Lys Thr Lys Gln Thr Asp Glu Leu
Leu Ala Lys Asp 1 5 10
15 Glu Gln Tyr Val Trp His Gly Met Arg Pro Phe Ser Pro Asn Ser Thr
20 25 30 Met Val Gly
Ala Lys Ala Glu Gly Cys Trp Val Glu Asp Ile Gln Gly 35
40 45 Lys Arg Tyr Leu Asp Gly Met Ser
Gly Leu Trp Cys Val Asn Ser Gly 50 55
60 Tyr Gly Arg Lys Glu Leu Ala Glu Ala Ala Tyr Lys Gln
Leu Gln Thr 65 70 75
80 Leu Ser Tyr Phe Pro Met Ser Gln Ser His Glu Pro Ala Ile Lys Leu
85 90 95 Ala Glu Lys Leu
Asn Glu Trp Leu Gly Gly Glu Tyr Val Ile Phe Phe 100
105 110 Ser Asn Ser Gly Ser Glu Ala Asn Glu
Thr Ala Phe Lys Ile Ala Arg 115 120
125 Gln Tyr Tyr Ala Gln Lys Gly Glu Pro His Arg Tyr Lys Phe
Met Ser 130 135 140
Arg Tyr Arg Gly Tyr His Gly Asn Thr Met Ala Thr Met Ala Ala Thr 145
150 155 160 Gly Gln Ala Gln Arg
Arg Tyr Gln Tyr Glu Pro Phe Ala Ser Gly Phe 165
170 175 Leu His Val Thr Pro Pro Asp Cys Tyr Arg
Met Pro Glu Ile Glu Gly 180 185
190 Gln His Ile Tyr Asp Val Glu Cys Val Lys Glu Val Asp Arg Val
Met 195 200 205 Thr
Trp Glu Leu Ser Glu Thr Ile Ala Ala Phe Ile Met Glu Pro Ile 210
215 220 Ile Thr Gly Gly Gly Ile
Leu Met Pro Pro Gln Asp Tyr Met Lys Ala 225 230
235 240 Val His Glu Met Cys Gln Lys His Gly Ala Leu
Leu Ile Ser Asp Glu 245 250
255 Val Ile Cys Gly Phe Gly Arg Thr Gly Lys Ala Phe Gly Phe Met Asn
260 265 270 Tyr Asp
Val Lys Pro Asp Ile Ile Thr Met Ala Lys Gly Ile Thr Ser 275
280 285 Ala Tyr Leu Pro Leu Ser Ala
Thr Ala Val Lys Lys Glu Ile Tyr Glu 290 295
300 Ala Phe Lys Gly Lys Gly Glu Tyr Glu Phe Phe Arg
His Ile Asn Thr 305 310 315
320 Phe Gly Gly Asn Pro Ala Ala Cys Ala Leu Ala Leu Lys Asn Leu Glu
325 330 335 Ile Met Glu
Asn Glu Asn Leu Ile Glu Arg Ser Ala Gln Met Gly Ser 340
345 350 Leu Leu Leu Glu Gln Leu Lys Asp
Glu Ile Gly Glu His Pro Leu Val 355 360
365 Gly Asn Ile Arg Gly Lys Gly Leu Leu Val Gly Ile Glu
Leu Val Asn 370 375 380
Asp Lys Glu Thr Lys Glu Pro Ile Asp Asn Asp Lys Ile Ala Ser Val 385
390 395 400 Val Asn Ala Cys
Lys Glu Lys Gly Leu Ile Ile Gly Arg Asn Gly Met 405
410 415 Thr Thr Ala Gly Tyr Asn Asn Val Leu
Thr Leu Ala Pro Pro Leu Val 420 425
430 Ile Ser Ser Glu Glu Ile Ala Phe Val Val Gly Thr Leu Lys
Thr Ala 435 440 445
Met Glu Arg Ile 450 21371DNAArtificial SequenceDescription of
Artificial Sequence Codon-optimized (for E. Coli) synthetic
polynucleotide 2tgatgatgat gaaaaccaag cagacggatg agctgctggc gaaggacgaa
cagtatgtgt 60ggcatggcat gcgcccgttt agcccgaatt ccactatggt tggtgcaaaa
gcagaaggtt 120gctgggtcga ggacattcag ggcaagcgtt atctggatgg catgagcggt
ctgtggtgcg 180tgaattcggg ttatggccgt aaagagttgg cggaggcggc gtacaagcag
ctgcaaaccc 240tgagctattt cccgatgtct cagagccacg aaccggcgat caaactggcg
gagaaactga 300atgaatggtt gggtggtgaa tatgtgatct ttttcagcaa ttccggtagc
gaggcaaacg 360aaacggcctt caagattgcg cgccaatact acgcgcaaaa aggcgaaccg
catcgttaca 420agttcatgtc tcgttaccgc ggctatcacg gcaataccat ggccaccatg
gccgccaccg 480gtcaagcgca acgccgttac caatacgagc cgtttgcttc tggttttctg
catgttacgc 540ctccggattg ttaccgcatg ccggaaatcg aaggccagca catctatgac
gtcgaatgcg 600tgaaagaagt ggatcgtgtg atgacctggg aactgtccga aaccatcgca
gccttcatca 660tggagccgat tatcacgggc ggtggtattc tgatgccacc gcaggactac
atgaaggcag 720ttcacgagat gtgccagaag cacggcgctc tgctgattag cgacgaagtc
atctgcggtt 780tcggccgtac gggtaaggcg tttggtttca tgaactacga tgttaaaccg
gacatcatta 840cgatggcgaa aggtattacg agcgcatatc tgcctctgag cgcgactgcg
gttaagaaag 900agatctacga agccttcaag ggtaagggtg agtatgaatt ctttcgtcac
atcaatacct 960ttggtggtaa tccagccgcg tgtgcattgg cactgaaaaa cttggagatt
atggagaatg 1020aaaacctgat tgaacgcagc gcccaaatgg gtagcctgct gctggagcag
ctgaaggatg 1080agatcggcga gcacccgctg gttggtaaca tccgcggcaa gggcctgctg
gtcggcatcg 1140agctggtcaa cgacaaagaa accaaagaac cgatcgataa tgacaagatt
gctagcgtcg 1200tgaatgcttg taaagagaag ggtctgatta tcggtcgtaa cggcatgacc
accgcgggtt 1260acaacaacgt tctgaccttg gcaccgccgc tggtgatcag ctccgaagag
attgcgtttg 1320tggttggtac gctgaaaacc gcaatggagc gtattcacca tcatcatcac t
13713298PRTPseudomonas aeruginosa 3Met Thr Asp Ile Ala Phe Leu
Gly Leu Gly Asn Met Gly Gly Pro Met 1 5
10 15 Ala Ala Asn Leu Leu Lys Ala Gly His Arg Val
Asn Val Phe Asp Leu 20 25
30 Gln Pro Lys Ala Val Leu Gly Leu Val Glu Gln Gly Ala Gln Gly
Ala 35 40 45 Asp
Ser Ala Leu Gln Cys Cys Glu Gly Ala Glu Val Val Ile Ser Met 50
55 60 Leu Pro Ala Gly Gln His
Val Glu Ser Leu Tyr Leu Gly Asp Asp Gly 65 70
75 80 Leu Leu Ala Arg Val Ala Gly Lys Pro Leu Leu
Ile Asp Cys Ser Thr 85 90
95 Ile Ala Pro Glu Thr Ala Arg Lys Val Ala Glu Ala Ala Ala Ala Lys
100 105 110 Gly Leu
Thr Leu Leu Asp Ala Pro Val Ser Gly Gly Val Gly Gly Ala 115
120 125 Arg Ala Gly Thr Leu Ser Phe
Ile Val Gly Gly Pro Ala Glu Gly Phe 130 135
140 Ala Arg Ala Arg Pro Val Leu Glu Asn Met Gly Arg
Asn Ile Phe His 145 150 155
160 Ala Gly Asp His Gly Ala Gly Gln Val Ala Lys Ile Cys Asn Asn Met
165 170 175 Leu Leu Gly
Ile Leu Met Ala Gly Thr Ala Glu Ala Leu Ala Leu Gly 180
185 190 Val Lys Asn Gly Leu Asp Pro Ala
Val Leu Ser Glu Val Met Lys Gln 195 200
205 Ser Ser Gly Gly Asn Trp Ala Leu Asn Leu Tyr Asn Pro
Trp Pro Gly 210 215 220
Val Met Pro Gln Ala Pro Ala Ser Asn Gly Tyr Ala Gly Gly Phe Gln 225
230 235 240 Val Arg Leu Met
Asn Lys Asp Leu Gly Leu Ala Leu Ala Asn Ala Gln 245
250 255 Ala Val Gln Ala Ser Thr Pro Leu Gly
Ala Leu Ala Arg Asn Leu Phe 260 265
270 Ser Leu His Ala Gln Ala Asp Ala Glu His Glu Gly Leu Asp
Phe Ser 275 280 285
Ser Ile Gln Lys Leu Tyr Arg Gly Lys Asp 290 295
4895DNAPseudomonas aeruginosa 4atgaccgaca ttgcgtttct gggtctgggc
aatatgggcg gtccgatggc cgcgaacctg 60ctgaaagccg gccaccgtgt gaatgtgttc
gacctgcaac caaaagcggt cctgggcttg 120gttgagcaag gcgcgcaggg cgcagactct
gctctgcaat gttgtgaggg tgcggaggtc 180gtgatttcta tgctgccagc aggccagcat
gtggaaagcc tgtacctggg cgatgatggt 240ctgctggcac gcgtggcggg caagcctttg
ctgattgact gtagcaccat cgcaccggaa 300acggcgcgta aggtggcgga ggcagccgca
gcaaagggcc tgacgctgct ggatgccccg 360gtttcgggcg gtgtcggtgg tgcccgtgca
ggtacgctgt cgtttatcgt gggtggtccg 420gcggagggtt ttgcgcgtgc gcgtccggtt
ctggagaata tgggtcgcaa cattttccac 480gcgggtgatc acggcgctgg tcaggtggcg
aaaatctgta acaacatgct gctgggtatc 540ttgatggcgg gcaccgccga agccttggcg
ctgggcgtca aaaacggtct ggacccggca 600gtgctgtccg aagtgatgaa acagagcagc
ggtggtaact gggcgctgaa tctgtacaat 660ccgtggccgg gtgtgatgcc gcaggcccca
gcctctaatg gctacgcagg cggcttccaa 720gtgcgcctga tgaacaaaga cctgggcctg
gcgctggcga atgcgcaagc ggtccaagcg 780agcaccccgc tgggcgcact ggcccgtaac
ctgtttagcc tgcacgctca agccgacgcc 840gagcacgaag gtctggactt cagctctatt
caaaaactgt atcgcggtaa ggatt 89551392DNAEscherichia coli
5cctgacgcta aaaaacaggg gcggtcaaac aaggcaatga cgtttttcgt ctgcttcctt
60gccgctctgg cgggattact ctttggcctg gatatcggtg taattgctgg cgcactgccg
120tttattgcag atgaattcca gattacttcg cacacgcaag aatgggtcgt aagctccatg
180atgttcggtg cggcagtcgg tgcggtgggc agcggctggc tctcctttaa actcgggcgc
240aaaaagagcc tgatgatcgg cgcaattttg tttgttgccg gttcgctgtt ctctgcggct
300gcgccaaacg ttgaagtact gattctttcc cgcgttctac tggggctggc ggtgggtgtg
360gcctcttata ccgcaccgct gtacctctct gaaattgcgc cggaaaaaat tcgtggcagt
420atgatctcga tgtatcagtt gatgatcact atcgggatcc tcggtgctta tctttctgat
480accgccttca gctacaccgg tgcatggcgc tggatgctgg gtgtgattat catcccggca
540attttgctgc tgattggtgt cttcttcctg ccagacagcc cacgttggtt tgccgccaaa
600cgccgttttg ttgatgccga acgcgtgctg ctacgcctgc gtgacaccag cgcggaagcg
660aaacgcgaac tggatgaaat ccgtgaaagt ttgcaggtta aacagagtgg ctgggcgctg
720tttaaagaga acagcaactt ccgccgcgcg gtgttccttg gcgtactgtt gcaggtaatg
780cagcaattca ccgggatgaa cgtcatcatg tattacgcgc cgaaaatctt cgaactggcg
840ggttatacca acactaccga gcaaatgtgg gggaccgtga ttgtcggcct gaccaacgta
900cttgccacct ttatcgcaat cggccttgtt gaccgctggg gacgtaaacc aacgctaacg
960ctgggcttcc tggtgatggc tgctggcatg ggcgtactcg gtacaatgat gcatatcggt
1020attcactctc cgtcggcgca gtatttcgcc atcgccatgc tgctgatgtt tattgtcggt
1080tttgccatga gtgccggtcc gctgatttgg gtactgtgct ccgaaattca gccgctgaaa
1140ggccgcgatt ttggcatcac ctgctccact gccaccaact ggattgccaa catgatcgtt
1200ggcgcaacgt tcctgaccat gctcaacacg ctgggtaacg ccaacacctt ctgggtgtat
1260gcggctctga acgtactgtt tatcctgctg acattgtggc tggtaccgga aaccaaacac
1320gtttcgctgg aacatattga acgtaatctg atgaaaggtc gtaaactgcg cgaaataggc
1380gctcacgatt aa
13926464PRTEscherichia coli 6Met Pro Asp Ala Lys Lys Gln Gly Arg Ser Asn
Lys Ala Met Thr Phe 1 5 10
15 Phe Val Cys Phe Leu Ala Ala Leu Ala Gly Leu Leu Phe Gly Leu Asp
20 25 30 Ile Gly
Val Ile Ala Gly Ala Leu Pro Phe Ile Ala Asp Glu Phe Gln 35
40 45 Ile Thr Ser His Thr Gln Glu
Trp Val Val Ser Ser Met Met Phe Gly 50 55
60 Ala Ala Val Gly Ala Val Gly Ser Gly Trp Leu Ser
Phe Lys Leu Gly 65 70 75
80 Arg Lys Lys Ser Leu Met Ile Gly Ala Ile Leu Phe Val Ala Gly Ser
85 90 95 Leu Phe Ser
Ala Ala Ala Pro Asn Val Glu Val Leu Ile Leu Ser Arg 100
105 110 Val Leu Leu Gly Leu Ala Val Gly
Val Ala Ser Tyr Thr Ala Pro Leu 115 120
125 Tyr Leu Ser Glu Ile Ala Pro Glu Lys Ile Arg Gly Ser
Met Ile Ser 130 135 140
Met Tyr Gln Leu Met Ile Thr Ile Gly Ile Leu Gly Ala Tyr Leu Ser 145
150 155 160 Asp Thr Ala Phe
Ser Tyr Thr Gly Ala Trp Arg Trp Met Leu Gly Val 165
170 175 Ile Ile Ile Pro Ala Ile Leu Leu Leu
Ile Gly Val Phe Phe Leu Pro 180 185
190 Asp Ser Pro Arg Trp Phe Ala Ala Lys Arg Arg Phe Val Asp
Ala Glu 195 200 205
Arg Val Leu Leu Arg Leu Arg Asp Thr Ser Ala Glu Ala Lys Arg Glu 210
215 220 Leu Asp Glu Ile Arg
Glu Ser Leu Gln Val Lys Gln Ser Gly Trp Ala 225 230
235 240 Leu Phe Lys Glu Asn Ser Asn Phe Arg Arg
Ala Val Phe Leu Gly Val 245 250
255 Leu Leu Gln Val Met Gln Gln Phe Thr Gly Met Asn Val Ile Met
Tyr 260 265 270 Tyr
Ala Pro Lys Ile Phe Glu Leu Ala Gly Tyr Thr Asn Thr Thr Glu 275
280 285 Gln Met Trp Gly Thr Val
Ile Val Gly Leu Thr Asn Val Leu Ala Thr 290 295
300 Phe Ile Ala Ile Gly Leu Val Asp Arg Trp Gly
Arg Lys Pro Thr Leu 305 310 315
320 Thr Leu Gly Phe Leu Val Met Ala Ala Gly Met Gly Val Leu Gly Thr
325 330 335 Met Met
His Ile Gly Ile His Ser Pro Ser Ala Gln Tyr Phe Ala Ile 340
345 350 Ala Met Leu Leu Met Phe Ile
Val Gly Phe Ala Met Ser Ala Gly Pro 355 360
365 Leu Ile Trp Val Leu Cys Ser Glu Ile Gln Pro Leu
Lys Gly Arg Asp 370 375 380
Phe Gly Ile Thr Cys Ser Thr Ala Thr Asn Trp Ile Ala Asn Met Ile 385
390 395 400 Val Gly Ala
Thr Phe Leu Thr Met Leu Asn Thr Leu Gly Asn Ala Asn 405
410 415 Thr Phe Trp Val Tyr Ala Ala Leu
Asn Val Leu Phe Ile Leu Leu Thr 420 425
430 Leu Trp Leu Val Pro Glu Thr Lys His Val Ser Leu Glu
His Ile Glu 435 440 445
Arg Asn Leu Met Lys Gly Arg Lys Leu Arg Glu Ile Gly Ala His Asp 450
455 460
71620DNAEscherichia coli 7cgcgttaaca atggtttgac cccgcaagaa ctcgaggctt
atggtatcag tgacgtacat 60gatatcgttt acaacccaag ctacgacctg ctgtatcagg
aagagctcga tccgagcctg 120acaggttatg agcgcggggt gttaactaat ctgggtgccg
ttgccgtcga taccgggatc 180ttcaccggtc gttcaccaaa agataagtat atcgtccgtg
acgataccac tcgcgatact 240ttctggtggg cagacaaagg caaaggtaag aacgacaaca
aacctctctc tccggaaacc 300tggcagcatc tgaaaggcct ggtgaccagg cagctttccg
gcaaacgtct gttcgttgtc 360gacgctttct gtggtgcgaa cccggatact cgtctttccg
tccgtttcat caccgaagtg 420gcctggcagg cgcattttgt caaaaacatg tttattcgcc
cgagcgatga agaactggca 480ggtttcaaac cagactttat cgttatgaac ggcgcgaagt
gcactaaccc gcagtggaaa 540gaacagggtc tcaactccga aaacttcgtg gcgtttaacc
tgaccgagcg catgcagctg 600attggcggca cctggtacgg cggcgaaatg aagaaaggga
tgttctcgat gatgaactac 660ctgctgccgc tgaaaggtat cgcttctatg cactgctccg
ccaacgttgg tgagaaaggc 720gatgttgcgg tgttcttcgg cctttccggc accggtaaaa
ccaccctttc caccgacccg 780aaacgtcgcc tgattggcga tgacgaacac ggctgggacg
atgacggcgt gtttaacttc 840gaaggcggct gctacgcaaa aactatcaag ctgtcgaaag
aagcggaacc tgaaatctac 900aacgctatcc gtcgtgatgc gttgctggaa aacgtcaccg
tgcgtgaaga tggcactatc 960gactttgatg atggttcaaa aaccgagaac acccgcgttt
cttatccgat ctatcacatc 1020gataacattg ttaagccggt ttccaaagcg ggccacgcga
ctaaggttat cttcctgact 1080gctgatgctt tcggcgtgtt gccgccggtt tctcgcctga
ctgccgatca aacccagtat 1140cacttcctct ctggcttcac cgccaaactg gccggtactg
agcgtggcat caccgaaccg 1200acgccaacct tctccgcttg cttcggcgcg gcattcctgt
cgctgcaccc gactcagtac 1260gcagaagtgc tggtgaaacg tatgcaggcg gcgggcgcgc
aggcttatct ggttaacact 1320ggctggaacg gcactggcaa acgtatctcg attaaagata
cccgcgccat tatcgacgcc 1380atcctcaacg gttcgctgga taatgcagaa accttcactc
tgccgatgtt taacctggcg 1440atcccaaccg aactgccggg cgtagacacg aagattctcg
atccgcgtaa cacctacgct 1500tctccggaac agtggcagga aaaagccgaa accctggcga
aactgtttat cgacaacttc 1560gataaataca ccgacacccc tgcgggtgcc gcgctggtag
cggctggtcc gaaactgtaa 16208540PRTEscherichia coli 8Met Arg Val Asn Asn
Gly Leu Thr Pro Gln Glu Leu Glu Ala Tyr Gly 1 5
10 15 Ile Ser Asp Val His Asp Ile Val Tyr Asn
Pro Ser Tyr Asp Leu Leu 20 25
30 Tyr Gln Glu Glu Leu Asp Pro Ser Leu Thr Gly Tyr Glu Arg Gly
Val 35 40 45 Leu
Thr Asn Leu Gly Ala Val Ala Val Asp Thr Gly Ile Phe Thr Gly 50
55 60 Arg Ser Pro Lys Asp Lys
Tyr Ile Val Arg Asp Asp Thr Thr Arg Asp 65 70
75 80 Thr Phe Trp Trp Ala Asp Lys Gly Lys Gly Lys
Asn Asp Asn Lys Pro 85 90
95 Leu Ser Pro Glu Thr Trp Gln His Leu Lys Gly Leu Val Thr Arg Gln
100 105 110 Leu Ser
Gly Lys Arg Leu Phe Val Val Asp Ala Phe Cys Gly Ala Asn 115
120 125 Pro Asp Thr Arg Leu Ser Val
Arg Phe Ile Thr Glu Val Ala Trp Gln 130 135
140 Ala His Phe Val Lys Asn Met Phe Ile Arg Pro Ser
Asp Glu Glu Leu 145 150 155
160 Ala Gly Phe Lys Pro Asp Phe Ile Val Met Asn Gly Ala Lys Cys Thr
165 170 175 Asn Pro Gln
Trp Lys Glu Gln Gly Leu Asn Ser Glu Asn Phe Val Ala 180
185 190 Phe Asn Leu Thr Glu Arg Met Gln
Leu Ile Gly Gly Thr Trp Tyr Gly 195 200
205 Gly Glu Met Lys Lys Gly Met Phe Ser Met Met Asn Tyr
Leu Leu Pro 210 215 220
Leu Lys Gly Ile Ala Ser Met His Cys Ser Ala Asn Val Gly Glu Lys 225
230 235 240 Gly Asp Val Ala
Val Phe Phe Gly Leu Ser Gly Thr Gly Lys Thr Thr 245
250 255 Leu Ser Thr Asp Pro Lys Arg Arg Leu
Ile Gly Asp Asp Glu His Gly 260 265
270 Trp Asp Asp Asp Gly Val Phe Asn Phe Glu Gly Gly Cys Tyr
Ala Lys 275 280 285
Thr Ile Lys Leu Ser Lys Glu Ala Glu Pro Glu Ile Tyr Asn Ala Ile 290
295 300 Arg Arg Asp Ala Leu
Leu Glu Asn Val Thr Val Arg Glu Asp Gly Thr 305 310
315 320 Ile Asp Phe Asp Asp Gly Ser Lys Thr Glu
Asn Thr Arg Val Ser Tyr 325 330
335 Pro Ile Tyr His Ile Asp Asn Ile Val Lys Pro Val Ser Lys Ala
Gly 340 345 350 His
Ala Thr Lys Val Ile Phe Leu Thr Ala Asp Ala Phe Gly Val Leu 355
360 365 Pro Pro Val Ser Arg Leu
Thr Ala Asp Gln Thr Gln Tyr His Phe Leu 370 375
380 Ser Gly Phe Thr Ala Lys Leu Ala Gly Thr Glu
Arg Gly Ile Thr Glu 385 390 395
400 Pro Thr Pro Thr Phe Ser Ala Cys Phe Gly Ala Ala Phe Leu Ser Leu
405 410 415 His Pro
Thr Gln Tyr Ala Glu Val Leu Val Lys Arg Met Gln Ala Ala 420
425 430 Gly Ala Gln Ala Tyr Leu Val
Asn Thr Gly Trp Asn Gly Thr Gly Lys 435 440
445 Arg Ile Ser Ile Lys Asp Thr Arg Ala Ile Ile Asp
Ala Ile Leu Asn 450 455 460
Gly Ser Leu Asp Asn Ala Glu Thr Phe Thr Leu Pro Met Phe Asn Leu 465
470 475 480 Ala Ile Pro
Thr Glu Leu Pro Gly Val Asp Thr Lys Ile Leu Asp Pro 485
490 495 Arg Asn Thr Tyr Ala Ser Pro Glu
Gln Trp Gln Glu Lys Ala Glu Thr 500 505
510 Leu Ala Lys Leu Phe Ile Asp Asn Phe Asp Lys Tyr Thr
Asp Thr Pro 515 520 525
Ala Gly Ala Ala Leu Val Ala Ala Gly Pro Lys Leu 530
535 540 92647DNAEscherichia coli 9aacgaacaat attccgcatt
gcgtagtaat gtcagtatgc tcggcaaagt gctgggagaa 60accatcaagg atgcgttggg
agaacacatt cttgaacgcg tagaaactat ccgtaagttg 120tcgaaatctt cacgcgctgg
caatgatgct aaccgccagg agttgctcac caccttacaa 180aatttgtcga acgacgagct
gctgcccgtt gcgcgtgcgt ttagtcagtt cctgaacctg 240gccaacaccg ccgagcaata
ccacagcatt tcgccgaaag gcgaagctgc cagcaacccg 300gaagtgatcg cccgcaccct
gcgtaaactg aaaaaccagc cggaactgag cgaagacacc 360atcaaaaaag cagtggaatc
gctgtcgctg gaactggtcc tcacggctca cccaaccgaa 420attacccgtc gtacactgat
ccacaaaatg gtggaagtga acgcctgttt aaaacagctc 480gataacaaag atatcgctga
ctacgaacac aaccagctga tgcgtcgcct gcgccagttg 540atcgcccagt catggcatac
cgatgaaatc cgtaagctgc gtccaagccc ggtagatgaa 600gccaaatggg gctttgccgt
agtggaaaac agcctgtggc aaggcgtacc aaattacctg 660cgcgaactga acgaacaact
ggaagagaac ctcggctaca aactgcccgt cgaatttgtt 720ccggtccgtt ttacttcgtg
gatgggcggc gaccgcgacg gcaacccgaa cgtcactgcc 780gatatcaccc gccacgtcct
gctactcagc cgctggaaag ccaccgattt gttcctgaaa 840gatattcagg tgctggtttc
tgaactgtcg atggttgaag cgacccctga actgctggcg 900ctggttggcg aagaaggtgc
cgcagaaccg tatcgctatc tgatgaaaaa cctgcgttct 960cgcctgatgg cgacacaggc
atggctggaa gcgcgcctga aaggcgaaga actgccaaaa 1020ccagaaggcc tgctgacaca
aaacgaagaa ctgtgggaac cgctctacgc ttgctaccag 1080tcacttcagg cgtgtggcat
gggtattatc gccaacggcg atctgctcga caccctgcgc 1140cgcgtgaaat gtttcggcgt
accgctggtc cgtattgata tccgtcagga gagcacgcgt 1200cataccgaag cgctgggcga
gctgacccgc tacctcggta tcggcgacta cgaaagctgg 1260tcagaggccg acaaacaggc
gttcctgatc cgcgaactga actccaaacg tccgcttctg 1320ccgcgcaact ggcaaccaag
cgccgaaacg cgcgaagtgc tcgatacctg ccaggtgatt 1380gccgaagcac cgcaaggctc
cattgccgcc tacgtgatct cgatggcgaa aacgccgtcc 1440gacgtactgg ctgtccacct
gctgctgaaa gaagcgggta tcgggtttgc gatgccggtt 1500gctccgctgt ttgaaaccct
cgatgatctg aacaacgcca acgatgtcat gacccagctg 1560ctcaatattg actggtatcg
tggcctgatt cagggcaaac agatggtgat gattggctat 1620tccgactcag caaaagatgc
gggagtgatg gcagcttcct gggcgcaata tcaggcacag 1680gatgcattaa tcaaaacctg
cgaaaaagcg ggtattgagc tgacgttgtt ccacggtcgc 1740ggcggttcca ttggtcgcgg
cggcgcacct gctcatgcgg cgctgctgtc acaaccgcca 1800ggaagcctga aaggcggcct
gcgcgtaacc gaacagggcg agatgatccg ctttaaatat 1860ggtctgccag aaatcaccgt
cagcagcctg tcgctttata ccggggcgat tctggaagcc 1920aacctgctgc caccgccgga
gccgaaagag agctggcgtc gcattatgga tgaactgtca 1980gtcatctcct gcgatgtcta
ccgcggctac gtacgtgaaa acaaagattt tgtgccttac 2040ttccgctccg ctacgccgga
acaagaactg ggcaaactgc cgttgggttc acgtccggcg 2100aaacgtcgcc caaccggcgg
cgtcgagtca ctacgcgcca ttccgtggat cttcgcctgg 2160acgcaaaacc gtctgatgct
ccccgcctgg ctgggtgcag gtacggcgct gcaaaaagtg 2220gtcgaagacg gcaaacagag
cgagctggag gctatgtgcc gcgattggcc attcttctcg 2280acgcgtctcg gcatgctgga
gatggtcttc gccaaagcag acctgtggct ggcggaatac 2340tatgaccaac gcctggtaga
caaagcactg tggccgttag gtaaagagtt acgcaacctg 2400caagaagaag acatcaaagt
ggtgctggcg attgccaacg attcccatct gatggccgat 2460ctgccgtgga ttgcagagtc
tattcagcta cggaatattt acaccgaccc gctgaacgta 2520ttgcaggccg agttgctgca
ccgctcccgc caggcagaaa aagaaggcca ggaaccggat 2580cctcgcgtcg aacaagcgtt
aatggtcact attgccggga ttgcggcagg tatgcgtaat 2640accggct
264710883PRTEscherichia coli
10Met Asn Glu Gln Tyr Ser Ala Leu Arg Ser Asn Val Ser Met Leu Gly 1
5 10 15 Lys Val Leu Gly
Glu Thr Ile Lys Asp Ala Leu Gly Glu His Ile Leu 20
25 30 Glu Arg Val Glu Thr Ile Arg Lys Leu
Ser Lys Ser Ser Arg Ala Gly 35 40
45 Asn Asp Ala Asn Arg Gln Glu Leu Leu Thr Thr Leu Gln Asn
Leu Ser 50 55 60
Asn Asp Glu Leu Leu Pro Val Ala Arg Ala Phe Ser Gln Phe Leu Asn 65
70 75 80 Leu Ala Asn Thr Ala
Glu Gln Tyr His Ser Ile Ser Pro Lys Gly Glu 85
90 95 Ala Ala Ser Asn Pro Glu Val Ile Ala Arg
Thr Leu Arg Lys Leu Lys 100 105
110 Asn Gln Pro Glu Leu Ser Glu Asp Thr Ile Lys Lys Ala Val Glu
Ser 115 120 125 Leu
Ser Leu Glu Leu Val Leu Thr Ala His Pro Thr Glu Ile Thr Arg 130
135 140 Arg Thr Leu Ile His Lys
Met Val Glu Val Asn Ala Cys Leu Lys Gln 145 150
155 160 Leu Asp Asn Lys Asp Ile Ala Asp Tyr Glu His
Asn Gln Leu Met Arg 165 170
175 Arg Leu Arg Gln Leu Ile Ala Gln Ser Trp His Thr Asp Glu Ile Arg
180 185 190 Lys Leu
Arg Pro Ser Pro Val Asp Glu Ala Lys Trp Gly Phe Ala Val 195
200 205 Val Glu Asn Ser Leu Trp Gln
Gly Val Pro Asn Tyr Leu Arg Glu Leu 210 215
220 Asn Glu Gln Leu Glu Glu Asn Leu Gly Tyr Lys Leu
Pro Val Glu Phe 225 230 235
240 Val Pro Val Arg Phe Thr Ser Trp Met Gly Gly Asp Arg Asp Gly Asn
245 250 255 Pro Asn Val
Thr Ala Asp Ile Thr Arg His Val Leu Leu Leu Ser Arg 260
265 270 Trp Lys Ala Thr Asp Leu Phe Leu
Lys Asp Ile Gln Val Leu Val Ser 275 280
285 Glu Leu Ser Met Val Glu Ala Thr Pro Glu Leu Leu Ala
Leu Val Gly 290 295 300
Glu Glu Gly Ala Ala Glu Pro Tyr Arg Tyr Leu Met Lys Asn Leu Arg 305
310 315 320 Ser Arg Leu Met
Ala Thr Gln Ala Trp Leu Glu Ala Arg Leu Lys Gly 325
330 335 Glu Glu Leu Pro Lys Pro Glu Gly Leu
Leu Thr Gln Asn Glu Glu Leu 340 345
350 Trp Glu Pro Leu Tyr Ala Cys Tyr Gln Ser Leu Gln Ala Cys
Gly Met 355 360 365
Gly Ile Ile Ala Asn Gly Asp Leu Leu Asp Thr Leu Arg Arg Val Lys 370
375 380 Cys Phe Gly Val Pro
Leu Val Arg Ile Asp Ile Arg Gln Glu Ser Thr 385 390
395 400 Arg His Thr Glu Ala Leu Gly Glu Leu Thr
Arg Tyr Leu Gly Ile Gly 405 410
415 Asp Tyr Glu Ser Trp Ser Glu Ala Asp Lys Gln Ala Phe Leu Ile
Arg 420 425 430 Glu
Leu Asn Ser Lys Arg Pro Leu Leu Pro Arg Asn Trp Gln Pro Ser 435
440 445 Ala Glu Thr Arg Glu Val
Leu Asp Thr Cys Gln Val Ile Ala Glu Ala 450 455
460 Pro Gln Gly Ser Ile Ala Ala Tyr Val Ile Ser
Met Ala Lys Thr Pro 465 470 475
480 Ser Asp Val Leu Ala Val His Leu Leu Leu Lys Glu Ala Gly Ile Gly
485 490 495 Phe Ala
Met Pro Val Ala Pro Leu Phe Glu Thr Leu Asp Asp Leu Asn 500
505 510 Asn Ala Asn Asp Val Met Thr
Gln Leu Leu Asn Ile Asp Trp Tyr Arg 515 520
525 Gly Leu Ile Gln Gly Lys Gln Met Val Met Ile Gly
Tyr Ser Asp Ser 530 535 540
Ala Lys Asp Ala Gly Val Met Ala Ala Ser Trp Ala Gln Tyr Gln Ala 545
550 555 560 Gln Asp Ala
Leu Ile Lys Thr Cys Glu Lys Ala Gly Ile Glu Leu Thr 565
570 575 Leu Phe His Gly Arg Gly Gly Ser
Ile Gly Arg Gly Gly Ala Pro Ala 580 585
590 His Ala Ala Leu Leu Ser Gln Pro Pro Gly Ser Leu Lys
Gly Gly Leu 595 600 605
Arg Val Thr Glu Gln Gly Glu Met Ile Arg Phe Lys Tyr Gly Leu Pro 610
615 620 Glu Ile Thr Val
Ser Ser Leu Ser Leu Tyr Thr Gly Ala Ile Leu Glu 625 630
635 640 Ala Asn Leu Leu Pro Pro Pro Glu Pro
Lys Glu Ser Trp Arg Arg Ile 645 650
655 Met Asp Glu Leu Ser Val Ile Ser Cys Asp Val Tyr Arg Gly
Tyr Val 660 665 670
Arg Glu Asn Lys Asp Phe Val Pro Tyr Phe Arg Ser Ala Thr Pro Glu
675 680 685 Gln Glu Leu Gly
Lys Leu Pro Leu Gly Ser Arg Pro Ala Lys Arg Arg 690
695 700 Pro Thr Gly Gly Val Glu Ser Leu
Arg Ala Ile Pro Trp Ile Phe Ala 705 710
715 720 Trp Thr Gln Asn Arg Leu Met Leu Pro Ala Trp Leu
Gly Ala Gly Thr 725 730
735 Ala Leu Gln Lys Val Val Glu Asp Gly Lys Gln Ser Glu Leu Glu Ala
740 745 750 Met Cys Arg
Asp Trp Pro Phe Phe Ser Thr Arg Leu Gly Met Leu Glu 755
760 765 Met Val Phe Ala Lys Ala Asp Leu
Trp Leu Ala Glu Tyr Tyr Asp Gln 770 775
780 Arg Leu Val Asp Lys Ala Leu Trp Pro Leu Gly Lys Glu
Leu Arg Asn 785 790 795
800 Leu Gln Glu Glu Asp Ile Lys Val Val Leu Ala Ile Ala Asn Asp Ser
805 810 815 His Leu Met Ala
Asp Leu Pro Trp Ile Ala Glu Ser Ile Gln Leu Arg 820
825 830 Asn Ile Tyr Thr Asp Pro Leu Asn Val
Leu Gln Ala Glu Leu Leu His 835 840
845 Arg Ser Arg Gln Ala Glu Lys Glu Gly Gln Glu Pro Asp Pro
Arg Val 850 855 860
Glu Gln Ala Leu Met Val Thr Ile Ala Gly Ile Ala Ala Gly Met Arg 865
870 875 880 Asn Thr Gly
112661DNAEscherichia coli 11tcagaacgtt tcccaaatga cgtggatccg atcgaaactc
gcgactggct ccaggcgatc 60gaatcggtca tccgtgaaga aggtgttgag cgtgctcagt
atctgatcga ccaactgctt 120gctgaagccc gcaaaggcgg tgtaaacgta gccgcaggca
caggtatcag caactacatc 180aacaccatcc ccgttgaaga acaaccggag tatccgggta
atctggaact ggaacgccgt 240attcgttcag ctatccgctg gaacgccatc atgacggtgc
tgcgtgcgtc gaaaaaagac 300ctcgaactgg gcggccatat ggcgtccttc cagtcttccg
caaccattta tgatgtgtgc 360tttaaccact tcttccgtgc acgcaacgag caggatggcg
gcgacctggt ttacttccag 420ggccacatct ccccgggcgt gtacgctcgt gctttcctgg
aaggtcgtct gactcaggag 480cagctggata acttccgtca ggaagttcac ggcaatggcc
tctcttccta tccgcacccg 540aaactgatgc cggaattctg gcagttcccg accgtatcta
tgggtctggg tccgattggt 600gctatttacc aggctaaatt cctgaaatat ctggaacacc
gtggcctgaa agatacctct 660aaacaaaccg tttacgcgtt cctcggtgac ggtgaaatgg
acgaaccgga atccaaaggt 720gcgatcacca tcgctacccg tgaaaaactg gataacctgg
tcttcgttat caactgtaac 780ctgcagcgtc ttgacggccc ggtcaccggt aacggcaaga
tcatcaacga actggaaggc 840atcttcgaag gtgctggctg gaacgtgatc aaagtgatgt
ggggtagccg ttgggatgaa 900ctgctgcgta aggataccag cggtaaactg atccagctga
tgaacgaaac cgttgacggc 960gactaccaga ccttcaaatc gaaagatggt gcgtacgttc
gtgaacactt cttcggtaaa 1020tatcctgaaa ccgcagcact ggttgcagac tggactgacg
agcagatctg ggcactgaac 1080cgtggtggtc acgatccgaa gaaaatctac gctgcattca
agaaagcgca ggaaaccaaa 1140ggcaaagcga cagtaatcct tgctcatacc attaaaggtt
acggcatggg cgacgcggct 1200gaaggtaaaa acatcgcgca ccaggttaag aaaatgaaca
tggacggtgt gcgtcatatc 1260cgcgaccgtt tcaatgtgcc ggtgtctgat gcagatatcg
aaaaactgcc gtacatcacc 1320ttcccggaag gttctgaaga gcatacctat ctgcacgctc
agcgtcagaa actgcacggt 1380tatctgccaa gccgtcagcc gaacttcacc gagaagcttg
agctgccgag cctgcaagac 1440ttcggcgcgc tgttggaaga gcagagcaaa gagatctcta
ccactatcgc tttcgttcgt 1500gctctgaacg tgatgctgaa gaacaagtcg atcaaagatc
gtctggtacc gatcatcgcc 1560gacgaagcgc gtactttcgg tatggaaggt ctgttccgtc
agattggtat ttacagcccg 1620aacggtcagc agtacacccc gcaggaccgc gagcaggttg
cttactataa agaagacgag 1680aaaggtcaga ttctgcagga agggatcaac gagctgggcg
caggttgttc ctggctggca 1740gcggcgacct cttacagcac caacaatctg ccgatgatcc
cgttctacat ctattactcg 1800atgttcggct tccagcgtat tggcgatctg tgctgggcgg
ctggcgacca gcaagcgcgt 1860ggcttcctga tcggcggtac ttccggtcgt accaccctga
acggcgaagg tctgcagcac 1920gaagatggtc acagccacat tcagtcgctg actatcccga
actgtatctc ttacgacccg 1980gcttacgctt acgaagttgc tgtcatcatg catgacggtc
tggagcgtat gtacggtgaa 2040aaacaagaga acgtttacta ctacatcact acgctgaacg
aaaactacca catgccggca 2100atgccggaag gtgctgagga aggtatccgt aaaggtatct
acaaactcga aactattgaa 2160ggtagcaaag gtaaagttca gctgctcggc tccggttcta
tcctgcgtca cgtccgtgaa 2220gcagctgaga tcctggcgaa agattacggc gtaggttctg
acgtttatag cgtgacctcc 2280ttcaccgagc tggcgcgtga tggtcaggat tgtgaacgct
ggaacatgct gcacccgctg 2340gaaactccgc gcgttccgta tatcgctcag gtgatgaacg
acgctccggc agtggcatct 2400accgactata tgaaactgtt cgctgagcag gtccgtactt
acgtaccggc tgacgactac 2460cgcgtactgg gtactgatgg cttcggtcgt tccgacagcc
gtgagaacct gcgtcaccac 2520ttcgaagttg atgcttctta tgtcgtggtt gcggcgctgg
gcgaactggc taaacgtggc 2580gaaatcgata agaaagtggt tgctgacgca atcgccaaat
tcaacatcga tgcagataaa 2640gttaacccgc gtctggcgta a
266112887PRTEscherichia coli 12Met Ser Glu Arg Phe
Pro Asn Asp Val Asp Pro Ile Glu Thr Arg Asp 1 5
10 15 Trp Leu Gln Ala Ile Glu Ser Val Ile Arg
Glu Glu Gly Val Glu Arg 20 25
30 Ala Gln Tyr Leu Ile Asp Gln Leu Leu Ala Glu Ala Arg Lys Gly
Gly 35 40 45 Val
Asn Val Ala Ala Gly Thr Gly Ile Ser Asn Tyr Ile Asn Thr Ile 50
55 60 Pro Val Glu Glu Gln Pro
Glu Tyr Pro Gly Asn Leu Glu Leu Glu Arg 65 70
75 80 Arg Ile Arg Ser Ala Ile Arg Trp Asn Ala Ile
Met Thr Val Leu Arg 85 90
95 Ala Ser Lys Lys Asp Leu Glu Leu Gly Gly His Met Ala Ser Phe Gln
100 105 110 Ser Ser
Ala Thr Ile Tyr Asp Val Cys Phe Asn His Phe Phe Arg Ala 115
120 125 Arg Asn Glu Gln Asp Gly Gly
Asp Leu Val Tyr Phe Gln Gly His Ile 130 135
140 Ser Pro Gly Val Tyr Ala Arg Ala Phe Leu Glu Gly
Arg Leu Thr Gln 145 150 155
160 Glu Gln Leu Asp Asn Phe Arg Gln Glu Val His Gly Asn Gly Leu Ser
165 170 175 Ser Tyr Pro
His Pro Lys Leu Met Pro Glu Phe Trp Gln Phe Pro Thr 180
185 190 Val Ser Met Gly Leu Gly Pro Ile
Gly Ala Ile Tyr Gln Ala Lys Phe 195 200
205 Leu Lys Tyr Leu Glu His Arg Gly Leu Lys Asp Thr Ser
Lys Gln Thr 210 215 220
Val Tyr Ala Phe Leu Gly Asp Gly Glu Met Asp Glu Pro Glu Ser Lys 225
230 235 240 Gly Ala Ile Thr
Ile Ala Thr Arg Glu Lys Leu Asp Asn Leu Val Phe 245
250 255 Val Ile Asn Cys Asn Leu Gln Arg Leu
Asp Gly Pro Val Thr Gly Asn 260 265
270 Gly Lys Ile Ile Asn Glu Leu Glu Gly Ile Phe Glu Gly Ala
Gly Trp 275 280 285
Asn Val Ile Lys Val Met Trp Gly Ser Arg Trp Asp Glu Leu Leu Arg 290
295 300 Lys Asp Thr Ser Gly
Lys Leu Ile Gln Leu Met Asn Glu Thr Val Asp 305 310
315 320 Gly Asp Tyr Gln Thr Phe Lys Ser Lys Asp
Gly Ala Tyr Val Arg Glu 325 330
335 His Phe Phe Gly Lys Tyr Pro Glu Thr Ala Ala Leu Val Ala Asp
Trp 340 345 350 Thr
Asp Glu Gln Ile Trp Ala Leu Asn Arg Gly Gly His Asp Pro Lys 355
360 365 Lys Ile Tyr Ala Ala Phe
Lys Lys Ala Gln Glu Thr Lys Gly Lys Ala 370 375
380 Thr Val Ile Leu Ala His Thr Ile Lys Gly Tyr
Gly Met Gly Asp Ala 385 390 395
400 Ala Glu Gly Lys Asn Ile Ala His Gln Val Lys Lys Met Asn Met Asp
405 410 415 Gly Val
Arg His Ile Arg Asp Arg Phe Asn Val Pro Val Ser Asp Ala 420
425 430 Asp Ile Glu Lys Leu Pro Tyr
Ile Thr Phe Pro Glu Gly Ser Glu Glu 435 440
445 His Thr Tyr Leu His Ala Gln Arg Gln Lys Leu His
Gly Tyr Leu Pro 450 455 460
Ser Arg Gln Pro Asn Phe Thr Glu Lys Leu Glu Leu Pro Ser Leu Gln 465
470 475 480 Asp Phe Gly
Ala Leu Leu Glu Glu Gln Ser Lys Glu Ile Ser Thr Thr 485
490 495 Ile Ala Phe Val Arg Ala Leu Asn
Val Met Leu Lys Asn Lys Ser Ile 500 505
510 Lys Asp Arg Leu Val Pro Ile Ile Ala Asp Glu Ala Arg
Thr Phe Gly 515 520 525
Met Glu Gly Leu Phe Arg Gln Ile Gly Ile Tyr Ser Pro Asn Gly Gln 530
535 540 Gln Tyr Thr Pro
Gln Asp Arg Glu Gln Val Ala Tyr Tyr Lys Glu Asp 545 550
555 560 Glu Lys Gly Gln Ile Leu Gln Glu Gly
Ile Asn Glu Leu Gly Ala Gly 565 570
575 Cys Ser Trp Leu Ala Ala Ala Thr Ser Tyr Ser Thr Asn Asn
Leu Pro 580 585 590
Met Ile Pro Phe Tyr Ile Tyr Tyr Ser Met Phe Gly Phe Gln Arg Ile
595 600 605 Gly Asp Leu Cys
Trp Ala Ala Gly Asp Gln Gln Ala Arg Gly Phe Leu 610
615 620 Ile Gly Gly Thr Ser Gly Arg Thr
Thr Leu Asn Gly Glu Gly Leu Gln 625 630
635 640 His Glu Asp Gly His Ser His Ile Gln Ser Leu Thr
Ile Pro Asn Cys 645 650
655 Ile Ser Tyr Asp Pro Ala Tyr Ala Tyr Glu Val Ala Val Ile Met His
660 665 670 Asp Gly Leu
Glu Arg Met Tyr Gly Glu Lys Gln Glu Asn Val Tyr Tyr 675
680 685 Tyr Ile Thr Thr Leu Asn Glu Asn
Tyr His Met Pro Ala Met Pro Glu 690 695
700 Gly Ala Glu Glu Gly Ile Arg Lys Gly Ile Tyr Lys Leu
Glu Thr Ile 705 710 715
720 Glu Gly Ser Lys Gly Lys Val Gln Leu Leu Gly Ser Gly Ser Ile Leu
725 730 735 Arg His Val Arg
Glu Ala Ala Glu Ile Leu Ala Lys Asp Tyr Gly Val 740
745 750 Gly Ser Asp Val Tyr Ser Val Thr Ser
Phe Thr Glu Leu Ala Arg Asp 755 760
765 Gly Gln Asp Cys Glu Arg Trp Asn Met Leu His Pro Leu Glu
Thr Pro 770 775 780
Arg Val Pro Tyr Ile Ala Gln Val Met Asn Asp Ala Pro Ala Val Ala 785
790 795 800 Ser Thr Asp Tyr Met
Lys Leu Phe Ala Glu Gln Val Arg Thr Tyr Val 805
810 815 Pro Ala Asp Asp Tyr Arg Val Leu Gly Thr
Asp Gly Phe Gly Arg Ser 820 825
830 Asp Ser Arg Glu Asn Leu Arg His His Phe Glu Val Asp Ala Ser
Tyr 835 840 845 Val
Val Val Ala Ala Leu Gly Glu Leu Ala Lys Arg Gly Glu Ile Asp 850
855 860 Lys Lys Val Val Ala Asp
Ala Ile Ala Lys Phe Asn Ile Asp Ala Asp 865 870
875 880 Lys Val Asn Pro Arg Leu Ala
885 131887DNAEscherichia coli 13gctatcgaaa tcaaagtacc ggacatcggg
gctgatgaag ttgaaatcac cgagatcctg 60gtcaaagtgg gcgacaaagt tgaagccgaa
cagtcgctga tcaccgtaga aggcgacaaa 120gcctctatgg aagttccgtc tccgcaggcg
ggtatcgtta aagagatcaa agtctctgtt 180ggcgataaaa cccagaccgg cgcactgatt
atgattttcg attccgccga cggtgcagca 240gacgctgcac ctgctcaggc agaagagaag
aaagaagcag ctccggcagc agcaccagcg 300gctgcggcgg caaaagacgt taacgttccg
gatatcggca gcgacgaagt tgaagtgacc 360gaaatcctgg tgaaagttgg cgataaagtt
gaagctgaac agtcgctgat caccgtagaa 420ggcgacaagg cttctatgga agttccggct
ccgtttgctg gcaccgtgaa agagatcaaa 480gtgaacgtgg gtgacaaagt gtctaccggc
tcgctgatta tggtcttcga agtcgcgggt 540gaagcaggcg cggcagctcc ggccgctaaa
caggaagcag ctccggcagc ggcccctgca 600ccagcggctg gcgtgaaaga agttaacgtt
ccggatatcg gcggtgacga agttgaagtg 660actgaagtga tggtgaaagt gggcgacaaa
gttgccgctg aacagtcact gatcaccgta 720gaaggcgaca aagcttctat ggaagttccg
gcgccgtttg caggcgtcgt gaaggaactg 780aaagtcaacg ttggcgataa agtgaaaact
ggctcgctga ttatgatctt cgaagttgaa 840ggcgcagcgc ctgcggcagc tcctgcgaaa
caggaagcgg cagcgccggc accggcagca 900aaagctgaag ccccggcagc agcaccagct
gcgaaagcgg aaggcaaatc tgaatttgct 960gaaaacgacg cttatgttca cgcgactccg
ctgatccgcc gtctggcacg cgagtttggt 1020gttaaccttg cgaaagtgaa gggcactggc
cgtaaaggtc gtatcctgcg cgaagacgtt 1080caggcttacg tgaaagaagc tatcaaacgt
gcagaagcag ctccggcagc gactggcggt 1140ggtatccctg gcatgctgcc gtggccgaag
gtggacttca gcaagtttgg tgaaatcgaa 1200gaagtggaac tgggccgcat ccagaaaatc
tctggtgcga acctgagccg taactgggta 1260atgatcccgc atgttactca cttcgacaaa
accgatatca ccgagttgga agcgttccgt 1320aaacagcaga acgaagaagc ggcgaaacgt
aagctggatg tgaagatcac cccggttgtc 1380ttcatcatga aagccgttgc tgcagctctt
gagcagatgc ctcgcttcaa tagttcgctg 1440tcggaagacg gtcagcgtct gaccctgaag
aaatacatca acatcggtgt ggcggtggat 1500accccgaacg gtctggttgt tccggtattc
aaagacgtca acaagaaagg catcatcgag 1560ctgtctcgcg agctgatgac tatttctaag
aaagcgcgtg acggtaagct gactgcgggc 1620gaaatgcagg gcggttgctt caccatctcc
agcatcggcg gcctgggtac tacccacttc 1680gcgccgattg tgaacgcgcc ggaagtggct
atcctcggcg tttccaagtc cgcgatggag 1740ccggtgtgga atggtaaaga gttcgtgccg
cgtctgatgc tgccgatttc tctctccttc 1800gaccaccgcg tgatcgacgg tgctgatggt
gcccgtttca ttaccatcat taacaacacg 1860ctgtctgaca ttcgccgtct ggtgatg
188714630PRTEscherichia coli 14Met Ala
Ile Glu Ile Lys Val Pro Asp Ile Gly Ala Asp Glu Val Glu 1 5
10 15 Ile Thr Glu Ile Leu Val Lys
Val Gly Asp Lys Val Glu Ala Glu Gln 20 25
30 Ser Leu Ile Thr Val Glu Gly Asp Lys Ala Ser Met
Glu Val Pro Ser 35 40 45
Pro Gln Ala Gly Ile Val Lys Glu Ile Lys Val Ser Val Gly Asp Lys
50 55 60 Thr Gln Thr
Gly Ala Leu Ile Met Ile Phe Asp Ser Ala Asp Gly Ala 65
70 75 80 Ala Asp Ala Ala Pro Ala Gln
Ala Glu Glu Lys Lys Glu Ala Ala Pro 85
90 95 Ala Ala Ala Pro Ala Ala Ala Ala Ala Lys Asp
Val Asn Val Pro Asp 100 105
110 Ile Gly Ser Asp Glu Val Glu Val Thr Glu Ile Leu Val Lys Val
Gly 115 120 125 Asp
Lys Val Glu Ala Glu Gln Ser Leu Ile Thr Val Glu Gly Asp Lys 130
135 140 Ala Ser Met Glu Val Pro
Ala Pro Phe Ala Gly Thr Val Lys Glu Ile 145 150
155 160 Lys Val Asn Val Gly Asp Lys Val Ser Thr Gly
Ser Leu Ile Met Val 165 170
175 Phe Glu Val Ala Gly Glu Ala Gly Ala Ala Ala Pro Ala Ala Lys Gln
180 185 190 Glu Ala
Ala Pro Ala Ala Ala Pro Ala Pro Ala Ala Gly Val Lys Glu 195
200 205 Val Asn Val Pro Asp Ile Gly
Gly Asp Glu Val Glu Val Thr Glu Val 210 215
220 Met Val Lys Val Gly Asp Lys Val Ala Ala Glu Gln
Ser Leu Ile Thr 225 230 235
240 Val Glu Gly Asp Lys Ala Ser Met Glu Val Pro Ala Pro Phe Ala Gly
245 250 255 Val Val Lys
Glu Leu Lys Val Asn Val Gly Asp Lys Val Lys Thr Gly 260
265 270 Ser Leu Ile Met Ile Phe Glu Val
Glu Gly Ala Ala Pro Ala Ala Ala 275 280
285 Pro Ala Lys Gln Glu Ala Ala Ala Pro Ala Pro Ala Ala
Lys Ala Glu 290 295 300
Ala Pro Ala Ala Ala Pro Ala Ala Lys Ala Glu Gly Lys Ser Glu Phe 305
310 315 320 Ala Glu Asn Asp
Ala Tyr Val His Ala Thr Pro Leu Ile Arg Arg Leu 325
330 335 Ala Arg Glu Phe Gly Val Asn Leu Ala
Lys Val Lys Gly Thr Gly Arg 340 345
350 Lys Gly Arg Ile Leu Arg Glu Asp Val Gln Ala Tyr Val Lys
Glu Ala 355 360 365
Ile Lys Arg Ala Glu Ala Ala Pro Ala Ala Thr Gly Gly Gly Ile Pro 370
375 380 Gly Met Leu Pro Trp
Pro Lys Val Asp Phe Ser Lys Phe Gly Glu Ile 385 390
395 400 Glu Glu Val Glu Leu Gly Arg Ile Gln Lys
Ile Ser Gly Ala Asn Leu 405 410
415 Ser Arg Asn Trp Val Met Ile Pro His Val Thr His Phe Asp Lys
Thr 420 425 430 Asp
Ile Thr Glu Leu Glu Ala Phe Arg Lys Gln Gln Asn Glu Glu Ala 435
440 445 Ala Lys Arg Lys Leu Asp
Val Lys Ile Thr Pro Val Val Phe Ile Met 450 455
460 Lys Ala Val Ala Ala Ala Leu Glu Gln Met Pro
Arg Phe Asn Ser Ser 465 470 475
480 Leu Ser Glu Asp Gly Gln Arg Leu Thr Leu Lys Lys Tyr Ile Asn Ile
485 490 495 Gly Val
Ala Val Asp Thr Pro Asn Gly Leu Val Val Pro Val Phe Lys 500
505 510 Asp Val Asn Lys Lys Gly Ile
Ile Glu Leu Ser Arg Glu Leu Met Thr 515 520
525 Ile Ser Lys Lys Ala Arg Asp Gly Lys Leu Thr Ala
Gly Glu Met Gln 530 535 540
Gly Gly Cys Phe Thr Ile Ser Ser Ile Gly Gly Leu Gly Thr Thr His 545
550 555 560 Phe Ala Pro
Ile Val Asn Ala Pro Glu Val Ala Ile Leu Gly Val Ser 565
570 575 Lys Ser Ala Met Glu Pro Val Trp
Asn Gly Lys Glu Phe Val Pro Arg 580 585
590 Leu Met Leu Pro Ile Ser Leu Ser Phe Asp His Arg Val
Ile Asp Gly 595 600 605
Ala Asp Gly Ala Arg Phe Ile Thr Ile Ile Asn Asn Thr Leu Ser Asp 610
615 620 Ile Arg Arg Leu
Val Met 625 630 151422DNAEscherichia coli 15agtactgaaa
tcaaaactca ggtcgtggta cttggggcag gccccgcagg ttactccgct 60gccttccgtt
gcgctgattt aggtctggaa accgtaatcg tagaacgtta caacaccctt 120ggcggtgttt
gcctgaacgt cggctgtatc ccttctaaag cactgctgca cgtagcaaaa 180gttatcgaag
aagccaaagc gctggctgaa cacggtatcg tcttcggcga accgaaaacc 240gatatcgaca
agattcgtac ctggaaagag aaagtgatca atcagctgac cggtggtctg 300gctggtatgg
cgaaaggccg caaagtcaaa gtggtcaacg gtctgggtaa attcaccggg 360gctaacaccc
tggaagttga aggtgagaac ggcaaaaccg tgatcaactt cgacaacgcg 420atcattgcag
cgggttctcg cccgatccaa ctgccgttta ttccgcatga agatccgcgt 480atctgggact
ccactgacgc gctggaactg aaagaagtac cagaacgcct gctggtaatg 540ggtggcggta
tcatcggtct ggaaatgggc accgtttacc acgcgctggg ttcacagatt 600gacgtggttg
aaatgttcga ccaggttatc ccggcagctg acaaagacat cgttaaagtc 660ttcaccaagc
gtatcagcaa gaaattcaac ctgatgctgg aaaccaaagt taccgccgtt 720gaagcgaaag
aagacggcat ttatgtgacg atggaaggca aaaaagcacc cgctgaaccg 780cagcgttacg
acgccgtgct ggtagcgatt ggtcgtgtgc cgaacggtaa aaacctcgac 840gcaggcaaag
caggcgtgga agttgacgac cgtggtttca tccgcgttga caaacagctg 900cgtaccaacg
taccgcacat ctttgctatc ggcgatatcg tcggtcaacc gatgctggca 960cacaaaggtg
ttcacgaagg tcacgttgcc gctgaagtta tcgccggtaa gaaacactac 1020ttcgatccga
aagttatccc gtccatcgcc tataccgaac cagaagttgc atgggtgggt 1080ctgactgaga
aagaagcgaa agagaaaggc atcagctatg aaaccgccac cttcccgtgg 1140gctgcttctg
gtcgtgctat cgcttccgac tgcgcagacg gtatgaccaa gctgattttc 1200gacaaagaat
ctcaccgtgt gatcggtggt gcgattgtcg gtactaacgg cggcgagctg 1260ctgggtgaaa
tcggcctggc aatcgaaatg ggttgtgatg ctgaagacat cgcactgacc 1320atccacgcgc
acccgactct gcacgagtct gtgggcctgg cggcagaagt gttcgaaggt 1380agcattaccg
acctgccgaa cccgaaagcg aagaagaagt aa
142216474PRTEscherichia coli 16Met Ser Thr Glu Ile Lys Thr Gln Val Val
Val Leu Gly Ala Gly Pro 1 5 10
15 Ala Gly Tyr Ser Ala Ala Phe Arg Cys Ala Asp Leu Gly Leu Glu
Thr 20 25 30 Val
Ile Val Glu Arg Tyr Asn Thr Leu Gly Gly Val Cys Leu Asn Val 35
40 45 Gly Cys Ile Pro Ser Lys
Ala Leu Leu His Val Ala Lys Val Ile Glu 50 55
60 Glu Ala Lys Ala Leu Ala Glu His Gly Ile Val
Phe Gly Glu Pro Lys 65 70 75
80 Thr Asp Ile Asp Lys Ile Arg Thr Trp Lys Glu Lys Val Ile Asn Gln
85 90 95 Leu Thr
Gly Gly Leu Ala Gly Met Ala Lys Gly Arg Lys Val Lys Val 100
105 110 Val Asn Gly Leu Gly Lys Phe
Thr Gly Ala Asn Thr Leu Glu Val Glu 115 120
125 Gly Glu Asn Gly Lys Thr Val Ile Asn Phe Asp Asn
Ala Ile Ile Ala 130 135 140
Ala Gly Ser Arg Pro Ile Gln Leu Pro Phe Ile Pro His Glu Asp Pro 145
150 155 160 Arg Ile Trp
Asp Ser Thr Asp Ala Leu Glu Leu Lys Glu Val Pro Glu 165
170 175 Arg Leu Leu Val Met Gly Gly Gly
Ile Ile Gly Leu Glu Met Gly Thr 180 185
190 Val Tyr His Ala Leu Gly Ser Gln Ile Asp Val Val Glu
Met Phe Asp 195 200 205
Gln Val Ile Pro Ala Ala Asp Lys Asp Ile Val Lys Val Phe Thr Lys 210
215 220 Arg Ile Ser Lys
Lys Phe Asn Leu Met Leu Glu Thr Lys Val Thr Ala 225 230
235 240 Val Glu Ala Lys Glu Asp Gly Ile Tyr
Val Thr Met Glu Gly Lys Lys 245 250
255 Ala Pro Ala Glu Pro Gln Arg Tyr Asp Ala Val Leu Val Ala
Ile Gly 260 265 270
Arg Val Pro Asn Gly Lys Asn Leu Asp Ala Gly Lys Ala Gly Val Glu
275 280 285 Val Asp Asp Arg
Gly Phe Ile Arg Val Asp Lys Gln Leu Arg Thr Asn 290
295 300 Val Pro His Ile Phe Ala Ile Gly
Asp Ile Val Gly Gln Pro Met Leu 305 310
315 320 Ala His Lys Gly Val His Glu Gly His Val Ala Ala
Glu Val Ile Ala 325 330
335 Gly Lys Lys His Tyr Phe Asp Pro Lys Val Ile Pro Ser Ile Ala Tyr
340 345 350 Thr Glu Pro
Glu Val Ala Trp Val Gly Leu Thr Glu Lys Glu Ala Lys 355
360 365 Glu Lys Gly Ile Ser Tyr Glu Thr
Ala Thr Phe Pro Trp Ala Ala Ser 370 375
380 Gly Arg Ala Ile Ala Ser Asp Cys Ala Asp Gly Met Thr
Lys Leu Ile 385 390 395
400 Phe Asp Lys Glu Ser His Arg Val Ile Gly Gly Ala Ile Val Gly Thr
405 410 415 Asn Gly Gly Glu
Leu Leu Gly Glu Ile Gly Leu Ala Ile Glu Met Gly 420
425 430 Cys Asp Ala Glu Asp Ile Ala Leu Thr
Ile His Ala His Pro Thr Leu 435 440
445 His Glu Ser Val Gly Leu Ala Ala Glu Val Phe Glu Gly Ser
Ile Thr 450 455 460
Asp Leu Pro Asn Pro Lys Ala Lys Lys Lys 465 470
17987DNAEscherichia coli 17aaactcgccg tttatagcac aaaacagtac
gacaagaagt acctgcaaca ggtgaacgag 60tcctttggct ttgagctgga attttttgac
tttctgctga cggaaaaaac cgctaaaact 120gccaatggct gcgaagcggt atgtattttc
gtaaacgatg acggcagccg cccggtgctg 180gaagagctga aaaagcacgg cgttaaatat
atcgccctgc gctgtgccgg tttcaataac 240gtcgaccttg acgcggcaaa agaactgggg
ctgaaagtag tccgtgttcc agcctatgat 300ccagaggccg ttgctgaaca cgccatcggt
atgatgatga cgctgaaccg ccgtattcac 360cgcgcgtatc agcgtacccg tgatgctaac
ttctctctgg aaggtctgac cggctttact 420atgtatggca aaacggcagg cgttatcggt
accggtaaaa tcggtgtggc gatgctgcgc 480attctgaaag gttttggtat gcgtctgctg
gcgttcgatc cgtatccaag tgcagcggcg 540ctggaactcg gtgtggagta tgtcgatctg
ccaaccctgt tctctgaatc agacgttatc 600tctctgcact gcccgctgac accggaaaac
tatcatctgt tgaacgaagc cgccttcgaa 660cagatgaaaa atggcgtgat gatcgtcaat
accagtcgcg gtgcattgat tgattctcag 720gcagcaattg aagcgctgaa aaatcagaaa
attggttcgt tgggtatgga cgtgtatgag 780aacgaacgcg atctattctt tgaagataaa
tccaacgacg tgatccagga tgacgtattc 840cgtcgcctgt ctgcctgcca caacgtgctg
tttaccgggc accaggcatt cctgacagca 900gaagctctga ccagtatttc tcagactacg
ctgcaaaact taagcaatct ggaaaaaggc 960gaaacctgcc cgaacgaact ggtttaa
98718329PRTEscherichia coli 18Met Lys
Leu Ala Val Tyr Ser Thr Lys Gln Tyr Asp Lys Lys Tyr Leu 1 5
10 15 Gln Gln Val Asn Glu Ser Phe
Gly Phe Glu Leu Glu Phe Phe Asp Phe 20 25
30 Leu Leu Thr Glu Lys Thr Ala Lys Thr Ala Asn Gly
Cys Glu Ala Val 35 40 45
Cys Ile Phe Val Asn Asp Asp Gly Ser Arg Pro Val Leu Glu Glu Leu
50 55 60 Lys Lys His
Gly Val Lys Tyr Ile Ala Leu Arg Cys Ala Gly Phe Asn 65
70 75 80 Asn Val Asp Leu Asp Ala Ala
Lys Glu Leu Gly Leu Lys Val Val Arg 85
90 95 Val Pro Ala Tyr Asp Pro Glu Ala Val Ala Glu
His Ala Ile Gly Met 100 105
110 Met Met Thr Leu Asn Arg Arg Ile His Arg Ala Tyr Gln Arg Thr
Arg 115 120 125 Asp
Ala Asn Phe Ser Leu Glu Gly Leu Thr Gly Phe Thr Met Tyr Gly 130
135 140 Lys Thr Ala Gly Val Ile
Gly Thr Gly Lys Ile Gly Val Ala Met Leu 145 150
155 160 Arg Ile Leu Lys Gly Phe Gly Met Arg Leu Leu
Ala Phe Asp Pro Tyr 165 170
175 Pro Ser Ala Ala Ala Leu Glu Leu Gly Val Glu Tyr Val Asp Leu Pro
180 185 190 Thr Leu
Phe Ser Glu Ser Asp Val Ile Ser Leu His Cys Pro Leu Thr 195
200 205 Pro Glu Asn Tyr His Leu Leu
Asn Glu Ala Ala Phe Glu Gln Met Lys 210 215
220 Asn Gly Val Met Ile Val Asn Thr Ser Arg Gly Ala
Leu Ile Asp Ser 225 230 235
240 Gln Ala Ala Ile Glu Ala Leu Lys Asn Gln Lys Ile Gly Ser Leu Gly
245 250 255 Met Asp Val
Tyr Glu Asn Glu Arg Asp Leu Phe Phe Glu Asp Lys Ser 260
265 270 Asn Asp Val Ile Gln Asp Asp Val
Phe Arg Arg Leu Ser Ala Cys His 275 280
285 Asn Val Leu Phe Thr Gly His Gln Ala Phe Leu Thr Ala
Glu Ala Leu 290 295 300
Thr Ser Ile Ser Gln Thr Thr Leu Gln Asn Leu Ser Asn Leu Glu Lys 305
310 315 320 Gly Glu Thr Cys
Pro Asn Glu Leu Val 325
192277DNAEscherichia coli 19tccgagctta atgaaaagtt agccacagcc tgggaaggtt
ttaccaaagg tgactggcag 60aatgaagtaa acgtccgtga cttcattcag aaaaactaca
ctccgtacga gggtgacgag 120tccttcctgg ctggcgctac tgaagcgacc accaccctgt
gggacaaagt aatggaaggc 180gttaaactgg aaaaccgcac tcacgcgcca gttgactttg
acaccgctgt tgcttccacc 240atcacctctc acgacgctgg ctacatcaac aagcagcttg
agaaaatcgt tggtctgcag 300actgaagctc cgctgaaacg tgctcttatc ccgttcggtg
gtatcaaaat gatcgaaggt 360tcctgcaaag cgtacaaccg cgaactggat ccgatgatca
aaaaaatctt cactgaatac 420cgtaaaactc acaaccaggg cgtgttcgac gtttacactc
cggacatcct gcgttgccgt 480aaatctggtg ttctgaccgg tctgccagat gcatatggcc
gtggccgtat catcggtgac 540taccgtcgcg ttgcgctgta cggtatcgac tacctgatga
aagacaaact ggcacagttc 600acttctctgc aggctgatct ggaaaacggc gtaaacctgg
aacagactat ccgtctgcgc 660gaagaaatcg ctgaacagca ccgcgctctg ggtcagatga
aagaaatggc tgcgaaatac 720ggctacgaca tctctggtcc ggctaccaac gctcaggaag
ctatccagtg gacttacttc 780ggctacctgg ctgctgttaa gtctcagaac ggtgctgcaa
tgtccttcgg tcgtacctcc 840accttcctgg atgtgtacat cgaacgtgac ctgaaagctg
gcaagatcac cgaacaagaa 900gcgcaggaaa tggttgacca cctggtcatg aaactgcgta
tggttcgctt cctgcgtact 960ccggaatacg atgaactgtt ctctggcgac ccgatctggg
caaccgaatc tatcggtggt 1020atgggcctcg acggtcgtac cctggttacc aaaaacagct
tccgtttcct gaacaccctg 1080tacaccatgg gtccgtctcc ggaaccgaac atgaccattc
tgtggtctga aaaactgccg 1140ctgaacttca agaaattcgc cgctaaagtg tccatcgaca
cctcttctct gcagtatgag 1200aacgatgacc tgatgcgtcc ggacttcaac aacgatgact
acgctattgc ttgctgcgta 1260agcccgatga tcgttggtaa acaaatgcag ttcttcggtg
cgcgtgcaaa cctggcgaaa 1320accatgctgt acgcaatcaa cggcggcgtt gacgaaaaac
tgaaaatgca ggttggtccg 1380aagtctgaac cgatcaaagg cgatgtcctg aactatgatg
aagtgatgga gcgcatggat 1440cacttcatgg actggctggc taaacagtac atcactgcac
tgaacatcat ccactacatg 1500cacgacaagt acagctacga agcctctctg atggcgctgc
acgaccgtga cgttatccgc 1560accatggcgt gtggtatcgc tggtctgtcc gttgctgctg
actccctgtc tgcaatcaaa 1620tatgcgaaag ttaaaccgat tcgtgacgaa gacggtctgg
ctatcgactt cgaaatcgaa 1680ggcgaatacc cgcagtttgg taacaatgat ccgcgtgtag
atgacctggc tgttgacctg 1740gtagaacgtt tcatgaagaa aattcagaaa ctgcacacct
accgtgacgc tatcccgact 1800cagtctgttc tgaccatcac ttctaacgtt gtgtatggta
agaaaacggg taacacccca 1860gacggtcgtc gtgctggcgc gccgttcgga ccgggtgcta
acccgatgca cggtcgtgac 1920cagaaaggtg cagtagcctc tctgacttcc gttgctaaac
tgccgtttgc ttacgctaaa 1980gatggtatct cctacacctt ctctatcgtt ccgaacgcac
tgggtaaaga cgacgaagtt 2040cgtaagacca acctggctgg tctgatggat ggttacttcc
accacgaagc atccatcgaa 2100ggtggtcagc acctgaacgt taacgtgatg aaccgtgaaa
tgctgctcga cgcgatggaa 2160aacccggaaa aatatccgca gctgaccatc cgtgtatctg
gctacgcagt acgtttcaac 2220tcgctgacta aagaacagca gcaggacgtt attactcgta
ccttcactca atctatg 227720760PRTEscherichia coli 20Met Ser Glu Leu
Asn Glu Lys Leu Ala Thr Ala Trp Glu Gly Phe Thr 1 5
10 15 Lys Gly Asp Trp Gln Asn Glu Val Asn
Val Arg Asp Phe Ile Gln Lys 20 25
30 Asn Tyr Thr Pro Tyr Glu Gly Asp Glu Ser Phe Leu Ala Gly
Ala Thr 35 40 45
Glu Ala Thr Thr Thr Leu Trp Asp Lys Val Met Glu Gly Val Lys Leu 50
55 60 Glu Asn Arg Thr His
Ala Pro Val Asp Phe Asp Thr Ala Val Ala Ser 65 70
75 80 Thr Ile Thr Ser His Asp Ala Gly Tyr Ile
Asn Lys Gln Leu Glu Lys 85 90
95 Ile Val Gly Leu Gln Thr Glu Ala Pro Leu Lys Arg Ala Leu Ile
Pro 100 105 110 Phe
Gly Gly Ile Lys Met Ile Glu Gly Ser Cys Lys Ala Tyr Asn Arg 115
120 125 Glu Leu Asp Pro Met Ile
Lys Lys Ile Phe Thr Glu Tyr Arg Lys Thr 130 135
140 His Asn Gln Gly Val Phe Asp Val Tyr Thr Pro
Asp Ile Leu Arg Cys 145 150 155
160 Arg Lys Ser Gly Val Leu Thr Gly Leu Pro Asp Ala Tyr Gly Arg Gly
165 170 175 Arg Ile
Ile Gly Asp Tyr Arg Arg Val Ala Leu Tyr Gly Ile Asp Tyr 180
185 190 Leu Met Lys Asp Lys Leu Ala
Gln Phe Thr Ser Leu Gln Ala Asp Leu 195 200
205 Glu Asn Gly Val Asn Leu Glu Gln Thr Ile Arg Leu
Arg Glu Glu Ile 210 215 220
Ala Glu Gln His Arg Ala Leu Gly Gln Met Lys Glu Met Ala Ala Lys 225
230 235 240 Tyr Gly Tyr
Asp Ile Ser Gly Pro Ala Thr Asn Ala Gln Glu Ala Ile 245
250 255 Gln Trp Thr Tyr Phe Gly Tyr Leu
Ala Ala Val Lys Ser Gln Asn Gly 260 265
270 Ala Ala Met Ser Phe Gly Arg Thr Ser Thr Phe Leu Asp
Val Tyr Ile 275 280 285
Glu Arg Asp Leu Lys Ala Gly Lys Ile Thr Glu Gln Glu Ala Gln Glu 290
295 300 Met Val Asp His
Leu Val Met Lys Leu Arg Met Val Arg Phe Leu Arg 305 310
315 320 Thr Pro Glu Tyr Asp Glu Leu Phe Ser
Gly Asp Pro Ile Trp Ala Thr 325 330
335 Glu Ser Ile Gly Gly Met Gly Leu Asp Gly Arg Thr Leu Val
Thr Lys 340 345 350
Asn Ser Phe Arg Phe Leu Asn Thr Leu Tyr Thr Met Gly Pro Ser Pro
355 360 365 Glu Pro Asn Met
Thr Ile Leu Trp Ser Glu Lys Leu Pro Leu Asn Phe 370
375 380 Lys Lys Phe Ala Ala Lys Val Ser
Ile Asp Thr Ser Ser Leu Gln Tyr 385 390
395 400 Glu Asn Asp Asp Leu Met Arg Pro Asp Phe Asn Asn
Asp Asp Tyr Ala 405 410
415 Ile Ala Cys Cys Val Ser Pro Met Ile Val Gly Lys Gln Met Gln Phe
420 425 430 Phe Gly Ala
Arg Ala Asn Leu Ala Lys Thr Met Leu Tyr Ala Ile Asn 435
440 445 Gly Gly Val Asp Glu Lys Leu Lys
Met Gln Val Gly Pro Lys Ser Glu 450 455
460 Pro Ile Lys Gly Asp Val Leu Asn Tyr Asp Glu Val Met
Glu Arg Met 465 470 475
480 Asp His Phe Met Asp Trp Leu Ala Lys Gln Tyr Ile Thr Ala Leu Asn
485 490 495 Ile Ile His Tyr
Met His Asp Lys Tyr Ser Tyr Glu Ala Ser Leu Met 500
505 510 Ala Leu His Asp Arg Asp Val Ile Arg
Thr Met Ala Cys Gly Ile Ala 515 520
525 Gly Leu Ser Val Ala Ala Asp Ser Leu Ser Ala Ile Lys Tyr
Ala Lys 530 535 540
Val Lys Pro Ile Arg Asp Glu Asp Gly Leu Ala Ile Asp Phe Glu Ile 545
550 555 560 Glu Gly Glu Tyr Pro
Gln Phe Gly Asn Asn Asp Pro Arg Val Asp Asp 565
570 575 Leu Ala Val Asp Leu Val Glu Arg Phe Met
Lys Lys Ile Gln Lys Leu 580 585
590 His Thr Tyr Arg Asp Ala Ile Pro Thr Gln Ser Val Leu Thr Ile
Thr 595 600 605 Ser
Asn Val Val Tyr Gly Lys Lys Thr Gly Asn Thr Pro Asp Gly Arg 610
615 620 Arg Ala Gly Ala Pro Phe
Gly Pro Gly Ala Asn Pro Met His Gly Arg 625 630
635 640 Asp Gln Lys Gly Ala Val Ala Ser Leu Thr Ser
Val Ala Lys Leu Pro 645 650
655 Phe Ala Tyr Ala Lys Asp Gly Ile Ser Tyr Thr Phe Ser Ile Val Pro
660 665 670 Asn Ala
Leu Gly Lys Asp Asp Glu Val Arg Lys Thr Asn Leu Ala Gly 675
680 685 Leu Met Asp Gly Tyr Phe His
His Glu Ala Ser Ile Glu Gly Gly Gln 690 695
700 His Leu Asn Val Asn Val Met Asn Arg Glu Met Leu
Leu Asp Ala Met 705 710 715
720 Glu Asn Pro Glu Lys Tyr Pro Gln Leu Thr Ile Arg Val Ser Gly Tyr
725 730 735 Ala Val Arg
Phe Asn Ser Leu Thr Lys Glu Gln Gln Gln Asp Val Ile 740
745 750 Thr Arg Thr Phe Thr Gln Ser Met
755 760 211716DNAEscherichia coli 21aaacaaacgg
ttgcagctta tatcgccaaa acactcgaat cggcaggggt gaaacgcatc 60tggggagtca
caggcgactc tctgaacggt cttagtgaca gtcttaatcg catgggcacc 120atcgagtgga
tgtccacccg ccacgaagaa gtggcggcct ttgccgctgg cgctgaagca 180caacttagcg
gagaactggc ggtctgcgcc ggatcgtgcg gccccggcaa cctgcactta 240atcaacggcc
tgttcgattg ccaccgcaat cacgttccgg tactggcgat tgccgctcat 300attccctcca
gcgaaattgg cagcggctat ttccaggaaa cccacccaca agagctattc 360cgcgaatgta
gtcactattg cgagctggtt tccagcccgg agcagatccc acaagtactg 420gcgattgcca
tgcgcaaagc ggtgcttaac cgtggcgttt cggttgtcgt gttaccaggc 480gacgtggcgt
taaaacctgc gccagaaggg gcaaccatgc actggtatca tgcgccacaa 540ccagtcgtga
cgccggaaga agaagagtta cgcaaactgg cgcaactgct gcgttattcc 600agcaatatcg
ccctgatgtg tggcagcggc tgcgcggggg cgcataaaga gttagttgag 660tttgccggga
aaattaaagc gcctattgtt catgccctgc gcggtaaaga acatgtcgaa 720tacgataatc
cgtatgatgt tggaatgacc gggttaatcg gcttctcgtc aggtttccat 780accatgatga
acgccgacac gttagtgcta ctcggcacgc aatttcccta ccgcgccttc 840tacccgaccg
atgccaaaat cattcagatt gatatcaacc cagccagcat cggcgctcac 900agcaaggtgg
atatggcact ggtcggcgat atcaagtcga ctctgcgtgc attgcttcca 960ttggtggaag
aaaaagccga tcgcaagttt ctggataaag cgctggaaga ttaccgcgac 1020gcccgcaaag
ggctggacga tttagctaaa ccgagcgaga aagccattca cccgcaatat 1080ctggcgcagc
aaattagtca ttttgccgcc gatgacgcta ttttcacctg tgacgttggt 1140acgccaacgg
tgtgggcggc acgttatcta aaaatgaacg gcaagcgtcg cctgttaggt 1200tcgtttaacc
acggttcgat ggctaacgcc atgccgcagg cgctgggtgc gcaggcgaca 1260gagccagaac
gtcaggtggt cgccatgtgc ggcgatggcg gttttagcat gttgatgggc 1320gatttcctct
cagtagtgca gatgaaactg ccagtgaaaa ttgtcgtctt taacaacagc 1380gtgctgggct
ttgtggcgat ggagatgaaa gctggtggct atttgactga cggcaccgaa 1440ctacacgaca
caaactttgc ccgcattgcc gaagcgtgcg gcattacggg tatccgtgta 1500gaaaaagcgt
ctgaagttga tgaagccctg caacgcgcct tctccatcga cggtccggtg 1560ttggtggatg
tggtggtcgc caaagaagag ttagccattc caccgcagat caaactcgaa 1620caggccaaag
gtttcagcct gtatatgctg cgcgcaatca tcagcggacg cggtgatgaa 1680gtgatcgaac
tggcgaaaac aaactggcta aggtaa
171622572PRTEscherichia coli 22Met Lys Gln Thr Val Ala Ala Tyr Ile Ala
Lys Thr Leu Glu Ser Ala 1 5 10
15 Gly Val Lys Arg Ile Trp Gly Val Thr Gly Asp Ser Leu Asn Gly
Leu 20 25 30 Ser
Asp Ser Leu Asn Arg Met Gly Thr Ile Glu Trp Met Ser Thr Arg 35
40 45 His Glu Glu Val Ala Ala
Phe Ala Ala Gly Ala Glu Ala Gln Leu Ser 50 55
60 Gly Glu Leu Ala Val Cys Ala Gly Ser Cys Gly
Pro Gly Asn Leu His 65 70 75
80 Leu Ile Asn Gly Leu Phe Asp Cys His Arg Asn His Val Pro Val Leu
85 90 95 Ala Ile
Ala Ala His Ile Pro Ser Ser Glu Ile Gly Ser Gly Tyr Phe 100
105 110 Gln Glu Thr His Pro Gln Glu
Leu Phe Arg Glu Cys Ser His Tyr Cys 115 120
125 Glu Leu Val Ser Ser Pro Glu Gln Ile Pro Gln Val
Leu Ala Ile Ala 130 135 140
Met Arg Lys Ala Val Leu Asn Arg Gly Val Ser Val Val Val Leu Pro 145
150 155 160 Gly Asp Val
Ala Leu Lys Pro Ala Pro Glu Gly Ala Thr Met His Trp 165
170 175 Tyr His Ala Pro Gln Pro Val Val
Thr Pro Glu Glu Glu Glu Leu Arg 180 185
190 Lys Leu Ala Gln Leu Leu Arg Tyr Ser Ser Asn Ile Ala
Leu Met Cys 195 200 205
Gly Ser Gly Cys Ala Gly Ala His Lys Glu Leu Val Glu Phe Ala Gly 210
215 220 Lys Ile Lys Ala
Pro Ile Val His Ala Leu Arg Gly Lys Glu His Val 225 230
235 240 Glu Tyr Asp Asn Pro Tyr Asp Val Gly
Met Thr Gly Leu Ile Gly Phe 245 250
255 Ser Ser Gly Phe His Thr Met Met Asn Ala Asp Thr Leu Val
Leu Leu 260 265 270
Gly Thr Gln Phe Pro Tyr Arg Ala Phe Tyr Pro Thr Asp Ala Lys Ile
275 280 285 Ile Gln Ile Asp
Ile Asn Pro Ala Ser Ile Gly Ala His Ser Lys Val 290
295 300 Asp Met Ala Leu Val Gly Asp Ile
Lys Ser Thr Leu Arg Ala Leu Leu 305 310
315 320 Pro Leu Val Glu Glu Lys Ala Asp Arg Lys Phe Leu
Asp Lys Ala Leu 325 330
335 Glu Asp Tyr Arg Asp Ala Arg Lys Gly Leu Asp Asp Leu Ala Lys Pro
340 345 350 Ser Glu Lys
Ala Ile His Pro Gln Tyr Leu Ala Gln Gln Ile Ser His 355
360 365 Phe Ala Ala Asp Asp Ala Ile Phe
Thr Cys Asp Val Gly Thr Pro Thr 370 375
380 Val Trp Ala Ala Arg Tyr Leu Lys Met Asn Gly Lys Arg
Arg Leu Leu 385 390 395
400 Gly Ser Phe Asn His Gly Ser Met Ala Asn Ala Met Pro Gln Ala Leu
405 410 415 Gly Ala Gln Ala
Thr Glu Pro Glu Arg Gln Val Val Ala Met Cys Gly 420
425 430 Asp Gly Gly Phe Ser Met Leu Met Gly
Asp Phe Leu Ser Val Val Gln 435 440
445 Met Lys Leu Pro Val Lys Ile Val Val Phe Asn Asn Ser Val
Leu Gly 450 455 460
Phe Val Ala Met Glu Met Lys Ala Gly Gly Tyr Leu Thr Asp Gly Thr 465
470 475 480 Glu Leu His Asp Thr
Asn Phe Ala Arg Ile Ala Glu Ala Cys Gly Ile 485
490 495 Thr Gly Ile Arg Val Glu Lys Ala Ser Glu
Val Asp Glu Ala Leu Gln 500 505
510 Arg Ala Phe Ser Ile Asp Gly Pro Val Leu Val Asp Val Val Val
Ala 515 520 525 Lys
Glu Glu Leu Ala Ile Pro Pro Gln Ile Lys Leu Glu Gln Ala Lys 530
535 540 Gly Phe Ser Leu Tyr Met
Leu Arg Ala Ile Ile Ser Gly Arg Gly Asp 545 550
555 560 Glu Val Ile Glu Leu Ala Lys Thr Asn Trp Leu
Arg 565 570 232142DNAEscherichia
coli 23tcccgtatta ttatgctgat ccctaccgga accagcgtcg gtctgaccag cgtcagcctt
60ggcgtgatcc gtgcaatgga acgcaaaggc gttcgtctga gcgttttcaa acctatcgct
120cagccgcgta ccggtggcga tgcgcccgat cagactacga ctatcgtgcg tgcgaactct
180tccaccacga cggccgctga accgctgaaa atgagctacg ttgaaggtct gctttccagc
240aatcagaaag atgtgctgat ggaagagatc gtcgcaaact accacgctaa caccaaagac
300gctgaagtcg ttctggttga aggtctggtc ccgacacgta agcaccagtt tgcccagtct
360ctgaactacg aaatcgctaa aacgctgaat gcggaaatcg tcttcgttat gtctcagggc
420actgacaccc cggaacagct gaaagagcgt atcgaactga cccgcaacag cttcggcggt
480gccaaaaaca ccaacatcac cggcgttatc gttaacaaac tgaacgcacc ggttgatgaa
540cagggtcgta ctcgcccgga tctgtccgag attttcgacg actcttccaa agctaaagta
600aacaatgttg atccggcgaa gctgcaagaa tccagcccgc tgccggttct cggcgctgtg
660ccgtggagct ttgacctgat cgcgactcgt gcgatcgata tggctcgcca cctgaatgcg
720accatcatca acgaaggcga catcaatact cgccgcgtta aatccgtcac tttctgcgca
780cgcagcattc cgcacatgct ggagcacttc cgtgccggtt ctctgctggt gacttccgca
840gaccgtcctg acgtgctggt ggccgcttgc ctggcagcca tgaacggcgt agaaatcggt
900gccctgctgc tgactggcgg ttacgaaatg gacgcgcgca tttctaaact gtgcgaacgt
960gctttcgcta ccggcctgcc ggtatttatg gtgaacacca acacctggca gacctctctg
1020agcctgcaga gcttcaacct ggaagttccg gttgacgatc acgaacgtat cgagaaagtt
1080caggaatacg ttgctaacta catcaacgct gactggatcg aatctctgac tgccacttct
1140gagcgcagcc gtcgtctgtc tccgcctgcg ttccgttatc agctgactga acttgcgcgc
1200aaagcgggca aacgtatcgt actgccggaa ggtgacgaac cgcgtaccgt taaagcagcc
1260gctatctgtg ctgaacgtgg tatcgcaact tgcgtactgc tgggtaatcc ggcagagatc
1320aaccgtgttg cagcgtctca gggtgtagaa ctgggtgcag ggattgaaat cgttgatcca
1380gaagtggttc gcgaaagcta tgttggtcgt ctggtcgaac tgcgtaagaa caaaggcatg
1440accgaaaccg ttgcccgcga acagctggaa gacaacgtgg tgctcggtac gctgatgctg
1500gaacaggatg aagttgatgg tctggtttcc ggtgctgttc acactaccgc aaacaccatc
1560cgtccgccgc tgcagctgat caaaactgca ccgggcagct ccctggtatc ttccgtgttc
1620ttcatgctgc tgccggaaca ggtttacgtt tacggtgact gtgcgatcaa cccggatccg
1680accgctgaac agctggcaga aatcgcgatt cagtccgctg attccgctgc ggccttcggt
1740atcgaaccgc gcgttgctat gctctcctac tccaccggta cttctggtgc aggtagcgac
1800gtagaaaaag ttcgcgaagc aactcgtctg gcgcaggaaa aacgtcctga cctgatgatc
1860gacggtccgc tgcagtacga cgctgcggta atggctgacg ttgcgaaatc caaagcgccg
1920aactctccgg ttgcaggtcg cgctaccgtg ttcatcttcc cggatctgaa caccggtaac
1980accacctaca aagcggtaca gcgttctgcc gacctgatct ccatcgggcc gatgctgcag
2040ggtatgcgca agccggttaa cgacctgtcc cgtggcgcac tggttgacga tatcgtctac
2100accatcgcgc tgactgcgat tcagtctgca cagcagcagt aa
214224714PRTEscherichia coli 24Met Ser Arg Ile Ile Met Leu Ile Pro Thr
Gly Thr Ser Val Gly Leu 1 5 10
15 Thr Ser Val Ser Leu Gly Val Ile Arg Ala Met Glu Arg Lys Gly
Val 20 25 30 Arg
Leu Ser Val Phe Lys Pro Ile Ala Gln Pro Arg Thr Gly Gly Asp 35
40 45 Ala Pro Asp Gln Thr Thr
Thr Ile Val Arg Ala Asn Ser Ser Thr Thr 50 55
60 Thr Ala Ala Glu Pro Leu Lys Met Ser Tyr Val
Glu Gly Leu Leu Ser 65 70 75
80 Ser Asn Gln Lys Asp Val Leu Met Glu Glu Ile Val Ala Asn Tyr His
85 90 95 Ala Asn
Thr Lys Asp Ala Glu Val Val Leu Val Glu Gly Leu Val Pro 100
105 110 Thr Arg Lys His Gln Phe Ala
Gln Ser Leu Asn Tyr Glu Ile Ala Lys 115 120
125 Thr Leu Asn Ala Glu Ile Val Phe Val Met Ser Gln
Gly Thr Asp Thr 130 135 140
Pro Glu Gln Leu Lys Glu Arg Ile Glu Leu Thr Arg Asn Ser Phe Gly 145
150 155 160 Gly Ala Lys
Asn Thr Asn Ile Thr Gly Val Ile Val Asn Lys Leu Asn 165
170 175 Ala Pro Val Asp Glu Gln Gly Arg
Thr Arg Pro Asp Leu Ser Glu Ile 180 185
190 Phe Asp Asp Ser Ser Lys Ala Lys Val Asn Asn Val Asp
Pro Ala Lys 195 200 205
Leu Gln Glu Ser Ser Pro Leu Pro Val Leu Gly Ala Val Pro Trp Ser 210
215 220 Phe Asp Leu Ile
Ala Thr Arg Ala Ile Asp Met Ala Arg His Leu Asn 225 230
235 240 Ala Thr Ile Ile Asn Glu Gly Asp Ile
Asn Thr Arg Arg Val Lys Ser 245 250
255 Val Thr Phe Cys Ala Arg Ser Ile Pro His Met Leu Glu His
Phe Arg 260 265 270
Ala Gly Ser Leu Leu Val Thr Ser Ala Asp Arg Pro Asp Val Leu Val
275 280 285 Ala Ala Cys Leu
Ala Ala Met Asn Gly Val Glu Ile Gly Ala Leu Leu 290
295 300 Leu Thr Gly Gly Tyr Glu Met Asp
Ala Arg Ile Ser Lys Leu Cys Glu 305 310
315 320 Arg Ala Phe Ala Thr Gly Leu Pro Val Phe Met Val
Asn Thr Asn Thr 325 330
335 Trp Gln Thr Ser Leu Ser Leu Gln Ser Phe Asn Leu Glu Val Pro Val
340 345 350 Asp Asp His
Glu Arg Ile Glu Lys Val Gln Glu Tyr Val Ala Asn Tyr 355
360 365 Ile Asn Ala Asp Trp Ile Glu Ser
Leu Thr Ala Thr Ser Glu Arg Ser 370 375
380 Arg Arg Leu Ser Pro Pro Ala Phe Arg Tyr Gln Leu Thr
Glu Leu Ala 385 390 395
400 Arg Lys Ala Gly Lys Arg Ile Val Leu Pro Glu Gly Asp Glu Pro Arg
405 410 415 Thr Val Lys Ala
Ala Ala Ile Cys Ala Glu Arg Gly Ile Ala Thr Cys 420
425 430 Val Leu Leu Gly Asn Pro Ala Glu Ile
Asn Arg Val Ala Ala Ser Gln 435 440
445 Gly Val Glu Leu Gly Ala Gly Ile Glu Ile Val Asp Pro Glu
Val Val 450 455 460
Arg Glu Ser Tyr Val Gly Arg Leu Val Glu Leu Arg Lys Asn Lys Gly 465
470 475 480 Met Thr Glu Thr Val
Ala Arg Glu Gln Leu Glu Asp Asn Val Val Leu 485
490 495 Gly Thr Leu Met Leu Glu Gln Asp Glu Val
Asp Gly Leu Val Ser Gly 500 505
510 Ala Val His Thr Thr Ala Asn Thr Ile Arg Pro Pro Leu Gln Leu
Ile 515 520 525 Lys
Thr Ala Pro Gly Ser Ser Leu Val Ser Ser Val Phe Phe Met Leu 530
535 540 Leu Pro Glu Gln Val Tyr
Val Tyr Gly Asp Cys Ala Ile Asn Pro Asp 545 550
555 560 Pro Thr Ala Glu Gln Leu Ala Glu Ile Ala Ile
Gln Ser Ala Asp Ser 565 570
575 Ala Ala Ala Phe Gly Ile Glu Pro Arg Val Ala Met Leu Ser Tyr Ser
580 585 590 Thr Gly
Thr Ser Gly Ala Gly Ser Asp Val Glu Lys Val Arg Glu Ala 595
600 605 Thr Arg Leu Ala Gln Glu Lys
Arg Pro Asp Leu Met Ile Asp Gly Pro 610 615
620 Leu Gln Tyr Asp Ala Ala Val Met Ala Asp Val Ala
Lys Ser Lys Ala 625 630 635
640 Pro Asn Ser Pro Val Ala Gly Arg Ala Thr Val Phe Ile Phe Pro Asp
645 650 655 Leu Asn Thr
Gly Asn Thr Thr Tyr Lys Ala Val Gln Arg Ser Ala Asp 660
665 670 Leu Ile Ser Ile Gly Pro Met Leu
Gln Gly Met Arg Lys Pro Val Asn 675 680
685 Asp Leu Ser Arg Gly Ala Leu Val Asp Asp Ile Val Tyr
Thr Ile Ala 690 695 700
Leu Thr Ala Ile Gln Ser Ala Gln Gln Gln 705 710
25255DNAEscherichia coli 25ttccagcaag aagttaccat taccgctccg
aacggtctgc acacccgccc tgctgcccag 60tttgtaaaag aagctaaggg cttcacttct
gaaattactg tgacttccaa cggcaaaagc 120gccagcgcga aaagcctgtt taaactgcag
actctgggcc tgactcaagg taccgttgtg 180actatctccg cagaaggcga agacgagcag
aaagcggttg aacatctggt taaactgatg 240gcggaactcg agtaa
2552685PRTEscherichia coli 26Met Phe
Gln Gln Glu Val Thr Ile Thr Ala Pro Asn Gly Leu His Thr 1 5
10 15 Arg Pro Ala Ala Gln Phe Val
Lys Glu Ala Lys Gly Phe Thr Ser Glu 20 25
30 Ile Thr Val Thr Ser Asn Gly Lys Ser Ala Ser Ala
Lys Ser Leu Phe 35 40 45
Lys Leu Gln Thr Leu Gly Leu Thr Gln Gly Thr Val Val Thr Ile Ser
50 55 60 Ala Glu Gly
Glu Asp Glu Gln Lys Ala Val Glu His Leu Val Lys Leu 65
70 75 80 Met Ala Glu Leu Glu
85 271725DNAEscherichia coli 27atttcaggca ttttagcatc cccgggtatc
gctttcggta aagctctgct tctgaaagaa 60gacgaaattg tcattgaccg gaaaaaaatt
tctgccgacc aggttgatca ggaagttgaa 120cgttttctga gcggtcgtgc caaggcatca
gcccagctgg aaacgatcaa aacgaaagct 180ggtgaaacgt tcggtgaaga aaaagaagcc
atctttgaag ggcatattat gctgctcgaa 240gatgaggagc tggagcagga aatcatagcc
ctgattaaag ataagcacat gacagctgac 300gcagctgctc atgaagttat cgaaggtcag
gcttctgccc tggaagagct ggatgatgaa 360tacctgaaag aacgtgcggc tgacgtacgt
gatatcggta agcgcctgct gcgcaacatc 420ctgggcctga agattatcga cctgagcgcc
attcaggatg aagtcattct ggttgccgct 480gacctgacgc cgtccgaaac cgcacagctg
aacctgaaga aggtgctggg tttcatcacc 540gacgcgggtg gccgtacttc ccacacctct
atcatggcgc gttctctgga actacctgct 600atcgtgggta ccggtagcgt cacctctcag
gtgaaaaatg acgactatct gattctggat 660gccgtaaata atcaggttta cgtcaatcca
accaacgaag ttattgataa aatgcgcgct 720gttcaggagc aagtggcttc tgaaaaagca
gagcttgcta aactgaaaga tctgccagct 780attacgctgg acggtcacca ggtagaagta
tgcgctaaca ttggtacggt tcgtgacgtt 840gaaggtgcag agcgtaacgg cgctgaaggc
gttggtctgt atcgtactga gttcctgttc 900atggaccgcg acgcactgcc cactgaagaa
gaacagtttg ctgcttacaa agcagtggct 960gaagcgtgtg gctcgcaagc ggttatcgtt
cgtaccatgg acatcggcgg cgacaaagag 1020ctgccataca tgaacttccc gaaagaagag
aacccgttcc tcggctggcg cgctatccgt 1080atcgcgatgg atcgtagaga gatcctgcgc
gatcagctcc gcgctatcct gcgtgcctcg 1140gctttcggta aattgcgcat tatgttcccg
atgatcatct ctgttgaaga agtgcgtgca 1200ctgcgcaaag agatcgaaat ctacaaacag
gaactgcgcg acgaaggtaa agcgtttgac 1260gagtcaattg aaatcggcgt aatggtggaa
acaccggctg ccgcaacaat tgcacgtcat 1320ttagccaaag aagttgattt ctttagtatc
ggcaccaatg atttaacgca gtacactctg 1380gcagttgacc gtggtaatga tatgatttca
cacctttacc agccaatgtc accgtccgtg 1440ctgaacttga tcaagcaagt tattgatgct
tctcatgctg aaggcaaatg gactggcatg 1500tgtggtgagc ttgctggcga tgaacgtgct
acacttctgt tgctggggat gggtctggac 1560gaattctcta tgagcgccat ttctatcccg
cgcattaaga agattatccg taacacgaac 1620ttcgaagatg cgaaggtgtt agcagagcag
gctcttgctc aaccgacaac ggacgagtta 1680atgacgctgg ttaacaagtt cattgaagaa
aaaacaatct gctaa 172528575PRTEscherichia coli 28Met Ile
Ser Gly Ile Leu Ala Ser Pro Gly Ile Ala Phe Gly Lys Ala 1 5
10 15 Leu Leu Leu Lys Glu Asp Glu
Ile Val Ile Asp Arg Lys Lys Ile Ser 20 25
30 Ala Asp Gln Val Asp Gln Glu Val Glu Arg Phe Leu
Ser Gly Arg Ala 35 40 45
Lys Ala Ser Ala Gln Leu Glu Thr Ile Lys Thr Lys Ala Gly Glu Thr
50 55 60 Phe Gly Glu
Glu Lys Glu Ala Ile Phe Glu Gly His Ile Met Leu Leu 65
70 75 80 Glu Asp Glu Glu Leu Glu Gln
Glu Ile Ile Ala Leu Ile Lys Asp Lys 85
90 95 His Met Thr Ala Asp Ala Ala Ala His Glu Val
Ile Glu Gly Gln Ala 100 105
110 Ser Ala Leu Glu Glu Leu Asp Asp Glu Tyr Leu Lys Glu Arg Ala
Ala 115 120 125 Asp
Val Arg Asp Ile Gly Lys Arg Leu Leu Arg Asn Ile Leu Gly Leu 130
135 140 Lys Ile Ile Asp Leu Ser
Ala Ile Gln Asp Glu Val Ile Leu Val Ala 145 150
155 160 Ala Asp Leu Thr Pro Ser Glu Thr Ala Gln Leu
Asn Leu Lys Lys Val 165 170
175 Leu Gly Phe Ile Thr Asp Ala Gly Gly Arg Thr Ser His Thr Ser Ile
180 185 190 Met Ala
Arg Ser Leu Glu Leu Pro Ala Ile Val Gly Thr Gly Ser Val 195
200 205 Thr Ser Gln Val Lys Asn Asp
Asp Tyr Leu Ile Leu Asp Ala Val Asn 210 215
220 Asn Gln Val Tyr Val Asn Pro Thr Asn Glu Val Ile
Asp Lys Met Arg 225 230 235
240 Ala Val Gln Glu Gln Val Ala Ser Glu Lys Ala Glu Leu Ala Lys Leu
245 250 255 Lys Asp Leu
Pro Ala Ile Thr Leu Asp Gly His Gln Val Glu Val Cys 260
265 270 Ala Asn Ile Gly Thr Val Arg Asp
Val Glu Gly Ala Glu Arg Asn Gly 275 280
285 Ala Glu Gly Val Gly Leu Tyr Arg Thr Glu Phe Leu Phe
Met Asp Arg 290 295 300
Asp Ala Leu Pro Thr Glu Glu Glu Gln Phe Ala Ala Tyr Lys Ala Val 305
310 315 320 Ala Glu Ala Cys
Gly Ser Gln Ala Val Ile Val Arg Thr Met Asp Ile 325
330 335 Gly Gly Asp Lys Glu Leu Pro Tyr Met
Asn Phe Pro Lys Glu Glu Asn 340 345
350 Pro Phe Leu Gly Trp Arg Ala Ile Arg Ile Ala Met Asp Arg
Arg Glu 355 360 365
Ile Leu Arg Asp Gln Leu Arg Ala Ile Leu Arg Ala Ser Ala Phe Gly 370
375 380 Lys Leu Arg Ile Met
Phe Pro Met Ile Ile Ser Val Glu Glu Val Arg 385 390
395 400 Ala Leu Arg Lys Glu Ile Glu Ile Tyr Lys
Gln Glu Leu Arg Asp Glu 405 410
415 Gly Lys Ala Phe Asp Glu Ser Ile Glu Ile Gly Val Met Val Glu
Thr 420 425 430 Pro
Ala Ala Ala Thr Ile Ala Arg His Leu Ala Lys Glu Val Asp Phe 435
440 445 Phe Ser Ile Gly Thr Asn
Asp Leu Thr Gln Tyr Thr Leu Ala Val Asp 450 455
460 Arg Gly Asn Asp Met Ile Ser His Leu Tyr Gln
Pro Met Ser Pro Ser 465 470 475
480 Val Leu Asn Leu Ile Lys Gln Val Ile Asp Ala Ser His Ala Glu Gly
485 490 495 Lys Trp
Thr Gly Met Cys Gly Glu Leu Ala Gly Asp Glu Arg Ala Thr 500
505 510 Leu Leu Leu Leu Gly Met Gly
Leu Asp Glu Phe Ser Met Ser Ala Ile 515 520
525 Ser Ile Pro Arg Ile Lys Lys Ile Ile Arg Asn Thr
Asn Phe Glu Asp 530 535 540
Ala Lys Val Leu Ala Glu Gln Ala Leu Ala Gln Pro Thr Thr Asp Glu 545
550 555 560 Leu Met Thr
Leu Val Asn Lys Phe Ile Glu Glu Lys Thr Ile Cys 565
570 575 29507DNAEscherichia coli 29 ggtttgttcg
ataaactgaa atctctggtt tccgacgaca agaaggatac cggaactatt 60gagatcattg
ctccgctctc tggcgagatc gtcaatatcg aagacgtgcc ggatgtcgtt 120tttgcggaaa
aaatcgttgg tgatggtatt gctatcaaac caacgggtaa caaaatggtc 180gcgccagtag
acggcaccat tggtaaaatc tttgaaacca accacgcatt ctctatcgaa 240tctgatagcg
gcgttgaact gttcgtccac ttcggtatcg acaccgttga actgaaaggc 300gaaggcttca
agcgtattgc tgaagaaggt cagcgcgtga aagttggcga tactgtcatt 360gaatttgatc
tgccgctgct ggaagagaaa gccaagtcta ccctgactcc ggttgttatc 420tccaacatgg
acgaaatcaa agaactgatc aaactgtccg gtagcgtaac cgtgggtgaa 480accccggtta
tccgcatcaa gaagtaa
50730169PRTEscherichia coli 30Met Gly Leu Phe Asp Lys Leu Lys Ser Leu Val
Ser Asp Asp Lys Lys 1 5 10
15 Asp Thr Gly Thr Ile Glu Ile Ile Ala Pro Leu Ser Gly Glu Ile Val
20 25 30 Asn Ile
Glu Asp Val Pro Asp Val Val Phe Ala Glu Lys Ile Val Gly 35
40 45 Asp Gly Ile Ala Ile Lys Pro
Thr Gly Asn Lys Met Val Ala Pro Val 50 55
60 Asp Gly Thr Ile Gly Lys Ile Phe Glu Thr Asn His
Ala Phe Ser Ile 65 70 75
80 Glu Ser Asp Ser Gly Val Glu Leu Phe Val His Phe Gly Ile Asp Thr
85 90 95 Val Glu Leu
Lys Gly Glu Gly Phe Lys Arg Ile Ala Glu Glu Gly Gln 100
105 110 Arg Val Lys Val Gly Asp Thr Val
Ile Glu Phe Asp Leu Pro Leu Leu 115 120
125 Glu Glu Lys Ala Lys Ser Thr Leu Thr Pro Val Val Ile
Ser Asn Met 130 135 140
Asp Glu Ile Lys Glu Leu Ile Lys Leu Ser Gly Ser Val Thr Val Gly 145
150 155 160 Glu Thr Pro Val
Ile Arg Ile Lys Lys 165
311443DNAEscherichia coli 31atgtccagaa ggcttcgcag aacaaaaatc gttaccacgt
taggcccagc aacagatcgc 60gataataatc ttgaaaaagt tatcgcggcg ggtgccaacg
ttgtacgtat gaacttttct 120cacggctcgc ctgaagatca caaaatgcgc gcggataaag
ttcgtgagat tgccgcaaaa 180ctggggcgtc atgtggctat tctgggtgac ctccaggggc
ccaaaatccg tgtatccacc 240tttaaagaag gcaaagtttt cctcaatatt ggggataaat
tcctgctcga cgccaacctg 300ggtaaaggtg aaggcgacaa agaaaaagtc ggtatcgact
acaaaggcct gcctgctgac 360gtcgtgcctg gtgacatcct gctgctggac gatggtcgcg
tccagttaaa agtactggaa 420gttcagggca tgaaagtgtt caccgaagtc accgtcggtg
gtcccctctc caacaataaa 480ggtatcaaca aacttggcgg cggtttgtcg gctgaagcgc
tgaccgaaaa agacaaagca 540gacattaaga ctgcggcgtt gattggcgta gattacctgg
ctgtctcctt cccacgctgt 600ggcgaagatc tgaactatgc ccgtcgcctg gcacgcgatg
caggatgtga tgcgaaaatt 660gttgccaagg ttgaacgtgc ggaagccgtt tgcagccagg
atgcaatgga tgacatcatc 720ctcgcctctg acgtggtaat ggttgcacgt ggcgacctcg
gtgtggaaat tggcgacccg 780gaactggtcg gcattcagaa agcgttgatc cgtcgtgcgc
gtcagctaaa ccgagcggta 840atcacggcga cccagatgat ggagtcaatg attactaacc
cgatgccgac gcgtgcagaa 900gtcatggacg tagcaaacgc cgttctggat ggtactgacg
ctgtgatgct gtctgcagaa 960actgccgctg ggcagtatcc gtcagaaacc gttgcagcca
tggcgcgcgt ttgcctgggt 1020gcggaaaaaa tcccgagcat caacgtttct aaacaccgtc
tggacgttca gttcgacaat 1080gtggaagaag ctattgccat gtcagcaatg tacgcagcta
accacctgaa aggcgttacg 1140gcgatcatca ccatgaccga atcgggtcgt accgcgctga
tgacctcccg tatcagctct 1200ggtctgccaa ttttcgccat gtcgcgccat gaacgtacgc
tgaacctgac tgctctctat 1260cgtggcgtta cgccggtgca ctttgatagc gctaatgacg
gcgtagcagc tgccagcgaa 1320gcggttaatc tgctgcgcga taaaggttac ttgatgtctg
gtgacctggt gattgtcacc 1380cagggcgacg tgatgagtac cgtgggttct actaatacca
cgcgtatttt aacggtagag 1440taa
144332480PRTEscherichia coli 32Met Ser Arg Arg Leu
Arg Arg Thr Lys Ile Val Thr Thr Leu Gly Pro 1 5
10 15 Ala Thr Asp Arg Asp Asn Asn Leu Glu Lys
Val Ile Ala Ala Gly Ala 20 25
30 Asn Val Val Arg Met Asn Phe Ser His Gly Ser Pro Glu Asp His
Lys 35 40 45 Met
Arg Ala Asp Lys Val Arg Glu Ile Ala Ala Lys Leu Gly Arg His 50
55 60 Val Ala Ile Leu Gly Asp
Leu Gln Gly Pro Lys Ile Arg Val Ser Thr 65 70
75 80 Phe Lys Glu Gly Lys Val Phe Leu Asn Ile Gly
Asp Lys Phe Leu Leu 85 90
95 Asp Ala Asn Leu Gly Lys Gly Glu Gly Asp Lys Glu Lys Val Gly Ile
100 105 110 Asp Tyr
Lys Gly Leu Pro Ala Asp Val Val Pro Gly Asp Ile Leu Leu 115
120 125 Leu Asp Asp Gly Arg Val Gln
Leu Lys Val Leu Glu Val Gln Gly Met 130 135
140 Lys Val Phe Thr Glu Val Thr Val Gly Gly Pro Leu
Ser Asn Asn Lys 145 150 155
160 Gly Ile Asn Lys Leu Gly Gly Gly Leu Ser Ala Glu Ala Leu Thr Glu
165 170 175 Lys Asp Lys
Ala Asp Ile Lys Thr Ala Ala Leu Ile Gly Val Asp Tyr 180
185 190 Leu Ala Val Ser Phe Pro Arg Cys
Gly Glu Asp Leu Asn Tyr Ala Arg 195 200
205 Arg Leu Ala Arg Asp Ala Gly Cys Asp Ala Lys Ile Val
Ala Lys Val 210 215 220
Glu Arg Ala Glu Ala Val Cys Ser Gln Asp Ala Met Asp Asp Ile Ile 225
230 235 240 Leu Ala Ser Asp
Val Val Met Val Ala Arg Gly Asp Leu Gly Val Glu 245
250 255 Ile Gly Asp Pro Glu Leu Val Gly Ile
Gln Lys Ala Leu Ile Arg Arg 260 265
270 Ala Arg Gln Leu Asn Arg Ala Val Ile Thr Ala Thr Gln Met
Met Glu 275 280 285
Ser Met Ile Thr Asn Pro Met Pro Thr Arg Ala Glu Val Met Asp Val 290
295 300 Ala Asn Ala Val Leu
Asp Gly Thr Asp Ala Val Met Leu Ser Ala Glu 305 310
315 320 Thr Ala Ala Gly Gln Tyr Pro Ser Glu Thr
Val Ala Ala Met Ala Arg 325 330
335 Val Cys Leu Gly Ala Glu Lys Ile Pro Ser Ile Asn Val Ser Lys
His 340 345 350 Arg
Leu Asp Val Gln Phe Asp Asn Val Glu Glu Ala Ile Ala Met Ser 355
360 365 Ala Met Tyr Ala Ala Asn
His Leu Lys Gly Val Thr Ala Ile Ile Thr 370 375
380 Met Thr Glu Ser Gly Arg Thr Ala Leu Met Thr
Ser Arg Ile Ser Ser 385 390 395
400 Gly Leu Pro Ile Phe Ala Met Ser Arg His Glu Arg Thr Leu Asn Leu
405 410 415 Thr Ala
Leu Tyr Arg Gly Val Thr Pro Val His Phe Asp Ser Ala Asn 420
425 430 Asp Gly Val Ala Ala Ala Ser
Glu Ala Val Asn Leu Leu Arg Asp Lys 435 440
445 Gly Tyr Leu Met Ser Gly Asp Leu Val Ile Val Thr
Gln Gly Asp Val 450 455 460
Met Ser Thr Val Gly Ser Thr Asn Thr Thr Arg Ile Leu Thr Val Glu 465
470 475 480
331410DNAEscherichia coli 33aaaaagacca aaattgtttg caccatcgga ccgaaaaccg
aatctgaaga gatgttagct 60aaaatgctgg acgctggcat gaacgttatg cgtctgaact
tctctcatgg tgactatgca 120gaacacggtc agcgcattca gaatctgcgc aacgtgatga
gcaaaactgg taaaaccgcc 180gctatcctgc ttgataccaa aggtccggaa atccgcacca
tgaaactgga aggcggtaac 240gacgtttctc tgaaagctgg tcagaccttt actttcacca
ctgataaatc tgttatcggc 300aacagcgaaa tggttgcggt aacgtatgaa ggtttcacta
ctgacctgtc tgttggcaac 360accgtactgg ttgacgatgg tctgatcggt atggaagtta
ccgccattga aggtaacaaa 420gttatctgta aagtgctgaa caacggtgac ctgggcgaaa
acaaaggtgt gaacctgcct 480ggcgtttcca ttgctctgcc agcactggct gaaaaagaca
aacaggacct gatctttggt 540tgcgaacaag gcgtagactt tgttgctgct tcctttattc
gtaagcgttc tgacgttatc 600gaaatccgtg agcacctgaa agcgcacggc ggcgaaaaca
tccacatcat ctccaaaatc 660gaaaaccagg aaggcctcaa caacttcgac gaaatcctcg
aagcctctga cggcatcatg 720gttgcgcgtg gcgacctggg tgtagaaatc ccggtagaag
aagttatctt cgcccagaag 780atgatgatcg aaaaatgtat ccgtgcacgt aaagtcgtta
tcactgcgac ccagatgctg 840gattccatga tcaaaaaccc acgcccgact cgcgcagaag
ccggtgacgt tgcaaacgcc 900atcctcgacg gtactgacgc agtgatgctg tctggtgaat
ccgcaaaagg taaatacccg 960ctggaagcgg tttctatcat ggcgaccatc tgcgaacgta
ccgaccgcgt gatgaacagc 1020cgtctcgagt tcaacaatga caaccgtaaa ctgcgcatta
ccgaagcggt atgccgtggt 1080gccgttgaaa ctgctgaaaa actggatgct ccgctgatcg
tggttgctac tcagggcggt 1140aaatctgctc gcgcagtacg taaatacttc ccggatgcca
ccatcctggc actgaccacc 1200aacgaaaaaa cggctcatca gttggtactg agcaaaggcg
ttgtgccgca gcttgttaaa 1260gagatcactt ctactgatga tttctaccgt ctgggtaaag
aactggctct gcagagcggt 1320ctggcacaca aaggtgacgt tgtagttatg gtttctggtg
cactggtacc gagcggcact 1380actaacaccg catctgttca cgtcctgtaa
141034470PRTEscherichia coli 34Met Lys Lys Thr Lys
Ile Val Cys Thr Ile Gly Pro Lys Thr Glu Ser 1 5
10 15 Glu Glu Met Leu Ala Lys Met Leu Asp Ala
Gly Met Asn Val Met Arg 20 25
30 Leu Asn Phe Ser His Gly Asp Tyr Ala Glu His Gly Gln Arg Ile
Gln 35 40 45 Asn
Leu Arg Asn Val Met Ser Lys Thr Gly Lys Thr Ala Ala Ile Leu 50
55 60 Leu Asp Thr Lys Gly Pro
Glu Ile Arg Thr Met Lys Leu Glu Gly Gly 65 70
75 80 Asn Asp Val Ser Leu Lys Ala Gly Gln Thr Phe
Thr Phe Thr Thr Asp 85 90
95 Lys Ser Val Ile Gly Asn Ser Glu Met Val Ala Val Thr Tyr Glu Gly
100 105 110 Phe Thr
Thr Asp Leu Ser Val Gly Asn Thr Val Leu Val Asp Asp Gly 115
120 125 Leu Ile Gly Met Glu Val Thr
Ala Ile Glu Gly Asn Lys Val Ile Cys 130 135
140 Lys Val Leu Asn Asn Gly Asp Leu Gly Glu Asn Lys
Gly Val Asn Leu 145 150 155
160 Pro Gly Val Ser Ile Ala Leu Pro Ala Leu Ala Glu Lys Asp Lys Gln
165 170 175 Asp Leu Ile
Phe Gly Cys Glu Gln Gly Val Asp Phe Val Ala Ala Ser 180
185 190 Phe Ile Arg Lys Arg Ser Asp Val
Ile Glu Ile Arg Glu His Leu Lys 195 200
205 Ala His Gly Gly Glu Asn Ile His Ile Ile Ser Lys Ile
Glu Asn Gln 210 215 220
Glu Gly Leu Asn Asn Phe Asp Glu Ile Leu Glu Ala Ser Asp Gly Ile 225
230 235 240 Met Val Ala Arg
Gly Asp Leu Gly Val Glu Ile Pro Val Glu Glu Val 245
250 255 Ile Phe Ala Gln Lys Met Met Ile Glu
Lys Cys Ile Arg Ala Arg Lys 260 265
270 Val Val Ile Thr Ala Thr Gln Met Leu Asp Ser Met Ile Lys
Asn Pro 275 280 285
Arg Pro Thr Arg Ala Glu Ala Gly Asp Val Ala Asn Ala Ile Leu Asp 290
295 300 Gly Thr Asp Ala Val
Met Leu Ser Gly Glu Ser Ala Lys Gly Lys Tyr 305 310
315 320 Pro Leu Glu Ala Val Ser Ile Met Ala Thr
Ile Cys Glu Arg Thr Asp 325 330
335 Arg Val Met Asn Ser Arg Leu Glu Phe Asn Asn Asp Asn Arg Lys
Leu 340 345 350 Arg
Ile Thr Glu Ala Val Cys Arg Gly Ala Val Glu Thr Ala Glu Lys 355
360 365 Leu Asp Ala Pro Leu Ile
Val Val Ala Thr Gln Gly Gly Lys Ser Ala 370 375
380 Arg Ala Val Arg Lys Tyr Phe Pro Asp Ala Thr
Ile Leu Ala Leu Thr 385 390 395
400 Thr Asn Glu Lys Thr Ala His Gln Leu Val Leu Ser Lys Gly Val Val
405 410 415 Pro Gln
Leu Val Lys Glu Ile Thr Ser Thr Asp Asp Phe Tyr Arg Leu 420
425 430 Gly Lys Glu Leu Ala Leu Gln
Ser Gly Leu Ala His Lys Gly Asp Val 435 440
445 Val Val Met Val Ser Gly Ala Leu Val Pro Ser Gly
Thr Thr Asn Thr 450 455 460
Ala Ser Val His Val Leu 465 470
35569PRTZymomonasmobilis 35Met Ser Tyr Thr Val Gly Thr Tyr Leu Ala Glu
Arg Leu Val Gln Ile 1 5 10
15 Gly Leu Lys His His Phe Ala Val Ala Gly Asp Tyr Asn Leu Val Leu
20 25 30 Leu Asp
Asn Leu Leu Leu Asn Lys Asn Met Glu Gln Val Tyr Cys Cys 35
40 45 Asn Glu Leu Asn Cys Gly Phe
Ser Ala Glu Gly Tyr Ala Arg Ala Lys 50 55
60 Gly Ala Ala Ala Ala Val Val Thr Tyr Ser Val Gly
Ala Leu Ser Ala 65 70 75
80 Phe Asp Ala Ile Gly Gly Ala Tyr Ala Glu Asn Leu Pro Val Ile Leu
85 90 95 Ile Ser Gly
Ala Pro Asn Asn Asn Asp His Ala Ala Gly His Val Leu 100
105 110 His His Ala Leu Gly Lys Thr Asp
Tyr His Tyr Gln Leu Glu Met Ala 115 120
125 Lys Asn Ile Thr Ala Ala Ala Glu Ala Ile Tyr Thr Pro
Glu Glu Ala 130 135 140
Pro Ala Lys Ile Asp His Val Ile Lys Thr Ala Leu Arg Glu Lys Lys 145
150 155 160 Pro Val Tyr Leu
Glu Ile Ala Cys Asn Ile Ala Ser Met Pro Cys Ala 165
170 175 Ala Pro Gly Pro Ala Ser Ala Leu Phe
Asn Asp Glu Ala Ser Asp Glu 180 185
190 Ala Ser Leu Asn Ala Ala Val Glu Glu Thr Leu Lys Phe Ile
Ala Asn 195 200 205
Arg Asp Lys Val Ala Val Leu Val Gly Ser Lys Leu Arg Ala Ala Gly 210
215 220 Ala Glu Glu Ala Ala
Val Lys Phe Ala Asp Ala Leu Gly Gly Ala Val 225 230
235 240 Ala Thr Met Ala Ala Ala Lys Ser Phe Phe
Pro Glu Glu Asn Pro His 245 250
255 Tyr Ile Gly Thr Ser Trp Gly Glu Val Ser Tyr Pro Gly Val Glu
Lys 260 265 270 Thr
Met Lys Glu Ala Asp Ala Val Ile Ala Leu Ala Pro Val Phe Asn 275
280 285 Asp Tyr Ser Thr Thr Gly
Trp Thr Asp Ile Pro Asp Pro Lys Lys Leu 290 295
300 Val Leu Ala Glu Pro Arg Ser Val Val Val Asn
Gly Ile Arg Phe Pro 305 310 315
320 Ser Val His Leu Lys Asp Tyr Leu Thr Arg Leu Ala Gln Lys Val Ser
325 330 335 Lys Lys
Thr Gly Ala Leu Asp Phe Phe Lys Ser Leu Asn Ala Gly Glu 340
345 350 Leu Lys Lys Ala Ala Pro Ala
Asp Pro Ser Ala Pro Leu Val Asn Ala 355 360
365 Glu Ile Ala Arg Gln Val Glu Ala Leu Leu Thr Pro
Asn Thr Thr Val 370 375 380
Ile Ala Glu Thr Gly Asp Ser Trp Phe Asn Ala Gln Arg Met Lys Leu 385
390 395 400 Pro Asn Gly
Ala Arg Val Glu Tyr Glu Met Gln Trp Gly His Ile Gly 405
410 415 Trp Ser Val Pro Ala Ala Phe Gly
Tyr Ala Val Gly Ala Pro Glu Arg 420 425
430 Arg Asn Ile Leu Met Val Gly Asp Gly Ser Phe Gln Leu
Thr Ala Gln 435 440 445
Glu Val Ala Gln Met Val Arg Leu Lys Leu Pro Val Ile Ile Phe Leu 450
455 460 Ile Asn Asn Tyr
Gly Tyr Thr Ile Glu Val Met Ile His Asp Gly Pro 465 470
475 480 Tyr Asn Asn Ile Lys Asn Trp Asp Tyr
Ala Gly Leu Met Glu Val Phe 485 490
495 Asn Gly Asn Gly Gly Tyr Asp Ser Gly Ala Gly Lys Gly Leu
Lys Ala 500 505 510
Lys Thr Gly Gly Glu Leu Ala Glu Ala Ile Lys Val Ala Leu Ala Asn
515 520 525 Thr Asp Gly Pro
Thr Leu Ile Glu Cys Phe Ile Gly Arg Glu Asp Cys 530
535 540 Thr Glu Glu Leu Val Lys Trp Gly
Lys Arg Val Ala Ala Ala Asn Ser 545 550
555 560 Arg Lys Pro Val Asn Lys Leu Leu Gly
565 36541PRTLeuconostoc mesenteroides 36Met Thr Asp Thr
Leu Thr Phe Asn Thr Lys His Leu Leu Glu Ala Leu 1 5
10 15 Phe Glu Ser Gly Ile Arg His Phe Ile
Val Ser Pro Gly Ser Arg Ser 20 25
30 Thr Pro Ile Ala Leu Leu Leu Ala Glu Tyr Ala Glu Gln Asn
Asn Glu 35 40 45
Ile Lys Leu Phe Val Asp Val Asp Glu Arg Ser Ala Gly Phe Phe Ala 50
55 60 Leu Gly Ile Ala Lys
Thr Leu Leu Glu Pro Val Val Leu Leu Gly Thr 65 70
75 80 Ser Gly Thr Ala Ile Ala Glu Tyr Met Pro
Ala Val Ala Glu Ala Tyr 85 90
95 Ala Ala Asn Ile Pro Leu Val Val Leu Ser Thr Asp Arg Pro Gln
Glu 100 105 110 Leu
Gln Phe Asn Gly Ala Pro Gln Thr Ile Pro Gln Ser Asn Leu Phe 115
120 125 Gly Gln Leu Thr Lys Gln
Ala Val Leu Ile Arg Leu Gln Asp Met His 130 135
140 Ser Asp Val Thr Glu Tyr Ile Asp Phe Ile Val
Gln Lys Val Val Asn 145 150 155
160 Leu Ser Ile Thr Ala Pro Arg Gly Pro Ile Gln Ile Asn Leu Pro Leu
165 170 175 Arg Lys
Pro Leu Met Pro Val Leu Asp Arg Gln Asp Glu Val His Val 180
185 190 Gln Arg Val Val Phe Asp Lys
Gln Ser Val Gln Tyr Arg Leu Pro Pro 195 200
205 Ile Thr Ala Lys Arg Leu Leu Ile Leu Ala Gly Pro
Asn Val Leu Asn 210 215 220
Ser Tyr Asp Asp Ser Leu Lys Lys Phe Ala Ile Lys Asn Asn Val Pro 225
230 235 240 Val Ile Ala
Asp Val Leu Ser Gln Ser Arg His Thr Tyr Thr Ile His 245
250 255 Gly Ile Asp Val Leu Leu Gln Ala
His Lys Ile Asn Ala Asp Leu Lys 260 265
270 Pro Asp Leu Val Val Arg Phe Gly Lys Thr Pro Val Ser
Ala Arg Val 275 280 285
Leu Gln Trp Leu Lys Glu Glu Asn Ile Leu Thr Trp His Val Asp Glu 290
295 300 Asp Ala Gly Val
Asp His Thr Arg His Ile Val Arg Ala Ile Lys Met 305 310
315 320 Ala Pro His Asp Phe Leu Glu Ser Met
His Leu Thr Leu Ser Lys Asn 325 330
335 Gln Ile Asp Phe Asn Gln Lys Trp Leu Ser Leu Pro Lys Val
Ile Lys 340 345 350
Thr Arg Asn Glu Met Asn Ile Ile Thr Ala Leu Asp Asp Ala Val Pro
355 360 365 Asp Asp Thr His
Ile Phe Val Ala Asn Ser Met Pro Ile Arg Asp Met 370
375 380 Asp Asn Phe Phe Thr Gly Asn His
Thr Gln Arg Ile Tyr Ala Asn Arg 385 390
395 400 Gly Ala Asn Gly Ile Asp Gly Val Ile Ser Ser Ala
Leu Gly Met Ser 405 410
415 Ala Val Val Lys Gln Arg Ser Val Leu Leu Thr Gly Asp Leu Thr Leu
420 425 430 Phe His Asp
Met Asn Gly Leu Met Met Ala Lys Asn Tyr Gln Leu Pro 435
440 445 Leu Asp Ile Ile Val Ile Asn Asn
Asn Gly Gly Gly Ile Phe Ser Phe 450 455
460 Leu Pro Gln Ala Gly Ala Pro Lys Tyr Phe Glu Gln Leu
Phe Gly Thr 465 470 475
480 Pro Leu Asn Ile Asp Ile Lys Lys Ile Ala Asp Leu Tyr Tyr Ile Asp
485 490 495 Tyr His Gln Leu
Asn Val Pro Glu Ala Leu Ser Gln Ile Leu Gln Thr 500
505 510 Pro Ser Lys Thr Thr Arg Leu Ile Glu
Tyr Lys Ser Asp His Gln Arg 515 520
525 Asn Arg Asp Asp His Arg Glu Val Leu Glu Met Leu Lys
530 535 540 371242PRTMycobacterium
tuberculosis 37Met Asn Leu Arg Gly Thr Ala Ala Arg Arg Arg His Arg Ala
Thr Gly 1 5 10 15
Asp Thr Leu Asp Ala Arg Gln Leu Val Arg Gln Gln Leu Arg Gln Leu
20 25 30 Gln Thr Glu Thr Arg
Pro Gly Ala Leu Val Arg Trp Phe Leu Leu Asp 35
40 45 Pro Lys Glu Leu Phe Asp Ile Phe Val
Thr Ile Gln Gly Phe Thr Gln 50 55
60 Thr Thr Thr Arg Gln Arg Arg Gln Leu Leu Gln Thr His
Asn Cys Gln 65 70 75
80 Ile Ile Thr Val Val Phe Arg Leu Ala Cys Arg Gln Phe Ile Ile Gln
85 90 95 Leu Thr Gly Gly
Gln Gln Asn Thr Ala Asn Phe Ile Ala Val Ala Asp 100
105 110 Thr Ile Leu Val Gly Arg Phe Leu Gln
Asp Ala Thr Glu Phe Asn Leu 115 120
125 Gly Lys Ile Leu Asn Leu Arg Asn Arg Arg Leu Val Ala Gln
His Gly 130 135 140
Phe Arg Gly Lys His Asp Gln Arg Thr Leu Asp Ala Val Gln Arg Val 145
150 155 160 Ala Ala Gln Gln Val
Lys Val Ile Gly Arg Gly Ala Arg His Arg Asp 165
170 175 Arg His Ala Thr Leu Ser Pro Gln Leu Gln
Lys Ala Leu Asn Ala Arg 180 185
190 Arg Arg Val Val Arg Ala Leu Thr Leu Val Ala Val Trp Gln Gln
Gln 195 200 205 Asn
Asn Val Gly Gln Leu Thr Pro Phe Arg Phe Ala Arg Ala Asp Glu 210
215 220 Leu Val Asn Asp Arg Leu
Gly Thr Ile Asn Lys Val Ala Glu Leu Arg 225 230
235 240 Leu Pro Gln His Asn Arg Val Trp Ile Ala Asn
Gly Ile Thr Val Leu 245 250
255 Lys Thr His Ser Arg Val Phe Arg Gln Arg Arg Ile Ile Asn Gln Glu
260 265 270 Leu Thr
Thr Gly Arg Ala Ala Val Ala Val Ser Arg Gln Gln Leu Gln 275
280 285 Arg Gly Glu Leu Leu Ala Gly
Val Thr Val Asp Gln His Arg Met Thr 290 295
300 Leu Ala Lys Gly Ala Thr Thr Arg Val Leu Thr Ala
Gln Thr Asp Gln 305 310 315
320 Leu Ala Leu Ser Asn Gln Ala Ala Gln Arg Gln Gln Leu Thr Lys Arg
325 330 335 Pro Val Asn
Leu Thr Leu Ile Arg His Leu Thr Thr Leu Phe Gln His 340
345 350 Arg Leu Asn Ala Arg Val Cys Arg
Glu Thr Val Arg Gln Arg Gln Glu 355 360
365 Gly Ile Thr Asp Thr Cys Gln Gln Gly Phe Val Asn Ser
Gly Arg Gln 370 375 380
Ala Arg Trp Asp His Leu Val Arg Leu Asp Ala Leu Arg Arg Leu Asp 385
390 395 400 Ala Val Leu Leu
Gln Phe Ala His Phe Val Glu His Thr Leu Gln Leu 405
410 415 Ala Leu Val Ile Ala Gln Arg Ile Leu
Arg Leu Phe His Ala Asp Val 420 425
430 Ala Thr Thr Asp Gln Ser Leu Gly Val Gly Phe Thr Gly Ala
Thr Leu 435 440 445
Gly Val His Asp Val Ile Asp Ile Arg Ile Gly His Arg Trp Ile Val 450
455 460 Thr Leu Ile Met Thr
Thr Thr Thr Ile Ala Gln His Val Asn Asn Asn 465 470
475 480 Val Leu Phe Lys Ala Leu Ala Glu Val Asn
Cys Gln Thr Arg Asn Pro 485 490
495 Asp Thr Arg Phe Trp Ile Val Ala Val His Val Glu Asp Trp Arg
Thr 500 505 510 Asp
His Leu Arg His Ile Arg Ala Val Leu Ala Arg Thr Arg Val Phe 515
520 525 Arg Ser Gly Gly Glu Ala
Asp Leu Val Val His Asn Asp Val Asn Arg 530 535
540 Thr Thr His Thr Ile Ala Arg Gln Ile Cys Gln
Ile Gln Arg Leu Arg 545 550 555
560 Asn Asn Ala Leu Thr Arg Glu Gly Arg Ile Thr Val Gln His Gln Arg
565 570 575 Asn Asp
Gly Glu Cys Thr Leu Ala Val Gly Val Asn Gly Thr Val Val 580
585 590 Gln Gln Val Leu Leu Arg Thr
His Gln Thr Phe Gln His Trp Ile Asp 595 600
605 Arg Phe Gln Met Gly Arg Ile Cys Arg Gln Gly Asn
Leu Asn Ile Val 610 615 620
Ile Ala Lys His Leu Gln Ile Gln Thr Arg Arg Thr Gln Val Val Leu 625
630 635 640 Asp Ile Ala
Gly Thr Val Ser Leu Gly Arg Val Gln Ile Ala Phe Lys 645
650 655 Leu Arg Lys Asp Leu Arg Ile Arg
Phe Ala His Asp Val Arg Gln Asp 660 665
670 Ile Gln Ala Thr Thr Val Arg His Ala Asn Asp His Phe
Ile Gln Thr 675 680 685
Met Leu Gly Thr Leu Val Asn Arg Arg Val His His Trp Asn Asn Arg 690
695 700 Phe Arg Thr Leu
Gln Ala Lys Thr Leu Leu Ala His Ile Leu Gly Leu 705 710
715 720 Gln Glu Gly Phe Lys Arg Leu Arg Cys
Val Gln Phe Arg Gln Asp Val 725 730
735 Leu Leu Leu Ser Asn Gly Arg Phe Asp Val Leu Arg Leu Asp
Thr Leu 740 745 750
Leu Gln Pro Leu Leu Leu Phe Arg Val Gln Asn Val Arg Val Leu Asn
755 760 765 Thr Asp Val Thr
Ala Val Arg Val Ala Gln Gln Thr Gln His Val Thr 770
775 780 Gln Leu Phe Val Leu Ser Thr Arg
Glu Thr Val Asn Leu Lys Asp Ala 785 790
795 800 Val Gln Val Pro Gln Ser Gln Ala Met Arg Gln His
Leu Gln Ile Arg 805 810
815 Met Arg Thr Lys Ala Arg Leu Ile Gln Ala Gln Arg Val Gly Val Arg
820 825 830 His Gln Met
Ala Ala Val Ala Ile Gly Arg Asp Gln Val His His Thr 835
840 845 Cys Val Leu Val Asn Asp Arg Val
Arg Ile Ile Gly Ala Pro Thr His 850 855
860 Trp Gln Val Arg Asp Ala Gln Leu Ala Glu Asp Leu Ile
Pro Glu Ala 865 870 875
880 Ile Arg Gln Gln Gln Phe Met Asn Gly Thr Gln Glu Val Thr Ala Phe
885 890 895 Arg Thr Leu Asn
Asp Thr Val Val Val Gly Arg Ser Gln Gly Asn Gln 900
905 910 Phe Ala Asn Thr Gln Leu Ser Asp Ala
Phe Leu Gly Arg Ala Leu Glu 915 920
925 Leu Arg Arg Ile Phe His Arg Thr Asp Thr Asp Asp Ser Thr
Leu Thr 930 935 940
Trp His Gln Ala Trp His Arg Val His Arg Ala Asn Arg Ala Arg Ile 945
950 955 960 Arg Gln Arg Asn Arg
Asn Ala Ser Glu Val Leu Gly Gly Gln Phe Thr 965
970 975 Ile Thr Ser Thr Thr Asp Asp Val Leu Val
Arg Gly Asn Glu Leu Arg 980 985
990 Glu Ala His Arg Leu Ala Thr Phe Asp Thr Gly His His Gln
Arg Ala 995 1000 1005
Leu Ala Ile Phe Thr Leu Gln Val Asn Ser Gln Thr Gln Ile Gly 1010
1015 1020 Val Arg Arg Ser Asn
Arg Arg Arg Leu Thr Val His Leu Arg Val 1025 1030
1035 Val Ala Ile His Ile Arg Glu Leu Leu Asn
Arg Leu Asn Gln Ser 1040 1045 1050
Ile Thr Gln Gln Met Gly Lys Ala Asp Phe Thr Thr Thr Arg Ala
1055 1060 1065 Phe Gln
Leu Ile Ile Asp His Asp Thr Ile Ile Asp Gln Gln Phe 1070
1075 1080 Arg Trp His Ser Thr His Arg
Gly Arg Arg Arg His Phe Gln Arg 1085 1090
1095 Arg Ala His Val Leu His Asp Ser Ser Arg Arg Thr
Thr Gln Asp 1100 1105 1110
Ser Asn Phe Ile Ala Phe Gly Trp Arg Arg His Arg Gly Leu Gly 1115
1120 1125 Gly Gln Ser Arg His
Asp Thr Val Ala Arg Cys Gly Arg Val Gly 1130 1135
1140 Arg Phe Arg Arg Arg Leu Arg Ser Arg Arg
Cys Arg Thr Leu Ser 1145 1150 1155
Asp Gln Arg Ala Gly His Thr Ser Arg Leu Ser Ser Arg Leu Gly
1160 1165 1170 Ser Arg
Leu Arg Ala Ile Val His Gln Lys Leu Met Pro Ala Trp 1175
1180 1185 Val Asn Gly Arg Arg Ile Ile
Thr Lys Phe Thr Ile His Thr Asn 1190 1195
1200 Thr Arg Arg Val Leu Ser His His Gly Tyr Ile Ser
Phe Leu Lys 1205 1210 1215
Asn Ser Ile Ser Gln Arg Gln Gly Arg His Lys Ile Tyr Ser Lys 1220
1225 1230 Cys Ile Ile Asn Thr
Asp Asn Ile Leu 1235 1240
384412DNAArtificial SequenceDescription of Artificial Sequence Synthetic
plasmid pJ201pdc 38tagaaaaact catcgagcat caaatgaaac tgcaatttat
tcatatcagg attatcaata 60ccatattttt gaaaaagccg tttctgtaat gaaggagaaa
actcaccgag gcagttccat 120aggatggcaa gatcctggta tcggtctgcg attccgactc
gtccaacatc aatacaacct 180attaatttcc cctcgtcaaa aataaggtta tcaagtgaga
aatcaccatg agtgacgact 240gaatccggtg agaatggcaa aagtttatgc atttctttcc
agacttgttc aacaggccag 300ccattacgct cgtcatcaaa atcactcgca tcaaccaaac
cgttattcat tcgtgattgc 360gcctgagcga ggcgaaatac gcgatcgctg ttaaaaggac
aattacaaac aggaatcgag 420tgcaaccggc gcaggaacac tgccagcgca tcaacaatat
tttcacctga atcaggatat 480tcttctaata cctggaacgc tgtttttccg gggatcgcag
tggtgagtaa ccatgcatca 540tcaggagtac ggataaaatg cttgatggtc ggaagtggca
taaattccgt cagccagttt 600agtctgacca tctcatctgt aacatcattg gcaacgctac
ctttgccatg tttcagaaac 660aactctggcg catcgggctt cccatacaag cgatagattg
tcgcacctga ttgcccgaca 720ttatcgcgag cccatttata cccatataaa tcagcatcca
tgttggaatt taatcgcggc 780ctcgacgttt cccgttgaat atggctcata ttcttccttt
ttcaatatta ttgaagcatt 840tatcagggtt attgtctcat gagcggatac atatttgaat
gtatttagaa aaataaacaa 900ataggggtca gtgttacaac caattaacca attctgaaca
ttatcgcgag cccatttata 960cctgaatatg gctcataaca ccccttgttt gcctggcggc
agtagcgcgg tggtcccacc 1020tgaccccatg ccgaactcag aagtgaaacg ccgtagcgcc
gatggtagtg tggggactcc 1080ccatgcgaga gtagggaact gccaggcatc aaataaaacg
aaaggctcag tcgaaagact 1140gggcctttcg cccgggctaa ttagggggtg tcgcccttcg
ctgaacatat gagctatacc 1200gttggcacct atctggcaga acgcctggtg cagatcggcc
tgaaacacca cttcgcggtg 1260gcgggtgact acaacctggt gctgctggac aacttgctgt
tgaacaagaa tatggaacag 1320gtgtactgct gtaatgagct gaattgtggc ttcagcgcgg
agggttacgc ccgggccaaa 1380ggtgcagccg cagcggttgt gacctatagc gttggcgcgc
tgagcgcatt tgatgcgatt 1440ggtggtgcgt acgctgagaa tttgccggtc atcctgatca
gcggcgcacc gaacaacaac 1500gaccatgctg cgggccacgt gctgcatcac gcgctgggta
agaccgacta tcattatcag 1560ttggagatgg cgaagaatat cacggccgct gcggaagcta
tctatacgcc ggaagaggct 1620cctgcaaaaa tcgatcacgt tatcaaaacc gcactgcgtg
agaagaagcc agtgtacctg 1680gagattgcct gcaacattgc gtctatgccg tgcgcagcac
cgggtcctgc gagcgcgttg 1740tttaacgatg aggccagcga cgaagcgagc ctgaacgcgg
ctgttgaaga aacgctgaaa 1800ttcattgcaa atcgtgataa ggttgctgtc ctggttggca
gcaagctgcg tgccgcaggc 1860gcggaagaag cagcggtgaa gttcgcagat gccctgggtg
gcgcggtcgc aaccatggct 1920gccgcaaaat ctttctttcc ggaagagaat ccacactata
tcggcactag ctggggtgag 1980gtgtcttatc cgggcgtcga gaaaaccatg aaagaagcgg
atgcggtgat cgccctggct 2040ccggttttca acgattacag caccactggc tggacggata
ttccggaccc gaaaaagctg 2100gtgctggcgg agccgcgtag cgtggtggtc aatggtatcc
gttttccgtc cgtccatctg 2160aaggactacc tgacccgcct ggcgcagaag gttagcaaga
aaacgggtgc gctggatttc 2220ttcaaatcgc tgaatgcagg tgaactgaaa aaggcagccc
cagcagatcc gtccgcaccg 2280ctggtgaatg cggagatcgc gcgtcaagtc gaagctctgc
tgacgccgaa cacgaccgtt 2340atcgcggaga cgggtgacag ctggtttaac gcgcaacgta
tgaagttgcc gaatggtgct 2400cgagttgagt acgagatgca atggggccac attggctggt
ccgtgcctgc agcctttggt 2460tacgccgttg gtgccccgga gcgtcgcaac attctgatgg
tgggtgacgg tagctttcaa 2520ctgaccgccc aagaggtcgc tcagatggtc cgcctgaaac
tgcctgtcat cattttcctg 2580atcaataact acggttacac gatcgaggtt atgatccatg
atggcccgta caacaacatt 2640aagaattggg actacgcggg tttgatggag gttttcaatg
gcaacggtgg ttatgactcc 2700ggtgcgggta aaggtctgaa agcaaagacg ggtggcgagc
tcgcagaggc gattaaggtt 2760gcgctggcga ataccgacgg cccgaccttg attgagtgtt
tcattggtcg cgaggattgc 2820accgaagaac tggtcaaatg gggcaagcgc gtcgcagccg
cgaattcccg taaaccggtt 2880aataagcttc tcggccatca ccatcaccat cactagaagc
ttctctagag aactatttcc 2940gtcaatcgag ttcgtaccta agggcgacac aaaatttatt
ctaaatgcat aataaatact 3000gataacatct tatagtttgt attatatttt gtattatcgt
tgacatgtat aattttgata 3060tcaaaaactg attttccctt tattattttc gagatttatt
ttcttaattc tctttaacaa 3120actagaaata ttgtatatac aaaaaatcat aaataataga
tgaatagttt aattataggt 3180gttcatcaat cgaaaaagca acgtatctta tttaaagtgc
gttgcttttt tctcatttat 3240aaggttaaat aattctcata tatcaagcaa agtgacaggc
gcccttaaat attctgacaa 3300atgctctttc cctaaactcc ccccataaaa aaacccgccg
aagcgggttt ttacgttatt 3360tgcggattaa cgattactcg ttatcagaac cgcccagggg
gcccgagctt aagactggcc 3420gtcgttttac aacacagaaa gagtttgtag aaacgcaaaa
aggccatccg tcaggggcct 3480tctgcttagt ttgatgcctg gcagttccct actctcgcct
tccgcttcct cgctcactga 3540ctcgctgcgc tcggtcgttc ggctgcggcg agcggtatca
gctcactcaa aggcggtaat 3600acggttatcc acagaatcag gggataacgc aggaaagaac
atgtgagcaa aaggccagca 3660aaaggccagg aaccgtaaaa aggccgcgtt gctggcgttt
ttccataggc tccgcccccc 3720tgacgagcat cacaaaaatc gacgctcaag tcagaggtgg
cgaaacccga caggactata 3780aagataccag gcgtttcccc ctggaagctc cctcgtgcgc
tctcctgttc cgaccctgcc 3840gcttaccgga tacctgtccg cctttctccc ttcgggaagc
gtggcgcttt ctcatagctc 3900acgctgtagg tatctcagtt cggtgtaggt cgttcgctcc
aagctgggct gtgtgcacga 3960accccccgtt cagcccgacc gctgcgcctt atccggtaac
tatcgtcttg agtccaaccc 4020ggtaagacac gacttatcgc cactggcagc agccactggt
aacaggatta gcagagcgag 4080gtatgtaggc ggtgctacag agttcttgaa gtggtgggct
aactacggct acactagaag 4140aacagtattt ggtatctgcg ctctgctgaa gccagttacc
ttcggaaaaa gagttggtag 4200ctcttgatcc ggcaaacaaa ccaccgctgg tagcggtggt
ttttttgttt gcaagcagca 4260gattacgcgc agaaaaaaag gatctcaaga agatcctttg
atcttttcta cggggtctga 4320cgctcagtgg aacgacgcgc gcgtaactca cgttaaggga
ttttggtcat gagcttgcgc 4380cgtcccgtca agtcagcgta atgctctgct tt
4412394418DNAArtificial SequenceDescription of
Artificial Sequence Synthetic plasmid pJ251oad 39gcttttagaa
aaactcatcg agcatcaaat gaaactgcaa tttattcata tcaggattat 60caataccata
tttttgaaaa agccgtttct gtaatgaagg agaaaactca ccgaggcagt 120tccataggat
ggcaagatcc tggtatcggt ctgcgattcc gactcgtcca acatcaatac 180aacctattaa
tttcccctcg tcaaaaataa ggttatcaag tgagaaatca ccatgagtga 240cgactgaatc
cggtgagaat ggcaaaagtt tatgcatttc tttccagact tgttcaacag 300gccagccatt
acgctcgtca tcaaaatcac tcgcatcaac caaaccgtta ttcattcgtg 360attgcgcctg
agcgaggcga aatacgcgat cgctgttaaa aggacaatta caaacaggaa 420tcgagtgcaa
ccggcgcagg aacactgcca gcgcatcaac aatattttca cctgaatcag 480gatattcttc
taatacctgg aacgctgttt ttccggggat cgcagtggtg agtaaccatg 540catcatcagg
agtacggata aaatgcttga tggtcggaag tggcataaat tccgtcagcc 600agtttagtct
gaccatctca tctgtaacat cattggcaac gctacctttg ccatgtttca 660gaaacaactc
tggcgcatcg ggcttcccat acaagcgata gattgtcgca cctgattgcc 720cgacattatc
gcgagcccat ttatacccat ataaatcagc atccatgttg gaatttaatc 780gcggcctcga
cgtttcccgt tgaatatggc tcatattctt cctttttcaa tattattgaa 840gcatttatca
gggttattgt ctcatgagcg gatacatatt tgaatgtatt tagaaaaata 900aacaaatagg
ggtcagtgtt acaaccaatt aaccaattct gaacattatc gcgagcccat 960ttatacctga
atatggctca taacacccct tgtttgcctg gcggcagtag cgcggtggtc 1020ccacctgacc
ccatgccgaa ctcagaagtg aaacgccgta gcgccgatgg tagtgtgggg 1080actccccatg
cgagagtagg gaactgccag gcatcaaata aaacgaaagg ctcagtcgaa 1140agactgggcc
tttcgcccgg gctaattagg gggtgtcgcc cttattcgac tctataggga 1200agttcctatt
ctctagaaag tataggaact tctgaagggg ggtttcgccc ggggacagct 1260gtcggtttga
ataaatgaca aaaagcaaag cctttgtgcc gatgaatctc tatactgttt 1320cacagacctg
ctgccctgcg gggcggccat cttcctttat tcgcttataa gcgtggagaa 1380ttaaaatgac
ggataccctg accttcaaca ccaaacattt gttggaagcg ctgtttgaaa 1440gcggtattcg
tcacttcatt gtttctccgg gttctcgtag caccccgatt gcactgctgc 1500tggcagaata
cgcagagcag aataacgaga tcaagctgtt cgttgacgtg gatgagcgca 1560gcgcgggttt
cttcgcactg ggcatcgcga aaacgttgct ggagccggtt gttttgctgg 1620gtacctcggg
taccgccatc gctgaataca tgcctgcggt ggcggaagcc tatgcggcga 1680atatcccgct
ggtcgtgttg agcaccgatc gtccacagga attgcagttc aacggcgcac 1740cgcaaactat
tccgcaaagc aacctgttcg gtcaactgac gaaacaggca gtcctgattc 1800gcctgcaaga
catgcacagc gatgtgaccg agtacatcga ctttatcgtt cagaaggtcg 1860ttaatctgtc
cattaccgcg cctcgtggcc cgattcagat caacctgccg ctgcgcaaac 1920cgctgatgcc
ggtcctggat cgtcaagacg aggtgcacgt tcagcgtgtc gtgtttgaca 1980aacaaagcgt
gcaatatcgc ttgccgccga ttacggcgaa acgcctgctg atcctggcag 2040gtccgaatgt
cctgaatagc tacgatgata gcctgaagaa gtttgcgatc aaaaacaatg 2100ttccagttat
cgcggatgtt ctgtcccaaa gccgccacac gtataccatt cacggtattg 2160acgttctgct
gcaagcccac aagatcaacg ccgacctgaa accggatctg gtggttcgtt 2220ttggcaagac
gccggttagc gcgcgtgtcc tgcaatggct gaaagaagag aacattctga 2280cctggcacgt
ggacgaggac gcaggcgtgg atcacacccg tcatatcgtc cgtgccatca 2340agatggcgcc
tcatgatttt ctggaatcca tgcatctgac gttgagcaaa aaccagattg 2400acttcaacca
gaagtggctg tctctgccga aagtgatcaa aactcgtaac gagatgaata 2460tcatcaccgc
attggacgac gcggtcccgg atgacactca catcttcgtg gcgaacagca 2520tgccgattcg
tgacatggac aactttttca cgggtaatca cacccagcgc atttacgcta 2580atcgtggtgc
aaatggtatt gatggcgtga ttagctccgc cctgggcatg agcgcggtcg 2640tcaagcaacg
ttcggtcctg ctgacgggtg acctgacgct gtttcatgac atgaacggtc 2700tgatgatggc
gaagaattac cagctgccgc tggacatcat tgtgattaac aacaatggcg 2760gtggcatttt
cagcttcttg ccacaagccg gtgctccgaa gtactttgag cagctgttcg 2820gtaccccgct
gaacattgac atcaaaaaga ttgctgacct gtattacatc gattaccatc 2880agttgaatgt
gccggaggcc ctgagccaga tcctgcagac gccgagcaaa accacccgcc 2940tgatcgagta
taagtccgat caccagcgta accgtgacga tcatcgcgaa gtgctggaga 3000tgctgaaaca
ccaccatcat cattagcacg tgaaaatgaa gggaagttcc tatactttct 3060agagaatagg
aacttctata gggagtcgaa taagggcgac acaaaaggta ttctaaatgc 3120ataataaata
ctgataacat cttatagttt gtattatatt ttgtattatc gttgacatgt 3180ataattttga
tatcaaaaac tgattttccc tttattattt tcgagattta ttttcttaat 3240tctctttaac
aaactagaaa tattgtatat acaaaaaatc ataaataata gatgaatagt 3300ttaattatag
gtgttcatca atcgaaaaag caacgtatct tatttaaagt gcgttgcttt 3360tttctcattt
ataaggttaa ataattctca tatatcaagc aaagtgacag gcgcccttaa 3420atattctgac
aaatgctctt tccctaaact ccccccataa aaaaacccgc cgaagcgggt 3480ttttacgtta
tttgcggatt aacgattact cgttatcaga accgcccagg atgcctggca 3540gttccctact
ctcgccgctg cgctcggtcg ttcggctgcg ggacctcagc gctagcggag 3600tgtatactgg
cttactatgt tggcactgat gagggtgtca gtgaagtgct tcatgtggca 3660ggagaaaaaa
ggctgcaccg gtgcgtcagc agaatatgtg atacaggata tattccgctt 3720cctcgctcac
tgactcgcta cgctcggtcg ttcgactgcg gcgagcggaa atggcttacg 3780aacggggcgg
agatttcctg gaagatgcca ggaagatact taacagggaa gtgagagggc 3840cgcggcaaag
ccgtttttcc ataggctccg cccccctgac aagcatcacg aaatctgacg 3900ctcaaatcag
tggtggcgaa acccgacagg actataaaga taccaggcgt ttccccctgg 3960cggctccctc
gtgcgctctc ctgttcctgc ctttcggttt accggtgtca ttccgctgtt 4020atggccgcgt
ttgtctcatt ccacgcctga cactcagttc cgggtaggca gttcgctcca 4080agctggactg
tatgcacgaa ccccccgttc agtccgaccg ctgcgcctta tccggtaact 4140atcgtcttga
gtccaacccg gaaagacatg caaaagcacc actggcagca gccactggta 4200attgatttag
aggagttagt cttgaagtca tgcgccggtt aaggctaaac tgaaaggaca 4260agttttggtg
actgcgctcc tccaagccag ttacctcggt tcaaagagtt ggtagctcag 4320agaaccttcg
aaaaaccgcc ctgcaaggcg gttttttcgt tttcagagca agagattacg 4380cgcagaccaa
aacgatctca agaagatcat cttattaa
4418408621DNAArtificial SequenceDescription of Artificial Sequence
Synthetic plasmid pJ206kgd 40ggtggcggta cttgggtcga tatcaaagtg
catcacttct tcccgtatgc ccaactttgt 60atagagagcc actgcgggat cgtcaccgta
atctgcttgc acgtagatca cataagcacc 120aagcgcgttg gcctcatgct tgaggagatt
gatgagcgcg gtggcaatgc cctgcctccg 180gtgctcgccg gagactgcga gatcatagat
atagatctca ctacgcggct gctcaaactt 240gggcagaacg taagccgcga gagcgccaac
aaccgcttct tggtcgaagg cagcaagcgc 300gatgaatgtc ttactacgga gcaagttccc
gaggtaatcg gagtccggct gatgttggga 360gtaggtggct acgtcaccga actcacgacc
gaaaagatca agagcagccc gcatggattt 420gacttggtca gggccgagcc tacatgtgcg
aatgatgccc atacttgagc cacctaactt 480tgttttaggg cgactgccct gctgcgtaac
atcgttgctg ctccataaca tcaaacatcg 540acccacggcg taacgcgctt gctgcttgga
tgcccgaggc atagactgta caaaaaaaca 600gtcataacaa gccatgaaaa ccgccactgc
gccgttacca ccgctgcgtt cggtcaaggt 660tctggaccag ttgcgtgagc gcattttttt
ttcctcctcg gcgtttacgc cccgccctgc 720cactcatcgc agtactgttg taattcatta
agcattctgc cgacatggaa gccatcacag 780acggcatgat gaacctgaat cgccagcggc
atcagcacct tgtcgccttg cgtataatat 840ttgcccatag tgaaaacggg ggcgaagaag
ttgtccatat tggccacgtt taaatcaaaa 900ctggtgaaac tcacccaggg attggcgctg
acgaaaaaca tattctcaat aaacccttta 960gggaaatagg ccaggttttc accgtaacac
gccacatctt gcgaatatat gtgtagaaac 1020tgccggaaat cgtcgtggta ttcactccag
agcgatgaaa acgtttcagt ttgctcatgg 1080aaaacggtgt aacaagggtg aacactatcc
catatcacca gctcaccgtc tttcattgcc 1140atacggaact ccggatgagc attcatcagg
cgggcaagaa tgtgaataaa ggccggataa 1200aacttgtgct tatttttctt tacggtcttt
aaaaaggccg taatatccag ctgaacggtc 1260tggttatagg tacattgagc aactgactga
aatgcctcaa aatgttcttt acgatgccat 1320tgggatatat caacggtggt atatccagtg
atttttttct ccattttttt ttcctccttt 1380agaaaaactc atcgagcatc aaatgaaact
gcaatttatt catatcagga ttatcaatac 1440catatttttg aaaaagccgt ttctgtaatg
aaggagaaaa ctcaccgagg cagttccata 1500ggatggcaag atcctggtat cggtctgcga
ttccgactcg tccaacatca atacaaccta 1560ttaatttccc ctcgtcaaaa ataaggttat
caagtgagaa atcaccatga gtgacgactg 1620aatccggtga gaatggcaaa agtttatgca
tttctttcca gacttgttca acaggccagc 1680cattacgctc gtcatcaaaa tcactcgcat
caaccaaacc gttattcatt cgtgattgcg 1740cctgagcgag gcgaaatacg cgatcgctgt
taaaaggaca attacaaaca ggaatcgagt 1800gcaaccggcg caggaacact gccagcgcat
caacaatatt ttcacctgaa tcaggatatt 1860cttctaatac ctggaacgct gtttttccgg
ggatcgcagt ggtgagtaac catgcatcat 1920caggagtacg gataaaatgc ttgatggtcg
gaagtggcat aaattccgtc agccagttta 1980gtctgaccat ctcatctgta acatcattgg
caacgctacc tttgccatgt ttcagaaaca 2040actctggcgc atcgggcttc ccatacaagc
gatagattgt cgcacctgat tgcccgacat 2100tatcgcgagc ccatttatac ccatataaat
cagcatccat gttggaattt aatcgcggcc 2160tcgacgtttc ccgttgaata tggctcattt
ttttttcctc ctttaccaat gcttaatcag 2220tgaggcacct atctcagcga tctgtctatt
tcgttcatcc atagttgcct gactccccgt 2280cgtgtagata actacgatac gggagggctt
accatctggc cccagcgctg cgatgatacc 2340gcgagaacca cgctcaccgg ctccggattt
atcagcaata aaccagccag ccggaagggc 2400cgagcgcaga agtggtcctg caactttatc
cgcctccatc cagtctatta attgttgccg 2460ggaagctaga gtaagtagtt cgccagttaa
tagtttgcgc aacgttgttg ccatcgctac 2520aggcatcgtg gtgtcacgct cgtcgtttgg
tatggcttca ttcagctccg gttcccaacg 2580atcaaggcga gttacatgat cccccatgtt
gtgcaaaaaa gcggttagct ccttcggtcc 2640tccgatcgtt gtcagaagta agttggccgc
agtgttatca ctcatggtta tggcagcact 2700gcataattct cttactgtca tgccatccgt
aagatgcttt tctgtgactg gtgagtactc 2760aaccaagtca ttctgagaat agtgtatgcg
gcgaccgagt tgctcttgcc cggcgtcaat 2820acgggataat accgcgccac atagcagaac
tttaaaagtg ctcatcattg gaaaacgttc 2880ttcggggcga aaactctcaa ggatcttacc
gctgttgaga tccagttcga tgtaacccac 2940tcgtgcaccc aactgatctt cagcatcttt
tactttcacc agcgtttctg ggtgagcaaa 3000aacaggaagg caaaatgccg caaaaaaggg
aataagggcg acacggaaat gttgaatact 3060catattcttc ctttttcaat attattgaag
catttatcag ggttattgtc tcatgagcgg 3120atacatattt gaatgtattt agaaaaataa
acaaataggg gtcagtgtta caaccaatta 3180accaattctg aacattatcg cgagcccatt
tatacctgaa tatggctcat aacacccctt 3240gtttgcctgg cggcagtagc gcggtggtcc
cacctgaccc catgccgaac tcagaagtga 3300aacgccgtag cgccgatggt agtgtgggga
ctccccatgc gagagtaggg aactgccagg 3360catcaaataa aacgaaaggc tcagtcgaaa
gactgggcct ttcgcccggg ctaattgagg 3420ggtgtcgccc ttttgacgga tatcaagctt
ctattaaccg aacgcttcgt ccaagatttc 3480ttgctgctcc acggcatgaa ccttcgagga
accgctgctc ggcgcagaca tcgcgcgacg 3540ggagatacgc ttgatgcccg ccaacttgtc
cggcagcaac tccggcaact ccaaaccgaa 3600acgcggccag gcgccctggt tcgctggttc
ctcctggacc caaaagaact ctttgacatt 3660ttcgtaacga tccagggttt cacgcagacg
acgacgcggc aacggcgcca gttgctccag 3720acgcacaatt gccagatcat tacggttgtc
tttcgccttg cgtgccgcca attcataata 3780caacttaccg gaggtcaaca gaatacggct
aactttattg cggtcgccga taccatcctc 3840gtaggtcggt tcctccagga cgctacggaa
tttaatctcg gtaaaatcct taatctccga 3900aaccgccgcc ttgtggcgca gcatggattt
cggggtaaac acgatcaacg gacgttggat 3960gccgtccaac gcgtggcggc gcagcaagtg
aaagtaattg gacggggtgc tcggcatcgc 4020gatcgtcatg ctaccctcag cccacagctg
caaaaagcgc tcaatgcgcg ccgacgtgtg 4080gtccgggccc tgaccctcgt ggccgtgtgg
cagcaacaga acaacgttgg acagctgacc 4140ccatttcgct tcgcccgagc tgatgaactc
gtcaatgatc gactgggcac cattaacaaa 4200gtcgccgaat tgcgcctccc acagcacaac
cgcgtctgga ttgccaacgg tataaccgta 4260ctcaaaaccc acagccgcgt attccgacaa
cggcgaatca taaaccagga acttaccacc 4320ggtcgggctg ccgtcgctgt tagtcgccag
cagctgcagc ggggtgaact cctcgccggt 4380gtgacggtcg atcagcacag aatgacgctg
gctaaaggtg ccacgacgag agtcttgacc 4440gctcagacgg accagcttgc cctcagcaac
caggctgccc agcgccagca actcaccaaa 4500cgcccagtca atcttaccct catacgccat
ctcacgacgc ttttccagca ccggctgaac 4560gcgcgggtgt gccgtgaaac cgttaggcag
cgccaggaag gcatcaccga tacgtgccag 4620cagggatttg tcaacagcgg tcgccaggcc
cgctgggatc atttggtccg actcgacgct 4680ctccgacggt tggacgccgt gcttctccag
ttcgcgcact tcgttgaaca cacgctccag 4740ttggccctgg taatcgcgca gcgcatcctc
cgcctctttc atgctgatgt cgccacgacc 4800gatcagagcc tcggtgtagg atttacgggc
gccacgcttg gtgtccacga cgtcatagac 4860atacggattg gtcatagatg gatcgtcacc
ctcattatga ccacgacgac gatagcacag 4920catgtcaata acaacgtcct ttttaaagcg
ttggcggaag tcaactgcca aacgcgcaac 4980ccagacacac gcttctggat cgtcgccgtt
cacgtggaag attggcgcac cgatcatctt 5040cgccacatcc gtgcagtact cgctcgaacg
agagtattcc ggagcggtgg tgaagccgat 5100ttggttgttc acaatgatgt gaatcgtacc
acccacacga tagccaggca gatttgccag 5160attcagcgtc tccgcaacaa cgccttgacc
cgcgaaggcc gcatcaccgt gcagcatcaa 5220cggaacgacg gagaatgcac gttggccgtc
ggagtcaatg gaaccgtggt ccagcaggtc 5280ttgcttcgca cgcaccaaac cttccagcac
tggatcgacc gcttccaaat gggacggatt 5340tgccgtcagg gaaacctgaa tatcgttatc
gccaaacatt tgcagataca gacccgtcgc 5400acccaggtgg tacttgacat cgccggaacc
gtgagcctgg gacgggttca gattgccttc 5460aaactccgta aagatttgcg aatacggttt
gcccacgatg ttcgccagga cattcaagcg 5520accacggtgc ggcatgccaa tgaccacttc
atccaaacca tgctcggcac attggtcaat 5580cgccgcgtcc atcattggaa taacagattc
cgcaccctcc aggctaaaac gcttttggcc 5640cacatacttg gtttgcagga aggtttcaaa
cgcctccgct gcgttcagtt tcgacaagat 5700gtacttttgt tgagcaacgg tcggtttgac
gtgcttcgtc tcgacacgct gctccagcca 5760ctccttttgt tccgggtcca gaatgtgcgc
gtactcaaca ccgatgtgac ggcagtacgc 5820gtcgcgcagc aaacccagca cgtcacgcag
ctttttgtat tgagcacccg cgaaaccgtc 5880aaccttaaag acgcggtcca ggtcccacag
agtcaggcca tgcgtcaaca cctccaaatc 5940cggatgcgaa cgaaagcgcg ccttatccaa
gcgcaacggg tcggtgtccg ccatcagatg 6000gccgcggttg cgataggccg cgatcaggtt
catcacacgt gcgttcttgt caacgatcga 6060gtccggatta tcggtgctcc aacgcactgg
caggtacggg atgctcagct cgcggaagac 6120ctcatcccag aagccatcag acaacagcag
ttcatgaatg gtacgcagga agtcaccgct 6180ttccgcacct tgaatgatac ggtggtcgta
ggtagaagtc agggtaatca gtttgccaat 6240acccagctca gcgatgcgtt cctcggacgc
gccttggaac tccgccggat attccatcgc 6300accgacaccg atgatagcac cttgacctgg
catcaggcgt ggcacagagt gcaccgtgcc 6360aatcgtgccc ggattcgtca gcgaaatcgt
aacgccagcg aagtcctcgg tggtcagttt 6420accatcacga gcacgacgga cgatgtcctc
gtacgcggta acgaactgcg cgaagcgcat 6480cgtctcgcaa cgtttgatac cggccaccac
cagagagcgc ttgccatctt taccttgcag 6540gtcaatagcc agacccagat tggtgtgcgc
aggagtaacc gccgtcggct taccgtccac 6600ctccgtgtag tggcgattca tattcgggaa
cttcttaacc gcctgaacca gagcataacc 6660cagcaaatgg gtaaagctga ttttaccacc
acgcgtgcgt ttcaactgat tattgatcac 6720gatacgatta tcgatcaaca atttcgctgg
cacagcacgc accgaggtcg ccgtaggcac 6780ttccagcgac gcgctcatgt tcttcacgac
agcagccgcc gcaccacgca ggacagcaac 6840ttcatcgcct tcggctggcg gaggcaccgc
ggtcttggcg gccagagccg ccacgacacc 6900gttgcccgct gcggccgtgt cggccggttt
cggcggcgct tgcggagccg ccgctgccgc 6960acgctcagcg accaaagggc tggtcacacg
agtcggctca gcagccggtt gggaagtcgg 7020ctccgggcta tagtccacca gaaactcatg
ccagcttggg tcaacggaag acggatcatc 7080acgaaattta cgatacatac caacacgaga
cgggtcctga gtcaccatgg atatatctcc 7140ttcttaaaga attcgatatc tcagcgacaa
gggcgacaca aaatttattc taaatgcata 7200ataaatactg ataacatctt atagtttgta
ttatattttg tattatcgtt gacatgtata 7260attttgatat caaaaactga ttttcccttt
attattttcg agatttattt tcttaattct 7320ctttaacaaa ctagaaatat tgtatataca
aaaaatcata aataatagat gaatagttta 7380attataggtg ttcatcaatc gaaaaagcaa
cgtatcttat ttaaagtgcg ttgctttttt 7440ctcatttata aggttaaata attctcatat
atcaagcaaa gtgacaggcg cccttaaata 7500ttctgacaaa tgctctttcc ctaaactccc
cccataaaaa aacccgccga agcgggtttt 7560tacgttattt gcggattaac gattactcgt
tatcagaacc gcccaggggg cccgagctta 7620agactggccg tcgttttaca acacagaaag
agtttgtaga aacgcaaaaa ggccatccgt 7680caggggcctt ctgcttagtt tgatgcctgg
cagttcccta ctctcgcctt ccgcttcctc 7740gctcactgac tcgctgcgct cggtcgttcg
gctgcggcga gcggtatcag ctcactcaaa 7800ggcggtaata cggttatcca cagaatcagg
ggataacgca ggaaagaaca tgtgagcaaa 7860aggccagcaa aaggccagga accgtaaaaa
ggccgcgttg ctggcgtttt tccataggct 7920ccgcccccct gacgagcatc acaaaaatcg
acgctcaagt cagaggtggc gaaacccgac 7980aggactataa agataccagg cgtttccccc
tggaagctcc ctcgtgcgct ctcctgttcc 8040gaccctgccg cttaccggat acctgtccgc
ctttctccct tcgggaagcg tggcgctttc 8100tcatagctca cgctgtaggt atctcagttc
ggtgtaggtc gttcgctcca agctgggctg 8160tgtgcacgaa ccccccgttc agcccgaccg
ctgcgcctta tccggtaact atcgtcttga 8220gtccaacccg gtaagacacg acttatcgcc
actggcagca gccactggta acaggattag 8280cagagcgagg tatgtaggcg gtgctacaga
gttcttgaag tggtgggcta actacggcta 8340cactagaaga acagtatttg gtatctgcgc
tctgctgaag ccagttacct tcggaaaaag 8400agttggtagc tcttgatccg gcaaacaaac
caccgctggt agcggtggtt tttttgtttg 8460caagcagcag attacgcgca gaaaaaaagg
atctcaagaa gatcctttga tcttttctac 8520ggggtctgac gctcagtgga acgacgcgcg
cgtaactcac gttaagggat tttggtcatg 8580agcttgcgcc gtcccgtcaa gtcagcgtaa
tgctctgctt a 8621414401DNAArtificial
SequenceDescription of Artificial Sequence Synthetic plasmid
pTrc-HisB 41gtttgacagc ttatcatcga ctgcacggtg caccaatgct tctggcgtca
ggcagccatc 60ggaagctgtg gtatggctgt gcaggtcgta aatcactgca taattcgtgt
cgctcaaggc 120gcactcccgt tctggataat gttttttgcg ccgacatcat aacggttctg
gcaaatattc 180tgaaatgagc tgttgacaat taatcatccg gctcgtataa tgtgtggaat
tgtgagcgga 240taacaatttc acacaggaaa cagcgccgct gagaaaaagc gaagcggcac
tgctctttaa 300caatttatca gacaatctgt gtgggcactc gaccggaatt atcgattaac
tttattatta 360aaaattaaag aggtatatat taatgtatcg attaaataag gaggaataaa
ccatgggggg 420ttctcatcat catcatcatc atggtatggc tagcatgact ggtggacagc
aaatgggtcg 480ggatctgtac gacgatgacg ataaggatcc gagctcgaga tctgcagctg
gtaccatatg 540ggaattcgaa gcttggctgt tttggcggat gagagaagat tttcagcctg
atacagatta 600aatcagaacg cagaagcggt ctgataaaac agaatttgcc tggcggcagt
agcgcggtgg 660tcccacctga ccccatgccg aactcagaag tgaaacgccg tagcgccgat
ggtagtgtgg 720ggtctcccca tgcgagagta gggaactgcc aggcatcaaa taaaacgaaa
ggctcagtcg 780aaagactggg cctttcgttt tatctgttgt ttgtcggtga acgctctcct
gagtaggaca 840aatccgccgg gagcggattt gaacgttgcg aagcaacggc ccggagggtg
gcgggcagga 900cgcccgccat aaactgccag gcatcaaatt aagcagaagg ccatcctgac
ggatggcctt 960tttgcgtttc tacaaactct ttttgtttat ttttctaaat acattcaaat
atgtatccgc 1020tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag
agtatgagta 1080ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt
cctgtttttg 1140ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt
gcacgagtgg 1200gttacatcga actggatctc aacagcggta agatccttga gagttttcgc
cccgaagaac 1260gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta
tcccgtgttg 1320acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac
ttggttgagt 1380actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa
ttatgcagtg 1440ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg
atcggaggac 1500cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc
cttgatcgtt 1560gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg
atgcctgtag 1620caatggcaac aacgttgcgc aaactattaa ctggcgaact acttactcta
gcttcccggc 1680aacaattaat agactggatg gaggcggata aagttgcagg accacttctg
cgctcggccc 1740ttccggctgg ctggtttatt gctgataaat ctggagccgg tgagcgtggg
tctcgcggta 1800tcattgcagc actggggcca gatggtaagc cctcccgtat cgtagttatc
tacacgacgg 1860ggagtcaggc aactatggat gaacgaaata gacagatcgc tgagataggt
gcctcactga 1920ttaagcattg gtaactgtca gaccaagttt actcatatat actttagatt
gatttaaaac 1980ttcattttta atttaaaagg atctaggtga agatcctttt tgataatctc
atgaccaaaa 2040tcccttaacg tgagttttcg ttccactgag cgtcagaccc cgtagaaaag
atcaaaggat 2100cttcttgaga tccttttttt ctgcgcgtaa tctgctgctt gcaaacaaaa
aaaccaccgc 2160taccagcggt ggtttgtttg ccggatcaag agctaccaac tctttttccg
aaggtaactg 2220gcttcagcag agcgcagata ccaaatactg tccttctagt gtagccgtag
ttaggccacc 2280acttcaagaa ctctgtagca ccgcctacat acctcgctct gctaatcctg
ttaccagtgg 2340ctgctgccag tggcgataag tcgtgtctta ccgggttgga ctcaagacga
tagttaccgg 2400ataaggcgca gcggtcgggc tgaacggggg gttcgtgcac acagcccagc
ttggagcgaa 2460cgacctacac cgaactgaga tacctacagc gtgagctatg agaaagcgcc
acgcttcccg 2520aagggagaaa ggcggacagg tatccggtaa gcggcagggt cggaacagga
gagcgcacga 2580gggagcttcc agggggaaac gcctggtatc tttatagtcc tgtcgggttt
cgccacctct 2640gacttgagcg tcgatttttg tgatgctcgt caggggggcg gagcctatgg
aaaaacgcca 2700gcaacgcggc ctttttacgg ttcctggcct tttgctggcc ttttgctcac
atgttctttc 2760ctgcgttatc ccctgattct gtggataacc gtattaccgc ctttgagtga
gctgataccg 2820ctcgccgcag ccgaacgacc gagcgcagcg agtcagtgag cgaggaagcg
gaagagcgcc 2880tgatgcggta ttttctcctt acgcatctgt gcggtatttc acaccgcata
tggtgcactc 2940tcagtacaat ctgctctgat gccgcatagt taagccagta tacactccgc
tatcgctacg 3000tgactgggtc atggctgcgc cccgacaccc gccaacaccc gctgacgcgc
cctgacgggc 3060ttgtctgctc ccggcatccg cttacagaca agctgtgacc gtctccggga
gctgcatgtg 3120tcagaggttt tcaccgtcat caccgaaacg cgcgaggcag cagatcaatt
cgcgcgcgaa 3180ggcgaagcgg catgcattta cgttgacacc atcgaatggc gcaaaacctt
tcgcggtatg 3240gcatgatagc gcccggaaga gagtcaattc agggtggtga atgtgaaacc
agtaacgtta 3300tacgatgtcg cagagtatgc cggtgtctct tatcagaccg tttcccgcgt
ggtgaaccag 3360gccagccacg tttctgcgaa aacgcgggaa aaagtggaag cggcgatggc
ggagctgaat 3420tacattccca accgcgtggc acaacaactg gcgggcaaac agtcgttgct
gattggcgtt 3480gccacctcca gtctggccct gcacgcgccg tcgcaaattg tcgcggcgat
taaatctcgc 3540gccgatcaac tgggtgccag cgtggtggtg tcgatggtag aacgaagcgg
cgtcgaagcc 3600tgtaaagcgg cggtgcacaa tcttctcgcg caacgcgtca gtgggctgat
cattaactat 3660ccgctggatg accaggatgc cattgctgtg gaagctgcct gcactaatgt
tccggcgtta 3720tttcttgatg tctctgacca gacacccatc aacagtatta ttttctccca
tgaagacggt 3780acgcgactgg gcgtggagca tctggtcgca ttgggtcacc agcaaatcgc
gctgttagcg 3840ggcccattaa gttctgtctc ggcgcgtctg cgtctggctg gctggcataa
atatctcact 3900cgcaatcaaa ttcagccgat agcggaacgg gaaggcgact ggagtgccat
gtccggtttt 3960caacaaacca tgcaaatgct gaatgagggc atcgttccca ctgcgatgct
ggttgccaac 4020gatcagatgg cgctgggcgc aatgcgcgcc attaccgagt ccgggctgcg
cgttggtgcg 4080gatatctcgg tagtgggata cgacgatacc gaagacagct catgttatat
cccgccgtca 4140accaccatca aacaggattt tcgcctgctg gggcaaacca gcgtggaccg
cttgctgcaa 4200ctctctcagg gccaggcggt gaagggcaat cagctgttgc ccgtctcact
ggtgaaaaga 4260aaaaccaccc tggcgcccaa tacgcaaacc gcctctcccc gcgcgttggc
cgattcatta 4320atgcagctgg cacgacaggt ttcccgactg gaaagcgggc agtgagcgca
acgcaattaa 4380tgtgagttag cgcgaattga g
44014225DNAArtificial SequenceDescription of Artificial
Sequence Synthetic primer 42gggatatcat gagctatacc gttgg
254324DNAArtificial SequenceDescription of
Artificial Sequence Synthetic primer 43gaaatagttc tctagagaag cttc
24446123DNAArtificial
SequenceDescription of Artificial Sequence Synthetic plasmid
pTrcpdc-his 44gtttgacagc ttatcatcga ctgcacggtg caccaatgct tctggcgtca
ggcagccatc 60ggaagctgtg gtatggctgt gcaggtcgta aatcactgca taattcgtgt
cgctcaaggc 120gcactcccgt tctggataat gttttttgcg ccgacatcat aacggttctg
gcaaatattc 180tgaaatgagc tgttgacaat taatcatccg gctcgtataa tgtgtggaat
tgtgagcgga 240taacaatttc acacaggaaa cagcgccgct gagaaaaagc gaagcggcac
tgctctttaa 300caatttatca gacaatctgt gtgggcactc gaccggaatt atcgattaac
tttattatta 360aaaattaaag aggtatatat taatgtatcg attaaataag gaggaataaa
ccatgagcta 420taccgttggc acctatctgg cagaacgcct ggtgcagatc ggcctgaaac
accacttcgc 480ggtggcgggt gactacaacc tggtgctgct ggacaacttg ctgttgaaca
agaatatgga 540acaggtgtac tgctgtaatg agctgaattg tggcttcagc gcggagggtt
acgcccgggc 600caaaggtgca gccgcagcgg ttgtgaccta tagcgttggc gcgctgagcg
catttgatgc 660gattggtggt gcgtacgctg agaatttgcc ggtcatcctg atcagcggcg
caccgaacaa 720caacgaccat gctgcgggcc acgtgctgca tcacgcgctg ggtaagaccg
actatcatta 780tcagttggag atggcgaaga atatcacggc cgctgcggaa gctatctata
cgccggaaga 840ggctcctgca aaaatcgatc acgttatcaa aaccgcactg cgtgagaaga
agccagtgta 900cctggagatt gcctgcaaca ttgcgtctat gccgtgcgca gcaccgggtc
ctgcgagcgc 960gttgtttaac gatgaggcca gcgacgaagc gagcctgaac gcggctgttg
aagaaacgct 1020gaaattcatt gcaaatcgtg ataaggttgc tgtcctggtt ggcagcaagc
tgcgtgccgc 1080aggcgcggaa gaagcagcgg tgaagttcgc agatgccctg ggtggcgcgg
tcgcaaccat 1140ggctgccgca aaatctttct ttccggaaga gaatccacac tatatcggca
ctagctgggg 1200tgaggtgtct tatccgggcg tcgagaaaac catgaaagaa gcggatgcgg
tgatcgccct 1260ggctccggtt ttcaacgatt acagcaccac tggctggacg gatattccgg
acccgaaaaa 1320gctggtgctg gcggagccgc gtagcgtggt ggtcaatggt atccgttttc
cgtccgtcca 1380tctgaaggac tacctgaccc gcctggcgca gaaggttagc aagaaaacgg
gtgcgctgga 1440tttcttcaaa tcgctgaatg caggtgaact gaaaaaggca gccccagcag
atccgtccgc 1500accgctggtg aatgcggaga tcgcgcgtca agtcgaagct ctgctgacgc
cgaacacgac 1560cgttatcgcg gagacgggtg acagctggtt taacgcgcaa cgtatgaagt
tgccgaatgg 1620tgctcgagtt gagtacgaga tgcaatgggg ccacattggc tggtccgtgc
ctgcagcctt 1680tggttacgcc gttggtgccc cggagcgtcg caacattctg atggtgggtg
acggtagctt 1740tcaactgacc gcccaagagg tcgctcagat ggtccgcctg aaactgcctg
tcatcatttt 1800cctgatcaat aactacggtt acacgatcga ggttatgatc catgatggcc
cgtacaacaa 1860cattaagaat tgggactacg cgggtttgat ggaggttttc aatggcaacg
gtggttatga 1920ctccggtgcg ggtaaaggtc tgaaagcaaa gacgggtggc gagctcgcag
aggcgattaa 1980ggttgcgctg gcgaataccg acggcccgac cttgattgag tgtttcattg
gtcgcgagga 2040ttgcaccgaa gaactggtca aatggggcaa gcgcgtcgca gccgcgaatt
cccgtaaacc 2100ggttaataag cttctcggcc atcaccatca ccatcactag aagcttctct
aggtatggct 2160agcatgactg gtggacagca aatgggtcgg gatctgtacg acgatgacga
taaggatcga 2220tggggatccg agctcgagat ctgcagctgg taccatatgg gaattcgaag
cttggctgtt 2280ttggcggatg agagaagatt ttcagcctga tacagattaa atcagaacgc
agaagcggtc 2340tgataaaaca gaatttgcct ggcggcagta gcgcggtggt cccacctgac
cccatgccga 2400actcagaagt gaaacgccgt agcgccgatg gtagtgtggg gtctccccat
gcgagagtag 2460ggaactgcca ggcatcaaat aaaacgaaag gctcagtcga aagactgggc
ctttcgtttt 2520atctgttgtt tgtcggtgaa cgctctcctg agtaggacaa atccgccggg
agcggatttg 2580aacgttgcga agcaacggcc cggagggtgg cgggcaggac gcccgccata
aactgccagg 2640catcaaatta agcagaaggc catcctgacg gatggccttt ttgcgtttct
acaaactctt 2700tttgtttatt tttctaaata cattcaaata tgtatccgct catgagacaa
taaccctgat 2760aaatgcttca ataatattga aaaaggaaga gtatgagtat tcaacatttc
cgtgtcgccc 2820ttattccctt ttttgcggca ttttgccttc ctgtttttgc tcacccagaa
acgctggtga 2880aagtaaaaga tgctgaagat cagttgggtg cacgagtggg ttacatcgaa
ctggatctca 2940acagcggtaa gatccttgag agttttcgcc ccgaagaacg ttttccaatg
atgagcactt 3000ttaaagttct gctatgtggc gcggtattat cccgtgttga cgccgggcaa
gagcaactcg 3060gtcgccgcat acactattct cagaatgact tggttgagta ctcaccagtc
acagaaaagc 3120atcttacgga tggcatgaca gtaagagaat tatgcagtgc tgccataacc
atgagtgata 3180acactgcggc caacttactt ctgacaacga tcggaggacc gaaggagcta
accgcttttt 3240tgcacaacat gggggatcat gtaactcgcc ttgatcgttg ggaaccggag
ctgaatgaag 3300ccataccaaa cgacgagcgt gacaccacga tgcctgtagc aatggcaaca
acgttgcgca 3360aactattaac tggcgaacta cttactctag cttcccggca acaattaata
gactggatgg 3420aggcggataa agttgcagga ccacttctgc gctcggccct tccggctggc
tggtttattg 3480ctgataaatc tggagccggt gagcgtgggt ctcgcggtat cattgcagca
ctggggccag 3540atggtaagcc ctcccgtatc gtagttatct acacgacggg gagtcaggca
actatggatg 3600aacgaaatag acagatcgct gagataggtg cctcactgat taagcattgg
taactgtcag 3660accaagttta ctcatatata ctttagattg atttaaaact tcatttttaa
tttaaaagga 3720tctaggtgaa gatccttttt gataatctca tgaccaaaat cccttaacgt
gagttttcgt 3780tccactgagc gtcagacccc gtagaaaaga tcaaaggatc ttcttgagat
cctttttttc 3840tgcgcgtaat ctgctgcttg caaacaaaaa aaccaccgct accagcggtg
gtttgtttgc 3900cggatcaaga gctaccaact ctttttccga aggtaactgg cttcagcaga
gcgcagatac 3960caaatactgt ccttctagtg tagccgtagt taggccacca cttcaagaac
tctgtagcac 4020cgcctacata cctcgctctg ctaatcctgt taccagtggc tgctgccagt
ggcgataagt 4080cgtgtcttac cgggttggac tcaagacgat agttaccgga taaggcgcag
cggtcgggct 4140gaacgggggg ttcgtgcaca cagcccagct tggagcgaac gacctacacc
gaactgagat 4200acctacagcg tgagctatga gaaagcgcca cgcttcccga agggagaaag
gcggacaggt 4260atccggtaag cggcagggtc ggaacaggag agcgcacgag ggagcttcca
gggggaaacg 4320cctggtatct ttatagtcct gtcgggtttc gccacctctg acttgagcgt
cgatttttgt 4380gatgctcgtc aggggggcgg agcctatgga aaaacgccag caacgcggcc
tttttacggt 4440tcctggcctt ttgctggcct tttgctcaca tgttctttcc tgcgttatcc
cctgattctg 4500tggataaccg tattaccgcc tttgagtgag ctgataccgc tcgccgcagc
cgaacgaccg 4560agcgcagcga gtcagtgagc gaggaagcgg aagagcgcct gatgcggtat
tttctcctta 4620cgcatctgtg cggtatttca caccgcatat ggtgcactct cagtacaatc
tgctctgatg 4680ccgcatagtt aagccagtat acactccgct atcgctacgt gactgggtca
tggctgcgcc 4740ccgacacccg ccaacacccg ctgacgcgcc ctgacgggct tgtctgctcc
cggcatccgc 4800ttacagacaa gctgtgaccg tctccgggag ctgcatgtgt cagaggtttt
caccgtcatc 4860accgaaacgc gcgaggcagc agatcaattc gcgcgcgaag gcgaagcggc
atgcatttac 4920gttgacacca tcgaatggcg caaaaccttt cgcggtatgg catgatagcg
cccggaagag 4980agtcaattca gggtggtgaa tgtgaaacca gtaacgttat acgatgtcgc
agagtatgcc 5040ggtgtctctt atcagaccgt ttcccgcgtg gtgaaccagg ccagccacgt
ttctgcgaaa 5100acgcgggaaa aagtggaagc ggcgatggcg gagctgaatt acattcccaa
ccgcgtggca 5160caacaactgg cgggcaaaca gtcgttgctg attggcgttg ccacctccag
tctggccctg 5220cacgcgccgt cgcaaattgt cgcggcgatt aaatctcgcg ccgatcaact
gggtgccagc 5280gtggtggtgt cgatggtaga acgaagcggc gtcgaagcct gtaaagcggc
ggtgcacaat 5340cttctcgcgc aacgcgtcag tgggctgatc attaactatc cgctggatga
ccaggatgcc 5400attgctgtgg aagctgcctg cactaatgtt ccggcgttat ttcttgatgt
ctctgaccag 5460acacccatca acagtattat tttctcccat gaagacggta cgcgactggg
cgtggagcat 5520ctggtcgcat tgggtcacca gcaaatcgcg ctgttagcgg gcccattaag
ttctgtctcg 5580gcgcgtctgc gtctggctgg ctggcataaa tatctcactc gcaatcaaat
tcagccgata 5640gcggaacggg aaggcgactg gagtgccatg tccggttttc aacaaaccat
gcaaatgctg 5700aatgagggca tcgttcccac tgcgatgctg gttgccaacg atcagatggc
gctgggcgca 5760atgcgcgcca ttaccgagtc cgggctgcgc gttggtgcgg atatctcggt
agtgggatac 5820gacgataccg aagacagctc atgttatatc ccgccgtcaa ccaccatcaa
acaggatttt 5880cgcctgctgg ggcaaaccag cgtggaccgc ttgctgcaac tctctcaggg
ccaggcggtg 5940aagggcaatc agctgttgcc cgtctcactg gtgaaaagaa aaaccaccct
ggcgcccaat 6000acgcaaaccg cctctccccg cgcgttggcc gattcattaa tgcagctggc
acgacaggtt 6060tcccgactgg aaagcgggca gtgagcgcaa cgcaattaat gtgagttagc
gcgaattgat 6120ctg
61234526DNAArtificial SequenceDescription of Artificial
Sequence Synthetic primer 45ggagaattac catggcggat accctg
264627DNAArtificial SequenceDescription of
Artificial Sequence Synthetic primer 46gggaatctag actaatgatg atggtgg
27475787DNAArtificial
SequenceDescription of Artificial Sequence Synthetic plasmid
pTrcoad-his 47gtttgacagc ttatcatcga ctgcacggtg caccaatgct tctggcgtca
ggcagccatc 60ggaagctgtg gtatggctgt gcaggtcgta aatcactgca taattcgtgt
cgctcaaggc 120gcactcccgt tctggataat gttttttgcg ccgacatcat aacggttctg
gcaaatattc 180tgaaatgagc tgttgacaat taatcatccg gctcgtataa tgtgtggaat
tgtgagcgga 240taacaatttc acacaggaaa cagaccatgg cggataccct gaccttcaac
accaaacatt 300tgttggaagc gctgtttgaa agcggtattc gtcacttcat tgtttctccg
ggttctcgta 360gcaccccgat tgcactgctg ctggcagaat acgcagagca gaataacgag
atcaagctgt 420tcgttgacgt ggatgagcgc agcgcgggtt tcttcgcact gggcatcgcg
aaaacgttgc 480tggagccggt tgttttgctg ggtacctcgg gtaccgccat cgctgaatac
atgcctgcgg 540tggcggaagc ctatgcggcg aatatcccgc tggtcgtgtt gagcaccgat
cgtccacagg 600aattgcagtt caacggcgca ccgcaaacta ttccgcaaag caacctgttc
ggtcaactga 660cgaaacaggc agtcctgatt cgcctgcaag acatgcacag cgatgtgacc
gagtacatcg 720actttatcgt tcagaaggtc gttaatctgt ccattaccgc gcctcgtggc
ccgattcaga 780tcaacctgcc gctgcgcaaa ccgctgatgc cggtcctgga tcgtcaagac
gaggtgcacg 840ttcagcgtgt cgtgtttgac aaacaaagcg tgcaatatcg cttgccgccg
attacggcga 900aacgcctgct gatcctggca ggtccgaatg tcctgaatag ctacgatgat
agcctgaaga 960agtttgcgat caaaaacaat gttccagtta tcgcggatgt tctgtcccaa
agccgccaca 1020cgtataccat tcacggtatt gacgttctgc tgcaagccca caagatcaac
gccgacctga 1080aaccggatct ggtggttcgt tttggcaaga cgccggttag cgcgcgtgtc
ctgcaatggc 1140tgaaagaaga gaacattctg acctggcacg tggacgagga cgcaggcgtg
gatcacaccc 1200gtcatatcgt ccgtgccatc aagatggcgc ctcatgattt tctggaatcc
atgcatctga 1260cgttgagcaa aaaccagatt gacttcaacc agaagtggct gtctctgccg
aaagtgatca 1320aaactcgtaa cgagatgaat atcatcaccg cattggacga cgcggtcccg
gatgacactc 1380acatcttcgt ggcgaacagc atgccgattc gtgacatgga caactttttc
acgggtaatc 1440acacccagcg catttacgct aatcgtggtg caaatggtat tgatggcgtg
attagctccg 1500ccctgggcat gagcgcggtc gtcaagcaac gttcggtcct gctgacgggt
gacctgacgc 1560tgtttcatga catgaacggt ctgatgatgg cgaagaatta ccagctgccg
ctggacatca 1620ttgtgattaa caacaatggc ggtggcattt tcagcttctt gccacaagcc
ggtgctccga 1680agtactttga gcagctgttc ggtaccccgc tgaacattga catcaaaaag
attgctgacc 1740tgtattacat cgattaccat cagttgaatg tgccggaggc cctgagccag
atcctgcaga 1800cgccgagcaa aaccacccgc ctgatcgagt ataagtccga tcaccagcgt
aaccgtgacg 1860atcatcgcga agtgctggag atgctgaaac accaccatca tcattagtct
agagtcgacc 1920tgcaggcatg caagcttggc tgttttggcg gatgagagaa gattttcagc
ctgatacaga 1980ttaaatcaga acgcagaagc ggtctgataa aacagaattt gcctggcggc
agtagcgcgg 2040tggtcccacc tgaccccatg ccgaactcag aagtgaaacg ccgtagcgcc
gatggtagtg 2100tggggtctcc ccatgcgaga gtagggaact gccaggcatc aaataaaacg
aaaggctcag 2160tcgaaagact gggcctttcg ttttatctgt tgtttgtcgg tgaacgctct
cctgagtagg 2220acaaatccgc cgggagcgga tttgaacgtt gcgaagcaac ggcccggagg
gtggcgggca 2280ggacgcccgc cataaactgc caggcatcaa attaagcaga aggccatcct
gacggatggc 2340ctttttgcgt ttctacaaac tctttttgtt tatttttcta aatacattca
aatatgtatc 2400cgctcatgag acaataaccc tgataaatgc ttcaataata ttgaaaaagg
aagagtatga 2460gtattcaaca tttccgtgtc gcccttattc ccttttttgc ggcattttgc
cttcctgttt 2520ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga agatcagttg
ggtgcacgag 2580tgggttacat cgaactggat ctcaacagcg gtaagatcct tgagagtttt
cgccccgaag 2640aacgttttcc aatgatgagc acttttaaag ttctgctatg tggcgcggta
ttatcccgtg 2700ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta ttctcagaat
gacttggttg 2760agtactcacc agtcacagaa aagcatctta cggatggcat gacagtaaga
gaattatgca 2820gtgctgccat aaccatgagt gataacactg cggccaactt acttctgaca
acgatcggag 2880gaccgaagga gctaaccgct tttttgcaca acatggggga tcatgtaact
cgccttgatc 2940gttgggaacc ggagctgaat gaagccatac caaacgacga gcgtgacacc
acgatgccta 3000cagcaatggc aacaacgttg cgcaaactat taactggcga actacttact
ctagcttccc 3060ggcaacaatt aatagactgg atggaggcgg ataaagttgc aggaccactt
ctgcgctcgg 3120cccttccggc tggctggttt attgctgata aatctggagc cggtgagcgt
gggtctcgcg 3180gtatcattgc agcactgggg ccagatggta agccctcccg tatcgtagtt
atctacacga 3240cggggagtca ggcaactatg gatgaacgaa atagacagat cgctgagata
ggtgcctcac 3300tgattaagca ttggtaactg tcagaccaag tttactcata tatactttag
attgatttaa 3360aacttcattt ttaatttaaa aggatctagg tgaagatcct ttttgataat
ctcatgacca 3420aaatccctta acgtgagttt tcgttccact gagcgtcaga ccccgtagaa
aagatcaaag 3480gatcttcttg agatcctttt tttctgcgcg taatctgctg cttgcaaaca
aaaaaaccac 3540cgctaccagc ggtggtttgt ttgccggatc aagagctacc aactcttttt
ccgaaggtaa 3600ctggcttcag cagagcgcag ataccaaata ctgtccttct agtgtagccg
tagttaggcc 3660accacttcaa gaactctgta gcaccgccta catacctcgc tctgctaatc
ctgttaccag 3720tggctgctgc cagtggcgat aagtcgtgtc ttaccgggtt ggactcaaga
cgatagttac 3780cggataaggc gcagcggtcg ggctgaacgg ggggttcgtg cacacagccc
agcttggagc 3840gaacgaccta caccgaactg agatacctac agcgtgagct atgagaaagc
gccacgcttc 3900ccgaagggag aaaggcggac aggtatccgg taagcggcag ggtcggaaca
ggagagcgca 3960cgagggagct tccaggggga aacgcctggt atctttatag tcctgtcggg
tttcgccacc 4020tctgacttga gcgtcgattt ttgtgatgct cgtcaggggg gcggagccta
tggaaaaacg 4080ccagcaacgc ggccttttta cggttcctgg ccttttgctg gccttttgct
cacatgttct 4140ttcctgcgtt atcccctgat tctgtggata accgtattac cgcctttgag
tgagctgata 4200ccgctcgccg cagccgaacg accgagcgca gcgagtcagt gagcgaggaa
gcggaagagc 4260gcctgatgcg gtattttctc cttacgcatc tgtgcggtat ttcacaccgc
atatggtgca 4320ctctcagtac aatctgctct gatgccgcat agttaagcca gtatacactc
cgctatcgct 4380acgtgactgg gtcatggctg cgccccgaca cccgccaaca cccgctgacg
cgccctgacg 4440ggcttgtctg ctcccggcat ccgcttacag acaagctgtg accgtctccg
ggagctgcat 4500gtgtcagagg ttttcaccgt catcaccgaa acgcgcgagg cagcagatca
attcgcgcgc 4560gaaggcgaag cggcatgcat ttacgttgac accatcgaat ggtgcaaaac
ctttcgcggt 4620atggcatgat agcgcccgga agagagtcaa ttcagggtgg tgaatgtgaa
accagtaacg 4680ttatacgatg tcgcagagta tgccggtgtc tcttatcaga ccgtttcccg
cgtggtgaac 4740caggccagcc acgtttctgc gaaaacgcgg gaaaaagtgg aagcggcgat
ggcggagctg 4800aattacattc ccaaccgcgt ggcacaacaa ctggcgggca aacagtcgtt
gctgattggc 4860gttgccacct ccagtctggc cctgcacgcg ccgtcgcaaa ttgtcgcggc
gattaaatct 4920cgcgccgatc aactgggtgc cagcgtggtg gtgtcgatgg tagaacgaag
cggcgtcgaa 4980gcctgtaaag cggcggtgca caatcttctc gcgcaacgcg tcagtgggct
gatcattaac 5040tatccgctgg atgaccagga tgccattgct gtggaagctg cctgcactaa
tgttccggcg 5100ttatttcttg atgtctctga ccagacaccc atcaacagta ttattttctc
ccatgaagac 5160ggtacgcgac tgggcgtgga gcatctggtc gcattgggtc accagcaaat
cgcgctgtta 5220gcgggcccat taagttctgt ctcggcgcgt ctgcgtctgg ctggctggca
taaatatctc 5280actcgcaatc aaattcagcc gatagcggaa cgggaaggcg actggagtgc
catgtccggt 5340tttcaacaaa ccatgcaaat gctgaatgag ggcatcgttc ccactgcgat
gctggttgcc 5400aacgatcaga tggcgctggg cgcaatgcgc gccattaccg agtccgggct
gcgcgttggt 5460gcggatatct cggtagtggg atacgacgat accgaagaca gctcatgtta
tatcccgccg 5520tcaaccacca tcaaacagga ttttcgcctg ctggggcaaa ccagcgtgga
ccgcttgctg 5580caactctctc agggccaggc ggtgaagggc aatcagctgt tgcccgtctc
actggtgaaa 5640agaaaaacca ccctggcgcc caatacgcaa accgcctctc cccgcgcgtt
ggccgattca 5700ttaatgcagc tggcacgaca ggtttcccga ctggaaagcg ggcagtgagc
gcaacgcaat 5760taatgtgagt tagcgcgaat tgatctg
57874826DNAArtificial SequenceDescription of Artificial
Sequence Synthetic primer 48aggttcccat ggtgactcag gacccg
264940DNAArtificial SequenceDescription of
Artificial Sequence Synthetic primer 49gtaagcttag tggtgatggt
gatgaccgaa cgcttcgtcc 40504614DNAArtificial
SequenceDescription of Artificial Sequence Synthetic plasmid
pKK223cterm-His 50ttgacaatta atcatcggct cgtataatgt gtggaattgt gagcggataa
caatttcaca 60caggaaacag aaccatggga cataacggat ctagatctca ccatcaccac
cattagtcga 120cctgcagcca agcttggctg ttttggcgga tgagagaaga ttttcagcct
gatacagatt 180aaatcagaac gcagaagcgg tctgataaaa cagaatttgc ctggcggcag
tagcgcggtg 240gtcccacctg accccatgcc gaactcagaa gtgaaacgcc gtagcgccga
tggtagtgtg 300gggtctcccc atgcgagagt agggaactgc caggcatcaa ataaaacgaa
aggctcagtc 360gaaagactgg gcctttcgtt ttatctgttg tttgtcggtg aacgctctcc
tgagtaggac 420aaatccgccg ggagcggatt tgaacgttgc gaagcaacgg cccggagggt
ggcgggcagg 480acgcccgcca taaactgcca ggcatcaaat taagcagaag gccatcctga
cggatggcct 540ttttgcgttt ctacaaactc ttttgtttat ttttctaaat acattcaaat
atgtatccgc 600tcatgagaca ataaccctga taaatgcttc aataatattg aaaaaggaag
agtatgagta 660ttcaacattt ccgtgtcgcc cttattccct tttttgcggc attttgcctt
cctgtttttg 720ctcacccaga aacgctggtg aaagtaaaag atgctgaaga tcagttgggt
gcacgagtgg 780gttacatcga actggatctc aacagcggta agatccttga gagttttcgc
cccgaagaac 840gttttccaat gatgagcact tttaaagttc tgctatgtgg cgcggtatta
tcccgtgttg 900acgccgggca agagcaactc ggtcgccgca tacactattc tcagaatgac
ttggttgagt 960actcaccagt cacagaaaag catcttacgg atggcatgac agtaagagaa
ttatgcagtg 1020ctgccataac catgagtgat aacactgcgg ccaacttact tctgacaacg
atcggaggac 1080cgaaggagct aaccgctttt ttgcacaaca tgggggatca tgtaactcgc
cttgatcgtt 1140gggaaccgga gctgaatgaa gccataccaa acgacgagcg tgacaccacg
atgctgtagc 1200aatggcaaca acgttgcgca aactattaac tggcgaacta cttactctag
cttcccggca 1260acaattaata gactggatgg aggcggataa agttgcagga ccacttctgc
gctcggccct 1320tccggctggc tggtttattg ctgataaatc tggagccggt gagcgtgggt
ctcgcggtat 1380cattgcagca ctggggccag atggtaagcc ctcccgtatc gtagttatct
acacgacggg 1440gagtcaggca actatggatg aacgaaatag acagatcgct gagataggtg
cctcactgat 1500taagcattgg taactgtcag accaagttta ctcatatata ctttagattg
atttaaaact 1560tcatttttaa tttaaaagga tctaggtgaa gatccttttt gataatctca
tgaccaaaat 1620cccttaacgt gagttttcgt tccactgagc gtcagacccc gtagaaaaga
tcaaaggatc 1680ttcttgagat cctttttttc tgcgcgtaat ctgctgcttg caaacaaaaa
aaccaccgct 1740accagcggtg gtttgtttgc cggatcaaga gctaccaact ctttttccga
aggtaactgg 1800cttcagcaga gcgcagatac caaatactgt ccttctagtg tagccgtagt
taggccacca 1860cttcaagaac tctgtagcac cgcctacata cctcgctctg ctaatcctgt
taccagtggc 1920tgctgccagt ggcgataagt cgtgtcttac cgggttggac tcaagacgat
agttaccgga 1980taaggcgcag cggtcgggct gaacgggggg ttcgtgcaca cagcccagct
tggagcgaac 2040gacctacacc gaactgagat acctacagcg tgagcattga gaaagcgcca
cgcttcccga 2100agggagaaag gcggacaggt atccggtaag cggcagggtc ggaacaggag
agcgcacgag 2160ggagcttcca gggggaaacg cctggtatct ttatagtcct gtcgggtttc
gccacctctg 2220acttgagcgt cgatttttgt gatgctcgtc aggggggcgg agcctatgga
aaaacgccag 2280caacgcggcc tttttacggt tcctggcctt ttgctggcct tttgctcaca
tgttctttcc 2340tgcgttatcc cctgattctg tggataaccg tattaccgcc tttgagtgag
ctgataccgc 2400tcgccgcagc cgaacgaccg agcgcagcga gtcagtgagc gaggaagcgg
aagagcgcct 2460gatgcggtat tttctcctta cgcatctgtg cggtatttca caccgcatat
ggtgcactct 2520cagtacaatc tgctctgatg ccgcatagtt aagccagtat acactccgct
atcgctacgt 2580gactgggtca tggctgcgcc ccgacacccg ccaacacccg ctgacgcgcc
ctgacgggct 2640tgtctgctcc cggcatccgc ttacagacaa gctgtgaccg tctccgggag
ctgcatgtgt 2700cagaggtttt caccgtcatc accgaaacgc gcgaggcagc tgcggtaaag
ctcatcagcg 2760tggtcgtgaa gcgattcaca gatgtctgcc tgttcatccg cgtccagctc
gttgagtttc 2820tccagaagcg ttaatgtctg gcttctgata aagcgggcca tgttaagggc
ggttttttcc 2880tgtttggtca ctgatgcctc cgtgtaaggg ggatttctgt tcatgggggt
aatgataccg 2940atgaaacgag agaggatgct cacgatacgg gttactgatg atgaacatgc
ccggttactg 3000gaacgttgtg agggtaaaca actggcggta tggatgcggc gggaccagag
aaaaatcact 3060cagggtcaat gccagcgctt cgttaataca gatgtaggtg ttccacaggg
tagccagcag 3120catcctgcga tgcagatccg gaacataatg gtgcagggcg ctgacttccg
cgtttccaga 3180ctttacgaaa cacggaaacc gaagaccatt catgttgttg ctcaggtcgc
agacgttttg 3240cagcagcagt cgcttcacgt tcgctcgcgt atcggtgatt cattctgcta
accagtaagg 3300caaccccgcc agcctagccg ggtcctcaac gacaggagca cgatcatgcg
cacccgtggc 3360caggacccaa cgctgcccga gatgcgccgc gtgcggctgc tggagatggc
ggacgcgatg 3420gatatgttct gccaagggtt ggtttgcgca ttcacagttc tccgcaagaa
ttgattggct 3480ccaattcttg gagtggtgaa tccgttagcg aggtgccgcc ggcttccatt
caggtcgagg 3540tggcccggct ccatgcaccg cgacgcaacg cggggaggca gacaaggtat
agggcggcgc 3600ctacaatcca tgccaacccg ttccatgtgc tcgccgaggc ggcataaatc
gccgtgacga 3660tcagcggtcc agtgatcgaa gttaggctgg taagagccgc gagcgatcct
tgaagctgtc 3720cctgatggtc gtcatctacc tgcctggaca gcatggcctg caacgcgggc
atcccgatgc 3780cgccggaagc gagaagaatc ataatgggga aggccatcca gcctcgcgtc
gcgaacgcca 3840gcaagacgta gcccagcgcg tcggccgcca tgccggcgat aatggcctgc
ttctcgccga 3900aacgtttggt ggcgggacca gtgacgaagg cttgagcgag ggcgtgcaag
attccgaata 3960ccgcaagcga caggccgatc atcgtcgcgc tccagcgaaa gcggtcctcg
ccgaaaatga 4020cccagagcgc tgccggcacc tgtcctacga gttgcatgat aaagaagaca
gtcataagtg 4080cggcgacgat agtcatgccc cgcgcccacc ggaaggagct gactgggttg
aaggctctca 4140agggcatcgg tcgacgctct cccttatgcg actcctgcat taggaagcag
cccagtagta 4200ggttgaggcc gttgagcacc gccgccgcaa ggaatggtgc atgcaaggag
atggcgccca 4260acagtccccc ggccacgggg cctgccacca tacccacgcc gaaacaagcg
ctcatgagcc 4320cgaagtggcg agcccgatct tccccatcgg tgatgtcggc gatataggcg
ccagcaaccg 4380cacctgtggc gccggtgatg ccggccacga tgcgtccggc gtagaggatc
cgggcttatc 4440gactgcacgg tgcaccaatg cttctggcgt caggcagcca tcggaagctg
tggtatggct 4500gtgcaggtcg taaatcactg cataattcgt gtcgctcaag gcgcactccc
gttctggata 4560atgttttttg cgccgacatc ataacggttc tggcaaatat tctgaaatga
gctg 46145126DNAArtificial SequenceDescription of Artificial
Sequence Synthetic primer 51aggttcccat ggtgactcag gacccg
265240DNAArtificial SequenceDescription of
Artificial Sequence Synthetic primer 52gtaagcttag tggtgatggt
gatgaccgaa cgcttcgtcc 40538241DNAArtificial
SequenceDescription of Artificial Sequence Synthetic plasmid
pKK223kgd-his 53atggatcgta aatttcgtga tgatccgtct tccgttgacc caagctggca
tgagtttctg 60gtggactata gcccggagcc gacttcccaa ccggctgctg agccgactcg
tgtgaccagc 120cctttggtcg ctgagcgtgc ggcagcggcg gctccgcaag cgccgccgaa
accggccgac 180acggccgcag cgggcaacgg tgtcgtggcg gctctggccg ccaagaccgc
ggtgcctccg 240ccagccgaag gcgatgaagt tgctgtcctg cgtggtgcgg cggctgctgt
cgtgaagaac 300atgagcgcgt cgctggaagt gcctacggcg acctcggtgc gtgctgtgcc
agcgaaattg 360ttgatcgata atcgtatcgt gatcaataat cagttgaaac gcacgcgtgg
tggtaaaatc 420agctttaccc atttgctggg ttatgctctg gttcaggcgg ttaagaagtt
cccgaatatg 480aatcgccact acacggaggt ggacggtaag ccgacggcgg ttactcctgc
gcacaccaat 540ctgggtctgg ctattgacct gcaaggtaaa gatggcaagc gctctctggt
ggtggccggt 600atcaaacgtt gcgagacgat gcgcttcgcg cagttcgtta ccgcgtacga
ggacatcgtc 660cgtcgtgctc gtgatggtaa actgaccacc gaggacttcg ctggcgttac
gatttcgctg 720acgaatccgg gcacgattgg cacggtgcac tctgtgccac gcctgatgcc
aggtcaaggt 780gctatcatcg gtgtcggtgc gatggaatat ccggcggagt tccaaggcgc
gtccgaggaa 840cgcatcgctg agctgggtat tggcaaactg attaccctga cttctaccta
cgaccaccgt 900atcattcaag gtgcggaaag cggtgacttc ctgcgtacca ttcatgaact
gctgttgtct 960gatggcttct gggatgaggt cttccgcgag ctgagcatcc cgtacctgcc
agtgcgttgg 1020agcaccgata atccggactc gatcgttgac aagaacgcac gtgtgatgaa
cctgatcgcg 1080gcctatcgca accgcggcca tctgatggcg gacaccgacc cgttgcgctt
ggataaggcg 1140cgctttcgtt cgcatccgga tttggaggtg ttgacgcatg gcctgactct
gtgggacctg 1200gaccgcgtct ttaaggttga cggtttcgcg ggtgctcaat acaaaaagct
gcgtgacgtg 1260ctgggtttgc tgcgcgacgc gtactgccgt cacatcggtg ttgagtacgc
gcacattctg 1320gacccggaac aaaaggagtg gctggagcag cgtgtcgaga cgaagcacgt
caaaccgacc 1380gttgctcaac aaaagtacat cttgtcgaaa ctgaacgcag cggangcgtt
tgaaaccttc 1440ctgcaaacca agtatgtggg ccaaaagcgt tttagcctgg agggtgcgga
atctgttatt 1500ccaatgatgg acgcggcgat tgaccaatgt gccgagcatg gtttggatga
agtggtcatt 1560ggcatgccgc accgtggtcg cttgaatgtc ctggcgaaca tcgtgggcaa
accgtattcg 1620caaatcttta cggagtttga aggcaatctg aacccgtccc aggctcacgg
ttccggcgat 1680gtcaagtacc acctgggtgc gacgggtctg tatctgcaaa tgtttggcga
taacgatatt 1740caggtttccc tgacggcaaa tccgtcccat ttggaagcgg tcgatccagt
gctggaaggt 1800ttggtgcgtg cgaagcaaga cctgctggac cacggttcca ttgactccga
cggccaacgt 1860gcattctccg tcgttccgtt gatgctgcac ggtgatgcgg ccttcgcggg
tcaaggcgtt 1920gttgcggaga cgctgaatct ggcaaatctg cctggctatc gtgtgggtgg
tacgattcac 1980atcattgtga acaaccaaat cggcttcacc accgctccgg aatactctcg
ttcgagcgag 2040tactgcacgg atgtggcgaa gatgatcggt gcgccaatct tccacgtgaa
cggcgacgat 2100ccagaagcgt gtgtctgggt tgcgcgtttg gcagttgact tccgccaacg
ctttaaaaag 2160gacgttgtta ttgacatgct gtgctatcgt cgtcgtggtc ataatgaggg
tgacgatcca 2220tctatgacca atccgtatgt ctatgacgtc gtggacacca agcgtggcgc
ccgtaaatcc 2280tacaccgagg ctctgatcgg tcgtggcgac atcagcatga aagaggcgga
ggatgcgctg 2340cgcgattacc agggccaact ggagcgtgtg ttcaacgaag tgcgcgaact
ggagaagcac 2400ggcgtccaac cgtcggagag cgtcgagtcg gaccaaatga tcccagcggg
cctggcgacc 2460gctgttgaca aatccctgct ggcacgtatc ggtgatgcct tcctggcgct
gcctaacggt 2520ttcacggcac acccgcgcgt tcagccggtg ctggaaaagc gtcgtgagat
ggcgtatgag 2580ggtaagattg actgggcgtt tggtgagttg ctggcgctgg gcagcctggt
tgctgagggc 2640aagctggtcc gtctgagcgg tcaagactct cgtcgtggca cctttagcca
gcgtcattct 2700gtgctgatcg accgtcacac cggcgaggag ttcaccccgc tgcagctgct
ggcgactaac 2760agcgacggca gcccgaccgg tggtaagttc ctggtttatg attcgccgtt
gtcggaatac 2820gcggctgtgg gttttgagta cggttatacc gttggcaatc cagacgcggt
tgtgctgtgg 2880gaggcgcaat tcggcgactt tgttaatggt gcccagtcga tcattgacga
gttcatcagc 2940tcgggcgaag cgaaatgggg tcagctgtcc aacgttgttc tgttgctgcc
acacggccac 3000gagggtcagg gcccggacca cacgtcggcg cgcattgagc gctttttgca
gctgtgggct 3060gagggtagca tgacgatcgc gatgccgagc accccgtcca attactttca
cttgctgcgc 3120cgccacgcgt tggacggcat ccaacgtccg ttgatcgtgt ttaccccgaa
atccatgctg 3180cgccacaagg cggcggtttc ggagattaag gattttaccg agattaaatt
ccgtagcgtc 3240ctggaggaac cgacctacga ggatggtatc ggcgaccgca ataaagttag
ccgtattctg 3300ttgacctccg gtaagttgta ttatgaattg gcggcacgca aggcgaaaga
caaccgtaat 3360gatctggcaa ttgtgcgtct ggagcaactg gcgccgttgc cgcgtcgtcg
tctgcgtgaa 3420accctggatc gttacgaaaa tgtcaaagag ttcttttggg tccaggagga
accagcgaac 3480cagggcgcct ggccgcgttt cggtttggag ttgccggagt tgctgccgga
caagttggcg 3540ggcatcaagc gtatctcccg tcgcgcgatg tctgcgccga gcagcggttc
ctcgaaggtt 3600catgccgtgg agcagcaaga aatcttggac gaagcgttcg gatctagatc
tcaccatcac 3660caccattagt cgacctgcag ccaagcttgg ctgttttggc ggatgagaga
agattttcag 3720cctgatacag attaaatcag aacgcagaag cggtctgata aaacagaatt
tgcctggcgg 3780cagtagcgcg gtggtcccac ctgaccccat gccgaactca gaagtgaaac
gccgtagcgc 3840cgatggtagt gtggggtctc cccatgcgag agtagggaac tgccaggcat
caaataaaac 3900gaaaggctca gtcgaaagac tgggcctttc gttttatctg ttgtttgtcg
gtgaacgctc 3960tcctgagtag gacaaatccg ccgggagcgg atttgaacgt tgcgaagcaa
cggcccggag 4020ggtggcgggc aggacgcccg ccataaactg ccaggcatca aattaagcag
aaggccatcc 4080tgacggatgg cctttttgcg tttctacaaa ctcttttgtt tatttttcta
aatacattca 4140aatatgtatc cgctcatgag acaataaccc tgataaatgc ttcaataata
ttgaaaaagg 4200aagagtatga gtattcaaca tttccgtgtc gcccttattc ccttttttgc
ggcattttgc 4260cttcctgttt ttgctcaccc agaaacgctg gtgaaagtaa aagatgctga
agatcagttg 4320ggtgcacgag tgggttacat cgaactggat ctcaacagcg gtaagatcct
tgagagtttt 4380cgccccgaag aacgttttcc aatgatgagc acttttaaag ttctgctatg
tggcgcggta 4440ttatcccgtg ttgacgccgg gcaagagcaa ctcggtcgcc gcatacacta
ttctcagaat 4500gacttggttg agtactcacc agtcacagaa aagcatctta cggatggcat
gacagtaaga 4560gaattatgca gtgctgccat aaccatgagt gataacactg cggccaactt
acttctgaca 4620acgatcggag gaccgaagga gctaaccgct tttttgcaca acatggggga
tcatgtaact 4680cgccttgatc gttgggaacc ggagctgaat gaagccatac caaacgacga
gcgtgacacc 4740acgatgctgt agcaatggca acaacgttgc gcaaactatt aactggcgaa
ctacttactc 4800tagcttcccg gcaacaatta atagactgga tggaggcgga taaagttgca
ggaccacttc 4860tgcgctcggc ccttccggct ggctggttta ttgctgataa atctggagcc
ggtgagcgtg 4920ggtctcgcgg tatcattgca gcactggggc cagatggtaa gccctcccgt
atcgtagtta 4980tctacacgac ggggagtcag gcaactatgg atgaacgaaa tagacagatc
gctgagatag 5040gtgcctcact gattaagcat tggtaactgt cagaccaagt ttactcatat
atactttaga 5100ttgatttaaa acttcatttt taatttaaaa ggatctaggt gaagatcctt
tttgataatc 5160tcatgaccaa aatcccttaa cgtgagtttt cgttccactg agcgtcagac
cccgtagaaa 5220agatcaaagg atcttcttga gatccttttt ttctgcgcgt aatctgctgc
ttgcaaacaa 5280aaaaaccacc gctaccagcg gtggtttgtt tgccggatca agagctacca
actctttttc 5340cgaaggtaac tggcttcagc agagcgcaga taccaaatac tgtccttcta
gtgtagccgt 5400agttaggcca ccacttcaag aactctgtag caccgcctac atacctcgct
ctgctaatcc 5460tgttaccagt ggctgctgcc agtggcgata agtcgtgtct taccgggttg
gactcaagac 5520gatagttacc ggataaggcg cagcggtcgg gctgaacggg gggttcgtgc
acacagccca 5580gcttggagcg aacgacctac accgaactga gatacctaca gcgtgagcat
tgagaaagcg 5640ccacgcttcc cgaagggaga aaggcggaca ggtatccggt aagcggcagg
gtcggaacag 5700gagagcgcac gagggagctt ccagggggaa acgcctggta tctttatagt
cctgtcgggt 5760ttcgccacct ctgacttgag cgtcgatttt tgtgatgctc gtcagggggg
cggagcctat 5820ggaaaaacgc cagcaacgcg gcctttttac ggttcctggc cttttgctgg
ccttttgctc 5880acatgttctt tcctgcgtta tcccctgatt ctgtggataa ccgtattacc
gcctttgagt 5940gagctgatac cgctcgccgc agccgaacga ccgagcgcag cgagtcagtg
agcgaggaag 6000cggaagagcg cctgatgcgg tattttctcc ttacgcatct gtgcggtatt
tcacaccgca 6060tatggtgcac tctcagtaca atctgctctg atgccgcata gttaagccag
tatacactcc 6120gctatcgcta cgtgactggg tcatggctgc gccccgacac ccgccaacac
ccgctgacgc 6180gccctgacgg gcttgtctgc tcccggcatc cgcttacaga caagctgtga
ccgtctccgg 6240gagctgcatg tgtcagaggt tttcaccgtc atcaccgaaa cgcgcgaggc
agctgcggta 6300aagctcatca gcgtggtcgt gaagcgattc acagatgtct gcctgttcat
ccgcgtccag 6360ctcgttgagt ttctccagaa gcgttaatgt ctggcttctg ataaagcggg
ccatgttaag 6420ggcggttttt tcctgtttgg tcactgatgc ctccgtgtaa gggggatttc
tgttcatggg 6480ggtaatgata ccgatgaaac gagagaggat gctcacgata cgggttactg
atgatgaaca 6540tgcccggtta ctggaacgtt gtgagggtaa acaactggcg gtatggatgc
ggcgggacca 6600gagaaaaatc actcagggtc aatgccagcg cttcgttaat acagatgtag
gtgttccaca 6660gggtagccag cagcatcctg cgatgcagat ccggaacata atggtgcagg
gcgctgactt 6720ccgcgtttcc agactttacg aaacacggaa accgaagacc attcatgttg
ttgctcaggt 6780cgcagacgtt ttgcagcagc agtcgcttca cgttcgctcg cgtatcggtg
attcattctg 6840ctaaccagta aggcaacccc gccagcctag ccgggtcctc aacgacagga
gcacgatcat 6900gcgcacccgt ggccaggacc caacgctgcc cgagatgcgc cgcgtgcggc
tgctggagat 6960ggcggacgcg atggatatgt tctgccaagg gttggtttgc gcattcacag
ttctccgcaa 7020gaattgattg gctccaattc ttggagtggt gaatccgtta gcgaggtgcc
gccggcttcc 7080attcaggtcg aggtggcccg gctccatgca ccgcgacgca acgcggggag
gcagacaagg 7140tatagggcgg cgcctacaat ccatgccaac ccgttccatg tgctcgccga
ggcggcataa 7200atcgccgtga cgatcagcgg tccagtgatc gaagttaggc tggtaagagc
cgcgagcgat 7260ccttgaagct gtccctgatg gtcgtcatct acctgcctgg acagcatggc
ctgcaacgcg 7320ggcatcccga tgccgccgga agcgagaaga atcataatgg ggaaggccat
ccagcctcgc 7380gtcgcgaacg ccagcaagac gtagcccagc gcgtcggccg ccatgccggc
gataatggcc 7440tgcttctcgc cgaaacgttt ggtggcggga ccagtgacga aggcttgagc
gagggcgtgc 7500aagattccga ataccgcaag cgacaggccg atcatcgtcg cgctccagcg
aaagcggtcc 7560tcgccgaaaa tgacccagag cgctgccggc acctgtccta cgagttgcat
gataaagaag 7620acagtcataa gtgcggcgac gatagtcatg ccccgcgccc accggaagga
gctgactggg 7680ttgaaggctc tcaagggcat cggtcgacgc tctcccttat gcgactcctg
cattaggaag 7740cagcccagta gtaggttgag gccgttgagc accgccgccg caaggaatgg
tgcatgcaag 7800gagatggcgc ccaacagtcc cccggccacg gggcctgcca ccatacccac
gccgaaacaa 7860gcgctcatga gcccgaagtg gcgagcccga tcttccccat cggtgatgtc
ggcgatatag 7920gcgccagcaa ccgcacctgt ggcgccggtg atgccggcca cgatgcgtcc
ggcgtagagg 7980atccgggctt atcgactgca cggtgcacca atgcttctgg cgtcaggcag
ccatcggaag 8040ctgtggtatg gctgtgcagg tcgtaaatca ctgcataatt cgtgtcgctc
aaggcgcact 8100cccgttctgg ataatgtttt ttgcgccgac atcataacgg ttctggcaaa
tattctgaaa 8160tgagctgttg acaattaatc atcggctcgt ataatgtgtg gaattgtgag
cggataacaa 8220tttcacacag gaaacagaac c
824154541PRTLeuconostoc mesenteroides 54Met Thr Asp Thr Leu
Thr Phe Asn Thr Lys His Leu Leu Glu Ala Leu 1 5
10 15 Phe Glu Ser Gly Ile Arg His Phe Ile Val
Ser Pro Gly Ser Arg Ser 20 25
30 Thr Pro Ile Ala Leu Leu Leu Ala Glu Tyr Ala Glu Gln Thr Asn
Glu 35 40 45 Ile
Lys Leu Phe Val Asp Val Asp Glu Arg Ser Ala Gly Phe Phe Ala 50
55 60 Leu Gly Ile Ala Lys Thr
Leu Leu Glu Pro Val Val Leu Leu Gly Thr 65 70
75 80 Ser Gly Thr Ala Ile Ala Glu Tyr Met Pro Ala
Val Ala Glu Ala Tyr 85 90
95 Ala Ala Asn Ile Pro Leu Val Val Leu Ser Thr Asp Arg Pro Gln Glu
100 105 110 Leu Gln
Phe Asn Gly Ala Pro Gln Thr Ile Pro Gln Ser Asn Leu Phe 115
120 125 Gly Gln Leu Thr Lys Gln Ala
Val Leu Ile Arg Leu Gln Asp Met His 130 135
140 Ser Asp Val Thr Glu Tyr Ile Asp Phe Ile Val Gln
Lys Val Val Asn 145 150 155
160 Leu Ser Ile Thr Ala Pro Arg Gly Pro Ile Gln Ile Asn Leu Pro Leu
165 170 175 Arg Lys Pro
Leu Met Pro Val Leu Asp Arg Gln Asp Glu Val His Val 180
185 190 Gln Arg Val Val Phe Asp Lys Gln
Ser Val Gln Tyr Arg Leu Pro Pro 195 200
205 Ile Thr Ala Lys Arg Leu Leu Ile Leu Ala Gly Pro Asn
Val Leu Asn 210 215 220
Ser Tyr Asp Asp Ser Leu Lys Lys Phe Ala Ile Lys Asn Asn Val Pro 225
230 235 240 Val Ile Ala Asp
Val Leu Ser Gln Ser Leu His Thr Tyr Thr Ile His 245
250 255 Gly Ile Asp Val Leu Leu Gln Ala His
Lys Ile Asn Ala Asp Leu Lys 260 265
270 Pro Asp Leu Val Val Arg Phe Gly Lys Thr Pro Val Ser Ala
Arg Val 275 280 285
Leu Gln Trp Leu Lys Glu Glu Asn Ile Leu Thr Trp His Val Gly Glu 290
295 300 Asp Ala Gly Val Asp
His Thr Arg His Ile Val Arg Ala Ile Lys Met 305 310
315 320 Ala Pro His Asp Phe Leu Glu Ser Met His
Leu Thr Leu Ser Lys Asn 325 330
335 Gln Ile Asp Phe Asn Gln Lys Trp Leu Ser Leu Pro Lys Val Ile
Lys 340 345 350 Thr
Arg Asn Glu Met Asn Ile Ile Thr Ala Leu Asp Asp Ala Val Pro 355
360 365 Asp Asp Thr His Ile Phe
Val Ala Asn Ser Met Pro Ile Arg Asp Met 370 375
380 Asp Asn Phe Phe Thr Gly Asn His Thr Gln Arg
Ile Tyr Ala Asn Arg 385 390 395
400 Gly Ala Asn Gly Ile Asp Gly Val Ile Ser Ser Ala Leu Gly Met Ser
405 410 415 Ala Val
Ala Lys Gln Arg Ser Val Leu Leu Thr Gly Asp Leu Thr Leu 420
425 430 Phe His Asp Met Asn Gly Leu
Met Met Ala Lys Asn Tyr Gln Leu Pro 435 440
445 Leu Asp Ile Ile Val Ile Asn Asn Asn Gly Gly Gly
Ile Phe Ser Phe 450 455 460
Leu Pro Gln Ala Gly Ala Pro Lys Tyr Phe Glu Gln Gln Phe Gly Thr 465
470 475 480 Pro Leu Asn
Ile Asp Ile Lys Lys Ile Ala Asp Leu Tyr Tyr Ile Asp 485
490 495 Tyr His Gln Leu Asn Val Pro Glu
Ala Leu Ser Gln Ile Leu Gln Thr 500 505
510 Pro Ser Lys Thr Thr Arg Leu Ile Glu Tyr Lys Ser Asp
His Gln Arg 515 520 525
Asn Arg Asp Asp His Arg Glu Val Leu Glu Met Leu Lys 530
535 540 55541PRTLeuconostoc mesenteroides 55Met
Thr Asp Thr Leu Thr Phe Asn Thr Lys His Leu Leu Glu Ala Leu 1
5 10 15 Phe Glu Ser Gly Ile Arg
His Phe Ile Val Ser Pro Gly Ser Arg Ser 20
25 30 Thr Pro Ile Ala Leu Leu Leu Ala Glu Tyr
Ala Glu Gln Asn Asn Glu 35 40
45 Ile Lys Leu Phe Val Asp Val Asp Glu Arg Ser Ala Gly Phe
Phe Ala 50 55 60
Leu Gly Ile Ala Lys Thr Leu Leu Glu Pro Val Val Leu Leu Gly Thr 65
70 75 80 Ser Gly Thr Ala Ile
Ala Glu Tyr Met Pro Ala Val Ala Glu Ala Tyr 85
90 95 Ala Ala Asn Ile Pro Leu Val Val Leu Ser
Thr Asp Arg Pro Gln Glu 100 105
110 Leu Gln Phe Asn Gly Ala Pro Gln Thr Ile Pro Gln Ser Asn Leu
Phe 115 120 125 Gly
Gln Leu Thr Lys Gln Ala Val Leu Ile Arg Leu Gln Asp Met His 130
135 140 Ser Asp Val Thr Glu Tyr
Ile Asp Phe Ile Val Gln Lys Val Val Asn 145 150
155 160 Leu Ser Ile Thr Ala Pro Arg Gly Pro Ile Gln
Ile Asn Leu Pro Leu 165 170
175 Arg Lys Pro Leu Met Pro Val Leu Asp Arg Gln Asp Glu Val His Val
180 185 190 Gln Arg
Val Val Phe Asp Lys Gln Ser Val Gln Tyr Arg Leu Pro Pro 195
200 205 Ile Thr Ala Lys Arg Leu Leu
Ile Leu Ala Gly Pro Asn Val Leu Asn 210 215
220 Ser Tyr Asp Asp Ser Leu Lys Lys Phe Ala Ile Lys
Asn Asn Val Pro 225 230 235
240 Val Ile Ala Asp Val Leu Ser Gln Ser Arg His Thr Tyr Thr Ile His
245 250 255 Gly Ile Asp
Val Leu Leu Gln Ala His Lys Ile Asn Ala Asp Leu Lys 260
265 270 Pro Asp Leu Val Val Arg Phe Gly
Lys Thr Pro Val Ser Ala Arg Val 275 280
285 Leu Gln Trp Leu Lys Glu Glu Asn Ile Leu Thr Trp His
Val Asp Glu 290 295 300
Asp Ala Gly Val Asp His Thr Arg His Ile Val Arg Ala Ile Lys Met 305
310 315 320 Ala Pro His Asp
Phe Leu Glu Ser Met His Leu Thr Leu Ser Lys Asn 325
330 335 Gln Ile Asp Phe Asn Gln Lys Trp Leu
Ser Leu Pro Lys Val Ile Lys 340 345
350 Thr Arg Asn Glu Met Asn Ile Ile Thr Ala Leu Asp Asp Ala
Val Pro 355 360 365
Asp Asp Thr His Ile Phe Val Ala Asn Ser Met Pro Ile Arg Asp Met 370
375 380 Asp Asn Phe Phe Thr
Gly Asn His Thr Gln Arg Ile Tyr Ala Asn Arg 385 390
395 400 Gly Ala Asn Gly Ile Asp Gly Val Ile Ser
Ser Ala Leu Gly Met Ser 405 410
415 Ala Val Val Lys Gln Arg Ser Val Leu Leu Thr Gly Asp Leu Thr
Leu 420 425 430 Phe
His Asp Met Asn Gly Leu Met Met Ala Lys Asn Tyr Gln Leu Pro 435
440 445 Leu Asp Ile Ile Val Ile
Asn Asn Asn Gly Gly Gly Ile Phe Ser Phe 450 455
460 Leu Pro Gln Ala Gly Ala Pro Lys Tyr Phe Glu
Gln Leu Phe Gly Asn 465 470 475
480 Pro Leu Asn Ile Asp Ile Lys Lys Ile Ala Asp Leu Tyr Tyr Ile Asp
485 490 495 Tyr His
Gln Leu Asn Val Pro Glu Ala Leu Ser Gln Ile Leu Gln Thr 500
505 510 Pro Ser Lys Thr Thr Arg Leu
Ile Glu Tyr Lys Ser Asp His Gln Arg 515 520
525 Asn Arg Asp Asp His Arg Glu Val Leu Glu Met Leu
Lys 530 535 540
56541PRTLeuconostoc mesenteroides 56Met Thr Asp Thr Leu Thr Phe Asn Thr
Lys His Leu Leu Glu Ala Leu 1 5 10
15 Phe Glu Ser Gly Ile Arg His Phe Ile Val Ser Pro Gly Ser
Arg Ser 20 25 30
Thr Pro Ile Ala Leu Leu Leu Ala Glu Tyr Ala Glu Gln Asn Asn Glu
35 40 45 Ile Lys Leu Phe
Val Asp Val Asp Glu Arg Ser Ala Gly Phe Phe Ala 50
55 60 Leu Gly Ile Ala Lys Thr Leu Leu
Glu Pro Val Val Leu Leu Gly Thr 65 70
75 80 Ser Gly Thr Ala Ile Ala Glu Tyr Met Pro Ala Val
Ala Glu Ala Tyr 85 90
95 Ala Ala Asn Ile Pro Leu Val Val Leu Ser Thr Asp Arg Pro Gln Glu
100 105 110 Leu Gln Phe
Asn Gly Ala Pro Gln Thr Ile Pro Gln Ser Asn Leu Phe 115
120 125 Gly Gln Leu Thr Lys Gln Ala Val
Leu Ile Arg Leu Gln Asp Met His 130 135
140 Ser Asp Val Thr Glu Tyr Ile Asp Phe Ile Val Gln Lys
Val Val Asn 145 150 155
160 Leu Ser Ile Thr Ala Pro Arg Gly Pro Ile Gln Ile Asn Leu Pro Leu
165 170 175 Arg Lys Pro Leu
Met Pro Val Leu Asp Arg Gln Asp Glu Val His Val 180
185 190 Gln Arg Val Val Phe Asp Lys Gln Ser
Val Gln Tyr Arg Leu Pro Pro 195 200
205 Ile Thr Ala Lys Arg Leu Leu Ile Leu Ala Gly Pro Asn Val
Leu Asn 210 215 220
Ser Tyr Asp Asp Ser Leu Lys Lys Phe Ala Ile Lys Asn Asn Val Pro 225
230 235 240 Val Ile Ala Asp Val
Leu Ser Gln Ser Arg His Thr Tyr Thr Ile His 245
250 255 Gly Ile Asp Val Leu Leu Gln Ala His Lys
Ile Asn Ala Asp Leu Lys 260 265
270 Pro Asp Leu Val Val Arg Phe Gly Lys Thr Pro Val Ser Ala Arg
Val 275 280 285 Leu
Gln Trp Leu Lys Glu Glu Asn Ile Leu Thr Trp His Val Asp Glu 290
295 300 Asp Ala Gly Val Asp His
Thr Arg His Ile Val Arg Ala Ile Lys Met 305 310
315 320 Ala Pro His Asp Phe Leu Glu Ser Met His Leu
Thr Leu Ser Lys Asn 325 330
335 Gln Ile Asp Phe Asn Gln Lys Trp Leu Ser Leu Pro Lys Val Ile Lys
340 345 350 Thr Arg
Asn Glu Met Asn Ile Ile Thr Ala Leu Asp Asp Ala Val Pro 355
360 365 Asp Asp Thr His Ile Phe Val
Ala Asn Ser Met Pro Ile Arg Asp Met 370 375
380 Asp Asn Phe Phe Thr Gly Asn His Thr Gln Cys Ile
Tyr Ala Asn Arg 385 390 395
400 Gly Ala Asn Gly Ile Asp Gly Val Ile Ser Ser Ala Leu Gly Met Ser
405 410 415 Ala Val Val
Lys Gln Arg Ser Val Leu Leu Thr Gly Asp Leu Thr Leu 420
425 430 Phe His Gly Met Asn Gly Leu Met
Met Ala Lys Asn Tyr Gln Leu Pro 435 440
445 Leu Asp Ile Ile Val Ile Asn Asn Asn Gly Gly Gly Ile
Phe Ser Phe 450 455 460
Leu Pro Gln Ala Gly Ala Pro Lys Tyr Phe Glu Gln Leu Phe Gly Thr 465
470 475 480 Pro Leu Asn Ile
Asp Ile Lys Lys Ile Ala Asp Leu Tyr Tyr Ile Asp 485
490 495 Tyr His Gln Leu Asn Val Pro Glu Ala
Leu Ser Gln Ile Leu Gln Ala 500 505
510 Pro Ser Lys Thr Thr Arg Leu Ile Glu Tyr Lys Ser Asp His
Gln Arg 515 520 525
Asn Arg Asp Asp His Arg Glu Val Leu Glu Met Leu Lys 530
535 540 57219PRTEscherichia coli 57Met Lys Glu Ile
Ile Asp Gly Phe Leu Lys Phe Gln Arg Glu Ala Phe 1 5
10 15 Pro Lys Arg Glu Ala Leu Phe Lys Gln
Leu Ala Thr Gln Gln Ser Pro 20 25
30 Arg Thr Leu Phe Ile Ser Cys Ser Asp Ser Arg Leu Val Pro
Glu Leu 35 40 45
Val Thr Gln Arg Glu Pro Gly Asp Leu Phe Val Ile Arg Asn Ala Gly 50
55 60 Asn Ile Val Pro Ser
Tyr Gly Pro Glu Pro Gly Gly Val Ser Ala Ser 65 70
75 80 Val Glu Tyr Ala Val Ala Ala Leu Arg Val
Ser Asp Ile Val Ile Cys 85 90
95 Gly His Ser Asn Cys Gly Ala Met Thr Ala Ile Ala Ser Cys Gln
Cys 100 105 110 Met
Asp His Met Pro Ala Val Ser His Trp Leu Arg Tyr Ala Asp Ser 115
120 125 Ala Arg Val Val Asn Glu
Ala Arg Pro His Ser Asp Leu Pro Ser Lys 130 135
140 Ala Ala Ala Met Val Arg Glu Asn Val Ile Ala
Gln Leu Ala Asn Leu 145 150 155
160 Gln Thr His Pro Ser Val Arg Leu Ala Leu Glu Glu Gly Arg Ile Ala
165 170 175 Leu His
Gly Trp Val Tyr Asp Ile Glu Ser Gly Ser Ile Ala Ala Phe 180
185 190 Asp Gly Ala Thr Arg Gln Phe
Val Pro Leu Ala Ala Asn Pro Arg Val 195 200
205 Cys Ala Ile Pro Leu Arg Gln Pro Thr Ala Ala
210 215 58219PRTPsuedomonas
fluorescensCarbonic anhydrase P. fluorescens Pf-5 58Met Gln Asn Ile Ile
Asp Gly Phe Leu Lys Phe Gln Arg Glu Ala Phe 1 5
10 15 Pro Gln Arg Ser Glu Leu Phe Lys Gln Leu
Ala Ser Thr Gln Asn Pro 20 25
30 Gly Thr Leu Phe Val Thr Cys Ser Asp Ser Arg Val Val Pro Glu
Leu 35 40 45 Leu
Thr Gln Gln Glu Pro Gly Asp Leu Phe Val Ile Arg Asn Ala Gly 50
55 60 Asn Ile Val Pro Ser Tyr
Gly Pro Glu Pro Gly Gly Val Ser Ala Thr 65 70
75 80 Val Glu Tyr Ala Val Ala Val Leu Gly Val Ser
Asp Ile Val Ile Cys 85 90
95 Gly His Ser Asp Cys Gly Ala Met Thr Ala Ile Ser Thr Cys Lys Cys
100 105 110 Leu Asp
His Leu Pro Ala Val Ala Asn Trp Leu Arg His Ala Glu Ser 115
120 125 Ala Lys Val Ile Asn Ala Ala
Arg Gln His Ala Ser Pro Ala Glu His 130 135
140 Leu Asp Ala Leu Val Arg Asp Asn Val Ile Ala Gln
Leu Ala Asn Leu 145 150 155
160 Lys Thr His Pro Ser Val Ala Leu Ala Leu Glu Gln Gly Arg Leu Asn
165 170 175 Leu His Gly
Trp Val Tyr Asp Ile Glu Ser Gly Ala Ile Val Ala Leu 180
185 190 Asp Gly Asn Thr Gln Arg Phe Val
Ser Leu Ala Glu Tyr Pro His Thr 195 200
205 Cys Ala Leu Ala Ser Gln Ala Ser Ser Ala Ala 210
215 59213PRTYersinia ruckeriCarbonic
anhydrase Y. ruckeri ATCC 29473 59Met Gln Asp Ile Ile Asp Gly Phe Leu Lys
Phe Gln Arg Glu Val Phe 1 5 10
15 Pro Gln Arg Ser Glu Leu Phe Lys Arg Leu Ala Asp Thr Gln His
Pro 20 25 30 Gly
Ala Leu Phe Val Thr Cys Ser Asp Ser Arg Val Val Pro Glu Leu 35
40 45 Leu Thr Gln Arg Glu Pro
Gly Glu Leu Phe Val Ile Arg Asn Ala Gly 50 55
60 Asn Ile Val Pro Ser Tyr Gly Pro Glu Pro Gly
Gly Val Ser Ala Thr 65 70 75
80 Val Glu Tyr Ala Val Ala Val Leu Gly Val Thr Asp Val Val Ile Cys
85 90 95 Gly His
Ser Asn Cys Gly Ala Met Ser Ala Ile Ala Glu Cys Gln Cys 100
105 110 Leu Asp His Leu Pro Ala Val
Ala Ala Trp Leu Arg His Ala Asp Ser 115 120
125 Ala Lys Leu Val Asn Ala Ala Leu Pro His Ala Ser
Pro Lys Asp Arg 130 135 140
Leu Asn Ser Leu Val Arg Glu Asn Val Ile Ala Gln Leu Ala Asn Ile 145
150 155 160 Lys Thr His
Pro Ser Val Ala Leu Ala Cys Ala Gln Gly Arg Leu Arg 165
170 175 Leu His Gly Trp Val Tyr Asp Ile
Glu Thr Gly Ser Ile Asp Val Leu 180 185
190 Asp Glu Leu Thr Arg Thr Phe Ser Pro Leu Ser Ala Tyr
Ser Val Val 195 200 205
Ser Lys Pro Thr Glu 210 60219PRTHahella
chejuensisCarbonic anhydrase Hahella chejuensis KCTC 2396 60Met Lys Asp
Ile Ile Glu Gly Phe Leu Lys Phe Gln Arg Glu Ala Phe 1 5
10 15 Pro Glu Arg Lys Glu Leu Phe Lys
Asp Leu Ala Asn Gln Gln Gln Pro 20 25
30 Arg Thr Leu Phe Ile Ser Cys Ser Asp Ser Arg Leu Val
Pro Glu Leu 35 40 45
Val Thr Gln Arg Glu Pro Gly Asp Leu Phe Val Ile Arg Asn Ala Gly 50
55 60 Asn Ile Val Pro
Pro Tyr Gly Pro Glu Pro Gly Gly Val Ser Ala Ser 65 70
75 80 Val Glu Tyr Ala Val Ala Ala Leu Arg
Val Ala Asp Ile Val Val Cys 85 90
95 Gly His Ser Asn Cys Gly Ala Met Thr Ala Val Ala Thr Cys
Gln Cys 100 105 110
Ile Asp His Met Pro Ala Val Ala His Trp Leu Arg Tyr Ala Asp Ser
115 120 125 Ala Lys Val Val
Asn Gln Ala Arg Lys His Ala Ser Glu Arg Ala Lys 130
135 140 Ile Glu Asp Met Val Arg Glu Asn
Val Ile Ala Gln Leu Ala Asn Leu 145 150
155 160 Gln Thr His Pro Ser Val Arg Leu Ala Leu Gln Glu
Gly Arg Leu Thr 165 170
175 Met His Gly Trp Phe Tyr Asp Ile Glu Ser Gly Gly Ile Asp Ala Tyr
180 185 190 Asp Gly Ser
Arg His Ala Phe Val Pro Leu Ala Glu His Pro Glu Ala 195
200 205 Arg Ala Ile Pro Gly Lys Leu Ser
His Ala Val 210 215
61219PRTBurkholderia cenocepaciaCarbonic anhydrase Burkholderia
cenocepacia J23152 61Met Lys Asp Ile Ile Glu Gly Phe Leu Lys Phe Gln
Arg Asp Ala Tyr 1 5 10
15 Pro Ala Arg Ala Ala Leu Phe Arg Asp Leu Ala Arg Ser Gln Asn Pro
20 25 30 Arg Ala Leu
Phe Ile Ser Cys Ser Asp Ser Arg Leu Val Pro Glu Leu 35
40 45 Val Thr Gln Arg Glu Pro Gly Asp
Leu Phe Val Ile Arg Asn Ala Gly 50 55
60 Asn Ile Val Pro Ser Tyr Gly Pro Glu Pro Gly Gly Val
Ser Ala Ser 65 70 75
80 Val Glu Tyr Ala Val Ala Ala Leu Arg Val Thr Asp Val Val Ile Cys
85 90 95 Gly His Ser Asp
Cys Gly Ala Met Thr Ala Ile Ala Thr Cys Gln Cys 100
105 110 Met Asp His Met Pro Ala Val Gly His
Trp Leu Arg Tyr Ala Asp Ser 115 120
125 Ala Arg Val Val Asn Glu Ala Arg Thr His Arg Ser Glu Arg
Glu Arg 130 135 140
Ile Asp Ser Met Val Arg Glu Asn Val Val Ala Gln Leu Ala Asn Leu 145
150 155 160 Lys Thr His Pro Ala
Val Arg Leu Ala Leu Glu Glu Gly Arg Leu Ala 165
170 175 Leu His Gly Trp Val Tyr Asp Ile Glu Ser
Gly Cys Ile Asp Ala Tyr 180 185
190 Asp Gly Ala Thr Gly Arg Phe Val Ser Leu Ala Asp His Pro Gly
Val 195 200 205 Arg
Ala Thr Pro Ala Thr Leu Pro Val Ala Ala 210 215
62215PRTYersinia frederikseniiCarbonic anhydrase Yersinia
frederiksenii ATCC 33461 62Met Lys Glu Ile Ile Asp Gly Phe Leu Lys
Phe Gln Arg Asp Ala Phe 1 5 10
15 Pro Glu Arg Ala Glu Leu Phe Arg Ser Leu Ala Thr Gln Gln Ser
Pro 20 25 30 Lys
Thr Leu Phe Ile Ser Cys Ser Asp Ser Arg Met Val Pro Glu Leu 35
40 45 Val Thr Gln Arg Glu Pro
Gly Asp Leu Phe Val Ile Arg Asn Ala Gly 50 55
60 Asn Ile Val Pro Ser Tyr Gly Pro Glu Pro Gly
Gly Ile Ser Ala Ser 65 70 75
80 Val Glu Tyr Ala Val Thr Ala Leu Lys Val Thr Asp Ile Val Ile Cys
85 90 95 Gly His
Ser Asp Cys Gly Ala Met Thr Ala Ile Ala Lys Cys His Cys 100
105 110 Leu Asp His Met Pro Ala Val
Lys His Trp Leu Gln Tyr Ala Asp Ser 115 120
125 Ala Lys Val Val Asn Glu Ser Arg Glu Tyr Lys Asn
Ile His Asp Lys 130 135 140
Thr Ile Ser Met Val His Glu Asn Val Val Ala Gln Leu Ala Asn Ile 145
150 155 160 Gln Thr His
Pro Ser Val Arg Leu Ala Leu Glu Glu Gly Arg Leu Thr 165
170 175 Ile His Gly Trp Val Tyr Asp Ile
Glu Ser Gly Leu Ile Ser Ala Phe 180 185
190 Asp Arg Ala Ser Arg Gln Phe Val Ser Leu Ala Ala Asn
Pro Asn Val 195 200 205
Arg Ala Val Pro Ala His Asn 210 215
63219PRTPseudomonas entomophilaCarbonic anhydrase Pseudomonas entomophila
L48 63Met Gln Asp Ile Ile Asp Gly Phe Leu Lys Phe Gln Arg Asp Ala Phe 1
5 10 15 Pro Glu Arg
Val Lys Leu Phe Lys Asp Leu Ala Thr Gln Gln Ser Pro 20
25 30 Arg Ala Leu Phe Ile Ser Cys Ser
Asp Ser Arg Leu Val Pro Glu Leu 35 40
45 Val Thr Gln Arg Glu Pro Gly Asp Leu Phe Val Ile Arg
Asn Ala Gly 50 55 60
Asn Ile Val Pro Ser Tyr Gly Pro Glu Pro Gly Gly Val Ser Ala Ser 65
70 75 80 Val Glu Tyr Ala
Val Ala Ala Leu Gln Val Ala Asp Ile Val Ile Cys 85
90 95 Gly His Ser Asp Cys Gly Ala Met Thr
Ala Ile Ala Thr Cys Lys Cys 100 105
110 Leu Asp His Met Pro Ala Val Ala Gly Trp Leu Arg Tyr Ala
Asp Ser 115 120 125
Ala Arg Val Val Asn Glu Ala Arg Gln His Gln Ser Pro His Ala Lys 130
135 140 Val Glu Ala Met Val
Arg Glu Asn Val Ile Ala Gln Leu Ala Asn Ile 145 150
155 160 Gln Thr His Pro Ser Val Arg Leu Ala Leu
Glu Glu Gly Arg Val Ala 165 170
175 Leu His Gly Trp Ile Tyr Asp Ile Glu Ser Gly Arg Ile Asp Ala
Phe 180 185 190 Asp
Gly Arg Thr Gly Gln Phe Val Ser Leu Ala Asp Asn Pro Glu Val 195
200 205 Arg Ala Val Ser His Ala
Ser Arg His Val Ala 210 215
64219PRTPseudomonas putidaCarbonate dehydratase Pseudomonas putida W619
64Met Lys Ala Ile Ile Asp Gly Phe Leu Lys Phe Gln Lys Asn Ala Phe 1
5 10 15 Pro Glu Arg Val
Lys Leu Phe Lys Asp Leu Ala Asn Gln Gln Ala Pro 20
25 30 Lys Ala Leu Phe Ile Ser Cys Ser Asp
Ser Arg Leu Val Pro Glu Leu 35 40
45 Val Thr Gln Arg Glu Pro Gly Asp Leu Phe Val Ile Arg Asn
Ala Gly 50 55 60
Asn Ile Val Pro Ser Tyr Gly Pro Glu Pro Gly Gly Val Ser Ala Ser 65
70 75 80 Val Glu Tyr Ala Val
Ala Gly Leu Asn Val Ala Asp Ile Val Ile Cys 85
90 95 Gly His Ser Asp Cys Gly Ala Met Thr Ala
Ile Ala Thr Cys Lys Cys 100 105
110 Leu Asp His Met Pro Ala Val Ala Gly Trp Leu Arg Tyr Ala Asp
Ser 115 120 125 Ala
Lys Val Val Asn Glu Ala Arg His His Val Asp Lys Pro Ser Lys 130
135 140 Val Ala Ser Met Val Arg
Glu Asn Val Ile Ala Gln Leu Ala Asn Ile 145 150
155 160 Gln Thr His Pro Ser Val Arg Leu Ala Leu Glu
Glu Gly Arg Val Thr 165 170
175 Leu His Gly Trp Ile Tyr Asp Ile Glu Thr Gly Gly Ile Asp Ala Phe
180 185 190 Asp Gly
Ser Thr Gly Thr Phe Val Ser Leu Ala Glu Asn Pro Glu Val 195
200 205 His Ala Val Ser Gln Gln Ala
Arg His Val Ala 210 215
65217PRTSerratia proteamaculansCarbonate dehydratase Serratia
proteamaculans 568 65Met Lys Glu Val Ile Glu Gly Phe Leu Lys Phe Gln
Arg Glu Ala Phe 1 5 10
15 Val Glu Arg Thr Ala Leu Phe Gln Arg Leu Ala Thr Gln Gln Ser Pro
20 25 30 Arg Thr Leu
Phe Ile Ser Cys Ser Asp Ser Arg Leu Val Pro Glu Leu 35
40 45 Ile Thr Gln Arg Glu Pro Gly Asp
Leu Phe Val Ile Arg Asn Ala Gly 50 55
60 Asn Ile Val Pro Ser Phe Gly Pro Glu Pro Gly Gly Val
Ser Ala Ser 65 70 75
80 Val Glu Tyr Ala Val Ser Ala Leu Gly Val Glu Asp Ile Val Ile Cys
85 90 95 Gly His Ser Asp
Cys Gly Ala Met Thr Ala Ile Ala Thr Cys Gln Cys 100
105 110 Leu Gln His Met Pro Thr Val Ala Asn
Trp Leu Arg Tyr Ala Asp Ser 115 120
125 Ala Lys Val Val Asn Gln Ala Tyr Gln His Ala Ser Glu Asn
Glu Lys 130 135 140
Val Ser Ser Met Val Arg Glu Asn Val Ile Ala Gln Leu Asn Asn Ile 145
150 155 160 Lys Thr His Pro Ser
Val Ala Leu Ala Leu Glu Gln Gly Arg Leu Lys 165
170 175 Leu His Gly Trp Val Tyr Asp Ile Ala Ser
Gly Gly Ile Glu Ala Leu 180 185
190 Asp Gly Glu Thr Arg Arg Phe Ile Pro Leu Ala Thr Asn Pro Glu
Val 195 200 205 Thr
Ala Thr Pro Ala Val Ser Arg Phe 210 215
66211PRTErwinia tasmaniensisCarbonate dehydratase Erwinia tasmaniensis
Et1/99 66Met Gln His Ile Val Glu Gly Phe Leu Asn Phe Gln Lys Asp Ile
Phe 1 5 10 15 Pro
Glu Gln Lys Glu Leu Phe Arg Ser Leu Ala Ser Ser Gln Asn Pro
20 25 30 Lys Ala Leu Phe Ile
Ser Cys Ser Asp Ser Arg Leu Val Pro Glu Leu 35
40 45 Val Thr Gln Gln Asp Pro Gly Gln Leu
Phe Val Ile Arg Asn Ala Gly 50 55
60 Asn Ile Val Pro Ser Phe Gly Pro Glu Pro Gly Gly Val
Ser Ala Thr 65 70 75
80 Ile Glu Tyr Ala Val Val Ala Leu Gly Val Ser Asp Ile Val Ile Cys
85 90 95 Gly His Ser Asn
Cys Gly Ala Met Lys Ala Ile Ala Thr Cys Gln Cys 100
105 110 Leu Ala Pro Met Pro Ala Val Glu His
Trp Leu Arg Tyr Ala Asp Ala 115 120
125 Ala Lys Ala Val Val Glu Lys Lys Asn Tyr Asp Thr Glu Glu
Asp Lys 130 135 140
Val Asn Ala Met Val Gln Glu Asn Val Ile Ala Gln Leu Asn Asn Ile 145
150 155 160 Lys Thr His Pro Ser
Val Ala Val Gly Leu Arg Asn Asn Ala Leu Arg 165
170 175 Leu His Gly Trp Val Tyr Asp Ile Glu Ser
Gly Ala Ile Arg Ala Leu 180 185
190 Asp Lys Asp Ser Lys Lys Phe Val Leu Leu Ser Asp Asn Pro Gln
Val 195 200 205 His
Phe Glu 210 67220PRTPseudomonas aeruginosaCarbonate dehydratase
Pseudomonas aeruginosa PA7 67Met Arg Asp Ile Ile Asp Gly Phe Leu Arg
Phe Gln Arg Asp Ala Tyr 1 5 10
15 Pro Ala Arg Ser Gln Leu Phe Lys Ser Leu Ala Thr Arg Gln Ala
Pro 20 25 30 Lys
Ala Leu Phe Ile Ala Cys Ser Asp Ser Arg Val Val Pro Glu Leu 35
40 45 Leu Thr Gln Arg Glu Pro
Gly Glu Leu Phe Val Ile Arg Asn Ala Gly 50 55
60 Asn Ile Val Pro Gly Tyr Gly Pro Gln Pro Gly
Gly Val Ser Ala Ser 65 70 75
80 Val Glu Tyr Ala Val Ala Val Leu Gly Val Ala Asp Ile Val Val Cys
85 90 95 Gly His
Ser Asp Cys Gly Ala Met Gly Ala Ile Ala Ser Cys Ala Cys 100
105 110 Leu Asp His Leu Pro Ala Val
Ala Gly Trp Leu Arg His Ala Glu Ala 115 120
125 Ala Arg Ala Met Asn Ser Ala His Glu His Ser Ser
Asp Ala Ala Arg 130 135 140
Leu Asp Ala Leu Val Arg His Asn Val Ile Ala Gln Leu Ala Asn Leu 145
150 155 160 Arg Thr His
Pro Ser Val Ala Arg Ala Leu Glu Gln Gly Arg Leu Asn 165
170 175 Leu His Gly Trp Val Tyr Asp Ile
Glu Ser Gly Arg Ile Asp Ala Leu 180 185
190 Asp Gly Ala Ser Arg Arg Phe Val Ser Leu Ala Glu His
Pro Gly Val 195 200 205
Arg Ala Val Gly Gly Glu Pro Gly Gln Ala Val Ala 210
215 220 68202PRTStreptosporangium roseumCarbonate
dehydratase Streptosporangium roseum DSM 43021 68Met Gln Asp Leu Glu
Glu Gly Val Ala Arg Phe Gln Arg Asp Val Phe 1 5
10 15 Pro Ala Lys Thr Glu Leu Phe Thr Arg Leu
Ala Thr Ala His Gln Pro 20 25
30 Ala Thr Leu Phe Ile Ser Cys Ser Asp Ala Arg Val Val Pro Glu
Leu 35 40 45 Ile
Thr Gln Ser Glu Pro Gly Glu Leu Phe Val Ile Arg Thr Ala Gly 50
55 60 Asn Leu Val Pro Ala Tyr
Ala Pro Gly Ser Ala Asp Gly Val Ala Ala 65 70
75 80 Gly Ile Glu Tyr Ala Val Ala Val Leu Gly Val
Ser Asp Ile Val Val 85 90
95 Cys Gly His Ser Gly Cys Gly Ala Met Thr Ala Val Ala Asp Gly Leu
100 105 110 Asp Pro
Ala Ala Leu Pro Ala Val Ala Gly Trp Leu Arg His Ala Asp 115
120 125 Ala Ser Arg Ala Arg Val Thr
Thr Thr Glu Thr Gly Thr Gly Glu Val 130 135
140 Ala Ala Leu Val Arg Gln Asn Val Leu Thr Gln Leu
Ala Asn Leu Ala 145 150 155
160 Thr His Pro Ser Val Ala His Ala Leu Ala Gly Lys Thr Val Thr Leu
165 170 175 His Gly Trp
Ile Tyr Asp Ile Gly Thr Gly Thr Val Ala Glu Leu Asp 180
185 190 Ala Thr Gly Arg Pro Ser Ala Leu
Ala Val 195 200
User Contributions:
Comment about this patent or add new information about this topic: