Patent application title: Novel Beta-Glucosidase and Uses thereof
Inventors:
Su-May Yu (Taipei, CN)
Tuan-Hua David Ho (Chesterfield, MO, US)
Hsion-Wen Kuo (Chiayi, TW)
Ng I-Son Wu (Yung-Kang City, TW)
Chen-Wei Li (Erlin Township, TW)
Yu-Ming Ju (Shi-Jr City, TW)
Assignees:
Academia Sinica
IPC8 Class: AC12N924FI
USPC Class:
435 99
Class name: Micro-organism, tissue cell culture or enzyme using process to synthesize a desired chemical compound or composition preparing compound containing saccharide radical produced by the action of a carbohydrase (e.g., maltose by the action of alpha amylase on starch, etc.)
Publication date: 2013-02-21
Patent application number: 20130045510
Abstract:
A novel beta-glucosidase and nucleic acids encoding the beta-glucosidase.
Also disclosed are cells, compositions, and methods relating to using the
beta-glucosidase to convert ligocellulosic material to fermentable
sugars.Claims:
1. An isolated polypeptide comprising a sequence at least 70% identical
to SEQ ID NO: 1, wherein the polypeptide has a β-glucosidase
activity.
2. The isolated polypeptide of claim 1, wherein the sequence is at least 80% identical to SEQ ID NO: 1.
3. The isolated polypeptide of claim 2, wherein the sequence is at least 90% identical to SEQ ID NO: 1.
4. The isolated polypeptide of claim 3, wherein the sequence is at least 95% identical to SEQ ID NO: 1.
5. The isolated polypeptide of claim 4, wherein the sequence is at least 99% identical to SEQ ID NO: 1.
6. The isolated polypeptide of claim 5, wherein the polypeptide includes the sequence of SEQ ID NO: 1.
7. An isolated nucleic acid encoding the polypeptide of claim 1.
8. The nucleic acid of claim 7, wherein the nucleic acid comprises the sequence of SEQ ID NO: 2, 3, 5, or 6.
9. An expression vector comprising a nucleic acid of claim 7.
10. A host cell comprising a nucleic acid of claim 7.
11. A method of producing a polypeptide, comprising culturing in a medium a host cell that contains a nucleic acid encoding the polypeptide of claim 1 under conditions permitting expression of the polypeptide encoded by the nucleic acid, and purifying the polypeptide from the cultured cell or the medium of the cell.
12. The method of claim 11, wherein the polypeptide has a sequence that is 90% identical to SEQ ID NO: 1.
13. The method of claim 11, wherein the polypeptide has a sequence that is 95% identical to SEQ ID NO: 1.
14. The method of claim 11, wherein the polypeptide has the sequence of SEQ ID NO: 1.
15. A composition comprising a polypeptide of claim 1.
16. The composition of claim 15, wherein the composition further comprises an exoglucanase and an endoglucanase.
17. A method of producing fermentable sugars from lignocellulosic material, comprising providing a composition of claim 15, and contacting the composition with lignocellulosic material to produce fermentable sugars.
18. The method of claim 17, wherein the fermentable sugar is elected from the group consisting of glucose, xylose, arabinose, galactose, mannose, rhamnose, sucrose, fructose, lactose, maltose, trehalose, and cellobiose.
19. The method of claim 17, wherein the polypeptide has the amino acid sequence of SEQ ID NO:1.
20. The method of claim 17, wherein the lignocellulosic material is selected from the group consisting of cellulosic animal waste, municipal solid waste, waste paper, yard waste, an agricultural residue, and a forestry residue.
21. The method of claim 17, further comprising converting the fermentable sugar to a fermentation product.
22. The method of claim 21, wherein the converting step is performed by microorganism fermentation or enzyme treatment.
23. A method of producing energy from lignocellulosic material, comprising providing a composition of claim 15; contacting the composition with the lignocellulosic material to produce a fermentable sugar; converting the fermentable sugar to produce a combustible fermentation product, and combusting the combustible fermentation product to produce energy.
24. A bioreactor containing a lignocellulosic material and a composition of claim 15.
Description:
CROSS REFERENCE TO RELATED APPLICATIONS
[0001] This application claims the priority of U.S. Provisional Application No. 61/299,007, filed on Jan. 28, 2010. The contents of the application are hereby incorporated by reference in its entirety.
BACKGROUND
[0002] Developments of alternative energy resources (e.g., solar energy, biofuel, hydropower, wind power etc.) have been encouraged for years to cope with arising energy problems related to vastly consumptions of fossil fuels. Among many possible resources, plant biomass is of particular interest as it is renewable. Plant mass contains a high amount of cellulose, a starting material for making biofuel. To convert cellulose to biofuel, it is first degraded to fermentable sugars, such as cellobiose and glucose, by the cellulolytic system of microorganisms. This system includes three major types of hydrolases, i.e., endoglucanases (EC 3.2.1.4), exoglucanases (EC 3.2.1.91), and β-glucosidases (EC 3.2.1.21). Although many glucosidases have been isolated from various microorganisms, their efficiencies are not satisfactory. Thus, there is a need for a high efficient glucosidase.
SUMMARY
[0003] This invention relates to a novel β-glucosidase, BGL-Cr-D2, which is isolated from a Taiwan-indigenous fungus Chaetomella raphigera. Shown in FIG. 2 are the sequences of the mature BGL-Cr-D2 protein (722 a.a. resides/SEQ ID NO: 1), the corresponding bgl-Cr-D2 cDNA (2166 bps/SEQ ID NO: 2), the signal peptide sequence (SEQ ID NO: 7), the corresponding cDNA (SEQ ID NO: 8), the BGL-Cr-D2 protein with its signal peptide (SEQ ID NO: 4), and the corresponding cDNA (SEQ ID NO: 5). Shown in FIG. 1 is the genomic DNA region (2851 bps, SEQ ID NO: 3) in Chaetomella raphigera encoding the BGL-Cr-D2 enzyme, which contains 12 intron sections, and the genomic DNA region corresponding to SEQ ID NO: 4 (SEQ ID NO: 6).
[0004] Accordingly, one aspect of this invention features an isolated polypeptide containing a sequence exhibiting at least 70% (e.g., 80%, 90%, 95%, or 99%) amino acid identity to SEQ ID NO:1 as determined by the BLAST algorithm. In one example, the polypeptide has a β-glucosidase activity.
[0005] This invention also encompasses (i) an isolated nucleic acid including a nucleotide sequence that encodes the polypeptide described above, and (ii) a host cell containing such an isolated nucleic acid. In one example, the nucleotide sequence contains the sequence of SEQ ID NO: 2, 3, 5, or 6. The isolated nucleic acid of this invention can be an expression vector, in which the nucleotide sequence is operably linked to a suitable promoter sequence (i.e., a sequence capable of initiating transcription in a host cell).
[0006] The term "isolated polypeptide" or "isolated nucleic acid" used herein refers to a polypeptide or nucleic acid substantially free from naturally associated molecules, i.e., the naturally associated molecules constituting at most 20% by dry weight of a preparation containing the polypeptide or nucleic acid. Purity can be measured by any appropriate method, e.g., column chromatography, polyacrylamide gel electrophoresis, and HPLC.
[0007] Also within the scope of this invention are (A) a composition containing the polypeptide described above, an exo-glucanase, an endo-glucanase, and/or a β-glucosidase, and (B) a method of producing a fermentable sugar from a ligocellulosic material. The method includes (i) providing the just-described composition, and (ii) contacting this composition with a lignocellulosic material to produce a fermentable sugar, e.g., glucose, xylose, arabinose, galactose, mannose, rhamnose, surose, fructose, lactose, maltose, trehalose, or cellobiose. The fermentable sugar can be converted to a fermentation product, such as alcohol, by microbial fermentation or enzyme treatment. Examples of the lignocellulosic material used in this method include, but are not limited to, orchard prunings, chaparral, mill waste, urban wood waste, municipal waste, logging waste, forest thinnings, short-rotation woody crops, industrial waste, wheat, wheat straw, oat straw, rice straw, barley straw, rye straw, flax straw, soy hulls, rice hulls, oat hulls, sugar cane, corn, corn stover, corn stalks, corn gluten feed, corn cobs, corn husks, corn kernel, fiber from kernels, prairie grass, gamagrass, foxtail, sugar beet pulp, citrus fruit pulp, seed hulls, cellulosic animal wastes, lawn clippings, cotton, seaweed, trees, shrubs, grasses, sugar cane bagasse, products and by-products from wet or dry milling of grains, municipal solid waste, waste paper, yard waste, herbaceous material, agricultural residues, forestry residues, municipal solid wastes, waste paper, pulp, paper mill residues, branches, bushes, canes, corn, corn husks, energy crops, forests, fruits, flowers, grains, grasses, herbaceous crops, leaves, bark, needles, logs, roots, saplings, shrubs, switch grasses, trees, vegetables, fruit peels, vines, sugar beet pulp, wheat midlings, oat hulls, hard and soft woods, organic waste materials generated from agricultural processes, forestry wood waste, or combinations thereof. If the fermentation product is combustible, the method further contains a step of combusting the combustible fermentation product to produce energy.
[0008] This invention also features a bioreactor containing a lignocellulosic material and a composition, including the polypeptide described above, an exo-glucanase, an endo-glucanase, and/or a β-glucosidase.
[0009] The details of one or more embodiments of the invention are set forth in the accompanying drawings and the description below. Other features, objects, and advantages of the invention will be apparent from the description and drawings, and from the claims.
DESCRIPTION OF DRAWINGS
[0010] FIG. 1 is a diagram showing the genomic DNA region (2851 bps, SEQ ID NO: 3) in Chaetomella raphigera encoding the BGL-Cr-D2 enzyme, which contains 12 intron sections (as indicated with "White" sections in the diagram and "Small Caps/Gray/Strikethrough" sections in the sequence). A region encoding signal peptide is indicated with "Black" in the diagram and "Underline" in the sequence.
[0011] FIG. 2 is a diagram showing the bgl-Cr-D2 cDNA (2166 bps/SEQ ID NO: 2 excluding signal sequence indicated with underline/SEQ ID NO: 8) and its encoding amino acid sequences (722 a.a. resides/SEQ ID NO: 1, excluding signal peptide/SEQ ID NO: 7) for a Taiwan-indigenous fungus Chaetomella raphigera.
[0012] FIG. 3 is a diagram of the phylogenetic tree based on amino acid sequences of beta-glucosidase (BGL) from glycoside hydrolase (GH) family 3, showing that the BGL-Cr-D2-Pp secreted from reconstructed Pichia pastoris (strain SMD 1168) was related to the beta-D-glucoside glucohydrolase (i.e., BGL) of T. stipitatua, A. terreus NIH2624, and A. fumigates A1163 (i.e., 64-65% similarity in AA sequences).
[0013] FIG. 4 is a diagram of a vector map of reconstructed pGAPZαC, showing that the bgl-Cr-D2 gene was cloned for further transformation to P. pastoris expression system. (In the simplified nucleotide sequence showed below the map, the lengths of shortened regions, i.e., GAP promoter, α-factor signal sequence, and bgl-Cr-D2, do not correlate to the real sizes as scaled in the diagram map.)
[0014] FIGS. 5A and 5B are diagrams showing growth and pNPGase activity versus time for the colonies of (A) Pichia pastoris strain SMD1168 and (B) P. pastoris strain SMD1168 carrying bgl-Cr-D2; they show similar growth profile but significant pNPGase activity only for the colony cloned with bgl-Cr-D2.
[0015] FIGS. 6A-C are photographs showing enzymes sizes and activities of native PAGE (A), MUG zymogram (B) and SDS PAGE (C). (Pp: Pichia pastoris; Cr-D2: Chaetomella raphigera fungus strain D2; BGL: beta-glucosidase).
[0016] FIG. 7 is a diagram showing that the recombinant enzyme BGL-Cr-D2-Pp was stable over a wide pH range (4-9) based on 20-hour pH stability assessment (incubated the enzyme at 4° C. under each of the particular pH levels for 20 hours prior to performing standard pNPGase activity measurement, i.e., pH5 and 55° C. for 10 mins). The optimal pH level for pNPGase activity of the BGL-Cr-D2-Pp was around 5 (measured pNPGase activity of the enzyme at each of the particular pH levels).
[0017] FIG. 8 is a diagram showing that the recombinant enzyme BGL-Cr-D2-Pp was stable under a temperature range of 4-55° C. based on 4-hour thermostability assessment (incubated the enzyme under each of the particular temperature for 4 hours prior to performing standard pNPGase activity measurement). Optimal temperature for pNPGase activity of the BGL-Cr-D2-Pp was around 70° C. (measured pNPGase activity of the enzyme at each of the particular temperature); but, at this high temperature, the enzyme would become inactive after longer incubation time (e.g., ≧2 hrs, FIG. 9).
[0018] FIG. 9 is a 3-D diagram showing that thermostability of the recombinant β glucosidase BGL-Cr-D2-Pp under temperature ranged from 4 to 75° C. over a 9-days period. Under a temperature ≦50° C., the BGL-Cr-D2-Pp remained stable (relative activity >80%) over 9 days.
DETAILED DESCRIPTION
[0019] This invention is based, at least in part, on the unexpected discoveries of a novel beta-glucosidase and its high enzymatic activity or efficiency. Beta-glucosidases (or beta-D-glucoside glucohydrolase, BGL) are a group of the key enzymes for processing lignocellulosic saccharization. This type of enzyme can liberate D-glucose units from the cellulose-hydrolyzing media such as cellobiose, cello-oligosaccharides, and other glucosides after the actions of endo-glucanases (EG) (mainly degrading the amorphous parts of cellulose fiber) and exo-glucanases (or cellobiohydrolases, CBH) (mainly degrading the crystalline cellulose). For industrial-scale production of biofuel from lignocelluloses, finding low-cost cellulases (i.e., CBH, EG, and BGL) with high activity and productivity is essential.
[0020] This invention discloses new discoveries in polypeptides with beta-glucosidase activity, polynucleotides encoding the polypeptides, and enhancement in beta-glucosidase specific activity for recombinant beta-glucosidase expressed from yeast. Explicitly, a Taiwan-indigenous fungus, i.e., Chaetomella raphigera strain D2, was found capable of secreting an active beta-1,4-glucosidase (BGL-Cr-D2). The gene (bgl-Cr-D2, size=2166 bps) encoding this enzyme was cloned from reverse-transcribed cDNA and has been successfully expressed in a yeast expression system using Pichia pastoris strain SMD1168 as the host. Recombinant enzyme BGL-Cr-D2-Pp secreted from P. pastoris had a higher β-glucosidase activity (based on pNPGase activity measurement) than the native enzyme (approx. 3×) and the benchmark Novozyme-188 (approx. 17×). This invention provides a new recombinant β-glucosidase enzyme that has potential to be utilized together with other cellulases for cellulosic saccharization.
1. Biofuels from Lignocellulose
[0021] Developments of renewable power resources (e.g., solar energy, biofuel, hydropower, wind power etc.) have been encouraged for years to cope with arising energy problems related to vastly consumptions of fossil fuels. Among the currently-exploited renewable energy forms/sources, biofuel has a unique advantage in reducing agricultural/municipal/industrial organic wastes while generating cleanliness and energy-rich fuels (e.g., methane, ethanol, butanol, and hydrogen). Thus, it has been listed as one of important sustainable energies (Antoni et al., 2007, Appl Microbiol Biotechnol 77:23-35; and Rubin, 2008, Natures 454:841-845). Not only in Taiwan, production of bioethanol from lignocellulose is urgent worldwide as about 5×1012 tons of cellulose-rich agriculture wastes are produced on the earth every year and supplement of ethanol in gasoline is ongoing or on-its-way in many countries. Prior to further fermentation for ethanol production, various enzyme systems (e.g., ligninase, cellulases, and hemicellulases) are required for complete breakdown of lignocelluloses (containing about 5-30% of lignin, approximately 35-50% of cellulose, and approximately 20-35% of hemicellulose) to release fermentable glucose (Lynd et al., 2002, Microbiol Mol Biol Rev 66:506-577).
2. Cellulases
[0022] For bioethanol production from cellulose, efficient saccharification (degradation of polymeric cellulose fibers to release smaller glucose molecules) by cellulases is crucial. Cellulases are a group of enzymes responsible for complex cellulose hydrolysis. Multiple cellulases have been classified into three groups, i.e., (i) exoglucanases (including 1,4-β-D-glucan cellobiohydrolases, CBH, EC 3.2.1.91 and 1,4-β-D-glucan glucanohydrolases, EC 3.2.1.74), (ii) endoglucanases (EG) (EC 3.2.1.4), and (iii) beta-glucosidases (or β-D-glucoside glucohydrolase, BGL) (EC 3.2.1.21) based on structural and functional properties of the enzymes (Coughlan et al., 1988, In Biochemistry and Genetics of Cellulose Degradation, pp. 11-30; and Henrissat et al., 1989, Gene 81:83-95). The CBH mainly act on the crystalline sections at reducing or nonreducing end of cellulose fiber while EG randomly attack the amorphous parts of cellulose. More specifically, these cellulases can cleave the β-1,4-glucan or β-D-glucosidic linkages between glucosyl residues within celluloses to form short-chain media including cellobiose (i.e., disaccharides), other cellodextrins (i.e., cello-oligosaccharides), and glucosides (e.g., alcoholic, cyanogenic, or phenolic glycosides). These short-chain intermediates can be further hydrolyzed to glucose (i.e., monosaccharide) by BGL that primarily catalyzes the transfer of glycosyl group between oxygen nucleophiles (Bhatia et al., 2002, Crit Rev Biotechnol 22(4):375-407). For cellulose saccharification, insufficient BGL activity can result in not only shortage of glucose but also accumulation of cellobiose that is a potent inhibitor for cellulose hydrolysis by CBH and EG (Harris et al., 2007, U.S. Pat. No. 7,244,605B2). Thus, BGL plays an important role in achieving high ethanol yield from cellulose (Lynd et al., 2002, Microbiol Mol Biol Rev 66:506-577; and Hong et al., 2007, Appl Microbiol Biotechnol 73:1331-1339).
3. Sources of β-glucosidases (BGLs)
[0023] The enzyme BGL is ubiquitous and has been discovered in all the living kingdoms from microbes, insects, and plants to highly-evolved mammals (Esen, 1993, In Biochemistry and Molecular Biology, American Chemical Society, 1-14; Bhatia et al., 2002, Crit Rev Biotechnol 22(4):375-407). One of major BGL-producing sources is from bacteria including some strains of genus Agrobacterium, Bacillus, Butyrivibrio, Cellovibrio, Clostridium, Erwinia, Pseudomonas, Pyrococcus, Ruminococcus, Streptomyces, Thermotoga, Thermobifida etc. (Hashimoto et al., 1998, Arch Biochem Biophys 360:1-9; Srivastava et al., 1999, Biotechnol Lett 21:293-297; and Yun et al., 2001, Biosci Biotechnol Biochem 65(9):2028-2032). Fungal strains (molds or yeasts) with BGL-producing ability include those from genus Aspergillus, Candida, Humicola, Penicillium, Pichia, Saccharomycopsis, Talaromyces, Trichoderma etc. (Dan et al., 2000, J Biol Chem 275(7):4973-4980; and Dunn-Coleman et al., 2008, U.S. Patent Publication 2008/0095889A1). Among these microbes, fungal species Aspergillus niger, Aspergillus fumigates, and Trichoderma reesei are the most well-known and efficient BGL producers making them the major sources for commercial BGLs in the world (Dan et al., 2000, J Biol Chem 275(7):4973-4980; Harris et al., 2007, U.S. Pat. No. 7,244,605B2; and Dunn-Coleman et al., 2008, U.S. Patent Publication 2008/0095889A1). In addition, few plants like barley, Costus, and Maize have been found containing BGL as well.
[0024] Beside above-mentioned native sources, some recombinant BGLs have been expressed in different hosts (e.g., E. coli, B. subtilis, P. pastoris, S. cerevisiae, A. kawachii, and T. reesei) (Pandey et al., 1995, J Ferment Bioeng 80(5):446-453; Murray et al., 2004, Protein Expres Purif 38:248-257; and Roy et al., 2005, Biochem Bioph Res Co 336:299-308). For example, bgl genes from Bacillus sp. had been successfully cloned and expressed in E. coli system (Hashimoto et al., 1998, Arch Biochem Biophys 360:1-9; and Srivastava et al., 1999, Biotechnol Lett 21:293-297). In addition, an Aspergillus niger BGL coding gene, bgl1, had been expressed in P. pastoris and S. cerevisiae (Dan et al., 2000, J Biol Chem 275(7):4973-4980). Clone BGL genes from diverse sources and then express in different hosts has been a practical approach for gaining higher BGL productivity (over-expression) and/or activity (advantageous post-translational modification improve kinetic features of the enzymes).
4. Properties of BGLs
[0025] The properties of BGLs are rather diverse by far. In general, BGLs from different orders and kingdoms appear to differ in their specificities for the aglycone part (aryl-, alkyl-, or amino-) linked to the glycosyl group (Bhatia et al., 2002, Crit Rev Biotechnol 22(4):375-407). They have been found to be intracellular, extracellular, cytosol-associated, membrane-bound, or periplasmically-localized. Their catalytic functionalities on hydrolyzing short-chain cellobiose and oligosaccharides are quite similar in cellulosic microorganisms although some other various functions have been noticed in different organisms (e.g., synthesis of glycosyl-bond between different molecules in some microbes, releasing cyanides from cyano-glucoside precursors in insects and plants, and hydrolysis of glycosylceramides for treating Gaucher's disease in humans) (Bhatia et al., 2002, Crit Rev Biotechnol 22(4):375-407). For cellulose hydrolysis, BGLs are able to hydrolyze their substrates with net retention of anomeric configuration that occurs presumably via a two-step, double-displacement mechanism involving two carboxylic acid residues at key active site (Davies et al., 1998, Comprehensive Biological Catalysis Vol. 1, pp. 119-208; and Withers, 2001, Carbohyd Polym 44:325-337). It has been noted that most of BGLs have different substrate specificity and can act on a wide spectrum of substrates (e.g., pNPG; cellobiose, salicin, MUG, arbutin, aryl-, alkyl-, and methyl-glucosides). In addition, most of BGLs have optimum pH within weak-acid levels and optimum temperature in mesophilic scopes despite some strong-acid and/or thermophilic properties have been reported. The BGL monomers from diverse organism sources have a very broad molecular size range (i.e., about 13-137 kDa) (Gabelsberger et al., 1993, Appl Microbiol Biotechnol 40:44-52; Iwashita et al., 1999, Appl Environ Microbiol 65:5546-5553; Srivastava et al., 1999, Biotechnol Lett 21:293-297; and Dan et al., 2000, J Biol Chem 275(7):4973-4980). Amino acid sequence similarities between BGLs of different organism species also vary widely (10s -90s %). Nevertheless, amino acid bases of aspartic acid (Asp, D) and glutamine acid (Glu, E) have been recognized as catalytic necleophile and catalytic proton donor of BGLs.
5. Classification of BGLs
[0026] Based on chemical reactions they catalyze, all BGLs have been assigned to EC 3.2.1.21 category (i.e., Enzyme Commission number of 3-hydrolases, 2-glycosylases, 1-glycosidases, and 21-glucosidase) according to numerical classification scheme for more than 3,000 enzymes published by the IUBMB (International Union of Biochemistry and Molecular Biology) (Webb, 1992, Enzyme nomenclature: recommendations of the Nomenclature Committee of the International Union of Biochemistry and Molecular Biology on the nomenclature and classification of enzymes, ISBN 0-12-227164-5; and ExPASy, 2009, www.expasy.org/enzyme/). Additionally, BGLs are versatile hydrolytic enzymes that have been further classified according to various criteria (e.g., substrate specificity and sequence identity). At first, BGLs had been grouped into Type I and II (Beguin et al., 1990, Ann Rev Microbiol 44:219-248) or subfamily A and B (Rojas et al., 1995, Biochem Mol Biol Int 35:1223-1231); but lately, these earlier classifications have been mostly replaced by the currently-acceptable scheme based on amino acid and/or nucleotide sequence identity (Henrissat et al., 1996, Biochem J 316:695-696).
[0027] Based on amino acid sequence and folding similarities, the BGLs have been placed in either family 1 or 3 among 115 up-to-date (November 2009) defined GH (glycoside-hydrolases) families with an exception of the glucosylceramidases (i.e., acid β-glucosidase placed in family 30) (Henrissat, 1991, Biochem J 280:309-316; Henrissat et al., 1993, Biochem J 293:781-788; Henrissat et al., 1996, Biochem J 316:695-696; Hong et al., 2007, Appl Microbiol Biotechnol 73:1331-1339; and Cantarel et al., 2009, Nucleic Acids Res. 37:D233-238). Family 1 (GH1) contains β-glucosidases from some archaea, bacteria, fungi, plants, and mammals whereas family 3 (GH3) comprises β-glucosidases of some bacterial, fungal, and plant origins (Henrissat et al., 1996, Biochem J 316:695-696; and Cantarel et al., 2009, Nucleic Acids Res. 37:D233-238).
6. Applications
[0028] Beside cellobiose/cellodextrins hydrolysis function used for cellulosic ethanol production, BGLs also can be broadly used in the fields of medicine, agriculture, and food industry wherein many reactions engaging cleavage or synthesis of glycosidic bonds are required. Applications based on hydrolytic activities of BGLs include (1) removal of bitterness from citrus fruit juices, (2) manufacture of low-viscosity gellan foods, (3) detoxification of cassava (the 3rd-4t largest source of calories in the tropics), (4) enhancement of aroma release for benefiting winemaking process, (5) feed additive for enhancing nutrient utilization in single-stomached animals, (6) hydrolysis of genistin to genistein as an antitumor agent, (7) production of melanin from phloridzin for reducing the risk of skin cancer and promoting dark color of hair, (8) production of hydroxytyrosol from oleuropein for preventing coronary heart disease and cancer, (9) hydrolysis of laminarin for the production of yeast extract and the conversion of algal biomass to fermentable sugars, (10) making pigments as natural food dyes in confectionary products, etc. (Bhatia et al., 2002, Crit Rev Biotechnol 22(4):375-407).
[0029] In addition, synthetic activities of BGLs have been widely used in the manufacture of pharmaceuticals, fine chemicals and food ingredients (as summarized by Bhatia et al., 2002, Crit Rev Biotechnol 22(4):375-407). For instance, synthesis of an aromatic n-alkyl glucoside ester that was effective in the treatment of fever, rheumatism, headache, and other ailments (Otto et al., 1998, Biotechnol Lett 20:437-440). Other applications of BGLs also have been reported such as sero-diagnosing histoplasmosis or post-diagnosing hepatic ischemia-reperfusion injury and recovery. Therefore, productions of high quality/quantity BGLs from diverse sources have been fundamental tasks for these multipurpose applications.
[0030] Described herein is an isolated polypeptide including an amino acid sequence at least 70% identical to that of BGL-Cr-D2 (SEQ ID NO:1) or a part thereof having at least 20 (e.g., 30, 50, 80, 100, 150, 200, 250, 300, and 350) contiguous amino acids. This polypeptide has a high beta-glucosidase activity.
[0031] The "percent identity" of two amino acid sequences is determined using the algorithm of Karlin and Altschul Proc. Natl. Acad. Sci. USA 87:2264-68, 1990, modified as in Karlin and Altschul Proc. Natl. Acad. Sci. USA 90:5873-77, 1993. Such an algorithm is incorporated into the NBLAST and XBLAST programs (version 2.0) of Altschul, et al. J. Mol. Biol. 215:403-10, 1990. BLAST protein searches can be performed with the XBLAST program, score=50, wordlength=3 to obtain amino acid sequences homologous to the protein molecules of the invention. Where gaps exist between two sequences, Gapped BLAST can be utilized as described in Altschul et al., Nucleic Acids Res. 25(17):3389-3402, 1997. When utilizing BLAST and Gapped BLAST programs, the default parameters of the respective programs (e.g., XBLAST and NBLAST) can be used.
[0032] The isolated polypeptide can be prepared by purification from a suitable microorganism, e.g., Chaetomella raphigera. It also can be prepared via conventional recombinant technology. An example follows. A DNA fragment encoding BGL-Cr-D2 can be prepared by polymerase chain reaction from Chaetomella raphigera cells and cloned into an expression vector. Upon insertion, the BGL-Cr-D2-encoding fragment is operably linked to a suitable promoter contained in the expression vector. The resultant DNA construct is then introduced into suitable host cells (e.g., E. coli cells, yeast cells, insect cells, and mammalian cells) for expression of BGL-Cr-D2, which can be purified from the cells by conventional methods. One example of yeast cells is Pichia pastoris, e.g., Pichia pastoris strain SMD1168.
[0033] To make a functional equivalent of BGL-Cr-D2, which is also within the scope of this invention, one or more conservative amino acid substitutions can be introduced into SEQ ID NO:1 without disrupting its β-glucosidase activity. A "conservative amino acid substitution" is one in which the amino acid residue is replaced with an amino acid residue having a similar side chain. Families of amino acid residues having similar side chains have been defined in the art. These families include amino acids with basic side chains (e.g., lysine, arginine, histidine), acidic side chains (e.g., aspartic acid, glutamic acid), uncharged polar side chains (e.g., glycine, asparagine, glutamine, serine, threonine, tyrosine, cysteine), nonpolar side chains (e.g., alanine, valine, leucine, isoleucine, proline, phenylalanine, methionine, tryptophan), beta-branched side chains (e.g., threonine, valine, isoleucine) and aromatic side chains (e.g., tyrosine, phenylalanine, tryptophan, histidine). Thus, a predicted nonessential amino acid residue in SEQ ID NO: 1 is preferably replaced with another amino acid residue from the same side chain family. Alternatively, mutations can be introduced randomly along all or part of SEQ ID NO: 1, such as by saturation mutagenesis, and the resultant mutants can be screened for the β-glucosidase activity to identify mutants that retain the activity as described below in the Example section below.
[0034] Fusion protein technology can be applied to improve expression efficiency and facilitate purification of the polypeptide of this invention. To prepare a fusion protein containing BGL-Cr-D2, a DNA fragment encoding this β-glucosidase can be linked to another DNA fragment encoding a fusion partner, e.g., glutathione-s-transferase (GST), 6×-His epitope tag, or M13 Gene 3 protein. The resultant fusion nucleic acid expresses in suitable host cells a fusion protein that can be isolated by methods known in the art. The isolated fusion protein can be further treated, e.g., by enzymatic digestion, to remove the fusion partner and obtain the recombinant polypeptide of this invention.
[0035] Also described herein is an isolated nucleic acid encoding the polypeptide of this invention. A nucleic acid refers to a DNA molecule (e.g., a cDNA or genomic DNA), an RNA molecule, or a DNA/RNA analog, which can be synthesized from nucleotide analogs. In one example, the nucleic acid of this invention is an expression vector in which a DNA fragment encoding the polypeptide is operably linked to a suitable promoter.
[0036] As used herein, the term "promoter" refers to a nucleotide sequence containing elements that initiate the transcription of an operably linked nucleic acid sequence in a desired host microorganism. At a minimum, a promoter contains an RNA polymerase binding site. It can further contain one or more enhancer elements which, by definition, enhance transcription, or one or more regulatory elements that control the on/off status of the promoter. When E. coli is used as the host microorganism, representative E. coli promoters include, but are not limited to, the β-lactamase and lactose promoter systems (see Chang et al., Nature 275:615-624, 1978), the SP6, T3, T5, and T7 RNA polymerase promoters (Studier et al., Meth. Enzymol. 185:60-89, 1990), the lambda promoter (Elvin et al., Gene 87:123-126, 1990), the trp promoter (Nichols and Yanofsky, Meth. in Enzymology 101:155-164, 1983), and the Tac and Trc promoters (Russell et al., Gene 20:231-243, 1982). When yeast is used as the host microorganism, exemplary yeast promoters include 3-phosphoglycerate kinase promoter, glyceraldehyde-3-phosphate dehydrogenase (GAPDH) promoter, galactokinase (GAL1) promoter, galactoepimerase promoter, and alcohol dehydrogenase (ADH) promoter. Promoters suitable for driving gene expression in other types of cells are also well known in the art.
[0037] A vector refers to a nucleic acid molecule capable of transporting another nucleic acid to which it has been linked. The vector can be capable of autonomous replication or integrate into a host DNA. Examples of the vector include a plasmid, cosmid, or viral vector. The vector of this invention includes a nucleotide sequence encoding BGL-Cr-D2 in a form suitable for expression of the nucleic acid in a host cell. Preferably the vector includes one or more regulatory sequences operatively linked to the encoding sequence. A "regulatory sequence" includes promoters, enhancers, and other expression control elements (e.g., polyadenylation signals). Regulatory sequences include those that direct constitutive expression of a nucleotide sequence, as well as tissue-specific regulatory and/or inducible sequences. The design of the expression vector can depend on such factors as the choice of the host cell to be transformed, the level of expression of protein desired, and the like. The expression vector can be introduced into host cells to produce the polypeptide of this invention.
[0038] Also within the scope of this invention is a host cell that contains the above-described nucleic acid. Examples include E. coli cells, insect cells (e.g., using baculovirus expression vectors), yeast cells, plant cells, or mammalian cells. See e.g., Goeddel, (1990) Gene Expression Technology: Methods in Enzymology 185, Academic Press, San Diego, Calif. To produce a polypeptide of this invention, one can culture a host cell in a medium under conditions permitting expression of the polypeptide encoded by a nucleic acid of this invention, and purify the polypeptide from the cultured cell or the medium of the cell. Alternatively, the nucleic acid of this invention can be transcribed and translated in vitro, for example, using T7 promoter regulatory sequences and T7 polymerase.
[0039] Further described herein is a method of converting lignocellulosic material to fermentable products (e.g., fermentable sugars) using a multi-enzyme composition containing the β-glucosidase described herein and other cellulolytic enzymes, such as exo-glucanase and endo-glucosidase. See, e.g., US Application Nos. 20070238155 and 20070250961. The term "cellulolytic enzyme" refers to an enzyme that hydrolyzes cellulose (a polysaccharide consisting of glucose units) into smaller sugar units. See Gilbert H J, Hazlewood G P, 1993 J Gen Microbiol 139:187-194; Olimiya K et al. 1997 Biotechnol Genet Eng Rev. 14:365-414. See also, e.g., US Application 2007016805. This multi-enzyme composition can be obtained from, e.g., a microbial, a plant, or a combination thereof, and will contain enzymes capable of degrading lignocellulosic material. In addition to the cellulolytic enzymes mentioned above, it can further include cellobiohydrolases, endoglucanase, beta.-glucosidases), hemicellulases (such as xylanases, including endoxylanases, exoxylanase, and beta-xylosidase), ligninases, amylases, alpha-arabinofuranosidases, alpha-glucuronidases, alpha-glucuronidases, arabinases, glucuronidases, proteases, esterases (including ferulic acid esterase and acetylxylan esterase), lipases, glucomannanases, or xylogluconases.
[0040] As used herein the term "lignocellulosic material" refers to materials containing cellulose and/or hemicellulose. Generally, these materials also contain xylan, lignin, protein, and carbohydrates, such as starch and sugar. Lignocellulose is found, for example, in the stems, leaves, hulls, husks, and cobs of plants or leaves, branches, and wood of trees. The process of converting a complex carbohydrate (such as starch, cellulose, or hemicellulose) into fermentable sugars is also referred to herein as "saccharification." Fermentable sugars, as used herein, refer to simple sugars, such as glucose, xylose, arabinose, galactose, mannose, rhamnose, surose, fructose, lactose, maltose, trehalose, or cellobiose. Lignocellulosic material can include virgin plant biomass and/or non-virgin plant biomass such as agricultural biomass, commercial organics, construction and demolition debris, municipal solid waste, waste paper, and yard waste. Common forms of lignocellulosic material include trees, shrubs and grasses, wheat, wheat straw, sugar cane bagasse, corn, corn husks, corn kernel including fiber from kernels, products and by-products from milling of grains such as corn, rice, wheat, and barley (including wet milling and dry milling), as well as municipal solid waste, waste paper, and yard waste. The lignocellulosic material can also be, but is not limited to, herbaceous material, agricultural residues, forestry residues, and paper mill residues. "Agricultural biomass" includes branches, bushes, canes, corn and corn husks, energy crops, forests, fruits, flowers, grains, grasses, herbaceous crops, leaves, bark, needles, logs, roots, saplings, short rotation woody crops, shrubs, switch grasses, trees, vegetables, fruit peels, vines, sugar beet pulp, wheat midlings, oat hulls, hard and soft woods (not including woods with deleterious materials), organic waste materials generated from agricultural processes including farming and forestry activities, specifically including forestry wood waste, or a mixture thereof.
[0041] The fermentable sugar produced in the method described above can be converted to useful value-added fermentation products via enzyme treatment or chemical reaction. Examples of the fermentation product include, but are not limited to amino acids, vitamins, pharmaceuticals, animal feed supplements, specialty chemicals, chemical feedstocks, plastics, solvents, fuels, or other organic polymers, lactic acid, and ethanol, including fuel ethanol. Specific value-added fermentation products that may be produced by the methods of the invention include, but not limited to, biofuels (including ethanol and butanol); lactic acid; plastics; specialty chemicals; organic acids, including citric acid, succinic acid and maleic acid; solvents; animal feed supplements; pharmaceuticals; vitamins; amino acids, such as lysine, methionine, tryptophan, threonine, and aspartic acid; chemical feedstocks. The fermentable sugar can also be used for culturing microbes that produce fermentation products, e.g., industrial enzymes, such as proteases, cellulases, amylases, glucanases, lactases, lipases, lyases, oxidoreductases, transferases and xylanases.
[0042] The invention also provides a method of producing energy from lignocellulosic material. This method include providing the multi-enzyme composition described above; contacting the composition with the lignocellulosic material to produce a fermentable product; fermenting the fermentable product to produce a combustible fermentation product, and combusting the combustible fermentation product to produce energy. This method can be performed in a bioreactor that contains all necessary components and may preferably be configured for anaerobic growth of microorganisms. Methods for making and using bioreactors are known in the art. See, e.g., US Application 20080131958.
[0043] The polypeptide and composition described above can also be used in the paper and pulp industry. For example, the polypeptide, which has a high β-glucosidase activity can be used in the deinking and refining of recycled paper. In this application, utilizing of them could reduce the amount of enzyme used per ton of paper substantially, and reduce the time of exposure to the enzyme needed to increase the brightness of the paper. Reducing the concentration of enzyme and the time of exposure to the enzyme in the refining process, correspondingly and desirably reduces the reaction of the cellulase on the fibrils themselves and processing costs.
[0044] The polypeptide of this invention has additional industrial applications. Given its high β-glucosidase activity, this polypeptide can function at a lower amount. The polypeptide, in combination with other enzymes, can be used, with enhanced yields, in extracting juice from fruits, or extracting juice or soup flavorings from vegetables. In combination with protease, it can be used to dissociate dried seaweed, which is then fermented with alcohol to produce vinegar. The polypeptide, mixed with other enzymes, can also serve as a dough conditioner in the baking industry. See, e.g., U.S. Pat. No. 6,602,700.
[0045] Moreover, the polypeptide of this invention can also be used in the textile industry. It can be used to brighten and soften cotton fabrics by removing microfibers on the surface, which causes a dull look of clothes. More specifically, it can be included as an additive in formulating enzyme-containing detergents for soil removal, fabric softening, and color brightening. For example, it can be used as a replacement to pumice in producing blue jeans having a "stone-washed" effect. Enzyme treatment causes less damage to the jean fabric than lengthy exposure to pumice. See U.S. Pat. Nos.5,232,851, 5,677,151, 6,451,063, and 7,226,773.
[0046] In another aspect, the present invention provides a transgenic plant, the genome of which is augmented with a recombinant polynucleotide encoding a polypeptide of this invention operably linked to a promoter sequence. The polynucleotide is optimized for expression in the plant and the polypeptide is produced at a level greater than 5% total soluble protein, greater than 10% total soluble protein or greater than 20% total soluble protein. The polypeptide may be expressed constitutively or tissue-specifically. For example, it may be expressed in a plant tissue selected from the group consisting of stems and leaves. It may also be expressed in a targeted sub-cellular compartment or organelle, such as apoplast, chloroplast, cell wall, or vacuole. The plant may be a monocotyledonous plant or a dicotyledonous plant. In certain embodiments, the plant is a crop plant. The plant may be selected from the group consisting of corn, switchgrass, sorghum, miscanthus, sugarcane, poplar, pine, wheat, rice, soy, cotton, barley, turf grass, tobacco, bamboo, rape, sugar beet, sunflower, willow, and eucalyptus. Methods for making transgenic plants are well known in the art.
[0047] The specific examples below are to be construed as merely illustrative, and not limitative of the remainder of the disclosure in any way whatsoever. Without further elaboration, it is believed that one skilled in the art can, based on the description herein, utilize the present invention to its fullest extent. All publications cited herein are hereby incorporated by reference in their entirety. Further, any mechanism proposed below does not in any way restrict the scope of the claimed invention.
I. Materials and methods
1. Cultivation of Native D2 Fungal Strain
[0048] The native fungal strain (i.e., Chaetomella raphigera D2) was cultured with MR (Mendels-Reese) medium (pH 5.0) to harvest fungal cells for DNA/RNA extractions and their crude enzymes for property analyses (e.g., amino acid sequences, cellulosic activities, and enzyme concentrations). Each one-liter MR medium contains 1 g soy peptone, 1.4 g (NH4)2SO4, 0.3 g urea, 2.0 g KH2PO4, 0.34 g CaCl2, 0.3 g MgSO4.7H2O, 5.0 mg FeSO4.7H2O, 1.6 mg MnSO4.7H2O, 1.4 mg ZnSO4.7H2O, 2 mg CoCl2.6H2O, and 0.72 g cellobiose. Fungal culture was grown at a temperature of 30° C. and a mixing rate of 125 rpm for 4 days prior to gather fungal cells and their crude enzymes.
2. Nucleic Acid Extraction and Crude Enzyme Collection
[0049] Genomic DNA of the C. raphigera D2 fungus was extracted using Wizard® Genomic DNA Purification Kit (Promega, USA) and fungal RNA was extracted using Plant Total RNA Miniprep Purification Kit (GeneMark, Taiwan). Concentrations of extracted nucleic acid were measured by a spectrophotometer (NanoDrop ND-1000, Thermo Scientific, USA). The DNA and RNA extracts were stored at -20° C. prior to further treatments (e.g., PCR and RT PCR) were conducted.
[0050] Crude enzymes were collected by filtering supernatant of 4-day fungal culture through Whatman No.1 filter paper (Whatman/GE Healthcare, USA), then 0.45 μm Supor® Membrane Disc Filter (PALL, USA), and followed by 30K NMWL (Nominal Molecular Weight Limit) Amico® Ultra-15 Centrifugal Filter (Millipore, USA). Enzyme concentrations were determined based on Bio-Rad Protein Assay using BSA (bovine serum albumin) as calibration standard (Bio-Rad, USA). The filtered enzymes were stored at 4° C. until further analyses were conducted.
3. Enzyme Electrophoresis and N-terminal Sequencing
[0051] Enzyme electrophoreses including native PAGE (polyacrylamide gel electrophoresis) and SDS (sodium dodecylsulfate) PAGE were performed based on the protocols suggested by Hoefer's Protein Electrophoresis Applications Guide (Hoefer Scientific Instruments, 1994). Before Coomassie blue staining for native PAGE, zymogram was carried out by immersing gel with 0.5 mM MUG (4-Methylumbelliferyl β-D-glucopyranoside) solution and incubating at 50° C. for 15 minutes and then followed by observation under UV light (Benoit et al., 1995, Curr Microbiol 30:305-312). Additionally, instead of coomassie blue staining for SDS-PAGE, silver staining using PlusOne Silver Staining Kit (GE Healthcare, USA) was also performed for observing some enzymes with low concentration.
[0052] For N-terminal sequence analysis, the target enzyme (i.e., BGL with MUG activity) was transferred from SDS-PAGE gel to PVDF membrane (iBlot Gel Transfer Stacks, Invitrogen, USA) and analyzed by Applied Biosystems Procise Protein Sequencer model 494 (Applied Biosystems, USA) based on Edman degradative chemistry. Part of the revealed N-terminal sequence, PGDGDWA (SEQ ID NO: 9), was used to design a degenerate primer D2-bgl-NT: CCN GGN GAY GGN GAY TGG GC (SEQ ID NO: 10) for further amplifying target bgl gene from genomic DNA and/or reverse transcribed cDNA.
4. Cloning and Sequencing for D2-bgl Gene
[0053] The bgl-Cr-D2 cDNA was first reverse transcribed (RT) from RNA sample using a poly-T primer: GGT TCT TGC CAC AGT CAC GAC TTT TTT TTT TTT TTT TTT (SEQ ID NO: 11) and SuperScript® III Reverse Transcriptase (Invitrogen, USA); and, subsequently amplified using a primer set of D2-bgl-NT and poly-T anchor primer: GGT TCT TGC CAC AGT CAC GAC (SEQ ID NO: 12) and a DNA polymerase (TaKaRa Ex Taq®, TaKaRa Bio Inc, Japan). PCR thermo-cycle conditions were 94° C. for 4 minutes, followed by 30 cycles at 94° C. for 1 minute, 58° C. for 30 seconds, and 72° C. for 3 minutes, and then a final elongation at 72° C. for 5 minutes. The RT-PCR product (i.e., amplicon of bgl-Cr-D2 cDNA) was cloned in pGEM®-T Easy vector (Promega, USA) and transferred into E. coli cells (strain DH5α) for preservation and further sequencing. Besides, the bgl-Cr-D2 gene (containing introns) was amplified from genomic DNA sample using a primer set of D2-bgl-f: CCT GGT GAT GGT GAT TGG GCA GC (SEQ ID NO: 13) and D2-bgl-r: ATG TCC ACC TTT CCG AAT ACC TTG GC (SEQ ID NO: 14) and TaKaRa Ex Tag®. The PCR product was also cloned in pGEM®-T Easy vector for sequencing. Intron sections were determined by comparing bgl-Cr-D2 cDNA (without introns) and bgl-Cr-D2 genomic DNA (with introns) sequences.
[0054] Additionally, signal sequence upstream of the bgl-Cr-D2 gene was revealed by randomly digesting genomic DNA with SacI restriction enzyme (NEB, New England Biolabs Inc., USA) and self-ligation with T4 DNA ligase (Promega, USA); and then, followed by inverse PCR using a primer set of D2-bgl233f: CGT TTC GTC CAA AAT GTA ACA GCA T (SEQ ID NO: 15) and D2-bgl232r: GAT GCT TTC ACC GTC AGT TCT GA (SEQ ID NO: 16). The PCR program was as follow: 5 minutes at 95° C., followed by 25 cycles consisting of 1 minute at 95° C., 1 minute at 55° C., and 6 minutes at 72° C. and a final cycle consisting of 10 minutes at 72° C. The inverse PCR product (should contain a sequence in the order of D2-bgl233f+partial D2-bgl+SacI+partial D2 genome+signal sequence of D2-bgl+partial D2-bgl+D2-bgl232r) was cloned in pGEM®-T Easy vector for further sequencing.
[0055] The putative amino acid sequence of BGL-Cr-D2 from revealed bgl-Cr-D2 cDNA was aligned with other GH3 family BGLs using Clustal X software (Thompson et al., 1997, Nucleic Acids Research 25(24): 4876-4882) to analyze phylogenetic relationships. The alignment result was used to build phylogenetic tree that was visualized in TreeView (Page, 1996, Computer Applications in the Biosciences 12:357-358).
5. Expression in Pichia pastoris
[0056] The bgl-Cr-D2 cDNA (cloned in the pGEM®-T Easy Vector) was further constructed in pGAPZα C vector for constitutive expression of recombinant BGL-Cr-D2-Pp in Pichia yeast (Invitrogen, USA). Briefly, the bgl-Cr-D2 cDNA in the Easy vector was PCR amplified with a primer set of D2-bgl-f-EcoRI: CGC TTG AAT TCG ATG CCT GGT GAT GGT GAT TGG (SEQ ID NO: 17) and D2-bgl-r-NotI: TTC AAG CGG CCG CAT GTC CAC CTT TCC GAA TAC C (SEQ ID NO: 18). This PCR product was joined into the pGAPZα C vector by double digesting amplicon and vector with EcoRI and NotI (NEB, USA) and then ligation with T4 DNA ligase. The constructed pGAPZα C vector carrying bgl-Cr-D2 cDNA was transformed into E. coli (strain DH5α) for preservation. Prior to subsequently transforming into Pichia host, the pGAPZα C vector with D2-bgl cDNA was purified with Plasmid Miniprep Purification Kit (GeneMark, Taiwan) and linearized with B spHI (NEB, USA). The linear plasmid DNA was transformed into Pichia pastoris (strains GS115 and SMD1168) via homologous recombination based on protocol for Pichia Expression Kit (Invitrogen, USA). The reconstructed Pichia pastoris yeast cells were grown in YPD.sub.zeocin100 medium and the conditions (e.g., 30° C. and 200 rpm) suggested in the Invitrogen protocol. Pichia cells and their enzymes were collected at different time points (0-14 days) to assess their growth and enzyme activity/quantity.
6. Activity Tests
[0057] Cellulolytic activities of crude enzymes from native D2 strain and recombinant enzymes from Pichia pastoris were tested using different substrates (p-nitrophenyl β-D-glucopyranoside, pNPG; cellobiose; carboxymethylcellulose, CMC; xylan; avicel; and, filter paper). The pNPGase activities of the enzymes were measured based on pNP releasing rate from 2 mM pNPG at 55° C. in 5 minutes. The pNP concentrations were calibrated based on spectrophotometric absorbance at OD405. The cellobiase activities were assessed by cellobiose reducing rate and/or glucose producing rate where cellobiose and glucose concentrations were determined by HPLC (high performance liquid chromatography) analyses. Additionally, CMCase (for endo-glucanase activity), xylanase (for xylose hydrolyzing activity), avicelase (for exo-glucanase activity), and FPase (representing total cellulase activity) were measured based on production rate of reducing sugars that were detected by DNS (dinitrosalicyclic acid) method.
II. Results
[0058] In this invention, a Taiwan-indigenous fungus Chaetomella raphigera strain D2 was first found capable of secreting crude enzyme with significant β-glucosidase activity based on assessment using pNPG (p-nitrophenyl-β-D-glucopyranoside) and cellobiose as substrates. The gene (bgl-Cr-D2) encoding the target β-glucosidase (BGL-Cr-D2) was derived from genomic DNA and reverse-transcribed cDNA. It showed that the bgl-Cr-D2 cDNA had a size of 2166 bps (not including signal sequence and stop codon) and the bgl-Cr-D2 genomic DNA has a size of 2851 bps (FIGS. 1 & 2). Twelve introns were identified after comparing the bgl-Cr-D2 genomic DNA with cDNA. The bgl-Cr-D2 gene would encode 722 bases of amino acid to form the BGL-Cr-D2 (FIG. 2). Based on the comparison of the amino acid sequences for the β-glucosidases derived from glycoside hydrolase (GH) family 3, the newly-found BGL-Cr-D2 was homologous to the β-D-glucoside glucohydrolase of Talaromyces stipitatua (65% similarity) and Aspergillus terreus (64% similarity) (FIG. 3). Based on the phylogenetic analyses, C. raphigena D2, T. stipitatua, and A. terreus formed a sub-group that could distinguish from other fungal and bacterial groups (e.g., Penicillium, Sacharomycopis, Clostridium, and Escherichia coli).
[0059] The bgl-Cr-D2 gene was further cloned into pGAPZαC vector (FIG. 4) and transformed to a yeast expression system using Pichia pastoris strain SMD1168 as host cells. When compared the growth and pNPGase activity for the Pichia pastoris strain SMD1168 and that carrying bgl-Cr-D2, significant pNPGase activity was observed only for the colony cloned with bgl-Cr-D2 while the growth profiles were similar for the both colonies (FIG. 5). The growth and activity reached plateau levels after incubating at 30° C. for around 4-6 days. Furthermore, the recombinant β-glucosidase (BGL-Cr-D2-Pp) had a molecular size of around 95 kDa (FIG. 6C) (larger than theoretical size of 79 kDa due to post-translational glycosylation); and, showed high specific activity based on assessments of saccharifying rate (Table 1) and observations from native PAGE (polyacrylamide gel electrophoresis) (FIG. 6A)/MUG (4-methylumbelliferyl-β-D-glucopyranoside) zymogram (FIG. 6B).
[0060] The recombinant BGL-Cr-D2-Pp remained active after 20-hour incubation under a wide pH range (i.e., pH4-pH9) (FIG. 7). However, the pNPGase activity of this enzyme significantly decreased at pH levels higher than 5.5. Besides, the BGL-Cr-D2-Pp had highest pNPGase activity at a temperature of 75° C.; but, at this high temperature, the activity dropped tremendously after short-term incubation (i.e., 2 hours) (FIGS. 8 & 9). Thus, the pNPGase activities of the recombinant BGL-Cr-D2-Pp were performed at pH 5 and a temperature of 55° C. for 10 minutes.
TABLE-US-00001 TABLE 1 Comparison of enzyme activities for native BGL-Cr-D2, Pichia pastoris strain SMD1168 secretes, recombinant BGL-Cr-D2-Pp secreted from P. pastoris, and Novozyme-188 for different substrates (all reactions were performed at pH 5 and 55° C.). Reaction time BGL-Cr- Pp BGL-Cr- Substrates (minutes) D2 SMD1168 D2-Pp N-188 Activity (U**/mg BGL) 2 mM pNPG* 5 1,029.9 41.1 2,963.9 171.7 2 mM 5 -- -- 57.5 11.1 Cellobiose Activity (U/mg enzymes.sup.††) 1% CMC.sup.† 30 121.1 1.0 0.7 1.1 1% Xylan 30 186.9 0.0 3.8 3.4 1% Avicel 120 0.0 0.2 0.0 0.0 FP.sup..dagger-dbl. 120 0.1 0.0 0.1 0.0 *p-nitrophenyl-β-D-glucopyranoside; .sup.†carboxymethylcellulose; .sup..dagger-dbl.filter paper; **1 U = 1 μmole pNP produced from pNPG, glucose produced from cellobiose, or reduced saccharides produced from CMC, xylan, avicel or FP per minute; .sup.††total enzymes except BGL.
Other Embodiments
[0061] All of the features disclosed in this specification may be combined in any combination. Each feature disclosed in this specification may be replaced by an alternative feature serving the same, equivalent, or similar purpose. Thus, unless expressly stated otherwise, each feature disclosed is only an example of a generic series of equivalent or similar features.
[0062] A number of embodiments of the invention have been described. Nevertheless, it will be understood that various modifications may be made without departing from the spirit and scope of the invention. Accordingly, other embodiments are within the scope of the following claims.
Sequence CWU
1
1
181722PRTChaetomella raphigera 1Pro Gly Asp Gly Asp Trp Ala Ala Ala Tyr
Lys Lys Ala Thr Ala Ala1 5 10
15Leu Ala Lys Leu Ser Asn Thr Asp Lys Ala Ser Ile Val Thr Gly Val
20 25 30Gly Trp Glu Lys Gly Pro
Cys Val Gly Asn Thr Ala Ala Val Ala Ser 35 40
45Ile Gly Leu Pro Glu Leu Cys Tyr Gln Asp Gly Pro Leu Gly
Ile Arg 50 55 60Phe Val Gln Asn Val
Thr Ala Phe Pro Thr Gly Ile Gln Thr Ala Ser65 70
75 80Thr Trp Asp Ile Ser Leu Ile Tyr Ser Arg
Gly Leu Ala Leu Gly Gln 85 90
95Glu Ala Lys Ala Leu Gly Ile Asn Val Gln Leu Gly Pro Val Ala Gly
100 105 110Pro Ile Gly Lys Ile
Pro Glu Ala Gly Arg Asn Trp Glu Gly Phe Ser 115
120 125Pro Asp Pro Tyr Leu Asn Gly Leu Ala Met Ser Asn
Thr Ile Thr Gly 130 135 140Met Gln Asp
Ala Gly Val Gln Ala Cys Ala Lys His Phe Ile Gly Asn145
150 155 160Glu Gln Glu Thr Asn Arg Asp
Thr Met Ser Ser Asn Ile Asp Asp Arg 165
170 175Thr Phe His Glu Leu Tyr Leu Trp Pro Phe Ala Asp
Ala Ile Lys Ala 180 185 190Asn
Val Ala Ser Ile Met Cys Ser Tyr Asn Lys Phe Asn Glu Thr Tyr 195
200 205Ala Cys Glu Asn Asn Phe Leu Thr Thr
Ile Leu Lys Gly Glu Leu Asp 210 215
220Phe Gln Gly Phe Val Val Ser Asp Trp Ala Ala Gln His Thr Thr Ile225
230 235 240Gly Ser Ala Asn
Ala Gly Leu Asp Val Ala Met Pro Gly Asp Asn Phe 245
250 255Gly Asp Asn Tyr Tyr Leu Trp Gly Ser Asn
Leu Leu Ala Ala Ile Ser 260 265
270Asn Gly Thr Val Ala Gln Ser Arg Leu Asp Asp Met Val Thr Arg Ile
275 280 285Leu Ala Ser Trp Tyr Phe Val
Gly Gln Asp Gln Gly Tyr Pro Ala Val 290 295
300Thr Trp Ser Ser Trp Asn Gly Gly Leu Gly Gly Pro Asn Val Gln
Ala305 310 315 320Asp His
Lys Gln Val Ala Arg Ala Ile Ala Arg Asp Gly Ile Val Leu
325 330 335Leu Thr Asn Lys Asn Lys Ala
Leu Pro Leu Lys Lys Pro Ala Ser Leu 340 345
350Ala Ile Ile Gly Gln Asp Ala Ile Asp Asn Pro Ala Gly Ile
Asn Ser 355 360 365Cys Ser Asp Arg
Gly Cys Asp Thr Gly His Leu Ala Met Gly Trp Gly 370
375 380Ser Gly Thr Ala Asp Phe Pro Tyr Leu Val Ala Pro
Leu Asp Ala Ile385 390 395
400Thr Pro Leu Ala Gln Ala Gln Gly Thr Lys Leu Val Leu Ser Thr Thr
405 410 415Asp Ser Thr Ser Ala
Ala Ala Ser Ala Ala Ala Ala Ala Glu Thr Ala 420
425 430Ile Val Phe Ile Thr Ala Asp Ser Gly Glu Gly Tyr
Ile Thr Val Asp 435 440 445Gly Gln
Leu Gly Asp Arg Asn Ser Leu Ala Pro Trp Asn Asn Gly Thr 450
455 460Ala Leu Val Gln Ala Val Ala Ser Ala Ser Lys
Asn Val Ile Val Val465 470 475
480Ile Asn Ser Val Gly Pro Leu Ile Leu Glu Asp Ile Leu Ala Leu Ser
485 490 495Ser Val Lys Ala
Ile Val Trp Ala Gly Val Ser Gly Gln Glu Ser Gly 500
505 510Asn Gly Leu Ala Asp Ile Leu Tyr Gly Ser Val
Ser Pro Ser Gly Lys 515 520 525Leu
Pro Tyr Thr Ile Ala Lys Gln Ala Ser Asp Tyr Gly Thr Ala Ile 530
535 540Val Pro Gly Asp Asp Asn Phe Pro Glu Gly
Leu Phe Val Asp Tyr Arg545 550 555
560His Phe Asp Gln Ala Asn Ile Gln Pro Arg Phe Glu Phe Gly Tyr
Gly 565 570 575Leu Ser Tyr
Thr Thr Phe Gln Tyr Ser Gln Leu Thr Ala Lys Tyr Ser 580
585 590Asp Thr Ser Ala Gly Ser Ser Thr Leu Ala
Pro Gly Gly Pro Lys Gly 595 600
605Leu Tyr Asp Ile Val Ala Thr Val Thr Ala Lys Val Thr Asn Ser Gly 610
615 620Thr Val Ser Gly Ala Glu Val Ala
Gln Leu Tyr Ile Gly Leu Pro Gly625 630
635 640Ser Ala Pro Ala Ser Pro Pro Lys Gln Leu Arg Gly
Phe Asp Lys Ile 645 650
655Ser Leu Lys Pro Gly Lys Ser Gly Thr Val Thr Phe Asn Leu Arg Arg
660 665 670Lys Asp Leu Ser Tyr Trp
Asp Thr Ala Ser Ala Gln Trp Val Thr Pro 675 680
685Thr Ser Gly Glu Phe Ser Leu Tyr Val Gly Ala Ser Ser Arg
Asp Ile 690 695 700Arg Leu Gln Gly Ser
Leu Lys Cys Ser Gly Gln Gly Ile Arg Lys Gly705 710
715 720Gly His22166DNAChaetomella raphigera
2cctggtgatg gtgattgggc agcggcttat aaaaaagcta ccgctgctct cgcaaaattg
60agcaacaccg acaaagccag tattgttact ggcgtgggtt gggaaaaggg tccttgcgtt
120gggaatactg ctgcagtagc atcaattgga cttccagagc tttgctatca agatggaccc
180ctaggcatcc gtttcgtcca aaatgtaaca gcatttccta caggtattca gactgcttcc
240acgtgggaca ttagtttgat ctacagtcga ggtttagctt taggtcagga ggcaaaggca
300cttggcatca atgtccagct cggtccagtt gctggtccca ttggtaaaat tccagaggcc
360ggtcgcaact gggaaggctt ttctccagat ccatacctga atggtctagc aatgtccaat
420accattaccg gaatgcaaga tgctggcgta caggcttgtg caaagcactt tattggaaac
480gagcaagaaa cgaatcgcga tacaatgagc tccaacatcg atgatcggac tttccatgag
540ttgtacctct ggcctttcgc agatgccatc aaggccaacg ttgcatccat aatgtgctca
600tacaacaagt tcaatgagac atacgcttgt gagaacaact tcttgacaac tattctcaag
660ggcgagcttg actttcaagg cttcgtcgtc agcgactggg ctgctcagca tactactatt
720ggaagtgcaa atgctggtct ggatgttgcg atgcctggtg acaattttgg cgacaactat
780tatctatggg gcagcaatct tctggcggcg atctccaacg gcactgttgc ccaatctcgc
840cttgatgaca tggtcactcg catcctggct tcatggtact ttgttggtca ggaccaaggc
900tatccagctg tgacctggtc ctcttggaat ggtggattgg gaggtcccaa tgtgcaggct
960gatcataagc aggtcgcacg cgccatcgcc cgagatggca ttgtcctact cacgaacaaa
1020aacaaagccc tgcccctgaa gaagcccgct agtttagcaa tcattggtca agatgccatc
1080gacaatcctg caggcatcaa ctcgtgctcc gaccgcggtt gtgatacggg tcacttggct
1140atgggttggg gctcgggcac agcagacttt ccatacctcg ttgcgccgct cgatgccatc
1200acacccttag ctcaggccca gggaacaaag cttgtattgt cgacgaccga cagtacttct
1260gccgctgcta gtgctgccgc cgccgcagag acagcaatcg tcttcatcac tgccgattca
1320ggagagggat acatcactgt tgacggccaa ttgggtgatc gcaactcact tgctccatgg
1380aataatggca ctgctctcgt tcaagcggta gccagcgcca gcaaaaatgt cattgtcgtg
1440atcaacagtg tcggcccatt gattctcgag gacattctcg ctctttccag cgtgaaagca
1500attgtctggg ctggcgtctc gggccaagaa tcgggcaatg gacttgctga tattctttac
1560ggttcagtat ctcccagtgg gaaactccca tacacaatcg ccaaacaggc cagcgactat
1620ggaacagcca ttgtgcccgg tgacgataac tttcccgaag gattgtttgt agactatcgt
1680catttcgacc aagcaaacat ccagccgcgt tttgaatttg gctatggcct ctcctatacg
1740acctttcaat actcgcagct taccgcaaag tactccgata cttccgcagg cagctccact
1800ctcgcccctg gcggacccaa ggggctgtat gatattgttg caacggtaac agcaaaggtg
1860acaaacagcg gcaccgtcag tggcgctgaa gtcgcacagc tatacattgg tttgccgggc
1920tcggcgcctg catctccacc caagcagcta cgcggctttg acaaaatcag cctcaagcca
1980ggcaaatcag gcacggtgac gtttaacctg cgccgaaagg acctcagcta ctgggatact
2040gcctcggctc agtgggttac accgaccagc ggcgagttct ccctgtatgt tggtgctagc
2100tcgagagata taaggttaca ggggtctttg aaatgctcgg gccaaggtat tcggaaaggt
2160ggacat
216632851DNAChaetomella raphigera 3cctggtgatg gtgattgggc agcggcttat
aaaaaagcta ccgctgctct cgcaaaattg 60agcaacaccg acaaagccag tattgttact
ggcgtgggtt gggaaaaggg tccttgcgtt 120gggaatactg ctgcagtagc atcaattgga
cttccagagc tttgctatca agatggaccc 180ctagggtagg ctgctggtta tttgtagtct
cagaactgac ggtgaaagca tccgtttcgt 240ccaaaatgta acagcatttc ctacaggtat
tcagactgct tccacgtggg acattagttt 300gatctacagt cgaggtttag ctttaggtaa
gtccaagtac gctgtcaaga tttgaattaa 360gagtattagg tcaggaggca aaggcacttg
gcatcaatgt ccagctcggt ccagttgctg 420gtcccattgg taaaattcca gaggtgtgtt
tagctggacc ctgtaaagtc ttgcgactga 480tatcgtacag gccggtcgca actgggaagg
cttttctcca gatccatacc tgaatggtct 540agcaatgtcc aatgtatgtg acgacttctt
cccctaggaa aaggcattga ttggtataga 600ccattaccgg aatgcaagat gctggcgtac
aggcttgtgc aaagcacttt agtaagttgg 660ttgacggcag ggtcatgcaa gatctgagac
tcttcagttg gaaacgagca agaaacgaat 720cgcgatacaa tgagctccaa catcgatgat
cggactttcc atgaggtagg tgatactcgc 780aaacaaagat actttggctg atacagatag
ttgtacctct ggcctttcgc agatgccatc 840aaggccaacg ttgcatccat aatgtgctca
tacaacaagt tcaatgagac atacgcttgt 900gagaacaact tcttgacaac tgtaagttct
aaggctcgtg catcaattct tcggaatcgc 960attaacgtct tcagattctc aagggcgagc
ttgactttca aggcttcgtc gtcagcggta 1020cttggtttcc tatcctgctg ttgaaataag
cacagagcta acgcatggtg cgctaagact 1080gggctgctca gcatactact attggaagtg
caaatgctgg tcttgtacgt tgacaaacat 1140gcgattcata ataccattgt agatctcgat
cccagcatga actgacacgt tttcaaggat 1200gttgcgatgc ctggtgacaa ttttggcgac
aactattatc tatggggcag caatcttctg 1260gcggcgatct ccaagtaagt tcctggttcg
caaatggcct tgacgttgct gatctctgtg 1320ccttccatag cggcactgtt gcccaatctc
gccttgatga catggtcact cgcatcctgg 1380cttcatggta ctttgttggt caggaccaag
gctatccagc tgtgacctgg tcctcttgga 1440atggtggatt gggaggtccc aatgtgcagg
ctgatcataa gcaggtcgca cgcgccatcg 1500cccgagatgg cattgtccta ctcacgaaca
aaaacaaagc cctgcccctg aagaagcccg 1560ctagtttagc aatcattggt caagatgcca
tcgacaatcc tgcaggcatc aactcgtgct 1620ccgaccgcgg ttgtgatacg ggtcacttgg
ctatgggttg gggctcgggc acagcagact 1680ttccatacct cgttgcgccg ctcgatgcca
tcacaccctt agctcaggcc cagggaacaa 1740agcttgtatt gtcgacgacc gacagtactt
ctgccgctgc tagtgctgcc gccgccgcag 1800agacagcaat cgtcttcatc actgccgatt
caggagaggg atacatcact gtgagtttct 1860ggcatcgagt gctgagcctc tacatttcct
aatcagtcac gcacaggttg acggccaatt 1920gggtgatcgc aactcacttg ctccatggaa
taatggcact gctctcgttc aagcggtagc 1980cagcgccagc aaaaatgtca ttgtcgtgat
caacagtgtc ggcccattga ttctcgagga 2040cattctcgct ctttccagcg tgaaagcaat
tgtctgggct ggcgtctcgg gccaagaatc 2100gggcaatgga cttgctgata ttctttacgg
ttcagtatct cccagtggga aactcccata 2160cacaatcgcc aaacaggcca gcgactatgg
aacagccatt gtgcccggtg acgataactt 2220tcccgaagga ttgtttgtag actatcgtca
tttcgaccaa gcaaacatcc agccgcgttt 2280tgaatttggc tatggcctct gtaagtccca
cccgcgcgcg ttcctttccc ccacgatgat 2340gataaggaga tgtgatatgt gaagagagtc
gaccttctca gccgctgcta tagttagacg 2400ctaacaaacg tcaacagcct atacgacctt
tcaatactcg cagcttaccg caaagtactc 2460cgatacttcc gcaggcagct ccactctcgc
ccctggcgga cccaaggggc tgtatgatat 2520tgttgcaacg gtaacagcaa aggtgacaaa
cagcggcacc gtcagtggcg ctgaagtcgc 2580acagctatac attggtttgc cgggctcggc
gcctgcatct ccacccaagc agctacgcgg 2640ctttgacaaa atcagcctca agccaggcaa
atcaggcacg gtgacgttta acctgcgccg 2700aaaggacctc agctactggg atactgcctc
ggctcagtgg gttacaccga ccagcggcga 2760gttctccctg tatgttggtg ctagctcgag
agatataagg ttacaggggt ctttgaaatg 2820ctcgggccaa ggtattcgga aaggtggaca t
28514740PRTChaetomella raphigera 4Met
Tyr Ser Arg Ala Phe Thr Phe Ala Ala Leu Ala Val Pro Ile Leu1
5 10 15Ala Ala Pro Gly Asp Gly Asp
Trp Ala Ala Ala Tyr Lys Lys Ala Thr 20 25
30Ala Ala Leu Ala Lys Leu Ser Asn Thr Asp Lys Ala Ser Ile
Val Thr 35 40 45Gly Val Gly Trp
Glu Lys Gly Pro Cys Val Gly Asn Thr Ala Ala Val 50 55
60Ala Ser Ile Gly Leu Pro Glu Leu Cys Tyr Gln Asp Gly
Pro Leu Gly65 70 75
80Ile Arg Phe Val Gln Asn Val Thr Ala Phe Pro Thr Gly Ile Gln Thr
85 90 95Ala Ser Thr Trp Asp Ile
Ser Leu Ile Tyr Ser Arg Gly Leu Ala Leu 100
105 110Gly Gln Glu Ala Lys Ala Leu Gly Ile Asn Val Gln
Leu Gly Pro Val 115 120 125Ala Gly
Pro Ile Gly Lys Ile Pro Glu Ala Gly Arg Asn Trp Glu Gly 130
135 140Phe Ser Pro Asp Pro Tyr Leu Asn Gly Leu Ala
Met Ser Asn Thr Ile145 150 155
160Thr Gly Met Gln Asp Ala Gly Val Gln Ala Cys Ala Lys His Phe Ile
165 170 175Gly Asn Glu Gln
Glu Thr Asn Arg Asp Thr Met Ser Ser Asn Ile Asp 180
185 190Asp Arg Thr Phe His Glu Leu Tyr Leu Trp Pro
Phe Ala Asp Ala Ile 195 200 205Lys
Ala Asn Val Ala Ser Ile Met Cys Ser Tyr Asn Lys Phe Asn Glu 210
215 220Thr Tyr Ala Cys Glu Asn Asn Phe Leu Thr
Thr Ile Leu Lys Gly Glu225 230 235
240Leu Asp Phe Gln Gly Phe Val Val Ser Asp Trp Ala Ala Gln His
Thr 245 250 255Thr Ile Gly
Ser Ala Asn Ala Gly Leu Asp Val Ala Met Pro Gly Asp 260
265 270Asn Phe Gly Asp Asn Tyr Tyr Leu Trp Gly
Ser Asn Leu Leu Ala Ala 275 280
285Ile Ser Asn Gly Thr Val Ala Gln Ser Arg Leu Asp Asp Met Val Thr 290
295 300Arg Ile Leu Ala Ser Trp Tyr Phe
Val Gly Gln Asp Gln Gly Tyr Pro305 310
315 320Ala Val Thr Trp Ser Ser Trp Asn Gly Gly Leu Gly
Gly Pro Asn Val 325 330
335Gln Ala Asp His Lys Gln Val Ala Arg Ala Ile Ala Arg Asp Gly Ile
340 345 350Val Leu Leu Thr Asn Lys
Asn Lys Ala Leu Pro Leu Lys Lys Pro Ala 355 360
365Ser Leu Ala Ile Ile Gly Gln Asp Ala Ile Asp Asn Pro Ala
Gly Ile 370 375 380Asn Ser Cys Ser Asp
Arg Gly Cys Asp Thr Gly His Leu Ala Met Gly385 390
395 400Trp Gly Ser Gly Thr Ala Asp Phe Pro Tyr
Leu Val Ala Pro Leu Asp 405 410
415Ala Ile Thr Pro Leu Ala Gln Ala Gln Gly Thr Lys Leu Val Leu Ser
420 425 430Thr Thr Asp Ser Thr
Ser Ala Ala Ala Ser Ala Ala Ala Ala Ala Glu 435
440 445Thr Ala Ile Val Phe Ile Thr Ala Asp Ser Gly Glu
Gly Tyr Ile Thr 450 455 460Val Asp Gly
Gln Leu Gly Asp Arg Asn Ser Leu Ala Pro Trp Asn Asn465
470 475 480Gly Thr Ala Leu Val Gln Ala
Val Ala Ser Ala Ser Lys Asn Val Ile 485
490 495Val Val Ile Asn Ser Val Gly Pro Leu Ile Leu Glu
Asp Ile Leu Ala 500 505 510Leu
Ser Ser Val Lys Ala Ile Val Trp Ala Gly Val Ser Gly Gln Glu 515
520 525Ser Gly Asn Gly Leu Ala Asp Ile Leu
Tyr Gly Ser Val Ser Pro Ser 530 535
540Gly Lys Leu Pro Tyr Thr Ile Ala Lys Gln Ala Ser Asp Tyr Gly Thr545
550 555 560Ala Ile Val Pro
Gly Asp Asp Asn Phe Pro Glu Gly Leu Phe Val Asp 565
570 575Tyr Arg His Phe Asp Gln Ala Asn Ile Gln
Pro Arg Phe Glu Phe Gly 580 585
590Tyr Gly Leu Ser Tyr Thr Thr Phe Gln Tyr Ser Gln Leu Thr Ala Lys
595 600 605Tyr Ser Asp Thr Ser Ala Gly
Ser Ser Thr Leu Ala Pro Gly Gly Pro 610 615
620Lys Gly Leu Tyr Asp Ile Val Ala Thr Val Thr Ala Lys Val Thr
Asn625 630 635 640Ser Gly
Thr Val Ser Gly Ala Glu Val Ala Gln Leu Tyr Ile Gly Leu
645 650 655Pro Gly Ser Ala Pro Ala Ser
Pro Pro Lys Gln Leu Arg Gly Phe Asp 660 665
670Lys Ile Ser Leu Lys Pro Gly Lys Ser Gly Thr Val Thr Phe
Asn Leu 675 680 685Arg Arg Lys Asp
Leu Ser Tyr Trp Asp Thr Ala Ser Ala Gln Trp Val 690
695 700Thr Pro Thr Ser Gly Glu Phe Ser Leu Tyr Val Gly
Ala Ser Ser Arg705 710 715
720Asp Ile Arg Leu Gln Gly Ser Leu Lys Cys Ser Gly Gln Gly Ile Arg
725 730 735Lys Gly Gly His
74052220DNAChaetomella raphigera 5atgtattccc gagccttcac tttcgcagcc
ctagctgtac caatcctagc tgctcctggt 60gatggtgatt gggcagcggc ttataaaaaa
gctaccgctg ctctcgcaaa attgagcaac 120accgacaaag ccagtattgt tactggcgtg
ggttgggaaa agggtccttg cgttgggaat 180actgctgcag tagcatcaat tggacttcca
gagctttgct atcaagatgg acccctaggc 240atccgtttcg tccaaaatgt aacagcattt
cctacaggta ttcagactgc ttccacgtgg 300gacattagtt tgatctacag tcgaggttta
gctttaggtc aggaggcaaa ggcacttggc 360atcaatgtcc agctcggtcc agttgctggt
cccattggta aaattccaga ggccggtcgc 420aactgggaag gcttttctcc agatccatac
ctgaatggtc tagcaatgtc caataccatt 480accggaatgc aagatgctgg cgtacaggct
tgtgcaaagc actttattgg aaacgagcaa 540gaaacgaatc gcgatacaat gagctccaac
atcgatgatc ggactttcca tgagttgtac 600ctctggcctt tcgcagatgc catcaaggcc
aacgttgcat ccataatgtg ctcatacaac 660aagttcaatg agacatacgc ttgtgagaac
aacttcttga caactattct caagggcgag 720cttgactttc aaggcttcgt cgtcagcgac
tgggctgctc agcatactac tattggaagt 780gcaaatgctg gtctggatgt tgcgatgcct
ggtgacaatt ttggcgacaa ctattatcta 840tggggcagca atcttctggc ggcgatctcc
aacggcactg ttgcccaatc tcgccttgat 900gacatggtca ctcgcatcct ggcttcatgg
tactttgttg gtcaggacca aggctatcca 960gctgtgacct ggtcctcttg gaatggtgga
ttgggaggtc ccaatgtgca ggctgatcat 1020aagcaggtcg cacgcgccat cgcccgagat
ggcattgtcc tactcacgaa caaaaacaaa 1080gccctgcccc tgaagaagcc cgctagttta
gcaatcattg gtcaagatgc catcgacaat 1140cctgcaggca tcaactcgtg ctccgaccgc
ggttgtgata cgggtcactt ggctatgggt 1200tggggctcgg gcacagcaga ctttccatac
ctcgttgcgc cgctcgatgc catcacaccc 1260ttagctcagg cccagggaac aaagcttgta
ttgtcgacga ccgacagtac ttctgccgct 1320gctagtgctg ccgccgccgc agagacagca
atcgtcttca tcactgccga ttcaggagag 1380ggatacatca ctgttgacgg ccaattgggt
gatcgcaact cacttgctcc atggaataat 1440ggcactgctc tcgttcaagc ggtagccagc
gccagcaaaa atgtcattgt cgtgatcaac 1500agtgtcggcc cattgattct cgaggacatt
ctcgctcttt ccagcgtgaa agcaattgtc 1560tgggctggcg tctcgggcca agaatcgggc
aatggacttg ctgatattct ttacggttca 1620gtatctccca gtgggaaact cccatacaca
atcgccaaac aggccagcga ctatggaaca 1680gccattgtgc ccggtgacga taactttccc
gaaggattgt ttgtagacta tcgtcatttc 1740gaccaagcaa acatccagcc gcgttttgaa
tttggctatg gcctctccta tacgaccttt 1800caatactcgc agcttaccgc aaagtactcc
gatacttccg caggcagctc cactctcgcc 1860cctggcggac ccaaggggct gtatgatatt
gttgcaacgg taacagcaaa ggtgacaaac 1920agcggcaccg tcagtggcgc tgaagtcgca
cagctataca ttggtttgcc gggctcggcg 1980cctgcatctc cacccaagca gctacgcggc
tttgacaaaa tcagcctcaa gccaggcaaa 2040tcaggcacgg tgacgtttaa cctgcgccga
aaggacctca gctactggga tactgcctcg 2100gctcagtggg ttacaccgac cagcggcgag
ttctccctgt atgttggtgc tagctcgaga 2160gatataaggt tacaggggtc tttgaaatgc
tcgggccaag gtattcggaa aggtggacat 222062905DNAChaetomella raphigera
6atgtattccc gagccttcac tttcgcagcc ctagctgtac caatcctagc tgctcctggt
60gatggtgatt gggcagcggc ttataaaaaa gctaccgctg ctctcgcaaa attgagcaac
120accgacaaag ccagtattgt tactggcgtg ggttgggaaa agggtccttg cgttgggaat
180actgctgcag tagcatcaat tggacttcca gagctttgct atcaagatgg acccctaggg
240taggctgctg gttatttgta gtctcagaac tgacggtgaa agcatccgtt tcgtccaaaa
300tgtaacagca tttcctacag gtattcagac tgcttccacg tgggacatta gtttgatcta
360cagtcgaggt ttagctttag gtaagtccaa gtacgctgtc aagatttgaa ttaagagtat
420taggtcagga ggcaaaggca cttggcatca atgtccagct cggtccagtt gctggtccca
480ttggtaaaat tccagaggtg tgtttagctg gaccctgtaa agtcttgcga ctgatatcgt
540acaggccggt cgcaactggg aaggcttttc tccagatcca tacctgaatg gtctagcaat
600gtccaatgta tgtgacgact tcttccccta ggaaaaggca ttgattggta tagaccatta
660ccggaatgca agatgctggc gtacaggctt gtgcaaagca ctttagtaag ttggttgacg
720gcagggtcat gcaagatctg agactcttca gttggaaacg agcaagaaac gaatcgcgat
780acaatgagct ccaacatcga tgatcggact ttccatgagg taggtgatac tcgcaaacaa
840agatactttg gctgatacag atagttgtac ctctggcctt tcgcagatgc catcaaggcc
900aacgttgcat ccataatgtg ctcatacaac aagttcaatg agacatacgc ttgtgagaac
960aacttcttga caactgtaag ttctaaggct cgtgcatcaa ttcttcggaa tcgcattaac
1020gtcttcagat tctcaagggc gagcttgact ttcaaggctt cgtcgtcagc ggtacttggt
1080ttcctatcct gctgttgaaa taagcacaga gctaacgcat ggtgcgctaa gactgggctg
1140ctcagcatac tactattgga agtgcaaatg ctggtcttgt acgttgacaa acatgcgatt
1200cataatacca ttgtagatct cgatcccagc atgaactgac acgttttcaa ggatgttgcg
1260atgcctggtg acaattttgg cgacaactat tatctatggg gcagcaatct tctggcggcg
1320atctccaagt aagttcctgg ttcgcaaatg gccttgacgt tgctgatctc tgtgccttcc
1380atagcggcac tgttgcccaa tctcgccttg atgacatggt cactcgcatc ctggcttcat
1440ggtactttgt tggtcaggac caaggctatc cagctgtgac ctggtcctct tggaatggtg
1500gattgggagg tcccaatgtg caggctgatc ataagcaggt cgcacgcgcc atcgcccgag
1560atggcattgt cctactcacg aacaaaaaca aagccctgcc cctgaagaag cccgctagtt
1620tagcaatcat tggtcaagat gccatcgaca atcctgcagg catcaactcg tgctccgacc
1680gcggttgtga tacgggtcac ttggctatgg gttggggctc gggcacagca gactttccat
1740acctcgttgc gccgctcgat gccatcacac ccttagctca ggcccaggga acaaagcttg
1800tattgtcgac gaccgacagt acttctgccg ctgctagtgc tgccgccgcc gcagagacag
1860caatcgtctt catcactgcc gattcaggag agggatacat cactgtgagt ttctggcatc
1920gagtgctgag cctctacatt tcctaatcag tcacgcacag gttgacggcc aattgggtga
1980tcgcaactca cttgctccat ggaataatgg cactgctctc gttcaagcgg tagccagcgc
2040cagcaaaaat gtcattgtcg tgatcaacag tgtcggccca ttgattctcg aggacattct
2100cgctctttcc agcgtgaaag caattgtctg ggctggcgtc tcgggccaag aatcgggcaa
2160tggacttgct gatattcttt acggttcagt atctcccagt gggaaactcc catacacaat
2220cgccaaacag gccagcgact atggaacagc cattgtgccc ggtgacgata actttcccga
2280aggattgttt gtagactatc gtcatttcga ccaagcaaac atccagccgc gttttgaatt
2340tggctatggc ctctgtaagt cccacccgcg cgcgttcctt tcccccacga tgatgataag
2400gagatgtgat atgtgaagag agtcgacctt ctcagccgct gctatagtta gacgctaaca
2460aacgtcaaca gcctatacga cctttcaata ctcgcagctt accgcaaagt actccgatac
2520ttccgcaggc agctccactc tcgcccctgg cggacccaag gggctgtatg atattgttgc
2580aacggtaaca gcaaaggtga caaacagcgg caccgtcagt ggcgctgaag tcgcacagct
2640atacattggt ttgccgggct cggcgcctgc atctccaccc aagcagctac gcggctttga
2700caaaatcagc ctcaagccag gcaaatcagg cacggtgacg tttaacctgc gccgaaagga
2760cctcagctac tgggatactg cctcggctca gtgggttaca ccgaccagcg gcgagttctc
2820cctgtatgtt ggtgctagct cgagagatat aaggttacag gggtctttga aatgctcggg
2880ccaaggtatt cggaaaggtg gacat
2905718PRTChaetomella raphigera 7Met Tyr Ser Arg Ala Phe Thr Phe Ala Ala
Leu Ala Val Pro Ile Leu1 5 10
15Ala Ala854DNAChaetomella raphigera 8atgtattccc gagccttcac
tttcgcagcc ctagctgtac caatcctagc tgct 5497PRTArtificial
sequenceSynthetic peptide 9Pro Gly Asp Gly Asp Trp Ala1
51020DNAArtificial sequenceSynthetic primer 10ccnggngayg gngaytgggc
201139DNAArtificial
sequenceSynthetic primer 11ggttcttgcc acagtcacga cttttttttt ttttttttt
391221DNAArtificial sequenceSynthetic primer
12ggttcttgcc acagtcacga c
211323DNAArtificial sequenceSynthetic primer 13cctggtgatg gtgattgggc agc
231426DNAArtificial
sequenceSynthetic primer 14atgtccacct ttccgaatac cttggc
261525DNAArtificial sequenceSynthetic primer
15cgtttcgtcc aaaatgtaac agcat
251623DNAArtificial sequenceSynthetic primer 16gatgctttca ccgtcagttc tga
231733DNAArtificial
sequenceSynthetic primer 17cgcttgaatt cgatgcctgg tgatggtgat tgg
331834DNAArtificial sequenceSynthetic primer
18ttcaagcggc cgcatgtcca cctttccgaa tacc
34
User Contributions:
Comment about this patent or add new information about this topic: