Inventors list

Assignees list

Classification tree browser

Top 100 Inventors

Top 100 Assignees

Patent application title: Environmental stress-responsive promoter and a gene encoding environmental stress-responsive transcriptional factor

Inventors:  Kazuo Shinozaki (Ibaraki, JP)  Motoaki Seki (Kanagawa, JP)  Miki Fujita (Ibaraki, JP)
IPC8 Class: AC12N1529FI
USPC Class: 800289
Class name: The polynucleotide confers resistance to heat or cold (e.g., chilling, etc.)
Publication date: 09/04/2008
Patent application number: 20080216196






Sign up to receive free email alerts when patent applications with chosen keywords are published SIGN UP

Abstract:

The present invention provides a method of regulating expression of a gene by (a) preparing a recombinant plant cell line, plant tissue or plant with an expression vector having an abiotic environmental stress-responsive promoter of SEQ ID NO: 27 and the gene downstream thereof; and (b) culturing and cultivating the recombinant plant cell, plant tissue or plant under an abiotic environmental stress, wherein the promoter regulates the expression of the gene under the abiotic environmental stress.

Claims:

1.-4. (canceled)

5. A method of regulating expression of a gene which comprises:(a) preparing a recombinant plant cell line, plant tissue or plant comprising an expression vector having an abiotic environmental stress-responsive promoter comprising SEQ ID NO: 63 which is operably linked to the coding sequence the gene; and(b) culturing and cultivating the recombinant plant cell, plant tissue or plant under an abiotic environmental stress, wherein the promoter regulates the expression of the gene under the abiotic environmental stress.

6. The method according to claim 5, wherein the abiotic environmental stress is cold stress, drought stress or salt stress.

7. The method according to claim 5, wherein the gene encodes a polypeptide that can confer increased environmental stress resistance compared to a plant cell line, plant tissue or plant lacking the expression vector.

8. The method according to claim 5, wherein the gene is a plant gene.

Description:

[0001]This application is a Divisional of co-pending application Ser. No. 10/470,154 filed on Sep. 5, 2006, that is a divisional of co-pending application Ser. No. 10/495,918 filed on May 18, 2004, and for which priority is claimed under 35 U.S.C. 120; and this application claims priority of International Application No. PCT/JP02/11955 filed on Nov. 5, 2002 under 35 U.S.C. 119; the entire contents of all are hereby incorporated by reference.

TECHNICAL FIELD

[0002]The present invention relates to an environmental stress-responsive promoter and a gene encoding environmental stress-responsive transcriptional factor.

BACKGROUND ART

[0003]Large quantities of genomic and cDNA sequences have been determined with respect to a number of organisms by gene sequencing projects. In a plant model, Arabidopsis thaliana, the complete genomic sequences of two chromosomes have been determined (Lin, X. et al., (1999), Nature 402, 761-768; and Mayer, K. et al., (1999), Nature 402, 769-777).

[0004]The expressed sequence tag (EST) project also has greatly contributed to the discovery of expression genes (Hofte, H. et al., (1993), Plant J. 4, 1051-1061; Newman, T. et al., (1994), Plant Physiol. 106, 1241-1255; and Cooke, R. et al., (1996), Plant J. 9, 101-124; and Asamizu, E. et al., (2000), DNA Res. 7, 175-180). For example, the database of EST (dbEST) of the National Center for Biotechnology Information (NCBI) includes partial cDNA sequences, in which more than half (about 28,000 genes) of the total genes are reproduced, (as estimated from the gene content of Arabidopsis thaliana chromosome 2 completely sequenced [Lin, X. et al., (1999), Nature 402, 761-768]).

[0005]Recently, microarray (DNA chip) technology has become a useful tool for analyzing genome-scale gene expression (Schena, M. et al., (1995), Science 270, 467-470; Eisen, M. B. and Brown, P. O. (1999), Methods Enzymol. 303, 179-205). In the technology using a DNA chip, cDNA sequences are arrayed on a slide glass in a density of not smaller than 1,000 genes/cm2. The cDNA sequences thus arrayed are hybridized simultaneously with a pair of cDNA probes tagged with two color fluorescent labels, which have been prepared from RNA samples of different types of cells or tissues. In this manner, a large amount of genes can be directly analyzed and compared for gene expression. This technology was demonstrated for the first time by analyzing 48 Arabidopsis genes for differential expression in root and shoots (Schena, M. et al., (1995), Science 270, 467-470). Furthermore, a microarray was used in investigating 1,000 clones randomly taken from a human cDNA library in order to identify a novel gene responsive to heat shock and protein kinase C activation (Schena, M. et al., (1996), Proc. Natl. Acad. Sci. USA, 93, 10614-10619).

[0006]In another method, a DNA chip is used in analyzing the expression profile of an inflammatory-disease associated gene under various induction conditions (Heller, R. A. et al., (1997), Proc. Natl. Acad. Sci. USA, 94, 2150-2155). Furthermore, using a microarray, a yeast genome having more than 6,000 coding sequences has been analyzed for dynamic expression (DeRisi, J. L. et al., (1997) Science 278, 680-686; and Wodicka, L. et al., (1997), Nature Biotechnol. 15, 1359-1367).

[0007]However, in the field of plant science, only a few reports have been made on microarray analysis (Schena, M. et al., (1995), Science 270, 467-470; Ruan, Y. et al., (1998), Plant J. 15, 821-833; Aharoni. A. et al., (2000), Plant Cell 12, 647-661; and Reymond, P. et al., (2000), Plant Cell 12, 707-719).

[0008]The growth of plants is significantly affected by environmental stresses such as drought, high salinity and low temperature. Of the stresses, drought or water deficiency is the most critical factor that limits growth of plants and production of crops. Such a drought stress causes various biochemical and physiological responses in plants.

[0009]To survive under these conditions of stress, plants acquire responsivity and adaptability to the stresses. Recently, several types of genes responsive to drought at a transcriptional level have been reported (Bohnert, H. J. et al., (1995), Plant Cell 7, 1099-1111; Ingram, J., and Bartels, D. (1996), Plant Mol. Biol. 47, 377-403; Bray, E. A. (1997), Trends Plant Sci. 2, 48-54; Shinozaki, K., and Yamaguchi-Shinozaki, K. (1997), Plant Physiol. 115, 327-334; Shinozaki, K., and Yamaguchi-Shinozaki, K. (1999), "Molecular responses to drought stress. Molecular responses to cold, drought, heat and salt stress in higher plants", edited by Shinozaki, K. and Yamaguchi-Shinozaki, K. R. G. Landes Company; and Shinozaki, K., and Yamaguchi-Shinozaki, K. (2000), Curr. Opin. Plant Biol. 3, 217-223).

[0010]On the other hand, in an attempt to improve stress resistance of plants by introducing a gene, stress-inducible genes have been used (Holmberg, N., and Bulow, L. (1998), Trends Plant Sci. 3, 61-66; and Bajaj, S. et al., (1999), Mol. Breed. 5, 493-503). Not only to further clarify the mechanism of stress resistance and stress responsivity of a higher plant at a molecular level but also to improve the stress resistance of a crop by gene manipulation, it is important to analyze the function of a stress-inducible gene.

[0011]Dehydration responsive element and C-repeat sequence (DRE/CRT) has been identified as an important cis-acting element when drought, high salt and cold stress-responsive genes are expressed in an ABA independent manner, where ABA refers to abscisic acid, a kind of plant hormone and serves as a signal transmission factor of seed dormancy and environmental stress (Yamaguchi-Shinozaki, K., and Shinozaki, K. (1994), Plant Cell 6, 251-264; Thomashow, M. F. et al., (1999), Plant Mol. Biol. 50, 571-599; and Shinozaki, K., and Yamaguchi-Shinozaki, K. (2000), Curr. Opin. Plant Biol. 3, 217-223). Furthermore, a transcriptional factor (DREB/CBF) involved in DRE/CRT responsive gene expression has been cloned (Stockinger. E. J. et al., (1997), Proc. Natl. Acad. Sci. USA 94, 1035-1040; Liu, Q. et al., (1998), Plant Cell 10, 1391-1406; Shinwari, Z. K. et al., (1998), Biochem. Biophys. Res. Commun. 250, 161-170; and Gilmour, S. J. et al., (1998), Plant J. 16, 433-443). DREB1/CBF is considered to function in cold-responsive gene expression, whereas DREB2 is involved in drought-responsive gene expression. Strong resistance to freezing stress was observed in a transgenic Arabidopis plant that overexpresses CBF1 (DREB1B) cDNA under the control of a cauliflower mosaic virus (CaMV) 35S promoter (Jaglo-Ottosen, K. R. et. al., (1998), Science 280, 104-106).

[0012]The present inventors have reported that when DREB1A (CBF3) cDNA molecules are overexpressed in transgenic plants under the control of a CaMV 35S promoter or a stress-inducible rd29A promoter, strong constitutive expression of stress-inducible DREB1A target genes are induced to improve resistance to freezing, drought and salt stresses (Liu, Q. et al., (1998), Plant Cell 10, 1391-1406; and Kasuga, M. et al., (1999), Nature Biotechnol. 17, 287-291). Furthermore, the present inventors have already identified six DREB1A target genes such as rd29A/lti78/cor78, kin1, kin2/cor6.6, cor15a, rd17/cor47, and erd10 (Kasuga, M. et al., (1999), Nature Biotechnol. 17, 287-291). However, it has not yet been sufficiently elucidated how the overexpressed DREB1A cDNA improves stress resistance to freezing, drought and salt in a transgenic plant. To investigate the molecular mechanisms of drought and freezing resistance, it is important to identify and analyze as many genes controlled by DREB1A as possible.

DISCLOSURE OF THE INVENTION

[0013]The present invention is directed to providing an environmental stress-responsive promoter and a gene encoding an environmental stress-responsive transcriptional factor.

[0014]The present inventors have intensively studied to solve the aforementioned problems. As a result, they succeeded in identifying novel genes responsive to cold, drought and salt stresses and isolating promoter regions thereof by using cDNA microarray analysis, thereby accomplishing the present invention.

[0015]More specifically, the present invention is directed to an environmental stress-responsive promoter comprising DNA of the following (a), (b) or (c):

(a) DNA consisting of any nucleotide sequence selected from SEQ ID NOS: 1 to 90;(b) DNA consisting of a nucleotide sequence comprising a deletion, substitution or addition of one or more nucleotides relative to any nucleotide sequence selected from SEQ ID NOS: 1 to 90, and functioning as an environmental stress responsive promoter; and(c) DNA hybridizing under stringent conditions to DNA consisting of any nucleotide sequence selected from SEQ ID NOS: 1 to 90, and functioning as an environmental stress responsive promoter.

[0016]Examples of environmental stress include at least one selected from the group consisting of cold stress, drought stress, and salt stress.

[0017]The present invention is also directed to an expression vector comprising the promoter mentioned above, or an expression vector having an arbitrary gene integrated therein.

[0018]Furthermore, the present invention is directed to a transformant comprising the expression vector.

[0019]Moreover, the present invention is directed to a transgenic plant, such as a plant body, plant organ, plant tissue or plant culture cell, comprising the expression vector.

[0020]The present invention is still further directed to a method for producing a stress-resistant plant, comprising culturing or cultivating the transgenic plant.

[0021]On the other hand, the present inventors identified novel genes encoding cold, drought and salt stress-responsive transcriptional factors by use of cDNA microarray analysis, thereby accomplishing the present invention.

[0022]More specifically, the present invention is directed to a gene encoding an environmental stress-responsive transcriptional factor comprising an amino acid of the following (a) or (b):

(a) any amino acid sequence selected from SEQ ID NOS: 2n (n is an integer from 47 to 82);(b) an amino acid sequence comprising a deletion, substitution or addition of one or more amino acids relative to any amino acid sequence selected from SEQ ID NOS: 2n (n is an integer from 47 to 82), functioning as an environmental stress-responsive transcriptional factor.

[0023]Also, the present invention is directed to a gene according to claim 1, comprising DNA of the following (a), (b) or (c):

(a) DNA consisting of any nucleotide sequence selected from SEQ ID NOS: 2n-1 (n is an integer from 47 to 82);(b) DNA consisting of a nucleotide sequence comprising a deletion, substitution or addition of one or more nucleotides relative to any nucleotide sequence selected from SEQ ID NOS: 2n-1 (n is an integer from 47 to 82), and encoding an environmental stress-responsive transcriptional factor; and(c) DNA hybridizing under stringent conditions to DNA consisting of any nucleotide sequence selected from SEQ ID NOS: 2n-1 (n is an integer from 47 to 82), and encoding an environmental stress-responsive transcriptional factor.

[0024]In the present invention, examples of environmental stress include at least one selected from the group consisting of cold stress, drought stress, and salt stress.

[0025]The present invention is also directed to an expression vector containing the gene, a transformant containing the expression vector, and a transgenic plant containing the expression vector.

[0026]Furthermore, the present invention is directed to a transgenic plant, such as a plant body, plant organ, plant tissue or plant culture cell.

[0027]Moreover, the present invention is directed to a method for producing a stress-resistant plant, comprising culturing or cultivating the transgenic plant.

[0028]Hereinafter, the present invention will be described in detail.

[0029]The present inventors constructed full-length cDNA libraries from Arabidopsis plants placed under different conditions, such as dehydration-treated plants and cold-treated plants (Seki. M. et al., (1998), Plant J. 15, 707-720), by the biotinylated CAP trapper method (Carninci. P. et al., (1996), Genomics, 37, 327-336); Then, Arabidopsis full-length cDNA microarrays were respectively prepared using about 1,300 full-length cDNA molecules and about 7,000 full-length cDNA molecules both containing stress-inducible genes. Besides using these dehydration and cold-inducible full-length cDNA molecules, another cDNA microarray was prepared using a DREB1A target gene, a transcriptional regulator for controlling expression of a stress-responsive gene. Thereafter, expression patterns of genes under drought and cold stress were monitored to exhaustively analyze stress-responsive genes. As a result, from the full-length cDNA microarray containing about 1,300 of full-length cDNA molecules, novel environmental stress-responsive genes, that is, 44 drought-inducible genes and 19 cold-inducible genes were isolated. 30 out of the 44 drought-inducible genes, and 10 out of the 19 cold-inducible genes were novel stress-inducible genes. Moreover, it was found that 12 stress-inducible genes were DREB1A target genes and 6 out of the 12 stress-inducible genes were novel genes. As a result of the analysis, 301 drought-inducible genes, 54 cold-inducible genes and 211 high salt-stress inducible genes were isolated from a cDNA microarray containing about 7,000 full-length cDNA molecules.

[0030]Thereafter, not only promoter regions but also environmental genes encoding environmental stress-responsive transcriptional factors were successfully isolated from these environmental stress-responsive genes.

[0031]As described above, a full-length cDNA microarray is useful tool for analyzing the expression manner of Arabidopsis thaliana drought- and cold-stress inducible genes and analyzing the target gene of a stress associated transcriptional regulator.

1. Isolation of Promoter

[0032]The promoter of the present invention contains a cis-element which is present upstream of a gene encoding a stress-responsive protein expressed by an environmental stress such as a cold, drought, or high salt stress and which activates the transcription of a gene present downstream thereof by binding of a transcriptional factor. Examples of such a cis-element include a dehydration responsive element (DRE), an abscisic acid responsive element (ABRE), and a cold-stress responsive element. Examples of genes encoding proteins binding to these elements include a DRE binding protein 1A gene (referred to also as a "DREB1A gene"), DRE binding protein 1C gene (referred to also as a "DREB1C gene"), DRE binding protein 2A gene (referred to also as a "DREB2A gene"), and DRE binding protein 2B gene (referred to also as a "DREB2B gene").

[0033]In isolating a promoter of the present invention, first, stress-responsive genes are isolated by using a microarray. In constructing a microarray, use may be made of about 1,300 cDNA molecules in total including genes isolated from Arabidopsis full-length cDNA libraries, responsive to dehydration (RD) genes, early responsive to dehydration (ERD) genes, kin1 genes, kin2 genes, and cor15a genes; and furthermore, α-tubulin genes as an internal standard; and moreover, mouse nicotinic acetylcholine receptor epsilon subunit (nAChRE) genes and mouse glucocorticoid receptor homologous genes, as negative controls.

[0034]As a microarray used in isolating the promoter of the present invention, use may be made of about 7,000 cDNA molecules in total including genes isolated from an Arabidopsis full-length cDNA library, responsive to dehydration (RD) genes, early responsive to dehydration (ERD) genes, and PCR amplification fragments as an internal standard obtained from λ control template DNA fragments (TX803, manufactured by Takara Shuzo); and mouse nicotinic acetylcholine receptor epsilon subunit (nAChRE) genes and mouse glucocorticoid receptor homologous genes, as negative controls.

[0035]A plasmid DNA extracted with a plasmid preparation device (manufactured by Kurabo) is sequenced by sequence analysis using a DNA sequencer (ABI PRISM 3700, PE Applied Biosystems, CA, USA). Based on the GenBank/EMBL database, the obtained sequence is screened for homology by using the BLAST program.

[0036]After poly A selection is performed, reverse transcription is carried out to synthesize double-stranded DNA molecules and a cDNA molecule is inserted into a vector.

[0037]The cDNA molecule inserted into a vector for constructing cDNA libraries is amplified by PCR using complementary primers to the sequences of vectors on both sides of the cDNA molecule. Examples of such vectors include λZAPII and λPS.

[0038]A microarray can be prepared according to a conventional method, which is not particularly limited. For example, using a gene tip microarray stamp machine GTMASS SYSTEM (manufactured by Nippon Laser & Electronics Lab.), the above obtained PCR product is loaded from a microtiter plate and spotted on a microslide glass at predetermined intervals. Then, to prevent a non-specific signal form being expressed, the slide is immersed into a blocking solution.

[0039]Examples of plant materials include a plant strain obtained by destroying specific genes as well as wild type plants. A transgenic plant having cDNA of DREB1A introduced therein may be used. Examples of plant species include Arabidopsis thaliana, tobacco and rice. Of them, Arabidopsis thaliana is preferable.

[0040]Dehydration- and cold-stress treatments can be carried out according to a known method (Yamaguchi-Shinozaki, K., and Shinozaki, K. (1994), Plant Cell 6, 251-264).

[0041]After plant bodies (wild type plants and DREB1A overexpression transformants) are exposed to stress, they are sampled and stored in cryogenic conditions with liquid nitrogen. The wild type and DREB1A overexpression transformants are used in an experiment to identify a DREB1A target gene. From plant bodies, mRNA is isolated and purified by a known method or a kit.

[0042]In the presence of Cy3 dUTP or Cy5 dUTP for labeling (Amersham Pharmacia), each of mRNA samples is subjected to reverse transcription and then used in hybridization.

[0043]After the hybridization, the microarray is scanned with a scanning laser microscope or the like. As a program for analyzing data of a microarray, Imagene Ver 2.0 (BioDiscovery) and QuantArray (GSI Lumonics) etc. may be used.

[0044]After the scanning, a plasmid having a target gene is prepared. In this way, the target genes are isolated.

[0045]A promoter region is determined by analyzing the nucleotide sequence of the gene isolated above and using a gene analysis program based on the genomic information of database (GenBank/EMBL, ABRC). The isolated genes can be classified into ones inducible by both dehydration and cold stress, ones inducible specifically by drought stress, and one inducible specifically by cold stress. According to the gene analysis program, from the genes mentioned above, 90 types of genes below can be identified.

[0046](FL03-07-F12, FL04-12-F24, FL04-14-N10, FL04-14-P24, FL04-17-103, FL04-17-M08, FL04-17-M22, FL05-05-A17, FL05-05-F20, FL05-05-G20, FL05-09-N09, FL05-10-J09, FL05-10-M08, FL05-11-H09, FL05-12-H13, FL05-13-I20, FL05-14-E15, FL05-14-E16, FL05-16-F03, FL05-16-H23, FL05-18-M07, FL05-18-O21, FL05-19-F21, FL05-19-O22, FL05-21-K17, FL06-10-F03, FL06-12-H12, FL07-12-123, FL08-08-H23, FL08-08-O14, FL08-09-M05, FL08-10-K08, FL08-11-P07, FL08-13-F10, FL08-19-D04, FL08-19-G15, FL09-06-B11, FL09-07-G17, FL09-10-A12, FL09-13-P15, FL02-05-I05, FL04-12-N15, FL04-16-P21, FL04-17-N22, FL04-20-P19, FL02-09-H01, FL05-01-D08, FL05-02-G08, FL05-02-O17, FL05-07-L13, FL05-08-B14, FL05-09-N10, FL05-11-L01, FL05-12-J09, FL05-14-D24, FL05-14-F20, FL05-14-108, FL05-15-C04, FL05-15-E19, FL05-18-A06, FL05-18-H15, FL05-19-C02, FL05-20-M16, FL05-20-N18, FL05-21-E06, FL05-21-L12, FL06-07-B08, FL06-08-H20, FL06-09-N04, FL06-11-K21, FL07-07-G15, FL07-12-D17, FL08-11-C23, FL08-13-G20, FL08-15-M21, FL08-18-N19, FL08-19-C07, FL08-19-P05, FL09-07-G09, FL09-07-G15, FL09-10-J18, FL09-11-I12, FL09-12-B03, FL09-16-I11, FL09-16-M04, FL11-01-J18, FL11-07-D13, FL11-07-F02, FL11-07-N15 and FL11-10-D10). The promoter regions of these genes are represented by SEQ ID NOS: 1 to 90, respectively.

[0047]As long as a promoter of the present invention acts as an environmental stress-responsive promoter, use may be made of any promoter having a nucleotide sequence selected from SEQ ID NOS: 1 to 90 wherein one or more nucleotides, preferably one or several nucleotides (for example 1 to 10, preferably 1 to 5) may be deleted, substituted or added. Furthermore, DNA hybridizing with the DNA comprising any nucleotide sequence selected from SEQ ID NOS: 1 to 90 under stringent conditions and acting as an environmental stress-responsive promoter is also included in the promoter of the present invention.

[0048]Once the nucleotide sequence of a promoter according to the present invention is determined, the promoter can be obtained by chemical synthesis, PCR using a cloned probe as a template, or hybridization using a DNA fragment having the nucleotide sequence as a probe. Furthermore, a mutant of the promoter of the present invention, which has the same functions as those of a non-mutated promoter, can be also synthesized by a site-specific mutagenesis or the like.

[0049]To introduce a mutation into a promoter sequence, a known method such as the Kunkel method, Gapped duplex method or an equivalent method may be employed. A mutation may be introduced by using a mutation-introducing kit (for example, Mutant-K manufactured by Takara or Mutant-G manufactured by Takara) which uses a site-specific mutagenesis or by using the LA PCR in vitro mutagenesis series kit (manufactured by Takara).

[0050]The term "functioning as an environmental stress-responsive promoter" used herein refers to a function of activating transcription caused by binding RNA polymerase to the promoter when the promoter is exposed to a predetermined environmental stress condition.

[0051]The term "environmental stress" used herein generally refers to an abiotic stress such as drought stress, cold stress, high salt stress, or intensive light stress. The term "drought" used herein refers to a state of water deficiency, and the term "cold" used herein refers to a state where an object is exposed to a lower temperature than the optimum living temperature for each organism (e.g., in the case of Arabidopsis thaliana, it is exposed to a temperature of -20 to +21° C. continuously for one hour to several weeks). The term "high salt" used herein refers to a state where a plant is treated with NaCl of 50 mM to 600 mM in concentration continuously for 0.5 hours to several weeks. The term "intensive light stress" used herein refers to a state where too intensive light to use for photosynthesis is applied to a plant, and corresponds to a case where, for example, light of 5,000 to 10,000 Lx or more is applied. These environmental stresses may be applied singly or in combination.

[0052]The plant promoter of the present invention includes a promoter having a nucleotide sequence represented by SEQ ID NOS: 1 to 90 wherein a nucleotide sequence may be added to the 3' end in order to increase transcriptional efficiency or a nucleotide sequence may be deleted from the 5' end to the extent not to lose the activity of a promoter.

[0053]Furthermore, the promoter of the present invention includes DNA which hybridizes with DNA consisting of any nucleotide sequence selected from SEQ ID NOS: 1 to 90 under stringent conditions and acts as an environmental stress-responsive promoter. The term "stringent conditions" used herein refers to the conditions of sodium concentration of 25 to 500 mM, preferably 25 to 300 mM, and a temperature of 42 to 68° C., preferably 42 to 65° C.; more preferably, conditions of 5×SSC (83 mM NaCl, 83 mM sodium citrate) and a temperature of 42° C.

2. Construction of Expression Vector

[0054]An expression vector of the present invention can be obtained by ligating (inserting) a promoter according to the present invention to an appropriate vector. The vector into which a promoter of the present invention is to be inserted is not particularly limited as long as it can be replicated in a host. Examples of such a vector include a plasmid, shuttle vector and helper plasmid.

[0055]Examples of such a plasmid DNA include plasmids derived from Escherichia coli (e.g., pBR322, pBR325, pUC118, pUC119, pUC18, pUC19, and pBluescript); plasmids derived from Bacillus subtilis (e.g., pUB110 and pTP5); and plasmids derived from yeasts (e.g., YEp13 and YCp50). Examples of a phage DNA include λ phages (Charon4A, Charon21A EMBL3, EMBL4, λgt10, λgt11, and λZAP). Further animal virus vectors such as retrovirus and a vaccinia virus and insect virus vectors such as a baculovirus can be also used.

[0056]To insert a promoter according to the present invention into a vector, use may be made of a method of digesting a purified DNA with appropriate restriction enzymes, inserting the obtained DNA fragment into the restriction site of a suitable vector DNA or a multi-cloning site, and ligating it to the vector.

[0057]In the present invention, to express an arbitrary gene, the arbitrary gene can be further inserted into the aforementioned expression vector. The technique inserting an arbitrary gene is the same as the method inserting a promoter into a vector. An arbitrary gene is not particularly limited. Examples of the gene include genes shown in Table 1 and known genes other than those.

[0058]In a case where a reporter gene, for example, a GUS gene, widely used in plants is linked to the 3' end of a promoter of the present invention, the strength of the promoter can be easily evaluated by checking GUS activity. As such a reporter gene other than the GUS gene, luciferase and a green fluorescent protein can be used.

[0059]As described above, various types of vectors can be used in the present invention. Further, a desired gene is ligated to the promoter of the present invention in a sense or antisense direction and then, the construction can be inserted into a vector such as pBI101 (Clonetech) called a binary vector.

3. Isolation of Transcriptional Factor

[0060]A transcriptional factor binds to a cis element which is present upstream of a gene and activates the transcription of the gene present downstream thereof. The transcriptional factors isolated in the present invention are induced by environmental stresses such as a low temperature, dehydration, and high salt concentration.

[0061]Environmental stress-responsive transcriptional factors are roughly divided into those belonging to a DREB family, ERF family, zinc finger family, WRKY family, MYB family, bHLH family, NAC family, homeo domain family and bZIP family.

[0062]In isolating a transcriptional factor, first, stress responsive genes are isolated by using a microarray. As a microarray, use may be made of about 7,000 cDNA molecules in total including genes isolated from Arabidopsis full-length cDNA libraries, responsive to dehydration (RD) genes, early responsive to dehydration (ERD) genes; PCR amplification fragments obtained from a λ control template DNA fragment (TX803, manufactured by Takara Shuzo), as an internal standard; and mouse nicotinic acetylcholine receptor epsilon subunit (nAChRE) genes and mouse glucocorticoid receptor homologous genes, as negative controls.

[0063]A plasmid DNA extracted by a plasmid preparation device (manufactured by Kurabo) is sequenced by sequence analysis using a DNA sequencer (ABI PRISM 3700, PE Applied Biosystems, CA, USA). Based on the GenBank/EMBL database, the obtained sequence is screened for homology by using the BLAST program.

[0064]After poly A selection is performed, reverse transcription is carried out to synthesize a double-stranded DNA molecule and a cDNA molecule is inserted into a vector.

[0065]The cDNA molecule inserted into a vector for constructing cDNA libraries is amplified by PCR using complementary primers to the sequences of vectors on both sides of the cDNA molecule. Examples of such vectors include λZAPII and λPS.

[0066]A microarray can be prepared according to a conventional method, which is not particularly limited. For example, using a gene tip microarray stamp machine GTMASS SYSTEM (manufactured by Nippon Laser & Electronics Lab.), the above obtained PCR product is loaded from the microtiter plate and spotted on a microslide glass at predetermined intervals. Then, to prevent a non-specific signal from being expressed, the slide is immersed into a blocking solution.

[0067]Examples of plant materials include a plant strain obtained by destroying a specific gene as well as wild type plants. A transgenic plant having a cDNA of DREB1A introduced therein may be used. Examples of plant species include Arabidopsis thaliana, tobacco and rice. Of them, Arabidopsis thaliana is preferable.

[0068]Dehydration- and cold-stress treatments can be carried out according to a known method (Yamaguchi-Shinozaki, K., and Shinozaki, K. (1994), Plant Cell 6, 251-264).

[0069]After plant bodies (wild type plants and DREB1A overexpression transformants) are exposed to stress, they are sampled and stored in cryogenic conditions with liquid nitrogen. The wild type and DREB1A overexpression transformants are used in an experiment to identify a DREB1A target gene. From plant bodies, mRNA is isolated and purified by a known method or a kit.

[0070]In the presence of Cy3 dUTP or Cy5 dUTP for labeling (Amersham Pharmacia), each of mRNA samples is subjected to reverse transcription and then used in hybridization.

[0071]After hybridization, the microarray is scanned with a scanning laser microscope or the like. As a program for analyzing data of a microarray, Imagene Ver 2.0 (BioDiscovery) and QuantArray (GSI Lumonics) etc., may be used.

[0072]After the scanning, a plasmid having a target gene is prepared. In this way, the target genes are isolated.

[0073]A transcriptional factor is determined by analyzing the nucleotide sequence of the gene isolated above and using a gene analysis program based on the genomic information of database (GenBank/EMBL, ABRC). The isolated genes can be classified into ones inducible by both drought and cold stress, ones inducible specifically by drought stress, and one inducible specifically by cold stress. According to the gene analysis program, from the genes mentioned above, genes encoding 36 types of transcriptional factors can be identified. The nucleotide sequences of the genes encoding 36 types of transcriptional factors are represented by SEQ ID NOS: 2n-1 (n is an integer of 47 to 82) and amino acid sequences of the transcriptional factors are represented by SEQ ID NOS: 2n (n is an integer of 47 to 82). Sequence ID numbers and the names of genes encoding transcriptional factors are shown in Table 1.

TABLE-US-00001 TABLE 1 Name of gene SEQ ID NO: RAFL05-11-M11 SEQ ID NO: 93 RAFL06-11-K21 SEQ ID NO: 95 RAFL05-16-H23 SEQ ID NO: 97 RAFL08-16-D06 SEQ ID NO: 99 RAFL08-16-G17 SEQ ID NO: 101 RAFL06-08-H20 SEQ ID NO: 103 RAFL07-10-G04 SEQ ID NO: 105 RAFL04-17-D16 SEQ ID NO: 107 RAFL05-19-M20 SEQ ID NO: 109 RAFL08-11-M13 SEQ ID NO: 111 RAFL04-15-K19 SEQ ID NO: 113 RAFL05-11-L01 SEQ ID NO: 115 RAFL05-14-C11 SEQ ID NO: 117 RAFL05-19-G24 SEQ ID NO: 119 RAFL05-20-N02 SEQ ID NO: 121 RAFL05-18-H12 SEQ ID NO: 123 RAFL06-10-D22 SEQ ID NO: 127 RAFL06-12-M01 SEQ ID NO: 129 RAFL05-14-D24 SEQ ID NO: 131 RAFL05-20-N17 SEQ ID NO: 133 RAFL04-17-F21 SEQ ID NO: 135 RAFL09-12-N16 SEQ ID NO: 137 RAFL05-19-I05 SEQ ID NO: 139 RAFL05-21-I22 SEQ ID NO: 141 RAFL08-11-H20 SEQ ID NO: 143 RAFL05-21-C17 SEQ ID NO: 145 RAFL05-08-D06 SEQ ID NO: 147 RAFL05-20-M16 SEQ ID NO: 149 RAFL11-01-J18 SEQ ID NO: 151 RAFL11-09-C20 SEQ ID NO: 153 RAFL05-18-N16 SEQ ID NO: 155 RAFL11-10-D10 SEQ ID NO: 157 RAFL04-17-N22 SEQ ID NO: 159 RAFL05-09-G15 SEQ ID NO: 161 RAFL05-21-L12 SEQ ID NO: 163

[0074]Note that as long as a transcriptional factor of the present invention functions as an environmental stress-responsive transcriptional factor, use may be made of any transcriptional factor having a nucleotide sequence selected from SEQ ID NOS: 2n-1 (n is an integer of 47 to 82) wherein one or more nucleotides, preferably one or several nucleotides (for example 1 to 10, preferably 1 to 5) have been deleted, substituted or added. Furthermore, DNA hybridizing with the DNA comprising any nucleotide sequence selected from SEQ. ID NOS. 2n-1 (n is an integer of 47 to 82) under stringent conditions and encoding an environmental stress-responsive transcriptional factor is also included in the transcriptional factor of the present invention. The term "stringent conditions" used herein refers to the conditions of sodium concentration of 25 to 500 mM, preferably 25 to 300 mM, and a temperature of 42 to 68° C., preferably 42 to 65° C.; more preferably, conditions of 5×SSC (83 mM NaCl, 83 mM sodium citrate) and a temperature of 42° C.

[0075]36 types of transcriptional factors isolated in the present invention may be classified as follows.

(1) DREB family: RAFL05-11-M11, RAFL06-11-K21, RAFL05-16-H23, RAFL08-16-D16;(2) ERF family: RAFL08-16-G17, RAFL06-08-H20;(3) Zinc finger family: RAFL07-10-G04, RAFL04-17-D16, RAFL05-19-M20, RAFL08-11-M13, RAFL04-15-K19, RAFL05-11-L01, RAFL05-14-C11, RAFL05-19-G24, RAFL05-20-N02;(4) WRKY family: RAFL05-18-H112, RAFL05-19-E19, RAFL06-10-D22, RAFL06-12-M01;(5) MYB family: RAFL05-14-D24, RAFL05-20-N17, RAFL04-17-F21;(6) bHLH family: RAFL09-12-N16;(7) NAC family: RAFL05-19-I05, RAFL05-21-I22, RAFL08-11-H20, RAFL05-21-C17, RAFL05-08-D06;(8) Homeo domain family: RAFL05-20-M16, RAFL11-01-J18, RAFL11-09-C20; and(9) bZIP family: RAFL05-18-N16, RAFL11-10-D10, RAFL04-17-N22, RAFL05-09-G15.

[0076]Note that RAFL05-21-L12 cannot be classified into (1) to (9).

[0077]Once the nucleotide sequence of a gene encoding a transcriptional factor according to the present invention is determined, the gene encoding a transcriptional factor according to the present invention can be obtained by chemical synthesis, PCR using a cloned probe as a template, or hybridizing a DNA fragment having the nucleotide sequence as a probe. Furthermore, a mutant of the gene encoding a transcriptional factor according to the present invention, and having the same functions as those of a non-mutated transcriptional factor, can be also synthesized by a site-specific mutagenesis or the like.

[0078]To introduce a mutation into a nucleotide sequence of a gene encoding a transcriptional factor, a known method such as the Kunkel method, Gapped duplex method, or an equivalent method may be employed. A mutation may be introduced by using a mutation-introducing kit (for example, Mutant-K manufactured by Takara and Mutant-G manufactured by Takara) which uses a site-specific mutagenesis or by using the LA PCR in vitro mutagenesis series kit (manufactured by Takara).

[0079]The term "environmental stress" used herein generally refers to an abiotic stress such as drought stress, cold stress, high salt stress, or intensive light stress. The term "drought" used herein refers to a state of water deficiency, the term "cold" used herein refers to a state where an object is exposed to a lower temperature than the optimum living temperature of each organism (e.g., in the case of Arabidopsis thaliana, e.g., in the case of Arabidopsis thaliana, it is exposed to a temperature of -20 to +21° C. continuously for one hour to several weeks). The term "high salt" used herein refers to a state where a plant is treated with NaCl of 50 mM to 600 mM in concentration continuously for 0.5 hours to several weeks. The term "intensive light stress" used herein refers to a state where too intensive light to use for photosynthesis is applied to a plant, and corresponds to a case where, for example, light of 5,000 to 10,000 Lx or more is applied. These environmental stresses may be applied singly or in combination.

4. Construction of Expression Vector

[0080]The expression vector of the present invention can be obtained by ligating (inserting) a gene encoding a transcriptional factor according to the present invention to an appropriate vector. The vector into which a gene encoding a transcriptional factor of the present invention is inserted is not particularly limited as long as it can be replicated in a host. Examples of such a vector include a plasmid, shuttle vector and helper plasmid.

[0081]Examples of such a plasmid DNA include plasmids derived from Escherichia coli (e.g., pBR322, pBR325, pUC118, pUC119, pUC118, pUC119, and pBluescript), plasmids derived from Bacillus subtilis (e.g., pUB110 and pTP5); and plasmids derived from yeasts (e.g., YEp13 and YCp50). Examples of a phage DNA include λ phages (Charon4A, Charon21A EMBL3, EMBL4, λgt10, λgt11, and λZAP). Further animal virus vectors such as retrovirus and a vaccinia virus and insect virus vectors such as a baculovirus can be also used.

[0082]To insert a transcriptional factor of the present invention into a vector, use may be made of a method of digesting a purified DNA with appropriate restriction enzymes, inserting the obtained DNA fragment into the restriction site of a suitable vector DNA or a multi-cloning site, and ligating it to the vector.

[0083]In a case where a reporter gene, for example, a GUS gene, widely used in plants is linked to the 3' end of the gene encoding a transcriptional factor of the present invention, the strength of the gene expression can be easily evaluated by checking GUS activity. As such a reporter gene other than the GUS gene, luciferase and a green fluorescent protein can be used.

5. Preparation of Transformant

[0084]A transformant of the present invention can be obtained by introducing an expression vector of the present invention into a host. The host used herein is not particularly limited as long as it can express a promoter, a gene of interest, or an environmental stress-responsive transcriptional factor. Of them, a plant is preferable. In a case of a plant host, a transformant plant (transgenic plant) can be obtained as follows.

[0085]A plant to be transformed in the present invention refers to an entire plant, a plant organ (such as leaf, petal, stem, root, or seed), a plant tissue (such as the epidermis, phloem, parenchyma, xylem, or vascular bundle), or a plant culture cell. Examples of plants used for transformation include plants belonging to the Brassicaceae, Gramineae, Solanaceae and Leguminosae (see below); however they are not limited to these plants.

[0086]Brassicaceae: Arabidopsis thaliana

[0087]Gramineae: Nicotiana tabacum

[0088]Solanaceae: Zea mays, Oryza sativa

[0089]Leguminosae: Glycine max

[0090]The aforementioned recombinant vector can be introduced into a plant by a conventional transformation method such as electroporation, Agrobacterium method, particle gun method, or PEG method.

[0091]For example, where electroporation is used, a gene is introduced into a host by treating a vector by an electroporation device equipped with a pulse controller under conditions: a voltage of 500 to 1,600 V, 25 to 1,000 μF, and 20 to 30 msec.

[0092]When a particle gun method is used, a plant body, organ and tissue may be directly used. Alternatively, they may be used after they are sectioned to pieces or after protoplasts of them are prepared. The samples thus prepared may be processed by a gene-introduction device (for example, PDS-1000/He manufactured by Bio-Rad). Processing conditions vary depending upon a plant or sample. Generally, processing is performed at a pressure of about 1,000 to 1800 psi and a distance of about 5 to 6 cm.

[0093]Furthermore, a gene of interest can be introduced into a plant by using a plant virus as a vector. Examples of available plant viruses include a cauliflower mosaic virus. More specifically, a virus genome is inserted into a vector derived from Escherichia coli to prepare a recombinant and then such a gene of interest is inserted into the virus genome. The virus genome thus modified is excised out from the recombinant with restriction enzymes and inoculated into a plant host. In this manner the gene of interest can be introduced into the plant host.

[0094]In the method using a Ti plasmid of the Agrobacterium, when bacteria belonging to the Agrobacterium are transfected to a plant, a portion of plasmid DNA of the bacteria is transferred into a plant genome. Using such a characteristic, a gene of interest is introduced into a plant host. Of bacteria belonging to the Agrobacterium, Agrobacterium tumefaciens, when it is introduced into a plant by transfection, produces a tumor called a crown gall. Also, a plant when it is transfected with Agrobacterium rhizogenes, it produces hairy roots. These phenomena are caused by transferring a region called a T-DNA region (transferred DNA region) present in a plasmid such as a Ti plasmid or Ri plasmid present in each bacterium into a plant and incorporating the region into a plant genome at a time of transfection.

[0095]By inserting desired DNA, which is to be incorporated into a plant genome, into the T-DNA region on a Ti or Ri plasmid, the desired DNA can be incorporated into a plant genome, when the host is transfected with Agrobacterium bacteria.

[0096]Tumoral tissues, shoots and hairy roots obtained as a result of transformation can be directly used in cell culture, tissue culture, or organ culture. Also, when a plant hormone such as auxin, cytokinin, gibberellin, abscisic acid, ethylene, or brassinoride, is administered to them in an appropriate concentration by using a conventional plant tissue culture method, a plant body can be regenerated from them.

[0097]A vector according to the present invention can be not only incorporated into the plant hosts mentioned above but also introduced into bacteria belonging to the Escherichia such as Escherichia coli, the Bacillus such as Bacillus subtilis and the Pseudomonas such as Pseudomonas putida; yeasts such as Saccharomyces cerevisiae and Schizosaccharomyces pombe; animal cells such as COS cells and CHO cells; and insect cells such as Sf9 cells, to obtain a transformant. Where a bacterium such as Escherichia coli or yeast is used as a host, it is preferable that a recombinant vector according to the present invention can be self-replicated in the bacterium and, at the same time, is comprised of a promoter of the present invention, a ribosome binding sequence, a gene of interest and a transcription termination sequence. Furthermore, a gene regulating the promoter may be included in the bacterium.

[0098]A method for introducing a recombinant vector into bacteria is not particularly limited as long as it is a method which can introduce DNA into bacteria. Examples of such a method include a method of using calcium ions and an electroporation method.

[0099]When a yeast is used as a host, Saccharomyces cerevisiae and Schizosaccharomyces pombe may be used. A method for introducing a recombinant vector is not particularly limited as long as it is a method for introducing DNA into a yeast. Examples of such a method include electroporation, spheroplast method, and lithium acetate method.

[0100]Where an animal cell is used as a host, a monkey COS-7 cell, Vero, Chinese hamster ovary cell (CHO cell), and mouse L cell etc. are used. Examples of methods for introducing a recombinant vector into an animal cell include electroporation, calcium phosphate method, and lipofection method.

[0101]When an insect cell is used as a host, a Sf9 cell and the like may be used. Examples of method for introducing a recombinant vector into an insect cell include a calcium phosphate method, lipofection method, and electroporation method.

[0102]Whether a gene is incorporated into a host or not is confirmed by a PCR method, Southern hybridization, Northern hybridization method or the like. For example, PCR is performed by preparing DNA from a transformant, and designing DNA specific primers. PCR is carried out under the same conditions as used for preparing the plasmid mentioned above. Thereafter, the obtained amplified product is subjected to agarose gel electrophoresis, polyacrylamide gel electrophoresis or capillary electrophoresis, and stained with ethidium bromide, or SYBR Green solution, etc. If the amplified product is found as a single band, it is confirmed that a transformant is obtained. Alternatively, the amplified product can be also detected by PCR using primers previously stained with a fluorescent dye or the like. Furthermore, there may be employed a method in which the amplified product is bound to a solid phase such as a microplate and confirmed by fluorescent or an enzymatic reaction.

4. Production of Plant

[0103]In the present invention, a transformed plant body can be regenerated from the above transformed plant cell or the like. As a regeneration method, use is made of one in which callus-form transformed cells are transferred to a medium having a different hormone in a different concentration and cultured to form an adventitious embryo, from which an entire plant body is obtained. Examples of the medium to be used herein include an LS medium and an MS medium.

[0104]The "method for producing a plant body" of the present invention comprises steps of: introducing a plant expression vector, into which the above plant promoter or a gene encoding an environmental stress-responsive transcriptional factor is inserted, into a host cell to obtain a transformed plant cell; regenerating a transformed plant body from the transformed plant cell; obtaining a plant seed from the transformed plant body; and producing a plant body from the plant seed.

[0105]To obtain plant seeds from a transformed plant body, for example, the transformed plant body is collected from a rooting medium and transferred to a pot having soil containing water placed therein. Then, the transformed plant body is grown at constant temperature to form flowers. Finally seeds are obtained. To produce a plant body from a seed, for example, when a seed formed on a transformed plant body has matured, the seed is isolated and seeded in soil containing water, followed by growing at constant temperature under illumination. The plant thus bred becomes an environmental stress-resistant plant exhibiting the stress resistance corresponding to the responsivity of a promoter introduced therein or a gene encoding the environmental stress-responsive transcriptional factor introduced therein.

BRIEF DESCRIPTION OF THE DRAWINGS

[0106]FIG. 1 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL03-07-F12;

[0107]FIG. 2 is a characteristic graph showing the relationship between cold treatment time and expression ratio regarding FL04-12-F24;

[0108]FIG. 3 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL04-14-N10;

[0109]FIG. 4 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL04-14-P24;

[0110]FIG. 5 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL04-17-I03;

[0111]FIG. 6 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL04-17-I03;

[0112]FIG. 7 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL04-17-M08;

[0113]FIG. 8 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL04-17-M22;

[0114]FIG. 9 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-05-A17;

[0115]FIG. 10 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-05-F20;

[0116]FIG. 11 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-05-G20;

[0117]FIG. 12 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-09-N09;

[0118]FIG. 13 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-10-J09;

[0119]FIG. 14 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL05-10-J09;

[0120]FIG. 15 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL05-10-M08;

[0121]FIG. 16 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL05-11-H09;

[0122]FIG. 17 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-12-H13;

[0123]FIG. 18 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL05-12-H13;

[0124]FIG. 19 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL05-13-I20;

[0125]FIG. 20 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-14-E15;

[0126]FIG. 21 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-14-E16;

[0127]FIG. 22 is a characteristic graph showing the relationship between cold treatment time and expression ratio regarding FL05-14-E16;

[0128]FIG. 23 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL05-14-E16;

[0129]FIG. 24 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-16-F03;

[0130]FIG. 25 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL05-16-F03;

[0131]FIG. 26 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-16-H23;

[0132]FIG. 27 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL05-16-H23;

[0133]FIG. 28 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-18-M07;

[0134]FIG. 29 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL05-18-M07;

[0135]FIG. 30 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL05-18-021;

[0136]FIG. 31 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-19-F21;

[0137]FIG. 32 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL05-19-F21;

[0138]FIG. 33 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-19-022;

[0139]FIG. 34 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL05-19-022;

[0140]FIG. 35 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL05-19-022;

[0141]FIG. 36 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL05-21-K17;

[0142]FIG. 37 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL06-10-F03;

[0143]FIG. 38 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL06-12-H12;

[0144]FIG. 39 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL06-12-H12;

[0145]FIG. 40 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL07-12-123;

[0146]FIG. 41 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL08-08-H23;

[0147]FIG. 42 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL08-08-O14;

[0148]FIG. 43 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL08-09-M05;

[0149]FIG. 44 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL08-10-K08;

[0150]FIG. 45 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL08-11-P07;

[0151]FIG. 46 is a characteristic graph showing the relationship between cold treatment time and expression ratio regarding FL08-11-P07;

[0152]FIG. 47 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL08-13-F10;

[0153]FIG. 48 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL08-13-F10;

[0154]FIG. 49 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL08-13-F10;

[0155]FIG. 50 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL08-19-D04;

[0156]FIG. 51 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL08-19-G15;

[0157]FIG. 52 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL09-06-B11;

[0158]FIG. 53 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL09-07-G17;

[0159]FIG. 54 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL09-10-A12;

[0160]FIG. 55 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL09-13-P15;

[0161]FIG. 56 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL02-05-105;

[0162]FIG. 57 is a characteristic graph showing the relationship between cold treatment time and expression ratio regarding FL04-12-N15;

[0163]FIG. 58 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL04-16-P21;

[0164]FIG. 59 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL04-17-N22;

[0165]FIG. 60 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL04-20-P19;

[0166]FIG. 61 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL02-09-H01;

[0167]FIG. 62 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-01-D08;

[0168]FIG. 63 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL05-02-G08;

[0169]FIG. 64 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-02-O17;

[0170]FIG. 65 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL05-07-L13;

[0171]FIG. 66 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-08-B14;

[0172]FIG. 67 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-09-N10;

[0173]FIG. 68 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-11-L01;

[0174]FIG. 69 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-12-J09;

[0175]FIG. 70 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-14-D24;

[0176]FIG. 71 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-14-F20;

[0177]FIG. 72 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-14-I08;

[0178]FIG. 73 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-15-C04;

[0179]FIG. 74 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-15-E19;

[0180]FIG. 75 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-18-A06;

[0181]FIG. 76 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL05-18-H15;

[0182]FIG. 77 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-19-C02;

[0183]FIG. 78 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-20-M16;

[0184]FIG. 79 is a characteristic graph showing the relationship between cold treatment time and expression ratio regarding FL05-20-N18;

[0185]FIG. 80 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-21-E06;

[0186]FIG. 81 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-21-L12;

[0187]FIG. 82 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL06-07-B08;

[0188]FIG. 83 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL06-08-H20;

[0189]FIG. 84 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL06-09-N04;

[0190]FIG. 85 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL06-11-K21;

[0191]FIG. 86 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL07-07-G15;

[0192]FIG. 87 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL07-12-D17;

[0193]FIG. 88 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL08-11-C23;

[0194]FIG. 89 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL08-13-G20;

[0195]FIG. 90 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL08-15-M21;

[0196]FIG. 91 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL08-18-N19;

[0197]FIG. 92 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL08-19-C07;

[0198]FIG. 93 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL08-19-P05;

[0199]FIG. 94 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL09-07-G09;

[0200]FIG. 95 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL09-07-G15;

[0201]FIG. 96 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL09-10-J18;

[0202]FIG. 97 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL09-11-I12;

[0203]FIG. 98 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL09-12-B03;

[0204]FIG. 99 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL09-16-I11;

[0205]FIG. 100 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL09-16-M04;

[0206]FIG. 101 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL11-01-J18;

[0207]FIG. 102 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL11-07-D13;

[0208]FIG. 103 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL11-07-F02;

[0209]FIG. 104 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL11-07-N15;

[0210]FIG. 105 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL11-10-D10;

[0211]FIG. 106 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL08-16-G17;

[0212]FIG. 107 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-11-M11;

[0213]FIG. 108 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-11-M11;

[0214]FIG. 109 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL06-11-K21;

[0215]FIG. 110 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL06-11-K21;

[0216]FIG. 111 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL06-08-H20;

[0217]FIG. 112 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL06-08-H20;

[0218]FIG. 113 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-16-H23;

[0219]FIG. 114 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-16-H23;

[0220]FIG. 115 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL08-16-D06;

[0221]FIG. 116 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL07-10-G04;

[0222]FIG. 117 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL04-17-D16;

[0223]FIG. 118 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-19-M20;

[0224]FIG. 119 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL08-1-M13;

[0225]FIG. 120 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL04-15-K19;

[0226]FIG. 121 is a characteristic graph showing the relationship between cold stress and expression ratio regarding RAFL04-15-K19;

[0227]FIG. 122 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-11-L01;

[0228]FIG. 123 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-11-L01;

[0229]FIG. 124 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-14-C11;

[0230]FIG. 125 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-19-G24;

[0231]FIG. 126 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-19-G24;

[0232]FIG. 127 is a characteristic graph showing the relationship between cold stress and expression ratio regarding RAFL05-19-G24;

[0233]FIG. 128 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-20-N02;

[0234]FIG. 129 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-18-H12;

[0235]FIG. 130 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-18-H12;

[0236]FIG. 131 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-19-E19;

[0237]FIG. 132 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL06-10-D22;

[0238]FIG. 133 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL06-12-M01;

[0239]FIG. 134 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL06-12-M01;

[0240]FIG. 135 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-14-D24;

[0241]FIG. 136 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-14-D24;

[0242]FIG. 137 is a characteristic graph showing the relationship between cold stress and expression ratio regarding RAFL05-20-N17;

[0243]FIG. 138 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-20-N17;

[0244]FIG. 139 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL04-17-F21;

[0245]FIG. 140 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL09-12-N16;

[0246]FIG. 141 is a characteristic graph showing the relationship between drought stress and expression ratio regarding AFL05-19-105;

[0247]FIG. 142 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-19-I05;

[0248]FIG. 143 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-21-I22;

[0249]FIG. 144 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL08-11-H20;

[0250]FIG. 145 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL08-11-H20;

[0251]FIG. 146 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-21-C17;

[0252]FIG. 147 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-21-C17;

[0253]FIG. 148 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-08-D06;

[0254]FIG. 149 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-20-M16;

[0255]FIG. 150 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-20-M16;

[0256]FIG. 151 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL11-01-J18;

[0257]FIG. 152 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL11-01-J18;

[0258]FIG. 153 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL11-09-C20;

[0259]FIG. 154 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-18-N16;

[0260]FIG. 155 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL11-10-D10;

[0261]FIG. 156 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL11-10-D10;

[0262]FIG. 157 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL04-17-N22;

[0263]FIG. 158 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL04-17-N22;

[0264]FIG. 159 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-09-G15;

[0265]FIG. 160 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-09-G15;

[0266]FIG. 161 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-21-L12; and

[0267]FIG. 162 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-21-L12.

BEST MODE FOR CARRYING OUT THE INVENTION

[0268]Hereinafter, the present invention will be further explained in detail by way of examples, which should not be construed as limiting the scope of the present invention.

Example 1

Isolation of Promoter

1. Materials and Methods

[0269](1) Arabidopsis cDNA Clone

[0270]A microarray was constructed by using about 7,000 cDNA molecules in total including genes isolated from an Arabidopsis full-length cDNA libraries, responsive-to-dehydration (RD) genes, early responsive-to-dehydration (ERD) genes, kin 1 genes, kin2 genes, and cor15a genes; α-tubulin genes as an internal standard; and mouse nicotinic acetylcholine receptor epsilon subunit (nAChRE) genes and mouse glucocorticoid receptor homologous genes, as negative controls.

[0271]Positive control: dehydration-inducible genes (responsive-to-dehydration genes: rd, and early responsive-to-dehydration genes: erd)

[0272]Internal standard: α-tubulin gene

[0273]Negative control: mouse nicotinic acetylcholine receptor epsilon subunit (nAChRE) genes and mouse glucocorticoid receptor homologous genes, which do not substantially have homology with any given sequence in an Arabidopsis database for analyzing non-specific hybridization.

(2) Arabidopsis Full-Length cDNA Microarray

[0274]The present inventors have constructed full-length cDNA libraries from an Arabidopsis plant body under different conditions (e.g., dehydration treatment, cold treatment and non-treatment in different growth stages from budding to maturation of seeds) by the biotinylated CAP trapper method. From the full-length cDNA libraries, the present inventors isolated individually about 7,000 independent Arabidopsis full-length cDNA molecules. The cDNA fragments, which were amplified by PCR, were arranged on a slide glass in accordance with a known method (Eisen and Brown, 1999). The present inventors prepared a full-length cDNA microarray containing about 7,000 Arabidopsis full-length cDNA molecules, which contain the genes below.

(3) Isolation of Dehydration-, Cold-, High Salt-, and ABA-Inducible Genes Using cDNA Microarray

[0275]In this example, dehydration-, cold-, high salt-, and ABA-inducible genes were isolated by using a full-length cDNA microarray containing about 7,000 Arabidopsis full-length cDNA molecules.

[0276]Probes of a plant treated with different stresses and an untreated plant with stress and labeled with Cy3 and Cy5 fluorescent dyes were mixed. The probes were hybridized with the full-length cDNA microarray containing about 7,000 Arabidopsis full-length cDNA molecules. By such a double labeling of a pair of cDNA probes wherein one of the mRNA samples was labeled with Cy3-dUTP and the other was labeled with Cy5-dUTP, hybridization with DNA elements on a microarray can be performed simultaneously, with the result that quantitative determination of gene expression under two different conditions (that is, stressed and unstressed conditions) can be directly and easily performed. The hybridized microarray was scanned by two discrete laser channels for Cy3 and Cy5 emission from each of DNA elements. Subsequently, the intensity ratio between two fluorescent signals from each DNA element was determined. Based on the relative value of the intensity ratio, a change of differential expression of genes represented as a cDNA spot on the microarray was determined. In this example, an α-tubulin gene, whose expression level was almost equivalent under two different experimental conditions was used, as an internal control gene.

[0277]In the full-length cDNA microarray containing about 7,000 Arabidopsis full-length cDNA molecules, a procedure for identifying dehydration-, cold-, high salt-, and ABA-inducible genes will be explained.

[0278]1) Both mRNA molecules derived from a plant treated with one of the stresses mentioned above and mRNA molecules derived from a wild-type plant unstressed were used to prepare Cy3-labeled cDNA and Cy5-labeled cDNA probes, respectively. These cDNA probes were mixed and hybridized with the cDNA microarray. In this example, an α-tubulin gene, which exhibits almost the same expression level under two type conditions, was as used as an internal control gene. A gene that exhibits the expression ratio of dehydration:unstressed, cold:unstressed, or high salt:unstressed more than double of that of the α-tubulin gene was defined as an inducible gene by a stress given to the gene.

[0279]2) Both mRNA molecules derived from a 35S:DREB1A transgenic plant and mRNA molecules derived from a wild-type plant unstressed were used to prepare Cy3-labeled cDNA and Cy5-labeled cDNA probes, respectively. These cDNA probes were mixed and hybridized with a cDNA microarray. In this example, an α-tubulin gene exhibiting almost the same expression level under two type conditions was used as an internal control gene. A gene of 35S:DREB1A transgenic plant exhibiting an expression ratio more than double of that of a gene of the wild type plant unstressed was defined as a DREB1A target gene.

[0280]Both mRNA molecules derived from a plant treated with a stress and mRNA molecules derived from a wild-type plant unstressed were used to prepare Cy3-labeled cDNA and Cy5-labeled cDNA probes, respectively. These cDNA probes were mixed and hybridized with a cDNA microarray. The same experiment was repeated three times to evaluate the reproducibility of microarray analysis. When the same mRNA sample was hybridized with various microarrays, a good correlation was observed. A gene that exhibits an expression ratio (dehydration/unstressed, cold/unstressed) more than double of that of the α-tubulin gene was defined as an inducible gene by a stress given to the gene.

(4) Analysis of Sequence

[0281]Plasmid DNA extracted by a plasmid preparation device (NA 100) manufactured by Kurabo was sequenced to find homology of gene sequences. The DNA sequence was determined by a dye terminator cycle sequencing method using a DNA sequencer (ABI PRISM 3700. PE Applied Biosystems, CA, USA). Based on the GenBank/EMBL database, homology of sequences was found by using the BLAST program.

(5) Amplification of cDNA

[0282]ZAPII (Carninci et al., 1996) was used as a vector for constructing a cDNA library. The cDNA inserted in a vector for the library was amplified by PCR using complementary primers to the sequences of both sides of the cDNA.

[0283]The sequences of primers are as follows:

TABLE-US-00002 (SEQ ID NO: 91) FL forward 1224: 5'-CGCCAGGGTTTTCCCAGTCACGA (SEQ ID NO: 92) FL reverse 1233: 5'-AGCGGATAACAATTTCACACAGGA

[0284]To 100 μl of a PCR solution mixture (0.25 mM dNTP, 0.2 μM PCR primer, 1×Ex Taq Buffer, and 1.25 U Ex Taq polymerase (manufactured by Takara Shuzo)), a plasmid (1 to 2 ng) was added as a template. PCR was performed under the following conditions: an initial reaction at 94° C. for 3 minutes, 35 cycles each consisting of 95° C. for one minute, 60° C. for 30 seconds and 72° C. for 3 minutes, and a final reaction at 72° C. for 3 minutes. After a PCR product was precipitated with ethanol, the precipitate was dissolved in 25 μl of 3×SSC and then subjected to electrophoresis using 0.7% agarose gel. The quality of the DNA obtained and amplification efficiency of PCR were confirmed.

(6) Construction of cDNA Microarray

[0285]Using a gene tip microarray stamp machine GTMASS SYSTEM (manufactured by Nippon Laser & Electronics Lab.), 0.5 μl of a PCR product (100 to 500 ng/ml) was loaded from a 384-well microtiter plate to form spots of the PCR product (5 nl for each) at intervals of 280 μm on 6 micro slide glasses (S7444, manufactured Matsunami) coated with poly-L lysine. To spot DNA in an equivalent amount, the slide after printing was placed in a beaker containing heated distilled water to moisten it and placed at 100° C. for 3 seconds to dry it. After the slide was placed on a slide rack, the rack was transferred into a glass chamber. To the glass chamber, a blocking solution (15 ml of 1M sodium borate salt (pH 8.0), 5.5 g succinic anhydrous compound (Wako), and 335 ml of 1-methyl-2-pyrrolidon (Wako)) was poured. After the glass chamber housing the slide rack was shaken up and down 5 times and gently shaking for 15 minutes, the slide rack was transferred to a glass chamber containing boiling water, shaken 5 times, and allowed to stand alone for 2 minutes. Thereafter, the slide rack was transferred to a glass chamber containing 95% ethanol, shaken 5 times, and centrifuged at 800 rpm for 30 minutes.

(7) Plant Material and Isolation of RNA

[0286]As a plant material, use was made of a wild type Arabidopsis thaliana plant body which was seeded on an agar medium and grown for 3 weeks (Yamaguchi-Shinozaki and Shinozaki, 1994) and an Arabidopsis thaliana (Colombian species) plant body into which DREB1A cDNA (Kasuga et al., 1999) connected to a 35S promoter of a cauliflower mosaic virus was introduced. Dehydration- and cold-stress treatments were performed in accordance with the method of Yamaguchi-Shinozaki and Shinozaki (1994). More specifically, dehydration treatment was performed by pulling a plant body out of the agar medium, placing it on a filter, and dried at a temperature of 22° C. and a relative humidity of 60%. The cold treatment was performed by transferring a plant body grown at 22° C. to 4° C. High salt stress treatment was performed by growing a plant body at an aqueous solution containing 250 mM NaCl.

[0287]After wild type plant bodies were exposed to stress-treatment for 2 or 10 hours, a sample was taken from each of plant bodies and stored in cryogenic conditions with liquid nitrogen. Furthermore, wild type and DREB1A overexpression-type transformants cultured in an agar medium without kanamycin were subjected to an experiment for identifying a DREB1A target gene. The DREB1A overexpression-type transformant was not treated with stresses. The total RNA was isolated from a plant body by using ISOGEN (Nippon gene, Tokyo, Japan) and mRNA was isolated and purified by Oligotex-dT30 mRNA purification kit (Takara, Tokyo, Japan).

(8) Fluorescent Labeling of Probe

[0288]Each of the mRNA samples was subjected to a reverse transcription reaction in the presence of Cy3 dUTP or Cy5 dUTP (Amersham Pharmacia). The composition of the buffer (30 μl) used in the reverse transcription reaction is shown in Table 2.

TABLE-US-00003 TABLE 2 poly(A)+ RNA with 6 μg oligo(dT) 18-mer 1 μg 10 mM DTT 500 μM dATP, dCTP and dGTP 200 μM dTTP 100 μM Cy3 dUTP or Cy5 dUTP 400 units of SuperScript II Reverse Transcriptase (Life technologies) 1× Superscript First Strand Synthesis Buffer (Life technologies) Total 30 μL

[0289]After reaction was performed at 42° C. for one hour, two samples (labeled with Cy3 and Cy5) were mixed to obtain a reaction mixture. To this reaction mixture, 15 μl of 0.1 M NaOH and 1.5 μl of 20 mM EDTA were added and treated at 70° C. for 10 minutes. Further, 15 μl of 0.1 M HCl was added to the reaction mixture, a sample was taken and transferred to a Micro con 30 micro concentrator (Amicon). 400 μl of TE buffer was added to the sample and centrifuged until the volume of the buffer reached 10 to 20 μl. The effluent was discarded. 400 μl of TE buffer and 20 μl of 1 mg/ml human Cot-1 DNA (Gibco BRL) were added to the resultant mixture and the mixture was again centrifuged. The labeled samples were centrifugally collected and several μl of distilled water was added thereto. The obtained probes, 2 μl of 10 μg/μl yeast tRNA, 2 μl of 1 μg/μl pd(A)12-18 (Amersham Pharmacia), 3.4 ml of 20×SSC, and 0.6 μl of 10% SDS were added. Further, the samples were denatured at 100° C. for 1 minute and placed at room temperature for 30 minutes and thereafter used in hybridization.

(9) Microarray Hybridization and Scanning

[0290]A probe was subjected to high-speed centrifugation for one minute by a benchtop micro centrifuge. To avoid generation of bubbles, the probe was placed at the center of an array and a cover slip was placed thereon. Four drops of 5 μl of 3×SSC were dropped on a slide glass and a chamber was kept at a suitable humidity to prevent the probe from being dried during hybridization. After the slide glass was placed in a cassette for hybridization (THC-1, BM machine) and the cassette was sealed, hybridization treatment was performed at 65° C. for 12 to 16 hours. The slide glass was taken out from the cassette and placed on the slide rack. After the cover slip was carefully removed in solution 1 (2×SSC, 0.1% SDS), the rack was washed while shaking and transferred into solution 2 (1×SSC) to wash for 2 minutes. The rack was further transferred to solution 3 (0.2×SSC), allowed to stand for 2 minutes, and centrifuged at 800 rpm for 1 min to dry.

[0291]The microarray was scanned at a resolution of 10 μm per pixel by use of a scanning laser microscope (ScanArray 4000; GSI Lumonics, Watertown, Mass.). As a program for analyzing microarray data, Imagene Ver 2.0 (BioDiscovery) and QuantArray (GSI Lumonics) were used.

(10) Northern Analysis

[0292]Northern analysis was performed using total RNA, (Yamaguchi-Shinozaki and Shinozaki, 1994). DNA fragments were isolated from the Arabidopsis thaliana full-length cDNA library by a PCR method and used as probes for Northern hybridization.

(11) Determination of Promoter Region

[0293]Based on the genomic information of Arabidopsis thaliana in a data base (GenBank/EMBL, ABRC), a promoter region was analyzed by using the BLAST program for gene analysis.

2. Results

(1) Stress-Inducible Gene

[0294]Fluorescent-labeled cDNA was prepared by subjecting mRNA isolated from an unstressed Arabidopsis thaliana plant to a reverse transcription reaction in the presence of Cy5-dUTP. A second probe labeled with Cy3-dUTP was prepared from a plant treated at low temperature for 2 hours. Both probes were simultaneously hybridized with a cDNA microarray comprising about 7,000 Arabidopsis thaliana cDNA clones and then a pseudo color image was created.

[0295]Genes induced and suppressed by a stress are represented by a red signal and green signal, respectively. Genes expressed at virtually the same level in both treatments are represented by a yellow signal. The intensity of each spot corresponds to the absolute value of the expression level of each gene. It is shown that a cold-inducible gene (rd29A) is represented by a red signal whereas an α-tubulin gene (an internal control) is represented by a yellow signal.

(2) Identification of Promoter Region

[0296]As a result of identifying a promoter region, the promoter gene regions of 90 types of genes were obtained in a full-length cDNA microarray containing about 7,000 of Arabidopsis full-length cDNA molecules. The name of these 90 types of genes and their promoter sequences are summarized in Table 3

TABLE-US-00004 TABLE 3 Name of gene SEQ ID NO: FL03-07-F12 SEQ ID NO: 1 FL04-12-F24 SEQ ID NO: 2 FL04-14-N10 SEQ ID NO: 3 FL04-14-P24 SEQ ID NO: 4 FL04-17-I03 SEQ ID NO: 5 FL04-17-M08 SEQ ID NO: 6 FL04-17-M22 SEQ ID NO: 7 FL05-05-A17 SEQ ID NO: 8 FL05-05-F20 SEQ ID NO: 9 FL05-05-G20 SEQ ID NO: 10 FL05-09-N09 SEQ ID NO: 11 FL05-10-J09 SEQ ID NO: 12 FL05-10-M08 SEQ ID NO: 13 FL05-11-H09 SEQ ID NO: 14 FL05-12-H13 SEQ ID NO: 15 FL05-13-I20 SEQ ID NO: 16 FL05-14-E15 SEQ ID NO: 17 FL05-14-E16 SEQ ID NO: 18 FL05-16-F03 SEQ ID NO: 19 FL05-16-H23 SEQ ID NO: 20 FL05-18-M07 SEQ ID NO: 21 FL05-18-O21 SEQ ID NO: 22 FL05-19-F21 SEQ ID NO: 23 FL05-19-O22 SEQ ID NO: 24 FL05-21-K17 SEQ ID NO: 25 FL06-10-F03 SEQ ID NO: 26 FL06-12-H12 SEQ ID NO: 27 FL07-12-I23 SEQ ID NO: 28 FL08-08-H23 SEQ ID NO: 29 FL08-08-O14 SEQ ID NO: 30 FL08-09-M05 SEQ ID NO: 31 FL08-10-K08 SEQ ID NO: 32 FL08-11-P07 SEQ ID NO: 33 FL08-13-F10 SEQ ID NO: 34 FL08-19-D04 SEQ ID NO: 35 FL08-19-G15 SEQ ID NO: 36 FL09-06-B11 SEQ ID NO: 37 FL09-07-G17 SEQ ID NO: 38 FL09-10-A12 SEQ ID NO: 39 FL09-13-P15 SEQ ID NO: 40 FL02-05-I05 SEQ ID NO: 41 FL04-12-N15 SEQ ID NO: 42 FL04-16-P21 SEQ ID NO: 43 FL04-17-N22 SEQ ID NO: 44 FL04-20-P19 SEQ ID NO: 45 FL02-09-H01 SEQ ID NO: 46 FL05-01-D08 SEQ ID NO: 47 FL05-02-G08 SEQ ID NO: 48 FL05-02-O17 SEQ ID NO: 49 FL05-07-L13 SEQ ID NO: 50 FL05-08-B14 SEQ ID NO: 51 FL05-09-N10 SEQ ID NO: 52 FL05-11-L01 SEQ ID NO: 53 FL05-12-J09 SEQ ID NO: 54 FL05-14-D24 SEQ ID NO: 55 FL05-14-F20 SEQ ID NO: 56 FL05-14-I08 SEQ ID NO: 57 FL05-15-C04 SEQ ID NO: 58 FL05-15-E19 SEQ ID NO: 59 FL05-18-A06 SEQ ID NO: 60 FL05-18-H15 SEQ ID NO: 61 FL05-19-C02 SEQ ID NO: 62 FL05-20-M16 SEQ ID NO: 63 FL05-20-N18 SEQ ID NO: 64 FL05-21-E06 SEQ ID NO: 65 FL05-21-L12 SEQ ID NO: 66 FL06-07-B08 SEQ ID NO: 67 FL06-08-H20 SEQ ID NO: 68 FL06-09-N04 SEQ ID NO: 69 FL06-11-K21 SEQ ID NO: 70 FL07-07-G15 SEQ ID NO: 71 FL07-12-D17 SEQ ID NO: 72 FL08-11-C23 SEQ ID NO: 73 FL08-13-G20 SEQ ID NO: 74 FL08-15-M21 SEQ ID NO: 75 FL08-18-N19 SEQ ID NO: 76 FL08-19-C07 SEQ ID NO: 77 FL08-19-P05 SEQ ID NO: 78 FL09-07-G09 SEQ ID NO: 79 FL09-07-G15 SEQ ID NO: 80 FL09-10-J18 SEQ ID NO: 81 FL09-11-I12 SEQ ID NO: 82 FL09-12-B03 SEQ ID NO: 83 FL09-16-I11 SEQ ID NO: 84 FL09-16-M04 SEQ ID NO: 85 FL11-01-J18 SEQ ID NO: 86 FL11-07-D13 SEQ ID NO: 87 FL11-07-F02 SEQ ID NO: 88 FL11-07-N15 SEQ ID NO: 89 FL11-10-D10 SEQ ID NO: 90

(3) The Relationship Between Stress Treatment Time and Expression Ratio

[0297]The 90 types of stress inducible genes isolated above were analyzed for the relationship between stress treatment time and expression ratio. The results are shown in FIGS. 1 to 105. The relationship between 90 types of genes and stress treatment are shown in Table 4.

TABLE-US-00005 TABLE 4 Name of gene Type of applied stress Drawing FL03-07-F12 Dehydration FIG. 1 FL04-12-F24 Exposure to cold FIG. 2 FL04-14-N10 Dehydration FIG. 3 FL04-14-P24 Dehydration FIG. 4 FL04-17-I03 Dehydration, Exposure to a high level salt solution FIGS. 5, 6 FL04-17-M08 Exposure to a high level salt solution FIG. 7 FL04-17-M22 Dehydration FIG. 8 FL05-05-A17 Dehydration FIG. 9 FL05-05-F20 Dehydration FIG. 10 FL05-05-G20 Dehydration FIG. 11 FL05-09-N09 Dehydration FIG. 12 FL05-10-J09 Dehydration, Exposure to a high level salt solution FIGS. 13, 14 FL05-10-M08 Exposure to a high level salt solution FIG. 15 FL05-11-H09 Exposure to a high level salt solution FIG. 16 FL05-12-H13 Dehydration, Exposure to a high level salt solution FIGS. 17, 18 FL05-13-I20 ABA treatment FIG. 19 FL05-14-E15 Dehydration FIG. 20 FL05-14-E16 Dehydration, Exposure to cold, ABA treatment FIGS. 21-23 FL05-16-F03 Dehydration, ABA treatment FIGS. 24, 25 FL05-16-H23 Dehydration, Exposure to a high level salt solution FIGS. 26, 27 FL05-18-M07 Dehydration, ABA treatment FIGS. 28, 29 FL05-18-O21 ABA treatment FIG. 30 FL05-19-F21 Dehydration, ABA treatment FIGS. 31, 32 FL05-19-O22 Dehydration, Exposure to a high level salt solution, ABA FIGS. 33-35 treatment FL05-21-K17 Exposure to a high level salt solution FIG. 36 FL06-10-F03 ABA treatment FIG. 37 FL06-12-H12 Dehydration, Exposure to a high level salt solution FIGS. 38, 39 FL07-12-I23 Exposure to a high level salt solution FIG. 40 FL08-08-H23 Exposure to a high level salt solution FIG. 41 FL08-08-O14 Dehydration FIG. 42 FL08-09-M05 Dehydration FIG. 43 FL08-10-K08 Exposure to a high level salt solution FIG. 44 FL08-11-P07 Dehydration, Exposure to cold FIGS. 45, 46 FL08-13-F10 Dehydration, Exposure to a high level salt solution, ABA FIGS. 47-49 treatment FL08-19-D04 Dehydration FIG. 50 FL08-19-G15 Exposure to a high level salt solution FIG. 51 FL09-06-B11 ABA treatment FIG. 52 FL09-07-G17 ABA treatment FIG. 53 FL09-10-A12 ABA treatment FIG. 54 FL09-13-P15 Dehydration FIG. 55 FL02-05-I05 Exposure to a high level salt solution FIG. 56 FL04-12-N15 Exposure to cold FIG. 57 FL04-16-P21 Dehydration FIG. 58 FL04-17-N22 Exposure to a high level salt solution FIG. 59 FL04-20-P19 Dehydration FIG. 60 FL02-09-H01 Dehydration FIG. 61 FL05-01-D08 Dehydration FIG. 62 FL05-02-G08 Exposure to a high level salt solution FIG. 63 FL05-02-O17 Dehydration FIG. 64 FL05-07-L13 Exposure to a high level salt solution FIG. 65 FL05-08-B14 Dehydration FIG. 66 FL05-09-N10 Dehydration FIG. 67 FL05-11-L01 Dehydration FIG. 68 FL05-12-J09 Dehydration FIG. 69 FL05-14-D24 Dehydration FIG. 70 FL05-14-F20 Dehydration FIG. 71 FL05-14-I08 Dehydration FIG. 72 FL05-15-C04 Dehydration FIG. 73 FL05-15-E19 Dehydration FIG. 74 FL05-18-A06 Dehydration FIG. 75 FL05-18-H15 Exposure to a high level salt solution FIG. 76 FL05-19-C02 Dehydration FIG. 77 FL05-20-M16 Dehydration FIG. 78 FL05-20-N18 Exposure to cold FIG. 79 FL05-21-E06 Dehydration FIG. 80 FL05-21-L12 Dehydration FIG. 81 FL06-07-B08 Dehydration FIG. 82 FL06-08-H20 Dehydration FIG. 83 FL06-09-N04 Dehydration FIG. 84 FL06-11-K21 Dehydration FIG. 85 FL07-07-G15 Exposure to a high level salt solution FIG. 86 FL07-12-D17 Exposure to a high level salt solution FIG. 87 FL08-11-C23 Dehydration FIG. 88 FL08-13-G20 Dehydration FIG. 89 FL08-15-M21 Dehydration FIG. 90 FL08-18-N19 Dehydration FIG. 91 FL08-19-C07 Dehydration FIG. 92 FL08-19-P05 Exposure to a high level salt solution FIG. 93 FL09-07-G09 Exposure to a high level salt solution FIG. 94 FL09-07-G15 Dehydration FIG. 95 FL09-10-J18 Exposure to a high level salt solution FIG. 96 FL09-11-I12 Dehydration FIG. 97 FL09-12-B03 Dehydration FIG. 98 FL09-16-I11 Exposure to a high level salt solution FIG. 99 FL09-16-M04 Exposure to a high level salt solution FIG. 100 FL11-01-J18 Dehydration FIG. 101 FL11-07-D13 Exposure to a high level salt solution FIG. 102 FL11-07-F02 Exposure to a high level salt solution FIG. 103 FL11-07-N15 Exposure to a high level salt solution FIG. 104 FL11-10-D10 Exposure to a high level salt solution FIG. 105

[0298]In FIGS. 1 to 105, the vertical axis shows the expression ratio of a gene, which is calculated as follows:

Expression ratio=[(FI of a cDNA molecule under stress)/(FI of a cDNA molecule under no stress)]/[(FI of α-tubulin under stress)/(FI of α-tubulin under no stress)]

where FI is the intensity of fluorescence.

[0299]As shown in FIGS. 1 to 105, the stress inducible genes isolated by a method according to the present invention exhibit different profiles; however, it is found that expression is induced by adding each stress. From this, it is demonstrated that the nucleotide sequences positioned upstream of these 90 types of genes and represented by SEQ ID NO: 1 to 90 serve as stress responsive promoters.

Example 2

Isolation of Gene Encoding Environmental Stress Responsive Transcriptional Factor

1. Materials and Methods

[0300](1) Arabidopsis cDNA Clone

[0301]A microarray was constructed by using about 7,000 cDNA molecules in total including genes isolated from Arabidopsis full-length cDNA libraries, responsive to dehydration (RD) genes, early responsive to dehydration (ERD) genes, kin 1 genes, kin2 genes, and cor15a genes; fragments amplified from λ control template DNA by PCR as an internal standard; and mouse nicotinic acetylcholine receptor epsilon subunit (nAChRE) genes and mouse glucocorticoid receptor homologous genes, as negative controls.

[0302]Positive control: dehydration-inducible genes (responsive to dehydration genes: rd, and early responsive to dehydration genes: erd);

[0303]Internal standard: fragments amplified from λ control template DNA by PCR (TX803, manufactured by Takara Shuzo, hereinafter referred to as a "control fragment");

[0304]Negative control: mouse nicotinic acetylcholine receptor epsilon subunit (nAChRE) genes and mouse glucocorticoid receptor homologous genes, as negative controls, which do not substantially have homology with any given sequence in an Arabidopsis database for analyzing non-specific hybridization.

(2) Arabidopsis Full-Length cDNA Microarray

[0305]The present inventors have constructed full-length cDNA libraries from an Arabidopsis plant body under different conditions (e.g., dehydration treatment, cold treatment and non-treatment in different growth stages from budding to maturation of seeds) by the biotinylated CAP trapper method. From the full-length cDNA libraries, the present inventors isolated individually about 7,000 independent Arabidopsis full-length cDNA molecules. The cDNA fragments, which were amplified by PCR, were arranged on a slide glass in accordance with a known method (Eisen and Brown, 1999). The present inventors prepared a full-length cDNA microarray containing about 7,000 Arabidopsis full-length cDNA molecules, which contain the genes below.

(3) Dehydration-, Cold-, and High Salt-Inducible Genes Using cDNA Microarray

[0306]In this example, dehydration-, cold- and high salt-inducible genes were isolated by using a full length cDNA microarray containing about 7,000 Arabidopsis full-length cDNA molecules.

[0307]Probes of plants treated with different stresses and an untreated plant with stress and labeled with Cy3 and Cy5 fluorescent dyes were mixed. The probes were hybridized with the full-length cDNA microarray containing about 7,000 Arabidopsis full-length cDNA molecules. By such a double labeling of a pair of cDNA probes wherein one of the mRNA samples was labeled with Cy3-dUTP and the other was labeled with Cy5-dUTP, hybridization with DNA elements on a microarray can be performed simultaneously, with the result that quantitative determination of gene expression under two different conditions (that is, stressed and unstressed conditions) can be directly and easily performed. The hybridized microarray was scanned by two discrete laser channels for Cy3 and Cy5 emission from each of DNA elements. Subsequently, the intensity ratio between two fluorescent signals from each DNA element was determined. Based on the relative value of the intensity ratio, a change of differential expression of genes represented as a cDNA spot on the microarray was determined. In this example, an α-tubulin gene, whose expression level was almost equivalent under two different experimental conditions, was used as an internal control gene.

[0308]In the full-length cDNA microarray containing about 7,000 Arabidopsis full-length cDNA molecules, a procedure for identifying dehydration-, cold-, and high salt-inducible genes will be explained.

[0309]Both mRNA molecules derived from a plant treated with one of the stresses mentioned above and mRNA molecules derived from a wild-type plant unstressed were used to prepare Cy3-labeled cDNA and Cy5-labeled cDNA probes, respectively. These cDNA probes were mixed and hybridized with a cDNA microarray. In this example, a control fragment, which exhibits almost the same expression level under two type conditions, was used as an internal control gene. A gene that exhibits the expression ratio (dehydration/unstressed, cold/unstressed or high salt/unstressed) more than 5 times of that of the control fragment was defined as an inducible gene by a stress given to the gene.

(4) Analysis of Sequence

[0310]Plasmid DNA extracted by a DNA extraction device (model Biomek, manufactured by Beckman Coulter) and purified by use of a multiscreen 96-hole filter plate (manufactured by Millipore) was sequenced to find homology of gene sequences. A DNA sequence was determined by a dye terminator cycle sequencing method using a DNA sequencer (ABI PRISM 3700. PE Applied Biosystems, CA, USA). Based on the GenBank/EMBL database, the homology of sequences was found by using the BLAST program.

(5) Amplification of cDNA

[0311]ZAP and λFLC-1 were used as a vector for constructing a cDNA library. The cDNA inserted in a vector for the library was amplified by PCR using complementary primers to the sequences of both sides of the cDNA.

[0312]The sequences of primers are as follows:

TABLE-US-00006 (SEQ ID NO: 165) FL forward 1224: 5'-CGCCAGGGTTTTCCCAGTCACGA (SEQ ID NO: 166) FL reverse 1233: 5'-AGCGGATAACAATTTCACACAGGA

[0313]To 100 μl of a PCR solution mixture (0.25 mM dNTP, 0.2 μM PCR primer, 1×Ex Taq Buffer, and 1.25 U of Ex Taq polymerase (manufactured by Takara Shuzo)), a plasmid (1 to 2 ng) was added as a template. PCR was performed under the following conditions: initial reaction at 94° C. for 3 minutes, 35 cycles each consisting of 95° C. for one minute, 60° C. for 30 seconds, and 72° C. for 3 minutes, and a final reaction at 72° C. for 3 minutes. After a PCR product was precipitated with ethanol, the precipitate was dissolved in 25 μl of 3×SSC and subjected to electrophoresis using 0.7% agarose gel. The quality of the DNA obtained and amplification efficiency of PCR were conformed.

(6) Construction of cDNA Microarray

[0314]Using a gene tip microarray stamp machine GTMASS SYSTEM (manufactured by Nippon Laser & Electronics Lab.), 0.5 μl of a PCR product (500-1,000 ng/ml) was loaded from a 384-well microtiter plate and form spots of the PCR product (5 nl for each) at intervals of 300 μm on 48 micro slide glasses (model Super Aldehyde substrate, manufactured by Telechem International). After spotting, the slide was dried in an atmosphere having a relative humidity of 30% or less and irradiated with ultraviolet rays for mediating a cross-linking reaction.

[0315]Thereafter, the slide was treated in 0.2% SDS with shaking for 2 minutes three times and soaked in distilled water twice. Subsequently, the slides were placed on a slide rack, which was the transferred into a chamber containing hot water and allowed to stand for 2 minutes. Subsequently, to the chamber, a blocking solution (containing 1 g borohydride, 300 ml of PBS, and 90 ml of 100% ethanol) was poured. After the glass chamber housing the slide rack was moderately shaken, the slide rack was transferred to a chamber containing 0.2% SDS and gently shaken for one minute 3 times. Thereafter, the slide rack was transferred to a glass chamber containing distilled water, moderately shaken for one minute, and centrifuged for 20 minutes to dry.

(7) Plant Material and Isolation of RNA

[0316]As a plant material, use was made of a wild type Arabidopsis thaliana plant body which was seeded on an agar medium and grown for 3 weeks (Yamaguchi-Shinozaki and Shinozaki, 1994) and an Arabidopsis thaliana (Colombian species) plant body into which DREB1A cDNA (Kasuga et al., 1999) connected to a 35S promoter of a cauliflower mosaic virus was introduced. Dehydration- and cold-stress treatments were performed in accordance with the method of Yamaguchi-Shinozaki and Shinozaki (1994). More specifically, dehydration treatment was performed by pulling a plant body out of the agar medium, placing it on a filter, and dried at a temperature of 22° C. and a relative humidity of 60%. The cold treatment was performed by transferring a plant body grown at 22° C. to 4° C. High salt stress treatment was performed by growing a plant body at an aqueous solution containing 250 mM NaCl.

[0317]After wild type plant bodies were exposed to stress-treatment for 2 or 10 hours, a sample was taken from each of plant bodies and stored in cryogenic conditions with liquid nitrogen. Furthermore, wild type and DREB1A overexpression-type transformants cultured in an agar medium without kanamycin were subjected to an experiment for identifying a DREB1A target gene. The DREB1A overexpression-type transformant was not treated with stresses. The total RNA was isolated from the plant body by using ISOGEN (Nippon gene, Tokyo, Japan) and mRNA was isolated and purified by Oligotex-dT30 mRNA purification kit (Takara, Tokyo, Japan).

(8) Fluorescent Labeling of Probe

[0318]Each of the mRNA samples was subjected to a reverse transcription reaction in the presence of Cy3 dUTP or Cy5 dUTP (Amersham Pharmacia). More specifically, the reverse transcription reaction was performed in a total amount of 20 μl of 1× Superscript first-stand buffer (containing 50 mM Tris-HCl, pH 8.3, 75 mM KCl, 3 mM MgCl2, and 20 mM DTT, manufactured by Life Technology), which contained:

1 μg of denatured poly (A)+ which contains 1 ng of λ poly A+RNA-A (TX802, manufactured by Takara Shuzo) serving as an internal standard;50 ng/μl 12 to 18 mer oligo dT primer (manufactured by Life Technology);0.5 mM dATP, 0.5 mM dGDP, 0.5 mM dCTP, and 0.2 mM dTTP;0.1 mM Cy3 dUTP or Cy5 dUTP;100 U of Rnase inhibitor;

10 mM DTT; and

[0319]200 U of Superscript II reverse transcriptase.

[0320]After the reaction solution of the aforementioned composition was incubated at 42° C. for 35 minutes, 200 U of Superscript II reverse transcriptase was added and further incubated at 42° C. for 35 minutes. To this reaction mixture, subsequently, 5 μl of 0.5 M EDTA, 10 μl of 1N NaOH, and 20 μl of distilled water were added, thereby terminating the enzyme reaction taking place in the reaction solution and simultaneously decomposing a template. The reaction solution was then incubated at 65° C. for 1 hour and thereafter neutralized with 1M Tris-HCL (pH 7.5).

[0321]The reaction solution was transferred to a Microcon 30 micro concentrator (manufactured by Amicon). 250 μl of TE buffer was added and centrifuged until the amount of the buffer reached 10 μl. The effluent was discarded. This step was repeated 4 times. Probes contained in the reaction solution were centrifugally collected and several μl of distilled water was added. To the obtained probes, 5.1 μl of 20×SSC, 2 μg/μl of Yeast tRNA, and 4.8 μl of 2% SDS were added. Further, the samples were denatured at 100° C. for 2 minutes, placed at room temperature for 5 minutes, and thereafter used in hybridization.

(9) Microarray Hybridization and Scanning

[0322]A probe was centrifuged for one minute by a benchtop micro centrifuge. To avoid generation of bubbles, the probe was placed at the center of an array and a cover slip was placed thereon. Four drops of 5 μl of 3×SSC were dropped on a slide glass and a chamber was kept at a suitable humidity to prevent the probe from being dried during hybridization. After the slide glass was placed in a cassette for hybridization (THC-1, BM machine) and the cassette was sealed, hybridization treatment was performed at 65° C. for 12 to 16 hours. The slide glass was taken out from the cassette and placed on the slide rack. After the cover slip was carefully removed in solution 1 (2×SSC, 0.03% SDS), the rack was washed while shaking and transferred into solution 2 (1×SSC) to wash for 2 minutes. The rack was further transferred to solution 3 (0.05×SSC), allowed to stand for 2 minutes, and centrifuged at 2500 g for 1 min to dry.

[0323]The microarray was scanned at a resolution of 10 μm per pixel by use of a scanning laser microscope (ScanArray 4000; GSI Lumonics, Watertown, Mass.). As a program for analyzing microarray data, QuantArray, Ver 2.0 (GSI Lumonics) was used. The background fluorescence was obtained through calculation based on fluorescent signals obtained from negative control genes (mouse nicotinic acetylcholine receptor epsilon subunit (nAChRE) gene and mouse glucocorticoid receptor homologous gene). Samples giving a fluorescent signal value of less than 1,000, which is equal to less than twice the background signal value, were not subjected to analysis. The cluster analysis of genes was performed by Genespring (manufactured by Silicon Genetic).

(10) Northern Analysis

[0324]Northern analysis was performed using total RNA, (Yamaguchi-Shinozaki and Shinozaki, 1994). DNA fragments were isolated from an Arabidopsis thaliana full-length cDNA library by a PCR method and used as probes for Northern hybridization.

(11) Determination of Gene Encoding Transcriptional Factor

[0325]Based on the genomic information of Arabidopsis thaliana in a data base (GenBank/EMBL, ABRC), a gene encoding transcriptional factor was analyzed by using the BLAST program for gene analysis.

2. Results

(1) Identification of Stress-Inducible Gene

[0326]Fluorescence-labeled cDNA was prepared by subjecting mRNA isolated from unstressed Arabidopsis thaliana to a reverse transcription reaction in the presence of Cy5-dUTP. A second probe labeled with Cy3-dUTP was prepared from a plant stress with dehydration, cold or high-salt. Both probes were simultaneously hybridized with a cDNA microarray containing about 7,000 Arabidopsis thaliana cDNA clones and pseudo color image was created.

[0327]Genes induced and suppressed by a stress are represented by a red signal and a green signal, respectively. Genes expressed at virtually the same level in both treatments are represented by a yellow signal. The intensity of each spot corresponds to the absolute value of the expression level of each gene. It is shown that a cold-inducible gene (rd29A) is represented by a red signal, whereas a control fragment (an internal control) is represented by a yellow signal.

[0328]As a result of scanning the microarray, 277 genes induced by dehydration treatment, 53 genes induced by cold treatment, and 194 genes induced by high salt treatment were identified. Note that genes whose expression ratio are not less than 5 times as large as that of a control fragment were determined as ones induced by a variety of stresses.

[0329]As a result of analysis using a database, 35 transcriptional factors, which were classified into the following families were identified. Note that RAFL05-21-L12 was not classified into the following families. However, when the nucleic acid base sequence, which was searched by the BLAST X based on amino acid sequence data registered in the GenBank Database, it exhibited E-value of e-100, which means that RAFL05-21-L12 is homologous to a gene encoding a known transcriptional factor, that is, heat shock transcriptional factor-like protein. As a result, RAFL05-21-L12 was identified as a transcriptional factor. In conclusion, 36 types of transcriptional factors were identified in this example.

(1) DREB family: RAFL05-11-M11, RAFL06-11-K21, RAFL05-16-H23, RAFL08-16-D06;(2) ERF family: RAFL08-16-G17, RAFL06-08-H20;(3) Zinc finger family: RAFL07-10-G04, RAFL04-17-D16, RAFL05-19-M20, RAFL08-11-M13, RAFL04-15-K19, RAFL05-11-L01, RAFL05-14-C11, RAFL05-19-G24, RAFL05-20-N02;(4) WRKY family: RAFL05-18-H12, RAFL05-19-E19, RAFL06-10-D22, RAFL06-12-M01;(5) MYB family: RAFL05-14-D24, RAFL05-20-N17, RAFL04-17-F21;(6) bHLH family: RAFL09-12-N16;(7) NAC family: RAFL05-19-105, RAFL05-21-122, RAFL08-11-H20, RAFL05-21-C17, RAFL05-08-D06;(8) Homeo domain family: RAFL05-20-M16, RAFL1-01-J18; RAFL11-09-C20; and(9) bZIP family: RAFL05-18-N16, RAFL11-10-D10, RAFL04-17-N22, RAFL05-09-G15.(3) The Relationship Between Treatment Time with Each Stress and Expression Ratio

[0330]Genes encoding 36 types of stress responsive transcriptional factors isolated as described above were analyzed for the relationship between treatment time with each stress and expression ratio. The results are shown in FIGS. 106 to 162. The correspondence between the names of genes and stress treatment shown in FIGS. 106 to 162 is listed in Table 5.

TABLE-US-00007 TABLE 5 Number of figure Name of gene Type of stress FIG. 106 RAFL08-16-G17 High level salt solution FIG. 107 RAFL05-11-M11 Dehydration FIG. 108 RAFL05-11-M11 High level salt solution FIG. 109 RAFL06-11-K21 High level salt solution FIG. 110 RAFL06-11-K21 Dehydration FIG. 111 RAFL06-08-H20 Dehydration FIG. 112 RAFL06-08-H20 High level salt solution FIG. 113 RAFL05-16-H23 High level salt solution FIG. 114 RAFL05-16-H23 Dehydration FIG. 115 RAFL08-16-D06 Dehydration FIG. 116 RAFL07-10-G04 Dehydration FIG. 117 RAFL04-17-D16 Dehydration FIG. 118 RAFL05-19-M20 Dehydration FIG. 119 RAFL08-11-M13 High level salt solution FIG. 120 RAFL04-15-K19 Dehydration FIG. 121 RAFL04-15-K19 Cold FIG. 122 RAFL05-11-L01 Dehydration FIG. 123 RAFL05-11-L01 High level salt solution FIG. 124 RAFL05-14-C11 Dehydration FIG. 125 RAFL05-19-G24 High level salt solution FIG. 126 RAFL05-19-G24 Dehydration FIG. 127 RAFL05-19-G24 Cold FIG. 128 RAFL05-20-N02 Dehydration FIG. 129 RAFL05-18-H12 Dehydration FIG. 130 RAFL05-18-H12 High level salt solution FIG. 131 RAFL05-19-E19 High level salt solution FIG. 132 RAFL06-10-D22 High level salt solution FIG. 133 RAFL06-12-M01 High level salt solution FIG. 134 RAFL06-12-M01 Dehydration FIG. 135 RAFL05-14-D24 Dehydration FIG. 136 RAFL05-14-D24 High level salt solution FIG. 137 RAFL05-20-N17 Cold FIG. 138 RAFL05-20-N17 Dehydration FIG. 139 RAFL04-17-F21 Dehydration FIG. 140 RAFL09-12-N16 Dehydration FIG. 141 RAFL05-19-I05 Dehydration FIG. 142 RAFL05-19-I05 High level salt solution FIG. 143 RAFL05-21-I22 High level salt solution FIG. 144 RAFL08-11-H20 Dehydration FIG. 145 RAFL08-11-H20 High level salt solution FIG. 146 RAFL05-21-C17 High level salt solution FIG. 147 RAFL05-21-C17 Dehydration FIG. 148 RAFL05-08-D06 High level salt solution FIG. 149 RAFL05-20-M16 Dehydration FIG. 150 RAFL05-20-M16 High level salt solution FIG. 151 RAFL11-01-J18 Dehydration FIG. 152 RAFL11-01-J18 High level salt solution FIG. 153 RAFL11-09-C20 High level salt solution FIG. 154 RAFL05-18-N16 High level salt solution FIG. 155 RAFL11-10-D10 Dehydration FIG. 156 RAFL11-10-D10 High level salt solution FIG. 157 RAFL04-17-N22 Dehydration FIG. 158 RAFL04-17-N22 High level salt solution FIG. 159 RAFL05-09-G15 Dehydration FIG. 160 RAFL05-09-G15 High level salt solution FIG. 161 RAFL05-21-L12 Dehydration FIG. 162 RAFL05-21-L12 High level salt solution

[0331]In FIGS. 106 to 162, the vertical axis shows the expression ratio of a gene, which is calculated as follows:

Expression ratio=[(FI of cDNA molecule under stress)/(FI of cDNA molecule under no stress)]/[(FI of control fragment under stress)/(FI of control fragment under no stress)]

where FI is the intensity of fluorescence.

[0332]As shown in FIGS. 106 to 162, the genes encoding stress responsive transcriptional factors isolated by a method according to the present invention exhibit different profiles; however, it is found that expression is induced by adding each stress.

INDUSTRIAL APPLICABILITY

[0333]A stress responsive promoter and an environmental stress responsive transcriptional factor are provided by the present invention. The promoter of the present invention is useful in that it can be used for breeding of environmental stress resistant plants in a molecular level.

Sequencing Free Text

[0334]SEQ ID NOS: 91, 92, 165 and 166 are synthetic primers.

Sequence CWU 1

16611000DNAArabidopsis thaliana 1gagaatttta gaaaaagaag ttccgtgaat atcgcaaaca ttaaggcaag aacatttgca 60aaaaaaaaag agaaacaaat tcaacatcca gcagaactgc agaagtaatg ggggtgttta 120gggcagaaga caagaacatt agcaaaaaaa aggtaaagat tgactggttc catatgcatt 180catgtcatgc aaaatattga atcgaaccat cacaagcata aacattaagg atttctgtta 240gacaaattaa cgttgtaagt actaaaacta gctagatttg tgattgtcac atatcaactt 300gttctacata aagtttgtaa gagctaaaac tttaagcgtg agctttttgt gaaacaaatg 360aaagttagat attaaccgtt ctcttattct cgagatgatt ttcaatcgat ggttatgcta 420caaagttggc tcgttcaata taatgagctc ttgtgtgttc aacatgaata catgatattt 480ttgtgctcgt taaatactcg gtttttacga aatgaatttt tttaagagtt cgaacctaaa 540acccgaactt gattgaccaa tgtacattca tagtagaata tttgctttgt acctgaatct 600gcgtttatga aactacgtct aaagattgaa ccaaacaaat aaaccgagtt aaaccaaaat 660ctagatccta ccaaatcaaa acgaatgaac attcaattta gaaaccaaaa aaataaaccg 720gacattcctc attagattct ctatattatt cttttgtctg taattgcgtt tgtacaaaac 780cacgtttgcg tttagattgt ttaaagatat ttattgaaaa aagacccgcg tgattaaaat 840gtgtaaagaa agttcctccg ttattaccgt ccaagtggat ctctaagaga ccgtacaact 900aactccactc acggcggcgc attcccaatc tgctgacacg tttctgctta tcactctctc 960tttttcttct cttcttacca cacctggttg actcaaccac 100021000DNAArabidopsis thaliana 2tatggtagaa aaatatgaga gagaatgatt atttacataa tgaaaagtga ttgaagttac 60cttttaaatg taactaaatt acatgagtta caattctaat ttgtttgaat gacaatgagt 120gaagaagaat ataattaata cacaattagt atgatagtta caatttagta ttcaatggtt 180gattaataaa tgtgaataaa tttattgttt ttgtgggtta attggagata aatattattc 240tatttttatt tttgaaaaat tggttgttaa aaatgagacc ttatttttat atatacatga 300ttaaactatt ttgaacttga atccatttgt tttggattta ctcacttttc caatgttact 360tatagtttcg ccagtggaat ataaggttag aaaatatgag agtaaaaatc aggaattagg 420acatgtaaaa tccagaaata tctgaggacg gatctcacga ttgattataa gattcgcatt 480acattttctc ttcatttttt tcagataaga aaattggatt ctatcaattt gtcgtcctat 540tgttcttttc ttgtttgata tacaaatacg agtcactgat tttttttagt tgattggtag 600agtcttaaag gaaacatcaa attcagacag tccttttaca accatttcat ccatgtcggc 660tcattatttc gcaccacaga agaataaccc aaaattatta gaaaacttgg accactataa 720gcacttgaac tgtggtcgtg cactgactaa tgtgagcccg ttttggtaat tctcagctgt 780gtgcgtaatt gatgtcggcg tcaccttcag aaacttcgaa attatgtgtg atacaccgtg 840tttcataaat tcatacacgt cattttagat acaaaattag atattttcat ttgataaaac 900cgaaagaaat aataaacttt ttctttcttg ttctcatttt aagtctctct ttatatcact 960ttctatatat attgaatcat gcaaggaata cacataaaat 100031000DNAArabidopsis thaliana 3tgggtgacca actctttcct acttttctaa cagtttttgg tttgttttgt tcagattcgg 60acaaaatgat gagtgttgtg ttgtgaacag agaactggtc cgtttgtaat atttggggtc 120ggtggtcttt gtgacggtct aggctttggt tgttgaacta cttcctttga ttaaggttta 180tggagttttg ctatatctac acaataatac ataattgttt tgtattattg acttgaccaa 240taacaggtgt cgaaaattct cagttagtac atcatctttt acaagtcttt tgacgttcta 300gaccaaatta tgttgatttt tgaatcaatg gtgagttatt ttagtcatct ggttgccgtg 360ttttgaaact aagaaaagtt gaattatata ccgtaaagaa gaagataata ctaaaaatcc 420gaatacgacc tacaatgaaa ctaagaaaat cagtagccaa gaatgatttt gagtcaatgg 480tactaaatgg caaaaaacat tacaaactaa ataaatgact tttggtggac gtctaataaa 540gtaatagcaa tgaataaaag taaagttttt gcaatagagt ctttcattta attgtaaatg 600gatgtcttgt taaccaaaca atattttgat tgttttcatt gctatgtgct ttattgagag 660gatgagagca catgcatgtc ttgttttgta agctgtacct ttttttgtca taactcaaca 720tgatagtaat ggaatggaat ctctttgatc tcttgttaag ttaccaaaac agaacaaccg 780tgagagagtg gttcatctaa tacaataaca agacatgtat ctaccacaat cctcacatac 840acatgtacgc atcccattgg ccaattcact tccaccaacg accccaatta acaaaaacca 900acatgagtta gttattagta gttgacctat actcacttct tttatcttcc ctccactctc 960tccaatctat aaaaccaaat ccaacaattg cttccttctc 100041000DNAArabidopsis thaliana 4ttgaattgaa tgaagggtgt ggtcggaaga gaagacgtgt agaagagacg agacaagtaa 60atttaagcat tggccccatt tacagccaca agtccgctac aacaaattat ttccaagaaa 120ctctgagata acgtcgtgat gaaacggctc atgctgctgt tgtgattcgt gaattagagg 180tttatctttt gggtttttga atgttactta attggacggt cgatttttca aactgggtgt 240gaaatgtgaa tgggtcattc ataatgggct tttgttttaa tgtgaagcca ttcacacact 300ctttgtcctt cttttctatt attcataact gtcactcttt gttcttcgaa atagtaaaga 360gcaaatcgat tctttgttga tctgggccgt aaaatttcca tggttgtggg aagtattctc 420gcagctgatc tgggccgtca atgctacagt ttcatgtcag agagaggtca agaatcaaca 480cgtggccaac catgatttta aaccaaagca aacacacgat tagaccccac attgtttgtt 540caccaacccc cgtggaccct cctttagccg acgtgtccac gtcaatagtg gtttttcttc 600ctttcaaagt acacaaattc cattctttct cattttactt tttggattac gttgttgtta 660taaactggta aaatgaatta tgaatgcaaa taaatttcat ttaagttttg ttggcttcta 720atattttttt cacctaaaat tctaataaac tacacagcca tgagccatcg tatgaaaaga 780agaagaaaaa aaatgtcttt ttctagaagg atctttcaac gactaaaaaa gattttaagc 840ttttgactaa ttttgtcaat aatatacaca aatttacact caattatagc catcaaatgt 900gtgctatgca gaaacaccaa ttatttcatc acacatacgc atacgttacg tttccaactt 960tctctatata tatatatagt aatacacaca cataaacagc 100051000DNAArabidopsis thaliana 5tttgtcgaat tggtggggtc ttaaggctaa aaaagggttc accacatgtg tatgaatcga 60aatctactca tccgctgtcg ggaccaaact acatatcaac taccacgttt ttctcatatg 120aatattcaaa gctaccgtaa atttttataa aattacgtca ttagcataag aatgtatgta 180catattaatg actattataa accctaaggc acggatacct tatccttcca ccaccgaatt 240tcacggattc catatccttt caccggacta gacgaatatg aagctaagat attaatagat 300tggtgaccaa atttgaatct tttcggataa aacttgtttg gcatactttc ccttagagtc 360aattattctt caaacaatgt ttaacttaaa tctcaagatc ttatgtagtt aatggtagat 420atcgtacagt ttttttcgtg tttgtctttt gatcgtgtgt aaagtgttgg aaaatctaac 480tttgtatata agttctacac ctccaaatta acgttcgagc ttatcggtat tactgctagt 540gctttctcaa attccttttt agatcatata gtagcccact aggaaacctt acaccaaaac 600gcccaaataa tatgaaaagc ccatcatatt gtccatccaa agtaggacaa ttgctgaaaa 660agcccaacga atcaaacctg cattgttacc ttgcacttgg aaaatgaatt aaacggcgac 720gcacctatgg atggagacta agttcactca tcaaaaatct aatttcaact ctaattatcg 780tatttaaggt cattggagat tcaactcttg tacctgtagt ttcttcttct gaaaatcagc 840aacaaactac aaatcatcca atgatagaat ttcagcttta attatcaaac aaattaaaga 900taaccccatt gataatatcg ccacgtctca tcggatgaaa tagtactatc tgacacgaca 960cgaatctctt atgagagaaa cagaagagca cggaatctcc 100061000DNAArabidopsis thaliana 6attcatttct ccgagaagaa tatgtacttt tttactttcg acaaaagaac catattttta 60gcttgggaat cttggattaa atattaggac aatattttta acttgagaat cttggtttaa 120atatgagaac catatttcag aaaaaaaaag tttatatatg tgtaattttg tcggaaccgg 180atttactaac attcgtaggt agtcggattt ggtcccgtaa aataaaatgg taaataaaac 240aaaattaata gtaaagaaaa tgaaaaataa tttacaaaca caaattaaat tgaatacaaa 300ctagctatca taatattttt caagttggtt ttcatgaaca ttttagagca ctaagctata 360taacaaaata ataataatta aacatctttt aatcattaag tatttaaaga tccatacaaa 420tgttggttac ttggtacaac caacaaggca aaggcggtac tgaataagaa tataagatac 480atgaaaaatt gatctactat actttacaaa acgcgttaga tattatataa ttgctactcg 540ttcaattcat ggggatgtag ctcagatggt agagcgctcg cttagcatgc gagaggcacg 600gggatcgata ccccgcatct ccacttttat tgttttcttt ttaaggttct ttttctttta 660atttattttc aatcttctag tcttcaattc tgtttttgtt tctgtcggaa tctcttaaaa 720gtcaatttaa atactttaaa ttccttgcac aacactcaat tttgatactt tgaattcctt 780gcacaacaca acacgtcgct atcaatcaca gacaccacaa acttggacac ttatctttag 840ttctgtctta aactgaatct ccctcttatc aattctgttt ttgtttctgt cggaatatct 900taaaagtcaa ttttaatact ttgaattcct tgcacaacac aacacgtcgc tctcaatcac 960agacaccaca aacttcgaca cttatcttta gttccgtctt 100071000DNAArabidopsis thaliana 7ggtttttata ataaaaagtt gaaatttgtg cccttttcaa gtctttcttc atcatcttca 60ttctcattct catttccatc ctctgcttgc tctcttctca cagccccatc ctctttaatt 120accatctcca atcccctgaa ccaggattta taccattgcg attccgcaat ccaacactac 180tatctctcta gtttcctttc ttcattagga aagagcaaaa gcattctttt atggtaaagt 240ttcgattttt caaattcata cactgtcgga ttcataccct tttcgctccg ggtctggaat 300ttgggttttt gaggtttcta cggagtcaat tcgagtctgc ggaaactcaa tttgatagac 360tcgtttcaaa atgttgctta agggttttct tttgttcatg gattgaagtt ttgcttaaac 420cagttacgag gtttcttggt tcttccatgt atccgtgcgt tatctttgat gacccttcac 480tcaaagtctt cgttttttca tgtcaagttg cttcctagac ttagaaaatt ctcgttttta 540atggattttt gttcatagaa caaacattta acgtttcatg ctttttctgt gagtcacatg 600tttttttttc ttgagctgat taagtctctt tttttgtttg cagccatcaa ttgaattata 660tgtcttttag ccaataattc tgataaacgt caagaggtag gtgagaaaat caaaactctt 720gccttgttat cggttttcga taaccacatc tatcaaacga tgtcatttgc ttgtttgttt 780ctgcttggac attagtttcc aaattattag agagttttag tcacgtagaa ttatgaagtt 840tgtagtatgg cagatggctc gtgagcttgt ttaactttat cttatgaatt tagtggacga 900gaaagaagtt gggttgtttt tggttggaca tttattaatt ttcgtctctt tgtgccatgt 960tccatgaagt atcatttcat catctctagc ttaatctctg 100081000DNAArabidopsis thaliana 8ataaaatttt tagatattta tatttataaa tatgactgca gaagttttaa gtttatgata 60gtttttaaaa agcaattatg atagttagta acaatttaga aacacgtata atttaatttc 120tttgtttatt tttcaaaaat atacaaaaaa tttatgtcag cgattttatt ttattttgta 180tggtatatga ctaatttaaa agtggtacta attagtatta taaaaaatta ttaaaaagta 240tttttaacct tacgactcta tttttttaca agttacaaat gtttcataaa attttattgt 300taaaacacta tgatctaaga tattgaaacc cacggtcaaa gtattgctaa catggtcatt 360acattgaaaa agaaaattaa ttgtctttac tcatgtttat tctatacaaa taaaaatatt 420aaccaaccat cgcactaaca aaatagaaat cttattctaa tcacttaatt gttgacaatt 480aaatcattga aaaatacact taaatgtcaa atattcgttt tgcatacttt tcaatttaaa 540tacatttaaa gttcgacaag ttgcgtttac tatcatagaa aactaaatct cctaccaaag 600cgaaatgaaa ctactaaagc gacaggcagg ttacataacc taacaaatct ccacgtgtca 660attaccaaga gaaaaaaaga gaagataagc ggaacacgtg gtagcacaaa aaagataatg 720tgatttaaat taaaaaacaa aaacaaagac acgtgacgac ctgacgctgc aacatcccac 780cttacaacgt aataaccact gaacataaga cacgtgtacg atcttgtctt tgttttctcg 840atgaaaacca cgtgggtgct caaagtcctt gggtcagagt cttccatgat tccacgtgtc 900gttaatgcac caaacaaggg tactttcggt attttggctt ccgcaaatta gacaaaacag 960ctttttgttt gattgatttt tctcttctct ttttccatct 100091000DNAArabidopsis thaliana 9cgtatatagc agatatagtt aaatctgttt tgtatgttga taaactgctt gatataacga 60attgttatat ggaagattca aaattgatga tcctatgata aagatcaaca tggatacaag 120acacgttttt tcttgcgata gtacttatat ctgacgtcag taatagtcgt ggtggctatg 180tcagaagagg acacatctgt gactcatgtt atgtggtcga gaatgaagtc tcgtgaaatt 240gtatttacac tttgttgtca tgtagggttt gactttactt agtcggcaac gtatatatcc 300gatttatttt attttcttca aactggaagc ttttagacca aaatttaaac taattttctg 360aaccacagaa atccaatctt tttttgtcac tgaaagcggt tataatcata aatgtccaaa 420atgtgatttg tttttgaaat agttaaaagg aagtgaattt ctaattaacc ttaatccacg 480taaaaacttt atatacctca aacaaaatcc cagcacaacg aggttaaaat caaatatagc 540cagcttcaat tttaatttat tctaaaatgt cgaagggtcc ggaaagcagt caaattgtcg 600cacattcata tttacgttaa ttagagaagt cgcttcttac tcacgtttct cgttccaaac 660acaatctttt catgtttctc ttttaatttg accgtcattt tttatgatgt gaaatattaa 720ttgcgcgaat acttcaaacg tacatctgtg atctgtccgt cattttcctt cacgttacga 780taagtttcaa tctaaaaact aacatcgtcg ccttcgttga atcaatgcat ggattcgtcg 840cagcatttat tctttattag acaactggcc cttcaagcga aatgagggat acgtgtatat 900atatgcatcc acataaacaa taagaaaaat ataaccctta tttttggatg ttatataaaa 960ttgctttacc tatatgtgag ctagcaacat ttactcatac 1000101000DNAArabidopsis thaliana 10gggaatctag cagaattttt cttcctaata actattttcg agctttctgt ttttgttctt 60tctttttaaa aaacttatta agttcttatg aataatgact tgtgaagttt gagttcgtct 120ccttcacaag caagttgtat tggtgttttc tactttatga atatgggttt tatataccta 180aagacttgtt atgttattat tcttaaatgt tgctgctatg atgattacta ttatcgattt 240ttactattat atttgttttt tttaacgatt cgttgaaaaa taaacgagat tgacactaat 300gtagcaaaat gtgcaatatt atcttcaaaa cattacaaaa ttcagttgtc ttaatctttt 360tatttccgat gacaaattat ggttacgtaa atagtactat attaaaaaaa aaagaaaaaa 420aaagacgaaa ataaaacaaa aagggagaat ttgagtatta aaccaaaaaa aaagggagaa 480gtggactttt gatcaaaaga agagaacaac ttgcatgagt aaattagttt tagagctgac 540tatttaaaat tttaagaact aaaaagaaag tggtagtata atttaacaaa aggatgatga 600ctcatatgaa taatagtagg caaaaaaaac ttcaaaacca caatttcttg ttttcactta 660gaaccttatt ctgccataat tacgtttagt taaattaaga aaaaattgct taaaacgaca 720gtacaagaag aaggaaacat tttctttcac tcaaaatcct attatcgcgc aacagaggag 780tctaagtgaa ataataccgt gtcgtttgcg gatcagaaat cgagagaacc acgtatgtgt 840cttttatgtc ctggtcaaac ggttatccaa cacgtggatc ccaaaactag ccaccgactg 900cagcaagttt ctcgaaccga tgcacgtgtc ctcgacacgt ggagactccc aagcacgtct 960ctgggttata aatagcaacc caatgctcca atcattttcc 1000111000DNAArabidopsis thaliana 11gtcaacattg atttttcaat ttattaagat tttttttact aatgtaacat agatgaaagt 60gaggttttct cgttacaata aaaaaaaaag aagctacata gccagtaagc cagagaacta 120aacgtttctt gtacatgcat gttgcatgag aactcttcta gtgagagaac tcttggccca 180aacaatcaat tttttgaaat attaaattaa gcttacaaat ttgcaagagt gatgattgct 240atattgcata agaggtattt aacttggatt tttttaaaaa aaaaaacaaa ttgggacgaa 300aatgtagtct ttcttttttt gtgtgtgtaa agattaatat gttagaagaa tagcttatat 360attgctaaaa aaaataaaaa tgaatagctt atatataaaa tagcaaaatc aagaaaacga 420cattactttt aagctgaata gatatgaatt ttttgaaatg ccaaaagtta ttatataata 480gatgaacaaa gtttttttta aaatatagtt ataattttaa aacgtacggg ataaattaaa 540agatcacttt ttcacgtttt ctttttttaa caaaactttt gttagataat ctgaaatatt 600actcattatt tcccatattc tattaagtat ttgtgaataa attgtataca gctaaacaca 660cataaattac aaatttattg atttaatata ctttactact ttggttaaaa agtaaacata 720ttcaaattcc tttatttctt ctactttgga gctgcattaa acaacttcct tgaataatta 780cgtaattaaa aacaaattac caaatattta gtcaaaaaat aatttagtga atatatcaaa 840aataaaaaat aaaggaaagt cggacgatac taacaaaatt attaaaggaa gatgaatttc 900tctttccttg tgcgcaaaat agcaacaagt ctccgcgttt cctctctttc tctctttccc 960ttatttaaac acacttcgtc ttctacctct aaacaatctc 1000121000DNAArabidopsis thaliana 12caacatgttc agaataataa ttcacagctg ggtaaatttt aatttacaat tattgagttt 60tttttttttt ttgtttatgt caaactcaaa actagtgcca cgtggaggaa gctaagccta 120taaacgatta tgaccgacag ttgtcactct agaagtagcg gtccttgtga cggcataagc 180tggtcataac cgcttagtgg tgatgatttg acgaaattgc ccattaagcg actgtgagat 240gtcggtagac tacgatggcc acgttccaca gagagtgaca ctaacaacca aaacctattt 300tattactatt ttttcttttg tcattagtat tttattacta attgtttgaa tatttcctcg 360tggtctatta ttattaaaca ctctagtcct cttattgttc attaaaattg tctgacgctt 420gtctcttcct ttcgttttct actagaaaat tcagctattt tcttaatgtt ttatatctcg 480taaactctta agttgtaaca gtgtttcaat ataatttgaa cacatattga agtttagtcc 540aaaattcttc ttctccagaa tatatccaaa tcgagtcaca atggaaaaca aattctcctt 600tgtaggtaaa tagaaacaaa caataattgt ggcattaaag tgttttaaaa cgagatccca 660ataatttaca tcagtaagat ttcaaaagtg tgtaattaga ataaacaata aacctccaca 720atttggaaga tattttgttt ccatatttgt cgcgtataat ttattatatt gtcaattctc 780gagaattttt aacgttttgt tttgaccagg taacgactgt tccggtaaat atgtaggtcc 840aaaaacttcc cgcggactta actcgttttg aaatccgctg tttggttttc tttttttatt 900accggctcta accggtccta cccaaaacct tgtcatgtcc tctcattcct ccaccgttta 960taagtaggcc cttaactcca attcccaccg tttcttcgct 1000131000DNAArabidopsis thaliana 13tcacgtgtta cggctgagag cttcgaagcc cacgtcattg ctgacatatc ttcattcgta 60cggatcatgt aatctgttaa atttcccaaa acgtttgcta cctaaacccg tgaattattt 120ggaagtcaaa cttcctcttt aatttcggaa atatttacaa attcaggtca atctaatgta 180tttcatacaa cgagtgtaaa tgtattattt tgacaaattt gcgtatcttc tagttgggtt 240gacaatttaa tatactttta ggtcaatttt gcatagtaga gatatggata agaaagtcta 300tctatataca cacgcacatc tacgtattat aacaactgta catcacattc acactatata 360cagaacattc atctaaaaca accgtattta ttgtatccac acgttttact atgggtgtgt 420ggtcctttta attatcagtt actataaatt ataaacaaaa ttccactagc taatcaatat 480gaaattacat ttgagtttag aatagtcaat aggaaacgat ttaaccaaaa taaaataact 540aattcaattt aaatcactgt ttttttttgt atccatcaac aatcatatga accatttgac 600tacattctct ggtcaatttg aattggtatt tactactact aatagatact cttataccaa 660tttagataaa taaacttaat cctgagtacg gactaaagga caatttgtac agtgagttaa 720aaatagaaaa tttacctaac aatacttttt aaatcagtca aacatttttt gactttaata 780agaaataata tttaatattg gccttgtaaa acaagactac tctcacaaca gtcaacacac 840acagctaagc atacgcgtca gcttccggtt taaacaaaaa aaaacacaaa ccgtaatttg 900gtttccgttt cttcctcttg taacgaccgg ttaaataaaa tgctgacgtc accatcttct 960tctatatata tcctacctgg aaccaactct gtatatacgc 1000141000DNAArabidopsis thaliana 14tttatacatt tcacatatta taaatattat aaaacgtttt tccgttaaat attttaaatg 60tttaactaac tataatattt atttttaata aagtataata ttaataaatt cctaagattt 120ctccttttta aaagtaatta gaattgatgg aaatatatta gataaagaaa tgctaataat 180taagaatatt aattagttat aataattaaa tatcaatgac atccactgta aataagtccc 240aacttaagga tttatttact aaaatagctc caaaaatgta tatatagatg ttatcaaaat 300tcgaattaaa aagttgtaaa aagactcgga ggcggtcaaa atcaattttt ttattttctt 360gcataaattt gtgaactcca gagaattaag aatgaccagt gatagagctc cacgtggata 420gattgggttc caaagtggga cggcgttaaa tataaaaagg ccggaagttg tcgaagtagt 480cttttttccc attatttctg ttttcggttc cttattttgt tcctttgtaa taatgagtga 540catcaaaaat atttggggac gaattaacag gtaatgtaaa tagaaagaaa tagaaaattt 600tctttgattt tacaatttgg attcggattg cataagcaat gacatcaaca gtaatacata 660gctgagaggc tgagatgcag tgtctctctc tcagggaaga tgacaaaaag aatctgatat 720acacgtacgt atgttgcgag ttacatattg atatatcaca tgtcataagt cacatgctaa 780cgataaacta ataaactaaa acaattatgt caacatatat cggtgaattg cattatcatc 840gacctcggtc actgttccgc acatcttggt cgccacggaa catagacaat ttttggattg 900tagtccaatt aatgagtccc ccaccaaagc cgtagtaaaa tcgaagtcgt tctctaatcc 960aatcatatat atatccataa cacacaccaa caacacccac 1000151000DNAArabidopsis thaliana 15tgccaaaacc cattcttgga tcctgttgtc accaactgca accattactt ctgcgacaag 60tgtgctctaa aggtaatctt ttccttacaa caagtttcgt aatttctgaa aacacactca 120aaacagattt tccttgttat tttttttcat aatgcagcat cacactgaga acgatacttg 180ttttgtgtgt aacgagccaa ctctagggct tttcgacaca gccgtggaga tcaaggaaag 240aatagaggaa gaacgtgaga aagccagagg tttgttaaaa acgcaaaact ctctgttttg 300acaccaaaaa cctactctta caaaactcgc

tcggtgatgt ctctttgtag ccatggtaaa 360ggaagtgaca gcaatgttag aaaaggcatc gacaatggcg gatgacgcca agggcgtagc 420gcaaaaggtt gtaaaaatgg tggaggagat tgaaacaatg gtggaaaaag tggcggctat 480ggccacaaag gcgggagaaa cggcgacaat ggcggcagat atggtgaaag aagctgagga 540gacgatggaa acagctaaag ctaatatgtc caaagccttt gtggtaatga agtcggtgaa 600ttggaacgtg taaatcgggt caaaacagag ttttattgtg atctcatgct gacatcagct 660acaatcttta tctaataaga tagattctca caagattctt ttatctatct actttttaga 720gaagatgaga tcatacttcg gagatagata ggtgtcgtaa aaattgggaa gcttacttgg 780caggagaaga gagatataca cacgtgctta aagtcaacag ctaatccaaa aaggtagacg 840acaaaccagt taaactaaga cacgtaatct atcttaaaga tttgtcggtt catcgtaaat 900ccgttggtgg attaatatat gtcggtgtct tcgttgattt tcttagccta accaaccaaa 960cataaataaa aatcgaagct ttactttgtt tgcttctatt 1000161000DNAArabidopsis thaliana 16ataatataat tctgaaaata actaataatt tactcttttc aagtaattat tcaaaataaa 60tttagaaaac taattcaatg tatttcctca ttgtcactct tttatttccc tctacaaaat 120tatatatatg aagtagtttt ttttaaaata gagccccata actaacatta ggggtccata 180tatttgcatt gtaatttata aacacgtgta gaacattata attatttaaa ataaaattta 240gaacaataac ttaatgtatt tcctcctcgt tagttttcct ttttttccct ctacaaaatt 300acatatatgg aatagttgtt ctttacttct tttaagtaga gccccttaac taacacatta 360ggggtccata tatttgcatt ataatttata aacacgtgta gaacctggca ctgaagtctg 420atgaatatat atccatttgt tttcacaact accctagctt tttgtatcat cggttacttt 480ttttttatca caagaaattt ctcgattttg attttgattt ttcaatgttg tgaagttgag 540aagttcaaca tgttttggac agatgttttt gaacagtaaa ttagtatttt aaacatgtga 600atctatttgc aaacgtaatg acgtgtcgtc tgaccatttc ttgaatgtta gttaattttt 660tagctctagt tttgattaat ctaccaatta ggtcattcat ttcagtttaa gtcaaatcaa 720caattaggct cgtcctaacg ttttcattta gccaaaacaa aaaaaaagat ttcaacagta 780taaataattc gtccttatca aaaaattaat aaataaaaaa gatatacatc attacttacg 840taattgtttg tattttgata gttttcgata atttctatgt gacggttttt aaaataaaaa 900taaatgttga ataacaaaaa atagtagtac aatagaatac tgaattggtt tgtgtagggt 960caaaccaaca aatatggaaa gaaaagtatt atttagttac 1000171000DNAArabidopsis thaliana 17gccctagcct aaatcaaacc agagataact gtggaaatga gactagagga ttcaataaaa 60acacaaactt ttttggatgt gcaaaaaaat caaaattggt acctcttgtt cctttttaag 120atcttgttct ggcttaatcc agagaaagta agcaatagtt ccggcgacga cccaagaagc 180ggcactctga ccacctctaa gacctcccat ggaattaccg atgaaagatc tcacgtttcc 240aattgatctt ctccagctgc tagccattga ttgatgccca actatacgaa ctggtctaag 300gttcttctgg atctcttctc tgacaattgc ttgactttga tgatgattcg atttggaact 360gaaattgcaa gaaaagtaaa agcaggcgat ttgattttga ttcgcatttg cttggtgaac 420actgccaaag gctaaaccgg tcgccgagac ggttaatctg atgtttggga ttgacacatc 480atgcggctgt tgtttctcca cgtaacattt ggagacccgc tcaaatatat tgtgggccgg 540agaacattgt tttaccatat agggcccata gactttgaat atgtgtaggt aaataaaata 600caatctgtgg aacaaaaatg gcaaaaatat ttgaagtcag caggattggt taacaatttg 660agcacagaaa tacatttggt aacatctgag catatcattc atatcatatc gctgtcgaat 720ttgaaggaaa aaaaaaagac taaggaagtg gatatattgg tgaatgctga tgatgagaaa 780acttattatt atatacgaaa aatttactaa agacagcaat attccaaaat aatgataggg 840aagattctga gatggtgtgt ggtcagtgtg gaggcactac agatggtcaa acagtagcat 900acccacctgg ctatttggac aaggacaagg aagaagaaaa atcgagaacc atcttctcag 960ccgattttag taaccatctt cttgtttgct ccgaacaatc 1000181000DNAArabidopsis thaliana 18aataataata actaattagt aattagtggg tattttaaat accattatgc catttagcta 60aggtaaaaaa tcaattgtta attataacac ccaccaccaa catagattcg tctatggttc 120attcatcgtg tttaagagtt gaaagaaata aattcgccca tcgtgaataa tagatattta 180ttcatctcaa acttataact caaatatatt tcaactatat atcgatcaaa tagtatagct 240tttggttaat gacgaatttt tattcttttg atccataatt cttcttcttc ttttttttct 300gtccggatac atgaaattga tgatacacat ttaaaacaac aattcgaatc agggaacaaa 360ttacagccac aaccagtgat gatagaccat atatatgatt tagtttattt aataaataat 420gtgtaagcaa atttggcttt gctgtaaaaa aagaacacga attttggcaa aagtttctgt 480taggaatctg gttctattct cctctgcaac ctccagtctc tcatgaatct ggttcggatt 540ctctttttcc ttgtttctat ataatttaga tacatggttt tataattcta tcatatgtct 600attttggata tagtatttta aaaatatata tatttttcat aaatggttat ggtctattct 660atgttaatga taatcattag tctttttgtc aactatgttt ttttttccaa caaatttagt 720atgtaaactt ttttttacta ccgttttatt aaatcgacgg ttgatcagat caactccggt 780ataacacaac ataagtttcg ttatcaaaac aaaaacaaaa acagattttt ttttgtcaac 840taccagtgaa gattagtctt acgtgtcaag aaaccggata aaaatatata acgtatttgg 900gcaatcagct aagatattaa ctaacgcgga gttcattatt aaaatggagt aatgatgttt 960tcagttttct atataaatca cgtcgagacc gtagagtctt 1000191000DNAArabidopsis thaliana 19taattttttt ttctttattc aaattatttc aaaatacaat tatatataat ttattttttg 60tttgaacaaa attttacatt tattccgaaa aaaaaaacta tgcaaaagaa aattttaatc 120caacttctca tcatcctcaa aataatgata acaatatatc cgcttcatta gtaactgtaa 180ccttcttcga tcaaacaact catctgcaaa gactagacta tttcatttat ctaacattac 240cgttaacaaa caaaaaacaa aaatttactt gggagtcgaa gaagtcaaat atctagaaga 300atacttcttc ttacaaaatt ataaggaaaa tttcccctac tgcttcgaga aatacctttt 360tccccattca cgttatgtta tgcaacgtgt gaggataacg agcggacaac acagccctta 420tccttatttt attggtcaac caaccccaaa acagatttta agacggagtg tatttctttt 480gtcaactgtt tacatgggtt aaataaaaca gtttcactaa aacttattaa attatcatgt 540catattgtca tgtatataag ctatagcata tatacggatg gaatatcgat taaaccattt 600tctttttcac cattggacca taatcagata atatacatag attgggaacc ccaaatattt 660tcacattttg acatatttag tagcgacatt aaactacttc gtgaaaatac ataggatatt 720atgtcaaatt gtcaacgaaa ctttttatat ttaataagaa aacgaaatta aataataatt 780aacagaaaat actcacaagt gactttcaat gattttgtct ctaacgtttt caatgtttat 840tttttggtct ctgactcttc aaggaaactg gtaaaaccgg tgtcccggta gtcatcactc 900acatctcgtc caccacgtgt cactcttaca cgttcataag ttttcccacc tttctacgtc 960cgattctgta tttatatacc ctaaacccat cgtcttctat 1000201000DNAArabidopsis thaliana 20gaataatact cttatagaga ttttagtaga ttttcttctg cacccatctt cttttttgcc 60actagaaaaa gttacataaa aactgtaagt tgataaataa ataaaactgt aaattccaac 120aattagtttg gattttccac ctgaaatctt tatagactat gtttattcct cacaataaat 180agaagatatg aagaataaca tcacgcatag tcgatatgca cgtgagataa tatgttgaga 240tttagttacc aaaaaagaat gattttgtct gagaaatgaa attcctctag ttgtgaattt 300ttcaaacttg gtcggtcttg tgagttgagt atctttttga ggggaggtgg gaataataat 360aagcaaaaat atcttaaaaa aaagcgaaag ctgagtttga ccaatcaccg acccggaaga 420ttgaaactat aattagtact acgtttggtt gtcatcaaaa tggtcattct tatggttatg 480ggatattatc ctttgccgcg gctagccgac cattacacca acatcatttt tttttgtctc 540cattcaattt gcataaacct cttaaattat agatgttttt tacagtctaa tcttatcttt 600taactttgta agtggtgtcg ccttacgaaa attaaagctt ggaaaataac taaataaagt 660tgaaggatat gaactaatga atgccttcac cgtccaaaaa aacagagaat gccacgtatt 720cgtacggacg ttaatgtttt catctctgcg cgttgtgttt ttatgtgtca atatctctct 780atttttcaaa tgctcatttt cttaactttc ttattggcca atgaattgga gtaaaagcaa 840aggaataacc ccaaaaagat agttttgaat agtcgtcaat agatagatgg gacatacaaa 900ttacaaaagc ggttagcaaa atctttcgtt gaaaaaaata aatatctaat tcttgccctc 960atatataaac ccatgtagag tcgtctctct cttaccaaat 1000211000DNAArabidopsis thaliana 21aatggatgaa aggatagatc atacttacgt ttgattcttg attttgattt tgattttcgt 60tagtttgggt agatgccata attgagagaa tagggtttca atatattgat taggattttc 120tgataagaat aagatatata gggtttgggt ggaagacata attattttag ttaattaaaa 180aaaaaaatta acctaaccgc taccgcccgc aaccgcaaac gcttgcggga aggaactttt 240aaaatatggc gatttcgagc ggtccaaagc ggtatctaac ggtttttatg attggtgtca 300aacgctaaca actgctacca cccgcaaacg cagcgtttgc gggtgacaga gggagaacca 360atcaagctct aagactaatc ggaccaattt agcaacttgt gtgcttcgat gtgttggact 420gggctagatt gggcccaaat agcctgtttt aacattccca aagatccaaa caattccttc 480ttctttcttc cgaacaattc aatgtagatc cgagcgaatc tcctaaatta ctaaaatgca 540caattgtgct cagcttacca aaaaggccca aaagaccaaa acatattctg attttttctc 600tttagataca ataaaagtaa tttatataga ttagaggccc aataaggttc acgtaggccc 660aaacatatta aaagtaactc agaagagatc catcaaaatt ttgattcaca catctgttta 720ggaattaaaa atattatttg atcggttatt cattcctttt catgaaatca tgcaaaaaat 780caaaaatcat ttttttctct agaaactacg tggcgagaaa gcagagcacc agttgtcttc 840ttgctctgat tatctcgttg aaaccgcttt caaagcagag caaaagagac gacaccggag 900cctccactgc tttacttttc ctttaaactg tgactgcttt catttatata ataaaataca 960tacactctca gagtcacatg tactctcctc taacataaac 1000221000DNAArabidopsis thaliana 22aggacacctc attctcatta cataatgtga cgagattgag ctgtcgtttt gttgtaaata 60tatggtcggg ggtcaattat tcatatgcag ttttaggtca tttataaaga ccattaaaaa 120cgtctttaat atttcaacaa tcttataatt caattattaa gaagcatcta gatatggatt 180cacattagag ttcgtattct ttgacagcaa attcacatta gacttttgat atatatattg 240tacatttgta atatagtata cgaaaatatt acttgaaaac tgatgttatg tgttgccatg 300atgccattgg tccatgactt ttccacacga aggccaaagc atactcttat attatatgag 360ttgagtcatt cttttttcct tttgacggca agttgagtaa ttcaattctc ttgtcactta 420ccggtgatat aagagttaat cttataaata gttttctgaa acttaatata ctataacaat 480gtaaaagtcg tcgctttgtt atttgaagtg aaaattaagc aatgttatga tatttttact 540aattaactca atatgaaaaa caaaaatcct cttaactaaa acagaaacat aaaagacgac 600ttagtttttg ctttagatct agactcataa ctcaaaaaac aatttcatta taaacttttg 660tagatcttac aattttaaaa taaaatgtac attaatgttg aaaagcaaaa tcttaaatta 720gtgtatacta ctactttttt tttatcaccg tgatagatca ttagatcctt aacctcaatc 780cctagagcct gcttattgcc tttaagcatt gtgcaatcac taccaaacac actcaaaact 840aaataaatat aatttataac ttatcaaata aaataaatac tatcaagtgt gtatgaaatc 900gatgacaatt tatttattac cttattacca tttgtgtctt gaagtacacg ttaagctatt 960tttatgtgtt ataaaggctt cttcgtcaac caattacgaa 1000231000DNAArabidopsis thaliana 23acatgcatga ttgttaacgt tttctttttg tacaaggaaa aaaaaaaacg tgtacctcta 60gacaagagtt agagagaaat gagaaactgg accaggagat gtgaatggtt tcattcctac 120tttggttaat cattgcaaat gccaataatt atatagagaa cggcgtcaat tttctgcaaa 180gaaatatctt tactggtact tttttgtgct acctataaat gtgttggttt aatctagcca 240tcacacgtat atttctagct atgttagatc tttaatcatt ctcacctaga ctcctttaca 300aaaaccaata tagatgaaac cagtcccacc cctctcattt aagtcatagt atatatttgt 360gtaacgagaa tacagacaaa acaattaaat tcattgtaat catacttgaa aatatagtca 420atgaaattaa ttatcctaca tttttgaaaa cagtgtctat aattattatg tacattgaat 480agatacgtgg gatattttgg attccagttc atatatatgt agattaaaaa taaagagagg 540gcgataaagc ctaaagaaaa tgtacaaatc gtaatgtaat ctagtgatga tttcctctat 600ataggttgat gttgggtaga cttttggtca ccatgatatt tatctatcgt ctataaagta 660caaaactgtg gtactaaatg tgatttatga aagctaatta aaaagaaggc gtgaagaaaa 720tgaaatcgta atagacgacg cgcgtacgag atgagagtag tggaggaaag aagtaagatt 780gagtgaatga taaaatgcaa acgctaccta ctaatatctc cacttgtcat gcaaccagac 840tgagttcgtt ttccttttcg agtcttattt ttttgttttt tattctactc aacacgttac 900acgcttctaa taaactctaa acattaaaat caaaatattt tgactacaat ggttattttg 960gagctatata taaaccacct gagcctcctc agtttcctcc 1000241000DNAArabidopsis thaliana 24catacatata ttacgatgca aacaccgtat tttagagcta cccataaaat tgcttacaca 60aacaatagct gtctgttttc acaaatttat aactccaaac cccacaatta caataaccaa 120gagacactta accagatttc ggtttacact agaattatat ccccattggt catctggctc 180ggtccggtag agttccccag gcacaccggt tacatacatg tatttcggcg gaggtggaga 240tgaagatgac ttgctgcagt atgtctttct tggtggtggt ggccgcaacg gagttgtaat 300tccggtatgt cctccgaggt aagaatcgga attggtttga ttagtgagat agtgtaacac 360gaggaacatt gagaaaaaga ctagaatcag aggtttaagt ttcatcatga tcatgattat 420tatgattaat gggaaatgtc ttttgatttc aagatttgca aaaaataaca catataaagc 480atacatatat aaacgaaatg catgtgcatg taaagtatat gcagcgtcat atctctacat 540cacctatatg catttggtgg agaccagatc atacacatta aaatgcaaaa ttgtaaggtt 600cttcagtttt gaataagtct tagattgacg aggtcaattt tttctggtta ccattagtcc 660tatataaggt tataggattt ttgtaatgga ctggcctggt tgcttttgaa tccggcctgt 720tttaaggccc atctcttttt cagtgaagaa gctaaaaaaa gagttttaat ttcgtttgaa 780aattttcggg agcataaaga actatcgaag caacacgtat ccaccttaaa ccacatgtat 840actctagaac acaacacgtg ttaaagtgaa agatgagaat aatacggacc gaccctacgt 900acttgataaa aacaattaag aagaagaatt taaaaagaga gtgaaatgat tcgcttgagt 960ctctatataa acatcacaat ccaactctcc atcatcttcc 1000251000DNAArabidopsis thaliana 25aggtagtgat aaatttatta agataagttt gtttgtaatt tattataaat cgtcatttgc 60tccttgcaca agtggtaatg tgtctatggc cttggcaagg tcaaaggatt ccaaattcgt 120ctttgcaata ttgttctttt tttgtttaat gtccaaaaca aacatagttg cattactcta 180acatatttag cagcacaaat atacaatttt gtactgatta aaacacaaaa tatttttctg 240attttcataa actaactcaa acttggaaaa tgatctttta attcttaact caaacaattt 300tgacatgact gatcgttttt ttggatacgt aaatattatg atagaagtaa taagaaatgt 360gagaaatttt tagttttaag aagatgaaag acctagtaaa tagatgttca aactttgaat 420acatcaaaca gatgagttaa tgctagtgag caaaattcca tagtcaagtc ttccaaaaaa 480aagtcaaaac aatcatgtta taaaagtttg taaccaatat ttacatttaa cacatttata 540aaatgtgaca tcaaaaacgc ggctataaaa taacgttacg atattaaaaa tcaaaggact 600aataaaactc agaatttaat ttggggaaag aaaaataaac cctgcaagat ttgaagtgtt 660cctcctaatc aaccgtaatt tagaaatggt ccttaatgga cggtggtcgt cgtttcacac 720tctccgtcca ccattttttt tttttgtttc tcctcctcta aaaagcaatg gtctttcaag 780ctccatcaat ggctactact acttagcgtc tacgaataat aaccccttta ccataagtcc 840atagatttat ctctcttctt ctcaccactc atgtcccttt ctttgtcttc ttcctctcac 900cgccttcaaa gttttcatct ttatccaaat aaaaaaatcg tttctttttt agtttcttct 960taacatattt aaaggcgtac cctttctctt acttatcctc 1000261000DNAArabidopsis thaliana 26cactaacgcc gtttactttc ccgcttggac gataacaccc ttgatatatg gattagtctc 60tcttttgctt ttggatatct taatatattt taatatgtcc atttttatat tgacgaccat 120ttaattaggt ctcagtttgt attattgatt tttaggtgcc aagcattact ttcgttacgt 180gtaatctttt gattattcaa tcaaattatc atatggttca tggcatttta tcaattgtga 240cggtcgacat gtgggttgac tttgatgtat gacttttagc ttgtgcctca tttttattcc 300aactcgacaa ctagcagact ttatatttta atttatttct ttctagatat tagacaattc 360ttagcttgtc tctctgcatg aatcacgcgt taatatactt ataatagtaa gttttaaaat 420gtttcgagta taatatcata ttagatttca attataattg taagattatt tagtctcaat 480gaagagcatt taaaaatgtg aaagagttta ggctttgact gttgactggt tctaacgcgt 540tttgcttctt atttatacaa aaagatttta ttttatttta attagaaata ttaagtactt 600tttagacgtg tttaggttga ttaatttctt gttaacatat ccgataaaaa aaaacagcac 660caaaatcacg acgagaatta tgcctacgtt acgttgataa taaagattag agataaaatg 720taacctctct tctctttaca tcattacatg catcttgtgt tttatttgac tttattctta 780ccaacaaaca accaatggat aaattaggct cgtgaaggag ggcaaactcg taatttcaaa 840gaaacggaaa gcgtcagaac gtggagaaca gtgaaccgag acacattccc gatttttcta 900aagaaacaaa ataaaacacg tgttttttat accataagta aagtagtagt atttattaaa 960ctattaatat atagctttgt tatattggag gtaggaagaa 1000271000DNAArabidopsis thaliana 27aaataaagtt ggacaaagaa aagaaaaaag tgtttggcat ttaataaaac gtctcattca 60ttgcgaagag attagatagt agagaggtca aattcaatgt cgcagccgac aatagataag 120aagagataag gtcttcgtta tatttttttg tttgtttttt aactgtcacg tgaaactgat 180atccacaaag agcgaatgag agacgtagaa gagtcaaaga ttaaaaaccc aacgaacttt 240gactcatcct ttgaaggtgt tcgttaatta tccatttttt ggtctgactc gtggcatgtg 300gcaactaccg accttagata agcctggtcc tactctcgta ttcgatcacc acgtgtcgat 360cggcttatcc gacacctcga gtgggagttg tatgttagta actagataac ggtcataggt 420acgattatga cattgacatg aaatcatatt ccaactatca acgttagtgt ccttgttttt 480atcccctgta attcagtcaa ttaagccatc gtaccaggtg agtctttgat attgttgttg 540tctacgaaaa accattagat gatctctaat tgatatttga ttcaacctat ggtaaaatta 600tcccaaaact caaatattac ttcaattgat atcatcccaa atattaccta gagaggatca 660agctttttaa tcgtcaattt tggttataca aaacgataaa aaaaaaattg taagccaaaa 720ataaaaagta aaacgaaatt gtgaattttt aataattctt ttgcataata cacaaaagaa 780aaaaaactca tactccacat gtcaagtgat gacacaataa atgtctaaat ttttacaatc 840aaaaacaaaa aaatgtataa aaaattcgtg taaccttttt tttttgttgt ctaaaaaaat 900gacatgattt tggtaaatag ccaacaaatt tgtagtagag tagtaaagtt aggtttcatc 960atccatctct ataaattctc aagaccgacc tatacatttt 1000281000DNAArabidopsis thaliana 28tcaatcatta atgtatttat gatttgatcg aataacaata atatagctta ttttgttcta 60ttctatcgta tttgattctt ctttcgtttt ttttttgttt gacttaagaa accgattgtt 120tatagtagta aacatttgtt tttaatgttg ctcgattcca gtgcacatgt ccaggctaga 180cacttgtcgt tataaaggtt gctttggttc aatattgatc cactagagat gttacaacta 240ttgttgacat ctgagattgt gtgataagaa aatatgaaac tggatttagt gaaagttaca 300atatataatc atacatcata gataggaaat aaggaaatgt cagatatact tgaagaatac 360atcaaataga caaggtcctt tttcttattg tcgactatta tagagccgta cagaaccttt 420tcacgtcttt agtaattagt acattctcca tttcggctct ctcttatttt ttttccatct 480cttttacttc tccaaataat aacaataaaa gcttcgattt tgtgtgtgtt tgtatttaca 540tcttgacatc gatattcttt tcatcaattt tttaccaaaa atgtaataaa aacaaaaaaa 600aaccaacgct gaacacagac atggtttctc catccgttta tattcatcgt ttgtatgttt 660acttaacaac ttatttcaaa atagtacata tcatggttgt gtttttaaaa aaagtataca 720gaacagaaaa gcacatggta gacaaaataa tgaagccaaa attaatacaa agaagaagtt 780caacttgtat ttattaacac attttctttc cttgtcaaag acatgcaaat tggttttgtt 840ttcttattcc catttttttt ttataataaa aagaagaaga gtaaaacaaa aaaactatca 900tttcttctta tcgcaaaact cttatctaag caagaaaccg acaaaaccta tatctacata 960tattctcatc aacatctctt gagacatatt cattttggtt 1000291000DNAArabidopsis thaliana 29cttttgtctt tttccagcct ttgtttacgt ggaatctcgg tttctaggac ccactacact 60ccttaaataa gaggtgcata attttatatt taacaaatac aattagcatt agattagtcc 120aaatatccgg agtgattttt tttggttttt agaatattta tttttttatt gtaaattata 180aaaaatgtta tatataattt ttttatagat aattttttta attttattag atttatgata 240tacaatgtat tcttttttta tgttataaat atttttaata aaatttgttt ttaataaaat 300tttggtgaac gattataatt atattttgtt gtaaattttt tatacattta tttttattaa 360tctgtaaatt tgtaataaat tataatattt gaaattcgaa ttttaaattt tttggtataa 420aattattaat aaacttttaa tcaattaata gatcaaaaaa atttatgtcc aaattaatta 480attaaaattt ttttaaccta cactattgtt taatgttatc acaaattata aatttttaaa 540ttaattattt tgtttggtcg gcaaattaag atattattat caaaaacatt tttttttttt 600tcgcaacaca ttagttacta aatgaactat taagttcact tatcttatcc aatttgtgtc 660taccctacat aaaccttgta cttatcccta

aatcacttta gataaattgt tgaaaattta 720atttataaat tttgtattta ccaaattagg aaaacaaata tctgaaaata tttattttta 780atatcttaac aactcgaaga actgagaaac gcgacaaaac caatcgtcct cttccgatag 840ccacaaaaca aaaatcagac aagaagaaag aagaacgttt cttctaacag atagagatta 900caatcaaatt gactcttaat ttctcaattc cgtatctctc atttcatctt cttcttcttc 960tcctttactt aaggatctct ggtttctctt tctctcctct 1000301000DNAArabidopsis thaliana 30ttagcaataa tatataaccc aaaaacttat ctctgaacaa attaattagg aagtaacgca 60caaaaagaaa tgcacaaaaa taaatgtcag aagtgggatt tgaacccacg ccctctttcg 120aagaccagaa cttgagtctg gcgccttaga ccactcggcc atcctgactt tttgttaaac 180ttatacgtat atagtgtaaa taatcgtata attttcaata tccataccaa tcttcagact 240cttttttttg gtgacaaagt gtatgtatta ttttcagaag ctatatgttt ttcctataaa 300acatccaaga ctggcccata catattttta agaaacccta tagtgtgtat atgacttcaa 360aatttcgaat ggttagtttt tctttcgagg accaaaacaa aaagcccatt caatcactag 420aaaaatatca ctagtcaatc aatagaccaa aagattgaaa gtaggatata tttgtttaat 480aatgcctacg attctgcgaa gacaggagaa gccatacctt tcaatctaag ccgtcaactt 540gttcccttac gtgggatcct attatacaat ccaacggttc taaatgagcc acgccttcca 600gatctaacac agtcatgctt tctacagtct gcaccccttt tttttttagt gttttatcta 660cattttttcc tttgtgttta attttgtgcc aacatctata acttacccct ataaaaatat 720tcaattatca cagaataccc acaatcgaaa acaaaattta ccggaataat ttaattaaag 780ctggactata atgacaattc cgaaactatc aaggaataaa ttaaagaaac taaaaaacta 840aagggcatta gagtaaagaa gcggcaacat cagaattaaa aaactgccga aaaaccaacc 900tagtagccgt ttatatgaca acacgtacgc aaagtctcgg taatgactca tcagttttca 960tgtgcaaaca tattaccccc atgaaataaa aaagcagaga 1000311000DNAArabidopsis thaliana 31aagtgagtat aggtcaacta ctaataacta actcatgttg gtttttgtta attggggtcg 60ttggtggaag tgaattggcc aatgggatgc gtacatgtgt atgtgaggat tgtggtagat 120acatgtcttg ttattgtatt agatgaacca ctctctcacg gttgttctgt tttggtaact 180taacaagaga tcaaagagat tccattccat tactatcatg ttgagttatg acaaaaaaag 240gtacagctta caaaacaaga catgcatgtg ctctcatcct ctcaataaag cacatagcaa 300tgaaaacaat caaaatattg tttggttaac aagacatcca tttacaatta aatgaaagac 360tctattgcaa aaactttact tttattcatt gctattactt tattagacgt ccaccaaaag 420tcatttattt agtttgtaat gttttttgcc atttagtacc attgactcaa aatcattctt 480ggctactgat tttcttagtt tcattgtagg tcgtattcgg atttttagta ttatcttctt 540ctttacggta tataattcaa cttttcttag tttcaaaaca cggcaaccag atgactaaaa 600taactcacca ttgattcaaa aatcaacata atttggtcta gaacgtcaaa agacttgtaa 660aagatgatgt actaactgag aattttcgac acctgttatt ggtcaagtca ataatacaaa 720acaattatgt attattgtgt agatatagca aaactccata aaccttaatc aaaggaagta 780gttcaacaac caaagcctag accgtcacaa agaccaccga ccccaaatat tacaaacgga 840ccagttctct gttcacaaca caacactcat cattttgtcc gaatctgaac aaaacaaacc 900aaaaactgtt agaaaagtag gaaagagttg gtcacccaaa ttaagccacc tttgcttcct 960caattcctta tttataccat caaaagactc cacaatatcc 1000321000DNAArabidopsis thaliana 32aaaacatcaa atctaagtac aatgatagac gacaacacag ttcctgcaac catggaactc 60aaattgaata tgcaatgact cttggttaaa gacagagcct ttgggatatt acatacataa 120acatataact taagcactaa ttcatgaatt tacatgttat atatgtatga tgacgaagtt 180ccaattccaa gaaagatcaa aaactacaaa gttacctctt ttacttctgg cggagtcgtt 240gacggagatt cactccggct cgttcttgct gcgacggtcg aaaaccacag aaaattgtcg 300caggtgggtc tggttagtgt aaaaaccgat aaaggaggga ttttgacatt ttgttttgga 360ggaggaagac gaggagattt cttgaaccga atgtcccggt ttttctaaaa ccacttctca 420gactcccggt tcatattaac cagcgcgtgc gtgtcgtctg gttttcatca tgaccaacgg 480tcaagatcaa aaggatcgcc ttttttagta gatggtgttc gatgaaatgc tcgttagaga 540cgttaaagtt gtagtgacaa caaattagtg tttatcattt tcagaatcgg tataatggaa 600atgtcagttt cttcattcat ttgaaattgt gacaagatcc aactccatca tttcaccacg 660caataactaa actgttaggg acaaaacgca acatgaatga atcaatgtac acaagtgtga 720gaacctgata ttcctatcat tgaattgtat ggaacatgcc aaacttagag agtaaagtgt 780acgtgttcta ttttgtaatg caaatatgat gcaattcaaa tcaaacacga gtttatctcc 840tttggcacgt ttctcaatat agattgctgc accagaaaaa gacaattttt tctgtttgaa 900gcctcttttg acatgaagcg aaatgcggca agagaaccaa gaagacaata gcatttttgc 960ttttctttgg ttaccactct ctaacaagat aaagaaatgt 1000331000DNAArabidopsis thaliana 33ccatgttagc taaattgatc catttggaat tttggattct atgtatttag ttcattatca 60atgattcgtt ccaagagaaa aaaaaaaaca cttaatagat tattagtttt ttttgttttg 120aacaccaaaa tttttagaaa ttactcaaca gaaataattt atcttttgtt tttttagtcg 180acccaaactt gtgcgtttta gtctttctct taatctttgg aaaaaaatat ctgcagtgtt 240tttatatcta ctcaaaagcc caactttcaa agttctaaaa ttcaaagccc aaagcctaac 300agttaatcca gttaaaacta agttcggaat ccatgtaagg cttaatggtc cggtcccgtc 360ccggtctcaa aacggtagtc gtgactcgtg atatcttgca gaatccgtaa ttccgtatac 420ttgacgaggt gctgtaagta ttagatgtta tcgtgatgtt tctgctcttc ttcgatcgcc 480acatcaaaat tataccatat ttttgaatct aaagattgtt acgaatgcat aagcatatat 540gtaagaaaac aataatcacc tatatttatc aaaattaaag atctatcgat tgatctttct 600gacttgcaaa gagagactcc acataaatgt aaacacggtt aaaatacttg aatacctcaa 660tcattgtcgt gccacgccga catagtcaat tagtcataac ttgaccctcc aaaaaagcct 720aacgtaagag agtttcacaa acttatttcc aattatctta agccacgtgt ttagtgacct 780ttgtgctggc cctgaaacgt gtctatctta tcgacccctc tttaaactca aaactcaaaa 840ataataatta aatagtaaca ggccgacata ataacccaaa aatatctata acgattaaaa 900atagtaacac actgacccta cgtggaaaca agaaagatat taaaaaaact catatgattc 960gtctctataa aagatccaac tctcacaaat caaacaaatc 1000341000DNAArabidopsis thaliana 34ttatccaatc cttttcaggc cattgactta ttttctaatt aacctaatta tatatataag 60aaaaagctaa ggcatcttat atttatatat acatatatca attgattgct aataatgttt 120tgattaactc tgcatatggc atgttttcca gcgatttaag ttcacaaaga agattttcgt 180ttttgtcatc actaaaaatt atctagttct ttctactttt tcagcgtaca ataatctttt 240tttctatcaa ttggtacgtt ttttattttc tcaattttaa ttttgttttc taggtacatt 300tttaatcgaa ttgcaccaaa atatccaaac aaatcaataa aacaggcaga atgaggtatt 360accgtattag ttaaacgaga agtggctact ttcctcagcg tgattaaaat gcttatgggt 420tcagtttcat aattctcata tacggaacta cgcatttgac ctcatgctct ctataaatat 480taagatctga tgttgagagc ccattagaac taagtttacg aatgagccca tgatagttta 540acttgggcct aattagtgtc taatctgctt ctaacgttac cgagtcaaac gcggtctaaa 600ctaatttgta gagtagtgta tttgtccaaa gcgtacccac acaattgctg agaaacttca 660aaccctaagt gtgtgtctag tattttgcaa tatacgacgc ataaagaaat atcctaacat 720tgacaacaac aaaaaaaaat cttagcggag atatagacgt gtgagacaag agctcacacg 780tgttgcacat tcgagacttg ttgataatga cgacaacaag aggttctaga atattcgtgt 840tggtttcatt attattagca aataagacca agtcaacaaa ctaacttaaa cattgtggat 900cgtccacttg tccgactctt acgtcaaacc catcattgtt ccacatatct ccacgcgtca 960atctctctat aaatacgaac aaatgaacac gtatcaattc 1000351000DNAArabidopsis thaliana 35catctggtac ggatcttcta gtttctatca ataatcagag tacacaagaa ctcatcatct 60cgaatatgta ttaacgtcta tattgtatgt ttcaattcag aaatcagtaa actctggtat 120tttgaagatg aaccggtccc taagtctcac atggataaat ttacatcctg gtacgagaat 180taccgaagtg aaatgtccca agcgatgatg gaaactgata aggttaagag gaatcaatta 240accaatgaag ttatccagag gtacaagcag gtaagctctc gctcaatcca atgcaaagat 300ctgaccaata gatttcatga atcacatttt ttgttgatta gtccatataa tgcttcttaa 360cacaactcat cggttgaacg gtttcttctg tctgtatcag gatttctatg gcgctgcagg 420gtttgaagac agcaataaaa gtctggaaga actctacccc caagccttgg cactctacaa 480cgtcgtttac gattatgcca ttcaggaagg tgttgcgaaa tgtacatttg cctggaatgt 540tgcaggaccg gtcctgtgca aattttacct taagaaaacg aaggataaat cagtagtggc 600ttcaacgtct gtgcttaaaa agcttttggg ttgaacagac ttacttgtcc tgtttgttgt 660cttcatgtat cataagatgc gttagacaag tatctgacct aggtaaccga acttataggc 720gaagctatgt ggtgtacttc ctttgtttaa gttataactt aagatttgtc atctagtgta 780caagtaattc cgtgtgtatg tttgcttatg gaataaataa agaaaactaa tgcttatatt 840taatttcaat taattgtacg tggcactttc agagtccatt tggtgtacaa agctgtcttc 900ttcagtgtga cacctctctg catcctcaag cttccatttg tcttttccag acctttcttc 960tttccattaa gtttcttcct ttgtgatcct agaaaaatct 1000361000DNAArabidopsis thaliana 36cacataatct ctctctacta cccacaggta tattagagaa actacatata acatgcacat 60aagtctttat tagttctcaa gcctgtcatc ttgttcaggt gtttgaaacg ctggttagca 120agatgacccg gtcgcaagat tcagttgtga gagcagcatg ctcatcagcc tttgggttac 180tgctaagatc aagcaagtca acactgtgga gaggagctcg acttgacggg accgactcag 240gtagaaaagc caatgatctc gaatctgtta agaaataaca tgtgcattac ggtctgtgtg 300aataccaacc agttactgcc gaaaactttg ggggggtttg cgatatatat ccataggtac 360agtgtgcaat gagaaaccat ccaggttttt gtgcttgttg taaagtgtat tatgttttca 420tcttgtaaag gatattgtca ctcgtgtgtc atctctatta ccaaactcag gaaagaaaca 480ttcacacgaa attgttgtag gctgtggtcc tctttttatt gattataatt tttgaggcta 540gtgttggata tacttgtctt gacagagtcc aacgtttgat ttatcttcag ttagtgtttc 600atatgagagt ttaaacggtc tttgaagttt caaacacaca ttagattttg gtaacattgg 660taatttattt atgtggcctc tatgtttatt ttgatttttg agacttgacg atagatgtag 720ctatcactat cagtgagccc tccaagttgt tgttttgtgt atgtgaatta tcttcgtttt 780ctttatgaag atatgtttct aaacttttcc tgagaaggaa ggtcactacg atggtcacat 840acggacaaac taagtagcaa gcacaatctg tggacttaaa aagaggtgtt tggcaagtaa 900aaaactgttt acgtcatctg ttacgtcttc tgttcacttt atgttttact ctccacgcat 960cttatccttt ataagctcgc acaaatctta accaaaacca 1000371000DNAArabidopsis thaliana 37atacccaata tatctaaact ttaaaactta atttggtcca acaaacgtca atcgatacaa 60tcatgtgttc ttctagctta tggattctat ttgacgacat catgaagagt ttgtacaaat 120caattaattt ggatattaca taaactaaaa atatgttgtt tttttttacg gatgatcttt 180taataactga agatgaaaat taattttagt tagaaatgtc ttgaataata attaatgtca 240cagaaaagtc agaacgtacc aaagatttcc gcgtgtatta aacttaataa atagacacaa 300gccaagtctt agcagtgtac tactactaac acacatccac atcaatgtgt catctttgtt 360gtatgacaca agttagtatg acaatcagtg tttgagtagt gatggtttaa atggaatgtt 420tgagaatcca atacaattgg cggtctgcta atggccagaa tatctgccaa agttattttt 480tatctactgc ttataaatcg tacaacaatc tatttagttc tctttagttt tgaccttgaa 540gtccatgaat cttaaggacc tcttctccac atcattcaaa ttcacatatc ggaaacacag 600tactgcttga atgcccctcc catacaatat gtctatccac tgatccacaa taactcgatg 660gtgcatgtgt gtgtgtgtgt gtgtgtgtgt gtgtgaactt cctggctgta ataaatttaa 720taaatactta aatagacagg cgaagtcctc gtccactttt ctacacctag catcaatcat 780caatgtgcca tcttattgtt tcaatttttc ttaccataac tttgtccttg catgtaactt 840ttccaattaa atacaagtct gtcttttatt aacccccgag aatttactaa atgatattga 900cataatctat tggacggaat gagttgtcaa aaaacgagag ctgtcttttc ctgctccact 960gtcctcttat atatcccaca acaatctttg ctacattttc 1000381000DNAArabidopsis thaliana 38taccttgtag agttccaaaa cattgtcaca aaatatttat aaagaattta ttttaactaa 60ttaggtcgtt aattgtccaa gggtttttca tagttgatat agttctgttc aaatatagcc 120atccttaatc gattcatggg atcgtaaatt actacttcga gtgttgtaaa aaaaaatgaa 180acttctacat tacaaactcg aatttaatgc atctggagtg atactataaa agtagggatg 240ctctcaggtc gcatttgaga gacacagaaa tgattttaat ggaattaata tattttcagt 300ttttcacaaa aaaaaattgt gtttataaca actgcagatt caatgctgat tttatgagtc 360tcacctatag aatttatatt tctatattca tagaggcagt ataggtgttg acccaacatc 420gaaagaacac ttcgtaaaaa attctttgga acaaggctga aaatttactc ccaaatttag 480ctatccgatg aagataaatc atttaccgtt tattaaagaa ttatcgagat tttagtccaa 540accaaaagag attatgagcc taagattttg aatttgtatt ggtaaaagaa attgaacgaa 600aatttcagaa aaaaatatta ataaattgaa cgatagagtt cacttactac atagtcaact 660agtgcctagc tataatagtt tcaaaagaca aaaaaaaaca aaatcggtta actacttccg 720tgacataatt ctcattttga tttttgaatc cagtctaatt tgaaaagtat attcaaaatc 780tttaaatcca ttaatgataa cttttataat acgttgacac acgcaattgt atatacaata 840ttcttgaatt ttaaatgtaa attctagaat atattgcgat caccacacta atcaaaatct 900ttgggacaac ttgaacccac atttgacttt tcttggtcaa atattttggc atcatgcatg 960atcttctcta taaaaaccaa aaggcctcaa cgacattcat 1000391000DNAArabidopsis thaliana 39ttcactgatt tggaccaaat cgaacgccag gtcattctgg tgagaaaatc ttaaggctat 60ttatgccttt tctttcttct attcattcag gagattacac gtggcattac ttgtgtggtt 120cgcatgtcgt tgtttctaca ttgcaaattg tcacaaattg gtccataatg agtgtttcta 180gtacctttcc ctttgtctgt gtttatttat gaataataat tcatgaatag aataaactat 240ttttccaaaa atattcatat atgtccgaaa actcacatcc atattgtaaa tttaatcgaa 300tctaatatat tttccaaata atatattagt ttttgccaaa atttaaacag aaaatcctta 360atacagtttc gaaaattctt tttctttttc tgttgaaatc atagtcaaat tcatttattt 420gtagatatat atatatagtc aaattttctt tttatacaaa ttgataagaa aataaatgtt 480aacctacaaa attaatgtat tatgagctgt caaattatta ttttgacttt atgagttgtc 540atattataca tagcatatca ttcttaagaa aaacaaaaac aaaaacaaaa acaaaaaaat 600tatcatcaat gatccgcttc ataaagtaac caccaacaca attattttgg acgcctcaaa 660tccaaacctt attttcaata attgtgaaaa aaaagaaaaa aaaaagctat attgtggtag 720gtggttcgaa ccactcaaaa atatttatga taataacacc caaaaacaaa ctaaactatt 780atttatttat tgtggtcatg agatagttac agacacagag aatacatctg tcagaaagct 840gactcattgt atcttatctt tcccatttcc tcttaccttt tttcctacgt gttgccacaa 900aactctcatc ttcacccact caatctccgc cacgtgtact ctctaaccct tcagttcggt 960ctttttaagg cttccccaag ccttcgcatc caaatctctc 1000401000DNAArabidopsis thaliana 40tcatcagaat taaaaaaaaa aagtcagtct aaaaatatac cctaataaac tgacatacct 60atttcaacga tacaattcta taatgagtta ttaaattgat ttgatttcgt attctctaaa 120atatactcat agacatagca gaaactagtt aatttgctta tatgttcata ttcttgtaca 180tcatgtgatc tcacgtggaa taaaaataat gtgaccatct ctaattctct acttcactta 240gtatctaagt atccactgtg ataccttgtt ttctgctcaa agttgtacat gcgtagatta 300ataaatacat ttcacccctt caattgatcc caaaagcctc tacaagacag ccaaaacaaa 360tcaaggattg cgcaaagcat tcaagtttcg aaacaataat ttttttaatg gttgacaaaa 420agcaataatt tgtaaagcga ttcataggtc agcttctaaa tgttctcgaa ataactcgaa 480ccaacaccta aaccatttgt tctaacgata atagacgaac gtattagaga ctcgttagga 540aaacactact tgtatatatt cacttttaca tacatatttg tttccttcat atatttattt 600atttacataa caaaccaata tgaaaaacgt acgtttcatg aaagagaacg ttgtagaaca 660ctatcaccgt ttgcttgtaa gaaaaaggct tatctcacct ataaactcgt gagtcatatt 720tatactctta acaataataa ataaaataaa actctatttt ggtaaccaca aaaaatattc 780tgaggctaaa agcgcgtctg ctggaaagtt cacgcgctct gctaacccca ccgtccattc 840tgactctttt agatcttaac cgtgtcttta tccacatcat acgtttatac acgtgtcaca 900cgatcaccta ttctataaaa aacattattt taatcttatc ctttttatgc ttcccctata 960aattatcttc tcttcctcct tctccgatct ctcctccgtc 1000411000DNAArabidopsis thaliana 41gactaattgt cagtattcta acaacgttaa tttcgtggtg cacgaaaaca aaatgacgat 60gtggttgcac aaggtccacg ttcttgtata ttgtttttta atgtgctaat tgtgcgctaa 120aataaattta attgtataaa attagtatag tttatatttt tttaatatgt atatgttcat 180tcaaaacatt tggataatat atgtttacta tgtgatagac tgatggtgga acattttatc 240catcttatat gcatatgaat aaaaattatt caaaatgttt gttgttgttc agattattaa 300ttattaaaga attagaaaat tttggaatta caggtatata tgtatgtgtt gtatattgta 360tatttaaatt attatgtata tttatattta aatttttcta ttttcataaa tcgttagtat 420ataattgtgt cacatctaat attattagaa gacaagttgg acttttcata ataaaataaa 480acatatcttt tatcaaatta accttttcct ttaggtgccc aaatccgtag caccatcaaa 540tacctaagat atacttttta tttccaaaaa tcctttcgtt tgatctaata ttaataatat 600aaacgatcaa acaaaacgat ttttagaaat caaacgttca agtttgatcg ttggtcgtcg 660ctaatttaag tttgatcgac tgctcatgtt ccgtgacaat gttaaaactg aatacatgaa 720acatgttttt tttttgtgca tttgatttta ataatttttt aaaaactgat aacacccgaa 780aaaaacagca agaaactaaa caaggaagaa aagaacggtc ggtgaggaac tctcgtgaaa 840gtcttcaaaa gacttttaac ggaaaaacaa aaaacaaaaa acaaaaactc ttgtgaacgt 900aagtgcatgg accgaattgc acccgatcac gtcggttcca caatcctctc ctatattaag 960gccttcactg ctccttatca tctcaaactt ctcatttatt 1000421000DNAArabidopsis thaliana 42aacaaaaatg gatttgatca agtgaaaact ttggtccaca tactccactg agatatttac 60ttggcttaca acatatgtca caagatatat attccaaaga gatattttac ttggctcaca 120taattcccaa gatatttcat atgttttggt tttgttttcg atgaagataa gatatttcaa 180atgcttctcc acctaaaccc aacaactttt gagaattcca aattctgatc aagatagaaa 240ttttttatat tgtttttcga attaattgtt tttcatatca ctacaaaaca aaaatattat 300attaattaaa gtataaaaaa tggtactaat ttaaattact caaaataaat gatacacata 360tacatggtgt tatatcaatt acaataattg acactattat tgaattgttt gacattgatt 420taataaaatc ggtacaaatt cacattaatt tgcatctttt tcaaaatatt attacaacaa 480catttaaatc actatgttct taatcgtctt ttcgtaataa taattaaatt ttgaggtttt 540tcgttttttt agataagaaa aaaataatgt aaaaaaaatt gacgagatca tattttggtt 600gatcacaagt gaggaaaatg aagttattat gtaaagaagg attctataat cgattgacca 660acattcctcc aaagaaatat attttatttt ataaaaaaat ctaataaata attggaaata 720aaatcatcat tcaccaaact ttatactgtt ttacaaatct acgttttagt caatttcttc 780cccatacttg acctcatgaa gcaatgaaca caaatctatg tttcgatttt accatggaag 840agggtcttag tgatatgaat ttaaatgtcg ggataaaata taagattata aaactataaa 900ggtttatttt gaaatacagc acacacaaga aaaaaggccc atcaacctcg cagtcctaat 960tagataaata cacgtcaact tctctgggaa caaaaaaact 1000431000DNAArabidopsis thaliana 43agttaaaacg ggcttacgaa accataaaaa acatgaacat ggatgctgat tctgtcttat 60ggagttctgt tctcggaagc tgtaaacttc atggagattt tgtgttaggc aaggaaattg 120cagagtatct cattggacta aacatcaaaa actcaggaat ttatgttctt ctttcgaaca 180tatatgcatc ggtgggtgat tatgaaggtg ttgcaaaagt gaggaacttg atgaaggaga 240aaggaatagt gaaggagcct ggtataagca ccatagaaat cgaaaacaaa gttcatgaat 300tccgagctgg agatagagaa cactcgaaaa gcaaagaaat ttacacaatg ctaagaaaga 360tcagcgagcg gatcaagtct catggttatg ttccaaatac aaacacagtc ttgcaggatc 420ttgaagagac agagaaagag caatctctac aagttcacag cgagagactc gcgattgcat 480acggtctaat cagcaccaaa ccgggaagtc ccttgaaaat attcaagaac ctacgagtat 540gttctgattg tcatacggtg acaaaactga tatcgaaaat cacaggacgg aaaatcgtta 600tgagagatcg gaataggttc catcatttca cagatggttc ttgctcttgt ggcgatttct 660ggtaaaacca aacaccacac attgtaattt gtacgtgaaa agaagatcaa agtgaacaaa 720tttagttgga ttatctaaat atcattcttt catgggaatt atgttgaaca cgtggagggc 780attatttgtg aagttgaaac gaggactatt tttcaaactt tcttgaggaa caagacttga 840cttttgtaac cataaagata gaatcaagtc agattttgct acttcgagaa cattggtcaa 900tgagtcaaaa tgtcaaatag tcaatactaa aatcaaaatt gaactttgtt tattaatttt 960gttctaatct atttaacccc acaaaagttt ccaaaactgc 1000441000DNAArabidopsis thaliana

44acgaatctcg tattcattac tatactgcta atgaatgcat caattttaat ttttattggt 60tatctatcct taaaaaaaag ttataagaat attttgataa tttgttccaa gatgtagtac 120tagtagtagt acttggtctc gtaaacgacg acgccgttta ggaaatcaaa cggccaaaaa 180tgcttccact ttgcgaaatt ttattggtgg gaaacgcaaa tgtatcgata atgatttaaa 240gaaaaagcgg cgtttctaat tggaccacgt tgcccaatct ctacggctca cgcacgatct 300gacgtcaaat tggcataata ctataataac cttatcgaga ataaaataat agtcgcatcc 360aaacaaaaga aaccaaccct tttatatcac gacgagtgtc acgccatgga cacttgtcca 420tctctctcgc tctgacccct ttcaggtatc ttttatttct caaagaggaa ttattgattt 480tccatttcca aagaaaaaaa ataaattcga aggtcaggaa aattaacaaa aaacttcctt 540tttttttttg ttagtttgtg tgactgagct gcttcatttt ttttctttct tttttttttt 600ggtttgatga atcgattttt gttgtctatt actgattggt tttcttgttc agattcactg 660attcgaagag aatcatgatt tttttttccc gctgaataat aagcatatga ttgggtgttt 720tggagatttg tttactgatt aaaaggagat tcctttccat tttcaccatt tgctctgttt 780gacttcattg tgcttatatt tcatttagat cttttgtttg ggtttagctt tggaactgat 840aaaaatctga ttttgtctca cggctttgga tttggttctt aaattttggt actttaaaac 900tggataaaga tcagtgcttt tttagattct tcgtttgttg atgaatttat ggatgtatgt 960ataattaaac cataatctct ctgcttgttt gttttcttat 1000451000DNAArabidopsis thaliana 45ttgaatttta tgtaaagtta ttctgttttt aggactttta tcctcactga attatgttgt 60gttaaagtag tagcttatat ggtcacatat ctcacacgtg tttccatttt ttttctatct 120aagttttcgg tttttacatt gtatacaaaa ataaatggca tgccttgggc cttggccact 180gattcacatg atcacgtaga gtttgtgtgt gattcaaatc ctagatgaga agatcatcca 240attgatctaa gatgtgtctt tcttatatca cacgttttat tttaataata ctcttcagtt 300gggtttgatt ccatttatga tttatctatg aaaacagaat atgtcctcat gctgattatt 360tcgtttgtgg ttgcatttta ccatctgtag ttttttttgt ttactgacaa acaactaaga 420ttttatgaat tcatttgatg ataaatagtt aatcaaatat attattatga gtgactcgat 480ggcgaaagct ggtattatct tatggctgct aaagtctact tttagaagaa gaaaaaaaaa 540gggtctgttg cattaccctg ttgttgcgac tctcctacaa taactctcac gtggcccgtt 600tcttggtttt tgccctattt tcacaaaagt aaacagagat tttcactagc aaattgggcc 660caattggaga caacttcatg tggtttgaat cgtatatttc agatagatta gacatctagt 720cttctttgat gtgcggtgag gatcttaagt cgctataacg ttgtgaaaaa agaaaagaaa 780aagtcactat aacatagatc gcaactaaac aaataaagga cactatacat gtacgtaata 840atattcattt attttcaacg acaatgttcc ctagggtggt aacccaagta tttttcatag 900atattatact tctagcgaaa agccatcaat tgagaaaaaa ggcctccacg tttgacccgt 960tttgcctata aaaggattct ttaagatatg gaccttactc 1000461000DNAArabidopsis thaliana 46caaagctcct aaatgatata gcttcaagca aagattcaca gagactatca catgcgagta 60tttaatcatg cctactataa aagagaggac aattagctac ataaatcaaa cgctataatc 120aataaagcga tggataatac cgaaaacgac ggcataatcg ctgttgagga catcaccgca 180agtgccacca ccaatagggc aaaacgcatc ggagccagtg accttgaggt aagtcaaata 240cgcagtgtct aacatcccaa taccgccgat tccggtgtac cagttgtatg tcgaactgtt 300actagtagat acttcacttg tagacgatga ggaggaggaa gacagcgacg gtgccgaatc 360ctcaccgttt tccggctcag atgaagaaca tttaatcgga atcgcaggaa atctccgatc 420agagacctaa ttcagaacat ctgacttcaa atgaccgatc acacactact gtcactgaag 480aaaacacatg acaagaattt acctcgaatc gtctaggata agaagttacc gacggaggag 540aaacctcacg gaagccgaaa tgaaactgac aagacgacac agaaacaaac ctcgccatca 600tcgtcgtcga agaatcgcta ttccaccatt gttaacttca tcatcttctc tcctccgctc 660cactatctca gccacttgtt tcaatattgg gcctaaacga ggcccaaata cgataatttg 720gatccataat tagtttatca caattgggcc ttacggtctt cttatccgtc cgagtcagca 780agttatcctc ttcaacacga ctcacgaggc accgagttgg acttatcatg attcattatc 840tacgcattgt ttacattttg gaatcgactt tggtacataa ttgaatttat ccatagaaat 900gatagagact cgaagatcgg acataaaact tgctgacgtg acacccatac ctttgtgtct 960gtatgtatta tataaacgct atagaactaa gagaagaagt 1000471000DNAArabidopsis thaliana 47ataccctcgt gacctttatc gaaggtttcc accttcacac tcaagatact tctcctagga 60tcttctctac acacattcct gtagggtcac tccctgagtc agttaaggac tcgtcctgta 120aagttgtgta ttgctgcagg aacccgaagg acgcgtttgt ctcactttgg catttcatga 180aaaatttgat tgttaaggag atggttggat gcacaatgga ggaaatggtg aggttttttt 240gccgagggtc tagtatatac ggaccctttt gggatcatgt attacagtat tggaaagaaa 300gccgagaaaa cccgaaaaag gttatgtttg taatgtatga agagatgaga gagcagcctc 360aggagtgggt gatgcggatc gccgagttct tgggatattc ttttacagaa gaagagatag 420agaatggagt attggaagat atcataaagc tgtgtagtct tgagaatctg agtaaattgg 480aggttaatga gaagggtaag ttactgaatg gaatggagac taaggcattt tttaggaaag 540gagagattgg tggatggaga gatactttaa ctcctttgtt agcagaggaa attgataaaa 600ccactaagga gaaactaatt ggttctgatt ttagattctt ttgctaagaa tctgctttac 660caacttagtt ctcaatttct tgagcttttt tttttcttct ttttattggc ttaatacaag 720taaggttgtt gacttgttgt gatggaccaa agtcaccaaa ctggtctgtt gagaaataaa 780actattgttt ttgagtttta gacataagag tcgtgattct ccttctttct tgttttgtta 840tctttaaata ttatattcta cgtgctcttc aagtaaagga tgtgttgacc atttcaccac 900ccaaactaat atcttagaag caaaagcatt gtgaaattta cttgtagaac aaccttctca 960ttcatatata aaagcagaca agcattagca tcatagatac 1000481000DNAArabidopsis thaliana 48cagttgatat tcataacatt ttgaaagaac ataacccaaa ttagtagtat atttctattt 60tagttcgttg aagaaactgg atttatttct cgaaatatta ctaaatcgaa aaaagaaatg 120aaaacgtgca catgttgtta tagtgtttaa cactgataaa tttcagtttt accttcttcc 180tttttttaag gagtctagca tgacaagaga tctttgttat cattcggaat atgaacgtat 240tataagaaaa tgattgtatt caaaaataga ataataaata aataaacgag aaatgagctg 300tgctttctag aagaacatcg atatctcttt ttcgtatttt tcgtctttaa ggctggaaac 360aaacgaaggc gtactcctac gcctcaaatc tttcatgcct tttggctttt cccaattaag 420attatacata ctatttagca aacaaaacaa tggtagtagc aattaatcca acaaaataag 480aatttttaga aacgtaagta ttcctccaaa aaagataaaa taggtgatta gtttaccttg 540ccagacaact aaataaaatt aatgtgaaac ttaagataaa aggcagacga gattttttag 600gttctgtgaa gacaaatccc tttctatttc gttttgacat gcatagaagt taacacatac 660tacggaaatg ctaaaaaaac gagatagtgt cgccatgata actgcaatgc tgctgaaacc 720ctcaacttta ttaatgaact aatgaagcca aatcattttt attttatgat taaaaagttg 780ccaagattcg tgctcactct gtcaaagaca acaactttat catttttcaa acaatagttg 840attaaactaa actaaaaatg ggtatacaaa atgagttaca acgataaaga caacaacgac 900tttccctttg tttgattatt taaaacaagt ccgtgataag aagaagatcc cactaatgga 960aaataaaatg ttataaaaac tggcggcaat acacaattgt 1000491000DNAArabidopsis thaliana 49tggatgggga tggcgtacta tgcatggctg ttgacatttt acccacagaa tttgcaaaag 60aggtatgtat gaaggttaca gttatagtac ttaagattaa atctaaagtt aaaaaccttg 120tattgagtgg gagttcttgt gtcctgaaaa aggcatccca gcattttgga gatattcttt 180ccggatttgt cggtagtttg gcttcaatga ctgaaatttc agatctacca gcacatctga 240agagggcttg cataagctat aggggagaat tgacatcttt gtatgagtat attccacgta 300tgaggaagtc aaatccagag tatgttctgc ttcgagcgtt acttcatctg aaatatttag 360gcctcttctc taaactatgt tttcatcttt acccacttta actgcagaga ggcacaagat 420aatattatcg ccaacggggt ttccagccag agaacattca acatattggt tagttttgat 480gaagaaagta tatataacta gtttccgaat catatgattt aagctaatga attaagaaaa 540tatatagttc aagacttatg attcatatct ctatcaactt tttgaccaaa gattgatact 600ttttcgacat ctgtcacagc attttgtgat gattttgatt gagacaaatc atttgtaggt 660atctctgagc ggacacctat ttgataagtt tctgataaac gaagctcttg atatgatcga 720agcggctggt ggctcatttc atttggctaa atgtgaactg gggcagagcg ctgatgctga 780atcgtactca gaacttgaag taagtttctt tctggataaa acctaatcat tcacatggaa 840caactgtcaa gagtttttaa tgtcacgttt aggttcaatg tccttttcac taagtctcgt 900aagtttttaa aacaagtaaa caaactacaa gccaaaaaca ttctggcccc acattaacct 960attcccactt gttaaagaac ccatcttgca ttatcttggt 1000501000DNAArabidopsis thaliana 50gtaattttta tttaagtttt ggtttaggtt tggtttgatt aaaaaccgta aaaccgaacg 60ttttttttgt ttttgattta aattaaaaat aattgtatat atatatatat aatgttcatt 120tgataacatg atatctatca aactatcgaa aaacaaaacc ctaactgtaa cctaaactaa 180aattctatat aaattacatg ccgtcattta ggatttgagt ttacaaatta gattttgatt 240ttatttatgc atcacactta taattttttt tggtaaaaac atgaaaaaac cggaaccaaa 300ccggaaccga tccgaaccaa aatacatatg gtttttaaat ggttttaatt ttttaaaacc 360aaaaactgta aaactgttaa aaccgaaccg taaccaaacc gaattttata tggtttttat 420atggttttac ttttcttaaa atcgaaaaac cgtaaaacct aaaaccgaaa cgaaaccaaa 480ccgaaaaact gaacacccag cccttaaata taatgaaaat cgaataaatt tgtttgaaag 540aatcgaacaa aattgacaat aaaatctaat taggactatt ttcgtctaat tttgacttag 600ttgaaacaga atattagcaa aaatactaaa acaccacaac gcgtaataat acccacacac 660gatatcatta aatttgacca ataagaatct agctcttggc gaccacgcaa gtatcttcca 720tcttgctctc caagaaaaat ctacaccggc tttaaattta cataaacacc ctcagtcaaa 780gaaaagtcgt aaacatagtc tctctcatga ccacaagggt aacacagtca tcctaaatat 840aaaccacaca agaaaactgt tatactttat acacgtgtca tagtctcatt acatctacgt 900gaagagtttc gatcatcaac cgttcgtttt cttactatat aaaccttgct cgagacctgc 960gtgtgaagcg tataaagacg acaaagtaaa ccaaaaaaaa 1000511000DNAArabidopsis thaliana 51acgaaacaac ttgcattaaa caatttgctt ctactatact tatattgatc tatcgtgggc 60ggtgagtaaa atgagttggg acatggattt gtcatttagg tcatttttag taaatgtgaa 120acttgtaacc aaaatatgca taaaaacttt agcctttcta aaagctaaaa atggtttgag 180gaacaccaag acgtaagctc ttttaagcta tttgatgttt gatgtgataa aataactaaa 240aactatttgt tatgtttttt cctcccctta aagatataag agaagagaag gtaaagaagt 300tgtgtaagtt agtcacaatc aattgttgga aataaatttg aaaactctgt taaaagtcac 360ccaaatgagt ttcaaacaag agttaaaaga ttaaaaaaaa aaagtaactg atataaagaa 420tggtagttta cttttttttt aaagaatggt agttttttaa aaactattta ggttgatggt 480atttttgtaa agaaatccaa aaattgtagt tgatacaaga aaattttgat ttgtgttgat 540ttatttctct tttggcatgt gaatgtaagt aatctttttc cacatgattt ctaattctgg 600acttgcatgt cttgttacgc cagtcaacat tcgttggata agttacaatg tcatttacaa 660cttgcagatt aattcggaaa tcaagaagga aaaaatctga attagaaata gtaggttgga 720tttttcacaa aaaaaaaaaa agtacaatat tttgagatat gaccattaga ttcttaaatt 780ttacgattct aacttgtaag atttcaaatt aaaccatgta taattcaatt taggggtaat 840ttaatacttt agggtagtaa cgataaaata gataattaaa ttgaagttta aggaccatcc 900tgtaaattca agttcggtac cggaaacacg tcagcattga tctaatagat acgagtcctc 960ctttctctat aaataacgta atctctctcc cgtcttcatt 1000521000DNAArabidopsis thaliana 52aatcctactt tttcaaattg tttttttttt ctttgtatta atgctcccaa gatgtttatt 60gaactgaaca cactagaatg aattatagct tttgtaatca gtcgagtaga ctagtttttt 120tttttccgat ttaattttat taaaatgtca acataacaaa cttccaaaca gaagtctttt 180aagaattttg gaggctcaaa gatttttttt tctcgatttt tttttactct aatttggacc 240atttggtctg ggaatacatt tagaaattga ttcatctcta tctcccgtat atattttaat 300atatttagaa aaaatgctaa aataaagaaa agtagacgag cttataattt tatatattaa 360ataaaaactt aaataaaagc ttataatttt atatattaaa taaaatcttc accttaaaac 420tattaaaata aataaatata tatatatata tatatatata tatatatata tatatagttt 480tgttatcggt attaaagctc tattttgtag tcgaaagaat aaaatcatat aaaacaattt 540tgctttagct ttatatcgat tttattttgt gactaaatac tagttttgca aattactatt 600cattatgtta atttaaaata ttttcaaata gaaataaaat aattattaat attttttaaa 660aagatataaa aaaaatcgat tacaaacaat gatataacga gaaaaggata gcattaaaaa 720ttcattccta acaattttgt actcaattat tttccatttt tagaaaaaca actgcttccc 780aaatatttga tcttagaata tttttccaag cttttaaaat ctttttgaca gatacgtggc 840acataaaact aggtggtgac atagatatcg acgtgtgtaa cgtttcttgt gtcctaaatc 900aattaaaaaa gcctcacatc tctttgtaca ctacaaagaa aagatcattg attcactcac 960aactttcatc tatttatagt ctttaaccca tttcagtatc 1000531000DNAArabidopsis thaliana 53tttgaaaacg ttaaaaatat aactaaaaga attattgtta taaaatataa ttactcccaa 60ccagaacatc aaactaaaaa cagaaactca tgagaaagat ttgatctgat taatgacaat 120acctcaggcg atgtaaaaga catcattgta ccatcaggac cgggtctagt cataacactg 180tcgtcggcat accacatcat ctgaactacc tcgggataga ctcctagatt caggaagtaa 240gattggagaa agcaattggc taatgcttcg gaagtattta agttcgtgaa gtgggattga 300ataaagccat tggctaacaa tacggaagga tcaacatctt tgatcatgga atccatatct 360gcaccagtaa ttatgcccta ggggagacga agaaatatcc aaagtaaacc ctagctagat 420atcaaaatct cttaaaaaca aacttatctc tctgttgaga tcaaccgctg atctataggc 480taaaggaatc accggaagag agagagagag aggaaagagg cggaagttgt gacttgtgag 540acggtttgga ttttggaaac ctacgttcca ttgaacttct ttatatattg gtttttcatt 600atgggatata gtaatatctt tataaaaaaa aaaaaaaaaa ttatatatat atatatatct 660ttaatacatt attaaattaa ataatatacg gataggaatt actataaatc tttataaaat 720aaaaataaaa ggagaagtcc cttttatcta atttggatgg ttaatttgtg aattttccat 780atttatctaa tttcgatttg aaacatatcc aacgttaagt cagattcttg tggtgaaatt 840attgcatcat aacataaagt catgagagta ggagagaaag taataagtgg caacatcaaa 900acgacgacgt ttttaagcta gtgcgtcagc aacagttaaa ccatatgaag ttaatggtct 960gtgttaatga agacaaaaac tatatcctca ggttagcgat 1000541000DNAArabidopsis thaliana 54ttataccatc gtaatccaca cgtacacatt gtttttaact tttgacaaaa taattgttta 60gctacctaac aaattgacat attgtgtggc aaatttataa ataaataatt acatatatct 120cactgagacg ccatgcagat tcgtatgttt atacaatgcc tcacccattt cttgaattct 180ctcaattgat cttcccatga tgttcattta ttatttgctt tatgtatatc tttgtagaat 240ggtagtatta ctattaaagt atcaaattac catgacgacc tatagaaaaa tgtttttaga 300aaagcatagg gatatgtgtc ttatggacag tggataagag agtggtccta aaaacagacc 360ataaattatg ggatcttccc ttagttcctt tattgttatc atattttaat tactcttatt 420agtatttttt taatggtcaa gttgtttatt ggaattacgc atttaattat ttttctaatt 480cgtatacttg tgagaaacag aacagcacgc tttgcttctt ttactagaca aacccttaat 540tagcgatatt tgaggcagaa aaaaagacaa ggttgcgtgt gttttactcc acaaaagaac 600tcctaaaacc aaaaacgtta aaacccactt cttttttttc acagtattct cttttcatgt 660caagaacaat tactgttttt cttggtatgc aatctgtaaa tttattgtag ttttcatttc 720tacttttggg gtttaggagt aaaagtggag tcactctact ttgctttacc tttttctgag 780tttttttttg ttcatacaaa tccaaatttc cttttctttg tattcctatt ttttctcttc 840tcgtcccttt cagaatcttt agagagagag aacaacaaat tgagaaagaa aaaaaggtaa 900ggctgtgtga gagagaaccc accaaagcgt gcacgagaga gagaaaaaaa aagctttgat 960tttgtataaa atcccaccac tgctctctta ccataccttc 1000551000DNAArabidopsis thaliana 55tgtgaaaaca gccaaaagct acggagagcg acactcctcg acgacgtcca tgtctcttta 60gattgtgatt gcattgttaa ttatcatatt tatcaattga ttagttaagt cttaagtatc 120atcacacaag tttatacatt aatctataca gtgtttaaca ctacaaaact ccccaaaatc 180acgtgtttca tatcatacgt acatttgtct accgtcttta ttaacaatat taaaacagaa 240tttatcaatt ttctagttga atagactata tactattata gagactctta taaattatga 300aactcctaat caattaaaaa tagttaattt gaaacatatc actcgaaact agggtattag 360ttggattgac gatatttttc ttattaatat caacgagaag aatcaagtgt cgttagctca 420attggtaaaa aatttaggca aagcttagag atgtctattt gagtgacgct tggaacaaaa 480attacatgct atggtttcag gcctagagaa atgcttcgat ccagaatctc ttgatattca 540aaaaaaaaaa aaaaatcaac aagaagaatg tataaatggt ttctatacta tctaagttat 600gaaaaatatt tttggttacc caaattacaa gatttgtacc gaagtgtcac actagattca 660catgcatgca tagttataat taacttaaaa cataaatgac ttacaaaaat acttacaaac 720aaaaaccaaa ttaagttatt tagtggccag ctttgttact agagttttag ccattatcat 780aattcacaat aaggtgtagg aaaaaacatt aatatatagt atgtatacat ccaaaaaaaa 840ctttttggaa aaaacaaatt ttgggattat aaaggacacg tagcccaatc gatgaacaga 900ggacggcacc gaatcaattt gacttcacct tttttcccca tatgataaga gtctcatctc 960tttcctatat atatttcgta cctctcttta aggaccctac 1000561000DNAArabidopsis thaliana 56aagactatgg cgtgaagcgc atgagctgta tagaaaaatg caaaaagagg atgggcttat 60gccaattaag tagacgtata aaagaggtaa atatgagaca ccttagagat agcgagttaa 120gggttcacca tcaccacaga tgcgtcttaa ttttaagttt ggctgttgta tgatacgaaa 180ttggacaaac attaagctag gaggcatgat ggatttaatc atattttaga tgggaatgag 240aggatattaa ggtaccaacc ctcaaggcat attgtagact ggtttggttt ggacatgtga 300agagttttga agctgaagtg tttggtcact ctagcattag atgttgcagg tgtagttttg 360tacatacatg taaatgatgt ctctttctta cgcacatttg ttgacataga gaagctaata 420tttgcttaag catgttgtaa gctgtaactt tagaatttaa cctccactgt aacatattac 480atatgcaatc aggcaaacga agattcgatc aaacgggtca ttgcaaatcc agaagtgtga 540gaagattgac cattaggaaa cttataaagt ggttcagtct tttacaaatt aaatttcctg 600tttcaaacac caatcaaatc caccgaatta ccaaagattt catgtttgcc tctatccata 660atggttggaa aattttcaat tttttaagta atagtttgat attgtgggga aaaaatacta 720ttctaaacgg tagataggtc ttacacgtga gcattccgta actggacaag gatcaaccaa 780agagaaagca gggagatcca tcacggaaca ttatctttgt aatcaaaatc tgatcgtaca 840cgtgtacact atgatgtcca ctaacagtcc acatctgttg acctgacatt ctcagacaca 900cgagaggatc ttttttcttt ctaacctcta aaatatttta attttaaaat aaaataataa 960aatccgtata tgtgtctatc gttagcttct acagctcaac 1000571000DNAArabidopsis thaliana 57ctaatctgat ttgatttttt tttattttgt ctacgattct tgagttacga aatgttcatc 60atttgttact tcttgttgtc gggattgagt agttgcattt ttttagttgt ggagctgtga 120caaagaggcc agctagtgtg tggaatgtgg atcatcatcc tttggttgat tccttgtaga 180atcctttttg taaacattta aactataata aagtgttatg attaagaaaa ctagttactt 240atttttttag tcatattgac tctgagccat aaactccata aaagtcctcc tcgattcaat 300tcggaattta aacagtttca gttgaactct gaacaagaaa aaaaaaggaa aaagcaaaag 360gagtccgtgc aagataactc tcattttatt ctctttgttt ttgaactatt tattttgcac 420aaattaggat tgatattaaa attttaatta aatcaaataa taactattaa ttttgcacca 480attaggatta atactaaaaa cattaaatta aatcaaataa tagcattttt gtttagtaaa 540tataaaattt aatctaaaat aattaaaata cacaatacat taaaaaaaaa aaactcaaaa 600ggcaaagcat aaaagaagaa acagagtgga ccatgtgtcc gcgtggaacc aaagcttcca 660aataactcga ccgtttagtt tattggaaac gaaaaaagaa aaatcatggc aacacaacgg 720ccataaacat ccatcacgcg tctactaaac gaacacgcgt acggtccaga ttctcaaatg 780acagctgtca tagagccgtt agcgccgtag tcaaagggtt taagcccaac cgcgtttctc 840tttttcacct ctttcttttg tagcataata tctcagccgt tagatttaag tcggtcctta 900ctttttcata tcaacggtca cgattcacta aacacctcaa tacagtgtac cccacttctt 960ctctctatat aaagaacacc atctcctagg tttcgttaac 1000581000DNAArabidopsis thaliana 58aaatttcaac tataccatta atacatgtat tatattctag aaaccacggt tcgattcaaa 60ttttgaccta acaaatatta atataatgtt tggatgtttc gtaatattgt aaccattaat 120ggagatgtcg tatatcattt gagttttttt gttaaccatt caaagtacac tataattaat 180ttagttactt taatataggt ctattttcct tgcgtcatag tctctatcgt ggttttgtcg 240gagatatgcc acgttctaaa actttggact tctgcgtcgc caaagcgttt gcaacacgtc 300aaattaggaa acacaaaagc ggcttagata gcaaacgagc gataaataaa gggtcaagta

360agccaaacaa tatgtaataa acgaataaat taattaaaag aaatgtttga tgtcctatta 420attttataaa ttcttttttt ttgttttttt tttaatttta taaattcctt atacacaaat 480gtggtttttt caggaaagtt gttcaatgtg cttattacaa tattaatcat ttgatgatac 540atactcacag ttacactaac agttcaatat cctgatggag cttatcacta aaagagaaaa 600tctatactac agtattagga actcgaaaca atatcgtcat cctaaaagag acaaaatagc 660ctgactaaac atttatattt tcgttaggac aattctcaaa aaaagatact atatattaga 720ctaaaaaaag gaatgaaaca gtgacataca gctacactaa agacaaaaaa ggagataaaa 780taatatctcc gtagacacat aatagtataa taccgtgtta cgtcgatcaa aatgtgatat 840aagaaatcaa attaatacta aaagcgatat gtttcgggac agattggtcc ggacccgcca 900cctttccctt tctatataca cacacgcaat acccaccaag aacacacaca aacacttaat 960tagcaatata aaaagcacat tcacaaactc tttccaacac 1000591000DNAArabidopsis thaliana 59ccgctataca cttgtgggaa aaggcttctt gtagtacagt ttcgtcacgg cgtcttatta 60attgcaaatc tgtgacgtta cggttttcga ttcgggttac tattttaatg gaaatctttt 120gataatagct agagcctaga gtatgtaacc ctaaggttag atggtaatac cgaatctacc 180taaatttttg ttattttcgg tttttcatga gatatataga ctgcgtaatt taaaatctca 240aaatttggtt tcactttatt tttatttttt tttagtttcg aggtaggtat acaagcatgg 300accttttgac attatttaaa attttcataa tttagaatct tcatgttaaa cgtgtatggc 360tacgatcttc ggaactttat tcggtttgta tgctcaaagg ccaacgccat ttattttcta 420attgattttt tcccgttacg gaaacaatct ttttgcatta ttgaacttaa ttaccaaagt 480ttatgaaaag aaacaataaa gtctaaattg gcctattttg atatactgat cacttacgcg 540aaaacatcta cattgttatc tactagacta gtcattaact cgaagataga atttcatttc 600ttgaccaaaa aaaaactcaa aagatagtta taatcacaga ttcattattg acttttgctt 660ttaacactgt agccatttat gtactattta tacaaatacc taacttttca aatttgtgat 720atttttcacc ctacatttta gcctataggt tggatcattt tgtcatattg catcactgtt 780tactagtacc aaacttaaat atcacacgag cagtaaattt gctatcaatt ttttcgtttt 840cacaaacata tatctagttt ccctaaccaa taattaagtt aaattgcatt aggaatttag 900gatttataat aaatagatat ataacccact aacgacaaaa gactatgtct tgctacgtct 960ctgcctttat tatcatgcca caagaaaata agaggggaac 1000601000DNAArabidopsis thaliana 60actcctatga aatcaagcct tagagacact attgttagcc tcaaggagaa atgtctttta 60tgattctgtt cgtatatttg taactattat gtgtattttt attttgttag tattactaat 120tcaagtggtt taagttgttg agactcttta aaatctaagc attttataaa caataatata 180taattattgt ttaggctaaa tttgtcacta attaaggttt ggatacatag tgtctaaact 240aagctaataa tatcacttaa cgtttacttg taacgctagg tgatgatgtc gtcaagtcaa 300ttggtacaag gaataaacga gtggtcatat gacattatga ccatatgaat tcaaactcca 360gtaatccaat ggtaattgga ttcaatgatc aagacttgaa ccacgtaatc cacccttatc 420cttagaagct cataaatatc actaaaggga caggcaacac ttaaccagta gttgtccaat 480aatttagttt tccaaaatga aaaattattg ttgtcatcta ttttaggtgt tttagttcaa 540tgtggattcc tcgtcctaac aaatacttga cgaatatatc tagactataa aattggttat 600gagttctact tttttttgtt tgtgaaatta tcaaaatttg ttatatttat ttatttattc 660tcattaattt gagtactaat ttttaaatta tttatactaa aaacaattac taagatacaa 720aaatggataa gagcatggtg tatagatatt taatgggata gaatatttcc cataattgta 780tgtgtgtgag aggttttgtt ttcgtaagga aagaaacaaa aaccatttga ccaaagaaaa 840gcaaaagaag gcaaggaatc aaacaacaaa tgttgcaagg cagaaataat ggacgttatg 900ttaatgtagt gtcgtcacac gtgacttaaa agagacgagt ctgcgtgtca aactaaaaat 960gtatgcaact ataaaaatgg gatttgatta tctttttagt 1000611000DNAArabidopsis thaliana 61aaagaacaaa atagtcccgc aggttaaaac ctagtatttt acaaaaatat agacaaataa 60aataaaaatg aactatcgtg gatttagtta gagaaatcca tacaaaaata aattgcattg 120agaaacttta gatgacgact aagtaatata agattttgca ttgagaaagc ttacatgtca 180ccctaatttc tatcaaaagg gtttctgata ctaattggaa ccatgtgaga aactatccat 240agaaatgatg atcttatcta gttatcatga gctgatgact ttgaatattc cacagtcact 300aaacgcatgt ttatttctca gctttatgaa tctgtatgta gacacacttt cgtattttct 360tcacttttgt gtatttgtat tgcccattcc tcttttacca accaataaaa aaagcagtct 420tttttttgtt tggtcaagca ttggcactct ttgtcaatca attacccaag ggatatataa 480taataaattg gaataatgaa agtagttcat cataaggttc gtgattatta acttctactt 540ttgtttatat tcaatagatt aagatgccgt gtatgataag gagagcaaaa gtaatccgat 600agagatcaca gcaatattca ttacaaaata gttttcaaat aaaaacacat gatcaaaaag 660tgactcacaa aaacacatga tacgtacgtg ggatgctaat cctaaagcaa caaggccaca 720agggcaaaat atgtagcttt catgctttgg caaagcgaaa actctagtta agcttaccta 780acagaaaaaa taacccccaa aaaagcgtta cgttccaaac atttagacag ggtacacgtg 840aatgactccc acttttttta aaaaaataat agtactatca attcaaatgt gtaaacaaca 900aaaaaaaaaa atgtgtaagt aacgtctgta attgatttga taagataaaa aaatcttgat 960taatttttct ataaattgaa gcctttcctc tttacatttc 1000621000DNAArabidopsis thaliana 62ttcatgcatg cattatttta agatattaaa gagtaatagt ctcgtcgaaa ttataacatt 60aaaaagtgga ttattcgcct cgaagatatc cgatgtcaca attttgtctt cttttagaac 120ataatgttga atccgagaat cgaggtatac ctgaatatct cgccaatgtg ataatgtgat 180ataatcgaga tacctgaata cccactgaac cagactcgtg aattgacttt gtccttaagt 240ttttcaaatt tgatcagcaa tacagatatg tcatttccta ggactataaa agatatcaaa 300caaatattaa ttaaagatat gtaaaactta tgggtgaatg atcggtgatg tatgtgtata 360tgttactggc attccataat acacgcgatt caactaggaa aaatatacta taaaaagagg 420cgtatggaac tacgttacac gtttcttgta catttgagct tcgttacgat gatttgtatc 480aagagtgaaa aataactacg atttattatt tacagtttca gaatatgatt tgattcgatg 540tttaatatga tttcttgcac agctagaaac aaccataatt tcaacatttc atattagcaa 600ttcctccgtt aacagccgtt tgacattttt aaagtaggct aaaatttggg taaactgatt 660ttttaactaa tattttttaa ttttatatta actacaacta ttattattat tcgtttataa 720aaatatacga aattcaattt aaatgctagt agtattttgg tatttttacc aaaaaaaaaa 780ctgacacaat cgaaagaacg tagtaaaaaa caaagtatat tttttttgta atgagttgtc 840tctgattcag caataggaga gcgacaaaaa tatcacgatt ttttctagca acttttccat 900ctccacggaa aaagagtaat aatatgacac gtggcaacat cgcaacggct cttaccaaat 960ctcctcgcat atataaagcg tcttctccgc caccttcatt 1000631000DNAArabidopsis thaliana 63atagtattgt caatttttca atctgaagta aatatctttt tgtaaggaag gggtcaacaa 60atgatcacaa cagagttggc aaaaagttat caaatcgcat gcacggaagt tttacgtgtg 120gtgaaggtaa acttgtatta cacttatcta taaaaattag tttaggcttt gattctaaat 180caaatctccg attagaaaaa attgcgtaag caaatagctg gaaaaaattg tatcccatca 240tacttaagtc acaatgtttt gtttttgaga tttgtgatgt aatcaatata tgttttacaa 300tgcaagtata ataatattaa agtcacattc taagaaaatt atgatttgtg tcatacgtat 360acaaaaacac ccgtcacaca tcctgacttc tgaacgttaa atctgtcgca cacaatcata 420aaaatttaaa aattcaccag agatgtactg aaaagaatat aattaatcac atgatgatat 480atgcatagga gatgaggatt attcattttc tgaaattccc tatatgaacc attataattg 540tttagtaatc agttcagaaa tgctaatcat tatatgaacc attataattc ccttcatttt 600tatttaagat ccacttaaca ggatttgtta atatgcaccc acatcactaa atacattggt 660acgcaaccgt tgttccattt ccattttcac atcgaccaga atgtttacta tgcggtaaat 720tgtgtagtat gcagattttt ttgtatcatt taattttcta acacttgtta agtcgaaact 780aattttgtca caagtaaaag aaataaaaaa ggtggaaatt attaatcagt agttagatga 840ttagtttcga gttgaaatga aactcgactt aacaagtgat agcgacgact ctagaaacag 900ccaaaatccg ccctattgct acctgtcgac ccacaaatcg tttactcaaa aatgaataaa 960aaatttacga taaagcaaac ccaaagttat atcttattat 1000641000DNAArabidopsis thaliana 64ttttgtgacg aaccaataaa agaggaactg ctttttttct tcgcatgtcc actatttact 60gtttggtatg acatcgaatt ggtatgatac attgttgtac caaaaacttc aaaatttggt 120ggcttgtaat caacattgat ccacttagcc attccaagca gtgaattaca atccgtggaa 180ggaaaaatta tggtccggca ccaacaaatt atcacctcat ctcccgtcac aaacatatca 240tctagaagtc ggtatgccac accacataat taaggagcac attactcaca tacccaacca 300atgtgggaca tatctaataa gctcattctt ggctggtaca tgctctattt catatttatc 360aaaaaaaaaa atattaggca ttgtaaatag cgtttttgct gttgagcaaa atagttatat 420ttgaaagtaa cattggtatt tataattata gtataacaat taggcattga agtgtgagtt 480tttgtttttg tttatttaac attggagtat taggttctta gaaatatatc tatatactat 540tagtagttta actacagttt gtacttaatt gaaaaaatgt taaaagttgt tttaacctag 600ctaattgcta aaaatgacta aatagacata cacaaagact tgtacatttt cagcttaacg 660actaatacat ttttccttta tatatatatc tctatcgagt ctagttatta atgttgaaag 720ttgcaaataa aacagaaatg ctaacatgta aatatcgtag ccaaaaatgc taacatgtgt 780ataacggtta taaccacaac ttgatggccg acctcttttt tcttttggta accatagaaa 840tggttacacg taactagtac gaaccaacga aaactcttct tattcgatag ttaaagataa 900tagcaatgcg caaaaatatc tagcactcac acgtgtagtt ttggattctc attggtcgag 960agatctataa aacgatacta ttggaggtta gatttttctc 1000651000DNAArabidopsis thaliana 65aatatccaaa agaaggttac atgcataagc ttagactggt tttgctgatt aggtgcagct 60gaaagtaatc tcatgaaatt gatttccgtt ttgccgattc ccattttaag atatctgatc 120ataggaatgc cacattcaga tggaagccag acaccagaac agaaacaaag ttgaggtaat 180caaagaagcc atcttgagag caaagtggtt ttgattccaa gaactcatat ttatcagcat 240aacctctatg ttcatgaaac ccagaagagc acaatacata actatcacca ctcaatctgt 300tattaccaga actagcagtg ttcaacttca ctcaaatgtg gacgaattac agtaaattgg 360ctaatcccac tactgaattg accttcaagg taaaccccaa tcatttgcca ctacgatcaa 420ccaaaaatat agattcttca ttcactctta tcaaaagcta tacatgaaat tcagagtgtt 480ctcattccta aatatggaaa gttgaattct atcaacacaa tcagatcatg cgacaatgaa 540ctagaaacca cgaaccagaa attgttggta atcgtttagt ggacgagatt gaatcaaagg 600ttcaagtggt aatcgttttc tcctgacgca aaatcgaaag aaaaaagatc ggtagcgtcg 660catcctaatc gggtgacccg gaaaccaata gttgattcgt tttagtggcg gtaaaacccg 720gtttgatgaa caaatattaa tgggcctggc ccatacgagg atgatcgtgg caatgtcgat 780gataacaaca actcctctat tcgggtttat gttgacccgg aaaacgaaag cataggacac 840gtgacacatg tgatgtgagt gaagccaaaa ataataatat tgggaaagga tgaacacagc 900agctcagctt tcgtcttctc cgtcaatcca ataaaaaaat cagcaaccgt tgtttgtttt 960taagcttttt ttacaaaaga cgtacacgtc tctctctctc 1000661000DNAArabidopsis thaliana 66gattgtgggt agaagtagag cgagtgctct tactttagcg gtatcaactt ggaagctacc 60ctgcccattg gattggatct aagcttcata tttgtggatt aaaaaggcct tagtgggact 120catgtctcct ccggtgggac tagtgtcttt tacaatgtca agtttaaaaa attggataac 180aacaccgata aaaattcaca tttgcaaatt ttattcagtc ggaatatata tttgaaacaa 240gttttgaaat ccattggacg attaaaattc attgttgaga ggataaatat ggatttgttc 300atctgaacca tgtcgttgat tagtgattga ctaccatgaa aaatatgtta tgaaaagtat 360aacaactttt gataaatcac atttattaac aataaatcaa gacaaaatat gtcaacaata 420atagtagtag aagatattaa ttcaaattca tccgtaacaa caaaaaatca taccacaatt 480aagtgtacag aaaaaccttt tggatatatt tattgtcgct tttcaatgat tttcgtgaaa 540aggatatatt tgtgtaaaat aagaaggatc ttgacgggtg taaaaacatg cacaattctt 600aatttagacc aatcagaaga caacacgaac acttctttat tataagctat taaacaaaat 660cttgcctatt ttgcttagaa taatatgaag agtgactcat cagggagtgg aaaatatctc 720aggatttgct tttagctcta acatgtcaaa ctatctagat gccaacaaca caaagtgcaa 780attcttttaa tatgaaaaca acaataatat ttctaataga aaattaaaaa gggaaataaa 840atattttttt aaaatataca aaagaagaag gaatccatca tcaaagtttt ataaaattgt 900aatataatac aaacttgttt gcttccttgt ctctccctct gtctctctca tctctcctat 960cttctccata tatacttcat cttcacaccc aaaactccac 1000671000DNAArabidopsis thaliana 67agtctataac tgataaataa ttagtcaatc agatcaaaca atagagaaat aattagtctg 60caatcagcga ttattaaaaa ataaaataaa atagaaacat atctcagcac cgatttttca 120acagccggcc cacgatccag aaccgtccag cttgaagccg gcgccgcgtc gacttgttcc 180accgttcgga tattgccacg tgtgtgagag acacctgcca ttacctcgtg gaacatgttt 240tgttagtata agttactaac caagtcgcta tcttcaattc ttcattacct aagagttgaa 300taaaaaaaca accatatacg atgagacacg caaataactg atatacgaaa caattattta 360ttgatgtaca ttcttctaac tataatacta caagattctg aatagtttta gatgtaaaaa 420gtacgagaag agttatgatc agagtaagtt aagtgataag atcatgacac gtaggcagag 480gaaataaagt agaaactatc gacaacataa taattgagtg agtacaaaaa ccacgtaata 540attaggtgta cactgtagtc tgtagatcga ttgtacaaaa tgaaaaaact atatgattta 600aggaactaga tatacatatg catcgactca agaacagttg ctttatgtga agactatcaa 660ttacattatc ggctaaattt ggcttcatgt gttgaataga tggattgtgt gctcaggtag 720ttgtgtcagt taattataac agtaaacctt cactgcttca catatataaa atttgggtta 780aaaaaacttt aacattttat ttttttaatt gacaacaaac aaattctgta acggttacat 840ataatagtgc ttcagttaaa ctcctttttt aggcaactct aattccttta tacccttaat 900ccattattat tctatcgaac ttatcttttt aaaagcttcc gcgttacgcc atcacttctc 960caccatttaa atacctaaac cactttcttt caaatttctt 1000681000DNAArabidopsis thaliana 68taaaaactaa gttgtccaac tactaacgca attgacaacc acaaccaaca ccaatattaa 60gtatgattac cgaccaaatt tgaattttat aaacatatag aatttttttt gccaactcat 120aatcatattt ttatatgtat tttaatataa ggtgaatgat tatcttttgt ttcatgtcat 180attgggcata acaaaactag tacaacaaag ttgtgtgaac cactcctttt atatagtttt 240gttgttgtct tttcttccaa ggaagattac tagacttaga caagtggatt atatatatat 300atatatatat atatatatat atatatatat atatattaac taatctttca acacttgtgg 360attgataatt attcatactt ttaataaaaa ttccatagtt caatgaatct gtaacaaaac 420caagaagtga cgtgatatac ttttggaact ataatcatat tgtgagctca atggataaga 480attgattgta gtaaatgatg tcggcagtta caagtgtagt tagatatatg caataactca 540aatcggacat gaactcattt taacttatct aaagagacca ccgacgaatc attttgggtt 600cacaaaattg tacttcgatt tctaagcctg aatgtgaacg cacgtttttg aatatttcaa 660cacgtgtttc atatttcatt acatgcatta taacataaat attacatctt tgagtcttta 720actagttgac caacaaaaaa aaaaacttta actaagtcta gctagttttg ttactacata 780tataaaaaca aaaccgaaat aaatatttaa aatttataat atatttgtgt ggctaaatca 840atcaacgtgt catgaaggtc taattcaagt tggtaaggaa atcttttgtt tatgtccatt 900tcccacgtgt cactatttgt atgacggcta gagaaagaca tgttgaatta actagtgact 960ccggattata taagcaagca tctactaaaa agataggaac 1000691000DNAArabidopsis thaliana 69gaatcgttct tgatctcaat ttggtttgtg atgagaatga ctttgatatt tatagtgatg 60aaggaggcta gggttgttgg atctaatatt ttcaaaaata ttcgaaaata tcttgcagga 120taagttgcaa atcaaagatt ggatcctcac taatgattcg aatttggttt agttcataat 180gattggctat atttagataa gtggtaaata ttgtttagtt gattatggtt agttttagat 240ttaagagaat aaatttataa gatttttttc taatttatgg aatgtattat cgtgttatgg 300tagttagaag aatcgaataa gctcgtctat gtcgatttta tagtgattag tgaggagggt 360gaatatatga ttgaagattc ttaaaatcgt aattaaatat gcagactatt ttgatggaat 420atgcgtgtat tagatactaa aaattgttga gatatacaac taatcgtgtt atggttaaga 480ttgcattact tgatggtaac ctattatgta gttttacatg tgatcaaaag cccattaaac 540ataactcacc cgtctatgta aaacgcggat cctgtttaat ttagttattt ggtttggacg 600attttaccct tgataaggaa gaacaaaatt caaaattgaa atcaaaattg aggctaaatt 660agacaaaaat ttaatttgta cttctttttt aataataaga ggatatgact atgatacctc 720gcctaattct ggaaataatt ttttaaatta atcaagattt ttttcttaat aggtaagcca 780aagatagaac acgtctcgta ctattacaga aatactcttc aatctgtcac aataaaataa 840gaattagtat catagatggt aacgtatctc catttatctt ttaatattag acgttactta 900attcttagta agagaaatcg taatttactt ttgtacttta attgtggtcc cttgatacca 960cattttccta taagtaagta cacgcaacag catcaacaac 1000701000DNAArabidopsis thaliana 70catatttgga atccatatta tgcaaggaga catcggtttt actaatgcga attgaggggc 60ttgtataaaa tcggtctgct tcagaaaaca ttccattttt caacccttga aagtccattc 120agtttaccat ccaagagttt ttccttgttg tgaatgataa gaaaatccat aacatttatc 180taattttatc gtattggaaa aataatttaa tatttctgaa tacatttatt aaaaatatag 240ttaaaaacga tgtattttta gcgttttata attatttagc attatttaga tgttttcgtt 300attttgttat tacatcattt taattattat tcatgtgact tgaaattaat tactttttaa 360ggtcgaatct ttttttacgc ctgcctttac acttcttatt agaaggttat gtatcgaaac 420agccatccaa aatttaaata aattctaaaa gaagttatat atattatgaa tgtatggtga 480tactgacatg ggtagggtac attctttttc tattgagtta gttgatacgt taagttggat 540gagatataac ttatactatc atataagctt tttttaattt gtgtgtttca agtttggtcg 600gttttgttta tgtcttttgt aggtggggta tacagaaaaa taagagaaaa aaaaacaaag 660tcaaagctga ctttgacgac aaatcgctaa ccttcgagag atgatgataa caagcctaat 720taaaatacac gttacgtcgc aatcaattgc ttcaatctca ttctcattgc atgtcatcct 780tagccgcggt tagccgacca tactatcaac gatgtgcata ctatcaacaa tgtatttgta 840ataattaagt agaaaaataa agaatgttcc accgtccaaa agaaacacaa gtacggtcct 900gagagggctt gagatatttt gcttagtgat atatgtcaac atcttagtta agccccaccg 960gttctgatat aaaagtgggg aaaatatttc ataaccacac 1000711000DNAArabidopsis thaliana 71atctcgagat agcctgattc agctcaaaga aagagtgtac gagaacattg ccacagtgcc 60tctagttgta gagagaatgc gagagagtaa agagagaatc gacaagtcca tttctttcaa 120tggaactact atgcacccgg ctttcacaag gcggaaagct agctaactga tcagtttcta 180cttctcccta ttggctttac cactcaaaga aagctaatgc ataggaggag atatggctgg 240tatttctttc tactatatta gtatcaacca acatctctcg catttagaat aagagtatac 300cataacctct gttttgattg tgttctacat tgaaacaggt ctttctctga ttttaatatg 360gactcttata gaatcttggg ctgattctac accctgtatg tatatgtaag aatcacataa 420tcagatggtt gcacaattgt caagaacacc caaatccaag cttgcgattt agcaatctca 480cccgtgtaag aaacttaaaa ccataaaacc aattgatata agagcttttc tagcaaaaac 540ttaaaaatat aattaacacc gtcgaaaatg gtggaggcta ctagagcgag atataaactt 600tatcaccgtt taatcgtgtg ggattttgaa atgatatgaa cctattgcaa aaagacaaaa 660aaaaaatata aacggaggaa ctaggcagtg ggaccggaga tccaccgacc ggaaacgaag 720aatccatagt aagatagagc ccaggggttc ggtcgtatcg gaccggtcgg atttggattc 780taacgacaag tgagtggacc cccagaaacg acagacgtaa gcaatgacaa catcaaactc 840atcgccacgt cagcaacgga gttctacttg cgaaacgtgt cccggcagcg tcatccaccc 900acgtgtgttc tagaagctct tctcctcctc cgtatctcga cacgtgtcca ctccctctcc 960ttatataatt gacttttctc tcattctttc caagtttcaa 1000721000DNAArabidopsis thaliana 72gcggttgttg gatggctgaa ggtaaaagct gctatgagat ggggattctt cataaggagg 60aaagctgctg agaggagagc acagattgtg gagcttcatg ataataacga tggtaaatga 120atagtcaggt tgcttgctag tgtatagaag ccatgtacag tttagggatg tataacaaca 180tcacttaccg atatggattt gcttgattta gcttattaga caagaatata atcgatgaac 240catgagaaat ttgaagacta accttcagca gcaagacaag aagctgattt tctgtctctt 300taagtttcaa ctttcaactt gcagatatat tagaagactg actaagtatg ggcctataag 360cccattaatg tctctactga acgcccaact tgtgaaaaac atgtctctca tctcaatttt 420gttttctttt cacatcaact taaagctacg agaaccaaac aagtgatgtt tttggtcaaa 480caggacgaaa catgtataca taaaaagagg taattatgtt cgataaacat aggtataaat 540aaactataag aatttgttac aacattcttt agatttgata ttaactgctg tttacatttg 600aagaactcag ttagttattt aatattttat tttatagggt ctagaaatta cttgcgtgtg 660gggactgatc accgattcta aagaaggctc atcctttgga taatagtatg aaacttgtca

720ataaagataa gtcatcacaa gtagggagat cttagctgtg ttccatacgc ccatctagaa 780aaagcgacga tggtcaagat taaataactg tatttgaaaa accaaaaccg cgtcaccaac 840tccaaagcca ttaccattag ccatcacttt ccatcttcca gctgttcgaa tcaggacgcc 900cctttttctt caccaaaccc atcggccgat aacgaacctt cctctctgac tgcctctgct 960cttactataa atacaaccaa tacgacctca tccaaaaccc 1000731000DNAArabidopsis thaliana 73gaccaccggt ttctcaagtt tccagaaata tgcaaggatc tggtggtccc ggcggcagat 60tctcagggag aggagatcca ggatcaggcc ctgtttcaat ctttggtgct tctacttcca 120agatcagtgt agatgcttct ttagccggtg ccatcattgg aaaaggtgga atccattcca 180aacagatatg ccgtgaaaca ggagcaaaat tatcgattaa agatcatgaa agagacccaa 240acttgaagat tatcgagctg gaaggaacat ttgaacagat caatgtagcg agtgggatgg 300tgagagagct tatagggagg cttggatcag tgaagaaacc tcaagggatt ggtggtcctg 360aagggaaacc acatcctggg agcaactaca aaaccaagat ctgtgatagg tactctaaag 420ggaactgtac atatggagat agatgccatt ttgctcatgg tgaatctgag ctgcgcaggt 480caggaatcgc ttagttatgt ctttagactc ttgagaacag attatgcatt gttagctcca 540ttatcattgt gactttttgc tctctctttt tgttttatca atttgtttta tgcgactcgc 600tttgaaaact ttagcccatt ctgtattgag ctctgaagat tcgacgagtt ctgtaagtta 660ccgatcacag ttaaaagact ttgatatgtt aaaaccctta tattacagct acatactatt 720tttgtcttaa ctcttaagat atcatgcaca ataatatact tgttttgtct taacctatcg 780attacaaaac cgggttaacc gccgacatga ggcgaactct aaagcctaac acgcgtcaac 840atctatcttc tcaacgactc aaaggctttc caacacgtgt aggaccaata actgaaacac 900aaagcctacc acctcttctt cctcttcttt catgacacgt ctcactgacg tgtcgtcaag 960aacgtaatta aatattaaac tatcgtgacg aacgcgaggc 1000741000DNAArabidopsis thaliana 74taaggtttta ggccaatgat agtgattata cctcttttgt ttgtacttga ttcgtcacca 60ttattctgct tgcgtttatt ttagaattta gaggtggctt tcggaattaa aaacaacaag 120acaacaacat tttataattt tgatttgtat actaataata gaaccaaact ctgcttctct 180cagctattat tatctaagta ctgatatatt tcaaattgta gcataagttc tgtatatgta 240ttagatagag ttatcttttg atttattatc atatacggat taaatttgca gtttagagca 300atgtatttgc aattttgtga aaagtgaatt tgctttattt ttgtcttcat tactaaagtt 360gaattagcgg tttgatgagt cattatcgct attaatttgt ttccgtaata ataatgataa 420gcaatcatgt cacgaacata ttactaatat tgtggtgatt cttttgccct cctcatccat 480attcattgat tgtaaaagca cacgtaggtc acatacgtat catcctaaac cccaaattaa 540aaaaaaaaac taagcaattt ttatacatgg cctttggcga tccaagcatg caaaaatatt 600ccataaatgg ttaacacaaa atcatcgatt ctgaggggtt aactagggtc cactttatta 660ccgtatggaa atagagacac gtcaaagcta gtatttggac ctacgtacgc tcatgtattc 720cgccacgtgt cgtggtttca ctgacataaa acacatgtcg gtcagataat tctagctctc 780acgacgatga agcaatgtag gctcacttcc actcaccgcc gcctatagca gtgacacacg 840tcctgtcacc ggcgtgtcac tcaataatta tcagattttc attttaatta aataatgtta 900tcctatattt gttaggtgga tcccaacggg agggctttat ggtaattacc atttactcct 960gagaagcttt tatttttgtg tgtggagtag taggagaagt 1000751000DNAArabidopsis thaliana 75cagggtggtg aagcttctcc tgtttggcca accgatgcaa caatgcaatc aatctctact 60caaaccactc tcaaatgctt atcccggatg ctagaggaaa gcatactaac cgatgtaatc 120atccacacag ccgatggaac actctctgct cacaaagcta tcctctcagc tagctcaact 180gttttcaaaa gcatgttcca ccacgacctc atggaaaagg aatcatccac aatccacata 240gacgacatgt cgagagaatc ttgcatggct ctactaagtt acctctacgg aaacataacc 300caagaagagt tctggaaaca caggctcgcc ctcctcggcg ctgcaaacaa gtacgacata 360acggatttga aagcagcctg cgaggagagc ctaatggaag atataaactc gagtaatgtg 420ctcgagaggc tacaagaagc ttggctttat cagctggaga aactgaagaa agggtgtttg 480atgtatttgt ttgatttcgg aaagatttat gatgtcagag aagaaatcag cagtttcttc 540aggcaagctg atcgagaact gatgctggag atgtttcaag aggttttgtc tgtgtggaaa 600ccggtctaat atacacttac acacatactt tgaacatttc agcttgatgg tgttttgttt 660atagtcttct tgttctcttc tgtgtgtgta actaacaaca actctggatt ttgttgtatc 720actctgtcta atgtatagta ttgagtggat ttgcctctag tttggtgtac cgagcctctc 780aaatgggccg agcctttcaa atgggctgtg actaggccta atgactgtcg acatcgaatg 840tcggcacaaa atatgataat tttgtcgatg ttgaaatcgt acgttttttg ttttaattag 900atgagtcagc atcaggagga tgaagtatcg accgacacgt gtccgttgcg tcccaaagta 960aagcttataa cgtgaaggca tttgtatcgt ctcttccgcc 1000761000DNAArabidopsis thaliana 76ttctttattg gtaacttttt aatatattca tttggtcaaa gctggcagga aaaaagtttt 60caacttataa ttttcttatg aagaagatac tgatacaatg catattaagg tcaaagcaag 120agtctaacaa tttattattc aaggaatata aataattgtt aggtaggttc ttattgaaat 180gaagcatttc acaccgtaat aatgatatag ctgtagaata tgaacaatgc gcgtttgctt 240gactgtttcg ttgtaagttt gtaacaataa tggccaccaa gtaatctgtc actagaatat 300tgcaacaaaa taaaaaccat aaatataact ataatgatct actaatttat taatttaatt 360aacatgaaaa tatgacagct tttgccagcc cttgctcgta taatcagtcc agataacaca 420ttacaacgaa acatataaaa taactttttt caacagaaac gattataata tacatctaaa 480ataattaaca gttgaaaatt ttgatagaca tactatatat gaatatgaac ttaaataatg 540acccattttt cgtataatgt taattattta ctcgtaaacg cgttatttcc acgaaacatt 600aggcaaaact caagttaatt tacgcctggc attgtaacgc ggttaaccaa aaagcaaatt 660acgcagagtc aaatcatatc taaaaaccaa tataaacata acacgtgtca atacttaact 720gatctcagaa ttaacatcgt taagagaaaa cacgtggcag agatctgtgt atccgtttgg 780tgctccttca tgtagatgat tcttcaagaa aacttcaaaa actcaaacac gtcaagttta 840agaaagaaaa aagacaacaa ttattttaaa ccgccattga aaagctaagc catgttgtat 900ttttgtatgt ggttcgcatg attagtgtca caccaataat taattattaa ctatttccca 960accatcgcgt atatatagag ctctcttctc tcattgttct 1000771000DNAArabidopsis thaliana 77tacttacaag caattgtgaa agagactctt cgtttgcatc ctgcagctcc tttgatccct 60agaaaatcag aatccgatgt tcagattatg ggtttccttg ttcctaaaaa cacccaggtt 120tttctcatac ttatctttat tacaccaaca ttcattactt gagatattac accaataaaa 180aagtttggtt tggttaaaat cttccacctc tgtttctcga gattaggttg ttgtgaacgt 240atgggcgata ggacgagacg cgagcgtgtg ggaaaatcca atgaagttcg agccagagag 300gttcttgtta cgagaaaccg atgtaaaagg cagagatttc gagttgatac cgtttggatc 360aggaagaagg atgtgtccgg gaatctcgat ggctcttaag acaatgcata tggtgcttgc 420ctctcttctc tattcctttg actggaagct tcaaaacggt gtcgtccccg gaaacattga 480catgagcgag actttcggtc ttaccttaca caaggccaaa tctctttgtg ccgtacccgt 540caagaaacct acaatatcgt cttcttatta ataatcgtat caatataaag ttcgtgtacg 600gatcaatatt aataattgaa gagaatgaaa ttataaaaga tactgctttt tatgtttcaa 660gtaaaaatgt taatattaga aagtcaagaa agaggaaaca caatgcaatt tgataaggtg 720ttttatcgtc tccgggattg aattgcgtag gtcaaaaaaa gacatttgcc atttgatatc 780gacatataat cactcattca cggatacgta tctcttgtga atggtcgtcg gtcaattaca 840actttactgc tcggctttaa ctgacaagtg gcgcctcctg gtttcattca gaccctaccg 900gtaggatgta atttggacca taagtaagaa atatttgtga aatatttact acacgaatat 960ctttacgtat ttaagttggc ttttgaagct ttgataattc 1000781000DNAArabidopsis thaliana 78gataaaaaca gatactttta gcgaatactt ataacaacac tgtgttttgt ccaattcagg 60ggaggttcta cgatcaaccc aatgcaacaa ttctatatgc ttccgtttca taaaaattct 120gaatatcaat ctaaacctat acaattacat ccttactaat ctagataaag tgtatacaat 180aaaatgagag caataaggtt cattaatctt tatacttagc tcctccacag gacgatatac 240tttgagactg acacaaaaca aaataaggaa gaaagcttac gtttttgaag gaaaatcaca 300ggagggacgc ggaagcaaga ttggtctgag aggaaaattg aggaagaggg gttaatcgag 360atgaaaatgg agtttgaagt aatagagaga acgttgcgca gatcgaggaa gaacagagga 420taggatcaac acaaggatct tgtgaaaatg aaaatggctg aggaaatgaa ggactatttg 480ttaaataaga tataatatta tttctgaaat atttccatct gactccttta atttatacaa 540gcctcctttt ttgtacatct attttcagaa gatccaaata attgtttctt ctatttgtgt 600atttttgata tttaaacgta aatctttgga ttttgatcaa taatagacaa ttagccaagt 660tcagttttca ttaattagat ttataatttt aattattttt agcattcgtc taaaattaaa 720taatgcggca aaagaggaga aattaatttt tgttgttatt aattcaaagc gttacaaata 780aaaaaggaaa ttccacgctg gcaataaaat aagcgaaaat tccacgtgac atctacctgt 840cggatcaaaa agagtggaat tgacatttgt ttctctcaca ctctctctcg aattctctgg 900tagcttctag ttctatgcaa aaaaacgacg atagttctct atctttccag atgaatctcc 960ttccatatac aaaagcagtc atgcctcctc gctctctcgc 1000791000DNAArabidopsis thaliana 79aattcacatg tgatccggta tattaaaata tacatgattt atggatacac tcgaatactt 60ctccgatatt catttcgcgg aaagttgcat atgatccact atctcctata tatttgttat 120ataacacatg gcaaatctct ctatatatat tttttaaata cacatatatc caaaataggc 180ttaactataa cacaaagact tttgtttact gtatattagt aagaagtaaa tgtatttttt 240aatattatga taaagtttgt gaaatcacca tttgcaatag ccatataggg tcgtgtttta 300attttacagt ttgtattgtt ataattcgat tccaaggttg agaatatgtg tgtactatta 360gactatacaa ataataattc gttgacgata ttgaatattt actaattata ggaagagaaa 420attatttact aactatagta cgatatattt cttctatatg tgtttttaac gttttttttt 480ttttaaattt aagtcttaac tttacttctc atttttaatc aaaaggaaaa aaataccaat 540caatttttcc taacacagtt tacttatcat tttcatttga aatgtgttca ctttctgata 600aaatgctaat cctacaatca aatacaccat tgtcgtgata acacgtgtac ggctctaaag 660caatcagaac aatcattgga cagtttttac accgtcagat aagtacctat ccacttgctg 720actcagccgg ataaacccta aaccggaagt ttgccccacc gtcaaaattg gaagaaaccg 780gacaaaagag aatgtaaaga ctaagaagta agaacccatc ggacgtcgta agaaggttaa 840ttaacacgtg gaaacagctg gtcagagtta tccggtaact tatccggtta caagtaaaaa 900aataatttgt tcccatacac gactccttca gaaccaaacg cgacatcacg gcgccgttta 960gtgtctataa atagagcaat cggtcgtaga aaaccaagac 1000801000DNAArabidopsis thaliana 80aagagatttt gaaaggtcgt cgaggagtta gtttattatc ttacaattat tatggatttc 60ttctgactaa agttttagtg caatagtttt ctgagttatt atatattttt cgaaattcat 120ccaaaggcta aaactttgta aggtattctg agtctattac gctagaaata gacacgctta 180cgtttttaag tcattttaat ctgatatata tatatatata tatatactac aattttataa 240ccaattatct ttgtttcaga caatttatat aaattcatat aaactatcag aaataaagag 300acaattggct gtaaaattgt attattttgc atctatgaaa tttttatttc tctttatcaa 360aattgaagac gagttgaaat tctatttctt ttttaaaaaa atttatatat gggagtgtca 420aaaaataatt ttcctttgta agagaatatt cgtatttgga cgagtcttga tttgtgtttt 480ccgttagcgt tccacattag ttgtaaagag cgagaaagat ttaagtgata attaaaaaga 540caagatatgc attttgaata tactctagat attaatctta agcatttttc atattggtta 600tattttatgg tcgaattttg acatccgaaa aatacatccc ttattcttca atttatattg 660aatacttaag tttttgatag taaatatgta taaataaatt ttaaacaatt gtcaaaaaga 720cattataact tatacggtta tagttgttaa aaaaaaaact tatacggtta taccaataga 780taaactgaca ttttagattt gagttttttt tttttttact ttccatgtag atagcgtgga 840ataatacctc atttatatgt atcgttagtt gaaataattg acaaaaggaa actattcgtt 900tgctgttcta gataaaagcc tttaattgag aaaaagaaaa tgcctccagc tttgacccgt 960tttgcctata aaaggattct ctaagatatg gactttcctc 1000811000DNAArabidopsis thaliana 81aataatcctt cgttgccagc agtctaatgg gactgtgtta taggttaatc tctctttcac 60ggactattaa tttttggctc aacctcaatg taatttattt aaattatatt tcaacgcata 120cacgtgaatt tatacatctt ttcataggat ttgcaattcc ggattctcat ccgtcctagg 180aggttgtata tgtatattat tggtaactac agcattaaac aactttaaca ctaagtacga 240tgatattata tgtacatgta cgtatgctat agattgatat aagaaaaaag taaagagaga 300gcggatgatt gttgattgaa ctaattaaca atagtgtatt aggtaggctt gtatccgaca 360gtcaacttaa acttcatagt tgaaaccaat aaaaccagaa gaaaaaaaat ttatatacta 420atcataatca gcatgatgtt agacgatttg atatctaact ttcttcgtta tgatagagaa 480ataatattcg caataaatca catataaaaa aaattacttt acgtcataga ttatcaactt 540gcctccaaac acaacaacat gatcccctac gctacaatac ataatatgaa aaataaatta 600atcgttgatt aacttaaagt cactccgtta ataacgttat aagcaaagct tattagtttt 660tttgaccgtc attatcgacg accaaacaaa gaaagagtcc caacaaatta tgccacgtaa 720gtccagaacg acgcctatga aaccaataag acaaaaattc aaaaaccaaa acatccgtga 780aaactcttac gtggcggtga catgtgggtc ccttatgtct ccattaacta acataagctt 840attgtcgact gtaccttttg tcccccgtct cggctacaac cctgtgacaa acaaagatca 900agtcaaaata tctaacctaa agtactctcg tccacgtgga cattgctgac gtggaacttc 960tcccttctat aaatacaggc ttctcttcgc cgaattatac 1000821000DNAArabidopsis thaliana 82caagagcttg actcaacagc tcaacaaagg atgagatggg tgcagaggaa ctacatgatc 60tataattatt gtacggatgc gaagaggttc cctcaaggtc ttcctaaaga gtgcttagct 120gcatagagag agtaaagagt tgagagagga acaagatttt atttttcttt gtggttataa 180aattctattc attttattgt agatcacgtg aattttattg atttgttttg tagtatactc 240tatagttcgt taaagttata atattctctt tgttacaatg tgcttttttg tttttttaag 300tctctcgtag ttcgttccgt tatattctcc gagtcagtcc aaaaatggcc gcttctgttt 360taattttgtt ctagggttcc acagtcaaac tcaaaaccaa acccaatgga gcaataacct 420tttttttagt tttataatcg aaatcaaacc ggaattttgc atgtaatttg attggtgtcg 480ttactttaaa tctttaatcc acaaaacaaa atttactcga ttttagtatt aaccgaacca 540attatagttt attgaaattt aattttaatt ctatcaaatt gcatatgtat tcttgagtta 600ttttttataa aaatactgaa accaactaaa ataatagagt ttggcggaac taccgtacca 660aatttgattg tatttggagt atcatttttg caaacctaat tagcctgaag actgagatat 720ccttgtccac tcttatgaag aaccaattta acaaggtgaa aaccagaatc tctaaaccaa 780acatggcatc aactgaaccg gatcaggcag acttaaacca aaacaaagaa caagcacacg 840tagcatgagg caaaattaag cacatgcttg ctttacttca aaacaaaaac cagctgttca 900cagctaaaac tacacaagag tcacaaacgg cgaactatac tacaaaaaga ctaagacttg 960cctcccttat ataaaacccc ccaacacata aggtcccaat 1000831000DNAArabidopsis thaliana 83tctacgacta ttatacattc atactttgaa tttggattac aaaaaaaaat cttcgagtat 60gagaaaacta ttaaaatagt tttattgaaa aatcatataa aataaaaata aatttagtaa 120tagtataatt aaatatgtaa aataaaatta aataagaaat atatcattaa cctataaatt 180agaagttgaa gagcttacat taattattta atagacacat gtcaaatgct aaagtgatga 240tgtgtcaatc atatgaagag agttggccaa ctttcatata tatgattctt agtttcgtac 300caattagttt ctcaattcag tgcatgtgta gtcaaatgtc caaaagattg tttctaagca 360attaaaatct tatcaaaact tcattgccca aaatattacg aaagcttgtt tctgtgtatg 420atatacccgg taaaattaaa atgatcatta tacagaacaa atcagcatga ttttcggata 480aagatgacac acatttgaaa tcgtagccgt actacgcgaa atacatgcac tcttcgttat 540gttaacactt taacagtgaa cgtagccata atgttgacca cattcaacag tcaacacaaa 600cattacttta cacacaaata tatgattata tatacatatg tacatgtaag tgaatgtgag 660caataatgac gggaatattc agagaagacg atggtgaatg ttagcagtga tcacgggcac 720attcaaaact gactgtggac aaaaaaagct cctggcctta aatatgattg tgccaaaaat 780agtacaaaac taagaaccca aaatggaatt cgagacctat ataataatat atatgtatat 840agtctttcct tggaaagaaa tcttatgtta ttaagaaaaa tactataagt tatctctcta 900tctagatatg atatatatgt ccaaacattt ccacgtagat gacgtatatt accgaggata 960atcctctata taaggaagag aagctcgagt aataaatctc 1000841000DNAArabidopsis thaliana 84gtgtatattt tgtcatgcaa actatatcgt ggagaaaata atgttgctta tgacttttga 60tagttgggct tacatttgga taatggatag ggtagacaaa gataggagga aagcaataat 120agcgaaatga agaacgaata tttggggaaa taggacaaat gaatatactt ctctttgaaa 180tggagattca cctaaattat taatactaaa gccatgcaat gcatccaaac aaatcagtgg 240tcaagcacac tcaattatat gtccacgaag acctttagaa tcttcacaac caaaagctat 300tttctacgct acctgataat tctgactcaa ttcttcttca taaaacgtat aatgaagctt 360tatgaatgat taattataga cacaaccggc cctatctgcg atttctacaa acaatagaac 420acaaaacttt aaaagttact acaaaatacc gaattgacta tatatatcat attatcagta 480taaacatgat tagattgatc atgtttatca gtaatcatga aagacaaaga gtgtgactat 540tgtaaaccaa attttagaat aaaataaata atttatcata ctatatacag tattttgtta 600agtatatgtc atccaatagt aacattatca tttaaactga aaaatgtttc agctacttta 660aggaattata gctttattaa aagtatatac ttttaggtca cgtgtttaga ggtgaagaac 720aataataatt actcaataag ttcaccagtc acactccaac atcttattca aattcctttt 780aaaagctttt taaccgtggc tgtttgatga ccatttgaca aaatttagta tattagaaaa 840aaacaatagg atagggataa tataggacat tagactatta gatggacaaa atgaagtatt 900atttaatttt ccaatgtacc aaccaataag aaagaagtga cgcacagtaa acgacaaaaa 960gctcaagcat aaaaacccaa accttctctg ctttctaaac 1000851000DNAArabidopsis thaliana 85gtaatcttaa accattcatt aattatgcga ataaaacgta aaaaacgtta tcaggtttca 60actttattta ggctgcaatg ggaactatgg atggcataat tgatacagtt tctgcatcgc 120attcgatttc acctttgatt ggactactca aatccaacgg taaacttgtt ctactcggtg 180caacggagaa gccatttgat atatctgcat tttccttaat attgggtatg catatataac 240cctacgtaat agaacgttat attagtcttt cacatctaga ctatgtatcg agtatgatga 300aaccatgatg cgacaggacg aaaatcgata gcgggaagtg gtattggagg aatgcaagag 360acacaagaga tgattgactt tgcggcagag catggcataa aagcagaaat tgagatcata 420tccatggatt atgtgaacac cgccatggat agacttgcca aaggggatgt tagatatcga 480tttgtcattg acatttctaa tacattggct gctactcgat cttaattaaa gtcgatgttc 540tatatgtatt caaaataatc tggatttcaa tcccacaaaa cttaaggata tatatatata 600tatatatata gtctatttta tataaatgga gtatagtcaa ataaatatgc attatcaacg 660atatatagtc ttctattaca tagatacgtg ggagttcacc caacgtagat acgttcggtt 720gaaacaagtc aatttcatca atgcctcttc caaaaaaaaa acaaattgca ttattgatga 780acacatgcat cattatcaaa taggttggtt aaaatgacca agatgactaa agccaatcac 840actactacca gatcgagtaa ccattaggga ccattaattc acgtggacgt agtgaatatg 900gtccttgtga attaatgagt acgtaattgt cctcattcat atatggatcg gttccacaaa 960catttcctgt ataaaattct acatctttcc tctcattatt 1000861000DNAArabidopsis thaliana 86cagcttacca catgcggcca caattttaac catgattaat cttcttttgg aatttagcga 60gataaatcaa gagtaggttt attttagaac atagtatata atacactaat gaaaagtctc 120gttaaaacgt gtttcacgta accgacctct aaagtatggc gtgcgagtaa cgacaaatga 180aatctcaaat gaaaatggag attattttaa tcttaattat aacacactaa ttaacacccg 240tcaaacatcc tgatatgtga acgtggaatc tcgttacaca atcgaataaa attattgctt 300aatacgatcc gtctttctct ctctttcgta atgattatct atcaatatta tcttgactaa 360taaaagtatc gcaaggcctt ggcagtcaca tgttaaatga tgatatacgt tgttgcatgt 420taaaacatag tacaacgatc atctaggctg ctgctaaatt ctattttcaa aaatgccttt 480tatatgcaag aaaagcaaaa taatcgagtt tttttttttg tttcattata aactgcttca 540tttttcttag gaacggccaa actgttaaaa agtaaaatat gtatggtgat taattgatgt 600aaccaaagcc agttcgcccg tttgattgtc aacccacgtt atcattcact tgatcacttc 660catgaaacat ataaaagctg ataatactta tattataaag gaaaaaaagt atgaaaatat 720tatcagtagt tagatgatta gttcacatct aaatgaaata cgacttaaac tgaaagagac 780atgagccaat tcgtgtcgag ccacaaattt tggacgtata tttttagccc gtggttccac 840aatatttgca ggtcttttta ttcaatgagt ttattttgtc ttggttgaat aatgaaattt 900ccaaatataa aaaataatag aaatccgagg ccctacacaa gcacacatag taactcccac 960attatatata agcggccaat atcagcaact cagagattcc 1000871000DNAArabidopsis thaliana 87gagataggac atagtattca tcactttaaa

acactcttat tgataaatcc aagagtcact 60taactttttc tggtaatcgc tgttaaagac tccaactcaa aaacaatcac attattccaa 120gtcttaacat tctgatacga ttcattacaa gaaactgagt ttatctaaga ttttcttctc 180acttgtgcat taacatatat acacacaaca taacacagat atgatacaat ttcacttctt 240tgaagaacgc ttgggagtaa cagcattaaa gatagcttca atctcttctt tcagtcgttg 300aaagaaacca ggtcctttca cataatcaat tggagtgctc tcatcaatgt cgtcacttgt 360tccatgagtc tctttgtcat ggtgacgact cttggacttt tcatgatgac ccattgcctc 420tatttcttct ttaactcgtt caaacacatt tggcgatttc acagtcttct cctctgtgga 480aatacccaat taaaacaaag ggtttagatt cgtgtttcat aagagacaaa acagagcaac 540gaatcaaaaa tagacttatc atggcgtgtt tctattgttt ctttagagaa atttccgatc 600aaaaccatcc attaatagac gtgtaatttt gattgtgaaa tttgaggaaa cgaacataac 660ttgatcaaaa agtcttcctt tttaacaata tgcataaacc tatgatgaga tctgatcgtt 720cattgtgttt acctgatcgg ttttgatccg ccatcgatgg tttttgtctt cttgatcttg 780cagctttcta tttttggttc ctgattcggt ggcgttttgc gtcagatgca aaagagtctt 840tacagtataa atcaaaatcg atttcaactt aaatgggcct tgttgggtta taatgggcct 900ttttgtttgt aagtgagtcc aaggaaaaaa aacaccagac cagattgaaa ttgaatcact 960tgcaaaagca aaattgatta ctctgttttg gttttggatc 1000881000DNAArabidopsis thaliana 88aaagtaaagc tcaaagactg tcctcactta tagtttatga atcagcaaaa gaggaactta 60ccggcaaagc aactaaatca taaacttgat cagcacattt cgagataaga acatcagcaa 120gcaatctggt ccctgaggat ccttcaacct ctagcttctg ctccacagaa gcaacagtaa 180catcagcacc agctcgccgt aaaacatcaa ctaacacaac agcttctatt tcctccgtcc 240catatccaat tggaacaagc acctacatag tttctcagtt atctccaatc tcaatctctc 300tacacaaaca agctggacaa ttttagttat agataacaag aaaaaaggag aaaccttttt 360agtagttgca gaagacccga cgccaacatc ggaatctagc gttgttacca tacccggaga 420catggaagat ctaagcttta aagttcggtc cctccttgtt gtccctagtg aagaagaaac 480tacagagact gaagaaaaac taggagcaac agtcatcgag acacaaccca tcgaagagat 540aagccttgac tccatcaaag ttggactcag cgaagctatc attgaaatcg aatatcctaa 600agaccccatt gacgatagct aatgcagcag tctgataatc ccgccgctag acggagagtg 660agttgcgtct tatctactag tatacacttc ttgaagaaag ctaaaaacgt cgtcgtatcg 720ttgtcttatt gcttgattgg ataaagcttc aattttttcc tccaattttt gtcgaaaatg 780taacattttt tttgctttcc actgtgagat tgtcacacac gattctctaa agaaatgaga 840cgaattttga aaacgattta taaaaatagt atttttgttt ccttgagctt atctcctctt 900ttctctttgg tttgagatat gaattacaga caatcaagta gtcagtcaca acagtgtgta 960taaatatgtc ttttaagtga aatcggtttc gagagacagt 1000891000DNAArabidopsis thaliana 89ttgtgtttat ctcaatgttt agatgttgaa aagtatatat gctgattaat aaataacaat 60ttcttagtca acaactcaat gatagaatgt tatataactc tcttacatca tttacatttt 120taaattgaat gaattctcat actatgcaat ctacctggtt agggtgaagt cacaacaaca 180acaacattaa ctaataacaa caatttaagc tgcatgtcct aatcagaatc tcggttattg 240gtcgctcata agaacagtta gaggtctttc agtcacaatt gagagacgac atacataatt 300ggatgggaat caaaatcagt ttaattgcgc tatacgcaac ttgaaacgtc attatacacc 360gataagcttt aaaaataaaa tgttgtgatg gacacgcgca ctatatatgt atgtgtttgg 420acatgattga agctaaacaa gataatattt attaagaaga aaatacataa ttattactaa 480ataatgtgta ttattatacg attccaactt ttatttgata atgatttttg agcaaatcag 540gttgataggt atgattgaga ccatcgccca aactacacct taaggtctta ctattagaaa 600caaacaattc atttgggaat tgaaaccaaa gtcatctatg gaagtaacct caaaatttat 660ctaagaaaca gtgatgtatt agaatggtca agtcaactta ttcacaaaat tacaaaacta 720tcgtttataa atatataatc tatttttcac ttgtgttaca cgattcctct tttgatatgc 780aggaagaaga gaagatctta caacggctag aaatagaaaa cttaatgtgc tattaattac 840aacttaccaa ctaattagaa ctttctatat gaaccggatc ttctacagca aatcacggtc 900aattcaaaat attcatctct ctctctctct ccacatggac aagacatgat taagaaagta 960tataaaagaa acagagcaga gaaagacatt ttgttccggt 1000901000DNAArabidopsis thaliana 90gtgagcagat aaggatcgtg tattttcaag tattatatta attaataaat tatcgcccat 60cttctattgt tttgtatttt catttgaaaa gaaagcaggg ctggcttatc aatatataca 120gtctctttct tttagtttac taaaatcatt tcagataaga cccacacaaa agtaatatca 180aatggaattt tgctttcaac atggataatt ttctttaata ataaaattgt caaccatgtt 240aagcgtaatc tctaaatccc caaaattaaa aacactaaaa atagtcggtc attgctccac 300gtttaacctg tccaattcca aaagaacttc gtgtttcatt gacgatatag tttacgtacg 360tatgattctt tctaaatgag gtatttttgt cccattagca atttaaaatt taaaatggtt 420gtgtaacttt ttgaaatgct tagtagttaa aaccgaacta agaagaaata atgatcggaa 480cgtgtttatg tcagctaatg cattacccac tgacattaac tgatccagtt tactgatttt 540agacaatcat actctatttt tactggagtt gcagaaaaag ggacttacaa aacgatgaat 600taaatttgga caagtatata tacataatta gaaggttata gaatctagtt ttcgtaggat 660atttcttgaa ttatcctacg agtgtgacaa ctaaatgcac aacaaaacaa caatttaaat 720gctaaaaagc aaaaatagta attactggtc acctattaag gaacaacacc aacacctatt 780tagctgttat gtcctaataa aaataaaata aaataaaaac aataaaccaa aaaagtaaaa 840aataaaaatg tttgactaaa attctctttc ttttccgatt cgtcattcga tgaatcttcc 900tttcttatac aatttggata atcttttcta atgtaaaaat tcgagatttt tttttataat 960gttaaatcta taaataagag caacctaaaa cgaacaactt 10009123DNAArtificial SequenceDescription of Artificial Sequenceprimer 91cgccagggtt ttcccagtca cga 239224DNAArtificial SequenceDescription of Artificial Sequenceprimer 92agcggataac aatttcacac agga 2493588DNAArabidopsis thalianaCDS(1)..(585) 93atg gtg aag caa gcg atg aag gaa gag gag aag aag aga aac acg gcg 48Met Val Lys Gln Ala Met Lys Glu Glu Glu Lys Lys Arg Asn Thr Ala 1 5 10 15atg cag tca aag tac aaa gga gtg agg aag agg aaa tgg gga aaa tgg 96Met Gln Ser Lys Tyr Lys Gly Val Arg Lys Arg Lys Trp Gly Lys Trp 20 25 30gta tcg gag atc aga ctt cca cac agc aga gaa cga att tgg tta ggc 144Val Ser Glu Ile Arg Leu Pro His Ser Arg Glu Arg Ile Trp Leu Gly 35 40 45tct tac gac act ccc gag aag gcg gcg cgt gct ttc gac gcc gct caa 192Ser Tyr Asp Thr Pro Glu Lys Ala Ala Arg Ala Phe Asp Ala Ala Gln 50 55 60ttt tgt ctc cgc ggc ggc gat gct aat ttc aat ttc cct aat aat cca 240Phe Cys Leu Arg Gly Gly Asp Ala Asn Phe Asn Phe Pro Asn Asn Pro 65 70 75 80ccg tcg atc tcc gta gaa aag tcg ttg acg cct ccg gag att cag gaa 288Pro Ser Ile Ser Val Glu Lys Ser Leu Thr Pro Pro Glu Ile Gln Glu 85 90 95gct gct gct aga ttc gct aac aca ttc caa gac att gtc aag gga gaa 336Ala Ala Ala Arg Phe Ala Asn Thr Phe Gln Asp Ile Val Lys Gly Glu 100 105 110gaa gaa tcg ggt tta gta ccc gga tcc gag atc cga cca gag tct cct 384Glu Glu Ser Gly Leu Val Pro Gly Ser Glu Ile Arg Pro Glu Ser Pro 115 120 125tct aca tct gca tct gtt gct aca tcg acg gtg gat tat gat ttt tcg 432Ser Thr Ser Ala Ser Val Ala Thr Ser Thr Val Asp Tyr Asp Phe Ser 130 135 140ttt ttg gat ttg ctt ccg atg aat ttc ggg ttt gat tcc ttc tcc gac 480Phe Leu Asp Leu Leu Pro Met Asn Phe Gly Phe Asp Ser Phe Ser Asp145 150 155 160gac ttc tct ggc ttc tcc ggt ggt gat cga ttt aca gag att tta ccc 528Asp Phe Ser Gly Phe Ser Gly Gly Asp Arg Phe Thr Glu Ile Leu Pro 165 170 175atc gaa gat tac gga gga gag agt tta tta gat gaa tct ttg att ctt 576Ile Glu Asp Tyr Gly Gly Glu Ser Leu Leu Asp Glu Ser Leu Ile Leu 180 185 190tgg gat ttt tga 588Trp Asp Phe 19594195PRTArabidopsis thaliana 94Met Val Lys Gln Ala Met Lys Glu Glu Glu Lys Lys Arg Asn Thr Ala 1 5 10 15Met Gln Ser Lys Tyr Lys Gly Val Arg Lys Arg Lys Trp Gly Lys Trp 20 25 30Val Ser Glu Ile Arg Leu Pro His Ser Arg Glu Arg Ile Trp Leu Gly 35 40 45Ser Tyr Asp Thr Pro Glu Lys Ala Ala Arg Ala Phe Asp Ala Ala Gln 50 55 60Phe Cys Leu Arg Gly Gly Asp Ala Asn Phe Asn Phe Pro Asn Asn Pro 65 70 75 80Pro Ser Ile Ser Val Glu Lys Ser Leu Thr Pro Pro Glu Ile Gln Glu 85 90 95Ala Ala Ala Arg Phe Ala Asn Thr Phe Gln Asp Ile Val Lys Gly Glu 100 105 110Glu Glu Ser Gly Leu Val Pro Gly Ser Glu Ile Arg Pro Glu Ser Pro 115 120 125Ser Thr Ser Ala Ser Val Ala Thr Ser Thr Val Asp Tyr Asp Phe Ser 130 135 140Phe Leu Asp Leu Leu Pro Met Asn Phe Gly Phe Asp Ser Phe Ser Asp145 150 155 160Asp Phe Ser Gly Phe Ser Gly Gly Asp Arg Phe Thr Glu Ile Leu Pro 165 170 175Ile Glu Asp Tyr Gly Gly Glu Ser Leu Leu Asp Glu Ser Leu Ile Leu 180 185 190Trp Asp Phe 195951290DNAArabidopsis thalianaCDS(81)..(1088) 95aacaaatctc tctgtttctc ccgctcttgc tctgttttct caaagacaaa agaggacatc 60gtcgttgact cctcttctct atg gct act gct aag aac aag gga aaa tca atc 113 Met Ala Thr Ala Lys Asn Lys Gly Lys Ser Ile 1 5 10agg gtc ctt ggt acc agt gaa gca gag aaa aag gat gag atg gag ttg 161Arg Val Leu Gly Thr Ser Glu Ala Glu Lys Lys Asp Glu Met Glu Leu 15 20 25gag gag gag ttc cag ttt agt agc ggc aag tat aaa gat tcg ggt cct 209Glu Glu Glu Phe Gln Phe Ser Ser Gly Lys Tyr Lys Asp Ser Gly Pro 30 35 40ggc tcg gac atg tgg tta gga gat gct tcc tct acg tct cca aga agt 257Gly Ser Asp Met Trp Leu Gly Asp Ala Ser Ser Thr Ser Pro Arg Ser 45 50 55ctt agg aag act aga acc ttt gac cga cat aat ccc tat ctc gta tct 305Leu Arg Lys Thr Arg Thr Phe Asp Arg His Asn Pro Tyr Leu Val Ser 60 65 70 75tct tat gct act cct cag ccg cca aca aca act aca tgc tct gtc tct 353Ser Tyr Ala Thr Pro Gln Pro Pro Thr Thr Thr Thr Cys Ser Val Ser 80 85 90ttt ccc ttt tac ctc cct cca gcg att caa aat caa caa cga ttt tta 401Phe Pro Phe Tyr Leu Pro Pro Ala Ile Gln Asn Gln Gln Arg Phe Leu 95 100 105cac ccg aat gac cct tca gga caa aga cag caa caa atg atc tcg ttt 449His Pro Asn Asp Pro Ser Gly Gln Arg Gln Gln Gln Met Ile Ser Phe 110 115 120gat cct caa caa cag gtg caa cca tat gtt gca caa cag cag caa caa 497Asp Pro Gln Gln Gln Val Gln Pro Tyr Val Ala Gln Gln Gln Gln Gln 125 130 135caa caa cat cta ttg cag tac tgg aga gac att ctg aag ctg agt ccg 545Gln Gln His Leu Leu Gln Tyr Trp Arg Asp Ile Leu Lys Leu Ser Pro140 145 150 155agc gga aga atg atg atg atg aac atg tta aga caa gaa agc gat ctg 593Ser Gly Arg Met Met Met Met Asn Met Leu Arg Gln Glu Ser Asp Leu 160 165 170cca ctg acg agg cca ccg gtt caa ccc ttc agc gcc acc aag cta tat 641Pro Leu Thr Arg Pro Pro Val Gln Pro Phe Ser Ala Thr Lys Leu Tyr 175 180 185aga ggt gtc agg caa cgc cac tgg gga aaa tgg gtt gcc gag atc cgt 689Arg Gly Val Arg Gln Arg His Trp Gly Lys Trp Val Ala Glu Ile Arg 190 195 200aag cca cga aac agg aca cgt ctc tgg cta ggg aca ttc gat aca gca 737Lys Pro Arg Asn Arg Thr Arg Leu Trp Leu Gly Thr Phe Asp Thr Ala 205 210 215gaa gaa gcc gcc atg gcc tac gac cgc gag gcc ttc aag ttg agg gga 785Glu Glu Ala Ala Met Ala Tyr Asp Arg Glu Ala Phe Lys Leu Arg Gly220 225 230 235gag acc gct agg ctc aat ttc cct gaa ctt ttt ctc aat aaa caa gag 833Glu Thr Ala Arg Leu Asn Phe Pro Glu Leu Phe Leu Asn Lys Gln Glu 240 245 250cca act ccc gtg cat cag aaa caa tgt gag acg ggg act act agt gaa 881Pro Thr Pro Val His Gln Lys Gln Cys Glu Thr Gly Thr Thr Ser Glu 255 260 265gac tca agc aga aga gga gag gat gat tcg agc acg gca ttg gca gta 929Asp Ser Ser Arg Arg Gly Glu Asp Asp Ser Ser Thr Ala Leu Ala Val 270 275 280gga ggg gtg agt gag gag acg ggt tgg gct gag gca tgg ttc aat gca 977Gly Gly Val Ser Glu Glu Thr Gly Trp Ala Glu Ala Trp Phe Asn Ala 285 290 295att cca gag gaa tgg gga cct gga agc cct cta tgg gat gat tac cac 1025Ile Pro Glu Glu Trp Gly Pro Gly Ser Pro Leu Trp Asp Asp Tyr His300 305 310 315ttt ccc att tct aac cat aag gac gat ctt gac gcc aca caa aac tct 1073Phe Pro Ile Ser Asn His Lys Asp Asp Leu Asp Ala Thr Gln Asn Ser 320 325 330tct tct gat aca att taggaccttt gttagaatat agatatgctt agttgtatga 1128Ser Ser Asp Thr Ile 335ctgatctagc ttgtgttttt tttttgggtg gagacagttt ttgtcatctt ccacatttta 1188gattctattt tcgaccatca tttttttctt gatcggtgac tatgaatcta atggggtcaa 1248tcattttcac atataaaact taagtatttg gtgtttgtac tt 129096336PRTArabidopsis thaliana 96Met Ala Thr Ala Lys Asn Lys Gly Lys Ser Ile Arg Val Leu Gly Thr 1 5 10 15Ser Glu Ala Glu Lys Lys Asp Glu Met Glu Leu Glu Glu Glu Phe Gln 20 25 30Phe Ser Ser Gly Lys Tyr Lys Asp Ser Gly Pro Gly Ser Asp Met Trp 35 40 45Leu Gly Asp Ala Ser Ser Thr Ser Pro Arg Ser Leu Arg Lys Thr Arg 50 55 60Thr Phe Asp Arg His Asn Pro Tyr Leu Val Ser Ser Tyr Ala Thr Pro 65 70 75 80Gln Pro Pro Thr Thr Thr Thr Cys Ser Val Ser Phe Pro Phe Tyr Leu 85 90 95Pro Pro Ala Ile Gln Asn Gln Gln Arg Phe Leu His Pro Asn Asp Pro 100 105 110Ser Gly Gln Arg Gln Gln Gln Met Ile Ser Phe Asp Pro Gln Gln Gln 115 120 125Val Gln Pro Tyr Val Ala Gln Gln Gln Gln Gln Gln Gln His Leu Leu 130 135 140Gln Tyr Trp Arg Asp Ile Leu Lys Leu Ser Pro Ser Gly Arg Met Met145 150 155 160Met Met Asn Met Leu Arg Gln Glu Ser Asp Leu Pro Leu Thr Arg Pro 165 170 175Pro Val Gln Pro Phe Ser Ala Thr Lys Leu Tyr Arg Gly Val Arg Gln 180 185 190Arg His Trp Gly Lys Trp Val Ala Glu Ile Arg Lys Pro Arg Asn Arg 195 200 205Thr Arg Leu Trp Leu Gly Thr Phe Asp Thr Ala Glu Glu Ala Ala Met 210 215 220Ala Tyr Asp Arg Glu Ala Phe Lys Leu Arg Gly Glu Thr Ala Arg Leu225 230 235 240Asn Phe Pro Glu Leu Phe Leu Asn Lys Gln Glu Pro Thr Pro Val His 245 250 255Gln Lys Gln Cys Glu Thr Gly Thr Thr Ser Glu Asp Ser Ser Arg Arg 260 265 270Gly Glu Asp Asp Ser Ser Thr Ala Leu Ala Val Gly Gly Val Ser Glu 275 280 285Glu Thr Gly Trp Ala Glu Ala Trp Phe Asn Ala Ile Pro Glu Glu Trp 290 295 300Gly Pro Gly Ser Pro Leu Trp Asp Asp Tyr His Phe Pro Ile Ser Asn305 310 315 320His Lys Asp Asp Leu Asp Ala Thr Gln Asn Ser Ser Ser Asp Thr Ile 325 330 33597879DNAArabidopsis thalianaCDS(1)..(876) 97atg gac ttt gac gag gag cta aat ctt tgt att acg aaa ggt aaa aat 48Met Asp Phe Asp Glu Glu Leu Asn Leu Cys Ile Thr Lys Gly Lys Asn 1 5 10 15gtt gat cat tct ttt gga gga gaa gct tct tcc acg tcc cca aga tct 96Val Asp His Ser Phe Gly Gly Glu Ala Ser Ser Thr Ser Pro Arg Ser 20 25 30atg aag aaa atg aag agt cct agt cgt cct aaa ccc tat ttc caa tcc 144Met Lys Lys Met Lys Ser Pro Ser Arg Pro Lys Pro Tyr Phe Gln Ser 35 40 45tct tct tct cct tat tcg tta gag gct ttc cct ttt tct ctc gat cca 192Ser Ser Ser Pro Tyr Ser Leu Glu Ala Phe Pro Phe Ser Leu Asp Pro 50 55 60aca ctt cag aat cag caa caa caa ctc gga tca tac gtt ccg gta ctt 240Thr Leu Gln Asn Gln Gln Gln Gln Leu Gly Ser Tyr Val Pro Val Leu 65 70 75 80gag caa cga caa gac ccg aca atg caa ggc cag aag caa atg atc tcc 288Glu Gln Arg Gln Asp Pro Thr Met Gln Gly Gln Lys Gln Met Ile Ser 85 90 95ttt agt cct caa caa caa caa cag cag cag cag tat atg gcc cag tac 336Phe Ser Pro Gln Gln Gln Gln Gln Gln Gln Gln Tyr Met Ala Gln Tyr 100 105 110tgg agt gac aca ttg aat ctg agt cca aga gga aga atg atg atg atg 384Trp Ser Asp Thr Leu Asn Leu Ser Pro Arg Gly Arg Met Met Met Met 115 120 125atg agc caa gaa gct gtt caa cct tac atc gca acg aag ctg tac aga 432Met Ser Gln Glu Ala Val Gln Pro Tyr Ile Ala Thr Lys Leu Tyr Arg 130 135 140gga gtg aga caa cgt caa tgg gga aaa tgg gtc gca gag atc cgt aag 480Gly Val Arg Gln Arg Gln Trp Gly Lys Trp Val Ala Glu Ile Arg Lys145 150 155 160cca cga agc agg gca cgt ctt tgg ctt ggt acc ttt gat aca gct gaa 528Pro Arg Ser Arg Ala Arg Leu Trp Leu Gly Thr Phe Asp Thr Ala Glu 165

170 175gaa gct gcc atg gcc tac gac cgc caa gcc ttc aaa tta cga ggc cac 576Glu Ala Ala Met Ala Tyr Asp Arg Gln Ala Phe Lys Leu Arg Gly His 180 185 190agc gca aca ctg aat ttc ccg gag cat ttt gtg aat aag gaa agc gag 624Ser Ala Thr Leu Asn Phe Pro Glu His Phe Val Asn Lys Glu Ser Glu 195 200 205ctg cat gat tca aac tcg tcg gat cag aaa gaa cct gaa acg cca cag 672Leu His Asp Ser Asn Ser Ser Asp Gln Lys Glu Pro Glu Thr Pro Gln 210 215 220cca agc gag gtt aac ttg gag agc aag gaa cta ccg gtg att gat gtt 720Pro Ser Glu Val Asn Leu Glu Ser Lys Glu Leu Pro Val Ile Asp Val225 230 235 240ggg aga gag gaa ggt atg gct gag gca tgg tac aat gcc att aca tcg 768Gly Arg Glu Glu Gly Met Ala Glu Ala Trp Tyr Asn Ala Ile Thr Ser 245 250 255gga tgg ggt cct gaa agt cct ctt tgg gat gat ttg gat agt tct cat 816Gly Trp Gly Pro Glu Ser Pro Leu Trp Asp Asp Leu Asp Ser Ser His 260 265 270cag ttt tca tca gaa agc tca tct tct tct cct ctc tct tgt cct atg 864Gln Phe Ser Ser Glu Ser Ser Ser Ser Ser Pro Leu Ser Cys Pro Met 275 280 285agg cct ttc ttt tga 879Arg Pro Phe Phe 29098292PRTArabidopsis thaliana 98Met Asp Phe Asp Glu Glu Leu Asn Leu Cys Ile Thr Lys Gly Lys Asn 1 5 10 15Val Asp His Ser Phe Gly Gly Glu Ala Ser Ser Thr Ser Pro Arg Ser 20 25 30Met Lys Lys Met Lys Ser Pro Ser Arg Pro Lys Pro Tyr Phe Gln Ser 35 40 45Ser Ser Ser Pro Tyr Ser Leu Glu Ala Phe Pro Phe Ser Leu Asp Pro 50 55 60Thr Leu Gln Asn Gln Gln Gln Gln Leu Gly Ser Tyr Val Pro Val Leu 65 70 75 80Glu Gln Arg Gln Asp Pro Thr Met Gln Gly Gln Lys Gln Met Ile Ser 85 90 95Phe Ser Pro Gln Gln Gln Gln Gln Gln Gln Gln Tyr Met Ala Gln Tyr 100 105 110Trp Ser Asp Thr Leu Asn Leu Ser Pro Arg Gly Arg Met Met Met Met 115 120 125Met Ser Gln Glu Ala Val Gln Pro Tyr Ile Ala Thr Lys Leu Tyr Arg 130 135 140Gly Val Arg Gln Arg Gln Trp Gly Lys Trp Val Ala Glu Ile Arg Lys145 150 155 160Pro Arg Ser Arg Ala Arg Leu Trp Leu Gly Thr Phe Asp Thr Ala Glu 165 170 175Glu Ala Ala Met Ala Tyr Asp Arg Gln Ala Phe Lys Leu Arg Gly His 180 185 190Ser Ala Thr Leu Asn Phe Pro Glu His Phe Val Asn Lys Glu Ser Glu 195 200 205Leu His Asp Ser Asn Ser Ser Asp Gln Lys Glu Pro Glu Thr Pro Gln 210 215 220Pro Ser Glu Val Asn Leu Glu Ser Lys Glu Leu Pro Val Ile Asp Val225 230 235 240Gly Arg Glu Glu Gly Met Ala Glu Ala Trp Tyr Asn Ala Ile Thr Ser 245 250 255Gly Trp Gly Pro Glu Ser Pro Leu Trp Asp Asp Leu Asp Ser Ser His 260 265 270Gln Phe Ser Ser Glu Ser Ser Ser Ser Ser Pro Leu Ser Cys Pro Met 275 280 285Arg Pro Phe Phe 290991499DNAArabidopsis thalianaCDS(644)..(1222) 99tggtatcggt gaggctgaga gttattcact tacaaaaaaa aaaaaaaact tgagtgtaac 60caaaaaaaaa aagttgatat actttctggt tttctcctta acttttattc tttacaaatc 120catccccctt agatctgttt atttcccgct actttgattc atttctgtta gtaatctgtc 180tttcgtatag aagaaaactg atttcttggt ttgtattttc ttaaagagat caatcttttt 240ttatttttga tcttcttgtg tttttttttc tttgtagaat taatcgtttg tgagggtatt 300tttttaattc cctcctctca gaaatctaca cagaggtttt ttattttata aacctctttt 360ttcgattttc ttgaaaacaa aaaatcctgt tctttacttt ttttacaaga acaagggaaa 420aaaatttctt tttattagaa atgacaactt ctatggattt ttacagtaac aaaacgtttc 480aacaatctga tccattcggt ggtgaattaa tggaagcgct ttacctttta tcaaaagccc 540ttccaacgat tcatccgcgt ttgcgttctc tctacccgct ccaatttcat acgggtcgga 600tctccactca ttttctcacc atcttagtcc taaaccggtc tca atg aaa caa acc 655 Met Lys Gln Thr 1ggt act tcc gcg gct aaa ccg acg aag cta tac aga gga gtg aga caa 703Gly Thr Ser Ala Ala Lys Pro Thr Lys Leu Tyr Arg Gly Val Arg Gln 5 10 15 20cgt cac tgg gga aaa tgg gtg gct gag att cgt tta ccg agg aat cga 751Arg His Trp Gly Lys Trp Val Ala Glu Ile Arg Leu Pro Arg Asn Arg 25 30 35act cga ctt tgg ctc gga aca ttc gac acg gcg gag gaa gct gct tta 799Thr Arg Leu Trp Leu Gly Thr Phe Asp Thr Ala Glu Glu Ala Ala Leu 40 45 50gct tat gac aag gcg gcg tat aag ctc cga gga gat ttt gcg cgg ctt 847Ala Tyr Asp Lys Ala Ala Tyr Lys Leu Arg Gly Asp Phe Ala Arg Leu 55 60 65aat ttc cct gat ctc cgt cat aac gac gag tat caa cct ctt caa tca 895Asn Phe Pro Asp Leu Arg His Asn Asp Glu Tyr Gln Pro Leu Gln Ser 70 75 80tca gtc gac gct aag ctt gaa gct att tgt caa aac tta gct gag acg 943Ser Val Asp Ala Lys Leu Glu Ala Ile Cys Gln Asn Leu Ala Glu Thr 85 90 95 100acg cag aaa cag gtg aga tca acg aag aag tct tct tct cgg aaa cgt 991Thr Gln Lys Gln Val Arg Ser Thr Lys Lys Ser Ser Ser Arg Lys Arg 105 110 115tca tca acc gtc gca gtg aaa cta ccg gag gag gac tac tct agc gcc 1039Ser Ser Thr Val Ala Val Lys Leu Pro Glu Glu Asp Tyr Ser Ser Ala 120 125 130gga tct tcg ccg ctg tta acg gag agt tat gga tct ggt gga tct tct 1087Gly Ser Ser Pro Leu Leu Thr Glu Ser Tyr Gly Ser Gly Gly Ser Ser 135 140 145tcg ccg ttg tcg gag ctg acg ttt ggt gat acg gag gag gag att cag 1135Ser Pro Leu Ser Glu Leu Thr Phe Gly Asp Thr Glu Glu Glu Ile Gln 150 155 160ccg ccg tgg aac gag aac gcg ttg gag aag tat ccg tcg tac gag atc 1183Pro Pro Trp Asn Glu Asn Ala Leu Glu Lys Tyr Pro Ser Tyr Glu Ile165 170 175 180gat tgg gat tcg att ctt cag tgt tcg agt ctt gta aat tagatgttgc 1232Asp Trp Asp Ser Ile Leu Gln Cys Ser Ser Leu Val Asn 185 190cataggggta ttttagggac tttagagctc tctgcgatgg agtttttggt cattgcagag 1292attttattat tattaagggg gtttgttatg ttaatatcaa ataagtttat ctactttgat 1352gttaattagt gttaatctct gcgtcggtcc aagctgtttt tttttggcat gcttcgaccg 1412tgtgagattt cttatgtaat ttttgtagtt ccttgatttt cttagttcaa gttaaattgg 1472cacaaaagag caaaaaaaaa aaaaaaa 1499100193PRTArabidopsis thaliana 100Met Lys Gln Thr Gly Thr Ser Ala Ala Lys Pro Thr Lys Leu Tyr Arg 1 5 10 15Gly Val Arg Gln Arg His Trp Gly Lys Trp Val Ala Glu Ile Arg Leu 20 25 30Pro Arg Asn Arg Thr Arg Leu Trp Leu Gly Thr Phe Asp Thr Ala Glu 35 40 45Glu Ala Ala Leu Ala Tyr Asp Lys Ala Ala Tyr Lys Leu Arg Gly Asp 50 55 60Phe Ala Arg Leu Asn Phe Pro Asp Leu Arg His Asn Asp Glu Tyr Gln 65 70 75 80Pro Leu Gln Ser Ser Val Asp Ala Lys Leu Glu Ala Ile Cys Gln Asn 85 90 95Leu Ala Glu Thr Thr Gln Lys Gln Val Arg Ser Thr Lys Lys Ser Ser 100 105 110Ser Arg Lys Arg Ser Ser Thr Val Ala Val Lys Leu Pro Glu Glu Asp 115 120 125Tyr Ser Ser Ala Gly Ser Ser Pro Leu Leu Thr Glu Ser Tyr Gly Ser 130 135 140Gly Gly Ser Ser Ser Pro Leu Ser Glu Leu Thr Phe Gly Asp Thr Glu145 150 155 160Glu Glu Ile Gln Pro Pro Trp Asn Glu Asn Ala Leu Glu Lys Tyr Pro 165 170 175Ser Tyr Glu Ile Asp Trp Asp Ser Ile Leu Gln Cys Ser Ser Leu Val 180 185 190Asn101962DNAArabidopsis thalianaCDS(77)..(880) 101acaccggaca ttttgaaatc tcaacaagaa ccaaaccaaa caacaaaaaa acattcttaa 60taattatctt tctgtt atg tcg atg acg gcg gat tct caa tct gat tat gct 112 Met Ser Met Thr Ala Asp Ser Gln Ser Asp Tyr Ala 1 5 10ttt ctt gag tcc ata cga cga cac tta cta gga gaa tcg gag ccg ata 160Phe Leu Glu Ser Ile Arg Arg His Leu Leu Gly Glu Ser Glu Pro Ile 15 20 25ctc agt gag tcg aca gcg agt tcg gtt act caa tct tgt gta acc ggt 208Leu Ser Glu Ser Thr Ala Ser Ser Val Thr Gln Ser Cys Val Thr Gly 30 35 40cag agc att aaa ccg gtg tac gga cga aac cct agc ttt agc aaa ctg 256Gln Ser Ile Lys Pro Val Tyr Gly Arg Asn Pro Ser Phe Ser Lys Leu 45 50 55 60tat cct tgc ttc acc gag agc tgg gga gat ttg ccg ttg aaa gaa aac 304Tyr Pro Cys Phe Thr Glu Ser Trp Gly Asp Leu Pro Leu Lys Glu Asn 65 70 75gat tct gag gat atg tta gtt tac ggt atc ctc aac gac gcc ttt cac 352Asp Ser Glu Asp Met Leu Val Tyr Gly Ile Leu Asn Asp Ala Phe His 80 85 90ggc ggt tgg gag ccg tct tct tcg tct tcc gac gaa gat cgt agc tct 400Gly Gly Trp Glu Pro Ser Ser Ser Ser Ser Asp Glu Asp Arg Ser Ser 95 100 105ttc ccg agt gtt aag atc gag act ccg gag agt ttc gcg gcg gtg gat 448Phe Pro Ser Val Lys Ile Glu Thr Pro Glu Ser Phe Ala Ala Val Asp 110 115 120tct gtt ccg gtc aag aag gag aag acg agt cct gtt tcg gcg gcg gtg 496Ser Val Pro Val Lys Lys Glu Lys Thr Ser Pro Val Ser Ala Ala Val125 130 135 140acg gcg gcg aag gga aag cat tat aga gga gtg aga caa agg ccg tgg 544Thr Ala Ala Lys Gly Lys His Tyr Arg Gly Val Arg Gln Arg Pro Trp 145 150 155ggg aaa ttt gcg gcg gag att aga gac ccg gcg aag aac gga gct agg 592Gly Lys Phe Ala Ala Glu Ile Arg Asp Pro Ala Lys Asn Gly Ala Arg 160 165 170gtt tgg tta gga acg ttt gag acg gcg gag gac gcg gcg ttg gct tac 640Val Trp Leu Gly Thr Phe Glu Thr Ala Glu Asp Ala Ala Leu Ala Tyr 175 180 185gac aga gct gct ttc agg atg cgt ggt tcc cgc gct ttg ttg aat ttt 688Asp Arg Ala Ala Phe Arg Met Arg Gly Ser Arg Ala Leu Leu Asn Phe 190 195 200ccg ttg aga gtt aat tca gga gaa ccc gac ccg gtt cga atc aag tcc 736Pro Leu Arg Val Asn Ser Gly Glu Pro Asp Pro Val Arg Ile Lys Ser205 210 215 220aag aga tct tct ttt tct tct tct aac gag aac gga gct ccg aag aag 784Lys Arg Ser Ser Phe Ser Ser Ser Asn Glu Asn Gly Ala Pro Lys Lys 225 230 235agg aga acg gtg gcc gcc ggt ggt gga atg gat aag gga ttg acg gtg 832Arg Arg Thr Val Ala Ala Gly Gly Gly Met Asp Lys Gly Leu Thr Val 240 245 250aag tgc gag gtt gtt gaa gtg gca cgt ggc gat cgt tta ttg gtt tta 880Lys Cys Glu Val Val Glu Val Ala Arg Gly Asp Arg Leu Leu Val Leu 255 260 265taattttgat ttttctttgt tggatgatta tatgattctt caaaaaagaa gaacgttaat 940aaaaaaattc gtttattatt gt 962102268PRTArabidopsis thaliana 102Met Ser Met Thr Ala Asp Ser Gln Ser Asp Tyr Ala Phe Leu Glu Ser 1 5 10 15Ile Arg Arg His Leu Leu Gly Glu Ser Glu Pro Ile Leu Ser Glu Ser 20 25 30Thr Ala Ser Ser Val Thr Gln Ser Cys Val Thr Gly Gln Ser Ile Lys 35 40 45Pro Val Tyr Gly Arg Asn Pro Ser Phe Ser Lys Leu Tyr Pro Cys Phe 50 55 60Thr Glu Ser Trp Gly Asp Leu Pro Leu Lys Glu Asn Asp Ser Glu Asp 65 70 75 80Met Leu Val Tyr Gly Ile Leu Asn Asp Ala Phe His Gly Gly Trp Glu 85 90 95Pro Ser Ser Ser Ser Ser Asp Glu Asp Arg Ser Ser Phe Pro Ser Val 100 105 110Lys Ile Glu Thr Pro Glu Ser Phe Ala Ala Val Asp Ser Val Pro Val 115 120 125Lys Lys Glu Lys Thr Ser Pro Val Ser Ala Ala Val Thr Ala Ala Lys 130 135 140Gly Lys His Tyr Arg Gly Val Arg Gln Arg Pro Trp Gly Lys Phe Ala145 150 155 160Ala Glu Ile Arg Asp Pro Ala Lys Asn Gly Ala Arg Val Trp Leu Gly 165 170 175Thr Phe Glu Thr Ala Glu Asp Ala Ala Leu Ala Tyr Asp Arg Ala Ala 180 185 190Phe Arg Met Arg Gly Ser Arg Ala Leu Leu Asn Phe Pro Leu Arg Val 195 200 205Asn Ser Gly Glu Pro Asp Pro Val Arg Ile Lys Ser Lys Arg Ser Ser 210 215 220Phe Ser Ser Ser Asn Glu Asn Gly Ala Pro Lys Lys Arg Arg Thr Val225 230 235 240Ala Ala Gly Gly Gly Met Asp Lys Gly Leu Thr Val Lys Cys Glu Val 245 250 255Val Glu Val Ala Arg Gly Asp Arg Leu Leu Val Leu 260 265103834DNAArabidopsis thalianaCDS(171)..(746) 103aacacaattt gattacactg agcaacacaa aactggcgaa ccaacgtgac tctaacgaag 60aaaccggcaa tggccagtat cactacaatg ccgaagaaat aacaagaatc ataaacgagc 120cagaatatta tcccccgggt tacaacttgt ctaccaccgc aatttcaaac atg gtg 176 Met Val 1tct atg ctg act aat gtt gtc tct ggt gag acc gaa ccc tcg gca tct 224Ser Met Leu Thr Asn Val Val Ser Gly Glu Thr Glu Pro Ser Ala Ser 5 10 15gcg aca tgg acg atg ggt cat aag aga gaa aga gaa gag ttt tct ttg 272Ala Thr Trp Thr Met Gly His Lys Arg Glu Arg Glu Glu Phe Ser Leu 20 25 30cct cct caa cca ttg att acc ggt tca gct gtg act aaa gaa tgt gaa 320Pro Pro Gln Pro Leu Ile Thr Gly Ser Ala Val Thr Lys Glu Cys Glu 35 40 45 50agc tca atg tcc ttg gag agg cca aaa aaa tat aga gga gta agg caa 368Ser Ser Met Ser Leu Glu Arg Pro Lys Lys Tyr Arg Gly Val Arg Gln 55 60 65cga cca tgg gga aaa tgg gcg gcg gag att cga gac cca cac aag gcg 416Arg Pro Trp Gly Lys Trp Ala Ala Glu Ile Arg Asp Pro His Lys Ala 70 75 80aca cgt gta tgg ctt ggg aca ttc gag aca gcc gag gcc gcc gca aga 464Thr Arg Val Trp Leu Gly Thr Phe Glu Thr Ala Glu Ala Ala Ala Arg 85 90 95gcc tat gat gcg gca gca ctt cgc ttt aga gga agc aaa gca aag ctt 512Ala Tyr Asp Ala Ala Ala Leu Arg Phe Arg Gly Ser Lys Ala Lys Leu 100 105 110aat ttc ccc gaa aat gtt gga act cag acg att caa cga aat tct cat 560Asn Phe Pro Glu Asn Val Gly Thr Gln Thr Ile Gln Arg Asn Ser His115 120 125 130ttc ttg caa aac tct atg caa cct tct ctg aca tac atc gat caa tgt 608Phe Leu Gln Asn Ser Met Gln Pro Ser Leu Thr Tyr Ile Asp Gln Cys 135 140 145cca act cta tta tct tac tct cga tgt atg gag caa caa caa cca tta 656Pro Thr Leu Leu Ser Tyr Ser Arg Cys Met Glu Gln Gln Gln Pro Leu 150 155 160gta ggc atg ttg cag cca aca gaa gag gaa aat cac ttt ttc gaa aaa 704Val Gly Met Leu Gln Pro Thr Glu Glu Glu Asn His Phe Phe Glu Lys 165 170 175cca tgg acc gaa tat gat caa tac aat tac tcc tct ttt ggt 746Pro Trp Thr Glu Tyr Asp Gln Tyr Asn Tyr Ser Ser Phe Gly 180 185 190taactaacat attgtcaacg ctttgtattt ctacttattc gatctaccaa ttttttctct 806cccaatacaa cttcagtctg attattgc 834104192PRTArabidopsis thaliana 104Met Val Ser Met Leu Thr Asn Val Val Ser Gly Glu Thr Glu Pro Ser 1 5 10 15Ala Ser Ala Thr Trp Thr Met Gly His Lys Arg Glu Arg Glu Glu Phe 20 25 30Ser Leu Pro Pro Gln Pro Leu Ile Thr Gly Ser Ala Val Thr Lys Glu 35 40 45Cys Glu Ser Ser Met Ser Leu Glu Arg Pro Lys Lys Tyr Arg Gly Val 50 55 60Arg Gln Arg Pro Trp Gly Lys Trp Ala Ala Glu Ile Arg Asp Pro His 65 70 75 80Lys Ala Thr Arg Val Trp Leu Gly Thr Phe Glu Thr Ala Glu Ala Ala 85 90 95Ala Arg Ala Tyr Asp Ala Ala Ala Leu Arg Phe Arg Gly Ser Lys Ala 100 105 110Lys Leu Asn Phe Pro Glu Asn Val Gly Thr Gln Thr Ile Gln Arg Asn 115 120 125Ser His Phe Leu Gln Asn Ser Met Gln Pro Ser Leu Thr Tyr Ile Asp 130 135 140Gln Cys Pro Thr Leu Leu Ser Tyr Ser Arg Cys Met Glu Gln Gln Gln145 150 155 160Pro Leu Val Gly Met Leu Gln Pro Thr Glu Glu Glu

Asn His Phe Phe 165 170 175Glu Lys Pro Trp Thr Glu Tyr Asp Gln Tyr Asn Tyr Ser Ser Phe Gly 180 185 1901051085DNAArabidopsis thalianaCDS(120)..(953) 105acttcactct ctaatttcct tctctctatc tctcaccata ttcgcgatta aaaactctca 60acttttctct caaatttctg atcctttgat ccaacagtta gaagaagatt catctgatc 119atg gcc ctc gaa gcg atg aac act cca act tct tct ttc acc aga atc 167Met Ala Leu Glu Ala Met Asn Thr Pro Thr Ser Ser Phe Thr Arg Ile 1 5 10 15gaa acg aaa gaa gat ttg atg aac gac gcc gtt ttc att gag ccg tgg 215Glu Thr Lys Glu Asp Leu Met Asn Asp Ala Val Phe Ile Glu Pro Trp 20 25 30ctt aaa cgc aaa cgc tcc aaa cgt cag cgt tct cac agc cct tct tcg 263Leu Lys Arg Lys Arg Ser Lys Arg Gln Arg Ser His Ser Pro Ser Ser 35 40 45tct tct tcc tca ccg cct cga tct cga ccc aaa tcc cag aat caa gat 311Ser Ser Ser Ser Pro Pro Arg Ser Arg Pro Lys Ser Gln Asn Gln Asp 50 55 60ctt acg gaa gaa gag tat ctc gct ctt tgt ctc ctc atg ctc gct aaa 359Leu Thr Glu Glu Glu Tyr Leu Ala Leu Cys Leu Leu Met Leu Ala Lys 65 70 75 80gat caa ccg tcg caa acg cga ttt cat caa cag tcg caa tcg tta acg 407Asp Gln Pro Ser Gln Thr Arg Phe His Gln Gln Ser Gln Ser Leu Thr 85 90 95ccg ccg cca gaa tca aag aac ctt ccg tac aag tgt aac gtc tgt gaa 455Pro Pro Pro Glu Ser Lys Asn Leu Pro Tyr Lys Cys Asn Val Cys Glu 100 105 110aaa gcg ttt cct tcc tat cag gct tta ggc ggt cac aaa gca agt cac 503Lys Ala Phe Pro Ser Tyr Gln Ala Leu Gly Gly His Lys Ala Ser His 115 120 125cga atc aaa cca cca acc gta atc tca aca acc gcc gat gat tca aca 551Arg Ile Lys Pro Pro Thr Val Ile Ser Thr Thr Ala Asp Asp Ser Thr 130 135 140gct ccg acc atc tcc atc gtc gcc gga gaa aaa cat ccg att gct gcc 599Ala Pro Thr Ile Ser Ile Val Ala Gly Glu Lys His Pro Ile Ala Ala145 150 155 160tcc gga aag atc cac gag tgt tca atc tgt cat aaa gtg ttt ccg acg 647Ser Gly Lys Ile His Glu Cys Ser Ile Cys His Lys Val Phe Pro Thr 165 170 175ggt caa gct tta ggc ggt cac aaa cgt tgt cac tac gaa ggc aac ctc 695Gly Gln Ala Leu Gly Gly His Lys Arg Cys His Tyr Glu Gly Asn Leu 180 185 190ggc ggc gga gga gga gga gga agc aaa tca atc agt cac agt gga agc 743Gly Gly Gly Gly Gly Gly Gly Ser Lys Ser Ile Ser His Ser Gly Ser 195 200 205gtg tcg agc acg gta tcg gaa gaa agg agc cac cgt gga ttc atc gat 791Val Ser Ser Thr Val Ser Glu Glu Arg Ser His Arg Gly Phe Ile Asp 210 215 220cta aac cta ccg gcg tta cct gaa ctc agc ctt cat cac aat cca atc 839Leu Asn Leu Pro Ala Leu Pro Glu Leu Ser Leu His His Asn Pro Ile225 230 235 240gtc gac gaa gag atc ttg agt ccg ttg acc ggt aaa aaa acc gct ttt 887Val Asp Glu Glu Ile Leu Ser Pro Leu Thr Gly Lys Lys Thr Ala Phe 245 250 255gtt gac cga tca cga cca agt cat caa gaa aga aga ttt atc ttt aaa 935Val Asp Arg Ser Arg Pro Ser His Gln Glu Arg Arg Phe Ile Phe Lys 260 265 270aat cta ata ctc gac tat taattcttgt gtgatttttt tcgttacaac 983Asn Leu Ile Leu Asp Tyr 275catagtttca ttttcatttt tttagttaca aatttttaat tgttctgatt tggattgaat 1043attggtatat tgttaggggt tgatacaaaa aaaaaaaaaa aa 1085106278PRTArabidopsis thaliana 106Met Ala Leu Glu Ala Met Asn Thr Pro Thr Ser Ser Phe Thr Arg Ile 1 5 10 15Glu Thr Lys Glu Asp Leu Met Asn Asp Ala Val Phe Ile Glu Pro Trp 20 25 30Leu Lys Arg Lys Arg Ser Lys Arg Gln Arg Ser His Ser Pro Ser Ser 35 40 45Ser Ser Ser Ser Pro Pro Arg Ser Arg Pro Lys Ser Gln Asn Gln Asp 50 55 60Leu Thr Glu Glu Glu Tyr Leu Ala Leu Cys Leu Leu Met Leu Ala Lys 65 70 75 80Asp Gln Pro Ser Gln Thr Arg Phe His Gln Gln Ser Gln Ser Leu Thr 85 90 95Pro Pro Pro Glu Ser Lys Asn Leu Pro Tyr Lys Cys Asn Val Cys Glu 100 105 110Lys Ala Phe Pro Ser Tyr Gln Ala Leu Gly Gly His Lys Ala Ser His 115 120 125Arg Ile Lys Pro Pro Thr Val Ile Ser Thr Thr Ala Asp Asp Ser Thr 130 135 140Ala Pro Thr Ile Ser Ile Val Ala Gly Glu Lys His Pro Ile Ala Ala145 150 155 160Ser Gly Lys Ile His Glu Cys Ser Ile Cys His Lys Val Phe Pro Thr 165 170 175Gly Gln Ala Leu Gly Gly His Lys Arg Cys His Tyr Glu Gly Asn Leu 180 185 190Gly Gly Gly Gly Gly Gly Gly Ser Lys Ser Ile Ser His Ser Gly Ser 195 200 205Val Ser Ser Thr Val Ser Glu Glu Arg Ser His Arg Gly Phe Ile Asp 210 215 220Leu Asn Leu Pro Ala Leu Pro Glu Leu Ser Leu His His Asn Pro Ile225 230 235 240Val Asp Glu Glu Ile Leu Ser Pro Leu Thr Gly Lys Lys Thr Ala Phe 245 250 255Val Asp Arg Ser Arg Pro Ser His Gln Glu Arg Arg Phe Ile Phe Lys 260 265 270Asn Leu Ile Leu Asp Tyr 2751071783DNAArabidopsis thalianaCDS(162)..(1529) 107attgttaaaa gctctcacac aaccaccgtt ctccgtcacg gtggcgcttt attctctcat 60cggagcgcct caccggtggc agacggtgtc gttgttcctc tcctaaaacc tccatcaatc 120accatctctt tacacagagc tctaaccaaa atcttcgaga g atg ggg cag agt atg 176 Met Gly Gln Ser Met 1 5agc tgt gga agt cga ccg gag cac gga ata ttc gcc tct gta cag tgc 224Ser Cys Gly Ser Arg Pro Glu His Gly Ile Phe Ala Ser Val Gln Cys 10 15 20ggc gat atc atc act atc cgt cgt gtg atg gcg acg gag cct agt ctg 272Gly Asp Ile Ile Thr Ile Arg Arg Val Met Ala Thr Glu Pro Ser Leu 25 30 35ttg aat caa act act cct tat gat cgt cac tct gtt ctt cat gtc gct 320Leu Asn Gln Thr Thr Pro Tyr Asp Arg His Ser Val Leu His Val Ala 40 45 50gct gct aat ggt cag atc gag att ttg tca ttg ctt ttg gaa cga ttt 368Ala Ala Asn Gly Gln Ile Glu Ile Leu Ser Leu Leu Leu Glu Arg Phe 55 60 65acg aat cca gat ttg ttg aat cgt cac aag cag act ccg tta atg ttg 416Thr Asn Pro Asp Leu Leu Asn Arg His Lys Gln Thr Pro Leu Met Leu 70 75 80 85gct gcg atg tat gga aga atc tct tgt gtg aag aag cta gct gaa gtt 464Ala Ala Met Tyr Gly Arg Ile Ser Cys Val Lys Lys Leu Ala Glu Val 90 95 100gga gct aat att ttg atg ttt gat tct gtg aat cga aga aca tgt ttg 512Gly Ala Asn Ile Leu Met Phe Asp Ser Val Asn Arg Arg Thr Cys Leu 105 110 115cat tac gct gct tat tat gga cat gct aat tgt gtt caa gct att ctc 560His Tyr Ala Ala Tyr Tyr Gly His Ala Asn Cys Val Gln Ala Ile Leu 120 125 130tct gct gct caa tca agt cct gtt gct gtc cat tgg gga tat gcg aga 608Ser Ala Ala Gln Ser Ser Pro Val Ala Val His Trp Gly Tyr Ala Arg 135 140 145ttt gtg aac ata aga gat gat aaa gga gcg act ccg ttg cat tta gct 656Phe Val Asn Ile Arg Asp Asp Lys Gly Ala Thr Pro Leu His Leu Ala150 155 160 165gct cga cag aga cga cct gaa tgt gtg aat gtt ttg ttg gat agt ggt 704Ala Arg Gln Arg Arg Pro Glu Cys Val Asn Val Leu Leu Asp Ser Gly 170 175 180tct ctt gtt tgt gca tct act agt gta tat ggt tct cca gga agc aca 752Ser Leu Val Cys Ala Ser Thr Ser Val Tyr Gly Ser Pro Gly Ser Thr 185 190 195cct ctt cat tta gca gct aga agt gga tct ata gat tgt gtc aga aag 800Pro Leu His Leu Ala Ala Arg Ser Gly Ser Ile Asp Cys Val Arg Lys 200 205 210ttg ctt gct tgg ggt gct gat cgt ctt caa cga gac gct tct ggg aga 848Leu Leu Ala Trp Gly Ala Asp Arg Leu Gln Arg Asp Ala Ser Gly Arg 215 220 225ata cct tat gtg gtt gcg atg aag cat aag cat gga gca tgt gga gcc 896Ile Pro Tyr Val Val Ala Met Lys His Lys His Gly Ala Cys Gly Ala230 235 240 245tta ctt aat ccg tcc tct gca gag cca ctt gtt tgg cca tca cca tta 944Leu Leu Asn Pro Ser Ser Ala Glu Pro Leu Val Trp Pro Ser Pro Leu 250 255 260aag ttc atc agt gag ctt aat gac gag gcg aaa ctt ctc tta gag cag 992Lys Phe Ile Ser Glu Leu Asn Asp Glu Ala Lys Leu Leu Leu Glu Gln 265 270 275gct tta atg gag gct aac agg gag aga gag aaa acc atc ctc aaa gga 1040Ala Leu Met Glu Ala Asn Arg Glu Arg Glu Lys Thr Ile Leu Lys Gly 280 285 290aca gct tat tcc tta cca tca ccc tct ttc tct gac acg gat gat aac 1088Thr Ala Tyr Ser Leu Pro Ser Pro Ser Phe Ser Asp Thr Asp Asp Asn 295 300 305atg tcc gag gtg agt gat acg gaa ctg tgc tgc att tgc ttt gag caa 1136Met Ser Glu Val Ser Asp Thr Glu Leu Cys Cys Ile Cys Phe Glu Gln310 315 320 325gta tgt aca att gaa gtt aaa gac tgt ggt cac caa atg tgt gca caa 1184Val Cys Thr Ile Glu Val Lys Asp Cys Gly His Gln Met Cys Ala Gln 330 335 340tgc aca ctt gca ctg tgc tgt cac aac aaa cca aac cca acg acc tca 1232Cys Thr Leu Ala Leu Cys Cys His Asn Lys Pro Asn Pro Thr Thr Ser 345 350 355acc gtg act cca ccg gtc tgt ccg ttc tgt aga agc acc att gca tgt 1280Thr Val Thr Pro Pro Val Cys Pro Phe Cys Arg Ser Thr Ile Ala Cys 360 365 370tta gtc gtc gcc cag aac aac aac aac aac aac gaa aag agc aaa agc 1328Leu Val Val Ala Gln Asn Asn Asn Asn Asn Asn Glu Lys Ser Lys Ser 375 380 385cta gat gat gtt gtt gtt gtt gat cgt gag gca ggt gat gtt agc tcc 1376Leu Asp Asp Val Val Val Val Asp Arg Glu Ala Gly Asp Val Ser Ser390 395 400 405tcc aaa ttc aga aaa cat aga aga tca ata aac ctt ggc gaa gaa agc 1424Ser Lys Phe Arg Lys His Arg Arg Ser Ile Asn Leu Gly Glu Glu Ser 410 415 420agc agc ttc atg gga cta tca act att gga tca ttc ggt agg ata acc 1472Ser Ser Phe Met Gly Leu Ser Thr Ile Gly Ser Phe Gly Arg Ile Thr 425 430 435ggc cgt ggc tcg gga agg atc gca gcc gaa aac gag ctg atg gac aaa 1520Gly Arg Gly Ser Gly Arg Ile Ala Ala Glu Asn Glu Leu Met Asp Lys 440 445 450cca ata ttg tgagggatcg attccgtttt aagggacatt ttggggcatg 1569Pro Ile Leu 455ggggagcaat aaaaaagatg aggggatgaa attgtgagaa tgtataaaat atagatgaat 1629ttatgttaga tcttttgttg aagggaggaa gattgaaata aggaaaaaga tgtggggagg 1689tgtgtaatgc aaggatttgt tgtttctttg attaagtttg gccaaaattg tttgttgttg 1749ttattatttg gttacttgat atgaaaggga aacc 1783108456PRTArabidopsis thaliana 108Met Gly Gln Ser Met Ser Cys Gly Ser Arg Pro Glu His Gly Ile Phe 1 5 10 15Ala Ser Val Gln Cys Gly Asp Ile Ile Thr Ile Arg Arg Val Met Ala 20 25 30Thr Glu Pro Ser Leu Leu Asn Gln Thr Thr Pro Tyr Asp Arg His Ser 35 40 45Val Leu His Val Ala Ala Ala Asn Gly Gln Ile Glu Ile Leu Ser Leu 50 55 60Leu Leu Glu Arg Phe Thr Asn Pro Asp Leu Leu Asn Arg His Lys Gln 65 70 75 80Thr Pro Leu Met Leu Ala Ala Met Tyr Gly Arg Ile Ser Cys Val Lys 85 90 95Lys Leu Ala Glu Val Gly Ala Asn Ile Leu Met Phe Asp Ser Val Asn 100 105 110Arg Arg Thr Cys Leu His Tyr Ala Ala Tyr Tyr Gly His Ala Asn Cys 115 120 125Val Gln Ala Ile Leu Ser Ala Ala Gln Ser Ser Pro Val Ala Val His 130 135 140Trp Gly Tyr Ala Arg Phe Val Asn Ile Arg Asp Asp Lys Gly Ala Thr145 150 155 160Pro Leu His Leu Ala Ala Arg Gln Arg Arg Pro Glu Cys Val Asn Val 165 170 175Leu Leu Asp Ser Gly Ser Leu Val Cys Ala Ser Thr Ser Val Tyr Gly 180 185 190Ser Pro Gly Ser Thr Pro Leu His Leu Ala Ala Arg Ser Gly Ser Ile 195 200 205Asp Cys Val Arg Lys Leu Leu Ala Trp Gly Ala Asp Arg Leu Gln Arg 210 215 220Asp Ala Ser Gly Arg Ile Pro Tyr Val Val Ala Met Lys His Lys His225 230 235 240Gly Ala Cys Gly Ala Leu Leu Asn Pro Ser Ser Ala Glu Pro Leu Val 245 250 255Trp Pro Ser Pro Leu Lys Phe Ile Ser Glu Leu Asn Asp Glu Ala Lys 260 265 270Leu Leu Leu Glu Gln Ala Leu Met Glu Ala Asn Arg Glu Arg Glu Lys 275 280 285Thr Ile Leu Lys Gly Thr Ala Tyr Ser Leu Pro Ser Pro Ser Phe Ser 290 295 300Asp Thr Asp Asp Asn Met Ser Glu Val Ser Asp Thr Glu Leu Cys Cys305 310 315 320Ile Cys Phe Glu Gln Val Cys Thr Ile Glu Val Lys Asp Cys Gly His 325 330 335Gln Met Cys Ala Gln Cys Thr Leu Ala Leu Cys Cys His Asn Lys Pro 340 345 350Asn Pro Thr Thr Ser Thr Val Thr Pro Pro Val Cys Pro Phe Cys Arg 355 360 365Ser Thr Ile Ala Cys Leu Val Val Ala Gln Asn Asn Asn Asn Asn Asn 370 375 380Glu Lys Ser Lys Ser Leu Asp Asp Val Val Val Val Asp Arg Glu Ala385 390 395 400Gly Asp Val Ser Ser Ser Lys Phe Arg Lys His Arg Arg Ser Ile Asn 405 410 415Leu Gly Glu Glu Ser Ser Ser Phe Met Gly Leu Ser Thr Ile Gly Ser 420 425 430Phe Gly Arg Ile Thr Gly Arg Gly Ser Gly Arg Ile Ala Ala Glu Asn 435 440 445Glu Leu Met Asp Lys Pro Ile Leu 450 455109714DNAArabidopsis thalianaCDS(1)..(711) 109atg aag ata caa tgt gat gtg tgt gag aaa gct ccg gcc acg ctt ata 48Met Lys Ile Gln Cys Asp Val Cys Glu Lys Ala Pro Ala Thr Leu Ile 1 5 10 15tgt tgt gct gat gaa gct gct ctc tgc gct aaa tgt gac gtt gag gtt 96Cys Cys Ala Asp Glu Ala Ala Leu Cys Ala Lys Cys Asp Val Glu Val 20 25 30cat gct gct aat aaa ctc gct agc aaa cac caa cgc ctt ttt ctt gac 144His Ala Ala Asn Lys Leu Ala Ser Lys His Gln Arg Leu Phe Leu Asp 35 40 45tct ctc tca act aaa ttc cct ccc tgc gac atc tgc ctt gag aag gca 192Ser Leu Ser Thr Lys Phe Pro Pro Cys Asp Ile Cys Leu Glu Lys Ala 50 55 60gct ttc ata ttc tgt gta gag gat agg gct ctg ctc tgc aga gat tgc 240Ala Phe Ile Phe Cys Val Glu Asp Arg Ala Leu Leu Cys Arg Asp Cys 65 70 75 80gat gag gcg acc cat gcg cca aat act cgc tct gct aat cac cag agg 288Asp Glu Ala Thr His Ala Pro Asn Thr Arg Ser Ala Asn His Gln Arg 85 90 95ttc tta gcc act gga atc cga gtt gct ctt agt tcc act agt tgc aat 336Phe Leu Ala Thr Gly Ile Arg Val Ala Leu Ser Ser Thr Ser Cys Asn 100 105 110caa gaa gtg gaa aag aat cac ttt gac cca tct aat cag cag agt ctc 384Gln Glu Val Glu Lys Asn His Phe Asp Pro Ser Asn Gln Gln Ser Leu 115 120 125tct aaa ccg cca act cag caa ccc gct gct cca tct cct ttg tgg gct 432Ser Lys Pro Pro Thr Gln Gln Pro Ala Ala Pro Ser Pro Leu Trp Ala 130 135 140acc gat gaa ttc ttc agc tac tct gat ctt gac tgc agt aat aag aaa 480Thr Asp Glu Phe Phe Ser Tyr Ser Asp Leu Asp Cys Ser Asn Lys Lys145 150 155 160gag caa ctc gat ctc ggg gag ctg gat tgg ctt gca gag atg ggt ctg 528Glu Gln Leu Asp Leu Gly Glu Leu Asp Trp Leu Ala Glu Met Gly Leu 165 170 175ttt ggt gac cag cct gat caa gag gct cta ccg gta gcc gaa gtt ccc 576Phe Gly Asp Gln Pro Asp Gln Glu Ala Leu Pro Val Ala Glu Val Pro 180 185 190gag ctt tcc ttt tca cat ttg gct cat gct cat tcc tac aac aga cct 624Glu Leu Ser Phe Ser His Leu Ala His Ala His Ser Tyr Asn Arg Pro 195 200 205atg aag tcc aat gta ccc aac aag aag cag agg ctt gag tac cgg tat 672Met Lys Ser Asn Val Pro Asn Lys Lys Gln Arg Leu Glu Tyr Arg Tyr 210 215 220gat gat gaa gaa gag cac ttc cta gtc ccc gac cta ggc taa 714Asp Asp Glu Glu Glu His Phe Leu Val Pro Asp

Leu Gly225 230 235110237PRTArabidopsis thaliana 110Met Lys Ile Gln Cys Asp Val Cys Glu Lys Ala Pro Ala Thr Leu Ile 1 5 10 15Cys Cys Ala Asp Glu Ala Ala Leu Cys Ala Lys Cys Asp Val Glu Val 20 25 30His Ala Ala Asn Lys Leu Ala Ser Lys His Gln Arg Leu Phe Leu Asp 35 40 45Ser Leu Ser Thr Lys Phe Pro Pro Cys Asp Ile Cys Leu Glu Lys Ala 50 55 60Ala Phe Ile Phe Cys Val Glu Asp Arg Ala Leu Leu Cys Arg Asp Cys 65 70 75 80Asp Glu Ala Thr His Ala Pro Asn Thr Arg Ser Ala Asn His Gln Arg 85 90 95Phe Leu Ala Thr Gly Ile Arg Val Ala Leu Ser Ser Thr Ser Cys Asn 100 105 110Gln Glu Val Glu Lys Asn His Phe Asp Pro Ser Asn Gln Gln Ser Leu 115 120 125Ser Lys Pro Pro Thr Gln Gln Pro Ala Ala Pro Ser Pro Leu Trp Ala 130 135 140Thr Asp Glu Phe Phe Ser Tyr Ser Asp Leu Asp Cys Ser Asn Lys Lys145 150 155 160Glu Gln Leu Asp Leu Gly Glu Leu Asp Trp Leu Ala Glu Met Gly Leu 165 170 175Phe Gly Asp Gln Pro Asp Gln Glu Ala Leu Pro Val Ala Glu Val Pro 180 185 190Glu Leu Ser Phe Ser His Leu Ala His Ala His Ser Tyr Asn Arg Pro 195 200 205Met Lys Ser Asn Val Pro Asn Lys Lys Gln Arg Leu Glu Tyr Arg Tyr 210 215 220Asp Asp Glu Glu Glu His Phe Leu Val Pro Asp Leu Gly225 230 235111829DNAArabidopsis thalianaCDS(99)..(584) 111atcacaacta ctatcacacc aaactcaaaa aacacaaacc acaagaggat catttcattt 60tttattgttt cgttttaatc atcatcatca gaagaaaa atg gtt gcg ata tcg gag 116 Met Val Ala Ile Ser Glu 1 5atc aag tcg acg gtg gat gtc acg gcg gcg aat tgt ttg atg ctt tta 164Ile Lys Ser Thr Val Asp Val Thr Ala Ala Asn Cys Leu Met Leu Leu 10 15 20tct aga gtt gga caa gaa aac gtt gac ggt ggc gat caa aaa cgc gtt 212Ser Arg Val Gly Gln Glu Asn Val Asp Gly Gly Asp Gln Lys Arg Val 25 30 35ttc aca tgt aaa acg tgt ttg aag cag ttt cat tcg ttc caa gcc tta 260Phe Thr Cys Lys Thr Cys Leu Lys Gln Phe His Ser Phe Gln Ala Leu 40 45 50gga ggt cac cgt gcg agt cac aag aag cct aac aac gac gct ttg tcg 308Gly Gly His Arg Ala Ser His Lys Lys Pro Asn Asn Asp Ala Leu Ser 55 60 65 70tct gga ttg atg aag aag gtg aaa acg tcg tcg cat cct tgt ccc ata 356Ser Gly Leu Met Lys Lys Val Lys Thr Ser Ser His Pro Cys Pro Ile 75 80 85tgt gga gtg gag ttt ccg atg gga caa gct ttg gga gga cac atg agg 404Cys Gly Val Glu Phe Pro Met Gly Gln Ala Leu Gly Gly His Met Arg 90 95 100aga cac agg aac gag agt ggg gct gct ggt ggc gcg ttg gtt aca cgc 452Arg His Arg Asn Glu Ser Gly Ala Ala Gly Gly Ala Leu Val Thr Arg 105 110 115gct ttg ttg ccg gag ccc acg gtg act acg ttg aag aaa tct agc agt 500Ala Leu Leu Pro Glu Pro Thr Val Thr Thr Leu Lys Lys Ser Ser Ser 120 125 130ggg aag aga gtg gct tgt ttg gat ctg agt cta ggg atg gtg gac aat 548Gly Lys Arg Val Ala Cys Leu Asp Leu Ser Leu Gly Met Val Asp Asn135 140 145 150ttg aat ctc aag ttg gag ctt gga aga aca gtt tat tgattttatt 594Leu Asn Leu Lys Leu Glu Leu Gly Arg Thr Val Tyr 155 160tattttcctt aaattttctg aatatatttg tttctctcat tctttgaatt tttcttaata 654ttctagatta tacatacatc cgcagattta ggaaactttc atagagtgta atcttttctt 714tctgtaaaaa tatattttac ttgtagcatt ggagatttgt tatgagatta tcttacttag 774catttagtga ataatctatt agcctatttt gccgacgcga aaaaaaaaaa aaaaa 829112162PRTArabidopsis thaliana 112Met Val Ala Ile Ser Glu Ile Lys Ser Thr Val Asp Val Thr Ala Ala 1 5 10 15Asn Cys Leu Met Leu Leu Ser Arg Val Gly Gln Glu Asn Val Asp Gly 20 25 30Gly Asp Gln Lys Arg Val Phe Thr Cys Lys Thr Cys Leu Lys Gln Phe 35 40 45His Ser Phe Gln Ala Leu Gly Gly His Arg Ala Ser His Lys Lys Pro 50 55 60Asn Asn Asp Ala Leu Ser Ser Gly Leu Met Lys Lys Val Lys Thr Ser 65 70 75 80Ser His Pro Cys Pro Ile Cys Gly Val Glu Phe Pro Met Gly Gln Ala 85 90 95Leu Gly Gly His Met Arg Arg His Arg Asn Glu Ser Gly Ala Ala Gly 100 105 110Gly Ala Leu Val Thr Arg Ala Leu Leu Pro Glu Pro Thr Val Thr Thr 115 120 125Leu Lys Lys Ser Ser Ser Gly Lys Arg Val Ala Cys Leu Asp Leu Ser 130 135 140Leu Gly Met Val Asp Asn Leu Asn Leu Lys Leu Glu Leu Gly Arg Thr145 150 155 160Val Tyr113881DNAArabidopsis thalianaCDS(89)..(769) 113agcaattaaa caatttcttc actgcaattc acaagcaacc ttcaaactaa aactcgagag 60acaagaaatc ctcagaatct ttaactta atg gcg ctc gag gct ctt aca tca 112 Met Ala Leu Glu Ala Leu Thr Ser 1 5cca aga tta gct tct ccg att cct cct ttg ttc gaa gat tct tca gtc 160Pro Arg Leu Ala Ser Pro Ile Pro Pro Leu Phe Glu Asp Ser Ser Val 10 15 20ttc cat gga gtc gag cac tgg aca aag ggt aag cga tct aag aga tca 208Phe His Gly Val Glu His Trp Thr Lys Gly Lys Arg Ser Lys Arg Ser 25 30 35 40aga tcc gat ttc cac cac caa aac ctc act gag gaa gag tat cta gct 256Arg Ser Asp Phe His His Gln Asn Leu Thr Glu Glu Glu Tyr Leu Ala 45 50 55ttt tgc ctc atg ctt ctc gct cgc gac aac cgt cag cct cct cct cct 304Phe Cys Leu Met Leu Leu Ala Arg Asp Asn Arg Gln Pro Pro Pro Pro 60 65 70ccg gcg gtg gag aag ttg agc tac aag tgt agc gtc tgc gac aag acg 352Pro Ala Val Glu Lys Leu Ser Tyr Lys Cys Ser Val Cys Asp Lys Thr 75 80 85ttc tct tct tac caa gct ctc ggt ggt cac aag gca agc cac cgt aag 400Phe Ser Ser Tyr Gln Ala Leu Gly Gly His Lys Ala Ser His Arg Lys 90 95 100aac tta tca cag act ctc tcc ggc gga gga gat gat cat tca acc tcg 448Asn Leu Ser Gln Thr Leu Ser Gly Gly Gly Asp Asp His Ser Thr Ser105 110 115 120tcg gcg aca acc aca tcc gcc gtg act act gga agt ggg aaa tca cac 496Ser Ala Thr Thr Thr Ser Ala Val Thr Thr Gly Ser Gly Lys Ser His 125 130 135gtt tgc acc atc tgt aac aag tct ttt cct tcc ggt caa gct ctc ggc 544Val Cys Thr Ile Cys Asn Lys Ser Phe Pro Ser Gly Gln Ala Leu Gly 140 145 150gga cac aag cgg tgc cac tac gaa gga aac aac aac atc aac act agt 592Gly His Lys Arg Cys His Tyr Glu Gly Asn Asn Asn Ile Asn Thr Ser 155 160 165agc gtg tcc aac tcc gaa ggt gcg ggg tcc act agc cac gtt agc agt 640Ser Val Ser Asn Ser Glu Gly Ala Gly Ser Thr Ser His Val Ser Ser 170 175 180agc cac cgt ggg ttt gac ctc aac atc cct ccg atc cct gaa ttc tcg 688Ser His Arg Gly Phe Asp Leu Asn Ile Pro Pro Ile Pro Glu Phe Ser185 190 195 200atg gtc aac gga gac gac gaa gtc atg agc cct atg ccg gcg aag aag 736Met Val Asn Gly Asp Asp Glu Val Met Ser Pro Met Pro Ala Lys Lys 205 210 215cct cgg ttt gac ttt ccg gtc aaa ctt caa ctt taaggaaatt tacttagacg 789Pro Arg Phe Asp Phe Pro Val Lys Leu Gln Leu 220 225ataagatttc gtttgtatac tgttgagagt tgtgtaggaa tttgttgact gtacatacca 849aattggactt tgactcaaaa aaaaaaaaaa aa 881114227PRTArabidopsis thaliana 114Met Ala Leu Glu Ala Leu Thr Ser Pro Arg Leu Ala Ser Pro Ile Pro 1 5 10 15Pro Leu Phe Glu Asp Ser Ser Val Phe His Gly Val Glu His Trp Thr 20 25 30Lys Gly Lys Arg Ser Lys Arg Ser Arg Ser Asp Phe His His Gln Asn 35 40 45Leu Thr Glu Glu Glu Tyr Leu Ala Phe Cys Leu Met Leu Leu Ala Arg 50 55 60Asp Asn Arg Gln Pro Pro Pro Pro Pro Ala Val Glu Lys Leu Ser Tyr 65 70 75 80Lys Cys Ser Val Cys Asp Lys Thr Phe Ser Ser Tyr Gln Ala Leu Gly 85 90 95Gly His Lys Ala Ser His Arg Lys Asn Leu Ser Gln Thr Leu Ser Gly 100 105 110Gly Gly Asp Asp His Ser Thr Ser Ser Ala Thr Thr Thr Ser Ala Val 115 120 125Thr Thr Gly Ser Gly Lys Ser His Val Cys Thr Ile Cys Asn Lys Ser 130 135 140Phe Pro Ser Gly Gln Ala Leu Gly Gly His Lys Arg Cys His Tyr Glu145 150 155 160Gly Asn Asn Asn Ile Asn Thr Ser Ser Val Ser Asn Ser Glu Gly Ala 165 170 175Gly Ser Thr Ser His Val Ser Ser Ser His Arg Gly Phe Asp Leu Asn 180 185 190Ile Pro Pro Ile Pro Glu Phe Ser Met Val Asn Gly Asp Asp Glu Val 195 200 205Met Ser Pro Met Pro Ala Lys Lys Pro Arg Phe Asp Phe Pro Val Lys 210 215 220Leu Gln Leu2251151107DNAArabidopsis thalianaCDS(139)..(957) 115aactcggtcc gagtcgactc ggtccgacaa gcgattttcg tgactccgtc gtcaccacgc 60ctatgttaag tgagtaaatc cgactcttct ccgtagagat ttctcctact ttcaggctct 120cttgtgaatt tggaagac atg agc ttt gtt ttc cgg gga agt aga gga gat 171 Met Ser Phe Val Phe Arg Gly Ser Arg Gly Asp 1 5 10tta gaa agc gga ttc tcg ggt ggt ttt cta ccc gaa aga cga gct atg 219Leu Glu Ser Gly Phe Ser Gly Gly Phe Leu Pro Glu Arg Arg Ala Met 15 20 25cgt gtt cat gga gct cga cca gtt aat tct aat tcc ctc gct ttt ctg 267Arg Val His Gly Ala Arg Pro Val Asn Ser Asn Ser Leu Ala Phe Leu 30 35 40gtt aca gtt ctt ttg ctg ttt atg att ctc aat tcg cat cag atg cct 315Val Thr Val Leu Leu Leu Phe Met Ile Leu Asn Ser His Gln Met Pro 45 50 55cct aat ttc ctg ctg tgg ctt gtg ctt ggg gtg ttt ttg atg gca acg 363Pro Asn Phe Leu Leu Trp Leu Val Leu Gly Val Phe Leu Met Ala Thr 60 65 70 75acg ctt agg atg tat gct act tgc caa caa ctt caa gct cat gct cag 411Thr Leu Arg Met Tyr Ala Thr Cys Gln Gln Leu Gln Ala His Ala Gln 80 85 90gct cag gct gca gca gca agt ggc ctc ttt agc cat act gag ctg agg 459Ala Gln Ala Ala Ala Ala Ser Gly Leu Phe Ser His Thr Glu Leu Arg 95 100 105ttg cat gtg cct cct tcc att gct ctt gct acg aga ggg cgt ctt cag 507Leu His Val Pro Pro Ser Ile Ala Leu Ala Thr Arg Gly Arg Leu Gln 110 115 120gga ctt agg ctc cag ctg gct ctt ctt gat cgg gaa ttt gat gac tta 555Gly Leu Arg Leu Gln Leu Ala Leu Leu Asp Arg Glu Phe Asp Asp Leu 125 130 135gat tat gaa act tta aga gca ctt gat tct gat aat gtt tcc aca act 603Asp Tyr Glu Thr Leu Arg Ala Leu Asp Ser Asp Asn Val Ser Thr Thr140 145 150 155tct atg agc gag gaa gag ata aat gca ctt cca gtt cac aag tac aag 651Ser Met Ser Glu Glu Glu Ile Asn Ala Leu Pro Val His Lys Tyr Lys 160 165 170gtg ttg gat cct gaa aat ggt tgc tct ttg gca aag caa gcg tca acc 699Val Leu Asp Pro Glu Asn Gly Cys Ser Leu Ala Lys Gln Ala Ser Thr 175 180 185tca tcc tca gct gag aag atg cta gat tct gcc aat gaa agt aaa aaa 747Ser Ser Ser Ala Glu Lys Met Leu Asp Ser Ala Asn Glu Ser Lys Lys 190 195 200gga aca gaa gat gag ctc aca tgt agt gtt tgc cta gaa caa gtt acc 795Gly Thr Glu Asp Glu Leu Thr Cys Ser Val Cys Leu Glu Gln Val Thr 205 210 215gta ggg gaa atc gtt cgc acc tta cct tgt ttg cat cag ttt cat gca 843Val Gly Glu Ile Val Arg Thr Leu Pro Cys Leu His Gln Phe His Ala220 225 230 235gga tgt atc gat cca tgg ttg aga cag caa gga aca tgt cct gtc tgt 891Gly Cys Ile Asp Pro Trp Leu Arg Gln Gln Gly Thr Cys Pro Val Cys 240 245 250aaa ttt aga gct cat tca gga tgg caa gaa caa gat gag att gat gat 939Lys Phe Arg Ala His Ser Gly Trp Gln Glu Gln Asp Glu Ile Asp Asp 255 260 265gat gct tcc gac atg gtt tgaaaagttt ggtttcgaac acttgtttat 987Asp Ala Ser Asp Met Val 270gttattaatg tgcgtgcgaa ttaaacccca aacaaatcat gtaatggtca ataactcaat 1047atgatgtaac tgatgactct ccctttccaa aagtttgatt tagattcatc atgaaaaagt 1107116273PRTArabidopsis thaliana 116Met Ser Phe Val Phe Arg Gly Ser Arg Gly Asp Leu Glu Ser Gly Phe 1 5 10 15Ser Gly Gly Phe Leu Pro Glu Arg Arg Ala Met Arg Val His Gly Ala 20 25 30Arg Pro Val Asn Ser Asn Ser Leu Ala Phe Leu Val Thr Val Leu Leu 35 40 45Leu Phe Met Ile Leu Asn Ser His Gln Met Pro Pro Asn Phe Leu Leu 50 55 60Trp Leu Val Leu Gly Val Phe Leu Met Ala Thr Thr Leu Arg Met Tyr 65 70 75 80Ala Thr Cys Gln Gln Leu Gln Ala His Ala Gln Ala Gln Ala Ala Ala 85 90 95Ala Ser Gly Leu Phe Ser His Thr Glu Leu Arg Leu His Val Pro Pro 100 105 110Ser Ile Ala Leu Ala Thr Arg Gly Arg Leu Gln Gly Leu Arg Leu Gln 115 120 125Leu Ala Leu Leu Asp Arg Glu Phe Asp Asp Leu Asp Tyr Glu Thr Leu 130 135 140Arg Ala Leu Asp Ser Asp Asn Val Ser Thr Thr Ser Met Ser Glu Glu145 150 155 160Glu Ile Asn Ala Leu Pro Val His Lys Tyr Lys Val Leu Asp Pro Glu 165 170 175Asn Gly Cys Ser Leu Ala Lys Gln Ala Ser Thr Ser Ser Ser Ala Glu 180 185 190Lys Met Leu Asp Ser Ala Asn Glu Ser Lys Lys Gly Thr Glu Asp Glu 195 200 205Leu Thr Cys Ser Val Cys Leu Glu Gln Val Thr Val Gly Glu Ile Val 210 215 220Arg Thr Leu Pro Cys Leu His Gln Phe His Ala Gly Cys Ile Asp Pro225 230 235 240Trp Leu Arg Gln Gln Gly Thr Cys Pro Val Cys Lys Phe Arg Ala His 245 250 255Ser Gly Trp Gln Glu Gln Asp Glu Ile Asp Asp Asp Ala Ser Asp Met 260 265 270Val117531DNAArabidopsis thalianaCDS(1)..(528) 117atg gga tcg gaa caa aac gat agc aca agc ttc acg caa tcg caa gct 48Met Gly Ser Glu Gln Asn Asp Ser Thr Ser Phe Thr Gln Ser Gln Ala 1 5 10 15tca gag cca aag cta tgt gtt aaa gga tgt ggt ttc ttt gga tca cca 96Ser Glu Pro Lys Leu Cys Val Lys Gly Cys Gly Phe Phe Gly Ser Pro 20 25 30tca aac atg gat ctc tgt tct aaa tgt tac aga ggc att tgt gct gag 144Ser Asn Met Asp Leu Cys Ser Lys Cys Tyr Arg Gly Ile Cys Ala Glu 35 40 45gaa gct caa aca gca gtt gct aaa gct gct gtt gaa aaa tct ttc aag 192Glu Ala Gln Thr Ala Val Ala Lys Ala Ala Val Glu Lys Ser Phe Lys 50 55 60cct tct cct cct cgt agt ctc ttc ata gca gaa cct cct gct gtt gtt 240Pro Ser Pro Pro Arg Ser Leu Phe Ile Ala Glu Pro Pro Ala Val Val 65 70 75 80gtg gaa ccc aaa ccc gaa aag gcg gca gtt gtt gtt gtc tcg gcc gag 288Val Glu Pro Lys Pro Glu Lys Ala Ala Val Val Val Val Ser Ala Glu 85 90 95cca tct tcc tcg gcg gtt cct gag gcg aac gag cca tcg aga cct gca 336Pro Ser Ser Ser Ala Val Pro Glu Ala Asn Glu Pro Ser Arg Pro Ala 100 105 110cga acc aac cgg tgt ttg tgt tgt aac aag aag gtt ggg atc atg ggg 384Arg Thr Asn Arg Cys Leu Cys Cys Asn Lys Lys Val Gly Ile Met Gly 115 120 125ttt aag tgc aaa tgc ggg agc act ttc tgc ggc gaa cat cgg tac ccg 432Phe Lys Cys Lys Cys Gly Ser Thr Phe Cys Gly Glu His Arg Tyr Pro 130 135 140gag act cat gat tgc agc ttt gat ttc aaa gaa gtt gga cgt gga gag 480Glu Thr His Asp Cys Ser Phe Asp Phe Lys Glu Val Gly Arg Gly Glu145 150 155 160att gcc aaa gct aat cct gtg gtt aag gct gat aaa att caa agg ttc 528Ile Ala Lys Ala Asn Pro Val Val Lys Ala Asp Lys Ile Gln Arg Phe 165 170 175tga 531118176PRTArabidopsis thaliana 118Met Gly

Ser Glu Gln Asn Asp Ser Thr Ser Phe Thr Gln Ser Gln Ala 1 5 10 15Ser Glu Pro Lys Leu Cys Val Lys Gly Cys Gly Phe Phe Gly Ser Pro 20 25 30Ser Asn Met Asp Leu Cys Ser Lys Cys Tyr Arg Gly Ile Cys Ala Glu 35 40 45Glu Ala Gln Thr Ala Val Ala Lys Ala Ala Val Glu Lys Ser Phe Lys 50 55 60Pro Ser Pro Pro Arg Ser Leu Phe Ile Ala Glu Pro Pro Ala Val Val 65 70 75 80Val Glu Pro Lys Pro Glu Lys Ala Ala Val Val Val Val Ser Ala Glu 85 90 95Pro Ser Ser Ser Ala Val Pro Glu Ala Asn Glu Pro Ser Arg Pro Ala 100 105 110Arg Thr Asn Arg Cys Leu Cys Cys Asn Lys Lys Val Gly Ile Met Gly 115 120 125Phe Lys Cys Lys Cys Gly Ser Thr Phe Cys Gly Glu His Arg Tyr Pro 130 135 140Glu Thr His Asp Cys Ser Phe Asp Phe Lys Glu Val Gly Arg Gly Glu145 150 155 160Ile Ala Lys Ala Asn Pro Val Val Lys Ala Asp Lys Ile Gln Arg Phe 165 170 1751191068DNAArabidopsis thalianaCDS(1)..(1065) 119atg ttg aaa gta gag agt aac tgg gca caa gcc tgt gat aca tgc cga 48Met Leu Lys Val Glu Ser Asn Trp Ala Gln Ala Cys Asp Thr Cys Arg 1 5 10 15tca gcc gcc tgc acc gtg tac tgc cgg gct gat tct gcc tac ttg tgc 96Ser Ala Ala Cys Thr Val Tyr Cys Arg Ala Asp Ser Ala Tyr Leu Cys 20 25 30tcc agt tgt gat gct caa gtt cat gct gcc aat cgt ctt gct tcc cgc 144Ser Ser Cys Asp Ala Gln Val His Ala Ala Asn Arg Leu Ala Ser Arg 35 40 45cat gaa cgt gtt cga gtc tgt caa tca tgt gag cga gcc ccg gct gcc 192His Glu Arg Val Arg Val Cys Gln Ser Cys Glu Arg Ala Pro Ala Ala 50 55 60ttt ttc tgc aag gca gat gct gca tct cta tgc aca acc tgt gat tca 240Phe Phe Cys Lys Ala Asp Ala Ala Ser Leu Cys Thr Thr Cys Asp Ser 65 70 75 80gag att cat tcc gca aac cca ctt gct aga cgc cat caa cga gtt cca 288Glu Ile His Ser Ala Asn Pro Leu Ala Arg Arg His Gln Arg Val Pro 85 90 95att ctg ccc att tct gag tac tct tac agt tcc acg gcc act aac cat 336Ile Leu Pro Ile Ser Glu Tyr Ser Tyr Ser Ser Thr Ala Thr Asn His 100 105 110tca tgt gag aca aca gtg aca gat cca gag aac aga ctt gtg ctt ggt 384Ser Cys Glu Thr Thr Val Thr Asp Pro Glu Asn Arg Leu Val Leu Gly 115 120 125caa gaa gaa gag gat gaa gat gaa gca gag gcg gct tca tgg ttg ttg 432Gln Glu Glu Glu Asp Glu Asp Glu Ala Glu Ala Ala Ser Trp Leu Leu 130 135 140cct aat tca ggg aaa aac agt ggt aac aac aat ggc ttc tcg att ggg 480Pro Asn Ser Gly Lys Asn Ser Gly Asn Asn Asn Gly Phe Ser Ile Gly145 150 155 160gat gag ttt ctg aac ctt gtt gat tat agt tcg agt gat aag caa ttc 528Asp Glu Phe Leu Asn Leu Val Asp Tyr Ser Ser Ser Asp Lys Gln Phe 165 170 175aca gat caa tcc aat cag tat caa cta gac tgc aac gta cct cag agg 576Thr Asp Gln Ser Asn Gln Tyr Gln Leu Asp Cys Asn Val Pro Gln Arg 180 185 190agc tat ggg gaa gat gga gtt gtt cca ctt caa att gaa gta tca aag 624Ser Tyr Gly Glu Asp Gly Val Val Pro Leu Gln Ile Glu Val Ser Lys 195 200 205ggc atg tac caa gag caa cag aac ttt cag ctg agt atc aac tgt ggc 672Gly Met Tyr Gln Glu Gln Gln Asn Phe Gln Leu Ser Ile Asn Cys Gly 210 215 220tcc tgg gga gct ctt cga agc tcc aat ggt tcc ctc agt cat atg gtg 720Ser Trp Gly Ala Leu Arg Ser Ser Asn Gly Ser Leu Ser His Met Val225 230 235 240aat gtt tca tct atg gac ctg gga gtt gtg ccg gag tca aca acg agt 768Asn Val Ser Ser Met Asp Leu Gly Val Val Pro Glu Ser Thr Thr Ser 245 250 255gac gca aca gta tca aac cca aga tcg ccc aaa gcg gta aca gac caa 816Asp Ala Thr Val Ser Asn Pro Arg Ser Pro Lys Ala Val Thr Asp Gln 260 265 270cca cct tac cct cca gct cag atg ctc agt cca agg gac aga gaa gct 864Pro Pro Tyr Pro Pro Ala Gln Met Leu Ser Pro Arg Asp Arg Glu Ala 275 280 285aga gtc ctg aga tac aga gag aag aag aag atg agg aaa ttt gag aag 912Arg Val Leu Arg Tyr Arg Glu Lys Lys Lys Met Arg Lys Phe Glu Lys 290 295 300acg ata aga tat gct tca agg aaa gcg tat gca gag aaa aga cca cgg 960Thr Ile Arg Tyr Ala Ser Arg Lys Ala Tyr Ala Glu Lys Arg Pro Arg305 310 315 320atc aag ggc cgg ttt gca aag aag aaa gat gtc gat gaa gag gca aac 1008Ile Lys Gly Arg Phe Ala Lys Lys Lys Asp Val Asp Glu Glu Ala Asn 325 330 335caa gct ttc tcc aca atg ata aca ttt gac acc gga tat gga att gtt 1056Gln Ala Phe Ser Thr Met Ile Thr Phe Asp Thr Gly Tyr Gly Ile Val 340 345 350cca tca ttc tga 1068Pro Ser Phe 355120355PRTArabidopsis thaliana 120Met Leu Lys Val Glu Ser Asn Trp Ala Gln Ala Cys Asp Thr Cys Arg 1 5 10 15Ser Ala Ala Cys Thr Val Tyr Cys Arg Ala Asp Ser Ala Tyr Leu Cys 20 25 30Ser Ser Cys Asp Ala Gln Val His Ala Ala Asn Arg Leu Ala Ser Arg 35 40 45His Glu Arg Val Arg Val Cys Gln Ser Cys Glu Arg Ala Pro Ala Ala 50 55 60Phe Phe Cys Lys Ala Asp Ala Ala Ser Leu Cys Thr Thr Cys Asp Ser 65 70 75 80Glu Ile His Ser Ala Asn Pro Leu Ala Arg Arg His Gln Arg Val Pro 85 90 95Ile Leu Pro Ile Ser Glu Tyr Ser Tyr Ser Ser Thr Ala Thr Asn His 100 105 110Ser Cys Glu Thr Thr Val Thr Asp Pro Glu Asn Arg Leu Val Leu Gly 115 120 125Gln Glu Glu Glu Asp Glu Asp Glu Ala Glu Ala Ala Ser Trp Leu Leu 130 135 140Pro Asn Ser Gly Lys Asn Ser Gly Asn Asn Asn Gly Phe Ser Ile Gly145 150 155 160Asp Glu Phe Leu Asn Leu Val Asp Tyr Ser Ser Ser Asp Lys Gln Phe 165 170 175Thr Asp Gln Ser Asn Gln Tyr Gln Leu Asp Cys Asn Val Pro Gln Arg 180 185 190Ser Tyr Gly Glu Asp Gly Val Val Pro Leu Gln Ile Glu Val Ser Lys 195 200 205Gly Met Tyr Gln Glu Gln Gln Asn Phe Gln Leu Ser Ile Asn Cys Gly 210 215 220Ser Trp Gly Ala Leu Arg Ser Ser Asn Gly Ser Leu Ser His Met Val225 230 235 240Asn Val Ser Ser Met Asp Leu Gly Val Val Pro Glu Ser Thr Thr Ser 245 250 255Asp Ala Thr Val Ser Asn Pro Arg Ser Pro Lys Ala Val Thr Asp Gln 260 265 270Pro Pro Tyr Pro Pro Ala Gln Met Leu Ser Pro Arg Asp Arg Glu Ala 275 280 285Arg Val Leu Arg Tyr Arg Glu Lys Lys Lys Met Arg Lys Phe Glu Lys 290 295 300Thr Ile Arg Tyr Ala Ser Arg Lys Ala Tyr Ala Glu Lys Arg Pro Arg305 310 315 320Ile Lys Gly Arg Phe Ala Lys Lys Lys Asp Val Asp Glu Glu Ala Asn 325 330 335Gln Ala Phe Ser Thr Met Ile Thr Phe Asp Thr Gly Tyr Gly Ile Val 340 345 350Pro Ser Phe 3551211116DNAArabidopsis thalianaCDS(191)..(934) 121atcccaccta cttgttcccc acaaaacact ctctccctct ttgttctttc atcttctcta 60agctctttct ctgaacctac gcttctgcta agctattcta agagaagcca gactagcaat 120aaacccttca ttttaagcat tctgtttcct tcttgagaaa cctagatatt ttggtttctt 180gtatccggtg atg aag ata cag tgt gat gtg tgt gag aaa gct ccg gcg 229 Met Lys Ile Gln Cys Asp Val Cys Glu Lys Ala Pro Ala 1 5 10acg gtg att tgt tgc gcc gac gaa gct gct ctc tgt cct caa tgc gac 277Thr Val Ile Cys Cys Ala Asp Glu Ala Ala Leu Cys Pro Gln Cys Asp 15 20 25atc gag att cac gcc gct aac aaa ctc gct agc aag cac caa cgt ctt 325Ile Glu Ile His Ala Ala Asn Lys Leu Ala Ser Lys His Gln Arg Leu 30 35 40 45cat ctt aat tcc ctc tcc acc aaa ttc cct cgt tgc gat atc tgc caa 373His Leu Asn Ser Leu Ser Thr Lys Phe Pro Arg Cys Asp Ile Cys Gln 50 55 60gag aag gca gct ttc att ttc tgt gta gag gat aga gct ctg ctt tgc 421Glu Lys Ala Ala Phe Ile Phe Cys Val Glu Asp Arg Ala Leu Leu Cys 65 70 75agg gac tgc gat gaa tcc atc cac gtg gct aat tct cga tct gct aat 469Arg Asp Cys Asp Glu Ser Ile His Val Ala Asn Ser Arg Ser Ala Asn 80 85 90cac cag agg ttc tta gcc act ggg atc aaa gta gct ctg acc tca act 517His Gln Arg Phe Leu Ala Thr Gly Ile Lys Val Ala Leu Thr Ser Thr 95 100 105ata tgt agt aaa gaa att gag aag aat caa cct gag cct tcc aac aac 565Ile Cys Ser Lys Glu Ile Glu Lys Asn Gln Pro Glu Pro Ser Asn Asn110 115 120 125caa cag aag gct aat cag att cct gct aaa tcc aca agc cag cag caa 613Gln Gln Lys Ala Asn Gln Ile Pro Ala Lys Ser Thr Ser Gln Gln Gln 130 135 140caa caa cct tct tct gct act cca ctt ccc tgg gct gtt gac gat ttc 661Gln Gln Pro Ser Ser Ala Thr Pro Leu Pro Trp Ala Val Asp Asp Phe 145 150 155ttt cac ttc tct gat att gaa tcc acc gac aag aaa gga cag ctt gat 709Phe His Phe Ser Asp Ile Glu Ser Thr Asp Lys Lys Gly Gln Leu Asp 160 165 170ctt ggg gca ggg gag ttg gat tgg ttt tca gac atg gga ttc ttc ggt 757Leu Gly Ala Gly Glu Leu Asp Trp Phe Ser Asp Met Gly Phe Phe Gly 175 180 185gat cag att aat gac aag gct ctt cct gca gct gaa gtt cct gag ctt 805Asp Gln Ile Asn Asp Lys Ala Leu Pro Ala Ala Glu Val Pro Glu Leu190 195 200 205tct gtt tcg cat tta ggt cat gtt cat tca tac aaa cct atg aag tca 853Ser Val Ser His Leu Gly His Val His Ser Tyr Lys Pro Met Lys Ser 210 215 220aat gtt tca cac aag aag ccg agg ttt gag acc aga tat gat gat gat 901Asn Val Ser His Lys Lys Pro Arg Phe Glu Thr Arg Tyr Asp Asp Asp 225 230 235gat gag gaa cac ttc att gtc cct gat ctt ggc taaaaagcta tatgtaatct 954Asp Glu Glu His Phe Ile Val Pro Asp Leu Gly 240 245atgtgtagac attcttcaat gtaaaagaac aaacaagaaa cctatctgca tgtgtggagt 1014taatgtcata tacattttag ttttgtctta agttgtgtaa gatatgttga gagcttataa 1074caaatgtctg tgtttgagtt ttgttcaaaa aaaaaaaaaa aa 1116122248PRTArabidopsis thaliana 122Met Lys Ile Gln Cys Asp Val Cys Glu Lys Ala Pro Ala Thr Val Ile 1 5 10 15Cys Cys Ala Asp Glu Ala Ala Leu Cys Pro Gln Cys Asp Ile Glu Ile 20 25 30His Ala Ala Asn Lys Leu Ala Ser Lys His Gln Arg Leu His Leu Asn 35 40 45Ser Leu Ser Thr Lys Phe Pro Arg Cys Asp Ile Cys Gln Glu Lys Ala 50 55 60Ala Phe Ile Phe Cys Val Glu Asp Arg Ala Leu Leu Cys Arg Asp Cys 65 70 75 80Asp Glu Ser Ile His Val Ala Asn Ser Arg Ser Ala Asn His Gln Arg 85 90 95Phe Leu Ala Thr Gly Ile Lys Val Ala Leu Thr Ser Thr Ile Cys Ser 100 105 110Lys Glu Ile Glu Lys Asn Gln Pro Glu Pro Ser Asn Asn Gln Gln Lys 115 120 125Ala Asn Gln Ile Pro Ala Lys Ser Thr Ser Gln Gln Gln Gln Gln Pro 130 135 140Ser Ser Ala Thr Pro Leu Pro Trp Ala Val Asp Asp Phe Phe His Phe145 150 155 160Ser Asp Ile Glu Ser Thr Asp Lys Lys Gly Gln Leu Asp Leu Gly Ala 165 170 175Gly Glu Leu Asp Trp Phe Ser Asp Met Gly Phe Phe Gly Asp Gln Ile 180 185 190Asn Asp Lys Ala Leu Pro Ala Ala Glu Val Pro Glu Leu Ser Val Ser 195 200 205His Leu Gly His Val His Ser Tyr Lys Pro Met Lys Ser Asn Val Ser 210 215 220His Lys Lys Pro Arg Phe Glu Thr Arg Tyr Asp Asp Asp Asp Glu Glu225 230 235 240His Phe Ile Val Pro Asp Leu Gly 245123909DNAArabidopsis thalianaCDS(1)..(906) 123atg gat cag tac tca tcc tct ttg gtc gat act tca tta gat ctc act 48Met Asp Gln Tyr Ser Ser Ser Leu Val Asp Thr Ser Leu Asp Leu Thr 1 5 10 15att ggc gtt act cgt atg cga gtt gaa gaa gat cca ccg aca agt gct 96Ile Gly Val Thr Arg Met Arg Val Glu Glu Asp Pro Pro Thr Ser Ala 20 25 30ttg gtg gaa gaa tta aac cga gtt agt gct gag aac aag aag ctc tcg 144Leu Val Glu Glu Leu Asn Arg Val Ser Ala Glu Asn Lys Lys Leu Ser 35 40 45gag atg cta act ttg atg tgt gac aac tac aac gtc ttg agg aag caa 192Glu Met Leu Thr Leu Met Cys Asp Asn Tyr Asn Val Leu Arg Lys Gln 50 55 60ctt atg gaa tat gtt aac aag agc aac ata acc gag agg gat caa atc 240Leu Met Glu Tyr Val Asn Lys Ser Asn Ile Thr Glu Arg Asp Gln Ile 65 70 75 80agc cct ccc aag aaa cgc aaa tcc ccg gcg aga gag gac gca ttc agc 288Ser Pro Pro Lys Lys Arg Lys Ser Pro Ala Arg Glu Asp Ala Phe Ser 85 90 95tgc gcg gtt att ggc gga gtg tcg gag agt agc tca acg gat caa gat 336Cys Ala Val Ile Gly Gly Val Ser Glu Ser Ser Ser Thr Asp Gln Asp 100 105 110gag tat ttg tgt aag aag cag aga gaa gag act gtc gtg aag gag aaa 384Glu Tyr Leu Cys Lys Lys Gln Arg Glu Glu Thr Val Val Lys Glu Lys 115 120 125gtc tca agg gtc tat tac aag acc gaa gct tct gac act acc ctc gtt 432Val Ser Arg Val Tyr Tyr Lys Thr Glu Ala Ser Asp Thr Thr Leu Val 130 135 140gtg aaa gat ggg tat caa tgg agg aaa tat gga cag aaa gtg act aga 480Val Lys Asp Gly Tyr Gln Trp Arg Lys Tyr Gly Gln Lys Val Thr Arg145 150 155 160gac aat cca tct cca aga gct tac ttc aaa tgt gct tgt gct cca agc 528Asp Asn Pro Ser Pro Arg Ala Tyr Phe Lys Cys Ala Cys Ala Pro Ser 165 170 175tgt tct gtc aaa aag aag gtt cag aga agt gtg gag gat cag tcc gtg 576Cys Ser Val Lys Lys Lys Val Gln Arg Ser Val Glu Asp Gln Ser Val 180 185 190tta gtt gca act tat gag ggt gaa cac aac cat cca atg cca tcg cag 624Leu Val Ala Thr Tyr Glu Gly Glu His Asn His Pro Met Pro Ser Gln 195 200 205atc gat tca aac aat ggc tta aac cgc cac atc tct cat ggt ggt tca 672Ile Asp Ser Asn Asn Gly Leu Asn Arg His Ile Ser His Gly Gly Ser 210 215 220gct tca aca ccc gtt gca gca aac aga aga agt agc ttg act gtg ccg 720Ala Ser Thr Pro Val Ala Ala Asn Arg Arg Ser Ser Leu Thr Val Pro225 230 235 240gtg act acc gta gat atg att gaa tcg aag aaa gtg acg agc cca acg 768Val Thr Thr Val Asp Met Ile Glu Ser Lys Lys Val Thr Ser Pro Thr 245 250 255tca aga atc gat ttt ccc caa gtt cag aaa ctt ttg gtg gag caa atg 816Ser Arg Ile Asp Phe Pro Gln Val Gln Lys Leu Leu Val Glu Gln Met 260 265 270gct tct tcc tta acc aaa gat cct aac ttt aca gca gct tta gca gca 864Ala Ser Ser Leu Thr Lys Asp Pro Asn Phe Thr Ala Ala Leu Ala Ala 275 280 285gct gtt acc gga aaa ttg tat caa cag aat cat acc gag aaa tag 909Ala Val Thr Gly Lys Leu Tyr Gln Gln Asn His Thr Glu Lys 290 295 300124302PRTArabidopsis thaliana 124Met Asp Gln Tyr Ser Ser Ser Leu Val Asp Thr Ser Leu Asp Leu Thr 1 5 10 15Ile Gly Val Thr Arg Met Arg Val Glu Glu Asp Pro Pro Thr Ser Ala 20 25 30Leu Val Glu Glu Leu Asn Arg Val Ser Ala Glu Asn Lys Lys Leu Ser 35 40 45Glu Met Leu Thr Leu Met Cys Asp Asn Tyr Asn Val Leu Arg Lys Gln 50 55 60Leu Met Glu Tyr Val Asn Lys Ser Asn Ile Thr Glu Arg Asp Gln Ile 65 70 75 80Ser Pro Pro Lys Lys Arg Lys Ser Pro Ala Arg Glu Asp Ala Phe Ser 85 90 95Cys Ala Val Ile Gly Gly Val Ser Glu Ser Ser Ser Thr Asp Gln Asp 100 105 110Glu Tyr Leu Cys Lys Lys Gln Arg Glu Glu Thr Val Val Lys Glu Lys 115 120 125Val Ser Arg Val Tyr Tyr Lys Thr Glu Ala Ser

Asp Thr Thr Leu Val 130 135 140Val Lys Asp Gly Tyr Gln Trp Arg Lys Tyr Gly Gln Lys Val Thr Arg145 150 155 160Asp Asn Pro Ser Pro Arg Ala Tyr Phe Lys Cys Ala Cys Ala Pro Ser 165 170 175Cys Ser Val Lys Lys Lys Val Gln Arg Ser Val Glu Asp Gln Ser Val 180 185 190Leu Val Ala Thr Tyr Glu Gly Glu His Asn His Pro Met Pro Ser Gln 195 200 205Ile Asp Ser Asn Asn Gly Leu Asn Arg His Ile Ser His Gly Gly Ser 210 215 220Ala Ser Thr Pro Val Ala Ala Asn Arg Arg Ser Ser Leu Thr Val Pro225 230 235 240Val Thr Thr Val Asp Met Ile Glu Ser Lys Lys Val Thr Ser Pro Thr 245 250 255Ser Arg Ile Asp Phe Pro Gln Val Gln Lys Leu Leu Val Glu Gln Met 260 265 270Ala Ser Ser Leu Thr Lys Asp Pro Asn Phe Thr Ala Ala Leu Ala Ala 275 280 285Ala Val Thr Gly Lys Leu Tyr Gln Gln Asn His Thr Glu Lys 290 295 3001251462DNAArabidopsis thalianaCDS(382)..(1161) 125accgaccttc ttggttcttc cggcgttgac tgttacgaag atgatgaaga cttgagagtt 60tctgggtcga gttttggtgg gtactatcca gagagaaccg ggtctggttt acctaagttc 120aagacggctc aaccaccacc tcttccgatt tcacaatctt ctcataactt cactttctcc 180gattaccttg attctcctct gcttctcagc tcctcacaca gtttgatatc tccaacaaca 240ggaacgtttc cattgcaagg ctttaatgga acaacaaaca atcactcaga ttttccctgg 300cagctacaat ctcaaccatc aaacgcttct tctgctttgc aagaaacata tggtgttcaa 360gatcacgaga agaagcagga g atg att cct aat gag att gca aca caa aac 411 Met Ile Pro Asn Glu Ile Ala Thr Gln Asn 1 5 10aac aat caa agt ttt gga aca gaa cgt cag ata aag ata cca gca tac 459Asn Asn Gln Ser Phe Gly Thr Glu Arg Gln Ile Lys Ile Pro Ala Tyr 15 20 25atg gtg agt agg aac tct aat gat ggt tat ggt tgg aga aaa tac ggt 507Met Val Ser Arg Asn Ser Asn Asp Gly Tyr Gly Trp Arg Lys Tyr Gly 30 35 40cag aaa caa gtg aag aag agc gaa aac cct agg agt tac ttc aag tgt 555Gln Lys Gln Val Lys Lys Ser Glu Asn Pro Arg Ser Tyr Phe Lys Cys 45 50 55acg tat cct gat tgt gtt tcc aag aag att gtt gag acg gct tct gat 603Thr Tyr Pro Asp Cys Val Ser Lys Lys Ile Val Glu Thr Ala Ser Asp 60 65 70gga cag atc act gag atc att tat aaa ggt ggt cat aat cat cct aag 651Gly Gln Ile Thr Glu Ile Ile Tyr Lys Gly Gly His Asn His Pro Lys 75 80 85 90cct gag ttc acc aag aga cca tct caa tct tca tta cca tca tcg gtt 699Pro Glu Phe Thr Lys Arg Pro Ser Gln Ser Ser Leu Pro Ser Ser Val 95 100 105aat ggg agg cgc ttg ttt aat cct gct tct gtt gtt agt gaa cct cat 747Asn Gly Arg Arg Leu Phe Asn Pro Ala Ser Val Val Ser Glu Pro His 110 115 120gat caa tca gag aac tct tcg att tcg ttt gac tat agt gat ctt gag 795Asp Gln Ser Glu Asn Ser Ser Ile Ser Phe Asp Tyr Ser Asp Leu Glu 125 130 135cag aaa agt ttt aaa tca gag tat ggt gag ata gat gaa gag gag gaa 843Gln Lys Ser Phe Lys Ser Glu Tyr Gly Glu Ile Asp Glu Glu Glu Glu 140 145 150caa cct gag atg aag agg atg aaa aga gaa ggt gaa gat gaa ggg atg 891Gln Pro Glu Met Lys Arg Met Lys Arg Glu Gly Glu Asp Glu Gly Met155 160 165 170tct ata gaa gta agc aaa gga gtt aaa gag cca aga gtt gtg gtt cag 939Ser Ile Glu Val Ser Lys Gly Val Lys Glu Pro Arg Val Val Val Gln 175 180 185aca ata agt gat att gat gtt ctt ata gat ggc ttt aga tgg agg aaa 987Thr Ile Ser Asp Ile Asp Val Leu Ile Asp Gly Phe Arg Trp Arg Lys 190 195 200tat ggt caa aaa gtt gtc aaa gga aat act aat cca agg agc tac tac 1035Tyr Gly Gln Lys Val Val Lys Gly Asn Thr Asn Pro Arg Ser Tyr Tyr 205 210 215aag tgc aca ttc caa ggt tgt gga gtg aag aag caa gtg gaa aga tcc 1083Lys Cys Thr Phe Gln Gly Cys Gly Val Lys Lys Gln Val Glu Arg Ser 220 225 230gca gca gac gag aga gca gtt ctc act acc tat gaa gga aga cac aat 1131Ala Ala Asp Glu Arg Ala Val Leu Thr Thr Tyr Glu Gly Arg His Asn235 240 245 250cac gat atc cca acc gcg cta cgt cgc tcg tgaaattatt gggacttagt 1181His Asp Ile Pro Thr Ala Leu Arg Arg Ser 255 260cactagtaat atgatttagg ctttctaaaa acaaaaaatc ttactatggc ttatcttttg 1241tgctcattca cagtttgttt atttgtttgt tacacagtca atactttgtt ttgtacagag 1301tggtgcttag tagtgttttt attattatct tggccttata gaataacctc tcttctcatc 1361tgtgtgactt taaacacttg agagtccatt ttatagttct tgtgtattgg tcttttgttt 1421gatttatgta catttttaat attcgaaaaa aaaaaaaaaa a 1462126260PRTArabidopsis thaliana 126Met Ile Pro Asn Glu Ile Ala Thr Gln Asn Asn Asn Gln Ser Phe Gly 1 5 10 15Thr Glu Arg Gln Ile Lys Ile Pro Ala Tyr Met Val Ser Arg Asn Ser 20 25 30Asn Asp Gly Tyr Gly Trp Arg Lys Tyr Gly Gln Lys Gln Val Lys Lys 35 40 45Ser Glu Asn Pro Arg Ser Tyr Phe Lys Cys Thr Tyr Pro Asp Cys Val 50 55 60Ser Lys Lys Ile Val Glu Thr Ala Ser Asp Gly Gln Ile Thr Glu Ile 65 70 75 80Ile Tyr Lys Gly Gly His Asn His Pro Lys Pro Glu Phe Thr Lys Arg 85 90 95Pro Ser Gln Ser Ser Leu Pro Ser Ser Val Asn Gly Arg Arg Leu Phe 100 105 110Asn Pro Ala Ser Val Val Ser Glu Pro His Asp Gln Ser Glu Asn Ser 115 120 125Ser Ile Ser Phe Asp Tyr Ser Asp Leu Glu Gln Lys Ser Phe Lys Ser 130 135 140Glu Tyr Gly Glu Ile Asp Glu Glu Glu Glu Gln Pro Glu Met Lys Arg145 150 155 160Met Lys Arg Glu Gly Glu Asp Glu Gly Met Ser Ile Glu Val Ser Lys 165 170 175Gly Val Lys Glu Pro Arg Val Val Val Gln Thr Ile Ser Asp Ile Asp 180 185 190Val Leu Ile Asp Gly Phe Arg Trp Arg Lys Tyr Gly Gln Lys Val Val 195 200 205Lys Gly Asn Thr Asn Pro Arg Ser Tyr Tyr Lys Cys Thr Phe Gln Gly 210 215 220Cys Gly Val Lys Lys Gln Val Glu Arg Ser Ala Ala Asp Glu Arg Ala225 230 235 240Val Leu Thr Thr Tyr Glu Gly Arg His Asn His Asp Ile Pro Thr Ala 245 250 255Leu Arg Arg Ser 260127438DNAArabidopsis thalianaCDS(1)..(435) 127atg gag gga tat gat aat ggg tcg ttg tat gct cct ttt ttg tcg ttg 48Met Glu Gly Tyr Asp Asn Gly Ser Leu Tyr Ala Pro Phe Leu Ser Leu 1 5 10 15aaa tct cat tcg aaa cca gag ctg cat caa ggc gaa gaa gag agc tca 96Lys Ser His Ser Lys Pro Glu Leu His Gln Gly Glu Glu Glu Ser Ser 20 25 30aag gtt aga tca gaa ggt tgt tcg aaa agc gtg gag tcg tcg aaa aag 144Lys Val Arg Ser Glu Gly Cys Ser Lys Ser Val Glu Ser Ser Lys Lys 35 40 45aag ggg aag aaa caa agg tat gcg ttt caa aca agg agc caa gtg gat 192Lys Gly Lys Lys Gln Arg Tyr Ala Phe Gln Thr Arg Ser Gln Val Asp 50 55 60att ctt gat gat ggt tat cga tgg agg aaa tat ggc caa aag gcc gtc 240Ile Leu Asp Asp Gly Tyr Arg Trp Arg Lys Tyr Gly Gln Lys Ala Val 65 70 75 80aag aac aac aag ttc cct agg agt tac tat agg tgt aca tat gga gga 288Lys Asn Asn Lys Phe Pro Arg Ser Tyr Tyr Arg Cys Thr Tyr Gly Gly 85 90 95tgc aat gtg aag aag caa gtg caa aga tta aca gtg gac caa gaa gtg 336Cys Asn Val Lys Lys Gln Val Gln Arg Leu Thr Val Asp Gln Glu Val 100 105 110gtc gtg aca acc tac gaa gga gtg cat tcg cat ccc atc gag aaa tcc 384Val Val Thr Thr Tyr Glu Gly Val His Ser His Pro Ile Glu Lys Ser 115 120 125acc gaa aac ttc gag cat att ctc act caa atg caa atc tac tct tct 432Thr Glu Asn Phe Glu His Ile Leu Thr Gln Met Gln Ile Tyr Ser Ser 130 135 140ttc tag 438Phe145128145PRTArabidopsis thaliana 128Met Glu Gly Tyr Asp Asn Gly Ser Leu Tyr Ala Pro Phe Leu Ser Leu 1 5 10 15Lys Ser His Ser Lys Pro Glu Leu His Gln Gly Glu Glu Glu Ser Ser 20 25 30Lys Val Arg Ser Glu Gly Cys Ser Lys Ser Val Glu Ser Ser Lys Lys 35 40 45Lys Gly Lys Lys Gln Arg Tyr Ala Phe Gln Thr Arg Ser Gln Val Asp 50 55 60Ile Leu Asp Asp Gly Tyr Arg Trp Arg Lys Tyr Gly Gln Lys Ala Val 65 70 75 80Lys Asn Asn Lys Phe Pro Arg Ser Tyr Tyr Arg Cys Thr Tyr Gly Gly 85 90 95Cys Asn Val Lys Lys Gln Val Gln Arg Leu Thr Val Asp Gln Glu Val 100 105 110Val Val Thr Thr Tyr Glu Gly Val His Ser His Pro Ile Glu Lys Ser 115 120 125Thr Glu Asn Phe Glu His Ile Leu Thr Gln Met Gln Ile Tyr Ser Ser 130 135 140Phe1451291254DNAArabidopsis thalianaCDS(137)..(1090) 129aaaagtccaa gcaccaatct agacctctta ggaaaaaaac ctaaaaacct aatccccaaa 60cctaaaaggc ttatctcatc tcttcttctt tgtcttcttt actctttttt tacctctctc 120ttcattgttc ttcacc atg tct aat gaa acc aga gat ctc tac aac tac caa 172 Met Ser Asn Glu Thr Arg Asp Leu Tyr Asn Tyr Gln 1 5 10tac cct tca tcg ttt tcg ttg cac gaa atg atg aat ctg cct act tca 220Tyr Pro Ser Ser Phe Ser Leu His Glu Met Met Asn Leu Pro Thr Ser 15 20 25aat cca tct tct tat gga aac ctc cca tca caa aac ggt ttt aat cca 268Asn Pro Ser Ser Tyr Gly Asn Leu Pro Ser Gln Asn Gly Phe Asn Pro 30 35 40tct act tat tcc ttc acc gat tgt ctc caa agt tct cca gca gcg tat 316Ser Thr Tyr Ser Phe Thr Asp Cys Leu Gln Ser Ser Pro Ala Ala Tyr 45 50 55 60gaa tct cta ctt cag aaa act ttt ggt ctt tct ccc tct tcc tca gag 364Glu Ser Leu Leu Gln Lys Thr Phe Gly Leu Ser Pro Ser Ser Ser Glu 65 70 75gtt ttc aat tct tcg atc gat caa gaa ccg aac cgt gat gtt act aat 412Val Phe Asn Ser Ser Ile Asp Gln Glu Pro Asn Arg Asp Val Thr Asn 80 85 90gac gta atc aat ggt ggt gca tgc aac gag act gaa act agg gtt tct 460Asp Val Ile Asn Gly Gly Ala Cys Asn Glu Thr Glu Thr Arg Val Ser 95 100 105cct tct aat tct tcc tct agt gag gct gat cac ccc ggt gaa gat tcc 508Pro Ser Asn Ser Ser Ser Ser Glu Ala Asp His Pro Gly Glu Asp Ser 110 115 120ggt aag agc cgg agg aaa cga gag tta gtc ggt gaa gaa gat caa att 556Gly Lys Ser Arg Arg Lys Arg Glu Leu Val Gly Glu Glu Asp Gln Ile125 130 135 140tcc aaa aaa gtt ggg aaa acg aaa aag act gag gtg aag aaa caa aga 604Ser Lys Lys Val Gly Lys Thr Lys Lys Thr Glu Val Lys Lys Gln Arg 145 150 155gag cca cga gtc tcg ttt atg act aaa agt gaa gtt gat cat ctt gaa 652Glu Pro Arg Val Ser Phe Met Thr Lys Ser Glu Val Asp His Leu Glu 160 165 170gat ggt tat aga tgg aga aaa tac ggc caa aag gct gta aaa aat agc 700Asp Gly Tyr Arg Trp Arg Lys Tyr Gly Gln Lys Ala Val Lys Asn Ser 175 180 185cct tat cca agg agt tac tat aga tgt aca aca caa aag tgc aac gtg 748Pro Tyr Pro Arg Ser Tyr Tyr Arg Cys Thr Thr Gln Lys Cys Asn Val 190 195 200aag aaa cga gtg gag aga tcg ttc caa gat cca acg gtt gtg att aca 796Lys Lys Arg Val Glu Arg Ser Phe Gln Asp Pro Thr Val Val Ile Thr205 210 215 220act tac gag ggt caa cac aac cac ccg att ccg act aat ctt cga gga 844Thr Tyr Glu Gly Gln His Asn His Pro Ile Pro Thr Asn Leu Arg Gly 225 230 235agt tct gcc gcg gct gct atg ttc tcc gca gac ctc atg act cca aga 892Ser Ser Ala Ala Ala Ala Met Phe Ser Ala Asp Leu Met Thr Pro Arg 240 245 250agc ttt gca cat gat atg ttt agg acg gca gct tat act aac ggc ggt 940Ser Phe Ala His Asp Met Phe Arg Thr Ala Ala Tyr Thr Asn Gly Gly 255 260 265tct gtg gcg gcg gct ttg gat tat gga tat gga caa agt ggt tat ggt 988Ser Val Ala Ala Ala Leu Asp Tyr Gly Tyr Gly Gln Ser Gly Tyr Gly 270 275 280agt gtg aat tca aac cct agt tct cac caa gtg tat cat caa ggg ggt 1036Ser Val Asn Ser Asn Pro Ser Ser His Gln Val Tyr His Gln Gly Gly285 290 295 300gag tat gag ctc ttg agg gag att ttt cct tca att ttc ttt aag caa 1084Glu Tyr Glu Leu Leu Arg Glu Ile Phe Pro Ser Ile Phe Phe Lys Gln 305 310 315gag cct tgatcgatca ttgttataac tacatatatt atatatattg agagagagag 1140Glu Progtagagaaaa aaaaaactta tatgtaactt aagatcttat tttgtctctc ttatttgcat 1200gtacatattt tttcatgaaa gaatgagaca gttgggcttg cttaaaaaaa aaat 1254130318PRTArabidopsis thaliana 130Met Ser Asn Glu Thr Arg Asp Leu Tyr Asn Tyr Gln Tyr Pro Ser Ser 1 5 10 15Phe Ser Leu His Glu Met Met Asn Leu Pro Thr Ser Asn Pro Ser Ser 20 25 30Tyr Gly Asn Leu Pro Ser Gln Asn Gly Phe Asn Pro Ser Thr Tyr Ser 35 40 45Phe Thr Asp Cys Leu Gln Ser Ser Pro Ala Ala Tyr Glu Ser Leu Leu 50 55 60Gln Lys Thr Phe Gly Leu Ser Pro Ser Ser Ser Glu Val Phe Asn Ser 65 70 75 80Ser Ile Asp Gln Glu Pro Asn Arg Asp Val Thr Asn Asp Val Ile Asn 85 90 95Gly Gly Ala Cys Asn Glu Thr Glu Thr Arg Val Ser Pro Ser Asn Ser 100 105 110Ser Ser Ser Glu Ala Asp His Pro Gly Glu Asp Ser Gly Lys Ser Arg 115 120 125Arg Lys Arg Glu Leu Val Gly Glu Glu Asp Gln Ile Ser Lys Lys Val 130 135 140Gly Lys Thr Lys Lys Thr Glu Val Lys Lys Gln Arg Glu Pro Arg Val145 150 155 160Ser Phe Met Thr Lys Ser Glu Val Asp His Leu Glu Asp Gly Tyr Arg 165 170 175Trp Arg Lys Tyr Gly Gln Lys Ala Val Lys Asn Ser Pro Tyr Pro Arg 180 185 190Ser Tyr Tyr Arg Cys Thr Thr Gln Lys Cys Asn Val Lys Lys Arg Val 195 200 205Glu Arg Ser Phe Gln Asp Pro Thr Val Val Ile Thr Thr Tyr Glu Gly 210 215 220Gln His Asn His Pro Ile Pro Thr Asn Leu Arg Gly Ser Ser Ala Ala225 230 235 240Ala Ala Met Phe Ser Ala Asp Leu Met Thr Pro Arg Ser Phe Ala His 245 250 255Asp Met Phe Arg Thr Ala Ala Tyr Thr Asn Gly Gly Ser Val Ala Ala 260 265 270Ala Leu Asp Tyr Gly Tyr Gly Gln Ser Gly Tyr Gly Ser Val Asn Ser 275 280 285Asn Pro Ser Ser His Gln Val Tyr His Gln Gly Gly Glu Tyr Glu Leu 290 295 300Leu Arg Glu Ile Phe Pro Ser Ile Phe Phe Lys Gln Glu Pro305 310 3151311148DNAArabidopsis thalianaCDS(104)..(1075) 131acgtctctct ctttctctct actctctgtt tcctcataat tcaatcacta tattttttta 60aaaacatttg acttcatcga tcggttaaca attaatcaaa aag atg gga cga tca 115 Met Gly Arg Ser 1cca tgt tgt gag aag aag aat ggt ctc aag aaa gga cca tgg act cct 163Pro Cys Cys Glu Lys Lys Asn Gly Leu Lys Lys Gly Pro Trp Thr Pro 5 10 15 20gag gag gat caa aag ctc att gat tat atc aat ata cat ggt tat gga 211Glu Glu Asp Gln Lys Leu Ile Asp Tyr Ile Asn Ile His Gly Tyr Gly 25 30 35aat tgg aga act ctt ccc aag aat gct ggg tta caa aga tgt ggt aag 259Asn Trp Arg Thr Leu Pro Lys Asn Ala Gly Leu Gln Arg Cys Gly Lys 40 45 50agt tgt cgt ctc cgg tgg acc aac tat ctc cga cca gat att aag cgt 307Ser Cys Arg Leu Arg Trp Thr Asn Tyr Leu Arg Pro Asp Ile Lys Arg 55 60 65gga aga ttc tct ttt gaa gaa gaa gaa acc att att caa ctt cac agc 355Gly Arg Phe Ser Phe Glu Glu Glu Glu Thr Ile Ile Gln Leu His Ser 70 75 80atc atg gga aac aag tgg tct gcg att gcg gct cgt ttg cct gga aga 403Ile Met Gly Asn Lys Trp Ser Ala Ile Ala Ala Arg Leu Pro Gly Arg 85 90 95

100aca gac aac gag atc aaa aac tat tgg aac act cac atc aga aaa aga 451Thr Asp Asn Glu Ile Lys Asn Tyr Trp Asn Thr His Ile Arg Lys Arg 105 110 115ctt cta aag atg gga atc gac ccg gtt aca cac act cca cgt ctt gat 499Leu Leu Lys Met Gly Ile Asp Pro Val Thr His Thr Pro Arg Leu Asp 120 125 130ctt ctc gat atc tcc tcc att ctc agc tca tct atc tac aac tct tcg 547Leu Leu Asp Ile Ser Ser Ile Leu Ser Ser Ser Ile Tyr Asn Ser Ser 135 140 145cat cat cat cat cat cat cat caa caa cat atg aac atg tcg agg ctc 595His His His His His His His Gln Gln His Met Asn Met Ser Arg Leu 150 155 160atg atg agt gat ggt aat cat caa cca ttg gtt aac ccc gag ata ctc 643Met Met Ser Asp Gly Asn His Gln Pro Leu Val Asn Pro Glu Ile Leu165 170 175 180aaa ctc gca acc tct ctc ttt tca aac caa aac cac ccc aac aac aca 691Lys Leu Ala Thr Ser Leu Phe Ser Asn Gln Asn His Pro Asn Asn Thr 185 190 195cac gag aac aac acg gtt aac caa acc gaa gta aac caa tac caa acc 739His Glu Asn Asn Thr Val Asn Gln Thr Glu Val Asn Gln Tyr Gln Thr 200 205 210ggt tac aac atg cct ggt aat gaa gaa tta caa tct tgg ttc cct atc 787Gly Tyr Asn Met Pro Gly Asn Glu Glu Leu Gln Ser Trp Phe Pro Ile 215 220 225atg gat caa ttc acg aat ttc caa gac ctc atg cca atg aag acg acg 835Met Asp Gln Phe Thr Asn Phe Gln Asp Leu Met Pro Met Lys Thr Thr 230 235 240gtc caa aat tca ttg tca tac gat gat gat tgt tcg aag tcc aat ttt 883Val Gln Asn Ser Leu Ser Tyr Asp Asp Asp Cys Ser Lys Ser Asn Phe245 250 255 260gta tta gaa cct tat tac tcc gac ttt gct tca gtc ttg acc aca cct 931Val Leu Glu Pro Tyr Tyr Ser Asp Phe Ala Ser Val Leu Thr Thr Pro 265 270 275tct tca agc ccg act ccg tta aac tca agt tcc tca act tac atc aat 979Ser Ser Ser Pro Thr Pro Leu Asn Ser Ser Ser Ser Thr Tyr Ile Asn 280 285 290agt agc act tgc agc acc gag gat gaa aaa gag agt tat tac agt gat 1027Ser Ser Thr Cys Ser Thr Glu Asp Glu Lys Glu Ser Tyr Tyr Ser Asp 295 300 305aat atc act aat tat tcg ttt gat gtt aat ggt ttt ctc caa ttc caa 1075Asn Ile Thr Asn Tyr Ser Phe Asp Val Asn Gly Phe Leu Gln Phe Gln 310 315 320taaacaaaac gccattggaa tagagttatg taaacatgca atcattgtat ttgttatata 1135gattttgtta cat 1148132324PRTArabidopsis thaliana 132Met Gly Arg Ser Pro Cys Cys Glu Lys Lys Asn Gly Leu Lys Lys Gly 1 5 10 15Pro Trp Thr Pro Glu Glu Asp Gln Lys Leu Ile Asp Tyr Ile Asn Ile 20 25 30His Gly Tyr Gly Asn Trp Arg Thr Leu Pro Lys Asn Ala Gly Leu Gln 35 40 45Arg Cys Gly Lys Ser Cys Arg Leu Arg Trp Thr Asn Tyr Leu Arg Pro 50 55 60Asp Ile Lys Arg Gly Arg Phe Ser Phe Glu Glu Glu Glu Thr Ile Ile 65 70 75 80Gln Leu His Ser Ile Met Gly Asn Lys Trp Ser Ala Ile Ala Ala Arg 85 90 95Leu Pro Gly Arg Thr Asp Asn Glu Ile Lys Asn Tyr Trp Asn Thr His 100 105 110Ile Arg Lys Arg Leu Leu Lys Met Gly Ile Asp Pro Val Thr His Thr 115 120 125Pro Arg Leu Asp Leu Leu Asp Ile Ser Ser Ile Leu Ser Ser Ser Ile 130 135 140Tyr Asn Ser Ser His His His His His His His Gln Gln His Met Asn145 150 155 160Met Ser Arg Leu Met Met Ser Asp Gly Asn His Gln Pro Leu Val Asn 165 170 175Pro Glu Ile Leu Lys Leu Ala Thr Ser Leu Phe Ser Asn Gln Asn His 180 185 190Pro Asn Asn Thr His Glu Asn Asn Thr Val Asn Gln Thr Glu Val Asn 195 200 205Gln Tyr Gln Thr Gly Tyr Asn Met Pro Gly Asn Glu Glu Leu Gln Ser 210 215 220Trp Phe Pro Ile Met Asp Gln Phe Thr Asn Phe Gln Asp Leu Met Pro225 230 235 240Met Lys Thr Thr Val Gln Asn Ser Leu Ser Tyr Asp Asp Asp Cys Ser 245 250 255Lys Ser Asn Phe Val Leu Glu Pro Tyr Tyr Ser Asp Phe Ala Ser Val 260 265 270Leu Thr Thr Pro Ser Ser Ser Pro Thr Pro Leu Asn Ser Ser Ser Ser 275 280 285Thr Tyr Ile Asn Ser Ser Thr Cys Ser Thr Glu Asp Glu Lys Glu Ser 290 295 300Tyr Tyr Ser Asp Asn Ile Thr Asn Tyr Ser Phe Asp Val Asn Gly Phe305 310 315 320Leu Gln Phe Gln1332684DNAArabidopsis thalianaCDS(898)..(2475) 133acttcttctt cttcttcttc tcgatttctt actgttttct tatccaacga aatctggaat 60taaaaatgga atctttatcg aatccaagct gattttgttt ctttcattga atcatctctc 120taaagtggaa ttttgtaaag agaagatctg aagttgtgta gaggagctta gtgatggaga 180caaattcgtc tggagaagat ctggttatta agactcggaa gccatatacg ataacaaagc 240aacgtgaaag gtggactgag gaagaacata atagattcat tgaagctttg aggctttatg 300gtagagcatg gcagaagatt gaagaacatg tagcaacaaa aactgctgtc cagataagaa 360gtcacgctca gaaatttttc tccaaggtaa aatcggttaa ttttgaaatg atgttctcat 420cttcattggc ttaatgctta agacttattg aaagccaggc aagttttctg cttcttttgc 480ttcttagtca ggagatagat agattacgtt tttagagttt agtaatgagc aataagtctt 540aaaatagttg gagaaatgac gagatgtaat cgttttcttt tgtttatgcc tatatcttgt 600taatccacaa acatgtacat agattcttca gaagaatgtt agtttcttta gattcttcag 660ataaacttgt gtcttcttac cgattctgag gtagtggcaa aagtgggctg agtgctagaa 720atttttgaat gttccttgtg ataagccata gaggtaaacc atttttgatt ttccagttct 780gtcatttaaa cttgttagtg tcattagatt tttgtttgtt tacgtttgtt tagagggtaa 840caaaactact ctcatctctc tcaggtagag aaagaggctg aagctaaagg tgtagct 897atg ggt caa gcg cta gac ata gct att cct cct cca cgg cct aag cgt 945Met Gly Gln Ala Leu Asp Ile Ala Ile Pro Pro Pro Arg Pro Lys Arg 1 5 10 15aaa cca aac aat cct tat cct cga aag acg gga agt gga acg atc ctt 993Lys Pro Asn Asn Pro Tyr Pro Arg Lys Thr Gly Ser Gly Thr Ile Leu 20 25 30atg tca aaa acg ggt gtg aat gat gga aaa gag tcc ctt gga tca gaa 1041Met Ser Lys Thr Gly Val Asn Asp Gly Lys Glu Ser Leu Gly Ser Glu 35 40 45aaa gtg tcg cat cct gag atg gcc aat gaa gat cga caa caa tca aag 1089Lys Val Ser His Pro Glu Met Ala Asn Glu Asp Arg Gln Gln Ser Lys 50 55 60cct gaa gag aaa act ctg cag gaa gac aac tgt tca gat tgt ttc act 1137Pro Glu Glu Lys Thr Leu Gln Glu Asp Asn Cys Ser Asp Cys Phe Thr 65 70 75 80cat cag tat ctc tct gct gca tcc tcc atg aat aaa agt tgt ata gag 1185His Gln Tyr Leu Ser Ala Ala Ser Ser Met Asn Lys Ser Cys Ile Glu 85 90 95aca tca aac gca agc act ttc cgc gag ttc ttg cct tca cgg gaa gag 1233Thr Ser Asn Ala Ser Thr Phe Arg Glu Phe Leu Pro Ser Arg Glu Glu 100 105 110gga agt cag aat aac agg gta aga aag gag tca aac tca gat ttg aat 1281Gly Ser Gln Asn Asn Arg Val Arg Lys Glu Ser Asn Ser Asp Leu Asn 115 120 125gca aaa tct ctg gaa aac ggt aat gag caa gga cct cag act tat ccg 1329Ala Lys Ser Leu Glu Asn Gly Asn Glu Gln Gly Pro Gln Thr Tyr Pro 130 135 140atg cat atc cct gtg cta gtg cca ttg ggg agc tca ata aca agt tct 1377Met His Ile Pro Val Leu Val Pro Leu Gly Ser Ser Ile Thr Ser Ser145 150 155 160cta tca cat cct cct tca gag cca gat agt cat ccc cac aca gtt gca 1425Leu Ser His Pro Pro Ser Glu Pro Asp Ser His Pro His Thr Val Ala 165 170 175gga gat tat cag tcg ttt cct aat cat ata atg tca acc ctt tta caa 1473Gly Asp Tyr Gln Ser Phe Pro Asn His Ile Met Ser Thr Leu Leu Gln 180 185 190aca ccg gct ctt tat act gcc gca act ttc gcc tca tca ttt tgg cct 1521Thr Pro Ala Leu Tyr Thr Ala Ala Thr Phe Ala Ser Ser Phe Trp Pro 195 200 205ccc gat tct agt ggt ggc tca cct gtt cca ggg aac tca cct ccg aat 1569Pro Asp Ser Ser Gly Gly Ser Pro Val Pro Gly Asn Ser Pro Pro Asn 210 215 220ctg gct gcc atg gcc gca gcc act gtt gca gct gct agt gct tgg tgg 1617Leu Ala Ala Met Ala Ala Ala Thr Val Ala Ala Ala Ser Ala Trp Trp225 230 235 240gct gcc aat gga tta tta cct tta tgt gct cct ctt agt tca ggt ggt 1665Ala Ala Asn Gly Leu Leu Pro Leu Cys Ala Pro Leu Ser Ser Gly Gly 245 250 255ttc act agt cat cct cca tct act ttt gga cca tca tgt gat gta gag 1713Phe Thr Ser His Pro Pro Ser Thr Phe Gly Pro Ser Cys Asp Val Glu 260 265 270tac aca aaa gca agc act tta caa cat ggt tct gtg cag agc cga gag 1761Tyr Thr Lys Ala Ser Thr Leu Gln His Gly Ser Val Gln Ser Arg Glu 275 280 285caa gaa cac ttc gag gca tca aag gct cga tct tca ctg gac tca gag 1809Gln Glu His Phe Glu Ala Ser Lys Ala Arg Ser Ser Leu Asp Ser Glu 290 295 300gat gtt gaa aat aag agt aaa cca gtt tgt cat gag cag cct tct gca 1857Asp Val Glu Asn Lys Ser Lys Pro Val Cys His Glu Gln Pro Ser Ala305 310 315 320aca cct gag agt gat gca aag ggt tca gat gga gca gga gac aga aaa 1905Thr Pro Glu Ser Asp Ala Lys Gly Ser Asp Gly Ala Gly Asp Arg Lys 325 330 335caa gtt gac cgg tcc tcg tgt ggc tca aac act ccg tcg agt agt gat 1953Gln Val Asp Arg Ser Ser Cys Gly Ser Asn Thr Pro Ser Ser Ser Asp 340 345 350gat gtt gag gcg gat gca tca gaa agg caa gag gat ggc acc aat ggt 2001Asp Val Glu Ala Asp Ala Ser Glu Arg Gln Glu Asp Gly Thr Asn Gly 355 360 365gag gtg aaa gaa acg aat gaa gac act aat aaa cct caa act tca gag 2049Glu Val Lys Glu Thr Asn Glu Asp Thr Asn Lys Pro Gln Thr Ser Glu 370 375 380tcc aat gca cgc cgc agt aga atc agc tcc aat ata acc gat cca tgg 2097Ser Asn Ala Arg Arg Ser Arg Ile Ser Ser Asn Ile Thr Asp Pro Trp385 390 395 400aag tct gtg tct gac gag ggt cga att gcc ttc caa gct ctc ttc tcc 2145Lys Ser Val Ser Asp Glu Gly Arg Ile Ala Phe Gln Ala Leu Phe Ser 405 410 415aga gag gta ttg ccg caa agt ttt aca tat cga gaa gaa cac aga gag 2193Arg Glu Val Leu Pro Gln Ser Phe Thr Tyr Arg Glu Glu His Arg Glu 420 425 430gaa gaa caa caa caa caa gaa caa aga tat cca atg gca ctt gat ctt 2241Glu Glu Gln Gln Gln Gln Glu Gln Arg Tyr Pro Met Ala Leu Asp Leu 435 440 445aac ttc aca gct cag tta aca cca gtt gat gat caa gag gag aag aga 2289Asn Phe Thr Ala Gln Leu Thr Pro Val Asp Asp Gln Glu Glu Lys Arg 450 455 460aac aca gga ttt ctt gga atc gga tta gat gct tca aag cta atg agt 2337Asn Thr Gly Phe Leu Gly Ile Gly Leu Asp Ala Ser Lys Leu Met Ser465 470 475 480aga gga aga aca ggt ttt aaa cca tac aaa aga tgt tcc atg gaa gcc 2385Arg Gly Arg Thr Gly Phe Lys Pro Tyr Lys Arg Cys Ser Met Glu Ala 485 490 495aaa gaa agt aga atc ctc aac aac aat cct atc att cat gtg gaa cag 2433Lys Glu Ser Arg Ile Leu Asn Asn Asn Pro Ile Ile His Val Glu Gln 500 505 510aaa gat ccc aaa cgg atg cgg ttg gaa act caa gct tcc aca 2475Lys Asp Pro Lys Arg Met Arg Leu Glu Thr Gln Ala Ser Thr 515 520 525tgagactcta ttttcatctg atctgttgtt tgtactctgt ttttaagttt tcaagaccac 2535tgctacattt tctttttctt ttgaggcctt tgtatttgtt tccttgtcca tagtcttcct 2595gtaacatttg actctgtatt attcaacaaa tcataaactg tttaatcttt ttttttccag 2655aaaaaaaaaa aagaaaaaaa aaaaaaaaa 2684134526PRTArabidopsis thaliana 134Met Gly Gln Ala Leu Asp Ile Ala Ile Pro Pro Pro Arg Pro Lys Arg 1 5 10 15Lys Pro Asn Asn Pro Tyr Pro Arg Lys Thr Gly Ser Gly Thr Ile Leu 20 25 30Met Ser Lys Thr Gly Val Asn Asp Gly Lys Glu Ser Leu Gly Ser Glu 35 40 45Lys Val Ser His Pro Glu Met Ala Asn Glu Asp Arg Gln Gln Ser Lys 50 55 60Pro Glu Glu Lys Thr Leu Gln Glu Asp Asn Cys Ser Asp Cys Phe Thr 65 70 75 80His Gln Tyr Leu Ser Ala Ala Ser Ser Met Asn Lys Ser Cys Ile Glu 85 90 95Thr Ser Asn Ala Ser Thr Phe Arg Glu Phe Leu Pro Ser Arg Glu Glu 100 105 110Gly Ser Gln Asn Asn Arg Val Arg Lys Glu Ser Asn Ser Asp Leu Asn 115 120 125Ala Lys Ser Leu Glu Asn Gly Asn Glu Gln Gly Pro Gln Thr Tyr Pro 130 135 140Met His Ile Pro Val Leu Val Pro Leu Gly Ser Ser Ile Thr Ser Ser145 150 155 160Leu Ser His Pro Pro Ser Glu Pro Asp Ser His Pro His Thr Val Ala 165 170 175Gly Asp Tyr Gln Ser Phe Pro Asn His Ile Met Ser Thr Leu Leu Gln 180 185 190Thr Pro Ala Leu Tyr Thr Ala Ala Thr Phe Ala Ser Ser Phe Trp Pro 195 200 205Pro Asp Ser Ser Gly Gly Ser Pro Val Pro Gly Asn Ser Pro Pro Asn 210 215 220Leu Ala Ala Met Ala Ala Ala Thr Val Ala Ala Ala Ser Ala Trp Trp225 230 235 240Ala Ala Asn Gly Leu Leu Pro Leu Cys Ala Pro Leu Ser Ser Gly Gly 245 250 255Phe Thr Ser His Pro Pro Ser Thr Phe Gly Pro Ser Cys Asp Val Glu 260 265 270Tyr Thr Lys Ala Ser Thr Leu Gln His Gly Ser Val Gln Ser Arg Glu 275 280 285Gln Glu His Phe Glu Ala Ser Lys Ala Arg Ser Ser Leu Asp Ser Glu 290 295 300Asp Val Glu Asn Lys Ser Lys Pro Val Cys His Glu Gln Pro Ser Ala305 310 315 320Thr Pro Glu Ser Asp Ala Lys Gly Ser Asp Gly Ala Gly Asp Arg Lys 325 330 335Gln Val Asp Arg Ser Ser Cys Gly Ser Asn Thr Pro Ser Ser Ser Asp 340 345 350Asp Val Glu Ala Asp Ala Ser Glu Arg Gln Glu Asp Gly Thr Asn Gly 355 360 365Glu Val Lys Glu Thr Asn Glu Asp Thr Asn Lys Pro Gln Thr Ser Glu 370 375 380Ser Asn Ala Arg Arg Ser Arg Ile Ser Ser Asn Ile Thr Asp Pro Trp385 390 395 400Lys Ser Val Ser Asp Glu Gly Arg Ile Ala Phe Gln Ala Leu Phe Ser 405 410 415Arg Glu Val Leu Pro Gln Ser Phe Thr Tyr Arg Glu Glu His Arg Glu 420 425 430Glu Glu Gln Gln Gln Gln Glu Gln Arg Tyr Pro Met Ala Leu Asp Leu 435 440 445Asn Phe Thr Ala Gln Leu Thr Pro Val Asp Asp Gln Glu Glu Lys Arg 450 455 460Asn Thr Gly Phe Leu Gly Ile Gly Leu Asp Ala Ser Lys Leu Met Ser465 470 475 480Arg Gly Arg Thr Gly Phe Lys Pro Tyr Lys Arg Cys Ser Met Glu Ala 485 490 495Lys Glu Ser Arg Ile Leu Asn Asn Asn Pro Ile Ile His Val Glu Gln 500 505 510Lys Asp Pro Lys Arg Met Arg Leu Glu Thr Gln Ala Ser Thr 515 520 5251351938DNAArabidopsis thalianaCDS(1)..(1935) 135atg gat act aat aca tct gga gaa gaa tta tta gct aag gca aga aag 48Met Asp Thr Asn Thr Ser Gly Glu Glu Leu Leu Ala Lys Ala Arg Lys 1 5 10 15cca tat aca ata aca aag cag cga gag cga tgg act gag gat gag cat 96Pro Tyr Thr Ile Thr Lys Gln Arg Glu Arg Trp Thr Glu Asp Glu His 20 25 30gag agg ttt cta gaa gcc ttg agg ctt tat gga aga gct tgg caa cga 144Glu Arg Phe Leu Glu Ala Leu Arg Leu Tyr Gly Arg Ala Trp Gln Arg 35 40 45att gaa gaa cat att ggg aca aag act gct gtt cag atc aga agt cat 192Ile Glu Glu His Ile Gly Thr Lys Thr Ala Val Gln Ile Arg Ser His 50 55 60gca caa aag ttc ttc aca aag ttg gag aaa gag gct gaa gtt aaa ggc 240Ala Gln Lys Phe Phe Thr Lys Leu Glu Lys Glu Ala Glu Val Lys Gly 65 70 75 80atc cct gtt tgc caa gct ttg gac ata gaa att ccg cct cct cgt cct 288Ile Pro Val Cys Gln Ala Leu Asp Ile Glu Ile Pro Pro Pro Arg Pro 85 90 95aaa cga aaa ccc aat act cct tat cct cga aag cct ggg aac aac ggt 336Lys Arg Lys Pro Asn Thr Pro Tyr Pro Arg Lys Pro Gly Asn Asn Gly 100 105 110aca tct tcc tct caa gta tca tca gca aaa gat gca aaa ctt gtt tca

384Thr Ser Ser Ser Gln Val Ser Ser Ala Lys Asp Ala Lys Leu Val Ser 115 120 125tcg gcc tct tct tca cag ttg aat cag gcg ttc ttg gat ttg gaa aaa 432Ser Ala Ser Ser Ser Gln Leu Asn Gln Ala Phe Leu Asp Leu Glu Lys 130 135 140atg ccg ttc tct gag aaa aca tca act gga aaa gaa aat caa gat gag 480Met Pro Phe Ser Glu Lys Thr Ser Thr Gly Lys Glu Asn Gln Asp Glu145 150 155 160aat tgc tcg ggt gtt tct act gtg aac aag tat ccc tta cca acg aaa 528Asn Cys Ser Gly Val Ser Thr Val Asn Lys Tyr Pro Leu Pro Thr Lys 165 170 175cag gta agt ggc gac att gaa aca agt aag acc tca act gtg gac aac 576Gln Val Ser Gly Asp Ile Glu Thr Ser Lys Thr Ser Thr Val Asp Asn 180 185 190gcg gtt caa gat gtt ccc aag aag aac aaa gac aaa gat ggt aac gat 624Ala Val Gln Asp Val Pro Lys Lys Asn Lys Asp Lys Asp Gly Asn Asp 195 200 205ggt act act gtg cac agc atg caa aac tac cct tgg cat ttc cac gca 672Gly Thr Thr Val His Ser Met Gln Asn Tyr Pro Trp His Phe His Ala 210 215 220gat att gtg aac ggg aat ata gca aaa tgc cct caa aat cat ccc tca 720Asp Ile Val Asn Gly Asn Ile Ala Lys Cys Pro Gln Asn His Pro Ser225 230 235 240ggt atg gta tct caa gac ttc atg ttt cat cct atg aga gaa gaa act 768Gly Met Val Ser Gln Asp Phe Met Phe His Pro Met Arg Glu Glu Thr 245 250 255cac ggg cac gca aat ctt caa gct aca aca gca tct gct act act aca 816His Gly His Ala Asn Leu Gln Ala Thr Thr Ala Ser Ala Thr Thr Thr 260 265 270gct tct cat caa gcg ttt cca gct tgt cat tca cag gat gat tac cgt 864Ala Ser His Gln Ala Phe Pro Ala Cys His Ser Gln Asp Asp Tyr Arg 275 280 285tcg ttt ctc cag ata tca tct act ttc tcc aat ctt att atg tca act 912Ser Phe Leu Gln Ile Ser Ser Thr Phe Ser Asn Leu Ile Met Ser Thr 290 295 300ctc cta cag aat cct gca gct cat gct gca gct aca ttc gct gct tcg 960Leu Leu Gln Asn Pro Ala Ala His Ala Ala Ala Thr Phe Ala Ala Ser305 310 315 320gtc tgg cct tat gcg agt gtc ggg aat tct ggt gat tca tca acc cca 1008Val Trp Pro Tyr Ala Ser Val Gly Asn Ser Gly Asp Ser Ser Thr Pro 325 330 335atg agc tct tct cct cca agt ata act gcc att gcc gct gct aca gta 1056Met Ser Ser Ser Pro Pro Ser Ile Thr Ala Ile Ala Ala Ala Thr Val 340 345 350gct gct gca act gct tgg tgg gct tct cat gga ctt ctt cct gta tgc 1104Ala Ala Ala Thr Ala Trp Trp Ala Ser His Gly Leu Leu Pro Val Cys 355 360 365gct cca gct cca ata aca tgt gtt cca ttc tca act gtt gca gtt cca 1152Ala Pro Ala Pro Ile Thr Cys Val Pro Phe Ser Thr Val Ala Val Pro 370 375 380act cca gca atg act gaa atg gat acc gtt gaa aat act caa ccg ttt 1200Thr Pro Ala Met Thr Glu Met Asp Thr Val Glu Asn Thr Gln Pro Phe385 390 395 400gag aaa caa aac aca gct ctg caa gat caa aac ttg gct tcg aaa tct 1248Glu Lys Gln Asn Thr Ala Leu Gln Asp Gln Asn Leu Ala Ser Lys Ser 405 410 415cca gct tca tca tct gat gat tca gat gag act gga gta acc aag cta 1296Pro Ala Ser Ser Ser Asp Asp Ser Asp Glu Thr Gly Val Thr Lys Leu 420 425 430aat gcc gac tca aaa acc aat gat gat aaa att gag gag gtt gtt gtt 1344Asn Ala Asp Ser Lys Thr Asn Asp Asp Lys Ile Glu Glu Val Val Val 435 440 445act gcc gct gtg cat gac tca aac act gcc cag aag aaa aat ctt gtg 1392Thr Ala Ala Val His Asp Ser Asn Thr Ala Gln Lys Lys Asn Leu Val 450 455 460gac cgc tca tcc tgt ggc tca aat aca cct tca ggg agt gac gca gaa 1440Asp Arg Ser Ser Cys Gly Ser Asn Thr Pro Ser Gly Ser Asp Ala Glu465 470 475 480act gat gca tta gat aaa atg gag aaa gat aaa gag gat gtg aag gag 1488Thr Asp Ala Leu Asp Lys Met Glu Lys Asp Lys Glu Asp Val Lys Glu 485 490 495aca gat gag aat cag cca gat gtt att gag tta aat aac cgt aag att 1536Thr Asp Glu Asn Gln Pro Asp Val Ile Glu Leu Asn Asn Arg Lys Ile 500 505 510aaa atg aga gac aac aac agc aac aac aat gca act act gat tcg tgg 1584Lys Met Arg Asp Asn Asn Ser Asn Asn Asn Ala Thr Thr Asp Ser Trp 515 520 525aag gaa gtc tcc gaa gag ggt cgt ata gcg ttt cag gct ctc ttt gca 1632Lys Glu Val Ser Glu Glu Gly Arg Ile Ala Phe Gln Ala Leu Phe Ala 530 535 540aga gaa aga ttg cct caa agc ttt tcg cct cct caa gtg gca gag aat 1680Arg Glu Arg Leu Pro Gln Ser Phe Ser Pro Pro Gln Val Ala Glu Asn545 550 555 560gtg aat aga aaa caa agt gac acg tca atg cca ttg gct cct aat ttc 1728Val Asn Arg Lys Gln Ser Asp Thr Ser Met Pro Leu Ala Pro Asn Phe 565 570 575aaa agc cag gat tct tgt gct gca gac caa gaa gga gta gta atg atc 1776Lys Ser Gln Asp Ser Cys Ala Ala Asp Gln Glu Gly Val Val Met Ile 580 585 590ggt gtt gga aca tgc aag agt ctt aaa acg aga cag aca gga ttt aag 1824Gly Val Gly Thr Cys Lys Ser Leu Lys Thr Arg Gln Thr Gly Phe Lys 595 600 605cca tac aag aga tgt tca atg gaa gtg aaa gag agc caa gtt ggg aac 1872Pro Tyr Lys Arg Cys Ser Met Glu Val Lys Glu Ser Gln Val Gly Asn 610 615 620ata aac aat caa agt gat gaa aaa gtc tgc aaa agg ctt cga ttg gaa 1920Ile Asn Asn Gln Ser Asp Glu Lys Val Cys Lys Arg Leu Arg Leu Glu625 630 635 640gga gaa gct tct aca tga 1938Gly Glu Ala Ser Thr 645136645PRTArabidopsis thaliana 136Met Asp Thr Asn Thr Ser Gly Glu Glu Leu Leu Ala Lys Ala Arg Lys 1 5 10 15Pro Tyr Thr Ile Thr Lys Gln Arg Glu Arg Trp Thr Glu Asp Glu His 20 25 30Glu Arg Phe Leu Glu Ala Leu Arg Leu Tyr Gly Arg Ala Trp Gln Arg 35 40 45Ile Glu Glu His Ile Gly Thr Lys Thr Ala Val Gln Ile Arg Ser His 50 55 60Ala Gln Lys Phe Phe Thr Lys Leu Glu Lys Glu Ala Glu Val Lys Gly 65 70 75 80Ile Pro Val Cys Gln Ala Leu Asp Ile Glu Ile Pro Pro Pro Arg Pro 85 90 95Lys Arg Lys Pro Asn Thr Pro Tyr Pro Arg Lys Pro Gly Asn Asn Gly 100 105 110Thr Ser Ser Ser Gln Val Ser Ser Ala Lys Asp Ala Lys Leu Val Ser 115 120 125Ser Ala Ser Ser Ser Gln Leu Asn Gln Ala Phe Leu Asp Leu Glu Lys 130 135 140Met Pro Phe Ser Glu Lys Thr Ser Thr Gly Lys Glu Asn Gln Asp Glu145 150 155 160Asn Cys Ser Gly Val Ser Thr Val Asn Lys Tyr Pro Leu Pro Thr Lys 165 170 175Gln Val Ser Gly Asp Ile Glu Thr Ser Lys Thr Ser Thr Val Asp Asn 180 185 190Ala Val Gln Asp Val Pro Lys Lys Asn Lys Asp Lys Asp Gly Asn Asp 195 200 205Gly Thr Thr Val His Ser Met Gln Asn Tyr Pro Trp His Phe His Ala 210 215 220Asp Ile Val Asn Gly Asn Ile Ala Lys Cys Pro Gln Asn His Pro Ser225 230 235 240Gly Met Val Ser Gln Asp Phe Met Phe His Pro Met Arg Glu Glu Thr 245 250 255His Gly His Ala Asn Leu Gln Ala Thr Thr Ala Ser Ala Thr Thr Thr 260 265 270Ala Ser His Gln Ala Phe Pro Ala Cys His Ser Gln Asp Asp Tyr Arg 275 280 285Ser Phe Leu Gln Ile Ser Ser Thr Phe Ser Asn Leu Ile Met Ser Thr 290 295 300Leu Leu Gln Asn Pro Ala Ala His Ala Ala Ala Thr Phe Ala Ala Ser305 310 315 320Val Trp Pro Tyr Ala Ser Val Gly Asn Ser Gly Asp Ser Ser Thr Pro 325 330 335Met Ser Ser Ser Pro Pro Ser Ile Thr Ala Ile Ala Ala Ala Thr Val 340 345 350Ala Ala Ala Thr Ala Trp Trp Ala Ser His Gly Leu Leu Pro Val Cys 355 360 365Ala Pro Ala Pro Ile Thr Cys Val Pro Phe Ser Thr Val Ala Val Pro 370 375 380Thr Pro Ala Met Thr Glu Met Asp Thr Val Glu Asn Thr Gln Pro Phe385 390 395 400Glu Lys Gln Asn Thr Ala Leu Gln Asp Gln Asn Leu Ala Ser Lys Ser 405 410 415Pro Ala Ser Ser Ser Asp Asp Ser Asp Glu Thr Gly Val Thr Lys Leu 420 425 430Asn Ala Asp Ser Lys Thr Asn Asp Asp Lys Ile Glu Glu Val Val Val 435 440 445Thr Ala Ala Val His Asp Ser Asn Thr Ala Gln Lys Lys Asn Leu Val 450 455 460Asp Arg Ser Ser Cys Gly Ser Asn Thr Pro Ser Gly Ser Asp Ala Glu465 470 475 480Thr Asp Ala Leu Asp Lys Met Glu Lys Asp Lys Glu Asp Val Lys Glu 485 490 495Thr Asp Glu Asn Gln Pro Asp Val Ile Glu Leu Asn Asn Arg Lys Ile 500 505 510Lys Met Arg Asp Asn Asn Ser Asn Asn Asn Ala Thr Thr Asp Ser Trp 515 520 525Lys Glu Val Ser Glu Glu Gly Arg Ile Ala Phe Gln Ala Leu Phe Ala 530 535 540Arg Glu Arg Leu Pro Gln Ser Phe Ser Pro Pro Gln Val Ala Glu Asn545 550 555 560Val Asn Arg Lys Gln Ser Asp Thr Ser Met Pro Leu Ala Pro Asn Phe 565 570 575Lys Ser Gln Asp Ser Cys Ala Ala Asp Gln Glu Gly Val Val Met Ile 580 585 590Gly Val Gly Thr Cys Lys Ser Leu Lys Thr Arg Gln Thr Gly Phe Lys 595 600 605Pro Tyr Lys Arg Cys Ser Met Glu Val Lys Glu Ser Gln Val Gly Asn 610 615 620Ile Asn Asn Gln Ser Asp Glu Lys Val Cys Lys Arg Leu Arg Leu Glu625 630 635 640Gly Glu Ala Ser Thr 6451371989DNAArabidopsis thalianaCDS(1)..(1986) 137atg aat atg agt gat tta ggt tgg gat gat gaa gat aaa tcg gtg gtt 48Met Asn Met Ser Asp Leu Gly Trp Asp Asp Glu Asp Lys Ser Val Val 1 5 10 15agt gct gtt tta ggg cat tta gct tct gat ttt ctt cga gca aac tct 96Ser Ala Val Leu Gly His Leu Ala Ser Asp Phe Leu Arg Ala Asn Ser 20 25 30aat tcg aat cag aat ctc ttt ctt gtt atg gga act gat gat act ctg 144Asn Ser Asn Gln Asn Leu Phe Leu Val Met Gly Thr Asp Asp Thr Leu 35 40 45aat aag aag ctc tct agt ctc gtt gat tgg cca aac tcg gag aat ttc 192Asn Lys Lys Leu Ser Ser Leu Val Asp Trp Pro Asn Ser Glu Asn Phe 50 55 60agc tgg aac tac gct att ttc tgg caa caa acc atg tct aga tcc gga 240Ser Trp Asn Tyr Ala Ile Phe Trp Gln Gln Thr Met Ser Arg Ser Gly 65 70 75 80caa caa gtc tta ggt tgg gga gat ggg tgt tgt cga gag cct aat gag 288Gln Gln Val Leu Gly Trp Gly Asp Gly Cys Cys Arg Glu Pro Asn Glu 85 90 95gaa gag gaa tca aaa gtt gtt agg tct tat aat ttt aac aac atg ggg 336Glu Glu Glu Ser Lys Val Val Arg Ser Tyr Asn Phe Asn Asn Met Gly 100 105 110gca gag gaa gag aca tgg caa gat atg agg aag aga gtg ttg cag aag 384Ala Glu Glu Glu Thr Trp Gln Asp Met Arg Lys Arg Val Leu Gln Lys 115 120 125ctt cat agg ttg ttt ggt gga tct gat gaa gac aat tat gct ttg agc 432Leu His Arg Leu Phe Gly Gly Ser Asp Glu Asp Asn Tyr Ala Leu Ser 130 135 140tta gag aaa gtt act gct act gag att ttc ttc tta gct tcc atg tat 480Leu Glu Lys Val Thr Ala Thr Glu Ile Phe Phe Leu Ala Ser Met Tyr145 150 155 160ttc ttc ttc aat cac ggt gaa ggc ggt cct ggg agg tgt tat tct tca 528Phe Phe Phe Asn His Gly Glu Gly Gly Pro Gly Arg Cys Tyr Ser Ser 165 170 175ggg aaa cat gtg tgg ctc tct gat gca gtt aac tct gag tct gac tat 576Gly Lys His Val Trp Leu Ser Asp Ala Val Asn Ser Glu Ser Asp Tyr 180 185 190tgt ttc agg tct ttt atg gcg aaa tct gcg gga atc aga acg atc gtt 624Cys Phe Arg Ser Phe Met Ala Lys Ser Ala Gly Ile Arg Thr Ile Val 195 200 205atg gtt cct act gat gct ggt gtt ctt gag ctt ggt tct gtt tgg tct 672Met Val Pro Thr Asp Ala Gly Val Leu Glu Leu Gly Ser Val Trp Ser 210 215 220ttg cct gaa aac att ggc ttg gtt aag tct gtt caa gct ttg ttc atg 720Leu Pro Glu Asn Ile Gly Leu Val Lys Ser Val Gln Ala Leu Phe Met225 230 235 240agg aga gtt acg caa cca gta atg gtg act tca aac act aac atg act 768Arg Arg Val Thr Gln Pro Val Met Val Thr Ser Asn Thr Asn Met Thr 245 250 255gga ggg att cac aag ctt ttc ggg cag gat ttg agt gga gct cac gcg 816Gly Gly Ile His Lys Leu Phe Gly Gln Asp Leu Ser Gly Ala His Ala 260 265 270tat cct aag aag ctc gaa gtg aga aga aac ttg gat gag aga ttc act 864Tyr Pro Lys Lys Leu Glu Val Arg Arg Asn Leu Asp Glu Arg Phe Thr 275 280 285cct caa agt tgg gaa ggc tat aat aac aat aaa ggt cca aca ttt ggt 912Pro Gln Ser Trp Glu Gly Tyr Asn Asn Asn Lys Gly Pro Thr Phe Gly 290 295 300tac aca cct cag agg gat gat gtg aaa gtg cta gag aat gtg aat atg 960Tyr Thr Pro Gln Arg Asp Asp Val Lys Val Leu Glu Asn Val Asn Met305 310 315 320gtt gta gat aat aac aat tac aag acg cag att gag ttt gcg gga tca 1008Val Val Asp Asn Asn Asn Tyr Lys Thr Gln Ile Glu Phe Ala Gly Ser 325 330 335tca gtt gct gct tct tcg aat cca tct aca aac act cag caa gaa aaa 1056Ser Val Ala Ala Ser Ser Asn Pro Ser Thr Asn Thr Gln Gln Glu Lys 340 345 350tca gaa tct tgt aca gag aaa aga cca gtg agc ttg tta gca gga gca 1104Ser Glu Ser Cys Thr Glu Lys Arg Pro Val Ser Leu Leu Ala Gly Ala 355 360 365gga ata gtt tct gtt gtt gat gag aag aga ccg aga aag aga ggg aga 1152Gly Ile Val Ser Val Val Asp Glu Lys Arg Pro Arg Lys Arg Gly Arg 370 375 380aag cct gca aac gga aga gaa gag cca ttg aac cat gtg gaa gct gag 1200Lys Pro Ala Asn Gly Arg Glu Glu Pro Leu Asn His Val Glu Ala Glu385 390 395 400agg cag aga cgc gag aag ctt aac caa aga ttc tac gct tta cga tca 1248Arg Gln Arg Arg Glu Lys Leu Asn Gln Arg Phe Tyr Ala Leu Arg Ser 405 410 415gtt gtt cca aac att tct aaa atg gac aag gct tct cta ctt gga gac 1296Val Val Pro Asn Ile Ser Lys Met Asp Lys Ala Ser Leu Leu Gly Asp 420 425 430gca att tct tac atc aaa gag ctt caa gag aaa gtc aag ata atg gaa 1344Ala Ile Ser Tyr Ile Lys Glu Leu Gln Glu Lys Val Lys Ile Met Glu 435 440 445gat gaa aga gta gga aca gat aag agc tta tca gaa tca aac aca ata 1392Asp Glu Arg Val Gly Thr Asp Lys Ser Leu Ser Glu Ser Asn Thr Ile 450 455 460aca gta gaa gaa agt cca gaa gtt gac att caa gct atg aat gaa gag 1440Thr Val Glu Glu Ser Pro Glu Val Asp Ile Gln Ala Met Asn Glu Glu465 470 475 480gtt gtt gta aga gta atc tcg cct ttg gat tca cat cca gct tca aga 1488Val Val Val Arg Val Ile Ser Pro Leu Asp Ser His Pro Ala Ser Arg 485 490 495atc ata caa gca atg aga aac tca aat gtt agt cta atg gag gct aag 1536Ile Ile Gln Ala Met Arg Asn Ser Asn Val Ser Leu Met Glu Ala Lys 500 505 510tta tca tta gct gaa gac aca atg ttt cac act ttt gtg ata aag tct 1584Leu Ser Leu Ala Glu Asp Thr Met Phe His Thr Phe Val Ile Lys Ser 515 520 525aac aac ggg tcg gat cca ttg acg aaa gag aag ctt ata gca gcg ttt 1632Asn Asn Gly Ser Asp Pro Leu Thr Lys Glu Lys Leu Ile Ala Ala Phe 530 535 540tac ccc gag acc agc tcg acg caa ccg cca ttg cct tct tct agt tca 1680Tyr Pro Glu Thr Ser Ser Thr Gln Pro Pro Leu Pro Ser Ser Ser Ser545 550 555 560cag gtc tct ggg ttt gtt tct ccg atc aga tcc aat ttc aat cct tct 1728Gln Val Ser Gly Phe Val Ser Pro Ile Arg Ser Asn Phe Asn Pro Ser 565 570 575ccg tca gat tat ctc ctc ggc cac gat gtc tct ccg ccg cac tat cct 1776Pro Ser Asp Tyr Leu Leu Gly His Asp Val Ser Pro Pro His Tyr Pro 580 585 590cga tct cca ccg tca aac cca gag agc cac tct ctc aaa gtc tct acc 1824Arg Ser Pro Pro Ser Asn Pro Glu Ser His Ser Leu Lys Val Ser Thr

595 600 605ttt ctc gat gag tca cat ctc ctc cgc atc cgc cac cgc cgc cgt gag 1872Phe Leu Asp Glu Ser His Leu Leu Arg Ile Arg His Arg Arg Arg Glu 610 615 620aaa ccc act ggg aag aga cct ctc ttc aat ccc ctt cgc aca agc cca 1920Lys Pro Thr Gly Lys Arg Pro Leu Phe Asn Pro Leu Arg Thr Ser Pro625 630 635 640gaa act gaa acc aga ttc aac caa tct cgt cac cga tcg ctc gat ttc 1968Glu Thr Glu Thr Arg Phe Asn Gln Ser Arg His Arg Ser Leu Asp Phe 645 650 655gtc ttc aat tgg gca atc tga 1989Val Phe Asn Trp Ala Ile 660138662PRTArabidopsis thaliana 138Met Asn Met Ser Asp Leu Gly Trp Asp Asp Glu Asp Lys Ser Val Val 1 5 10 15Ser Ala Val Leu Gly His Leu Ala Ser Asp Phe Leu Arg Ala Asn Ser 20 25 30Asn Ser Asn Gln Asn Leu Phe Leu Val Met Gly Thr Asp Asp Thr Leu 35 40 45Asn Lys Lys Leu Ser Ser Leu Val Asp Trp Pro Asn Ser Glu Asn Phe 50 55 60Ser Trp Asn Tyr Ala Ile Phe Trp Gln Gln Thr Met Ser Arg Ser Gly 65 70 75 80Gln Gln Val Leu Gly Trp Gly Asp Gly Cys Cys Arg Glu Pro Asn Glu 85 90 95Glu Glu Glu Ser Lys Val Val Arg Ser Tyr Asn Phe Asn Asn Met Gly 100 105 110Ala Glu Glu Glu Thr Trp Gln Asp Met Arg Lys Arg Val Leu Gln Lys 115 120 125Leu His Arg Leu Phe Gly Gly Ser Asp Glu Asp Asn Tyr Ala Leu Ser 130 135 140Leu Glu Lys Val Thr Ala Thr Glu Ile Phe Phe Leu Ala Ser Met Tyr145 150 155 160Phe Phe Phe Asn His Gly Glu Gly Gly Pro Gly Arg Cys Tyr Ser Ser 165 170 175Gly Lys His Val Trp Leu Ser Asp Ala Val Asn Ser Glu Ser Asp Tyr 180 185 190Cys Phe Arg Ser Phe Met Ala Lys Ser Ala Gly Ile Arg Thr Ile Val 195 200 205Met Val Pro Thr Asp Ala Gly Val Leu Glu Leu Gly Ser Val Trp Ser 210 215 220Leu Pro Glu Asn Ile Gly Leu Val Lys Ser Val Gln Ala Leu Phe Met225 230 235 240Arg Arg Val Thr Gln Pro Val Met Val Thr Ser Asn Thr Asn Met Thr 245 250 255Gly Gly Ile His Lys Leu Phe Gly Gln Asp Leu Ser Gly Ala His Ala 260 265 270Tyr Pro Lys Lys Leu Glu Val Arg Arg Asn Leu Asp Glu Arg Phe Thr 275 280 285Pro Gln Ser Trp Glu Gly Tyr Asn Asn Asn Lys Gly Pro Thr Phe Gly 290 295 300Tyr Thr Pro Gln Arg Asp Asp Val Lys Val Leu Glu Asn Val Asn Met305 310 315 320Val Val Asp Asn Asn Asn Tyr Lys Thr Gln Ile Glu Phe Ala Gly Ser 325 330 335Ser Val Ala Ala Ser Ser Asn Pro Ser Thr Asn Thr Gln Gln Glu Lys 340 345 350Ser Glu Ser Cys Thr Glu Lys Arg Pro Val Ser Leu Leu Ala Gly Ala 355 360 365Gly Ile Val Ser Val Val Asp Glu Lys Arg Pro Arg Lys Arg Gly Arg 370 375 380Lys Pro Ala Asn Gly Arg Glu Glu Pro Leu Asn His Val Glu Ala Glu385 390 395 400Arg Gln Arg Arg Glu Lys Leu Asn Gln Arg Phe Tyr Ala Leu Arg Ser 405 410 415Val Val Pro Asn Ile Ser Lys Met Asp Lys Ala Ser Leu Leu Gly Asp 420 425 430Ala Ile Ser Tyr Ile Lys Glu Leu Gln Glu Lys Val Lys Ile Met Glu 435 440 445Asp Glu Arg Val Gly Thr Asp Lys Ser Leu Ser Glu Ser Asn Thr Ile 450 455 460Thr Val Glu Glu Ser Pro Glu Val Asp Ile Gln Ala Met Asn Glu Glu465 470 475 480Val Val Val Arg Val Ile Ser Pro Leu Asp Ser His Pro Ala Ser Arg 485 490 495Ile Ile Gln Ala Met Arg Asn Ser Asn Val Ser Leu Met Glu Ala Lys 500 505 510Leu Ser Leu Ala Glu Asp Thr Met Phe His Thr Phe Val Ile Lys Ser 515 520 525Asn Asn Gly Ser Asp Pro Leu Thr Lys Glu Lys Leu Ile Ala Ala Phe 530 535 540Tyr Pro Glu Thr Ser Ser Thr Gln Pro Pro Leu Pro Ser Ser Ser Ser545 550 555 560Gln Val Ser Gly Phe Val Ser Pro Ile Arg Ser Asn Phe Asn Pro Ser 565 570 575Pro Ser Asp Tyr Leu Leu Gly His Asp Val Ser Pro Pro His Tyr Pro 580 585 590Arg Ser Pro Pro Ser Asn Pro Glu Ser His Ser Leu Lys Val Ser Thr 595 600 605Phe Leu Asp Glu Ser His Leu Leu Arg Ile Arg His Arg Arg Arg Glu 610 615 620Lys Pro Thr Gly Lys Arg Pro Leu Phe Asn Pro Leu Arg Thr Ser Pro625 630 635 640Glu Thr Glu Thr Arg Phe Asn Gln Ser Arg His Arg Ser Leu Asp Phe 645 650 655Val Phe Asn Trp Ala Ile 660139870DNAArabidopsis thalianaCDS(1)..(867) 139atg tca gaa tta tta cag ttg cct cca ggt ttc cga ttt cac cct acc 48Met Ser Glu Leu Leu Gln Leu Pro Pro Gly Phe Arg Phe His Pro Thr 1 5 10 15gat gaa gag ctt gtc atg cac tat ctc tgc cgc aaa tgt gcc tct cag 96Asp Glu Glu Leu Val Met His Tyr Leu Cys Arg Lys Cys Ala Ser Gln 20 25 30tcc atc gcc gtt ccg atc atc gct gag atc gat ctc tac aaa tac gat 144Ser Ile Ala Val Pro Ile Ile Ala Glu Ile Asp Leu Tyr Lys Tyr Asp 35 40 45cca tgg gag ctt cct ggt tta gcc ttg tat ggt gag aag gaa tgg tac 192Pro Trp Glu Leu Pro Gly Leu Ala Leu Tyr Gly Glu Lys Glu Trp Tyr 50 55 60ttc ttc tct ccc agg gac aga aaa tat ccc aac ggt tcg cgt cct aac 240Phe Phe Ser Pro Arg Asp Arg Lys Tyr Pro Asn Gly Ser Arg Pro Asn 65 70 75 80cgg tcc gct ggt tct ggt tac tgg aaa gct acc gga gct gat aaa ccg 288Arg Ser Ala Gly Ser Gly Tyr Trp Lys Ala Thr Gly Ala Asp Lys Pro 85 90 95atc gga cta cct aaa ccg gtc gga att aag aaa gct ctt gtt ttc tac 336Ile Gly Leu Pro Lys Pro Val Gly Ile Lys Lys Ala Leu Val Phe Tyr 100 105 110gcc ggc aaa gct cca aag gga gag aaa acc aat tgg atc atg cac gag 384Ala Gly Lys Ala Pro Lys Gly Glu Lys Thr Asn Trp Ile Met His Glu 115 120 125tac cgt ctc gcc gac gtt gac cgg tcc gtt cgc aag aag aag aat agt 432Tyr Arg Leu Ala Asp Val Asp Arg Ser Val Arg Lys Lys Lys Asn Ser 130 135 140ctc agg ctg gat gat tgg gtt ctc tgc cgg att tac aac aaa aaa gga 480Leu Arg Leu Asp Asp Trp Val Leu Cys Arg Ile Tyr Asn Lys Lys Gly145 150 155 160gct acc gag agg cgg gga cca ccg cct ccg gtt gtt tac ggc gac gaa 528Ala Thr Glu Arg Arg Gly Pro Pro Pro Pro Val Val Tyr Gly Asp Glu 165 170 175atc atg gag gag aag ccg aag gtg acg gag atg gtt atg cct ccg ccg 576Ile Met Glu Glu Lys Pro Lys Val Thr Glu Met Val Met Pro Pro Pro 180 185 190ccg caa cag aca agt gag ttc gcg tat ttc gac acg tcg gat tcg gtg 624Pro Gln Gln Thr Ser Glu Phe Ala Tyr Phe Asp Thr Ser Asp Ser Val 195 200 205ccg aag ctg cat act acg gat tcg agt tgc tcg gag cag gtg gtg tcg 672Pro Lys Leu His Thr Thr Asp Ser Ser Cys Ser Glu Gln Val Val Ser 210 215 220ccg gag ttc acg agc gag gtt cag agc gag ccc aag tgg aaa gat tgg 720Pro Glu Phe Thr Ser Glu Val Gln Ser Glu Pro Lys Trp Lys Asp Trp225 230 235 240tcg gcc gta agt aat gac aat aac aat acc ctt gat ttt ggg ttt aat 768Ser Ala Val Ser Asn Asp Asn Asn Asn Thr Leu Asp Phe Gly Phe Asn 245 250 255tac att gat gcc acc gtg gat aac gcg ttt gga gga gga ggg agt agt 816Tyr Ile Asp Ala Thr Val Asp Asn Ala Phe Gly Gly Gly Gly Ser Ser 260 265 270aat cag atg ttt ccg cta cag gat atg ttc atg tac atg cag aag cct 864Asn Gln Met Phe Pro Leu Gln Asp Met Phe Met Tyr Met Gln Lys Pro 275 280 285tac tag 870Tyr140289PRTArabidopsis thaliana 140Met Ser Glu Leu Leu Gln Leu Pro Pro Gly Phe Arg Phe His Pro Thr 1 5 10 15Asp Glu Glu Leu Val Met His Tyr Leu Cys Arg Lys Cys Ala Ser Gln 20 25 30Ser Ile Ala Val Pro Ile Ile Ala Glu Ile Asp Leu Tyr Lys Tyr Asp 35 40 45Pro Trp Glu Leu Pro Gly Leu Ala Leu Tyr Gly Glu Lys Glu Trp Tyr 50 55 60Phe Phe Ser Pro Arg Asp Arg Lys Tyr Pro Asn Gly Ser Arg Pro Asn 65 70 75 80Arg Ser Ala Gly Ser Gly Tyr Trp Lys Ala Thr Gly Ala Asp Lys Pro 85 90 95Ile Gly Leu Pro Lys Pro Val Gly Ile Lys Lys Ala Leu Val Phe Tyr 100 105 110Ala Gly Lys Ala Pro Lys Gly Glu Lys Thr Asn Trp Ile Met His Glu 115 120 125Tyr Arg Leu Ala Asp Val Asp Arg Ser Val Arg Lys Lys Lys Asn Ser 130 135 140Leu Arg Leu Asp Asp Trp Val Leu Cys Arg Ile Tyr Asn Lys Lys Gly145 150 155 160Ala Thr Glu Arg Arg Gly Pro Pro Pro Pro Val Val Tyr Gly Asp Glu 165 170 175Ile Met Glu Glu Lys Pro Lys Val Thr Glu Met Val Met Pro Pro Pro 180 185 190Pro Gln Gln Thr Ser Glu Phe Ala Tyr Phe Asp Thr Ser Asp Ser Val 195 200 205Pro Lys Leu His Thr Thr Asp Ser Ser Cys Ser Glu Gln Val Val Ser 210 215 220Pro Glu Phe Thr Ser Glu Val Gln Ser Glu Pro Lys Trp Lys Asp Trp225 230 235 240Ser Ala Val Ser Asn Asp Asn Asn Asn Thr Leu Asp Phe Gly Phe Asn 245 250 255Tyr Ile Asp Ala Thr Val Asp Asn Ala Phe Gly Gly Gly Gly Ser Ser 260 265 270Asn Gln Met Phe Pro Leu Gln Asp Met Phe Met Tyr Met Gln Lys Pro 275 280 285Tyr1411202DNAArabidopsis thalianaCDS(112)..(918) 141tcctcgattt caatctttga gataaaccac aaagatcctc cgattcgaag gtttataaaa 60actcaaaatc gaatcttatc cacaagaaaa caacaaggta cttttccaaa a atg aag 117 Met Lys 1gcg gag ttg aat ttg ccg gcg gga ttc cga ttt cat ccg acg gac gaa 165Ala Glu Leu Asn Leu Pro Ala Gly Phe Arg Phe His Pro Thr Asp Glu 5 10 15gag ctt gtc aag ttc tat ctt tgc cgg aga tgt gcg tca gaa ccg att 213Glu Leu Val Lys Phe Tyr Leu Cys Arg Arg Cys Ala Ser Glu Pro Ile 20 25 30aac gtt ccg gtt atc gca gag att gac ttg tac aaa ttc aat cca agg 261Asn Val Pro Val Ile Ala Glu Ile Asp Leu Tyr Lys Phe Asn Pro Arg 35 40 45 50gag ctt cca gaa atg gcg ttg tac ggt gag aaa gaa tgg tac ttc ttc 309Glu Leu Pro Glu Met Ala Leu Tyr Gly Glu Lys Glu Trp Tyr Phe Phe 55 60 65tcg cat aga gac cgg aaa tac cca aac ggg tcg aga cca aac cgg gca 357Ser His Arg Asp Arg Lys Tyr Pro Asn Gly Ser Arg Pro Asn Arg Ala 70 75 80gct gga acc ggt tat tgg aaa gcg act gga gct gat aaa ccg atc gga 405Ala Gly Thr Gly Tyr Trp Lys Ala Thr Gly Ala Asp Lys Pro Ile Gly 85 90 95aaa ccg aag acg tta ggg att aag aaa gca ctc gtc ttc tac gca gga 453Lys Pro Lys Thr Leu Gly Ile Lys Lys Ala Leu Val Phe Tyr Ala Gly 100 105 110aaa gct ccg aaa ggg att aaa acg aat tgg att atg cac gag tat cgt 501Lys Ala Pro Lys Gly Ile Lys Thr Asn Trp Ile Met His Glu Tyr Arg115 120 125 130ctc gct aat gtc gat cga tct gct tct acc aac aag aag aac aac tta 549Leu Ala Asn Val Asp Arg Ser Ala Ser Thr Asn Lys Lys Asn Asn Leu 135 140 145aga ctt gat gat tgg gtt ttg tgt cgg ata tac aat aag aaa gga aca 597Arg Leu Asp Asp Trp Val Leu Cys Arg Ile Tyr Asn Lys Lys Gly Thr 150 155 160atg gag aag tat tta ccg gcg gcg gct gag aaa ccg acg gaa aag atg 645Met Glu Lys Tyr Leu Pro Ala Ala Ala Glu Lys Pro Thr Glu Lys Met 165 170 175agt acg tcg gac tca aga tgc tca agt cac gtg att tca ccg gac gtc 693Ser Thr Ser Asp Ser Arg Cys Ser Ser His Val Ile Ser Pro Asp Val 180 185 190acg tgt tct gat aac tgg gag gtt gag agt gag ccc aaa tgg att aat 741Thr Cys Ser Asp Asn Trp Glu Val Glu Ser Glu Pro Lys Trp Ile Asn195 200 205 210ctg gaa gac gcg tta gag gca ttt aat gat gac acg tcc atg ttt agt 789Leu Glu Asp Ala Leu Glu Ala Phe Asn Asp Asp Thr Ser Met Phe Ser 215 220 225tcc att ggt ttg ttg caa aat gac gcc ttt gtt cct cag ttt cag tac 837Ser Ile Gly Leu Leu Gln Asn Asp Ala Phe Val Pro Gln Phe Gln Tyr 230 235 240cag tcc tcc gat ttc gtc gat tcg ttt cag gac ccg ttc gag cag aaa 885Gln Ser Ser Asp Phe Val Asp Ser Phe Gln Asp Pro Phe Glu Gln Lys 245 250 255ccg ttc ttg aat tgg aat ttt gct cct caa ggg taaaaataat cggcaaaaag 938Pro Phe Leu Asn Trp Asn Phe Ala Pro Gln Gly 260 265ttgaagcttt tcagagtctt cgatcaccgg cattgtgtcg gatcctgacc cggagaccaa 998gtcgggtcat acgattacat aatcgggtta ttgagatttc cacatttgga tttccgagac 1058taaccaactt aacggattct ggggtaattg gggggttttg cacaggtgaa tcacactgag 1118tcagcaagtt tcgatttttt ggttttgttt tgtaatgatt gattaaatgt ctaaagatat 1178cacgaagtta aaaaaaaaaa aaaa 1202142269PRTArabidopsis thaliana 142Met Lys Ala Glu Leu Asn Leu Pro Ala Gly Phe Arg Phe His Pro Thr 1 5 10 15Asp Glu Glu Leu Val Lys Phe Tyr Leu Cys Arg Arg Cys Ala Ser Glu 20 25 30Pro Ile Asn Val Pro Val Ile Ala Glu Ile Asp Leu Tyr Lys Phe Asn 35 40 45Pro Arg Glu Leu Pro Glu Met Ala Leu Tyr Gly Glu Lys Glu Trp Tyr 50 55 60Phe Phe Ser His Arg Asp Arg Lys Tyr Pro Asn Gly Ser Arg Pro Asn 65 70 75 80Arg Ala Ala Gly Thr Gly Tyr Trp Lys Ala Thr Gly Ala Asp Lys Pro 85 90 95Ile Gly Lys Pro Lys Thr Leu Gly Ile Lys Lys Ala Leu Val Phe Tyr 100 105 110Ala Gly Lys Ala Pro Lys Gly Ile Lys Thr Asn Trp Ile Met His Glu 115 120 125Tyr Arg Leu Ala Asn Val Asp Arg Ser Ala Ser Thr Asn Lys Lys Asn 130 135 140Asn Leu Arg Leu Asp Asp Trp Val Leu Cys Arg Ile Tyr Asn Lys Lys145 150 155 160Gly Thr Met Glu Lys Tyr Leu Pro Ala Ala Ala Glu Lys Pro Thr Glu 165 170 175Lys Met Ser Thr Ser Asp Ser Arg Cys Ser Ser His Val Ile Ser Pro 180 185 190Asp Val Thr Cys Ser Asp Asn Trp Glu Val Glu Ser Glu Pro Lys Trp 195 200 205Ile Asn Leu Glu Asp Ala Leu Glu Ala Phe Asn Asp Asp Thr Ser Met 210 215 220Phe Ser Ser Ile Gly Leu Leu Gln Asn Asp Ala Phe Val Pro Gln Phe225 230 235 240Gln Tyr Gln Ser Ser Asp Phe Val Asp Ser Phe Gln Asp Pro Phe Glu 245 250 255Gln Lys Pro Phe Leu Asn Trp Asn Phe Ala Pro Gln Gly 260 2651431188DNAArabidopsis thalianaCDS(124)..(978) 143aaaaacctca actttcttct ctcttctcaa aaacccttcc ctcttcgtct ccaaacaaca 60acaaacacaa caacaacaaa aatcttacaa gaagatcatt tttagaaacc ctattaggat 120aaa atg gat tac gag gca tca aga atc gtc gaa atg gta gaa gat gaa 168 Met Asp Tyr Glu Ala Ser Arg Ile Val Glu Met Val Glu Asp Glu 1 5 10 15gaa cat ata gat cta cca cca gga ttc aga ttt cac cct act gat gaa 216Glu His Ile Asp Leu Pro Pro Gly Phe Arg Phe His Pro Thr Asp Glu 20 25 30gaa ctc ata act cac tac ctc aaa cca aag gtt ttc aac act ttc ttc 264Glu Leu Ile Thr His Tyr Leu Lys Pro Lys Val Phe Asn Thr Phe Phe 35 40 45tct gct act gcc att ggt gaa gtt gat ctc aac aag att gag cct tgg 312Ser Ala Thr Ala Ile Gly Glu Val Asp Leu Asn Lys Ile Glu Pro Trp 50 55 60gac tta cca tgg aag gct aag atg gga gaa aaa gaa tgg tat

ttc ttc 360Asp Leu Pro Trp Lys Ala Lys Met Gly Glu Lys Glu Trp Tyr Phe Phe 65 70 75tgt gtg aga gac cgg aaa tac ccg acc ggt tta agg aca aac cgg gcg 408Cys Val Arg Asp Arg Lys Tyr Pro Thr Gly Leu Arg Thr Asn Arg Ala 80 85 90 95aca gaa gcc ggt tat tgg aaa gcc aca gga aaa gac aaa gag ata ttc 456Thr Glu Ala Gly Tyr Trp Lys Ala Thr Gly Lys Asp Lys Glu Ile Phe 100 105 110aag gga aaa tca ctt gtg ggt atg aag aaa act ttg gtt ttc tat aaa 504Lys Gly Lys Ser Leu Val Gly Met Lys Lys Thr Leu Val Phe Tyr Lys 115 120 125gga aga gct cct aaa gga gtt aaa acc aat tgg gtt atg cat gaa tat 552Gly Arg Ala Pro Lys Gly Val Lys Thr Asn Trp Val Met His Glu Tyr 130 135 140cgt tta gaa ggc aaa tat tgt att gaa aat ctt ccc caa aca gct aag 600Arg Leu Glu Gly Lys Tyr Cys Ile Glu Asn Leu Pro Gln Thr Ala Lys 145 150 155aac gaa tgg gtt ata tgt cgt gtt ttc caa aaa cgt gcc gat ggt aca 648Asn Glu Trp Val Ile Cys Arg Val Phe Gln Lys Arg Ala Asp Gly Thr160 165 170 175aag gtt cca atg tca atg ctt gat cca cac att aac cga atg gaa cca 696Lys Val Pro Met Ser Met Leu Asp Pro His Ile Asn Arg Met Glu Pro 180 185 190gcc ggt tta cct tcg tta atg gat tgt tct caa cga gac tcc ttc acc 744Ala Gly Leu Pro Ser Leu Met Asp Cys Ser Gln Arg Asp Ser Phe Thr 195 200 205ggt tcg tcg tct cac gtg acc tgc ttc tcc gac caa gaa acc gaa gac 792Gly Ser Ser Ser His Val Thr Cys Phe Ser Asp Gln Glu Thr Glu Asp 210 215 220aaa aga ctt gtc cac gag tcc aaa gac ggt ttt ggt tct ctg ttt tac 840Lys Arg Leu Val His Glu Ser Lys Asp Gly Phe Gly Ser Leu Phe Tyr 225 230 235tcg gat cct ctg ttt tta caa gac aat tat tcg cta atg aag ctg ttg 888Ser Asp Pro Leu Phe Leu Gln Asp Asn Tyr Ser Leu Met Lys Leu Leu240 245 250 255ctt gac ggt caa gaa act caa ttc tcc ggc aaa cct ttc gac ggt cgt 936Leu Asp Gly Gln Glu Thr Gln Phe Ser Gly Lys Pro Phe Asp Gly Arg 260 265 270gat tcg tcc ggt aca gaa gaa ttg gat tgc gtt tgg aat ttc 978Asp Ser Ser Gly Thr Glu Glu Leu Asp Cys Val Trp Asn Phe 275 280 285tgagttgtat aagttatgtt gtagacttgt agtagtcatg tgttcgtgtg tgtgaatgaa 1038tattcttgtt acattttttt gtaaaaaagg agaaaaaaat atgctagaaa gtcaattgct 1098tttgttatgt agcattagtg ttttttatgt actcaataga cttcctaatt aaataaaaat 1158cttaatttat ttgcaaaaaa aaaaaaaaaa 1188144285PRTArabidopsis thaliana 144Met Asp Tyr Glu Ala Ser Arg Ile Val Glu Met Val Glu Asp Glu Glu 1 5 10 15His Ile Asp Leu Pro Pro Gly Phe Arg Phe His Pro Thr Asp Glu Glu 20 25 30Leu Ile Thr His Tyr Leu Lys Pro Lys Val Phe Asn Thr Phe Phe Ser 35 40 45Ala Thr Ala Ile Gly Glu Val Asp Leu Asn Lys Ile Glu Pro Trp Asp 50 55 60Leu Pro Trp Lys Ala Lys Met Gly Glu Lys Glu Trp Tyr Phe Phe Cys 65 70 75 80Val Arg Asp Arg Lys Tyr Pro Thr Gly Leu Arg Thr Asn Arg Ala Thr 85 90 95Glu Ala Gly Tyr Trp Lys Ala Thr Gly Lys Asp Lys Glu Ile Phe Lys 100 105 110Gly Lys Ser Leu Val Gly Met Lys Lys Thr Leu Val Phe Tyr Lys Gly 115 120 125Arg Ala Pro Lys Gly Val Lys Thr Asn Trp Val Met His Glu Tyr Arg 130 135 140Leu Glu Gly Lys Tyr Cys Ile Glu Asn Leu Pro Gln Thr Ala Lys Asn145 150 155 160Glu Trp Val Ile Cys Arg Val Phe Gln Lys Arg Ala Asp Gly Thr Lys 165 170 175Val Pro Met Ser Met Leu Asp Pro His Ile Asn Arg Met Glu Pro Ala 180 185 190Gly Leu Pro Ser Leu Met Asp Cys Ser Gln Arg Asp Ser Phe Thr Gly 195 200 205Ser Ser Ser His Val Thr Cys Phe Ser Asp Gln Glu Thr Glu Asp Lys 210 215 220Arg Leu Val His Glu Ser Lys Asp Gly Phe Gly Ser Leu Phe Tyr Ser225 230 235 240Asp Pro Leu Phe Leu Gln Asp Asn Tyr Ser Leu Met Lys Leu Leu Leu 245 250 255Asp Gly Gln Glu Thr Gln Phe Ser Gly Lys Pro Phe Asp Gly Arg Asp 260 265 270Ser Ser Gly Thr Glu Glu Leu Asp Cys Val Trp Asn Phe 275 280 285145894DNAArabidopsis thalianaCDS(1)..(891) 145atg ggt gtt aga gag aaa gat ccg tta gcc cag ttg agt ttg cca cca 48Met Gly Val Arg Glu Lys Asp Pro Leu Ala Gln Leu Ser Leu Pro Pro 1 5 10 15ggt ttt aga ttt tat ccg aca gat gaa gag ctt ctt gtt cag tat cta 96Gly Phe Arg Phe Tyr Pro Thr Asp Glu Glu Leu Leu Val Gln Tyr Leu 20 25 30tgt cgg aaa gtt gca ggc tat cat ttc tct ctc cag gtc atc gga gac 144Cys Arg Lys Val Ala Gly Tyr His Phe Ser Leu Gln Val Ile Gly Asp 35 40 45atc gat ctc tac aag ttc gat cct tgg gat ttg cca agt aag gct ttg 192Ile Asp Leu Tyr Lys Phe Asp Pro Trp Asp Leu Pro Ser Lys Ala Leu 50 55 60ttt gga gag aag gaa tgg tat ttc ttt agc cca aga gat cgg aaa tat 240Phe Gly Glu Lys Glu Trp Tyr Phe Phe Ser Pro Arg Asp Arg Lys Tyr 65 70 75 80ccg aac ggg tca aga ccc aat aga gta gcc ggg tcg ggt tat tgg aaa 288Pro Asn Gly Ser Arg Pro Asn Arg Val Ala Gly Ser Gly Tyr Trp Lys 85 90 95gca acg ggt act gac aaa att atc acg gcg gat ggt cgt cgt gtc ggg 336Ala Thr Gly Thr Asp Lys Ile Ile Thr Ala Asp Gly Arg Arg Val Gly 100 105 110att aaa aaa gct ctg gtc ttt tac gcc gga aaa gct ccc aaa ggc act 384Ile Lys Lys Ala Leu Val Phe Tyr Ala Gly Lys Ala Pro Lys Gly Thr 115 120 125aaa acc aac tgg att atg cac gag tat cgc tta ata gaa cat tct cgt 432Lys Thr Asn Trp Ile Met His Glu Tyr Arg Leu Ile Glu His Ser Arg 130 135 140agc cat gga agc tcc aag ttg gat gat tgg gtg ttg tgt cga att tac 480Ser His Gly Ser Ser Lys Leu Asp Asp Trp Val Leu Cys Arg Ile Tyr145 150 155 160aag aaa aca tct gga tct cag aga caa gct gtt act cct gtt caa gct 528Lys Lys Thr Ser Gly Ser Gln Arg Gln Ala Val Thr Pro Val Gln Ala 165 170 175tgt cgt gaa gag cat agc acg aat ggg tcg tca tcg tct tct tca tca 576Cys Arg Glu Glu His Ser Thr Asn Gly Ser Ser Ser Ser Ser Ser Ser 180 185 190cag ctt gac gac gtt ctt gat tcg ttc ccg gag ata aaa gac cag tct 624Gln Leu Asp Asp Val Leu Asp Ser Phe Pro Glu Ile Lys Asp Gln Ser 195 200 205ttt aat ctt cct cgg atg aat tcg ctc agg acg att ctt aac ggg aac 672Phe Asn Leu Pro Arg Met Asn Ser Leu Arg Thr Ile Leu Asn Gly Asn 210 215 220ttt gat tgg gct agc ttg gca ggt ctt aat cca att cca gag cta gct 720Phe Asp Trp Ala Ser Leu Ala Gly Leu Asn Pro Ile Pro Glu Leu Ala225 230 235 240ccg acc aat gga tta ccg agt tac ggt ggt tac gat gcg ttt cga gcg 768Pro Thr Asn Gly Leu Pro Ser Tyr Gly Gly Tyr Asp Ala Phe Arg Ala 245 250 255gcg gaa ggt gag gcg gag agt ggg cat gtg aat cgg cag cag aac tcg 816Ala Glu Gly Glu Ala Glu Ser Gly His Val Asn Arg Gln Gln Asn Ser 260 265 270agc ggg ttg act cag agt ttc ggg tac agc tcg agt ggg ttt ggt gtt 864Ser Gly Leu Thr Gln Ser Phe Gly Tyr Ser Ser Ser Gly Phe Gly Val 275 280 285tcg ggt caa aca ttc gag ttt agg caa tga 894Ser Gly Gln Thr Phe Glu Phe Arg Gln 290 295146297PRTArabidopsis thaliana 146Met Gly Val Arg Glu Lys Asp Pro Leu Ala Gln Leu Ser Leu Pro Pro 1 5 10 15Gly Phe Arg Phe Tyr Pro Thr Asp Glu Glu Leu Leu Val Gln Tyr Leu 20 25 30Cys Arg Lys Val Ala Gly Tyr His Phe Ser Leu Gln Val Ile Gly Asp 35 40 45Ile Asp Leu Tyr Lys Phe Asp Pro Trp Asp Leu Pro Ser Lys Ala Leu 50 55 60Phe Gly Glu Lys Glu Trp Tyr Phe Phe Ser Pro Arg Asp Arg Lys Tyr 65 70 75 80Pro Asn Gly Ser Arg Pro Asn Arg Val Ala Gly Ser Gly Tyr Trp Lys 85 90 95Ala Thr Gly Thr Asp Lys Ile Ile Thr Ala Asp Gly Arg Arg Val Gly 100 105 110Ile Lys Lys Ala Leu Val Phe Tyr Ala Gly Lys Ala Pro Lys Gly Thr 115 120 125Lys Thr Asn Trp Ile Met His Glu Tyr Arg Leu Ile Glu His Ser Arg 130 135 140Ser His Gly Ser Ser Lys Leu Asp Asp Trp Val Leu Cys Arg Ile Tyr145 150 155 160Lys Lys Thr Ser Gly Ser Gln Arg Gln Ala Val Thr Pro Val Gln Ala 165 170 175Cys Arg Glu Glu His Ser Thr Asn Gly Ser Ser Ser Ser Ser Ser Ser 180 185 190Gln Leu Asp Asp Val Leu Asp Ser Phe Pro Glu Ile Lys Asp Gln Ser 195 200 205Phe Asn Leu Pro Arg Met Asn Ser Leu Arg Thr Ile Leu Asn Gly Asn 210 215 220Phe Asp Trp Ala Ser Leu Ala Gly Leu Asn Pro Ile Pro Glu Leu Ala225 230 235 240Pro Thr Asn Gly Leu Pro Ser Tyr Gly Gly Tyr Asp Ala Phe Arg Ala 245 250 255Ala Glu Gly Glu Ala Glu Ser Gly His Val Asn Arg Gln Gln Asn Ser 260 265 270Ser Gly Leu Thr Gln Ser Phe Gly Tyr Ser Ser Ser Gly Phe Gly Val 275 280 285Ser Gly Gln Thr Phe Glu Phe Arg Gln 290 2951471920DNAArabidopsis thalianaCDS(62)..(1645) 147acagaaaaat cgaaactttt tagggttttt tttttttgtg ataacgagag agaaaaaagt 60g atg gac ttg tcg gtt gag aac gga ggt tta gct cca ggt ttt agg ttt 109 Met Asp Leu Ser Val Glu Asn Gly Gly Leu Ala Pro Gly Phe Arg Phe 1 5 10 15cat ccg acg gac gaa gaa ctt gtc gtc tat tat ctc aaa aga aag atc 157His Pro Thr Asp Glu Glu Leu Val Val Tyr Tyr Leu Lys Arg Lys Ile 20 25 30 cgt cgg aaa aaa ctc aga gtc gaa gca atc ggc gag act gat gtc tat 205Arg Arg Lys Lys Leu Arg Val Glu Ala Ile Gly Glu Thr Asp Val Tyr 35 40 45aag ttt gat cct gag gaa tta cct gag aaa gcg ttg tat aag act aga 253Lys Phe Asp Pro Glu Glu Leu Pro Glu Lys Ala Leu Tyr Lys Thr Arg 50 55 60gat cgt caa tgg ttc ttt ttc agc tta agg gat agg aaa cat gga agt 301Asp Arg Gln Trp Phe Phe Phe Ser Leu Arg Asp Arg Lys His Gly Ser65 70 75 80agg tca agt aga gct act gaa cgt ggc tat tgg aaa gca aca ggg aag 349Arg Ser Ser Arg Ala Thr Glu Arg Gly Tyr Trp Lys Ala Thr Gly Lys 85 90 95gat aga gtc att cat tgt gat tcg aga ccc gtt gga gag aag aag act 397Asp Arg Val Ile His Cys Asp Ser Arg Pro Val Gly Glu Lys Lys Thr 100 105 110ctt gtt ttc cat aga ggc agg gca cct aat ggc gaa cgg act aat tgg 445Leu Val Phe His Arg Gly Arg Ala Pro Asn Gly Glu Arg Thr Asn Trp 115 120 125gtg atg cat gag tat aca ttg cac aaa gag gag ctc aag agg tgt ggt 493Val Met His Glu Tyr Thr Leu His Lys Glu Glu Leu Lys Arg Cys Gly 130 135 140ggt gaa gat gtt aag gat gct tat gtt ctt tac aag att tat aag aaa 541Gly Glu Asp Val Lys Asp Ala Tyr Val Leu Tyr Lys Ile Tyr Lys Lys145 150 155 160agt ggg tct ggt cct aag aat ggt gag caa tat gga gct cct ttt att 589Ser Gly Ser Gly Pro Lys Asn Gly Glu Gln Tyr Gly Ala Pro Phe Ile 165 170 175gaa gaa gaa tgg gct gaa gat gat gat gat gat gtt gat gag cct gct 637Glu Glu Glu Trp Ala Glu Asp Asp Asp Asp Asp Val Asp Glu Pro Ala 180 185 190aat cag ctc gtt gtt tcg gct agt gtt gat aat agt tta tgg ggg aaa 685Asn Gln Leu Val Val Ser Ala Ser Val Asp Asn Ser Leu Trp Gly Lys 195 200 205ggg ctt aac caa tct gaa ttg gat gat aat gat att gaa gag ctg atg 733Gly Leu Asn Gln Ser Glu Leu Asp Asp Asn Asp Ile Glu Glu Leu Met 210 215 220 agt cag gtt aga gat cag tct ggt cca aca ttg cag cag aat ggg gtg 781Ser Gln Val Arg Asp Gln Ser Gly Pro Thr Leu Gln Gln Asn Gly Val225 230 235 240tct gga ctg aac tct cat gta gac acg tat aat ctg gag aac ctg gag 829Ser Gly Leu Asn Ser His Val Asp Thr Tyr Asn Leu Glu Asn Leu Glu 245 250 255gaa gat atg tat ttg gaa atc aat gat ctt atg gaa cct gaa cct gga 877Glu Asp Met Tyr Leu Glu Ile Asn Asp Leu Met Glu Pro Glu Pro Gly 260 265 270cca act tct gtg gaa gtc atg gag aat aac tgg aac gag gat ggt tct 925Pro Thr Ser Val Glu Val Met Glu Asn Asn Trp Asn Glu Asp Gly Ser 275 280 285ggt ctc ctg aat gat gat gat ttc gtt ggt gct gat tca tat ttc ctt 973Gly Leu Leu Asn Asp Asp Asp Phe Val Gly Ala Asp Ser Tyr Phe Leu 290 295 300gat ttg gga gtg aca aat cct cag tta gat ttt gtt agt ggt gat ttg 1021Asp Leu Gly Val Thr Asn Pro Gln Leu Asp Phe Val Ser Gly Asp Leu305 310 315 320aaa aat ggg ttt gca caa agt ctt cag gtg aat act tct tta atg act 1069Lys Asn Gly Phe Ala Gln Ser Leu Gln Val Asn Thr Ser Leu Met Thr 325 330 335tac cag gcc aat aat aac cag ttc cag cag caa tca ggg aag aac caa 1117Tyr Gln Ala Asn Asn Asn Gln Phe Gln Gln Gln Ser Gly Lys Asn Gln 340 345 350 gct agt aac tgg cca ctc cgt aac agc tat acc aga cag ata aac aat 1165Ala Ser Asn Trp Pro Leu Arg Asn Ser Tyr Thr Arg Gln Ile Asn Asn 355 360 365gga tca tcg tgg gtg cag gag cta aac aat gac gga ctt acc gtt acc 1213Gly Ser Ser Trp Val Gln Glu Leu Asn Asn Asp Gly Leu Thr Val Thr 370 375 380cgg ttt ggt gag gcg cct ggt aca ggt gat tca tct gaa ttc cta aac 1261Arg Phe Gly Glu Ala Pro Gly Thr Gly Asp Ser Ser Glu Phe Leu Asn385 390 395 400cct gtt cct tct ggt ata agt aca act aat gaa gat gac ccg tca aaa 1309Pro Val Pro Ser Gly Ile Ser Thr Thr Asn Glu Asp Asp Pro Ser Lys 405 410 415gac gag tct agt aag ttt gct tct agt gta tgg act ttc ctg gaa tcc 1357Asp Glu Ser Ser Lys Phe Ala Ser Ser Val Trp Thr Phe Leu Glu Ser 420 425 430att cct gca aag cca gca tat gca tca gag aat cca ttt gtg aag ctg 1405Ile Pro Ala Lys Pro Ala Tyr Ala Ser Glu Asn Pro Phe Val Lys Leu 435 440 445aac ctt gtt aga atg tca acc agt ggt ggt cgt ttc agg ttt act tct 1453Asn Leu Val Arg Met Ser Thr Ser Gly Gly Arg Phe Arg Phe Thr Ser 450 455 460aaa agc aca ggt aat aat gtt gtt gtt atg gat agt gac tca gca gtg 1501Lys Ser Thr Gly Asn Asn Val Val Val Met Asp Ser Asp Ser Ala Val465 470 475 480aag agg aac aag tct gga gga aac aac gat aag aag aag aag aag aac 1549Lys Arg Asn Lys Ser Gly Gly Asn Asn Asp Lys Lys Lys Lys Lys Asn 485 490 495aaa ggt ttc ttt tgc tta tcg atc att ggg gct tta tgt gct ttg ttt 1597Lys Gly Phe Phe Cys Leu Ser Ile Ile Gly Ala Leu Cys Ala Leu Phe 500 505 510tgg gtg atc ata gga aca atg gga ggt tca ggg agg cct ttg tta tgg 1645Trp Val Ile Ile Gly Thr Met Gly Gly Ser Gly Arg Pro Leu Leu Trp 515 520 525tgagaaccga aaaatccaag aggttaagag acataaaggc ttggttttgt gtgaaccatt 1705agagagtcaa gtcattgtaa ttattctctt ggattattag attcagaagc tgtttagtat 1765cacagtttat gcttggaagt tttctctggt attgttaaaa aagtaccaat agaaataagc 1825aaaagaattt tcttatcttt taggatattt gaacaaatga tgttacttaa ctagattatt 1885aacttggagg ttgttgtaca aaaaaaaaaa aaaaa 1920148528PRTArabidopsis thaliana 148Met Asp Leu Ser Val Glu Asn Gly Gly Leu Ala Pro Gly Phe Arg Phe 1 5 10 15His Pro Thr Asp Glu Glu Leu Val Val Tyr Tyr Leu Lys Arg Lys Ile 20 25 30Arg Arg Lys Lys Leu Arg Val Glu Ala Ile Gly Glu Thr Asp Val Tyr 35 40 45Lys Phe Asp Pro Glu Glu Leu Pro Glu Lys Ala Leu Tyr Lys Thr Arg 50 55 60Asp Arg Gln Trp Phe Phe Phe Ser Leu Arg Asp Arg Lys His Gly Ser 65 70 75 80Arg Ser Ser Arg Ala Thr Glu Arg Gly Tyr Trp Lys Ala Thr

Gly Lys 85 90 95Asp Arg Val Ile His Cys Asp Ser Arg Pro Val Gly Glu Lys Lys Thr 100 105 110Leu Val Phe His Arg Gly Arg Ala Pro Asn Gly Glu Arg Thr Asn Trp 115 120 125Val Met His Glu Tyr Thr Leu His Lys Glu Glu Leu Lys Arg Cys Gly 130 135 140Gly Glu Asp Val Lys Asp Ala Tyr Val Leu Tyr Lys Ile Tyr Lys Lys145 150 155 160Ser Gly Ser Gly Pro Lys Asn Gly Glu Gln Tyr Gly Ala Pro Phe Ile 165 170 175Glu Glu Glu Trp Ala Glu Asp Asp Asp Asp Asp Val Asp Glu Pro Ala 180 185 190Asn Gln Leu Val Val Ser Ala Ser Val Asp Asn Ser Leu Trp Gly Lys 195 200 205Gly Leu Asn Gln Ser Glu Leu Asp Asp Asn Asp Ile Glu Glu Leu Met 210 215 220Ser Gln Val Arg Asp Gln Ser Gly Pro Thr Leu Gln Gln Asn Gly Val225 230 235 240Ser Gly Leu Asn Ser His Val Asp Thr Tyr Asn Leu Glu Asn Leu Glu 245 250 255Glu Asp Met Tyr Leu Glu Ile Asn Asp Leu Met Glu Pro Glu Pro Gly 260 265 270Pro Thr Ser Val Glu Val Met Glu Asn Asn Trp Asn Glu Asp Gly Ser 275 280 285Gly Leu Leu Asn Asp Asp Asp Phe Val Gly Ala Asp Ser Tyr Phe Leu 290 295 300Asp Leu Gly Val Thr Asn Pro Gln Leu Asp Phe Val Ser Gly Asp Leu305 310 315 320Lys Asn Gly Phe Ala Gln Ser Leu Gln Val Asn Thr Ser Leu Met Thr 325 330 335Tyr Gln Ala Asn Asn Asn Gln Phe Gln Gln Gln Ser Gly Lys Asn Gln 340 345 350Ala Ser Asn Trp Pro Leu Arg Asn Ser Tyr Thr Arg Gln Ile Asn Asn 355 360 365Gly Ser Ser Trp Val Gln Glu Leu Asn Asn Asp Gly Leu Thr Val Thr 370 375 380Arg Phe Gly Glu Ala Pro Gly Thr Gly Asp Ser Ser Glu Phe Leu Asn385 390 395 400Pro Val Pro Ser Gly Ile Ser Thr Thr Asn Glu Asp Asp Pro Ser Lys 405 410 415Asp Glu Ser Ser Lys Phe Ala Ser Ser Val Trp Thr Phe Leu Glu Ser 420 425 430Ile Pro Ala Lys Pro Ala Tyr Ala Ser Glu Asn Pro Phe Val Lys Leu 435 440 445Asn Leu Val Arg Met Ser Thr Ser Gly Gly Arg Phe Arg Phe Thr Ser 450 455 460Lys Ser Thr Gly Asn Asn Val Val Val Met Asp Ser Asp Ser Ala Val465 470 475 480Lys Arg Asn Lys Ser Gly Gly Asn Asn Asp Lys Lys Lys Lys Lys Asn 485 490 495Lys Gly Phe Phe Cys Leu Ser Ile Ile Gly Ala Leu Cys Ala Leu Phe 500 505 510Trp Val Ile Ile Gly Thr Met Gly Gly Ser Gly Arg Pro Leu Leu Trp 515 520 5251491219DNAArabidopsis thalianaCDS(177)..(950) 149atgctaagga gccctcccaa aaaagaacaa caaatcacat ttttatataa ctgttaacat 60aataatctca gcctcatcaa cacacatata tagatagcca acatcacaca aacatagaga 120ttccaaaaaa taaaaataaa gaaaacataa atcctctgag gaaaaattcc gatgag atg 179 Met 1aca gaa ggt gga gaa tat tct ccg gcg atg atg tca gca gag cca ttc 227Thr Glu Gly Gly Glu Tyr Ser Pro Ala Met Met Ser Ala Glu Pro Phe 5 10 15ttg acc atg aag aag atg aag aag agc aac cac aac aag aac aat cag 275Leu Thr Met Lys Lys Met Lys Lys Ser Asn His Asn Lys Asn Asn Gln 20 25 30aga agg ttt agc gac gag cag atc aag tca ctg gag atg atg ttt gag 323Arg Arg Phe Ser Asp Glu Gln Ile Lys Ser Leu Glu Met Met Phe Glu 35 40 45tct gag aca agg ctt gag cca agg aag aag gtt caa tta gct aga gag 371Ser Glu Thr Arg Leu Glu Pro Arg Lys Lys Val Gln Leu Ala Arg Glu50 55 60 65cta ggg ttg cag ccg agg caa gtg gct ata tgg ttt cag aac aag agg 419Leu Gly Leu Gln Pro Arg Gln Val Ala Ile Trp Phe Gln Asn Lys Arg 70 75 80gct cgt tgg aaa tcc aag cag ctc gag act gag tac aac att ctc aga 467Ala Arg Trp Lys Ser Lys Gln Leu Glu Thr Glu Tyr Asn Ile Leu Arg 85 90 95caa aac tac gac aac ttg gct tct cag ttc gag tcc tta aag aaa gaa 515Gln Asn Tyr Asp Asn Leu Ala Ser Gln Phe Glu Ser Leu Lys Lys Glu 100 105 110aaa caa gct tta gtc tct gag ttg cag agg cta aaa gag gcg acg caa 563Lys Gln Ala Leu Val Ser Glu Leu Gln Arg Leu Lys Glu Ala Thr Gln 115 120 125aag aag aca cag gag gag gaa agg cag tgt agt gga gat caa gcg gtg 611Lys Lys Thr Gln Glu Glu Glu Arg Gln Cys Ser Gly Asp Gln Ala Val130 135 140 145gtt gct cta agc agc aca cat cat gaa tca gaa aac gaa gag aac cgg 659Val Ala Leu Ser Ser Thr His His Glu Ser Glu Asn Glu Glu Asn Arg 150 155 160agg cgt aaa ccg gaa gag gtt aga ccg gag atg gag atg aaa gat gat 707Arg Arg Lys Pro Glu Glu Val Arg Pro Glu Met Glu Met Lys Asp Asp 165 170 175aag ggt cat cat ggg gtt atg tgt gat cat cat gat tat gaa gat gat 755Lys Gly His His Gly Val Met Cys Asp His His Asp Tyr Glu Asp Asp 180 185 190gat aat ggt tat agt aac aac atc aag aga gag tat ttt ggt ggg ttt 803Asp Asn Gly Tyr Ser Asn Asn Ile Lys Arg Glu Tyr Phe Gly Gly Phe 195 200 205gag gaa gaa cca gat cac tta atg aac att gtt gaa cca gct gat agt 851Glu Glu Glu Pro Asp His Leu Met Asn Ile Val Glu Pro Ala Asp Ser210 215 220 225tgt ttg aca tca tct gat gat tgg aga ggt ttc aaa tca gat act act 899Cys Leu Thr Ser Ser Asp Asp Trp Arg Gly Phe Lys Ser Asp Thr Thr 230 235 240act ctc ttg gac caa tcc agc aac aat tac cct tgg cgg gat ttt tgg 947Thr Leu Leu Asp Gln Ser Ser Asn Asn Tyr Pro Trp Arg Asp Phe Trp 245 250 255tca tgaaaacaat aaactctaaa caagaagatg aaacagattg agactaaaga 1000Serttggatatat acatattcaa atcgaaattt accggtctac atcgcatgaa ccgagccacg 1060gatatagaga tattcggtcc agcaaatgac tcgtttctca gcgagaattt tgcaggattt 1120tgagctgaaa ttgtatggtt ttgtctgtat aaatgatgtg tttagaaaga cgtatattct 1180caaaaaaaaa aaaaaaaaaa aagaaaaaaa aaaaaaaaa 1219150258PRTArabidopsis thaliana 150Met Thr Glu Gly Gly Glu Tyr Ser Pro Ala Met Met Ser Ala Glu Pro 1 5 10 15Phe Leu Thr Met Lys Lys Met Lys Lys Ser Asn His Asn Lys Asn Asn 20 25 30Gln Arg Arg Phe Ser Asp Glu Gln Ile Lys Ser Leu Glu Met Met Phe 35 40 45Glu Ser Glu Thr Arg Leu Glu Pro Arg Lys Lys Val Gln Leu Ala Arg 50 55 60Glu Leu Gly Leu Gln Pro Arg Gln Val Ala Ile Trp Phe Gln Asn Lys 65 70 75 80Arg Ala Arg Trp Lys Ser Lys Gln Leu Glu Thr Glu Tyr Asn Ile Leu 85 90 95Arg Gln Asn Tyr Asp Asn Leu Ala Ser Gln Phe Glu Ser Leu Lys Lys 100 105 110Glu Lys Gln Ala Leu Val Ser Glu Leu Gln Arg Leu Lys Glu Ala Thr 115 120 125Gln Lys Lys Thr Gln Glu Glu Glu Arg Gln Cys Ser Gly Asp Gln Ala 130 135 140Val Val Ala Leu Ser Ser Thr His His Glu Ser Glu Asn Glu Glu Asn145 150 155 160Arg Arg Arg Lys Pro Glu Glu Val Arg Pro Glu Met Glu Met Lys Asp 165 170 175Asp Lys Gly His His Gly Val Met Cys Asp His His Asp Tyr Glu Asp 180 185 190Asp Asp Asn Gly Tyr Ser Asn Asn Ile Lys Arg Glu Tyr Phe Gly Gly 195 200 205Phe Glu Glu Glu Pro Asp His Leu Met Asn Ile Val Glu Pro Ala Asp 210 215 220Ser Cys Leu Thr Ser Ser Asp Asp Trp Arg Gly Phe Lys Ser Asp Thr225 230 235 240Thr Thr Leu Leu Asp Gln Ser Ser Asn Asn Tyr Pro Trp Arg Asp Phe 245 250 255Trp Ser151978DNAArabidopsis thalianaCDS(89)..(793) 151aaagaaagaa agaaaaaaaa gaaacaaata attccaaaac cttctctctt aatcaaaatc 60aagaaactta caagatctgg tgaaaacc atg gaa gaa gga gat ttt ttc aac 112 Met Glu Glu Gly Asp Phe Phe Asn 1 5tgc tgt ttc agc gag att agt agt ggc atg acc atg aat aag aag aag 160Cys Cys Phe Ser Glu Ile Ser Ser Gly Met Thr Met Asn Lys Lys Lys 10 15 20atg aag aag agc aat aac caa aag agg ttt agc gag gaa cag atc aag 208Met Lys Lys Ser Asn Asn Gln Lys Arg Phe Ser Glu Glu Gln Ile Lys 25 30 35 40tca ctt gag ctt ata ttt gag tct gag acg agg ctt gag ccg agg aag 256Ser Leu Glu Leu Ile Phe Glu Ser Glu Thr Arg Leu Glu Pro Arg Lys 45 50 55aag gtt cag gta gct aga gag cta ggg ctg caa cca aga caa gtg gct 304Lys Val Gln Val Ala Arg Glu Leu Gly Leu Gln Pro Arg Gln Val Ala 60 65 70ata tgg ttt caa aac aag agg gct cga tgg aaa act aag caa ctt gag 352Ile Trp Phe Gln Asn Lys Arg Ala Arg Trp Lys Thr Lys Gln Leu Glu 75 80 85aaa gag tat aac act ctt aga gcc aat tac aac aat ttg gct tca caa 400Lys Glu Tyr Asn Thr Leu Arg Ala Asn Tyr Asn Asn Leu Ala Ser Gln 90 95 100ttt gaa atc atg aag aaa gaa aag caa tct ctg gtc tct gag ctg cag 448Phe Glu Ile Met Lys Lys Glu Lys Gln Ser Leu Val Ser Glu Leu Gln105 110 115 120aga cta aac gaa gag atg caa agg cct aaa gaa gaa aag cat cat gag 496Arg Leu Asn Glu Glu Met Gln Arg Pro Lys Glu Glu Lys His His Glu 125 130 135tgt tgt ggt gat caa gga ctg gct cta agc agc agc aca gag tcg cat 544Cys Cys Gly Asp Gln Gly Leu Ala Leu Ser Ser Ser Thr Glu Ser His 140 145 150aat gga aag agt gag cca gaa ggg agg tta gac caa ggg agt gtt cta 592Asn Gly Lys Ser Glu Pro Glu Gly Arg Leu Asp Gln Gly Ser Val Leu 155 160 165tgt aat gat ggt gat tac aac aac aac att aaa aca gag tat ttt ggg 640Cys Asn Asp Gly Asp Tyr Asn Asn Asn Ile Lys Thr Glu Tyr Phe Gly 170 175 180ttc gag gaa gag act gat cat gag ctg atg aac att gtg gag aaa gct 688Phe Glu Glu Glu Thr Asp His Glu Leu Met Asn Ile Val Glu Lys Ala185 190 195 200gat gat agt tgc ttg aca tct tct gag aat tgg gga ggt ttc aat tct 736Asp Asp Ser Cys Leu Thr Ser Ser Glu Asn Trp Gly Gly Phe Asn Ser 205 210 215gat tct ctc tta gac caa tct agc agc aat tac cct aac tgg tgg gag 784Asp Ser Leu Leu Asp Gln Ser Ser Ser Asn Tyr Pro Asn Trp Trp Glu 220 225 230ttt tgg tca taaaagcata taagaaaaaa acagaacata agcgaagaga 833Phe Trp Ser 235aagagtgtga atagtttgta aattatgtgt taagaaaaat aaatttagtt tagtttaaat 893cttgtttcga tctatgtatc tactatgttc aatactcttt gtagctaatt agtagcttat 953aatgagacta gaaaagtttt gaagc 978152235PRTArabidopsis thaliana 152Met Glu Glu Gly Asp Phe Phe Asn Cys Cys Phe Ser Glu Ile Ser Ser 1 5 10 15Gly Met Thr Met Asn Lys Lys Lys Met Lys Lys Ser Asn Asn Gln Lys 20 25 30Arg Phe Ser Glu Glu Gln Ile Lys Ser Leu Glu Leu Ile Phe Glu Ser 35 40 45Glu Thr Arg Leu Glu Pro Arg Lys Lys Val Gln Val Ala Arg Glu Leu 50 55 60Gly Leu Gln Pro Arg Gln Val Ala Ile Trp Phe Gln Asn Lys Arg Ala 65 70 75 80Arg Trp Lys Thr Lys Gln Leu Glu Lys Glu Tyr Asn Thr Leu Arg Ala 85 90 95Asn Tyr Asn Asn Leu Ala Ser Gln Phe Glu Ile Met Lys Lys Glu Lys 100 105 110Gln Ser Leu Val Ser Glu Leu Gln Arg Leu Asn Glu Glu Met Gln Arg 115 120 125Pro Lys Glu Glu Lys His His Glu Cys Cys Gly Asp Gln Gly Leu Ala 130 135 140Leu Ser Ser Ser Thr Glu Ser His Asn Gly Lys Ser Glu Pro Glu Gly145 150 155 160Arg Leu Asp Gln Gly Ser Val Leu Cys Asn Asp Gly Asp Tyr Asn Asn 165 170 175Asn Ile Lys Thr Glu Tyr Phe Gly Phe Glu Glu Glu Thr Asp His Glu 180 185 190Leu Met Asn Ile Val Glu Lys Ala Asp Asp Ser Cys Leu Thr Ser Ser 195 200 205Glu Asn Trp Gly Gly Phe Asn Ser Asp Ser Leu Leu Asp Gln Ser Ser 210 215 220Ser Asn Tyr Pro Asn Trp Trp Glu Phe Trp Ser225 230 2351532043DNAArabidopsis thalianaCDS(1)..(2040) 153atg gct gct tac ttt cac gga aac cca ccg gag atc tct gcc gga tcc 48Met Ala Ala Tyr Phe His Gly Asn Pro Pro Glu Ile Ser Ala Gly Ser 1 5 10 15gac ggt ggt ctt caa acg ttg atc ctc atg aat cca act act tac gtt 96Asp Gly Gly Leu Gln Thr Leu Ile Leu Met Asn Pro Thr Thr Tyr Val 20 25 30cag tac acc caa caa gac aac gac tcg aac aac aac aac aac agc aac 144Gln Tyr Thr Gln Gln Asp Asn Asp Ser Asn Asn Asn Asn Asn Ser Asn 35 40 45aat agc aac aac aac aac aca aac aca aac aca aac aac aac aac agt 192Asn Ser Asn Asn Asn Asn Thr Asn Thr Asn Thr Asn Asn Asn Asn Ser 50 55 60agt ttc gtt ttc ctc gat tcc cac gcg ccg cag cca aac gcg agc cag 240Ser Phe Val Phe Leu Asp Ser His Ala Pro Gln Pro Asn Ala Ser Gln 65 70 75 80cag ttc gtc gga ata cca ctc tca ggt cac gaa gct gct tcc att aca 288Gln Phe Val Gly Ile Pro Leu Ser Gly His Glu Ala Ala Ser Ile Thr 85 90 95gcc gcc gac aac atc tcc gta ctt cac ggt tat cct ccg cgc gtg cag 336Ala Ala Asp Asn Ile Ser Val Leu His Gly Tyr Pro Pro Arg Val Gln 100 105 110tac agt ctc tac ggt agc cac caa gtg gat ccc act cac cag caa gcc 384Tyr Ser Leu Tyr Gly Ser His Gln Val Asp Pro Thr His Gln Gln Ala 115 120 125gcg tgt gag acg cca cgc gcg cag caa ggc ctc tct tta acc ctc tcg 432Ala Cys Glu Thr Pro Arg Ala Gln Gln Gly Leu Ser Leu Thr Leu Ser 130 135 140tct caa cag cag cag caa cag caa cat cat caa caa cac cag cct att 480Ser Gln Gln Gln Gln Gln Gln Gln His His Gln Gln His Gln Pro Ile145 150 155 160cac gtc gga ttc ggg tcc gga cat gga gaa gat atc cgg gtc ggg tct 528His Val Gly Phe Gly Ser Gly His Gly Glu Asp Ile Arg Val Gly Ser 165 170 175ggc tct aca gga tcg ggg gta aca aac ggt ata gct aat ctt gtt agc 576Gly Ser Thr Gly Ser Gly Val Thr Asn Gly Ile Ala Asn Leu Val Ser 180 185 190tcc aag tac ttg aag gca gca caa gag ctt ctt gac gaa gta gtc aac 624Ser Lys Tyr Leu Lys Ala Ala Gln Glu Leu Leu Asp Glu Val Val Asn 195 200 205gct gat tcc gat gac atg aac gct aaa tcc caa cta ttc tca tcg aaa 672Ala Asp Ser Asp Asp Met Asn Ala Lys Ser Gln Leu Phe Ser Ser Lys 210 215 220aag ggt agt tgc gga aat gat aaa cct gtc gga gaa tca tcg gcc ggc 720Lys Gly Ser Cys Gly Asn Asp Lys Pro Val Gly Glu Ser Ser Ala Gly225 230 235 240gct gga gga gaa ggt tcc ggt ggc gga gca gaa gca gcc ggg aaa cgt 768Ala Gly Gly Glu Gly Ser Gly Gly Gly Ala Glu Ala Ala Gly Lys Arg 245 250 255ccg gtg gag cta ggc acg gca gag aga caa gaa ata cag atg aag aaa 816Pro Val Glu Leu Gly Thr Ala Glu Arg Gln Glu Ile Gln Met Lys Lys 260 265 270gca aaa ctt agt aac atg ctt cat gag gtg gag cag aga tat aga cag 864Ala Lys Leu Ser Asn Met Leu His Glu Val Glu Gln Arg Tyr Arg Gln 275 280 285tac cac cag cag atg cag atg gtg atc tct tcg ttc gag caa gcg gca 912Tyr His Gln Gln Met Gln Met Val Ile Ser Ser Phe Glu Gln Ala Ala 290 295 300ggg ata gga tca gcg aag tca tac acg tcg cta gca ttg aaa acc ata 960Gly Ile Gly Ser Ala Lys Ser Tyr Thr Ser Leu Ala Leu Lys Thr Ile305 310 315 320tca aga cag ttc cgt tgc ttg aaa gag gcg atc gct ggt cag ata aaa 1008Ser Arg Gln Phe Arg Cys Leu Lys Glu Ala Ile Ala Gly Gln Ile Lys 325 330 335gcg gcc aac aag agt ctt ggg gag gaa gat tca gtg tct ggt gtt ggg 1056Ala Ala Asn Lys Ser Leu Gly Glu Glu Asp Ser Val Ser Gly Val Gly 340

345 350agg ttt gag ggg tcg agg ctc aag ttc gtg gac cac cac ttg aga cag 1104Arg Phe Glu Gly Ser Arg Leu Lys Phe Val Asp His His Leu Arg Gln 355 360 365caa aga gct ctt caa caa ctg gga atg att caa cat cct tcc aat aat 1152Gln Arg Ala Leu Gln Gln Leu Gly Met Ile Gln His Pro Ser Asn Asn 370 375 380gct tgg aga cct caa cgt ggt ctc cca gaa cga gcc gtc tca gtt ctc 1200Ala Trp Arg Pro Gln Arg Gly Leu Pro Glu Arg Ala Val Ser Val Leu385 390 395 400cgt gct tgg ctc ttc gaa cac ttt ctt cat cca tac cct aag gat tcg 1248Arg Ala Trp Leu Phe Glu His Phe Leu His Pro Tyr Pro Lys Asp Ser 405 410 415gac aag cac atg cta gct aag caa aca gga ctc act cgt agc cag gtg 1296Asp Lys His Met Leu Ala Lys Gln Thr Gly Leu Thr Arg Ser Gln Val 420 425 430tcg aac tgg ttt ata aac gcg aga gtt cgg tta tgg aaa cca atg gtg 1344Ser Asn Trp Phe Ile Asn Ala Arg Val Arg Leu Trp Lys Pro Met Val 435 440 445gag gag atg tac atg gag gaa atg aag gag cag gca aag aac atg gga 1392Glu Glu Met Tyr Met Glu Glu Met Lys Glu Gln Ala Lys Asn Met Gly 450 455 460tcc atg gaa aag act cct ttg gat caa agc aac gaa gat tct gct tca 1440Ser Met Glu Lys Thr Pro Leu Asp Gln Ser Asn Glu Asp Ser Ala Ser465 470 475 480aag tca aca agt aac caa gaa aag agc cca atg gcg gac act aat tac 1488Lys Ser Thr Ser Asn Gln Glu Lys Ser Pro Met Ala Asp Thr Asn Tyr 485 490 495cat atg aat ccc aat cac aac ggt gac cta gaa ggc gtc act gga atg 1536His Met Asn Pro Asn His Asn Gly Asp Leu Glu Gly Val Thr Gly Met 500 505 510caa gga agc ccc aag aga cta aga acc agc gac gag aca atg atg cag 1584Gln Gly Ser Pro Lys Arg Leu Arg Thr Ser Asp Glu Thr Met Met Gln 515 520 525cca ata aat gcg gat ttc agc tcc aac gag aag ctc acg atg aag att 1632Pro Ile Asn Ala Asp Phe Ser Ser Asn Glu Lys Leu Thr Met Lys Ile 530 535 540cta gaa gaa cgg caa ggg ata aga tca gat ggt ggc tac cct ttc atg 1680Leu Glu Glu Arg Gln Gly Ile Arg Ser Asp Gly Gly Tyr Pro Phe Met545 550 555 560ggt aat ttc ggg caa tac caa atg gat gag atg tca aga ttt gat gta 1728Gly Asn Phe Gly Gln Tyr Gln Met Asp Glu Met Ser Arg Phe Asp Val 565 570 575gtc tca gac cag gag ctc atg gcg caa agg tac tca gga aac aac aat 1776Val Ser Asp Gln Glu Leu Met Ala Gln Arg Tyr Ser Gly Asn Asn Asn 580 585 590ggc gtg tcc ctc acg tta ggt tta cct cat tgt gat agc ttg tcg tcc 1824Gly Val Ser Leu Thr Leu Gly Leu Pro His Cys Asp Ser Leu Ser Ser 595 600 605acg cac cat cag ggt ttc atg cag acc cac cat ggg att cct ata ggg 1872Thr His His Gln Gly Phe Met Gln Thr His His Gly Ile Pro Ile Gly 610 615 620aga aga gtg aaa ata gga gaa aca gag gaa tat gga ccc gcc acc atc 1920Arg Arg Val Lys Ile Gly Glu Thr Glu Glu Tyr Gly Pro Ala Thr Ile625 630 635 640aat ggt ggt agc tcg acc aca acc gca cat tca tca gcg gca gct gcc 1968Asn Gly Gly Ser Ser Thr Thr Thr Ala His Ser Ser Ala Ala Ala Ala 645 650 655gcg gct tac aat ggg atg aac ata cag aac cag aag aga tat gtg gct 2016Ala Ala Tyr Asn Gly Met Asn Ile Gln Asn Gln Lys Arg Tyr Val Ala 660 665 670cag tta ttg ccc gac ttc gtt gca taa 2043Gln Leu Leu Pro Asp Phe Val Ala 675 680154680PRTArabidopsis thaliana 154Met Ala Ala Tyr Phe His Gly Asn Pro Pro Glu Ile Ser Ala Gly Ser 1 5 10 15Asp Gly Gly Leu Gln Thr Leu Ile Leu Met Asn Pro Thr Thr Tyr Val 20 25 30Gln Tyr Thr Gln Gln Asp Asn Asp Ser Asn Asn Asn Asn Asn Ser Asn 35 40 45Asn Ser Asn Asn Asn Asn Thr Asn Thr Asn Thr Asn Asn Asn Asn Ser 50 55 60Ser Phe Val Phe Leu Asp Ser His Ala Pro Gln Pro Asn Ala Ser Gln 65 70 75 80Gln Phe Val Gly Ile Pro Leu Ser Gly His Glu Ala Ala Ser Ile Thr 85 90 95Ala Ala Asp Asn Ile Ser Val Leu His Gly Tyr Pro Pro Arg Val Gln 100 105 110Tyr Ser Leu Tyr Gly Ser His Gln Val Asp Pro Thr His Gln Gln Ala 115 120 125Ala Cys Glu Thr Pro Arg Ala Gln Gln Gly Leu Ser Leu Thr Leu Ser 130 135 140Ser Gln Gln Gln Gln Gln Gln Gln His His Gln Gln His Gln Pro Ile145 150 155 160His Val Gly Phe Gly Ser Gly His Gly Glu Asp Ile Arg Val Gly Ser 165 170 175Gly Ser Thr Gly Ser Gly Val Thr Asn Gly Ile Ala Asn Leu Val Ser 180 185 190Ser Lys Tyr Leu Lys Ala Ala Gln Glu Leu Leu Asp Glu Val Val Asn 195 200 205Ala Asp Ser Asp Asp Met Asn Ala Lys Ser Gln Leu Phe Ser Ser Lys 210 215 220Lys Gly Ser Cys Gly Asn Asp Lys Pro Val Gly Glu Ser Ser Ala Gly225 230 235 240Ala Gly Gly Glu Gly Ser Gly Gly Gly Ala Glu Ala Ala Gly Lys Arg 245 250 255Pro Val Glu Leu Gly Thr Ala Glu Arg Gln Glu Ile Gln Met Lys Lys 260 265 270Ala Lys Leu Ser Asn Met Leu His Glu Val Glu Gln Arg Tyr Arg Gln 275 280 285Tyr His Gln Gln Met Gln Met Val Ile Ser Ser Phe Glu Gln Ala Ala 290 295 300Gly Ile Gly Ser Ala Lys Ser Tyr Thr Ser Leu Ala Leu Lys Thr Ile305 310 315 320Ser Arg Gln Phe Arg Cys Leu Lys Glu Ala Ile Ala Gly Gln Ile Lys 325 330 335Ala Ala Asn Lys Ser Leu Gly Glu Glu Asp Ser Val Ser Gly Val Gly 340 345 350Arg Phe Glu Gly Ser Arg Leu Lys Phe Val Asp His His Leu Arg Gln 355 360 365Gln Arg Ala Leu Gln Gln Leu Gly Met Ile Gln His Pro Ser Asn Asn 370 375 380Ala Trp Arg Pro Gln Arg Gly Leu Pro Glu Arg Ala Val Ser Val Leu385 390 395 400Arg Ala Trp Leu Phe Glu His Phe Leu His Pro Tyr Pro Lys Asp Ser 405 410 415Asp Lys His Met Leu Ala Lys Gln Thr Gly Leu Thr Arg Ser Gln Val 420 425 430Ser Asn Trp Phe Ile Asn Ala Arg Val Arg Leu Trp Lys Pro Met Val 435 440 445Glu Glu Met Tyr Met Glu Glu Met Lys Glu Gln Ala Lys Asn Met Gly 450 455 460Ser Met Glu Lys Thr Pro Leu Asp Gln Ser Asn Glu Asp Ser Ala Ser465 470 475 480Lys Ser Thr Ser Asn Gln Glu Lys Ser Pro Met Ala Asp Thr Asn Tyr 485 490 495His Met Asn Pro Asn His Asn Gly Asp Leu Glu Gly Val Thr Gly Met 500 505 510Gln Gly Ser Pro Lys Arg Leu Arg Thr Ser Asp Glu Thr Met Met Gln 515 520 525Pro Ile Asn Ala Asp Phe Ser Ser Asn Glu Lys Leu Thr Met Lys Ile 530 535 540Leu Glu Glu Arg Gln Gly Ile Arg Ser Asp Gly Gly Tyr Pro Phe Met545 550 555 560Gly Asn Phe Gly Gln Tyr Gln Met Asp Glu Met Ser Arg Phe Asp Val 565 570 575Val Ser Asp Gln Glu Leu Met Ala Gln Arg Tyr Ser Gly Asn Asn Asn 580 585 590Gly Val Ser Leu Thr Leu Gly Leu Pro His Cys Asp Ser Leu Ser Ser 595 600 605Thr His His Gln Gly Phe Met Gln Thr His His Gly Ile Pro Ile Gly 610 615 620Arg Arg Val Lys Ile Gly Glu Thr Glu Glu Tyr Gly Pro Ala Thr Ile625 630 635 640Asn Gly Gly Ser Ser Thr Thr Thr Ala His Ser Ser Ala Ala Ala Ala 645 650 655Ala Ala Tyr Asn Gly Met Asn Ile Gln Asn Gln Lys Arg Tyr Val Ala 660 665 670Gln Leu Leu Pro Asp Phe Val Ala 675 6801551162DNAArabidopsis thalianaCDS(49)..(933) 155aaattcgctt ttttttttct tctttgtata tttttttttt ttttgacc atg gcg gag 57 Met Ala Glu 1gaa ttt gga agc ata gat tta ctc gga gat gaa gat ttc ttc ttc gat 105Glu Phe Gly Ser Ile Asp Leu Leu Gly Asp Glu Asp Phe Phe Phe Asp 5 10 15ttc gat cct tca atc gta att gat tct ctt ccg gcg gag gat ttt ctt 153Phe Asp Pro Ser Ile Val Ile Asp Ser Leu Pro Ala Glu Asp Phe Leu 20 25 30 35cag tct tca ccg gat tca tgg atc gga gaa atc gag aat caa ttg atg 201Gln Ser Ser Pro Asp Ser Trp Ile Gly Glu Ile Glu Asn Gln Leu Met 40 45 50aac gat gag aat cat caa gag gag agt ttt gtg gaa ttg gat cag caa 249Asn Asp Glu Asn His Gln Glu Glu Ser Phe Val Glu Leu Asp Gln Gln 55 60 65tcg gtt tca gat ttc ata gcg gat cta ctc gtt gat tat cca act agc 297Ser Val Ser Asp Phe Ile Ala Asp Leu Leu Val Asp Tyr Pro Thr Ser 70 75 80gat tct ggc tcc gtt gat ttg gcg gct gat aaa gtt cta acc gtc gat 345Asp Ser Gly Ser Val Asp Leu Ala Ala Asp Lys Val Leu Thr Val Asp 85 90 95tct ccc gcc gcc gct gat gat tcc ggg aag gag aat tcg gat ttg gtt 393Ser Pro Ala Ala Ala Asp Asp Ser Gly Lys Glu Asn Ser Asp Leu Val100 105 110 115gtt gag aag aag tct aat gat tct ggt agc gag att cat gat gat gat 441Val Glu Lys Lys Ser Asn Asp Ser Gly Ser Glu Ile His Asp Asp Asp 120 125 130gac gaa gaa gga gac gat gat gct gtg gct aaa aaa cga aga agg aga 489Asp Glu Glu Gly Asp Asp Asp Ala Val Ala Lys Lys Arg Arg Arg Arg 135 140 145gta aga aat aga gat gcg gcg gtt aga tcg aga gag agg aag aag gaa 537Val Arg Asn Arg Asp Ala Ala Val Arg Ser Arg Glu Arg Lys Lys Glu 150 155 160tat gta caa gat tta gag aag aag agt aag tat ctc gaa aga gaa tgc 585Tyr Val Gln Asp Leu Glu Lys Lys Ser Lys Tyr Leu Glu Arg Glu Cys 165 170 175ttg aga cta gga cgt atg ctt gag tgc ttc gtt gct gaa aac cag tct 633Leu Arg Leu Gly Arg Met Leu Glu Cys Phe Val Ala Glu Asn Gln Ser180 185 190 195cta cgt tac tgt ttg caa aag ggt aat ggc aat aat act acc atg atg 681Leu Arg Tyr Cys Leu Gln Lys Gly Asn Gly Asn Asn Thr Thr Met Met 200 205 210tcg aag cag gag tct gct gtg ctc ttg ttg gaa tcc ctg ctg ttg ggt 729Ser Lys Gln Glu Ser Ala Val Leu Leu Leu Glu Ser Leu Leu Leu Gly 215 220 225tcc ctg ctt tgg ctt ctg gga gta aac ttc att tgc cta ttc cct tat 777Ser Leu Leu Trp Leu Leu Gly Val Asn Phe Ile Cys Leu Phe Pro Tyr 230 235 240atg tcc cac aca aag tgt tgc ctc cta cgt cca gaa cca gaa aag ctg 825Met Ser His Thr Lys Cys Cys Leu Leu Arg Pro Glu Pro Glu Lys Leu 245 250 255gtt cta aac ggg ctc ggg agt agt agc aaa ccg tct tat acc ggc gtt 873Val Leu Asn Gly Leu Gly Ser Ser Ser Lys Pro Ser Tyr Thr Gly Val260 265 270 275agt cgg aga tgt aag ggt tcg agg cct agg atg aaa tac caa atc tta 921Ser Arg Arg Cys Lys Gly Ser Arg Pro Arg Met Lys Tyr Gln Ile Leu 280 285 290acc ctt gcg gcg tgacaacgcc ttttttaact gcttcttttg cgcattttga 973Thr Leu Ala Ala 295gttgtagatg agtgtctttt agttttctct ctcttgtttt gtatttcgct gttgaaagtt 1033ttctgtctaa tatcgataag ttaacagtga atgtgggtct tatggttatg gatgatatct 1093atctaataat gcttctgcct ttaaaatgtt gattttgagg cataacttca ggcaaaaaaa 1153aaaaagaaa 1162156295PRTArabidopsis thaliana 156Met Ala Glu Glu Phe Gly Ser Ile Asp Leu Leu Gly Asp Glu Asp Phe 1 5 10 15Phe Phe Asp Phe Asp Pro Ser Ile Val Ile Asp Ser Leu Pro Ala Glu 20 25 30Asp Phe Leu Gln Ser Ser Pro Asp Ser Trp Ile Gly Glu Ile Glu Asn 35 40 45Gln Leu Met Asn Asp Glu Asn His Gln Glu Glu Ser Phe Val Glu Leu 50 55 60Asp Gln Gln Ser Val Ser Asp Phe Ile Ala Asp Leu Leu Val Asp Tyr 65 70 75 80Pro Thr Ser Asp Ser Gly Ser Val Asp Leu Ala Ala Asp Lys Val Leu 85 90 95Thr Val Asp Ser Pro Ala Ala Ala Asp Asp Ser Gly Lys Glu Asn Ser 100 105 110Asp Leu Val Val Glu Lys Lys Ser Asn Asp Ser Gly Ser Glu Ile His 115 120 125Asp Asp Asp Asp Glu Glu Gly Asp Asp Asp Ala Val Ala Lys Lys Arg 130 135 140Arg Arg Arg Val Arg Asn Arg Asp Ala Ala Val Arg Ser Arg Glu Arg145 150 155 160Lys Lys Glu Tyr Val Gln Asp Leu Glu Lys Lys Ser Lys Tyr Leu Glu 165 170 175Arg Glu Cys Leu Arg Leu Gly Arg Met Leu Glu Cys Phe Val Ala Glu 180 185 190Asn Gln Ser Leu Arg Tyr Cys Leu Gln Lys Gly Asn Gly Asn Asn Thr 195 200 205Thr Met Met Ser Lys Gln Glu Ser Ala Val Leu Leu Leu Glu Ser Leu 210 215 220Leu Leu Gly Ser Leu Leu Trp Leu Leu Gly Val Asn Phe Ile Cys Leu225 230 235 240Phe Pro Tyr Met Ser His Thr Lys Cys Cys Leu Leu Arg Pro Glu Pro 245 250 255Glu Lys Leu Val Leu Asn Gly Leu Gly Ser Ser Ser Lys Pro Ser Tyr 260 265 270Thr Gly Val Ser Arg Arg Cys Lys Gly Ser Arg Pro Arg Met Lys Tyr 275 280 285Gln Ile Leu Thr Leu Ala Ala 290 295157438DNAArabidopsis thalianaCDS(1)..(435) 157atg gca aac gca gag aag aca agt tca ggt tcc gac ata gat gag aag 48Met Ala Asn Ala Glu Lys Thr Ser Ser Gly Ser Asp Ile Asp Glu Lys 1 5 10 15aaa aga aaa cgc aag tta tca aac cgc gaa tct gca agg agg tcg cgt 96Lys Arg Lys Arg Lys Leu Ser Asn Arg Glu Ser Ala Arg Arg Ser Arg 20 25 30ttg aag aaa cag aag tta atg gaa gac acg att cat gag atc tcc agt 144Leu Lys Lys Gln Lys Leu Met Glu Asp Thr Ile His Glu Ile Ser Ser 35 40 45ctt gaa cga cga atc aaa gag aac agt gag aga tgt cga gct gta aaa 192Leu Glu Arg Arg Ile Lys Glu Asn Ser Glu Arg Cys Arg Ala Val Lys 50 55 60cag agg ctt gac tcg gtc gaa acg gag aac gcg ggt ctt aga tcg gag 240Gln Arg Leu Asp Ser Val Glu Thr Glu Asn Ala Gly Leu Arg Ser Glu 65 70 75 80aag att tgg ctc tcg agt tac gtt agc gat tta gag aat atg att gct 288Lys Ile Trp Leu Ser Ser Tyr Val Ser Asp Leu Glu Asn Met Ile Ala 85 90 95acg acg agt tta acg ctg acg cag agt ggt ggt ggc gat tgt gtc gac 336Thr Thr Ser Leu Thr Leu Thr Gln Ser Gly Gly Gly Asp Cys Val Asp 100 105 110gat cag aac gca aac gcg gga ata gcg gtt gga gat tgt aga cgt aca 384Asp Gln Asn Ala Asn Ala Gly Ile Ala Val Gly Asp Cys Arg Arg Thr 115 120 125ccg tgg aaa ttg agt tgt ggt tct cta caa cca atg gcg tcc ttt aag 432Pro Trp Lys Leu Ser Cys Gly Ser Leu Gln Pro Met Ala Ser Phe Lys 130 135 140aca tga 438Thr145158145PRTArabidopsis thaliana 158Met Ala Asn Ala Glu Lys Thr Ser Ser Gly Ser Asp Ile Asp Glu Lys 1 5 10 15Lys Arg Lys Arg Lys Leu Ser Asn Arg Glu Ser Ala Arg Arg Ser Arg 20 25 30Leu Lys Lys Gln Lys Leu Met Glu Asp Thr Ile His Glu Ile Ser Ser 35 40 45Leu Glu Arg Arg Ile Lys Glu Asn Ser Glu Arg Cys Arg Ala Val Lys 50 55 60Gln Arg Leu Asp Ser Val Glu Thr Glu Asn Ala Gly Leu Arg Ser Glu 65 70 75 80Lys Ile Trp Leu Ser Ser Tyr Val Ser Asp Leu Glu Asn Met Ile Ala 85 90 95Thr Thr Ser Leu Thr Leu Thr Gln Ser Gly Gly Gly Asp Cys Val Asp 100 105 110Asp Gln Asn Ala Asn Ala Gly Ile Ala Val Gly Asp Cys Arg Arg Thr 115 120 125Pro Trp Lys Leu Ser Cys Gly Ser Leu Gln Pro Met Ala Ser Phe Lys 130 135 140Thr1451591697DNAArabidopsis thalianaCDS(74)..(1435) 159ccaaagaaaa

aaaataaatt cgaaggtaaa tatccagaag cttgatcctc ctagttgtac 60gaaagcttga gta atg ggg tct aga tta aac ttc aag agc ttt gtt gat 109 Met Gly Ser Arg Leu Asn Phe Lys Ser Phe Val Asp 1 5 10ggt gtg agt gag cag cag cca acg gtg ggg act agt ctt cca ttg act 157 Gly Val Ser Glu Gln Gln Pro Thr Val Gly Thr Ser Leu Pro Leu Thr 15 20 25agg cag aac tct gtg ttc tcg tta acc ttt gat gag ttt cag aac tca 205Arg Gln Asn Ser Val Phe Ser Leu Thr Phe Asp Glu Phe Gln Asn Ser 30 35 40tgg ggt ggt gga att ggg aaa gat ttt ggg tct atg aac atg gat gag 253Trp Gly Gly Gly Ile Gly Lys Asp Phe Gly Ser Met Asn Met Asp Glu 45 50 55 60ctc ttg aag aac att tgg act gca gag gaa agt cat tca atg atg gga 301Leu Leu Lys Asn Ile Trp Thr Ala Glu Glu Ser His Ser Met Met Gly 65 70 75aac aat acc agt tac acc aac atc agc aat ggt aat agt gga aac act 349Asn Asn Thr Ser Tyr Thr Asn Ile Ser Asn Gly Asn Ser Gly Asn Thr 80 85 90gtt att aac ggc ggt ggt aac aac att ggt ggg tta gct gtt ggt gtg 397Val Ile Asn Gly Gly Gly Asn Asn Ile Gly Gly Leu Ala Val Gly Val 95 100 105gga gga gaa agt ggt ggt ttt ttc act ggt ggg agt ttg cag aga caa 445Gly Gly Glu Ser Gly Gly Phe Phe Thr Gly Gly Ser Leu Gln Arg Gln 110 115 120ggt tca ctt acc ttg cct cgg acg att agt cag aaa agg gtt gat gat 493Gly Ser Leu Thr Leu Pro Arg Thr Ile Ser Gln Lys Arg Val Asp Asp125 130 135 140gtc tgg aag gag ctg atg aag gag gat gac att gga aat ggt gtt gtt 541Val Trp Lys Glu Leu Met Lys Glu Asp Asp Ile Gly Asn Gly Val Val 145 150 155aat ggt ggg aca agc gga att ccg cag agg caa caa acg ctg gga gag 589Asn Gly Gly Thr Ser Gly Ile Pro Gln Arg Gln Gln Thr Leu Gly Glu 160 165 170atg act ttg gag gag ttt ttg gtc agg gct ggt gtg gtt agg gaa gaa 637Met Thr Leu Glu Glu Phe Leu Val Arg Ala Gly Val Val Arg Glu Glu 175 180 185cct caa ccg gtg gag agt gta act aac ttc aat ggc gga ttc tat gga 685Pro Gln Pro Val Glu Ser Val Thr Asn Phe Asn Gly Gly Phe Tyr Gly 190 195 200ttt ggc agt aat gga ggt ctt ggg aca gct agt aat ggg ttt gtt gca 733Phe Gly Ser Asn Gly Gly Leu Gly Thr Ala Ser Asn Gly Phe Val Ala205 210 215 220aac caa cct caa gat ttg tca gga aat gga gta gcg gtg aga cag gat 781Asn Gln Pro Gln Asp Leu Ser Gly Asn Gly Val Ala Val Arg Gln Asp 225 230 235ctg ctg act gct caa act cag cca cta cag atg cag cag cca cag atg 829Leu Leu Thr Ala Gln Thr Gln Pro Leu Gln Met Gln Gln Pro Gln Met 240 245 250gtg cag cag cca cag atg gtg cag cag ccg caa caa ctg ata cag acg 877Val Gln Gln Pro Gln Met Val Gln Gln Pro Gln Gln Leu Ile Gln Thr 255 260 265cag gag agg cct ttt ccc aaa cag acc act ata gca ttt tcc aac act 925Gln Glu Arg Pro Phe Pro Lys Gln Thr Thr Ile Ala Phe Ser Asn Thr 270 275 280gtt gat gtg gtt aac cgt tct caa cct gca aca cag tgc cag gaa gtg 973Val Asp Val Val Asn Arg Ser Gln Pro Ala Thr Gln Cys Gln Glu Val285 290 295 300aag cct tca ata ctt gga att cat aac cat cct atg aac aac aat cta 1021Lys Pro Ser Ile Leu Gly Ile His Asn His Pro Met Asn Asn Asn Leu 305 310 315ctg caa gct gtc gat ttt aaa aca gga gta acg gtt gca gca gta tct 1069Leu Gln Ala Val Asp Phe Lys Thr Gly Val Thr Val Ala Ala Val Ser 320 325 330cct gga agc cag atg tca cct gat ctg act cca aag agc gcc ctg gat 1117Pro Gly Ser Gln Met Ser Pro Asp Leu Thr Pro Lys Ser Ala Leu Asp 335 340 345gca tct ttg tcc cct gtt cct tac atg ttt ggg cga gtg aga aaa aca 1165Ala Ser Leu Ser Pro Val Pro Tyr Met Phe Gly Arg Val Arg Lys Thr 350 355 360ggt gca gtt ctg gag aaa gtg att gag aga agg caa aaa agg atg ata 1213Gly Ala Val Leu Glu Lys Val Ile Glu Arg Arg Gln Lys Arg Met Ile365 370 375 380aag aat agg gaa tca gct gca aga tcc cgc gct cgc aag caa gct tat 1261Lys Asn Arg Glu Ser Ala Ala Arg Ser Arg Ala Arg Lys Gln Ala Tyr 385 390 395acg atg gaa ctg gaa gca gaa att gcg caa ctc aaa gaa ttg aat gaa 1309Thr Met Glu Leu Glu Ala Glu Ile Ala Gln Leu Lys Glu Leu Asn Glu 400 405 410gag ttg cag aag aaa caa gtt gaa atc atg gaa aag cag aaa aat cag 1357Glu Leu Gln Lys Lys Gln Val Glu Ile Met Glu Lys Gln Lys Asn Gln 415 420 425ctt ctg gag cct ctg cgc cag cca tgg gga atg gga tgc aaa agg caa 1405Leu Leu Glu Pro Leu Arg Gln Pro Trp Gly Met Gly Cys Lys Arg Gln 430 435 440tgc ttg cga agg aca ttg acg ggt ccc tgg tagagcttat aatggcgtct 1455Cys Leu Arg Arg Thr Leu Thr Gly Pro Trp445 450aaggaaccca acaaagcgcc gaagttatag aacaactcag aagatagaaa gctagctttg 1515tacgtagttt aggcaggttc tgtgggtgat tgtaaatctt gaagtgtggc ggatttgaca 1575gagatagata aacacatatc tgttctattt tcctaaatct tttggtttta tcttcctgat 1635gtaatggatc tttatcattt gtcttgaaca tctttgtgac ttaaccagag tgaatttatc 1695tt 1697160454PRTArabidopsis thaliana 160Met Gly Ser Arg Leu Asn Phe Lys Ser Phe Val Asp Gly Val Ser Glu 1 5 10 15Gln Gln Pro Thr Val Gly Thr Ser Leu Pro Leu Thr Arg Gln Asn Ser 20 25 30Val Phe Ser Leu Thr Phe Asp Glu Phe Gln Asn Ser Trp Gly Gly Gly 35 40 45Ile Gly Lys Asp Phe Gly Ser Met Asn Met Asp Glu Leu Leu Lys Asn 50 55 60Ile Trp Thr Ala Glu Glu Ser His Ser Met Met Gly Asn Asn Thr Ser 65 70 75 80Tyr Thr Asn Ile Ser Asn Gly Asn Ser Gly Asn Thr Val Ile Asn Gly 85 90 95Gly Gly Asn Asn Ile Gly Gly Leu Ala Val Gly Val Gly Gly Glu Ser 100 105 110Gly Gly Phe Phe Thr Gly Gly Ser Leu Gln Arg Gln Gly Ser Leu Thr 115 120 125Leu Pro Arg Thr Ile Ser Gln Lys Arg Val Asp Asp Val Trp Lys Glu 130 135 140Leu Met Lys Glu Asp Asp Ile Gly Asn Gly Val Val Asn Gly Gly Thr145 150 155 160Ser Gly Ile Pro Gln Arg Gln Gln Thr Leu Gly Glu Met Thr Leu Glu 165 170 175Glu Phe Leu Val Arg Ala Gly Val Val Arg Glu Glu Pro Gln Pro Val 180 185 190Glu Ser Val Thr Asn Phe Asn Gly Gly Phe Tyr Gly Phe Gly Ser Asn 195 200 205Gly Gly Leu Gly Thr Ala Ser Asn Gly Phe Val Ala Asn Gln Pro Gln 210 215 220Asp Leu Ser Gly Asn Gly Val Ala Val Arg Gln Asp Leu Leu Thr Ala225 230 235 240Gln Thr Gln Pro Leu Gln Met Gln Gln Pro Gln Met Val Gln Gln Pro 245 250 255Gln Met Val Gln Gln Pro Gln Gln Leu Ile Gln Thr Gln Glu Arg Pro 260 265 270Phe Pro Lys Gln Thr Thr Ile Ala Phe Ser Asn Thr Val Asp Val Val 275 280 285Asn Arg Ser Gln Pro Ala Thr Gln Cys Gln Glu Val Lys Pro Ser Ile 290 295 300Leu Gly Ile His Asn His Pro Met Asn Asn Asn Leu Leu Gln Ala Val305 310 315 320Asp Phe Lys Thr Gly Val Thr Val Ala Ala Val Ser Pro Gly Ser Gln 325 330 335Met Ser Pro Asp Leu Thr Pro Lys Ser Ala Leu Asp Ala Ser Leu Ser 340 345 350Pro Val Pro Tyr Met Phe Gly Arg Val Arg Lys Thr Gly Ala Val Leu 355 360 365Glu Lys Val Ile Glu Arg Arg Gln Lys Arg Met Ile Lys Asn Arg Glu 370 375 380Ser Ala Ala Arg Ser Arg Ala Arg Lys Gln Ala Tyr Thr Met Glu Leu385 390 395 400Glu Ala Glu Ile Ala Gln Leu Lys Glu Leu Asn Glu Glu Leu Gln Lys 405 410 415Lys Gln Val Glu Ile Met Glu Lys Gln Lys Asn Gln Leu Leu Glu Pro 420 425 430Leu Arg Gln Pro Trp Gly Met Gly Cys Lys Arg Gln Cys Leu Arg Arg 435 440 445Thr Leu Thr Gly Pro Trp 4501611717DNAArabidopsis thalianaCDS(234)..(1379) 161aaaaaaaaaa aaaactgaac tctttttcgc tctggttttt ttagagagag agaaagatga 60aaatgcgttt aattgctgtt taggtttcga attcgcgatt taaatttctg ggtttctctc 120tgtttaagct tcttcttctt catcttctgc ttacgtttct tcttcaagga gctttcggat 180tcttgtagaa agagtcattg ttctcttgag tgggaaacct tgaaaccatt cct atg 236 Met 1gga aat agc agc gag gaa cca aag cct cct acc aaa tca gat aaa cca 284Gly Asn Ser Ser Glu Glu Pro Lys Pro Pro Thr Lys Ser Asp Lys Pro 5 10 15tct tca ccc ccg gtg gat caa aca aat gtt cat gtc tac cct gat tgg 332Ser Ser Pro Pro Val Asp Gln Thr Asn Val His Val Tyr Pro Asp Trp 20 25 30gca gct atg cag gca tat tat ggt cca aga gta gca atg cct cct tat 380Ala Ala Met Gln Ala Tyr Tyr Gly Pro Arg Val Ala Met Pro Pro Tyr 35 40 45tac aat tca gct atg gct gca tct ggt cat cct cct cct cct tac atg 428Tyr Asn Ser Ala Met Ala Ala Ser Gly His Pro Pro Pro Pro Tyr Met 50 55 60 65tgg aat cct cag cat atg atg tca cca tat gga gca ccc tat gct gct 476Trp Asn Pro Gln His Met Met Ser Pro Tyr Gly Ala Pro Tyr Ala Ala 70 75 80gtt tat cct cat gga gga gga gtt tac gct cat ccc ggt att ccc atg 524Val Tyr Pro His Gly Gly Gly Val Tyr Ala His Pro Gly Ile Pro Met 85 90 95gga tca ctg cct caa ggt caa aag gat cca cct tta aca act ccg ggg 572Gly Ser Leu Pro Gln Gly Gln Lys Asp Pro Pro Leu Thr Thr Pro Gly 100 105 110acg ctt ttg agc atc gac act cct act aaa tct aca ggg aac aca gac 620Thr Leu Leu Ser Ile Asp Thr Pro Thr Lys Ser Thr Gly Asn Thr Asp 115 120 125aat gga ttg atg aag aag ctg aaa gag ttt gat ggg ctt gct atg tct 668Asn Gly Leu Met Lys Lys Leu Lys Glu Phe Asp Gly Leu Ala Met Ser130 135 140 145cta gga aat ggg aat cct gaa aat ggt gca gat gaa cat aaa cga tca 716Leu Gly Asn Gly Asn Pro Glu Asn Gly Ala Asp Glu His Lys Arg Ser 150 155 160cgg aac agc tca gaa act gat ggt tct act gat gga agt gat ggg aat 764Arg Asn Ser Ser Glu Thr Asp Gly Ser Thr Asp Gly Ser Asp Gly Asn 165 170 175aca act ggg gca gat gaa ccg aaa ctt aaa aga agt cga gag gga act 812Thr Thr Gly Ala Asp Glu Pro Lys Leu Lys Arg Ser Arg Glu Gly Thr 180 185 190cca aca aaa gat ggg aaa caa ttg gtt caa gct agc tca ttt cat tct 860Pro Thr Lys Asp Gly Lys Gln Leu Val Gln Ala Ser Ser Phe His Ser 195 200 205gtt tct ccg tca agt ggt gat acc ggc gta aaa ctc att caa gga tct 908Val Ser Pro Ser Ser Gly Asp Thr Gly Val Lys Leu Ile Gln Gly Ser210 215 220 225gga gct ata ctc tct cct ggt gta agt gca aat tcc aac ccc ttc atg 956Gly Ala Ile Leu Ser Pro Gly Val Ser Ala Asn Ser Asn Pro Phe Met 230 235 240tca caa tct tta gcc atg gtt cct cct gaa act tgg ctt cag aac gag 1004Ser Gln Ser Leu Ala Met Val Pro Pro Glu Thr Trp Leu Gln Asn Glu 245 250 255aga gaa ctg aaa cgg gag cga agg aaa cag tct aat aga gaa tct gct 1052Arg Glu Leu Lys Arg Glu Arg Arg Lys Gln Ser Asn Arg Glu Ser Ala 260 265 270aga agg tca aga tta agg aaa cag gcc gag aca gaa gaa ctt gct agg 1100Arg Arg Ser Arg Leu Arg Lys Gln Ala Glu Thr Glu Glu Leu Ala Arg 275 280 285aaa gtg gaa gcc ttg aca gcc gaa aac atg gca tta aga tct gaa cta 1148Lys Val Glu Ala Leu Thr Ala Glu Asn Met Ala Leu Arg Ser Glu Leu290 295 300 305aac caa ctt aat gag aaa tct gat aaa cta aga gga gca aat gca acc 1196Asn Gln Leu Asn Glu Lys Ser Asp Lys Leu Arg Gly Ala Asn Ala Thr 310 315 320ttg ttg gac aaa ctg aaa tgc tcg gaa ccc gaa aag aga gtc ccc gca 1244Leu Leu Asp Lys Leu Lys Cys Ser Glu Pro Glu Lys Arg Val Pro Ala 325 330 335aat atg ttg tct aga gtt aag aac tca gga gct gga gat aag aac aag 1292Asn Met Leu Ser Arg Val Lys Asn Ser Gly Ala Gly Asp Lys Asn Lys 340 345 350aac caa gga gac aat gat tct aac tct aca agc aaa ttg cat caa ctg 1340Asn Gln Gly Asp Asn Asp Ser Asn Ser Thr Ser Lys Leu His Gln Leu 355 360 365ctc gat acg aag cct cga gct aaa gca gta gct gca ggc tgatcgatgg 1389Leu Asp Thr Lys Pro Arg Ala Lys Ala Val Ala Ala Gly370 375 380taattcatgt cgatttctac ttaatttgtc gacataaaca aagaaaataa gtgctactaa 1449tttcagaaaa acttgataga tagtatagta gagagagaga gagagagaga ggtgtgatga 1509ttattgatct ataaattttc ggagagagag agggagaaag agaaactttt cctccagatg 1569aaaatttggt gttatggttt gttactgtta atatagagag gcttttcttt ttttataaaa 1629tggcttcctt tgttgcattt ccttgtttta gacctgatgt aattttatga aatcggtgtt 1689attgctttgc gtaaaaaaaa aaaaaaaa 1717162382PRTArabidopsis thaliana 162Met Gly Asn Ser Ser Glu Glu Pro Lys Pro Pro Thr Lys Ser Asp Lys 1 5 10 15Pro Ser Ser Pro Pro Val Asp Gln Thr Asn Val His Val Tyr Pro Asp 20 25 30Trp Ala Ala Met Gln Ala Tyr Tyr Gly Pro Arg Val Ala Met Pro Pro 35 40 45Tyr Tyr Asn Ser Ala Met Ala Ala Ser Gly His Pro Pro Pro Pro Tyr 50 55 60Met Trp Asn Pro Gln His Met Met Ser Pro Tyr Gly Ala Pro Tyr Ala 65 70 75 80Ala Val Tyr Pro His Gly Gly Gly Val Tyr Ala His Pro Gly Ile Pro 85 90 95Met Gly Ser Leu Pro Gln Gly Gln Lys Asp Pro Pro Leu Thr Thr Pro 100 105 110Gly Thr Leu Leu Ser Ile Asp Thr Pro Thr Lys Ser Thr Gly Asn Thr 115 120 125Asp Asn Gly Leu Met Lys Lys Leu Lys Glu Phe Asp Gly Leu Ala Met 130 135 140Ser Leu Gly Asn Gly Asn Pro Glu Asn Gly Ala Asp Glu His Lys Arg145 150 155 160Ser Arg Asn Ser Ser Glu Thr Asp Gly Ser Thr Asp Gly Ser Asp Gly 165 170 175Asn Thr Thr Gly Ala Asp Glu Pro Lys Leu Lys Arg Ser Arg Glu Gly 180 185 190Thr Pro Thr Lys Asp Gly Lys Gln Leu Val Gln Ala Ser Ser Phe His 195 200 205Ser Val Ser Pro Ser Ser Gly Asp Thr Gly Val Lys Leu Ile Gln Gly 210 215 220Ser Gly Ala Ile Leu Ser Pro Gly Val Ser Ala Asn Ser Asn Pro Phe225 230 235 240Met Ser Gln Ser Leu Ala Met Val Pro Pro Glu Thr Trp Leu Gln Asn 245 250 255Glu Arg Glu Leu Lys Arg Glu Arg Arg Lys Gln Ser Asn Arg Glu Ser 260 265 270Ala Arg Arg Ser Arg Leu Arg Lys Gln Ala Glu Thr Glu Glu Leu Ala 275 280 285Arg Lys Val Glu Ala Leu Thr Ala Glu Asn Met Ala Leu Arg Ser Glu 290 295 300Leu Asn Gln Leu Asn Glu Lys Ser Asp Lys Leu Arg Gly Ala Asn Ala305 310 315 320Thr Leu Leu Asp Lys Leu Lys Cys Ser Glu Pro Glu Lys Arg Val Pro 325 330 335Ala Asn Met Leu Ser Arg Val Lys Asn Ser Gly Ala Gly Asp Lys Asn 340 345 350Lys Asn Gln Gly Asp Asn Asp Ser Asn Ser Thr Ser Lys Leu His Gln 355 360 365Leu Leu Asp Thr Lys Pro Arg Ala Lys Ala Val Ala Ala Gly 370 375 3801631554DNAArabidopsis thalianaCDS(162)..(1379) 163acaaaatatc tctccctcta tctgcaaatt ttccaaagtt gcatcctttc aatttccact 60cctctctaat ataattcaca ttttcccact attgctgatt catttttttt tgtgaattat 120ttcaaaccca cataaaaaaa tctttgttta aatttaaaac c atg gat cct tca ttt 176 Met Asp Pro Ser Phe 1 5agg ttc att aaa gag gag ttt cct gct gga ttc agt gat tct cca tca 224Arg Phe Ile Lys Glu Glu Phe Pro Ala Gly Phe Ser Asp Ser Pro Ser 10 15 20cca cca tct tct tct tca tac ctt tat tca tct tcc atg gct gaa gca 272Pro Pro Ser Ser Ser Ser Tyr Leu Tyr Ser Ser Ser Met Ala Glu Ala 25 30

35gcc ata aat gat cca aca aca ttg agc tat cca caa cca tta gaa ggt 320Ala Ile Asn Asp Pro Thr Thr Leu Ser Tyr Pro Gln Pro Leu Glu Gly 40 45 50ctc cat gaa tca ggg cca cct cca ttt ttg aca aag aca tat gac ttg 368Leu His Glu Ser Gly Pro Pro Pro Phe Leu Thr Lys Thr Tyr Asp Leu 55 60 65gtg gaa gat tca aga acc aat cat gtc gtg tct tgg agc aaa tcc aat 416Val Glu Asp Ser Arg Thr Asn His Val Val Ser Trp Ser Lys Ser Asn 70 75 80 85aac agc ttc att gtc tgg gat cca cag gcc ttt tct gta act ctc ctt 464Asn Ser Phe Ile Val Trp Asp Pro Gln Ala Phe Ser Val Thr Leu Leu 90 95 100ccc aga ttc ttc aag cac aat aac ttc tcc agt ttt gtc cgc cag ctc 512Pro Arg Phe Phe Lys His Asn Asn Phe Ser Ser Phe Val Arg Gln Leu 105 110 115aac aca tat ggt ttc aga aag gtg aat ccg gat cgg tgg gag ttt gca 560Asn Thr Tyr Gly Phe Arg Lys Val Asn Pro Asp Arg Trp Glu Phe Ala 120 125 130aac gaa ggg ttt ctt aga ggg caa aag cat ctc ctc aag aac ata agg 608Asn Glu Gly Phe Leu Arg Gly Gln Lys His Leu Leu Lys Asn Ile Arg 135 140 145aga aga aaa aca agt aat aat agt aat caa atg caa caa cct caa agt 656Arg Arg Lys Thr Ser Asn Asn Ser Asn Gln Met Gln Gln Pro Gln Ser150 155 160 165tct gaa caa caa tct cta gac aat ttt tgc ata gaa gtg ggt agg tac 704Ser Glu Gln Gln Ser Leu Asp Asn Phe Cys Ile Glu Val Gly Arg Tyr 170 175 180ggt cta gat gga gag atg gac agc cta agg cga gac aag caa gtg ttg 752Gly Leu Asp Gly Glu Met Asp Ser Leu Arg Arg Asp Lys Gln Val Leu 185 190 195atg atg gag cta gtg aga cta aga cag caa caa caa agc acc aaa atg 800Met Met Glu Leu Val Arg Leu Arg Gln Gln Gln Gln Ser Thr Lys Met 200 205 210tat ctc aca ttg att gaa gag aag ctc aag aag acc gag tca aaa caa 848Tyr Leu Thr Leu Ile Glu Glu Lys Leu Lys Lys Thr Glu Ser Lys Gln 215 220 225aaa caa atg atg agc ttc ctt gcc cgc gca atg cag aat cca gat ttt 896Lys Gln Met Met Ser Phe Leu Ala Arg Ala Met Gln Asn Pro Asp Phe230 235 240 245att cag cag cta gta gag cag aag gaa aag agg aaa gag atc gaa gag 944Ile Gln Gln Leu Val Glu Gln Lys Glu Lys Arg Lys Glu Ile Glu Glu 250 255 260gcg atc agc aag aag aga caa aga ccg atc gat caa gga aaa aga aat 992Ala Ile Ser Lys Lys Arg Gln Arg Pro Ile Asp Gln Gly Lys Arg Asn 265 270 275gtg gaa gat tat ggt gat gaa agt ggt tat ggg aat gat gtt gca gcc 1040Val Glu Asp Tyr Gly Asp Glu Ser Gly Tyr Gly Asn Asp Val Ala Ala 280 285 290tca tcc tca gca ttg att ggt atg agt cag gaa tat aca tat gga aac 1088Ser Ser Ser Ala Leu Ile Gly Met Ser Gln Glu Tyr Thr Tyr Gly Asn 295 300 305atg tct gaa ttc gag atg tcg gag ttg gac aaa ctt gct atg cac att 1136Met Ser Glu Phe Glu Met Ser Glu Leu Asp Lys Leu Ala Met His Ile310 315 320 325caa gga ctt gga gat aat tcc agt gct agg gaa gaa gtc ttg aat gtg 1184Gln Gly Leu Gly Asp Asn Ser Ser Ala Arg Glu Glu Val Leu Asn Val 330 335 340gaa aaa gga aat gat gag gaa gaa gta gaa gat caa caa caa ggg tac 1232Glu Lys Gly Asn Asp Glu Glu Glu Val Glu Asp Gln Gln Gln Gly Tyr 345 350 355cat aag gag aac aat gag att tat ggt gaa ggt ttt tgg gaa gat ttg 1280His Lys Glu Asn Asn Glu Ile Tyr Gly Glu Gly Phe Trp Glu Asp Leu 360 365 370tta aat gaa ggt caa aat ttt gat ttt gaa gga gat caa gaa aat gtt 1328Leu Asn Glu Gly Gln Asn Phe Asp Phe Glu Gly Asp Gln Glu Asn Val 375 380 385gat gtg tta att cag caa ctt ggt tat ttg ggt tct agt tca cac act 1376Asp Val Leu Ile Gln Gln Leu Gly Tyr Leu Gly Ser Ser Ser His Thr390 395 400 405aat taagaagaaa ttgaaatgat gactacttta agcatttgaa tcaacttgtt 1429Asntcctattagt aatttggctt tgtttcaatc aagtgagtcg tggactaact tattgaattt 1489gggggttaaa tccgtttctt atttttggaa ataaaattgc tttttgttta aaaaaaaaaa 1549aaaaa 1554164406PRTArabidopsis thaliana 164Met Asp Pro Ser Phe Arg Phe Ile Lys Glu Glu Phe Pro Ala Gly Phe 1 5 10 15Ser Asp Ser Pro Ser Pro Pro Ser Ser Ser Ser Tyr Leu Tyr Ser Ser 20 25 30Ser Met Ala Glu Ala Ala Ile Asn Asp Pro Thr Thr Leu Ser Tyr Pro 35 40 45Gln Pro Leu Glu Gly Leu His Glu Ser Gly Pro Pro Pro Phe Leu Thr 50 55 60Lys Thr Tyr Asp Leu Val Glu Asp Ser Arg Thr Asn His Val Val Ser 65 70 75 80Trp Ser Lys Ser Asn Asn Ser Phe Ile Val Trp Asp Pro Gln Ala Phe 85 90 95Ser Val Thr Leu Leu Pro Arg Phe Phe Lys His Asn Asn Phe Ser Ser 100 105 110Phe Val Arg Gln Leu Asn Thr Tyr Gly Phe Arg Lys Val Asn Pro Asp 115 120 125Arg Trp Glu Phe Ala Asn Glu Gly Phe Leu Arg Gly Gln Lys His Leu 130 135 140Leu Lys Asn Ile Arg Arg Arg Lys Thr Ser Asn Asn Ser Asn Gln Met145 150 155 160Gln Gln Pro Gln Ser Ser Glu Gln Gln Ser Leu Asp Asn Phe Cys Ile 165 170 175Glu Val Gly Arg Tyr Gly Leu Asp Gly Glu Met Asp Ser Leu Arg Arg 180 185 190Asp Lys Gln Val Leu Met Met Glu Leu Val Arg Leu Arg Gln Gln Gln 195 200 205Gln Ser Thr Lys Met Tyr Leu Thr Leu Ile Glu Glu Lys Leu Lys Lys 210 215 220Thr Glu Ser Lys Gln Lys Gln Met Met Ser Phe Leu Ala Arg Ala Met225 230 235 240Gln Asn Pro Asp Phe Ile Gln Gln Leu Val Glu Gln Lys Glu Lys Arg 245 250 255Lys Glu Ile Glu Glu Ala Ile Ser Lys Lys Arg Gln Arg Pro Ile Asp 260 265 270Gln Gly Lys Arg Asn Val Glu Asp Tyr Gly Asp Glu Ser Gly Tyr Gly 275 280 285Asn Asp Val Ala Ala Ser Ser Ser Ala Leu Ile Gly Met Ser Gln Glu 290 295 300Tyr Thr Tyr Gly Asn Met Ser Glu Phe Glu Met Ser Glu Leu Asp Lys305 310 315 320Leu Ala Met His Ile Gln Gly Leu Gly Asp Asn Ser Ser Ala Arg Glu 325 330 335Glu Val Leu Asn Val Glu Lys Gly Asn Asp Glu Glu Glu Val Glu Asp 340 345 350Gln Gln Gln Gly Tyr His Lys Glu Asn Asn Glu Ile Tyr Gly Glu Gly 355 360 365Phe Trp Glu Asp Leu Leu Asn Glu Gly Gln Asn Phe Asp Phe Glu Gly 370 375 380Asp Gln Glu Asn Val Asp Val Leu Ile Gln Gln Leu Gly Tyr Leu Gly385 390 395 400Ser Ser Ser His Thr Asn 40516523DNAArtificial SequenceDescription of Artificial Sequenceprimer 165cgccagggtt ttcccagtca cga 2316624DNAArtificial SequenceDescription of Artificial Sequenceprimer 166agcggataac aatttcacac agga 24


Patent applications by Kazuo Shinozaki, Ibaraki JP

Patent applications by Miki Fujita, Ibaraki JP

Patent applications by Motoaki Seki, Kanagawa JP

Patent applications in class The polynucleotide confers resistance to heat or cold (e.g., chilling, etc.)

Patent applications in all subclasses The polynucleotide confers resistance to heat or cold (e.g., chilling, etc.)


User Contributions:

Comment about this patent or add new information about this topic:

CAPTCHA