Patent application title: Environmental stress-responsive promoter and a gene encoding environmental stress-responsive transcriptional factor
Inventors:
Kazuo Shinozaki (Ibaraki, JP)
Motoaki Seki (Kanagawa, JP)
Miki Fujita (Ibaraki, JP)
IPC8 Class: AC12N1529FI
USPC Class:
800289
Class name: Multicellular living organisms and unmodified parts thereof and related processes method of introducing a polynucleotide molecule into or rearrangement of genetic material within a plant or plant part the polynucleotide confers resistance to heat or cold (e.g., chilling, etc.)
Publication date: 2008-09-04
Patent application number: 20080216196
Inventors list |
Agents list |
Assignees list |
List by place |
Classification tree browser |
Top 100 Inventors |
Top 100 Agents |
Top 100 Assignees |
Usenet FAQ Index |
Documents |
Other FAQs |
Patent application title: Environmental stress-responsive promoter and a gene encoding environmental stress-responsive transcriptional factor
Inventors:
Kazuo Shinozaki
Motoaki Seki
Miki Fujita
Agents:
BIRCH STEWART KOLASCH & BIRCH
Assignees:
Origin: FALLS CHURCH, VA US
IPC8 Class: AC12N1529FI
USPC Class:
800289
Abstract:
The present invention provides a method of regulating expression of a gene
by (a) preparing a recombinant plant cell line, plant tissue or plant
with an expression vector having an abiotic environmental
stress-responsive promoter of SEQ ID NO: 27 and the gene downstream
thereof; and (b) culturing and cultivating the recombinant plant cell,
plant tissue or plant under an abiotic environmental stress, wherein the
promoter regulates the expression of the gene under the abiotic
environmental stress.Claims:
1.-4. (canceled)
5. A method of regulating expression of a gene which comprises:(a) preparing a recombinant plant cell line, plant tissue or plant comprising an expression vector having an abiotic environmental stress-responsive promoter comprising SEQ ID NO: 63 which is operably linked to the coding sequence the gene; and(b) culturing and cultivating the recombinant plant cell, plant tissue or plant under an abiotic environmental stress, wherein the promoter regulates the expression of the gene under the abiotic environmental stress.
6. The method according to claim 5, wherein the abiotic environmental stress is cold stress, drought stress or salt stress.
7. The method according to claim 5, wherein the gene encodes a polypeptide that can confer increased environmental stress resistance compared to a plant cell line, plant tissue or plant lacking the expression vector.
8. The method according to claim 5, wherein the gene is a plant gene.
Description:
[0001]This application is a Divisional of co-pending application Ser. No.
10/470,154 filed on Sep. 5, 2006, that is a divisional of co-pending
application Ser. No. 10/495,918 filed on May 18, 2004, and for which
priority is claimed under 35 U.S.C. 120; and this application claims
priority of International Application No. PCT/JP02/11955 filed on Nov. 5,
2002 under 35 U.S.C. 119; the entire contents of all are hereby
incorporated by reference.
TECHNICAL FIELD
[0002]The present invention relates to an environmental stress-responsive promoter and a gene encoding environmental stress-responsive transcriptional factor.
BACKGROUND ART
[0003]Large quantities of genomic and cDNA sequences have been determined with respect to a number of organisms by gene sequencing projects. In a plant model, Arabidopsis thaliana, the complete genomic sequences of two chromosomes have been determined (Lin, X. et al., (1999), Nature 402, 761-768; and Mayer, K. et al., (1999), Nature 402, 769-777).
[0004]The expressed sequence tag (EST) project also has greatly contributed to the discovery of expression genes (Hofte, H. et al., (1993), Plant J. 4, 1051-1061; Newman, T. et al., (1994), Plant Physiol. 106, 1241-1255; and Cooke, R. et al., (1996), Plant J. 9, 101-124; and Asamizu, E. et al., (2000), DNA Res. 7, 175-180). For example, the database of EST (dbEST) of the National Center for Biotechnology Information (NCBI) includes partial cDNA sequences, in which more than half (about 28,000 genes) of the total genes are reproduced, (as estimated from the gene content of Arabidopsis thaliana chromosome 2 completely sequenced [Lin, X. et al., (1999), Nature 402, 761-768]).
[0005]Recently, microarray (DNA chip) technology has become a useful tool for analyzing genome-scale gene expression (Schena, M. et al., (1995), Science 270, 467-470; Eisen, M. B. and Brown, P. O. (1999), Methods Enzymol. 303, 179-205). In the technology using a DNA chip, cDNA sequences are arrayed on a slide glass in a density of not smaller than 1,000 genes/cm2. The cDNA sequences thus arrayed are hybridized simultaneously with a pair of cDNA probes tagged with two color fluorescent labels, which have been prepared from RNA samples of different types of cells or tissues. In this manner, a large amount of genes can be directly analyzed and compared for gene expression. This technology was demonstrated for the first time by analyzing 48 Arabidopsis genes for differential expression in root and shoots (Schena, M. et al., (1995), Science 270, 467-470). Furthermore, a microarray was used in investigating 1,000 clones randomly taken from a human cDNA library in order to identify a novel gene responsive to heat shock and protein kinase C activation (Schena, M. et al., (1996), Proc. Natl. Acad. Sci. USA, 93, 10614-10619).
[0006]In another method, a DNA chip is used in analyzing the expression profile of an inflammatory-disease associated gene under various induction conditions (Heller, R. A. et al., (1997), Proc. Natl. Acad. Sci. USA, 94, 2150-2155). Furthermore, using a microarray, a yeast genome having more than 6,000 coding sequences has been analyzed for dynamic expression (DeRisi, J. L. et al., (1997) Science 278, 680-686; and Wodicka, L. et al., (1997), Nature Biotechnol. 15, 1359-1367).
[0007]However, in the field of plant science, only a few reports have been made on microarray analysis (Schena, M. et al., (1995), Science 270, 467-470; Ruan, Y. et al., (1998), Plant J. 15, 821-833; Aharoni. A. et al., (2000), Plant Cell 12, 647-661; and Reymond, P. et al., (2000), Plant Cell 12, 707-719).
[0008]The growth of plants is significantly affected by environmental stresses such as drought, high salinity and low temperature. Of the stresses, drought or water deficiency is the most critical factor that limits growth of plants and production of crops. Such a drought stress causes various biochemical and physiological responses in plants.
[0009]To survive under these conditions of stress, plants acquire responsivity and adaptability to the stresses. Recently, several types of genes responsive to drought at a transcriptional level have been reported (Bohnert, H. J. et al., (1995), Plant Cell 7, 1099-1111; Ingram, J., and Bartels, D. (1996), Plant Mol. Biol. 47, 377-403; Bray, E. A. (1997), Trends Plant Sci. 2, 48-54; Shinozaki, K., and Yamaguchi-Shinozaki, K. (1997), Plant Physiol. 115, 327-334; Shinozaki, K., and Yamaguchi-Shinozaki, K. (1999), "Molecular responses to drought stress. Molecular responses to cold, drought, heat and salt stress in higher plants", edited by Shinozaki, K. and Yamaguchi-Shinozaki, K. R. G. Landes Company; and Shinozaki, K., and Yamaguchi-Shinozaki, K. (2000), Curr. Opin. Plant Biol. 3, 217-223).
[0010]On the other hand, in an attempt to improve stress resistance of plants by introducing a gene, stress-inducible genes have been used (Holmberg, N., and Bulow, L. (1998), Trends Plant Sci. 3, 61-66; and Bajaj, S. et al., (1999), Mol. Breed. 5, 493-503). Not only to further clarify the mechanism of stress resistance and stress responsivity of a higher plant at a molecular level but also to improve the stress resistance of a crop by gene manipulation, it is important to analyze the function of a stress-inducible gene.
[0011]Dehydration responsive element and C-repeat sequence (DRE/CRT) has been identified as an important cis-acting element when drought, high salt and cold stress-responsive genes are expressed in an ABA independent manner, where ABA refers to abscisic acid, a kind of plant hormone and serves as a signal transmission factor of seed dormancy and environmental stress (Yamaguchi-Shinozaki, K., and Shinozaki, K. (1994), Plant Cell 6, 251-264; Thomashow, M. F. et al., (1999), Plant Mol. Biol. 50, 571-599; and Shinozaki, K., and Yamaguchi-Shinozaki, K. (2000), Curr. Opin. Plant Biol. 3, 217-223). Furthermore, a transcriptional factor (DREB/CBF) involved in DRE/CRT responsive gene expression has been cloned (Stockinger. E. J. et al., (1997), Proc. Natl. Acad. Sci. USA 94, 1035-1040; Liu, Q. et al., (1998), Plant Cell 10, 1391-1406; Shinwari, Z. K. et al., (1998), Biochem. Biophys. Res. Commun. 250, 161-170; and Gilmour, S. J. et al., (1998), Plant J. 16, 433-443). DREB1/CBF is considered to function in cold-responsive gene expression, whereas DREB2 is involved in drought-responsive gene expression. Strong resistance to freezing stress was observed in a transgenic Arabidopis plant that overexpresses CBF1 (DREB1B) cDNA under the control of a cauliflower mosaic virus (CaMV) 35S promoter (Jaglo-Ottosen, K. R. et. al., (1998), Science 280, 104-106).
[0012]The present inventors have reported that when DREB1A (CBF3) cDNA molecules are overexpressed in transgenic plants under the control of a CaMV 35S promoter or a stress-inducible rd29A promoter, strong constitutive expression of stress-inducible DREB1A target genes are induced to improve resistance to freezing, drought and salt stresses (Liu, Q. et al., (1998), Plant Cell 10, 1391-1406; and Kasuga, M. et al., (1999), Nature Biotechnol. 17, 287-291). Furthermore, the present inventors have already identified six DREB1A target genes such as rd29A/lti78/cor78, kin1, kin2/cor6.6, cor15a, rd17/cor47, and erd10 (Kasuga, M. et al., (1999), Nature Biotechnol. 17, 287-291). However, it has not yet been sufficiently elucidated how the overexpressed DREB1A cDNA improves stress resistance to freezing, drought and salt in a transgenic plant. To investigate the molecular mechanisms of drought and freezing resistance, it is important to identify and analyze as many genes controlled by DREB1A as possible.
DISCLOSURE OF THE INVENTION
[0013]The present invention is directed to providing an environmental stress-responsive promoter and a gene encoding an environmental stress-responsive transcriptional factor.
[0014]The present inventors have intensively studied to solve the aforementioned problems. As a result, they succeeded in identifying novel genes responsive to cold, drought and salt stresses and isolating promoter regions thereof by using cDNA microarray analysis, thereby accomplishing the present invention.
[0015]More specifically, the present invention is directed to an environmental stress-responsive promoter comprising DNA of the following (a), (b) or (c):
(a) DNA consisting of any nucleotide sequence selected from SEQ ID NOS: 1 to 90;(b) DNA consisting of a nucleotide sequence comprising a deletion, substitution or addition of one or more nucleotides relative to any nucleotide sequence selected from SEQ ID NOS: 1 to 90, and functioning as an environmental stress responsive promoter; and(c) DNA hybridizing under stringent conditions to DNA consisting of any nucleotide sequence selected from SEQ ID NOS: 1 to 90, and functioning as an environmental stress responsive promoter.
[0016]Examples of environmental stress include at least one selected from the group consisting of cold stress, drought stress, and salt stress.
[0017]The present invention is also directed to an expression vector comprising the promoter mentioned above, or an expression vector having an arbitrary gene integrated therein.
[0018]Furthermore, the present invention is directed to a transformant comprising the expression vector.
[0019]Moreover, the present invention is directed to a transgenic plant, such as a plant body, plant organ, plant tissue or plant culture cell, comprising the expression vector.
[0020]The present invention is still further directed to a method for producing a stress-resistant plant, comprising culturing or cultivating the transgenic plant.
[0021]On the other hand, the present inventors identified novel genes encoding cold, drought and salt stress-responsive transcriptional factors by use of cDNA microarray analysis, thereby accomplishing the present invention.
[0022]More specifically, the present invention is directed to a gene encoding an environmental stress-responsive transcriptional factor comprising an amino acid of the following (a) or (b):
(a) any amino acid sequence selected from SEQ ID NOS: 2n (n is an integer from 47 to 82);(b) an amino acid sequence comprising a deletion, substitution or addition of one or more amino acids relative to any amino acid sequence selected from SEQ ID NOS: 2n (n is an integer from 47 to 82), functioning as an environmental stress-responsive transcriptional factor.
[0023]Also, the present invention is directed to a gene according to claim 1, comprising DNA of the following (a), (b) or (c):
(a) DNA consisting of any nucleotide sequence selected from SEQ ID NOS: 2n-1 (n is an integer from 47 to 82);(b) DNA consisting of a nucleotide sequence comprising a deletion, substitution or addition of one or more nucleotides relative to any nucleotide sequence selected from SEQ ID NOS: 2n-1 (n is an integer from 47 to 82), and encoding an environmental stress-responsive transcriptional factor; and(c) DNA hybridizing under stringent conditions to DNA consisting of any nucleotide sequence selected from SEQ ID NOS: 2n-1 (n is an integer from 47 to 82), and encoding an environmental stress-responsive transcriptional factor.
[0024]In the present invention, examples of environmental stress include at least one selected from the group consisting of cold stress, drought stress, and salt stress.
[0025]The present invention is also directed to an expression vector containing the gene, a transformant containing the expression vector, and a transgenic plant containing the expression vector.
[0026]Furthermore, the present invention is directed to a transgenic plant, such as a plant body, plant organ, plant tissue or plant culture cell.
[0027]Moreover, the present invention is directed to a method for producing a stress-resistant plant, comprising culturing or cultivating the transgenic plant.
[0028]Hereinafter, the present invention will be described in detail.
[0029]The present inventors constructed full-length cDNA libraries from Arabidopsis plants placed under different conditions, such as dehydration-treated plants and cold-treated plants (Seki. M. et al., (1998), Plant J. 15, 707-720), by the biotinylated CAP trapper method (Carninci. P. et al., (1996), Genomics, 37, 327-336); Then, Arabidopsis full-length cDNA microarrays were respectively prepared using about 1,300 full-length cDNA molecules and about 7,000 full-length cDNA molecules both containing stress-inducible genes. Besides using these dehydration and cold-inducible full-length cDNA molecules, another cDNA microarray was prepared using a DREB1A target gene, a transcriptional regulator for controlling expression of a stress-responsive gene. Thereafter, expression patterns of genes under drought and cold stress were monitored to exhaustively analyze stress-responsive genes. As a result, from the full-length cDNA microarray containing about 1,300 of full-length cDNA molecules, novel environmental stress-responsive genes, that is, 44 drought-inducible genes and 19 cold-inducible genes were isolated. 30 out of the 44 drought-inducible genes, and 10 out of the 19 cold-inducible genes were novel stress-inducible genes. Moreover, it was found that 12 stress-inducible genes were DREB1A target genes and 6 out of the 12 stress-inducible genes were novel genes. As a result of the analysis, 301 drought-inducible genes, 54 cold-inducible genes and 211 high salt-stress inducible genes were isolated from a cDNA microarray containing about 7,000 full-length cDNA molecules.
[0030]Thereafter, not only promoter regions but also environmental genes encoding environmental stress-responsive transcriptional factors were successfully isolated from these environmental stress-responsive genes.
[0031]As described above, a full-length cDNA microarray is useful tool for analyzing the expression manner of Arabidopsis thaliana drought- and cold-stress inducible genes and analyzing the target gene of a stress associated transcriptional regulator.
1. Isolation of Promoter
[0032]The promoter of the present invention contains a cis-element which is present upstream of a gene encoding a stress-responsive protein expressed by an environmental stress such as a cold, drought, or high salt stress and which activates the transcription of a gene present downstream thereof by binding of a transcriptional factor. Examples of such a cis-element include a dehydration responsive element (DRE), an abscisic acid responsive element (ABRE), and a cold-stress responsive element. Examples of genes encoding proteins binding to these elements include a DRE binding protein 1A gene (referred to also as a "DREB1A gene"), DRE binding protein 1C gene (referred to also as a "DREB1C gene"), DRE binding protein 2A gene (referred to also as a "DREB2A gene"), and DRE binding protein 2B gene (referred to also as a "DREB2B gene").
[0033]In isolating a promoter of the present invention, first, stress-responsive genes are isolated by using a microarray. In constructing a microarray, use may be made of about 1,300 cDNA molecules in total including genes isolated from Arabidopsis full-length cDNA libraries, responsive to dehydration (RD) genes, early responsive to dehydration (ERD) genes, kin1 genes, kin2 genes, and cor15a genes; and furthermore, α-tubulin genes as an internal standard; and moreover, mouse nicotinic acetylcholine receptor epsilon subunit (nAChRE) genes and mouse glucocorticoid receptor homologous genes, as negative controls.
[0034]As a microarray used in isolating the promoter of the present invention, use may be made of about 7,000 cDNA molecules in total including genes isolated from an Arabidopsis full-length cDNA library, responsive to dehydration (RD) genes, early responsive to dehydration (ERD) genes, and PCR amplification fragments as an internal standard obtained from λ control template DNA fragments (TX803, manufactured by Takara Shuzo); and mouse nicotinic acetylcholine receptor epsilon subunit (nAChRE) genes and mouse glucocorticoid receptor homologous genes, as negative controls.
[0035]A plasmid DNA extracted with a plasmid preparation device (manufactured by Kurabo) is sequenced by sequence analysis using a DNA sequencer (ABI PRISM 3700, PE Applied Biosystems, CA, USA). Based on the GenBank/EMBL database, the obtained sequence is screened for homology by using the BLAST program.
[0036]After poly A selection is performed, reverse transcription is carried out to synthesize double-stranded DNA molecules and a cDNA molecule is inserted into a vector.
[0037]The cDNA molecule inserted into a vector for constructing cDNA libraries is amplified by PCR using complementary primers to the sequences of vectors on both sides of the cDNA molecule. Examples of such vectors include λZAPII and λPS.
[0038]A microarray can be prepared according to a conventional method, which is not particularly limited. For example, using a gene tip microarray stamp machine GTMASS SYSTEM (manufactured by Nippon Laser & Electronics Lab.), the above obtained PCR product is loaded from a microtiter plate and spotted on a microslide glass at predetermined intervals. Then, to prevent a non-specific signal form being expressed, the slide is immersed into a blocking solution.
[0039]Examples of plant materials include a plant strain obtained by destroying specific genes as well as wild type plants. A transgenic plant having cDNA of DREB1A introduced therein may be used. Examples of plant species include Arabidopsis thaliana, tobacco and rice. Of them, Arabidopsis thaliana is preferable.
[0040]Dehydration- and cold-stress treatments can be carried out according to a known method (Yamaguchi-Shinozaki, K., and Shinozaki, K. (1994), Plant Cell 6, 251-264).
[0041]After plant bodies (wild type plants and DREB1A overexpression transformants) are exposed to stress, they are sampled and stored in cryogenic conditions with liquid nitrogen. The wild type and DREB1A overexpression transformants are used in an experiment to identify a DREB1A target gene. From plant bodies, mRNA is isolated and purified by a known method or a kit.
[0042]In the presence of Cy3 dUTP or Cy5 dUTP for labeling (Amersham Pharmacia), each of mRNA samples is subjected to reverse transcription and then used in hybridization.
[0043]After the hybridization, the microarray is scanned with a scanning laser microscope or the like. As a program for analyzing data of a microarray, Imagene Ver 2.0 (BioDiscovery) and QuantArray (GSI Lumonics) etc. may be used.
[0044]After the scanning, a plasmid having a target gene is prepared. In this way, the target genes are isolated.
[0045]A promoter region is determined by analyzing the nucleotide sequence of the gene isolated above and using a gene analysis program based on the genomic information of database (GenBank/EMBL, ABRC). The isolated genes can be classified into ones inducible by both dehydration and cold stress, ones inducible specifically by drought stress, and one inducible specifically by cold stress. According to the gene analysis program, from the genes mentioned above, 90 types of genes below can be identified.
[0046](FL03-07-F12, FL04-12-F24, FL04-14-N10, FL04-14-P24, FL04-17-103, FL04-17-M08, FL04-17-M22, FL05-05-A17, FL05-05-F20, FL05-05-G20, FL05-09-N09, FL05-10-J09, FL05-10-M08, FL05-11-H09, FL05-12-H13, FL05-13-I20, FL05-14-E15, FL05-14-E16, FL05-16-F03, FL05-16-H23, FL05-18-M07, FL05-18-O21, FL05-19-F21, FL05-19-O22, FL05-21-K17, FL06-10-F03, FL06-12-H12, FL07-12-123, FL08-08-H23, FL08-08-O14, FL08-09-M05, FL08-10-K08, FL08-11-P07, FL08-13-F10, FL08-19-D04, FL08-19-G15, FL09-06-B11, FL09-07-G17, FL09-10-A12, FL09-13-P15, FL02-05-I05, FL04-12-N15, FL04-16-P21, FL04-17-N22, FL04-20-P19, FL02-09-H01, FL05-01-D08, FL05-02-G08, FL05-02-O17, FL05-07-L13, FL05-08-B14, FL05-09-N10, FL05-11-L01, FL05-12-J09, FL05-14-D24, FL05-14-F20, FL05-14-108, FL05-15-C04, FL05-15-E19, FL05-18-A06, FL05-18-H15, FL05-19-C02, FL05-20-M16, FL05-20-N18, FL05-21-E06, FL05-21-L12, FL06-07-B08, FL06-08-H20, FL06-09-N04, FL06-11-K21, FL07-07-G15, FL07-12-D17, FL08-11-C23, FL08-13-G20, FL08-15-M21, FL08-18-N19, FL08-19-C07, FL08-19-P05, FL09-07-G09, FL09-07-G15, FL09-10-J18, FL09-11-I12, FL09-12-B03, FL09-16-I11, FL09-16-M04, FL11-01-J18, FL11-07-D13, FL11-07-F02, FL11-07-N15 and FL11-10-D10). The promoter regions of these genes are represented by SEQ ID NOS: 1 to 90, respectively.
[0047]As long as a promoter of the present invention acts as an environmental stress-responsive promoter, use may be made of any promoter having a nucleotide sequence selected from SEQ ID NOS: 1 to 90 wherein one or more nucleotides, preferably one or several nucleotides (for example 1 to 10, preferably 1 to 5) may be deleted, substituted or added. Furthermore, DNA hybridizing with the DNA comprising any nucleotide sequence selected from SEQ ID NOS: 1 to 90 under stringent conditions and acting as an environmental stress-responsive promoter is also included in the promoter of the present invention.
[0048]Once the nucleotide sequence of a promoter according to the present invention is determined, the promoter can be obtained by chemical synthesis, PCR using a cloned probe as a template, or hybridization using a DNA fragment having the nucleotide sequence as a probe. Furthermore, a mutant of the promoter of the present invention, which has the same functions as those of a non-mutated promoter, can be also synthesized by a site-specific mutagenesis or the like.
[0049]To introduce a mutation into a promoter sequence, a known method such as the Kunkel method, Gapped duplex method or an equivalent method may be employed. A mutation may be introduced by using a mutation-introducing kit (for example, Mutant-K manufactured by Takara or Mutant-G manufactured by Takara) which uses a site-specific mutagenesis or by using the LA PCR in vitro mutagenesis series kit (manufactured by Takara).
[0050]The term "functioning as an environmental stress-responsive promoter" used herein refers to a function of activating transcription caused by binding RNA polymerase to the promoter when the promoter is exposed to a predetermined environmental stress condition.
[0051]The term "environmental stress" used herein generally refers to an abiotic stress such as drought stress, cold stress, high salt stress, or intensive light stress. The term "drought" used herein refers to a state of water deficiency, and the term "cold" used herein refers to a state where an object is exposed to a lower temperature than the optimum living temperature for each organism (e.g., in the case of Arabidopsis thaliana, it is exposed to a temperature of -20 to +21° C. continuously for one hour to several weeks). The term "high salt" used herein refers to a state where a plant is treated with NaCl of 50 mM to 600 mM in concentration continuously for 0.5 hours to several weeks. The term "intensive light stress" used herein refers to a state where too intensive light to use for photosynthesis is applied to a plant, and corresponds to a case where, for example, light of 5,000 to 10,000 Lx or more is applied. These environmental stresses may be applied singly or in combination.
[0052]The plant promoter of the present invention includes a promoter having a nucleotide sequence represented by SEQ ID NOS: 1 to 90 wherein a nucleotide sequence may be added to the 3' end in order to increase transcriptional efficiency or a nucleotide sequence may be deleted from the 5' end to the extent not to lose the activity of a promoter.
[0053]Furthermore, the promoter of the present invention includes DNA which hybridizes with DNA consisting of any nucleotide sequence selected from SEQ ID NOS: 1 to 90 under stringent conditions and acts as an environmental stress-responsive promoter. The term "stringent conditions" used herein refers to the conditions of sodium concentration of 25 to 500 mM, preferably 25 to 300 mM, and a temperature of 42 to 68° C., preferably 42 to 65° C.; more preferably, conditions of 5×SSC (83 mM NaCl, 83 mM sodium citrate) and a temperature of 42° C.
2. Construction of Expression Vector
[0054]An expression vector of the present invention can be obtained by ligating (inserting) a promoter according to the present invention to an appropriate vector. The vector into which a promoter of the present invention is to be inserted is not particularly limited as long as it can be replicated in a host. Examples of such a vector include a plasmid, shuttle vector and helper plasmid.
[0055]Examples of such a plasmid DNA include plasmids derived from Escherichia coli (e.g., pBR322, pBR325, pUC118, pUC119, pUC18, pUC19, and pBluescript); plasmids derived from Bacillus subtilis (e.g., pUB110 and pTP5); and plasmids derived from yeasts (e.g., YEp13 and YCp50). Examples of a phage DNA include λ phages (Charon4A, Charon21A EMBL3, EMBL4, λgt10, λgt11, and λZAP). Further animal virus vectors such as retrovirus and a vaccinia virus and insect virus vectors such as a baculovirus can be also used.
[0056]To insert a promoter according to the present invention into a vector, use may be made of a method of digesting a purified DNA with appropriate restriction enzymes, inserting the obtained DNA fragment into the restriction site of a suitable vector DNA or a multi-cloning site, and ligating it to the vector.
[0057]In the present invention, to express an arbitrary gene, the arbitrary gene can be further inserted into the aforementioned expression vector. The technique inserting an arbitrary gene is the same as the method inserting a promoter into a vector. An arbitrary gene is not particularly limited. Examples of the gene include genes shown in Table 1 and known genes other than those.
[0058]In a case where a reporter gene, for example, a GUS gene, widely used in plants is linked to the 3' end of a promoter of the present invention, the strength of the promoter can be easily evaluated by checking GUS activity. As such a reporter gene other than the GUS gene, luciferase and a green fluorescent protein can be used.
[0059]As described above, various types of vectors can be used in the present invention. Further, a desired gene is ligated to the promoter of the present invention in a sense or antisense direction and then, the construction can be inserted into a vector such as pBI101 (Clonetech) called a binary vector.
3. Isolation of Transcriptional Factor
[0060]A transcriptional factor binds to a cis element which is present upstream of a gene and activates the transcription of the gene present downstream thereof. The transcriptional factors isolated in the present invention are induced by environmental stresses such as a low temperature, dehydration, and high salt concentration.
[0061]Environmental stress-responsive transcriptional factors are roughly divided into those belonging to a DREB family, ERF family, zinc finger family, WRKY family, MYB family, bHLH family, NAC family, homeo domain family and bZIP family.
[0062]In isolating a transcriptional factor, first, stress responsive genes are isolated by using a microarray. As a microarray, use may be made of about 7,000 cDNA molecules in total including genes isolated from Arabidopsis full-length cDNA libraries, responsive to dehydration (RD) genes, early responsive to dehydration (ERD) genes; PCR amplification fragments obtained from a λ control template DNA fragment (TX803, manufactured by Takara Shuzo), as an internal standard; and mouse nicotinic acetylcholine receptor epsilon subunit (nAChRE) genes and mouse glucocorticoid receptor homologous genes, as negative controls.
[0063]A plasmid DNA extracted by a plasmid preparation device (manufactured by Kurabo) is sequenced by sequence analysis using a DNA sequencer (ABI PRISM 3700, PE Applied Biosystems, CA, USA). Based on the GenBank/EMBL database, the obtained sequence is screened for homology by using the BLAST program.
[0064]After poly A selection is performed, reverse transcription is carried out to synthesize a double-stranded DNA molecule and a cDNA molecule is inserted into a vector.
[0065]The cDNA molecule inserted into a vector for constructing cDNA libraries is amplified by PCR using complementary primers to the sequences of vectors on both sides of the cDNA molecule. Examples of such vectors include λZAPII and λPS.
[0066]A microarray can be prepared according to a conventional method, which is not particularly limited. For example, using a gene tip microarray stamp machine GTMASS SYSTEM (manufactured by Nippon Laser & Electronics Lab.), the above obtained PCR product is loaded from the microtiter plate and spotted on a microslide glass at predetermined intervals. Then, to prevent a non-specific signal from being expressed, the slide is immersed into a blocking solution.
[0067]Examples of plant materials include a plant strain obtained by destroying a specific gene as well as wild type plants. A transgenic plant having a cDNA of DREB1A introduced therein may be used. Examples of plant species include Arabidopsis thaliana, tobacco and rice. Of them, Arabidopsis thaliana is preferable.
[0068]Dehydration- and cold-stress treatments can be carried out according to a known method (Yamaguchi-Shinozaki, K., and Shinozaki, K. (1994), Plant Cell 6, 251-264).
[0069]After plant bodies (wild type plants and DREB1A overexpression transformants) are exposed to stress, they are sampled and stored in cryogenic conditions with liquid nitrogen. The wild type and DREB1A overexpression transformants are used in an experiment to identify a DREB1A target gene. From plant bodies, mRNA is isolated and purified by a known method or a kit.
[0070]In the presence of Cy3 dUTP or Cy5 dUTP for labeling (Amersham Pharmacia), each of mRNA samples is subjected to reverse transcription and then used in hybridization.
[0071]After hybridization, the microarray is scanned with a scanning laser microscope or the like. As a program for analyzing data of a microarray, Imagene Ver 2.0 (BioDiscovery) and QuantArray (GSI Lumonics) etc., may be used.
[0072]After the scanning, a plasmid having a target gene is prepared. In this way, the target genes are isolated.
[0073]A transcriptional factor is determined by analyzing the nucleotide sequence of the gene isolated above and using a gene analysis program based on the genomic information of database (GenBank/EMBL, ABRC). The isolated genes can be classified into ones inducible by both drought and cold stress, ones inducible specifically by drought stress, and one inducible specifically by cold stress. According to the gene analysis program, from the genes mentioned above, genes encoding 36 types of transcriptional factors can be identified. The nucleotide sequences of the genes encoding 36 types of transcriptional factors are represented by SEQ ID NOS: 2n-1 (n is an integer of 47 to 82) and amino acid sequences of the transcriptional factors are represented by SEQ ID NOS: 2n (n is an integer of 47 to 82). Sequence ID numbers and the names of genes encoding transcriptional factors are shown in Table 1.
TABLE-US-00001 TABLE 1 Name of gene SEQ ID NO: RAFL05-11-M11 SEQ ID NO: 93 RAFL06-11-K21 SEQ ID NO: 95 RAFL05-16-H23 SEQ ID NO: 97 RAFL08-16-D06 SEQ ID NO: 99 RAFL08-16-G17 SEQ ID NO: 101 RAFL06-08-H20 SEQ ID NO: 103 RAFL07-10-G04 SEQ ID NO: 105 RAFL04-17-D16 SEQ ID NO: 107 RAFL05-19-M20 SEQ ID NO: 109 RAFL08-11-M13 SEQ ID NO: 111 RAFL04-15-K19 SEQ ID NO: 113 RAFL05-11-L01 SEQ ID NO: 115 RAFL05-14-C11 SEQ ID NO: 117 RAFL05-19-G24 SEQ ID NO: 119 RAFL05-20-N02 SEQ ID NO: 121 RAFL05-18-H12 SEQ ID NO: 123 RAFL06-10-D22 SEQ ID NO: 127 RAFL06-12-M01 SEQ ID NO: 129 RAFL05-14-D24 SEQ ID NO: 131 RAFL05-20-N17 SEQ ID NO: 133 RAFL04-17-F21 SEQ ID NO: 135 RAFL09-12-N16 SEQ ID NO: 137 RAFL05-19-I05 SEQ ID NO: 139 RAFL05-21-I22 SEQ ID NO: 141 RAFL08-11-H20 SEQ ID NO: 143 RAFL05-21-C17 SEQ ID NO: 145 RAFL05-08-D06 SEQ ID NO: 147 RAFL05-20-M16 SEQ ID NO: 149 RAFL11-01-J18 SEQ ID NO: 151 RAFL11-09-C20 SEQ ID NO: 153 RAFL05-18-N16 SEQ ID NO: 155 RAFL11-10-D10 SEQ ID NO: 157 RAFL04-17-N22 SEQ ID NO: 159 RAFL05-09-G15 SEQ ID NO: 161 RAFL05-21-L12 SEQ ID NO: 163
[0074]Note that as long as a transcriptional factor of the present invention functions as an environmental stress-responsive transcriptional factor, use may be made of any transcriptional factor having a nucleotide sequence selected from SEQ ID NOS: 2n-1 (n is an integer of 47 to 82) wherein one or more nucleotides, preferably one or several nucleotides (for example 1 to 10, preferably 1 to 5) have been deleted, substituted or added. Furthermore, DNA hybridizing with the DNA comprising any nucleotide sequence selected from SEQ. ID NOS. 2n-1 (n is an integer of 47 to 82) under stringent conditions and encoding an environmental stress-responsive transcriptional factor is also included in the transcriptional factor of the present invention. The term "stringent conditions" used herein refers to the conditions of sodium concentration of 25 to 500 mM, preferably 25 to 300 mM, and a temperature of 42 to 68° C., preferably 42 to 65° C.; more preferably, conditions of 5×SSC (83 mM NaCl, 83 mM sodium citrate) and a temperature of 42° C.
[0075]36 types of transcriptional factors isolated in the present invention may be classified as follows.
(1) DREB family: RAFL05-11-M11, RAFL06-11-K21, RAFL05-16-H23, RAFL08-16-D16;(2) ERF family: RAFL08-16-G17, RAFL06-08-H20;(3) Zinc finger family: RAFL07-10-G04, RAFL04-17-D16, RAFL05-19-M20, RAFL08-11-M13, RAFL04-15-K19, RAFL05-11-L01, RAFL05-14-C11, RAFL05-19-G24, RAFL05-20-N02;(4) WRKY family: RAFL05-18-H112, RAFL05-19-E19, RAFL06-10-D22, RAFL06-12-M01;(5) MYB family: RAFL05-14-D24, RAFL05-20-N17, RAFL04-17-F21;(6) bHLH family: RAFL09-12-N16;(7) NAC family: RAFL05-19-I05, RAFL05-21-I22, RAFL08-11-H20, RAFL05-21-C17, RAFL05-08-D06;(8) Homeo domain family: RAFL05-20-M16, RAFL11-01-J18, RAFL11-09-C20; and(9) bZIP family: RAFL05-18-N16, RAFL11-10-D10, RAFL04-17-N22, RAFL05-09-G15.
[0076]Note that RAFL05-21-L12 cannot be classified into (1) to (9).
[0077]Once the nucleotide sequence of a gene encoding a transcriptional factor according to the present invention is determined, the gene encoding a transcriptional factor according to the present invention can be obtained by chemical synthesis, PCR using a cloned probe as a template, or hybridizing a DNA fragment having the nucleotide sequence as a probe. Furthermore, a mutant of the gene encoding a transcriptional factor according to the present invention, and having the same functions as those of a non-mutated transcriptional factor, can be also synthesized by a site-specific mutagenesis or the like.
[0078]To introduce a mutation into a nucleotide sequence of a gene encoding a transcriptional factor, a known method such as the Kunkel method, Gapped duplex method, or an equivalent method may be employed. A mutation may be introduced by using a mutation-introducing kit (for example, Mutant-K manufactured by Takara and Mutant-G manufactured by Takara) which uses a site-specific mutagenesis or by using the LA PCR in vitro mutagenesis series kit (manufactured by Takara).
[0079]The term "environmental stress" used herein generally refers to an abiotic stress such as drought stress, cold stress, high salt stress, or intensive light stress. The term "drought" used herein refers to a state of water deficiency, the term "cold" used herein refers to a state where an object is exposed to a lower temperature than the optimum living temperature of each organism (e.g., in the case of Arabidopsis thaliana, e.g., in the case of Arabidopsis thaliana, it is exposed to a temperature of -20 to +21° C. continuously for one hour to several weeks). The term "high salt" used herein refers to a state where a plant is treated with NaCl of 50 mM to 600 mM in concentration continuously for 0.5 hours to several weeks. The term "intensive light stress" used herein refers to a state where too intensive light to use for photosynthesis is applied to a plant, and corresponds to a case where, for example, light of 5,000 to 10,000 Lx or more is applied. These environmental stresses may be applied singly or in combination.
4. Construction of Expression Vector
[0080]The expression vector of the present invention can be obtained by ligating (inserting) a gene encoding a transcriptional factor according to the present invention to an appropriate vector. The vector into which a gene encoding a transcriptional factor of the present invention is inserted is not particularly limited as long as it can be replicated in a host. Examples of such a vector include a plasmid, shuttle vector and helper plasmid.
[0081]Examples of such a plasmid DNA include plasmids derived from Escherichia coli (e.g., pBR322, pBR325, pUC118, pUC119, pUC118, pUC119, and pBluescript), plasmids derived from Bacillus subtilis (e.g., pUB110 and pTP5); and plasmids derived from yeasts (e.g., YEp13 and YCp50). Examples of a phage DNA include λ phages (Charon4A, Charon21A EMBL3, EMBL4, λgt10, λgt11, and λZAP). Further animal virus vectors such as retrovirus and a vaccinia virus and insect virus vectors such as a baculovirus can be also used.
[0082]To insert a transcriptional factor of the present invention into a vector, use may be made of a method of digesting a purified DNA with appropriate restriction enzymes, inserting the obtained DNA fragment into the restriction site of a suitable vector DNA or a multi-cloning site, and ligating it to the vector.
[0083]In a case where a reporter gene, for example, a GUS gene, widely used in plants is linked to the 3' end of the gene encoding a transcriptional factor of the present invention, the strength of the gene expression can be easily evaluated by checking GUS activity. As such a reporter gene other than the GUS gene, luciferase and a green fluorescent protein can be used.
5. Preparation of Transformant
[0084]A transformant of the present invention can be obtained by introducing an expression vector of the present invention into a host. The host used herein is not particularly limited as long as it can express a promoter, a gene of interest, or an environmental stress-responsive transcriptional factor. Of them, a plant is preferable. In a case of a plant host, a transformant plant (transgenic plant) can be obtained as follows.
[0085]A plant to be transformed in the present invention refers to an entire plant, a plant organ (such as leaf, petal, stem, root, or seed), a plant tissue (such as the epidermis, phloem, parenchyma, xylem, or vascular bundle), or a plant culture cell. Examples of plants used for transformation include plants belonging to the Brassicaceae, Gramineae, Solanaceae and Leguminosae (see below); however they are not limited to these plants.
[0086]Brassicaceae: Arabidopsis thaliana
[0087]Gramineae: Nicotiana tabacum
[0088]Solanaceae: Zea mays, Oryza sativa
[0089]Leguminosae: Glycine max
[0090]The aforementioned recombinant vector can be introduced into a plant by a conventional transformation method such as electroporation, Agrobacterium method, particle gun method, or PEG method.
[0091]For example, where electroporation is used, a gene is introduced into a host by treating a vector by an electroporation device equipped with a pulse controller under conditions: a voltage of 500 to 1,600 V, 25 to 1,000 μF, and 20 to 30 msec.
[0092]When a particle gun method is used, a plant body, organ and tissue may be directly used. Alternatively, they may be used after they are sectioned to pieces or after protoplasts of them are prepared. The samples thus prepared may be processed by a gene-introduction device (for example, PDS-1000/He manufactured by Bio-Rad). Processing conditions vary depending upon a plant or sample. Generally, processing is performed at a pressure of about 1,000 to 1800 psi and a distance of about 5 to 6 cm.
[0093]Furthermore, a gene of interest can be introduced into a plant by using a plant virus as a vector. Examples of available plant viruses include a cauliflower mosaic virus. More specifically, a virus genome is inserted into a vector derived from Escherichia coli to prepare a recombinant and then such a gene of interest is inserted into the virus genome. The virus genome thus modified is excised out from the recombinant with restriction enzymes and inoculated into a plant host. In this manner the gene of interest can be introduced into the plant host.
[0094]In the method using a Ti plasmid of the Agrobacterium, when bacteria belonging to the Agrobacterium are transfected to a plant, a portion of plasmid DNA of the bacteria is transferred into a plant genome. Using such a characteristic, a gene of interest is introduced into a plant host. Of bacteria belonging to the Agrobacterium, Agrobacterium tumefaciens, when it is introduced into a plant by transfection, produces a tumor called a crown gall. Also, a plant when it is transfected with Agrobacterium rhizogenes, it produces hairy roots. These phenomena are caused by transferring a region called a T-DNA region (transferred DNA region) present in a plasmid such as a Ti plasmid or Ri plasmid present in each bacterium into a plant and incorporating the region into a plant genome at a time of transfection.
[0095]By inserting desired DNA, which is to be incorporated into a plant genome, into the T-DNA region on a Ti or Ri plasmid, the desired DNA can be incorporated into a plant genome, when the host is transfected with Agrobacterium bacteria.
[0096]Tumoral tissues, shoots and hairy roots obtained as a result of transformation can be directly used in cell culture, tissue culture, or organ culture. Also, when a plant hormone such as auxin, cytokinin, gibberellin, abscisic acid, ethylene, or brassinoride, is administered to them in an appropriate concentration by using a conventional plant tissue culture method, a plant body can be regenerated from them.
[0097]A vector according to the present invention can be not only incorporated into the plant hosts mentioned above but also introduced into bacteria belonging to the Escherichia such as Escherichia coli, the Bacillus such as Bacillus subtilis and the Pseudomonas such as Pseudomonas putida; yeasts such as Saccharomyces cerevisiae and Schizosaccharomyces pombe; animal cells such as COS cells and CHO cells; and insect cells such as Sf9 cells, to obtain a transformant. Where a bacterium such as Escherichia coli or yeast is used as a host, it is preferable that a recombinant vector according to the present invention can be self-replicated in the bacterium and, at the same time, is comprised of a promoter of the present invention, a ribosome binding sequence, a gene of interest and a transcription termination sequence. Furthermore, a gene regulating the promoter may be included in the bacterium.
[0098]A method for introducing a recombinant vector into bacteria is not particularly limited as long as it is a method which can introduce DNA into bacteria. Examples of such a method include a method of using calcium ions and an electroporation method.
[0099]When a yeast is used as a host, Saccharomyces cerevisiae and Schizosaccharomyces pombe may be used. A method for introducing a recombinant vector is not particularly limited as long as it is a method for introducing DNA into a yeast. Examples of such a method include electroporation, spheroplast method, and lithium acetate method.
[0100]Where an animal cell is used as a host, a monkey COS-7 cell, Vero, Chinese hamster ovary cell (CHO cell), and mouse L cell etc. are used. Examples of methods for introducing a recombinant vector into an animal cell include electroporation, calcium phosphate method, and lipofection method.
[0101]When an insect cell is used as a host, a Sf9 cell and the like may be used. Examples of method for introducing a recombinant vector into an insect cell include a calcium phosphate method, lipofection method, and electroporation method.
[0102]Whether a gene is incorporated into a host or not is confirmed by a PCR method, Southern hybridization, Northern hybridization method or the like. For example, PCR is performed by preparing DNA from a transformant, and designing DNA specific primers. PCR is carried out under the same conditions as used for preparing the plasmid mentioned above. Thereafter, the obtained amplified product is subjected to agarose gel electrophoresis, polyacrylamide gel electrophoresis or capillary electrophoresis, and stained with ethidium bromide, or SYBR Green solution, etc. If the amplified product is found as a single band, it is confirmed that a transformant is obtained. Alternatively, the amplified product can be also detected by PCR using primers previously stained with a fluorescent dye or the like. Furthermore, there may be employed a method in which the amplified product is bound to a solid phase such as a microplate and confirmed by fluorescent or an enzymatic reaction.
4. Production of Plant
[0103]In the present invention, a transformed plant body can be regenerated from the above transformed plant cell or the like. As a regeneration method, use is made of one in which callus-form transformed cells are transferred to a medium having a different hormone in a different concentration and cultured to form an adventitious embryo, from which an entire plant body is obtained. Examples of the medium to be used herein include an LS medium and an MS medium.
[0104]The "method for producing a plant body" of the present invention comprises steps of: introducing a plant expression vector, into which the above plant promoter or a gene encoding an environmental stress-responsive transcriptional factor is inserted, into a host cell to obtain a transformed plant cell; regenerating a transformed plant body from the transformed plant cell; obtaining a plant seed from the transformed plant body; and producing a plant body from the plant seed.
[0105]To obtain plant seeds from a transformed plant body, for example, the transformed plant body is collected from a rooting medium and transferred to a pot having soil containing water placed therein. Then, the transformed plant body is grown at constant temperature to form flowers. Finally seeds are obtained. To produce a plant body from a seed, for example, when a seed formed on a transformed plant body has matured, the seed is isolated and seeded in soil containing water, followed by growing at constant temperature under illumination. The plant thus bred becomes an environmental stress-resistant plant exhibiting the stress resistance corresponding to the responsivity of a promoter introduced therein or a gene encoding the environmental stress-responsive transcriptional factor introduced therein.
BRIEF DESCRIPTION OF THE DRAWINGS
[0106]FIG. 1 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL03-07-F12;
[0107]FIG. 2 is a characteristic graph showing the relationship between cold treatment time and expression ratio regarding FL04-12-F24;
[0108]FIG. 3 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL04-14-N10;
[0109]FIG. 4 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL04-14-P24;
[0110]FIG. 5 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL04-17-I03;
[0111]FIG. 6 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL04-17-I03;
[0112]FIG. 7 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL04-17-M08;
[0113]FIG. 8 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL04-17-M22;
[0114]FIG. 9 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-05-A17;
[0115]FIG. 10 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-05-F20;
[0116]FIG. 11 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-05-G20;
[0117]FIG. 12 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-09-N09;
[0118]FIG. 13 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-10-J09;
[0119]FIG. 14 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL05-10-J09;
[0120]FIG. 15 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL05-10-M08;
[0121]FIG. 16 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL05-11-H09;
[0122]FIG. 17 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-12-H13;
[0123]FIG. 18 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL05-12-H13;
[0124]FIG. 19 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL05-13-I20;
[0125]FIG. 20 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-14-E15;
[0126]FIG. 21 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-14-E16;
[0127]FIG. 22 is a characteristic graph showing the relationship between cold treatment time and expression ratio regarding FL05-14-E16;
[0128]FIG. 23 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL05-14-E16;
[0129]FIG. 24 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-16-F03;
[0130]FIG. 25 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL05-16-F03;
[0131]FIG. 26 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-16-H23;
[0132]FIG. 27 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL05-16-H23;
[0133]FIG. 28 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-18-M07;
[0134]FIG. 29 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL05-18-M07;
[0135]FIG. 30 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL05-18-021;
[0136]FIG. 31 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-19-F21;
[0137]FIG. 32 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL05-19-F21;
[0138]FIG. 33 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-19-022;
[0139]FIG. 34 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL05-19-022;
[0140]FIG. 35 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL05-19-022;
[0141]FIG. 36 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL05-21-K17;
[0142]FIG. 37 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL06-10-F03;
[0143]FIG. 38 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL06-12-H12;
[0144]FIG. 39 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL06-12-H12;
[0145]FIG. 40 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL07-12-123;
[0146]FIG. 41 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL08-08-H23;
[0147]FIG. 42 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL08-08-O14;
[0148]FIG. 43 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL08-09-M05;
[0149]FIG. 44 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL08-10-K08;
[0150]FIG. 45 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL08-11-P07;
[0151]FIG. 46 is a characteristic graph showing the relationship between cold treatment time and expression ratio regarding FL08-11-P07;
[0152]FIG. 47 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL08-13-F10;
[0153]FIG. 48 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL08-13-F10;
[0154]FIG. 49 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL08-13-F10;
[0155]FIG. 50 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL08-19-D04;
[0156]FIG. 51 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL08-19-G15;
[0157]FIG. 52 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL09-06-B11;
[0158]FIG. 53 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL09-07-G17;
[0159]FIG. 54 is a characteristic graph showing the relationship between ABA treatment time and expression ratio regarding FL09-10-A12;
[0160]FIG. 55 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL09-13-P15;
[0161]FIG. 56 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL02-05-105;
[0162]FIG. 57 is a characteristic graph showing the relationship between cold treatment time and expression ratio regarding FL04-12-N15;
[0163]FIG. 58 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL04-16-P21;
[0164]FIG. 59 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL04-17-N22;
[0165]FIG. 60 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL04-20-P19;
[0166]FIG. 61 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL02-09-H01;
[0167]FIG. 62 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-01-D08;
[0168]FIG. 63 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL05-02-G08;
[0169]FIG. 64 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-02-O17;
[0170]FIG. 65 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL05-07-L13;
[0171]FIG. 66 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-08-B14;
[0172]FIG. 67 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-09-N10;
[0173]FIG. 68 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-11-L01;
[0174]FIG. 69 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-12-J09;
[0175]FIG. 70 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-14-D24;
[0176]FIG. 71 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-14-F20;
[0177]FIG. 72 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-14-I08;
[0178]FIG. 73 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-15-C04;
[0179]FIG. 74 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-15-E19;
[0180]FIG. 75 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-18-A06;
[0181]FIG. 76 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL05-18-H15;
[0182]FIG. 77 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-19-C02;
[0183]FIG. 78 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-20-M16;
[0184]FIG. 79 is a characteristic graph showing the relationship between cold treatment time and expression ratio regarding FL05-20-N18;
[0185]FIG. 80 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-21-E06;
[0186]FIG. 81 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL05-21-L12;
[0187]FIG. 82 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL06-07-B08;
[0188]FIG. 83 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL06-08-H20;
[0189]FIG. 84 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL06-09-N04;
[0190]FIG. 85 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL06-11-K21;
[0191]FIG. 86 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL07-07-G15;
[0192]FIG. 87 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL07-12-D17;
[0193]FIG. 88 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL08-11-C23;
[0194]FIG. 89 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL08-13-G20;
[0195]FIG. 90 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL08-15-M21;
[0196]FIG. 91 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL08-18-N19;
[0197]FIG. 92 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL08-19-C07;
[0198]FIG. 93 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL08-19-P05;
[0199]FIG. 94 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL09-07-G09;
[0200]FIG. 95 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL09-07-G15;
[0201]FIG. 96 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL09-10-J18;
[0202]FIG. 97 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL09-11-I12;
[0203]FIG. 98 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL09-12-B03;
[0204]FIG. 99 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL09-16-I11;
[0205]FIG. 100 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL09-16-M04;
[0206]FIG. 101 is a characteristic graph showing the relationship between dehydration treatment time and expression ratio regarding FL11-01-J18;
[0207]FIG. 102 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL11-07-D13;
[0208]FIG. 103 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL11-07-F02;
[0209]FIG. 104 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL11-07-N15;
[0210]FIG. 105 is a characteristic graph showing the relationship between high salt treatment time and expression ratio regarding FL11-10-D10;
[0211]FIG. 106 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL08-16-G17;
[0212]FIG. 107 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-11-M11;
[0213]FIG. 108 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-11-M11;
[0214]FIG. 109 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL06-11-K21;
[0215]FIG. 110 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL06-11-K21;
[0216]FIG. 111 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL06-08-H20;
[0217]FIG. 112 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL06-08-H20;
[0218]FIG. 113 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-16-H23;
[0219]FIG. 114 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-16-H23;
[0220]FIG. 115 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL08-16-D06;
[0221]FIG. 116 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL07-10-G04;
[0222]FIG. 117 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL04-17-D16;
[0223]FIG. 118 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-19-M20;
[0224]FIG. 119 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL08-1-M13;
[0225]FIG. 120 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL04-15-K19;
[0226]FIG. 121 is a characteristic graph showing the relationship between cold stress and expression ratio regarding RAFL04-15-K19;
[0227]FIG. 122 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-11-L01;
[0228]FIG. 123 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-11-L01;
[0229]FIG. 124 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-14-C11;
[0230]FIG. 125 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-19-G24;
[0231]FIG. 126 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-19-G24;
[0232]FIG. 127 is a characteristic graph showing the relationship between cold stress and expression ratio regarding RAFL05-19-G24;
[0233]FIG. 128 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-20-N02;
[0234]FIG. 129 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-18-H12;
[0235]FIG. 130 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-18-H12;
[0236]FIG. 131 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-19-E19;
[0237]FIG. 132 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL06-10-D22;
[0238]FIG. 133 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL06-12-M01;
[0239]FIG. 134 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL06-12-M01;
[0240]FIG. 135 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-14-D24;
[0241]FIG. 136 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-14-D24;
[0242]FIG. 137 is a characteristic graph showing the relationship between cold stress and expression ratio regarding RAFL05-20-N17;
[0243]FIG. 138 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-20-N17;
[0244]FIG. 139 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL04-17-F21;
[0245]FIG. 140 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL09-12-N16;
[0246]FIG. 141 is a characteristic graph showing the relationship between drought stress and expression ratio regarding AFL05-19-105;
[0247]FIG. 142 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-19-I05;
[0248]FIG. 143 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-21-I22;
[0249]FIG. 144 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL08-11-H20;
[0250]FIG. 145 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL08-11-H20;
[0251]FIG. 146 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-21-C17;
[0252]FIG. 147 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-21-C17;
[0253]FIG. 148 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-08-D06;
[0254]FIG. 149 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-20-M16;
[0255]FIG. 150 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-20-M16;
[0256]FIG. 151 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL11-01-J18;
[0257]FIG. 152 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL11-01-J18;
[0258]FIG. 153 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL11-09-C20;
[0259]FIG. 154 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-18-N16;
[0260]FIG. 155 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL11-10-D10;
[0261]FIG. 156 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL11-10-D10;
[0262]FIG. 157 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL04-17-N22;
[0263]FIG. 158 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL04-17-N22;
[0264]FIG. 159 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-09-G15;
[0265]FIG. 160 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-09-G15;
[0266]FIG. 161 is a characteristic graph showing the relationship between drought stress and expression ratio regarding RAFL05-21-L12; and
[0267]FIG. 162 is a characteristic graph showing the relationship between high salt stress and expression ratio regarding RAFL05-21-L12.
BEST MODE FOR CARRYING OUT THE INVENTION
[0268]Hereinafter, the present invention will be further explained in detail by way of examples, which should not be construed as limiting the scope of the present invention.
Example 1
Isolation of Promoter
1. Materials and Methods
[0269](1) Arabidopsis cDNA Clone
[0270]A microarray was constructed by using about 7,000 cDNA molecules in total including genes isolated from an Arabidopsis full-length cDNA libraries, responsive-to-dehydration (RD) genes, early responsive-to-dehydration (ERD) genes, kin 1 genes, kin2 genes, and cor15a genes; α-tubulin genes as an internal standard; and mouse nicotinic acetylcholine receptor epsilon subunit (nAChRE) genes and mouse glucocorticoid receptor homologous genes, as negative controls.
[0271]Positive control: dehydration-inducible genes (responsive-to-dehydration genes: rd, and early responsive-to-dehydration genes: erd)
[0272]Internal standard: α-tubulin gene
[0273]Negative control: mouse nicotinic acetylcholine receptor epsilon subunit (nAChRE) genes and mouse glucocorticoid receptor homologous genes, which do not substantially have homology with any given sequence in an Arabidopsis database for analyzing non-specific hybridization.
(2) Arabidopsis Full-Length cDNA Microarray
[0274]The present inventors have constructed full-length cDNA libraries from an Arabidopsis plant body under different conditions (e.g., dehydration treatment, cold treatment and non-treatment in different growth stages from budding to maturation of seeds) by the biotinylated CAP trapper method. From the full-length cDNA libraries, the present inventors isolated individually about 7,000 independent Arabidopsis full-length cDNA molecules. The cDNA fragments, which were amplified by PCR, were arranged on a slide glass in accordance with a known method (Eisen and Brown, 1999). The present inventors prepared a full-length cDNA microarray containing about 7,000 Arabidopsis full-length cDNA molecules, which contain the genes below.
(3) Isolation of Dehydration-, Cold-, High Salt-, and ABA-Inducible Genes Using cDNA Microarray
[0275]In this example, dehydration-, cold-, high salt-, and ABA-inducible genes were isolated by using a full-length cDNA microarray containing about 7,000 Arabidopsis full-length cDNA molecules.
[0276]Probes of a plant treated with different stresses and an untreated plant with stress and labeled with Cy3 and Cy5 fluorescent dyes were mixed. The probes were hybridized with the full-length cDNA microarray containing about 7,000 Arabidopsis full-length cDNA molecules. By such a double labeling of a pair of cDNA probes wherein one of the mRNA samples was labeled with Cy3-dUTP and the other was labeled with Cy5-dUTP, hybridization with DNA elements on a microarray can be performed simultaneously, with the result that quantitative determination of gene expression under two different conditions (that is, stressed and unstressed conditions) can be directly and easily performed. The hybridized microarray was scanned by two discrete laser channels for Cy3 and Cy5 emission from each of DNA elements. Subsequently, the intensity ratio between two fluorescent signals from each DNA element was determined. Based on the relative value of the intensity ratio, a change of differential expression of genes represented as a cDNA spot on the microarray was determined. In this example, an α-tubulin gene, whose expression level was almost equivalent under two different experimental conditions was used, as an internal control gene.
[0277]In the full-length cDNA microarray containing about 7,000 Arabidopsis full-length cDNA molecules, a procedure for identifying dehydration-, cold-, high salt-, and ABA-inducible genes will be explained.
[0278]1) Both mRNA molecules derived from a plant treated with one of the stresses mentioned above and mRNA molecules derived from a wild-type plant unstressed were used to prepare Cy3-labeled cDNA and Cy5-labeled cDNA probes, respectively. These cDNA probes were mixed and hybridized with the cDNA microarray. In this example, an α-tubulin gene, which exhibits almost the same expression level under two type conditions, was as used as an internal control gene. A gene that exhibits the expression ratio of dehydration:unstressed, cold:unstressed, or high salt:unstressed more than double of that of the α-tubulin gene was defined as an inducible gene by a stress given to the gene.
[0279]2) Both mRNA molecules derived from a 35S:DREB1A transgenic plant and mRNA molecules derived from a wild-type plant unstressed were used to prepare Cy3-labeled cDNA and Cy5-labeled cDNA probes, respectively. These cDNA probes were mixed and hybridized with a cDNA microarray. In this example, an α-tubulin gene exhibiting almost the same expression level under two type conditions was used as an internal control gene. A gene of 35S:DREB1A transgenic plant exhibiting an expression ratio more than double of that of a gene of the wild type plant unstressed was defined as a DREB1A target gene.
[0280]Both mRNA molecules derived from a plant treated with a stress and mRNA molecules derived from a wild-type plant unstressed were used to prepare Cy3-labeled cDNA and Cy5-labeled cDNA probes, respectively. These cDNA probes were mixed and hybridized with a cDNA microarray. The same experiment was repeated three times to evaluate the reproducibility of microarray analysis. When the same mRNA sample was hybridized with various microarrays, a good correlation was observed. A gene that exhibits an expression ratio (dehydration/unstressed, cold/unstressed) more than double of that of the α-tubulin gene was defined as an inducible gene by a stress given to the gene.
(4) Analysis of Sequence
[0281]Plasmid DNA extracted by a plasmid preparation device (NA 100) manufactured by Kurabo was sequenced to find homology of gene sequences. The DNA sequence was determined by a dye terminator cycle sequencing method using a DNA sequencer (ABI PRISM 3700. PE Applied Biosystems, CA, USA). Based on the GenBank/EMBL database, homology of sequences was found by using the BLAST program.
(5) Amplification of cDNA
[0282]ZAPII (Carninci et al., 1996) was used as a vector for constructing a cDNA library. The cDNA inserted in a vector for the library was amplified by PCR using complementary primers to the sequences of both sides of the cDNA.
[0283]The sequences of primers are as follows:
TABLE-US-00002 (SEQ ID NO: 91) FL forward 1224: 5'-CGCCAGGGTTTTCCCAGTCACGA (SEQ ID NO: 92) FL reverse 1233: 5'-AGCGGATAACAATTTCACACAGGA
[0284]To 100 μl of a PCR solution mixture (0.25 mM dNTP, 0.2 μM PCR primer, 1×Ex Taq Buffer, and 1.25 U Ex Taq polymerase (manufactured by Takara Shuzo)), a plasmid (1 to 2 ng) was added as a template. PCR was performed under the following conditions: an initial reaction at 94° C. for 3 minutes, 35 cycles each consisting of 95° C. for one minute, 60° C. for 30 seconds and 72° C. for 3 minutes, and a final reaction at 72° C. for 3 minutes. After a PCR product was precipitated with ethanol, the precipitate was dissolved in 25 μl of 3×SSC and then subjected to electrophoresis using 0.7% agarose gel. The quality of the DNA obtained and amplification efficiency of PCR were confirmed.
(6) Construction of cDNA Microarray
[0285]Using a gene tip microarray stamp machine GTMASS SYSTEM (manufactured by Nippon Laser & Electronics Lab.), 0.5 μl of a PCR product (100 to 500 ng/ml) was loaded from a 384-well microtiter plate to form spots of the PCR product (5 nl for each) at intervals of 280 μm on 6 micro slide glasses (S7444, manufactured Matsunami) coated with poly-L lysine. To spot DNA in an equivalent amount, the slide after printing was placed in a beaker containing heated distilled water to moisten it and placed at 100° C. for 3 seconds to dry it. After the slide was placed on a slide rack, the rack was transferred into a glass chamber. To the glass chamber, a blocking solution (15 ml of 1M sodium borate salt (pH 8.0), 5.5 g succinic anhydrous compound (Wako), and 335 ml of 1-methyl-2-pyrrolidon (Wako)) was poured. After the glass chamber housing the slide rack was shaken up and down 5 times and gently shaking for 15 minutes, the slide rack was transferred to a glass chamber containing boiling water, shaken 5 times, and allowed to stand alone for 2 minutes. Thereafter, the slide rack was transferred to a glass chamber containing 95% ethanol, shaken 5 times, and centrifuged at 800 rpm for 30 minutes.
(7) Plant Material and Isolation of RNA
[0286]As a plant material, use was made of a wild type Arabidopsis thaliana plant body which was seeded on an agar medium and grown for 3 weeks (Yamaguchi-Shinozaki and Shinozaki, 1994) and an Arabidopsis thaliana (Colombian species) plant body into which DREB1A cDNA (Kasuga et al., 1999) connected to a 35S promoter of a cauliflower mosaic virus was introduced. Dehydration- and cold-stress treatments were performed in accordance with the method of Yamaguchi-Shinozaki and Shinozaki (1994). More specifically, dehydration treatment was performed by pulling a plant body out of the agar medium, placing it on a filter, and dried at a temperature of 22° C. and a relative humidity of 60%. The cold treatment was performed by transferring a plant body grown at 22° C. to 4° C. High salt stress treatment was performed by growing a plant body at an aqueous solution containing 250 mM NaCl.
[0287]After wild type plant bodies were exposed to stress-treatment for 2 or 10 hours, a sample was taken from each of plant bodies and stored in cryogenic conditions with liquid nitrogen. Furthermore, wild type and DREB1A overexpression-type transformants cultured in an agar medium without kanamycin were subjected to an experiment for identifying a DREB1A target gene. The DREB1A overexpression-type transformant was not treated with stresses. The total RNA was isolated from a plant body by using ISOGEN (Nippon gene, Tokyo, Japan) and mRNA was isolated and purified by Oligotex-dT30 mRNA purification kit (Takara, Tokyo, Japan).
(8) Fluorescent Labeling of Probe
[0288]Each of the mRNA samples was subjected to a reverse transcription reaction in the presence of Cy3 dUTP or Cy5 dUTP (Amersham Pharmacia). The composition of the buffer (30 μl) used in the reverse transcription reaction is shown in Table 2.
TABLE-US-00003 TABLE 2 poly(A)+ RNA with 6 μg oligo(dT) 18-mer 1 μg 10 mM DTT 500 μM dATP, dCTP and dGTP 200 μM dTTP 100 μM Cy3 dUTP or Cy5 dUTP 400 units of SuperScript II Reverse Transcriptase (Life technologies) 1× Superscript First Strand Synthesis Buffer (Life technologies) Total 30 μL
[0289]After reaction was performed at 42° C. for one hour, two samples (labeled with Cy3 and Cy5) were mixed to obtain a reaction mixture. To this reaction mixture, 15 μl of 0.1 M NaOH and 1.5 μl of 20 mM EDTA were added and treated at 70° C. for 10 minutes. Further, 15 μl of 0.1 M HCl was added to the reaction mixture, a sample was taken and transferred to a Micro con 30 micro concentrator (Amicon). 400 μl of TE buffer was added to the sample and centrifuged until the volume of the buffer reached 10 to 20 μl. The effluent was discarded. 400 μl of TE buffer and 20 μl of 1 mg/ml human Cot-1 DNA (Gibco BRL) were added to the resultant mixture and the mixture was again centrifuged. The labeled samples were centrifugally collected and several μl of distilled water was added thereto. The obtained probes, 2 μl of 10 μg/μl yeast tRNA, 2 μl of 1 μg/μl pd(A)12-18 (Amersham Pharmacia), 3.4 ml of 20×SSC, and 0.6 μl of 10% SDS were added. Further, the samples were denatured at 100° C. for 1 minute and placed at room temperature for 30 minutes and thereafter used in hybridization.
(9) Microarray Hybridization and Scanning
[0290]A probe was subjected to high-speed centrifugation for one minute by a benchtop micro centrifuge. To avoid generation of bubbles, the probe was placed at the center of an array and a cover slip was placed thereon. Four drops of 5 μl of 3×SSC were dropped on a slide glass and a chamber was kept at a suitable humidity to prevent the probe from being dried during hybridization. After the slide glass was placed in a cassette for hybridization (THC-1, BM machine) and the cassette was sealed, hybridization treatment was performed at 65° C. for 12 to 16 hours. The slide glass was taken out from the cassette and placed on the slide rack. After the cover slip was carefully removed in solution 1 (2×SSC, 0.1% SDS), the rack was washed while shaking and transferred into solution 2 (1×SSC) to wash for 2 minutes. The rack was further transferred to solution 3 (0.2×SSC), allowed to stand for 2 minutes, and centrifuged at 800 rpm for 1 min to dry.
[0291]The microarray was scanned at a resolution of 10 μm per pixel by use of a scanning laser microscope (ScanArray 4000; GSI Lumonics, Watertown, Mass.). As a program for analyzing microarray data, Imagene Ver 2.0 (BioDiscovery) and QuantArray (GSI Lumonics) were used.
(10) Northern Analysis
[0292]Northern analysis was performed using total RNA, (Yamaguchi-Shinozaki and Shinozaki, 1994). DNA fragments were isolated from the Arabidopsis thaliana full-length cDNA library by a PCR method and used as probes for Northern hybridization.
(11) Determination of Promoter Region
[0293]Based on the genomic information of Arabidopsis thaliana in a data base (GenBank/EMBL, ABRC), a promoter region was analyzed by using the BLAST program for gene analysis.
2. Results
(1) Stress-Inducible Gene
[0294]Fluorescent-labeled cDNA was prepared by subjecting mRNA isolated from an unstressed Arabidopsis thaliana plant to a reverse transcription reaction in the presence of Cy5-dUTP. A second probe labeled with Cy3-dUTP was prepared from a plant treated at low temperature for 2 hours. Both probes were simultaneously hybridized with a cDNA microarray comprising about 7,000 Arabidopsis thaliana cDNA clones and then a pseudo color image was created.
[0295]Genes induced and suppressed by a stress are represented by a red signal and green signal, respectively. Genes expressed at virtually the same level in both treatments are represented by a yellow signal. The intensity of each spot corresponds to the absolute value of the expression level of each gene. It is shown that a cold-inducible gene (rd29A) is represented by a red signal whereas an α-tubulin gene (an internal control) is represented by a yellow signal.
(2) Identification of Promoter Region
[0296]As a result of identifying a promoter region, the promoter gene regions of 90 types of genes were obtained in a full-length cDNA microarray containing about 7,000 of Arabidopsis full-length cDNA molecules. The name of these 90 types of genes and their promoter sequences are summarized in Table 3
TABLE-US-00004 TABLE 3 Name of gene SEQ ID NO: FL03-07-F12 SEQ ID NO: 1 FL04-12-F24 SEQ ID NO: 2 FL04-14-N10 SEQ ID NO: 3 FL04-14-P24 SEQ ID NO: 4 FL04-17-I03 SEQ ID NO: 5 FL04-17-M08 SEQ ID NO: 6 FL04-17-M22 SEQ ID NO: 7 FL05-05-A17 SEQ ID NO: 8 FL05-05-F20 SEQ ID NO: 9 FL05-05-G20 SEQ ID NO: 10 FL05-09-N09 SEQ ID NO: 11 FL05-10-J09 SEQ ID NO: 12 FL05-10-M08 SEQ ID NO: 13 FL05-11-H09 SEQ ID NO: 14 FL05-12-H13 SEQ ID NO: 15 FL05-13-I20 SEQ ID NO: 16 FL05-14-E15 SEQ ID NO: 17 FL05-14-E16 SEQ ID NO: 18 FL05-16-F03 SEQ ID NO: 19 FL05-16-H23 SEQ ID NO: 20 FL05-18-M07 SEQ ID NO: 21 FL05-18-O21 SEQ ID NO: 22 FL05-19-F21 SEQ ID NO: 23 FL05-19-O22 SEQ ID NO: 24 FL05-21-K17 SEQ ID NO: 25 FL06-10-F03 SEQ ID NO: 26 FL06-12-H12 SEQ ID NO: 27 FL07-12-I23 SEQ ID NO: 28 FL08-08-H23 SEQ ID NO: 29 FL08-08-O14 SEQ ID NO: 30 FL08-09-M05 SEQ ID NO: 31 FL08-10-K08 SEQ ID NO: 32 FL08-11-P07 SEQ ID NO: 33 FL08-13-F10 SEQ ID NO: 34 FL08-19-D04 SEQ ID NO: 35 FL08-19-G15 SEQ ID NO: 36 FL09-06-B11 SEQ ID NO: 37 FL09-07-G17 SEQ ID NO: 38 FL09-10-A12 SEQ ID NO: 39 FL09-13-P15 SEQ ID NO: 40 FL02-05-I05 SEQ ID NO: 41 FL04-12-N15 SEQ ID NO: 42 FL04-16-P21 SEQ ID NO: 43 FL04-17-N22 SEQ ID NO: 44 FL04-20-P19 SEQ ID NO: 45 FL02-09-H01 SEQ ID NO: 46 FL05-01-D08 SEQ ID NO: 47 FL05-02-G08 SEQ ID NO: 48 FL05-02-O17 SEQ ID NO: 49 FL05-07-L13 SEQ ID NO: 50 FL05-08-B14 SEQ ID NO: 51 FL05-09-N10 SEQ ID NO: 52 FL05-11-L01 SEQ ID NO: 53 FL05-12-J09 SEQ ID NO: 54 FL05-14-D24 SEQ ID NO: 55 FL05-14-F20 SEQ ID NO: 56 FL05-14-I08 SEQ ID NO: 57 FL05-15-C04 SEQ ID NO: 58 FL05-15-E19 SEQ ID NO: 59 FL05-18-A06 SEQ ID NO: 60 FL05-18-H15 SEQ ID NO: 61 FL05-19-C02 SEQ ID NO: 62 FL05-20-M16 SEQ ID NO: 63 FL05-20-N18 SEQ ID NO: 64 FL05-21-E06 SEQ ID NO: 65 FL05-21-L12 SEQ ID NO: 66 FL06-07-B08 SEQ ID NO: 67 FL06-08-H20 SEQ ID NO: 68 FL06-09-N04 SEQ ID NO: 69 FL06-11-K21 SEQ ID NO: 70 FL07-07-G15 SEQ ID NO: 71 FL07-12-D17 SEQ ID NO: 72 FL08-11-C23 SEQ ID NO: 73 FL08-13-G20 SEQ ID NO: 74 FL08-15-M21 SEQ ID NO: 75 FL08-18-N19 SEQ ID NO: 76 FL08-19-C07 SEQ ID NO: 77 FL08-19-P05 SEQ ID NO: 78 FL09-07-G09 SEQ ID NO: 79 FL09-07-G15 SEQ ID NO: 80 FL09-10-J18 SEQ ID NO: 81 FL09-11-I12 SEQ ID NO: 82 FL09-12-B03 SEQ ID NO: 83 FL09-16-I11 SEQ ID NO: 84 FL09-16-M04 SEQ ID NO: 85 FL11-01-J18 SEQ ID NO: 86 FL11-07-D13 SEQ ID NO: 87 FL11-07-F02 SEQ ID NO: 88 FL11-07-N15 SEQ ID NO: 89 FL11-10-D10 SEQ ID NO: 90
(3) The Relationship Between Stress Treatment Time and Expression Ratio
[0297]The 90 types of stress inducible genes isolated above were analyzed for the relationship between stress treatment time and expression ratio. The results are shown in FIGS. 1 to 105. The relationship between 90 types of genes and stress treatment are shown in Table 4.
TABLE-US-00005 TABLE 4 Name of gene Type of applied stress Drawing FL03-07-F12 Dehydration FIG. 1 FL04-12-F24 Exposure to cold FIG. 2 FL04-14-N10 Dehydration FIG. 3 FL04-14-P24 Dehydration FIG. 4 FL04-17-I03 Dehydration, Exposure to a high level salt solution FIGS. 5, 6 FL04-17-M08 Exposure to a high level salt solution FIG. 7 FL04-17-M22 Dehydration FIG. 8 FL05-05-A17 Dehydration FIG. 9 FL05-05-F20 Dehydration FIG. 10 FL05-05-G20 Dehydration FIG. 11 FL05-09-N09 Dehydration FIG. 12 FL05-10-J09 Dehydration, Exposure to a high level salt solution FIGS. 13, 14 FL05-10-M08 Exposure to a high level salt solution FIG. 15 FL05-11-H09 Exposure to a high level salt solution FIG. 16 FL05-12-H13 Dehydration, Exposure to a high level salt solution FIGS. 17, 18 FL05-13-I20 ABA treatment FIG. 19 FL05-14-E15 Dehydration FIG. 20 FL05-14-E16 Dehydration, Exposure to cold, ABA treatment FIGS. 21-23 FL05-16-F03 Dehydration, ABA treatment FIGS. 24, 25 FL05-16-H23 Dehydration, Exposure to a high level salt solution FIGS. 26, 27 FL05-18-M07 Dehydration, ABA treatment FIGS. 28, 29 FL05-18-O21 ABA treatment FIG. 30 FL05-19-F21 Dehydration, ABA treatment FIGS. 31, 32 FL05-19-O22 Dehydration, Exposure to a high level salt solution, ABA FIGS. 33-35 treatment FL05-21-K17 Exposure to a high level salt solution FIG. 36 FL06-10-F03 ABA treatment FIG. 37 FL06-12-H12 Dehydration, Exposure to a high level salt solution FIGS. 38, 39 FL07-12-I23 Exposure to a high level salt solution FIG. 40 FL08-08-H23 Exposure to a high level salt solution FIG. 41 FL08-08-O14 Dehydration FIG. 42 FL08-09-M05 Dehydration FIG. 43 FL08-10-K08 Exposure to a high level salt solution FIG. 44 FL08-11-P07 Dehydration, Exposure to cold FIGS. 45, 46 FL08-13-F10 Dehydration, Exposure to a high level salt solution, ABA FIGS. 47-49 treatment FL08-19-D04 Dehydration FIG. 50 FL08-19-G15 Exposure to a high level salt solution FIG. 51 FL09-06-B11 ABA treatment FIG. 52 FL09-07-G17 ABA treatment FIG. 53 FL09-10-A12 ABA treatment FIG. 54 FL09-13-P15 Dehydration FIG. 55 FL02-05-I05 Exposure to a high level salt solution FIG. 56 FL04-12-N15 Exposure to cold FIG. 57 FL04-16-P21 Dehydration FIG. 58 FL04-17-N22 Exposure to a high level salt solution FIG. 59 FL04-20-P19 Dehydration FIG. 60 FL02-09-H01 Dehydration FIG. 61 FL05-01-D08 Dehydration FIG. 62 FL05-02-G08 Exposure to a high level salt solution FIG. 63 FL05-02-O17 Dehydration FIG. 64 FL05-07-L13 Exposure to a high level salt solution FIG. 65 FL05-08-B14 Dehydration FIG. 66 FL05-09-N10 Dehydration FIG. 67 FL05-11-L01 Dehydration FIG. 68 FL05-12-J09 Dehydration FIG. 69 FL05-14-D24 Dehydration FIG. 70 FL05-14-F20 Dehydration FIG. 71 FL05-14-I08 Dehydration FIG. 72 FL05-15-C04 Dehydration FIG. 73 FL05-15-E19 Dehydration FIG. 74 FL05-18-A06 Dehydration FIG. 75 FL05-18-H15 Exposure to a high level salt solution FIG. 76 FL05-19-C02 Dehydration FIG. 77 FL05-20-M16 Dehydration FIG. 78 FL05-20-N18 Exposure to cold FIG. 79 FL05-21-E06 Dehydration FIG. 80 FL05-21-L12 Dehydration FIG. 81 FL06-07-B08 Dehydration FIG. 82 FL06-08-H20 Dehydration FIG. 83 FL06-09-N04 Dehydration FIG. 84 FL06-11-K21 Dehydration FIG. 85 FL07-07-G15 Exposure to a high level salt solution FIG. 86 FL07-12-D17 Exposure to a high level salt solution FIG. 87 FL08-11-C23 Dehydration FIG. 88 FL08-13-G20 Dehydration FIG. 89 FL08-15-M21 Dehydration FIG. 90 FL08-18-N19 Dehydration FIG. 91 FL08-19-C07 Dehydration FIG. 92 FL08-19-P05 Exposure to a high level salt solution FIG. 93 FL09-07-G09 Exposure to a high level salt solution FIG. 94 FL09-07-G15 Dehydration FIG. 95 FL09-10-J18 Exposure to a high level salt solution FIG. 96 FL09-11-I12 Dehydration FIG. 97 FL09-12-B03 Dehydration FIG. 98 FL09-16-I11 Exposure to a high level salt solution FIG. 99 FL09-16-M04 Exposure to a high level salt solution FIG. 100 FL11-01-J18 Dehydration FIG. 101 FL11-07-D13 Exposure to a high level salt solution FIG. 102 FL11-07-F02 Exposure to a high level salt solution FIG. 103 FL11-07-N15 Exposure to a high level salt solution FIG. 104 FL11-10-D10 Exposure to a high level salt solution FIG. 105
[0298]In FIGS. 1 to 105, the vertical axis shows the expression ratio of a gene, which is calculated as follows:
Expression ratio=[(FI of a cDNA molecule under stress)/(FI of a cDNA molecule under no stress)]/[(FI of α-tubulin under stress)/(FI of α-tubulin under no stress)]
where FI is the intensity of fluorescence.
[0299]As shown in FIGS. 1 to 105, the stress inducible genes isolated by a method according to the present invention exhibit different profiles; however, it is found that expression is induced by adding each stress. From this, it is demonstrated that the nucleotide sequences positioned upstream of these 90 types of genes and represented by SEQ ID NO: 1 to 90 serve as stress responsive promoters.
Example 2
Isolation of Gene Encoding Environmental Stress Responsive Transcriptional Factor
1. Materials and Methods
[0300](1) Arabidopsis cDNA Clone
[0301]A microarray was constructed by using about 7,000 cDNA molecules in total including genes isolated from Arabidopsis full-length cDNA libraries, responsive to dehydration (RD) genes, early responsive to dehydration (ERD) genes, kin 1 genes, kin2 genes, and cor15a genes; fragments amplified from λ control template DNA by PCR as an internal standard; and mouse nicotinic acetylcholine receptor epsilon subunit (nAChRE) genes and mouse glucocorticoid receptor homologous genes, as negative controls.
[0302]Positive control: dehydration-inducible genes (responsive to dehydration genes: rd, and early responsive to dehydration genes: erd);
[0303]Internal standard: fragments amplified from λ control template DNA by PCR (TX803, manufactured by Takara Shuzo, hereinafter referred to as a "control fragment");
[0304]Negative control: mouse nicotinic acetylcholine receptor epsilon subunit (nAChRE) genes and mouse glucocorticoid receptor homologous genes, as negative controls, which do not substantially have homology with any given sequence in an Arabidopsis database for analyzing non-specific hybridization.
(2) Arabidopsis Full-Length cDNA Microarray
[0305]The present inventors have constructed full-length cDNA libraries from an Arabidopsis plant body under different conditions (e.g., dehydration treatment, cold treatment and non-treatment in different growth stages from budding to maturation of seeds) by the biotinylated CAP trapper method. From the full-length cDNA libraries, the present inventors isolated individually about 7,000 independent Arabidopsis full-length cDNA molecules. The cDNA fragments, which were amplified by PCR, were arranged on a slide glass in accordance with a known method (Eisen and Brown, 1999). The present inventors prepared a full-length cDNA microarray containing about 7,000 Arabidopsis full-length cDNA molecules, which contain the genes below.
(3) Dehydration-, Cold-, and High Salt-Inducible Genes Using cDNA Microarray
[0306]In this example, dehydration-, cold- and high salt-inducible genes were isolated by using a full length cDNA microarray containing about 7,000 Arabidopsis full-length cDNA molecules.
[0307]Probes of plants treated with different stresses and an untreated plant with stress and labeled with Cy3 and Cy5 fluorescent dyes were mixed. The probes were hybridized with the full-length cDNA microarray containing about 7,000 Arabidopsis full-length cDNA molecules. By such a double labeling of a pair of cDNA probes wherein one of the mRNA samples was labeled with Cy3-dUTP and the other was labeled with Cy5-dUTP, hybridization with DNA elements on a microarray can be performed simultaneously, with the result that quantitative determination of gene expression under two different conditions (that is, stressed and unstressed conditions) can be directly and easily performed. The hybridized microarray was scanned by two discrete laser channels for Cy3 and Cy5 emission from each of DNA elements. Subsequently, the intensity ratio between two fluorescent signals from each DNA element was determined. Based on the relative value of the intensity ratio, a change of differential expression of genes represented as a cDNA spot on the microarray was determined. In this example, an α-tubulin gene, whose expression level was almost equivalent under two different experimental conditions, was used as an internal control gene.
[0308]In the full-length cDNA microarray containing about 7,000 Arabidopsis full-length cDNA molecules, a procedure for identifying dehydration-, cold-, and high salt-inducible genes will be explained.
[0309]Both mRNA molecules derived from a plant treated with one of the stresses mentioned above and mRNA molecules derived from a wild-type plant unstressed were used to prepare Cy3-labeled cDNA and Cy5-labeled cDNA probes, respectively. These cDNA probes were mixed and hybridized with a cDNA microarray. In this example, a control fragment, which exhibits almost the same expression level under two type conditions, was used as an internal control gene. A gene that exhibits the expression ratio (dehydration/unstressed, cold/unstressed or high salt/unstressed) more than 5 times of that of the control fragment was defined as an inducible gene by a stress given to the gene.
(4) Analysis of Sequence
[0310]Plasmid DNA extracted by a DNA extraction device (model Biomek, manufactured by Beckman Coulter) and purified by use of a multiscreen 96-hole filter plate (manufactured by Millipore) was sequenced to find homology of gene sequences. A DNA sequence was determined by a dye terminator cycle sequencing method using a DNA sequencer (ABI PRISM 3700. PE Applied Biosystems, CA, USA). Based on the GenBank/EMBL database, the homology of sequences was found by using the BLAST program.
(5) Amplification of cDNA
[0311]ZAP and λFLC-1 were used as a vector for constructing a cDNA library. The cDNA inserted in a vector for the library was amplified by PCR using complementary primers to the sequences of both sides of the cDNA.
[0312]The sequences of primers are as follows:
TABLE-US-00006 (SEQ ID NO: 165) FL forward 1224: 5'-CGCCAGGGTTTTCCCAGTCACGA (SEQ ID NO: 166) FL reverse 1233: 5'-AGCGGATAACAATTTCACACAGGA
[0313]To 100 μl of a PCR solution mixture (0.25 mM dNTP, 0.2 μM PCR primer, 1×Ex Taq Buffer, and 1.25 U of Ex Taq polymerase (manufactured by Takara Shuzo)), a plasmid (1 to 2 ng) was added as a template. PCR was performed under the following conditions: initial reaction at 94° C. for 3 minutes, 35 cycles each consisting of 95° C. for one minute, 60° C. for 30 seconds, and 72° C. for 3 minutes, and a final reaction at 72° C. for 3 minutes. After a PCR product was precipitated with ethanol, the precipitate was dissolved in 25 μl of 3×SSC and subjected to electrophoresis using 0.7% agarose gel. The quality of the DNA obtained and amplification efficiency of PCR were conformed.
(6) Construction of cDNA Microarray
[0314]Using a gene tip microarray stamp machine GTMASS SYSTEM (manufactured by Nippon Laser & Electronics Lab.), 0.5 μl of a PCR product (500-1,000 ng/ml) was loaded from a 384-well microtiter plate and form spots of the PCR product (5 nl for each) at intervals of 300 μm on 48 micro slide glasses (model Super Aldehyde substrate, manufactured by Telechem International). After spotting, the slide was dried in an atmosphere having a relative humidity of 30% or less and irradiated with ultraviolet rays for mediating a cross-linking reaction.
[0315]Thereafter, the slide was treated in 0.2% SDS with shaking for 2 minutes three times and soaked in distilled water twice. Subsequently, the slides were placed on a slide rack, which was the transferred into a chamber containing hot water and allowed to stand for 2 minutes. Subsequently, to the chamber, a blocking solution (containing 1 g borohydride, 300 ml of PBS, and 90 ml of 100% ethanol) was poured. After the glass chamber housing the slide rack was moderately shaken, the slide rack was transferred to a chamber containing 0.2% SDS and gently shaken for one minute 3 times. Thereafter, the slide rack was transferred to a glass chamber containing distilled water, moderately shaken for one minute, and centrifuged for 20 minutes to dry.
(7) Plant Material and Isolation of RNA
[0316]As a plant material, use was made of a wild type Arabidopsis thaliana plant body which was seeded on an agar medium and grown for 3 weeks (Yamaguchi-Shinozaki and Shinozaki, 1994) and an Arabidopsis thaliana (Colombian species) plant body into which DREB1A cDNA (Kasuga et al., 1999) connected to a 35S promoter of a cauliflower mosaic virus was introduced. Dehydration- and cold-stress treatments were performed in accordance with the method of Yamaguchi-Shinozaki and Shinozaki (1994). More specifically, dehydration treatment was performed by pulling a plant body out of the agar medium, placing it on a filter, and dried at a temperature of 22° C. and a relative humidity of 60%. The cold treatment was performed by transferring a plant body grown at 22° C. to 4° C. High salt stress treatment was performed by growing a plant body at an aqueous solution containing 250 mM NaCl.
[0317]After wild type plant bodies were exposed to stress-treatment for 2 or 10 hours, a sample was taken from each of plant bodies and stored in cryogenic conditions with liquid nitrogen. Furthermore, wild type and DREB1A overexpression-type transformants cultured in an agar medium without kanamycin were subjected to an experiment for identifying a DREB1A target gene. The DREB1A overexpression-type transformant was not treated with stresses. The total RNA was isolated from the plant body by using ISOGEN (Nippon gene, Tokyo, Japan) and mRNA was isolated and purified by Oligotex-dT30 mRNA purification kit (Takara, Tokyo, Japan).
(8) Fluorescent Labeling of Probe
[0318]Each of the mRNA samples was subjected to a reverse transcription reaction in the presence of Cy3 dUTP or Cy5 dUTP (Amersham Pharmacia). More specifically, the reverse transcription reaction was performed in a total amount of 20 μl of 1× Superscript first-stand buffer (containing 50 mM Tris-HCl, pH 8.3, 75 mM KCl, 3 mM MgCl2, and 20 mM DTT, manufactured by Life Technology), which contained:
1 μg of denatured poly (A)+ which contains 1 ng of λ poly A+RNA-A (TX802, manufactured by Takara Shuzo) serving as an internal standard;50 ng/μl 12 to 18 mer oligo dT primer (manufactured by Life Technology);0.5 mM dATP, 0.5 mM dGDP, 0.5 mM dCTP, and 0.2 mM dTTP;0.1 mM Cy3 dUTP or Cy5 dUTP;100 U of Rnase inhibitor;
10 mM DTT; and
[0319]200 U of Superscript II reverse transcriptase.
[0320]After the reaction solution of the aforementioned composition was incubated at 42° C. for 35 minutes, 200 U of Superscript II reverse transcriptase was added and further incubated at 42° C. for 35 minutes. To this reaction mixture, subsequently, 5 μl of 0.5 M EDTA, 10 μl of 1N NaOH, and 20 μl of distilled water were added, thereby terminating the enzyme reaction taking place in the reaction solution and simultaneously decomposing a template. The reaction solution was then incubated at 65° C. for 1 hour and thereafter neutralized with 1M Tris-HCL (pH 7.5).
[0321]The reaction solution was transferred to a Microcon 30 micro concentrator (manufactured by Amicon). 250 μl of TE buffer was added and centrifuged until the amount of the buffer reached 10 μl. The effluent was discarded. This step was repeated 4 times. Probes contained in the reaction solution were centrifugally collected and several μl of distilled water was added. To the obtained probes, 5.1 μl of 20×SSC, 2 μg/μl of Yeast tRNA, and 4.8 μl of 2% SDS were added. Further, the samples were denatured at 100° C. for 2 minutes, placed at room temperature for 5 minutes, and thereafter used in hybridization.
(9) Microarray Hybridization and Scanning
[0322]A probe was centrifuged for one minute by a benchtop micro centrifuge. To avoid generation of bubbles, the probe was placed at the center of an array and a cover slip was placed thereon. Four drops of 5 μl of 3×SSC were dropped on a slide glass and a chamber was kept at a suitable humidity to prevent the probe from being dried during hybridization. After the slide glass was placed in a cassette for hybridization (THC-1, BM machine) and the cassette was sealed, hybridization treatment was performed at 65° C. for 12 to 16 hours. The slide glass was taken out from the cassette and placed on the slide rack. After the cover slip was carefully removed in solution 1 (2×SSC, 0.03% SDS), the rack was washed while shaking and transferred into solution 2 (1×SSC) to wash for 2 minutes. The rack was further transferred to solution 3 (0.05×SSC), allowed to stand for 2 minutes, and centrifuged at 2500 g for 1 min to dry.
[0323]The microarray was scanned at a resolution of 10 μm per pixel by use of a scanning laser microscope (ScanArray 4000; GSI Lumonics, Watertown, Mass.). As a program for analyzing microarray data, QuantArray, Ver 2.0 (GSI Lumonics) was used. The background fluorescence was obtained through calculation based on fluorescent signals obtained from negative control genes (mouse nicotinic acetylcholine receptor epsilon subunit (nAChRE) gene and mouse glucocorticoid receptor homologous gene). Samples giving a fluorescent signal value of less than 1,000, which is equal to less than twice the background signal value, were not subjected to analysis. The cluster analysis of genes was performed by Genespring (manufactured by Silicon Genetic).
(10) Northern Analysis
[0324]Northern analysis was performed using total RNA, (Yamaguchi-Shinozaki and Shinozaki, 1994). DNA fragments were isolated from an Arabidopsis thaliana full-length cDNA library by a PCR method and used as probes for Northern hybridization.
(11) Determination of Gene Encoding Transcriptional Factor
[0325]Based on the genomic information of Arabidopsis thaliana in a data base (GenBank/EMBL, ABRC), a gene encoding transcriptional factor was analyzed by using the BLAST program for gene analysis.
2. Results
(1) Identification of Stress-Inducible Gene
[0326]Fluorescence-labeled cDNA was prepared by subjecting mRNA isolated from unstressed Arabidopsis thaliana to a reverse transcription reaction in the presence of Cy5-dUTP. A second probe labeled with Cy3-dUTP was prepared from a plant stress with dehydration, cold or high-salt. Both probes were simultaneously hybridized with a cDNA microarray containing about 7,000 Arabidopsis thaliana cDNA clones and pseudo color image was created.
[0327]Genes induced and suppressed by a stress are represented by a red signal and a green signal, respectively. Genes expressed at virtually the same level in both treatments are represented by a yellow signal. The intensity of each spot corresponds to the absolute value of the expression level of each gene. It is shown that a cold-inducible gene (rd29A) is represented by a red signal, whereas a control fragment (an internal control) is represented by a yellow signal.
[0328]As a result of scanning the microarray, 277 genes induced by dehydration treatment, 53 genes induced by cold treatment, and 194 genes induced by high salt treatment were identified. Note that genes whose expression ratio are not less than 5 times as large as that of a control fragment were determined as ones induced by a variety of stresses.
[0329]As a result of analysis using a database, 35 transcriptional factors, which were classified into the following families were identified. Note that RAFL05-21-L12 was not classified into the following families. However, when the nucleic acid base sequence, which was searched by the BLAST X based on amino acid sequence data registered in the GenBank Database, it exhibited E-value of e-100, which means that RAFL05-21-L12 is homologous to a gene encoding a known transcriptional factor, that is, heat shock transcriptional factor-like protein. As a result, RAFL05-21-L12 was identified as a transcriptional factor. In conclusion, 36 types of transcriptional factors were identified in this example.
(1) DREB family: RAFL05-11-M11, RAFL06-11-K21, RAFL05-16-H23, RAFL08-16-D06;(2) ERF family: RAFL08-16-G17, RAFL06-08-H20;(3) Zinc finger family: RAFL07-10-G04, RAFL04-17-D16, RAFL05-19-M20, RAFL08-11-M13, RAFL04-15-K19, RAFL05-11-L01, RAFL05-14-C11, RAFL05-19-G24, RAFL05-20-N02;(4) WRKY family: RAFL05-18-H12, RAFL05-19-E19, RAFL06-10-D22, RAFL06-12-M01;(5) MYB family: RAFL05-14-D24, RAFL05-20-N17, RAFL04-17-F21;(6) bHLH family: RAFL09-12-N16;(7) NAC family: RAFL05-19-105, RAFL05-21-122, RAFL08-11-H20, RAFL05-21-C17, RAFL05-08-D06;(8) Homeo domain family: RAFL05-20-M16, RAFL1-01-J18; RAFL11-09-C20; and(9) bZIP family: RAFL05-18-N16, RAFL11-10-D10, RAFL04-17-N22, RAFL05-09-G15.(3) The Relationship Between Treatment Time with Each Stress and Expression Ratio
[0330]Genes encoding 36 types of stress responsive transcriptional factors isolated as described above were analyzed for the relationship between treatment time with each stress and expression ratio. The results are shown in FIGS. 106 to 162. The correspondence between the names of genes and stress treatment shown in FIGS. 106 to 162 is listed in Table 5.
TABLE-US-00007 TABLE 5 Number of figure Name of gene Type of stress FIG. 106 RAFL08-16-G17 High level salt solution FIG. 107 RAFL05-11-M11 Dehydration FIG. 108 RAFL05-11-M11 High level salt solution FIG. 109 RAFL06-11-K21 High level salt solution FIG. 110 RAFL06-11-K21 Dehydration FIG. 111 RAFL06-08-H20 Dehydration FIG. 112 RAFL06-08-H20 High level salt solution FIG. 113 RAFL05-16-H23 High level salt solution FIG. 114 RAFL05-16-H23 Dehydration FIG. 115 RAFL08-16-D06 Dehydration FIG. 116 RAFL07-10-G04 Dehydration FIG. 117 RAFL04-17-D16 Dehydration FIG. 118 RAFL05-19-M20 Dehydration FIG. 119 RAFL08-11-M13 High level salt solution FIG. 120 RAFL04-15-K19 Dehydration FIG. 121 RAFL04-15-K19 Cold FIG. 122 RAFL05-11-L01 Dehydration FIG. 123 RAFL05-11-L01 High level salt solution FIG. 124 RAFL05-14-C11 Dehydration FIG. 125 RAFL05-19-G24 High level salt solution FIG. 126 RAFL05-19-G24 Dehydration FIG. 127 RAFL05-19-G24 Cold FIG. 128 RAFL05-20-N02 Dehydration FIG. 129 RAFL05-18-H12 Dehydration FIG. 130 RAFL05-18-H12 High level salt solution FIG. 131 RAFL05-19-E19 High level salt solution FIG. 132 RAFL06-10-D22 High level salt solution FIG. 133 RAFL06-12-M01 High level salt solution FIG. 134 RAFL06-12-M01 Dehydration FIG. 135 RAFL05-14-D24 Dehydration FIG. 136 RAFL05-14-D24 High level salt solution FIG. 137 RAFL05-20-N17 Cold FIG. 138 RAFL05-20-N17 Dehydration FIG. 139 RAFL04-17-F21 Dehydration FIG. 140 RAFL09-12-N16 Dehydration FIG. 141 RAFL05-19-I05 Dehydration FIG. 142 RAFL05-19-I05 High level salt solution FIG. 143 RAFL05-21-I22 High level salt solution FIG. 144 RAFL08-11-H20 Dehydration FIG. 145 RAFL08-11-H20 High level salt solution FIG. 146 RAFL05-21-C17 High level salt solution FIG. 147 RAFL05-21-C17 Dehydration FIG. 148 RAFL05-08-D06 High level salt solution FIG. 149 RAFL05-20-M16 Dehydration FIG. 150 RAFL05-20-M16 High level salt solution FIG. 151 RAFL11-01-J18 Dehydration FIG. 152 RAFL11-01-J18 High level salt solution FIG. 153 RAFL11-09-C20 High level salt solution FIG. 154 RAFL05-18-N16 High level salt solution FIG. 155 RAFL11-10-D10 Dehydration FIG. 156 RAFL11-10-D10 High level salt solution FIG. 157 RAFL04-17-N22 Dehydration FIG. 158 RAFL04-17-N22 High level salt solution FIG. 159 RAFL05-09-G15 Dehydration FIG. 160 RAFL05-09-G15 High level salt solution FIG. 161 RAFL05-21-L12 Dehydration FIG. 162 RAFL05-21-L12 High level salt solution
[0331]In FIGS. 106 to 162, the vertical axis shows the expression ratio of a gene, which is calculated as follows:
Expression ratio=[(FI of cDNA molecule under stress)/(FI of cDNA molecule under no stress)]/[(FI of control fragment under stress)/(FI of control fragment under no stress)]
where FI is the intensity of fluorescence.
[0332]As shown in FIGS. 106 to 162, the genes encoding stress responsive transcriptional factors isolated by a method according to the present invention exhibit different profiles; however, it is found that expression is induced by adding each stress.
INDUSTRIAL APPLICABILITY
[0333]A stress responsive promoter and an environmental stress responsive transcriptional factor are provided by the present invention. The promoter of the present invention is useful in that it can be used for breeding of environmental stress resistant plants in a molecular level.
Sequencing Free Text
[0334]SEQ ID NOS: 91, 92, 165 and 166 are synthetic primers.
Sequence CWU
1
16611000DNAArabidopsis thaliana 1gagaatttta gaaaaagaag ttccgtgaat
atcgcaaaca ttaaggcaag aacatttgca 60aaaaaaaaag agaaacaaat tcaacatcca
gcagaactgc agaagtaatg ggggtgttta 120gggcagaaga caagaacatt agcaaaaaaa
aggtaaagat tgactggttc catatgcatt 180catgtcatgc aaaatattga atcgaaccat
cacaagcata aacattaagg atttctgtta 240gacaaattaa cgttgtaagt actaaaacta
gctagatttg tgattgtcac atatcaactt 300gttctacata aagtttgtaa gagctaaaac
tttaagcgtg agctttttgt gaaacaaatg 360aaagttagat attaaccgtt ctcttattct
cgagatgatt ttcaatcgat ggttatgcta 420caaagttggc tcgttcaata taatgagctc
ttgtgtgttc aacatgaata catgatattt 480ttgtgctcgt taaatactcg gtttttacga
aatgaatttt tttaagagtt cgaacctaaa 540acccgaactt gattgaccaa tgtacattca
tagtagaata tttgctttgt acctgaatct 600gcgtttatga aactacgtct aaagattgaa
ccaaacaaat aaaccgagtt aaaccaaaat 660ctagatccta ccaaatcaaa acgaatgaac
attcaattta gaaaccaaaa aaataaaccg 720gacattcctc attagattct ctatattatt
cttttgtctg taattgcgtt tgtacaaaac 780cacgtttgcg tttagattgt ttaaagatat
ttattgaaaa aagacccgcg tgattaaaat 840gtgtaaagaa agttcctccg ttattaccgt
ccaagtggat ctctaagaga ccgtacaact 900aactccactc acggcggcgc attcccaatc
tgctgacacg tttctgctta tcactctctc 960tttttcttct cttcttacca cacctggttg
actcaaccac 100021000DNAArabidopsis thaliana
2tatggtagaa aaatatgaga gagaatgatt atttacataa tgaaaagtga ttgaagttac
60cttttaaatg taactaaatt acatgagtta caattctaat ttgtttgaat gacaatgagt
120gaagaagaat ataattaata cacaattagt atgatagtta caatttagta ttcaatggtt
180gattaataaa tgtgaataaa tttattgttt ttgtgggtta attggagata aatattattc
240tatttttatt tttgaaaaat tggttgttaa aaatgagacc ttatttttat atatacatga
300ttaaactatt ttgaacttga atccatttgt tttggattta ctcacttttc caatgttact
360tatagtttcg ccagtggaat ataaggttag aaaatatgag agtaaaaatc aggaattagg
420acatgtaaaa tccagaaata tctgaggacg gatctcacga ttgattataa gattcgcatt
480acattttctc ttcatttttt tcagataaga aaattggatt ctatcaattt gtcgtcctat
540tgttcttttc ttgtttgata tacaaatacg agtcactgat tttttttagt tgattggtag
600agtcttaaag gaaacatcaa attcagacag tccttttaca accatttcat ccatgtcggc
660tcattatttc gcaccacaga agaataaccc aaaattatta gaaaacttgg accactataa
720gcacttgaac tgtggtcgtg cactgactaa tgtgagcccg ttttggtaat tctcagctgt
780gtgcgtaatt gatgtcggcg tcaccttcag aaacttcgaa attatgtgtg atacaccgtg
840tttcataaat tcatacacgt cattttagat acaaaattag atattttcat ttgataaaac
900cgaaagaaat aataaacttt ttctttcttg ttctcatttt aagtctctct ttatatcact
960ttctatatat attgaatcat gcaaggaata cacataaaat
100031000DNAArabidopsis thaliana 3tgggtgacca actctttcct acttttctaa
cagtttttgg tttgttttgt tcagattcgg 60acaaaatgat gagtgttgtg ttgtgaacag
agaactggtc cgtttgtaat atttggggtc 120ggtggtcttt gtgacggtct aggctttggt
tgttgaacta cttcctttga ttaaggttta 180tggagttttg ctatatctac acaataatac
ataattgttt tgtattattg acttgaccaa 240taacaggtgt cgaaaattct cagttagtac
atcatctttt acaagtcttt tgacgttcta 300gaccaaatta tgttgatttt tgaatcaatg
gtgagttatt ttagtcatct ggttgccgtg 360ttttgaaact aagaaaagtt gaattatata
ccgtaaagaa gaagataata ctaaaaatcc 420gaatacgacc tacaatgaaa ctaagaaaat
cagtagccaa gaatgatttt gagtcaatgg 480tactaaatgg caaaaaacat tacaaactaa
ataaatgact tttggtggac gtctaataaa 540gtaatagcaa tgaataaaag taaagttttt
gcaatagagt ctttcattta attgtaaatg 600gatgtcttgt taaccaaaca atattttgat
tgttttcatt gctatgtgct ttattgagag 660gatgagagca catgcatgtc ttgttttgta
agctgtacct ttttttgtca taactcaaca 720tgatagtaat ggaatggaat ctctttgatc
tcttgttaag ttaccaaaac agaacaaccg 780tgagagagtg gttcatctaa tacaataaca
agacatgtat ctaccacaat cctcacatac 840acatgtacgc atcccattgg ccaattcact
tccaccaacg accccaatta acaaaaacca 900acatgagtta gttattagta gttgacctat
actcacttct tttatcttcc ctccactctc 960tccaatctat aaaaccaaat ccaacaattg
cttccttctc 100041000DNAArabidopsis thaliana
4ttgaattgaa tgaagggtgt ggtcggaaga gaagacgtgt agaagagacg agacaagtaa
60atttaagcat tggccccatt tacagccaca agtccgctac aacaaattat ttccaagaaa
120ctctgagata acgtcgtgat gaaacggctc atgctgctgt tgtgattcgt gaattagagg
180tttatctttt gggtttttga atgttactta attggacggt cgatttttca aactgggtgt
240gaaatgtgaa tgggtcattc ataatgggct tttgttttaa tgtgaagcca ttcacacact
300ctttgtcctt cttttctatt attcataact gtcactcttt gttcttcgaa atagtaaaga
360gcaaatcgat tctttgttga tctgggccgt aaaatttcca tggttgtggg aagtattctc
420gcagctgatc tgggccgtca atgctacagt ttcatgtcag agagaggtca agaatcaaca
480cgtggccaac catgatttta aaccaaagca aacacacgat tagaccccac attgtttgtt
540caccaacccc cgtggaccct cctttagccg acgtgtccac gtcaatagtg gtttttcttc
600ctttcaaagt acacaaattc cattctttct cattttactt tttggattac gttgttgtta
660taaactggta aaatgaatta tgaatgcaaa taaatttcat ttaagttttg ttggcttcta
720atattttttt cacctaaaat tctaataaac tacacagcca tgagccatcg tatgaaaaga
780agaagaaaaa aaatgtcttt ttctagaagg atctttcaac gactaaaaaa gattttaagc
840ttttgactaa ttttgtcaat aatatacaca aatttacact caattatagc catcaaatgt
900gtgctatgca gaaacaccaa ttatttcatc acacatacgc atacgttacg tttccaactt
960tctctatata tatatatagt aatacacaca cataaacagc
100051000DNAArabidopsis thaliana 5tttgtcgaat tggtggggtc ttaaggctaa
aaaagggttc accacatgtg tatgaatcga 60aatctactca tccgctgtcg ggaccaaact
acatatcaac taccacgttt ttctcatatg 120aatattcaaa gctaccgtaa atttttataa
aattacgtca ttagcataag aatgtatgta 180catattaatg actattataa accctaaggc
acggatacct tatccttcca ccaccgaatt 240tcacggattc catatccttt caccggacta
gacgaatatg aagctaagat attaatagat 300tggtgaccaa atttgaatct tttcggataa
aacttgtttg gcatactttc ccttagagtc 360aattattctt caaacaatgt ttaacttaaa
tctcaagatc ttatgtagtt aatggtagat 420atcgtacagt ttttttcgtg tttgtctttt
gatcgtgtgt aaagtgttgg aaaatctaac 480tttgtatata agttctacac ctccaaatta
acgttcgagc ttatcggtat tactgctagt 540gctttctcaa attccttttt agatcatata
gtagcccact aggaaacctt acaccaaaac 600gcccaaataa tatgaaaagc ccatcatatt
gtccatccaa agtaggacaa ttgctgaaaa 660agcccaacga atcaaacctg cattgttacc
ttgcacttgg aaaatgaatt aaacggcgac 720gcacctatgg atggagacta agttcactca
tcaaaaatct aatttcaact ctaattatcg 780tatttaaggt cattggagat tcaactcttg
tacctgtagt ttcttcttct gaaaatcagc 840aacaaactac aaatcatcca atgatagaat
ttcagcttta attatcaaac aaattaaaga 900taaccccatt gataatatcg ccacgtctca
tcggatgaaa tagtactatc tgacacgaca 960cgaatctctt atgagagaaa cagaagagca
cggaatctcc 100061000DNAArabidopsis thaliana
6attcatttct ccgagaagaa tatgtacttt tttactttcg acaaaagaac catattttta
60gcttgggaat cttggattaa atattaggac aatattttta acttgagaat cttggtttaa
120atatgagaac catatttcag aaaaaaaaag tttatatatg tgtaattttg tcggaaccgg
180atttactaac attcgtaggt agtcggattt ggtcccgtaa aataaaatgg taaataaaac
240aaaattaata gtaaagaaaa tgaaaaataa tttacaaaca caaattaaat tgaatacaaa
300ctagctatca taatattttt caagttggtt ttcatgaaca ttttagagca ctaagctata
360taacaaaata ataataatta aacatctttt aatcattaag tatttaaaga tccatacaaa
420tgttggttac ttggtacaac caacaaggca aaggcggtac tgaataagaa tataagatac
480atgaaaaatt gatctactat actttacaaa acgcgttaga tattatataa ttgctactcg
540ttcaattcat ggggatgtag ctcagatggt agagcgctcg cttagcatgc gagaggcacg
600gggatcgata ccccgcatct ccacttttat tgttttcttt ttaaggttct ttttctttta
660atttattttc aatcttctag tcttcaattc tgtttttgtt tctgtcggaa tctcttaaaa
720gtcaatttaa atactttaaa ttccttgcac aacactcaat tttgatactt tgaattcctt
780gcacaacaca acacgtcgct atcaatcaca gacaccacaa acttggacac ttatctttag
840ttctgtctta aactgaatct ccctcttatc aattctgttt ttgtttctgt cggaatatct
900taaaagtcaa ttttaatact ttgaattcct tgcacaacac aacacgtcgc tctcaatcac
960agacaccaca aacttcgaca cttatcttta gttccgtctt
100071000DNAArabidopsis thaliana 7ggtttttata ataaaaagtt gaaatttgtg
cccttttcaa gtctttcttc atcatcttca 60ttctcattct catttccatc ctctgcttgc
tctcttctca cagccccatc ctctttaatt 120accatctcca atcccctgaa ccaggattta
taccattgcg attccgcaat ccaacactac 180tatctctcta gtttcctttc ttcattagga
aagagcaaaa gcattctttt atggtaaagt 240ttcgattttt caaattcata cactgtcgga
ttcataccct tttcgctccg ggtctggaat 300ttgggttttt gaggtttcta cggagtcaat
tcgagtctgc ggaaactcaa tttgatagac 360tcgtttcaaa atgttgctta agggttttct
tttgttcatg gattgaagtt ttgcttaaac 420cagttacgag gtttcttggt tcttccatgt
atccgtgcgt tatctttgat gacccttcac 480tcaaagtctt cgttttttca tgtcaagttg
cttcctagac ttagaaaatt ctcgttttta 540atggattttt gttcatagaa caaacattta
acgtttcatg ctttttctgt gagtcacatg 600tttttttttc ttgagctgat taagtctctt
tttttgtttg cagccatcaa ttgaattata 660tgtcttttag ccaataattc tgataaacgt
caagaggtag gtgagaaaat caaaactctt 720gccttgttat cggttttcga taaccacatc
tatcaaacga tgtcatttgc ttgtttgttt 780ctgcttggac attagtttcc aaattattag
agagttttag tcacgtagaa ttatgaagtt 840tgtagtatgg cagatggctc gtgagcttgt
ttaactttat cttatgaatt tagtggacga 900gaaagaagtt gggttgtttt tggttggaca
tttattaatt ttcgtctctt tgtgccatgt 960tccatgaagt atcatttcat catctctagc
ttaatctctg 100081000DNAArabidopsis thaliana
8ataaaatttt tagatattta tatttataaa tatgactgca gaagttttaa gtttatgata
60gtttttaaaa agcaattatg atagttagta acaatttaga aacacgtata atttaatttc
120tttgtttatt tttcaaaaat atacaaaaaa tttatgtcag cgattttatt ttattttgta
180tggtatatga ctaatttaaa agtggtacta attagtatta taaaaaatta ttaaaaagta
240tttttaacct tacgactcta tttttttaca agttacaaat gtttcataaa attttattgt
300taaaacacta tgatctaaga tattgaaacc cacggtcaaa gtattgctaa catggtcatt
360acattgaaaa agaaaattaa ttgtctttac tcatgtttat tctatacaaa taaaaatatt
420aaccaaccat cgcactaaca aaatagaaat cttattctaa tcacttaatt gttgacaatt
480aaatcattga aaaatacact taaatgtcaa atattcgttt tgcatacttt tcaatttaaa
540tacatttaaa gttcgacaag ttgcgtttac tatcatagaa aactaaatct cctaccaaag
600cgaaatgaaa ctactaaagc gacaggcagg ttacataacc taacaaatct ccacgtgtca
660attaccaaga gaaaaaaaga gaagataagc ggaacacgtg gtagcacaaa aaagataatg
720tgatttaaat taaaaaacaa aaacaaagac acgtgacgac ctgacgctgc aacatcccac
780cttacaacgt aataaccact gaacataaga cacgtgtacg atcttgtctt tgttttctcg
840atgaaaacca cgtgggtgct caaagtcctt gggtcagagt cttccatgat tccacgtgtc
900gttaatgcac caaacaaggg tactttcggt attttggctt ccgcaaatta gacaaaacag
960ctttttgttt gattgatttt tctcttctct ttttccatct
100091000DNAArabidopsis thaliana 9cgtatatagc agatatagtt aaatctgttt
tgtatgttga taaactgctt gatataacga 60attgttatat ggaagattca aaattgatga
tcctatgata aagatcaaca tggatacaag 120acacgttttt tcttgcgata gtacttatat
ctgacgtcag taatagtcgt ggtggctatg 180tcagaagagg acacatctgt gactcatgtt
atgtggtcga gaatgaagtc tcgtgaaatt 240gtatttacac tttgttgtca tgtagggttt
gactttactt agtcggcaac gtatatatcc 300gatttatttt attttcttca aactggaagc
ttttagacca aaatttaaac taattttctg 360aaccacagaa atccaatctt tttttgtcac
tgaaagcggt tataatcata aatgtccaaa 420atgtgatttg tttttgaaat agttaaaagg
aagtgaattt ctaattaacc ttaatccacg 480taaaaacttt atatacctca aacaaaatcc
cagcacaacg aggttaaaat caaatatagc 540cagcttcaat tttaatttat tctaaaatgt
cgaagggtcc ggaaagcagt caaattgtcg 600cacattcata tttacgttaa ttagagaagt
cgcttcttac tcacgtttct cgttccaaac 660acaatctttt catgtttctc ttttaatttg
accgtcattt tttatgatgt gaaatattaa 720ttgcgcgaat acttcaaacg tacatctgtg
atctgtccgt cattttcctt cacgttacga 780taagtttcaa tctaaaaact aacatcgtcg
ccttcgttga atcaatgcat ggattcgtcg 840cagcatttat tctttattag acaactggcc
cttcaagcga aatgagggat acgtgtatat 900atatgcatcc acataaacaa taagaaaaat
ataaccctta tttttggatg ttatataaaa 960ttgctttacc tatatgtgag ctagcaacat
ttactcatac 1000101000DNAArabidopsis thaliana
10gggaatctag cagaattttt cttcctaata actattttcg agctttctgt ttttgttctt
60tctttttaaa aaacttatta agttcttatg aataatgact tgtgaagttt gagttcgtct
120ccttcacaag caagttgtat tggtgttttc tactttatga atatgggttt tatataccta
180aagacttgtt atgttattat tcttaaatgt tgctgctatg atgattacta ttatcgattt
240ttactattat atttgttttt tttaacgatt cgttgaaaaa taaacgagat tgacactaat
300gtagcaaaat gtgcaatatt atcttcaaaa cattacaaaa ttcagttgtc ttaatctttt
360tatttccgat gacaaattat ggttacgtaa atagtactat attaaaaaaa aaagaaaaaa
420aaagacgaaa ataaaacaaa aagggagaat ttgagtatta aaccaaaaaa aaagggagaa
480gtggactttt gatcaaaaga agagaacaac ttgcatgagt aaattagttt tagagctgac
540tatttaaaat tttaagaact aaaaagaaag tggtagtata atttaacaaa aggatgatga
600ctcatatgaa taatagtagg caaaaaaaac ttcaaaacca caatttcttg ttttcactta
660gaaccttatt ctgccataat tacgtttagt taaattaaga aaaaattgct taaaacgaca
720gtacaagaag aaggaaacat tttctttcac tcaaaatcct attatcgcgc aacagaggag
780tctaagtgaa ataataccgt gtcgtttgcg gatcagaaat cgagagaacc acgtatgtgt
840cttttatgtc ctggtcaaac ggttatccaa cacgtggatc ccaaaactag ccaccgactg
900cagcaagttt ctcgaaccga tgcacgtgtc ctcgacacgt ggagactccc aagcacgtct
960ctgggttata aatagcaacc caatgctcca atcattttcc
1000111000DNAArabidopsis thaliana 11gtcaacattg atttttcaat ttattaagat
tttttttact aatgtaacat agatgaaagt 60gaggttttct cgttacaata aaaaaaaaag
aagctacata gccagtaagc cagagaacta 120aacgtttctt gtacatgcat gttgcatgag
aactcttcta gtgagagaac tcttggccca 180aacaatcaat tttttgaaat attaaattaa
gcttacaaat ttgcaagagt gatgattgct 240atattgcata agaggtattt aacttggatt
tttttaaaaa aaaaaacaaa ttgggacgaa 300aatgtagtct ttcttttttt gtgtgtgtaa
agattaatat gttagaagaa tagcttatat 360attgctaaaa aaaataaaaa tgaatagctt
atatataaaa tagcaaaatc aagaaaacga 420cattactttt aagctgaata gatatgaatt
ttttgaaatg ccaaaagtta ttatataata 480gatgaacaaa gtttttttta aaatatagtt
ataattttaa aacgtacggg ataaattaaa 540agatcacttt ttcacgtttt ctttttttaa
caaaactttt gttagataat ctgaaatatt 600actcattatt tcccatattc tattaagtat
ttgtgaataa attgtataca gctaaacaca 660cataaattac aaatttattg atttaatata
ctttactact ttggttaaaa agtaaacata 720ttcaaattcc tttatttctt ctactttgga
gctgcattaa acaacttcct tgaataatta 780cgtaattaaa aacaaattac caaatattta
gtcaaaaaat aatttagtga atatatcaaa 840aataaaaaat aaaggaaagt cggacgatac
taacaaaatt attaaaggaa gatgaatttc 900tctttccttg tgcgcaaaat agcaacaagt
ctccgcgttt cctctctttc tctctttccc 960ttatttaaac acacttcgtc ttctacctct
aaacaatctc 1000121000DNAArabidopsis thaliana
12caacatgttc agaataataa ttcacagctg ggtaaatttt aatttacaat tattgagttt
60tttttttttt ttgtttatgt caaactcaaa actagtgcca cgtggaggaa gctaagccta
120taaacgatta tgaccgacag ttgtcactct agaagtagcg gtccttgtga cggcataagc
180tggtcataac cgcttagtgg tgatgatttg acgaaattgc ccattaagcg actgtgagat
240gtcggtagac tacgatggcc acgttccaca gagagtgaca ctaacaacca aaacctattt
300tattactatt ttttcttttg tcattagtat tttattacta attgtttgaa tatttcctcg
360tggtctatta ttattaaaca ctctagtcct cttattgttc attaaaattg tctgacgctt
420gtctcttcct ttcgttttct actagaaaat tcagctattt tcttaatgtt ttatatctcg
480taaactctta agttgtaaca gtgtttcaat ataatttgaa cacatattga agtttagtcc
540aaaattcttc ttctccagaa tatatccaaa tcgagtcaca atggaaaaca aattctcctt
600tgtaggtaaa tagaaacaaa caataattgt ggcattaaag tgttttaaaa cgagatccca
660ataatttaca tcagtaagat ttcaaaagtg tgtaattaga ataaacaata aacctccaca
720atttggaaga tattttgttt ccatatttgt cgcgtataat ttattatatt gtcaattctc
780gagaattttt aacgttttgt tttgaccagg taacgactgt tccggtaaat atgtaggtcc
840aaaaacttcc cgcggactta actcgttttg aaatccgctg tttggttttc tttttttatt
900accggctcta accggtccta cccaaaacct tgtcatgtcc tctcattcct ccaccgttta
960taagtaggcc cttaactcca attcccaccg tttcttcgct
1000131000DNAArabidopsis thaliana 13tcacgtgtta cggctgagag cttcgaagcc
cacgtcattg ctgacatatc ttcattcgta 60cggatcatgt aatctgttaa atttcccaaa
acgtttgcta cctaaacccg tgaattattt 120ggaagtcaaa cttcctcttt aatttcggaa
atatttacaa attcaggtca atctaatgta 180tttcatacaa cgagtgtaaa tgtattattt
tgacaaattt gcgtatcttc tagttgggtt 240gacaatttaa tatactttta ggtcaatttt
gcatagtaga gatatggata agaaagtcta 300tctatataca cacgcacatc tacgtattat
aacaactgta catcacattc acactatata 360cagaacattc atctaaaaca accgtattta
ttgtatccac acgttttact atgggtgtgt 420ggtcctttta attatcagtt actataaatt
ataaacaaaa ttccactagc taatcaatat 480gaaattacat ttgagtttag aatagtcaat
aggaaacgat ttaaccaaaa taaaataact 540aattcaattt aaatcactgt ttttttttgt
atccatcaac aatcatatga accatttgac 600tacattctct ggtcaatttg aattggtatt
tactactact aatagatact cttataccaa 660tttagataaa taaacttaat cctgagtacg
gactaaagga caatttgtac agtgagttaa 720aaatagaaaa tttacctaac aatacttttt
aaatcagtca aacatttttt gactttaata 780agaaataata tttaatattg gccttgtaaa
acaagactac tctcacaaca gtcaacacac 840acagctaagc atacgcgtca gcttccggtt
taaacaaaaa aaaacacaaa ccgtaatttg 900gtttccgttt cttcctcttg taacgaccgg
ttaaataaaa tgctgacgtc accatcttct 960tctatatata tcctacctgg aaccaactct
gtatatacgc 1000141000DNAArabidopsis thaliana
14tttatacatt tcacatatta taaatattat aaaacgtttt tccgttaaat attttaaatg
60tttaactaac tataatattt atttttaata aagtataata ttaataaatt cctaagattt
120ctccttttta aaagtaatta gaattgatgg aaatatatta gataaagaaa tgctaataat
180taagaatatt aattagttat aataattaaa tatcaatgac atccactgta aataagtccc
240aacttaagga tttatttact aaaatagctc caaaaatgta tatatagatg ttatcaaaat
300tcgaattaaa aagttgtaaa aagactcgga ggcggtcaaa atcaattttt ttattttctt
360gcataaattt gtgaactcca gagaattaag aatgaccagt gatagagctc cacgtggata
420gattgggttc caaagtggga cggcgttaaa tataaaaagg ccggaagttg tcgaagtagt
480cttttttccc attatttctg ttttcggttc cttattttgt tcctttgtaa taatgagtga
540catcaaaaat atttggggac gaattaacag gtaatgtaaa tagaaagaaa tagaaaattt
600tctttgattt tacaatttgg attcggattg cataagcaat gacatcaaca gtaatacata
660gctgagaggc tgagatgcag tgtctctctc tcagggaaga tgacaaaaag aatctgatat
720acacgtacgt atgttgcgag ttacatattg atatatcaca tgtcataagt cacatgctaa
780cgataaacta ataaactaaa acaattatgt caacatatat cggtgaattg cattatcatc
840gacctcggtc actgttccgc acatcttggt cgccacggaa catagacaat ttttggattg
900tagtccaatt aatgagtccc ccaccaaagc cgtagtaaaa tcgaagtcgt tctctaatcc
960aatcatatat atatccataa cacacaccaa caacacccac
1000151000DNAArabidopsis thaliana 15tgccaaaacc cattcttgga tcctgttgtc
accaactgca accattactt ctgcgacaag 60tgtgctctaa aggtaatctt ttccttacaa
caagtttcgt aatttctgaa aacacactca 120aaacagattt tccttgttat tttttttcat
aatgcagcat cacactgaga acgatacttg 180ttttgtgtgt aacgagccaa ctctagggct
tttcgacaca gccgtggaga tcaaggaaag 240aatagaggaa gaacgtgaga aagccagagg
tttgttaaaa acgcaaaact ctctgttttg 300acaccaaaaa cctactctta caaaactcgc
tcggtgatgt ctctttgtag ccatggtaaa 360ggaagtgaca gcaatgttag aaaaggcatc
gacaatggcg gatgacgcca agggcgtagc 420gcaaaaggtt gtaaaaatgg tggaggagat
tgaaacaatg gtggaaaaag tggcggctat 480ggccacaaag gcgggagaaa cggcgacaat
ggcggcagat atggtgaaag aagctgagga 540gacgatggaa acagctaaag ctaatatgtc
caaagccttt gtggtaatga agtcggtgaa 600ttggaacgtg taaatcgggt caaaacagag
ttttattgtg atctcatgct gacatcagct 660acaatcttta tctaataaga tagattctca
caagattctt ttatctatct actttttaga 720gaagatgaga tcatacttcg gagatagata
ggtgtcgtaa aaattgggaa gcttacttgg 780caggagaaga gagatataca cacgtgctta
aagtcaacag ctaatccaaa aaggtagacg 840acaaaccagt taaactaaga cacgtaatct
atcttaaaga tttgtcggtt catcgtaaat 900ccgttggtgg attaatatat gtcggtgtct
tcgttgattt tcttagccta accaaccaaa 960cataaataaa aatcgaagct ttactttgtt
tgcttctatt 1000161000DNAArabidopsis thaliana
16ataatataat tctgaaaata actaataatt tactcttttc aagtaattat tcaaaataaa
60tttagaaaac taattcaatg tatttcctca ttgtcactct tttatttccc tctacaaaat
120tatatatatg aagtagtttt ttttaaaata gagccccata actaacatta ggggtccata
180tatttgcatt gtaatttata aacacgtgta gaacattata attatttaaa ataaaattta
240gaacaataac ttaatgtatt tcctcctcgt tagttttcct ttttttccct ctacaaaatt
300acatatatgg aatagttgtt ctttacttct tttaagtaga gccccttaac taacacatta
360ggggtccata tatttgcatt ataatttata aacacgtgta gaacctggca ctgaagtctg
420atgaatatat atccatttgt tttcacaact accctagctt tttgtatcat cggttacttt
480ttttttatca caagaaattt ctcgattttg attttgattt ttcaatgttg tgaagttgag
540aagttcaaca tgttttggac agatgttttt gaacagtaaa ttagtatttt aaacatgtga
600atctatttgc aaacgtaatg acgtgtcgtc tgaccatttc ttgaatgtta gttaattttt
660tagctctagt tttgattaat ctaccaatta ggtcattcat ttcagtttaa gtcaaatcaa
720caattaggct cgtcctaacg ttttcattta gccaaaacaa aaaaaaagat ttcaacagta
780taaataattc gtccttatca aaaaattaat aaataaaaaa gatatacatc attacttacg
840taattgtttg tattttgata gttttcgata atttctatgt gacggttttt aaaataaaaa
900taaatgttga ataacaaaaa atagtagtac aatagaatac tgaattggtt tgtgtagggt
960caaaccaaca aatatggaaa gaaaagtatt atttagttac
1000171000DNAArabidopsis thaliana 17gccctagcct aaatcaaacc agagataact
gtggaaatga gactagagga ttcaataaaa 60acacaaactt ttttggatgt gcaaaaaaat
caaaattggt acctcttgtt cctttttaag 120atcttgttct ggcttaatcc agagaaagta
agcaatagtt ccggcgacga cccaagaagc 180ggcactctga ccacctctaa gacctcccat
ggaattaccg atgaaagatc tcacgtttcc 240aattgatctt ctccagctgc tagccattga
ttgatgccca actatacgaa ctggtctaag 300gttcttctgg atctcttctc tgacaattgc
ttgactttga tgatgattcg atttggaact 360gaaattgcaa gaaaagtaaa agcaggcgat
ttgattttga ttcgcatttg cttggtgaac 420actgccaaag gctaaaccgg tcgccgagac
ggttaatctg atgtttggga ttgacacatc 480atgcggctgt tgtttctcca cgtaacattt
ggagacccgc tcaaatatat tgtgggccgg 540agaacattgt tttaccatat agggcccata
gactttgaat atgtgtaggt aaataaaata 600caatctgtgg aacaaaaatg gcaaaaatat
ttgaagtcag caggattggt taacaatttg 660agcacagaaa tacatttggt aacatctgag
catatcattc atatcatatc gctgtcgaat 720ttgaaggaaa aaaaaaagac taaggaagtg
gatatattgg tgaatgctga tgatgagaaa 780acttattatt atatacgaaa aatttactaa
agacagcaat attccaaaat aatgataggg 840aagattctga gatggtgtgt ggtcagtgtg
gaggcactac agatggtcaa acagtagcat 900acccacctgg ctatttggac aaggacaagg
aagaagaaaa atcgagaacc atcttctcag 960ccgattttag taaccatctt cttgtttgct
ccgaacaatc 1000181000DNAArabidopsis thaliana
18aataataata actaattagt aattagtggg tattttaaat accattatgc catttagcta
60aggtaaaaaa tcaattgtta attataacac ccaccaccaa catagattcg tctatggttc
120attcatcgtg tttaagagtt gaaagaaata aattcgccca tcgtgaataa tagatattta
180ttcatctcaa acttataact caaatatatt tcaactatat atcgatcaaa tagtatagct
240tttggttaat gacgaatttt tattcttttg atccataatt cttcttcttc ttttttttct
300gtccggatac atgaaattga tgatacacat ttaaaacaac aattcgaatc agggaacaaa
360ttacagccac aaccagtgat gatagaccat atatatgatt tagtttattt aataaataat
420gtgtaagcaa atttggcttt gctgtaaaaa aagaacacga attttggcaa aagtttctgt
480taggaatctg gttctattct cctctgcaac ctccagtctc tcatgaatct ggttcggatt
540ctctttttcc ttgtttctat ataatttaga tacatggttt tataattcta tcatatgtct
600attttggata tagtatttta aaaatatata tatttttcat aaatggttat ggtctattct
660atgttaatga taatcattag tctttttgtc aactatgttt ttttttccaa caaatttagt
720atgtaaactt ttttttacta ccgttttatt aaatcgacgg ttgatcagat caactccggt
780ataacacaac ataagtttcg ttatcaaaac aaaaacaaaa acagattttt ttttgtcaac
840taccagtgaa gattagtctt acgtgtcaag aaaccggata aaaatatata acgtatttgg
900gcaatcagct aagatattaa ctaacgcgga gttcattatt aaaatggagt aatgatgttt
960tcagttttct atataaatca cgtcgagacc gtagagtctt
1000191000DNAArabidopsis thaliana 19taattttttt ttctttattc aaattatttc
aaaatacaat tatatataat ttattttttg 60tttgaacaaa attttacatt tattccgaaa
aaaaaaacta tgcaaaagaa aattttaatc 120caacttctca tcatcctcaa aataatgata
acaatatatc cgcttcatta gtaactgtaa 180ccttcttcga tcaaacaact catctgcaaa
gactagacta tttcatttat ctaacattac 240cgttaacaaa caaaaaacaa aaatttactt
gggagtcgaa gaagtcaaat atctagaaga 300atacttcttc ttacaaaatt ataaggaaaa
tttcccctac tgcttcgaga aatacctttt 360tccccattca cgttatgtta tgcaacgtgt
gaggataacg agcggacaac acagccctta 420tccttatttt attggtcaac caaccccaaa
acagatttta agacggagtg tatttctttt 480gtcaactgtt tacatgggtt aaataaaaca
gtttcactaa aacttattaa attatcatgt 540catattgtca tgtatataag ctatagcata
tatacggatg gaatatcgat taaaccattt 600tctttttcac cattggacca taatcagata
atatacatag attgggaacc ccaaatattt 660tcacattttg acatatttag tagcgacatt
aaactacttc gtgaaaatac ataggatatt 720atgtcaaatt gtcaacgaaa ctttttatat
ttaataagaa aacgaaatta aataataatt 780aacagaaaat actcacaagt gactttcaat
gattttgtct ctaacgtttt caatgtttat 840tttttggtct ctgactcttc aaggaaactg
gtaaaaccgg tgtcccggta gtcatcactc 900acatctcgtc caccacgtgt cactcttaca
cgttcataag ttttcccacc tttctacgtc 960cgattctgta tttatatacc ctaaacccat
cgtcttctat 1000201000DNAArabidopsis thaliana
20gaataatact cttatagaga ttttagtaga ttttcttctg cacccatctt cttttttgcc
60actagaaaaa gttacataaa aactgtaagt tgataaataa ataaaactgt aaattccaac
120aattagtttg gattttccac ctgaaatctt tatagactat gtttattcct cacaataaat
180agaagatatg aagaataaca tcacgcatag tcgatatgca cgtgagataa tatgttgaga
240tttagttacc aaaaaagaat gattttgtct gagaaatgaa attcctctag ttgtgaattt
300ttcaaacttg gtcggtcttg tgagttgagt atctttttga ggggaggtgg gaataataat
360aagcaaaaat atcttaaaaa aaagcgaaag ctgagtttga ccaatcaccg acccggaaga
420ttgaaactat aattagtact acgtttggtt gtcatcaaaa tggtcattct tatggttatg
480ggatattatc ctttgccgcg gctagccgac cattacacca acatcatttt tttttgtctc
540cattcaattt gcataaacct cttaaattat agatgttttt tacagtctaa tcttatcttt
600taactttgta agtggtgtcg ccttacgaaa attaaagctt ggaaaataac taaataaagt
660tgaaggatat gaactaatga atgccttcac cgtccaaaaa aacagagaat gccacgtatt
720cgtacggacg ttaatgtttt catctctgcg cgttgtgttt ttatgtgtca atatctctct
780atttttcaaa tgctcatttt cttaactttc ttattggcca atgaattgga gtaaaagcaa
840aggaataacc ccaaaaagat agttttgaat agtcgtcaat agatagatgg gacatacaaa
900ttacaaaagc ggttagcaaa atctttcgtt gaaaaaaata aatatctaat tcttgccctc
960atatataaac ccatgtagag tcgtctctct cttaccaaat
1000211000DNAArabidopsis thaliana 21aatggatgaa aggatagatc atacttacgt
ttgattcttg attttgattt tgattttcgt 60tagtttgggt agatgccata attgagagaa
tagggtttca atatattgat taggattttc 120tgataagaat aagatatata gggtttgggt
ggaagacata attattttag ttaattaaaa 180aaaaaaatta acctaaccgc taccgcccgc
aaccgcaaac gcttgcggga aggaactttt 240aaaatatggc gatttcgagc ggtccaaagc
ggtatctaac ggtttttatg attggtgtca 300aacgctaaca actgctacca cccgcaaacg
cagcgtttgc gggtgacaga gggagaacca 360atcaagctct aagactaatc ggaccaattt
agcaacttgt gtgcttcgat gtgttggact 420gggctagatt gggcccaaat agcctgtttt
aacattccca aagatccaaa caattccttc 480ttctttcttc cgaacaattc aatgtagatc
cgagcgaatc tcctaaatta ctaaaatgca 540caattgtgct cagcttacca aaaaggccca
aaagaccaaa acatattctg attttttctc 600tttagataca ataaaagtaa tttatataga
ttagaggccc aataaggttc acgtaggccc 660aaacatatta aaagtaactc agaagagatc
catcaaaatt ttgattcaca catctgttta 720ggaattaaaa atattatttg atcggttatt
cattcctttt catgaaatca tgcaaaaaat 780caaaaatcat ttttttctct agaaactacg
tggcgagaaa gcagagcacc agttgtcttc 840ttgctctgat tatctcgttg aaaccgcttt
caaagcagag caaaagagac gacaccggag 900cctccactgc tttacttttc ctttaaactg
tgactgcttt catttatata ataaaataca 960tacactctca gagtcacatg tactctcctc
taacataaac 1000221000DNAArabidopsis thaliana
22aggacacctc attctcatta cataatgtga cgagattgag ctgtcgtttt gttgtaaata
60tatggtcggg ggtcaattat tcatatgcag ttttaggtca tttataaaga ccattaaaaa
120cgtctttaat atttcaacaa tcttataatt caattattaa gaagcatcta gatatggatt
180cacattagag ttcgtattct ttgacagcaa attcacatta gacttttgat atatatattg
240tacatttgta atatagtata cgaaaatatt acttgaaaac tgatgttatg tgttgccatg
300atgccattgg tccatgactt ttccacacga aggccaaagc atactcttat attatatgag
360ttgagtcatt cttttttcct tttgacggca agttgagtaa ttcaattctc ttgtcactta
420ccggtgatat aagagttaat cttataaata gttttctgaa acttaatata ctataacaat
480gtaaaagtcg tcgctttgtt atttgaagtg aaaattaagc aatgttatga tatttttact
540aattaactca atatgaaaaa caaaaatcct cttaactaaa acagaaacat aaaagacgac
600ttagtttttg ctttagatct agactcataa ctcaaaaaac aatttcatta taaacttttg
660tagatcttac aattttaaaa taaaatgtac attaatgttg aaaagcaaaa tcttaaatta
720gtgtatacta ctactttttt tttatcaccg tgatagatca ttagatcctt aacctcaatc
780cctagagcct gcttattgcc tttaagcatt gtgcaatcac taccaaacac actcaaaact
840aaataaatat aatttataac ttatcaaata aaataaatac tatcaagtgt gtatgaaatc
900gatgacaatt tatttattac cttattacca tttgtgtctt gaagtacacg ttaagctatt
960tttatgtgtt ataaaggctt cttcgtcaac caattacgaa
1000231000DNAArabidopsis thaliana 23acatgcatga ttgttaacgt tttctttttg
tacaaggaaa aaaaaaaacg tgtacctcta 60gacaagagtt agagagaaat gagaaactgg
accaggagat gtgaatggtt tcattcctac 120tttggttaat cattgcaaat gccaataatt
atatagagaa cggcgtcaat tttctgcaaa 180gaaatatctt tactggtact tttttgtgct
acctataaat gtgttggttt aatctagcca 240tcacacgtat atttctagct atgttagatc
tttaatcatt ctcacctaga ctcctttaca 300aaaaccaata tagatgaaac cagtcccacc
cctctcattt aagtcatagt atatatttgt 360gtaacgagaa tacagacaaa acaattaaat
tcattgtaat catacttgaa aatatagtca 420atgaaattaa ttatcctaca tttttgaaaa
cagtgtctat aattattatg tacattgaat 480agatacgtgg gatattttgg attccagttc
atatatatgt agattaaaaa taaagagagg 540gcgataaagc ctaaagaaaa tgtacaaatc
gtaatgtaat ctagtgatga tttcctctat 600ataggttgat gttgggtaga cttttggtca
ccatgatatt tatctatcgt ctataaagta 660caaaactgtg gtactaaatg tgatttatga
aagctaatta aaaagaaggc gtgaagaaaa 720tgaaatcgta atagacgacg cgcgtacgag
atgagagtag tggaggaaag aagtaagatt 780gagtgaatga taaaatgcaa acgctaccta
ctaatatctc cacttgtcat gcaaccagac 840tgagttcgtt ttccttttcg agtcttattt
ttttgttttt tattctactc aacacgttac 900acgcttctaa taaactctaa acattaaaat
caaaatattt tgactacaat ggttattttg 960gagctatata taaaccacct gagcctcctc
agtttcctcc 1000241000DNAArabidopsis thaliana
24catacatata ttacgatgca aacaccgtat tttagagcta cccataaaat tgcttacaca
60aacaatagct gtctgttttc acaaatttat aactccaaac cccacaatta caataaccaa
120gagacactta accagatttc ggtttacact agaattatat ccccattggt catctggctc
180ggtccggtag agttccccag gcacaccggt tacatacatg tatttcggcg gaggtggaga
240tgaagatgac ttgctgcagt atgtctttct tggtggtggt ggccgcaacg gagttgtaat
300tccggtatgt cctccgaggt aagaatcgga attggtttga ttagtgagat agtgtaacac
360gaggaacatt gagaaaaaga ctagaatcag aggtttaagt ttcatcatga tcatgattat
420tatgattaat gggaaatgtc ttttgatttc aagatttgca aaaaataaca catataaagc
480atacatatat aaacgaaatg catgtgcatg taaagtatat gcagcgtcat atctctacat
540cacctatatg catttggtgg agaccagatc atacacatta aaatgcaaaa ttgtaaggtt
600cttcagtttt gaataagtct tagattgacg aggtcaattt tttctggtta ccattagtcc
660tatataaggt tataggattt ttgtaatgga ctggcctggt tgcttttgaa tccggcctgt
720tttaaggccc atctcttttt cagtgaagaa gctaaaaaaa gagttttaat ttcgtttgaa
780aattttcggg agcataaaga actatcgaag caacacgtat ccaccttaaa ccacatgtat
840actctagaac acaacacgtg ttaaagtgaa agatgagaat aatacggacc gaccctacgt
900acttgataaa aacaattaag aagaagaatt taaaaagaga gtgaaatgat tcgcttgagt
960ctctatataa acatcacaat ccaactctcc atcatcttcc
1000251000DNAArabidopsis thaliana 25aggtagtgat aaatttatta agataagttt
gtttgtaatt tattataaat cgtcatttgc 60tccttgcaca agtggtaatg tgtctatggc
cttggcaagg tcaaaggatt ccaaattcgt 120ctttgcaata ttgttctttt tttgtttaat
gtccaaaaca aacatagttg cattactcta 180acatatttag cagcacaaat atacaatttt
gtactgatta aaacacaaaa tatttttctg 240attttcataa actaactcaa acttggaaaa
tgatctttta attcttaact caaacaattt 300tgacatgact gatcgttttt ttggatacgt
aaatattatg atagaagtaa taagaaatgt 360gagaaatttt tagttttaag aagatgaaag
acctagtaaa tagatgttca aactttgaat 420acatcaaaca gatgagttaa tgctagtgag
caaaattcca tagtcaagtc ttccaaaaaa 480aagtcaaaac aatcatgtta taaaagtttg
taaccaatat ttacatttaa cacatttata 540aaatgtgaca tcaaaaacgc ggctataaaa
taacgttacg atattaaaaa tcaaaggact 600aataaaactc agaatttaat ttggggaaag
aaaaataaac cctgcaagat ttgaagtgtt 660cctcctaatc aaccgtaatt tagaaatggt
ccttaatgga cggtggtcgt cgtttcacac 720tctccgtcca ccattttttt tttttgtttc
tcctcctcta aaaagcaatg gtctttcaag 780ctccatcaat ggctactact acttagcgtc
tacgaataat aaccccttta ccataagtcc 840atagatttat ctctcttctt ctcaccactc
atgtcccttt ctttgtcttc ttcctctcac 900cgccttcaaa gttttcatct ttatccaaat
aaaaaaatcg tttctttttt agtttcttct 960taacatattt aaaggcgtac cctttctctt
acttatcctc 1000261000DNAArabidopsis thaliana
26cactaacgcc gtttactttc ccgcttggac gataacaccc ttgatatatg gattagtctc
60tcttttgctt ttggatatct taatatattt taatatgtcc atttttatat tgacgaccat
120ttaattaggt ctcagtttgt attattgatt tttaggtgcc aagcattact ttcgttacgt
180gtaatctttt gattattcaa tcaaattatc atatggttca tggcatttta tcaattgtga
240cggtcgacat gtgggttgac tttgatgtat gacttttagc ttgtgcctca tttttattcc
300aactcgacaa ctagcagact ttatatttta atttatttct ttctagatat tagacaattc
360ttagcttgtc tctctgcatg aatcacgcgt taatatactt ataatagtaa gttttaaaat
420gtttcgagta taatatcata ttagatttca attataattg taagattatt tagtctcaat
480gaagagcatt taaaaatgtg aaagagttta ggctttgact gttgactggt tctaacgcgt
540tttgcttctt atttatacaa aaagatttta ttttatttta attagaaata ttaagtactt
600tttagacgtg tttaggttga ttaatttctt gttaacatat ccgataaaaa aaaacagcac
660caaaatcacg acgagaatta tgcctacgtt acgttgataa taaagattag agataaaatg
720taacctctct tctctttaca tcattacatg catcttgtgt tttatttgac tttattctta
780ccaacaaaca accaatggat aaattaggct cgtgaaggag ggcaaactcg taatttcaaa
840gaaacggaaa gcgtcagaac gtggagaaca gtgaaccgag acacattccc gatttttcta
900aagaaacaaa ataaaacacg tgttttttat accataagta aagtagtagt atttattaaa
960ctattaatat atagctttgt tatattggag gtaggaagaa
1000271000DNAArabidopsis thaliana 27aaataaagtt ggacaaagaa aagaaaaaag
tgtttggcat ttaataaaac gtctcattca 60ttgcgaagag attagatagt agagaggtca
aattcaatgt cgcagccgac aatagataag 120aagagataag gtcttcgtta tatttttttg
tttgtttttt aactgtcacg tgaaactgat 180atccacaaag agcgaatgag agacgtagaa
gagtcaaaga ttaaaaaccc aacgaacttt 240gactcatcct ttgaaggtgt tcgttaatta
tccatttttt ggtctgactc gtggcatgtg 300gcaactaccg accttagata agcctggtcc
tactctcgta ttcgatcacc acgtgtcgat 360cggcttatcc gacacctcga gtgggagttg
tatgttagta actagataac ggtcataggt 420acgattatga cattgacatg aaatcatatt
ccaactatca acgttagtgt ccttgttttt 480atcccctgta attcagtcaa ttaagccatc
gtaccaggtg agtctttgat attgttgttg 540tctacgaaaa accattagat gatctctaat
tgatatttga ttcaacctat ggtaaaatta 600tcccaaaact caaatattac ttcaattgat
atcatcccaa atattaccta gagaggatca 660agctttttaa tcgtcaattt tggttataca
aaacgataaa aaaaaaattg taagccaaaa 720ataaaaagta aaacgaaatt gtgaattttt
aataattctt ttgcataata cacaaaagaa 780aaaaaactca tactccacat gtcaagtgat
gacacaataa atgtctaaat ttttacaatc 840aaaaacaaaa aaatgtataa aaaattcgtg
taaccttttt tttttgttgt ctaaaaaaat 900gacatgattt tggtaaatag ccaacaaatt
tgtagtagag tagtaaagtt aggtttcatc 960atccatctct ataaattctc aagaccgacc
tatacatttt 1000281000DNAArabidopsis thaliana
28tcaatcatta atgtatttat gatttgatcg aataacaata atatagctta ttttgttcta
60ttctatcgta tttgattctt ctttcgtttt ttttttgttt gacttaagaa accgattgtt
120tatagtagta aacatttgtt tttaatgttg ctcgattcca gtgcacatgt ccaggctaga
180cacttgtcgt tataaaggtt gctttggttc aatattgatc cactagagat gttacaacta
240ttgttgacat ctgagattgt gtgataagaa aatatgaaac tggatttagt gaaagttaca
300atatataatc atacatcata gataggaaat aaggaaatgt cagatatact tgaagaatac
360atcaaataga caaggtcctt tttcttattg tcgactatta tagagccgta cagaaccttt
420tcacgtcttt agtaattagt acattctcca tttcggctct ctcttatttt ttttccatct
480cttttacttc tccaaataat aacaataaaa gcttcgattt tgtgtgtgtt tgtatttaca
540tcttgacatc gatattcttt tcatcaattt tttaccaaaa atgtaataaa aacaaaaaaa
600aaccaacgct gaacacagac atggtttctc catccgttta tattcatcgt ttgtatgttt
660acttaacaac ttatttcaaa atagtacata tcatggttgt gtttttaaaa aaagtataca
720gaacagaaaa gcacatggta gacaaaataa tgaagccaaa attaatacaa agaagaagtt
780caacttgtat ttattaacac attttctttc cttgtcaaag acatgcaaat tggttttgtt
840ttcttattcc catttttttt ttataataaa aagaagaaga gtaaaacaaa aaaactatca
900tttcttctta tcgcaaaact cttatctaag caagaaaccg acaaaaccta tatctacata
960tattctcatc aacatctctt gagacatatt cattttggtt
1000291000DNAArabidopsis thaliana 29cttttgtctt tttccagcct ttgtttacgt
ggaatctcgg tttctaggac ccactacact 60ccttaaataa gaggtgcata attttatatt
taacaaatac aattagcatt agattagtcc 120aaatatccgg agtgattttt tttggttttt
agaatattta tttttttatt gtaaattata 180aaaaatgtta tatataattt ttttatagat
aattttttta attttattag atttatgata 240tacaatgtat tcttttttta tgttataaat
atttttaata aaatttgttt ttaataaaat 300tttggtgaac gattataatt atattttgtt
gtaaattttt tatacattta tttttattaa 360tctgtaaatt tgtaataaat tataatattt
gaaattcgaa ttttaaattt tttggtataa 420aattattaat aaacttttaa tcaattaata
gatcaaaaaa atttatgtcc aaattaatta 480attaaaattt ttttaaccta cactattgtt
taatgttatc acaaattata aatttttaaa 540ttaattattt tgtttggtcg gcaaattaag
atattattat caaaaacatt tttttttttt 600tcgcaacaca ttagttacta aatgaactat
taagttcact tatcttatcc aatttgtgtc 660taccctacat aaaccttgta cttatcccta
aatcacttta gataaattgt tgaaaattta 720atttataaat tttgtattta ccaaattagg
aaaacaaata tctgaaaata tttattttta 780atatcttaac aactcgaaga actgagaaac
gcgacaaaac caatcgtcct cttccgatag 840ccacaaaaca aaaatcagac aagaagaaag
aagaacgttt cttctaacag atagagatta 900caatcaaatt gactcttaat ttctcaattc
cgtatctctc atttcatctt cttcttcttc 960tcctttactt aaggatctct ggtttctctt
tctctcctct 1000301000DNAArabidopsis thaliana
30ttagcaataa tatataaccc aaaaacttat ctctgaacaa attaattagg aagtaacgca
60caaaaagaaa tgcacaaaaa taaatgtcag aagtgggatt tgaacccacg ccctctttcg
120aagaccagaa cttgagtctg gcgccttaga ccactcggcc atcctgactt tttgttaaac
180ttatacgtat atagtgtaaa taatcgtata attttcaata tccataccaa tcttcagact
240cttttttttg gtgacaaagt gtatgtatta ttttcagaag ctatatgttt ttcctataaa
300acatccaaga ctggcccata catattttta agaaacccta tagtgtgtat atgacttcaa
360aatttcgaat ggttagtttt tctttcgagg accaaaacaa aaagcccatt caatcactag
420aaaaatatca ctagtcaatc aatagaccaa aagattgaaa gtaggatata tttgtttaat
480aatgcctacg attctgcgaa gacaggagaa gccatacctt tcaatctaag ccgtcaactt
540gttcccttac gtgggatcct attatacaat ccaacggttc taaatgagcc acgccttcca
600gatctaacac agtcatgctt tctacagtct gcaccccttt tttttttagt gttttatcta
660cattttttcc tttgtgttta attttgtgcc aacatctata acttacccct ataaaaatat
720tcaattatca cagaataccc acaatcgaaa acaaaattta ccggaataat ttaattaaag
780ctggactata atgacaattc cgaaactatc aaggaataaa ttaaagaaac taaaaaacta
840aagggcatta gagtaaagaa gcggcaacat cagaattaaa aaactgccga aaaaccaacc
900tagtagccgt ttatatgaca acacgtacgc aaagtctcgg taatgactca tcagttttca
960tgtgcaaaca tattaccccc atgaaataaa aaagcagaga
1000311000DNAArabidopsis thaliana 31aagtgagtat aggtcaacta ctaataacta
actcatgttg gtttttgtta attggggtcg 60ttggtggaag tgaattggcc aatgggatgc
gtacatgtgt atgtgaggat tgtggtagat 120acatgtcttg ttattgtatt agatgaacca
ctctctcacg gttgttctgt tttggtaact 180taacaagaga tcaaagagat tccattccat
tactatcatg ttgagttatg acaaaaaaag 240gtacagctta caaaacaaga catgcatgtg
ctctcatcct ctcaataaag cacatagcaa 300tgaaaacaat caaaatattg tttggttaac
aagacatcca tttacaatta aatgaaagac 360tctattgcaa aaactttact tttattcatt
gctattactt tattagacgt ccaccaaaag 420tcatttattt agtttgtaat gttttttgcc
atttagtacc attgactcaa aatcattctt 480ggctactgat tttcttagtt tcattgtagg
tcgtattcgg atttttagta ttatcttctt 540ctttacggta tataattcaa cttttcttag
tttcaaaaca cggcaaccag atgactaaaa 600taactcacca ttgattcaaa aatcaacata
atttggtcta gaacgtcaaa agacttgtaa 660aagatgatgt actaactgag aattttcgac
acctgttatt ggtcaagtca ataatacaaa 720acaattatgt attattgtgt agatatagca
aaactccata aaccttaatc aaaggaagta 780gttcaacaac caaagcctag accgtcacaa
agaccaccga ccccaaatat tacaaacgga 840ccagttctct gttcacaaca caacactcat
cattttgtcc gaatctgaac aaaacaaacc 900aaaaactgtt agaaaagtag gaaagagttg
gtcacccaaa ttaagccacc tttgcttcct 960caattcctta tttataccat caaaagactc
cacaatatcc 1000321000DNAArabidopsis thaliana
32aaaacatcaa atctaagtac aatgatagac gacaacacag ttcctgcaac catggaactc
60aaattgaata tgcaatgact cttggttaaa gacagagcct ttgggatatt acatacataa
120acatataact taagcactaa ttcatgaatt tacatgttat atatgtatga tgacgaagtt
180ccaattccaa gaaagatcaa aaactacaaa gttacctctt ttacttctgg cggagtcgtt
240gacggagatt cactccggct cgttcttgct gcgacggtcg aaaaccacag aaaattgtcg
300caggtgggtc tggttagtgt aaaaaccgat aaaggaggga ttttgacatt ttgttttgga
360ggaggaagac gaggagattt cttgaaccga atgtcccggt ttttctaaaa ccacttctca
420gactcccggt tcatattaac cagcgcgtgc gtgtcgtctg gttttcatca tgaccaacgg
480tcaagatcaa aaggatcgcc ttttttagta gatggtgttc gatgaaatgc tcgttagaga
540cgttaaagtt gtagtgacaa caaattagtg tttatcattt tcagaatcgg tataatggaa
600atgtcagttt cttcattcat ttgaaattgt gacaagatcc aactccatca tttcaccacg
660caataactaa actgttaggg acaaaacgca acatgaatga atcaatgtac acaagtgtga
720gaacctgata ttcctatcat tgaattgtat ggaacatgcc aaacttagag agtaaagtgt
780acgtgttcta ttttgtaatg caaatatgat gcaattcaaa tcaaacacga gtttatctcc
840tttggcacgt ttctcaatat agattgctgc accagaaaaa gacaattttt tctgtttgaa
900gcctcttttg acatgaagcg aaatgcggca agagaaccaa gaagacaata gcatttttgc
960ttttctttgg ttaccactct ctaacaagat aaagaaatgt
1000331000DNAArabidopsis thaliana 33ccatgttagc taaattgatc catttggaat
tttggattct atgtatttag ttcattatca 60atgattcgtt ccaagagaaa aaaaaaaaca
cttaatagat tattagtttt ttttgttttg 120aacaccaaaa tttttagaaa ttactcaaca
gaaataattt atcttttgtt tttttagtcg 180acccaaactt gtgcgtttta gtctttctct
taatctttgg aaaaaaatat ctgcagtgtt 240tttatatcta ctcaaaagcc caactttcaa
agttctaaaa ttcaaagccc aaagcctaac 300agttaatcca gttaaaacta agttcggaat
ccatgtaagg cttaatggtc cggtcccgtc 360ccggtctcaa aacggtagtc gtgactcgtg
atatcttgca gaatccgtaa ttccgtatac 420ttgacgaggt gctgtaagta ttagatgtta
tcgtgatgtt tctgctcttc ttcgatcgcc 480acatcaaaat tataccatat ttttgaatct
aaagattgtt acgaatgcat aagcatatat 540gtaagaaaac aataatcacc tatatttatc
aaaattaaag atctatcgat tgatctttct 600gacttgcaaa gagagactcc acataaatgt
aaacacggtt aaaatacttg aatacctcaa 660tcattgtcgt gccacgccga catagtcaat
tagtcataac ttgaccctcc aaaaaagcct 720aacgtaagag agtttcacaa acttatttcc
aattatctta agccacgtgt ttagtgacct 780ttgtgctggc cctgaaacgt gtctatctta
tcgacccctc tttaaactca aaactcaaaa 840ataataatta aatagtaaca ggccgacata
ataacccaaa aatatctata acgattaaaa 900atagtaacac actgacccta cgtggaaaca
agaaagatat taaaaaaact catatgattc 960gtctctataa aagatccaac tctcacaaat
caaacaaatc 1000341000DNAArabidopsis thaliana
34ttatccaatc cttttcaggc cattgactta ttttctaatt aacctaatta tatatataag
60aaaaagctaa ggcatcttat atttatatat acatatatca attgattgct aataatgttt
120tgattaactc tgcatatggc atgttttcca gcgatttaag ttcacaaaga agattttcgt
180ttttgtcatc actaaaaatt atctagttct ttctactttt tcagcgtaca ataatctttt
240tttctatcaa ttggtacgtt ttttattttc tcaattttaa ttttgttttc taggtacatt
300tttaatcgaa ttgcaccaaa atatccaaac aaatcaataa aacaggcaga atgaggtatt
360accgtattag ttaaacgaga agtggctact ttcctcagcg tgattaaaat gcttatgggt
420tcagtttcat aattctcata tacggaacta cgcatttgac ctcatgctct ctataaatat
480taagatctga tgttgagagc ccattagaac taagtttacg aatgagccca tgatagttta
540acttgggcct aattagtgtc taatctgctt ctaacgttac cgagtcaaac gcggtctaaa
600ctaatttgta gagtagtgta tttgtccaaa gcgtacccac acaattgctg agaaacttca
660aaccctaagt gtgtgtctag tattttgcaa tatacgacgc ataaagaaat atcctaacat
720tgacaacaac aaaaaaaaat cttagcggag atatagacgt gtgagacaag agctcacacg
780tgttgcacat tcgagacttg ttgataatga cgacaacaag aggttctaga atattcgtgt
840tggtttcatt attattagca aataagacca agtcaacaaa ctaacttaaa cattgtggat
900cgtccacttg tccgactctt acgtcaaacc catcattgtt ccacatatct ccacgcgtca
960atctctctat aaatacgaac aaatgaacac gtatcaattc
1000351000DNAArabidopsis thaliana 35catctggtac ggatcttcta gtttctatca
ataatcagag tacacaagaa ctcatcatct 60cgaatatgta ttaacgtcta tattgtatgt
ttcaattcag aaatcagtaa actctggtat 120tttgaagatg aaccggtccc taagtctcac
atggataaat ttacatcctg gtacgagaat 180taccgaagtg aaatgtccca agcgatgatg
gaaactgata aggttaagag gaatcaatta 240accaatgaag ttatccagag gtacaagcag
gtaagctctc gctcaatcca atgcaaagat 300ctgaccaata gatttcatga atcacatttt
ttgttgatta gtccatataa tgcttcttaa 360cacaactcat cggttgaacg gtttcttctg
tctgtatcag gatttctatg gcgctgcagg 420gtttgaagac agcaataaaa gtctggaaga
actctacccc caagccttgg cactctacaa 480cgtcgtttac gattatgcca ttcaggaagg
tgttgcgaaa tgtacatttg cctggaatgt 540tgcaggaccg gtcctgtgca aattttacct
taagaaaacg aaggataaat cagtagtggc 600ttcaacgtct gtgcttaaaa agcttttggg
ttgaacagac ttacttgtcc tgtttgttgt 660cttcatgtat cataagatgc gttagacaag
tatctgacct aggtaaccga acttataggc 720gaagctatgt ggtgtacttc ctttgtttaa
gttataactt aagatttgtc atctagtgta 780caagtaattc cgtgtgtatg tttgcttatg
gaataaataa agaaaactaa tgcttatatt 840taatttcaat taattgtacg tggcactttc
agagtccatt tggtgtacaa agctgtcttc 900ttcagtgtga cacctctctg catcctcaag
cttccatttg tcttttccag acctttcttc 960tttccattaa gtttcttcct ttgtgatcct
agaaaaatct 1000361000DNAArabidopsis thaliana
36cacataatct ctctctacta cccacaggta tattagagaa actacatata acatgcacat
60aagtctttat tagttctcaa gcctgtcatc ttgttcaggt gtttgaaacg ctggttagca
120agatgacccg gtcgcaagat tcagttgtga gagcagcatg ctcatcagcc tttgggttac
180tgctaagatc aagcaagtca acactgtgga gaggagctcg acttgacggg accgactcag
240gtagaaaagc caatgatctc gaatctgtta agaaataaca tgtgcattac ggtctgtgtg
300aataccaacc agttactgcc gaaaactttg ggggggtttg cgatatatat ccataggtac
360agtgtgcaat gagaaaccat ccaggttttt gtgcttgttg taaagtgtat tatgttttca
420tcttgtaaag gatattgtca ctcgtgtgtc atctctatta ccaaactcag gaaagaaaca
480ttcacacgaa attgttgtag gctgtggtcc tctttttatt gattataatt tttgaggcta
540gtgttggata tacttgtctt gacagagtcc aacgtttgat ttatcttcag ttagtgtttc
600atatgagagt ttaaacggtc tttgaagttt caaacacaca ttagattttg gtaacattgg
660taatttattt atgtggcctc tatgtttatt ttgatttttg agacttgacg atagatgtag
720ctatcactat cagtgagccc tccaagttgt tgttttgtgt atgtgaatta tcttcgtttt
780ctttatgaag atatgtttct aaacttttcc tgagaaggaa ggtcactacg atggtcacat
840acggacaaac taagtagcaa gcacaatctg tggacttaaa aagaggtgtt tggcaagtaa
900aaaactgttt acgtcatctg ttacgtcttc tgttcacttt atgttttact ctccacgcat
960cttatccttt ataagctcgc acaaatctta accaaaacca
1000371000DNAArabidopsis thaliana 37atacccaata tatctaaact ttaaaactta
atttggtcca acaaacgtca atcgatacaa 60tcatgtgttc ttctagctta tggattctat
ttgacgacat catgaagagt ttgtacaaat 120caattaattt ggatattaca taaactaaaa
atatgttgtt tttttttacg gatgatcttt 180taataactga agatgaaaat taattttagt
tagaaatgtc ttgaataata attaatgtca 240cagaaaagtc agaacgtacc aaagatttcc
gcgtgtatta aacttaataa atagacacaa 300gccaagtctt agcagtgtac tactactaac
acacatccac atcaatgtgt catctttgtt 360gtatgacaca agttagtatg acaatcagtg
tttgagtagt gatggtttaa atggaatgtt 420tgagaatcca atacaattgg cggtctgcta
atggccagaa tatctgccaa agttattttt 480tatctactgc ttataaatcg tacaacaatc
tatttagttc tctttagttt tgaccttgaa 540gtccatgaat cttaaggacc tcttctccac
atcattcaaa ttcacatatc ggaaacacag 600tactgcttga atgcccctcc catacaatat
gtctatccac tgatccacaa taactcgatg 660gtgcatgtgt gtgtgtgtgt gtgtgtgtgt
gtgtgaactt cctggctgta ataaatttaa 720taaatactta aatagacagg cgaagtcctc
gtccactttt ctacacctag catcaatcat 780caatgtgcca tcttattgtt tcaatttttc
ttaccataac tttgtccttg catgtaactt 840ttccaattaa atacaagtct gtcttttatt
aacccccgag aatttactaa atgatattga 900cataatctat tggacggaat gagttgtcaa
aaaacgagag ctgtcttttc ctgctccact 960gtcctcttat atatcccaca acaatctttg
ctacattttc 1000381000DNAArabidopsis thaliana
38taccttgtag agttccaaaa cattgtcaca aaatatttat aaagaattta ttttaactaa
60ttaggtcgtt aattgtccaa gggtttttca tagttgatat agttctgttc aaatatagcc
120atccttaatc gattcatggg atcgtaaatt actacttcga gtgttgtaaa aaaaaatgaa
180acttctacat tacaaactcg aatttaatgc atctggagtg atactataaa agtagggatg
240ctctcaggtc gcatttgaga gacacagaaa tgattttaat ggaattaata tattttcagt
300ttttcacaaa aaaaaattgt gtttataaca actgcagatt caatgctgat tttatgagtc
360tcacctatag aatttatatt tctatattca tagaggcagt ataggtgttg acccaacatc
420gaaagaacac ttcgtaaaaa attctttgga acaaggctga aaatttactc ccaaatttag
480ctatccgatg aagataaatc atttaccgtt tattaaagaa ttatcgagat tttagtccaa
540accaaaagag attatgagcc taagattttg aatttgtatt ggtaaaagaa attgaacgaa
600aatttcagaa aaaaatatta ataaattgaa cgatagagtt cacttactac atagtcaact
660agtgcctagc tataatagtt tcaaaagaca aaaaaaaaca aaatcggtta actacttccg
720tgacataatt ctcattttga tttttgaatc cagtctaatt tgaaaagtat attcaaaatc
780tttaaatcca ttaatgataa cttttataat acgttgacac acgcaattgt atatacaata
840ttcttgaatt ttaaatgtaa attctagaat atattgcgat caccacacta atcaaaatct
900ttgggacaac ttgaacccac atttgacttt tcttggtcaa atattttggc atcatgcatg
960atcttctcta taaaaaccaa aaggcctcaa cgacattcat
1000391000DNAArabidopsis thaliana 39ttcactgatt tggaccaaat cgaacgccag
gtcattctgg tgagaaaatc ttaaggctat 60ttatgccttt tctttcttct attcattcag
gagattacac gtggcattac ttgtgtggtt 120cgcatgtcgt tgtttctaca ttgcaaattg
tcacaaattg gtccataatg agtgtttcta 180gtacctttcc ctttgtctgt gtttatttat
gaataataat tcatgaatag aataaactat 240ttttccaaaa atattcatat atgtccgaaa
actcacatcc atattgtaaa tttaatcgaa 300tctaatatat tttccaaata atatattagt
ttttgccaaa atttaaacag aaaatcctta 360atacagtttc gaaaattctt tttctttttc
tgttgaaatc atagtcaaat tcatttattt 420gtagatatat atatatagtc aaattttctt
tttatacaaa ttgataagaa aataaatgtt 480aacctacaaa attaatgtat tatgagctgt
caaattatta ttttgacttt atgagttgtc 540atattataca tagcatatca ttcttaagaa
aaacaaaaac aaaaacaaaa acaaaaaaat 600tatcatcaat gatccgcttc ataaagtaac
caccaacaca attattttgg acgcctcaaa 660tccaaacctt attttcaata attgtgaaaa
aaaagaaaaa aaaaagctat attgtggtag 720gtggttcgaa ccactcaaaa atatttatga
taataacacc caaaaacaaa ctaaactatt 780atttatttat tgtggtcatg agatagttac
agacacagag aatacatctg tcagaaagct 840gactcattgt atcttatctt tcccatttcc
tcttaccttt tttcctacgt gttgccacaa 900aactctcatc ttcacccact caatctccgc
cacgtgtact ctctaaccct tcagttcggt 960ctttttaagg cttccccaag ccttcgcatc
caaatctctc 1000401000DNAArabidopsis thaliana
40tcatcagaat taaaaaaaaa aagtcagtct aaaaatatac cctaataaac tgacatacct
60atttcaacga tacaattcta taatgagtta ttaaattgat ttgatttcgt attctctaaa
120atatactcat agacatagca gaaactagtt aatttgctta tatgttcata ttcttgtaca
180tcatgtgatc tcacgtggaa taaaaataat gtgaccatct ctaattctct acttcactta
240gtatctaagt atccactgtg ataccttgtt ttctgctcaa agttgtacat gcgtagatta
300ataaatacat ttcacccctt caattgatcc caaaagcctc tacaagacag ccaaaacaaa
360tcaaggattg cgcaaagcat tcaagtttcg aaacaataat ttttttaatg gttgacaaaa
420agcaataatt tgtaaagcga ttcataggtc agcttctaaa tgttctcgaa ataactcgaa
480ccaacaccta aaccatttgt tctaacgata atagacgaac gtattagaga ctcgttagga
540aaacactact tgtatatatt cacttttaca tacatatttg tttccttcat atatttattt
600atttacataa caaaccaata tgaaaaacgt acgtttcatg aaagagaacg ttgtagaaca
660ctatcaccgt ttgcttgtaa gaaaaaggct tatctcacct ataaactcgt gagtcatatt
720tatactctta acaataataa ataaaataaa actctatttt ggtaaccaca aaaaatattc
780tgaggctaaa agcgcgtctg ctggaaagtt cacgcgctct gctaacccca ccgtccattc
840tgactctttt agatcttaac cgtgtcttta tccacatcat acgtttatac acgtgtcaca
900cgatcaccta ttctataaaa aacattattt taatcttatc ctttttatgc ttcccctata
960aattatcttc tcttcctcct tctccgatct ctcctccgtc
1000411000DNAArabidopsis thaliana 41gactaattgt cagtattcta acaacgttaa
tttcgtggtg cacgaaaaca aaatgacgat 60gtggttgcac aaggtccacg ttcttgtata
ttgtttttta atgtgctaat tgtgcgctaa 120aataaattta attgtataaa attagtatag
tttatatttt tttaatatgt atatgttcat 180tcaaaacatt tggataatat atgtttacta
tgtgatagac tgatggtgga acattttatc 240catcttatat gcatatgaat aaaaattatt
caaaatgttt gttgttgttc agattattaa 300ttattaaaga attagaaaat tttggaatta
caggtatata tgtatgtgtt gtatattgta 360tatttaaatt attatgtata tttatattta
aatttttcta ttttcataaa tcgttagtat 420ataattgtgt cacatctaat attattagaa
gacaagttgg acttttcata ataaaataaa 480acatatcttt tatcaaatta accttttcct
ttaggtgccc aaatccgtag caccatcaaa 540tacctaagat atacttttta tttccaaaaa
tcctttcgtt tgatctaata ttaataatat 600aaacgatcaa acaaaacgat ttttagaaat
caaacgttca agtttgatcg ttggtcgtcg 660ctaatttaag tttgatcgac tgctcatgtt
ccgtgacaat gttaaaactg aatacatgaa 720acatgttttt tttttgtgca tttgatttta
ataatttttt aaaaactgat aacacccgaa 780aaaaacagca agaaactaaa caaggaagaa
aagaacggtc ggtgaggaac tctcgtgaaa 840gtcttcaaaa gacttttaac ggaaaaacaa
aaaacaaaaa acaaaaactc ttgtgaacgt 900aagtgcatgg accgaattgc acccgatcac
gtcggttcca caatcctctc ctatattaag 960gccttcactg ctccttatca tctcaaactt
ctcatttatt 1000421000DNAArabidopsis thaliana
42aacaaaaatg gatttgatca agtgaaaact ttggtccaca tactccactg agatatttac
60ttggcttaca acatatgtca caagatatat attccaaaga gatattttac ttggctcaca
120taattcccaa gatatttcat atgttttggt tttgttttcg atgaagataa gatatttcaa
180atgcttctcc acctaaaccc aacaactttt gagaattcca aattctgatc aagatagaaa
240ttttttatat tgtttttcga attaattgtt tttcatatca ctacaaaaca aaaatattat
300attaattaaa gtataaaaaa tggtactaat ttaaattact caaaataaat gatacacata
360tacatggtgt tatatcaatt acaataattg acactattat tgaattgttt gacattgatt
420taataaaatc ggtacaaatt cacattaatt tgcatctttt tcaaaatatt attacaacaa
480catttaaatc actatgttct taatcgtctt ttcgtaataa taattaaatt ttgaggtttt
540tcgttttttt agataagaaa aaaataatgt aaaaaaaatt gacgagatca tattttggtt
600gatcacaagt gaggaaaatg aagttattat gtaaagaagg attctataat cgattgacca
660acattcctcc aaagaaatat attttatttt ataaaaaaat ctaataaata attggaaata
720aaatcatcat tcaccaaact ttatactgtt ttacaaatct acgttttagt caatttcttc
780cccatacttg acctcatgaa gcaatgaaca caaatctatg tttcgatttt accatggaag
840agggtcttag tgatatgaat ttaaatgtcg ggataaaata taagattata aaactataaa
900ggtttatttt gaaatacagc acacacaaga aaaaaggccc atcaacctcg cagtcctaat
960tagataaata cacgtcaact tctctgggaa caaaaaaact
1000431000DNAArabidopsis thaliana 43agttaaaacg ggcttacgaa accataaaaa
acatgaacat ggatgctgat tctgtcttat 60ggagttctgt tctcggaagc tgtaaacttc
atggagattt tgtgttaggc aaggaaattg 120cagagtatct cattggacta aacatcaaaa
actcaggaat ttatgttctt ctttcgaaca 180tatatgcatc ggtgggtgat tatgaaggtg
ttgcaaaagt gaggaacttg atgaaggaga 240aaggaatagt gaaggagcct ggtataagca
ccatagaaat cgaaaacaaa gttcatgaat 300tccgagctgg agatagagaa cactcgaaaa
gcaaagaaat ttacacaatg ctaagaaaga 360tcagcgagcg gatcaagtct catggttatg
ttccaaatac aaacacagtc ttgcaggatc 420ttgaagagac agagaaagag caatctctac
aagttcacag cgagagactc gcgattgcat 480acggtctaat cagcaccaaa ccgggaagtc
ccttgaaaat attcaagaac ctacgagtat 540gttctgattg tcatacggtg acaaaactga
tatcgaaaat cacaggacgg aaaatcgtta 600tgagagatcg gaataggttc catcatttca
cagatggttc ttgctcttgt ggcgatttct 660ggtaaaacca aacaccacac attgtaattt
gtacgtgaaa agaagatcaa agtgaacaaa 720tttagttgga ttatctaaat atcattcttt
catgggaatt atgttgaaca cgtggagggc 780attatttgtg aagttgaaac gaggactatt
tttcaaactt tcttgaggaa caagacttga 840cttttgtaac cataaagata gaatcaagtc
agattttgct acttcgagaa cattggtcaa 900tgagtcaaaa tgtcaaatag tcaatactaa
aatcaaaatt gaactttgtt tattaatttt 960gttctaatct atttaacccc acaaaagttt
ccaaaactgc 1000441000DNAArabidopsis thaliana
44acgaatctcg tattcattac tatactgcta atgaatgcat caattttaat ttttattggt
60tatctatcct taaaaaaaag ttataagaat attttgataa tttgttccaa gatgtagtac
120tagtagtagt acttggtctc gtaaacgacg acgccgttta ggaaatcaaa cggccaaaaa
180tgcttccact ttgcgaaatt ttattggtgg gaaacgcaaa tgtatcgata atgatttaaa
240gaaaaagcgg cgtttctaat tggaccacgt tgcccaatct ctacggctca cgcacgatct
300gacgtcaaat tggcataata ctataataac cttatcgaga ataaaataat agtcgcatcc
360aaacaaaaga aaccaaccct tttatatcac gacgagtgtc acgccatgga cacttgtcca
420tctctctcgc tctgacccct ttcaggtatc ttttatttct caaagaggaa ttattgattt
480tccatttcca aagaaaaaaa ataaattcga aggtcaggaa aattaacaaa aaacttcctt
540tttttttttg ttagtttgtg tgactgagct gcttcatttt ttttctttct tttttttttt
600ggtttgatga atcgattttt gttgtctatt actgattggt tttcttgttc agattcactg
660attcgaagag aatcatgatt tttttttccc gctgaataat aagcatatga ttgggtgttt
720tggagatttg tttactgatt aaaaggagat tcctttccat tttcaccatt tgctctgttt
780gacttcattg tgcttatatt tcatttagat cttttgtttg ggtttagctt tggaactgat
840aaaaatctga ttttgtctca cggctttgga tttggttctt aaattttggt actttaaaac
900tggataaaga tcagtgcttt tttagattct tcgtttgttg atgaatttat ggatgtatgt
960ataattaaac cataatctct ctgcttgttt gttttcttat
1000451000DNAArabidopsis thaliana 45ttgaatttta tgtaaagtta ttctgttttt
aggactttta tcctcactga attatgttgt 60gttaaagtag tagcttatat ggtcacatat
ctcacacgtg tttccatttt ttttctatct 120aagttttcgg tttttacatt gtatacaaaa
ataaatggca tgccttgggc cttggccact 180gattcacatg atcacgtaga gtttgtgtgt
gattcaaatc ctagatgaga agatcatcca 240attgatctaa gatgtgtctt tcttatatca
cacgttttat tttaataata ctcttcagtt 300gggtttgatt ccatttatga tttatctatg
aaaacagaat atgtcctcat gctgattatt 360tcgtttgtgg ttgcatttta ccatctgtag
ttttttttgt ttactgacaa acaactaaga 420ttttatgaat tcatttgatg ataaatagtt
aatcaaatat attattatga gtgactcgat 480ggcgaaagct ggtattatct tatggctgct
aaagtctact tttagaagaa gaaaaaaaaa 540gggtctgttg cattaccctg ttgttgcgac
tctcctacaa taactctcac gtggcccgtt 600tcttggtttt tgccctattt tcacaaaagt
aaacagagat tttcactagc aaattgggcc 660caattggaga caacttcatg tggtttgaat
cgtatatttc agatagatta gacatctagt 720cttctttgat gtgcggtgag gatcttaagt
cgctataacg ttgtgaaaaa agaaaagaaa 780aagtcactat aacatagatc gcaactaaac
aaataaagga cactatacat gtacgtaata 840atattcattt attttcaacg acaatgttcc
ctagggtggt aacccaagta tttttcatag 900atattatact tctagcgaaa agccatcaat
tgagaaaaaa ggcctccacg tttgacccgt 960tttgcctata aaaggattct ttaagatatg
gaccttactc 1000461000DNAArabidopsis thaliana
46caaagctcct aaatgatata gcttcaagca aagattcaca gagactatca catgcgagta
60tttaatcatg cctactataa aagagaggac aattagctac ataaatcaaa cgctataatc
120aataaagcga tggataatac cgaaaacgac ggcataatcg ctgttgagga catcaccgca
180agtgccacca ccaatagggc aaaacgcatc ggagccagtg accttgaggt aagtcaaata
240cgcagtgtct aacatcccaa taccgccgat tccggtgtac cagttgtatg tcgaactgtt
300actagtagat acttcacttg tagacgatga ggaggaggaa gacagcgacg gtgccgaatc
360ctcaccgttt tccggctcag atgaagaaca tttaatcgga atcgcaggaa atctccgatc
420agagacctaa ttcagaacat ctgacttcaa atgaccgatc acacactact gtcactgaag
480aaaacacatg acaagaattt acctcgaatc gtctaggata agaagttacc gacggaggag
540aaacctcacg gaagccgaaa tgaaactgac aagacgacac agaaacaaac ctcgccatca
600tcgtcgtcga agaatcgcta ttccaccatt gttaacttca tcatcttctc tcctccgctc
660cactatctca gccacttgtt tcaatattgg gcctaaacga ggcccaaata cgataatttg
720gatccataat tagtttatca caattgggcc ttacggtctt cttatccgtc cgagtcagca
780agttatcctc ttcaacacga ctcacgaggc accgagttgg acttatcatg attcattatc
840tacgcattgt ttacattttg gaatcgactt tggtacataa ttgaatttat ccatagaaat
900gatagagact cgaagatcgg acataaaact tgctgacgtg acacccatac ctttgtgtct
960gtatgtatta tataaacgct atagaactaa gagaagaagt
1000471000DNAArabidopsis thaliana 47ataccctcgt gacctttatc gaaggtttcc
accttcacac tcaagatact tctcctagga 60tcttctctac acacattcct gtagggtcac
tccctgagtc agttaaggac tcgtcctgta 120aagttgtgta ttgctgcagg aacccgaagg
acgcgtttgt ctcactttgg catttcatga 180aaaatttgat tgttaaggag atggttggat
gcacaatgga ggaaatggtg aggttttttt 240gccgagggtc tagtatatac ggaccctttt
gggatcatgt attacagtat tggaaagaaa 300gccgagaaaa cccgaaaaag gttatgtttg
taatgtatga agagatgaga gagcagcctc 360aggagtgggt gatgcggatc gccgagttct
tgggatattc ttttacagaa gaagagatag 420agaatggagt attggaagat atcataaagc
tgtgtagtct tgagaatctg agtaaattgg 480aggttaatga gaagggtaag ttactgaatg
gaatggagac taaggcattt tttaggaaag 540gagagattgg tggatggaga gatactttaa
ctcctttgtt agcagaggaa attgataaaa 600ccactaagga gaaactaatt ggttctgatt
ttagattctt ttgctaagaa tctgctttac 660caacttagtt ctcaatttct tgagcttttt
tttttcttct ttttattggc ttaatacaag 720taaggttgtt gacttgttgt gatggaccaa
agtcaccaaa ctggtctgtt gagaaataaa 780actattgttt ttgagtttta gacataagag
tcgtgattct ccttctttct tgttttgtta 840tctttaaata ttatattcta cgtgctcttc
aagtaaagga tgtgttgacc atttcaccac 900ccaaactaat atcttagaag caaaagcatt
gtgaaattta cttgtagaac aaccttctca 960ttcatatata aaagcagaca agcattagca
tcatagatac 1000481000DNAArabidopsis thaliana
48cagttgatat tcataacatt ttgaaagaac ataacccaaa ttagtagtat atttctattt
60tagttcgttg aagaaactgg atttatttct cgaaatatta ctaaatcgaa aaaagaaatg
120aaaacgtgca catgttgtta tagtgtttaa cactgataaa tttcagtttt accttcttcc
180tttttttaag gagtctagca tgacaagaga tctttgttat cattcggaat atgaacgtat
240tataagaaaa tgattgtatt caaaaataga ataataaata aataaacgag aaatgagctg
300tgctttctag aagaacatcg atatctcttt ttcgtatttt tcgtctttaa ggctggaaac
360aaacgaaggc gtactcctac gcctcaaatc tttcatgcct tttggctttt cccaattaag
420attatacata ctatttagca aacaaaacaa tggtagtagc aattaatcca acaaaataag
480aatttttaga aacgtaagta ttcctccaaa aaagataaaa taggtgatta gtttaccttg
540ccagacaact aaataaaatt aatgtgaaac ttaagataaa aggcagacga gattttttag
600gttctgtgaa gacaaatccc tttctatttc gttttgacat gcatagaagt taacacatac
660tacggaaatg ctaaaaaaac gagatagtgt cgccatgata actgcaatgc tgctgaaacc
720ctcaacttta ttaatgaact aatgaagcca aatcattttt attttatgat taaaaagttg
780ccaagattcg tgctcactct gtcaaagaca acaactttat catttttcaa acaatagttg
840attaaactaa actaaaaatg ggtatacaaa atgagttaca acgataaaga caacaacgac
900tttccctttg tttgattatt taaaacaagt ccgtgataag aagaagatcc cactaatgga
960aaataaaatg ttataaaaac tggcggcaat acacaattgt
1000491000DNAArabidopsis thaliana 49tggatgggga tggcgtacta tgcatggctg
ttgacatttt acccacagaa tttgcaaaag 60aggtatgtat gaaggttaca gttatagtac
ttaagattaa atctaaagtt aaaaaccttg 120tattgagtgg gagttcttgt gtcctgaaaa
aggcatccca gcattttgga gatattcttt 180ccggatttgt cggtagtttg gcttcaatga
ctgaaatttc agatctacca gcacatctga 240agagggcttg cataagctat aggggagaat
tgacatcttt gtatgagtat attccacgta 300tgaggaagtc aaatccagag tatgttctgc
ttcgagcgtt acttcatctg aaatatttag 360gcctcttctc taaactatgt tttcatcttt
acccacttta actgcagaga ggcacaagat 420aatattatcg ccaacggggt ttccagccag
agaacattca acatattggt tagttttgat 480gaagaaagta tatataacta gtttccgaat
catatgattt aagctaatga attaagaaaa 540tatatagttc aagacttatg attcatatct
ctatcaactt tttgaccaaa gattgatact 600ttttcgacat ctgtcacagc attttgtgat
gattttgatt gagacaaatc atttgtaggt 660atctctgagc ggacacctat ttgataagtt
tctgataaac gaagctcttg atatgatcga 720agcggctggt ggctcatttc atttggctaa
atgtgaactg gggcagagcg ctgatgctga 780atcgtactca gaacttgaag taagtttctt
tctggataaa acctaatcat tcacatggaa 840caactgtcaa gagtttttaa tgtcacgttt
aggttcaatg tccttttcac taagtctcgt 900aagtttttaa aacaagtaaa caaactacaa
gccaaaaaca ttctggcccc acattaacct 960attcccactt gttaaagaac ccatcttgca
ttatcttggt 1000501000DNAArabidopsis thaliana
50gtaattttta tttaagtttt ggtttaggtt tggtttgatt aaaaaccgta aaaccgaacg
60ttttttttgt ttttgattta aattaaaaat aattgtatat atatatatat aatgttcatt
120tgataacatg atatctatca aactatcgaa aaacaaaacc ctaactgtaa cctaaactaa
180aattctatat aaattacatg ccgtcattta ggatttgagt ttacaaatta gattttgatt
240ttatttatgc atcacactta taattttttt tggtaaaaac atgaaaaaac cggaaccaaa
300ccggaaccga tccgaaccaa aatacatatg gtttttaaat ggttttaatt ttttaaaacc
360aaaaactgta aaactgttaa aaccgaaccg taaccaaacc gaattttata tggtttttat
420atggttttac ttttcttaaa atcgaaaaac cgtaaaacct aaaaccgaaa cgaaaccaaa
480ccgaaaaact gaacacccag cccttaaata taatgaaaat cgaataaatt tgtttgaaag
540aatcgaacaa aattgacaat aaaatctaat taggactatt ttcgtctaat tttgacttag
600ttgaaacaga atattagcaa aaatactaaa acaccacaac gcgtaataat acccacacac
660gatatcatta aatttgacca ataagaatct agctcttggc gaccacgcaa gtatcttcca
720tcttgctctc caagaaaaat ctacaccggc tttaaattta cataaacacc ctcagtcaaa
780gaaaagtcgt aaacatagtc tctctcatga ccacaagggt aacacagtca tcctaaatat
840aaaccacaca agaaaactgt tatactttat acacgtgtca tagtctcatt acatctacgt
900gaagagtttc gatcatcaac cgttcgtttt cttactatat aaaccttgct cgagacctgc
960gtgtgaagcg tataaagacg acaaagtaaa ccaaaaaaaa
1000511000DNAArabidopsis thaliana 51acgaaacaac ttgcattaaa caatttgctt
ctactatact tatattgatc tatcgtgggc 60ggtgagtaaa atgagttggg acatggattt
gtcatttagg tcatttttag taaatgtgaa 120acttgtaacc aaaatatgca taaaaacttt
agcctttcta aaagctaaaa atggtttgag 180gaacaccaag acgtaagctc ttttaagcta
tttgatgttt gatgtgataa aataactaaa 240aactatttgt tatgtttttt cctcccctta
aagatataag agaagagaag gtaaagaagt 300tgtgtaagtt agtcacaatc aattgttgga
aataaatttg aaaactctgt taaaagtcac 360ccaaatgagt ttcaaacaag agttaaaaga
ttaaaaaaaa aaagtaactg atataaagaa 420tggtagttta cttttttttt aaagaatggt
agttttttaa aaactattta ggttgatggt 480atttttgtaa agaaatccaa aaattgtagt
tgatacaaga aaattttgat ttgtgttgat 540ttatttctct tttggcatgt gaatgtaagt
aatctttttc cacatgattt ctaattctgg 600acttgcatgt cttgttacgc cagtcaacat
tcgttggata agttacaatg tcatttacaa 660cttgcagatt aattcggaaa tcaagaagga
aaaaatctga attagaaata gtaggttgga 720tttttcacaa aaaaaaaaaa agtacaatat
tttgagatat gaccattaga ttcttaaatt 780ttacgattct aacttgtaag atttcaaatt
aaaccatgta taattcaatt taggggtaat 840ttaatacttt agggtagtaa cgataaaata
gataattaaa ttgaagttta aggaccatcc 900tgtaaattca agttcggtac cggaaacacg
tcagcattga tctaatagat acgagtcctc 960ctttctctat aaataacgta atctctctcc
cgtcttcatt 1000521000DNAArabidopsis thaliana
52aatcctactt tttcaaattg tttttttttt ctttgtatta atgctcccaa gatgtttatt
60gaactgaaca cactagaatg aattatagct tttgtaatca gtcgagtaga ctagtttttt
120tttttccgat ttaattttat taaaatgtca acataacaaa cttccaaaca gaagtctttt
180aagaattttg gaggctcaaa gatttttttt tctcgatttt tttttactct aatttggacc
240atttggtctg ggaatacatt tagaaattga ttcatctcta tctcccgtat atattttaat
300atatttagaa aaaatgctaa aataaagaaa agtagacgag cttataattt tatatattaa
360ataaaaactt aaataaaagc ttataatttt atatattaaa taaaatcttc accttaaaac
420tattaaaata aataaatata tatatatata tatatatata tatatatata tatatagttt
480tgttatcggt attaaagctc tattttgtag tcgaaagaat aaaatcatat aaaacaattt
540tgctttagct ttatatcgat tttattttgt gactaaatac tagttttgca aattactatt
600cattatgtta atttaaaata ttttcaaata gaaataaaat aattattaat attttttaaa
660aagatataaa aaaaatcgat tacaaacaat gatataacga gaaaaggata gcattaaaaa
720ttcattccta acaattttgt actcaattat tttccatttt tagaaaaaca actgcttccc
780aaatatttga tcttagaata tttttccaag cttttaaaat ctttttgaca gatacgtggc
840acataaaact aggtggtgac atagatatcg acgtgtgtaa cgtttcttgt gtcctaaatc
900aattaaaaaa gcctcacatc tctttgtaca ctacaaagaa aagatcattg attcactcac
960aactttcatc tatttatagt ctttaaccca tttcagtatc
1000531000DNAArabidopsis thaliana 53tttgaaaacg ttaaaaatat aactaaaaga
attattgtta taaaatataa ttactcccaa 60ccagaacatc aaactaaaaa cagaaactca
tgagaaagat ttgatctgat taatgacaat 120acctcaggcg atgtaaaaga catcattgta
ccatcaggac cgggtctagt cataacactg 180tcgtcggcat accacatcat ctgaactacc
tcgggataga ctcctagatt caggaagtaa 240gattggagaa agcaattggc taatgcttcg
gaagtattta agttcgtgaa gtgggattga 300ataaagccat tggctaacaa tacggaagga
tcaacatctt tgatcatgga atccatatct 360gcaccagtaa ttatgcccta ggggagacga
agaaatatcc aaagtaaacc ctagctagat 420atcaaaatct cttaaaaaca aacttatctc
tctgttgaga tcaaccgctg atctataggc 480taaaggaatc accggaagag agagagagag
aggaaagagg cggaagttgt gacttgtgag 540acggtttgga ttttggaaac ctacgttcca
ttgaacttct ttatatattg gtttttcatt 600atgggatata gtaatatctt tataaaaaaa
aaaaaaaaaa ttatatatat atatatatct 660ttaatacatt attaaattaa ataatatacg
gataggaatt actataaatc tttataaaat 720aaaaataaaa ggagaagtcc cttttatcta
atttggatgg ttaatttgtg aattttccat 780atttatctaa tttcgatttg aaacatatcc
aacgttaagt cagattcttg tggtgaaatt 840attgcatcat aacataaagt catgagagta
ggagagaaag taataagtgg caacatcaaa 900acgacgacgt ttttaagcta gtgcgtcagc
aacagttaaa ccatatgaag ttaatggtct 960gtgttaatga agacaaaaac tatatcctca
ggttagcgat 1000541000DNAArabidopsis thaliana
54ttataccatc gtaatccaca cgtacacatt gtttttaact tttgacaaaa taattgttta
60gctacctaac aaattgacat attgtgtggc aaatttataa ataaataatt acatatatct
120cactgagacg ccatgcagat tcgtatgttt atacaatgcc tcacccattt cttgaattct
180ctcaattgat cttcccatga tgttcattta ttatttgctt tatgtatatc tttgtagaat
240ggtagtatta ctattaaagt atcaaattac catgacgacc tatagaaaaa tgtttttaga
300aaagcatagg gatatgtgtc ttatggacag tggataagag agtggtccta aaaacagacc
360ataaattatg ggatcttccc ttagttcctt tattgttatc atattttaat tactcttatt
420agtatttttt taatggtcaa gttgtttatt ggaattacgc atttaattat ttttctaatt
480cgtatacttg tgagaaacag aacagcacgc tttgcttctt ttactagaca aacccttaat
540tagcgatatt tgaggcagaa aaaaagacaa ggttgcgtgt gttttactcc acaaaagaac
600tcctaaaacc aaaaacgtta aaacccactt cttttttttc acagtattct cttttcatgt
660caagaacaat tactgttttt cttggtatgc aatctgtaaa tttattgtag ttttcatttc
720tacttttggg gtttaggagt aaaagtggag tcactctact ttgctttacc tttttctgag
780tttttttttg ttcatacaaa tccaaatttc cttttctttg tattcctatt ttttctcttc
840tcgtcccttt cagaatcttt agagagagag aacaacaaat tgagaaagaa aaaaaggtaa
900ggctgtgtga gagagaaccc accaaagcgt gcacgagaga gagaaaaaaa aagctttgat
960tttgtataaa atcccaccac tgctctctta ccataccttc
1000551000DNAArabidopsis thaliana 55tgtgaaaaca gccaaaagct acggagagcg
acactcctcg acgacgtcca tgtctcttta 60gattgtgatt gcattgttaa ttatcatatt
tatcaattga ttagttaagt cttaagtatc 120atcacacaag tttatacatt aatctataca
gtgtttaaca ctacaaaact ccccaaaatc 180acgtgtttca tatcatacgt acatttgtct
accgtcttta ttaacaatat taaaacagaa 240tttatcaatt ttctagttga atagactata
tactattata gagactctta taaattatga 300aactcctaat caattaaaaa tagttaattt
gaaacatatc actcgaaact agggtattag 360ttggattgac gatatttttc ttattaatat
caacgagaag aatcaagtgt cgttagctca 420attggtaaaa aatttaggca aagcttagag
atgtctattt gagtgacgct tggaacaaaa 480attacatgct atggtttcag gcctagagaa
atgcttcgat ccagaatctc ttgatattca 540aaaaaaaaaa aaaaatcaac aagaagaatg
tataaatggt ttctatacta tctaagttat 600gaaaaatatt tttggttacc caaattacaa
gatttgtacc gaagtgtcac actagattca 660catgcatgca tagttataat taacttaaaa
cataaatgac ttacaaaaat acttacaaac 720aaaaaccaaa ttaagttatt tagtggccag
ctttgttact agagttttag ccattatcat 780aattcacaat aaggtgtagg aaaaaacatt
aatatatagt atgtatacat ccaaaaaaaa 840ctttttggaa aaaacaaatt ttgggattat
aaaggacacg tagcccaatc gatgaacaga 900ggacggcacc gaatcaattt gacttcacct
tttttcccca tatgataaga gtctcatctc 960tttcctatat atatttcgta cctctcttta
aggaccctac 1000561000DNAArabidopsis thaliana
56aagactatgg cgtgaagcgc atgagctgta tagaaaaatg caaaaagagg atgggcttat
60gccaattaag tagacgtata aaagaggtaa atatgagaca ccttagagat agcgagttaa
120gggttcacca tcaccacaga tgcgtcttaa ttttaagttt ggctgttgta tgatacgaaa
180ttggacaaac attaagctag gaggcatgat ggatttaatc atattttaga tgggaatgag
240aggatattaa ggtaccaacc ctcaaggcat attgtagact ggtttggttt ggacatgtga
300agagttttga agctgaagtg tttggtcact ctagcattag atgttgcagg tgtagttttg
360tacatacatg taaatgatgt ctctttctta cgcacatttg ttgacataga gaagctaata
420tttgcttaag catgttgtaa gctgtaactt tagaatttaa cctccactgt aacatattac
480atatgcaatc aggcaaacga agattcgatc aaacgggtca ttgcaaatcc agaagtgtga
540gaagattgac cattaggaaa cttataaagt ggttcagtct tttacaaatt aaatttcctg
600tttcaaacac caatcaaatc caccgaatta ccaaagattt catgtttgcc tctatccata
660atggttggaa aattttcaat tttttaagta atagtttgat attgtgggga aaaaatacta
720ttctaaacgg tagataggtc ttacacgtga gcattccgta actggacaag gatcaaccaa
780agagaaagca gggagatcca tcacggaaca ttatctttgt aatcaaaatc tgatcgtaca
840cgtgtacact atgatgtcca ctaacagtcc acatctgttg acctgacatt ctcagacaca
900cgagaggatc ttttttcttt ctaacctcta aaatatttta attttaaaat aaaataataa
960aatccgtata tgtgtctatc gttagcttct acagctcaac
1000571000DNAArabidopsis thaliana 57ctaatctgat ttgatttttt tttattttgt
ctacgattct tgagttacga aatgttcatc 60atttgttact tcttgttgtc gggattgagt
agttgcattt ttttagttgt ggagctgtga 120caaagaggcc agctagtgtg tggaatgtgg
atcatcatcc tttggttgat tccttgtaga 180atcctttttg taaacattta aactataata
aagtgttatg attaagaaaa ctagttactt 240atttttttag tcatattgac tctgagccat
aaactccata aaagtcctcc tcgattcaat 300tcggaattta aacagtttca gttgaactct
gaacaagaaa aaaaaaggaa aaagcaaaag 360gagtccgtgc aagataactc tcattttatt
ctctttgttt ttgaactatt tattttgcac 420aaattaggat tgatattaaa attttaatta
aatcaaataa taactattaa ttttgcacca 480attaggatta atactaaaaa cattaaatta
aatcaaataa tagcattttt gtttagtaaa 540tataaaattt aatctaaaat aattaaaata
cacaatacat taaaaaaaaa aaactcaaaa 600ggcaaagcat aaaagaagaa acagagtgga
ccatgtgtcc gcgtggaacc aaagcttcca 660aataactcga ccgtttagtt tattggaaac
gaaaaaagaa aaatcatggc aacacaacgg 720ccataaacat ccatcacgcg tctactaaac
gaacacgcgt acggtccaga ttctcaaatg 780acagctgtca tagagccgtt agcgccgtag
tcaaagggtt taagcccaac cgcgtttctc 840tttttcacct ctttcttttg tagcataata
tctcagccgt tagatttaag tcggtcctta 900ctttttcata tcaacggtca cgattcacta
aacacctcaa tacagtgtac cccacttctt 960ctctctatat aaagaacacc atctcctagg
tttcgttaac 1000581000DNAArabidopsis thaliana
58aaatttcaac tataccatta atacatgtat tatattctag aaaccacggt tcgattcaaa
60ttttgaccta acaaatatta atataatgtt tggatgtttc gtaatattgt aaccattaat
120ggagatgtcg tatatcattt gagttttttt gttaaccatt caaagtacac tataattaat
180ttagttactt taatataggt ctattttcct tgcgtcatag tctctatcgt ggttttgtcg
240gagatatgcc acgttctaaa actttggact tctgcgtcgc caaagcgttt gcaacacgtc
300aaattaggaa acacaaaagc ggcttagata gcaaacgagc gataaataaa gggtcaagta
360agccaaacaa tatgtaataa acgaataaat taattaaaag aaatgtttga tgtcctatta
420attttataaa ttcttttttt ttgttttttt tttaatttta taaattcctt atacacaaat
480gtggtttttt caggaaagtt gttcaatgtg cttattacaa tattaatcat ttgatgatac
540atactcacag ttacactaac agttcaatat cctgatggag cttatcacta aaagagaaaa
600tctatactac agtattagga actcgaaaca atatcgtcat cctaaaagag acaaaatagc
660ctgactaaac atttatattt tcgttaggac aattctcaaa aaaagatact atatattaga
720ctaaaaaaag gaatgaaaca gtgacataca gctacactaa agacaaaaaa ggagataaaa
780taatatctcc gtagacacat aatagtataa taccgtgtta cgtcgatcaa aatgtgatat
840aagaaatcaa attaatacta aaagcgatat gtttcgggac agattggtcc ggacccgcca
900cctttccctt tctatataca cacacgcaat acccaccaag aacacacaca aacacttaat
960tagcaatata aaaagcacat tcacaaactc tttccaacac
1000591000DNAArabidopsis thaliana 59ccgctataca cttgtgggaa aaggcttctt
gtagtacagt ttcgtcacgg cgtcttatta 60attgcaaatc tgtgacgtta cggttttcga
ttcgggttac tattttaatg gaaatctttt 120gataatagct agagcctaga gtatgtaacc
ctaaggttag atggtaatac cgaatctacc 180taaatttttg ttattttcgg tttttcatga
gatatataga ctgcgtaatt taaaatctca 240aaatttggtt tcactttatt tttatttttt
tttagtttcg aggtaggtat acaagcatgg 300accttttgac attatttaaa attttcataa
tttagaatct tcatgttaaa cgtgtatggc 360tacgatcttc ggaactttat tcggtttgta
tgctcaaagg ccaacgccat ttattttcta 420attgattttt tcccgttacg gaaacaatct
ttttgcatta ttgaacttaa ttaccaaagt 480ttatgaaaag aaacaataaa gtctaaattg
gcctattttg atatactgat cacttacgcg 540aaaacatcta cattgttatc tactagacta
gtcattaact cgaagataga atttcatttc 600ttgaccaaaa aaaaactcaa aagatagtta
taatcacaga ttcattattg acttttgctt 660ttaacactgt agccatttat gtactattta
tacaaatacc taacttttca aatttgtgat 720atttttcacc ctacatttta gcctataggt
tggatcattt tgtcatattg catcactgtt 780tactagtacc aaacttaaat atcacacgag
cagtaaattt gctatcaatt ttttcgtttt 840cacaaacata tatctagttt ccctaaccaa
taattaagtt aaattgcatt aggaatttag 900gatttataat aaatagatat ataacccact
aacgacaaaa gactatgtct tgctacgtct 960ctgcctttat tatcatgcca caagaaaata
agaggggaac 1000601000DNAArabidopsis thaliana
60actcctatga aatcaagcct tagagacact attgttagcc tcaaggagaa atgtctttta
60tgattctgtt cgtatatttg taactattat gtgtattttt attttgttag tattactaat
120tcaagtggtt taagttgttg agactcttta aaatctaagc attttataaa caataatata
180taattattgt ttaggctaaa tttgtcacta attaaggttt ggatacatag tgtctaaact
240aagctaataa tatcacttaa cgtttacttg taacgctagg tgatgatgtc gtcaagtcaa
300ttggtacaag gaataaacga gtggtcatat gacattatga ccatatgaat tcaaactcca
360gtaatccaat ggtaattgga ttcaatgatc aagacttgaa ccacgtaatc cacccttatc
420cttagaagct cataaatatc actaaaggga caggcaacac ttaaccagta gttgtccaat
480aatttagttt tccaaaatga aaaattattg ttgtcatcta ttttaggtgt tttagttcaa
540tgtggattcc tcgtcctaac aaatacttga cgaatatatc tagactataa aattggttat
600gagttctact tttttttgtt tgtgaaatta tcaaaatttg ttatatttat ttatttattc
660tcattaattt gagtactaat ttttaaatta tttatactaa aaacaattac taagatacaa
720aaatggataa gagcatggtg tatagatatt taatgggata gaatatttcc cataattgta
780tgtgtgtgag aggttttgtt ttcgtaagga aagaaacaaa aaccatttga ccaaagaaaa
840gcaaaagaag gcaaggaatc aaacaacaaa tgttgcaagg cagaaataat ggacgttatg
900ttaatgtagt gtcgtcacac gtgacttaaa agagacgagt ctgcgtgtca aactaaaaat
960gtatgcaact ataaaaatgg gatttgatta tctttttagt
1000611000DNAArabidopsis thaliana 61aaagaacaaa atagtcccgc aggttaaaac
ctagtatttt acaaaaatat agacaaataa 60aataaaaatg aactatcgtg gatttagtta
gagaaatcca tacaaaaata aattgcattg 120agaaacttta gatgacgact aagtaatata
agattttgca ttgagaaagc ttacatgtca 180ccctaatttc tatcaaaagg gtttctgata
ctaattggaa ccatgtgaga aactatccat 240agaaatgatg atcttatcta gttatcatga
gctgatgact ttgaatattc cacagtcact 300aaacgcatgt ttatttctca gctttatgaa
tctgtatgta gacacacttt cgtattttct 360tcacttttgt gtatttgtat tgcccattcc
tcttttacca accaataaaa aaagcagtct 420tttttttgtt tggtcaagca ttggcactct
ttgtcaatca attacccaag ggatatataa 480taataaattg gaataatgaa agtagttcat
cataaggttc gtgattatta acttctactt 540ttgtttatat tcaatagatt aagatgccgt
gtatgataag gagagcaaaa gtaatccgat 600agagatcaca gcaatattca ttacaaaata
gttttcaaat aaaaacacat gatcaaaaag 660tgactcacaa aaacacatga tacgtacgtg
ggatgctaat cctaaagcaa caaggccaca 720agggcaaaat atgtagcttt catgctttgg
caaagcgaaa actctagtta agcttaccta 780acagaaaaaa taacccccaa aaaagcgtta
cgttccaaac atttagacag ggtacacgtg 840aatgactccc acttttttta aaaaaataat
agtactatca attcaaatgt gtaaacaaca 900aaaaaaaaaa atgtgtaagt aacgtctgta
attgatttga taagataaaa aaatcttgat 960taatttttct ataaattgaa gcctttcctc
tttacatttc 1000621000DNAArabidopsis thaliana
62ttcatgcatg cattatttta agatattaaa gagtaatagt ctcgtcgaaa ttataacatt
60aaaaagtgga ttattcgcct cgaagatatc cgatgtcaca attttgtctt cttttagaac
120ataatgttga atccgagaat cgaggtatac ctgaatatct cgccaatgtg ataatgtgat
180ataatcgaga tacctgaata cccactgaac cagactcgtg aattgacttt gtccttaagt
240ttttcaaatt tgatcagcaa tacagatatg tcatttccta ggactataaa agatatcaaa
300caaatattaa ttaaagatat gtaaaactta tgggtgaatg atcggtgatg tatgtgtata
360tgttactggc attccataat acacgcgatt caactaggaa aaatatacta taaaaagagg
420cgtatggaac tacgttacac gtttcttgta catttgagct tcgttacgat gatttgtatc
480aagagtgaaa aataactacg atttattatt tacagtttca gaatatgatt tgattcgatg
540tttaatatga tttcttgcac agctagaaac aaccataatt tcaacatttc atattagcaa
600ttcctccgtt aacagccgtt tgacattttt aaagtaggct aaaatttggg taaactgatt
660ttttaactaa tattttttaa ttttatatta actacaacta ttattattat tcgtttataa
720aaatatacga aattcaattt aaatgctagt agtattttgg tatttttacc aaaaaaaaaa
780ctgacacaat cgaaagaacg tagtaaaaaa caaagtatat tttttttgta atgagttgtc
840tctgattcag caataggaga gcgacaaaaa tatcacgatt ttttctagca acttttccat
900ctccacggaa aaagagtaat aatatgacac gtggcaacat cgcaacggct cttaccaaat
960ctcctcgcat atataaagcg tcttctccgc caccttcatt
1000631000DNAArabidopsis thaliana 63atagtattgt caatttttca atctgaagta
aatatctttt tgtaaggaag gggtcaacaa 60atgatcacaa cagagttggc aaaaagttat
caaatcgcat gcacggaagt tttacgtgtg 120gtgaaggtaa acttgtatta cacttatcta
taaaaattag tttaggcttt gattctaaat 180caaatctccg attagaaaaa attgcgtaag
caaatagctg gaaaaaattg tatcccatca 240tacttaagtc acaatgtttt gtttttgaga
tttgtgatgt aatcaatata tgttttacaa 300tgcaagtata ataatattaa agtcacattc
taagaaaatt atgatttgtg tcatacgtat 360acaaaaacac ccgtcacaca tcctgacttc
tgaacgttaa atctgtcgca cacaatcata 420aaaatttaaa aattcaccag agatgtactg
aaaagaatat aattaatcac atgatgatat 480atgcatagga gatgaggatt attcattttc
tgaaattccc tatatgaacc attataattg 540tttagtaatc agttcagaaa tgctaatcat
tatatgaacc attataattc ccttcatttt 600tatttaagat ccacttaaca ggatttgtta
atatgcaccc acatcactaa atacattggt 660acgcaaccgt tgttccattt ccattttcac
atcgaccaga atgtttacta tgcggtaaat 720tgtgtagtat gcagattttt ttgtatcatt
taattttcta acacttgtta agtcgaaact 780aattttgtca caagtaaaag aaataaaaaa
ggtggaaatt attaatcagt agttagatga 840ttagtttcga gttgaaatga aactcgactt
aacaagtgat agcgacgact ctagaaacag 900ccaaaatccg ccctattgct acctgtcgac
ccacaaatcg tttactcaaa aatgaataaa 960aaatttacga taaagcaaac ccaaagttat
atcttattat 1000641000DNAArabidopsis thaliana
64ttttgtgacg aaccaataaa agaggaactg ctttttttct tcgcatgtcc actatttact
60gtttggtatg acatcgaatt ggtatgatac attgttgtac caaaaacttc aaaatttggt
120ggcttgtaat caacattgat ccacttagcc attccaagca gtgaattaca atccgtggaa
180ggaaaaatta tggtccggca ccaacaaatt atcacctcat ctcccgtcac aaacatatca
240tctagaagtc ggtatgccac accacataat taaggagcac attactcaca tacccaacca
300atgtgggaca tatctaataa gctcattctt ggctggtaca tgctctattt catatttatc
360aaaaaaaaaa atattaggca ttgtaaatag cgtttttgct gttgagcaaa atagttatat
420ttgaaagtaa cattggtatt tataattata gtataacaat taggcattga agtgtgagtt
480tttgtttttg tttatttaac attggagtat taggttctta gaaatatatc tatatactat
540tagtagttta actacagttt gtacttaatt gaaaaaatgt taaaagttgt tttaacctag
600ctaattgcta aaaatgacta aatagacata cacaaagact tgtacatttt cagcttaacg
660actaatacat ttttccttta tatatatatc tctatcgagt ctagttatta atgttgaaag
720ttgcaaataa aacagaaatg ctaacatgta aatatcgtag ccaaaaatgc taacatgtgt
780ataacggtta taaccacaac ttgatggccg acctcttttt tcttttggta accatagaaa
840tggttacacg taactagtac gaaccaacga aaactcttct tattcgatag ttaaagataa
900tagcaatgcg caaaaatatc tagcactcac acgtgtagtt ttggattctc attggtcgag
960agatctataa aacgatacta ttggaggtta gatttttctc
1000651000DNAArabidopsis thaliana 65aatatccaaa agaaggttac atgcataagc
ttagactggt tttgctgatt aggtgcagct 60gaaagtaatc tcatgaaatt gatttccgtt
ttgccgattc ccattttaag atatctgatc 120ataggaatgc cacattcaga tggaagccag
acaccagaac agaaacaaag ttgaggtaat 180caaagaagcc atcttgagag caaagtggtt
ttgattccaa gaactcatat ttatcagcat 240aacctctatg ttcatgaaac ccagaagagc
acaatacata actatcacca ctcaatctgt 300tattaccaga actagcagtg ttcaacttca
ctcaaatgtg gacgaattac agtaaattgg 360ctaatcccac tactgaattg accttcaagg
taaaccccaa tcatttgcca ctacgatcaa 420ccaaaaatat agattcttca ttcactctta
tcaaaagcta tacatgaaat tcagagtgtt 480ctcattccta aatatggaaa gttgaattct
atcaacacaa tcagatcatg cgacaatgaa 540ctagaaacca cgaaccagaa attgttggta
atcgtttagt ggacgagatt gaatcaaagg 600ttcaagtggt aatcgttttc tcctgacgca
aaatcgaaag aaaaaagatc ggtagcgtcg 660catcctaatc gggtgacccg gaaaccaata
gttgattcgt tttagtggcg gtaaaacccg 720gtttgatgaa caaatattaa tgggcctggc
ccatacgagg atgatcgtgg caatgtcgat 780gataacaaca actcctctat tcgggtttat
gttgacccgg aaaacgaaag cataggacac 840gtgacacatg tgatgtgagt gaagccaaaa
ataataatat tgggaaagga tgaacacagc 900agctcagctt tcgtcttctc cgtcaatcca
ataaaaaaat cagcaaccgt tgtttgtttt 960taagcttttt ttacaaaaga cgtacacgtc
tctctctctc 1000661000DNAArabidopsis thaliana
66gattgtgggt agaagtagag cgagtgctct tactttagcg gtatcaactt ggaagctacc
60ctgcccattg gattggatct aagcttcata tttgtggatt aaaaaggcct tagtgggact
120catgtctcct ccggtgggac tagtgtcttt tacaatgtca agtttaaaaa attggataac
180aacaccgata aaaattcaca tttgcaaatt ttattcagtc ggaatatata tttgaaacaa
240gttttgaaat ccattggacg attaaaattc attgttgaga ggataaatat ggatttgttc
300atctgaacca tgtcgttgat tagtgattga ctaccatgaa aaatatgtta tgaaaagtat
360aacaactttt gataaatcac atttattaac aataaatcaa gacaaaatat gtcaacaata
420atagtagtag aagatattaa ttcaaattca tccgtaacaa caaaaaatca taccacaatt
480aagtgtacag aaaaaccttt tggatatatt tattgtcgct tttcaatgat tttcgtgaaa
540aggatatatt tgtgtaaaat aagaaggatc ttgacgggtg taaaaacatg cacaattctt
600aatttagacc aatcagaaga caacacgaac acttctttat tataagctat taaacaaaat
660cttgcctatt ttgcttagaa taatatgaag agtgactcat cagggagtgg aaaatatctc
720aggatttgct tttagctcta acatgtcaaa ctatctagat gccaacaaca caaagtgcaa
780attcttttaa tatgaaaaca acaataatat ttctaataga aaattaaaaa gggaaataaa
840atattttttt aaaatataca aaagaagaag gaatccatca tcaaagtttt ataaaattgt
900aatataatac aaacttgttt gcttccttgt ctctccctct gtctctctca tctctcctat
960cttctccata tatacttcat cttcacaccc aaaactccac
1000671000DNAArabidopsis thaliana 67agtctataac tgataaataa ttagtcaatc
agatcaaaca atagagaaat aattagtctg 60caatcagcga ttattaaaaa ataaaataaa
atagaaacat atctcagcac cgatttttca 120acagccggcc cacgatccag aaccgtccag
cttgaagccg gcgccgcgtc gacttgttcc 180accgttcgga tattgccacg tgtgtgagag
acacctgcca ttacctcgtg gaacatgttt 240tgttagtata agttactaac caagtcgcta
tcttcaattc ttcattacct aagagttgaa 300taaaaaaaca accatatacg atgagacacg
caaataactg atatacgaaa caattattta 360ttgatgtaca ttcttctaac tataatacta
caagattctg aatagtttta gatgtaaaaa 420gtacgagaag agttatgatc agagtaagtt
aagtgataag atcatgacac gtaggcagag 480gaaataaagt agaaactatc gacaacataa
taattgagtg agtacaaaaa ccacgtaata 540attaggtgta cactgtagtc tgtagatcga
ttgtacaaaa tgaaaaaact atatgattta 600aggaactaga tatacatatg catcgactca
agaacagttg ctttatgtga agactatcaa 660ttacattatc ggctaaattt ggcttcatgt
gttgaataga tggattgtgt gctcaggtag 720ttgtgtcagt taattataac agtaaacctt
cactgcttca catatataaa atttgggtta 780aaaaaacttt aacattttat ttttttaatt
gacaacaaac aaattctgta acggttacat 840ataatagtgc ttcagttaaa ctcctttttt
aggcaactct aattccttta tacccttaat 900ccattattat tctatcgaac ttatcttttt
aaaagcttcc gcgttacgcc atcacttctc 960caccatttaa atacctaaac cactttcttt
caaatttctt 1000681000DNAArabidopsis thaliana
68taaaaactaa gttgtccaac tactaacgca attgacaacc acaaccaaca ccaatattaa
60gtatgattac cgaccaaatt tgaattttat aaacatatag aatttttttt gccaactcat
120aatcatattt ttatatgtat tttaatataa ggtgaatgat tatcttttgt ttcatgtcat
180attgggcata acaaaactag tacaacaaag ttgtgtgaac cactcctttt atatagtttt
240gttgttgtct tttcttccaa ggaagattac tagacttaga caagtggatt atatatatat
300atatatatat atatatatat atatatatat atatattaac taatctttca acacttgtgg
360attgataatt attcatactt ttaataaaaa ttccatagtt caatgaatct gtaacaaaac
420caagaagtga cgtgatatac ttttggaact ataatcatat tgtgagctca atggataaga
480attgattgta gtaaatgatg tcggcagtta caagtgtagt tagatatatg caataactca
540aatcggacat gaactcattt taacttatct aaagagacca ccgacgaatc attttgggtt
600cacaaaattg tacttcgatt tctaagcctg aatgtgaacg cacgtttttg aatatttcaa
660cacgtgtttc atatttcatt acatgcatta taacataaat attacatctt tgagtcttta
720actagttgac caacaaaaaa aaaaacttta actaagtcta gctagttttg ttactacata
780tataaaaaca aaaccgaaat aaatatttaa aatttataat atatttgtgt ggctaaatca
840atcaacgtgt catgaaggtc taattcaagt tggtaaggaa atcttttgtt tatgtccatt
900tcccacgtgt cactatttgt atgacggcta gagaaagaca tgttgaatta actagtgact
960ccggattata taagcaagca tctactaaaa agataggaac
1000691000DNAArabidopsis thaliana 69gaatcgttct tgatctcaat ttggtttgtg
atgagaatga ctttgatatt tatagtgatg 60aaggaggcta gggttgttgg atctaatatt
ttcaaaaata ttcgaaaata tcttgcagga 120taagttgcaa atcaaagatt ggatcctcac
taatgattcg aatttggttt agttcataat 180gattggctat atttagataa gtggtaaata
ttgtttagtt gattatggtt agttttagat 240ttaagagaat aaatttataa gatttttttc
taatttatgg aatgtattat cgtgttatgg 300tagttagaag aatcgaataa gctcgtctat
gtcgatttta tagtgattag tgaggagggt 360gaatatatga ttgaagattc ttaaaatcgt
aattaaatat gcagactatt ttgatggaat 420atgcgtgtat tagatactaa aaattgttga
gatatacaac taatcgtgtt atggttaaga 480ttgcattact tgatggtaac ctattatgta
gttttacatg tgatcaaaag cccattaaac 540ataactcacc cgtctatgta aaacgcggat
cctgtttaat ttagttattt ggtttggacg 600attttaccct tgataaggaa gaacaaaatt
caaaattgaa atcaaaattg aggctaaatt 660agacaaaaat ttaatttgta cttctttttt
aataataaga ggatatgact atgatacctc 720gcctaattct ggaaataatt ttttaaatta
atcaagattt ttttcttaat aggtaagcca 780aagatagaac acgtctcgta ctattacaga
aatactcttc aatctgtcac aataaaataa 840gaattagtat catagatggt aacgtatctc
catttatctt ttaatattag acgttactta 900attcttagta agagaaatcg taatttactt
ttgtacttta attgtggtcc cttgatacca 960cattttccta taagtaagta cacgcaacag
catcaacaac 1000701000DNAArabidopsis thaliana
70catatttgga atccatatta tgcaaggaga catcggtttt actaatgcga attgaggggc
60ttgtataaaa tcggtctgct tcagaaaaca ttccattttt caacccttga aagtccattc
120agtttaccat ccaagagttt ttccttgttg tgaatgataa gaaaatccat aacatttatc
180taattttatc gtattggaaa aataatttaa tatttctgaa tacatttatt aaaaatatag
240ttaaaaacga tgtattttta gcgttttata attatttagc attatttaga tgttttcgtt
300attttgttat tacatcattt taattattat tcatgtgact tgaaattaat tactttttaa
360ggtcgaatct ttttttacgc ctgcctttac acttcttatt agaaggttat gtatcgaaac
420agccatccaa aatttaaata aattctaaaa gaagttatat atattatgaa tgtatggtga
480tactgacatg ggtagggtac attctttttc tattgagtta gttgatacgt taagttggat
540gagatataac ttatactatc atataagctt tttttaattt gtgtgtttca agtttggtcg
600gttttgttta tgtcttttgt aggtggggta tacagaaaaa taagagaaaa aaaaacaaag
660tcaaagctga ctttgacgac aaatcgctaa ccttcgagag atgatgataa caagcctaat
720taaaatacac gttacgtcgc aatcaattgc ttcaatctca ttctcattgc atgtcatcct
780tagccgcggt tagccgacca tactatcaac gatgtgcata ctatcaacaa tgtatttgta
840ataattaagt agaaaaataa agaatgttcc accgtccaaa agaaacacaa gtacggtcct
900gagagggctt gagatatttt gcttagtgat atatgtcaac atcttagtta agccccaccg
960gttctgatat aaaagtgggg aaaatatttc ataaccacac
1000711000DNAArabidopsis thaliana 71atctcgagat agcctgattc agctcaaaga
aagagtgtac gagaacattg ccacagtgcc 60tctagttgta gagagaatgc gagagagtaa
agagagaatc gacaagtcca tttctttcaa 120tggaactact atgcacccgg ctttcacaag
gcggaaagct agctaactga tcagtttcta 180cttctcccta ttggctttac cactcaaaga
aagctaatgc ataggaggag atatggctgg 240tatttctttc tactatatta gtatcaacca
acatctctcg catttagaat aagagtatac 300cataacctct gttttgattg tgttctacat
tgaaacaggt ctttctctga ttttaatatg 360gactcttata gaatcttggg ctgattctac
accctgtatg tatatgtaag aatcacataa 420tcagatggtt gcacaattgt caagaacacc
caaatccaag cttgcgattt agcaatctca 480cccgtgtaag aaacttaaaa ccataaaacc
aattgatata agagcttttc tagcaaaaac 540ttaaaaatat aattaacacc gtcgaaaatg
gtggaggcta ctagagcgag atataaactt 600tatcaccgtt taatcgtgtg ggattttgaa
atgatatgaa cctattgcaa aaagacaaaa 660aaaaaatata aacggaggaa ctaggcagtg
ggaccggaga tccaccgacc ggaaacgaag 720aatccatagt aagatagagc ccaggggttc
ggtcgtatcg gaccggtcgg atttggattc 780taacgacaag tgagtggacc cccagaaacg
acagacgtaa gcaatgacaa catcaaactc 840atcgccacgt cagcaacgga gttctacttg
cgaaacgtgt cccggcagcg tcatccaccc 900acgtgtgttc tagaagctct tctcctcctc
cgtatctcga cacgtgtcca ctccctctcc 960ttatataatt gacttttctc tcattctttc
caagtttcaa 1000721000DNAArabidopsis thaliana
72gcggttgttg gatggctgaa ggtaaaagct gctatgagat ggggattctt cataaggagg
60aaagctgctg agaggagagc acagattgtg gagcttcatg ataataacga tggtaaatga
120atagtcaggt tgcttgctag tgtatagaag ccatgtacag tttagggatg tataacaaca
180tcacttaccg atatggattt gcttgattta gcttattaga caagaatata atcgatgaac
240catgagaaat ttgaagacta accttcagca gcaagacaag aagctgattt tctgtctctt
300taagtttcaa ctttcaactt gcagatatat tagaagactg actaagtatg ggcctataag
360cccattaatg tctctactga acgcccaact tgtgaaaaac atgtctctca tctcaatttt
420gttttctttt cacatcaact taaagctacg agaaccaaac aagtgatgtt tttggtcaaa
480caggacgaaa catgtataca taaaaagagg taattatgtt cgataaacat aggtataaat
540aaactataag aatttgttac aacattcttt agatttgata ttaactgctg tttacatttg
600aagaactcag ttagttattt aatattttat tttatagggt ctagaaatta cttgcgtgtg
660gggactgatc accgattcta aagaaggctc atcctttgga taatagtatg aaacttgtca
720ataaagataa gtcatcacaa gtagggagat cttagctgtg ttccatacgc ccatctagaa
780aaagcgacga tggtcaagat taaataactg tatttgaaaa accaaaaccg cgtcaccaac
840tccaaagcca ttaccattag ccatcacttt ccatcttcca gctgttcgaa tcaggacgcc
900cctttttctt caccaaaccc atcggccgat aacgaacctt cctctctgac tgcctctgct
960cttactataa atacaaccaa tacgacctca tccaaaaccc
1000731000DNAArabidopsis thaliana 73gaccaccggt ttctcaagtt tccagaaata
tgcaaggatc tggtggtccc ggcggcagat 60tctcagggag aggagatcca ggatcaggcc
ctgtttcaat ctttggtgct tctacttcca 120agatcagtgt agatgcttct ttagccggtg
ccatcattgg aaaaggtgga atccattcca 180aacagatatg ccgtgaaaca ggagcaaaat
tatcgattaa agatcatgaa agagacccaa 240acttgaagat tatcgagctg gaaggaacat
ttgaacagat caatgtagcg agtgggatgg 300tgagagagct tatagggagg cttggatcag
tgaagaaacc tcaagggatt ggtggtcctg 360aagggaaacc acatcctggg agcaactaca
aaaccaagat ctgtgatagg tactctaaag 420ggaactgtac atatggagat agatgccatt
ttgctcatgg tgaatctgag ctgcgcaggt 480caggaatcgc ttagttatgt ctttagactc
ttgagaacag attatgcatt gttagctcca 540ttatcattgt gactttttgc tctctctttt
tgttttatca atttgtttta tgcgactcgc 600tttgaaaact ttagcccatt ctgtattgag
ctctgaagat tcgacgagtt ctgtaagtta 660ccgatcacag ttaaaagact ttgatatgtt
aaaaccctta tattacagct acatactatt 720tttgtcttaa ctcttaagat atcatgcaca
ataatatact tgttttgtct taacctatcg 780attacaaaac cgggttaacc gccgacatga
ggcgaactct aaagcctaac acgcgtcaac 840atctatcttc tcaacgactc aaaggctttc
caacacgtgt aggaccaata actgaaacac 900aaagcctacc acctcttctt cctcttcttt
catgacacgt ctcactgacg tgtcgtcaag 960aacgtaatta aatattaaac tatcgtgacg
aacgcgaggc 1000741000DNAArabidopsis thaliana
74taaggtttta ggccaatgat agtgattata cctcttttgt ttgtacttga ttcgtcacca
60ttattctgct tgcgtttatt ttagaattta gaggtggctt tcggaattaa aaacaacaag
120acaacaacat tttataattt tgatttgtat actaataata gaaccaaact ctgcttctct
180cagctattat tatctaagta ctgatatatt tcaaattgta gcataagttc tgtatatgta
240ttagatagag ttatcttttg atttattatc atatacggat taaatttgca gtttagagca
300atgtatttgc aattttgtga aaagtgaatt tgctttattt ttgtcttcat tactaaagtt
360gaattagcgg tttgatgagt cattatcgct attaatttgt ttccgtaata ataatgataa
420gcaatcatgt cacgaacata ttactaatat tgtggtgatt cttttgccct cctcatccat
480attcattgat tgtaaaagca cacgtaggtc acatacgtat catcctaaac cccaaattaa
540aaaaaaaaac taagcaattt ttatacatgg cctttggcga tccaagcatg caaaaatatt
600ccataaatgg ttaacacaaa atcatcgatt ctgaggggtt aactagggtc cactttatta
660ccgtatggaa atagagacac gtcaaagcta gtatttggac ctacgtacgc tcatgtattc
720cgccacgtgt cgtggtttca ctgacataaa acacatgtcg gtcagataat tctagctctc
780acgacgatga agcaatgtag gctcacttcc actcaccgcc gcctatagca gtgacacacg
840tcctgtcacc ggcgtgtcac tcaataatta tcagattttc attttaatta aataatgtta
900tcctatattt gttaggtgga tcccaacggg agggctttat ggtaattacc atttactcct
960gagaagcttt tatttttgtg tgtggagtag taggagaagt
1000751000DNAArabidopsis thaliana 75cagggtggtg aagcttctcc tgtttggcca
accgatgcaa caatgcaatc aatctctact 60caaaccactc tcaaatgctt atcccggatg
ctagaggaaa gcatactaac cgatgtaatc 120atccacacag ccgatggaac actctctgct
cacaaagcta tcctctcagc tagctcaact 180gttttcaaaa gcatgttcca ccacgacctc
atggaaaagg aatcatccac aatccacata 240gacgacatgt cgagagaatc ttgcatggct
ctactaagtt acctctacgg aaacataacc 300caagaagagt tctggaaaca caggctcgcc
ctcctcggcg ctgcaaacaa gtacgacata 360acggatttga aagcagcctg cgaggagagc
ctaatggaag atataaactc gagtaatgtg 420ctcgagaggc tacaagaagc ttggctttat
cagctggaga aactgaagaa agggtgtttg 480atgtatttgt ttgatttcgg aaagatttat
gatgtcagag aagaaatcag cagtttcttc 540aggcaagctg atcgagaact gatgctggag
atgtttcaag aggttttgtc tgtgtggaaa 600ccggtctaat atacacttac acacatactt
tgaacatttc agcttgatgg tgttttgttt 660atagtcttct tgttctcttc tgtgtgtgta
actaacaaca actctggatt ttgttgtatc 720actctgtcta atgtatagta ttgagtggat
ttgcctctag tttggtgtac cgagcctctc 780aaatgggccg agcctttcaa atgggctgtg
actaggccta atgactgtcg acatcgaatg 840tcggcacaaa atatgataat tttgtcgatg
ttgaaatcgt acgttttttg ttttaattag 900atgagtcagc atcaggagga tgaagtatcg
accgacacgt gtccgttgcg tcccaaagta 960aagcttataa cgtgaaggca tttgtatcgt
ctcttccgcc 1000761000DNAArabidopsis thaliana
76ttctttattg gtaacttttt aatatattca tttggtcaaa gctggcagga aaaaagtttt
60caacttataa ttttcttatg aagaagatac tgatacaatg catattaagg tcaaagcaag
120agtctaacaa tttattattc aaggaatata aataattgtt aggtaggttc ttattgaaat
180gaagcatttc acaccgtaat aatgatatag ctgtagaata tgaacaatgc gcgtttgctt
240gactgtttcg ttgtaagttt gtaacaataa tggccaccaa gtaatctgtc actagaatat
300tgcaacaaaa taaaaaccat aaatataact ataatgatct actaatttat taatttaatt
360aacatgaaaa tatgacagct tttgccagcc cttgctcgta taatcagtcc agataacaca
420ttacaacgaa acatataaaa taactttttt caacagaaac gattataata tacatctaaa
480ataattaaca gttgaaaatt ttgatagaca tactatatat gaatatgaac ttaaataatg
540acccattttt cgtataatgt taattattta ctcgtaaacg cgttatttcc acgaaacatt
600aggcaaaact caagttaatt tacgcctggc attgtaacgc ggttaaccaa aaagcaaatt
660acgcagagtc aaatcatatc taaaaaccaa tataaacata acacgtgtca atacttaact
720gatctcagaa ttaacatcgt taagagaaaa cacgtggcag agatctgtgt atccgtttgg
780tgctccttca tgtagatgat tcttcaagaa aacttcaaaa actcaaacac gtcaagttta
840agaaagaaaa aagacaacaa ttattttaaa ccgccattga aaagctaagc catgttgtat
900ttttgtatgt ggttcgcatg attagtgtca caccaataat taattattaa ctatttccca
960accatcgcgt atatatagag ctctcttctc tcattgttct
1000771000DNAArabidopsis thaliana 77tacttacaag caattgtgaa agagactctt
cgtttgcatc ctgcagctcc tttgatccct 60agaaaatcag aatccgatgt tcagattatg
ggtttccttg ttcctaaaaa cacccaggtt 120tttctcatac ttatctttat tacaccaaca
ttcattactt gagatattac accaataaaa 180aagtttggtt tggttaaaat cttccacctc
tgtttctcga gattaggttg ttgtgaacgt 240atgggcgata ggacgagacg cgagcgtgtg
ggaaaatcca atgaagttcg agccagagag 300gttcttgtta cgagaaaccg atgtaaaagg
cagagatttc gagttgatac cgtttggatc 360aggaagaagg atgtgtccgg gaatctcgat
ggctcttaag acaatgcata tggtgcttgc 420ctctcttctc tattcctttg actggaagct
tcaaaacggt gtcgtccccg gaaacattga 480catgagcgag actttcggtc ttaccttaca
caaggccaaa tctctttgtg ccgtacccgt 540caagaaacct acaatatcgt cttcttatta
ataatcgtat caatataaag ttcgtgtacg 600gatcaatatt aataattgaa gagaatgaaa
ttataaaaga tactgctttt tatgtttcaa 660gtaaaaatgt taatattaga aagtcaagaa
agaggaaaca caatgcaatt tgataaggtg 720ttttatcgtc tccgggattg aattgcgtag
gtcaaaaaaa gacatttgcc atttgatatc 780gacatataat cactcattca cggatacgta
tctcttgtga atggtcgtcg gtcaattaca 840actttactgc tcggctttaa ctgacaagtg
gcgcctcctg gtttcattca gaccctaccg 900gtaggatgta atttggacca taagtaagaa
atatttgtga aatatttact acacgaatat 960ctttacgtat ttaagttggc ttttgaagct
ttgataattc 1000781000DNAArabidopsis thaliana
78gataaaaaca gatactttta gcgaatactt ataacaacac tgtgttttgt ccaattcagg
60ggaggttcta cgatcaaccc aatgcaacaa ttctatatgc ttccgtttca taaaaattct
120gaatatcaat ctaaacctat acaattacat ccttactaat ctagataaag tgtatacaat
180aaaatgagag caataaggtt cattaatctt tatacttagc tcctccacag gacgatatac
240tttgagactg acacaaaaca aaataaggaa gaaagcttac gtttttgaag gaaaatcaca
300ggagggacgc ggaagcaaga ttggtctgag aggaaaattg aggaagaggg gttaatcgag
360atgaaaatgg agtttgaagt aatagagaga acgttgcgca gatcgaggaa gaacagagga
420taggatcaac acaaggatct tgtgaaaatg aaaatggctg aggaaatgaa ggactatttg
480ttaaataaga tataatatta tttctgaaat atttccatct gactccttta atttatacaa
540gcctcctttt ttgtacatct attttcagaa gatccaaata attgtttctt ctatttgtgt
600atttttgata tttaaacgta aatctttgga ttttgatcaa taatagacaa ttagccaagt
660tcagttttca ttaattagat ttataatttt aattattttt agcattcgtc taaaattaaa
720taatgcggca aaagaggaga aattaatttt tgttgttatt aattcaaagc gttacaaata
780aaaaaggaaa ttccacgctg gcaataaaat aagcgaaaat tccacgtgac atctacctgt
840cggatcaaaa agagtggaat tgacatttgt ttctctcaca ctctctctcg aattctctgg
900tagcttctag ttctatgcaa aaaaacgacg atagttctct atctttccag atgaatctcc
960ttccatatac aaaagcagtc atgcctcctc gctctctcgc
1000791000DNAArabidopsis thaliana 79aattcacatg tgatccggta tattaaaata
tacatgattt atggatacac tcgaatactt 60ctccgatatt catttcgcgg aaagttgcat
atgatccact atctcctata tatttgttat 120ataacacatg gcaaatctct ctatatatat
tttttaaata cacatatatc caaaataggc 180ttaactataa cacaaagact tttgtttact
gtatattagt aagaagtaaa tgtatttttt 240aatattatga taaagtttgt gaaatcacca
tttgcaatag ccatataggg tcgtgtttta 300attttacagt ttgtattgtt ataattcgat
tccaaggttg agaatatgtg tgtactatta 360gactatacaa ataataattc gttgacgata
ttgaatattt actaattata ggaagagaaa 420attatttact aactatagta cgatatattt
cttctatatg tgtttttaac gttttttttt 480ttttaaattt aagtcttaac tttacttctc
atttttaatc aaaaggaaaa aaataccaat 540caatttttcc taacacagtt tacttatcat
tttcatttga aatgtgttca ctttctgata 600aaatgctaat cctacaatca aatacaccat
tgtcgtgata acacgtgtac ggctctaaag 660caatcagaac aatcattgga cagtttttac
accgtcagat aagtacctat ccacttgctg 720actcagccgg ataaacccta aaccggaagt
ttgccccacc gtcaaaattg gaagaaaccg 780gacaaaagag aatgtaaaga ctaagaagta
agaacccatc ggacgtcgta agaaggttaa 840ttaacacgtg gaaacagctg gtcagagtta
tccggtaact tatccggtta caagtaaaaa 900aataatttgt tcccatacac gactccttca
gaaccaaacg cgacatcacg gcgccgttta 960gtgtctataa atagagcaat cggtcgtaga
aaaccaagac 1000801000DNAArabidopsis thaliana
80aagagatttt gaaaggtcgt cgaggagtta gtttattatc ttacaattat tatggatttc
60ttctgactaa agttttagtg caatagtttt ctgagttatt atatattttt cgaaattcat
120ccaaaggcta aaactttgta aggtattctg agtctattac gctagaaata gacacgctta
180cgtttttaag tcattttaat ctgatatata tatatatata tatatactac aattttataa
240ccaattatct ttgtttcaga caatttatat aaattcatat aaactatcag aaataaagag
300acaattggct gtaaaattgt attattttgc atctatgaaa tttttatttc tctttatcaa
360aattgaagac gagttgaaat tctatttctt ttttaaaaaa atttatatat gggagtgtca
420aaaaataatt ttcctttgta agagaatatt cgtatttgga cgagtcttga tttgtgtttt
480ccgttagcgt tccacattag ttgtaaagag cgagaaagat ttaagtgata attaaaaaga
540caagatatgc attttgaata tactctagat attaatctta agcatttttc atattggtta
600tattttatgg tcgaattttg acatccgaaa aatacatccc ttattcttca atttatattg
660aatacttaag tttttgatag taaatatgta taaataaatt ttaaacaatt gtcaaaaaga
720cattataact tatacggtta tagttgttaa aaaaaaaact tatacggtta taccaataga
780taaactgaca ttttagattt gagttttttt tttttttact ttccatgtag atagcgtgga
840ataatacctc atttatatgt atcgttagtt gaaataattg acaaaaggaa actattcgtt
900tgctgttcta gataaaagcc tttaattgag aaaaagaaaa tgcctccagc tttgacccgt
960tttgcctata aaaggattct ctaagatatg gactttcctc
1000811000DNAArabidopsis thaliana 81aataatcctt cgttgccagc agtctaatgg
gactgtgtta taggttaatc tctctttcac 60ggactattaa tttttggctc aacctcaatg
taatttattt aaattatatt tcaacgcata 120cacgtgaatt tatacatctt ttcataggat
ttgcaattcc ggattctcat ccgtcctagg 180aggttgtata tgtatattat tggtaactac
agcattaaac aactttaaca ctaagtacga 240tgatattata tgtacatgta cgtatgctat
agattgatat aagaaaaaag taaagagaga 300gcggatgatt gttgattgaa ctaattaaca
atagtgtatt aggtaggctt gtatccgaca 360gtcaacttaa acttcatagt tgaaaccaat
aaaaccagaa gaaaaaaaat ttatatacta 420atcataatca gcatgatgtt agacgatttg
atatctaact ttcttcgtta tgatagagaa 480ataatattcg caataaatca catataaaaa
aaattacttt acgtcataga ttatcaactt 540gcctccaaac acaacaacat gatcccctac
gctacaatac ataatatgaa aaataaatta 600atcgttgatt aacttaaagt cactccgtta
ataacgttat aagcaaagct tattagtttt 660tttgaccgtc attatcgacg accaaacaaa
gaaagagtcc caacaaatta tgccacgtaa 720gtccagaacg acgcctatga aaccaataag
acaaaaattc aaaaaccaaa acatccgtga 780aaactcttac gtggcggtga catgtgggtc
ccttatgtct ccattaacta acataagctt 840attgtcgact gtaccttttg tcccccgtct
cggctacaac cctgtgacaa acaaagatca 900agtcaaaata tctaacctaa agtactctcg
tccacgtgga cattgctgac gtggaacttc 960tcccttctat aaatacaggc ttctcttcgc
cgaattatac 1000821000DNAArabidopsis thaliana
82caagagcttg actcaacagc tcaacaaagg atgagatggg tgcagaggaa ctacatgatc
60tataattatt gtacggatgc gaagaggttc cctcaaggtc ttcctaaaga gtgcttagct
120gcatagagag agtaaagagt tgagagagga acaagatttt atttttcttt gtggttataa
180aattctattc attttattgt agatcacgtg aattttattg atttgttttg tagtatactc
240tatagttcgt taaagttata atattctctt tgttacaatg tgcttttttg tttttttaag
300tctctcgtag ttcgttccgt tatattctcc gagtcagtcc aaaaatggcc gcttctgttt
360taattttgtt ctagggttcc acagtcaaac tcaaaaccaa acccaatgga gcaataacct
420tttttttagt tttataatcg aaatcaaacc ggaattttgc atgtaatttg attggtgtcg
480ttactttaaa tctttaatcc acaaaacaaa atttactcga ttttagtatt aaccgaacca
540attatagttt attgaaattt aattttaatt ctatcaaatt gcatatgtat tcttgagtta
600ttttttataa aaatactgaa accaactaaa ataatagagt ttggcggaac taccgtacca
660aatttgattg tatttggagt atcatttttg caaacctaat tagcctgaag actgagatat
720ccttgtccac tcttatgaag aaccaattta acaaggtgaa aaccagaatc tctaaaccaa
780acatggcatc aactgaaccg gatcaggcag acttaaacca aaacaaagaa caagcacacg
840tagcatgagg caaaattaag cacatgcttg ctttacttca aaacaaaaac cagctgttca
900cagctaaaac tacacaagag tcacaaacgg cgaactatac tacaaaaaga ctaagacttg
960cctcccttat ataaaacccc ccaacacata aggtcccaat
1000831000DNAArabidopsis thaliana 83tctacgacta ttatacattc atactttgaa
tttggattac aaaaaaaaat cttcgagtat 60gagaaaacta ttaaaatagt tttattgaaa
aatcatataa aataaaaata aatttagtaa 120tagtataatt aaatatgtaa aataaaatta
aataagaaat atatcattaa cctataaatt 180agaagttgaa gagcttacat taattattta
atagacacat gtcaaatgct aaagtgatga 240tgtgtcaatc atatgaagag agttggccaa
ctttcatata tatgattctt agtttcgtac 300caattagttt ctcaattcag tgcatgtgta
gtcaaatgtc caaaagattg tttctaagca 360attaaaatct tatcaaaact tcattgccca
aaatattacg aaagcttgtt tctgtgtatg 420atatacccgg taaaattaaa atgatcatta
tacagaacaa atcagcatga ttttcggata 480aagatgacac acatttgaaa tcgtagccgt
actacgcgaa atacatgcac tcttcgttat 540gttaacactt taacagtgaa cgtagccata
atgttgacca cattcaacag tcaacacaaa 600cattacttta cacacaaata tatgattata
tatacatatg tacatgtaag tgaatgtgag 660caataatgac gggaatattc agagaagacg
atggtgaatg ttagcagtga tcacgggcac 720attcaaaact gactgtggac aaaaaaagct
cctggcctta aatatgattg tgccaaaaat 780agtacaaaac taagaaccca aaatggaatt
cgagacctat ataataatat atatgtatat 840agtctttcct tggaaagaaa tcttatgtta
ttaagaaaaa tactataagt tatctctcta 900tctagatatg atatatatgt ccaaacattt
ccacgtagat gacgtatatt accgaggata 960atcctctata taaggaagag aagctcgagt
aataaatctc 1000841000DNAArabidopsis thaliana
84gtgtatattt tgtcatgcaa actatatcgt ggagaaaata atgttgctta tgacttttga
60tagttgggct tacatttgga taatggatag ggtagacaaa gataggagga aagcaataat
120agcgaaatga agaacgaata tttggggaaa taggacaaat gaatatactt ctctttgaaa
180tggagattca cctaaattat taatactaaa gccatgcaat gcatccaaac aaatcagtgg
240tcaagcacac tcaattatat gtccacgaag acctttagaa tcttcacaac caaaagctat
300tttctacgct acctgataat tctgactcaa ttcttcttca taaaacgtat aatgaagctt
360tatgaatgat taattataga cacaaccggc cctatctgcg atttctacaa acaatagaac
420acaaaacttt aaaagttact acaaaatacc gaattgacta tatatatcat attatcagta
480taaacatgat tagattgatc atgtttatca gtaatcatga aagacaaaga gtgtgactat
540tgtaaaccaa attttagaat aaaataaata atttatcata ctatatacag tattttgtta
600agtatatgtc atccaatagt aacattatca tttaaactga aaaatgtttc agctacttta
660aggaattata gctttattaa aagtatatac ttttaggtca cgtgtttaga ggtgaagaac
720aataataatt actcaataag ttcaccagtc acactccaac atcttattca aattcctttt
780aaaagctttt taaccgtggc tgtttgatga ccatttgaca aaatttagta tattagaaaa
840aaacaatagg atagggataa tataggacat tagactatta gatggacaaa atgaagtatt
900atttaatttt ccaatgtacc aaccaataag aaagaagtga cgcacagtaa acgacaaaaa
960gctcaagcat aaaaacccaa accttctctg ctttctaaac
1000851000DNAArabidopsis thaliana 85gtaatcttaa accattcatt aattatgcga
ataaaacgta aaaaacgtta tcaggtttca 60actttattta ggctgcaatg ggaactatgg
atggcataat tgatacagtt tctgcatcgc 120attcgatttc acctttgatt ggactactca
aatccaacgg taaacttgtt ctactcggtg 180caacggagaa gccatttgat atatctgcat
tttccttaat attgggtatg catatataac 240cctacgtaat agaacgttat attagtcttt
cacatctaga ctatgtatcg agtatgatga 300aaccatgatg cgacaggacg aaaatcgata
gcgggaagtg gtattggagg aatgcaagag 360acacaagaga tgattgactt tgcggcagag
catggcataa aagcagaaat tgagatcata 420tccatggatt atgtgaacac cgccatggat
agacttgcca aaggggatgt tagatatcga 480tttgtcattg acatttctaa tacattggct
gctactcgat cttaattaaa gtcgatgttc 540tatatgtatt caaaataatc tggatttcaa
tcccacaaaa cttaaggata tatatatata 600tatatatata gtctatttta tataaatgga
gtatagtcaa ataaatatgc attatcaacg 660atatatagtc ttctattaca tagatacgtg
ggagttcacc caacgtagat acgttcggtt 720gaaacaagtc aatttcatca atgcctcttc
caaaaaaaaa acaaattgca ttattgatga 780acacatgcat cattatcaaa taggttggtt
aaaatgacca agatgactaa agccaatcac 840actactacca gatcgagtaa ccattaggga
ccattaattc acgtggacgt agtgaatatg 900gtccttgtga attaatgagt acgtaattgt
cctcattcat atatggatcg gttccacaaa 960catttcctgt ataaaattct acatctttcc
tctcattatt 1000861000DNAArabidopsis thaliana
86cagcttacca catgcggcca caattttaac catgattaat cttcttttgg aatttagcga
60gataaatcaa gagtaggttt attttagaac atagtatata atacactaat gaaaagtctc
120gttaaaacgt gtttcacgta accgacctct aaagtatggc gtgcgagtaa cgacaaatga
180aatctcaaat gaaaatggag attattttaa tcttaattat aacacactaa ttaacacccg
240tcaaacatcc tgatatgtga acgtggaatc tcgttacaca atcgaataaa attattgctt
300aatacgatcc gtctttctct ctctttcgta atgattatct atcaatatta tcttgactaa
360taaaagtatc gcaaggcctt ggcagtcaca tgttaaatga tgatatacgt tgttgcatgt
420taaaacatag tacaacgatc atctaggctg ctgctaaatt ctattttcaa aaatgccttt
480tatatgcaag aaaagcaaaa taatcgagtt tttttttttg tttcattata aactgcttca
540tttttcttag gaacggccaa actgttaaaa agtaaaatat gtatggtgat taattgatgt
600aaccaaagcc agttcgcccg tttgattgtc aacccacgtt atcattcact tgatcacttc
660catgaaacat ataaaagctg ataatactta tattataaag gaaaaaaagt atgaaaatat
720tatcagtagt tagatgatta gttcacatct aaatgaaata cgacttaaac tgaaagagac
780atgagccaat tcgtgtcgag ccacaaattt tggacgtata tttttagccc gtggttccac
840aatatttgca ggtcttttta ttcaatgagt ttattttgtc ttggttgaat aatgaaattt
900ccaaatataa aaaataatag aaatccgagg ccctacacaa gcacacatag taactcccac
960attatatata agcggccaat atcagcaact cagagattcc
1000871000DNAArabidopsis thaliana 87gagataggac atagtattca tcactttaaa
acactcttat tgataaatcc aagagtcact 60taactttttc tggtaatcgc tgttaaagac
tccaactcaa aaacaatcac attattccaa 120gtcttaacat tctgatacga ttcattacaa
gaaactgagt ttatctaaga ttttcttctc 180acttgtgcat taacatatat acacacaaca
taacacagat atgatacaat ttcacttctt 240tgaagaacgc ttgggagtaa cagcattaaa
gatagcttca atctcttctt tcagtcgttg 300aaagaaacca ggtcctttca cataatcaat
tggagtgctc tcatcaatgt cgtcacttgt 360tccatgagtc tctttgtcat ggtgacgact
cttggacttt tcatgatgac ccattgcctc 420tatttcttct ttaactcgtt caaacacatt
tggcgatttc acagtcttct cctctgtgga 480aatacccaat taaaacaaag ggtttagatt
cgtgtttcat aagagacaaa acagagcaac 540gaatcaaaaa tagacttatc atggcgtgtt
tctattgttt ctttagagaa atttccgatc 600aaaaccatcc attaatagac gtgtaatttt
gattgtgaaa tttgaggaaa cgaacataac 660ttgatcaaaa agtcttcctt tttaacaata
tgcataaacc tatgatgaga tctgatcgtt 720cattgtgttt acctgatcgg ttttgatccg
ccatcgatgg tttttgtctt cttgatcttg 780cagctttcta tttttggttc ctgattcggt
ggcgttttgc gtcagatgca aaagagtctt 840tacagtataa atcaaaatcg atttcaactt
aaatgggcct tgttgggtta taatgggcct 900ttttgtttgt aagtgagtcc aaggaaaaaa
aacaccagac cagattgaaa ttgaatcact 960tgcaaaagca aaattgatta ctctgttttg
gttttggatc 1000881000DNAArabidopsis thaliana
88aaagtaaagc tcaaagactg tcctcactta tagtttatga atcagcaaaa gaggaactta
60ccggcaaagc aactaaatca taaacttgat cagcacattt cgagataaga acatcagcaa
120gcaatctggt ccctgaggat ccttcaacct ctagcttctg ctccacagaa gcaacagtaa
180catcagcacc agctcgccgt aaaacatcaa ctaacacaac agcttctatt tcctccgtcc
240catatccaat tggaacaagc acctacatag tttctcagtt atctccaatc tcaatctctc
300tacacaaaca agctggacaa ttttagttat agataacaag aaaaaaggag aaaccttttt
360agtagttgca gaagacccga cgccaacatc ggaatctagc gttgttacca tacccggaga
420catggaagat ctaagcttta aagttcggtc cctccttgtt gtccctagtg aagaagaaac
480tacagagact gaagaaaaac taggagcaac agtcatcgag acacaaccca tcgaagagat
540aagccttgac tccatcaaag ttggactcag cgaagctatc attgaaatcg aatatcctaa
600agaccccatt gacgatagct aatgcagcag tctgataatc ccgccgctag acggagagtg
660agttgcgtct tatctactag tatacacttc ttgaagaaag ctaaaaacgt cgtcgtatcg
720ttgtcttatt gcttgattgg ataaagcttc aattttttcc tccaattttt gtcgaaaatg
780taacattttt tttgctttcc actgtgagat tgtcacacac gattctctaa agaaatgaga
840cgaattttga aaacgattta taaaaatagt atttttgttt ccttgagctt atctcctctt
900ttctctttgg tttgagatat gaattacaga caatcaagta gtcagtcaca acagtgtgta
960taaatatgtc ttttaagtga aatcggtttc gagagacagt
1000891000DNAArabidopsis thaliana 89ttgtgtttat ctcaatgttt agatgttgaa
aagtatatat gctgattaat aaataacaat 60ttcttagtca acaactcaat gatagaatgt
tatataactc tcttacatca tttacatttt 120taaattgaat gaattctcat actatgcaat
ctacctggtt agggtgaagt cacaacaaca 180acaacattaa ctaataacaa caatttaagc
tgcatgtcct aatcagaatc tcggttattg 240gtcgctcata agaacagtta gaggtctttc
agtcacaatt gagagacgac atacataatt 300ggatgggaat caaaatcagt ttaattgcgc
tatacgcaac ttgaaacgtc attatacacc 360gataagcttt aaaaataaaa tgttgtgatg
gacacgcgca ctatatatgt atgtgtttgg 420acatgattga agctaaacaa gataatattt
attaagaaga aaatacataa ttattactaa 480ataatgtgta ttattatacg attccaactt
ttatttgata atgatttttg agcaaatcag 540gttgataggt atgattgaga ccatcgccca
aactacacct taaggtctta ctattagaaa 600caaacaattc atttgggaat tgaaaccaaa
gtcatctatg gaagtaacct caaaatttat 660ctaagaaaca gtgatgtatt agaatggtca
agtcaactta ttcacaaaat tacaaaacta 720tcgtttataa atatataatc tatttttcac
ttgtgttaca cgattcctct tttgatatgc 780aggaagaaga gaagatctta caacggctag
aaatagaaaa cttaatgtgc tattaattac 840aacttaccaa ctaattagaa ctttctatat
gaaccggatc ttctacagca aatcacggtc 900aattcaaaat attcatctct ctctctctct
ccacatggac aagacatgat taagaaagta 960tataaaagaa acagagcaga gaaagacatt
ttgttccggt 1000901000DNAArabidopsis thaliana
90gtgagcagat aaggatcgtg tattttcaag tattatatta attaataaat tatcgcccat
60cttctattgt tttgtatttt catttgaaaa gaaagcaggg ctggcttatc aatatataca
120gtctctttct tttagtttac taaaatcatt tcagataaga cccacacaaa agtaatatca
180aatggaattt tgctttcaac atggataatt ttctttaata ataaaattgt caaccatgtt
240aagcgtaatc tctaaatccc caaaattaaa aacactaaaa atagtcggtc attgctccac
300gtttaacctg tccaattcca aaagaacttc gtgtttcatt gacgatatag tttacgtacg
360tatgattctt tctaaatgag gtatttttgt cccattagca atttaaaatt taaaatggtt
420gtgtaacttt ttgaaatgct tagtagttaa aaccgaacta agaagaaata atgatcggaa
480cgtgtttatg tcagctaatg cattacccac tgacattaac tgatccagtt tactgatttt
540agacaatcat actctatttt tactggagtt gcagaaaaag ggacttacaa aacgatgaat
600taaatttgga caagtatata tacataatta gaaggttata gaatctagtt ttcgtaggat
660atttcttgaa ttatcctacg agtgtgacaa ctaaatgcac aacaaaacaa caatttaaat
720gctaaaaagc aaaaatagta attactggtc acctattaag gaacaacacc aacacctatt
780tagctgttat gtcctaataa aaataaaata aaataaaaac aataaaccaa aaaagtaaaa
840aataaaaatg tttgactaaa attctctttc ttttccgatt cgtcattcga tgaatcttcc
900tttcttatac aatttggata atcttttcta atgtaaaaat tcgagatttt tttttataat
960gttaaatcta taaataagag caacctaaaa cgaacaactt
10009123DNAArtificial SequenceDescription of Artificial Sequenceprimer
91cgccagggtt ttcccagtca cga
239224DNAArtificial SequenceDescription of Artificial Sequenceprimer
92agcggataac aatttcacac agga
2493588DNAArabidopsis thalianaCDS(1)..(585) 93atg gtg aag caa gcg atg aag
gaa gag gag aag aag aga aac acg gcg 48Met Val Lys Gln Ala Met Lys
Glu Glu Glu Lys Lys Arg Asn Thr Ala 1 5
10 15atg cag tca aag tac aaa gga gtg agg aag agg aaa tgg
gga aaa tgg 96Met Gln Ser Lys Tyr Lys Gly Val Arg Lys Arg Lys Trp
Gly Lys Trp 20 25 30gta tcg
gag atc aga ctt cca cac agc aga gaa cga att tgg tta ggc 144Val Ser
Glu Ile Arg Leu Pro His Ser Arg Glu Arg Ile Trp Leu Gly 35
40 45tct tac gac act ccc gag aag gcg gcg cgt
gct ttc gac gcc gct caa 192Ser Tyr Asp Thr Pro Glu Lys Ala Ala Arg
Ala Phe Asp Ala Ala Gln 50 55 60ttt
tgt ctc cgc ggc ggc gat gct aat ttc aat ttc cct aat aat cca 240Phe
Cys Leu Arg Gly Gly Asp Ala Asn Phe Asn Phe Pro Asn Asn Pro 65
70 75 80ccg tcg atc tcc gta gaa
aag tcg ttg acg cct ccg gag att cag gaa 288Pro Ser Ile Ser Val Glu
Lys Ser Leu Thr Pro Pro Glu Ile Gln Glu 85
90 95gct gct gct aga ttc gct aac aca ttc caa gac att
gtc aag gga gaa 336Ala Ala Ala Arg Phe Ala Asn Thr Phe Gln Asp Ile
Val Lys Gly Glu 100 105 110gaa
gaa tcg ggt tta gta ccc gga tcc gag atc cga cca gag tct cct 384Glu
Glu Ser Gly Leu Val Pro Gly Ser Glu Ile Arg Pro Glu Ser Pro 115
120 125tct aca tct gca tct gtt gct aca tcg
acg gtg gat tat gat ttt tcg 432Ser Thr Ser Ala Ser Val Ala Thr Ser
Thr Val Asp Tyr Asp Phe Ser 130 135
140ttt ttg gat ttg ctt ccg atg aat ttc ggg ttt gat tcc ttc tcc gac
480Phe Leu Asp Leu Leu Pro Met Asn Phe Gly Phe Asp Ser Phe Ser Asp145
150 155 160gac ttc tct ggc
ttc tcc ggt ggt gat cga ttt aca gag att tta ccc 528Asp Phe Ser Gly
Phe Ser Gly Gly Asp Arg Phe Thr Glu Ile Leu Pro 165
170 175atc gaa gat tac gga gga gag agt tta tta
gat gaa tct ttg att ctt 576Ile Glu Asp Tyr Gly Gly Glu Ser Leu Leu
Asp Glu Ser Leu Ile Leu 180 185
190tgg gat ttt tga
588Trp Asp Phe 19594195PRTArabidopsis thaliana 94Met Val Lys Gln
Ala Met Lys Glu Glu Glu Lys Lys Arg Asn Thr Ala 1 5
10 15Met Gln Ser Lys Tyr Lys Gly Val Arg Lys
Arg Lys Trp Gly Lys Trp 20 25
30Val Ser Glu Ile Arg Leu Pro His Ser Arg Glu Arg Ile Trp Leu Gly
35 40 45Ser Tyr Asp Thr Pro Glu Lys
Ala Ala Arg Ala Phe Asp Ala Ala Gln 50 55
60Phe Cys Leu Arg Gly Gly Asp Ala Asn Phe Asn Phe Pro Asn Asn Pro
65 70 75 80Pro Ser Ile
Ser Val Glu Lys Ser Leu Thr Pro Pro Glu Ile Gln Glu 85
90 95Ala Ala Ala Arg Phe Ala Asn Thr Phe
Gln Asp Ile Val Lys Gly Glu 100 105
110Glu Glu Ser Gly Leu Val Pro Gly Ser Glu Ile Arg Pro Glu Ser Pro
115 120 125Ser Thr Ser Ala Ser Val
Ala Thr Ser Thr Val Asp Tyr Asp Phe Ser 130 135
140Phe Leu Asp Leu Leu Pro Met Asn Phe Gly Phe Asp Ser Phe Ser
Asp145 150 155 160Asp Phe
Ser Gly Phe Ser Gly Gly Asp Arg Phe Thr Glu Ile Leu Pro
165 170 175Ile Glu Asp Tyr Gly Gly Glu
Ser Leu Leu Asp Glu Ser Leu Ile Leu 180 185
190Trp Asp Phe 195951290DNAArabidopsis
thalianaCDS(81)..(1088) 95aacaaatctc tctgtttctc ccgctcttgc tctgttttct
caaagacaaa agaggacatc 60gtcgttgact cctcttctct atg gct act gct aag aac
aag gga aaa tca atc 113 Met Ala Thr Ala Lys Asn
Lys Gly Lys Ser Ile 1 5
10agg gtc ctt ggt acc agt gaa gca gag aaa aag gat gag atg gag ttg
161Arg Val Leu Gly Thr Ser Glu Ala Glu Lys Lys Asp Glu Met Glu Leu
15 20 25gag gag gag ttc cag ttt
agt agc ggc aag tat aaa gat tcg ggt cct 209Glu Glu Glu Phe Gln Phe
Ser Ser Gly Lys Tyr Lys Asp Ser Gly Pro 30 35
40ggc tcg gac atg tgg tta gga gat gct tcc tct acg tct cca
aga agt 257Gly Ser Asp Met Trp Leu Gly Asp Ala Ser Ser Thr Ser Pro
Arg Ser 45 50 55ctt agg aag act aga
acc ttt gac cga cat aat ccc tat ctc gta tct 305Leu Arg Lys Thr Arg
Thr Phe Asp Arg His Asn Pro Tyr Leu Val Ser 60 65
70 75tct tat gct act cct cag ccg cca aca aca
act aca tgc tct gtc tct 353Ser Tyr Ala Thr Pro Gln Pro Pro Thr Thr
Thr Thr Cys Ser Val Ser 80 85
90ttt ccc ttt tac ctc cct cca gcg att caa aat caa caa cga ttt tta
401Phe Pro Phe Tyr Leu Pro Pro Ala Ile Gln Asn Gln Gln Arg Phe Leu
95 100 105cac ccg aat gac cct tca
gga caa aga cag caa caa atg atc tcg ttt 449His Pro Asn Asp Pro Ser
Gly Gln Arg Gln Gln Gln Met Ile Ser Phe 110 115
120gat cct caa caa cag gtg caa cca tat gtt gca caa cag cag
caa caa 497Asp Pro Gln Gln Gln Val Gln Pro Tyr Val Ala Gln Gln Gln
Gln Gln 125 130 135caa caa cat cta ttg
cag tac tgg aga gac att ctg aag ctg agt ccg 545Gln Gln His Leu Leu
Gln Tyr Trp Arg Asp Ile Leu Lys Leu Ser Pro140 145
150 155agc gga aga atg atg atg atg aac atg tta
aga caa gaa agc gat ctg 593Ser Gly Arg Met Met Met Met Asn Met Leu
Arg Gln Glu Ser Asp Leu 160 165
170cca ctg acg agg cca ccg gtt caa ccc ttc agc gcc acc aag cta tat
641Pro Leu Thr Arg Pro Pro Val Gln Pro Phe Ser Ala Thr Lys Leu Tyr
175 180 185aga ggt gtc agg caa cgc
cac tgg gga aaa tgg gtt gcc gag atc cgt 689Arg Gly Val Arg Gln Arg
His Trp Gly Lys Trp Val Ala Glu Ile Arg 190 195
200aag cca cga aac agg aca cgt ctc tgg cta ggg aca ttc gat
aca gca 737Lys Pro Arg Asn Arg Thr Arg Leu Trp Leu Gly Thr Phe Asp
Thr Ala 205 210 215gaa gaa gcc gcc atg
gcc tac gac cgc gag gcc ttc aag ttg agg gga 785Glu Glu Ala Ala Met
Ala Tyr Asp Arg Glu Ala Phe Lys Leu Arg Gly220 225
230 235gag acc gct agg ctc aat ttc cct gaa ctt
ttt ctc aat aaa caa gag 833Glu Thr Ala Arg Leu Asn Phe Pro Glu Leu
Phe Leu Asn Lys Gln Glu 240 245
250cca act ccc gtg cat cag aaa caa tgt gag acg ggg act act agt gaa
881Pro Thr Pro Val His Gln Lys Gln Cys Glu Thr Gly Thr Thr Ser Glu
255 260 265gac tca agc aga aga gga
gag gat gat tcg agc acg gca ttg gca gta 929Asp Ser Ser Arg Arg Gly
Glu Asp Asp Ser Ser Thr Ala Leu Ala Val 270 275
280gga ggg gtg agt gag gag acg ggt tgg gct gag gca tgg ttc
aat gca 977Gly Gly Val Ser Glu Glu Thr Gly Trp Ala Glu Ala Trp Phe
Asn Ala 285 290 295att cca gag gaa tgg
gga cct gga agc cct cta tgg gat gat tac cac 1025Ile Pro Glu Glu Trp
Gly Pro Gly Ser Pro Leu Trp Asp Asp Tyr His300 305
310 315ttt ccc att tct aac cat aag gac gat ctt
gac gcc aca caa aac tct 1073Phe Pro Ile Ser Asn His Lys Asp Asp Leu
Asp Ala Thr Gln Asn Ser 320 325
330tct tct gat aca att taggaccttt gttagaatat agatatgctt agttgtatga
1128Ser Ser Asp Thr Ile 335ctgatctagc ttgtgttttt tttttgggtg
gagacagttt ttgtcatctt ccacatttta 1188gattctattt tcgaccatca tttttttctt
gatcggtgac tatgaatcta atggggtcaa 1248tcattttcac atataaaact taagtatttg
gtgtttgtac tt 129096336PRTArabidopsis thaliana 96Met
Ala Thr Ala Lys Asn Lys Gly Lys Ser Ile Arg Val Leu Gly Thr 1
5 10 15Ser Glu Ala Glu Lys Lys Asp
Glu Met Glu Leu Glu Glu Glu Phe Gln 20 25
30Phe Ser Ser Gly Lys Tyr Lys Asp Ser Gly Pro Gly Ser Asp
Met Trp 35 40 45Leu Gly Asp Ala
Ser Ser Thr Ser Pro Arg Ser Leu Arg Lys Thr Arg 50
55 60Thr Phe Asp Arg His Asn Pro Tyr Leu Val Ser Ser Tyr
Ala Thr Pro 65 70 75
80Gln Pro Pro Thr Thr Thr Thr Cys Ser Val Ser Phe Pro Phe Tyr Leu
85 90 95Pro Pro Ala Ile Gln Asn
Gln Gln Arg Phe Leu His Pro Asn Asp Pro 100
105 110Ser Gly Gln Arg Gln Gln Gln Met Ile Ser Phe Asp
Pro Gln Gln Gln 115 120 125Val Gln
Pro Tyr Val Ala Gln Gln Gln Gln Gln Gln Gln His Leu Leu 130
135 140Gln Tyr Trp Arg Asp Ile Leu Lys Leu Ser Pro
Ser Gly Arg Met Met145 150 155
160Met Met Asn Met Leu Arg Gln Glu Ser Asp Leu Pro Leu Thr Arg Pro
165 170 175Pro Val Gln Pro
Phe Ser Ala Thr Lys Leu Tyr Arg Gly Val Arg Gln 180
185 190Arg His Trp Gly Lys Trp Val Ala Glu Ile Arg
Lys Pro Arg Asn Arg 195 200 205Thr
Arg Leu Trp Leu Gly Thr Phe Asp Thr Ala Glu Glu Ala Ala Met 210
215 220Ala Tyr Asp Arg Glu Ala Phe Lys Leu Arg
Gly Glu Thr Ala Arg Leu225 230 235
240Asn Phe Pro Glu Leu Phe Leu Asn Lys Gln Glu Pro Thr Pro Val
His 245 250 255Gln Lys Gln
Cys Glu Thr Gly Thr Thr Ser Glu Asp Ser Ser Arg Arg 260
265 270Gly Glu Asp Asp Ser Ser Thr Ala Leu Ala
Val Gly Gly Val Ser Glu 275 280
285Glu Thr Gly Trp Ala Glu Ala Trp Phe Asn Ala Ile Pro Glu Glu Trp 290
295 300Gly Pro Gly Ser Pro Leu Trp Asp
Asp Tyr His Phe Pro Ile Ser Asn305 310
315 320His Lys Asp Asp Leu Asp Ala Thr Gln Asn Ser Ser
Ser Asp Thr Ile 325 330
33597879DNAArabidopsis thalianaCDS(1)..(876) 97atg gac ttt gac gag gag
cta aat ctt tgt att acg aaa ggt aaa aat 48Met Asp Phe Asp Glu Glu
Leu Asn Leu Cys Ile Thr Lys Gly Lys Asn 1 5
10 15gtt gat cat tct ttt gga gga gaa gct tct tcc acg
tcc cca aga tct 96Val Asp His Ser Phe Gly Gly Glu Ala Ser Ser Thr
Ser Pro Arg Ser 20 25 30atg
aag aaa atg aag agt cct agt cgt cct aaa ccc tat ttc caa tcc 144Met
Lys Lys Met Lys Ser Pro Ser Arg Pro Lys Pro Tyr Phe Gln Ser 35
40 45tct tct tct cct tat tcg tta gag gct
ttc cct ttt tct ctc gat cca 192Ser Ser Ser Pro Tyr Ser Leu Glu Ala
Phe Pro Phe Ser Leu Asp Pro 50 55
60aca ctt cag aat cag caa caa caa ctc gga tca tac gtt ccg gta ctt
240Thr Leu Gln Asn Gln Gln Gln Gln Leu Gly Ser Tyr Val Pro Val Leu 65
70 75 80gag caa cga caa
gac ccg aca atg caa ggc cag aag caa atg atc tcc 288Glu Gln Arg Gln
Asp Pro Thr Met Gln Gly Gln Lys Gln Met Ile Ser 85
90 95ttt agt cct caa caa caa caa cag cag cag
cag tat atg gcc cag tac 336Phe Ser Pro Gln Gln Gln Gln Gln Gln Gln
Gln Tyr Met Ala Gln Tyr 100 105
110tgg agt gac aca ttg aat ctg agt cca aga gga aga atg atg atg atg
384Trp Ser Asp Thr Leu Asn Leu Ser Pro Arg Gly Arg Met Met Met Met
115 120 125atg agc caa gaa gct gtt caa
cct tac atc gca acg aag ctg tac aga 432Met Ser Gln Glu Ala Val Gln
Pro Tyr Ile Ala Thr Lys Leu Tyr Arg 130 135
140gga gtg aga caa cgt caa tgg gga aaa tgg gtc gca gag atc cgt aag
480Gly Val Arg Gln Arg Gln Trp Gly Lys Trp Val Ala Glu Ile Arg Lys145
150 155 160cca cga agc agg
gca cgt ctt tgg ctt ggt acc ttt gat aca gct gaa 528Pro Arg Ser Arg
Ala Arg Leu Trp Leu Gly Thr Phe Asp Thr Ala Glu 165
170 175gaa gct gcc atg gcc tac gac cgc caa gcc
ttc aaa tta cga ggc cac 576Glu Ala Ala Met Ala Tyr Asp Arg Gln Ala
Phe Lys Leu Arg Gly His 180 185
190agc gca aca ctg aat ttc ccg gag cat ttt gtg aat aag gaa agc gag
624Ser Ala Thr Leu Asn Phe Pro Glu His Phe Val Asn Lys Glu Ser Glu
195 200 205ctg cat gat tca aac tcg tcg
gat cag aaa gaa cct gaa acg cca cag 672Leu His Asp Ser Asn Ser Ser
Asp Gln Lys Glu Pro Glu Thr Pro Gln 210 215
220cca agc gag gtt aac ttg gag agc aag gaa cta ccg gtg att gat gtt
720Pro Ser Glu Val Asn Leu Glu Ser Lys Glu Leu Pro Val Ile Asp Val225
230 235 240ggg aga gag gaa
ggt atg gct gag gca tgg tac aat gcc att aca tcg 768Gly Arg Glu Glu
Gly Met Ala Glu Ala Trp Tyr Asn Ala Ile Thr Ser 245
250 255gga tgg ggt cct gaa agt cct ctt tgg gat
gat ttg gat agt tct cat 816Gly Trp Gly Pro Glu Ser Pro Leu Trp Asp
Asp Leu Asp Ser Ser His 260 265
270cag ttt tca tca gaa agc tca tct tct tct cct ctc tct tgt cct atg
864Gln Phe Ser Ser Glu Ser Ser Ser Ser Ser Pro Leu Ser Cys Pro Met
275 280 285agg cct ttc ttt tga
879Arg Pro Phe Phe
29098292PRTArabidopsis thaliana 98Met Asp Phe Asp Glu Glu Leu Asn Leu Cys
Ile Thr Lys Gly Lys Asn 1 5 10
15Val Asp His Ser Phe Gly Gly Glu Ala Ser Ser Thr Ser Pro Arg Ser
20 25 30Met Lys Lys Met Lys
Ser Pro Ser Arg Pro Lys Pro Tyr Phe Gln Ser 35
40 45Ser Ser Ser Pro Tyr Ser Leu Glu Ala Phe Pro Phe Ser
Leu Asp Pro 50 55 60Thr Leu Gln Asn
Gln Gln Gln Gln Leu Gly Ser Tyr Val Pro Val Leu 65 70
75 80Glu Gln Arg Gln Asp Pro Thr Met Gln
Gly Gln Lys Gln Met Ile Ser 85 90
95Phe Ser Pro Gln Gln Gln Gln Gln Gln Gln Gln Tyr Met Ala Gln
Tyr 100 105 110Trp Ser Asp Thr
Leu Asn Leu Ser Pro Arg Gly Arg Met Met Met Met 115
120 125Met Ser Gln Glu Ala Val Gln Pro Tyr Ile Ala Thr
Lys Leu Tyr Arg 130 135 140Gly Val Arg
Gln Arg Gln Trp Gly Lys Trp Val Ala Glu Ile Arg Lys145
150 155 160Pro Arg Ser Arg Ala Arg Leu
Trp Leu Gly Thr Phe Asp Thr Ala Glu 165
170 175Glu Ala Ala Met Ala Tyr Asp Arg Gln Ala Phe Lys
Leu Arg Gly His 180 185 190Ser
Ala Thr Leu Asn Phe Pro Glu His Phe Val Asn Lys Glu Ser Glu 195
200 205Leu His Asp Ser Asn Ser Ser Asp Gln
Lys Glu Pro Glu Thr Pro Gln 210 215
220Pro Ser Glu Val Asn Leu Glu Ser Lys Glu Leu Pro Val Ile Asp Val225
230 235 240Gly Arg Glu Glu
Gly Met Ala Glu Ala Trp Tyr Asn Ala Ile Thr Ser 245
250 255Gly Trp Gly Pro Glu Ser Pro Leu Trp Asp
Asp Leu Asp Ser Ser His 260 265
270Gln Phe Ser Ser Glu Ser Ser Ser Ser Ser Pro Leu Ser Cys Pro Met
275 280 285Arg Pro Phe Phe
290991499DNAArabidopsis thalianaCDS(644)..(1222) 99tggtatcggt gaggctgaga
gttattcact tacaaaaaaa aaaaaaaact tgagtgtaac 60caaaaaaaaa aagttgatat
actttctggt tttctcctta acttttattc tttacaaatc 120catccccctt agatctgttt
atttcccgct actttgattc atttctgtta gtaatctgtc 180tttcgtatag aagaaaactg
atttcttggt ttgtattttc ttaaagagat caatcttttt 240ttatttttga tcttcttgtg
tttttttttc tttgtagaat taatcgtttg tgagggtatt 300tttttaattc cctcctctca
gaaatctaca cagaggtttt ttattttata aacctctttt 360ttcgattttc ttgaaaacaa
aaaatcctgt tctttacttt ttttacaaga acaagggaaa 420aaaatttctt tttattagaa
atgacaactt ctatggattt ttacagtaac aaaacgtttc 480aacaatctga tccattcggt
ggtgaattaa tggaagcgct ttacctttta tcaaaagccc 540ttccaacgat tcatccgcgt
ttgcgttctc tctacccgct ccaatttcat acgggtcgga 600tctccactca ttttctcacc
atcttagtcc taaaccggtc tca atg aaa caa acc 655
Met Lys Gln Thr
1ggt act tcc gcg gct aaa ccg acg aag cta tac aga gga gtg
aga caa 703Gly Thr Ser Ala Ala Lys Pro Thr Lys Leu Tyr Arg Gly Val
Arg Gln 5 10 15 20cgt
cac tgg gga aaa tgg gtg gct gag att cgt tta ccg agg aat cga 751Arg
His Trp Gly Lys Trp Val Ala Glu Ile Arg Leu Pro Arg Asn Arg
25 30 35act cga ctt tgg ctc gga aca
ttc gac acg gcg gag gaa gct gct tta 799Thr Arg Leu Trp Leu Gly Thr
Phe Asp Thr Ala Glu Glu Ala Ala Leu 40 45
50gct tat gac aag gcg gcg tat aag ctc cga gga gat ttt gcg
cgg ctt 847Ala Tyr Asp Lys Ala Ala Tyr Lys Leu Arg Gly Asp Phe Ala
Arg Leu 55 60 65aat ttc cct gat
ctc cgt cat aac gac gag tat caa cct ctt caa tca 895Asn Phe Pro Asp
Leu Arg His Asn Asp Glu Tyr Gln Pro Leu Gln Ser 70
75 80tca gtc gac gct aag ctt gaa gct att tgt caa aac tta
gct gag acg 943Ser Val Asp Ala Lys Leu Glu Ala Ile Cys Gln Asn Leu
Ala Glu Thr 85 90 95
100acg cag aaa cag gtg aga tca acg aag aag tct tct tct cgg aaa cgt
991Thr Gln Lys Gln Val Arg Ser Thr Lys Lys Ser Ser Ser Arg Lys Arg
105 110 115tca tca acc gtc gca
gtg aaa cta ccg gag gag gac tac tct agc gcc 1039Ser Ser Thr Val Ala
Val Lys Leu Pro Glu Glu Asp Tyr Ser Ser Ala 120
125 130gga tct tcg ccg ctg tta acg gag agt tat gga tct
ggt gga tct tct 1087Gly Ser Ser Pro Leu Leu Thr Glu Ser Tyr Gly Ser
Gly Gly Ser Ser 135 140 145tcg ccg
ttg tcg gag ctg acg ttt ggt gat acg gag gag gag att cag 1135Ser Pro
Leu Ser Glu Leu Thr Phe Gly Asp Thr Glu Glu Glu Ile Gln 150
155 160ccg ccg tgg aac gag aac gcg ttg gag aag tat
ccg tcg tac gag atc 1183Pro Pro Trp Asn Glu Asn Ala Leu Glu Lys Tyr
Pro Ser Tyr Glu Ile165 170 175
180gat tgg gat tcg att ctt cag tgt tcg agt ctt gta aat tagatgttgc
1232Asp Trp Asp Ser Ile Leu Gln Cys Ser Ser Leu Val Asn
185 190cataggggta ttttagggac tttagagctc tctgcgatgg
agtttttggt cattgcagag 1292attttattat tattaagggg gtttgttatg ttaatatcaa
ataagtttat ctactttgat 1352gttaattagt gttaatctct gcgtcggtcc aagctgtttt
tttttggcat gcttcgaccg 1412tgtgagattt cttatgtaat ttttgtagtt ccttgatttt
cttagttcaa gttaaattgg 1472cacaaaagag caaaaaaaaa aaaaaaa
1499100193PRTArabidopsis thaliana 100Met Lys Gln
Thr Gly Thr Ser Ala Ala Lys Pro Thr Lys Leu Tyr Arg 1 5
10 15Gly Val Arg Gln Arg His Trp Gly Lys
Trp Val Ala Glu Ile Arg Leu 20 25
30Pro Arg Asn Arg Thr Arg Leu Trp Leu Gly Thr Phe Asp Thr Ala Glu
35 40 45Glu Ala Ala Leu Ala Tyr
Asp Lys Ala Ala Tyr Lys Leu Arg Gly Asp 50 55
60Phe Ala Arg Leu Asn Phe Pro Asp Leu Arg His Asn Asp Glu Tyr
Gln 65 70 75 80Pro Leu
Gln Ser Ser Val Asp Ala Lys Leu Glu Ala Ile Cys Gln Asn
85 90 95Leu Ala Glu Thr Thr Gln Lys Gln
Val Arg Ser Thr Lys Lys Ser Ser 100 105
110Ser Arg Lys Arg Ser Ser Thr Val Ala Val Lys Leu Pro Glu Glu
Asp 115 120 125Tyr Ser Ser Ala Gly
Ser Ser Pro Leu Leu Thr Glu Ser Tyr Gly Ser 130 135
140Gly Gly Ser Ser Ser Pro Leu Ser Glu Leu Thr Phe Gly Asp
Thr Glu145 150 155 160Glu
Glu Ile Gln Pro Pro Trp Asn Glu Asn Ala Leu Glu Lys Tyr Pro
165 170 175Ser Tyr Glu Ile Asp Trp Asp
Ser Ile Leu Gln Cys Ser Ser Leu Val 180 185
190Asn101962DNAArabidopsis thalianaCDS(77)..(880)
101acaccggaca ttttgaaatc tcaacaagaa ccaaaccaaa caacaaaaaa acattcttaa
60taattatctt tctgtt atg tcg atg acg gcg gat tct caa tct gat tat gct
112 Met Ser Met Thr Ala Asp Ser Gln Ser Asp Tyr Ala
1 5 10ttt ctt gag tcc ata
cga cga cac tta cta gga gaa tcg gag ccg ata 160Phe Leu Glu Ser Ile
Arg Arg His Leu Leu Gly Glu Ser Glu Pro Ile 15
20 25ctc agt gag tcg aca gcg agt tcg gtt act caa tct tgt
gta acc ggt 208Leu Ser Glu Ser Thr Ala Ser Ser Val Thr Gln Ser Cys
Val Thr Gly 30 35 40cag agc att aaa
ccg gtg tac gga cga aac cct agc ttt agc aaa ctg 256Gln Ser Ile Lys
Pro Val Tyr Gly Arg Asn Pro Ser Phe Ser Lys Leu 45 50
55 60tat cct tgc ttc acc gag agc tgg gga
gat ttg ccg ttg aaa gaa aac 304Tyr Pro Cys Phe Thr Glu Ser Trp Gly
Asp Leu Pro Leu Lys Glu Asn 65 70
75gat tct gag gat atg tta gtt tac ggt atc ctc aac gac gcc ttt
cac 352Asp Ser Glu Asp Met Leu Val Tyr Gly Ile Leu Asn Asp Ala Phe
His 80 85 90ggc ggt tgg gag
ccg tct tct tcg tct tcc gac gaa gat cgt agc tct 400Gly Gly Trp Glu
Pro Ser Ser Ser Ser Ser Asp Glu Asp Arg Ser Ser 95
100 105ttc ccg agt gtt aag atc gag act ccg gag agt ttc
gcg gcg gtg gat 448Phe Pro Ser Val Lys Ile Glu Thr Pro Glu Ser Phe
Ala Ala Val Asp 110 115 120tct gtt ccg
gtc aag aag gag aag acg agt cct gtt tcg gcg gcg gtg 496Ser Val Pro
Val Lys Lys Glu Lys Thr Ser Pro Val Ser Ala Ala Val125
130 135 140acg gcg gcg aag gga aag cat
tat aga gga gtg aga caa agg ccg tgg 544Thr Ala Ala Lys Gly Lys His
Tyr Arg Gly Val Arg Gln Arg Pro Trp 145
150 155ggg aaa ttt gcg gcg gag att aga gac ccg gcg aag
aac gga gct agg 592Gly Lys Phe Ala Ala Glu Ile Arg Asp Pro Ala Lys
Asn Gly Ala Arg 160 165 170gtt
tgg tta gga acg ttt gag acg gcg gag gac gcg gcg ttg gct tac 640Val
Trp Leu Gly Thr Phe Glu Thr Ala Glu Asp Ala Ala Leu Ala Tyr 175
180 185gac aga gct gct ttc agg atg cgt ggt
tcc cgc gct ttg ttg aat ttt 688Asp Arg Ala Ala Phe Arg Met Arg Gly
Ser Arg Ala Leu Leu Asn Phe 190 195
200ccg ttg aga gtt aat tca gga gaa ccc gac ccg gtt cga atc aag tcc
736Pro Leu Arg Val Asn Ser Gly Glu Pro Asp Pro Val Arg Ile Lys Ser205
210 215 220aag aga tct tct
ttt tct tct tct aac gag aac gga gct ccg aag aag 784Lys Arg Ser Ser
Phe Ser Ser Ser Asn Glu Asn Gly Ala Pro Lys Lys 225
230 235agg aga acg gtg gcc gcc ggt ggt gga atg
gat aag gga ttg acg gtg 832Arg Arg Thr Val Ala Ala Gly Gly Gly Met
Asp Lys Gly Leu Thr Val 240 245
250aag tgc gag gtt gtt gaa gtg gca cgt ggc gat cgt tta ttg gtt tta
880Lys Cys Glu Val Val Glu Val Ala Arg Gly Asp Arg Leu Leu Val Leu
255 260 265taattttgat ttttctttgt
tggatgatta tatgattctt caaaaaagaa gaacgttaat 940aaaaaaattc gtttattatt
gt 962102268PRTArabidopsis
thaliana 102Met Ser Met Thr Ala Asp Ser Gln Ser Asp Tyr Ala Phe Leu Glu
Ser 1 5 10 15Ile Arg Arg
His Leu Leu Gly Glu Ser Glu Pro Ile Leu Ser Glu Ser 20
25 30Thr Ala Ser Ser Val Thr Gln Ser Cys Val
Thr Gly Gln Ser Ile Lys 35 40
45Pro Val Tyr Gly Arg Asn Pro Ser Phe Ser Lys Leu Tyr Pro Cys Phe 50
55 60Thr Glu Ser Trp Gly Asp Leu Pro Leu
Lys Glu Asn Asp Ser Glu Asp 65 70 75
80Met Leu Val Tyr Gly Ile Leu Asn Asp Ala Phe His Gly Gly
Trp Glu 85 90 95Pro Ser
Ser Ser Ser Ser Asp Glu Asp Arg Ser Ser Phe Pro Ser Val 100
105 110Lys Ile Glu Thr Pro Glu Ser Phe Ala
Ala Val Asp Ser Val Pro Val 115 120
125Lys Lys Glu Lys Thr Ser Pro Val Ser Ala Ala Val Thr Ala Ala Lys
130 135 140Gly Lys His Tyr Arg Gly Val
Arg Gln Arg Pro Trp Gly Lys Phe Ala145 150
155 160Ala Glu Ile Arg Asp Pro Ala Lys Asn Gly Ala Arg
Val Trp Leu Gly 165 170
175Thr Phe Glu Thr Ala Glu Asp Ala Ala Leu Ala Tyr Asp Arg Ala Ala
180 185 190Phe Arg Met Arg Gly Ser
Arg Ala Leu Leu Asn Phe Pro Leu Arg Val 195 200
205Asn Ser Gly Glu Pro Asp Pro Val Arg Ile Lys Ser Lys Arg
Ser Ser 210 215 220Phe Ser Ser Ser Asn
Glu Asn Gly Ala Pro Lys Lys Arg Arg Thr Val225 230
235 240Ala Ala Gly Gly Gly Met Asp Lys Gly Leu
Thr Val Lys Cys Glu Val 245 250
255Val Glu Val Ala Arg Gly Asp Arg Leu Leu Val Leu 260
265103834DNAArabidopsis thalianaCDS(171)..(746)
103aacacaattt gattacactg agcaacacaa aactggcgaa ccaacgtgac tctaacgaag
60aaaccggcaa tggccagtat cactacaatg ccgaagaaat aacaagaatc ataaacgagc
120cagaatatta tcccccgggt tacaacttgt ctaccaccgc aatttcaaac atg gtg
176 Met Val
1tct atg ctg act aat gtt
gtc tct ggt gag acc gaa ccc tcg gca tct 224Ser Met Leu Thr Asn Val
Val Ser Gly Glu Thr Glu Pro Ser Ala Ser 5 10
15gcg aca tgg acg atg ggt cat aag aga gaa aga gaa gag ttt
tct ttg 272Ala Thr Trp Thr Met Gly His Lys Arg Glu Arg Glu Glu Phe
Ser Leu 20 25 30cct cct caa cca ttg
att acc ggt tca gct gtg act aaa gaa tgt gaa 320Pro Pro Gln Pro Leu
Ile Thr Gly Ser Ala Val Thr Lys Glu Cys Glu 35 40
45 50agc tca atg tcc ttg gag agg cca aaa aaa
tat aga gga gta agg caa 368Ser Ser Met Ser Leu Glu Arg Pro Lys Lys
Tyr Arg Gly Val Arg Gln 55 60
65cga cca tgg gga aaa tgg gcg gcg gag att cga gac cca cac aag gcg
416Arg Pro Trp Gly Lys Trp Ala Ala Glu Ile Arg Asp Pro His Lys Ala
70 75 80aca cgt gta tgg ctt ggg
aca ttc gag aca gcc gag gcc gcc gca aga 464Thr Arg Val Trp Leu Gly
Thr Phe Glu Thr Ala Glu Ala Ala Ala Arg 85 90
95gcc tat gat gcg gca gca ctt cgc ttt aga gga agc aaa gca
aag ctt 512Ala Tyr Asp Ala Ala Ala Leu Arg Phe Arg Gly Ser Lys Ala
Lys Leu 100 105 110aat ttc ccc gaa aat
gtt gga act cag acg att caa cga aat tct cat 560Asn Phe Pro Glu Asn
Val Gly Thr Gln Thr Ile Gln Arg Asn Ser His115 120
125 130ttc ttg caa aac tct atg caa cct tct ctg
aca tac atc gat caa tgt 608Phe Leu Gln Asn Ser Met Gln Pro Ser Leu
Thr Tyr Ile Asp Gln Cys 135 140
145cca act cta tta tct tac tct cga tgt atg gag caa caa caa cca tta
656Pro Thr Leu Leu Ser Tyr Ser Arg Cys Met Glu Gln Gln Gln Pro Leu
150 155 160gta ggc atg ttg cag cca
aca gaa gag gaa aat cac ttt ttc gaa aaa 704Val Gly Met Leu Gln Pro
Thr Glu Glu Glu Asn His Phe Phe Glu Lys 165 170
175cca tgg acc gaa tat gat caa tac aat tac tcc tct ttt ggt
746Pro Trp Thr Glu Tyr Asp Gln Tyr Asn Tyr Ser Ser Phe Gly
180 185 190taactaacat attgtcaacg
ctttgtattt ctacttattc gatctaccaa ttttttctct 806cccaatacaa cttcagtctg
attattgc 834104192PRTArabidopsis
thaliana 104Met Val Ser Met Leu Thr Asn Val Val Ser Gly Glu Thr Glu Pro
Ser 1 5 10 15Ala Ser Ala
Thr Trp Thr Met Gly His Lys Arg Glu Arg Glu Glu Phe 20
25 30Ser Leu Pro Pro Gln Pro Leu Ile Thr Gly
Ser Ala Val Thr Lys Glu 35 40
45Cys Glu Ser Ser Met Ser Leu Glu Arg Pro Lys Lys Tyr Arg Gly Val 50
55 60Arg Gln Arg Pro Trp Gly Lys Trp Ala
Ala Glu Ile Arg Asp Pro His 65 70 75
80Lys Ala Thr Arg Val Trp Leu Gly Thr Phe Glu Thr Ala Glu
Ala Ala 85 90 95Ala Arg
Ala Tyr Asp Ala Ala Ala Leu Arg Phe Arg Gly Ser Lys Ala 100
105 110Lys Leu Asn Phe Pro Glu Asn Val Gly
Thr Gln Thr Ile Gln Arg Asn 115 120
125Ser His Phe Leu Gln Asn Ser Met Gln Pro Ser Leu Thr Tyr Ile Asp
130 135 140Gln Cys Pro Thr Leu Leu Ser
Tyr Ser Arg Cys Met Glu Gln Gln Gln145 150
155 160Pro Leu Val Gly Met Leu Gln Pro Thr Glu Glu Glu
Asn His Phe Phe 165 170
175Glu Lys Pro Trp Thr Glu Tyr Asp Gln Tyr Asn Tyr Ser Ser Phe Gly
180 185 1901051085DNAArabidopsis
thalianaCDS(120)..(953) 105acttcactct ctaatttcct tctctctatc tctcaccata
ttcgcgatta aaaactctca 60acttttctct caaatttctg atcctttgat ccaacagtta
gaagaagatt catctgatc 119atg gcc ctc gaa gcg atg aac act cca act tct
tct ttc acc aga atc 167Met Ala Leu Glu Ala Met Asn Thr Pro Thr Ser
Ser Phe Thr Arg Ile 1 5 10
15gaa acg aaa gaa gat ttg atg aac gac gcc gtt ttc att gag ccg tgg
215Glu Thr Lys Glu Asp Leu Met Asn Asp Ala Val Phe Ile Glu Pro Trp
20 25 30ctt aaa cgc aaa cgc tcc
aaa cgt cag cgt tct cac agc cct tct tcg 263Leu Lys Arg Lys Arg Ser
Lys Arg Gln Arg Ser His Ser Pro Ser Ser 35 40
45tct tct tcc tca ccg cct cga tct cga ccc aaa tcc cag aat
caa gat 311Ser Ser Ser Ser Pro Pro Arg Ser Arg Pro Lys Ser Gln Asn
Gln Asp 50 55 60ctt acg gaa gaa gag
tat ctc gct ctt tgt ctc ctc atg ctc gct aaa 359Leu Thr Glu Glu Glu
Tyr Leu Ala Leu Cys Leu Leu Met Leu Ala Lys 65 70
75 80gat caa ccg tcg caa acg cga ttt cat caa
cag tcg caa tcg tta acg 407Asp Gln Pro Ser Gln Thr Arg Phe His Gln
Gln Ser Gln Ser Leu Thr 85 90
95ccg ccg cca gaa tca aag aac ctt ccg tac aag tgt aac gtc tgt gaa
455Pro Pro Pro Glu Ser Lys Asn Leu Pro Tyr Lys Cys Asn Val Cys Glu
100 105 110aaa gcg ttt cct tcc tat
cag gct tta ggc ggt cac aaa gca agt cac 503Lys Ala Phe Pro Ser Tyr
Gln Ala Leu Gly Gly His Lys Ala Ser His 115 120
125cga atc aaa cca cca acc gta atc tca aca acc gcc gat gat
tca aca 551Arg Ile Lys Pro Pro Thr Val Ile Ser Thr Thr Ala Asp Asp
Ser Thr 130 135 140gct ccg acc atc tcc
atc gtc gcc gga gaa aaa cat ccg att gct gcc 599Ala Pro Thr Ile Ser
Ile Val Ala Gly Glu Lys His Pro Ile Ala Ala145 150
155 160tcc gga aag atc cac gag tgt tca atc tgt
cat aaa gtg ttt ccg acg 647Ser Gly Lys Ile His Glu Cys Ser Ile Cys
His Lys Val Phe Pro Thr 165 170
175ggt caa gct tta ggc ggt cac aaa cgt tgt cac tac gaa ggc aac ctc
695Gly Gln Ala Leu Gly Gly His Lys Arg Cys His Tyr Glu Gly Asn Leu
180 185 190ggc ggc gga gga gga gga
gga agc aaa tca atc agt cac agt gga agc 743Gly Gly Gly Gly Gly Gly
Gly Ser Lys Ser Ile Ser His Ser Gly Ser 195 200
205gtg tcg agc acg gta tcg gaa gaa agg agc cac cgt gga ttc
atc gat 791Val Ser Ser Thr Val Ser Glu Glu Arg Ser His Arg Gly Phe
Ile Asp 210 215 220cta aac cta ccg gcg
tta cct gaa ctc agc ctt cat cac aat cca atc 839Leu Asn Leu Pro Ala
Leu Pro Glu Leu Ser Leu His His Asn Pro Ile225 230
235 240gtc gac gaa gag atc ttg agt ccg ttg acc
ggt aaa aaa acc gct ttt 887Val Asp Glu Glu Ile Leu Ser Pro Leu Thr
Gly Lys Lys Thr Ala Phe 245 250
255gtt gac cga tca cga cca agt cat caa gaa aga aga ttt atc ttt aaa
935Val Asp Arg Ser Arg Pro Ser His Gln Glu Arg Arg Phe Ile Phe Lys
260 265 270aat cta ata ctc gac tat
taattcttgt gtgatttttt tcgttacaac 983Asn Leu Ile Leu Asp Tyr
275catagtttca ttttcatttt tttagttaca aatttttaat tgttctgatt tggattgaat
1043attggtatat tgttaggggt tgatacaaaa aaaaaaaaaa aa
1085106278PRTArabidopsis thaliana 106Met Ala Leu Glu Ala Met Asn Thr Pro
Thr Ser Ser Phe Thr Arg Ile 1 5 10
15Glu Thr Lys Glu Asp Leu Met Asn Asp Ala Val Phe Ile Glu Pro
Trp 20 25 30Leu Lys Arg Lys
Arg Ser Lys Arg Gln Arg Ser His Ser Pro Ser Ser 35
40 45Ser Ser Ser Ser Pro Pro Arg Ser Arg Pro Lys Ser
Gln Asn Gln Asp 50 55 60Leu Thr Glu
Glu Glu Tyr Leu Ala Leu Cys Leu Leu Met Leu Ala Lys 65
70 75 80Asp Gln Pro Ser Gln Thr Arg Phe
His Gln Gln Ser Gln Ser Leu Thr 85 90
95Pro Pro Pro Glu Ser Lys Asn Leu Pro Tyr Lys Cys Asn Val
Cys Glu 100 105 110Lys Ala Phe
Pro Ser Tyr Gln Ala Leu Gly Gly His Lys Ala Ser His 115
120 125Arg Ile Lys Pro Pro Thr Val Ile Ser Thr Thr
Ala Asp Asp Ser Thr 130 135 140Ala Pro
Thr Ile Ser Ile Val Ala Gly Glu Lys His Pro Ile Ala Ala145
150 155 160Ser Gly Lys Ile His Glu Cys
Ser Ile Cys His Lys Val Phe Pro Thr 165
170 175Gly Gln Ala Leu Gly Gly His Lys Arg Cys His Tyr
Glu Gly Asn Leu 180 185 190Gly
Gly Gly Gly Gly Gly Gly Ser Lys Ser Ile Ser His Ser Gly Ser 195
200 205Val Ser Ser Thr Val Ser Glu Glu Arg
Ser His Arg Gly Phe Ile Asp 210 215
220Leu Asn Leu Pro Ala Leu Pro Glu Leu Ser Leu His His Asn Pro Ile225
230 235 240Val Asp Glu Glu
Ile Leu Ser Pro Leu Thr Gly Lys Lys Thr Ala Phe 245
250 255Val Asp Arg Ser Arg Pro Ser His Gln Glu
Arg Arg Phe Ile Phe Lys 260 265
270Asn Leu Ile Leu Asp Tyr 2751071783DNAArabidopsis
thalianaCDS(162)..(1529) 107attgttaaaa gctctcacac aaccaccgtt ctccgtcacg
gtggcgcttt attctctcat 60cggagcgcct caccggtggc agacggtgtc gttgttcctc
tcctaaaacc tccatcaatc 120accatctctt tacacagagc tctaaccaaa atcttcgaga g
atg ggg cag agt atg 176
Met Gly Gln Ser Met 1
5agc tgt gga agt cga ccg gag cac gga ata ttc gcc tct gta cag tgc
224Ser Cys Gly Ser Arg Pro Glu His Gly Ile Phe Ala Ser Val Gln Cys
10 15 20ggc gat atc atc
act atc cgt cgt gtg atg gcg acg gag cct agt ctg 272Gly Asp Ile Ile
Thr Ile Arg Arg Val Met Ala Thr Glu Pro Ser Leu 25
30 35ttg aat caa act act cct tat gat cgt cac tct
gtt ctt cat gtc gct 320Leu Asn Gln Thr Thr Pro Tyr Asp Arg His Ser
Val Leu His Val Ala 40 45 50gct
gct aat ggt cag atc gag att ttg tca ttg ctt ttg gaa cga ttt 368Ala
Ala Asn Gly Gln Ile Glu Ile Leu Ser Leu Leu Leu Glu Arg Phe 55
60 65acg aat cca gat ttg ttg aat cgt cac aag
cag act ccg tta atg ttg 416Thr Asn Pro Asp Leu Leu Asn Arg His Lys
Gln Thr Pro Leu Met Leu 70 75 80
85gct gcg atg tat gga aga atc tct tgt gtg aag aag cta gct gaa
gtt 464Ala Ala Met Tyr Gly Arg Ile Ser Cys Val Lys Lys Leu Ala Glu
Val 90 95 100gga gct aat
att ttg atg ttt gat tct gtg aat cga aga aca tgt ttg 512Gly Ala Asn
Ile Leu Met Phe Asp Ser Val Asn Arg Arg Thr Cys Leu 105
110 115cat tac gct gct tat tat gga cat gct aat
tgt gtt caa gct att ctc 560His Tyr Ala Ala Tyr Tyr Gly His Ala Asn
Cys Val Gln Ala Ile Leu 120 125
130tct gct gct caa tca agt cct gtt gct gtc cat tgg gga tat gcg aga
608Ser Ala Ala Gln Ser Ser Pro Val Ala Val His Trp Gly Tyr Ala Arg 135
140 145ttt gtg aac ata aga gat gat aaa
gga gcg act ccg ttg cat tta gct 656Phe Val Asn Ile Arg Asp Asp Lys
Gly Ala Thr Pro Leu His Leu Ala150 155
160 165gct cga cag aga cga cct gaa tgt gtg aat gtt ttg
ttg gat agt ggt 704Ala Arg Gln Arg Arg Pro Glu Cys Val Asn Val Leu
Leu Asp Ser Gly 170 175
180tct ctt gtt tgt gca tct act agt gta tat ggt tct cca gga agc aca
752Ser Leu Val Cys Ala Ser Thr Ser Val Tyr Gly Ser Pro Gly Ser Thr
185 190 195cct ctt cat tta gca gct
aga agt gga tct ata gat tgt gtc aga aag 800Pro Leu His Leu Ala Ala
Arg Ser Gly Ser Ile Asp Cys Val Arg Lys 200 205
210ttg ctt gct tgg ggt gct gat cgt ctt caa cga gac gct tct
ggg aga 848Leu Leu Ala Trp Gly Ala Asp Arg Leu Gln Arg Asp Ala Ser
Gly Arg 215 220 225ata cct tat gtg gtt
gcg atg aag cat aag cat gga gca tgt gga gcc 896Ile Pro Tyr Val Val
Ala Met Lys His Lys His Gly Ala Cys Gly Ala230 235
240 245tta ctt aat ccg tcc tct gca gag cca ctt
gtt tgg cca tca cca tta 944Leu Leu Asn Pro Ser Ser Ala Glu Pro Leu
Val Trp Pro Ser Pro Leu 250 255
260aag ttc atc agt gag ctt aat gac gag gcg aaa ctt ctc tta gag cag
992Lys Phe Ile Ser Glu Leu Asn Asp Glu Ala Lys Leu Leu Leu Glu Gln
265 270 275gct tta atg gag gct aac
agg gag aga gag aaa acc atc ctc aaa gga 1040Ala Leu Met Glu Ala Asn
Arg Glu Arg Glu Lys Thr Ile Leu Lys Gly 280 285
290aca gct tat tcc tta cca tca ccc tct ttc tct gac acg gat
gat aac 1088Thr Ala Tyr Ser Leu Pro Ser Pro Ser Phe Ser Asp Thr Asp
Asp Asn 295 300 305atg tcc gag gtg agt
gat acg gaa ctg tgc tgc att tgc ttt gag caa 1136Met Ser Glu Val Ser
Asp Thr Glu Leu Cys Cys Ile Cys Phe Glu Gln310 315
320 325gta tgt aca att gaa gtt aaa gac tgt ggt
cac caa atg tgt gca caa 1184Val Cys Thr Ile Glu Val Lys Asp Cys Gly
His Gln Met Cys Ala Gln 330 335
340tgc aca ctt gca ctg tgc tgt cac aac aaa cca aac cca acg acc tca
1232Cys Thr Leu Ala Leu Cys Cys His Asn Lys Pro Asn Pro Thr Thr Ser
345 350 355acc gtg act cca ccg gtc
tgt ccg ttc tgt aga agc acc att gca tgt 1280Thr Val Thr Pro Pro Val
Cys Pro Phe Cys Arg Ser Thr Ile Ala Cys 360 365
370tta gtc gtc gcc cag aac aac aac aac aac aac gaa aag agc
aaa agc 1328Leu Val Val Ala Gln Asn Asn Asn Asn Asn Asn Glu Lys Ser
Lys Ser 375 380 385cta gat gat gtt gtt
gtt gtt gat cgt gag gca ggt gat gtt agc tcc 1376Leu Asp Asp Val Val
Val Val Asp Arg Glu Ala Gly Asp Val Ser Ser390 395
400 405tcc aaa ttc aga aaa cat aga aga tca ata
aac ctt ggc gaa gaa agc 1424Ser Lys Phe Arg Lys His Arg Arg Ser Ile
Asn Leu Gly Glu Glu Ser 410 415
420agc agc ttc atg gga cta tca act att gga tca ttc ggt agg ata acc
1472Ser Ser Phe Met Gly Leu Ser Thr Ile Gly Ser Phe Gly Arg Ile Thr
425 430 435ggc cgt ggc tcg gga agg
atc gca gcc gaa aac gag ctg atg gac aaa 1520Gly Arg Gly Ser Gly Arg
Ile Ala Ala Glu Asn Glu Leu Met Asp Lys 440 445
450cca ata ttg tgagggatcg attccgtttt aagggacatt ttggggcatg
1569Pro Ile Leu 455ggggagcaat aaaaaagatg aggggatgaa
attgtgagaa tgtataaaat atagatgaat 1629ttatgttaga tcttttgttg aagggaggaa
gattgaaata aggaaaaaga tgtggggagg 1689tgtgtaatgc aaggatttgt tgtttctttg
attaagtttg gccaaaattg tttgttgttg 1749ttattatttg gttacttgat atgaaaggga
aacc 1783108456PRTArabidopsis thaliana
108Met Gly Gln Ser Met Ser Cys Gly Ser Arg Pro Glu His Gly Ile Phe 1
5 10 15Ala Ser Val Gln Cys
Gly Asp Ile Ile Thr Ile Arg Arg Val Met Ala 20
25 30Thr Glu Pro Ser Leu Leu Asn Gln Thr Thr Pro Tyr
Asp Arg His Ser 35 40 45Val Leu
His Val Ala Ala Ala Asn Gly Gln Ile Glu Ile Leu Ser Leu 50
55 60Leu Leu Glu Arg Phe Thr Asn Pro Asp Leu Leu
Asn Arg His Lys Gln 65 70 75
80Thr Pro Leu Met Leu Ala Ala Met Tyr Gly Arg Ile Ser Cys Val Lys
85 90 95Lys Leu Ala Glu
Val Gly Ala Asn Ile Leu Met Phe Asp Ser Val Asn 100
105 110Arg Arg Thr Cys Leu His Tyr Ala Ala Tyr Tyr
Gly His Ala Asn Cys 115 120 125Val
Gln Ala Ile Leu Ser Ala Ala Gln Ser Ser Pro Val Ala Val His 130
135 140Trp Gly Tyr Ala Arg Phe Val Asn Ile Arg
Asp Asp Lys Gly Ala Thr145 150 155
160Pro Leu His Leu Ala Ala Arg Gln Arg Arg Pro Glu Cys Val Asn
Val 165 170 175Leu Leu Asp
Ser Gly Ser Leu Val Cys Ala Ser Thr Ser Val Tyr Gly 180
185 190Ser Pro Gly Ser Thr Pro Leu His Leu Ala
Ala Arg Ser Gly Ser Ile 195 200
205Asp Cys Val Arg Lys Leu Leu Ala Trp Gly Ala Asp Arg Leu Gln Arg 210
215 220Asp Ala Ser Gly Arg Ile Pro Tyr
Val Val Ala Met Lys His Lys His225 230
235 240Gly Ala Cys Gly Ala Leu Leu Asn Pro Ser Ser Ala
Glu Pro Leu Val 245 250
255Trp Pro Ser Pro Leu Lys Phe Ile Ser Glu Leu Asn Asp Glu Ala Lys
260 265 270Leu Leu Leu Glu Gln Ala
Leu Met Glu Ala Asn Arg Glu Arg Glu Lys 275 280
285Thr Ile Leu Lys Gly Thr Ala Tyr Ser Leu Pro Ser Pro Ser
Phe Ser 290 295 300Asp Thr Asp Asp Asn
Met Ser Glu Val Ser Asp Thr Glu Leu Cys Cys305 310
315 320Ile Cys Phe Glu Gln Val Cys Thr Ile Glu
Val Lys Asp Cys Gly His 325 330
335Gln Met Cys Ala Gln Cys Thr Leu Ala Leu Cys Cys His Asn Lys Pro
340 345 350Asn Pro Thr Thr Ser
Thr Val Thr Pro Pro Val Cys Pro Phe Cys Arg 355
360 365Ser Thr Ile Ala Cys Leu Val Val Ala Gln Asn Asn
Asn Asn Asn Asn 370 375 380Glu Lys Ser
Lys Ser Leu Asp Asp Val Val Val Val Asp Arg Glu Ala385
390 395 400Gly Asp Val Ser Ser Ser Lys
Phe Arg Lys His Arg Arg Ser Ile Asn 405
410 415Leu Gly Glu Glu Ser Ser Ser Phe Met Gly Leu Ser
Thr Ile Gly Ser 420 425 430Phe
Gly Arg Ile Thr Gly Arg Gly Ser Gly Arg Ile Ala Ala Glu Asn 435
440 445Glu Leu Met Asp Lys Pro Ile Leu
450 455109714DNAArabidopsis thalianaCDS(1)..(711) 109atg
aag ata caa tgt gat gtg tgt gag aaa gct ccg gcc acg ctt ata 48Met
Lys Ile Gln Cys Asp Val Cys Glu Lys Ala Pro Ala Thr Leu Ile 1
5 10 15tgt tgt gct gat gaa gct gct
ctc tgc gct aaa tgt gac gtt gag gtt 96Cys Cys Ala Asp Glu Ala Ala
Leu Cys Ala Lys Cys Asp Val Glu Val 20 25
30cat gct gct aat aaa ctc gct agc aaa cac caa cgc ctt ttt
ctt gac 144His Ala Ala Asn Lys Leu Ala Ser Lys His Gln Arg Leu Phe
Leu Asp 35 40 45tct ctc tca act
aaa ttc cct ccc tgc gac atc tgc ctt gag aag gca 192Ser Leu Ser Thr
Lys Phe Pro Pro Cys Asp Ile Cys Leu Glu Lys Ala 50
55 60gct ttc ata ttc tgt gta gag gat agg gct ctg ctc tgc
aga gat tgc 240Ala Phe Ile Phe Cys Val Glu Asp Arg Ala Leu Leu Cys
Arg Asp Cys 65 70 75
80gat gag gcg acc cat gcg cca aat act cgc tct gct aat cac cag agg
288Asp Glu Ala Thr His Ala Pro Asn Thr Arg Ser Ala Asn His Gln Arg
85 90 95ttc tta gcc act gga
atc cga gtt gct ctt agt tcc act agt tgc aat 336Phe Leu Ala Thr Gly
Ile Arg Val Ala Leu Ser Ser Thr Ser Cys Asn 100
105 110caa gaa gtg gaa aag aat cac ttt gac cca tct aat
cag cag agt ctc 384Gln Glu Val Glu Lys Asn His Phe Asp Pro Ser Asn
Gln Gln Ser Leu 115 120 125tct aaa
ccg cca act cag caa ccc gct gct cca tct cct ttg tgg gct 432Ser Lys
Pro Pro Thr Gln Gln Pro Ala Ala Pro Ser Pro Leu Trp Ala 130
135 140acc gat gaa ttc ttc agc tac tct gat ctt gac
tgc agt aat aag aaa 480Thr Asp Glu Phe Phe Ser Tyr Ser Asp Leu Asp
Cys Ser Asn Lys Lys145 150 155
160gag caa ctc gat ctc ggg gag ctg gat tgg ctt gca gag atg ggt ctg
528Glu Gln Leu Asp Leu Gly Glu Leu Asp Trp Leu Ala Glu Met Gly Leu
165 170 175ttt ggt gac cag cct
gat caa gag gct cta ccg gta gcc gaa gtt ccc 576Phe Gly Asp Gln Pro
Asp Gln Glu Ala Leu Pro Val Ala Glu Val Pro 180
185 190gag ctt tcc ttt tca cat ttg gct cat gct cat tcc
tac aac aga cct 624Glu Leu Ser Phe Ser His Leu Ala His Ala His Ser
Tyr Asn Arg Pro 195 200 205atg aag
tcc aat gta ccc aac aag aag cag agg ctt gag tac cgg tat 672Met Lys
Ser Asn Val Pro Asn Lys Lys Gln Arg Leu Glu Tyr Arg Tyr 210
215 220gat gat gaa gaa gag cac ttc cta gtc ccc gac
cta ggc taa 714Asp Asp Glu Glu Glu His Phe Leu Val Pro Asp
Leu Gly225 230 235110237PRTArabidopsis
thaliana 110Met Lys Ile Gln Cys Asp Val Cys Glu Lys Ala Pro Ala Thr Leu
Ile 1 5 10 15Cys Cys Ala
Asp Glu Ala Ala Leu Cys Ala Lys Cys Asp Val Glu Val 20
25 30His Ala Ala Asn Lys Leu Ala Ser Lys His
Gln Arg Leu Phe Leu Asp 35 40
45Ser Leu Ser Thr Lys Phe Pro Pro Cys Asp Ile Cys Leu Glu Lys Ala 50
55 60Ala Phe Ile Phe Cys Val Glu Asp Arg
Ala Leu Leu Cys Arg Asp Cys 65 70 75
80Asp Glu Ala Thr His Ala Pro Asn Thr Arg Ser Ala Asn His
Gln Arg 85 90 95Phe Leu
Ala Thr Gly Ile Arg Val Ala Leu Ser Ser Thr Ser Cys Asn 100
105 110Gln Glu Val Glu Lys Asn His Phe Asp
Pro Ser Asn Gln Gln Ser Leu 115 120
125Ser Lys Pro Pro Thr Gln Gln Pro Ala Ala Pro Ser Pro Leu Trp Ala
130 135 140Thr Asp Glu Phe Phe Ser Tyr
Ser Asp Leu Asp Cys Ser Asn Lys Lys145 150
155 160Glu Gln Leu Asp Leu Gly Glu Leu Asp Trp Leu Ala
Glu Met Gly Leu 165 170
175Phe Gly Asp Gln Pro Asp Gln Glu Ala Leu Pro Val Ala Glu Val Pro
180 185 190Glu Leu Ser Phe Ser His
Leu Ala His Ala His Ser Tyr Asn Arg Pro 195 200
205Met Lys Ser Asn Val Pro Asn Lys Lys Gln Arg Leu Glu Tyr
Arg Tyr 210 215 220Asp Asp Glu Glu Glu
His Phe Leu Val Pro Asp Leu Gly225 230
235111829DNAArabidopsis thalianaCDS(99)..(584) 111atcacaacta ctatcacacc
aaactcaaaa aacacaaacc acaagaggat catttcattt 60tttattgttt cgttttaatc
atcatcatca gaagaaaa atg gtt gcg ata tcg gag 116
Met Val Ala Ile Ser Glu
1 5atc aag tcg acg gtg gat gtc acg gcg gcg aat
tgt ttg atg ctt tta 164Ile Lys Ser Thr Val Asp Val Thr Ala Ala Asn
Cys Leu Met Leu Leu 10 15
20tct aga gtt gga caa gaa aac gtt gac ggt ggc gat caa aaa cgc gtt
212Ser Arg Val Gly Gln Glu Asn Val Asp Gly Gly Asp Gln Lys Arg Val
25 30 35ttc aca tgt aaa acg tgt ttg
aag cag ttt cat tcg ttc caa gcc tta 260Phe Thr Cys Lys Thr Cys Leu
Lys Gln Phe His Ser Phe Gln Ala Leu 40 45
50gga ggt cac cgt gcg agt cac aag aag cct aac aac gac gct ttg tcg
308Gly Gly His Arg Ala Ser His Lys Lys Pro Asn Asn Asp Ala Leu Ser 55
60 65 70tct gga ttg atg
aag aag gtg aaa acg tcg tcg cat cct tgt ccc ata 356Ser Gly Leu Met
Lys Lys Val Lys Thr Ser Ser His Pro Cys Pro Ile 75
80 85tgt gga gtg gag ttt ccg atg gga caa gct
ttg gga gga cac atg agg 404Cys Gly Val Glu Phe Pro Met Gly Gln Ala
Leu Gly Gly His Met Arg 90 95
100aga cac agg aac gag agt ggg gct gct ggt ggc gcg ttg gtt aca cgc
452Arg His Arg Asn Glu Ser Gly Ala Ala Gly Gly Ala Leu Val Thr Arg
105 110 115gct ttg ttg ccg gag ccc acg
gtg act acg ttg aag aaa tct agc agt 500Ala Leu Leu Pro Glu Pro Thr
Val Thr Thr Leu Lys Lys Ser Ser Ser 120 125
130ggg aag aga gtg gct tgt ttg gat ctg agt cta ggg atg gtg gac aat
548Gly Lys Arg Val Ala Cys Leu Asp Leu Ser Leu Gly Met Val Asp Asn135
140 145 150ttg aat ctc aag
ttg gag ctt gga aga aca gtt tat tgattttatt 594Leu Asn Leu Lys
Leu Glu Leu Gly Arg Thr Val Tyr 155
160tattttcctt aaattttctg aatatatttg tttctctcat tctttgaatt tttcttaata
654ttctagatta tacatacatc cgcagattta ggaaactttc atagagtgta atcttttctt
714tctgtaaaaa tatattttac ttgtagcatt ggagatttgt tatgagatta tcttacttag
774catttagtga ataatctatt agcctatttt gccgacgcga aaaaaaaaaa aaaaa
829112162PRTArabidopsis thaliana 112Met Val Ala Ile Ser Glu Ile Lys Ser
Thr Val Asp Val Thr Ala Ala 1 5 10
15Asn Cys Leu Met Leu Leu Ser Arg Val Gly Gln Glu Asn Val Asp
Gly 20 25 30Gly Asp Gln Lys
Arg Val Phe Thr Cys Lys Thr Cys Leu Lys Gln Phe 35
40 45His Ser Phe Gln Ala Leu Gly Gly His Arg Ala Ser
His Lys Lys Pro 50 55 60Asn Asn Asp
Ala Leu Ser Ser Gly Leu Met Lys Lys Val Lys Thr Ser 65
70 75 80Ser His Pro Cys Pro Ile Cys Gly
Val Glu Phe Pro Met Gly Gln Ala 85 90
95Leu Gly Gly His Met Arg Arg His Arg Asn Glu Ser Gly Ala
Ala Gly 100 105 110Gly Ala Leu
Val Thr Arg Ala Leu Leu Pro Glu Pro Thr Val Thr Thr 115
120 125Leu Lys Lys Ser Ser Ser Gly Lys Arg Val Ala
Cys Leu Asp Leu Ser 130 135 140Leu Gly
Met Val Asp Asn Leu Asn Leu Lys Leu Glu Leu Gly Arg Thr145
150 155 160Val Tyr113881DNAArabidopsis
thalianaCDS(89)..(769) 113agcaattaaa caatttcttc actgcaattc acaagcaacc
ttcaaactaa aactcgagag 60acaagaaatc ctcagaatct ttaactta atg gcg ctc
gag gct ctt aca tca 112 Met Ala Leu
Glu Ala Leu Thr Ser 1 5cca
aga tta gct tct ccg att cct cct ttg ttc gaa gat tct tca gtc 160Pro
Arg Leu Ala Ser Pro Ile Pro Pro Leu Phe Glu Asp Ser Ser Val 10
15 20ttc cat gga gtc gag cac tgg aca aag ggt
aag cga tct aag aga tca 208Phe His Gly Val Glu His Trp Thr Lys Gly
Lys Arg Ser Lys Arg Ser 25 30 35
40aga tcc gat ttc cac cac caa aac ctc act gag gaa gag tat cta
gct 256Arg Ser Asp Phe His His Gln Asn Leu Thr Glu Glu Glu Tyr Leu
Ala 45 50 55ttt tgc ctc
atg ctt ctc gct cgc gac aac cgt cag cct cct cct cct 304Phe Cys Leu
Met Leu Leu Ala Arg Asp Asn Arg Gln Pro Pro Pro Pro 60
65 70ccg gcg gtg gag aag ttg agc tac aag tgt
agc gtc tgc gac aag acg 352Pro Ala Val Glu Lys Leu Ser Tyr Lys Cys
Ser Val Cys Asp Lys Thr 75 80
85ttc tct tct tac caa gct ctc ggt ggt cac aag gca agc cac cgt aag
400Phe Ser Ser Tyr Gln Ala Leu Gly Gly His Lys Ala Ser His Arg Lys 90
95 100aac tta tca cag act ctc tcc ggc
gga gga gat gat cat tca acc tcg 448Asn Leu Ser Gln Thr Leu Ser Gly
Gly Gly Asp Asp His Ser Thr Ser105 110
115 120tcg gcg aca acc aca tcc gcc gtg act act gga agt
ggg aaa tca cac 496Ser Ala Thr Thr Thr Ser Ala Val Thr Thr Gly Ser
Gly Lys Ser His 125 130
135gtt tgc acc atc tgt aac aag tct ttt cct tcc ggt caa gct ctc ggc
544Val Cys Thr Ile Cys Asn Lys Ser Phe Pro Ser Gly Gln Ala Leu Gly
140 145 150gga cac aag cgg tgc cac
tac gaa gga aac aac aac atc aac act agt 592Gly His Lys Arg Cys His
Tyr Glu Gly Asn Asn Asn Ile Asn Thr Ser 155 160
165agc gtg tcc aac tcc gaa ggt gcg ggg tcc act agc cac gtt
agc agt 640Ser Val Ser Asn Ser Glu Gly Ala Gly Ser Thr Ser His Val
Ser Ser 170 175 180agc cac cgt ggg ttt
gac ctc aac atc cct ccg atc cct gaa ttc tcg 688Ser His Arg Gly Phe
Asp Leu Asn Ile Pro Pro Ile Pro Glu Phe Ser185 190
195 200atg gtc aac gga gac gac gaa gtc atg agc
cct atg ccg gcg aag aag 736Met Val Asn Gly Asp Asp Glu Val Met Ser
Pro Met Pro Ala Lys Lys 205 210
215cct cgg ttt gac ttt ccg gtc aaa ctt caa ctt taaggaaatt tacttagacg
789Pro Arg Phe Asp Phe Pro Val Lys Leu Gln Leu 220
225ataagatttc gtttgtatac tgttgagagt tgtgtaggaa tttgttgact
gtacatacca 849aattggactt tgactcaaaa aaaaaaaaaa aa
881114227PRTArabidopsis thaliana 114Met Ala Leu Glu Ala Leu
Thr Ser Pro Arg Leu Ala Ser Pro Ile Pro 1 5
10 15Pro Leu Phe Glu Asp Ser Ser Val Phe His Gly Val
Glu His Trp Thr 20 25 30Lys
Gly Lys Arg Ser Lys Arg Ser Arg Ser Asp Phe His His Gln Asn 35
40 45Leu Thr Glu Glu Glu Tyr Leu Ala Phe
Cys Leu Met Leu Leu Ala Arg 50 55
60Asp Asn Arg Gln Pro Pro Pro Pro Pro Ala Val Glu Lys Leu Ser Tyr 65
70 75 80Lys Cys Ser Val Cys
Asp Lys Thr Phe Ser Ser Tyr Gln Ala Leu Gly 85
90 95Gly His Lys Ala Ser His Arg Lys Asn Leu Ser
Gln Thr Leu Ser Gly 100 105
110Gly Gly Asp Asp His Ser Thr Ser Ser Ala Thr Thr Thr Ser Ala Val
115 120 125Thr Thr Gly Ser Gly Lys Ser
His Val Cys Thr Ile Cys Asn Lys Ser 130 135
140Phe Pro Ser Gly Gln Ala Leu Gly Gly His Lys Arg Cys His Tyr
Glu145 150 155 160Gly Asn
Asn Asn Ile Asn Thr Ser Ser Val Ser Asn Ser Glu Gly Ala
165 170 175Gly Ser Thr Ser His Val Ser
Ser Ser His Arg Gly Phe Asp Leu Asn 180 185
190Ile Pro Pro Ile Pro Glu Phe Ser Met Val Asn Gly Asp Asp
Glu Val 195 200 205Met Ser Pro Met
Pro Ala Lys Lys Pro Arg Phe Asp Phe Pro Val Lys 210
215 220Leu Gln Leu2251151107DNAArabidopsis
thalianaCDS(139)..(957) 115aactcggtcc gagtcgactc ggtccgacaa gcgattttcg
tgactccgtc gtcaccacgc 60ctatgttaag tgagtaaatc cgactcttct ccgtagagat
ttctcctact ttcaggctct 120cttgtgaatt tggaagac atg agc ttt gtt ttc cgg
gga agt aga gga gat 171 Met Ser Phe Val Phe Arg
Gly Ser Arg Gly Asp 1 5
10tta gaa agc gga ttc tcg ggt ggt ttt cta ccc gaa aga cga gct atg
219Leu Glu Ser Gly Phe Ser Gly Gly Phe Leu Pro Glu Arg Arg Ala Met
15 20 25cgt gtt cat gga gct cga
cca gtt aat tct aat tcc ctc gct ttt ctg 267Arg Val His Gly Ala Arg
Pro Val Asn Ser Asn Ser Leu Ala Phe Leu 30 35
40gtt aca gtt ctt ttg ctg ttt atg att ctc aat tcg cat cag
atg cct 315Val Thr Val Leu Leu Leu Phe Met Ile Leu Asn Ser His Gln
Met Pro 45 50 55cct aat ttc ctg ctg
tgg ctt gtg ctt ggg gtg ttt ttg atg gca acg 363Pro Asn Phe Leu Leu
Trp Leu Val Leu Gly Val Phe Leu Met Ala Thr 60 65
70 75acg ctt agg atg tat gct act tgc caa caa
ctt caa gct cat gct cag 411Thr Leu Arg Met Tyr Ala Thr Cys Gln Gln
Leu Gln Ala His Ala Gln 80 85
90gct cag gct gca gca gca agt ggc ctc ttt agc cat act gag ctg agg
459Ala Gln Ala Ala Ala Ala Ser Gly Leu Phe Ser His Thr Glu Leu Arg
95 100 105ttg cat gtg cct cct tcc
att gct ctt gct acg aga ggg cgt ctt cag 507Leu His Val Pro Pro Ser
Ile Ala Leu Ala Thr Arg Gly Arg Leu Gln 110 115
120gga ctt agg ctc cag ctg gct ctt ctt gat cgg gaa ttt gat
gac tta 555Gly Leu Arg Leu Gln Leu Ala Leu Leu Asp Arg Glu Phe Asp
Asp Leu 125 130 135gat tat gaa act tta
aga gca ctt gat tct gat aat gtt tcc aca act 603Asp Tyr Glu Thr Leu
Arg Ala Leu Asp Ser Asp Asn Val Ser Thr Thr140 145
150 155tct atg agc gag gaa gag ata aat gca ctt
cca gtt cac aag tac aag 651Ser Met Ser Glu Glu Glu Ile Asn Ala Leu
Pro Val His Lys Tyr Lys 160 165
170gtg ttg gat cct gaa aat ggt tgc tct ttg gca aag caa gcg tca acc
699Val Leu Asp Pro Glu Asn Gly Cys Ser Leu Ala Lys Gln Ala Ser Thr
175 180 185tca tcc tca gct gag aag
atg cta gat tct gcc aat gaa agt aaa aaa 747Ser Ser Ser Ala Glu Lys
Met Leu Asp Ser Ala Asn Glu Ser Lys Lys 190 195
200gga aca gaa gat gag ctc aca tgt agt gtt tgc cta gaa caa
gtt acc 795Gly Thr Glu Asp Glu Leu Thr Cys Ser Val Cys Leu Glu Gln
Val Thr 205 210 215gta ggg gaa atc gtt
cgc acc tta cct tgt ttg cat cag ttt cat gca 843Val Gly Glu Ile Val
Arg Thr Leu Pro Cys Leu His Gln Phe His Ala220 225
230 235gga tgt atc gat cca tgg ttg aga cag caa
gga aca tgt cct gtc tgt 891Gly Cys Ile Asp Pro Trp Leu Arg Gln Gln
Gly Thr Cys Pro Val Cys 240 245
250aaa ttt aga gct cat tca gga tgg caa gaa caa gat gag att gat gat
939Lys Phe Arg Ala His Ser Gly Trp Gln Glu Gln Asp Glu Ile Asp Asp
255 260 265gat gct tcc gac atg gtt
tgaaaagttt ggtttcgaac acttgtttat 987Asp Ala Ser Asp Met Val
270gttattaatg tgcgtgcgaa ttaaacccca aacaaatcat gtaatggtca ataactcaat
1047atgatgtaac tgatgactct ccctttccaa aagtttgatt tagattcatc atgaaaaagt
1107116273PRTArabidopsis thaliana 116Met Ser Phe Val Phe Arg Gly Ser Arg
Gly Asp Leu Glu Ser Gly Phe 1 5 10
15Ser Gly Gly Phe Leu Pro Glu Arg Arg Ala Met Arg Val His Gly
Ala 20 25 30Arg Pro Val Asn
Ser Asn Ser Leu Ala Phe Leu Val Thr Val Leu Leu 35
40 45Leu Phe Met Ile Leu Asn Ser His Gln Met Pro Pro
Asn Phe Leu Leu 50 55 60Trp Leu Val
Leu Gly Val Phe Leu Met Ala Thr Thr Leu Arg Met Tyr 65
70 75 80Ala Thr Cys Gln Gln Leu Gln Ala
His Ala Gln Ala Gln Ala Ala Ala 85 90
95Ala Ser Gly Leu Phe Ser His Thr Glu Leu Arg Leu His Val
Pro Pro 100 105 110Ser Ile Ala
Leu Ala Thr Arg Gly Arg Leu Gln Gly Leu Arg Leu Gln 115
120 125Leu Ala Leu Leu Asp Arg Glu Phe Asp Asp Leu
Asp Tyr Glu Thr Leu 130 135 140Arg Ala
Leu Asp Ser Asp Asn Val Ser Thr Thr Ser Met Ser Glu Glu145
150 155 160Glu Ile Asn Ala Leu Pro Val
His Lys Tyr Lys Val Leu Asp Pro Glu 165
170 175Asn Gly Cys Ser Leu Ala Lys Gln Ala Ser Thr Ser
Ser Ser Ala Glu 180 185 190Lys
Met Leu Asp Ser Ala Asn Glu Ser Lys Lys Gly Thr Glu Asp Glu 195
200 205Leu Thr Cys Ser Val Cys Leu Glu Gln
Val Thr Val Gly Glu Ile Val 210 215
220Arg Thr Leu Pro Cys Leu His Gln Phe His Ala Gly Cys Ile Asp Pro225
230 235 240Trp Leu Arg Gln
Gln Gly Thr Cys Pro Val Cys Lys Phe Arg Ala His 245
250 255Ser Gly Trp Gln Glu Gln Asp Glu Ile Asp
Asp Asp Ala Ser Asp Met 260 265
270Val117531DNAArabidopsis thalianaCDS(1)..(528) 117atg gga tcg gaa caa
aac gat agc aca agc ttc acg caa tcg caa gct 48Met Gly Ser Glu Gln
Asn Asp Ser Thr Ser Phe Thr Gln Ser Gln Ala 1 5
10 15tca gag cca aag cta tgt gtt aaa gga tgt ggt
ttc ttt gga tca cca 96Ser Glu Pro Lys Leu Cys Val Lys Gly Cys Gly
Phe Phe Gly Ser Pro 20 25
30tca aac atg gat ctc tgt tct aaa tgt tac aga ggc att tgt gct gag
144Ser Asn Met Asp Leu Cys Ser Lys Cys Tyr Arg Gly Ile Cys Ala Glu
35 40 45gaa gct caa aca gca gtt gct
aaa gct gct gtt gaa aaa tct ttc aag 192Glu Ala Gln Thr Ala Val Ala
Lys Ala Ala Val Glu Lys Ser Phe Lys 50 55
60cct tct cct cct cgt agt ctc ttc ata gca gaa cct cct gct gtt gtt
240Pro Ser Pro Pro Arg Ser Leu Phe Ile Ala Glu Pro Pro Ala Val Val 65
70 75 80gtg gaa ccc aaa
ccc gaa aag gcg gca gtt gtt gtt gtc tcg gcc gag 288Val Glu Pro Lys
Pro Glu Lys Ala Ala Val Val Val Val Ser Ala Glu 85
90 95cca tct tcc tcg gcg gtt cct gag gcg aac
gag cca tcg aga cct gca 336Pro Ser Ser Ser Ala Val Pro Glu Ala Asn
Glu Pro Ser Arg Pro Ala 100 105
110cga acc aac cgg tgt ttg tgt tgt aac aag aag gtt ggg atc atg ggg
384Arg Thr Asn Arg Cys Leu Cys Cys Asn Lys Lys Val Gly Ile Met Gly
115 120 125ttt aag tgc aaa tgc ggg agc
act ttc tgc ggc gaa cat cgg tac ccg 432Phe Lys Cys Lys Cys Gly Ser
Thr Phe Cys Gly Glu His Arg Tyr Pro 130 135
140gag act cat gat tgc agc ttt gat ttc aaa gaa gtt gga cgt gga gag
480Glu Thr His Asp Cys Ser Phe Asp Phe Lys Glu Val Gly Arg Gly Glu145
150 155 160att gcc aaa gct
aat cct gtg gtt aag gct gat aaa att caa agg ttc 528Ile Ala Lys Ala
Asn Pro Val Val Lys Ala Asp Lys Ile Gln Arg Phe 165
170 175tga
531118176PRTArabidopsis thaliana 118Met Gly
Ser Glu Gln Asn Asp Ser Thr Ser Phe Thr Gln Ser Gln Ala 1
5 10 15Ser Glu Pro Lys Leu Cys Val Lys
Gly Cys Gly Phe Phe Gly Ser Pro 20 25
30Ser Asn Met Asp Leu Cys Ser Lys Cys Tyr Arg Gly Ile Cys Ala
Glu 35 40 45Glu Ala Gln Thr Ala
Val Ala Lys Ala Ala Val Glu Lys Ser Phe Lys 50 55
60Pro Ser Pro Pro Arg Ser Leu Phe Ile Ala Glu Pro Pro Ala
Val Val 65 70 75 80Val
Glu Pro Lys Pro Glu Lys Ala Ala Val Val Val Val Ser Ala Glu
85 90 95Pro Ser Ser Ser Ala Val Pro
Glu Ala Asn Glu Pro Ser Arg Pro Ala 100 105
110Arg Thr Asn Arg Cys Leu Cys Cys Asn Lys Lys Val Gly Ile
Met Gly 115 120 125Phe Lys Cys Lys
Cys Gly Ser Thr Phe Cys Gly Glu His Arg Tyr Pro 130
135 140Glu Thr His Asp Cys Ser Phe Asp Phe Lys Glu Val
Gly Arg Gly Glu145 150 155
160Ile Ala Lys Ala Asn Pro Val Val Lys Ala Asp Lys Ile Gln Arg Phe
165 170 1751191068DNAArabidopsis
thalianaCDS(1)..(1065) 119atg ttg aaa gta gag agt aac tgg gca caa gcc tgt
gat aca tgc cga 48Met Leu Lys Val Glu Ser Asn Trp Ala Gln Ala Cys
Asp Thr Cys Arg 1 5 10
15tca gcc gcc tgc acc gtg tac tgc cgg gct gat tct gcc tac ttg tgc
96Ser Ala Ala Cys Thr Val Tyr Cys Arg Ala Asp Ser Ala Tyr Leu Cys
20 25 30tcc agt tgt gat gct caa gtt
cat gct gcc aat cgt ctt gct tcc cgc 144Ser Ser Cys Asp Ala Gln Val
His Ala Ala Asn Arg Leu Ala Ser Arg 35 40
45cat gaa cgt gtt cga gtc tgt caa tca tgt gag cga gcc ccg gct
gcc 192His Glu Arg Val Arg Val Cys Gln Ser Cys Glu Arg Ala Pro Ala
Ala 50 55 60ttt ttc tgc aag gca gat
gct gca tct cta tgc aca acc tgt gat tca 240Phe Phe Cys Lys Ala Asp
Ala Ala Ser Leu Cys Thr Thr Cys Asp Ser 65 70
75 80gag att cat tcc gca aac cca ctt gct aga cgc
cat caa cga gtt cca 288Glu Ile His Ser Ala Asn Pro Leu Ala Arg Arg
His Gln Arg Val Pro 85 90
95att ctg ccc att tct gag tac tct tac agt tcc acg gcc act aac cat
336Ile Leu Pro Ile Ser Glu Tyr Ser Tyr Ser Ser Thr Ala Thr Asn His
100 105 110tca tgt gag aca aca gtg
aca gat cca gag aac aga ctt gtg ctt ggt 384Ser Cys Glu Thr Thr Val
Thr Asp Pro Glu Asn Arg Leu Val Leu Gly 115 120
125caa gaa gaa gag gat gaa gat gaa gca gag gcg gct tca tgg
ttg ttg 432Gln Glu Glu Glu Asp Glu Asp Glu Ala Glu Ala Ala Ser Trp
Leu Leu 130 135 140cct aat tca ggg aaa
aac agt ggt aac aac aat ggc ttc tcg att ggg 480Pro Asn Ser Gly Lys
Asn Ser Gly Asn Asn Asn Gly Phe Ser Ile Gly145 150
155 160gat gag ttt ctg aac ctt gtt gat tat agt
tcg agt gat aag caa ttc 528Asp Glu Phe Leu Asn Leu Val Asp Tyr Ser
Ser Ser Asp Lys Gln Phe 165 170
175aca gat caa tcc aat cag tat caa cta gac tgc aac gta cct cag agg
576Thr Asp Gln Ser Asn Gln Tyr Gln Leu Asp Cys Asn Val Pro Gln Arg
180 185 190agc tat ggg gaa gat gga
gtt gtt cca ctt caa att gaa gta tca aag 624Ser Tyr Gly Glu Asp Gly
Val Val Pro Leu Gln Ile Glu Val Ser Lys 195 200
205ggc atg tac caa gag caa cag aac ttt cag ctg agt atc aac
tgt ggc 672Gly Met Tyr Gln Glu Gln Gln Asn Phe Gln Leu Ser Ile Asn
Cys Gly 210 215 220tcc tgg gga gct ctt
cga agc tcc aat ggt tcc ctc agt cat atg gtg 720Ser Trp Gly Ala Leu
Arg Ser Ser Asn Gly Ser Leu Ser His Met Val225 230
235 240aat gtt tca tct atg gac ctg gga gtt gtg
ccg gag tca aca acg agt 768Asn Val Ser Ser Met Asp Leu Gly Val Val
Pro Glu Ser Thr Thr Ser 245 250
255gac gca aca gta tca aac cca aga tcg ccc aaa gcg gta aca gac caa
816Asp Ala Thr Val Ser Asn Pro Arg Ser Pro Lys Ala Val Thr Asp Gln
260 265 270cca cct tac cct cca gct
cag atg ctc agt cca agg gac aga gaa gct 864Pro Pro Tyr Pro Pro Ala
Gln Met Leu Ser Pro Arg Asp Arg Glu Ala 275 280
285aga gtc ctg aga tac aga gag aag aag aag atg agg aaa ttt
gag aag 912Arg Val Leu Arg Tyr Arg Glu Lys Lys Lys Met Arg Lys Phe
Glu Lys 290 295 300acg ata aga tat gct
tca agg aaa gcg tat gca gag aaa aga cca cgg 960Thr Ile Arg Tyr Ala
Ser Arg Lys Ala Tyr Ala Glu Lys Arg Pro Arg305 310
315 320atc aag ggc cgg ttt gca aag aag aaa gat
gtc gat gaa gag gca aac 1008Ile Lys Gly Arg Phe Ala Lys Lys Lys Asp
Val Asp Glu Glu Ala Asn 325 330
335caa gct ttc tcc aca atg ata aca ttt gac acc gga tat gga att gtt
1056Gln Ala Phe Ser Thr Met Ile Thr Phe Asp Thr Gly Tyr Gly Ile Val
340 345 350cca tca ttc tga
1068Pro Ser Phe
355120355PRTArabidopsis thaliana 120Met Leu Lys Val Glu Ser Asn Trp Ala
Gln Ala Cys Asp Thr Cys Arg 1 5 10
15Ser Ala Ala Cys Thr Val Tyr Cys Arg Ala Asp Ser Ala Tyr Leu
Cys 20 25 30Ser Ser Cys Asp
Ala Gln Val His Ala Ala Asn Arg Leu Ala Ser Arg 35
40 45His Glu Arg Val Arg Val Cys Gln Ser Cys Glu Arg
Ala Pro Ala Ala 50 55 60Phe Phe Cys
Lys Ala Asp Ala Ala Ser Leu Cys Thr Thr Cys Asp Ser 65
70 75 80Glu Ile His Ser Ala Asn Pro Leu
Ala Arg Arg His Gln Arg Val Pro 85 90
95Ile Leu Pro Ile Ser Glu Tyr Ser Tyr Ser Ser Thr Ala Thr
Asn His 100 105 110Ser Cys Glu
Thr Thr Val Thr Asp Pro Glu Asn Arg Leu Val Leu Gly 115
120 125Gln Glu Glu Glu Asp Glu Asp Glu Ala Glu Ala
Ala Ser Trp Leu Leu 130 135 140Pro Asn
Ser Gly Lys Asn Ser Gly Asn Asn Asn Gly Phe Ser Ile Gly145
150 155 160Asp Glu Phe Leu Asn Leu Val
Asp Tyr Ser Ser Ser Asp Lys Gln Phe 165
170 175Thr Asp Gln Ser Asn Gln Tyr Gln Leu Asp Cys Asn
Val Pro Gln Arg 180 185 190Ser
Tyr Gly Glu Asp Gly Val Val Pro Leu Gln Ile Glu Val Ser Lys 195
200 205Gly Met Tyr Gln Glu Gln Gln Asn Phe
Gln Leu Ser Ile Asn Cys Gly 210 215
220Ser Trp Gly Ala Leu Arg Ser Ser Asn Gly Ser Leu Ser His Met Val225
230 235 240Asn Val Ser Ser
Met Asp Leu Gly Val Val Pro Glu Ser Thr Thr Ser 245
250 255Asp Ala Thr Val Ser Asn Pro Arg Ser Pro
Lys Ala Val Thr Asp Gln 260 265
270Pro Pro Tyr Pro Pro Ala Gln Met Leu Ser Pro Arg Asp Arg Glu Ala
275 280 285Arg Val Leu Arg Tyr Arg Glu
Lys Lys Lys Met Arg Lys Phe Glu Lys 290 295
300Thr Ile Arg Tyr Ala Ser Arg Lys Ala Tyr Ala Glu Lys Arg Pro
Arg305 310 315 320Ile Lys
Gly Arg Phe Ala Lys Lys Lys Asp Val Asp Glu Glu Ala Asn
325 330 335Gln Ala Phe Ser Thr Met Ile
Thr Phe Asp Thr Gly Tyr Gly Ile Val 340 345
350Pro Ser Phe 3551211116DNAArabidopsis
thalianaCDS(191)..(934) 121atcccaccta cttgttcccc acaaaacact ctctccctct
ttgttctttc atcttctcta 60agctctttct ctgaacctac gcttctgcta agctattcta
agagaagcca gactagcaat 120aaacccttca ttttaagcat tctgtttcct tcttgagaaa
cctagatatt ttggtttctt 180gtatccggtg atg aag ata cag tgt gat gtg tgt
gag aaa gct ccg gcg 229 Met Lys Ile Gln Cys Asp Val Cys
Glu Lys Ala Pro Ala 1 5 10acg
gtg att tgt tgc gcc gac gaa gct gct ctc tgt cct caa tgc gac 277Thr
Val Ile Cys Cys Ala Asp Glu Ala Ala Leu Cys Pro Gln Cys Asp 15
20 25atc gag att cac gcc gct aac aaa ctc gct
agc aag cac caa cgt ctt 325Ile Glu Ile His Ala Ala Asn Lys Leu Ala
Ser Lys His Gln Arg Leu 30 35 40
45cat ctt aat tcc ctc tcc acc aaa ttc cct cgt tgc gat atc tgc
caa 373His Leu Asn Ser Leu Ser Thr Lys Phe Pro Arg Cys Asp Ile Cys
Gln 50 55 60gag aag gca
gct ttc att ttc tgt gta gag gat aga gct ctg ctt tgc 421Glu Lys Ala
Ala Phe Ile Phe Cys Val Glu Asp Arg Ala Leu Leu Cys 65
70 75agg gac tgc gat gaa tcc atc cac gtg gct
aat tct cga tct gct aat 469Arg Asp Cys Asp Glu Ser Ile His Val Ala
Asn Ser Arg Ser Ala Asn 80 85
90cac cag agg ttc tta gcc act ggg atc aaa gta gct ctg acc tca act
517His Gln Arg Phe Leu Ala Thr Gly Ile Lys Val Ala Leu Thr Ser Thr 95
100 105ata tgt agt aaa gaa att gag aag
aat caa cct gag cct tcc aac aac 565Ile Cys Ser Lys Glu Ile Glu Lys
Asn Gln Pro Glu Pro Ser Asn Asn110 115
120 125caa cag aag gct aat cag att cct gct aaa tcc aca
agc cag cag caa 613Gln Gln Lys Ala Asn Gln Ile Pro Ala Lys Ser Thr
Ser Gln Gln Gln 130 135
140caa caa cct tct tct gct act cca ctt ccc tgg gct gtt gac gat ttc
661Gln Gln Pro Ser Ser Ala Thr Pro Leu Pro Trp Ala Val Asp Asp Phe
145 150 155ttt cac ttc tct gat att
gaa tcc acc gac aag aaa gga cag ctt gat 709Phe His Phe Ser Asp Ile
Glu Ser Thr Asp Lys Lys Gly Gln Leu Asp 160 165
170ctt ggg gca ggg gag ttg gat tgg ttt tca gac atg gga ttc
ttc ggt 757Leu Gly Ala Gly Glu Leu Asp Trp Phe Ser Asp Met Gly Phe
Phe Gly 175 180 185gat cag att aat gac
aag gct ctt cct gca gct gaa gtt cct gag ctt 805Asp Gln Ile Asn Asp
Lys Ala Leu Pro Ala Ala Glu Val Pro Glu Leu190 195
200 205tct gtt tcg cat tta ggt cat gtt cat tca
tac aaa cct atg aag tca 853Ser Val Ser His Leu Gly His Val His Ser
Tyr Lys Pro Met Lys Ser 210 215
220aat gtt tca cac aag aag ccg agg ttt gag acc aga tat gat gat gat
901Asn Val Ser His Lys Lys Pro Arg Phe Glu Thr Arg Tyr Asp Asp Asp
225 230 235gat gag gaa cac ttc att
gtc cct gat ctt ggc taaaaagcta tatgtaatct 954Asp Glu Glu His Phe Ile
Val Pro Asp Leu Gly 240 245atgtgtagac attcttcaat
gtaaaagaac aaacaagaaa cctatctgca tgtgtggagt 1014taatgtcata tacattttag
ttttgtctta agttgtgtaa gatatgttga gagcttataa 1074caaatgtctg tgtttgagtt
ttgttcaaaa aaaaaaaaaa aa 1116122248PRTArabidopsis
thaliana 122Met Lys Ile Gln Cys Asp Val Cys Glu Lys Ala Pro Ala Thr Val
Ile 1 5 10 15Cys Cys Ala
Asp Glu Ala Ala Leu Cys Pro Gln Cys Asp Ile Glu Ile 20
25 30His Ala Ala Asn Lys Leu Ala Ser Lys His
Gln Arg Leu His Leu Asn 35 40
45Ser Leu Ser Thr Lys Phe Pro Arg Cys Asp Ile Cys Gln Glu Lys Ala 50
55 60Ala Phe Ile Phe Cys Val Glu Asp Arg
Ala Leu Leu Cys Arg Asp Cys 65 70 75
80Asp Glu Ser Ile His Val Ala Asn Ser Arg Ser Ala Asn His
Gln Arg 85 90 95Phe Leu
Ala Thr Gly Ile Lys Val Ala Leu Thr Ser Thr Ile Cys Ser 100
105 110Lys Glu Ile Glu Lys Asn Gln Pro Glu
Pro Ser Asn Asn Gln Gln Lys 115 120
125Ala Asn Gln Ile Pro Ala Lys Ser Thr Ser Gln Gln Gln Gln Gln Pro
130 135 140Ser Ser Ala Thr Pro Leu Pro
Trp Ala Val Asp Asp Phe Phe His Phe145 150
155 160Ser Asp Ile Glu Ser Thr Asp Lys Lys Gly Gln Leu
Asp Leu Gly Ala 165 170
175Gly Glu Leu Asp Trp Phe Ser Asp Met Gly Phe Phe Gly Asp Gln Ile
180 185 190Asn Asp Lys Ala Leu Pro
Ala Ala Glu Val Pro Glu Leu Ser Val Ser 195 200
205His Leu Gly His Val His Ser Tyr Lys Pro Met Lys Ser Asn
Val Ser 210 215 220His Lys Lys Pro Arg
Phe Glu Thr Arg Tyr Asp Asp Asp Asp Glu Glu225 230
235 240His Phe Ile Val Pro Asp Leu Gly
245123909DNAArabidopsis thalianaCDS(1)..(906) 123atg gat cag tac tca
tcc tct ttg gtc gat act tca tta gat ctc act 48Met Asp Gln Tyr Ser
Ser Ser Leu Val Asp Thr Ser Leu Asp Leu Thr 1 5
10 15att ggc gtt act cgt atg cga gtt gaa gaa gat
cca ccg aca agt gct 96Ile Gly Val Thr Arg Met Arg Val Glu Glu Asp
Pro Pro Thr Ser Ala 20 25
30ttg gtg gaa gaa tta aac cga gtt agt gct gag aac aag aag ctc tcg
144Leu Val Glu Glu Leu Asn Arg Val Ser Ala Glu Asn Lys Lys Leu Ser
35 40 45gag atg cta act ttg atg tgt
gac aac tac aac gtc ttg agg aag caa 192Glu Met Leu Thr Leu Met Cys
Asp Asn Tyr Asn Val Leu Arg Lys Gln 50 55
60ctt atg gaa tat gtt aac aag agc aac ata acc gag agg gat caa atc
240Leu Met Glu Tyr Val Asn Lys Ser Asn Ile Thr Glu Arg Asp Gln Ile 65
70 75 80agc cct ccc aag
aaa cgc aaa tcc ccg gcg aga gag gac gca ttc agc 288Ser Pro Pro Lys
Lys Arg Lys Ser Pro Ala Arg Glu Asp Ala Phe Ser 85
90 95tgc gcg gtt att ggc gga gtg tcg gag agt
agc tca acg gat caa gat 336Cys Ala Val Ile Gly Gly Val Ser Glu Ser
Ser Ser Thr Asp Gln Asp 100 105
110gag tat ttg tgt aag aag cag aga gaa gag act gtc gtg aag gag aaa
384Glu Tyr Leu Cys Lys Lys Gln Arg Glu Glu Thr Val Val Lys Glu Lys
115 120 125gtc tca agg gtc tat tac aag
acc gaa gct tct gac act acc ctc gtt 432Val Ser Arg Val Tyr Tyr Lys
Thr Glu Ala Ser Asp Thr Thr Leu Val 130 135
140gtg aaa gat ggg tat caa tgg agg aaa tat gga cag aaa gtg act aga
480Val Lys Asp Gly Tyr Gln Trp Arg Lys Tyr Gly Gln Lys Val Thr Arg145
150 155 160gac aat cca tct
cca aga gct tac ttc aaa tgt gct tgt gct cca agc 528Asp Asn Pro Ser
Pro Arg Ala Tyr Phe Lys Cys Ala Cys Ala Pro Ser 165
170 175tgt tct gtc aaa aag aag gtt cag aga agt
gtg gag gat cag tcc gtg 576Cys Ser Val Lys Lys Lys Val Gln Arg Ser
Val Glu Asp Gln Ser Val 180 185
190tta gtt gca act tat gag ggt gaa cac aac cat cca atg cca tcg cag
624Leu Val Ala Thr Tyr Glu Gly Glu His Asn His Pro Met Pro Ser Gln
195 200 205atc gat tca aac aat ggc tta
aac cgc cac atc tct cat ggt ggt tca 672Ile Asp Ser Asn Asn Gly Leu
Asn Arg His Ile Ser His Gly Gly Ser 210 215
220gct tca aca ccc gtt gca gca aac aga aga agt agc ttg act gtg ccg
720Ala Ser Thr Pro Val Ala Ala Asn Arg Arg Ser Ser Leu Thr Val Pro225
230 235 240gtg act acc gta
gat atg att gaa tcg aag aaa gtg acg agc cca acg 768Val Thr Thr Val
Asp Met Ile Glu Ser Lys Lys Val Thr Ser Pro Thr 245
250 255tca aga atc gat ttt ccc caa gtt cag aaa
ctt ttg gtg gag caa atg 816Ser Arg Ile Asp Phe Pro Gln Val Gln Lys
Leu Leu Val Glu Gln Met 260 265
270gct tct tcc tta acc aaa gat cct aac ttt aca gca gct tta gca gca
864Ala Ser Ser Leu Thr Lys Asp Pro Asn Phe Thr Ala Ala Leu Ala Ala
275 280 285gct gtt acc gga aaa ttg tat
caa cag aat cat acc gag aaa tag 909Ala Val Thr Gly Lys Leu Tyr
Gln Gln Asn His Thr Glu Lys 290 295
300124302PRTArabidopsis thaliana 124Met Asp Gln Tyr Ser Ser Ser Leu Val
Asp Thr Ser Leu Asp Leu Thr 1 5 10
15Ile Gly Val Thr Arg Met Arg Val Glu Glu Asp Pro Pro Thr Ser
Ala 20 25 30Leu Val Glu Glu
Leu Asn Arg Val Ser Ala Glu Asn Lys Lys Leu Ser 35
40 45Glu Met Leu Thr Leu Met Cys Asp Asn Tyr Asn Val
Leu Arg Lys Gln 50 55 60Leu Met Glu
Tyr Val Asn Lys Ser Asn Ile Thr Glu Arg Asp Gln Ile 65
70 75 80Ser Pro Pro Lys Lys Arg Lys Ser
Pro Ala Arg Glu Asp Ala Phe Ser 85 90
95Cys Ala Val Ile Gly Gly Val Ser Glu Ser Ser Ser Thr Asp
Gln Asp 100 105 110Glu Tyr Leu
Cys Lys Lys Gln Arg Glu Glu Thr Val Val Lys Glu Lys 115
120 125Val Ser Arg Val Tyr Tyr Lys Thr Glu Ala Ser
Asp Thr Thr Leu Val 130 135 140Val Lys
Asp Gly Tyr Gln Trp Arg Lys Tyr Gly Gln Lys Val Thr Arg145
150 155 160Asp Asn Pro Ser Pro Arg Ala
Tyr Phe Lys Cys Ala Cys Ala Pro Ser 165
170 175Cys Ser Val Lys Lys Lys Val Gln Arg Ser Val Glu
Asp Gln Ser Val 180 185 190Leu
Val Ala Thr Tyr Glu Gly Glu His Asn His Pro Met Pro Ser Gln 195
200 205Ile Asp Ser Asn Asn Gly Leu Asn Arg
His Ile Ser His Gly Gly Ser 210 215
220Ala Ser Thr Pro Val Ala Ala Asn Arg Arg Ser Ser Leu Thr Val Pro225
230 235 240Val Thr Thr Val
Asp Met Ile Glu Ser Lys Lys Val Thr Ser Pro Thr 245
250 255Ser Arg Ile Asp Phe Pro Gln Val Gln Lys
Leu Leu Val Glu Gln Met 260 265
270Ala Ser Ser Leu Thr Lys Asp Pro Asn Phe Thr Ala Ala Leu Ala Ala
275 280 285Ala Val Thr Gly Lys Leu Tyr
Gln Gln Asn His Thr Glu Lys 290 295
3001251462DNAArabidopsis thalianaCDS(382)..(1161) 125accgaccttc
ttggttcttc cggcgttgac tgttacgaag atgatgaaga cttgagagtt 60tctgggtcga
gttttggtgg gtactatcca gagagaaccg ggtctggttt acctaagttc 120aagacggctc
aaccaccacc tcttccgatt tcacaatctt ctcataactt cactttctcc 180gattaccttg
attctcctct gcttctcagc tcctcacaca gtttgatatc tccaacaaca 240ggaacgtttc
cattgcaagg ctttaatgga acaacaaaca atcactcaga ttttccctgg 300cagctacaat
ctcaaccatc aaacgcttct tctgctttgc aagaaacata tggtgttcaa 360gatcacgaga
agaagcagga g atg att cct aat gag att gca aca caa aac 411
Met Ile Pro Asn Glu Ile Ala Thr Gln Asn
1 5 10aac aat caa agt ttt gga aca gaa
cgt cag ata aag ata cca gca tac 459Asn Asn Gln Ser Phe Gly Thr Glu
Arg Gln Ile Lys Ile Pro Ala Tyr 15 20
25atg gtg agt agg aac tct aat gat ggt tat ggt tgg aga aaa
tac ggt 507Met Val Ser Arg Asn Ser Asn Asp Gly Tyr Gly Trp Arg Lys
Tyr Gly 30 35 40cag aaa caa
gtg aag aag agc gaa aac cct agg agt tac ttc aag tgt 555Gln Lys Gln
Val Lys Lys Ser Glu Asn Pro Arg Ser Tyr Phe Lys Cys 45
50 55acg tat cct gat tgt gtt tcc aag aag att gtt
gag acg gct tct gat 603Thr Tyr Pro Asp Cys Val Ser Lys Lys Ile Val
Glu Thr Ala Ser Asp 60 65 70gga cag
atc act gag atc att tat aaa ggt ggt cat aat cat cct aag 651Gly Gln
Ile Thr Glu Ile Ile Tyr Lys Gly Gly His Asn His Pro Lys 75
80 85 90cct gag ttc acc aag aga cca
tct caa tct tca tta cca tca tcg gtt 699Pro Glu Phe Thr Lys Arg Pro
Ser Gln Ser Ser Leu Pro Ser Ser Val 95
100 105aat ggg agg cgc ttg ttt aat cct gct tct gtt gtt
agt gaa cct cat 747Asn Gly Arg Arg Leu Phe Asn Pro Ala Ser Val Val
Ser Glu Pro His 110 115 120gat
caa tca gag aac tct tcg att tcg ttt gac tat agt gat ctt gag 795Asp
Gln Ser Glu Asn Ser Ser Ile Ser Phe Asp Tyr Ser Asp Leu Glu 125
130 135cag aaa agt ttt aaa tca gag tat ggt
gag ata gat gaa gag gag gaa 843Gln Lys Ser Phe Lys Ser Glu Tyr Gly
Glu Ile Asp Glu Glu Glu Glu 140 145
150caa cct gag atg aag agg atg aaa aga gaa ggt gaa gat gaa ggg atg
891Gln Pro Glu Met Lys Arg Met Lys Arg Glu Gly Glu Asp Glu Gly Met155
160 165 170tct ata gaa gta
agc aaa gga gtt aaa gag cca aga gtt gtg gtt cag 939Ser Ile Glu Val
Ser Lys Gly Val Lys Glu Pro Arg Val Val Val Gln 175
180 185aca ata agt gat att gat gtt ctt ata gat
ggc ttt aga tgg agg aaa 987Thr Ile Ser Asp Ile Asp Val Leu Ile Asp
Gly Phe Arg Trp Arg Lys 190 195
200tat ggt caa aaa gtt gtc aaa gga aat act aat cca agg agc tac tac
1035Tyr Gly Gln Lys Val Val Lys Gly Asn Thr Asn Pro Arg Ser Tyr Tyr
205 210 215aag tgc aca ttc caa ggt tgt
gga gtg aag aag caa gtg gaa aga tcc 1083Lys Cys Thr Phe Gln Gly Cys
Gly Val Lys Lys Gln Val Glu Arg Ser 220 225
230gca gca gac gag aga gca gtt ctc act acc tat gaa gga aga cac aat
1131Ala Ala Asp Glu Arg Ala Val Leu Thr Thr Tyr Glu Gly Arg His Asn235
240 245 250cac gat atc cca
acc gcg cta cgt cgc tcg tgaaattatt gggacttagt 1181His Asp Ile Pro
Thr Ala Leu Arg Arg Ser 255 260cactagtaat
atgatttagg ctttctaaaa acaaaaaatc ttactatggc ttatcttttg 1241tgctcattca
cagtttgttt atttgtttgt tacacagtca atactttgtt ttgtacagag 1301tggtgcttag
tagtgttttt attattatct tggccttata gaataacctc tcttctcatc 1361tgtgtgactt
taaacacttg agagtccatt ttatagttct tgtgtattgg tcttttgttt 1421gatttatgta
catttttaat attcgaaaaa aaaaaaaaaa a
1462126260PRTArabidopsis thaliana 126Met Ile Pro Asn Glu Ile Ala Thr Gln
Asn Asn Asn Gln Ser Phe Gly 1 5 10
15Thr Glu Arg Gln Ile Lys Ile Pro Ala Tyr Met Val Ser Arg Asn
Ser 20 25 30Asn Asp Gly Tyr
Gly Trp Arg Lys Tyr Gly Gln Lys Gln Val Lys Lys 35
40 45Ser Glu Asn Pro Arg Ser Tyr Phe Lys Cys Thr Tyr
Pro Asp Cys Val 50 55 60Ser Lys Lys
Ile Val Glu Thr Ala Ser Asp Gly Gln Ile Thr Glu Ile 65
70 75 80Ile Tyr Lys Gly Gly His Asn His
Pro Lys Pro Glu Phe Thr Lys Arg 85 90
95Pro Ser Gln Ser Ser Leu Pro Ser Ser Val Asn Gly Arg Arg
Leu Phe 100 105 110Asn Pro Ala
Ser Val Val Ser Glu Pro His Asp Gln Ser Glu Asn Ser 115
120 125Ser Ile Ser Phe Asp Tyr Ser Asp Leu Glu Gln
Lys Ser Phe Lys Ser 130 135 140Glu Tyr
Gly Glu Ile Asp Glu Glu Glu Glu Gln Pro Glu Met Lys Arg145
150 155 160Met Lys Arg Glu Gly Glu Asp
Glu Gly Met Ser Ile Glu Val Ser Lys 165
170 175Gly Val Lys Glu Pro Arg Val Val Val Gln Thr Ile
Ser Asp Ile Asp 180 185 190Val
Leu Ile Asp Gly Phe Arg Trp Arg Lys Tyr Gly Gln Lys Val Val 195
200 205Lys Gly Asn Thr Asn Pro Arg Ser Tyr
Tyr Lys Cys Thr Phe Gln Gly 210 215
220Cys Gly Val Lys Lys Gln Val Glu Arg Ser Ala Ala Asp Glu Arg Ala225
230 235 240Val Leu Thr Thr
Tyr Glu Gly Arg His Asn His Asp Ile Pro Thr Ala 245
250 255Leu Arg Arg Ser
260127438DNAArabidopsis thalianaCDS(1)..(435) 127atg gag gga tat gat aat
ggg tcg ttg tat gct cct ttt ttg tcg ttg 48Met Glu Gly Tyr Asp Asn
Gly Ser Leu Tyr Ala Pro Phe Leu Ser Leu 1 5
10 15aaa tct cat tcg aaa cca gag ctg cat caa ggc gaa
gaa gag agc tca 96Lys Ser His Ser Lys Pro Glu Leu His Gln Gly Glu
Glu Glu Ser Ser 20 25 30aag
gtt aga tca gaa ggt tgt tcg aaa agc gtg gag tcg tcg aaa aag 144Lys
Val Arg Ser Glu Gly Cys Ser Lys Ser Val Glu Ser Ser Lys Lys 35
40 45aag ggg aag aaa caa agg tat gcg ttt
caa aca agg agc caa gtg gat 192Lys Gly Lys Lys Gln Arg Tyr Ala Phe
Gln Thr Arg Ser Gln Val Asp 50 55
60att ctt gat gat ggt tat cga tgg agg aaa tat ggc caa aag gcc gtc
240Ile Leu Asp Asp Gly Tyr Arg Trp Arg Lys Tyr Gly Gln Lys Ala Val 65
70 75 80aag aac aac aag
ttc cct agg agt tac tat agg tgt aca tat gga gga 288Lys Asn Asn Lys
Phe Pro Arg Ser Tyr Tyr Arg Cys Thr Tyr Gly Gly 85
90 95tgc aat gtg aag aag caa gtg caa aga tta
aca gtg gac caa gaa gtg 336Cys Asn Val Lys Lys Gln Val Gln Arg Leu
Thr Val Asp Gln Glu Val 100 105
110gtc gtg aca acc tac gaa gga gtg cat tcg cat ccc atc gag aaa tcc
384Val Val Thr Thr Tyr Glu Gly Val His Ser His Pro Ile Glu Lys Ser
115 120 125acc gaa aac ttc gag cat att
ctc act caa atg caa atc tac tct tct 432Thr Glu Asn Phe Glu His Ile
Leu Thr Gln Met Gln Ile Tyr Ser Ser 130 135
140ttc tag
438Phe145128145PRTArabidopsis thaliana 128Met Glu Gly Tyr Asp Asn Gly
Ser Leu Tyr Ala Pro Phe Leu Ser Leu 1 5
10 15Lys Ser His Ser Lys Pro Glu Leu His Gln Gly Glu Glu
Glu Ser Ser 20 25 30Lys Val
Arg Ser Glu Gly Cys Ser Lys Ser Val Glu Ser Ser Lys Lys 35
40 45Lys Gly Lys Lys Gln Arg Tyr Ala Phe Gln
Thr Arg Ser Gln Val Asp 50 55 60Ile
Leu Asp Asp Gly Tyr Arg Trp Arg Lys Tyr Gly Gln Lys Ala Val 65
70 75 80Lys Asn Asn Lys Phe Pro
Arg Ser Tyr Tyr Arg Cys Thr Tyr Gly Gly 85
90 95Cys Asn Val Lys Lys Gln Val Gln Arg Leu Thr Val
Asp Gln Glu Val 100 105 110Val
Val Thr Thr Tyr Glu Gly Val His Ser His Pro Ile Glu Lys Ser 115
120 125Thr Glu Asn Phe Glu His Ile Leu Thr
Gln Met Gln Ile Tyr Ser Ser 130 135
140Phe1451291254DNAArabidopsis thalianaCDS(137)..(1090) 129aaaagtccaa
gcaccaatct agacctctta ggaaaaaaac ctaaaaacct aatccccaaa 60cctaaaaggc
ttatctcatc tcttcttctt tgtcttcttt actctttttt tacctctctc 120ttcattgttc
ttcacc atg tct aat gaa acc aga gat ctc tac aac tac caa 172
Met Ser Asn Glu Thr Arg Asp Leu Tyr Asn Tyr Gln 1
5 10tac cct tca tcg ttt tcg ttg cac gaa
atg atg aat ctg cct act tca 220Tyr Pro Ser Ser Phe Ser Leu His Glu
Met Met Asn Leu Pro Thr Ser 15 20
25aat cca tct tct tat gga aac ctc cca tca caa aac ggt ttt aat cca
268Asn Pro Ser Ser Tyr Gly Asn Leu Pro Ser Gln Asn Gly Phe Asn Pro 30
35 40tct act tat tcc ttc acc gat tgt
ctc caa agt tct cca gca gcg tat 316Ser Thr Tyr Ser Phe Thr Asp Cys
Leu Gln Ser Ser Pro Ala Ala Tyr 45 50
55 60gaa tct cta ctt cag aaa act ttt ggt ctt tct ccc tct
tcc tca gag 364Glu Ser Leu Leu Gln Lys Thr Phe Gly Leu Ser Pro Ser
Ser Ser Glu 65 70 75gtt
ttc aat tct tcg atc gat caa gaa ccg aac cgt gat gtt act aat 412Val
Phe Asn Ser Ser Ile Asp Gln Glu Pro Asn Arg Asp Val Thr Asn
80 85 90gac gta atc aat ggt ggt gca tgc
aac gag act gaa act agg gtt tct 460Asp Val Ile Asn Gly Gly Ala Cys
Asn Glu Thr Glu Thr Arg Val Ser 95 100
105cct tct aat tct tcc tct agt gag gct gat cac ccc ggt gaa gat tcc
508Pro Ser Asn Ser Ser Ser Ser Glu Ala Asp His Pro Gly Glu Asp Ser
110 115 120ggt aag agc cgg agg aaa cga
gag tta gtc ggt gaa gaa gat caa att 556Gly Lys Ser Arg Arg Lys Arg
Glu Leu Val Gly Glu Glu Asp Gln Ile125 130
135 140tcc aaa aaa gtt ggg aaa acg aaa aag act gag gtg
aag aaa caa aga 604Ser Lys Lys Val Gly Lys Thr Lys Lys Thr Glu Val
Lys Lys Gln Arg 145 150
155gag cca cga gtc tcg ttt atg act aaa agt gaa gtt gat cat ctt gaa
652Glu Pro Arg Val Ser Phe Met Thr Lys Ser Glu Val Asp His Leu Glu
160 165 170gat ggt tat aga tgg aga
aaa tac ggc caa aag gct gta aaa aat agc 700Asp Gly Tyr Arg Trp Arg
Lys Tyr Gly Gln Lys Ala Val Lys Asn Ser 175 180
185cct tat cca agg agt tac tat aga tgt aca aca caa aag tgc
aac gtg 748Pro Tyr Pro Arg Ser Tyr Tyr Arg Cys Thr Thr Gln Lys Cys
Asn Val 190 195 200aag aaa cga gtg gag
aga tcg ttc caa gat cca acg gtt gtg att aca 796Lys Lys Arg Val Glu
Arg Ser Phe Gln Asp Pro Thr Val Val Ile Thr205 210
215 220act tac gag ggt caa cac aac cac ccg att
ccg act aat ctt cga gga 844Thr Tyr Glu Gly Gln His Asn His Pro Ile
Pro Thr Asn Leu Arg Gly 225 230
235agt tct gcc gcg gct gct atg ttc tcc gca gac ctc atg act cca aga
892Ser Ser Ala Ala Ala Ala Met Phe Ser Ala Asp Leu Met Thr Pro Arg
240 245 250agc ttt gca cat gat atg
ttt agg acg gca gct tat act aac ggc ggt 940Ser Phe Ala His Asp Met
Phe Arg Thr Ala Ala Tyr Thr Asn Gly Gly 255 260
265tct gtg gcg gcg gct ttg gat tat gga tat gga caa agt ggt
tat ggt 988Ser Val Ala Ala Ala Leu Asp Tyr Gly Tyr Gly Gln Ser Gly
Tyr Gly 270 275 280agt gtg aat tca aac
cct agt tct cac caa gtg tat cat caa ggg ggt 1036Ser Val Asn Ser Asn
Pro Ser Ser His Gln Val Tyr His Gln Gly Gly285 290
295 300gag tat gag ctc ttg agg gag att ttt cct
tca att ttc ttt aag caa 1084Glu Tyr Glu Leu Leu Arg Glu Ile Phe Pro
Ser Ile Phe Phe Lys Gln 305 310
315gag cct tgatcgatca ttgttataac tacatatatt atatatattg agagagagag
1140Glu Progtagagaaaa aaaaaactta tatgtaactt aagatcttat tttgtctctc
ttatttgcat 1200gtacatattt tttcatgaaa gaatgagaca gttgggcttg cttaaaaaaa
aaat 1254130318PRTArabidopsis thaliana 130Met Ser Asn Glu Thr Arg
Asp Leu Tyr Asn Tyr Gln Tyr Pro Ser Ser 1 5
10 15Phe Ser Leu His Glu Met Met Asn Leu Pro Thr Ser
Asn Pro Ser Ser 20 25 30Tyr
Gly Asn Leu Pro Ser Gln Asn Gly Phe Asn Pro Ser Thr Tyr Ser 35
40 45Phe Thr Asp Cys Leu Gln Ser Ser Pro
Ala Ala Tyr Glu Ser Leu Leu 50 55
60Gln Lys Thr Phe Gly Leu Ser Pro Ser Ser Ser Glu Val Phe Asn Ser 65
70 75 80Ser Ile Asp Gln Glu
Pro Asn Arg Asp Val Thr Asn Asp Val Ile Asn 85
90 95Gly Gly Ala Cys Asn Glu Thr Glu Thr Arg Val
Ser Pro Ser Asn Ser 100 105
110Ser Ser Ser Glu Ala Asp His Pro Gly Glu Asp Ser Gly Lys Ser Arg
115 120 125Arg Lys Arg Glu Leu Val Gly
Glu Glu Asp Gln Ile Ser Lys Lys Val 130 135
140Gly Lys Thr Lys Lys Thr Glu Val Lys Lys Gln Arg Glu Pro Arg
Val145 150 155 160Ser Phe
Met Thr Lys Ser Glu Val Asp His Leu Glu Asp Gly Tyr Arg
165 170 175Trp Arg Lys Tyr Gly Gln Lys
Ala Val Lys Asn Ser Pro Tyr Pro Arg 180 185
190Ser Tyr Tyr Arg Cys Thr Thr Gln Lys Cys Asn Val Lys Lys
Arg Val 195 200 205Glu Arg Ser Phe
Gln Asp Pro Thr Val Val Ile Thr Thr Tyr Glu Gly 210
215 220Gln His Asn His Pro Ile Pro Thr Asn Leu Arg Gly
Ser Ser Ala Ala225 230 235
240Ala Ala Met Phe Ser Ala Asp Leu Met Thr Pro Arg Ser Phe Ala His
245 250 255Asp Met Phe Arg Thr
Ala Ala Tyr Thr Asn Gly Gly Ser Val Ala Ala 260
265 270Ala Leu Asp Tyr Gly Tyr Gly Gln Ser Gly Tyr Gly
Ser Val Asn Ser 275 280 285Asn Pro
Ser Ser His Gln Val Tyr His Gln Gly Gly Glu Tyr Glu Leu 290
295 300Leu Arg Glu Ile Phe Pro Ser Ile Phe Phe Lys
Gln Glu Pro305 310
3151311148DNAArabidopsis thalianaCDS(104)..(1075) 131acgtctctct
ctttctctct actctctgtt tcctcataat tcaatcacta tattttttta 60aaaacatttg
acttcatcga tcggttaaca attaatcaaa aag atg gga cga tca 115
Met Gly Arg Ser
1cca tgt tgt gag aag aag aat ggt ctc aag aaa
gga cca tgg act cct 163Pro Cys Cys Glu Lys Lys Asn Gly Leu Lys Lys
Gly Pro Trp Thr Pro 5 10 15
20gag gag gat caa aag ctc att gat tat atc aat ata cat ggt tat gga
211Glu Glu Asp Gln Lys Leu Ile Asp Tyr Ile Asn Ile His Gly Tyr Gly
25 30 35aat tgg aga act ctt
ccc aag aat gct ggg tta caa aga tgt ggt aag 259Asn Trp Arg Thr Leu
Pro Lys Asn Ala Gly Leu Gln Arg Cys Gly Lys 40
45 50agt tgt cgt ctc cgg tgg acc aac tat ctc cga cca
gat att aag cgt 307Ser Cys Arg Leu Arg Trp Thr Asn Tyr Leu Arg Pro
Asp Ile Lys Arg 55 60 65gga aga
ttc tct ttt gaa gaa gaa gaa acc att att caa ctt cac agc 355Gly Arg
Phe Ser Phe Glu Glu Glu Glu Thr Ile Ile Gln Leu His Ser 70
75 80atc atg gga aac aag tgg tct gcg att gcg gct
cgt ttg cct gga aga 403Ile Met Gly Asn Lys Trp Ser Ala Ile Ala Ala
Arg Leu Pro Gly Arg 85 90 95
100aca gac aac gag atc aaa aac tat tgg aac act cac atc aga aaa aga
451Thr Asp Asn Glu Ile Lys Asn Tyr Trp Asn Thr His Ile Arg Lys Arg
105 110 115ctt cta aag atg gga
atc gac ccg gtt aca cac act cca cgt ctt gat 499Leu Leu Lys Met Gly
Ile Asp Pro Val Thr His Thr Pro Arg Leu Asp 120
125 130ctt ctc gat atc tcc tcc att ctc agc tca tct atc
tac aac tct tcg 547Leu Leu Asp Ile Ser Ser Ile Leu Ser Ser Ser Ile
Tyr Asn Ser Ser 135 140 145cat cat
cat cat cat cat cat caa caa cat atg aac atg tcg agg ctc 595His His
His His His His His Gln Gln His Met Asn Met Ser Arg Leu 150
155 160atg atg agt gat ggt aat cat caa cca ttg gtt
aac ccc gag ata ctc 643Met Met Ser Asp Gly Asn His Gln Pro Leu Val
Asn Pro Glu Ile Leu165 170 175
180aaa ctc gca acc tct ctc ttt tca aac caa aac cac ccc aac aac aca
691Lys Leu Ala Thr Ser Leu Phe Ser Asn Gln Asn His Pro Asn Asn Thr
185 190 195cac gag aac aac acg
gtt aac caa acc gaa gta aac caa tac caa acc 739His Glu Asn Asn Thr
Val Asn Gln Thr Glu Val Asn Gln Tyr Gln Thr 200
205 210ggt tac aac atg cct ggt aat gaa gaa tta caa tct
tgg ttc cct atc 787Gly Tyr Asn Met Pro Gly Asn Glu Glu Leu Gln Ser
Trp Phe Pro Ile 215 220 225atg gat
caa ttc acg aat ttc caa gac ctc atg cca atg aag acg acg 835Met Asp
Gln Phe Thr Asn Phe Gln Asp Leu Met Pro Met Lys Thr Thr 230
235 240gtc caa aat tca ttg tca tac gat gat gat tgt
tcg aag tcc aat ttt 883Val Gln Asn Ser Leu Ser Tyr Asp Asp Asp Cys
Ser Lys Ser Asn Phe245 250 255
260gta tta gaa cct tat tac tcc gac ttt gct tca gtc ttg acc aca cct
931Val Leu Glu Pro Tyr Tyr Ser Asp Phe Ala Ser Val Leu Thr Thr Pro
265 270 275tct tca agc ccg act
ccg tta aac tca agt tcc tca act tac atc aat 979Ser Ser Ser Pro Thr
Pro Leu Asn Ser Ser Ser Ser Thr Tyr Ile Asn 280
285 290agt agc act tgc agc acc gag gat gaa aaa gag agt
tat tac agt gat 1027Ser Ser Thr Cys Ser Thr Glu Asp Glu Lys Glu Ser
Tyr Tyr Ser Asp 295 300 305aat atc
act aat tat tcg ttt gat gtt aat ggt ttt ctc caa ttc caa 1075Asn Ile
Thr Asn Tyr Ser Phe Asp Val Asn Gly Phe Leu Gln Phe Gln 310
315 320taaacaaaac gccattggaa tagagttatg taaacatgca
atcattgtat ttgttatata 1135gattttgtta cat
1148132324PRTArabidopsis thaliana 132Met Gly Arg
Ser Pro Cys Cys Glu Lys Lys Asn Gly Leu Lys Lys Gly 1 5
10 15Pro Trp Thr Pro Glu Glu Asp Gln Lys
Leu Ile Asp Tyr Ile Asn Ile 20 25
30His Gly Tyr Gly Asn Trp Arg Thr Leu Pro Lys Asn Ala Gly Leu Gln
35 40 45Arg Cys Gly Lys Ser Cys
Arg Leu Arg Trp Thr Asn Tyr Leu Arg Pro 50 55
60Asp Ile Lys Arg Gly Arg Phe Ser Phe Glu Glu Glu Glu Thr Ile
Ile 65 70 75 80Gln Leu
His Ser Ile Met Gly Asn Lys Trp Ser Ala Ile Ala Ala Arg
85 90 95Leu Pro Gly Arg Thr Asp Asn Glu
Ile Lys Asn Tyr Trp Asn Thr His 100 105
110Ile Arg Lys Arg Leu Leu Lys Met Gly Ile Asp Pro Val Thr His
Thr 115 120 125Pro Arg Leu Asp Leu
Leu Asp Ile Ser Ser Ile Leu Ser Ser Ser Ile 130 135
140Tyr Asn Ser Ser His His His His His His His Gln Gln His
Met Asn145 150 155 160Met
Ser Arg Leu Met Met Ser Asp Gly Asn His Gln Pro Leu Val Asn
165 170 175Pro Glu Ile Leu Lys Leu Ala
Thr Ser Leu Phe Ser Asn Gln Asn His 180 185
190Pro Asn Asn Thr His Glu Asn Asn Thr Val Asn Gln Thr Glu
Val Asn 195 200 205Gln Tyr Gln Thr
Gly Tyr Asn Met Pro Gly Asn Glu Glu Leu Gln Ser 210
215 220Trp Phe Pro Ile Met Asp Gln Phe Thr Asn Phe Gln
Asp Leu Met Pro225 230 235
240Met Lys Thr Thr Val Gln Asn Ser Leu Ser Tyr Asp Asp Asp Cys Ser
245 250 255Lys Ser Asn Phe Val
Leu Glu Pro Tyr Tyr Ser Asp Phe Ala Ser Val 260
265 270Leu Thr Thr Pro Ser Ser Ser Pro Thr Pro Leu Asn
Ser Ser Ser Ser 275 280 285Thr Tyr
Ile Asn Ser Ser Thr Cys Ser Thr Glu Asp Glu Lys Glu Ser 290
295 300Tyr Tyr Ser Asp Asn Ile Thr Asn Tyr Ser Phe
Asp Val Asn Gly Phe305 310 315
320Leu Gln Phe Gln1332684DNAArabidopsis thalianaCDS(898)..(2475)
133acttcttctt cttcttcttc tcgatttctt actgttttct tatccaacga aatctggaat
60taaaaatgga atctttatcg aatccaagct gattttgttt ctttcattga atcatctctc
120taaagtggaa ttttgtaaag agaagatctg aagttgtgta gaggagctta gtgatggaga
180caaattcgtc tggagaagat ctggttatta agactcggaa gccatatacg ataacaaagc
240aacgtgaaag gtggactgag gaagaacata atagattcat tgaagctttg aggctttatg
300gtagagcatg gcagaagatt gaagaacatg tagcaacaaa aactgctgtc cagataagaa
360gtcacgctca gaaatttttc tccaaggtaa aatcggttaa ttttgaaatg atgttctcat
420cttcattggc ttaatgctta agacttattg aaagccaggc aagttttctg cttcttttgc
480ttcttagtca ggagatagat agattacgtt tttagagttt agtaatgagc aataagtctt
540aaaatagttg gagaaatgac gagatgtaat cgttttcttt tgtttatgcc tatatcttgt
600taatccacaa acatgtacat agattcttca gaagaatgtt agtttcttta gattcttcag
660ataaacttgt gtcttcttac cgattctgag gtagtggcaa aagtgggctg agtgctagaa
720atttttgaat gttccttgtg ataagccata gaggtaaacc atttttgatt ttccagttct
780gtcatttaaa cttgttagtg tcattagatt tttgtttgtt tacgtttgtt tagagggtaa
840caaaactact ctcatctctc tcaggtagag aaagaggctg aagctaaagg tgtagct
897atg ggt caa gcg cta gac ata gct att cct cct cca cgg cct aag cgt
945Met Gly Gln Ala Leu Asp Ile Ala Ile Pro Pro Pro Arg Pro Lys Arg 1
5 10 15aaa cca aac aat cct
tat cct cga aag acg gga agt gga acg atc ctt 993Lys Pro Asn Asn Pro
Tyr Pro Arg Lys Thr Gly Ser Gly Thr Ile Leu 20
25 30atg tca aaa acg ggt gtg aat gat gga aaa gag tcc
ctt gga tca gaa 1041Met Ser Lys Thr Gly Val Asn Asp Gly Lys Glu Ser
Leu Gly Ser Glu 35 40 45aaa gtg
tcg cat cct gag atg gcc aat gaa gat cga caa caa tca aag 1089Lys Val
Ser His Pro Glu Met Ala Asn Glu Asp Arg Gln Gln Ser Lys 50
55 60cct gaa gag aaa act ctg cag gaa gac aac tgt
tca gat tgt ttc act 1137Pro Glu Glu Lys Thr Leu Gln Glu Asp Asn Cys
Ser Asp Cys Phe Thr 65 70 75
80cat cag tat ctc tct gct gca tcc tcc atg aat aaa agt tgt ata gag
1185His Gln Tyr Leu Ser Ala Ala Ser Ser Met Asn Lys Ser Cys Ile Glu
85 90 95aca tca aac gca agc
act ttc cgc gag ttc ttg cct tca cgg gaa gag 1233Thr Ser Asn Ala Ser
Thr Phe Arg Glu Phe Leu Pro Ser Arg Glu Glu 100
105 110gga agt cag aat aac agg gta aga aag gag tca aac
tca gat ttg aat 1281Gly Ser Gln Asn Asn Arg Val Arg Lys Glu Ser Asn
Ser Asp Leu Asn 115 120 125gca aaa
tct ctg gaa aac ggt aat gag caa gga cct cag act tat ccg 1329Ala Lys
Ser Leu Glu Asn Gly Asn Glu Gln Gly Pro Gln Thr Tyr Pro 130
135 140atg cat atc cct gtg cta gtg cca ttg ggg agc
tca ata aca agt tct 1377Met His Ile Pro Val Leu Val Pro Leu Gly Ser
Ser Ile Thr Ser Ser145 150 155
160cta tca cat cct cct tca gag cca gat agt cat ccc cac aca gtt gca
1425Leu Ser His Pro Pro Ser Glu Pro Asp Ser His Pro His Thr Val Ala
165 170 175gga gat tat cag tcg
ttt cct aat cat ata atg tca acc ctt tta caa 1473Gly Asp Tyr Gln Ser
Phe Pro Asn His Ile Met Ser Thr Leu Leu Gln 180
185 190aca ccg gct ctt tat act gcc gca act ttc gcc tca
tca ttt tgg cct 1521Thr Pro Ala Leu Tyr Thr Ala Ala Thr Phe Ala Ser
Ser Phe Trp Pro 195 200 205ccc gat
tct agt ggt ggc tca cct gtt cca ggg aac tca cct ccg aat 1569Pro Asp
Ser Ser Gly Gly Ser Pro Val Pro Gly Asn Ser Pro Pro Asn 210
215 220ctg gct gcc atg gcc gca gcc act gtt gca gct
gct agt gct tgg tgg 1617Leu Ala Ala Met Ala Ala Ala Thr Val Ala Ala
Ala Ser Ala Trp Trp225 230 235
240gct gcc aat gga tta tta cct tta tgt gct cct ctt agt tca ggt ggt
1665Ala Ala Asn Gly Leu Leu Pro Leu Cys Ala Pro Leu Ser Ser Gly Gly
245 250 255ttc act agt cat cct
cca tct act ttt gga cca tca tgt gat gta gag 1713Phe Thr Ser His Pro
Pro Ser Thr Phe Gly Pro Ser Cys Asp Val Glu 260
265 270tac aca aaa gca agc act tta caa cat ggt tct gtg
cag agc cga gag 1761Tyr Thr Lys Ala Ser Thr Leu Gln His Gly Ser Val
Gln Ser Arg Glu 275 280 285caa gaa
cac ttc gag gca tca aag gct cga tct tca ctg gac tca gag 1809Gln Glu
His Phe Glu Ala Ser Lys Ala Arg Ser Ser Leu Asp Ser Glu 290
295 300gat gtt gaa aat aag agt aaa cca gtt tgt cat
gag cag cct tct gca 1857Asp Val Glu Asn Lys Ser Lys Pro Val Cys His
Glu Gln Pro Ser Ala305 310 315
320aca cct gag agt gat gca aag ggt tca gat gga gca gga gac aga aaa
1905Thr Pro Glu Ser Asp Ala Lys Gly Ser Asp Gly Ala Gly Asp Arg Lys
325 330 335caa gtt gac cgg tcc
tcg tgt ggc tca aac act ccg tcg agt agt gat 1953Gln Val Asp Arg Ser
Ser Cys Gly Ser Asn Thr Pro Ser Ser Ser Asp 340
345 350gat gtt gag gcg gat gca tca gaa agg caa gag gat
ggc acc aat ggt 2001Asp Val Glu Ala Asp Ala Ser Glu Arg Gln Glu Asp
Gly Thr Asn Gly 355 360 365gag gtg
aaa gaa acg aat gaa gac act aat aaa cct caa act tca gag 2049Glu Val
Lys Glu Thr Asn Glu Asp Thr Asn Lys Pro Gln Thr Ser Glu 370
375 380tcc aat gca cgc cgc agt aga atc agc tcc aat
ata acc gat cca tgg 2097Ser Asn Ala Arg Arg Ser Arg Ile Ser Ser Asn
Ile Thr Asp Pro Trp385 390 395
400aag tct gtg tct gac gag ggt cga att gcc ttc caa gct ctc ttc tcc
2145Lys Ser Val Ser Asp Glu Gly Arg Ile Ala Phe Gln Ala Leu Phe Ser
405 410 415aga gag gta ttg ccg
caa agt ttt aca tat cga gaa gaa cac aga gag 2193Arg Glu Val Leu Pro
Gln Ser Phe Thr Tyr Arg Glu Glu His Arg Glu 420
425 430gaa gaa caa caa caa caa gaa caa aga tat cca atg
gca ctt gat ctt 2241Glu Glu Gln Gln Gln Gln Glu Gln Arg Tyr Pro Met
Ala Leu Asp Leu 435 440 445aac ttc
aca gct cag tta aca cca gtt gat gat caa gag gag aag aga 2289Asn Phe
Thr Ala Gln Leu Thr Pro Val Asp Asp Gln Glu Glu Lys Arg 450
455 460aac aca gga ttt ctt gga atc gga tta gat gct
tca aag cta atg agt 2337Asn Thr Gly Phe Leu Gly Ile Gly Leu Asp Ala
Ser Lys Leu Met Ser465 470 475
480aga gga aga aca ggt ttt aaa cca tac aaa aga tgt tcc atg gaa gcc
2385Arg Gly Arg Thr Gly Phe Lys Pro Tyr Lys Arg Cys Ser Met Glu Ala
485 490 495aaa gaa agt aga atc
ctc aac aac aat cct atc att cat gtg gaa cag 2433Lys Glu Ser Arg Ile
Leu Asn Asn Asn Pro Ile Ile His Val Glu Gln 500
505 510aaa gat ccc aaa cgg atg cgg ttg gaa act caa gct
tcc aca 2475Lys Asp Pro Lys Arg Met Arg Leu Glu Thr Gln Ala
Ser Thr 515 520 525tgagactcta
ttttcatctg atctgttgtt tgtactctgt ttttaagttt tcaagaccac 2535tgctacattt
tctttttctt ttgaggcctt tgtatttgtt tccttgtcca tagtcttcct 2595gtaacatttg
actctgtatt attcaacaaa tcataaactg tttaatcttt ttttttccag 2655aaaaaaaaaa
aagaaaaaaa aaaaaaaaa
2684134526PRTArabidopsis thaliana 134Met Gly Gln Ala Leu Asp Ile Ala Ile
Pro Pro Pro Arg Pro Lys Arg 1 5 10
15Lys Pro Asn Asn Pro Tyr Pro Arg Lys Thr Gly Ser Gly Thr Ile
Leu 20 25 30Met Ser Lys Thr
Gly Val Asn Asp Gly Lys Glu Ser Leu Gly Ser Glu 35
40 45Lys Val Ser His Pro Glu Met Ala Asn Glu Asp Arg
Gln Gln Ser Lys 50 55 60Pro Glu Glu
Lys Thr Leu Gln Glu Asp Asn Cys Ser Asp Cys Phe Thr 65
70 75 80His Gln Tyr Leu Ser Ala Ala Ser
Ser Met Asn Lys Ser Cys Ile Glu 85 90
95Thr Ser Asn Ala Ser Thr Phe Arg Glu Phe Leu Pro Ser Arg
Glu Glu 100 105 110Gly Ser Gln
Asn Asn Arg Val Arg Lys Glu Ser Asn Ser Asp Leu Asn 115
120 125Ala Lys Ser Leu Glu Asn Gly Asn Glu Gln Gly
Pro Gln Thr Tyr Pro 130 135 140Met His
Ile Pro Val Leu Val Pro Leu Gly Ser Ser Ile Thr Ser Ser145
150 155 160Leu Ser His Pro Pro Ser Glu
Pro Asp Ser His Pro His Thr Val Ala 165
170 175Gly Asp Tyr Gln Ser Phe Pro Asn His Ile Met Ser
Thr Leu Leu Gln 180 185 190Thr
Pro Ala Leu Tyr Thr Ala Ala Thr Phe Ala Ser Ser Phe Trp Pro 195
200 205Pro Asp Ser Ser Gly Gly Ser Pro Val
Pro Gly Asn Ser Pro Pro Asn 210 215
220Leu Ala Ala Met Ala Ala Ala Thr Val Ala Ala Ala Ser Ala Trp Trp225
230 235 240Ala Ala Asn Gly
Leu Leu Pro Leu Cys Ala Pro Leu Ser Ser Gly Gly 245
250 255Phe Thr Ser His Pro Pro Ser Thr Phe Gly
Pro Ser Cys Asp Val Glu 260 265
270Tyr Thr Lys Ala Ser Thr Leu Gln His Gly Ser Val Gln Ser Arg Glu
275 280 285Gln Glu His Phe Glu Ala Ser
Lys Ala Arg Ser Ser Leu Asp Ser Glu 290 295
300Asp Val Glu Asn Lys Ser Lys Pro Val Cys His Glu Gln Pro Ser
Ala305 310 315 320Thr Pro
Glu Ser Asp Ala Lys Gly Ser Asp Gly Ala Gly Asp Arg Lys
325 330 335Gln Val Asp Arg Ser Ser Cys
Gly Ser Asn Thr Pro Ser Ser Ser Asp 340 345
350Asp Val Glu Ala Asp Ala Ser Glu Arg Gln Glu Asp Gly Thr
Asn Gly 355 360 365Glu Val Lys Glu
Thr Asn Glu Asp Thr Asn Lys Pro Gln Thr Ser Glu 370
375 380Ser Asn Ala Arg Arg Ser Arg Ile Ser Ser Asn Ile
Thr Asp Pro Trp385 390 395
400Lys Ser Val Ser Asp Glu Gly Arg Ile Ala Phe Gln Ala Leu Phe Ser
405 410 415Arg Glu Val Leu Pro
Gln Ser Phe Thr Tyr Arg Glu Glu His Arg Glu 420
425 430Glu Glu Gln Gln Gln Gln Glu Gln Arg Tyr Pro Met
Ala Leu Asp Leu 435 440 445Asn Phe
Thr Ala Gln Leu Thr Pro Val Asp Asp Gln Glu Glu Lys Arg 450
455 460Asn Thr Gly Phe Leu Gly Ile Gly Leu Asp Ala
Ser Lys Leu Met Ser465 470 475
480Arg Gly Arg Thr Gly Phe Lys Pro Tyr Lys Arg Cys Ser Met Glu Ala
485 490 495Lys Glu Ser Arg
Ile Leu Asn Asn Asn Pro Ile Ile His Val Glu Gln 500
505 510Lys Asp Pro Lys Arg Met Arg Leu Glu Thr Gln
Ala Ser Thr 515 520
5251351938DNAArabidopsis thalianaCDS(1)..(1935) 135atg gat act aat aca
tct gga gaa gaa tta tta gct aag gca aga aag 48Met Asp Thr Asn Thr
Ser Gly Glu Glu Leu Leu Ala Lys Ala Arg Lys 1 5
10 15cca tat aca ata aca aag cag cga gag cga tgg
act gag gat gag cat 96Pro Tyr Thr Ile Thr Lys Gln Arg Glu Arg Trp
Thr Glu Asp Glu His 20 25
30gag agg ttt cta gaa gcc ttg agg ctt tat gga aga gct tgg caa cga
144Glu Arg Phe Leu Glu Ala Leu Arg Leu Tyr Gly Arg Ala Trp Gln Arg
35 40 45att gaa gaa cat att ggg aca
aag act gct gtt cag atc aga agt cat 192Ile Glu Glu His Ile Gly Thr
Lys Thr Ala Val Gln Ile Arg Ser His 50 55
60gca caa aag ttc ttc aca aag ttg gag aaa gag gct gaa gtt aaa ggc
240Ala Gln Lys Phe Phe Thr Lys Leu Glu Lys Glu Ala Glu Val Lys Gly 65
70 75 80atc cct gtt tgc
caa gct ttg gac ata gaa att ccg cct cct cgt cct 288Ile Pro Val Cys
Gln Ala Leu Asp Ile Glu Ile Pro Pro Pro Arg Pro 85
90 95aaa cga aaa ccc aat act cct tat cct cga
aag cct ggg aac aac ggt 336Lys Arg Lys Pro Asn Thr Pro Tyr Pro Arg
Lys Pro Gly Asn Asn Gly 100 105
110aca tct tcc tct caa gta tca tca gca aaa gat gca aaa ctt gtt tca
384Thr Ser Ser Ser Gln Val Ser Ser Ala Lys Asp Ala Lys Leu Val Ser
115 120 125tcg gcc tct tct tca cag ttg
aat cag gcg ttc ttg gat ttg gaa aaa 432Ser Ala Ser Ser Ser Gln Leu
Asn Gln Ala Phe Leu Asp Leu Glu Lys 130 135
140atg ccg ttc tct gag aaa aca tca act gga aaa gaa aat caa gat gag
480Met Pro Phe Ser Glu Lys Thr Ser Thr Gly Lys Glu Asn Gln Asp Glu145
150 155 160aat tgc tcg ggt
gtt tct act gtg aac aag tat ccc tta cca acg aaa 528Asn Cys Ser Gly
Val Ser Thr Val Asn Lys Tyr Pro Leu Pro Thr Lys 165
170 175cag gta agt ggc gac att gaa aca agt aag
acc tca act gtg gac aac 576Gln Val Ser Gly Asp Ile Glu Thr Ser Lys
Thr Ser Thr Val Asp Asn 180 185
190gcg gtt caa gat gtt ccc aag aag aac aaa gac aaa gat ggt aac gat
624Ala Val Gln Asp Val Pro Lys Lys Asn Lys Asp Lys Asp Gly Asn Asp
195 200 205ggt act act gtg cac agc atg
caa aac tac cct tgg cat ttc cac gca 672Gly Thr Thr Val His Ser Met
Gln Asn Tyr Pro Trp His Phe His Ala 210 215
220gat att gtg aac ggg aat ata gca aaa tgc cct caa aat cat ccc tca
720Asp Ile Val Asn Gly Asn Ile Ala Lys Cys Pro Gln Asn His Pro Ser225
230 235 240ggt atg gta tct
caa gac ttc atg ttt cat cct atg aga gaa gaa act 768Gly Met Val Ser
Gln Asp Phe Met Phe His Pro Met Arg Glu Glu Thr 245
250 255cac ggg cac gca aat ctt caa gct aca aca
gca tct gct act act aca 816His Gly His Ala Asn Leu Gln Ala Thr Thr
Ala Ser Ala Thr Thr Thr 260 265
270gct tct cat caa gcg ttt cca gct tgt cat tca cag gat gat tac cgt
864Ala Ser His Gln Ala Phe Pro Ala Cys His Ser Gln Asp Asp Tyr Arg
275 280 285tcg ttt ctc cag ata tca tct
act ttc tcc aat ctt att atg tca act 912Ser Phe Leu Gln Ile Ser Ser
Thr Phe Ser Asn Leu Ile Met Ser Thr 290 295
300ctc cta cag aat cct gca gct cat gct gca gct aca ttc gct gct tcg
960Leu Leu Gln Asn Pro Ala Ala His Ala Ala Ala Thr Phe Ala Ala Ser305
310 315 320gtc tgg cct tat
gcg agt gtc ggg aat tct ggt gat tca tca acc cca 1008Val Trp Pro Tyr
Ala Ser Val Gly Asn Ser Gly Asp Ser Ser Thr Pro 325
330 335atg agc tct tct cct cca agt ata act gcc
att gcc gct gct aca gta 1056Met Ser Ser Ser Pro Pro Ser Ile Thr Ala
Ile Ala Ala Ala Thr Val 340 345
350gct gct gca act gct tgg tgg gct tct cat gga ctt ctt cct gta tgc
1104Ala Ala Ala Thr Ala Trp Trp Ala Ser His Gly Leu Leu Pro Val Cys
355 360 365gct cca gct cca ata aca tgt
gtt cca ttc tca act gtt gca gtt cca 1152Ala Pro Ala Pro Ile Thr Cys
Val Pro Phe Ser Thr Val Ala Val Pro 370 375
380act cca gca atg act gaa atg gat acc gtt gaa aat act caa ccg ttt
1200Thr Pro Ala Met Thr Glu Met Asp Thr Val Glu Asn Thr Gln Pro Phe385
390 395 400gag aaa caa aac
aca gct ctg caa gat caa aac ttg gct tcg aaa tct 1248Glu Lys Gln Asn
Thr Ala Leu Gln Asp Gln Asn Leu Ala Ser Lys Ser 405
410 415cca gct tca tca tct gat gat tca gat gag
act gga gta acc aag cta 1296Pro Ala Ser Ser Ser Asp Asp Ser Asp Glu
Thr Gly Val Thr Lys Leu 420 425
430aat gcc gac tca aaa acc aat gat gat aaa att gag gag gtt gtt gtt
1344Asn Ala Asp Ser Lys Thr Asn Asp Asp Lys Ile Glu Glu Val Val Val
435 440 445act gcc gct gtg cat gac tca
aac act gcc cag aag aaa aat ctt gtg 1392Thr Ala Ala Val His Asp Ser
Asn Thr Ala Gln Lys Lys Asn Leu Val 450 455
460gac cgc tca tcc tgt ggc tca aat aca cct tca ggg agt gac gca gaa
1440Asp Arg Ser Ser Cys Gly Ser Asn Thr Pro Ser Gly Ser Asp Ala Glu465
470 475 480act gat gca tta
gat aaa atg gag aaa gat aaa gag gat gtg aag gag 1488Thr Asp Ala Leu
Asp Lys Met Glu Lys Asp Lys Glu Asp Val Lys Glu 485
490 495aca gat gag aat cag cca gat gtt att gag
tta aat aac cgt aag att 1536Thr Asp Glu Asn Gln Pro Asp Val Ile Glu
Leu Asn Asn Arg Lys Ile 500 505
510aaa atg aga gac aac aac agc aac aac aat gca act act gat tcg tgg
1584Lys Met Arg Asp Asn Asn Ser Asn Asn Asn Ala Thr Thr Asp Ser Trp
515 520 525aag gaa gtc tcc gaa gag ggt
cgt ata gcg ttt cag gct ctc ttt gca 1632Lys Glu Val Ser Glu Glu Gly
Arg Ile Ala Phe Gln Ala Leu Phe Ala 530 535
540aga gaa aga ttg cct caa agc ttt tcg cct cct caa gtg gca gag aat
1680Arg Glu Arg Leu Pro Gln Ser Phe Ser Pro Pro Gln Val Ala Glu Asn545
550 555 560gtg aat aga aaa
caa agt gac acg tca atg cca ttg gct cct aat ttc 1728Val Asn Arg Lys
Gln Ser Asp Thr Ser Met Pro Leu Ala Pro Asn Phe 565
570 575aaa agc cag gat tct tgt gct gca gac caa
gaa gga gta gta atg atc 1776Lys Ser Gln Asp Ser Cys Ala Ala Asp Gln
Glu Gly Val Val Met Ile 580 585
590ggt gtt gga aca tgc aag agt ctt aaa acg aga cag aca gga ttt aag
1824Gly Val Gly Thr Cys Lys Ser Leu Lys Thr Arg Gln Thr Gly Phe Lys
595 600 605cca tac aag aga tgt tca atg
gaa gtg aaa gag agc caa gtt ggg aac 1872Pro Tyr Lys Arg Cys Ser Met
Glu Val Lys Glu Ser Gln Val Gly Asn 610 615
620ata aac aat caa agt gat gaa aaa gtc tgc aaa agg ctt cga ttg gaa
1920Ile Asn Asn Gln Ser Asp Glu Lys Val Cys Lys Arg Leu Arg Leu Glu625
630 635 640gga gaa gct tct
aca tga 1938Gly Glu Ala Ser
Thr 645136645PRTArabidopsis thaliana 136Met Asp Thr Asn
Thr Ser Gly Glu Glu Leu Leu Ala Lys Ala Arg Lys 1 5
10 15Pro Tyr Thr Ile Thr Lys Gln Arg Glu Arg
Trp Thr Glu Asp Glu His 20 25
30Glu Arg Phe Leu Glu Ala Leu Arg Leu Tyr Gly Arg Ala Trp Gln Arg
35 40 45Ile Glu Glu His Ile Gly Thr
Lys Thr Ala Val Gln Ile Arg Ser His 50 55
60Ala Gln Lys Phe Phe Thr Lys Leu Glu Lys Glu Ala Glu Val Lys Gly
65 70 75 80Ile Pro Val
Cys Gln Ala Leu Asp Ile Glu Ile Pro Pro Pro Arg Pro 85
90 95Lys Arg Lys Pro Asn Thr Pro Tyr Pro
Arg Lys Pro Gly Asn Asn Gly 100 105
110Thr Ser Ser Ser Gln Val Ser Ser Ala Lys Asp Ala Lys Leu Val Ser
115 120 125Ser Ala Ser Ser Ser Gln
Leu Asn Gln Ala Phe Leu Asp Leu Glu Lys 130 135
140Met Pro Phe Ser Glu Lys Thr Ser Thr Gly Lys Glu Asn Gln Asp
Glu145 150 155 160Asn Cys
Ser Gly Val Ser Thr Val Asn Lys Tyr Pro Leu Pro Thr Lys
165 170 175Gln Val Ser Gly Asp Ile Glu
Thr Ser Lys Thr Ser Thr Val Asp Asn 180 185
190Ala Val Gln Asp Val Pro Lys Lys Asn Lys Asp Lys Asp Gly
Asn Asp 195 200 205Gly Thr Thr Val
His Ser Met Gln Asn Tyr Pro Trp His Phe His Ala 210
215 220Asp Ile Val Asn Gly Asn Ile Ala Lys Cys Pro Gln
Asn His Pro Ser225 230 235
240Gly Met Val Ser Gln Asp Phe Met Phe His Pro Met Arg Glu Glu Thr
245 250 255His Gly His Ala Asn
Leu Gln Ala Thr Thr Ala Ser Ala Thr Thr Thr 260
265 270Ala Ser His Gln Ala Phe Pro Ala Cys His Ser Gln
Asp Asp Tyr Arg 275 280 285Ser Phe
Leu Gln Ile Ser Ser Thr Phe Ser Asn Leu Ile Met Ser Thr 290
295 300Leu Leu Gln Asn Pro Ala Ala His Ala Ala Ala
Thr Phe Ala Ala Ser305 310 315
320Val Trp Pro Tyr Ala Ser Val Gly Asn Ser Gly Asp Ser Ser Thr Pro
325 330 335Met Ser Ser Ser
Pro Pro Ser Ile Thr Ala Ile Ala Ala Ala Thr Val 340
345 350Ala Ala Ala Thr Ala Trp Trp Ala Ser His Gly
Leu Leu Pro Val Cys 355 360 365Ala
Pro Ala Pro Ile Thr Cys Val Pro Phe Ser Thr Val Ala Val Pro 370
375 380Thr Pro Ala Met Thr Glu Met Asp Thr Val
Glu Asn Thr Gln Pro Phe385 390 395
400Glu Lys Gln Asn Thr Ala Leu Gln Asp Gln Asn Leu Ala Ser Lys
Ser 405 410 415Pro Ala Ser
Ser Ser Asp Asp Ser Asp Glu Thr Gly Val Thr Lys Leu 420
425 430Asn Ala Asp Ser Lys Thr Asn Asp Asp Lys
Ile Glu Glu Val Val Val 435 440
445Thr Ala Ala Val His Asp Ser Asn Thr Ala Gln Lys Lys Asn Leu Val 450
455 460Asp Arg Ser Ser Cys Gly Ser Asn
Thr Pro Ser Gly Ser Asp Ala Glu465 470
475 480Thr Asp Ala Leu Asp Lys Met Glu Lys Asp Lys Glu
Asp Val Lys Glu 485 490
495Thr Asp Glu Asn Gln Pro Asp Val Ile Glu Leu Asn Asn Arg Lys Ile
500 505 510Lys Met Arg Asp Asn Asn
Ser Asn Asn Asn Ala Thr Thr Asp Ser Trp 515 520
525Lys Glu Val Ser Glu Glu Gly Arg Ile Ala Phe Gln Ala Leu
Phe Ala 530 535 540Arg Glu Arg Leu Pro
Gln Ser Phe Ser Pro Pro Gln Val Ala Glu Asn545 550
555 560Val Asn Arg Lys Gln Ser Asp Thr Ser Met
Pro Leu Ala Pro Asn Phe 565 570
575Lys Ser Gln Asp Ser Cys Ala Ala Asp Gln Glu Gly Val Val Met Ile
580 585 590Gly Val Gly Thr Cys
Lys Ser Leu Lys Thr Arg Gln Thr Gly Phe Lys 595
600 605Pro Tyr Lys Arg Cys Ser Met Glu Val Lys Glu Ser
Gln Val Gly Asn 610 615 620Ile Asn Asn
Gln Ser Asp Glu Lys Val Cys Lys Arg Leu Arg Leu Glu625
630 635 640Gly Glu Ala Ser Thr
6451371989DNAArabidopsis thalianaCDS(1)..(1986) 137atg aat atg agt
gat tta ggt tgg gat gat gaa gat aaa tcg gtg gtt 48Met Asn Met Ser
Asp Leu Gly Trp Asp Asp Glu Asp Lys Ser Val Val 1 5
10 15agt gct gtt tta ggg cat tta gct tct gat
ttt ctt cga gca aac tct 96Ser Ala Val Leu Gly His Leu Ala Ser Asp
Phe Leu Arg Ala Asn Ser 20 25
30aat tcg aat cag aat ctc ttt ctt gtt atg gga act gat gat act ctg
144Asn Ser Asn Gln Asn Leu Phe Leu Val Met Gly Thr Asp Asp Thr Leu
35 40 45aat aag aag ctc tct agt ctc
gtt gat tgg cca aac tcg gag aat ttc 192Asn Lys Lys Leu Ser Ser Leu
Val Asp Trp Pro Asn Ser Glu Asn Phe 50 55
60agc tgg aac tac gct att ttc tgg caa caa acc atg tct aga tcc gga
240Ser Trp Asn Tyr Ala Ile Phe Trp Gln Gln Thr Met Ser Arg Ser Gly 65
70 75 80caa caa gtc tta
ggt tgg gga gat ggg tgt tgt cga gag cct aat gag 288Gln Gln Val Leu
Gly Trp Gly Asp Gly Cys Cys Arg Glu Pro Asn Glu 85
90 95gaa gag gaa tca aaa gtt gtt agg tct tat
aat ttt aac aac atg ggg 336Glu Glu Glu Ser Lys Val Val Arg Ser Tyr
Asn Phe Asn Asn Met Gly 100 105
110gca gag gaa gag aca tgg caa gat atg agg aag aga gtg ttg cag aag
384Ala Glu Glu Glu Thr Trp Gln Asp Met Arg Lys Arg Val Leu Gln Lys
115 120 125ctt cat agg ttg ttt ggt gga
tct gat gaa gac aat tat gct ttg agc 432Leu His Arg Leu Phe Gly Gly
Ser Asp Glu Asp Asn Tyr Ala Leu Ser 130 135
140tta gag aaa gtt act gct act gag att ttc ttc tta gct tcc atg tat
480Leu Glu Lys Val Thr Ala Thr Glu Ile Phe Phe Leu Ala Ser Met Tyr145
150 155 160ttc ttc ttc aat
cac ggt gaa ggc ggt cct ggg agg tgt tat tct tca 528Phe Phe Phe Asn
His Gly Glu Gly Gly Pro Gly Arg Cys Tyr Ser Ser 165
170 175ggg aaa cat gtg tgg ctc tct gat gca gtt
aac tct gag tct gac tat 576Gly Lys His Val Trp Leu Ser Asp Ala Val
Asn Ser Glu Ser Asp Tyr 180 185
190tgt ttc agg tct ttt atg gcg aaa tct gcg gga atc aga acg atc gtt
624Cys Phe Arg Ser Phe Met Ala Lys Ser Ala Gly Ile Arg Thr Ile Val
195 200 205atg gtt cct act gat gct ggt
gtt ctt gag ctt ggt tct gtt tgg tct 672Met Val Pro Thr Asp Ala Gly
Val Leu Glu Leu Gly Ser Val Trp Ser 210 215
220ttg cct gaa aac att ggc ttg gtt aag tct gtt caa gct ttg ttc atg
720Leu Pro Glu Asn Ile Gly Leu Val Lys Ser Val Gln Ala Leu Phe Met225
230 235 240agg aga gtt acg
caa cca gta atg gtg act tca aac act aac atg act 768Arg Arg Val Thr
Gln Pro Val Met Val Thr Ser Asn Thr Asn Met Thr 245
250 255gga ggg att cac aag ctt ttc ggg cag gat
ttg agt gga gct cac gcg 816Gly Gly Ile His Lys Leu Phe Gly Gln Asp
Leu Ser Gly Ala His Ala 260 265
270tat cct aag aag ctc gaa gtg aga aga aac ttg gat gag aga ttc act
864Tyr Pro Lys Lys Leu Glu Val Arg Arg Asn Leu Asp Glu Arg Phe Thr
275 280 285cct caa agt tgg gaa ggc tat
aat aac aat aaa ggt cca aca ttt ggt 912Pro Gln Ser Trp Glu Gly Tyr
Asn Asn Asn Lys Gly Pro Thr Phe Gly 290 295
300tac aca cct cag agg gat gat gtg aaa gtg cta gag aat gtg aat atg
960Tyr Thr Pro Gln Arg Asp Asp Val Lys Val Leu Glu Asn Val Asn Met305
310 315 320gtt gta gat aat
aac aat tac aag acg cag att gag ttt gcg gga tca 1008Val Val Asp Asn
Asn Asn Tyr Lys Thr Gln Ile Glu Phe Ala Gly Ser 325
330 335tca gtt gct gct tct tcg aat cca tct aca
aac act cag caa gaa aaa 1056Ser Val Ala Ala Ser Ser Asn Pro Ser Thr
Asn Thr Gln Gln Glu Lys 340 345
350tca gaa tct tgt aca gag aaa aga cca gtg agc ttg tta gca gga gca
1104Ser Glu Ser Cys Thr Glu Lys Arg Pro Val Ser Leu Leu Ala Gly Ala
355 360 365gga ata gtt tct gtt gtt gat
gag aag aga ccg aga aag aga ggg aga 1152Gly Ile Val Ser Val Val Asp
Glu Lys Arg Pro Arg Lys Arg Gly Arg 370 375
380aag cct gca aac gga aga gaa gag cca ttg aac cat gtg gaa gct gag
1200Lys Pro Ala Asn Gly Arg Glu Glu Pro Leu Asn His Val Glu Ala Glu385
390 395 400agg cag aga cgc
gag aag ctt aac caa aga ttc tac gct tta cga tca 1248Arg Gln Arg Arg
Glu Lys Leu Asn Gln Arg Phe Tyr Ala Leu Arg Ser 405
410 415gtt gtt cca aac att tct aaa atg gac aag
gct tct cta ctt gga gac 1296Val Val Pro Asn Ile Ser Lys Met Asp Lys
Ala Ser Leu Leu Gly Asp 420 425
430gca att tct tac atc aaa gag ctt caa gag aaa gtc aag ata atg gaa
1344Ala Ile Ser Tyr Ile Lys Glu Leu Gln Glu Lys Val Lys Ile Met Glu
435 440 445gat gaa aga gta gga aca gat
aag agc tta tca gaa tca aac aca ata 1392Asp Glu Arg Val Gly Thr Asp
Lys Ser Leu Ser Glu Ser Asn Thr Ile 450 455
460aca gta gaa gaa agt cca gaa gtt gac att caa gct atg aat gaa gag
1440Thr Val Glu Glu Ser Pro Glu Val Asp Ile Gln Ala Met Asn Glu Glu465
470 475 480gtt gtt gta aga
gta atc tcg cct ttg gat tca cat cca gct tca aga 1488Val Val Val Arg
Val Ile Ser Pro Leu Asp Ser His Pro Ala Ser Arg 485
490 495atc ata caa gca atg aga aac tca aat gtt
agt cta atg gag gct aag 1536Ile Ile Gln Ala Met Arg Asn Ser Asn Val
Ser Leu Met Glu Ala Lys 500 505
510tta tca tta gct gaa gac aca atg ttt cac act ttt gtg ata aag tct
1584Leu Ser Leu Ala Glu Asp Thr Met Phe His Thr Phe Val Ile Lys Ser
515 520 525aac aac ggg tcg gat cca ttg
acg aaa gag aag ctt ata gca gcg ttt 1632Asn Asn Gly Ser Asp Pro Leu
Thr Lys Glu Lys Leu Ile Ala Ala Phe 530 535
540tac ccc gag acc agc tcg acg caa ccg cca ttg cct tct tct agt tca
1680Tyr Pro Glu Thr Ser Ser Thr Gln Pro Pro Leu Pro Ser Ser Ser Ser545
550 555 560cag gtc tct ggg
ttt gtt tct ccg atc aga tcc aat ttc aat cct tct 1728Gln Val Ser Gly
Phe Val Ser Pro Ile Arg Ser Asn Phe Asn Pro Ser 565
570 575ccg tca gat tat ctc ctc ggc cac gat gtc
tct ccg ccg cac tat cct 1776Pro Ser Asp Tyr Leu Leu Gly His Asp Val
Ser Pro Pro His Tyr Pro 580 585
590cga tct cca ccg tca aac cca gag agc cac tct ctc aaa gtc tct acc
1824Arg Ser Pro Pro Ser Asn Pro Glu Ser His Ser Leu Lys Val Ser Thr
595 600 605ttt ctc gat gag tca cat ctc
ctc cgc atc cgc cac cgc cgc cgt gag 1872Phe Leu Asp Glu Ser His Leu
Leu Arg Ile Arg His Arg Arg Arg Glu 610 615
620aaa ccc act ggg aag aga cct ctc ttc aat ccc ctt cgc aca agc cca
1920Lys Pro Thr Gly Lys Arg Pro Leu Phe Asn Pro Leu Arg Thr Ser Pro625
630 635 640gaa act gaa acc
aga ttc aac caa tct cgt cac cga tcg ctc gat ttc 1968Glu Thr Glu Thr
Arg Phe Asn Gln Ser Arg His Arg Ser Leu Asp Phe 645
650 655gtc ttc aat tgg gca atc tga
1989Val Phe Asn Trp Ala Ile
660138662PRTArabidopsis thaliana 138Met Asn Met Ser Asp Leu Gly Trp Asp
Asp Glu Asp Lys Ser Val Val 1 5 10
15Ser Ala Val Leu Gly His Leu Ala Ser Asp Phe Leu Arg Ala Asn
Ser 20 25 30Asn Ser Asn Gln
Asn Leu Phe Leu Val Met Gly Thr Asp Asp Thr Leu 35
40 45Asn Lys Lys Leu Ser Ser Leu Val Asp Trp Pro Asn
Ser Glu Asn Phe 50 55 60Ser Trp Asn
Tyr Ala Ile Phe Trp Gln Gln Thr Met Ser Arg Ser Gly 65
70 75 80Gln Gln Val Leu Gly Trp Gly Asp
Gly Cys Cys Arg Glu Pro Asn Glu 85 90
95Glu Glu Glu Ser Lys Val Val Arg Ser Tyr Asn Phe Asn Asn
Met Gly 100 105 110Ala Glu Glu
Glu Thr Trp Gln Asp Met Arg Lys Arg Val Leu Gln Lys 115
120 125Leu His Arg Leu Phe Gly Gly Ser Asp Glu Asp
Asn Tyr Ala Leu Ser 130 135 140Leu Glu
Lys Val Thr Ala Thr Glu Ile Phe Phe Leu Ala Ser Met Tyr145
150 155 160Phe Phe Phe Asn His Gly Glu
Gly Gly Pro Gly Arg Cys Tyr Ser Ser 165
170 175Gly Lys His Val Trp Leu Ser Asp Ala Val Asn Ser
Glu Ser Asp Tyr 180 185 190Cys
Phe Arg Ser Phe Met Ala Lys Ser Ala Gly Ile Arg Thr Ile Val 195
200 205Met Val Pro Thr Asp Ala Gly Val Leu
Glu Leu Gly Ser Val Trp Ser 210 215
220Leu Pro Glu Asn Ile Gly Leu Val Lys Ser Val Gln Ala Leu Phe Met225
230 235 240Arg Arg Val Thr
Gln Pro Val Met Val Thr Ser Asn Thr Asn Met Thr 245
250 255Gly Gly Ile His Lys Leu Phe Gly Gln Asp
Leu Ser Gly Ala His Ala 260 265
270Tyr Pro Lys Lys Leu Glu Val Arg Arg Asn Leu Asp Glu Arg Phe Thr
275 280 285Pro Gln Ser Trp Glu Gly Tyr
Asn Asn Asn Lys Gly Pro Thr Phe Gly 290 295
300Tyr Thr Pro Gln Arg Asp Asp Val Lys Val Leu Glu Asn Val Asn
Met305 310 315 320Val Val
Asp Asn Asn Asn Tyr Lys Thr Gln Ile Glu Phe Ala Gly Ser
325 330 335Ser Val Ala Ala Ser Ser Asn
Pro Ser Thr Asn Thr Gln Gln Glu Lys 340 345
350Ser Glu Ser Cys Thr Glu Lys Arg Pro Val Ser Leu Leu Ala
Gly Ala 355 360 365Gly Ile Val Ser
Val Val Asp Glu Lys Arg Pro Arg Lys Arg Gly Arg 370
375 380Lys Pro Ala Asn Gly Arg Glu Glu Pro Leu Asn His
Val Glu Ala Glu385 390 395
400Arg Gln Arg Arg Glu Lys Leu Asn Gln Arg Phe Tyr Ala Leu Arg Ser
405 410 415Val Val Pro Asn Ile
Ser Lys Met Asp Lys Ala Ser Leu Leu Gly Asp 420
425 430Ala Ile Ser Tyr Ile Lys Glu Leu Gln Glu Lys Val
Lys Ile Met Glu 435 440 445Asp Glu
Arg Val Gly Thr Asp Lys Ser Leu Ser Glu Ser Asn Thr Ile 450
455 460Thr Val Glu Glu Ser Pro Glu Val Asp Ile Gln
Ala Met Asn Glu Glu465 470 475
480Val Val Val Arg Val Ile Ser Pro Leu Asp Ser His Pro Ala Ser Arg
485 490 495Ile Ile Gln Ala
Met Arg Asn Ser Asn Val Ser Leu Met Glu Ala Lys 500
505 510Leu Ser Leu Ala Glu Asp Thr Met Phe His Thr
Phe Val Ile Lys Ser 515 520 525Asn
Asn Gly Ser Asp Pro Leu Thr Lys Glu Lys Leu Ile Ala Ala Phe 530
535 540Tyr Pro Glu Thr Ser Ser Thr Gln Pro Pro
Leu Pro Ser Ser Ser Ser545 550 555
560Gln Val Ser Gly Phe Val Ser Pro Ile Arg Ser Asn Phe Asn Pro
Ser 565 570 575Pro Ser Asp
Tyr Leu Leu Gly His Asp Val Ser Pro Pro His Tyr Pro 580
585 590Arg Ser Pro Pro Ser Asn Pro Glu Ser His
Ser Leu Lys Val Ser Thr 595 600
605Phe Leu Asp Glu Ser His Leu Leu Arg Ile Arg His Arg Arg Arg Glu 610
615 620Lys Pro Thr Gly Lys Arg Pro Leu
Phe Asn Pro Leu Arg Thr Ser Pro625 630
635 640Glu Thr Glu Thr Arg Phe Asn Gln Ser Arg His Arg
Ser Leu Asp Phe 645 650
655Val Phe Asn Trp Ala Ile 660139870DNAArabidopsis
thalianaCDS(1)..(867) 139atg tca gaa tta tta cag ttg cct cca ggt ttc cga
ttt cac cct acc 48Met Ser Glu Leu Leu Gln Leu Pro Pro Gly Phe Arg
Phe His Pro Thr 1 5 10
15gat gaa gag ctt gtc atg cac tat ctc tgc cgc aaa tgt gcc tct cag
96Asp Glu Glu Leu Val Met His Tyr Leu Cys Arg Lys Cys Ala Ser Gln
20 25 30tcc atc gcc gtt ccg atc atc
gct gag atc gat ctc tac aaa tac gat 144Ser Ile Ala Val Pro Ile Ile
Ala Glu Ile Asp Leu Tyr Lys Tyr Asp 35 40
45cca tgg gag ctt cct ggt tta gcc ttg tat ggt gag aag gaa tgg
tac 192Pro Trp Glu Leu Pro Gly Leu Ala Leu Tyr Gly Glu Lys Glu Trp
Tyr 50 55 60ttc ttc tct ccc agg gac
aga aaa tat ccc aac ggt tcg cgt cct aac 240Phe Phe Ser Pro Arg Asp
Arg Lys Tyr Pro Asn Gly Ser Arg Pro Asn 65 70
75 80cgg tcc gct ggt tct ggt tac tgg aaa gct acc
gga gct gat aaa ccg 288Arg Ser Ala Gly Ser Gly Tyr Trp Lys Ala Thr
Gly Ala Asp Lys Pro 85 90
95atc gga cta cct aaa ccg gtc gga att aag aaa gct ctt gtt ttc tac
336Ile Gly Leu Pro Lys Pro Val Gly Ile Lys Lys Ala Leu Val Phe Tyr
100 105 110gcc ggc aaa gct cca aag
gga gag aaa acc aat tgg atc atg cac gag 384Ala Gly Lys Ala Pro Lys
Gly Glu Lys Thr Asn Trp Ile Met His Glu 115 120
125tac cgt ctc gcc gac gtt gac cgg tcc gtt cgc aag aag aag
aat agt 432Tyr Arg Leu Ala Asp Val Asp Arg Ser Val Arg Lys Lys Lys
Asn Ser 130 135 140ctc agg ctg gat gat
tgg gtt ctc tgc cgg att tac aac aaa aaa gga 480Leu Arg Leu Asp Asp
Trp Val Leu Cys Arg Ile Tyr Asn Lys Lys Gly145 150
155 160gct acc gag agg cgg gga cca ccg cct ccg
gtt gtt tac ggc gac gaa 528Ala Thr Glu Arg Arg Gly Pro Pro Pro Pro
Val Val Tyr Gly Asp Glu 165 170
175atc atg gag gag aag ccg aag gtg acg gag atg gtt atg cct ccg ccg
576Ile Met Glu Glu Lys Pro Lys Val Thr Glu Met Val Met Pro Pro Pro
180 185 190ccg caa cag aca agt gag
ttc gcg tat ttc gac acg tcg gat tcg gtg 624Pro Gln Gln Thr Ser Glu
Phe Ala Tyr Phe Asp Thr Ser Asp Ser Val 195 200
205ccg aag ctg cat act acg gat tcg agt tgc tcg gag cag gtg
gtg tcg 672Pro Lys Leu His Thr Thr Asp Ser Ser Cys Ser Glu Gln Val
Val Ser 210 215 220ccg gag ttc acg agc
gag gtt cag agc gag ccc aag tgg aaa gat tgg 720Pro Glu Phe Thr Ser
Glu Val Gln Ser Glu Pro Lys Trp Lys Asp Trp225 230
235 240tcg gcc gta agt aat gac aat aac aat acc
ctt gat ttt ggg ttt aat 768Ser Ala Val Ser Asn Asp Asn Asn Asn Thr
Leu Asp Phe Gly Phe Asn 245 250
255tac att gat gcc acc gtg gat aac gcg ttt gga gga gga ggg agt agt
816Tyr Ile Asp Ala Thr Val Asp Asn Ala Phe Gly Gly Gly Gly Ser Ser
260 265 270aat cag atg ttt ccg cta
cag gat atg ttc atg tac atg cag aag cct 864Asn Gln Met Phe Pro Leu
Gln Asp Met Phe Met Tyr Met Gln Lys Pro 275 280
285tac tag
870Tyr140289PRTArabidopsis thaliana 140Met Ser Glu Leu Leu Gln
Leu Pro Pro Gly Phe Arg Phe His Pro Thr 1 5
10 15Asp Glu Glu Leu Val Met His Tyr Leu Cys Arg Lys
Cys Ala Ser Gln 20 25 30Ser
Ile Ala Val Pro Ile Ile Ala Glu Ile Asp Leu Tyr Lys Tyr Asp 35
40 45Pro Trp Glu Leu Pro Gly Leu Ala Leu
Tyr Gly Glu Lys Glu Trp Tyr 50 55
60Phe Phe Ser Pro Arg Asp Arg Lys Tyr Pro Asn Gly Ser Arg Pro Asn 65
70 75 80Arg Ser Ala Gly Ser
Gly Tyr Trp Lys Ala Thr Gly Ala Asp Lys Pro 85
90 95Ile Gly Leu Pro Lys Pro Val Gly Ile Lys Lys
Ala Leu Val Phe Tyr 100 105
110Ala Gly Lys Ala Pro Lys Gly Glu Lys Thr Asn Trp Ile Met His Glu
115 120 125Tyr Arg Leu Ala Asp Val Asp
Arg Ser Val Arg Lys Lys Lys Asn Ser 130 135
140Leu Arg Leu Asp Asp Trp Val Leu Cys Arg Ile Tyr Asn Lys Lys
Gly145 150 155 160Ala Thr
Glu Arg Arg Gly Pro Pro Pro Pro Val Val Tyr Gly Asp Glu
165 170 175Ile Met Glu Glu Lys Pro Lys
Val Thr Glu Met Val Met Pro Pro Pro 180 185
190Pro Gln Gln Thr Ser Glu Phe Ala Tyr Phe Asp Thr Ser Asp
Ser Val 195 200 205Pro Lys Leu His
Thr Thr Asp Ser Ser Cys Ser Glu Gln Val Val Ser 210
215 220Pro Glu Phe Thr Ser Glu Val Gln Ser Glu Pro Lys
Trp Lys Asp Trp225 230 235
240Ser Ala Val Ser Asn Asp Asn Asn Asn Thr Leu Asp Phe Gly Phe Asn
245 250 255Tyr Ile Asp Ala Thr
Val Asp Asn Ala Phe Gly Gly Gly Gly Ser Ser 260
265 270Asn Gln Met Phe Pro Leu Gln Asp Met Phe Met Tyr
Met Gln Lys Pro 275 280
285Tyr1411202DNAArabidopsis thalianaCDS(112)..(918) 141tcctcgattt
caatctttga gataaaccac aaagatcctc cgattcgaag gtttataaaa 60actcaaaatc
gaatcttatc cacaagaaaa caacaaggta cttttccaaa a atg aag 117
Met Lys
1gcg gag ttg aat ttg ccg gcg gga ttc
cga ttt cat ccg acg gac gaa 165Ala Glu Leu Asn Leu Pro Ala Gly Phe
Arg Phe His Pro Thr Asp Glu 5 10
15gag ctt gtc aag ttc tat ctt tgc cgg aga tgt gcg tca gaa ccg att
213Glu Leu Val Lys Phe Tyr Leu Cys Arg Arg Cys Ala Ser Glu Pro Ile 20
25 30aac gtt ccg gtt atc gca gag att
gac ttg tac aaa ttc aat cca agg 261Asn Val Pro Val Ile Ala Glu Ile
Asp Leu Tyr Lys Phe Asn Pro Arg 35 40
45 50gag ctt cca gaa atg gcg ttg tac ggt gag aaa gaa tgg
tac ttc ttc 309Glu Leu Pro Glu Met Ala Leu Tyr Gly Glu Lys Glu Trp
Tyr Phe Phe 55 60 65tcg
cat aga gac cgg aaa tac cca aac ggg tcg aga cca aac cgg gca 357Ser
His Arg Asp Arg Lys Tyr Pro Asn Gly Ser Arg Pro Asn Arg Ala
70 75 80gct gga acc ggt tat tgg aaa gcg
act gga gct gat aaa ccg atc gga 405Ala Gly Thr Gly Tyr Trp Lys Ala
Thr Gly Ala Asp Lys Pro Ile Gly 85 90
95aaa ccg aag acg tta ggg att aag aaa gca ctc gtc ttc tac gca gga
453Lys Pro Lys Thr Leu Gly Ile Lys Lys Ala Leu Val Phe Tyr Ala Gly
100 105 110aaa gct ccg aaa ggg att aaa
acg aat tgg att atg cac gag tat cgt 501Lys Ala Pro Lys Gly Ile Lys
Thr Asn Trp Ile Met His Glu Tyr Arg115 120
125 130ctc gct aat gtc gat cga tct gct tct acc aac aag
aag aac aac tta 549Leu Ala Asn Val Asp Arg Ser Ala Ser Thr Asn Lys
Lys Asn Asn Leu 135 140
145aga ctt gat gat tgg gtt ttg tgt cgg ata tac aat aag aaa gga aca
597Arg Leu Asp Asp Trp Val Leu Cys Arg Ile Tyr Asn Lys Lys Gly Thr
150 155 160atg gag aag tat tta ccg
gcg gcg gct gag aaa ccg acg gaa aag atg 645Met Glu Lys Tyr Leu Pro
Ala Ala Ala Glu Lys Pro Thr Glu Lys Met 165 170
175agt acg tcg gac tca aga tgc tca agt cac gtg att tca ccg
gac gtc 693Ser Thr Ser Asp Ser Arg Cys Ser Ser His Val Ile Ser Pro
Asp Val 180 185 190acg tgt tct gat aac
tgg gag gtt gag agt gag ccc aaa tgg att aat 741Thr Cys Ser Asp Asn
Trp Glu Val Glu Ser Glu Pro Lys Trp Ile Asn195 200
205 210ctg gaa gac gcg tta gag gca ttt aat gat
gac acg tcc atg ttt agt 789Leu Glu Asp Ala Leu Glu Ala Phe Asn Asp
Asp Thr Ser Met Phe Ser 215 220
225tcc att ggt ttg ttg caa aat gac gcc ttt gtt cct cag ttt cag tac
837Ser Ile Gly Leu Leu Gln Asn Asp Ala Phe Val Pro Gln Phe Gln Tyr
230 235 240cag tcc tcc gat ttc gtc
gat tcg ttt cag gac ccg ttc gag cag aaa 885Gln Ser Ser Asp Phe Val
Asp Ser Phe Gln Asp Pro Phe Glu Gln Lys 245 250
255ccg ttc ttg aat tgg aat ttt gct cct caa ggg taaaaataat
cggcaaaaag 938Pro Phe Leu Asn Trp Asn Phe Ala Pro Gln Gly 260
265ttgaagcttt tcagagtctt cgatcaccgg cattgtgtcg gatcctgacc
cggagaccaa 998gtcgggtcat acgattacat aatcgggtta ttgagatttc cacatttgga
tttccgagac 1058taaccaactt aacggattct ggggtaattg gggggttttg cacaggtgaa
tcacactgag 1118tcagcaagtt tcgatttttt ggttttgttt tgtaatgatt gattaaatgt
ctaaagatat 1178cacgaagtta aaaaaaaaaa aaaa
1202142269PRTArabidopsis thaliana 142Met Lys Ala Glu Leu Asn
Leu Pro Ala Gly Phe Arg Phe His Pro Thr 1 5
10 15Asp Glu Glu Leu Val Lys Phe Tyr Leu Cys Arg Arg
Cys Ala Ser Glu 20 25 30Pro
Ile Asn Val Pro Val Ile Ala Glu Ile Asp Leu Tyr Lys Phe Asn 35
40 45Pro Arg Glu Leu Pro Glu Met Ala Leu
Tyr Gly Glu Lys Glu Trp Tyr 50 55
60Phe Phe Ser His Arg Asp Arg Lys Tyr Pro Asn Gly Ser Arg Pro Asn 65
70 75 80Arg Ala Ala Gly Thr
Gly Tyr Trp Lys Ala Thr Gly Ala Asp Lys Pro 85
90 95Ile Gly Lys Pro Lys Thr Leu Gly Ile Lys Lys
Ala Leu Val Phe Tyr 100 105
110Ala Gly Lys Ala Pro Lys Gly Ile Lys Thr Asn Trp Ile Met His Glu
115 120 125Tyr Arg Leu Ala Asn Val Asp
Arg Ser Ala Ser Thr Asn Lys Lys Asn 130 135
140Asn Leu Arg Leu Asp Asp Trp Val Leu Cys Arg Ile Tyr Asn Lys
Lys145 150 155 160Gly Thr
Met Glu Lys Tyr Leu Pro Ala Ala Ala Glu Lys Pro Thr Glu
165 170 175Lys Met Ser Thr Ser Asp Ser
Arg Cys Ser Ser His Val Ile Ser Pro 180 185
190Asp Val Thr Cys Ser Asp Asn Trp Glu Val Glu Ser Glu Pro
Lys Trp 195 200 205Ile Asn Leu Glu
Asp Ala Leu Glu Ala Phe Asn Asp Asp Thr Ser Met 210
215 220Phe Ser Ser Ile Gly Leu Leu Gln Asn Asp Ala Phe
Val Pro Gln Phe225 230 235
240Gln Tyr Gln Ser Ser Asp Phe Val Asp Ser Phe Gln Asp Pro Phe Glu
245 250 255Gln Lys Pro Phe Leu
Asn Trp Asn Phe Ala Pro Gln Gly 260
2651431188DNAArabidopsis thalianaCDS(124)..(978) 143aaaaacctca actttcttct
ctcttctcaa aaacccttcc ctcttcgtct ccaaacaaca 60acaaacacaa caacaacaaa
aatcttacaa gaagatcatt tttagaaacc ctattaggat 120aaa atg gat tac gag gca
tca aga atc gtc gaa atg gta gaa gat gaa 168 Met Asp Tyr Glu
Ala Ser Arg Ile Val Glu Met Val Glu Asp Glu 1 5
10 15gaa cat ata gat cta cca cca gga ttc aga
ttt cac cct act gat gaa 216Glu His Ile Asp Leu Pro Pro Gly Phe Arg
Phe His Pro Thr Asp Glu 20 25
30gaa ctc ata act cac tac ctc aaa cca aag gtt ttc aac act ttc ttc
264Glu Leu Ile Thr His Tyr Leu Lys Pro Lys Val Phe Asn Thr Phe Phe
35 40 45tct gct act gcc att ggt
gaa gtt gat ctc aac aag att gag cct tgg 312Ser Ala Thr Ala Ile Gly
Glu Val Asp Leu Asn Lys Ile Glu Pro Trp 50 55
60gac tta cca tgg aag gct aag atg gga gaa aaa gaa tgg tat
ttc ttc 360Asp Leu Pro Trp Lys Ala Lys Met Gly Glu Lys Glu Trp Tyr
Phe Phe 65 70 75tgt gtg aga gac cgg
aaa tac ccg acc ggt tta agg aca aac cgg gcg 408Cys Val Arg Asp Arg
Lys Tyr Pro Thr Gly Leu Arg Thr Asn Arg Ala 80 85
90 95aca gaa gcc ggt tat tgg aaa gcc aca gga
aaa gac aaa gag ata ttc 456Thr Glu Ala Gly Tyr Trp Lys Ala Thr Gly
Lys Asp Lys Glu Ile Phe 100 105
110aag gga aaa tca ctt gtg ggt atg aag aaa act ttg gtt ttc tat aaa
504Lys Gly Lys Ser Leu Val Gly Met Lys Lys Thr Leu Val Phe Tyr Lys
115 120 125gga aga gct cct aaa gga
gtt aaa acc aat tgg gtt atg cat gaa tat 552Gly Arg Ala Pro Lys Gly
Val Lys Thr Asn Trp Val Met His Glu Tyr 130 135
140cgt tta gaa ggc aaa tat tgt att gaa aat ctt ccc caa aca
gct aag 600Arg Leu Glu Gly Lys Tyr Cys Ile Glu Asn Leu Pro Gln Thr
Ala Lys 145 150 155aac gaa tgg gtt ata
tgt cgt gtt ttc caa aaa cgt gcc gat ggt aca 648Asn Glu Trp Val Ile
Cys Arg Val Phe Gln Lys Arg Ala Asp Gly Thr160 165
170 175aag gtt cca atg tca atg ctt gat cca cac
att aac cga atg gaa cca 696Lys Val Pro Met Ser Met Leu Asp Pro His
Ile Asn Arg Met Glu Pro 180 185
190gcc ggt tta cct tcg tta atg gat tgt tct caa cga gac tcc ttc acc
744Ala Gly Leu Pro Ser Leu Met Asp Cys Ser Gln Arg Asp Ser Phe Thr
195 200 205ggt tcg tcg tct cac gtg
acc tgc ttc tcc gac caa gaa acc gaa gac 792Gly Ser Ser Ser His Val
Thr Cys Phe Ser Asp Gln Glu Thr Glu Asp 210 215
220aaa aga ctt gtc cac gag tcc aaa gac ggt ttt ggt tct ctg
ttt tac 840Lys Arg Leu Val His Glu Ser Lys Asp Gly Phe Gly Ser Leu
Phe Tyr 225 230 235tcg gat cct ctg ttt
tta caa gac aat tat tcg cta atg aag ctg ttg 888Ser Asp Pro Leu Phe
Leu Gln Asp Asn Tyr Ser Leu Met Lys Leu Leu240 245
250 255ctt gac ggt caa gaa act caa ttc tcc ggc
aaa cct ttc gac ggt cgt 936Leu Asp Gly Gln Glu Thr Gln Phe Ser Gly
Lys Pro Phe Asp Gly Arg 260 265
270gat tcg tcc ggt aca gaa gaa ttg gat tgc gtt tgg aat ttc
978Asp Ser Ser Gly Thr Glu Glu Leu Asp Cys Val Trp Asn Phe
275 280 285tgagttgtat aagttatgtt
gtagacttgt agtagtcatg tgttcgtgtg tgtgaatgaa 1038tattcttgtt acattttttt
gtaaaaaagg agaaaaaaat atgctagaaa gtcaattgct 1098tttgttatgt agcattagtg
ttttttatgt actcaataga cttcctaatt aaataaaaat 1158cttaatttat ttgcaaaaaa
aaaaaaaaaa 1188144285PRTArabidopsis
thaliana 144Met Asp Tyr Glu Ala Ser Arg Ile Val Glu Met Val Glu Asp Glu
Glu 1 5 10 15His Ile Asp
Leu Pro Pro Gly Phe Arg Phe His Pro Thr Asp Glu Glu 20
25 30Leu Ile Thr His Tyr Leu Lys Pro Lys Val
Phe Asn Thr Phe Phe Ser 35 40
45Ala Thr Ala Ile Gly Glu Val Asp Leu Asn Lys Ile Glu Pro Trp Asp 50
55 60Leu Pro Trp Lys Ala Lys Met Gly Glu
Lys Glu Trp Tyr Phe Phe Cys 65 70 75
80Val Arg Asp Arg Lys Tyr Pro Thr Gly Leu Arg Thr Asn Arg
Ala Thr 85 90 95Glu Ala
Gly Tyr Trp Lys Ala Thr Gly Lys Asp Lys Glu Ile Phe Lys 100
105 110Gly Lys Ser Leu Val Gly Met Lys Lys
Thr Leu Val Phe Tyr Lys Gly 115 120
125Arg Ala Pro Lys Gly Val Lys Thr Asn Trp Val Met His Glu Tyr Arg
130 135 140Leu Glu Gly Lys Tyr Cys Ile
Glu Asn Leu Pro Gln Thr Ala Lys Asn145 150
155 160Glu Trp Val Ile Cys Arg Val Phe Gln Lys Arg Ala
Asp Gly Thr Lys 165 170
175Val Pro Met Ser Met Leu Asp Pro His Ile Asn Arg Met Glu Pro Ala
180 185 190Gly Leu Pro Ser Leu Met
Asp Cys Ser Gln Arg Asp Ser Phe Thr Gly 195 200
205Ser Ser Ser His Val Thr Cys Phe Ser Asp Gln Glu Thr Glu
Asp Lys 210 215 220Arg Leu Val His Glu
Ser Lys Asp Gly Phe Gly Ser Leu Phe Tyr Ser225 230
235 240Asp Pro Leu Phe Leu Gln Asp Asn Tyr Ser
Leu Met Lys Leu Leu Leu 245 250
255Asp Gly Gln Glu Thr Gln Phe Ser Gly Lys Pro Phe Asp Gly Arg Asp
260 265 270Ser Ser Gly Thr Glu
Glu Leu Asp Cys Val Trp Asn Phe 275 280
285145894DNAArabidopsis thalianaCDS(1)..(891) 145atg ggt gtt aga gag
aaa gat ccg tta gcc cag ttg agt ttg cca cca 48Met Gly Val Arg Glu
Lys Asp Pro Leu Ala Gln Leu Ser Leu Pro Pro 1 5
10 15ggt ttt aga ttt tat ccg aca gat gaa gag ctt
ctt gtt cag tat cta 96Gly Phe Arg Phe Tyr Pro Thr Asp Glu Glu Leu
Leu Val Gln Tyr Leu 20 25
30tgt cgg aaa gtt gca ggc tat cat ttc tct ctc cag gtc atc gga gac
144Cys Arg Lys Val Ala Gly Tyr His Phe Ser Leu Gln Val Ile Gly Asp
35 40 45atc gat ctc tac aag ttc gat
cct tgg gat ttg cca agt aag gct ttg 192Ile Asp Leu Tyr Lys Phe Asp
Pro Trp Asp Leu Pro Ser Lys Ala Leu 50 55
60ttt gga gag aag gaa tgg tat ttc ttt agc cca aga gat cgg aaa tat
240Phe Gly Glu Lys Glu Trp Tyr Phe Phe Ser Pro Arg Asp Arg Lys Tyr 65
70 75 80ccg aac ggg tca
aga ccc aat aga gta gcc ggg tcg ggt tat tgg aaa 288Pro Asn Gly Ser
Arg Pro Asn Arg Val Ala Gly Ser Gly Tyr Trp Lys 85
90 95gca acg ggt act gac aaa att atc acg gcg
gat ggt cgt cgt gtc ggg 336Ala Thr Gly Thr Asp Lys Ile Ile Thr Ala
Asp Gly Arg Arg Val Gly 100 105
110att aaa aaa gct ctg gtc ttt tac gcc gga aaa gct ccc aaa ggc act
384Ile Lys Lys Ala Leu Val Phe Tyr Ala Gly Lys Ala Pro Lys Gly Thr
115 120 125aaa acc aac tgg att atg cac
gag tat cgc tta ata gaa cat tct cgt 432Lys Thr Asn Trp Ile Met His
Glu Tyr Arg Leu Ile Glu His Ser Arg 130 135
140agc cat gga agc tcc aag ttg gat gat tgg gtg ttg tgt cga att tac
480Ser His Gly Ser Ser Lys Leu Asp Asp Trp Val Leu Cys Arg Ile Tyr145
150 155 160aag aaa aca tct
gga tct cag aga caa gct gtt act cct gtt caa gct 528Lys Lys Thr Ser
Gly Ser Gln Arg Gln Ala Val Thr Pro Val Gln Ala 165
170 175tgt cgt gaa gag cat agc acg aat ggg tcg
tca tcg tct tct tca tca 576Cys Arg Glu Glu His Ser Thr Asn Gly Ser
Ser Ser Ser Ser Ser Ser 180 185
190cag ctt gac gac gtt ctt gat tcg ttc ccg gag ata aaa gac cag tct
624Gln Leu Asp Asp Val Leu Asp Ser Phe Pro Glu Ile Lys Asp Gln Ser
195 200 205ttt aat ctt cct cgg atg aat
tcg ctc agg acg att ctt aac ggg aac 672Phe Asn Leu Pro Arg Met Asn
Ser Leu Arg Thr Ile Leu Asn Gly Asn 210 215
220ttt gat tgg gct agc ttg gca ggt ctt aat cca att cca gag cta gct
720Phe Asp Trp Ala Ser Leu Ala Gly Leu Asn Pro Ile Pro Glu Leu Ala225
230 235 240ccg acc aat gga
tta ccg agt tac ggt ggt tac gat gcg ttt cga gcg 768Pro Thr Asn Gly
Leu Pro Ser Tyr Gly Gly Tyr Asp Ala Phe Arg Ala 245
250 255gcg gaa ggt gag gcg gag agt ggg cat gtg
aat cgg cag cag aac tcg 816Ala Glu Gly Glu Ala Glu Ser Gly His Val
Asn Arg Gln Gln Asn Ser 260 265
270agc ggg ttg act cag agt ttc ggg tac agc tcg agt ggg ttt ggt gtt
864Ser Gly Leu Thr Gln Ser Phe Gly Tyr Ser Ser Ser Gly Phe Gly Val
275 280 285tcg ggt caa aca ttc gag ttt
agg caa tga 894Ser Gly Gln Thr Phe Glu Phe
Arg Gln 290 295146297PRTArabidopsis thaliana 146Met
Gly Val Arg Glu Lys Asp Pro Leu Ala Gln Leu Ser Leu Pro Pro 1
5 10 15Gly Phe Arg Phe Tyr Pro Thr
Asp Glu Glu Leu Leu Val Gln Tyr Leu 20 25
30Cys Arg Lys Val Ala Gly Tyr His Phe Ser Leu Gln Val Ile
Gly Asp 35 40 45Ile Asp Leu Tyr
Lys Phe Asp Pro Trp Asp Leu Pro Ser Lys Ala Leu 50
55 60Phe Gly Glu Lys Glu Trp Tyr Phe Phe Ser Pro Arg Asp
Arg Lys Tyr 65 70 75
80Pro Asn Gly Ser Arg Pro Asn Arg Val Ala Gly Ser Gly Tyr Trp Lys
85 90 95Ala Thr Gly Thr Asp Lys
Ile Ile Thr Ala Asp Gly Arg Arg Val Gly 100
105 110Ile Lys Lys Ala Leu Val Phe Tyr Ala Gly Lys Ala
Pro Lys Gly Thr 115 120 125Lys Thr
Asn Trp Ile Met His Glu Tyr Arg Leu Ile Glu His Ser Arg 130
135 140Ser His Gly Ser Ser Lys Leu Asp Asp Trp Val
Leu Cys Arg Ile Tyr145 150 155
160Lys Lys Thr Ser Gly Ser Gln Arg Gln Ala Val Thr Pro Val Gln Ala
165 170 175Cys Arg Glu Glu
His Ser Thr Asn Gly Ser Ser Ser Ser Ser Ser Ser 180
185 190Gln Leu Asp Asp Val Leu Asp Ser Phe Pro Glu
Ile Lys Asp Gln Ser 195 200 205Phe
Asn Leu Pro Arg Met Asn Ser Leu Arg Thr Ile Leu Asn Gly Asn 210
215 220Phe Asp Trp Ala Ser Leu Ala Gly Leu Asn
Pro Ile Pro Glu Leu Ala225 230 235
240Pro Thr Asn Gly Leu Pro Ser Tyr Gly Gly Tyr Asp Ala Phe Arg
Ala 245 250 255Ala Glu Gly
Glu Ala Glu Ser Gly His Val Asn Arg Gln Gln Asn Ser 260
265 270Ser Gly Leu Thr Gln Ser Phe Gly Tyr Ser
Ser Ser Gly Phe Gly Val 275 280
285Ser Gly Gln Thr Phe Glu Phe Arg Gln 290
2951471920DNAArabidopsis thalianaCDS(62)..(1645) 147acagaaaaat cgaaactttt
tagggttttt tttttttgtg ataacgagag agaaaaaagt 60g atg gac ttg tcg gtt
gag aac gga ggt tta gct cca ggt ttt agg ttt 109 Met Asp Leu Ser Val
Glu Asn Gly Gly Leu Ala Pro Gly Phe Arg Phe 1 5
10 15cat ccg acg gac gaa gaa ctt gtc gtc tat tat
ctc aaa aga aag atc 157His Pro Thr Asp Glu Glu Leu Val Val Tyr Tyr
Leu Lys Arg Lys Ile 20 25
30 cgt cgg aaa aaa ctc aga gtc gaa gca atc ggc gag act gat gtc tat
205Arg Arg Lys Lys Leu Arg Val Glu Ala Ile Gly Glu Thr Asp Val Tyr
35 40 45aag ttt gat cct gag gaa tta
cct gag aaa gcg ttg tat aag act aga 253Lys Phe Asp Pro Glu Glu Leu
Pro Glu Lys Ala Leu Tyr Lys Thr Arg 50 55
60gat cgt caa tgg ttc ttt ttc agc tta agg gat agg aaa cat gga agt
301Asp Arg Gln Trp Phe Phe Phe Ser Leu Arg Asp Arg Lys His Gly Ser65
70 75 80agg tca agt aga
gct act gaa cgt ggc tat tgg aaa gca aca ggg aag 349Arg Ser Ser Arg
Ala Thr Glu Arg Gly Tyr Trp Lys Ala Thr Gly Lys 85
90 95gat aga gtc att cat tgt gat tcg aga ccc
gtt gga gag aag aag act 397Asp Arg Val Ile His Cys Asp Ser Arg Pro
Val Gly Glu Lys Lys Thr 100 105
110ctt gtt ttc cat aga ggc agg gca cct aat ggc gaa cgg act aat tgg
445Leu Val Phe His Arg Gly Arg Ala Pro Asn Gly Glu Arg Thr Asn Trp
115 120 125gtg atg cat gag tat aca ttg
cac aaa gag gag ctc aag agg tgt ggt 493Val Met His Glu Tyr Thr Leu
His Lys Glu Glu Leu Lys Arg Cys Gly 130 135
140ggt gaa gat gtt aag gat gct tat gtt ctt tac aag att tat aag aaa
541Gly Glu Asp Val Lys Asp Ala Tyr Val Leu Tyr Lys Ile Tyr Lys Lys145
150 155 160agt ggg tct ggt
cct aag aat ggt gag caa tat gga gct cct ttt att 589Ser Gly Ser Gly
Pro Lys Asn Gly Glu Gln Tyr Gly Ala Pro Phe Ile 165
170 175gaa gaa gaa tgg gct gaa gat gat gat gat
gat gtt gat gag cct gct 637Glu Glu Glu Trp Ala Glu Asp Asp Asp Asp
Asp Val Asp Glu Pro Ala 180 185
190aat cag ctc gtt gtt tcg gct agt gtt gat aat agt tta tgg ggg aaa
685Asn Gln Leu Val Val Ser Ala Ser Val Asp Asn Ser Leu Trp Gly Lys
195 200 205ggg ctt aac caa tct gaa ttg
gat gat aat gat att gaa gag ctg atg 733Gly Leu Asn Gln Ser Glu Leu
Asp Asp Asn Asp Ile Glu Glu Leu Met 210 215
220 agt cag gtt aga gat cag tct ggt cca aca ttg cag cag aat ggg gtg
781Ser Gln Val Arg Asp Gln Ser Gly Pro Thr Leu Gln Gln Asn Gly Val225
230 235 240tct gga ctg
aac tct cat gta gac acg tat aat ctg gag aac ctg gag 829Ser Gly Leu
Asn Ser His Val Asp Thr Tyr Asn Leu Glu Asn Leu Glu 245
250 255gaa gat atg tat ttg gaa atc aat gat
ctt atg gaa cct gaa cct gga 877Glu Asp Met Tyr Leu Glu Ile Asn Asp
Leu Met Glu Pro Glu Pro Gly 260 265
270cca act tct gtg gaa gtc atg gag aat aac tgg aac gag gat ggt tct
925Pro Thr Ser Val Glu Val Met Glu Asn Asn Trp Asn Glu Asp Gly Ser
275 280 285ggt ctc ctg aat gat gat gat
ttc gtt ggt gct gat tca tat ttc ctt 973Gly Leu Leu Asn Asp Asp Asp
Phe Val Gly Ala Asp Ser Tyr Phe Leu 290 295
300gat ttg gga gtg aca aat cct cag tta gat ttt gtt agt ggt gat ttg
1021Asp Leu Gly Val Thr Asn Pro Gln Leu Asp Phe Val Ser Gly Asp Leu305
310 315 320aaa aat ggg ttt
gca caa agt ctt cag gtg aat act tct tta atg act 1069Lys Asn Gly Phe
Ala Gln Ser Leu Gln Val Asn Thr Ser Leu Met Thr 325
330 335tac cag gcc aat aat aac cag ttc cag cag
caa tca ggg aag aac caa 1117Tyr Gln Ala Asn Asn Asn Gln Phe Gln Gln
Gln Ser Gly Lys Asn Gln 340 345
350 gct agt aac tgg cca ctc cgt aac agc tat acc aga cag ata aac aat
1165Ala Ser Asn Trp Pro Leu Arg Asn Ser Tyr Thr Arg Gln Ile Asn Asn
355 360 365gga tca tcg tgg gtg cag gag
cta aac aat gac gga ctt acc gtt acc 1213Gly Ser Ser Trp Val Gln Glu
Leu Asn Asn Asp Gly Leu Thr Val Thr 370 375
380cgg ttt ggt gag gcg cct ggt aca ggt gat tca tct gaa ttc cta aac
1261Arg Phe Gly Glu Ala Pro Gly Thr Gly Asp Ser Ser Glu Phe Leu Asn385
390 395 400cct gtt cct tct
ggt ata agt aca act aat gaa gat gac ccg tca aaa 1309Pro Val Pro Ser
Gly Ile Ser Thr Thr Asn Glu Asp Asp Pro Ser Lys 405
410 415gac gag tct agt aag ttt gct tct agt gta
tgg act ttc ctg gaa tcc 1357Asp Glu Ser Ser Lys Phe Ala Ser Ser Val
Trp Thr Phe Leu Glu Ser 420 425
430att cct gca aag cca gca tat gca tca gag aat cca ttt gtg aag ctg
1405Ile Pro Ala Lys Pro Ala Tyr Ala Ser Glu Asn Pro Phe Val Lys Leu
435 440 445aac ctt gtt aga atg tca acc
agt ggt ggt cgt ttc agg ttt act tct 1453Asn Leu Val Arg Met Ser Thr
Ser Gly Gly Arg Phe Arg Phe Thr Ser 450 455
460aaa agc aca ggt aat aat gtt gtt gtt atg gat agt gac tca gca gtg
1501Lys Ser Thr Gly Asn Asn Val Val Val Met Asp Ser Asp Ser Ala Val465
470 475 480aag agg aac
aag tct gga gga aac aac gat aag aag aag aag aag aac 1549Lys Arg Asn
Lys Ser Gly Gly Asn Asn Asp Lys Lys Lys Lys Lys Asn 485
490 495aaa ggt ttc ttt tgc tta tcg atc att
ggg gct tta tgt gct ttg ttt 1597Lys Gly Phe Phe Cys Leu Ser Ile Ile
Gly Ala Leu Cys Ala Leu Phe 500 505
510tgg gtg atc ata gga aca atg gga ggt tca ggg agg cct ttg tta tgg
1645Trp Val Ile Ile Gly Thr Met Gly Gly Ser Gly Arg Pro Leu Leu Trp
515 520 525tgagaaccga aaaatccaag
aggttaagag acataaaggc ttggttttgt gtgaaccatt 1705agagagtcaa gtcattgtaa
ttattctctt ggattattag attcagaagc tgtttagtat 1765cacagtttat gcttggaagt
tttctctggt attgttaaaa aagtaccaat agaaataagc 1825aaaagaattt tcttatcttt
taggatattt gaacaaatga tgttacttaa ctagattatt 1885aacttggagg ttgttgtaca
aaaaaaaaaa aaaaa 1920148528PRTArabidopsis
thaliana 148Met Asp Leu Ser Val Glu Asn Gly Gly Leu Ala Pro Gly Phe Arg
Phe 1 5 10 15His Pro Thr
Asp Glu Glu Leu Val Val Tyr Tyr Leu Lys Arg Lys Ile 20
25 30Arg Arg Lys Lys Leu Arg Val Glu Ala Ile
Gly Glu Thr Asp Val Tyr 35 40
45Lys Phe Asp Pro Glu Glu Leu Pro Glu Lys Ala Leu Tyr Lys Thr Arg 50
55 60Asp Arg Gln Trp Phe Phe Phe Ser Leu
Arg Asp Arg Lys His Gly Ser 65 70 75
80Arg Ser Ser Arg Ala Thr Glu Arg Gly Tyr Trp Lys Ala Thr
Gly Lys 85 90 95Asp Arg
Val Ile His Cys Asp Ser Arg Pro Val Gly Glu Lys Lys Thr 100
105 110Leu Val Phe His Arg Gly Arg Ala Pro
Asn Gly Glu Arg Thr Asn Trp 115 120
125Val Met His Glu Tyr Thr Leu His Lys Glu Glu Leu Lys Arg Cys Gly
130 135 140Gly Glu Asp Val Lys Asp Ala
Tyr Val Leu Tyr Lys Ile Tyr Lys Lys145 150
155 160Ser Gly Ser Gly Pro Lys Asn Gly Glu Gln Tyr Gly
Ala Pro Phe Ile 165 170
175Glu Glu Glu Trp Ala Glu Asp Asp Asp Asp Asp Val Asp Glu Pro Ala
180 185 190Asn Gln Leu Val Val Ser
Ala Ser Val Asp Asn Ser Leu Trp Gly Lys 195 200
205Gly Leu Asn Gln Ser Glu Leu Asp Asp Asn Asp Ile Glu Glu
Leu Met 210 215 220Ser Gln Val Arg Asp
Gln Ser Gly Pro Thr Leu Gln Gln Asn Gly Val225 230
235 240Ser Gly Leu Asn Ser His Val Asp Thr Tyr
Asn Leu Glu Asn Leu Glu 245 250
255Glu Asp Met Tyr Leu Glu Ile Asn Asp Leu Met Glu Pro Glu Pro Gly
260 265 270Pro Thr Ser Val Glu
Val Met Glu Asn Asn Trp Asn Glu Asp Gly Ser 275
280 285Gly Leu Leu Asn Asp Asp Asp Phe Val Gly Ala Asp
Ser Tyr Phe Leu 290 295 300Asp Leu Gly
Val Thr Asn Pro Gln Leu Asp Phe Val Ser Gly Asp Leu305
310 315 320Lys Asn Gly Phe Ala Gln Ser
Leu Gln Val Asn Thr Ser Leu Met Thr 325
330 335Tyr Gln Ala Asn Asn Asn Gln Phe Gln Gln Gln Ser
Gly Lys Asn Gln 340 345 350Ala
Ser Asn Trp Pro Leu Arg Asn Ser Tyr Thr Arg Gln Ile Asn Asn 355
360 365Gly Ser Ser Trp Val Gln Glu Leu Asn
Asn Asp Gly Leu Thr Val Thr 370 375
380Arg Phe Gly Glu Ala Pro Gly Thr Gly Asp Ser Ser Glu Phe Leu Asn385
390 395 400Pro Val Pro Ser
Gly Ile Ser Thr Thr Asn Glu Asp Asp Pro Ser Lys 405
410 415Asp Glu Ser Ser Lys Phe Ala Ser Ser Val
Trp Thr Phe Leu Glu Ser 420 425
430Ile Pro Ala Lys Pro Ala Tyr Ala Ser Glu Asn Pro Phe Val Lys Leu
435 440 445Asn Leu Val Arg Met Ser Thr
Ser Gly Gly Arg Phe Arg Phe Thr Ser 450 455
460Lys Ser Thr Gly Asn Asn Val Val Val Met Asp Ser Asp Ser Ala
Val465 470 475 480Lys Arg
Asn Lys Ser Gly Gly Asn Asn Asp Lys Lys Lys Lys Lys Asn
485 490 495Lys Gly Phe Phe Cys Leu Ser
Ile Ile Gly Ala Leu Cys Ala Leu Phe 500 505
510Trp Val Ile Ile Gly Thr Met Gly Gly Ser Gly Arg Pro Leu
Leu Trp 515 520
5251491219DNAArabidopsis thalianaCDS(177)..(950) 149atgctaagga gccctcccaa
aaaagaacaa caaatcacat ttttatataa ctgttaacat 60aataatctca gcctcatcaa
cacacatata tagatagcca acatcacaca aacatagaga 120ttccaaaaaa taaaaataaa
gaaaacataa atcctctgag gaaaaattcc gatgag atg 179
Met
1aca gaa ggt gga gaa tat tct ccg gcg atg
atg tca gca gag cca ttc 227Thr Glu Gly Gly Glu Tyr Ser Pro Ala Met
Met Ser Ala Glu Pro Phe 5 10
15ttg acc atg aag aag atg aag aag agc aac cac aac aag aac aat cag
275Leu Thr Met Lys Lys Met Lys Lys Ser Asn His Asn Lys Asn Asn Gln
20 25 30aga agg ttt agc gac gag cag
atc aag tca ctg gag atg atg ttt gag 323Arg Arg Phe Ser Asp Glu Gln
Ile Lys Ser Leu Glu Met Met Phe Glu 35 40
45tct gag aca agg ctt gag cca agg aag aag gtt caa tta gct aga gag
371Ser Glu Thr Arg Leu Glu Pro Arg Lys Lys Val Gln Leu Ala Arg Glu50
55 60 65cta ggg ttg cag
ccg agg caa gtg gct ata tgg ttt cag aac aag agg 419Leu Gly Leu Gln
Pro Arg Gln Val Ala Ile Trp Phe Gln Asn Lys Arg 70
75 80gct cgt tgg aaa tcc aag cag ctc gag act
gag tac aac att ctc aga 467Ala Arg Trp Lys Ser Lys Gln Leu Glu Thr
Glu Tyr Asn Ile Leu Arg 85 90
95caa aac tac gac aac ttg gct tct cag ttc gag tcc tta aag aaa gaa
515Gln Asn Tyr Asp Asn Leu Ala Ser Gln Phe Glu Ser Leu Lys Lys Glu
100 105 110aaa caa gct tta gtc tct gag
ttg cag agg cta aaa gag gcg acg caa 563Lys Gln Ala Leu Val Ser Glu
Leu Gln Arg Leu Lys Glu Ala Thr Gln 115 120
125aag aag aca cag gag gag gaa agg cag tgt agt gga gat caa gcg gtg
611Lys Lys Thr Gln Glu Glu Glu Arg Gln Cys Ser Gly Asp Gln Ala Val130
135 140 145gtt gct cta agc
agc aca cat cat gaa tca gaa aac gaa gag aac cgg 659Val Ala Leu Ser
Ser Thr His His Glu Ser Glu Asn Glu Glu Asn Arg 150
155 160agg cgt aaa ccg gaa gag gtt aga ccg gag
atg gag atg aaa gat gat 707Arg Arg Lys Pro Glu Glu Val Arg Pro Glu
Met Glu Met Lys Asp Asp 165 170
175aag ggt cat cat ggg gtt atg tgt gat cat cat gat tat gaa gat gat
755Lys Gly His His Gly Val Met Cys Asp His His Asp Tyr Glu Asp Asp
180 185 190gat aat ggt tat agt aac aac
atc aag aga gag tat ttt ggt ggg ttt 803Asp Asn Gly Tyr Ser Asn Asn
Ile Lys Arg Glu Tyr Phe Gly Gly Phe 195 200
205gag gaa gaa cca gat cac tta atg aac att gtt gaa cca gct gat agt
851Glu Glu Glu Pro Asp His Leu Met Asn Ile Val Glu Pro Ala Asp Ser210
215 220 225tgt ttg aca tca
tct gat gat tgg aga ggt ttc aaa tca gat act act 899Cys Leu Thr Ser
Ser Asp Asp Trp Arg Gly Phe Lys Ser Asp Thr Thr 230
235 240act ctc ttg gac caa tcc agc aac aat tac
cct tgg cgg gat ttt tgg 947Thr Leu Leu Asp Gln Ser Ser Asn Asn Tyr
Pro Trp Arg Asp Phe Trp 245 250
255tca tgaaaacaat aaactctaaa caagaagatg aaacagattg agactaaaga
1000Serttggatatat acatattcaa atcgaaattt accggtctac atcgcatgaa ccgagccacg
1060gatatagaga tattcggtcc agcaaatgac tcgtttctca gcgagaattt tgcaggattt
1120tgagctgaaa ttgtatggtt ttgtctgtat aaatgatgtg tttagaaaga cgtatattct
1180caaaaaaaaa aaaaaaaaaa aagaaaaaaa aaaaaaaaa
1219150258PRTArabidopsis thaliana 150Met Thr Glu Gly Gly Glu Tyr Ser Pro
Ala Met Met Ser Ala Glu Pro 1 5 10
15Phe Leu Thr Met Lys Lys Met Lys Lys Ser Asn His Asn Lys Asn
Asn 20 25 30Gln Arg Arg Phe
Ser Asp Glu Gln Ile Lys Ser Leu Glu Met Met Phe 35
40 45Glu Ser Glu Thr Arg Leu Glu Pro Arg Lys Lys Val
Gln Leu Ala Arg 50 55 60Glu Leu Gly
Leu Gln Pro Arg Gln Val Ala Ile Trp Phe Gln Asn Lys 65
70 75 80Arg Ala Arg Trp Lys Ser Lys Gln
Leu Glu Thr Glu Tyr Asn Ile Leu 85 90
95Arg Gln Asn Tyr Asp Asn Leu Ala Ser Gln Phe Glu Ser Leu
Lys Lys 100 105 110Glu Lys Gln
Ala Leu Val Ser Glu Leu Gln Arg Leu Lys Glu Ala Thr 115
120 125Gln Lys Lys Thr Gln Glu Glu Glu Arg Gln Cys
Ser Gly Asp Gln Ala 130 135 140Val Val
Ala Leu Ser Ser Thr His His Glu Ser Glu Asn Glu Glu Asn145
150 155 160Arg Arg Arg Lys Pro Glu Glu
Val Arg Pro Glu Met Glu Met Lys Asp 165
170 175Asp Lys Gly His His Gly Val Met Cys Asp His His
Asp Tyr Glu Asp 180 185 190Asp
Asp Asn Gly Tyr Ser Asn Asn Ile Lys Arg Glu Tyr Phe Gly Gly 195
200 205Phe Glu Glu Glu Pro Asp His Leu Met
Asn Ile Val Glu Pro Ala Asp 210 215
220Ser Cys Leu Thr Ser Ser Asp Asp Trp Arg Gly Phe Lys Ser Asp Thr225
230 235 240Thr Thr Leu Leu
Asp Gln Ser Ser Asn Asn Tyr Pro Trp Arg Asp Phe 245
250 255Trp Ser151978DNAArabidopsis
thalianaCDS(89)..(793) 151aaagaaagaa agaaaaaaaa gaaacaaata attccaaaac
cttctctctt aatcaaaatc 60aagaaactta caagatctgg tgaaaacc atg gaa gaa
gga gat ttt ttc aac 112 Met Glu Glu
Gly Asp Phe Phe Asn 1 5tgc
tgt ttc agc gag att agt agt ggc atg acc atg aat aag aag aag 160Cys
Cys Phe Ser Glu Ile Ser Ser Gly Met Thr Met Asn Lys Lys Lys 10
15 20atg aag aag agc aat aac caa aag agg ttt
agc gag gaa cag atc aag 208Met Lys Lys Ser Asn Asn Gln Lys Arg Phe
Ser Glu Glu Gln Ile Lys 25 30 35
40tca ctt gag ctt ata ttt gag tct gag acg agg ctt gag ccg agg
aag 256Ser Leu Glu Leu Ile Phe Glu Ser Glu Thr Arg Leu Glu Pro Arg
Lys 45 50 55aag gtt cag
gta gct aga gag cta ggg ctg caa cca aga caa gtg gct 304Lys Val Gln
Val Ala Arg Glu Leu Gly Leu Gln Pro Arg Gln Val Ala 60
65 70ata tgg ttt caa aac aag agg gct cga tgg
aaa act aag caa ctt gag 352Ile Trp Phe Gln Asn Lys Arg Ala Arg Trp
Lys Thr Lys Gln Leu Glu 75 80
85aaa gag tat aac act ctt aga gcc aat tac aac aat ttg gct tca caa
400Lys Glu Tyr Asn Thr Leu Arg Ala Asn Tyr Asn Asn Leu Ala Ser Gln 90
95 100ttt gaa atc atg aag aaa gaa aag
caa tct ctg gtc tct gag ctg cag 448Phe Glu Ile Met Lys Lys Glu Lys
Gln Ser Leu Val Ser Glu Leu Gln105 110
115 120aga cta aac gaa gag atg caa agg cct aaa gaa gaa
aag cat cat gag 496Arg Leu Asn Glu Glu Met Gln Arg Pro Lys Glu Glu
Lys His His Glu 125 130
135tgt tgt ggt gat caa gga ctg gct cta agc agc agc aca gag tcg cat
544Cys Cys Gly Asp Gln Gly Leu Ala Leu Ser Ser Ser Thr Glu Ser His
140 145 150aat gga aag agt gag cca
gaa ggg agg tta gac caa ggg agt gtt cta 592Asn Gly Lys Ser Glu Pro
Glu Gly Arg Leu Asp Gln Gly Ser Val Leu 155 160
165tgt aat gat ggt gat tac aac aac aac att aaa aca gag tat
ttt ggg 640Cys Asn Asp Gly Asp Tyr Asn Asn Asn Ile Lys Thr Glu Tyr
Phe Gly 170 175 180ttc gag gaa gag act
gat cat gag ctg atg aac att gtg gag aaa gct 688Phe Glu Glu Glu Thr
Asp His Glu Leu Met Asn Ile Val Glu Lys Ala185 190
195 200gat gat agt tgc ttg aca tct tct gag aat
tgg gga ggt ttc aat tct 736Asp Asp Ser Cys Leu Thr Ser Ser Glu Asn
Trp Gly Gly Phe Asn Ser 205 210
215gat tct ctc tta gac caa tct agc agc aat tac cct aac tgg tgg gag
784Asp Ser Leu Leu Asp Gln Ser Ser Ser Asn Tyr Pro Asn Trp Trp Glu
220 225 230ttt tgg tca taaaagcata
taagaaaaaa acagaacata agcgaagaga 833Phe Trp Ser
235aagagtgtga atagtttgta aattatgtgt taagaaaaat aaatttagtt tagtttaaat
893cttgtttcga tctatgtatc tactatgttc aatactcttt gtagctaatt agtagcttat
953aatgagacta gaaaagtttt gaagc
978152235PRTArabidopsis thaliana 152Met Glu Glu Gly Asp Phe Phe Asn Cys
Cys Phe Ser Glu Ile Ser Ser 1 5 10
15Gly Met Thr Met Asn Lys Lys Lys Met Lys Lys Ser Asn Asn Gln
Lys 20 25 30Arg Phe Ser Glu
Glu Gln Ile Lys Ser Leu Glu Leu Ile Phe Glu Ser 35
40 45Glu Thr Arg Leu Glu Pro Arg Lys Lys Val Gln Val
Ala Arg Glu Leu 50 55 60Gly Leu Gln
Pro Arg Gln Val Ala Ile Trp Phe Gln Asn Lys Arg Ala 65
70 75 80Arg Trp Lys Thr Lys Gln Leu Glu
Lys Glu Tyr Asn Thr Leu Arg Ala 85 90
95Asn Tyr Asn Asn Leu Ala Ser Gln Phe Glu Ile Met Lys Lys
Glu Lys 100 105 110Gln Ser Leu
Val Ser Glu Leu Gln Arg Leu Asn Glu Glu Met Gln Arg 115
120 125Pro Lys Glu Glu Lys His His Glu Cys Cys Gly
Asp Gln Gly Leu Ala 130 135 140Leu Ser
Ser Ser Thr Glu Ser His Asn Gly Lys Ser Glu Pro Glu Gly145
150 155 160Arg Leu Asp Gln Gly Ser Val
Leu Cys Asn Asp Gly Asp Tyr Asn Asn 165
170 175Asn Ile Lys Thr Glu Tyr Phe Gly Phe Glu Glu Glu
Thr Asp His Glu 180 185 190Leu
Met Asn Ile Val Glu Lys Ala Asp Asp Ser Cys Leu Thr Ser Ser 195
200 205Glu Asn Trp Gly Gly Phe Asn Ser Asp
Ser Leu Leu Asp Gln Ser Ser 210 215
220Ser Asn Tyr Pro Asn Trp Trp Glu Phe Trp Ser225 230
2351532043DNAArabidopsis thalianaCDS(1)..(2040) 153atg gct
gct tac ttt cac gga aac cca ccg gag atc tct gcc gga tcc 48Met Ala
Ala Tyr Phe His Gly Asn Pro Pro Glu Ile Ser Ala Gly Ser 1
5 10 15gac ggt ggt ctt caa acg ttg atc
ctc atg aat cca act act tac gtt 96Asp Gly Gly Leu Gln Thr Leu Ile
Leu Met Asn Pro Thr Thr Tyr Val 20 25
30cag tac acc caa caa gac aac gac tcg aac aac aac aac aac agc
aac 144Gln Tyr Thr Gln Gln Asp Asn Asp Ser Asn Asn Asn Asn Asn Ser
Asn 35 40 45aat agc aac aac aac
aac aca aac aca aac aca aac aac aac aac agt 192Asn Ser Asn Asn Asn
Asn Thr Asn Thr Asn Thr Asn Asn Asn Asn Ser 50 55
60agt ttc gtt ttc ctc gat tcc cac gcg ccg cag cca aac gcg
agc cag 240Ser Phe Val Phe Leu Asp Ser His Ala Pro Gln Pro Asn Ala
Ser Gln 65 70 75 80cag
ttc gtc gga ata cca ctc tca ggt cac gaa gct gct tcc att aca 288Gln
Phe Val Gly Ile Pro Leu Ser Gly His Glu Ala Ala Ser Ile Thr
85 90 95gcc gcc gac aac atc tcc gta
ctt cac ggt tat cct ccg cgc gtg cag 336Ala Ala Asp Asn Ile Ser Val
Leu His Gly Tyr Pro Pro Arg Val Gln 100 105
110tac agt ctc tac ggt agc cac caa gtg gat ccc act cac cag
caa gcc 384Tyr Ser Leu Tyr Gly Ser His Gln Val Asp Pro Thr His Gln
Gln Ala 115 120 125gcg tgt gag acg
cca cgc gcg cag caa ggc ctc tct tta acc ctc tcg 432Ala Cys Glu Thr
Pro Arg Ala Gln Gln Gly Leu Ser Leu Thr Leu Ser 130
135 140tct caa cag cag cag caa cag caa cat cat caa caa
cac cag cct att 480Ser Gln Gln Gln Gln Gln Gln Gln His His Gln Gln
His Gln Pro Ile145 150 155
160cac gtc gga ttc ggg tcc gga cat gga gaa gat atc cgg gtc ggg tct
528His Val Gly Phe Gly Ser Gly His Gly Glu Asp Ile Arg Val Gly Ser
165 170 175ggc tct aca gga tcg
ggg gta aca aac ggt ata gct aat ctt gtt agc 576Gly Ser Thr Gly Ser
Gly Val Thr Asn Gly Ile Ala Asn Leu Val Ser 180
185 190tcc aag tac ttg aag gca gca caa gag ctt ctt gac
gaa gta gtc aac 624Ser Lys Tyr Leu Lys Ala Ala Gln Glu Leu Leu Asp
Glu Val Val Asn 195 200 205gct gat
tcc gat gac atg aac gct aaa tcc caa cta ttc tca tcg aaa 672Ala Asp
Ser Asp Asp Met Asn Ala Lys Ser Gln Leu Phe Ser Ser Lys 210
215 220aag ggt agt tgc gga aat gat aaa cct gtc gga
gaa tca tcg gcc ggc 720Lys Gly Ser Cys Gly Asn Asp Lys Pro Val Gly
Glu Ser Ser Ala Gly225 230 235
240gct gga gga gaa ggt tcc ggt ggc gga gca gaa gca gcc ggg aaa cgt
768Ala Gly Gly Glu Gly Ser Gly Gly Gly Ala Glu Ala Ala Gly Lys Arg
245 250 255ccg gtg gag cta ggc
acg gca gag aga caa gaa ata cag atg aag aaa 816Pro Val Glu Leu Gly
Thr Ala Glu Arg Gln Glu Ile Gln Met Lys Lys 260
265 270gca aaa ctt agt aac atg ctt cat gag gtg gag cag
aga tat aga cag 864Ala Lys Leu Ser Asn Met Leu His Glu Val Glu Gln
Arg Tyr Arg Gln 275 280 285tac cac
cag cag atg cag atg gtg atc tct tcg ttc gag caa gcg gca 912Tyr His
Gln Gln Met Gln Met Val Ile Ser Ser Phe Glu Gln Ala Ala 290
295 300ggg ata gga tca gcg aag tca tac acg tcg cta
gca ttg aaa acc ata 960Gly Ile Gly Ser Ala Lys Ser Tyr Thr Ser Leu
Ala Leu Lys Thr Ile305 310 315
320tca aga cag ttc cgt tgc ttg aaa gag gcg atc gct ggt cag ata aaa
1008Ser Arg Gln Phe Arg Cys Leu Lys Glu Ala Ile Ala Gly Gln Ile Lys
325 330 335gcg gcc aac aag agt
ctt ggg gag gaa gat tca gtg tct ggt gtt ggg 1056Ala Ala Asn Lys Ser
Leu Gly Glu Glu Asp Ser Val Ser Gly Val Gly 340
345 350agg ttt gag ggg tcg agg ctc aag ttc gtg gac cac
cac ttg aga cag 1104Arg Phe Glu Gly Ser Arg Leu Lys Phe Val Asp His
His Leu Arg Gln 355 360 365caa aga
gct ctt caa caa ctg gga atg att caa cat cct tcc aat aat 1152Gln Arg
Ala Leu Gln Gln Leu Gly Met Ile Gln His Pro Ser Asn Asn 370
375 380gct tgg aga cct caa cgt ggt ctc cca gaa cga
gcc gtc tca gtt ctc 1200Ala Trp Arg Pro Gln Arg Gly Leu Pro Glu Arg
Ala Val Ser Val Leu385 390 395
400cgt gct tgg ctc ttc gaa cac ttt ctt cat cca tac cct aag gat tcg
1248Arg Ala Trp Leu Phe Glu His Phe Leu His Pro Tyr Pro Lys Asp Ser
405 410 415gac aag cac atg cta
gct aag caa aca gga ctc act cgt agc cag gtg 1296Asp Lys His Met Leu
Ala Lys Gln Thr Gly Leu Thr Arg Ser Gln Val 420
425 430tcg aac tgg ttt ata aac gcg aga gtt cgg tta tgg
aaa cca atg gtg 1344Ser Asn Trp Phe Ile Asn Ala Arg Val Arg Leu Trp
Lys Pro Met Val 435 440 445gag gag
atg tac atg gag gaa atg aag gag cag gca aag aac atg gga 1392Glu Glu
Met Tyr Met Glu Glu Met Lys Glu Gln Ala Lys Asn Met Gly 450
455 460tcc atg gaa aag act cct ttg gat caa agc aac
gaa gat tct gct tca 1440Ser Met Glu Lys Thr Pro Leu Asp Gln Ser Asn
Glu Asp Ser Ala Ser465 470 475
480aag tca aca agt aac caa gaa aag agc cca atg gcg gac act aat tac
1488Lys Ser Thr Ser Asn Gln Glu Lys Ser Pro Met Ala Asp Thr Asn Tyr
485 490 495cat atg aat ccc aat
cac aac ggt gac cta gaa ggc gtc act gga atg 1536His Met Asn Pro Asn
His Asn Gly Asp Leu Glu Gly Val Thr Gly Met 500
505 510caa gga agc ccc aag aga cta aga acc agc gac gag
aca atg atg cag 1584Gln Gly Ser Pro Lys Arg Leu Arg Thr Ser Asp Glu
Thr Met Met Gln 515 520 525cca ata
aat gcg gat ttc agc tcc aac gag aag ctc acg atg aag att 1632Pro Ile
Asn Ala Asp Phe Ser Ser Asn Glu Lys Leu Thr Met Lys Ile 530
535 540cta gaa gaa cgg caa ggg ata aga tca gat ggt
ggc tac cct ttc atg 1680Leu Glu Glu Arg Gln Gly Ile Arg Ser Asp Gly
Gly Tyr Pro Phe Met545 550 555
560ggt aat ttc ggg caa tac caa atg gat gag atg tca aga ttt gat gta
1728Gly Asn Phe Gly Gln Tyr Gln Met Asp Glu Met Ser Arg Phe Asp Val
565 570 575gtc tca gac cag gag
ctc atg gcg caa agg tac tca gga aac aac aat 1776Val Ser Asp Gln Glu
Leu Met Ala Gln Arg Tyr Ser Gly Asn Asn Asn 580
585 590ggc gtg tcc ctc acg tta ggt tta cct cat tgt gat
agc ttg tcg tcc 1824Gly Val Ser Leu Thr Leu Gly Leu Pro His Cys Asp
Ser Leu Ser Ser 595 600 605acg cac
cat cag ggt ttc atg cag acc cac cat ggg att cct ata ggg 1872Thr His
His Gln Gly Phe Met Gln Thr His His Gly Ile Pro Ile Gly 610
615 620aga aga gtg aaa ata gga gaa aca gag gaa tat
gga ccc gcc acc atc 1920Arg Arg Val Lys Ile Gly Glu Thr Glu Glu Tyr
Gly Pro Ala Thr Ile625 630 635
640aat ggt ggt agc tcg acc aca acc gca cat tca tca gcg gca gct gcc
1968Asn Gly Gly Ser Ser Thr Thr Thr Ala His Ser Ser Ala Ala Ala Ala
645 650 655gcg gct tac aat ggg
atg aac ata cag aac cag aag aga tat gtg gct 2016Ala Ala Tyr Asn Gly
Met Asn Ile Gln Asn Gln Lys Arg Tyr Val Ala 660
665 670cag tta ttg ccc gac ttc gtt gca taa
2043Gln Leu Leu Pro Asp Phe Val Ala 675
680154680PRTArabidopsis thaliana 154Met Ala Ala Tyr Phe His Gly
Asn Pro Pro Glu Ile Ser Ala Gly Ser 1 5
10 15Asp Gly Gly Leu Gln Thr Leu Ile Leu Met Asn Pro Thr
Thr Tyr Val 20 25 30Gln Tyr
Thr Gln Gln Asp Asn Asp Ser Asn Asn Asn Asn Asn Ser Asn 35
40 45Asn Ser Asn Asn Asn Asn Thr Asn Thr Asn
Thr Asn Asn Asn Asn Ser 50 55 60Ser
Phe Val Phe Leu Asp Ser His Ala Pro Gln Pro Asn Ala Ser Gln 65
70 75 80Gln Phe Val Gly Ile Pro
Leu Ser Gly His Glu Ala Ala Ser Ile Thr 85
90 95Ala Ala Asp Asn Ile Ser Val Leu His Gly Tyr Pro
Pro Arg Val Gln 100 105 110Tyr
Ser Leu Tyr Gly Ser His Gln Val Asp Pro Thr His Gln Gln Ala 115
120 125Ala Cys Glu Thr Pro Arg Ala Gln Gln
Gly Leu Ser Leu Thr Leu Ser 130 135
140Ser Gln Gln Gln Gln Gln Gln Gln His His Gln Gln His Gln Pro Ile145
150 155 160His Val Gly Phe
Gly Ser Gly His Gly Glu Asp Ile Arg Val Gly Ser 165
170 175Gly Ser Thr Gly Ser Gly Val Thr Asn Gly
Ile Ala Asn Leu Val Ser 180 185
190Ser Lys Tyr Leu Lys Ala Ala Gln Glu Leu Leu Asp Glu Val Val Asn
195 200 205Ala Asp Ser Asp Asp Met Asn
Ala Lys Ser Gln Leu Phe Ser Ser Lys 210 215
220Lys Gly Ser Cys Gly Asn Asp Lys Pro Val Gly Glu Ser Ser Ala
Gly225 230 235 240Ala Gly
Gly Glu Gly Ser Gly Gly Gly Ala Glu Ala Ala Gly Lys Arg
245 250 255Pro Val Glu Leu Gly Thr Ala
Glu Arg Gln Glu Ile Gln Met Lys Lys 260 265
270Ala Lys Leu Ser Asn Met Leu His Glu Val Glu Gln Arg Tyr
Arg Gln 275 280 285Tyr His Gln Gln
Met Gln Met Val Ile Ser Ser Phe Glu Gln Ala Ala 290
295 300Gly Ile Gly Ser Ala Lys Ser Tyr Thr Ser Leu Ala
Leu Lys Thr Ile305 310 315
320Ser Arg Gln Phe Arg Cys Leu Lys Glu Ala Ile Ala Gly Gln Ile Lys
325 330 335Ala Ala Asn Lys Ser
Leu Gly Glu Glu Asp Ser Val Ser Gly Val Gly 340
345 350Arg Phe Glu Gly Ser Arg Leu Lys Phe Val Asp His
His Leu Arg Gln 355 360 365Gln Arg
Ala Leu Gln Gln Leu Gly Met Ile Gln His Pro Ser Asn Asn 370
375 380Ala Trp Arg Pro Gln Arg Gly Leu Pro Glu Arg
Ala Val Ser Val Leu385 390 395
400Arg Ala Trp Leu Phe Glu His Phe Leu His Pro Tyr Pro Lys Asp Ser
405 410 415Asp Lys His Met
Leu Ala Lys Gln Thr Gly Leu Thr Arg Ser Gln Val 420
425 430Ser Asn Trp Phe Ile Asn Ala Arg Val Arg Leu
Trp Lys Pro Met Val 435 440 445Glu
Glu Met Tyr Met Glu Glu Met Lys Glu Gln Ala Lys Asn Met Gly 450
455 460Ser Met Glu Lys Thr Pro Leu Asp Gln Ser
Asn Glu Asp Ser Ala Ser465 470 475
480Lys Ser Thr Ser Asn Gln Glu Lys Ser Pro Met Ala Asp Thr Asn
Tyr 485 490 495His Met Asn
Pro Asn His Asn Gly Asp Leu Glu Gly Val Thr Gly Met 500
505 510Gln Gly Ser Pro Lys Arg Leu Arg Thr Ser
Asp Glu Thr Met Met Gln 515 520
525Pro Ile Asn Ala Asp Phe Ser Ser Asn Glu Lys Leu Thr Met Lys Ile 530
535 540Leu Glu Glu Arg Gln Gly Ile Arg
Ser Asp Gly Gly Tyr Pro Phe Met545 550
555 560Gly Asn Phe Gly Gln Tyr Gln Met Asp Glu Met Ser
Arg Phe Asp Val 565 570
575Val Ser Asp Gln Glu Leu Met Ala Gln Arg Tyr Ser Gly Asn Asn Asn
580 585 590Gly Val Ser Leu Thr Leu
Gly Leu Pro His Cys Asp Ser Leu Ser Ser 595 600
605Thr His His Gln Gly Phe Met Gln Thr His His Gly Ile Pro
Ile Gly 610 615 620Arg Arg Val Lys Ile
Gly Glu Thr Glu Glu Tyr Gly Pro Ala Thr Ile625 630
635 640Asn Gly Gly Ser Ser Thr Thr Thr Ala His
Ser Ser Ala Ala Ala Ala 645 650
655Ala Ala Tyr Asn Gly Met Asn Ile Gln Asn Gln Lys Arg Tyr Val Ala
660 665 670Gln Leu Leu Pro Asp
Phe Val Ala 675 6801551162DNAArabidopsis
thalianaCDS(49)..(933) 155aaattcgctt ttttttttct tctttgtata tttttttttt
ttttgacc atg gcg gag 57
Met Ala Glu
1gaa ttt gga agc ata gat tta ctc gga gat gaa gat ttc ttc ttc gat
105Glu Phe Gly Ser Ile Asp Leu Leu Gly Asp Glu Asp Phe Phe Phe Asp 5
10 15ttc gat cct tca atc gta att gat
tct ctt ccg gcg gag gat ttt ctt 153Phe Asp Pro Ser Ile Val Ile Asp
Ser Leu Pro Ala Glu Asp Phe Leu 20 25
30 35cag tct tca ccg gat tca tgg atc gga gaa atc gag aat
caa ttg atg 201Gln Ser Ser Pro Asp Ser Trp Ile Gly Glu Ile Glu Asn
Gln Leu Met 40 45 50aac
gat gag aat cat caa gag gag agt ttt gtg gaa ttg gat cag caa 249Asn
Asp Glu Asn His Gln Glu Glu Ser Phe Val Glu Leu Asp Gln Gln
55 60 65tcg gtt tca gat ttc ata gcg gat
cta ctc gtt gat tat cca act agc 297Ser Val Ser Asp Phe Ile Ala Asp
Leu Leu Val Asp Tyr Pro Thr Ser 70 75
80gat tct ggc tcc gtt gat ttg gcg gct gat aaa gtt cta acc gtc gat
345Asp Ser Gly Ser Val Asp Leu Ala Ala Asp Lys Val Leu Thr Val Asp
85 90 95tct ccc gcc gcc gct gat gat tcc
ggg aag gag aat tcg gat ttg gtt 393Ser Pro Ala Ala Ala Asp Asp Ser
Gly Lys Glu Asn Ser Asp Leu Val100 105
110 115gtt gag aag aag tct aat gat tct ggt agc gag att
cat gat gat gat 441Val Glu Lys Lys Ser Asn Asp Ser Gly Ser Glu Ile
His Asp Asp Asp 120 125
130gac gaa gaa gga gac gat gat gct gtg gct aaa aaa cga aga agg aga
489Asp Glu Glu Gly Asp Asp Asp Ala Val Ala Lys Lys Arg Arg Arg Arg
135 140 145gta aga aat aga gat gcg
gcg gtt aga tcg aga gag agg aag aag gaa 537Val Arg Asn Arg Asp Ala
Ala Val Arg Ser Arg Glu Arg Lys Lys Glu 150 155
160tat gta caa gat tta gag aag aag agt aag tat ctc gaa aga
gaa tgc 585Tyr Val Gln Asp Leu Glu Lys Lys Ser Lys Tyr Leu Glu Arg
Glu Cys 165 170 175ttg aga cta gga cgt
atg ctt gag tgc ttc gtt gct gaa aac cag tct 633Leu Arg Leu Gly Arg
Met Leu Glu Cys Phe Val Ala Glu Asn Gln Ser180 185
190 195cta cgt tac tgt ttg caa aag ggt aat ggc
aat aat act acc atg atg 681Leu Arg Tyr Cys Leu Gln Lys Gly Asn Gly
Asn Asn Thr Thr Met Met 200 205
210tcg aag cag gag tct gct gtg ctc ttg ttg gaa tcc ctg ctg ttg ggt
729Ser Lys Gln Glu Ser Ala Val Leu Leu Leu Glu Ser Leu Leu Leu Gly
215 220 225tcc ctg ctt tgg ctt ctg
gga gta aac ttc att tgc cta ttc cct tat 777Ser Leu Leu Trp Leu Leu
Gly Val Asn Phe Ile Cys Leu Phe Pro Tyr 230 235
240atg tcc cac aca aag tgt tgc ctc cta cgt cca gaa cca gaa
aag ctg 825Met Ser His Thr Lys Cys Cys Leu Leu Arg Pro Glu Pro Glu
Lys Leu 245 250 255gtt cta aac ggg ctc
ggg agt agt agc aaa ccg tct tat acc ggc gtt 873Val Leu Asn Gly Leu
Gly Ser Ser Ser Lys Pro Ser Tyr Thr Gly Val260 265
270 275agt cgg aga tgt aag ggt tcg agg cct agg
atg aaa tac caa atc tta 921Ser Arg Arg Cys Lys Gly Ser Arg Pro Arg
Met Lys Tyr Gln Ile Leu 280 285
290acc ctt gcg gcg tgacaacgcc ttttttaact gcttcttttg cgcattttga
973Thr Leu Ala Ala 295gttgtagatg agtgtctttt agttttctct
ctcttgtttt gtatttcgct gttgaaagtt 1033ttctgtctaa tatcgataag ttaacagtga
atgtgggtct tatggttatg gatgatatct 1093atctaataat gcttctgcct ttaaaatgtt
gattttgagg cataacttca ggcaaaaaaa 1153aaaaagaaa
1162156295PRTArabidopsis thaliana 156Met
Ala Glu Glu Phe Gly Ser Ile Asp Leu Leu Gly Asp Glu Asp Phe 1
5 10 15Phe Phe Asp Phe Asp Pro Ser
Ile Val Ile Asp Ser Leu Pro Ala Glu 20 25
30Asp Phe Leu Gln Ser Ser Pro Asp Ser Trp Ile Gly Glu Ile
Glu Asn 35 40 45Gln Leu Met Asn
Asp Glu Asn His Gln Glu Glu Ser Phe Val Glu Leu 50
55 60Asp Gln Gln Ser Val Ser Asp Phe Ile Ala Asp Leu Leu
Val Asp Tyr 65 70 75
80Pro Thr Ser Asp Ser Gly Ser Val Asp Leu Ala Ala Asp Lys Val Leu
85 90 95Thr Val Asp Ser Pro Ala
Ala Ala Asp Asp Ser Gly Lys Glu Asn Ser 100
105 110Asp Leu Val Val Glu Lys Lys Ser Asn Asp Ser Gly
Ser Glu Ile His 115 120 125Asp Asp
Asp Asp Glu Glu Gly Asp Asp Asp Ala Val Ala Lys Lys Arg 130
135 140Arg Arg Arg Val Arg Asn Arg Asp Ala Ala Val
Arg Ser Arg Glu Arg145 150 155
160Lys Lys Glu Tyr Val Gln Asp Leu Glu Lys Lys Ser Lys Tyr Leu Glu
165 170 175Arg Glu Cys Leu
Arg Leu Gly Arg Met Leu Glu Cys Phe Val Ala Glu 180
185 190Asn Gln Ser Leu Arg Tyr Cys Leu Gln Lys Gly
Asn Gly Asn Asn Thr 195 200 205Thr
Met Met Ser Lys Gln Glu Ser Ala Val Leu Leu Leu Glu Ser Leu 210
215 220Leu Leu Gly Ser Leu Leu Trp Leu Leu Gly
Val Asn Phe Ile Cys Leu225 230 235
240Phe Pro Tyr Met Ser His Thr Lys Cys Cys Leu Leu Arg Pro Glu
Pro 245 250 255Glu Lys Leu
Val Leu Asn Gly Leu Gly Ser Ser Ser Lys Pro Ser Tyr 260
265 270Thr Gly Val Ser Arg Arg Cys Lys Gly Ser
Arg Pro Arg Met Lys Tyr 275 280
285Gln Ile Leu Thr Leu Ala Ala 290
295157438DNAArabidopsis thalianaCDS(1)..(435) 157atg gca aac gca gag aag
aca agt tca ggt tcc gac ata gat gag aag 48Met Ala Asn Ala Glu Lys
Thr Ser Ser Gly Ser Asp Ile Asp Glu Lys 1 5
10 15aaa aga aaa cgc aag tta tca aac cgc gaa tct gca
agg agg tcg cgt 96Lys Arg Lys Arg Lys Leu Ser Asn Arg Glu Ser Ala
Arg Arg Ser Arg 20 25 30ttg
aag aaa cag aag tta atg gaa gac acg att cat gag atc tcc agt 144Leu
Lys Lys Gln Lys Leu Met Glu Asp Thr Ile His Glu Ile Ser Ser 35
40 45ctt gaa cga cga atc aaa gag aac agt
gag aga tgt cga gct gta aaa 192Leu Glu Arg Arg Ile Lys Glu Asn Ser
Glu Arg Cys Arg Ala Val Lys 50 55
60cag agg ctt gac tcg gtc gaa acg gag aac gcg ggt ctt aga tcg gag
240Gln Arg Leu Asp Ser Val Glu Thr Glu Asn Ala Gly Leu Arg Ser Glu 65
70 75 80aag att tgg ctc
tcg agt tac gtt agc gat tta gag aat atg att gct 288Lys Ile Trp Leu
Ser Ser Tyr Val Ser Asp Leu Glu Asn Met Ile Ala 85
90 95acg acg agt tta acg ctg acg cag agt ggt
ggt ggc gat tgt gtc gac 336Thr Thr Ser Leu Thr Leu Thr Gln Ser Gly
Gly Gly Asp Cys Val Asp 100 105
110gat cag aac gca aac gcg gga ata gcg gtt gga gat tgt aga cgt aca
384Asp Gln Asn Ala Asn Ala Gly Ile Ala Val Gly Asp Cys Arg Arg Thr
115 120 125ccg tgg aaa ttg agt tgt ggt
tct cta caa cca atg gcg tcc ttt aag 432Pro Trp Lys Leu Ser Cys Gly
Ser Leu Gln Pro Met Ala Ser Phe Lys 130 135
140aca tga
438Thr145158145PRTArabidopsis thaliana 158Met Ala Asn Ala Glu Lys Thr
Ser Ser Gly Ser Asp Ile Asp Glu Lys 1 5
10 15Lys Arg Lys Arg Lys Leu Ser Asn Arg Glu Ser Ala Arg
Arg Ser Arg 20 25 30Leu Lys
Lys Gln Lys Leu Met Glu Asp Thr Ile His Glu Ile Ser Ser 35
40 45Leu Glu Arg Arg Ile Lys Glu Asn Ser Glu
Arg Cys Arg Ala Val Lys 50 55 60Gln
Arg Leu Asp Ser Val Glu Thr Glu Asn Ala Gly Leu Arg Ser Glu 65
70 75 80Lys Ile Trp Leu Ser Ser
Tyr Val Ser Asp Leu Glu Asn Met Ile Ala 85
90 95Thr Thr Ser Leu Thr Leu Thr Gln Ser Gly Gly Gly
Asp Cys Val Asp 100 105 110Asp
Gln Asn Ala Asn Ala Gly Ile Ala Val Gly Asp Cys Arg Arg Thr 115
120 125Pro Trp Lys Leu Ser Cys Gly Ser Leu
Gln Pro Met Ala Ser Phe Lys 130 135
140Thr1451591697DNAArabidopsis thalianaCDS(74)..(1435) 159ccaaagaaaa
aaaataaatt cgaaggtaaa tatccagaag cttgatcctc ctagttgtac 60gaaagcttga
gta atg ggg tct aga tta aac ttc aag agc ttt gtt gat 109
Met Gly Ser Arg Leu Asn Phe Lys Ser Phe Val Asp 1
5 10ggt gtg agt gag cag cag cca acg gtg ggg act
agt ctt cca ttg act 157 Gly Val Ser Glu Gln Gln Pro Thr Val Gly
Thr Ser Leu Pro Leu Thr 15 20
25agg cag aac tct gtg ttc tcg tta acc ttt gat gag ttt cag aac tca
205Arg Gln Asn Ser Val Phe Ser Leu Thr Phe Asp Glu Phe Gln Asn Ser 30
35 40tgg ggt ggt gga att ggg aaa gat
ttt ggg tct atg aac atg gat gag 253Trp Gly Gly Gly Ile Gly Lys Asp
Phe Gly Ser Met Asn Met Asp Glu 45 50
55 60ctc ttg aag aac att tgg act gca gag gaa agt cat tca
atg atg gga 301Leu Leu Lys Asn Ile Trp Thr Ala Glu Glu Ser His Ser
Met Met Gly 65 70 75aac
aat acc agt tac acc aac atc agc aat ggt aat agt gga aac act 349Asn
Asn Thr Ser Tyr Thr Asn Ile Ser Asn Gly Asn Ser Gly Asn Thr
80 85 90gtt att aac ggc ggt ggt aac aac
att ggt ggg tta gct gtt ggt gtg 397Val Ile Asn Gly Gly Gly Asn Asn
Ile Gly Gly Leu Ala Val Gly Val 95 100
105gga gga gaa agt ggt ggt ttt ttc act ggt ggg agt ttg cag aga caa
445Gly Gly Glu Ser Gly Gly Phe Phe Thr Gly Gly Ser Leu Gln Arg Gln
110 115 120ggt tca ctt acc ttg cct cgg
acg att agt cag aaa agg gtt gat gat 493Gly Ser Leu Thr Leu Pro Arg
Thr Ile Ser Gln Lys Arg Val Asp Asp125 130
135 140gtc tgg aag gag ctg atg aag gag gat gac att gga
aat ggt gtt gtt 541Val Trp Lys Glu Leu Met Lys Glu Asp Asp Ile Gly
Asn Gly Val Val 145 150
155aat ggt ggg aca agc gga att ccg cag agg caa caa acg ctg gga gag
589Asn Gly Gly Thr Ser Gly Ile Pro Gln Arg Gln Gln Thr Leu Gly Glu
160 165 170atg act ttg gag gag ttt
ttg gtc agg gct ggt gtg gtt agg gaa gaa 637Met Thr Leu Glu Glu Phe
Leu Val Arg Ala Gly Val Val Arg Glu Glu 175 180
185cct caa ccg gtg gag agt gta act aac ttc aat ggc gga ttc
tat gga 685Pro Gln Pro Val Glu Ser Val Thr Asn Phe Asn Gly Gly Phe
Tyr Gly 190 195 200ttt ggc agt aat gga
ggt ctt ggg aca gct agt aat ggg ttt gtt gca 733Phe Gly Ser Asn Gly
Gly Leu Gly Thr Ala Ser Asn Gly Phe Val Ala205 210
215 220aac caa cct caa gat ttg tca gga aat gga
gta gcg gtg aga cag gat 781Asn Gln Pro Gln Asp Leu Ser Gly Asn Gly
Val Ala Val Arg Gln Asp 225 230
235ctg ctg act gct caa act cag cca cta cag atg cag cag cca cag atg
829Leu Leu Thr Ala Gln Thr Gln Pro Leu Gln Met Gln Gln Pro Gln Met
240 245 250gtg cag cag cca cag atg
gtg cag cag ccg caa caa ctg ata cag acg 877Val Gln Gln Pro Gln Met
Val Gln Gln Pro Gln Gln Leu Ile Gln Thr 255 260
265cag gag agg cct ttt ccc aaa cag acc act ata gca ttt tcc
aac act 925Gln Glu Arg Pro Phe Pro Lys Gln Thr Thr Ile Ala Phe Ser
Asn Thr 270 275 280gtt gat gtg gtt aac
cgt tct caa cct gca aca cag tgc cag gaa gtg 973Val Asp Val Val Asn
Arg Ser Gln Pro Ala Thr Gln Cys Gln Glu Val285 290
295 300aag cct tca ata ctt gga att cat aac cat
cct atg aac aac aat cta 1021Lys Pro Ser Ile Leu Gly Ile His Asn His
Pro Met Asn Asn Asn Leu 305 310
315ctg caa gct gtc gat ttt aaa aca gga gta acg gtt gca gca gta tct
1069Leu Gln Ala Val Asp Phe Lys Thr Gly Val Thr Val Ala Ala Val Ser
320 325 330cct gga agc cag atg tca
cct gat ctg act cca aag agc gcc ctg gat 1117Pro Gly Ser Gln Met Ser
Pro Asp Leu Thr Pro Lys Ser Ala Leu Asp 335 340
345gca tct ttg tcc cct gtt cct tac atg ttt ggg cga gtg aga
aaa aca 1165Ala Ser Leu Ser Pro Val Pro Tyr Met Phe Gly Arg Val Arg
Lys Thr 350 355 360ggt gca gtt ctg gag
aaa gtg att gag aga agg caa aaa agg atg ata 1213Gly Ala Val Leu Glu
Lys Val Ile Glu Arg Arg Gln Lys Arg Met Ile365 370
375 380aag aat agg gaa tca gct gca aga tcc cgc
gct cgc aag caa gct tat 1261Lys Asn Arg Glu Ser Ala Ala Arg Ser Arg
Ala Arg Lys Gln Ala Tyr 385 390
395acg atg gaa ctg gaa gca gaa att gcg caa ctc aaa gaa ttg aat gaa
1309Thr Met Glu Leu Glu Ala Glu Ile Ala Gln Leu Lys Glu Leu Asn Glu
400 405 410gag ttg cag aag aaa caa
gtt gaa atc atg gaa aag cag aaa aat cag 1357Glu Leu Gln Lys Lys Gln
Val Glu Ile Met Glu Lys Gln Lys Asn Gln 415 420
425ctt ctg gag cct ctg cgc cag cca tgg gga atg gga tgc aaa
agg caa 1405Leu Leu Glu Pro Leu Arg Gln Pro Trp Gly Met Gly Cys Lys
Arg Gln 430 435 440tgc ttg cga agg aca
ttg acg ggt ccc tgg tagagcttat aatggcgtct 1455Cys Leu Arg Arg Thr
Leu Thr Gly Pro Trp445 450aaggaaccca acaaagcgcc
gaagttatag aacaactcag aagatagaaa gctagctttg 1515tacgtagttt aggcaggttc
tgtgggtgat tgtaaatctt gaagtgtggc ggatttgaca 1575gagatagata aacacatatc
tgttctattt tcctaaatct tttggtttta tcttcctgat 1635gtaatggatc tttatcattt
gtcttgaaca tctttgtgac ttaaccagag tgaatttatc 1695tt
1697160454PRTArabidopsis
thaliana 160Met Gly Ser Arg Leu Asn Phe Lys Ser Phe Val Asp Gly Val Ser
Glu 1 5 10 15Gln Gln Pro
Thr Val Gly Thr Ser Leu Pro Leu Thr Arg Gln Asn Ser 20
25 30Val Phe Ser Leu Thr Phe Asp Glu Phe Gln
Asn Ser Trp Gly Gly Gly 35 40
45Ile Gly Lys Asp Phe Gly Ser Met Asn Met Asp Glu Leu Leu Lys Asn 50
55 60Ile Trp Thr Ala Glu Glu Ser His Ser
Met Met Gly Asn Asn Thr Ser 65 70 75
80Tyr Thr Asn Ile Ser Asn Gly Asn Ser Gly Asn Thr Val Ile
Asn Gly 85 90 95Gly Gly
Asn Asn Ile Gly Gly Leu Ala Val Gly Val Gly Gly Glu Ser 100
105 110Gly Gly Phe Phe Thr Gly Gly Ser Leu
Gln Arg Gln Gly Ser Leu Thr 115 120
125Leu Pro Arg Thr Ile Ser Gln Lys Arg Val Asp Asp Val Trp Lys Glu
130 135 140Leu Met Lys Glu Asp Asp Ile
Gly Asn Gly Val Val Asn Gly Gly Thr145 150
155 160Ser Gly Ile Pro Gln Arg Gln Gln Thr Leu Gly Glu
Met Thr Leu Glu 165 170
175Glu Phe Leu Val Arg Ala Gly Val Val Arg Glu Glu Pro Gln Pro Val
180 185 190Glu Ser Val Thr Asn Phe
Asn Gly Gly Phe Tyr Gly Phe Gly Ser Asn 195 200
205Gly Gly Leu Gly Thr Ala Ser Asn Gly Phe Val Ala Asn Gln
Pro Gln 210 215 220Asp Leu Ser Gly Asn
Gly Val Ala Val Arg Gln Asp Leu Leu Thr Ala225 230
235 240Gln Thr Gln Pro Leu Gln Met Gln Gln Pro
Gln Met Val Gln Gln Pro 245 250
255Gln Met Val Gln Gln Pro Gln Gln Leu Ile Gln Thr Gln Glu Arg Pro
260 265 270Phe Pro Lys Gln Thr
Thr Ile Ala Phe Ser Asn Thr Val Asp Val Val 275
280 285Asn Arg Ser Gln Pro Ala Thr Gln Cys Gln Glu Val
Lys Pro Ser Ile 290 295 300Leu Gly Ile
His Asn His Pro Met Asn Asn Asn Leu Leu Gln Ala Val305
310 315 320Asp Phe Lys Thr Gly Val Thr
Val Ala Ala Val Ser Pro Gly Ser Gln 325
330 335Met Ser Pro Asp Leu Thr Pro Lys Ser Ala Leu Asp
Ala Ser Leu Ser 340 345 350Pro
Val Pro Tyr Met Phe Gly Arg Val Arg Lys Thr Gly Ala Val Leu 355
360 365Glu Lys Val Ile Glu Arg Arg Gln Lys
Arg Met Ile Lys Asn Arg Glu 370 375
380Ser Ala Ala Arg Ser Arg Ala Arg Lys Gln Ala Tyr Thr Met Glu Leu385
390 395 400Glu Ala Glu Ile
Ala Gln Leu Lys Glu Leu Asn Glu Glu Leu Gln Lys 405
410 415Lys Gln Val Glu Ile Met Glu Lys Gln Lys
Asn Gln Leu Leu Glu Pro 420 425
430Leu Arg Gln Pro Trp Gly Met Gly Cys Lys Arg Gln Cys Leu Arg Arg
435 440 445Thr Leu Thr Gly Pro Trp
4501611717DNAArabidopsis thalianaCDS(234)..(1379) 161aaaaaaaaaa
aaaactgaac tctttttcgc tctggttttt ttagagagag agaaagatga 60aaatgcgttt
aattgctgtt taggtttcga attcgcgatt taaatttctg ggtttctctc 120tgtttaagct
tcttcttctt catcttctgc ttacgtttct tcttcaagga gctttcggat 180tcttgtagaa
agagtcattg ttctcttgag tgggaaacct tgaaaccatt cct atg 236
Met
1gga aat agc agc gag gaa cca aag cct
cct acc aaa tca gat aaa cca 284Gly Asn Ser Ser Glu Glu Pro Lys Pro
Pro Thr Lys Ser Asp Lys Pro 5 10
15tct tca ccc ccg gtg gat caa aca aat gtt cat gtc tac cct gat tgg
332Ser Ser Pro Pro Val Asp Gln Thr Asn Val His Val Tyr Pro Asp Trp
20 25 30gca gct atg cag gca tat tat
ggt cca aga gta gca atg cct cct tat 380Ala Ala Met Gln Ala Tyr Tyr
Gly Pro Arg Val Ala Met Pro Pro Tyr 35 40
45tac aat tca gct atg gct gca tct ggt cat cct cct cct cct tac atg
428Tyr Asn Ser Ala Met Ala Ala Ser Gly His Pro Pro Pro Pro Tyr Met 50
55 60 65tgg aat cct cag
cat atg atg tca cca tat gga gca ccc tat gct gct 476Trp Asn Pro Gln
His Met Met Ser Pro Tyr Gly Ala Pro Tyr Ala Ala 70
75 80gtt tat cct cat gga gga gga gtt tac gct
cat ccc ggt att ccc atg 524Val Tyr Pro His Gly Gly Gly Val Tyr Ala
His Pro Gly Ile Pro Met 85 90
95gga tca ctg cct caa ggt caa aag gat cca cct tta aca act ccg ggg
572Gly Ser Leu Pro Gln Gly Gln Lys Asp Pro Pro Leu Thr Thr Pro Gly
100 105 110acg ctt ttg agc atc gac act
cct act aaa tct aca ggg aac aca gac 620Thr Leu Leu Ser Ile Asp Thr
Pro Thr Lys Ser Thr Gly Asn Thr Asp 115 120
125aat gga ttg atg aag aag ctg aaa gag ttt gat ggg ctt gct atg tct
668Asn Gly Leu Met Lys Lys Leu Lys Glu Phe Asp Gly Leu Ala Met Ser130
135 140 145cta gga aat ggg
aat cct gaa aat ggt gca gat gaa cat aaa cga tca 716Leu Gly Asn Gly
Asn Pro Glu Asn Gly Ala Asp Glu His Lys Arg Ser 150
155 160cgg aac agc tca gaa act gat ggt tct act
gat gga agt gat ggg aat 764Arg Asn Ser Ser Glu Thr Asp Gly Ser Thr
Asp Gly Ser Asp Gly Asn 165 170
175aca act ggg gca gat gaa ccg aaa ctt aaa aga agt cga gag gga act
812Thr Thr Gly Ala Asp Glu Pro Lys Leu Lys Arg Ser Arg Glu Gly Thr
180 185 190cca aca aaa gat ggg aaa caa
ttg gtt caa gct agc tca ttt cat tct 860Pro Thr Lys Asp Gly Lys Gln
Leu Val Gln Ala Ser Ser Phe His Ser 195 200
205gtt tct ccg tca agt ggt gat acc ggc gta aaa ctc att caa gga tct
908Val Ser Pro Ser Ser Gly Asp Thr Gly Val Lys Leu Ile Gln Gly Ser210
215 220 225gga gct ata ctc
tct cct ggt gta agt gca aat tcc aac ccc ttc atg 956Gly Ala Ile Leu
Ser Pro Gly Val Ser Ala Asn Ser Asn Pro Phe Met 230
235 240tca caa tct tta gcc atg gtt cct cct gaa
act tgg ctt cag aac gag 1004Ser Gln Ser Leu Ala Met Val Pro Pro Glu
Thr Trp Leu Gln Asn Glu 245 250
255aga gaa ctg aaa cgg gag cga agg aaa cag tct aat aga gaa tct gct
1052Arg Glu Leu Lys Arg Glu Arg Arg Lys Gln Ser Asn Arg Glu Ser Ala
260 265 270aga agg tca aga tta agg aaa
cag gcc gag aca gaa gaa ctt gct agg 1100Arg Arg Ser Arg Leu Arg Lys
Gln Ala Glu Thr Glu Glu Leu Ala Arg 275 280
285aaa gtg gaa gcc ttg aca gcc gaa aac atg gca tta aga tct gaa cta
1148Lys Val Glu Ala Leu Thr Ala Glu Asn Met Ala Leu Arg Ser Glu Leu290
295 300 305aac caa ctt aat
gag aaa tct gat aaa cta aga gga gca aat gca acc 1196Asn Gln Leu Asn
Glu Lys Ser Asp Lys Leu Arg Gly Ala Asn Ala Thr 310
315 320ttg ttg gac aaa ctg aaa tgc tcg gaa ccc
gaa aag aga gtc ccc gca 1244Leu Leu Asp Lys Leu Lys Cys Ser Glu Pro
Glu Lys Arg Val Pro Ala 325 330
335aat atg ttg tct aga gtt aag aac tca gga gct gga gat aag aac aag
1292Asn Met Leu Ser Arg Val Lys Asn Ser Gly Ala Gly Asp Lys Asn Lys
340 345 350aac caa gga gac aat gat tct
aac tct aca agc aaa ttg cat caa ctg 1340Asn Gln Gly Asp Asn Asp Ser
Asn Ser Thr Ser Lys Leu His Gln Leu 355 360
365ctc gat acg aag cct cga gct aaa gca gta gct gca ggc tgatcgatgg
1389Leu Asp Thr Lys Pro Arg Ala Lys Ala Val Ala Ala Gly370
375 380taattcatgt cgatttctac ttaatttgtc gacataaaca
aagaaaataa gtgctactaa 1449tttcagaaaa acttgataga tagtatagta gagagagaga
gagagagaga ggtgtgatga 1509ttattgatct ataaattttc ggagagagag agggagaaag
agaaactttt cctccagatg 1569aaaatttggt gttatggttt gttactgtta atatagagag
gcttttcttt ttttataaaa 1629tggcttcctt tgttgcattt ccttgtttta gacctgatgt
aattttatga aatcggtgtt 1689attgctttgc gtaaaaaaaa aaaaaaaa
1717162382PRTArabidopsis thaliana 162Met Gly Asn
Ser Ser Glu Glu Pro Lys Pro Pro Thr Lys Ser Asp Lys 1 5
10 15Pro Ser Ser Pro Pro Val Asp Gln Thr
Asn Val His Val Tyr Pro Asp 20 25
30Trp Ala Ala Met Gln Ala Tyr Tyr Gly Pro Arg Val Ala Met Pro Pro
35 40 45Tyr Tyr Asn Ser Ala Met
Ala Ala Ser Gly His Pro Pro Pro Pro Tyr 50 55
60Met Trp Asn Pro Gln His Met Met Ser Pro Tyr Gly Ala Pro Tyr
Ala 65 70 75 80Ala Val
Tyr Pro His Gly Gly Gly Val Tyr Ala His Pro Gly Ile Pro
85 90 95Met Gly Ser Leu Pro Gln Gly Gln
Lys Asp Pro Pro Leu Thr Thr Pro 100 105
110Gly Thr Leu Leu Ser Ile Asp Thr Pro Thr Lys Ser Thr Gly Asn
Thr 115 120 125Asp Asn Gly Leu Met
Lys Lys Leu Lys Glu Phe Asp Gly Leu Ala Met 130 135
140Ser Leu Gly Asn Gly Asn Pro Glu Asn Gly Ala Asp Glu His
Lys Arg145 150 155 160Ser
Arg Asn Ser Ser Glu Thr Asp Gly Ser Thr Asp Gly Ser Asp Gly
165 170 175Asn Thr Thr Gly Ala Asp Glu
Pro Lys Leu Lys Arg Ser Arg Glu Gly 180 185
190Thr Pro Thr Lys Asp Gly Lys Gln Leu Val Gln Ala Ser Ser
Phe His 195 200 205Ser Val Ser Pro
Ser Ser Gly Asp Thr Gly Val Lys Leu Ile Gln Gly 210
215 220Ser Gly Ala Ile Leu Ser Pro Gly Val Ser Ala Asn
Ser Asn Pro Phe225 230 235
240Met Ser Gln Ser Leu Ala Met Val Pro Pro Glu Thr Trp Leu Gln Asn
245 250 255Glu Arg Glu Leu Lys
Arg Glu Arg Arg Lys Gln Ser Asn Arg Glu Ser 260
265 270Ala Arg Arg Ser Arg Leu Arg Lys Gln Ala Glu Thr
Glu Glu Leu Ala 275 280 285Arg Lys
Val Glu Ala Leu Thr Ala Glu Asn Met Ala Leu Arg Ser Glu 290
295 300Leu Asn Gln Leu Asn Glu Lys Ser Asp Lys Leu
Arg Gly Ala Asn Ala305 310 315
320Thr Leu Leu Asp Lys Leu Lys Cys Ser Glu Pro Glu Lys Arg Val Pro
325 330 335Ala Asn Met Leu
Ser Arg Val Lys Asn Ser Gly Ala Gly Asp Lys Asn 340
345 350Lys Asn Gln Gly Asp Asn Asp Ser Asn Ser Thr
Ser Lys Leu His Gln 355 360 365Leu
Leu Asp Thr Lys Pro Arg Ala Lys Ala Val Ala Ala Gly 370
375 3801631554DNAArabidopsis thalianaCDS(162)..(1379)
163acaaaatatc tctccctcta tctgcaaatt ttccaaagtt gcatcctttc aatttccact
60cctctctaat ataattcaca ttttcccact attgctgatt catttttttt tgtgaattat
120ttcaaaccca cataaaaaaa tctttgttta aatttaaaac c atg gat cct tca ttt
176 Met Asp Pro Ser Phe
1 5agg ttc att
aaa gag gag ttt cct gct gga ttc agt gat tct cca tca 224Arg Phe Ile
Lys Glu Glu Phe Pro Ala Gly Phe Ser Asp Ser Pro Ser 10
15 20cca cca tct tct tct tca tac ctt tat
tca tct tcc atg gct gaa gca 272Pro Pro Ser Ser Ser Ser Tyr Leu Tyr
Ser Ser Ser Met Ala Glu Ala 25 30
35gcc ata aat gat cca aca aca ttg agc tat cca caa cca tta gaa ggt
320Ala Ile Asn Asp Pro Thr Thr Leu Ser Tyr Pro Gln Pro Leu Glu Gly
40 45 50ctc cat gaa tca ggg cca cct
cca ttt ttg aca aag aca tat gac ttg 368Leu His Glu Ser Gly Pro Pro
Pro Phe Leu Thr Lys Thr Tyr Asp Leu 55 60
65gtg gaa gat tca aga acc aat cat gtc gtg tct tgg agc aaa tcc aat
416Val Glu Asp Ser Arg Thr Asn His Val Val Ser Trp Ser Lys Ser Asn 70
75 80 85aac agc ttc att
gtc tgg gat cca cag gcc ttt tct gta act ctc ctt 464Asn Ser Phe Ile
Val Trp Asp Pro Gln Ala Phe Ser Val Thr Leu Leu 90
95 100ccc aga ttc ttc aag cac aat aac ttc tcc
agt ttt gtc cgc cag ctc 512Pro Arg Phe Phe Lys His Asn Asn Phe Ser
Ser Phe Val Arg Gln Leu 105 110
115aac aca tat ggt ttc aga aag gtg aat ccg gat cgg tgg gag ttt gca
560Asn Thr Tyr Gly Phe Arg Lys Val Asn Pro Asp Arg Trp Glu Phe Ala
120 125 130aac gaa ggg ttt ctt aga ggg
caa aag cat ctc ctc aag aac ata agg 608Asn Glu Gly Phe Leu Arg Gly
Gln Lys His Leu Leu Lys Asn Ile Arg 135 140
145aga aga aaa aca agt aat aat agt aat caa atg caa caa cct caa agt
656Arg Arg Lys Thr Ser Asn Asn Ser Asn Gln Met Gln Gln Pro Gln Ser150
155 160 165tct gaa caa caa
tct cta gac aat ttt tgc ata gaa gtg ggt agg tac 704Ser Glu Gln Gln
Ser Leu Asp Asn Phe Cys Ile Glu Val Gly Arg Tyr 170
175 180ggt cta gat gga gag atg gac agc cta agg
cga gac aag caa gtg ttg 752Gly Leu Asp Gly Glu Met Asp Ser Leu Arg
Arg Asp Lys Gln Val Leu 185 190
195atg atg gag cta gtg aga cta aga cag caa caa caa agc acc aaa atg
800Met Met Glu Leu Val Arg Leu Arg Gln Gln Gln Gln Ser Thr Lys Met
200 205 210tat ctc aca ttg att gaa gag
aag ctc aag aag acc gag tca aaa caa 848Tyr Leu Thr Leu Ile Glu Glu
Lys Leu Lys Lys Thr Glu Ser Lys Gln 215 220
225aaa caa atg atg agc ttc ctt gcc cgc gca atg cag aat cca gat ttt
896Lys Gln Met Met Ser Phe Leu Ala Arg Ala Met Gln Asn Pro Asp Phe230
235 240 245att cag cag cta
gta gag cag aag gaa aag agg aaa gag atc gaa gag 944Ile Gln Gln Leu
Val Glu Gln Lys Glu Lys Arg Lys Glu Ile Glu Glu 250
255 260gcg atc agc aag aag aga caa aga ccg atc
gat caa gga aaa aga aat 992Ala Ile Ser Lys Lys Arg Gln Arg Pro Ile
Asp Gln Gly Lys Arg Asn 265 270
275gtg gaa gat tat ggt gat gaa agt ggt tat ggg aat gat gtt gca gcc
1040Val Glu Asp Tyr Gly Asp Glu Ser Gly Tyr Gly Asn Asp Val Ala Ala
280 285 290tca tcc tca gca ttg att ggt
atg agt cag gaa tat aca tat gga aac 1088Ser Ser Ser Ala Leu Ile Gly
Met Ser Gln Glu Tyr Thr Tyr Gly Asn 295 300
305atg tct gaa ttc gag atg tcg gag ttg gac aaa ctt gct atg cac att
1136Met Ser Glu Phe Glu Met Ser Glu Leu Asp Lys Leu Ala Met His Ile310
315 320 325caa gga ctt gga
gat aat tcc agt gct agg gaa gaa gtc ttg aat gtg 1184Gln Gly Leu Gly
Asp Asn Ser Ser Ala Arg Glu Glu Val Leu Asn Val 330
335 340gaa aaa gga aat gat gag gaa gaa gta gaa
gat caa caa caa ggg tac 1232Glu Lys Gly Asn Asp Glu Glu Glu Val Glu
Asp Gln Gln Gln Gly Tyr 345 350
355cat aag gag aac aat gag att tat ggt gaa ggt ttt tgg gaa gat ttg
1280His Lys Glu Asn Asn Glu Ile Tyr Gly Glu Gly Phe Trp Glu Asp Leu
360 365 370tta aat gaa ggt caa aat ttt
gat ttt gaa gga gat caa gaa aat gtt 1328Leu Asn Glu Gly Gln Asn Phe
Asp Phe Glu Gly Asp Gln Glu Asn Val 375 380
385gat gtg tta att cag caa ctt ggt tat ttg ggt tct agt tca cac act
1376Asp Val Leu Ile Gln Gln Leu Gly Tyr Leu Gly Ser Ser Ser His Thr390
395 400 405aat taagaagaaa
ttgaaatgat gactacttta agcatttgaa tcaacttgtt 1429Asntcctattagt
aatttggctt tgtttcaatc aagtgagtcg tggactaact tattgaattt 1489gggggttaaa
tccgtttctt atttttggaa ataaaattgc tttttgttta aaaaaaaaaa 1549aaaaa
1554164406PRTArabidopsis thaliana 164Met Asp Pro Ser Phe Arg Phe Ile Lys
Glu Glu Phe Pro Ala Gly Phe 1 5 10
15Ser Asp Ser Pro Ser Pro Pro Ser Ser Ser Ser Tyr Leu Tyr Ser
Ser 20 25 30Ser Met Ala Glu
Ala Ala Ile Asn Asp Pro Thr Thr Leu Ser Tyr Pro 35
40 45Gln Pro Leu Glu Gly Leu His Glu Ser Gly Pro Pro
Pro Phe Leu Thr 50 55 60Lys Thr Tyr
Asp Leu Val Glu Asp Ser Arg Thr Asn His Val Val Ser 65
70 75 80Trp Ser Lys Ser Asn Asn Ser Phe
Ile Val Trp Asp Pro Gln Ala Phe 85 90
95Ser Val Thr Leu Leu Pro Arg Phe Phe Lys His Asn Asn Phe
Ser Ser 100 105 110Phe Val Arg
Gln Leu Asn Thr Tyr Gly Phe Arg Lys Val Asn Pro Asp 115
120 125Arg Trp Glu Phe Ala Asn Glu Gly Phe Leu Arg
Gly Gln Lys His Leu 130 135 140Leu Lys
Asn Ile Arg Arg Arg Lys Thr Ser Asn Asn Ser Asn Gln Met145
150 155 160Gln Gln Pro Gln Ser Ser Glu
Gln Gln Ser Leu Asp Asn Phe Cys Ile 165
170 175Glu Val Gly Arg Tyr Gly Leu Asp Gly Glu Met Asp
Ser Leu Arg Arg 180 185 190Asp
Lys Gln Val Leu Met Met Glu Leu Val Arg Leu Arg Gln Gln Gln 195
200 205Gln Ser Thr Lys Met Tyr Leu Thr Leu
Ile Glu Glu Lys Leu Lys Lys 210 215
220Thr Glu Ser Lys Gln Lys Gln Met Met Ser Phe Leu Ala Arg Ala Met225
230 235 240Gln Asn Pro Asp
Phe Ile Gln Gln Leu Val Glu Gln Lys Glu Lys Arg 245
250 255Lys Glu Ile Glu Glu Ala Ile Ser Lys Lys
Arg Gln Arg Pro Ile Asp 260 265
270Gln Gly Lys Arg Asn Val Glu Asp Tyr Gly Asp Glu Ser Gly Tyr Gly
275 280 285Asn Asp Val Ala Ala Ser Ser
Ser Ala Leu Ile Gly Met Ser Gln Glu 290 295
300Tyr Thr Tyr Gly Asn Met Ser Glu Phe Glu Met Ser Glu Leu Asp
Lys305 310 315 320Leu Ala
Met His Ile Gln Gly Leu Gly Asp Asn Ser Ser Ala Arg Glu
325 330 335Glu Val Leu Asn Val Glu Lys
Gly Asn Asp Glu Glu Glu Val Glu Asp 340 345
350Gln Gln Gln Gly Tyr His Lys Glu Asn Asn Glu Ile Tyr Gly
Glu Gly 355 360 365Phe Trp Glu Asp
Leu Leu Asn Glu Gly Gln Asn Phe Asp Phe Glu Gly 370
375 380Asp Gln Glu Asn Val Asp Val Leu Ile Gln Gln Leu
Gly Tyr Leu Gly385 390 395
400Ser Ser Ser His Thr Asn 40516523DNAArtificial
SequenceDescription of Artificial Sequenceprimer 165cgccagggtt ttcccagtca
cga 2316624DNAArtificial
SequenceDescription of Artificial Sequenceprimer 166agcggataac aatttcacac
agga 24
User Contributions:
comments("1"); ?> comment_form("1"); ?>Inventors list |
Agents list |
Assignees list |
List by place |
Classification tree browser |
Top 100 Inventors |
Top 100 Agents |
Top 100 Assignees |
Usenet FAQ Index |
Documents |
Other FAQs |
User Contributions:
Comment about this patent or add new information about this topic: